BLASTX nr result

ID: Zingiber24_contig00020022 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber24_contig00020022
         (3086 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004308587.1| PREDICTED: beta-galactosidase-like [Fragaria...  1399   0.0  
ref|XP_003634896.1| PREDICTED: beta-galactosidase [Vitis vinifera]   1399   0.0  
ref|XP_002266400.1| PREDICTED: beta-galactosidase isoform 1 [Vit...  1399   0.0  
gb|EOY19805.1| Glycoside hydrolase family 2 protein isoform 1 [T...  1395   0.0  
ref|XP_006645382.1| PREDICTED: beta-galactosidase-like [Oryza br...  1389   0.0  
gb|EOY19806.1| Glycoside hydrolase family 2 protein isoform 2 [T...  1387   0.0  
ref|NP_001045421.1| Os01g0952600 [Oryza sativa Japonica Group] g...  1387   0.0  
ref|XP_004971288.1| PREDICTED: beta-galactosidase-like isoform X...  1379   0.0  
gb|EMJ20103.1| hypothetical protein PRUPE_ppa000532mg [Prunus pe...  1367   0.0  
ref|XP_002513059.1| beta-galactosidase, putative [Ricinus commun...  1365   0.0  
ref|XP_002299206.2| glycoside hydrolase family 2 family protein ...  1364   0.0  
dbj|BAJ86348.1| predicted protein [Hordeum vulgare subsp. vulgare]   1364   0.0  
ref|XP_003565092.1| PREDICTED: beta-galactosidase-like [Brachypo...  1364   0.0  
ref|XP_002303929.2| glycoside hydrolase family 2 family protein ...  1353   0.0  
ref|XP_006487669.1| PREDICTED: beta-galactosidase-like [Citrus s...  1351   0.0  
ref|XP_002877978.1| hydrolase, hydrolyzing O-glycosyl compounds ...  1348   0.0  
ref|NP_001190087.1| glycoside hydrolase family 2 protein [Arabid...  1347   0.0  
ref|NP_680128.1| glycoside hydrolase family 2 protein [Arabidops...  1347   0.0  
gb|EXB62674.1| hypothetical protein L484_023970 [Morus notabilis]    1346   0.0  
ref|NP_001030858.1| glycoside hydrolase family 2 protein [Arabid...  1343   0.0  

>ref|XP_004308587.1| PREDICTED: beta-galactosidase-like [Fragaria vesca subsp. vesca]
          Length = 1113

 Score = 1399 bits (3620), Expect = 0.0
 Identities = 651/975 (66%), Positives = 774/975 (79%), Gaps = 1/975 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP+VP DNPTGCYR  F IP+EWKGRR+LLHFEAVDSAF  W+NGV +GYSQDSRLPA
Sbjct: 149  LDPPFVPVDNPTGCYRTDFVIPEEWKGRRVLLHFEAVDSAFCAWINGVPVGYSQDSRLPA 208

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD+CYP   DK NVL+VQV RWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFI DY
Sbjct: 209  EFEITDYCYPCGSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFIGDY 268

Query: 2726 FFKSTLDQNFLTADLQVEVKLD-LWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDL 2550
            FF+S L ++F  ADLQVEVK+D   +  +N  + N T+EA L+D+  W + G      DL
Sbjct: 269  FFRSNLAEDFSYADLQVEVKIDNSRETSKNTVIDNFTIEAALFDSGSWYSIGGS---ADL 325

Query: 2549 SSYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCE 2370
             S +V +L L   P   LG   Y L G+LE PRLWS+E PNLYTLV++LKD SGNI+DCE
Sbjct: 326  LSSNVANLKLDLSPGSILGFRDYSLVGRLEAPRLWSAEQPNLYTLVVILKDKSGNIVDCE 385

Query: 2369 SCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINA 2190
            SC VGIRQ+S APKQLLVNG P++IRGVNRHEHHPR GK  +E+CMIKDLV+MKQ NINA
Sbjct: 386  SCVVGIRQVSNAPKQLLVNGHPIIIRGVNRHEHHPRLGKTNIESCMIKDLVLMKQYNINA 445

Query: 2189 VRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGM 2010
            VRNSHYPQHPRWYELCD+FG+YMIDEANIE HGFD S   KHPT E  WA++MLDRVIGM
Sbjct: 446  VRNSHYPQHPRWYELCDIFGMYMIDEANIEAHGFDYSGHVKHPTLEPSWATAMLDRVIGM 505

Query: 2009 VERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMY 1830
            VERDKNHACIISWSLGNESGYGPNH A AGW+R KD SRLLHYEGGGSRT STDIICPMY
Sbjct: 506  VERDKNHACIISWSLGNESGYGPNHSASAGWVRGKDPSRLLHYEGGGSRTPSTDIICPMY 565

Query: 1829 MRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQG 1650
            MRVWDI+KIAKDPNE+RPLILCEYSHAMGNSNGNIH+YWEAI+ TFGLQGGFIWDWVDQG
Sbjct: 566  MRVWDIVKIAKDPNETRPLILCEYSHAMGNSNGNIHEYWEAIDSTFGLQGGFIWDWVDQG 625

Query: 1649 LLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAET 1470
            LLK   DG KHWAYGGDFGD PNDLNFCLNG+VWPDRTPHPAMHEVKY+YQPIK+  +E 
Sbjct: 626  LLKDSADGTKHWAYGGDFGDVPNDLNFCLNGLVWPDRTPHPAMHEVKYVYQPIKVSFSEG 685

Query: 1469 KVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKA 1290
             +++TN  ++ TT+ +EF W  HGDGC LG G L+LP+I+ Q++Y IE +SAPW+++W +
Sbjct: 686  TLKVTNTHFYETTRALEFYWAAHGDGCELGSGNLSLPLIEPQKTYHIESQSAPWHTLWAS 745

Query: 1289 CSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGN 1110
             SA E FLTI  KL HST W +  HV++STQ+ L  K + +PHV++   D+  + + VG+
Sbjct: 746  SSAEEFFLTITAKLLHSTCWVEAGHVISSTQVQLPVKREFVPHVIK-TKDATFLREIVGD 804

Query: 1109 IIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKW 930
             + + +QN+W+I LN + GT+ESWKV    L+ +G+ PC WRAPTDNDKGG  +SY  KW
Sbjct: 805  TLKVSQQNAWEIILNVKMGTVESWKVEGVPLMTKGIFPCFWRAPTDNDKGGGASSYSSKW 864

Query: 929  KAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKV 750
            +AA +D L + T  CSV+ ++ +++ V  VF+GVP   E       S  E     I+  +
Sbjct: 865  QAAHIDNLHYITKSCSVENMSDDLLKVAVVFLGVPNSGE------GSGVEDRSALIE--I 916

Query: 749  EVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEA 570
            +V Y I+S+GDV+V+  V P ++LPPLPRVGV FHLEKS+D + WYG+GPFECYPDRK A
Sbjct: 917  DVIYTIYSSGDVVVECNVRPNSNLPPLPRVGVEFHLEKSIDQIKWYGRGPFECYPDRKVA 976

Query: 569  AHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFY 390
            AHVG+YE  V DLHVPYI P E SGR DVRWV FQN  G+G+YAS YG+SPPMQM+AS+Y
Sbjct: 977  AHVGVYEQKVGDLHVPYIVPGECSGRADVRWVTFQNKDGLGIYASIYGSSPPMQMNASYY 1036

Query: 389  GTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSP 210
             TAEL+RATHN  L+  D +EVHLDHKHMG+ GDDSWSPCVHD+YL+P VP SFSIRLSP
Sbjct: 1037 TTAELDRATHNEDLIRGDDIEVHLDHKHMGLAGDDSWSPCVHDKYLIPAVPSSFSIRLSP 1096

Query: 209  IIPPTTGQDIYRSQI 165
            I P T+G DIY+SQ+
Sbjct: 1097 ITPATSGHDIYKSQV 1111


>ref|XP_003634896.1| PREDICTED: beta-galactosidase [Vitis vinifera]
          Length = 1127

 Score = 1399 bits (3620), Expect = 0.0
 Identities = 651/976 (66%), Positives = 783/976 (80%), Gaps = 2/976 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP+VP++NPTGCYR  F IP EWKGRRILLHFEAVDSAFF W+NGV +GYSQDSRLPA
Sbjct: 160  LDPPHVPTENPTGCYRTVFHIPHEWKGRRILLHFEAVDSAFFAWINGVPVGYSQDSRLPA 219

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD+C+P   +K NVL+VQV RWSDGSYLEDQD WWLSGIHRDVLLL+KPQV+I DY
Sbjct: 220  EFEITDYCHPCGSNKKNVLAVQVFRWSDGSYLEDQDQWWLSGIHRDVLLLAKPQVYIEDY 279

Query: 2726 FFKSTLDQNFLTADLQVEVKLD-LWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDL 2550
            FFKS L +NF  AD+QVEVK+D   +  ++  L+  ++EA L+D++ W  S   +E  DL
Sbjct: 280  FFKSNLGENFSYADIQVEVKIDNSLETSKDSILNKFSIEAELFDSAKWHDS---DEYCDL 336

Query: 2549 SSYDVLHLNLRPPP-TGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDC 2373
             S  V H+ L P   T   G   Y+L GKLE P+LWS+E P LYTLV++LKD  G ++DC
Sbjct: 337  HSSSVAHMELDPSSSTAIFGFLGYVLVGKLESPKLWSAEQPYLYTLVVILKDEFGKVVDC 396

Query: 2372 ESCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNIN 2193
            ESCQVGIRQ+S+APKQLLVNG PV++RGVNRHEHHPR GK  +E+CM+KDLV+MKQNNIN
Sbjct: 397  ESCQVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRLGKTNMESCMVKDLVLMKQNNIN 456

Query: 2192 AVRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIG 2013
            AVRNSHYPQHPRWYELCDLFG+YMIDEANIETHGF  S   K+PT ES WASSM+DRVI 
Sbjct: 457  AVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFYDSQHLKNPTLESSWASSMMDRVIS 516

Query: 2012 MVERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPM 1833
            MVERDKNHACIISWSLGNESGYGPNH A+AGWIR +DSSRLLHYEGGG+RT STDI+CPM
Sbjct: 517  MVERDKNHACIISWSLGNESGYGPNHSALAGWIRGRDSSRLLHYEGGGARTPSTDIVCPM 576

Query: 1832 YMRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQ 1653
            YMRVWDI+KIAKDP E RPLILCEYSH+MGNSNGNI +YWEAI+ TFGLQGGFIWDWVDQ
Sbjct: 577  YMRVWDIVKIAKDPTEMRPLILCEYSHSMGNSNGNIQEYWEAIDNTFGLQGGFIWDWVDQ 636

Query: 1652 GLLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAE 1473
            GLLKV  DG KHWAYGGDFGD PNDLNFCLNGI WPDRT HPA+HEVKY+YQPIKI L+E
Sbjct: 637  GLLKVGADGAKHWAYGGDFGDIPNDLNFCLNGITWPDRTLHPAVHEVKYVYQPIKISLSE 696

Query: 1472 TKVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWK 1293
            + ++ITN  ++ TT+ +EFSW + GDGC LG GTL+LP+I+ Q SY IE ES PWYS+W 
Sbjct: 697  STLKITNTHFYETTKAMEFSWTVCGDGCKLGSGTLSLPIIEPQSSYSIEFESGPWYSLWA 756

Query: 1292 ACSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVG 1113
            + SA E FLTI  KL   TRW +  HV++STQ++L AK + +PHV++   D+ +  + +G
Sbjct: 757  SSSAEEHFLTITAKLLQPTRWVEAGHVISSTQILLPAKREFVPHVIK-NKDAPVPGEILG 815

Query: 1112 NIIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCK 933
            N I   +QN W+I+ N QTGTIESWKVG  +++N+G+ PC WRAPTDND GG   SY+ K
Sbjct: 816  NTIRFYQQNVWEIQFNAQTGTIESWKVGGVTVMNKGIFPCFWRAPTDNDNGGGAKSYVSK 875

Query: 932  WKAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCK 753
            WKAA LD L+F T+ CSV+ +T + V +  V++G+PK +E      +S   SE   +  K
Sbjct: 876  WKAAHLDNLSFITESCSVQNITDHPVKLAVVYLGIPKGEE------NSLSRSENPKVLLK 929

Query: 752  VEVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKE 573
            V++ Y ++ +GD+I++  V+P +DLPPLPRVGV F LEK++D + WYGKGPFECYPDRK 
Sbjct: 930  VDITYTVYGSGDIIMECNVHPCSDLPPLPRVGVEFQLEKTIDQIKWYGKGPFECYPDRKA 989

Query: 572  AAHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASF 393
            AAHVG+YE +V D+HVPYI PVE SGR DVRWV FQN  G G+YAS YG+SPPMQM+AS+
Sbjct: 990  AAHVGVYEQNVGDMHVPYIVPVECSGRADVRWVTFQNKDGFGIYASMYGSSPPMQMNASY 1049

Query: 392  YGTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLS 213
            Y TAELERATH   L++ D +EVHLDHKHMG+GGDDSWSPCVH++YL+P VPYSFSIRLS
Sbjct: 1050 YSTAELERATHKEKLIKGDDIEVHLDHKHMGLGGDDSWSPCVHEKYLIPAVPYSFSIRLS 1109

Query: 212  PIIPPTTGQDIYRSQI 165
            PI    TG DIY+SQ+
Sbjct: 1110 PITAAITGYDIYKSQL 1125


>ref|XP_002266400.1| PREDICTED: beta-galactosidase isoform 1 [Vitis vinifera]
            gi|296090332|emb|CBI40151.3| unnamed protein product
            [Vitis vinifera]
          Length = 1114

 Score = 1399 bits (3620), Expect = 0.0
 Identities = 651/976 (66%), Positives = 783/976 (80%), Gaps = 2/976 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP+VP++NPTGCYR  F IP EWKGRRILLHFEAVDSAFF W+NGV +GYSQDSRLPA
Sbjct: 147  LDPPHVPTENPTGCYRTVFHIPHEWKGRRILLHFEAVDSAFFAWINGVPVGYSQDSRLPA 206

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD+C+P   +K NVL+VQV RWSDGSYLEDQD WWLSGIHRDVLLL+KPQV+I DY
Sbjct: 207  EFEITDYCHPCGSNKKNVLAVQVFRWSDGSYLEDQDQWWLSGIHRDVLLLAKPQVYIEDY 266

Query: 2726 FFKSTLDQNFLTADLQVEVKLD-LWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDL 2550
            FFKS L +NF  AD+QVEVK+D   +  ++  L+  ++EA L+D++ W  S   +E  DL
Sbjct: 267  FFKSNLGENFSYADIQVEVKIDNSLETSKDSILNKFSIEAELFDSAKWHDS---DEYCDL 323

Query: 2549 SSYDVLHLNLRPPP-TGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDC 2373
             S  V H+ L P   T   G   Y+L GKLE P+LWS+E P LYTLV++LKD  G ++DC
Sbjct: 324  HSSSVAHMELDPSSSTAIFGFLGYVLVGKLESPKLWSAEQPYLYTLVVILKDEFGKVVDC 383

Query: 2372 ESCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNIN 2193
            ESCQVGIRQ+S+APKQLLVNG PV++RGVNRHEHHPR GK  +E+CM+KDLV+MKQNNIN
Sbjct: 384  ESCQVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRLGKTNMESCMVKDLVLMKQNNIN 443

Query: 2192 AVRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIG 2013
            AVRNSHYPQHPRWYELCDLFG+YMIDEANIETHGF  S   K+PT ES WASSM+DRVI 
Sbjct: 444  AVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFYDSQHLKNPTLESSWASSMMDRVIS 503

Query: 2012 MVERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPM 1833
            MVERDKNHACIISWSLGNESGYGPNH A+AGWIR +DSSRLLHYEGGG+RT STDI+CPM
Sbjct: 504  MVERDKNHACIISWSLGNESGYGPNHSALAGWIRGRDSSRLLHYEGGGARTPSTDIVCPM 563

Query: 1832 YMRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQ 1653
            YMRVWDI+KIAKDP E RPLILCEYSH+MGNSNGNI +YWEAI+ TFGLQGGFIWDWVDQ
Sbjct: 564  YMRVWDIVKIAKDPTEMRPLILCEYSHSMGNSNGNIQEYWEAIDNTFGLQGGFIWDWVDQ 623

Query: 1652 GLLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAE 1473
            GLLKV  DG KHWAYGGDFGD PNDLNFCLNGI WPDRT HPA+HEVKY+YQPIKI L+E
Sbjct: 624  GLLKVGADGAKHWAYGGDFGDIPNDLNFCLNGITWPDRTLHPAVHEVKYVYQPIKISLSE 683

Query: 1472 TKVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWK 1293
            + ++ITN  ++ TT+ +EFSW + GDGC LG GTL+LP+I+ Q SY IE ES PWYS+W 
Sbjct: 684  STLKITNTHFYETTKAMEFSWTVCGDGCKLGSGTLSLPIIEPQSSYSIEFESGPWYSLWA 743

Query: 1292 ACSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVG 1113
            + SA E FLTI  KL   TRW +  HV++STQ++L AK + +PHV++   D+ +  + +G
Sbjct: 744  SSSAEEHFLTITAKLLQPTRWVEAGHVISSTQILLPAKREFVPHVIK-NKDAPVPGEILG 802

Query: 1112 NIIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCK 933
            N I   +QN W+I+ N QTGTIESWKVG  +++N+G+ PC WRAPTDND GG   SY+ K
Sbjct: 803  NTIRFYQQNVWEIQFNAQTGTIESWKVGGVTVMNKGIFPCFWRAPTDNDNGGGAKSYVSK 862

Query: 932  WKAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCK 753
            WKAA LD L+F T+ CSV+ +T + V +  V++G+PK +E      +S   SE   +  K
Sbjct: 863  WKAAHLDNLSFITESCSVQNITDHPVKLAVVYLGIPKGEE------NSLSRSENPKVLLK 916

Query: 752  VEVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKE 573
            V++ Y ++ +GD+I++  V+P +DLPPLPRVGV F LEK++D + WYGKGPFECYPDRK 
Sbjct: 917  VDITYTVYGSGDIIMECNVHPCSDLPPLPRVGVEFQLEKTIDQIKWYGKGPFECYPDRKA 976

Query: 572  AAHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASF 393
            AAHVG+YE +V D+HVPYI PVE SGR DVRWV FQN  G G+YAS YG+SPPMQM+AS+
Sbjct: 977  AAHVGVYEQNVGDMHVPYIVPVECSGRADVRWVTFQNKDGFGIYASMYGSSPPMQMNASY 1036

Query: 392  YGTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLS 213
            Y TAELERATH   L++ D +EVHLDHKHMG+GGDDSWSPCVH++YL+P VPYSFSIRLS
Sbjct: 1037 YSTAELERATHKEKLIKGDDIEVHLDHKHMGLGGDDSWSPCVHEKYLIPAVPYSFSIRLS 1096

Query: 212  PIIPPTTGQDIYRSQI 165
            PI    TG DIY+SQ+
Sbjct: 1097 PITAAITGYDIYKSQL 1112


>gb|EOY19805.1| Glycoside hydrolase family 2 protein isoform 1 [Theobroma cacao]
          Length = 1114

 Score = 1395 bits (3610), Expect = 0.0
 Identities = 642/975 (65%), Positives = 778/975 (79%), Gaps = 1/975 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP+VP DNPTGCYR  F IP++W+GRRILLHFEAVDSAF  W+NG+ +GYSQDSRLPA
Sbjct: 148  LDPPHVPIDNPTGCYRTYFHIPEQWQGRRILLHFEAVDSAFCAWINGIPVGYSQDSRLPA 207

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEIT++CY  D DK NVL+VQV RWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFI DY
Sbjct: 208  EFEITEYCYSCDSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFIADY 267

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNC-ENVSLSNITMEATLYDNSGWSTSGKDNEEMDL 2550
            FFKS+L  NF  AD+QVEVK+D  +   ++  L++ T+EA L+D   W      +  +DL
Sbjct: 268  FFKSSLAYNFSYADIQVEVKIDCSREMSKDKVLTDFTIEAALFDAGVWYNH---DGNVDL 324

Query: 2549 SSYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCE 2370
             S +V ++ L+  PTG+LG H Y+L GKLEKP+LWS+E PNLYTLV++LKD+SGN++DCE
Sbjct: 325  LSSNVANIVLKTVPTGTLGFHGYVLVGKLEKPKLWSAEQPNLYTLVIILKDASGNVVDCE 384

Query: 2369 SCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINA 2190
            SC VG+RQ+S+APKQLLVNG PVVIRGVNRHEHHPR GK  +E+CM+KDLVVMKQNNINA
Sbjct: 385  SCLVGVRQVSKAPKQLLVNGHPVVIRGVNRHEHHPRLGKTNIESCMVKDLVVMKQNNINA 444

Query: 2189 VRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGM 2010
            VRNSHYPQHPRWYELCDLFG+YMIDEANIETHGFDLS   KH TQE  WA++M+DRVIGM
Sbjct: 445  VRNSHYPQHPRWYELCDLFGIYMIDEANIETHGFDLSGHVKHLTQEPGWAAAMMDRVIGM 504

Query: 2009 VERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMY 1830
            VERDKNHACI SWSLGNESGYGPNH A AGWIR +D SRL+HYEGGGSRTSSTDIICPMY
Sbjct: 505  VERDKNHACIFSWSLGNESGYGPNHSASAGWIRGRDPSRLVHYEGGGSRTSSTDIICPMY 564

Query: 1829 MRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQG 1650
            MRVWDI+KIAKDPNE+RPLILCEYSHAMGNSNGNIH+YWEAI+  FGLQGGFIWDWVDQG
Sbjct: 565  MRVWDIVKIAKDPNETRPLILCEYSHAMGNSNGNIHEYWEAIDNIFGLQGGFIWDWVDQG 624

Query: 1649 LLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAET 1470
            LLK + DG K+WAYGGDFGDSPNDLNFCLNG+ WPDRTPHPA+ EVKY+YQPIK+ + E+
Sbjct: 625  LLKDNEDGSKYWAYGGDFGDSPNDLNFCLNGLTWPDRTPHPALQEVKYVYQPIKVSIGES 684

Query: 1469 KVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKA 1290
             ++I N  ++ TT+ +E  W   GDGC LGCG L+LPVI+ Q SY IE +S PWY +W +
Sbjct: 685  MIKIKNTNFYETTEGVELKWAARGDGCELGCGILSLPVIEPQSSYDIEWKSGPWYPLWAS 744

Query: 1289 CSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGN 1110
              A E+FLTI  KL HS RW    HV++STQ+ L AK D +PH+++   D  L ++ +G+
Sbjct: 745  SDAEEIFLTITAKLLHSKRWVDAGHVVSSTQVQLLAKRDIVPHIIK-TKDDVLSTEILGD 803

Query: 1109 IIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKW 930
             I I +Q  W+I LN +TG+++SWKV   S++  G++PC WRAPTDNDKGG P+SY  +W
Sbjct: 804  NIRISQQKLWEITLNVKTGSLDSWKVQGVSILKNGIIPCFWRAPTDNDKGGGPSSYYSRW 863

Query: 929  KAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKV 750
            KAA +D + F  + CS++E T + V +  V++GV K +   L       E E      ++
Sbjct: 864  KAAHMDDIVFLRESCSIQEKTDHAVKIVVVYLGVSKGENGPL------NELEKADALVEI 917

Query: 749  EVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEA 570
            ++ Y IH++GD+I+D  V P + LPPLPRVGV FHLEKS+D V WYG+GPFECYPDRK A
Sbjct: 918  DMLYTIHASGDIIIDSNVKPSSSLPPLPRVGVEFHLEKSVDQVKWYGRGPFECYPDRKAA 977

Query: 569  AHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFY 390
            A VG+YE +V D+HVPYI P ES GR DVRWV FQN  G G+YAS YG SPPMQM+AS+Y
Sbjct: 978  AQVGVYEQTVDDMHVPYIVPGESGGRADVRWVTFQNKDGYGIYASTYGKSPPMQMNASYY 1037

Query: 389  GTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSP 210
             T EL+RAT N  L++ D +EVHLDHKHMGIGGDDSW+PCVH++YL+P VPYSFSIRL P
Sbjct: 1038 STTELDRATRNEELIKGDSIEVHLDHKHMGIGGDDSWTPCVHEKYLIPAVPYSFSIRLCP 1097

Query: 209  IIPPTTGQDIYRSQI 165
            +   T+GQ+IY+SQ+
Sbjct: 1098 VTAATSGQNIYKSQL 1112


>ref|XP_006645382.1| PREDICTED: beta-galactosidase-like [Oryza brachyantha]
          Length = 1117

 Score = 1389 bits (3594), Expect = 0.0
 Identities = 645/977 (66%), Positives = 782/977 (80%), Gaps = 3/977 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            MNPP+VP+DNPTGCYR  F IPKEWKGRRILLHFEA DSAFF WVNGV +GYSQDSRLPA
Sbjct: 148  MNPPFVPNDNPTGCYRTVFHIPKEWKGRRILLHFEAADSAFFAWVNGVPVGYSQDSRLPA 207

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD C+P D DK N+L+VQVMRWSDGSYLEDQDHWWLSGIHRDVLL+SKPQ+FITDY
Sbjct: 208  EFEITDFCHPCDSDKKNILAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLVSKPQIFITDY 267

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
            FFK+TLD+ F  AD++VEV++D  Q  +   +S +++EATLYDN G      D    DLS
Sbjct: 268  FFKATLDEGFRVADIEVEVEIDS-QKQDQEHVSTLSIEATLYDNYG----PPDGLCSDLS 322

Query: 2546 SYDVLHLNLRPP--PTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDC 2373
            +  V +L  +P   P    G H Y+L GK+E P+LWSSEHPNLYTLV+VLKDS+G +++C
Sbjct: 323  AASVANLKRKPASRPKHCYGFHGYVLGGKIENPKLWSSEHPNLYTLVIVLKDSNGKLIEC 382

Query: 2372 ESCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNIN 2193
            ESCQVGIR +  A KQ+LVNG PVVIRGVNRHEHHPR GK  LEACMIKDLV+M+QNNIN
Sbjct: 383  ESCQVGIRNVVLAHKQMLVNGCPVVIRGVNRHEHHPRVGKTNLEACMIKDLVLMRQNNIN 442

Query: 2192 AVRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIG 2013
            AVRNSHYPQHPRWYELCD+FG+Y+IDEANIETHGFD SS FKHPT E  WAS+MLDRV+G
Sbjct: 443  AVRNSHYPQHPRWYELCDIFGLYVIDEANIETHGFDESSHFKHPTLEPFWASAMLDRVVG 502

Query: 2012 MVERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPM 1833
            MVERDKNHACII WSLGNES YGPNH AM+GWIR +D +R +HYEGGGSRTSSTDI+CPM
Sbjct: 503  MVERDKNHACIIIWSLGNESSYGPNHSAMSGWIRGRDPTRPIHYEGGGSRTSSTDIVCPM 562

Query: 1832 YMRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQ 1653
            YMRVWDIL IAK+P+E+RPLILCEYSHAMGNSNGNI  YW AI+ T GLQGGFIWDWVDQ
Sbjct: 563  YMRVWDILNIAKEPSENRPLILCEYSHAMGNSNGNIDAYWMAIDNTVGLQGGFIWDWVDQ 622

Query: 1652 GLLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAE 1473
            GLLK D+DG K WAYGGDFGD+PNDLNFCLNGIVWPDRT HPA+HEVKYLYQPIKI + +
Sbjct: 623  GLLKEDMDGSKFWAYGGDFGDTPNDLNFCLNGIVWPDRTIHPAVHEVKYLYQPIKITMVD 682

Query: 1472 TKVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWK 1293
              ++I N  +F TT+ ++FSW LHGDGC+LG G+LN+P +  Q ++ I  ES+PW+++W 
Sbjct: 683  NTLKIENVHFFETTEALDFSWLLHGDGCDLGSGSLNVPSLAPQSTHLINMESSPWFTLWN 742

Query: 1292 ACSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVG 1113
             C+ +E+FL+I  KLR+ T+WAKD H+LAS Q+ L  K+  +PHV+ L S S+L+S+  G
Sbjct: 743  TCALKEIFLSINVKLRYQTQWAKDGHILASAQICLPQKNGFVPHVIAL-SRSSLVSERAG 801

Query: 1112 NIIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGA-PNSYLC 936
            + + I K N+W+IK+N+ +GTI+SW+V    L+++G+ PC WRAPTDND GG+   SY  
Sbjct: 802  DTVIISKNNAWEIKVNSISGTIDSWQVNNIELMSKGIYPCFWRAPTDNDNGGSLTKSYAS 861

Query: 935  KWKAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQC 756
            +W+ A LD ++F++ + S+KEL    V + TV+ G+P     L K   +    E   +  
Sbjct: 862  RWREAFLDNISFYSSKFSLKELPDQTVEISTVYYGLPGK---LPKPDEAALSDESESVLF 918

Query: 755  KVEVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRK 576
            +V +   I+ +GDVI+DY VNPKNDLPPLPRVGVVF+ +KSL H  WYG+GPFECYPDRK
Sbjct: 919  RVHMRGRIYDSGDVILDYEVNPKNDLPPLPRVGVVFNADKSLSHAKWYGRGPFECYPDRK 978

Query: 575  EAAHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSAS 396
             AAHVG+YESSV DLHVPYI P E  GR DVRWVA QN+ G GL+ASA G  PPMQMSAS
Sbjct: 979  AAAHVGVYESSVDDLHVPYIVPGECGGRADVRWVALQNADGFGLFASASGEPPPMQMSAS 1038

Query: 395  FYGTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRL 216
            +YGTAEL+RATHNH LV+ D +E+HLDHKHMG+GGDDSWSPCVH++YL+ P  Y+FS+RL
Sbjct: 1039 YYGTAELDRATHNHKLVKGDDIELHLDHKHMGLGGDDSWSPCVHEQYLLQPARYAFSVRL 1098

Query: 215  SPIIPPTTGQDIYRSQI 165
             P++P ++  DIY SQ+
Sbjct: 1099 CPLLPSSSCNDIYHSQL 1115


>gb|EOY19806.1| Glycoside hydrolase family 2 protein isoform 2 [Theobroma cacao]
          Length = 1112

 Score = 1387 bits (3589), Expect = 0.0
 Identities = 641/975 (65%), Positives = 776/975 (79%), Gaps = 1/975 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP+VP DNPTGCYR  F IP++W+GRRILLHFEAVDSAF  W+NG+ +GYSQDSRLPA
Sbjct: 148  LDPPHVPIDNPTGCYRTYFHIPEQWQGRRILLHFEAVDSAFCAWINGIPVGYSQDSRLPA 207

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEIT++CY  D DK NVL+VQV RWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFI DY
Sbjct: 208  EFEITEYCYSCDSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFIADY 267

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNC-ENVSLSNITMEATLYDNSGWSTSGKDNEEMDL 2550
            FFKS+L  NF  AD+QVEVK+D  +   ++  L++ T+EA L+D   W      +  +DL
Sbjct: 268  FFKSSLAYNFSYADIQVEVKIDCSREMSKDKVLTDFTIEAALFDAGVWYNH---DGNVDL 324

Query: 2549 SSYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCE 2370
             S +V ++ L+  PTG+LG H Y+L GKLEKP+LWS+E PNLYTLV++LKD+SGN++DCE
Sbjct: 325  LSSNVANIVLKTVPTGTLGFHGYVLVGKLEKPKLWSAEQPNLYTLVIILKDASGNVVDCE 384

Query: 2369 SCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINA 2190
            SC VG+RQ+S+APKQLLVNG PVVIRGVNRHEHHPR GK  +E+CM  DLVVMKQNNINA
Sbjct: 385  SCLVGVRQVSKAPKQLLVNGHPVVIRGVNRHEHHPRLGKTNIESCM--DLVVMKQNNINA 442

Query: 2189 VRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGM 2010
            VRNSHYPQHPRWYELCDLFG+YMIDEANIETHGFDLS   KH TQE  WA++M+DRVIGM
Sbjct: 443  VRNSHYPQHPRWYELCDLFGIYMIDEANIETHGFDLSGHVKHLTQEPGWAAAMMDRVIGM 502

Query: 2009 VERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMY 1830
            VERDKNHACI SWSLGNESGYGPNH A AGWIR +D SRL+HYEGGGSRTSSTDIICPMY
Sbjct: 503  VERDKNHACIFSWSLGNESGYGPNHSASAGWIRGRDPSRLVHYEGGGSRTSSTDIICPMY 562

Query: 1829 MRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQG 1650
            MRVWDI+KIAKDPNE+RPLILCEYSHAMGNSNGNIH+YWEAI+  FGLQGGFIWDWVDQG
Sbjct: 563  MRVWDIVKIAKDPNETRPLILCEYSHAMGNSNGNIHEYWEAIDNIFGLQGGFIWDWVDQG 622

Query: 1649 LLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAET 1470
            LLK + DG K+WAYGGDFGDSPNDLNFCLNG+ WPDRTPHPA+ EVKY+YQPIK+ + E+
Sbjct: 623  LLKDNEDGSKYWAYGGDFGDSPNDLNFCLNGLTWPDRTPHPALQEVKYVYQPIKVSIGES 682

Query: 1469 KVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKA 1290
             ++I N  ++ TT+ +E  W   GDGC LGCG L+LPVI+ Q SY IE +S PWY +W +
Sbjct: 683  MIKIKNTNFYETTEGVELKWAARGDGCELGCGILSLPVIEPQSSYDIEWKSGPWYPLWAS 742

Query: 1289 CSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGN 1110
              A E+FLTI  KL HS RW    HV++STQ+ L AK D +PH+++   D  L ++ +G+
Sbjct: 743  SDAEEIFLTITAKLLHSKRWVDAGHVVSSTQVQLLAKRDIVPHIIK-TKDDVLSTEILGD 801

Query: 1109 IIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKW 930
             I I +Q  W+I LN +TG+++SWKV   S++  G++PC WRAPTDNDKGG P+SY  +W
Sbjct: 802  NIRISQQKLWEITLNVKTGSLDSWKVQGVSILKNGIIPCFWRAPTDNDKGGGPSSYYSRW 861

Query: 929  KAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKV 750
            KAA +D + F  + CS++E T + V +  V++GV K +   L       E E      ++
Sbjct: 862  KAAHMDDIVFLRESCSIQEKTDHAVKIVVVYLGVSKGENGPL------NELEKADALVEI 915

Query: 749  EVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEA 570
            ++ Y IH++GD+I+D  V P + LPPLPRVGV FHLEKS+D V WYG+GPFECYPDRK A
Sbjct: 916  DMLYTIHASGDIIIDSNVKPSSSLPPLPRVGVEFHLEKSVDQVKWYGRGPFECYPDRKAA 975

Query: 569  AHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFY 390
            A VG+YE +V D+HVPYI P ES GR DVRWV FQN  G G+YAS YG SPPMQM+AS+Y
Sbjct: 976  AQVGVYEQTVDDMHVPYIVPGESGGRADVRWVTFQNKDGYGIYASTYGKSPPMQMNASYY 1035

Query: 389  GTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSP 210
             T EL+RAT N  L++ D +EVHLDHKHMGIGGDDSW+PCVH++YL+P VPYSFSIRL P
Sbjct: 1036 STTELDRATRNEELIKGDSIEVHLDHKHMGIGGDDSWTPCVHEKYLIPAVPYSFSIRLCP 1095

Query: 209  IIPPTTGQDIYRSQI 165
            +   T+GQ+IY+SQ+
Sbjct: 1096 VTAATSGQNIYKSQL 1110


>ref|NP_001045421.1| Os01g0952600 [Oryza sativa Japonica Group]
            gi|57899943|dbj|BAD87855.1| putative beta-galactosidase
            [Oryza sativa Japonica Group]
            gi|113534952|dbj|BAF07335.1| Os01g0952600 [Oryza sativa
            Japonica Group] gi|222619883|gb|EEE56015.1| hypothetical
            protein OsJ_04784 [Oryza sativa Japonica Group]
          Length = 1117

 Score = 1387 bits (3589), Expect = 0.0
 Identities = 644/977 (65%), Positives = 785/977 (80%), Gaps = 3/977 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            MNPP+VP+DNPTGCYR  FRIPKEWKGRRILLHFEAVDSAFF WVNGV +GYSQDSRLPA
Sbjct: 148  MNPPFVPNDNPTGCYRTVFRIPKEWKGRRILLHFEAVDSAFFAWVNGVPVGYSQDSRLPA 207

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD C+P D +K+NVL+VQVMRWSDGSYLEDQDHWWLSGIHRDVLL+SKPQ+FITDY
Sbjct: 208  EFEITDFCHPCDSEKENVLAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLVSKPQIFITDY 267

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
            FFK+TLD+ F  AD++VEV++D  Q  +   +S +++EATLYDN G +    D    D+S
Sbjct: 268  FFKATLDEGFRVADIEVEVEIDS-QKQDREHVSTLSIEATLYDNYGPA----DVLTSDMS 322

Query: 2546 SYDVLHLNLRPP--PTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDC 2373
            +  V +L L+P   P    G H Y+L GK+E P+LWSSEHPNLYTLV+VLKDS+G +++C
Sbjct: 323  AASVANLKLKPASRPKHCYGFHGYVLGGKVENPKLWSSEHPNLYTLVVVLKDSNGKLIEC 382

Query: 2372 ESCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNIN 2193
            ESCQVGIR +  A KQ+LVNG PVVIRGVNRHEHHPR GK  LEACMIKDLV+M+QNNIN
Sbjct: 383  ESCQVGIRNVVLAHKQMLVNGCPVVIRGVNRHEHHPRVGKTNLEACMIKDLVLMRQNNIN 442

Query: 2192 AVRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIG 2013
            AVRNSHYPQHPRWYELCD+FG+Y+IDEANIETHGFD SS FKHPT E  WAS+MLDRV+G
Sbjct: 443  AVRNSHYPQHPRWYELCDIFGLYVIDEANIETHGFDESSHFKHPTLEPFWASAMLDRVVG 502

Query: 2012 MVERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPM 1833
            MVERDKNHACII WSLGNES YGPNH AM+GWIR KD +R +HYEGGGSRTSSTDI+CPM
Sbjct: 503  MVERDKNHACIIVWSLGNESSYGPNHSAMSGWIRGKDPTRPIHYEGGGSRTSSTDIVCPM 562

Query: 1832 YMRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQ 1653
            YMRVWDILKIA+DP+E+RPLILCEYSHAMGNSNGNI  YW AI+ T GLQGGFIWDWVDQ
Sbjct: 563  YMRVWDILKIAQDPSENRPLILCEYSHAMGNSNGNIDAYWMAIDNTVGLQGGFIWDWVDQ 622

Query: 1652 GLLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAE 1473
            GLLK D DG K+WAYGGDFGD+PNDLNFCLNGIVWPDRT HPA+HEVKYLYQPIKI + +
Sbjct: 623  GLLKEDADGSKNWAYGGDFGDTPNDLNFCLNGIVWPDRTIHPAVHEVKYLYQPIKITMMD 682

Query: 1472 TKVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWK 1293
              ++I N  +F TT+ ++FSW L GDGC LG G+LN+P I  Q ++ I  +S+PW+++W 
Sbjct: 683  NMLKIENVHFFETTEALDFSWLLQGDGCALGSGSLNVPSIAPQSTHLINMKSSPWFTIWS 742

Query: 1292 ACSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVG 1113
             C  +E+FL+I  KLR+ T+WAKD H+LAS Q+ L  K   +PH + L   S+L+S+ VG
Sbjct: 743  TCVVKEIFLSINVKLRYQTQWAKDGHILASAQICLPPKKGFVPHAIAL-PRSSLVSERVG 801

Query: 1112 NIIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGG-APNSYLC 936
            + + I K N+WQIK+N+ +GTI+SWKV    L+++G+ PC WR PTDNDKGG     Y+ 
Sbjct: 802  DHVLISKSNAWQIKVNSISGTIDSWKVNNIELMSKGIHPCFWRTPTDNDKGGFYTKPYVS 861

Query: 935  KWKAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQC 756
            +W+ A LD ++F++ + S+KEL    V + T++ G+P +Q    K   ++   E   +  
Sbjct: 862  RWREASLDNISFYSSQFSLKELPDQTVEISTIYYGLPGNQP---KPDETSLSDESESVLF 918

Query: 755  KVEVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRK 576
            +V++   I+ +GDVI+DY V+PKNDLPPLPRVGVVF+ +KSL    WYG+GPFECYPDRK
Sbjct: 919  RVQMRGRIYDSGDVILDYEVSPKNDLPPLPRVGVVFNADKSLSRAKWYGRGPFECYPDRK 978

Query: 575  EAAHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSAS 396
             AAHVG+YES V +LHVPYI P E  GR DVRWVA Q++ G GL+ASAYG SPPMQ+SAS
Sbjct: 979  AAAHVGVYESGVDELHVPYIVPGECGGRADVRWVALQDADGFGLFASAYGESPPMQVSAS 1038

Query: 395  FYGTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRL 216
            +YG AEL+RATHNH LV+ D +EVHLDHKHMG+GGDDSWSPCVH++YL+PP  Y+FS+RL
Sbjct: 1039 YYGAAELDRATHNHKLVKGDDIEVHLDHKHMGLGGDDSWSPCVHEQYLLPPARYAFSVRL 1098

Query: 215  SPIIPPTTGQDIYRSQI 165
             P++P ++  DIY SQ+
Sbjct: 1099 CPLLPSSSCHDIYHSQL 1115


>ref|XP_004971288.1| PREDICTED: beta-galactosidase-like isoform X1 [Setaria italica]
            gi|514787266|ref|XP_004971289.1| PREDICTED:
            beta-galactosidase-like isoform X2 [Setaria italica]
            gi|514787270|ref|XP_004971290.1| PREDICTED:
            beta-galactosidase-like isoform X3 [Setaria italica]
            gi|514787274|ref|XP_004971291.1| PREDICTED:
            beta-galactosidase-like isoform X4 [Setaria italica]
            gi|514787278|ref|XP_004971292.1| PREDICTED:
            beta-galactosidase-like isoform X5 [Setaria italica]
          Length = 1116

 Score = 1379 bits (3569), Expect = 0.0
 Identities = 645/979 (65%), Positives = 790/979 (80%), Gaps = 5/979 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            +NPP+V +DNPTGCYR  F IPKEWKGRRILLHFEAVDSAFF WVNGV IGYSQDSRLPA
Sbjct: 148  INPPFVSTDNPTGCYRTVFHIPKEWKGRRILLHFEAVDSAFFAWVNGVPIGYSQDSRLPA 207

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFE+TD C+P D DK+NVL+VQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQ+FITDY
Sbjct: 208  EFEVTDCCHPCDSDKENVLAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQIFITDY 267

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
            FFK+T+D+NF  AD++VEV++D  +  +   +S +++EATLYDNSG S S     + DLS
Sbjct: 268  FFKATMDENFSLADIEVEVEIDSHKQ-DREHVSTLSIEATLYDNSGPSIS----LDGDLS 322

Query: 2546 SYDVLHLNLRPPPTGS----LGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNIL 2379
              +V+  NL+P P  S    LG H Y+L GK+E P+LWSSEHPNLYTLV++LKD++G ++
Sbjct: 323  FANVV--NLKPKPKTSRGPCLGFHGYVLGGKIENPKLWSSEHPNLYTLVVLLKDANGKLI 380

Query: 2378 DCESCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNN 2199
            +CESCQVGIR + +A KQ+LVNG PVV+RGVNRHEHHPR GK  +EACMIKDL++M+QNN
Sbjct: 381  ECESCQVGIRNVVRAHKQMLVNGCPVVLRGVNRHEHHPRLGKTNIEACMIKDLILMRQNN 440

Query: 2198 INAVRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRV 2019
            INAVRNSHYPQH RWYELCD+FG+Y+IDEANIETHGFD +S FKHPT E  WA++MLDRV
Sbjct: 441  INAVRNSHYPQHSRWYELCDIFGLYVIDEANIETHGFDENSHFKHPTLEPIWANAMLDRV 500

Query: 2018 IGMVERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIIC 1839
            +GMVERDKNHACII WSLGNES YGPNH +M+GWIRE+D +RLLHYEGGGSRTSSTDI+C
Sbjct: 501  VGMVERDKNHACIIVWSLGNESSYGPNHASMSGWIRERDPTRLLHYEGGGSRTSSTDIVC 560

Query: 1838 PMYMRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWV 1659
            PMYMRVWDI+KIAKDP+E+RPLILCEYSHAMGNSNGNI  YW AI+ TFGLQGGFIWDWV
Sbjct: 561  PMYMRVWDIIKIAKDPSETRPLILCEYSHAMGNSNGNIDAYWMAIDNTFGLQGGFIWDWV 620

Query: 1658 DQGLLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVL 1479
            DQGLLK D DG K WAYGGDFGD+PNDLNFCLNGIVWPDRT HPA+HEVKYLYQPIKI  
Sbjct: 621  DQGLLKEDSDGSKFWAYGGDFGDTPNDLNFCLNGIVWPDRTIHPAVHEVKYLYQPIKISS 680

Query: 1478 AETKVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSV 1299
            A+  ++I N  +F+TT+ ++FSW L GDGC LG G+LN+P +  Q S+ I  ES+PW+++
Sbjct: 681  ADNMLKIENGHFFDTTEALDFSWVLQGDGCILGSGSLNVPTLAPQTSHLINMESSPWFAL 740

Query: 1298 WKACSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKN 1119
            W  C+ +EVFL++  K R+ TRWAKD H+LAS QL L  K+  +PH V   S S L+ + 
Sbjct: 741  WSTCAVKEVFLSVNVKQRYHTRWAKDGHLLASAQLCLPQKNGFVPHAV-AFSSSPLVCER 799

Query: 1118 VGNIIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGG-APNSY 942
             G+ + I K ++W+IK+N+Q GTI+SWKV    L+++G+ PC WRAPTDNDKGG     Y
Sbjct: 800  TGDSVIISKNDAWKIKVNSQLGTIDSWKVSNVELMSKGIFPCFWRAPTDNDKGGFYTKPY 859

Query: 941  LCKWKAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHI 762
            + +W+ A LD ++F++ + SVKEL  N V + TV+ G+P +   L K   +     P   
Sbjct: 860  VSQWREASLDNVSFYSSQFSVKELPDNTVELSTVYYGLPGN---LPKPDDAALSQAPEST 916

Query: 761  QCKVEVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPD 582
              +V +   I+ +GDV+++Y VNPK DLPPLPRVGVVF+ EKSL HV WYG+GPFECYPD
Sbjct: 917  LFQVNMLCRIYESGDVVLEYEVNPKADLPPLPRVGVVFNAEKSLSHVMWYGRGPFECYPD 976

Query: 581  RKEAAHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMS 402
            RK AAHVG+YESSV+DLHVPYI P E  GR DVRWVA +N+ G+GL AS +G SPPMQMS
Sbjct: 977  RKAAAHVGVYESSVEDLHVPYIVPGECGGRADVRWVALRNADGLGLQASVHGESPPMQMS 1036

Query: 401  ASFYGTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSI 222
            AS+YGT EL+RATH H LV+ D +EVHLDH+HMG+GGDDSW+PCVH++YL+PP  Y+FS+
Sbjct: 1037 ASYYGTEELDRATHVHKLVKGDDIEVHLDHRHMGLGGDDSWTPCVHEQYLLPPTRYAFSM 1096

Query: 221  RLSPIIPPTTGQDIYRSQI 165
            RL P++P ++  DIY+SQ+
Sbjct: 1097 RLCPLLPSSSCHDIYKSQL 1115


>gb|EMJ20103.1| hypothetical protein PRUPE_ppa000532mg [Prunus persica]
          Length = 1111

 Score = 1367 bits (3537), Expect = 0.0
 Identities = 633/975 (64%), Positives = 769/975 (78%), Gaps = 1/975 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP+VP DNPTGCYR  F IPKEWKGRRILLHFEAVDSAF  W+NGV IGYSQDSRLPA
Sbjct: 148  LDPPFVPVDNPTGCYRTYFHIPKEWKGRRILLHFEAVDSAFCAWLNGVPIGYSQDSRLPA 207

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD+CYP D+DK NVL+VQV RWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFI DY
Sbjct: 208  EFEITDYCYPSDMDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFIADY 267

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNCENVS-LSNITMEATLYDNSGWSTSGKDNEEMDL 2550
            FFKSTL ++F  AD+QVEVK+D  +     S L+N  +EA L+D + W +  +     DL
Sbjct: 268  FFKSTLAEDFSYADIQVEVKIDNSRETSKDSVLANYVIEAALFDTACWYSIDR---YADL 324

Query: 2549 SSYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCE 2370
               +V  + L    + SLG H YLL G+L+ PRLWS+E P+LYTL + LKD+SGN+LDCE
Sbjct: 325  HLSNVASIKLNLSSSTSLGFHGYLLVGRLDMPRLWSAEQPSLYTLAVTLKDASGNLLDCE 384

Query: 2369 SCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINA 2190
            S  VGIRQ+S+APKQLLVNG P++IRGVNRHEHHPR GK  +E+CM+KDLV+MKQ NINA
Sbjct: 385  SSLVGIRQVSKAPKQLLVNGHPIIIRGVNRHEHHPRLGKTNIESCMVKDLVLMKQYNINA 444

Query: 2189 VRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGM 2010
            VRNSHYPQHPRWYELCDLFG+YMIDEANIETHGFDLS   KHPT E  WA++M+DRVIGM
Sbjct: 445  VRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHVKHPTLEPSWATAMMDRVIGM 504

Query: 2009 VERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMY 1830
            VERDKNHACIISWSLGNE+GYGPNH A+AGW+R KD SRL+HYEGGGSRTSSTDIICPMY
Sbjct: 505  VERDKNHACIISWSLGNEAGYGPNHSALAGWVRGKDPSRLVHYEGGGSRTSSTDIICPMY 564

Query: 1829 MRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQG 1650
            MRVWD+L+I++DPNE+RPLILCEYSHAMGNSNGN+H+YWE I+ TFGLQGGFIWDWVDQ 
Sbjct: 565  MRVWDMLQISRDPNETRPLILCEYSHAMGNSNGNLHEYWEVIDSTFGLQGGFIWDWVDQA 624

Query: 1649 LLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAET 1470
            LLK + DG KHWAYGGDFGD PNDLNFCLNG+ WPDRTPHPA+HEVKY+YQPIK+  ++ 
Sbjct: 625  LLKDNADGSKHWAYGGDFGDVPNDLNFCLNGLTWPDRTPHPALHEVKYVYQPIKVSFSKE 684

Query: 1469 KVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKA 1290
             + ITN  ++ TTQ +EFSW +HGDGC LG G L  P+I+ Q+SY I+  SA WY +W +
Sbjct: 685  TLRITNTHFYKTTQGLEFSWDVHGDGCKLGSGILPFPLIEPQKSYDIKWRSALWYPLWTS 744

Query: 1289 CSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGN 1110
             SA E FLTI  KL  STRW +  HV++STQ+ L +K + +PHV++   D+  +S+ +G+
Sbjct: 745  SSAEEYFLTITAKLLRSTRWVEAGHVISSTQVQLPSKREIVPHVIK-TEDAVFVSETLGD 803

Query: 1109 IIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKW 930
             I + + + W+I  + QTGT++SW V    L+ +G+ PC WRAPTDNDKGG  +SY   W
Sbjct: 804  KIRVSRHSFWEIIFSVQTGTVDSWTVEGVPLMTKGIFPCFWRAPTDNDKGGGASSYFSLW 863

Query: 929  KAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKV 750
            KAA +D L + T  CS++  T ++V +   F GVPK++  L K K          I+ +V
Sbjct: 864  KAAHIDNLHYITQSCSIQNKTDHLVKIAVAFHGVPKEEGALYKGKK---------IKIEV 914

Query: 749  EVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEA 570
            +V Y I+ +GDV+V+  V P ++L  LPRVGV FHL+KS+D + WYG+GPFECYPDRK A
Sbjct: 915  DVIYTIYGSGDVVVECNVRPSSNLRLLPRVGVEFHLDKSMDQIKWYGRGPFECYPDRKAA 974

Query: 569  AHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFY 390
            AHV +YE  V+D+HVPYI P E SGR DVRWV FQN  G G+YAS YG+S PMQ++AS+Y
Sbjct: 975  AHVAVYEQKVEDMHVPYIVPGECSGRADVRWVTFQNKDGFGIYASVYGSSTPMQINASYY 1034

Query: 389  GTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSP 210
             TAEL+RATHN  L++ D +EVHLDHKHMG+GGDDSWSPCVHD+YLV  VPYSFSIRL P
Sbjct: 1035 TTAELDRATHNEDLIKGDDIEVHLDHKHMGLGGDDSWSPCVHDKYLVHAVPYSFSIRLCP 1094

Query: 209  IIPPTTGQDIYRSQI 165
            I P T+GQ +Y++Q+
Sbjct: 1095 ITPATSGQAVYKTQL 1109


>ref|XP_002513059.1| beta-galactosidase, putative [Ricinus communis]
            gi|223548070|gb|EEF49562.1| beta-galactosidase, putative
            [Ricinus communis]
          Length = 1110

 Score = 1365 bits (3533), Expect = 0.0
 Identities = 640/974 (65%), Positives = 770/974 (79%), Gaps = 1/974 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PPYVP DNPTGCYR  F+IPKEW+GRRILLHFEAVDSAF  WVNGV +GYSQDSRLPA
Sbjct: 147  LDPPYVPEDNPTGCYRTYFQIPKEWQGRRILLHFEAVDSAFCAWVNGVPVGYSQDSRLPA 206

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEIT++CY  D  K NVL+VQV+RWSDGSYLEDQDHWWLSGIHRDVLLL+KPQVFI DY
Sbjct: 207  EFEITEYCYSCDSGKSNVLAVQVIRWSDGSYLEDQDHWWLSGIHRDVLLLAKPQVFIVDY 266

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNC-ENVSLSNITMEATLYDNSGWSTSGKDNEEMDL 2550
            FFKS L ++F +A+++VEVKLD  Q   ++  L N  +EA LYD   W  S   +   +L
Sbjct: 267  FFKSNLAEDFASAEIEVEVKLDSSQEMPKDKILDNFVIEAALYDTESWYNS---DGAANL 323

Query: 2549 SSYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCE 2370
             S  V  + + P     LG   Y+L GK+EKP+LWS+E PNLY LVL LKD+ G+++DCE
Sbjct: 324  LSSQVADIKINPSFDAILGFLGYVLVGKVEKPKLWSAEQPNLYILVLTLKDAFGHVVDCE 383

Query: 2369 SCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINA 2190
            SC VGIRQ+S+APKQLLVNGQPV+IRGVNRHEHHPR GK  +E+CMIKDLV+MKQNNINA
Sbjct: 384  SCLVGIRQVSKAPKQLLVNGQPVIIRGVNRHEHHPRIGKTNIESCMIKDLVLMKQNNINA 443

Query: 2189 VRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGM 2010
            VRNSHYPQHPRWYELCDLFG+YMIDEANIETHGF LS   KHPT E  WA +M+DRVIGM
Sbjct: 444  VRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFHLSGHIKHPTSEQSWAIAMIDRVIGM 503

Query: 2009 VERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMY 1830
            VERDKNHACIISWSLGNE+ YGPNH A AGWIR KD+SRL+HYEGGGSRT STDI+CPMY
Sbjct: 504  VERDKNHACIISWSLGNEASYGPNHSAAAGWIRGKDTSRLVHYEGGGSRTPSTDIVCPMY 563

Query: 1829 MRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQG 1650
            MRVWDI+KIA DP E RPLILCEYSHAMGNS+GNI +YWEAI+ TFGLQGGFIWDWVDQG
Sbjct: 564  MRVWDIVKIANDPTELRPLILCEYSHAMGNSSGNICEYWEAIDSTFGLQGGFIWDWVDQG 623

Query: 1649 LLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAET 1470
            LLK + DG K+WAYGGDFGD+PNDLNFCLNG+ WPDR+PHPA+HEVKY+YQPIK+ L  +
Sbjct: 624  LLKENTDGSKYWAYGGDFGDTPNDLNFCLNGLTWPDRSPHPALHEVKYVYQPIKVSLKGS 683

Query: 1469 KVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKA 1290
             ++ITN  +F TTQ +EFSW  HGDG  LG G L+LP+++ Q SY IE ES PWY +W +
Sbjct: 684  TLKITNTYFFETTQGLEFSWAAHGDGHQLGSGILSLPLMKPQSSYDIELESGPWYPLWAS 743

Query: 1289 CSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGN 1110
             S  E+FLT+  KL HST W +  HV++STQ+ L ++ + IPHV++  +D+ L S+ +G+
Sbjct: 744  YSG-EIFLTVTAKLLHSTPWVETGHVISSTQVQLPSRKEIIPHVIK-ATDATLSSEILGD 801

Query: 1109 IIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKW 930
             + + +Q  W+I LN QTGT+ESWKV   +++N+G+LPC WRAPTDNDKGG  NSY  +W
Sbjct: 802  TVRVSQQTFWEITLNIQTGTVESWKVEGVTIMNKGILPCFWRAPTDNDKGGEENSYYSRW 861

Query: 929  KAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKV 750
            KAA +D L F T  CS++E T ++V +K V+IGVP+D++       S+Q S+    +  V
Sbjct: 862  KAARIDNLEFLTKSCSIQEKTDHLVKIKAVYIGVPRDED------DSSQSSKQALFE--V 913

Query: 749  EVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEA 570
            ++ Y I  +GD+I++  V+P +DLPPLPRVGV FHL +S+DHV WYGKGPFECYPDRK A
Sbjct: 914  DIIYEIFGSGDLIIECNVSPSSDLPPLPRVGVEFHLVESVDHVRWYGKGPFECYPDRKAA 973

Query: 569  AHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFY 390
            +HVGIYE +V D+HVPYI P E SGR DVRWV FQN  G G++AS +G SPPMQMS S+Y
Sbjct: 974  SHVGIYEKNVCDMHVPYIVPGECSGRADVRWVTFQNKEGKGIFASMHGNSPPMQMSVSYY 1033

Query: 389  GTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSP 210
             T EL RA HN  LV  + +EVHLDHKHMGIGGDDSWSPCVH++YLVP VPYSFSIRL P
Sbjct: 1034 STRELHRARHNKELVRGNDIEVHLDHKHMGIGGDDSWSPCVHEKYLVPAVPYSFSIRLCP 1093

Query: 209  IIPPTTGQDIYRSQ 168
            I   T+G  IY  +
Sbjct: 1094 ITAATSGLRIYEPE 1107


>ref|XP_002299206.2| glycoside hydrolase family 2 family protein [Populus trichocarpa]
            gi|550346663|gb|EEE84011.2| glycoside hydrolase family 2
            family protein [Populus trichocarpa]
          Length = 1110

 Score = 1364 bits (3531), Expect = 0.0
 Identities = 635/976 (65%), Positives = 770/976 (78%), Gaps = 2/976 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP+VP DNPTGCYR  F IP+EW+GRRILLHFEAVDSAF  W+NGV +GYSQDSRLPA
Sbjct: 147  VDPPHVPDDNPTGCYRTYFDIPEEWQGRRILLHFEAVDSAFCAWINGVPVGYSQDSRLPA 206

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD+C+P    K NVL+VQV RWSDGSYLEDQDHWWLSG+HRDVLLLSKPQVFI DY
Sbjct: 207  EFEITDYCHPCGSGKKNVLAVQVFRWSDGSYLEDQDHWWLSGVHRDVLLLSKPQVFIADY 266

Query: 2726 FFKSTLDQNFLTADLQVEVKLD--LWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMD 2553
            FFKS L +NF  AD+QVEVK++  L    E + L+N T+EA LYD   W  S    E  +
Sbjct: 267  FFKSNLAENFTCADIQVEVKIESSLAIPKEKI-LANFTIEAALYDTGSWYDS---EESAN 322

Query: 2552 LSSYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDC 2373
            L S +V +L L   P G LG    +LEGKLE P+LWS+E PNLY LVL LKD++G ++DC
Sbjct: 323  LLSSNVANLKLTHSPMGLLGFLGNVLEGKLEMPKLWSAEQPNLYILVLSLKDATGQVVDC 382

Query: 2372 ESCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNIN 2193
            ESC VGIRQ+S+APKQLLVNG PV++RGVNRHEHHPR GK  +E+CMIKDLV+MKQNN+N
Sbjct: 383  ESCLVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRVGKTNIESCMIKDLVLMKQNNMN 442

Query: 2192 AVRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIG 2013
            AVRNSHYPQH RWYELCDLFG+YMIDEANIETHGF L    KHPTQE  WA++M+DRVI 
Sbjct: 443  AVRNSHYPQHHRWYELCDLFGMYMIDEANIETHGFYLCEHLKHPTQEQSWAAAMMDRVIS 502

Query: 2012 MVERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPM 1833
            MVERDKNHACIISWSLGNE+ YGPNH A AGWIREKD+SRL+HYEGGGSRT+STDI+CPM
Sbjct: 503  MVERDKNHACIISWSLGNEASYGPNHSAAAGWIREKDTSRLVHYEGGGSRTTSTDIVCPM 562

Query: 1832 YMRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQ 1653
            YMRVWDI+KIAKDP ESRPLILCEYSHAMGNSNGNIH+YWEAIN TFGLQGGFIWDWVDQ
Sbjct: 563  YMRVWDIVKIAKDPAESRPLILCEYSHAMGNSNGNIHEYWEAINSTFGLQGGFIWDWVDQ 622

Query: 1652 GLLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAE 1473
            GLLK   DG KHWAYGGDFGD+PNDLNFCLNG+ WPDRTPHPA+HEVKY+YQPIK+ L E
Sbjct: 623  GLLKDSGDGTKHWAYGGDFGDTPNDLNFCLNGLTWPDRTPHPALHEVKYVYQPIKVSLEE 682

Query: 1472 TKVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWK 1293
            ++++IT+  +F TTQ +EFSW   GDG  +G G L+LP+I+ Q SY +E ES PWY +  
Sbjct: 683  SRIKITSTHFFQTTQGLEFSWATQGDGYEIGSGILSLPLIEPQSSYELEWESGPWYPLLA 742

Query: 1292 ACSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVG 1113
            +  A E+FLTI   L HSTRW +  HV++S+Q+ L      +PHV++  +D+ ++ + +G
Sbjct: 743  SSFAEEIFLTITTTLLHSTRWVEAGHVVSSSQVQLPTTRKILPHVIK-TTDAKVLIETLG 801

Query: 1112 NIIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCK 933
            +I+ +   + W+I  N QTG++ESWKVG   ++N+G+ PC WRAPTDNDKGG   SY  +
Sbjct: 802  DIVRVSLPSFWEITWNIQTGSVESWKVGGVPVMNKGIFPCFWRAPTDNDKGGEKKSYYSR 861

Query: 932  WKAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCK 753
            WK A +D + +HT  CSVK   +++V ++ V++G P  +E       S+  S  +     
Sbjct: 862  WKEARIDSIVYHTKSCSVKSTANDIVKIEVVYVGAPSCEE------GSSSHSNAVF---T 912

Query: 752  VEVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKE 573
            V + Y I+S+GD+I++  V P ++LPPLPRVGV  HLEKS+D + WYG+GPFECYPDRK 
Sbjct: 913  VNMIYTIYSSGDLIIECNVIPSSELPPLPRVGVELHLEKSVDQIKWYGRGPFECYPDRKA 972

Query: 572  AAHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASF 393
            AAHVG+YE +V D+HVPYI P E SGR DVRWV FQN +GVG++AS YG+SPPMQMSAS+
Sbjct: 973  AAHVGVYEQNVGDMHVPYIVPGECSGRADVRWVTFQNKNGVGIFASTYGSSPPMQMSASY 1032

Query: 392  YGTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLS 213
            Y TAEL+RATHN  L + + +EVHLDHKHMG+GGDDSWSPCVHD YLVP VPYS+SIRL 
Sbjct: 1033 YSTAELDRATHNEELAQGNDIEVHLDHKHMGVGGDDSWSPCVHDNYLVPAVPYSYSIRLC 1092

Query: 212  PIIPPTTGQDIYRSQI 165
            PI   T+G +IY+SQ+
Sbjct: 1093 PITAATSGLEIYKSQL 1108


>dbj|BAJ86348.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1122

 Score = 1364 bits (3531), Expect = 0.0
 Identities = 637/981 (64%), Positives = 782/981 (79%), Gaps = 7/981 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            MNPP+VPS+NPTGCYRK F IPKEWKGRRILLHFEAVDSAF  WVNGV IGYSQDSRLPA
Sbjct: 153  MNPPFVPSENPTGCYRKVFHIPKEWKGRRILLHFEAVDSAFLAWVNGVPIGYSQDSRLPA 212

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD C+  D  K+NVL+VQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQ+FITDY
Sbjct: 213  EFEITDCCHHCDSGKENVLAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQIFITDY 272

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
            FFKSTLD+NF  AD++VEV++D  +  +   +  +++EATL+DNS       D+   D+S
Sbjct: 273  FFKSTLDENFRVADIEVEVEIDSHKE-DREHIPTLSIEATLFDNS----ESSDDLNSDMS 327

Query: 2546 SYDVLHLNLRPPPTGSL--GSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDC 2373
              +V++L  +P P G    G H Y+L GK+E P+LWSSE PNLYTLV++LKD++G ++DC
Sbjct: 328  DANVVNLKTKPKPKGGPCHGFHGYVLGGKVENPKLWSSEKPNLYTLVVLLKDANGKLIDC 387

Query: 2372 ESCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNIN 2193
            ESCQVGIR +  A KQ+LVNG PVVIRGVNRHEHHPR GK  LEACMIKDLV+M+QNNIN
Sbjct: 388  ESCQVGIRNVVLAHKQMLVNGSPVVIRGVNRHEHHPRVGKTNLEACMIKDLVLMRQNNIN 447

Query: 2192 AVRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIG 2013
            AVRNSHYPQHPRWYELCD+FG+Y+IDEANIETHGFD +S FKHPT E  WA+SMLDRV+G
Sbjct: 448  AVRNSHYPQHPRWYELCDIFGLYVIDEANIETHGFDETSHFKHPTLEPIWANSMLDRVVG 507

Query: 2012 MVERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPM 1833
            MVERDKNHACII WSLGNE+ YGPNH AM+GW+R +D +RL+HYEGGGSRTSSTDI+CPM
Sbjct: 508  MVERDKNHACIIIWSLGNEASYGPNHSAMSGWVRGRDPTRLIHYEGGGSRTSSTDIVCPM 567

Query: 1832 YMRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQ 1653
            YMRVWDILKIA DP+E+RPLILCEYSHAMGNSNGNI  YW+AI+ T GLQGGFIWDWVDQ
Sbjct: 568  YMRVWDILKIANDPSENRPLILCEYSHAMGNSNGNIDAYWKAIDNTMGLQGGFIWDWVDQ 627

Query: 1652 GLLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAE 1473
            GLLK +VDG K WAYGGDFGD+PNDLNFC+NGIVWPDRT HPA++EVKYLYQPIK+ L +
Sbjct: 628  GLLKENVDGSKSWAYGGDFGDTPNDLNFCINGIVWPDRTLHPAVNEVKYLYQPIKVSLVD 687

Query: 1472 TKVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWK 1293
              ++I N Q+  TT+ ++FSW +HGDG  LG G+ ++P +  Q S+ I  ES+PW+++W 
Sbjct: 688  NILKIENGQFSETTEALDFSWIIHGDGSVLGSGSFSVPNLAPQSSHLINMESSPWFTLWS 747

Query: 1292 ACSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVG 1113
            AC+A+E FL++   LR  TRWAK  HVLAS Q+ L      +PHV+ L S S LIS+ VG
Sbjct: 748  ACAAKETFLSVHVTLRDQTRWAKAGHVLASAQVCLPQIKGFVPHVIAL-SQSPLISQQVG 806

Query: 1112 NIIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGG-APNSYLC 936
            + + I K N WQI++N+Q GTI+SWK+    L+++G+ PC WRAPTDND GG    SY  
Sbjct: 807  DGVIISKNNEWQIRINSQLGTIDSWKINNVELMSKGIFPCFWRAPTDNDNGGFYTKSYAT 866

Query: 935  KWKAALLDRLTFHTDECSVKELTSNMVHVKTVFIG----VPKDQEILLKSKSSNQESEPI 768
             W+ A LD ++F++ + SVKEL  + V V T + G    +PK  +  L S++S+     +
Sbjct: 867  TWREAFLDNVSFYSSQFSVKELPDHTVEVSTTYYGLPGHLPKPDDAAL-SEASDSVLFRV 925

Query: 767  HIQCKVEVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECY 588
            H++C+      I+ +GDV++DY VNPK+DLPPLPRVGV+F+ EKSL+HV+WYG+GPFECY
Sbjct: 926  HMRCR------IYESGDVVLDYEVNPKSDLPPLPRVGVMFNAEKSLNHVTWYGRGPFECY 979

Query: 587  PDRKEAAHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQ 408
            PDRK AAHVG+YES V+DLHVPYI P E  GR DVRWVA +N+ G GL+AS +G SPPMQ
Sbjct: 980  PDRKAAAHVGVYESGVEDLHVPYIVPGECGGRADVRWVALKNADGFGLFASTHGESPPMQ 1039

Query: 407  MSASFYGTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSF 228
             SAS+YG+ EL+RATH H L + D +EVHLDH+HMG+GGDDSW+PCVH+EYL+PP  Y F
Sbjct: 1040 ASASYYGSVELDRATHQHKLTKGDDIEVHLDHRHMGVGGDDSWTPCVHEEYLLPPASYRF 1099

Query: 227  SIRLSPIIPPTTGQDIYRSQI 165
            S+RL P++P ++  DIY SQ+
Sbjct: 1100 SLRLRPLLPSSSCHDIYISQL 1120


>ref|XP_003565092.1| PREDICTED: beta-galactosidase-like [Brachypodium distachyon]
          Length = 1119

 Score = 1364 bits (3530), Expect = 0.0
 Identities = 637/976 (65%), Positives = 779/976 (79%), Gaps = 2/976 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            MNPP+VPS+NPTGCYRK F IPKEWKGRRILLHFEAVDSAF  WVNGV IGYSQDSRLPA
Sbjct: 149  MNPPFVPSENPTGCYRKVFHIPKEWKGRRILLHFEAVDSAFLAWVNGVPIGYSQDSRLPA 208

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD C+  D  K+NVL+VQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKP++FITDY
Sbjct: 209  EFEITDCCHHCDSGKENVLAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPEIFITDY 268

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
            FFK+TLD+NFL AD++VEV++D  +  +   +S +++EA L+DNS    SG  N +M  +
Sbjct: 269  FFKATLDENFLVADIEVEVEIDSHKQ-DREHISTLSIEAKLFDNS--VPSGGLNSDMSAA 325

Query: 2546 SYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCES 2367
            +   L    +P P+   G H Y+L GK+E P+LWSSE PNLYTLV++LKD+ G +++CES
Sbjct: 326  NVVNLKAKPKPKPSHCHGFHGYVLGGKVENPKLWSSEKPNLYTLVVLLKDAKGKLIECES 385

Query: 2366 CQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINAV 2187
            CQVGIR +  A KQ+LVNG PVVIRGVNRHEHHPR GK  +EACMIKDLV+M+QNNINAV
Sbjct: 386  CQVGIRGVVLAHKQMLVNGCPVVIRGVNRHEHHPRVGKTNIEACMIKDLVLMRQNNINAV 445

Query: 2186 RNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGMV 2007
            RN HYPQHPRWYELCD+FG+Y+IDEANIETHGFD SS+FKHPT ES W + MLDRV+ MV
Sbjct: 446  RNCHYPQHPRWYELCDIFGLYVIDEANIETHGFDESSSFKHPTLESIWTNCMLDRVVSMV 505

Query: 2006 ERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMYM 1827
            ERDKNHACII WSLGNES YGPNH AM+GW+R +D +RL+HYEGGGSRTSSTDI+CPMYM
Sbjct: 506  ERDKNHACIIVWSLGNESSYGPNHSAMSGWVRGRDPTRLIHYEGGGSRTSSTDIVCPMYM 565

Query: 1826 RVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQGL 1647
            RVWDILKIA DP+E+RPLILCEYSHAMGNSNGNI  YW AI+ T GLQGGFIWDWVDQGL
Sbjct: 566  RVWDILKIANDPSENRPLILCEYSHAMGNSNGNIDAYWMAIDSTMGLQGGFIWDWVDQGL 625

Query: 1646 LKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAETK 1467
            LK DV+G K WAYGGDFGD+PND NFCLNGIVWPDRT HPA++EVK+LYQPIKI L +  
Sbjct: 626  LKEDVNGSKFWAYGGDFGDTPNDSNFCLNGIVWPDRTIHPAVYEVKHLYQPIKISLMDNT 685

Query: 1466 VEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKAC 1287
            ++I NAQ+  TT+ ++FSW LHGDGC LG G+L+LP I  Q S+ I  +S+PW+++W AC
Sbjct: 686  LKINNAQFSETTEALDFSWILHGDGCVLGSGSLDLPDIAPQSSHLINMQSSPWFTLWSAC 745

Query: 1286 SAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGNI 1107
            +A+E FL+I  KL   TRW+KD H+LAS QL L   +  +PHV+ L S S L+S++VG+ 
Sbjct: 746  AAKETFLSINVKLGGQTRWSKDGHILASAQLCLPQTNSFVPHVIAL-SRSPLVSEHVGDS 804

Query: 1106 IAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGG-APNSYLCKW 930
            + I K   WQIK+NT++GTI+SWK+    L+++G+LPC WRAPTDND GG    SY  +W
Sbjct: 805  VIISKNKDWQIKVNTRSGTIDSWKINNIELLSKGILPCFWRAPTDNDNGGFYTKSYATRW 864

Query: 929  KAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKV 750
            + A LD + F + + SVKEL  + V + T + G+P     L+K   +     P     +V
Sbjct: 865  REAFLDNIVFRSSQFSVKELPDHAVEISTTYYGLPGH---LVKPDDAALSEAPESTLFQV 921

Query: 749  EVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEA 570
             +   ++ +GDVI++Y VNPK+DLPPLPR+G+VF+ EKSL HV+WYG+GPFECYPDRK A
Sbjct: 922  PMRCRMYESGDVILEYEVNPKSDLPPLPRIGIVFNTEKSLSHVTWYGRGPFECYPDRKAA 981

Query: 569  AHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFY 390
            AHVG+YESSV+DLHVPYI P E  GR DVRW A +N+ G GL+AS +G SPPMQMSAS+Y
Sbjct: 982  AHVGVYESSVEDLHVPYIVPGECGGRADVRWTALRNAEGFGLFASVHGESPPMQMSASYY 1041

Query: 389  GTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPP-VPYSFSIRLS 213
            G AEL+RATH H LV+ D +EVHLDH+HMG+GGDDSW+PCVH++YL+PP   Y+FS+RL 
Sbjct: 1042 GAAELDRATHKHKLVKGDDIEVHLDHRHMGLGGDDSWTPCVHEQYLLPPDRRYAFSVRLR 1101

Query: 212  PIIPPTTGQDIYRSQI 165
            P++P T+  DIYRSQ+
Sbjct: 1102 PLLPSTSCDDIYRSQL 1117


>ref|XP_002303929.2| glycoside hydrolase family 2 family protein [Populus trichocarpa]
            gi|550343549|gb|EEE78908.2| glycoside hydrolase family 2
            family protein [Populus trichocarpa]
          Length = 1113

 Score = 1353 bits (3503), Expect = 0.0
 Identities = 633/974 (64%), Positives = 763/974 (78%), Gaps = 1/974 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP VP DNPTGCYR  F +P+ W+ RRI LHFEAVDSAF  W+NGV +GYSQDSRLPA
Sbjct: 147  VDPPRVPDDNPTGCYRTYFDLPQGWQDRRIFLHFEAVDSAFCAWINGVAVGYSQDSRLPA 206

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD+CYP    K N+L+VQV RWSDGSYLEDQDHWW+SGIHRDVLLLSK QVFI DY
Sbjct: 207  EFEITDYCYPCGSGKKNLLAVQVFRWSDGSYLEDQDHWWMSGIHRDVLLLSKAQVFIADY 266

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNCENVSL-SNITMEATLYDNSGWSTSGKDNEEMDL 2550
            FFKS L +NF +AD++VEVK++         +  N T+EA LYD   W  S    E  DL
Sbjct: 267  FFKSNLAENFTSADIEVEVKIESALEIPRDKIFDNFTIEAALYDTGSWYNS---EESPDL 323

Query: 2549 SSYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCE 2370
             S +V +L L   P G LG     LEGKLEKP+LWS+E PNLY LVL LKD++G ++DCE
Sbjct: 324  LSSNVANLKLTHSPMGILGFLGNFLEGKLEKPKLWSAEQPNLYILVLSLKDATGQVVDCE 383

Query: 2369 SCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINA 2190
            SC VGIRQIS+APKQLLVNG PV+IRGVNRHEHHPR GK  +E+CMIKDLV+MKQNN+NA
Sbjct: 384  SCLVGIRQISKAPKQLLVNGCPVIIRGVNRHEHHPRVGKTNIESCMIKDLVLMKQNNMNA 443

Query: 2189 VRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGM 2010
            VRNSHYPQHPRWYELCDLFG+YMIDEANIETHGF L    KHPTQE  WA++M+DRVI M
Sbjct: 444  VRNSHYPQHPRWYELCDLFGLYMIDEANIETHGFHLCEHLKHPTQEQSWAAAMMDRVISM 503

Query: 2009 VERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMY 1830
            VERDKNHACIISWSLGNES YGPNH A AGWIRE+D SRL+HYEGGGSRT+STDIICPMY
Sbjct: 504  VERDKNHACIISWSLGNESSYGPNHSAAAGWIRERDPSRLVHYEGGGSRTASTDIICPMY 563

Query: 1829 MRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQG 1650
            MRVWDI+KIAKDP E RPLILCEYSHAMGNS+GNI +YW+AI+ TFGLQGGFIW+WVDQ 
Sbjct: 564  MRVWDIVKIAKDPTEPRPLILCEYSHAMGNSSGNIREYWDAIDSTFGLQGGFIWEWVDQA 623

Query: 1649 LLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAET 1470
            LLK   DG KHWAYGGDFGD+PNDLNFCLNG+ WPDRTPHPA+ EVKY+YQPIK+ L E+
Sbjct: 624  LLKESGDGRKHWAYGGDFGDTPNDLNFCLNGLTWPDRTPHPALEEVKYVYQPIKVSLEES 683

Query: 1469 KVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKA 1290
             ++ITN  +F TTQ +EFSW +HGDG  LG G L+LP+ + Q SY +E E  PWY +  +
Sbjct: 684  TIKITNTHFFQTTQGLEFSWTVHGDGYELGSGILSLPLTEPQSSYKLEWELGPWYPLLAS 743

Query: 1289 CSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGN 1110
              A E+F+TI  +L HSTRW +  HV++STQ+ L  +   +PHV++  +D+ + S+ +G+
Sbjct: 744  SFAEEIFVTITTRLLHSTRWVEAGHVISSTQIQLPTRQKIMPHVIK-TTDAKVFSETLGD 802

Query: 1109 IIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKW 930
             + + + N W+I  N QTG+IESWKVG   +I +G++PC WRAPTDNDKGG  +SY  +W
Sbjct: 803  TVRVSQLNVWEITWNIQTGSIESWKVGGVPVIKEGIIPCFWRAPTDNDKGGEKDSYYSRW 862

Query: 929  KAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKV 750
            KAA +D L F T  CSVK  T N+V ++ +++GVP  +E  L S+S+N  +        V
Sbjct: 863  KAAGIDSLVFQTKSCSVKSTTDNLVKIEVIYVGVPSCEERSL-SESTNATA-----LITV 916

Query: 749  EVNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEA 570
             + Y I+S+GD+I++    P ++LPPLPRVGV  HLEKS+D + WYG+GPFECYPDRK A
Sbjct: 917  NMIYTIYSSGDLIIECNAIPSSELPPLPRVGVELHLEKSVDQIKWYGRGPFECYPDRKAA 976

Query: 569  AHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFY 390
            AHVG+YE +V D+HVPYI PVE SGR DVRWV FQN  GVG++AS YG+SPPMQMSAS+Y
Sbjct: 977  AHVGVYEQNVGDMHVPYIVPVECSGRADVRWVTFQNKDGVGIFASTYGSSPPMQMSASYY 1036

Query: 389  GTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSP 210
             TAEL+RATH+  LV+ + +EVHLDHKHMG+GGDDSWSPCVHD+YLVP VP SFSIRL P
Sbjct: 1037 FTAELDRATHHEELVQGNDIEVHLDHKHMGLGGDDSWSPCVHDKYLVPAVPCSFSIRLCP 1096

Query: 209  IIPPTTGQDIYRSQ 168
            I   T+G +IY+SQ
Sbjct: 1097 ITAATSGLEIYKSQ 1110


>ref|XP_006487669.1| PREDICTED: beta-galactosidase-like [Citrus sinensis]
          Length = 1115

 Score = 1351 bits (3496), Expect = 0.0
 Identities = 630/976 (64%), Positives = 768/976 (78%), Gaps = 2/976 (0%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP VP++NPTGCYR  F IPKEW+GRRILLHFEAVDSAF  W+NGV +GYSQDSRLPA
Sbjct: 149  LDPPNVPAENPTGCYRTYFHIPKEWQGRRILLHFEAVDSAFCAWINGVPVGYSQDSRLPA 208

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEI+D+CYP   DK NVL+VQV RWSDGSYLEDQDHWWLSGIHRDVLLL+KPQVFI DY
Sbjct: 209  EFEISDYCYPHGSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKPQVFIADY 268

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNCENVS-LSNITMEATLYDNSGWSTSGKDNEEMDL 2550
            FFKS L ++F  AD+QVEV++D        S L+N  +EA LYD   W      +  +DL
Sbjct: 269  FFKSNLAEDFSLADIQVEVEIDCSPEISKDSILANFVIEAGLYDTGSWYNC---DGCIDL 325

Query: 2549 SSYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCE 2370
             S  V ++ L P  T S+    Y+L GKLE PRLWS+E PNLYTLV++LK +SG ++DCE
Sbjct: 326  LSSKVANIQLNPS-TASVEFPGYMLVGKLEMPRLWSAEQPNLYTLVVILKHASGPVVDCE 384

Query: 2369 SCQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINA 2190
            SC VGIRQ+S+APKQLLVNG PVVIRGVNRHEHHPR GK  +E+CM+KDLV+MKQNNINA
Sbjct: 385  SCLVGIRQVSKAPKQLLVNGNPVVIRGVNRHEHHPRVGKTNIESCMVKDLVLMKQNNINA 444

Query: 2189 VRNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGM 2010
            VRNSHYPQHPRWYELCDLFG+YMIDEANIETHGF  S   KHPT E  WA++M+DRVIGM
Sbjct: 445  VRNSHYPQHPRWYELCDLFGLYMIDEANIETHGFYFSEHLKHPTMEPSWAAAMMDRVIGM 504

Query: 2009 VERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMY 1830
            VERDKNHA II WSLGNE+G+GPNH A AGWIR KD SRLLHYEGGGSRT STDI+CPMY
Sbjct: 505  VERDKNHASIICWSLGNEAGHGPNHSAAAGWIRGKDPSRLLHYEGGGSRTPSTDIVCPMY 564

Query: 1829 MRVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQG 1650
            MRVWDI+ IAKDP E+RPLILCEYSHAMGNSNGNIH+YWEAI+ TFGLQGGFIWDWVDQG
Sbjct: 565  MRVWDIVMIAKDPTETRPLILCEYSHAMGNSNGNIHEYWEAIDSTFGLQGGFIWDWVDQG 624

Query: 1649 LLKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAET 1470
            LL+   DG KHWAYGGDFGD+PNDLNFCLNG++WPDRTPHPA+HEVKY+YQ IK+ L + 
Sbjct: 625  LLRELADGTKHWAYGGDFGDTPNDLNFCLNGLLWPDRTPHPALHEVKYVYQAIKVSLKKG 684

Query: 1469 KVEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKA 1290
             ++I+N  +F TTQ +EFSW  HGDG  LG G L+LP+I+   +Y IE +S+PWYS+W +
Sbjct: 685  TLKISNTNFFETTQGLEFSWVAHGDGYKLGFGILSLPLIKPHSNYEIELKSSPWYSLWNS 744

Query: 1289 CSAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGN 1110
            CSA E+FLT+  KL +STRWA+  HV+++ Q+ L +K + +PHV+R   D+ ++ +N+GN
Sbjct: 745  CSAEEIFLTVTAKLMNSTRWAEAGHVISTAQVQLPSKRERLPHVIR-TGDAIILQENLGN 803

Query: 1109 IIAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKW 930
             I +  QNSWQIK + QTG +ESWKV   S++ +G+ PC WRAPTDNDKGG  +SY  +W
Sbjct: 804  TIQLSHQNSWQIKFDIQTGAVESWKVEGVSVMKRGIFPCFWRAPTDNDKGGGESSYYSRW 863

Query: 929  KAALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKV 750
            +AA +D L F T  CS++ +T   V ++ V+ G P+         SS  + E      ++
Sbjct: 864  RAAGIDSLVFLTKSCSIQNVTDYFVKIRVVYDGTPR------VDMSSLTKLEKAKALFEI 917

Query: 749  EVNYYIHSTGDVIVDYMVNPK-NDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKE 573
             ++Y I+ +G+VIV+    P  +DLPPLPRVGV FHLE+S+D + +YG+GPFECYPDRK 
Sbjct: 918  VIDYTIYGSGNVIVECNFKPNTSDLPPLPRVGVEFHLEQSMDKIKFYGRGPFECYPDRKA 977

Query: 572  AAHVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASF 393
            AAHV +YE  V D+HVPYI P E +GR DVRWV FQN  G+G+YAS Y +SPPMQ++AS+
Sbjct: 978  AAHVDVYEQIVGDMHVPYIVPGECAGRADVRWVTFQNKEGIGIYASMYSSSPPMQLNASY 1037

Query: 392  YGTAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLS 213
            Y T EL+RATHN  LV++D +EVHLDHKHMG+GGDDSW+PCVHD+YLVP V YSFSIRLS
Sbjct: 1038 YTTTELDRATHNEQLVKEDKIEVHLDHKHMGLGGDDSWTPCVHDKYLVPAVAYSFSIRLS 1097

Query: 212  PIIPPTTGQDIYRSQI 165
            P+   T+G DIY+SQ+
Sbjct: 1098 PVTAATSGYDIYKSQM 1113


>ref|XP_002877978.1| hydrolase, hydrolyzing O-glycosyl compounds [Arabidopsis lyrata
            subsp. lyrata] gi|297323816|gb|EFH54237.1| hydrolase,
            hydrolyzing O-glycosyl compounds [Arabidopsis lyrata
            subsp. lyrata]
          Length = 1107

 Score = 1348 bits (3490), Expect = 0.0
 Identities = 626/974 (64%), Positives = 758/974 (77%), Gaps = 1/974 (0%)
 Frame = -1

Query: 3083 NPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPAE 2904
            +PP+VP DNPTGCYR  F+IPKEWK RRILLHFEAVDSAFF W+NG  +GYSQDSRLPAE
Sbjct: 148  DPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGNPVGYSQDSRLPAE 207

Query: 2903 FEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDYF 2724
            FEI+D+CYP+D  K NVL+VQV RWSDGSYLEDQDHWWLSGIHRDVLLL+KP+VFI DYF
Sbjct: 208  FEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKPKVFIADYF 267

Query: 2723 FKSTLDQNFLTADLQVEVKLD-LWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
            FKS L  +F  AD+QVEVK+D + ++ +++ LSN  +EA ++D   W  S   N E+   
Sbjct: 268  FKSKLADDFSYADIQVEVKIDNMQESSKHLVLSNFIIEAAVFDTKNWYNSEGFNCEL--- 324

Query: 2546 SYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCES 2367
            S  V HL L P P+ +LG H YLLEGKL+ P LWS+E PN+Y LVL LKD+SG +LD ES
Sbjct: 325  SPKVAHLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSES 384

Query: 2366 CQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINAV 2187
              VGIRQ+S+A KQLLVNG PVVI+GVNRHEHHPR GK  +EACM+KDL++MK+ NINAV
Sbjct: 385  SIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAV 444

Query: 2186 RNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGMV 2007
            RNSHYPQHPRWYELCDLFG+YMIDEANIETHGFDLS   KHP +E  WA++MLDRV+GMV
Sbjct: 445  RNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSWAAAMLDRVVGMV 504

Query: 2006 ERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMYM 1827
            ERDKNH CIISWSLGNE+GYGPNH AMAGWIREKD SRL+HYEGGGSRTSSTDI+CPMYM
Sbjct: 505  ERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYM 564

Query: 1826 RVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQGL 1647
            RVWDI+KIA D NESRPLILCEY HAMGNSNGNI +YW+AI+ TFGLQGGFIWDWVDQGL
Sbjct: 565  RVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWDAIDNTFGLQGGFIWDWVDQGL 624

Query: 1646 LKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAETK 1467
            LK+  DG K WAYGGDFGD PNDLNFCLNG++WPDRTPHPA+HEVK+ YQPIK+ L +  
Sbjct: 625  LKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGL 684

Query: 1466 VEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKAC 1287
            +++ N  +F+TT+E+EFSW +HGDG  LG GTL++PVI+ Q S+ IE +S PW+S W   
Sbjct: 685  IKVANTYFFHTTEELEFSWKIHGDGLELGSGTLSIPVIKPQNSFEIEWKSGPWFSFWNDS 744

Query: 1286 SAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGNI 1107
            +A E+FLTI  KL + TR  +  H+L+STQ+ L AK   IP  ++  +D+ +  + VG+ 
Sbjct: 745  NAGELFLTINAKLLNPTRSLEAGHLLSSTQIPLPAKRQIIPQAIKK-TDTIITCETVGDF 803

Query: 1106 IAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKWK 927
            I I +Q+SW++ +N + G IE WK+    L+ + +LPC WRAPTDNDKGG  +SY  +WK
Sbjct: 804  IKISQQDSWELMINVRKGAIEGWKIQGVLLMKEDILPCFWRAPTDNDKGGGDSSYFLRWK 863

Query: 926  AALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKVE 747
            AA LD + F  + CSVK +T   V ++ +++G            SS   S       KV 
Sbjct: 864  AAQLDNVEFLVESCSVKSITDKAVEIEFIYLG------------SSASVSSKTDALFKVN 911

Query: 746  VNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEAA 567
            V Y I+ +GD+I ++ V P +DLPPLPRVG+ FH+EK+LD V WYGKGPFECYPDRK AA
Sbjct: 912  VTYLIYGSGDIITNWSVEPNSDLPPLPRVGIEFHIEKTLDRVEWYGKGPFECYPDRKAAA 971

Query: 566  HVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFYG 387
            HV IYE +V D+HVPYI P ES GRTDVRWV F+N  GVG+YAS YG S PMQM+AS+Y 
Sbjct: 972  HVAIYEHNVGDMHVPYIVPGESGGRTDVRWVTFRNKDGVGIYASTYGNSSPMQMNASYYT 1031

Query: 386  TAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSPI 207
            T EL RATH   L++   +EVHLDHKHMG+GGDDSW+PCVHD+YL+PP PYSFS+RL PI
Sbjct: 1032 TGELNRATHEEDLIKGQNIEVHLDHKHMGLGGDDSWTPCVHDKYLIPPKPYSFSLRLCPI 1091

Query: 206  IPPTTGQDIYRSQI 165
               T+  DIY+ Q+
Sbjct: 1092 TASTSVLDIYKDQL 1105


>ref|NP_001190087.1| glycoside hydrolase family 2 protein [Arabidopsis thaliana]
            gi|332645711|gb|AEE79232.1| glycoside hydrolase family 2
            protein [Arabidopsis thaliana]
          Length = 1120

 Score = 1347 bits (3487), Expect = 0.0
 Identities = 623/974 (63%), Positives = 765/974 (78%), Gaps = 1/974 (0%)
 Frame = -1

Query: 3083 NPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPAE 2904
            +PPYVP DNPTGCYR  F+IPKEWK RRILLHFEAVDSAFF W+NG  +GYSQDSRLPAE
Sbjct: 161  DPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGNPVGYSQDSRLPAE 220

Query: 2903 FEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDYF 2724
            FEI+D+CYP+D  K NVL+VQV RWSDGSYLEDQDHWWLSGIHRDVLLL+KP+VFI DYF
Sbjct: 221  FEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKPKVFIADYF 280

Query: 2723 FKSTLDQNFLTADLQVEVKLD-LWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
            FKS L  +F  AD+QVEVK+D + ++ +++ LSN  +EA ++D   W  S   + E+   
Sbjct: 281  FKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAIFDTKNWYNSEGFSCEL--- 337

Query: 2546 SYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCES 2367
            S  V +L L P P+ +LG H YLLEGKL+ P LWS+E PN+Y LVL LKD+SG +LD ES
Sbjct: 338  SPKVANLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSES 397

Query: 2366 CQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINAV 2187
              VGIRQ+S+A KQLLVNG PVVI+GVNRHEHHPR GK  +EACM+KDL++MK+ NINAV
Sbjct: 398  SIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAV 457

Query: 2186 RNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGMV 2007
            RNSHYPQHPRWYELCDLFG+YMIDEANIETHGFDLS   KHP +E  WA++MLDRV+GMV
Sbjct: 458  RNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSWAAAMLDRVVGMV 517

Query: 2006 ERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMYM 1827
            ERDKNH CIISWSLGNE+GYGPNH AMAGWIREKD SRL+HYEGGGSRTSSTDI+CPMYM
Sbjct: 518  ERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYM 577

Query: 1826 RVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQGL 1647
            RVWDI+KIA D NESRPLILCEY HAMGNSNGNI +YWEAI+ TFGLQGGFIWDWVDQGL
Sbjct: 578  RVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGL 637

Query: 1646 LKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAETK 1467
            LK+  DG K WAYGGDFGD PNDLNFCLNG++WPDRTPHPA+HEVK+ YQPIK+ L +  
Sbjct: 638  LKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGM 697

Query: 1466 VEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKAC 1287
            +++ N  +FNTT+E+EFSW +HGDG  LG GTL++PVI+ Q S+ +E +S PW+S W   
Sbjct: 698  IKVANTYFFNTTEELEFSWTIHGDGLELGSGTLSIPVIKPQNSFEMEWKSGPWFSFWNDS 757

Query: 1286 SAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGNI 1107
            +A E+FLTI  KL + TR  +  H+L+STQ+ L AK   IP  ++  +D+++  + VG+ 
Sbjct: 758  NAGELFLTINAKLLNLTRSLEAGHLLSSTQIPLPAKGQIIPQAIKK-TDTSITCETVGDF 816

Query: 1106 IAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKWK 927
            I I +++SW++ +N + GTIE WK+    L+N+ +LPC WRAPTDNDKGG  +SY  +WK
Sbjct: 817  IKISQKDSWELMVNVRKGTIEGWKIQGVLLMNEAILPCFWRAPTDNDKGGGDSSYFSRWK 876

Query: 926  AALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKVE 747
            AA LD + F  + CSVK +T   V ++ +++G          S S + +S+ +    KV 
Sbjct: 877  AAQLDNVEFLVESCSVKSITDKSVEIEFIYLG---------SSASGSSKSDALF---KVN 924

Query: 746  VNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEAA 567
            V Y I+ +GD+I ++ V P +DLPPLPRVG+ FH+EK+LD V WYGKGPFECYPDRK AA
Sbjct: 925  VTYLIYGSGDIITNWFVEPNSDLPPLPRVGIEFHIEKTLDRVEWYGKGPFECYPDRKAAA 984

Query: 566  HVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFYG 387
            HV IYE +V D+HVPYI P E+ GRTDVRWV F+N  GVG+YAS YG+S  MQM+AS+Y 
Sbjct: 985  HVAIYEHNVGDMHVPYIVPGENGGRTDVRWVTFRNKDGVGIYASTYGSSSLMQMNASYYT 1044

Query: 386  TAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSPI 207
            T EL RATH   L++   +EVHLDHKHMG+GGDDSW+PCVHD++L+PP  YSFS+RL PI
Sbjct: 1045 TGELHRATHEEDLIKGQNIEVHLDHKHMGLGGDDSWTPCVHDKFLIPPAQYSFSLRLCPI 1104

Query: 206  IPPTTGQDIYRSQI 165
               T+G +IY+ Q+
Sbjct: 1105 TASTSGLNIYKDQL 1118


>ref|NP_680128.1| glycoside hydrolase family 2 protein [Arabidopsis thaliana]
            gi|20147224|gb|AAM10327.1| At3g54435 [Arabidopsis
            thaliana] gi|332645709|gb|AEE79230.1| glycoside hydrolase
            family 2 protein [Arabidopsis thaliana]
          Length = 1107

 Score = 1347 bits (3487), Expect = 0.0
 Identities = 623/974 (63%), Positives = 765/974 (78%), Gaps = 1/974 (0%)
 Frame = -1

Query: 3083 NPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPAE 2904
            +PPYVP DNPTGCYR  F+IPKEWK RRILLHFEAVDSAFF W+NG  +GYSQDSRLPAE
Sbjct: 148  DPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGNPVGYSQDSRLPAE 207

Query: 2903 FEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDYF 2724
            FEI+D+CYP+D  K NVL+VQV RWSDGSYLEDQDHWWLSGIHRDVLLL+KP+VFI DYF
Sbjct: 208  FEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKPKVFIADYF 267

Query: 2723 FKSTLDQNFLTADLQVEVKLD-LWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
            FKS L  +F  AD+QVEVK+D + ++ +++ LSN  +EA ++D   W  S   + E+   
Sbjct: 268  FKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAIFDTKNWYNSEGFSCEL--- 324

Query: 2546 SYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCES 2367
            S  V +L L P P+ +LG H YLLEGKL+ P LWS+E PN+Y LVL LKD+SG +LD ES
Sbjct: 325  SPKVANLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSES 384

Query: 2366 CQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINAV 2187
              VGIRQ+S+A KQLLVNG PVVI+GVNRHEHHPR GK  +EACM+KDL++MK+ NINAV
Sbjct: 385  SIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAV 444

Query: 2186 RNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGMV 2007
            RNSHYPQHPRWYELCDLFG+YMIDEANIETHGFDLS   KHP +E  WA++MLDRV+GMV
Sbjct: 445  RNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSWAAAMLDRVVGMV 504

Query: 2006 ERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMYM 1827
            ERDKNH CIISWSLGNE+GYGPNH AMAGWIREKD SRL+HYEGGGSRTSSTDI+CPMYM
Sbjct: 505  ERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYM 564

Query: 1826 RVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQGL 1647
            RVWDI+KIA D NESRPLILCEY HAMGNSNGNI +YWEAI+ TFGLQGGFIWDWVDQGL
Sbjct: 565  RVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGL 624

Query: 1646 LKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAETK 1467
            LK+  DG K WAYGGDFGD PNDLNFCLNG++WPDRTPHPA+HEVK+ YQPIK+ L +  
Sbjct: 625  LKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGM 684

Query: 1466 VEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKAC 1287
            +++ N  +FNTT+E+EFSW +HGDG  LG GTL++PVI+ Q S+ +E +S PW+S W   
Sbjct: 685  IKVANTYFFNTTEELEFSWTIHGDGLELGSGTLSIPVIKPQNSFEMEWKSGPWFSFWNDS 744

Query: 1286 SAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGNI 1107
            +A E+FLTI  KL + TR  +  H+L+STQ+ L AK   IP  ++  +D+++  + VG+ 
Sbjct: 745  NAGELFLTINAKLLNLTRSLEAGHLLSSTQIPLPAKGQIIPQAIKK-TDTSITCETVGDF 803

Query: 1106 IAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKWK 927
            I I +++SW++ +N + GTIE WK+    L+N+ +LPC WRAPTDNDKGG  +SY  +WK
Sbjct: 804  IKISQKDSWELMVNVRKGTIEGWKIQGVLLMNEAILPCFWRAPTDNDKGGGDSSYFSRWK 863

Query: 926  AALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKVE 747
            AA LD + F  + CSVK +T   V ++ +++G          S S + +S+ +    KV 
Sbjct: 864  AAQLDNVEFLVESCSVKSITDKSVEIEFIYLG---------SSASGSSKSDALF---KVN 911

Query: 746  VNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEAA 567
            V Y I+ +GD+I ++ V P +DLPPLPRVG+ FH+EK+LD V WYGKGPFECYPDRK AA
Sbjct: 912  VTYLIYGSGDIITNWFVEPNSDLPPLPRVGIEFHIEKTLDRVEWYGKGPFECYPDRKAAA 971

Query: 566  HVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFYG 387
            HV IYE +V D+HVPYI P E+ GRTDVRWV F+N  GVG+YAS YG+S  MQM+AS+Y 
Sbjct: 972  HVAIYEHNVGDMHVPYIVPGENGGRTDVRWVTFRNKDGVGIYASTYGSSSLMQMNASYYT 1031

Query: 386  TAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSPI 207
            T EL RATH   L++   +EVHLDHKHMG+GGDDSW+PCVHD++L+PP  YSFS+RL PI
Sbjct: 1032 TGELHRATHEEDLIKGQNIEVHLDHKHMGLGGDDSWTPCVHDKFLIPPAQYSFSLRLCPI 1091

Query: 206  IPPTTGQDIYRSQI 165
               T+G +IY+ Q+
Sbjct: 1092 TASTSGLNIYKDQL 1105


>gb|EXB62674.1| hypothetical protein L484_023970 [Morus notabilis]
          Length = 1119

 Score = 1346 bits (3484), Expect = 0.0
 Identities = 631/974 (64%), Positives = 763/974 (78%)
 Frame = -1

Query: 3086 MNPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPA 2907
            ++PP+VP +NPTGCYRK FRIPKEWKGRRI LHFEAVDSAFF WVNGVL+GYSQDSRLPA
Sbjct: 176  LDPPFVPEENPTGCYRKYFRIPKEWKGRRIFLHFEAVDSAFFAWVNGVLVGYSQDSRLPA 235

Query: 2906 EFEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDY 2727
            EFEITD+C+ F  + +NVL+VQV+RWSDGSYLEDQDHWWLSGIHRDVLLL+KP V +   
Sbjct: 236  EFEITDYCHSFGSESENVLAVQVLRWSDGSYLEDQDHWWLSGIHRDVLLLAKPPVEVI-- 293

Query: 2726 FFKSTLDQNFLTADLQVEVKLDLWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
                 +D +  T               ENV LS  T+EA+LYD  GW    K++   DL 
Sbjct: 294  -----IDNSLETPK-------------ENV-LSRFTIEASLYDTEGWY---KNDASADLI 331

Query: 2546 SYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCES 2367
            S +V ++ L    T  LG H YLL GKLE P+LWS+E PNLYTLV++LKD+SGN++DCES
Sbjct: 332  SSNVANMKLNISSTARLGFHGYLLSGKLETPKLWSAEQPNLYTLVVILKDASGNVIDCES 391

Query: 2366 CQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINAV 2187
            C VGIRQ+S+APKQLLVNG+P++IRGVNRHEHHPR GK  +E+CMIKDLV+MKQNN NAV
Sbjct: 392  CIVGIRQVSKAPKQLLVNGRPIMIRGVNRHEHHPRLGKTNIESCMIKDLVLMKQNNFNAV 451

Query: 2186 RNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGMV 2007
            RNSHYPQH RWYELCDLFG+YMIDEANIETHGFD S   KHPT E  WA++M+DRVIGMV
Sbjct: 452  RNSHYPQHSRWYELCDLFGMYMIDEANIETHGFDHSGDVKHPTLEPSWANAMMDRVIGMV 511

Query: 2006 ERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMYM 1827
            ERDKNHACIISWSLGNESGYGPNH A AGW+R KD SRLLHYEGGGSRTSSTDI+CPMYM
Sbjct: 512  ERDKNHACIISWSLGNESGYGPNHSASAGWVRGKDPSRLLHYEGGGSRTSSTDIVCPMYM 571

Query: 1826 RVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQGL 1647
            RVWDI+KIA DP E RPLILCEYSH+MGNSNGNIH+YWEAI+ TFGLQGGFIWDWVDQGL
Sbjct: 572  RVWDIVKIANDPEEKRPLILCEYSHSMGNSNGNIHEYWEAIDNTFGLQGGFIWDWVDQGL 631

Query: 1646 LKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAETK 1467
            LK D +  K WAYGGDFGD PNDLNFCLNG++WPDRTPHPA+HEVKYLYQPIKI   E  
Sbjct: 632  LK-DTEKGKRWAYGGDFGDVPNDLNFCLNGLIWPDRTPHPALHEVKYLYQPIKISFLEGT 690

Query: 1466 VEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKAC 1287
            ++ITN  +F TT+ +EFSW  HGDG  LG G L++PVI+ Q  Y IE +S PWY++W + 
Sbjct: 691  LKITNTHFFETTKGMEFSWSAHGDGFELGSGILSIPVIEPQNGYDIEWKSCPWYNLWFSS 750

Query: 1286 SAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGNI 1107
            SA E F+TI  KL HST W +  HV++STQ+ L +K     H ++   D  L+++ + ++
Sbjct: 751  SAEETFVTITAKLLHSTLWVEAGHVISSTQVQLPSKGKLARHEIK-TKDGTLVTEILKDV 809

Query: 1106 IAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKWK 927
            I I ++NSW+I LN++TG IESWKVG   ++N+G+ PC WRAPTDNDKGG  NSYL  WK
Sbjct: 810  IKISEENSWEIILNSRTGIIESWKVGGVLVMNKGIFPCFWRAPTDNDKGGGSNSYLSLWK 869

Query: 926  AALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKVE 747
            A+ +D L + T+ CSV+ +T ++V +  VF+GVP+D+E L     + +      +  KVE
Sbjct: 870  ASRIDSLHYITESCSVQNVTDHLVQIAVVFLGVPRDEEGLSSDLGNRK------VLIKVE 923

Query: 746  VNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEAA 567
            V Y I+S+GDVI++  V PK DLPPLPRVGV FHLEKS++ + WYG+GPFECYPDRK +A
Sbjct: 924  VVYTINSSGDVIMNCNVVPKADLPPLPRVGVEFHLEKSVNQIKWYGRGPFECYPDRKASA 983

Query: 566  HVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFYG 387
            HV +YE +V ++HVPYI P E SGR DVRW  F N +G G+YAS YG+SPPMQMSAS+Y 
Sbjct: 984  HVDVYEKNVDEMHVPYIVPGECSGRADVRWATFLNRNGFGIYASIYGSSPPMQMSASYYS 1043

Query: 386  TAELERATHNHYLVEKDYVEVHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSPI 207
             AEL++A HN  LV+ D +EVHLDHKHMG+GGDDSWSPCVH+ YL+P V YSFS+RL P+
Sbjct: 1044 AAELDQAIHNEDLVKGDNIEVHLDHKHMGLGGDDSWSPCVHNNYLIPAVQYSFSMRLCPV 1103

Query: 206  IPPTTGQDIYRSQI 165
             P T+GQ+IY++Q+
Sbjct: 1104 TPATSGQEIYKTQL 1117


>ref|NP_001030858.1| glycoside hydrolase family 2 protein [Arabidopsis thaliana]
            gi|332645710|gb|AEE79231.1| glycoside hydrolase family 2
            protein [Arabidopsis thaliana]
          Length = 1108

 Score = 1343 bits (3475), Expect = 0.0
 Identities = 623/975 (63%), Positives = 765/975 (78%), Gaps = 2/975 (0%)
 Frame = -1

Query: 3083 NPPYVPSDNPTGCYRKSFRIPKEWKGRRILLHFEAVDSAFFVWVNGVLIGYSQDSRLPAE 2904
            +PPYVP DNPTGCYR  F+IPKEWK RRILLHFEAVDSAFF W+NG  +GYSQDSRLPAE
Sbjct: 148  DPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGNPVGYSQDSRLPAE 207

Query: 2903 FEITDHCYPFDLDKDNVLSVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDYF 2724
            FEI+D+CYP+D  K NVL+VQV RWSDGSYLEDQDHWWLSGIHRDVLLL+KP+VFI DYF
Sbjct: 208  FEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKPKVFIADYF 267

Query: 2723 FKSTLDQNFLTADLQVEVKLD-LWQNCENVSLSNITMEATLYDNSGWSTSGKDNEEMDLS 2547
            FKS L  +F  AD+QVEVK+D + ++ +++ LSN  +EA ++D   W  S   + E+   
Sbjct: 268  FKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAIFDTKNWYNSEGFSCEL--- 324

Query: 2546 SYDVLHLNLRPPPTGSLGSHFYLLEGKLEKPRLWSSEHPNLYTLVLVLKDSSGNILDCES 2367
            S  V +L L P P+ +LG H YLLEGKL+ P LWS+E PN+Y LVL LKD+SG +LD ES
Sbjct: 325  SPKVANLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSES 384

Query: 2366 CQVGIRQISQAPKQLLVNGQPVVIRGVNRHEHHPRTGKAALEACMIKDLVVMKQNNINAV 2187
              VGIRQ+S+A KQLLVNG PVVI+GVNRHEHHPR GK  +EACM+KDL++MK+ NINAV
Sbjct: 385  SIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAV 444

Query: 2186 RNSHYPQHPRWYELCDLFGVYMIDEANIETHGFDLSSTFKHPTQESRWASSMLDRVIGMV 2007
            RNSHYPQHPRWYELCDLFG+YMIDEANIETHGFDLS   KHP +E  WA++MLDRV+GMV
Sbjct: 445  RNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSWAAAMLDRVVGMV 504

Query: 2006 ERDKNHACIISWSLGNESGYGPNHCAMAGWIREKDSSRLLHYEGGGSRTSSTDIICPMYM 1827
            ERDKNH CIISWSLGNE+GYGPNH AMAGWIREKD SRL+HYEGGGSRTSSTDI+CPMYM
Sbjct: 505  ERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYM 564

Query: 1826 RVWDILKIAKDPNESRPLILCEYSHAMGNSNGNIHKYWEAINGTFGLQGGFIWDWVDQGL 1647
            RVWDI+KIA D NESRPLILCEY HAMGNSNGNI +YWEAI+ TFGLQGGFIWDWVDQGL
Sbjct: 565  RVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGL 624

Query: 1646 LKVDVDGCKHWAYGGDFGDSPNDLNFCLNGIVWPDRTPHPAMHEVKYLYQPIKIVLAETK 1467
            LK+  DG K WAYGGDFGD PNDLNFCLNG++WPDRTPHPA+HEVK+ YQPIK+ L +  
Sbjct: 625  LKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGM 684

Query: 1466 VEITNAQYFNTTQEIEFSWHLHGDGCNLGCGTLNLPVIQAQESYGIEQESAPWYSVWKAC 1287
            +++ N  +FNTT+E+EFSW +HGDG  LG GTL++PVI+ Q S+ +E +S PW+S W   
Sbjct: 685  IKVANTYFFNTTEELEFSWTIHGDGLELGSGTLSIPVIKPQNSFEMEWKSGPWFSFWNDS 744

Query: 1286 SAREVFLTIIGKLRHSTRWAKDCHVLASTQLILSAKDDSIPHVVRLISDSNLISKNVGNI 1107
            +A E+FLTI  KL + TR  +  H+L+STQ+ L AK   IP  ++  +D+++  + VG+ 
Sbjct: 745  NAGELFLTINAKLLNLTRSLEAGHLLSSTQIPLPAKGQIIPQAIKK-TDTSITCETVGDF 803

Query: 1106 IAIEKQNSWQIKLNTQTGTIESWKVGECSLINQGLLPCLWRAPTDNDKGGAPNSYLCKWK 927
            I I +++SW++ +N + GTIE WK+    L+N+ +LPC WRAPTDNDKGG  +SY  +WK
Sbjct: 804  IKISQKDSWELMVNVRKGTIEGWKIQGVLLMNEAILPCFWRAPTDNDKGGGDSSYFSRWK 863

Query: 926  AALLDRLTFHTDECSVKELTSNMVHVKTVFIGVPKDQEILLKSKSSNQESEPIHIQCKVE 747
            AA LD + F  + CSVK +T   V ++ +++G          S S + +S+ +    KV 
Sbjct: 864  AAQLDNVEFLVESCSVKSITDKSVEIEFIYLG---------SSASGSSKSDALF---KVN 911

Query: 746  VNYYIHSTGDVIVDYMVNPKNDLPPLPRVGVVFHLEKSLDHVSWYGKGPFECYPDRKEAA 567
            V Y I+ +GD+I ++ V P +DLPPLPRVG+ FH+EK+LD V WYGKGPFECYPDRK AA
Sbjct: 912  VTYLIYGSGDIITNWFVEPNSDLPPLPRVGIEFHIEKTLDRVEWYGKGPFECYPDRKAAA 971

Query: 566  HVGIYESSVQDLHVPYIFPVESSGRTDVRWVAFQNSSGVGLYASAYGTSPPMQMSASFYG 387
            HV IYE +V D+HVPYI P E+ GRTDVRWV F+N  GVG+YAS YG+S  MQM+AS+Y 
Sbjct: 972  HVAIYEHNVGDMHVPYIVPGENGGRTDVRWVTFRNKDGVGIYASTYGSSSLMQMNASYYT 1031

Query: 386  TAELERATHNHYLVEKDYVE-VHLDHKHMGIGGDDSWSPCVHDEYLVPPVPYSFSIRLSP 210
            T EL RATH   L++   +E VHLDHKHMG+GGDDSW+PCVHD++L+PP  YSFS+RL P
Sbjct: 1032 TGELHRATHEEDLIKGQNIEVVHLDHKHMGLGGDDSWTPCVHDKFLIPPAQYSFSLRLCP 1091

Query: 209  IIPPTTGQDIYRSQI 165
            I   T+G +IY+ Q+
Sbjct: 1092 ITASTSGLNIYKDQL 1106


Top