BLASTX nr result

ID: Ephedra25_contig00003005 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00003005
         (1852 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002299206.2| glycoside hydrolase family 2 family protein ...   896   0.0  
ref|XP_003634896.1| PREDICTED: beta-galactosidase [Vitis vinifera]    890   0.0  
ref|XP_002266400.1| PREDICTED: beta-galactosidase isoform 1 [Vit...   890   0.0  
ref|XP_002513059.1| beta-galactosidase, putative [Ricinus commun...   887   0.0  
ref|XP_002303929.2| glycoside hydrolase family 2 family protein ...   885   0.0  
ref|XP_004971288.1| PREDICTED: beta-galactosidase-like isoform X...   884   0.0  
ref|XP_006403576.1| hypothetical protein EUTSA_v10010080mg [Eutr...   884   0.0  
gb|EOY19805.1| Glycoside hydrolase family 2 protein isoform 1 [T...   884   0.0  
dbj|BAJ86348.1| predicted protein [Hordeum vulgare subsp. vulgare]    881   0.0  
ref|XP_004308587.1| PREDICTED: beta-galactosidase-like [Fragaria...   880   0.0  
ref|NP_001030858.1| glycoside hydrolase family 2 protein [Arabid...   880   0.0  
ref|NP_001190087.1| glycoside hydrolase family 2 protein [Arabid...   880   0.0  
ref|NP_680128.1| glycoside hydrolase family 2 protein [Arabidops...   880   0.0  
ref|XP_006293102.1| hypothetical protein CARUB_v10019396mg [Caps...   879   0.0  
ref|NP_001045421.1| Os01g0952600 [Oryza sativa Japonica Group] g...   878   0.0  
ref|XP_002877978.1| hydrolase, hydrolyzing O-glycosyl compounds ...   878   0.0  
gb|EMJ20103.1| hypothetical protein PRUPE_ppa000532mg [Prunus pe...   877   0.0  
gb|EOY19806.1| Glycoside hydrolase family 2 protein isoform 2 [T...   875   0.0  
ref|XP_004142388.1| PREDICTED: beta-galactosidase-like [Cucumis ...   875   0.0  
ref|XP_006487669.1| PREDICTED: beta-galactosidase-like [Citrus s...   874   0.0  

>ref|XP_002299206.2| glycoside hydrolase family 2 family protein [Populus trichocarpa]
            gi|550346663|gb|EEE84011.2| glycoside hydrolase family 2
            family protein [Populus trichocarpa]
          Length = 1110

 Score =  896 bits (2315), Expect = 0.0
 Identities = 420/617 (68%), Positives = 497/617 (80%), Gaps = 2/617 (0%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WVK L +++SLSG WKF L   P +V +KF    F+DS W TLPVPSNW++HG+DRPIYT
Sbjct: 80   WVKDLPFVQSLSGLWKFFLAPDPTSVPNKFYGTAFEDSEWETLPVPSNWEMHGYDRPIYT 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  P++NPTGCYR  F +P+EW GRRILLHFEAVDSAF AW+NG  VGYS
Sbjct: 140  NVIYPFPVDPPHVPDDNPTGCYRTYFDIPEEWQGRRILLHFEAVDSAFCAWINGVPVGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT YC+  GS K+N+LAVQV RWSDGSYLEDQDHWWLSG+HRDV+L+SKP
Sbjct: 200  QDSRLPAEFEITDYCHPCGSGKKNVLAVQVFRWSDGSYLEDQDHWWLSGVHRDVLLLSKP 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            +VFI+DYFFKS L  +F+ +DI+VEV IE S  +  +  L  +++E  +YD    S    
Sbjct: 260  QVFIADYFFKSNLAENFTCADIQVEVKIESSLAIPKEKILANFTIEAALYDT--GSWYDS 317

Query: 1127 ESFAN--SKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDP 954
            E  AN  S  +  L+LTH   G  LG  G  +L GKL  PKLWSAE P LY+LV++LKD 
Sbjct: 318  EESANLLSSNVANLKLTHSPMGL-LGFLG-NVLEGKLEMPKLWSAEQPNLYILVLSLKDA 375

Query: 953  TGNIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVL 774
            TG +VDCESC VGIRQ+S+ PKQLLVNG PV++ GVNRHEHHP VGKTNI+SCMIKD+VL
Sbjct: 376  TGQVVDCESCLVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRVGKTNIESCMIKDLVL 435

Query: 773  MKQHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAA 594
            MKQ+N+NAVRNSHYPQH RWYELCDLFG+Y+IDEANIETHGF       HP  E  WAAA
Sbjct: 436  MKQNNMNAVRNSHYPQHHRWYELCDLFGMYMIDEANIETHGFYLCEHLKHPTQEQSWAAA 495

Query: 593  MLDRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTS 414
            M+DRVI+M+ERDKNH+CII WSLGNEASYGPNH+A AGW+RE+D++RL+HYEGGGSRTTS
Sbjct: 496  MMDRVISMVERDKNHACIISWSLGNEASYGPNHSAAAGWIREKDTSRLVHYEGGGSRTTS 555

Query: 413  TDVVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGF 234
            TD+VCPMYMRVWD++KIA+DP E RPLI CEYSHAMGNSNGN+H+YWEAI +T GLQGGF
Sbjct: 556  TDIVCPMYMRVWDIVKIAKDPAESRPLILCEYSHAMGNSNGNIHEYWEAINSTFGLQGGF 615

Query: 233  IWDWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQP 54
            IWDWVDQGLLK+  +  KHWAYGGDFGD PNDLNFC+NGL WPDRT HP L+EV+Y YQP
Sbjct: 616  IWDWVDQGLLKDSGDGTKHWAYGGDFGDTPNDLNFCLNGLTWPDRTPHPALHEVKYVYQP 675

Query: 53   ICVMQKENAIELRSRNF 3
            I V  +E+ I++ S +F
Sbjct: 676  IKVSLEESRIKITSTHF 692


>ref|XP_003634896.1| PREDICTED: beta-galactosidase [Vitis vinifera]
          Length = 1127

 Score =  890 bits (2300), Expect = 0.0
 Identities = 409/615 (66%), Positives = 490/615 (79%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WVKGL +++SLSG WKF L   P +V   F D +F+DS W TLPVPSNWQ+HGFDRPIYT
Sbjct: 93   WVKGLPFVKSLSGYWKFYLAPGPTSVPMNFYDSSFEDSTWETLPVPSNWQMHGFDRPIYT 152

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  P ENPTGCYR  F +P EW GRRILLHFEAVDSAFFAW+NG  VGYS
Sbjct: 153  NIVYPFPLDPPHVPTENPTGCYRTVFHIPHEWKGRRILLHFEAVDSAFFAWINGVPVGYS 212

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT YC+  GS K+N+LAVQV RWSDGSYLEDQD WWLSGIHRDV+L++KP
Sbjct: 213  QDSRLPAEFEITDYCHPCGSNKKNVLAVQVFRWSDGSYLEDQDQWWLSGIHRDVLLLAKP 272

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            +V+I DYFFKS L  +FS +DI+VEV I+ S +   D  L  +S+E  ++D+ +      
Sbjct: 273  QVYIEDYFFKSNLGENFSYADIQVEVKIDNSLETSKDSILNKFSIEAELFDSAKWHDSDE 332

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
                +S  +  ++L         G  GY L+ GKL +PKLWSAE PYLY LVV LKD  G
Sbjct: 333  YCDLHSSSVAHMELDPSSSTAIFGFLGYVLV-GKLESPKLWSAEQPYLYTLVVILKDEFG 391

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             +VDCESCQVGIRQ+S+ PKQLLVNG PV++ GVNRHEHHP +GKTN++SCM+KD+VLMK
Sbjct: 392  KVVDCESCQVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRLGKTNMESCMVKDLVLMK 451

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            Q+NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       +P  E  WA++M+
Sbjct: 452  QNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFYDSQHLKNPTLESSWASSMM 511

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRVI+M+ERDKNH+CII WSLGNE+ YGPNH+ALAGW+R RDS+RLLHYEGGG+RT STD
Sbjct: 512  DRVISMVERDKNHACIISWSLGNESGYGPNHSALAGWIRGRDSSRLLHYEGGGARTPSTD 571

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD++KIA+DPTE RPLI CEYSH+MGNSNGN+ +YWEAI NT GLQGGFIW
Sbjct: 572  IVCPMYMRVWDIVKIAKDPTEMRPLILCEYSHSMGNSNGNIQEYWEAIDNTFGLQGGFIW 631

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK G++  KHWAYGGDFGD PNDLNFC+NG+ WPDRT HP ++EV+Y YQPI 
Sbjct: 632  DWVDQGLLKVGADGAKHWAYGGDFGDIPNDLNFCLNGITWPDRTLHPAVHEVKYVYQPIK 691

Query: 47   VMQKENAIELRSRNF 3
            +   E+ +++ + +F
Sbjct: 692  ISLSESTLKITNTHF 706


>ref|XP_002266400.1| PREDICTED: beta-galactosidase isoform 1 [Vitis vinifera]
            gi|296090332|emb|CBI40151.3| unnamed protein product
            [Vitis vinifera]
          Length = 1114

 Score =  890 bits (2300), Expect = 0.0
 Identities = 409/615 (66%), Positives = 490/615 (79%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WVKGL +++SLSG WKF L   P +V   F D +F+DS W TLPVPSNWQ+HGFDRPIYT
Sbjct: 80   WVKGLPFVKSLSGYWKFYLAPGPTSVPMNFYDSSFEDSTWETLPVPSNWQMHGFDRPIYT 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  P ENPTGCYR  F +P EW GRRILLHFEAVDSAFFAW+NG  VGYS
Sbjct: 140  NIVYPFPLDPPHVPTENPTGCYRTVFHIPHEWKGRRILLHFEAVDSAFFAWINGVPVGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT YC+  GS K+N+LAVQV RWSDGSYLEDQD WWLSGIHRDV+L++KP
Sbjct: 200  QDSRLPAEFEITDYCHPCGSNKKNVLAVQVFRWSDGSYLEDQDQWWLSGIHRDVLLLAKP 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            +V+I DYFFKS L  +FS +DI+VEV I+ S +   D  L  +S+E  ++D+ +      
Sbjct: 260  QVYIEDYFFKSNLGENFSYADIQVEVKIDNSLETSKDSILNKFSIEAELFDSAKWHDSDE 319

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
                +S  +  ++L         G  GY L+ GKL +PKLWSAE PYLY LVV LKD  G
Sbjct: 320  YCDLHSSSVAHMELDPSSSTAIFGFLGYVLV-GKLESPKLWSAEQPYLYTLVVILKDEFG 378

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             +VDCESCQVGIRQ+S+ PKQLLVNG PV++ GVNRHEHHP +GKTN++SCM+KD+VLMK
Sbjct: 379  KVVDCESCQVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRLGKTNMESCMVKDLVLMK 438

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            Q+NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       +P  E  WA++M+
Sbjct: 439  QNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFYDSQHLKNPTLESSWASSMM 498

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRVI+M+ERDKNH+CII WSLGNE+ YGPNH+ALAGW+R RDS+RLLHYEGGG+RT STD
Sbjct: 499  DRVISMVERDKNHACIISWSLGNESGYGPNHSALAGWIRGRDSSRLLHYEGGGARTPSTD 558

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD++KIA+DPTE RPLI CEYSH+MGNSNGN+ +YWEAI NT GLQGGFIW
Sbjct: 559  IVCPMYMRVWDIVKIAKDPTEMRPLILCEYSHSMGNSNGNIQEYWEAIDNTFGLQGGFIW 618

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK G++  KHWAYGGDFGD PNDLNFC+NG+ WPDRT HP ++EV+Y YQPI 
Sbjct: 619  DWVDQGLLKVGADGAKHWAYGGDFGDIPNDLNFCLNGITWPDRTLHPAVHEVKYVYQPIK 678

Query: 47   VMQKENAIELRSRNF 3
            +   E+ +++ + +F
Sbjct: 679  ISLSESTLKITNTHF 693


>ref|XP_002513059.1| beta-galactosidase, putative [Ricinus communis]
            gi|223548070|gb|EEF49562.1| beta-galactosidase, putative
            [Ricinus communis]
          Length = 1110

 Score =  887 bits (2291), Expect = 0.0
 Identities = 414/615 (67%), Positives = 490/615 (79%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WVK L +++S+SG WKF L   P  V  KF +  F D  W TLPVPSNWQ+HGFDRPIYT
Sbjct: 80   WVKDLPFVKSMSGFWKFFLAPSPTKVPIKFYEPAFQDFEWQTLPVPSNWQMHGFDRPIYT 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  PE+NPTGCYR  F +PKEW GRRILLHFEAVDSAF AWVNG  VGYS
Sbjct: 140  NVVYPFPLDPPYVPEDNPTGCYRTYFQIPKEWQGRRILLHFEAVDSAFCAWVNGVPVGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT+YCY   S K N+LAVQV+RWSDGSYLEDQDHWWLSGIHRDV+L++KP
Sbjct: 200  QDSRLPAEFEITEYCYSCDSGKSNVLAVQVIRWSDGSYLEDQDHWWLSGIHRDVLLLAKP 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            +VFI DYFFKS L  DF+S++IEVEV ++ S +M  D  L  + +E  +YD E      G
Sbjct: 260  QVFIVDYFFKSNLAEDFASAEIEVEVKLDSSQEMPKDKILDNFVIEAALYDTESWYNSDG 319

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
             +   S  +  +++ +P     LG  GY L+ GK+  PKLWSAE P LY+LV+TLKD  G
Sbjct: 320  AANLLSSQVADIKI-NPSFDAILGFLGYVLV-GKVEKPKLWSAEQPNLYILVLTLKDAFG 377

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
            ++VDCESC VGIRQ+S+ PKQLLVNG+PV++ GVNRHEHHP +GKTNI+SCMIKD+VLMK
Sbjct: 378  HVVDCESCLVGIRQVSKAPKQLLVNGQPVIIRGVNRHEHHPRIGKTNIESCMIKDLVLMK 437

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            Q+NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       HP SE  WA AM+
Sbjct: 438  QNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFHLSGHIKHPTSEQSWAIAMI 497

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRVI M+ERDKNH+CII WSLGNEASYGPNH+A AGW+R +D++RL+HYEGGGSRT STD
Sbjct: 498  DRVIGMVERDKNHACIISWSLGNEASYGPNHSAAAGWIRGKDTSRLVHYEGGGSRTPSTD 557

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD++KIA DPTE RPLI CEYSHAMGNS+GN+ +YWEAI +T GLQGGFIW
Sbjct: 558  IVCPMYMRVWDIVKIANDPTELRPLILCEYSHAMGNSSGNICEYWEAIDSTFGLQGGFIW 617

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLKE ++  K+WAYGGDFGD PNDLNFC+NGL WPDR+ HP L+EV+Y YQPI 
Sbjct: 618  DWVDQGLLKENTDGSKYWAYGGDFGDTPNDLNFCLNGLTWPDRSPHPALHEVKYVYQPIK 677

Query: 47   VMQKENAIELRSRNF 3
            V  K + +++ +  F
Sbjct: 678  VSLKGSTLKITNTYF 692


>ref|XP_002303929.2| glycoside hydrolase family 2 family protein [Populus trichocarpa]
            gi|550343549|gb|EEE78908.2| glycoside hydrolase family 2
            family protein [Populus trichocarpa]
          Length = 1113

 Score =  885 bits (2287), Expect = 0.0
 Identities = 416/617 (67%), Positives = 495/617 (80%), Gaps = 2/617 (0%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WVK L +++SLSG W+F L   P++V  KF D  F+DS W TLPVPSNW+LHG+DRPIY 
Sbjct: 80   WVKDLPFVKSLSGFWRFFLAPGPDSVPKKFYDAEFEDSEWNTLPVPSNWELHGYDRPIYA 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PPR P++NPTGCYR  F LP+ W  RRI LHFEAVDSAF AW+NG  VGYS
Sbjct: 140  NVLYPFPVDPPRVPDDNPTGCYRTYFDLPQGWQDRRIFLHFEAVDSAFCAWINGVAVGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT YCY  GS K+N+LAVQV RWSDGSYLEDQDHWW+SGIHRDV+L+SK 
Sbjct: 200  QDSRLPAEFEITDYCYPCGSGKKNLLAVQVFRWSDGSYLEDQDHWWMSGIHRDVLLLSKA 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAER--NSAE 1134
            +VFI+DYFFKS L  +F+S+DIEVEV IE + ++  D     +++E  +YD     NS E
Sbjct: 260  QVFIADYFFKSNLAENFTSADIEVEVKIESALEIPRDKIFDNFTIEAALYDTGSWYNSEE 319

Query: 1133 SGESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDP 954
            S +  +++  +  L+LTH   G  LG  G   L GKL  PKLWSAE P LY+LV++LKD 
Sbjct: 320  SPDLLSSN--VANLKLTHSPMGI-LGFLG-NFLEGKLEKPKLWSAEQPNLYILVLSLKDA 375

Query: 953  TGNIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVL 774
            TG +VDCESC VGIRQIS+ PKQLLVNG PV++ GVNRHEHHP VGKTNI+SCMIKD+VL
Sbjct: 376  TGQVVDCESCLVGIRQISKAPKQLLVNGCPVIIRGVNRHEHHPRVGKTNIESCMIKDLVL 435

Query: 773  MKQHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAA 594
            MKQ+N+NAVRNSHYPQHPRWYELCDLFGLY+IDEANIETHGF       HP  E  WAAA
Sbjct: 436  MKQNNMNAVRNSHYPQHPRWYELCDLFGLYMIDEANIETHGFHLCEHLKHPTQEQSWAAA 495

Query: 593  MLDRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTS 414
            M+DRVI+M+ERDKNH+CII WSLGNE+SYGPNH+A AGW+RERD +RL+HYEGGGSRT S
Sbjct: 496  MMDRVISMVERDKNHACIISWSLGNESSYGPNHSAAAGWIRERDPSRLVHYEGGGSRTAS 555

Query: 413  TDVVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGF 234
            TD++CPMYMRVWD++KIA+DPTE RPLI CEYSHAMGNS+GN+ +YW+AI +T GLQGGF
Sbjct: 556  TDIICPMYMRVWDIVKIAKDPTEPRPLILCEYSHAMGNSSGNIREYWDAIDSTFGLQGGF 615

Query: 233  IWDWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQP 54
            IW+WVDQ LLKE  + +KHWAYGGDFGD PNDLNFC+NGL WPDRT HP L EV+Y YQP
Sbjct: 616  IWEWVDQALLKESGDGRKHWAYGGDFGDTPNDLNFCLNGLTWPDRTPHPALEEVKYVYQP 675

Query: 53   ICVMQKENAIELRSRNF 3
            I V  +E+ I++ + +F
Sbjct: 676  IKVSLEESTIKITNTHF 692


>ref|XP_004971288.1| PREDICTED: beta-galactosidase-like isoform X1 [Setaria italica]
            gi|514787266|ref|XP_004971289.1| PREDICTED:
            beta-galactosidase-like isoform X2 [Setaria italica]
            gi|514787270|ref|XP_004971290.1| PREDICTED:
            beta-galactosidase-like isoform X3 [Setaria italica]
            gi|514787274|ref|XP_004971291.1| PREDICTED:
            beta-galactosidase-like isoform X4 [Setaria italica]
            gi|514787278|ref|XP_004971292.1| PREDICTED:
            beta-galactosidase-like isoform X5 [Setaria italica]
          Length = 1116

 Score =  884 bits (2285), Expect = 0.0
 Identities = 416/618 (67%), Positives = 492/618 (79%), Gaps = 2/618 (0%)
 Frame = -3

Query: 1850 VWVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIY 1671
            +W KGL Y +SLSG WKFLL    E+V +KF D  FDDS W  LPVPSNWQ+HGFDRPIY
Sbjct: 80   LWSKGLPYTKSLSGYWKFLLAPSAESVPEKFFDAHFDDSNWEALPVPSNWQMHGFDRPIY 139

Query: 1670 TNTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGY 1491
            TNT YPF  NPP    +NPTGCYR  F +PKEW GRRILLHFEAVDSAFFAWVNG  +GY
Sbjct: 140  TNTTYPFPINPPFVSTDNPTGCYRTVFHIPKEWKGRRILLHFEAVDSAFFAWVNGVPIGY 199

Query: 1490 SQDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISK 1311
            SQDSRLPAEFE+T  C+   S KEN+LAVQVMRWSDGSYLEDQDHWWLSGIHRDV+L+SK
Sbjct: 200  SQDSRLPAEFEVTDCCHPCDSDKENVLAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSK 259

Query: 1310 PKVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYD--AERNSA 1137
            P++FI+DYFFK+ +  +FS +DIEVEV I+ SH+   +  +   S+E  +YD      S 
Sbjct: 260  PQIFITDYFFKATMDENFSLADIEVEVEID-SHKQDRE-HVSTLSIEATLYDNSGPSISL 317

Query: 1136 ESGESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKD 957
            +   SFAN   +     T   RG  LG  GY +L GK+ NPKLWS+E P LY LVV LKD
Sbjct: 318  DGDLSFANVVNLKPKPKTS--RGPCLGFHGY-VLGGKIENPKLWSSEHPNLYTLVVLLKD 374

Query: 956  PTGNIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIV 777
              G +++CESCQVGIR + R  KQ+LVNG PVV+ GVNRHEHHP +GKTNI++CMIKD++
Sbjct: 375  ANGKLIECESCQVGIRNVVRAHKQMLVNGCPVVLRGVNRHEHHPRLGKTNIEACMIKDLI 434

Query: 776  LMKQHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAA 597
            LM+Q+NINAVRNSHYPQH RWYELCD+FGLYVIDEANIETHGF   +   HP  EP+WA 
Sbjct: 435  LMRQNNINAVRNSHYPQHSRWYELCDIFGLYVIDEANIETHGFDENSHFKHPTLEPIWAN 494

Query: 596  AMLDRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTT 417
            AMLDRV+ M+ERDKNH+CII+WSLGNE+SYGPNHA+++GW+RERD TRLLHYEGGGSRT+
Sbjct: 495  AMLDRVVGMVERDKNHACIIVWSLGNESSYGPNHASMSGWIRERDPTRLLHYEGGGSRTS 554

Query: 416  STDVVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGG 237
            STD+VCPMYMRVWD+IKIA+DP+E RPLI CEYSHAMGNSNGN+  YW AI NT GLQGG
Sbjct: 555  STDIVCPMYMRVWDIIKIAKDPSETRPLILCEYSHAMGNSNGNIDAYWMAIDNTFGLQGG 614

Query: 236  FIWDWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQ 57
            FIWDWVDQGLLKE S+  K WAYGGDFGD PNDLNFC+NG++WPDRT HP ++EV+Y YQ
Sbjct: 615  FIWDWVDQGLLKEDSDGSKFWAYGGDFGDTPNDLNFCLNGIVWPDRTIHPAVHEVKYLYQ 674

Query: 56   PICVMQKENAIELRSRNF 3
            PI +   +N +++ + +F
Sbjct: 675  PIKISSADNMLKIENGHF 692


>ref|XP_006403576.1| hypothetical protein EUTSA_v10010080mg [Eutrema salsugineum]
            gi|557104695|gb|ESQ45029.1| hypothetical protein
            EUTSA_v10010080mg [Eutrema salsugineum]
          Length = 1107

 Score =  884 bits (2283), Expect = 0.0
 Identities = 411/615 (66%), Positives = 490/615 (79%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WV+GL +++SLSG WKF L   P  V DKF D  F DS W +LPVPSNWQ HGFDRPIYT
Sbjct: 80   WVEGLPFVKSLSGFWKFFLAPSPANVPDKFYDAAFPDSDWKSLPVPSNWQCHGFDRPIYT 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  PE+NPTGCYR  F +PKEW  RRILLHFEAVDSAFFAW+NG+ VGYS
Sbjct: 140  NIVYPFPNDPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGKPVGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEI+ YCY   S K+N+LAVQV RWSDGSYLEDQDHWWLSG+HRDV+L++KP
Sbjct: 200  QDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGLHRDVLLLAKP 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            KVFI DYFFKS L  DFS +DI+VEV I+   +   D  L  + +E  V+D +      G
Sbjct: 260  KVFIDDYFFKSKLADDFSYADIQVEVKIDNMLETSKDLVLSNFIIEAAVFDTKSWYNSGG 319

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
             S+  S  + +L+L +P   ++LG  GY LL GKL +P LWSAE P +Y+LV+TLKD +G
Sbjct: 320  FSYELSPKVASLKL-NPSPSSSLGFHGY-LLEGKLDSPNLWSAEQPNVYILVITLKDKSG 377

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             ++D ES  VG+RQ+S+  KQLLVNG PV++ GVNRHEHHP VGKTNI++CMIKD+++MK
Sbjct: 378  KLLDSESSIVGVRQVSKAFKQLLVNGHPVMIKGVNRHEHHPRVGKTNIEACMIKDLIMMK 437

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            ++NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       HP  EP WAAAML
Sbjct: 438  EYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPTKEPSWAAAML 497

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRV+ M+ERDKNH+CII WSLGNEA+YGPNH+A+AGW+RE+D +RL+HYEGGGSRT STD
Sbjct: 498  DRVVGMVERDKNHACIISWSLGNEANYGPNHSAMAGWIREKDPSRLVHYEGGGSRTDSTD 557

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD++KIA D  E RPLI CEYSHAMGNSNGN+ +YWEAI NT GLQGGFIW
Sbjct: 558  IVCPMYMRVWDIVKIALDKNESRPLILCEYSHAMGNSNGNIDEYWEAIDNTFGLQGGFIW 617

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK GS+  KHWAYGGDFGD+PNDLNFC+NGL+WPDRT HP L+EV++CYQPI 
Sbjct: 618  DWVDQGLLKLGSDGIKHWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIK 677

Query: 47   VMQKENAIELRSRNF 3
            V   +  + + +  F
Sbjct: 678  VSLTDGTMRVANAYF 692


>gb|EOY19805.1| Glycoside hydrolase family 2 protein isoform 1 [Theobroma cacao]
          Length = 1114

 Score =  884 bits (2283), Expect = 0.0
 Identities = 412/615 (66%), Positives = 484/615 (78%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WV GL +++SLSG WKF L   P AV   F +  F DS W TLPVPSNWQ+HGFDRPIYT
Sbjct: 81   WVNGLPFVKSLSGYWKFFLASNPNAVPKNFYESAFQDSDWETLPVPSNWQMHGFDRPIYT 140

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YP   +PP  P +NPTGCYR  F +P++W GRRILLHFEAVDSAF AW+NG  VGYS
Sbjct: 141  NVVYPIPLDPPHVPIDNPTGCYRTYFHIPEQWQGRRILLHFEAVDSAFCAWINGIPVGYS 200

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT+YCY   S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L+SKP
Sbjct: 201  QDSRLPAEFEITEYCYSCDSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSKP 260

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            +VFI+DYFFKS L  +FS +DI+VEV I+ S +M  D  L  +++E  ++DA       G
Sbjct: 261  QVFIADYFFKSSLAYNFSYADIQVEVKIDCSREMSKDKVLTDFTIEAALFDAGVWYNHDG 320

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
                 S  +  + L     GT LG  GY L+ GKL  PKLWSAE P LY LV+ LKD +G
Sbjct: 321  NVDLLSSNVANIVLKTVPTGT-LGFHGYVLV-GKLEKPKLWSAEQPNLYTLVIILKDASG 378

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
            N+VDCESC VG+RQ+S+ PKQLLVNG PVV+ GVNRHEHHP +GKTNI+SCM+KD+V+MK
Sbjct: 379  NVVDCESCLVGVRQVSKAPKQLLVNGHPVVIRGVNRHEHHPRLGKTNIESCMVKDLVVMK 438

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            Q+NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       H   EP WAAAM+
Sbjct: 439  QNNINAVRNSHYPQHPRWYELCDLFGIYMIDEANIETHGFDLSGHVKHLTQEPGWAAAMM 498

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRVI M+ERDKNH+CI  WSLGNE+ YGPNH+A AGW+R RD +RL+HYEGGGSRT+STD
Sbjct: 499  DRVIGMVERDKNHACIFSWSLGNESGYGPNHSASAGWIRGRDPSRLVHYEGGGSRTSSTD 558

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            ++CPMYMRVWD++KIA+DP E RPLI CEYSHAMGNSNGN+H+YWEAI N  GLQGGFIW
Sbjct: 559  IICPMYMRVWDIVKIAKDPNETRPLILCEYSHAMGNSNGNIHEYWEAIDNIFGLQGGFIW 618

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK+  +  K+WAYGGDFGD PNDLNFC+NGL WPDRT HP L EV+Y YQPI 
Sbjct: 619  DWVDQGLLKDNEDGSKYWAYGGDFGDSPNDLNFCLNGLTWPDRTPHPALQEVKYVYQPIK 678

Query: 47   VMQKENAIELRSRNF 3
            V   E+ I++++ NF
Sbjct: 679  VSIGESMIKIKNTNF 693


>dbj|BAJ86348.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1122

 Score =  881 bits (2276), Expect = 0.0
 Identities = 412/615 (66%), Positives = 484/615 (78%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            W +GL Y RSLSG WKF L   PE V DKF D  F+DS W  LPVPSNWQ+HGFDRPIYT
Sbjct: 86   WSEGLPYARSLSGLWKFRLAQSPETVPDKFFDAQFNDSDWDALPVPSNWQMHGFDRPIYT 145

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  NPP  P ENPTGCYRK F +PKEW GRRILLHFEAVDSAF AWVNG  +GYS
Sbjct: 146  NVTYPFPMNPPFVPSENPTGCYRKVFHIPKEWKGRRILLHFEAVDSAFLAWVNGVPIGYS 205

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT  C+   S KEN+LAVQVMRWSDGSYLEDQDHWWLSGIHRDV+L+SKP
Sbjct: 206  QDSRLPAEFEITDCCHHCDSGKENVLAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKP 265

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            ++FI+DYFFKS L  +F  +DIEVEV I+ SH+   +  +   S+E  ++D   +S +  
Sbjct: 266  QIFITDYFFKSTLDENFRVADIEVEVEID-SHKEDRE-HIPTLSIEATLFDNSESSDDLN 323

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
               +++ ++       P  G   G  GY +L GK+ NPKLWS+E P LY LVV LKD  G
Sbjct: 324  SDMSDANVVNLKTKPKPKGGPCHGFHGY-VLGGKVENPKLWSSEKPNLYTLVVLLKDANG 382

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             ++DCESCQVGIR +    KQ+LVNG PVV+ GVNRHEHHP VGKTN+++CMIKD+VLM+
Sbjct: 383  KLIDCESCQVGIRNVVLAHKQMLVNGSPVVIRGVNRHEHHPRVGKTNLEACMIKDLVLMR 442

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            Q+NINAVRNSHYPQHPRWYELCD+FGLYVIDEANIETHGF   +   HP  EP+WA +ML
Sbjct: 443  QNNINAVRNSHYPQHPRWYELCDIFGLYVIDEANIETHGFDETSHFKHPTLEPIWANSML 502

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRV+ M+ERDKNH+CII+WSLGNEASYGPNH+A++GW+R RD TRL+HYEGGGSRT+STD
Sbjct: 503  DRVVGMVERDKNHACIIIWSLGNEASYGPNHSAMSGWVRGRDPTRLIHYEGGGSRTSSTD 562

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD++KIA DP+E RPLI CEYSHAMGNSNGN+  YW+AI NT GLQGGFIW
Sbjct: 563  IVCPMYMRVWDILKIANDPSENRPLILCEYSHAMGNSNGNIDAYWKAIDNTMGLQGGFIW 622

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLKE  +  K WAYGGDFGD PNDLNFCING++WPDRT HP + EV+Y YQPI 
Sbjct: 623  DWVDQGLLKENVDGSKSWAYGGDFGDTPNDLNFCINGIVWPDRTLHPAVNEVKYLYQPIK 682

Query: 47   VMQKENAIELRSRNF 3
            V   +N +++ +  F
Sbjct: 683  VSLVDNILKIENGQF 697


>ref|XP_004308587.1| PREDICTED: beta-galactosidase-like [Fragaria vesca subsp. vesca]
          Length = 1113

 Score =  880 bits (2273), Expect = 0.0
 Identities = 407/615 (66%), Positives = 486/615 (79%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            W KGL ++ SLSG WKF L   P  V   F   TF DS W TLPVPSNWQ+HGFDRPIYT
Sbjct: 82   WTKGLPFVESLSGYWKFYLASTPGNVPLNFYHTTFQDSEWETLPVPSNWQMHGFDRPIYT 141

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  P +NPTGCYR  F++P+EW GRR+LLHFEAVDSAF AW+NG  VGYS
Sbjct: 142  NVVYPFPLDPPFVPVDNPTGCYRTDFVIPEEWKGRRVLLHFEAVDSAFCAWINGVPVGYS 201

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT YCY  GS K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L+SKP
Sbjct: 202  QDSRLPAEFEITDYCYPCGSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSKP 261

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            +VFI DYFF+S L  DFS +D++VEV I+ S +   +  +  +++E  ++D+    +  G
Sbjct: 262  QVFIGDYFFRSNLAEDFSYADLQVEVKIDNSRETSKNTVIDNFTIEAALFDSGSWYSIGG 321

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
             +   S  +  L+L     G+ LG + Y+L+ G+L  P+LWSAE P LY LVV LKD +G
Sbjct: 322  SADLLSSNVANLKLDLS-PGSILGFRDYSLV-GRLEAPRLWSAEQPNLYTLVVILKDKSG 379

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
            NIVDCESC VGIRQ+S  PKQLLVNG P+++ GVNRHEHHP +GKTNI+SCMIKD+VLMK
Sbjct: 380  NIVDCESCVVGIRQVSNAPKQLLVNGHPIIIRGVNRHEHHPRLGKTNIESCMIKDLVLMK 439

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            Q+NINAVRNSHYPQHPRWYELCD+FG+Y+IDEANIE HGF       HP  EP WA AML
Sbjct: 440  QYNINAVRNSHYPQHPRWYELCDIFGMYMIDEANIEAHGFDYSGHVKHPTLEPSWATAML 499

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRVI M+ERDKNH+CII WSLGNE+ YGPNH+A AGW+R +D +RLLHYEGGGSRT STD
Sbjct: 500  DRVIGMVERDKNHACIISWSLGNESGYGPNHSASAGWVRGKDPSRLLHYEGGGSRTPSTD 559

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            ++CPMYMRVWD++KIA+DP E RPLI CEYSHAMGNSNGN+H+YWEAI +T GLQGGFIW
Sbjct: 560  IICPMYMRVWDIVKIAKDPNETRPLILCEYSHAMGNSNGNIHEYWEAIDSTFGLQGGFIW 619

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK+ ++  KHWAYGGDFGD PNDLNFC+NGL+WPDRT HP ++EV+Y YQPI 
Sbjct: 620  DWVDQGLLKDSADGTKHWAYGGDFGDVPNDLNFCLNGLVWPDRTPHPAMHEVKYVYQPIK 679

Query: 47   VMQKENAIELRSRNF 3
            V   E  +++ + +F
Sbjct: 680  VSFSEGTLKVTNTHF 694


>ref|NP_001030858.1| glycoside hydrolase family 2 protein [Arabidopsis thaliana]
            gi|332645710|gb|AEE79231.1| glycoside hydrolase family 2
            protein [Arabidopsis thaliana]
          Length = 1108

 Score =  880 bits (2273), Expect = 0.0
 Identities = 413/615 (67%), Positives = 484/615 (78%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WV GL +++SLSG WKF L  KP  V DKF D  F DS W  L VPSNWQ HGFDRPIYT
Sbjct: 80   WVDGLPFVKSLSGYWKFFLAPKPANVPDKFYDAAFSDSDWNALQVPSNWQCHGFDRPIYT 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  PE+NPTGCYR  F +PKEW  RRILLHFEAVDSAFFAW+NG  VGYS
Sbjct: 140  NVVYPFPNDPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGNPVGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEI+ YCY   S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L++KP
Sbjct: 200  QDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKP 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            KVFI+DYFFKS L  DFS +DI+VEV I+   +   D  L  + +E  ++D +      G
Sbjct: 260  KVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAIFDTKNWYNSEG 319

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
             S   S  +  L+L +P     LG  GY LL GKL +P LWSAE P +Y+LV+TLKD +G
Sbjct: 320  FSCELSPKVANLKL-NPSPSPTLGFHGY-LLEGKLDSPNLWSAEQPNVYILVLTLKDTSG 377

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             ++D ES  VGIRQ+S+  KQLLVNG PVV+ GVNRHEHHP VGKTNI++CM+KD+++MK
Sbjct: 378  KVLDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMK 437

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            ++NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       HPA EP WAAAML
Sbjct: 438  EYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSWAAAML 497

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRV+ M+ERDKNH+CII WSLGNEA YGPNH+A+AGW+RE+D +RL+HYEGGGSRT+STD
Sbjct: 498  DRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTD 557

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD+IKIA D  E RPLI CEY HAMGNSNGN+ +YWEAI NT GLQGGFIW
Sbjct: 558  IVCPMYMRVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQGGFIW 617

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK GS+  K WAYGGDFGD+PNDLNFC+NGL+WPDRT HP L+EV++CYQPI 
Sbjct: 618  DWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIK 677

Query: 47   VMQKENAIELRSRNF 3
            V   +  I++ +  F
Sbjct: 678  VSLTDGMIKVANTYF 692


>ref|NP_001190087.1| glycoside hydrolase family 2 protein [Arabidopsis thaliana]
            gi|332645711|gb|AEE79232.1| glycoside hydrolase family 2
            protein [Arabidopsis thaliana]
          Length = 1120

 Score =  880 bits (2273), Expect = 0.0
 Identities = 413/615 (67%), Positives = 484/615 (78%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WV GL +++SLSG WKF L  KP  V DKF D  F DS W  L VPSNWQ HGFDRPIYT
Sbjct: 93   WVDGLPFVKSLSGYWKFFLAPKPANVPDKFYDAAFSDSDWNALQVPSNWQCHGFDRPIYT 152

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  PE+NPTGCYR  F +PKEW  RRILLHFEAVDSAFFAW+NG  VGYS
Sbjct: 153  NVVYPFPNDPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGNPVGYS 212

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEI+ YCY   S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L++KP
Sbjct: 213  QDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKP 272

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            KVFI+DYFFKS L  DFS +DI+VEV I+   +   D  L  + +E  ++D +      G
Sbjct: 273  KVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAIFDTKNWYNSEG 332

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
             S   S  +  L+L +P     LG  GY LL GKL +P LWSAE P +Y+LV+TLKD +G
Sbjct: 333  FSCELSPKVANLKL-NPSPSPTLGFHGY-LLEGKLDSPNLWSAEQPNVYILVLTLKDTSG 390

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             ++D ES  VGIRQ+S+  KQLLVNG PVV+ GVNRHEHHP VGKTNI++CM+KD+++MK
Sbjct: 391  KVLDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMK 450

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            ++NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       HPA EP WAAAML
Sbjct: 451  EYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSWAAAML 510

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRV+ M+ERDKNH+CII WSLGNEA YGPNH+A+AGW+RE+D +RL+HYEGGGSRT+STD
Sbjct: 511  DRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTD 570

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD+IKIA D  E RPLI CEY HAMGNSNGN+ +YWEAI NT GLQGGFIW
Sbjct: 571  IVCPMYMRVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQGGFIW 630

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK GS+  K WAYGGDFGD+PNDLNFC+NGL+WPDRT HP L+EV++CYQPI 
Sbjct: 631  DWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIK 690

Query: 47   VMQKENAIELRSRNF 3
            V   +  I++ +  F
Sbjct: 691  VSLTDGMIKVANTYF 705


>ref|NP_680128.1| glycoside hydrolase family 2 protein [Arabidopsis thaliana]
            gi|20147224|gb|AAM10327.1| At3g54435 [Arabidopsis
            thaliana] gi|332645709|gb|AEE79230.1| glycoside hydrolase
            family 2 protein [Arabidopsis thaliana]
          Length = 1107

 Score =  880 bits (2273), Expect = 0.0
 Identities = 413/615 (67%), Positives = 484/615 (78%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WV GL +++SLSG WKF L  KP  V DKF D  F DS W  L VPSNWQ HGFDRPIYT
Sbjct: 80   WVDGLPFVKSLSGYWKFFLAPKPANVPDKFYDAAFSDSDWNALQVPSNWQCHGFDRPIYT 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  PE+NPTGCYR  F +PKEW  RRILLHFEAVDSAFFAW+NG  VGYS
Sbjct: 140  NVVYPFPNDPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGNPVGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEI+ YCY   S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L++KP
Sbjct: 200  QDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKP 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            KVFI+DYFFKS L  DFS +DI+VEV I+   +   D  L  + +E  ++D +      G
Sbjct: 260  KVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAIFDTKNWYNSEG 319

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
             S   S  +  L+L +P     LG  GY LL GKL +P LWSAE P +Y+LV+TLKD +G
Sbjct: 320  FSCELSPKVANLKL-NPSPSPTLGFHGY-LLEGKLDSPNLWSAEQPNVYILVLTLKDTSG 377

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             ++D ES  VGIRQ+S+  KQLLVNG PVV+ GVNRHEHHP VGKTNI++CM+KD+++MK
Sbjct: 378  KVLDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMK 437

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            ++NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       HPA EP WAAAML
Sbjct: 438  EYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSWAAAML 497

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRV+ M+ERDKNH+CII WSLGNEA YGPNH+A+AGW+RE+D +RL+HYEGGGSRT+STD
Sbjct: 498  DRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTD 557

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD+IKIA D  E RPLI CEY HAMGNSNGN+ +YWEAI NT GLQGGFIW
Sbjct: 558  IVCPMYMRVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQGGFIW 617

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK GS+  K WAYGGDFGD+PNDLNFC+NGL+WPDRT HP L+EV++CYQPI 
Sbjct: 618  DWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIK 677

Query: 47   VMQKENAIELRSRNF 3
            V   +  I++ +  F
Sbjct: 678  VSLTDGMIKVANTYF 692


>ref|XP_006293102.1| hypothetical protein CARUB_v10019396mg [Capsella rubella]
            gi|482561809|gb|EOA26000.1| hypothetical protein
            CARUB_v10019396mg [Capsella rubella]
          Length = 1107

 Score =  879 bits (2270), Expect = 0.0
 Identities = 410/615 (66%), Positives = 483/615 (78%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WV GL +++SLSG WKF L  KP  V + F D  F DS W  LPVPSNWQ HGFDRPIYT
Sbjct: 80   WVDGLPFVKSLSGYWKFFLAPKPANVPENFYDAAFPDSDWDALPVPSNWQCHGFDRPIYT 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  PE+NPTGCYR  F +PKEW  RRILLHFEAVDSAFFAW+NG  +GYS
Sbjct: 140  NVVYPFPNDPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGNPIGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEI++YCY   S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L++KP
Sbjct: 200  QDSRLPAEFEISEYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKP 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            KVFI+DYFFKS L  DFS +DI+VEV I+   +   D  L  + +E  V+  +      G
Sbjct: 260  KVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAVFSTKNWYNSEG 319

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
             S   S  +  L L +P     LG  GY LL GKL +P LWSAE P +Y+LV+TLKD +G
Sbjct: 320  FSSELSPKVANLTL-NPSPSPVLGFHGY-LLEGKLDSPNLWSAEQPNVYILVLTLKDTSG 377

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             I+D ES  VGIRQ+S+  KQLLVNG PVV+ GVNRHEHHP VGKTNI+SCM+KD+++MK
Sbjct: 378  KILDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIESCMVKDLIMMK 437

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            ++NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       HPA EP WAAAML
Sbjct: 438  EYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSWAAAML 497

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRV+ M+ERDKNH+CI+ WSLGNEA YGPNH+A+AGW+RE+D +RL+HYEGGGSRT+STD
Sbjct: 498  DRVVGMVERDKNHTCIVSWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTD 557

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            ++CPMYMRVWD++KIA D  E RPLI CEY HAMGNSNGN+ +YWEAI NT GLQGGFIW
Sbjct: 558  IICPMYMRVWDIVKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQGGFIW 617

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK GS+  K WAYGGDFGD+PNDLNFC+NGL+WPDRT HP L+EV+YCYQPI 
Sbjct: 618  DWVDQGLLKPGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKYCYQPIN 677

Query: 47   VMQKENAIELRSRNF 3
            V   +  +++ +  F
Sbjct: 678  VSLTDGTMKVANTYF 692


>ref|NP_001045421.1| Os01g0952600 [Oryza sativa Japonica Group]
            gi|57899943|dbj|BAD87855.1| putative beta-galactosidase
            [Oryza sativa Japonica Group]
            gi|113534952|dbj|BAF07335.1| Os01g0952600 [Oryza sativa
            Japonica Group] gi|222619883|gb|EEE56015.1| hypothetical
            protein OsJ_04784 [Oryza sativa Japonica Group]
          Length = 1117

 Score =  878 bits (2269), Expect = 0.0
 Identities = 410/620 (66%), Positives = 487/620 (78%), Gaps = 5/620 (0%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            W KGL Y+++LSG WKFLL   PE+V +KF D  F+DS W  LPVPSNWQ+HGFDRPIYT
Sbjct: 81   WSKGLPYVQTLSGYWKFLLASSPESVPEKFYDAYFNDSDWEALPVPSNWQMHGFDRPIYT 140

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF+ NPP  P +NPTGCYR  F +PKEW GRRILLHFEAVDSAFFAWVNG  VGYS
Sbjct: 141  NVTYPFTMNPPFVPNDNPTGCYRTVFRIPKEWKGRRILLHFEAVDSAFFAWVNGVPVGYS 200

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT +C+   S KEN+LAVQVMRWSDGSYLEDQDHWWLSGIHRDV+L+SKP
Sbjct: 201  QDSRLPAEFEITDFCHPCDSEKENVLAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLVSKP 260

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAER-----N 1143
            ++FI+DYFFK+ L   F  +DIEVEV I+   Q      +   S+E  +YD         
Sbjct: 261  QIFITDYFFKATLDEGFRVADIEVEVEIDSQKQD--REHVSTLSIEATLYDNYGPADVLT 318

Query: 1142 SAESGESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTL 963
            S  S  S AN KL    +  H       G  GY +L GK+ NPKLWS+E P LY LVV L
Sbjct: 319  SDMSAASVANLKLKPASRPKHCY-----GFHGY-VLGGKVENPKLWSSEHPNLYTLVVVL 372

Query: 962  KDPTGNIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKD 783
            KD  G +++CESCQVGIR +    KQ+LVNG PVV+ GVNRHEHHP VGKTN+++CMIKD
Sbjct: 373  KDSNGKLIECESCQVGIRNVVLAHKQMLVNGCPVVIRGVNRHEHHPRVGKTNLEACMIKD 432

Query: 782  IVLMKQHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMW 603
            +VLM+Q+NINAVRNSHYPQHPRWYELCD+FGLYVIDEANIETHGF   +   HP  EP W
Sbjct: 433  LVLMRQNNINAVRNSHYPQHPRWYELCDIFGLYVIDEANIETHGFDESSHFKHPTLEPFW 492

Query: 602  AAAMLDRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSR 423
            A+AMLDRV+ M+ERDKNH+CII+WSLGNE+SYGPNH+A++GW+R +D TR +HYEGGGSR
Sbjct: 493  ASAMLDRVVGMVERDKNHACIIVWSLGNESSYGPNHSAMSGWIRGKDPTRPIHYEGGGSR 552

Query: 422  TTSTDVVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQ 243
            T+STD+VCPMYMRVWD++KIA+DP+E RPLI CEYSHAMGNSNGN+  YW AI NT GLQ
Sbjct: 553  TSSTDIVCPMYMRVWDILKIAQDPSENRPLILCEYSHAMGNSNGNIDAYWMAIDNTVGLQ 612

Query: 242  GGFIWDWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYC 63
            GGFIWDWVDQGLLKE ++  K+WAYGGDFGD PNDLNFC+NG++WPDRT HP ++EV+Y 
Sbjct: 613  GGFIWDWVDQGLLKEDADGSKNWAYGGDFGDTPNDLNFCLNGIVWPDRTIHPAVHEVKYL 672

Query: 62   YQPICVMQKENAIELRSRNF 3
            YQPI +   +N +++ + +F
Sbjct: 673  YQPIKITMMDNMLKIENVHF 692


>ref|XP_002877978.1| hydrolase, hydrolyzing O-glycosyl compounds [Arabidopsis lyrata
            subsp. lyrata] gi|297323816|gb|EFH54237.1| hydrolase,
            hydrolyzing O-glycosyl compounds [Arabidopsis lyrata
            subsp. lyrata]
          Length = 1107

 Score =  878 bits (2268), Expect = 0.0
 Identities = 412/615 (66%), Positives = 484/615 (78%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WV GL +++SLSG WKF L  KP  V DKF D  F DS W  LPVPSNWQ HGFDRPIYT
Sbjct: 80   WVDGLPFVKSLSGYWKFFLAPKPANVPDKFYDPAFPDSDWNALPVPSNWQCHGFDRPIYT 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  PE+NPTGCYR  F +PKEW  RRILLHFEAVDSAFFAW+NG  VGYS
Sbjct: 140  NVVYPFPNDPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWINGNPVGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEI+ YCY   S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L++KP
Sbjct: 200  QDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKP 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            KVFI+DYFFKS L  DFS +DI+VEV I+   +      L  + +E  V+D +      G
Sbjct: 260  KVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKHLVLSNFIIEAAVFDTKNWYNSEG 319

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
             +   S  +  L+L +P     LG  GY LL GKL +P LWSAE P +Y+LV+TLKD +G
Sbjct: 320  FNCELSPKVAHLKL-NPSPSPTLGFHGY-LLEGKLDSPNLWSAEQPNVYILVLTLKDTSG 377

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             ++D ES  VGIRQ+S+  KQLLVNG PVV+ GVNRHEHHP VGKTNI++CM+KD+++MK
Sbjct: 378  KVLDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMK 437

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            ++NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       HPA EP WAAAML
Sbjct: 438  EYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSWAAAML 497

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRV+ M+ERDKNH+CII WSLGNEA YGPNH+A+AGW+RE+D +RL+HYEGGGSRT+STD
Sbjct: 498  DRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTD 557

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD+IKIA D  E RPLI CEY HAMGNSNGN+ +YW+AI NT GLQGGFIW
Sbjct: 558  IVCPMYMRVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWDAIDNTFGLQGGFIW 617

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK GS+  K WAYGGDFGD+PNDLNFC+NGL+WPDRT HP L+EV++CYQPI 
Sbjct: 618  DWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIK 677

Query: 47   VMQKENAIELRSRNF 3
            V   +  I++ +  F
Sbjct: 678  VSLTDGLIKVANTYF 692


>gb|EMJ20103.1| hypothetical protein PRUPE_ppa000532mg [Prunus persica]
          Length = 1111

 Score =  877 bits (2266), Expect = 0.0
 Identities = 409/622 (65%), Positives = 483/622 (77%), Gaps = 6/622 (0%)
 Frame = -3

Query: 1850 VWVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIY 1671
            +WVK L +++SLSG WKF L   P  V   F D  F DS W TLPVPSNWQ+HGFDRPIY
Sbjct: 80   LWVKDLPFVKSLSGYWKFFLASSPRNVPVNFYDTAFQDSEWETLPVPSNWQMHGFDRPIY 139

Query: 1670 TNTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGY 1491
            TN  YPF  +PP  P +NPTGCYR  F +PKEW GRRILLHFEAVDSAF AW+NG  +GY
Sbjct: 140  TNVVYPFPLDPPFVPVDNPTGCYRTYFHIPKEWKGRRILLHFEAVDSAFCAWLNGVPIGY 199

Query: 1490 SQDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISK 1311
            SQDSRLPAEFEIT YCY     K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L+SK
Sbjct: 200  SQDSRLPAEFEITDYCYPSDMDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSK 259

Query: 1310 PKVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDA------E 1149
            P+VFI+DYFFKS L  DFS +DI+VEV I+ S +   D  L  Y +E  ++D       +
Sbjct: 260  PQVFIADYFFKSTLAEDFSYADIQVEVKIDNSRETSKDSVLANYVIEAALFDTACWYSID 319

Query: 1148 RNSAESGESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVV 969
            R +     + A+ KL ++         T+LG  GY LL G+L  P+LWSAE P LY L V
Sbjct: 320  RYADLHLSNVASIKLNLS-------SSTSLGFHGY-LLVGRLDMPRLWSAEQPSLYTLAV 371

Query: 968  TLKDPTGNIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMI 789
            TLKD +GN++DCES  VGIRQ+S+ PKQLLVNG P+++ GVNRHEHHP +GKTNI+SCM+
Sbjct: 372  TLKDASGNLLDCESSLVGIRQVSKAPKQLLVNGHPIIIRGVNRHEHHPRLGKTNIESCMV 431

Query: 788  KDIVLMKQHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEP 609
            KD+VLMKQ+NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       HP  EP
Sbjct: 432  KDLVLMKQYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHVKHPTLEP 491

Query: 608  MWAAAMLDRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGG 429
             WA AM+DRVI M+ERDKNH+CII WSLGNEA YGPNH+ALAGW+R +D +RL+HYEGGG
Sbjct: 492  SWATAMMDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWVRGKDPSRLVHYEGGG 551

Query: 428  SRTTSTDVVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPG 249
            SRT+STD++CPMYMRVWDM++I+ DP E RPLI CEYSHAMGNSNGNLH+YWE I +T G
Sbjct: 552  SRTSSTDIICPMYMRVWDMLQISRDPNETRPLILCEYSHAMGNSNGNLHEYWEVIDSTFG 611

Query: 248  LQGGFIWDWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQ 69
            LQGGFIWDWVDQ LLK+ ++  KHWAYGGDFGD PNDLNFC+NGL WPDRT HP L+EV+
Sbjct: 612  LQGGFIWDWVDQALLKDNADGSKHWAYGGDFGDVPNDLNFCLNGLTWPDRTPHPALHEVK 671

Query: 68   YCYQPICVMQKENAIELRSRNF 3
            Y YQPI V   +  + + + +F
Sbjct: 672  YVYQPIKVSFSKETLRITNTHF 693


>gb|EOY19806.1| Glycoside hydrolase family 2 protein isoform 2 [Theobroma cacao]
          Length = 1112

 Score =  875 bits (2262), Expect = 0.0
 Identities = 411/615 (66%), Positives = 482/615 (78%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WV GL +++SLSG WKF L   P AV   F +  F DS W TLPVPSNWQ+HGFDRPIYT
Sbjct: 81   WVNGLPFVKSLSGYWKFFLASNPNAVPKNFYESAFQDSDWETLPVPSNWQMHGFDRPIYT 140

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YP   +PP  P +NPTGCYR  F +P++W GRRILLHFEAVDSAF AW+NG  VGYS
Sbjct: 141  NVVYPIPLDPPHVPIDNPTGCYRTYFHIPEQWQGRRILLHFEAVDSAFCAWINGIPVGYS 200

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT+YCY   S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L+SKP
Sbjct: 201  QDSRLPAEFEITEYCYSCDSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSKP 260

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            +VFI+DYFFKS L  +FS +DI+VEV I+ S +M  D  L  +++E  ++DA       G
Sbjct: 261  QVFIADYFFKSSLAYNFSYADIQVEVKIDCSREMSKDKVLTDFTIEAALFDAGVWYNHDG 320

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
                 S  +  + L     GT LG  GY L+ GKL  PKLWSAE P LY LV+ LKD +G
Sbjct: 321  NVDLLSSNVANIVLKTVPTGT-LGFHGYVLV-GKLEKPKLWSAEQPNLYTLVIILKDASG 378

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
            N+VDCESC VG+RQ+S+ PKQLLVNG PVV+ GVNRHEHHP +GKTNI+SCM  D+V+MK
Sbjct: 379  NVVDCESCLVGVRQVSKAPKQLLVNGHPVVIRGVNRHEHHPRLGKTNIESCM--DLVVMK 436

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            Q+NINAVRNSHYPQHPRWYELCDLFG+Y+IDEANIETHGF       H   EP WAAAM+
Sbjct: 437  QNNINAVRNSHYPQHPRWYELCDLFGIYMIDEANIETHGFDLSGHVKHLTQEPGWAAAMM 496

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRVI M+ERDKNH+CI  WSLGNE+ YGPNH+A AGW+R RD +RL+HYEGGGSRT+STD
Sbjct: 497  DRVIGMVERDKNHACIFSWSLGNESGYGPNHSASAGWIRGRDPSRLVHYEGGGSRTSSTD 556

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            ++CPMYMRVWD++KIA+DP E RPLI CEYSHAMGNSNGN+H+YWEAI N  GLQGGFIW
Sbjct: 557  IICPMYMRVWDIVKIAKDPNETRPLILCEYSHAMGNSNGNIHEYWEAIDNIFGLQGGFIW 616

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLLK+  +  K+WAYGGDFGD PNDLNFC+NGL WPDRT HP L EV+Y YQPI 
Sbjct: 617  DWVDQGLLKDNEDGSKYWAYGGDFGDSPNDLNFCLNGLTWPDRTPHPALQEVKYVYQPIK 676

Query: 47   VMQKENAIELRSRNF 3
            V   E+ I++++ NF
Sbjct: 677  VSIGESMIKIKNTNF 691


>ref|XP_004142388.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
            gi|449487140|ref|XP_004157508.1| PREDICTED:
            beta-galactosidase-like [Cucumis sativus]
          Length = 1114

 Score =  875 bits (2262), Expect = 0.0
 Identities = 405/615 (65%), Positives = 484/615 (78%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            WVK L +I+SLSG WKF L   P +V   F    F+DS WA LPVPSNWQ+HGFDRPIYT
Sbjct: 80   WVKDLPFIKSLSGYWKFYLAATPTSVPHNFHATVFEDSQWANLPVPSNWQMHGFDRPIYT 139

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  PE+NPTGCYR  F LP+EW GRRILLHFEAVDSAFFAW+NG  VGYS
Sbjct: 140  NVVYPFPLDPPHVPEDNPTGCYRTYFHLPEEWKGRRILLHFEAVDSAFFAWINGSLVGYS 199

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEIT+YC+  GS  +N+LAVQV++WSDGSYLEDQD WWLSGIHRDVIL+SKP
Sbjct: 200  QDSRLPAEFEITEYCHPCGSQSKNVLAVQVLKWSDGSYLEDQDQWWLSGIHRDVILLSKP 259

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            +VFI DYFFKS +  DFS +DI+VEV I+ S +   +  L  + +E  ++D+       G
Sbjct: 260  QVFIGDYFFKSHVGEDFSYADIQVEVKIDSSLEGRKENFLNNFKLEAVLFDSGSWDNHDG 319

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
                 S  M  ++L+  +  T LG  GY +L G+L  PKLWSAE P+LY L+V LKD + 
Sbjct: 320  NIDLLSSNMANVKLSL-LSVTTLGFHGY-VLGGRLQKPKLWSAEQPHLYTLIVLLKDSSD 377

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             IVDCESC VGIR I++GPKQLLVNG+PVV+ GVNRHEHHP +GKTNI++CM++D+VLMK
Sbjct: 378  QIVDCESCLVGIRSITKGPKQLLVNGRPVVIRGVNRHEHHPRLGKTNIEACMVRDLVLMK 437

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            QHNINAVRNSHYPQH RWYELCDLFG+Y++DEANIETHGF       HP  +P WAAAML
Sbjct: 438  QHNINAVRNSHYPQHSRWYELCDLFGMYMVDEANIETHGFDFSGHVKHPTLQPSWAAAML 497

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRVI M+ERDKNH+CII+WSLGNE+ YGPNH+ALAGW+R +DS+R+LHYEGGGSRT+STD
Sbjct: 498  DRVIGMVERDKNHACIIVWSLGNESGYGPNHSALAGWIRGKDSSRVLHYEGGGSRTSSTD 557

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            ++CPMYMRVWD++ IA DP E RPLI CEYSH+MGNS GNLHKYWEAI NT GLQGGFIW
Sbjct: 558  IICPMYMRVWDIVNIANDPNETRPLILCEYSHSMGNSTGNLHKYWEAIDNTFGLQGGFIW 617

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQ LLKE  N +K WAYGG+FGD PND  FC+NG+ WPDRT HP L+EV+Y +Q I 
Sbjct: 618  DWVDQALLKEVGNGRKRWAYGGEFGDIPNDSTFCLNGVTWPDRTPHPALHEVKYLHQAIK 677

Query: 47   VMQKENAIELRSRNF 3
            +  K+  +E+ + +F
Sbjct: 678  ISSKDGTLEVLNGHF 692


>ref|XP_006487669.1| PREDICTED: beta-galactosidase-like [Citrus sinensis]
          Length = 1115

 Score =  874 bits (2258), Expect = 0.0
 Identities = 412/615 (66%), Positives = 477/615 (77%)
 Frame = -3

Query: 1847 WVKGLCYIRSLSGQWKFLLYDKPEAVHDKFQDETFDDSLWATLPVPSNWQLHGFDRPIYT 1668
            W  GL +++SLSG WKF L   P  V   F   +F DS W  +PVPSNWQ+HGFDRPIYT
Sbjct: 82   WANGLPFVKSLSGHWKFFLASSPPDVPLNFHKSSFQDSKWEAIPVPSNWQMHGFDRPIYT 141

Query: 1667 NTDYPFSFNPPRAPEENPTGCYRKSFLLPKEWTGRRILLHFEAVDSAFFAWVNGQFVGYS 1488
            N  YPF  +PP  P ENPTGCYR  F +PKEW GRRILLHFEAVDSAF AW+NG  VGYS
Sbjct: 142  NVVYPFPLDPPNVPAENPTGCYRTYFHIPKEWQGRRILLHFEAVDSAFCAWINGVPVGYS 201

Query: 1487 QDSRLPAEFEITKYCYEPGSTKENILAVQVMRWSDGSYLEDQDHWWLSGIHRDVILISKP 1308
            QDSRLPAEFEI+ YCY  GS K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRDV+L++KP
Sbjct: 202  QDSRLPAEFEISDYCYPHGSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKP 261

Query: 1307 KVFISDYFFKSCLMSDFSSSDIEVEVLIEGSHQMVVDGQLGLYSVECFVYDAERNSAESG 1128
            +VFI+DYFFKS L  DFS +DI+VEV I+ S ++  D  L  + +E  +YD        G
Sbjct: 262  QVFIADYFFKSNLAEDFSLADIQVEVEIDCSPEISKDSILANFVIEAGLYDTGSWYNCDG 321

Query: 1127 ESFANSKLMVTLQLTHPIRGTNLGNQGYTLLNGKLVNPKLWSAEDPYLYVLVVTLKDPTG 948
                 S  +  +QL            GY L+ GKL  P+LWSAE P LY LVV LK  +G
Sbjct: 322  CIDLLSSKVANIQLNPSTASVEF--PGYMLV-GKLEMPRLWSAEQPNLYTLVVILKHASG 378

Query: 947  NIVDCESCQVGIRQISRGPKQLLVNGKPVVMHGVNRHEHHPLVGKTNIDSCMIKDIVLMK 768
             +VDCESC VGIRQ+S+ PKQLLVNG PVV+ GVNRHEHHP VGKTNI+SCM+KD+VLMK
Sbjct: 379  PVVDCESCLVGIRQVSKAPKQLLVNGNPVVIRGVNRHEHHPRVGKTNIESCMVKDLVLMK 438

Query: 767  QHNINAVRNSHYPQHPRWYELCDLFGLYVIDEANIETHGFGPYTERTHPASEPMWAAAML 588
            Q+NINAVRNSHYPQHPRWYELCDLFGLY+IDEANIETHGF       HP  EP WAAAM+
Sbjct: 439  QNNINAVRNSHYPQHPRWYELCDLFGLYMIDEANIETHGFYFSEHLKHPTMEPSWAAAMM 498

Query: 587  DRVINMLERDKNHSCIILWSLGNEASYGPNHAALAGWLRERDSTRLLHYEGGGSRTTSTD 408
            DRVI M+ERDKNH+ II WSLGNEA +GPNH+A AGW+R +D +RLLHYEGGGSRT STD
Sbjct: 499  DRVIGMVERDKNHASIICWSLGNEAGHGPNHSAAAGWIRGKDPSRLLHYEGGGSRTPSTD 558

Query: 407  VVCPMYMRVWDMIKIAEDPTECRPLIQCEYSHAMGNSNGNLHKYWEAIYNTPGLQGGFIW 228
            +VCPMYMRVWD++ IA+DPTE RPLI CEYSHAMGNSNGN+H+YWEAI +T GLQGGFIW
Sbjct: 559  IVCPMYMRVWDIVMIAKDPTETRPLILCEYSHAMGNSNGNIHEYWEAIDSTFGLQGGFIW 618

Query: 227  DWVDQGLLKEGSNSKKHWAYGGDFGDKPNDLNFCINGLLWPDRTCHPGLYEVQYCYQPIC 48
            DWVDQGLL+E ++  KHWAYGGDFGD PNDLNFC+NGLLWPDRT HP L+EV+Y YQ I 
Sbjct: 619  DWVDQGLLRELADGTKHWAYGGDFGDTPNDLNFCLNGLLWPDRTPHPALHEVKYVYQAIK 678

Query: 47   VMQKENAIELRSRNF 3
            V  K+  +++ + NF
Sbjct: 679  VSLKKGTLKISNTNF 693


Top