BLASTX nr result

ID: Papaver27_contig00007569 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver27_contig00007569
         (2140 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004308587.1| PREDICTED: beta-galactosidase-like [Fragaria...  1176   0.0  
ref|XP_007010995.1| Glycoside hydrolase family 2 protein isoform...  1165   0.0  
ref|XP_006487669.1| PREDICTED: beta-galactosidase-like [Citrus s...  1163   0.0  
ref|XP_007010996.1| Glycoside hydrolase family 2 protein isoform...  1157   0.0  
ref|XP_007218904.1| hypothetical protein PRUPE_ppa000532mg [Prun...  1157   0.0  
ref|XP_003634896.1| PREDICTED: beta-galactosidase [Vitis vinifera]   1148   0.0  
ref|XP_002266400.1| PREDICTED: beta-galactosidase isoform 1 [Vit...  1148   0.0  
ref|XP_002513059.1| beta-galactosidase, putative [Ricinus commun...  1145   0.0  
ref|XP_002299206.2| glycoside hydrolase family 2 family protein ...  1144   0.0  
ref|XP_007220592.1| hypothetical protein PRUPE_ppa000508mg [Prun...  1130   0.0  
ref|XP_002303929.2| glycoside hydrolase family 2 family protein ...  1128   0.0  
ref|XP_004142388.1| PREDICTED: beta-galactosidase-like [Cucumis ...  1122   0.0  
ref|NP_001030858.1| glycoside hydrolase family 2 protein [Arabid...  1118   0.0  
ref|NP_001190087.1| glycoside hydrolase family 2 protein [Arabid...  1118   0.0  
ref|NP_680128.1| glycoside hydrolase family 2 protein [Arabidops...  1118   0.0  
ref|XP_003542824.2| PREDICTED: beta-galactosidase-like [Glycine ...  1117   0.0  
ref|XP_006293102.1| hypothetical protein CARUB_v10019396mg [Caps...  1117   0.0  
ref|XP_002877978.1| hydrolase, hydrolyzing O-glycosyl compounds ...  1117   0.0  
ref|XP_007133761.1| hypothetical protein PHAVU_011G206800g [Phas...  1112   0.0  
ref|XP_006403576.1| hypothetical protein EUTSA_v10010080mg [Eutr...  1112   0.0  

>ref|XP_004308587.1| PREDICTED: beta-galactosidase-like [Fragaria vesca subsp. vesca]
          Length = 1113

 Score = 1176 bits (3043), Expect = 0.0
 Identities = 536/716 (74%), Positives = 615/716 (85%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFPLDPP VPV+NPTGCYRT F IPEEWK RR+ LHFEAVDSAF AW+N
Sbjct: 135  FDRPIYTNVVYPFPLDPPFVPVDNPTGCYRTDFVIPEEWKGRRVLLHFEAVDSAFCAWIN 194

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G+P+GYSQDSRLPAEFEI+D+C+P  SDKKN+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 195  GVPVGYSQDSRLPAEFEITDYCYPCGSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 254

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLLSKPQVFI D+FF+S+L+EDFS AD+QVEVKI    E   + ++ NF++EA ++D+ 
Sbjct: 255  VLLLSKPQVFIGDYFFRSNLAEDFSYADLQVEVKIDNSRETSKNTVIDNFTIEAALFDSG 314

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            SWY   G  DLL+S+VA+LKL   P  +LGF  Y L G+LE P+LWSAEQPNLYTLV+IL
Sbjct: 315  SWYSIGGSADLLSSNVANLKLDLSPGSILGFRDYSLVGRLEAPRLWSAEQPNLYTLVVIL 374

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KD SG  VDCESC VG RQ+S A K++LVNGHP+IIRGVNRHEHHPRLGKTN+ESCM+KD
Sbjct: 375  KDKSGNIVDCESCVVGIRQVSNAPKQLLVNGHPIIIRGVNRHEHHPRLGKTNIESCMIKD 434

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L+LMKQ N+NAVRNSHYPQHPRWYELCD+FG YMIDEANIE HGFDY G++KHPTLEPSW
Sbjct: 435  LVLMKQYNINAVRNSHYPQHPRWYELCDIFGMYMIDEANIEAHGFDYSGHVKHPTLEPSW 494

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AMLDRVIGMVERDKNHACIISWSLGNE+GYGPNHSA AGW+RG+DPSR++HYEGGGSR
Sbjct: 495  ATAMLDRVIGMVERDKNHACIISWSLGNESGYGPNHSASAGWVRGKDPSRLLHYEGGGSR 554

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            T STDIICPMYMRVWDIVKIAKDPNETRP+ILCEYSHAMGNSNGNIHEYWEAID+TFGLQ
Sbjct: 555  TPSTDIICPMYMRVWDIVKIAKDPNETRPLILCEYSHAMGNSNGNIHEYWEAIDSTFGLQ 614

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK+ +DG KHWAYGGDFGD PNDLNFCLNGL+WPDRTPHPA++EVKYV
Sbjct: 615  GGFIWDWVDQGLLKDSADGTKHWAYGGDFGDVPNDLNFCLNGLVWPDRTPHPAMHEVKYV 674

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV   +G LK+ N+HF++TT+ LEF WA HGD            +I+PQ +Y IE 
Sbjct: 675  YQPIKVSFSEGTLKVTNTHFYETTRALEFYWAAHGDGCELGSGNLSLPLIEPQKTYHIES 734

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             SAPW++LWASS   E FLTIT KL +ST W  AGH+++STQ+QLP K+E  PHVIK  D
Sbjct: 735  QSAPWHTLWASSSAEEFFLTITAKLLHSTCWVEAGHVISSTQVQLPVKREFVPHVIKTKD 794

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
               L E +GDT++VS+Q++WEI +N K GT+ESWKVEGV +M KGIFPCFWRAPTD
Sbjct: 795  ATFLREIVGDTLKVSQQNAWEIILNVKMGTVESWKVEGVPLMTKGIFPCFWRAPTD 850


>ref|XP_007010995.1| Glycoside hydrolase family 2 protein isoform 1 [Theobroma cacao]
            gi|508727908|gb|EOY19805.1| Glycoside hydrolase family 2
            protein isoform 1 [Theobroma cacao]
          Length = 1114

 Score = 1165 bits (3015), Expect = 0.0
 Identities = 537/716 (75%), Positives = 611/716 (85%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYP PLDPP+VP++NPTGCYRTYF IPE+W+ RRI LHFEAVDSAF AW+N
Sbjct: 134  FDRPIYTNVVYPIPLDPPHVPIDNPTGCYRTYFHIPEQWQGRRILLHFEAVDSAFCAWIN 193

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            GIP+GYSQDSRLPAEFEI+++C+   SDKKN+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 194  GIPVGYSQDSRLPAEFEITEYCYSCDSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 253

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLLSKPQVFIAD+FFKSSL+ +FS ADIQVEVKI    E   D +LT+F++EA ++D  
Sbjct: 254  VLLLSKPQVFIADYFFKSSLAYNFSYADIQVEVKIDCSREMSKDKVLTDFTIEAALFDAG 313

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
             WY  DG VDLL+S+VA++ L+ VP   LGFHGY+L GKLE PKLWSAEQPNLYTLVIIL
Sbjct: 314  VWYNHDGNVDLLSSNVANIVLKTVPTGTLGFHGYVLVGKLEKPKLWSAEQPNLYTLVIIL 373

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KDASG  VDCESC VG RQ+SKA K++LVNGHPV+IRGVNRHEHHPRLGKTN+ESCMVKD
Sbjct: 374  KDASGNVVDCESCLVGVRQVSKAPKQLLVNGHPVVIRGVNRHEHHPRLGKTNIESCMVKD 433

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L++MKQNN+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFD  G++KH T EP W
Sbjct: 434  LVVMKQNNINAVRNSHYPQHPRWYELCDLFGIYMIDEANIETHGFDLSGHVKHLTQEPGW 493

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AM+DRVIGMVERDKNHACI SWSLGNE+GYGPNHSA AGWIRGRDPSR+VHYEGGGSR
Sbjct: 494  AAAMMDRVIGMVERDKNHACIFSWSLGNESGYGPNHSASAGWIRGRDPSRLVHYEGGGSR 553

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            TSSTDIICPMYMRVWDIVKIAKDPNETRP+ILCEYSHAMGNSNGNIHEYWEAIDN FGLQ
Sbjct: 554  TSSTDIICPMYMRVWDIVKIAKDPNETRPLILCEYSHAMGNSNGNIHEYWEAIDNIFGLQ 613

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK+  DG K+WAYGGDFGD+PNDLNFCLNGL WPDRTPHPAL EVKYV
Sbjct: 614  GGFIWDWVDQGLLKDNEDGSKYWAYGGDFGDSPNDLNFCLNGLTWPDRTPHPALQEVKYV 673

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV   +  +KI N++F++TT+ +E  WA  GD            VI+PQSSYDIE+
Sbjct: 674  YQPIKVSIGESMIKIKNTNFYETTEGVELKWAARGDGCELGCGILSLPVIEPQSSYDIEW 733

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PWY LWASS   E+FLTIT KL +S RW  AGH+++STQ+QL  K++  PH+IK  D
Sbjct: 734  KSGPWYPLWASSDAEEIFLTITAKLLHSKRWVDAGHVVSSTQVQLLAKRDIVPHIIKTKD 793

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +  E LGD IR+S+Q  WEI +N KTG+++SWKV+GVS++  GI PCFWRAPTD
Sbjct: 794  DVLSTEILGDNIRISQQKLWEITLNVKTGSLDSWKVQGVSILKNGIIPCFWRAPTD 849


>ref|XP_006487669.1| PREDICTED: beta-galactosidase-like [Citrus sinensis]
          Length = 1115

 Score = 1163 bits (3009), Expect = 0.0
 Identities = 538/716 (75%), Positives = 617/716 (86%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFPLDPPNVP ENPTGCYRTYF IP+EW+ RRI LHFEAVDSAF AW+N
Sbjct: 135  FDRPIYTNVVYPFPLDPPNVPAENPTGCYRTYFHIPKEWQGRRILLHFEAVDSAFCAWIN 194

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G+P+GYSQDSRLPAEFEISD+C+P  SDKKN+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 195  GVPVGYSQDSRLPAEFEISDYCYPHGSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 254

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLL+KPQVFIAD+FFKS+L+EDFS ADIQVEV+I    E   D +L NF +EA +YDT 
Sbjct: 255  VLLLAKPQVFIADYFFKSNLAEDFSLADIQVEVEIDCSPEISKDSILANFVIEAGLYDTG 314

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            SWY  DG +DLL+S VA+++L      V  F GY+L GKLE+P+LWSAEQPNLYTLV+IL
Sbjct: 315  SWYNCDGCIDLLSSKVANIQLNPSTASV-EFPGYMLVGKLEMPRLWSAEQPNLYTLVVIL 373

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            K ASG  VDCESC VG RQ+SKA K++LVNG+PV+IRGVNRHEHHPR+GKTN+ESCMVKD
Sbjct: 374  KHASGPVVDCESCLVGIRQVSKAPKQLLVNGNPVVIRGVNRHEHHPRVGKTNIESCMVKD 433

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L+LMKQNN+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGF +  ++KHPT+EPSW
Sbjct: 434  LVLMKQNNINAVRNSHYPQHPRWYELCDLFGLYMIDEANIETHGFYFSEHLKHPTMEPSW 493

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AM+DRVIGMVERDKNHA II WSLGNEAG+GPNHSA AGWIRG+DPSR++HYEGGGSR
Sbjct: 494  AAAMMDRVIGMVERDKNHASIICWSLGNEAGHGPNHSAAAGWIRGKDPSRLLHYEGGGSR 553

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            T STDI+CPMYMRVWDIV IAKDP ETRP+ILCEYSHAMGNSNGNIHEYWEAID+TFGLQ
Sbjct: 554  TPSTDIVCPMYMRVWDIVMIAKDPTETRPLILCEYSHAMGNSNGNIHEYWEAIDSTFGLQ 613

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLL+E +DG KHWAYGGDFGDTPNDLNFCLNGL+WPDRTPHPAL+EVKYV
Sbjct: 614  GGFIWDWVDQGLLRELADGTKHWAYGGDFGDTPNDLNFCLNGLLWPDRTPHPALHEVKYV 673

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQ IKV  K G LKI N++FF+TT+ LEFSW  HGD            +IKP S+Y+IE 
Sbjct: 674  YQAIKVSLKKGTLKISNTNFFETTQGLEFSWVAHGDGYKLGFGILSLPLIKPHSNYEIEL 733

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S+PWYSLW S    E+FLT+T KL NSTRWA AGH++++ Q+QLP+K+E  PHVI+  D
Sbjct: 734  KSSPWYSLWNSCSAEEIFLTVTAKLMNSTRWAEAGHVISTAQVQLPSKRERLPHVIRTGD 793

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +L E LG+TI++S Q+SW+IK + +TG +ESWKVEGVSVM +GIFPCFWRAPTD
Sbjct: 794  AIILQENLGNTIQLSHQNSWQIKFDIQTGAVESWKVEGVSVMKRGIFPCFWRAPTD 849


>ref|XP_007010996.1| Glycoside hydrolase family 2 protein isoform 2 [Theobroma cacao]
            gi|508727909|gb|EOY19806.1| Glycoside hydrolase family 2
            protein isoform 2 [Theobroma cacao]
          Length = 1112

 Score = 1157 bits (2993), Expect = 0.0
 Identities = 535/716 (74%), Positives = 609/716 (85%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYP PLDPP+VP++NPTGCYRTYF IPE+W+ RRI LHFEAVDSAF AW+N
Sbjct: 134  FDRPIYTNVVYPIPLDPPHVPIDNPTGCYRTYFHIPEQWQGRRILLHFEAVDSAFCAWIN 193

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            GIP+GYSQDSRLPAEFEI+++C+   SDKKN+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 194  GIPVGYSQDSRLPAEFEITEYCYSCDSDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 253

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLLSKPQVFIAD+FFKSSL+ +FS ADIQVEVKI    E   D +LT+F++EA ++D  
Sbjct: 254  VLLLSKPQVFIADYFFKSSLAYNFSYADIQVEVKIDCSREMSKDKVLTDFTIEAALFDAG 313

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
             WY  DG VDLL+S+VA++ L+ VP   LGFHGY+L GKLE PKLWSAEQPNLYTLVIIL
Sbjct: 314  VWYNHDGNVDLLSSNVANIVLKTVPTGTLGFHGYVLVGKLEKPKLWSAEQPNLYTLVIIL 373

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KDASG  VDCESC VG RQ+SKA K++LVNGHPV+IRGVNRHEHHPRLGKTN+ESCM  D
Sbjct: 374  KDASGNVVDCESCLVGVRQVSKAPKQLLVNGHPVVIRGVNRHEHHPRLGKTNIESCM--D 431

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L++MKQNN+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFD  G++KH T EP W
Sbjct: 432  LVVMKQNNINAVRNSHYPQHPRWYELCDLFGIYMIDEANIETHGFDLSGHVKHLTQEPGW 491

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AM+DRVIGMVERDKNHACI SWSLGNE+GYGPNHSA AGWIRGRDPSR+VHYEGGGSR
Sbjct: 492  AAAMMDRVIGMVERDKNHACIFSWSLGNESGYGPNHSASAGWIRGRDPSRLVHYEGGGSR 551

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            TSSTDIICPMYMRVWDIVKIAKDPNETRP+ILCEYSHAMGNSNGNIHEYWEAIDN FGLQ
Sbjct: 552  TSSTDIICPMYMRVWDIVKIAKDPNETRPLILCEYSHAMGNSNGNIHEYWEAIDNIFGLQ 611

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK+  DG K+WAYGGDFGD+PNDLNFCLNGL WPDRTPHPAL EVKYV
Sbjct: 612  GGFIWDWVDQGLLKDNEDGSKYWAYGGDFGDSPNDLNFCLNGLTWPDRTPHPALQEVKYV 671

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV   +  +KI N++F++TT+ +E  WA  GD            VI+PQSSYDIE+
Sbjct: 672  YQPIKVSIGESMIKIKNTNFYETTEGVELKWAARGDGCELGCGILSLPVIEPQSSYDIEW 731

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PWY LWASS   E+FLTIT KL +S RW  AGH+++STQ+QL  K++  PH+IK  D
Sbjct: 732  KSGPWYPLWASSDAEEIFLTITAKLLHSKRWVDAGHVVSSTQVQLLAKRDIVPHIIKTKD 791

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +  E LGD IR+S+Q  WEI +N KTG+++SWKV+GVS++  GI PCFWRAPTD
Sbjct: 792  DVLSTEILGDNIRISQQKLWEITLNVKTGSLDSWKVQGVSILKNGIIPCFWRAPTD 847


>ref|XP_007218904.1| hypothetical protein PRUPE_ppa000532mg [Prunus persica]
            gi|462415366|gb|EMJ20103.1| hypothetical protein
            PRUPE_ppa000532mg [Prunus persica]
          Length = 1111

 Score = 1157 bits (2992), Expect = 0.0
 Identities = 533/716 (74%), Positives = 606/716 (84%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFPLDPP VPV+NPTGCYRTYF IP+EWK RRI LHFEAVDSAF AW+N
Sbjct: 134  FDRPIYTNVVYPFPLDPPFVPVDNPTGCYRTYFHIPKEWKGRRILLHFEAVDSAFCAWLN 193

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G+PIGYSQDSRLPAEFEI+D+C+P   DKKN+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 194  GVPIGYSQDSRLPAEFEITDYCYPSDMDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 253

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLLSKPQVFIAD+FFKS+L+EDFS ADIQVEVKI    E   D +L N+ +EA ++DTA
Sbjct: 254  VLLLSKPQVFIADYFFKSTLAEDFSYADIQVEVKIDNSRETSKDSVLANYVIEAALFDTA 313

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
             WY  D   DL  S+VA +KL    +  LGFHGY+L G+L++P+LWSAEQP+LYTL + L
Sbjct: 314  CWYSIDRYADLHLSNVASIKLNLSSSTSLGFHGYLLVGRLDMPRLWSAEQPSLYTLAVTL 373

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KDASG  +DCES  VG RQ+SKA K++LVNGHP+IIRGVNRHEHHPRLGKTN+ESCMVKD
Sbjct: 374  KDASGNLLDCESSLVGIRQVSKAPKQLLVNGHPIIIRGVNRHEHHPRLGKTNIESCMVKD 433

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L+LMKQ N+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFD  G++KHPTLEPSW
Sbjct: 434  LVLMKQYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHVKHPTLEPSW 493

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AM+DRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGW+RG+DPSR+VHYEGGGSR
Sbjct: 494  ATAMMDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWVRGKDPSRLVHYEGGGSR 553

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            TSSTDIICPMYMRVWD+++I++DPNETRP+ILCEYSHAMGNSNGN+HEYWE ID+TFGLQ
Sbjct: 554  TSSTDIICPMYMRVWDMLQISRDPNETRPLILCEYSHAMGNSNGNLHEYWEVIDSTFGLQ 613

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQ LLK+ +DG KHWAYGGDFGD PNDLNFCLNGL WPDRTPHPAL+EVKYV
Sbjct: 614  GGFIWDWVDQALLKDNADGSKHWAYGGDFGDVPNDLNFCLNGLTWPDRTPHPALHEVKYV 673

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV      L+I N+HF+ TT+ LEFSW VHGD            +I+PQ SYDI++
Sbjct: 674  YQPIKVSFSKETLRITNTHFYKTTQGLEFSWDVHGDGCKLGSGILPFPLIEPQKSYDIKW 733

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             SA WY LW SS   E FLTIT KL  STRW  AGH+++STQ+QLP+K+E  PHVIK  D
Sbjct: 734  RSALWYPLWTSSSAEEYFLTITAKLLRSTRWVEAGHVISSTQVQLPSKREIVPHVIKTED 793

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
               + E LGD IRVS+   WEI  + +TGT++SW VEGV +M KGIFPCFWRAPTD
Sbjct: 794  AVFVSETLGDKIRVSRHSFWEIIFSVQTGTVDSWTVEGVPLMTKGIFPCFWRAPTD 849


>ref|XP_003634896.1| PREDICTED: beta-galactosidase [Vitis vinifera]
          Length = 1127

 Score = 1148 bits (2969), Expect = 0.0
 Identities = 533/717 (74%), Positives = 606/717 (84%), Gaps = 5/717 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTN+VYPFPLDPP+VP ENPTGCYRT F IP EWK RRI LHFEAVDSAF+AW+N
Sbjct: 146  FDRPIYTNIVYPFPLDPPHVPTENPTGCYRTVFHIPHEWKGRRILLHFEAVDSAFFAWIN 205

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G+P+GYSQDSRLPAEFEI+D+CHP  S+KKN+LAVQV RWSDGSYLEDQD WWLSGIHRD
Sbjct: 206  GVPVGYSQDSRLPAEFEITDYCHPCGSNKKNVLAVQVFRWSDGSYLEDQDQWWLSGIHRD 265

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLL+KPQV+I D+FFKS+L E+FS ADIQVEVKI    E   D +L  FS+EA ++D+A
Sbjct: 266  VLLLAKPQVYIEDYFFKSNLGENFSYADIQVEVKIDNSLETSKDSILNKFSIEAELFDSA 325

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNP-VLGFHGYILGGKLELPKLWSAEQPNLYTLVII 707
             W++ D   DL +SSVAH++L    +  + GF GY+L GKLE PKLWSAEQP LYTLV+I
Sbjct: 326  KWHDSDEYCDLHSSSVAHMELDPSSSTAIFGFLGYVLVGKLESPKLWSAEQPYLYTLVVI 385

Query: 708  LKDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVK 887
            LKD  G+ VDCESCQVG RQ+SKA K++LVNGHPVI+RGVNRHEHHPRLGKTN+ESCMVK
Sbjct: 386  LKDEFGKVVDCESCQVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRLGKTNMESCMVK 445

Query: 888  DLILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPS 1067
            DL+LMKQNN+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGF    ++K+PTLE S
Sbjct: 446  DLVLMKQNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFYDSQHLKNPTLESS 505

Query: 1068 WASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGS 1247
            WAS+M+DRVI MVERDKNHACIISWSLGNE+GYGPNHSALAGWIRGRD SR++HYEGGG+
Sbjct: 506  WASSMMDRVISMVERDKNHACIISWSLGNESGYGPNHSALAGWIRGRDSSRLLHYEGGGA 565

Query: 1248 RTSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGL 1427
            RT STDI+CPMYMRVWDIVKIAKDP E RP+ILCEYSH+MGNSNGNI EYWEAIDNTFGL
Sbjct: 566  RTPSTDIVCPMYMRVWDIVKIAKDPTEMRPLILCEYSHSMGNSNGNIQEYWEAIDNTFGL 625

Query: 1428 QGGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKY 1607
            QGGFIWDWVDQGLLK G+DG KHWAYGGDFGD PNDLNFCLNG+ WPDRT HPA++EVKY
Sbjct: 626  QGGFIWDWVDQGLLKVGADGAKHWAYGGDFGDIPNDLNFCLNGITWPDRTLHPAVHEVKY 685

Query: 1608 VYQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIE 1787
            VYQPIK+   +  LKI N+HF++TTK +EFSW V GD            +I+PQSSY IE
Sbjct: 686  VYQPIKISLSESTLKITNTHFYETTKAMEFSWTVCGDGCKLGSGTLSLPIIEPQSSYSIE 745

Query: 1788 FVSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKAT 1967
            F S PWYSLWASS   E FLTIT KL   TRW  AGH+++STQ+ LP K+E  PHVIK  
Sbjct: 746  FESGPWYSLWASSSAEEHFLTITAKLLQPTRWVEAGHVISSTQILLPAKREFVPHVIKNK 805

Query: 1968 DGPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
            D PV GE LG+TIR  +Q+ WEI+ N++TGTIESWKV GV+VMNKGIFPCFWRAPTD
Sbjct: 806  DAPVPGEILGNTIRFYQQNVWEIQFNAQTGTIESWKVGGVTVMNKGIFPCFWRAPTD 862


>ref|XP_002266400.1| PREDICTED: beta-galactosidase isoform 1 [Vitis vinifera]
            gi|296090332|emb|CBI40151.3| unnamed protein product
            [Vitis vinifera]
          Length = 1114

 Score = 1148 bits (2969), Expect = 0.0
 Identities = 533/717 (74%), Positives = 606/717 (84%), Gaps = 5/717 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTN+VYPFPLDPP+VP ENPTGCYRT F IP EWK RRI LHFEAVDSAF+AW+N
Sbjct: 133  FDRPIYTNIVYPFPLDPPHVPTENPTGCYRTVFHIPHEWKGRRILLHFEAVDSAFFAWIN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G+P+GYSQDSRLPAEFEI+D+CHP  S+KKN+LAVQV RWSDGSYLEDQD WWLSGIHRD
Sbjct: 193  GVPVGYSQDSRLPAEFEITDYCHPCGSNKKNVLAVQVFRWSDGSYLEDQDQWWLSGIHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLL+KPQV+I D+FFKS+L E+FS ADIQVEVKI    E   D +L  FS+EA ++D+A
Sbjct: 253  VLLLAKPQVYIEDYFFKSNLGENFSYADIQVEVKIDNSLETSKDSILNKFSIEAELFDSA 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNP-VLGFHGYILGGKLELPKLWSAEQPNLYTLVII 707
             W++ D   DL +SSVAH++L    +  + GF GY+L GKLE PKLWSAEQP LYTLV+I
Sbjct: 313  KWHDSDEYCDLHSSSVAHMELDPSSSTAIFGFLGYVLVGKLESPKLWSAEQPYLYTLVVI 372

Query: 708  LKDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVK 887
            LKD  G+ VDCESCQVG RQ+SKA K++LVNGHPVI+RGVNRHEHHPRLGKTN+ESCMVK
Sbjct: 373  LKDEFGKVVDCESCQVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRLGKTNMESCMVK 432

Query: 888  DLILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPS 1067
            DL+LMKQNN+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGF    ++K+PTLE S
Sbjct: 433  DLVLMKQNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFYDSQHLKNPTLESS 492

Query: 1068 WASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGS 1247
            WAS+M+DRVI MVERDKNHACIISWSLGNE+GYGPNHSALAGWIRGRD SR++HYEGGG+
Sbjct: 493  WASSMMDRVISMVERDKNHACIISWSLGNESGYGPNHSALAGWIRGRDSSRLLHYEGGGA 552

Query: 1248 RTSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGL 1427
            RT STDI+CPMYMRVWDIVKIAKDP E RP+ILCEYSH+MGNSNGNI EYWEAIDNTFGL
Sbjct: 553  RTPSTDIVCPMYMRVWDIVKIAKDPTEMRPLILCEYSHSMGNSNGNIQEYWEAIDNTFGL 612

Query: 1428 QGGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKY 1607
            QGGFIWDWVDQGLLK G+DG KHWAYGGDFGD PNDLNFCLNG+ WPDRT HPA++EVKY
Sbjct: 613  QGGFIWDWVDQGLLKVGADGAKHWAYGGDFGDIPNDLNFCLNGITWPDRTLHPAVHEVKY 672

Query: 1608 VYQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIE 1787
            VYQPIK+   +  LKI N+HF++TTK +EFSW V GD            +I+PQSSY IE
Sbjct: 673  VYQPIKISLSESTLKITNTHFYETTKAMEFSWTVCGDGCKLGSGTLSLPIIEPQSSYSIE 732

Query: 1788 FVSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKAT 1967
            F S PWYSLWASS   E FLTIT KL   TRW  AGH+++STQ+ LP K+E  PHVIK  
Sbjct: 733  FESGPWYSLWASSSAEEHFLTITAKLLQPTRWVEAGHVISSTQILLPAKREFVPHVIKNK 792

Query: 1968 DGPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
            D PV GE LG+TIR  +Q+ WEI+ N++TGTIESWKV GV+VMNKGIFPCFWRAPTD
Sbjct: 793  DAPVPGEILGNTIRFYQQNVWEIQFNAQTGTIESWKVGGVTVMNKGIFPCFWRAPTD 849


>ref|XP_002513059.1| beta-galactosidase, putative [Ricinus communis]
            gi|223548070|gb|EEF49562.1| beta-galactosidase, putative
            [Ricinus communis]
          Length = 1110

 Score = 1145 bits (2962), Expect = 0.0
 Identities = 530/716 (74%), Positives = 607/716 (84%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFPLDPP VP +NPTGCYRTYF+IP+EW+ RRI LHFEAVDSAF AWVN
Sbjct: 133  FDRPIYTNVVYPFPLDPPYVPEDNPTGCYRTYFQIPKEWQGRRILLHFEAVDSAFCAWVN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G+P+GYSQDSRLPAEFEI+++C+   S K N+LAVQV+RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 193  GVPVGYSQDSRLPAEFEITEYCYSCDSGKSNVLAVQVIRWSDGSYLEDQDHWWLSGIHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLL+KPQVFI D+FFKS+L+EDF+SA+I+VEVK+    E   D +L NF +EA +YDT 
Sbjct: 253  VLLLAKPQVFIVDYFFKSNLAEDFASAEIEVEVKLDSSQEMPKDKILDNFVIEAALYDTE 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            SWY  DG  +LL+S VA +K+    + +LGF GY+L GK+E PKLWSAEQPNLY LV+ L
Sbjct: 313  SWYNSDGAANLLSSQVADIKINPSFDAILGFLGYVLVGKVEKPKLWSAEQPNLYILVLTL 372

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KDA G  VDCESC VG RQ+SKA K++LVNG PVIIRGVNRHEHHPR+GKTN+ESCM+KD
Sbjct: 373  KDAFGHVVDCESCLVGIRQVSKAPKQLLVNGQPVIIRGVNRHEHHPRIGKTNIESCMIKD 432

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L+LMKQNN+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGF   G+IKHPT E SW
Sbjct: 433  LVLMKQNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFHLSGHIKHPTSEQSW 492

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A AM+DRVIGMVERDKNHACIISWSLGNEA YGPNHSA AGWIRG+D SR+VHYEGGGSR
Sbjct: 493  AIAMIDRVIGMVERDKNHACIISWSLGNEASYGPNHSAAAGWIRGKDTSRLVHYEGGGSR 552

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            T STDI+CPMYMRVWDIVKIA DP E RP+ILCEYSHAMGNS+GNI EYWEAID+TFGLQ
Sbjct: 553  TPSTDIVCPMYMRVWDIVKIANDPTELRPLILCEYSHAMGNSSGNICEYWEAIDSTFGLQ 612

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLKE +DG K+WAYGGDFGDTPNDLNFCLNGL WPDR+PHPAL+EVKYV
Sbjct: 613  GGFIWDWVDQGLLKENTDGSKYWAYGGDFGDTPNDLNFCLNGLTWPDRSPHPALHEVKYV 672

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV  K   LKI N++FF+TT+ LEFSWA HGD            ++KPQSSYDIE 
Sbjct: 673  YQPIKVSLKGSTLKITNTYFFETTQGLEFSWAAHGDGHQLGSGILSLPLMKPQSSYDIEL 732

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PWY LWAS  + E+FLT+T KL +ST W   GH+++STQ+QLP+++E  PHVIKATD
Sbjct: 733  ESGPWYPLWASY-SGEIFLTVTAKLLHSTPWVETGHVISSTQVQLPSRKEIIPHVIKATD 791

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +  E LGDT+RVS+Q  WEI +N +TGT+ESWKVEGV++MNKGI PCFWRAPTD
Sbjct: 792  ATLSSEILGDTVRVSQQTFWEITLNIQTGTVESWKVEGVTIMNKGILPCFWRAPTD 847


>ref|XP_002299206.2| glycoside hydrolase family 2 family protein [Populus trichocarpa]
            gi|550346663|gb|EEE84011.2| glycoside hydrolase family 2
            family protein [Populus trichocarpa]
          Length = 1110

 Score = 1144 bits (2958), Expect = 0.0
 Identities = 528/716 (73%), Positives = 610/716 (85%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            +DRPIYTNV+YPFP+DPP+VP +NPTGCYRTYF+IPEEW+ RRI LHFEAVDSAF AW+N
Sbjct: 133  YDRPIYTNVIYPFPVDPPHVPDDNPTGCYRTYFDIPEEWQGRRILLHFEAVDSAFCAWIN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G+P+GYSQDSRLPAEFEI+D+CHP  S KKN+LAVQV RWSDGSYLEDQDHWWLSG+HRD
Sbjct: 193  GVPVGYSQDSRLPAEFEITDYCHPCGSGKKNVLAVQVFRWSDGSYLEDQDHWWLSGVHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKIEPV----SDCLLTNFSLEAVIYDTA 530
            VLLLSKPQVFIAD+FFKS+L+E+F+ ADIQVEVKIE       + +L NF++EA +YDT 
Sbjct: 253  VLLLSKPQVFIADYFFKSNLAENFTCADIQVEVKIESSLAIPKEKILANFTIEAALYDTG 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            SWY+ +   +LL+S+VA+LKL   P  +LGF G +L GKLE+PKLWSAEQPNLY LV+ L
Sbjct: 313  SWYDSEESANLLSSNVANLKLTHSPMGLLGFLGNVLEGKLEMPKLWSAEQPNLYILVLSL 372

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KDA+GQ VDCESC VG RQ+SKA K++LVNGHPVI+RGVNRHEHHPR+GKTN+ESCM+KD
Sbjct: 373  KDATGQVVDCESCLVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRVGKTNIESCMIKD 432

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L+LMKQNN+NAVRNSHYPQH RWYELCDLFG YMIDEANIETHGF  C ++KHPT E SW
Sbjct: 433  LVLMKQNNMNAVRNSHYPQHHRWYELCDLFGMYMIDEANIETHGFYLCEHLKHPTQEQSW 492

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AM+DRVI MVERDKNHACIISWSLGNEA YGPNHSA AGWIR +D SR+VHYEGGGSR
Sbjct: 493  AAAMMDRVISMVERDKNHACIISWSLGNEASYGPNHSAAAGWIREKDTSRLVHYEGGGSR 552

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            T+STDI+CPMYMRVWDIVKIAKDP E+RP+ILCEYSHAMGNSNGNIHEYWEAI++TFGLQ
Sbjct: 553  TTSTDIVCPMYMRVWDIVKIAKDPAESRPLILCEYSHAMGNSNGNIHEYWEAINSTFGLQ 612

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK+  DG KHWAYGGDFGDTPNDLNFCLNGL WPDRTPHPAL+EVKYV
Sbjct: 613  GGFIWDWVDQGLLKDSGDGTKHWAYGGDFGDTPNDLNFCLNGLTWPDRTPHPALHEVKYV 672

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV  ++  +KI ++HFF TT+ LEFSWA  GD            +I+PQSSY++E+
Sbjct: 673  YQPIKVSLEESRIKITSTHFFQTTQGLEFSWATQGDGYEIGSGILSLPLIEPQSSYELEW 732

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PWY L ASS   E+FLTIT  L +STRW  AGH+++S+Q+QLPT ++  PHVIK TD
Sbjct: 733  ESGPWYPLLASSFAEEIFLTITTTLLHSTRWVEAGHVVSSSQVQLPTTRKILPHVIKTTD 792

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              VL E LGD +RVS    WEI  N +TG++ESWKV GV VMNKGIFPCFWRAPTD
Sbjct: 793  AKVLIETLGDIVRVSLPSFWEITWNIQTGSVESWKVGGVPVMNKGIFPCFWRAPTD 848


>ref|XP_007220592.1| hypothetical protein PRUPE_ppa000508mg [Prunus persica]
            gi|462417054|gb|EMJ21791.1| hypothetical protein
            PRUPE_ppa000508mg [Prunus persica]
          Length = 1121

 Score = 1130 bits (2922), Expect = 0.0
 Identities = 528/726 (72%), Positives = 599/726 (82%), Gaps = 14/726 (1%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFPLDPP VPV+NPTGCYRTYF IP+EWK RRI LHFEAVDSAF AW+N
Sbjct: 134  FDRPIYTNVVYPFPLDPPVVPVDNPTGCYRTYFHIPKEWKGRRILLHFEAVDSAFCAWLN 193

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G+PIGYSQDSRLPAEFEI+D+C+P   DKKN+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 194  GVPIGYSQDSRLPAEFEITDYCYPSDMDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 253

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLLSKPQVFIAD+FFKS+L+EDFS ADIQVEVKI    E   D +L N+ +EA ++DTA
Sbjct: 254  VLLLSKPQVFIADYFFKSTLAEDFSYADIQVEVKIDNSRETSKDSVLANYVIEAALFDTA 313

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
             WY  DG  DL  S VA +KL    +  LGFHGY+L G+L++P+LWSAEQP+LY L + L
Sbjct: 314  CWYSIDGYGDLHLSYVASIKLNLSSSTSLGFHGYLLVGRLDMPRLWSAEQPSLYALAVTL 373

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KDASG  +DCES  VG RQ+SKA K++LVNGHP+IIRGVNRHEHHPRLGKTN+ESCMVKD
Sbjct: 374  KDASGNLLDCESSLVGIRQVSKAPKQLLVNGHPIIIRGVNRHEHHPRLGKTNIESCMVKD 433

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L+LMKQ N+NAVRNSHYPQHPRWYELCDLFG YMIDEANI THGFD   ++KHPTLEPSW
Sbjct: 434  LVLMKQYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIGTHGFDLSDHVKHPTLEPSW 493

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRG----------RDPSR 1220
            A+AM+DRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAG  R            DPSR
Sbjct: 494  ATAMMDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGTFRKCYYFVLVRELLDPSR 553

Query: 1221 IVHYEGGGSRTSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYW 1400
            +VHYEGGGSRTSSTDI+CPMYMRVWD++KI++DPNETRP+ILCEYSHAMGNSNGN+HEYW
Sbjct: 554  LVHYEGGGSRTSSTDIVCPMYMRVWDMMKISRDPNETRPLILCEYSHAMGNSNGNLHEYW 613

Query: 1401 EAIDNTFGLQGGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTP 1580
            E ID+TFGLQGGFIWDWVDQ LLK+ +DG KHWAYGGDFGD PNDLNFCLNGLIWPDRTP
Sbjct: 614  ERIDSTFGLQGGFIWDWVDQALLKDNADGSKHWAYGGDFGDVPNDLNFCLNGLIWPDRTP 673

Query: 1581 HPALNEVKYVYQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVI 1760
            HPAL+EVKYVYQPIKV      L+I N+HF+ TT+ LEFSW VHGD            +I
Sbjct: 674  HPALHEVKYVYQPIKVSFSKETLRITNTHFYKTTQGLEFSWDVHGDGCKLGSGILPFPLI 733

Query: 1761 KPQSSYDIEFVSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQE 1940
            +PQ SYDI++  A WY LW SS   E FLTIT KL  STRW  AGH+++STQ+QLP+K+E
Sbjct: 734  EPQKSYDIKWRLALWYPLWTSSSAEEYFLTITAKLLRSTRWVEAGHVISSTQVQLPSKRE 793

Query: 1941 CGPHVIKATDGPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCF 2120
              PHVIK  D   + E LGD IRVS+   WEI ++ +TGT++SW VEGV +M KGIFPCF
Sbjct: 794  IVPHVIKTEDATFVSETLGDKIRVSRHSFWEIILSVQTGTVDSWTVEGVPLMTKGIFPCF 853

Query: 2121 WRAPTD 2138
            WRA TD
Sbjct: 854  WRASTD 859


>ref|XP_002303929.2| glycoside hydrolase family 2 family protein [Populus trichocarpa]
            gi|550343549|gb|EEE78908.2| glycoside hydrolase family 2
            family protein [Populus trichocarpa]
          Length = 1113

 Score = 1128 bits (2918), Expect = 0.0
 Identities = 523/716 (73%), Positives = 601/716 (83%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            +DRPIY NV+YPFP+DPP VP +NPTGCYRTYF++P+ W++RRIFLHFEAVDSAF AW+N
Sbjct: 133  YDRPIYANVLYPFPVDPPRVPDDNPTGCYRTYFDLPQGWQDRRIFLHFEAVDSAFCAWIN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G+ +GYSQDSRLPAEFEI+D+C+P  S KKNLLAVQV RWSDGSYLEDQDHWW+SGIHRD
Sbjct: 193  GVAVGYSQDSRLPAEFEITDYCYPCGSGKKNLLAVQVFRWSDGSYLEDQDHWWMSGIHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKIEPV----SDCLLTNFSLEAVIYDTA 530
            VLLLSK QVFIAD+FFKS+L+E+F+SADI+VEVKIE       D +  NF++EA +YDT 
Sbjct: 253  VLLLSKAQVFIADYFFKSNLAENFTSADIEVEVKIESALEIPRDKIFDNFTIEAALYDTG 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            SWY  +   DLL+S+VA+LKL   P  +LGF G  L GKLE PKLWSAEQPNLY LV+ L
Sbjct: 313  SWYNSEESPDLLSSNVANLKLTHSPMGILGFLGNFLEGKLEKPKLWSAEQPNLYILVLSL 372

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KDA+GQ VDCESC VG RQISKA K++LVNG PVIIRGVNRHEHHPR+GKTN+ESCM+KD
Sbjct: 373  KDATGQVVDCESCLVGIRQISKAPKQLLVNGCPVIIRGVNRHEHHPRVGKTNIESCMIKD 432

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L+LMKQNN+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGF  C ++KHPT E SW
Sbjct: 433  LVLMKQNNMNAVRNSHYPQHPRWYELCDLFGLYMIDEANIETHGFHLCEHLKHPTQEQSW 492

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AM+DRVI MVERDKNHACIISWSLGNE+ YGPNHSA AGWIR RDPSR+VHYEGGGSR
Sbjct: 493  AAAMMDRVISMVERDKNHACIISWSLGNESSYGPNHSAAAGWIRERDPSRLVHYEGGGSR 552

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            T+STDIICPMYMRVWDIVKIAKDP E RP+ILCEYSHAMGNS+GNI EYW+AID+TFGLQ
Sbjct: 553  TASTDIICPMYMRVWDIVKIAKDPTEPRPLILCEYSHAMGNSSGNIREYWDAIDSTFGLQ 612

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIW+WVDQ LLKE  DG KHWAYGGDFGDTPNDLNFCLNGL WPDRTPHPAL EVKYV
Sbjct: 613  GGFIWEWVDQALLKESGDGRKHWAYGGDFGDTPNDLNFCLNGLTWPDRTPHPALEEVKYV 672

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV  ++  +KI N+HFF TT+ LEFSW VHGD            + +PQSSY +E+
Sbjct: 673  YQPIKVSLEESTIKITNTHFFQTTQGLEFSWTVHGDGYELGSGILSLPLTEPQSSYKLEW 732

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
               PWY L ASS   E+F+TIT +L +STRW  AGH+++STQ+QLPT+Q+  PHVIK TD
Sbjct: 733  ELGPWYPLLASSFAEEIFVTITTRLLHSTRWVEAGHVISSTQIQLPTRQKIMPHVIKTTD 792

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              V  E LGDT+RVS+ + WEI  N +TG+IESWKV GV V+ +GI PCFWRAPTD
Sbjct: 793  AKVFSETLGDTVRVSQLNVWEITWNIQTGSIESWKVGGVPVIKEGIIPCFWRAPTD 848


>ref|XP_004142388.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
            gi|449487140|ref|XP_004157508.1| PREDICTED:
            beta-galactosidase-like [Cucumis sativus]
          Length = 1114

 Score = 1122 bits (2903), Expect = 0.0
 Identities = 511/716 (71%), Positives = 607/716 (84%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFPLDPP+VP +NPTGCYRTYF +PEEWK RRI LHFEAVDSAF+AW+N
Sbjct: 133  FDRPIYTNVVYPFPLDPPHVPEDNPTGCYRTYFHLPEEWKGRRILLHFEAVDSAFFAWIN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G  +GYSQDSRLPAEFEI+++CHP  S  KN+LAVQVL+WSDGSYLEDQD WWLSGIHRD
Sbjct: 193  GSLVGYSQDSRLPAEFEITEYCHPCGSQSKNVLAVQVLKWSDGSYLEDQDQWWLSGIHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            V+LLSKPQVFI D+FFKS + EDFS ADIQVEVKI    E   +  L NF LEAV++D+ 
Sbjct: 253  VILLSKPQVFIGDYFFKSHVGEDFSYADIQVEVKIDSSLEGRKENFLNNFKLEAVLFDSG 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            SW   DG +DLL+S++A++KL  +    LGFHGY+LGG+L+ PKLWSAEQP+LYTL+++L
Sbjct: 313  SWDNHDGNIDLLSSNMANVKLSLLSVTTLGFHGYVLGGRLQKPKLWSAEQPHLYTLIVLL 372

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KD+S Q VDCESC VG R I+K  K++LVNG PV+IRGVNRHEHHPRLGKTN+E+CMV+D
Sbjct: 373  KDSSDQIVDCESCLVGIRSITKGPKQLLVNGRPVVIRGVNRHEHHPRLGKTNIEACMVRD 432

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            L+LMKQ+N+NAVRNSHYPQH RWYELCDLFG YM+DEANIETHGFD+ G++KHPTL+PSW
Sbjct: 433  LVLMKQHNINAVRNSHYPQHSRWYELCDLFGMYMVDEANIETHGFDFSGHVKHPTLQPSW 492

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AMLDRVIGMVERDKNHACII WSLGNE+GYGPNHSALAGWIRG+D SR++HYEGGGSR
Sbjct: 493  AAAMLDRVIGMVERDKNHACIIVWSLGNESGYGPNHSALAGWIRGKDSSRVLHYEGGGSR 552

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            TSSTDIICPMYMRVWDIV IA DPNETRP+ILCEYSH+MGNS GN+H+YWEAIDNTFGLQ
Sbjct: 553  TSSTDIICPMYMRVWDIVNIANDPNETRPLILCEYSHSMGNSTGNLHKYWEAIDNTFGLQ 612

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQ LLKE  +G K WAYGG+FGD PND  FCLNG+ WPDRTPHPAL+EVKY+
Sbjct: 613  GGFIWDWVDQALLKEVGNGRKRWAYGGEFGDIPNDSTFCLNGVTWPDRTPHPALHEVKYL 672

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            +Q IK+ SKDG L++LN HFF TT++LEFSW+++GD            VI P+ SY+IE+
Sbjct: 673  HQAIKISSKDGTLEVLNGHFFSTTEDLEFSWSIYGDGLELGNGILSLPVIGPRGSYNIEW 732

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S+PWY LWASS   E FLTI+VKL +STRWA AGHI++ +Q+QLP K+E  PH IK   
Sbjct: 733  QSSPWYDLWASSSALEFFLTISVKLLHSTRWAEAGHIVSLSQVQLPMKREFFPHSIKNGS 792

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              ++ E LGD++RV +Q+ WEIK++ +TGT+ESWKV+GV ++ KGI P FWRAPT+
Sbjct: 793  STLVNEILGDSVRVYQQNLWEIKLDVQTGTLESWKVKGVPLIIKGIIPSFWRAPTE 848


>ref|NP_001030858.1| glycoside hydrolase family 2 protein [Arabidopsis thaliana]
            gi|332645710|gb|AEE79231.1| glycoside hydrolase family 2
            protein [Arabidopsis thaliana]
          Length = 1108

 Score = 1118 bits (2893), Expect = 0.0
 Identities = 513/716 (71%), Positives = 599/716 (83%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFP DPP VP +NPTGCYRTYF+IP+EWK+RRI LHFEAVDSAF+AW+N
Sbjct: 133  FDRPIYTNVVYPFPNDPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWIN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G P+GYSQDSRLPAEFEISD+C+P  S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 193  GNPVGYSQDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLL+KP+VFIAD+FFKS L++DFS ADIQVEVKI    E   D +L+NF +EA I+DT 
Sbjct: 253  VLLLAKPKVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAIFDTK 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            +WY  +G    L+  VA+LKL   P+P LGFHGY+L GKL+ P LWSAEQPN+Y LV+ L
Sbjct: 313  NWYNSEGFSCELSPKVANLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTL 372

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KD SG+ +D ES  VG RQ+SKA K++LVNGHPV+I+GVNRHEHHPR+GKTN+E+CMVKD
Sbjct: 373  KDTSGKVLDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKD 432

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            LI+MK+ N+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFD  G++KHP  EPSW
Sbjct: 433  LIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSW 492

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AMLDRV+GMVERDKNH CIISWSLGNEAGYGPNHSA+AGWIR +DPSR+VHYEGGGSR
Sbjct: 493  AAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSR 552

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            TSSTDI+CPMYMRVWDI+KIA D NE+RP+ILCEY HAMGNSNGNI EYWEAIDNTFGLQ
Sbjct: 553  TSSTDIVCPMYMRVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQ 612

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK GSDG K WAYGGDFGD PNDLNFCLNGLIWPDRTPHPAL+EVK+ 
Sbjct: 613  GGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHC 672

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV   DG +K+ N++FF+TT+ELEFSW +HGD            VIKPQ+S+++E+
Sbjct: 673  YQPIKVSLTDGMIKVANTYFFNTTEELEFSWTIHGDGLELGSGTLSIPVIKPQNSFEMEW 732

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PW+S W  S   E+FLTI  KL N TR   AGH+L+STQ+ LP K +  P  IK TD
Sbjct: 733  KSGPWFSFWNDSNAGELFLTINAKLLNLTRSLEAGHLLSSTQIPLPAKGQIIPQAIKKTD 792

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +  E +GD I++S++DSWE+ +N + GTIE WK++GV +MN+ I PCFWRAPTD
Sbjct: 793  TSITCETVGDFIKISQKDSWELMVNVRKGTIEGWKIQGVLLMNEAILPCFWRAPTD 848


>ref|NP_001190087.1| glycoside hydrolase family 2 protein [Arabidopsis thaliana]
            gi|332645711|gb|AEE79232.1| glycoside hydrolase family 2
            protein [Arabidopsis thaliana]
          Length = 1120

 Score = 1118 bits (2893), Expect = 0.0
 Identities = 513/716 (71%), Positives = 599/716 (83%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFP DPP VP +NPTGCYRTYF+IP+EWK+RRI LHFEAVDSAF+AW+N
Sbjct: 146  FDRPIYTNVVYPFPNDPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWIN 205

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G P+GYSQDSRLPAEFEISD+C+P  S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 206  GNPVGYSQDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 265

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLL+KP+VFIAD+FFKS L++DFS ADIQVEVKI    E   D +L+NF +EA I+DT 
Sbjct: 266  VLLLAKPKVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAIFDTK 325

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            +WY  +G    L+  VA+LKL   P+P LGFHGY+L GKL+ P LWSAEQPN+Y LV+ L
Sbjct: 326  NWYNSEGFSCELSPKVANLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTL 385

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KD SG+ +D ES  VG RQ+SKA K++LVNGHPV+I+GVNRHEHHPR+GKTN+E+CMVKD
Sbjct: 386  KDTSGKVLDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKD 445

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            LI+MK+ N+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFD  G++KHP  EPSW
Sbjct: 446  LIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSW 505

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AMLDRV+GMVERDKNH CIISWSLGNEAGYGPNHSA+AGWIR +DPSR+VHYEGGGSR
Sbjct: 506  AAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSR 565

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            TSSTDI+CPMYMRVWDI+KIA D NE+RP+ILCEY HAMGNSNGNI EYWEAIDNTFGLQ
Sbjct: 566  TSSTDIVCPMYMRVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQ 625

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK GSDG K WAYGGDFGD PNDLNFCLNGLIWPDRTPHPAL+EVK+ 
Sbjct: 626  GGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHC 685

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV   DG +K+ N++FF+TT+ELEFSW +HGD            VIKPQ+S+++E+
Sbjct: 686  YQPIKVSLTDGMIKVANTYFFNTTEELEFSWTIHGDGLELGSGTLSIPVIKPQNSFEMEW 745

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PW+S W  S   E+FLTI  KL N TR   AGH+L+STQ+ LP K +  P  IK TD
Sbjct: 746  KSGPWFSFWNDSNAGELFLTINAKLLNLTRSLEAGHLLSSTQIPLPAKGQIIPQAIKKTD 805

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +  E +GD I++S++DSWE+ +N + GTIE WK++GV +MN+ I PCFWRAPTD
Sbjct: 806  TSITCETVGDFIKISQKDSWELMVNVRKGTIEGWKIQGVLLMNEAILPCFWRAPTD 861


>ref|NP_680128.1| glycoside hydrolase family 2 protein [Arabidopsis thaliana]
            gi|20147224|gb|AAM10327.1| At3g54435 [Arabidopsis
            thaliana] gi|332645709|gb|AEE79230.1| glycoside hydrolase
            family 2 protein [Arabidopsis thaliana]
          Length = 1107

 Score = 1118 bits (2893), Expect = 0.0
 Identities = 513/716 (71%), Positives = 599/716 (83%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFP DPP VP +NPTGCYRTYF+IP+EWK+RRI LHFEAVDSAF+AW+N
Sbjct: 133  FDRPIYTNVVYPFPNDPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWIN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G P+GYSQDSRLPAEFEISD+C+P  S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 193  GNPVGYSQDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLL+KP+VFIAD+FFKS L++DFS ADIQVEVKI    E   D +L+NF +EA I+DT 
Sbjct: 253  VLLLAKPKVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAIFDTK 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            +WY  +G    L+  VA+LKL   P+P LGFHGY+L GKL+ P LWSAEQPN+Y LV+ L
Sbjct: 313  NWYNSEGFSCELSPKVANLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTL 372

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KD SG+ +D ES  VG RQ+SKA K++LVNGHPV+I+GVNRHEHHPR+GKTN+E+CMVKD
Sbjct: 373  KDTSGKVLDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKD 432

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            LI+MK+ N+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFD  G++KHP  EPSW
Sbjct: 433  LIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSW 492

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AMLDRV+GMVERDKNH CIISWSLGNEAGYGPNHSA+AGWIR +DPSR+VHYEGGGSR
Sbjct: 493  AAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSR 552

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            TSSTDI+CPMYMRVWDI+KIA D NE+RP+ILCEY HAMGNSNGNI EYWEAIDNTFGLQ
Sbjct: 553  TSSTDIVCPMYMRVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQ 612

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK GSDG K WAYGGDFGD PNDLNFCLNGLIWPDRTPHPAL+EVK+ 
Sbjct: 613  GGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHC 672

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV   DG +K+ N++FF+TT+ELEFSW +HGD            VIKPQ+S+++E+
Sbjct: 673  YQPIKVSLTDGMIKVANTYFFNTTEELEFSWTIHGDGLELGSGTLSIPVIKPQNSFEMEW 732

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PW+S W  S   E+FLTI  KL N TR   AGH+L+STQ+ LP K +  P  IK TD
Sbjct: 733  KSGPWFSFWNDSNAGELFLTINAKLLNLTRSLEAGHLLSSTQIPLPAKGQIIPQAIKKTD 792

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +  E +GD I++S++DSWE+ +N + GTIE WK++GV +MN+ I PCFWRAPTD
Sbjct: 793  TSITCETVGDFIKISQKDSWELMVNVRKGTIEGWKIQGVLLMNEAILPCFWRAPTD 848


>ref|XP_003542824.2| PREDICTED: beta-galactosidase-like [Glycine max]
          Length = 1121

 Score = 1117 bits (2890), Expect = 0.0
 Identities = 512/718 (71%), Positives = 598/718 (83%), Gaps = 6/718 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FD PIYTNVVYPFPLDPP +PVENPTGCYRTYF IP+EW+ RR+ LHFEAVDSAF AW+N
Sbjct: 138  FDTPIYTNVVYPFPLDPPFIPVENPTGCYRTYFHIPKEWEGRRVLLHFEAVDSAFCAWIN 197

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G P+GYSQDSRLPAEFEI+DFCHP  SD KN+LAVQV RW DGSYLEDQD W LSGIHRD
Sbjct: 198  GHPVGYSQDSRLPAEFEITDFCHPCGSDLKNVLAVQVFRWCDGSYLEDQDQWRLSGIHRD 257

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLL++KP+VFI D+FFKS+L+EDFS A+I VEVKI    E   D +LTN+S+EA ++D+ 
Sbjct: 258  VLLMAKPEVFITDYFFKSNLAEDFSCAEIMVEVKIDRLQETSKDNVLTNYSIEATLFDSG 317

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPV--LGFHGYILGGKLELPKLWSAEQPNLYTLVI 704
            SWY  DG  DLL+S+VA +KLQ    P   LGFHGY+L GKL+ PKLWSAE+P LYTLV+
Sbjct: 318  SWYTSDGNPDLLSSNVADIKLQSSSAPAQPLGFHGYVLTGKLKSPKLWSAEKPYLYTLVV 377

Query: 705  ILKDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMV 884
            +LKD SG+ VDCESC VGFR++SKA K++LVNGH V+IRGVNRHEHHP++GK N+ESCM+
Sbjct: 378  VLKDRSGRIVDCESCPVGFRKVSKAHKQLLVNGHAVVIRGVNRHEHHPQVGKANIESCMI 437

Query: 885  KDLILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEP 1064
            KDL+LMKQNN+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETH FDY  ++KHPT+EP
Sbjct: 438  KDLVLMKQNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHHFDYSKHLKHPTMEP 497

Query: 1065 SWASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGG 1244
             WA++MLDRVIGMVERDKNH CIISWSLGNE+G+G NH ALAGWIRGRD SR++HYEGGG
Sbjct: 498  KWATSMLDRVIGMVERDKNHTCIISWSLGNESGFGTNHFALAGWIRGRDSSRVLHYEGGG 557

Query: 1245 SRTSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFG 1424
            SRT  TDI+CPMYMRVWD+VKIA DP ETRP+ILCEYSHAMGNSNGN+H YWEAIDNTFG
Sbjct: 558  SRTPCTDIVCPMYMRVWDMVKIANDPTETRPLILCEYSHAMGNSNGNLHIYWEAIDNTFG 617

Query: 1425 LQGGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVK 1604
            LQGGFIWDWVDQ L+K   DG KHWAYGG+FGD PNDLNFCLNGL +PDRTPHP L+EVK
Sbjct: 618  LQGGFIWDWVDQALVKVYEDGTKHWAYGGEFGDVPNDLNFCLNGLTFPDRTPHPVLHEVK 677

Query: 1605 YVYQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDI 1784
            Y+YQPIKV  K+G L+I N+HFF TT+ LEFSW++  D             IKPQSS+ +
Sbjct: 678  YLYQPIKVALKEGKLEIKNTHFFQTTEGLEFSWSISADGYNLGSGLLGLVPIKPQSSHAV 737

Query: 1785 EFVSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKA 1964
            ++ S PWYSLWAS+   E+FLTIT KL NSTRW  AGHI++S Q+QLPT++   PHVI  
Sbjct: 738  DWQSGPWYSLWASTDEEELFLTITAKLLNSTRWVEAGHIVSSAQVQLPTRRNIAPHVIDI 797

Query: 1965 TDGPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              G ++ E LGDTI V +QD+W++ +N+KTG +ESWKV+GV VM KGI PCFWRAP D
Sbjct: 798  NGGTLVAETLGDTIVVKQQDAWDLTLNTKTGLVESWKVKGVHVMKKGILPCFWRAPID 855


>ref|XP_006293102.1| hypothetical protein CARUB_v10019396mg [Capsella rubella]
            gi|482561809|gb|EOA26000.1| hypothetical protein
            CARUB_v10019396mg [Capsella rubella]
          Length = 1107

 Score = 1117 bits (2889), Expect = 0.0
 Identities = 513/716 (71%), Positives = 597/716 (83%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFP DPP+VP +NPTGCYRTYF+IP+EWK+RRI LHFEAVDSAF+AW+N
Sbjct: 133  FDRPIYTNVVYPFPNDPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWIN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G PIGYSQDSRLPAEFEIS++C+P  S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 193  GNPIGYSQDSRLPAEFEISEYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLL+KP+VFIAD+FFKS L++DFS ADIQVEVKI    E   D +L+NF +EA ++ T 
Sbjct: 253  VLLLAKPKVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKDLVLSNFIIEAAVFSTK 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            +WY  +G    L+  VA+L L   P+PVLGFHGY+L GKL+ P LWSAEQPN+Y LV+ L
Sbjct: 313  NWYNSEGFSSELSPKVANLTLNPSPSPVLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTL 372

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KD SG+ +D ES  VG RQ+SKA K++LVNGHPV+I+GVNRHEHHPR+GKTN+ESCMVKD
Sbjct: 373  KDTSGKILDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIESCMVKD 432

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            LI+MK+ N+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFD  G++KHP  EPSW
Sbjct: 433  LIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSW 492

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AMLDRV+GMVERDKNH CI+SWSLGNEAGYGPNHSA+AGWIR +DPSR+VHYEGGGSR
Sbjct: 493  AAAMLDRVVGMVERDKNHTCIVSWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSR 552

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            TSSTDIICPMYMRVWDIVKIA D NE+RP+ILCEY HAMGNSNGNI EYWEAIDNTFGLQ
Sbjct: 553  TSSTDIICPMYMRVWDIVKIALDQNESRPLILCEYQHAMGNSNGNIDEYWEAIDNTFGLQ 612

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK GSDG K WAYGGDFGD PNDLNFCLNGLIWPDRTPHPAL+EVKY 
Sbjct: 613  GGFIWDWVDQGLLKPGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKYC 672

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPI V   DG +K+ N++FF TT+ELEFSW VHGD            VIKPQ+S+D+E+
Sbjct: 673  YQPINVSLTDGTMKVANTYFFHTTEELEFSWTVHGDGLELGSGALSIPVIKPQNSFDMEW 732

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PW+S W  S   E+FLTIT KL + TR    GH+++STQ+ LP K++  P  +K TD
Sbjct: 733  KSGPWFSFWNDSNAGELFLTITAKLLSPTRSLETGHLVSSTQIPLPAKRQIIPQALKKTD 792

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +  E +GD I++S+QDSWE+ IN + G IE WK++GV +MN+ I PCFWRAPTD
Sbjct: 793  TIIACETVGDFIKISQQDSWELMINVRKGAIEGWKIQGVLLMNEAILPCFWRAPTD 848


>ref|XP_002877978.1| hydrolase, hydrolyzing O-glycosyl compounds [Arabidopsis lyrata
            subsp. lyrata] gi|297323816|gb|EFH54237.1| hydrolase,
            hydrolyzing O-glycosyl compounds [Arabidopsis lyrata
            subsp. lyrata]
          Length = 1107

 Score = 1117 bits (2888), Expect = 0.0
 Identities = 511/716 (71%), Positives = 599/716 (83%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTNVVYPFP DPP+VP +NPTGCYRTYF+IP+EWK+RRI LHFEAVDSAF+AW+N
Sbjct: 133  FDRPIYTNVVYPFPNDPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWIN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G P+GYSQDSRLPAEFEISD+C+P  S K+N+LAVQV RWSDGSYLEDQDHWWLSGIHRD
Sbjct: 193  GNPVGYSQDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKIEPVSDC----LLTNFSLEAVIYDTA 530
            VLLL+KP+VFIAD+FFKS L++DFS ADIQVEVKI+ + +     +L+NF +EA ++DT 
Sbjct: 253  VLLLAKPKVFIADYFFKSKLADDFSYADIQVEVKIDNMQESSKHLVLSNFIIEAAVFDTK 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            +WY  +G    L+  VAHLKL   P+P LGFHGY+L GKL+ P LWSAEQPN+Y LV+ L
Sbjct: 313  NWYNSEGFNCELSPKVAHLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTL 372

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KD SG+ +D ES  VG RQ+SKA K++LVNGHPV+I+GVNRHEHHPR+GKTN+E+CMVKD
Sbjct: 373  KDTSGKVLDSESSIVGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKD 432

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            LI+MK+ N+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFD  G++KHP  EPSW
Sbjct: 433  LIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPAKEPSW 492

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AMLDRV+GMVERDKNH CIISWSLGNEAGYGPNHSA+AGWIR +DPSR+VHYEGGGSR
Sbjct: 493  AAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSR 552

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            TSSTDI+CPMYMRVWDI+KIA D NE+RP+ILCEY HAMGNSNGNI EYW+AIDNTFGLQ
Sbjct: 553  TSSTDIVCPMYMRVWDIIKIALDQNESRPLILCEYQHAMGNSNGNIDEYWDAIDNTFGLQ 612

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK GSDG K WAYGGDFGD PNDLNFCLNGLIWPDRTPHPAL+EVK+ 
Sbjct: 613  GGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHC 672

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV   DG +K+ N++FF TT+ELEFSW +HGD            VIKPQ+S++IE+
Sbjct: 673  YQPIKVSLTDGLIKVANTYFFHTTEELEFSWKIHGDGLELGSGTLSIPVIKPQNSFEIEW 732

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PW+S W  S   E+FLTI  KL N TR   AGH+L+STQ+ LP K++  P  IK TD
Sbjct: 733  KSGPWFSFWNDSNAGELFLTINAKLLNPTRSLEAGHLLSSTQIPLPAKRQIIPQAIKKTD 792

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +  E +GD I++S+QDSWE+ IN + G IE WK++GV +M + I PCFWRAPTD
Sbjct: 793  TIITCETVGDFIKISQQDSWELMINVRKGAIEGWKIQGVLLMKEDILPCFWRAPTD 848


>ref|XP_007133761.1| hypothetical protein PHAVU_011G206800g [Phaseolus vulgaris]
            gi|561006761|gb|ESW05755.1| hypothetical protein
            PHAVU_011G206800g [Phaseolus vulgaris]
          Length = 1120

 Score = 1112 bits (2875), Expect = 0.0
 Identities = 509/718 (70%), Positives = 598/718 (83%), Gaps = 6/718 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FD PIYTNVVYPFP+DPP +P+ENPTGCYRTYF+IP+EW+ RRI LHFEAVDSAF AW+N
Sbjct: 137  FDIPIYTNVVYPFPVDPPFIPMENPTGCYRTYFQIPKEWEGRRILLHFEAVDSAFCAWIN 196

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G P+GYSQDSRLPAEFEI+DFCHP  SD KN+LAVQV RWSDGSYLEDQD W LSGIHRD
Sbjct: 197  GHPVGYSQDSRLPAEFEITDFCHPCGSDLKNVLAVQVYRWSDGSYLEDQDQWRLSGIHRD 256

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLL+SKP+VF+ D+FFKS+L+EDFS ADI VEVKI    E   D +LT++S+EA ++D+ 
Sbjct: 257  VLLMSKPEVFVTDYFFKSNLAEDFSYADILVEVKIDRLKETSKDNVLTDYSIEATLFDSG 316

Query: 531  SWYERDGKVDLLASSVAHLKLQ--QVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVI 704
            SWY  +G  DLL+S+VA +KLQ    P+P LGFHGY+L GKL+ PKLWSAE+P LYTLV+
Sbjct: 317  SWYTSEGIADLLSSNVADIKLQPSSTPSPTLGFHGYVLTGKLQSPKLWSAEKPYLYTLVV 376

Query: 705  ILKDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMV 884
            +LKD SG+ VDCESC VGFR++SKA K++LVNGH V+IRGVNRHEHHP++GK N+ESCM+
Sbjct: 377  VLKDQSGRVVDCESCPVGFRKVSKAHKQLLVNGHAVVIRGVNRHEHHPQVGKANIESCMI 436

Query: 885  KDLILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEP 1064
            KDL+LMKQNN+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFDY  ++KHPTLEP
Sbjct: 437  KDLVLMKQNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDYSKHLKHPTLEP 496

Query: 1065 SWASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGG 1244
             WASAMLDRVIGMVERDKNH CIISWSLGNE+G+G NH ALAGWIRGRD SR++HYEGGG
Sbjct: 497  MWASAMLDRVIGMVERDKNHTCIISWSLGNESGFGTNHFALAGWIRGRDSSRVLHYEGGG 556

Query: 1245 SRTSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFG 1424
            SRT  TDI+CPMYMRVWD+VKIA DP ETRP+ILCEYSHAMGNSNGN+H YWEAIDNTFG
Sbjct: 557  SRTPCTDIVCPMYMRVWDMVKIANDPTETRPLILCEYSHAMGNSNGNLHTYWEAIDNTFG 616

Query: 1425 LQGGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVK 1604
            LQGGFIWDWVDQ L+K   DG KHWAYGG+FGD PNDLNFCLNGL +PDRTPHP L+EVK
Sbjct: 617  LQGGFIWDWVDQALVKVYEDGTKHWAYGGEFGDVPNDLNFCLNGLTFPDRTPHPVLHEVK 676

Query: 1605 YVYQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDI 1784
            Y+YQPIKV   +G L+I N+HFF TT+ LE SW +  +             IKPQSSY +
Sbjct: 677  YLYQPIKVALNEGKLEIKNTHFFQTTEGLESSWYISANGYNLGSGTLDLAPIKPQSSYAV 736

Query: 1785 EFVSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKA 1964
            ++ S PWYSLWASS   E+FLT+T KL +STRW  AGHI++S Q+QLP ++   PH I  
Sbjct: 737  DWESGPWYSLWASSSEEELFLTLTFKLLDSTRWVEAGHIVSSAQVQLPARRSILPHAIDI 796

Query: 1965 TDGPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
            + G ++ E LGDTI V +QD W++ +N+KTG +ESWKV+GV ++ KGI PCFWRAP D
Sbjct: 797  SSGTLVAETLGDTIIVKQQDVWDLTLNTKTGLVESWKVKGVHILKKGILPCFWRAPID 854


>ref|XP_006403576.1| hypothetical protein EUTSA_v10010080mg [Eutrema salsugineum]
            gi|557104695|gb|ESQ45029.1| hypothetical protein
            EUTSA_v10010080mg [Eutrema salsugineum]
          Length = 1107

 Score = 1112 bits (2875), Expect = 0.0
 Identities = 513/716 (71%), Positives = 596/716 (83%), Gaps = 4/716 (0%)
 Frame = +3

Query: 3    FDRPIYTNVVYPFPLDPPNVPVENPTGCYRTYFEIPEEWKNRRIFLHFEAVDSAFYAWVN 182
            FDRPIYTN+VYPFP DPP+VP +NPTGCYRTYF+IP+EWK+RRI LHFEAVDSAF+AW+N
Sbjct: 133  FDRPIYTNIVYPFPNDPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSAFFAWIN 192

Query: 183  GIPIGYSQDSRLPAEFEISDFCHPRCSDKKNLLAVQVLRWSDGSYLEDQDHWWLSGIHRD 362
            G P+GYSQDSRLPAEFEISD+C+P  S K+N+LAVQV RWSDGSYLEDQDHWWLSG+HRD
Sbjct: 193  GKPVGYSQDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGLHRD 252

Query: 363  VLLLSKPQVFIADHFFKSSLSEDFSSADIQVEVKI----EPVSDCLLTNFSLEAVIYDTA 530
            VLLL+KP+VFI D+FFKS L++DFS ADIQVEVKI    E   D +L+NF +EA ++DT 
Sbjct: 253  VLLLAKPKVFIDDYFFKSKLADDFSYADIQVEVKIDNMLETSKDLVLSNFIIEAAVFDTK 312

Query: 531  SWYERDGKVDLLASSVAHLKLQQVPNPVLGFHGYILGGKLELPKLWSAEQPNLYTLVIIL 710
            SWY   G    L+  VA LKL   P+  LGFHGY+L GKL+ P LWSAEQPN+Y LVI L
Sbjct: 313  SWYNSGGFSYELSPKVASLKLNPSPSSSLGFHGYLLEGKLDSPNLWSAEQPNVYILVITL 372

Query: 711  KDASGQQVDCESCQVGFRQISKATKEILVNGHPVIIRGVNRHEHHPRLGKTNVESCMVKD 890
            KD SG+ +D ES  VG RQ+SKA K++LVNGHPV+I+GVNRHEHHPR+GKTN+E+CM+KD
Sbjct: 373  KDKSGKLLDSESSIVGVRQVSKAFKQLLVNGHPVMIKGVNRHEHHPRVGKTNIEACMIKD 432

Query: 891  LILMKQNNVNAVRNSHYPQHPRWYELCDLFGFYMIDEANIETHGFDYCGYIKHPTLEPSW 1070
            LI+MK+ N+NAVRNSHYPQHPRWYELCDLFG YMIDEANIETHGFD  G++KHPT EPSW
Sbjct: 433  LIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPTKEPSW 492

Query: 1071 ASAMLDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGWIRGRDPSRIVHYEGGGSR 1250
            A+AMLDRV+GMVERDKNHACIISWSLGNEA YGPNHSA+AGWIR +DPSR+VHYEGGGSR
Sbjct: 493  AAAMLDRVVGMVERDKNHACIISWSLGNEANYGPNHSAMAGWIREKDPSRLVHYEGGGSR 552

Query: 1251 TSSTDIICPMYMRVWDIVKIAKDPNETRPVILCEYSHAMGNSNGNIHEYWEAIDNTFGLQ 1430
            T STDI+CPMYMRVWDIVKIA D NE+RP+ILCEYSHAMGNSNGNI EYWEAIDNTFGLQ
Sbjct: 553  TDSTDIVCPMYMRVWDIVKIALDKNESRPLILCEYSHAMGNSNGNIDEYWEAIDNTFGLQ 612

Query: 1431 GGFIWDWVDQGLLKEGSDGCKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPALNEVKYV 1610
            GGFIWDWVDQGLLK GSDG KHWAYGGDFGD PNDLNFCLNGLIWPDRTPHPAL+EVK+ 
Sbjct: 613  GGFIWDWVDQGLLKLGSDGIKHWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHC 672

Query: 1611 YQPIKVLSKDGALKILNSHFFDTTKELEFSWAVHGDXXXXXXXXXXXXVIKPQSSYDIEF 1790
            YQPIKV   DG +++ N++FF TT+ELEFSW +HGD            VIKPQ+ YD+E+
Sbjct: 673  YQPIKVSLTDGTMRVANAYFFHTTEELEFSWTIHGDGVELGSGTLSIPVIKPQNIYDMEW 732

Query: 1791 VSAPWYSLWASSPTTEVFLTITVKLSNSTRWAVAGHILASTQLQLPTKQECGPHVIKATD 1970
             S PW+SLW  S T E FLTIT KL N TR   AGH+L+STQ+ LP K++  P  IK TD
Sbjct: 733  KSGPWFSLWNDSNTGESFLTITAKLLNPTRSLQAGHLLSSTQIPLPAKRQIIPQAIKITD 792

Query: 1971 GPVLGECLGDTIRVSKQDSWEIKINSKTGTIESWKVEGVSVMNKGIFPCFWRAPTD 2138
              +  E +GD I++S+QDSWE+ I+ + G IE WK++GV +  + I PCFWRAPTD
Sbjct: 793  AIINCETVGDFIKISQQDSWELMIDVRKGAIEGWKMQGVLLTKEAILPCFWRAPTD 848


Top