BLASTX nr result

ID: Cocculus22_contig00018639 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00018639
         (1916 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-lik...   966   0.0  
ref|XP_006476679.1| PREDICTED: alpha-N-acetylglucosaminidase-lik...   958   0.0  
ref|XP_006439738.1| hypothetical protein CICLE_v10018883mg [Citr...   956   0.0  
ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ric...   947   0.0  
ref|XP_006581937.1| PREDICTED: alpha-N-acetylglucosaminidase-lik...   946   0.0  
ref|XP_006581936.1| PREDICTED: alpha-N-acetylglucosaminidase-lik...   946   0.0  
ref|XP_007036097.1| Alpha-N-acetylglucosaminidase family / NAGLU...   934   0.0  
emb|CBI15090.3| unnamed protein product [Vitis vinifera]              932   0.0  
ref|XP_006345181.1| PREDICTED: alpha-N-acetylglucosaminidase-lik...   931   0.0  
ref|XP_004502129.1| PREDICTED: alpha-N-acetylglucosaminidase-lik...   922   0.0  
ref|XP_007210354.1| hypothetical protein PRUPE_ppa001642mg [Prun...   919   0.0  
ref|XP_004301281.1| PREDICTED: alpha-N-acetylglucosaminidase-lik...   918   0.0  
ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-lik...   913   0.0  
ref|XP_007036096.1| Alpha-N-acetylglucosaminidase family / NAGLU...   907   0.0  
ref|XP_002318632.2| hypothetical protein POPTR_0012s07760g, part...   906   0.0  
ref|XP_006856885.1| hypothetical protein AMTR_s00055p00202230 [A...   906   0.0  
ref|XP_007138123.1| hypothetical protein PHAVU_009G182100g [Phas...   872   0.0  
ref|XP_007138122.1| hypothetical protein PHAVU_009G182100g [Phas...   872   0.0  
dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]    865   0.0  
dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]    863   0.0  

>ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
          Length = 813

 Score =  966 bits (2498), Expect = 0.0
 Identities = 452/577 (78%), Positives = 512/577 (88%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SW  QQL LQKKIL+RMYELGMTPVLPAFSGNVPAALK  FPSAKITR
Sbjct: 236  GNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKITR 295

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWFTVG +PRWCCTYLLDATDPLFIEIGKAFI+QQ+KEYGR+GHIYNCDTFDENTPP 
Sbjct: 296  LGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPPV 355

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYISSLGAAI++ MQSGD +A+WLMQGWLF+YDPFW+PPQMKALLHSVP+G+LVVLD
Sbjct: 356  DDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVLD 415

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIWI++EQFYGVPYIWCML+NFAGNIEMYG+LDA+ASGPVEA  SENSTMVGVG
Sbjct: 416  LFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYGILDAVASGPVEARTSENSTMVGVG 475

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNPVVYDLM+EMAF+H K+DVK+WI LY  RRYG+  P IQDAWNILYHT+YN
Sbjct: 476  MSMEGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYSTRRYGKSVPEIQDAWNILYHTVYN 535

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDG+YDKNRDVIVAFPDIDPS IP  ++S  G +      VSR+  LK++++S++QPHL
Sbjct: 536  CTDGSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPHL 595

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WYST EV  AL LF+ASG +L  S+TYRYD+VDLTRQALAKYANQ+FL++IE YQL +V+
Sbjct: 596  WYSTSEVKDALGLFIASGGQLLGSNTYRYDLVDLTRQALAKYANQLFLEVIEAYQLNDVR 655

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
                +SQ FL+LV+DMDTLLACHDGFLLGPWLES+KQLAQ+ +QE Q EWNARTQITMWF
Sbjct: 656  GAACHSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQDEQQEIQFEWNARTQITMWF 715

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DNTE+EASLLRDYGNKYWSGLL DYY PRAAIYFKYL+ESLE G  F LKDWRR+WIKLT
Sbjct: 716  DNTEDEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLLESLETGNEFALKDWRREWIKLT 775

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGSYD 1733
            NDWQ  RN +PV+S G+A++TSR LY KYL +P  YD
Sbjct: 776  NDWQNSRNAYPVRSSGNAIDTSRRLYNKYLQDPEIYD 812


>ref|XP_006476679.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Citrus sinensis]
          Length = 814

 Score =  958 bits (2477), Expect = 0.0
 Identities = 452/577 (78%), Positives = 505/577 (87%)
 Frame = +3

Query: 6    NLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITRL 185
            NLHGWGGPLP+SW  QQL LQKKIL R+YELGM PVLPAFSGNVPAAL+N FPSAKIT+L
Sbjct: 242  NLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 301

Query: 186  GNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPSD 365
            GNWF+V SDPRWCCTYLLDATDPLFIEIG+AFI QQ+KEYGR+ HIYNCDTFDENTPP D
Sbjct: 302  GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 361

Query: 366  DSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLDL 545
              EYISSLGAAIY  MQSGD DAVWLMQGWLF+YDPFW+PPQMKALL+SVP+GKLVVLDL
Sbjct: 362  SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDL 421

Query: 546  FAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVGM 725
            FAEVKPIW +++QFYGVPYIWCML+NFAGNIEMYG+LD+IA GPVEA  SEN+TMVGVGM
Sbjct: 422  FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 481

Query: 726  SMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYNC 905
            SMEGIEQNPVVYDLM+EMAF+HEK+DVK WI+ Y  RRYGR  P IQDAWN+LYHT+YNC
Sbjct: 482  SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNC 541

Query: 906  TDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHLW 1085
            TDGA DKNRDVIVAFPD+DPS+I +    T GK++     VS++A LK  + SYD PHLW
Sbjct: 542  TDGATDKNRDVIVAFPDVDPSIISV----TEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 597

Query: 1086 YSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQE 1265
            YST EVI ALELF+ASG+ELSAS+TYRYD++DLTRQALAKYAN++FL IIE YQL +   
Sbjct: 598  YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 657

Query: 1266 VTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWFD 1445
            V   S+ FL+LV+DMD+LLACHDGFLLGPWLES+KQLAQN EQEKQ EWNARTQITMWFD
Sbjct: 658  VFQLSRRFLELVEDMDSLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 717

Query: 1446 NTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLTN 1625
            NTEEEASLLRDYGNKYWSGLL DYY PRAAIYFKY+IESLE G  F LKDWRR+WIKLTN
Sbjct: 718  NTEEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 777

Query: 1626 DWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGSYDH 1736
            DWQ GRN++PV+S GDAL TS+WLY KYL     +DH
Sbjct: 778  DWQNGRNVYPVESNGDALITSQWLYNKYLQGTSVFDH 814


>ref|XP_006439738.1| hypothetical protein CICLE_v10018883mg [Citrus clementina]
            gi|557542000|gb|ESR52978.1| hypothetical protein
            CICLE_v10018883mg [Citrus clementina]
          Length = 814

 Score =  956 bits (2471), Expect = 0.0
 Identities = 451/577 (78%), Positives = 503/577 (87%)
 Frame = +3

Query: 6    NLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITRL 185
            NLHGWGGPLP+SW  QQL LQKKIL RMYELGM PVLPAFSGNVPAAL+N FPSAKIT+L
Sbjct: 242  NLHGWGGPLPQSWLDQQLVLQKKILVRMYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 301

Query: 186  GNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPSD 365
            GNWF+V SDPRWCCTYLLDATDPLFIEIG+AFI QQ+KEYGR+ HIYNCDTFDENTPP D
Sbjct: 302  GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 361

Query: 366  DSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLDL 545
              EYISSLGAAIY  MQSGD DAVWLMQGWLF+YDPFW+PPQMKALLHSVP+GKLVVLDL
Sbjct: 362  SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLHSVPLGKLVVLDL 421

Query: 546  FAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVGM 725
            FAEVKPIW +++QFYGVPYIWCML+NFAGNIEMYG+LD+IA GPVEA  SEN+TMVGVGM
Sbjct: 422  FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 481

Query: 726  SMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYNC 905
            SMEGIEQNPVVYDLM+EMAF+HE +DVK WI+ Y  RRYGR  P IQDAWN+LYHT+YNC
Sbjct: 482  SMEGIEQNPVVYDLMSEMAFQHENVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNC 541

Query: 906  TDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHLW 1085
            TDGA DKNRDVIVAFPD+DPS+I +    T GK++     VS++A LK  + SYD PHLW
Sbjct: 542  TDGATDKNRDVIVAFPDVDPSIISV----TEGKYQNYGKPVSKKAVLKSETSSYDHPHLW 597

Query: 1086 YSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQE 1265
            YST EVI ALELF+ASG+ELSAS+TYRYD++DLTRQALAKYAN++FL I+E YQL +   
Sbjct: 598  YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNILEAYQLNDAHG 657

Query: 1266 VTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWFD 1445
            V   S+ FL+LV+DMD LLACHDGFLLGPWLES+KQLAQN EQEKQ EWNARTQITMWFD
Sbjct: 658  VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 717

Query: 1446 NTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLTN 1625
            NT+EEASLLRDYGNKYWSGLL DYY PRAAIYFKY+IESLE G  F LKDWRR+WIKLTN
Sbjct: 718  NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 777

Query: 1626 DWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGSYDH 1736
             WQ GRN++PV+S GDAL TS+WLY KYL   G +DH
Sbjct: 778  YWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH 814


>ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
            gi|223550576|gb|EEF52063.1|
            alpha-n-acetylglucosaminidase, putative [Ricinus
            communis]
          Length = 809

 Score =  947 bits (2448), Expect = 0.0
 Identities = 448/579 (77%), Positives = 504/579 (87%), Gaps = 1/579 (0%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLH WGG LP+SWF QQL LQKKIL+RMYELGM PVLPAFSGNVPAAL+N FPSAKI R
Sbjct: 235  GNLHRWGGSLPQSWFFQQLILQKKILARMYELGMNPVLPAFSGNVPAALRNIFPSAKIAR 294

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V SD RWCCTYLLDATDPLFIEIG+AFI QQ++EYG + HIYNCDTFDENTPP 
Sbjct: 295  LGNWFSVKSDLRWCCTYLLDATDPLFIEIGRAFIEQQLEEYGSTSHIYNCDTFDENTPPV 354

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD +YIS+LGAA++K MQSGD+DAVWLMQGWLF+YDPFW+PPQMKALLHSVP+G+LVVLD
Sbjct: 355  DDPKYISALGAAVFKGMQSGDNDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGRLVVLD 414

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIW S+ QFYGVPYIWCML+NFAGN+EMYG+LD+IASGPVEA  SENSTMVGVG
Sbjct: 415  LFAEVKPIWTSSYQFYGVPYIWCMLHNFAGNVEMYGILDSIASGPVEARTSENSTMVGVG 474

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNPVVYDLM+EMAF+H+K+DVK WI+LY  RRYGR  P IQDAW+ILYHT+YN
Sbjct: 475  MSMEGIEQNPVVYDLMSEMAFQHKKVDVKAWINLYSTRRYGRSVPSIQDAWDILYHTVYN 534

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGAYDKNRDVIVAFPD++P    +    +  +H L    VSR+A LK+ SDSYD PHL
Sbjct: 535  CTDGAYDKNRDVIVAFPDVNPFYFSV----SQKRHHLNGKPVSRRAVLKENSDSYDHPHL 590

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WYST EV+HALELF+ SG+ELS SSTY YD+VDLTRQALAKY N++FLKIIE YQ  +  
Sbjct: 591  WYSTSEVLHALELFITSGEELSGSSTYSYDLVDLTRQALAKYGNELFLKIIESYQANDGN 650

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             V   SQ FLDLV+DMDTLL CH+GFLLGPWLES+KQLAQ+ EQEKQ EWNARTQITMWF
Sbjct: 651  GVASRSQKFLDLVEDMDTLLGCHEGFLLGPWLESAKQLAQDQEQEKQFEWNARTQITMWF 710

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DNTE+EASLL DYGNKYWSGLL DYY PRAAIYFKYLI+SLE G+ FPLKDWRR+WIKLT
Sbjct: 711  DNTEDEASLLHDYGNKYWSGLLQDYYGPRAAIYFKYLIKSLENGKVFPLKDWRREWIKLT 770

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPG-SYDH 1736
            N+WQ+ RN FPVKS G+AL  S+WLY KYL NP  +YDH
Sbjct: 771  NEWQRSRNKFPVKSNGNALIISKWLYDKYLRNPDTTYDH 809


>ref|XP_006581937.1| PREDICTED: alpha-N-acetylglucosaminidase-like isoform X2 [Glycine
            max]
          Length = 813

 Score =  946 bits (2446), Expect = 0.0
 Identities = 438/570 (76%), Positives = 502/570 (88%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SWF QQL LQKKIL+RM+ELGMTPVLPAFSGNVPAALK+ FPSAKITR
Sbjct: 235  GNLHGWGGPLPQSWFDQQLILQKKILARMFELGMTPVLPAFSGNVPAALKHIFPSAKITR 294

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V +D +WCCTYLLDATD LF+EIGKAFI +Q++EYGR+ HIYNCDTFDENTPP 
Sbjct: 295  LGNWFSVKNDLKWCCTYLLDATDSLFVEIGKAFIEKQLQEYGRTSHIYNCDTFDENTPPV 354

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYISSLGAA +K MQSGDDDAVWLMQGWLF+YDPFW+PPQMKALLHSVP+GKLVVLD
Sbjct: 355  DDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGKLVVLD 414

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIW+++EQFYGVPYIWCML+NFAGNIEMYG+LDAIASGP++A  S NSTMVGVG
Sbjct: 415  LFAEVKPIWVTSEQFYGVPYIWCMLHNFAGNIEMYGILDAIASGPIDARTSNNSTMVGVG 474

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNP+VYDLM+EMAF+H+K+DVK W+D+Y  RRYG+  PLIQ+ WN+LYHTIYN
Sbjct: 475  MSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWVDMYSTRRYGQTLPLIQEGWNVLYHTIYN 534

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGAYDKNRDVIVAFPD+DPSLI     S   +     DK      +K+++DS+D+PHL
Sbjct: 535  CTDGAYDKNRDVIVAFPDVDPSLI-----SVQHEQSHHNDKPYSGTIIKEITDSFDRPHL 589

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WY T EVI+ALELF+ SGDELS  +TYRYD+VDLTRQ LAKYAN++F K+IE YQ  ++ 
Sbjct: 590  WYPTSEVIYALELFITSGDELSRCNTYRYDLVDLTRQVLAKYANELFFKVIEAYQSHDIH 649

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             +T  SQ FLDLV+D+DTLLACHDGFLLGPWLES+KQLA N EQE+Q EWNARTQITMWF
Sbjct: 650  GMTLLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLALNEEQERQFEWNARTQITMWF 709

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DN++EEASLLRDYGNKYW+GLL DYY PRAAIYFKYL ESLE G+ F L+ WRR+WIKLT
Sbjct: 710  DNSDEEASLLRDYGNKYWNGLLHDYYGPRAAIYFKYLRESLESGEDFKLRGWRREWIKLT 769

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYL 1712
            N+WQK RNIFPV+S GDALNTSRWL+ KYL
Sbjct: 770  NEWQKRRNIFPVESSGDALNTSRWLFNKYL 799


>ref|XP_006581936.1| PREDICTED: alpha-N-acetylglucosaminidase-like isoform X1 [Glycine
            max]
          Length = 814

 Score =  946 bits (2446), Expect = 0.0
 Identities = 438/570 (76%), Positives = 502/570 (88%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SWF QQL LQKKIL+RM+ELGMTPVLPAFSGNVPAALK+ FPSAKITR
Sbjct: 236  GNLHGWGGPLPQSWFDQQLILQKKILARMFELGMTPVLPAFSGNVPAALKHIFPSAKITR 295

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V +D +WCCTYLLDATD LF+EIGKAFI +Q++EYGR+ HIYNCDTFDENTPP 
Sbjct: 296  LGNWFSVKNDLKWCCTYLLDATDSLFVEIGKAFIEKQLQEYGRTSHIYNCDTFDENTPPV 355

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYISSLGAA +K MQSGDDDAVWLMQGWLF+YDPFW+PPQMKALLHSVP+GKLVVLD
Sbjct: 356  DDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGKLVVLD 415

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIW+++EQFYGVPYIWCML+NFAGNIEMYG+LDAIASGP++A  S NSTMVGVG
Sbjct: 416  LFAEVKPIWVTSEQFYGVPYIWCMLHNFAGNIEMYGILDAIASGPIDARTSNNSTMVGVG 475

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNP+VYDLM+EMAF+H+K+DVK W+D+Y  RRYG+  PLIQ+ WN+LYHTIYN
Sbjct: 476  MSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWVDMYSTRRYGQTLPLIQEGWNVLYHTIYN 535

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGAYDKNRDVIVAFPD+DPSLI     S   +     DK      +K+++DS+D+PHL
Sbjct: 536  CTDGAYDKNRDVIVAFPDVDPSLI-----SVQHEQSHHNDKPYSGTIIKEITDSFDRPHL 590

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WY T EVI+ALELF+ SGDELS  +TYRYD+VDLTRQ LAKYAN++F K+IE YQ  ++ 
Sbjct: 591  WYPTSEVIYALELFITSGDELSRCNTYRYDLVDLTRQVLAKYANELFFKVIEAYQSHDIH 650

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             +T  SQ FLDLV+D+DTLLACHDGFLLGPWLES+KQLA N EQE+Q EWNARTQITMWF
Sbjct: 651  GMTLLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLALNEEQERQFEWNARTQITMWF 710

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DN++EEASLLRDYGNKYW+GLL DYY PRAAIYFKYL ESLE G+ F L+ WRR+WIKLT
Sbjct: 711  DNSDEEASLLRDYGNKYWNGLLHDYYGPRAAIYFKYLRESLESGEDFKLRGWRREWIKLT 770

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYL 1712
            N+WQK RNIFPV+S GDALNTSRWL+ KYL
Sbjct: 771  NEWQKRRNIFPVESSGDALNTSRWLFNKYL 800


>ref|XP_007036097.1| Alpha-N-acetylglucosaminidase family / NAGLU family isoform 2
            [Theobroma cacao] gi|508773342|gb|EOY20598.1|
            Alpha-N-acetylglucosaminidase family / NAGLU family
            isoform 2 [Theobroma cacao]
          Length = 572

 Score =  934 bits (2415), Expect = 0.0
 Identities = 438/575 (76%), Positives = 501/575 (87%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SWF  QL+LQKKILSRMYELGMTPVLPAFSGNVPAALKN FPSAKITR
Sbjct: 2    GNLHGWGGPLPQSWFNGQLTLQKKILSRMYELGMTPVLPAFSGNVPAALKNIFPSAKITR 61

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V  +P+WCCTYLLDATDPLFIEIGKAFI++Q+KEYG++ HIYNCDTFDENTPP 
Sbjct: 62   LGNWFSVKGNPKWCCTYLLDATDPLFIEIGKAFIKEQLKEYGKTSHIYNCDTFDENTPPM 121

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYI+SLG AI+  MQSGD +A+WLMQGWLF+YDPFW+PPQMKALLHSVP+GKLVVLD
Sbjct: 122  DDPEYITSLGVAIFSGMQSGDVNAMWLMQGWLFSYDPFWRPPQMKALLHSVPLGKLVVLD 181

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIWI++EQFYGVPYIWCML+NFAGNIEMYG LDAIASGP+EA  SENSTMVG+G
Sbjct: 182  LFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYGYLDAIASGPIEALTSENSTMVGIG 241

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNP+VYDLM+EMAF+H+K+DV+ WI+LY  RRYG+  P I DAW+ILY T+YN
Sbjct: 242  MSMEGIEQNPIVYDLMSEMAFQHKKVDVEAWIELYIARRYGQSIPSISDAWSILYRTLYN 301

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGAYDKNRDVIVAFPD+ PS I L       ++       SR+A L + +D+YDQPHL
Sbjct: 302  CTDGAYDKNRDVIVAFPDVSPSFISL----PRERYHHYGKPTSRRAVLSEKTDAYDQPHL 357

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WYST EVI ALELF+ SGD LSAS+TYRYD+VDLTRQALAKYAN++FL+II+ Y+LK+V 
Sbjct: 358  WYSTSEVIRALELFITSGDALSASNTYRYDLVDLTRQALAKYANELFLEIIDAYELKDVN 417

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             VT  SQ FL+LV+DMDTLLACHDGFLLGPWLES+KQLAQN E+EKQ EWNARTQITMWF
Sbjct: 418  RVTTLSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQNKEEEKQFEWNARTQITMWF 477

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DNT+EEASLLRDYGNKYWSG++ DYY PRA IYFK LIESLE G+ F +K WR +WIKLT
Sbjct: 478  DNTKEEASLLRDYGNKYWSGVVGDYYGPRATIYFKVLIESLENGEDFKVKKWRGEWIKLT 537

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGS 1727
            NDWQ  R ++PV+S G+AL  SRWLY KYL +  S
Sbjct: 538  NDWQTSRKVYPVESNGNALTISRWLYNKYLRSESS 572


>emb|CBI15090.3| unnamed protein product [Vitis vinifera]
          Length = 846

 Score =  932 bits (2410), Expect = 0.0
 Identities = 445/610 (72%), Positives = 508/610 (83%), Gaps = 33/610 (5%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SW  QQL LQKKIL+RMYELGMTPVLPAFSGNVPAALK  FPSAKITR
Sbjct: 236  GNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKITR 295

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWFTVG +PRWCCTYLLDATDPLFIEIGKAFI+QQ+KEYGR+GHIYNCDTFDENTPP 
Sbjct: 296  LGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPPV 355

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYISSLGAAI++ MQSGD +A+WLMQGWLF+YDPFW+PPQMKALLHSVP+G+LVVLD
Sbjct: 356  DDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVLD 415

Query: 543  LFAEVKPIWISTEQFYGVPYIW--------------------------------CMLYNF 626
            LFAEVKPIWI++EQFYGVPYIW                                CML+NF
Sbjct: 416  LFAEVKPIWITSEQFYGVPYIWKVTKSGRQQSLKFTNEKCCSFFRSHSPDSEVLCMLHNF 475

Query: 627  AGNIEMYGVLDAIASGPVEA-SKSENSTMVGVGMSMEGIEQNPVVYDLMAEMAFRHEKID 803
            AGNIEMYG+LDA+ASGP+   +K   S +VGVGMSMEGIEQNPVVYDLM+EMAF+H K+D
Sbjct: 476  AGNIEMYGILDAVASGPILLRAKYAESAVVGVGMSMEGIEQNPVVYDLMSEMAFQHSKVD 535

Query: 804  VKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYNCTDGAYDKNRDVIVAFPDIDPSLIPLV 983
            VK+WI LY  RRYG+  P IQDAWNILYHT+YNCTDG+YDKNRDVIVAFPDIDPS IP  
Sbjct: 536  VKVWIALYSTRRYGKSVPEIQDAWNILYHTVYNCTDGSYDKNRDVIVAFPDIDPSFIPTP 595

Query: 984  EMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHLWYSTDEVIHALELFLASGDELSASSTY 1163
            ++S  G +      VSR+  LK++++S++QPHLWYST EV  AL LF+ASG +L  S+TY
Sbjct: 596  KLSMPGGYHRYGKSVSRRTVLKEITNSFEQPHLWYSTSEVKDALGLFIASGGQLLGSNTY 655

Query: 1164 RYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQEVTFYSQHFLDLVKDMDTLLACHDGFL 1343
            RYD+VDLTRQALAKYANQ+FL++IE YQL +V+    +SQ FL+LV+DMDTLLACHDGFL
Sbjct: 656  RYDLVDLTRQALAKYANQLFLEVIEAYQLNDVRGAACHSQKFLELVEDMDTLLACHDGFL 715

Query: 1344 LGPWLESSKQLAQNSEQEKQNEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYC 1523
            LGPWLES+KQLAQ+ +QE Q EWNARTQITMWFDNTE+EASLLRDYGNKYWSGLL DYY 
Sbjct: 716  LGPWLESAKQLAQDEQQEIQFEWNARTQITMWFDNTEDEASLLRDYGNKYWSGLLRDYYG 775

Query: 1524 PRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLTNDWQKGRNIFPVKSEGDALNTSRWLYA 1703
            PRAAIYFKYL+ESLE G  F LKDWRR+WIKLTNDWQ  RN +PV+S G+A++TSR LY 
Sbjct: 776  PRAAIYFKYLLESLETGNEFALKDWRREWIKLTNDWQNSRNAYPVRSSGNAIDTSRRLYN 835

Query: 1704 KYLWNPGSYD 1733
            KYL +P  YD
Sbjct: 836  KYLQDPEIYD 845


>ref|XP_006345181.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Solanum tuberosum]
          Length = 819

 Score =  931 bits (2406), Expect = 0.0
 Identities = 433/582 (74%), Positives = 499/582 (85%), Gaps = 4/582 (0%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLH WGGPLP+SW  QQL LQKKIL RMYELGMTPVLPAFSGNVPAALK  FPSAKI+R
Sbjct: 238  GNLHKWGGPLPQSWLDQQLILQKKILGRMYELGMTPVLPAFSGNVPAALKRVFPSAKISR 297

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWFTV SD RWCCTYLLDATDPLF+EIGK FI QQ+KEYGRS HIYNCDTFDENTPP 
Sbjct: 298  LGNWFTVNSDTRWCCTYLLDATDPLFVEIGKTFIEQQLKEYGRSSHIYNCDTFDENTPPV 357

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD +YISSLGA I++ MQS D DAVWLMQGWLFTYDPFW+P QMKALLHSVP+GKL+VLD
Sbjct: 358  DDPDYISSLGATIFRGMQSADSDAVWLMQGWLFTYDPFWRPTQMKALLHSVPLGKLIVLD 417

Query: 543  LFAEVKPIWISTEQFYGVPYIW----CMLYNFAGNIEMYGVLDAIASGPVEASKSENSTM 710
            L+AEVKPIW +++QFYG+PYIW    CML+NFAGN+EMYGVLDA+ SGP+EA  SENSTM
Sbjct: 418  LYAEVKPIWATSKQFYGIPYIWKVTLCMLHNFAGNVEMYGVLDAVGSGPIEACTSENSTM 477

Query: 711  VGVGMSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYH 890
            VGVGMSMEGIEQNPV+YDLM+EMAF+H  +DVK WIDLY +RRYGRF   +QDAWNILYH
Sbjct: 478  VGVGMSMEGIEQNPVMYDLMSEMAFQHSPVDVKAWIDLYSRRRYGRFVQPMQDAWNILYH 537

Query: 891  TIYNCTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYD 1070
            TIYNCTDGAYDKNRDVIV+FPD+DP+ I  ++   +  H+    +  R+A L++ +DSYD
Sbjct: 538  TIYNCTDGAYDKNRDVIVSFPDVDPNSISTLQTVLNDVHEQYGKRYLRRAILEEPNDSYD 597

Query: 1071 QPHLWYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQL 1250
            +PHLWYST EVIHAL+LFL SG++LS SSTYRYD++DLTRQALAKYAN++FL  IE Y+L
Sbjct: 598  KPHLWYSTSEVIHALKLFLESGNQLSDSSTYRYDLIDLTRQALAKYANELFLDAIEAYKL 657

Query: 1251 KNVQEVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQI 1430
             ++  V   S+ FL LV+D+D LL CHDGFLLGPW+ES+K+LAQ+ +QE+Q EWNARTQI
Sbjct: 658  DDLHAVAHLSEKFLGLVEDLDMLLGCHDGFLLGPWIESAKELAQDEDQERQFEWNARTQI 717

Query: 1431 TMWFDNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDW 1610
            TMWFDNTE EASLLRDYGNKYWSGLL DYY PRAAIYFKYL ESLE+G+ F LK WRR+W
Sbjct: 718  TMWFDNTELEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLTESLEEGKGFDLKAWRREW 777

Query: 1611 IKLTNDWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGSYDH 1736
            IKLTN WQ  RN+FPVKS G+ALN S+WL+ KYL + GS+DH
Sbjct: 778  IKLTNSWQSSRNVFPVKSTGNALNVSQWLFEKYLQDLGSHDH 819


>ref|XP_004502129.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cicer arietinum]
          Length = 808

 Score =  922 bits (2384), Expect = 0.0
 Identities = 440/570 (77%), Positives = 490/570 (85%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SWF QQL LQKKIL+RMYELGMTPVLPAFSGNVPAALK  FPSAKITR
Sbjct: 244  GNLHGWGGPLPQSWFDQQLILQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKITR 303

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V +D +WCCTYLLDATDPLFIEIG+AF+ QQ++EYGR+ HIYNCDTFDENTPP 
Sbjct: 304  LGNWFSVKNDLKWCCTYLLDATDPLFIEIGRAFVEQQLQEYGRTSHIYNCDTFDENTPPI 363

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYISSLGAAI+  MQSGD+DAVWLMQGWLF+YDPFW+PPQMKALLHSVP+GKLVVLD
Sbjct: 364  DDPEYISSLGAAIFNGMQSGDNDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGKLVVLD 423

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIWIS+EQFYGVPYIWCML+NFAGNIEMYG+LDA+ASGP+EA  S NSTMVGVG
Sbjct: 424  LFAEVKPIWISSEQFYGVPYIWCMLHNFAGNIEMYGILDAVASGPIEARISFNSTMVGVG 483

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNP+VYDLM+EMAF+H+KIDVK+W+DLY  RRYGR  PLIQ+ WN+LYHTIYN
Sbjct: 484  MSMEGIEQNPIVYDLMSEMAFQHKKIDVKVWVDLYSTRRYGRQVPLIQEGWNVLYHTIYN 543

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGAYDKNRDVIVAFPD+DPSL      S   +H  L  K   +A +K+V+DS+DQPHL
Sbjct: 544  CTDGAYDKNRDVIVAFPDVDPSL-----FSLQHEHSRLYGKPYSRAIIKEVTDSFDQPHL 598

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WYST EVIHALELF++SGDELS SSTYRYD+VD+TRQ LAKYANQ+F K+IE YQ  +V 
Sbjct: 599  WYSTSEVIHALELFISSGDELSKSSTYRYDLVDVTRQVLAKYANQLFFKVIEAYQSHDVH 658

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             VT  SQ FLDLV+D+D LLACHDGFLLGPWLES+KQ AQN EQ++Q EWNARTQITMWF
Sbjct: 659  GVTLLSQRFLDLVEDLDALLACHDGFLLGPWLESAKQQAQNEEQKRQFEWNARTQITMWF 718

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DNT+EEASLL DYGNKYWSGLL DYY PRAAIYFKYLIE LEKG+ F             
Sbjct: 719  DNTDEEASLLHDYGNKYWSGLLHDYYGPRAAIYFKYLIEKLEKGEDF------------- 765

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYL 1712
                  RNIFPV S GDALNTSRWL+ KYL
Sbjct: 766  ----NRRNIFPVVSRGDALNTSRWLFNKYL 791


>ref|XP_007210354.1| hypothetical protein PRUPE_ppa001642mg [Prunus persica]
            gi|462406089|gb|EMJ11553.1| hypothetical protein
            PRUPE_ppa001642mg [Prunus persica]
          Length = 787

 Score =  919 bits (2374), Expect = 0.0
 Identities = 437/570 (76%), Positives = 488/570 (85%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SW  QQL LQKKIL RMYELGMTPVLPAFSGNVPAALK  +PSAKITR
Sbjct: 211  GNLHGWGGPLPQSWLDQQLILQKKILVRMYELGMTPVLPAFSGNVPAALKTIYPSAKITR 270

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V SDPRW CTYLLDATDPLF+EIG+ FI +Q+KEYGR+ HIYNCDTFDENTPP 
Sbjct: 271  LGNWFSVKSDPRWTCTYLLDATDPLFVEIGRTFIEEQLKEYGRTSHIYNCDTFDENTPPD 330

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYISSLG AI++ MQSGD+D VWLMQGWLF+YDPFW+PPQMKALL SVP G+LVVLD
Sbjct: 331  DDPEYISSLGVAIFRGMQSGDNDGVWLMQGWLFSYDPFWRPPQMKALLQSVPAGRLVVLD 390

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIWI+TEQ        CML+NFAGN+EMYGVLDAIASGP++A  SENSTMVGVG
Sbjct: 391  LFAEVKPIWITTEQ--------CMLHNFAGNVEMYGVLDAIASGPIDARTSENSTMVGVG 442

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNP+VYDLM+EMAF+H K+D K WID Y  RRYGR  P IQDAWNILYHT+YN
Sbjct: 443  MSMEGIEQNPIVYDLMSEMAFQHNKVDAKAWIDQYSARRYGRSVPSIQDAWNILYHTLYN 502

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGAYDKNRDVIVAFPD+DPS I +   +        ++ V+ +A LK+++DS+DQPHL
Sbjct: 503  CTDGAYDKNRDVIVAFPDVDPSFISIPPEAFQPN----ENPVAGRAVLKEITDSFDQPHL 558

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WYST EVIHAL++F+ASGDELS SS YRYD+VDLTRQALAKYANQ+FLK+IE YQ  +  
Sbjct: 559  WYSTSEVIHALDIFIASGDELSESSAYRYDLVDLTRQALAKYANQLFLKVIEAYQFNDAI 618

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             V   SQ FL LV+DMDTLLACHDGFLLGPWLES+K+LAQ+ EQEKQ EWNARTQITMWF
Sbjct: 619  GVARRSQKFLGLVEDMDTLLACHDGFLLGPWLESAKKLAQDEEQEKQFEWNARTQITMWF 678

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DNT+EEASLLRDYGNKYWSGLL DYY PRAAIYFKYL +SLE G  F LKDWRR+WIKLT
Sbjct: 679  DNTKEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLTQSLEWGSEFRLKDWRREWIKLT 738

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYL 1712
            NDWQ  R  FPVKS G+ALNTSRWL+ KYL
Sbjct: 739  NDWQNSRKEFPVKSSGNALNTSRWLFDKYL 768


>ref|XP_004301281.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Fragaria vesca subsp.
            vesca]
          Length = 834

 Score =  918 bits (2373), Expect = 0.0
 Identities = 437/578 (75%), Positives = 497/578 (85%), Gaps = 8/578 (1%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SW  QQL LQK+IL RMYELGMTPVLPAFSGNVPAALK  +P+AKIT+
Sbjct: 245  GNLHGWGGPLPQSWLDQQLILQKRILDRMYELGMTPVLPAFSGNVPAALKTIYPAAKITQ 304

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V SDPRW CTYLLDATDPLF+EIGK FI +Q+KEYGR+ HIYNCDTFDENTPP 
Sbjct: 305  LGNWFSVKSDPRWTCTYLLDATDPLFVEIGKTFIEEQLKEYGRTSHIYNCDTFDENTPPV 364

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYIS+LG  I+K +QSGD D VWLMQGWLF+YDPFW+P QMKALLHSVP G++VVLD
Sbjct: 365  DDPEYISALGKTIFKGLQSGDKDGVWLMQGWLFSYDPFWRPAQMKALLHSVPAGRMVVLD 424

Query: 543  LFAEVKPIWISTEQFYGVPYIW-------CMLYNFAGNIEMYGVLDAIASGPVEASKSEN 701
            LFAEVKPIW ++EQFYGVPYIW       CML+NFAGN+EMYGVLDAIASGP++A  SEN
Sbjct: 425  LFAEVKPIWTTSEQFYGVPYIWKFGIHYRCMLHNFAGNVEMYGVLDAIASGPIDAWTSEN 484

Query: 702  STMVGVGMSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNI 881
            STMVGVGMSMEGIEQNPVVYDLM+EMAF+  K+DVK WI+LY  RRYGR  PL+QDAW+I
Sbjct: 485  STMVGVGMSMEGIEQNPVVYDLMSEMAFQQNKVDVKDWINLYSTRRYGRAVPLVQDAWSI 544

Query: 882  LYHTIYNCTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASL-KDVS 1058
            L HT YNCTDGAYDKNRDVIVAFPD+DPS I        G ++  K  VSR+A L ++V+
Sbjct: 545  LRHTTYNCTDGAYDKNRDVIVAFPDVDPSFIA---RPPQGYYQNEKSLVSRRAELLEEVT 601

Query: 1059 DSYDQPHLWYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIE 1238
            DS+++PHLWYST EV+HALELF+ASGDELS S+TYRYD+VDLTRQALAKYAN++FLK+IE
Sbjct: 602  DSFERPHLWYSTSEVVHALELFIASGDELSGSNTYRYDLVDLTRQALAKYANELFLKVIE 661

Query: 1239 VYQLKNVQEVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNA 1418
             Y L +  EV   SQ FL+LV+DMDTLLACHDGFLLGPWLES+K+LAQ+ EQE Q EWNA
Sbjct: 662  AYHLNDTLEVVGLSQKFLELVEDMDTLLACHDGFLLGPWLESAKKLAQDKEQEIQFEWNA 721

Query: 1419 RTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDW 1598
            RTQITMWFDNTEEEASLLRDYGNKYWSGLL DYY PRAAIYFKYLI+SL++G  F LK+W
Sbjct: 722  RTQITMWFDNTEEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLIKSLDEGSDFDLKNW 781

Query: 1599 RRDWIKLTNDWQKGRNIFPVKSEGDALNTSRWLYAKYL 1712
            RR+WIKLTNDWQ  RN FPVKS G+A+ TSR L+ KYL
Sbjct: 782  RREWIKLTNDWQSSRNTFPVKSTGNAVTTSRLLFEKYL 819


>ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
          Length = 808

 Score =  913 bits (2360), Expect = 0.0
 Identities = 425/577 (73%), Positives = 493/577 (85%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLH WGGPLP+SWF QQL LQKK++ RM+ELGMTPVLPAFSGN+PAA K  +P+AKITR
Sbjct: 237  GNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITR 296

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWFTV SDPRWCCTYLLDA DPLF+EIGKAFI QQ KEYGR+ H+YNCDTFDENTPP 
Sbjct: 297  LGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPV 356

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYISSLG+AI+  MQ+GD +AVWLMQGW+F+YDPFW+P QMKALLHSVP+G+LVVLD
Sbjct: 357  DDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLD 416

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            L+AEVKPIWIS+EQFYG+PYIWCML+NFAGN+EMYG+LD+IASGP+EA  S  STMVGVG
Sbjct: 417  LYAEVKPIWISSEQFYGIPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVG 476

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNPVVYDLM+EMAF+H K+DVK W+  Y  RRYG   P IQDAW++LYHT+YN
Sbjct: 477  MSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYN 536

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGA DKNRDVIVAFPD+DPS I LV    S +H  L   V R   L+D   ++D+PHL
Sbjct: 537  CTDGANDKNRDVIVAFPDVDPSAI-LVLPEGSNRHGNLDSSVDR---LQDA--TFDRPHL 590

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WY T EVI AL+LF+A GD+LS+S+TYRYD+VDLTRQALAKY+N++F +I++ YQL +VQ
Sbjct: 591  WYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQ 650

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             +   SQ FL+LV D+DTLLACH+GFLLGPWL+S+KQLA++ E+EKQ EWNARTQITMWF
Sbjct: 651  TMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWF 710

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DNTEEEASLLRDYGNKYWSGLL DYYCPRAAIY K+L ES E G RFPL +WRR+WIKLT
Sbjct: 711  DNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLT 770

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGSYD 1733
            NDWQ  R I+PV+S GDAL+TS WLY KYL  P S D
Sbjct: 771  NDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSD 807


>ref|XP_007036096.1| Alpha-N-acetylglucosaminidase family / NAGLU family isoform 1
            [Theobroma cacao] gi|508773341|gb|EOY20597.1|
            Alpha-N-acetylglucosaminidase family / NAGLU family
            isoform 1 [Theobroma cacao]
          Length = 798

 Score =  907 bits (2344), Expect = 0.0
 Identities = 430/575 (74%), Positives = 493/575 (85%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SWF  QL+LQKKILSRMYELGMTPVLPAFSGNVPAALKN FPSAKITR
Sbjct: 236  GNLHGWGGPLPQSWFNGQLTLQKKILSRMYELGMTPVLPAFSGNVPAALKNIFPSAKITR 295

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V  +P+WCCTYLLDATDPLFIEIGKAFI++Q+KEYG++ HIYNCDTFDENTPP 
Sbjct: 296  LGNWFSVKGNPKWCCTYLLDATDPLFIEIGKAFIKEQLKEYGKTSHIYNCDTFDENTPPM 355

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYI+SLG AI+  MQSGD +A+WLMQGWLF+YDPFW+PPQMKALLHSVP+GKLVVLD
Sbjct: 356  DDPEYITSLGVAIFSGMQSGDVNAMWLMQGWLFSYDPFWRPPQMKALLHSVPLGKLVVLD 415

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIWI++EQ        CML+NFAGNIEMYG LDAIASGP+EA  SENSTMVG+G
Sbjct: 416  LFAEVKPIWITSEQ--------CMLHNFAGNIEMYGYLDAIASGPIEALTSENSTMVGIG 467

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNP+VYDLM+EMAF+H+K+DV+ WI+LY  RRYG+  P I DAW+ILY T+YN
Sbjct: 468  MSMEGIEQNPIVYDLMSEMAFQHKKVDVEAWIELYIARRYGQSIPSISDAWSILYRTLYN 527

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGAYDKNRDVIVAFPD+ PS I L       ++       SR+A L + +D+YDQPHL
Sbjct: 528  CTDGAYDKNRDVIVAFPDVSPSFISLPRE----RYHHYGKPTSRRAVLSEKTDAYDQPHL 583

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WYST EVI ALELF+ SGD LSAS+TYRYD+VDLTRQALAKYAN++FL+II+ Y+LK+V 
Sbjct: 584  WYSTSEVIRALELFITSGDALSASNTYRYDLVDLTRQALAKYANELFLEIIDAYELKDVN 643

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             VT  SQ FL+LV+DMDTLLACHDGFLLGPWLES+KQLAQN E+EKQ EWNARTQITMWF
Sbjct: 644  RVTTLSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQNKEEEKQFEWNARTQITMWF 703

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DNT+EEASLLRDYGNKYWSG++ DYY PRA IYFK LIESLE G+ F +K WR +WIKLT
Sbjct: 704  DNTKEEASLLRDYGNKYWSGVVGDYYGPRATIYFKVLIESLENGEDFKVKKWRGEWIKLT 763

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGS 1727
            NDWQ  R ++PV+S G+AL  SRWLY KYL +  S
Sbjct: 764  NDWQTSRKVYPVESNGNALTISRWLYNKYLRSESS 798


>ref|XP_002318632.2| hypothetical protein POPTR_0012s07760g, partial [Populus trichocarpa]
            gi|550326604|gb|EEE96852.2| hypothetical protein
            POPTR_0012s07760g, partial [Populus trichocarpa]
          Length = 760

 Score =  906 bits (2341), Expect = 0.0
 Identities = 431/577 (74%), Positives = 493/577 (85%)
 Frame = +3

Query: 6    NLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITRL 185
            NLH WGGPLP+SWF QQL LQKKIL+RMYELGMTPVLPAFSGNVPAAL+N FPSAKITRL
Sbjct: 199  NLHRWGGPLPQSWFDQQLVLQKKILARMYELGMTPVLPAFSGNVPAALRNIFPSAKITRL 258

Query: 186  GNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPSD 365
            GNWF+V SD RWCCTYLLDATDPLFIEIG+AFI QQ+ EYG + HIYNCDTFDENTPP D
Sbjct: 259  GNWFSVRSDVRWCCTYLLDATDPLFIEIGRAFIEQQLTEYGSTSHIYNCDTFDENTPPVD 318

Query: 366  DSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLDL 545
            D EYISSLG +I++ MQSGD +AVWLMQGWLF+YDPFW+PPQ KALLHSVPIG+LVVLDL
Sbjct: 319  DPEYISSLGGSIFEGMQSGDSNAVWLMQGWLFSYDPFWRPPQTKALLHSVPIGRLVVLDL 378

Query: 546  FAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVGM 725
            FAEVKPIW ++EQFYGVPYIWCML+NFAGN+EMYG LD++ASGPVEA  SENSTMVGVGM
Sbjct: 379  FAEVKPIWNTSEQFYGVPYIWCMLHNFAGNLEMYGYLDSVASGPVEARTSENSTMVGVGM 438

Query: 726  SMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYNC 905
            SMEGIEQNPVVYDLM+EMAF+  K+DVK+ +++          P IQ+AWNILYHT+YNC
Sbjct: 439  SMEGIEQNPVVYDLMSEMAFQKNKVDVKV-MEI----------PTIQNAWNILYHTVYNC 487

Query: 906  TDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHLW 1085
            TDGAYDKNRDVIVAFPD++P+L+ +++    G+H      VSR+A+L   +DSY+ PHLW
Sbjct: 488  TDGAYDKNRDVIVAFPDVNPNLVSMLQ----GRHHTDVKLVSRRAALIKNTDSYEHPHLW 543

Query: 1086 YSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQE 1265
            YST EV+ ALELF+A GDELS SSTY YD+VDLTRQ LAKYAN++FLK+IE Y+LK+   
Sbjct: 544  YSTTEVVRALELFIAGGDELSGSSTYSYDLVDLTRQVLAKYANELFLKVIEAYRLKDSHG 603

Query: 1266 VTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWFD 1445
            V   SQ FLDLV+D+DTLLACH+GFLLGPWLES+KQLAQ+ EQ+ Q EWNARTQITMW+D
Sbjct: 604  VAHQSQMFLDLVEDIDTLLACHEGFLLGPWLESAKQLAQDEEQQIQFEWNARTQITMWYD 663

Query: 1446 NTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLTN 1625
            NTE EASLLRDYGNKYWSGLL DYY PRAAIYF +L +SLE G  F LK WRR+WIKLTN
Sbjct: 664  NTEVEASLLRDYGNKYWSGLLKDYYGPRAAIYFNFLTQSLENGHGFQLKAWRREWIKLTN 723

Query: 1626 DWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGSYDH 1736
             WQK R IFPV+S G+ALN SRWLY KYL NP +YDH
Sbjct: 724  KWQKSRKIFPVESNGNALNISRWLYHKYLGNPDTYDH 760


>ref|XP_006856885.1| hypothetical protein AMTR_s00055p00202230 [Amborella trichopoda]
            gi|548860819|gb|ERN18352.1| hypothetical protein
            AMTR_s00055p00202230 [Amborella trichopoda]
          Length = 727

 Score =  906 bits (2341), Expect = 0.0
 Identities = 418/572 (73%), Positives = 486/572 (84%)
 Frame = +3

Query: 6    NLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITRL 185
            NLH WGGPLP+SW  QQL LQKKIL+RM+ LGMTPVLPAFSGNVPAALK  +P AKI RL
Sbjct: 154  NLHRWGGPLPQSWHDQQLILQKKILARMHNLGMTPVLPAFSGNVPAALKTIYPDAKIARL 213

Query: 186  GNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPSD 365
            GNWFTV  DPRWCCTYLLD TDPLF++IGKAFI QQ  EYG++GHIYNCDTFDENTPP D
Sbjct: 214  GNWFTVRGDPRWCCTYLLDPTDPLFVQIGKAFIEQQRIEYGKTGHIYNCDTFDENTPPDD 273

Query: 366  DSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLDL 545
            D +YI++LG+AIY+AM  GD +AVWLMQGWLF+YDPFW+PPQMKALLHSVPIG+LV+LDL
Sbjct: 274  DPDYIAALGSAIYEAMLKGDSEAVWLMQGWLFSYDPFWRPPQMKALLHSVPIGRLVILDL 333

Query: 546  FAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVGM 725
            FAEVKPIW++++QFYGVPYIWCML+NFAGNIEMYGVLDA+ASGP+ A +SENS MVGVGM
Sbjct: 334  FAEVKPIWMTSDQFYGVPYIWCMLHNFAGNIEMYGVLDAVASGPINARQSENSMMVGVGM 393

Query: 726  SMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYNC 905
            SMEGIEQNP+VYDLM+EMAF H K+DV+ WI  YP +RYG+    +QDAWNILYHT+YNC
Sbjct: 394  SMEGIEQNPIVYDLMSEMAFHHSKVDVEKWIYKYPNQRYGKTVKALQDAWNILYHTVYNC 453

Query: 906  TDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHLW 1085
            TDG YDKNRDVIVAFPD+DP +I   E+    +HK  K +   +ASLK++++SYD+PH+W
Sbjct: 454  TDGKYDKNRDVIVAFPDVDPVMISTSEIFVIEEHKPSKAEGLCRASLKEITESYDRPHIW 513

Query: 1086 YSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQE 1265
            Y T +VI+AL+LFL+S  E+S S  +RYD++DLTRQA+AK+ANQ+FLK+I  YQ K++  
Sbjct: 514  YPTSDVINALKLFLSSASEVSESCNFRYDLIDLTRQAVAKHANQIFLKVIAAYQSKDLHG 573

Query: 1266 VTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWFD 1445
            V  YSQ FLD+V D+DTLLACH+GFLLGPWLES+K+LAQN EQEKQ EWNARTQITMWFD
Sbjct: 574  VALYSQLFLDIVNDLDTLLACHEGFLLGPWLESAKELAQNEEQEKQYEWNARTQITMWFD 633

Query: 1446 NTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLTN 1625
            NTE EASLLRDYGNKYWSGLL DYY PRAAIYF YL+ESLE G+ F L DWRRDWI LTN
Sbjct: 634  NTEVEASLLRDYGNKYWSGLLQDYYGPRAAIYFSYLLESLETGEDFRLIDWRRDWIALTN 693

Query: 1626 DWQKGRNIFPVKSEGDALNTSRWLYAKYLWNP 1721
             WQ  R IFP K EGDALN +  LY KYL NP
Sbjct: 694  KWQNSRKIFPSKGEGDALNIALQLYEKYLQNP 725


>ref|XP_007138123.1| hypothetical protein PHAVU_009G182100g [Phaseolus vulgaris]
            gi|561011210|gb|ESW10117.1| hypothetical protein
            PHAVU_009G182100g [Phaseolus vulgaris]
          Length = 777

 Score =  872 bits (2254), Expect = 0.0
 Identities = 416/570 (72%), Positives = 473/570 (82%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SWF +QL LQKKIL+RMYELGMTPVLPAFSGNVPAALK  FPSAKITR
Sbjct: 233  GNLHGWGGPLPQSWFDKQLILQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKITR 292

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V +D +WCCTYLLDATDPLFIEIGKAFI +Q++EYGR+GHIYNCDTFDENTPP 
Sbjct: 293  LGNWFSVKNDLKWCCTYLLDATDPLFIEIGKAFIEKQLQEYGRTGHIYNCDTFDENTPPI 352

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYISSLGAA +K MQSGDDDAVWLMQGWLF+YDPFW+PPQMKALLHSVP+GKLVVLD
Sbjct: 353  DDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYDPFWRPPQMKALLHSVPLGKLVVLD 412

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIW+++EQFYGVPYIW                                  VGVG
Sbjct: 413  LFAEVKPIWVTSEQFYGVPYIW---------------------------------KVGVG 439

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNP+VYDLM+EMAF+ +KIDVK W+D+Y  RRYG+  PLIQ+ WN+LYHTIYN
Sbjct: 440  MSMEGIEQNPIVYDLMSEMAFQQKKIDVKAWVDMYSTRRYGKSLPLIQEGWNVLYHTIYN 499

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGAYDKNRDVIVAFPD+DPSLI  V+   S  +      V     +K+++D +D+PHL
Sbjct: 500  CTDGAYDKNRDVIVAFPDVDPSLIS-VQYDQSHHYYRPSGTV-----IKEITDPFDRPHL 553

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WYST EVI+ALELF+  GDELS S TYRYD+VDLTRQ LAKYAN++F K+IE Y+  +V 
Sbjct: 554  WYSTSEVIYALELFITIGDELSRSKTYRYDLVDLTRQVLAKYANELFFKVIEAYKSHDVH 613

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             +T  SQ FLDLV+D+DTLLACHDGFLLGPWLES+KQLAQN EQE+Q EWNARTQITMWF
Sbjct: 614  GMTLLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLAQNEEQERQFEWNARTQITMWF 673

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DNT+EEASLLRDYGNKYWSGLL DYY PRAAIYFKYL ESLE+G+ F L +WRR+WIKLT
Sbjct: 674  DNTKEEASLLRDYGNKYWSGLLHDYYGPRAAIYFKYLRESLERGEDFKLIEWRREWIKLT 733

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYL 1712
            N+WQK RN FPV+S+GDALNTSRWL+ KYL
Sbjct: 734  NEWQKSRNTFPVESKGDALNTSRWLFNKYL 763


>ref|XP_007138122.1| hypothetical protein PHAVU_009G182100g [Phaseolus vulgaris]
            gi|561011209|gb|ESW10116.1| hypothetical protein
            PHAVU_009G182100g [Phaseolus vulgaris]
          Length = 594

 Score =  872 bits (2254), Expect = 0.0
 Identities = 416/570 (72%), Positives = 473/570 (82%)
 Frame = +3

Query: 3    GNLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITR 182
            GNLHGWGGPLP+SWF +QL LQKKIL+RMYELGMTPVLPAFSGNVPAALK  FPSAKITR
Sbjct: 50   GNLHGWGGPLPQSWFDKQLILQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKITR 109

Query: 183  LGNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPS 362
            LGNWF+V +D +WCCTYLLDATDPLFIEIGKAFI +Q++EYGR+GHIYNCDTFDENTPP 
Sbjct: 110  LGNWFSVKNDLKWCCTYLLDATDPLFIEIGKAFIEKQLQEYGRTGHIYNCDTFDENTPPI 169

Query: 363  DDSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLD 542
            DD EYISSLGAA +K MQSGDDDAVWLMQGWLF+YDPFW+PPQMKALLHSVP+GKLVVLD
Sbjct: 170  DDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYDPFWRPPQMKALLHSVPLGKLVVLD 229

Query: 543  LFAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVG 722
            LFAEVKPIW+++EQFYGVPYIW                                  VGVG
Sbjct: 230  LFAEVKPIWVTSEQFYGVPYIW---------------------------------KVGVG 256

Query: 723  MSMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYN 902
            MSMEGIEQNP+VYDLM+EMAF+ +KIDVK W+D+Y  RRYG+  PLIQ+ WN+LYHTIYN
Sbjct: 257  MSMEGIEQNPIVYDLMSEMAFQQKKIDVKAWVDMYSTRRYGKSLPLIQEGWNVLYHTIYN 316

Query: 903  CTDGAYDKNRDVIVAFPDIDPSLIPLVEMSTSGKHKLLKDKVSRQASLKDVSDSYDQPHL 1082
            CTDGAYDKNRDVIVAFPD+DPSLI  V+   S  +      V     +K+++D +D+PHL
Sbjct: 317  CTDGAYDKNRDVIVAFPDVDPSLIS-VQYDQSHHYYRPSGTV-----IKEITDPFDRPHL 370

Query: 1083 WYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKNVQ 1262
            WYST EVI+ALELF+  GDELS S TYRYD+VDLTRQ LAKYAN++F K+IE Y+  +V 
Sbjct: 371  WYSTSEVIYALELFITIGDELSRSKTYRYDLVDLTRQVLAKYANELFFKVIEAYKSHDVH 430

Query: 1263 EVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITMWF 1442
             +T  SQ FLDLV+D+DTLLACHDGFLLGPWLES+KQLAQN EQE+Q EWNARTQITMWF
Sbjct: 431  GMTLLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLAQNEEQERQFEWNARTQITMWF 490

Query: 1443 DNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIKLT 1622
            DNT+EEASLLRDYGNKYWSGLL DYY PRAAIYFKYL ESLE+G+ F L +WRR+WIKLT
Sbjct: 491  DNTKEEASLLRDYGNKYWSGLLHDYYGPRAAIYFKYLRESLERGEDFKLIEWRREWIKLT 550

Query: 1623 NDWQKGRNIFPVKSEGDALNTSRWLYAKYL 1712
            N+WQK RN FPV+S+GDALNTSRWL+ KYL
Sbjct: 551  NEWQKSRNTFPVESKGDALNTSRWLFNKYL 580


>dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 829

 Score =  865 bits (2235), Expect = 0.0
 Identities = 402/589 (68%), Positives = 488/589 (82%), Gaps = 8/589 (1%)
 Frame = +3

Query: 6    NLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITRL 185
            N+HGWGGPLP++W   QL+LQKKILSRMY  GM+PVLPAFSGN+PAALK +FPSAK+T L
Sbjct: 241  NMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVTHL 300

Query: 186  GNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPSD 365
            GNWFTV S+PRWCCTYLLDA+DPL++EIGK FI +Q++EYGR+ H+YNCDTFDENTPP  
Sbjct: 301  GNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPPLS 360

Query: 366  DSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLDL 545
            D  YISSLGAA ++AMQSGD+DA+WLMQGWLFTYDPFW+PPQMKALLHSVP+G+++VLDL
Sbjct: 361  DPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVLDL 420

Query: 546  FAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVGM 725
            +AEVKP+WI+++QFYGVPYIWCML+NFA + EMYGVLDA+ASGP++A  SENSTMVGVGM
Sbjct: 421  YAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGVGM 480

Query: 726  SMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYNC 905
            SMEGIEQNP+VYDLM+EM F H ++D+K+W++ YP RRYG+    +QDAW IL+ T+YNC
Sbjct: 481  SMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLYNC 540

Query: 906  TDGAYDKNRDVIVAFPDIDPSLI--PLVEMSTSGKHKLLKDKVSRQASLKDV-SDSYDQP 1076
            TDG  DKNRDVIVAFPD++PS+I  P +   TS  +  +   +S    +KD  +D+Y+QP
Sbjct: 541  TDGKNDKNRDVIVAFPDVEPSVIQTPGLYARTSKNYSTM---LSENYVMKDAPNDAYEQP 597

Query: 1077 HLWYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKN 1256
            H+WY T  VIHALELFL SGDE+S SST+RYD+VDLTRQALAKYANQ+FLKII+ Y+  N
Sbjct: 598  HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657

Query: 1257 VQEVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITM 1436
            V +VT   + FL+LVKD+D LLA H+GFLLGPWLES+K LA++ EQE Q EWNARTQITM
Sbjct: 658  VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQYEWNARTQITM 717

Query: 1437 WFDNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIK 1616
            WFDNTE +ASLLRDY NKYWSGLL DYY PRAAIYFK+LI SL+K + F L++WRR+WI 
Sbjct: 718  WFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFKHLISSLKKKEPFALEEWRREWIS 777

Query: 1617 LTNDWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGS-----YDHLIKP 1748
            LTN+WQ  R +F   + GDALN SR L+ KYL N  S      D  +KP
Sbjct: 778  LTNNWQSDRKVFATTATGDALNISRALFTKYLRNADSLGLDGMDSFVKP 826


>dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 829

 Score =  863 bits (2231), Expect = 0.0
 Identities = 402/589 (68%), Positives = 487/589 (82%), Gaps = 8/589 (1%)
 Frame = +3

Query: 6    NLHGWGGPLPESWFVQQLSLQKKILSRMYELGMTPVLPAFSGNVPAALKNRFPSAKITRL 185
            N+HGWGGPLP++W   QL+LQKKILSRMY  GM+PVLPAFSGN+PAALK +FPSAK+T L
Sbjct: 241  NMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVTHL 300

Query: 186  GNWFTVGSDPRWCCTYLLDATDPLFIEIGKAFIRQQVKEYGRSGHIYNCDTFDENTPPSD 365
            GNWFTV S+PRWCCTYLLDA+DPL++EIGK FI +Q++EYGR+ H+YNCDTFDENTPP  
Sbjct: 301  GNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPPLS 360

Query: 366  DSEYISSLGAAIYKAMQSGDDDAVWLMQGWLFTYDPFWKPPQMKALLHSVPIGKLVVLDL 545
            D  YISSLGAA ++AMQSGD+DA+WLMQGWLFTYDPFW+PPQMKALLHSVP+G+++VLDL
Sbjct: 361  DPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVLDL 420

Query: 546  FAEVKPIWISTEQFYGVPYIWCMLYNFAGNIEMYGVLDAIASGPVEASKSENSTMVGVGM 725
            +AEVKP WI+++QFYGVPYIWCML+NFA + EMYGVLDA+ASGP++A  SENSTMVGVGM
Sbjct: 421  YAEVKPAWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGVGM 480

Query: 726  SMEGIEQNPVVYDLMAEMAFRHEKIDVKMWIDLYPKRRYGRFHPLIQDAWNILYHTIYNC 905
            SMEGIEQNP+VYDLM+EM F H ++D+K+W++ YP RRYG+    +QDAW IL+ T+YNC
Sbjct: 481  SMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLYNC 540

Query: 906  TDGAYDKNRDVIVAFPDIDPSLI--PLVEMSTSGKHKLLKDKVSRQASLKDV-SDSYDQP 1076
            TDG  DKNRDVIVAFPD++PS+I  P +   TS  +  +   +S    +KD  +D+Y+QP
Sbjct: 541  TDGKNDKNRDVIVAFPDVEPSVIQTPGLYARTSKNYSTM---LSENYVMKDAPNDAYEQP 597

Query: 1077 HLWYSTDEVIHALELFLASGDELSASSTYRYDIVDLTRQALAKYANQVFLKIIEVYQLKN 1256
            H+WY T  VIHALELFL SGDE+S SST+RYD+VDLTRQALAKYANQ+FLKII+ Y+  N
Sbjct: 598  HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657

Query: 1257 VQEVTFYSQHFLDLVKDMDTLLACHDGFLLGPWLESSKQLAQNSEQEKQNEWNARTQITM 1436
            V +VT   + FL+LVKD+D LLA H+GFLLGPWLES+K LA++ EQE Q EWNARTQITM
Sbjct: 658  VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQYEWNARTQITM 717

Query: 1437 WFDNTEEEASLLRDYGNKYWSGLLSDYYCPRAAIYFKYLIESLEKGQRFPLKDWRRDWIK 1616
            WFDNTE +ASLLRDY NKYWSGLL DYY PRAAIYFK+LI SL+K + F L++WRR+WI 
Sbjct: 718  WFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFKHLISSLKKKEPFALEEWRREWIS 777

Query: 1617 LTNDWQKGRNIFPVKSEGDALNTSRWLYAKYLWNPGS-----YDHLIKP 1748
            LTN+WQ  R +F   + GDALN SR L+ KYL N  S      D  +KP
Sbjct: 778  LTNNWQSDRKVFATTATGDALNISRALFTKYLRNADSLGLDGMDSFVKP 826


Top