BLASTX nr result

ID: Ephedra27_contig00015419 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00015419
         (2143 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, puta...   704   0.0  
ref|XP_006434020.1| hypothetical protein CICLE_v10000352mg [Citr...   698   0.0  
gb|EOY16051.1| Glycosyl hydrolase family protein [Theobroma cacao]    697   0.0  
gb|EOY16050.1| Glycosyl hydrolase family protein isoform 3 [Theo...   697   0.0  
gb|EOY16049.1| Glycosyl hydrolase family protein isoform 2 [Theo...   697   0.0  
gb|EOY16048.1| Glycosyl hydrolase family protein isoform 1 [Theo...   697   0.0  
ref|XP_006472631.1| PREDICTED: probable beta-D-xylosidase 7-like...   697   0.0  
ref|XP_006826952.1| hypothetical protein AMTR_s00010p00188970 [A...   696   0.0  
ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [S...   694   0.0  
ref|XP_002306583.2| hypothetical protein POPTR_0005s16660g [Popu...   692   0.0  
gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indi...   689   0.0  
ref|XP_006647908.1| PREDICTED: probable beta-D-xylosidase 7-like...   688   0.0  
ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group] g...   688   0.0  
ref|XP_004953945.1| PREDICTED: probable beta-D-xylosidase 7-like...   687   0.0  
ref|XP_002302285.1| glycosyl hydrolase family 3 family protein [...   687   0.0  
ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like...   681   0.0  
ref|NP_001266114.1| SlArf/Xyl4 protein precursor [Solanum lycope...   681   0.0  
ref|XP_002302284.2| glycosyl hydrolase family 3 family protein [...   680   0.0  
gb|EMJ26446.1| hypothetical protein PRUPE_ppa001675mg [Prunus pe...   679   0.0  
ref|XP_006354009.1| PREDICTED: probable beta-D-xylosidase 7-like...   676   0.0  

>ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
            gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase
            precursor, putative [Ricinus communis]
          Length = 774

 Score =  704 bits (1816), Expect = 0.0
 Identities = 346/612 (56%), Positives = 440/612 (71%), Gaps = 1/612 (0%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY+++YVRG+QGDS  G      L+ASACCKHFTAYDLD+WKG
Sbjct: 166  RWGRGQETPGEDPLVTGKYAVSYVRGVQGDSFQGGKLKGHLQASACCKHFTAYDLDNWKG 225

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ FDA V+ QDL DTYQPPF+SCV+ G+ASGIMC+YN VNGIP+CAD+NLL++ AR 
Sbjct: 226  VNRFVFDARVTMQDLADTYQPPFQSCVQQGKASGIMCAYNRVNGIPSCADFNLLSRTARG 285

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI SDCDAVSII     Y ++ EDA  DVLKAGMDVNCG +LQK+T  A+++ K
Sbjct: 286  QWDFHGYIASDCDAVSIIYDNQGYAKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVEQKK 345

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L E+ +DRALHNLFSVRMRLGLFN   +   + +     VCS +HQ LAL+A R GIVLL
Sbjct: 346  LPEASIDRALHNLFSVRMRLGLFNGNPTEQPFSNIGPDQVCSQEHQILALEAARNGIVLL 405

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN+   LPL K+   SLAVIGPNAN+   +LGNY G PCKTVTPL  LQ YV NT+Y +G
Sbjct: 406  KNSARLLPLQKSKTVSLAVIGPNANSVQTLLGNYAGPPCKTVTPLQALQYYVKNTIYYSG 465

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C+ V C S S I +AV I+K VD+VV+++G+DQTQE+E  DR++L LPG Q+        
Sbjct: 466  CDTVKCSSAS-IDKAVDIAKGVDRVVMIMGLDQTQEREELDRLDLVLPGKQQELITNVAK 524

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+LSGGPVD+ FA  D +IGSI+WAGYPGEAGG ALAE IFGDHNPGG+LPM
Sbjct: 525  SAKNPIVLVLLSGGPVDISFAKYDENIGSILWAGYPGEAGGIALAEIIFGDHNPGGKLPM 584

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQE+VK+PMT+M MRP+P +GYPGRTYRFY G  VFEFG+GLSYS++SY+     ++
Sbjct: 585  TWYPQEFVKVPMTDMRMRPDPSSGYPGRTYRFYKGRNVFEFGYGLSYSKYSYEL--KYVS 642

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKV-NVKSISCKELEFKCDIHVKNHGDLDGRHSVLLF 527
              KL +  S+             + V  + +  CKE +F   + V+N G++ G+H VLLF
Sbjct: 643  QTKLYLNQSSTMRIIDNSDPVRATLVAQLGAEFCKESKFSVKVGVENQGEMAGKHPVLLF 702

Query: 526  SSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLL 347
            +      +G P +QLI F+++ L AG+ A++ F++ PCEH S   EDG R++  G+H L+
Sbjct: 703  ARHARHGNGRPRRQLIGFKSVILNAGEKAEIEFELSPCEHFSRANEDGLRVMEEGTHFLM 762

Query: 346  VGDVQFPLSVEV 311
            VG  ++P+SV V
Sbjct: 763  VGGDKYPISVVV 774


>ref|XP_006434020.1| hypothetical protein CICLE_v10000352mg [Citrus clementina]
            gi|557536142|gb|ESR47260.1| hypothetical protein
            CICLE_v10000352mg [Citrus clementina]
          Length = 776

 Score =  698 bits (1801), Expect = 0.0
 Identities = 351/617 (56%), Positives = 437/617 (70%), Gaps = 6/617 (0%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY+++YVRG+QGD+ +G      L+ASACCKHFTAYDLD+WKG
Sbjct: 168  RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +RY FDA V+ QDL DTYQPPF+SCV+ GRASGIMC+YN VNGIP+CAD NLL+K AR+
Sbjct: 228  TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI SDCDAVSII  A  Y ++ EDA  DVLKAGMDVNCG  LQK+T  A+++ K
Sbjct: 288  LWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L ESE+DRALHNLFSVRMRLGLFN   +   +G      VCSP HQ LAL A + GIVLL
Sbjct: 348  LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN+   LPL K+   SLA+IGPNAN+A  +LGNY G  C+++TPL  LQ YV NT+Y  G
Sbjct: 408  KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPG 467

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C+ VAC S S I +AV I+K  D VVL++G+DQTQEKE  DRV+L LPG Q+        
Sbjct: 468  CDTVACSSAS-IDKAVNIAKGADHVVLIMGLDQTQEKEELDRVDLVLPGRQQELITRVAE 526

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+L GGPVD+ FA +D +IGSI+WAGYPGEAG  ALAE IFGDHNPGGRLPM
Sbjct: 527  AAKKPVILVLLCGGPVDITFAKHDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQ+Y+K+PMT+M MRP   +G PGRTYRFY G +VF FG GLSYS++SYKF S + N
Sbjct: 587  TWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKSVSQN 646

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSIS------CKELEFKCDIHVKNHGDLDGRH 542
             L L  ++S                V+ KS+       C+  +F   I VKNHG++ G+H
Sbjct: 647  KLYLNQSSST-------KMVENQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699

Query: 541  SVLLFSSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALG 362
             VLLF       +G P+KQL+ F+++ L A + A++ F++ PCE LS  +EDG  ++  G
Sbjct: 700  PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759

Query: 361  SHNLLVGDVQFPLSVEV 311
            +H L+VGD ++P+S+ V
Sbjct: 760  THFLVVGDEEYPISIFV 776


>gb|EOY16051.1| Glycosyl hydrolase family protein [Theobroma cacao]
          Length = 840

 Score =  697 bits (1799), Expect = 0.0
 Identities = 339/615 (55%), Positives = 432/615 (70%), Gaps = 6/615 (0%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY++++VRG+QGDS  G      L+ SACCKHFTAYDLD+WKG
Sbjct: 202  RWGRGQETPGEDPLVTGKYAVSFVRGIQGDSFEGGKLGENLQVSACCKHFTAYDLDNWKG 261

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ FDA V+ QDL DTYQPPF+SC++ G+ASG+MC+YN +NG+P CADYNLL+K AR 
Sbjct: 262  INRFVFDANVTLQDLADTYQPPFQSCIQKGKASGVMCAYNRINGVPNCADYNLLSKTARG 321

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI +DCDAVSII     Y +  EDA ADVLKAGMD++CG +L+ YT  A++K K
Sbjct: 322  QWGFDGYITADCDAVSIIYDEQGYAKEPEDAVADVLKAGMDIDCGEYLKNYTESAVKKKK 381

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            +  +E+DRALHNLFS+RMRLGLFN   +   +G+     VCS +H  LAL+A R GIVLL
Sbjct: 382  VSVTEIDRALHNLFSIRMRLGLFNGNPTKQPFGNVGSDQVCSQEHLNLALEAARNGIVLL 441

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN  N LPLSKT   SLAVIGPNAN+   ++GNY G PC+ +TPL GLQ Y+ NT Y  G
Sbjct: 442  KNTDNLLPLSKTKTNSLAVIGPNANSTETLVGNYAGPPCEPITPLQGLQSYIKNTNYHPG 501

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C  V C SD +  +AV I+   D+VVLV+G+DQTQE+E+ DRV+L LPG Q+        
Sbjct: 502  CSTVNCSSD-LTDQAVKIAAGADRVVLVMGLDQTQEREAHDRVDLVLPGNQQKLISSIVR 560

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+L GGPVD+ FA ND +IGSIIWAGYPGEAGGQALAE IFGDHNPGGRLPM
Sbjct: 561  AANKPVILVLLCGGPVDISFAKNDQNIGSIIWAGYPGEAGGQALAEIIFGDHNPGGRLPM 620

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQ ++KIPMT+M MRP P +GYPGRTYRFY G KVFEFG+GLSYS +SY+    T N
Sbjct: 621  TWYPQSFIKIPMTDMRMRPEPSSGYPGRTYRFYQGPKVFEFGYGLSYSNYSYEILPVTQN 680

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSIS------CKELEFKCDIHVKNHGDLDGRH 542
             + L   +S+               V  KS+S      C++ +F   + V+N+G++ G+H
Sbjct: 681  KVYLNNQSSD------------KMAVAYKSVSEMGPELCEKSKFPVTVGVQNNGEMSGKH 728

Query: 541  SVLLFSSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALG 362
            +VLLF       +G P+KQL+ F ++ L AG+ A++ F++ PCEHLS+  E G  ++  G
Sbjct: 729  AVLLFVRQAKPGNGRPMKQLVGFNSVDLKAGERAEIKFELSPCEHLSSANEGGLMVIDEG 788

Query: 361  SHNLLVGDVQFPLSV 317
            SH L +GD +  ++V
Sbjct: 789  SHFLSIGDKESEITV 803


>gb|EOY16050.1| Glycosyl hydrolase family protein isoform 3 [Theobroma cacao]
          Length = 1593

 Score =  697 bits (1799), Expect = 0.0
 Identities = 340/609 (55%), Positives = 424/609 (69%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY++++VRG+QGDS  G      L+ SACCKHFTAYDLD+WKG
Sbjct: 985  RWGRGQETPGEDPLVTGKYAVSFVRGIQGDSFEGGMLGEHLQVSACCKHFTAYDLDNWKG 1044

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ F+A VS QDL DTYQPPF+SC++ G+ASGIMC+YN VNG+P CADYNLL+K AR 
Sbjct: 1045 VNRFVFNAKVSLQDLADTYQPPFQSCIQQGKASGIMCAYNRVNGVPNCADYNLLSKTARG 1104

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI SDCDAVSI+     Y +  EDA ADVLKAGMDVNCG +L+ YT  A++K K
Sbjct: 1105 QWGFNGYITSDCDAVSIMHEKQGYAKVPEDAVADVLKAGMDVNCGNYLKNYTKSAVKKRK 1164

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L  SE+DRALHNLFSVRMRLGLFN   +   +G+     VCS +HQ LAL+A R GIVLL
Sbjct: 1165 LPMSEIDRALHNLFSVRMRLGLFNGNPTKQPFGNIGSDQVCSQEHQNLALEAARNGIVLL 1224

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN  + LPLSKT   SLAVIGPNAN+A  ++GNY G PCK++TPL  LQ Y  +T Y  G
Sbjct: 1225 KNTDSLLPLSKTKTTSLAVIGPNANSAKTLVGNYAGPPCKSITPLQALQSYAKDTRYHPG 1284

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C  V C S ++  +AV I+K  D VVLV+G+DQTQE+E  DRV+L LP  Q+        
Sbjct: 1285 CSAVNC-SSALTDQAVKIAKGADHVVLVMGLDQTQEREDHDRVDLVLPAKQQNLISSIAR 1343

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+LSGGPVD+ FA  D HIGSI+WAGYPGEAGG ALAE IFGDHNPGGRLP+
Sbjct: 1344 AAKNPVILVLLSGGPVDITFAKYDQHIGSILWAGYPGEAGGLALAEIIFGDHNPGGRLPV 1403

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQ ++K+PMT+M MRP P +GYPGRTYRFY G KVFEFG+GLSYS++SY+F   T N
Sbjct: 1404 TWYPQSFIKVPMTDMRMRPEPSSGYPGRTYRFYQGPKVFEFGYGLSYSKYSYEFLPVTQN 1463

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRHSVLLFS 524
             + L   + N                  K + C + +F   + V+NHG++ G H VLLF 
Sbjct: 1464 KVYLNHQSCNKMVENSNPVRYMPVSEIAKEL-CDKRKFPVKVGVQNHGEMAGTHPVLLFV 1522

Query: 523  SSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLLV 344
                  +G P+KQL+ F +++L AG+  ++ F++ PCEHLS   EDG  ++  G H L +
Sbjct: 1523 RQAKVGNGRPMKQLVGFHSVNLNAGERVEIEFELSPCEHLSRANEDGLMVIEEGPHFLSI 1582

Query: 343  GDVQFPLSV 317
            GD +  ++V
Sbjct: 1583 GDKESEITV 1591



 Score =  684 bits (1764), Expect = 0.0
 Identities = 338/593 (56%), Positives = 421/593 (70%), Gaps = 6/593 (1%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY+++YVRG+QGD   G      L+ASACCKHFTAYDLD+WKG
Sbjct: 165  RWGRGQETPGEDPLVTGKYAVSYVRGVQGDIFQGGKLNGHLQASACCKHFTAYDLDNWKG 224

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ FDA V+ QDL DTYQPPFKSCV+ GRASGIMC+YN VNG+P+CAD NLL+K  R 
Sbjct: 225  VNRFVFDARVTVQDLADTYQPPFKSCVQDGRASGIMCAYNRVNGVPSCADSNLLSKTLRG 284

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
            +W F+GYI SDCDAV+II     Y ++ EDA  DVLKAGMD+NCG +LQKY+  A+ + K
Sbjct: 285  EWDFKGYITSDCDAVAIIHNDQGYAKSPEDAVVDVLKAGMDLNCGSYLQKYSKSAVLQKK 344

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L ESE+DRALHNLF+VRMRLGLFN   + H +G+     VCSP+HQ LAL+A R GIVLL
Sbjct: 345  LPESEIDRALHNLFAVRMRLGLFNGNPAQHPFGNIGTDQVCSPEHQILALEAARNGIVLL 404

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN    LPL K  + SLAVIGPNAN+   +LGNY G PCK+VTPL  LQ YV NT+Y  G
Sbjct: 405  KNEEKLLPLPKATV-SLAVIGPNANSPQTLLGNYAGPPCKSVTPLQALQSYVKNTVYHPG 463

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C+ V+C S  +I +AV I+K+ D VVL++G+DQTQEKE  DRV+L LPG Q+        
Sbjct: 464  CDTVSC-STGVIDKAVDIAKQADYVVLIMGLDQTQEKEELDRVDLLLPGRQQELITSVAK 522

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+LSGGP+DV FA +DP IG I WAGYPGE GG ALAE +FGDHNPGGRLP+
Sbjct: 523  AAKRPVVLVLLSGGPIDVSFAKDDPRIGGIFWAGYPGEGGGIALAEIVFGDHNPGGRLPV 582

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQE+ K+PMT+M MRP   + YPGRTYRFY G KVFEFG+GLSYS++SY+F+  + N
Sbjct: 583  TWYPQEFTKVPMTDMRMRPESSSEYPGRTYRFYKGDKVFEFGYGLSYSKYSYEFTRVSQN 642

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSIS------CKELEFKCDIHVKNHGDLDGRH 542
            ++ L  ++S                V  K +S      C + +F   + VKNHG++ G+H
Sbjct: 643  NVYLNHSSS-------FHTTVTSDSVRYKLVSELGAEVCDQRKFTVCVGVKNHGEMAGKH 695

Query: 541  SVLLFSSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDG 383
             VLLF+   +   G P KQL+ F+++ L AG+ A++ F+V PCEHLS   E G
Sbjct: 696  PVLLFARHGNHGDGRPKKQLVGFQSVILSAGEMAEIQFEVSPCEHLSRANEYG 748


>gb|EOY16049.1| Glycosyl hydrolase family protein isoform 2 [Theobroma cacao]
          Length = 1597

 Score =  697 bits (1799), Expect = 0.0
 Identities = 340/609 (55%), Positives = 424/609 (69%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY++++VRG+QGDS  G      L+ SACCKHFTAYDLD+WKG
Sbjct: 989  RWGRGQETPGEDPLVTGKYAVSFVRGIQGDSFEGGMLGEHLQVSACCKHFTAYDLDNWKG 1048

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ F+A VS QDL DTYQPPF+SC++ G+ASGIMC+YN VNG+P CADYNLL+K AR 
Sbjct: 1049 VNRFVFNAKVSLQDLADTYQPPFQSCIQQGKASGIMCAYNRVNGVPNCADYNLLSKTARG 1108

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI SDCDAVSI+     Y +  EDA ADVLKAGMDVNCG +L+ YT  A++K K
Sbjct: 1109 QWGFNGYITSDCDAVSIMHEKQGYAKVPEDAVADVLKAGMDVNCGNYLKNYTKSAVKKRK 1168

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L  SE+DRALHNLFSVRMRLGLFN   +   +G+     VCS +HQ LAL+A R GIVLL
Sbjct: 1169 LPMSEIDRALHNLFSVRMRLGLFNGNPTKQPFGNIGSDQVCSQEHQNLALEAARNGIVLL 1228

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN  + LPLSKT   SLAVIGPNAN+A  ++GNY G PCK++TPL  LQ Y  +T Y  G
Sbjct: 1229 KNTDSLLPLSKTKTTSLAVIGPNANSAKTLVGNYAGPPCKSITPLQALQSYAKDTRYHPG 1288

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C  V C S ++  +AV I+K  D VVLV+G+DQTQE+E  DRV+L LP  Q+        
Sbjct: 1289 CSAVNC-SSALTDQAVKIAKGADHVVLVMGLDQTQEREDHDRVDLVLPAKQQNLISSIAR 1347

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+LSGGPVD+ FA  D HIGSI+WAGYPGEAGG ALAE IFGDHNPGGRLP+
Sbjct: 1348 AAKNPVILVLLSGGPVDITFAKYDQHIGSILWAGYPGEAGGLALAEIIFGDHNPGGRLPV 1407

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQ ++K+PMT+M MRP P +GYPGRTYRFY G KVFEFG+GLSYS++SY+F   T N
Sbjct: 1408 TWYPQSFIKVPMTDMRMRPEPSSGYPGRTYRFYQGPKVFEFGYGLSYSKYSYEFLPVTQN 1467

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRHSVLLFS 524
             + L   + N                  K + C + +F   + V+NHG++ G H VLLF 
Sbjct: 1468 KVYLNHQSCNKMVENSNPVRYMPVSEIAKEL-CDKRKFPVKVGVQNHGEMAGTHPVLLFV 1526

Query: 523  SSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLLV 344
                  +G P+KQL+ F +++L AG+  ++ F++ PCEHLS   EDG  ++  G H L +
Sbjct: 1527 RQAKVGNGRPMKQLVGFHSVNLNAGERVEIEFELSPCEHLSRANEDGLMVIEEGPHFLSI 1586

Query: 343  GDVQFPLSV 317
            GD +  ++V
Sbjct: 1587 GDKESEITV 1595



 Score =  684 bits (1764), Expect = 0.0
 Identities = 338/593 (56%), Positives = 421/593 (70%), Gaps = 6/593 (1%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY+++YVRG+QGD   G      L+ASACCKHFTAYDLD+WKG
Sbjct: 165  RWGRGQETPGEDPLVTGKYAVSYVRGVQGDIFQGGKLNGHLQASACCKHFTAYDLDNWKG 224

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ FDA V+ QDL DTYQPPFKSCV+ GRASGIMC+YN VNG+P+CAD NLL+K  R 
Sbjct: 225  VNRFVFDARVTVQDLADTYQPPFKSCVQDGRASGIMCAYNRVNGVPSCADSNLLSKTLRG 284

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
            +W F+GYI SDCDAV+II     Y ++ EDA  DVLKAGMD+NCG +LQKY+  A+ + K
Sbjct: 285  EWDFKGYITSDCDAVAIIHNDQGYAKSPEDAVVDVLKAGMDLNCGSYLQKYSKSAVLQKK 344

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L ESE+DRALHNLF+VRMRLGLFN   + H +G+     VCSP+HQ LAL+A R GIVLL
Sbjct: 345  LPESEIDRALHNLFAVRMRLGLFNGNPAQHPFGNIGTDQVCSPEHQILALEAARNGIVLL 404

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN    LPL K  + SLAVIGPNAN+   +LGNY G PCK+VTPL  LQ YV NT+Y  G
Sbjct: 405  KNEEKLLPLPKATV-SLAVIGPNANSPQTLLGNYAGPPCKSVTPLQALQSYVKNTVYHPG 463

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C+ V+C S  +I +AV I+K+ D VVL++G+DQTQEKE  DRV+L LPG Q+        
Sbjct: 464  CDTVSC-STGVIDKAVDIAKQADYVVLIMGLDQTQEKEELDRVDLLLPGRQQELITSVAK 522

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+LSGGP+DV FA +DP IG I WAGYPGE GG ALAE +FGDHNPGGRLP+
Sbjct: 523  AAKRPVVLVLLSGGPIDVSFAKDDPRIGGIFWAGYPGEGGGIALAEIVFGDHNPGGRLPV 582

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQE+ K+PMT+M MRP   + YPGRTYRFY G KVFEFG+GLSYS++SY+F+  + N
Sbjct: 583  TWYPQEFTKVPMTDMRMRPESSSEYPGRTYRFYKGDKVFEFGYGLSYSKYSYEFTRVSQN 642

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSIS------CKELEFKCDIHVKNHGDLDGRH 542
            ++ L  ++S                V  K +S      C + +F   + VKNHG++ G+H
Sbjct: 643  NVYLNHSSS-------FHTTVTSDSVRYKLVSELGAEVCDQRKFTVCVGVKNHGEMAGKH 695

Query: 541  SVLLFSSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDG 383
             VLLF+   +   G P KQL+ F+++ L AG+ A++ F+V PCEHLS   E G
Sbjct: 696  PVLLFARHGNHGDGRPKKQLVGFQSVILSAGEMAEIQFEVSPCEHLSRANEYG 748


>gb|EOY16048.1| Glycosyl hydrolase family protein isoform 1 [Theobroma cacao]
          Length = 1593

 Score =  697 bits (1799), Expect = 0.0
 Identities = 340/609 (55%), Positives = 424/609 (69%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY++++VRG+QGDS  G      L+ SACCKHFTAYDLD+WKG
Sbjct: 985  RWGRGQETPGEDPLVTGKYAVSFVRGIQGDSFEGGMLGEHLQVSACCKHFTAYDLDNWKG 1044

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ F+A VS QDL DTYQPPF+SC++ G+ASGIMC+YN VNG+P CADYNLL+K AR 
Sbjct: 1045 VNRFVFNAKVSLQDLADTYQPPFQSCIQQGKASGIMCAYNRVNGVPNCADYNLLSKTARG 1104

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI SDCDAVSI+     Y +  EDA ADVLKAGMDVNCG +L+ YT  A++K K
Sbjct: 1105 QWGFNGYITSDCDAVSIMHEKQGYAKVPEDAVADVLKAGMDVNCGNYLKNYTKSAVKKRK 1164

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L  SE+DRALHNLFSVRMRLGLFN   +   +G+     VCS +HQ LAL+A R GIVLL
Sbjct: 1165 LPMSEIDRALHNLFSVRMRLGLFNGNPTKQPFGNIGSDQVCSQEHQNLALEAARNGIVLL 1224

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN  + LPLSKT   SLAVIGPNAN+A  ++GNY G PCK++TPL  LQ Y  +T Y  G
Sbjct: 1225 KNTDSLLPLSKTKTTSLAVIGPNANSAKTLVGNYAGPPCKSITPLQALQSYAKDTRYHPG 1284

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C  V C S ++  +AV I+K  D VVLV+G+DQTQE+E  DRV+L LP  Q+        
Sbjct: 1285 CSAVNC-SSALTDQAVKIAKGADHVVLVMGLDQTQEREDHDRVDLVLPAKQQNLISSIAR 1343

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+LSGGPVD+ FA  D HIGSI+WAGYPGEAGG ALAE IFGDHNPGGRLP+
Sbjct: 1344 AAKNPVILVLLSGGPVDITFAKYDQHIGSILWAGYPGEAGGLALAEIIFGDHNPGGRLPV 1403

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQ ++K+PMT+M MRP P +GYPGRTYRFY G KVFEFG+GLSYS++SY+F   T N
Sbjct: 1404 TWYPQSFIKVPMTDMRMRPEPSSGYPGRTYRFYQGPKVFEFGYGLSYSKYSYEFLPVTQN 1463

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRHSVLLFS 524
             + L   + N                  K + C + +F   + V+NHG++ G H VLLF 
Sbjct: 1464 KVYLNHQSCNKMVENSNPVRYMPVSEIAKEL-CDKRKFPVKVGVQNHGEMAGTHPVLLFV 1522

Query: 523  SSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLLV 344
                  +G P+KQL+ F +++L AG+  ++ F++ PCEHLS   EDG  ++  G H L +
Sbjct: 1523 RQAKVGNGRPMKQLVGFHSVNLNAGERVEIEFELSPCEHLSRANEDGLMVIEEGPHFLSI 1582

Query: 343  GDVQFPLSV 317
            GD +  ++V
Sbjct: 1583 GDKESEITV 1591



 Score =  684 bits (1764), Expect = 0.0
 Identities = 338/593 (56%), Positives = 421/593 (70%), Gaps = 6/593 (1%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY+++YVRG+QGD   G      L+ASACCKHFTAYDLD+WKG
Sbjct: 165  RWGRGQETPGEDPLVTGKYAVSYVRGVQGDIFQGGKLNGHLQASACCKHFTAYDLDNWKG 224

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ FDA V+ QDL DTYQPPFKSCV+ GRASGIMC+YN VNG+P+CAD NLL+K  R 
Sbjct: 225  VNRFVFDARVTVQDLADTYQPPFKSCVQDGRASGIMCAYNRVNGVPSCADSNLLSKTLRG 284

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
            +W F+GYI SDCDAV+II     Y ++ EDA  DVLKAGMD+NCG +LQKY+  A+ + K
Sbjct: 285  EWDFKGYITSDCDAVAIIHNDQGYAKSPEDAVVDVLKAGMDLNCGSYLQKYSKSAVLQKK 344

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L ESE+DRALHNLF+VRMRLGLFN   + H +G+     VCSP+HQ LAL+A R GIVLL
Sbjct: 345  LPESEIDRALHNLFAVRMRLGLFNGNPAQHPFGNIGTDQVCSPEHQILALEAARNGIVLL 404

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN    LPL K  + SLAVIGPNAN+   +LGNY G PCK+VTPL  LQ YV NT+Y  G
Sbjct: 405  KNEEKLLPLPKATV-SLAVIGPNANSPQTLLGNYAGPPCKSVTPLQALQSYVKNTVYHPG 463

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C+ V+C S  +I +AV I+K+ D VVL++G+DQTQEKE  DRV+L LPG Q+        
Sbjct: 464  CDTVSC-STGVIDKAVDIAKQADYVVLIMGLDQTQEKEELDRVDLLLPGRQQELITSVAK 522

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+LSGGP+DV FA +DP IG I WAGYPGE GG ALAE +FGDHNPGGRLP+
Sbjct: 523  AAKRPVVLVLLSGGPIDVSFAKDDPRIGGIFWAGYPGEGGGIALAEIVFGDHNPGGRLPV 582

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQE+ K+PMT+M MRP   + YPGRTYRFY G KVFEFG+GLSYS++SY+F+  + N
Sbjct: 583  TWYPQEFTKVPMTDMRMRPESSSEYPGRTYRFYKGDKVFEFGYGLSYSKYSYEFTRVSQN 642

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSIS------CKELEFKCDIHVKNHGDLDGRH 542
            ++ L  ++S                V  K +S      C + +F   + VKNHG++ G+H
Sbjct: 643  NVYLNHSSS-------FHTTVTSDSVRYKLVSELGAEVCDQRKFTVCVGVKNHGEMAGKH 695

Query: 541  SVLLFSSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDG 383
             VLLF+   +   G P KQL+ F+++ L AG+ A++ F+V PCEHLS   E G
Sbjct: 696  PVLLFARHGNHGDGRPKKQLVGFQSVILSAGEMAEIQFEVSPCEHLSRANEYG 748


>ref|XP_006472631.1| PREDICTED: probable beta-D-xylosidase 7-like [Citrus sinensis]
          Length = 776

 Score =  697 bits (1798), Expect = 0.0
 Identities = 350/617 (56%), Positives = 436/617 (70%), Gaps = 6/617 (0%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV  KY+++YVRG+QGD+ +G      L+ASACCKHFTAYDLD+WKG
Sbjct: 168  RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGNLQASACCKHFTAYDLDNWKG 227

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +RY FDA V+ QDL DTYQPPF+SCV+ GRASGIMC+YN VNGIP+CAD NLL+K AR+
Sbjct: 228  TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI SDCDAVSII  A  Y ++ EDA  DVLKAGMDVNCG  LQK+T  A+++ K
Sbjct: 288  QWGFHGYITSDCDAVSIIHDAQGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L ESE+DRALHNLFSVRMRLGLFN   +   +G      VCSP HQ LAL A + GIVLL
Sbjct: 348  LPESEIDRALHNLFSVRMRLGLFNGNPTTQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN+   LPL K+   SLA+IGPNAN+A  +LGNY G  C+++TPL  LQ YV NT+Y  G
Sbjct: 408  KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPG 467

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C+ VAC S S I +AV I+K  D VVL++G+DQTQEKE  DRV+L LPG Q+        
Sbjct: 468  CDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAE 526

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+L GGPVD+ FA  D +IGSI+WAGYPGEAG  ALAE IFGDHNPGGRLPM
Sbjct: 527  AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGGRLPM 586

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQ+Y+K+PMT+M MRP   +G PGRTYRFY G +VF FG GLSYS++SYKF + + N
Sbjct: 587  TWYPQDYIKVPMTDMKMRPQATSGNPGRTYRFYEGKEVFPFGCGLSYSKYSYKFKAVSQN 646

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSIS------CKELEFKCDIHVKNHGDLDGRH 542
             L L  ++S                V+ KS+       C+  +F   I VKNHG++ G+H
Sbjct: 647  KLYLNQSSST-------KMVESQDVVHYKSVPELGTEFCETRKFLVTIGVKNHGEMAGKH 699

Query: 541  SVLLFSSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALG 362
             VLLF       +G P+KQL+ F+++ L A + A++ F++ PCE LS  +EDG  ++  G
Sbjct: 700  PVLLFVKPARRGNGRPIKQLVGFQSVILNAKEKAEIVFELSPCESLSRAREDGLMVIEEG 759

Query: 361  SHNLLVGDVQFPLSVEV 311
            +H L+VGD ++P+S+ V
Sbjct: 760  THFLVVGDEEYPISIFV 776


>ref|XP_006826952.1| hypothetical protein AMTR_s00010p00188970 [Amborella trichopoda]
            gi|548831381|gb|ERM94189.1| hypothetical protein
            AMTR_s00010p00188970 [Amborella trichopoda]
          Length = 778

 Score =  696 bits (1797), Expect = 0.0
 Identities = 339/606 (55%), Positives = 433/606 (71%), Gaps = 2/606 (0%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATP-LKASACCKHFTAYDLDDWK 1967
            RWGRGQETPGEDP V  KY++AYVRGLQGDS+ G    T  L+ASACCKH TAYDLD W+
Sbjct: 166  RWGRGQETPGEDPTVTGKYAVAYVRGLQGDSIDGSRGPTVRLRASACCKHLTAYDLDKWE 225

Query: 1966 GYSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIAR 1787
            G +RY+F+A VS QDL DTYQPPF+ CVE G ASGIMC+YN VNG+P CADY+LLTK AR
Sbjct: 226  GTTRYTFNAAVSSQDLADTYQPPFQRCVEEGHASGIMCAYNRVNGVPNCADYDLLTKTAR 285

Query: 1786 QDWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKG 1607
              W F GYI SDCDAV II  + +Y  T EDA  DVL+AGMDVNCG ++Q++T+ AIQ+G
Sbjct: 286  SRWGFYGYITSDCDAVGIIHDSQSYAATPEDAVGDVLRAGMDVNCGSYMQQHTMSAIQQG 345

Query: 1606 KLKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVL 1427
            K KES+V+RALH+LFS+R+RLGLF+   +    G+     VCS +HQ LAL A R+GIVL
Sbjct: 346  KAKESDVNRALHHLFSIRIRLGLFDGNPTKLPDGNIGPNIVCSNEHQYLALQAAREGIVL 405

Query: 1426 LKNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAA 1247
            LKN+P  LPLSK AIKS AVIGPNANN   +LGNY G PC  ++PL  LQRYV +T +A 
Sbjct: 406  LKNSPKVLPLSKNAIKSFAVIGPNANNPQTLLGNYAGPPCNILSPLQALQRYVKHTQFAH 465

Query: 1246 GCENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXX 1067
            GC+ VAC S+++I EAV +++  D VVLV+G+DQ+QE+E  DRV+L+LPG QE       
Sbjct: 466  GCDLVACTSEALIDEAVNVARAADHVVLVMGLDQSQEREELDRVSLSLPGHQERLITMVS 525

Query: 1066 XXXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLP 887
                     V+L GGPVDV FA  +  IG+++WAGYPGEAGG ALAE IFG+HNPGG+LP
Sbjct: 526  QAAKKPVVLVLLCGGPVDVSFAKRNSKIGAMVWAGYPGEAGGTALAEIIFGEHNPGGKLP 585

Query: 886  MTWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTI 707
            MTWYP+E+ KIPMTNM MRP+  +GYPGRTYRFY G +VF +GHGLSYS +SY+F S  I
Sbjct: 586  MTWYPEEFTKIPMTNMRMRPDLNSGYPGRTYRFYRGREVFRYGHGLSYSTYSYEFLSPAI 645

Query: 706  NSLKLKITASNGXXXXXXXXXXXXSKVN-VKSISCKELEFKCDIHVKNHGDLDGRHSVLL 530
              L L ++                  ++ + S +C +++F   + V+N G +DG+H VLL
Sbjct: 646  TPLYLNLSLYRDPSLSNQDSEHLIYDIDQLGSEACDQVQFSTTVRVRNSGQMDGQHVVLL 705

Query: 529  FSSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNL 350
            FS  P   +  P +QL+ F+++ L AG+  +V F+++PCEH +   EDG R+   G H L
Sbjct: 706  FSRPPVVSNDAPKRQLVDFKSVYLKAGEMREVHFELRPCEHFAGTLEDGRRVFEGGPHYL 765

Query: 349  LVGDVQ 332
            +VG+V+
Sbjct: 766  MVGNVE 771


>ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
            gi|241932371|gb|EES05516.1| hypothetical protein
            SORBIDRAFT_04g027700 [Sorghum bicolor]
          Length = 784

 Score =  694 bits (1791), Expect = 0.0
 Identities = 343/616 (55%), Positives = 434/616 (70%), Gaps = 8/616 (1%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDP +  KY+  +VRG+QG  ++G   +T L+ASACCKHFTAYDL++WKG
Sbjct: 175  RWGRGQETPGEDPTMTGKYAAVFVRGVQGYGVAGPVNSTDLEASACCKHFTAYDLENWKG 234

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +RY +DA V+ QDL DTY PPFKSCVE G ASGIMCSYN VNG+PTCADYNLL+K ARQ
Sbjct: 235  ITRYVYDAKVTAQDLEDTYNPPFKSCVEDGHASGIMCSYNRVNGVPTCADYNLLSKTARQ 294

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI SDCDAVSII  A  Y +T+EDA ADVLKAGMDVNCGG++QKY   A+Q+GK
Sbjct: 295  SWGFYGYITSDCDAVSIIHDAQGYAKTSEDAVADVLKAGMDVNCGGYVQKYGASALQQGK 354

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            + E +++RALHNLF+VRMRLGLFN +   + YG+     VC+ +HQ LAL+A + GIVLL
Sbjct: 355  ITEQDINRALHNLFTVRMRLGLFNGDPRRNRYGNIGPDQVCTQEHQDLALEAAQDGIVLL 414

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN    LPLSK+ + SLAVIG NANNAT +LGNYFG PC TVTPL  LQ YV +T + AG
Sbjct: 415  KNDGGALPLSKSGVASLAVIGFNANNATSLLGNYFGPPCVTVTPLQVLQGYVKDTSFVAG 474

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C + AC + + I EAV  +   D VVL +G+DQ QE+E  DR++L LPG Q+        
Sbjct: 475  CNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQNQEREEVDRLDLTLPGQQQTLIESVAN 533

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+L GGPVDV FA  +P IG+I+WAGYPGEAGG A+A+ +FG+HNPGGRLP+
Sbjct: 534  AAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPV 593

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKF------ 722
            TWYPQ++ K+PMT+M MR +P TGYPGRTYRFY G  VF FG+GLSYS++S++F      
Sbjct: 594  TWYPQDFTKVPMTDMRMRADPATGYPGRTYRFYRGPTVFNFGYGLSYSKYSHRFVTKPPP 653

Query: 721  SSNTINSLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRH 542
            S + +  LK   T + G                + S +C  L+F   + V+NHG +DG+H
Sbjct: 654  SMSNVAGLKALATTAGGVATYDVEA--------IGSETCDRLKFPAVVRVQNHGPMDGKH 705

Query: 541  SVLLFSSSPSA--HSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILA 368
             VL+F   P+A   SG P +QLI F++L L A Q A V F+V PC+H S   EDG +++ 
Sbjct: 706  PVLVFLRWPNATDGSGRPARQLIGFQSLHLRATQTAHVEFEVSPCKHFSRATEDGRKVID 765

Query: 367  LGSHNLLVGDVQFPLS 320
             GSH ++VGD +F +S
Sbjct: 766  QGSHFVMVGDDEFEMS 781


>ref|XP_002306583.2| hypothetical protein POPTR_0005s16660g [Populus trichocarpa]
            gi|550339137|gb|EEE93579.2| hypothetical protein
            POPTR_0005s16660g [Populus trichocarpa]
          Length = 773

 Score =  692 bits (1787), Expect = 0.0
 Identities = 343/611 (56%), Positives = 426/611 (69%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPL+  KY+++YVRGLQGDS  G     PL+ASACCKHFTAYDL++W G
Sbjct: 165  RWGRGQETPGEDPLMTGKYAVSYVRGLQGDSFKGGEIKGPLQASACCKHFTAYDLENWNG 224

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             SRY FDA V+ QDL DTYQPPFKSCVE GRASGIMC+YN VNGIP CAD N L++ AR 
Sbjct: 225  TSRYVFDAYVTAQDLADTYQPPFKSCVEEGRASGIMCAYNRVNGIPNCADSNFLSRTARA 284

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI SDCDAVSII  A  Y +T EDA   VLKAGMDVNCG +LQ++T  A+ + K
Sbjct: 285  QWGFDGYIASDCDAVSIIHDAQGYAKTPEDAVVAVLKAGMDVNCGSYLQQHTKAAVDQKK 344

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L  SE+DRALHNLFSVRMRLGLFN   +   +G+     VCS ++Q LALDA R GIVLL
Sbjct: 345  LTISEIDRALHNLFSVRMRLGLFNGNPTGQQFGNIGPDQVCSQENQILALDAARNGIVLL 404

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN+   LPLSK+   SLAVIGPNAN+   +LGNY G PCK VTPL  LQ Y+ +T+   G
Sbjct: 405  KNSAGLLPLSKSKTMSLAVIGPNANSVQTLLGNYAGPPCKLVTPLQALQSYIKHTIPYPG 464

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C++V C S S++  AV ++K  D VVL++G+D TQEKE  DR +L LPG Q+        
Sbjct: 465  CDSVQCSSASIV-GAVNVAKGADHVVLIMGLDDTQEKEGLDRRDLVLPGKQQELIISVAK 523

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+LSGGPVD+ FA ND +IGSI+WAGYPGEAG  ALAE IFGDHNPGG+LPM
Sbjct: 524  AAKNPVVLVLLSGGPVDISFAKNDKNIGSILWAGYPGEAGAIALAEIIFGDHNPGGKLPM 583

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQE+VK+PMT+M MRP   +GYPGRTYRFY G  VFEFG+GLSYS+++Y+  + + N
Sbjct: 584  TWYPQEFVKVPMTDMRMRPETSSGYPGRTYRFYKGPTVFEFGYGLSYSKYTYELRAVSQN 643

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRHSVLLFS 524
             L L   +S                  + +  C+  +F   I VKNHG++ G+H VLLF+
Sbjct: 644  KLYLN-QSSTMHKINNFDSVLSLLVSELGTEFCEHNKFPVRIEVKNHGEMAGKHPVLLFA 702

Query: 523  SSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLLV 344
                  +G P KQL+ F ++ L AG+ A++ F+V PCEHLS   EDG  ++  G+H L+V
Sbjct: 703  RQTKQGNGRPRKQLVGFHSVQLSAGERAEIEFEVSPCEHLSRTNEDGLMVMEEGTHFLVV 762

Query: 343  GDVQFPLSVEV 311
               ++P+S+ +
Sbjct: 763  EGQEYPISIVI 773


>gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
          Length = 774

 Score =  689 bits (1777), Expect = 0.0
 Identities = 339/618 (54%), Positives = 438/618 (70%), Gaps = 10/618 (1%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDP V  KY+  +VRG+QG +++G   +T L+ASACCKHFTAYDL++WKG
Sbjct: 163  RWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDLEASACCKHFTAYDLENWKG 222

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +RY+FDA V+ QDL DTY PPF+SCVE G ASGIMCSYN VNG+PTCADYNLL+K AR 
Sbjct: 223  VTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADYNLLSKTARG 282

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
            DW+F GYI SDCDAVSII     Y +TAEDA ADVLKAGMDVNCG ++Q++ + AIQ+GK
Sbjct: 283  DWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMDVNCGSYVQEHGLSAIQQGK 342

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            + E +++RALHNLF+VRMRLGLFN     + YG+     VC+ +HQ LAL+A + G+VLL
Sbjct: 343  ITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQEHQNLALEAAQHGVVLL 402

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN  N LPLSK+ + S+AVIG NAN+AT +LGNYFG PC +VTPL  LQ YV +T + AG
Sbjct: 403  KNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVTPLQVLQGYVKDTRFLAG 462

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C + AC   S I EA  ++  VD VVL +G+DQ QE+E  DR+ L+LPG QE        
Sbjct: 463  CNSAACNVSS-IGEAAQLASSVDYVVLFMGLDQDQEREEVDRLELSLPGMQENLINTVAN 521

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+L GGPVDV FA  +P IG+I+WAGYPGEAGG A+A+ +FG+HNPGGRLP+
Sbjct: 522  AAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPV 581

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSN--- 713
            TWYP+E+  +PMT+M MR +P TGYPGRTYRFY G+ V++FG+GLSYS++S+ F +N   
Sbjct: 582  TWYPKEFTSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGYGLSYSKYSHHFVANGTK 641

Query: 712  -----TINSLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDG 548
                 +I+ LK   TA+ G                + + +C +L+F   + V+NHG +DG
Sbjct: 642  LPSLSSIDGLKAMATAAAGTVSYDVE--------EIGTETCDKLKFPALVRVQNHGPMDG 693

Query: 547  RHSVLLFSSSP--SAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERI 374
            RH VLLF   P  +A  G P  QLI F++L L + Q   V F+V PC+H S   EDG+++
Sbjct: 694  RHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCKHFSRATEDGKKV 753

Query: 373  LALGSHNLLVGDVQFPLS 320
            +  GSH ++VGD +F +S
Sbjct: 754  IDHGSHFMMVGDDEFEMS 771


>ref|XP_006647908.1| PREDICTED: probable beta-D-xylosidase 7-like [Oryza brachyantha]
          Length = 780

 Score =  688 bits (1775), Expect = 0.0
 Identities = 342/617 (55%), Positives = 436/617 (70%), Gaps = 9/617 (1%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDP V  KY+  +VRG+QG  ++G    T L+ASACCKHFTAYDL++WKG
Sbjct: 168  RWGRGQETPGEDPTVTGKYAAVFVRGVQGYGLAGAVNTTDLEASACCKHFTAYDLENWKG 227

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +RY FDA V+ QDL DTY PPF+SCVE G ASGIMCSYN VNG+PTCADYNLL+K AR 
Sbjct: 228  VTRYVFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADYNLLSKTARG 287

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
            DW+F GYI SDCDAVSII  A  Y +TAEDA ADVLKAGMDVNCG ++Q++ + AIQ+GK
Sbjct: 288  DWRFYGYITSDCDAVSIIHDAQGYAQTAEDAVADVLKAGMDVNCGSYVQQHGLSAIQQGK 347

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            + E +++RALHNLF+VRMRLGLFN     + YG+     VC+ +HQ LAL+A + GIVLL
Sbjct: 348  ITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQEHQNLALEAAQDGIVLL 407

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN  N LPLSK+ + S+AVIG NAN+AT +LGNYFG PC +VTPL  LQ YV +T + AG
Sbjct: 408  KNDANALPLSKSKVSSIAVIGHNANDATRLLGNYFGPPCISVTPLQVLQGYVKDTRFLAG 467

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C + AC   S I EA  ++  VD VVL +G+DQ QE+E  DR+ L+LPG QE        
Sbjct: 468  CNSAACNVSS-IGEAAQLASSVDYVVLFMGLDQDQEREEVDRLELSLPGMQENLINAVAN 526

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+L GGPVDV FA  +P IG+I+WAGYPGEAGG A+A+ +FG+HNPGGRLP+
Sbjct: 527  AAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPV 586

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSN--- 713
            TWYP+E+  +PMT+M MR +P TGYPGRTYRFY G+ V++FG+GLSYS++S+ F +N   
Sbjct: 587  TWYPKEFTSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGYGLSYSKYSHNFVANGTK 646

Query: 712  -----TINSLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDG 548
                 +IN LK   TA+                  + S +C +L F   + V+N+G +DG
Sbjct: 647  VPSFGSINGLKAMATAAAAGGTVSYDVE------EIGSETCDKLRFPALVRVQNNGPMDG 700

Query: 547  RHSVLLFSSSPSA-HSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERIL 371
            RH VLLF   P+A   G P  QLIAF++L L + Q A V F+V PC+H S   E+G++++
Sbjct: 701  RHPVLLFLRWPNATDGGRPASQLIAFKSLHLKSMQTAHVEFEVSPCKHFSRATEEGKKVI 760

Query: 370  ALGSHNLLVGDVQFPLS 320
              GSH ++VGD +F +S
Sbjct: 761  DHGSHFMMVGDDEFEMS 777


>ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
            gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase
            [Oryza sativa Japonica Group] gi|46390225|dbj|BAD15656.1|
            putative beta-D-xylosidase [Oryza sativa Japonica Group]
            gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa
            Japonica Group] gi|125583710|gb|EAZ24641.1| hypothetical
            protein OsJ_08409 [Oryza sativa Japonica Group]
          Length = 780

 Score =  688 bits (1775), Expect = 0.0
 Identities = 339/618 (54%), Positives = 437/618 (70%), Gaps = 10/618 (1%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDP V  KY+  +VRG+QG +++G   +T L+ASACCKHFTAYDL++WKG
Sbjct: 169  RWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDLEASACCKHFTAYDLENWKG 228

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +RY+FDA V+ QDL DTY PPF+SCVE G ASGIMCSYN VNG+PTCADYNLL+K AR 
Sbjct: 229  VTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADYNLLSKTARG 288

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
            DW+F GYI SDCDAVSII     Y +TAEDA ADVLKAGMDVNCG ++Q++ + AIQ+GK
Sbjct: 289  DWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMDVNCGSYVQEHGLSAIQQGK 348

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            + E +++RALHNLF+VRMRLGLFN     + YG+     VC+ +HQ LAL+A + G+VLL
Sbjct: 349  ITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQEHQNLALEAAQHGVVLL 408

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN  N LPLSK+ + S+AVIG NAN+AT +LGNYFG PC +VTPL  LQ YV +T + AG
Sbjct: 409  KNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVTPLQVLQGYVKDTRFLAG 468

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C + AC   S I EA  ++  VD VVL +G+DQ QE+E  DR+ L+LPG QE        
Sbjct: 469  CNSAACNVSS-IGEAAQLASSVDYVVLFMGLDQDQEREEVDRLELSLPGMQENLINTVAN 527

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+L GGPVDV FA  +P IG+I+WAGYPGEAGG A+A+ +FG+HNPGGRLP+
Sbjct: 528  AAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPV 587

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSN--- 713
            TWYP+E+  +PMT+M MR +P TGYPGRTYRFY G+ V++FG+GLSYS++S+ F +N   
Sbjct: 588  TWYPKEFTSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGYGLSYSKYSHHFVANGTK 647

Query: 712  -----TINSLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDG 548
                 +I+ LK   TA+ G                +   +C +L+F   + V+NHG +DG
Sbjct: 648  LPSLSSIDGLKAMATAAAGTVSYDVE--------EIGPETCDKLKFPALVRVQNHGPMDG 699

Query: 547  RHSVLLFSSSP--SAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERI 374
            RH VLLF   P  +A  G P  QLI F++L L + Q   V F+V PC+H S   EDG+++
Sbjct: 700  RHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCKHFSRATEDGKKV 759

Query: 373  LALGSHNLLVGDVQFPLS 320
            +  GSH ++VGD +F +S
Sbjct: 760  IDHGSHFMMVGDDEFEMS 777


>ref|XP_004953945.1| PREDICTED: probable beta-D-xylosidase 7-like [Setaria italica]
          Length = 788

 Score =  687 bits (1773), Expect = 0.0
 Identities = 344/620 (55%), Positives = 437/620 (70%), Gaps = 12/620 (1%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDP +  KY+  +VRG+QG +++G   +T L+ASACCKHFTAYDL++WKG
Sbjct: 175  RWGRGQETPGEDPTMTGKYAAVFVRGVQGYAIAGPVNSTDLEASACCKHFTAYDLENWKG 234

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +RY FDA V+ QDL DTY PPFKSCVE G ASGIMCSYN VNG+PTCADYNLL+K ARQ
Sbjct: 235  VTRYVFDAQVTVQDLEDTYNPPFKSCVEDGHASGIMCSYNRVNGVPTCADYNLLSKTARQ 294

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
            +W F GYI SDCDAVSII  A  Y +TAEDA ADVLKAGMDVNCG ++Q++   A+Q+GK
Sbjct: 295  NWGFYGYITSDCDAVSIIHDAQGYAKTAEDAVADVLKAGMDVNCGSYVQQHGASALQQGK 354

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            + E ++DRALHNLF+VRMRLGLFN +   + YG      VC+ +HQ LAL+A + GIVLL
Sbjct: 355  ITEQDIDRALHNLFAVRMRLGLFNGDPRRNRYGDIGPDQVCTQEHQNLALEAAQDGIVLL 414

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN    LPLSK+ + SL VIG NANNA  +LGNYFG PC TVTPL  LQ YV +T +AAG
Sbjct: 415  KNDAGALPLSKSKVTSLGVIGFNANNAERLLGNYFGPPCVTVTPLQVLQGYVKDTRFAAG 474

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C   AC + + I EAV ++  VD VVL +G+DQ QE+E  DR++L LPG Q+        
Sbjct: 475  CNAAAC-NVTAIPEAVQVASSVDSVVLFMGLDQDQEREEIDRLDLTLPGQQQSLIESVAN 533

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+L GGPVDV FA  +P IG+I+WAGYPGEAGG A+A+ +FG+HNPGGRLP+
Sbjct: 534  AANKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGEAGGMAIAQVLFGEHNPGGRLPV 593

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKF------ 722
            TWYPQ++ K+PMT+M MR +P TGYPGRTYRFY G  VF+FG+GLSYS++S++F      
Sbjct: 594  TWYPQDFTKVPMTDMRMRADPATGYPGRTYRFYRGPTVFDFGYGLSYSKYSHRFVASGTK 653

Query: 721  --SSNTINSLK-LKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLD 551
              S + I  LK L+ T++ G                + S +C+ L+F   + V+NHG +D
Sbjct: 654  PPSMSDIAGLKALETTSAAGAAMYDVEA--------MGSEACERLKFPAVVRVQNHGPMD 705

Query: 550  GRHSVLLFSSSPSA---HSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGE 380
            G+H VL+F   P+A    SG P +QLI F  L L A Q A V F+V PC+H S   EDG 
Sbjct: 706  GKHPVLVFLRWPNATDDGSGRPARQLIGFRTLHLRAMQTAHVEFEVSPCKHFSRASEDGR 765

Query: 379  RILALGSHNLLVGDVQFPLS 320
            +++  GSH ++VG+ +F +S
Sbjct: 766  KVIDQGSHIVMVGEDEFEMS 785


>ref|XP_002302285.1| glycosyl hydrolase family 3 family protein [Populus trichocarpa]
            gi|222844011|gb|EEE81558.1| glycosyl hydrolase family 3
            family protein [Populus trichocarpa]
          Length = 773

 Score =  687 bits (1773), Expect = 0.0
 Identities = 337/611 (55%), Positives = 428/611 (70%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV   Y+ +YV+G+QGDS  G      L+ASACCKHFTAYDLD+WKG
Sbjct: 165  RWGRGQETPGEDPLVTGLYAASYVKGVQGDSFEGGKIKGHLQASACCKHFTAYDLDNWKG 224

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ FDA V+ QDL DTYQPPFKSCVE GRASGIMC+YN VNG+P+CAD NLL+K AR 
Sbjct: 225  MNRFVFDARVTMQDLADTYQPPFKSCVEQGRASGIMCAYNKVNGVPSCADSNLLSKTARA 284

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F+GYI SDCDAVSII     Y ++ EDA  DVLKAGMDVNCG +L K+   A+++ K
Sbjct: 285  QWGFRGYITSDCDAVSIIHDDQGYAKSPEDAVVDVLKAGMDVNCGSYLLKHAKVAVEQKK 344

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L ES++D+ALHNLFSVRMRLGLFN      ++G+     VCS +HQ LAL+A R GIVLL
Sbjct: 345  LSESDIDKALHNLFSVRMRLGLFNGRPEGQLFGNIGPDQVCSQEHQILALEAARNGIVLL 404

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN+   LPLSK+  KSLAVIGPNAN+  ++LGNY G PC+ VTPL  LQ Y+  T+Y   
Sbjct: 405  KNSARLLPLSKSKTKSLAVIGPNANSGQMLLGNYAGPPCRFVTPLQALQSYIKQTVYHPA 464

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C+ V C S S +  AV ++K  D VVL++G+DQTQE+E  DR +L LPG Q+        
Sbjct: 465  CDTVQCSSAS-VDRAVDVAKGADNVVLMMGLDQTQEREELDRTDLLLPGKQQELIIAVAK 523

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+ SGGPVD+ FA ND +IGSI+WAGYPGE G  ALAE +FGDHNPGGRLPM
Sbjct: 524  AAKNPVVLVLFSGGPVDISFAKNDKNIGSILWAGYPGEGGAIALAEIVFGDHNPGGRLPM 583

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQE+VK+PMT+M MRP   +GYPGRTYRFY G  VFEFG+G+SYS++SY+ ++ + N
Sbjct: 584  TWYPQEFVKVPMTDMGMRPEASSGYPGRTYRFYRGRSVFEFGYGISYSKYSYELTAVSQN 643

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRHSVLLFS 524
            +L L   +S                  + +  C++ + +  I VKNHG++ G+H VLLF+
Sbjct: 644  TLYLN-QSSTMHIINDFDSVRSTLISELGTEFCEQNKCRARIGVKNHGEMAGKHPVLLFA 702

Query: 523  SSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLLV 344
                  +G P KQLI F+++ L AG+ A++ F+V PCEHLS   EDG  ++  G H L+V
Sbjct: 703  RQEKHGNGRPRKQLIGFQSVVLGAGERAEIEFEVSPCEHLSRANEDGLMVMEEGRHFLVV 762

Query: 343  GDVQFPLSVEV 311
               ++P+SV +
Sbjct: 763  DGDEYPISVVI 773


>ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
          Length = 774

 Score =  681 bits (1758), Expect = 0.0
 Identities = 340/611 (55%), Positives = 421/611 (68%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLV   Y+++YVRG+QGD + G      L+ASACCKHFTAYDLDDWKG
Sbjct: 166  RWGRGQETPGEDPLVTGSYAVSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKG 225

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
              R+ FDA V+ QDL DTYQPPF  C+E GRASGIMC+YN VNG+P+CAD+NLLT  AR+
Sbjct: 226  IDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARK 285

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W FQGYI SDCDAVS+I  ++ + +T EDA  DVLKAGMDVNCG +L  +T  A+ + K
Sbjct: 286  RWNFQGYITSDCDAVSLIHDSYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKK 345

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L ESE+DRAL NLF+VRMRLGLFN       YG      VCS +HQ LALDA R GIVLL
Sbjct: 346  LPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLL 405

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN+   LPL K    SLAVIGPNAN+   ++GNY G PCK +TPL  LQ YV +T+Y  G
Sbjct: 406  KNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPG 465

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C+ VAC S S I++AV I++K D VVLV+G+DQTQE+E+ DR++L LPG Q+        
Sbjct: 466  CDAVACSSPS-IEKAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVAN 524

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+LSGGPVD+ FA    +IGSI+WAGYPG AGG A+AETIFGDHNPGGRLP+
Sbjct: 525  AAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPV 584

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQ++ KIPMT+M MRP   +GYPGRTYRFY G KVFEFG+GLSYS +S +    T N
Sbjct: 585  TWYPQDFTKIPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYSTYSCETIPVTRN 644

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRHSVLLFS 524
             L     +S              S   +    C        I V+N G++ G+HSVLLF 
Sbjct: 645  KLYFN-QSSTAHVYENTDSIRYTSVAELGKELCDSNNISISIRVRNDGEMAGKHSVLLFV 703

Query: 523  SSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLLV 344
                A +G P+KQL+AF+++ L  G++A V F + PCEH S   +DG  ++  G+H L+V
Sbjct: 704  RRLKASAGSPIKQLVAFQSVHLNGGESADVGFLLNPCEHFSGPNKDGLMVIEEGTHFLVV 763

Query: 343  GDVQFPLSVEV 311
            GD + P++V V
Sbjct: 764  GDQEHPVTVVV 774


>ref|NP_001266114.1| SlArf/Xyl4 protein precursor [Solanum lycopersicum]
            gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum
            lycopersicum]
          Length = 775

 Score =  681 bits (1757), Expect = 0.0
 Identities = 332/612 (54%), Positives = 429/612 (70%), Gaps = 1/612 (0%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATP-LKASACCKHFTAYDLDDWK 1967
            RWGRGQETPGEDP++  KY+I YVRG+QGDS +G       L+ASACCKHFTAYDLD WK
Sbjct: 166  RWGRGQETPGEDPIMTGKYAIRYVRGVQGDSFNGGQLKKGHLQASACCKHFTAYDLDQWK 225

Query: 1966 GYSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIAR 1787
               R+SF+A+V+ QD+ DT+QPPF+ C++  +ASGIMCSYNSVNGIP+CA+YNLLTK AR
Sbjct: 226  NLDRFSFNAIVTPQDMADTFQPPFQDCIQKAQASGIMCSYNSVNGIPSCANYNLLTKTAR 285

Query: 1786 QDWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKG 1607
            Q W F GYI SDCDAV ++   H Y  T ED+ A  LKAGMD++CG +L+KYT  A+ K 
Sbjct: 286  QQWGFHGYITSDCDAVQVMHDNHRYGNTPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKK 345

Query: 1606 KLKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVL 1427
            K+ +  +DRALHNLFS+RMRLGLFN +    +YG+ +   VC+P+HQ LAL+A R GIVL
Sbjct: 346  KVSQVHIDRALHNLFSIRMRLGLFNGDPRKQLYGNISPSQVCAPQHQQLALEAARNGIVL 405

Query: 1426 LKNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAA 1247
            LKN    LPLSK    SLAVIG NANNA ++ GNY G PCK +  L  L  Y  +  Y  
Sbjct: 406  LKNTGKLLPLSKAKTNSLAVIGHNANNAYILRGNYDGPPCKYIEILKALVGYAKSVQYQQ 465

Query: 1246 GCENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXX 1067
            GC    C S + I +AV I++  D VVL++G+DQTQE+E  DR +L LPG QE       
Sbjct: 466  GCNAANCTS-ANIDQAVNIARNADYVVLIMGLDQTQEREQFDRDDLVLPGQQENLINSVA 524

Query: 1066 XXXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLP 887
                     VILSGGPVD+ FA  +P IGSI+WAGYPGEAGG ALAE IFG+HNPGG+LP
Sbjct: 525  KAAKKPVILVILSGGPVDISFAKYNPKIGSILWAGYPGEAGGIALAEIIFGEHNPGGKLP 584

Query: 886  MTWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTI 707
            +TWYPQ +VKIPMT+M MRP+PKTGYPGRTYRFY G KV+EFG+GLSY+ +SY F S T 
Sbjct: 585  VTWYPQAFVKIPMTDMRMRPDPKTGYPGRTYRFYKGPKVYEFGYGLSYTTYSYGFHSATP 644

Query: 706  NSLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRHSVLLF 527
            N+++L    S                  + S +C++ +F   + V+N G++DG+H VLLF
Sbjct: 645  NTIQLNQLLSVKTVENSDSIRYTFVD-EIGSDNCEKAKFSAHVSVENSGEMDGKHPVLLF 703

Query: 526  SSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLL 347
                 A +G P+KQL+ F+++SL AG+N+++ F++ PCEHLS+  EDG  ++  GS  L+
Sbjct: 704  VKQDKARNGSPIKQLVGFQSVSLKAGENSQLVFEISPCEHLSSANEDGLMMIEEGSRYLV 763

Query: 346  VGDVQFPLSVEV 311
            VGD + P+++ +
Sbjct: 764  VGDAEHPINIMI 775


>ref|XP_002302284.2| glycosyl hydrolase family 3 family protein [Populus trichocarpa]
            gi|550344639|gb|EEE81557.2| glycosyl hydrolase family 3
            family protein [Populus trichocarpa]
          Length = 742

 Score =  680 bits (1754), Expect = 0.0
 Identities = 342/609 (56%), Positives = 416/609 (68%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATPLKASACCKHFTAYDLDDWKG 1964
            RWGRGQETPGEDPLVA KY+++YVRG+QGDS  G      L+ASACCKHFTAYDLD WKG
Sbjct: 172  RWGRGQETPGEDPLVAGKYAVSYVRGVQGDSFGGGTLGEQLQASACCKHFTAYDLDKWKG 231

Query: 1963 YSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIARQ 1784
             +R+ FDA    QDL DTYQPPF+SC++ G+ASGIMC+YN VNG+P CADYNLL+K AR 
Sbjct: 232  MNRFVFDA----QDLADTYQPPFQSCIQEGKASGIMCAYNRVNGVPNCADYNLLSKKARG 287

Query: 1783 DWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKGK 1604
             W F GYI SDCDAV+II     Y ++ EDA ADVLKAGMDVNCG +L+ YT  A++K K
Sbjct: 288  QWGFYGYITSDCDAVAIIHDDQGYAKSPEDAVADVLKAGMDVNCGDYLKNYTKSAVKKKK 347

Query: 1603 LKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVLL 1424
            L ESE+DRALHNLFS+RMRLGLFN   +   YG+     VCS +HQALAL A + GIVLL
Sbjct: 348  LPESEIDRALHNLFSIRMRLGLFNGNPTKQPYGNIAPDQVCSQEHQALALKAAQDGIVLL 407

Query: 1423 KNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAAG 1244
            KN    LPLSK   KSLAVIGPNANN+T +LGNYFG PCKTVTPL GLQ Y+ NT Y  G
Sbjct: 408  KNPDKLLPLSKLETKSLAVIGPNANNSTKLLGNYFGPPCKTVTPLQGLQNYIKNTRYHPG 467

Query: 1243 CENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXXX 1064
            C  VAC S S I +AV I+K  DQV+LV+G+DQTQEKE QDRV+L LPG Q         
Sbjct: 468  CSRVACSSAS-INQAVKIAKGADQVILVMGLDQTQEKEEQDRVDLVLPGKQRELITAVAK 526

Query: 1063 XXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLPM 884
                    V+  GGPVDV FA  D +IGSIIWAGYPGEAGG ALA+ IFGDHNPGGRLPM
Sbjct: 527  AAKKPVVLVLFCGGPVDVSFAKYDQNIGSIIWAGYPGEAGGTALAQIIFGDHNPGGRLPM 586

Query: 883  TWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTIN 704
            TWYPQ++ K+PMT+M MRP   +GYPGRTYRFYNG KVFEFG+GLSYS +SY+ +S+  N
Sbjct: 587  TWYPQDFTKVPMTDMRMRPQLSSGYPGRTYRFYNGKKVFEFGYGLSYSNYSYELASDAQN 646

Query: 703  SLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRHSVLLFS 524
              KL + AS+                N+    C++ +F   + VKNHG++ G +      
Sbjct: 647  --KLYLRASSNQITKNSNTIRHKLISNIGKELCEKTKFTVTVRVKNHGEMAGEN------ 698

Query: 523  SSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLLV 344
                                       A++ +++ PCEHLS+  + G  ++  GS  LL+
Sbjct: 699  ---------------------------AEIQYELSPCEHLSSPDDRGMMVMEEGSQFLLI 731

Query: 343  GDVQFPLSV 317
            GD ++P+++
Sbjct: 732  GDKEYPITI 740


>gb|EMJ26446.1| hypothetical protein PRUPE_ppa001675mg [Prunus persica]
          Length = 781

 Score =  679 bits (1751), Expect = 0.0
 Identities = 340/614 (55%), Positives = 428/614 (69%), Gaps = 3/614 (0%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATP--LKASACCKHFTAYDLDDW 1970
            RWGRGQETPGEDPLV  KY+++YVRG+QGDS  G        L+ASACCKHFTAYDLD+W
Sbjct: 170  RWGRGQETPGEDPLVVGKYAVSYVRGVQGDSFEGGKLKVGGRLQASACCKHFTAYDLDNW 229

Query: 1969 KGYSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIA 1790
            K  +R+ FDA VS+QDL DTYQPPFKSCV+ G+ASGIMC+YN VNG+P+CADYNLLTK+A
Sbjct: 230  KSVTRFGFDARVSEQDLADTYQPPFKSCVQQGQASGIMCAYNRVNGVPSCADYNLLTKVA 289

Query: 1789 RQDWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQK 1610
            R  W F GYI SDCDAVSII+    Y +T EDA  DVLKAGMDVNCG +L+ +T  A+Q+
Sbjct: 290  RGQWDFHGYITSDCDAVSIIRDVQGYAKTPEDAVGDVLKAGMDVNCGSYLKDHTKSAVQQ 349

Query: 1609 GKLKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIV 1430
             KL  SE+DRALHNLFS+RMRLGLF+       YG+      CS +HQALAL+A + GIV
Sbjct: 350  KKLDVSEIDRALHNLFSIRMRLGLFDGSPLEQPYGNIGPDQACSKEHQALALEAAQDGIV 409

Query: 1429 LLKNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYA 1250
            LLKN+   LPL K+   SLAVIGPNAN +  +LGNY G PCK++TPL  LQ Y   T Y 
Sbjct: 410  LLKNSGRLLPLPKSKAISLAVIGPNANASETLLGNYHGRPCKSITPLKALQGYAKYTNYE 469

Query: 1249 AGCENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXX 1070
            AGC+ V C   + I +AV  +K  D VVL++G+DQ+QE+E+ DR +L LPG Q+      
Sbjct: 470  AGCDTVKC-PQATIDKAVEAAKAADYVVLIMGLDQSQEREAHDRRHLGLPGKQQELISSV 528

Query: 1069 XXXXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRL 890
                      VILSGGPVD+  A  D  IG I+WAGYPGEAGG ALAE IFGDHNPGGRL
Sbjct: 529  AKAAKKPVILVILSGGPVDITPAKYDKKIGGILWAGYPGEAGGIALAEIIFGDHNPGGRL 588

Query: 889  PMTWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNT 710
            P+TWY Q+YVK+PMT+M MRP+ KTGYPGRTYRFY G  V+ FG GLSYS + Y+F+S  
Sbjct: 589  PVTWYTQDYVKVPMTDMRMRPDTKTGYPGRTYRFYKGGNVYHFGFGLSYSNYIYEFAS-A 647

Query: 709  INSLKLKITASNGXXXXXXXXXXXXSKV-NVKSISCKELEFKCDIHVKNHGDLDGRHSVL 533
            I   KL +  S+               + ++    C++ +F   + VKNHG++ G+H VL
Sbjct: 648  IAQNKLYLNESSISPEVESSDSGHFRLIPDLSEEFCEKKKFPVRVAVKNHGEMVGKHPVL 707

Query: 532  LFSSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHN 353
            LF    + ++G P+KQL+ F+++ L AG+ A++ F + PCEHLS   E G  ++  GS+ 
Sbjct: 708  LFVGQKNPNNGSPMKQLVGFQSVILSAGERAELEFILNPCEHLSHANEGGLMVVEEGSYF 767

Query: 352  LLVGDVQFPLSVEV 311
            L VGDV++PL + V
Sbjct: 768  LQVGDVEYPLDIIV 781


>ref|XP_006354009.1| PREDICTED: probable beta-D-xylosidase 7-like [Solanum tuberosum]
          Length = 775

 Score =  676 bits (1745), Expect = 0.0
 Identities = 333/610 (54%), Positives = 429/610 (70%), Gaps = 1/610 (0%)
 Frame = -1

Query: 2143 RWGRGQETPGEDPLVASKYSIAYVRGLQGDSMSGHHYATP-LKASACCKHFTAYDLDDWK 1967
            RWGRGQETPGEDP++  KY+I YVRG+QGDS +G       L+ASACCKHFTAYDLD WK
Sbjct: 166  RWGRGQETPGEDPIMTGKYAIRYVRGVQGDSFNGGQLKKGHLQASACCKHFTAYDLDQWK 225

Query: 1966 GYSRYSFDAMVSKQDLLDTYQPPFKSCVEGGRASGIMCSYNSVNGIPTCADYNLLTKIAR 1787
               R+SF+A+V+ QD+ DT+QPPF+ C++  +ASGIMCSYNSVNGIP+CA+YNLLTK AR
Sbjct: 226  NLDRFSFNAIVTPQDMADTFQPPFQDCIQKAQASGIMCSYNSVNGIPSCANYNLLTKTAR 285

Query: 1786 QDWKFQGYIVSDCDAVSIIQTAHNYTRTAEDAAADVLKAGMDVNCGGHLQKYTIPAIQKG 1607
            Q W F GYI SDCDAV ++   H Y  T ED+ A  LKAGMD++CG +L+KYT  A+ K 
Sbjct: 286  QQWGFHGYITSDCDAVQVMHDNHRYGNTPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKK 345

Query: 1606 KLKESEVDRALHNLFSVRMRLGLFNKETSNHVYGHFNGKDVCSPKHQALALDAVRQGIVL 1427
            K+ +  +DRALHNLFS+RMRLGLFN +    +YG+ +   VC+P+HQ LAL+A R GIVL
Sbjct: 346  KVSQVHIDRALHNLFSIRMRLGLFNGDPRKQLYGNISPSLVCAPQHQELALEAARNGIVL 405

Query: 1426 LKNAPNHLPLSKTAIKSLAVIGPNANNATVMLGNYFGFPCKTVTPLDGLQRYVPNTLYAA 1247
            LKN    LPLSK    SLAVIG NANNA ++ GNY G PCK +  L  L  Y  +  Y  
Sbjct: 406  LKNTGKLLPLSKAKTNSLAVIGHNANNAYILRGNYDGPPCKYIEILKALVGYAKSVQYQQ 465

Query: 1246 GCENVACQSDSMIKEAVYISKKVDQVVLVIGIDQTQEKESQDRVNLNLPGAQEXXXXXXX 1067
            GC    C S + I +AV I+   D VVLV+G+DQTQE+E  DR +L LPG QE       
Sbjct: 466  GCNAANCTS-ADINQAVNIATNADYVVLVMGLDQTQEREQFDRDDLVLPGQQENLINSVA 524

Query: 1066 XXXXXXXXXVILSGGPVDVGFAVNDPHIGSIIWAGYPGEAGGQALAETIFGDHNPGGRLP 887
                     VILSGGPVD+ FA  +P IGSI+WAGYPGEAGG ALAE IFG+HNPGG+LP
Sbjct: 525  KAAKKPVILVILSGGPVDISFAKYNPKIGSILWAGYPGEAGGIALAEIIFGEHNPGGKLP 584

Query: 886  MTWYPQEYVKIPMTNMNMRPNPKTGYPGRTYRFYNGHKVFEFGHGLSYSRHSYKFSSNTI 707
            +TWYPQ +VKIPMT+M MRP+PKTGYPGRTYRFY G KV+EFG+GLSY+ +SY F S T 
Sbjct: 585  VTWYPQAFVKIPMTDMRMRPDPKTGYPGRTYRFYKGPKVYEFGYGLSYTTYSYGFHSATP 644

Query: 706  NSLKLKITASNGXXXXXXXXXXXXSKVNVKSISCKELEFKCDIHVKNHGDLDGRHSVLLF 527
            N+++L    S+             S   + S +C++ +F   + V+N G++DG+H VLLF
Sbjct: 645  NTVQLN-QLSSVKTVENSDSIRYTSVDEIGSDNCEKAKFSAHVSVENSGEMDGKHPVLLF 703

Query: 526  SSSPSAHSGVPLKQLIAFENLSLMAGQNAKVSFDVKPCEHLSTFQEDGERILALGSHNLL 347
                 A +G P+KQL+ F+++SL AG+++++ F++ PCEHLS+  EDG  ++  GS  L+
Sbjct: 704  VKQDKARNGRPIKQLVGFQSVSLKAGEDSQLVFEISPCEHLSSANEDGLMMIEEGSRYLV 763

Query: 346  VGDVQFPLSV 317
            VGD + P+++
Sbjct: 764  VGDAEHPINI 773


Top