BLASTX nr result

ID: Akebia24_contig00028587 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00028587
         (1298 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXC06151.1| putative glycosyltransferase [Morus notabilis]         596   e-168
ref|XP_006438045.1| hypothetical protein CICLE_v10031600mg [Citr...   596   e-168
ref|XP_002512333.1| catalytic, putative [Ricinus communis] gi|22...   596   e-168
ref|XP_004310070.1| PREDICTED: probable glycosyltransferase At3g...   590   e-166
ref|XP_007044049.1| Exostosin family protein isoform 1 [Theobrom...   587   e-165
ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g...   585   e-164
ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable gly...   583   e-164
ref|XP_007227458.1| hypothetical protein PRUPE_ppa005995mg [Prun...   582   e-163
ref|XP_004504444.1| PREDICTED: probable glycosyltransferase At3g...   581   e-163
ref|XP_003524893.1| PREDICTED: probable glucuronosyltransferase ...   581   e-163
ref|XP_003531191.2| PREDICTED: probable glycosyltransferase At3g...   580   e-163
ref|XP_006848395.1| hypothetical protein AMTR_s00013p00218260 [A...   571   e-160
ref|XP_007158691.1| hypothetical protein PHAVU_002G174100g [Phas...   568   e-159
ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable gly...   566   e-159
ref|XP_004502300.1| PREDICTED: probable glycosyltransferase At3g...   563   e-158
ref|XP_003601797.1| hypothetical protein MTR_3g085480 [Medicago ...   549   e-154
ref|XP_006391283.1| hypothetical protein EUTSA_v10018590mg [Eutr...   548   e-153
ref|XP_002311068.2| exostosin family protein [Populus trichocarp...   546   e-153
ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata...   542   e-151
ref|XP_006302174.1| hypothetical protein CARUB_v10020184mg [Caps...   540   e-151

>gb|EXC06151.1| putative glycosyltransferase [Morus notabilis]
          Length = 469

 Score =  596 bits (1537), Expect = e-168
 Identities = 284/396 (71%), Positives = 332/396 (83%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GTVDL+S+FFP                      PLR+FMYDLP RFN+GM+N + S++  
Sbjct: 62   GTVDLRSYFFPLLQSPPGARPLCATIAS---PLPLRVFMYDLPRRFNVGMLNRRSSDQAP 118

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQKWREVVRVSDPGIADAFFVPFFSSM 469
            V  + +PPWP NSGLK+QHSVEYWMM S++Y+G+   REVVRVSDP +A+AFFVPFFSS+
Sbjct: 119  VTAQTWPPWPKNSGLKRQHSVEYWMMGSLLYDGDG--REVVRVSDPEMAEAFFVPFFSSL 176

Query: 470  SFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQVN 649
            SFNTHGHNMTDP+T ID QLQID+LEFL +S+YW+R GGRDHVIPM HPNAFRFLR ++N
Sbjct: 177  SFNTHGHNMTDPKTRIDHQLQIDLLEFLGESKYWKRYGGRDHVIPMTHPNAFRFLRAELN 236

Query: 650  ASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRKD 829
            ASI IV DFGR+P+TMS L KDVVAPYVHVV+SF DD+  +P+ESR TLLFFRGRT RKD
Sbjct: 237  ASIQIVVDFGRHPRTMSNLGKDVVAPYVHVVDSFTDDDLSDPYESRTTLLFFRGRTFRKD 296

Query: 830  EGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDAI 1009
            EGIVR KL K+L GY+DVHYE+S ATGE+IKASS GMR SKFCLHPAGDTPSSCRLFDAI
Sbjct: 297  EGIVRVKLAKVLAGYDDVHYERSVATGENIKASSLGMRLSKFCLHPAGDTPSSCRLFDAI 356

Query: 1010 VSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRKL 1189
            VSHCVPVIVSD+IELPFEDE+DY+QFS+FFS +EAL+P YM+ QLRKFPKE+WVEMWR+L
Sbjct: 357  VSHCVPVIVSDQIELPFEDEIDYSQFSLFFSFKEALEPGYMVEQLRKFPKEKWVEMWRRL 416

Query: 1190 KNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            KNISHH+EFQYPP KEDAV+MLWRQV+ KVPG  L+
Sbjct: 417  KNISHHFEFQYPPNKEDAVDMLWRQVKHKVPGVNLA 452


>ref|XP_006438045.1| hypothetical protein CICLE_v10031600mg [Citrus clementina]
            gi|567891051|ref|XP_006438046.1| hypothetical protein
            CICLE_v10031600mg [Citrus clementina]
            gi|568861185|ref|XP_006484086.1| PREDICTED: probable
            glycosyltransferase At3g07620-like isoform X1 [Citrus
            sinensis] gi|568861187|ref|XP_006484087.1| PREDICTED:
            probable glycosyltransferase At3g07620-like isoform X2
            [Citrus sinensis] gi|568861189|ref|XP_006484088.1|
            PREDICTED: probable glycosyltransferase At3g07620-like
            isoform X3 [Citrus sinensis] gi|557540241|gb|ESR51285.1|
            hypothetical protein CICLE_v10031600mg [Citrus
            clementina] gi|557540242|gb|ESR51286.1| hypothetical
            protein CICLE_v10031600mg [Citrus clementina]
          Length = 431

 Score =  596 bits (1536), Expect = e-168
 Identities = 277/396 (69%), Positives = 334/396 (84%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GTVD++SHFFP                  +  +PLR++MYDLP RF++GM++H   + + 
Sbjct: 32   GTVDIRSHFFPLLQSTAQ-----------SCSAPLRVYMYDLPRRFHVGMLDHSSPDGLP 80

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQKWREVVRVSDPGIADAFFVPFFSSM 469
            V   N P WP +SG+K+QHSVEYW+MAS++Y+G  + RE VRVSDP  A AFFVPFFSS+
Sbjct: 81   VTSENLPRWPRSSGIKRQHSVEYWLMASLLYDGESEEREAVRVSDPDTAQAFFVPFFSSL 140

Query: 470  SFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQVN 649
            SFNTHGHNMTDP+TE DRQLQI+ILEFLR S+YWQ+SGGRDHVIPM HPNAFRFLR+Q+N
Sbjct: 141  SFNTHGHNMTDPDTEFDRQLQIEILEFLRNSKYWQKSGGRDHVIPMTHPNAFRFLRQQLN 200

Query: 650  ASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRKD 829
            ASILIVADFGRYP++MS LSKDVVAPYVHVVESF DDN P+PF +R TLLFF+G T+RKD
Sbjct: 201  ASILIVADFGRYPRSMSNLSKDVVAPYVHVVESFTDDNPPDPFVARKTLLFFQGNTIRKD 260

Query: 830  EGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDAI 1009
            EG VRAKL KIL GY+DVHYE+S  T +SIK S++GMRSSKFCLHPAGDTPSSCRLFDAI
Sbjct: 261  EGKVRAKLAKILTGYDDVHYERSAPTTKSIKESTEGMRSSKFCLHPAGDTPSSCRLFDAI 320

Query: 1010 VSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRKL 1189
            VSHCVPVIVSD+IELPFEDE+DY++FS+FFS++EA +P YMI QLR+ PK RW+EMW++L
Sbjct: 321  VSHCVPVIVSDRIELPFEDEIDYSEFSVFFSIKEAGQPGYMIDQLRQIPKARWIEMWQRL 380

Query: 1190 KNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            K+ISH+YEFQYPPKKEDAVNM+WRQV+ K+PG +L+
Sbjct: 381  KSISHYYEFQYPPKKEDAVNMVWRQVKNKIPGVQLA 416


>ref|XP_002512333.1| catalytic, putative [Ricinus communis] gi|223548294|gb|EEF49785.1|
            catalytic, putative [Ricinus communis]
          Length = 434

 Score =  596 bits (1536), Expect = e-168
 Identities = 281/398 (70%), Positives = 336/398 (84%), Gaps = 2/398 (0%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINH--QISNE 283
            GT+D++S+FFP                 C    PL+++MYDLP RF++GM++H     N+
Sbjct: 24   GTLDMRSYFFPLLQQQQSPTTGARSL--CATGPPLKVYMYDLPRRFHVGMMDHGGDAKND 81

Query: 284  ILVNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQKWREVVRVSDPGIADAFFVPFFS 463
              V G N P WP NSGL+KQHSVEYW+MAS++YEG  + RE VRV DP  ADAFFVPFFS
Sbjct: 82   TPVTGENLPTWPKNSGLRKQHSVEYWLMASLLYEGADE-REAVRVLDPEKADAFFVPFFS 140

Query: 464  SMSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQ 643
            S+SFNTHGH MTDPETEIDRQLQ+D+++ L KS+YWQ+SGGRDHVIPM HPNAFRFLR+Q
Sbjct: 141  SLSFNTHGHTMTDPETEIDRQLQVDVIDMLYKSKYWQKSGGRDHVIPMTHPNAFRFLRQQ 200

Query: 644  VNASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVR 823
            +NASILIVADFGRYPK+MS LSKDVVAPYVHVV+SF DD   NPFESR TLLFFRG T+R
Sbjct: 201  LNASILIVADFGRYPKSMSTLSKDVVAPYVHVVDSFTDDEVSNPFESRTTLLFFRGNTIR 260

Query: 824  KDEGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFD 1003
            KDEG VRAKL KIL GY+D+H+E+S AT E+IKAS++GMRSSKFCLHPAGDTPSSCRLFD
Sbjct: 261  KDEGKVRAKLAKILTGYDDIHFERSSATAETIKASTEGMRSSKFCLHPAGDTPSSCRLFD 320

Query: 1004 AIVSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWR 1183
            AIVSHCVPVIVSD+IELP+EDE+DY+QFS+FFSV EA++P YM+ QLR+ PKERW+EMWR
Sbjct: 321  AIVSHCVPVIVSDQIELPYEDEIDYSQFSVFFSVNEAIQPGYMVDQLRQLPKERWLEMWR 380

Query: 1184 KLKNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            KLK+ISHH+EFQYPP+KEDAV+MLWR+V+ K+PGA+L+
Sbjct: 381  KLKSISHHFEFQYPPEKEDAVDMLWREVKHKLPGAQLA 418


>ref|XP_004310070.1| PREDICTED: probable glycosyltransferase At3g07620-like [Fragaria
            vesca subsp. vesca]
          Length = 446

 Score =  590 bits (1522), Expect = e-166
 Identities = 281/397 (70%), Positives = 327/397 (82%), Gaps = 1/397 (0%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GTVDL+S+F P                 C    PL++FMYDLP RFN+GM+N + + E  
Sbjct: 38   GTVDLRSYFIPLLKSSPLAPQSL-----CATGPPLKVFMYDLPRRFNVGMLNRKSAEEAP 92

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQ-KWREVVRVSDPGIADAFFVPFFSS 466
            V  R +PPWP NSGLKKQHSVEYWMM S+++EGN  +  EVVRVSDP +ADAFFVPFFSS
Sbjct: 93   VTAREWPPWPRNSGLKKQHSVEYWMMGSVLWEGNGGEGSEVVRVSDPEVADAFFVPFFSS 152

Query: 467  MSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQV 646
            +SFNTHGHNM DPETE+D QLQID+++ L +S+YW RSGGRDHVIPM HPNAFRFLR Q+
Sbjct: 153  LSFNTHGHNMNDPETEVDHQLQIDLVKLLHESKYWNRSGGRDHVIPMTHPNAFRFLRPQI 212

Query: 647  NASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRK 826
            NASI IV DFGRYP  MS LSKDVV PYVHVVESF DDNS +P+ESR TLLFF+GRT RK
Sbjct: 213  NASIQIVVDFGRYPHVMSNLSKDVVTPYVHVVESFTDDNSSDPYESRTTLLFFQGRTHRK 272

Query: 827  DEGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDA 1006
            DEGIVRAKL K+L GY+DVHYE+S ATGE+IK S+Q MR+SKFCLHPAGDTPSSCRLFDA
Sbjct: 273  DEGIVRAKLAKVLAGYDDVHYERSVATGENIKLSTQRMRASKFCLHPAGDTPSSCRLFDA 332

Query: 1007 IVSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRK 1186
            IVSHC+PVIVSD+IELPFEDELDY QFS+FFS +EAL+P YM+ +LRK  KE+W+EM+R 
Sbjct: 333  IVSHCIPVIVSDEIELPFEDELDYNQFSVFFSFKEALQPGYMVNELRKLSKEKWMEMYRH 392

Query: 1187 LKNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            LK+ISHH+EF YPP+KEDAVNMLWRQV+RKVP  KL+
Sbjct: 393  LKSISHHFEFHYPPEKEDAVNMLWRQVKRKVPAVKLA 429


>ref|XP_007044049.1| Exostosin family protein isoform 1 [Theobroma cacao]
            gi|590692416|ref|XP_007044050.1| Exostosin family protein
            isoform 1 [Theobroma cacao]
            gi|590692424|ref|XP_007044051.1| Exostosin family protein
            isoform 1 [Theobroma cacao] gi|508707984|gb|EOX99880.1|
            Exostosin family protein isoform 1 [Theobroma cacao]
            gi|508707985|gb|EOX99881.1| Exostosin family protein
            isoform 1 [Theobroma cacao] gi|508707986|gb|EOX99882.1|
            Exostosin family protein isoform 1 [Theobroma cacao]
          Length = 432

 Score =  587 bits (1513), Expect = e-165
 Identities = 281/398 (70%), Positives = 334/398 (83%), Gaps = 2/398 (0%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GTVDL+S+FFP                 C    PLR++MYDLP +F++GM++ + S E  
Sbjct: 24   GTVDLRSYFFPLLQSPPVLRSL------CATGRPLRVYMYDLPRKFHVGMLDRRSSEEAA 77

Query: 290  -VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEG-NQKWREVVRVSDPGIADAFFVPFFS 463
             V   N PPWPSNSG+K+QHSVEYW+MAS++Y+G ++  RE VRV DP  ADAFFVPFFS
Sbjct: 78   PVTMENLPPWPSNSGIKRQHSVEYWLMASLLYDGQDEDGREAVRVLDPEKADAFFVPFFS 137

Query: 464  SMSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQ 643
            S+SFNTHGHNMTDPETEIDR LQ+++LEFL++S+Y+QRSGGRDHVIPM HPNAFRFLREQ
Sbjct: 138  SLSFNTHGHNMTDPETEIDRHLQVELLEFLQQSKYYQRSGGRDHVIPMTHPNAFRFLREQ 197

Query: 644  VNASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVR 823
            +NASILIV DFGRYPKTMS LSKDVVAPYVHVV+SF DD+  +P+ESR TLLFFRG TVR
Sbjct: 198  LNASILIVVDFGRYPKTMSSLSKDVVAPYVHVVDSFTDDDPLDPYESRTTLLFFRGNTVR 257

Query: 824  KDEGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFD 1003
            KDEG +R KL KIL G +DVHYE+S AT ++IK S++GMRSSKFCLHPAGDTPSSCRLFD
Sbjct: 258  KDEGKIRVKLAKILAGSDDVHYEKSVATPKNIKMSTEGMRSSKFCLHPAGDTPSSCRLFD 317

Query: 1004 AIVSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWR 1183
            AIVSHCVPVIVSDKIELP+EDE+DYT+FSIFFS++EAL+P Y++  LR+FPK RWV+MW+
Sbjct: 318  AIVSHCVPVIVSDKIELPYEDEIDYTEFSIFFSMKEALEPGYLVNHLRQFPKNRWVQMWK 377

Query: 1184 KLKNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
             LKNIS HYEFQYPPKKEDAVNMLWRQV+ K+PG +L+
Sbjct: 378  LLKNISRHYEFQYPPKKEDAVNMLWRQVKHKLPGVQLA 415


>ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
            sativus]
          Length = 429

 Score =  585 bits (1509), Expect = e-164
 Identities = 275/396 (69%), Positives = 326/396 (82%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GTVD++S+FFP                 C  + PLR++MYDLP RFN+G++N +  ++  
Sbjct: 24   GTVDIRSYFFPLLQSQPISPFP------CTTDPPLRVYMYDLPRRFNVGILNRRNLDQTP 77

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQKWREVVRVSDPGIADAFFVPFFSSM 469
            V    +PPWP NSGLK+QHSVEYWMM S+++E     R+ VRV DP  ADAFFVPFFSS+
Sbjct: 78   VTASTWPPWPRNSGLKRQHSVEYWMMGSLLHEATGDGRDAVRVMDPENADAFFVPFFSSL 137

Query: 470  SFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQVN 649
            SFN+HG NMTDP TE+D QLQID+++FL +S+YWQRS GRDHVIPM HPNAFRFLR QVN
Sbjct: 138  SFNSHGRNMTDPATEVDHQLQIDLMKFLSESKYWQRSKGRDHVIPMTHPNAFRFLRNQVN 197

Query: 650  ASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRKD 829
            ASI IV DFGRYPKTMS L KDVVAPYVHVV SFIDDN P+PFESR TLLFF+G+T RKD
Sbjct: 198  ASIQIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKD 257

Query: 830  EGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDAI 1009
            +GI+R KL KIL GY+DVHYE+S AT +SIK SSQGMRSSKFCLHPAGDTPSSCRLFDAI
Sbjct: 258  DGIIRVKLAKILDGYDDVHYERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAI 317

Query: 1010 VSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRKL 1189
            VSHCVPVIVSD+IELP+EDE+DY+QF++FFS +EAL+P YM+ +LR+FPKERW+EMW++L
Sbjct: 318  VSHCVPVIVSDQIELPYEDEIDYSQFTLFFSFEEALQPGYMVEKLREFPKERWIEMWKQL 377

Query: 1190 KNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            K IS HYEFQYPPKKEDAVNMLWRQV+ K+P  KL+
Sbjct: 378  KEISRHYEFQYPPKKEDAVNMLWRQVKHKLPAVKLA 413


>ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
            At3g07620-like [Cucumis sativus]
          Length = 429

 Score =  583 bits (1504), Expect = e-164
 Identities = 274/396 (69%), Positives = 325/396 (82%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GTVD++S+FFP                 C  + PLR++MYDLP RFN+G++N +  ++  
Sbjct: 24   GTVDIRSYFFPLLQSQPISPFP------CTTDPPLRVYMYDLPRRFNVGILNRRNLDQTP 77

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQKWREVVRVSDPGIADAFFVPFFSSM 469
            V    +PPWP NSGLK+QHSVEYWMM S+++E     R+ VRV DP  ADAFFVPFFSS+
Sbjct: 78   VTASTWPPWPRNSGLKRQHSVEYWMMGSLLHEATGDGRDAVRVMDPENADAFFVPFFSSL 137

Query: 470  SFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQVN 649
            SFN+HG NMTDP TE+D QLQID+++FL +S+YWQRS GRDHVIPM HPNAFRFLR QVN
Sbjct: 138  SFNSHGRNMTDPATEVDHQLQIDLMKFLSESKYWQRSKGRDHVIPMTHPNAFRFLRNQVN 197

Query: 650  ASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRKD 829
            ASI IV DFGRYPKTMS L KDVVAPYVHVV SFIDDN P+PFESR TLLFF+G+T RKD
Sbjct: 198  ASIQIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKD 257

Query: 830  EGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDAI 1009
            +GI+R KL KIL GY+DVHYE+S AT +SIK SSQGMRSSKFCLHPAGDTPSSCRLFDAI
Sbjct: 258  DGIIRVKLAKILDGYDDVHYERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAI 317

Query: 1010 VSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRKL 1189
            VSHCVPVIVSD+IELP+EDE+DY+QF++FF  +EAL+P YM+ +LR+FPKERW+EMW++L
Sbjct: 318  VSHCVPVIVSDQIELPYEDEIDYSQFTLFFXFEEALQPGYMVEKLREFPKERWIEMWKQL 377

Query: 1190 KNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            K IS HYEFQYPPKKEDAVNMLWRQV+ K+P  KL+
Sbjct: 378  KEISRHYEFQYPPKKEDAVNMLWRQVKHKLPAVKLA 413


>ref|XP_007227458.1| hypothetical protein PRUPE_ppa005995mg [Prunus persica]
            gi|462424394|gb|EMJ28657.1| hypothetical protein
            PRUPE_ppa005995mg [Prunus persica]
          Length = 433

 Score =  582 bits (1499), Expect = e-163
 Identities = 275/397 (69%), Positives = 326/397 (82%), Gaps = 1/397 (0%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GTVD++S+F P                      PL+++MYDLP RFN+GM+N + + +  
Sbjct: 25   GTVDIRSYFLPLLPSPPPGAQPPRATGP-----PLKVYMYDLPRRFNVGMLNRKSTEQAP 79

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQ-KWREVVRVSDPGIADAFFVPFFSS 466
            V  R +P WP NSGLK+QHSVEYWMM S++++G+    R  VRVSDP +ADAFFVPFFSS
Sbjct: 80   VTARTWPTWPRNSGLKRQHSVEYWMMGSLLFDGDGGDGRAAVRVSDPELADAFFVPFFSS 139

Query: 467  MSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQV 646
            +SFNTHGH+MTDP TEID QLQID+L+ L +S+YWQRSGGRDHVIP+ HPNAFRFLR Q+
Sbjct: 140  LSFNTHGHHMTDPATEIDHQLQIDVLKILGESKYWQRSGGRDHVIPLTHPNAFRFLRPQI 199

Query: 647  NASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRK 826
            NASI IV DFGRYP  MS LSKDVV+PYVHVV+SF DDN  NP+ESR TLLFF+GRT RK
Sbjct: 200  NASIQIVVDFGRYPHVMSNLSKDVVSPYVHVVDSFTDDNHSNPYESRTTLLFFQGRTFRK 259

Query: 827  DEGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDA 1006
            DEGIVR KL KIL GY+DVHYE+S ATG++IKASSQ MRSSKFCLHPAGDTPSSCRLFDA
Sbjct: 260  DEGIVRVKLAKILAGYDDVHYERSVATGDNIKASSQRMRSSKFCLHPAGDTPSSCRLFDA 319

Query: 1007 IVSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRK 1186
            IVSHCVPVIVSD+IELPFEDE+DYT+FS+FFS +EAL+P YM+ QLRKFPK+RW+EMWR+
Sbjct: 320  IVSHCVPVIVSDEIELPFEDEIDYTKFSLFFSFKEALEPGYMVDQLRKFPKDRWIEMWRQ 379

Query: 1187 LKNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            L +ISHH+EF YPP+KEDAVNMLWRQV+ K+P  KL+
Sbjct: 380  LNSISHHFEFHYPPEKEDAVNMLWRQVKHKLPAVKLA 416


>ref|XP_004504444.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cicer
            arietinum]
          Length = 430

 Score =  581 bits (1498), Expect = e-163
 Identities = 275/397 (69%), Positives = 327/397 (82%), Gaps = 1/397 (0%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GT+D++S+FFP                 C+ + PLR++MYDLP RFN+ MI H+ ++E  
Sbjct: 24   GTLDIRSYFFPHLKSPTLEPAP------CSPDPPLRVYMYDLPRRFNVEMITHRTASESP 77

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQ-KWREVVRVSDPGIADAFFVPFFSS 466
            V  +++PPWP N GLKKQHSVEYWMM S+++EG   + RE VRV DP  ADAFFVPFFSS
Sbjct: 78   VTVKDWPPWPDNWGLKKQHSVEYWMMGSLLHEGEDGESREAVRVFDPEFADAFFVPFFSS 137

Query: 467  MSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQV 646
            +SFN+HGH MTDP TEIDRQLQ+D++EFL KS+YWQRS GRDH+ PM HPNAFRFLR QV
Sbjct: 138  LSFNSHGHTMTDPATEIDRQLQVDVMEFLTKSKYWQRSRGRDHIFPMTHPNAFRFLRNQV 197

Query: 647  NASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRK 826
            N +I +V DFGRYPK MS L+KDVV+PYVHVV+SF DD   +P+E+R+TLLFFRGRT RK
Sbjct: 198  NDTIQVVVDFGRYPKGMSNLNKDVVSPYVHVVDSFTDDEPEDPYEARSTLLFFRGRTFRK 257

Query: 827  DEGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDA 1006
            DEGIVRAKL KIL GY DVHYE+S ATGE+IKASS+GMRSSKFCLHPAGDTPSSCRLFDA
Sbjct: 258  DEGIVRAKLTKILSGYSDVHYERSVATGENIKASSKGMRSSKFCLHPAGDTPSSCRLFDA 317

Query: 1007 IVSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRK 1186
            IVSHCVPVIVSD+IELPFED++DY+QFS+FFS +EAL+P YMI  LRKFPK++W EMWR+
Sbjct: 318  IVSHCVPVIVSDQIELPFEDQIDYSQFSLFFSFKEALQPGYMIDHLRKFPKQKWTEMWRQ 377

Query: 1187 LKNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            LKN SHHYEFQYPPK+ DAVNMLWRQ++ K+P   LS
Sbjct: 378  LKNNSHHYEFQYPPKRGDAVNMLWRQIKHKLPEVTLS 414


>ref|XP_003524893.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like isoform
            X1 [Glycine max]
          Length = 427

 Score =  581 bits (1497), Expect = e-163
 Identities = 276/396 (69%), Positives = 327/396 (82%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GT+D++ +FFP                 C  + PLR+FMYDLP RFN+GMI+ + + E+ 
Sbjct: 24   GTLDIRPYFFPRLKLPSGAPAP------CAPDPPLRVFMYDLPRRFNVGMIDRRSAAEMP 77

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQKWREVVRVSDPGIADAFFVPFFSSM 469
            V   ++P WP N GLKKQHSVEYWMM S++  G    REVVRVSDP +A AFFVPFFSS+
Sbjct: 78   VTVEDWPAWPVNWGLKKQHSVEYWMMGSLLNVGGG--REVVRVSDPELAQAFFVPFFSSL 135

Query: 470  SFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQVN 649
            SFNTHGH M DP T+IDRQLQ+D++E L+KS YWQRSGGRDHV PM HPNAFRFLR+Q+N
Sbjct: 136  SFNTHGHTMKDPATQIDRQLQVDLMELLKKSNYWQRSGGRDHVFPMTHPNAFRFLRDQLN 195

Query: 650  ASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRKD 829
             SI +V DFGRYP+ MS L+KDVV+PYVHVV+SF DD   +P+ESR+TLLFFRGRT RKD
Sbjct: 196  ESIQVVVDFGRYPRGMSNLNKDVVSPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTYRKD 255

Query: 830  EGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDAI 1009
            EGIVR KL KIL GY+DVHYE+S AT E+IKASS+GMRSSKFCLHPAGDTPSSCRLFDAI
Sbjct: 256  EGIVRVKLAKILAGYDDVHYERSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAI 315

Query: 1010 VSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRKL 1189
            VSHC+PVIVSD+IELPFEDE+DY+QFS+FFS +EAL+P YMI QLRKFPKE+W EMWR+L
Sbjct: 316  VSHCIPVIVSDQIELPFEDEIDYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQL 375

Query: 1190 KNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            K+ISHHYEF+YPPK+EDAV+MLWRQV+ K+PG KLS
Sbjct: 376  KSISHHYEFRYPPKREDAVDMLWRQVKHKLPGVKLS 411


>ref|XP_003531191.2| PREDICTED: probable glycosyltransferase At3g07620-like isoformX1
            [Glycine max]
          Length = 472

 Score =  580 bits (1496), Expect = e-163
 Identities = 276/396 (69%), Positives = 327/396 (82%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GT+D++S+FFP                 C  E PLR+FMYDLP RFN+GMI+ + ++E  
Sbjct: 69   GTLDIRSYFFPRLKLPAAAPAP------CAPEPPLRVFMYDLPRRFNVGMIDRRSASETP 122

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQKWREVVRVSDPGIADAFFVPFFSSM 469
            V   ++P WP N GLKKQHSVEYWMM S++  G  + RE VRVSDP +A AFFVPFFSS+
Sbjct: 123  VTVEDWPAWPVNWGLKKQHSVEYWMMGSLLNAG--EGREAVRVSDPELAQAFFVPFFSSL 180

Query: 470  SFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQVN 649
            SFNTHGH M DP T+IDRQLQ+D++E L+KS+YWQRSGGRDHV PM HPNAFRFLR Q+N
Sbjct: 181  SFNTHGHTMKDPATQIDRQLQVDLMELLKKSKYWQRSGGRDHVFPMTHPNAFRFLRGQLN 240

Query: 650  ASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRKD 829
             SI +V DFGRYP+ MS L+KDVV+PYVHVV+SF DD   +P+ESR+TLLFFRGRT RKD
Sbjct: 241  ESIQVVVDFGRYPRGMSNLNKDVVSPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTYRKD 300

Query: 830  EGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDAI 1009
            EGIVR KL KIL GY+DVHYE+S AT E+IKASS+GMRSSKFCLHPAGDTPSSCRLFDAI
Sbjct: 301  EGIVRVKLAKILAGYDDVHYERSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAI 360

Query: 1010 VSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRKL 1189
            VSHCVPVIVSD+IELPFED++DY+QFS+FFS +EAL+P YMI QLRKFPKE+W EMWR+L
Sbjct: 361  VSHCVPVIVSDQIELPFEDDIDYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQL 420

Query: 1190 KNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            K+ISHHYEF+YPPK+EDAV+MLWRQ + K+PG KLS
Sbjct: 421  KSISHHYEFEYPPKREDAVDMLWRQAKHKLPGVKLS 456


>ref|XP_006848395.1| hypothetical protein AMTR_s00013p00218260 [Amborella trichopoda]
            gi|548851701|gb|ERN09976.1| hypothetical protein
            AMTR_s00013p00218260 [Amborella trichopoda]
          Length = 422

 Score =  571 bits (1471), Expect = e-160
 Identities = 275/397 (69%), Positives = 328/397 (82%), Gaps = 1/397 (0%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            G++DL+S FF                      SPL+I+MY+LP  FN+GM+     ++ L
Sbjct: 23   GSLDLRSQFFAPTIIAPS-------------NSPLKIYMYNLPRHFNIGMLRRSDPHQDL 69

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYE-GNQKWREVVRVSDPGIADAFFVPFFSS 466
                  PPWP NSGLKKQHSVEYWMMAS++YE G  +  E +RVSDP  ADAFFVPFFSS
Sbjct: 70   PFTGQIPPWPQNSGLKKQHSVEYWMMASLLYEDGEGRDMEAIRVSDPEEADAFFVPFFSS 129

Query: 467  MSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQV 646
            +SFNTHGHNMTDPETE+DRQLQI++LEFLR S++W++SGGRDHVIPMHHPNAFRFLRE+V
Sbjct: 130  LSFNTHGHNMTDPETEVDRQLQIELLEFLRISKFWEQSGGRDHVIPMHHPNAFRFLREKV 189

Query: 647  NASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRK 826
            NASIL+VADFGR PK +S LSKDVVAPYVHV +SFIDD+S +PFESR TLLFFRGRTVRK
Sbjct: 190  NASILVVADFGRCPKNISSLSKDVVAPYVHVGDSFIDDDSSDPFESRTTLLFFRGRTVRK 249

Query: 827  DEGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDA 1006
             EGIVR+KL KIL G E VH+E+S ATGESIKASS GMRSSKFCL+PAGDTPSSCRLFDA
Sbjct: 250  AEGIVRSKLAKILRGQEGVHFEESVATGESIKASSLGMRSSKFCLNPAGDTPSSCRLFDA 309

Query: 1007 IVSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRK 1186
            IVSHC+PVIVSD+IELP+EDE+DY  FS+FFSV+EAL+P YM+ +LR+  +E+WVEMWR+
Sbjct: 310  IVSHCIPVIVSDRIELPYEDEIDYRTFSLFFSVEEALRPGYMLKELRQIKREKWVEMWRR 369

Query: 1187 LKNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            LK ISHH+EFQ+PPK++DAVNM+W+QVR K+P AKL+
Sbjct: 370  LKEISHHFEFQFPPKRDDAVNMIWKQVRHKLPAAKLA 406


>ref|XP_007158691.1| hypothetical protein PHAVU_002G174100g [Phaseolus vulgaris]
            gi|561032106|gb|ESW30685.1| hypothetical protein
            PHAVU_002G174100g [Phaseolus vulgaris]
          Length = 427

 Score =  568 bits (1463), Expect = e-159
 Identities = 270/396 (68%), Positives = 325/396 (82%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            G +D++S+FFP                 C ++  LR+FMYDLP RFN+GMI+ + ++E  
Sbjct: 24   GNLDIRSYFFPHFKLSAVVPAA------CALDPSLRVFMYDLPRRFNVGMIDRRNTSETP 77

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQKWREVVRVSDPGIADAFFVPFFSSM 469
            V   ++P WP N GLKKQHSVEYWMM S+I+    + REVVRVSDP +A+AFFVPFFSS+
Sbjct: 78   VTVEDWPRWPENWGLKKQHSVEYWMMGSLIH--GVEGREVVRVSDPELANAFFVPFFSSL 135

Query: 470  SFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQVN 649
            SFNTHGH M DP TEIDRQLQ+D++E L+KS+YWQRSGGRDHV P+ HPNAFRFLR+Q+N
Sbjct: 136  SFNTHGHTMKDPATEIDRQLQVDLMELLKKSKYWQRSGGRDHVFPVTHPNAFRFLRDQLN 195

Query: 650  ASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRKD 829
             SI +V DFGRYP+ MS L+KDVV+PYVHVV+S   D   +P+ESR+TLLFFRGRT RKD
Sbjct: 196  DSIQVVVDFGRYPRHMSNLNKDVVSPYVHVVDSLTVDEPQDPYESRSTLLFFRGRTYRKD 255

Query: 830  EGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDAI 1009
            EGIVR KL KIL GY+DVHYE+S AT E+IK SS+GMRSSKFCLHPAGDTPSSCRLFDAI
Sbjct: 256  EGIVRVKLAKILSGYDDVHYERSVATEENIKLSSKGMRSSKFCLHPAGDTPSSCRLFDAI 315

Query: 1010 VSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRKL 1189
            VSHCVPVIVSD+IELPFEDE++Y+QFSIFFS +EAL+P YM+ QLRKFPK++W EMW++L
Sbjct: 316  VSHCVPVIVSDQIELPFEDEIEYSQFSIFFSFKEALQPGYMVDQLRKFPKQKWTEMWKQL 375

Query: 1190 KNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            KNIS HYEFQYPPK+EDAV+MLWRQV+ K+PG  LS
Sbjct: 376  KNISQHYEFQYPPKREDAVSMLWRQVKHKIPGVSLS 411


>ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
            At5g25310-like [Vitis vinifera]
          Length = 437

 Score =  566 bits (1459), Expect = e-159
 Identities = 282/424 (66%), Positives = 324/424 (76%), Gaps = 6/424 (1%)
 Frame = +2

Query: 41   MVGKXXXXXXXXXXXXXXXXXXXGTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRI 220
            MVGK                    TVDL+S+ +P                      PL +
Sbjct: 1    MVGKATISVLILALLLLTTSIFIATVDLRSYLYPILLPRPGGRFPCSTGG-----GPLMV 55

Query: 221  FMYDLPIRFNLGMINHQI-SNEILVNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGN-- 391
            +MYDLP RF++GM+  +  ++E  V   N PPWPSNSGLKKQHSVEYWMMAS++Y+G   
Sbjct: 56   YMYDLPRRFHVGMLRRRSPADESPVTAENLPPWPSNSGLKKQHSVEYWMMASLLYDGGGG 115

Query: 392  QKWREVVRVSDPGIADAFFVPFFSSMSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYW 571
             + RE VRV DP +ADAFFVPFFSS+SFNTHGHNMTDP+TE DRQLQIDIL+ LR+S+YW
Sbjct: 116  NETREAVRVWDPEMADAFFVPFFSSLSFNTHGHNMTDPDTEFDRQLQIDILKILRESKYW 175

Query: 572  QRSGGRDHVIPMHHPNAFRFLREQVNASILIVADFGRYPKTMSRLSKDVVAPYVHVVESF 751
            QRSGGRDHVIPMHHPNAFRF REQVN SILIVADFGRYPK +S L KDVVAPYVHVV+SF
Sbjct: 176  QRSGGRDHVIPMHHPNAFRFFREQVNTSILIVADFGRYPKEISNLRKDVVAPYVHVVDSF 235

Query: 752  IDDNSPNPFESRNTLLFFRGRTVRKDEGIVRAKLGKILVGYED---VHYEQSFATGESIK 922
             DDNSP+P+ESR TLLFFRGRT+RKDEGIVR KL K+L G +D   +H+         + 
Sbjct: 236  TDDNSPDPYESRTTLLFFRGRTIRKDEGIVRDKLVKLLAGXDDYLQLHFHHRSYLSFLVX 295

Query: 923  ASSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYTQFSIFFS 1102
             S+QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD+IELP+EDE+DYTQFSIFFS
Sbjct: 296  QSTQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYTQFSIFFS 355

Query: 1103 VQEALKPDYMIGQLRKFPKERWVEMWRKLKNISHHYEFQYPPKKEDAVNMLWRQVRRKVP 1282
             +EAL+P YMI QLR+ PKERWVEMWR LK ISHHYEFQYPPKK DA++MLWRQV+ K+P
Sbjct: 356  DKEALEPGYMIEQLRQIPKERWVEMWRHLKYISHHYEFQYPPKKGDAIDMLWRQVKHKLP 415

Query: 1283 GAKL 1294
             A L
Sbjct: 416  RANL 419


>ref|XP_004502300.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cicer
            arietinum]
          Length = 420

 Score =  563 bits (1451), Expect = e-158
 Identities = 262/368 (71%), Positives = 315/368 (85%)
 Frame = +2

Query: 194  CNIESPLRIFMYDLPIRFNLGMINHQISNEILVNGRNFPPWPSNSGLKKQHSVEYWMMAS 373
            C  ++PLR++MYDLP RFN+GM+N + + E  V   ++P WP NSGLKKQHSVEYWMM S
Sbjct: 43   CTGKTPLRVYMYDLPRRFNVGMLNRRNTTEAPVTAVDYPMWPDNSGLKKQHSVEYWMMGS 102

Query: 374  IIYEGNQKWREVVRVSDPGIADAFFVPFFSSMSFNTHGHNMTDPETEIDRQLQIDILEFL 553
            +I  G     EVVRV DP + D FFVPFFSS+SFNTHGH+MTDPET+IDRQLQID++E L
Sbjct: 103  VI-GGGGNGSEVVRVLDPELVDVFFVPFFSSLSFNTHGHHMTDPETQIDRQLQIDLMELL 161

Query: 554  RKSQYWQRSGGRDHVIPMHHPNAFRFLREQVNASILIVADFGRYPKTMSRLSKDVVAPYV 733
            R+S+YWQR GGRDHV P+ HPNAFRFLR+Q+N SI +V DFGR P+ +S L+KDVV+PYV
Sbjct: 162  RQSKYWQRYGGRDHVFPLTHPNAFRFLRDQLNESIQVVVDFGRSPEGVSNLNKDVVSPYV 221

Query: 734  HVVESFIDDNSPNPFESRNTLLFFRGRTVRKDEGIVRAKLGKILVGYEDVHYEQSFATGE 913
            HVV+S+ DD   +PFESR TLLFFRGRT RKD+GI+RA+L KIL G++DVHYE+S ATGE
Sbjct: 222  HVVDSYEDDELQDPFESRTTLLFFRGRTHRKDKGIIRAQLTKILAGFDDVHYERSVATGE 281

Query: 914  SIKASSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYTQFSI 1093
            +IK SS+GMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFE+E+DY+QFS+
Sbjct: 282  NIKLSSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFENEIDYSQFSL 341

Query: 1094 FFSVQEALKPDYMIGQLRKFPKERWVEMWRKLKNISHHYEFQYPPKKEDAVNMLWRQVRR 1273
            FFS +EAL+P YMI QLR FPK++W+EMWR+LKNISHHYEFQYP K+EDAVNMLWRQ++ 
Sbjct: 342  FFSFKEALEPGYMINQLRSFPKQKWIEMWRQLKNISHHYEFQYPSKREDAVNMLWRQIKH 401

Query: 1274 KVPGAKLS 1297
            K+PG + S
Sbjct: 402  KLPGIRQS 409


>ref|XP_003601797.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
            gi|355490845|gb|AES72048.1| hypothetical protein
            MTR_3g085480 [Medicago truncatula]
          Length = 425

 Score =  549 bits (1415), Expect = e-154
 Identities = 258/396 (65%), Positives = 316/396 (79%)
 Frame = +2

Query: 110  GTVDLKSHFFPXXXXXXXXXXXXXXXXXCNIESPLRIFMYDLPIRFNLGMINHQISNEIL 289
            GT+D++S++                   C  E+PLR++MYDLP RFN+GM++ + + E  
Sbjct: 24   GTLDIRSYY--------QQSPSIIATTPCADEAPLRVYMYDLPRRFNVGMLDGRNTTEAP 75

Query: 290  VNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQKWREVVRVSDPGIADAFFVPFFSSM 469
            V   ++P WP N GL++QHSVEYWMM S++  G     E VRV DP + D +FVPFFSS+
Sbjct: 76   VTIADYPLWPDNQGLRRQHSVEYWMMGSLL-NGGGNGSEAVRVLDPEVVDVYFVPFFSSL 134

Query: 470  SFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQVN 649
            SFNTHGH+M DPETEID QLQID++  L +S+YWQRSGGRDH+ PM HPNAFRFLR+Q+N
Sbjct: 135  SFNTHGHHMRDPETEIDHQLQIDLMGLLGQSKYWQRSGGRDHIFPMTHPNAFRFLRDQLN 194

Query: 650  ASILIVADFGRYPKTMSRLSKDVVAPYVHVVESFIDDNSPNPFESRNTLLFFRGRTVRKD 829
             SI +V DFGRYPK +S L+KDVV+PYVH V+S++DD   +PFESR TLLFFRG T RKD
Sbjct: 195  ESIQVVVDFGRYPKGVSNLNKDVVSPYVHFVDSYVDDEPHDPFESRTTLLFFRGGTHRKD 254

Query: 830  EGIVRAKLGKILVGYEDVHYEQSFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDAI 1009
            +GIVRAK  KIL G++DVHYE+S ATGE+IK SS+GMRSSKFCLHPAGDTPSSCRLFDAI
Sbjct: 255  KGIVRAKFTKILAGFDDVHYERSSATGENIKLSSKGMRSSKFCLHPAGDTPSSCRLFDAI 314

Query: 1010 VSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRKL 1189
            VSHCVPVIVSDKIELPFE+E+DY+QFS+FFS +EAL+P YMI QLR FPK+ W EMWR+L
Sbjct: 315  VSHCVPVIVSDKIELPFENEIDYSQFSLFFSFKEALEPGYMINQLRSFPKQNWTEMWRQL 374

Query: 1190 KNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAKLS 1297
            KNISHHYEF YPP++EDAVNMLWRQ++ K+PG + S
Sbjct: 375  KNISHHYEFHYPPEREDAVNMLWRQIKHKLPGIRQS 410


>ref|XP_006391283.1| hypothetical protein EUTSA_v10018590mg [Eutrema salsugineum]
            gi|557087717|gb|ESQ28569.1| hypothetical protein
            EUTSA_v10018590mg [Eutrema salsugineum]
          Length = 432

 Score =  548 bits (1413), Expect = e-153
 Identities = 249/374 (66%), Positives = 318/374 (85%), Gaps = 6/374 (1%)
 Frame = +2

Query: 194  CNIES-PLRIFMYDLPIRFNLGMINHQISNEILVNGRNFPPWPSNSGLKKQHSVEYWMMA 370
            C+I   PLR+FMYDLP +FN+ M++ Q S+   + G+N P WP  SG+K+QHSVEYW+MA
Sbjct: 45   CSITGRPLRVFMYDLPRKFNVAMMDPQSSDVEPLTGKNLPSWPQTSGIKRQHSVEYWLMA 104

Query: 371  SIIYEGN--QKWREVVRVSDPGIADAFFVPFFSSMSFNTHGHNMTDPETEIDRQLQIDIL 544
            S+++ G   ++ +E  RV DP +ADAF+VPFFSS+SFNTHG NMTDP+TE DRQLQ++++
Sbjct: 105  SLLHGGGGGEEEKEAFRVFDPELADAFYVPFFSSLSFNTHGKNMTDPDTEFDRQLQVELM 164

Query: 545  EFLRKSQYWQRSGGRDHVIPMHHPNAFRFLREQVNASILIVADFGRYPKTMSRLSKDVVA 724
            E+L  S+YWQRSGGRDHVIPM HPNAFRFLR+QVNASIL+V DFGRYP+ M+RL KDVV+
Sbjct: 165  EYLENSKYWQRSGGRDHVIPMTHPNAFRFLRQQVNASILVVVDFGRYPREMARLGKDVVS 224

Query: 725  PYVHVVESFIDD---NSPNPFESRNTLLFFRGRTVRKDEGIVRAKLGKILVGYEDVHYEQ 895
            PYVHVVESF +D   ++P+PFE+R TLL+FRG TVRK EG +R +L K+L G  DVHYE+
Sbjct: 225  PYVHVVESFTEDGGVDTPDPFEARTTLLYFRGNTVRKAEGKIRLRLEKLLAGNSDVHYEK 284

Query: 896  SFATGESIKASSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELD 1075
            S AT ++IK S++GMRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SD+IELPFEDE+D
Sbjct: 285  SVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDRIELPFEDEID 344

Query: 1076 YTQFSIFFSVQEALKPDYMIGQLRKFPKERWVEMWRKLKNISHHYEFQYPPKKEDAVNML 1255
            Y++FS+FFS++EAL+P Y++  LR+FPKE+W++MW  LKN+SHH+EFQYPPK+EDAVNML
Sbjct: 345  YSEFSVFFSIKEALEPGYILNNLRQFPKEKWLQMWENLKNVSHHFEFQYPPKREDAVNML 404

Query: 1256 WRQVRRKVPGAKLS 1297
            WRQV+ K+P  KL+
Sbjct: 405  WRQVKHKIPSVKLA 418


>ref|XP_002311068.2| exostosin family protein [Populus trichocarpa]
            gi|550332343|gb|EEE88435.2| exostosin family protein
            [Populus trichocarpa]
          Length = 379

 Score =  546 bits (1408), Expect = e-153
 Identities = 260/362 (71%), Positives = 311/362 (85%), Gaps = 4/362 (1%)
 Frame = +2

Query: 224  MYDLPIRFNLGMINHQIS---NEILVNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEGNQ 394
            MYDLP RFN+GM+  +     +  +      P WP N G++KQHSVEYW+MAS++  G +
Sbjct: 1    MYDLPRRFNIGMMQWKKGGGDDTPVRTAEELPRWPVNVGVRKQHSVEYWLMASLLGSGGE 60

Query: 395  -KWREVVRVSDPGIADAFFVPFFSSMSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQYW 571
             + RE VRV DP IA+A+FVPFFSS+SFNTHG NMTDPETE DRQLQ+D+++FL+KS+YW
Sbjct: 61   GEEREAVRVLDPEIAEAYFVPFFSSLSFNTHGRNMTDPETEKDRQLQVDLIDFLQKSKYW 120

Query: 572  QRSGGRDHVIPMHHPNAFRFLREQVNASILIVADFGRYPKTMSRLSKDVVAPYVHVVESF 751
            QRSGGRDHVIPM HPNAFRFLR+ VNASILIVADFGRYPK++S LSKDVV+PYVH V+SF
Sbjct: 121  QRSGGRDHVIPMTHPNAFRFLRQLVNASILIVADFGRYPKSLSTLSKDVVSPYVHNVDSF 180

Query: 752  IDDNSPNPFESRNTLLFFRGRTVRKDEGIVRAKLGKILVGYEDVHYEQSFATGESIKASS 931
             DD+  +PFESR TLLFFRG TVRKD+G VRAKL KIL GY+DV YE+S  T E+I+AS+
Sbjct: 181  KDDDLLDPFESRKTLLFFRGNTVRKDKGKVRAKLEKILAGYDDVRYERSSPTAEAIQAST 240

Query: 932  QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYTQFSIFFSVQE 1111
            QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELP+EDE+DY+QFSIFFS+ E
Sbjct: 241  QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDLIELPYEDEIDYSQFSIFFSINE 300

Query: 1112 ALKPDYMIGQLRKFPKERWVEMWRKLKNISHHYEFQYPPKKEDAVNMLWRQVRRKVPGAK 1291
            A++PDY++ QLRKFPK+RW+EMWR+LK ISHH+EFQYPP KEDAVN+LWRQV+ K+PGA+
Sbjct: 301  AIQPDYLVNQLRKFPKDRWIEMWRQLKKISHHFEFQYPPVKEDAVNLLWRQVKNKLPGAQ 360

Query: 1292 LS 1297
            L+
Sbjct: 361  LA 362


>ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297334437|gb|EFH64855.1| exostosin family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 429

 Score =  542 bits (1396), Expect = e-151
 Identities = 247/366 (67%), Positives = 311/366 (84%), Gaps = 3/366 (0%)
 Frame = +2

Query: 209  PLRIFMYDLPIRFNLGMINHQISNEILVNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEG 388
            PLR+FMYDLP +FN+ M++   S+   + G+N P WP  SG+K+QHSVEYW+MAS++  G
Sbjct: 51   PLRVFMYDLPRKFNVAMMDPHSSDVEPLTGKNLPSWPQTSGIKRQHSVEYWLMASLLNGG 110

Query: 389  NQKWREVVRVSDPGIADAFFVPFFSSMSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQY 568
            +    E +RV DP +ADAF+VPFFSS+SFNTHG NMTDP+TE DRQLQ++++EFL  S+Y
Sbjct: 111  DDD-NEAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEFDRQLQVELMEFLEGSEY 169

Query: 569  WQRSGGRDHVIPMHHPNAFRFLREQVNASILIVADFGRYPKTMSRLSKDVVAPYVHVVES 748
            W RSGG+DHVIPM HPNAFRFLR+QVNASILIV DFGRY K M+RLSKDVV+PYVHVVES
Sbjct: 170  WNRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYAKDMARLSKDVVSPYVHVVES 229

Query: 749  FI---DDNSPNPFESRNTLLFFRGRTVRKDEGIVRAKLGKILVGYEDVHYEQSFATGESI 919
                 DD   +PFE+R TLL+FRG TVRKDEG +R +L K+L G  DVH+E+S AT ++I
Sbjct: 230  LNEEDDDGLTDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKSVATTQNI 289

Query: 920  KASSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYTQFSIFF 1099
            K S++GMRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SDKIELPFEDE+DY++FS+FF
Sbjct: 290  KVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSLFF 349

Query: 1100 SVQEALKPDYMIGQLRKFPKERWVEMWRKLKNISHHYEFQYPPKKEDAVNMLWRQVRRKV 1279
            S++E+L+P Y++ +LR+FPKE+W+EMW++LKN+SHH+EFQYPPK+EDAVNMLWRQV+ K+
Sbjct: 350  SIKESLEPGYILNKLRQFPKEKWLEMWKRLKNVSHHFEFQYPPKREDAVNMLWRQVKHKI 409

Query: 1280 PGAKLS 1297
            P  KL+
Sbjct: 410  PNVKLA 415


>ref|XP_006302174.1| hypothetical protein CARUB_v10020184mg [Capsella rubella]
            gi|482570884|gb|EOA35072.1| hypothetical protein
            CARUB_v10020184mg [Capsella rubella]
          Length = 494

 Score =  540 bits (1391), Expect = e-151
 Identities = 244/368 (66%), Positives = 310/368 (84%), Gaps = 5/368 (1%)
 Frame = +2

Query: 209  PLRIFMYDLPIRFNLGMINHQISNEILVNGRNFPPWPSNSGLKKQHSVEYWMMASIIYEG 388
            PLR+FMYDLP +FN+ M++ + S+   + G+N P WP  SG+K+QHSVEYW+MAS++  G
Sbjct: 113  PLRVFMYDLPRKFNVAMMDPRSSDVEPLTGKNLPSWPQTSGIKRQHSVEYWLMASLLQRG 172

Query: 389  NQKWR-EVVRVSDPGIADAFFVPFFSSMSFNTHGHNMTDPETEIDRQLQIDILEFLRKSQ 565
                  E +RV DP +ADAF+VPFFSS+SFNTHG NMTDP+TE DR+LQ++++EFL  S+
Sbjct: 173  GDGGDDEAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEFDRKLQVELMEFLENSE 232

Query: 566  YWQRSGGRDHVIPMHHPNAFRFLREQVNASILIVADFGRYPKTMSRLSKDVVAPYVHVVE 745
            YW+RSGG+DHVIPM HPNAFRFLR+QVNASILIV DFGRYPK M+RLSKDVV+PYVHVVE
Sbjct: 233  YWKRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYPKDMARLSKDVVSPYVHVVE 292

Query: 746  SFI----DDNSPNPFESRNTLLFFRGRTVRKDEGIVRAKLGKILVGYEDVHYEQSFATGE 913
            +      DD   +PFE+R TLL+FRG T RKDEG +R +L K+L    DVHYE+S AT +
Sbjct: 293  TLTEDGDDDGMTDPFEARTTLLYFRGNTARKDEGKIRLRLEKLLANNSDVHYEKSVATTQ 352

Query: 914  SIKASSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYTQFSI 1093
            +IK S++GMRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SDKIELPFEDE+DY++FS+
Sbjct: 353  NIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSV 412

Query: 1094 FFSVQEALKPDYMIGQLRKFPKERWVEMWRKLKNISHHYEFQYPPKKEDAVNMLWRQVRR 1273
            FFS++E+L+P Y++  LR+FPK++W+EMW++LKN+SHH+EFQYPPK+EDAVNMLWRQV+ 
Sbjct: 413  FFSIKESLEPGYILNNLRQFPKDKWLEMWKRLKNVSHHFEFQYPPKREDAVNMLWRQVKH 472

Query: 1274 KVPGAKLS 1297
            K+P  KL+
Sbjct: 473  KIPNVKLA 480


Top