BLASTX nr result

ID: Astragalus24_contig00023485 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00023485
         (1334 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU12394.1| hypothetical protein TSUD_253450 [Trifolium subt...   675   0.0  
dbj|GAU42924.1| hypothetical protein TSUD_283470 [Trifolium subt...   674   0.0  
dbj|GAU51965.1| hypothetical protein TSUD_417550 [Trifolium subt...   651   0.0  
dbj|GAU42521.1| hypothetical protein TSUD_376490 [Trifolium subt...   629   0.0  
dbj|GAU30423.1| hypothetical protein TSUD_364690 [Trifolium subt...   629   0.0  
ref|XP_003628138.1| UDP-glucosyltransferase family protein [Medi...   623   0.0  
gb|ABI94026.1| (iso)flavonoid glycosyltransferase [Medicago trun...   623   0.0  
ref|XP_013466939.1| UDP-glucosyltransferase family protein [Medi...   602   0.0  
ref|XP_020231152.1| soyasapogenol B glucuronide galactosyltransf...   588   0.0  
dbj|GAU11951.1| hypothetical protein TSUD_195750 [Trifolium subt...   579   0.0  
gb|AMQ26114.1| UDP-glycosyltransferase 41 [Pueraria montana var....   575   0.0  
gb|KHN10128.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [...   572   0.0  
ref|XP_003546674.1| PREDICTED: soyasapogenol B glucuronide galac...   572   0.0  
ref|XP_020211411.1| soyasapogenol B glucuronide galactosyltransf...   558   0.0  
ref|XP_020211413.1| soyasapogenol B glucuronide galactosyltransf...   555   0.0  
ref|XP_007142833.1| hypothetical protein PHAVU_007G020800g [Phas...   542   0.0  
gb|KHN37410.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [...   540   0.0  
ref|XP_003536714.1| PREDICTED: soyasapogenol B glucuronide galac...   540   0.0  
ref|XP_014512855.1| soyasapogenol B glucuronide galactosyltransf...   535   0.0  
ref|XP_017413459.1| PREDICTED: soyasapogenol B glucuronide galac...   532   0.0  

>dbj|GAU12394.1| hypothetical protein TSUD_253450 [Trifolium subterraneum]
          Length = 502

 Score =  675 bits (1742), Expect = 0.0
 Identities = 337/420 (80%), Positives = 358/420 (85%), Gaps = 3/420 (0%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQ+ GLPHG+ES +ADTPQD+ SKIY             +  MKPDFIVTDMFYPWSV
Sbjct: 69   KFPQIPGLPHGLESLDADTPQDMSSKIYQGLFLLKENFQQLY--MKPDFIVTDMFYPWSV 126

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            DIAAELGIPRLNCTGGSYFSHAARNSIEQF+PH NVGS++ES LLPGLPHKVEMT  QLS
Sbjct: 127  DIAAELGIPRLNCTGGSYFSHAARNSIEQFSPHVNVGSDHESFLLPGLPHKVEMTRSQLS 186

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            DW+KEPNDF  LMKMI D+DRKSYGSLF SFYE+EGTYEEHYQRVTGTRSW LGPVS WV
Sbjct: 187  DWVKEPNDFGDLMKMIGDADRKSYGSLFRSFYEMEGTYEEHYQRVTGTRSWSLGPVSLWV 246

Query: 543  NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEED-SVVYVSFGSMNKFPISQLIEIAHALE 719
            NQDD DKANRG AK      NGVLKWLDSKEED SVVYVSFGSMNKFPISQ IEIAHALE
Sbjct: 247  NQDDFDKANRGRAKEKEEEENGVLKWLDSKEEDNSVVYVSFGSMNKFPISQHIEIAHALE 306

Query: 720  DSGYDFIWVVRKAEEG-EYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHC 896
            DSG+DFIWVV+K EEG EYG LE+F+KRVKESNKGYLIWGWAPQL ILEHSAIG VVTHC
Sbjct: 307  DSGFDFIWVVKKTEEGNEYGKLEEFEKRVKESNKGYLIWGWAPQLAILEHSAIGTVVTHC 366

Query: 897  GWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKRE 1076
            GWNT LESV AGLPM TWPLFAEQFYNEKLLVDVLKIGVPVGAK WKNWN++GD+VVKRE
Sbjct: 367  GWNTTLESVYAGLPMVTWPLFAEQFYNEKLLVDVLKIGVPVGAKEWKNWNQYGDKVVKRE 426

Query: 1077 DIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKVN 1253
            DIGKAI LLM GGEE LE+R+RVNEFSDAAKKT KVGGSSHT          SFK+QK N
Sbjct: 427  DIGKAIALLMGGGEECLEIRKRVNEFSDAAKKTIKVGGSSHTNLKELLKELMSFKYQKAN 486


>dbj|GAU42924.1| hypothetical protein TSUD_283470 [Trifolium subterraneum]
          Length = 497

 Score =  674 bits (1738), Expect = 0.0
 Identities = 337/425 (79%), Positives = 361/425 (84%), Gaps = 3/425 (0%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQ+ GLP G+ES +A+TP+DI SKIY             FRDMKPDFIVTDMFYPWSV
Sbjct: 73   KFPQIPGLPLGLESVDAETPKDISSKIYQGLFLLKDNFQQLFRDMKPDFIVTDMFYPWSV 132

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            D AAELGIPRLNCTGGSYFSHAARNSIEQFAPH NVGS+ ES LLPGLPHKVEMT  QLS
Sbjct: 133  DTAAELGIPRLNCTGGSYFSHAARNSIEQFAPHVNVGSDYESFLLPGLPHKVEMTRSQLS 192

Query: 363  DWLKE-PNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539
            DW+ E  NDF  +MKMIKD+DR+SYGSLF SFYELEGTYEEHYQRVTGTRSW LGPVS W
Sbjct: 193  DWVNERSNDFGNIMKMIKDADRRSYGSLFRSFYELEGTYEEHYQRVTGTRSWSLGPVSLW 252

Query: 540  VNQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALE 719
            VNQDD DKANRG+AK      NGVLKWLDSKE++SVVYVSFGSMNKFPISQ IEIAHALE
Sbjct: 253  VNQDDFDKANRGNAKEKEE--NGVLKWLDSKEDNSVVYVSFGSMNKFPISQHIEIAHALE 310

Query: 720  DSGYDFIWVVRKAEEGE-YGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHC 896
            DSGYDFIWVV+K EEGE YGVLE+F+KRVKESNKGYLIW WAPQLVILEHSA+GAVVTHC
Sbjct: 311  DSGYDFIWVVKKTEEGEEYGVLEEFEKRVKESNKGYLIWDWAPQLVILEHSAVGAVVTHC 370

Query: 897  GWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKRE 1076
            GWNT LESV  GLPM TWPLFAEQFYNEKLLV+VLKIGV +GAK WKNWN +GD+VVKRE
Sbjct: 371  GWNTTLESVYMGLPMVTWPLFAEQFYNEKLLVNVLKIGVSIGAKEWKNWNAYGDKVVKRE 430

Query: 1077 DIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKVN 1253
            DIGKAI LLM GGEE LE+R+RVNE SDAAKKT KVGGSSHT          SFKHQKVN
Sbjct: 431  DIGKAIALLMGGGEECLEIRKRVNELSDAAKKTIKVGGSSHTNLKELLEELKSFKHQKVN 490

Query: 1254 HKMEG 1268
            H+MEG
Sbjct: 491  HQMEG 495


>dbj|GAU51965.1| hypothetical protein TSUD_417550 [Trifolium subterraneum]
          Length = 512

 Score =  651 bits (1679), Expect = 0.0
 Identities = 329/438 (75%), Positives = 354/438 (80%), Gaps = 17/438 (3%)
 Frame = +3

Query: 6    FPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSVD 185
            FPQ+ GLPHG+E  +ADTPQD     Y              RDMKPDFIVTDMFYPWSVD
Sbjct: 74   FPQIPGLPHGLEIIDADTPQDSSKLFYQGLLLLQENFQQIIRDMKPDFIVTDMFYPWSVD 133

Query: 186  IAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLSD 365
            IAAELGIPRLNC GGSYFSHAARNS EQFAPH NV S++E+  LPGLPHK+EMT  QLSD
Sbjct: 134  IAAELGIPRLNCNGGSYFSHAARNSTEQFAPHVNVSSDDETFSLPGLPHKIEMTRSQLSD 193

Query: 366  WLKEPN-DFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            W+KEPN +F Y MKMI D+DRKSYGSLF SFYELEGTYEEHYQRVTGTRSW LGPVS WV
Sbjct: 194  WVKEPNNEFGYWMKMIIDADRKSYGSLFRSFYELEGTYEEHYQRVTGTRSWSLGPVSLWV 253

Query: 543  NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722
            NQDD DKANRG AK      NGVLKWLDSKE++SVVYVSFGSMNKF ISQ IEIAHALED
Sbjct: 254  NQDDFDKANRGCAKEKEEE-NGVLKWLDSKEDNSVVYVSFGSMNKFSISQQIEIAHALED 312

Query: 723  SGYDFIWVVRKA-EEGEY--------------GVLEKFDKRVKESNKGYLIWGWAPQLVI 857
            SG+DFIWVVRK  +E EY               +LE+F+KRVKESNKGYLIWGWAPQLVI
Sbjct: 313  SGHDFIWVVRKTTKENEYLSCLGAGTVPVPDTSILEEFEKRVKESNKGYLIWGWAPQLVI 372

Query: 858  LEHSAIGAVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWK 1037
            LEHSAIGAVVTHCGWNT LES+  GLPM TWPLFAEQFYNEKLLVDVLKIGVPVG+K WK
Sbjct: 373  LEHSAIGAVVTHCGWNTTLESIYMGLPMVTWPLFAEQFYNEKLLVDVLKIGVPVGSKEWK 432

Query: 1038 NWNEFGDEVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXX 1214
            NWNE+GD+VVKREDIGKAIDLLM GGEE LE+R+RVNE SDAAKKT KVGGSS+T     
Sbjct: 433  NWNEYGDKVVKREDIGKAIDLLMGGGEECLEIRKRVNELSDAAKKTIKVGGSSYTKLKEL 492

Query: 1215 XXXXXSFKHQKVNHKMEG 1268
                 SFKHQKVN+KMEG
Sbjct: 493  LEELKSFKHQKVNNKMEG 510


>dbj|GAU42521.1| hypothetical protein TSUD_376490 [Trifolium subterraneum]
          Length = 458

 Score =  629 bits (1623), Expect = 0.0
 Identities = 321/426 (75%), Positives = 347/426 (81%), Gaps = 4/426 (0%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQ+ GLPHG+E+ +ADTPQD+ SKIY              RDMKPDFIVTDMFYPWSV
Sbjct: 43   KFPQIPGLPHGLENVDADTPQDMNSKIYQGLLLLKDDFQQLIRDMKPDFIVTDMFYPWSV 102

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPH NVGS++ES LLPGLPHKVEMT  QLS
Sbjct: 103  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHVNVGSDDESFLLPGLPHKVEMTRSQLS 162

Query: 363  DWLKEPN-DFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539
            DW+K+PN +F Y MK+IKD+DRKSYGSLF SFYELEGT           RSW LGPVS W
Sbjct: 163  DWVKDPNLEFGYWMKVIKDADRKSYGSLFRSFYELEGT-----------RSWSLGPVSLW 211

Query: 540  VNQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSM-NKFPISQLIEIAHAL 716
            VNQDD DKANRG AK      +GVLKWLDSKE++SVVYVSFGSM NKFPISQ IEIAHAL
Sbjct: 212  VNQDDFDKANRGCAKEKKEE-HGVLKWLDSKEDNSVVYVSFGSMMNKFPISQHIEIAHAL 270

Query: 717  EDSGYDFIWVVRKAEE-GEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893
            EDSGYDFIWVV+K EE  EYG+LE+F+KRVKESNKGYLIWGWAPQLVILEH AIG VVTH
Sbjct: 271  EDSGYDFIWVVKKTEEVDEYGILEQFEKRVKESNKGYLIWGWAPQLVILEHFAIGTVVTH 330

Query: 894  CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073
            CGWNT LESV   LPM TWPLFAEQFYNEKLLVDVLKIGVPVGAK WKNWNE+ D+VVKR
Sbjct: 331  CGWNTTLESVYMSLPMVTWPLFAEQFYNEKLLVDVLKIGVPVGAKEWKNWNEYRDKVVKR 390

Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKV 1250
            EDIGKAI LLM G E+ LE+R+RVNE SDAAKKT KVGGSSHT            KHQK 
Sbjct: 391  EDIGKAIALLMDGREKCLEIRKRVNELSDAAKKTIKVGGSSHTKLKELIEELMLLKHQKA 450

Query: 1251 NHKMEG 1268
            NH+M+G
Sbjct: 451  NHEMKG 456


>dbj|GAU30423.1| hypothetical protein TSUD_364690 [Trifolium subterraneum]
          Length = 501

 Score =  629 bits (1621), Expect = 0.0
 Identities = 308/423 (72%), Positives = 344/423 (81%), Gaps = 2/423 (0%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQV GLP GMESFNADTP +IRSKIY             FRDMKPDFIVTDMFYPWSV
Sbjct: 76   KFPQVPGLPQGMESFNADTPNEIRSKIYQGLMVLQEQFKQLFRDMKPDFIVTDMFYPWSV 135

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            DIA EL IPRL C  GSYF+H+A NSIE FAPHA V SN+ES LLPGLPHKVEMT LQL 
Sbjct: 136  DIADELRIPRLICISGSYFAHSAMNSIEVFAPHAKVNSNSESFLLPGLPHKVEMTRLQLP 195

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            DWL+ PND+TYLMKMIK+S+RKSYGSLFDS++E+EGTYE+HY+   GT+SWG+GPVS WV
Sbjct: 196  DWLRAPNDYTYLMKMIKESERKSYGSLFDSYHEIEGTYEDHYKTAMGTKSWGVGPVSLWV 255

Query: 543  NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722
            NQ++SDKA+RGH        + VLKWLDSKEEDSV+YVSFGSMNKFP  QL+EIAHALED
Sbjct: 256  NQNNSDKASRGHRIEQDAEEDEVLKWLDSKEEDSVLYVSFGSMNKFPSPQLVEIAHALED 315

Query: 723  SGYDFIWVVRKAEEGE-YGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHCG 899
            SG DFIWVVRK E+GE  G L +F+KRVKE NKGYLIWGWAPQL+ILEH+A+GAVVTHCG
Sbjct: 316  SGNDFIWVVRKVEDGEDGGFLREFEKRVKERNKGYLIWGWAPQLLILEHAAVGAVVTHCG 375

Query: 900  WNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKRED 1079
            WNTI+ESVNAGLP+ATWPLFAEQFYNE+LLVDVLKIGV VGA  W+NWNEFGD+VVKRED
Sbjct: 376  WNTIMESVNAGLPLATWPLFAEQFYNERLLVDVLKIGVAVGANEWRNWNEFGDDVVKRED 435

Query: 1080 IGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKVNH 1256
            IGKAI LLM  GEE LEMRRR    S AAKK  + GGSSHT          SFK + V +
Sbjct: 436  IGKAIGLLMGSGEECLEMRRRAKALSGAAKKAIEFGGSSHTKLKELLEDLKSFKLENVKN 495

Query: 1257 KME 1265
            K+E
Sbjct: 496  KLE 498


>ref|XP_003628138.1| UDP-glucosyltransferase family protein [Medicago truncatula]
 gb|AET02614.1| UDP-glucosyltransferase family protein [Medicago truncatula]
          Length = 464

 Score =  623 bits (1606), Expect = 0.0
 Identities = 304/424 (71%), Positives = 346/424 (81%), Gaps = 4/424 (0%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQV GLP GMESFNADTP+DI SKIY             FRDMKPDFIVTDMFYPWSV
Sbjct: 38   KFPQVPGLPQGMESFNADTPKDIISKIYQGLAILQEQFTQLFRDMKPDFIVTDMFYPWSV 97

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            D+A ELGIPRL C GGSYF+H+A NSIEQF PHA V SN+ S LLPGLPH VEMT LQL 
Sbjct: 98   DVADELGIPRLICIGGSYFAHSAMNSIEQFEPHAKVKSNSVSFLLPGLPHNVEMTRLQLP 157

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            DWL+ PN +TYLMKMIKDS++KSYGSLFDS+YE+EGTYE++Y+   G++SW +GPVS W+
Sbjct: 158  DWLRAPNGYTYLMKMIKDSEKKSYGSLFDSYYEIEGTYEDYYKIAMGSKSWSVGPVSLWM 217

Query: 543  NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722
            N+DDSDKA RGH K       GVLKWLDSK+ DSV+YVSFGSMNKFP  QL+EIAHALED
Sbjct: 218  NKDDSDKAGRGHGK-EEDEEEGVLKWLDSKKYDSVLYVSFGSMNKFPTPQLVEIAHALED 276

Query: 723  SGYDFIWVVRK---AEEGEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893
            SG+DFIWVVRK   AE+G+ G L +F+KR+KE NKGYLIWGWAPQL+ILEH A+GAVVTH
Sbjct: 277  SGHDFIWVVRKIEDAEDGDDGFLSEFEKRMKERNKGYLIWGWAPQLLILEHGAVGAVVTH 336

Query: 894  CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073
            CGWNTI+ESVNAGLP+ATWPLFAEQF+NE+LLVDVLKIGV VGAK W+NWNEFGD+VVKR
Sbjct: 337  CGWNTIMESVNAGLPLATWPLFAEQFFNERLLVDVLKIGVAVGAKEWRNWNEFGDDVVKR 396

Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKV 1250
            EDIGKAI LLM GGEE LEMR+RV   S AAKK  +VGGSS+T          SFK +K+
Sbjct: 397  EDIGKAIGLLMGGGEECLEMRKRVKALSGAAKKAIEVGGSSYTKLKELIEELKSFKLEKI 456

Query: 1251 NHKM 1262
            N K+
Sbjct: 457  NKKL 460


>gb|ABI94026.1| (iso)flavonoid glycosyltransferase [Medicago truncatula]
          Length = 502

 Score =  623 bits (1606), Expect = 0.0
 Identities = 304/424 (71%), Positives = 346/424 (81%), Gaps = 4/424 (0%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQV GLP GMESFNADTP+DI SKIY             FRDMKPDFIVTDMFYPWSV
Sbjct: 76   KFPQVPGLPQGMESFNADTPKDIISKIYQGLAILQEQFTQLFRDMKPDFIVTDMFYPWSV 135

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            D+A ELGIPRL C GGSYF+H+A NSIEQF PHA V SN+ S LLPGLPH VEMT LQL 
Sbjct: 136  DVADELGIPRLICIGGSYFAHSAMNSIEQFEPHAKVKSNSVSFLLPGLPHNVEMTRLQLP 195

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            DWL+ PN +TYLMKMIKDS++KSYGSLFDS+YE+EGTYE++Y+   G++SW +GPVS W+
Sbjct: 196  DWLRAPNGYTYLMKMIKDSEKKSYGSLFDSYYEIEGTYEDYYKIAMGSKSWSVGPVSLWM 255

Query: 543  NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722
            N+DDSDKA RGH K       GVLKWLDSK+ DSV+YVSFGSMNKFP  QL+EIAHALED
Sbjct: 256  NKDDSDKAGRGHGK-EEDEEEGVLKWLDSKKYDSVLYVSFGSMNKFPTPQLVEIAHALED 314

Query: 723  SGYDFIWVVRK---AEEGEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893
            SG+DFIWVVRK   AE+G+ G L +F+KR+KE NKGYLIWGWAPQL+ILEH A+GAVVTH
Sbjct: 315  SGHDFIWVVRKIEDAEDGDDGFLSEFEKRMKERNKGYLIWGWAPQLLILEHGAVGAVVTH 374

Query: 894  CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073
            CGWNTI+ESVNAGLP+ATWPLFAEQF+NE+LLVDVLKIGV VGAK W+NWNEFGD+VVKR
Sbjct: 375  CGWNTIMESVNAGLPLATWPLFAEQFFNERLLVDVLKIGVAVGAKEWRNWNEFGDDVVKR 434

Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKV 1250
            EDIGKAI LLM GGEE LEMR+RV   S AAKK  +VGGSS+T          SFK +K+
Sbjct: 435  EDIGKAIGLLMGGGEECLEMRKRVKALSGAAKKAIEVGGSSYTKLKELIEELKSFKLEKI 494

Query: 1251 NHKM 1262
            N K+
Sbjct: 495  NKKL 498


>ref|XP_013466939.1| UDP-glucosyltransferase family protein [Medicago truncatula]
 gb|KEH40975.1| UDP-glucosyltransferase family protein [Medicago truncatula]
          Length = 503

 Score =  602 bits (1553), Expect = 0.0
 Identities = 300/423 (70%), Positives = 340/423 (80%), Gaps = 3/423 (0%)
 Frame = +3

Query: 6    FPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSVD 185
            FPQV GL  GMESFNADTP +IRSKIY             FRDMKPDFIVTDMFYPWSVD
Sbjct: 77   FPQVPGLARGMESFNADTPNEIRSKIYQGLIILQEQFKQQFRDMKPDFIVTDMFYPWSVD 136

Query: 186  IAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLSD 365
            +A ELGIPRL C  GSYF+H+A NSIE F+P A V  N+ES LLPGLPHKVEM  LQL D
Sbjct: 137  VADELGIPRLICISGSYFAHSAMNSIEHFSPQAKVKLNSESFLLPGLPHKVEMKRLQLPD 196

Query: 366  WLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWVN 545
            WL+ PND+TYLMKMIKDS+RKSYGSLFDS +E+E TYEEHY+   GT+SW LGPVS WVN
Sbjct: 197  WLRAPNDYTYLMKMIKDSERKSYGSLFDS-HEIESTYEEHYKTAMGTKSWSLGPVSLWVN 255

Query: 546  QDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALEDS 725
            QDDSDKA RGH K       GVLKWLDSK++DSV+YVSFGSMNKFP  QL+EIAHALE S
Sbjct: 256  QDDSDKAGRGHGKEEDED-EGVLKWLDSKKDDSVLYVSFGSMNKFPTPQLVEIAHALEHS 314

Query: 726  GYDFIWVVRKAEEGEYG-VLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHCGW 902
            G+DFIWVVRK E+ E G    +F+KR+KESNKGYLIWGWAPQL+ILEH+A+GAVVTHCGW
Sbjct: 315  GHDFIWVVRKIEDVEDGDFFTEFEKRMKESNKGYLIWGWAPQLLILEHAAVGAVVTHCGW 374

Query: 903  NTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKREDI 1082
            NTI+ESVNAGL +ATWPLFAEQF+NE+LLVDVLKIGV VGAK W+NWNEFGD+VVKR++I
Sbjct: 375  NTIMESVNAGLSLATWPLFAEQFFNERLLVDVLKIGVAVGAKEWRNWNEFGDDVVKRDEI 434

Query: 1083 GKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFK-HQKVNH 1256
            GKAI LLM GGEE LEMR++    S AAKK  +VGGSS+T          SFK  +KVN+
Sbjct: 435  GKAIGLLMGGGEECLEMRKKAKALSGAAKKAIEVGGSSYTKLKQLIEELKSFKLEKKVNN 494

Query: 1257 KME 1265
            K+E
Sbjct: 495  KLE 497


>ref|XP_020231152.1| soyasapogenol B glucuronide galactosyltransferase-like [Cajanus
            cajan]
 gb|KYP51621.1| Anthocyanin 3'-O-beta-glucosyltransferase [Cajanus cajan]
          Length = 509

 Score =  588 bits (1517), Expect = 0.0
 Identities = 289/430 (67%), Positives = 340/430 (79%), Gaps = 9/430 (2%)
 Frame = +3

Query: 3    KFP-QVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWS 179
            KFP +  GLP G+ESFN++TPQD+  K+Y             F DM+PDF+VTDMFYPW+
Sbjct: 77   KFPFEQVGLPQGVESFNSNTPQDMVKKVYEGLSILKDQYQQLFHDMQPDFLVTDMFYPWT 136

Query: 180  VDIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQL 359
            VD AA+LGIPRL   GG YF+H+A+N+IEQF+PH  V S++E  L+PGLPH++EMT LQ+
Sbjct: 137  VDAAAKLGIPRLIYVGGGYFAHSAQNAIEQFSPHTKVDSDSERFLIPGLPHELEMTRLQI 196

Query: 360  SDWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539
             DWL+EP D++ LMK++KDS+R+SYGSLF++FYELEGTYEEHY++  G +SW +GPVSFW
Sbjct: 197  PDWLREPKDYSDLMKIMKDSERRSYGSLFNTFYELEGTYEEHYKKAMGVKSWSVGPVSFW 256

Query: 540  VNQDDSDKANRGHAK---XXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAH 710
            VNQD SDKA+RGHAK          G L WLDSK E+SV+YVSFGSMNKFP  QL+EIAH
Sbjct: 257  VNQDASDKADRGHAKEEQEGEGGGEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAH 316

Query: 711  ALEDSGYDFIWVVRKAEEGE----YGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIG 878
            ALEDSG+DFIWVVRK  E E       LE+F++RV+ SNKGYLIWGWAPQL+ILEH AIG
Sbjct: 317  ALEDSGHDFIWVVRKKGESEDCDGNEFLEEFEERVRASNKGYLIWGWAPQLLILEHLAIG 376

Query: 879  AVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGD 1058
            AVVTHCGWNTI+ESVNAGLPMATWPLFAEQFYNEKLL DVL+IGVPVGAK WKNWNEFGD
Sbjct: 377  AVVTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLADVLRIGVPVGAKEWKNWNEFGD 436

Query: 1059 EVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSF 1235
            EVVKR++IGKAI +LM GGEE LEMRRRV   SDAAKK  +VGGSSH           SF
Sbjct: 437  EVVKRDEIGKAIAVLMGGGEECLEMRRRVKALSDAAKKAIQVGGSSHNKMKQLIQELKSF 496

Query: 1236 KHQKVNHKME 1265
            K QK+N K E
Sbjct: 497  KLQKINLKNE 506


>dbj|GAU11951.1| hypothetical protein TSUD_195750 [Trifolium subterraneum]
          Length = 476

 Score =  579 bits (1493), Expect = 0.0
 Identities = 282/382 (73%), Positives = 318/382 (83%), Gaps = 2/382 (0%)
 Frame = +3

Query: 126  FRDMKPDFIVTDMFYPWSVDIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNE 305
            FR+MKPDFIVT MFYPW+VDIA ELGIPR  C GGSYF+H+A NSIE FAPH  V SN+E
Sbjct: 92   FREMKPDFIVTYMFYPWTVDIADELGIPRFICIGGSYFAHSAMNSIEVFAPHEKVNSNSE 151

Query: 306  SVLLPGLPHKVEMTCLQLSDWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEH 485
            S LLPGLPHKVEMT LQL DWL+ PN++TYLMKMIK+S+RKSYGSLFDS+YE+EGTYE+H
Sbjct: 152  SFLLPGLPHKVEMTRLQLPDWLRAPNNYTYLMKMIKESERKSYGSLFDSYYEIEGTYEDH 211

Query: 486  YQRVTGTRSWGLGPVSFWVNQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFG 665
            Y+   GT+SWG+GPVS WVNQDDSDKA RG+ K      +GVLKWLDSKEEDSV+YVSFG
Sbjct: 212  YKTAMGTKSWGVGPVSLWVNQDDSDKAGRGNGKKQDEKEDGVLKWLDSKEEDSVLYVSFG 271

Query: 666  SMNKFPISQLIEIAHALEDSGYDFIWVVRKAEEGEYG-VLEKFDKRVKESNKGYLIWGWA 842
            SM KFP  QL+EIA ALEDSG +FIWVVRK E GE G  L +F+KRVKESNKGYLIWGWA
Sbjct: 272  SMTKFPSPQLVEIAQALEDSGNNFIWVVRKIEHGEDGSFLREFEKRVKESNKGYLIWGWA 331

Query: 843  PQLVILEHSAIGAVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVG 1022
            PQL+ILEH+A+GA+VT CGWNTI+ESVNAGLP+ATWPLFAEQFYNE+LLVDVLKIGV VG
Sbjct: 332  PQLLILEHAAVGAMVTRCGWNTIMESVNAGLPLATWPLFAEQFYNERLLVDVLKIGVAVG 391

Query: 1023 AKVWKNWNEFGDEVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHT 1199
            AK W+NWNEFGD+VVKREDIGKAI LLM  GEE LEMRRR    S AAKK  + GGSSHT
Sbjct: 392  AKEWRNWNEFGDDVVKREDIGKAIGLLMGCGEECLEMRRRAKALSGAAKKAIEFGGSSHT 451

Query: 1200 XXXXXXXXXXSFKHQKVNHKME 1265
                      S K +KVN+K+E
Sbjct: 452  KLKELNEDLKSIKLEKVNNKLE 473


>gb|AMQ26114.1| UDP-glycosyltransferase 41 [Pueraria montana var. lobata]
          Length = 504

 Score =  575 bits (1481), Expect = 0.0
 Identities = 280/427 (65%), Positives = 332/427 (77%), Gaps = 6/427 (1%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQV GLP GMESFNA TP D+ +KI              FRDMKPDFIV+DMFYPW+V
Sbjct: 75   KFPQVPGLPQGMESFNASTPTDMVAKISHALSTLEGQFRQVFRDMKPDFIVSDMFYPWTV 134

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            D AAELGIPRL   GG+YF+H A +S+E+F PH N+GS++ES L+PGLPH+ EMT  QL 
Sbjct: 135  DAAAELGIPRLIYVGGTYFAHCAMDSLERFEPHTNLGSDDESFLIPGLPHEFEMTRSQLP 194

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            D  K PND TY+MK +K+S+++SYGS+F SFY  EG YEEHY+++ GT+SW +GP+S WV
Sbjct: 195  DRFKAPNDMTYIMKRVKESEKRSYGSVFKSFYAFEGAYEEHYRKIMGTKSWNVGPISSWV 254

Query: 543  NQDDSDKANRGHAK----XXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAH 710
            NQD SDKA+RGH K           G   WLDSK+E+SV+YV FGSMN FP SQL+EIA+
Sbjct: 255  NQDASDKASRGHGKEELQEEGKGKEGWFAWLDSKKEESVLYVCFGSMNNFPTSQLVEIAY 314

Query: 711  ALEDSGYDFIWVVRKAEEGE-YGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVV 887
            ALED G+DFIWVVRK +EGE  G +E+F+KRV+ SNKGYLIWGWAPQL+ILEH AIGAVV
Sbjct: 315  ALEDCGHDFIWVVRKIDEGEARGFVEEFEKRVQASNKGYLIWGWAPQLLILEHPAIGAVV 374

Query: 888  THCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVV 1067
            THCG NT++ESV+AGLP+ TWPLFAEQF+NE+LLVDVLKIGVP+GAK WKNWNEFGDE+V
Sbjct: 375  THCGMNTVIESVDAGLPLVTWPLFAEQFFNERLLVDVLKIGVPIGAKKWKNWNEFGDEIV 434

Query: 1068 KREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQ 1244
            KREDIGKAI LLM GGEES EMRRRV   SDAAKK  +VGGSSH           S K +
Sbjct: 435  KREDIGKAIALLMGGGEESEEMRRRVKALSDAAKKAIQVGGSSHNSLKDLIEELKSLKLR 494

Query: 1245 KVNHKME 1265
            KVN K++
Sbjct: 495  KVNGKLD 501


>gb|KHN10128.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [Glycine soja]
          Length = 498

 Score =  572 bits (1474), Expect = 0.0
 Identities = 283/426 (66%), Positives = 330/426 (77%), Gaps = 7/426 (1%)
 Frame = +3

Query: 3    KFP-QVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWS 179
            KFP +  GLP G+ESFN++TP+D+  KIY             F D++PDF+ TDMFYPW+
Sbjct: 73   KFPCEQVGLPEGVESFNSNTPRDLVPKIYQGLTILQDQYQQLFHDLQPDFLFTDMFYPWT 132

Query: 180  VDIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQL 359
            VD AA+LGIPRL    G Y +H+++N+IEQF+PH  V S+ ES LLPGLPH+++MT LQL
Sbjct: 133  VDAAAKLGIPRLIYVSGGYLAHSSQNTIEQFSPHTKVDSDTESFLLPGLPHELKMTRLQL 192

Query: 360  SDWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539
             DWL+ P  +TYLM M+KDS+RKSYGSL ++FYELEG YEEHY++  GT+SW +GPVSFW
Sbjct: 193  PDWLRAPTGYTYLMNMMKDSERKSYGSLLNTFYELEGDYEEHYKKAMGTKSWSVGPVSFW 252

Query: 540  VNQDDSDKANRGHAK-XXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHAL 716
            VNQD  DKA+RGHAK        G L WLDSK E+SV+YVSFGSMNKFP  QL+EIAHAL
Sbjct: 253  VNQDALDKADRGHAKEEQGEGEEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAHAL 312

Query: 717  EDSGYDFIWVVRKAEEGEYG----VLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAV 884
            EDS +DFIWVVRK  E E G     L++FDKRVK SNKGYLIWGWAPQL+ILEH AIGAV
Sbjct: 313  EDSDHDFIWVVRKKGESEDGEGNDFLQEFDKRVKASNKGYLIWGWAPQLLILEHHAIGAV 372

Query: 885  VTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEV 1064
            VTHCGWNTI+ESVNAGLPMATWPLFAEQFYNEKLL +VL+IGVPVGAK W+NWNEFGDEV
Sbjct: 373  VTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLAEVLRIGVPVGAKEWRNWNEFGDEV 432

Query: 1065 VKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKH 1241
            VKRE+IG AI +LM GGEES+EMRRR    SDAAKK  +VGGSSH           S K 
Sbjct: 433  VKREEIGNAIGVLM-GGEESIEMRRRAKALSDAAKKAIQVGGSSHNNLKELIQELKSLKL 491

Query: 1242 QKVNHK 1259
            QK NHK
Sbjct: 492  QKANHK 497


>ref|XP_003546674.1| PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like
            [Glycine max]
 gb|KRH13189.1| hypothetical protein GLYMA_15G221300 [Glycine max]
          Length = 501

 Score =  572 bits (1474), Expect = 0.0
 Identities = 283/426 (66%), Positives = 330/426 (77%), Gaps = 7/426 (1%)
 Frame = +3

Query: 3    KFP-QVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWS 179
            KFP +  GLP G+ESFN++TP+D+  KIY             F D++PDF+ TDMFYPW+
Sbjct: 76   KFPCEQVGLPEGVESFNSNTPRDLVPKIYQGLTILQDQYQQLFHDLQPDFLFTDMFYPWT 135

Query: 180  VDIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQL 359
            VD AA+LGIPRL    G Y +H+++N+IEQF+PH  V S+ ES LLPGLPH+++MT LQL
Sbjct: 136  VDAAAKLGIPRLIYVSGGYLAHSSQNTIEQFSPHTKVDSDTESFLLPGLPHELKMTRLQL 195

Query: 360  SDWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539
             DWL+ P  +TYLM M+KDS+RKSYGSL ++FYELEG YEEHY++  GT+SW +GPVSFW
Sbjct: 196  PDWLRAPTGYTYLMNMMKDSERKSYGSLLNTFYELEGDYEEHYKKAMGTKSWSVGPVSFW 255

Query: 540  VNQDDSDKANRGHAK-XXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHAL 716
            VNQD  DKA+RGHAK        G L WLDSK E+SV+YVSFGSMNKFP  QL+EIAHAL
Sbjct: 256  VNQDALDKADRGHAKEEQGEGEEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAHAL 315

Query: 717  EDSGYDFIWVVRKAEEGEYG----VLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAV 884
            EDS +DFIWVVRK  E E G     L++FDKRVK SNKGYLIWGWAPQL+ILEH AIGAV
Sbjct: 316  EDSDHDFIWVVRKKGESEDGEGNDFLQEFDKRVKASNKGYLIWGWAPQLLILEHHAIGAV 375

Query: 885  VTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEV 1064
            VTHCGWNTI+ESVNAGLPMATWPLFAEQFYNEKLL +VL+IGVPVGAK W+NWNEFGDEV
Sbjct: 376  VTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLAEVLRIGVPVGAKEWRNWNEFGDEV 435

Query: 1065 VKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKH 1241
            VKRE+IG AI +LM GGEES+EMRRR    SDAAKK  +VGGSSH           S K 
Sbjct: 436  VKREEIGNAIGVLM-GGEESIEMRRRAKALSDAAKKAIQVGGSSHNNLKELIQELKSLKL 494

Query: 1242 QKVNHK 1259
            QK NHK
Sbjct: 495  QKANHK 500


>ref|XP_020211411.1| soyasapogenol B glucuronide galactosyltransferase-like [Cajanus
            cajan]
          Length = 522

 Score =  558 bits (1438), Expect = 0.0
 Identities = 276/430 (64%), Positives = 330/430 (76%), Gaps = 9/430 (2%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQ++GLP GMES NA TP+D+ SKI              FRDMKPDFIV+DMFYPWSV
Sbjct: 89   KFPQISGLPEGMESINASTPKDMTSKIVEGMSILERQFRQAFRDMKPDFIVSDMFYPWSV 148

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            ++AAEL IPRL   GG+YF+H A +S+E+F PH  VGS++ES LLPGLPH++EM   QL 
Sbjct: 149  EVAAELEIPRLIYVGGTYFAHCAMDSLERFEPHNKVGSDDESFLLPGLPHQIEMIRSQLP 208

Query: 363  DWLKEPN-DFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539
               + PN  F YLMK +K+S++KSYGS+  SF+E EG YEEHY+++ GT+SW +GP+S W
Sbjct: 209  IRFRNPNHQFGYLMKAVKESEKKSYGSVLKSFHEFEGDYEEHYKKIMGTKSWNVGPISSW 268

Query: 540  VNQDDSDKANRGHAKXXXXXXN------GVLKWLDSKEEDSVVYVSFGSMNKFPISQLIE 701
            VNQD SDKA RGHAK             G L WLDSK+EDSV+YV FGSMN FP +QL+E
Sbjct: 269  VNQDASDKAGRGHAKEEEEEEEEEKGKEGWLAWLDSKKEDSVLYVCFGSMNNFPSAQLVE 328

Query: 702  IAHALEDSGYDFIWVVRKAEEGEY-GVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIG 878
            IAHALEDSG+DF+WVVRK +EGE  G +E+F+KRV+ SNKGYLIWGWAPQL+ILEH AIG
Sbjct: 329  IAHALEDSGHDFLWVVRKVDEGEAKGFVEEFEKRVQASNKGYLIWGWAPQLLILEHPAIG 388

Query: 879  AVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGD 1058
            AVVTHCG NT++ESV+AGLP+ TWPLF+EQF+NEKLLVDVLKIGVPVGAK W++WNE GD
Sbjct: 389  AVVTHCGMNTVIESVDAGLPLVTWPLFSEQFFNEKLLVDVLKIGVPVGAKKWRDWNELGD 448

Query: 1059 EVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSF 1235
            E+VKREDIGKAI LLMSGGEESLE+R+R    S AAKK  +VGGSS+           S 
Sbjct: 449  EIVKREDIGKAIALLMSGGEESLEIRKRAKAMSVAAKKAIQVGGSSYNSLKELIEELRSL 508

Query: 1236 KHQKVNHKME 1265
            K QK N KME
Sbjct: 509  KLQKANRKME 518


>ref|XP_020211413.1| soyasapogenol B glucuronide galactosyltransferase-like [Cajanus
            cajan]
          Length = 519

 Score =  555 bits (1429), Expect = 0.0
 Identities = 275/427 (64%), Positives = 326/427 (76%), Gaps = 6/427 (1%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQ+ GLP GMES NA TP+D+ SKI              FRDMKPDFIVTDMFY WSV
Sbjct: 89   KFPQIPGLPEGMESINASTPKDMTSKIVEGMSILELQFRQAFRDMKPDFIVTDMFYVWSV 148

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            ++AAEL IPRL   GG+YF+H A +S+E+F PH  VGS++ES LLPGLPH++EM   QL 
Sbjct: 149  EVAAELDIPRLIYMGGTYFAHCAMDSLERFEPHNKVGSDDESFLLPGLPHEIEMIRSQLP 208

Query: 363  DWLKEPN-DFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539
               + PN  + YLMK +K+S +KSYGS+F SF E EG YEEHY+++ GT+SW +GP+S W
Sbjct: 209  IRFRNPNHQYEYLMKAVKESAKKSYGSVFKSFREFEGVYEEHYKKIMGTKSWNVGPISSW 268

Query: 540  VNQDDSDKANRGHAKXXXXXXNGV---LKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAH 710
            VNQD SDKA RGHAK       G    L WLDSK+EDSV+YV FGSMN FP +QL+EIAH
Sbjct: 269  VNQDASDKAGRGHAKEEEEEEKGKEGWLAWLDSKKEDSVLYVCFGSMNNFPSTQLVEIAH 328

Query: 711  ALEDSGYDFIWVVRKAEEGEY-GVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVV 887
            ALEDSG+DF+WVVRK +EGE  G +E+F+KRV+ SNKGYLIWGWAPQL+ILEH A+GAVV
Sbjct: 329  ALEDSGHDFLWVVRKVDEGEAKGFVEEFEKRVQASNKGYLIWGWAPQLLILEHPAVGAVV 388

Query: 888  THCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVV 1067
            THCG NT++ESV+AGLP+ TWPLFAEQF+NEKLLVDVLKIGVP+GAK W+ WN  GDE+V
Sbjct: 389  THCGMNTVIESVDAGLPLVTWPLFAEQFFNEKLLVDVLKIGVPIGAKKWREWNGLGDEIV 448

Query: 1068 KREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQ 1244
            KREDIGKAI LLMSGGEESLE+R+R    S AAKK  +VGGSS+           S K Q
Sbjct: 449  KREDIGKAIALLMSGGEESLEIRKRAKAMSVAAKKAIQVGGSSYNSLKELIEELRSLKLQ 508

Query: 1245 KVNHKME 1265
            KVN KME
Sbjct: 509  KVNRKME 515


>ref|XP_007142833.1| hypothetical protein PHAVU_007G020800g [Phaseolus vulgaris]
 gb|ESW14827.1| hypothetical protein PHAVU_007G020800g [Phaseolus vulgaris]
          Length = 494

 Score =  542 bits (1397), Expect = 0.0
 Identities = 266/418 (63%), Positives = 315/418 (75%), Gaps = 1/418 (0%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQ+ GLP G+E+ N+DTP  +  KI              FR M+PDFIVTDMFYPWS 
Sbjct: 75   KFPQIPGLPEGVETINSDTPPPLTMKIGEALSILQGQYQQLFRLMQPDFIVTDMFYPWSA 134

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            D AAELGIPRL   G SYFSH A N +E+FAPH  V S+ ES  LPGLPHK+EMT LQL 
Sbjct: 135  DAAAELGIPRLVYVGASYFSHCAMNCVEEFAPHDKVDSDGESFELPGLPHKLEMTRLQLP 194

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            DWL+ P  +TYL KM+K+S++KSYGS+F SFYE EG YEEHY+RV GT+SW +GPVS WV
Sbjct: 195  DWLRAPKPYTYLKKMMKESEKKSYGSVFKSFYEFEGAYEEHYKRVMGTKSWSIGPVSLWV 254

Query: 543  NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722
            NQD+SDKA RG AK        +++WLDSK+E+SV+YVSFGSMNKFP +QL+EIAHALED
Sbjct: 255  NQDESDKAGRGQAKEGKGTDEELIRWLDSKKENSVLYVSFGSMNKFPTTQLVEIAHALED 314

Query: 723  SGYDFIWVVRKAEEGEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHCGW 902
            SG+DFIWVVRK ++G++  LE+F+KRV+ SN+GYLIWGWAPQLVIL+H A GAVVTHCG 
Sbjct: 315  SGHDFIWVVRKNDDGDF--LEEFEKRVQGSNRGYLIWGWAPQLVILDHPATGAVVTHCGM 372

Query: 903  NTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKREDI 1082
            NT+ ESV AGLPM  WPLF+EQF+NEKL+VDVLKIGV VGAK W+N N+FG E VKRE I
Sbjct: 373  NTVFESVIAGLPMVAWPLFSEQFFNEKLVVDVLKIGVSVGAKEWRNLNDFGSETVKREAI 432

Query: 1083 GKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKVN 1253
            G+AI L M GGEE +EMRRRV   SD AKK  +  G+SH           S K QK N
Sbjct: 433  GEAIGLSMGGGEECVEMRRRVKVLSDEAKKAIQSDGTSHNNLQELIQELKSLKLQKDN 490


>gb|KHN37410.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [Glycine soja]
          Length = 491

 Score =  540 bits (1390), Expect = 0.0
 Identities = 266/426 (62%), Positives = 322/426 (75%), Gaps = 9/426 (2%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQV GLP G+ESFNA TP D+ +KI              FRD+KPDFIV+DMFYPWSV
Sbjct: 65   KFPQVPGLPQGLESFNASTPADMVTKIGHALSILEGPFRQLFRDIKPDFIVSDMFYPWSV 124

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            D A ELGIPRL   GG+YF+H A +S+E+F PH  VGS++ES L+PGLPH+ EMT  Q+ 
Sbjct: 125  DAADELGIPRLIYVGGTYFAHCAMDSLERFEPHTKVGSDDESFLIPGLPHEFEMTRSQIP 184

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            D  K P++ TYLMK IK+S+++SYGS+F SFY  EG YE+HY+++ GT+SW LGP+S WV
Sbjct: 185  DRFKAPDNLTYLMKTIKESEKRSYGSVFKSFYAFEGAYEDHYRKIMGTKSWNLGPISSWV 244

Query: 543  NQDDSDKANRG-------HAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIE 701
            NQD SDKA+RG         +         L WLDSK+E SV+YV FGSMN FP +QL+E
Sbjct: 245  NQDASDKASRGSRDNKAKEEQVEEGKDGSWLAWLDSKKEGSVLYVCFGSMNNFPTTQLVE 304

Query: 702  IAHALEDSGYDFIWVVRKAEEGEY-GVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIG 878
            IAHALEDSG+DFIWVV K +EGE  G +++F+KRV+ SNKGYLI GWAPQL+ILEH +IG
Sbjct: 305  IAHALEDSGHDFIWVVGKTDEGETKGFVDEFEKRVQASNKGYLICGWAPQLLILEHPSIG 364

Query: 879  AVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGD 1058
            AVVTHCG NT++ESV+AGLP+ TWPLFAEQF+NE+LLVDVLKIGV +GAK W NWN+FGD
Sbjct: 365  AVVTHCGMNTVIESVDAGLPLVTWPLFAEQFFNERLLVDVLKIGVAIGAKKWNNWNDFGD 424

Query: 1059 EVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSF 1235
            E+VKREDIGKAI LLM GGEES EMR+RV   SDAAKK  +VGGSSH           S 
Sbjct: 425  EIVKREDIGKAIALLMGGGEESEEMRKRVKALSDAAKKAIQVGGSSHNSLKDLIEELKSL 484

Query: 1236 KHQKVN 1253
            K QK++
Sbjct: 485  KLQKLS 490


>ref|XP_003536714.1| PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like
            [Glycine max]
 gb|KRH36042.1| hypothetical protein GLYMA_10G280400 [Glycine max]
          Length = 505

 Score =  540 bits (1391), Expect = 0.0
 Identities = 268/426 (62%), Positives = 321/426 (75%), Gaps = 9/426 (2%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQV GLP G+ESFNA TP D+ +KI              FRD+KPDFIV+DMFYPWSV
Sbjct: 79   KFPQVPGLPQGLESFNASTPADMVTKIGHALSILEGPFRQLFRDIKPDFIVSDMFYPWSV 138

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            D A ELGIPRL   GG+YF+H A +S+E+F PH  VGS++ES L+PGLPH+ EMT  Q+ 
Sbjct: 139  DAADELGIPRLIYVGGTYFAHCAMDSLERFEPHTKVGSDDESFLIPGLPHEFEMTRSQIP 198

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            D  K P++ TYLMK IK+S+++SYGS+F SFY  EG YE+HY+++ GT+SW LGP+S WV
Sbjct: 199  DRFKAPDNLTYLMKTIKESEKRSYGSVFKSFYAFEGAYEDHYRKIMGTKSWNLGPISSWV 258

Query: 543  NQDDSDKANRG-------HAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIE 701
            NQD SDKA+RG         +         L WLDSK+E SV+YV FGSMN FP +QL E
Sbjct: 259  NQDASDKASRGSRDNKAKEEQVEEGKDGSWLAWLDSKKEGSVLYVCFGSMNNFPTTQLGE 318

Query: 702  IAHALEDSGYDFIWVVRKAEEGEY-GVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIG 878
            IAHALEDSG+DFIWVV K +EGE  G +E+F+KRV+ SNKGYLI GWAPQL+ILEH +IG
Sbjct: 319  IAHALEDSGHDFIWVVGKTDEGETKGFVEEFEKRVQASNKGYLICGWAPQLLILEHPSIG 378

Query: 879  AVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGD 1058
            AVVTHCG NT++ESV+AGLP+ TWPLFAEQF+NE+LLVDVLKIGV +GAK W NWN+FGD
Sbjct: 379  AVVTHCGMNTVIESVDAGLPLVTWPLFAEQFFNERLLVDVLKIGVAIGAKKWNNWNDFGD 438

Query: 1059 EVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSF 1235
            E+VKREDIGKAI LLM GGEES EMR+RV   SDAAKK  +VGGSSH           S 
Sbjct: 439  EIVKREDIGKAIALLMGGGEESEEMRKRVKALSDAAKKAIQVGGSSHNSLKDLIEELKSL 498

Query: 1236 KHQKVN 1253
            K QK+N
Sbjct: 499  KLQKLN 504


>ref|XP_014512855.1| soyasapogenol B glucuronide galactosyltransferase [Vigna radiata var.
            radiata]
          Length = 493

 Score =  535 bits (1379), Expect = 0.0
 Identities = 266/418 (63%), Positives = 314/418 (75%), Gaps = 4/418 (0%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQV GLP G+E+ NADTP  +  KI              FR MKPDFIVTDMFYPWS 
Sbjct: 75   KFPQVPGLPEGIETINADTPPLLTMKISEALSILQGQYQELFRVMKPDFIVTDMFYPWSA 134

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            D AAELGIPRL   G SYFSH A N +E+FAPHA V S+ ES  LPGLPHK+EMT  QL 
Sbjct: 135  DAAAELGIPRLVYVGASYFSHCAMNCVEEFAPHAKVDSDGESFELPGLPHKLEMTRSQLP 194

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            DWL+ P  +TYL KMIK+S++KSYGSLF SFYE EG YEEHY+RV GT+SW +GPVS WV
Sbjct: 195  DWLRAPKPYTYLKKMIKESEKKSYGSLFKSFYEFEGAYEEHYKRVMGTKSWSIGPVSLWV 254

Query: 543  NQDDSDKANRGHAKXXXXXXNG--VLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHAL 716
            NQD+ DKA RGHAK          +++WLD+K+E+SV+YVSFGSMNKFP +QL+EIAHAL
Sbjct: 255  NQDELDKAGRGHAKEGEGKGTNEELMRWLDTKKENSVLYVSFGSMNKFPTAQLVEIAHAL 314

Query: 717  EDSGYDFIWVVRKAEE-GEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893
            ED G+DFIWVVRK ++ G+ G LE+F+KRV+ESN+GYLIWGWAPQL IL+H A GAVVTH
Sbjct: 315  EDCGHDFIWVVRKNDDHGDKGFLEEFEKRVQESNRGYLIWGWAPQLAILDHPATGAVVTH 374

Query: 894  CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073
            CG NT+ ESV AGLP+  WP+F+EQF+NEKL+VDVLKIGV VGAK W+N N+FG E VKR
Sbjct: 375  CGMNTVFESVIAGLPLVAWPIFSEQFFNEKLVVDVLKIGVSVGAKEWRNLNDFGSETVKR 434

Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQ 1244
            E+I KA+ L+M GGEE +EMRRRV   SD AKK  + GG+SH           S K Q
Sbjct: 435  EEIRKAVVLVM-GGEECVEMRRRVKVLSDEAKKAIQSGGTSHNNLKELIEELKSLKLQ 491


>ref|XP_017413459.1| PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like
            [Vigna angularis]
 gb|KOM36380.1| hypothetical protein LR48_Vigan02g253000 [Vigna angularis]
 dbj|BAT93707.1| hypothetical protein VIGAN_08023500 [Vigna angularis var. angularis]
 dbj|BAT93708.1| hypothetical protein VIGAN_08023600 [Vigna angularis var. angularis]
          Length = 494

 Score =  532 bits (1370), Expect = 0.0
 Identities = 265/419 (63%), Positives = 312/419 (74%), Gaps = 4/419 (0%)
 Frame = +3

Query: 3    KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182
            KFPQV GLP G+E+ NADTP  +  KI              FR MKPDFIVTDMFYPWS 
Sbjct: 75   KFPQVPGLPEGVETINADTPPLLTMKISEGLSILQGQYQELFRVMKPDFIVTDMFYPWSA 134

Query: 183  DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362
            D AAELGIPRL   G  YFSH A N +EQFAPHA V S+ ES  LPGLPHK+EMT  QL 
Sbjct: 135  DAAAELGIPRLVYVGACYFSHCAMNCVEQFAPHAKVDSDGESFELPGLPHKLEMTRSQLP 194

Query: 363  DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542
            DWL+ P  +TYL KMIK+S++KSYGSLF SFYE EG YEEHY+RV GT+SW +GPVS WV
Sbjct: 195  DWLRAPKPYTYLKKMIKESEKKSYGSLFKSFYEFEGAYEEHYKRVMGTKSWSIGPVSLWV 254

Query: 543  NQDDSDKANRGHAKXXXXXXNG--VLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHAL 716
            N+D+ DKA RGHAK          +++WLDSK+E+ V+YVSFGSMNKFP +QL+EIAHAL
Sbjct: 255  NEDELDKAGRGHAKEGEGKRTDEELMRWLDSKKENCVLYVSFGSMNKFPTAQLVEIAHAL 314

Query: 717  EDSGYDFIWVVRK-AEEGEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893
            ED G+DFIWVVRK  ++G+ G LE+F+KRV+ESN GYLIWGWAPQL IL+H A GAVVTH
Sbjct: 315  EDCGHDFIWVVRKNDDDGDRGFLEEFEKRVQESNNGYLIWGWAPQLAILDHPATGAVVTH 374

Query: 894  CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073
            CG NT+ ESV AGLP+  WP+F+EQF+NEKL+VDVLKIGV VGAK W+N N+FG E VKR
Sbjct: 375  CGMNTVFESVIAGLPLVAWPIFSEQFFNEKLVVDVLKIGVSVGAKEWRNLNDFGSETVKR 434

Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQK 1247
            E+I KA+ L+M GGEE +EMR+RV   SD AKK  + GG+SH           S K QK
Sbjct: 435  EEIRKAVVLVM-GGEECVEMRKRVKVLSDEAKKAIQSGGTSHNNLKELIEELKSVKLQK 492


Top