BLASTX nr result
ID: Astragalus24_contig00023485
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00023485 (1334 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU12394.1| hypothetical protein TSUD_253450 [Trifolium subt... 675 0.0 dbj|GAU42924.1| hypothetical protein TSUD_283470 [Trifolium subt... 674 0.0 dbj|GAU51965.1| hypothetical protein TSUD_417550 [Trifolium subt... 651 0.0 dbj|GAU42521.1| hypothetical protein TSUD_376490 [Trifolium subt... 629 0.0 dbj|GAU30423.1| hypothetical protein TSUD_364690 [Trifolium subt... 629 0.0 ref|XP_003628138.1| UDP-glucosyltransferase family protein [Medi... 623 0.0 gb|ABI94026.1| (iso)flavonoid glycosyltransferase [Medicago trun... 623 0.0 ref|XP_013466939.1| UDP-glucosyltransferase family protein [Medi... 602 0.0 ref|XP_020231152.1| soyasapogenol B glucuronide galactosyltransf... 588 0.0 dbj|GAU11951.1| hypothetical protein TSUD_195750 [Trifolium subt... 579 0.0 gb|AMQ26114.1| UDP-glycosyltransferase 41 [Pueraria montana var.... 575 0.0 gb|KHN10128.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [... 572 0.0 ref|XP_003546674.1| PREDICTED: soyasapogenol B glucuronide galac... 572 0.0 ref|XP_020211411.1| soyasapogenol B glucuronide galactosyltransf... 558 0.0 ref|XP_020211413.1| soyasapogenol B glucuronide galactosyltransf... 555 0.0 ref|XP_007142833.1| hypothetical protein PHAVU_007G020800g [Phas... 542 0.0 gb|KHN37410.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [... 540 0.0 ref|XP_003536714.1| PREDICTED: soyasapogenol B glucuronide galac... 540 0.0 ref|XP_014512855.1| soyasapogenol B glucuronide galactosyltransf... 535 0.0 ref|XP_017413459.1| PREDICTED: soyasapogenol B glucuronide galac... 532 0.0 >dbj|GAU12394.1| hypothetical protein TSUD_253450 [Trifolium subterraneum] Length = 502 Score = 675 bits (1742), Expect = 0.0 Identities = 337/420 (80%), Positives = 358/420 (85%), Gaps = 3/420 (0%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQ+ GLPHG+ES +ADTPQD+ SKIY + MKPDFIVTDMFYPWSV Sbjct: 69 KFPQIPGLPHGLESLDADTPQDMSSKIYQGLFLLKENFQQLY--MKPDFIVTDMFYPWSV 126 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 DIAAELGIPRLNCTGGSYFSHAARNSIEQF+PH NVGS++ES LLPGLPHKVEMT QLS Sbjct: 127 DIAAELGIPRLNCTGGSYFSHAARNSIEQFSPHVNVGSDHESFLLPGLPHKVEMTRSQLS 186 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 DW+KEPNDF LMKMI D+DRKSYGSLF SFYE+EGTYEEHYQRVTGTRSW LGPVS WV Sbjct: 187 DWVKEPNDFGDLMKMIGDADRKSYGSLFRSFYEMEGTYEEHYQRVTGTRSWSLGPVSLWV 246 Query: 543 NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEED-SVVYVSFGSMNKFPISQLIEIAHALE 719 NQDD DKANRG AK NGVLKWLDSKEED SVVYVSFGSMNKFPISQ IEIAHALE Sbjct: 247 NQDDFDKANRGRAKEKEEEENGVLKWLDSKEEDNSVVYVSFGSMNKFPISQHIEIAHALE 306 Query: 720 DSGYDFIWVVRKAEEG-EYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHC 896 DSG+DFIWVV+K EEG EYG LE+F+KRVKESNKGYLIWGWAPQL ILEHSAIG VVTHC Sbjct: 307 DSGFDFIWVVKKTEEGNEYGKLEEFEKRVKESNKGYLIWGWAPQLAILEHSAIGTVVTHC 366 Query: 897 GWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKRE 1076 GWNT LESV AGLPM TWPLFAEQFYNEKLLVDVLKIGVPVGAK WKNWN++GD+VVKRE Sbjct: 367 GWNTTLESVYAGLPMVTWPLFAEQFYNEKLLVDVLKIGVPVGAKEWKNWNQYGDKVVKRE 426 Query: 1077 DIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKVN 1253 DIGKAI LLM GGEE LE+R+RVNEFSDAAKKT KVGGSSHT SFK+QK N Sbjct: 427 DIGKAIALLMGGGEECLEIRKRVNEFSDAAKKTIKVGGSSHTNLKELLKELMSFKYQKAN 486 >dbj|GAU42924.1| hypothetical protein TSUD_283470 [Trifolium subterraneum] Length = 497 Score = 674 bits (1738), Expect = 0.0 Identities = 337/425 (79%), Positives = 361/425 (84%), Gaps = 3/425 (0%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQ+ GLP G+ES +A+TP+DI SKIY FRDMKPDFIVTDMFYPWSV Sbjct: 73 KFPQIPGLPLGLESVDAETPKDISSKIYQGLFLLKDNFQQLFRDMKPDFIVTDMFYPWSV 132 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 D AAELGIPRLNCTGGSYFSHAARNSIEQFAPH NVGS+ ES LLPGLPHKVEMT QLS Sbjct: 133 DTAAELGIPRLNCTGGSYFSHAARNSIEQFAPHVNVGSDYESFLLPGLPHKVEMTRSQLS 192 Query: 363 DWLKE-PNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539 DW+ E NDF +MKMIKD+DR+SYGSLF SFYELEGTYEEHYQRVTGTRSW LGPVS W Sbjct: 193 DWVNERSNDFGNIMKMIKDADRRSYGSLFRSFYELEGTYEEHYQRVTGTRSWSLGPVSLW 252 Query: 540 VNQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALE 719 VNQDD DKANRG+AK NGVLKWLDSKE++SVVYVSFGSMNKFPISQ IEIAHALE Sbjct: 253 VNQDDFDKANRGNAKEKEE--NGVLKWLDSKEDNSVVYVSFGSMNKFPISQHIEIAHALE 310 Query: 720 DSGYDFIWVVRKAEEGE-YGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHC 896 DSGYDFIWVV+K EEGE YGVLE+F+KRVKESNKGYLIW WAPQLVILEHSA+GAVVTHC Sbjct: 311 DSGYDFIWVVKKTEEGEEYGVLEEFEKRVKESNKGYLIWDWAPQLVILEHSAVGAVVTHC 370 Query: 897 GWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKRE 1076 GWNT LESV GLPM TWPLFAEQFYNEKLLV+VLKIGV +GAK WKNWN +GD+VVKRE Sbjct: 371 GWNTTLESVYMGLPMVTWPLFAEQFYNEKLLVNVLKIGVSIGAKEWKNWNAYGDKVVKRE 430 Query: 1077 DIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKVN 1253 DIGKAI LLM GGEE LE+R+RVNE SDAAKKT KVGGSSHT SFKHQKVN Sbjct: 431 DIGKAIALLMGGGEECLEIRKRVNELSDAAKKTIKVGGSSHTNLKELLEELKSFKHQKVN 490 Query: 1254 HKMEG 1268 H+MEG Sbjct: 491 HQMEG 495 >dbj|GAU51965.1| hypothetical protein TSUD_417550 [Trifolium subterraneum] Length = 512 Score = 651 bits (1679), Expect = 0.0 Identities = 329/438 (75%), Positives = 354/438 (80%), Gaps = 17/438 (3%) Frame = +3 Query: 6 FPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSVD 185 FPQ+ GLPHG+E +ADTPQD Y RDMKPDFIVTDMFYPWSVD Sbjct: 74 FPQIPGLPHGLEIIDADTPQDSSKLFYQGLLLLQENFQQIIRDMKPDFIVTDMFYPWSVD 133 Query: 186 IAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLSD 365 IAAELGIPRLNC GGSYFSHAARNS EQFAPH NV S++E+ LPGLPHK+EMT QLSD Sbjct: 134 IAAELGIPRLNCNGGSYFSHAARNSTEQFAPHVNVSSDDETFSLPGLPHKIEMTRSQLSD 193 Query: 366 WLKEPN-DFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 W+KEPN +F Y MKMI D+DRKSYGSLF SFYELEGTYEEHYQRVTGTRSW LGPVS WV Sbjct: 194 WVKEPNNEFGYWMKMIIDADRKSYGSLFRSFYELEGTYEEHYQRVTGTRSWSLGPVSLWV 253 Query: 543 NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722 NQDD DKANRG AK NGVLKWLDSKE++SVVYVSFGSMNKF ISQ IEIAHALED Sbjct: 254 NQDDFDKANRGCAKEKEEE-NGVLKWLDSKEDNSVVYVSFGSMNKFSISQQIEIAHALED 312 Query: 723 SGYDFIWVVRKA-EEGEY--------------GVLEKFDKRVKESNKGYLIWGWAPQLVI 857 SG+DFIWVVRK +E EY +LE+F+KRVKESNKGYLIWGWAPQLVI Sbjct: 313 SGHDFIWVVRKTTKENEYLSCLGAGTVPVPDTSILEEFEKRVKESNKGYLIWGWAPQLVI 372 Query: 858 LEHSAIGAVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWK 1037 LEHSAIGAVVTHCGWNT LES+ GLPM TWPLFAEQFYNEKLLVDVLKIGVPVG+K WK Sbjct: 373 LEHSAIGAVVTHCGWNTTLESIYMGLPMVTWPLFAEQFYNEKLLVDVLKIGVPVGSKEWK 432 Query: 1038 NWNEFGDEVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXX 1214 NWNE+GD+VVKREDIGKAIDLLM GGEE LE+R+RVNE SDAAKKT KVGGSS+T Sbjct: 433 NWNEYGDKVVKREDIGKAIDLLMGGGEECLEIRKRVNELSDAAKKTIKVGGSSYTKLKEL 492 Query: 1215 XXXXXSFKHQKVNHKMEG 1268 SFKHQKVN+KMEG Sbjct: 493 LEELKSFKHQKVNNKMEG 510 >dbj|GAU42521.1| hypothetical protein TSUD_376490 [Trifolium subterraneum] Length = 458 Score = 629 bits (1623), Expect = 0.0 Identities = 321/426 (75%), Positives = 347/426 (81%), Gaps = 4/426 (0%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQ+ GLPHG+E+ +ADTPQD+ SKIY RDMKPDFIVTDMFYPWSV Sbjct: 43 KFPQIPGLPHGLENVDADTPQDMNSKIYQGLLLLKDDFQQLIRDMKPDFIVTDMFYPWSV 102 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPH NVGS++ES LLPGLPHKVEMT QLS Sbjct: 103 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHVNVGSDDESFLLPGLPHKVEMTRSQLS 162 Query: 363 DWLKEPN-DFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539 DW+K+PN +F Y MK+IKD+DRKSYGSLF SFYELEGT RSW LGPVS W Sbjct: 163 DWVKDPNLEFGYWMKVIKDADRKSYGSLFRSFYELEGT-----------RSWSLGPVSLW 211 Query: 540 VNQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSM-NKFPISQLIEIAHAL 716 VNQDD DKANRG AK +GVLKWLDSKE++SVVYVSFGSM NKFPISQ IEIAHAL Sbjct: 212 VNQDDFDKANRGCAKEKKEE-HGVLKWLDSKEDNSVVYVSFGSMMNKFPISQHIEIAHAL 270 Query: 717 EDSGYDFIWVVRKAEE-GEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893 EDSGYDFIWVV+K EE EYG+LE+F+KRVKESNKGYLIWGWAPQLVILEH AIG VVTH Sbjct: 271 EDSGYDFIWVVKKTEEVDEYGILEQFEKRVKESNKGYLIWGWAPQLVILEHFAIGTVVTH 330 Query: 894 CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073 CGWNT LESV LPM TWPLFAEQFYNEKLLVDVLKIGVPVGAK WKNWNE+ D+VVKR Sbjct: 331 CGWNTTLESVYMSLPMVTWPLFAEQFYNEKLLVDVLKIGVPVGAKEWKNWNEYRDKVVKR 390 Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKV 1250 EDIGKAI LLM G E+ LE+R+RVNE SDAAKKT KVGGSSHT KHQK Sbjct: 391 EDIGKAIALLMDGREKCLEIRKRVNELSDAAKKTIKVGGSSHTKLKELIEELMLLKHQKA 450 Query: 1251 NHKMEG 1268 NH+M+G Sbjct: 451 NHEMKG 456 >dbj|GAU30423.1| hypothetical protein TSUD_364690 [Trifolium subterraneum] Length = 501 Score = 629 bits (1621), Expect = 0.0 Identities = 308/423 (72%), Positives = 344/423 (81%), Gaps = 2/423 (0%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQV GLP GMESFNADTP +IRSKIY FRDMKPDFIVTDMFYPWSV Sbjct: 76 KFPQVPGLPQGMESFNADTPNEIRSKIYQGLMVLQEQFKQLFRDMKPDFIVTDMFYPWSV 135 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 DIA EL IPRL C GSYF+H+A NSIE FAPHA V SN+ES LLPGLPHKVEMT LQL Sbjct: 136 DIADELRIPRLICISGSYFAHSAMNSIEVFAPHAKVNSNSESFLLPGLPHKVEMTRLQLP 195 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 DWL+ PND+TYLMKMIK+S+RKSYGSLFDS++E+EGTYE+HY+ GT+SWG+GPVS WV Sbjct: 196 DWLRAPNDYTYLMKMIKESERKSYGSLFDSYHEIEGTYEDHYKTAMGTKSWGVGPVSLWV 255 Query: 543 NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722 NQ++SDKA+RGH + VLKWLDSKEEDSV+YVSFGSMNKFP QL+EIAHALED Sbjct: 256 NQNNSDKASRGHRIEQDAEEDEVLKWLDSKEEDSVLYVSFGSMNKFPSPQLVEIAHALED 315 Query: 723 SGYDFIWVVRKAEEGE-YGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHCG 899 SG DFIWVVRK E+GE G L +F+KRVKE NKGYLIWGWAPQL+ILEH+A+GAVVTHCG Sbjct: 316 SGNDFIWVVRKVEDGEDGGFLREFEKRVKERNKGYLIWGWAPQLLILEHAAVGAVVTHCG 375 Query: 900 WNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKRED 1079 WNTI+ESVNAGLP+ATWPLFAEQFYNE+LLVDVLKIGV VGA W+NWNEFGD+VVKRED Sbjct: 376 WNTIMESVNAGLPLATWPLFAEQFYNERLLVDVLKIGVAVGANEWRNWNEFGDDVVKRED 435 Query: 1080 IGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKVNH 1256 IGKAI LLM GEE LEMRRR S AAKK + GGSSHT SFK + V + Sbjct: 436 IGKAIGLLMGSGEECLEMRRRAKALSGAAKKAIEFGGSSHTKLKELLEDLKSFKLENVKN 495 Query: 1257 KME 1265 K+E Sbjct: 496 KLE 498 >ref|XP_003628138.1| UDP-glucosyltransferase family protein [Medicago truncatula] gb|AET02614.1| UDP-glucosyltransferase family protein [Medicago truncatula] Length = 464 Score = 623 bits (1606), Expect = 0.0 Identities = 304/424 (71%), Positives = 346/424 (81%), Gaps = 4/424 (0%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQV GLP GMESFNADTP+DI SKIY FRDMKPDFIVTDMFYPWSV Sbjct: 38 KFPQVPGLPQGMESFNADTPKDIISKIYQGLAILQEQFTQLFRDMKPDFIVTDMFYPWSV 97 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 D+A ELGIPRL C GGSYF+H+A NSIEQF PHA V SN+ S LLPGLPH VEMT LQL Sbjct: 98 DVADELGIPRLICIGGSYFAHSAMNSIEQFEPHAKVKSNSVSFLLPGLPHNVEMTRLQLP 157 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 DWL+ PN +TYLMKMIKDS++KSYGSLFDS+YE+EGTYE++Y+ G++SW +GPVS W+ Sbjct: 158 DWLRAPNGYTYLMKMIKDSEKKSYGSLFDSYYEIEGTYEDYYKIAMGSKSWSVGPVSLWM 217 Query: 543 NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722 N+DDSDKA RGH K GVLKWLDSK+ DSV+YVSFGSMNKFP QL+EIAHALED Sbjct: 218 NKDDSDKAGRGHGK-EEDEEEGVLKWLDSKKYDSVLYVSFGSMNKFPTPQLVEIAHALED 276 Query: 723 SGYDFIWVVRK---AEEGEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893 SG+DFIWVVRK AE+G+ G L +F+KR+KE NKGYLIWGWAPQL+ILEH A+GAVVTH Sbjct: 277 SGHDFIWVVRKIEDAEDGDDGFLSEFEKRMKERNKGYLIWGWAPQLLILEHGAVGAVVTH 336 Query: 894 CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073 CGWNTI+ESVNAGLP+ATWPLFAEQF+NE+LLVDVLKIGV VGAK W+NWNEFGD+VVKR Sbjct: 337 CGWNTIMESVNAGLPLATWPLFAEQFFNERLLVDVLKIGVAVGAKEWRNWNEFGDDVVKR 396 Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKV 1250 EDIGKAI LLM GGEE LEMR+RV S AAKK +VGGSS+T SFK +K+ Sbjct: 397 EDIGKAIGLLMGGGEECLEMRKRVKALSGAAKKAIEVGGSSYTKLKELIEELKSFKLEKI 456 Query: 1251 NHKM 1262 N K+ Sbjct: 457 NKKL 460 >gb|ABI94026.1| (iso)flavonoid glycosyltransferase [Medicago truncatula] Length = 502 Score = 623 bits (1606), Expect = 0.0 Identities = 304/424 (71%), Positives = 346/424 (81%), Gaps = 4/424 (0%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQV GLP GMESFNADTP+DI SKIY FRDMKPDFIVTDMFYPWSV Sbjct: 76 KFPQVPGLPQGMESFNADTPKDIISKIYQGLAILQEQFTQLFRDMKPDFIVTDMFYPWSV 135 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 D+A ELGIPRL C GGSYF+H+A NSIEQF PHA V SN+ S LLPGLPH VEMT LQL Sbjct: 136 DVADELGIPRLICIGGSYFAHSAMNSIEQFEPHAKVKSNSVSFLLPGLPHNVEMTRLQLP 195 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 DWL+ PN +TYLMKMIKDS++KSYGSLFDS+YE+EGTYE++Y+ G++SW +GPVS W+ Sbjct: 196 DWLRAPNGYTYLMKMIKDSEKKSYGSLFDSYYEIEGTYEDYYKIAMGSKSWSVGPVSLWM 255 Query: 543 NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722 N+DDSDKA RGH K GVLKWLDSK+ DSV+YVSFGSMNKFP QL+EIAHALED Sbjct: 256 NKDDSDKAGRGHGK-EEDEEEGVLKWLDSKKYDSVLYVSFGSMNKFPTPQLVEIAHALED 314 Query: 723 SGYDFIWVVRK---AEEGEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893 SG+DFIWVVRK AE+G+ G L +F+KR+KE NKGYLIWGWAPQL+ILEH A+GAVVTH Sbjct: 315 SGHDFIWVVRKIEDAEDGDDGFLSEFEKRMKERNKGYLIWGWAPQLLILEHGAVGAVVTH 374 Query: 894 CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073 CGWNTI+ESVNAGLP+ATWPLFAEQF+NE+LLVDVLKIGV VGAK W+NWNEFGD+VVKR Sbjct: 375 CGWNTIMESVNAGLPLATWPLFAEQFFNERLLVDVLKIGVAVGAKEWRNWNEFGDDVVKR 434 Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKV 1250 EDIGKAI LLM GGEE LEMR+RV S AAKK +VGGSS+T SFK +K+ Sbjct: 435 EDIGKAIGLLMGGGEECLEMRKRVKALSGAAKKAIEVGGSSYTKLKELIEELKSFKLEKI 494 Query: 1251 NHKM 1262 N K+ Sbjct: 495 NKKL 498 >ref|XP_013466939.1| UDP-glucosyltransferase family protein [Medicago truncatula] gb|KEH40975.1| UDP-glucosyltransferase family protein [Medicago truncatula] Length = 503 Score = 602 bits (1553), Expect = 0.0 Identities = 300/423 (70%), Positives = 340/423 (80%), Gaps = 3/423 (0%) Frame = +3 Query: 6 FPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSVD 185 FPQV GL GMESFNADTP +IRSKIY FRDMKPDFIVTDMFYPWSVD Sbjct: 77 FPQVPGLARGMESFNADTPNEIRSKIYQGLIILQEQFKQQFRDMKPDFIVTDMFYPWSVD 136 Query: 186 IAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLSD 365 +A ELGIPRL C GSYF+H+A NSIE F+P A V N+ES LLPGLPHKVEM LQL D Sbjct: 137 VADELGIPRLICISGSYFAHSAMNSIEHFSPQAKVKLNSESFLLPGLPHKVEMKRLQLPD 196 Query: 366 WLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWVN 545 WL+ PND+TYLMKMIKDS+RKSYGSLFDS +E+E TYEEHY+ GT+SW LGPVS WVN Sbjct: 197 WLRAPNDYTYLMKMIKDSERKSYGSLFDS-HEIESTYEEHYKTAMGTKSWSLGPVSLWVN 255 Query: 546 QDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALEDS 725 QDDSDKA RGH K GVLKWLDSK++DSV+YVSFGSMNKFP QL+EIAHALE S Sbjct: 256 QDDSDKAGRGHGKEEDED-EGVLKWLDSKKDDSVLYVSFGSMNKFPTPQLVEIAHALEHS 314 Query: 726 GYDFIWVVRKAEEGEYG-VLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHCGW 902 G+DFIWVVRK E+ E G +F+KR+KESNKGYLIWGWAPQL+ILEH+A+GAVVTHCGW Sbjct: 315 GHDFIWVVRKIEDVEDGDFFTEFEKRMKESNKGYLIWGWAPQLLILEHAAVGAVVTHCGW 374 Query: 903 NTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKREDI 1082 NTI+ESVNAGL +ATWPLFAEQF+NE+LLVDVLKIGV VGAK W+NWNEFGD+VVKR++I Sbjct: 375 NTIMESVNAGLSLATWPLFAEQFFNERLLVDVLKIGVAVGAKEWRNWNEFGDDVVKRDEI 434 Query: 1083 GKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFK-HQKVNH 1256 GKAI LLM GGEE LEMR++ S AAKK +VGGSS+T SFK +KVN+ Sbjct: 435 GKAIGLLMGGGEECLEMRKKAKALSGAAKKAIEVGGSSYTKLKQLIEELKSFKLEKKVNN 494 Query: 1257 KME 1265 K+E Sbjct: 495 KLE 497 >ref|XP_020231152.1| soyasapogenol B glucuronide galactosyltransferase-like [Cajanus cajan] gb|KYP51621.1| Anthocyanin 3'-O-beta-glucosyltransferase [Cajanus cajan] Length = 509 Score = 588 bits (1517), Expect = 0.0 Identities = 289/430 (67%), Positives = 340/430 (79%), Gaps = 9/430 (2%) Frame = +3 Query: 3 KFP-QVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWS 179 KFP + GLP G+ESFN++TPQD+ K+Y F DM+PDF+VTDMFYPW+ Sbjct: 77 KFPFEQVGLPQGVESFNSNTPQDMVKKVYEGLSILKDQYQQLFHDMQPDFLVTDMFYPWT 136 Query: 180 VDIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQL 359 VD AA+LGIPRL GG YF+H+A+N+IEQF+PH V S++E L+PGLPH++EMT LQ+ Sbjct: 137 VDAAAKLGIPRLIYVGGGYFAHSAQNAIEQFSPHTKVDSDSERFLIPGLPHELEMTRLQI 196 Query: 360 SDWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539 DWL+EP D++ LMK++KDS+R+SYGSLF++FYELEGTYEEHY++ G +SW +GPVSFW Sbjct: 197 PDWLREPKDYSDLMKIMKDSERRSYGSLFNTFYELEGTYEEHYKKAMGVKSWSVGPVSFW 256 Query: 540 VNQDDSDKANRGHAK---XXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAH 710 VNQD SDKA+RGHAK G L WLDSK E+SV+YVSFGSMNKFP QL+EIAH Sbjct: 257 VNQDASDKADRGHAKEEQEGEGGGEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAH 316 Query: 711 ALEDSGYDFIWVVRKAEEGE----YGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIG 878 ALEDSG+DFIWVVRK E E LE+F++RV+ SNKGYLIWGWAPQL+ILEH AIG Sbjct: 317 ALEDSGHDFIWVVRKKGESEDCDGNEFLEEFEERVRASNKGYLIWGWAPQLLILEHLAIG 376 Query: 879 AVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGD 1058 AVVTHCGWNTI+ESVNAGLPMATWPLFAEQFYNEKLL DVL+IGVPVGAK WKNWNEFGD Sbjct: 377 AVVTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLADVLRIGVPVGAKEWKNWNEFGD 436 Query: 1059 EVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSF 1235 EVVKR++IGKAI +LM GGEE LEMRRRV SDAAKK +VGGSSH SF Sbjct: 437 EVVKRDEIGKAIAVLMGGGEECLEMRRRVKALSDAAKKAIQVGGSSHNKMKQLIQELKSF 496 Query: 1236 KHQKVNHKME 1265 K QK+N K E Sbjct: 497 KLQKINLKNE 506 >dbj|GAU11951.1| hypothetical protein TSUD_195750 [Trifolium subterraneum] Length = 476 Score = 579 bits (1493), Expect = 0.0 Identities = 282/382 (73%), Positives = 318/382 (83%), Gaps = 2/382 (0%) Frame = +3 Query: 126 FRDMKPDFIVTDMFYPWSVDIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNE 305 FR+MKPDFIVT MFYPW+VDIA ELGIPR C GGSYF+H+A NSIE FAPH V SN+E Sbjct: 92 FREMKPDFIVTYMFYPWTVDIADELGIPRFICIGGSYFAHSAMNSIEVFAPHEKVNSNSE 151 Query: 306 SVLLPGLPHKVEMTCLQLSDWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEH 485 S LLPGLPHKVEMT LQL DWL+ PN++TYLMKMIK+S+RKSYGSLFDS+YE+EGTYE+H Sbjct: 152 SFLLPGLPHKVEMTRLQLPDWLRAPNNYTYLMKMIKESERKSYGSLFDSYYEIEGTYEDH 211 Query: 486 YQRVTGTRSWGLGPVSFWVNQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFG 665 Y+ GT+SWG+GPVS WVNQDDSDKA RG+ K +GVLKWLDSKEEDSV+YVSFG Sbjct: 212 YKTAMGTKSWGVGPVSLWVNQDDSDKAGRGNGKKQDEKEDGVLKWLDSKEEDSVLYVSFG 271 Query: 666 SMNKFPISQLIEIAHALEDSGYDFIWVVRKAEEGEYG-VLEKFDKRVKESNKGYLIWGWA 842 SM KFP QL+EIA ALEDSG +FIWVVRK E GE G L +F+KRVKESNKGYLIWGWA Sbjct: 272 SMTKFPSPQLVEIAQALEDSGNNFIWVVRKIEHGEDGSFLREFEKRVKESNKGYLIWGWA 331 Query: 843 PQLVILEHSAIGAVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVG 1022 PQL+ILEH+A+GA+VT CGWNTI+ESVNAGLP+ATWPLFAEQFYNE+LLVDVLKIGV VG Sbjct: 332 PQLLILEHAAVGAMVTRCGWNTIMESVNAGLPLATWPLFAEQFYNERLLVDVLKIGVAVG 391 Query: 1023 AKVWKNWNEFGDEVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHT 1199 AK W+NWNEFGD+VVKREDIGKAI LLM GEE LEMRRR S AAKK + GGSSHT Sbjct: 392 AKEWRNWNEFGDDVVKREDIGKAIGLLMGCGEECLEMRRRAKALSGAAKKAIEFGGSSHT 451 Query: 1200 XXXXXXXXXXSFKHQKVNHKME 1265 S K +KVN+K+E Sbjct: 452 KLKELNEDLKSIKLEKVNNKLE 473 >gb|AMQ26114.1| UDP-glycosyltransferase 41 [Pueraria montana var. lobata] Length = 504 Score = 575 bits (1481), Expect = 0.0 Identities = 280/427 (65%), Positives = 332/427 (77%), Gaps = 6/427 (1%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQV GLP GMESFNA TP D+ +KI FRDMKPDFIV+DMFYPW+V Sbjct: 75 KFPQVPGLPQGMESFNASTPTDMVAKISHALSTLEGQFRQVFRDMKPDFIVSDMFYPWTV 134 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 D AAELGIPRL GG+YF+H A +S+E+F PH N+GS++ES L+PGLPH+ EMT QL Sbjct: 135 DAAAELGIPRLIYVGGTYFAHCAMDSLERFEPHTNLGSDDESFLIPGLPHEFEMTRSQLP 194 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 D K PND TY+MK +K+S+++SYGS+F SFY EG YEEHY+++ GT+SW +GP+S WV Sbjct: 195 DRFKAPNDMTYIMKRVKESEKRSYGSVFKSFYAFEGAYEEHYRKIMGTKSWNVGPISSWV 254 Query: 543 NQDDSDKANRGHAK----XXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAH 710 NQD SDKA+RGH K G WLDSK+E+SV+YV FGSMN FP SQL+EIA+ Sbjct: 255 NQDASDKASRGHGKEELQEEGKGKEGWFAWLDSKKEESVLYVCFGSMNNFPTSQLVEIAY 314 Query: 711 ALEDSGYDFIWVVRKAEEGE-YGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVV 887 ALED G+DFIWVVRK +EGE G +E+F+KRV+ SNKGYLIWGWAPQL+ILEH AIGAVV Sbjct: 315 ALEDCGHDFIWVVRKIDEGEARGFVEEFEKRVQASNKGYLIWGWAPQLLILEHPAIGAVV 374 Query: 888 THCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVV 1067 THCG NT++ESV+AGLP+ TWPLFAEQF+NE+LLVDVLKIGVP+GAK WKNWNEFGDE+V Sbjct: 375 THCGMNTVIESVDAGLPLVTWPLFAEQFFNERLLVDVLKIGVPIGAKKWKNWNEFGDEIV 434 Query: 1068 KREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQ 1244 KREDIGKAI LLM GGEES EMRRRV SDAAKK +VGGSSH S K + Sbjct: 435 KREDIGKAIALLMGGGEESEEMRRRVKALSDAAKKAIQVGGSSHNSLKDLIEELKSLKLR 494 Query: 1245 KVNHKME 1265 KVN K++ Sbjct: 495 KVNGKLD 501 >gb|KHN10128.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [Glycine soja] Length = 498 Score = 572 bits (1474), Expect = 0.0 Identities = 283/426 (66%), Positives = 330/426 (77%), Gaps = 7/426 (1%) Frame = +3 Query: 3 KFP-QVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWS 179 KFP + GLP G+ESFN++TP+D+ KIY F D++PDF+ TDMFYPW+ Sbjct: 73 KFPCEQVGLPEGVESFNSNTPRDLVPKIYQGLTILQDQYQQLFHDLQPDFLFTDMFYPWT 132 Query: 180 VDIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQL 359 VD AA+LGIPRL G Y +H+++N+IEQF+PH V S+ ES LLPGLPH+++MT LQL Sbjct: 133 VDAAAKLGIPRLIYVSGGYLAHSSQNTIEQFSPHTKVDSDTESFLLPGLPHELKMTRLQL 192 Query: 360 SDWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539 DWL+ P +TYLM M+KDS+RKSYGSL ++FYELEG YEEHY++ GT+SW +GPVSFW Sbjct: 193 PDWLRAPTGYTYLMNMMKDSERKSYGSLLNTFYELEGDYEEHYKKAMGTKSWSVGPVSFW 252 Query: 540 VNQDDSDKANRGHAK-XXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHAL 716 VNQD DKA+RGHAK G L WLDSK E+SV+YVSFGSMNKFP QL+EIAHAL Sbjct: 253 VNQDALDKADRGHAKEEQGEGEEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAHAL 312 Query: 717 EDSGYDFIWVVRKAEEGEYG----VLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAV 884 EDS +DFIWVVRK E E G L++FDKRVK SNKGYLIWGWAPQL+ILEH AIGAV Sbjct: 313 EDSDHDFIWVVRKKGESEDGEGNDFLQEFDKRVKASNKGYLIWGWAPQLLILEHHAIGAV 372 Query: 885 VTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEV 1064 VTHCGWNTI+ESVNAGLPMATWPLFAEQFYNEKLL +VL+IGVPVGAK W+NWNEFGDEV Sbjct: 373 VTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLAEVLRIGVPVGAKEWRNWNEFGDEV 432 Query: 1065 VKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKH 1241 VKRE+IG AI +LM GGEES+EMRRR SDAAKK +VGGSSH S K Sbjct: 433 VKREEIGNAIGVLM-GGEESIEMRRRAKALSDAAKKAIQVGGSSHNNLKELIQELKSLKL 491 Query: 1242 QKVNHK 1259 QK NHK Sbjct: 492 QKANHK 497 >ref|XP_003546674.1| PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like [Glycine max] gb|KRH13189.1| hypothetical protein GLYMA_15G221300 [Glycine max] Length = 501 Score = 572 bits (1474), Expect = 0.0 Identities = 283/426 (66%), Positives = 330/426 (77%), Gaps = 7/426 (1%) Frame = +3 Query: 3 KFP-QVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWS 179 KFP + GLP G+ESFN++TP+D+ KIY F D++PDF+ TDMFYPW+ Sbjct: 76 KFPCEQVGLPEGVESFNSNTPRDLVPKIYQGLTILQDQYQQLFHDLQPDFLFTDMFYPWT 135 Query: 180 VDIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQL 359 VD AA+LGIPRL G Y +H+++N+IEQF+PH V S+ ES LLPGLPH+++MT LQL Sbjct: 136 VDAAAKLGIPRLIYVSGGYLAHSSQNTIEQFSPHTKVDSDTESFLLPGLPHELKMTRLQL 195 Query: 360 SDWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539 DWL+ P +TYLM M+KDS+RKSYGSL ++FYELEG YEEHY++ GT+SW +GPVSFW Sbjct: 196 PDWLRAPTGYTYLMNMMKDSERKSYGSLLNTFYELEGDYEEHYKKAMGTKSWSVGPVSFW 255 Query: 540 VNQDDSDKANRGHAK-XXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHAL 716 VNQD DKA+RGHAK G L WLDSK E+SV+YVSFGSMNKFP QL+EIAHAL Sbjct: 256 VNQDALDKADRGHAKEEQGEGEEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAHAL 315 Query: 717 EDSGYDFIWVVRKAEEGEYG----VLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAV 884 EDS +DFIWVVRK E E G L++FDKRVK SNKGYLIWGWAPQL+ILEH AIGAV Sbjct: 316 EDSDHDFIWVVRKKGESEDGEGNDFLQEFDKRVKASNKGYLIWGWAPQLLILEHHAIGAV 375 Query: 885 VTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEV 1064 VTHCGWNTI+ESVNAGLPMATWPLFAEQFYNEKLL +VL+IGVPVGAK W+NWNEFGDEV Sbjct: 376 VTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLAEVLRIGVPVGAKEWRNWNEFGDEV 435 Query: 1065 VKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKH 1241 VKRE+IG AI +LM GGEES+EMRRR SDAAKK +VGGSSH S K Sbjct: 436 VKREEIGNAIGVLM-GGEESIEMRRRAKALSDAAKKAIQVGGSSHNNLKELIQELKSLKL 494 Query: 1242 QKVNHK 1259 QK NHK Sbjct: 495 QKANHK 500 >ref|XP_020211411.1| soyasapogenol B glucuronide galactosyltransferase-like [Cajanus cajan] Length = 522 Score = 558 bits (1438), Expect = 0.0 Identities = 276/430 (64%), Positives = 330/430 (76%), Gaps = 9/430 (2%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQ++GLP GMES NA TP+D+ SKI FRDMKPDFIV+DMFYPWSV Sbjct: 89 KFPQISGLPEGMESINASTPKDMTSKIVEGMSILERQFRQAFRDMKPDFIVSDMFYPWSV 148 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 ++AAEL IPRL GG+YF+H A +S+E+F PH VGS++ES LLPGLPH++EM QL Sbjct: 149 EVAAELEIPRLIYVGGTYFAHCAMDSLERFEPHNKVGSDDESFLLPGLPHQIEMIRSQLP 208 Query: 363 DWLKEPN-DFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539 + PN F YLMK +K+S++KSYGS+ SF+E EG YEEHY+++ GT+SW +GP+S W Sbjct: 209 IRFRNPNHQFGYLMKAVKESEKKSYGSVLKSFHEFEGDYEEHYKKIMGTKSWNVGPISSW 268 Query: 540 VNQDDSDKANRGHAKXXXXXXN------GVLKWLDSKEEDSVVYVSFGSMNKFPISQLIE 701 VNQD SDKA RGHAK G L WLDSK+EDSV+YV FGSMN FP +QL+E Sbjct: 269 VNQDASDKAGRGHAKEEEEEEEEEKGKEGWLAWLDSKKEDSVLYVCFGSMNNFPSAQLVE 328 Query: 702 IAHALEDSGYDFIWVVRKAEEGEY-GVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIG 878 IAHALEDSG+DF+WVVRK +EGE G +E+F+KRV+ SNKGYLIWGWAPQL+ILEH AIG Sbjct: 329 IAHALEDSGHDFLWVVRKVDEGEAKGFVEEFEKRVQASNKGYLIWGWAPQLLILEHPAIG 388 Query: 879 AVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGD 1058 AVVTHCG NT++ESV+AGLP+ TWPLF+EQF+NEKLLVDVLKIGVPVGAK W++WNE GD Sbjct: 389 AVVTHCGMNTVIESVDAGLPLVTWPLFSEQFFNEKLLVDVLKIGVPVGAKKWRDWNELGD 448 Query: 1059 EVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSF 1235 E+VKREDIGKAI LLMSGGEESLE+R+R S AAKK +VGGSS+ S Sbjct: 449 EIVKREDIGKAIALLMSGGEESLEIRKRAKAMSVAAKKAIQVGGSSYNSLKELIEELRSL 508 Query: 1236 KHQKVNHKME 1265 K QK N KME Sbjct: 509 KLQKANRKME 518 >ref|XP_020211413.1| soyasapogenol B glucuronide galactosyltransferase-like [Cajanus cajan] Length = 519 Score = 555 bits (1429), Expect = 0.0 Identities = 275/427 (64%), Positives = 326/427 (76%), Gaps = 6/427 (1%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQ+ GLP GMES NA TP+D+ SKI FRDMKPDFIVTDMFY WSV Sbjct: 89 KFPQIPGLPEGMESINASTPKDMTSKIVEGMSILELQFRQAFRDMKPDFIVTDMFYVWSV 148 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 ++AAEL IPRL GG+YF+H A +S+E+F PH VGS++ES LLPGLPH++EM QL Sbjct: 149 EVAAELDIPRLIYMGGTYFAHCAMDSLERFEPHNKVGSDDESFLLPGLPHEIEMIRSQLP 208 Query: 363 DWLKEPN-DFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFW 539 + PN + YLMK +K+S +KSYGS+F SF E EG YEEHY+++ GT+SW +GP+S W Sbjct: 209 IRFRNPNHQYEYLMKAVKESAKKSYGSVFKSFREFEGVYEEHYKKIMGTKSWNVGPISSW 268 Query: 540 VNQDDSDKANRGHAKXXXXXXNGV---LKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAH 710 VNQD SDKA RGHAK G L WLDSK+EDSV+YV FGSMN FP +QL+EIAH Sbjct: 269 VNQDASDKAGRGHAKEEEEEEKGKEGWLAWLDSKKEDSVLYVCFGSMNNFPSTQLVEIAH 328 Query: 711 ALEDSGYDFIWVVRKAEEGEY-GVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVV 887 ALEDSG+DF+WVVRK +EGE G +E+F+KRV+ SNKGYLIWGWAPQL+ILEH A+GAVV Sbjct: 329 ALEDSGHDFLWVVRKVDEGEAKGFVEEFEKRVQASNKGYLIWGWAPQLLILEHPAVGAVV 388 Query: 888 THCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVV 1067 THCG NT++ESV+AGLP+ TWPLFAEQF+NEKLLVDVLKIGVP+GAK W+ WN GDE+V Sbjct: 389 THCGMNTVIESVDAGLPLVTWPLFAEQFFNEKLLVDVLKIGVPIGAKKWREWNGLGDEIV 448 Query: 1068 KREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQ 1244 KREDIGKAI LLMSGGEESLE+R+R S AAKK +VGGSS+ S K Q Sbjct: 449 KREDIGKAIALLMSGGEESLEIRKRAKAMSVAAKKAIQVGGSSYNSLKELIEELRSLKLQ 508 Query: 1245 KVNHKME 1265 KVN KME Sbjct: 509 KVNRKME 515 >ref|XP_007142833.1| hypothetical protein PHAVU_007G020800g [Phaseolus vulgaris] gb|ESW14827.1| hypothetical protein PHAVU_007G020800g [Phaseolus vulgaris] Length = 494 Score = 542 bits (1397), Expect = 0.0 Identities = 266/418 (63%), Positives = 315/418 (75%), Gaps = 1/418 (0%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQ+ GLP G+E+ N+DTP + KI FR M+PDFIVTDMFYPWS Sbjct: 75 KFPQIPGLPEGVETINSDTPPPLTMKIGEALSILQGQYQQLFRLMQPDFIVTDMFYPWSA 134 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 D AAELGIPRL G SYFSH A N +E+FAPH V S+ ES LPGLPHK+EMT LQL Sbjct: 135 DAAAELGIPRLVYVGASYFSHCAMNCVEEFAPHDKVDSDGESFELPGLPHKLEMTRLQLP 194 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 DWL+ P +TYL KM+K+S++KSYGS+F SFYE EG YEEHY+RV GT+SW +GPVS WV Sbjct: 195 DWLRAPKPYTYLKKMMKESEKKSYGSVFKSFYEFEGAYEEHYKRVMGTKSWSIGPVSLWV 254 Query: 543 NQDDSDKANRGHAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHALED 722 NQD+SDKA RG AK +++WLDSK+E+SV+YVSFGSMNKFP +QL+EIAHALED Sbjct: 255 NQDESDKAGRGQAKEGKGTDEELIRWLDSKKENSVLYVSFGSMNKFPTTQLVEIAHALED 314 Query: 723 SGYDFIWVVRKAEEGEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTHCGW 902 SG+DFIWVVRK ++G++ LE+F+KRV+ SN+GYLIWGWAPQLVIL+H A GAVVTHCG Sbjct: 315 SGHDFIWVVRKNDDGDF--LEEFEKRVQGSNRGYLIWGWAPQLVILDHPATGAVVTHCGM 372 Query: 903 NTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKREDI 1082 NT+ ESV AGLPM WPLF+EQF+NEKL+VDVLKIGV VGAK W+N N+FG E VKRE I Sbjct: 373 NTVFESVIAGLPMVAWPLFSEQFFNEKLVVDVLKIGVSVGAKEWRNLNDFGSETVKREAI 432 Query: 1083 GKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQKVN 1253 G+AI L M GGEE +EMRRRV SD AKK + G+SH S K QK N Sbjct: 433 GEAIGLSMGGGEECVEMRRRVKVLSDEAKKAIQSDGTSHNNLQELIQELKSLKLQKDN 490 >gb|KHN37410.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [Glycine soja] Length = 491 Score = 540 bits (1390), Expect = 0.0 Identities = 266/426 (62%), Positives = 322/426 (75%), Gaps = 9/426 (2%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQV GLP G+ESFNA TP D+ +KI FRD+KPDFIV+DMFYPWSV Sbjct: 65 KFPQVPGLPQGLESFNASTPADMVTKIGHALSILEGPFRQLFRDIKPDFIVSDMFYPWSV 124 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 D A ELGIPRL GG+YF+H A +S+E+F PH VGS++ES L+PGLPH+ EMT Q+ Sbjct: 125 DAADELGIPRLIYVGGTYFAHCAMDSLERFEPHTKVGSDDESFLIPGLPHEFEMTRSQIP 184 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 D K P++ TYLMK IK+S+++SYGS+F SFY EG YE+HY+++ GT+SW LGP+S WV Sbjct: 185 DRFKAPDNLTYLMKTIKESEKRSYGSVFKSFYAFEGAYEDHYRKIMGTKSWNLGPISSWV 244 Query: 543 NQDDSDKANRG-------HAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIE 701 NQD SDKA+RG + L WLDSK+E SV+YV FGSMN FP +QL+E Sbjct: 245 NQDASDKASRGSRDNKAKEEQVEEGKDGSWLAWLDSKKEGSVLYVCFGSMNNFPTTQLVE 304 Query: 702 IAHALEDSGYDFIWVVRKAEEGEY-GVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIG 878 IAHALEDSG+DFIWVV K +EGE G +++F+KRV+ SNKGYLI GWAPQL+ILEH +IG Sbjct: 305 IAHALEDSGHDFIWVVGKTDEGETKGFVDEFEKRVQASNKGYLICGWAPQLLILEHPSIG 364 Query: 879 AVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGD 1058 AVVTHCG NT++ESV+AGLP+ TWPLFAEQF+NE+LLVDVLKIGV +GAK W NWN+FGD Sbjct: 365 AVVTHCGMNTVIESVDAGLPLVTWPLFAEQFFNERLLVDVLKIGVAIGAKKWNNWNDFGD 424 Query: 1059 EVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSF 1235 E+VKREDIGKAI LLM GGEES EMR+RV SDAAKK +VGGSSH S Sbjct: 425 EIVKREDIGKAIALLMGGGEESEEMRKRVKALSDAAKKAIQVGGSSHNSLKDLIEELKSL 484 Query: 1236 KHQKVN 1253 K QK++ Sbjct: 485 KLQKLS 490 >ref|XP_003536714.1| PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like [Glycine max] gb|KRH36042.1| hypothetical protein GLYMA_10G280400 [Glycine max] Length = 505 Score = 540 bits (1391), Expect = 0.0 Identities = 268/426 (62%), Positives = 321/426 (75%), Gaps = 9/426 (2%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQV GLP G+ESFNA TP D+ +KI FRD+KPDFIV+DMFYPWSV Sbjct: 79 KFPQVPGLPQGLESFNASTPADMVTKIGHALSILEGPFRQLFRDIKPDFIVSDMFYPWSV 138 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 D A ELGIPRL GG+YF+H A +S+E+F PH VGS++ES L+PGLPH+ EMT Q+ Sbjct: 139 DAADELGIPRLIYVGGTYFAHCAMDSLERFEPHTKVGSDDESFLIPGLPHEFEMTRSQIP 198 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 D K P++ TYLMK IK+S+++SYGS+F SFY EG YE+HY+++ GT+SW LGP+S WV Sbjct: 199 DRFKAPDNLTYLMKTIKESEKRSYGSVFKSFYAFEGAYEDHYRKIMGTKSWNLGPISSWV 258 Query: 543 NQDDSDKANRG-------HAKXXXXXXNGVLKWLDSKEEDSVVYVSFGSMNKFPISQLIE 701 NQD SDKA+RG + L WLDSK+E SV+YV FGSMN FP +QL E Sbjct: 259 NQDASDKASRGSRDNKAKEEQVEEGKDGSWLAWLDSKKEGSVLYVCFGSMNNFPTTQLGE 318 Query: 702 IAHALEDSGYDFIWVVRKAEEGEY-GVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIG 878 IAHALEDSG+DFIWVV K +EGE G +E+F+KRV+ SNKGYLI GWAPQL+ILEH +IG Sbjct: 319 IAHALEDSGHDFIWVVGKTDEGETKGFVEEFEKRVQASNKGYLICGWAPQLLILEHPSIG 378 Query: 879 AVVTHCGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGD 1058 AVVTHCG NT++ESV+AGLP+ TWPLFAEQF+NE+LLVDVLKIGV +GAK W NWN+FGD Sbjct: 379 AVVTHCGMNTVIESVDAGLPLVTWPLFAEQFFNERLLVDVLKIGVAIGAKKWNNWNDFGD 438 Query: 1059 EVVKREDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSF 1235 E+VKREDIGKAI LLM GGEES EMR+RV SDAAKK +VGGSSH S Sbjct: 439 EIVKREDIGKAIALLMGGGEESEEMRKRVKALSDAAKKAIQVGGSSHNSLKDLIEELKSL 498 Query: 1236 KHQKVN 1253 K QK+N Sbjct: 499 KLQKLN 504 >ref|XP_014512855.1| soyasapogenol B glucuronide galactosyltransferase [Vigna radiata var. radiata] Length = 493 Score = 535 bits (1379), Expect = 0.0 Identities = 266/418 (63%), Positives = 314/418 (75%), Gaps = 4/418 (0%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQV GLP G+E+ NADTP + KI FR MKPDFIVTDMFYPWS Sbjct: 75 KFPQVPGLPEGIETINADTPPLLTMKISEALSILQGQYQELFRVMKPDFIVTDMFYPWSA 134 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 D AAELGIPRL G SYFSH A N +E+FAPHA V S+ ES LPGLPHK+EMT QL Sbjct: 135 DAAAELGIPRLVYVGASYFSHCAMNCVEEFAPHAKVDSDGESFELPGLPHKLEMTRSQLP 194 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 DWL+ P +TYL KMIK+S++KSYGSLF SFYE EG YEEHY+RV GT+SW +GPVS WV Sbjct: 195 DWLRAPKPYTYLKKMIKESEKKSYGSLFKSFYEFEGAYEEHYKRVMGTKSWSIGPVSLWV 254 Query: 543 NQDDSDKANRGHAKXXXXXXNG--VLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHAL 716 NQD+ DKA RGHAK +++WLD+K+E+SV+YVSFGSMNKFP +QL+EIAHAL Sbjct: 255 NQDELDKAGRGHAKEGEGKGTNEELMRWLDTKKENSVLYVSFGSMNKFPTAQLVEIAHAL 314 Query: 717 EDSGYDFIWVVRKAEE-GEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893 ED G+DFIWVVRK ++ G+ G LE+F+KRV+ESN+GYLIWGWAPQL IL+H A GAVVTH Sbjct: 315 EDCGHDFIWVVRKNDDHGDKGFLEEFEKRVQESNRGYLIWGWAPQLAILDHPATGAVVTH 374 Query: 894 CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073 CG NT+ ESV AGLP+ WP+F+EQF+NEKL+VDVLKIGV VGAK W+N N+FG E VKR Sbjct: 375 CGMNTVFESVIAGLPLVAWPIFSEQFFNEKLVVDVLKIGVSVGAKEWRNLNDFGSETVKR 434 Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQ 1244 E+I KA+ L+M GGEE +EMRRRV SD AKK + GG+SH S K Q Sbjct: 435 EEIRKAVVLVM-GGEECVEMRRRVKVLSDEAKKAIQSGGTSHNNLKELIEELKSLKLQ 491 >ref|XP_017413459.1| PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like [Vigna angularis] gb|KOM36380.1| hypothetical protein LR48_Vigan02g253000 [Vigna angularis] dbj|BAT93707.1| hypothetical protein VIGAN_08023500 [Vigna angularis var. angularis] dbj|BAT93708.1| hypothetical protein VIGAN_08023600 [Vigna angularis var. angularis] Length = 494 Score = 532 bits (1370), Expect = 0.0 Identities = 265/419 (63%), Positives = 312/419 (74%), Gaps = 4/419 (0%) Frame = +3 Query: 3 KFPQVAGLPHGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMKPDFIVTDMFYPWSV 182 KFPQV GLP G+E+ NADTP + KI FR MKPDFIVTDMFYPWS Sbjct: 75 KFPQVPGLPEGVETINADTPPLLTMKISEGLSILQGQYQELFRVMKPDFIVTDMFYPWSA 134 Query: 183 DIAAELGIPRLNCTGGSYFSHAARNSIEQFAPHANVGSNNESVLLPGLPHKVEMTCLQLS 362 D AAELGIPRL G YFSH A N +EQFAPHA V S+ ES LPGLPHK+EMT QL Sbjct: 135 DAAAELGIPRLVYVGACYFSHCAMNCVEQFAPHAKVDSDGESFELPGLPHKLEMTRSQLP 194 Query: 363 DWLKEPNDFTYLMKMIKDSDRKSYGSLFDSFYELEGTYEEHYQRVTGTRSWGLGPVSFWV 542 DWL+ P +TYL KMIK+S++KSYGSLF SFYE EG YEEHY+RV GT+SW +GPVS WV Sbjct: 195 DWLRAPKPYTYLKKMIKESEKKSYGSLFKSFYEFEGAYEEHYKRVMGTKSWSIGPVSLWV 254 Query: 543 NQDDSDKANRGHAKXXXXXXNG--VLKWLDSKEEDSVVYVSFGSMNKFPISQLIEIAHAL 716 N+D+ DKA RGHAK +++WLDSK+E+ V+YVSFGSMNKFP +QL+EIAHAL Sbjct: 255 NEDELDKAGRGHAKEGEGKRTDEELMRWLDSKKENCVLYVSFGSMNKFPTAQLVEIAHAL 314 Query: 717 EDSGYDFIWVVRK-AEEGEYGVLEKFDKRVKESNKGYLIWGWAPQLVILEHSAIGAVVTH 893 ED G+DFIWVVRK ++G+ G LE+F+KRV+ESN GYLIWGWAPQL IL+H A GAVVTH Sbjct: 315 EDCGHDFIWVVRKNDDDGDRGFLEEFEKRVQESNNGYLIWGWAPQLAILDHPATGAVVTH 374 Query: 894 CGWNTILESVNAGLPMATWPLFAEQFYNEKLLVDVLKIGVPVGAKVWKNWNEFGDEVVKR 1073 CG NT+ ESV AGLP+ WP+F+EQF+NEKL+VDVLKIGV VGAK W+N N+FG E VKR Sbjct: 375 CGMNTVFESVIAGLPLVAWPIFSEQFFNEKLVVDVLKIGVSVGAKEWRNLNDFGSETVKR 434 Query: 1074 EDIGKAIDLLMSGGEESLEMRRRVNEFSDAAKKT-KVGGSSHTXXXXXXXXXXSFKHQK 1247 E+I KA+ L+M GGEE +EMR+RV SD AKK + GG+SH S K QK Sbjct: 435 EEIRKAVVLVM-GGEECVEMRKRVKVLSDEAKKAIQSGGTSHNNLKELIEELKSVKLQK 492