BLASTX nr result

ID: Astragalus22_contig00018709 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00018709
         (702 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003628138.1| UDP-glucosyltransferase family protein [Medi...   371   e-125
gb|ABI94026.1| (iso)flavonoid glycosyltransferase [Medicago trun...   371   e-124
dbj|GAU30423.1| hypothetical protein TSUD_364690 [Trifolium subt...   370   e-124
ref|XP_013466939.1| UDP-glucosyltransferase family protein [Medi...   367   e-123
dbj|GAU30424.1| hypothetical protein TSUD_364700, partial [Trifo...   348   e-118
dbj|GAU42924.1| hypothetical protein TSUD_283470 [Trifolium subt...   345   e-114
dbj|GAU12394.1| hypothetical protein TSUD_253450 [Trifolium subt...   342   e-113
gb|PNX59314.1| UDP-glycosyltransferase 73B3-like protein, partia...   330   e-111
dbj|GAU51965.1| hypothetical protein TSUD_417550 [Trifolium subt...   336   e-110
ref|XP_020231152.1| soyasapogenol B glucuronide galactosyltransf...   336   e-110
dbj|GAU11951.1| hypothetical protein TSUD_195750 [Trifolium subt...   332   e-109
gb|KHN10128.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [...   324   e-106
ref|XP_003546674.1| PREDICTED: soyasapogenol B glucuronide galac...   324   e-106
dbj|GAU42521.1| hypothetical protein TSUD_376490 [Trifolium subt...   320   e-105
dbj|GAU26052.1| hypothetical protein TSUD_225140 [Trifolium subt...   312   e-101
gb|PNY00105.1| anthocyanin 3'-O-beta-glucosyltransferase [Trifol...   310   e-101
ref|XP_007142833.1| hypothetical protein PHAVU_007G020800g [Phas...   309   e-100
ref|XP_014512855.1| soyasapogenol B glucuronide galactosyltransf...   308   e-100
ref|XP_017413459.1| PREDICTED: soyasapogenol B glucuronide galac...   307   2e-99
ref|NP_001304384.2| soyasapogenol B glucuronide galactosyltransf...   306   4e-99

>ref|XP_003628138.1| UDP-glucosyltransferase family protein [Medicago truncatula]
 gb|AET02614.1| UDP-glucosyltransferase family protein [Medicago truncatula]
          Length = 464

 Score =  371 bits (952), Expect = e-125
 Identities = 178/234 (76%), Positives = 197/234 (84%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP GMESFNADTP+DI SKIY             FRDM PDF+VTDMFYPWSVD+A ELG
Sbjct: 45  LPQGMESFNADTPKDIISKIYQGLAILQEQFTQLFRDMKPDFIVTDMFYPWSVDVADELG 104

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRLIC  GSYFAHSAMNSI+QF PHA   SN  SFLL GLPH VEMTRLQLPDWLR+PN
Sbjct: 105 IPRLICIGGSYFAHSAMNSIEQFEPHAKVKSNSVSFLLPGLPHNVEMTRLQLPDWLRAPN 164

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT LMKMIKDSE+KSYGSLFDS+YE+EGTYE++YKIAMG+KSW +GPVS W+N+DDSDK
Sbjct: 165 GYTYLMKMIKDSEKKSYGSLFDSYYEIEGTYEDYYKIAMGSKSWSVGPVSLWMNKDDSDK 224

Query: 162 ANRGHAKEKEEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A RGH KE++EE GVLKWLDSK+ DSV+YVSFGSMNKFP  QL+EIAHALEDSG
Sbjct: 225 AGRGHGKEEDEEEGVLKWLDSKKYDSVLYVSFGSMNKFPTPQLVEIAHALEDSG 278


>gb|ABI94026.1| (iso)flavonoid glycosyltransferase [Medicago truncatula]
          Length = 502

 Score =  371 bits (952), Expect = e-124
 Identities = 178/234 (76%), Positives = 197/234 (84%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP GMESFNADTP+DI SKIY             FRDM PDF+VTDMFYPWSVD+A ELG
Sbjct: 83  LPQGMESFNADTPKDIISKIYQGLAILQEQFTQLFRDMKPDFIVTDMFYPWSVDVADELG 142

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRLIC  GSYFAHSAMNSI+QF PHA   SN  SFLL GLPH VEMTRLQLPDWLR+PN
Sbjct: 143 IPRLICIGGSYFAHSAMNSIEQFEPHAKVKSNSVSFLLPGLPHNVEMTRLQLPDWLRAPN 202

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT LMKMIKDSE+KSYGSLFDS+YE+EGTYE++YKIAMG+KSW +GPVS W+N+DDSDK
Sbjct: 203 GYTYLMKMIKDSEKKSYGSLFDSYYEIEGTYEDYYKIAMGSKSWSVGPVSLWMNKDDSDK 262

Query: 162 ANRGHAKEKEEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A RGH KE++EE GVLKWLDSK+ DSV+YVSFGSMNKFP  QL+EIAHALEDSG
Sbjct: 263 AGRGHGKEEDEEEGVLKWLDSKKYDSVLYVSFGSMNKFPTPQLVEIAHALEDSG 316


>dbj|GAU30423.1| hypothetical protein TSUD_364690 [Trifolium subterraneum]
          Length = 501

 Score =  370 bits (950), Expect = e-124
 Identities = 181/235 (77%), Positives = 202/235 (85%), Gaps = 1/235 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP GMESFNADTP +IRSKIY             FRDM PDF+VTDMFYPWSVDIA EL 
Sbjct: 83  LPQGMESFNADTPNEIRSKIYQGLMVLQEQFKQLFRDMKPDFIVTDMFYPWSVDIADELR 142

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRLIC SGSYFAHSAMNSI+ F+PHA  +SN ESFLL GLPHKVEMTRLQLPDWLR+PN
Sbjct: 143 IPRLICISGSYFAHSAMNSIEVFAPHAKVNSNSESFLLPGLPHKVEMTRLQLPDWLRAPN 202

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
           DYT LMKMIK+SERKSYGSLFDS++E+EGTYE+HYK AMGTKSWG+GPVS WVNQ++SDK
Sbjct: 203 DYTYLMKMIKESERKSYGSLFDSYHEIEGTYEDHYKTAMGTKSWGVGPVSLWVNQNNSDK 262

Query: 162 ANRGHAKEKE-EENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A+RGH  E++ EE+ VLKWLDSKE+DSV+YVSFGSMNKFP  QL+EIAHALEDSG
Sbjct: 263 ASRGHRIEQDAEEDEVLKWLDSKEEDSVLYVSFGSMNKFPSPQLVEIAHALEDSG 317


>ref|XP_013466939.1| UDP-glucosyltransferase family protein [Medicago truncatula]
 gb|KEH40975.1| UDP-glucosyltransferase family protein [Medicago truncatula]
          Length = 503

 Score =  367 bits (943), Expect = e-123
 Identities = 181/234 (77%), Positives = 194/234 (82%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           L  GMESFNADTP +IRSKIY             FRDM PDF+VTDMFYPWSVD+A ELG
Sbjct: 83  LARGMESFNADTPNEIRSKIYQGLIILQEQFKQQFRDMKPDFIVTDMFYPWSVDVADELG 142

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRLIC SGSYFAHSAMNSI+ FSP A    N ESFLL GLPHKVEM RLQLPDWLR+PN
Sbjct: 143 IPRLICISGSYFAHSAMNSIEHFSPQAKVKLNSESFLLPGLPHKVEMKRLQLPDWLRAPN 202

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
           DYT LMKMIKDSERKSYGSLFDS +E+E TYEEHYK AMGTKSW LGPVS WVNQDDSDK
Sbjct: 203 DYTYLMKMIKDSERKSYGSLFDS-HEIESTYEEHYKTAMGTKSWSLGPVSLWVNQDDSDK 261

Query: 162 ANRGHAKEKEEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A RGH KE++E+ GVLKWLDSK+DDSV+YVSFGSMNKFP  QL+EIAHALE SG
Sbjct: 262 AGRGHGKEEDEDEGVLKWLDSKKDDSVLYVSFGSMNKFPTPQLVEIAHALEHSG 315


>dbj|GAU30424.1| hypothetical protein TSUD_364700, partial [Trifolium subterraneum]
          Length = 300

 Score =  348 bits (894), Expect = e-118
 Identities = 169/218 (77%), Positives = 186/218 (85%), Gaps = 2/218 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP GMESFNADTP +IRSKIY             FRDM PDF+VTDMFYPWSVDIA ELG
Sbjct: 83  LPQGMESFNADTPNEIRSKIYQGLMVLQEQFKQLFRDMKPDFIVTDMFYPWSVDIADELG 142

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRLIC  GSYFAHSAMNSI+ F+PH   +SN E+FLL GLPHKVEMTRLQLPDWLR+PN
Sbjct: 143 IPRLICIGGSYFAHSAMNSIEVFAPHEKVNSNSETFLLPGLPHKVEMTRLQLPDWLRAPN 202

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
           +YT LMKMIK+SERKSYGSLFDS+YE+EGTYE+HYK AMGTKSWG+GPVS WVNQDDSDK
Sbjct: 203 NYTYLMKMIKESERKSYGSLFDSYYEIEGTYEDHYKTAMGTKSWGVGPVSLWVNQDDSDK 262

Query: 162 ANRGHAKE--KEEENGVLKWLDSKEDDSVVYVSFGSMN 55
           A RG+ KE  +EEE+GVLKWLDSKEDDSV+YVSFGSMN
Sbjct: 263 AGRGNGKEEDEEEEDGVLKWLDSKEDDSVLYVSFGSMN 300


>dbj|GAU42924.1| hypothetical protein TSUD_283470 [Trifolium subterraneum]
          Length = 497

 Score =  345 bits (886), Expect = e-114
 Identities = 172/235 (73%), Positives = 192/235 (81%), Gaps = 1/235 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LPLG+ES +A+TP+DI SKIY             FRDM PDF+VTDMFYPWSVD A ELG
Sbjct: 80  LPLGLESVDAETPKDISSKIYQGLFLLKDNFQQLFRDMKPDFIVTDMFYPWSVDTAAELG 139

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRS-P 346
           IPRL C  GSYF+H+A NSI+QF+PH N  S+ ESFLL GLPHKVEMTR QL DW+    
Sbjct: 140 IPRLNCTGGSYFSHAARNSIEQFAPHVNVGSDYESFLLPGLPHKVEMTRSQLSDWVNERS 199

Query: 345 NDYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSD 166
           ND+ N+MKMIKD++R+SYGSLF SFYELEGTYEEHY+   GT+SW LGPVS WVNQDD D
Sbjct: 200 NDFGNIMKMIKDADRRSYGSLFRSFYELEGTYEEHYQRVTGTRSWSLGPVSLWVNQDDFD 259

Query: 165 KANRGHAKEKEEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           KANRG+AKEK EENGVLKWLDSKED+SVVYVSFGSMNKFPISQ IEIAHALEDSG
Sbjct: 260 KANRGNAKEK-EENGVLKWLDSKEDNSVVYVSFGSMNKFPISQHIEIAHALEDSG 313


>dbj|GAU12394.1| hypothetical protein TSUD_253450 [Trifolium subterraneum]
          Length = 502

 Score =  342 bits (877), Expect = e-113
 Identities = 172/236 (72%), Positives = 191/236 (80%), Gaps = 2/236 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP G+ES +ADTPQD+ SKIY             +  M PDF+VTDMFYPWSVDIA ELG
Sbjct: 76  LPHGLESLDADTPQDMSSKIYQGLFLLKENFQQLY--MKPDFIVTDMFYPWSVDIAAELG 133

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRL C  GSYF+H+A NSI+QFSPH N  S+ ESFLL GLPHKVEMTR QL DW++ PN
Sbjct: 134 IPRLNCTGGSYFSHAARNSIEQFSPHVNVGSDHESFLLPGLPHKVEMTRSQLSDWVKEPN 193

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
           D+ +LMKMI D++RKSYGSLF SFYE+EGTYEEHY+   GT+SW LGPVS WVNQDD DK
Sbjct: 194 DFGDLMKMIGDADRKSYGSLFRSFYEMEGTYEEHYQRVTGTRSWSLGPVSLWVNQDDFDK 253

Query: 162 ANRGHAKEK-EEENGVLKWLDSKEDD-SVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           ANRG AKEK EEENGVLKWLDSKE+D SVVYVSFGSMNKFPISQ IEIAHALEDSG
Sbjct: 254 ANRGRAKEKEEEENGVLKWLDSKEEDNSVVYVSFGSMNKFPISQHIEIAHALEDSG 309


>gb|PNX59314.1| UDP-glycosyltransferase 73B3-like protein, partial [Trifolium
           pratense]
          Length = 292

 Score =  330 bits (845), Expect = e-111
 Identities = 159/210 (75%), Positives = 179/210 (85%), Gaps = 1/210 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP GMESFNADTP +IRSKIY             FRDM PDF+VTDMFYPW+VDIA ELG
Sbjct: 83  LPQGMESFNADTPNEIRSKIYQGLMVLQEQFKQLFRDMKPDFIVTDMFYPWTVDIADELG 142

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRLIC SGSYFAHSAMNSI+ F+PHA   SN ESFLL GLPHKVEMTRLQLPDWLR+PN
Sbjct: 143 IPRLICISGSYFAHSAMNSIEVFAPHAKVKSNSESFLLPGLPHKVEMTRLQLPDWLRAPN 202

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
           DYT LMKMIK+SERKSYGSLFDS++E+EGTYE+HYK AMGTKSWG+GP+S WVNQD+SDK
Sbjct: 203 DYTYLMKMIKESERKSYGSLFDSYHEIEGTYEDHYKTAMGTKSWGVGPISLWVNQDNSDK 262

Query: 162 ANRGHAKEKE-EENGVLKWLDSKEDDSVVY 76
           A+RGH  E++ EE+ VLKWLDSKE+DSV+Y
Sbjct: 263 ASRGHRIEQDAEEDEVLKWLDSKEEDSVLY 292


>dbj|GAU51965.1| hypothetical protein TSUD_417550 [Trifolium subterraneum]
          Length = 512

 Score =  336 bits (862), Expect = e-110
 Identities = 165/235 (70%), Positives = 184/235 (78%), Gaps = 1/235 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP G+E  +ADTPQD     Y              RDM PDF+VTDMFYPWSVDIA ELG
Sbjct: 80  LPHGLEIIDADTPQDSSKLFYQGLLLLQENFQQIIRDMKPDFIVTDMFYPWSVDIAAELG 139

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRL CN GSYF+H+A NS +QF+PH N  S+DE+F L GLPHK+EMTR QL DW++ PN
Sbjct: 140 IPRLNCNGGSYFSHAARNSTEQFAPHVNVSSDDETFSLPGLPHKIEMTRSQLSDWVKEPN 199

Query: 342 -DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSD 166
            ++   MKMI D++RKSYGSLF SFYELEGTYEEHY+   GT+SW LGPVS WVNQDD D
Sbjct: 200 NEFGYWMKMIIDADRKSYGSLFRSFYELEGTYEEHYQRVTGTRSWSLGPVSLWVNQDDFD 259

Query: 165 KANRGHAKEKEEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           KANRG AKEKEEENGVLKWLDSKED+SVVYVSFGSMNKF ISQ IEIAHALEDSG
Sbjct: 260 KANRGCAKEKEEENGVLKWLDSKEDNSVVYVSFGSMNKFSISQQIEIAHALEDSG 314


>ref|XP_020231152.1| soyasapogenol B glucuronide galactosyltransferase-like [Cajanus
           cajan]
 gb|KYP51621.1| Anthocyanin 3'-O-beta-glucosyltransferase [Cajanus cajan]
          Length = 509

 Score =  336 bits (861), Expect = e-110
 Identities = 161/238 (67%), Positives = 190/238 (79%), Gaps = 4/238 (1%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP G+ESFN++TPQD+  K+Y             F DM PDFLVTDMFYPW+VD A +LG
Sbjct: 85  LPQGVESFNSNTPQDMVKKVYEGLSILKDQYQQLFHDMQPDFLVTDMFYPWTVDAAAKLG 144

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRLI   G YFAHSA N+I+QFSPH   DS+ E FL+ GLPH++EMTRLQ+PDWLR P 
Sbjct: 145 IPRLIYVGGGYFAHSAQNAIEQFSPHTKVDSDSERFLIPGLPHELEMTRLQIPDWLREPK 204

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
           DY++LMK++KDSER+SYGSLF++FYELEGTYEEHYK AMG KSW +GPVSFWVNQD SDK
Sbjct: 205 DYSDLMKIMKDSERRSYGSLFNTFYELEGTYEEHYKKAMGVKSWSVGPVSFWVNQDASDK 264

Query: 162 ANRGHAKEKEE----ENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A+RGHAKE++E      G L WLDSK ++SV+YVSFGSMNKFP  QL+EIAHALEDSG
Sbjct: 265 ADRGHAKEEQEGEGGGEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAHALEDSG 322


>dbj|GAU11951.1| hypothetical protein TSUD_195750 [Trifolium subterraneum]
          Length = 476

 Score =  332 bits (850), Expect = e-109
 Identities = 157/200 (78%), Positives = 178/200 (89%), Gaps = 1/200 (0%)
 Frame = -1

Query: 597 RDMNPDFLVTDMFYPWSVDIATELGIPRLICNSGSYFAHSAMNSIQQFSPHANADSNDES 418
           R+M PDF+VT MFYPW+VDIA ELGIPR IC  GSYFAHSAMNSI+ F+PH   +SN ES
Sbjct: 93  REMKPDFIVTYMFYPWTVDIADELGIPRFICIGGSYFAHSAMNSIEVFAPHEKVNSNSES 152

Query: 417 FLLLGLPHKVEMTRLQLPDWLRSPNDYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHY 238
           FLL GLPHKVEMTRLQLPDWLR+PN+YT LMKMIK+SERKSYGSLFDS+YE+EGTYE+HY
Sbjct: 153 FLLPGLPHKVEMTRLQLPDWLRAPNNYTYLMKMIKESERKSYGSLFDSYYEIEGTYEDHY 212

Query: 237 KIAMGTKSWGLGPVSFWVNQDDSDKANRGHAKEKEE-ENGVLKWLDSKEDDSVVYVSFGS 61
           K AMGTKSWG+GPVS WVNQDDSDKA RG+ K+++E E+GVLKWLDSKE+DSV+YVSFGS
Sbjct: 213 KTAMGTKSWGVGPVSLWVNQDDSDKAGRGNGKKQDEKEDGVLKWLDSKEEDSVLYVSFGS 272

Query: 60  MNKFPISQLIEIAHALEDSG 1
           M KFP  QL+EIA ALEDSG
Sbjct: 273 MTKFPSPQLVEIAQALEDSG 292


>gb|KHN10128.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [Glycine soja]
          Length = 498

 Score =  324 bits (831), Expect = e-106
 Identities = 159/235 (67%), Positives = 185/235 (78%), Gaps = 2/235 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP G+ESFN++TP+D+  KIY             F D+ PDFL TDMFYPW+VD A +LG
Sbjct: 81  LPEGVESFNSNTPRDLVPKIYQGLTILQDQYQQLFHDLQPDFLFTDMFYPWTVDAAAKLG 140

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRLI  SG Y AHS+ N+I+QFSPH   DS+ ESFLL GLPH+++MTRLQLPDWLR+P 
Sbjct: 141 IPRLIYVSGGYLAHSSQNTIEQFSPHTKVDSDTESFLLPGLPHELKMTRLQLPDWLRAPT 200

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT LM M+KDSERKSYGSL ++FYELEG YEEHYK AMGTKSW +GPVSFWVNQD  DK
Sbjct: 201 GYTYLMNMMKDSERKSYGSLLNTFYELEGDYEEHYKKAMGTKSWSVGPVSFWVNQDALDK 260

Query: 162 ANRGHAKEK--EEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDS 4
           A+RGHAKE+  E E G L WLDSK ++SV+YVSFGSMNKFP  QL+EIAHALEDS
Sbjct: 261 ADRGHAKEEQGEGEEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAHALEDS 315


>ref|XP_003546674.1| PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like
           [Glycine max]
 gb|KRH13189.1| hypothetical protein GLYMA_15G221300 [Glycine max]
          Length = 501

 Score =  324 bits (831), Expect = e-106
 Identities = 159/235 (67%), Positives = 185/235 (78%), Gaps = 2/235 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP G+ESFN++TP+D+  KIY             F D+ PDFL TDMFYPW+VD A +LG
Sbjct: 84  LPEGVESFNSNTPRDLVPKIYQGLTILQDQYQQLFHDLQPDFLFTDMFYPWTVDAAAKLG 143

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRLI  SG Y AHS+ N+I+QFSPH   DS+ ESFLL GLPH+++MTRLQLPDWLR+P 
Sbjct: 144 IPRLIYVSGGYLAHSSQNTIEQFSPHTKVDSDTESFLLPGLPHELKMTRLQLPDWLRAPT 203

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT LM M+KDSERKSYGSL ++FYELEG YEEHYK AMGTKSW +GPVSFWVNQD  DK
Sbjct: 204 GYTYLMNMMKDSERKSYGSLLNTFYELEGDYEEHYKKAMGTKSWSVGPVSFWVNQDALDK 263

Query: 162 ANRGHAKEK--EEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDS 4
           A+RGHAKE+  E E G L WLDSK ++SV+YVSFGSMNKFP  QL+EIAHALEDS
Sbjct: 264 ADRGHAKEEQGEGEEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAHALEDS 318


>dbj|GAU42521.1| hypothetical protein TSUD_376490 [Trifolium subterraneum]
          Length = 458

 Score =  320 bits (819), Expect = e-105
 Identities = 163/236 (69%), Positives = 184/236 (77%), Gaps = 2/236 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP G+E+ +ADTPQD+ SKIY              RDM PDF+VTDMFYPWSVDIA ELG
Sbjct: 50  LPHGLENVDADTPQDMNSKIYQGLLLLKDDFQQLIRDMKPDFIVTDMFYPWSVDIAAELG 109

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRL C  GSYF+H+A NSI+QF+PH N  S+DESFLL GLPHKVEMTR QL DW++ PN
Sbjct: 110 IPRLNCTGGSYFSHAARNSIEQFAPHVNVGSDDESFLLPGLPHKVEMTRSQLSDWVKDPN 169

Query: 342 -DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSD 166
            ++   MK+IKD++RKSYGSLF SFYELE           GT+SW LGPVS WVNQDD D
Sbjct: 170 LEFGYWMKVIKDADRKSYGSLFRSFYELE-----------GTRSWSLGPVSLWVNQDDFD 218

Query: 165 KANRGHAKEKEEENGVLKWLDSKEDDSVVYVSFGS-MNKFPISQLIEIAHALEDSG 1
           KANRG AKEK+EE+GVLKWLDSKED+SVVYVSFGS MNKFPISQ IEIAHALEDSG
Sbjct: 219 KANRGCAKEKKEEHGVLKWLDSKEDNSVVYVSFGSMMNKFPISQHIEIAHALEDSG 274


>dbj|GAU26052.1| hypothetical protein TSUD_225140 [Trifolium subterraneum]
          Length = 493

 Score =  312 bits (800), Expect = e-101
 Identities = 148/238 (62%), Positives = 187/238 (78%), Gaps = 4/238 (1%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP+G+E+FN +TP+++  K+Y             F  + PDF+VTDMF+PWS D+A +LG
Sbjct: 79  LPIGIEAFNVNTPKEMIPKVYMGLYILQPEIESLFETLQPDFIVTDMFFPWSADVAKKLG 138

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPR++ +  SY A SA +S++QF+PH N +S+ E F++  LP K+EMTRLQLPDWLRSPN
Sbjct: 139 IPRIMFHGASYLARSAAHSVEQFAPHLNVESDTEKFVIPDLPDKLEMTRLQLPDWLRSPN 198

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT LMK+IK+SERKS+GS+F+SFYELE  Y +HYK AMGTKSWGLGPVS WVNQDDSDK
Sbjct: 199 QYTELMKVIKESERKSFGSVFNSFYELESDYYDHYKKAMGTKSWGLGPVSLWVNQDDSDK 258

Query: 162 ANRGHAKEKE----EENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A RG+AK+++    EE G LKWL+SK + SV+YVSFGSMNKFP SQL+EIAHALEDSG
Sbjct: 259 AARGYAKKEQEGEKEEEGWLKWLNSKPESSVLYVSFGSMNKFPYSQLVEIAHALEDSG 316


>gb|PNY00105.1| anthocyanin 3'-O-beta-glucosyltransferase [Trifolium pratense]
          Length = 465

 Score =  310 bits (793), Expect = e-101
 Identities = 147/237 (62%), Positives = 185/237 (78%), Gaps = 3/237 (1%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP+G+E+FN +TP+++  K+Y             F  + PDF+VTDMF+PWS D A +LG
Sbjct: 52  LPIGIEAFNVNTPKEMIPKVYMGLYILQPEIESLFETLQPDFIVTDMFFPWSADAAKKLG 111

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPR++ +  SY A SA +S++QF+PH N +S+ E F++  LP K+EMTRLQLPDWLRSPN
Sbjct: 112 IPRIMFHGASYLARSAAHSVEQFAPHLNVESDTEKFVIPDLPDKLEMTRLQLPDWLRSPN 171

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT LMK+IK+SERKS+GS+F+SFYELE  Y +HYK  MGTKSWGLGPVS WVNQDDSDK
Sbjct: 172 QYTELMKVIKESERKSFGSVFNSFYELESDYYDHYKKVMGTKSWGLGPVSLWVNQDDSDK 231

Query: 162 ANRGHAKEKE---EENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A RG+AK+++   EE G LKWL+SK + SV+YVSFGSMNKFP SQL+EIAHALEDSG
Sbjct: 232 AARGYAKKEQGEKEEEGWLKWLNSKPESSVLYVSFGSMNKFPYSQLVEIAHALEDSG 288


>ref|XP_007142833.1| hypothetical protein PHAVU_007G020800g [Phaseolus vulgaris]
 gb|ESW14827.1| hypothetical protein PHAVU_007G020800g [Phaseolus vulgaris]
          Length = 494

 Score =  309 bits (791), Expect = e-100
 Identities = 149/235 (63%), Positives = 179/235 (76%), Gaps = 1/235 (0%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP G+E+ N+DTP  +  KI              FR M PDF+VTDMFYPWS D A ELG
Sbjct: 82  LPEGVETINSDTPPPLTMKIGEALSILQGQYQQLFRLMQPDFIVTDMFYPWSADAAAELG 141

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRL+    SYF+H AMN +++F+PH   DS+ ESF L GLPHK+EMTRLQLPDWLR+P 
Sbjct: 142 IPRLVYVGASYFSHCAMNCVEEFAPHDKVDSDGESFELPGLPHKLEMTRLQLPDWLRAPK 201

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT L KM+K+SE+KSYGS+F SFYE EG YEEHYK  MGTKSW +GPVS WVNQD+SDK
Sbjct: 202 PYTYLKKMMKESEKKSYGSVFKSFYEFEGAYEEHYKRVMGTKSWSIGPVSLWVNQDESDK 261

Query: 162 ANRGHAKE-KEEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A RG AKE K  +  +++WLDSK+++SV+YVSFGSMNKFP +QL+EIAHALEDSG
Sbjct: 262 AGRGQAKEGKGTDEELIRWLDSKKENSVLYVSFGSMNKFPTTQLVEIAHALEDSG 316


>ref|XP_014512855.1| soyasapogenol B glucuronide galactosyltransferase [Vigna radiata
           var. radiata]
          Length = 493

 Score =  308 bits (789), Expect = e-100
 Identities = 150/237 (63%), Positives = 177/237 (74%), Gaps = 3/237 (1%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP G+E+ NADTP  +  KI              FR M PDF+VTDMFYPWS D A ELG
Sbjct: 82  LPEGIETINADTPPLLTMKISEALSILQGQYQELFRVMKPDFIVTDMFYPWSADAAAELG 141

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRL+    SYF+H AMN +++F+PHA  DS+ ESF L GLPHK+EMTR QLPDWLR+P 
Sbjct: 142 IPRLVYVGASYFSHCAMNCVEEFAPHAKVDSDGESFELPGLPHKLEMTRSQLPDWLRAPK 201

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT L KMIK+SE+KSYGSLF SFYE EG YEEHYK  MGTKSW +GPVS WVNQD+ DK
Sbjct: 202 PYTYLKKMIKESEKKSYGSLFKSFYEFEGAYEEHYKRVMGTKSWSIGPVSLWVNQDELDK 261

Query: 162 ANRGHAKE---KEEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A RGHAKE   K     +++WLD+K+++SV+YVSFGSMNKFP +QL+EIAHALED G
Sbjct: 262 AGRGHAKEGEGKGTNEELMRWLDTKKENSVLYVSFGSMNKFPTAQLVEIAHALEDCG 318


>ref|XP_017413459.1| PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like
           [Vigna angularis]
 gb|KOM36380.1| hypothetical protein LR48_Vigan02g253000 [Vigna angularis]
 dbj|BAT93707.1| hypothetical protein VIGAN_08023500 [Vigna angularis var.
           angularis]
 dbj|BAT93708.1| hypothetical protein VIGAN_08023600 [Vigna angularis var.
           angularis]
          Length = 494

 Score =  307 bits (786), Expect = 2e-99
 Identities = 149/237 (62%), Positives = 176/237 (74%), Gaps = 3/237 (1%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP G+E+ NADTP  +  KI              FR M PDF+VTDMFYPWS D A ELG
Sbjct: 82  LPEGVETINADTPPLLTMKISEGLSILQGQYQELFRVMKPDFIVTDMFYPWSADAAAELG 141

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPRL+     YF+H AMN ++QF+PHA  DS+ ESF L GLPHK+EMTR QLPDWLR+P 
Sbjct: 142 IPRLVYVGACYFSHCAMNCVEQFAPHAKVDSDGESFELPGLPHKLEMTRSQLPDWLRAPK 201

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT L KMIK+SE+KSYGSLF SFYE EG YEEHYK  MGTKSW +GPVS WVN+D+ DK
Sbjct: 202 PYTYLKKMIKESEKKSYGSLFKSFYEFEGAYEEHYKRVMGTKSWSIGPVSLWVNEDELDK 261

Query: 162 ANRGHAKE---KEEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A RGHAKE   K  +  +++WLDSK+++ V+YVSFGSMNKFP +QL+EIAHALED G
Sbjct: 262 AGRGHAKEGEGKRTDEELMRWLDSKKENCVLYVSFGSMNKFPTAQLVEIAHALEDCG 318


>ref|NP_001304384.2| soyasapogenol B glucuronide galactosyltransferase [Glycine max]
 gb|ACJ61480.1| flavonoid glycosyltransferase [Glycine max]
 gb|KHN42101.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [Glycine soja]
 gb|KRH28437.1| hypothetical protein GLYMA_11G053400 [Glycine max]
          Length = 495

 Score =  306 bits (784), Expect = 4e-99
 Identities = 144/234 (61%), Positives = 179/234 (76%)
 Frame = -1

Query: 702 LPLGMESFNADTPQDIRSKIYXXXXXXXXXXXXXFRDMNPDFLVTDMFYPWSVDIATELG 523
           LP+G+E+FN DTP+++  +IY             F D+ PDF+VTDMF+PWSVD A +LG
Sbjct: 78  LPVGIEAFNVDTPREMTPRIYMGLSLLQQVFEKLFHDLQPDFIVTDMFHPWSVDAAAKLG 137

Query: 522 IPRLICNSGSYFAHSAMNSIQQFSPHANADSNDESFLLLGLPHKVEMTRLQLPDWLRSPN 343
           IPR++ +  SY A SA +S++Q++PH  A  + + F+L GLP  +EMTRLQLPDWLRSPN
Sbjct: 138 IPRIMFHGASYLARSAAHSVEQYAPHLEAKFDTDKFVLPGLPDNLEMTRLQLPDWLRSPN 197

Query: 342 DYTNLMKMIKDSERKSYGSLFDSFYELEGTYEEHYKIAMGTKSWGLGPVSFWVNQDDSDK 163
            YT LM+ IK SE+KSYGSLF+SFY+LE  Y EHYK  MGTKSWG+GPVS W NQD  DK
Sbjct: 198 QYTELMRTIKQSEKKSYGSLFNSFYDLESAYYEHYKSIMGTKSWGIGPVSLWANQDAQDK 257

Query: 162 ANRGHAKEKEEENGVLKWLDSKEDDSVVYVSFGSMNKFPISQLIEIAHALEDSG 1
           A RG+AKE+EE+ G LKWL+SK + SV+YVSFGSMNKFP SQL+EIA ALEDSG
Sbjct: 258 AARGYAKEEEEKEGWLKWLNSKAESSVLYVSFGSMNKFPYSQLVEIARALEDSG 311


Top