BLASTX nr result

ID: Astragalus22_contig00034781 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00034781
         (1511 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAC78438.1| isoflavonoid glucosyltransferase [Glycyrrhiza ec...   566   0.0  
ref|XP_013441780.1| UDP-glucosyltransferase family protein [Medi...   559   0.0  
ref|XP_004508344.1| PREDICTED: UDP-glucose flavonoid 3-O-glucosy...   548   0.0  
ref|XP_013458218.1| UDP-glucosyltransferase family protein [Medi...   544   0.0  
gb|PNY08795.1| isoflavonoid glucosyltransferase [Trifolium prate...   539   0.0  
ref|XP_020982066.1| UDP-glucose flavonoid 3-O-glucosyltransferas...   517   e-177
gb|PNY11934.1| isoflavonoid glucosyltransferase [Trifolium prate...   516   e-177
ref|XP_016196645.1| UDP-glucose flavonoid 3-O-glucosyltransferas...   514   e-176
ref|XP_013458223.1| UDP-glucosyltransferase family protein [Medi...   514   e-176
ref|XP_007154186.1| hypothetical protein PHAVU_003G097200g [Phas...   513   e-176
gb|KRH02148.1| hypothetical protein GLYMA_17G019400 [Glycine max]     511   e-175
ref|XP_006600323.1| PREDICTED: scopoletin glucosyltransferase-li...   512   e-175
ref|XP_014521976.1| UDP-glucose flavonoid 3-O-glucosyltransferas...   510   e-175
ref|XP_004508345.1| PREDICTED: scopoletin glucosyltransferase [C...   510   e-175
ref|XP_020227520.1| scopoletin glucosyltransferase-like [Cajanus...   508   e-174
dbj|GAU11068.1| hypothetical protein TSUD_196920 [Trifolium subt...   506   e-173
ref|XP_017423634.1| PREDICTED: scopoletin glucosyltransferase-li...   505   e-173
dbj|GAU41610.1| hypothetical protein TSUD_196800 [Trifolium subt...   505   e-173
gb|PNY07429.1| isoflavonoid glucosyltransferase [Trifolium prate...   504   e-172
dbj|GAU41618.1| hypothetical protein TSUD_196880 [Trifolium subt...   504   e-172

>dbj|BAC78438.1| isoflavonoid glucosyltransferase [Glycyrrhiza echinata]
          Length = 482

 Score =  566 bits (1459), Expect = 0.0
 Identities = 277/414 (66%), Positives = 333/414 (80%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQEVGL +GVE+LS + D+D L++  QA TLL+  IE  VE +PPDCIVADF+Y WV+
Sbjct: 73   FPSQEVGLPDGVESLSSVTDLDNLAKVFQATTLLRTPIEHFVEENPPDCIVADFIYQWVD 132

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            ELANKL IPRLAF  FSLFAICA++SV A S++  SGSF+I GLPHPI +NA PP  + D
Sbjct: 133  ELANKLNIPRLAFNGFSLFAICAIESVKAHSLYA-SGSFVIPGLPHPIAMNAAPPKQMSD 191

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
              ++ +L+ +LKSH LIVNNF+ELDGEEY E+YEKT  H+AWH+GP  LIR+T+QEKAER
Sbjct: 192  F-LESMLETELKSHGLIVNNFAELDGEEYIEHYEKTTGHRAWHLGPVSLIRRTSQEKAER 250

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            G+ S VSVHEC+SWL+SKRD+SVLYI FG++ +  +KQLYEIA  +EA GH+FIWVVP  
Sbjct: 251  GEKSVVSVHECLSWLDSKRDDSVLYICFGSLCHFSDKQLYEIACGVEASGHEFIWVVPEK 310

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         W+PKGFEER     KKGL+++GWAPQ+LIL H A+ AF+THCGWNS 
Sbjct: 311  KGKEDESEEEKEKWMPKGFEER-----KKGLIMRGWAPQVLILSHRAVGAFVTHCGWNST 365

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+AGVPMITWPVHGEQF+NEKLVTQVRGIGVEVGA EWS +G  +RE VV RESIEK
Sbjct: 366  VEAVSAGVPMITWPVHGEQFYNEKLVTQVRGIGVEVGAEEWSAIGFGEREKVVCRESIEK 425

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
            AVRRL+DGGDEA+KI++RAREF DKA RA QEGGSSH NLT LI+DL+RLRD K
Sbjct: 426  AVRRLMDGGDEAEKIRRRAREFRDKATRAVQEGGSSHNNLTALIDDLRRLRDRK 479


>ref|XP_013441780.1| UDP-glucosyltransferase family protein [Medicago truncatula]
 gb|KEH15805.1| UDP-glucosyltransferase family protein [Medicago truncatula]
          Length = 489

 Score =  559 bits (1441), Expect = 0.0
 Identities = 276/414 (66%), Positives = 330/414 (79%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPS EVGL +GVE+LS   D++ L +  QA TLLQP I+  VE  PPDCIVADF++PWV+
Sbjct: 76   FPSHEVGLPDGVESLSAATDLENLRKIFQATTLLQPPIQHFVEQHPPDCIVADFLFPWVD 135

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            ELANKL+IPRL+F  FSLFAICA++SV A S++  S SF+I GLPH I +NA PP  + D
Sbjct: 136  ELANKLQIPRLSFNGFSLFAICAIESVKAHSLYE-SASFVIPGLPHSIAMNAAPPKQMSD 194

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
            + ++ +L+   KS+ +IVNNFSELDGEEY E+YEKT  HKAWH+GPA LIR+T QEKA+R
Sbjct: 195  L-LEALLETVFKSNGIIVNNFSELDGEEYIEHYEKTTGHKAWHLGPASLIRRTVQEKAQR 253

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            GQ S VSVHEC+SWL+SK DNSVLYI FG++   P+KQLYEIA  +EA GH FIWVVP  
Sbjct: 254  GQQSVVSVHECLSWLDSKPDNSVLYICFGSLCLFPDKQLYEIACGIEASGHKFIWVVPEK 313

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEER IGK KKGL+++GWAPQ++IL H A+ AF+THCGWNS 
Sbjct: 314  KGKEDESEEEKGKWLPKGFEERNIGK-KKGLIIRGWAPQVMILSHKALGAFMTHCGWNST 372

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+AGVPMITWPVHGEQF+NEKL+TQVR IGV VGA EWS+ GI +RE VV R+SIEK
Sbjct: 373  VEAVSAGVPMITWPVHGEQFYNEKLITQVRRIGVVVGAAEWSSTGIGEREKVVGRDSIEK 432

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
            AVRRL+DGGDEA+KIK+ AREFGDKAK AAQEGGSSHRNLT +I+DLK LRD K
Sbjct: 433  AVRRLMDGGDEAEKIKKYAREFGDKAKHAAQEGGSSHRNLTAVIDDLKILRDRK 486


>ref|XP_004508344.1| PREDICTED: UDP-glucose flavonoid 3-O-glucosyltransferase 7 [Cicer
            arietinum]
 gb|AGU14072.1| UDP-glycosyltransferase [Cicer arietinum]
          Length = 483

 Score =  548 bits (1413), Expect = 0.0
 Identities = 267/412 (64%), Positives = 328/412 (79%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQEVGL +GVENLS   ++D L++   A TLL+P I+  VE  PPDCIVADF++PWV+
Sbjct: 69   FPSQEVGLPDGVENLSAATNLDNLTKIYHATTLLRPPIQHFVEQHPPDCIVADFLFPWVD 128

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            ELANKLRI RLAF  FSLFAICA++SV A+S H  S SFLI  LPHPI +NA PP  + +
Sbjct: 129  ELANKLRITRLAFNGFSLFAICAIESVKANS-HYDSASFLIHDLPHPISMNAAPPKKMHE 187

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
            + +  + +   KS+ ++VNNF ELDGEEY ++YEKT  HKAWH+GPA LIRKT QEKAER
Sbjct: 188  L-LVTLFETVFKSNGIVVNNFVELDGEEYIKHYEKTTGHKAWHLGPASLIRKTDQEKAER 246

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            G+ S VSVHEC+SWLNSKR NSV+YI FG++ + P+KQLYEIA  +EA G++F+WVVP  
Sbjct: 247  GEESVVSVHECLSWLNSKRVNSVIYIGFGSLCHFPDKQLYEIACGIEASGYEFVWVVPEK 306

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEER    ++KGL+V+GWAPQ+LIL H A+ AF+THCGWNSI
Sbjct: 307  KGKEYESEEEKEKWLPKGFEER---NSEKGLIVRGWAPQVLILSHPAVGAFMTHCGWNSI 363

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+AG+PMITWPVHGEQF+NEKL+TQVRGIGVEVGA EWS  G  +RE +V  ESIEK
Sbjct: 364  VEAVSAGIPMITWPVHGEQFYNEKLITQVRGIGVEVGAGEWSNSGYGEREKLVSGESIEK 423

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRD 1238
            A+RRL+DGGD+A++I++RAREFGDKAK AAQEGGSS++NLT LI+DLKRLRD
Sbjct: 424  ALRRLMDGGDDAQEIRRRAREFGDKAKEAAQEGGSSYKNLTVLIDDLKRLRD 475


>ref|XP_013458218.1| UDP-glucosyltransferase family protein [Medicago truncatula]
 gb|KEH32249.1| UDP-glucosyltransferase family protein [Medicago truncatula]
          Length = 484

 Score =  544 bits (1401), Expect = 0.0
 Identities = 266/414 (64%), Positives = 324/414 (78%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQ VGL +GVE L    D+D L++  +A TLL+P I+  VE  PPDCIVADFMYPWV 
Sbjct: 71   FPSQAVGLPDGVEPLFTTTDLDNLTKIYRAATLLRPTIQHFVEQHPPDCIVADFMYPWVH 130

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            ELANKL+IPRLAF  FSLFAICAM+SV A S++  S SF+I  LPH I +NA PP  L  
Sbjct: 131  ELANKLQIPRLAFNGFSLFAICAMESVKAHSLYE-SASFVIPHLPHSIAMNAAPPKQLSK 189

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
            + ++ +L+   KS+ ++VNNF+ELDGEEY E+YEKT  HKAWH+GPA LIR+T QEKAER
Sbjct: 190  L-LEALLETVFKSNGILVNNFAELDGEEYIEHYEKTTCHKAWHLGPASLIRRTIQEKAER 248

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            G+ S VSVHEC+SWLNSK+DNSV+YI FG+  +  +KQLYEIA  +EA  H+FIWVVP  
Sbjct: 249  GEESVVSVHECLSWLNSKQDNSVVYICFGSQCHFSDKQLYEIACGIEASSHEFIWVVPEK 308

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEER IGK KK +++KGWAPQ++IL H+A+ AF+THCGWNS 
Sbjct: 309  KRTENDNEEEKEKWLPKGFEERIIGK-KKAMIIKGWAPQVMILSHTAVGAFMTHCGWNST 367

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+AGVPMITWP+HGEQF+NEKL+TQV GIGVEVGATEWST GI +RE VV R++IEK
Sbjct: 368  VEAVSAGVPMITWPMHGEQFYNEKLITQVHGIGVEVGATEWSTTGIGEREKVVWRDNIEK 427

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
             V+RL+D GDEA+KI+Q AREFG+KAK A +EGGSSH NLT ++  LKRLRDNK
Sbjct: 428  VVKRLMDSGDEAEKIRQHAREFGEKAKHAIKEGGSSHSNLTAVVNYLKRLRDNK 481


>gb|PNY08795.1| isoflavonoid glucosyltransferase [Trifolium pratense]
          Length = 482

 Score =  539 bits (1389), Expect = 0.0
 Identities = 261/414 (63%), Positives = 327/414 (78%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQEVGL +G+E+LS   + D L +  Q  TLLQP IE  +E +PPDCIVADF++PWV+
Sbjct: 69   FPSQEVGLPDGMESLSAATNPDNLVKIYQGTTLLQPPIEHFIEQNPPDCIVADFLFPWVD 128

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            ELANKL+IPRL+F  FS+FAICAM+SV A+S++  S S++I  LPH I +NATPP  + +
Sbjct: 129  ELANKLQIPRLSFNGFSIFAICAMESVKANSLYESS-SYVIPDLPHSIAMNATPPKQMTE 187

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
            + + E+L+   KS  +I+NNF+ELDGEEY E+YEKT  HKAWH+GPA LIR+T Q+KAER
Sbjct: 188  L-LGELLETVFKSKGIIINNFNELDGEEYIEHYEKTTGHKAWHLGPASLIRRTIQQKAER 246

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            G+ S VS+HEC+SWLNSK D SVLYI FG++ +  +KQLYEI+  +E  GH+F+WVVP  
Sbjct: 247  GEQSAVSLHECLSWLNSKPDKSVLYICFGSLCHFQDKQLYEISCGIENSGHEFVWVVPEK 306

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEER IGK  KGL+V+GWAPQ++IL HSA+  F+THCGWNS 
Sbjct: 307  KGKENETDEDKEKWLPKGFEERNIGK--KGLIVRGWAPQVMILSHSAVGGFMTHCGWNST 364

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+AGVPMITWPVHGEQF+NEKL+TQVRGIGVEVGA EWST+G  +RE VV RE IEK
Sbjct: 365  VEAVSAGVPMITWPVHGEQFYNEKLITQVRGIGVEVGAAEWSTMGFGEREEVVGRECIEK 424

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
            AVRR++DGGDEA++I++ A+E GDKA  AAQEGGSS++NLT LI+DLKR RD K
Sbjct: 425  AVRRMMDGGDEAEEIRRCAQELGDKANLAAQEGGSSYKNLTALIDDLKRSRDRK 478


>ref|XP_020982066.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [Arachis duranensis]
          Length = 479

 Score =  517 bits (1331), Expect = e-177
 Identities = 245/412 (59%), Positives = 319/412 (77%), Gaps = 1/412 (0%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQEVGL +G+E+LS + DM  L +  QA T+L+  IE+ V + PPDCIV DFM+PWV+
Sbjct: 66   FPSQEVGLPDGLESLSTVTDMVNLYKVYQATTMLRDPIEDFVGNHPPDCIVGDFMFPWVD 125

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVH-GGSGSFLIQGLPHPIRLNATPPMNLR 359
            +LANKLRIPR AF  F LF +CA++S+ A  +    S  FL+ GLPHP+ L   PP N++
Sbjct: 126  DLANKLRIPRFAFNGFCLFTLCAIESLKAHPIPLDASPPFLLHGLPHPVTLKTAPPTNIK 185

Query: 360  DVNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAE 539
            +V +  +++I+L+S+ LIVNNF+ELDGEEY +YYE+T  HKAWH+GPA L+ +T +EKA+
Sbjct: 186  EV-LDAMIEIELRSNGLIVNNFAELDGEEYIQYYERTTGHKAWHLGPACLLHRTVEEKAQ 244

Query: 540  RGQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPX 719
            RGQ S +S H+CMSWL+SK+ NSV+YI FG++ Y P+ QLYEIA A+EA G +F+WVVP 
Sbjct: 245  RGQKSVLSAHKCMSWLDSKKQNSVVYICFGSLCYFPDNQLYEIACAVEASGCEFVWVVPE 304

Query: 720  XXXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNS 899
                          WLPKGFEER    +KKG++++GWAPQ+LILEH A+ AF+THCGWNS
Sbjct: 305  KKGKENEEEEEKQKWLPKGFEER---NSKKGMIIRGWAPQVLILEHPAVGAFVTHCGWNS 361

Query: 900  ILEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIE 1079
             +EAV+AGVPMITWPVHGEQF+NEKLV+ VRGIGVEVGA EW T+G  +RE +V RE IE
Sbjct: 362  TVEAVSAGVPMITWPVHGEQFYNEKLVSDVRGIGVEVGADEWGTVGFGEREKLVGREDIE 421

Query: 1080 KAVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLR 1235
            +A+RRL+DGGDEA++I+ R +EFG KA+ A QEGGSS+ NLT LI++LKRLR
Sbjct: 422  RALRRLMDGGDEAQEIRGRTKEFGKKARVAVQEGGSSYNNLTALIDELKRLR 473


>gb|PNY11934.1| isoflavonoid glucosyltransferase [Trifolium pratense]
          Length = 477

 Score =  516 bits (1328), Expect = e-177
 Identities = 259/414 (62%), Positives = 311/414 (75%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPS +VGL  GVENL  + ++D   R  QA  LLQ  I+  VE D PDCIVADFM+ WV+
Sbjct: 70   FPSHQVGLPPGVENLGSVNNLDNSYRVHQAAMLLQSPIQHFVEQDSPDCIVADFMFLWVD 129

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            ELANKL IPRLAF  FSLF ICAM+S+ A     G  S +I+GLPH I LNATPP  L  
Sbjct: 130  ELANKLHIPRLAFNGFSLFTICAMESLKAR----GFESSVIKGLPHCITLNATPPKALTK 185

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
              M+ +L+ +LKS+ LIVNNF+ELDG+EY E+Y+KT  H+ WH+GP  LIR+T QEKAER
Sbjct: 186  F-MEPLLETELKSYGLIVNNFAELDGDEYIEHYKKTTGHRVWHLGPVSLIRRTTQEKAER 244

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            GQ S VSVHECMSWLNSK+ NSVLYI FG++ +  NKQLYEIASA+EA  H FIWVVP  
Sbjct: 245  GQPSAVSVHECMSWLNSKQPNSVLYICFGSLCHFSNKQLYEIASAIEATNHQFIWVVPEK 304

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEER      KG++++GWAPQ++IL H AI AFLTHCGWNS 
Sbjct: 305  KGKEDESNDANEKWLPKGFEER-----NKGMIIRGWAPQVVILGHPAIGAFLTHCGWNST 359

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+AGVPMITWPVH EQF+NEKL+TQVRGIGVEVGA EWS +G  +R+ +V R+ IEK
Sbjct: 360  VEAVSAGVPMITWPVHDEQFYNEKLITQVRGIGVEVGAEEWSIIGFMERKKLVGRDIIEK 419

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
            AVRRL+DGG EA +I+Q A+E+  KAKR+ QEGGSSH+NL  LI+DLKRL+D K
Sbjct: 420  AVRRLMDGGIEANEIRQCAKEYAIKAKRSVQEGGSSHKNLMALIDDLKRLKDYK 473


>ref|XP_016196645.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7 [Arachis ipaensis]
          Length = 479

 Score =  514 bits (1324), Expect = e-176
 Identities = 244/412 (59%), Positives = 319/412 (77%), Gaps = 1/412 (0%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQEVGL +G+E+LS + DM  L +  QA T+L+  IE+ V + PPDCIV DFM+PWV+
Sbjct: 66   FPSQEVGLPDGLESLSTVTDMVNLYKVYQATTMLRDPIEDFVGNHPPDCIVGDFMFPWVD 125

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVH-GGSGSFLIQGLPHPIRLNATPPMNLR 359
            +LANKLRIPR AF  F LF++CA++S+ A  +    S  FL+ GLPHP+ L   PP N++
Sbjct: 126  DLANKLRIPRFAFNGFCLFSLCAIESLKAHPIPLDASPPFLLHGLPHPVTLKTAPPTNIK 185

Query: 360  DVNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAE 539
            +V +  +++I+L+S+ LIVNNF+ELDGEEY +YYE+T  HKAWH+GPA L+ +T +EKA+
Sbjct: 186  EV-LDAMIEIELRSNGLIVNNFAELDGEEYIQYYERTTGHKAWHLGPACLLHRTVEEKAQ 244

Query: 540  RGQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPX 719
            RGQ S +S H+CMSWL+SK+ NSV+YI FG++   P+ QLYEIA A+EA G +F+WVVP 
Sbjct: 245  RGQKSVLSAHKCMSWLDSKKQNSVVYICFGSLCCFPDNQLYEIACAVEASGCEFVWVVPE 304

Query: 720  XXXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNS 899
                          WLPKGFEER    +KKG++++GWAPQ+LILEH A+ AF+THCGWNS
Sbjct: 305  KKGKENEEEEEKQKWLPKGFEER---NSKKGMIIRGWAPQVLILEHPAVGAFVTHCGWNS 361

Query: 900  ILEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIE 1079
             +EAV+AGVPMITWPVHGEQF+NEKLV+ VRGIGVEVGA EW T+G  +RE +V RE IE
Sbjct: 362  TVEAVSAGVPMITWPVHGEQFYNEKLVSDVRGIGVEVGADEWGTIGFGEREKLVGREDIE 421

Query: 1080 KAVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLR 1235
            +A+RRL+DGGDEA++I+ R +EFG KA+ A QEGGSS+ NLT LI++LKRLR
Sbjct: 422  RALRRLMDGGDEAQEIRGRTKEFGKKARVAVQEGGSSYNNLTALIDELKRLR 473


>ref|XP_013458223.1| UDP-glucosyltransferase family protein [Medicago truncatula]
 gb|KEH32254.1| UDP-glucosyltransferase family protein [Medicago truncatula]
          Length = 491

 Score =  514 bits (1324), Expect = e-176
 Identities = 253/421 (60%), Positives = 315/421 (74%), Gaps = 7/421 (1%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FP Q+VGL  GVE++S+ PD+    +      LLQ  I+  +E DPPDCI+ DF+YPWV 
Sbjct: 71   FPYQQVGLPSGVESMSNTPDLASSGKLYAGAMLLQEPIQNFMEKDPPDCIIGDFLYPWVH 130

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMAS-SVH------GGSGSFLIQGLPHPIRLNAT 341
            +LA+KL +P LAF  FSLF +C M+++  + S++        SGSF+++ LPHPI L+  
Sbjct: 131  DLASKLWVPNLAFNGFSLFTVCLMETLRTNPSIYPHMDSDSDSGSFVVRNLPHPITLSGR 190

Query: 342  PPMNLRDVNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKT 521
             P +  +  M  +L+ +LKS+ LIVNNF+ELDGEEY +YY  +  HKAWH+GPA LIRKT
Sbjct: 191  LPKSSEEF-MGSMLEKELKSNGLIVNNFAELDGEEYIKYYVSSTGHKAWHLGPASLIRKT 249

Query: 522  AQEKAERGQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDF 701
             QEKAERGQ S VSV EC+SWLNSKR NSVLYISFG++    +KQLYEIA A+EA GHDF
Sbjct: 250  VQEKAERGQESAVSVQECLSWLNSKRHNSVLYISFGSLCRYQDKQLYEIACAIEASGHDF 309

Query: 702  IWVVPXXXXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLT 881
            IWV+P               WLPKGFEER IGK  KGL+++GWAPQ+LIL H A+ AF+T
Sbjct: 310  IWVIPLNNGKEDESEEEKQKWLPKGFEERNIGK--KGLIIRGWAPQVLILSHPAVGAFMT 367

Query: 882  HCGWNSILEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVV 1061
            HCGWNS +EAV AGVPMITWP HGEQF+NEKL+T+VRGIGVEVGATEW   G  +RE +V
Sbjct: 368  HCGWNSTVEAVGAGVPMITWPSHGEQFYNEKLITEVRGIGVEVGATEWCLTGFEEREKLV 427

Query: 1062 RRESIEKAVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDN 1241
              +SIEKAVRRL+D GDEA+KI+ RA+EFG+KA+RA QEGGSSH+NL  LI+DLKRLRD+
Sbjct: 428  SSDSIEKAVRRLMDSGDEAEKIRSRAQEFGEKARRAIQEGGSSHKNLLALIDDLKRLRDS 487

Query: 1242 K 1244
            K
Sbjct: 488  K 488


>ref|XP_007154186.1| hypothetical protein PHAVU_003G097200g [Phaseolus vulgaris]
 gb|ESW26180.1| hypothetical protein PHAVU_003G097200g [Phaseolus vulgaris]
          Length = 473

 Score =  513 bits (1321), Expect = e-176
 Identities = 246/414 (59%), Positives = 316/414 (76%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQEVGL +G+ENLS + D+D L++   A T+LQP I++ VE +PPDCIVADF++PWV+
Sbjct: 66   FPSQEVGLPDGIENLSSVTDVDNLAKVFNATTMLQPPIQKFVEENPPDCIVADFLFPWVD 125

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            +LAN L IPRLAF  FSLF ICA+ S   +S      S L   LPHPI LNA+PP  L +
Sbjct: 126  DLANNLNIPRLAFNGFSLFTICAIHSSQPTS------SLLSPTLPHPITLNASPPKELSE 179

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
              + ++L+ +L+S+ LIVNNF+ELDGEEY  YYEKT  HKAWH+GPA LI +T +EKAER
Sbjct: 180  F-LDKLLETELRSYGLIVNNFAELDGEEYIRYYEKTTGHKAWHLGPASLISRTPEEKAER 238

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            G  S VS+ EC+SWL+SK +NSV+YI FG++ Y  +KQLYEIA  +EA GH FIWVVP  
Sbjct: 239  GMKSVVSMQECVSWLDSKAENSVVYICFGSLCYFSDKQLYEIACGIEASGHGFIWVVPEK 298

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         W+P+GFEER     +KG++++GWAPQLLIL H A+ AF++HCGWNS 
Sbjct: 299  KGKEKERQEEKEKWMPEGFEER---NAEKGMVIRGWAPQLLILNHRAVGAFVSHCGWNSS 355

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+ GVPMITWPVHGEQF+NEKL+++VRGIGVEVGA EW+T+G+ +R+ VV RESIE+
Sbjct: 356  VEAVSGGVPMITWPVHGEQFYNEKLISEVRGIGVEVGAAEWTTIGLGERQMVVCRESIER 415

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
             VRR++DGG EA+++++RA+EFG KA+ A  EGGSSH+NLT LI DL RLRD K
Sbjct: 416  GVRRIMDGGVEAEEVRRRAKEFGKKAREAVGEGGSSHKNLTALIHDLTRLRDAK 469


>gb|KRH02148.1| hypothetical protein GLYMA_17G019400 [Glycine max]
          Length = 473

 Score =  511 bits (1317), Expect = e-175
 Identities = 248/411 (60%), Positives = 310/411 (75%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPS EVGL +G+EN+S + D+D L +   A  +LQP IE+ VE  PPDCIVADF++PWV+
Sbjct: 66   FPSHEVGLPDGIENISAVSDLDSLGKVFSATAMLQPPIEDFVEQQPPDCIVADFLFPWVD 125

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            +LA KLRIPRLAF  FSLF ICA+ S   SS      S +IQ LPHPI LNATPP  L  
Sbjct: 126  DLAKKLRIPRLAFNGFSLFTICAIHSSSESS-----DSPIIQSLPHPITLNATPPKELTK 180

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
              ++ +L+ +LKS+ LIVN+F+ELDGEEYT YYEKT  HKAWH+GPA LI +TAQEKAER
Sbjct: 181  F-LETVLETELKSYGLIVNSFTELDGEEYTRYYEKTTGHKAWHLGPASLIGRTAQEKAER 239

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            GQ S VS+HEC++WL+SKR+NSV+YI FG++ Y  +KQLYEIA  ++A GHDFIWVVP  
Sbjct: 240  GQKSVVSMHECVAWLDSKRENSVVYICFGSLCYFQDKQLYEIACGIQASGHDFIWVVPEK 299

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEE       KG++++GWAPQ++IL H AI AFLTHCGWNS 
Sbjct: 300  KGKEHEKEEEKEKWLPKGFEET---NEDKGMIIRGWAPQMIILGHPAIGAFLTHCGWNST 356

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+AG+PM+TWPVHGEQF+NEKL+T+VRGIGVEVGA EW+ +GI DR ++V R+ I+K
Sbjct: 357  VEAVSAGIPMLTWPVHGEQFYNEKLITEVRGIGVEVGAVEWTPIGIGDRLNLVTRDHIQK 416

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLR 1235
             VRRL+D  DEA +I++RA++F  KA++A  EGGSSH NLT LI  L  LR
Sbjct: 417  GVRRLMDASDEALEIRRRAKDFAQKARQAVLEGGSSHNNLTALIHHLILLR 467


>ref|XP_006600323.1| PREDICTED: scopoletin glucosyltransferase-like [Glycine max]
          Length = 500

 Score =  512 bits (1319), Expect = e-175
 Identities = 249/414 (60%), Positives = 311/414 (75%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPS EVGL +G+EN+S + D+D L +   A  +LQP IE+ VE  PPDCIVADF++PWV+
Sbjct: 66   FPSHEVGLPDGIENISAVSDLDSLGKVFSATAMLQPPIEDFVEQQPPDCIVADFLFPWVD 125

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            +LA KLRIPRLAF  FSLF ICA+ S   SS      S +IQ LPHPI LNATPP  L  
Sbjct: 126  DLAKKLRIPRLAFNGFSLFTICAIHSSSESS-----DSPIIQSLPHPITLNATPPKELTK 180

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
              ++ +L+ +LKS+ LIVN+F+ELDGEEYT YYEKT  HKAWH+GPA LI +TAQEKAER
Sbjct: 181  F-LETVLETELKSYGLIVNSFTELDGEEYTRYYEKTTGHKAWHLGPASLIGRTAQEKAER 239

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            GQ S VS+HEC++WL+SKR+NSV+YI FG++ Y  +KQLYEIA  ++A GHDFIWVVP  
Sbjct: 240  GQKSVVSMHECVAWLDSKRENSVVYICFGSLCYFQDKQLYEIACGIQASGHDFIWVVPEK 299

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEE       KG++++GWAPQ++IL H AI AFLTHCGWNS 
Sbjct: 300  KGKEHEKEEEKEKWLPKGFEET---NEDKGMIIRGWAPQMIILGHPAIGAFLTHCGWNST 356

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+AG+PM+TWPVHGEQF+NEKL+T+VRGIGVEVGA EW+ +GI DR ++V R+ I+K
Sbjct: 357  VEAVSAGIPMLTWPVHGEQFYNEKLITEVRGIGVEVGAVEWTPIGIGDRLNLVTRDHIQK 416

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
             VRRL+D  DEA +I++RA++F  KA++A  EGGSSH NLT LI  L  LR  K
Sbjct: 417  GVRRLMDASDEALEIRRRAKDFAQKARQAVLEGGSSHNNLTALIHHLILLRMEK 470


>ref|XP_014521976.1| UDP-glucose flavonoid 3-O-glucosyltransferase 7-like [Vigna radiata
            var. radiata]
          Length = 472

 Score =  510 bits (1314), Expect = e-175
 Identities = 247/414 (59%), Positives = 316/414 (76%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQEVGL +G+EN+S I D D L +   A T+LQ  I+  VE +PPDCIVADF++PWV+
Sbjct: 66   FPSQEVGLPDGIENISFITDPDHLGKVFNATTMLQTPIQNFVEENPPDCIVADFLFPWVD 125

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            +LAN L+IPRLAF  FSLF ICA+ S   SS      S L   LPHPI LNA+PP  L +
Sbjct: 126  DLANNLKIPRLAFNGFSLFTICALHSSSNSS-----NSLLCPTLPHPITLNASPPKELTE 180

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
              + ++++ +L+S+ LIVNNF+ELDGEEY +YYEKT  HKAWH+GPA LI +T +EKAER
Sbjct: 181  F-LDKMMETELRSYGLIVNNFAELDGEEYIQYYEKTTGHKAWHLGPASLIPRTPEEKAER 239

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            G  S VS+HEC+SWL+SK  NSV+YI FG++ + P+KQLYEIA  +EA GH FIWVVP  
Sbjct: 240  GMKSVVSMHECLSWLDSKAKNSVVYICFGSLCHFPDKQLYEIACGIEASGHGFIWVVPEK 299

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         W+P+GFEER     +KG++++GWAPQL+IL H A+ AFL+HCGWNS 
Sbjct: 300  KGKEESEEEKE-KWMPEGFEER---NAEKGMVIRGWAPQLVILNHRAVGAFLSHCGWNST 355

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+ GVPMITWPVHGEQF+NEKL+++VRGIGVEVGA EWS++G  +RE +V RESIE+
Sbjct: 356  VEAVSGGVPMITWPVHGEQFYNEKLISEVRGIGVEVGAAEWSSIGFGEREMLVCRESIER 415

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
             VRR++DGGDEA+++++RA+EFG+KA+ A  EGGSSH+NLT LI DL RLRD K
Sbjct: 416  GVRRIMDGGDEAQEVRRRAQEFGEKAREAVGEGGSSHKNLTALIHDLMRLRDAK 469


>ref|XP_004508345.1| PREDICTED: scopoletin glucosyltransferase [Cicer arietinum]
 gb|AGU14073.1| UDP-glycosyltransferase [Cicer arietinum]
          Length = 479

 Score =  510 bits (1314), Expect = e-175
 Identities = 245/415 (59%), Positives = 313/415 (75%), Gaps = 1/415 (0%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQ+VGL +GVENL+ + D+D   +   A TLL+  IE  VE  PPDC++ADF++PWV+
Sbjct: 60   FPSQQVGLPDGVENLTSVTDIDNSYKIFFATTLLREHIENFVEQYPPDCVIADFLFPWVD 119

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSV-HGGSGSFLIQGLPHPIRLNATPPMNLR 359
            ELANKL IPRL F  FSLF ICAM+S+    +    SGSF+I   PH I +N+TPP+  +
Sbjct: 120  ELANKLHIPRLVFNGFSLFTICAMESLKLHPLPEDASGSFVIPHFPHDIVINSTPPVGSK 179

Query: 360  DVNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAE 539
               +  +L + LKSH  I+N+F ELDGEEY EYYEKT+ HKAWH+GPA L+R+T QEKA+
Sbjct: 180  SF-IDPLLTVALKSHGFIINSFVELDGEEYVEYYEKTMTHKAWHLGPASLVRRTTQEKAD 238

Query: 540  RGQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPX 719
            RG+ STVSV +C++WLNSKRD SV+YISFGT+ Y P+KQLYEIASA+EA G++FIWVVP 
Sbjct: 239  RGEKSTVSVEKCLAWLNSKRDKSVIYISFGTICYFPDKQLYEIASAIEASGYEFIWVVPE 298

Query: 720  XXXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNS 899
                         NWLPKGFEER      KG++V+GWAPQ++IL HSA+ AFLTHCGWNS
Sbjct: 299  KRGKENESEVEKENWLPKGFEER-----NKGMIVRGWAPQVVILGHSAVGAFLTHCGWNS 353

Query: 900  ILEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIE 1079
            ++EA++AGVPMITWPVH +QF+NEKL+TQVRGIGVE+G  EW T    D E +V R+SIE
Sbjct: 354  MVEAISAGVPMITWPVHSDQFYNEKLITQVRGIGVEIGVEEWITTAFRDMEKLVGRDSIE 413

Query: 1080 KAVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
            K +RRL+D  DEA +I+++A+ F   A+ A  EGGSSH+NLT LI+++K LRDNK
Sbjct: 414  KTMRRLMDDSDEAVEIRRKAQGFAKLARHAVGEGGSSHQNLTNLIDEIKLLRDNK 468


>ref|XP_020227520.1| scopoletin glucosyltransferase-like [Cajanus cajan]
          Length = 477

 Score =  508 bits (1309), Expect = e-174
 Identities = 247/414 (59%), Positives = 308/414 (74%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQE GL +G+EN+S + D + L++   A  +LQP IEE VE  PPDCIVADF++PWV+
Sbjct: 70   FPSQEAGLPDGIENISSVEDAENLAKMFHATAMLQPPIEEFVEQHPPDCIVADFLFPWVD 129

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            ELANKL IPRLAF  FSLF ICA+ S         + S  I  LPHPI +NATPP  L +
Sbjct: 130  ELANKLCIPRLAFNGFSLFTICAVGS--PHEYEYDNDSLHIPTLPHPITINATPPKELTE 187

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
              +  +L  +LKSH LIVN+F ELDGEEY  YYEK+  HKAWH+GPA LI ++AQEKAER
Sbjct: 188  F-LHTMLQTELKSHGLIVNSFLELDGEEYIGYYEKSTGHKAWHLGPASLIGRSAQEKAER 246

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            GQ S VSV EC++WL+SKR  SV+YI FG++ + P+KQL EIA  +EA GH+FIWVVP  
Sbjct: 247  GQKSVVSVEECLTWLDSKRVESVVYICFGSLCHFPDKQLQEIACGIEASGHEFIWVVPEK 306

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEE+     +KG++++GWAPQLLIL H A+ AFLTHCGWNS 
Sbjct: 307  KGKENERQEEKEKWLPKGFEEK-----EKGMIIRGWAPQLLILNHHAVGAFLTHCGWNST 361

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV AG+PM+TWPVHGEQF+NEKL+TQVRGIGVEVGA EW+++G  +R  +V R+SI+ 
Sbjct: 362  VEAVTAGIPMLTWPVHGEQFYNEKLITQVRGIGVEVGAAEWTSIGFGERHKLVTRDSIQN 421

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
            A+ RL+D GDEA +I++RA+EF  KA++A +EGGSSH NLT LI DLKRLRD K
Sbjct: 422  AITRLMDAGDEATQIRRRAKEFAQKARKAVKEGGSSHNNLTALIHDLKRLRDAK 475


>dbj|GAU11068.1| hypothetical protein TSUD_196920 [Trifolium subterraneum]
          Length = 479

 Score =  506 bits (1303), Expect = e-173
 Identities = 254/415 (61%), Positives = 310/415 (74%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPS +VGL  GVENLS + ++D   R  QA  LLQ  I+  VE D PDCIVADFM+ WV+
Sbjct: 70   FPSHQVGLPPGVENLSSVNNLDNSYRVHQAAMLLQSPIQHFVEQDSPDCIVADFMFLWVD 129

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            ELANKL IPRLAF  FSLF ICAM+S+ A     G  S +I+GLPH I LNATPP  L  
Sbjct: 130  ELANKLHIPRLAFNGFSLFTICAMESLRAR----GFESSVIKGLPHCITLNATPPKALAK 185

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
              M+ +L+ +LKS+ LIVNNF+ELDGEEY E+Y+KT  H+ WH+GP  LI +T QEKAER
Sbjct: 186  F-MEPLLETELKSYGLIVNNFTELDGEEYIEHYKKTTGHRVWHLGPVSLICRTTQEKAER 244

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            GQ S V+VHECMSWLNSK+ NSVLYI FG++ +  NKQLYEIASA+EA  H FIWVVP  
Sbjct: 245  GQTSAVNVHECMSWLNSKQPNSVLYICFGSLCHFSNKQLYEIASAIEATNHQFIWVVPEK 304

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEER      KG++++GWAPQ++IL   AI AFLTHCGWNS 
Sbjct: 305  KGKEDESNDENEKWLPKGFEER-----NKGMIIRGWAPQVVILGDPAIGAFLTHCGWNST 359

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+AGVPMITWPVH EQF+NEKL+TQVRGIGVEVGA EWS +G  +R+ +V R+ IEK
Sbjct: 360  VEAVSAGVPMITWPVHDEQFYNEKLITQVRGIGVEVGAEEWSIIGFMERKKLVGRDIIEK 419

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNKF 1247
            AVRRL+DGG EA +I++ A+E+  KAKR+ QEGGSSH+NL  LI+D+KRL++  +
Sbjct: 420  AVRRLMDGGVEANEIRRCAKEYAIKAKRSVQEGGSSHKNLMALIDDVKRLKEKDY 474


>ref|XP_017423634.1| PREDICTED: scopoletin glucosyltransferase-like [Vigna angularis]
 gb|KOM43816.1| hypothetical protein LR48_Vigan05g142100 [Vigna angularis]
          Length = 472

 Score =  505 bits (1301), Expect = e-173
 Identities = 242/414 (58%), Positives = 316/414 (76%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQEVGL +G+EN+S + D D L++   A T+LQ  I+  VE +PPDCIVADF++PWV+
Sbjct: 66   FPSQEVGLPDGIENISSVTDTDHLAKVFNATTMLQTPIQNFVEENPPDCIVADFLFPWVD 125

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            +LAN L IPRLAF  FSLF ICA+ S   SS      S L   LPHPI LNA+PP  L +
Sbjct: 126  DLANNLNIPRLAFNGFSLFTICALHSSSNSS-----NSLLSPTLPHPITLNASPPKELTE 180

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
              + ++L+ +L+S+ LIVNNF+ELDGEEY  YYEKT  HKAWH+GPA LI +T +EKAER
Sbjct: 181  F-LDKMLETELRSYGLIVNNFAELDGEEYIRYYEKTTGHKAWHLGPASLIPRTLEEKAER 239

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            G  S VS+HEC+SWL+SK  NSV+YI FG++ + P+KQLYEIA  +EA GH FIWVVP  
Sbjct: 240  GMKSVVSMHECLSWLDSKAKNSVVYICFGSLCHFPDKQLYEIACGIEASGHGFIWVVPEK 299

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         W+P+GF+ER     +KG++++GWAPQL+IL H A+ AFL+HCGWNS 
Sbjct: 300  KGKEESEEEKE-KWMPEGFKER---NAEKGMVIRGWAPQLVILNHRAVGAFLSHCGWNST 355

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
            +EAV+ GVPMITWPVHGEQF+NEKL+++VRGIG+EVGA EWS++G  +R+++V R+SIE+
Sbjct: 356  VEAVSGGVPMITWPVHGEQFYNEKLISEVRGIGLEVGAAEWSSIGFGERQTLVCRDSIER 415

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
             VRR++DGGDEA+++++RA+E+G+KA+ A  EGGSSH+NLT LI DL RLRD K
Sbjct: 416  GVRRIMDGGDEAQEVRRRAQEYGEKAREAVGEGGSSHKNLTALIHDLMRLRDAK 469


>dbj|GAU41610.1| hypothetical protein TSUD_196800 [Trifolium subterraneum]
          Length = 482

 Score =  505 bits (1301), Expect = e-173
 Identities = 247/414 (59%), Positives = 314/414 (75%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQEVGL  G+E++S   + + L +  Q+  LL+  I+  +E +PPDCIVADF+YPWV 
Sbjct: 69   FPSQEVGLPNGMESISTATNNNNLIKIYQSTNLLRTPIQHFIEQNPPDCIVADFLYPWVH 128

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSVHGGSGSFLIQGLPHPIRLNATPPMNLRD 362
            ELANKL+IPR AF   SLFA CAM+SV A+SV+G S S++I  LPH I +  TPP  + +
Sbjct: 129  ELANKLQIPRFAFHALSLFATCAMESVKANSVYGSS-SYVIPDLPHSISMTVTPPKGVAE 187

Query: 363  VNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAER 542
            V +  +L+   KS+  I+N+F+EL+G+EY E+YEKT  HKA H+GPA LIR T QEK+ER
Sbjct: 188  V-LGGLLETVYKSNGFIINSFAELEGQEYIEHYEKTTGHKALHLGPASLIRTTIQEKSER 246

Query: 543  GQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPXX 722
            G+ S VS HEC+SWLNSK D SVLYI FG++    +KQLYEIA  +EA GH+FIW+VP  
Sbjct: 247  GEPSIVSSHECLSWLNSKPDKSVLYICFGSLCLFQDKQLYEIACGIEASGHEFIWIVPEK 306

Query: 723  XXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNSI 902
                         WLPKGFEER IGK   GL+++GWAPQ++IL H A+  F+THCGWNSI
Sbjct: 307  KGKEDESDEEKEKWLPKGFEERNIGK---GLIIRGWAPQVMILSHRAVGGFMTHCGWNSI 363

Query: 903  LEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIEK 1082
             EAV+AGVPMITWPVHGEQFFNEKL+TQVR IG+EVGATEW+ +G  +RE VV RESIEK
Sbjct: 364  TEAVSAGVPMITWPVHGEQFFNEKLITQVRRIGLEVGATEWTHMGFGEREEVVGRESIEK 423

Query: 1083 AVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
            AVRRL+D GDEA++I++RAREFG+KAK A QEGGS+++N T +I+++KR RD K
Sbjct: 424  AVRRLMDDGDEAEEIRRRAREFGEKAKLAVQEGGSTYKNFTAVIDEIKRSRDRK 477


>gb|PNY07429.1| isoflavonoid glucosyltransferase [Trifolium pratense]
          Length = 489

 Score =  504 bits (1299), Expect = e-172
 Identities = 247/417 (59%), Positives = 313/417 (75%), Gaps = 3/417 (0%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQ+VGL  G+E++S+ PD+D   +  +   LL   I++ +E DPPDCI+ADFMY WV 
Sbjct: 73   FPSQQVGLPNGIESMSNTPDLDSSHKLYRGAMLLHEQIQDFMEADPPDCIIADFMYTWVN 132

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMAS-SVHG--GSGSFLIQGLPHPIRLNATPPMN 353
            +LA KL +P+LAF  FSLF +  M++   + S+H    SGSF++   P+ I L + PP +
Sbjct: 133  DLATKLHVPKLAFNGFSLFTVSLMETFRTNPSLHSLTDSGSFVVPNFPYHITLCSRPPKS 192

Query: 354  LRDVNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEK 533
                 M+ +L+ ++KS+ LIVNNF+ELDGEE  E+YEK   HKAWH+GPA LIRKT QEK
Sbjct: 193  YTGF-MEPLLEKEVKSNGLIVNNFAELDGEECIEHYEKKTGHKAWHLGPASLIRKTVQEK 251

Query: 534  AERGQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVV 713
            AERGQ S VSVHEC+SWLNSKRDN+VLYI FG++ +   KQLYEIA A+EA GH FIWVV
Sbjct: 252  AERGQESAVSVHECLSWLNSKRDNTVLYICFGSICHYSEKQLYEIACAIEASGHKFIWVV 311

Query: 714  PXXXXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGW 893
            P               WLPKGFEE+ IGK  +GL+++GWAPQ+LIL H A+  F+THCGW
Sbjct: 312  PEKIGKEDESDEEKEKWLPKGFEEKNIGK--QGLIIRGWAPQVLILSHPAVGGFMTHCGW 369

Query: 894  NSILEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRES 1073
            NS +EAV AGVPMITWPVHGEQF+NEKL+T+VRGIGVEVGATEW      +RE++V R+S
Sbjct: 370  NSTVEAVGAGVPMITWPVHGEQFYNEKLITEVRGIGVEVGATEWCLSSFAERETLVTRDS 429

Query: 1074 IEKAVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
            IEKAVRRL+DGGDEA+KI++RA EF +KA+ A QE GSSH+NL+ LI++LKRLRD K
Sbjct: 430  IEKAVRRLMDGGDEAEKIRRRAHEFREKARGAVQEDGSSHKNLSALIDELKRLRDRK 486


>dbj|GAU41618.1| hypothetical protein TSUD_196880 [Trifolium subterraneum]
          Length = 471

 Score =  504 bits (1297), Expect = e-172
 Identities = 249/415 (60%), Positives = 308/415 (74%), Gaps = 1/415 (0%)
 Frame = +3

Query: 3    FPSQEVGLQEGVENLSDIPDMDKLSRYTQAMTLLQPLIEEVVEHDPPDCIVADFMYPWVE 182
            FPSQ+VGL EGVENLS + D+D   +  QA TLL+  +E  VE  PPDCIVADF +PWV+
Sbjct: 60   FPSQKVGLPEGVENLSAVTDIDSGYKIYQAATLLREQVEHFVEKHPPDCIVADFCFPWVD 119

Query: 183  ELANKLRIPRLAFTVFSLFAICAMKSVMASSV-HGGSGSFLIQGLPHPIRLNATPPMNLR 359
            E+ANKL IPR AF  FSLF ICAM+S+ +  +    SG F+I   P  I +N+TPP   +
Sbjct: 120  EVANKLHIPRFAFNGFSLFTICAMESLKSHPLPDNASGPFVIPNFPRDIIINSTPPFESK 179

Query: 360  DVNMQEILDIQLKSHSLIVNNFSELDGEEYTEYYEKTINHKAWHIGPAYLIRKTAQEKAE 539
               +   L I LKS   I+N+F ELDGEEY EYYEK I HKAWH+GPA L+RKT QEKAE
Sbjct: 180  SF-VDPHLTIALKSRGFIINSFVELDGEEYVEYYEKIIGHKAWHLGPASLVRKTTQEKAE 238

Query: 540  RGQGSTVSVHECMSWLNSKRDNSVLYISFGTMSYLPNKQLYEIASALEACGHDFIWVVPX 719
            RG+ ST  V + ++WLNSK DNSVLYISFG++ Y P+KQL+EIASA+EA G+DFIWVVP 
Sbjct: 239  RGEKSTKDVQKYLTWLNSKCDNSVLYISFGSICYFPDKQLFEIASAIEASGYDFIWVVPE 298

Query: 720  XXXXXXXXXXXXXNWLPKGFEERCIGKNKKGLLVKGWAPQLLILEHSAICAFLTHCGWNS 899
                          WLPKGFEER      KG++V+GWAPQ++IL H A+ AFLTHCGWNS
Sbjct: 299  KKGKENESEEEKEKWLPKGFEER-----NKGMIVRGWAPQMVILGHPALGAFLTHCGWNS 353

Query: 900  ILEAVAAGVPMITWPVHGEQFFNEKLVTQVRGIGVEVGATEWSTLGIFDRESVVRRESIE 1079
            ++EAV+AGVPMITWPVH +QF+NEKL+TQVRGIGVEVG  EW T    D + +V+R+ IE
Sbjct: 354  VVEAVSAGVPMITWPVHSDQFYNEKLITQVRGIGVEVGVDEWITAAFRDMKKLVKRDQIE 413

Query: 1080 KAVRRLLDGGDEAKKIKQRAREFGDKAKRAAQEGGSSHRNLTTLIEDLKRLRDNK 1244
            KA+RRL+DGGDEA +I+QRA++F   A+ A QEGGSSH +L TLI++LK+LRDNK
Sbjct: 414  KALRRLMDGGDEAVQIRQRAQKFAKIARHAVQEGGSSHESLVTLIDELKQLRDNK 468


Top