BLASTX nr result
ID: Angelica23_contig00010627
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00010627 (1621 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAR06921.1| UDP-glycosyltransferase 89B2 [Stevia rebaudiana] 509 e-141 ref|XP_002514595.1| UDP-glucosyltransferase, putative [Ricinus c... 482 e-133 ref|XP_002318584.1| predicted protein [Populus trichocarpa] gi|2... 459 e-127 ref|XP_002887513.1| UDP-glucoronosyl/UDP-glucosyl transferase fa... 444 e-123 gb|AAG52529.1|AC016662_23 putative glucosyltransferase; 88035-86... 436 e-120 >gb|AAR06921.1| UDP-glycosyltransferase 89B2 [Stevia rebaudiana] Length = 468 Score = 509 bits (1310), Expect = e-141 Identities = 250/451 (55%), Positives = 327/451 (72%) Frame = +3 Query: 3 DLTHQLALRNLTITILVTPKNLHYLNPLLTRHPESVHTLVLPFPANHSALLAGVENVRDL 182 DLTHQLA+RNLTITILVTPKNL ++PLL HP +V L+LP P H A+ +G+ENV+DL Sbjct: 28 DLTHQLAIRNLTITILVTPKNLPTISPLLAAHPTTVSALLLPLPP-HPAIPSGIENVKDL 86 Query: 183 PASAFRLMMMALTGLHDPIVDWFQNHPSPPVAIVSDMFLGWTHRLACKLKISRFVFSPSG 362 P AF+ MM+AL L++P+ DWF+N P+PPVAI+SD FLGWTH LA +L I R+ FSPSG Sbjct: 87 PNDAFKAMMVALGDLYNPLRDWFRNQPNPPVAIISDFFLGWTHHLAVELGIRRYTFSPSG 146 Query: 363 ALALSFIYSLWRDMPKRNDPSDDNELIKFSEIPNCPSYPWWKIPLGFRSYVEGNTQSEAV 542 ALALS I+SLWR PKR D ++ E IKF +IPN P YPWW++ +RSYVEG+ SE + Sbjct: 147 ALALSVIFSLWRYQPKRIDVENEKEAIKFPKIPNSPEYPWWQLSPIYRSYVEGDPDSEFI 206 Query: 543 RDSFMENIASHGLVFNSFTALEQVYLEYMQKFLGHVRMWAVGPLLPLEEERVGRGGSTEI 722 +D F+ +IAS G+V NSFT LEQVY+++++ LGH +++AVGPLLP ++ GRGGS+ Sbjct: 207 KDGFLADIASWGIVINSFTELEQVYVDHLKHELGHDQVFAVGPLLPPGDKTSGRGGSS-- 264 Query: 723 VADGIKSWLDQFEEKTVVYVCFGSQAVLTNKQMEMLALGLENSGVKFLWAYKDPTKGHEA 902 ++ + SWLD ++TVVYVCFGSQ VLTN QME++ALGLE S VKF+W+ K+PT GHEA Sbjct: 265 -SNDVLSWLDTCADRTVVYVCFGSQMVLTNGQMEVVALGLEKSRVKFVWSVKEPTVGHEA 323 Query: 903 GDYGMIPSGFKERVAGRGFIVKGWSPQVLILRHPAVGAFFTHCGWNSVLESIVAGVPMLT 1082 +YG +P GF++RV+GRG +++GW PQV IL H +VG F THCGWNSV+E++ A V MLT Sbjct: 324 ANYGRVPPGFEDRVSGRGLVIRGWVPQVAILSHDSVGVFLTHCGWNSVMEAVAAEVLMLT 383 Query: 1083 WPMGADQFTNADLLDELALGTRVCEGEETIPDQFTNADLLDELALGTRVCEGEETIPDSD 1262 WPM ADQF+NA LL EL +G +VC EG +P+SD Sbjct: 384 WPMSADQFSNATLLHELKVGIKVC--------------------------EGSNIVPNSD 417 Query: 1263 ELACLVARSVSDEKRARIARAKEFSKAALDS 1355 ELA L ++S+SDE R R KEF+K+A ++ Sbjct: 418 ELAELFSKSLSDETRLERKRVKEFAKSAKEA 448 >ref|XP_002514595.1| UDP-glucosyltransferase, putative [Ricinus communis] gi|223546199|gb|EEF47701.1| UDP-glucosyltransferase, putative [Ricinus communis] Length = 472 Score = 482 bits (1240), Expect = e-133 Identities = 248/455 (54%), Positives = 312/455 (68%), Gaps = 4/455 (0%) Frame = +3 Query: 3 DLTHQLALRNLTITILVTPKNLHYLNPLLTRHPESVHTLVLPFPANHSALLAGVENVRDL 182 DLT +LA+ LTITILVTPKNL +L+PLL+ HP S+ TLV PFPA H + +GVEN +DL Sbjct: 28 DLTRKLAVHGLTITILVTPKNLSFLHPLLSTHP-SIETLVFPFPA-HPLIPSGVENNKDL 85 Query: 183 PASAFRLMMMALTGLHDPIVDWFQNHPSPPVAIVSDMFLGWTHRLACKLKISRFVFSPSG 362 PA +++ AL GL+DP++ WF +HPSPPVAI+SDMFLGWT LA +L I R VFSPSG Sbjct: 86 PAECTPVLIRALGGLYDPLLHWFISHPSPPVAIISDMFLGWTQNLASQLNIRRIVFSPSG 145 Query: 363 ALALSFIYSLWRDMPKRNDPSDDNELIKFSEIPNCPSYPWWKIPLGFRSYVEGNTQSEAV 542 A+ALS IYSLWRDMP+RN NE++ FS IPNCP+YPW +I +RSY+E +T E + Sbjct: 146 AMALSIIYSLWRDMPRRNQ----NEVVSFSRIPNCPNYPWRQISPIYRSYIENDTNWEFI 201 Query: 543 RDSFMENIASHGLVFNSFTALEQVYLEYMQKFLGHVRMWAVGPLLPLEEERVGR----GG 710 +DSF N+ S GLV NSFT LE++YL+Y +K LG +WAVGPLLP + + R GG Sbjct: 202 KDSFRANLVSWGLVVNSFTELEEIYLDYFKKELGSDHVWAVGPLLPPHHDSISRQSERGG 261 Query: 711 STEIVADGIKSWLDQFEEKTVVYVCFGSQAVLTNKQMEMLALGLENSGVKFLWAYKDPTK 890 + + + +WLD E+ VVYVCFGSQ LT Q+E LAL LE S V F+W K+ Sbjct: 262 PSSVPVHDVMAWLDTCEDHRVVYVCFGSQTWLTKDQIEELALSLEMSKVNFIWCVKE--- 318 Query: 891 GHEAGDYGMIPSGFKERVAGRGFIVKGWSPQVLILRHPAVGAFFTHCGWNSVLESIVAGV 1070 H G Y +IPSGF++RVAGRG +++GW PQVLIL HPAVGAF THCGWNSVLE +VA V Sbjct: 319 -HINGKYSVIPSGFEDRVAGRGLVIRGWVPQVLILSHPAVGAFLTHCGWNSVLEGLVAAV 377 Query: 1071 PMLTWPMGADQFTNADLLDELALGTRVCEGEETIPDQFTNADLLDELALGTRVCEGEETI 1250 PML WPMGADQF NA LL +DEL + RVCEG +T+ Sbjct: 378 PMLAWPMGADQFVNARLL-------------------------VDELQVAVRVCEGAKTV 412 Query: 1251 PDSDELACLVARSVSDEKRARIARAKEFSKAALDS 1355 P+SDELA ++ SVS E R +AK+ + A+D+ Sbjct: 413 PNSDELARVIMESVS-ENRVEREQAKKLRRVAMDT 446 >ref|XP_002318584.1| predicted protein [Populus trichocarpa] gi|222859257|gb|EEE96804.1| predicted protein [Populus trichocarpa] Length = 472 Score = 459 bits (1181), Expect(2) = e-127 Identities = 239/455 (52%), Positives = 308/455 (67%), Gaps = 4/455 (0%) Frame = +3 Query: 3 DLTHQLALRNLTITILVTPKNLHYLNPLLTRHPESVHTLVLPFPANHSALLAGVENVRDL 182 DL H L +R LTITILVTPKNL LNPLL+++ +++TLVLPFP N+ ++ G+EN++DL Sbjct: 23 DLAHHLVIRGLTITILVTPKNLPILNPLLSKN-STINTLVLPFP-NYPSIPLGIENLKDL 80 Query: 183 PASAFRLMMM-ALTGLHDPIVDWFQNHPSPPVAIVSDMFLGWTHRLACKLKISRFVFSPS 359 P + M+ AL L+ P++ WF++HPSPPVAI+SDMFLGWTHRLAC+L + RFVFSPS Sbjct: 81 PPNIRPTSMIHALGELYQPLLSWFRSHPSPPVAIISDMFLGWTHRLACQLGVRRFVFSPS 140 Query: 360 GALALSFIYSLWRDMPKRNDPSDDNELIKFSEIPNCPSYPWWKIPLGFRSYVEGNTQSEA 539 GA+AL+ +YSLW++MP N P D NEL FS+IP+CP YPW +I +RSYVEG+ SE Sbjct: 141 GAMALATMYSLWQEMP--NAPKDQNELFSFSKIPSCPKYPWLQISTIYRSYVEGDPVSEF 198 Query: 540 VRDSFMENIASHGLVFNSFTALEQVYLEYMQKFLGHVRMWAVGPLLP---LEEERVGRGG 710 ++ NIAS GL+ NS T LE +Y E+++K LGH R+WAVGP+LP ++ RG Sbjct: 199 TKEGMEANIASWGLIVNSLTLLEGIYFEHLRKQLGHDRVWAVGPILPEKTIDMTPPERGV 258 Query: 711 STEIVADGIKSWLDQFEEKTVVYVCFGSQAVLTNKQMEMLALGLENSGVKFLWAYKDPTK 890 S +K+WLD E+ VVYVC+G+Q VLT QME +A GLE SGV F+W K P+K Sbjct: 259 SMH----DLKTWLDTCEDHKVVYVCYGTQVVLTKYQMEAVASGLEKSGVHFIWCVKQPSK 314 Query: 891 GHEAGDYGMIPSGFKERVAGRGFIVKGWSPQVLILRHPAVGAFFTHCGWNSVLESIVAGV 1070 H Y MIPSGF++RVAGRG I++GW+PQV IL H AVGAF THCGWNS+LE IVAGV Sbjct: 315 EHVGEGYSMIPSGFEDRVAGRGLIIRGWAPQVWILSHRAVGAFLTHCGWNSILEGIVAGV 374 Query: 1071 PMLTWPMGADQFTNADLLDELALGTRVCEGEETIPDQFTNADLLDELALGTRVCEGEETI 1250 PML PM ADQF A L L+++L + RVC+G + Sbjct: 375 PMLACPMAADQFVGATL-------------------------LVEDLKVAKRVCDGANLV 409 Query: 1251 PDSDELACLVARSVSDEKRARIARAKEFSKAALDS 1355 +S +LA + SVSDE + RAKE AALD+ Sbjct: 410 SNSAKLARTLMESVSDESQVEKERAKELRMAALDA 444 Score = 24.3 bits (51), Expect(2) = e-127 Identities = 11/20 (55%), Positives = 17/20 (85%) Frame = +2 Query: 1457 AALDSIEKDGSSYEALDSLV 1516 AALD+I++DGSS + L++ V Sbjct: 440 AALDAIKEDGSSDKHLNAFV 459 >ref|XP_002887513.1| UDP-glucoronosyl/UDP-glucosyl transferase family protein [Arabidopsis lyrata subsp. lyrata] gi|297333354|gb|EFH63772.1| UDP-glucoronosyl/UDP-glucosyl transferase family protein [Arabidopsis lyrata subsp. lyrata] Length = 473 Score = 444 bits (1143), Expect(2) = e-123 Identities = 239/455 (52%), Positives = 309/455 (67%), Gaps = 4/455 (0%) Frame = +3 Query: 3 DLTHQLALRN---LTITILVTPKNLHYLNPLLTRHPESVHTLVLPFPANHSALLAGVENV 173 D TH+LALR LTIT+LVTPKNL +L+PLL+ ++ TL+LPFP+ H ++ +GVENV Sbjct: 31 DFTHRLALRGGAALTITVLVTPKNLPFLSPLLSA-VSNIETLILPFPS-HPSIPSGVENV 88 Query: 174 RDLPASAFRLMMMALTGLHDPIVDWFQNHPSPPVAIVSDMFLGWTHRLACKLKISRFVFS 353 +DLP S F LM+ AL LH P++ W +HPSPPVAIVSD FLGWT+ L I RF FS Sbjct: 89 QDLPPSGFPLMIHALGNLHAPLLSWITSHPSPPVAIVSDFFLGWTNNLG----IPRFDFS 144 Query: 354 PSGALALSFIYSLWRDMPKRNDPSDDNELIKFSEIPNCPSYPWWKIPLGFRSYVEGNTQS 533 PS A+ + +LW +MP + + DDNE+++F +IPNCP YP+ +I +RSYV G+ Sbjct: 145 PSAAITCCILNTLWIEMPTKINEDDDNEILQFPKIPNCPKYPFNQISSLYRSYVHGDPAW 204 Query: 534 EAVRDSFMENIASHGLVFNSFTALEQVYLEYMQKFLGHVRMWAVGPLLPLEEERVGRGGS 713 E +RDSF +N AS GLV NSFTA+E VYLE++++ +GH +WAVGP+LPL + RGG Sbjct: 205 EFIRDSFRDNAASWGLVVNSFTAMEGVYLEHLKREMGHDCVWAVGPILPLSDG--NRGGP 262 Query: 714 TEIVADGIKSWLDQFEEKTVVYVCFGSQAVLTNKQMEMLALGLENSGVKFLWAYKDPTKG 893 T + D + SWLD E+ VVYVCFGSQ VLT +Q LA GLE SGV F+WA K+P +G Sbjct: 263 TSVSVDHVMSWLDAREDDHVVYVCFGSQTVLTKEQTLALASGLEKSGVHFIWAVKEPVEG 322 Query: 894 HEAGDYGMIPSGFKERVAGRGFIVKGWSPQVLILRHPAVGAFFTHCGWNSVLESIVAGVP 1073 G I GF +RVAGRG +++GW+PQV +LRH AVGAF THCGWNSV+E++VAGV Sbjct: 323 E--SPRGNILDGFDDRVAGRGLVIRGWAPQVAVLRHRAVGAFLTHCGWNSVIEAVVAGVL 380 Query: 1074 MLTWPMGADQFTNADL-LDELALGTRVCEGEETIPDQFTNADLLDELALGTRVCEGEETI 1250 MLTWPM ADQ+T+A L +DEL +G R CEG PD T+ Sbjct: 381 MLTWPMRADQYTDASLVVDELKVGVRACEG----PD----------------------TV 414 Query: 1251 PDSDELACLVARSVSDEKRARIARAKEFSKAALDS 1355 PD DELA + A SV+ ++ RI +A E KAALD+ Sbjct: 415 PDPDELARVFADSVTGKQTERI-KAVELRKAALDA 448 Score = 25.4 bits (54), Expect(2) = e-123 Identities = 11/21 (52%), Positives = 16/21 (76%) Frame = +2 Query: 1454 KAALDSIEKDGSSYEALDSLV 1516 KAALD+I++ GSS + LD + Sbjct: 443 KAALDAIQERGSSVKDLDGFI 463 >gb|AAG52529.1|AC016662_23 putative glucosyltransferase; 88035-86003 [Arabidopsis thaliana] Length = 570 Score = 436 bits (1120), Expect(2) = e-120 Identities = 235/455 (51%), Positives = 305/455 (67%), Gaps = 4/455 (0%) Frame = +3 Query: 3 DLTHQLALRN---LTITILVTPKNLHYLNPLLTRHPESVHTLVLPFPANHSALLAGVENV 173 D TH+LALR L IT+LVTPKNL +L+PLL+ ++ L+LPFP+ H ++ +GVENV Sbjct: 31 DFTHRLALRGGAALKITVLVTPKNLPFLSPLLSA-VVNIEPLILPFPS-HPSIPSGVENV 88 Query: 174 RDLPASAFRLMMMALTGLHDPIVDWFQNHPSPPVAIVSDMFLGWTHRLACKLKISRFVFS 353 +DLP S F LM+ AL LH P++ W +HPSPPVAIVSD FLGWT L I RF FS Sbjct: 89 QDLPPSGFPLMIHALGNLHAPLISWITSHPSPPVAIVSDFFLGWTKNLG----IPRFDFS 144 Query: 354 PSGALALSFIYSLWRDMPKRNDPSDDNELIKFSEIPNCPSYPWWKIPLGFRSYVEGNTQS 533 PS A+ + +LW +MP + + DDNE++ F +IPNCP Y + +I +RSYV G+ Sbjct: 145 PSAAITCCILNTLWIEMPTKINEDDDNEILHFPKIPNCPKYRFDQISSLYRSYVHGDPAW 204 Query: 534 EAVRDSFMENIASHGLVFNSFTALEQVYLEYMQKFLGHVRMWAVGPLLPLEEERVGRGGS 713 E +RDSF +N+AS GLV NSFTA+E VYLE++++ +GH R+WAVGP++PL + RGG Sbjct: 205 EFIRDSFRDNVASWGLVVNSFTAMEGVYLEHLKREMGHDRVWAVGPIIPLSGD--NRGGP 262 Query: 714 TEIVADGIKSWLDQFEEKTVVYVCFGSQAVLTNKQMEMLALGLENSGVKFLWAYKDPTKG 893 T + D + SWLD E+ VVYVCFGSQ VLT +Q LA GLE SGV F+WA K+P + Sbjct: 263 TSVSVDHVMSWLDAREDNHVVYVCFGSQVVLTKEQTLALASGLEKSGVHFIWAVKEPVE- 321 Query: 894 HEAGDYGMIPSGFKERVAGRGFIVKGWSPQVLILRHPAVGAFFTHCGWNSVLESIVAGVP 1073 + G I GF +RVAGRG +++GW+PQV +LRH AVGAF THCGWNSV+E++VAGV Sbjct: 322 -KDSTRGNILDGFDDRVAGRGLVIRGWAPQVAVLRHRAVGAFLTHCGWNSVVEAVVAGVL 380 Query: 1074 MLTWPMGADQFTNADL-LDELALGTRVCEGEETIPDQFTNADLLDELALGTRVCEGEETI 1250 MLTWPM ADQ+T+A L +DEL +G R CEG PD T+ Sbjct: 381 MLTWPMRADQYTDASLVVDELKVGVRACEG----PD----------------------TV 414 Query: 1251 PDSDELACLVARSVSDEKRARIARAKEFSKAALDS 1355 PD DELA + A SV+ + RI +A E KAALD+ Sbjct: 415 PDPDELARVFADSVTGNQTERI-KAVELRKAALDA 448 Score = 25.0 bits (53), Expect(2) = e-120 Identities = 11/21 (52%), Positives = 15/21 (71%) Frame = +2 Query: 1454 KAALDSIEKDGSSYEALDSLV 1516 KAALD+I++ GSS LD + Sbjct: 443 KAALDAIQERGSSVNDLDGFI 463