BLASTX nr result
ID: Glycyrrhiza23_contig00005531
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00005531 (1682 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003535992.1| PREDICTED: heparan-alpha-glucosaminide N-ace... 786 0.0 ref|XP_003591458.1| Heparan-alpha-glucosaminide N-acetyltransfer... 784 0.0 ref|XP_002320987.1| predicted protein [Populus trichocarpa] gi|2... 646 0.0 ref|XP_002515320.1| conserved hypothetical protein [Ricinus comm... 565 e-158 ref|NP_001054693.1| Os05g0155700 [Oryza sativa Japonica Group] g... 558 e-156 >ref|XP_003535992.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Length = 489 Score = 786 bits (2031), Expect = 0.0 Identities = 398/488 (81%), Positives = 420/488 (86%), Gaps = 5/488 (1%) Frame = +1 Query: 58 DVAGEERRPLVVRDFDSSIIIAHQNEDTIF--PLPSESVHTDKSLPSLSVPNQRLASLDV 231 +V EERRPL+ D SSI+I HQNEDTI PLP +S TD S SLS+PNQRL+SLDV Sbjct: 4 EVEEEERRPLIGHDLGSSILIVHQNEDTISICPLP-QSNPTDTS--SLSLPNQRLSSLDV 60 Query: 232 FRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSKPN 411 FRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLF VGVSIGLVFKKVSSKPN Sbjct: 61 FRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFVVGVSIGLVFKKVSSKPN 120 Query: 412 ATKKIISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMSEI 591 ATKK+ISRT+K YFHG G LTYGVDLSKIRWLGVLQRISIGYF AS+SEI Sbjct: 121 ATKKVISRTLKLFLLGLLLQGGYFHGHGKLTYGVDLSKIRWLGVLQRISIGYFFASISEI 180 Query: 592 WLVKNNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKFIHSNSLWSG---R 762 WLV +NILVDSPA FVRKYSIQWMFSILLCSVYLC LYGLYVPNWKF HSN L S Sbjct: 181 WLVNHNILVDSPAGFVRKYSIQWMFSILLCSVYLCLLYGLYVPNWKFKHSNLLSSSDSSH 240 Query: 763 LSNIQNVHCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPPD 942 LS IQNVHCE+RGSL+PPCN VGFIDRLILGE HMYQRPVYIRTKECSVNSPDYGPLPPD Sbjct: 241 LSIIQNVHCEVRGSLEPPCNVVGFIDRLILGEDHMYQRPVYIRTKECSVNSPDYGPLPPD 300 Query: 943 SPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGYM 1122 SPGWCLAPFDPEGILSSLMAAITCFMGLQ+GHI+V QGHKQR IGY+ Sbjct: 301 SPGWCLAPFDPEGILSSLMAAITCFMGLQYGHIIVHLQGHKQRVLLWSVFSFSLLLIGYI 360 Query: 1123 LEILGIPLSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVYA 1302 LEILG+PLSKALYTLSY ITAGASGLVLTAIYYIVD+EHLRKPTVLLQWMGMNAL+VYA Sbjct: 361 LEILGMPLSKALYTLSYTCITAGASGLVLTAIYYIVDIEHLRKPTVLLQWMGMNALVVYA 420 Query: 1303 LAACDIFPAATQGFYWRSPENNLVDASEALIQNVLHSKKWGSLAFVIVEILFWGLFAGFL 1482 LAACDIFPA QGFYW SPENNLVDASEAL+Q + HSK+WG+LAFVIVEILFWGLFAGFL Sbjct: 421 LAACDIFPAVIQGFYWHSPENNLVDASEALMQIIFHSKRWGTLAFVIVEILFWGLFAGFL 480 Query: 1483 HKKGIYIK 1506 HKK IYIK Sbjct: 481 HKKRIYIK 488 >ref|XP_003591458.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago truncatula] gi|355480506|gb|AES61709.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago truncatula] Length = 476 Score = 784 bits (2025), Expect = 0.0 Identities = 391/481 (81%), Positives = 414/481 (86%), Gaps = 2/481 (0%) Frame = +1 Query: 70 EERRPLVVRDFDSSIII--AHQNEDTIFPLPSESVHTDKSLPSLSVPNQRLASLDVFRGL 243 EERRPL+ DSSI+ H+NE LP +SVPNQRL SLDVFRGL Sbjct: 15 EERRPLI----DSSILTLTVHENE----------------LPPVSVPNQRLVSLDVFRGL 54 Query: 244 TVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSKPNATKK 423 TVALMILVD+VGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSI LVFKKVSSK NATKK Sbjct: 55 TVALMILVDDVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIALVFKKVSSKQNATKK 114 Query: 424 IISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMSEIWLVK 603 IISRTIK YFHGRGNLTYG+DL+K+RW GVLQRISIGYFLASMSEIWLV Sbjct: 115 IISRTIKLFLLGLLLQGGYFHGRGNLTYGLDLTKLRWFGVLQRISIGYFLASMSEIWLVN 174 Query: 604 NNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKFIHSNSLWSGRLSNIQNV 783 NILVDSPAAFVRKYSIQW+FSILLCSVYLC LYGLYVPNW+F HSN LW GR+S IQNV Sbjct: 175 GNILVDSPAAFVRKYSIQWIFSILLCSVYLCLLYGLYVPNWEFEHSNLLWPGRVSTIQNV 234 Query: 784 HCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPPDSPGWCLA 963 HC+MRGSLDPPCNAVGFIDRLILGE HMYQRPVY RTKECSVNSPDYGPLPPDSPGWCLA Sbjct: 235 HCDMRGSLDPPCNAVGFIDRLILGEDHMYQRPVYRRTKECSVNSPDYGPLPPDSPGWCLA 294 Query: 964 PFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGYMLEILGIP 1143 PFDPEGILSSLMAAITCF+GLQFGHILV+FQ HKQR +GY+LEILGIP Sbjct: 295 PFDPEGILSSLMAAITCFVGLQFGHILVIFQAHKQRVLLWSVFSFSLLVVGYVLEILGIP 354 Query: 1144 LSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVYALAACDIF 1323 LSKALYTLS+MFITAGASGLVLTAIYYIVD++ LRKPTVLLQWMGMNALIVYALAACDIF Sbjct: 355 LSKALYTLSFMFITAGASGLVLTAIYYIVDIKQLRKPTVLLQWMGMNALIVYALAACDIF 414 Query: 1324 PAATQGFYWRSPENNLVDASEALIQNVLHSKKWGSLAFVIVEILFWGLFAGFLHKKGIYI 1503 PA QGFYWRSPENNLVDASEALIQN+LHS+KWG+LAFVI+EILFWGL AGFLHKKGIYI Sbjct: 415 PAVIQGFYWRSPENNLVDASEALIQNILHSEKWGTLAFVIIEILFWGLLAGFLHKKGIYI 474 Query: 1504 K 1506 K Sbjct: 475 K 475 >ref|XP_002320987.1| predicted protein [Populus trichocarpa] gi|222861760|gb|EEE99302.1| predicted protein [Populus trichocarpa] Length = 481 Score = 646 bits (1666), Expect = 0.0 Identities = 326/488 (66%), Positives = 379/488 (77%), Gaps = 1/488 (0%) Frame = +1 Query: 46 TQIDDVAGEERRPLVVRDFDSSIIIAHQNEDTIFPLPSESVHTDKSLPSLSVPNQRLASL 225 T++D+ +R PL+ + ++++ E+ I PS S ++ S P P QRL SL Sbjct: 7 TELDE---RQREPLL----HNPRSLSNEEEEEITNTPSTS-SSNASPP----PTQRLLSL 54 Query: 226 DVFRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSK 405 DVFRGLTVALMILVD+ G AFP +NHSPWFGVTLADFVMPFFLF VGVSI LVFKKVSSK Sbjct: 55 DVFRGLTVALMILVDDAGGAFPCINHSPWFGVTLADFVMPFFLFVVGVSISLVFKKVSSK 114 Query: 406 PNATKKIISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMS 585 P ATKK+I RTIK YFHGR NLTYGVD+ KIRW+GVLQRISIGY A+MS Sbjct: 115 PMATKKVIQRTIKLFLLGLLLQGGYFHGRHNLTYGVDVGKIRWMGVLQRISIGYLFAAMS 174 Query: 586 EIWLVKNNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKF-IHSNSLWSGR 762 EIWLV ++I VDSP AFV+KY IQWM + L C+ Y+C LYGLYVP+W+F + S +L+ Sbjct: 175 EIWLV-DSITVDSPMAFVKKYYIQWMVAFLFCTFYMCLLYGLYVPDWEFEVPSTNLFEHE 233 Query: 763 LSNIQNVHCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPPD 942 + V+C +RGSL+PPCNAVG IDR LGEHH+YQ PVY RTK CSVNSPDYGPLPP+ Sbjct: 234 FGT-KIVNCGVRGSLEPPCNAVGLIDRFFLGEHHLYQHPVYRRTKHCSVNSPDYGPLPPN 292 Query: 943 SPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGYM 1122 SPGWCLAPFDPEGILSSLMAAITCF+GLQFGHILV F+GH QR GY+ Sbjct: 293 SPGWCLAPFDPEGILSSLMAAITCFLGLQFGHILVHFKGHMQRLCLWSVCSFIILITGYV 352 Query: 1123 LEILGIPLSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVYA 1302 E+LG+PL K LYTLSYM ITAGASGL LT I+YIVDV+H RKPT++LQWMGMNALI+YA Sbjct: 353 FELLGVPLCKPLYTLSYMCITAGASGLALTIIFYIVDVKHFRKPTMILQWMGMNALIIYA 412 Query: 1303 LAACDIFPAATQGFYWRSPENNLVDASEALIQNVLHSKKWGSLAFVIVEILFWGLFAGFL 1482 LAACD+FPAA QGFYW SPENNLVD +E+L Q +LHSKKWG+L FVIVEILFWGL AGFL Sbjct: 413 LAACDLFPAAIQGFYWGSPENNLVDDTESLFQVMLHSKKWGTLVFVIVEILFWGLVAGFL 472 Query: 1483 HKKGIYIK 1506 H KGIY++ Sbjct: 473 HLKGIYVR 480 >ref|XP_002515320.1| conserved hypothetical protein [Ricinus communis] gi|223545800|gb|EEF47304.1| conserved hypothetical protein [Ricinus communis] Length = 460 Score = 565 bits (1456), Expect = e-158 Identities = 282/444 (63%), Positives = 333/444 (75%) Frame = +1 Query: 61 VAGEERRPLVVRDFDSSIIIAHQNEDTIFPLPSESVHTDKSLPSLSVPNQRLASLDVFRG 240 VA +E+R ++ ++ + E+ I P S S + P PNQRL SLDVFRG Sbjct: 7 VAEDEQRQSLLHHYNDE----DEKEEEIAPSSSSSDEREALPPP--TPNQRLMSLDVFRG 60 Query: 241 LTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSKPNATK 420 LT+ALMILVD+ G AFPS+NHSPWFGVTLADFVMPFFLFGVGVSI LVFKK+SSK ATK Sbjct: 61 LTIALMILVDDAGGAFPSINHSPWFGVTLADFVMPFFLFGVGVSISLVFKKISSKSVATK 120 Query: 421 KIISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMSEIWLV 600 K++ RTIK YFHGR +LTYG+D+ KIRWLGVLQRISIGY AS+SEIWLV Sbjct: 121 KVMLRTIKLFLLGVLLQGGYFHGRNHLTYGIDVLKIRWLGVLQRISIGYLFASISEIWLV 180 Query: 601 KNNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKFIHSNSLWSGRLSNIQN 780 N+ +VDSP AF++KY QWM S++LCS+Y C LY L+VPNW+F S+ G S Q Sbjct: 181 -NHCIVDSPLAFMKKYYAQWMVSLILCSLYTCLLYFLFVPNWEFEASSINLFGYGSGTQT 239 Query: 781 VHCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPPDSPGWCL 960 V C +RGSL+PPCNAVG IDR +LGEHH+YQRPVY RTK+CSVNSPDYGPLPP+SP WCL Sbjct: 240 VICGVRGSLEPPCNAVGLIDRFLLGEHHLYQRPVYRRTKQCSVNSPDYGPLPPNSPPWCL 299 Query: 961 APFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGYMLEILGI 1140 APFDPEGILSSLMAA+TC +GLQFGH+LV + H QR G++L+++GI Sbjct: 300 APFDPEGILSSLMAAVTCLLGLQFGHVLVHLKDHMQRILVWLISSFSLLVTGFVLKLIGI 359 Query: 1141 PLSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVYALAACDI 1320 P SK LYTLSY IT GASGL+LT I+Y VDV+H RK +LQWMGMNALI+YALAACD+ Sbjct: 360 PFSKPLYTLSYTCITTGASGLLLTIIFYAVDVKHFRKAIAILQWMGMNALIIYALAACDL 419 Query: 1321 FPAATQGFYWRSPENNLVDASEAL 1392 FPAA QGFYW+SPENNLV S L Sbjct: 420 FPAALQGFYWQSPENNLVRPSSRL 443 >ref|NP_001054693.1| Os05g0155700 [Oryza sativa Japonica Group] gi|54291854|gb|AAV32222.1| unknown protein [Oryza sativa Japonica Group] gi|113578244|dbj|BAF16607.1| Os05g0155700 [Oryza sativa Japonica Group] gi|215694847|dbj|BAG90038.1| unnamed protein product [Oryza sativa Japonica Group] gi|218196128|gb|EEC78555.1| hypothetical protein OsI_18526 [Oryza sativa Indica Group] gi|222630256|gb|EEE62388.1| hypothetical protein OsJ_17178 [Oryza sativa Japonica Group] Length = 491 Score = 558 bits (1438), Expect = e-156 Identities = 277/489 (56%), Positives = 351/489 (71%), Gaps = 6/489 (1%) Frame = +1 Query: 58 DVAGEERRPLVVRDFDSSIIIAHQNEDTIFPLPSESVHTDKSLPSLSVPNQRLASLDVFR 237 D RRPL+ D +D I P P+ S + P +R+ASLDVFR Sbjct: 13 DADAGHRRPLLASADD---------DDEIRPYPASSPSPQHPAGAERKP-RRVASLDVFR 62 Query: 238 GLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSKPNAT 417 GLTVA+MILVD+ G A+P +NHSPW GVT+ADFVMP FLF +GVS LVFKK +K AT Sbjct: 63 GLTVAMMILVDDAGGAWPGMNHSPWLGVTVADFVMPAFLFIIGVSAALVFKKTPNKTVAT 122 Query: 418 KKIISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMSEIWL 597 KK R IK Y HGR NLTYG+DL IRWLGVLQRI+IGYFLA++SEIWL Sbjct: 123 KKAAIRAIKLFILGVILQGGYIHGRHNLTYGIDLDHIRWLGVLQRIAIGYFLAAISEIWL 182 Query: 598 VKNNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKFI--HSNSLWS----G 759 V NNI VDS +FV+KY ++W+ ++++ ++Y+ L GLYV NW+F SNS+ + G Sbjct: 183 V-NNISVDSAISFVKKYFMEWIVAVMISALYVGLLLGLYVSNWEFKVQTSNSILTIPTPG 241 Query: 760 RLSNIQNVHCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPP 939 ++ + C +RGSL PPCNAVGF+DR++LGE+H+Y+ PVY RTKECSVNSPDYGPLPP Sbjct: 242 NEIGMKMIQCGVRGSLGPPCNAVGFVDRVLLGENHLYKNPVYKRTKECSVNSPDYGPLPP 301 Query: 940 DSPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGY 1119 ++P WCLAPFDPEG+LS+LMAA+TCF+GL FGH+LV + H R G+ Sbjct: 302 NAPDWCLAPFDPEGLLSTLMAAVTCFVGLHFGHVLVHCKDHSPRMLLWLLASTVLTVSGF 361 Query: 1120 MLEILGIPLSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVY 1299 +L++LG+P SK LYT+SYM +T G SG +L +YYIVDV +++KP +L QWMGMNALIVY Sbjct: 362 LLQLLGMPFSKPLYTVSYMLLTGGVSGFLLLLLYYIVDVINIKKPFILFQWMGMNALIVY 421 Query: 1300 ALAACDIFPAATQGFYWRSPENNLVDASEALIQNVLHSKKWGSLAFVIVEILFWGLFAGF 1479 LAAC+IFP QGFYWRSPENNLVD +E+L+Q + HSK+WG+LAFV++EI+FW L A F Sbjct: 422 VLAACEIFPTLVQGFYWRSPENNLVDLTESLLQTIFHSKRWGTLAFVVLEIIFWCLAACF 481 Query: 1480 LHKKGIYIK 1506 LH KGIY+K Sbjct: 482 LHMKGIYLK 490