BLASTX nr result

ID: Glycyrrhiza23_contig00005531 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00005531
         (1682 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003535992.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   786   0.0  
ref|XP_003591458.1| Heparan-alpha-glucosaminide N-acetyltransfer...   784   0.0  
ref|XP_002320987.1| predicted protein [Populus trichocarpa] gi|2...   646   0.0  
ref|XP_002515320.1| conserved hypothetical protein [Ricinus comm...   565   e-158
ref|NP_001054693.1| Os05g0155700 [Oryza sativa Japonica Group] g...   558   e-156

>ref|XP_003535992.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Glycine max]
          Length = 489

 Score =  786 bits (2031), Expect = 0.0
 Identities = 398/488 (81%), Positives = 420/488 (86%), Gaps = 5/488 (1%)
 Frame = +1

Query: 58   DVAGEERRPLVVRDFDSSIIIAHQNEDTIF--PLPSESVHTDKSLPSLSVPNQRLASLDV 231
            +V  EERRPL+  D  SSI+I HQNEDTI   PLP +S  TD S  SLS+PNQRL+SLDV
Sbjct: 4    EVEEEERRPLIGHDLGSSILIVHQNEDTISICPLP-QSNPTDTS--SLSLPNQRLSSLDV 60

Query: 232  FRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSKPN 411
            FRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLF VGVSIGLVFKKVSSKPN
Sbjct: 61   FRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFVVGVSIGLVFKKVSSKPN 120

Query: 412  ATKKIISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMSEI 591
            ATKK+ISRT+K           YFHG G LTYGVDLSKIRWLGVLQRISIGYF AS+SEI
Sbjct: 121  ATKKVISRTLKLFLLGLLLQGGYFHGHGKLTYGVDLSKIRWLGVLQRISIGYFFASISEI 180

Query: 592  WLVKNNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKFIHSNSLWSG---R 762
            WLV +NILVDSPA FVRKYSIQWMFSILLCSVYLC LYGLYVPNWKF HSN L S     
Sbjct: 181  WLVNHNILVDSPAGFVRKYSIQWMFSILLCSVYLCLLYGLYVPNWKFKHSNLLSSSDSSH 240

Query: 763  LSNIQNVHCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPPD 942
            LS IQNVHCE+RGSL+PPCN VGFIDRLILGE HMYQRPVYIRTKECSVNSPDYGPLPPD
Sbjct: 241  LSIIQNVHCEVRGSLEPPCNVVGFIDRLILGEDHMYQRPVYIRTKECSVNSPDYGPLPPD 300

Query: 943  SPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGYM 1122
            SPGWCLAPFDPEGILSSLMAAITCFMGLQ+GHI+V  QGHKQR             IGY+
Sbjct: 301  SPGWCLAPFDPEGILSSLMAAITCFMGLQYGHIIVHLQGHKQRVLLWSVFSFSLLLIGYI 360

Query: 1123 LEILGIPLSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVYA 1302
            LEILG+PLSKALYTLSY  ITAGASGLVLTAIYYIVD+EHLRKPTVLLQWMGMNAL+VYA
Sbjct: 361  LEILGMPLSKALYTLSYTCITAGASGLVLTAIYYIVDIEHLRKPTVLLQWMGMNALVVYA 420

Query: 1303 LAACDIFPAATQGFYWRSPENNLVDASEALIQNVLHSKKWGSLAFVIVEILFWGLFAGFL 1482
            LAACDIFPA  QGFYW SPENNLVDASEAL+Q + HSK+WG+LAFVIVEILFWGLFAGFL
Sbjct: 421  LAACDIFPAVIQGFYWHSPENNLVDASEALMQIIFHSKRWGTLAFVIVEILFWGLFAGFL 480

Query: 1483 HKKGIYIK 1506
            HKK IYIK
Sbjct: 481  HKKRIYIK 488


>ref|XP_003591458.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago truncatula]
            gi|355480506|gb|AES61709.1| Heparan-alpha-glucosaminide
            N-acetyltransferase [Medicago truncatula]
          Length = 476

 Score =  784 bits (2025), Expect = 0.0
 Identities = 391/481 (81%), Positives = 414/481 (86%), Gaps = 2/481 (0%)
 Frame = +1

Query: 70   EERRPLVVRDFDSSIII--AHQNEDTIFPLPSESVHTDKSLPSLSVPNQRLASLDVFRGL 243
            EERRPL+    DSSI+    H+NE                LP +SVPNQRL SLDVFRGL
Sbjct: 15   EERRPLI----DSSILTLTVHENE----------------LPPVSVPNQRLVSLDVFRGL 54

Query: 244  TVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSKPNATKK 423
            TVALMILVD+VGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSI LVFKKVSSK NATKK
Sbjct: 55   TVALMILVDDVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIALVFKKVSSKQNATKK 114

Query: 424  IISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMSEIWLVK 603
            IISRTIK           YFHGRGNLTYG+DL+K+RW GVLQRISIGYFLASMSEIWLV 
Sbjct: 115  IISRTIKLFLLGLLLQGGYFHGRGNLTYGLDLTKLRWFGVLQRISIGYFLASMSEIWLVN 174

Query: 604  NNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKFIHSNSLWSGRLSNIQNV 783
             NILVDSPAAFVRKYSIQW+FSILLCSVYLC LYGLYVPNW+F HSN LW GR+S IQNV
Sbjct: 175  GNILVDSPAAFVRKYSIQWIFSILLCSVYLCLLYGLYVPNWEFEHSNLLWPGRVSTIQNV 234

Query: 784  HCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPPDSPGWCLA 963
            HC+MRGSLDPPCNAVGFIDRLILGE HMYQRPVY RTKECSVNSPDYGPLPPDSPGWCLA
Sbjct: 235  HCDMRGSLDPPCNAVGFIDRLILGEDHMYQRPVYRRTKECSVNSPDYGPLPPDSPGWCLA 294

Query: 964  PFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGYMLEILGIP 1143
            PFDPEGILSSLMAAITCF+GLQFGHILV+FQ HKQR             +GY+LEILGIP
Sbjct: 295  PFDPEGILSSLMAAITCFVGLQFGHILVIFQAHKQRVLLWSVFSFSLLVVGYVLEILGIP 354

Query: 1144 LSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVYALAACDIF 1323
            LSKALYTLS+MFITAGASGLVLTAIYYIVD++ LRKPTVLLQWMGMNALIVYALAACDIF
Sbjct: 355  LSKALYTLSFMFITAGASGLVLTAIYYIVDIKQLRKPTVLLQWMGMNALIVYALAACDIF 414

Query: 1324 PAATQGFYWRSPENNLVDASEALIQNVLHSKKWGSLAFVIVEILFWGLFAGFLHKKGIYI 1503
            PA  QGFYWRSPENNLVDASEALIQN+LHS+KWG+LAFVI+EILFWGL AGFLHKKGIYI
Sbjct: 415  PAVIQGFYWRSPENNLVDASEALIQNILHSEKWGTLAFVIIEILFWGLLAGFLHKKGIYI 474

Query: 1504 K 1506
            K
Sbjct: 475  K 475


>ref|XP_002320987.1| predicted protein [Populus trichocarpa] gi|222861760|gb|EEE99302.1|
            predicted protein [Populus trichocarpa]
          Length = 481

 Score =  646 bits (1666), Expect = 0.0
 Identities = 326/488 (66%), Positives = 379/488 (77%), Gaps = 1/488 (0%)
 Frame = +1

Query: 46   TQIDDVAGEERRPLVVRDFDSSIIIAHQNEDTIFPLPSESVHTDKSLPSLSVPNQRLASL 225
            T++D+    +R PL+     +   ++++ E+ I   PS S  ++ S P    P QRL SL
Sbjct: 7    TELDE---RQREPLL----HNPRSLSNEEEEEITNTPSTS-SSNASPP----PTQRLLSL 54

Query: 226  DVFRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSK 405
            DVFRGLTVALMILVD+ G AFP +NHSPWFGVTLADFVMPFFLF VGVSI LVFKKVSSK
Sbjct: 55   DVFRGLTVALMILVDDAGGAFPCINHSPWFGVTLADFVMPFFLFVVGVSISLVFKKVSSK 114

Query: 406  PNATKKIISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMS 585
            P ATKK+I RTIK           YFHGR NLTYGVD+ KIRW+GVLQRISIGY  A+MS
Sbjct: 115  PMATKKVIQRTIKLFLLGLLLQGGYFHGRHNLTYGVDVGKIRWMGVLQRISIGYLFAAMS 174

Query: 586  EIWLVKNNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKF-IHSNSLWSGR 762
            EIWLV ++I VDSP AFV+KY IQWM + L C+ Y+C LYGLYVP+W+F + S +L+   
Sbjct: 175  EIWLV-DSITVDSPMAFVKKYYIQWMVAFLFCTFYMCLLYGLYVPDWEFEVPSTNLFEHE 233

Query: 763  LSNIQNVHCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPPD 942
                + V+C +RGSL+PPCNAVG IDR  LGEHH+YQ PVY RTK CSVNSPDYGPLPP+
Sbjct: 234  FGT-KIVNCGVRGSLEPPCNAVGLIDRFFLGEHHLYQHPVYRRTKHCSVNSPDYGPLPPN 292

Query: 943  SPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGYM 1122
            SPGWCLAPFDPEGILSSLMAAITCF+GLQFGHILV F+GH QR              GY+
Sbjct: 293  SPGWCLAPFDPEGILSSLMAAITCFLGLQFGHILVHFKGHMQRLCLWSVCSFIILITGYV 352

Query: 1123 LEILGIPLSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVYA 1302
             E+LG+PL K LYTLSYM ITAGASGL LT I+YIVDV+H RKPT++LQWMGMNALI+YA
Sbjct: 353  FELLGVPLCKPLYTLSYMCITAGASGLALTIIFYIVDVKHFRKPTMILQWMGMNALIIYA 412

Query: 1303 LAACDIFPAATQGFYWRSPENNLVDASEALIQNVLHSKKWGSLAFVIVEILFWGLFAGFL 1482
            LAACD+FPAA QGFYW SPENNLVD +E+L Q +LHSKKWG+L FVIVEILFWGL AGFL
Sbjct: 413  LAACDLFPAAIQGFYWGSPENNLVDDTESLFQVMLHSKKWGTLVFVIVEILFWGLVAGFL 472

Query: 1483 HKKGIYIK 1506
            H KGIY++
Sbjct: 473  HLKGIYVR 480


>ref|XP_002515320.1| conserved hypothetical protein [Ricinus communis]
            gi|223545800|gb|EEF47304.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 460

 Score =  565 bits (1456), Expect = e-158
 Identities = 282/444 (63%), Positives = 333/444 (75%)
 Frame = +1

Query: 61   VAGEERRPLVVRDFDSSIIIAHQNEDTIFPLPSESVHTDKSLPSLSVPNQRLASLDVFRG 240
            VA +E+R  ++  ++       + E+ I P  S S   +   P    PNQRL SLDVFRG
Sbjct: 7    VAEDEQRQSLLHHYNDE----DEKEEEIAPSSSSSDEREALPPP--TPNQRLMSLDVFRG 60

Query: 241  LTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSKPNATK 420
            LT+ALMILVD+ G AFPS+NHSPWFGVTLADFVMPFFLFGVGVSI LVFKK+SSK  ATK
Sbjct: 61   LTIALMILVDDAGGAFPSINHSPWFGVTLADFVMPFFLFGVGVSISLVFKKISSKSVATK 120

Query: 421  KIISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMSEIWLV 600
            K++ RTIK           YFHGR +LTYG+D+ KIRWLGVLQRISIGY  AS+SEIWLV
Sbjct: 121  KVMLRTIKLFLLGVLLQGGYFHGRNHLTYGIDVLKIRWLGVLQRISIGYLFASISEIWLV 180

Query: 601  KNNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKFIHSNSLWSGRLSNIQN 780
             N+ +VDSP AF++KY  QWM S++LCS+Y C LY L+VPNW+F  S+    G  S  Q 
Sbjct: 181  -NHCIVDSPLAFMKKYYAQWMVSLILCSLYTCLLYFLFVPNWEFEASSINLFGYGSGTQT 239

Query: 781  VHCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPPDSPGWCL 960
            V C +RGSL+PPCNAVG IDR +LGEHH+YQRPVY RTK+CSVNSPDYGPLPP+SP WCL
Sbjct: 240  VICGVRGSLEPPCNAVGLIDRFLLGEHHLYQRPVYRRTKQCSVNSPDYGPLPPNSPPWCL 299

Query: 961  APFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGYMLEILGI 1140
            APFDPEGILSSLMAA+TC +GLQFGH+LV  + H QR              G++L+++GI
Sbjct: 300  APFDPEGILSSLMAAVTCLLGLQFGHVLVHLKDHMQRILVWLISSFSLLVTGFVLKLIGI 359

Query: 1141 PLSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVYALAACDI 1320
            P SK LYTLSY  IT GASGL+LT I+Y VDV+H RK   +LQWMGMNALI+YALAACD+
Sbjct: 360  PFSKPLYTLSYTCITTGASGLLLTIIFYAVDVKHFRKAIAILQWMGMNALIIYALAACDL 419

Query: 1321 FPAATQGFYWRSPENNLVDASEAL 1392
            FPAA QGFYW+SPENNLV  S  L
Sbjct: 420  FPAALQGFYWQSPENNLVRPSSRL 443


>ref|NP_001054693.1| Os05g0155700 [Oryza sativa Japonica Group] gi|54291854|gb|AAV32222.1|
            unknown protein [Oryza sativa Japonica Group]
            gi|113578244|dbj|BAF16607.1| Os05g0155700 [Oryza sativa
            Japonica Group] gi|215694847|dbj|BAG90038.1| unnamed
            protein product [Oryza sativa Japonica Group]
            gi|218196128|gb|EEC78555.1| hypothetical protein
            OsI_18526 [Oryza sativa Indica Group]
            gi|222630256|gb|EEE62388.1| hypothetical protein
            OsJ_17178 [Oryza sativa Japonica Group]
          Length = 491

 Score =  558 bits (1438), Expect = e-156
 Identities = 277/489 (56%), Positives = 351/489 (71%), Gaps = 6/489 (1%)
 Frame = +1

Query: 58   DVAGEERRPLVVRDFDSSIIIAHQNEDTIFPLPSESVHTDKSLPSLSVPNQRLASLDVFR 237
            D     RRPL+    D         +D I P P+ S        +   P +R+ASLDVFR
Sbjct: 13   DADAGHRRPLLASADD---------DDEIRPYPASSPSPQHPAGAERKP-RRVASLDVFR 62

Query: 238  GLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIGLVFKKVSSKPNAT 417
            GLTVA+MILVD+ G A+P +NHSPW GVT+ADFVMP FLF +GVS  LVFKK  +K  AT
Sbjct: 63   GLTVAMMILVDDAGGAWPGMNHSPWLGVTVADFVMPAFLFIIGVSAALVFKKTPNKTVAT 122

Query: 418  KKIISRTIKXXXXXXXXXXXYFHGRGNLTYGVDLSKIRWLGVLQRISIGYFLASMSEIWL 597
            KK   R IK           Y HGR NLTYG+DL  IRWLGVLQRI+IGYFLA++SEIWL
Sbjct: 123  KKAAIRAIKLFILGVILQGGYIHGRHNLTYGIDLDHIRWLGVLQRIAIGYFLAAISEIWL 182

Query: 598  VKNNILVDSPAAFVRKYSIQWMFSILLCSVYLCWLYGLYVPNWKFI--HSNSLWS----G 759
            V NNI VDS  +FV+KY ++W+ ++++ ++Y+  L GLYV NW+F    SNS+ +    G
Sbjct: 183  V-NNISVDSAISFVKKYFMEWIVAVMISALYVGLLLGLYVSNWEFKVQTSNSILTIPTPG 241

Query: 760  RLSNIQNVHCEMRGSLDPPCNAVGFIDRLILGEHHMYQRPVYIRTKECSVNSPDYGPLPP 939
                ++ + C +RGSL PPCNAVGF+DR++LGE+H+Y+ PVY RTKECSVNSPDYGPLPP
Sbjct: 242  NEIGMKMIQCGVRGSLGPPCNAVGFVDRVLLGENHLYKNPVYKRTKECSVNSPDYGPLPP 301

Query: 940  DSPGWCLAPFDPEGILSSLMAAITCFMGLQFGHILVLFQGHKQRXXXXXXXXXXXXXIGY 1119
            ++P WCLAPFDPEG+LS+LMAA+TCF+GL FGH+LV  + H  R              G+
Sbjct: 302  NAPDWCLAPFDPEGLLSTLMAAVTCFVGLHFGHVLVHCKDHSPRMLLWLLASTVLTVSGF 361

Query: 1120 MLEILGIPLSKALYTLSYMFITAGASGLVLTAIYYIVDVEHLRKPTVLLQWMGMNALIVY 1299
            +L++LG+P SK LYT+SYM +T G SG +L  +YYIVDV +++KP +L QWMGMNALIVY
Sbjct: 362  LLQLLGMPFSKPLYTVSYMLLTGGVSGFLLLLLYYIVDVINIKKPFILFQWMGMNALIVY 421

Query: 1300 ALAACDIFPAATQGFYWRSPENNLVDASEALIQNVLHSKKWGSLAFVIVEILFWGLFAGF 1479
             LAAC+IFP   QGFYWRSPENNLVD +E+L+Q + HSK+WG+LAFV++EI+FW L A F
Sbjct: 422  VLAACEIFPTLVQGFYWRSPENNLVDLTESLLQTIFHSKRWGTLAFVVLEIIFWCLAACF 481

Query: 1480 LHKKGIYIK 1506
            LH KGIY+K
Sbjct: 482  LHMKGIYLK 490


Top