BLASTX nr result

ID: Atractylodes22_contig00047181 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00047181
         (1352 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779...   437   e-120
ref|XP_003555467.1| PREDICTED: uncharacterized protein LOC100790...   437   e-120
ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arab...   436   e-120
emb|CAB85554.1| putative protein [Arabidopsis thaliana]               436   e-120
ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabido...   436   e-120

>ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779157 [Glycine max]
          Length = 1044

 Score =  437 bits (1125), Expect = e-120
 Identities = 200/300 (66%), Positives = 239/300 (79%)
 Frame = +2

Query: 2    RDILLEMGGMFSIANRVDNVHKRPWIGFQSWRAAARKVSLSSKAESILEGIVYQKHRGDV 181
            RDIL EMGGMF+IANRVDN+H+RPWIGFQSWRAA RKV+LS+KAE +LE  + +  RGDV
Sbjct: 744  RDILCEMGGMFAIANRVDNIHRRPWIGFQSWRAAGRKVALSAKAEKVLEETMQENFRGDV 803

Query: 182  IYFWARADMDGELAGSNHVLTFWSMCDILNAGNCRTAFQDTFRRMYLLPSYVEALPPMPE 361
            IYFW R DMD  + G+++  +FW MCDILN GNCR  FQ+ FR+MY LP + EALPPMPE
Sbjct: 804  IYFWGRFDMDQSVIGNHNANSFWYMCDILNGGNCRIVFQEGFRQMYALPPHAEALPPMPE 863

Query: 362  DGGHWSSLHSWVMATPSFLEFMMFSRMFADSLDSLHINASTATECLLGSSASEKQHCYCR 541
            DG +WS+LHSWVM TPSFLEF+MFSRMF DS+D+LH +++  + CLLGSS  EK+HCYCR
Sbjct: 864  DG-YWSALHSWVMPTPSFLEFIMFSRMFVDSIDALHRDSTKYSLCLLGSSEIEKKHCYCR 922

Query: 542  ILELLVNVWAYHSARTMVYVDPNSGSLEEQHPVEQRKGFMWTKYFNATLLKSMXXXXXXX 721
            +LELL+NVWAYHSAR MVY++PN+GS+EEQHP+EQRKGFMW KYFN +LLKSM       
Sbjct: 923  VLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWAKYFNISLLKSMDEDLAEA 982

Query: 722  XXXXXHPYEMWLWPRTGEVHWQGIXXXXXXXXXXIKMDKKRKQKEKILERLKFGYKQKTL 901
                 HP EMWLWP TGEVHWQGI          +KMDKKRK KEK+ ER+K+GYKQK+L
Sbjct: 983  ADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQKSL 1042


>ref|XP_003555467.1| PREDICTED: uncharacterized protein LOC100790929 [Glycine max]
          Length = 1045

 Score =  437 bits (1123), Expect = e-120
 Identities = 198/300 (66%), Positives = 239/300 (79%)
 Frame = +2

Query: 2    RDILLEMGGMFSIANRVDNVHKRPWIGFQSWRAAARKVSLSSKAESILEGIVYQKHRGDV 181
            RDIL EMGGMF+IANRVD++H+RPWIGFQSWRAA RKV+LS+KAE++LE  + +  RGDV
Sbjct: 744  RDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENVLEETMQENFRGDV 803

Query: 182  IYFWARADMDGELAGSNHVLTFWSMCDILNAGNCRTAFQDTFRRMYLLPSYVEALPPMPE 361
            IYFW R DMD     +++ ++FW MCDILN GNCR  FQD FR+MY LP + EALPPMPE
Sbjct: 804  IYFWGRLDMDQSAIRNHNAISFWYMCDILNGGNCRIVFQDGFRQMYALPPHAEALPPMPE 863

Query: 362  DGGHWSSLHSWVMATPSFLEFMMFSRMFADSLDSLHINASTATECLLGSSASEKQHCYCR 541
            DGG+WS+LHSWVM T SFLEF+MFSRMF DS+D+ H +++  + CLLGSS  EK+HCYCR
Sbjct: 864  DGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCLLGSSEIEKKHCYCR 923

Query: 542  ILELLVNVWAYHSARTMVYVDPNSGSLEEQHPVEQRKGFMWTKYFNATLLKSMXXXXXXX 721
            +LELL+NVWAYHSAR MVY++PN+GS+EEQHP+EQRKGFMW+KYFN +LLKSM       
Sbjct: 924  MLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFNFSLLKSMDEDLAEA 983

Query: 722  XXXXXHPYEMWLWPRTGEVHWQGIXXXXXXXXXXIKMDKKRKQKEKILERLKFGYKQKTL 901
                 HP EMWLWP TGEVHWQGI          +KMDKKRK KEK+ ER+K+GYKQK+L
Sbjct: 984  ADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQKSL 1043


>ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
            gi|332003368|gb|AED90751.1| UDP-glycosyltransferase
            family protein [Arabidopsis thaliana]
          Length = 1035

 Score =  436 bits (1122), Expect = e-120
 Identities = 198/302 (65%), Positives = 235/302 (77%)
 Frame = +2

Query: 2    RDILLEMGGMFSIANRVDNVHKRPWIGFQSWRAAARKVSLSSKAESILEGIVYQKHRGDV 181
            RDIL E+GGMFS+AN+VD++H RPWIGFQSWRAA RKVSLSSKAE  LE I+ Q+ +G++
Sbjct: 734  RDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETKGEI 793

Query: 182  IYFWARADMDGELAGSNHVLTFWSMCDILNAGNCRTAFQDTFRRMYLLPSYVEALPPMPE 361
            IYFW R D+DG+  GS + LTFWSMCDILN GNCRT F+D FR MY LP ++EALPPMPE
Sbjct: 794  IYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPPMPE 853

Query: 362  DGGHWSSLHSWVMATPSFLEFMMFSRMFADSLDSLHINASTATECLLGSSASEKQHCYCR 541
            DG HWSSLH+WVM TPSFLEF+MFSRMF++SLD+LH N + +  C L SS  E++HCYCR
Sbjct: 854  DGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHCYCR 913

Query: 542  ILELLVNVWAYHSARTMVYVDPNSGSLEEQHPVEQRKGFMWTKYFNATLLKSMXXXXXXX 721
            +LELLVNVWAYHS R MVY++P  GSLEEQHP++QRKG MW KYFN TLLKSM       
Sbjct: 914  VLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDLAEA 973

Query: 722  XXXXXHPYEMWLWPRTGEVHWQGIXXXXXXXXXXIKMDKKRKQKEKILERLKFGYKQKTL 901
                 HP E WLWP TGEVHW+G+          +KMDKKRK KEK+ +R+K GYKQK+L
Sbjct: 974  ADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQKSL 1033

Query: 902  AG 907
             G
Sbjct: 1034 GG 1035


>emb|CAB85554.1| putative protein [Arabidopsis thaliana]
          Length = 1091

 Score =  436 bits (1122), Expect = e-120
 Identities = 198/302 (65%), Positives = 235/302 (77%)
 Frame = +2

Query: 2    RDILLEMGGMFSIANRVDNVHKRPWIGFQSWRAAARKVSLSSKAESILEGIVYQKHRGDV 181
            RDIL E+GGMFS+AN+VD++H RPWIGFQSWRAA RKVSLSSKAE  LE I+ Q+ +G++
Sbjct: 790  RDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETKGEI 849

Query: 182  IYFWARADMDGELAGSNHVLTFWSMCDILNAGNCRTAFQDTFRRMYLLPSYVEALPPMPE 361
            IYFW R D+DG+  GS + LTFWSMCDILN GNCRT F+D FR MY LP ++EALPPMPE
Sbjct: 850  IYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPPMPE 909

Query: 362  DGGHWSSLHSWVMATPSFLEFMMFSRMFADSLDSLHINASTATECLLGSSASEKQHCYCR 541
            DG HWSSLH+WVM TPSFLEF+MFSRMF++SLD+LH N + +  C L SS  E++HCYCR
Sbjct: 910  DGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHCYCR 969

Query: 542  ILELLVNVWAYHSARTMVYVDPNSGSLEEQHPVEQRKGFMWTKYFNATLLKSMXXXXXXX 721
            +LELLVNVWAYHS R MVY++P  GSLEEQHP++QRKG MW KYFN TLLKSM       
Sbjct: 970  VLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDLAEA 1029

Query: 722  XXXXXHPYEMWLWPRTGEVHWQGIXXXXXXXXXXIKMDKKRKQKEKILERLKFGYKQKTL 901
                 HP E WLWP TGEVHW+G+          +KMDKKRK KEK+ +R+K GYKQK+L
Sbjct: 1030 ADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQKSL 1089

Query: 902  AG 907
             G
Sbjct: 1090 GG 1091


>ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
            gi|15450503|gb|AAK96544.1| AT5g04480/T32M21_80
            [Arabidopsis thaliana] gi|24111433|gb|AAN46867.1|
            At5g04480/T32M21_80 [Arabidopsis thaliana]
            gi|332003367|gb|AED90750.1| UDP-glycosyltransferase
            family protein [Arabidopsis thaliana]
          Length = 1050

 Score =  436 bits (1122), Expect = e-120
 Identities = 198/302 (65%), Positives = 235/302 (77%)
 Frame = +2

Query: 2    RDILLEMGGMFSIANRVDNVHKRPWIGFQSWRAAARKVSLSSKAESILEGIVYQKHRGDV 181
            RDIL E+GGMFS+AN+VD++H RPWIGFQSWRAA RKVSLSSKAE  LE I+ Q+ +G++
Sbjct: 749  RDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETKGEI 808

Query: 182  IYFWARADMDGELAGSNHVLTFWSMCDILNAGNCRTAFQDTFRRMYLLPSYVEALPPMPE 361
            IYFW R D+DG+  GS + LTFWSMCDILN GNCRT F+D FR MY LP ++EALPPMPE
Sbjct: 809  IYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPPMPE 868

Query: 362  DGGHWSSLHSWVMATPSFLEFMMFSRMFADSLDSLHINASTATECLLGSSASEKQHCYCR 541
            DG HWSSLH+WVM TPSFLEF+MFSRMF++SLD+LH N + +  C L SS  E++HCYCR
Sbjct: 869  DGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHCYCR 928

Query: 542  ILELLVNVWAYHSARTMVYVDPNSGSLEEQHPVEQRKGFMWTKYFNATLLKSMXXXXXXX 721
            +LELLVNVWAYHS R MVY++P  GSLEEQHP++QRKG MW KYFN TLLKSM       
Sbjct: 929  VLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDLAEA 988

Query: 722  XXXXXHPYEMWLWPRTGEVHWQGIXXXXXXXXXXIKMDKKRKQKEKILERLKFGYKQKTL 901
                 HP E WLWP TGEVHW+G+          +KMDKKRK KEK+ +R+K GYKQK+L
Sbjct: 989  ADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQKSL 1048

Query: 902  AG 907
             G
Sbjct: 1049 GG 1050


Top