BLASTX nr result

ID: Scutellaria23_contig00018754 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria23_contig00018754
         (2010 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270269.1| PREDICTED: uncharacterized protein LOC100254...   671   0.0  
ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arab...   662   0.0  
ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabido...   662   0.0  
ref|XP_002873152.1| hypothetical protein ARALYDRAFT_487229 [Arab...   659   0.0  
emb|CAB85554.1| putative protein [Arabidopsis thaliana]               658   0.0  

>ref|XP_002270269.1| PREDICTED: uncharacterized protein LOC100254795 [Vitis vinifera]
          Length = 1028

 Score =  671 bits (1731), Expect = 0.0
 Identities = 333/574 (58%), Positives = 404/574 (70%), Gaps = 15/574 (2%)
 Frame = -2

Query: 2009 FPKNDPEVLRNSFSLLISDGKLSRFARSVAASGRLHAKNMFAEECIIVHAKLVEDVFHFP 1830
            FPKN+P+ L  +FSLLIS+GKLS+FA++VA SGRL AKNM A EC+  +AKL+E+V  FP
Sbjct: 466  FPKNNPDALMRAFSLLISNGKLSKFAKAVALSGRLLAKNMLASECVNSYAKLLENVLSFP 525

Query: 1829 SDVLLPSRASQMNNIIWEWSLFRKGLDQISRDTEDLLLEDN---TRMNSSIVYDLEEDMT 1659
            SDVLLP   SQ  +  WEW+ FR         T D+ L +N   +   SS+V  LEE ++
Sbjct: 526  SDVLLPGHISQSQHDAWEWNSFR---------TADMPLIENGSASMRKSSVVDVLEETLS 576

Query: 1658 SYVALGNVTQGHSEGLEVEIPTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEWEEI 1479
            + +  GN++   +E    ++ T                                  W+EI
Sbjct: 577  NQLDSGNISNSETEN---DVLTQLDWDVLREIESIEEMERLEMEELEERMEKNPGIWDEI 633

Query: 1478 YRNARKAEKLRFEPNERDEGELERTGQPICIYEMYNGAGGWPFLHHGSLYRGLSLSTKAR 1299
            YRNARK E+++FE NERDEGELERTGQP+CIYE+YNGAG WPFLHHGS+YRGLSL+T AR
Sbjct: 634  YRNARKVERVKFETNERDEGELERTGQPLCIYEIYNGAGAWPFLHHGSMYRGLSLTTSAR 693

Query: 1298 RLSSDDVDAVSRLPILNDTYYCNILCEIGAMFAIANGIDDIHKGPWIGFQSWRTAGRKVS 1119
            RL SDDVDAV RLP+LNDTYY +I C+IG MF+IA  +D IHK PWIGFQSW   G KVS
Sbjct: 694  RLRSDDVDAVDRLPVLNDTYYRDIFCDIGGMFSIAFRVDKIHKRPWIGFQSWHAVGSKVS 753

Query: 1118 LSKKAEEILEKTIQENTKGDVIYFWACLDMDRGVVGNNDLLTFWSTCDIMNAGRCRTAFE 939
            LS +AE++LE+TIQE TKGDV+YFWA L++D G    N + TFWS CDI+N G CRTAFE
Sbjct: 754  LSSRAEKVLEETIQEETKGDVLYFWAHLNVDDGPTQKNRIPTFWSMCDILNGGNCRTAFE 813

Query: 938  DAFRRMYGLPSNVEALPPMPQGGGHWLALHSWAMPTTSFLEFIMFSRMFIDSLHSLHVNS 759
            DAFR+MY +PS +EALPPMP+ GG+W ALHSW MPT SFLEFIMFSRMF DSL +LH+NS
Sbjct: 814  DAFRQMYAMPSYIEALPPMPEDGGYWSALHSWVMPTPSFLEFIMFSRMFADSLDALHMNS 873

Query: 758  NKT------------SECFLGLSAPEKKHCYCRITELLVNVWAYHSARKMVYIDPHSGLL 615
             ++            + C LG S  EKKHCYCR+ ELLVNVWAYHSARKMVYI+P+SG L
Sbjct: 874  RQSMNLSQSMNSSQPTVCLLGSSKLEKKHCYCRVLELLVNVWAYHSARKMVYINPYSGQL 933

Query: 614  REQHPVDQRKGFMWAKYFNNTLLKNMXXXXXXXXXXXDHPYRRWLWPLTGEIFWQGVXXX 435
             EQHPV+QR+GFMWAKYFN+TLLK+M           DHP  RWLWPLTGE+ WQG+   
Sbjct: 934  EEQHPVEQRRGFMWAKYFNSTLLKSMDEDLAEAADDGDHPRERWLWPLTGEVHWQGIYER 993

Query: 434  XXXXXXRVKMDKKRKTKEKLLDRLKHGYRQKSIG 333
                  R KMDKKRK KEKL++R+KHGY+QK IG
Sbjct: 994  EREERYRSKMDKKRKAKEKLVERMKHGYKQKPIG 1027


>ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
            gi|332003368|gb|AED90751.1| UDP-glycosyltransferase
            family protein [Arabidopsis thaliana]
          Length = 1035

 Score =  662 bits (1709), Expect = 0.0
 Identities = 321/562 (57%), Positives = 391/562 (69%), Gaps = 2/562 (0%)
 Frame = -2

Query: 2009 FPKNDPEVLRNSFSLLISDGKLSRFARSVAASGRLHAKNMFAEECIIVHAKLVEDVFHFP 1830
            F +NDP+ L  +FS LISDG+LS+FA+++A+SGRL  KN+ A ECI  +A+L+E++ HFP
Sbjct: 478  FRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECITGYARLLENMLHFP 537

Query: 1829 SDVLLPSRASQMNNIIWEWSLFRKGLDQISRDTEDLLLEDNTRM--NSSIVYDLEEDMTS 1656
            SD  LP   SQ+    WEW+ FR  L+Q     +  +L+        S IV+ +EE    
Sbjct: 538  SDTFLPGSISQLQVAAWEWNFFRSELEQ----PKSFILDSAYAFIGKSGIVFQVEEKFMG 593

Query: 1655 YVALGNVTQGHSEGLEVEIPTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEWEEIY 1476
             +   N    ++  +  E+P+                                 +WEEIY
Sbjct: 594  VIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDVEDWEEIY 653

Query: 1475 RNARKAEKLRFEPNERDEGELERTGQPICIYEMYNGAGGWPFLHHGSLYRGLSLSTKARR 1296
            RNARK+EKL+FE NERDEGELERTG+P+CIYE+YNGAG WPFLHHGSLYRGLSLS+K RR
Sbjct: 654  RNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLSLSSKDRR 713

Query: 1295 LSSDDVDAVSRLPILNDTYYCNILCEIGAMFAIANGIDDIHKGPWIGFQSWRTAGRKVSL 1116
            LSSDDVDA  RLP+LNDTYY +ILCEIG MF++AN +D IH  PWIGFQSWR AGRKVSL
Sbjct: 714  LSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSL 773

Query: 1115 SKKAEEILEKTIQENTKGDVIYFWACLDMDRGVVGNNDLLTFWSTCDIMNAGRCRTAFED 936
            S KAEE LE  I++ TKG++IYFW  LD+D    G+ + LTFWS CDI+N G CRT FED
Sbjct: 774  SSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFED 833

Query: 935  AFRRMYGLPSNVEALPPMPQGGGHWLALHSWAMPTTSFLEFIMFSRMFIDSLHSLHVNSN 756
            AFR MYGLP ++EALPPMP+ G HW +LH+W MPT SFLEF+MFSRMF +SL +LH N N
Sbjct: 834  AFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLN 893

Query: 755  KTSECFLGLSAPEKKHCYCRITELLVNVWAYHSARKMVYIDPHSGLLREQHPVDQRKGFM 576
             +  C L  S  E+KHCYCR+ ELLVNVWAYHS RKMVYI+P  G L EQHP+ QRKG M
Sbjct: 894  DSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLM 953

Query: 575  WAKYFNNTLLKNMXXXXXXXXXXXDHPYRRWLWPLTGEIFWQGVXXXXXXXXXRVKMDKK 396
            WAKYFN TLLK+M           DHP  RWLWPLTGE+ W+GV         R+KMDKK
Sbjct: 954  WAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKK 1013

Query: 395  RKTKEKLLDRLKHGYRQKSIGG 330
            RKTKEKL DR+K+GY+QKS+GG
Sbjct: 1014 RKTKEKLYDRIKNGYKQKSLGG 1035


>ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
            gi|15450503|gb|AAK96544.1| AT5g04480/T32M21_80
            [Arabidopsis thaliana] gi|24111433|gb|AAN46867.1|
            At5g04480/T32M21_80 [Arabidopsis thaliana]
            gi|332003367|gb|AED90750.1| UDP-glycosyltransferase
            family protein [Arabidopsis thaliana]
          Length = 1050

 Score =  662 bits (1709), Expect = 0.0
 Identities = 321/562 (57%), Positives = 391/562 (69%), Gaps = 2/562 (0%)
 Frame = -2

Query: 2009 FPKNDPEVLRNSFSLLISDGKLSRFARSVAASGRLHAKNMFAEECIIVHAKLVEDVFHFP 1830
            F +NDP+ L  +FS LISDG+LS+FA+++A+SGRL  KN+ A ECI  +A+L+E++ HFP
Sbjct: 493  FRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECITGYARLLENMLHFP 552

Query: 1829 SDVLLPSRASQMNNIIWEWSLFRKGLDQISRDTEDLLLEDNTRM--NSSIVYDLEEDMTS 1656
            SD  LP   SQ+    WEW+ FR  L+Q     +  +L+        S IV+ +EE    
Sbjct: 553  SDTFLPGSISQLQVAAWEWNFFRSELEQ----PKSFILDSAYAFIGKSGIVFQVEEKFMG 608

Query: 1655 YVALGNVTQGHSEGLEVEIPTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEWEEIY 1476
             +   N    ++  +  E+P+                                 +WEEIY
Sbjct: 609  VIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDVEDWEEIY 668

Query: 1475 RNARKAEKLRFEPNERDEGELERTGQPICIYEMYNGAGGWPFLHHGSLYRGLSLSTKARR 1296
            RNARK+EKL+FE NERDEGELERTG+P+CIYE+YNGAG WPFLHHGSLYRGLSLS+K RR
Sbjct: 669  RNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLSLSSKDRR 728

Query: 1295 LSSDDVDAVSRLPILNDTYYCNILCEIGAMFAIANGIDDIHKGPWIGFQSWRTAGRKVSL 1116
            LSSDDVDA  RLP+LNDTYY +ILCEIG MF++AN +D IH  PWIGFQSWR AGRKVSL
Sbjct: 729  LSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSL 788

Query: 1115 SKKAEEILEKTIQENTKGDVIYFWACLDMDRGVVGNNDLLTFWSTCDIMNAGRCRTAFED 936
            S KAEE LE  I++ TKG++IYFW  LD+D    G+ + LTFWS CDI+N G CRT FED
Sbjct: 789  SSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFED 848

Query: 935  AFRRMYGLPSNVEALPPMPQGGGHWLALHSWAMPTTSFLEFIMFSRMFIDSLHSLHVNSN 756
            AFR MYGLP ++EALPPMP+ G HW +LH+W MPT SFLEF+MFSRMF +SL +LH N N
Sbjct: 849  AFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLN 908

Query: 755  KTSECFLGLSAPEKKHCYCRITELLVNVWAYHSARKMVYIDPHSGLLREQHPVDQRKGFM 576
             +  C L  S  E+KHCYCR+ ELLVNVWAYHS RKMVYI+P  G L EQHP+ QRKG M
Sbjct: 909  DSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLM 968

Query: 575  WAKYFNNTLLKNMXXXXXXXXXXXDHPYRRWLWPLTGEIFWQGVXXXXXXXXXRVKMDKK 396
            WAKYFN TLLK+M           DHP  RWLWPLTGE+ W+GV         R+KMDKK
Sbjct: 969  WAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKK 1028

Query: 395  RKTKEKLLDRLKHGYRQKSIGG 330
            RKTKEKL DR+K+GY+QKS+GG
Sbjct: 1029 RKTKEKLYDRIKNGYKQKSLGG 1050


>ref|XP_002873152.1| hypothetical protein ARALYDRAFT_487229 [Arabidopsis lyrata subsp.
            lyrata] gi|297318989|gb|EFH49411.1| hypothetical protein
            ARALYDRAFT_487229 [Arabidopsis lyrata subsp. lyrata]
          Length = 1051

 Score =  659 bits (1700), Expect = 0.0
 Identities = 321/560 (57%), Positives = 386/560 (68%)
 Frame = -2

Query: 2009 FPKNDPEVLRNSFSLLISDGKLSRFARSVAASGRLHAKNMFAEECIIVHAKLVEDVFHFP 1830
            F +NDP+ L  +FS LISDG+LS FA+++A+SGRL  KN+ A ECI  +A+L+E++ HFP
Sbjct: 494  FRRNDPDALLKAFSPLISDGRLSEFAQTIASSGRLLTKNLMATECITGYARLLENILHFP 553

Query: 1829 SDVLLPSRASQMNNIIWEWSLFRKGLDQISRDTEDLLLEDNTRMNSSIVYDLEEDMTSYV 1650
            SD  LP   SQ+    WEWS FR  L+Q      D       +  S IV+ +EE     +
Sbjct: 554  SDTFLPGSISQLQGASWEWSFFRSELEQPKSFILDSAYASIGK--SGIVFQVEEKYMGVI 611

Query: 1649 ALGNVTQGHSEGLEVEIPTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEWEEIYRN 1470
               N     +  +  E+P+                                 +WEEIYRN
Sbjct: 612  ESTNPVDNSTLFVSDELPSKLDWDVLEEIEGAEEYENVESEELEDRMERDVEDWEEIYRN 671

Query: 1469 ARKAEKLRFEPNERDEGELERTGQPICIYEMYNGAGGWPFLHHGSLYRGLSLSTKARRLS 1290
            ARK+EKL+FE NERDEGELERTGQP+CIYE+Y+GAG WPFLHHGSLYRGLSLS+K RRLS
Sbjct: 672  ARKSEKLKFEVNERDEGELERTGQPVCIYEIYDGAGAWPFLHHGSLYRGLSLSSKDRRLS 731

Query: 1289 SDDVDAVSRLPILNDTYYCNILCEIGAMFAIANGIDDIHKGPWIGFQSWRTAGRKVSLSK 1110
            SDDVDA  RLP+LNDTYY +ILCEIG MF++AN +D IH  PWIGFQSWR AGRKVSLS 
Sbjct: 732  SDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSS 791

Query: 1109 KAEEILEKTIQENTKGDVIYFWACLDMDRGVVGNNDLLTFWSTCDIMNAGRCRTAFEDAF 930
            KAEE LE  I++ TKG++IYFW  LD+D    G  + LTFWS CDI+N G CRT FEDAF
Sbjct: 792  KAEESLENIIKQETKGEIIYFWTRLDIDGDAYGRKNALTFWSMCDILNQGNCRTTFEDAF 851

Query: 929  RRMYGLPSNVEALPPMPQGGGHWLALHSWAMPTTSFLEFIMFSRMFIDSLHSLHVNSNKT 750
            R +YGLP ++EALPPMP+ G HW +LH+W MPT SFLEF+MFSRMF +SL +LH N N +
Sbjct: 852  RHIYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDS 911

Query: 749  SECFLGLSAPEKKHCYCRITELLVNVWAYHSARKMVYIDPHSGLLREQHPVDQRKGFMWA 570
              C L  S  E+KHCYCR+ ELLVNVWAYHS RKMVYI+P  G L EQHP+ QRKG MWA
Sbjct: 912  KSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLLQRKGLMWA 971

Query: 569  KYFNNTLLKNMXXXXXXXXXXXDHPYRRWLWPLTGEIFWQGVXXXXXXXXXRVKMDKKRK 390
            KYFN TLLK+M           DHP  RWLWPLTGE+ W+GV         R+KMDKKRK
Sbjct: 972  KYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRK 1031

Query: 389  TKEKLLDRLKHGYRQKSIGG 330
            TKEKL DR+K+GY+QKS+GG
Sbjct: 1032 TKEKLYDRIKNGYKQKSLGG 1051


>emb|CAB85554.1| putative protein [Arabidopsis thaliana]
          Length = 1091

 Score =  658 bits (1698), Expect = 0.0
 Identities = 321/562 (57%), Positives = 390/562 (69%), Gaps = 2/562 (0%)
 Frame = -2

Query: 2009 FPKNDPEVLRNSFSLLISDGKLSRFARSVAASGRLHAKNMFAEECIIVHAKLVEDVFHFP 1830
            F +NDP+ L  +FS LISDG+LS+FA+++A+SGRL  KN+ A ECI  +A+L+E++ HFP
Sbjct: 536  FRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECITGYARLLENMLHFP 595

Query: 1829 SDVLLPSRASQMNNIIWEWSLFRKGLDQISRDTEDLLLEDNTRM--NSSIVYDLEEDMTS 1656
            SD  LP   SQ+    WEW+ FR  L+Q     +  +L+        S IV+ +EE    
Sbjct: 596  SDTFLPGSISQLQVAAWEWNFFRSELEQ----PKSFILDSAYAFIGKSGIVFQVEEKFMG 651

Query: 1655 YVALGNVTQGHSEGLEVEIPTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEWEEIY 1476
             +   N    ++  +  E+P+                                  WEEIY
Sbjct: 652  VIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESEEDRMERDVED--WEEIY 709

Query: 1475 RNARKAEKLRFEPNERDEGELERTGQPICIYEMYNGAGGWPFLHHGSLYRGLSLSTKARR 1296
            RNARK+EKL+FE NERDEGELERTG+P+CIYE+YNGAG WPFLHHGSLYRGLSLS+K RR
Sbjct: 710  RNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLSLSSKDRR 769

Query: 1295 LSSDDVDAVSRLPILNDTYYCNILCEIGAMFAIANGIDDIHKGPWIGFQSWRTAGRKVSL 1116
            LSSDDVDA  RLP+LNDTYY +ILCEIG MF++AN +D IH  PWIGFQSWR AGRKVSL
Sbjct: 770  LSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSL 829

Query: 1115 SKKAEEILEKTIQENTKGDVIYFWACLDMDRGVVGNNDLLTFWSTCDIMNAGRCRTAFED 936
            S KAEE LE  I++ TKG++IYFW  LD+D    G+ + LTFWS CDI+N G CRT FED
Sbjct: 830  SSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFED 889

Query: 935  AFRRMYGLPSNVEALPPMPQGGGHWLALHSWAMPTTSFLEFIMFSRMFIDSLHSLHVNSN 756
            AFR MYGLP ++EALPPMP+ G HW +LH+W MPT SFLEF+MFSRMF +SL +LH N N
Sbjct: 890  AFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLN 949

Query: 755  KTSECFLGLSAPEKKHCYCRITELLVNVWAYHSARKMVYIDPHSGLLREQHPVDQRKGFM 576
             +  C L  S  E+KHCYCR+ ELLVNVWAYHS RKMVYI+P  G L EQHP+ QRKG M
Sbjct: 950  DSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLM 1009

Query: 575  WAKYFNNTLLKNMXXXXXXXXXXXDHPYRRWLWPLTGEIFWQGVXXXXXXXXXRVKMDKK 396
            WAKYFN TLLK+M           DHP  RWLWPLTGE+ W+GV         R+KMDKK
Sbjct: 1010 WAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKK 1069

Query: 395  RKTKEKLLDRLKHGYRQKSIGG 330
            RKTKEKL DR+K+GY+QKS+GG
Sbjct: 1070 RKTKEKLYDRIKNGYKQKSLGG 1091


Top