BLASTX nr result

ID: Scutellaria22_contig00001888 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria22_contig00001888
         (2142 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_190612.3| uncharacterized protein [Arabidopsis thaliana] ...   317   8e-84
ref|XP_002876031.1| hypothetical protein ARALYDRAFT_485395 [Arab...   315   3e-83
emb|CAB62472.1| hypothetical protein [Arabidopsis thaliana]           302   2e-79
ref|XP_003519917.1| PREDICTED: uncharacterized protein LOC100812...   282   3e-73
ref|XP_002306450.1| predicted protein [Populus trichocarpa] gi|2...   281   4e-73

>ref|NP_190612.3| uncharacterized protein [Arabidopsis thaliana]
            gi|28973649|gb|AAO64145.1| unknown protein [Arabidopsis
            thaliana] gi|110737253|dbj|BAF00574.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332645145|gb|AEE78666.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 642

 Score =  317 bits (812), Expect = 8e-84
 Identities = 187/492 (38%), Positives = 278/492 (56%), Gaps = 9/492 (1%)
 Frame = +1

Query: 535  ANWSVVAAIFRVLRSIQKYLKQDMDDKIMKAFXXXXXXXXXXXPWDSLGEIYAHYNAEYL 714
            ANWS V+ IFRVLR+I K L Q+ +++I   +           PW  L  I++H +    
Sbjct: 180  ANWSTVSDIFRVLRNILKRLSQEDNEEIFDVYLESVNSTLAKVPWCRLDTIFSHQH---- 235

Query: 715  QGSAEDIAVQVEVVQPRDLTLFFGNFIQFFCSLVTQSSLA-DGLGYSPLFGII---INLV 882
             GS E    Q +     + T+F G+F+QF CS+V Q  +  D   + P + I+   I L+
Sbjct: 236  -GSGER-NFQGQSGNSEEATVFLGSFVQFLCSMVQQVHVVEDSDDFEPSYLILQKTIKLI 293

Query: 883  PKLTAWCHIHLQSPYHVRISHYFRHKVLMLLVNLSSRTQIEQSIRVTWMHVLHKYFEDLL 1062
            P L  WC   L+S     +S Y  HK+L+L++ L+ +++I+ +I ++W+  L +  +  L
Sbjct: 294  PDLLRWCQPKLKSQSGSCMSRYLGHKLLVLMIRLTDKSKIKCTILLSWLQYLQRDSQGFL 353

Query: 1063 LLPISGVKFDQNDFLEGSPFSTSIFDPGKQNISSRHLQRLAIFLFLKCSLNLASTKG--- 1233
               ++  K  Q++ LEGSPF  S+ D     + S HLQRL++FLFL+CS  L  +     
Sbjct: 354  QHTLTKFKPVQDNCLEGSPFFVSLSDREVNEMHSNHLQRLSVFLFLRCSFTLIYSSRHND 413

Query: 1234 -IPEXXXXXXXXXELHGWLLAHVPADILLNDELYLERCMRFAESFLQLFMHEDDILFETL 1410
             + E         E+  W+   +P ++  +  +Y ++ + F+ SF++LFMHEDD+LF+ L
Sbjct: 414  KLCEFDCRKKGMAEMFKWIERQIPGNMFSDHRIYSKKNVEFSASFVRLFMHEDDLLFKVL 473

Query: 1411 LQLFNVPFYL-ERQVMKDKALSEVKNHLSFLVCDLFNPINLFHLFLAEIHYDHQVLLDYL 1587
            LQL +VP +  E   ++  +L + +    F +  LFNP+ LF +FL+E+HYDHQVLLDYL
Sbjct: 474  LQLLSVPLHRQELPNVEGGSLEDEEQITLFRLSTLFNPVRLFCIFLSELHYDHQVLLDYL 533

Query: 1588 ISKDTGSSCAEYLLRSLRIICNSWSLFVEFPGVQEDLGQLRAKRQKVLAECTDFKGIFYP 1767
            ISKD G+SCAEYLLR LR +C+SW+LFVEFP  +        KR+KVL E +        
Sbjct: 534  ISKDIGASCAEYLLRCLRAVCDSWTLFVEFP-FEGSTDAPSPKRRKVLPETS-------- 584

Query: 1768 ASLKDCRSSSFDKEHKEGRAHGKNHTLPFVAARDCLVSLTTSISSLNQKNLFPYNPKVLL 1947
                            E   + + H   F  A+DCL+SL  S+  L+QK LFPYNP+ LL
Sbjct: 585  ----------------EVEQNWRLHAQAFEDAKDCLLSLQNSVVKLHQKKLFPYNPEALL 628

Query: 1948 RRLMRFQELCIS 1983
            RRL RF ELC+S
Sbjct: 629  RRLSRFHELCLS 640


>ref|XP_002876031.1| hypothetical protein ARALYDRAFT_485395 [Arabidopsis lyrata subsp.
            lyrata] gi|297321869|gb|EFH52290.1| hypothetical protein
            ARALYDRAFT_485395 [Arabidopsis lyrata subsp. lyrata]
          Length = 648

 Score =  315 bits (807), Expect = 3e-83
 Identities = 181/492 (36%), Positives = 268/492 (54%), Gaps = 9/492 (1%)
 Frame = +1

Query: 535  ANWSVVAAIFRVLRSIQKYLKQDMDDKIMKAFXXXXXXXXXXXPWDSLGEIYAHYNAEYL 714
            A+WS V+ IFR+LR+I K L Q+ D++++  +           PW  +  +++H +    
Sbjct: 180  ASWSTVSDIFRILRNILKRLSQEEDEELLDVYLESVNSTLAKVPWSRVDTVFSHQHGSGE 239

Query: 715  QGSAEDIAVQVEVVQPRDLTLFFGNFIQFFCSLVTQSSLADGLGYSPLFGII----INLV 882
            +                + T+F GNF+QF CS+V    + +    S    +I    I LV
Sbjct: 240  RNFQGQSGTLGSTANSEEATVFLGNFVQFLCSMVQHVRVVEDSDDSEPSHLILQKTIKLV 299

Query: 883  PKLTAWCHIHLQSPYHVRISHYFRHKVLMLLVNLSSRTQIEQSIRVTWMHVLHKYFEDLL 1062
            P L  WC   L+S     +S Y  HK+L+L++ L+ ++ I+ +I ++W+  L +  +  L
Sbjct: 300  PDLIRWCQPKLKSQSGSCMSRYLGHKLLVLMIRLTDKSNIKCTILLSWLQYLQRDSQGFL 359

Query: 1063 LLPISGVKFDQNDFLEGSPFSTSIFDPGKQNISSRHLQRLAIFLFLKCSLNLASTKGIP- 1239
               ++  K  Q++ LEGSPF  S+ D       S HLQRL++FLFL+CS  L  +     
Sbjct: 360  QHTLTKFKPVQDNCLEGSPFFVSLSDREINETHSNHLQRLSVFLFLRCSFTLIYSSRHNG 419

Query: 1240 ---EXXXXXXXXXELHGWLLAHVPADILLNDELYLERCMRFAESFLQLFMHEDDILFETL 1410
               E         E+  W++  +P  I  +  +Y ++ + F+ SF++LFMHEDD+LF+ L
Sbjct: 420  KQCEFDCRKKGMAEMFKWIVRQIPGIICSDHRIYSKKSVEFSASFVRLFMHEDDLLFKVL 479

Query: 1411 LQLFNVPFYL-ERQVMKDKALSEVKNHLSFLVCDLFNPINLFHLFLAEIHYDHQVLLDYL 1587
            LQL +VP +  E   ++  +L + +    F    LFNP+ LF +FL+E+HYDHQVLLDYL
Sbjct: 480  LQLLSVPLHRQELPNVEGGSLEDEEQITLFRFSTLFNPVTLFCIFLSELHYDHQVLLDYL 539

Query: 1588 ISKDTGSSCAEYLLRSLRIICNSWSLFVEFPGVQEDLGQLRAKRQKVLAECTDFKGIFYP 1767
            ISKD G SCAEYLLR LR +C+SW+LFVEFP  +        KR+KVL E +        
Sbjct: 540  ISKDIGDSCAEYLLRCLRAVCDSWTLFVEFP-FEGSTNASSPKRRKVLPETS-------- 590

Query: 1768 ASLKDCRSSSFDKEHKEGRAHGKNHTLPFVAARDCLVSLTTSISSLNQKNLFPYNPKVLL 1947
                            E   + + H   F  A+DCL+SL  S+  L+QK LFPYNP+ LL
Sbjct: 591  ----------------EVEQNWRLHPQAFEDAKDCLLSLQNSVVKLHQKKLFPYNPEALL 634

Query: 1948 RRLMRFQELCIS 1983
            RRL RFQELC+S
Sbjct: 635  RRLSRFQELCLS 646


>emb|CAB62472.1| hypothetical protein [Arabidopsis thaliana]
          Length = 730

 Score =  302 bits (774), Expect = 2e-79
 Identities = 180/482 (37%), Positives = 270/482 (56%), Gaps = 9/482 (1%)
 Frame = +1

Query: 535  ANWSVVAAIFRVLRSIQKYLKQDMDDKIMKAFXXXXXXXXXXXPWDSLGEIYAHYNAEYL 714
            ANWS V+ IFRVLR+I K L Q+ +++I   +           PW  L  I++H +    
Sbjct: 173  ANWSTVSDIFRVLRNILKRLSQEDNEEIFDVYLESVNSTLAKVPWCRLDTIFSHQH---- 228

Query: 715  QGSAEDIAVQVEVVQPRDLTLFFGNFIQFFCSLVTQSSLA-DGLGYSPLFGII---INLV 882
             GS E    Q +     + T+F G+F+QF CS+V Q  +  D   + P + I+   I L+
Sbjct: 229  -GSGER-NFQGQSGNSEEATVFLGSFVQFLCSMVQQVHVVEDSDDFEPSYLILQKTIKLI 286

Query: 883  PKLTAWCHIHLQSPYHVRISHYFRHKVLMLLVNLSSRTQIEQSIRVTWMHVLHKYFEDLL 1062
            P L  WC   L+S     +S Y  HK+L+L++ L+ +++I+ +I ++W+  L +  +  L
Sbjct: 287  PDLLRWCQPKLKSQSGSCMSRYLGHKLLVLMIRLTDKSKIKCTILLSWLQYLQRDSQGFL 346

Query: 1063 LLPISGVKFDQNDFLEGSPFSTSIFDPGKQNISSRHLQRLAIFLFLKCSLNLASTKG--- 1233
               ++  K  Q++ LEGSPF  S+ D     + S HLQRL++FLFL+CS  L  +     
Sbjct: 347  QHTLTKFKPVQDNCLEGSPFFVSLSDREVNEMHSNHLQRLSVFLFLRCSFTLIYSSRHND 406

Query: 1234 -IPEXXXXXXXXXELHGWLLAHVPADILLNDELYLERCMRFAESFLQLFMHEDDILFETL 1410
             + E         E+  W+   +P ++  +  +Y ++ + F+ SF++LFMHEDD+LF+ L
Sbjct: 407  KLCEFDCRKKGMAEMFKWIERQIPGNMFSDHRIYSKKNVEFSASFVRLFMHEDDLLFKVL 466

Query: 1411 LQLFNVPFYL-ERQVMKDKALSEVKNHLSFLVCDLFNPINLFHLFLAEIHYDHQVLLDYL 1587
            LQL +VP +  E   ++  +L + +    F +  LFNP+ LF +FL+E+HYDHQVLLDYL
Sbjct: 467  LQLLSVPLHRQELPNVEGGSLEDEEQITLFRLSTLFNPVRLFCIFLSELHYDHQVLLDYL 526

Query: 1588 ISKDTGSSCAEYLLRSLRIICNSWSLFVEFPGVQEDLGQLRAKRQKVLAECTDFKGIFYP 1767
            ISKD G+SCAEYLLR LR +C+SW+LFVEFP  +        KR+KVL E +        
Sbjct: 527  ISKDIGASCAEYLLRCLRAVCDSWTLFVEFP-FEGSTDAPSPKRRKVLPETS-------- 577

Query: 1768 ASLKDCRSSSFDKEHKEGRAHGKNHTLPFVAARDCLVSLTTSISSLNQKNLFPYNPKVLL 1947
                            E   + + H   F  A+DCL+SL  S+  L+QK LFPYNP+ LL
Sbjct: 578  ----------------EVEQNWRLHAQAFEDAKDCLLSLQNSVVKLHQKKLFPYNPEALL 621

Query: 1948 RR 1953
            RR
Sbjct: 622  RR 623


>ref|XP_003519917.1| PREDICTED: uncharacterized protein LOC100812484 [Glycine max]
          Length = 639

 Score =  282 bits (721), Expect = 3e-73
 Identities = 186/526 (35%), Positives = 268/526 (50%), Gaps = 36/526 (6%)
 Frame = +1

Query: 406  VCLSLDLAISSTLGSSSQP--SALESRYLDFDPSTLRSXXXXXXSANWSVVAAIFRVLRS 579
            +C SL++AI+  +  SS+P   A  S +  FD   L        + +WS VA + RVLR 
Sbjct: 131  LCCSLEMAIARMISCSSEPPSGAENSEFDCFDVEFLMQYGLK--NFDWSTVAGVVRVLRV 188

Query: 580  IQKYLKQ-DMDDKIMKAFXXXXXXXXXXXPWDSLGEIYAHY------NAEYLQGSAEDIA 738
            I K+LK+ D DD ++K +           PWD L E ++        N+   Q   ++ +
Sbjct: 189  ICKHLKEEDYDDGLIKVYYDSVNSCLLKMPWDLLDEYWSSEFGRMKDNSTINQLHLKNFS 248

Query: 739  VQVEVVQPRDLTLFFGNFIQFFCSLVTQSSLA----DGLGYSPLFGIIINLVPKLTAWCH 906
            V   V+       F G F+Q  CSLV ++       D +   PLF  ++NL+P+L  WC 
Sbjct: 249  VMDPVMN------FLGTFLQLLCSLVYRNDSVETGCDSVDKHPLFLTVVNLIPRLAKWCL 302

Query: 907  IHLQSPYHVRISHYFRHKVLMLLVNLSSRTQIEQSIRVTWMHVLHKYFEDLLLLPISGVK 1086
               ++   +   HY +HK+L+L++ L S T ++  IR++W+ +LH YF++LL  P++   
Sbjct: 303  SEQENNAEMHAIHYLKHKLLILMIRLGSLTGLDCRIRLSWLELLHNYFQELLQQPLTQFL 362

Query: 1087 FDQNDFLEGSPFSTSIFDPGKQNISSRHLQRLAIFLFLKCSLNLASTKGI---------- 1236
             DQ D LE SPF  S+ D       S HL+R A++L L CS +L   +G           
Sbjct: 363  SDQIDCLEDSPFLWSLCDGEACMKRSDHLRRQAVYLLLACSFSLICKRGEIANHCNNSTL 422

Query: 1237 -----------PEXXXXXXXXXELHGWLLAHVPADILLNDELYLERCMRFAESFLQLFMH 1383
                        +         EL  W+L H+P  I +N E Y++ CM F  SFLQL++ 
Sbjct: 423  CSSFTTNPDSEHDYFCRKKGSLELFKWILGHLPTAISINHEKYMQMCMNFISSFLQLYLR 482

Query: 1384 EDDILFETLLQLFNVPFYLERQVMKDKALSEVKNHLSFLVCDLFNPINLFHLFLAEIHYD 1563
            EDD+LFE LL LF++   L+ Q       SE K+               +H    +IHYD
Sbjct: 483  EDDLLFEVLLLLFSISSSLQEQ-------SESKDAA-------------YH----DIHYD 518

Query: 1564 HQVLLDYLISKDTGSSCAEYLLRSLRIICNSWSLFVEFPGVQEDLGQLRAKRQKVLAECT 1743
            HQVLLDYLISKDTG SCA+YLLR L +ICNSW LFVEFP   E L Q   KR+K++ +  
Sbjct: 519  HQVLLDYLISKDTGISCAKYLLRCLHLICNSWKLFVEFPLFGEFLDQSSCKRRKIVGDGL 578

Query: 1744 DFKGIFYPASLKDCRSSSFD-KEHKEGRAHGKNHTL-PFVAARDCL 1875
             F     P S+ +  S     K +KE R   K + + PF  A +C+
Sbjct: 579  HFLADGMPTSIDNSGSIILHIKNYKEDRGGFKYYNIKPFKKAGECI 624


>ref|XP_002306450.1| predicted protein [Populus trichocarpa] gi|222855899|gb|EEE93446.1|
            predicted protein [Populus trichocarpa]
          Length = 622

 Score =  281 bits (720), Expect = 4e-73
 Identities = 188/566 (33%), Positives = 275/566 (48%), Gaps = 36/566 (6%)
 Frame = +1

Query: 394  YVPGVCLSLDLAISSTLGSSSQPSALESRYLDFDPSTLRSXXXXXXSANWSVVAAIFRVL 573
            ++  +   L+LAI++    S +PS  E    + D S+           +WS  A I RVL
Sbjct: 115  FIHSLSTCLELAIANVFLCSWEPSRTEVEDSNCDFSSYEVVKSSLKGGDWSTAAGIVRVL 174

Query: 574  RSIQKYLKQDMDDKIMKAFXXXXXXXXXXXPWDSLGEIYAHYNAEYLQGS-----AEDIA 738
            R+I K+LKQ+ DD++++ +           PW+S+ EI+   + +   G      ++D +
Sbjct: 175  RNILKHLKQECDDQLLEVYLGSVSSFLSNVPWESMDEIHVDQSCDAWDGDPQNCCSKDAS 234

Query: 739  VQVEVVQPRDLTLFFGNFIQFFCSLVTQSSLADGLGYS----PLFGIIINLVPKLTAWCH 906
            V           LF G FIQF CSLV QSS  +    S    P+  ++I+LVPKL  WC 
Sbjct: 235  VFRSFGAKEPKVLFLGIFIQFLCSLVEQSSAVETEVGSQVQYPVLSMVISLVPKLACWCL 294

Query: 907  IHLQSPYHVRISHYFRHKVLMLLVNLSSRTQIEQSIRVTWMHVLHKYFEDLLLLPISGVK 1086
                    + +S YFRHK+LML++ +S  T +  S  + W+ +LH+YFE+LL  PIS ++
Sbjct: 295  CKKGKSVKLSVSQYFRHKLLMLMLRISYVTCLGCSTLILWLQLLHEYFEELLQKPISKLE 354

Query: 1087 FDQNDFLEGSPFSTSIFDPGKQNISSRHLQRLAIFLFLKCSLNLASTKGIP--------- 1239
              Q++ LEGSPF   + +     + S HLQR  + LFL+C  +L S  G           
Sbjct: 355  AGQDECLEGSPFLLGLSNGELDGMHSFHLQRQTLLLFLRCCFSLMSFTGETSKQCVTSKT 414

Query: 1240 --------------EXXXXXXXXXELHGWLLAHVPADILLNDELYLERCMRFAESFLQLF 1377
                          +         EL+ WL  H+P DIL++ E                 
Sbjct: 415  ILKSCLTVASVSDLDYCSRNKGLLELYNWLQGHLPDDILVDHE----------------- 457

Query: 1378 MHEDDILFETLLQLFNVPFYLERQVMKDKALSEVKNHLSFLVCDLFNPINLFHLFLAEIH 1557
                                L  +    + L +  +HL                     H
Sbjct: 458  -------------------RLNGEKQTSQYLKDATHHL---------------------H 477

Query: 1558 YDHQVLLDYLISKDTGSSCAEYLLRSLRIICNSWSLFVEFPGVQEDLGQLRAKRQKVLAE 1737
            YDHQVLLDYLISKD G SCAEYLLR LR++ NSW++F  F    + + Q   K++++L +
Sbjct: 478  YDHQVLLDYLISKDVGISCAEYLLRCLRMVHNSWNVFATFSMDWKVVNQSCCKKRRLLLD 537

Query: 1738 CTDFKGIFYPASLKDCRSSSFDKE-HKEGRAHGKNH---TLPFVAARDCLVSLTTSISSL 1905
             +DF+G    +  + C S S ++E  KE     +NH     PF  A+DCL+SL  S+ SL
Sbjct: 538  VSDFQGEL-SSIPEQCISQSLEEEDEKEFEYTCENHQNKRQPFKEAKDCLISLKASVESL 596

Query: 1906 NQKNLFPYNPKVLLRRLMRFQELCIS 1983
            ++KNLFPYNP VLL+RL +FQELC S
Sbjct: 597  HRKNLFPYNPLVLLKRLSQFQELCHS 622


Top