BLASTX nr result

ID: Scutellaria23_contig00009198 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria23_contig00009198
         (1451 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276690.1| PREDICTED: uncharacterized protein LOC100248...   480   e-133
ref|XP_002529253.1| conserved hypothetical protein [Ricinus comm...   423   e-116
ref|XP_002869394.1| hypothetical protein ARALYDRAFT_913468 [Arab...   408   e-111
ref|NP_194744.2| uncharacterized protein [Arabidopsis thaliana] ...   407   e-111
ref|XP_004150076.1| PREDICTED: uncharacterized protein LOC101208...   388   e-105

>ref|XP_002276690.1| PREDICTED: uncharacterized protein LOC100248664 [Vitis vinifera]
          Length = 2129

 Score =  480 bits (1235), Expect = e-133
 Identities = 257/506 (50%), Positives = 332/506 (65%), Gaps = 25/506 (4%)
 Frame = -1

Query: 1445 LKKSLLMQVFRGENAEAAYFLRQLFNACSAILRLNLQIDITSSSWSLYVIMVDISESLLL 1266
            L + LL  + +G+N EAA+FLR+LF A SAILRLNLQI+    S     I   IS+ LLL
Sbjct: 1605 LNRPLLRSLLKGDNPEAAFFLRELFIASSAILRLNLQINCIPLSSCFVPIFNGISQLLLL 1664

Query: 1265 EFLR-SQMPHQFAFLWLDGVVKFLEELGSYFPHFDPSLSRDFYVKLIGLHLRTIGKCICL 1089
            E    + +P   + +WLDGV+K+LEELG+ FP  +P+L RD Y KLI LHL+ IGKCI L
Sbjct: 1665 ELANMADVPQPISLVWLDGVLKYLEELGNQFPLTNPTLYRDVYAKLIDLHLKAIGKCISL 1724

Query: 1088 QGKEAKLASQETGSHSKLGDQVQSL----VXXXXXXXXXXXXXXXXSFTKYVSKSSDLHL 921
            QGK A LAS +  S +K  D    L    +                SF  ++ K S+LHL
Sbjct: 1725 QGKRATLASHDAESSTKTLDSHVGLSDASLSHGPYCFDEFKSRLRMSFKVFIKKPSELHL 1784

Query: 920  LSAIQAVERALVGVQEGLMTNYEIXXXXXXXXXXXXXXXXGIDALDLILEFVTGPRRLNM 741
            LSAIQA+ERALVGVQEG M  Y++                GID LDL+LEFV+G +RL++
Sbjct: 1785 LSAIQALERALVGVQEGCMVIYDVNTGSAHGGKVSSITAAGIDCLDLVLEFVSGRKRLSV 1844

Query: 740  IKKHIQSLVGCLINVIVHLQGPSIFYGCVDPLKAYEKPDSGSVILMCIEILTKISGKPSF 561
            +K+H++SL+  L N+++HLQ P IFY  +   K    PD GSVILMCIE+LT+ISGK + 
Sbjct: 1845 VKRHLKSLIAGLFNIVLHLQSPFIFYRKLIHNKGQTDPDPGSVILMCIEVLTRISGKHAL 1904

Query: 560  FQLDACHIAQSLRAPGALFQYFLRLQISEAPIQP--------------------AMDRKI 441
            FQ+D CH+ Q LR P ALFQ F  L++S+AP                        +DR+ 
Sbjct: 1905 FQMDPCHLQQCLRIPAALFQSFRGLRLSDAPASYNFFMFSDNQDNGSLESMDSCTVDRQF 1964

Query: 440  SVELYAACCRMLCTALKHHKSETRQCIALLEDSVGALLHCLEILNTERVAVRESFAWEVQ 261
            +++L+AACCR+L T LKHHKSE  QCIALLEDSV  LL CLE ++ + V  +  F+WEV+
Sbjct: 1965 TIDLFAACCRLLNTVLKHHKSECEQCIALLEDSVCVLLRCLETVDADSVVRKGYFSWEVE 2024

Query: 260  EAVICASSLRRVYEEVRQQKDTFGQCSFQFLSRYIWVYCGFGPARNGIIREVDEALKPGL 81
            E V CA  LRR+YEE+RQQKD F Q  F+FLS YIW+Y G+GP + GI RE+D+AL+PG+
Sbjct: 2025 EGVKCACFLRRIYEEMRQQKDVFRQHCFKFLSNYIWIYSGYGPLKTGIRREIDDALRPGV 2084

Query: 80   YALIDSCSADDLQLLHSLFGEGPCRS 3
            YALID+CSADDLQ LH++FGEGPCRS
Sbjct: 2085 YALIDACSADDLQYLHTVFGEGPCRS 2110


>ref|XP_002529253.1| conserved hypothetical protein [Ricinus communis]
            gi|223531289|gb|EEF33131.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 2057

 Score =  423 bits (1088), Expect = e-116
 Identities = 238/506 (47%), Positives = 315/506 (62%), Gaps = 25/506 (4%)
 Frame = -1

Query: 1445 LKKSLLMQVFRGENAEAAYFLRQLFNACSAILRLNLQIDITSSSWSLYVIMVDISESLLL 1266
            L    L  +  G++ EAA  +RQL  A SA+L+LNLQ + T+S  SL      IS  LLL
Sbjct: 1536 LNNYFLQSLLDGDHPEAAILIRQLLIASSALLKLNLQTNCTTSLSSLVPSFFGISHVLLL 1595

Query: 1265 EFLR-SQMPHQFAFLWLDGVVKFLEELGSYFPH-FDPSLSRDFYVKLIGLHLRTIGKCIC 1092
            +    S++P  F+ +WLDGV+K+L+ELGS+FP   D + +   Y +L+ LHL  +GKCI 
Sbjct: 1596 KLADVSEVPQPFSLIWLDGVLKYLQELGSHFPSKVDSTSTVSVYTRLVELHLNALGKCIT 1655

Query: 1091 LQGKEAKLASQETGSHSKL----GDQVQSLVXXXXXXXXXXXXXXXXSFTKYVSKSSDLH 924
            LQGKEA LAS E  S SK+        +S                  S    +SKS +LH
Sbjct: 1656 LQGKEATLASHEMESSSKILSNNKGSSESSFSHTSFFLDEFKARLRMSLKVLISKSIELH 1715

Query: 923  LLSAIQAVERALVGVQEGLMTNYEIXXXXXXXXXXXXXXXXGIDALDLILEFVTGPRRLN 744
            +  AIQA+ERALVGVQEG    YEI                GID LDL+LE+++G R+ +
Sbjct: 1716 MFPAIQAIERALVGVQEGCTMIYEIKTGTADGGKVSSTVAAGIDCLDLVLEYISGGRQSS 1775

Query: 743  MIKKHIQSLVGCLINVIVHLQGPSIFYGCVDPL-KAYEKPDSGSVILMCIEILTKISGKP 567
            +++ HIQ LV  L N+IVHLQ   +FY  V P    +  PD G+VILMC+E++T+ISGK 
Sbjct: 1776 VVRGHIQKLVAALFNIIVHLQSSLVFY--VRPTGSVHNGPDPGAVILMCVEVVTRISGKR 1833

Query: 566  SFFQLDACHIAQSLRAPGALFQYFLRLQISEAPIQP------------------AMDRKI 441
            +  Q+ + H+AQSL  P ALFQ F +L++S+ P  P                   +DRK 
Sbjct: 1834 AL-QMASWHVAQSLHVPAALFQDFSQLRLSKGPPLPDLFLDNQDCDPVMGKCSSVVDRKF 1892

Query: 440  SVELYAACCRMLCTALKHHKSETRQCIALLEDSVGALLHCLEILNTERVAVRESFAWEVQ 261
            SVELYAACCR+L T LKH K E+ +CIA+L++S   LLHCLE ++ +    +  ++W  Q
Sbjct: 1893 SVELYAACCRLLYTTLKHQKRESEKCIAVLQNSARVLLHCLETVDNDLRVRKGYYSWGAQ 1952

Query: 260  EAVICASSLRRVYEEVRQQKDTFGQCSFQFLSRYIWVYCGFGPARNGIIREVDEALKPGL 81
            E V CA +LRR+YEE+R  KD FGQ  F+FLS YIWVY G+GP + GI RE+DEALKPG+
Sbjct: 1953 EGVKCACALRRIYEELRHHKDDFGQHCFKFLSDYIWVYSGYGPLKTGIRREMDEALKPGV 2012

Query: 80   YALIDSCSADDLQLLHSLFGEGPCRS 3
            YALID+CS DDLQ LHS+FGEGPCR+
Sbjct: 2013 YALIDACSVDDLQYLHSVFGEGPCRN 2038


>ref|XP_002869394.1| hypothetical protein ARALYDRAFT_913468 [Arabidopsis lyrata subsp.
            lyrata] gi|297315230|gb|EFH45653.1| hypothetical protein
            ARALYDRAFT_913468 [Arabidopsis lyrata subsp. lyrata]
          Length = 1967

 Score =  408 bits (1049), Expect = e-111
 Identities = 234/508 (46%), Positives = 315/508 (62%), Gaps = 27/508 (5%)
 Frame = -1

Query: 1445 LKKSLLMQVFRGENAEAAYFLRQLFNACSAILRLNLQIDITSSSWSLYVIMVDISESLLL 1266
            +KK ++  + +G+++E    LR L  A +AILRLNLQID  + S +   ++ +IS  LL 
Sbjct: 1445 VKKKIIESLIKGDSSEVVLALRHLLIASAAILRLNLQIDGIAFSPTFVSVLSNISNDLLS 1504

Query: 1265 EFL-RSQMPHQFAFLWLDGVVKFLEELGSYFPHFDPSLSRDFYVKLIGLHLRTIGKCICL 1089
             F   S+   +F+F+WLDG VK +EELGS F   +P+L+ D Y KLI LHL+ IGKCI L
Sbjct: 1505 VFADMSEASLEFSFIWLDGAVKVVEELGSQFCLSNPTLNIDLYSKLIELHLKVIGKCISL 1564

Query: 1088 QGKEAKLASQETG-----SHSKLGDQVQSLVXXXXXXXXXXXXXXXXSFTKYVSKSSDLH 924
            QGKEA L S ETG      H+KL    ++                  SF  ++  SS+LH
Sbjct: 1565 QGKEATLESHETGFGTNAIHAKLVLSAKN-QSHRLHWLDELKQRLRMSFKVFIQSSSELH 1623

Query: 923  LLSAIQAVERALVGVQEGLMTNYEIXXXXXXXXXXXXXXXXGIDALDLILEFVTGPRRLN 744
            LLS +QA+ERALVGV E     Y I                G+D LDLILE  TG +RLN
Sbjct: 1624 LLSGVQAIERALVGVWEVCPAIYSIQTGNRDGGRISETVAAGLDCLDLILEHATGRKRLN 1683

Query: 743  MIKKHIQSLVGCLINVIVHLQGPSIFY-GCVDPLKAYEKPDSGSVILMCIEILTKISGKP 567
            ++K+HIQ L+  +  ++ H+Q P IF+   V   +    PDSGSVILMC+E+L +I+GK 
Sbjct: 1684 VVKRHIQGLLSAVFGIMAHMQSPFIFFTNAVVGNQGSSSPDSGSVILMCVEVLIRIAGKH 1743

Query: 566  SFFQLDACHIAQSLRAPGALFQYFLRLQ-----------ISEAPIQP---------AMDR 447
            + F++D+ HI+QS+  PGA+F  +L+             +S+   Q           +D+
Sbjct: 1744 ALFRMDSSHISQSIHIPGAIFLDYLQATRVGFSVLDGNLLSKDDQQQDLLGSSKGLQVDK 1803

Query: 446  KISVELYAACCRMLCTALKHHKSETRQCIALLEDSVGALLHCLEILNTERVAVRESFAWE 267
            K SV LYAACCR+L TA+KHHKSET   IA L++SV ALLH LE   T    +    +WE
Sbjct: 1804 KFSVSLYAACCRLLYTAVKHHKSETEGSIATLQESVSALLHSLE---TAGKKLGNCVSWE 1860

Query: 266  VQEAVICASSLRRVYEEVRQQKDTFGQCSFQFLSRYIWVYCGFGPARNGIIREVDEALKP 87
            V+E + CA  LRR+YEE+RQQK+ FGQ  F+FLS YIWV  G+GP + G+ REVDEAL+P
Sbjct: 1861 VEEGIRCACFLRRIYEELRQQKEVFGQHCFKFLSTYIWVSSGYGPLKTGLEREVDEALRP 1920

Query: 86   GLYALIDSCSADDLQLLHSLFGEGPCRS 3
            G+YALIDSCS +DLQ LH++FGEGPCR+
Sbjct: 1921 GVYALIDSCSPNDLQYLHTVFGEGPCRN 1948


>ref|NP_194744.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332660326|gb|AEE85726.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 2009

 Score =  407 bits (1047), Expect = e-111
 Identities = 230/508 (45%), Positives = 313/508 (61%), Gaps = 27/508 (5%)
 Frame = -1

Query: 1445 LKKSLLMQVFRGENAEAAYFLRQLFNACSAILRLNLQIDITSSSWSLYVIMVDISESLLL 1266
            +KK ++  + +G+++E    L+ L  A +AILRLNLQID  + S +   ++ +IS  LL 
Sbjct: 1487 VKKKIIESLIKGDSSEVVLALKHLLIASAAILRLNLQIDGITFSPTFVSVLTNISNDLLS 1546

Query: 1265 EFL-RSQMPHQFAFLWLDGVVKFLEELGSYFPHFDPSLSRDFYVKLIGLHLRTIGKCICL 1089
             F   S+ P +F+F+WLDG VK +EELGS F   +P+L+ D Y KLI LHL+ IGKCI L
Sbjct: 1547 VFADMSEAPLEFSFIWLDGAVKVVEELGSQFCLSNPTLNIDLYSKLIELHLKVIGKCISL 1606

Query: 1088 QGKEAKLASQETG-----SHSKLGDQVQSLVXXXXXXXXXXXXXXXXSFTKYVSKSSDLH 924
            QGKEA L S ETG      H+KL    +                   SF  ++  SS+LH
Sbjct: 1607 QGKEATLESHETGFGTNAIHAKL-VLTEKKRSHRLHWLDELKQRLRMSFKVFIHSSSELH 1665

Query: 923  LLSAIQAVERALVGVQEGLMTNYEIXXXXXXXXXXXXXXXXGIDALDLILEFVTGPRRLN 744
            LLS +QA+ERALVGV E     Y I                G+D LDLILE  TG +RLN
Sbjct: 1666 LLSGVQAIERALVGVWEVCPAIYCIQTGNRDGGRISETVAAGLDCLDLILEHATGRKRLN 1725

Query: 743  MIKKHIQSLVGCLINVIVHLQGPSIFY-GCVDPLKAYEKPDSGSVILMCIEILTKISGKP 567
            ++K+HIQ L+  +  ++ H+Q P IF+   V   +    PDSG+VILMC+ +L +I+GK 
Sbjct: 1726 VVKRHIQGLMSAVFGIMAHMQSPFIFFSNAVVGNQGSNSPDSGAVILMCVGVLIRIAGKH 1785

Query: 566  SFFQLDACHIAQSLRAPGALFQYFL--------------------RLQISEAPIQPAMDR 447
            + F++D+ H++QS+  PGA+F  +L                    +  +     +  +DR
Sbjct: 1786 ALFRMDSSHVSQSIHIPGAIFLDYLHATRVGFSVLDGNLLSKDDQQQDLLGCSKELQVDR 1845

Query: 446  KISVELYAACCRMLCTALKHHKSETRQCIALLEDSVGALLHCLEILNTERVAVRESFAWE 267
            K SV LYAACCR+L TA+KHHKS+T   IA L++SV ALLHCLE   T    +    +WE
Sbjct: 1846 KFSVSLYAACCRLLYTAVKHHKSQTEGSIATLQESVSALLHCLE---TAGKNLGNCVSWE 1902

Query: 266  VQEAVICASSLRRVYEEVRQQKDTFGQCSFQFLSRYIWVYCGFGPARNGIIREVDEALKP 87
            V+E + CA  LRR+YEE+RQQK+ FGQ  F+FLS YIWV  G+GP + G+ REVDEAL+P
Sbjct: 1903 VEEGIRCACFLRRIYEELRQQKEVFGQHCFKFLSTYIWVSSGYGPLKTGLEREVDEALRP 1962

Query: 86   GLYALIDSCSADDLQLLHSLFGEGPCRS 3
            G+YALIDSCS +DLQ LH++FGEGPCR+
Sbjct: 1963 GVYALIDSCSPNDLQYLHTVFGEGPCRN 1990


>ref|XP_004150076.1| PREDICTED: uncharacterized protein LOC101208263 [Cucumis sativus]
          Length = 1981

 Score =  388 bits (997), Expect = e-105
 Identities = 221/503 (43%), Positives = 310/503 (61%), Gaps = 22/503 (4%)
 Frame = -1

Query: 1445 LKKSLLMQVFRGENAEAAYFLRQLFNACSAILRLNLQIDITSSSWSLYVIMVDISESLLL 1266
            L +  L  + +G   E  + L+QLF A S ILRL+ Q D T  S S   I++ IS  LLL
Sbjct: 1461 LNQPFLRGLLQGSYPEVNFALKQLFLAASRILRLHKQYDTTPLSSSSMTILIGISRFLLL 1520

Query: 1265 EFLRS-QMPHQFAFLWLDGVVKFLEELGSYFPHFDPSLSRDFYVKLIGLHLRTIGKCICL 1089
            EF+    +P  F     DGV+K+LEELG  F   DP  SR+ Y +LI LHL+ +GKCICL
Sbjct: 1521 EFVDMVDVPQPFLLACFDGVLKYLEELGHLFRFADPVQSRNLYSELINLHLQAVGKCICL 1580

Query: 1088 QGKEAKLASQETGSHSKL--GDQVQSLVXXXXXXXXXXXXXXXXSFTKYVSKSSDLHLLS 915
            QGK A LAS ET S +K   G   +                   SF  ++ ++++LHLLS
Sbjct: 1581 QGKRATLASHETESTTKTLDGGFFKESSFPGVYCMDEFKASLRMSFKVFIREATELHLLS 1640

Query: 914  AIQAVERALVGVQEGLMTNYEIXXXXXXXXXXXXXXXXGIDALDLILEFVTGPRRLNMIK 735
            A+QA+ERALVGVQEG  T Y +                G++ LDL+LE  +G + + +IK
Sbjct: 1641 AVQAIERALVGVQEGCTTIYGLYSGSEDGGKCSSIVAAGVECLDLVLEIFSGRKCMGVIK 1700

Query: 734  KHIQSLVGCLINVIVHLQGPSIFYGCVDPLKAYEKPDSGSVILMCIEILTKISGKPSFFQ 555
            +HI+SL   L+++++HLQ P IFY  +  +K    PD GSVILM IE+LT++SGK + FQ
Sbjct: 1701 RHIESLTAGLLSIVLHLQSPQIFYRMI-AMKDRSDPDPGSVILMSIEVLTRVSGKHALFQ 1759

Query: 554  LDACHIAQSLRAPGALFQYF-LRLQ----ISEAPIQPA--------------MDRKISVE 432
            ++   ++Q LR P ALF+ F L+L      SE  +  A              +D++ +++
Sbjct: 1760 MNVWQVSQCLRIPAALFENFSLKLPGIATESECSLISAQETSSVVVTTSSSTIDKQFTID 1819

Query: 431  LYAACCRMLCTALKHHKSETRQCIALLEDSVGALLHCLEILNTERVAVRESFAWEVQEAV 252
            L+AACCR+L T +KH KSE ++ IA L+ SV  LL  LE ++ +  ++   F+W+V+E V
Sbjct: 1820 LFAACCRLLYTIIKHRKSECKRSIAQLQASVSVLLQSLESVDPDPKSMGGYFSWKVEEGV 1879

Query: 251  ICASSLRRVYEEVRQQKDTFGQCSFQFLSRYIWVYCGFGPARNGIIREVDEALKPGLYAL 72
             CAS LRR+YEE+RQQ+D   +    FLS YIW Y G GP ++GI RE+D+AL+PG+YAL
Sbjct: 1880 KCASFLRRIYEEIRQQRDIVERHCALFLSDYIWFYSGHGPLKSGIRREIDDALRPGVYAL 1939

Query: 71   IDSCSADDLQLLHSLFGEGPCRS 3
            ID+CSA+DLQ LH++FGEGPCR+
Sbjct: 1940 IDACSAEDLQYLHTVFGEGPCRN 1962


Top