BLASTX nr result

ID: Scutellaria23_contig00011908 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria23_contig00011908
         (2402 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI34399.3| unnamed protein product [Vitis vinifera]              269   3e-69
ref|XP_002509869.1| conserved hypothetical protein [Ricinus comm...   235   4e-59
ref|XP_003532424.1| PREDICTED: uncharacterized protein LOC100819...   226   3e-56
ref|XP_003525371.1| PREDICTED: uncharacterized protein LOC100782...   218   5e-54
ref|XP_004146572.1| PREDICTED: uncharacterized protein LOC101222...   185   6e-44

>emb|CBI34399.3| unnamed protein product [Vitis vinifera]
          Length = 691

 Score =  269 bits (687), Expect = 3e-69
 Identities = 198/620 (31%), Positives = 305/620 (49%), Gaps = 29/620 (4%)
 Frame = +3

Query: 372  WSKITDPELHREGCQSFILQDDDAS---TDHGLVNTSSPLVLGLEAXXXXXXXXXXXXXN 542
            W KI   E+ R   Q   LQ +DA    +  G  N+SS L+ G E              +
Sbjct: 73   WDKIPHIEICRRASQMACLQGEDAPEHLSSEG-TNSSSLLIPGPEGSQLLRKAGKTPRSS 131

Query: 543  G---GCGKRSGMVQMDVSRRKNGVHEVNGTSTSPASSPSCNMSEKTHGVKHKNGHNCRCG 713
            G   GC KR    Q + S R +G  ++ G S+ P         EK+  V+ KN  N + G
Sbjct: 132  GLPSGCFKRPRTAQTEDSTRLSGADDMKGISSYPTKG---TFPEKSQVVRQKNNFNGKRG 188

Query: 714  DKRNSKVH-KNRXXXXXXXXXXXXXXXXXXXXXXXXVYGLKTDVFDITKYVNDLSLDELF 890
            +KRN KV  + +                        +YGLK+D+ D+TK V+++S++ L 
Sbjct: 189  EKRNFKVPTRTKYDSFSLKAGLTSFSSAGGGNSILGIYGLKSDIHDVTKLVDEISVNRLL 248

Query: 891  RGQYSKPSNAEDKGKSAANSNDNLLLSVRKACSVLQDKKQFQAPKYAEIDNTCIQKVSTG 1070
             G Y  PS  +DKGK A ++N+N+L SVRKACS+LQ ++  Q+ +++E D +  +K+ST 
Sbjct: 249  DGTYKCPSLGKDKGKKAVSTNENILHSVRKACSLLQLRRPVQSQQFSETDCSSNRKLSTC 308

Query: 1071 SMTTSSAIEQT-DYDKGDSCPAELPPDKVKKSDYKVAASNSVTDDPLYLPKFVLERXXXX 1247
            S  + S +    + DKGD+   +L     K S  K     ++ D  L+ PK +LER    
Sbjct: 309  SSNSFSCVASNINGDKGDAYRMDLS-SCYKDSCSKPETPFNMLDFSLHQPKDILERLALP 367

Query: 1248 XXXXXXXXXXXSGKTASSSKGCIDPCLGKSNFQRIGLPPFPWSHSISGHNKLGTDSVKLS 1427
                       + K A SSK   D  LGK    R  LPPFPW+H+ SGH K  +D+VKLS
Sbjct: 368  PPKDLESLLLDAVKPAGSSKSTPDQRLGKPISHRANLPPFPWAHTFSGHCKTNSDAVKLS 427

Query: 1428 TNRSMCHGRWSKVKNSTSLEKCSLDLAQDFESLVFNQSLVPKVNL----------TSSEF 1577
            T+RS C GRW ++ ++        D  +D ES  ++QSLVP   L          TS+ F
Sbjct: 428  TSRSTCQGRWQRIGSTAGSLGDVTDCFKDLESFTYDQSLVPSQGLKLGVLENEVGTSASF 487

Query: 1578 PENETAAEQVLSSSRACSTPSVPTGE--------HSP-TYSAAKTLCEMAAYSSKENLSA 1730
            P ++       + S+A   P    G         HSP    AA+TL  +AA+S +++ + 
Sbjct: 488  PLHDWCPSSSTTCSKASHLPPESAGSLKNEGDGGHSPRLIEAAQTLFSLAAHSLRQHPNG 547

Query: 1731 LVKLLEKPCNTAMRALKLNASEKSKNLFDARKTMVRPHNPVKVGDDGLPSKKLRRSIDLT 1910
            ++K  +K     M+A K  ++E+ +++  A K++V      K  +   PSKK +      
Sbjct: 548  IMKWPKKSLQKPMKARKTKSNERPEHI-SATKSVVVSDYASKNTEQITPSKKPKLCATEK 606

Query: 1911 SAYVDHTESIKRRTLHWS-TPESLASSPRKLIRHSNAGTDILGVNLVKKSYMMKPP-RSG 2084
                 H+ S ++  ++WS TP S+ SSP K +R     ++    ++ K+S M+ PP R  
Sbjct: 607  KKDFGHS-SARKGLINWSTTPRSIRSSPSKSVRDPTGDSNHHNASITKQSCMLPPPTRIL 665

Query: 2085 DRPSSSQQKTGKAIPLKWSR 2144
            D+  +SQQK  +  P++WSR
Sbjct: 666  DKAGNSQQKPKRPGPVEWSR 685


>ref|XP_002509869.1| conserved hypothetical protein [Ricinus communis]
            gi|223549768|gb|EEF51256.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 632

 Score =  235 bits (600), Expect = 4e-59
 Identities = 174/538 (32%), Positives = 257/538 (47%), Gaps = 5/538 (0%)
 Frame = +3

Query: 546  GCGKRSGMVQMDVSRRKNGVHEVNGTSTSPASSP-SCNMSEKTHGVKHKNGHNCRCGDKR 722
            GC KR  +  ++ +     +       +   S P  C  +EKT   K +N  + + GD+R
Sbjct: 121  GCSKRPRVTLLEDTTDPATIDNAKEACSKQVSHPIKCESNEKTQSAKQRNNFSSKRGDRR 180

Query: 723  NSKV-HKNRXXXXXXXXXXXXXXXXXXXXXXXXVYGLKTDVFDITKYVNDLSLDELFRGQ 899
            NSKV  K +                        +YGLKTDV DITK V+DLSLD+L +G 
Sbjct: 181  NSKVLTKTKYDSFSVKASLASFSSAAAGNNFFGLYGLKTDVHDITKLVDDLSLDDLLQGI 240

Query: 900  YSKPSNAEDKGKSAANSNDNLLLSVRKACSVLQDKKQFQAPKYAEIDNTCIQKVSTGSMT 1079
            Y  P   +DKGK A N+ +++L SVRKACS+LQ  +  Q   +AEID+   + + TG  T
Sbjct: 241  YECPKLGKDKGKKATNTTESVLHSVRKACSILQLTRSAQFQNFAEIDSCSNEIIPTGQTT 300

Query: 1080 TSSAIEQTDYDKGDSCPAELPPDKVKKSDYKVAASNSVTDDPLYLPKFVLERXXXXXXXX 1259
            + S +   + D GDS   EL     K+S  K  +S +  +     PK  LER        
Sbjct: 301  SISIV--GNGDNGDSSMTELCSYN-KESCSKRESSANFLNLSFEQPKGTLERLALPPPKD 357

Query: 1260 XXXXXXXSGKTASSSKGCIDPCLGKSNFQRIGLPPFPWSHSISGHNKLGTDSVKLSTNRS 1439
                   + K A SS+   DP  GK   +R  LPPFPWSH+  G+ +  +D+ KL T+RS
Sbjct: 358  LEALLLDAAKPAVSSRNAPDPRPGKQASRRPSLPPFPWSHTFGGNCRTNSDANKLLTSRS 417

Query: 1440 MCHGRWSKVKNSTSLEKCSLDLAQDFESLVFNQSLVPKVNLTSSEFPENETAAEQVLSSS 1619
             C GRW K+ N+ +    SL +   + SL    SL       S    +N           
Sbjct: 418  TCQGRWVKLGNTFN----SLGIVPKYTSLTLPSSLGTDFIKESGSDLKNHR--------- 464

Query: 1620 RACSTPSVPTGEHSPTYSAAKTLCEMAAYSSKENLSALVKLLEKPCNTAMRALKLNASEK 1799
                      G+     +AA+TL ++A  +S+ N   +VK  +KP    M+A K  ++EK
Sbjct: 465  --------KVGQCPRLLAAAQTLYDIATSTSRLNQDGMVKWPKKPSQKVMKARKSKSAEK 516

Query: 1800 SKNLFDARKTMVRPHNPVKVG--DDGLPSKKLRRSIDLTSAYVDHTESIKRRTLHWSTPE 1973
             +++F A  T+V   + ++    D  L SK+L+ S      Y+ H   +++  ++WSTP+
Sbjct: 517  PEDIF-APSTLVMGSDQMEKNSVDHTLTSKRLKLSTIENKNYLSHVNGVRKGPINWSTPK 575

Query: 1974 SLASSPRKLIRHSNAGTDILGVNLVKKSYMMKPP-RSGDRPSSSQQKTGKAIPLKWSR 2144
            S  SSP K +R S A        +VK+S M  PP +   R  + QQK  K I   W+R
Sbjct: 576  SSRSSPNKSVRDSTA-------CIVKQSCMTPPPAKVLHRSCNGQQKFHKLIRTDWNR 626


>ref|XP_003532424.1| PREDICTED: uncharacterized protein LOC100819206 [Glycine max]
          Length = 660

 Score =  226 bits (575), Expect = 3e-56
 Identities = 172/563 (30%), Positives = 266/563 (47%), Gaps = 28/563 (4%)
 Frame = +3

Query: 540  NGGCGKRSGMVQMDVSRRKNGVHEVNGTSTSPAS-SPSCNMSEKTHGVKHKNGHNCRCGD 716
            N  C KR  M Q + S   NG+ E    S    S + +C   EK    K K+  + R GD
Sbjct: 101  NSSCSKRPRMSQPEDSLSPNGIEESKDISDKLGSHNLNCTSPEKNQLPKQKSNSSKR-GD 159

Query: 717  KRNSKVH--KNRXXXXXXXXXXXXXXXXXXXXXXXXVYGLKTDVFDITKYVNDLSLDELF 890
            KRN KV   K +                        +YGLK D  D+TK +++  LDEL 
Sbjct: 160  KRNFKVPSAKAKFESSSMKMGASIFSSTSGGNNFFGLYGLKHDFHDVTKLIDEPPLDELL 219

Query: 891  RGQYSKPSNAEDKGKSAANSNDNLLLSVRKACSVLQDKKQFQAPKYAEIDNTCIQKVSTG 1070
            RG +  P  ++DKGK  ++ +D+ L SVRKACS+LQ  K  Q+    E+D +   K+ST 
Sbjct: 220  RGTFECPILSKDKGKKTSSVSDSFLNSVRKACSILQCPKPVQSQNMTEMDYSSNMKMSTC 279

Query: 1071 SMTTSSAIEQT-DYDKGDSCPAELPPDKVKKSDYKVAASNSVTDDPLYLPKFVLERXXXX 1247
             +++  A+E   + DK  SC  ++   + K    +V ++ S  D PL+ PK VLER    
Sbjct: 280  QLSSVCAVESVGNGDKEQSCTLDMSSCQ-KDHCSEVESTTSPLDFPLHQPKDVLERIALH 338

Query: 1248 XXXXXXXXXXXSGKTASSSKGCIDPCLGKSNFQRIGLPPFPWSHSISGHNKLGTDSVKLS 1427
                         K A ++K  ID   GK   +R  LP FPWSH+  GH++  +D+ KLS
Sbjct: 339  PFQDLESLLLDVSKPAVTTKNGIDQRSGKQVSRRPSLPTFPWSHAFGGHSRTNSDTGKLS 398

Query: 1428 TNRSMCHGRWSK---VKNSTSLEKCSLDLAQDFESLVFNQSLVPKVNLTSSEFPENETAA 1598
            T+RSMC G+WS+   + +ST  ++ S     + +S  ++QSLVP    + S   +N ++ 
Sbjct: 399  TSRSMCQGKWSRTCVIASSTDADRSSF---TNLDSFSYDQSLVPS---SGSSDKKNFSSL 452

Query: 1599 EQVL-------SSSRACSTPSVPTGEHS-------------PTYSAAKTLCEMAAYSSKE 1718
               L       SSS +CS  S    E                  +AA+TLCE+A +S ++
Sbjct: 453  FANLPFHLLDSSSSVSCSEDSWAKAEFGGPADTKENDERCPRVLTAAQTLCEIATHSQRQ 512

Query: 1719 NLSALVKLLEKPCNTAMRALKLNASEKSKNLFDARKTMVRPHNPVKVGDDGLPSKKLRRS 1898
            N   +++   K     M+A    ++EK +       +M+      +  +  +PSKK R S
Sbjct: 513  NSDGILRWQRKTSQKTMKACHYKSNEKLEETSSRPISMIGSDMVARSVEQIMPSKKPRLS 572

Query: 1899 IDLTSAYVDHTESIKRRTLHWSTPESLASSPRKLIRHSNAGTDILGVNLVKKSYMMKPPR 2078
            I + +    ++   K+  + W   +S  S P K +R S         +++K+  MM PP 
Sbjct: 573  I-VENKNSGYSNIAKKGHIVWPISKSSRSFPSKQVRDSFVENKRTNASILKQHCMMPPPA 631

Query: 2079 SG-DRPSSSQQKTGKAIPLKWSR 2144
             G D+    QQ+ GK + + W R
Sbjct: 632  RGLDKTRDGQQQVGKLVVMDWKR 654


>ref|XP_003525371.1| PREDICTED: uncharacterized protein LOC100782637 [Glycine max]
          Length = 659

 Score =  218 bits (556), Expect = 5e-54
 Identities = 166/557 (29%), Positives = 258/557 (46%), Gaps = 22/557 (3%)
 Frame = +3

Query: 540  NGGCGKRSGMVQMDVSRRKNGVHEVNGTSTSPA-SSPSCNMSEKTHGVKHKNGHNCRCGD 716
            N  C KR  M Q + S   NG+ E    S      + +C   EK    K K+  + R GD
Sbjct: 101  NSSCSKRPRMSQPEDSLSPNGIEESKDISDKLGLHNLNCTSPEKNQLPKQKSNSSKR-GD 159

Query: 717  KRNSKVH--KNRXXXXXXXXXXXXXXXXXXXXXXXXVYGLKTDVFDITKYVNDLSLDELF 890
            KRN KV   K +                        +YGLK D  D+TK + +  L+EL 
Sbjct: 160  KRNFKVPSVKAKFESSSMKMGASIFSFTSGGNNFFGLYGLKHDFHDVTKLMEEPPLEELL 219

Query: 891  RGQYSKPSNAEDKGKSAANSNDNLLLSVRKACSVLQDKKQFQAPKYAEIDNTCIQKVSTG 1070
            RG +  P  ++DKGK  ++ +D+ L SVRKACS+LQ  K  ++   AE+D +   K+ST 
Sbjct: 220  RGTFDFPILSKDKGKKTSSMSDSFLNSVRKACSILQHPKPIRSQNMAEMDYSSNMKMSTC 279

Query: 1071 SMTTSSAIEQT-DYDKGDSCPAELPPDKVKKSDYKVAASNSVTDDPLYLPKFVLERXXXX 1247
             +++  AIE   + DK  SC  ++   + K    +V ++ S  D PL+ PK VLER    
Sbjct: 280  QLSSVCAIESVGNGDKEQSCTLDMSSCQ-KDHCSEVESTTSPLDFPLHQPKDVLERIALH 338

Query: 1248 XXXXXXXXXXXSGKTASSSKGCIDPCLGKSNFQRIGLPPFPWSHSISGHNKLGTDSVKLS 1427
                         K A ++K   D   GK   +R  LP FPWSH+  GH++  +D+ KLS
Sbjct: 339  PFQDLESLLLDVSKPAVTTKNGNDQRSGKQVSRRPSLPTFPWSHAF-GHSRTNSDAGKLS 397

Query: 1428 TNRSMCHGRWSKVKNSTSLEKCSLDLAQDFESLVFNQSLVPK-------------VNLTS 1568
            T+RSMC G+WS++    S          + +S  ++QSLVP               NL  
Sbjct: 398  TSRSMCQGKWSRIGVIASSTDADRSSFSNLDSFSYDQSLVPSSGSSDKRNFSSLFANLPF 457

Query: 1569 SEFPENETAAEQVLSSSRACSTPSVPTGEHSP----TYSAAKTLCEMAAYSSKENLSALV 1736
             +   + +     +S ++A     V T E+        +AA+TLCE+A +  +++   ++
Sbjct: 458  HQLDSSSSVPCSEISQAKAEFGGQVDTKENDERCPIILTAAQTLCEIATHLMRQSSDGIL 517

Query: 1737 KLLEKPCNTAMRALKLNASEKSKNLFDARKTMVRPHNPVKVGDDGLPSKKLRRSIDLTSA 1916
            K   K    AM++    + EK +       +M+      +  +  +PSKK R SI + + 
Sbjct: 518  KWQRKTSLKAMKSCHYKSDEKLEETSSRPISMIGSDMVARSVEQIMPSKKPRLSI-VENK 576

Query: 1917 YVDHTESIKRRTLHWSTPESLASSPRKLIRHSNAGTDILGVNLVKKSYMMKPP-RSGDRP 2093
               H+   K+  + W   +S  S P K IR S         +++K+ YMM PP R  D+ 
Sbjct: 577  NSGHSNIAKKGHIVWPISKSSRSFPSKQIRDSFVENKRTNASILKQHYMMPPPARDLDKA 636

Query: 2094 SSSQQKTGKAIPLKWSR 2144
               Q++ GK + + W R
Sbjct: 637  HDGQKQVGKVVAMDWKR 653


>ref|XP_004146572.1| PREDICTED: uncharacterized protein LOC101222259 [Cucumis sativus]
          Length = 498

 Score =  185 bits (469), Expect = 6e-44
 Identities = 151/491 (30%), Positives = 226/491 (46%), Gaps = 46/491 (9%)
 Frame = +3

Query: 819  VYGLKTDVFDITKYVNDLSLDELFRGQYSKPSNAEDKGKSAANSNDNLLLSVRKACSVLQ 998
            +YGLK+DV D TK  +D  L+ L  G Y   + ++DKG+   N N+  L S+RKACSVLQ
Sbjct: 10   LYGLKSDVHDFTKLTDDPPLNGLLDGSYDCANLSKDKGRKDTNVNECFLQSIRKACSVLQ 69

Query: 999  DKKQFQAPKYAEIDNTCIQKVSTGSMTTSSAIEQTDYDKGDSCPAELPPDKVKKSDYKVA 1178
                       E ++    K ST  +++ S++E+          A    D    +  + A
Sbjct: 70   LPLPVHPQNMPESESCSNSKPSTSLVSSVSSMEERANFDAKGTSASWATDSPSLNKVQDA 129

Query: 1179 ASNS-----VTDDPLYLPKFVLERXXXXXXXXXXXXXXXSGKTASSSKGCIDPCLGKSNF 1343
             SNS       D  L+ P  +  +               + K++  SK   D    K  F
Sbjct: 130  CSNSEPLANALDFELHKPDDMFVKLGLPLPKDLESLLQDASKSSVPSKNATDLRSAKQQF 189

Query: 1344 QRIGLPPFPWSHSISGHNKLGTDSVKLSTNRSMCHGRWSKVKNSTSLEKCSLD-LAQDFE 1520
            +R  L PFPWSHS +GH+K  +DS KLS NR+ C GRW +V N +++   + D   +D E
Sbjct: 190  RRAMLQPFPWSHSFNGHSKASSDSSKLSANRTTCPGRWWRVGNFSNIPSATTDCFTKDLE 249

Query: 1521 SLVFNQSLVPKVNLT------SSEFPENETAAEQVLSSSRACSTPS-------------- 1640
            SL FN +L P            S    N         SS  CS  S              
Sbjct: 250  SLTFNHNLFPSTMRVVGSKDGGSFVSVNHNQCGWDSLSSATCSKTSSVLVESRGKINHEA 309

Query: 1641 --------VPTGEHSP-TYSAAKTLCEMAAYSS-KENLSALVKLLEKPCNTAMRALKLNA 1790
                    +   +H P   +AA+TLC++A  +S ++N+  +V+  +KP   +M+A KL  
Sbjct: 310  NGMSFNYPICCEQHCPRVMAAAQTLCDIATSASLRQNIDGIVRWPKKPSQKSMKARKLK- 368

Query: 1791 SEKSKNLFDARKTMVR--PHNPVKVGDDG------LPSKKLRRSIDLTSAYVDHTESIKR 1946
            SE+++ L+  + T+ R   +NP K  ++G      L   KL  + +     +  T + +R
Sbjct: 369  SEETEELY-TKPTIYRLWSNNPFK--NEGHQTPHPLKKPKLGTTTENRRDNIAQT-NCRR 424

Query: 1947 RTLHWSTPESLASSPRKLIRHSNAGTDILGVNLVKKSYMMKPPRSG--DRPSSSQQKTGK 2120
              L+WSTP S  SSP K I+ S + T    V  VK+S MM PP +    +    QQKT K
Sbjct: 425  GPLNWSTPRSSRSSPSKFIKDSVSDTKQSTVGTVKQSSMMPPPATTLLCKAGDGQQKTRK 484

Query: 2121 AIPLKWSRPEG 2153
             + + W R  G
Sbjct: 485  LMLMDWKRGGG 495


Top