BLASTX nr result

ID: Scutellaria22_contig00005992 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria22_contig00005992
         (2156 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI35923.3| unnamed protein product [Vitis vinifera]              198   4e-48
ref|XP_002521366.1| conserved hypothetical protein [Ricinus comm...   184   1e-43
ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261...   164   1e-37
gb|AAM13859.1| unknown protein [Arabidopsis thaliana]                 155   5e-35
ref|NP_177422.1| hydroxyproline-rich glycoprotein family protein...   155   5e-35

>emb|CBI35923.3| unnamed protein product [Vitis vinifera]
          Length = 628

 Score =  198 bits (504), Expect = 4e-48
 Identities = 201/664 (30%), Positives = 273/664 (41%), Gaps = 47/664 (7%)
 Frame = -3

Query: 2100 CNKNSKTHKKQTQKLQRAKARKQHEMEDGDGEDRPPFWLQNATHLRRGDRLRRGXXXXXX 1921
            C+   ++ +KQ Q       +   E  + +G   P FW+  ++  RR  R  R       
Sbjct: 13   CDCPIQSSQKQNQNQNTRSKKNTLETMEEEGATTP-FWMPASSGHRRR-RSSRSPSSIFL 70

Query: 1920 XXXXXXXXXXXXXXXXXXXXVPSTLSFSAHIFKPNSVKKSWDSLNVVLVIFAVVFGFLSR 1741
                                +P  LSF+++IFKPN VKKSWDSLN+VLV+FA++ GFLSR
Sbjct: 71   SSGFLIIFLPLTALLFIVFVLPPILSFTSYIFKPNMVKKSWDSLNLVLVLFAIICGFLSR 130

Query: 1740 NKNEERDSYFDGFQSSPVKENGSQKSFDFERNVEQKYE---SEQKNLMLKRNSSSYPDLR 1570
                           S V E  +Q+S     N    YE   S    +   R+SSSYPDLR
Sbjct: 131  GGGGGSSDMESSV--SEVPEESTQRS-----NHGHCYEERISGYGGMRRMRSSSSYPDLR 183

Query: 1569 EFSSVNWSYGDYQARFYDDINVDSGRVSDQGLIHHHRR---HRSLEQVDYLXXXXXXXXX 1399
            + S+  W+ GD + R +DD  +D+ RV     +  HR+    R  E  DY          
Sbjct: 184  QESA--WAGGDGRWRSFDDTQLDNHRV-----LGSHRQLYIRRRYEDQDYC--------- 227

Query: 1398 XXVDTLVRESKKXXXXXXXXXXXXXXXAEDEEIPKNAVARKDRLSRKINKELESLSYVAA 1219
                                           E+    V     +S K  K L  +     
Sbjct: 228  -------------------------------EVKNIDVDNTSMISPK-EKVLSHIPPRPP 255

Query: 1218 SKP--PLPPVGESPAPESQENQKRAHERVARRKERSNRK----QIKDVEAIDTVTAPXXX 1057
            S P  P PP    P P  +   KR+ + VAR + R  R+    + K V+A      P   
Sbjct: 256  SPPLPPSPPPPPPPPPVVKRKVKRSFQAVAREERRETRENSSFESKRVQAAPPPPPPPPP 315

Query: 1056 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPSKEKISKSDRKRGGATGSSTKEFLN 877
                                                + +  KSDRKRGGAT    KEFL 
Sbjct: 316  PPPPLAV-----------------------------ERRSEKSDRKRGGAT----KEFLT 342

Query: 876  SLYHXXXXXXXXK--SVDNMESLLHQA-----------EAPLSLQIXXXXPSVFQNLFST 736
            SLY+        +  S++N++++LH +             P    +     SVF NLFS+
Sbjct: 343  SLYYQRNKKKKQRQKSMENLDTILHNSPHSDQPLRPPPSPPPPPPLPPPPNSVFHNLFSS 402

Query: 735  KKQKRKRTITVTLEPLPP-----QRAEARDPEPT-------PRPPEVTAGKPPQPIKMNS 592
            KK K KR +TV   P PP      RA A   +         P    + A KPP P K +S
Sbjct: 403  KKGKSKRFLTVPPPPPPPPPPPASRAYAGKTKTKIALSRSHPYDHPLNASKPPIPEKSSS 462

Query: 591  FDKVEEASNSGGESPLNRIXXXXXXXXXAFFRSPAWKFVVQGDYVRIXXXXXXXXXSPDP 412
            F+ V+    +G ES L  I           F+ P WKFVV GDYVRI         SPD 
Sbjct: 463  FNSVDGNPYAGSESLL--IPVPPPPPPPPPFKMPDWKFVVHGDYVRIKSTNSSRSGSPDL 520

Query: 411  D--DTESDVTPSAAVAFHPS--------PLFCASPDVNTKAESFITNFRAKLKLEKIHSM 262
            D   + S   PS + +            PLFC SPDVNTKA++FI  FRA LKLEKI+S+
Sbjct: 521  DYIGSPSSKGPSRSTSLKSETEGGDSAQPLFCPSPDVNTKADTFIARFRAGLKLEKINSI 580

Query: 261  KKRE 250
            K+++
Sbjct: 581  KEKQ 584


>ref|XP_002521366.1| conserved hypothetical protein [Ricinus communis]
            gi|223539444|gb|EEF41034.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 553

 Score =  184 bits (466), Expect = 1e-43
 Identities = 182/621 (29%), Positives = 250/621 (40%), Gaps = 36/621 (5%)
 Frame = -3

Query: 2007 EDRPPFWLQNATHLRRGDRLRRGXXXXXXXXXXXXXXXXXXXXXXXXXXVPSTLSFSAHI 1828
            ED PPFWLQ      RG RLRR                           VPS ++F++ +
Sbjct: 5    EDVPPFWLQATDQHHRGRRLRRQASSIFLNSGVILIMLLVIAFVFVFVVVPSVVTFTSQV 64

Query: 1827 FKPNSVKKSWDSLNVVLVIFAVVFGFLSRNKNEERDSYFDGFQSSPVKENGSQKSFDFER 1648
            FKPN +KK WDSLN VLV+FA+V GFL RN     +     +Q   +  + S  S + ++
Sbjct: 65   FKPNLIKKGWDSLNFVLVLFAIVCGFLGRNSPNTSNESSTSYQR--LSSSSSASSSNVQQ 122

Query: 1647 NVEQKYES-----------------EQKNLMLKRNSSSYPDLREFSSVNWSYGDYQARFY 1519
            +V++ Y S                         R+  SYPDLR+ S   WS  D + RFY
Sbjct: 123  DVQRSYPSTPAYRWYDDGQYQDRTASYNTFNRLRSFRSYPDLRQESL--WSNNDERWRFY 180

Query: 1518 DDINVDSGRVSD----QGLIHHHRRHRSLEQVDYLXXXXXXXXXXXVDTLVRESKKXXXX 1351
            DD  V+  + S       L   H   +  ++ D                  +E +K    
Sbjct: 181  DDTRVNGYKFSSPLHQDELQDDHPPQQQQQEQD------------------QEPRK---- 218

Query: 1350 XXXXXXXXXXXAEDEEIPKNAVARKDRL--SRKINKELESLSYVAASKPPLPPVGESPA- 1180
                        +D+E  ++ V+ KD    +  I+KE      V    PP+PP   SP  
Sbjct: 219  ------------QDQEQEQD-VSTKDIAVDTFVIHKE----EVVQTPPPPMPPAPVSPPR 261

Query: 1179 -PESQENQKRAHERVARRKERSNRKQIKDVEAIDTVTAPXXXXXXXXXXXXXXXXXXXXX 1003
             P     ++RA        E   R++ K++E + T+  P                     
Sbjct: 262  LPTRSTVKRRAKRTYHDLGEHEKRRENKNLE-VKTINIPPPPPPPQLIS----------- 309

Query: 1002 XXXXXXXXXXXXXXXXPSKEKISKSDRKRGGATGSSTKEFLNSLYHXXXXXXXXKSVDNM 823
                                  SKSD++RG       K+ L SL          KSV+N+
Sbjct: 310  ----------------------SKSDKRRG-------KDLLISL-RRKRKKQRQKSVENL 339

Query: 822  ESLLHQAEAPLSLQIXXXXPS-----VFQNLFSTKKQKRKRTITVTL-EPLPPQRAEARD 661
            ESL +    P  +      P       FQNLFS+KK K K+  + ++ +P PP R     
Sbjct: 340  ESLFNPEPLPSIIPPPPPPPPPPPPHFFQNLFSSKKGKTKKDHSHSVPQPQPPSRTHRS- 398

Query: 660  PEPTPRPPEVTAGKPPQPIKMNSFDKVEEASNSGGESPLNRIXXXXXXXXXAFFRSPAWK 481
               T +   + A KP + +K  +F  VEE    G  SPL  I           F+   WK
Sbjct: 399  -RTTVQEATIEAYKPLKAVKTGNFSSVEENVERGNASPLIPIPPPPPPPPPPPFKMKPWK 457

Query: 480  FVVQGDYVRIXXXXXXXXXSPD-----PDDTESDVTPSAAVAFHPSPLFCASPDVNTKAE 316
            F+  GDYVR+         SPD     P D ES             P FC SPDVNTKAE
Sbjct: 458  FISDGDYVRVASFNSSRSGSPDIDSEDPSDKESSPMARNKEGDSAMPSFCPSPDVNTKAE 517

Query: 315  SFITNFRAKLKLEKIHSMKKR 253
            +FI  FRA LKLEKI+S+K R
Sbjct: 518  NFIARFRAGLKLEKINSVKGR 538


>ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera]
          Length = 555

 Score =  164 bits (414), Expect = 1e-37
 Identities = 164/568 (28%), Positives = 237/568 (41%), Gaps = 32/568 (5%)
 Frame = -3

Query: 1857 PSTLSFSAHIFKPNSVKKSWDSLNVVLVIFAVVFGFLSRNKNEERDSYFDGFQSS----- 1693
            PS L+F++   +PNSV+KSWDSLNV+LV+FA++ G  +R  +E+ D   +   SS     
Sbjct: 52   PSFLNFTSQFLRPNSVRKSWDSLNVLLVLFAILCGVFARKNDEKNDDVLENHGSSGSVVM 111

Query: 1692 -PVKENGSQKSFDFERNVEQKYESEQKNLMLKRNSSSYPDLREFSSVNWSYGDYQARFYD 1516
                E+ S   F+F          +  ++ L+R+SSSYPDLR+ S   W  GD + RF+D
Sbjct: 112  GKSHESISHSLFEFSDRKIYDPPIQSGSVRLRRSSSSYPDLRQESL--WGAGDDRRRFFD 169

Query: 1515 DINVDSGRVSDQGLIHHHRRHRSLEQVDYLXXXXXXXXXXXVDTLVRESKKXXXXXXXXX 1336
            D  V++ R        + RRHR  E                   L R+            
Sbjct: 170  DFEVNNYRSPASS--DYVRRHRRSE-------------------LERDDS---------- 198

Query: 1335 XXXXXXAEDEEIPKNAVARKDRLSRKINKELESLSYVAASKPPLPPVGESPAPESQENQK 1156
                   E + IP +  A +            S S      PP PP    P P  Q   +
Sbjct: 199  -------EVKVIPVDTFAVRSS---------PSPSPAPPRTPPPPP--PPPPPIVQRKPR 240

Query: 1155 RAHERVARRKERSNRKQIKDVEAIDTVTAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 976
            R++E VAR+++ SN     D +      +P                              
Sbjct: 241  RSYETVARKEKLSN----SDADQFKKSRSPPAPPPPPPPPPPPRVPGGHLP--------- 287

Query: 975  XXXXXXXPSKEKISKSDRKRGGATGSSTKEFLNSLYHXXXXXXXXKSVDNMESLLHQAEA 796
                     ++K  KS R+ GGAT      F+ SLY+        ++ +  E+ +    +
Sbjct: 288  ---------EQKSRKSARRMGGATKDIATVFV-SLYNQTRKKKKQRTKNIHENAVQSPPS 337

Query: 795  -----PLSLQIXXXXPSVFQNLFSTKKQKRKRTITVTLEPLPPQ--------RAEARD-- 661
                 P         PS+  NLF  K  K KR  +V+  P PP         R+  R   
Sbjct: 338  ATTPTPPPPPPPPPPPSMLHNLFR-KGSKSKRIHSVSAPPPPPPPPPRPPPPRSSKRKTH 396

Query: 660  -----PEPTPRPPEVT-----AGKPPQPIKMNSFDKVEEASNSGGESPLNRIXXXXXXXX 511
                 P P P PP  T     AGKPP P + +SF   ++  NSGG+SPL  +        
Sbjct: 397  IPPAPPTPPPPPPPDTSRRRAAGKPPLPARKSSFYNRDDNVNSGGQSPLIPMPPPPPP-- 454

Query: 510  XAFFRSPAWKFVVQGDYVRIXXXXXXXXXSPDPDDTESDVTPSAAVAFHP-SPLFCASPD 334
               FR P  K+VV+GD+VRI         SP+ DD +     SA          FC SPD
Sbjct: 455  ---FRMPELKYVVRGDFVRIRSTHSSRCSSPELDDVDLSSNKSAMDGGDAIGATFCPSPD 511

Query: 333  VNTKAESFITNFRAKLKLEKIHSMKKRE 250
            VN KA++FI   R + +LEKI+S+++R+
Sbjct: 512  VNVKADTFIARLRGEWRLEKINSLRERK 539


>gb|AAM13859.1| unknown protein [Arabidopsis thaliana]
          Length = 535

 Score =  155 bits (391), Expect = 5e-35
 Identities = 173/619 (27%), Positives = 253/619 (40%), Gaps = 29/619 (4%)
 Frame = -3

Query: 2025 MEDGDGEDRPPFWLQ---NATHLRRGDRLRRGXXXXXXXXXXXXXXXXXXXXXXXXXXVP 1855
            ME+ DG+   PFWLQ   N T+ RR   L                             +P
Sbjct: 1    MEEDDGDASTPFWLQSRRNNTYFRRTASL----GGRTTTIATQIFFAGTAAILIVVFIIP 56

Query: 1854 STLSFSAHIFKPNSVKKSWDSLNVVLVIFAVVFGFLSRNKNEERDSYF------DGFQSS 1693
               S  + IF+P+ V+KSWD LN VLV+FAV+ GFLSRN N +  ++       + F +S
Sbjct: 57   PFFSSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTS 116

Query: 1692 P--------VKENGSQKSF-DFERNVEQKYESEQKNLMLKRNSSSYPDLREFSSVNWSYG 1540
            P        V  +G+   + + +R      ++  K     R+ SSYPDLR    +     
Sbjct: 117  PSIIDRRSRVSNSGTTPRYWNDDRGGGGGDQTVYKRFSRLRSVSSYPDLR----LREYEA 172

Query: 1539 DYQARFYDDINVDSGRVSDQGLIHHHRRHRSLEQVDYLXXXXXXXXXXXVDTLVRESKKX 1360
            D + RFYDD  V   R  D   I+ ++ +R+  +                   V +++  
Sbjct: 173  DERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHE-----------EGKPPPEDVDQTEDG 221

Query: 1359 XXXXXXXXXXXXXXAEDEEIPKNAVARKDRLSRKINKELE--SLSYVAASKPPLPPVGES 1186
                           E  E+   A A       ++ +EL+  S      S PP PP    
Sbjct: 222  DNGEGSKVRNGGSETEKVEVVATAEA-------EVVEELKVPSAPPYIPSPPPSPP-RPP 273

Query: 1185 PAPESQENQKRAHERVARRKERSNRKQIKDVEAIDTVTAPXXXXXXXXXXXXXXXXXXXX 1006
            PA +++    R ++ V+ ++E+  R    D  A  T   P                    
Sbjct: 274  PAKQAKRKTNRVYQDVSPQEEKKERD---DFVATTTPIPPPATVY--------------- 315

Query: 1005 XXXXXXXXXXXXXXXXXPSKEKISKSDRKRGGATGSSTKEFLNSLYHXXXXXXXXKSVDN 826
                                +K +K ++K+GGAT    K+FL +L          +S+D 
Sbjct: 316  --------------------QKSNKQEKKKGGAT----KDFLIAL-RRKKKKQRQQSIDG 350

Query: 825  MESLLHQAEAPLSLQIXXXXPS---VFQNLFSTKKQKRKRTITVTLEPLPP----QRAEA 667
            ++ LL  ++ PL        P     FQ LFS+KK K K+  +    P PP    +R E+
Sbjct: 351  LD-LLFGSDPPLVYSPPPPPPPPPPFFQGLFSSKKGKSKKNNSNPPPPPPPPPPERRYES 409

Query: 666  RDPEPTPR--PPEVTAGKPPQPIKMNSFDKVEEASNSGGESPLNRIXXXXXXXXXAFFRS 493
            R      R  P E    KP  P K+  +        +G ESPL  I           F+ 
Sbjct: 410  RASTSKLRKAPVESRTSKPNPPAKVTQY------VGTGSESPLMPIPPPPPPPP---FKM 460

Query: 492  PAWKFVVQGDYVRIXXXXXXXXXSPDPDDTESDVTPSAAVAFHPSPLFCASPDVNTKAES 313
            PAWKFV +GDYVR+          PD    + DV  SA        +FC SPDV+TKA+ 
Sbjct: 461  PAWKFVKRGDYVRMASDISISSDEPD----DPDVAQSAGSKEAAGSMFCPSPDVDTKADD 516

Query: 312  FITNFRAKLKLEKIHSMKK 256
            FI  FRA LKLEK++S+K+
Sbjct: 517  FIARFRAGLKLEKMNSVKR 535


>ref|NP_177422.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|12323765|gb|AAG51845.1|AC010926_8 unknown
            protein; 15669-13984 [Arabidopsis thaliana]
            gi|24030251|gb|AAN41301.1| unknown protein [Arabidopsis
            thaliana] gi|332197252|gb|AEE35373.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 561

 Score =  155 bits (391), Expect = 5e-35
 Identities = 173/619 (27%), Positives = 253/619 (40%), Gaps = 29/619 (4%)
 Frame = -3

Query: 2025 MEDGDGEDRPPFWLQ---NATHLRRGDRLRRGXXXXXXXXXXXXXXXXXXXXXXXXXXVP 1855
            ME+ DG+   PFWLQ   N T+ RR   L                             +P
Sbjct: 1    MEEDDGDASTPFWLQSRRNNTYFRRTASL----GGRTTTIATQIFFAGTAAILIVVFIIP 56

Query: 1854 STLSFSAHIFKPNSVKKSWDSLNVVLVIFAVVFGFLSRNKNEERDSYF------DGFQSS 1693
               S  + IF+P+ V+KSWD LN VLV+FAV+ GFLSRN N +  ++       + F +S
Sbjct: 57   PFFSSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTS 116

Query: 1692 P--------VKENGSQKSF-DFERNVEQKYESEQKNLMLKRNSSSYPDLREFSSVNWSYG 1540
            P        V  +G+   + + +R      ++  K     R+ SSYPDLR    +     
Sbjct: 117  PSIIDRRSRVSNSGTTPRYWNDDRGGGGGDQTVYKRFSRLRSVSSYPDLR----LREYEA 172

Query: 1539 DYQARFYDDINVDSGRVSDQGLIHHHRRHRSLEQVDYLXXXXXXXXXXXVDTLVRESKKX 1360
            D + RFYDD  V   R  D   I+ ++ +R+  +                   V +++  
Sbjct: 173  DERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHE-----------EGKPPPEDVDQTEDG 221

Query: 1359 XXXXXXXXXXXXXXAEDEEIPKNAVARKDRLSRKINKELE--SLSYVAASKPPLPPVGES 1186
                           E  E+   A A       ++ +EL+  S      S PP PP    
Sbjct: 222  DNGEGSKVRNGGSETEKVEVVATAEA-------EVVEELKVPSAPPYIPSPPPSPP-RPP 273

Query: 1185 PAPESQENQKRAHERVARRKERSNRKQIKDVEAIDTVTAPXXXXXXXXXXXXXXXXXXXX 1006
            PA +++    R ++ V+ ++E+  R    D  A  T   P                    
Sbjct: 274  PAKQAKRKTNRVYQDVSPQEEKKERD---DFVATTTPIPPPATVY--------------- 315

Query: 1005 XXXXXXXXXXXXXXXXXPSKEKISKSDRKRGGATGSSTKEFLNSLYHXXXXXXXXKSVDN 826
                                +K +K ++K+GGAT    K+FL +L          +S+D 
Sbjct: 316  --------------------QKSNKQEKKKGGAT----KDFLIAL-RRKKKKQRQQSIDG 350

Query: 825  MESLLHQAEAPLSLQIXXXXPS---VFQNLFSTKKQKRKRTITVTLEPLPP----QRAEA 667
            ++ LL  ++ PL        P     FQ LFS+KK K K+  +    P PP    +R E+
Sbjct: 351  LD-LLFGSDPPLVYSPPPPPPPPPPFFQGLFSSKKGKSKKNNSNPPPPPPPPPPERRYES 409

Query: 666  RDPEPTPR--PPEVTAGKPPQPIKMNSFDKVEEASNSGGESPLNRIXXXXXXXXXAFFRS 493
            R      R  P E    KP  P K+  +        +G ESPL  I           F+ 
Sbjct: 410  RASTSKLRKAPVESRTSKPNPPAKVTQY------VGTGSESPLMPIPPPPPPPP---FKM 460

Query: 492  PAWKFVVQGDYVRIXXXXXXXXXSPDPDDTESDVTPSAAVAFHPSPLFCASPDVNTKAES 313
            PAWKFV +GDYVR+          PD    + DV  SA        +FC SPDV+TKA+ 
Sbjct: 461  PAWKFVKRGDYVRMASDISISSDEPD----DPDVAQSAGSKEAAGSMFCPSPDVDTKADD 516

Query: 312  FITNFRAKLKLEKIHSMKK 256
            FI  FRA LKLEK++S+K+
Sbjct: 517  FIARFRAGLKLEKMNSVKR 535


Top