BLASTX nr result

ID: Salvia21_contig00016791 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Salvia21_contig00016791
         (1097 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002521492.1| conserved hypothetical protein [Ricinus comm...   233   5e-59
ref|XP_002271099.1| PREDICTED: uncharacterized protein LOC100262...   219   1e-54
ref|XP_002873667.1| hypothetical protein ARALYDRAFT_488286 [Arab...   216   6e-54
ref|XP_002298556.1| predicted protein [Populus trichocarpa] gi|2...   214   3e-53
ref|NP_196948.2| Surfeit locus protein 2 (SURF2) [Arabidopsis th...   214   3e-53

>ref|XP_002521492.1| conserved hypothetical protein [Ricinus communis]
            gi|223539391|gb|EEF40982.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 249

 Score =  233 bits (595), Expect = 5e-59
 Identities = 127/243 (52%), Positives = 151/243 (62%), Gaps = 6/243 (2%)
 Frame = -3

Query: 1008 EPKEGRNLLGKPKFKKLENGRFKCVETGHELPAHARDAYAESKHCRLGLIDAALARSKPP 829
            E KEG  LLG P F +LENGRFKCVETGHE+ A  +D+Y++SK CRLGLID ALA SKPP
Sbjct: 8    ESKEGNKLLGSPTFTELENGRFKCVETGHEMLAKDKDSYSQSKRCRLGLIDFALANSKPP 67

Query: 828  LNMFRQDPASSSKLECKLTGLTINKSEEHIWKHMNGKRFLNMLEKFE------SEKETPS 667
            LNMF+QDPAS SKL CKLTG T+NKSEEHIWKH+NGKRFLN LE+ E       +KE   
Sbjct: 68   LNMFKQDPASRSKLICKLTGDTVNKSEEHIWKHINGKRFLNKLEQKEMGKVEDKQKEDSK 127

Query: 666  GVEGKPDEEKKKMKNEDGSLXXXXXXXXXXXXXXXXNVDEIVNEVRDSAGKSSDTEDEAE 487
                 P ++KKK K ++                    ++EI++EVRDS+ K SDTE E +
Sbjct: 128  SSADGPKKKKKKTKKKE---------------KEEKPIEEIISEVRDSSDKDSDTE-ELD 171

Query: 486  FWMPPEGERWDHDDGGDRWSDGSESAPXXXXXXXXXXXXXXXXXXDATVLSNGTKRMSLE 307
            FWMPP GERWD DDGGDRW    E                     ++  LS  TKRMS+E
Sbjct: 172  FWMPPVGERWDFDDGGDRWGSDGE-LEQETEKENGADDAVDENGKESEELSKRTKRMSIE 230

Query: 306  TEP 298
              P
Sbjct: 231  IGP 233


>ref|XP_002271099.1| PREDICTED: uncharacterized protein LOC100262754 [Vitis vinifera]
          Length = 273

 Score =  219 bits (557), Expect = 1e-54
 Identities = 124/266 (46%), Positives = 156/266 (58%), Gaps = 18/266 (6%)
 Frame = -3

Query: 1041 KKKGEENPKSDEPKEGRNLLGKPKFKKLENGRFKCVETGHELPAHARDAYAESKHCRLGL 862
            ++K EE  K     EG NLLG P F+ L NGRFKCVETGHE+P +A D+Y++SK CRLGL
Sbjct: 12   EEKVEEEKKKKVGHEGSNLLGPPTFEALANGRFKCVETGHEMPPNAVDSYSQSKRCRLGL 71

Query: 861  IDAALARSKPPLNMFRQDPASSSKLECKLTGLTINKSEEHIWKHMNGKRFLNMLEKFESE 682
            ID AL+  K PLN+F+QDP S SKL CKLTG TINK+EEHIWKH+NG+RFLN LE+ E+E
Sbjct: 72   IDFALSHKKTPLNLFKQDPVSRSKLICKLTGDTINKTEEHIWKHINGRRFLNKLEQKEAE 131

Query: 681  KETPS--------------GVEGKPDEEKKKMKNEDGSLXXXXXXXXXXXXXXXXNVDEI 544
            K+  +              G + K  ++KKK K ++                    VDE 
Sbjct: 132  KQMSNRKVVDKGDQMQQKDGEKHKKKDKKKKKKKQE------------------KEVDEN 173

Query: 543  VNEVR----DSAGKSSDTEDEAEFWMPPEGERWDHDDGGDRWSDGSESAPXXXXXXXXXX 376
            ++E R    D   K+SDTE EAEFWMPP G+RWD DDGGDRW   S+             
Sbjct: 174  ISEARKSEVDEIDKNSDTE-EAEFWMPPVGDRWDSDDGGDRWGSDSD-LECDIDEVNGTD 231

Query: 375  XXXXXXXXDATVLSNGTKRMSLETEP 298
                    ++  LS  TKR+S+E  P
Sbjct: 232  GAEEEHGNESRELSKRTKRLSIEVGP 257


>ref|XP_002873667.1| hypothetical protein ARALYDRAFT_488286 [Arabidopsis lyrata subsp.
            lyrata] gi|297319504|gb|EFH49926.1| hypothetical protein
            ARALYDRAFT_488286 [Arabidopsis lyrata subsp. lyrata]
          Length = 294

 Score =  216 bits (551), Expect = 6e-54
 Identities = 111/215 (51%), Positives = 139/215 (64%), Gaps = 10/215 (4%)
 Frame = -3

Query: 1026 ENPKSDEPKEGRNLLGKPKFKKLENGRFKCVETGHELPAHARDAYAESKHCRLGLIDAAL 847
            E   +   KEG +LLGKPK+KKL+NGRFKCV+TGHEL    +  Y++SK CRLGLID AL
Sbjct: 5    EEEMTTTTKEGADLLGKPKYKKLDNGRFKCVQTGHELLEKDKKVYSQSKRCRLGLIDYAL 64

Query: 846  ARSKPPLNMFRQDPASSSKLECKLTGLTINKSEEHIWKHMNGKRFLNMLEKFESEKETPS 667
            + SKPPLN+F QDP + SKL+CKLTG T+NK+EEHIWKH+NG+RFLN LE+ E EKE+ S
Sbjct: 65   SHSKPPLNLFEQDPNARSKLKCKLTGDTVNKTEEHIWKHINGRRFLNRLEEKEREKESGS 124

Query: 666  GVE----------GKPDEEKKKMKNEDGSLXXXXXXXXXXXXXXXXNVDEIVNEVRDSAG 517
              E          G  +EEKKKMK                        +++ +EV     
Sbjct: 125  IPEEGGETLAEENGVKEEEKKKMKKRKNKKKEKKKNKKSVEKEKNG--EDVADEVEHEND 182

Query: 516  KSSDTEDEAEFWMPPEGERWDHDDGGDRWSDGSES 412
            ++   E+E EFWMPP+GERWD DDGGDRW   S+S
Sbjct: 183  EA--VEEELEFWMPPDGERWDFDDGGDRWGSDSDS 215


>ref|XP_002298556.1| predicted protein [Populus trichocarpa] gi|222845814|gb|EEE83361.1|
            predicted protein [Populus trichocarpa]
          Length = 221

 Score =  214 bits (545), Expect = 3e-53
 Identities = 120/244 (49%), Positives = 147/244 (60%)
 Frame = -3

Query: 1029 EENPKSDEPKEGRNLLGKPKFKKLENGRFKCVETGHELPAHARDAYAESKHCRLGLIDAA 850
            EE+ K  E KEG NLLG P F +LENGRFKC+E+GHE+ A  +++Y+ SK CRLGLID A
Sbjct: 3    EESTKM-ETKEGSNLLGSPTFTQLENGRFKCLESGHEVLAKDKESYSHSKRCRLGLIDFA 61

Query: 849  LARSKPPLNMFRQDPASSSKLECKLTGLTINKSEEHIWKHMNGKRFLNMLEKFESEKETP 670
            LA +KPPLNMF+QDP S SKL CKLTG T+NKSEEHIWKH+NGKRFLN LEK        
Sbjct: 62   LANNKPPLNMFKQDPLSRSKLICKLTGDTVNKSEEHIWKHINGKRFLNKLEK-------- 113

Query: 669  SGVEGKPDEEKKKMKNEDGSLXXXXXXXXXXXXXXXXNVDEIVNEVRDSAGKSSDTEDEA 490
                     + KK +N+                     V+EI+++VRDS+ K SD E E 
Sbjct: 114  ---------KNKKKQNK---------------------VEEIISQVRDSSDKDSDLE-ET 142

Query: 489  EFWMPPEGERWDHDDGGDRWSDGSESAPXXXXXXXXXXXXXXXXXXDATVLSNGTKRMSL 310
            +FW+PP GERWD DDGGDRW   +ES                    ++  LS   KRMS+
Sbjct: 143  DFWIPPAGERWDFDDGGDRWGSDAES-EHESQEENPADDAVEDNGEESKELSTRAKRMSI 201

Query: 309  ETEP 298
            E  P
Sbjct: 202  EIGP 205


>ref|NP_196948.2| Surfeit locus protein 2 (SURF2) [Arabidopsis thaliana]
            gi|38454156|gb|AAR20772.1| At5g14440 [Arabidopsis
            thaliana] gi|38604030|gb|AAR24758.1| At5g14440
            [Arabidopsis thaliana] gi|332004651|gb|AED92034.1|
            Surfeit locus protein 2 (SURF2) [Arabidopsis thaliana]
          Length = 292

 Score =  214 bits (545), Expect = 3e-53
 Identities = 112/207 (54%), Positives = 135/207 (65%), Gaps = 10/207 (4%)
 Frame = -3

Query: 1002 KEGRNLLGKPKFKKLENGRFKCVETGHELPAHARDAYAESKHCRLGLIDAALARSKPPLN 823
            KEG +LLGKPK+KKLENGRFKCV+TGHEL    +  Y++SK CRLGLID AL+ SKPPLN
Sbjct: 12   KEGADLLGKPKYKKLENGRFKCVQTGHELLEKDKKVYSQSKRCRLGLIDYALSHSKPPLN 71

Query: 822  MFRQDPASSSKLECKLTGLTINKSEEHIWKHMNGKRFLNMLEKFESEK----------ET 673
            +F QDP + SKL+CKLTG T+NK+EEHIWKH+ G+RFLN LE+ E EK          ET
Sbjct: 72   LFEQDPNARSKLKCKLTGDTVNKTEEHIWKHITGRRFLNRLEEKEREKESGSIPAEGGET 131

Query: 672  PSGVEGKPDEEKKKMKNEDGSLXXXXXXXXXXXXXXXXNVDEIVNEVRDSAGKSSDTEDE 493
            P+   G  DE+KKK K ++                     DEI +E  D A      E+E
Sbjct: 132  PAKENGVEDEDKKKKKKKNNK-KKKNKKSVEKKKNGEDVADEIEHE-NDEA-----VEEE 184

Query: 492  AEFWMPPEGERWDHDDGGDRWSDGSES 412
             EFWMPP+GERWD DDG DRW   S+S
Sbjct: 185  LEFWMPPDGERWDFDDGRDRWGSDSDS 211


Top