BLASTX nr result

ID: Salvia21_contig00035262 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Salvia21_contig00035262
         (476 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002302000.1| predicted protein [Populus trichocarpa] gi|2...   239   1e-61
ref|XP_002280360.1| PREDICTED: pentatricopeptide repeat-containi...   239   1e-61
ref|XP_004152758.1| PREDICTED: pentatricopeptide repeat-containi...   235   3e-60
ref|XP_002893429.1| pentatricopeptide repeat-containing protein ...   234   5e-60
ref|NP_173907.1| pentatricopeptide repeat-containing protein [Ar...   231   5e-59

>ref|XP_002302000.1| predicted protein [Populus trichocarpa] gi|222843726|gb|EEE81273.1|
           predicted protein [Populus trichocarpa]
          Length = 797

 Score =  239 bits (611), Expect = 1e-61
 Identities = 109/158 (68%), Positives = 138/158 (87%)
 Frame = +3

Query: 3   ALEQGRQLHGQLIRVGFDSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIA 182
           +L+ GRQLH Q++R G++SSLSAGNALITMY RCG +DAA+ +F+ MPC+D +SWNAMIA
Sbjct: 440 SLKHGRQLHAQVVRYGYESSLSAGNALITMYARCGVVDAAHCLFINMPCVDAISWNAMIA 499

Query: 183 ALGQHGHGEKAIELYEEMLEEHILPDRITFLTVLSACSHAGLVDQGKKYFESMTEVYGIT 362
           ALGQHG G +AIEL+EEML+E ILPDRI+FLTV+SACSHAGLV +G+KYF+SM  VYG+ 
Sbjct: 500 ALGQHGQGTQAIELFEEMLKEGILPDRISFLTVISACSHAGLVKEGRKYFDSMHNVYGVN 559

Query: 363 PGEDHYARLIDLLGRSGKLTEAQNVIQAMPFEPGAQIW 476
           P E+HYAR+IDLL R+GK +EA+ V+++MPFEPGA IW
Sbjct: 560 PDEEHYARIIDLLCRAGKFSEAKEVMESMPFEPGAPIW 597



 Score = 59.3 bits (142), Expect = 3e-07
 Identities = 36/130 (27%), Positives = 68/130 (52%), Gaps = 4/130 (3%)
 Frame = +3

Query: 15  GRQLHGQLIRV----GFDSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIA 182
           G+++H   ++       D ++   NALIT Y +CG +D A ++F  MP  D VSWN +++
Sbjct: 308 GKEMHAYFLKTVANPAPDVAMPVNNALITFYWKCGKVDIAQEIFNKMPERDLVSWNIILS 367

Query: 183 ALGQHGHGEKAIELYEEMLEEHILPDRITFLTVLSACSHAGLVDQGKKYFESMTEVYGIT 362
                   ++A   + EM E++IL    +++ ++S  +  G  ++  K+F  M ++ G  
Sbjct: 368 GYVNVRCMDEAKSFFNEMPEKNIL----SWIIMISGLAQIGFAEEALKFFNRM-KLQGFE 422

Query: 363 PGEDHYARLI 392
           P +  +A  I
Sbjct: 423 PCDYAFAGAI 432


>ref|XP_002280360.1| PREDICTED: pentatricopeptide repeat-containing protein At1g25360
           [Vitis vinifera]
          Length = 799

 Score =  239 bits (611), Expect = 1e-61
 Identities = 113/158 (71%), Positives = 137/158 (86%)
 Frame = +3

Query: 3   ALEQGRQLHGQLIRVGFDSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIA 182
           AL  GRQLH QL+R+GFDSSLSAGNALITMY +CG ++AA+ +FLTMP LD VSWNAMIA
Sbjct: 442 ALMHGRQLHAQLVRLGFDSSLSAGNALITMYAKCGVVEAAHCLFLTMPYLDSVSWNAMIA 501

Query: 183 ALGQHGHGEKAIELYEEMLEEHILPDRITFLTVLSACSHAGLVDQGKKYFESMTEVYGIT 362
           ALGQHGHG +A+EL+E ML+E ILPDRITFLTVLS CSHAGLV++G +YF+SM+ +YGI 
Sbjct: 502 ALGQHGHGAQALELFELMLKEDILPDRITFLTVLSTCSHAGLVEEGHRYFKSMSGLYGIC 561

Query: 363 PGEDHYARLIDLLGRSGKLTEAQNVIQAMPFEPGAQIW 476
           PGEDHYAR+IDLL R+GK +EA+++I+ MP EPG  IW
Sbjct: 562 PGEDHYARMIDLLCRAGKFSEAKDMIETMPVEPGPPIW 599



 Score = 69.7 bits (169), Expect = 2e-10
 Identities = 39/113 (34%), Positives = 61/113 (53%), Gaps = 4/113 (3%)
 Frame = +3

Query: 15  GRQLHGQLIRV----GFDSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIA 182
           G+Q+H  ++R       D SLS  NAL T+Y +CG +D A  +F  MP  D VSWNA+++
Sbjct: 310 GKQVHAYILRTEPRPSLDFSLSVNNALATLYWKCGKVDEARQVFNQMPVKDLVSWNAILS 369

Query: 183 ALGQHGHGEKAIELYEEMLEEHILPDRITFLTVLSACSHAGLVDQGKKYFESM 341
                G  ++A   +EEM E ++L    T+  ++S  +  G  ++  K F  M
Sbjct: 370 GYVNAGRIDEAKSFFEEMPERNLL----TWTVMISGLAQNGFGEESLKLFNRM 418



 Score = 55.5 bits (132), Expect = 5e-06
 Identities = 43/128 (33%), Positives = 60/128 (46%), Gaps = 4/128 (3%)
 Frame = +3

Query: 81  LITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIAALGQHGHGEKAIELYEEMLEEHILPD 260
           +I  Y R G LDAA      M     V+WNAMI+    HG   +A+E++ +M    I  D
Sbjct: 231 MIAGYVRNGELDAARQFLDGMTEKLVVAWNAMISGYVHHGFFLEALEMFRKMYLLGIQWD 290

Query: 261 RITFLTVLSACSHAGLVDQGKKYFESMTEVYGITPGEDHYA----RLIDLLGRSGKLTEA 428
             T+ +VLSAC++AG    GK+    +       P  D        L  L  + GK+ EA
Sbjct: 291 EFTYTSVLSACANAGFFLHGKQVHAYILRTEP-RPSLDFSLSVNNALATLYWKCGKVDEA 349

Query: 429 QNVIQAMP 452
           + V   MP
Sbjct: 350 RQVFNQMP 357


>ref|XP_004152758.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g25360-like [Cucumis sativus]
           gi|449504088|ref|XP_004162249.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At1g25360-like [Cucumis sativus]
          Length = 797

 Score =  235 bits (599), Expect = 3e-60
 Identities = 112/158 (70%), Positives = 132/158 (83%)
 Frame = +3

Query: 3   ALEQGRQLHGQLIRVGFDSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIA 182
           ALE GRQLH Q++ +G DS+LS GNA+ITMY RCG ++AA  MFLTMP +D VSWN+MIA
Sbjct: 440 ALENGRQLHAQIVHLGHDSTLSVGNAMITMYARCGIVEAARTMFLTMPFVDPVSWNSMIA 499

Query: 183 ALGQHGHGEKAIELYEEMLEEHILPDRITFLTVLSACSHAGLVDQGKKYFESMTEVYGIT 362
           ALGQHGHG KAIELYE+ML+E ILPDR TFLTVLSACSHAGLV++G +YF SM E YGI 
Sbjct: 500 ALGQHGHGVKAIELYEQMLKEGILPDRRTFLTVLSACSHAGLVEEGNRYFNSMLENYGIA 559

Query: 363 PGEDHYARLIDLLGRSGKLTEAQNVIQAMPFEPGAQIW 476
           PGEDHYAR+IDL  R+GK ++A+NVI +MPFE  A IW
Sbjct: 560 PGEDHYARMIDLFCRAGKFSDAKNVIDSMPFEARAPIW 597



 Score = 65.1 bits (157), Expect = 6e-09
 Identities = 47/186 (25%), Positives = 80/186 (43%), Gaps = 39/186 (20%)
 Frame = +3

Query: 15  GRQLHGQLIRVGF----DSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWN---- 170
           G+Q+H  +++       D  LS GN LIT+Y + G +D A  +F  MP  D ++WN    
Sbjct: 308 GKQVHAYILKNELNPDRDFLLSVGNTLITLYWKYGKVDGARKIFYEMPVKDIITWNTLLS 367

Query: 171 ---------------------------AMIAALGQHGHGEKAIELYEEMLEEHILPDRIT 269
                                       MI+ L Q+G GE+A++L+ +M  +   P+   
Sbjct: 368 GYVNAGRMEEAKSFFAQMPEKNLLTWTVMISGLAQNGFGEQALKLFNQMKLDGYEPNDYA 427

Query: 270 FLTVLSACSHAGLVDQGKKYFESMTEVYGITPGEDHYA----RLIDLLGRSGKLTEAQNV 437
           F   ++ACS  G ++ G++    +  +     G D        +I +  R G +  A+ +
Sbjct: 428 FAGAITACSVLGALENGRQLHAQIVHL-----GHDSTLSVGNAMITMYARCGIVEAARTM 482

Query: 438 IQAMPF 455
              MPF
Sbjct: 483 FLTMPF 488


>ref|XP_002893429.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297339271|gb|EFH69688.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 790

 Score =  234 bits (597), Expect = 5e-60
 Identities = 107/154 (69%), Positives = 133/154 (86%)
 Frame = +3

Query: 15  GRQLHGQLIRVGFDSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIAALGQ 194
           G+Q H QL+++GFDSSLSAGNALITMY +CG ++ A  +F TMPCLD VSWNA+IAALGQ
Sbjct: 436 GQQFHAQLVKIGFDSSLSAGNALITMYAKCGVVEEAQQVFRTMPCLDSVSWNALIAALGQ 495

Query: 195 HGHGEKAIELYEEMLEEHILPDRITFLTVLSACSHAGLVDQGKKYFESMTEVYGITPGED 374
           HGHG +A+++YEEML++ I PDRITFLTVL+ACSHAGLVDQG+KYF SM  VY I PG D
Sbjct: 496 HGHGVEAVDVYEEMLKKGIRPDRITFLTVLTACSHAGLVDQGRKYFNSMETVYRIPPGAD 555

Query: 375 HYARLIDLLGRSGKLTEAQNVIQAMPFEPGAQIW 476
           HYARLIDLL RSGK +EA+++I+++PF+P A+IW
Sbjct: 556 HYARLIDLLCRSGKFSEAESIIESLPFKPTAEIW 589



 Score = 66.2 bits (160), Expect = 3e-09
 Identities = 37/112 (33%), Positives = 65/112 (58%)
 Frame = +3

Query: 6   LEQGRQLHGQLIRVGFDSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIAA 185
           L+ G+Q+H  ++R   D S    N+L+T+Y +CG  + A  +F  MP  D VSWNA+++ 
Sbjct: 302 LQLGKQVHAYVLRRE-DFSFHFDNSLVTLYYKCGKFNEARAIFEKMPAKDLVSWNALLSG 360

Query: 186 LGQHGHGEKAIELYEEMLEEHILPDRITFLTVLSACSHAGLVDQGKKYFESM 341
               GH  +A  +++EM E++IL    +++ ++S  +  G  ++G K F  M
Sbjct: 361 YVSSGHIGEAKLIFKEMKEKNIL----SWMIMISGLAENGFGEEGLKLFSCM 408


>ref|NP_173907.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75172213|sp|Q9FRI5.1|PPR57_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g25360 gi|11067273|gb|AAG28801.1|AC079374_4
           hypothetical protein [Arabidopsis thaliana]
           gi|332192491|gb|AEE30612.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 790

 Score =  231 bits (589), Expect = 5e-59
 Identities = 106/154 (68%), Positives = 133/154 (86%)
 Frame = +3

Query: 15  GRQLHGQLIRVGFDSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIAALGQ 194
           G+Q H QL+++GFDSSLSAGNALITMY +CG ++ A  +F TMPCLD VSWNA+IAALGQ
Sbjct: 436 GQQYHAQLLKIGFDSSLSAGNALITMYAKCGVVEEARQVFRTMPCLDSVSWNALIAALGQ 495

Query: 195 HGHGEKAIELYEEMLEEHILPDRITFLTVLSACSHAGLVDQGKKYFESMTEVYGITPGED 374
           HGHG +A+++YEEML++ I PDRIT LTVL+ACSHAGLVDQG+KYF+SM  VY I PG D
Sbjct: 496 HGHGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQGRKYFDSMETVYRIPPGAD 555

Query: 375 HYARLIDLLGRSGKLTEAQNVIQAMPFEPGAQIW 476
           HYARLIDLL RSGK ++A++VI+++PF+P A+IW
Sbjct: 556 HYARLIDLLCRSGKFSDAESVIESLPFKPTAEIW 589



 Score = 66.6 bits (161), Expect = 2e-09
 Identities = 37/112 (33%), Positives = 65/112 (58%)
 Frame = +3

Query: 6   LEQGRQLHGQLIRVGFDSSLSAGNALITMYGRCGALDAAYDMFLTMPCLDHVSWNAMIAA 185
           L+ G+Q+H  ++R   D S    N+L+++Y +CG  D A  +F  MP  D VSWNA+++ 
Sbjct: 302 LQLGKQVHAYVLRRE-DFSFHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSG 360

Query: 186 LGQHGHGEKAIELYEEMLEEHILPDRITFLTVLSACSHAGLVDQGKKYFESM 341
               GH  +A  +++EM E++IL    +++ ++S  +  G  ++G K F  M
Sbjct: 361 YVSSGHIGEAKLIFKEMKEKNIL----SWMIMISGLAENGFGEEGLKLFSCM 408


Top