BLASTX nr result

ID: Panax21_contig00032676 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Panax21_contig00032676
         (715 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002328557.1| predicted protein [Populus trichocarpa] gi|2...   296   4e-81
ref|XP_002282084.1| PREDICTED: pentatricopeptide repeat-containi...   305   8e-81
ref|XP_003550780.1| PREDICTED: pentatricopeptide repeat-containi...   299   4e-79
ref|XP_002887548.1| hypothetical protein ARALYDRAFT_339650 [Arab...   277   8e-75
ref|NP_177601.1| pentatricopeptide repeat-containing protein [Ar...   277   1e-74

>ref|XP_002328557.1| predicted protein [Populus trichocarpa] gi|222838272|gb|EEE76637.1|
           predicted protein [Populus trichocarpa]
          Length = 643

 Score =  296 bits (758), Expect(2) = 4e-81
 Identities = 139/187 (74%), Positives = 161/187 (86%)
 Frame = +2

Query: 35  SNLIVGFAQNGCFDEAFCYFRELQRVGMRPNKVSLTGVLSACAQAGAFEFGKILHRFMEK 214
           S +IVGFA NG F+EAF +FRELQR GMRPN+ SLTGVLSACAQAGA EFGKILH F+EK
Sbjct: 238 STMIVGFAHNGYFEEAFSFFRELQRKGMRPNETSLTGVLSACAQAGALEFGKILHGFIEK 297

Query: 215 AGFVWISSVNNALMDTYSKCGNVRMVRLVFERMPGVKSVISWTSMMAGLAMQGYGEEAIA 394
           +G  WI SVNNAL+DTYSKCGNV M +LVFER+   ++++SWTSMMA LAM G+GEEAI 
Sbjct: 298 SGLAWIVSVNNALLDTYSKCGNVLMAQLVFERIMNERNIVSWTSMMAALAMHGHGEEAIG 357

Query: 395 LFSEMEGSGLRPDGITFISILYACSHAGLIEQGCRYFSKMKNIYDIEPAIEHYGCMVDLY 574
           +F +ME SG+RPD I FIS+LYACSHAGL+EQGC YF KMK +Y+IEP+IEHYGCMVDLY
Sbjct: 358 IFHKMEESGIRPDEIAFISLLYACSHAGLVEQGCEYFDKMKGMYNIEPSIEHYGCMVDLY 417

Query: 575 GRAGQLQ 595
           GRAGQLQ
Sbjct: 418 GRAGQLQ 424



 Score = 32.0 bits (71), Expect(2) = 4e-81
 Identities = 17/39 (43%), Positives = 23/39 (58%)
 Frame = +3

Query: 591 CRASEEKAISALWLIYMDVDLAEQVRKRLFELDPNIDFD 707
           C A   + +     ++ DV LAEQV++RL ELDPN   D
Sbjct: 437 CTAIIWRTLLGACSMHGDVKLAEQVKERLSELDPNNSSD 475



 Score = 77.0 bits (188), Expect = 4e-12
 Identities = 40/125 (32%), Positives = 69/125 (55%), Gaps = 1/125 (0%)
 Frame = +2

Query: 122 PNKVSLTGVLSACAQAGAFEFGKILHRFMEKAGFV-WISSVNNALMDTYSKCGNVRMVRL 298
           PN ++   +++AC + G  + G+ L   M     + W     N ++  Y+K G + + R 
Sbjct: 170 PNAIAWNAMVTACCRGGDMKGGRELFDLMPVRNLMSW-----NVMLAGYTKAGELELARE 224

Query: 299 VFERMPGVKSVISWTSMMAGLAMQGYGEEAIALFSEMEGSGLRPDGITFISILYACSHAG 478
           +F  MP +K  +SW++M+ G A  GY EEA + F E++  G+RP+  +   +L AC+ AG
Sbjct: 225 MFLEMP-MKDDVSWSTMIVGFAHNGYFEEAFSFFRELQRKGMRPNETSLTGVLSACAQAG 283

Query: 479 LIEQG 493
            +E G
Sbjct: 284 ALEFG 288


>ref|XP_002282084.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g74630-like [Vitis vinifera]
          Length = 643

 Score =  305 bits (780), Expect = 8e-81
 Identities = 150/198 (75%), Positives = 169/198 (85%)
 Frame = +2

Query: 35  SNLIVGFAQNGCFDEAFCYFRELQRVGMRPNKVSLTGVLSACAQAGAFEFGKILHRFMEK 214
           S +IVGFA NG F EAF +FRELQ+VGMRPN+VSLTG LSACA AGA EFGKILH F+EK
Sbjct: 238 STMIVGFAHNGFFYEAFGFFRELQQVGMRPNEVSLTGALSACADAGAIEFGKILHGFIEK 297

Query: 215 AGFVWISSVNNALMDTYSKCGNVRMVRLVFERMPGVKSVISWTSMMAGLAMQGYGEEAIA 394
           +GF+W+ SVNNAL+DTYSKCGNV M RLVFERMP  +S++SWTSM+AGLAM GYGEEAI 
Sbjct: 298 SGFLWMVSVNNALLDTYSKCGNVGMARLVFERMPEKRSIVSWTSMIAGLAMHGYGEEAIQ 357

Query: 395 LFSEMEGSGLRPDGITFISILYACSHAGLIEQGCRYFSKMKNIYDIEPAIEHYGCMVDLY 574
           LF EME SG+RPDGI FISILYACSHAGLIE+G  YF KMK+IY+IEPAIEHYGCMVDLY
Sbjct: 358 LFHEMEESGIRPDGIAFISILYACSHAGLIEKGYEYFYKMKDIYNIEPAIEHYGCMVDLY 417

Query: 575 GRAGQLQSK*GKGYQCIM 628
           GRAGQL     K Y+ I+
Sbjct: 418 GRAGQLD----KAYEFII 431



 Score = 70.1 bits (170), Expect = 4e-10
 Identities = 46/162 (28%), Positives = 79/162 (48%), Gaps = 1/162 (0%)
 Frame = +2

Query: 11  MGTNFLQVSNLIVGFAQNGCFDEAFCYFRELQRVGMRPNKVSLTGVLSACAQAGAFEFGK 190
           + T+    + L+  +++ G    A   F E+      PN V+   V++AC + G  +   
Sbjct: 137 LDTHLFVGTTLVSMYSECGFVAFAKKVFEEM----FEPNVVAWNAVVTACFRCGDVKGAD 192

Query: 191 ILHRFMEKAGFV-WISSVNNALMDTYSKCGNVRMVRLVFERMPGVKSVISWTSMMAGLAM 367
           ++   M       W     N ++  Y+K G + + R +F  MP VK  +SW++M+ G A 
Sbjct: 193 MMFNRMPFRNLTSW-----NVMLAGYTKAGELELARKLFLEMP-VKDDVSWSTMIVGFAH 246

Query: 368 QGYGEEAIALFSEMEGSGLRPDGITFISILYACSHAGLIEQG 493
            G+  EA   F E++  G+RP+ ++    L AC+ AG IE G
Sbjct: 247 NGFFYEAFGFFRELQQVGMRPNEVSLTGALSACADAGAIEFG 288


>ref|XP_003550780.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g74630-like [Glycine max]
          Length = 640

 Score =  299 bits (765), Expect = 4e-79
 Identities = 145/197 (73%), Positives = 168/197 (85%)
 Frame = +2

Query: 35  SNLIVGFAQNGCFDEAFCYFRELQRVGMRPNKVSLTGVLSACAQAGAFEFGKILHRFMEK 214
           S +IVGFA NGCFDEAF +FREL R  +R N+VSLTGVLSACAQAGAFEFGKILH F+EK
Sbjct: 235 STMIVGFAHNGCFDEAFGFFRELLREEIRTNEVSLTGVLSACAQAGAFEFGKILHGFVEK 294

Query: 215 AGFVWISSVNNALMDTYSKCGNVRMVRLVFERMPGVKSVISWTSMMAGLAMQGYGEEAIA 394
           AGF+++ SVNNAL+DTYSKCGNV M RLVF+ MP  +S++SWTS++AGLAM G GEEAI 
Sbjct: 295 AGFLYVGSVNNALIDTYSKCGNVAMARLVFQNMPVARSIVSWTSIIAGLAMHGCGEEAIQ 354

Query: 395 LFSEMEGSGLRPDGITFISILYACSHAGLIEQGCRYFSKMKNIYDIEPAIEHYGCMVDLY 574
           LF EME SG+RPDGITFIS+LYACSH+GL+E+GC  FSKMKN+Y IEPAIEHYGCMVDLY
Sbjct: 355 LFHEMEESGVRPDGITFISLLYACSHSGLVEEGCGLFSKMKNLYGIEPAIEHYGCMVDLY 414

Query: 575 GRAGQLQSK*GKGYQCI 625
           GRA +LQ    K Y+ I
Sbjct: 415 GRAARLQ----KAYEFI 427


>ref|XP_002887548.1| hypothetical protein ARALYDRAFT_339650 [Arabidopsis lyrata subsp.
           lyrata] gi|297333389|gb|EFH63807.1| hypothetical protein
           ARALYDRAFT_339650 [Arabidopsis lyrata subsp. lyrata]
          Length = 1221

 Score =  277 bits (709), Expect(2) = 8e-75
 Identities = 131/187 (70%), Positives = 159/187 (85%)
 Frame = +2

Query: 35  SNLIVGFAQNGCFDEAFCYFRELQRVGMRPNKVSLTGVLSACAQAGAFEFGKILHRFMEK 214
           S +IVGF+ NG F+E+F YFREL R  MRPN+VSLTGVLSAC+Q+GAFEFGK LH F+EK
Sbjct: 405 STMIVGFSHNGSFNESFSYFRELLRAEMRPNEVSLTGVLSACSQSGAFEFGKTLHGFVEK 464

Query: 215 AGFVWISSVNNALMDTYSKCGNVRMVRLVFERMPGVKSVISWTSMMAGLAMQGYGEEAIA 394
           +G+ WI SVNNAL+D YS+CGNV M RLVFE M   +S++SWTSM+AGLAM G+GEEAI 
Sbjct: 465 SGYSWIVSVNNALIDMYSRCGNVPMARLVFEGMQEKRSIVSWTSMIAGLAMHGHGEEAIR 524

Query: 395 LFSEMEGSGLRPDGITFISILYACSHAGLIEQGCRYFSKMKNIYDIEPAIEHYGCMVDLY 574
           +F+EM  SG+ PD I+FIS+LYACSHAGLI++G  YFSKMK +Y IEPA+EHYGCMVDLY
Sbjct: 525 IFNEMTESGVMPDEISFISLLYACSHAGLIKEGEGYFSKMKRVYHIEPAVEHYGCMVDLY 584

Query: 575 GRAGQLQ 595
           GR+G+LQ
Sbjct: 585 GRSGKLQ 591



 Score = 29.6 bits (65), Expect(2) = 8e-75
 Identities = 12/18 (66%), Positives = 17/18 (94%)
 Frame = +3

Query: 642 DVDLAEQVRKRLFELDPN 695
           +++LAEQV++RL ELDPN
Sbjct: 621 NIELAEQVKQRLNELDPN 638



 Score = 61.6 bits (148), Expect = 2e-07
 Identities = 46/192 (23%), Positives = 91/192 (47%), Gaps = 3/192 (1%)
 Frame = +2

Query: 26  LQVSNLIVG-FAQNGCFDEAFCYFRELQRVGMRPNKVSLTGVLSACAQAGAFEFGK-ILH 199
           L V+  ++G + + GC   A   F E+ +    PN V+   V++AC +       + I  
Sbjct: 308 LFVATTLIGMYGECGCVGFARKVFDEMPQ----PNLVAWNAVVTACFRGNDVSGAREIFD 363

Query: 200 RFMEKAGFVWISSVNNALMDTYSKCGNVRMVRLVFERMPGVKSVISWTSMMAGLAMQGYG 379
           + + +    W     N ++  Y K G +   + +F  MP  +  +SW++M+ G +  G  
Sbjct: 364 KMLVRNHTSW-----NVMLAGYIKAGELECAKRIFSEMPH-RDDVSWSTMIVGFSHNGSF 417

Query: 380 EEAIALFSEMEGSGLRPDGITFISILYACSHAGLIEQGCRYFSKM-KNIYDIEPAIEHYG 556
            E+ + F E+  + +RP+ ++   +L ACS +G  E G      + K+ Y    ++ +  
Sbjct: 418 NESFSYFRELLRAEMRPNEVSLTGVLSACSQSGAFEFGKTLHGFVEKSGYSWIVSVNN-- 475

Query: 557 CMVDLYGRAGQL 592
            ++D+Y R G +
Sbjct: 476 ALIDMYSRCGNV 487


>ref|NP_177601.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75169836|sp|Q9CA54.1|PP122_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g74630 gi|12324801|gb|AAG52363.1|AC011765_15
           hypothetical protein; 86841-88772 [Arabidopsis thaliana]
           gi|332197495|gb|AEE35616.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 643

 Score =  277 bits (708), Expect(2) = 1e-74
 Identities = 131/187 (70%), Positives = 157/187 (83%)
 Frame = +2

Query: 35  SNLIVGFAQNGCFDEAFCYFRELQRVGMRPNKVSLTGVLSACAQAGAFEFGKILHRFMEK 214
           S +IVG A NG F+E+F YFRELQR GM PN+VSLTGVLSAC+Q+G+FEFGKILH F+EK
Sbjct: 238 STMIVGIAHNGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEK 297

Query: 215 AGFVWISSVNNALMDTYSKCGNVRMVRLVFERMPGVKSVISWTSMMAGLAMQGYGEEAIA 394
           AG+ WI SVNNAL+D YS+CGNV M RLVFE M   + ++SWTSM+AGLAM G GEEA+ 
Sbjct: 298 AGYSWIVSVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVR 357

Query: 395 LFSEMEGSGLRPDGITFISILYACSHAGLIEQGCRYFSKMKNIYDIEPAIEHYGCMVDLY 574
           LF+EM   G+ PDGI+FIS+L+ACSHAGLIE+G  YFS+MK +Y IEP IEHYGCMVDLY
Sbjct: 358 LFNEMTAYGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLY 417

Query: 575 GRAGQLQ 595
           GR+G+LQ
Sbjct: 418 GRSGKLQ 424



 Score = 29.6 bits (65), Expect(2) = 1e-74
 Identities = 12/18 (66%), Positives = 17/18 (94%)
 Frame = +3

Query: 642 DVDLAEQVRKRLFELDPN 695
           +++LAEQV++RL ELDPN
Sbjct: 454 NIELAEQVKQRLNELDPN 471



 Score = 65.9 bits (159), Expect = 8e-09
 Identities = 51/195 (26%), Positives = 93/195 (47%), Gaps = 9/195 (4%)
 Frame = +2

Query: 35  SNLIVGFAQNGCFDEAFCYFRELQRVGMRPNKVSLTGVLSACAQ----AGAFEFGKILHR 202
           + LI  +   GC + A   F E+ +    PN V+   V++AC +    AGA E   I  +
Sbjct: 145 TTLIGMYGGCGCVEFARKVFDEMHQ----PNLVAWNAVITACFRGNDVAGARE---IFDK 197

Query: 203 FMEKAGFVWISSVNNALMDTYSKCGNVRMVRLVFERMPGVKSVISWTSMMAGLAMQGYGE 382
            + +    W     N ++  Y K G +   + +F  MP  +  +SW++M+ G+A  G   
Sbjct: 198 MLVRNHTSW-----NVMLAGYIKAGELESAKRIFSEMPH-RDDVSWSTMIVGIAHNGSFN 251

Query: 383 EAIALFSEMEGSGLRPDGITFISILYACSHAGLIEQGCRYFSKMKNIYDIEPA-----IE 547
           E+   F E++ +G+ P+ ++   +L ACS +G  E     F K+ + + +E A     + 
Sbjct: 252 ESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFE-----FGKILHGF-VEKAGYSWIVS 305

Query: 548 HYGCMVDLYGRAGQL 592
               ++D+Y R G +
Sbjct: 306 VNNALIDMYSRCGNV 320


Top