BLASTX nr result

ID: Cephaelis21_contig00006319 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00006319
         (1019 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002518527.1| pentatricopeptide repeat-containing protein,...   284   2e-74
ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|2...   262   1e-67
ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containi...   260   4e-67
ref|NP_193849.2| pentatricopeptide repeat-containing protein [Ar...   199   7e-49
sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-c...   199   7e-49

>ref|XP_002518527.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223542372|gb|EEF43914.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 599

 Score =  284 bits (727), Expect = 2e-74
 Identities = 158/342 (46%), Positives = 212/342 (61%), Gaps = 3/342 (0%)
 Frame = +3

Query: 3    KNSKQSSWPPTNWRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXXFLRIL 182
            + +  +S    +WR ++Q+ QL+S++S ILLQR    W                 F +IL
Sbjct: 31   RKTYSTSTSKISWRTRIQQNQLVSEISTILLQRNN--WIPLLQNLNLSSKLTPFLFFQIL 88

Query: 183  QKTQPSPQISLDFFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSD 362
             KTQ   QISL+FFNWAK NL F PDLKS C +  L   S L   AK +L S+I++YPS+
Sbjct: 89   HKTQTHAQISLNFFNWAKTNLNFNPDLKSQCHVIQLSLGSDLPRAAKKILDSLIKTYPSN 148

Query: 363  EIVSSFSKVANFDTYSSVLCS---VLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNS 533
              + +  +       SS+LC+   VL+ Y ++G + E L+VY K +  G    SVH  N 
Sbjct: 149  LFLETMVQACRGK--SSLLCTLNFVLEFYSHKGSFLEGLEVYKKMRVIGC-TPSVHACNV 205

Query: 534  LLHLLQVQNETRLAWCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSS 713
            LL  LQ ++E RLAWC Y +MIR  V  ++FTW ++A ILCKD  FERI ++LDMGI +S
Sbjct: 206  LLDALQRESEIRLAWCFYCAMIRVGVLPDKFTWSLVAHILCKDGNFERIVKLLDMGICNS 265

Query: 714  SLYDLIVQNYSERGDFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRT 893
             +Y+ +V  YS+ GDF AAF  L +MYD+K++P FS YSSILDGACKC + +VIE V   
Sbjct: 266  VMYNAVVDYYSKNGDFKAAFCRLNEMYDRKVEPGFSTYSSILDGACKCRNLQVIERVVAI 325

Query: 894  MIEKGYIPKGVTSKYDSVIQKLSDLGKTYAAKLFLSRACVEK 1019
            M+ K  + K  +S YDS+IQKL DLGK  AA LF  RAC E+
Sbjct: 326  MVGKQLLSKCPSSDYDSIIQKLCDLGKVSAATLFFKRACDER 367


>ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|222848569|gb|EEE86116.1|
            predicted protein [Populus trichocarpa]
          Length = 564

 Score =  262 bits (669), Expect = 1e-67
 Identities = 143/328 (43%), Positives = 206/328 (62%), Gaps = 1/328 (0%)
 Frame = +3

Query: 39   WRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXXFLRILQKTQPSPQISLD 218
            WR+Q+++ QL+ Q+S ILLQR    W                 F +IL KTQ +PQISL 
Sbjct: 14   WRIQIRQNQLVFQISSILLQRHN--WVSLLQNFNLSTKLTPPLFNQILHKTQTNPQISLR 71

Query: 219  FFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSDEIVSSFSKVANF 398
            FFNW + NL+ +PDLKS C + ++   SGL+   +P++ S+++++    +  +       
Sbjct: 72   FFNWVQTNLKLKPDLKSQCHIINICVNSGLTLPVRPIMDSLVKTHHVSVLGEAMVDSCRG 131

Query: 399  DTYSSVLCS-VLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNSLLHLLQVQNETRLA 575
             +  S   S VL+ Y ++GL+ E+L+++ K + +G  + S    NS+L +LQ +NE +LA
Sbjct: 132  KSLKSDAFSFVLECYSHKGLFMESLEMFRKMRGNGF-IASGTACNSVLDVLQRENEIKLA 190

Query: 576  WCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSSSLYDLIVQNYSERG 755
            WC Y +MI+  V  ++ TW +IA+ILCKD  FERI + LDMG+ +S LY+ ++   S+RG
Sbjct: 191  WCFYCAMIKDGVLPDKLTWSLIAQILCKDGNFERIVKFLDMGVYNSVLYNGVIDCCSKRG 250

Query: 756  DFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRTMIEKGYIPKGVTSK 935
            DF+AAF  L +M ++KLDP FS YS+ILDGACK G+ EVIE V   M EKG +PK   S+
Sbjct: 251  DFEAAFERLNQMCERKLDPGFSTYSAILDGACKHGNEEVIERVMDIMAEKGLLPKCPLSQ 310

Query: 936  YDSVIQKLSDLGKTYAAKLFLSRACVEK 1019
             DSVIQK SDL K   A +F  RAC EK
Sbjct: 311  CDSVIQKFSDLCKMNVATMFFRRACDEK 338


>ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            [Vitis vinifera]
          Length = 569

 Score =  260 bits (664), Expect = 4e-67
 Identities = 152/340 (44%), Positives = 212/340 (62%), Gaps = 2/340 (0%)
 Frame = +3

Query: 6    NSKQSSWPPTNWRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXXFLRILQ 185
            N    S  P NWR Q+++ QLISQ+S ILLQR    W                 F +IL 
Sbjct: 11   NQFSKSTTPLNWRAQIKQNQLISQISSILLQRHN--WVTLLRNFNLSSKLTPSLFHQILL 68

Query: 186  KTQPSPQISLDFFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSDE 365
            KTQ +PQ SL FFNW + NL FQPDL +H ++  +  +SGL   AK +L S+I++     
Sbjct: 69   KTQKNPQSSLSFFNWVRTNLGFQPDLAAHSQIIRISIQSGLFQPAKGILDSLIETQKVSV 128

Query: 366  IVSSFSKVANF-DTYSSVLCSVLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNSLLH 542
            +V S  +     D+ S VL  VL+ Y ++GL+ EAL+V+ +    G+ V SV   N+LL 
Sbjct: 129  LVDSVIQACRGKDSESPVLGFVLECYSSKGLFIEALEVFRRITIHGY-VPSVRSCNALLD 187

Query: 543  LLQVQNETRLAWCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSSSL- 719
             LQ +NE +LAWCV  ++IR+ V  +      IA ILCK+ K ER+ R+LDM I  ++L 
Sbjct: 188  SLQRENEIKLAWCVCGALIRNGVLPDYVR---IALILCKNGKLERVVRLLDMSIVCNALI 244

Query: 720  YDLIVQNYSERGDFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRTMI 899
            Y L++  Y ERG+F AAF YL +M ++K DP F  Y+SILDGACK  + EVI++V  +M+
Sbjct: 245  YKLVIDCYCERGNFSAAFHYLNEMCNRKFDPGFCAYNSILDGACKYENDEVIQIVMGSMV 304

Query: 900  EKGYIPKGVTSKYDSVIQKLSDLGKTYAAKLFLSRACVEK 1019
            EKG +PK + S+YDS+IQK+ +LGKT+AA++F  RA  EK
Sbjct: 305  EKGLLPKLLLSEYDSIIQKICNLGKTHAAQMFFKRARNEK 344


>ref|NP_193849.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332659015|gb|AEE84415.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 551

 Score =  199 bits (507), Expect = 7e-49
 Identities = 109/328 (33%), Positives = 192/328 (58%), Gaps = 2/328 (0%)
 Frame = +3

Query: 33   TNWRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXX-FLRILQKTQPSPQI 209
            ++W+ Q    ++ +++S ILLQR+                      FL+IL++T+  P+ 
Sbjct: 26   SDWKTQQTLFRVATEISSILLQRRNWITHLQYVKSKLPRSTLTSPVFLQILRETRKCPKT 85

Query: 210  SLDFFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSDEIVSSFSKV 389
            +LDFF++AK +LRF+PDLKSHC++  +  ESGL   A+ +L  ++++     +V    + 
Sbjct: 86   TLDFFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRW 145

Query: 390  ANFDTYSSVLCS-VLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNSLLHLLQVQNET 566
               +   SV  S VL+ Y  +G +   L+V+   +       S   +NSLL  L  +N+ 
Sbjct: 146  FEGEVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSP-SQSAYNSLLGSLVKENQF 204

Query: 567  RLAWCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSSSLYDLIVQNYS 746
            R+A C+YS+M+R+ +  ++ TW +IA+ILC+  + + + ++++ G+ S  +Y  +V+ YS
Sbjct: 205  RVALCLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYS 264

Query: 747  ERGDFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRTMIEKGYIPKGV 926
              G+FDA F  + +M DKKL+ SF  Y  +LD AC+ GD E I+ V   M+EK ++  G 
Sbjct: 265  RNGEFDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGD 324

Query: 927  TSKYDSVIQKLSDLGKTYAAKLFLSRAC 1010
            ++  D +I++L D+GKT+A+++   +AC
Sbjct: 325  SAVNDKIIERLCDMGKTFASEMLFRKAC 352


>sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21170
          Length = 585

 Score =  199 bits (507), Expect = 7e-49
 Identities = 109/328 (33%), Positives = 192/328 (58%), Gaps = 2/328 (0%)
 Frame = +3

Query: 33   TNWRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXX-FLRILQKTQPSPQI 209
            ++W+ Q    ++ +++S ILLQR+                      FL+IL++T+  P+ 
Sbjct: 26   SDWKTQQTLFRVATEISSILLQRRNWITHLQYVKSKLPRSTLTSPVFLQILRETRKCPKT 85

Query: 210  SLDFFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSDEIVSSFSKV 389
            +LDFF++AK +LRF+PDLKSHC++  +  ESGL   A+ +L  ++++     +V    + 
Sbjct: 86   TLDFFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRW 145

Query: 390  ANFDTYSSVLCS-VLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNSLLHLLQVQNET 566
               +   SV  S VL+ Y  +G +   L+V+   +       S   +NSLL  L  +N+ 
Sbjct: 146  FEGEVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSP-SQSAYNSLLGSLVKENQF 204

Query: 567  RLAWCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSSSLYDLIVQNYS 746
            R+A C+YS+M+R+ +  ++ TW +IA+ILC+  + + + ++++ G+ S  +Y  +V+ YS
Sbjct: 205  RVALCLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYS 264

Query: 747  ERGDFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRTMIEKGYIPKGV 926
              G+FDA F  + +M DKKL+ SF  Y  +LD AC+ GD E I+ V   M+EK ++  G 
Sbjct: 265  RNGEFDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGD 324

Query: 927  TSKYDSVIQKLSDLGKTYAAKLFLSRAC 1010
            ++  D +I++L D+GKT+A+++   +AC
Sbjct: 325  SAVNDKIIERLCDMGKTFASEMLFRKAC 352


Top