BLASTX nr result

ID: Akebia23_contig00025720 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00025720
         (1368 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265722.2| PREDICTED: pentatricopeptide repeat-containi...   370   e-100
emb|CBI30711.3| unnamed protein product [Vitis vinifera]              370   e-100
ref|XP_002305733.2| hypothetical protein POPTR_0004s05810g, part...   360   6e-97
ref|XP_007025334.1| Pentatricopeptide, putative [Theobroma cacao...   359   1e-96
ref|XP_004150218.1| PREDICTED: pentatricopeptide repeat-containi...   359   1e-96
ref|XP_007214267.1| hypothetical protein PRUPE_ppa025121mg [Prun...   357   7e-96
ref|XP_006467621.1| PREDICTED: pentatricopeptide repeat-containi...   355   3e-95
ref|XP_006449535.1| hypothetical protein CICLE_v10014413mg [Citr...   355   3e-95
gb|EXB84044.1| hypothetical protein L484_005808 [Morus notabilis]     353   1e-94
gb|ACD56635.1| putative pentatricopeptide repeat protein [Gossyp...   349   1e-93
ref|XP_006414062.1| hypothetical protein EUTSA_v10024377mg [Eutr...   346   1e-92
gb|ACD56648.1| putative pentatricopeptide repeat protein [Gossyp...   346   1e-92
gb|ACD56662.1| putative pentatricopeptide [Gossypium arboreum]        345   3e-92
gb|AHB18405.1| pentatricopeptide repeat-containing protein [Goss...   344   6e-92
gb|AAT64030.1| putative pentatricopeptide repeat protein [Gossyp...   344   6e-92
ref|XP_004294643.1| PREDICTED: pentatricopeptide repeat-containi...   343   1e-91
gb|AAT64016.1| putative pentatricopeptide repeat protein [Gossyp...   343   1e-91
dbj|BAD94843.1| putative protein [Arabidopsis thaliana]               340   1e-90
ref|NP_193610.1| pentatricopeptide repeat protein DOT4 [Arabidop...   340   1e-90
ref|XP_006597484.1| PREDICTED: pentatricopeptide repeat-containi...   338   3e-90

>ref|XP_002265722.2| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Vitis vinifera]
          Length = 824

 Score =  370 bits (951), Expect = e-100
 Identities = 174/215 (80%), Positives = 187/215 (86%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            KFI+ MPIEPD+TIWGALLCGCRI+ DVKLAEKVAE VFELEPENTGYYVLLANIYAEAE
Sbjct: 610  KFIKMMPIEPDATIWGALLCGCRIYHDVKLAEKVAEHVFELEPENTGYYVLLANIYAEAE 669

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEVKKLRE IGRRGLRK PGCSWIE+K +VH+FV GD SHP    IELLLK+ R RMK
Sbjct: 670  KWEEVKKLRERIGRRGLRKNPGCSWIEIKGKVHIFVTGDSSHPLANKIELLLKKTRTRMK 729

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            EEG  PK RYALI   D EKE  LCGHSEK+AMA+GI SLPPGKTVRVTKNLRVCGDCHE
Sbjct: 730  EEGHFPKMRYALIKADDTEKEMALCGHSEKIAMAFGILSLPPGKTVRVTKNLRVCGDCHE 789

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKFMSKM  ++I+LRDSNRFHHF DG CSCRG+W
Sbjct: 790  MAKFMSKMVKRDIILRDSNRFHHFKDGSCSCRGHW 824


>emb|CBI30711.3| unnamed protein product [Vitis vinifera]
          Length = 697

 Score =  370 bits (951), Expect = e-100
 Identities = 174/215 (80%), Positives = 187/215 (86%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            KFI+ MPIEPD+TIWGALLCGCRI+ DVKLAEKVAE VFELEPENTGYYVLLANIYAEAE
Sbjct: 483  KFIKMMPIEPDATIWGALLCGCRIYHDVKLAEKVAEHVFELEPENTGYYVLLANIYAEAE 542

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEVKKLRE IGRRGLRK PGCSWIE+K +VH+FV GD SHP    IELLLK+ R RMK
Sbjct: 543  KWEEVKKLRERIGRRGLRKNPGCSWIEIKGKVHIFVTGDSSHPLANKIELLLKKTRTRMK 602

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            EEG  PK RYALI   D EKE  LCGHSEK+AMA+GI SLPPGKTVRVTKNLRVCGDCHE
Sbjct: 603  EEGHFPKMRYALIKADDTEKEMALCGHSEKIAMAFGILSLPPGKTVRVTKNLRVCGDCHE 662

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKFMSKM  ++I+LRDSNRFHHF DG CSCRG+W
Sbjct: 663  MAKFMSKMVKRDIILRDSNRFHHFKDGSCSCRGHW 697


>ref|XP_002305733.2| hypothetical protein POPTR_0004s05810g, partial [Populus trichocarpa]
            gi|550340410|gb|EEE86244.2| hypothetical protein
            POPTR_0004s05810g, partial [Populus trichocarpa]
          Length = 778

 Score =  360 bits (925), Expect = 6e-97
 Identities = 166/215 (77%), Positives = 190/215 (88%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            KFI++MPIEPD+TIWGALL GCRIH DVKLAEKVAE VFELEPENTGYYVLLAN YAEAE
Sbjct: 564  KFIKSMPIEPDATIWGALLSGCRIHHDVKLAEKVAEHVFELEPENTGYYVLLANTYAEAE 623

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEVKKLR+ IGRRGL+K PGCSWIE+K++VH+F+AG+ SHPQ K IE+LLKR+R +MK
Sbjct: 624  KWEEVKKLRQKIGRRGLKKNPGCSWIEVKSKVHIFLAGNSSHPQAKKIEVLLKRLRSKMK 683

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            EEG  PK RYALIN   ++KE  LCGHSEKLAMA+GI +LPP +T+RV+KNLRVCGDCHE
Sbjct: 684  EEGYFPKTRYALINADSLQKETALCGHSEKLAMAFGILNLPPARTIRVSKNLRVCGDCHE 743

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKF+SK  G+EIVLRDSNRFHHF DG C CRG+W
Sbjct: 744  MAKFISKTLGREIVLRDSNRFHHFKDGVCCCRGFW 778


>ref|XP_007025334.1| Pentatricopeptide, putative [Theobroma cacao]
            gi|508780700|gb|EOY27956.1| Pentatricopeptide, putative
            [Theobroma cacao]
          Length = 874

 Score =  359 bits (922), Expect = 1e-96
 Identities = 166/214 (77%), Positives = 189/214 (88%)
 Frame = -2

Query: 1364 FIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEK 1185
            FIE MPI PD+TIWGA+LCGCRI+ DVKLAE+VAERVFELEPENTGYYVLLANIYAEAEK
Sbjct: 661  FIERMPIAPDATIWGAVLCGCRIYHDVKLAERVAERVFELEPENTGYYVLLANIYAEAEK 720

Query: 1184 WEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMKE 1005
            WEEVK++RE IGR+GLRK PGCSWIE+K +V++FVAGD SHPQ K IE LLK++R +MK 
Sbjct: 721  WEEVKRVRERIGRKGLRKNPGCSWIEIKGKVNLFVAGDSSHPQSKKIESLLKKLRRKMKG 780

Query: 1004 EGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEM 825
            EG  PK +YALIN  D++KE  LCGHSEKLAMA+G+ SLPP KT+RVTKNLR+CGDCHEM
Sbjct: 781  EGYFPKTKYALINADDMQKEMALCGHSEKLAMAFGLLSLPPSKTIRVTKNLRICGDCHEM 840

Query: 824  AKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            AKFMSK  G+EIVLRDSNRFHHF DG CSCRG+W
Sbjct: 841  AKFMSKETGREIVLRDSNRFHHFKDGYCSCRGFW 874


>ref|XP_004150218.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Cucumis sativus]
            gi|449500809|ref|XP_004161200.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Cucumis sativus]
          Length = 926

 Score =  359 bits (922), Expect = 1e-96
 Identities = 166/215 (77%), Positives = 190/215 (88%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            KFI+ MPI+PD+TIWGALLCGCRIH DVKLAEKVAER+FELEPENTGYYVLLANIYAEAE
Sbjct: 712  KFIKAMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAEAE 771

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEV+KLR+ IG+RGL+K PGCSWIE+K ++++FVAGD S PQ K IELLLKR+R +MK
Sbjct: 772  KWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKINIFVAGDCSKPQAKKIELLLKRLRSKMK 831

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            EEG  PK  YAL+N  + EKE  LCGHSEKLAMA+G+ +LPPGKT+RVTKNLRVCGDCHE
Sbjct: 832  EEGYSPKTAYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHE 891

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKFMSK   +EI+LRDS+RFHHF DG CSCRGYW
Sbjct: 892  MAKFMSKSASREIILRDSSRFHHFKDGSCSCRGYW 926


>ref|XP_007214267.1| hypothetical protein PRUPE_ppa025121mg [Prunus persica]
            gi|462410132|gb|EMJ15466.1| hypothetical protein
            PRUPE_ppa025121mg [Prunus persica]
          Length = 796

 Score =  357 bits (916), Expect = 7e-96
 Identities = 167/215 (77%), Positives = 188/215 (87%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            KFI  MPIEPD+TIWG+LLCGCRIH DVKLAEKVAERVFELEPENTGYYVLLANIYAEAE
Sbjct: 582  KFINKMPIEPDATIWGSLLCGCRIHHDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 641

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEVKKLRE IGR+GL+K PGCSWIE+K +V +FVAG+ SHPQ   IE LLKR+R++MK
Sbjct: 642  KWEEVKKLRERIGRQGLKKNPGCSWIEIKGKVQIFVAGNSSHPQATKIESLLKRLRLKMK 701

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            EEG  PK +YALIN  ++EKE  LCGHSEKLA+A+GI +LPPGKT+RVTKNLRVC DCHE
Sbjct: 702  EEGYSPKMQYALINADEMEKEVALCGHSEKLAIAFGILNLPPGKTIRVTKNLRVCSDCHE 761

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKF+SK   +EIVLRDSNRFHH  DG CSCRG+W
Sbjct: 762  MAKFISKTSRREIVLRDSNRFHHMKDGICSCRGFW 796


>ref|XP_006467621.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Citrus sinensis]
          Length = 872

 Score =  355 bits (911), Expect = 3e-95
 Identities = 164/215 (76%), Positives = 186/215 (86%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            +FIE MP+ PD+TIWG+LLCGCRIH +VKLAEKVAE VFELEP+NTGYYVLLAN+YAEAE
Sbjct: 658  RFIEMMPVAPDATIWGSLLCGCRIHHEVKLAEKVAEHVFELEPDNTGYYVLLANVYAEAE 717

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEVKKLRE I RRGL+K PGCSWIE+K +V++FVAG  SHP  K IE LLKR+R+ MK
Sbjct: 718  KWEEVKKLREKISRRGLKKNPGCSWIEIKGKVNIFVAGGSSHPHAKKIESLLKRLRLEMK 777

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
             EG  PK RYALIN  ++EKE  LCGHSEKLAMA+GI +LP G+T+RVTKNLRVCGDCHE
Sbjct: 778  REGYFPKTRYALINADEMEKEVALCGHSEKLAMAFGILNLPAGQTIRVTKNLRVCGDCHE 837

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKFMSK   +EIVLRDSNRFHHF DGRCSCRG+W
Sbjct: 838  MAKFMSKTARREIVLRDSNRFHHFKDGRCSCRGFW 872


>ref|XP_006449535.1| hypothetical protein CICLE_v10014413mg [Citrus clementina]
            gi|557552146|gb|ESR62775.1| hypothetical protein
            CICLE_v10014413mg [Citrus clementina]
          Length = 725

 Score =  355 bits (910), Expect = 3e-95
 Identities = 164/215 (76%), Positives = 186/215 (86%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            +FIE MP+ PD+TIWG+LLCGCRIH +V+LAEKVAE VFELEP+NTGYYVLLAN+YAEAE
Sbjct: 511  RFIEMMPVAPDATIWGSLLCGCRIHHEVQLAEKVAEHVFELEPDNTGYYVLLANVYAEAE 570

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEVKKLRE I RRGL+K PGCSWIE+K +V++FVAG  SHP  K IE LLKR+R+ MK
Sbjct: 571  KWEEVKKLREKISRRGLKKNPGCSWIEIKGKVNIFVAGGSSHPHAKKIESLLKRLRLEMK 630

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
             EG  PK RYALIN  ++EKE  LCGHSEKLAMA+GI SLP G+T+RVTKNLRVCGDCHE
Sbjct: 631  REGYFPKTRYALINADEMEKEVALCGHSEKLAMAFGILSLPAGQTIRVTKNLRVCGDCHE 690

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKFMSK   +EIVLRDSNRFHHF DGRCSCRG+W
Sbjct: 691  MAKFMSKTARREIVLRDSNRFHHFKDGRCSCRGFW 725


>gb|EXB84044.1| hypothetical protein L484_005808 [Morus notabilis]
          Length = 877

 Score =  353 bits (906), Expect = 1e-94
 Identities = 164/215 (76%), Positives = 186/215 (86%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            +FI  MPIEPD+TIWGALLCGCR + DVKLAE+VAE VFELEP+NTGYYVLLANIYAEAE
Sbjct: 663  RFIRKMPIEPDATIWGALLCGCRTYHDVKLAERVAEHVFELEPDNTGYYVLLANIYAEAE 722

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEV+KLRE IGRRGL+K PGCSWIE+K +V++FVAGD S P  K IE LLKR+R +MK
Sbjct: 723  KWEEVRKLREKIGRRGLKKNPGCSWIEIKGKVNIFVAGDDSQPLAKKIESLLKRLRAKMK 782

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            EEG  P  +YALIN  ++EKE  LCGHSEKLAMA+G+ SLPPGKT+RVTKNLRVCGDCHE
Sbjct: 783  EEGFYPNMKYALINADEMEKEVALCGHSEKLAMAFGMLSLPPGKTIRVTKNLRVCGDCHE 842

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
             AKF+SKM  +EIVLRDSNRFHHF DG CSCRG+W
Sbjct: 843  TAKFISKMSSREIVLRDSNRFHHFKDGHCSCRGFW 877


>gb|ACD56635.1| putative pentatricopeptide repeat protein [Gossypium raimondii]
          Length = 667

 Score =  349 bits (896), Expect = 1e-93
 Identities = 162/216 (75%), Positives = 190/216 (87%), Gaps = 1/216 (0%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            KFIET+PI PD+TIWGALLCGCRI+ D++LAEKVAERVFELEPENTGYYVLLANIYAEAE
Sbjct: 452  KFIETLPIAPDATIWGALLCGCRIYHDIELAEKVAERVFELEPENTGYYVLLANIYAEAE 511

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRS-HPQFKTIELLLKRVRMRM 1011
            KWEEVK++RE IG++GLRK PGCSWIE+K +V++FV+G+ S HP  K IE LLK++R +M
Sbjct: 512  KWEEVKRMREKIGKKGLRKNPGCSWIEIKGRVNLFVSGNNSSHPHSKKIESLLKKMRRKM 571

Query: 1010 KEEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCH 831
            KEEG  PK +YALIN  +++KE  LCGHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCH
Sbjct: 572  KEEGYFPKTKYALINADEMQKEMALCGHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCH 631

Query: 830  EMAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            EMAKFMSK   +EIVLRDSNRFHHF DG CSCRG+W
Sbjct: 632  EMAKFMSKETRREIVLRDSNRFHHFKDGYCSCRGFW 667


>ref|XP_006414062.1| hypothetical protein EUTSA_v10024377mg [Eutrema salsugineum]
            gi|557115232|gb|ESQ55515.1| hypothetical protein
            EUTSA_v10024377mg [Eutrema salsugineum]
          Length = 872

 Score =  346 bits (888), Expect = 1e-92
 Identities = 161/215 (74%), Positives = 185/215 (86%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            +FIE MPI PD+TIWGALLCGCRIH DVKLAEKVAE+VF LEP+NTGYYVL+ANIYAEAE
Sbjct: 658  RFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFALEPDNTGYYVLMANIYAEAE 717

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEVK+LR+ IGRRGLRK PGCSWIE+K +V++FVAGD SHP+ + IE  L+RVR RM+
Sbjct: 718  KWEEVKRLRKRIGRRGLRKNPGCSWIEIKGKVNIFVAGDSSHPETEKIEAFLRRVRARMR 777

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            EEG  P+ +YALI+  ++EKEE LCGHSEKLAMA GI S   GK +RVTKNLRVCGDCHE
Sbjct: 778  EEGYSPQTKYALIDAEEMEKEEALCGHSEKLAMALGILSSGHGKIIRVTKNLRVCGDCHE 837

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAK MSK+  +EIVLRDSNRFHHF DG CSCRG+W
Sbjct: 838  MAKLMSKLTRREIVLRDSNRFHHFKDGHCSCRGFW 872


>gb|ACD56648.1| putative pentatricopeptide repeat protein [Gossypioides kirkii]
          Length = 805

 Score =  346 bits (888), Expect = 1e-92
 Identities = 161/216 (74%), Positives = 190/216 (87%), Gaps = 1/216 (0%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            +FIET+PI PD+TIWGALLCGCRI+ D++LAEKVAERVFELEPENTGYYVLLANIYAEAE
Sbjct: 590  EFIETLPIAPDATIWGALLCGCRIYHDIELAEKVAERVFELEPENTGYYVLLANIYAEAE 649

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRS-HPQFKTIELLLKRVRMRM 1011
            KWEEVK++RE IG++GLRK PGCSWIE+K +V++FV+G+ S HP  K IE LLK++R +M
Sbjct: 650  KWEEVKRMREKIGKKGLRKNPGCSWIEIKGKVNLFVSGNNSSHPHSKKIESLLKKMRRKM 709

Query: 1010 KEEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCH 831
            KEEG  PK +YALIN  +++KE  LCGHSEKLAMA+G+ +LPP KTVRVTKNLRVCGDCH
Sbjct: 710  KEEGYFPKTKYALINADEMQKEMALCGHSEKLAMAFGLLALPPRKTVRVTKNLRVCGDCH 769

Query: 830  EMAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            EMAKFMSK   +EIVLRDSNRFHHF +G CSCRG+W
Sbjct: 770  EMAKFMSKETRREIVLRDSNRFHHFKNGYCSCRGFW 805


>gb|ACD56662.1| putative pentatricopeptide [Gossypium arboreum]
          Length = 805

 Score =  345 bits (885), Expect = 3e-92
 Identities = 160/216 (74%), Positives = 189/216 (87%), Gaps = 1/216 (0%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            +F+ET+PI PD+TIWGALLCGCR + D++LAEKVAERVFELEPENTGYYVLLANIYAEAE
Sbjct: 590  EFMETLPIAPDATIWGALLCGCRNYHDIELAEKVAERVFELEPENTGYYVLLANIYAEAE 649

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRS-HPQFKTIELLLKRVRMRM 1011
            KWEEVK+LRE IG++GLRK PGCSWIE+K +V++FV+G+ S HP  K IE LLK++R +M
Sbjct: 650  KWEEVKRLREKIGKQGLRKNPGCSWIEIKGKVNLFVSGNNSSHPHSKNIESLLKKMRRKM 709

Query: 1010 KEEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCH 831
            KEEG  PK +YALIN  +++KE  LCGHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCH
Sbjct: 710  KEEGHFPKTKYALINADEMQKEMALCGHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCH 769

Query: 830  EMAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            EMAKFMSK   +EIVLRDSNRFHHF DG CSCRG+W
Sbjct: 770  EMAKFMSKETRREIVLRDSNRFHHFKDGYCSCRGFW 805


>gb|AHB18405.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 875

 Score =  344 bits (882), Expect = 6e-92
 Identities = 161/216 (74%), Positives = 189/216 (87%), Gaps = 1/216 (0%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            KFIET+PI PD+TIWGALLCGCRI+ D++LAEKVAERVFELEPENTGYYVLLANIYAEAE
Sbjct: 660  KFIETLPIAPDATIWGALLCGCRIYHDIELAEKVAERVFELEPENTGYYVLLANIYAEAE 719

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRS-HPQFKTIELLLKRVRMRM 1011
            K EEVK++RE IG++GLRK PGCSWIE+K +V++FV+G+ S HP  K IE LLK++R +M
Sbjct: 720  KREEVKRMREKIGKKGLRKNPGCSWIEIKGRVNLFVSGNNSSHPHSKKIESLLKKMRRKM 779

Query: 1010 KEEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCH 831
            KEEG  PK +YALIN  +++KE  LCGHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCH
Sbjct: 780  KEEGYFPKTKYALINADEMQKEMALCGHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCH 839

Query: 830  EMAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            EMAKFMSK   +EIVLRDSNRFHHF DG CSCRG+W
Sbjct: 840  EMAKFMSKETRREIVLRDSNRFHHFKDGYCSCRGFW 875


>gb|AAT64030.1| putative pentatricopeptide repeat protein [Gossypium hirsutum]
          Length = 805

 Score =  344 bits (882), Expect = 6e-92
 Identities = 161/216 (74%), Positives = 189/216 (87%), Gaps = 1/216 (0%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            KFIET+PI PD+TIWGALLCGCRI+ D++LAEKVAERVFELEPENTGYYVLLANIYAEAE
Sbjct: 590  KFIETLPIAPDATIWGALLCGCRIYHDIELAEKVAERVFELEPENTGYYVLLANIYAEAE 649

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRS-HPQFKTIELLLKRVRMRM 1011
            K EEVK++RE IG++GLRK PGCSWIE+K +V++FV+G+ S HP  K IE LLK++R +M
Sbjct: 650  KREEVKRMREKIGKKGLRKNPGCSWIEIKGRVNLFVSGNNSSHPHSKKIESLLKKMRRKM 709

Query: 1010 KEEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCH 831
            KEEG  PK +YALIN  +++KE  LCGHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCH
Sbjct: 710  KEEGYFPKTKYALINADEMQKEMALCGHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCH 769

Query: 830  EMAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            EMAKFMSK   +EIVLRDSNRFHHF DG CSCRG+W
Sbjct: 770  EMAKFMSKETRREIVLRDSNRFHHFKDGYCSCRGFW 805


>ref|XP_004294643.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 870

 Score =  343 bits (879), Expect = 1e-91
 Identities = 159/215 (73%), Positives = 181/215 (84%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            KFI  MPIEPD+T+WG+LLCGCRIH DVKLAEKVAE VFELEPENTGYY+LLANIYAEAE
Sbjct: 656  KFINMMPIEPDATVWGSLLCGCRIHHDVKLAEKVAEHVFELEPENTGYYILLANIYAEAE 715

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWEEVKKLRE IGRR L+K PGCSWIE+K +V++FVAG  SHP    IE L+K+ R RMK
Sbjct: 716  KWEEVKKLRERIGRRSLKKNPGCSWIEIKGKVNIFVAGGTSHPDAMKIESLVKKFRSRMK 775

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            E+G  PK +YALIN  +VEKE  LC HSEKLA+A+GI + PP KT+RVTKNLRVCGDCHE
Sbjct: 776  EDGYNPKMQYALINADEVEKEVALCAHSEKLAIAFGILNTPPRKTIRVTKNLRVCGDCHE 835

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKF+S+   +EIVLRDSNRFHH  DG CSCRG+W
Sbjct: 836  MAKFISRTSRREIVLRDSNRFHHMKDGNCSCRGFW 870


>gb|AAT64016.1| putative pentatricopeptide repeat protein [Gossypium hirsutum]
          Length = 805

 Score =  343 bits (879), Expect = 1e-91
 Identities = 159/216 (73%), Positives = 188/216 (87%), Gaps = 1/216 (0%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            +FIET+PI PD+TIWGALLCGCR + D++LAEKVAERVFELEPEN+GYYVLLANIYAEAE
Sbjct: 590  EFIETLPIAPDATIWGALLCGCRNYHDIELAEKVAERVFELEPENSGYYVLLANIYAEAE 649

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRS-HPQFKTIELLLKRVRMRM 1011
            KWEEVK+LRE IG++GLRK PGCSWIE+K +V++FV+G+ S HP  K IE LLK++R +M
Sbjct: 650  KWEEVKRLREKIGKQGLRKNPGCSWIEIKGKVNLFVSGNNSSHPHSKNIESLLKKMRRKM 709

Query: 1010 KEEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCH 831
            KEEG  PK +YALIN  +++KE  LCGHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCH
Sbjct: 710  KEEGHFPKTKYALINADEMQKEMALCGHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCH 769

Query: 830  EMAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            EMAKFMSK   +EIVLRD NRFHHF DG CSCRG+W
Sbjct: 770  EMAKFMSKETRREIVLRDPNRFHHFKDGYCSCRGFW 805


>dbj|BAD94843.1| putative protein [Arabidopsis thaliana]
          Length = 720

 Score =  340 bits (871), Expect = 1e-90
 Identities = 159/215 (73%), Positives = 184/215 (85%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            +FIE MPI PD+TIWGALLCGCRIH DVKLAEKVAE+VFELEPENTGYYVL+ANIYAEAE
Sbjct: 506  RFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAE 565

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWE+VK+LR+ IG+RGLRK PGCSWIE+K +V++FVAGD S+P+ + IE  L++VR RM 
Sbjct: 566  KWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMI 625

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            EEG  P  +YALI+  ++EKEE LCGHSEKLAMA GI S   GK +RVTKNLRVCGDCHE
Sbjct: 626  EEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHE 685

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKFMSK+  +EIVLRDSNRFH F DG CSCRG+W
Sbjct: 686  MAKFMSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 720


>ref|NP_193610.1| pentatricopeptide repeat protein DOT4 [Arabidopsis thaliana]
            gi|75206861|sp|Q9SN39.1|PP320_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g18750, chloroplastic; Flags: Precursor
            gi|4539394|emb|CAB37460.1| putative protein [Arabidopsis
            thaliana] gi|7268669|emb|CAB78877.1| putative protein
            [Arabidopsis thaliana] gi|332658686|gb|AEE84086.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 871

 Score =  340 bits (871), Expect = 1e-90
 Identities = 159/215 (73%), Positives = 184/215 (85%)
 Frame = -2

Query: 1367 KFIETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAE 1188
            +FIE MPI PD+TIWGALLCGCRIH DVKLAEKVAE+VFELEPENTGYYVL+ANIYAEAE
Sbjct: 657  RFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAE 716

Query: 1187 KWEEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMK 1008
            KWE+VK+LR+ IG+RGLRK PGCSWIE+K +V++FVAGD S+P+ + IE  L++VR RM 
Sbjct: 717  KWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMI 776

Query: 1007 EEGSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHE 828
            EEG  P  +YALI+  ++EKEE LCGHSEKLAMA GI S   GK +RVTKNLRVCGDCHE
Sbjct: 777  EEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHE 836

Query: 827  MAKFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            MAKFMSK+  +EIVLRDSNRFH F DG CSCRG+W
Sbjct: 837  MAKFMSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 871


>ref|XP_006597484.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Glycine max]
          Length = 873

 Score =  338 bits (867), Expect = 3e-90
 Identities = 157/213 (73%), Positives = 180/213 (84%)
 Frame = -2

Query: 1361 IETMPIEPDSTIWGALLCGCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKW 1182
            IETMPI+PD+TIWGALLCGCRIH DV+LAEKVAE VFELEP+N GYYVLLANIYAEAEKW
Sbjct: 661  IETMPIKPDATIWGALLCGCRIHHDVELAEKVAEHVFELEPDNAGYYVLLANIYAEAEKW 720

Query: 1181 EEVKKLRETIGRRGLRKKPGCSWIEMKNQVHVFVAGDRSHPQFKTIELLLKRVRMRMKEE 1002
            EEVKKLRE IG+RGL+K PGCSWIE++ +   FV+ D +HPQ K+I  LL  +R++MK E
Sbjct: 721  EEVKKLRERIGKRGLKKSPGCSWIEVQGKFTTFVSADTAHPQAKSIFSLLNNLRIKMKNE 780

Query: 1001 GSLPKKRYALINGGDVEKEEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMA 822
            G  PK RYALIN GD+EKE  LCGHSEKLAMA+GI +LP G+T+RV KNLRVC DCHEMA
Sbjct: 781  GHSPKMRYALINAGDMEKEVALCGHSEKLAMAFGILNLPSGRTIRVAKNLRVCDDCHEMA 840

Query: 821  KFMSKMFGKEIVLRDSNRFHHFMDGRCSCRGYW 723
            KFMSK   +EI+LRDSNRFHHF DG CSCR +W
Sbjct: 841  KFMSKTTRREIILRDSNRFHHFKDGFCSCRDFW 873


Top