BLASTX nr result

ID: Akebia27_contig00029081 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00029081
         (1240 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265722.2| PREDICTED: pentatricopeptide repeat-containi...   475   e-131
emb|CBI30711.3| unnamed protein product [Vitis vinifera]              475   e-131
ref|XP_002305733.2| hypothetical protein POPTR_0004s05810g, part...   464   e-128
ref|XP_007214267.1| hypothetical protein PRUPE_ppa025121mg [Prun...   464   e-128
ref|XP_004150218.1| PREDICTED: pentatricopeptide repeat-containi...   464   e-128
ref|XP_007025334.1| Pentatricopeptide, putative [Theobroma cacao...   460   e-127
ref|XP_006467621.1| PREDICTED: pentatricopeptide repeat-containi...   457   e-126
ref|XP_006449535.1| hypothetical protein CICLE_v10014413mg [Citr...   456   e-126
ref|XP_006414062.1| hypothetical protein EUTSA_v10024377mg [Eutr...   456   e-126
gb|EXB84044.1| hypothetical protein L484_005808 [Morus notabilis]     454   e-125
gb|ACD56635.1| putative pentatricopeptide repeat protein [Gossyp...   452   e-124
gb|ACD56648.1| putative pentatricopeptide repeat protein [Gossyp...   450   e-124
gb|AHB18405.1| pentatricopeptide repeat-containing protein [Goss...   447   e-123
gb|AAT64030.1| putative pentatricopeptide repeat protein [Gossyp...   447   e-123
gb|ACD56662.1| putative pentatricopeptide [Gossypium arboreum]        446   e-122
ref|XP_004294643.1| PREDICTED: pentatricopeptide repeat-containi...   445   e-122
gb|AAT64016.1| putative pentatricopeptide repeat protein [Gossyp...   444   e-122
ref|XP_002870024.1| pentatricopeptide repeat-containing protein ...   443   e-122
dbj|BAD94843.1| putative protein [Arabidopsis thaliana]               438   e-120
ref|NP_193610.1| pentatricopeptide repeat protein DOT4 [Arabidop...   438   e-120

>ref|XP_002265722.2| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Vitis vinifera]
          Length = 824

 Score =  475 bits (1222), Expect = e-131
 Identities = 221/310 (71%), Positives = 254/310 (81%)
 Frame = -1

Query: 1222 DKDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLE 1043
            +KD+ SW +M+ GY   G+G+ AI  F  M ++ I+PDE++FI++L ACS SG++ EG  
Sbjct: 515  EKDLVSWTVMIAGYGMHGYGSEAIAAFNEMRNSGIEPDEVSFISILYACSHSGLLDEGWG 574

Query: 1042 YFNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIH 863
            +FN M  +  + P  +HYAC+VDLLAR G L+KAYKFI  MPIEPD+TIWGALLCGCRI+
Sbjct: 575  FFNMMRNNCCIEPKSEHYACIVDLLARAGNLSKAYKFIKMMPIEPDATIWGALLCGCRIY 634

Query: 862  RDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSW 683
             DVKLAEKVAE VFELEPENTGYYVLLANIYAEAEKWEEVKKLRE IGRRGLRK PGCSW
Sbjct: 635  HDVKLAEKVAEHVFELEPENTGYYVLLANIYAEAEKWEEVKKLRERIGRRGLRKNPGCSW 694

Query: 682  IEMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLC 503
            IE+K +VH+FV GD SHP   KIELLLK+ R RMKEEG  PK RYALI   D EKE  LC
Sbjct: 695  IEIKGKVHIFVTGDSSHPLANKIELLLKKTRTRMKEEGHFPKMRYALIKADDTEKEMALC 754

Query: 502  GHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFM 323
            GHSEK+AMA+GI SLPPGKTVRVTKNLRVCGDCHEMAKFMSKM  ++I+LRDSNRFHHF 
Sbjct: 755  GHSEKIAMAFGILSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMVKRDIILRDSNRFHHFK 814

Query: 322  DGRCSCRGYW 293
            DG CSCRG+W
Sbjct: 815  DGSCSCRGHW 824



 Score = 60.8 bits (146), Expect = 1e-06
 Identities = 35/116 (30%), Positives = 57/116 (49%)
 Frame = -1

Query: 1222 DKDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLE 1043
            D+DV SWN M++GY   G     +DLF +M+   I  D  T ++V+  CS +GM+  G  
Sbjct: 213  DRDVISWNSMISGYVSNGLSEKGLDLFEQMLLLGINTDLATMVSVVAGCSNTGMLLLG-R 271

Query: 1042 YFNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCG 875
              +      S    L    C++D+ ++ G L  A +  +TM  E     W +++ G
Sbjct: 272  ALHGYAIKASFGKELTLNNCLLDMYSKSGNLNSAIQVFETMG-ERSVVSWTSMIAG 326


>emb|CBI30711.3| unnamed protein product [Vitis vinifera]
          Length = 697

 Score =  475 bits (1222), Expect = e-131
 Identities = 221/310 (71%), Positives = 254/310 (81%)
 Frame = -1

Query: 1222 DKDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLE 1043
            +KD+ SW +M+ GY   G+G+ AI  F  M ++ I+PDE++FI++L ACS SG++ EG  
Sbjct: 388  EKDLVSWTVMIAGYGMHGYGSEAIAAFNEMRNSGIEPDEVSFISILYACSHSGLLDEGWG 447

Query: 1042 YFNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIH 863
            +FN M  +  + P  +HYAC+VDLLAR G L+KAYKFI  MPIEPD+TIWGALLCGCRI+
Sbjct: 448  FFNMMRNNCCIEPKSEHYACIVDLLARAGNLSKAYKFIKMMPIEPDATIWGALLCGCRIY 507

Query: 862  RDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSW 683
             DVKLAEKVAE VFELEPENTGYYVLLANIYAEAEKWEEVKKLRE IGRRGLRK PGCSW
Sbjct: 508  HDVKLAEKVAEHVFELEPENTGYYVLLANIYAEAEKWEEVKKLRERIGRRGLRKNPGCSW 567

Query: 682  IEMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLC 503
            IE+K +VH+FV GD SHP   KIELLLK+ R RMKEEG  PK RYALI   D EKE  LC
Sbjct: 568  IEIKGKVHIFVTGDSSHPLANKIELLLKKTRTRMKEEGHFPKMRYALIKADDTEKEMALC 627

Query: 502  GHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFM 323
            GHSEK+AMA+GI SLPPGKTVRVTKNLRVCGDCHEMAKFMSKM  ++I+LRDSNRFHHF 
Sbjct: 628  GHSEKIAMAFGILSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMVKRDIILRDSNRFHHFK 687

Query: 322  DGRCSCRGYW 293
            DG CSCRG+W
Sbjct: 688  DGSCSCRGHW 697


>ref|XP_002305733.2| hypothetical protein POPTR_0004s05810g, partial [Populus trichocarpa]
            gi|550340410|gb|EEE86244.2| hypothetical protein
            POPTR_0004s05810g, partial [Populus trichocarpa]
          Length = 778

 Score =  464 bits (1193), Expect = e-128
 Identities = 214/309 (69%), Positives = 254/309 (82%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ +W +M+ GY   G G  AI  F  M  A I+PDE++FI++L ACS SG++ EG  +
Sbjct: 470  KDLITWTVMIAGYGMHGFGNNAITTFNEMRQAGIEPDEVSFISILYACSHSGLLDEGWRF 529

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            FN M+   +V P L+HYAC+VDLLAR G+L  AYKFI +MPIEPD+TIWGALL GCRIH 
Sbjct: 530  FNVMQDECNVKPKLEHYACIVDLLARSGKLAMAYKFIKSMPIEPDATIWGALLSGCRIHH 589

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            DVKLAEKVAE VFELEPENTGYYVLLAN YAEAEKWEEVKKLR+ IGRRGL+K PGCSWI
Sbjct: 590  DVKLAEKVAEHVFELEPENTGYYVLLANTYAEAEKWEEVKKLRQKIGRRGLKKNPGCSWI 649

Query: 679  EMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLCG 500
            E+K++VH+F+AG+ SHPQ KKIE+LLKR+R +MKEEG  PK RYALIN   ++KE  LCG
Sbjct: 650  EVKSKVHIFLAGNSSHPQAKKIEVLLKRLRSKMKEEGYFPKTRYALINADSLQKETALCG 709

Query: 499  HSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFMD 320
            HSEKLAMA+GI +LPP +T+RV+KNLRVCGDCHEMAKF+SK  G+EIVLRDSNRFHHF D
Sbjct: 710  HSEKLAMAFGILNLPPARTIRVSKNLRVCGDCHEMAKFISKTLGREIVLRDSNRFHHFKD 769

Query: 319  GRCSCRGYW 293
            G C CRG+W
Sbjct: 770  GVCCCRGFW 778


>ref|XP_007214267.1| hypothetical protein PRUPE_ppa025121mg [Prunus persica]
            gi|462410132|gb|EMJ15466.1| hypothetical protein
            PRUPE_ppa025121mg [Prunus persica]
          Length = 796

 Score =  464 bits (1193), Expect = e-128
 Identities = 217/309 (70%), Positives = 255/309 (82%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW +++ GY   G G+ AI  F  M  + IKPD I+FI++L ACS SG++ E   +
Sbjct: 488  KDLISWTVIVAGYGMHGFGSEAITAFNEMRKSGIKPDSISFISILYACSHSGLLDEAWRF 547

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            F+SM   YS+ P L+HYAC+VDLLAR G LTKAYKFI+ MPIEPD+TIWG+LLCGCRIH 
Sbjct: 548  FDSMRNDYSIVPKLEHYACMVDLLARTGNLTKAYKFINKMPIEPDATIWGSLLCGCRIHH 607

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRE IGR+GL+K PGCSWI
Sbjct: 608  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRERIGRQGLKKNPGCSWI 667

Query: 679  EMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLCG 500
            E+K +V +FVAG+ SHPQ  KIE LLKR+R++MKEEG  PK +YALIN  ++EKE  LCG
Sbjct: 668  EIKGKVQIFVAGNSSHPQATKIESLLKRLRLKMKEEGYSPKMQYALINADEMEKEVALCG 727

Query: 499  HSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFMD 320
            HSEKLA+A+GI +LPPGKT+RVTKNLRVC DCHEMAKF+SK   +EIVLRDSNRFHH  D
Sbjct: 728  HSEKLAIAFGILNLPPGKTIRVTKNLRVCSDCHEMAKFISKTSRREIVLRDSNRFHHMKD 787

Query: 319  GRCSCRGYW 293
            G CSCRG+W
Sbjct: 788  GICSCRGFW 796



 Score = 58.2 bits (139), Expect = 7e-06
 Identities = 34/135 (25%), Positives = 68/135 (50%), Gaps = 4/135 (2%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            + V SW  M+ GY ++G    AI+LF  M    + PD  T  ++L AC+ +G + +G + 
Sbjct: 287  RSVVSWTSMIAGYVREGLSDEAIELFSEMERNDVSPDVYTITSILHACACNGSLKKGRDI 346

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCG----C 872
               + +H  +  +L     ++D+ A+ G +  A+    +MP++ D   W  ++ G    C
Sbjct: 347  HKYIREH-GMDSSLFVCNTLMDMYAKCGSMEDAHSVFSSMPVK-DIVSWNTMIGGYSKNC 404

Query: 871  RIHRDVKLAEKVAER 827
              +  +KL  ++ ++
Sbjct: 405  LPNEALKLFSEMQQK 419


>ref|XP_004150218.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Cucumis sativus]
            gi|449500809|ref|XP_004161200.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Cucumis sativus]
          Length = 926

 Score =  464 bits (1193), Expect = e-128
 Identities = 214/310 (69%), Positives = 259/310 (83%)
 Frame = -1

Query: 1222 DKDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLE 1043
            +KD+ SW +M+ GY   G+G+ AI+ F +M    I+PDE++FI++L ACS SG++ EG +
Sbjct: 617  NKDLVSWTVMIAGYGMHGYGSEAINTFNQMRMTGIEPDEVSFISILYACSHSGLLDEGWK 676

Query: 1042 YFNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIH 863
             FN M++   + PNL+HYAC+VDLLAR G L KA+KFI  MPI+PD+TIWGALLCGCRIH
Sbjct: 677  IFNIMKKECQIEPNLEHYACMVDLLARTGNLVKAHKFIKAMPIKPDATIWGALLCGCRIH 736

Query: 862  RDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSW 683
             DVKLAEKVAER+FELEPENTGYYVLLANIYAEAEKWEEV+KLR+ IG+RGL+K PGCSW
Sbjct: 737  HDVKLAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSW 796

Query: 682  IEMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLC 503
            IE+K ++++FVAGD S PQ KKIELLLKR+R +MKEEG  PK  YAL+N  + EKE  LC
Sbjct: 797  IEIKGKINIFVAGDCSKPQAKKIELLLKRLRSKMKEEGYSPKTAYALLNADEREKEVALC 856

Query: 502  GHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFM 323
            GHSEKLAMA+G+ +LPPGKT+RVTKNLRVCGDCHEMAKFMSK   +EI+LRDS+RFHHF 
Sbjct: 857  GHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKSASREIILRDSSRFHHFK 916

Query: 322  DGRCSCRGYW 293
            DG CSCRGYW
Sbjct: 917  DGSCSCRGYW 926


>ref|XP_007025334.1| Pentatricopeptide, putative [Theobroma cacao]
            gi|508780700|gb|EOY27956.1| Pentatricopeptide, putative
            [Theobroma cacao]
          Length = 874

 Score =  460 bits (1184), Expect = e-127
 Identities = 213/315 (67%), Positives = 256/315 (81%)
 Frame = -1

Query: 1237 FNAHIDKDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMV 1058
            F+    KD+ SW +M+ GY   G    AI  F  M DA I+PDE++FI++L ACS SG++
Sbjct: 560  FDMISSKDLVSWTVMIAGYGMHGFANEAITTFNEMRDAGIEPDEVSFISILYACSHSGLL 619

Query: 1057 SEGLEYFNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLC 878
             EG  +F  M   Y++ P L+HYAC+VDLL+R G L+KA+ FI+ MPI PD+TIWGA+LC
Sbjct: 620  EEGWRFFYIMRNDYNIEPKLEHYACMVDLLSRTGNLSKAFHFIERMPIAPDATIWGAVLC 679

Query: 877  GCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKK 698
            GCRI+ DVKLAE+VAERVFELEPENTGYYVLLANIYAEAEKWEEVK++RE IGR+GLRK 
Sbjct: 680  GCRIYHDVKLAERVAERVFELEPENTGYYVLLANIYAEAEKWEEVKRVRERIGRKGLRKN 739

Query: 697  PGCSWIEMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEK 518
            PGCSWIE+K +V++FVAGD SHPQ KKIE LLK++R +MK EG  PK +YALIN  D++K
Sbjct: 740  PGCSWIEIKGKVNLFVAGDSSHPQSKKIESLLKKLRRKMKGEGYFPKTKYALINADDMQK 799

Query: 517  EEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNR 338
            E  LCGHSEKLAMA+G+ SLPP KT+RVTKNLR+CGDCHEMAKFMSK  G+EIVLRDSNR
Sbjct: 800  EMALCGHSEKLAMAFGLLSLPPSKTIRVTKNLRICGDCHEMAKFMSKETGREIVLRDSNR 859

Query: 337  FHHFMDGRCSCRGYW 293
            FHHF DG CSCRG+W
Sbjct: 860  FHHFKDGYCSCRGFW 874


>ref|XP_006467621.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Citrus sinensis]
          Length = 872

 Score =  457 bits (1175), Expect = e-126
 Identities = 213/309 (68%), Positives = 251/309 (81%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW IM+ GY   G G  AI  F  M  A I+PDE++FI+VL ACS SG+V EG  +
Sbjct: 564  KDLISWTIMIAGYGMHGFGCDAIATFNDMRQAGIEPDEVSFISVLYACSHSGLVDEGWRF 623

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            FN M    ++ P L+HYAC+VDLL+R G L++AY+FI+ MP+ PD+TIWG+LLCGCRIH 
Sbjct: 624  FNMMRYECNIEPKLEHYACMVDLLSRTGNLSEAYRFIEMMPVAPDATIWGSLLCGCRIHH 683

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            +VKLAEKVAE VFELEP+NTGYYVLLAN+YAEAEKWEEVKKLRE I RRGL+K PGCSWI
Sbjct: 684  EVKLAEKVAEHVFELEPDNTGYYVLLANVYAEAEKWEEVKKLREKISRRGLKKNPGCSWI 743

Query: 679  EMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLCG 500
            E+K +V++FVAG  SHP  KKIE LLKR+R+ MK EG  PK RYALIN  ++EKE  LCG
Sbjct: 744  EIKGKVNIFVAGGSSHPHAKKIESLLKRLRLEMKREGYFPKTRYALINADEMEKEVALCG 803

Query: 499  HSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFMD 320
            HSEKLAMA+GI +LP G+T+RVTKNLRVCGDCHEMAKFMSK   +EIVLRDSNRFHHF D
Sbjct: 804  HSEKLAMAFGILNLPAGQTIRVTKNLRVCGDCHEMAKFMSKTARREIVLRDSNRFHHFKD 863

Query: 319  GRCSCRGYW 293
            GRCSCRG+W
Sbjct: 864  GRCSCRGFW 872


>ref|XP_006449535.1| hypothetical protein CICLE_v10014413mg [Citrus clementina]
            gi|557552146|gb|ESR62775.1| hypothetical protein
            CICLE_v10014413mg [Citrus clementina]
          Length = 725

 Score =  456 bits (1174), Expect = e-126
 Identities = 213/309 (68%), Positives = 251/309 (81%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW IM+ GY   G G  AI  F  M  A I+PDE++FI+VL ACS SG+V EG  +
Sbjct: 417  KDLISWTIMIAGYGMHGFGCDAIATFNDMRQAGIEPDEVSFISVLYACSHSGLVDEGWRF 476

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            FN M    ++ P L+HYAC+VDLL+R G L++AY+FI+ MP+ PD+TIWG+LLCGCRIH 
Sbjct: 477  FNMMRYECNIEPKLEHYACMVDLLSRTGNLSEAYRFIEMMPVAPDATIWGSLLCGCRIHH 536

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            +V+LAEKVAE VFELEP+NTGYYVLLAN+YAEAEKWEEVKKLRE I RRGL+K PGCSWI
Sbjct: 537  EVQLAEKVAEHVFELEPDNTGYYVLLANVYAEAEKWEEVKKLREKISRRGLKKNPGCSWI 596

Query: 679  EMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLCG 500
            E+K +V++FVAG  SHP  KKIE LLKR+R+ MK EG  PK RYALIN  ++EKE  LCG
Sbjct: 597  EIKGKVNIFVAGGSSHPHAKKIESLLKRLRLEMKREGYFPKTRYALINADEMEKEVALCG 656

Query: 499  HSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFMD 320
            HSEKLAMA+GI SLP G+T+RVTKNLRVCGDCHEMAKFMSK   +EIVLRDSNRFHHF D
Sbjct: 657  HSEKLAMAFGILSLPAGQTIRVTKNLRVCGDCHEMAKFMSKTARREIVLRDSNRFHHFKD 716

Query: 319  GRCSCRGYW 293
            GRCSCRG+W
Sbjct: 717  GRCSCRGFW 725


>ref|XP_006414062.1| hypothetical protein EUTSA_v10024377mg [Eutrema salsugineum]
            gi|557115232|gb|ESQ55515.1| hypothetical protein
            EUTSA_v10024377mg [Eutrema salsugineum]
          Length = 872

 Score =  456 bits (1174), Expect = e-126
 Identities = 213/315 (67%), Positives = 255/315 (80%)
 Frame = -1

Query: 1237 FNAHIDKDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMV 1058
            F+    KD+ SW +M+ GY   G GA +I LF RM +A I+PDEI+F++VL ACS SG+V
Sbjct: 558  FDEVASKDLVSWTVMIAGYGMHGIGAESIALFNRMREAGIEPDEISFVSVLYACSHSGLV 617

Query: 1057 SEGLEYFNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLC 878
             EG  +FN M     + P L+HYAC+VD+LAR G L+KAY+FI+ MPI PD+TIWGALLC
Sbjct: 618  DEGWRFFNIMRHECKIEPTLEHYACIVDMLARTGNLSKAYRFIENMPIPPDATIWGALLC 677

Query: 877  GCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKK 698
            GCRIH DVKLAEKVAE+VF LEP+NTGYYVL+ANIYAEAEKWEEVK+LR+ IGRRGLRK 
Sbjct: 678  GCRIHHDVKLAEKVAEKVFALEPDNTGYYVLMANIYAEAEKWEEVKRLRKRIGRRGLRKN 737

Query: 697  PGCSWIEMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEK 518
            PGCSWIE+K +V++FVAGD SHP+ +KIE  L+RVR RM+EEG  P+ +YALI+  ++EK
Sbjct: 738  PGCSWIEIKGKVNIFVAGDSSHPETEKIEAFLRRVRARMREEGYSPQTKYALIDAEEMEK 797

Query: 517  EEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNR 338
            EE LCGHSEKLAMA GI S   GK +RVTKNLRVCGDCHEMAK MSK+  +EIVLRDSNR
Sbjct: 798  EEALCGHSEKLAMALGILSSGHGKIIRVTKNLRVCGDCHEMAKLMSKLTRREIVLRDSNR 857

Query: 337  FHHFMDGRCSCRGYW 293
            FHHF DG CSCRG+W
Sbjct: 858  FHHFKDGHCSCRGFW 872


>gb|EXB84044.1| hypothetical protein L484_005808 [Morus notabilis]
          Length = 877

 Score =  454 bits (1167), Expect = e-125
 Identities = 214/309 (69%), Positives = 251/309 (81%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW +M+ GY   G G  AI  F  M  A I+PDE++FI++L ACS SG+  EG  +
Sbjct: 570  KDLISWTVMIAGYGMHGFGREAIAAFDEMRHAGIEPDEVSFISILYACSHSGL-DEGWSF 628

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            FN M   YS+ P L+HYAC+VDLL+R G L+KAY+FI  MPIEPD+TIWGALLCGCR + 
Sbjct: 629  FNVMRNEYSIEPMLEHYACMVDLLSRTGNLSKAYRFIRKMPIEPDATIWGALLCGCRTYH 688

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            DVKLAE+VAE VFELEP+NTGYYVLLANIYAEAEKWEEV+KLRE IGRRGL+K PGCSWI
Sbjct: 689  DVKLAERVAEHVFELEPDNTGYYVLLANIYAEAEKWEEVRKLREKIGRRGLKKNPGCSWI 748

Query: 679  EMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLCG 500
            E+K +V++FVAGD S P  KKIE LLKR+R +MKEEG  P  +YALIN  ++EKE  LCG
Sbjct: 749  EIKGKVNIFVAGDDSQPLAKKIESLLKRLRAKMKEEGFYPNMKYALINADEMEKEVALCG 808

Query: 499  HSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFMD 320
            HSEKLAMA+G+ SLPPGKT+RVTKNLRVCGDCHE AKF+SKM  +EIVLRDSNRFHHF D
Sbjct: 809  HSEKLAMAFGMLSLPPGKTIRVTKNLRVCGDCHETAKFISKMSSREIVLRDSNRFHHFKD 868

Query: 319  GRCSCRGYW 293
            G CSCRG+W
Sbjct: 869  GHCSCRGFW 877


>gb|ACD56635.1| putative pentatricopeptide repeat protein [Gossypium raimondii]
          Length = 667

 Score =  452 bits (1164), Expect = e-124
 Identities = 208/310 (67%), Positives = 258/310 (83%), Gaps = 1/310 (0%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW +M+ GY   G+G  AI  F  M DA I+PDE++FI++L ACS SG++ +G  +
Sbjct: 358  KDLVSWTVMIAGYGMHGYGNEAIATFNEMRDAGIEPDEVSFISILYACSHSGLLEQGWRF 417

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            F  M+  +++ P L+HYAC+VDLL+R G L+KAYKFI+T+PI PD+TIWGALLCGCRI+ 
Sbjct: 418  FYIMKNDFNIEPKLEHYACMVDLLSRTGNLSKAYKFIETLPIAPDATIWGALLCGCRIYH 477

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            D++LAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVK++RE IG++GLRK PGCSWI
Sbjct: 478  DIELAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKRMREKIGKKGLRKNPGCSWI 537

Query: 679  EMKNQVHVFVAGDR-SHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLC 503
            E+K +V++FV+G+  SHP  KKIE LLK++R +MKEEG  PK +YALIN  +++KE  LC
Sbjct: 538  EIKGRVNLFVSGNNSSHPHSKKIESLLKKMRRKMKEEGYFPKTKYALINADEMQKEMALC 597

Query: 502  GHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFM 323
            GHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCHEMAKFMSK   +EIVLRDSNRFHHF 
Sbjct: 598  GHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCHEMAKFMSKETRREIVLRDSNRFHHFK 657

Query: 322  DGRCSCRGYW 293
            DG CSCRG+W
Sbjct: 658  DGYCSCRGFW 667


>gb|ACD56648.1| putative pentatricopeptide repeat protein [Gossypioides kirkii]
          Length = 805

 Score =  450 bits (1157), Expect = e-124
 Identities = 207/310 (66%), Positives = 259/310 (83%), Gaps = 1/310 (0%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW +M++GY   G+G  AI  F  M DA I+PDE++FI++L ACS SG++ +G  +
Sbjct: 496  KDLVSWTVMISGYGMHGYGNEAIATFNEMRDAGIEPDEVSFISILYACSHSGLLEQGWRF 555

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            F  M+  +++ P L+HYAC+VDLL+R G L+KAY+FI+T+PI PD+TIWGALLCGCRI+ 
Sbjct: 556  FYIMKNDFNIEPKLEHYACMVDLLSRTGNLSKAYEFIETLPIAPDATIWGALLCGCRIYH 615

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            D++LAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVK++RE IG++GLRK PGCSWI
Sbjct: 616  DIELAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKRMREKIGKKGLRKNPGCSWI 675

Query: 679  EMKNQVHVFVAGDR-SHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLC 503
            E+K +V++FV+G+  SHP  KKIE LLK++R +MKEEG  PK +YALIN  +++KE  LC
Sbjct: 676  EIKGKVNLFVSGNNSSHPHSKKIESLLKKMRRKMKEEGYFPKTKYALINADEMQKEMALC 735

Query: 502  GHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFM 323
            GHSEKLAMA+G+ +LPP KTVRVTKNLRVCGDCHEMAKFMSK   +EIVLRDSNRFHHF 
Sbjct: 736  GHSEKLAMAFGLLALPPRKTVRVTKNLRVCGDCHEMAKFMSKETRREIVLRDSNRFHHFK 795

Query: 322  DGRCSCRGYW 293
            +G CSCRG+W
Sbjct: 796  NGYCSCRGFW 805


>gb|AHB18405.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 875

 Score =  447 bits (1150), Expect = e-123
 Identities = 207/310 (66%), Positives = 257/310 (82%), Gaps = 1/310 (0%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW +M+ GY   G+G  AI  F  M DA I+PDE++FI++L ACS SG++ +G  +
Sbjct: 566  KDLVSWTVMIAGYGMHGYGNEAIATFNEMRDAGIEPDEVSFISILYACSHSGLLEQGWRF 625

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            F  M+  +++ P L+HYAC+VDLL+R G L+KAYKFI+T+PI PD+TIWGALLCGCRI+ 
Sbjct: 626  FYIMKNDFNIEPKLEHYACMVDLLSRTGNLSKAYKFIETLPIAPDATIWGALLCGCRIYH 685

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            D++LAEKVAERVFELEPENTGYYVLLANIYAEAEK EEVK++RE IG++GLRK PGCSWI
Sbjct: 686  DIELAEKVAERVFELEPENTGYYVLLANIYAEAEKREEVKRMREKIGKKGLRKNPGCSWI 745

Query: 679  EMKNQVHVFVAGDR-SHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLC 503
            E+K +V++FV+G+  SHP  KKIE LLK++R +MKEEG  PK +YALIN  +++KE  LC
Sbjct: 746  EIKGRVNLFVSGNNSSHPHSKKIESLLKKMRRKMKEEGYFPKTKYALINADEMQKEMALC 805

Query: 502  GHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFM 323
            GHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCHEMAKFMSK   +EIVLRDSNRFHHF 
Sbjct: 806  GHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCHEMAKFMSKETRREIVLRDSNRFHHFK 865

Query: 322  DGRCSCRGYW 293
            DG CSCRG+W
Sbjct: 866  DGYCSCRGFW 875


>gb|AAT64030.1| putative pentatricopeptide repeat protein [Gossypium hirsutum]
          Length = 805

 Score =  447 bits (1150), Expect = e-123
 Identities = 207/310 (66%), Positives = 257/310 (82%), Gaps = 1/310 (0%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW +M+ GY   G+G  AI  F  M DA I+PDE++FI++L ACS SG++ +G  +
Sbjct: 496  KDLVSWTVMIAGYGMHGYGNEAIATFNEMRDAGIEPDEVSFISILYACSHSGLLEQGWRF 555

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            F  M+  +++ P L+HYAC+VDLL+R G L+KAYKFI+T+PI PD+TIWGALLCGCRI+ 
Sbjct: 556  FYIMKNDFNIEPKLEHYACMVDLLSRTGNLSKAYKFIETLPIAPDATIWGALLCGCRIYH 615

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            D++LAEKVAERVFELEPENTGYYVLLANIYAEAEK EEVK++RE IG++GLRK PGCSWI
Sbjct: 616  DIELAEKVAERVFELEPENTGYYVLLANIYAEAEKREEVKRMREKIGKKGLRKNPGCSWI 675

Query: 679  EMKNQVHVFVAGDR-SHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLC 503
            E+K +V++FV+G+  SHP  KKIE LLK++R +MKEEG  PK +YALIN  +++KE  LC
Sbjct: 676  EIKGRVNLFVSGNNSSHPHSKKIESLLKKMRRKMKEEGYFPKTKYALINADEMQKEMALC 735

Query: 502  GHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFM 323
            GHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCHEMAKFMSK   +EIVLRDSNRFHHF 
Sbjct: 736  GHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCHEMAKFMSKETRREIVLRDSNRFHHFK 795

Query: 322  DGRCSCRGYW 293
            DG CSCRG+W
Sbjct: 796  DGYCSCRGFW 805


>gb|ACD56662.1| putative pentatricopeptide [Gossypium arboreum]
          Length = 805

 Score =  446 bits (1147), Expect = e-122
 Identities = 205/310 (66%), Positives = 256/310 (82%), Gaps = 1/310 (0%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW +M+ GY   G+G  AI  F  M DA I+PDE++FI++L ACS SG++ +G  +
Sbjct: 496  KDLVSWTVMIAGYGMHGYGNEAIATFNEMRDAGIEPDEVSFISILYACSHSGLLEQGWRF 555

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            F  M+  +++ P L+HYAC+VDLL+R G L+KAY+F++T+PI PD+TIWGALLCGCR + 
Sbjct: 556  FYIMKNDFNIEPKLEHYACMVDLLSRTGNLSKAYEFMETLPIAPDATIWGALLCGCRNYH 615

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            D++LAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVK+LRE IG++GLRK PGCSWI
Sbjct: 616  DIELAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKRLREKIGKQGLRKNPGCSWI 675

Query: 679  EMKNQVHVFVAGDR-SHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLC 503
            E+K +V++FV+G+  SHP  K IE LLK++R +MKEEG  PK +YALIN  +++KE  LC
Sbjct: 676  EIKGKVNLFVSGNNSSHPHSKNIESLLKKMRRKMKEEGHFPKTKYALINADEMQKEMALC 735

Query: 502  GHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFM 323
            GHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCHEMAKFMSK   +EIVLRDSNRFHHF 
Sbjct: 736  GHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCHEMAKFMSKETRREIVLRDSNRFHHFK 795

Query: 322  DGRCSCRGYW 293
            DG CSCRG+W
Sbjct: 796  DGYCSCRGFW 805


>ref|XP_004294643.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 870

 Score =  445 bits (1144), Expect = e-122
 Identities = 208/309 (67%), Positives = 246/309 (79%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ S+ +++ GY   G G  AI  F  M  A I+PD I+FI++L ACS SG+V EG  +
Sbjct: 562  KDLISYTVIIAGYGMHGFGKEAIAAFNEMTKAEIEPDSISFISILYACSHSGLVQEGWRF 621

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            F+ M   Y + P L+HYAC+VDLLAR G LTKAYKFI+ MPIEPD+T+WG+LLCGCRIH 
Sbjct: 622  FDIMRNDYKIEPMLEHYACMVDLLARTGNLTKAYKFINMMPIEPDATVWGSLLCGCRIHH 681

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            DVKLAEKVAE VFELEPENTGYY+LLANIYAEAEKWEEVKKLRE IGRR L+K PGCSWI
Sbjct: 682  DVKLAEKVAEHVFELEPENTGYYILLANIYAEAEKWEEVKKLRERIGRRSLKKNPGCSWI 741

Query: 679  EMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLCG 500
            E+K +V++FVAG  SHP   KIE L+K+ R RMKE+G  PK +YALIN  +VEKE  LC 
Sbjct: 742  EIKGKVNIFVAGGTSHPDAMKIESLVKKFRSRMKEDGYNPKMQYALINADEVEKEVALCA 801

Query: 499  HSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFMD 320
            HSEKLA+A+GI + PP KT+RVTKNLRVCGDCHEMAKF+S+   +EIVLRDSNRFHH  D
Sbjct: 802  HSEKLAIAFGILNTPPRKTIRVTKNLRVCGDCHEMAKFISRTSRREIVLRDSNRFHHMKD 861

Query: 319  GRCSCRGYW 293
            G CSCRG+W
Sbjct: 862  GNCSCRGFW 870


>gb|AAT64016.1| putative pentatricopeptide repeat protein [Gossypium hirsutum]
          Length = 805

 Score =  444 bits (1141), Expect = e-122
 Identities = 204/310 (65%), Positives = 255/310 (82%), Gaps = 1/310 (0%)
 Frame = -1

Query: 1219 KDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMVSEGLEY 1040
            KD+ SW +M+ GY   G+G  AI  F  M DA I+PDE++FI++L ACS SG++ +G  +
Sbjct: 496  KDLVSWTVMIAGYGMHGYGNEAIATFNEMRDAGIEPDEVSFISILYACSHSGLLEQGWRF 555

Query: 1039 FNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLCGCRIHR 860
            F  M+  +++ P L+HYAC+VDLL+R G L+KAY+FI+T+PI PD+TIWGALLCGCR + 
Sbjct: 556  FYIMKNDFNIEPKLEHYACMVDLLSRTGNLSKAYEFIETLPIAPDATIWGALLCGCRNYH 615

Query: 859  DVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKKPGCSWI 680
            D++LAEKVAERVFELEPEN+GYYVLLANIYAEAEKWEEVK+LRE IG++GLRK PGCSWI
Sbjct: 616  DIELAEKVAERVFELEPENSGYYVLLANIYAEAEKWEEVKRLREKIGKQGLRKNPGCSWI 675

Query: 679  EMKNQVHVFVAGDR-SHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEKEEVLC 503
            E+K +V++FV+G+  SHP  K IE LLK++R +MKEEG  PK +YALIN  +++KE  LC
Sbjct: 676  EIKGKVNLFVSGNNSSHPHSKNIESLLKKMRRKMKEEGHFPKTKYALINADEMQKEMALC 735

Query: 502  GHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNRFHHFM 323
            GHSEKLAMA+G+ +LPP KT+RVTKNLRVCGDCHEMAKFMSK   +EIVLRD NRFHHF 
Sbjct: 736  GHSEKLAMAFGLLTLPPRKTIRVTKNLRVCGDCHEMAKFMSKETRREIVLRDPNRFHHFK 795

Query: 322  DGRCSCRGYW 293
            DG CSCRG+W
Sbjct: 796  DGYCSCRGFW 805


>ref|XP_002870024.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297315860|gb|EFH46283.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 871

 Score =  443 bits (1139), Expect = e-122
 Identities = 208/315 (66%), Positives = 251/315 (79%)
 Frame = -1

Query: 1237 FNAHIDKDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMV 1058
            F+    KD+ SW +M+ GY   G G  AI LF +M  A I+PDEI+F+++L ACS SG+V
Sbjct: 557  FDDITSKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEPDEISFVSLLYACSHSGLV 616

Query: 1057 SEGLEYFNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLC 878
             EG  +FN M     + P ++HYAC+VD+LAR G L+KAY+FI+ MPI PD+TIWGALLC
Sbjct: 617  DEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGNLSKAYRFIENMPIPPDATIWGALLC 676

Query: 877  GCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKK 698
            GCRIH DVKLAE+VAE+VFELEPENTGYYVL+ANIYAEAEKWEEVK+LR+ IG+RGLRK 
Sbjct: 677  GCRIHHDVKLAERVAEKVFELEPENTGYYVLMANIYAEAEKWEEVKRLRKRIGQRGLRKN 736

Query: 697  PGCSWIEMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEK 518
            PGCSWIE+K +V++FVAGD S+P+ +KIE  L+ VR RM EEG  P  +YALI+  ++EK
Sbjct: 737  PGCSWIEIKGRVNIFVAGDSSNPETEKIEAFLRGVRARMIEEGYSPLTKYALIDAEEMEK 796

Query: 517  EEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNR 338
            EE LCGHSEKLAMA GI S   GK +RVTKNLRVCGDCHEMAKFMSK+  +EIVLRDSNR
Sbjct: 797  EEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNR 856

Query: 337  FHHFMDGRCSCRGYW 293
            FH F DG CSCRG+W
Sbjct: 857  FHQFKDGHCSCRGFW 871


>dbj|BAD94843.1| putative protein [Arabidopsis thaliana]
          Length = 720

 Score =  438 bits (1126), Expect = e-120
 Identities = 206/315 (65%), Positives = 249/315 (79%)
 Frame = -1

Query: 1237 FNAHIDKDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMV 1058
            F+    KD+ SW +M+ GY   G G  AI LF +M  A I+ DEI+F+++L ACS SG+V
Sbjct: 406  FDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLV 465

Query: 1057 SEGLEYFNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLC 878
             EG  +FN M     + P ++HYAC+VD+LAR G L KAY+FI+ MPI PD+TIWGALLC
Sbjct: 466  DEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLC 525

Query: 877  GCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKK 698
            GCRIH DVKLAEKVAE+VFELEPENTGYYVL+ANIYAEAEKWE+VK+LR+ IG+RGLRK 
Sbjct: 526  GCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKN 585

Query: 697  PGCSWIEMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEK 518
            PGCSWIE+K +V++FVAGD S+P+ + IE  L++VR RM EEG  P  +YALI+  ++EK
Sbjct: 586  PGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEK 645

Query: 517  EEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNR 338
            EE LCGHSEKLAMA GI S   GK +RVTKNLRVCGDCHEMAKFMSK+  +EIVLRDSNR
Sbjct: 646  EEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNR 705

Query: 337  FHHFMDGRCSCRGYW 293
            FH F DG CSCRG+W
Sbjct: 706  FHQFKDGHCSCRGFW 720


>ref|NP_193610.1| pentatricopeptide repeat protein DOT4 [Arabidopsis thaliana]
            gi|75206861|sp|Q9SN39.1|PP320_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g18750, chloroplastic; Flags: Precursor
            gi|4539394|emb|CAB37460.1| putative protein [Arabidopsis
            thaliana] gi|7268669|emb|CAB78877.1| putative protein
            [Arabidopsis thaliana] gi|332658686|gb|AEE84086.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 871

 Score =  438 bits (1126), Expect = e-120
 Identities = 206/315 (65%), Positives = 249/315 (79%)
 Frame = -1

Query: 1237 FNAHIDKDVASWNIMLTGYAQQGHGALAIDLFGRMIDARIKPDEITFIAVLCACSRSGMV 1058
            F+    KD+ SW +M+ GY   G G  AI LF +M  A I+ DEI+F+++L ACS SG+V
Sbjct: 557  FDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLV 616

Query: 1057 SEGLEYFNSMEQHYSVTPNLKHYACVVDLLARGGRLTKAYKFIDTMPIEPDSTIWGALLC 878
             EG  +FN M     + P ++HYAC+VD+LAR G L KAY+FI+ MPI PD+TIWGALLC
Sbjct: 617  DEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLC 676

Query: 877  GCRIHRDVKLAEKVAERVFELEPENTGYYVLLANIYAEAEKWEEVKKLRETIGRRGLRKK 698
            GCRIH DVKLAEKVAE+VFELEPENTGYYVL+ANIYAEAEKWE+VK+LR+ IG+RGLRK 
Sbjct: 677  GCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKN 736

Query: 697  PGCSWIEMKNQVHVFVAGDRSHPQFKKIELLLKRVRMRMKEEGSLPKKRYALINGGDVEK 518
            PGCSWIE+K +V++FVAGD S+P+ + IE  L++VR RM EEG  P  +YALI+  ++EK
Sbjct: 737  PGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEK 796

Query: 517  EEVLCGHSEKLAMAYGIFSLPPGKTVRVTKNLRVCGDCHEMAKFMSKMFGKEIVLRDSNR 338
            EE LCGHSEKLAMA GI S   GK +RVTKNLRVCGDCHEMAKFMSK+  +EIVLRDSNR
Sbjct: 797  EEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNR 856

Query: 337  FHHFMDGRCSCRGYW 293
            FH F DG CSCRG+W
Sbjct: 857  FHQFKDGHCSCRGFW 871


Top