BLASTX nr result

ID: Akebia23_contig00042035 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00042035
         (1210 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003633947.1| PREDICTED: pentatricopeptide repeat-containi...   505   e-140
ref|XP_002301427.2| hypothetical protein POPTR_0002s17640g [Popu...   499   e-139
ref|XP_004230005.1| PREDICTED: pentatricopeptide repeat-containi...   481   e-133
gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis]     478   e-132
ref|XP_006339746.1| PREDICTED: pentatricopeptide repeat-containi...   476   e-131
ref|XP_004510637.1| PREDICTED: pentatricopeptide repeat-containi...   474   e-131
ref|XP_007135284.1| hypothetical protein PHAVU_010G116300g [Phas...   469   e-129
gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus...   466   e-129
ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily p...   466   e-129
ref|XP_004155716.1| PREDICTED: pentatricopeptide repeat-containi...   466   e-128
ref|XP_003548250.1| PREDICTED: pentatricopeptide repeat-containi...   464   e-128
ref|XP_004142577.1| PREDICTED: pentatricopeptide repeat-containi...   451   e-124
emb|CAN66974.1| hypothetical protein VITISV_022076 [Vitis vinifera]   441   e-121
ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Caps...   435   e-119
ref|NP_196468.1| pentatricopeptide repeat-containing protein [Ar...   434   e-119
dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana]           434   e-119
emb|CBI40590.3| unnamed protein product [Vitis vinifera]              429   e-118
ref|XP_006399350.1| hypothetical protein EUTSA_v10013320mg [Eutr...   428   e-117
ref|XP_002871343.1| pentatricopeptide repeat-containing protein ...   428   e-117
dbj|BAA90805.1| pentatricopeptide (PPR) repeat-containing protei...   427   e-117

>ref|XP_003633947.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Vitis vinifera]
          Length = 512

 Score =  505 bits (1300), Expect = e-140
 Identities = 249/360 (69%), Positives = 291/360 (80%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RD+P WNS++AG+AR G+L  A ELF LMP RNV SWTAMISGY+QNG+Y  A+ M++ M
Sbjct: 146  RDVPTWNSMIAGYARCGDLEGALELFRLMPARNVTSWTAMISGYAQNGQYAKALSMFLMM 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E+E+ + PNEVT+ASVLPACANLGAL++GERIE YAR  G+ +NL+VSNALLEMYA+CG 
Sbjct: 206  EEETEMRPNEVTLASVLPACANLGALEVGERIEVYARGNGYFKNLYVSNALLEMYARCGR 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID A  VF+EI   RNLCSWNSMIMGLAVHGR  E +ELF++ML EG  PDD+TFVGVLL
Sbjct: 266  IDKAWGVFEEIDGRRNLCSWNSMIMGLAVHGRCDEAIELFYKMLREGAAPDDVTFVGVLL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V +G  FF+SME DFSI PKLEHYGCMVDLLGRAGEL EA+ LI RMPM PDS
Sbjct: 326  ACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMVDLLGRAGELREAHDLILRMPMEPDS 385

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            VVWG+LLGACSF+G+V             EP NPG YVILSN+YA +G WDGVA++RKLM
Sbjct: 386  VVWGTLLGACSFHGHVELAEKAAGALFELEPSNPGNYVILSNIYATAGRWDGVARLRKLM 445

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYEPDLDSDI 129
            KG ++TKAAGYSFIE  G +HKFIVEDRSH+RS EIY LLDE+S KMKL G   D DS+I
Sbjct: 446  KGGKITKAAGYSFIEEGGHIHKFIVEDRSHSRSDEIYALLDEVSMKMKLHGNVNDSDSEI 505



 Score = 95.9 bits (237), Expect = 3e-17
 Identities = 66/273 (24%), Positives = 126/273 (46%), Gaps = 40/273 (14%)
 Frame = -3

Query: 1145 ARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACA 966
            A +LF+ +P   V  +  +I  YS +G +     +Y +M  + G SPNE +   +  ACA
Sbjct: 35   AHKLFDFIPKPTVFLYNKLIQAYSSHGPHHQCFSLYTQMCLQ-GCSPNEHSFTFLFSACA 93

Query: 965  NLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDE----------- 819
            +L + + G  +  +  K GF  ++F   AL++MYAK G + +AR+ FDE           
Sbjct: 94   SLSSHQQGRMLHTHFVKSGFGCDVFALTALVDMYAKLGLLSLARKQFDEMTVRDVPTWNS 153

Query: 818  -------------------IGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSE-GIIP 699
                               +   RN+ SW +MI G A +G++ + L +F  M  E  + P
Sbjct: 154  MIAGYARCGDLEGALELFRLMPARNVTSWTAMISGYAQNGQYAKALSMFLMMEEETEMRP 213

Query: 698  DDITFVGVLLACTHGGLVKQGWQ---------FFKSMELDFSITPKLEHYGCMVDLLGRA 546
            +++T   VL AC + G ++ G +         +FK++ +             ++++  R 
Sbjct: 214  NEVTLASVLPACANLGALEVGERIEVYARGNGYFKNLYVS----------NALLEMYARC 263

Query: 545  GELNEAYALIKRMPMTPDSVVWGSLLGACSFYG 447
            G +++A+ + + +    +   W S++   + +G
Sbjct: 264  GRIDKAWGVFEEIDGRRNLCSWNSMIMGLAVHG 296


>ref|XP_002301427.2| hypothetical protein POPTR_0002s17640g [Populus trichocarpa]
            gi|550345235|gb|EEE80700.2| hypothetical protein
            POPTR_0002s17640g [Populus trichocarpa]
          Length = 514

 Score =  499 bits (1286), Expect = e-139
 Identities = 242/362 (66%), Positives = 288/362 (79%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RDIP WNS++AG++RSG++  A ELF+LMP R+VVSWT MISGYSQNG Y  A+EM+++M
Sbjct: 146  RDIPTWNSLIAGYSRSGDMEGALELFKLMPSRSVVSWTTMISGYSQNGMYTKALEMFLKM 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            EK+  V PNEVTIASV  ACA LGAL++GERIE YAR  G ++NL+VSN LLEMYA+CG 
Sbjct: 206  EKDKEVRPNEVTIASVFSACAKLGALEVGERIESYARDNGLMKNLYVSNTLLEMYARCGK 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID AR VF+EIG+ RNLCSWNSM+MGLAVHGR  E L+L+ +ML EGI PDD+TFVG++L
Sbjct: 266  IDAARHVFNEIGKRRNLCSWNSMMMGLAVHGRSNEALQLYDQMLGEGIEPDDVTFVGLIL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGGLV +GWQ F+SME +FSI PKLEHYGCMVDLLGRAGEL EAY L+K MPM PDS
Sbjct: 326  ACTHGGLVAKGWQLFQSMETNFSIVPKLEHYGCMVDLLGRAGELQEAYDLVKSMPMKPDS 385

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG+LLGACSF+ NV             EP NPG YVIL N+YA +  WDGVAK+RKLM
Sbjct: 386  VIWGTLLGACSFHSNVEFAEIAAESLFQVEPWNPGNYVILCNIYASAQRWDGVAKLRKLM 445

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYEPDLDSDI 129
            KG Q+TKAAGYS IE +G +HKFIVED+SH R +EIY LL+EIS KMKL   E D   ++
Sbjct: 446  KGGQITKAAGYSVIEGEGEIHKFIVEDKSHPRHYEIYALLNEISTKMKLQITEDDFKPEL 505

Query: 128  AE 123
             E
Sbjct: 506  EE 507



 Score = 95.1 bits (235), Expect = 5e-17
 Identities = 68/272 (25%), Positives = 123/272 (45%), Gaps = 32/272 (11%)
 Frame = -3

Query: 1166 RSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIA 987
            R  ++  A ++F   PY  V  +  +I  YS   +    + +Y +M  + G  PNE+T  
Sbjct: 28   RIPDIPYAHKVFNQSPYPTVFLYNKLIKAYSSQNQPRQCLSLYSQMLLK-GCPPNELTFT 86

Query: 986  SVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRG 807
             + PACA+  +L  G+ I  +  K GF  +++   AL+ MYAK G + +AR+VFDE+   
Sbjct: 87   FLFPACASFYSLLHGKVIHTHFIKSGFDFDVYALTALVNMYAKLGVLMLARQVFDEM-TV 145

Query: 806  RNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGII------------------------- 702
            R++ +WNS+I G +  G  +  LELF  M S  ++                         
Sbjct: 146  RDIPTWNSLIAGYSRSGDMEGALELFKLMPSRSVVSWTTMISGYSQNGMYTKALEMFLKM 205

Query: 701  -------PDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAG 543
                   P+++T   V  AC   G ++ G +  +S   D  +   L     ++++  R G
Sbjct: 206  EKDKEVRPNEVTIASVFSACAKLGALEVG-ERIESYARDNGLMKNLYVSNTLLEMYARCG 264

Query: 542  ELNEAYALIKRMPMTPDSVVWGSLLGACSFYG 447
            +++ A  +   +    +   W S++   + +G
Sbjct: 265  KIDAARHVFNEIGKRRNLCSWNSMMMGLAVHG 296


>ref|XP_004230005.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Solanum lycopersicum]
          Length = 508

 Score =  481 bits (1237), Expect = e-133
 Identities = 233/364 (64%), Positives = 288/364 (79%), Gaps = 2/364 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            +D+P WNS++AG+A++G + +A +LF +MP RNV+SWTAMISGYSQNG+Y +A+ +Y +M
Sbjct: 146  KDVPIWNSLIAGYAKNGNVVEAFKLFSVMPSRNVISWTAMISGYSQNGKYANALAVYKQM 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            EK+  V PNEVTIASVLPACANLGAL++GE IE YAR  G+ +N+FV NA+LEMY KCG 
Sbjct: 206  EKDRKVKPNEVTIASVLPACANLGALEVGENIEAYARANGYFKNMFVCNAVLEMYTKCGR 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID A ++F EIGR RNLCSWN+MIMGLAVHG+  E L+LF++ML EG  PDD+TFVG +L
Sbjct: 266  IDRAMQLFHEIGRRRNLCSWNTMIMGLAVHGKGDEALKLFNQMLGEGNTPDDVTFVGAIL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V +GW+  K ME  FSI PKLEHYGCMVDLLGRAG+L EAY LI+ MPM PD 
Sbjct: 326  ACTHGGMVAKGWELLKLMEQRFSIAPKLEHYGCMVDLLGRAGKLQEAYDLIQSMPMRPDC 385

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG++LGACSFYGNV             EP NPG YVILSN+YA +G WDGVA++RKLM
Sbjct: 386  VIWGTILGACSFYGNVELAEKAAEFLSVLEPWNPGNYVILSNIYARAGRWDGVARLRKLM 445

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMK--LLGYEPDLDS 135
            K +Q+TKAAGYSFIE  G +HKFIVED+SH +S EIY LLD ++ ++K  +   E DLDS
Sbjct: 446  KSSQITKAAGYSFIEEGGDIHKFIVEDKSHPKSNEIYSLLDLVTTRLKFDVSTMEIDLDS 505

Query: 134  DIAE 123
             IAE
Sbjct: 506  -IAE 508



 Score = 80.1 bits (196), Expect = 2e-12
 Identities = 56/213 (26%), Positives = 98/213 (46%), Gaps = 5/213 (2%)
 Frame = -3

Query: 1145 ARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACA 966
            A ++F+ +    V  +  +I  YS +G       +Y++M ++ G SPN  +   +  AC+
Sbjct: 35   AHKVFDNITKPTVFLYNKLIQAYSSHGFPSQCFSLYIKMRRQ-GCSPNPHSFTFLFAACS 93

Query: 965  NLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWN 786
            N      G+    +  K GF  +++   AL++MYAK   +  AR++FDE+   +++  WN
Sbjct: 94   NRSTPIQGQMFHVHFIKWGFEFDIYTLTALVDMYAKMSLLPSARKLFDEM-EMKDVPIWN 152

Query: 785  SMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELD 606
            S+I G A +G   E  +LF  M S  +    I++  ++   +  G        +K ME D
Sbjct: 153  SLIAGYAKNGNVVEAFKLFSVMPSRNV----ISWTAMISGYSQNGKYANALAVYKQMEKD 208

Query: 605  FSITPKLEHYGCMVDLLGRAGELN-----EAYA 522
              + P       ++      G L      EAYA
Sbjct: 209  RKVKPNEVTIASVLPACANLGALEVGENIEAYA 241


>gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis]
          Length = 513

 Score =  478 bits (1230), Expect = e-132
 Identities = 236/365 (64%), Positives = 286/365 (78%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            R  P WNS+++G+ARSG++  A ELF LMP RNVVSWTAMISGYS+NG+Y  A+ M+++M
Sbjct: 149  RGTPTWNSMLSGYARSGDMEGASELFRLMPQRNVVSWTAMISGYSKNGQYAKALAMFLQM 208

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            EKE  V PN +TIASVLPACANLGAL++GER+E YARK GFL++L+VSNA+LEMYAKCG 
Sbjct: 209  EKERDVRPNAITIASVLPACANLGALEVGERVEEYARKVGFLKDLYVSNAVLEMYAKCGR 268

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID ARRVFDEIGR RNLCSWNSMIMGLAVHGR  E L+L+ +M +  I PDD+TFVG++L
Sbjct: 269  IDTARRVFDEIGRRRNLCSWNSMIMGLAVHGRCNEALDLYEQMTTVRIAPDDVTFVGLIL 328

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+  +G Q FKSME  F ITPKLEHYGCMVDLLGRAG+L EAY LI+ M M PD+
Sbjct: 329  ACTHGGMAMKGQQLFKSMEPKFGITPKLEHYGCMVDLLGRAGKLQEAYDLIQGMSMKPDN 388

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG+LLGACSF+GNV             E  NP  YVILSN+YA +  WDGVAK+RK+M
Sbjct: 389  VIWGALLGACSFHGNVELAEKAAESLFELESWNPANYVILSNIYASARRWDGVAKLRKVM 448

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYEPDLDSDI 129
            KG ++TKAAGYSFIE  G VHKFIVED+SH RS EIY LL++   K++L   + D  ++ 
Sbjct: 449  KGGKITKAAGYSFIEEGGQVHKFIVEDKSHPRSDEIYALLNKFYAKVRLYRNDTDCLTED 508

Query: 128  AE*EF 114
             E +F
Sbjct: 509  EEMQF 513



 Score = 95.5 bits (236), Expect = 4e-17
 Identities = 65/264 (24%), Positives = 124/264 (46%), Gaps = 31/264 (11%)
 Frame = -3

Query: 1145 ARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACA 966
            AR LF+L+P   V  +  +I  YS +G++   + +Y RM  + G +PNE +   +   C+
Sbjct: 38   ARNLFDLIPEPTVFLYNRLIKAYSFHGQHHQCLFLYRRMCLQ-GCTPNEHSFTLLFSVCS 96

Query: 965  NLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDE----------- 819
            +L + ++G+ +  +  K G +R++F   AL++MYAK G +D AR+ FDE           
Sbjct: 97   SLSSRQLGQMMHSHFVKLGHVRDIFALTALVDMYAKLGMLDCARKQFDEKRVRGTPTWNS 156

Query: 818  -------------------IGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSE-GIIP 699
                               +   RN+ SW +MI G + +G++ + L +F +M  E  + P
Sbjct: 157  MLSGYARSGDMEGASELFRLMPQRNVVSWTAMISGYSKNGQYAKALAMFLQMEKERDVRP 216

Query: 698  DDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYAL 519
            + IT   VL AC + G ++ G +  +           L     ++++  + G ++ A  +
Sbjct: 217  NAITIASVLPACANLGALEVG-ERVEEYARKVGFLKDLYVSNAVLEMYAKCGRIDTARRV 275

Query: 518  IKRMPMTPDSVVWGSLLGACSFYG 447
               +    +   W S++   + +G
Sbjct: 276  FDEIGRRRNLCSWNSMIMGLAVHG 299


>ref|XP_006339746.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Solanum tuberosum]
          Length = 508

 Score =  476 bits (1224), Expect = e-131
 Identities = 226/359 (62%), Positives = 282/359 (78%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            +D+P WNS++AG+A++G + +A +LF +MP RNV+SWTAMISGYSQNG+Y +A+ +Y  M
Sbjct: 146  KDVPTWNSLIAGYAKNGNVEEAFKLFSVMPSRNVISWTAMISGYSQNGKYANALAVYKEM 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            EK+  V PNEVTIASVLPACANLGAL++GE IE YAR  G+ +N+FV NA+LEMY KCG 
Sbjct: 206  EKDRRVKPNEVTIASVLPACANLGALEVGENIEAYARANGYFKNMFVCNAILEMYTKCGR 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID + ++F EIGR RNLCSWN+MIMGLAVHG+  E L+LF++ML EG  PDD+TFVG +L
Sbjct: 266  IDRSMQLFHEIGRRRNLCSWNTMIMGLAVHGKGDEVLKLFNQMLGEGNAPDDVTFVGAIL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V +GW+  K ME  FSI PKLEHYGCMVDLLGRAG+L EAY LI+ +PM PD 
Sbjct: 326  ACTHGGMVAKGWELLKLMEQRFSIAPKLEHYGCMVDLLGRAGKLQEAYDLIQSIPMRPDC 385

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG+LLGACSF+GNV             EP NPG YVILSN+YA +G WDGVA++RKLM
Sbjct: 386  VIWGTLLGACSFHGNVELAEKAAEFLSVLEPWNPGNYVILSNIYARTGRWDGVARLRKLM 445

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYEPDLDSD 132
            K +Q+TKAAGYSFIE  G +HKFIVED+SH +S EIY LLD ++ ++K      D+D D
Sbjct: 446  KSSQITKAAGYSFIEEGGDIHKFIVEDKSHPKSNEIYSLLDLVTTRLKFDVSTMDIDLD 504



 Score = 81.6 bits (200), Expect = 6e-13
 Identities = 56/213 (26%), Positives = 100/213 (46%), Gaps = 5/213 (2%)
 Frame = -3

Query: 1145 ARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACA 966
            A ++F+ +    V  +  +I  YS +G       +Y++M ++ G SPN  +   +  AC 
Sbjct: 35   AHKVFDSITKPTVFLYNKLIQAYSSHGLPSRCFSLYIQMRRQ-GCSPNPHSFTFLFAACT 93

Query: 965  NLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWN 786
            N  +   G+    +  K GF  +++   AL++MYAK   +  AR++FDE+   +++ +WN
Sbjct: 94   NSSSPIQGQMFHVHFIKWGFEFDIYTLTALVDMYAKMSLLPSARKLFDEM-EMKDVPTWN 152

Query: 785  SMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELD 606
            S+I G A +G  +E  +LF  M S  +    I++  ++   +  G        +K ME D
Sbjct: 153  SLIAGYAKNGNVEEAFKLFSVMPSRNV----ISWTAMISGYSQNGKYANALAVYKEMEKD 208

Query: 605  FSITPKLEHYGCMVDLLGRAGELN-----EAYA 522
              + P       ++      G L      EAYA
Sbjct: 209  RRVKPNEVTIASVLPACANLGALEVGENIEAYA 241


>ref|XP_004510637.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Cicer arietinum]
          Length = 512

 Score =  474 bits (1221), Expect = e-131
 Identities = 229/362 (63%), Positives = 289/362 (79%), Gaps = 2/362 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            R++P WN+++AG+ R G++ +A ELF LMP RNVVSWT ++SGYSQN +YE A+E+++RM
Sbjct: 147  REVPTWNAMMAGYTRFGDMERALELFGLMPARNVVSWTTVVSGYSQNKQYEKALELFLRM 206

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E E  V PNEVT+ASVLPACANLGAL++G+R+E YAR+ G  +NLFVSNA+LEMYAKCG 
Sbjct: 207  EWEKDVIPNEVTLASVLPACANLGALEIGQRVEAYARENGLFKNLFVSNAVLEMYAKCGK 266

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID+A +VFDE+GR RNLCS+NSMIMGLAVHG+  + +EL+ +ML EG +PDD+TFVG+LL
Sbjct: 267  IDVAWKVFDEMGRFRNLCSFNSMIMGLAVHGQCDKAIELYDQMLREGTLPDDVTFVGLLL 326

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V+ G   FKSM  DF+I PKLEHYGCMVDLLGRAG+L+EAY +IK MPM PDS
Sbjct: 327  ACTHGGMVETGKHIFKSMTRDFNIIPKLEHYGCMVDLLGRAGKLSEAYEVIKSMPMVPDS 386

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG+LLGACSF+GNV             EP NPG YVILSN+YA +G W+GVAK+RK+M
Sbjct: 387  VIWGALLGACSFHGNVELAEIAAESLFVLEPWNPGNYVILSNIYASAGQWNGVAKLRKVM 446

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKL--LGYEPDLDS 135
            KG ++TKAAG+SFIE  G +HKFIVEDRSH+ S +I+ LLD +   +K     YE  LD 
Sbjct: 447  KGGKITKAAGHSFIEEGGRLHKFIVEDRSHSESNQIFALLDGVYEMIKFNKNAYECHLDF 506

Query: 134  DI 129
            D+
Sbjct: 507  DL 508



 Score = 79.7 bits (195), Expect = 2e-12
 Identities = 52/221 (23%), Positives = 107/221 (48%), Gaps = 31/221 (14%)
 Frame = -3

Query: 1016 GVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIA 837
            G SPN+ T   +  A  ++ ++ +G+ +  +  K GF  ++F S ALL+MYAK G++ +A
Sbjct: 78   GHSPNQHTFNFLFKAGTSVSSISLGQMLHTHFIKSGFKHDVFASTALLDMYAKLGSLKLA 137

Query: 836  RRVFDEIG------------------------------RGRNLCSWNSMIMGLAVHGRWK 747
            R VFDE+                                 RN+ SW +++ G + + +++
Sbjct: 138  RHVFDEMSVREVPTWNAMMAGYTRFGDMERALELFGLMPARNVVSWTTVVSGYSQNKQYE 197

Query: 746  EGLELFHEM-LSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGC 570
            + LELF  M   + +IP+++T   VL AC + G ++ G Q  ++   +  +   L     
Sbjct: 198  KALELFLRMEWEKDVIPNEVTLASVLPACANLGALEIG-QRVEAYARENGLFKNLFVSNA 256

Query: 569  MVDLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYG 447
            ++++  + G+++ A+ +   M    +   + S++   + +G
Sbjct: 257  VLEMYAKCGKIDVAWKVFDEMGRFRNLCSFNSMIMGLAVHG 297


>ref|XP_007135284.1| hypothetical protein PHAVU_010G116300g [Phaseolus vulgaris]
            gi|561008329|gb|ESW07278.1| hypothetical protein
            PHAVU_010G116300g [Phaseolus vulgaris]
          Length = 510

 Score =  469 bits (1206), Expect = e-129
 Identities = 227/363 (62%), Positives = 285/363 (78%), Gaps = 2/363 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            R +P WN++++G+A+ G++  A ELF LMP RN+VSWT MISGYS+N ++ +A+ ++++M
Sbjct: 147  RGVPTWNAMMSGYAKFGDMEGALELFGLMPTRNLVSWTTMISGYSRNKQFGEALGLFLKM 206

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E+E G+ PNEVT+AS+LPAC+NLGAL++G+R+E YARK GF +NL+VSNALLEMYAKCG 
Sbjct: 207  EQEKGIVPNEVTLASILPACSNLGALEIGQRVEAYARKNGFFKNLYVSNALLEMYAKCGK 266

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID+A RVF+EIGR RNLCSWNSMIMGLAVHG+  +  EL+ +ML EG  PDD+TFVG+LL
Sbjct: 267  IDVAWRVFNEIGRFRNLCSWNSMIMGLAVHGQCCKAFELYDQMLGEGTSPDDVTFVGLLL 326

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V++G   FKSM   F I PKLEHYGCMVDLLGRAG L EAY +I+ MPM PDS
Sbjct: 327  ACTHGGMVEKGRHIFKSMTTAFHIIPKLEHYGCMVDLLGRAGHLREAYEVIQSMPMKPDS 386

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG+LLGACSF+GNV             EP NPG YVILSN+YA  G WDGVAK+RK+M
Sbjct: 387  VIWGALLGACSFHGNVELAEVAAESLFVLEPWNPGNYVILSNIYASVGQWDGVAKLRKVM 446

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKL--LGYEPDLDS 135
            KGNQ+TK+AG+SFIE  G +HKFIVEDRSH R  EI  LLD +   +KL    +E  LD 
Sbjct: 447  KGNQITKSAGHSFIEEGGQLHKFIVEDRSHPRRNEILALLDGVYEMIKLNRSAFEYHLDL 506

Query: 134  DIA 126
            D++
Sbjct: 507  DLS 509



 Score = 89.7 bits (221), Expect = 2e-15
 Identities = 64/257 (24%), Positives = 123/257 (47%), Gaps = 32/257 (12%)
 Frame = -3

Query: 1121 PYRNVVSWTAMISGYSQNGRYED-AVEMYVRMEKESGVSPNEVTIASVLPACANLGALKM 945
            P +N+  +  +I  YS + +++     +Y +M    G  PN+ T   +  AC +L +  +
Sbjct: 43   PKQNLFLYNKLIQAYSSHPQHQHRCFSLYYQMRLH-GFLPNQHTFNFLFSACTSLFSHSL 101

Query: 944  GERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIG-RG------------- 807
            G+ +  +  K GF  +LF + ALL+MY K G + +AR++FDE+  RG             
Sbjct: 102  GQMLHTHFIKSGFEPDLFAATALLDMYCKVGTLGLARQLFDEMPVRGVPTWNAMMSGYAK 161

Query: 806  ----------------RNLCSWNSMIMGLAVHGRWKEGLELFHEMLSE-GIIPDDITFVG 678
                            RNL SW +MI G + + ++ E L LF +M  E GI+P+++T   
Sbjct: 162  FGDMEGALELFGLMPTRNLVSWTTMISGYSRNKQFGEALGLFLKMEQEKGIVPNEVTLAS 221

Query: 677  VLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMT 498
            +L AC++ G ++ G Q  ++          L     ++++  + G+++ A+ +   +   
Sbjct: 222  ILPACSNLGALEIG-QRVEAYARKNGFFKNLYVSNALLEMYAKCGKIDVAWRVFNEIGRF 280

Query: 497  PDSVVWGSLLGACSFYG 447
             +   W S++   + +G
Sbjct: 281  RNLCSWNSMIMGLAVHG 297


>gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus guttatus]
          Length = 516

 Score =  466 bits (1200), Expect = e-129
 Identities = 225/366 (61%), Positives = 285/366 (77%), Gaps = 1/366 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            +D P WNS++AG+AR+G++++A  LF  MP RNV+SWTA+ISG+SQNG+Y++A+EMY+ M
Sbjct: 146  KDAPTWNSLIAGYARNGDMSEALRLFSNMPSRNVISWTAIISGFSQNGKYKEALEMYLAM 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E++  V PN VT+ASVLPACANLGAL++G+RIE YAR  G+ +N FV NA+LE+YA+CG 
Sbjct: 206  ERDGKVKPNHVTLASVLPACANLGALEVGQRIEAYARANGYFKNAFVCNAVLELYARCGV 265

Query: 848  IDIARRVFDEIGRG-RNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVL 672
            I+ A +VFDEIG G RNLCSWN++IMGLAVHGR    LE+F++ML++G+ PDD+TFVG +
Sbjct: 266  IEKAMQVFDEIGSGNRNLCSWNTLIMGLAVHGRCDGALEIFNQMLTKGVTPDDVTFVGAI 325

Query: 671  LACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPD 492
            LACTHGG+V +G + F SME  FSITPK+EHYGCMVDLLGRAG L EAY LIK MPM PD
Sbjct: 326  LACTHGGMVNKGREIFDSMEKRFSITPKIEHYGCMVDLLGRAGLLQEAYKLIKAMPMKPD 385

Query: 491  SVVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKL 312
            SVVWG+LLGACSF+GNV             EP NPG YVILSN+YA +G W+GVAK+RKL
Sbjct: 386  SVVWGTLLGACSFHGNVELGEKAAESLFVLEPLNPGNYVILSNIYARAGRWNGVAKLRKL 445

Query: 311  MKGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYEPDLDSD 132
            MKG+ V K AG+SFIE  G +HKFIVED+SH R  +I+  LD ++ +MK  G     DS 
Sbjct: 446  MKGSNVVKGAGHSFIEEGGLIHKFIVEDKSHERCDDIFTALDCVTAEMKFDGNAIGFDSI 505

Query: 131  IAE*EF 114
            + E  F
Sbjct: 506  VDEIRF 511



 Score = 81.3 bits (199), Expect = 8e-13
 Identities = 56/213 (26%), Positives = 99/213 (46%), Gaps = 5/213 (2%)
 Frame = -3

Query: 1145 ARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACA 966
            A +L +  P   +  ++ +I  YS +G +     +Y ++   S  SPN      +  ACA
Sbjct: 35   AHKLLDKTPDPTLFLYSKLIKAYSSHGPHFQCFSLYSQILHLS-FSPNPNCFTFLFSACA 93

Query: 965  NLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWN 786
             L     G+ +  +  K G   +++   AL++MYAK G +  +R++FDE+   ++  +WN
Sbjct: 94   KLSNPSQGQMLHAHFIKFGLDYDVYALTALVDMYAKMGLLRFSRKIFDEM-NDKDAPTWN 152

Query: 785  SMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELD 606
            S+I G A +G   E L LF  M S  +    I++  ++   +  G  K+  + + +ME D
Sbjct: 153  SLIAGYARNGDMSEALRLFSNMPSRNV----ISWTAIISGFSQNGKYKEALEMYLAMERD 208

Query: 605  FSITPKLEHYGCMVDLLGRAGELN-----EAYA 522
              + P       ++      G L      EAYA
Sbjct: 209  GKVKPNHVTLASVLPACANLGALEVGQRIEAYA 241


>ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily protein, putative
            [Theobroma cacao] gi|508703740|gb|EOX95636.1|
            Pentatricopeptide repeat (PPR) superfamily protein,
            putative [Theobroma cacao]
          Length = 515

 Score =  466 bits (1199), Expect = e-129
 Identities = 225/360 (62%), Positives = 281/360 (78%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            R++P WN++++G++  G++ +A ELF+ MP +NVVSWT MISGYSQNG+Y  A++M++RM
Sbjct: 148  RNLPTWNALISGYSMCGDMKEALELFKSMPEKNVVSWTTMISGYSQNGQYSKALDMFLRM 207

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            EKE+GV PN VTIASVLPACANLGAL++GERIE YAR+ G   +L+VSN +LEMYA+CG 
Sbjct: 208  EKETGVKPNRVTIASVLPACANLGALEVGERIETYARENGLFEDLYVSNTVLEMYARCGK 267

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            I++A+ VFDEIG+ RNLC WNSMIMGLA+HG+  E  E + +ML EG  PDD+TFVGVLL
Sbjct: 268  IEVAKLVFDEIGKRRNLCVWNSMIMGLALHGKCIEAFEYYDQMLQEGTAPDDVTFVGVLL 327

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHG LV +G + F+SM   + I+PKLEHYGCMVDLLGR+G L EAY LIK MPM PD+
Sbjct: 328  ACTHGRLVVKGRELFESMGKKYHISPKLEHYGCMVDLLGRSGALQEAYDLIKSMPMKPDA 387

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            VVWG+LLGACSF+ NV             EP N G YVILSN+YA  G WDGVAK+RKLM
Sbjct: 388  VVWGALLGACSFHNNVELAEKAAQPLFQLEPWNAGNYVILSNIYASWGWWDGVAKLRKLM 447

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYEPDLDSDI 129
            KG Q+TKAAGYSFIE  G +HKFIVED+SH R  EIY++LD++S  MKL     D +S++
Sbjct: 448  KGGQITKAAGYSFIEEGGRMHKFIVEDKSHPRCDEIYQILDQVSRVMKLQDKLMDSESEL 507



 Score = 97.4 bits (241), Expect = 1e-17
 Identities = 71/265 (26%), Positives = 123/265 (46%), Gaps = 32/265 (12%)
 Frame = -3

Query: 1145 ARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACA 966
            A +LF L+P + V  +  +I  YS   +    + +Y +M   +  SPNE +   + PACA
Sbjct: 37   AHKLFNLIPQKTVFLYNKLIQAYSSINQSHRCLTLYSQMCLNN-CSPNEHSFIFLFPACA 95

Query: 965  NLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWN 786
            +L +L  G+ +     K GF  + +   ALL MYAK   + +AR+VFDE+ R RNL +WN
Sbjct: 96   SLPSLLHGQILHTQFLKSGFGLDCYALTALLVMYAKLRMLPLARKVFDEM-RVRNLPTWN 154

Query: 785  SMIMGLAVHGRWKEGLELFHEMLSE--------------------------------GII 702
            ++I G ++ G  KE LELF  M  +                                G+ 
Sbjct: 155  ALISGYSMCGDMKEALELFKSMPEKNVVSWTTMISGYSQNGQYSKALDMFLRMEKETGVK 214

Query: 701  PDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYA 522
            P+ +T   VL AC + G ++ G +  ++   +  +   L     ++++  R G++  A  
Sbjct: 215  PNRVTIASVLPACANLGALEVG-ERIETYARENGLFEDLYVSNTVLEMYARCGKIEVAKL 273

Query: 521  LIKRMPMTPDSVVWGSLLGACSFYG 447
            +   +    +  VW S++   + +G
Sbjct: 274  VFDEIGKRRNLCVWNSMIMGLALHG 298


>ref|XP_004155716.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Cucumis sativus]
          Length = 512

 Score =  466 bits (1198), Expect = e-128
 Identities = 221/349 (63%), Positives = 276/349 (79%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RDIP WNS++AG+ARSG +  A ELF  MP RNV+SWTA+ISGY+QNG+Y  A+EM++ +
Sbjct: 146  RDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGL 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E E G  PNEV+IASVLPAC+ LGAL +G+RIE YAR  GF +N +VSNA+LE++A+CGN
Sbjct: 206  ENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGN 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            I+ A++VFDEIG  RNLCSWN+MIMGLAVHGR  + L+L+ +ML   + PDD+TFVG+LL
Sbjct: 266  IEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V +G Q F+SME  F + PKLEHYGC+VDLLGRAGEL EAY LI+ MPM PDS
Sbjct: 326  ACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDS 385

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG+LLGACSF+GNV             EP NPG YVILSN+YAL+G W GVA++RK+M
Sbjct: 386  VIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMM 445

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKL 162
            KG  +TK AGYS+IEV   +H+FIVEDRSH +S EIY LL +I + +KL
Sbjct: 446  KGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKL 494



 Score = 87.8 bits (216), Expect = 8e-15
 Identities = 62/217 (28%), Positives = 106/217 (48%), Gaps = 5/217 (2%)
 Frame = -3

Query: 1157 ELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVL 978
            +L  A  LF+ +P  +V  +   I  +S  G       +Y +M  + G SPN+ +   + 
Sbjct: 31   DLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQ-GCSPNQYSFTFLF 89

Query: 977  PACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNL 798
            PACA+L  +  G+ +  +  K GF  ++F   ALL+MYAK G +  AR++FDE+   R++
Sbjct: 90   PACASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEM-PVRDI 148

Query: 797  CSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKS 618
             +WNS+I G A  G  +  LELF++M    +    I++  ++      G   +  + F  
Sbjct: 149  PTWNSLIAGYARSGHMEAALELFNKMPVRNV----ISWTALISGYAQNGKYAKALEMFIG 204

Query: 617  MELDFSITPKLEHYGCMVDLLGRAGELN-----EAYA 522
            +E +    P       ++    + G L+     EAYA
Sbjct: 205  LENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYA 241


>ref|XP_003548250.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Glycine max]
          Length = 512

 Score =  464 bits (1193), Expect = e-128
 Identities = 225/363 (61%), Positives = 285/363 (78%), Gaps = 2/363 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            R +P WN+++AGHAR G++  A ELF LMP RNVVSWT MISGYS++ +Y +A+ +++RM
Sbjct: 147  RGVPTWNAMMAGHARFGDMDVALELFRLMPSRNVVSWTTMISGYSRSKKYGEALGLFLRM 206

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E+E G+ PN VT+AS+ PA ANLGAL++G+R+E YARK GF +NL+VSNA+LEMYAKCG 
Sbjct: 207  EQEKGMMPNAVTLASIFPAFANLGALEIGQRVEAYARKNGFFKNLYVSNAVLEMYAKCGK 266

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID+A +VF+EIG  RNLCSWNSMIMGLAVHG   + L+L+ +ML EG  PDD+TFVG+LL
Sbjct: 267  IDVAWKVFNEIGSLRNLCSWNSMIMGLAVHGECCKTLKLYDQMLGEGTSPDDVTFVGLLL 326

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V++G   FKSM   F+I PKLEHYGCMVDLLGRAG+L EAY +I+RMPM PDS
Sbjct: 327  ACTHGGMVEKGRHIFKSMTTSFNIIPKLEHYGCMVDLLGRAGQLREAYEVIQRMPMKPDS 386

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG+LLGACSF+ NV             EP NPG YVILSN+YA +G WDGVAK+RK+M
Sbjct: 387  VIWGALLGACSFHDNVELAEIAAESLFALEPWNPGNYVILSNIYASAGQWDGVAKLRKVM 446

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKL--LGYEPDLDS 135
            KG+++TK+AG+SFIE  G +HKFIVEDRSH  S EI+ LLD +   +KL    +E  LD 
Sbjct: 447  KGSKITKSAGHSFIEEGGQLHKFIVEDRSHPESNEIFALLDGVYEMIKLNRSAFECHLDL 506

Query: 134  DIA 126
            D++
Sbjct: 507  DLS 509



 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 59/265 (22%), Positives = 120/265 (45%), Gaps = 32/265 (12%)
 Frame = -3

Query: 1145 ARELFELMPYRNVVSWTAMISGYSQNGRYE-DAVEMYVRMEKESGVSPNEVTIASVLPAC 969
            A ++    P   +  +  +I  YS + +++     +Y +M   S + PN+ T   +  AC
Sbjct: 35   AHKVLHHSPKPTLFLYNKLIQAYSSHPQHQHQCFSLYSQMLLHSFL-PNQHTFNFLFSAC 93

Query: 968  ANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIG-RG----- 807
             +L +  +G+ +  +  K GF  +LF + ALL+MY K G +++AR++FD++  RG     
Sbjct: 94   TSLSSPSLGQMLHTHFIKSGFEPDLFAATALLDMYTKVGTLELARKLFDQMPVRGVPTWN 153

Query: 806  ------------------------RNLCSWNSMIMGLAVHGRWKEGLELFHEMLSE-GII 702
                                    RN+ SW +MI G +   ++ E L LF  M  E G++
Sbjct: 154  AMMAGHARFGDMDVALELFRLMPSRNVVSWTTMISGYSRSKKYGEALGLFLRMEQEKGMM 213

Query: 701  PDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYA 522
            P+ +T   +  A  + G ++ G Q  ++          L     ++++  + G+++ A+ 
Sbjct: 214  PNAVTLASIFPAFANLGALEIG-QRVEAYARKNGFFKNLYVSNAVLEMYAKCGKIDVAWK 272

Query: 521  LIKRMPMTPDSVVWGSLLGACSFYG 447
            +   +    +   W S++   + +G
Sbjct: 273  VFNEIGSLRNLCSWNSMIMGLAVHG 297


>ref|XP_004142577.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Cucumis sativus]
          Length = 589

 Score =  451 bits (1160), Expect = e-124
 Identities = 214/354 (60%), Positives = 273/354 (77%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RDIP WNS++AG+ARSG +  A ELF  MP RNV+SWTA+ISGY+QNG+Y  A+EM++ +
Sbjct: 146  RDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGL 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E E G  PNEV+IASVLPAC+ LGAL +G+RIE YAR  GF +N +VSNA+LE++A+CGN
Sbjct: 206  ENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGN 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            I+ A++VFDEIG  RNLCSWN+MIMGLAVHGR  + L+L+ +ML   + PDD+TFVG+LL
Sbjct: 266  IEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V +G Q F+SME  F + PKLEHYGC+VDLLGRAGEL EAY LI+ MPM PDS
Sbjct: 326  ACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDS 385

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG+LLGACSF+GNV             EP NPG YVILSN+YAL+G W GVA++RK+M
Sbjct: 386  VIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMM 445

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYEP 147
            KG  +TK AGYS+IEV   +H+FIVEDR  +R  + + LL  ++   + + + P
Sbjct: 446  KGGHITKRAGYSYIEVGDGIHEFIVEDRITSRCVQPFLLLLHVTLAPQPVSFPP 499



 Score = 87.8 bits (216), Expect = 8e-15
 Identities = 62/217 (28%), Positives = 106/217 (48%), Gaps = 5/217 (2%)
 Frame = -3

Query: 1157 ELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVL 978
            +L  A  LF+ +P  +V  +   I  +S  G       +Y +M  + G SPN+ +   + 
Sbjct: 31   DLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQ-GCSPNQYSFTFLF 89

Query: 977  PACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNL 798
            PACA+L  +  G+ +  +  K GF  ++F   ALL+MYAK G +  AR++FDE+   R++
Sbjct: 90   PACASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEM-PVRDI 148

Query: 797  CSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKS 618
             +WNS+I G A  G  +  LELF++M    +    I++  ++      G   +  + F  
Sbjct: 149  PTWNSLIAGYARSGHMEAALELFNKMPVRNV----ISWTALISGYAQNGKYAKALEMFIG 204

Query: 617  MELDFSITPKLEHYGCMVDLLGRAGELN-----EAYA 522
            +E +    P       ++    + G L+     EAYA
Sbjct: 205  LENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYA 241


>emb|CAN66974.1| hypothetical protein VITISV_022076 [Vitis vinifera]
          Length = 967

 Score =  441 bits (1134), Expect = e-121
 Identities = 226/360 (62%), Positives = 265/360 (73%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RD+P WNS++AG+AR G+L  A ELF LMP RNV SWTAMISGY+QNG+Y  A+ M++ M
Sbjct: 631  RDVPTWNSMIAGYARCGDLEGALELFRLMPARNVTSWTAMISGYAQNGQYAKALSMFLMM 690

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E+E+ + PNEVT+ASVLPACANLGAL++GERIE YAR  G+ +NL+VSNALLEMYA+CG 
Sbjct: 691  EEETEMRPNEVTLASVLPACANLGALEVGERIEVYARGNGYFKNLYVSNALLEMYARCGR 750

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID A  VF+EI   R                              EG  PDD+TFVGVLL
Sbjct: 751  IDKAWGVFEEIDGRR------------------------------EGAAPDDVTFVGVLL 780

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V +G  FF+SME DFSI PKLEHYGCMVDLLGRAGEL EA+ LI RMPM PDS
Sbjct: 781  ACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMVDLLGRAGELREAHDLILRMPMEPDS 840

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            VVWG+LLGACSF+G+V             EP NPG YVILSN+YA +G WDGVA++RKLM
Sbjct: 841  VVWGTLLGACSFHGHVELAEKAAGALFELEPSNPGNYVILSNIYATAGRWDGVARLRKLM 900

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYEPDLDSDI 129
            KG ++TKAAGYSFIE  G +HKFIVEDRSH+RS EIY LLDE+S KMKL G   D DS+I
Sbjct: 901  KGGKITKAAGYSFIEEGGHIHKFIVEDRSHSRSDEIYALLDEVSMKMKLHGNVNDSDSEI 960



 Score = 73.2 bits (178), Expect = 2e-10
 Identities = 56/227 (24%), Positives = 104/227 (45%), Gaps = 45/227 (19%)
 Frame = -3

Query: 986  SVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDE---- 819
            +++ ACA+L + + G  +  +  K GF  ++F   AL++MYAK G + +AR+ FDE    
Sbjct: 572  ALISACASLSSHQQGRMLHTHFVKSGFGCDVFALTALVDMYAKLGLLSLARKQFDEMTVR 631

Query: 818  --------------------------IGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEML 717
                                      +   RN+ SW +MI G A +G++ + L +F  M 
Sbjct: 632  DVPTWNSMIAGYARCGDLEGALELFRLMPARNVTSWTAMISGYAQNGQYAKALSMFLMME 691

Query: 716  SE-GIIPDDITFVGVLLACTHGGLVKQGWQ---------FFKSMELDFSITPKLEHYGCM 567
             E  + P+++T   VL AC + G ++ G +         +FK++ +             +
Sbjct: 692  EETEMRPNEVTLASVLPACANLGALEVGERIEVYARGNGYFKNLYVS----------NAL 741

Query: 566  VDLLGRAGELNEAYALI-----KRMPMTPDSVVWGSLLGACSFYGNV 441
            +++  R G +++A+ +      +R    PD V +  +L AC+  G V
Sbjct: 742  LEMYARCGRIDKAWGVFEEIDGRREGAAPDDVTFVGVLLACTHGGMV 788


>ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Capsella rubella]
            gi|482558368|gb|EOA22560.1| hypothetical protein
            CARUB_v10003220mg [Capsella rubella]
          Length = 511

 Score =  435 bits (1118), Expect = e-119
 Identities = 207/349 (59%), Positives = 265/349 (75%), Gaps = 1/349 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RD P WN+++ G+ R G++  A ELF+ MP +NV+SWT +ISG+SQNG Y +A+ M++ M
Sbjct: 146  RDAPVWNTMITGYQRQGDMKAAMELFDSMPCKNVISWTTVISGFSQNGNYSEALTMFLCM 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            EK+  V PN VT+ SVLPACANLG L++G R+E YAR+ GF  N++V NA LEMY+KCG 
Sbjct: 206  EKDKSVKPNHVTLVSVLPACANLGELEIGRRLESYARENGFFDNIYVCNATLEMYSKCGM 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID+A+++F EIG  RNLCSWNSMI  LA HG+  E LEL+ +ML EG  PD +TFVG+LL
Sbjct: 266  IDLAKQLFHEIGNQRNLCSWNSMIGSLATHGKHHEALELYAQMLREGEKPDAVTFVGLLL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            AC HGG+V +G + FKSME    I+PKLEHYGCM+DLLGR G+L EAY LI+ MPM PD+
Sbjct: 326  ACVHGGMVVKGHELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEAYDLIETMPMKPDA 385

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            VVWG+LLGACSF+G+V             EP NPG YVI+SN+YA++  WDGV ++RKLM
Sbjct: 386  VVWGTLLGACSFHGHVEIAEIASEALFKLEPTNPGNYVIMSNIYAVNEKWDGVLRMRKLM 445

Query: 308  KGNQVTKAAGYS-FIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMK 165
            K   +TKAAGYS F+EV   VH+F VED+SH RS+EIY++LDEIS +MK
Sbjct: 446  KKETMTKAAGYSYFVEVGVEVHRFTVEDKSHPRSYEIYQVLDEISRRMK 494



 Score = 75.5 bits (184), Expect = 4e-11
 Identities = 57/205 (27%), Positives = 94/205 (45%)
 Frame = -3

Query: 1145 ARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACA 966
            AR+LF+L     +  +  +I  YS +    +++ +Y  +  + G+ PN  T   +  A A
Sbjct: 35   ARKLFDLHRNPCIFLYNKLIQAYSVHHHPHESIVLYNLLSFD-GLRPNHHTFNFIFAASA 93

Query: 965  NLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWN 786
            +  + +    +     K GF  + F   AL+  YAK G +  ARRVFDE+   R+   WN
Sbjct: 94   SFSSARPLRLLHSQFFKSGFESDSFCCTALITAYAKLGELCCARRVFDEMS-NRDAPVWN 152

Query: 785  SMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELD 606
            +MI G    G  K  +ELF  M  + +    I++  V+   +  G   +    F  ME D
Sbjct: 153  TMITGYQRQGDMKAAMELFDSMPCKNV----ISWTTVISGFSQNGNYSEALTMFLCMEKD 208

Query: 605  FSITPKLEHYGCMVDLLGRAGELNE 531
             S+ P   ++  +V +L     L E
Sbjct: 209  KSVKP---NHVTLVSVLPACANLGE 230


>ref|NP_196468.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75171895|sp|Q9FNN7.1|PP371_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g08510 gi|9759345|dbj|BAB10000.1| unnamed protein
            product [Arabidopsis thaliana] gi|50897238|gb|AAT85758.1|
            At5g08510 [Arabidopsis thaliana]
            gi|332003930|gb|AED91313.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 511

 Score =  434 bits (1115), Expect = e-119
 Identities = 206/350 (58%), Positives = 264/350 (75%), Gaps = 1/350 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RD+P WN+++ G+ R G++  A ELF+ MP +NV SWT +ISG+SQNG Y +A++M++ M
Sbjct: 146  RDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCM 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            EK+  V PN +T+ SVLPACANLG L++G R+E YAR+ GF  N++V NA +EMY+KCG 
Sbjct: 206  EKDKSVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGM 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID+A+R+F+E+G  RNLCSWNSMI  LA HG+  E L LF +ML EG  PD +TFVG+LL
Sbjct: 266  IDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            AC HGG+V +G + FKSME    I+PKLEHYGCM+DLLGR G+L EAY LIK MPM PD+
Sbjct: 326  ACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDA 385

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            VVWG+LLGACSF+GNV             EP NPG  VI+SN+YA +  WDGV ++RKLM
Sbjct: 386  VVWGTLLGACSFHGNVEIAEIASEALFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLM 445

Query: 308  KGNQVTKAAGYS-FIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKL 162
            K   +TKAAGYS F+EV   VHKF VED+SH RS+EIY++L+EI  +MKL
Sbjct: 446  KKETMTKAAGYSYFVEVGVDVHKFTVEDKSHPRSYEIYQVLEEIFRRMKL 495



 Score = 65.5 bits (158), Expect = 4e-08
 Identities = 48/188 (25%), Positives = 86/188 (45%)
 Frame = -3

Query: 1154 LAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLP 975
            L  AR+LF+         +  +I  Y  + +  +++ +Y  +  + G+ P+  T   +  
Sbjct: 32   LVYARKLFDHHQNSCTFLYNKLIQAYYVHHQPHESIVLYNLLSFD-GLRPSHHTFNFIFA 90

Query: 974  ACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLC 795
            A A+  + +    +     + GF  + F    L+  YAK G +  ARRVFDE+ + R++ 
Sbjct: 91   ASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLGALCCARRVFDEMSK-RDVP 149

Query: 794  SWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSM 615
             WN+MI G    G  K  +ELF  M  + +     ++  V+   +  G   +  + F  M
Sbjct: 150  VWNAMITGYQRRGDMKAAMELFDSMPRKNV----TSWTTVISGFSQNGNYSEALKMFLCM 205

Query: 614  ELDFSITP 591
            E D S+ P
Sbjct: 206  EKDKSVKP 213


>dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana]
          Length = 504

 Score =  434 bits (1115), Expect = e-119
 Identities = 206/350 (58%), Positives = 264/350 (75%), Gaps = 1/350 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RD+P WN+++ G+ R G++  A ELF+ MP +NV SWT +ISG+SQNG Y +A++M++ M
Sbjct: 139  RDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCM 198

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            EK+  V PN +T+ SVLPACANLG L++G R+E YAR+ GF  N++V NA +EMY+KCG 
Sbjct: 199  EKDKSVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGM 258

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID+A+R+F+E+G  RNLCSWNSMI  LA HG+  E L LF +ML EG  PD +TFVG+LL
Sbjct: 259  IDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLL 318

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            AC HGG+V +G + FKSME    I+PKLEHYGCM+DLLGR G+L EAY LIK MPM PD+
Sbjct: 319  ACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDA 378

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            VVWG+LLGACSF+GNV             EP NPG  VI+SN+YA +  WDGV ++RKLM
Sbjct: 379  VVWGTLLGACSFHGNVEIAEIASEALFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLM 438

Query: 308  KGNQVTKAAGYS-FIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKL 162
            K   +TKAAGYS F+EV   VHKF VED+SH RS+EIY++L+EI  +MKL
Sbjct: 439  KKETMTKAAGYSYFVEVGVDVHKFTVEDKSHPRSYEIYQVLEEIFRRMKL 488



 Score = 65.5 bits (158), Expect = 4e-08
 Identities = 48/188 (25%), Positives = 86/188 (45%)
 Frame = -3

Query: 1154 LAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLP 975
            L  AR+LF+         +  +I  Y  + +  +++ +Y  +  + G+ P+  T   +  
Sbjct: 25   LVYARKLFDHHQNSCTFLYNKLIQAYYVHHQPHESIVLYNLLSFD-GLRPSHHTFNFIFA 83

Query: 974  ACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLC 795
            A A+  + +    +     + GF  + F    L+  YAK G +  ARRVFDE+ + R++ 
Sbjct: 84   ASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLGALCCARRVFDEMSK-RDVP 142

Query: 794  SWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSM 615
             WN+MI G    G  K  +ELF  M  + +     ++  V+   +  G   +  + F  M
Sbjct: 143  VWNAMITGYQRRGDMKAAMELFDSMPRKNV----TSWTTVISGFSQNGNYSEALKMFLCM 198

Query: 614  ELDFSITP 591
            E D S+ P
Sbjct: 199  EKDKSVKP 206


>emb|CBI40590.3| unnamed protein product [Vitis vinifera]
          Length = 495

 Score =  429 bits (1104), Expect = e-118
 Identities = 223/360 (61%), Positives = 261/360 (72%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RD+P WNS++AG+AR G+L  A ELF LMP RNV SWTAMISGY+QNG+Y  A+ M++ M
Sbjct: 146  RDVPTWNSMIAGYARCGDLEGALELFRLMPARNVTSWTAMISGYAQNGQYAKALSMFLMM 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E+E+ + PNEVT+ASVLPACANLGAL++GERIE YAR  G+ +NL+VSNALLEMYA+CG 
Sbjct: 206  EEETEMRPNEVTLASVLPACANLGALEVGERIEVYARGNGYFKNLYVSNALLEMYARCGR 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID A  VF+EI   RNLCSWNSMIMGLAVHGR  E +ELF++ML EG  PDD+TFVGVLL
Sbjct: 266  IDKAWGVFEEIDGRRNLCSWNSMIMGLAVHGRCDEAIELFYKMLREGAAPDDVTFVGVLL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGG+V +G  FF+SME DFSI PKLEHYGCMVDLLG  G L E              
Sbjct: 326  ACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMVDLLG-PGALFE-------------- 370

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
                                         EP NPG YVILSN+YA +G WDGVA++RKLM
Sbjct: 371  ----------------------------LEPSNPGNYVILSNIYATAGRWDGVARLRKLM 402

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYEPDLDSDI 129
            KG ++TKAAGYSFIE  G +HKFIVEDRSH+RS EIY LLDE+S KMKL G   D DS+I
Sbjct: 403  KGGKITKAAGYSFIEEGGHIHKFIVEDRSHSRSDEIYALLDEVSMKMKLHGNVNDSDSEI 462



 Score = 95.9 bits (237), Expect = 3e-17
 Identities = 66/273 (24%), Positives = 126/273 (46%), Gaps = 40/273 (14%)
 Frame = -3

Query: 1145 ARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACA 966
            A +LF+ +P   V  +  +I  YS +G +     +Y +M  + G SPNE +   +  ACA
Sbjct: 35   AHKLFDFIPKPTVFLYNKLIQAYSSHGPHHQCFSLYTQMCLQ-GCSPNEHSFTFLFSACA 93

Query: 965  NLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDE----------- 819
            +L + + G  +  +  K GF  ++F   AL++MYAK G + +AR+ FDE           
Sbjct: 94   SLSSHQQGRMLHTHFVKSGFGCDVFALTALVDMYAKLGLLSLARKQFDEMTVRDVPTWNS 153

Query: 818  -------------------IGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSE-GIIP 699
                               +   RN+ SW +MI G A +G++ + L +F  M  E  + P
Sbjct: 154  MIAGYARCGDLEGALELFRLMPARNVTSWTAMISGYAQNGQYAKALSMFLMMEEETEMRP 213

Query: 698  DDITFVGVLLACTHGGLVKQGWQ---------FFKSMELDFSITPKLEHYGCMVDLLGRA 546
            +++T   VL AC + G ++ G +         +FK++ +             ++++  R 
Sbjct: 214  NEVTLASVLPACANLGALEVGERIEVYARGNGYFKNLYVS----------NALLEMYARC 263

Query: 545  GELNEAYALIKRMPMTPDSVVWGSLLGACSFYG 447
            G +++A+ + + +    +   W S++   + +G
Sbjct: 264  GRIDKAWGVFEEIDGRRNLCSWNSMIMGLAVHG 296


>ref|XP_006399350.1| hypothetical protein EUTSA_v10013320mg [Eutrema salsugineum]
            gi|557100440|gb|ESQ40803.1| hypothetical protein
            EUTSA_v10013320mg [Eutrema salsugineum]
          Length = 502

 Score =  428 bits (1100), Expect = e-117
 Identities = 204/349 (58%), Positives = 258/349 (73%), Gaps = 1/349 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RD+  WN+++  + R G++  A ELF+ MP +NV+SWT +ISG+SQNG Y  A+ M++ M
Sbjct: 136  RDLAVWNAMITVYNRQGDMKAAMELFDSMPCKNVISWTTVISGFSQNGNYSKALSMFLCM 195

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            E    V PN +T+ASVLPAC NLGAL +G R+E YAR+ GF  N++VSNA LEMY+KCG 
Sbjct: 196  ESNKTVKPNHITVASVLPACGNLGALDIGRRLEGYARENGFFDNIYVSNATLEMYSKCGM 255

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID+A+R+FDEIG  RNLCSWNSM+ GLA HG+  E LEL+ +ML EG  PD +TFVG+LL
Sbjct: 256  IDVAKRIFDEIGNQRNLCSWNSMVSGLATHGKHDEALELYAQMLREGEKPDAVTFVGLLL 315

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            AC HGG+V +G + FKSME    I+PKLEHYGCM+DLLGR G+L EAY LIK MPM PD+
Sbjct: 316  ACVHGGMVVKGKELFKSMEQVHKISPKLEHYGCMIDLLGRVGKLQEAYNLIKTMPMKPDA 375

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            VVWG+LLGACSF+GNV             EP NPG YVI+SN+YA +  WDGV ++RK+M
Sbjct: 376  VVWGTLLGACSFHGNVEIAEIASEALFKLEPSNPGNYVIMSNIYAANKKWDGVLRMRKMM 435

Query: 308  KGNQVTKAAGYSFIEVDGF-VHKFIVEDRSHTRSFEIYELLDEISNKMK 165
            K   +TKAAGYS++   G  VH F VED+SH RS EIY +LDEI  ++K
Sbjct: 436  KKETMTKAAGYSYLVETGVEVHNFTVEDKSHPRSSEIYHVLDEIFRRIK 484



 Score = 77.4 bits (189), Expect = 1e-11
 Identities = 58/211 (27%), Positives = 97/211 (45%)
 Frame = -3

Query: 1166 RSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIA 987
            R   LA AR LF+L     +  +  +I  YS + +  ++V ++ R+   +G+ PN  T  
Sbjct: 18   RIPNLAYARRLFDLHRNPCIFLYNKLIQAYSVHDQPHESVVLF-RLLSFNGLRPNHHTFN 76

Query: 986  SVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRG 807
             +  A A++ +++          + GF  + F   AL+  YAK G +  ARRVFDEI   
Sbjct: 77   FIFAASASISSVRTLRMFHSQFFRSGFESDSFCCTALITEYAKLGALRCARRVFDEIS-N 135

Query: 806  RNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQF 627
            R+L  WN+MI      G  K  +ELF  M  + +    I++  V+   +  G   +    
Sbjct: 136  RDLAVWNAMITVYNRQGDMKAAMELFDSMPCKNV----ISWTTVISGFSQNGNYSKALSM 191

Query: 626  FKSMELDFSITPKLEHYGCMVDLLGRAGELN 534
            F  ME + ++ P       ++   G  G L+
Sbjct: 192  FLCMESNKTVKPNHITVASVLPACGNLGALD 222


>ref|XP_002871343.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297317180|gb|EFH47602.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 511

 Score =  428 bits (1100), Expect = e-117
 Identities = 206/349 (59%), Positives = 260/349 (74%), Gaps = 1/349 (0%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RD+P WN+++ G+ R G++  A ELF+ MP +NV SWT +ISG+SQNG Y +A+ M++ M
Sbjct: 146  RDVPVWNAMITGYQRRGDMKAAMELFDSMPNKNVTSWTTVISGFSQNGNYSEALTMFLCM 205

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
            EK+  V PN +T+ SVLPACANLG L++G R+E YAR+ GF  N++V NA LEMY+KCG 
Sbjct: 206  EKDKSVKPNHITLVSVLPACANLGELEIGRRLEGYARENGFFDNIYVRNATLEMYSKCGM 265

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            ID+A+R+FDEIG  RNL SWNSMI  LA HG+  E LEL+ +ML EG  PD +TFVG+LL
Sbjct: 266  IDVAKRLFDEIGNQRNLISWNSMIGSLATHGKHDEALELYAQMLQEGERPDAVTFVGLLL 325

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            AC HGG+V +G +  KSME    I+PKLEHYGCM+DLLGR G+L EA  LIK MPM PD+
Sbjct: 326  ACVHGGMVLKGKELLKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEACDLIKTMPMKPDA 385

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            VVWG+LLGACSF+GNV             EP NPG  VI+SN+YA +  WDGV ++RKLM
Sbjct: 386  VVWGTLLGACSFHGNVEIAEIASEALMKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLM 445

Query: 308  KGNQVTKAAGYS-FIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMK 165
            K   +TKAAGYS F+E    VHKF VED+SH RS+EIY++LDEIS +MK
Sbjct: 446  KKETMTKAAGYSYFVEAGVEVHKFTVEDKSHPRSYEIYQVLDEISRRMK 494



 Score = 74.7 bits (182), Expect = 7e-11
 Identities = 56/208 (26%), Positives = 97/208 (46%)
 Frame = -3

Query: 1154 LAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLP 975
            L  AR+LF+L     +  +  +I  YS + +  +++ +Y  +  + G+ PN  T   +  
Sbjct: 32   LVYARKLFDLHRNPCIFLYNKLIQSYSVHHQPHESIVLYNLLSFD-GIRPNHHTFNFIFA 90

Query: 974  ACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLC 795
            A A+  + +    +     + GF  + F   AL+  YAK G +  ARRVFDE+   R++ 
Sbjct: 91   ASASFSSARPLRLLHSQFFRSGFESDSFCCTALITAYAKLGALCCARRVFDEMS-NRDVP 149

Query: 794  SWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSM 615
             WN+MI G    G  K  +ELF  M ++ +     ++  V+   +  G   +    F  M
Sbjct: 150  VWNAMITGYQRRGDMKAAMELFDSMPNKNV----TSWTTVISGFSQNGNYSEALTMFLCM 205

Query: 614  ELDFSITPKLEHYGCMVDLLGRAGELNE 531
            E D S+ P   ++  +V +L     L E
Sbjct: 206  EKDKSVKP---NHITLVSVLPACANLGE 230


>dbj|BAA90805.1| pentatricopeptide (PPR) repeat-containing protein-like [Oryza sativa
            Japonica Group] gi|125553873|gb|EAY99478.1| hypothetical
            protein OsI_21446 [Oryza sativa Indica Group]
          Length = 510

 Score =  427 bits (1099), Expect = e-117
 Identities = 204/353 (57%), Positives = 270/353 (76%)
 Frame = -3

Query: 1208 RDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRM 1029
            RD   +N++++ +A+ G +  A +LFE MP RNVVSWTAM+SGY+QNGR+E+AVE ++ M
Sbjct: 151  RDTAVYNALLSAYAKGGLVDSAEKLFEEMPDRNVVSWTAMVSGYAQNGRHEEAVETFLEM 210

Query: 1028 EKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCGN 849
             + +GV PNE+T++SVLPACA +GA+++G ++E YAR KG LRN++V+NALLEMY+KCG+
Sbjct: 211  WERAGVQPNELTVSSVLPACAAVGAMELGRKVEEYARGKGLLRNVYVANALLEMYSKCGS 270

Query: 848  IDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVLL 669
            I  A +VF  IGR ++LCSWNSMIM  AVHG W+E L LF+++   G+ PD ITFVGV+L
Sbjct: 271  IRQAWQVFQGIGRQQDLCSWNSMIMAFAVHGLWREALALFYKLRMAGVKPDGITFVGVIL 330

Query: 668  ACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPDS 489
            ACTHGGLV +G  FF SME +FS+ P++EHYGCMVDLLGRAG L E+Y+LI  MP+ PD+
Sbjct: 331  ACTHGGLVNEGKLFFDSMEAEFSLKPRIEHYGCMVDLLGRAGLLIESYSLIASMPVEPDA 390

Query: 488  VVWGSLLGACSFYGNVXXXXXXXXXXXXXEPRNPGIYVILSNVYALSGSWDGVAKVRKLM 309
            V+WG+LLGACSF+GNV             EP+N    VILSN+YA SG WDGVA+V KL+
Sbjct: 391  VIWGALLGACSFHGNVELAELAMDKLIHLEPQNTANLVILSNIYASSGKWDGVAQVWKLL 450

Query: 308  KGNQVTKAAGYSFIEVDGFVHKFIVEDRSHTRSFEIYELLDEISNKMKLLGYE 150
            K     K+AGYSFIE+DG +HKF+VED+SH R  E+Y  L+ ++  MKL+G E
Sbjct: 451  KEKDHKKSAGYSFIELDGTMHKFLVEDKSHPRFEEVYNTLNSVTMTMKLVGLE 503


Top