BLASTX nr result

ID: Akebia25_contig00020084 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00020084
         (1308 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003633947.1| PREDICTED: pentatricopeptide repeat-containi...   558   e-156
gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis]     546   e-153
ref|XP_004230005.1| PREDICTED: pentatricopeptide repeat-containi...   543   e-152
ref|XP_002301427.2| hypothetical protein POPTR_0002s17640g [Popu...   541   e-151
ref|XP_006339746.1| PREDICTED: pentatricopeptide repeat-containi...   538   e-150
ref|XP_004155716.1| PREDICTED: pentatricopeptide repeat-containi...   527   e-147
ref|XP_004142577.1| PREDICTED: pentatricopeptide repeat-containi...   527   e-147
gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus...   525   e-146
ref|XP_004510637.1| PREDICTED: pentatricopeptide repeat-containi...   511   e-142
ref|XP_007135284.1| hypothetical protein PHAVU_010G116300g [Phas...   508   e-141
ref|XP_003548250.1| PREDICTED: pentatricopeptide repeat-containi...   505   e-140
emb|CBI40590.3| unnamed protein product [Vitis vinifera]              499   e-139
ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily p...   495   e-137
ref|XP_003627527.1| Pentatricopeptide repeat-containing protein ...   481   e-133
ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Caps...   462   e-127
ref|XP_002871343.1| pentatricopeptide repeat-containing protein ...   461   e-127
ref|NP_196468.1| pentatricopeptide repeat-containing protein [Ar...   457   e-126
dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana]           441   e-121
ref|XP_006399350.1| hypothetical protein EUTSA_v10013320mg [Eutr...   438   e-120
ref|XP_002523296.1| pentatricopeptide repeat-containing protein,...   432   e-118

>ref|XP_003633947.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Vitis vinifera]
          Length = 512

 Score =  558 bits (1438), Expect = e-156
 Identities = 273/401 (68%), Positives = 318/401 (79%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MN+LKQI A+ LRNGI+HTK LI+ LL+IP IPYAH LFD IP+PT FLYNKLIQAYSSH
Sbjct: 1    MNRLKQIQAYTLRNGIEHTKQLIVSLLQIPSIPYAHKLFDFIPKPTVFLYNKLIQAYSSH 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
            GP+HQ  SLY+ M   GC PN H               QQG+ +HTHF+K GF  D FAL
Sbjct: 61   GPHHQCFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGCDVFAL 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMYAK GLL  AR+ F EM  RD+P WNS++AG+AR G+L  A ELF LMP RNV 
Sbjct: 121  TALVDMYAKLGLLSLARKQFDEMTVRDVPTWNSMIAGYARCGDLEGALELFRLMPARNVT 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWTAMISGY+QNG+Y  A+ M++ ME+E+ + PNEVT+ASVLPACANLGAL++GERIE Y
Sbjct: 181  SWTAMISGYAQNGQYAKALSMFLMMEEETEMRPNEVTLASVLPACANLGALEVGERIEVY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR  G+ +NL+VSNALLEMYA+CG ID A  VF+EI   RNLCSWNSMIMGLAVHGR  E
Sbjct: 241  ARGNGYFKNLYVSNALLEMYARCGRIDKAWGVFEEIDGRRNLCSWNSMIMGLAVHGRCDE 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             +ELF++ML EG  PDD+TFVGVLLACTHGG+V +G  FF+SME DFSI PKLEHYGCMV
Sbjct: 301  AIELFYKMLREGAAPDDVTFVGVLLACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMV 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGRAGEL EA+ LI RMPM PDSVVWG+LLGACSF+G+V
Sbjct: 361  DLLGRAGELREAHDLILRMPMEPDSVVWGTLLGACSFHGHV 401


>gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis]
          Length = 513

 Score =  546 bits (1408), Expect = e-153
 Identities = 267/401 (66%), Positives = 314/401 (78%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MNQLKQIHAH LRNG+DHT  LI+KLLEIP+I YA  LFDLIPEPT FLYN+LI+AYS H
Sbjct: 4    MNQLKQIHAHTLRNGVDHTSILILKLLEIPNILYARNLFDLIPEPTVFLYNRLIKAYSFH 63

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
            G +HQ L LY  M   GC PN H               Q GQ +H+HF+KLG   D FAL
Sbjct: 64   GQHHQCLFLYRRMCLQGCTPNEHSFTLLFSVCSSLSSRQLGQMMHSHFVKLGHVRDIFAL 123

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMYAK G+L  AR+ F E   R  P WNS+++G+ARSG++  A ELF LMP RNVV
Sbjct: 124  TALVDMYAKLGMLDCARKQFDEKRVRGTPTWNSMLSGYARSGDMEGASELFRLMPQRNVV 183

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWTAMISGYS+NG+Y  A+ M+++MEKE  V PN +TIASVLPACANLGAL++GER+E Y
Sbjct: 184  SWTAMISGYSKNGQYAKALAMFLQMEKERDVRPNAITIASVLPACANLGALEVGERVEEY 243

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            ARK GFL++L+VSNA+LEMYAKCG ID ARRVFDEIGR RNLCSWNSMIMGLAVHGR  E
Sbjct: 244  ARKVGFLKDLYVSNAVLEMYAKCGRIDTARRVFDEIGRRRNLCSWNSMIMGLAVHGRCNE 303

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             L+L+ +M +  I PDD+TFVG++LACTHGG+  +G Q FKSME  F ITPKLEHYGCMV
Sbjct: 304  ALDLYEQMTTVRIAPDDVTFVGLILACTHGGMAMKGQQLFKSMEPKFGITPKLEHYGCMV 363

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGRAG+L EAY LI+ M M PD+V+WG+LLGACSF+GNV
Sbjct: 364  DLLGRAGKLQEAYDLIQGMSMKPDNVIWGALLGACSFHGNV 404


>ref|XP_004230005.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Solanum lycopersicum]
          Length = 508

 Score =  543 bits (1400), Expect = e-152
 Identities = 260/401 (64%), Positives = 313/401 (78%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MNQLKQIHA+ LRNGID T+FLI K++EIP+IPYAH +FD I +PT FLYNKLIQAYSSH
Sbjct: 1    MNQLKQIHANTLRNGIDFTQFLISKIIEIPNIPYAHKVFDNITKPTVFLYNKLIQAYSSH 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
            G   Q  SLY  MR  GC PNPH                QGQ  H HF+K GFEFD + L
Sbjct: 61   GFPSQCFSLYIKMRRQGCSPNPHSFTFLFAACSNRSTPIQGQMFHVHFIKWGFEFDIYTL 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMYAK  LL SAR++F EM  +D+P WNS++AG+A++G + +A +LF +MP RNV+
Sbjct: 121  TALVDMYAKMSLLPSARKLFDEMEMKDVPIWNSLIAGYAKNGNVVEAFKLFSVMPSRNVI 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWTAMISGYSQNG+Y +A+ +Y +MEK+  V PNEVTIASVLPACANLGAL++GE IE Y
Sbjct: 181  SWTAMISGYSQNGKYANALAVYKQMEKDRKVKPNEVTIASVLPACANLGALEVGENIEAY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR  G+ +N+FV NA+LEMY KCG ID A ++F EIGR RNLCSWN+MIMGLAVHG+  E
Sbjct: 241  ARANGYFKNMFVCNAVLEMYTKCGRIDRAMQLFHEIGRRRNLCSWNTMIMGLAVHGKGDE 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             L+LF++ML EG  PDD+TFVG +LACTHGG+V +GW+  K ME  FSI PKLEHYGCMV
Sbjct: 301  ALKLFNQMLGEGNTPDDVTFVGAILACTHGGMVAKGWELLKLMEQRFSIAPKLEHYGCMV 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGRAG+L EAY LI+ MPM PD V+WG++LGACSFYGNV
Sbjct: 361  DLLGRAGKLQEAYDLIQSMPMRPDCVIWGTILGACSFYGNV 401


>ref|XP_002301427.2| hypothetical protein POPTR_0002s17640g [Populus trichocarpa]
            gi|550345235|gb|EEE80700.2| hypothetical protein
            POPTR_0002s17640g [Populus trichocarpa]
          Length = 514

 Score =  541 bits (1394), Expect = e-151
 Identities = 261/401 (65%), Positives = 314/401 (78%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            M+QL +IHAH L+ GI+++K LI++LL IPDIPYAH +F+  P PT FLYNKLI+AYSS 
Sbjct: 1    MSQLNRIHAHTLKKGIEYSKTLIVELLRIPDIPYAHKVFNQSPYPTVFLYNKLIKAYSSQ 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
                Q LSLYS M   GCPPN                   G+ IHTHF+K GF+FD +AL
Sbjct: 61   NQPRQCLSLYSQMLLKGCPPNELTFTFLFPACASFYSLLHGKVIHTHFIKSGFDFDVYAL 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL++MYAK G+L  ARQVF EM  RDIP WNS++AG++RSG++  A ELF+LMP R+VV
Sbjct: 121  TALVNMYAKLGVLMLARQVFDEMTVRDIPTWNSLIAGYSRSGDMEGALELFKLMPSRSVV 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWT MISGYSQNG Y  A+EM+++MEK+  V PNEVTIASV  ACA LGAL++GERIE Y
Sbjct: 181  SWTTMISGYSQNGMYTKALEMFLKMEKDKEVRPNEVTIASVFSACAKLGALEVGERIESY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR  G ++NL+VSN LLEMYA+CG ID AR VF+EIG+ RNLCSWNSM+MGLAVHGR  E
Sbjct: 241  ARDNGLMKNLYVSNTLLEMYARCGKIDAARHVFNEIGKRRNLCSWNSMMMGLAVHGRSNE 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             L+L+ +ML EGI PDD+TFVG++LACTHGGLV +GWQ F+SME +FSI PKLEHYGCMV
Sbjct: 301  ALQLYDQMLGEGIEPDDVTFVGLILACTHGGLVAKGWQLFQSMETNFSIVPKLEHYGCMV 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGRAGEL EAY L+K MPM PDSV+WG+LLGACSF+ NV
Sbjct: 361  DLLGRAGELQEAYDLVKSMPMKPDSVIWGTLLGACSFHSNV 401


>ref|XP_006339746.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Solanum tuberosum]
          Length = 508

 Score =  538 bits (1385), Expect = e-150
 Identities = 257/401 (64%), Positives = 311/401 (77%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MNQLKQIH + LRNGID T+FLI KL+EIP+IPYAH +FD I +PT FLYNKLIQAYSSH
Sbjct: 1    MNQLKQIHGNTLRNGIDFTQFLITKLIEIPNIPYAHKVFDSITKPTVFLYNKLIQAYSSH 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
            G   +  SLY  MR  GC PNPH                QGQ  H HF+K GFEFD + L
Sbjct: 61   GLPSRCFSLYIQMRRQGCSPNPHSFTFLFAACTNSSSPIQGQMFHVHFIKWGFEFDIYTL 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMYAK  LL SAR++F EM  +D+P WNS++AG+A++G + +A +LF +MP RNV+
Sbjct: 121  TALVDMYAKMSLLPSARKLFDEMEMKDVPTWNSLIAGYAKNGNVEEAFKLFSVMPSRNVI 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWTAMISGYSQNG+Y +A+ +Y  MEK+  V PNEVTIASVLPACANLGAL++GE IE Y
Sbjct: 181  SWTAMISGYSQNGKYANALAVYKEMEKDRRVKPNEVTIASVLPACANLGALEVGENIEAY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR  G+ +N+FV NA+LEMY KCG ID + ++F EIGR RNLCSWN+MIMGLAVHG+  E
Sbjct: 241  ARANGYFKNMFVCNAILEMYTKCGRIDRSMQLFHEIGRRRNLCSWNTMIMGLAVHGKGDE 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             L+LF++ML EG  PDD+TFVG +LACTHGG+V +GW+  K ME  FSI PKLEHYGCMV
Sbjct: 301  VLKLFNQMLGEGNAPDDVTFVGAILACTHGGMVAKGWELLKLMEQRFSIAPKLEHYGCMV 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGRAG+L EAY LI+ +PM PD V+WG+LLGACSF+GNV
Sbjct: 361  DLLGRAGKLQEAYDLIQSIPMRPDCVIWGTLLGACSFHGNV 401


>ref|XP_004155716.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Cucumis sativus]
          Length = 512

 Score =  527 bits (1358), Expect = e-147
 Identities = 251/401 (62%), Positives = 310/401 (77%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MNQLKQIHA+ LRNG+DHTKFLI KLL++PD+PYA  LFD IP+P+ +LYNK IQ +SS 
Sbjct: 1    MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
            G  H+   LY  M   GC PN +                 GQ +H+HF K GF  D FA+
Sbjct: 61   GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMYAK G+LRSARQ+F EMP RDIP WNS++AG+ARSG +  A ELF  MP RNV+
Sbjct: 121  TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWTA+ISGY+QNG+Y  A+EM++ +E E G  PNEV+IASVLPAC+ LGAL +G+RIE Y
Sbjct: 181  SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR  GF +N +VSNA+LE++A+CGNI+ A++VFDEIG  RNLCSWN+MIMGLAVHGR  +
Sbjct: 241  ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             L+L+ +ML   + PDD+TFVG+LLACTHGG+V +G Q F+SME  F + PKLEHYGC+V
Sbjct: 301  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGRAGEL EAY LI+ MPM PDSV+WG+LLGACSF+GNV
Sbjct: 361  DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNV 401


>ref|XP_004142577.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Cucumis sativus]
          Length = 589

 Score =  527 bits (1358), Expect = e-147
 Identities = 251/401 (62%), Positives = 310/401 (77%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MNQLKQIHA+ LRNG+DHTKFLI KLL++PD+PYA  LFD IP+P+ +LYNK IQ +SS 
Sbjct: 1    MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
            G  H+   LY  M   GC PN +                 GQ +H+HF K GF  D FA+
Sbjct: 61   GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMYAK G+LRSARQ+F EMP RDIP WNS++AG+ARSG +  A ELF  MP RNV+
Sbjct: 121  TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWTA+ISGY+QNG+Y  A+EM++ +E E G  PNEV+IASVLPAC+ LGAL +G+RIE Y
Sbjct: 181  SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR  GF +N +VSNA+LE++A+CGNI+ A++VFDEIG  RNLCSWN+MIMGLAVHGR  +
Sbjct: 241  ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             L+L+ +ML   + PDD+TFVG+LLACTHGG+V +G Q F+SME  F + PKLEHYGC+V
Sbjct: 301  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGRAGEL EAY LI+ MPM PDSV+WG+LLGACSF+GNV
Sbjct: 361  DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNV 401


>gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus guttatus]
          Length = 516

 Score =  525 bits (1352), Expect = e-146
 Identities = 250/402 (62%), Positives = 314/402 (78%), Gaps = 1/402 (0%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MN LKQIHAH LRNG + T  LI KLLEIP+I YAH L D  P+PT FLY+KLI+AYSSH
Sbjct: 1    MNHLKQIHAHALRNGTNFTNHLITKLLEIPNINYAHKLLDKTPDPTLFLYSKLIKAYSSH 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
            GP+ Q  SLYS +      PNP+                QGQ +H HF+K G ++D +AL
Sbjct: 61   GPHFQCFSLYSQILHLSFSPNPNCFTFLFSACAKLSNPSQGQMLHAHFIKFGLDYDVYAL 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMYAK GLLR +R++F EM ++D P WNS++AG+AR+G++++A  LF  MP RNV+
Sbjct: 121  TALVDMYAKMGLLRFSRKIFDEMNDKDAPTWNSLIAGYARNGDMSEALRLFSNMPSRNVI 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWTA+ISG+SQNG+Y++A+EMY+ ME++  V PN VT+ASVLPACANLGAL++G+RIE Y
Sbjct: 181  SWTAIISGFSQNGKYKEALEMYLAMERDGKVKPNHVTLASVLPACANLGALEVGQRIEAY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRG-RNLCSWNSMIMGLAVHGRWK 953
            AR  G+ +N FV NA+LE+YA+CG I+ A +VFDEIG G RNLCSWN++IMGLAVHGR  
Sbjct: 241  ARANGYFKNAFVCNAVLELYARCGVIEKAMQVFDEIGSGNRNLCSWNTLIMGLAVHGRCD 300

Query: 954  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCM 1133
              LE+F++ML++G+ PDD+TFVG +LACTHGG+V +G + F SME  FSITPK+EHYGCM
Sbjct: 301  GALEIFNQMLTKGVTPDDVTFVGAILACTHGGMVNKGREIFDSMEKRFSITPKIEHYGCM 360

Query: 1134 VDLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            VDLLGRAG L EAY LIK MPM PDSVVWG+LLGACSF+GNV
Sbjct: 361  VDLLGRAGLLQEAYKLIKAMPMKPDSVVWGTLLGACSFHGNV 402


>ref|XP_004510637.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Cicer arietinum]
          Length = 512

 Score =  511 bits (1316), Expect = e-142
 Identities = 251/402 (62%), Positives = 309/402 (76%), Gaps = 1/402 (0%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSS- 233
            MNQ+KQI  + LRNGID+TK LI KLL+IP++ YA  L      PT FLYNKLIQAYSS 
Sbjct: 1    MNQVKQIQCYTLRNGIDNTKILIEKLLQIPNLHYAQLLLHHSHNPTLFLYNKLIQAYSSK 60

Query: 234  HGPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFA 413
            H  +HQ   LYS M  +G  PN H                 GQ +HTHF+K GF+ D FA
Sbjct: 61   HQNHHQCFFLYSQMLLHGHSPNQHTFNFLFKAGTSVSSISLGQMLHTHFIKSGFKHDVFA 120

Query: 414  LTALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNV 593
             TAL+DMYAK G L+ AR VF EM  R++P WN+++AG+ R G++ +A ELF LMP RNV
Sbjct: 121  STALLDMYAKLGSLKLARHVFDEMSVREVPTWNAMMAGYTRFGDMERALELFGLMPARNV 180

Query: 594  VSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIER 773
            VSWT ++SGYSQN +YE A+E+++RME E  V PNEVT+ASVLPACANLGAL++G+R+E 
Sbjct: 181  VSWTTVVSGYSQNKQYEKALELFLRMEWEKDVIPNEVTLASVLPACANLGALEIGQRVEA 240

Query: 774  YARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWK 953
            YAR+ G  +NLFVSNA+LEMYAKCG ID+A +VFDE+GR RNLCS+NSMIMGLAVHG+  
Sbjct: 241  YARENGLFKNLFVSNAVLEMYAKCGKIDVAWKVFDEMGRFRNLCSFNSMIMGLAVHGQCD 300

Query: 954  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCM 1133
            + +EL+ +ML EG +PDD+TFVG+LLACTHGG+V+ G   FKSM  DF+I PKLEHYGCM
Sbjct: 301  KAIELYDQMLREGTLPDDVTFVGLLLACTHGGMVETGKHIFKSMTRDFNIIPKLEHYGCM 360

Query: 1134 VDLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            VDLLGRAG+L+EAY +IK MPM PDSV+WG+LLGACSF+GNV
Sbjct: 361  VDLLGRAGKLSEAYEVIKSMPMVPDSVIWGALLGACSFHGNV 402


>ref|XP_007135284.1| hypothetical protein PHAVU_010G116300g [Phaseolus vulgaris]
            gi|561008329|gb|ESW07278.1| hypothetical protein
            PHAVU_010G116300g [Phaseolus vulgaris]
          Length = 510

 Score =  508 bits (1308), Expect = e-141
 Identities = 247/402 (61%), Positives = 307/402 (76%), Gaps = 1/402 (0%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            M Q+KQIH + LRNGID+TK LI KLLEIP++ YAH +    P+   FLYNKLIQAYSSH
Sbjct: 1    MRQVKQIHGYTLRNGIDNTKILIEKLLEIPNLHYAHMVLHHSPKQNLFLYNKLIQAYSSH 60

Query: 237  GPY-HQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFA 413
              + H+  SLY  MR +G  PN H                 GQ +HTHF+K GFE D FA
Sbjct: 61   PQHQHRCFSLYYQMRLHGFLPNQHTFNFLFSACTSLFSHSLGQMLHTHFIKSGFEPDLFA 120

Query: 414  LTALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNV 593
             TAL+DMY K G L  ARQ+F EMP R +P WN++++G+A+ G++  A ELF LMP RN+
Sbjct: 121  ATALLDMYCKVGTLGLARQLFDEMPVRGVPTWNAMMSGYAKFGDMEGALELFGLMPTRNL 180

Query: 594  VSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIER 773
            VSWT MISGYS+N ++ +A+ ++++ME+E G+ PNEVT+AS+LPAC+NLGAL++G+R+E 
Sbjct: 181  VSWTTMISGYSRNKQFGEALGLFLKMEQEKGIVPNEVTLASILPACSNLGALEIGQRVEA 240

Query: 774  YARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWK 953
            YARK GF +NL+VSNALLEMYAKCG ID+A RVF+EIGR RNLCSWNSMIMGLAVHG+  
Sbjct: 241  YARKNGFFKNLYVSNALLEMYAKCGKIDVAWRVFNEIGRFRNLCSWNSMIMGLAVHGQCC 300

Query: 954  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCM 1133
            +  EL+ +ML EG  PDD+TFVG+LLACTHGG+V++G   FKSM   F I PKLEHYGCM
Sbjct: 301  KAFELYDQMLGEGTSPDDVTFVGLLLACTHGGMVEKGRHIFKSMTTAFHIIPKLEHYGCM 360

Query: 1134 VDLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            VDLLGRAG L EAY +I+ MPM PDSV+WG+LLGACSF+GNV
Sbjct: 361  VDLLGRAGHLREAYEVIQSMPMKPDSVIWGALLGACSFHGNV 402


>ref|XP_003548250.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Glycine max]
          Length = 512

 Score =  505 bits (1301), Expect = e-140
 Identities = 247/402 (61%), Positives = 305/402 (75%), Gaps = 1/402 (0%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            M Q+KQIH + LRNGID TK LI KLLEIP++ YAH +    P+PT FLYNKLIQAYSSH
Sbjct: 1    MRQVKQIHGYTLRNGIDQTKILIEKLLEIPNLHYAHKVLHHSPKPTLFLYNKLIQAYSSH 60

Query: 237  GPY-HQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFA 413
              + HQ  SLYS M  +   PN H                 GQ +HTHF+K GFE D FA
Sbjct: 61   PQHQHQCFSLYSQMLLHSFLPNQHTFNFLFSACTSLSSPSLGQMLHTHFIKSGFEPDLFA 120

Query: 414  LTALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNV 593
             TAL+DMY K G L  AR++F +MP R +P WN+++AGHAR G++  A ELF LMP RNV
Sbjct: 121  ATALLDMYTKVGTLELARKLFDQMPVRGVPTWNAMMAGHARFGDMDVALELFRLMPSRNV 180

Query: 594  VSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIER 773
            VSWT MISGYS++ +Y +A+ +++RME+E G+ PN VT+AS+ PA ANLGAL++G+R+E 
Sbjct: 181  VSWTTMISGYSRSKKYGEALGLFLRMEQEKGMMPNAVTLASIFPAFANLGALEIGQRVEA 240

Query: 774  YARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWK 953
            YARK GF +NL+VSNA+LEMYAKCG ID+A +VF+EIG  RNLCSWNSMIMGLAVHG   
Sbjct: 241  YARKNGFFKNLYVSNAVLEMYAKCGKIDVAWKVFNEIGSLRNLCSWNSMIMGLAVHGECC 300

Query: 954  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCM 1133
            + L+L+ +ML EG  PDD+TFVG+LLACTHGG+V++G   FKSM   F+I PKLEHYGCM
Sbjct: 301  KTLKLYDQMLGEGTSPDDVTFVGLLLACTHGGMVEKGRHIFKSMTTSFNIIPKLEHYGCM 360

Query: 1134 VDLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            VDLLGRAG+L EAY +I+RMPM PDSV+WG+LLGACSF+ NV
Sbjct: 361  VDLLGRAGQLREAYEVIQRMPMKPDSVIWGALLGACSFHDNV 402


>emb|CBI40590.3| unnamed protein product [Vitis vinifera]
          Length = 495

 Score =  499 bits (1286), Expect = e-139
 Identities = 249/387 (64%), Positives = 292/387 (75%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MN+LKQI A+ LRNGI+HTK LI+ LL+IP IPYAH LFD IP+PT FLYNKLIQAYSSH
Sbjct: 1    MNRLKQIQAYTLRNGIEHTKQLIVSLLQIPSIPYAHKLFDFIPKPTVFLYNKLIQAYSSH 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
            GP+HQ  SLY+ M   GC PN H               QQG+ +HTHF+K GF  D FAL
Sbjct: 61   GPHHQCFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGCDVFAL 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMYAK GLL  AR+ F EM  RD+P WNS++AG+AR G+L  A ELF LMP RNV 
Sbjct: 121  TALVDMYAKLGLLSLARKQFDEMTVRDVPTWNSMIAGYARCGDLEGALELFRLMPARNVT 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWTAMISGY+QNG+Y  A+ M++ ME+E+ + PNEVT+ASVLPACANLGAL++GERIE Y
Sbjct: 181  SWTAMISGYAQNGQYAKALSMFLMMEEETEMRPNEVTLASVLPACANLGALEVGERIEVY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR  G+ +NL+VSNALLEMYA+CG ID A  VF+EI   RNLCSWNSMIMGLAVHGR  E
Sbjct: 241  ARGNGYFKNLYVSNALLEMYARCGRIDKAWGVFEEIDGRRNLCSWNSMIMGLAVHGRCDE 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             +ELF++ML EG  PDD+TFVGVLLACTHGG+V +G  FF+SME DFSI PKLEHYGCMV
Sbjct: 301  AIELFYKMLREGAAPDDVTFVGVLLACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMV 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVV 1217
            DLLG         AL +  P  P + V
Sbjct: 361  DLLGPG-------ALFELEPSNPGNYV 380


>ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily protein, putative
            [Theobroma cacao] gi|508703740|gb|EOX95636.1|
            Pentatricopeptide repeat (PPR) superfamily protein,
            putative [Theobroma cacao]
          Length = 515

 Score =  495 bits (1275), Expect = e-137
 Identities = 243/403 (60%), Positives = 305/403 (75%), Gaps = 2/403 (0%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDH--TKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYS 230
            MNQLKQ  A+ L+NG++   T+ LII++L+ P+IPYAH LF+LIP+ T FLYNKLIQAYS
Sbjct: 1    MNQLKQSLAYTLKNGMEQNQTQLLIIQILQTPNIPYAHKLFNLIPQKTVFLYNKLIQAYS 60

Query: 231  SHGPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHF 410
            S    H+ L+LYS M  N C PN H                 GQ +HT FLK GF  D +
Sbjct: 61   SINQSHRCLTLYSQMCLNNCSPNEHSFIFLFPACASLPSLLHGQILHTQFLKSGFGLDCY 120

Query: 411  ALTALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRN 590
            ALTAL+ MYAK  +L  AR+VF EM  R++P WN++++G++  G++ +A ELF+ MP +N
Sbjct: 121  ALTALLVMYAKLRMLPLARKVFDEMRVRNLPTWNALISGYSMCGDMKEALELFKSMPEKN 180

Query: 591  VVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIE 770
            VVSWT MISGYSQNG+Y  A++M++RMEKE+GV PN VTIASVLPACANLGAL++GERIE
Sbjct: 181  VVSWTTMISGYSQNGQYSKALDMFLRMEKETGVKPNRVTIASVLPACANLGALEVGERIE 240

Query: 771  RYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRW 950
             YAR+ G   +L+VSN +LEMYA+CG I++A+ VFDEIG+ RNLC WNSMIMGLA+HG+ 
Sbjct: 241  TYARENGLFEDLYVSNTVLEMYARCGKIEVAKLVFDEIGKRRNLCVWNSMIMGLALHGKC 300

Query: 951  KEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGC 1130
             E  E + +ML EG  PDD+TFVGVLLACTHG LV +G + F+SM   + I+PKLEHYGC
Sbjct: 301  IEAFEYYDQMLQEGTAPDDVTFVGVLLACTHGRLVVKGRELFESMGKKYHISPKLEHYGC 360

Query: 1131 MVDLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            MVDLLGR+G L EAY LIK MPM PD+VVWG+LLGACSF+ NV
Sbjct: 361  MVDLLGRSGALQEAYDLIKSMPMKPDAVVWGALLGACSFHNNV 403


>ref|XP_003627527.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355521549|gb|AET02003.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 550

 Score =  481 bits (1239), Expect = e-133
 Identities = 248/445 (55%), Positives = 305/445 (68%), Gaps = 44/445 (9%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MNQ+KQ H + LRN ID+TK LI KLL+IP++ YA  L     +PTTFLYNKLIQA SS 
Sbjct: 1    MNQVKQFHGYTLRNNIDNTKILIEKLLQIPNLNYAQVLLHHSQKPTTFLYNKLIQACSSK 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
               HQ  +LYS M  +G  PN +                 GQ IHT F+K GF+ D FA 
Sbjct: 61   ---HQCFTLYSQMYLHGHSPNQYTFNFLFTTCTSLSSLSLGQMIHTQFMKSGFKHDVFAS 117

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMYAK G L+ AR VF EM  +++  WN+++AG  R G++ +A ELF LMP RNVV
Sbjct: 118  TALLDMYAKLGCLKFARNVFDEMSVKELATWNAMMAGCTRFGDMERALELFWLMPSRNVV 177

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWT M+SGY QN +YE A+ +++RME+E  VSPNEVT+ASVLPACANLGAL++G+R+E Y
Sbjct: 178  SWTTMVSGYLQNKQYEKALGLFMRMEREKDVSPNEVTLASVLPACANLGALEIGQRVEVY 237

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            ARK GF +NLFV NA+LEMYAKCG ID+A +VFDEIGR RNLCSWNSMIMGLAVHG+  +
Sbjct: 238  ARKNGFFKNLFVCNAVLEMYAKCGKIDVAWKVFDEIGRFRNLCSWNSMIMGLAVHGQCHK 297

Query: 957  GLELFHEML--------------------------------------------SEGIIPD 1004
             ++L+ +ML                                             EG +PD
Sbjct: 298  AIQLYDQMLVSYSLYLLFISFAFIMIRGGHGLVNHINRTEPNLSVEMVRNNRTREGTLPD 357

Query: 1005 DITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALI 1184
            D+TFVG+LLACTHGG+V++G   F+SM  DF+I PKLEHYGCMVDLLGRAG L EAY +I
Sbjct: 358  DVTFVGLLLACTHGGMVEKGKHVFQSMTRDFNIIPKLEHYGCMVDLLGRAGRLTEAYEVI 417

Query: 1185 KRMPMTPDSVVWGSLLGACSFYGNV 1259
            KRMPM PDSV+WG+LLGACSF+GNV
Sbjct: 418  KRMPMKPDSVIWGTLLGACSFHGNV 442


>ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Capsella rubella]
            gi|482558368|gb|EOA22560.1| hypothetical protein
            CARUB_v10003220mg [Capsella rubella]
          Length = 511

 Score =  462 bits (1188), Expect = e-127
 Identities = 226/401 (56%), Positives = 286/401 (71%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MNQ+KQ+HAH LR G+D TK L+ +LL I +I YA  LFDL   P  FLYNKLIQAYS H
Sbjct: 1    MNQIKQLHAHCLRRGVDETKDLLQRLLLIQNIVYARKLFDLHRNPCIFLYNKLIQAYSVH 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
               H+S+ LY+ + F+G  PN H               +  + +H+ F K GFE D F  
Sbjct: 61   HHPHESIVLYNLLSFDGLRPNHHTFNFIFAASASFSSARPLRLLHSQFFKSGFESDSFCC 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TALI  YAK G L  AR+VF EM  RD P WN+++ G+ R G++  A ELF+ MP +NV+
Sbjct: 121  TALITAYAKLGELCCARRVFDEMSNRDAPVWNTMITGYQRQGDMKAAMELFDSMPCKNVI 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWT +ISG+SQNG Y +A+ M++ MEK+  V PN VT+ SVLPACANLG L++G R+E Y
Sbjct: 181  SWTTVISGFSQNGNYSEALTMFLCMEKDKSVKPNHVTLVSVLPACANLGELEIGRRLESY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR+ GF  N++V NA LEMY+KCG ID+A+++F EIG  RNLCSWNSMI  LA HG+  E
Sbjct: 241  ARENGFFDNIYVCNATLEMYSKCGMIDLAKQLFHEIGNQRNLCSWNSMIGSLATHGKHHE 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             LEL+ +ML EG  PD +TFVG+LLAC HGG+V +G + FKSME    I+PKLEHYGCM+
Sbjct: 301  ALELYAQMLREGEKPDAVTFVGLLLACVHGGMVVKGHELFKSMEEVHKISPKLEHYGCMI 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGR G+L EAY LI+ MPM PD+VVWG+LLGACSF+G+V
Sbjct: 361  DLLGRVGKLQEAYDLIETMPMKPDAVVWGTLLGACSFHGHV 401


>ref|XP_002871343.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297317180|gb|EFH47602.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 511

 Score =  461 bits (1185), Expect = e-127
 Identities = 224/401 (55%), Positives = 285/401 (71%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MNQ+KQ+HAH LR G+D TK L+ +LL IP++ YA  LFDL   P  FLYNKLIQ+YS H
Sbjct: 1    MNQIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDLHRNPCIFLYNKLIQSYSVH 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
               H+S+ LY+ + F+G  PN H               +  + +H+ F + GFE D F  
Sbjct: 61   HQPHESIVLYNLLSFDGIRPNHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCC 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TALI  YAK G L  AR+VF EM  RD+P WN+++ G+ R G++  A ELF+ MP +NV 
Sbjct: 121  TALITAYAKLGALCCARRVFDEMSNRDVPVWNAMITGYQRRGDMKAAMELFDSMPNKNVT 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWT +ISG+SQNG Y +A+ M++ MEK+  V PN +T+ SVLPACANLG L++G R+E Y
Sbjct: 181  SWTTVISGFSQNGNYSEALTMFLCMEKDKSVKPNHITLVSVLPACANLGELEIGRRLEGY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR+ GF  N++V NA LEMY+KCG ID+A+R+FDEIG  RNL SWNSMI  LA HG+  E
Sbjct: 241  ARENGFFDNIYVRNATLEMYSKCGMIDVAKRLFDEIGNQRNLISWNSMIGSLATHGKHDE 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             LEL+ +ML EG  PD +TFVG+LLAC HGG+V +G +  KSME    I+PKLEHYGCM+
Sbjct: 301  ALELYAQMLQEGERPDAVTFVGLLLACVHGGMVLKGKELLKSMEEVHKISPKLEHYGCMI 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGR G+L EA  LIK MPM PD+VVWG+LLGACSF+GNV
Sbjct: 361  DLLGRVGKLQEACDLIKTMPMKPDAVVWGTLLGACSFHGNV 401


>ref|NP_196468.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75171895|sp|Q9FNN7.1|PP371_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g08510 gi|9759345|dbj|BAB10000.1| unnamed protein
            product [Arabidopsis thaliana] gi|50897238|gb|AAT85758.1|
            At5g08510 [Arabidopsis thaliana]
            gi|332003930|gb|AED91313.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 511

 Score =  457 bits (1176), Expect = e-126
 Identities = 220/401 (54%), Positives = 285/401 (71%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MN +KQ+HAH LR G+D TK L+ +LL IP++ YA  LFD      TFLYNKLIQAY  H
Sbjct: 1    MNGIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
               H+S+ LY+ + F+G  P+ H               +  + +H+ F + GFE D F  
Sbjct: 61   HQPHESIVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCC 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            T LI  YAK G L  AR+VF EM +RD+P WN+++ G+ R G++  A ELF+ MP +NV 
Sbjct: 121  TTLITAYAKLGALCCARRVFDEMSKRDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVT 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWT +ISG+SQNG Y +A++M++ MEK+  V PN +T+ SVLPACANLG L++G R+E Y
Sbjct: 181  SWTTVISGFSQNGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGELEIGRRLEGY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 956
            AR+ GF  N++V NA +EMY+KCG ID+A+R+F+E+G  RNLCSWNSMI  LA HG+  E
Sbjct: 241  ARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDE 300

Query: 957  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1136
             L LF +ML EG  PD +TFVG+LLAC HGG+V +G + FKSME    I+PKLEHYGCM+
Sbjct: 301  ALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMI 360

Query: 1137 DLLGRAGELNEAYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            DLLGR G+L EAY LIK MPM PD+VVWG+LLGACSF+GNV
Sbjct: 361  DLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNV 401


>dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana]
          Length = 504

 Score =  441 bits (1134), Expect = e-121
 Identities = 213/390 (54%), Positives = 276/390 (70%)
 Frame = +3

Query: 90   LRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSHGPYHQSLSLYS 269
            LR G+D TK L+ +LL IP++ YA  LFD      TFLYNKLIQAY  H   H+S+ LY+
Sbjct: 5    LRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVHHQPHESIVLYN 64

Query: 270  HMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFALTALIDMYAKSG 449
             + F+G  P+ H               +  + +H+ F + GFE D F  T LI  YAK G
Sbjct: 65   LLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLG 124

Query: 450  LLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQ 629
             L  AR+VF EM +RD+P WN+++ G+ R G++  A ELF+ MP +NV SWT +ISG+SQ
Sbjct: 125  ALCCARRVFDEMSKRDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVISGFSQ 184

Query: 630  NGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLF 809
            NG Y +A++M++ MEK+  V PN +T+ SVLPACANLG L++G R+E YAR+ GF  N++
Sbjct: 185  NGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNIY 244

Query: 810  VSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSE 989
            V NA +EMY+KCG ID+A+R+F+E+G  RNLCSWNSMI  LA HG+  E L LF +ML E
Sbjct: 245  VCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLRE 304

Query: 990  GIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNE 1169
            G  PD +TFVG+LLAC HGG+V +G + FKSME    I+PKLEHYGCM+DLLGR G+L E
Sbjct: 305  GEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQE 364

Query: 1170 AYALIKRMPMTPDSVVWGSLLGACSFYGNV 1259
            AY LIK MPM PD+VVWG+LLGACSF+GNV
Sbjct: 365  AYDLIKTMPMKPDAVVWGTLLGACSFHGNV 394


>ref|XP_006399350.1| hypothetical protein EUTSA_v10013320mg [Eutrema salsugineum]
            gi|557100440|gb|ESQ40803.1| hypothetical protein
            EUTSA_v10013320mg [Eutrema salsugineum]
          Length = 502

 Score =  438 bits (1127), Expect = e-120
 Identities = 212/377 (56%), Positives = 266/377 (70%)
 Frame = +3

Query: 129  KLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSHGPYHQSLSLYSHMRFNGCPPNPHX 308
            +LL IP++ YA  LFDL   P  FLYNKLIQAYS H   H+S+ L+  + FNG  PN H 
Sbjct: 15   RLLRIPNLAYARRLFDLHRNPCIFLYNKLIQAYSVHDQPHESVVLFRLLSFNGLRPNHHT 74

Query: 309  XXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFALTALIDMYAKSGLLRSARQVFYEMP 488
                          +  +  H+ F + GFE D F  TALI  YAK G LR AR+VF E+ 
Sbjct: 75   FNFIFAASASISSVRTLRMFHSQFFRSGFESDSFCCTALITEYAKLGALRCARRVFDEIS 134

Query: 489  ERDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVR 668
             RD+  WN+++  + R G++  A ELF+ MP +NV+SWT +ISG+SQNG Y  A+ M++ 
Sbjct: 135  NRDLAVWNAMITVYNRQGDMKAAMELFDSMPCKNVISWTTVISGFSQNGNYSKALSMFLC 194

Query: 669  MEKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCG 848
            ME    V PN +T+ASVLPAC NLGAL +G R+E YAR+ GF  N++VSNA LEMY+KCG
Sbjct: 195  MESNKTVKPNHITVASVLPACGNLGALDIGRRLEGYARENGFFDNIYVSNATLEMYSKCG 254

Query: 849  NIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVL 1028
             ID+A+R+FDEIG  RNLCSWNSM+ GLA HG+  E LEL+ +ML EG  PD +TFVG+L
Sbjct: 255  MIDVAKRIFDEIGNQRNLCSWNSMVSGLATHGKHDEALELYAQMLREGEKPDAVTFVGLL 314

Query: 1029 LACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVDLLGRAGELNEAYALIKRMPMTPD 1208
            LAC HGG+V +G + FKSME    I+PKLEHYGCM+DLLGR G+L EAY LIK MPM PD
Sbjct: 315  LACVHGGMVVKGKELFKSMEQVHKISPKLEHYGCMIDLLGRVGKLQEAYNLIKTMPMKPD 374

Query: 1209 SVVWGSLLGACSFYGNV 1259
            +VVWG+LLGACSF+GNV
Sbjct: 375  AVVWGTLLGACSFHGNV 391


>ref|XP_002523296.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223537384|gb|EEF39012.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 353

 Score =  432 bits (1110), Expect = e-118
 Identities = 208/340 (61%), Positives = 258/340 (75%), Gaps = 1/340 (0%)
 Frame = +3

Query: 57   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 236
            MNQLKQIHA+ LRNGID+ K L  +L++IP++PYAH L DLIP P  FLYNKLIQAYS  
Sbjct: 1    MNQLKQIHAYTLRNGIDYNKTLTERLIQIPNVPYAHKLIDLIPSPNVFLYNKLIQAYSFQ 60

Query: 237  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 416
               HQ  S+YS MR   C  N H                  Q +HTHF K GFE D  AL
Sbjct: 61   NQLHQCFSIYSQMRSRNCTGNQHTFTFLFAACASFFSPLHAQMLHTHFKKSGFESDVIAL 120

Query: 417  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 596
            TAL+DMY K G++  A +VF E+P RDIP WN+++AG++R G++  A ++F+LMP RNVV
Sbjct: 121  TALVDMYCKLGMVAFAHRVFDEIPVRDIPTWNALIAGYSRCGDMEGALKIFKLMPDRNVV 180

Query: 597  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 776
            SWTAMISGYSQNGRY  A+E++++MEKE+G+ PNEVTIAS+LPACANLGAL++G+RIE Y
Sbjct: 181  SWTAMISGYSQNGRYAKALELFLKMEKENGLRPNEVTIASILPACANLGALEVGDRIETY 240

Query: 777  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDE-IGRGRNLCSWNSMIMGLAVHGRWK 953
            AR+ G LRNL+VSNALLEMYA+CG ID+AR+VFD+ IG+ RNLCSWNSMIMGLA+HGR  
Sbjct: 241  ARENGLLRNLYVSNALLEMYARCGKIDMARKVFDKIIGKRRNLCSWNSMIMGLAIHGRSH 300

Query: 954  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQF 1073
            + L L++ ML EGI PDD+TFVG+LLACTHGG++     F
Sbjct: 301  DALHLYNRMLIEGIAPDDVTFVGILLACTHGGMLNSSALF 340


Top