BLASTX nr result

ID: Akebia24_contig00017987 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00017987
         (1142 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003633947.1| PREDICTED: pentatricopeptide repeat-containi...   494   e-137
emb|CBI40590.3| unnamed protein product [Vitis vinifera]              494   e-137
gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis]     486   e-135
ref|XP_004230005.1| PREDICTED: pentatricopeptide repeat-containi...   479   e-133
ref|XP_006339746.1| PREDICTED: pentatricopeptide repeat-containi...   476   e-132
ref|XP_002301427.2| hypothetical protein POPTR_0002s17640g [Popu...   476   e-131
ref|XP_004155716.1| PREDICTED: pentatricopeptide repeat-containi...   460   e-127
ref|XP_004142577.1| PREDICTED: pentatricopeptide repeat-containi...   460   e-127
gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus...   459   e-126
ref|XP_004510637.1| PREDICTED: pentatricopeptide repeat-containi...   444   e-122
ref|XP_007135284.1| hypothetical protein PHAVU_010G116300g [Phas...   444   e-122
ref|XP_003548250.1| PREDICTED: pentatricopeptide repeat-containi...   441   e-121
ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily p...   434   e-119
ref|XP_002523296.1| pentatricopeptide repeat-containing protein,...   432   e-118
ref|XP_003627527.1| Pentatricopeptide repeat-containing protein ...   414   e-113
ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Caps...   401   e-109
ref|XP_002871343.1| pentatricopeptide repeat-containing protein ...   400   e-109
ref|NP_196468.1| pentatricopeptide repeat-containing protein [Ar...   393   e-107
dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana]           377   e-102
ref|XP_006399350.1| hypothetical protein EUTSA_v10013320mg [Eutr...   374   e-101

>ref|XP_003633947.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Vitis vinifera]
          Length = 512

 Score =  494 bits (1271), Expect = e-137
 Identities = 241/361 (66%), Positives = 282/361 (78%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MN+LKQI A+ LRNGI+HTK LI+ LL+IP IPYAH LFD IP+PT FLYNKLIQAYSSH
Sbjct: 1    MNRLKQIQAYTLRNGIEHTKQLIVSLLQIPSIPYAHKLFDFIPKPTVFLYNKLIQAYSSH 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
            GP+HQ  SLY+ M   GC PN H               QQG+ +HTHF+K GF  D FAL
Sbjct: 61   GPHHQCFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGCDVFAL 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMYAK GLL  AR+ F EM  RD+P WNS++AG+AR G+L  A ELF LMP RNV 
Sbjct: 121  TALVDMYAKLGLLSLARKQFDEMTVRDVPTWNSMIAGYARCGDLEGALELFRLMPARNVT 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWTAMISGY+QNG+Y  A+ M++ ME+E+ + PNEVT+ASVLPACANLGAL++GERIE Y
Sbjct: 181  SWTAMISGYAQNGQYAKALSMFLMMEEETEMRPNEVTLASVLPACANLGALEVGERIEVY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR  G+ +NL+VSNALLEMYA+CG ID A  VF+EI   RNLCSWNSMIMGLAVHGR  E
Sbjct: 241  ARGNGYFKNLYVSNALLEMYARCGRIDKAWGVFEEIDGRRNLCSWNSMIMGLAVHGRCDE 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             +ELF++ML EG  PDD+TFVGVLLACTHGG+V +G  FF+SME DFSI PKLEHYGCMV
Sbjct: 301  AIELFYKMLREGAAPDDVTFVGVLLACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMV 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>emb|CBI40590.3| unnamed protein product [Vitis vinifera]
          Length = 495

 Score =  494 bits (1271), Expect = e-137
 Identities = 241/361 (66%), Positives = 282/361 (78%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MN+LKQI A+ LRNGI+HTK LI+ LL+IP IPYAH LFD IP+PT FLYNKLIQAYSSH
Sbjct: 1    MNRLKQIQAYTLRNGIEHTKQLIVSLLQIPSIPYAHKLFDFIPKPTVFLYNKLIQAYSSH 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
            GP+HQ  SLY+ M   GC PN H               QQG+ +HTHF+K GF  D FAL
Sbjct: 61   GPHHQCFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGCDVFAL 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMYAK GLL  AR+ F EM  RD+P WNS++AG+AR G+L  A ELF LMP RNV 
Sbjct: 121  TALVDMYAKLGLLSLARKQFDEMTVRDVPTWNSMIAGYARCGDLEGALELFRLMPARNVT 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWTAMISGY+QNG+Y  A+ M++ ME+E+ + PNEVT+ASVLPACANLGAL++GERIE Y
Sbjct: 181  SWTAMISGYAQNGQYAKALSMFLMMEEETEMRPNEVTLASVLPACANLGALEVGERIEVY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR  G+ +NL+VSNALLEMYA+CG ID A  VF+EI   RNLCSWNSMIMGLAVHGR  E
Sbjct: 241  ARGNGYFKNLYVSNALLEMYARCGRIDKAWGVFEEIDGRRNLCSWNSMIMGLAVHGRCDE 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             +ELF++ML EG  PDD+TFVGVLLACTHGG+V +G  FF+SME DFSI PKLEHYGCMV
Sbjct: 301  AIELFYKMLREGAAPDDVTFVGVLLACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMV 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis]
          Length = 513

 Score =  486 bits (1251), Expect = e-135
 Identities = 238/361 (65%), Positives = 279/361 (77%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MNQLKQIHAH LRNG+DHT  LI+KLLEIP+I YA  LFDLIPEPT FLYN+LI+AYS H
Sbjct: 4    MNQLKQIHAHTLRNGVDHTSILILKLLEIPNILYARNLFDLIPEPTVFLYNRLIKAYSFH 63

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
            G +HQ L LY  M   GC PN H               Q GQ +H+HF+KLG   D FAL
Sbjct: 64   GQHHQCLFLYRRMCLQGCTPNEHSFTLLFSVCSSLSSRQLGQMMHSHFVKLGHVRDIFAL 123

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMYAK G+L  AR+ F E   R  P WNS+++G+ARSG++  A ELF LMP RNVV
Sbjct: 124  TALVDMYAKLGMLDCARKQFDEKRVRGTPTWNSMLSGYARSGDMEGASELFRLMPQRNVV 183

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWTAMISGYS+NG+Y  A+ M+++MEKE  V PN +TIASVLPACANLGAL++GER+E Y
Sbjct: 184  SWTAMISGYSKNGQYAKALAMFLQMEKERDVRPNAITIASVLPACANLGALEVGERVEEY 243

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            ARK GFL++L+VSNA+LEMYAKCG ID ARRVFDEIGR RNLCSWNSMIMGLAVHGR  E
Sbjct: 244  ARKVGFLKDLYVSNAVLEMYAKCGRIDTARRVFDEIGRRRNLCSWNSMIMGLAVHGRCNE 303

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             L+L+ +M +  I PDD+TFVG++LACTHGG+  +G Q FKSME  F ITPKLEHYGCMV
Sbjct: 304  ALDLYEQMTTVRIAPDDVTFVGLILACTHGGMAMKGQQLFKSMEPKFGITPKLEHYGCMV 363

Query: 1139 D 1141
            D
Sbjct: 364  D 364


>ref|XP_004230005.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Solanum lycopersicum]
          Length = 508

 Score =  479 bits (1233), Expect = e-133
 Identities = 230/361 (63%), Positives = 278/361 (77%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MNQLKQIHA+ LRNGID T+FLI K++EIP+IPYAH +FD I +PT FLYNKLIQAYSSH
Sbjct: 1    MNQLKQIHANTLRNGIDFTQFLISKIIEIPNIPYAHKVFDNITKPTVFLYNKLIQAYSSH 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
            G   Q  SLY  MR  GC PNPH                QGQ  H HF+K GFEFD + L
Sbjct: 61   GFPSQCFSLYIKMRRQGCSPNPHSFTFLFAACSNRSTPIQGQMFHVHFIKWGFEFDIYTL 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMYAK  LL SAR++F EM  +D+P WNS++AG+A++G + +A +LF +MP RNV+
Sbjct: 121  TALVDMYAKMSLLPSARKLFDEMEMKDVPIWNSLIAGYAKNGNVVEAFKLFSVMPSRNVI 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWTAMISGYSQNG+Y +A+ +Y +MEK+  V PNEVTIASVLPACANLGAL++GE IE Y
Sbjct: 181  SWTAMISGYSQNGKYANALAVYKQMEKDRKVKPNEVTIASVLPACANLGALEVGENIEAY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR  G+ +N+FV NA+LEMY KCG ID A ++F EIGR RNLCSWN+MIMGLAVHG+  E
Sbjct: 241  ARANGYFKNMFVCNAVLEMYTKCGRIDRAMQLFHEIGRRRNLCSWNTMIMGLAVHGKGDE 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             L+LF++ML EG  PDD+TFVG +LACTHGG+V +GW+  K ME  FSI PKLEHYGCMV
Sbjct: 301  ALKLFNQMLGEGNTPDDVTFVGAILACTHGGMVAKGWELLKLMEQRFSIAPKLEHYGCMV 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>ref|XP_006339746.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Solanum tuberosum]
          Length = 508

 Score =  476 bits (1225), Expect = e-132
 Identities = 228/361 (63%), Positives = 276/361 (76%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MNQLKQIH + LRNGID T+FLI KL+EIP+IPYAH +FD I +PT FLYNKLIQAYSSH
Sbjct: 1    MNQLKQIHGNTLRNGIDFTQFLITKLIEIPNIPYAHKVFDSITKPTVFLYNKLIQAYSSH 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
            G   +  SLY  MR  GC PNPH                QGQ  H HF+K GFEFD + L
Sbjct: 61   GLPSRCFSLYIQMRRQGCSPNPHSFTFLFAACTNSSSPIQGQMFHVHFIKWGFEFDIYTL 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMYAK  LL SAR++F EM  +D+P WNS++AG+A++G + +A +LF +MP RNV+
Sbjct: 121  TALVDMYAKMSLLPSARKLFDEMEMKDVPTWNSLIAGYAKNGNVEEAFKLFSVMPSRNVI 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWTAMISGYSQNG+Y +A+ +Y  MEK+  V PNEVTIASVLPACANLGAL++GE IE Y
Sbjct: 181  SWTAMISGYSQNGKYANALAVYKEMEKDRRVKPNEVTIASVLPACANLGALEVGENIEAY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR  G+ +N+FV NA+LEMY KCG ID + ++F EIGR RNLCSWN+MIMGLAVHG+  E
Sbjct: 241  ARANGYFKNMFVCNAILEMYTKCGRIDRSMQLFHEIGRRRNLCSWNTMIMGLAVHGKGDE 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             L+LF++ML EG  PDD+TFVG +LACTHGG+V +GW+  K ME  FSI PKLEHYGCMV
Sbjct: 301  VLKLFNQMLGEGNAPDDVTFVGAILACTHGGMVAKGWELLKLMEQRFSIAPKLEHYGCMV 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>ref|XP_002301427.2| hypothetical protein POPTR_0002s17640g [Populus trichocarpa]
            gi|550345235|gb|EEE80700.2| hypothetical protein
            POPTR_0002s17640g [Populus trichocarpa]
          Length = 514

 Score =  476 bits (1224), Expect = e-131
 Identities = 230/361 (63%), Positives = 279/361 (77%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            M+QL +IHAH L+ GI+++K LI++LL IPDIPYAH +F+  P PT FLYNKLI+AYSS 
Sbjct: 1    MSQLNRIHAHTLKKGIEYSKTLIVELLRIPDIPYAHKVFNQSPYPTVFLYNKLIKAYSSQ 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
                Q LSLYS M   GCPPN                   G+ IHTHF+K GF+FD +AL
Sbjct: 61   NQPRQCLSLYSQMLLKGCPPNELTFTFLFPACASFYSLLHGKVIHTHFIKSGFDFDVYAL 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL++MYAK G+L  ARQVF EM  RDIP WNS++AG++RSG++  A ELF+LMP R+VV
Sbjct: 121  TALVNMYAKLGVLMLARQVFDEMTVRDIPTWNSLIAGYSRSGDMEGALELFKLMPSRSVV 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWT MISGYSQNG Y  A+EM+++MEK+  V PNEVTIASV  ACA LGAL++GERIE Y
Sbjct: 181  SWTTMISGYSQNGMYTKALEMFLKMEKDKEVRPNEVTIASVFSACAKLGALEVGERIESY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR  G ++NL+VSN LLEMYA+CG ID AR VF+EIG+ RNLCSWNSM+MGLAVHGR  E
Sbjct: 241  ARDNGLMKNLYVSNTLLEMYARCGKIDAARHVFNEIGKRRNLCSWNSMMMGLAVHGRSNE 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             L+L+ +ML EGI PDD+TFVG++LACTHGGLV +GWQ F+SME +FSI PKLEHYGCMV
Sbjct: 301  ALQLYDQMLGEGIEPDDVTFVGLILACTHGGLVAKGWQLFQSMETNFSIVPKLEHYGCMV 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>ref|XP_004155716.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Cucumis sativus]
          Length = 512

 Score =  460 bits (1183), Expect = e-127
 Identities = 219/361 (60%), Positives = 274/361 (75%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MNQLKQIHA+ LRNG+DHTKFLI KLL++PD+PYA  LFD IP+P+ +LYNK IQ +SS 
Sbjct: 1    MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
            G  H+   LY  M   GC PN +                 GQ +H+HF K GF  D FA+
Sbjct: 61   GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMYAK G+LRSARQ+F EMP RDIP WNS++AG+ARSG +  A ELF  MP RNV+
Sbjct: 121  TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWTA+ISGY+QNG+Y  A+EM++ +E E G  PNEV+IASVLPAC+ LGAL +G+RIE Y
Sbjct: 181  SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR  GF +N +VSNA+LE++A+CGNI+ A++VFDEIG  RNLCSWN+MIMGLAVHGR  +
Sbjct: 241  ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             L+L+ +ML   + PDD+TFVG+LLACTHGG+V +G Q F+SME  F + PKLEHYGC+V
Sbjct: 301  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>ref|XP_004142577.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Cucumis sativus]
          Length = 589

 Score =  460 bits (1183), Expect = e-127
 Identities = 219/361 (60%), Positives = 274/361 (75%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MNQLKQIHA+ LRNG+DHTKFLI KLL++PD+PYA  LFD IP+P+ +LYNK IQ +SS 
Sbjct: 1    MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
            G  H+   LY  M   GC PN +                 GQ +H+HF K GF  D FA+
Sbjct: 61   GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMYAK G+LRSARQ+F EMP RDIP WNS++AG+ARSG +  A ELF  MP RNV+
Sbjct: 121  TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWTA+ISGY+QNG+Y  A+EM++ +E E G  PNEV+IASVLPAC+ LGAL +G+RIE Y
Sbjct: 181  SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR  GF +N +VSNA+LE++A+CGNI+ A++VFDEIG  RNLCSWN+MIMGLAVHGR  +
Sbjct: 241  ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             L+L+ +ML   + PDD+TFVG+LLACTHGG+V +G Q F+SME  F + PKLEHYGC+V
Sbjct: 301  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus guttatus]
          Length = 516

 Score =  459 bits (1181), Expect = e-126
 Identities = 217/362 (59%), Positives = 279/362 (77%), Gaps = 1/362 (0%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MN LKQIHAH LRNG + T  LI KLLEIP+I YAH L D  P+PT FLY+KLI+AYSSH
Sbjct: 1    MNHLKQIHAHALRNGTNFTNHLITKLLEIPNINYAHKLLDKTPDPTLFLYSKLIKAYSSH 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
            GP+ Q  SLYS +      PNP+                QGQ +H HF+K G ++D +AL
Sbjct: 61   GPHFQCFSLYSQILHLSFSPNPNCFTFLFSACAKLSNPSQGQMLHAHFIKFGLDYDVYAL 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMYAK GLLR +R++F EM ++D P WNS++AG+AR+G++++A  LF  MP RNV+
Sbjct: 121  TALVDMYAKMGLLRFSRKIFDEMNDKDAPTWNSLIAGYARNGDMSEALRLFSNMPSRNVI 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWTA+ISG+SQNG+Y++A+EMY+ ME++  V PN VT+ASVLPACANLGAL++G+RIE Y
Sbjct: 181  SWTAIISGFSQNGKYKEALEMYLAMERDGKVKPNHVTLASVLPACANLGALEVGQRIEAY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRG-RNLCSWNSMIMGLAVHGRWK 955
            AR  G+ +N FV NA+LE+YA+CG I+ A +VFDEIG G RNLCSWN++IMGLAVHGR  
Sbjct: 241  ARANGYFKNAFVCNAVLELYARCGVIEKAMQVFDEIGSGNRNLCSWNTLIMGLAVHGRCD 300

Query: 956  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCM 1135
              LE+F++ML++G+ PDD+TFVG +LACTHGG+V +G + F SME  FSITPK+EHYGCM
Sbjct: 301  GALEIFNQMLTKGVTPDDVTFVGAILACTHGGMVNKGREIFDSMEKRFSITPKIEHYGCM 360

Query: 1136 VD 1141
            VD
Sbjct: 361  VD 362


>ref|XP_004510637.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Cicer arietinum]
          Length = 512

 Score =  444 bits (1143), Expect = e-122
 Identities = 220/362 (60%), Positives = 272/362 (75%), Gaps = 1/362 (0%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSS- 235
            MNQ+KQI  + LRNGID+TK LI KLL+IP++ YA  L      PT FLYNKLIQAYSS 
Sbjct: 1    MNQVKQIQCYTLRNGIDNTKILIEKLLQIPNLHYAQLLLHHSHNPTLFLYNKLIQAYSSK 60

Query: 236  HGPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFA 415
            H  +HQ   LYS M  +G  PN H                 GQ +HTHF+K GF+ D FA
Sbjct: 61   HQNHHQCFFLYSQMLLHGHSPNQHTFNFLFKAGTSVSSISLGQMLHTHFIKSGFKHDVFA 120

Query: 416  LTALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNV 595
             TAL+DMYAK G L+ AR VF EM  R++P WN+++AG+ R G++ +A ELF LMP RNV
Sbjct: 121  STALLDMYAKLGSLKLARHVFDEMSVREVPTWNAMMAGYTRFGDMERALELFGLMPARNV 180

Query: 596  VSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIER 775
            VSWT ++SGYSQN +YE A+E+++RME E  V PNEVT+ASVLPACANLGAL++G+R+E 
Sbjct: 181  VSWTTVVSGYSQNKQYEKALELFLRMEWEKDVIPNEVTLASVLPACANLGALEIGQRVEA 240

Query: 776  YARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWK 955
            YAR+ G  +NLFVSNA+LEMYAKCG ID+A +VFDE+GR RNLCS+NSMIMGLAVHG+  
Sbjct: 241  YARENGLFKNLFVSNAVLEMYAKCGKIDVAWKVFDEMGRFRNLCSFNSMIMGLAVHGQCD 300

Query: 956  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCM 1135
            + +EL+ +ML EG +PDD+TFVG+LLACTHGG+V+ G   FKSM  DF+I PKLEHYGCM
Sbjct: 301  KAIELYDQMLREGTLPDDVTFVGLLLACTHGGMVETGKHIFKSMTRDFNIIPKLEHYGCM 360

Query: 1136 VD 1141
            VD
Sbjct: 361  VD 362


>ref|XP_007135284.1| hypothetical protein PHAVU_010G116300g [Phaseolus vulgaris]
            gi|561008329|gb|ESW07278.1| hypothetical protein
            PHAVU_010G116300g [Phaseolus vulgaris]
          Length = 510

 Score =  444 bits (1142), Expect = e-122
 Identities = 217/362 (59%), Positives = 272/362 (75%), Gaps = 1/362 (0%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            M Q+KQIH + LRNGID+TK LI KLLEIP++ YAH +    P+   FLYNKLIQAYSSH
Sbjct: 1    MRQVKQIHGYTLRNGIDNTKILIEKLLEIPNLHYAHMVLHHSPKQNLFLYNKLIQAYSSH 60

Query: 239  GPY-HQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFA 415
              + H+  SLY  MR +G  PN H                 GQ +HTHF+K GFE D FA
Sbjct: 61   PQHQHRCFSLYYQMRLHGFLPNQHTFNFLFSACTSLFSHSLGQMLHTHFIKSGFEPDLFA 120

Query: 416  LTALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNV 595
             TAL+DMY K G L  ARQ+F EMP R +P WN++++G+A+ G++  A ELF LMP RN+
Sbjct: 121  ATALLDMYCKVGTLGLARQLFDEMPVRGVPTWNAMMSGYAKFGDMEGALELFGLMPTRNL 180

Query: 596  VSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIER 775
            VSWT MISGYS+N ++ +A+ ++++ME+E G+ PNEVT+AS+LPAC+NLGAL++G+R+E 
Sbjct: 181  VSWTTMISGYSRNKQFGEALGLFLKMEQEKGIVPNEVTLASILPACSNLGALEIGQRVEA 240

Query: 776  YARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWK 955
            YARK GF +NL+VSNALLEMYAKCG ID+A RVF+EIGR RNLCSWNSMIMGLAVHG+  
Sbjct: 241  YARKNGFFKNLYVSNALLEMYAKCGKIDVAWRVFNEIGRFRNLCSWNSMIMGLAVHGQCC 300

Query: 956  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCM 1135
            +  EL+ +ML EG  PDD+TFVG+LLACTHGG+V++G   FKSM   F I PKLEHYGCM
Sbjct: 301  KAFELYDQMLGEGTSPDDVTFVGLLLACTHGGMVEKGRHIFKSMTTAFHIIPKLEHYGCM 360

Query: 1136 VD 1141
            VD
Sbjct: 361  VD 362


>ref|XP_003548250.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like
            [Glycine max]
          Length = 512

 Score =  441 bits (1134), Expect = e-121
 Identities = 217/362 (59%), Positives = 269/362 (74%), Gaps = 1/362 (0%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            M Q+KQIH + LRNGID TK LI KLLEIP++ YAH +    P+PT FLYNKLIQAYSSH
Sbjct: 1    MRQVKQIHGYTLRNGIDQTKILIEKLLEIPNLHYAHKVLHHSPKPTLFLYNKLIQAYSSH 60

Query: 239  GPY-HQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFA 415
              + HQ  SLYS M  +   PN H                 GQ +HTHF+K GFE D FA
Sbjct: 61   PQHQHQCFSLYSQMLLHSFLPNQHTFNFLFSACTSLSSPSLGQMLHTHFIKSGFEPDLFA 120

Query: 416  LTALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNV 595
             TAL+DMY K G L  AR++F +MP R +P WN+++AGHAR G++  A ELF LMP RNV
Sbjct: 121  ATALLDMYTKVGTLELARKLFDQMPVRGVPTWNAMMAGHARFGDMDVALELFRLMPSRNV 180

Query: 596  VSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIER 775
            VSWT MISGYS++ +Y +A+ +++RME+E G+ PN VT+AS+ PA ANLGAL++G+R+E 
Sbjct: 181  VSWTTMISGYSRSKKYGEALGLFLRMEQEKGMMPNAVTLASIFPAFANLGALEIGQRVEA 240

Query: 776  YARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWK 955
            YARK GF +NL+VSNA+LEMYAKCG ID+A +VF+EIG  RNLCSWNSMIMGLAVHG   
Sbjct: 241  YARKNGFFKNLYVSNAVLEMYAKCGKIDVAWKVFNEIGSLRNLCSWNSMIMGLAVHGECC 300

Query: 956  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCM 1135
            + L+L+ +ML EG  PDD+TFVG+LLACTHGG+V++G   FKSM   F+I PKLEHYGCM
Sbjct: 301  KTLKLYDQMLGEGTSPDDVTFVGLLLACTHGGMVEKGRHIFKSMTTSFNIIPKLEHYGCM 360

Query: 1136 VD 1141
            VD
Sbjct: 361  VD 362


>ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily protein, putative
            [Theobroma cacao] gi|508703740|gb|EOX95636.1|
            Pentatricopeptide repeat (PPR) superfamily protein,
            putative [Theobroma cacao]
          Length = 515

 Score =  434 bits (1115), Expect = e-119
 Identities = 213/363 (58%), Positives = 271/363 (74%), Gaps = 2/363 (0%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDH--TKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYS 232
            MNQLKQ  A+ L+NG++   T+ LII++L+ P+IPYAH LF+LIP+ T FLYNKLIQAYS
Sbjct: 1    MNQLKQSLAYTLKNGMEQNQTQLLIIQILQTPNIPYAHKLFNLIPQKTVFLYNKLIQAYS 60

Query: 233  SHGPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHF 412
            S    H+ L+LYS M  N C PN H                 GQ +HT FLK GF  D +
Sbjct: 61   SINQSHRCLTLYSQMCLNNCSPNEHSFIFLFPACASLPSLLHGQILHTQFLKSGFGLDCY 120

Query: 413  ALTALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRN 592
            ALTAL+ MYAK  +L  AR+VF EM  R++P WN++++G++  G++ +A ELF+ MP +N
Sbjct: 121  ALTALLVMYAKLRMLPLARKVFDEMRVRNLPTWNALISGYSMCGDMKEALELFKSMPEKN 180

Query: 593  VVSWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIE 772
            VVSWT MISGYSQNG+Y  A++M++RMEKE+GV PN VTIASVLPACANLGAL++GERIE
Sbjct: 181  VVSWTTMISGYSQNGQYSKALDMFLRMEKETGVKPNRVTIASVLPACANLGALEVGERIE 240

Query: 773  RYARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRW 952
             YAR+ G   +L+VSN +LEMYA+CG I++A+ VFDEIG+ RNLC WNSMIMGLA+HG+ 
Sbjct: 241  TYARENGLFEDLYVSNTVLEMYARCGKIEVAKLVFDEIGKRRNLCVWNSMIMGLALHGKC 300

Query: 953  KEGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGC 1132
             E  E + +ML EG  PDD+TFVGVLLACTHG LV +G + F+SM   + I+PKLEHYGC
Sbjct: 301  IEAFEYYDQMLQEGTAPDDVTFVGVLLACTHGRLVVKGRELFESMGKKYHISPKLEHYGC 360

Query: 1133 MVD 1141
            MVD
Sbjct: 361  MVD 363



 Score = 70.9 bits (172), Expect = 1e-09
 Identities = 59/267 (22%), Positives = 110/267 (41%), Gaps = 2/267 (0%)
 Frame = +2

Query: 149 DIPYAHALFDLIPEPTTFLYNKLIQAYSSHGPYHQSLSLYSHM-RFNGCPPNPHXXXXXX 325
           D+  A  LF  +PE     +  +I  YS +G Y ++L ++  M +  G  PN        
Sbjct: 165 DMKEALELFKSMPEKNVVSWTTMISGYSQNGQYSKALDMFLRMEKETGVKPNRVTIASVL 224

Query: 326 XXXXXXXXXQQGQKIHTHFLKLGFEFDHFALTALIDMYAKSGLLRSARQVFYEMPERDIP 505
                    + G++I T+  + G   D +    +++MYA+ G +  A+ VF E+ +R   
Sbjct: 225 PACANLGALEVGERIETYARENGLFEDLYVSNTVLEMYARCGKIEVAKLVFDEIGKR--- 281

Query: 506 AWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVRMEKES 685
                                      RN+  W +MI G + +G+  +A E Y +M +E 
Sbjct: 282 ---------------------------RNLCVWNSMIMGLALHGKCIEAFEYYDQMLQE- 313

Query: 686 GVSPNEVTIASVLPACANLGALKMG-ERIERYARKKGFLRNLFVSNALLEMYAKCGNIDI 862
           G +P++VT   VL AC +   +  G E  E   +K      L     ++++  + G +  
Sbjct: 314 GTAPDDVTFVGVLLACTHGRLVVKGRELFESMGKKYHISPKLEHYGCMVDLLGRSGALQE 373

Query: 863 ARRVFDEIGRGRNLCSWNSMIMGLAVH 943
           A  +   +    +   W +++   + H
Sbjct: 374 AYDLIKSMPMKPDAVVWGALLGACSFH 400


>ref|XP_002523296.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223537384|gb|EEF39012.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 353

 Score =  432 bits (1110), Expect = e-118
 Identities = 208/340 (61%), Positives = 258/340 (75%), Gaps = 1/340 (0%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MNQLKQIHA+ LRNGID+ K L  +L++IP++PYAH L DLIP P  FLYNKLIQAYS  
Sbjct: 1    MNQLKQIHAYTLRNGIDYNKTLTERLIQIPNVPYAHKLIDLIPSPNVFLYNKLIQAYSFQ 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
               HQ  S+YS MR   C  N H                  Q +HTHF K GFE D  AL
Sbjct: 61   NQLHQCFSIYSQMRSRNCTGNQHTFTFLFAACASFFSPLHAQMLHTHFKKSGFESDVIAL 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMY K G++  A +VF E+P RDIP WN+++AG++R G++  A ++F+LMP RNVV
Sbjct: 121  TALVDMYCKLGMVAFAHRVFDEIPVRDIPTWNALIAGYSRCGDMEGALKIFKLMPDRNVV 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWTAMISGYSQNGRY  A+E++++MEKE+G+ PNEVTIAS+LPACANLGAL++G+RIE Y
Sbjct: 181  SWTAMISGYSQNGRYAKALELFLKMEKENGLRPNEVTIASILPACANLGALEVGDRIETY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDE-IGRGRNLCSWNSMIMGLAVHGRWK 955
            AR+ G LRNL+VSNALLEMYA+CG ID+AR+VFD+ IG+ RNLCSWNSMIMGLA+HGR  
Sbjct: 241  ARENGLLRNLYVSNALLEMYARCGKIDMARKVFDKIIGKRRNLCSWNSMIMGLAIHGRSH 300

Query: 956  EGLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQF 1075
            + L L++ ML EGI PDD+TFVG+LLACTHGG++     F
Sbjct: 301  DALHLYNRMLIEGIAPDDVTFVGILLACTHGGMLNSSALF 340


>ref|XP_003627527.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355521549|gb|AET02003.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 550

 Score =  414 bits (1063), Expect = e-113
 Identities = 216/405 (53%), Positives = 269/405 (66%), Gaps = 44/405 (10%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MNQ+KQ H + LRN ID+TK LI KLL+IP++ YA  L     +PTTFLYNKLIQA SS 
Sbjct: 1    MNQVKQFHGYTLRNNIDNTKILIEKLLQIPNLNYAQVLLHHSQKPTTFLYNKLIQACSSK 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
               HQ  +LYS M  +G  PN +                 GQ IHT F+K GF+ D FA 
Sbjct: 61   ---HQCFTLYSQMYLHGHSPNQYTFNFLFTTCTSLSSLSLGQMIHTQFMKSGFKHDVFAS 117

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TAL+DMYAK G L+ AR VF EM  +++  WN+++AG  R G++ +A ELF LMP RNVV
Sbjct: 118  TALLDMYAKLGCLKFARNVFDEMSVKELATWNAMMAGCTRFGDMERALELFWLMPSRNVV 177

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWT M+SGY QN +YE A+ +++RME+E  VSPNEVT+ASVLPACANLGAL++G+R+E Y
Sbjct: 178  SWTTMVSGYLQNKQYEKALGLFMRMEREKDVSPNEVTLASVLPACANLGALEIGQRVEVY 237

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            ARK GF +NLFV NA+LEMYAKCG ID+A +VFDEIGR RNLCSWNSMIMGLAVHG+  +
Sbjct: 238  ARKNGFFKNLFVCNAVLEMYAKCGKIDVAWKVFDEIGRFRNLCSWNSMIMGLAVHGQCHK 297

Query: 959  GLELFHEML--------------------------------------------SEGIIPD 1006
             ++L+ +ML                                             EG +PD
Sbjct: 298  AIQLYDQMLVSYSLYLLFISFAFIMIRGGHGLVNHINRTEPNLSVEMVRNNRTREGTLPD 357

Query: 1007 DITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVD 1141
            D+TFVG+LLACTHGG+V++G   F+SM  DF+I PKLEHYGCMVD
Sbjct: 358  DVTFVGLLLACTHGGMVEKGKHVFQSMTRDFNIIPKLEHYGCMVD 402



 Score = 77.4 bits (189), Expect = 1e-11
 Identities = 68/281 (24%), Positives = 123/281 (43%), Gaps = 15/281 (5%)
 Frame = +2

Query: 149  DIPYAHALFDLIPEPTTFLYNKLIQAYSSHGPYHQSLSLYSHM-RFNGCPPNPHXXXXXX 325
            D+  A  LF L+P      +  ++  Y  +  Y ++L L+  M R     PN        
Sbjct: 160  DMERALELFWLMPSRNVVSWTTMVSGYLQNKQYEKALGLFMRMEREKDVSPNEVTLASVL 219

Query: 326  XXXXXXXXXQQGQKIHTHFLKLGFEFDHFALTALIDMYAKSGLLRSARQVFYEMPE-RDI 502
                     + GQ++  +  K GF  + F   A+++MYAK G +  A +VF E+   R++
Sbjct: 220  PACANLGALEIGQRVEVYARKNGFFKNLFVCNAVLEMYAKCGKIDVAWKVFDEIGRFRNL 279

Query: 503  PAWNSIVAGHARSGELAKARELFELMP-----YRNVVSWT-AMISG----YSQNGRYED- 649
             +WNS++ G A  G+  KA +L++ M      Y   +S+   MI G     +   R E  
Sbjct: 280  CSWNSMIMGLAVHGQCHKAIQLYDQMLVSYSLYLLFISFAFIMIRGGHGLVNHINRTEPN 339

Query: 650  -AVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERI-ERYARKKGFLRNLFVSNA 823
             +VEM        G  P++VT   +L AC + G ++ G+ + +   R    +  L     
Sbjct: 340  LSVEMVRNNRTREGTLPDDVTFVGLLLACTHGGMVEKGKHVFQSMTRDFNIIPKLEHYGC 399

Query: 824  LLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHG 946
            ++++  + G +  A  V   +    +   W +++   + HG
Sbjct: 400  MVDLLGRAGRLTEAYEVIKRMPMKPDSVIWGTLLGACSFHG 440


>ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Capsella rubella]
            gi|482558368|gb|EOA22560.1| hypothetical protein
            CARUB_v10003220mg [Capsella rubella]
          Length = 511

 Score =  401 bits (1030), Expect = e-109
 Identities = 197/361 (54%), Positives = 251/361 (69%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MNQ+KQ+HAH LR G+D TK L+ +LL I +I YA  LFDL   P  FLYNKLIQAYS H
Sbjct: 1    MNQIKQLHAHCLRRGVDETKDLLQRLLLIQNIVYARKLFDLHRNPCIFLYNKLIQAYSVH 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
               H+S+ LY+ + F+G  PN H               +  + +H+ F K GFE D F  
Sbjct: 61   HHPHESIVLYNLLSFDGLRPNHHTFNFIFAASASFSSARPLRLLHSQFFKSGFESDSFCC 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TALI  YAK G L  AR+VF EM  RD P WN+++ G+ R G++  A ELF+ MP +NV+
Sbjct: 121  TALITAYAKLGELCCARRVFDEMSNRDAPVWNTMITGYQRQGDMKAAMELFDSMPCKNVI 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWT +ISG+SQNG Y +A+ M++ MEK+  V PN VT+ SVLPACANLG L++G R+E Y
Sbjct: 181  SWTTVISGFSQNGNYSEALTMFLCMEKDKSVKPNHVTLVSVLPACANLGELEIGRRLESY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR+ GF  N++V NA LEMY+KCG ID+A+++F EIG  RNLCSWNSMI  LA HG+  E
Sbjct: 241  ARENGFFDNIYVCNATLEMYSKCGMIDLAKQLFHEIGNQRNLCSWNSMIGSLATHGKHHE 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             LEL+ +ML EG  PD +TFVG+LLAC HGG+V +G + FKSME    I+PKLEHYGCM+
Sbjct: 301  ALELYAQMLREGEKPDAVTFVGLLLACVHGGMVVKGHELFKSMEEVHKISPKLEHYGCMI 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>ref|XP_002871343.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297317180|gb|EFH47602.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 511

 Score =  400 bits (1027), Expect = e-109
 Identities = 194/361 (53%), Positives = 251/361 (69%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MNQ+KQ+HAH LR G+D TK L+ +LL IP++ YA  LFDL   P  FLYNKLIQ+YS H
Sbjct: 1    MNQIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDLHRNPCIFLYNKLIQSYSVH 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
               H+S+ LY+ + F+G  PN H               +  + +H+ F + GFE D F  
Sbjct: 61   HQPHESIVLYNLLSFDGIRPNHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCC 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            TALI  YAK G L  AR+VF EM  RD+P WN+++ G+ R G++  A ELF+ MP +NV 
Sbjct: 121  TALITAYAKLGALCCARRVFDEMSNRDVPVWNAMITGYQRRGDMKAAMELFDSMPNKNVT 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWT +ISG+SQNG Y +A+ M++ MEK+  V PN +T+ SVLPACANLG L++G R+E Y
Sbjct: 181  SWTTVISGFSQNGNYSEALTMFLCMEKDKSVKPNHITLVSVLPACANLGELEIGRRLEGY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR+ GF  N++V NA LEMY+KCG ID+A+R+FDEIG  RNL SWNSMI  LA HG+  E
Sbjct: 241  ARENGFFDNIYVRNATLEMYSKCGMIDVAKRLFDEIGNQRNLISWNSMIGSLATHGKHDE 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             LEL+ +ML EG  PD +TFVG+LLAC HGG+V +G +  KSME    I+PKLEHYGCM+
Sbjct: 301  ALELYAQMLQEGERPDAVTFVGLLLACVHGGMVLKGKELLKSMEEVHKISPKLEHYGCMI 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>ref|NP_196468.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75171895|sp|Q9FNN7.1|PP371_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g08510 gi|9759345|dbj|BAB10000.1| unnamed protein
            product [Arabidopsis thaliana] gi|50897238|gb|AAT85758.1|
            At5g08510 [Arabidopsis thaliana]
            gi|332003930|gb|AED91313.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 511

 Score =  393 bits (1009), Expect = e-107
 Identities = 189/361 (52%), Positives = 250/361 (69%)
 Frame = +2

Query: 59   MNQLKQIHAHILRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSH 238
            MN +KQ+HAH LR G+D TK L+ +LL IP++ YA  LFD      TFLYNKLIQAY  H
Sbjct: 1    MNGIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 60

Query: 239  GPYHQSLSLYSHMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFAL 418
               H+S+ LY+ + F+G  P+ H               +  + +H+ F + GFE D F  
Sbjct: 61   HQPHESIVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCC 120

Query: 419  TALIDMYAKSGLLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVV 598
            T LI  YAK G L  AR+VF EM +RD+P WN+++ G+ R G++  A ELF+ MP +NV 
Sbjct: 121  TTLITAYAKLGALCCARRVFDEMSKRDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVT 180

Query: 599  SWTAMISGYSQNGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERY 778
            SWT +ISG+SQNG Y +A++M++ MEK+  V PN +T+ SVLPACANLG L++G R+E Y
Sbjct: 181  SWTTVISGFSQNGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGELEIGRRLEGY 240

Query: 779  ARKKGFLRNLFVSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKE 958
            AR+ GF  N++V NA +EMY+KCG ID+A+R+F+E+G  RNLCSWNSMI  LA HG+  E
Sbjct: 241  ARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDE 300

Query: 959  GLELFHEMLSEGIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMV 1138
             L LF +ML EG  PD +TFVG+LLAC HGG+V +G + FKSME    I+PKLEHYGCM+
Sbjct: 301  ALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMI 360

Query: 1139 D 1141
            D
Sbjct: 361  D 361


>dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana]
          Length = 504

 Score =  377 bits (967), Expect = e-102
 Identities = 182/350 (52%), Positives = 241/350 (68%)
 Frame = +2

Query: 92   LRNGIDHTKFLIIKLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSHGPYHQSLSLYS 271
            LR G+D TK L+ +LL IP++ YA  LFD      TFLYNKLIQAY  H   H+S+ LY+
Sbjct: 5    LRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVHHQPHESIVLYN 64

Query: 272  HMRFNGCPPNPHXXXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFALTALIDMYAKSG 451
             + F+G  P+ H               +  + +H+ F + GFE D F  T LI  YAK G
Sbjct: 65   LLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLG 124

Query: 452  LLRSARQVFYEMPERDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQ 631
             L  AR+VF EM +RD+P WN+++ G+ R G++  A ELF+ MP +NV SWT +ISG+SQ
Sbjct: 125  ALCCARRVFDEMSKRDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVISGFSQ 184

Query: 632  NGRYEDAVEMYVRMEKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLF 811
            NG Y +A++M++ MEK+  V PN +T+ SVLPACANLG L++G R+E YAR+ GF  N++
Sbjct: 185  NGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNIY 244

Query: 812  VSNALLEMYAKCGNIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSE 991
            V NA +EMY+KCG ID+A+R+F+E+G  RNLCSWNSMI  LA HG+  E L LF +ML E
Sbjct: 245  VCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLRE 304

Query: 992  GIIPDDITFVGVLLACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVD 1141
            G  PD +TFVG+LLAC HGG+V +G + FKSME    I+PKLEHYGCM+D
Sbjct: 305  GEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMID 354


>ref|XP_006399350.1| hypothetical protein EUTSA_v10013320mg [Eutrema salsugineum]
            gi|557100440|gb|ESQ40803.1| hypothetical protein
            EUTSA_v10013320mg [Eutrema salsugineum]
          Length = 502

 Score =  374 bits (960), Expect = e-101
 Identities = 181/337 (53%), Positives = 231/337 (68%)
 Frame = +2

Query: 131  KLLEIPDIPYAHALFDLIPEPTTFLYNKLIQAYSSHGPYHQSLSLYSHMRFNGCPPNPHX 310
            +LL IP++ YA  LFDL   P  FLYNKLIQAYS H   H+S+ L+  + FNG  PN H 
Sbjct: 15   RLLRIPNLAYARRLFDLHRNPCIFLYNKLIQAYSVHDQPHESVVLFRLLSFNGLRPNHHT 74

Query: 311  XXXXXXXXXXXXXXQQGQKIHTHFLKLGFEFDHFALTALIDMYAKSGLLRSARQVFYEMP 490
                          +  +  H+ F + GFE D F  TALI  YAK G LR AR+VF E+ 
Sbjct: 75   FNFIFAASASISSVRTLRMFHSQFFRSGFESDSFCCTALITEYAKLGALRCARRVFDEIS 134

Query: 491  ERDIPAWNSIVAGHARSGELAKARELFELMPYRNVVSWTAMISGYSQNGRYEDAVEMYVR 670
             RD+  WN+++  + R G++  A ELF+ MP +NV+SWT +ISG+SQNG Y  A+ M++ 
Sbjct: 135  NRDLAVWNAMITVYNRQGDMKAAMELFDSMPCKNVISWTTVISGFSQNGNYSKALSMFLC 194

Query: 671  MEKESGVSPNEVTIASVLPACANLGALKMGERIERYARKKGFLRNLFVSNALLEMYAKCG 850
            ME    V PN +T+ASVLPAC NLGAL +G R+E YAR+ GF  N++VSNA LEMY+KCG
Sbjct: 195  MESNKTVKPNHITVASVLPACGNLGALDIGRRLEGYARENGFFDNIYVSNATLEMYSKCG 254

Query: 851  NIDIARRVFDEIGRGRNLCSWNSMIMGLAVHGRWKEGLELFHEMLSEGIIPDDITFVGVL 1030
             ID+A+R+FDEIG  RNLCSWNSM+ GLA HG+  E LEL+ +ML EG  PD +TFVG+L
Sbjct: 255  MIDVAKRIFDEIGNQRNLCSWNSMVSGLATHGKHDEALELYAQMLREGEKPDAVTFVGLL 314

Query: 1031 LACTHGGLVKQGWQFFKSMELDFSITPKLEHYGCMVD 1141
            LAC HGG+V +G + FKSME    I+PKLEHYGCM+D
Sbjct: 315  LACVHGGMVVKGKELFKSMEQVHKISPKLEHYGCMID 351


Top