BLASTX nr result

ID: Mentha29_contig00010294 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00010294
         (1018 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU33203.1| hypothetical protein MIMGU_mgv1a020021mg, partial...   335   1e-89
ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containi...   326   1e-86
emb|CBI30774.3| unnamed protein product [Vitis vinifera]              325   2e-86
ref|XP_007025241.1| Pentatricopeptide repeat superfamily protein...   324   3e-86
ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containi...   323   5e-86
ref|XP_007211999.1| hypothetical protein PRUPE_ppa011078mg [Prun...   322   1e-85
ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containi...   320   4e-85
ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containi...   320   6e-85
ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citr...   317   5e-84
ref|XP_002526919.1| conserved hypothetical protein [Ricinus comm...   313   7e-83
gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis]     311   4e-82
ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containi...   309   1e-81
ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containi...   308   2e-81
ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containi...   299   1e-78
gb|ACU23441.1| unknown [Glycine max]                                  298   2e-78
ref|XP_007147960.1| hypothetical protein PHAVU_006G168700g [Phas...   296   1e-77
ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [A...   293   6e-77
ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arab...   292   2e-76
ref|NP_001031667.1| pentatricopeptide repeat-containing protein ...   289   1e-75
ref|NP_567571.1| pentatricopeptide repeat-containing protein [Ar...   289   1e-75

>gb|EYU33203.1| hypothetical protein MIMGU_mgv1a020021mg, partial [Mimulus
           guttatus]
          Length = 209

 Score =  335 bits (860), Expect = 1e-89
 Identities = 159/203 (78%), Positives = 177/203 (87%)
 Frame = +2

Query: 167 KLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLI 346
           K++RKSGKKEHHLWQ+RD AGSGHKA NLVR IC LPNEK AVY ALDEWIAWETEFPLI
Sbjct: 1   KVIRKSGKKEHHLWQKRDSAGSGHKALNLVRTICRLPNEKEAVYGALDEWIAWETEFPLI 60

Query: 347 AAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMI 526
           AAAKAL ILRKR+ WKRIIQV KWMLSKGQGATMSTYD LLLAFDMD R D+A++LW M+
Sbjct: 61  AAAKALRILRKRNHWKRIIQVGKWMLSKGQGATMSTYDSLLLAFDMDGRLDDAEILWNMV 120

Query: 527 LQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQ 706
           LQ + RS+ K +FSRMISLYDHHNLP+ V+EVFADMEEL VKPDEDTVRR+ARAFE LGQ
Sbjct: 121 LQTYNRSLPKMIFSRMISLYDHHNLPDKVIEVFADMEELEVKPDEDTVRRVARAFEALGQ 180

Query: 707 EDKHTQFLNKYQRRWKYIHFKGE 775
           ++K    + KYQ +WKYIHFKGE
Sbjct: 181 KEKERLVMKKYQSKWKYIHFKGE 203


>ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like [Solanum lycopersicum]
          Length = 265

 Score =  326 bits (835), Expect = 1e-86
 Identities = 165/255 (64%), Positives = 197/255 (77%)
 Frame = +2

Query: 11  SLLIKASNSSEATHKASGGEKSIQLSKAIASPQHCRGSQKEQPLVAPTAVEKKLVRKSGK 190
           ++L+K  NS+  + K       + +S A+   +H +  Q E  L    A ++K V+K+GK
Sbjct: 13  NILLKGINSTGLSDK-------LNVSSAL---KHSK-KQGELSLTISDAADQKKVQKAGK 61

Query: 191 KEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGI 370
            EHHLW++R+ AGSG KA NLVRII GLPNEK +VY ALD+WIAWETEFPLIAAAKAL I
Sbjct: 62  VEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPLIAAAKALRI 121

Query: 371 LRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSI 550
           LR++  WKR+IQVAKWMLSKGQGATM+TYD LLLAFDMD R DEA+ LW MIL    RS+
Sbjct: 122 LRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNMILHTSTRSV 181

Query: 551 SKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFL 730
           SKRLFSRMISLYDHH++P+ +VEVFADMEELGVKPDEDTVRR+ARAF+ LGQED     L
Sbjct: 182 SKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDNQKLVL 241

Query: 731 NKYQRRWKYIHFKGE 775
            KYQ RWKY+HF GE
Sbjct: 242 KKYQSRWKYVHFNGE 256


>emb|CBI30774.3| unnamed protein product [Vitis vinifera]
          Length = 277

 Score =  325 bits (833), Expect = 2e-86
 Identities = 158/207 (76%), Positives = 176/207 (85%)
 Frame = +2

Query: 155 AVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETE 334
           AVEK++ +K GKKEHHLW++RD  GSG KA NLVRI+  LPNEK AVY ALD+W AWETE
Sbjct: 58  AVEKEISKKVGKKEHHLWRKRDSIGSGQKALNLVRIVSELPNEKEAVYGALDKWTAWETE 117

Query: 335 FPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKML 514
           FPLIAAAKAL ILRKR+QWKR+IQVAKWMLSKGQGATM TYD LLLAFDMD R DEA+ L
Sbjct: 118 FPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDWRVDEAESL 177

Query: 515 WEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFE 694
           W MIL  H RSISK+LFSRMISLYDHH++ + V+EVFADMEELGVKPDEDTVRR+A AF+
Sbjct: 178 WNMILHTHTRSISKQLFSRMISLYDHHDMRDKVIEVFADMEELGVKPDEDTVRRVACAFQ 237

Query: 695 TLGQEDKHTQFLNKYQRRWKYIHFKGE 775
           TLGQEDK    L KYQ +WKYIHF GE
Sbjct: 238 TLGQEDKQKLVLKKYQCKWKYIHFNGE 264


>ref|XP_007025241.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao]
           gi|508780607|gb|EOY27863.1| Pentatricopeptide repeat
           superfamily protein [Theobroma cacao]
          Length = 276

 Score =  324 bits (831), Expect = 3e-86
 Identities = 160/221 (72%), Positives = 178/221 (80%)
 Frame = +2

Query: 113 CRGSQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVA 292
           C     EQ L    AVEKK V+K GK EHHLW++RD AGSG KA NLVRII  LPNEK A
Sbjct: 44  CSQKLGEQSLGISEAVEKKPVKKVGKNEHHLWKKRDSAGSGQKALNLVRIISQLPNEKEA 103

Query: 293 VYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLL 472
           VY ALD+W AWETEFPLIAAAKAL ILRKRSQW R+IQVAKWMLSKGQGATM TYD LLL
Sbjct: 104 VYGALDKWTAWETEFPLIAAAKALRILRKRSQWLRVIQVAKWMLSKGQGATMGTYDTLLL 163

Query: 473 AFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVK 652
           AFDMD+R DEA+ LW MIL  H RSISKRLFSRMISLYDHHN+ + ++EVFADMEEL V+
Sbjct: 164 AFDMDKRVDEAESLWNMILHIHTRSISKRLFSRMISLYDHHNMQDKIIEVFADMEELCVR 223

Query: 653 PDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775
           PDE+TVR++ARAF+ LGQEDK    L +Y  +WKYIHF GE
Sbjct: 224 PDENTVRKVARAFQKLGQEDKQKLVLRRYLSKWKYIHFNGE 264


>ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like [Cucumis sativus]
          Length = 270

 Score =  323 bits (829), Expect = 5e-86
 Identities = 154/221 (69%), Positives = 180/221 (81%)
 Frame = +2

Query: 113 CRGSQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVA 292
           C  +Q  QPL + T  E+++V+K GK+ HHLW++RD AGSG KA NLVRI+   PNEK A
Sbjct: 36  CVHNQAAQPLTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEA 95

Query: 293 VYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLL 472
           VY  L++WIAWETEFPLIAAAKAL ILRKRSQWKR+IQVAKWMLSKGQGATM TYD LLL
Sbjct: 96  VYGELNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLL 155

Query: 473 AFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVK 652
           AFDMD+R DEA+ LW MIL  H RSISKR+FSRMISLY+HH+L + ++E+FADMEELGVK
Sbjct: 156 AFDMDKRVDEAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVK 215

Query: 653 PDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775
           PDEDTVRR+ RAF+ LGQED       +Y  +WKYIHFKGE
Sbjct: 216 PDEDTVRRVCRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGE 256


>ref|XP_007211999.1| hypothetical protein PRUPE_ppa011078mg [Prunus persica]
           gi|462407864|gb|EMJ13198.1| hypothetical protein
           PRUPE_ppa011078mg [Prunus persica]
          Length = 224

 Score =  322 bits (826), Expect = 1e-85
 Identities = 154/204 (75%), Positives = 172/204 (84%)
 Frame = +2

Query: 164 KKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPL 343
           +K ++K G+KEHHLWQ+RD AGSG KA NLVRI+ GLPNEK  VY ALD+W AWETEFPL
Sbjct: 4   RKTIKKVGRKEHHLWQKRDSAGSGQKALNLVRIVSGLPNEKETVYGALDKWTAWETEFPL 63

Query: 344 IAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEM 523
           IAA KAL ILRKRSQW R+IQVAKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW M
Sbjct: 64  IAAVKALRILRKRSQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNM 123

Query: 524 ILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLG 703
           IL  H RSISKRLFSRMISLYDHH+    ++EVFADMEELGVKPDEDTVRR+ARAF+ LG
Sbjct: 124 ILHTHTRSISKRLFSRMISLYDHHDKQNKIIEVFADMEELGVKPDEDTVRRVARAFKELG 183

Query: 704 QEDKHTQFLNKYQRRWKYIHFKGE 775
           QE+  T  L +YQ +WKYIHFKGE
Sbjct: 184 QEENKTLVLRRYQCKWKYIHFKGE 207


>ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X1 [Solanum tuberosum]
           gi|565378234|ref|XP_006355564.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X2 [Solanum tuberosum]
           gi|565378236|ref|XP_006355565.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 265

 Score =  320 bits (821), Expect = 4e-85
 Identities = 162/240 (67%), Positives = 189/240 (78%), Gaps = 3/240 (1%)
 Frame = +2

Query: 65  GEKSIQLSKAIASPQHCRGSQKEQPL---VAPTAVEKKLVRKSGKKEHHLWQRRDQAGSG 235
           G  S  LS  +      + S+K+  L   ++ TA +KK V+K+GK EHHLW++R+ AGSG
Sbjct: 18  GINSTGLSDKLNVTSALKDSKKQGELSLTISDTADQKK-VQKAGKVEHHLWKKRESAGSG 76

Query: 236 HKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAK 415
            KA NLVRII GLPNEK +VY ALD+WIAWE EFPLIAAAKAL ILR++  WKR+IQVAK
Sbjct: 77  QKALNLVRIISGLPNEKESVYGALDKWIAWEAEFPLIAAAKALRILRQQRLWKRVIQVAK 136

Query: 416 WMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHH 595
           WMLSKGQGATM+TYD LLLAFDMD R DEA+ LW MIL    RS+SKRLFSRMISLYDHH
Sbjct: 137 WMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHH 196

Query: 596 NLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775
           ++P+ +VEVFADMEELGVKPDEDTV R+ARAF+ LGQEDK    L KYQ RWKY+HF GE
Sbjct: 197 HVPDKIVEVFADMEELGVKPDEDTVGRVARAFQMLGQEDKQKLVLKKYQSRWKYVHFNGE 256


>ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 300

 Score =  320 bits (820), Expect = 6e-85
 Identities = 150/221 (67%), Positives = 179/221 (80%)
 Frame = +2

Query: 113 CRGSQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVA 292
           C   Q  Q ++A  A+EKK+++K+G+ EHHLW+++D AGSG KA NL+RI+  LPNEK A
Sbjct: 63  CCQKQSRQTVMASKAMEKKIIKKAGRNEHHLWKKKDSAGSGQKALNLIRIVSDLPNEKEA 122

Query: 293 VYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLL 472
           ++ ALD+W AWETEFPLIAAAKAL ILR+  QW+R+IQVAKWMLSKGQGATM+TYD LLL
Sbjct: 123 IFGALDKWTAWETEFPLIAAAKALRILRRTCQWRRVIQVAKWMLSKGQGATMATYDTLLL 182

Query: 473 AFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVK 652
           AFDMD R DEA+ LW MIL  H RSISKRLFSRMISLYDHH +   ++EVFADMEEL V+
Sbjct: 183 AFDMDNRLDEAESLWNMILHTHTRSISKRLFSRMISLYDHHEMKTKIIEVFADMEELSVR 242

Query: 653 PDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775
           PDEDTVRR+ARAF+  GQEDK    L +Y  +WKYIHFKGE
Sbjct: 243 PDEDTVRRVARAFQEFGQEDKSKLVLRRYGCKWKYIHFKGE 283


>ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citrus clementina]
           gi|557552197|gb|ESR62826.1| hypothetical protein
           CICLE_v10016169mg [Citrus clementina]
          Length = 284

 Score =  317 bits (812), Expect = 5e-84
 Identities = 162/240 (67%), Positives = 184/240 (76%), Gaps = 6/240 (2%)
 Frame = +2

Query: 74  SIQLSKAIASPQH--CRGSQKEQPLVAPT----AVEKKLVRKSGKKEHHLWQRRDQAGSG 235
           S+  +K   S  H  C  +Q + P VA +    + + KLV K GKKE HLWQ+RD AGSG
Sbjct: 29  SLLTTKLATSNPHLKCFLNQNKLPPVANSNANASKKNKLVVKVGKKEQHLWQKRDSAGSG 88

Query: 236 HKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAK 415
            KA NLVRI+  LPNEK AVY ALD+W AWETEFPLIAAAKAL ILRKR QW R+IQVAK
Sbjct: 89  QKALNLVRIVSELPNEKHAVYGALDKWTAWETEFPLIAAAKALRILRKRGQWLRVIQVAK 148

Query: 416 WMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHH 595
           WMLSKGQGATM TYD LLLAFD D R DEA+ LW MIL  H RSISKRLFSRMISLYDHH
Sbjct: 149 WMLSKGQGATMGTYDTLLLAFDKDHRADEAESLWNMILHTHTRSISKRLFSRMISLYDHH 208

Query: 596 NLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775
           ++P  ++EVFADMEELGV+PDEDTVRRIA AF+ +GQ++K    L KY  +WKYIHFKGE
Sbjct: 209 DMPNKIIEVFADMEELGVRPDEDTVRRIASAFQRVGQDEKQKLVLKKYLSKWKYIHFKGE 268


>ref|XP_002526919.1| conserved hypothetical protein [Ricinus communis]
           gi|223533738|gb|EEF35472.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 224

 Score =  313 bits (802), Expect = 7e-83
 Identities = 147/204 (72%), Positives = 169/204 (82%)
 Frame = +2

Query: 164 KKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPL 343
           +K V+K+GK+EHHLW++RD A SG KA +LVRI+C LP+EK  VY ALD+W AWETEFPL
Sbjct: 10  RKPVKKAGKEEHHLWKKRDSARSGEKALSLVRIVCELPDEKECVYGALDKWTAWETEFPL 69

Query: 344 IAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEM 523
           IA AK L ILRK +QW R+IQVAKWMLSKGQG TM TYD LLLAFDMD R DEA  LW M
Sbjct: 70  IAVAKGLRILRKHNQWLRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDNRVDEAASLWNM 129

Query: 524 ILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLG 703
           IL  H RSISKRLFSRMISLYDHHN+P+ ++E+FADMEELGV+PDEDTVRR+ARAF+ LG
Sbjct: 130 ILHTHVRSISKRLFSRMISLYDHHNMPDGIIEIFADMEELGVRPDEDTVRRVARAFKELG 189

Query: 704 QEDKHTQFLNKYQRRWKYIHFKGE 775
           QE+K    L +Y  RWKYIHFKGE
Sbjct: 190 QEEKQKLVLKRYMSRWKYIHFKGE 213


>gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis]
          Length = 326

 Score =  311 bits (796), Expect = 4e-82
 Identities = 153/239 (64%), Positives = 182/239 (76%), Gaps = 14/239 (5%)
 Frame = +2

Query: 101 SPQHCRGSQKEQPLVAPTAVEK--------------KLVRKSGKKEHHLWQRRDQAGSGH 238
           S  +C      +PL +  A+EK               LV+K+GKKE+HLW+++D AGSG 
Sbjct: 71  SHHNCSIKGNGEPLTSSKAIEKLQRLCIEFLYMEFRNLVKKTGKKEYHLWKKKDSAGSGQ 130

Query: 239 KAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKW 418
           KA NL+RI+  LPNEK  VY AL++WIAWETEFPLIAAAKAL ILRKRSQWKR+IQVAKW
Sbjct: 131 KALNLIRILSVLPNEKEVVYGALNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKW 190

Query: 419 MLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHN 598
           MLSKGQG TM TYD LLLAFDMD+R DEA+  W MIL  H RSISKRLFSRMI+LYDHH+
Sbjct: 191 MLSKGQGTTMGTYDTLLLAFDMDQRVDEAESFWNMILHTHKRSISKRLFSRMIALYDHHD 250

Query: 599 LPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775
           + + ++EVFADMEEL V+ DEDTVRR+A AF+ LGQE+K    L KYQ +WKY+HFKGE
Sbjct: 251 VKDKIIEVFADMEELSVRLDEDTVRRVAYAFQKLGQEEKKKLLLRKYQCKWKYVHFKGE 309


>ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X5 [Cicer arietinum]
          Length = 287

 Score =  309 bits (792), Expect = 1e-81
 Identities = 156/251 (62%), Positives = 181/251 (72%), Gaps = 5/251 (1%)
 Frame = +2

Query: 38  SEATHKASGGEKSIQLSKAIASPQHCRGSQ----KEQPLVA-PTAVEKKLVRKSGKKEHH 202
           SEAT     G K +  S  I+    C   +    K  P V  P   +KK  +  GK EHH
Sbjct: 25  SEATRVFLSGNKFLTTSITISRKTSCTSCRFVQSKSSPNVGRPVEKDKKGNKIKGKVEHH 84

Query: 203 LWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGILRKR 382
           LW+RR+ A SG KA  LVR IC LPNEK +VY ALD+W AWETEFPL+AAAKAL ILRKR
Sbjct: 85  LWKRRNSAQSGQKALTLVRTICELPNEKESVYGALDKWTAWETEFPLVAAAKALNILRKR 144

Query: 383 SQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRL 562
            QW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW MI+ AH RS+SKRL
Sbjct: 145 GQWVRVIQLAKWMLSKGQGATMGTYDTLLLAFDMDQRIDEAESLWNMIIHAHMRSVSKRL 204

Query: 563 FSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFLNKYQ 742
           FSRMISLYDHHNL E +VE+FADMEEL +KPDEDTVR++  AF  LGQE+K    + +Y 
Sbjct: 205 FSRMISLYDHHNLSEKIVEIFADMEELRIKPDEDTVRKVTNAFRKLGQEEKRKSVIKRYG 264

Query: 743 RRWKYIHFKGE 775
            +WKYIHF GE
Sbjct: 265 LKWKYIHFNGE 275


>ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like [Citrus sinensis]
          Length = 281

 Score =  308 bits (790), Expect = 2e-81
 Identities = 164/255 (64%), Positives = 184/255 (72%), Gaps = 6/255 (2%)
 Frame = +2

Query: 29  SNSSEATHKASGGEKSIQLSKAIASPQH--CRGSQKEQPLV----APTAVEKKLVRKSGK 190
           SNS       +    S+  +K   S  H  C  +Q +QP V    A  + + KLV K GK
Sbjct: 14  SNSCRIPPLQTASGFSLLTTKLATSNPHLKCFLNQNKQPPVSNSNANASKKNKLVVKVGK 73

Query: 191 KEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGI 370
           KE HLWQ+RD AGSG KA NLV     LPNEK AVY ALD+W AWETEFPLIAAAKAL I
Sbjct: 74  KEQHLWQKRDSAGSGQKALNLVS---ELPNEKHAVYGALDKWTAWETEFPLIAAAKALRI 130

Query: 371 LRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSI 550
           LRKR QW R+IQVAKWMLSKGQGATM TYD LLLAFD D R DEA+ LW MIL    RSI
Sbjct: 131 LRKRGQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDKDHRADEAESLWNMILHTQTRSI 190

Query: 551 SKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFL 730
           SKRLFSRMISLYDHH++P  ++EVFADMEELGV+PDEDTVRRIA AF+ +GQ+DK    L
Sbjct: 191 SKRLFSRMISLYDHHDMPNKIIEVFADMEELGVRPDEDTVRRIASAFQRVGQDDKQKLVL 250

Query: 731 NKYQRRWKYIHFKGE 775
            KY  +WKYIHFKGE
Sbjct: 251 KKYLSKWKYIHFKGE 265


>ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X1 [Glycine max]
           gi|571517206|ref|XP_006597502.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X2 [Glycine max]
          Length = 288

 Score =  299 bits (765), Expect = 1e-78
 Identities = 142/206 (68%), Positives = 166/206 (80%)
 Frame = +2

Query: 158 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEF 337
           +EKK  + +GKKEHHLW+ RD A SG KA  LVR +  LPNEK AVY ALD+W AWETEF
Sbjct: 67  MEKKGKKTTGKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEF 126

Query: 338 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 517
           P+IA +KAL ILRKR  W R+IQVAKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW
Sbjct: 127 PVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLW 186

Query: 518 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFET 697
            MI+ AH RS+SKRLFSRMISLYDHHN+P+ +++VFADMEEL +KPDEDTVRR+ARAF  
Sbjct: 187 NMIIHAHMRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRE 246

Query: 698 LGQEDKHTQFLNKYQRRWKYIHFKGE 775
           LG E+K    + +Y  +WKYIHF GE
Sbjct: 247 LGDEEKRKLVIKQYGLKWKYIHFNGE 272


>gb|ACU23441.1| unknown [Glycine max]
          Length = 288

 Score =  298 bits (764), Expect = 2e-78
 Identities = 142/206 (68%), Positives = 166/206 (80%)
 Frame = +2

Query: 158 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEF 337
           +EKK  + +GKKEHHLW+ RD A SG KA  LVR +  LPNEK AVY ALD+W AWETEF
Sbjct: 67  MEKKGKKTTGKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEF 126

Query: 338 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 517
           P+IA +KAL ILRKR  W R+IQVAKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW
Sbjct: 127 PVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLW 186

Query: 518 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFET 697
            MI+ AH RS+SKRLFSRMISLYDHHN+P+ +++VFADMEEL +KPDEDTVRR+ARAF  
Sbjct: 187 NMIIHAHLRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRE 246

Query: 698 LGQEDKHTQFLNKYQRRWKYIHFKGE 775
           LG E+K    + +Y  +WKYIHF GE
Sbjct: 247 LGDEEKRKLVIKQYGLKWKYIHFNGE 272


>ref|XP_007147960.1| hypothetical protein PHAVU_006G168700g [Phaseolus vulgaris]
           gi|561021183|gb|ESW19954.1| hypothetical protein
           PHAVU_006G168700g [Phaseolus vulgaris]
          Length = 289

 Score =  296 bits (757), Expect = 1e-77
 Identities = 152/261 (58%), Positives = 185/261 (70%), Gaps = 5/261 (1%)
 Frame = +2

Query: 8   PSLLIKASNSSEATHKASGGEKSIQLSKAIASPQ----HCRGSQKEQPLVAPTAVEKKLV 175
           P LL +       T +A   +  +  +    SP+    HC   + +        +EKK  
Sbjct: 13  PILLSRICQVKMDTTRALVWDNKLSTNAITVSPKRSFIHCLIKRSKFSPKGGGPMEKKKG 72

Query: 176 RKS-GKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAA 352
           +K+ GKKEHHLW+ RD A SG KA  LVRI+  LPNEK AVY ALD+WIAWETEFP+IAA
Sbjct: 73  KKTTGKKEHHLWKSRDSAQSGQKALTLVRIVSKLPNEKEAVYGALDKWIAWETEFPVIAA 132

Query: 353 AKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQ 532
           AKAL ILRKR  W R+IQVAKWMLSKGQGATM T+D LLLAFDMD+R DEA+ LW MI+ 
Sbjct: 133 AKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTFDTLLLAFDMDQRVDEAESLWNMIIH 192

Query: 533 AHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQED 712
            H RS+SKRLFSRMIS+YD+H++P+ ++EVFADMEEL VKPDEDTVRR+ARAF  LG+E+
Sbjct: 193 THMRSVSKRLFSRMISIYDNHDMPDKIIEVFADMEELRVKPDEDTVRRVARAFTELGEEE 252

Query: 713 KHTQFLNKYQRRWKYIHFKGE 775
           K      +Y  +WKYIHF  E
Sbjct: 253 KRKLVARRYGIKWKYIHFNRE 273


>ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda]
           gi|548851451|gb|ERN09727.1| hypothetical protein
           AMTR_s00029p00227910 [Amborella trichopoda]
          Length = 287

 Score =  293 bits (751), Expect = 6e-77
 Identities = 144/218 (66%), Positives = 172/218 (78%)
 Frame = +2

Query: 122 SQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYS 301
           S+K  PL+ P   + K   K  KKEHHLW +RD AGS  KA NLVRI+  + NEK A+Y 
Sbjct: 61  SRKIAPLITPVDEKPK---KLFKKEHHLWMKRDSAGSSQKALNLVRIVSRVSNEKEAIYV 117

Query: 302 ALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFD 481
           ALDEW AWETEFP+IAAAKALGILRKR +W R+IQV+KW+LSKGQ  TM TYD LLLAFD
Sbjct: 118 ALDEWAAWETEFPVIAAAKALGILRKRRRWLRVIQVSKWLLSKGQVLTMGTYDTLLLAFD 177

Query: 482 MDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDE 661
           MD R DEA+ +W MIL  + RSISKRLFSRM+SLYDHH++P+ ++EVFADMEELGVKPD+
Sbjct: 178 MDGRVDEAETIWNMILHTYTRSISKRLFSRMMSLYDHHHIPDKLLEVFADMEELGVKPDQ 237

Query: 662 DTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775
           D+VRR+ARAF+ LG+E+K  Q L KY  + KYIHF GE
Sbjct: 238 DSVRRVARAFQQLGEEEKQKQVLQKYGLKLKYIHFNGE 275


>ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp.
           lyrata] gi|297313803|gb|EFH44226.1| hypothetical protein
           ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata]
          Length = 284

 Score =  292 bits (747), Expect = 2e-76
 Identities = 140/218 (64%), Positives = 170/218 (77%)
 Frame = +2

Query: 122 SQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYS 301
           S+K+   +    V    ++K GKKEHHLW++ D AGSG KA NLVR++ GLPNEK AVY 
Sbjct: 55  SEKQAGKLDVATVNSNEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYG 114

Query: 302 ALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFD 481
           AL++W+AWE EFP+IAAAKAL ILRKRSQW R+IQ+AKWMLSKGQGATM TYD LLLAFD
Sbjct: 115 ALNKWVAWEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDTLLLAFD 174

Query: 482 MDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDE 661
           MD+R DEA+ LW MIL  H RSI +RLF+RMI+LY H++L + V+EVFADMEEL V+PDE
Sbjct: 175 MDQRADEAESLWNMILHTHTRSIPRRLFARMIALYAHYDLHDKVIEVFADMEELKVRPDE 234

Query: 662 DTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775
           DT RR+ARAF  LGQE+     L +Y   +KYI+F GE
Sbjct: 235 DTARRVARAFRELGQEENRKLILRRYLSEFKYIYFNGE 272


>ref|NP_001031667.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|332658716|gb|AEE84116.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 260

 Score =  289 bits (740), Expect = 1e-75
 Identities = 138/206 (66%), Positives = 163/206 (79%)
 Frame = +2

Query: 158 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEF 337
           V  K ++K GKKEHHLW++ D AGSG KA NLVR++ GLPNEK AVY AL++W+AWE EF
Sbjct: 43  VNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEF 102

Query: 338 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 517
           P+IAAAKAL ILRKRSQW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD R DEA+ LW
Sbjct: 103 PIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLW 162

Query: 518 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFET 697
            MIL  H RSI +RLF+RMI+LY HH+L + V+EVFADMEEL V PDED+ RR+ARAF  
Sbjct: 163 NMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRE 222

Query: 698 LGQEDKHTQFLNKYQRRWKYIHFKGE 775
           L QE+     L +Y   +KYI+F GE
Sbjct: 223 LNQEENRKLILRRYLSEYKYIYFNGE 248


>ref|NP_567571.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|186512032|ref|NP_001119009.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|334186688|ref|NP_001190768.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|223635632|sp|Q2V3H0.2|PP322_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g18975, chloroplastic; Flags: Precursor
           gi|332658715|gb|AEE84115.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|332658717|gb|AEE84117.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|332658718|gb|AEE84118.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 287

 Score =  289 bits (740), Expect = 1e-75
 Identities = 138/206 (66%), Positives = 163/206 (79%)
 Frame = +2

Query: 158 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEF 337
           V  K ++K GKKEHHLW++ D AGSG KA NLVR++ GLPNEK AVY AL++W+AWE EF
Sbjct: 70  VNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEF 129

Query: 338 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 517
           P+IAAAKAL ILRKRSQW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD R DEA+ LW
Sbjct: 130 PIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLW 189

Query: 518 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFET 697
            MIL  H RSI +RLF+RMI+LY HH+L + V+EVFADMEEL V PDED+ RR+ARAF  
Sbjct: 190 NMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRE 249

Query: 698 LGQEDKHTQFLNKYQRRWKYIHFKGE 775
           L QE+     L +Y   +KYI+F GE
Sbjct: 250 LNQEENRKLILRRYLSEYKYIYFNGE 275