BLASTX nr result
ID: Mentha29_contig00010294
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00010294 (1018 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU33203.1| hypothetical protein MIMGU_mgv1a020021mg, partial... 335 1e-89 ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containi... 326 1e-86 emb|CBI30774.3| unnamed protein product [Vitis vinifera] 325 2e-86 ref|XP_007025241.1| Pentatricopeptide repeat superfamily protein... 324 3e-86 ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containi... 323 5e-86 ref|XP_007211999.1| hypothetical protein PRUPE_ppa011078mg [Prun... 322 1e-85 ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containi... 320 4e-85 ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containi... 320 6e-85 ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citr... 317 5e-84 ref|XP_002526919.1| conserved hypothetical protein [Ricinus comm... 313 7e-83 gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis] 311 4e-82 ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containi... 309 1e-81 ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containi... 308 2e-81 ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containi... 299 1e-78 gb|ACU23441.1| unknown [Glycine max] 298 2e-78 ref|XP_007147960.1| hypothetical protein PHAVU_006G168700g [Phas... 296 1e-77 ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [A... 293 6e-77 ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arab... 292 2e-76 ref|NP_001031667.1| pentatricopeptide repeat-containing protein ... 289 1e-75 ref|NP_567571.1| pentatricopeptide repeat-containing protein [Ar... 289 1e-75 >gb|EYU33203.1| hypothetical protein MIMGU_mgv1a020021mg, partial [Mimulus guttatus] Length = 209 Score = 335 bits (860), Expect = 1e-89 Identities = 159/203 (78%), Positives = 177/203 (87%) Frame = +2 Query: 167 KLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLI 346 K++RKSGKKEHHLWQ+RD AGSGHKA NLVR IC LPNEK AVY ALDEWIAWETEFPLI Sbjct: 1 KVIRKSGKKEHHLWQKRDSAGSGHKALNLVRTICRLPNEKEAVYGALDEWIAWETEFPLI 60 Query: 347 AAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMI 526 AAAKAL ILRKR+ WKRIIQV KWMLSKGQGATMSTYD LLLAFDMD R D+A++LW M+ Sbjct: 61 AAAKALRILRKRNHWKRIIQVGKWMLSKGQGATMSTYDSLLLAFDMDGRLDDAEILWNMV 120 Query: 527 LQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQ 706 LQ + RS+ K +FSRMISLYDHHNLP+ V+EVFADMEEL VKPDEDTVRR+ARAFE LGQ Sbjct: 121 LQTYNRSLPKMIFSRMISLYDHHNLPDKVIEVFADMEELEVKPDEDTVRRVARAFEALGQ 180 Query: 707 EDKHTQFLNKYQRRWKYIHFKGE 775 ++K + KYQ +WKYIHFKGE Sbjct: 181 KEKERLVMKKYQSKWKYIHFKGE 203 >ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Solanum lycopersicum] Length = 265 Score = 326 bits (835), Expect = 1e-86 Identities = 165/255 (64%), Positives = 197/255 (77%) Frame = +2 Query: 11 SLLIKASNSSEATHKASGGEKSIQLSKAIASPQHCRGSQKEQPLVAPTAVEKKLVRKSGK 190 ++L+K NS+ + K + +S A+ +H + Q E L A ++K V+K+GK Sbjct: 13 NILLKGINSTGLSDK-------LNVSSAL---KHSK-KQGELSLTISDAADQKKVQKAGK 61 Query: 191 KEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGI 370 EHHLW++R+ AGSG KA NLVRII GLPNEK +VY ALD+WIAWETEFPLIAAAKAL I Sbjct: 62 VEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPLIAAAKALRI 121 Query: 371 LRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSI 550 LR++ WKR+IQVAKWMLSKGQGATM+TYD LLLAFDMD R DEA+ LW MIL RS+ Sbjct: 122 LRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNMILHTSTRSV 181 Query: 551 SKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFL 730 SKRLFSRMISLYDHH++P+ +VEVFADMEELGVKPDEDTVRR+ARAF+ LGQED L Sbjct: 182 SKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVRRVARAFQMLGQEDNQKLVL 241 Query: 731 NKYQRRWKYIHFKGE 775 KYQ RWKY+HF GE Sbjct: 242 KKYQSRWKYVHFNGE 256 >emb|CBI30774.3| unnamed protein product [Vitis vinifera] Length = 277 Score = 325 bits (833), Expect = 2e-86 Identities = 158/207 (76%), Positives = 176/207 (85%) Frame = +2 Query: 155 AVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETE 334 AVEK++ +K GKKEHHLW++RD GSG KA NLVRI+ LPNEK AVY ALD+W AWETE Sbjct: 58 AVEKEISKKVGKKEHHLWRKRDSIGSGQKALNLVRIVSELPNEKEAVYGALDKWTAWETE 117 Query: 335 FPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKML 514 FPLIAAAKAL ILRKR+QWKR+IQVAKWMLSKGQGATM TYD LLLAFDMD R DEA+ L Sbjct: 118 FPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDWRVDEAESL 177 Query: 515 WEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFE 694 W MIL H RSISK+LFSRMISLYDHH++ + V+EVFADMEELGVKPDEDTVRR+A AF+ Sbjct: 178 WNMILHTHTRSISKQLFSRMISLYDHHDMRDKVIEVFADMEELGVKPDEDTVRRVACAFQ 237 Query: 695 TLGQEDKHTQFLNKYQRRWKYIHFKGE 775 TLGQEDK L KYQ +WKYIHF GE Sbjct: 238 TLGQEDKQKLVLKKYQCKWKYIHFNGE 264 >ref|XP_007025241.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao] gi|508780607|gb|EOY27863.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao] Length = 276 Score = 324 bits (831), Expect = 3e-86 Identities = 160/221 (72%), Positives = 178/221 (80%) Frame = +2 Query: 113 CRGSQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVA 292 C EQ L AVEKK V+K GK EHHLW++RD AGSG KA NLVRII LPNEK A Sbjct: 44 CSQKLGEQSLGISEAVEKKPVKKVGKNEHHLWKKRDSAGSGQKALNLVRIISQLPNEKEA 103 Query: 293 VYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLL 472 VY ALD+W AWETEFPLIAAAKAL ILRKRSQW R+IQVAKWMLSKGQGATM TYD LLL Sbjct: 104 VYGALDKWTAWETEFPLIAAAKALRILRKRSQWLRVIQVAKWMLSKGQGATMGTYDTLLL 163 Query: 473 AFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVK 652 AFDMD+R DEA+ LW MIL H RSISKRLFSRMISLYDHHN+ + ++EVFADMEEL V+ Sbjct: 164 AFDMDKRVDEAESLWNMILHIHTRSISKRLFSRMISLYDHHNMQDKIIEVFADMEELCVR 223 Query: 653 PDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775 PDE+TVR++ARAF+ LGQEDK L +Y +WKYIHF GE Sbjct: 224 PDENTVRKVARAFQKLGQEDKQKLVLRRYLSKWKYIHFNGE 264 >ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Cucumis sativus] Length = 270 Score = 323 bits (829), Expect = 5e-86 Identities = 154/221 (69%), Positives = 180/221 (81%) Frame = +2 Query: 113 CRGSQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVA 292 C +Q QPL + T E+++V+K GK+ HHLW++RD AGSG KA NLVRI+ PNEK A Sbjct: 36 CVHNQAAQPLTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEA 95 Query: 293 VYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLL 472 VY L++WIAWETEFPLIAAAKAL ILRKRSQWKR+IQVAKWMLSKGQGATM TYD LLL Sbjct: 96 VYGELNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLL 155 Query: 473 AFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVK 652 AFDMD+R DEA+ LW MIL H RSISKR+FSRMISLY+HH+L + ++E+FADMEELGVK Sbjct: 156 AFDMDKRVDEAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVK 215 Query: 653 PDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775 PDEDTVRR+ RAF+ LGQED +Y +WKYIHFKGE Sbjct: 216 PDEDTVRRVCRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGE 256 >ref|XP_007211999.1| hypothetical protein PRUPE_ppa011078mg [Prunus persica] gi|462407864|gb|EMJ13198.1| hypothetical protein PRUPE_ppa011078mg [Prunus persica] Length = 224 Score = 322 bits (826), Expect = 1e-85 Identities = 154/204 (75%), Positives = 172/204 (84%) Frame = +2 Query: 164 KKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPL 343 +K ++K G+KEHHLWQ+RD AGSG KA NLVRI+ GLPNEK VY ALD+W AWETEFPL Sbjct: 4 RKTIKKVGRKEHHLWQKRDSAGSGQKALNLVRIVSGLPNEKETVYGALDKWTAWETEFPL 63 Query: 344 IAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEM 523 IAA KAL ILRKRSQW R+IQVAKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW M Sbjct: 64 IAAVKALRILRKRSQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNM 123 Query: 524 ILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLG 703 IL H RSISKRLFSRMISLYDHH+ ++EVFADMEELGVKPDEDTVRR+ARAF+ LG Sbjct: 124 ILHTHTRSISKRLFSRMISLYDHHDKQNKIIEVFADMEELGVKPDEDTVRRVARAFKELG 183 Query: 704 QEDKHTQFLNKYQRRWKYIHFKGE 775 QE+ T L +YQ +WKYIHFKGE Sbjct: 184 QEENKTLVLRRYQCKWKYIHFKGE 207 >ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X1 [Solanum tuberosum] gi|565378234|ref|XP_006355564.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X2 [Solanum tuberosum] gi|565378236|ref|XP_006355565.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X3 [Solanum tuberosum] Length = 265 Score = 320 bits (821), Expect = 4e-85 Identities = 162/240 (67%), Positives = 189/240 (78%), Gaps = 3/240 (1%) Frame = +2 Query: 65 GEKSIQLSKAIASPQHCRGSQKEQPL---VAPTAVEKKLVRKSGKKEHHLWQRRDQAGSG 235 G S LS + + S+K+ L ++ TA +KK V+K+GK EHHLW++R+ AGSG Sbjct: 18 GINSTGLSDKLNVTSALKDSKKQGELSLTISDTADQKK-VQKAGKVEHHLWKKRESAGSG 76 Query: 236 HKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAK 415 KA NLVRII GLPNEK +VY ALD+WIAWE EFPLIAAAKAL ILR++ WKR+IQVAK Sbjct: 77 QKALNLVRIISGLPNEKESVYGALDKWIAWEAEFPLIAAAKALRILRQQRLWKRVIQVAK 136 Query: 416 WMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHH 595 WMLSKGQGATM+TYD LLLAFDMD R DEA+ LW MIL RS+SKRLFSRMISLYDHH Sbjct: 137 WMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNMILHTSTRSVSKRLFSRMISLYDHH 196 Query: 596 NLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775 ++P+ +VEVFADMEELGVKPDEDTV R+ARAF+ LGQEDK L KYQ RWKY+HF GE Sbjct: 197 HVPDKIVEVFADMEELGVKPDEDTVGRVARAFQMLGQEDKQKLVLKKYQSRWKYVHFNGE 256 >ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 300 Score = 320 bits (820), Expect = 6e-85 Identities = 150/221 (67%), Positives = 179/221 (80%) Frame = +2 Query: 113 CRGSQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVA 292 C Q Q ++A A+EKK+++K+G+ EHHLW+++D AGSG KA NL+RI+ LPNEK A Sbjct: 63 CCQKQSRQTVMASKAMEKKIIKKAGRNEHHLWKKKDSAGSGQKALNLIRIVSDLPNEKEA 122 Query: 293 VYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLL 472 ++ ALD+W AWETEFPLIAAAKAL ILR+ QW+R+IQVAKWMLSKGQGATM+TYD LLL Sbjct: 123 IFGALDKWTAWETEFPLIAAAKALRILRRTCQWRRVIQVAKWMLSKGQGATMATYDTLLL 182 Query: 473 AFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVK 652 AFDMD R DEA+ LW MIL H RSISKRLFSRMISLYDHH + ++EVFADMEEL V+ Sbjct: 183 AFDMDNRLDEAESLWNMILHTHTRSISKRLFSRMISLYDHHEMKTKIIEVFADMEELSVR 242 Query: 653 PDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775 PDEDTVRR+ARAF+ GQEDK L +Y +WKYIHFKGE Sbjct: 243 PDEDTVRRVARAFQEFGQEDKSKLVLRRYGCKWKYIHFKGE 283 >ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citrus clementina] gi|557552197|gb|ESR62826.1| hypothetical protein CICLE_v10016169mg [Citrus clementina] Length = 284 Score = 317 bits (812), Expect = 5e-84 Identities = 162/240 (67%), Positives = 184/240 (76%), Gaps = 6/240 (2%) Frame = +2 Query: 74 SIQLSKAIASPQH--CRGSQKEQPLVAPT----AVEKKLVRKSGKKEHHLWQRRDQAGSG 235 S+ +K S H C +Q + P VA + + + KLV K GKKE HLWQ+RD AGSG Sbjct: 29 SLLTTKLATSNPHLKCFLNQNKLPPVANSNANASKKNKLVVKVGKKEQHLWQKRDSAGSG 88 Query: 236 HKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAK 415 KA NLVRI+ LPNEK AVY ALD+W AWETEFPLIAAAKAL ILRKR QW R+IQVAK Sbjct: 89 QKALNLVRIVSELPNEKHAVYGALDKWTAWETEFPLIAAAKALRILRKRGQWLRVIQVAK 148 Query: 416 WMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHH 595 WMLSKGQGATM TYD LLLAFD D R DEA+ LW MIL H RSISKRLFSRMISLYDHH Sbjct: 149 WMLSKGQGATMGTYDTLLLAFDKDHRADEAESLWNMILHTHTRSISKRLFSRMISLYDHH 208 Query: 596 NLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775 ++P ++EVFADMEELGV+PDEDTVRRIA AF+ +GQ++K L KY +WKYIHFKGE Sbjct: 209 DMPNKIIEVFADMEELGVRPDEDTVRRIASAFQRVGQDEKQKLVLKKYLSKWKYIHFKGE 268 >ref|XP_002526919.1| conserved hypothetical protein [Ricinus communis] gi|223533738|gb|EEF35472.1| conserved hypothetical protein [Ricinus communis] Length = 224 Score = 313 bits (802), Expect = 7e-83 Identities = 147/204 (72%), Positives = 169/204 (82%) Frame = +2 Query: 164 KKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPL 343 +K V+K+GK+EHHLW++RD A SG KA +LVRI+C LP+EK VY ALD+W AWETEFPL Sbjct: 10 RKPVKKAGKEEHHLWKKRDSARSGEKALSLVRIVCELPDEKECVYGALDKWTAWETEFPL 69 Query: 344 IAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEM 523 IA AK L ILRK +QW R+IQVAKWMLSKGQG TM TYD LLLAFDMD R DEA LW M Sbjct: 70 IAVAKGLRILRKHNQWLRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDNRVDEAASLWNM 129 Query: 524 ILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLG 703 IL H RSISKRLFSRMISLYDHHN+P+ ++E+FADMEELGV+PDEDTVRR+ARAF+ LG Sbjct: 130 ILHTHVRSISKRLFSRMISLYDHHNMPDGIIEIFADMEELGVRPDEDTVRRVARAFKELG 189 Query: 704 QEDKHTQFLNKYQRRWKYIHFKGE 775 QE+K L +Y RWKYIHFKGE Sbjct: 190 QEEKQKLVLKRYMSRWKYIHFKGE 213 >gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis] Length = 326 Score = 311 bits (796), Expect = 4e-82 Identities = 153/239 (64%), Positives = 182/239 (76%), Gaps = 14/239 (5%) Frame = +2 Query: 101 SPQHCRGSQKEQPLVAPTAVEK--------------KLVRKSGKKEHHLWQRRDQAGSGH 238 S +C +PL + A+EK LV+K+GKKE+HLW+++D AGSG Sbjct: 71 SHHNCSIKGNGEPLTSSKAIEKLQRLCIEFLYMEFRNLVKKTGKKEYHLWKKKDSAGSGQ 130 Query: 239 KAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKW 418 KA NL+RI+ LPNEK VY AL++WIAWETEFPLIAAAKAL ILRKRSQWKR+IQVAKW Sbjct: 131 KALNLIRILSVLPNEKEVVYGALNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKW 190 Query: 419 MLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHN 598 MLSKGQG TM TYD LLLAFDMD+R DEA+ W MIL H RSISKRLFSRMI+LYDHH+ Sbjct: 191 MLSKGQGTTMGTYDTLLLAFDMDQRVDEAESFWNMILHTHKRSISKRLFSRMIALYDHHD 250 Query: 599 LPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775 + + ++EVFADMEEL V+ DEDTVRR+A AF+ LGQE+K L KYQ +WKY+HFKGE Sbjct: 251 VKDKIIEVFADMEELSVRLDEDTVRRVAYAFQKLGQEEKKKLLLRKYQCKWKYVHFKGE 309 >ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X5 [Cicer arietinum] Length = 287 Score = 309 bits (792), Expect = 1e-81 Identities = 156/251 (62%), Positives = 181/251 (72%), Gaps = 5/251 (1%) Frame = +2 Query: 38 SEATHKASGGEKSIQLSKAIASPQHCRGSQ----KEQPLVA-PTAVEKKLVRKSGKKEHH 202 SEAT G K + S I+ C + K P V P +KK + GK EHH Sbjct: 25 SEATRVFLSGNKFLTTSITISRKTSCTSCRFVQSKSSPNVGRPVEKDKKGNKIKGKVEHH 84 Query: 203 LWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGILRKR 382 LW+RR+ A SG KA LVR IC LPNEK +VY ALD+W AWETEFPL+AAAKAL ILRKR Sbjct: 85 LWKRRNSAQSGQKALTLVRTICELPNEKESVYGALDKWTAWETEFPLVAAAKALNILRKR 144 Query: 383 SQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRL 562 QW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW MI+ AH RS+SKRL Sbjct: 145 GQWVRVIQLAKWMLSKGQGATMGTYDTLLLAFDMDQRIDEAESLWNMIIHAHMRSVSKRL 204 Query: 563 FSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFLNKYQ 742 FSRMISLYDHHNL E +VE+FADMEEL +KPDEDTVR++ AF LGQE+K + +Y Sbjct: 205 FSRMISLYDHHNLSEKIVEIFADMEELRIKPDEDTVRKVTNAFRKLGQEEKRKSVIKRYG 264 Query: 743 RRWKYIHFKGE 775 +WKYIHF GE Sbjct: 265 LKWKYIHFNGE 275 >ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Citrus sinensis] Length = 281 Score = 308 bits (790), Expect = 2e-81 Identities = 164/255 (64%), Positives = 184/255 (72%), Gaps = 6/255 (2%) Frame = +2 Query: 29 SNSSEATHKASGGEKSIQLSKAIASPQH--CRGSQKEQPLV----APTAVEKKLVRKSGK 190 SNS + S+ +K S H C +Q +QP V A + + KLV K GK Sbjct: 14 SNSCRIPPLQTASGFSLLTTKLATSNPHLKCFLNQNKQPPVSNSNANASKKNKLVVKVGK 73 Query: 191 KEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAAAKALGI 370 KE HLWQ+RD AGSG KA NLV LPNEK AVY ALD+W AWETEFPLIAAAKAL I Sbjct: 74 KEQHLWQKRDSAGSGQKALNLVS---ELPNEKHAVYGALDKWTAWETEFPLIAAAKALRI 130 Query: 371 LRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSI 550 LRKR QW R+IQVAKWMLSKGQGATM TYD LLLAFD D R DEA+ LW MIL RSI Sbjct: 131 LRKRGQWLRVIQVAKWMLSKGQGATMGTYDTLLLAFDKDHRADEAESLWNMILHTQTRSI 190 Query: 551 SKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQEDKHTQFL 730 SKRLFSRMISLYDHH++P ++EVFADMEELGV+PDEDTVRRIA AF+ +GQ+DK L Sbjct: 191 SKRLFSRMISLYDHHDMPNKIIEVFADMEELGVRPDEDTVRRIASAFQRVGQDDKQKLVL 250 Query: 731 NKYQRRWKYIHFKGE 775 KY +WKYIHFKGE Sbjct: 251 KKYLSKWKYIHFKGE 265 >ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X1 [Glycine max] gi|571517206|ref|XP_006597502.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X2 [Glycine max] Length = 288 Score = 299 bits (765), Expect = 1e-78 Identities = 142/206 (68%), Positives = 166/206 (80%) Frame = +2 Query: 158 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEF 337 +EKK + +GKKEHHLW+ RD A SG KA LVR + LPNEK AVY ALD+W AWETEF Sbjct: 67 MEKKGKKTTGKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEF 126 Query: 338 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 517 P+IA +KAL ILRKR W R+IQVAKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW Sbjct: 127 PVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLW 186 Query: 518 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFET 697 MI+ AH RS+SKRLFSRMISLYDHHN+P+ +++VFADMEEL +KPDEDTVRR+ARAF Sbjct: 187 NMIIHAHMRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRE 246 Query: 698 LGQEDKHTQFLNKYQRRWKYIHFKGE 775 LG E+K + +Y +WKYIHF GE Sbjct: 247 LGDEEKRKLVIKQYGLKWKYIHFNGE 272 >gb|ACU23441.1| unknown [Glycine max] Length = 288 Score = 298 bits (764), Expect = 2e-78 Identities = 142/206 (68%), Positives = 166/206 (80%) Frame = +2 Query: 158 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEF 337 +EKK + +GKKEHHLW+ RD A SG KA LVR + LPNEK AVY ALD+W AWETEF Sbjct: 67 MEKKGKKTTGKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEF 126 Query: 338 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 517 P+IA +KAL ILRKR W R+IQVAKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW Sbjct: 127 PVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLW 186 Query: 518 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFET 697 MI+ AH RS+SKRLFSRMISLYDHHN+P+ +++VFADMEEL +KPDEDTVRR+ARAF Sbjct: 187 NMIIHAHLRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRE 246 Query: 698 LGQEDKHTQFLNKYQRRWKYIHFKGE 775 LG E+K + +Y +WKYIHF GE Sbjct: 247 LGDEEKRKLVIKQYGLKWKYIHFNGE 272 >ref|XP_007147960.1| hypothetical protein PHAVU_006G168700g [Phaseolus vulgaris] gi|561021183|gb|ESW19954.1| hypothetical protein PHAVU_006G168700g [Phaseolus vulgaris] Length = 289 Score = 296 bits (757), Expect = 1e-77 Identities = 152/261 (58%), Positives = 185/261 (70%), Gaps = 5/261 (1%) Frame = +2 Query: 8 PSLLIKASNSSEATHKASGGEKSIQLSKAIASPQ----HCRGSQKEQPLVAPTAVEKKLV 175 P LL + T +A + + + SP+ HC + + +EKK Sbjct: 13 PILLSRICQVKMDTTRALVWDNKLSTNAITVSPKRSFIHCLIKRSKFSPKGGGPMEKKKG 72 Query: 176 RKS-GKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEFPLIAA 352 +K+ GKKEHHLW+ RD A SG KA LVRI+ LPNEK AVY ALD+WIAWETEFP+IAA Sbjct: 73 KKTTGKKEHHLWKSRDSAQSGQKALTLVRIVSKLPNEKEAVYGALDKWIAWETEFPVIAA 132 Query: 353 AKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQ 532 AKAL ILRKR W R+IQVAKWMLSKGQGATM T+D LLLAFDMD+R DEA+ LW MI+ Sbjct: 133 AKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTFDTLLLAFDMDQRVDEAESLWNMIIH 192 Query: 533 AHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFETLGQED 712 H RS+SKRLFSRMIS+YD+H++P+ ++EVFADMEEL VKPDEDTVRR+ARAF LG+E+ Sbjct: 193 THMRSVSKRLFSRMISIYDNHDMPDKIIEVFADMEELRVKPDEDTVRRVARAFTELGEEE 252 Query: 713 KHTQFLNKYQRRWKYIHFKGE 775 K +Y +WKYIHF E Sbjct: 253 KRKLVARRYGIKWKYIHFNRE 273 >ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda] gi|548851451|gb|ERN09727.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda] Length = 287 Score = 293 bits (751), Expect = 6e-77 Identities = 144/218 (66%), Positives = 172/218 (78%) Frame = +2 Query: 122 SQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYS 301 S+K PL+ P + K K KKEHHLW +RD AGS KA NLVRI+ + NEK A+Y Sbjct: 61 SRKIAPLITPVDEKPK---KLFKKEHHLWMKRDSAGSSQKALNLVRIVSRVSNEKEAIYV 117 Query: 302 ALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFD 481 ALDEW AWETEFP+IAAAKALGILRKR +W R+IQV+KW+LSKGQ TM TYD LLLAFD Sbjct: 118 ALDEWAAWETEFPVIAAAKALGILRKRRRWLRVIQVSKWLLSKGQVLTMGTYDTLLLAFD 177 Query: 482 MDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDE 661 MD R DEA+ +W MIL + RSISKRLFSRM+SLYDHH++P+ ++EVFADMEELGVKPD+ Sbjct: 178 MDGRVDEAETIWNMILHTYTRSISKRLFSRMMSLYDHHHIPDKLLEVFADMEELGVKPDQ 237 Query: 662 DTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775 D+VRR+ARAF+ LG+E+K Q L KY + KYIHF GE Sbjct: 238 DSVRRVARAFQQLGEEEKQKQVLQKYGLKLKYIHFNGE 275 >ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata] gi|297313803|gb|EFH44226.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata] Length = 284 Score = 292 bits (747), Expect = 2e-76 Identities = 140/218 (64%), Positives = 170/218 (77%) Frame = +2 Query: 122 SQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYS 301 S+K+ + V ++K GKKEHHLW++ D AGSG KA NLVR++ GLPNEK AVY Sbjct: 55 SEKQAGKLDVATVNSNEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYG 114 Query: 302 ALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFD 481 AL++W+AWE EFP+IAAAKAL ILRKRSQW R+IQ+AKWMLSKGQGATM TYD LLLAFD Sbjct: 115 ALNKWVAWEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDTLLLAFD 174 Query: 482 MDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDE 661 MD+R DEA+ LW MIL H RSI +RLF+RMI+LY H++L + V+EVFADMEEL V+PDE Sbjct: 175 MDQRADEAESLWNMILHTHTRSIPRRLFARMIALYAHYDLHDKVIEVFADMEELKVRPDE 234 Query: 662 DTVRRIARAFETLGQEDKHTQFLNKYQRRWKYIHFKGE 775 DT RR+ARAF LGQE+ L +Y +KYI+F GE Sbjct: 235 DTARRVARAFRELGQEENRKLILRRYLSEFKYIYFNGE 272 >ref|NP_001031667.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658716|gb|AEE84116.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 260 Score = 289 bits (740), Expect = 1e-75 Identities = 138/206 (66%), Positives = 163/206 (79%) Frame = +2 Query: 158 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEF 337 V K ++K GKKEHHLW++ D AGSG KA NLVR++ GLPNEK AVY AL++W+AWE EF Sbjct: 43 VNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEF 102 Query: 338 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 517 P+IAAAKAL ILRKRSQW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD R DEA+ LW Sbjct: 103 PIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLW 162 Query: 518 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFET 697 MIL H RSI +RLF+RMI+LY HH+L + V+EVFADMEEL V PDED+ RR+ARAF Sbjct: 163 NMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRE 222 Query: 698 LGQEDKHTQFLNKYQRRWKYIHFKGE 775 L QE+ L +Y +KYI+F GE Sbjct: 223 LNQEENRKLILRRYLSEYKYIYFNGE 248 >ref|NP_567571.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|186512032|ref|NP_001119009.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|334186688|ref|NP_001190768.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635632|sp|Q2V3H0.2|PP322_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g18975, chloroplastic; Flags: Precursor gi|332658715|gb|AEE84115.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658717|gb|AEE84117.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332658718|gb|AEE84118.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 287 Score = 289 bits (740), Expect = 1e-75 Identities = 138/206 (66%), Positives = 163/206 (79%) Frame = +2 Query: 158 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKVAVYSALDEWIAWETEF 337 V K ++K GKKEHHLW++ D AGSG KA NLVR++ GLPNEK AVY AL++W+AWE EF Sbjct: 70 VNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEF 129 Query: 338 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 517 P+IAAAKAL ILRKRSQW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD R DEA+ LW Sbjct: 130 PIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLW 189 Query: 518 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFET 697 MIL H RSI +RLF+RMI+LY HH+L + V+EVFADMEEL V PDED+ RR+ARAF Sbjct: 190 NMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRE 249 Query: 698 LGQEDKHTQFLNKYQRRWKYIHFKGE 775 L QE+ L +Y +KYI+F GE Sbjct: 250 LNQEENRKLILRRYLSEYKYIYFNGE 275