BLASTX nr result
ID: Catharanthus23_contig00010388
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00010388 (1215 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containi... 535 e-149 ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containi... 510 e-142 ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containi... 509 e-141 ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Popu... 507 e-141 ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containi... 505 e-140 ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containi... 505 e-140 ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containi... 505 e-140 ref|XP_002529286.1| pentatricopeptide repeat-containing protein,... 505 e-140 ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containi... 504 e-140 gb|EMJ26349.1| hypothetical protein PRUPE_ppa002505mg [Prunus pe... 501 e-139 ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containi... 500 e-139 ref|XP_003531588.2| PREDICTED: pentatricopeptide repeat-containi... 499 e-139 ref|XP_004486236.1| PREDICTED: pentatricopeptide repeat-containi... 496 e-137 ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citr... 494 e-137 gb|ESW20506.1| hypothetical protein PHAVU_006G214900g [Phaseolus... 492 e-136 gb|EOX98058.1| Pentatricopeptide repeat superfamily protein isof... 488 e-135 ref|XP_003593032.1| Pentatricopeptide repeat-containing protein ... 483 e-134 gb|EOX98059.1| Pentatricopeptide repeat (PPR) superfamily protei... 479 e-132 ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutr... 463 e-128 ref|NP_172560.2| pentatricopeptide repeat-containing protein [Ar... 460 e-127 >ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic [Vitis vinifera] gi|298204537|emb|CBI23812.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 535 bits (1377), Expect = e-149 Identities = 263/361 (72%), Positives = 307/361 (85%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 G+YKKA+ L+QDM+SAGL NKVILTTLLKVYVR LFEKSR LL ELE+LGYA DEMPY Sbjct: 290 GDYKKADMLVQDMKSAGLVPNKVILTTLLKVYVRGGLFEKSRELLAELEDLGYAEDEMPY 349 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL KS + LEAKS+F++M KK+V++DGY YSIMISAFC GLL+EAKQL D+EA Sbjct: 350 CLLMDGLAKSRRILEAKSIFEEMKKKQVKSDGYCYSIMISAFCRSGLLKEAKQLARDFEA 409 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y KYD+V+LN+MLCAYCRAGEM++V MDELAISPDWNTFHILIKYFC EKLY+L Sbjct: 410 TYDKYDLVMLNTMLCAYCRAGEMESVMQMMRKMDELAISPDWNTFHILIKYFCKEKLYLL 469 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AYRTMEDMH KGHQ EEELC +L+ HLGK A+ +AFSVYNMLRYSKRTM K+LHEK+LH Sbjct: 470 AYRTMEDMHNKGHQPEEELCSSLISHLGKIRAHSQAFSVYNMLRYSKRTMCKALHEKILH 529 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 ILV GRLLK+AYV+VKDN GLIS +IKKFAT+FM+ GN+NL+NDV+K IH G+KI+QE Sbjct: 530 ILVAGRLLKDAYVVVKDNEGLISKPSIKKFATAFMKFGNVNLINDVMKAIHGSGYKIDQE 589 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF AV+RY+ E +KK+ QWMPGQGY VDSSTRN+ILKN+HLFGRQ IAE+LSKQ Sbjct: 590 LFQMAVTRYIAEPEKKELLLHLLQWMPGQGYVVDSSTRNMILKNSHLFGRQLIAEMLSKQ 649 Query: 129 H 127 H Sbjct: 650 H 650 >ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like isoform X2 [Solanum tuberosum] Length = 651 Score = 510 bits (1313), Expect = e-142 Identities = 247/363 (68%), Positives = 302/363 (83%) Frame = -1 Query: 1206 NYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPYC 1027 NY+KAE LI++MRSAGL LNKVI TTLLKVYV+ LFEKS+ LL ELE LGYA DEMP+C Sbjct: 278 NYEKAEVLIEEMRSAGLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFC 337 Query: 1026 ILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEAN 847 +LMDGL KSG LEAKSVFD+MM+K V+TDGYSYSIMISAFC GLL++AK++ S++E Sbjct: 338 LLMDGLAKSGHLLEAKSVFDEMMEKHVKTDGYSYSIMISAFCRSGLLEDAKKVASEFEEK 397 Query: 846 YQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYILA 667 Y KYD+VILN+ML AYCRAG+M+NV MD+ AISPDWNTF+ILI+YFC EKLY+LA Sbjct: 398 YDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLA 457 Query: 666 YRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLHI 487 YRTMEDMH KGHQ EE LC +L++HLGKTGA+ EAFSVYNMLRYSKRT++ +LHE +LHI Sbjct: 458 YRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHI 517 Query: 486 LVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQEL 307 L+ GRLLK+AYV+VKDN G IS AIKKF+ +FM+SGN+NL+NDV+ +HS G KI+QEL Sbjct: 518 LIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQEL 577 Query: 306 FHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQH 127 F A++RY+ + +KK+ +WMPG+GY +DSSTRNLILKN+HLFG Q IAE LSK Sbjct: 578 FDLAIARYIAKPEKKELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHL 637 Query: 126 IVS 118 ++S Sbjct: 638 VMS 640 >ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like [Solanum lycopersicum] Length = 642 Score = 509 bits (1310), Expect = e-141 Identities = 247/363 (68%), Positives = 302/363 (83%) Frame = -1 Query: 1206 NYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPYC 1027 NY+KAE LI++MRSAGL LNKVI TTLLKVYV+ LFEKS+ LL ELE LGYA DEMP+C Sbjct: 278 NYEKAEALIEEMRSAGLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFC 337 Query: 1026 ILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEAN 847 +LMDGL KSG LEAKSVFD+MM+K+V+TDGYSYSIMISAFC GLL++AK+L S++E Sbjct: 338 LLMDGLAKSGHLLEAKSVFDEMMEKQVKTDGYSYSIMISAFCRRGLLEDAKKLASEFEEK 397 Query: 846 YQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYILA 667 Y KYD+VILN+ML AYCRAG+M+NV MD+ AISPDWNTF+ILI+YFC EKLY+LA Sbjct: 398 YDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLA 457 Query: 666 YRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLHI 487 YRTMEDMH KGHQ EE LC +L++HLGKTGA+ EAFSVYNMLRYSKRT++ +LHE +LHI Sbjct: 458 YRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHENILHI 517 Query: 486 LVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQEL 307 L+ GRLLK+AYV+VKDN G IS AIKKF+ +FM+SGN+NL+NDV+ +HS G KI+QEL Sbjct: 518 LIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQEL 577 Query: 306 FHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQH 127 F A++RY+ + +KK+ +WMP +GY +DSSTRNLILKN+HLFG Q IAE LSK Sbjct: 578 FDLAIARYIAKPEKKELLLWLLKWMPVKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHL 637 Query: 126 IVS 118 ++S Sbjct: 638 VMS 640 >ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Populus trichocarpa] gi|550347847|gb|EEE84472.2| hypothetical protein POPTR_0001s21880g [Populus trichocarpa] Length = 673 Score = 507 bits (1305), Expect = e-141 Identities = 250/364 (68%), Positives = 303/364 (83%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 GNYKKAE L+QDM+S+GL NKVILTTLLKVYVR LFEKSR LL EL+ LG+A +EMPY Sbjct: 304 GNYKKAEELVQDMKSSGLVPNKVILTTLLKVYVRGGLFEKSRDLLVELDTLGFAKNEMPY 363 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL K+G EA+SVF++M +K+V++ GYSYSIMIS+FC GL +EAK+L ++EA Sbjct: 364 CLLMDGLAKNGLLDEARSVFNEMKEKRVKSGGYSYSIMISSFCRGGLFEEAKELAEEFEA 423 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y KYDVVILN++LCAYCR GE ++V MDELAISPD+NTFHILIKYFC EKLY+L Sbjct: 424 KYDKYDVVILNTILCAYCRTGEKESVMRTMRKMDELAISPDYNTFHILIKYFCKEKLYML 483 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AY+TMEDMH+KGHQ EELC +L+ HLGK A+ EAFSVY+ML+ SKRTM+K+ HE +LH Sbjct: 484 AYQTMEDMHRKGHQPMEELCSSLILHLGKIKAHAEAFSVYSMLKSSKRTMSKAFHEDILH 543 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ GRLLK+AYV+VKDN LISP+AIKKFA+SF++ G+INL+NDV+K IH G+KI+QE Sbjct: 544 ILIAGRLLKDAYVVVKDNAELISPAAIKKFASSFVKLGDINLINDVMKVIHGSGYKIDQE 603 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF AVSRY+ E +KKD QWMPGQGY VDSSTRNLILKN+HLFGRQ IAE+LSKQ Sbjct: 604 LFLMAVSRYIAEPEKKDLLIQLLQWMPGQGYVVDSSTRNLILKNSHLFGRQLIAEILSKQ 663 Query: 129 HIVS 118 H+ S Sbjct: 664 HMTS 667 Score = 58.5 bits (140), Expect = 5e-06 Identities = 52/258 (20%), Positives = 104/258 (40%), Gaps = 2/258 (0%) Frame = -1 Query: 1206 NYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPYC 1027 N KA + + + N I +LL+ VR+ F+ S ++++N G D + Y Sbjct: 164 NPAKALEIYHSIPDESKKTNVFICNSLLRCLVRNTKFDSSMKFFHKMKNNGLTPDAITYS 223 Query: 1026 ILMDGL--VKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYE 853 L+ G +K G + +A + ++ ++ D Y +++ +EA+ + + Sbjct: 224 TLLAGCMKIKDGYS-KALDLVQELNYNGLQMDSIMYGTLLAVCASNNRCEEAQSYFNQMK 282 Query: 852 ANYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYI 673 ++ +S+L AY G M + P+ L+K + L+ Sbjct: 283 DEGHSPNIFHYSSLLNAYSSDGNYKKAEELVQDMKSSGLVPNKVILTTLLKVYVRGGLFE 342 Query: 672 LAYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLL 493 + + ++ G E C LM L K G EA SV+N ++ + + ++ Sbjct: 343 KSRDLLVELDTLGFAKNEMPYCLLMDGLAKNGLLDEARSVFNEMKEKRVKSGGYSYSIMI 402 Query: 492 HILVRGRLLKEAYVIVKD 439 RG L +EA + ++ Sbjct: 403 SSFCRGGLFEEAKELAEE 420 >ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like isoform X4 [Solanum tuberosum] Length = 539 Score = 505 bits (1301), Expect = e-140 Identities = 247/364 (67%), Positives = 302/364 (82%), Gaps = 1/364 (0%) Frame = -1 Query: 1206 NYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPYC 1027 NY+KAE LI++MRSAGL LNKVI TTLLKVYV+ LFEKS+ LL ELE LGYA DEMP+C Sbjct: 165 NYEKAEVLIEEMRSAGLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFC 224 Query: 1026 ILMDGLVKSGQTLEAKSVFDDMMKKKVRT-DGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 +LMDGL KSG LEAKSVFD+MM+K V+T DGYSYSIMISAFC GLL++AK++ S++E Sbjct: 225 LLMDGLAKSGHLLEAKSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEE 284 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y KYD+VILN+ML AYCRAG+M+NV MD+ AISPDWNTF+ILI+YFC EKLY+L Sbjct: 285 KYDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLL 344 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AYRTMEDMH KGHQ EE LC +L++HLGKTGA+ EAFSVYNMLRYSKRT++ +LHE +LH Sbjct: 345 AYRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILH 404 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ GRLLK+AYV+VKDN G IS AIKKF+ +FM+SGN+NL+NDV+ +HS G KI+QE Sbjct: 405 ILIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQE 464 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF A++RY+ + +KK+ +WMPG+GY +DSSTRNLILKN+HLFG Q IAE LSK Sbjct: 465 LFDLAIARYIAKPEKKELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKH 524 Query: 129 HIVS 118 ++S Sbjct: 525 LVMS 528 >ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like isoform X3 [Solanum tuberosum] Length = 646 Score = 505 bits (1301), Expect = e-140 Identities = 247/364 (67%), Positives = 302/364 (82%), Gaps = 1/364 (0%) Frame = -1 Query: 1206 NYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPYC 1027 NY+KAE LI++MRSAGL LNKVI TTLLKVYV+ LFEKS+ LL ELE LGYA DEMP+C Sbjct: 278 NYEKAEVLIEEMRSAGLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFC 337 Query: 1026 ILMDGLVKSGQTLEAKSVFDDMMKKKVRT-DGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 +LMDGL KSG LEAKSVFD+MM+K V+T DGYSYSIMISAFC GLL++AK++ S++E Sbjct: 338 LLMDGLAKSGHLLEAKSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEE 397 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y KYD+VILN+ML AYCRAG+M+NV MD+ AISPDWNTF+ILI+YFC EKLY+L Sbjct: 398 KYDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLL 457 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AYRTMEDMH KGHQ EE LC +L++HLGKTGA+ EAFSVYNMLRYSKRT++ +LHE +LH Sbjct: 458 AYRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILH 517 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ GRLLK+AYV+VKDN G IS AIKKF+ +FM+SGN+NL+NDV+ +HS G KI+QE Sbjct: 518 ILIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQE 577 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF A++RY+ + +KK+ +WMPG+GY +DSSTRNLILKN+HLFG Q IAE LSK Sbjct: 578 LFDLAIARYIAKPEKKELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKH 637 Query: 129 HIVS 118 ++S Sbjct: 638 LVMS 641 >ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like isoform X1 [Solanum tuberosum] Length = 652 Score = 505 bits (1301), Expect = e-140 Identities = 247/364 (67%), Positives = 302/364 (82%), Gaps = 1/364 (0%) Frame = -1 Query: 1206 NYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPYC 1027 NY+KAE LI++MRSAGL LNKVI TTLLKVYV+ LFEKS+ LL ELE LGYA DEMP+C Sbjct: 278 NYEKAEVLIEEMRSAGLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFC 337 Query: 1026 ILMDGLVKSGQTLEAKSVFDDMMKKKVRT-DGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 +LMDGL KSG LEAKSVFD+MM+K V+T DGYSYSIMISAFC GLL++AK++ S++E Sbjct: 338 LLMDGLAKSGHLLEAKSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEE 397 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y KYD+VILN+ML AYCRAG+M+NV MD+ AISPDWNTF+ILI+YFC EKLY+L Sbjct: 398 KYDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLL 457 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AYRTMEDMH KGHQ EE LC +L++HLGKTGA+ EAFSVYNMLRYSKRT++ +LHE +LH Sbjct: 458 AYRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILH 517 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ GRLLK+AYV+VKDN G IS AIKKF+ +FM+SGN+NL+NDV+ +HS G KI+QE Sbjct: 518 ILIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQE 577 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF A++RY+ + +KK+ +WMPG+GY +DSSTRNLILKN+HLFG Q IAE LSK Sbjct: 578 LFDLAIARYIAKPEKKELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKH 637 Query: 129 HIVS 118 ++S Sbjct: 638 LVMS 641 >ref|XP_002529286.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223531275|gb|EEF33118.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 672 Score = 505 bits (1300), Expect = e-140 Identities = 251/366 (68%), Positives = 301/366 (82%), Gaps = 2/366 (0%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 GNYKKAE L+QDM+S GL NKVI TTLLKVYVR LFEKS+ LL ELE LGYA DEMPY Sbjct: 301 GNYKKAEELVQDMKSLGLVPNKVIWTTLLKVYVRGGLFEKSQQLLLELETLGYAEDEMPY 360 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL K+G+ EA+S FD+M +K V++DGY+YSIMISA+C LL+EAKQL ++EA Sbjct: 361 CLLMDGLSKAGRVDEARSFFDEMKEKNVKSDGYAYSIMISAYCRGRLLEEAKQLAKEFEA 420 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y KYDVVILN+MLCAYCRAG+M++V MDELAISP + TFHILIKYFC +KLY+L Sbjct: 421 KYDKYDVVILNTMLCAYCRAGDMESVMQTMRKMDELAISPSYCTFHILIKYFCKQKLYLL 480 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AY+TMEDMH+KGHQ EEELC L+FHLGK AY EAFSVY ML+Y KRTM K+LHEK+LH Sbjct: 481 AYQTMEDMHRKGHQPEEELCSMLIFHLGKAKAYTEAFSVYTMLKYGKRTMCKALHEKILH 540 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQ- 313 +L+ G+LLK+AYV+VKDN LIS +AIKKFA +FM+ GNINL+NDV+K IHS G+KI+Q Sbjct: 541 VLLGGQLLKDAYVVVKDNAELISQAAIKKFANAFMKLGNINLINDVMKVIHSSGYKIDQA 600 Query: 312 -ELFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLS 136 ELF A+SRY+ + +KKD QWMPG GY VD+STRNLILK++HLFGRQ IAE+LS Sbjct: 601 SELFQMAISRYIAQPEKKDLLVQLLQWMPGHGYVVDASTRNLILKSSHLFGRQLIAEILS 660 Query: 135 KQHIVS 118 KQHI+S Sbjct: 661 KQHIIS 666 >ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like [Cucumis sativus] Length = 668 Score = 504 bits (1299), Expect = e-140 Identities = 247/366 (67%), Positives = 299/366 (81%) Frame = -1 Query: 1215 VHGNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEM 1036 ++G+YKKA+ LI+DM+ GL NKVILTTLLKVYVR LFEKSR LL+ELE+LGY +EM Sbjct: 290 INGDYKKADELIEDMKLTGLVPNKVILTTLLKVYVRGGLFEKSRKLLSELESLGYGENEM 349 Query: 1035 PYCILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDY 856 PYC+LMDGL K+G EAK+VFD+M K V+TDGY++SIMISAFC GLL+EAK L D+ Sbjct: 350 PYCLLMDGLAKAGSIREAKTVFDEMKAKNVKTDGYAHSIMISAFCRGGLLEEAKLLAKDF 409 Query: 855 EANYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLY 676 EA Y +YD+VILN+MLCAYCRAGEM++V MD+LAISPD+NTFHILIKYF EKLY Sbjct: 410 EATYDRYDIVILNTMLCAYCRAGEMESVMQMLRKMDDLAISPDYNTFHILIKYFFKEKLY 469 Query: 675 ILAYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKL 496 +L YRT+EDMH+KGHQ EEELC +L+ LG AY EAFSVYN+L+YSKRTM K+LHEK+ Sbjct: 470 LLCYRTLEDMHRKGHQPEEELCSSLILSLGNIRAYSEAFSVYNILKYSKRTMCKALHEKI 529 Query: 495 LHILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKIN 316 LHIL+ GRLLK+AYV+VKDN G+IS AI+KFA FM+ GN+NL+NDV+K IH G+KI+ Sbjct: 530 LHILIAGRLLKDAYVVVKDNAGVISKPAIRKFAFGFMKFGNVNLINDVMKAIHGSGYKID 589 Query: 315 QELFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLS 136 Q+LF A SRY+E +KKD +WMPGQGY VDSSTRNLILKNAHLFGRQ IAE+LS Sbjct: 590 QDLFMIATSRYIELPEKKDLFIQLLKWMPGQGYVVDSSTRNLILKNAHLFGRQLIAEILS 649 Query: 135 KQHIVS 118 K ++S Sbjct: 650 KHSLLS 655 >gb|EMJ26349.1| hypothetical protein PRUPE_ppa002505mg [Prunus persica] Length = 664 Score = 501 bits (1290), Expect = e-139 Identities = 245/366 (66%), Positives = 301/366 (82%) Frame = -1 Query: 1215 VHGNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEM 1036 + GNYK+A++L+QDM+SAGL NKVILTTLLKVYVR LFEKSR LL ELE LGYA DEM Sbjct: 287 ISGNYKEADDLVQDMKSAGLVPNKVILTTLLKVYVRGGLFEKSRELLAELEALGYAEDEM 346 Query: 1035 PYCILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDY 856 PYC+LMD L K+G+ EAK VFD+M +K +R++GYSYSIMISAFC GLL++AKQL D Sbjct: 347 PYCLLMDALAKAGRIHEAKLVFDEMKEKSIRSNGYSYSIMISAFCRGGLLEDAKQLSKDV 406 Query: 855 EANYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLY 676 E + K+D+V+LN+M+CAYCRAGEMD+V MDE I+PD+NTFHILIKYFC EKLY Sbjct: 407 ERTHDKFDLVMLNTMICAYCRAGEMDSVMEMMRKMDEQKITPDYNTFHILIKYFCKEKLY 466 Query: 675 ILAYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKL 496 +LAY+TMEDMH KGHQ +EELC +LMF LGK AY EA+SVYN+LRYSKRTM K+LHEK+ Sbjct: 467 LLAYQTMEDMHNKGHQPDEELCSSLMFLLGKIRAYSEAYSVYNILRYSKRTMCKALHEKI 526 Query: 495 LHILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKIN 316 LHIL+ G+LLK+AYV+VKDN GLIS A+KKF+T+F++ GNINL+NDV+K I + G KI+ Sbjct: 527 LHILLAGQLLKDAYVVVKDNAGLISKPAVKKFSTAFLKLGNINLINDVLKVIDASGCKID 586 Query: 315 QELFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLS 136 Q LF A+SRY+ +KK+ WMPGQGY VDS+TRNLILKN+HLFGRQ IA++LS Sbjct: 587 QGLFQMAISRYIALPEKKELLIQMLLWMPGQGYVVDSATRNLILKNSHLFGRQHIADVLS 646 Query: 135 KQHIVS 118 KQH++S Sbjct: 647 KQHMIS 652 >ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 642 Score = 500 bits (1287), Expect = e-139 Identities = 247/366 (67%), Positives = 295/366 (80%) Frame = -1 Query: 1215 VHGNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEM 1036 + GNYKKA++++QDM+SAGL NKV LTTLLK YVR LFEKSR LL ELE LGYA DEM Sbjct: 271 ISGNYKKADDVVQDMKSAGLVPNKVTLTTLLKAYVRGGLFEKSRELLTELEALGYAEDEM 330 Query: 1035 PYCILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDY 856 PYCILMD K+G+ +AK VFD++ +K VR+DGYSYSIMISAFC GL+ +AKQL D+ Sbjct: 331 PYCILMDAFAKAGRIEDAKLVFDEIKEKSVRSDGYSYSIMISAFCRGGLVDDAKQLAKDF 390 Query: 855 EANYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLY 676 E Y KYD+V+LN+M+CAYCRAGEMD+V MDEL I+PD NTFHILIKYFC EKLY Sbjct: 391 ERTYDKYDLVMLNTMICAYCRAGEMDSVMEMLRKMDELKITPDNNTFHILIKYFCKEKLY 450 Query: 675 ILAYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKL 496 +LAY+TMEDMH KG+ +EELC +LMFHLGK AY EA+S+YN+LRYSKRTM K+LHEK+ Sbjct: 451 MLAYKTMEDMHNKGYPPDEELCSSLMFHLGKIRAYSEAYSIYNILRYSKRTMCKALHEKI 510 Query: 495 LHILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKIN 316 LHILV GRLLK+AYV+VKDN LIS +A KFAT+FM+ GNINL+NDV+K I G KI+ Sbjct: 511 LHILVAGRLLKDAYVVVKDNPRLISKAATMKFATAFMKLGNINLINDVLKAIDGSGCKID 570 Query: 315 QELFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLS 136 Q +F A+SRY+ + KKD QWMPGQGY VDSSTRNLILKN+HLF RQ IAE+LS Sbjct: 571 QGIFQMAISRYISDPDKKDLLLQLLQWMPGQGYTVDSSTRNLILKNSHLFDRQHIAEMLS 630 Query: 135 KQHIVS 118 KQH++S Sbjct: 631 KQHMIS 636 >ref|XP_003531588.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like [Glycine max] Length = 646 Score = 499 bits (1286), Expect = e-139 Identities = 247/362 (68%), Positives = 294/362 (81%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 GNYKKA+ LIQDM+S GL NKVILTTLLKVYV+ LFEKSR LL EL++LGYA DEMPY Sbjct: 276 GNYKKADMLIQDMKSEGLVPNKVILTTLLKVYVKGGLFEKSRELLAELKSLGYAEDEMPY 335 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 CI MDGL K+GQ EAK +FD+MMK VR+DGY++SIMISAFC L +EAKQL D+E Sbjct: 336 CIFMDGLAKAGQIHEAKLIFDEMMKNHVRSDGYAHSIMISAFCRAKLFREAKQLAKDFET 395 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 KYD+VILNSMLCA+CR GEM+ V MDELAI+P +NTFHILIKYFC EK+Y+L Sbjct: 396 TSNKYDLVILNSMLCAFCRVGEMERVMETLKKMDELAINPGYNTFHILIKYFCREKMYLL 455 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AYRTM+DMH KGHQ EELC +L+ HLG+ AY EAFSVYNML+YSKRTM KSLHEK+LH Sbjct: 456 AYRTMKDMHSKGHQPVEELCSSLISHLGQVNAYSEAFSVYNMLKYSKRTMCKSLHEKILH 515 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ G+LLK+AYV+VKDN IS A KKFA++FM+SGN+N +NDV+KT+H G+K++Q+ Sbjct: 516 ILLAGQLLKDAYVVVKDNAKFISRPATKKFASAFMKSGNLNYINDVLKTLHDCGYKLDQD 575 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF AVSRYL++ +KKD QWM GQGY VDSSTRNLILKN+HLFGRQ IAE+LSKQ Sbjct: 576 LFAMAVSRYLDQPEKKDLLLHLLQWMAGQGYAVDSSTRNLILKNSHLFGRQLIAEVLSKQ 635 Query: 129 HI 124 + Sbjct: 636 QV 637 >ref|XP_004486236.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic-like [Cicer arietinum] Length = 642 Score = 496 bits (1276), Expect = e-137 Identities = 241/362 (66%), Positives = 297/362 (82%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 GN+KKA++LIQDM+S GL NKVILTTLLKVYVR L EKSR LL +LE+L YA DEMPY Sbjct: 272 GNFKKADSLIQDMKSEGLVPNKVILTTLLKVYVRGGLLEKSRELLTKLESLSYAEDEMPY 331 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL K+GQ EAK VFD+MMKK VR+DGY++SIMISAFC L +EAKQL +++ Sbjct: 332 CVLMDGLAKAGQVHEAKIVFDEMMKKHVRSDGYAHSIMISAFCRAKLFEEAKQLAKNFQT 391 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 + KYDVVI+NSMLCA+CRAGEM++V MDELAISPD+NTF+ILIKYFC + +Y+L Sbjct: 392 TFNKYDVVIMNSMLCAFCRAGEMESVMETLRKMDELAISPDYNTFNILIKYFCRQNMYLL 451 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AY+TMEDMH KG+Q EELC +L++HLG+ AY EAFSVYNML+YSKRT+ K+LHEK+LH Sbjct: 452 AYQTMEDMHSKGYQPVEELCSSLIYHLGQANAYSEAFSVYNMLKYSKRTIRKTLHEKILH 511 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ G+LLK+AYV+ KDN IS KKFA++FM+ GNINL+NDV+KT+H+ G+KI+Q+ Sbjct: 512 ILLAGKLLKDAYVVFKDNATFISGHTTKKFASAFMKLGNINLINDVMKTLHNCGYKIDQD 571 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF AV+RYL + +KKD QWMPGQGY VD STRNLILKN+HLFGRQ IAE+LSKQ Sbjct: 572 LFEMAVTRYLGQPEKKDLLLHLLQWMPGQGYVVDPSTRNLILKNSHLFGRQLIAEVLSKQ 631 Query: 129 HI 124 + Sbjct: 632 RV 633 >ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citrus clementina] gi|557534005|gb|ESR45123.1| hypothetical protein CICLE_v10000525mg [Citrus clementina] Length = 660 Score = 494 bits (1272), Expect = e-137 Identities = 242/364 (66%), Positives = 298/364 (81%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 G+Y KA+ LIQDM+S+GL NKVILTTLLKVYVR LFEKSR LL EL+ LGYA +EMPY Sbjct: 289 GDYTKADELIQDMKSSGLVPNKVILTTLLKVYVRGGLFEKSRELLAELDTLGYAENEMPY 348 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL K+G EA+ VF++M +K V++DGY++SIMISAFC G +EAKQL D+EA Sbjct: 349 CLLMDGLSKAGCLDEARVVFNEMQEKCVKSDGYAHSIMISAFCRGGCFEEAKQLAGDFEA 408 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y KYDVV+LNSMLCAYCR G+M++V +DELAISPD+NTFHILIKYFC EK+YIL Sbjct: 409 KYDKYDVVLLNSMLCAYCRTGDMESVMHVMRKLDELAISPDYNTFHILIKYFCKEKMYIL 468 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AYRTM DMH+KGHQ EEELC +L+FHLGK A+ EA SVYNMLRYSKR+M K+LHEK+LH Sbjct: 469 AYRTMVDMHRKGHQPEEELCSSLIFHLGKMRAHSEALSVYNMLRYSKRSMCKALHEKILH 528 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ G+LLK+AYV+VKDN IS IKKFA++F++ GNINLVNDV+K IH+ G++I+Q Sbjct: 529 ILISGKLLKDAYVVVKDNSESISHPVIKKFASAFVRLGNINLVNDVMKAIHTTGYRIDQG 588 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 +FH A++RY+ E +KK+ +WM GQGY VDSSTRNLILKN+HL GRQ IA++LSKQ Sbjct: 589 IFHIAIARYIAEREKKELLLKLLEWMTGQGYVVDSSTRNLILKNSHLLGRQLIADILSKQ 648 Query: 129 HIVS 118 H+ S Sbjct: 649 HMKS 652 Score = 67.8 bits (164), Expect = 9e-09 Identities = 60/316 (18%), Positives = 122/316 (38%), Gaps = 4/316 (1%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 GN KA + + ++N I ++L VR+ FE S L ++++ G D + Y Sbjct: 148 GNSLKALEIYNSITDESDKVNVFICNSILSCLVRNGKFESSLKLFDKMKQSGLTPDAVTY 207 Query: 1029 CILMDGLVKSGQTL-EAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYE 853 L+ G +K +A + ++ + D Y I+++ L +A+ + + Sbjct: 208 NTLLTGCIKDKNGYSKALELVQELKYNGAQMDNVMYGILLAICASNNLCAKAQSYFNQMK 267 Query: 852 ANYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYI 673 +V +S+L AY G+ M + P+ L+K + L+ Sbjct: 268 VEGHSPNVYHYSSLLNAYSSGGDYTKADELIQDMKSSGLVPNKVILTTLLKVYVRGGLFE 327 Query: 672 LAYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLL 493 + + ++ G+ E C LM L K G EA V+N ++ + H ++ Sbjct: 328 KSRELLAELDTLGYAENEMPYCLLMDGLSKAGCLDEARVVFNEMQEKCVKSDGYAHSIMI 387 Query: 492 HILVRGRLLKEAYVIVKD---NGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFK 322 RG +EA + D + ++ ++G++ V V++ + Sbjct: 388 SAFCRGGCFEEAKQLAGDFEAKYDKYDVVLLNSMLCAYCRTGDMESVMHVMRKLDELAIS 447 Query: 321 INQELFHSAVSRYLEE 274 + FH + + +E Sbjct: 448 PDYNTFHILIKYFCKE 463 >gb|ESW20506.1| hypothetical protein PHAVU_006G214900g [Phaseolus vulgaris] Length = 639 Score = 492 bits (1267), Expect = e-136 Identities = 246/362 (67%), Positives = 289/362 (79%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 GNYKKA+ L QDM+S GL NKVILTTLLKVYV+ LF+KSR LL EL++LGYA DEMPY Sbjct: 269 GNYKKADILFQDMKSEGLVPNKVILTTLLKVYVKGGLFDKSRELLAELKSLGYAEDEMPY 328 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 CILMDGL K+GQ EAK +FD+MMK VR+DGY++SIMISA C L +EAKQL D+E Sbjct: 329 CILMDGLAKAGQIHEAKLIFDEMMKNHVRSDGYAHSIMISALCRSKLFREAKQLAKDFET 388 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 KYD+VILNSMLCA+CR GEM++V MDELAISP +NTFHILIKYFC EK+Y+L Sbjct: 389 TSNKYDIVILNSMLCAFCRVGEMESVMETLKKMDELAISPSYNTFHILIKYFCREKMYLL 448 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AYRTM+DMH KGHQ EELC TL+ HLG+ AY EAFSVYNMLRY KRTM KSLHEK+L+ Sbjct: 449 AYRTMKDMHSKGHQPGEELCSTLISHLGQVNAYSEAFSVYNMLRYGKRTMCKSLHEKILY 508 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ G LLK+AYV+VKDN IS KKFA +FM+SGNIN +NDV+KT+H G+K++Q+ Sbjct: 509 ILLAGHLLKDAYVVVKDNAKYISRPPTKKFAIAFMKSGNINYINDVLKTLHDSGYKLDQD 568 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF AVSRYL E +KKD QWM GQGY VDSSTRNLILK++HLFGRQ IAE+LSKQ Sbjct: 569 LFAMAVSRYLGEPEKKDLLLHLLQWMSGQGYMVDSSTRNLILKHSHLFGRQLIAEVLSKQ 628 Query: 129 HI 124 + Sbjct: 629 QV 630 >gb|EOX98058.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] Length = 717 Score = 488 bits (1256), Expect = e-135 Identities = 242/364 (66%), Positives = 295/364 (81%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 GNY KA+ L++ M+S+GL NKVILTTLLKVYVR LFEKS LL ELE LGYA DEMP+ Sbjct: 284 GNYCKADELVEQMKSSGLVPNKVILTTLLKVYVRGGLFEKSTKLLAELEALGYAEDEMPF 343 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL K+G+ EA+SVF +M +K V++DGYS+SIMISA C GL +EAK+L D+EA Sbjct: 344 CLLMDGLSKAGRLDEARSVFVEMQQKCVKSDGYSHSIMISALCRAGLFEEAKELAQDFEA 403 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y KYD+V+LN+MLCAYCRAGEM++V MDELAISPD+NTFHILIKYFC EKLY+L Sbjct: 404 QYNKYDLVMLNTMLCAYCRAGEMESVMQTMKKMDELAISPDYNTFHILIKYFCKEKLYLL 463 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AY+TMEDMH KG+ EEELC +L+F LGK A+ EAFSVYNMLRYSKRTM K+LHEK+LH Sbjct: 464 AYKTMEDMHGKGYHPEEELCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMCKALHEKILH 523 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ G+LLK+AYV+VKDN LIS AI KFAT+FM+ GNIN++NDV+K +H G+KI+Q Sbjct: 524 ILIAGQLLKDAYVVVKDNAELISQPAITKFATAFMKLGNINMINDVLKVLHGSGYKIDQG 583 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF A+SRYL + +KK+ QWMPG GY VDSSTRN+ILKN+ L GRQ AE+LSKQ Sbjct: 584 LFQMAISRYLGQPEKKELLLQLLQWMPGHGYVVDSSTRNMILKNSQLLGRQLTAEILSKQ 643 Query: 129 HIVS 118 H++S Sbjct: 644 HMMS 647 >ref|XP_003593032.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355482080|gb|AES63283.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 627 Score = 483 bits (1243), Expect = e-134 Identities = 238/362 (65%), Positives = 292/362 (80%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 G++ KA+ LIQDM S GL NKVILTTLLKVYVR LFEKSR LL +LE+LGYA DEMPY Sbjct: 258 GDFTKADALIQDMESEGLAPNKVILTTLLKVYVRGGLFEKSRELLAKLESLGYAEDEMPY 317 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL K+ QT EAK +FD+MMKK V +DGY++SI+ISAFC L QEAKQL D++ Sbjct: 318 CVLMDGLAKARQTHEAKIIFDEMMKKHVMSDGYAHSIIISAFCRAKLFQEAKQLAKDFQT 377 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 + KYDVVI+NSMLCA+CRAGEM++V MDELAISPD+NTF+ILIKYFC + +Y+L Sbjct: 378 TFDKYDVVIMNSMLCAFCRAGEMESVMETLRKMDELAISPDYNTFNILIKYFCRKNMYLL 437 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AYRT DMH KG+Q EELC +L++HLG+ A EAFS+YNMLRYSKRT+ K+LHEK+LH Sbjct: 438 AYRTTMDMHSKGYQPAEELCSSLIYHLGQENASSEAFSLYNMLRYSKRTIGKALHEKILH 497 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ G+LLK+AYV+ KDN IS KKFA++FM+SGNINL+NDV+KT+H+ G+KI+Q Sbjct: 498 ILLAGKLLKDAYVVFKDNATSISGPTTKKFASAFMKSGNINLINDVMKTLHNCGYKIDQG 557 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 LF AVSRYL + +KKD QWMPGQGY ++ STRNLILKN+HLFGRQ IAE+LSKQ Sbjct: 558 LFEMAVSRYLGQPEKKDLLLHLLQWMPGQGYVINPSTRNLILKNSHLFGRQLIAEVLSKQ 617 Query: 129 HI 124 + Sbjct: 618 RV 619 >gb|EOX98059.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Theobroma cacao] Length = 649 Score = 479 bits (1233), Expect = e-132 Identities = 240/364 (65%), Positives = 293/364 (80%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 GNY KA+ L++ M+S+GL NKVILTTLLKVYVR LFEKS LL ELE LGYA DEMP+ Sbjct: 284 GNYCKADELVEQMKSSGLVPNKVILTTLLKVYVRGGLFEKSTKLLAELEALGYAEDEMPF 343 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL K+G+ EA+SVF +M +K V++DGYS+SIMISA C GL +EAK+L D+EA Sbjct: 344 CLLMDGLSKAGRLDEARSVFVEMQQKCVKSDGYSHSIMISALCRAGLFEEAKELAQDFEA 403 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y KYD+V+LN+MLCAYCRAGEM++V MDELAISPD+NTFHILIKYFC EKLY+L Sbjct: 404 QYNKYDLVMLNTMLCAYCRAGEMESVMQTMKKMDELAISPDYNTFHILIKYFCKEKLYLL 463 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AY+TMEDMH KG+ EEELC +L+F LGK A+ EAFSVYNMLRYSKRTM K+LHEK+LH Sbjct: 464 AYKTMEDMHGKGYHPEEELCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMCKALHEKILH 523 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ G+LLK+AYV+VKDN LIS AI KFAT+FM+ GNIN++NDV+K +H G+KI+Q Sbjct: 524 ILIAGQLLKDAYVVVKDNAELISQPAITKFATAFMKLGNINMINDVLKVLHGSGYKIDQ- 582 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 A+SRYL + +KK+ QWMPG GY VDSSTRN+ILKN+ L GRQ AE+LSKQ Sbjct: 583 ---MAISRYLGQPEKKELLLQLLQWMPGHGYVVDSSTRNMILKNSQLLGRQLTAEILSKQ 639 Query: 129 HIVS 118 H++S Sbjct: 640 HMMS 643 >ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutrema salsugineum] gi|557095175|gb|ESQ35757.1| hypothetical protein EUTSA_v10007006mg [Eutrema salsugineum] Length = 666 Score = 463 bits (1192), Expect = e-128 Identities = 225/364 (61%), Positives = 289/364 (79%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 G+YKKA+ L+ +M+S G+ NKV++TTLLKVY+R LFE+SR LL+ELE+ GYA +EMPY Sbjct: 287 GDYKKADELMTEMKSVGIVPNKVMMTTLLKVYIRGGLFERSRELLSELESAGYAENEMPY 346 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL K+G+ EA+S+FD+M K V++DGY+ SIMISA C +EAKQL D E+ Sbjct: 347 CMLMDGLSKAGKFEEARSIFDEMKGKGVKSDGYANSIMISALCRSKRFEEAKQLARDSES 406 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y+K D+V+LN+MLCAYCRAGEM++V MDE A+SPD+NTFHILIKYF EKL++L Sbjct: 407 TYEKCDLVMLNTMLCAYCRAGEMESVMRMMKKMDEQAVSPDYNTFHILIKYFIKEKLHLL 466 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AY+T+ DMH KGH++EEELC +L++HLGK A+ EAFSVY+MLRYSKRT+ K LHEK+LH Sbjct: 467 AYQTLLDMHSKGHRLEEELCSSLIYHLGKIRAHSEAFSVYSMLRYSKRTICKDLHEKILH 526 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL+ G+LLK+AYV+VKDN +IS +K+F +FM SGN+NLVNDV+K +H G KI+Q Sbjct: 527 ILIHGKLLKDAYVVVKDNAKMISQPTLKRFGRAFMNSGNVNLVNDVLKVLHGSGHKIDQV 586 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 F A+SRY+ + KK+ QWMPGQGY VDSSTRNLILKN++LFGRQ IAE+LSK Sbjct: 587 QFEIAISRYISQPDKKELLLQLLQWMPGQGYVVDSSTRNLILKNSNLFGRQLIAEILSKH 646 Query: 129 HIVS 118 HI S Sbjct: 647 HIAS 650 >ref|NP_172560.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|122242678|sp|Q0WVV0.1|PPR31_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g10910, chloroplastic; Flags: Precursor gi|110741600|dbj|BAE98748.1| membrane-associated salt-inducible protein isolog [Arabidopsis thaliana] gi|332190541|gb|AEE28662.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 664 Score = 460 bits (1184), Expect = e-127 Identities = 225/364 (61%), Positives = 285/364 (78%) Frame = -1 Query: 1209 GNYKKAENLIQDMRSAGLELNKVILTTLLKVYVRSDLFEKSRVLLNELENLGYAADEMPY 1030 G+YKKA+ L+ +M+S GL NKV++TTLLKVY++ LF++SR LL+ELE+ GYA +EMPY Sbjct: 286 GDYKKADELMTEMKSIGLVPNKVMMTTLLKVYIKGGLFDRSRELLSELESAGYAENEMPY 345 Query: 1029 CILMDGLVKSGQTLEAKSVFDDMMKKKVRTDGYSYSIMISAFCHVGLLQEAKQLISDYEA 850 C+LMDGL K+G+ EA+S+FDDM K VR+DGY+ SIMISA C +EAK+L D E Sbjct: 346 CMLMDGLSKAGKLEEARSIFDDMKGKGVRSDGYANSIMISALCRSKRFKEAKELSRDSET 405 Query: 849 NYQKYDVVILNSMLCAYCRAGEMDNVXXXXXXMDELAISPDWNTFHILIKYFCSEKLYIL 670 Y+K D+V+LN+MLCAYCRAGEM++V MDE A+SPD+NTFHILIKYF EKL++L Sbjct: 406 TYEKCDLVMLNTMLCAYCRAGEMESVMRMMKKMDEQAVSPDYNTFHILIKYFIKEKLHLL 465 Query: 669 AYRTMEDMHKKGHQVEEELCCTLMFHLGKTGAYGEAFSVYNMLRYSKRTMNKSLHEKLLH 490 AY+T DMH KGH++EEELC +L++HLGK A EAFSVYNMLRYSKRT+ K LHEK+LH Sbjct: 466 AYQTTLDMHSKGHRLEEELCSSLIYHLGKIRAQAEAFSVYNMLRYSKRTICKELHEKILH 525 Query: 489 ILVRGRLLKEAYVIVKDNGGLISPSAIKKFATSFMQSGNINLVNDVIKTIHSFGFKINQE 310 IL++G LLK+AY++VKDN +IS +KKF +FM SGNINLVNDV+K +H G KI+Q Sbjct: 526 ILIQGNLLKDAYIVVKDNAKMISQPTLKKFGRAFMISGNINLVNDVLKVLHGSGHKIDQV 585 Query: 309 LFHSAVSRYLEETQKKDXXXXXXQWMPGQGYFVDSSTRNLILKNAHLFGRQTIAELLSKQ 130 F A+SRY+ + KK+ QWMPGQGY VDSSTRNLILKN+H+FGR IAE+LSK Sbjct: 586 QFEIAISRYISQPDKKELLLQLLQWMPGQGYVVDSSTRNLILKNSHMFGRLLIAEILSKH 645 Query: 129 HIVS 118 H+ S Sbjct: 646 HVAS 649