BLASTX nr result
ID: Catharanthus22_contig00030821
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00030821 (1585 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006348098.1| PREDICTED: pentatricopeptide repeat-containi... 668 0.0 ref|XP_002280513.1| PREDICTED: pentatricopeptide repeat-containi... 634 e-179 ref|XP_006494311.1| PREDICTED: pentatricopeptide repeat-containi... 613 e-173 ref|XP_006451265.1| hypothetical protein CICLE_v10008215mg [Citr... 613 e-173 ref|XP_004307867.1| PREDICTED: pentatricopeptide repeat-containi... 611 e-172 ref|XP_004233792.1| PREDICTED: uncharacterized protein LOC101257... 590 e-166 ref|XP_004135378.1| PREDICTED: pentatricopeptide repeat-containi... 572 e-160 ref|XP_004162737.1| PREDICTED: pentatricopeptide repeat-containi... 571 e-160 ref|XP_003630737.1| Pentatricopeptide repeat-containing protein ... 568 e-159 gb|AFK47126.1| unknown [Medicago truncatula] 566 e-159 ref|XP_004503475.1| PREDICTED: pentatricopeptide repeat-containi... 566 e-158 ref|NP_177842.1| pentatricopeptide repeat-containing protein [Ar... 546 e-153 ref|XP_006300895.1| hypothetical protein CARUB_v10021262mg [Caps... 545 e-152 ref|XP_006390118.1| hypothetical protein EUTSA_v10018483mg [Eutr... 540 e-151 ref|XP_002889127.1| pentatricopeptide repeat-containing protein ... 540 e-151 gb|ESW32292.1| hypothetical protein PHAVU_002G309900g [Phaseolus... 511 e-142 gb|EPS67797.1| hypothetical protein M569_06976 [Genlisea aurea] 498 e-138 ref|XP_002514235.1| pentatricopeptide repeat-containing protein,... 400 e-109 ref|XP_003567066.1| PREDICTED: pentatricopeptide repeat-containi... 400 e-109 ref|XP_004972152.1| PREDICTED: pentatricopeptide repeat-containi... 399 e-108 >ref|XP_006348098.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77170-like [Solanum tuberosum] Length = 456 Score = 668 bits (1723), Expect = 0.0 Identities = 315/418 (75%), Positives = 371/418 (88%) Frame = +3 Query: 138 KIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFLYVF 317 KIIAT+ISNC++L L QI++ IIR HFL L+PV FH+NN+IRSYT LNSP A +Y+ Sbjct: 37 KIIATRISNCNNLHHLDQIYSQIIRNHFLELYPVQFHWNNIIRSYTRLNSPSNALHVYIT 96 Query: 318 MSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSKAGD 497 MSR GVRPDT+TLPI+LK++ Q ++ + R +H +AIK G++TNMYCESGFISLY+KAG+ Sbjct: 97 MSRTGVRPDTFTLPIVLKAICQVLNYVVARQLHGVAIKLGLETNMYCESGFISLYAKAGE 156 Query: 498 FNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSVTSA 677 F +ARKVF++NS+RKLGSWNAIIAGLSQGGR+KEAI+MF+EL++SGL+ DDVTMVSVTSA Sbjct: 157 FENARKVFEQNSERKLGSWNAIIAGLSQGGRAKEAIEMFLELRESGLQPDDVTMVSVTSA 216 Query: 678 CGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGERNVS 857 CG+LGDL+LA QLH+CVFQA+ +E+SD+LMMNSLIDMYGKCG+MDLA +VF M ERNVS Sbjct: 217 CGSLGDLDLASQLHKCVFQAKEMERSDLLMMNSLIDMYGKCGKMDLAYRVFSRMKERNVS 276 Query: 858 SWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYFKMM 1037 SWTSMIVGYAMHG VRDA+ECF CMR+AGVRPNHVTF+GVLSACVHGG V +GK+YF MM Sbjct: 277 SWTSMIVGYAMHGYVRDAVECFHCMREAGVRPNHVTFIGVLSACVHGGMVKEGKYYFNMM 336 Query: 1038 KTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGNVEM 1217 K YGIAP+LQHYGCMVDLLGRAGL EA+G ++ M MKPNVVIWGCLMGACEK G+V+M Sbjct: 337 KNEYGIAPMLQHYGCMVDLLGRAGLFEEARGTIEGMSMKPNVVIWGCLMGACEKHGHVKM 396 Query: 1218 GEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLSTN 1391 GEWVAKHL +LEPWNDGVYVVLSNIYA NDMWEEV RIR IMKERKLAK+PAYSLST+ Sbjct: 397 GEWVAKHLQQLEPWNDGVYVVLSNIYASNDMWEEVRRIRAIMKERKLAKIPAYSLSTS 454 >ref|XP_002280513.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77170 [Vitis vinifera] gi|297738768|emb|CBI28013.3| unnamed protein product [Vitis vinifera] Length = 479 Score = 634 bits (1635), Expect = e-179 Identities = 303/463 (65%), Positives = 373/463 (80%), Gaps = 2/463 (0%) Frame = +3 Query: 9 RVPKPLYIFSHL-NHTPSTPSDPQSIAHFVTTTHS-SHQPFQDYAKIIATQISNCSDLKQ 182 ++PKP I HL N+ +T + +F T H S P QD A+ IA+ +S C++L + Sbjct: 15 QIPKPQTISRHLCNYLATTTHSVDAQFNFQPTAHPPSPDPIQDTAQTIASHLSKCANLIE 74 Query: 183 LRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPI 362 L Q+ AHIIRTHFL L+P PF +NN+IRSYT L + A +Y+ MSRAGV PD+YT+PI Sbjct: 75 LNQLLAHIIRTHFLELYPAPFQWNNIIRSYTRLEAHHYALSIYIAMSRAGVSPDSYTIPI 134 Query: 363 LLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSKAGDFNSARKVFDKNSDRK 542 +LK+V Q F GR +H +AI+ G++ N YCESGFIS+YSKAG+F +A KVF++N RK Sbjct: 135 VLKAVCQAFATGFGRQVHSVAIRHGLELNEYCESGFISVYSKAGEFQNAHKVFEQNRFRK 194 Query: 543 LGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSVTSACGTLGDLNLALQLHR 722 LGSWNAII GLSQGGR+KEA+ MF+E+++ G E D+VTMVSVTSACG+LG L+LALQLH+ Sbjct: 195 LGSWNAIIGGLSQGGRAKEAVTMFMEMRKCGFEPDEVTMVSVTSACGSLGHLDLALQLHK 254 Query: 723 CVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGERNVSSWTSMIVGYAMHGQV 902 CV+QA+ E+SD L +NSL+DMYGKCGRMDLA +VF M E NVSSWTSMIVGYAMHGQ+ Sbjct: 255 CVYQAKTSERSDTLTLNSLVDMYGKCGRMDLAYRVFSRMDEPNVSSWTSMIVGYAMHGQL 314 Query: 903 RDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYFKMMKTRYGIAPLLQHYGC 1082 DALECF CMR+AGVRPNHVTF+GVLSACVHGG V +GK+YF MM T YG+ P +QHYGC Sbjct: 315 YDALECFRCMREAGVRPNHVTFIGVLSACVHGGAVQEGKYYFDMMTTAYGLVPRMQHYGC 374 Query: 1083 MVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGNVEMGEWVAKHLMELEPWN 1262 MVDLLGRAGLL EA+ MV+ MPMK NV++WGCLMGACEK+GNV+MGEWVA+HL+ELEPWN Sbjct: 375 MVDLLGRAGLLEEARKMVERMPMKANVIVWGCLMGACEKYGNVKMGEWVAEHLLELEPWN 434 Query: 1263 DGVYVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLSTN 1391 DGV+VVLSNIYA +W EV R+RG+MKERKL K+PAYSL+TN Sbjct: 435 DGVFVVLSNIYASRGLWREVERVRGVMKERKLDKVPAYSLATN 477 >ref|XP_006494311.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77170-like [Citrus sinensis] Length = 471 Score = 613 bits (1582), Expect = e-173 Identities = 294/421 (69%), Positives = 349/421 (82%) Frame = +3 Query: 126 QDYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQF 305 +D AKI+ATQ+S C++L QL QI+AHIIRTH L + FH+NN+IR YT L +P+KA Sbjct: 48 EDPAKIVATQLSKCTNLLQLNQIYAHIIRTHMLHSYSAAFHWNNIIRLYTRLEAPKKALD 107 Query: 306 LYVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYS 485 +Y+FMSRAGV PD YTLPI+LK+ Q F ++GR +H +A++ G+++N +CESGFISLYS Sbjct: 108 IYIFMSRAGVLPDCYTLPIVLKASCQLFALEIGRQLHSLAVRLGLESNEFCESGFISLYS 167 Query: 486 KAGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVS 665 KAGDF ARKVFD+N +RKLGSWNAIIAGLSQ GR+KEAI MFI LK+ G E DDVTMVS Sbjct: 168 KAGDFEKARKVFDENPERKLGSWNAIIAGLSQDGRAKEAIDMFIGLKKCGFEPDDVTMVS 227 Query: 666 VTSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGE 845 VTSACG+LGDL LALQ+H+ VFQ + +KSD LM+NSLIDMYGKCGRMDLA KVF E+ + Sbjct: 228 VTSACGSLGDLELALQVHKYVFQVKSKQKSDTLMLNSLIDMYGKCGRMDLAYKVFWEIDQ 287 Query: 846 RNVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHY 1025 NVSSWTSMIVGYA +G +AL+CF MR++G+RPNHVTFVGVLSACVHGG+V +GKH+ Sbjct: 288 PNVSSWTSMIVGYAANGLANEALDCFHYMRESGIRPNHVTFVGVLSACVHGGKVQEGKHF 347 Query: 1026 FKMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFG 1205 F+MMK Y I P HYGCMVDLLGRAGLL EA+ MV+ MPMK NVVIWGCLMGACEKFG Sbjct: 348 FEMMKNVYQIEPRFAHYGCMVDLLGRAGLLEEARAMVEGMPMKANVVIWGCLMGACEKFG 407 Query: 1206 NVEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLS 1385 NV+MGEWVAKHL ELEPW+DG YVVLSNIYA +WEEV RIR +MK R LAK+PAYSL+ Sbjct: 408 NVKMGEWVAKHLQELEPWSDGAYVVLSNIYASRGLWEEVERIRAVMKHRNLAKIPAYSLA 467 Query: 1386 T 1388 T Sbjct: 468 T 468 >ref|XP_006451265.1| hypothetical protein CICLE_v10008215mg [Citrus clementina] gi|557554491|gb|ESR64505.1| hypothetical protein CICLE_v10008215mg [Citrus clementina] Length = 461 Score = 613 bits (1582), Expect = e-173 Identities = 299/448 (66%), Positives = 359/448 (80%), Gaps = 1/448 (0%) Frame = +3 Query: 48 HTPSTPSDPQSIAH-FVTTTHSSHQPFQDYAKIIATQISNCSDLKQLRQIHAHIIRTHFL 224 H +PS Q++ + ++ TH AKI+ATQ+S C++L QL QI+AHIIRTH L Sbjct: 19 HEILSPSPAQALQNPYIQKTHP--------AKIVATQLSKCTNLLQLNQIYAHIIRTHML 70 Query: 225 ALHPVPFHYNNLIRSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPILLKSVSQFFDFKMG 404 + FH+NN+IR YT L +P+KA +Y+FMSRAGV PD YTLPI+LK+ Q F ++G Sbjct: 71 HSYSAAFHWNNIIRLYTRLEAPKKALDIYIFMSRAGVLPDCYTLPIVLKASCQLFALEIG 130 Query: 405 RLIHVIAIKSGMDTNMYCESGFISLYSKAGDFNSARKVFDKNSDRKLGSWNAIIAGLSQG 584 R +H +A++ G+++N +CESGFISLYSKAGDF ARKVFD+N +RKLGSWNAIIAGLSQ Sbjct: 131 RQLHSLAVRLGLESNEFCESGFISLYSKAGDFEKARKVFDENPERKLGSWNAIIAGLSQD 190 Query: 585 GRSKEAIKMFIELKQSGLEADDVTMVSVTSACGTLGDLNLALQLHRCVFQARILEKSDML 764 GR+KEAI MFI LK+ G E DDVTMVSVTSACG+LGDL LALQ+H+ VFQ + +KSD L Sbjct: 191 GRAKEAIDMFIGLKKCGFEPDDVTMVSVTSACGSLGDLELALQVHKYVFQVKSKQKSDTL 250 Query: 765 MMNSLIDMYGKCGRMDLANKVFLEMGERNVSSWTSMIVGYAMHGQVRDALECFGCMRDAG 944 M+NSLIDMYGKCGRMDLA KVF E+ + NVSSWTSMIVGYA +G +AL+CF MR++G Sbjct: 251 MLNSLIDMYGKCGRMDLAYKVFWEIDQPNVSSWTSMIVGYAANGLANEALDCFHYMRESG 310 Query: 945 VRPNHVTFVGVLSACVHGGRVDDGKHYFKMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEA 1124 +RPNHVTFVGVLSACVHGG+V +GKH+F+MMK Y I P HYGCMVDLLGRAGLL EA Sbjct: 311 IRPNHVTFVGVLSACVHGGKVQEGKHFFEMMKNVYQIEPRFAHYGCMVDLLGRAGLLEEA 370 Query: 1125 KGMVDAMPMKPNVVIWGCLMGACEKFGNVEMGEWVAKHLMELEPWNDGVYVVLSNIYACN 1304 + MV+ MPMK NVVIWGCLMGACEKFGNV+MGEWVAKHL ELEPW+DG YVVLSNIYA Sbjct: 371 RAMVEGMPMKANVVIWGCLMGACEKFGNVKMGEWVAKHLQELEPWSDGAYVVLSNIYASR 430 Query: 1305 DMWEEVGRIRGIMKERKLAKLPAYSLST 1388 +WEEV RIR +MK R LAK+PAYSL+T Sbjct: 431 GLWEEVERIRAVMKHRNLAKIPAYSLAT 458 >ref|XP_004307867.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77170-like [Fragaria vesca subsp. vesca] Length = 468 Score = 611 bits (1576), Expect = e-172 Identities = 299/458 (65%), Positives = 362/458 (79%), Gaps = 1/458 (0%) Frame = +3 Query: 15 PKPLYIFSHLNHTPSTPSDPQSIAHFVTTTHSSHQPFQDYAKIIATQISNCSDLKQLRQI 194 PKP I LN + +T S TT + P + +A Q+SNCS + QL QI Sbjct: 17 PKPTTISGLLNRSLTTTS----------TTPNVESPSTPDPETLAIQLSNCSCVSQLDQI 66 Query: 195 HAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPILLKS 374 + +IRT FL L+P PFH+NNLIRSYT ++P ++ F+YV MSRAGV PD YT+PI+LK+ Sbjct: 67 YCRVIRTQFLHLYPAPFHWNNLIRSYTRRDAPTESLFVYVAMSRAGVLPDCYTIPIVLKT 126 Query: 375 VSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSKAGDFNSARKVFDKNSDRKLGSW 554 + Q + +GR +H +A++ G+D+N +CESGFI+LYSKAG+F +ARKVFD+N++RKLGSW Sbjct: 127 LCQLYAVDIGRQLHSVAVRIGLDSNEFCESGFINLYSKAGEFENARKVFDQNAERKLGSW 186 Query: 555 NAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSVTSACGTLGDLNLALQLHRCVFQ 734 NAIIAGLSQ GR+KEAI MFIEL++ GL DDVTMVSVTSACG LGDL LALQLH+CV+Q Sbjct: 187 NAIIAGLSQSGRAKEAIDMFIELRRCGLLPDDVTMVSVTSACGGLGDLRLALQLHKCVYQ 246 Query: 735 ARILE-KSDMLMMNSLIDMYGKCGRMDLANKVFLEMGERNVSSWTSMIVGYAMHGQVRDA 911 A I KSD+LM+NSL+DMYGKCGRMDLA +VF M +NVSSWTSMIVGYAMHG V +A Sbjct: 247 AEIAAGKSDLLMLNSLVDMYGKCGRMDLAYRVFKRMKVQNVSSWTSMIVGYAMHGHVDEA 306 Query: 912 LECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYFKMMKTRYGIAPLLQHYGCMVD 1091 LECF CMR+AGVRPNHVTFVG LSACVHGG V +G++YF+MMK YGI P LQHYGCMVD Sbjct: 307 LECFRCMREAGVRPNHVTFVGALSACVHGGTVQEGRYYFEMMKKDYGINPRLQHYGCMVD 366 Query: 1092 LLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGNVEMGEWVAKHLMELEPWNDGV 1271 LLG+AGLL EA+ MV+ MPMK N +WGCLMGACEK GNVEMGEWVAKHL +LEPWNDG Sbjct: 367 LLGKAGLLQEARKMVEEMPMKANSAVWGCLMGACEKHGNVEMGEWVAKHLQQLEPWNDGA 426 Query: 1272 YVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLS 1385 +VVLSNIYA +W+EV R+R IM +RKLAK+P YSL+ Sbjct: 427 FVVLSNIYASRGLWKEVERVRRIMHQRKLAKVPGYSLA 464 >ref|XP_004233792.1| PREDICTED: uncharacterized protein LOC101257235 [Solanum lycopersicum] Length = 1917 Score = 590 bits (1521), Expect = e-166 Identities = 275/370 (74%), Positives = 325/370 (87%) Frame = +3 Query: 228 LHPVPFHYNNLIRSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGR 407 L+P FH+NN+IRSYT LNSP A +Y+ MSR GVRPDT+TLPI+LK++ Q ++ + R Sbjct: 1372 LYPAQFHWNNIIRSYTRLNSPTNALHVYITMSRTGVRPDTFTLPIVLKAICQVLNYVVAR 1431 Query: 408 LIHVIAIKSGMDTNMYCESGFISLYSKAGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGG 587 +H +AIK G++TNMYCESGFISLY+KAG+F +ARKVF++NS+RKLGSWNAIIAGLSQGG Sbjct: 1432 QLHGVAIKLGLETNMYCESGFISLYAKAGEFENARKVFEQNSERKLGSWNAIIAGLSQGG 1491 Query: 588 RSKEAIKMFIELKQSGLEADDVTMVSVTSACGTLGDLNLALQLHRCVFQARILEKSDMLM 767 R+KEAI+MF+EL++SGL+ DDVTMVS TSACG+LGDL+LA QLH+CVFQA+ +E+SD+LM Sbjct: 1492 RAKEAIEMFLELRESGLQPDDVTMVSATSACGSLGDLDLASQLHKCVFQAKEMERSDLLM 1551 Query: 768 MNSLIDMYGKCGRMDLANKVFLEMGERNVSSWTSMIVGYAMHGQVRDALECFGCMRDAGV 947 MNSLIDMYGKCG+MDLA +VF M ERNVSSWTSMIVG AMHG VRDA+ECF CMR+AGV Sbjct: 1552 MNSLIDMYGKCGKMDLAYRVFSRMKERNVSSWTSMIVGCAMHGYVRDAVECFHCMREAGV 1611 Query: 948 RPNHVTFVGVLSACVHGGRVDDGKHYFKMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAK 1127 RPNHVTF+GVLSACVHGG V +GK+YF MMK YGIAP+LQHYGCMVDLLGRAGL EA+ Sbjct: 1612 RPNHVTFIGVLSACVHGGMVKEGKYYFNMMKNEYGIAPMLQHYGCMVDLLGRAGLFEEAR 1671 Query: 1128 GMVDAMPMKPNVVIWGCLMGACEKFGNVEMGEWVAKHLMELEPWNDGVYVVLSNIYACND 1307 G ++ M MKPNVVIWGCLMGACEK G+V+MGEWVAKHL +LEPWNDGVYVVLSNIYA ND Sbjct: 1672 GTIEGMSMKPNVVIWGCLMGACEKHGHVKMGEWVAKHLQQLEPWNDGVYVVLSNIYASND 1731 Query: 1308 MWEEVGRIRG 1337 MWEE+ RG Sbjct: 1732 MWEEIVAWRG 1741 >ref|XP_004135378.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77170-like [Cucumis sativus] Length = 609 Score = 572 bits (1475), Expect = e-160 Identities = 271/420 (64%), Positives = 338/420 (80%) Frame = +3 Query: 129 DYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFL 308 D+A+I+AT + NC+++ +L QIHAH++RT+ L HP F++N +IRSYT L P+ A F+ Sbjct: 187 DHARIVATLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFV 246 Query: 309 YVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSK 488 Y+ M RAG+ PD YTLPI+ K++S + F +G +H +AI+ G + + Y ESG ISLYSK Sbjct: 247 YIDMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSK 306 Query: 489 AGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSV 668 GD A KVF++N +RKLGSWNAIIAGLSQGGR+KEA+ MFI+L+QSGLE DD T+VSV Sbjct: 307 IGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGRAKEAVNMFIKLRQSGLEPDDFTIVSV 366 Query: 669 TSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGER 848 TSACG+LG+L L+LQ+H+ VFQ ++ KS++LM+NSLIDMYGKCGRMDLA KVF MG R Sbjct: 367 TSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHR 426 Query: 849 NVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYF 1028 NVSSWTS+IVGYAMHGQV+ ALE F MR+AGV PN VTFVGVLSACVHGG +++GKHYF Sbjct: 427 NVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYF 486 Query: 1029 KMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGN 1208 MMK YG P L HYGCMVDLL +AGLL EA+ M++ MPMK N +IWGCL+G CEK GN Sbjct: 487 DMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGN 546 Query: 1209 VEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLST 1388 VE+GEW KHL ELEPWNDGVYVVLSNIYA N MW+E ++R +MK+R+LAK+P YSL+T Sbjct: 547 VEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYSLAT 606 >ref|XP_004162737.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77170-like [Cucumis sativus] Length = 614 Score = 571 bits (1472), Expect = e-160 Identities = 270/420 (64%), Positives = 338/420 (80%) Frame = +3 Query: 129 DYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFL 308 D+A+I+AT + NC+++ +L QIHAH++RT+ L HP F++N +IRSYT L P+ A F+ Sbjct: 192 DHARIVATLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFV 251 Query: 309 YVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSK 488 Y+ M +AG+ PD YTLPI+ K++S + F +G +H +AI+ G + + Y ESG ISLYSK Sbjct: 252 YIAMLQAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSK 311 Query: 489 AGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSV 668 GD A KVF++N +RKLGSWNAIIAGLSQGGR+KEA+ MFI+L+QSGLE DD T+VSV Sbjct: 312 IGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGRAKEAVNMFIKLRQSGLEPDDFTIVSV 371 Query: 669 TSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGER 848 TSACG+LG+L L+LQ+H+ VFQ ++ KS++LM+NSLIDMYGKCGRMDLA KVF MG R Sbjct: 372 TSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHR 431 Query: 849 NVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYF 1028 NVSSWTS+IVGYAMHGQV+ ALE F MR+AGV PN VTFVGVLSACVHGG +++GKHYF Sbjct: 432 NVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYF 491 Query: 1029 KMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGN 1208 MMK YG P L HYGCMVDLL +AGLL EA+ M++ MPMK N +IWGCL+G CEK GN Sbjct: 492 DMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGN 551 Query: 1209 VEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLST 1388 VE+GEW KHL ELEPWNDGVYVVLSNIYA N MW+E ++R +MK+R+LAK+P YSL+T Sbjct: 552 VEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYSLAT 611 >ref|XP_003630737.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355524759|gb|AET05213.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 447 Score = 568 bits (1465), Expect = e-159 Identities = 272/421 (64%), Positives = 337/421 (80%) Frame = +3 Query: 129 DYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFL 308 D +IAT +SN + ++ L QI+AHI+ T FL +P F++NN+IRSYT L SPQ A + Sbjct: 25 DPVTVIATLLSNTTRIRDLNQIYAHILLTRFLESNPASFNWNNIIRSYTRLESPQNALRI 84 Query: 309 YVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSK 488 YV M RAGV PD YTLPI+LK+VSQ F ++G+ +H IK G+ +N YCESGFI+LY K Sbjct: 85 YVSMLRAGVLPDRYTLPIVLKAVSQSFAIQLGQQVHSYGIKLGLQSNEYCESGFINLYCK 144 Query: 489 AGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSV 668 AGDF+SA KVFD+N + KLGSWNA+I+GLSQGG + +AI +F+++K+ G E D +TMVSV Sbjct: 145 AGDFDSAHKVFDENHEPKLGSWNALISGLSQGGLAMDAIVVFVDMKRHGFEPDGITMVSV 204 Query: 669 TSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGER 848 SACG++GDL LALQLH+ VFQA+ E + +LM NSLIDMYGKCGRMDLA +VF M +R Sbjct: 205 MSACGSIGDLYLALQLHKYVFQAKTNEWTVILMSNSLIDMYGKCGRMDLAYEVFATMEDR 264 Query: 849 NVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYF 1028 NVSSWTSMIVGYAMHG ++AL CF CMR++GV+PN+VTF+GVLSACVHGG V +G+ YF Sbjct: 265 NVSSWTSMIVGYAMHGHAKEALGCFHCMRESGVKPNYVTFIGVLSACVHGGTVQEGRFYF 324 Query: 1029 KMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGN 1208 MMK YGI P LQHYGCMVDLLGRAGL +A+ MV+ MPMKPN V+WGCLMGACEK GN Sbjct: 325 DMMKNIYGITPQLQHYGCMVDLLGRAGLFDDARRMVEEMPMKPNSVVWGCLMGACEKHGN 384 Query: 1209 VEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLST 1388 V+M EWVA++L LEPWN+GVYVVLSNIYA +W+EV RIR MKE +LAK+PAYS++T Sbjct: 385 VDMAEWVAENLQALEPWNEGVYVVLSNIYANKGLWKEVERIRSFMKEGRLAKIPAYSITT 444 Query: 1389 N 1391 N Sbjct: 445 N 445 >gb|AFK47126.1| unknown [Medicago truncatula] Length = 447 Score = 567 bits (1460), Expect = e-159 Identities = 271/421 (64%), Positives = 336/421 (79%) Frame = +3 Query: 129 DYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFL 308 D +IAT +SN + ++ L QI+AHI+ T FL +P F++NN+IRSYT L SPQ A + Sbjct: 25 DPVTVIATLLSNTTRIRDLNQIYAHILLTRFLESNPASFNWNNIIRSYTRLESPQNALRI 84 Query: 309 YVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSK 488 YV M RAGV PD YTLPI+LK+VSQ F ++G+ +H IK G+ +N YCESGFI+LY K Sbjct: 85 YVSMLRAGVLPDRYTLPIVLKAVSQSFAIQLGQQVHSYGIKLGLQSNEYCESGFINLYCK 144 Query: 489 AGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSV 668 AGDF+SA KVFD+N + KLGSWNA+I+GLSQGG + +AI +F+++K+ G E D +TMVSV Sbjct: 145 AGDFDSAHKVFDENHEPKLGSWNALISGLSQGGLAMDAIVVFVDMKRHGFEPDGITMVSV 204 Query: 669 TSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGER 848 ACG++GDL LALQLH+ VFQA+ E + +LM NSLIDMYGKCGRMDLA +VF M +R Sbjct: 205 MCACGSIGDLYLALQLHKYVFQAKTNEWTVILMSNSLIDMYGKCGRMDLAYEVFATMEDR 264 Query: 849 NVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYF 1028 NVSSWTSMIVGYAMHG ++AL CF CMR++GV+PN+VTF+GVLSACVHGG V +G+ YF Sbjct: 265 NVSSWTSMIVGYAMHGHAKEALGCFHCMRESGVKPNYVTFIGVLSACVHGGTVQEGRFYF 324 Query: 1029 KMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGN 1208 MMK YGI P LQHYGCMVDLLGRAGL +A+ MV+ MPMKPN V+WGCLMGACEK GN Sbjct: 325 DMMKNIYGITPQLQHYGCMVDLLGRAGLFDDARRMVEEMPMKPNSVVWGCLMGACEKHGN 384 Query: 1209 VEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLST 1388 V+M EWVA++L LEPWN+GVYVVLSNIYA +W+EV RIR MKE +LAK+PAYS++T Sbjct: 385 VDMAEWVAENLQALEPWNEGVYVVLSNIYANKGLWKEVERIRSFMKEGRLAKIPAYSITT 444 Query: 1389 N 1391 N Sbjct: 445 N 445 >ref|XP_004503475.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77170-like [Cicer arietinum] Length = 451 Score = 566 bits (1458), Expect = e-158 Identities = 271/436 (62%), Positives = 341/436 (78%), Gaps = 2/436 (0%) Frame = +3 Query: 90 FVT--TTHSSHQPFQDYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLI 263 FVT TT S D +I T +SN + + +L Q++AH++RT FL +P F++NN+I Sbjct: 14 FVTANTTQPSITSGNDPVTVITTLLSNSTRIHELNQVYAHLLRTRFLESNPASFNWNNII 73 Query: 264 RSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMD 443 R+YT L +P A ++V M RAGV PD YTLPI+LK++ Q F ++G+ +H IK G+ Sbjct: 74 RAYTRLEAPLNALRVHVLMLRAGVLPDHYTLPIVLKALCQSFAIELGKQVHSFGIKLGLQ 133 Query: 444 TNMYCESGFISLYSKAGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIEL 623 +N YCE+GFI+LY KAG+F SARKVFD+N D KLGSWN IIAG SQ G + +AI +F+++ Sbjct: 134 SNEYCETGFINLYCKAGEFYSARKVFDENPDPKLGSWNTIIAGFSQAGLAVDAIYVFVDM 193 Query: 624 KQSGLEADDVTMVSVTSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCG 803 ++ G E + +TMVSV SACG++GDL LALQLH+CVFQA EK+D+LM NSLIDMYGKCG Sbjct: 194 RRQGFEPNGITMVSVMSACGSIGDLYLALQLHKCVFQAEATEKTDILMSNSLIDMYGKCG 253 Query: 804 RMDLANKVFLEMGERNVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLS 983 RMDLA KVF M +RNVSSWTSMIVGYAMHG V++AL+CF CMR++ V+PN VTFVGVLS Sbjct: 254 RMDLAYKVFATMEDRNVSSWTSMIVGYAMHGLVKEALDCFRCMRESAVKPNFVTFVGVLS 313 Query: 984 ACVHGGRVDDGKHYFKMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNV 1163 ACVHGG V +G+ YF MM YGI P LQHYGCMVDLLGRAGLL +A+ MV+ MP+KPN Sbjct: 314 ACVHGGTVQEGRFYFDMMTNVYGITPQLQHYGCMVDLLGRAGLLDDARRMVEEMPVKPNS 373 Query: 1164 VIWGCLMGACEKFGNVEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIM 1343 V+WGCLMGACEK+GNV+M EWVA+HL+ LEPW DG YVVLSNIYA +W+EV RIR +M Sbjct: 374 VVWGCLMGACEKYGNVDMAEWVAEHLLALEPWTDGAYVVLSNIYANKGLWKEVERIRSLM 433 Query: 1344 KERKLAKLPAYSLSTN 1391 KE +LAK+PAYSL+T+ Sbjct: 434 KEGRLAKVPAYSLTTS 449 >ref|NP_177842.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|122215262|sp|Q3ECB8.1|PP128_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g77170 gi|332197823|gb|AEE35944.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 467 Score = 546 bits (1408), Expect = e-153 Identities = 258/436 (59%), Positives = 341/436 (78%), Gaps = 2/436 (0%) Frame = +3 Query: 87 HFVTTTHSSHQPF--QDYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNL 260 HFVTT+ SS P QD K++AT +SNC+ L ++R+IH I R+ L +P+ F +NN+ Sbjct: 29 HFVTTSSSSVTPLSPQDRNKLLATLLSNCTSLARVRRIHGDIFRSRILDQYPIAFLWNNI 88 Query: 261 IRSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGM 440 +RSY SP A +Y+ M R+ V PD Y+LPI++K+ Q DF +G+ +H +A++ G Sbjct: 89 MRSYIRHESPLDAIQVYLGMVRSTVLPDRYSLPIVIKAAVQIHDFTLGKELHSVAVRLGF 148 Query: 441 DTNMYCESGFISLYSKAGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIE 620 + +CESGFI+LY KAG+F +ARKVFD+N +RKLGSWNAII GL+ GR+ EA++MF++ Sbjct: 149 VGDEFCESGFITLYCKAGEFENARKVFDENPERKLGSWNAIIGGLNHAGRANEAVEMFVD 208 Query: 621 LKQSGLEADDVTMVSVTSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKC 800 +K+SGLE DD TMVSVT++CG LGDL+LA QLH+CV QA+ EKSD++M+NSLIDMYGKC Sbjct: 209 MKRSGLEPDDFTMVSVTASCGGLGDLSLAFQLHKCVLQAKTEEKSDIMMLNSLIDMYGKC 268 Query: 801 GRMDLANKVFLEMGERNVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVL 980 GRMDLA+ +F EM +RNV SW+SMIVGYA +G +ALECF MR+ GVRPN +TFVGVL Sbjct: 269 GRMDLASHIFEEMRQRNVVSWSSMIVGYAANGNTLEALECFRQMREFGVRPNKITFVGVL 328 Query: 981 SACVHGGRVDDGKHYFKMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPN 1160 SACVHGG V++GK YF MMK+ + + P L HYGC+VDLL R G L EAK +V+ MPMKPN Sbjct: 329 SACVHGGLVEEGKTYFAMMKSEFELEPGLSHYGCIVDLLSRDGQLKEAKKVVEEMPMKPN 388 Query: 1161 VVIWGCLMGACEKFGNVEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGI 1340 V++WGCLMG CEKFG+VEM EWVA +++ELEPWNDGVYVVL+N+YA MW++V R+R + Sbjct: 389 VMVWGCLMGGCEKFGDVEMAEWVAPYMVELEPWNDGVYVVLANVYALRGMWKDVERVRKL 448 Query: 1341 MKERKLAKLPAYSLST 1388 MK +K+AK+PAYS ++ Sbjct: 449 MKTKKVAKIPAYSYAS 464 >ref|XP_006300895.1| hypothetical protein CARUB_v10021262mg [Capsella rubella] gi|482569605|gb|EOA33793.1| hypothetical protein CARUB_v10021262mg [Capsella rubella] Length = 464 Score = 545 bits (1404), Expect = e-152 Identities = 254/421 (60%), Positives = 333/421 (79%) Frame = +3 Query: 126 QDYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQF 305 QD K++AT +SNC+ L ++R+IH I R+ L +P+PF +NN++RSY +SP A Sbjct: 41 QDRNKLLATLLSNCTSLARVRRIHGDIFRSRILDQYPIPFLWNNIMRSYIRHDSPLDAVQ 100 Query: 306 LYVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYS 485 +Y+ M R+ V PD YTLPI++K+ Q DF MG+ +H +A++ G + +CESGFI+LY Sbjct: 101 VYLDMVRSNVLPDRYTLPIVIKAAVQIHDFPMGKQLHSVAVRLGFVGDEFCESGFITLYF 160 Query: 486 KAGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVS 665 KAG+F +AR VFD+N +RKLGSWNAI+ GL+ GR+ EA++MF+E+K+SG E DD TMVS Sbjct: 161 KAGEFVNARNVFDENPERKLGSWNAIVGGLNHAGRASEAVEMFMEMKKSGFEPDDFTMVS 220 Query: 666 VTSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGE 845 VTSACG LG+L+LA QLH+CV QA+ +KSD++MMNSLIDMYGKCGRMDLA++VF +M + Sbjct: 221 VTSACGRLGNLSLAFQLHKCVLQAKSEDKSDIMMMNSLIDMYGKCGRMDLASQVFEKMPQ 280 Query: 846 RNVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHY 1025 RNV SW+SMI YA +G +ALECF MRD GV+PN VTFVGVLSACVHGG V++GK Y Sbjct: 281 RNVVSWSSMITSYAANGNTLEALECFRQMRDIGVKPNKVTFVGVLSACVHGGLVEEGKTY 340 Query: 1026 FKMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFG 1205 F MMK+ + + P L HYGC+VDLL R G L EAK +V+ MPMKPNV++WGCLMG CEKFG Sbjct: 341 FAMMKSEFELEPSLSHYGCIVDLLSRDGQLEEAKKVVEGMPMKPNVMVWGCLMGGCEKFG 400 Query: 1206 NVEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLS 1385 +VEM EWVA+H++ELEPWNDGVYVVL+N+YA MWE+V R+R +MK++KLAK+PAYS + Sbjct: 401 DVEMAEWVARHMVELEPWNDGVYVVLANVYAVRGMWEDVERVRNMMKQKKLAKVPAYSYA 460 Query: 1386 T 1388 + Sbjct: 461 S 461 >ref|XP_006390118.1| hypothetical protein EUTSA_v10018483mg [Eutrema salsugineum] gi|557086552|gb|ESQ27404.1| hypothetical protein EUTSA_v10018483mg [Eutrema salsugineum] Length = 476 Score = 540 bits (1391), Expect = e-151 Identities = 263/458 (57%), Positives = 346/458 (75%), Gaps = 3/458 (0%) Frame = +3 Query: 24 LYIFSHLNH--TPSTPSDPQ-SIAHFVTTTHSSHQPFQDYAKIIATQISNCSDLKQLRQI 194 L IF NH TP+ +P SI+ TT+ S +D +K +AT +SNC+ L ++R+I Sbjct: 20 LTIFHRSNHSITPAAIDEPYVSISSSSTTSLSP----EDRSKFLATLLSNCNSLARVRRI 75 Query: 195 HAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPILLKS 374 H I R+ L + F +NN++RSY SP A +Y+ M R+ V PD Y+LPI++K+ Sbjct: 76 HGDIFRSRILDEDRIAFLWNNIMRSYVRHGSPLDALQVYLGMVRSNVSPDRYSLPIVIKA 135 Query: 375 VSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSKAGDFNSARKVFDKNSDRKLGSW 554 Q D MG+ +H +A+K G + +CESGFI+LY KAG AR +FD+N +RKLGSW Sbjct: 136 AVQIHDLPMGKQLHSVAVKLGFVRDEFCESGFITLYCKAGKLRDARNLFDENPERKLGSW 195 Query: 555 NAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSVTSACGTLGDLNLALQLHRCVFQ 734 NAIIAGL+Q R+ EA++MF+E+K++G E DD TMVSVTSACG+LG+L+LA QLHRCV + Sbjct: 196 NAIIAGLNQAARANEAVEMFVEMKRNGFEPDDFTMVSVTSACGSLGNLSLAFQLHRCVLE 255 Query: 735 ARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGERNVSSWTSMIVGYAMHGQVRDAL 914 A+ EKSD +MMNSLID+YGKCGRMDLA++VF EM RNV SWTSMI+GYA HG+ +AL Sbjct: 256 AKPEEKSDTMMMNSLIDVYGKCGRMDLASQVFDEMPHRNVVSWTSMIMGYAAHGKTLEAL 315 Query: 915 ECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYFKMMKTRYGIAPLLQHYGCMVDL 1094 ECF MRD+ VRPN VTF+GVLSACVHGG V++GK YF MMK+ +G+ P L HYGC+VDL Sbjct: 316 ECFRDMRDSSVRPNGVTFLGVLSACVHGGLVEEGKTYFAMMKSEFGLEPRLSHYGCIVDL 375 Query: 1095 LGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGNVEMGEWVAKHLMELEPWNDGVY 1274 L R G + EAK +V+ MPMKP+VV+WGCLMG CEKFG+VEM EWVA+H++ELEPWNDGVY Sbjct: 376 LSRDGQIKEAKLVVEGMPMKPSVVVWGCLMGGCEKFGDVEMAEWVAQHMIELEPWNDGVY 435 Query: 1275 VVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLST 1388 VVL+N+YA MW++V R+R +M+++KLAK+PAYS ++ Sbjct: 436 VVLANVYATRGMWKDVERVRKMMQDKKLAKIPAYSYAS 473 >ref|XP_002889127.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297334968|gb|EFH65386.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 466 Score = 540 bits (1391), Expect = e-151 Identities = 257/437 (58%), Positives = 336/437 (76%), Gaps = 3/437 (0%) Frame = +3 Query: 87 HFVTTTHSSHQPF---QDYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNN 257 HFVT + SS QD K++AT +SNC+ L ++R+IH I R+ L +P+ F +NN Sbjct: 27 HFVTISSSSSITSLSPQDRNKLLATLLSNCTSLARVRRIHGDIFRSCILDQYPIAFLWNN 86 Query: 258 LIRSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSG 437 ++RSY +SP + +Y+ M R+ V PD YTLPI++K+ Q DF +G+ +H +A++ G Sbjct: 87 IMRSYIRHDSPLDSVQVYLGMVRSNVLPDRYTLPIVIKAAVQIHDFPLGKQLHSVAVRLG 146 Query: 438 MDTNMYCESGFISLYSKAGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFI 617 + +CESGFI+LY KAG+ +AR VFD+N +RKLGSWNAII GL+ GR+ EA++MF+ Sbjct: 147 FVGDEFCESGFITLYCKAGELENARNVFDENPERKLGSWNAIIGGLNHAGRANEAVEMFM 206 Query: 618 ELKQSGLEADDVTMVSVTSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGK 797 E+++SG E DD TMVSVTSACG LGDLNLA QLH+CV QA+ EKSD++MMNSLIDMYGK Sbjct: 207 EMRRSGFEPDDFTMVSVTSACGGLGDLNLAFQLHKCVLQAKTEEKSDVMMMNSLIDMYGK 266 Query: 798 CGRMDLANKVFLEMGERNVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGV 977 CGRMD A +VF EM +RNV SW+SMI GYA +G +ALECF MR+ GVRPN +TFVGV Sbjct: 267 CGRMDFAIQVFEEMPQRNVVSWSSMITGYAANGNTLEALECFRQMREFGVRPNKITFVGV 326 Query: 978 LSACVHGGRVDDGKHYFKMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKP 1157 LSACVHGG V++GK YF MMK+ + + P L HYGC+VDLL R G L EAK +V+ MPMKP Sbjct: 327 LSACVHGGLVEEGKAYFAMMKSEFNLEPGLSHYGCIVDLLSRDGQLKEAKKVVEEMPMKP 386 Query: 1158 NVVIWGCLMGACEKFGNVEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRG 1337 NV++WGCLMG CEKFG+VEM EWVA +++ELEPWNDGVYVVL+N+YA MW++V R+R Sbjct: 387 NVMVWGCLMGGCEKFGDVEMAEWVAPYMVELEPWNDGVYVVLANVYALKGMWKDVERVRK 446 Query: 1338 IMKERKLAKLPAYSLST 1388 +MKE+K+AK+PAYS ++ Sbjct: 447 VMKEKKVAKIPAYSYAS 463 >gb|ESW32292.1| hypothetical protein PHAVU_002G309900g [Phaseolus vulgaris] Length = 423 Score = 511 bits (1316), Expect = e-142 Identities = 244/421 (57%), Positives = 312/421 (74%) Frame = +3 Query: 129 DYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFL 308 D ++A + NC+ +++L +++AH++RTHF +P PF++NN+IRSYT L +P+ A + Sbjct: 35 DLVALVAAHLCNCATVRELNRVYAHVLRTHFFISNPAPFNWNNIIRSYTRLEAPRNALRI 94 Query: 309 YVFMSRAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSK 488 +V M R GV PD YTLPI+LK+VSQ +D K+G+ +H + IK G+ N +CE+GF+SLY K Sbjct: 95 HVLMLRNGVLPDCYTLPIVLKAVSQTYDVKLGKQVHSVGIKVGLQCNEFCETGFLSLYFK 154 Query: 489 AGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSV 668 AG+F SAR VFD+N D KLGSWNA+I GLSQ G ++AI +F+++++ G D +TMV+V Sbjct: 155 AGEFGSARMVFDENPDPKLGSWNAVIGGLSQSGLVRDAISVFLDMRRRGFAPDGLTMVNV 214 Query: 669 TSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGER 848 TSACG +GDLNLALQLH+CVFQ E++D+LM+NSLIDMYGKCGR+DLA KVF M ER Sbjct: 215 TSACGKIGDLNLALQLHKCVFQVEAGERTDVLMLNSLIDMYGKCGRLDLAYKVFATMEER 274 Query: 849 NVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYF 1028 +VSSWTSMIV ACVHGG V +G+ YF Sbjct: 275 SVSSWTSMIV----------------------------------VACVHGGAVREGRIYF 300 Query: 1029 KMMKTRYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGN 1208 MMK YGIAP LQHYGCMVDLLGRAGLL +A+ MV+ MPMKPN V+WGCLMGACEK+GN Sbjct: 301 DMMKNVYGIAPQLQHYGCMVDLLGRAGLLEDARRMVEEMPMKPNSVVWGCLMGACEKYGN 360 Query: 1209 VEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERKLAKLPAYSLST 1388 V+M EWVAKHL ELEPW+DGVYVVLSNIYA +W+EV RIR +M+E +LAK+PAYSL+T Sbjct: 361 VDMAEWVAKHLQELEPWSDGVYVVLSNIYANRGLWKEVERIRSVMEEGRLAKIPAYSLTT 420 Query: 1389 N 1391 N Sbjct: 421 N 421 >gb|EPS67797.1| hypothetical protein M569_06976 [Genlisea aurea] Length = 503 Score = 498 bits (1281), Expect = e-138 Identities = 243/415 (58%), Positives = 308/415 (74%), Gaps = 2/415 (0%) Frame = +3 Query: 144 IATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFLYVFMS 323 +A +S S+L++LRQ+ A + RTHFLAL+P FHYNN+IR + L SP++A LY M Sbjct: 84 VALLVSESSNLRRLRQVFALMFRTHFLALNPESFHYNNVIRIFIRLESPKEALRLYGAMR 143 Query: 324 RAGVRPDTYTLPILLKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSKAGDFN 503 RAGV PD+YT P++LK+ Q DF + +H I K G++ ++YCESG IS Y KAG Sbjct: 144 RAGVPPDSYTFPMVLKAAGQSPDFTLSEQLHAIPFKYGLEGDVYCESGLISSYCKAGKIE 203 Query: 504 SARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSVTSACG 683 S +VF +S RKLG+WNA IAGLSQGGR KEA+ MF+ + ++G+ DD+T+V +TS C Sbjct: 204 SGLRVFSYSSGRKLGAWNAAIAGLSQGGRVKEALDMFLSMMRTGIVPDDITIVCLTSNCA 263 Query: 684 TLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGERNVSSW 863 +G+LNLA+QLH+ V Q +I ++D+LMMNSL+DMYGKCGRMDLA +VF E+ +NVSSW Sbjct: 264 AIGNLNLAMQLHKFVLQVKISSRTDLLMMNSLVDMYGKCGRMDLAYRVFTEIEVKNVSSW 323 Query: 864 TSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGGRVDDGKHYFKMMKT 1043 TSMIVGYA G V ++L+ F M +GV PN ++FVGVLSACVHGG V +G+ YF M Sbjct: 324 TSMIVGYAAQGYVHESLDSFHRMIGSGVSPNAISFVGVLSACVHGGMVREGREYFDAMVN 383 Query: 1044 RYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVDAMPMKPNVVIWGCLMGACEKFGNVEMGE 1223 + + P L HYGCM DLLGRAGLL E + MV+ MPMKPN V+WGCLMGACEKFG+V+MGE Sbjct: 384 GFRLEPKLAHYGCMADLLGRAGLLNEVREMVETMPMKPNAVVWGCLMGACEKFGDVKMGE 443 Query: 1224 WVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERKL--AKLPAYSL 1382 WV K LMELEP NDGV+V LSNIYA N MW+EV +R MKER + KLPAYSL Sbjct: 444 WVGKQLMELEPENDGVFVALSNIYATNGMWDEVETMRETMKERLMGSTKLPAYSL 498 >ref|XP_002514235.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223546691|gb|EEF48189.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 352 Score = 400 bits (1029), Expect = e-109 Identities = 197/333 (59%), Positives = 252/333 (75%), Gaps = 4/333 (1%) Frame = +3 Query: 18 KPLYIFSHLNHTPSTPSDPQSIAHFVTTTHSSHQPF----QDYAKIIATQISNCSDLKQL 185 K L I HLNH S+A F ++ + P QD AK ATQ SNC+ L+ L Sbjct: 19 KNLTISCHLNH---------SLAFFSASSDAQTPPLPQSAQDIAKWAATQSSNCTTLRDL 69 Query: 186 RQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPIL 365 QI+AHII + L + PFH+NN+IRSYT L++P KA +Y+ MSRAGV PD+YTLPI+ Sbjct: 70 NQIYAHIICSDLLHFYSAPFHWNNIIRSYTRLDAPVKALQVYISMSRAGVLPDSYTLPIV 129 Query: 366 LKSVSQFFDFKMGRLIHVIAIKSGMDTNMYCESGFISLYSKAGDFNSARKVFDKNSDRKL 545 LK+ Q F ++G+ + +AI+ G+++N YCESGFIS YSK+GD +A K+F++N +RKL Sbjct: 130 LKAACQIFSIEIGKQLQSVAIRLGLESNEYCESGFISFYSKSGDIKNAYKMFEENPERKL 189 Query: 546 GSWNAIIAGLSQGGRSKEAIKMFIELKQSGLEADDVTMVSVTSACGTLGDLNLALQLHRC 725 GSWNAII GLSQGG +KEAI++FIE+++ G DDVTMVSV SACG+LGDLNLA+QLH+ Sbjct: 190 GSWNAIIGGLSQGGHAKEAIEIFIEMRKCGFVPDDVTMVSVISACGSLGDLNLAIQLHKY 249 Query: 726 VFQARILEKSDMLMMNSLIDMYGKCGRMDLANKVFLEMGERNVSSWTSMIVGYAMHGQVR 905 VF A + ++++L+MNSLIDMYGKCGRMDLA +VF MGE+NVSSWTSMIVGY MHG V Sbjct: 250 VFHANVFGRTNILVMNSLIDMYGKCGRMDLARRVFCTMGEKNVSSWTSMIVGYGMHGHVN 309 Query: 906 DALECFGCMRDAGVRPNHVTFVGVLSACVHGGR 1004 +A+E F CMR+AGVRPNHVTF+GVLS CVHGG+ Sbjct: 310 EAIEYFHCMREAGVRPNHVTFIGVLSVCVHGGK 342 >ref|XP_003567066.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77170-like [Brachypodium distachyon] Length = 433 Score = 400 bits (1028), Expect = e-109 Identities = 205/429 (47%), Positives = 276/429 (64%), Gaps = 4/429 (0%) Frame = +3 Query: 108 SSHQPFQDYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLIRSYTNLNS 287 + H+P A + A ++ +C D + HAH++R L L PFH+N L R+Y S Sbjct: 8 AQHEPLGREADLAAARLESCDDGRLPPLFHAHLLRHGLLLL---PFHWNALTRAYLRHGS 64 Query: 288 PQKAQFLYVFMSRAGVRPDTYTLPILLKSVSQFFD--FKMGRLIHVIAIKSGMDTNMYCE 461 + A + M R PD YT P+ LK+ +Q + R H A K G+ + + E Sbjct: 65 SRSALCVAAHMFRCAAHPDRYTFPLALKAAAQGEPPISSLRRQFHAAAAKRGLARHPFTE 124 Query: 462 SGFISLYSKAGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFIELKQSGLE 641 S IS YSKAGD ++AR+VFD+N R LGSWNAII+GLSQ G SKE + +F+++++ G+ Sbjct: 125 SALISCYSKAGDLDAARRVFDENPHRGLGSWNAIISGLSQAGESKEPLALFVKMRRCGVV 184 Query: 642 ADDVTMVSVTSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGKCGRMDLAN 821 DD+TMVS+ S+C +GD+ L QLH+C+ Q + + D+ + N+LIDMY KCGR DLA Sbjct: 185 PDDLTMVSLVSSCCAVGDIGLVEQLHKCMLQCKHSSRLDVTLSNALIDMYAKCGRTDLAG 244 Query: 822 KVFLEMGERNVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGVLSACVHGG 1001 +VF M R+VSSWT+MI G A HG+ + AL+ F M+ GV PN VT + VLSAC H G Sbjct: 245 RVFERMPLRDVSSWTTMITGLATHGEEQRALKKFDEMKSEGVPPNRVTMLAVLSACAHRG 304 Query: 1002 RVDDGKHYFKMMKT-RYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVD-AMPMKPNVVIWG 1175 VD G K M+ +AP ++HYGC+VDLLGR G + +A+ +V+ MPM+ NVVIWG Sbjct: 305 LVDTGMGLLKQMEDGEIKVAPTVEHYGCLVDLLGRVGWVDDARALVEHRMPMEANVVIWG 364 Query: 1176 CLMGACEKFGNVEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRIRGIMKERK 1355 L+GACEK GNV +GEW A+ L E EPWNDGVYVVLSN+YA MW EV R+R +M RK Sbjct: 365 TLLGACEKHGNVSVGEWAAERLQEAEPWNDGVYVVLSNVYAAAGMWGEVERVRKMMSGRK 424 Query: 1356 LAKLPAYSL 1382 + K P SL Sbjct: 425 VTKFPGCSL 433 >ref|XP_004972152.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77170-like [Setaria italica] Length = 437 Score = 399 bits (1024), Expect = e-108 Identities = 210/437 (48%), Positives = 279/437 (63%), Gaps = 4/437 (0%) Frame = +3 Query: 84 AHFVTTTHSSHQPFQDYAKIIATQISNCSDLKQLRQIHAHIIRTHFLALHPVPFHYNNLI 263 AH P A + A ++ +C+D + +IHA ++R L L PFH+N L Sbjct: 4 AHAPAAYRDPLPPHHHAADLAAARLESCADGRLPPRIHARLLRRGLLLL---PFHWNALT 60 Query: 264 RSYTNLNSPQKAQFLYVFMSRAGVRPDTYTLPILLKSVSQFFD--FKMGRLIHVIAIKSG 437 R+Y +L SP+ A M G D YT P+ LK+ +Q + +H A+K G Sbjct: 61 RAYLHLGSPRSALRAAACMLAHGAALDHYTFPLALKAAAQAEPPGSSLRLQLHAAAVKRG 120 Query: 438 MDTNMYCESGFISLYSKAGDFNSARKVFDKNSDRKLGSWNAIIAGLSQGGRSKEAIKMFI 617 + + + ES IS Y+KAGD +AR+VFD+N +R+LGSWNAII+GLSQ G EA+ +F Sbjct: 121 LARHPFTESALISGYAKAGDLGAARRVFDENPNRELGSWNAIISGLSQAGEPMEALALFH 180 Query: 618 ELKQSGLEADDVTMVSVTSACGTLGDLNLALQLHRCVFQARILEKSDMLMMNSLIDMYGK 797 EL++ G+ DD+TMVSV SAC LGD+ LA QLH+C+ Q + + D+ + N+L+DMY K Sbjct: 181 ELRRGGMVPDDLTMVSVASACCVLGDIGLAEQLHKCILQCQRSGRLDVTLSNALVDMYAK 240 Query: 798 CGRMDLANKVFLEMGERNVSSWTSMIVGYAMHGQVRDALECFGCMRDAGVRPNHVTFVGV 977 CGR DLA +VF M R+VSSWT+MI G A HG+ + AL+ F M+ V PN VT + V Sbjct: 241 CGRTDLARRVFDRMPVRDVSSWTTMITGLATHGEEQGALDMFDNMQREPVPPNRVTMLAV 300 Query: 978 LSACVHGGRVDDGKHYFKMMKT-RYGIAPLLQHYGCMVDLLGRAGLLVEAKGMVD-AMPM 1151 LSAC HGG VD G K M+ + P ++HYGC+VD+LGR G + EA+ +V+ MPM Sbjct: 301 LSACAHGGLVDRGLGLLKQMEDGEIKVVPTVEHYGCVVDMLGRVGRVDEARALVEQRMPM 360 Query: 1152 KPNVVIWGCLMGACEKFGNVEMGEWVAKHLMELEPWNDGVYVVLSNIYACNDMWEEVGRI 1331 + NVVIWG L+GACEK GNV +GEW A+ L+E EPWNDGVYVVLSNIYA MW EV R+ Sbjct: 361 EGNVVIWGTLLGACEKHGNVSVGEWAAERLVEAEPWNDGVYVVLSNIYAAAGMWGEVERV 420 Query: 1332 RGIMKERKLAKLPAYSL 1382 R IM ER + K P SL Sbjct: 421 RKIMSERNVVKSPGCSL 437