BLASTX nr result
ID: Akebia23_contig00017785
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00017785 (1951 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat... 944 0.0 gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis] 933 0.0 emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera] 922 0.0 ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containi... 899 0.0 ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containi... 897 0.0 ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containi... 897 0.0 ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat... 895 0.0 ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citr... 889 0.0 ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat... 885 0.0 ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containi... 882 0.0 ref|XP_007047616.1| Pentatricopeptide repeat (PPR) superfamily p... 880 0.0 ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat... 877 0.0 gb|EYU29622.1| hypothetical protein MIMGU_mgv1a023801mg [Mimulus... 846 0.0 ref|XP_002530608.1| pentatricopeptide repeat-containing protein,... 833 0.0 ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutr... 813 0.0 ref|NP_201383.1| pentatricopeptide repeat-containing protein [Ar... 805 0.0 gb|EPS62602.1| hypothetical protein M569_12187, partial [Genlise... 800 0.0 ref|XP_002866691.1| pentatricopeptide repeat-containing protein ... 800 0.0 ref|NP_190542.4| pentatricopeptide repeat-containing protein [Ar... 798 0.0 emb|CAB66911.1| putative protein [Arabidopsis thaliana] 798 0.0 >ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like [Vitis vinifera] Length = 622 Score = 944 bits (2440), Expect = 0.0 Identities = 461/604 (76%), Positives = 516/604 (85%), Gaps = 6/604 (0%) Frame = +1 Query: 157 NEDESMVGTRISNLS------TDRNRGSRFICLENKPNIYSNHQNSDEYSADVEMVYRIL 318 N D+ R+SN ++R G + LE+ + QN DE+SADVE VYRIL Sbjct: 6 NTDQVHPNHRLSNFGDKNCTISERRGGFGLVRLESNRENCTYDQNYDEFSADVEKVYRIL 65 Query: 319 RKFHSRVPKLELALQESGIIVRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVY 498 RKFHSRVPKLELALQESG+ VRSGL ERVLNRCGDAGNLG+RFF WASKQPGYRHSYEVY Sbjct: 66 RKFHSRVPKLELALQESGVAVRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYEVY 125 Query: 499 KSMIKILGKMRQFGAVWALIEEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMP 678 K+MIKILGKMRQFGAVWALIEEMR++NPQF++ FVVLMRRFASARMV+KAIEVLDEMP Sbjct: 126 KAMIKILGKMRQFGAVWALIEEMRRENPQFVSPYVFVVLMRRFASARMVKKAIEVLDEMP 185 Query: 679 KYGCKPDEYVFGCLLDALCKNGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLME 858 KYGC+PDE+VFGCLLDALCKNGSVKEAASLFEDMRI+F P +KHFTSLLYGWC+ GKLME Sbjct: 186 KYGCEPDEHVFGCLLDALCKNGSVKEAASLFEDMRIRFTPTLKHFTSLLYGWCREGKLME 245 Query: 859 AKFILVQMKEAGFEPDIVVYNNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQ 1038 AK++LVQ++EAGFEPDIVVYNNLL+GYA AGKM DA+DLL+EM+RK C+PN S+T LIQ Sbjct: 246 AKYVLVQIREAGFEPDIVVYNNLLTGYAAAGKMVDAYDLLKEMRRKECEPNVMSFTTLIQ 305 Query: 1039 ALCSREKMEEAMRVFVEMRMNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTP 1218 ALC+++KMEEAMRVF EM+ GC AD VTYTTLISGFCKWGKI KGYELLD+MIQQG P Sbjct: 306 ALCAKKKMEEAMRVFFEMQSCGCPADAVTYTTLISGFCKWGKISKGYELLDNMIQQGHIP 365 Query: 1219 NQMTYFHVLVAHXXXXXXXXXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAW 1398 N MTY H++ AH I C PDLNIYN VIRLACKLGE++EG+R W Sbjct: 366 NPMTYLHIMAAHEKKEELEECIELMEEMRKIGCTPDLNIYNIVIRLACKLGEIKEGVRVW 425 Query: 1399 NDMEANGFSPGLDTFTIMIHGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVL 1578 N+MEA G SPGLDTF IMIHGFL Q L+EAC +FKEMVGRGLLSAPQYGTLKELLNS+L Sbjct: 426 NEMEATGLSPGLDTFVIMIHGFLSQRCLVEACEFFKEMVGRGLLSAPQYGTLKELLNSLL 485 Query: 1579 RAEKLEMAKDVWSCIVSKGCDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPD 1758 RAEKLEM+KDVWSCI++KGCD+NV+AWTIWIHALFS GHVKEACSYCLDM+D GVMPQPD Sbjct: 486 RAEKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 545 Query: 1759 TFAKLMRGLRKLYNRQIAAEITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMKKDGRKR 1938 TFAKLMRGLRKLYNRQIAAEITEKVRKMAA+R+MTFKMYKRRGER+LKEK+K KDGRKR Sbjct: 546 TFAKLMRGLRKLYNRQIAAEITEKVRKMAAEREMTFKMYKRRGERNLKEKIKEAKDGRKR 605 Query: 1939 RARR 1950 RARR Sbjct: 606 RARR 609 >gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis] Length = 638 Score = 933 bits (2412), Expect = 0.0 Identities = 446/592 (75%), Positives = 512/592 (86%), Gaps = 2/592 (0%) Frame = +1 Query: 181 TRISNLSTDRNRGSRF--ICLENKPNIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLEL 354 T+ S+ NR + F + LE P + + + DE+S DVE +YRILRKFHSRV KLEL Sbjct: 34 TQFSSTQNPHNRATGFSPVHLEQNPVVSDDDETHDEFSGDVEKIYRILRKFHSRVSKLEL 93 Query: 355 ALQESGIIVRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQ 534 ALQESG+++RSGL ERVL RCGDAG+LG+RFF WASKQPGYR SYEVYK+MI+ LGKMRQ Sbjct: 94 ALQESGVVLRSGLTERVLGRCGDAGSLGYRFFVWASKQPGYRPSYEVYKAMIRALGKMRQ 153 Query: 535 FGAVWALIEEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFG 714 FGAVWAL+EEMRK+NPQ +T + FVVLMRRFASARMV+KA+EV DEMPKYGC+PDE+VFG Sbjct: 154 FGAVWALLEEMRKENPQLITPEIFVVLMRRFASARMVKKAVEVFDEMPKYGCEPDEHVFG 213 Query: 715 CLLDALCKNGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAG 894 CLLDALCKNGSVKEAASLFE+MR+KF P++KHFTSLLYGWC+ GKLMEAKF+LVQMKEAG Sbjct: 214 CLLDALCKNGSVKEAASLFEEMRVKFTPSLKHFTSLLYGWCREGKLMEAKFVLVQMKEAG 273 Query: 895 FEPDIVVYNNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEAM 1074 FEPD+VVYNNLL GYA AGKM DA+DL++EM+ KGC PNA SYT+LIQALC REKMEEAM Sbjct: 274 FEPDVVVYNNLLGGYAQAGKMADAYDLMKEMRGKGCSPNAASYTVLIQALCKREKMEEAM 333 Query: 1075 RVFVEMRMNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAH 1254 RVFVEM+ +GC ADV+TYTTLISGFCKWGKI++GYE+LDSMIQ+G +PN+ TY H+++AH Sbjct: 334 RVFVEMQRSGCDADVMTYTTLISGFCKWGKIERGYEILDSMIQRGFSPNETTYLHIMLAH 393 Query: 1255 XXXXXXXXXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGL 1434 I C+PDL IYNTVIRLACKL EV+EG+R WN++EA+G SPGL Sbjct: 394 EKKEEFEECVELIGEMRKIGCVPDLKIYNTVIRLACKLREVKEGVRLWNEIEASGLSPGL 453 Query: 1435 DTFTIMIHGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVW 1614 DTF +MIHGFLGQG LIEAC YFKEMV RGLLS PQYGTLKELLN++LRA+KLEMAKDVW Sbjct: 454 DTFVVMIHGFLGQGCLIEACQYFKEMVERGLLSGPQYGTLKELLNALLRADKLEMAKDVW 513 Query: 1615 SCIVSKGCDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKL 1794 +CIV+KGC+INV+AWTIWIHALF GHVKEACSYCLDM+D VMPQPDTFAKLMRGL+KL Sbjct: 514 TCIVNKGCEINVYAWTIWIHALFKNGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKKL 573 Query: 1795 YNRQIAAEITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMKKDGRKRRARR 1950 YNRQIAAEITEKVRKMA DRQMTFKMYKRRGERDLKEK K K++GRKRRARR Sbjct: 574 YNRQIAAEITEKVRKMAEDRQMTFKMYKRRGERDLKEKAKEKQNGRKRRARR 625 >emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera] Length = 655 Score = 922 bits (2382), Expect = 0.0 Identities = 444/556 (79%), Positives = 493/556 (88%) Frame = +1 Query: 283 YSADVEMVYRILRKFHSRVPKLELALQESGIIVRSGLVERVLNRCGDAGNLGFRFFNWAS 462 + A + VYRILRKFHSRVPKLELALQESG+ VRSGL ERVLNRCGDAGNLG+RFF WAS Sbjct: 87 WKAIEKTVYRILRKFHSRVPKLELALQESGVAVRSGLTERVLNRCGDAGNLGYRFFVWAS 146 Query: 463 KQPGYRHSYEVYKSMIKILGKMRQFGAVWALIEEMRKDNPQFLTVDAFVVLMRRFASARM 642 KQPGYRHSYEVYK+MIKILGKMRQFGAVWALIEEMR++NPQF++ FVVLMRRFASARM Sbjct: 147 KQPGYRHSYEVYKAMIKILGKMRQFGAVWALIEEMRRENPQFVSPYVFVVLMRRFASARM 206 Query: 643 VQKAIEVLDEMPKYGCKPDEYVFGCLLDALCKNGSVKEAASLFEDMRIKFKPNVKHFTSL 822 V+KAIEVLDEMPKYGC+PDE+VFGCLLDALCKNGSVKEAASLFEDMRI+F P +KHFTSL Sbjct: 207 VKKAIEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAASLFEDMRIRFTPTLKHFTSL 266 Query: 823 LYGWCKVGKLMEAKFILVQMKEAGFEPDIVVYNNLLSGYALAGKMQDAFDLLQEMKRKGC 1002 LYGWC+ GKLMEAK++LVQ++EAGFEPDIVVYNNLL+GYA AGKM DA+DLL+EM+RK C Sbjct: 267 LYGWCREGKLMEAKYVLVQIREAGFEPDIVVYNNLLTGYAAAGKMVDAYDLLKEMRRKEC 326 Query: 1003 DPNATSYTILIQALCSREKMEEAMRVFVEMRMNGCVADVVTYTTLISGFCKWGKIDKGYE 1182 +PN S+T LIQALC+++KMEEAMRVF EM+ GC AD VTYTTLISGFCKWGKI KGYE Sbjct: 327 EPNVMSFTTLIQALCAKKKMEEAMRVFFEMQSCGCPADAVTYTTLISGFCKWGKISKGYE 386 Query: 1183 LLDSMIQQGCTPNQMTYFHVLVAHXXXXXXXXXXXXXXXXXXIDCIPDLNIYNTVIRLAC 1362 LLD+MIQQG PN MTY H++ AH I C PDLNIYN VIRLAC Sbjct: 387 LLDNMIQQGHIPNPMTYLHIMAAHEKKEELEECIELMEEMRKIGCTPDLNIYNIVIRLAC 446 Query: 1363 KLGEVEEGIRAWNDMEANGFSPGLDTFTIMIHGFLGQGRLIEACSYFKEMVGRGLLSAPQ 1542 KLGE++EG+R WN+MEA G SPGLDTF IMIHGFL Q L+EAC +FKEMVGRGLLSAPQ Sbjct: 447 KLGEIKEGVRVWNEMEATGLSPGLDTFVIMIHGFLSQRCLVEACEFFKEMVGRGLLSAPQ 506 Query: 1543 YGTLKELLNSVLRAEKLEMAKDVWSCIVSKGCDINVFAWTIWIHALFSKGHVKEACSYCL 1722 YGTLKELLNS+LRAEKLEM+KDVWSCI++KGCD+NV+AWTIWIHALFS GHVKEACSYCL Sbjct: 507 YGTLKELLNSLLRAEKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCL 566 Query: 1723 DMLDNGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAADRQMTFKMYKRRGERDLK 1902 DM+D GVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAA+R+MTFKMYKRRGER+LK Sbjct: 567 DMMDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAAEREMTFKMYKRRGERNLK 626 Query: 1903 EKVKMKKDGRKRRARR 1950 EK+K KDGRKRRARR Sbjct: 627 EKIKEAKDGRKRRARR 642 >ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like [Cucumis sativus] Length = 641 Score = 899 bits (2323), Expect = 0.0 Identities = 430/614 (70%), Positives = 514/614 (83%), Gaps = 1/614 (0%) Frame = +1 Query: 112 ENHKKSSPQT-VPIENNEDESMVGTRISNLSTDRNRGSRFICLENKPNIYSNHQNSDEYS 288 ++ K +SP +P + + S++ ++ S T + G I L+ P+ ++ +++DE+S Sbjct: 15 QSSKSNSPLIFLPKKPHLSLSLISSQTSPNGTTQRGGFGPIHLKTTPHESAHDRDADEFS 74 Query: 289 ADVEMVYRILRKFHSRVPKLELALQESGIIVRSGLVERVLNRCGDAGNLGFRFFNWASKQ 468 DVE VYRILRKFH+RVPKLELALQESG+I+RSGL ERVL+RCGDAGNLG+RFF WASKQ Sbjct: 75 VDVEKVYRILRKFHTRVPKLELALQESGVIMRSGLPERVLSRCGDAGNLGYRFFVWASKQ 134 Query: 469 PGYRHSYEVYKSMIKILGKMRQFGAVWALIEEMRKDNPQFLTVDAFVVLMRRFASARMVQ 648 PGYRHSYEVYK+MIK LGKMRQFGAVWALIEEMRK+NP LT + F+VLMRRFAS RMV+ Sbjct: 135 PGYRHSYEVYKAMIKTLGKMRQFGAVWALIEEMRKENPYMLTPEVFIVLMRRFASVRMVK 194 Query: 649 KAIEVLDEMPKYGCKPDEYVFGCLLDALCKNGSVKEAASLFEDMRIKFKPNVKHFTSLLY 828 KA+EVLDEMPKYGC+PDEYVFGCLLDALCKNGSVKEAASLFEDMR++F PN++HFTSLLY Sbjct: 195 KAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRVRFNPNLRHFTSLLY 254 Query: 829 GWCKVGKLMEAKFILVQMKEAGFEPDIVVYNNLLSGYALAGKMQDAFDLLQEMKRKGCDP 1008 GWC+ GK+MEAK +LVQ+KEAGFEPDIVVYNNLL GYA AGKM+DAFDLL EMK+ C P Sbjct: 255 GWCREGKIMEAKHVLVQIKEAGFEPDIVVYNNLLGGYAQAGKMRDAFDLLAEMKKVNCGP 314 Query: 1009 NATSYTILIQALCSREKMEEAMRVFVEMRMNGCVADVVTYTTLISGFCKWGKIDKGYELL 1188 NA S+TILIQ+ C EKM+EAMR+F EM+ +GC ADVVTYTTLISGFCKWG DK YE+L Sbjct: 315 NAASFTILIQSFCKTEKMDEAMRIFTEMQGSGCEADVVTYTTLISGFCKWGNTDKAYEIL 374 Query: 1189 DSMIQQGCTPNQMTYFHVLVAHXXXXXXXXXXXXXXXXXXIDCIPDLNIYNTVIRLACKL 1368 D MIQ+G P+Q++Y +++AH I C+PDLNIYNT+IRL CKL Sbjct: 375 DDMIQKGHDPSQLSYLCIMMAHEKKEELEECMELIEEMRKIGCVPDLNIYNTMIRLVCKL 434 Query: 1369 GEVEEGIRAWNDMEANGFSPGLDTFTIMIHGFLGQGRLIEACSYFKEMVGRGLLSAPQYG 1548 G+++E +R W +M+A G +PGLDT+ +M+HGFL QG L+EAC YFKEMV RGLLSAPQYG Sbjct: 435 GDLKEAVRLWGEMQAGGLNPGLDTYILMVHGFLSQGCLVEACDYFKEMVERGLLSAPQYG 494 Query: 1549 TLKELLNSVLRAEKLEMAKDVWSCIVSKGCDINVFAWTIWIHALFSKGHVKEACSYCLDM 1728 TLKEL N++LRAEKLEMAK++WSC+ +KGC++NV AWTIWIHALFS GHVKEACSYCLDM Sbjct: 495 TLKELTNALLRAEKLEMAKNMWSCMTTKGCELNVSAWTIWIHALFSNGHVKEACSYCLDM 554 Query: 1729 LDNGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAADRQMTFKMYKRRGERDLKEK 1908 +D +MPQPDTFAKLMRGL+KL++RQ+A EITEKVRKMAADRQ+TFKMYKRRGERDLKEK Sbjct: 555 MDADLMPQPDTFAKLMRGLKKLFHRQLAVEITEKVRKMAADRQITFKMYKRRGERDLKEK 614 Query: 1909 VKMKKDGRKRRARR 1950 +K K DGRKRRARR Sbjct: 615 IKAKIDGRKRRARR 628 >ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like [Cucumis sativus] Length = 664 Score = 897 bits (2318), Expect = 0.0 Identities = 430/616 (69%), Positives = 508/616 (82%) Frame = +1 Query: 103 QTTENHKKSSPQTVPIENNEDESMVGTRISNLSTDRNRGSRFICLENKPNIYSNHQNSDE 282 Q H +S + N S+ + S T + G I L+ P+ ++ +++DE Sbjct: 36 QVPSTHPESCSNFLSFMLNCGVSLFPFKTSPNGTTQRGGFGPIHLKTTPHESAHDRDADE 95 Query: 283 YSADVEMVYRILRKFHSRVPKLELALQESGIIVRSGLVERVLNRCGDAGNLGFRFFNWAS 462 +S DVE VYRILRKFH+RVPKLELALQESG+I+RSGL ERVL+RCGDAGNLG+RFF WAS Sbjct: 96 FSVDVEKVYRILRKFHTRVPKLELALQESGVIMRSGLPERVLSRCGDAGNLGYRFFVWAS 155 Query: 463 KQPGYRHSYEVYKSMIKILGKMRQFGAVWALIEEMRKDNPQFLTVDAFVVLMRRFASARM 642 KQPGYRHSYEVYK+MIK LGKMRQFGAVWALIEEMRK+NP LT + F+VLMRRFAS RM Sbjct: 156 KQPGYRHSYEVYKAMIKTLGKMRQFGAVWALIEEMRKENPYMLTPEVFIVLMRRFASVRM 215 Query: 643 VQKAIEVLDEMPKYGCKPDEYVFGCLLDALCKNGSVKEAASLFEDMRIKFKPNVKHFTSL 822 V+KA+EVLDEMPKYGC+PDEYVFGCLLDALCKNGSVKEAASLFEDMR++F PN++HFTSL Sbjct: 216 VKKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRVRFNPNLRHFTSL 275 Query: 823 LYGWCKVGKLMEAKFILVQMKEAGFEPDIVVYNNLLSGYALAGKMQDAFDLLQEMKRKGC 1002 LYGWC+ GK+MEAK +LVQ+KEAGFEPDIVVYNNLL GYA AGKM+DAFDLL EMK+ C Sbjct: 276 LYGWCREGKIMEAKHVLVQIKEAGFEPDIVVYNNLLGGYAQAGKMRDAFDLLAEMKKVNC 335 Query: 1003 DPNATSYTILIQALCSREKMEEAMRVFVEMRMNGCVADVVTYTTLISGFCKWGKIDKGYE 1182 PNA S+TILIQ+ C EKM+EAMR+F EM+ +GC ADVVTYTTLISGFCKWG DK YE Sbjct: 336 GPNAASFTILIQSFCKTEKMDEAMRIFTEMQGSGCEADVVTYTTLISGFCKWGNTDKAYE 395 Query: 1183 LLDSMIQQGCTPNQMTYFHVLVAHXXXXXXXXXXXXXXXXXXIDCIPDLNIYNTVIRLAC 1362 +LD MIQ+G P+Q++Y +++AH I C+PDLNIYNT+IRL C Sbjct: 396 ILDDMIQKGHDPSQLSYLCIMMAHEKKEELEECMELIEEMRKIGCVPDLNIYNTMIRLVC 455 Query: 1363 KLGEVEEGIRAWNDMEANGFSPGLDTFTIMIHGFLGQGRLIEACSYFKEMVGRGLLSAPQ 1542 KLG+++E +R W +M+A G +PGLDT+ +M+HGFL QG L+EAC YFKEMV RGLLSAPQ Sbjct: 456 KLGDLKEAVRLWGEMQAGGLNPGLDTYILMVHGFLSQGCLVEACDYFKEMVERGLLSAPQ 515 Query: 1543 YGTLKELLNSVLRAEKLEMAKDVWSCIVSKGCDINVFAWTIWIHALFSKGHVKEACSYCL 1722 YGTLKEL N++LRAEKLEMAK++WSC+ +KGC++NV AWTIWIHALFS GHVKEACSYCL Sbjct: 516 YGTLKELTNALLRAEKLEMAKNMWSCMTTKGCELNVSAWTIWIHALFSNGHVKEACSYCL 575 Query: 1723 DMLDNGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAADRQMTFKMYKRRGERDLK 1902 DM+D +MPQPDTFAKLMRGL+KL++RQ+A EITEKVRKMAADRQ+TFKMYKRRGERDLK Sbjct: 576 DMMDADLMPQPDTFAKLMRGLKKLFHRQLAVEITEKVRKMAADRQITFKMYKRRGERDLK 635 Query: 1903 EKVKMKKDGRKRRARR 1950 EK+K K DGRKRRARR Sbjct: 636 EKIKAKIDGRKRRARR 651 >ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like [Solanum tuberosum] Length = 625 Score = 897 bits (2317), Expect = 0.0 Identities = 424/562 (75%), Positives = 491/562 (87%) Frame = +1 Query: 265 HQNSDEYSADVEMVYRILRKFHSRVPKLELALQESGIIVRSGLVERVLNRCGDAGNLGFR 444 ++N DE+SADVE VYRILRKFHSRVPKLELAL ESG++ RSGL ERVLNRCGDAGNLG+R Sbjct: 51 NKNHDEFSADVEKVYRILRKFHSRVPKLELALLESGVVARSGLTERVLNRCGDAGNLGYR 110 Query: 445 FFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALIEEMRKDNPQFLTVDAFVVLMRR 624 FF W SKQPGYRHS++ YK+MIKILGKMRQFG VWAL+EEMR +NPQFLT + F+VLMRR Sbjct: 111 FFVWVSKQPGYRHSHDAYKAMIKILGKMRQFGTVWALVEEMRIENPQFLTPEVFIVLMRR 170 Query: 625 FASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCKNGSVKEAASLFEDMRIKFKPNV 804 FAS RMV+KAIEVLDEMPKYG +PDEYVFGCLLDALCKNGSVKEAA+LF++MR +F P + Sbjct: 171 FASGRMVKKAIEVLDEMPKYGVEPDEYVFGCLLDALCKNGSVKEAAALFDEMRFRFSPTI 230 Query: 805 KHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVYNNLLSGYALAGKMQDAFDLLQE 984 KHFTSLLYGWCK GKL+EAK +LV+M+EAGFEPDIVVYNNLL+GYA++ KM DAFDLLQE Sbjct: 231 KHFTSLLYGWCKEGKLIEAKVVLVKMREAGFEPDIVVYNNLLNGYAVSRKMADAFDLLQE 290 Query: 985 MKRKGCDPNATSYTILIQALCSREKMEEAMRVFVEMRMNGCVADVVTYTTLISGFCKWGK 1164 M+RKGC+PN TS+TI+IQALC ++KMEEAMRVF++M +GC DVVTYTTLISGFCKWGK Sbjct: 291 MRRKGCNPNETSFTIVIQALCLQDKMEEAMRVFLDMERSGCEGDVVTYTTLISGFCKWGK 350 Query: 1165 IDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXXXXXXXXXXXXXIDCIPDLNIYNT 1344 I+KGYEL+D+M+Q+G PNQ TY H+++AH I PD +IYN Sbjct: 351 IEKGYELVDTMLQKGYNPNQTTYLHIMLAHEKKEELEECLELVKEMGKIGIPPDHSIYNI 410 Query: 1345 VIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMIHGFLGQGRLIEACSYFKEMVGRG 1524 VIRLACKLGE++EG+R WN +EANG SPG+DTF IMI+GF+ QGRLIEAC +FKEM+GRG Sbjct: 411 VIRLACKLGEIDEGVRVWNQIEANGISPGVDTFIIMINGFVEQGRLIEACDHFKEMIGRG 470 Query: 1525 LLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCIVSKGCDINVFAWTIWIHALFSKGHVKE 1704 LLSAPQYGTLK+LLNS+LRAEKLE+ KDVWSCI++KGC++NV AWTIWIHALFS GHVKE Sbjct: 471 LLSAPQYGTLKDLLNSLLRAEKLELCKDVWSCIMTKGCELNVSAWTIWIHALFSNGHVKE 530 Query: 1705 ACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAADRQMTFKMYKRR 1884 AC+YCLDM+D G+MPQPDTFAKLM+GLRKLYNR+IAAEITEK RKMA R MTFKMYKRR Sbjct: 531 ACAYCLDMMDAGLMPQPDTFAKLMKGLRKLYNREIAAEITEKARKMAEQRNMTFKMYKRR 590 Query: 1885 GERDLKEKVKMKKDGRKRRARR 1950 GERDLKEK K + DGRKRRARR Sbjct: 591 GERDLKEKAKTQIDGRKRRARR 612 >ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like isoform X1 [Cicer arietinum] gi|502165084|ref|XP_004513408.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like isoform X2 [Cicer arietinum] Length = 655 Score = 895 bits (2314), Expect = 0.0 Identities = 425/575 (73%), Positives = 495/575 (86%), Gaps = 1/575 (0%) Frame = +1 Query: 229 ICLENKPNIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLELALQESGIIVRSGLVERVL 408 I L++ N +++ + DE+++DVE VYRILRK+HSRVPKLELAL+ESG++V SGL ERVL Sbjct: 70 IHLQSNANHFNDQNSDDEFTSDVEKVYRILRKYHSRVPKLELALKESGVVVSSGLTERVL 129 Query: 409 NRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALIEEMRKDNPQF 588 NRCG++GNL +RFF+WASKQ GYRHS EVYK+MIK+L KMRQFGAVWALI+EMR +NPQ Sbjct: 130 NRCGNSGNLAYRFFSWASKQSGYRHSEEVYKAMIKVLSKMRQFGAVWALIDEMRLENPQL 189 Query: 589 LTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCKNGSVKEAASL 768 ++ FV+LMRRFASARMV KAIEVLDEMPKYGC+PDEYVFGCLLDALCKNGS+KEAASL Sbjct: 190 ISPHVFVILMRRFASARMVHKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSIKEAASL 249 Query: 769 FEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVYNNLLSGYALA 948 FEDMR +F P VKHFTSLLYGWCK GKL+EAK +LVQMK+AG EPDIVV+NNLL GYA Sbjct: 250 FEDMRYRFPPTVKHFTSLLYGWCKEGKLVEAKHVLVQMKDAGIEPDIVVFNNLLGGYAQG 309 Query: 949 GKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEAMRVFVEMRMNGCVADVVTY 1128 GKM DA+DLL+EMKRKGC+PNA SYTILIQ+LC EK+EEAMR+FVEM+ N C DV+TY Sbjct: 310 GKMADAYDLLKEMKRKGCEPNAASYTILIQSLCKHEKLEEAMRIFVEMQRNDCQMDVITY 369 Query: 1129 TTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXXXXXXXXXXXXX 1308 TTLISGFCKWGKI +GYELLD MIQ+G +PNQ+TY H+++AH Sbjct: 370 TTLISGFCKWGKIKRGYELLDQMIQEGHSPNQLTYLHIMLAHEKKEELEECMELVNEMKK 429 Query: 1309 IDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMIHGFLGQGRLIE 1488 I C+P+LNIYNTVIRLACK GEV++G+R WN+MEA+G SPG DTF +MI+GFL Q LIE Sbjct: 430 IGCVPNLNIYNTVIRLACKFGEVKQGVRLWNEMEASGLSPGTDTFVVMINGFLEQDCLIE 489 Query: 1489 ACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCI-VSKGCDINVFAWTI 1665 AC YFKEMVGRGL +APQYGTLKEL+NS+LRAEKLEMAKD W+CI SK C++NV AWTI Sbjct: 490 ACEYFKEMVGRGLFAAPQYGTLKELMNSLLRAEKLEMAKDTWNCITASKSCEMNVAAWTI 549 Query: 1666 WIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMA 1845 WIHALFSKGHVKEACS+C+DM+DN +MPQPDTFAKL+RGL+KLYNR+ AAEITEKVRKMA Sbjct: 550 WIHALFSKGHVKEACSFCIDMMDNDLMPQPDTFAKLIRGLKKLYNREFAAEITEKVRKMA 609 Query: 1846 ADRQMTFKMYKRRGERDLKEKVKMKKDGRKRRARR 1950 ADR +TFKMYKRRGERDLKEK K KKDGRKRRAR+ Sbjct: 610 ADRHITFKMYKRRGERDLKEKEKEKKDGRKRRARQ 644 >ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citrus clementina] gi|557528135|gb|ESR39385.1| hypothetical protein CICLE_v10025134mg [Citrus clementina] Length = 638 Score = 889 bits (2296), Expect = 0.0 Identities = 422/593 (71%), Positives = 498/593 (83%), Gaps = 6/593 (1%) Frame = +1 Query: 190 SNLSTDRNRGSRFICLENKP------NIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLE 351 S +T S +CL+ K N H + +E+S DVE ++RIL+KFHSR+PKLE Sbjct: 33 STATTTNQLNSNLVCLKTKEDDCKCNNTTDTHGSHNEFSHDVEKIFRILKKFHSRLPKLE 92 Query: 352 LALQESGIIVRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMR 531 LALQ SG+++R GL ERV+NRCGDAGNLG+R++ WASKQP Y HSY+VY+++IK L KMR Sbjct: 93 LALQHSGVVLRPGLTERVINRCGDAGNLGYRYYMWASKQPNYVHSYDVYRALIKSLSKMR 152 Query: 532 QFGAVWALIEEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVF 711 +FGAVWAL+EEMRK+ PQ +T + FV+LMRRFASARMV+KAIEVLDEMPKYGC+PDE+VF Sbjct: 153 KFGAVWALMEEMRKEKPQLITTEVFVILMRRFASARMVKKAIEVLDEMPKYGCEPDEFVF 212 Query: 712 GCLLDALCKNGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEA 891 GCLLDALCKN SVKEAA LF++MR +FKP+++HFTSLLYGWCK GKL+EAK++LVQMK+A Sbjct: 213 GCLLDALCKNSSVKEAAKLFDEMRERFKPSLRHFTSLLYGWCKEGKLVEAKYVLVQMKDA 272 Query: 892 GFEPDIVVYNNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEA 1071 GFEPDIVVYNNLLSGYA GKM DAF+LL+EM+RKGCDPNA SYT+LIQALC EKMEEA Sbjct: 273 GFEPDIVVYNNLLSGYAQMGKMTDAFELLKEMRRKGCDPNANSYTVLIQALCRMEKMEEA 332 Query: 1072 MRVFVEMRMNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVA 1251 R FVEM +GC ADVVTYTTLISGFCK KID+ YE+LDSMIQ+G PNQ+TY H+++A Sbjct: 333 NRAFVEMERSGCEADVVTYTTLISGFCKSRKIDRCYEILDSMIQRGILPNQLTYLHIMLA 392 Query: 1252 HXXXXXXXXXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPG 1431 H I C+PD++ YN VIRLACKLGE++E + WN+MEA SPG Sbjct: 393 HEKKEELEECVELMGEMRKIGCVPDVSNYNVVIRLACKLGELKEAVNVWNEMEAASLSPG 452 Query: 1432 LDTFTIMIHGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDV 1611 D+F +M+HGFLGQG LIEAC YFKEMVGRGLLSAPQYGTLKELLNS+LRA+K+EMAKDV Sbjct: 453 TDSFVVMVHGFLGQGCLIEACEYFKEMVGRGLLSAPQYGTLKELLNSLLRAQKVEMAKDV 512 Query: 1612 WSCIVSKGCDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRK 1791 WSCIV+KGC++NV+AWTIWIH+LFS GHVKEACSYCLDM+D VMPQPDTFAKLMRGL+K Sbjct: 513 WSCIVTKGCELNVYAWTIWIHSLFSNGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKK 572 Query: 1792 LYNRQIAAEITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMKKDGRKRRARR 1950 LYNRQIAAEITEKVRKMAA+RQ+TFKMYKRRGERDLKEK K + DGRKRRAR+ Sbjct: 573 LYNRQIAAEITEKVRKMAAERQITFKMYKRRGERDLKEKAKKQVDGRKRRARQ 625 >ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like [Citrus sinensis] Length = 638 Score = 885 bits (2286), Expect = 0.0 Identities = 420/593 (70%), Positives = 497/593 (83%), Gaps = 6/593 (1%) Frame = +1 Query: 190 SNLSTDRNRGSRFICLENKP------NIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLE 351 S +T S +CL+ K N H + +E+S DVE ++RIL+KFHSR+PKLE Sbjct: 33 STATTTNQLNSNLVCLKTKEDDCKCDNTTDTHGSHNEFSHDVEKIFRILKKFHSRLPKLE 92 Query: 352 LALQESGIIVRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMR 531 LALQ SG+++R GL ERV+NRCGDAGNLG+R++ WASKQP Y HSY+VY+++IK L KMR Sbjct: 93 LALQHSGVVLRPGLTERVINRCGDAGNLGYRYYMWASKQPNYVHSYDVYRALIKSLSKMR 152 Query: 532 QFGAVWALIEEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVF 711 +FGAVWAL+EEMRK+ PQ +T + FV+LMRRFASARMV+KAIEVLDEMPKYGC+PDE+VF Sbjct: 153 KFGAVWALMEEMRKEKPQLITTEVFVILMRRFASARMVKKAIEVLDEMPKYGCEPDEFVF 212 Query: 712 GCLLDALCKNGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEA 891 GCLLDALCKN SVKEAA LF+++R +FKP+++HFTSLLYGWCK GKL+EAK++LVQMK+A Sbjct: 213 GCLLDALCKNSSVKEAAKLFDEIRERFKPSLRHFTSLLYGWCKEGKLVEAKYVLVQMKDA 272 Query: 892 GFEPDIVVYNNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEA 1071 GFEPDIVVYNNLLSGYA GKM DAF+LL+EM+RKGCDPNA SYT+LIQALC EKMEEA Sbjct: 273 GFEPDIVVYNNLLSGYAQMGKMTDAFELLKEMRRKGCDPNANSYTVLIQALCRMEKMEEA 332 Query: 1072 MRVFVEMRMNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVA 1251 R FVEM +GC ADVVTYTTLISGFCK KID+ YE+LDSMIQ+G PNQ+TY H+++A Sbjct: 333 NRAFVEMERSGCEADVVTYTTLISGFCKSRKIDRCYEILDSMIQRGILPNQLTYLHIMLA 392 Query: 1252 HXXXXXXXXXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPG 1431 H I C+PD++ YN VIRLACKLGE++E + WN+MEA SPG Sbjct: 393 HEKKEELEECVELMGEMRKIGCVPDVSNYNVVIRLACKLGELKEAVNVWNEMEAASLSPG 452 Query: 1432 LDTFTIMIHGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDV 1611 D+F +M+HGFLGQG LIEAC YFKEMVGRGLLSAPQYGTLK LLNS+LRA+K+EMAKDV Sbjct: 453 TDSFVVMVHGFLGQGCLIEACEYFKEMVGRGLLSAPQYGTLKALLNSLLRAQKVEMAKDV 512 Query: 1612 WSCIVSKGCDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRK 1791 WSCIV+KGC++NV+AWTIWIH+LFS GHVKEACSYCLDM+D VMPQPDTFAKLMRGL+K Sbjct: 513 WSCIVTKGCELNVYAWTIWIHSLFSNGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKK 572 Query: 1792 LYNRQIAAEITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMKKDGRKRRARR 1950 LYNRQIAAEITEKVRKMAA+RQ+TFKMYKRRGERDLKEK K + DGRKRRAR+ Sbjct: 573 LYNRQIAAEITEKVRKMAAERQITFKMYKRRGERDLKEKAKKQVDGRKRRARQ 625 >ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like isoform X1 [Glycine max] gi|571514894|ref|XP_006597171.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like isoform X2 [Glycine max] gi|571514897|ref|XP_006597172.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like isoform X3 [Glycine max] Length = 654 Score = 882 bits (2280), Expect = 0.0 Identities = 420/566 (74%), Positives = 488/566 (86%), Gaps = 1/566 (0%) Frame = +1 Query: 256 YSNHQNSDEYSADVEMVYRILRKFHSRVPKLELALQESGIIVRSGLVERVLNRCGDAGNL 435 +++ DE+++DVE VYRILRK+HSRVPKLELAL+ESG++VR GL ERVL+RCGDAGNL Sbjct: 79 HTDDHTHDEFASDVEKVYRILRKYHSRVPKLELALRESGVVVRPGLTERVLSRCGDAGNL 138 Query: 436 GFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALIEEMRKDNPQFLTVDAFVVL 615 +RF++WASKQ G+R ++ YK+MIK+L +MRQFGAVWALIEEMR++NP +T FV+L Sbjct: 139 AYRFYSWASKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPHLITPQVFVIL 198 Query: 616 MRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCKNGSVKEAASLFEDMRIKFK 795 MRRFASARMV KA+EVLDEMPKYGC+PDEYVFGCLLDALCKNGSVKEAASLFEDMR ++K Sbjct: 199 MRRFASARMVHKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRYRWK 258 Query: 796 PNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVYNNLLSGYALAGKMQDAFDL 975 P+VKHFTSLLYGWCK GKLMEAK +LVQMK+ G EPDIVVYNNLL GYA AGKM DA+DL Sbjct: 259 PSVKHFTSLLYGWCKEGKLMEAKHVLVQMKDMGIEPDIVVYNNLLGGYAQAGKMGDAYDL 318 Query: 976 LQEMKRKGCDPNATSYTILIQALCSREKMEEAMRVFVEMRMNGCVADVVTYTTLISGFCK 1155 L+EM+RK C+PNATSYT+LIQ+LC E++EEA R+FVEM+ NGC ADVVTY+TLISGFCK Sbjct: 319 LKEMRRKRCEPNATSYTVLIQSLCKHERLEEATRLFVEMQTNGCQADVVTYSTLISGFCK 378 Query: 1156 WGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXXXXXXXXXXXXXIDCIPDLNI 1335 WGKI +GYELLD MIQQG PNQ+ Y H+++AH I C PDL+I Sbjct: 379 WGKIKRGYELLDEMIQQGHFPNQVIYQHIMLAHEKKEELEECKELVNEMQKIGCAPDLSI 438 Query: 1336 YNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMIHGFLGQGRLIEACSYFKEMV 1515 YNTVIRLACKLGEV+EGI+ WN+ME++G SPG+DTF IMI+GFL QG L+EAC YFKEMV Sbjct: 439 YNTVIRLACKLGEVKEGIQLWNEMESSGLSPGMDTFVIMINGFLEQGCLVEACEYFKEMV 498 Query: 1516 GRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCI-VSKGCDINVFAWTIWIHALFSKG 1692 GRGL +APQYGTLKEL+NS+LRAEKLEMAKD W+CI SKGC +NV AWTIWIHALFSKG Sbjct: 499 GRGLFTAPQYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWTIWIHALFSKG 558 Query: 1693 HVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAADRQMTFKM 1872 HVKEACS+C+DM+D +MP PDTFAKLM GL+KLYNRQ AAEITEKVRKMAADRQ+TFKM Sbjct: 559 HVKEACSFCIDMMDKDLMPNPDTFAKLMHGLKKLYNRQFAAEITEKVRKMAADRQITFKM 618 Query: 1873 YKRRGERDLKEKVKMKKDGRKRRARR 1950 YKRRGERDLKEK K KKDGRKRRAR+ Sbjct: 619 YKRRGERDLKEKAKEKKDGRKRRARQ 644 >ref|XP_007047616.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] gi|508699877|gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 647 Score = 880 bits (2273), Expect = 0.0 Identities = 418/605 (69%), Positives = 499/605 (82%), Gaps = 2/605 (0%) Frame = +1 Query: 142 VPIENNEDESMVGTRISNLSTDRNRGSRFICLENK-PNIYS-NHQNSDEYSADVEMVYRI 315 +P NN + + ++ LS++ G + LE K P + S N Q +D++++DVE +YRI Sbjct: 32 LPDNNNNNNN--SNSLNLLSSNSKSGFGLVTLETKQPTLKSDNDQQTDDFASDVEKIYRI 89 Query: 316 LRKFHSRVPKLELALQESGIIVRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEV 495 LRKFH+RVPKL LALQ+SG++ R GL ERVLNRCGDAGNLG++FF WASKQPGY SYE+ Sbjct: 90 LRKFHTRVPKLNLALQQSGVVFRPGLTERVLNRCGDAGNLGYKFFTWASKQPGYHPSYEI 149 Query: 496 YKSMIKILGKMRQFGAVWALIEEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEM 675 YK+MIKILGKMRQFGAVWALIEE++++NP F+T + F++L+RRFAS+RMV+KAIEV DEM Sbjct: 150 YKAMIKILGKMRQFGAVWALIEEIKRENPHFITAELFILLIRRFASSRMVKKAIEVFDEM 209 Query: 676 PKYGCKPDEYVFGCLLDALCKNGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLM 855 PKYGC D+ VFG LLDALCKNG+VKEAA +FE+MR++F PN+KHFTSLLYGWCK G+++ Sbjct: 210 PKYGCLQDDAVFGSLLDALCKNGNVKEAALVFEEMRVRFLPNLKHFTSLLYGWCKEGRIL 269 Query: 856 EAKFILVQMKEAGFEPDIVVYNNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILI 1035 EAK +LVQMKEAGFEPDIVV+NNLLSGY L KM DAFDLL+EM++KG DPNA SYTI+I Sbjct: 270 EAKHVLVQMKEAGFEPDIVVFNNLLSGYVLGNKMGDAFDLLKEMRKKGIDPNANSYTIVI 329 Query: 1036 QALCSREKMEEAMRVFVEMRMNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCT 1215 Q LC ++MEEAMRVFV+M NGC DVV YTTLISGFCKWG+++KGYE+LD MI +G Sbjct: 330 QGLCKADRMEEAMRVFVDMERNGCRGDVVVYTTLISGFCKWGRVEKGYEVLDRMISEGLM 389 Query: 1216 PNQMTYFHVLVAHXXXXXXXXXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRA 1395 PN +TY H+++AH I C+PD IYN V+RLACKL EV+E R Sbjct: 390 PNSLTYLHIMLAHEKKDELEECLELMEEMRKIGCVPDGGIYNVVVRLACKLEEVKEAARV 449 Query: 1396 WNDMEANGFSPGLDTFTIMIHGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSV 1575 WN+ME GFSPG+D F +MIHGF+GQG L+EAC YFKEM GRGL PQYG LK+LLNS+ Sbjct: 450 WNEMEGRGFSPGVDNFIVMIHGFIGQGCLVEACEYFKEMAGRGLFCVPQYGILKDLLNSL 509 Query: 1576 LRAEKLEMAKDVWSCIVSKGCDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQP 1755 LRAEKLEMAK+VWSCIVSKGC++NV AWTIW+HALFSKGHVKEACSYCL+M+D VMPQP Sbjct: 510 LRAEKLEMAKNVWSCIVSKGCELNVSAWTIWVHALFSKGHVKEACSYCLEMMDVDVMPQP 569 Query: 1756 DTFAKLMRGLRKLYNRQIAAEITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMKKDGRK 1935 DTFAKLMRGLRKLYNRQIAAEITEKVRKMAADR++TFKMYKRRG+RDLKEKVK K DGRK Sbjct: 570 DTFAKLMRGLRKLYNRQIAAEITEKVRKMAADREITFKMYKRRGQRDLKEKVKEKADGRK 629 Query: 1936 RRARR 1950 RRARR Sbjct: 630 RRARR 634 >ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like, partial [Glycine max] Length = 656 Score = 877 bits (2267), Expect = 0.0 Identities = 420/576 (72%), Positives = 492/576 (85%), Gaps = 1/576 (0%) Frame = +1 Query: 226 FICLENKPNIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLELALQESGIIVRSGLVERV 405 FI L+ +++ Q DE+++DVE VYRILRK+HSRVPKLELAL+ESG++VR GL ERV Sbjct: 71 FIRLQEISINHTDDQTHDEFASDVEKVYRILRKYHSRVPKLELALRESGVVVRPGLTERV 130 Query: 406 LNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALIEEMRKDNPQ 585 LNRCGDAGNL +RF++WASKQ G+R ++ YK+MIK+L +MRQFGAVWALIEEMR++NP Sbjct: 131 LNRCGDAGNLAYRFYSWASKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPH 190 Query: 586 FLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCKNGSVKEAAS 765 +T FV+LMRRFASARMV KA++VLDEMP YGC+PDEYVFGCLLDAL KNGSVKEAAS Sbjct: 191 LITPQVFVILMRRFASARMVHKAVQVLDEMPNYGCEPDEYVFGCLLDALRKNGSVKEAAS 250 Query: 766 LFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVYNNLLSGYAL 945 LFE++R ++KP+VKHFTSLLYGWCK GKLMEAK +LVQMK+AG EPDIVVYNNLL GYA Sbjct: 251 LFEELRYRWKPSVKHFTSLLYGWCKEGKLMEAKHVLVQMKDAGIEPDIVVYNNLLGGYAQ 310 Query: 946 AGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEAMRVFVEMRMNGCVADVVT 1125 A KM DA+DLL+EM+RKGC+PNATSYT+LIQ+LC E++EEA RVFVEM+ NGC AD+VT Sbjct: 311 ADKMGDAYDLLKEMRRKGCEPNATSYTVLIQSLCKHERLEEATRVFVEMQRNGCQADLVT 370 Query: 1126 YTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXXXXXXXXXXXX 1305 Y+TLISGFCKWGKI +GYELLD MIQQG PNQ+ Y H++VAH Sbjct: 371 YSTLISGFCKWGKIKRGYELLDEMIQQGHFPNQVIYQHIMVAHEKKEELEECKELVNEMQ 430 Query: 1306 XIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMIHGFLGQGRLI 1485 I C PDL+IYNTVIRLACKLGEV+EG+R WN+ME++G SP +DTF IMI+GFL QG L+ Sbjct: 431 KIGCAPDLSIYNTVIRLACKLGEVKEGVRLWNEMESSGLSPSIDTFVIMINGFLEQGCLV 490 Query: 1486 EACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCI-VSKGCDINVFAWT 1662 EAC YFKEMVGRGL +APQYGTLKEL+NS+LRAEKLEMAKD W+CI SKGC +NV AWT Sbjct: 491 EACEYFKEMVGRGLFAAPQYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWT 550 Query: 1663 IWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKM 1842 IWIHALFSKGHVKEACS+C+ M+D +MPQPDTFAKLMRGL+KLYNR+ AAEITEKVRKM Sbjct: 551 IWIHALFSKGHVKEACSFCIAMMDKDLMPQPDTFAKLMRGLKKLYNREFAAEITEKVRKM 610 Query: 1843 AADRQMTFKMYKRRGERDLKEKVKMKKDGRKRRARR 1950 AADR++TFKMYKRRGERDLKEK K KKDGRKRRAR+ Sbjct: 611 AADRKITFKMYKRRGERDLKEKAKEKKDGRKRRARQ 646 >gb|EYU29622.1| hypothetical protein MIMGU_mgv1a023801mg [Mimulus guttatus] Length = 601 Score = 846 bits (2185), Expect = 0.0 Identities = 403/571 (70%), Positives = 476/571 (83%) Frame = +1 Query: 238 ENKPNIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLELALQESGIIVRSGLVERVLNRC 417 + P + D + ADVE VYRILRKFHSRVPKLELALQ SG++VRSGL ERVLNRC Sbjct: 18 QESPTEREIQEPDDYFFADVEKVYRILRKFHSRVPKLELALQGSGVVVRSGLTERVLNRC 77 Query: 418 GDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALIEEMRKDNPQFLTV 597 GDAGNLG+RFF WASKQPGYRH+ +VYKSMIKIL KMRQFGAVWALIEEMRK++P L+ Sbjct: 78 GDAGNLGYRFFVWASKQPGYRHNRDVYKSMIKILAKMRQFGAVWALIEEMRKESPHLLSP 137 Query: 598 DAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCKNGSVKEAASLFED 777 + FV+LMRRFASARMV+KA+EVLDEMPKYGC+PDEY FGCLLDALCKNGSVKEAA LFED Sbjct: 138 EVFVILMRRFASARMVKKAVEVLDEMPKYGCEPDEYAFGCLLDALCKNGSVKEAALLFED 197 Query: 778 MRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVYNNLLSGYALAGKM 957 M+I+F+P +KHFTSLLYGWCK GKL+EAK +LV+M+EAGFEPD+VVYNNLL+GY++AGKM Sbjct: 198 MKIRFEPTIKHFTSLLYGWCKEGKLIEAKVVLVKMREAGFEPDLVVYNNLLNGYSVAGKM 257 Query: 958 QDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEAMRVFVEMRMNGCVADVVTYTTL 1137 DA LL EM+R G +PNATSYTI+IQALC REKMEEA RVF EM NGC ADVVTYTTL Sbjct: 258 ADASHLLVEMRRNGVEPNATSYTIMIQALCGREKMEEATRVFSEMEKNGCEADVVTYTTL 317 Query: 1138 ISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXXXXXXXXXXXXXIDC 1317 ISGFCKWGKI K +ELL++MI++G PN TY + ++AH I Sbjct: 318 ISGFCKWGKIKKAHELLEAMIRKGHIPNATTYLYFMLAHEKKEELEECLELVNEMKKIRV 377 Query: 1318 IPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMIHGFLGQGRLIEACS 1497 PDL IYNT++RL+CKLGE+E GIR N++E NG +PG+DT+ I+I G + Q RL+EAC Sbjct: 378 SPDLFIYNTILRLSCKLGEIESGIRIMNELEENGITPGVDTYIILIGGLVEQARLVEACD 437 Query: 1498 YFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCIVSKGCDINVFAWTIWIHA 1677 YF+EMV RGL SAPQYG +K+LLNS+LR +KL++AKD W CI+ KGC++NV AWTIWIHA Sbjct: 438 YFQEMVERGLFSAPQYGVMKDLLNSLLRDDKLQLAKDAWGCIIEKGCEVNVSAWTIWIHA 497 Query: 1678 LFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAADRQ 1857 LFS GHVK+ACSYCLDM+++G MP+PDTF+KLM+GL+KLYNR+IA EITEKVRKMA +R Sbjct: 498 LFSNGHVKDACSYCLDMMESGEMPKPDTFSKLMKGLKKLYNREIAVEITEKVRKMAEERN 557 Query: 1858 MTFKMYKRRGERDLKEKVKMKKDGRKRRARR 1950 +TFKMYKRRGERDLKEK K KKDGRKRRAR+ Sbjct: 558 ITFKMYKRRGERDLKEKDKAKKDGRKRRARQ 588 >ref|XP_002530608.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223529856|gb|EEF31788.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 596 Score = 833 bits (2151), Expect = 0.0 Identities = 392/551 (71%), Positives = 466/551 (84%), Gaps = 4/551 (0%) Frame = +1 Query: 196 LSTDRNRGSRFICLENKPNIYSNHQNS----DEYSADVEMVYRILRKFHSRVPKLELALQ 363 LS + G +CL+ + N S+ NS DE++ DVE VYRILR FHSRVPKLELALQ Sbjct: 39 LSNNLRNGFGVVCLKTQENNTSDRDNSSSKVDEFAKDVEKVYRILRNFHSRVPKLELALQ 98 Query: 364 ESGIIVRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGA 543 ESG+ +R+GL ERVLNRCGDAGNLG+RFF WASKQPGYRHSYE YK+M+KI KMRQFGA Sbjct: 99 ESGVTMRAGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYENYKAMVKIFSKMRQFGA 158 Query: 544 VWALIEEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLL 723 VWAL+EEMRKDN +T + F+VL+RRFASAR+V+KAIEVLDEMPKYGC+PDEYVFGCLL Sbjct: 159 VWALLEEMRKDNSVLITSELFIVLIRRFASARLVEKAIEVLDEMPKYGCEPDEYVFGCLL 218 Query: 724 DALCKNGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEP 903 DALCKNGSVK+AASLFEDMR++F P+++HFTSLLYGWC+ GKL+EAK +LVQM+EAGFEP Sbjct: 219 DALCKNGSVKQAASLFEDMRVRFSPSLRHFTSLLYGWCREGKLIEAKHVLVQMREAGFEP 278 Query: 904 DIVVYNNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEAMRVF 1083 DIVV+NNLLS Y++AGKM DAFDLL+EM RKGC+PNA SYTI+IQA CS+EKM+EAMRVF Sbjct: 279 DIVVFNNLLSAYSMAGKMTDAFDLLKEMVRKGCEPNANSYTIMIQAFCSQEKMDEAMRVF 338 Query: 1084 VEMRMNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXX 1263 VEM GC ADVVTYT LISGFCKWGKI++GY++LD+M Q+G PNQ+TY +L+AH Sbjct: 339 VEMERTGCEADVVTYTALISGFCKWGKINRGYQILDAMKQKGHMPNQLTYLRILLAHEKK 398 Query: 1264 XXXXXXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTF 1443 + C+PDL+IYN VIRLACKLGEV++G++ WN+MEA+ FSP LDTF Sbjct: 399 EELEECLELIESMRMVGCVPDLSIYNVVIRLACKLGEVKQGVQIWNEMEASDFSPELDTF 458 Query: 1444 TIMIHGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCI 1623 IMIHGFLGQG L+EAC YFKEM+GRGLL+ PQYG LKELLN++LR EKL MAKDVWSCI Sbjct: 459 VIMIHGFLGQGCLVEACEYFKEMIGRGLLTTPQYGILKELLNALLRGEKLGMAKDVWSCI 518 Query: 1624 VSKGCDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNR 1803 V+KGC++N AWTIWIH+LFS GHVKEACSYCLDM++ +MP+P+TFAKLMRGLRKLYNR Sbjct: 519 VTKGCELNADAWTIWIHSLFSNGHVKEACSYCLDMMEADIMPKPETFAKLMRGLRKLYNR 578 Query: 1804 QIAAEITEKVR 1836 + AAEITEK++ Sbjct: 579 EFAAEITEKIK 589 >ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutrema salsugineum] gi|557105226|gb|ESQ45560.1| hypothetical protein EUTSA_v10010190mg [Eutrema salsugineum] Length = 645 Score = 813 bits (2101), Expect = 0.0 Identities = 396/586 (67%), Positives = 473/586 (80%), Gaps = 5/586 (0%) Frame = +1 Query: 199 STDRNRGSRFICLENKPNIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLELALQESGII 378 S +R G+ +C E + Q DE++ DVE +YRILR +HSRVPKLEL L ESGI Sbjct: 48 SAERINGAGLVCPEKR-------QQEDEFAGDVEKIYRILRNYHSRVPKLELVLHESGIN 100 Query: 379 VRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALI 558 +R GL+ RVL+RCGDAGNLG+RFF WA+KQPGY HSYEV KSM+KIL KMRQFGAVWALI Sbjct: 101 LRPGLIVRVLSRCGDAGNLGYRFFLWAAKQPGYCHSYEVCKSMVKILSKMRQFGAVWALI 160 Query: 559 EEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCK 738 EEMRK+NPQ + + FVVLMRRFASA MV+KA+EVLDEMPKYG +PDEY+FGCLLDALCK Sbjct: 161 EEMRKENPQLIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGIEPDEYIFGCLLDALCK 220 Query: 739 NGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVY 918 NGSVK+A+ LFEDMR KF PN+++FTSLLYGWC+ GKL+EAK +LVQMKEAG EPDIVV+ Sbjct: 221 NGSVKDASKLFEDMRDKFPPNLRYFTSLLYGWCREGKLIEAKHVLVQMKEAGLEPDIVVF 280 Query: 919 NNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREK-MEEAMRVFVEMR 1095 NLLSGYA AGKM DA+DL+++M+R+G +PNA YT+LIQALC EK M+EAMRVFVEM Sbjct: 281 TNLLSGYAHAGKMADAYDLMKDMRRRGYEPNANCYTVLIQALCKMEKRMDEAMRVFVEME 340 Query: 1096 MNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXX 1275 GC AD+VTYT LISGFCKWG IDKGY +LD M ++G P Q+TY ++VAH Sbjct: 341 RYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPLQVTYMQIMVAHEKKEQFE 400 Query: 1276 XXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMI 1455 C+PDL IYN VIRLACKLGEV+E +R WN+MEANG SPG+DTF IMI Sbjct: 401 ECLDLIEKMKQNGCLPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMI 460 Query: 1456 HGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCIVSK- 1632 +GF QG LIEAC +FKEMV RG+ SAP YGTLK LLN+++R +KLEMAKDVWSC+ +K Sbjct: 461 NGFASQGCLIEACDHFKEMVSRGIFSAPHYGTLKILLNTLVRDDKLEMAKDVWSCLSNKS 520 Query: 1633 -GCDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQI 1809 C++NV AWTIWIHALF++GHVKEACSYCLDM++ +MPQPDT+AKLM+GL KLYNR I Sbjct: 521 SSCELNVSAWTIWIHALFARGHVKEACSYCLDMMEMDLMPQPDTYAKLMKGLNKLYNRTI 580 Query: 1810 AAEITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMK--KDGRKRR 1941 AAEITEKVRKMA++R+M+FKMYKRRGE DL EK K K K+G+K++ Sbjct: 581 AAEITEKVRKMASEREMSFKMYKRRGEEDLIEKAKPKGNKEGKKKK 626 >ref|NP_201383.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75170571|sp|Q9FH87.1|PP447_ARATH RecName: Full=Putative pentatricopeptide repeat-containing protein At5g65820 gi|9758569|dbj|BAB09050.1| unnamed protein product [Arabidopsis thaliana] gi|332010728|gb|AED98111.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 637 Score = 805 bits (2080), Expect = 0.0 Identities = 390/584 (66%), Positives = 472/584 (80%), Gaps = 3/584 (0%) Frame = +1 Query: 208 RNRGSRFICLENKPNIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLELALQESGIIVRS 387 R+ G +CLE N + + DE+++DVE YRILRKFHSRVPKLELAL ESG+ +R Sbjct: 54 RSNGIGLVCLEKSHNDRTKNSKYDEFASDVEKSYRILRKFHSRVPKLELALNESGVELRP 113 Query: 388 GLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALIEEM 567 GL+ERVLNRCGDAGNLG+RFF WA+KQP Y HS EVYKSM+KIL KMRQFGAVW LIEEM Sbjct: 114 GLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEM 173 Query: 568 RKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCKNGS 747 RK+NPQ + + FVVL++RFASA MV+KAIEVLDEMPK+G +PDEYVFGCLLDALCK+GS Sbjct: 174 RKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFEPDEYVFGCLLDALCKHGS 233 Query: 748 VKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVYNNL 927 VK+AA LFEDMR++F N+++FTSLLYGWC+VGK+MEAK++LVQM EAGFEPDIV Y NL Sbjct: 234 VKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNL 293 Query: 928 LSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEAMRVFVEMRMNGC 1107 LSGYA AGKM DA+DLL++M+R+G +PNA YT+LIQALC ++MEEAM+VFVEM C Sbjct: 294 LSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYEC 353 Query: 1108 VADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXXXXXX 1287 ADVVTYT L+SGFCKWGKIDK Y +LD MI++G P+++TY H++VAH Sbjct: 354 EADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLE 413 Query: 1288 XXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMIHGFL 1467 I+ PD+ IYN VIRLACKLGEV+E +R WN+ME NG SPG+DTF IMI+G Sbjct: 414 LMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLA 473 Query: 1468 GQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCIVSKG-CDI 1644 QG L+EA +FKEMV RGL S QYGTLK LLN+VL+ +KLEMAKDVWSCI SKG C++ Sbjct: 474 SQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLLNTVLKDKKLEMAKDVWSCITSKGACEL 533 Query: 1645 NVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQIAAEIT 1824 NV +WTIWIHALFSKG+ KEACSYC++M++ MPQPDTFAKLM+GL+KLYNR+ A EIT Sbjct: 534 NVLSWTIWIHALFSKGYEKEACSYCIEMIEMDFMPQPDTFAKLMKGLKKLYNREFAGEIT 593 Query: 1825 EKVRKMAADRQMTFKMYKRRGERDLKEKVKMKKD--GRKRRARR 1950 EKVR MAA+R+M+FKMYKRRG +DL EK K K+D G+K++ R Sbjct: 594 EKVRNMAAEREMSFKMYKRRGVQDLTEKAKSKQDREGKKKQRSR 637 >gb|EPS62602.1| hypothetical protein M569_12187, partial [Genlisea aurea] Length = 593 Score = 800 bits (2066), Expect = 0.0 Identities = 396/588 (67%), Positives = 477/588 (81%), Gaps = 7/588 (1%) Frame = +1 Query: 208 RNRGSRFICLENKP-----NIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLELALQESG 372 RNRG I +E ++ + SD++SADVE VY+ILRKF+S+VPKLELALQ SG Sbjct: 1 RNRGFDLIRIEEDEQQQDCSVGRRNNISDDFSADVEKVYKILRKFNSKVPKLELALQHSG 60 Query: 373 IIVRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWA 552 + VRSGL ERVLNRCGDAGNLG+RFF WASKQPGY HS++VYK+MI+ILGKMRQFGAVWA Sbjct: 61 VSVRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYNHSHDVYKAMIRILGKMRQFGAVWA 120 Query: 553 LIEEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDAL 732 LIEEMRK+NPQ LT + F+VLMRRFASARMV+KA+EVLDEMP YGC+PDEYVFGCLLDAL Sbjct: 121 LIEEMRKENPQLLTPEVFIVLMRRFASARMVKKAVEVLDEMPSYGCEPDEYVFGCLLDAL 180 Query: 733 CKNGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIV 912 CKNGSVKEA+ L EDM+++FKP +KHFTSLL+GWC+ GKL+EAK +L +M+EAGF PDIV Sbjct: 181 CKNGSVKEASLLMEDMQMRFKPTMKHFTSLLHGWCREGKLIEAKTVLQKMREAGFLPDIV 240 Query: 913 VYNNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEAMRVFVEM 1092 VYN LL+GYA AGK+ DA LL EM+R C P ATSYT +I++LC+REKM EA+++F EM Sbjct: 241 VYNTLLAGYAAAGKIADARHLLLEMRRNSCRPTATSYTAVIRSLCAREKMAEAVQLFSEM 300 Query: 1093 RMNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXX 1272 +GC ADVV YTTLISGFCK GK KGYELLD+MI++G TPN TY +++ AH Sbjct: 301 EADGCEADVVAYTTLISGFCKRGKTGKGYELLDAMIRKGITPNNTTYSYLISAHEKEEEL 360 Query: 1273 XXXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIM 1452 I PD +YN VIRL+CKLGEVE+GIR N+ME +G SPG+DTF I+ Sbjct: 361 EECLGLAKSMRQIGVTPDSAVYNPVIRLSCKLGEVEDGIRLMNEMEEDGISPGVDTFVIL 420 Query: 1453 IHGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWS-CIVS 1629 I+G + G L EAC F+EMVGRGL++APQYG LK+LLNS+LR KL+++KDVWS + S Sbjct: 421 INGLILHGHLDEACLRFEEMVGRGLVAAPQYGLLKDLLNSLLRCGKLQLSKDVWSKMVTS 480 Query: 1630 KG-CDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQ 1806 KG CD+NV+AWTIWIHAL SKG+VKEAC Y L+M++ G+MPQPDTFAKL+RGLRKLYNR+ Sbjct: 481 KGCCDVNVYAWTIWIHALLSKGYVKEACFYGLEMMEAGLMPQPDTFAKLIRGLRKLYNRE 540 Query: 1807 IAAEITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMKKDGRKRRARR 1950 IAAEITEKV++MAA+R +TFKMYKRRGERDLK+K K K DGRK RARR Sbjct: 541 IAAEITEKVKRMAAERHITFKMYKRRGERDLKDKTKAKVDGRKVRARR 588 >ref|XP_002866691.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297312526|gb|EFH42950.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 638 Score = 800 bits (2066), Expect = 0.0 Identities = 389/586 (66%), Positives = 472/586 (80%), Gaps = 5/586 (0%) Frame = +1 Query: 208 RNRGSRFICLENKPNIYSNHQNS--DEYSADVEMVYRILRKFHSRVPKLELALQESGIIV 381 R+ G +CLE N +NS DE+++DVE YRILRKFHSRVPKLELAL ESG+ + Sbjct: 53 RSNGIGLVCLEKNHNHNDRTKNSKYDEFASDVEKAYRILRKFHSRVPKLELALNESGVEL 112 Query: 382 RSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALIE 561 R GL+ERVLNRCGDAGNLG+RFF WA+KQP Y HS EVYKSM+KIL KMRQFGAVW LIE Sbjct: 113 RPGLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIE 172 Query: 562 EMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCKN 741 EMRK+NPQ + + FVVL++RFASA MV+KAIEVLDEMP +G +PDEYVFGCLLDALCK+ Sbjct: 173 EMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPTFGLEPDEYVFGCLLDALCKH 232 Query: 742 GSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVYN 921 GSVK+AA LFEDMR++F N+++FTSLLYGWC+ K+MEAK++LVQMKEAGFEPDIV Y Sbjct: 233 GSVKDAAKLFEDMRLRFPVNLRYFTSLLYGWCREEKMMEAKYVLVQMKEAGFEPDIVDYT 292 Query: 922 NLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREKMEEAMRVFVEMRMN 1101 NLLSGYA AGKM DA+DLL++M+R+G +PNAT YT+LIQALC ++MEEAM+VFVEM Sbjct: 293 NLLSGYANAGKMADAYDLLKDMRRRGFEPNATCYTVLIQALCKVDRMEEAMKVFVEMERY 352 Query: 1102 GCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXXXX 1281 C ADVVTYT L+SGFCKWGKIDK Y +LD MI++G P+Q+TY H++ AH Sbjct: 353 ECEADVVTYTALVSGFCKWGKIDKCYLVLDDMIKKGLMPSQLTYMHIMAAHEKKEKLIEC 412 Query: 1282 XXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMIHG 1461 I+ PD+ IYN VIRLACKLGEV+E +R WN+ME NG SPG DTF I+I+G Sbjct: 413 LELMEKMKQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEGNGLSPGADTFVIIING 472 Query: 1462 FLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCIVSKG-C 1638 QG L+EAC +FKEMV RGL S PQYGTLK LLN++L+ +KLEMAKDVWSCI SKG C Sbjct: 473 LTSQGCLLEACDHFKEMVARGLFSVPQYGTLKLLLNTLLKDKKLEMAKDVWSCITSKGSC 532 Query: 1639 DINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQIAAE 1818 +++V +WTIWIHALFSKG+ KEACSYCL+M++ MPQPDTFAKLM+GL+KLY+R+ A E Sbjct: 533 ELSVSSWTIWIHALFSKGYEKEACSYCLEMIELEFMPQPDTFAKLMKGLKKLYHREFAVE 592 Query: 1819 ITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMKKD--GRKRRARR 1950 ITEKVR MAA+++M+FKMYKRRG +DL EK K K+D G+K++ R Sbjct: 593 ITEKVRNMAAEKEMSFKMYKRRGVQDLTEKAKSKQDREGKKKQRTR 638 >ref|NP_190542.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546755|sp|P0C8A0.1|PP275_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g49730 gi|332645062|gb|AEE78583.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 638 Score = 798 bits (2061), Expect = 0.0 Identities = 391/585 (66%), Positives = 468/585 (80%), Gaps = 5/585 (0%) Frame = +1 Query: 199 STDRNRGSRFICLENKPNIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLELALQESGII 378 ST+R G +C E ++ DE++ +VE +YRILR HSRVPKLELAL ESGI Sbjct: 44 STERKNGVGLVCPE---------KHEDEFAGEVEKIYRILRNHHSRVPKLELALNESGID 94 Query: 379 VRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALI 558 +R GL+ RVL+RCGDAGNLG+RFF WA+KQPGY HSYEV KSM+ IL KMRQFGAVW LI Sbjct: 95 LRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLI 154 Query: 559 EEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCK 738 EEMRK NP+ + + FVVLMRRFASA MV+KA+EVLDEMPKYG +PDEYVFGCLLDALCK Sbjct: 155 EEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCK 214 Query: 739 NGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVY 918 NGSVKEA+ +FEDMR KF PN+++FTSLLYGWC+ GKLMEAK +LVQMKEAG EPDIVV+ Sbjct: 215 NGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVF 274 Query: 919 NNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREK-MEEAMRVFVEMR 1095 NLLSGYA AGKM DA+DL+ +M+++G +PN YT+LIQALC EK M+EAMRVFVEM Sbjct: 275 TNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEME 334 Query: 1096 MNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXX 1275 GC AD+VTYT LISGFCKWG IDKGY +LD M ++G P+Q+TY ++VAH Sbjct: 335 RYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFE 394 Query: 1276 XXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMI 1455 C PDL IYN VIRLACKLGEV+E +R WN+MEANG SPG+DTF IMI Sbjct: 395 ECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMI 454 Query: 1456 HGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCIVSK- 1632 +GF QG LIEAC++FKEMV RG+ SAPQYGTLK LLN+++R +KLEMAKDVWSCI +K Sbjct: 455 NGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKLEMAKDVWSCISNKT 514 Query: 1633 -GCDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQI 1809 C++NV AWTIWIHAL++KGHVKEACSYCLDM++ +MPQP+T+AKLM+GL KLYNR I Sbjct: 515 SSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPNTYAKLMKGLNKLYNRTI 574 Query: 1810 AAEITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMK--KDGRKR 1938 AAEITEKV KMA++R+M+FKMYK++GE DL EK K K K+G+K+ Sbjct: 575 AAEITEKVVKMASEREMSFKMYKKKGEEDLIEKAKPKGNKEGKKK 619 >emb|CAB66911.1| putative protein [Arabidopsis thaliana] Length = 1184 Score = 798 bits (2061), Expect = 0.0 Identities = 391/585 (66%), Positives = 468/585 (80%), Gaps = 5/585 (0%) Frame = +1 Query: 199 STDRNRGSRFICLENKPNIYSNHQNSDEYSADVEMVYRILRKFHSRVPKLELALQESGII 378 ST+R G +C E ++ DE++ +VE +YRILR HSRVPKLELAL ESGI Sbjct: 44 STERKNGVGLVCPE---------KHEDEFAGEVEKIYRILRNHHSRVPKLELALNESGID 94 Query: 379 VRSGLVERVLNRCGDAGNLGFRFFNWASKQPGYRHSYEVYKSMIKILGKMRQFGAVWALI 558 +R GL+ RVL+RCGDAGNLG+RFF WA+KQPGY HSYEV KSM+ IL KMRQFGAVW LI Sbjct: 95 LRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLI 154 Query: 559 EEMRKDNPQFLTVDAFVVLMRRFASARMVQKAIEVLDEMPKYGCKPDEYVFGCLLDALCK 738 EEMRK NP+ + + FVVLMRRFASA MV+KA+EVLDEMPKYG +PDEYVFGCLLDALCK Sbjct: 155 EEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCK 214 Query: 739 NGSVKEAASLFEDMRIKFKPNVKHFTSLLYGWCKVGKLMEAKFILVQMKEAGFEPDIVVY 918 NGSVKEA+ +FEDMR KF PN+++FTSLLYGWC+ GKLMEAK +LVQMKEAG EPDIVV+ Sbjct: 215 NGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVF 274 Query: 919 NNLLSGYALAGKMQDAFDLLQEMKRKGCDPNATSYTILIQALCSREK-MEEAMRVFVEMR 1095 NLLSGYA AGKM DA+DL+ +M+++G +PN YT+LIQALC EK M+EAMRVFVEM Sbjct: 275 TNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEME 334 Query: 1096 MNGCVADVVTYTTLISGFCKWGKIDKGYELLDSMIQQGCTPNQMTYFHVLVAHXXXXXXX 1275 GC AD+VTYT LISGFCKWG IDKGY +LD M ++G P+Q+TY ++VAH Sbjct: 335 RYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFE 394 Query: 1276 XXXXXXXXXXXIDCIPDLNIYNTVIRLACKLGEVEEGIRAWNDMEANGFSPGLDTFTIMI 1455 C PDL IYN VIRLACKLGEV+E +R WN+MEANG SPG+DTF IMI Sbjct: 395 ECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMI 454 Query: 1456 HGFLGQGRLIEACSYFKEMVGRGLLSAPQYGTLKELLNSVLRAEKLEMAKDVWSCIVSK- 1632 +GF QG LIEAC++FKEMV RG+ SAPQYGTLK LLN+++R +KLEMAKDVWSCI +K Sbjct: 455 NGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKLEMAKDVWSCISNKT 514 Query: 1633 -GCDINVFAWTIWIHALFSKGHVKEACSYCLDMLDNGVMPQPDTFAKLMRGLRKLYNRQI 1809 C++NV AWTIWIHAL++KGHVKEACSYCLDM++ +MPQP+T+AKLM+GL KLYNR I Sbjct: 515 SSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPNTYAKLMKGLNKLYNRTI 574 Query: 1810 AAEITEKVRKMAADRQMTFKMYKRRGERDLKEKVKMK--KDGRKR 1938 AAEITEKV KMA++R+M+FKMYK++GE DL EK K K K+G+K+ Sbjct: 575 AAEITEKVVKMASEREMSFKMYKKKGEEDLIEKAKPKGNKEGKKK 619