BLASTX nr result
ID: Sinomenium21_contig00017854
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00017854 (2481 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat... 917 0.0 gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis] 908 0.0 emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera] 890 0.0 ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citr... 887 0.0 ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat... 885 0.0 ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containi... 870 0.0 ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containi... 869 0.0 ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containi... 857 0.0 ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat... 852 0.0 ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containi... 847 0.0 ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat... 843 0.0 ref|XP_007047616.1| Pentatricopeptide repeat (PPR) superfamily p... 842 0.0 ref|XP_002530608.1| pentatricopeptide repeat-containing protein,... 840 0.0 gb|EYU29622.1| hypothetical protein MIMGU_mgv1a023801mg [Mimulus... 816 0.0 ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutr... 805 0.0 ref|XP_006393982.1| hypothetical protein EUTSA_v10003830mg [Eutr... 797 0.0 ref|NP_201383.1| pentatricopeptide repeat-containing protein [Ar... 792 0.0 ref|XP_002866691.1| pentatricopeptide repeat-containing protein ... 790 0.0 ref|XP_006292382.1| hypothetical protein CARUB_v10018595mg [Caps... 786 0.0 ref|NP_190542.4| pentatricopeptide repeat-containing protein [Ar... 786 0.0 >ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like [Vitis vinifera] Length = 622 Score = 917 bits (2371), Expect = 0.0 Identities = 446/567 (78%), Positives = 497/567 (87%), Gaps = 1/567 (0%) Frame = +1 Query: 280 TDRTQGSQFVCLE-NRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELALQESGVV 456 ++R G V LE NR NC T++QN DEF++DVEK+YRIL+KFHSRVPKLELALQESGV Sbjct: 27 SERRGGFGLVRLESNRENC-TYDQNYDEFSADVEKVYRILRKFHSRVPKLELALQESGVA 85 Query: 457 IRSGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALI 636 +RSGL ERVLNRCGDAG+LGYRFF WASKQPGYRHSYEVYK+MIK L KMRQFGAVWALI Sbjct: 86 VRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYEVYKAMIKILGKMRQFGAVWALI 145 Query: 637 EEMRKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCK 816 EEMR+ENPQ ++ FVVLMRRFASARMVKKA+EVLDEMPKYGCEPDEHVFGCLLDALCK Sbjct: 146 EEMRRENPQFVSPYVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEHVFGCLLDALCK 205 Query: 817 NGSVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVY 996 NGSVKEAA LFEDMR RFTP LKHFTSLLYGWC+EGKLMEAK+VLVQ+REAGFEPDIVVY Sbjct: 206 NGSVKEAASLFEDMRIRFTPTLKHFTSLLYGWCREGKLMEAKYVLVQIREAGFEPDIVVY 265 Query: 997 NNLLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQR 1176 NNLL+GYA AGKM DA++LL +M+ K C+PN S+T LIQ+LC+++KME+AMR+F EMQ Sbjct: 266 NNLLTGYAAAGKMVDAYDLLKEMRRKECEPNVMSFTTLIQALCAKKKMEEAMRVFFEMQS 325 Query: 1177 SGCVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXX 1356 GC AD VTYTTLISGFCK GKI KGYELLD MIQ+G +PN MTY HI+ AH Sbjct: 326 CGCPADAVTYTTLISGFCKWGKISKGYELLDNMIQQGHIPNPMTYLHIMAAHEKKEELEE 385 Query: 1357 XXXXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVH 1536 M+KIGC+PDL+ YNIVIRLACKLGE+ EGV+ WN MEA+GLSPGLDTFVIM+H Sbjct: 386 CIELMEEMRKIGCTPDLNIYNIVIRLACKLGEIKEGVRVWNEMEATGLSPGLDTFVIMIH 445 Query: 1537 GFLGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGTGC 1716 GFL Q CLVEAC++FKEMVGRGLLS PQYG LKELLNSLLR +KLEM+KDVWSCI+ GC Sbjct: 446 GFLSQRCLVEACEFFKEMVGRGLLSAPQYGTLKELLNSLLRAEKLEMSKDVWSCIMTKGC 505 Query: 1717 DLNVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAE 1896 DLNVYAWTIWIHALFSNGHV EACSYCLDM+DAGVMPQPDTFAKLMRGLRKLYNRQIAAE Sbjct: 506 DLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQIAAE 565 Query: 1897 ITEKVRQMAADRNVTFKMYKRRGERDL 1977 ITEKVR+MAA+R +TFKMYKRRGER+L Sbjct: 566 ITEKVRKMAAEREMTFKMYKRRGERNL 592 >gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis] Length = 638 Score = 908 bits (2346), Expect = 0.0 Identities = 437/565 (77%), Positives = 488/565 (86%) Frame = +1 Query: 283 DRTQGSQFVCLENRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELALQESGVVIR 462 +R G V LE P ++ DEF+ DVEK+YRIL+KFHSRV KLELALQESGVV+R Sbjct: 44 NRATGFSPVHLEQNPVVSDDDETHDEFSGDVEKIYRILRKFHSRVSKLELALQESGVVLR 103 Query: 463 SGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEE 642 SGL ERVL RCGDAGSLGYRFF WASKQPGYR SYEVYK+MI+ L KMRQFGAVWAL+EE Sbjct: 104 SGLTERVLGRCGDAGSLGYRFFVWASKQPGYRPSYEVYKAMIRALGKMRQFGAVWALLEE 163 Query: 643 MRKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNG 822 MRKENPQL+T + FVVLMRRFASARMVKKAVEV DEMPKYGCEPDEHVFGCLLDALCKNG Sbjct: 164 MRKENPQLITPEIFVVLMRRFASARMVKKAVEVFDEMPKYGCEPDEHVFGCLLDALCKNG 223 Query: 823 SVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNN 1002 SVKEAA LFE+MR +FTP+LKHFTSLLYGWC+EGKLMEAKFVLVQM+EAGFEPD+VVYNN Sbjct: 224 SVKEAASLFEEMRVKFTPSLKHFTSLLYGWCREGKLMEAKFVLVQMKEAGFEPDVVVYNN 283 Query: 1003 LLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQRSG 1182 LL GYA AGKM DA++L+ +M+ KGC PNA+SYT+LIQ+LC REKME+AMR+FVEMQRSG Sbjct: 284 LLGGYAQAGKMADAYDLMKEMRGKGCSPNAASYTVLIQALCKREKMEEAMRVFVEMQRSG 343 Query: 1183 CVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXX 1362 C ADV+TYTTLISGFCK GKI++GYE+LD MIQ+G PN+ TY HI++AH Sbjct: 344 CDADVMTYTTLISGFCKWGKIERGYEILDSMIQRGFSPNETTYLHIMLAHEKKEEFEECV 403 Query: 1363 XXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGF 1542 M+KIGC PDL YN VIRLACKL EV EGV+ WN +EASGLSPGLDTFV+M+HGF Sbjct: 404 ELIGEMRKIGCVPDLKIYNTVIRLACKLREVKEGVRLWNEIEASGLSPGLDTFVVMIHGF 463 Query: 1543 LGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGTGCDL 1722 LGQGCL+EAC+YFKEMV RGLLS PQYG LKELLN+LLR KLEMAKDVW+CIV GC++ Sbjct: 464 LGQGCLIEACQYFKEMVERGLLSGPQYGTLKELLNALLRADKLEMAKDVWTCIVNKGCEI 523 Query: 1723 NVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEIT 1902 NVYAWTIWIHALF NGHV EACSYCLDM+DA VMPQPDTFAKLMRGL+KLYNRQIAAEIT Sbjct: 524 NVYAWTIWIHALFKNGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKKLYNRQIAAEIT 583 Query: 1903 EKVRQMAADRNVTFKMYKRRGERDL 1977 EKVR+MA DR +TFKMYKRRGERDL Sbjct: 584 EKVRKMAEDRQMTFKMYKRRGERDL 608 >emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera] Length = 655 Score = 890 bits (2300), Expect = 0.0 Identities = 432/547 (78%), Positives = 481/547 (87%), Gaps = 2/547 (0%) Frame = +1 Query: 343 EQNADEFA-SDVEK-LYRILKKFHSRVPKLELALQESGVVIRSGLVERVLNRCGDAGSLG 516 E++ D +A +EK +YRIL+KFHSRVPKLELALQESGV +RSGL ERVLNRCGDAG+LG Sbjct: 79 EEDLDLYAWKAIEKTVYRILRKFHSRVPKLELALQESGVAVRSGLTERVLNRCGDAGNLG 138 Query: 517 YRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEEMRKENPQLLTADAFVVLM 696 YRFF WASKQPGYRHSYEVYK+MIK L KMRQFGAVWALIEEMR+ENPQ ++ FVVLM Sbjct: 139 YRFFVWASKQPGYRHSYEVYKAMIKILGKMRQFGAVWALIEEMRRENPQFVSPYVFVVLM 198 Query: 697 RRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMRFRFTP 876 RRFASARMVKKA+EVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAA LFEDMR RFTP Sbjct: 199 RRFASARMVKKAIEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAASLFEDMRIRFTP 258 Query: 877 NLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNNLLSGYAMAGKMQDAFELL 1056 LKHFTSLLYGWC+EGKLMEAK+VLVQ+REAGFEPDIVVYNNLL+GYA AGKM DA++LL Sbjct: 259 TLKHFTSLLYGWCREGKLMEAKYVLVQIREAGFEPDIVVYNNLLTGYAAAGKMVDAYDLL 318 Query: 1057 ADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQRSGCVADVVTYTTLISGFCKR 1236 +M+ K C+PN S+T LIQ+LC+++KME+AMR+F EMQ GC AD VTYTTLISGFCK Sbjct: 319 KEMRRKECEPNVMSFTTLIQALCAKKKMEEAMRVFFEMQSCGCPADAVTYTTLISGFCKW 378 Query: 1237 GKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQKIGCSPDLSTY 1416 GKI KGYELLD MIQ+G +PN MTY HI+ AH M+KIGC+PDL+ Y Sbjct: 379 GKISKGYELLDNMIQQGHIPNPMTYLHIMAAHEKKEELEECIELMEEMRKIGCTPDLNIY 438 Query: 1417 NIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGFLGQGCLVEACKYFKEMVG 1596 NIVIRLACKLGE+ EGV+ WN MEA+GLSPGLDTFVIM+HGFL Q CLVEAC++FKEMVG Sbjct: 439 NIVIRLACKLGEIKEGVRVWNEMEATGLSPGLDTFVIMIHGFLSQRCLVEACEFFKEMVG 498 Query: 1597 RGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGTGCDLNVYAWTIWIHALFSNGHV 1776 RGLLS PQYG LKELLNSLLR +KLEM+KDVWSCI+ GCDLNVYAWTIWIHALFSNGHV Sbjct: 499 RGLLSAPQYGTLKELLNSLLRAEKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHV 558 Query: 1777 TEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRQMAADRNVTFKMYK 1956 EACSYCLDM+DAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVR+MAA+R +TFKMYK Sbjct: 559 KEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAAEREMTFKMYK 618 Query: 1957 RRGERDL 1977 RRGER+L Sbjct: 619 RRGERNL 625 >ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citrus clementina] gi|557528135|gb|ESR39385.1| hypothetical protein CICLE_v10025134mg [Citrus clementina] Length = 638 Score = 887 bits (2293), Expect = 0.0 Identities = 427/577 (74%), Positives = 492/577 (85%), Gaps = 7/577 (1%) Frame = +1 Query: 268 STVATDRTQGSQFVCLENRPN-------CETHEQNADEFASDVEKLYRILKKFHSRVPKL 426 ST T S VCL+ + + +TH + +EF+ DVEK++RILKKFHSR+PKL Sbjct: 33 STATTTNQLNSNLVCLKTKEDDCKCNNTTDTHGSH-NEFSHDVEKIFRILKKFHSRLPKL 91 Query: 427 ELALQESGVVIRSGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKM 606 ELALQ SGVV+R GL ERV+NRCGDAG+LGYR++ WASKQP Y HSY+VY+++IK+LSKM Sbjct: 92 ELALQHSGVVLRPGLTERVINRCGDAGNLGYRYYMWASKQPNYVHSYDVYRALIKSLSKM 151 Query: 607 RQFGAVWALIEEMRKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHV 786 R+FGAVWAL+EEMRKE PQL+T + FV+LMRRFASARMVKKA+EVLDEMPKYGCEPDE V Sbjct: 152 RKFGAVWALMEEMRKEKPQLITTEVFVILMRRFASARMVKKAIEVLDEMPKYGCEPDEFV 211 Query: 787 FGCLLDALCKNGSVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMRE 966 FGCLLDALCKN SVKEAA LF++MR RF P+L+HFTSLLYGWCKEGKL+EAK+VLVQM++ Sbjct: 212 FGCLLDALCKNSSVKEAAKLFDEMRERFKPSLRHFTSLLYGWCKEGKLVEAKYVLVQMKD 271 Query: 967 AGFEPDIVVYNNLLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMED 1146 AGFEPDIVVYNNLLSGYA GKM DAFELL +M+ KGCDPNA+SYT+LIQ+LC EKME+ Sbjct: 272 AGFEPDIVVYNNLLSGYAQMGKMTDAFELLKEMRRKGCDPNANSYTVLIQALCRMEKMEE 331 Query: 1147 AMRMFVEMQRSGCVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILV 1326 A R FVEM+RSGC ADVVTYTTLISGFCK KID+ YE+LD MIQ+G +PNQ+TY HI++ Sbjct: 332 ANRAFVEMERSGCEADVVTYTTLISGFCKSRKIDRCYEILDSMIQRGILPNQLTYLHIML 391 Query: 1327 AHXXXXXXXXXXXXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSP 1506 AH M+KIGC PD+S YN+VIRLACKLGE+ E V WN MEA+ LSP Sbjct: 392 AHEKKEELEECVELMGEMRKIGCVPDVSNYNVVIRLACKLGELKEAVNVWNEMEAASLSP 451 Query: 1507 GLDTFVIMVHGFLGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKD 1686 G D+FV+MVHGFLGQGCL+EAC+YFKEMVGRGLLS PQYG LKELLNSLLR QK+EMAKD Sbjct: 452 GTDSFVVMVHGFLGQGCLIEACEYFKEMVGRGLLSAPQYGTLKELLNSLLRAQKVEMAKD 511 Query: 1687 VWSCIVGTGCDLNVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLR 1866 VWSCIV GC+LNVYAWTIWIH+LFSNGHV EACSYCLDM+DA VMPQPDTFAKLMRGL+ Sbjct: 512 VWSCIVTKGCELNVYAWTIWIHSLFSNGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLK 571 Query: 1867 KLYNRQIAAEITEKVRQMAADRNVTFKMYKRRGERDL 1977 KLYNRQIAAEITEKVR+MAA+R +TFKMYKRRGERDL Sbjct: 572 KLYNRQIAAEITEKVRKMAAERQITFKMYKRRGERDL 608 >ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like [Citrus sinensis] Length = 638 Score = 885 bits (2286), Expect = 0.0 Identities = 428/595 (71%), Positives = 497/595 (83%), Gaps = 7/595 (1%) Frame = +1 Query: 214 HRCYINNTEEKSGKSRRISTVATDRTQGSQFVCLENRPN-------CETHEQNADEFASD 372 HR + + + S ST T S VCL+ + + +TH + +EF+ D Sbjct: 15 HRRFSFSPRPNTTTSHVESTATTTNQLNSNLVCLKTKEDDCKCDNTTDTHGSH-NEFSHD 73 Query: 373 VEKLYRILKKFHSRVPKLELALQESGVVIRSGLVERVLNRCGDAGSLGYRFFAWASKQPG 552 VEK++RILKKFHSR+PKLELALQ SGVV+R GL ERV+NRCGDAG+LGYR++ WASKQP Sbjct: 74 VEKIFRILKKFHSRLPKLELALQHSGVVLRPGLTERVINRCGDAGNLGYRYYMWASKQPN 133 Query: 553 YRHSYEVYKSMIKTLSKMRQFGAVWALIEEMRKENPQLLTADAFVVLMRRFASARMVKKA 732 Y HSY+VY+++IK+LSKMR+FGAVWAL+EEMRKE PQL+T + FV+LMRRFASARMVKKA Sbjct: 134 YVHSYDVYRALIKSLSKMRKFGAVWALMEEMRKEKPQLITTEVFVILMRRFASARMVKKA 193 Query: 733 VEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMRFRFTPNLKHFTSLLYGW 912 +EVLDEMPKYGCEPDE VFGCLLDALCKN SVKEAA LF+++R RF P+L+HFTSLLYGW Sbjct: 194 IEVLDEMPKYGCEPDEFVFGCLLDALCKNSSVKEAAKLFDEIRERFKPSLRHFTSLLYGW 253 Query: 913 CKEGKLMEAKFVLVQMREAGFEPDIVVYNNLLSGYAMAGKMQDAFELLADMKNKGCDPNA 1092 CKEGKL+EAK+VLVQM++AGFEPDIVVYNNLLSGYA GKM DAFELL +M+ KGCDPNA Sbjct: 254 CKEGKLVEAKYVLVQMKDAGFEPDIVVYNNLLSGYAQMGKMTDAFELLKEMRRKGCDPNA 313 Query: 1093 SSYTILIQSLCSREKMEDAMRMFVEMQRSGCVADVVTYTTLISGFCKRGKIDKGYELLDV 1272 +SYT+LIQ+LC EKME+A R FVEM+RSGC ADVVTYTTLISGFCK KID+ YE+LD Sbjct: 314 NSYTVLIQALCRMEKMEEANRAFVEMERSGCEADVVTYTTLISGFCKSRKIDRCYEILDS 373 Query: 1273 MIQKGCVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQKIGCSPDLSTYNIVIRLACKLGE 1452 MIQ+G +PNQ+TY HI++AH M+KIGC PD+S YN+VIRLACKLGE Sbjct: 374 MIQRGILPNQLTYLHIMLAHEKKEELEECVELMGEMRKIGCVPDVSNYNVVIRLACKLGE 433 Query: 1453 VGEGVKAWNSMEASGLSPGLDTFVIMVHGFLGQGCLVEACKYFKEMVGRGLLSTPQYGIL 1632 + E V WN MEA+ LSPG D+FV+MVHGFLGQGCL+EAC+YFKEMVGRGLLS PQYG L Sbjct: 434 LKEAVNVWNEMEAASLSPGTDSFVVMVHGFLGQGCLIEACEYFKEMVGRGLLSAPQYGTL 493 Query: 1633 KELLNSLLRDQKLEMAKDVWSCIVGTGCDLNVYAWTIWIHALFSNGHVTEACSYCLDMLD 1812 K LLNSLLR QK+EMAKDVWSCIV GC+LNVYAWTIWIH+LFSNGHV EACSYCLDM+D Sbjct: 494 KALLNSLLRAQKVEMAKDVWSCIVTKGCELNVYAWTIWIHSLFSNGHVKEACSYCLDMMD 553 Query: 1813 AGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRQMAADRNVTFKMYKRRGERDL 1977 A VMPQPDTFAKLMRGL+KLYNRQIAAEITEKVR+MAA+R +TFKMYKRRGERDL Sbjct: 554 ADVMPQPDTFAKLMRGLKKLYNRQIAAEITEKVRKMAAERQITFKMYKRRGERDL 608 >ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like [Cucumis sativus] Length = 641 Score = 870 bits (2247), Expect = 0.0 Identities = 419/574 (72%), Positives = 484/574 (84%) Frame = +1 Query: 256 SRRISTVATDRTQGSQFVCLENRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELA 435 S + S T + G + L+ P+ H+++ADEF+ DVEK+YRIL+KFH+RVPKLELA Sbjct: 38 SSQTSPNGTTQRGGFGPIHLKTTPHESAHDRDADEFSVDVEKVYRILRKFHTRVPKLELA 97 Query: 436 LQESGVVIRSGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQF 615 LQESGV++RSGL ERVL+RCGDAG+LGYRFF WASKQPGYRHSYEVYK+MIKTL KMRQF Sbjct: 98 LQESGVIMRSGLPERVLSRCGDAGNLGYRFFVWASKQPGYRHSYEVYKAMIKTLGKMRQF 157 Query: 616 GAVWALIEEMRKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGC 795 GAVWALIEEMRKENP +LT + F+VLMRRFAS RMVKKAVEVLDEMPKYGCEPDE+VFGC Sbjct: 158 GAVWALIEEMRKENPYMLTPEVFIVLMRRFASVRMVKKAVEVLDEMPKYGCEPDEYVFGC 217 Query: 796 LLDALCKNGSVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGF 975 LLDALCKNGSVKEAA LFEDMR RF PNL+HFTSLLYGWC+EGK+MEAK VLVQ++EAGF Sbjct: 218 LLDALCKNGSVKEAASLFEDMRVRFNPNLRHFTSLLYGWCREGKIMEAKHVLVQIKEAGF 277 Query: 976 EPDIVVYNNLLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMEDAMR 1155 EPDIVVYNNLL GYA AGKM+DAF+LLA+MK C PNA+S+TILIQS C EKM++AMR Sbjct: 278 EPDIVVYNNLLGGYAQAGKMRDAFDLLAEMKKVNCGPNAASFTILIQSFCKTEKMDEAMR 337 Query: 1156 MFVEMQRSGCVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHX 1335 +F EMQ SGC ADVVTYTTLISGFCK G DK YE+LD MIQKG P+Q++Y I++AH Sbjct: 338 IFTEMQGSGCEADVVTYTTLISGFCKWGNTDKAYEILDDMIQKGHDPSQLSYLCIMMAHE 397 Query: 1336 XXXXXXXXXXXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLD 1515 M+KIGC PDL+ YN +IRL CKLG++ E V+ W M+A GL+PGLD Sbjct: 398 KKEELEECMELIEEMRKIGCVPDLNIYNTMIRLVCKLGDLKEAVRLWGEMQAGGLNPGLD 457 Query: 1516 TFVIMVHGFLGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWS 1695 T+++MVHGFL QGCLVEAC YFKEMV RGLLS PQYG LKEL N+LLR +KLEMAK++WS Sbjct: 458 TYILMVHGFLSQGCLVEACDYFKEMVERGLLSAPQYGTLKELTNALLRAEKLEMAKNMWS 517 Query: 1696 CIVGTGCDLNVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLY 1875 C+ GC+LNV AWTIWIHALFSNGHV EACSYCLDM+DA +MPQPDTFAKLMRGL+KL+ Sbjct: 518 CMTTKGCELNVSAWTIWIHALFSNGHVKEACSYCLDMMDADLMPQPDTFAKLMRGLKKLF 577 Query: 1876 NRQIAAEITEKVRQMAADRNVTFKMYKRRGERDL 1977 +RQ+A EITEKVR+MAADR +TFKMYKRRGERDL Sbjct: 578 HRQLAVEITEKVRKMAADRQITFKMYKRRGERDL 611 >ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like [Cucumis sativus] Length = 664 Score = 869 bits (2245), Expect = 0.0 Identities = 418/572 (73%), Positives = 483/572 (84%) Frame = +1 Query: 262 RISTVATDRTQGSQFVCLENRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELALQ 441 + S T + G + L+ P+ H+++ADEF+ DVEK+YRIL+KFH+RVPKLELALQ Sbjct: 63 KTSPNGTTQRGGFGPIHLKTTPHESAHDRDADEFSVDVEKVYRILRKFHTRVPKLELALQ 122 Query: 442 ESGVVIRSGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGA 621 ESGV++RSGL ERVL+RCGDAG+LGYRFF WASKQPGYRHSYEVYK+MIKTL KMRQFGA Sbjct: 123 ESGVIMRSGLPERVLSRCGDAGNLGYRFFVWASKQPGYRHSYEVYKAMIKTLGKMRQFGA 182 Query: 622 VWALIEEMRKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLL 801 VWALIEEMRKENP +LT + F+VLMRRFAS RMVKKAVEVLDEMPKYGCEPDE+VFGCLL Sbjct: 183 VWALIEEMRKENPYMLTPEVFIVLMRRFASVRMVKKAVEVLDEMPKYGCEPDEYVFGCLL 242 Query: 802 DALCKNGSVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEP 981 DALCKNGSVKEAA LFEDMR RF PNL+HFTSLLYGWC+EGK+MEAK VLVQ++EAGFEP Sbjct: 243 DALCKNGSVKEAASLFEDMRVRFNPNLRHFTSLLYGWCREGKIMEAKHVLVQIKEAGFEP 302 Query: 982 DIVVYNNLLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMF 1161 DIVVYNNLL GYA AGKM+DAF+LLA+MK C PNA+S+TILIQS C EKM++AMR+F Sbjct: 303 DIVVYNNLLGGYAQAGKMRDAFDLLAEMKKVNCGPNAASFTILIQSFCKTEKMDEAMRIF 362 Query: 1162 VEMQRSGCVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXX 1341 EMQ SGC ADVVTYTTLISGFCK G DK YE+LD MIQKG P+Q++Y I++AH Sbjct: 363 TEMQGSGCEADVVTYTTLISGFCKWGNTDKAYEILDDMIQKGHDPSQLSYLCIMMAHEKK 422 Query: 1342 XXXXXXXXXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTF 1521 M+KIGC PDL+ YN +IRL CKLG++ E V+ W M+A GL+PGLDT+ Sbjct: 423 EELEECMELIEEMRKIGCVPDLNIYNTMIRLVCKLGDLKEAVRLWGEMQAGGLNPGLDTY 482 Query: 1522 VIMVHGFLGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCI 1701 ++MVHGFL QGCLVEAC YFKEMV RGLLS PQYG LKEL N+LLR +KLEMAK++WSC+ Sbjct: 483 ILMVHGFLSQGCLVEACDYFKEMVERGLLSAPQYGTLKELTNALLRAEKLEMAKNMWSCM 542 Query: 1702 VGTGCDLNVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNR 1881 GC+LNV AWTIWIHALFSNGHV EACSYCLDM+DA +MPQPDTFAKLMRGL+KL++R Sbjct: 543 TTKGCELNVSAWTIWIHALFSNGHVKEACSYCLDMMDADLMPQPDTFAKLMRGLKKLFHR 602 Query: 1882 QIAAEITEKVRQMAADRNVTFKMYKRRGERDL 1977 Q+A EITEKVR+MAADR +TFKMYKRRGERDL Sbjct: 603 QLAVEITEKVRKMAADRQITFKMYKRRGERDL 634 >ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like [Solanum tuberosum] Length = 625 Score = 857 bits (2215), Expect = 0.0 Identities = 406/544 (74%), Positives = 474/544 (87%) Frame = +1 Query: 346 QNADEFASDVEKLYRILKKFHSRVPKLELALQESGVVIRSGLVERVLNRCGDAGSLGYRF 525 +N DEF++DVEK+YRIL+KFHSRVPKLELAL ESGVV RSGL ERVLNRCGDAG+LGYRF Sbjct: 52 KNHDEFSADVEKVYRILRKFHSRVPKLELALLESGVVARSGLTERVLNRCGDAGNLGYRF 111 Query: 526 FAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEEMRKENPQLLTADAFVVLMRRF 705 F W SKQPGYRHS++ YK+MIK L KMRQFG VWAL+EEMR ENPQ LT + F+VLMRRF Sbjct: 112 FVWVSKQPGYRHSHDAYKAMIKILGKMRQFGTVWALVEEMRIENPQFLTPEVFIVLMRRF 171 Query: 706 ASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMRFRFTPNLK 885 AS RMVKKA+EVLDEMPKYG EPDE+VFGCLLDALCKNGSVKEAA LF++MRFRF+P +K Sbjct: 172 ASGRMVKKAIEVLDEMPKYGVEPDEYVFGCLLDALCKNGSVKEAAALFDEMRFRFSPTIK 231 Query: 886 HFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNNLLSGYAMAGKMQDAFELLADM 1065 HFTSLLYGWCKEGKL+EAK VLV+MREAGFEPDIVVYNNLL+GYA++ KM DAF+LL +M Sbjct: 232 HFTSLLYGWCKEGKLIEAKVVLVKMREAGFEPDIVVYNNLLNGYAVSRKMADAFDLLQEM 291 Query: 1066 KNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQRSGCVADVVTYTTLISGFCKRGKI 1245 + KGC+PN +S+TI+IQ+LC ++KME+AMR+F++M+RSGC DVVTYTTLISGFCK GKI Sbjct: 292 RRKGCNPNETSFTIVIQALCLQDKMEEAMRVFLDMERSGCEGDVVTYTTLISGFCKWGKI 351 Query: 1246 DKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQKIGCSPDLSTYNIV 1425 +KGYEL+D M+QKG PNQ TY HI++AH M KIG PD S YNIV Sbjct: 352 EKGYELVDTMLQKGYNPNQTTYLHIMLAHEKKEELEECLELVKEMGKIGIPPDHSIYNIV 411 Query: 1426 IRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGFLGQGCLVEACKYFKEMVGRGL 1605 IRLACKLGE+ EGV+ WN +EA+G+SPG+DTF+IM++GF+ QG L+EAC +FKEM+GRGL Sbjct: 412 IRLACKLGEIDEGVRVWNQIEANGISPGVDTFIIMINGFVEQGRLIEACDHFKEMIGRGL 471 Query: 1606 LSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGTGCDLNVYAWTIWIHALFSNGHVTEA 1785 LS PQYG LK+LLNSLLR +KLE+ KDVWSCI+ GC+LNV AWTIWIHALFSNGHV EA Sbjct: 472 LSAPQYGTLKDLLNSLLRAEKLELCKDVWSCIMTKGCELNVSAWTIWIHALFSNGHVKEA 531 Query: 1786 CSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRQMAADRNVTFKMYKRRG 1965 C+YCLDM+DAG+MPQPDTFAKLM+GLRKLYNR+IAAEITEK R+MA RN+TFKMYKRRG Sbjct: 532 CAYCLDMMDAGLMPQPDTFAKLMKGLRKLYNREIAAEITEKARKMAEQRNMTFKMYKRRG 591 Query: 1966 ERDL 1977 ERDL Sbjct: 592 ERDL 595 >ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like isoform X1 [Cicer arietinum] gi|502165084|ref|XP_004513408.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like isoform X2 [Cicer arietinum] Length = 655 Score = 852 bits (2200), Expect = 0.0 Identities = 409/569 (71%), Positives = 480/569 (84%), Gaps = 3/569 (0%) Frame = +1 Query: 280 TDRTQGSQF--VCLENRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELALQESGV 453 T T+ QF + L++ N + + DEF SDVEK+YRIL+K+HSRVPKLELAL+ESGV Sbjct: 59 TTITKNDQFGLIHLQSNANHFNDQNSDDEFTSDVEKVYRILRKYHSRVPKLELALKESGV 118 Query: 454 VIRSGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWAL 633 V+ SGL ERVLNRCG++G+L YRFF+WASKQ GYRHS EVYK+MIK LSKMRQFGAVWAL Sbjct: 119 VVSSGLTERVLNRCGNSGNLAYRFFSWASKQSGYRHSEEVYKAMIKVLSKMRQFGAVWAL 178 Query: 634 IEEMRKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALC 813 I+EMR ENPQL++ FV+LMRRFASARMV KA+EVLDEMPKYGCEPDE+VFGCLLDALC Sbjct: 179 IDEMRLENPQLISPHVFVILMRRFASARMVHKAIEVLDEMPKYGCEPDEYVFGCLLDALC 238 Query: 814 KNGSVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVV 993 KNGS+KEAA LFEDMR+RF P +KHFTSLLYGWCKEGKL+EAK VLVQM++AG EPDIVV Sbjct: 239 KNGSIKEAASLFEDMRYRFPPTVKHFTSLLYGWCKEGKLVEAKHVLVQMKDAGIEPDIVV 298 Query: 994 YNNLLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQ 1173 +NNLL GYA GKM DA++LL +MK KGC+PNA+SYTILIQSLC EK+E+AMR+FVEMQ Sbjct: 299 FNNLLGGYAQGGKMADAYDLLKEMKRKGCEPNAASYTILIQSLCKHEKLEEAMRIFVEMQ 358 Query: 1174 RSGCVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXX 1353 R+ C DV+TYTTLISGFCK GKI +GYELLD MIQ+G PNQ+TY HI++AH Sbjct: 359 RNDCQMDVITYTTLISGFCKWGKIKRGYELLDQMIQEGHSPNQLTYLHIMLAHEKKEELE 418 Query: 1354 XXXXXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMV 1533 M+KIGC P+L+ YN VIRLACK GEV +GV+ WN MEASGLSPG DTFV+M+ Sbjct: 419 ECMELVNEMKKIGCVPNLNIYNTVIRLACKFGEVKQGVRLWNEMEASGLSPGTDTFVVMI 478 Query: 1534 HGFLGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGT- 1710 +GFL Q CL+EAC+YFKEMVGRGL + PQYG LKEL+NSLLR +KLEMAKD W+CI + Sbjct: 479 NGFLEQDCLIEACEYFKEMVGRGLFAAPQYGTLKELMNSLLRAEKLEMAKDTWNCITASK 538 Query: 1711 GCDLNVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIA 1890 C++NV AWTIWIHALFS GHV EACS+C+DM+D +MPQPDTFAKL+RGL+KLYNR+ A Sbjct: 539 SCEMNVAAWTIWIHALFSKGHVKEACSFCIDMMDNDLMPQPDTFAKLIRGLKKLYNREFA 598 Query: 1891 AEITEKVRQMAADRNVTFKMYKRRGERDL 1977 AEITEKVR+MAADR++TFKMYKRRGERDL Sbjct: 599 AEITEKVRKMAADRHITFKMYKRRGERDL 627 >ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like isoform X1 [Glycine max] gi|571514894|ref|XP_006597171.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like isoform X2 [Glycine max] gi|571514897|ref|XP_006597172.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like isoform X3 [Glycine max] Length = 654 Score = 847 bits (2188), Expect = 0.0 Identities = 406/548 (74%), Positives = 469/548 (85%), Gaps = 1/548 (0%) Frame = +1 Query: 337 THEQNADEFASDVEKLYRILKKFHSRVPKLELALQESGVVIRSGLVERVLNRCGDAGSLG 516 T + DEFASDVEK+YRIL+K+HSRVPKLELAL+ESGVV+R GL ERVL+RCGDAG+L Sbjct: 80 TDDHTHDEFASDVEKVYRILRKYHSRVPKLELALRESGVVVRPGLTERVLSRCGDAGNLA 139 Query: 517 YRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEEMRKENPQLLTADAFVVLM 696 YRF++WASKQ G+R ++ YK+MIK LS+MRQFGAVWALIEEMR+ENP L+T FV+LM Sbjct: 140 YRFYSWASKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPHLITPQVFVILM 199 Query: 697 RRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMRFRFTP 876 RRFASARMV KAVEVLDEMPKYGCEPDE+VFGCLLDALCKNGSVKEAA LFEDMR+R+ P Sbjct: 200 RRFASARMVHKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRYRWKP 259 Query: 877 NLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNNLLSGYAMAGKMQDAFELL 1056 ++KHFTSLLYGWCKEGKLMEAK VLVQM++ G EPDIVVYNNLL GYA AGKM DA++LL Sbjct: 260 SVKHFTSLLYGWCKEGKLMEAKHVLVQMKDMGIEPDIVVYNNLLGGYAQAGKMGDAYDLL 319 Query: 1057 ADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQRSGCVADVVTYTTLISGFCKR 1236 +M+ K C+PNA+SYT+LIQSLC E++E+A R+FVEMQ +GC ADVVTY+TLISGFCK Sbjct: 320 KEMRRKRCEPNATSYTVLIQSLCKHERLEEATRLFVEMQTNGCQADVVTYSTLISGFCKW 379 Query: 1237 GKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQKIGCSPDLSTY 1416 GKI +GYELLD MIQ+G PNQ+ Y HI++AH MQKIGC+PDLS Y Sbjct: 380 GKIKRGYELLDEMIQQGHFPNQVIYQHIMLAHEKKEELEECKELVNEMQKIGCAPDLSIY 439 Query: 1417 NIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGFLGQGCLVEACKYFKEMVG 1596 N VIRLACKLGEV EG++ WN ME+SGLSPG+DTFVIM++GFL QGCLVEAC+YFKEMVG Sbjct: 440 NTVIRLACKLGEVKEGIQLWNEMESSGLSPGMDTFVIMINGFLEQGCLVEACEYFKEMVG 499 Query: 1597 RGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGT-GCDLNVYAWTIWIHALFSNGH 1773 RGL + PQYG LKEL+NSLLR +KLEMAKD W+CI + GC LNV AWTIWIHALFS GH Sbjct: 500 RGLFTAPQYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWTIWIHALFSKGH 559 Query: 1774 VTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRQMAADRNVTFKMY 1953 V EACS+C+DM+D +MP PDTFAKLM GL+KLYNRQ AAEITEKVR+MAADR +TFKMY Sbjct: 560 VKEACSFCIDMMDKDLMPNPDTFAKLMHGLKKLYNRQFAAEITEKVRKMAADRQITFKMY 619 Query: 1954 KRRGERDL 1977 KRRGERDL Sbjct: 620 KRRGERDL 627 >ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like, partial [Glycine max] Length = 656 Score = 843 bits (2179), Expect = 0.0 Identities = 407/559 (72%), Positives = 474/559 (84%), Gaps = 1/559 (0%) Frame = +1 Query: 304 FVCLENRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELALQESGVVIRSGLVERV 483 F+ L+ T +Q DEFASDVEK+YRIL+K+HSRVPKLELAL+ESGVV+R GL ERV Sbjct: 71 FIRLQEISINHTDDQTHDEFASDVEKVYRILRKYHSRVPKLELALRESGVVVRPGLTERV 130 Query: 484 LNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEEMRKENPQ 663 LNRCGDAG+L YRF++WASKQ G+R ++ YK+MIK LS+MRQFGAVWALIEEMR+ENP Sbjct: 131 LNRCGDAGNLAYRFYSWASKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPH 190 Query: 664 LLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAAL 843 L+T FV+LMRRFASARMV KAV+VLDEMP YGCEPDE+VFGCLLDAL KNGSVKEAA Sbjct: 191 LITPQVFVILMRRFASARMVHKAVQVLDEMPNYGCEPDEYVFGCLLDALRKNGSVKEAAS 250 Query: 844 LFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNNLLSGYAM 1023 LFE++R+R+ P++KHFTSLLYGWCKEGKLMEAK VLVQM++AG EPDIVVYNNLL GYA Sbjct: 251 LFEELRYRWKPSVKHFTSLLYGWCKEGKLMEAKHVLVQMKDAGIEPDIVVYNNLLGGYAQ 310 Query: 1024 AGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQRSGCVADVVT 1203 A KM DA++LL +M+ KGC+PNA+SYT+LIQSLC E++E+A R+FVEMQR+GC AD+VT Sbjct: 311 ADKMGDAYDLLKEMRRKGCEPNATSYTVLIQSLCKHERLEEATRVFVEMQRNGCQADLVT 370 Query: 1204 YTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQ 1383 Y+TLISGFCK GKI +GYELLD MIQ+G PNQ+ Y HI+VAH MQ Sbjct: 371 YSTLISGFCKWGKIKRGYELLDEMIQQGHFPNQVIYQHIMVAHEKKEELEECKELVNEMQ 430 Query: 1384 KIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGFLGQGCLV 1563 KIGC+PDLS YN VIRLACKLGEV EGV+ WN ME+SGLSP +DTFVIM++GFL QGCLV Sbjct: 431 KIGCAPDLSIYNTVIRLACKLGEVKEGVRLWNEMESSGLSPSIDTFVIMINGFLEQGCLV 490 Query: 1564 EACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGT-GCDLNVYAWT 1740 EAC+YFKEMVGRGL + PQYG LKEL+NSLLR +KLEMAKD W+CI + GC LNV AWT Sbjct: 491 EACEYFKEMVGRGLFAAPQYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWT 550 Query: 1741 IWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRQM 1920 IWIHALFS GHV EACS+C+ M+D +MPQPDTFAKLMRGL+KLYNR+ AAEITEKVR+M Sbjct: 551 IWIHALFSKGHVKEACSFCIAMMDKDLMPQPDTFAKLMRGLKKLYNREFAAEITEKVRKM 610 Query: 1921 AADRNVTFKMYKRRGERDL 1977 AADR +TFKMYKRRGERDL Sbjct: 611 AADRKITFKMYKRRGERDL 629 >ref|XP_007047616.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] gi|508699877|gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 647 Score = 842 bits (2176), Expect = 0.0 Identities = 398/590 (67%), Positives = 482/590 (81%), Gaps = 2/590 (0%) Frame = +1 Query: 214 HRCYINNTEEKSGKSRRISTVATDRTQGSQFVCLENR-PNCET-HEQNADEFASDVEKLY 387 H + + + S ++ ++++ G V LE + P ++ ++Q D+FASDVEK+Y Sbjct: 28 HFHILPDNNNNNNNSNSLNLLSSNSKSGFGLVTLETKQPTLKSDNDQQTDDFASDVEKIY 87 Query: 388 RILKKFHSRVPKLELALQESGVVIRSGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSY 567 RIL+KFH+RVPKL LALQ+SGVV R GL ERVLNRCGDAG+LGY+FF WASKQPGY SY Sbjct: 88 RILRKFHTRVPKLNLALQQSGVVFRPGLTERVLNRCGDAGNLGYKFFTWASKQPGYHPSY 147 Query: 568 EVYKSMIKTLSKMRQFGAVWALIEEMRKENPQLLTADAFVVLMRRFASARMVKKAVEVLD 747 E+YK+MIK L KMRQFGAVWALIEE+++ENP +TA+ F++L+RRFAS+RMVKKA+EV D Sbjct: 148 EIYKAMIKILGKMRQFGAVWALIEEIKRENPHFITAELFILLIRRFASSRMVKKAIEVFD 207 Query: 748 EMPKYGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGK 927 EMPKYGC D+ VFG LLDALCKNG+VKEAAL+FE+MR RF PNLKHFTSLLYGWCKEG+ Sbjct: 208 EMPKYGCLQDDAVFGSLLDALCKNGNVKEAALVFEEMRVRFLPNLKHFTSLLYGWCKEGR 267 Query: 928 LMEAKFVLVQMREAGFEPDIVVYNNLLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTI 1107 ++EAK VLVQM+EAGFEPDIVV+NNLLSGY + KM DAF+LL +M+ KG DPNA+SYTI Sbjct: 268 ILEAKHVLVQMKEAGFEPDIVVFNNLLSGYVLGNKMGDAFDLLKEMRKKGIDPNANSYTI 327 Query: 1108 LIQSLCSREKMEDAMRMFVEMQRSGCVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKG 1287 +IQ LC ++ME+AMR+FV+M+R+GC DVV YTTLISGFCK G+++KGYE+LD MI +G Sbjct: 328 VIQGLCKADRMEEAMRVFVDMERNGCRGDVVVYTTLISGFCKWGRVEKGYEVLDRMISEG 387 Query: 1288 CVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGV 1467 +PN +TY HI++AH M+KIGC PD YN+V+RLACKL EV E Sbjct: 388 LMPNSLTYLHIMLAHEKKDELEECLELMEEMRKIGCVPDGGIYNVVVRLACKLEEVKEAA 447 Query: 1468 KAWNSMEASGLSPGLDTFVIMVHGFLGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLN 1647 + WN ME G SPG+D F++M+HGF+GQGCLVEAC+YFKEM GRGL PQYGILK+LLN Sbjct: 448 RVWNEMEGRGFSPGVDNFIVMIHGFIGQGCLVEACEYFKEMAGRGLFCVPQYGILKDLLN 507 Query: 1648 SLLRDQKLEMAKDVWSCIVGTGCDLNVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMP 1827 SLLR +KLEMAK+VWSCIV GC+LNV AWTIW+HALFS GHV EACSYCL+M+D VMP Sbjct: 508 SLLRAEKLEMAKNVWSCIVSKGCELNVSAWTIWVHALFSKGHVKEACSYCLEMMDVDVMP 567 Query: 1828 QPDTFAKLMRGLRKLYNRQIAAEITEKVRQMAADRNVTFKMYKRRGERDL 1977 QPDTFAKLMRGLRKLYNRQIAAEITEKVR+MAADR +TFKMYKRRG+RDL Sbjct: 568 QPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAADREITFKMYKRRGQRDL 617 >ref|XP_002530608.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223529856|gb|EEF31788.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 596 Score = 840 bits (2169), Expect = 0.0 Identities = 404/577 (70%), Positives = 480/577 (83%), Gaps = 8/577 (1%) Frame = +1 Query: 208 NPHRCYIN----NTEEKSGKSRRISTVATDRTQGSQFVCLENRPNCETHEQNA----DEF 363 N H C N +K + + ++ + G VCL+ + N + N+ DEF Sbjct: 13 NKHCCRFNLIHVQLYQKGQEPIDRNPLSNNLRNGFGVVCLKTQENNTSDRDNSSSKVDEF 72 Query: 364 ASDVEKLYRILKKFHSRVPKLELALQESGVVIRSGLVERVLNRCGDAGSLGYRFFAWASK 543 A DVEK+YRIL+ FHSRVPKLELALQESGV +R+GL ERVLNRCGDAG+LGYRFF WASK Sbjct: 73 AKDVEKVYRILRNFHSRVPKLELALQESGVTMRAGLTERVLNRCGDAGNLGYRFFVWASK 132 Query: 544 QPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEEMRKENPQLLTADAFVVLMRRFASARMV 723 QPGYRHSYE YK+M+K SKMRQFGAVWAL+EEMRK+N L+T++ F+VL+RRFASAR+V Sbjct: 133 QPGYRHSYENYKAMVKIFSKMRQFGAVWALLEEMRKDNSVLITSELFIVLIRRFASARLV 192 Query: 724 KKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMRFRFTPNLKHFTSLL 903 +KA+EVLDEMPKYGCEPDE+VFGCLLDALCKNGSVK+AA LFEDMR RF+P+L+HFTSLL Sbjct: 193 EKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKQAASLFEDMRVRFSPSLRHFTSLL 252 Query: 904 YGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNNLLSGYAMAGKMQDAFELLADMKNKGCD 1083 YGWC+EGKL+EAK VLVQMREAGFEPDIVV+NNLLS Y+MAGKM DAF+LL +M KGC+ Sbjct: 253 YGWCREGKLIEAKHVLVQMREAGFEPDIVVFNNLLSAYSMAGKMTDAFDLLKEMVRKGCE 312 Query: 1084 PNASSYTILIQSLCSREKMEDAMRMFVEMQRSGCVADVVTYTTLISGFCKRGKIDKGYEL 1263 PNA+SYTI+IQ+ CS+EKM++AMR+FVEM+R+GC ADVVTYT LISGFCK GKI++GY++ Sbjct: 313 PNANSYTIMIQAFCSQEKMDEAMRVFVEMERTGCEADVVTYTALISGFCKWGKINRGYQI 372 Query: 1264 LDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQKIGCSPDLSTYNIVIRLACK 1443 LD M QKG +PNQ+TY IL+AH M+ +GC PDLS YN+VIRLACK Sbjct: 373 LDAMKQKGHMPNQLTYLRILLAHEKKEELEECLELIESMRMVGCVPDLSIYNVVIRLACK 432 Query: 1444 LGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGFLGQGCLVEACKYFKEMVGRGLLSTPQY 1623 LGEV +GV+ WN MEAS SP LDTFVIM+HGFLGQGCLVEAC+YFKEM+GRGLL+TPQY Sbjct: 433 LGEVKQGVQIWNEMEASDFSPELDTFVIMIHGFLGQGCLVEACEYFKEMIGRGLLTTPQY 492 Query: 1624 GILKELLNSLLRDQKLEMAKDVWSCIVGTGCDLNVYAWTIWIHALFSNGHVTEACSYCLD 1803 GILKELLN+LLR +KL MAKDVWSCIV GC+LN AWTIWIH+LFSNGHV EACSYCLD Sbjct: 493 GILKELLNALLRGEKLGMAKDVWSCIVTKGCELNADAWTIWIHSLFSNGHVKEACSYCLD 552 Query: 1804 MLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVR 1914 M++A +MP+P+TFAKLMRGLRKLYNR+ AAEITEK++ Sbjct: 553 MMEADIMPKPETFAKLMRGLRKLYNREFAAEITEKIK 589 Score = 65.5 bits (158), Expect = 1e-07 Identities = 61/337 (18%), Positives = 125/337 (37%), Gaps = 1/337 (0%) Frame = +1 Query: 913 CKEGKLMEAKFVLVQMREAGFEPDIVVYNNLLSGYAMAGKMQDAFELLADM-KNKGCDPN 1089 C + + +F + ++ G+ Y ++ ++ + + LL +M K+ Sbjct: 116 CGDAGNLGYRFFVWASKQPGYRHSYENYKAMVKIFSKMRQFGAVWALLEEMRKDNSVLIT 175 Query: 1090 ASSYTILIQSLCSREKMEDAMRMFVEMQRSGCVADVVTYTTLISGFCKRGKIDKGYELLD 1269 + + +LI+ S +E A+ + EM + GC D + L+ CK G + + L + Sbjct: 176 SELFIVLIRRFASARLVEKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKQAASLFE 235 Query: 1270 VMIQKGCVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQKIGCSPDLSTYNIVIRLACKLG 1449 M ++ SP L + ++ C+ G Sbjct: 236 DM------------------------------------RVRFSPSLRHFTSLLYGWCREG 259 Query: 1450 EVGEGVKAWNSMEASGLSPGLDTFVIMVHGFLGQGCLVEACKYFKEMVGRGLLSTPQYGI 1629 ++ E M +G P + F ++ + G + +A KEMV +G P Sbjct: 260 KLIEAKHVLVQMREAGFEPDIVVFNNLLSAYSMAGKMTDAFDLLKEMVRKGC--EPNANS 317 Query: 1630 LKELLNSLLRDQKLEMAKDVWSCIVGTGCDLNVYAWTIWIHALFSNGHVTEACSYCLDML 1809 ++ + +K++ A V+ + TGC+ +V +T I G + M Sbjct: 318 YTIMIQAFCSQEKMDEAMRVFVEMERTGCEADVVTYTALISGFCKWGKINRGYQILDAMK 377 Query: 1810 DAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRQM 1920 G MP T+ +++ K + E+ E +R + Sbjct: 378 QKGHMPNQLTYLRILLAHEKKEELEECLELIESMRMV 414 >gb|EYU29622.1| hypothetical protein MIMGU_mgv1a023801mg [Mimulus guttatus] Length = 601 Score = 816 bits (2107), Expect = 0.0 Identities = 384/541 (70%), Positives = 461/541 (85%) Frame = +1 Query: 355 DEFASDVEKLYRILKKFHSRVPKLELALQESGVVIRSGLVERVLNRCGDAGSLGYRFFAW 534 D F +DVEK+YRIL+KFHSRVPKLELALQ SGVV+RSGL ERVLNRCGDAG+LGYRFF W Sbjct: 31 DYFFADVEKVYRILRKFHSRVPKLELALQGSGVVVRSGLTERVLNRCGDAGNLGYRFFVW 90 Query: 535 ASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEEMRKENPQLLTADAFVVLMRRFASA 714 ASKQPGYRH+ +VYKSMIK L+KMRQFGAVWALIEEMRKE+P LL+ + FV+LMRRFASA Sbjct: 91 ASKQPGYRHNRDVYKSMIKILAKMRQFGAVWALIEEMRKESPHLLSPEVFVILMRRFASA 150 Query: 715 RMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMRFRFTPNLKHFT 894 RMVKKAVEVLDEMPKYGCEPDE+ FGCLLDALCKNGSVKEAALLFEDM+ RF P +KHFT Sbjct: 151 RMVKKAVEVLDEMPKYGCEPDEYAFGCLLDALCKNGSVKEAALLFEDMKIRFEPTIKHFT 210 Query: 895 SLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNNLLSGYAMAGKMQDAFELLADMKNK 1074 SLLYGWCKEGKL+EAK VLV+MREAGFEPD+VVYNNLL+GY++AGKM DA LL +M+ Sbjct: 211 SLLYGWCKEGKLIEAKVVLVKMREAGFEPDLVVYNNLLNGYSVAGKMADASHLLVEMRRN 270 Query: 1075 GCDPNASSYTILIQSLCSREKMEDAMRMFVEMQRSGCVADVVTYTTLISGFCKRGKIDKG 1254 G +PNA+SYTI+IQ+LC REKME+A R+F EM+++GC ADVVTYTTLISGFCK GKI K Sbjct: 271 GVEPNATSYTIMIQALCGREKMEEATRVFSEMEKNGCEADVVTYTTLISGFCKWGKIKKA 330 Query: 1255 YELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQKIGCSPDLSTYNIVIRL 1434 +ELL+ MI+KG +PN TY + ++AH M+KI SPDL YN ++RL Sbjct: 331 HELLEAMIRKGHIPNATTYLYFMLAHEKKEELEECLELVNEMKKIRVSPDLFIYNTILRL 390 Query: 1435 ACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGFLGQGCLVEACKYFKEMVGRGLLST 1614 +CKLGE+ G++ N +E +G++PG+DT++I++ G + Q LVEAC YF+EMV RGL S Sbjct: 391 SCKLGEIESGIRIMNELEENGITPGVDTYIILIGGLVEQARLVEACDYFQEMVERGLFSA 450 Query: 1615 PQYGILKELLNSLLRDQKLEMAKDVWSCIVGTGCDLNVYAWTIWIHALFSNGHVTEACSY 1794 PQYG++K+LLNSLLRD KL++AKD W CI+ GC++NV AWTIWIHALFSNGHV +ACSY Sbjct: 451 PQYGVMKDLLNSLLRDDKLQLAKDAWGCIIEKGCEVNVSAWTIWIHALFSNGHVKDACSY 510 Query: 1795 CLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRQMAADRNVTFKMYKRRGERD 1974 CLDM+++G MP+PDTF+KLM+GL+KLYNR+IA EITEKVR+MA +RN+TFKMYKRRGERD Sbjct: 511 CLDMMESGEMPKPDTFSKLMKGLKKLYNREIAVEITEKVRKMAEERNITFKMYKRRGERD 570 Query: 1975 L 1977 L Sbjct: 571 L 571 >ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutrema salsugineum] gi|557105226|gb|ESQ45560.1| hypothetical protein EUTSA_v10010190mg [Eutrema salsugineum] Length = 645 Score = 805 bits (2080), Expect = 0.0 Identities = 392/570 (68%), Positives = 464/570 (81%), Gaps = 3/570 (0%) Frame = +1 Query: 277 ATDRTQGSQFVCLENRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELALQESGVV 456 + +R G+ VC E R Q DEFA DVEK+YRIL+ +HSRVPKLEL L ESG+ Sbjct: 48 SAERINGAGLVCPEKR-------QQEDEFAGDVEKIYRILRNYHSRVPKLELVLHESGIN 100 Query: 457 IRSGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALI 636 +R GL+ RVL+RCGDAG+LGYRFF WA+KQPGY HSYEV KSM+K LSKMRQFGAVWALI Sbjct: 101 LRPGLIVRVLSRCGDAGNLGYRFFLWAAKQPGYCHSYEVCKSMVKILSKMRQFGAVWALI 160 Query: 637 EEMRKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCK 816 EEMRKENPQL+ + FVVLMRRFASA MVKKAVEVLDEMPKYG EPDE++FGCLLDALCK Sbjct: 161 EEMRKENPQLIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGIEPDEYIFGCLLDALCK 220 Query: 817 NGSVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVY 996 NGSVK+A+ LFEDMR +F PNL++FTSLLYGWC+EGKL+EAK VLVQM+EAG EPDIVV+ Sbjct: 221 NGSVKDASKLFEDMRDKFPPNLRYFTSLLYGWCREGKLIEAKHVLVQMKEAGLEPDIVVF 280 Query: 997 NNLLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREK-MEDAMRMFVEMQ 1173 NLLSGYA AGKM DA++L+ DM+ +G +PNA+ YT+LIQ+LC EK M++AMR+FVEM+ Sbjct: 281 TNLLSGYAHAGKMADAYDLMKDMRRRGYEPNANCYTVLIQALCKMEKRMDEAMRVFVEME 340 Query: 1174 RSGCVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXX 1353 R GC AD+VTYT LISGFCK G IDKGY +LD M +KG +P Q+TY I+VAH Sbjct: 341 RYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPLQVTYMQIMVAHEKKEQFE 400 Query: 1354 XXXXXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMV 1533 M++ GC PDL YN+VIRLACKLGEV E V+ WN MEA+GLSPG+DTFVIM+ Sbjct: 401 ECLDLIEKMKQNGCLPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMI 460 Query: 1534 HGFLGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCI--VG 1707 +GF QGCL+EAC +FKEMV RG+ S P YG LK LLN+L+RD KLEMAKDVWSC+ Sbjct: 461 NGFASQGCLIEACDHFKEMVSRGIFSAPHYGTLKILLNTLVRDDKLEMAKDVWSCLSNKS 520 Query: 1708 TGCDLNVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQI 1887 + C+LNV AWTIWIHALF+ GHV EACSYCLDM++ +MPQPDT+AKLM+GL KLYNR I Sbjct: 521 SSCELNVSAWTIWIHALFARGHVKEACSYCLDMMEMDLMPQPDTYAKLMKGLNKLYNRTI 580 Query: 1888 AAEITEKVRQMAADRNVTFKMYKRRGERDL 1977 AAEITEKVR+MA++R ++FKMYKRRGE DL Sbjct: 581 AAEITEKVRKMASEREMSFKMYKRRGEEDL 610 >ref|XP_006393982.1| hypothetical protein EUTSA_v10003830mg [Eutrema salsugineum] gi|557090621|gb|ESQ31268.1| hypothetical protein EUTSA_v10003830mg [Eutrema salsugineum] Length = 620 Score = 797 bits (2059), Expect = 0.0 Identities = 385/565 (68%), Positives = 458/565 (81%), Gaps = 1/565 (0%) Frame = +1 Query: 286 RTQGSQFVCLENRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELALQESGVVIRS 465 R+ G+ VCL+ T N DEFASDVEK YRIL+KFHSRVPKLELAL ESGV +R Sbjct: 47 RSNGTGLVCLDKSHKERTKNSNHDEFASDVEKAYRILRKFHSRVPKLELALNESGVELRP 106 Query: 466 GLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEEM 645 GL+ERVLNRCGDAG+LGYRFF WA+KQPGY HSY+VYKSM+K LSKMR F AVWALIEEM Sbjct: 107 GLIERVLNRCGDAGNLGYRFFVWAAKQPGYCHSYQVYKSMVKILSKMRHFEAVWALIEEM 166 Query: 646 RKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNGS 825 RKENPQL+ + FVVL+RRFAS+ MVKKA+EVLDEMPK+G EPDE+VFGCLLDALCKNGS Sbjct: 167 RKENPQLIEPELFVVLVRRFASSNMVKKAIEVLDEMPKFGLEPDEYVFGCLLDALCKNGS 226 Query: 826 VKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNNL 1005 VK+AA LFE+MR RF PNL++FTSLLYGWC+EGK+MEA+ VLV+M+EA FEPD+VVY NL Sbjct: 227 VKDAAKLFEEMRLRFPPNLRYFTSLLYGWCREGKMMEAEHVLVEMKEARFEPDVVVYTNL 286 Query: 1006 LSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQRSGC 1185 LSGYA AGKM +A++LL DM+ +G +PNA+ YT+LIQ+LC ++ME+AMR+FVEM+R C Sbjct: 287 LSGYAHAGKMAEAYDLLKDMRRRGFEPNANCYTVLIQALCKVDRMEEAMRVFVEMERYEC 346 Query: 1186 VADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXXX 1365 AD+VTY L+SGFCK GKIDK Y +LD MI+K +P+Q+TY HI+ AH Sbjct: 347 EADIVTYNALVSGFCKWGKIDKCYSVLDDMIKKCLMPSQLTYMHIMAAHEKKEKFEECLE 406 Query: 1366 XXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGFL 1545 M++IG DL YN+VIRLACKLGEV E V+ WN MEASGLSPG+DTFVIM+ G Sbjct: 407 LMEKMKEIGYHLDLGVYNVVIRLACKLGEVKEAVRLWNEMEASGLSPGVDTFVIMIDGLT 466 Query: 1546 GQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGTG-CDL 1722 QGCL+EAC +FK MV RGL S PQYG LK LLN+LLRD KLE AKD+WSCI+ G C+L Sbjct: 467 NQGCLLEACDHFKVMVSRGLFSVPQYGTLKSLLNALLRDGKLETAKDIWSCIMSEGSCEL 526 Query: 1723 NVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEIT 1902 NV +WTIWIHALFS G+V +ACSYCL+M++ M QPDTFAKLM+GL+KLYNR+ A EIT Sbjct: 527 NVSSWTIWIHALFSKGYVKDACSYCLEMMEMDFMLQPDTFAKLMKGLKKLYNREFAVEIT 586 Query: 1903 EKVRQMAADRNVTFKMYKRRGERDL 1977 EKVR MAA+R ++FKMYKRRG DL Sbjct: 587 EKVRNMAAERELSFKMYKRRGVEDL 611 >ref|NP_201383.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75170571|sp|Q9FH87.1|PP447_ARATH RecName: Full=Putative pentatricopeptide repeat-containing protein At5g65820 gi|9758569|dbj|BAB09050.1| unnamed protein product [Arabidopsis thaliana] gi|332010728|gb|AED98111.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 637 Score = 792 bits (2046), Expect = 0.0 Identities = 386/565 (68%), Positives = 458/565 (81%), Gaps = 1/565 (0%) Frame = +1 Query: 286 RTQGSQFVCLENRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELALQESGVVIRS 465 R+ G VCLE N T DEFASDVEK YRIL+KFHSRVPKLELAL ESGV +R Sbjct: 54 RSNGIGLVCLEKSHNDRTKNSKYDEFASDVEKSYRILRKFHSRVPKLELALNESGVELRP 113 Query: 466 GLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEEM 645 GL+ERVLNRCGDAG+LGYRFF WA+KQP Y HS EVYKSM+K LSKMRQFGAVW LIEEM Sbjct: 114 GLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEM 173 Query: 646 RKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNGS 825 RKENPQL+ + FVVL++RFASA MVKKA+EVLDEMPK+G EPDE+VFGCLLDALCK+GS Sbjct: 174 RKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFEPDEYVFGCLLDALCKHGS 233 Query: 826 VKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNNL 1005 VK+AA LFEDMR RF NL++FTSLLYGWC+ GK+MEAK+VLVQM EAGFEPDIV Y NL Sbjct: 234 VKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNL 293 Query: 1006 LSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQRSGC 1185 LSGYA AGKM DA++LL DM+ +G +PNA+ YT+LIQ+LC ++ME+AM++FVEM+R C Sbjct: 294 LSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYEC 353 Query: 1186 VADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXXX 1365 ADVVTYT L+SGFCK GKIDK Y +LD MI+KG +P+++TY HI+VAH Sbjct: 354 EADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLE 413 Query: 1366 XXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGFL 1545 M++I PD+ YN+VIRLACKLGEV E V+ WN ME +GLSPG+DTFVIM++G Sbjct: 414 LMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLA 473 Query: 1546 GQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGTG-CDL 1722 QGCL+EA +FKEMV RGL S QYG LK LLN++L+D+KLEMAKDVWSCI G C+L Sbjct: 474 SQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLLNTVLKDKKLEMAKDVWSCITSKGACEL 533 Query: 1723 NVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEIT 1902 NV +WTIWIHALFS G+ EACSYC++M++ MPQPDTFAKLM+GL+KLYNR+ A EIT Sbjct: 534 NVLSWTIWIHALFSKGYEKEACSYCIEMIEMDFMPQPDTFAKLMKGLKKLYNREFAGEIT 593 Query: 1903 EKVRQMAADRNVTFKMYKRRGERDL 1977 EKVR MAA+R ++FKMYKRRG +DL Sbjct: 594 EKVRNMAAEREMSFKMYKRRGVQDL 618 >ref|XP_002866691.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297312526|gb|EFH42950.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 638 Score = 790 bits (2039), Expect = 0.0 Identities = 385/567 (67%), Positives = 460/567 (81%), Gaps = 3/567 (0%) Frame = +1 Query: 286 RTQGSQFVCLENRPNCETHEQNA--DEFASDVEKLYRILKKFHSRVPKLELALQESGVVI 459 R+ G VCLE N +N+ DEFASDVEK YRIL+KFHSRVPKLELAL ESGV + Sbjct: 53 RSNGIGLVCLEKNHNHNDRTKNSKYDEFASDVEKAYRILRKFHSRVPKLELALNESGVEL 112 Query: 460 RSGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIE 639 R GL+ERVLNRCGDAG+LGYRFF WA+KQP Y HS EVYKSM+K LSKMRQFGAVW LIE Sbjct: 113 RPGLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIE 172 Query: 640 EMRKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKN 819 EMRKENPQL+ + FVVL++RFASA MVKKA+EVLDEMP +G EPDE+VFGCLLDALCK+ Sbjct: 173 EMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPTFGLEPDEYVFGCLLDALCKH 232 Query: 820 GSVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYN 999 GSVK+AA LFEDMR RF NL++FTSLLYGWC+E K+MEAK+VLVQM+EAGFEPDIV Y Sbjct: 233 GSVKDAAKLFEDMRLRFPVNLRYFTSLLYGWCREEKMMEAKYVLVQMKEAGFEPDIVDYT 292 Query: 1000 NLLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREKMEDAMRMFVEMQRS 1179 NLLSGYA AGKM DA++LL DM+ +G +PNA+ YT+LIQ+LC ++ME+AM++FVEM+R Sbjct: 293 NLLSGYANAGKMADAYDLLKDMRRRGFEPNATCYTVLIQALCKVDRMEEAMKVFVEMERY 352 Query: 1180 GCVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXX 1359 C ADVVTYT L+SGFCK GKIDK Y +LD MI+KG +P+Q+TY HI+ AH Sbjct: 353 ECEADVVTYTALVSGFCKWGKIDKCYLVLDDMIKKGLMPSQLTYMHIMAAHEKKEKLIEC 412 Query: 1360 XXXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHG 1539 M++I PD+ YN+VIRLACKLGEV E V+ WN ME +GLSPG DTFVI+++G Sbjct: 413 LELMEKMKQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEGNGLSPGADTFVIIING 472 Query: 1540 FLGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVGTG-C 1716 QGCL+EAC +FKEMV RGL S PQYG LK LLN+LL+D+KLEMAKDVWSCI G C Sbjct: 473 LTSQGCLLEACDHFKEMVARGLFSVPQYGTLKLLLNTLLKDKKLEMAKDVWSCITSKGSC 532 Query: 1717 DLNVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAE 1896 +L+V +WTIWIHALFS G+ EACSYCL+M++ MPQPDTFAKLM+GL+KLY+R+ A E Sbjct: 533 ELSVSSWTIWIHALFSKGYEKEACSYCLEMIELEFMPQPDTFAKLMKGLKKLYHREFAVE 592 Query: 1897 ITEKVRQMAADRNVTFKMYKRRGERDL 1977 ITEKVR MAA++ ++FKMYKRRG +DL Sbjct: 593 ITEKVRNMAAEKEMSFKMYKRRGVQDL 619 >ref|XP_006292382.1| hypothetical protein CARUB_v10018595mg [Capsella rubella] gi|482561089|gb|EOA25280.1| hypothetical protein CARUB_v10018595mg [Capsella rubella] Length = 639 Score = 786 bits (2030), Expect = 0.0 Identities = 378/544 (69%), Positives = 451/544 (82%), Gaps = 3/544 (0%) Frame = +1 Query: 355 DEFASDVEKLYRILKKFHSRVPKLELALQESGVVIRSGLVERVLNRCGDAGSLGYRFFAW 534 DEFA DV+K+YRIL+ +HSRVPKLELAL ES + +R GL+ RVL+RCGDAG+LGYRFF W Sbjct: 62 DEFAGDVDKIYRILRNYHSRVPKLELALNESSIDLRPGLIVRVLSRCGDAGNLGYRFFLW 121 Query: 535 ASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALIEEMRKENPQLLTADAFVVLMRRFASA 714 A+KQPGY HSYEV KSM+K LSKMRQFGAVW LIEEMRKENP+L+ + FV+LMRRFASA Sbjct: 122 AAKQPGYCHSYEVCKSMVKVLSKMRQFGAVWGLIEEMRKENPELIEPELFVILMRRFASA 181 Query: 715 RMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMRFRFTPNLKHFT 894 MVKKAVEVLDEMPKYG EPDE+VFGCLLDALCKNGSVK+A+ LFEDM+ ++ PNL++FT Sbjct: 182 NMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKDASKLFEDMKEKYPPNLRYFT 241 Query: 895 SLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVYNNLLSGYAMAGKMQDAFELLADMKNK 1074 SLLYGWC+EGKLMEAK VLVQM+EAG EPDIVV+ NLLSGYA AGKM DA++L+ DM+ + Sbjct: 242 SLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMKDMRKR 301 Query: 1075 GCDPNASSYTILIQSLCSREK-MEDAMRMFVEMQRSGCVADVVTYTTLISGFCKRGKIDK 1251 G +PNA+ YT+LIQ+LC EK M++AMR+FVEM+R GC AD+VTYT LISGFCK IDK Sbjct: 302 GYEPNANCYTVLIQALCKTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWEMIDK 361 Query: 1252 GYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXXXXXXXXXXMQKIGCSPDLSTYNIVIR 1431 GY +LD M +KG +P+Q+TY I+VAH M++IGC DL YN+VIR Sbjct: 362 GYSVLDDMRKKGVIPSQVTYMQIMVAHEKKEQFEECLDLIEKMKQIGCQLDLLIYNVVIR 421 Query: 1432 LACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMVHGFLGQGCLVEACKYFKEMVGRGLLS 1611 LACKLGEV E V+ WN MEA+GLSPG+DTFVIM++GF QGCLVEAC +FKEMV RG+ S Sbjct: 422 LACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGCLVEACNHFKEMVSRGIFS 481 Query: 1612 TPQYGILKELLNSLLRDQKLEMAKDVWSCI--VGTGCDLNVYAWTIWIHALFSNGHVTEA 1785 PQYG LK LLN+L+RD+KLEMAKDVWSCI + C+LNV AWTIWIHAL + GHV EA Sbjct: 482 APQYGTLKLLLNNLVRDEKLEMAKDVWSCISNKSSSCELNVSAWTIWIHALLAKGHVKEA 541 Query: 1786 CSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRQMAADRNVTFKMYKRRG 1965 CSYCLDM+ +MPQPDT+ KLM+GL KLYNR IAAEITEKV +MA++R ++FKMYK++G Sbjct: 542 CSYCLDMMKMDLMPQPDTYVKLMKGLNKLYNRTIAAEITEKVMKMASEREMSFKMYKKKG 601 Query: 1966 ERDL 1977 E DL Sbjct: 602 EEDL 605 >ref|NP_190542.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546755|sp|P0C8A0.1|PP275_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g49730 gi|332645062|gb|AEE78583.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 638 Score = 786 bits (2030), Expect = 0.0 Identities = 386/570 (67%), Positives = 460/570 (80%), Gaps = 3/570 (0%) Frame = +1 Query: 277 ATDRTQGSQFVCLENRPNCETHEQNADEFASDVEKLYRILKKFHSRVPKLELALQESGVV 456 +T+R G VC E HE DEFA +VEK+YRIL+ HSRVPKLELAL ESG+ Sbjct: 44 STERKNGVGLVCPEK------HE---DEFAGEVEKIYRILRNHHSRVPKLELALNESGID 94 Query: 457 IRSGLVERVLNRCGDAGSLGYRFFAWASKQPGYRHSYEVYKSMIKTLSKMRQFGAVWALI 636 +R GL+ RVL+RCGDAG+LGYRFF WA+KQPGY HSYEV KSM+ LSKMRQFGAVW LI Sbjct: 95 LRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLI 154 Query: 637 EEMRKENPQLLTADAFVVLMRRFASARMVKKAVEVLDEMPKYGCEPDEHVFGCLLDALCK 816 EEMRK NP+L+ + FVVLMRRFASA MVKKAVEVLDEMPKYG EPDE+VFGCLLDALCK Sbjct: 155 EEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCK 214 Query: 817 NGSVKEAALLFEDMRFRFTPNLKHFTSLLYGWCKEGKLMEAKFVLVQMREAGFEPDIVVY 996 NGSVKEA+ +FEDMR +F PNL++FTSLLYGWC+EGKLMEAK VLVQM+EAG EPDIVV+ Sbjct: 215 NGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVF 274 Query: 997 NNLLSGYAMAGKMQDAFELLADMKNKGCDPNASSYTILIQSLCSREK-MEDAMRMFVEMQ 1173 NLLSGYA AGKM DA++L+ DM+ +G +PN + YT+LIQ+LC EK M++AMR+FVEM+ Sbjct: 275 TNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEME 334 Query: 1174 RSGCVADVVTYTTLISGFCKRGKIDKGYELLDVMIQKGCVPNQMTYFHILVAHXXXXXXX 1353 R GC AD+VTYT LISGFCK G IDKGY +LD M +KG +P+Q+TY I+VAH Sbjct: 335 RYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFE 394 Query: 1354 XXXXXXXXMQKIGCSPDLSTYNIVIRLACKLGEVGEGVKAWNSMEASGLSPGLDTFVIMV 1533 M++ GC PDL YN+VIRLACKLGEV E V+ WN MEA+GLSPG+DTFVIM+ Sbjct: 395 ECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMI 454 Query: 1534 HGFLGQGCLVEACKYFKEMVGRGLLSTPQYGILKELLNSLLRDQKLEMAKDVWSCIVG-- 1707 +GF QG L+EAC +FKEMV RG+ S PQYG LK LLN+L+RD KLEMAKDVWSCI Sbjct: 455 NGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKLEMAKDVWSCISNKT 514 Query: 1708 TGCDLNVYAWTIWIHALFSNGHVTEACSYCLDMLDAGVMPQPDTFAKLMRGLRKLYNRQI 1887 + C+LNV AWTIWIHAL++ GHV EACSYCLDM++ +MPQP+T+AKLM+GL KLYNR I Sbjct: 515 SSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPNTYAKLMKGLNKLYNRTI 574 Query: 1888 AAEITEKVRQMAADRNVTFKMYKRRGERDL 1977 AAEITEKV +MA++R ++FKMYK++GE DL Sbjct: 575 AAEITEKVVKMASEREMSFKMYKKKGEEDL 604