BLASTX nr result
ID: Catharanthus22_contig00005067
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00005067 (3639 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006363176.1| PREDICTED: pentatricopeptide repeat-containi... 927 0.0 ref|XP_004232626.1| PREDICTED: pentatricopeptide repeat-containi... 924 0.0 ref|XP_002275605.1| PREDICTED: pentatricopeptide repeat-containi... 909 0.0 emb|CBI27232.3| unnamed protein product [Vitis vinifera] 901 0.0 ref|XP_006448599.1| hypothetical protein CICLE_v10014519mg [Citr... 879 0.0 gb|EMJ12567.1| hypothetical protein PRUPE_ppa002507mg [Prunus pe... 870 0.0 ref|XP_006468575.1| PREDICTED: pentatricopeptide repeat-containi... 869 0.0 ref|XP_002304600.2| hypothetical protein POPTR_0003s15360g [Popu... 867 0.0 gb|EOX96827.1| Pentatricopeptide repeat (PPR) superfamily protei... 862 0.0 ref|XP_002297917.1| hypothetical protein POPTR_0001s12190g [Popu... 862 0.0 ref|XP_004295517.1| PREDICTED: pentatricopeptide repeat-containi... 855 0.0 ref|XP_002528143.1| pentatricopeptide repeat-containing protein,... 849 0.0 ref|XP_002867892.1| EMB1025 [Arabidopsis lyrata subsp. lyrata] g... 818 0.0 ref|NP_193742.1| pentatricopeptide repeat-containing protein [Ar... 812 0.0 ref|XP_006283284.1| hypothetical protein CARUB_v10004320mg [Caps... 808 0.0 ref|XP_003534864.1| PREDICTED: pentatricopeptide repeat-containi... 802 0.0 gb|EXB83265.1| hypothetical protein L484_011559 [Morus notabilis] 797 0.0 gb|ESW10855.1| hypothetical protein PHAVU_009G243700g [Phaseolus... 793 0.0 ref|XP_006404148.1| hypothetical protein EUTSA_v10010168mg [Eutr... 790 0.0 ref|XP_003594857.1| Pentatricopeptide repeat-containing protein ... 781 0.0 >ref|XP_006363176.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like isoform X1 [Solanum tuberosum] gi|565395083|ref|XP_006363177.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like isoform X2 [Solanum tuberosum] Length = 717 Score = 927 bits (2396), Expect = 0.0 Identities = 447/635 (70%), Positives = 529/635 (83%), Gaps = 1/635 (0%) Frame = +3 Query: 345 NSCETEKDYEEDIIAIQSTNSHMLPKR-SHKVEVEPPITDRLFKHAPKSGSYKQGDSTFY 521 NSC E E+ ++ S + P S K EVE PI+D+LFK APK GS+K GDSTFY Sbjct: 86 NSCGAEV---EEPLSDNSFKVTLKPNLGSCKTEVEVPISDKLFKEAPKLGSFKLGDSTFY 142 Query: 522 SLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYE 701 SLIE YANSGDF SLE VF RM E+R F+EKSFI+VFRAYGKA LPEKAVELF+RMV E Sbjct: 143 SLIEKYANSGDFTSLEKVFDRMKCEKRVFIEKSFILVFRAYGKARLPEKAVELFERMVDE 202 Query: 702 FQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCKNIKPNVLTFNLVIKAMCKLQLVD 881 FQCKRTVKSFNSVLNVI+Q GLY AL+F++ VVN +NI PNVL+FNLVIK MCKL++VD Sbjct: 203 FQCKRTVKSFNSVLNVIVQTGLYRHALDFYADVVNNRNIMPNVLSFNLVIKTMCKLRMVD 262 Query: 882 RAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLI 1061 RA+EVFREMP KC+ DV+TYCTLMDGLCK+DR++EAV LLDEMQ+EGC P P TFNVLI Sbjct: 263 RAMEVFREMPTWKCEPDVYTYCTLMDGLCKDDRIDEAVILLDEMQVEGCLPVPVTFNVLI 322 Query: 1062 NGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFI 1241 NGLC+KGDL+RAAK+VDNMFLKGCVPNEVTYNTLIHGLCL+GKLEKA+SL+DRMVS+K+I Sbjct: 323 NGLCRKGDLARAAKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLEKAVSLVDRMVSNKYI 382 Query: 1242 PNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNL 1421 P D+TYGTII+G V++ RA DGV ++++M+E+GH N+++YS+LVSGLFKEG+ EEAL + Sbjct: 383 PTDITYGTIINGFVKQRRATDGVQILLAMQEKGHLANEYVYSALVSGLFKEGKPEEALKI 442 Query: 1422 WKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFK 1601 WK ++E G KPNTV YSA IDGLCR G+P EAKEIL EM GC PNAYTY SLMKG+FK Sbjct: 443 WKGMIEKGVKPNTVAYSAFIDGLCREGRPDEAKEILSEMNKMGCTPNAYTYCSLMKGYFK 502 Query: 1602 VGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVA 1781 G+SN A+LLWK+M G NE CYS+L HGLC +GKLKEA MVW+HMLGKG PDVVA Sbjct: 503 TGDSNKAILLWKDMATSGITCNEICYSVLTHGLCQDGKLKEAMMVWKHMLGKGLVPDVVA 562 Query: 1782 YTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNN 1961 Y+SMIHGLC+ GSV+QGL LFNEM +GSDSQPDV YN++ NALCK ++I+ AI LLN Sbjct: 563 YSSMIHGLCNAGSVDQGLRLFNEMQCRGSDSQPDVIAYNIIINALCKVDRISLAIDLLNT 622 Query: 1962 MLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQK 2141 MLDRGCDPD +TCNIFL T +K NPSQDG +FLD+L L+L++RQRI GASRIIEVMLQK Sbjct: 623 MLDRGCDPDTITCNIFLKTLNDKANPSQDGEDFLDKLVLQLYRRQRIVGASRIIEVMLQK 682 Query: 2142 FLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLFV 2246 +YPK STWE+ IRELCKP+K+Q AINKCW+ LF+ Sbjct: 683 IIYPKSSTWEMIIRELCKPKKVQGAINKCWSDLFI 717 >ref|XP_004232626.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like [Solanum lycopersicum] Length = 717 Score = 924 bits (2389), Expect = 0.0 Identities = 448/635 (70%), Positives = 530/635 (83%), Gaps = 1/635 (0%) Frame = +3 Query: 345 NSCETEKDYEEDIIAIQSTNSHMLPKR-SHKVEVEPPITDRLFKHAPKSGSYKQGDSTFY 521 NSC TE E+ ++ +S + P S + EVE PI+D+LFK APK GS+K GDSTFY Sbjct: 86 NSCVTEV---EEPLSDKSFKVTLKPNLGSCETEVEVPISDKLFKEAPKLGSFKLGDSTFY 142 Query: 522 SLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYE 701 SLIE YANS DF SLE VF RM E+R F+EKSFI+VFRAYGKA LPEKAVELF+RMV E Sbjct: 143 SLIEKYANSEDFTSLEKVFGRMKCEKRVFIEKSFILVFRAYGKARLPEKAVELFERMVDE 202 Query: 702 FQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCKNIKPNVLTFNLVIKAMCKLQLVD 881 FQCKRTVKSFNSVLNVI+Q GLY RAL+F++ VVN +NI PNVL+FNLVIK MCKL++VD Sbjct: 203 FQCKRTVKSFNSVLNVIVQTGLYHRALDFYADVVNNRNIMPNVLSFNLVIKTMCKLRMVD 262 Query: 882 RAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLI 1061 RA+EVFREMP KC+ DV+TYCTLMDGLCK+DR++EAV LLDEMQ+EGC P P TFNVLI Sbjct: 263 RAMEVFREMPTWKCEPDVYTYCTLMDGLCKDDRIDEAVILLDEMQVEGCLPVPVTFNVLI 322 Query: 1062 NGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFI 1241 NGLC+KGDL+RAAK+VDNMFLKGCVPN+VTYNTLIHGLCL+GKLEKA+SLLDRMVS+K+I Sbjct: 323 NGLCRKGDLARAAKLVDNMFLKGCVPNDVTYNTLIHGLCLKGKLEKAVSLLDRMVSNKYI 382 Query: 1242 PNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNL 1421 P D+TYGTII+G V++ RA DGV ++++M+E+GH N+++YS+LVSGLFKEG+ EEAL + Sbjct: 383 PTDITYGTIINGFVKQRRATDGVQILLAMQEKGHLANEYVYSALVSGLFKEGKPEEALKI 442 Query: 1422 WKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFK 1601 WK+++E G KPN V YSA IDGLCR GKP EAKEIL EM GC PNAYTY SLMKG+FK Sbjct: 443 WKEMIEKGVKPNIVAYSAFIDGLCREGKPDEAKEILSEMNKMGCTPNAYTYCSLMKGYFK 502 Query: 1602 VGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVA 1781 +SN A+LLWK+M G NE CYS+LIHGLC +GKLKEA MVW+HMLGKG PD VA Sbjct: 503 TSDSNKAILLWKDMATSGITCNEICYSVLIHGLCQDGKLKEAMMVWKHMLGKGLVPDAVA 562 Query: 1782 YTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNN 1961 Y+SMIHGLC+ GSV+QGL LFNEML +GSDSQPDV YN++ NALCK ++I+ AI LLN Sbjct: 563 YSSMIHGLCNAGSVDQGLRLFNEMLCRGSDSQPDVVAYNIIINALCKVDRISLAIDLLNT 622 Query: 1962 MLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQK 2141 MLDRGCDPD +TCNIFL T EK NPSQDG +FLD+L L+L++RQRI GASRIIEVMLQK Sbjct: 623 MLDRGCDPDKITCNIFLKTLNEKANPSQDGEDFLDKLVLQLYRRQRIIGASRIIEVMLQK 682 Query: 2142 FLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLFV 2246 L PK STWE+ IRELCKP+K+Q AINKCW+ LF+ Sbjct: 683 ILSPKSSTWEMIIRELCKPKKVQGAINKCWSDLFI 717 >ref|XP_002275605.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like [Vitis vinifera] Length = 644 Score = 909 bits (2350), Expect = 0.0 Identities = 438/603 (72%), Positives = 512/603 (84%), Gaps = 1/603 (0%) Frame = +3 Query: 438 EVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEK 617 E + PI D++FK A + GSYK GDSTFYSLIENYANSGDF +L VF RM ERR F+EK Sbjct: 41 ESDAPIPDQIFKSASQMGSYKSGDSTFYSLIENYANSGDFGTLFQVFDRMKRERRVFIEK 100 Query: 618 SFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSY 797 +FI+VFRAYGKAHLPEKA+ELF RMV EFQC+RTV+SFNSVLNVIIQEGL+ RALEF+ Sbjct: 101 NFILVFRAYGKAHLPEKAIELFGRMVDEFQCRRTVRSFNSVLNVIIQEGLFHRALEFYEC 160 Query: 798 VVNCK-NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKE 974 V K NI PNVL+FNLVIKAMCKL LVDRA+EVFREM KC+ DVFTYCTLMDGLCKE Sbjct: 161 GVGGKTNISPNVLSFNLVIKAMCKLGLVDRAIEVFREMAIQKCEPDVFTYCTLMDGLCKE 220 Query: 975 DRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTY 1154 DR++EAV LLDEMQIEGCFP+ TFNVLINGLCKKGD+ R K+VDNMFLKGCVPNEVTY Sbjct: 221 DRIDEAVLLLDEMQIEGCFPSSVTFNVLINGLCKKGDMVRVTKLVDNMFLKGCVPNEVTY 280 Query: 1155 NTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEE 1334 NT+I+GLCL+GKL+KA+SLLDRMV+ K +PNDVTYGT+I+GLV++GR+VDGVH++ S+EE Sbjct: 281 NTIINGLCLKGKLDKAVSLLDRMVASKCVPNDVTYGTLINGLVKQGRSVDGVHLLSSLEE 340 Query: 1335 RGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFE 1514 RGH N++ YS+L+SGLFKE +SEEA+ LWKK++E G +PN VVYSALIDGLCR GK E Sbjct: 341 RGHHANEYAYSTLISGLFKEEKSEEAMGLWKKMVEKGCQPNIVVYSALIDGLCREGKLDE 400 Query: 1515 AKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIH 1694 AKEIL EMVNKGC PNA+TYSSL+KGFFK GNS A+ +WKEM + CV NE CYS+LIH Sbjct: 401 AKEILCEMVNKGCTPNAFTYSSLIKGFFKTGNSQKAIRVWKEMAKNNCVPNEICYSVLIH 460 Query: 1695 GLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDS 1874 GLC++GKL+EA M+W HMLG+G PDVVAY+SMIHGLC+ GSVE GL LFNEML + SDS Sbjct: 461 GLCEDGKLREAMMMWTHMLGRGLRPDVVAYSSMIHGLCNAGSVEVGLKLFNEMLCQESDS 520 Query: 1875 QPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGR 2054 QPDV TYN+L ALCK I+ AI LLN+MLDRGC+PD++TCNIFL +EK+NP QDGR Sbjct: 521 QPDVVTYNILLRALCKQNSISHAIDLLNSMLDRGCNPDLITCNIFLNALREKLNPPQDGR 580 Query: 2055 EFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWN 2234 EFLDEL +RLHKRQRI GA++IIEVMLQKFL P STWE I ELCKP+K+Q I+KCW+ Sbjct: 581 EFLDELVVRLHKRQRIVGAAKIIEVMLQKFLPPNASTWERIIPELCKPKKVQAIIDKCWS 640 Query: 2235 SLF 2243 SLF Sbjct: 641 SLF 643 >emb|CBI27232.3| unnamed protein product [Vitis vinifera] Length = 660 Score = 901 bits (2329), Expect = 0.0 Identities = 434/594 (73%), Positives = 506/594 (85%), Gaps = 1/594 (0%) Frame = +3 Query: 465 LFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAY 644 +FK A + GSYK GDSTFYSLIENYANSGDF +L VF RM ERR F+EK+FI+VFRAY Sbjct: 66 IFKSASQMGSYKSGDSTFYSLIENYANSGDFGTLFQVFDRMKRERRVFIEKNFILVFRAY 125 Query: 645 GKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCK-NIK 821 GKAHLPEKA+ELF RMV EFQC+RTV+SFNSVLNVIIQEGL+ RALEF+ V K NI Sbjct: 126 GKAHLPEKAIELFGRMVDEFQCRRTVRSFNSVLNVIIQEGLFHRALEFYECGVGGKTNIS 185 Query: 822 PNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVAL 1001 PNVL+FNLVIKAMCKL LVDRA+EVFREM KC+ DVFTYCTLMDGLCKEDR++EAV L Sbjct: 186 PNVLSFNLVIKAMCKLGLVDRAIEVFREMAIQKCEPDVFTYCTLMDGLCKEDRIDEAVLL 245 Query: 1002 LDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCL 1181 LDEMQIEGCFP+ TFNVLINGLCKKGD+ R K+VDNMFLKGCVPNEVTYNT+I+GLCL Sbjct: 246 LDEMQIEGCFPSSVTFNVLINGLCKKGDMVRVTKLVDNMFLKGCVPNEVTYNTIINGLCL 305 Query: 1182 QGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHI 1361 +GKL+KA+SLLDRMV+ K +PNDVTYGT+I+GLV++GR+VDGVH++ S+EERGH N++ Sbjct: 306 KGKLDKAVSLLDRMVASKCVPNDVTYGTLINGLVKQGRSVDGVHLLSSLEERGHHANEYA 365 Query: 1362 YSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMV 1541 YS+L+SGLFKE +SEEA+ LWKK++E G +PN VVYSALIDGLCR GK EAKEIL EMV Sbjct: 366 YSTLISGLFKEEKSEEAMGLWKKMVEKGCQPNIVVYSALIDGLCREGKLDEAKEILCEMV 425 Query: 1542 NKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLK 1721 NKGC PNA+TYSSL+KGFFK GNS A+ +WKEM + CV NE CYS+LIHGLC++GKL+ Sbjct: 426 NKGCTPNAFTYSSLIKGFFKTGNSQKAIRVWKEMAKNNCVPNEICYSVLIHGLCEDGKLR 485 Query: 1722 EATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNV 1901 EA M+W HMLG+G PDVVAY+SMIHGLC+ GSVE GL LFNEML + SDSQPDV TYN+ Sbjct: 486 EAMMMWTHMLGRGLRPDVVAYSSMIHGLCNAGSVEVGLKLFNEMLCQESDSQPDVVTYNI 545 Query: 1902 LFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLR 2081 L ALCK I+ AI LLN+MLDRGC+PD++TCNIFL +EK+NP QDGREFLDEL +R Sbjct: 546 LLRALCKQNSISHAIDLLNSMLDRGCNPDLITCNIFLNALREKLNPPQDGREFLDELVVR 605 Query: 2082 LHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLF 2243 LHKRQRI GA++IIEVMLQKFL P STWE I ELCKP+K+Q I+KCW+SLF Sbjct: 606 LHKRQRIVGAAKIIEVMLQKFLPPNASTWERIIPELCKPKKVQAIIDKCWSSLF 659 >ref|XP_006448599.1| hypothetical protein CICLE_v10014519mg [Citrus clementina] gi|557551210|gb|ESR61839.1| hypothetical protein CICLE_v10014519mg [Citrus clementina] Length = 664 Score = 879 bits (2270), Expect = 0.0 Identities = 418/618 (67%), Positives = 508/618 (82%), Gaps = 2/618 (0%) Frame = +3 Query: 396 STNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMV 575 S+N HM + + E P +D +F PK GSY+ GDSTFYSLI++YANSGDFKSLEMV Sbjct: 46 SSNKHMETEPQGNAKSEQPFSDEVFNSTPKLGSYQLGDSTFYSLIQHYANSGDFKSLEMV 105 Query: 576 FSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVII 755 RM E+R +EKSFI +F+AYGKAHL E+AV LF MV EFQCKRTVKSFNSVLNVII Sbjct: 106 LCRMRREKRVALEKSFIFIFKAYGKAHLVEEAVRLFHTMVDEFQCKRTVKSFNSVLNVII 165 Query: 756 QEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDA 929 QEGLY RALEF++++VN K NI PN LTFNLVIKA+C+L LVD A+E+FREMP C+ Sbjct: 166 QEGLYHRALEFYNHIVNAKHMNILPNTLTFNLVIKAVCRLGLVDNAIELFREMPVRNCEP 225 Query: 930 DVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVV 1109 D++TYCTLMDGLCKE+R++EAV LLDEMQ++GCFP P TFNVLINGLCK G L RAAK+V Sbjct: 226 DIYTYCTLMDGLCKENRLDEAVLLLDEMQVDGCFPTPVTFNVLINGLCKNGGLGRAAKLV 285 Query: 1110 DNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRK 1289 DNMFLKGC+PNEVTYNTLIHGLCL+G L+KA+SLLDRMV+ K +PN+VTYGTII+GLV+ Sbjct: 286 DNMFLKGCLPNEVTYNTLIHGLCLKGDLDKAVSLLDRMVASKCMPNEVTYGTIINGLVKL 345 Query: 1290 GRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVY 1469 GRAVDG V++SMEER N++IYS+L+SGLFKEG++E+A+ LWK++ME G KPNTVVY Sbjct: 346 GRAVDGARVLMSMEERKFHVNEYIYSTLISGLFKEGKAEDAMKLWKQMMEKGCKPNTVVY 405 Query: 1470 SALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTE 1649 SALIDGLCRVGKP EA+EIL EM+N GC NA+TYSSLMKGFF+ G + AV +WK+M + Sbjct: 406 SALIDGLCRVGKPDEAEEILSEMINNGCAANAFTYSSLMKGFFESGKGHKAVEIWKDMAK 465 Query: 1650 KGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQ 1829 CV+NE CYS+LIHGLC++GKL+EA MVW ML +G+ PDVVAY+SMIHGLC+ GS+E+ Sbjct: 466 NNCVYNEVCYSVLIHGLCEDGKLREARMVWTQMLSRGYKPDVVAYSSMIHGLCNAGSLEE 525 Query: 1830 GLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIF 2009 L LFNEML SQPDVFTYN+L NALCK I+ +I LLN+M+DRGCDPD+VTCNIF Sbjct: 526 ALKLFNEMLCPEPKSQPDVFTYNILLNALCKQSNISHSIDLLNSMMDRGCDPDLVTCNIF 585 Query: 2010 LTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIREL 2189 LT KEK+ QDG +FL+EL +RL KRQR G +I+EVMLQKFL PK STWE ++EL Sbjct: 586 LTALKEKLETPQDGTDFLNELAIRLFKRQRTSGGFKIVEVMLQKFLPPKTSTWERVVQEL 645 Query: 2190 CKPRKIQVAINKCWNSLF 2243 C+P++IQ AINKCW++L+ Sbjct: 646 CRPKRIQAAINKCWSNLY 663 >gb|EMJ12567.1| hypothetical protein PRUPE_ppa002507mg [Prunus persica] Length = 664 Score = 870 bits (2247), Expect = 0.0 Identities = 426/643 (66%), Positives = 513/643 (79%), Gaps = 2/643 (0%) Frame = +3 Query: 321 CPFSALPNNSCETEKDYEEDIIAIQSTNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYK 500 CP S +C + ++AI S N + + + E EPPI++ +FK K GSYK Sbjct: 26 CPISPCELLTCSLHSHFS--VLAIPS-NQALQTEPVNNDETEPPISNEIFKKGTKLGSYK 82 Query: 501 QGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVEL 680 GDSTFYSLIENYAN GDF+SLE V RM ERR F+E+SFI++FRAYGKAHLP KAVEL Sbjct: 83 SGDSTFYSLIENYANLGDFRSLEQVLDRMKRERRVFIEQSFILMFRAYGKAHLPNKAVEL 142 Query: 681 FDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIK 854 F RMV EFQC+RTVKSFNSVLNVIIQEG YS ALEF+S+VV NI PNVL+FNL+IK Sbjct: 143 FYRMVDEFQCRRTVKSFNSVLNVIIQEGHYSHALEFYSHVVGTTGMNISPNVLSFNLIIK 202 Query: 855 AMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFP 1034 +MCKL LVDRAV+VFREMP C DVFTY TLMDGLCKE R++EAV LLDEMQ+EGC P Sbjct: 203 SMCKLGLVDRAVQVFREMPLRNCTPDVFTYSTLMDGLCKEKRIDEAVFLLDEMQLEGCIP 262 Query: 1035 NPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLL 1214 +P TFNVLIN LCKKGDL RAAK+VDNM LKGCVPNEVTYNTLIHGLCL+GKL KA+SLL Sbjct: 263 SPVTFNVLINALCKKGDLGRAAKLVDNMLLKGCVPNEVTYNTLIHGLCLKGKLAKAVSLL 322 Query: 1215 DRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKE 1394 DRMVS+K +PNDVTYGTII+GLV++GRAVDG V++SMEERG+ N++IYS LVSGLFKE Sbjct: 323 DRMVSNKCVPNDVTYGTIINGLVKRGRAVDGARVLMSMEERGNHANEYIYSVLVSGLFKE 382 Query: 1395 GRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTY 1574 G+SE+A+ LWK+++E G KPNT+ YS LI+GLC GKP EAKE+ EMV+ GC PN++TY Sbjct: 383 GKSEDAMRLWKEMLEKGCKPNTIAYSTLINGLCGEGKPDEAKEVFSEMVSNGCMPNSFTY 442 Query: 1575 SSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLG 1754 SSLM+GFF+ G S A+LLWKEM + NE CYS+LIHGLC++G+L EA + W+ MLG Sbjct: 443 SSLMRGFFQTGQSQKAILLWKEMANN--MRNEVCYSVLIHGLCEDGQLNEALIAWQQMLG 500 Query: 1755 KGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKI 1934 +G+ PDVVAY+SMIHGLC+ G VEQGL LFNEML + + QPDV TYN+LFN CK I Sbjct: 501 RGYKPDVVAYSSMIHGLCNAGLVEQGLKLFNEMLCQEPECQPDVITYNILFNVFCKQSSI 560 Query: 1935 TPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGAS 2114 + AI LN MLDRGCDPD VTC+IFL + +E+++P QDGREFL+EL +RL K+QRI GAS Sbjct: 561 SLAIDHLNRMLDRGCDPDSVTCDIFLRSLRERLDPPQDGREFLNELVVRLFKQQRIVGAS 620 Query: 2115 RIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLF 2243 I+EVMLQKFL PK STW ++ELCKP+ ++ AI+KCW+SL+ Sbjct: 621 IIVEVMLQKFLPPKASTWTRVVQELCKPKMVRAAIDKCWSSLY 663 >ref|XP_006468575.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like [Citrus sinensis] Length = 664 Score = 869 bits (2246), Expect = 0.0 Identities = 413/618 (66%), Positives = 505/618 (81%), Gaps = 2/618 (0%) Frame = +3 Query: 396 STNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMV 575 S+N M + + E P +D +F PK GSY+ GDSTFYSLI++YANSGDFKSLEMV Sbjct: 46 SSNKQMETEPQGNAKSEQPFSDEIFNSTPKLGSYQLGDSTFYSLIQHYANSGDFKSLEMV 105 Query: 576 FSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVII 755 RM E+R +EKSFI +F+AYGKAHL E+A+ LF MV EF CKRTVKSFNSVLNVII Sbjct: 106 LYRMRREKRVVLEKSFIFIFKAYGKAHLVEEAIRLFHTMVDEFHCKRTVKSFNSVLNVII 165 Query: 756 QEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDA 929 QEGLY RALEF++++VN K NI PN LTFNLVIK +C+L LVD A+++FREMP C+ Sbjct: 166 QEGLYHRALEFYNHIVNAKHMNILPNTLTFNLVIKTVCRLGLVDNAIQLFREMPVRNCEP 225 Query: 930 DVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVV 1109 D++TYCTLMDGLCKE+R++EAV LLDEMQ++GCFP P TFNVLINGLCK G+L RAAK+V Sbjct: 226 DIYTYCTLMDGLCKENRLDEAVLLLDEMQVDGCFPTPVTFNVLINGLCKNGELGRAAKLV 285 Query: 1110 DNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRK 1289 DNMFLKGC+PNEVTYNTLIHGLCL+G L+KA+SLLDRMV+ K +PN+VTYGTII+GLV+ Sbjct: 286 DNMFLKGCLPNEVTYNTLIHGLCLKGNLDKAVSLLDRMVASKCMPNEVTYGTIINGLVKL 345 Query: 1290 GRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVY 1469 GRAVDG V++SMEER N++IYS+L+SGLFKEG++E+A+ LWK++ME G KPNTVVY Sbjct: 346 GRAVDGARVLMSMEERKFHVNEYIYSTLISGLFKEGKAEDAMKLWKQMMEKGCKPNTVVY 405 Query: 1470 SALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTE 1649 SALIDGLCRVGKP EA+EIL EM+N GC NA+TYSSLMKGFF+ G + AV +WK+M + Sbjct: 406 SALIDGLCRVGKPDEAEEILFEMINNGCAANAFTYSSLMKGFFESGKGHKAVEIWKDMAK 465 Query: 1650 KGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQ 1829 CV+NE CYS+LIHGLC++GKL+EA MVW ML +G PDVVAY+SMIHGLC+ GSVE+ Sbjct: 466 NNCVYNEVCYSVLIHGLCEDGKLREARMVWTQMLSRGCKPDVVAYSSMIHGLCNAGSVEE 525 Query: 1830 GLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIF 2009 L LFNEML SQPDVFTYN+L NALCK I+ +I LLN+M+DRGCDPD+VTCNIF Sbjct: 526 ALKLFNEMLCLEPKSQPDVFTYNILLNALCKQSNISHSIDLLNSMMDRGCDPDLVTCNIF 585 Query: 2010 LTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIREL 2189 LT KEK+ QDG +FL+EL +RL KRQR G +I+EVMLQKFL P+ STWE ++EL Sbjct: 586 LTALKEKLEAPQDGTDFLNELAIRLFKRQRTSGGFKIVEVMLQKFLSPQTSTWERVVQEL 645 Query: 2190 CKPRKIQVAINKCWNSLF 2243 C+P++IQ AINKCW++L+ Sbjct: 646 CRPKRIQAAINKCWSNLY 663 >ref|XP_002304600.2| hypothetical protein POPTR_0003s15360g [Populus trichocarpa] gi|550343237|gb|EEE79579.2| hypothetical protein POPTR_0003s15360g [Populus trichocarpa] Length = 672 Score = 867 bits (2241), Expect = 0.0 Identities = 417/609 (68%), Positives = 499/609 (81%), Gaps = 2/609 (0%) Frame = +3 Query: 423 RSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERR 602 R H +E +PPI+D++FK PK GSYK GDSTFYSLI+NYAN GDFKSLE V RM E+R Sbjct: 63 REHGIEHDPPISDKIFKSGPKMGSYKLGDSTFYSLIDNYANLGDFKSLEKVLDRMRCEKR 122 Query: 603 AFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRAL 782 VEK F+V+F+AYGKAHLPEKAV LFDRM YEF+CKRTVKSFNSVLNVIIQEGL+ RAL Sbjct: 123 VVVEKCFVVIFKAYGKAHLPEKAVGLFDRMAYEFECKRTVKSFNSVLNVIIQEGLFYRAL 182 Query: 783 EFHSYVVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLM 956 EF+++V+ K NI PNVLTFNLVIK MCK+ LVD AV++FR+MP KC DV+TYCTLM Sbjct: 183 EFYNHVIGAKGVNISPNVLTFNLVIKTMCKVGLVDDAVQMFRDMPVSKCQPDVYTYCTLM 242 Query: 957 DGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCV 1136 DGLCK DR++EAV+LLDEMQI+GCFP+P TFNVLINGLCKKGDL+R AK+VDNMFLKGC Sbjct: 243 DGLCKADRIDEAVSLLDEMQIDGCFPSPVTFNVLINGLCKKGDLARVAKLVDNMFLKGCA 302 Query: 1137 PNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHV 1316 PNEVTYNTLIHGLCL+GKLEKAISLLDRMVS K +PN VTYGTII+GLV++GRA+DG V Sbjct: 303 PNEVTYNTLIHGLCLKGKLEKAISLLDRMVSSKCVPNVVTYGTIINGLVKQGRALDGARV 362 Query: 1317 MVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCR 1496 + MEERG+ N+++YS+L+SGLFKEG+S+EA+ L+K++ + NT+VYSA+IDGLCR Sbjct: 363 LALMEERGYHVNEYVYSALISGLFKEGKSQEAMQLFKEMTVKECELNTIVYSAVIDGLCR 422 Query: 1497 VGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFC 1676 GKP EA E+L EM N C+PNAYTYSSLMKGFF+ GN + A+ +WK+M + NE C Sbjct: 423 DGKPDEALEVLSEMTNNRCKPNAYTYSSLMKGFFEAGNGHKAIEMWKDMAKHNFTQNEVC 482 Query: 1677 YSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEML 1856 YS+LIHGLC +GK+KEA MVW MLGKG PDVVAY SMI+GL + G VE L L+NEML Sbjct: 483 YSVLIHGLCKDGKVKEAMMVWAQMLGKGCKPDVVAYGSMINGLSNAGLVEDALQLYNEML 542 Query: 1857 YKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMN 2036 + DSQPDV TYN+L NALCK I+ AI LLN+MLDRGCDPD+VTC IFL T +EK++ Sbjct: 543 CQEPDSQPDVVTYNILLNALCKQSSISRAIDLLNSMLDRGCDPDLVTCIIFLRTLREKLD 602 Query: 2037 PSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVA 2216 P QDGREFLD L +RL KRQR+ GAS+I+EVMLQK L PKPSTW + +LC P+K+Q A Sbjct: 603 PPQDGREFLDGLVVRLLKRQRVLGASKIVEVMLQKLLPPKPSTWTRVVEDLCNPKKVQAA 662 Query: 2217 INKCWNSLF 2243 I KCW+ L+ Sbjct: 663 IQKCWSILY 671 >gb|EOX96827.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 636 Score = 862 bits (2227), Expect = 0.0 Identities = 405/600 (67%), Positives = 498/600 (83%), Gaps = 2/600 (0%) Frame = +3 Query: 450 PITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIV 629 P++D+LF AP+SGS++ GDST YSLI +YA+ DF SL V RM + R F+EK F++ Sbjct: 36 PLSDQLFNSAPQSGSFRLGDSTCYSLIHHYAHKVDFASLHDVLCRMKLQNRVFIEKYFLL 95 Query: 630 VFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNC 809 +F+AYG+AHLPEKAV+LF RM +EF CK TVKSFNSVLNVIIQEG Y RA +F++ V+ Sbjct: 96 IFKAYGRAHLPEKAVDLFHRMPHEFHCKPTVKSFNSVLNVIIQEGFYHRAFDFYNCSVSA 155 Query: 810 KN--IKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRV 983 KN I PNVLTFNL++KAMCKL VDRA+EVFREMP KC DV+TYCTLMDGLCKEDR+ Sbjct: 156 KNTNISPNVLTFNLLLKAMCKLGWVDRAIEVFREMPLRKCAPDVYTYCTLMDGLCKEDRI 215 Query: 984 EEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTL 1163 +EAV+LLDEMQ EGCFP P TFNVLINGLCKKGDL+RAAK+VDNMFLKGC+PN+VTYNTL Sbjct: 216 DEAVSLLDEMQTEGCFPTPVTFNVLINGLCKKGDLARAAKLVDNMFLKGCLPNQVTYNTL 275 Query: 1164 IHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGH 1343 IHGLCL+GKL+KA+ LLDRMVS IPND+TYGTI++GLV++GR D V ++VSMEERG+ Sbjct: 276 IHGLCLKGKLDKAVILLDRMVSSNCIPNDITYGTIVNGLVKQGRVEDAVMLVVSMEERGY 335 Query: 1344 QGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKE 1523 N+++YS+L+SGLFK G+SEEA+ W ++ME G+KPNTVVYS+LIDGLCR GKP EA+E Sbjct: 336 GVNEYVYSALISGLFKGGKSEEAMKRWTEMMEKGYKPNTVVYSSLIDGLCREGKPNEAEE 395 Query: 1524 ILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLC 1703 +L EM+ KGC PNAYTYSSLMKGFFK GN + AV +WK+M E C+H++ CYS+LIHGLC Sbjct: 396 VLSEMIEKGCIPNAYTYSSLMKGFFKTGNCHKAVQVWKDMAEHKCIHSQVCYSVLIHGLC 455 Query: 1704 DEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPD 1883 ++G L EA M WRHML KG PD VAY+SMI GLC+ GS+E+ L LFNEMLY+ ++SQPD Sbjct: 456 EDGNLSEAMMAWRHMLDKGCKPDAVAYSSMIQGLCNAGSLEEALKLFNEMLYQEAESQPD 515 Query: 1884 VFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFL 2063 V TYN+LFNALC + I+ A+ LLN+MLD+ CDPDI TCNIFL T +EK++P QDGREFL Sbjct: 516 VITYNILFNALCNQKSISHAVDLLNSMLDQACDPDIATCNIFLRTLREKVDPPQDGREFL 575 Query: 2064 DELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLF 2243 DEL +RL KRQR+ GAS+I++VMLQKFL PK STW + ELCKP+KIQ AI+KCW +++ Sbjct: 576 DELVIRLFKRQRVFGASKIVQVMLQKFLPPKASTWARVVEELCKPKKIQAAIDKCWRNIY 635 >ref|XP_002297917.1| hypothetical protein POPTR_0001s12190g [Populus trichocarpa] gi|222845175|gb|EEE82722.1| hypothetical protein POPTR_0001s12190g [Populus trichocarpa] Length = 670 Score = 862 bits (2226), Expect = 0.0 Identities = 431/686 (62%), Positives = 527/686 (76%), Gaps = 3/686 (0%) Frame = +3 Query: 195 CIPFVEKVLSVLVIPMLAFTSKSARLILTSNPCKSSFIFLIHCPFSALPNN-SCETEKDY 371 C PF + + + +F SK L + SN F H A+P+ + ETE Sbjct: 4 CQPFNTNSILKALNNLFSFPSKFLSLSMHSN-------FSAH----AIPSTKTIETEP-- 50 Query: 372 EEDIIAIQSTNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSG 551 + T + + +E +PPI+D++FK PK GSY+ GDSTFYSLI NYAN G Sbjct: 51 ------LNHTQHCNTTDQENGIEPDPPISDKIFKSGPKMGSYRLGDSTFYSLINNYANLG 104 Query: 552 DFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSF 731 DFKSLE V RM E+R EK FIV+F+AYGKAHLPEKAV+LFDRM EF+CKRT KSF Sbjct: 105 DFKSLEKVLDRMKCEKRVIFEKCFIVIFKAYGKAHLPEKAVDLFDRMACEFECKRTGKSF 164 Query: 732 NSVLNVIIQEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFRE 905 NSVLNVIIQEGL+ RALEF+++V+ K +I PNVLTFNLVIKAMCK+ LVD A++VFR+ Sbjct: 165 NSVLNVIIQEGLFHRALEFYNHVIGAKGVSISPNVLTFNLVIKAMCKVGLVDDAIQVFRD 224 Query: 906 MPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGD 1085 M KC+ DV+TYCTLMDGLCK DR++EAV+LLDEMQI+GCFP+P TFNVLINGLCKKGD Sbjct: 225 MTIRKCEPDVYTYCTLMDGLCKADRIDEAVSLLDEMQIDGCFPSPVTFNVLINGLCKKGD 284 Query: 1086 LSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGT 1265 LSRAAK+VDNMFLKGC+PNEVTYNTLIHGLCL+GKLEKAISLLDRMVS K +PN VTYGT Sbjct: 285 LSRAAKLVDNMFLKGCIPNEVTYNTLIHGLCLKGKLEKAISLLDRMVSSKCVPNVVTYGT 344 Query: 1266 IIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENG 1445 II+GLV++GRA+DG V+ MEERG+ N+++YS+L+SGLFKEG+S+EA++L+K++ G Sbjct: 345 IINGLVKQGRALDGACVLALMEERGYCVNEYVYSTLISGLFKEGKSQEAMHLFKEMTVKG 404 Query: 1446 HKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAV 1625 ++ NT+VYSA+IDGLCR GKP +A E+L EM NKGC PNAYT SSLMKGFF+ GNS+ AV Sbjct: 405 YELNTIVYSAVIDGLCRDGKPDDAVEVLSEMTNKGCTPNAYTCSSLMKGFFEAGNSHRAV 464 Query: 1626 LLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGL 1805 +WK+M + NE CYS+LIHGLC +GK+KEA MVW MLGKG PDVVAY+SMI+GL Sbjct: 465 EVWKDMAKHNFTQNEVCYSVLIHGLCKDGKVKEAMMVWTQMLGKGCKPDVVAYSSMINGL 524 Query: 1806 CSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDP 1985 G VE + L+NEML +G DSQPDV TYN+L N LCK I+ AI LLN+MLDRGCDP Sbjct: 525 SIAGLVEDAMQLYNEMLCQGPDSQPDVVTYNILLNTLCKQSSISRAIDLLNSMLDRGCDP 584 Query: 1986 DIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPST 2165 D+VTC IFL +EK++P QDGREFLDEL +RL KRQR+ GAS+I+EVMLQK L PK ST Sbjct: 585 DLVTCTIFLRMLREKLDPPQDGREFLDELVVRLLKRQRVLGASKIVEVMLQKLLPPKHST 644 Query: 2166 WEIAIRELCKPRKIQVAINKCWNSLF 2243 W + LCKP+K+Q I KCW+ L+ Sbjct: 645 WARVVENLCKPKKVQAVIQKCWSILY 670 >ref|XP_004295517.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like [Fragaria vesca subsp. vesca] Length = 647 Score = 855 bits (2209), Expect = 0.0 Identities = 414/604 (68%), Positives = 494/604 (81%), Gaps = 2/604 (0%) Frame = +3 Query: 438 EVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEK 617 E +PPI++ +F+ P G+YK GDSTFYSLIENYA+ GDF SLE V RM ERR FVE Sbjct: 43 EPDPPISEEIFRKGPNFGAYKSGDSTFYSLIENYASLGDFGSLEKVLDRMKRERRVFVEG 102 Query: 618 SFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSY 797 SFI VFRA+GKAHLP +AV+LF RMV EFQC+RTVKSFNSVLNVI+QEG Y+ ALEF+ + Sbjct: 103 SFIAVFRAFGKAHLPNQAVDLFHRMVDEFQCRRTVKSFNSVLNVIVQEGHYAHALEFYDH 162 Query: 798 VVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCK 971 VV + NI PNVL++NL+IKA+C+ LVD+AVE FREMP C DVFTYCTLMDGLCK Sbjct: 163 VVGDRSMNISPNVLSYNLIIKALCRFGLVDKAVEKFREMPVRDCAPDVFTYCTLMDGLCK 222 Query: 972 EDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVT 1151 +RV+EAV LLDEMQIEGC P+PA FNVLI+ +CKKGDL RAAK+VDNMFLKGCVPNEVT Sbjct: 223 VNRVDEAVFLLDEMQIEGCSPSPAAFNVLIDAVCKKGDLGRAAKLVDNMFLKGCVPNEVT 282 Query: 1152 YNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSME 1331 YNTLIHGLCLQGKLEKAISLLDRMV +K +PNDVTYGTII+GLV++GR++DGV V++SME Sbjct: 283 YNTLIHGLCLQGKLEKAISLLDRMVLNKCVPNDVTYGTIINGLVKQGRSLDGVRVLISME 342 Query: 1332 ERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPF 1511 ERG + N++IYS LVSGLFKEG+SEEA+ LWK++ME G KPNTVVYSALIDGLC GKP Sbjct: 343 ERGRRANEYIYSVLVSGLFKEGKSEEAMKLWKEMMEKGCKPNTVVYSALIDGLCLDGKPD 402 Query: 1512 EAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILI 1691 EAKE+ EMV GC PN+Y YSSLM+GFF+ G S A+LLWKEM V NE CYS++I Sbjct: 403 EAKEVFCEMVRNGCMPNSYAYSSLMRGFFRTGQSQKAILLWKEMAANNVVRNEVCYSVII 462 Query: 1692 HGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSD 1871 G C EGK+KEA MVW+ +L +G+ DVVAY+SMIHGLC+ G VEQGL LFN+ML + + Sbjct: 463 DGFCKEGKVKEALMVWKQILARGYKLDVVAYSSMIHGLCNDGLVEQGLKLFNDMLSQEPE 522 Query: 1872 SQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDG 2051 QPDV TYN+L NALCK I+ AI LLN+MLD GCDPD+VTC+IFLTT EK++P QDG Sbjct: 523 CQPDVITYNILLNALCKQHTISRAIDLLNSMLDHGCDPDLVTCDIFLTTLGEKLDPPQDG 582 Query: 2052 REFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCW 2231 REFL+EL +RL KRQR GA RI+EVML+KFL P TW ++ELCKP+K++ AI+KCW Sbjct: 583 REFLNELVVRLFKRQRTVGAFRIVEVMLKKFLPPTACTWTTVVQELCKPKKVRAAIDKCW 642 Query: 2232 NSLF 2243 +SL+ Sbjct: 643 SSLY 646 >ref|XP_002528143.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223532441|gb|EEF34234.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 653 Score = 849 bits (2193), Expect = 0.0 Identities = 411/600 (68%), Positives = 491/600 (81%), Gaps = 2/600 (0%) Frame = +3 Query: 450 PITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIV 629 PI+D++F PK GS+K GDSTFYSLIENYA S DF SLE V +RM E R F EKSF V Sbjct: 53 PISDKIFSSPPKMGSFKVGDSTFYSLIENYAYSSDFNSLEKVLNRMRLENRVFSEKSFFV 112 Query: 630 VFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNC 809 +F+AYGKAHLP KA+ELF RM +EF CK TVKSFNSVLNVIIQ G + RALEF+++VV Sbjct: 113 MFKAYGKAHLPNKAIELFYRMSFEFYCKPTVKSFNSVLNVIIQAGFHDRALEFYNHVVGA 172 Query: 810 K--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRV 983 K NI PNVL+FNL+IK+MCKL LVD A+E+FREMP KC D +TYCTLMDGLCK DR+ Sbjct: 173 KDMNILPNVLSFNLIIKSMCKLGLVDNAIELFREMPVRKCVPDAYTYCTLMDGLCKVDRI 232 Query: 984 EEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTL 1163 +EAV+LLDEMQIEGCFP+PATFNVLINGLCKKGD +R K+VDNMFLKGCVPNEVTYNTL Sbjct: 233 DEAVSLLDEMQIEGCFPSPATFNVLINGLCKKGDFTRVTKLVDNMFLKGCVPNEVTYNTL 292 Query: 1164 IHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGH 1343 IHGLCL+GKL+KA+SLLDRMVS K +PN+VTYGTII+GLV++GRA+DG V+V MEERG+ Sbjct: 293 IHGLCLKGKLDKALSLLDRMVSSKCVPNEVTYGTIINGLVKQGRALDGARVLVLMEERGY 352 Query: 1344 QGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKE 1523 N+++YS LVSGLFKEG+SEEA+ L+K+ M+ G K NTV+YSAL+DGLCR KP EA + Sbjct: 353 IVNEYVYSVLVSGLFKEGKSEEAMRLFKESMDKGCKLNTVLYSALVDGLCRDRKPDEAMK 412 Query: 1524 ILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLC 1703 IL EM +KGC PNA+T+SSLMKGFF+VGNS+ A+ +WK+MT+ C NE CYS+LIHGLC Sbjct: 413 ILSEMTDKGCAPNAFTFSSLMKGFFEVGNSHKAIEVWKDMTKINCAENEVCYSVLIHGLC 472 Query: 1704 DEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPD 1883 +GK+ EA MVW ML G PDVVAY+SMI GLC GSVE+ L L+NEML DSQPD Sbjct: 473 KDGKVMEAMMVWAKMLATGCRPDVVAYSSMIQGLCDAGSVEEALKLYNEMLCLEPDSQPD 532 Query: 1884 VFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFL 2063 V TYN+LFNALCK I+ A+ LLN+MLDRGCDPD+VTCNIFL +EK++P QDG +FL Sbjct: 533 VITYNILFNALCKQSSISRAVDLLNSMLDRGCDPDLVTCNIFLRMLREKLDPPQDGAKFL 592 Query: 2064 DELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLF 2243 DEL +RL KRQR GAS+I+EVMLQKFL PK STW + ELC+P+KIQ I+KCW+ L+ Sbjct: 593 DELVVRLLKRQRNLGASKIVEVMLQKFLSPKASTWARVVHELCQPKKIQAVIDKCWSKLY 652 >ref|XP_002867892.1| EMB1025 [Arabidopsis lyrata subsp. lyrata] gi|297313728|gb|EFH44151.1| EMB1025 [Arabidopsis lyrata subsp. lyrata] Length = 658 Score = 818 bits (2112), Expect = 0.0 Identities = 411/665 (61%), Positives = 497/665 (74%), Gaps = 9/665 (1%) Frame = +3 Query: 273 ILTSNPCKSSFIFLIHCPFSAL-----PNNSCETEKDYEEDIIAIQSTNSHMLPKRSHKV 437 +L+SNP K F IH FSA PN S E E Sbjct: 22 LLSSNPVK----FSIHLRFSASSVSVSPNPSMEVE------------------------T 53 Query: 438 EVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEK 617 +E PI++++FK APK GS+K GDST S+IENYAN GDF S+E + SR+ E R +E+ Sbjct: 54 PLEAPISEQMFKSAPKMGSFKLGDSTLSSMIENYANLGDFASVEKLLSRIRLENRVIIER 113 Query: 618 SFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSY 797 SFIVVFRAYGKAHLPEKAV+LF RMV EF+CKR+VKSFNSVLNVII EGLY R LEF+ Y Sbjct: 114 SFIVVFRAYGKAHLPEKAVDLFHRMVDEFRCKRSVKSFNSVLNVIINEGLYHRGLEFYDY 173 Query: 798 VVNCK---NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLC 968 VVN NI PN L+FNLVIKA+CKL VDRA+EVFR MP KC D +TYCTLMDGLC Sbjct: 174 VVNSNMNMNISPNGLSFNLVIKALCKLGFVDRAIEVFRGMPEKKCLPDGYTYCTLMDGLC 233 Query: 969 KEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEV 1148 KE+R++EAV LLDEMQ EGC P+P +NVLI+GLCKKGDLSR K+VDNMFLKGC PNEV Sbjct: 234 KEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKGDLSRVTKLVDNMFLKGCFPNEV 293 Query: 1149 TYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSM 1328 TYNTLIHGLCL+GKL+KA+SLL+RMVS K IPNDVTYGT+I+GLV++ RA+DG +++SM Sbjct: 294 TYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRAMDGARLLISM 353 Query: 1329 EERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKP 1508 EERG++ N HIYS L+SGLFKEG++EEA+ LWKK+ E G +PN VVYSA+IDGLCR GKP Sbjct: 354 EERGYRLNQHIYSVLISGLFKEGKAEEAMTLWKKMAEKGCRPNIVVYSAVIDGLCREGKP 413 Query: 1509 FEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSIL 1688 EAKEIL M++ GC PN YTYSSLMKGFFK G S A+ +W+EM E GC NEFCYS+L Sbjct: 414 NEAKEILNGMISSGCLPNVYTYSSLMKGFFKTGLSEEAIQVWREMDETGCSRNEFCYSVL 473 Query: 1689 IHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEML-YKG 1865 I GLC G++KEA MVW ML G PD VAY+SMI GLC +GS++ L L++EML + Sbjct: 474 IDGLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSMIKGLCGIGSMDAALKLYHEMLCQEE 533 Query: 1866 SDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQ 2045 SQPDV TYN+L + LC + ++ A+ LLN MLDRGCDPD++TCN FL T EK + + Sbjct: 534 PKSQPDVVTYNILLDGLCMQKDVSRAVDLLNCMLDRGCDPDVITCNTFLNTLSEKSDSCE 593 Query: 2046 DGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINK 2225 +GR FL+EL RL KRQR+ GA +I+EVML K+L PK STW + + E+CKP+KI AINK Sbjct: 594 EGRSFLEELVARLLKRQRVSGACKIVEVMLGKYLAPKTSTWAMIVPEICKPKKINAAINK 653 Query: 2226 CWNSL 2240 CW +L Sbjct: 654 CWRNL 658 >ref|NP_193742.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75098720|sp|O49436.1|PP327_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g20090; AltName: Full=Protein EMBRYO DEFECTIVE 1025 gi|2827663|emb|CAA16617.1| membrane-associated salt-inducible-like protein [Arabidopsis thaliana] gi|7268804|emb|CAB79009.1| membrane-associated salt-inducible-like protein [Arabidopsis thaliana] gi|58013024|gb|AAW62965.1| embryo-defective 1025 [Arabidopsis thaliana] gi|332658871|gb|AEE84271.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 660 Score = 812 bits (2098), Expect = 0.0 Identities = 408/673 (60%), Positives = 498/673 (73%), Gaps = 4/673 (0%) Frame = +3 Query: 234 IPMLAFTSKSARLILTSNPCKSSFIFLIHCPFSALPNNSCETEKDYEEDIIAIQSTNSHM 413 I ++ K +R IL+SNP S S PN S E ++ Sbjct: 10 ISFFSYFLKESR-ILSSNPVNFSIHLRFSSSVSVSPNPSMEVVEN--------------- 53 Query: 414 LPKRSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWH 593 +E PI++++FK APK GS+K GDST S+IE+YANSGDF S+E + SR+ Sbjct: 54 --------PLEAPISEKMFKSAPKMGSFKLGDSTLSSMIESYANSGDFDSVEKLLSRIRL 105 Query: 594 ERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYS 773 E R +E+SFIVVFRAYGKAHLP+KAV+LF RMV EF+CKR+VKSFNSVLNVII EGLY Sbjct: 106 ENRVIIERSFIVVFRAYGKAHLPDKAVDLFHRMVDEFRCKRSVKSFNSVLNVIINEGLYH 165 Query: 774 RALEFHSYVVNCK---NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTY 944 R LEF+ YVVN NI PN L+FNLVIKA+CKL+ VDRA+EVFR MP KC D +TY Sbjct: 166 RGLEFYDYVVNSNMNMNISPNGLSFNLVIKALCKLRFVDRAIEVFRGMPERKCLPDGYTY 225 Query: 945 CTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFL 1124 CTLMDGLCKE+R++EAV LLDEMQ EGC P+P +NVLI+GLCKKGDL+R K+VDNMFL Sbjct: 226 CTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNMFL 285 Query: 1125 KGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVD 1304 KGCVPNEVTYNTLIHGLCL+GKL+KA+SLL+RMVS K IPNDVTYGT+I+GLV++ RA D Sbjct: 286 KGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATD 345 Query: 1305 GVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALID 1484 V ++ SMEERG+ N HIYS L+SGLFKEG++EEA++LW+K+ E G KPN VVYS L+D Sbjct: 346 AVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVD 405 Query: 1485 GLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVH 1664 GLCR GKP EAKEIL M+ GC PNAYTYSSLMKGFFK G AV +WKEM + GC Sbjct: 406 GLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSR 465 Query: 1665 NEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLF 1844 N+FCYS+LI GLC G++KEA MVW ML G PD VAY+S+I GLC +GS++ L L+ Sbjct: 466 NKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSIIKGLCGIGSMDAALKLY 525 Query: 1845 NEML-YKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTF 2021 +EML + SQPDV TYN+L + LC + I+ A+ LLN+MLDRGCDPD++TCN FL T Sbjct: 526 HEMLCQEEPKSQPDVVTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDVITCNTFLNTL 585 Query: 2022 KEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPR 2201 EK N GR FL+EL +RL KRQR+ GA I+EVML K+L PK STW + +RE+CKP+ Sbjct: 586 SEKSNSCDKGRSFLEELVVRLLKRQRVSGACTIVEVMLGKYLAPKTSTWAMIVREICKPK 645 Query: 2202 KIQVAINKCWNSL 2240 KI AI+KCW +L Sbjct: 646 KINAAIDKCWRNL 658 >ref|XP_006283284.1| hypothetical protein CARUB_v10004320mg [Capsella rubella] gi|482551989|gb|EOA16182.1| hypothetical protein CARUB_v10004320mg [Capsella rubella] Length = 660 Score = 808 bits (2086), Expect = 0.0 Identities = 397/621 (63%), Positives = 485/621 (78%), Gaps = 6/621 (0%) Frame = +3 Query: 396 STNSHMLPKRSHKVE--VEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLE 569 S++ + P S +VE E PI++ +FK APK GSYK GDST S+IENYANSGDF S+E Sbjct: 38 SSSVSVSPDPSMEVENPSEAPISENMFKSAPKMGSYKLGDSTLSSMIENYANSGDFASVE 97 Query: 570 MVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNV 749 V SR+ E R E SFIVVFRAYGKAHLP KAV+LF RMV EFQCKR+VKSFNSVLNV Sbjct: 98 QVLSRVRLENRVISEHSFIVVFRAYGKAHLPGKAVDLFHRMVDEFQCKRSVKSFNSVLNV 157 Query: 750 IIQEGLYSRALEFHSYVVNCK---NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMK 920 I+ EGLY R LEF+ YVVN NI PN L+FNLVIKA+CKL V++A+EVFREMP K Sbjct: 158 ILNEGLYHRGLEFYDYVVNSNMNMNIAPNGLSFNLVIKALCKLGFVNKAIEVFREMPEKK 217 Query: 921 CDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAA 1100 C D +TYCTLMDGLCKE+R++EAV LLDEMQ EGC P+ T+NVLI+GLCKKGDL+R Sbjct: 218 CLPDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSSVTYNVLIDGLCKKGDLTRVT 277 Query: 1101 KVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGL 1280 K+VDNMFLKGCVPNEVTYNTLIHGLCL+GKL KA+SLL+RMVS K IPNDVTYGT+I+GL Sbjct: 278 KLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLNKAVSLLERMVSSKCIPNDVTYGTLINGL 337 Query: 1281 VRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNT 1460 V++ RA D V +++SMEERG+ N HIYS L+SGLFKEG++EEA+ LWKK++E G +PN Sbjct: 338 VKQRRATDAVRLLISMEERGYCLNQHIYSVLISGLFKEGKAEEAMTLWKKMVEKGCRPNI 397 Query: 1461 VVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKE 1640 VVYSAL+DGLCR GKP EAKEI M++ GC PNAYTYSSLMKGFF+ G S A+ +W+E Sbjct: 398 VVYSALVDGLCREGKPNEAKEIFRGMISNGCLPNAYTYSSLMKGFFRTGLSEEAIQVWRE 457 Query: 1641 MTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGS 1820 M + GC NEFCYS+LI GLC G++ EA M+W ML G PD VAY+SMI GLC +GS Sbjct: 458 MDDTGCSRNEFCYSVLIDGLCGIGRVNEAMMLWSKMLTIGIKPDTVAYSSMIKGLCGIGS 517 Query: 1821 VEQGLLLFNEMLYKGS-DSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVT 1997 ++ L L++EML + SQPD+ TYN+LF+ LC + ++ A+ LLN MLDRGCDPD++T Sbjct: 518 MDAALKLYHEMLCEEEPKSQPDIVTYNILFDGLCMQKDVSRAVDLLNFMLDRGCDPDVIT 577 Query: 1998 CNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIA 2177 CN FL T EK + ++GR FL+EL LRL KRQR+ GA +I+EVML K+L PK STW + Sbjct: 578 CNTFLKTLSEKSDSCEEGRNFLEELVLRLLKRQRVSGACKIVEVMLDKYLTPKISTWVLI 637 Query: 2178 IRELCKPRKIQVAINKCWNSL 2240 + E+CKP+KI AI+KCW +L Sbjct: 638 VPEICKPKKINAAIDKCWRNL 658 >ref|XP_003534864.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like isoform X1 [Glycine max] gi|571476386|ref|XP_006586943.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like isoform X2 [Glycine max] gi|571476388|ref|XP_006586944.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like isoform X3 [Glycine max] gi|571476390|ref|XP_006586945.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like isoform X4 [Glycine max] gi|571476393|ref|XP_006586946.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like isoform X5 [Glycine max] gi|571476395|ref|XP_006586947.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like isoform X6 [Glycine max] Length = 642 Score = 802 bits (2071), Expect = 0.0 Identities = 394/641 (61%), Positives = 499/641 (77%), Gaps = 4/641 (0%) Frame = +3 Query: 330 SALPNNSCET--EKDYEEDIIAIQSTNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYKQ 503 S+ P N T + + + +I + S +S SHK P + +FK + GSYK Sbjct: 9 SSFPTNLLRTTLHRYFSQTLITLPSYSSS-----SHK----PHPSSEIFKSGTQMGSYKL 59 Query: 504 GDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELF 683 GD +FYSLIE++A+S DF+SLE V +M ERR F+EK+FIV+F+AYGKAHLPEKAV+LF Sbjct: 60 GDLSFYSLIESHASSLDFRSLEEVLHQMKRERRVFLEKNFIVMFKAYGKAHLPEKAVDLF 119 Query: 684 DRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIKA 857 RM EFQCK+TVKSFNSVLNVI+QEGL++RALEF+++VV K NI PN LTFNLVIKA Sbjct: 120 HRMWGEFQCKQTVKSFNSVLNVIVQEGLFNRALEFYNHVVASKSLNIHPNALTFNLVIKA 179 Query: 858 MCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPN 1037 MC+L LVD+A+EVFRE+P C D +TY TLM GLCKE+R++EAV+LLDEMQ+EG FPN Sbjct: 180 MCRLGLVDKAIEVFREIPLRNCAPDNYTYSTLMHGLCKEERIDEAVSLLDEMQVEGTFPN 239 Query: 1038 PATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLD 1217 FNVLI+ LCKKGDL RAAK+VDNMFLKGCVPNEVTYN L+HGLCL+GKLEKA+SLL+ Sbjct: 240 LVAFNVLISALCKKGDLGRAAKLVDNMFLKGCVPNEVTYNALVHGLCLKGKLEKAVSLLN 299 Query: 1218 RMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEG 1397 +MVS+K +PNDVT+GT+I+G V +GRA DG V+VS+E RGH+GN+++YSSL+SGL KEG Sbjct: 300 QMVSNKCVPNDVTFGTLINGFVMQGRASDGTRVLVSLEARGHRGNEYVYSSLISGLCKEG 359 Query: 1398 RSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYS 1577 + +A+ LWK+++ G PNT+VYSALIDGLCR GK EA+ L EM NKG PN++TYS Sbjct: 360 KFNQAMELWKEMVGKGCGPNTIVYSALIDGLCREGKLDEARGFLSEMKNKGYLPNSFTYS 419 Query: 1578 SLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGK 1757 SLM+G+F+ G+S+ A+L+WKEM C+HNE CYSILI+GLC +GK EA MVW+ ML + Sbjct: 420 SLMRGYFEAGDSHKAILVWKEMANNNCIHNEVCYSILINGLCKDGKFMEALMVWKQMLSR 479 Query: 1758 GWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKIT 1937 G DVVAY+SMIHG C+ VEQGL LFN+ML +G QPDV TYN+L NA C + I Sbjct: 480 GIKLDVVAYSSMIHGFCNANLVEQGLKLFNQMLCQGPVVQPDVITYNILLNAFCIQKSIF 539 Query: 1938 PAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASR 2117 AI +LN MLD+GCDPD +TC+IFL T +E MNP QDGREFLDEL +RL KRQR GAS+ Sbjct: 540 RAIDILNIMLDQGCDPDFITCDIFLKTLRENMNPPQDGREFLDELVVRLVKRQRTIGASK 599 Query: 2118 IIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSL 2240 IIEVM+ KFL PK STW + ++++CKP+ ++ AI++CW+ L Sbjct: 600 IIEVMMHKFLLPKASTWAMVVQQVCKPKNVRKAISECWSRL 640 >gb|EXB83265.1| hypothetical protein L484_011559 [Morus notabilis] Length = 699 Score = 797 bits (2058), Expect = 0.0 Identities = 396/604 (65%), Positives = 478/604 (79%), Gaps = 19/604 (3%) Frame = +3 Query: 450 PITDRLF---KHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKS 620 P++ +LF +P SGSYK GDSTFYSLI NYA+S DF+SLE V R+ ERR VEK Sbjct: 45 PLSPQLFMPSSSSPDSGSYKLGDSTFYSLIHNYASSADFRSLEKVLDRIKSERRVLVEKC 104 Query: 621 FIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFH--- 791 FIV+FRAYGKAHLP KAV+LF RM+++F+C+ TVKSFNSVLNVIIQE +S AL+F+ Sbjct: 105 FIVIFRAYGKAHLPNKAVDLFQRMLHDFRCRPTVKSFNSVLNVIIQEHKFSYALDFYYSN 164 Query: 792 ----------SYVVNCKN--IKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADV 935 ++N KN I PNVLTFNLVIKAMCKL LVDRAV+VFRE+P C DV Sbjct: 165 VVALRSGVCKDNILNMKNMNISPNVLTFNLVIKAMCKLGLVDRAVQVFREIPLRNCTPDV 224 Query: 936 FTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDN 1115 FTY TLMDGLCKE+R++EAV+LLDEMQIEGCFP+P TFNVLI+ LCKKGD+ RAAK+VDN Sbjct: 225 FTYSTLMDGLCKENRIDEAVSLLDEMQIEGCFPSPVTFNVLISALCKKGDIGRAAKLVDN 284 Query: 1116 MFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGR 1295 MFLK C+PNE TYN LIHGLCL+GKL KA+SLLDRMV +K +PNDVTYGTII+GLV+ GR Sbjct: 285 MFLKDCLPNEATYNALIHGLCLKGKLNKAVSLLDRMVMNKCVPNDVTYGTIINGLVKHGR 344 Query: 1296 AVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSA 1475 A DG +++VSMEERG N+++YS+L+SGLFKEG+ EEA+ LWK + GHKPN VVYSA Sbjct: 345 AFDGANLLVSMEERGRHANEYVYSALISGLFKEGKYEEAMGLWKDMTGKGHKPNVVVYSA 404 Query: 1476 LIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKG 1655 LIDGLCR GKP +AKE++ EMV G PN+ TYSSLM+GFFK S+ A+LLWKE+ Sbjct: 405 LIDGLCREGKPDKAKEVMFEMVKNGFNPNSRTYSSLMRGFFKASESHKAILLWKEIVANN 464 Query: 1656 CVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGL 1835 + NEFCYS+LI GLC +GKLKEA M+W+ ML +G+ PDVVAY+SMIHGLC+ G VE+G+ Sbjct: 465 -LENEFCYSVLIDGLCGDGKLKEALMMWKQMLYRGFKPDVVAYSSMIHGLCTAGLVEEGM 523 Query: 1836 LLFNEMLYKGSDSQPDVFTYNVLFNALCKH-EKITPAIHLLNNMLDRGCDPDIVTCNIFL 2012 LFNEML +SQPDV TYN+L NALCK+ I+ A+ LLN MLD GCDPD++TC+IFL Sbjct: 524 NLFNEMLCLEPESQPDVITYNILLNALCKNGGSISRAVDLLNYMLDLGCDPDVITCDIFL 583 Query: 2013 TTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELC 2192 T +EK+ P QDGREFLDEL +RL KR+RI GA I+EVMLQKFL PK STW I++LC Sbjct: 584 RTLREKLEPPQDGREFLDELAVRLLKRERIKGAVTIVEVMLQKFLPPKASTWARVIQQLC 643 Query: 2193 KPRK 2204 KP+K Sbjct: 644 KPKK 647 >gb|ESW10855.1| hypothetical protein PHAVU_009G243700g [Phaseolus vulgaris] Length = 645 Score = 793 bits (2049), Expect = 0.0 Identities = 380/601 (63%), Positives = 483/601 (80%), Gaps = 2/601 (0%) Frame = +3 Query: 444 EPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSF 623 +P + +FK K GSYK GD +FYSLI+N+A++ DF SLE V +M ERR FVE++F Sbjct: 43 QPHPSAEIFKSGTKMGSYKLGDLSFYSLIQNHASTLDFGSLEEVLQQMKRERRVFVERNF 102 Query: 624 IVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVV 803 IV+F+AYGKAHLPEKAV+LF RM EFQCK+TVKSFNSVL+V+IQEGL++RALE +S+VV Sbjct: 103 IVMFKAYGKAHLPEKAVDLFLRMGGEFQCKQTVKSFNSVLSVVIQEGLFNRALELYSHVV 162 Query: 804 NCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKED 977 K NI PN LTFNL+IKAMC+L LVD+AVEVFRE+P C D +TY TLM GLC+E Sbjct: 163 ASKSFNIHPNALTFNLLIKAMCRLGLVDQAVEVFREIPLRNCAPDAYTYSTLMHGLCQEG 222 Query: 978 RVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYN 1157 R++EAV+LLDEMQ+EG FPNP FNVLI+ LCK GDL+RAAK+VDNMFLKGCVPNEVTYN Sbjct: 223 RIDEAVSLLDEMQVEGTFPNPVAFNVLISALCKNGDLARAAKLVDNMFLKGCVPNEVTYN 282 Query: 1158 TLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEER 1337 L+HGLCL+GKLEKA+SLL+RMV +K +PNDVT+GT+I+G V++GRA +G V+VS+EER Sbjct: 283 ALVHGLCLKGKLEKAVSLLNRMVLNKCVPNDVTFGTLINGFVKQGRASEGARVLVSLEER 342 Query: 1338 GHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEA 1517 H GN+++YSSL+SGL KEG+ A+ LWK+++ G KPNTVVYSALIDGLCR GK EA Sbjct: 343 DHCGNEYVYSSLISGLCKEGKFNHAMQLWKEMVGKGCKPNTVVYSALIDGLCREGKLDEA 402 Query: 1518 KEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHG 1697 +E+L EM +KG PN++TYSSLM+G+F+ G S+ A+L+WKEM + C HNE CYSILI+G Sbjct: 403 REVLSEMKSKGYLPNSFTYSSLMRGYFEAGISHKAILVWKEMADNNCNHNEVCYSILING 462 Query: 1698 LCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQ 1877 LC +GK+ EA MVW+ ML +G DVVAY+SMIHG C+ +E GL LFN+ML + + Q Sbjct: 463 LCKDGKVMEALMVWKQMLSRGIKLDVVAYSSMIHGFCNANLIEHGLKLFNQMLCQEPEVQ 522 Query: 1878 PDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGRE 2057 PDV TYN++ NALC H I+ AI +LN MLD+GCDPD +TC++FL T +E +NP QDGRE Sbjct: 523 PDVITYNIILNALCMHNSISRAIDILNIMLDQGCDPDFITCDVFLKTLRENVNPPQDGRE 582 Query: 2058 FLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNS 2237 FLDEL +RL KRQR GAS+IIEVML KFL PK STW + +++LCKP++++ I++CW+ Sbjct: 583 FLDELVVRLVKRQRTIGASKIIEVMLHKFLLPKASTWAMIVQQLCKPKRVRKVISECWSK 642 Query: 2238 L 2240 L Sbjct: 643 L 643 >ref|XP_006404148.1| hypothetical protein EUTSA_v10010168mg [Eutrema salsugineum] gi|557105267|gb|ESQ45601.1| hypothetical protein EUTSA_v10010168mg [Eutrema salsugineum] Length = 696 Score = 790 bits (2040), Expect = 0.0 Identities = 398/667 (59%), Positives = 493/667 (73%), Gaps = 3/667 (0%) Frame = +3 Query: 249 FTSKSARLILTSNPCKSSFIFL-IHCPFSALPNNSCETEKDYEEDIIAIQSTNSHMLPKR 425 F +KS IL+SNP K S L S P S ETE+ + E+ A Sbjct: 49 FLNKSR--ILSSNPVKLSIHLLCFSSSVSVSPKPSMETEQQHTENPSAA----------- 95 Query: 426 SHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRA 605 PI++++F+ APK GSYK GDST S+IENYANSGDF S+E + SR+ E R Sbjct: 96 --------PISEKMFESAPKMGSYKLGDSTLSSMIENYANSGDFASVEKLLSRIRLENRM 147 Query: 606 FVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALE 785 E SFIV+FRAYGKAHLPEK +ELF RMV EFQCKRT+KSFNSVLNVII EG Y R LE Sbjct: 148 IREHSFIVLFRAYGKAHLPEKTIELFHRMVDEFQCKRTIKSFNSVLNVIINEGRYHRGLE 207 Query: 786 FHSYVVNCK-NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDG 962 F+ YVVN NI PN L+FNLVIKAMCKL VDRA+EVFR MP KC D +TYCTLMDG Sbjct: 208 FYDYVVNSNMNIAPNGLSFNLVIKAMCKLGFVDRAIEVFRVMPEKKCVPDGYTYCTLMDG 267 Query: 963 LCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPN 1142 LCKE+R++EAV LLDEMQ EGC P+ T+NVLI+GLCKKGDL+R K+VDNMFLKGCVPN Sbjct: 268 LCKEERIDEAVLLLDEMQSEGCSPSSVTYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPN 327 Query: 1143 EVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMV 1322 +VTYNTLIHGLCL+GKL+KA+SLL+RMVS K IPNDVTYGT+I+GLV++ RA+DG +++ Sbjct: 328 KVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRAMDGAGLLI 387 Query: 1323 SMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVG 1502 SMEERG++ N H+YS L+SGLFKEG+ EEA++LWKK+ E G +PN VVYSAL+DGLCR G Sbjct: 388 SMEERGYRLNQHVYSILISGLFKEGKVEEAMSLWKKMGEKGCQPNIVVYSALVDGLCRQG 447 Query: 1503 KPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYS 1682 K EAKEI M++ GC PN YTYSSLMKGFFK G S A+ +W+EM C N+ CYS Sbjct: 448 KTKEAKEIFDIMISNGCLPNVYTYSSLMKGFFKTGLSEEAIQVWREMDNTECSRNKVCYS 507 Query: 1683 ILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEML-Y 1859 +LI GLC G++KEA MVW ML G PD VAY+SMI G C +GS++ + L++EML Sbjct: 508 VLIDGLCGVGRVKEAMMVWSKMLIIGIKPDTVAYSSMIKGFCGIGSMDAAIRLYHEMLCQ 567 Query: 1860 KGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNP 2039 + SQPDV TYN++ + C + I+ A+ LLN MLDRGCDPD +TC+ FL T +K + Sbjct: 568 EDHKSQPDVVTYNIIIDGFCMQKDISRAVDLLNCMLDRGCDPDAITCDTFLKTLSKKSDS 627 Query: 2040 SQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAI 2219 ++G+ FL+EL +RL KRQR+ GA +I+EVML K+L PK STW + + E+CKP+KI VAI Sbjct: 628 CEEGKSFLEELVVRLLKRQRVSGACKIVEVMLSKYLTPKASTWAMIVPEICKPKKINVAI 687 Query: 2220 NKCWNSL 2240 +KCW ++ Sbjct: 688 DKCWRNM 694 >ref|XP_003594857.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355483905|gb|AES65108.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 647 Score = 781 bits (2016), Expect = 0.0 Identities = 381/619 (61%), Positives = 486/619 (78%), Gaps = 8/619 (1%) Frame = +3 Query: 396 STNSHMLPKRSHKVEVEPPITDRLFKH-----APKSGSYKQGDSTFYSLIENYANSGDFK 560 S +S LP H + PP ++FK + K GSYK GD +FYSLIEN++NS DF Sbjct: 29 SYSSSNLPHTHHSL---PP---QIFKSPSNTSSHKWGSYKLGDLSFYSLIENFSNSLDFT 82 Query: 561 SLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSV 740 SLE + +M E R F+EKSFI++F+AYGKAHLP+KA++LF RM EF CK+TVKSFN+V Sbjct: 83 SLEQLLHQMKCENRVFIEKSFIIMFKAYGKAHLPQKALDLFHRMGAEFHCKQTVKSFNTV 142 Query: 741 LNVIIQEGLYSRALEFHSYVVNCK---NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMP 911 LNV+IQEG + ALEF+++V++ NI+PN L+FNLVIKA+C++ VD+AVEVFR M Sbjct: 143 LNVVIQEGCFDLALEFYNHVIDSNSFSNIQPNGLSFNLVIKALCRVGNVDQAVEVFRGMS 202 Query: 912 AMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLS 1091 C AD +TY TLM GLC E R++EAV+LLDEMQ+EG FPNP FNVLI+ LCKKGDLS Sbjct: 203 DRNCVADGYTYSTLMHGLCNEGRIDEAVSLLDEMQVEGTFPNPVAFNVLISALCKKGDLS 262 Query: 1092 RAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTII 1271 RA+K+VDNMFLKGCVPNEVTYN+L+HGLCL+GKL+KA+SLL+RMV++K +PND+T+GT++ Sbjct: 263 RASKLVDNMFLKGCVPNEVTYNSLVHGLCLKGKLDKAMSLLNRMVANKCVPNDITFGTLV 322 Query: 1272 DGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHK 1451 DG V+ GRA+DGV V+VS+EE+G++GN+ YSSL+SGLFKEG+ E + LWK+++E G K Sbjct: 323 DGFVKHGRALDGVRVLVSLEEKGYRGNEFSYSSLISGLFKEGKGEHGMQLWKEMVEKGCK 382 Query: 1452 PNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLL 1631 PNT+VYSALIDGLCR GKP EAKE L EM NKG PN++TYSSLM G+F+ G+ + A+L+ Sbjct: 383 PNTIVYSALIDGLCREGKPDEAKEYLIEMKNKGHTPNSFTYSSLMWGYFEAGDIHKAILV 442 Query: 1632 WKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCS 1811 WKEMT+ C H+E CYSILI+GLC GKLKEA +VW+ ML +G DVVAY+SMIHG C+ Sbjct: 443 WKEMTDNDCNHHEVCYSILINGLCKNGKLKEALIVWKQMLSRGIKLDVVAYSSMIHGFCN 502 Query: 1812 VGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDI 1991 VEQG+ LFN+ML QPDV TYN+L NA C ++ AI +LN MLD+GCDPD Sbjct: 503 AQLVEQGMKLFNQMLCHNPKLQPDVVTYNILLNAFCTKNSVSRAIDILNTMLDQGCDPDF 562 Query: 1992 VTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWE 2171 +TC+IFL T ++ M+P QDGREFLDEL +RL KRQR GAS IIEVMLQKFL PKPSTW Sbjct: 563 ITCDIFLKTLRDNMDPPQDGREFLDELVVRLIKRQRTVGASNIIEVMLQKFLLPKPSTWA 622 Query: 2172 IAIRELCKPRKIQVAINKC 2228 +A+++LCKP K++ I++C Sbjct: 623 LAVQQLCKPMKVRKTISEC 641