BLASTX nr result
ID: Ephedra25_contig00000547
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00000547 (2982 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002511505.1| pentatricopeptide repeat-containing protein,... 738 0.0 gb|EMJ11573.1| hypothetical protein PRUPE_ppa001256mg [Prunus pe... 726 0.0 gb|AAF79278.1|AC068602_1 F14D16.2 [Arabidopsis thaliana] 724 0.0 ref|NP_173324.1| pentatricopeptide repeat-containing protein [Ar... 724 0.0 gb|ESW29652.1| hypothetical protein PHAVU_002G087700g [Phaseolus... 723 0.0 ref|XP_002890305.1| pentatricopeptide repeat-containing protein ... 719 0.0 ref|XP_003525037.2| PREDICTED: pentatricopeptide repeat-containi... 718 0.0 ref|XP_003527053.1| PREDICTED: pentatricopeptide repeat-containi... 716 0.0 ref|XP_002266698.1| PREDICTED: pentatricopeptide repeat-containi... 715 0.0 ref|XP_006416553.1| hypothetical protein EUTSA_v10009547mg [Eutr... 715 0.0 gb|EOY21688.1| Pentatricopeptide repeat-containing protein, puta... 714 0.0 ref|NP_001185030.1| pentatricopeptide repeat-containing protein ... 714 0.0 ref|XP_002321537.1| pentatricopeptide repeat-containing family p... 713 0.0 ref|XP_006476670.1| PREDICTED: pentatricopeptide repeat-containi... 712 0.0 ref|XP_006439668.1| hypothetical protein CICLE_v10018829mg [Citr... 712 0.0 ref|XP_006306742.1| hypothetical protein CARUB_v10008274mg [Caps... 710 0.0 ref|XP_004299605.1| PREDICTED: pentatricopeptide repeat-containi... 710 0.0 gb|EXC34220.1| hypothetical protein L484_010090 [Morus notabilis] 709 0.0 ref|XP_003523047.2| PREDICTED: pentatricopeptide repeat-containi... 709 0.0 ref|XP_006578589.1| PREDICTED: pentatricopeptide repeat-containi... 709 0.0 >ref|XP_002511505.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223550620|gb|EEF52107.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 876 Score = 738 bits (1904), Expect = 0.0 Identities = 350/565 (61%), Positives = 444/565 (78%) Frame = +3 Query: 1014 SNNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAK 1193 ++NG++V+ V+ ILRQ RWGPA EAL NLN +++ YQ NQ+LK QD +ALNFF+W K Sbjct: 312 ASNGHIVENVAHILRQIRWGPAAEEALANLNYSMDPYQANQVLKQLQDHTVALNFFYWLK 371 Query: 1194 QQEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHI 1373 +Q GF HD H+YTTM+GILG +++F AI+ LL +M+ DGC+P VVTYNRLIHSYGRAN++ Sbjct: 372 RQPGFNHDGHTYTTMVGILGRAKQFGAINKLLDQMVKDGCQPNVVTYNRLIHSYGRANYL 431 Query: 1374 REAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVM 1553 +AVDVF+EMQ +GC PDR TY TLI+IHAKAG+LD AL MY MQ AGLSPD+F YSV+ Sbjct: 432 NDAVDVFNEMQRVGCEPDRVTYCTLIDIHAKAGFLDFALEMYQRMQAAGLSPDTFTYSVI 491 Query: 1554 INCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGF 1733 INCLGK+G L+AA+KLFCEM E+GCVP LVTYN MI L AKAR Y ALKLY+DMQ+AGF Sbjct: 492 INCLGKAGHLAAAHKLFCEMVEQGCVPNLVTYNIMIALQAKARNYQSALKLYRDMQSAGF 551 Query: 1734 QPDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQ 1913 QPDK T+++VMEV +CG +EAE++F+EM++ W+PDE VYG+LVD+WGKAGNV++A+Q Sbjct: 552 QPDKVTYSIVMEVLGHCGYLDEAEAVFSEMKRKNWVPDEPVYGLLVDLWGKAGNVEKAWQ 611 Query: 1914 WFSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCT 2093 W+ +ML++G+RPNVPTCNSLL AFLR + +A +LQ ML++ L PSLQTYTLL+S CT Sbjct: 612 WYQTMLNTGLRPNVPTCNSLLSAFLRVHKLADAYNLLQSMLELGLNPSLQTYTLLLSCCT 671 Query: 2094 TNGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRG 2273 + DM ++ LMA TGHPAH FL+SLP+A P GQN+ +HA+ F DL+HSEDRESKRG Sbjct: 672 EARSPYDMGIYCELMAVTGHPAHMFLLSLPSAGPDGQNVRDHASKFLDLMHSEDRESKRG 731 Query: 2274 FTDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXX 2453 DAVV+FL+ S LKEEAG VWEVA +RN+YP+AV K YW INLHVMS GT Sbjct: 732 LVDAVVDFLHKSGLKEEAGSVWEVAAQRNVYPDAVKEKGSCYWLINLHVMSDGTAVTALS 791 Query: 2454 XXXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNS 2633 ++M +SG+ P RIDI+TGWG+RSRV GSS+V+Q+V+++L +F PF ENGNS Sbjct: 792 RTLAWFRQQMLVSGISPSRIDIVTGWGRRSRVTGSSMVRQAVQELLHIFSFPFFTENGNS 851 Query: 2634 GCFVGFGRPLIEWMNASPLERMHLL 2708 GCFVG G PL W+ ++RMHLL Sbjct: 852 GCFVGCGEPLNRWLLQPYVDRMHLL 876 >gb|EMJ11573.1| hypothetical protein PRUPE_ppa001256mg [Prunus persica] Length = 870 Score = 726 bits (1875), Expect = 0.0 Identities = 347/564 (61%), Positives = 438/564 (77%) Frame = +3 Query: 1017 NNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAKQ 1196 + GN+V VS IL+Q RWGPA AL NLN +++ YQ NQILK QD +AL+FF+W K+ Sbjct: 307 HTGNVVQNVSHILQQMRWGPAAEAALLNLNCSMDAYQANQILKQLQDHSVALSFFYWLKR 366 Query: 1197 QEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHIR 1376 Q GF+HD H+YTTM+GILG SR+F AI+ LL +M+ +GC+P VVTYNRLIHSYGRAN+++ Sbjct: 367 QAGFKHDGHTYTTMVGILGRSRQFGAINKLLNQMVKEGCQPNVVTYNRLIHSYGRANYLK 426 Query: 1377 EAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVMI 1556 EA++VF++MQ GC PDR TY TLI+IHAKAG+LD ALR+Y MQEAGLSPD+F YSVMI Sbjct: 427 EAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDVALRLYDGMQEAGLSPDTFTYSVMI 486 Query: 1557 NCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGFQ 1736 NCLGK+G L+AA++LFCEM +GCVP LVTYN MI L AKAR Y ALKLY+DMQ AGF+ Sbjct: 487 NCLGKAGHLAAAHRLFCEMVNQGCVPNLVTYNIMIALQAKARNYETALKLYRDMQGAGFE 546 Query: 1737 PDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQW 1916 PDK T+++VMEV +CG EAE+IF EM++ W+PDE VYG+LVD+WGKAGNV +A+ W Sbjct: 547 PDKVTYSIVMEVLGHCGYLEEAEAIFGEMKRKNWVPDEPVYGLLVDLWGKAGNVGKAWNW 606 Query: 1917 FSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCTT 2096 + +MLH+G+RPNVPTCNSLL AFLR + ++A +LQ M+ + L PSLQTYTLL+S CT Sbjct: 607 YQAMLHAGLRPNVPTCNSLLSAFLRVHQLSDAYNLLQSMMGLGLNPSLQTYTLLLSCCTE 666 Query: 2097 NGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRGF 2276 + DM LMA TGHPAH+FL+S+P+A P GQN+ H + F DL+HSEDRESKRG Sbjct: 667 ARSPYDMDFCCELMAVTGHPAHTFLLSMPSAGPDGQNVREHMSRFLDLMHSEDRESKRGL 726 Query: 2277 TDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXXX 2456 DAVV+FL+ S LKEEAG VWEVA ++N+YP+A+ K+ YW INLHVMS GT Sbjct: 727 VDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAIREKSSCYWLINLHVMSDGTAVTALSR 786 Query: 2457 XXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNSG 2636 ++M +SG+ P RIDI+TGWG+RSRV GSSLV+Q+VE++L++F PF ENGNSG Sbjct: 787 TLAWFRQQMLISGICPSRIDIVTGWGRRSRVTGSSLVRQAVEELLNMFSFPFFTENGNSG 846 Query: 2637 CFVGFGRPLIEWMNASPLERMHLL 2708 CFVG G PL +W+ S +ERMHLL Sbjct: 847 CFVGCGEPLNKWLLQSYVERMHLL 870 >gb|AAF79278.1|AC068602_1 F14D16.2 [Arabidopsis thaliana] Length = 977 Score = 724 bits (1870), Expect = 0.0 Identities = 368/714 (51%), Positives = 488/714 (68%), Gaps = 12/714 (1%) Frame = +3 Query: 603 PDYLYGVKRPGNPSSCVNNPTL-LNSRRYYCIPNEQTKHFV-------SQESHLLPDING 758 P Y G G P SC+ +PT ++S + + + +HF ++ES + N Sbjct: 272 PSYDGGSDAFGLPKSCMVDPTRPISSVKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNP 331 Query: 759 RDHLQYQITSVPPTGAVPLYHKESHNLQNNNVCSTGNVNG--TTDASRPYTNEN--CPGN 926 + + TG V + + S+++ ++ +T N G T+ RP+ + N P Sbjct: 332 SSNFR-GAKEAERTGFVKGFRQVSNSVVGKSLPTTNNTYGKRTSVLQRPHIDSNRFVPSG 390 Query: 927 YGNNRGRTXXXXXXXXXXXXXXLYQQFPVSNNGNLVDYVSTILRQNRWGPATLEALKNLN 1106 + N+ +Q+ N+G++V+ VS++LR+ RWGPA EAL+NL Sbjct: 391 FSNSSVEMMKGPSGTALTS-----RQY--CNSGHIVENVSSVLRRFRWGPAAEEALQNLG 443 Query: 1107 VTLNVYQVNQILKLQQDPDLALNFFFWAKQQEGFRHDEHSYTTMIGILGNSRKFDAIDAL 1286 + ++ YQ NQ+LK D AL FF+W K+Q GF+HD H+YTTM+G LG +++F AI+ L Sbjct: 444 LRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKL 503 Query: 1287 LGEMINDGCRPTVVTYNRLIHSYGRANHIREAVDVFHEMQAIGCMPDRTTYGTLINIHAK 1466 L EM+ DGC+P VTYNRLIHSYGRAN++ EA++VF++MQ GC PDR TY TLI+IHAK Sbjct: 504 LDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAK 563 Query: 1467 AGYLDQALRMYSEMQEAGLSPDSFVYSVMINCLGKSGDLSAAYKLFCEMTERGCVPTLVT 1646 AG+LD A+ MY MQ GLSPD+F YSV+INCLGK+G L AA+KLFCEM ++GC P LVT Sbjct: 564 AGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVT 623 Query: 1647 YNNMIDLHAKARKYTIALKLYQDMQNAGFQPDKYTFNVVMEVHKYCGRYNEAESIFNEMQ 1826 YN M+DLHAKAR Y ALKLY+DMQNAGF+PDK T+++VMEV +CG EAE++F EMQ Sbjct: 624 YNIMMDLHAKARNYQNALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQ 683 Query: 1827 KAGWMPDEVVYGVLVDMWGKAGNVQRAFQWFSSMLHSGIRPNVPTCNSLLGAFLRANMFN 2006 + W+PDE VYG+LVD+WGKAGNV++A+QW+ +MLH+G+RPNVPTCNSLL FLR N Sbjct: 684 QKNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSTFLRVNKIA 743 Query: 2007 EALMILQEMLQMNLCPSLQTYTLLISSCTTNGAHQDMYLFSTLMANTGHPAHSFLISLPT 2186 EA +LQ ML + L PSLQTYTLL+S CT + DM LMA+TGHPAH FL+ +P Sbjct: 744 EAYELLQNMLALGLRPSLQTYTLLLSCCTDGRSKLDMGFCGQLMASTGHPAHMFLLKMPA 803 Query: 2187 AEPGGQNIMNHAAGFFDLIHSEDRESKRGFTDAVVNFLYSSDLKEEAGFVWEVAMERNLY 2366 A P G+N+ NHA F DL+HSEDRESKRG DAVV+FL+ S KEEAG VWEVA ++N++ Sbjct: 804 AGPDGENVRNHANNFLDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVF 863 Query: 2367 PNAVTIKAPKYWSINLHVMSMGTXXXXXXXXXXXXXEKMFMSGVEPDRIDIITGWGKRSR 2546 P+A+ K+ YW INLHVMS GT ++M SG P RIDI+TGWG+RSR Sbjct: 864 PDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRKQMLASGTCPSRIDIVTGWGRRSR 923 Query: 2547 VMGSSLVKQSVEQMLSVFHSPFSLENGNSGCFVGFGRPLIEWMNASPLERMHLL 2708 V G+S+V+Q+VE++L++F SPF E+GNSGCFVG G PL W+ S +ERMHLL Sbjct: 924 VTGTSMVRQAVEELLNIFGSPFFTESGNSGCFVGSGEPLNRWLLQSHVERMHLL 977 >ref|NP_173324.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|42571539|ref|NP_973860.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75151479|sp|Q8GYP6.1|PPR49_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g18900 gi|26450017|dbj|BAC42129.1| unknown protein [Arabidopsis thaliana] gi|28827402|gb|AAO50545.1| unknown protein [Arabidopsis thaliana] gi|332191657|gb|AEE29778.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332191658|gb|AEE29779.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 860 Score = 724 bits (1870), Expect = 0.0 Identities = 368/714 (51%), Positives = 488/714 (68%), Gaps = 12/714 (1%) Frame = +3 Query: 603 PDYLYGVKRPGNPSSCVNNPTL-LNSRRYYCIPNEQTKHFV-------SQESHLLPDING 758 P Y G G P SC+ +PT ++S + + + +HF ++ES + N Sbjct: 155 PSYDGGSDAFGLPKSCMVDPTRPISSVKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNP 214 Query: 759 RDHLQYQITSVPPTGAVPLYHKESHNLQNNNVCSTGNVNG--TTDASRPYTNEN--CPGN 926 + + TG V + + S+++ ++ +T N G T+ RP+ + N P Sbjct: 215 SSNFR-GAKEAERTGFVKGFRQVSNSVVGKSLPTTNNTYGKRTSVLQRPHIDSNRFVPSG 273 Query: 927 YGNNRGRTXXXXXXXXXXXXXXLYQQFPVSNNGNLVDYVSTILRQNRWGPATLEALKNLN 1106 + N+ +Q+ N+G++V+ VS++LR+ RWGPA EAL+NL Sbjct: 274 FSNSSVEMMKGPSGTALTS-----RQY--CNSGHIVENVSSVLRRFRWGPAAEEALQNLG 326 Query: 1107 VTLNVYQVNQILKLQQDPDLALNFFFWAKQQEGFRHDEHSYTTMIGILGNSRKFDAIDAL 1286 + ++ YQ NQ+LK D AL FF+W K+Q GF+HD H+YTTM+G LG +++F AI+ L Sbjct: 327 LRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKL 386 Query: 1287 LGEMINDGCRPTVVTYNRLIHSYGRANHIREAVDVFHEMQAIGCMPDRTTYGTLINIHAK 1466 L EM+ DGC+P VTYNRLIHSYGRAN++ EA++VF++MQ GC PDR TY TLI+IHAK Sbjct: 387 LDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAK 446 Query: 1467 AGYLDQALRMYSEMQEAGLSPDSFVYSVMINCLGKSGDLSAAYKLFCEMTERGCVPTLVT 1646 AG+LD A+ MY MQ GLSPD+F YSV+INCLGK+G L AA+KLFCEM ++GC P LVT Sbjct: 447 AGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVT 506 Query: 1647 YNNMIDLHAKARKYTIALKLYQDMQNAGFQPDKYTFNVVMEVHKYCGRYNEAESIFNEMQ 1826 YN M+DLHAKAR Y ALKLY+DMQNAGF+PDK T+++VMEV +CG EAE++F EMQ Sbjct: 507 YNIMMDLHAKARNYQNALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQ 566 Query: 1827 KAGWMPDEVVYGVLVDMWGKAGNVQRAFQWFSSMLHSGIRPNVPTCNSLLGAFLRANMFN 2006 + W+PDE VYG+LVD+WGKAGNV++A+QW+ +MLH+G+RPNVPTCNSLL FLR N Sbjct: 567 QKNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSTFLRVNKIA 626 Query: 2007 EALMILQEMLQMNLCPSLQTYTLLISSCTTNGAHQDMYLFSTLMANTGHPAHSFLISLPT 2186 EA +LQ ML + L PSLQTYTLL+S CT + DM LMA+TGHPAH FL+ +P Sbjct: 627 EAYELLQNMLALGLRPSLQTYTLLLSCCTDGRSKLDMGFCGQLMASTGHPAHMFLLKMPA 686 Query: 2187 AEPGGQNIMNHAAGFFDLIHSEDRESKRGFTDAVVNFLYSSDLKEEAGFVWEVAMERNLY 2366 A P G+N+ NHA F DL+HSEDRESKRG DAVV+FL+ S KEEAG VWEVA ++N++ Sbjct: 687 AGPDGENVRNHANNFLDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVF 746 Query: 2367 PNAVTIKAPKYWSINLHVMSMGTXXXXXXXXXXXXXEKMFMSGVEPDRIDIITGWGKRSR 2546 P+A+ K+ YW INLHVMS GT ++M SG P RIDI+TGWG+RSR Sbjct: 747 PDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRKQMLASGTCPSRIDIVTGWGRRSR 806 Query: 2547 VMGSSLVKQSVEQMLSVFHSPFSLENGNSGCFVGFGRPLIEWMNASPLERMHLL 2708 V G+S+V+Q+VE++L++F SPF E+GNSGCFVG G PL W+ S +ERMHLL Sbjct: 807 VTGTSMVRQAVEELLNIFGSPFFTESGNSGCFVGSGEPLNRWLLQSHVERMHLL 860 >gb|ESW29652.1| hypothetical protein PHAVU_002G087700g [Phaseolus vulgaris] Length = 881 Score = 723 bits (1865), Expect = 0.0 Identities = 345/565 (61%), Positives = 439/565 (77%) Frame = +3 Query: 1014 SNNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAK 1193 +N+G++VD V +LRQ +WGPAT +AL NLN +++ YQ NQILK QD +AL+FF+W K Sbjct: 317 TNSGHVVDMVKDMLRQLKWGPATEKALCNLNFSIDAYQANQILKQLQDHSVALSFFYWLK 376 Query: 1194 QQEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHI 1373 Q GF HD H+YTTM+GILG +R+F AI+ LL +M+ DGC+P VVTYNRLIHSYGRAN++ Sbjct: 377 LQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGCQPNVVTYNRLIHSYGRANYL 436 Query: 1374 REAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVM 1553 REA++VF++MQ +GC PDR TY TLI+IHAKAG+LD A+ MY MQE GLSPD+F YSVM Sbjct: 437 REALNVFNQMQKMGCEPDRVTYCTLIDIHAKAGFLDVAMSMYERMQEVGLSPDTFTYSVM 496 Query: 1554 INCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGF 1733 INCLGKSG+LSAA++LFCEM E+GCVP +VTYN +I L AKAR Y ALKLY+DMQNAGF Sbjct: 497 INCLGKSGNLSAAHRLFCEMVEQGCVPNIVTYNILIALQAKARNYQTALKLYRDMQNAGF 556 Query: 1734 QPDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQ 1913 +PDK T+++VMEV +CG EAE++F +M++ W+PDE VYG+L+D+WGKAGNV++A++ Sbjct: 557 KPDKVTYSIVMEVLGHCGYLEEAEAVFIKMKQNNWIPDEPVYGLLIDLWGKAGNVEKAWE 616 Query: 1914 WFSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCT 2093 W+ +M+ +G+ PNVPTCNSLL AFLR + +A +LQ M+ + L PSLQTYTLL+S CT Sbjct: 617 WYQAMVRAGLLPNVPTCNSLLSAFLRVHRLPDAYNLLQNMVALGLNPSLQTYTLLLSCCT 676 Query: 2094 TNGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRG 2273 + DM LMA TGHPAH+FL S+P A P GQN+ +H + F DL+HSEDRE KRG Sbjct: 677 EAQSMYDMCFCRELMAVTGHPAHTFLQSMPAAGPDGQNVRDHVSRFLDLMHSEDREGKRG 736 Query: 2274 FTDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXX 2453 DAVV+FL+ S LKEEAG VWEVA ++N+YP+AV K+ YW INLHVMS GT Sbjct: 737 LVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVKEKSSCYWLINLHVMSDGTAVTALS 796 Query: 2454 XXXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNS 2633 +KM SGV P+RIDIITGWG+RSRV GSSLV+Q+V ++L++F PF ENGNS Sbjct: 797 RTLASFRQKMLTSGVGPNRIDIITGWGRRSRVTGSSLVRQTVHELLNLFSFPFFTENGNS 856 Query: 2634 GCFVGFGRPLIEWMNASPLERMHLL 2708 GCFVG G PL +W+N S +ERMHLL Sbjct: 857 GCFVGCGEPLSQWLNHSYVERMHLL 881 >ref|XP_002890305.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297336147|gb|EFH66564.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 860 Score = 719 bits (1856), Expect = 0.0 Identities = 367/714 (51%), Positives = 483/714 (67%), Gaps = 12/714 (1%) Frame = +3 Query: 603 PDYLYGVKRPGNPSSCVNNPTL----LNSRRYYCIPNEQTKHF----VSQESHLLPDING 758 P Y G + G P SC+ +PT + S I EQ ++ES + N Sbjct: 155 PSYDGGSEAFGLPKSCMVDPTRPISSVKSSSVKAIRREQFSKVYPRSAAKESSIGKTRNP 214 Query: 759 RDHLQYQITSVPPTGAVPLYHKESHNLQNNNVCSTGNVNG--TTDASRPYTNEN--CPGN 926 + + TG V + + S+++ ++ +T N G T+ RP+ + N P Sbjct: 215 SSNFR-GAKEAERTGFVKGFRQVSNSMVGKSLPTTNNTYGKRTSVLQRPHIDSNRFVPSG 273 Query: 927 YGNNRGRTXXXXXXXXXXXXXXLYQQFPVSNNGNLVDYVSTILRQNRWGPATLEALKNLN 1106 + N+ +Q+ N+G +V+ VS++LR+ RWGPA EAL+NL Sbjct: 274 FSNSSMEMVKGPPGTALTS-----RQY--CNSGYIVENVSSVLRRFRWGPAAEEALQNLG 326 Query: 1107 VTLNVYQVNQILKLQQDPDLALNFFFWAKQQEGFRHDEHSYTTMIGILGNSRKFDAIDAL 1286 + ++ YQ NQ+LK D AL FF+W K+Q GF+HD H+YTTM+G LG +++F AI+ L Sbjct: 327 LRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKL 386 Query: 1287 LGEMINDGCRPTVVTYNRLIHSYGRANHIREAVDVFHEMQAIGCMPDRTTYGTLINIHAK 1466 L EM+ DGC+P VTYNRLIHSYGRAN++ EA++VF++MQ GC PDR TY TLI+IHAK Sbjct: 387 LDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAK 446 Query: 1467 AGYLDQALRMYSEMQEAGLSPDSFVYSVMINCLGKSGDLSAAYKLFCEMTERGCVPTLVT 1646 AG+LD A+ MY MQ GLSPD+F YSV+INCLGK+G L AA+KLFCEM ++GC P LVT Sbjct: 447 AGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVT 506 Query: 1647 YNNMIDLHAKARKYTIALKLYQDMQNAGFQPDKYTFNVVMEVHKYCGRYNEAESIFNEMQ 1826 YN M+DLHAKAR Y ALKLY+DMQNAGF+PDK T+++VMEV +CG EAE++F EMQ Sbjct: 507 YNIMMDLHAKARNYQSALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQ 566 Query: 1827 KAGWMPDEVVYGVLVDMWGKAGNVQRAFQWFSSMLHSGIRPNVPTCNSLLGAFLRANMFN 2006 + W+PDE VYG+LVD+WGKAGNV++A+QW+ +MLH+G+ PNVPTCNSLL FLR N Sbjct: 567 QKNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAMLHAGLLPNVPTCNSLLSTFLRVNKIA 626 Query: 2007 EALMILQEMLQMNLCPSLQTYTLLISSCTTNGAHQDMYLFSTLMANTGHPAHSFLISLPT 2186 EA +LQ ML + L PSLQTYTLL+S CT + DM LMA+TGHPAH FL+ +P Sbjct: 627 EAYELLQNMLALGLRPSLQTYTLLLSCCTDGRSKLDMGYCGQLMASTGHPAHMFLLKMPA 686 Query: 2187 AEPGGQNIMNHAAGFFDLIHSEDRESKRGFTDAVVNFLYSSDLKEEAGFVWEVAMERNLY 2366 A P G+N+ NHA F DL+HSEDRESKRG DAVV+FL+ S KEEAG VWEVA ++N++ Sbjct: 687 AGPNGENVRNHANNFLDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVF 746 Query: 2367 PNAVTIKAPKYWSINLHVMSMGTXXXXXXXXXXXXXEKMFMSGVEPDRIDIITGWGKRSR 2546 P+A+ K+ YW INLHVMS GT +M +SG P RIDI+TGWG+RSR Sbjct: 747 PDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRRQMLVSGTCPSRIDIVTGWGRRSR 806 Query: 2547 VMGSSLVKQSVEQMLSVFHSPFSLENGNSGCFVGFGRPLIEWMNASPLERMHLL 2708 V G+S+V+Q+VE++L++F SPF E+GNSGCFVG G L +W+ S +ERMHLL Sbjct: 807 VTGTSMVRQAVEELLNIFGSPFFTESGNSGCFVGCGESLNKWLLQSHVERMHLL 860 >ref|XP_003525037.2| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like [Glycine max] Length = 876 Score = 718 bits (1853), Expect = 0.0 Identities = 340/563 (60%), Positives = 428/563 (76%) Frame = +3 Query: 1020 NGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAKQQ 1199 N +V+ VS ILRQ RWGP +AL NLN +++ YQ NQILK QDP +AL FF W ++Q Sbjct: 314 NRRIVEVVSDILRQLRWGPTAEKALYNLNFSMDAYQANQILKQLQDPSVALGFFDWLRRQ 373 Query: 1200 EGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHIRE 1379 GFRHD H+YTTM+GILG +R+FD+I LL +M+ DGC+P VVTYNRLIH YG AN+++E Sbjct: 374 PGFRHDGHTYTTMVGILGRARRFDSISKLLEQMVKDGCQPNVVTYNRLIHCYGCANYLKE 433 Query: 1380 AVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVMIN 1559 A++VF+EMQ +GC PDR TY TLI+IHAKAG++D A+ MY MQEAGLSPD+F YSV+IN Sbjct: 434 ALNVFNEMQEVGCEPDRVTYCTLIDIHAKAGFIDVAMSMYKRMQEAGLSPDTFTYSVIIN 493 Query: 1560 CLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGFQP 1739 CLGK+G+L+AA+ LFCEM E GCVP LVTYN MI L AKAR Y +ALKLY DMQNAGFQP Sbjct: 494 CLGKAGNLAAAHWLFCEMVEHGCVPNLVTYNIMIALQAKARNYEMALKLYHDMQNAGFQP 553 Query: 1740 DKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQWF 1919 DK T+++VME +CG EAES+F EMQ+ W+PDE VYG+LVD+WGKAGNV++A +W+ Sbjct: 554 DKVTYSIVMEALGHCGYLEEAESVFVEMQQKNWVPDEPVYGLLVDLWGKAGNVEKASEWY 613 Query: 1920 SSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCTTN 2099 +ML++G+ PNVPTCNSLL AFLR + +A ++Q M+ + L PSLQTYTLL+S CT Sbjct: 614 QAMLNAGLLPNVPTCNSLLSAFLRLHRLPDAYNLVQSMVALGLRPSLQTYTLLLSCCTEA 673 Query: 2100 GAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRGFT 2279 DM F LMA TGHPAH+FL+S+P A P GQN+ +H + F D++H+EDRE KRG Sbjct: 674 QPAHDMGFFCELMAVTGHPAHAFLLSMPAAGPDGQNVRDHVSKFLDMMHTEDREGKRGLV 733 Query: 2280 DAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXXXX 2459 D+VVNFL S LKEEAG VWE A +RN+YP+AV K+ +YW INLHVMS GT Sbjct: 734 DSVVNFLNKSGLKEEAGSVWEAAAQRNVYPDAVKEKSSRYWLINLHVMSDGTAVTALSRT 793 Query: 2460 XXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNSGC 2639 ++M +SG+ P R+DIITGWG+RS+V GSSLV+Q+V+ +L F PF E GNSGC Sbjct: 794 LAWFRQRMLVSGIRPSRVDIITGWGRRSKVTGSSLVRQAVQDLLHTFSFPFLAEKGNSGC 853 Query: 2640 FVGFGRPLIEWMNASPLERMHLL 2708 FVG G PL +W+N S +ERMHLL Sbjct: 854 FVGCGEPLCQWLNHSYVERMHLL 876 >ref|XP_003527053.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like [Glycine max] Length = 882 Score = 716 bits (1849), Expect = 0.0 Identities = 343/565 (60%), Positives = 437/565 (77%) Frame = +3 Query: 1014 SNNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAK 1193 +N+G++V+ V IL+Q RWGPAT +AL NLN +++ YQ NQILK QD +AL+FF+W K Sbjct: 318 TNSGHVVEGVKDILKQLRWGPATEKALYNLNFSIDAYQANQILKQLQDHSVALSFFYWLK 377 Query: 1194 QQEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHI 1373 +Q GF HD H+YTTM+GILG +R+F AI+ LL +M+ DGC+P VVTYNRLIHSYGRAN++ Sbjct: 378 RQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGCQPNVVTYNRLIHSYGRANYL 437 Query: 1374 REAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVM 1553 EA++VF++MQ +GC PDR TY TLI+IHAKAG+LD A+ MY MQE GLSPD+F YSVM Sbjct: 438 GEALNVFNQMQEMGCEPDRVTYCTLIDIHAKAGFLDVAMSMYERMQEVGLSPDTFTYSVM 497 Query: 1554 INCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGF 1733 INCLGKSG+LSAA++LFCEM ++GCVP +VTYN +I L AKAR Y ALKLY+DMQNAGF Sbjct: 498 INCLGKSGNLSAAHRLFCEMVDQGCVPNIVTYNILIALQAKARNYQTALKLYRDMQNAGF 557 Query: 1734 QPDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQ 1913 +PDK T+++VMEV YCG EAE++F EM++ W+PDE VYG+L+D+WGKAGNV++A++ Sbjct: 558 KPDKVTYSIVMEVLGYCGYLEEAEAVFFEMKQNNWVPDEPVYGLLIDLWGKAGNVEKAWE 617 Query: 1914 WFSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCT 2093 W+ +ML +G+ PNVPTCNSLL AFLR + +A +LQ M+ + L PSLQTYTLL+S CT Sbjct: 618 WYHAMLRAGLLPNVPTCNSLLSAFLRVHRLPDAYNLLQNMVTLGLNPSLQTYTLLLSCCT 677 Query: 2094 TNGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRG 2273 + DM LMA +GHPAH+FL S+P A P GQN+ +H + F DL+HSEDRE KRG Sbjct: 678 EAQSPYDMGFCCELMAVSGHPAHAFLQSMPAAGPDGQNVRDHVSKFLDLMHSEDREGKRG 737 Query: 2274 FTDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXX 2453 DAVV+FL+ S LKEEAG VWEVA ++N+YP+A+ K+ YW INLHVMS GT Sbjct: 738 LVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAIREKSTCYWLINLHVMSDGTAVTALS 797 Query: 2454 XXXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNS 2633 +M SGV P+RIDIITGWG+RSRV GSSLV+Q+V+++L VF PF ENGNS Sbjct: 798 RTLAWFRRQMLASGVGPNRIDIITGWGRRSRVTGSSLVRQAVQELLHVFSFPFFTENGNS 857 Query: 2634 GCFVGFGRPLIEWMNASPLERMHLL 2708 GCFVG G PL +W+ S +ERMHLL Sbjct: 858 GCFVGCGEPLSQWLVHSYVERMHLL 882 >ref|XP_002266698.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750 [Vitis vinifera] Length = 875 Score = 715 bits (1846), Expect = 0.0 Identities = 342/564 (60%), Positives = 431/564 (76%) Frame = +3 Query: 1017 NNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAKQ 1196 ++G++V+ VS ILRQ WGPA EAL+NLN ++ YQ NQ+LK QD +AL FF+W K+ Sbjct: 312 SSGHVVENVSRILRQLSWGPAAEEALRNLNCLMDAYQANQVLKQIQDHPVALGFFYWLKR 371 Query: 1197 QEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHIR 1376 Q GF+HD H+YTTM+GILG +R+F AI+ LL EM+ DGC+P VVTYNRLIHSYGRAN++ Sbjct: 372 QTGFKHDGHTYTTMVGILGRARQFGAINKLLAEMVRDGCQPNVVTYNRLIHSYGRANYLN 431 Query: 1377 EAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVMI 1556 EAV VF MQ GC PDR TY TLI+IHAKAG+LD AL MY +MQEA LSPD+F YSV+I Sbjct: 432 EAVSVFDRMQEAGCQPDRVTYCTLIDIHAKAGFLDVALHMYQKMQEAHLSPDTFTYSVII 491 Query: 1557 NCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGFQ 1736 NCLGK+G L++A+KLFCEM ++GCVP LVTYN MI L AKAR Y AL+LY+DMQNAGFQ Sbjct: 492 NCLGKAGHLTSAHKLFCEMVDQGCVPNLVTYNIMIALQAKARNYPTALELYRDMQNAGFQ 551 Query: 1737 PDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQW 1916 PDK T+++VMEV +CG EAE+IF EM++ W+PDE VYG+LVD+WGK GNV+++++W Sbjct: 552 PDKVTYSIVMEVLGHCGHLEEAEAIFTEMKRKNWVPDEPVYGLLVDLWGKVGNVEKSWEW 611 Query: 1917 FSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCTT 2096 + +ML++G+ PNVPTCNSLL AFLR + ++A +LQ ML++ L PSLQTYTLL+S CT Sbjct: 612 YQAMLNAGLCPNVPTCNSLLSAFLRVHRLSDAYNLLQSMLRLGLQPSLQTYTLLLSCCTE 671 Query: 2097 NGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRGF 2276 + DM LMA TGHPAH FL+S+P A P GQN+ +H + F DL+HSEDRESKRG Sbjct: 672 ARSSFDMGFCGELMAVTGHPAHMFLLSMPAAGPDGQNVRDHVSKFLDLMHSEDRESKRGL 731 Query: 2277 TDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXXX 2456 DAVV+FL+ S LKEEAG VWEVA ++N+YP+AV K+ YW INLH MS GT Sbjct: 732 VDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVREKSSCYWLINLHFMSDGTAVTALSR 791 Query: 2457 XXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNSG 2636 +M +SG P RIDI+TGWG+RSRV G+SLV+Q+V+++L +F PF ENGNSG Sbjct: 792 TLAWFHREMLVSGTVPSRIDIVTGWGRRSRVTGASLVRQAVQELLHIFSFPFFTENGNSG 851 Query: 2637 CFVGFGRPLIEWMNASPLERMHLL 2708 CFVG G PL W+ S +ERMHLL Sbjct: 852 CFVGRGEPLGRWLLQSYVERMHLL 875 >ref|XP_006416553.1| hypothetical protein EUTSA_v10009547mg [Eutrema salsugineum] gi|557094324|gb|ESQ34906.1| hypothetical protein EUTSA_v10009547mg [Eutrema salsugineum] Length = 847 Score = 715 bits (1845), Expect = 0.0 Identities = 336/564 (59%), Positives = 433/564 (76%) Frame = +3 Query: 1017 NNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAKQ 1196 N+G++V+ VS++LR+ RWGP +AL+NL + ++ YQ NQ+LK D AL FF+W K+ Sbjct: 284 NSGHIVENVSSVLRRFRWGPDAEDALQNLGLRMDPYQANQVLKQMNDHGNALGFFYWLKR 343 Query: 1197 QEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHIR 1376 Q GF+HD H+YTTM+G LG +++F AI+ LL EM+ DGC+P VTYNRLIHSYGRAN++ Sbjct: 344 QPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLN 403 Query: 1377 EAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVMI 1556 EA++VF++MQ GC PDR TY TLI+IHAKAG+L+ A+ MY MQ AGLSPD+F YSV+I Sbjct: 404 EAMNVFNQMQEAGCRPDRVTYCTLIDIHAKAGFLEIAMDMYQRMQAAGLSPDTFTYSVII 463 Query: 1557 NCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGFQ 1736 NCLGK+G L AA+KLFCEM ++GC P LVTYN MIDLHAKAR Y ALKLY+DMQNAGF+ Sbjct: 464 NCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMIDLHAKARNYQSALKLYRDMQNAGFE 523 Query: 1737 PDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQW 1916 PDK T+++VMEV +CG EAE++F EMQ W+PDE VYG+LVD+WGK+GNV++A+ W Sbjct: 524 PDKVTYSIVMEVLGHCGYLVEAEAVFTEMQDKNWIPDEPVYGLLVDLWGKSGNVEKAWHW 583 Query: 1917 FSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCTT 2096 + +MLH+G+RPNVPTCNSLL FLR NM EA +LQ ML + L PSLQTYTLL+S CT Sbjct: 584 YQAMLHAGLRPNVPTCNSLLSTFLRVNMIAEAYELLQNMLVLGLRPSLQTYTLLLSCCTD 643 Query: 2097 NGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRGF 2276 + DM LMA+TGHPAH+FL+ +P A P GQN+ NH F +L+HSEDRESKRG Sbjct: 644 GRSKLDMGFCGQLMASTGHPAHTFLLKMPPAGPDGQNVRNHVNNFLELMHSEDRESKRGL 703 Query: 2277 TDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXXX 2456 DAVV+FL+ S LKEEAG VWEVA ++N++P+A+ K+ YW INLHVMS GT Sbjct: 704 VDAVVDFLHKSGLKEEAGSVWEVAAQKNVFPDALREKSSSYWLINLHVMSEGTAITALSR 763 Query: 2457 XXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNSG 2636 ++M +SG P RIDI+TGWG+RSRV G+S+V+++VE++L++F SPF +ENGNSG Sbjct: 764 TLAWFRKQMLVSGSCPSRIDIVTGWGRRSRVTGTSMVRKAVEELLNIFGSPFFMENGNSG 823 Query: 2637 CFVGFGRPLIEWMNASPLERMHLL 2708 CFVG G L +W+ S +ERMHLL Sbjct: 824 CFVGSGESLNKWLLQSYVERMHLL 847 >gb|EOY21688.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 859 Score = 714 bits (1844), Expect = 0.0 Identities = 340/567 (59%), Positives = 434/567 (76%) Frame = +3 Query: 1008 PVSNNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFW 1187 P++ ++ + VS IL+Q WGPA +AL+NLN +++ YQ NQ+LK QD +AL FF+W Sbjct: 293 PLAGTRHVTESVSHILQQLNWGPAAEQALENLNFSMDAYQANQVLKQIQDHTVALGFFYW 352 Query: 1188 AKQQEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRAN 1367 KQ+ GF+HD H+YTTM+GILG +R+F AI+ LL +M+ DGC+P VVTYNRLIHSYGRAN Sbjct: 353 LKQRAGFKHDGHTYTTMVGILGRARQFGAINRLLDQMVKDGCQPNVVTYNRLIHSYGRAN 412 Query: 1368 HIREAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYS 1547 +++EA++VF++MQ GC PDR TY TLI+IHAKAG+LD A+ +Y MQ GLSPD+F YS Sbjct: 413 YLKEAINVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDVAMDLYQRMQAVGLSPDTFTYS 472 Query: 1548 VMINCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNA 1727 V+INCLGK+G L AA++LFCEM +GCVP LVTYN MI L AKAR Y ALKLY+DMQNA Sbjct: 473 VIINCLGKAGHLPAAHRLFCEMVGQGCVPNLVTYNIMIALQAKARNYESALKLYRDMQNA 532 Query: 1728 GFQPDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRA 1907 GF PDK T+++VMEV + G +EAESIF EM+K W+PDE VYG+LVD+WGKAGNV++A Sbjct: 533 GFDPDKVTYSIVMEVLGHYGYLDEAESIFAEMKKKNWVPDEPVYGLLVDLWGKAGNVEKA 592 Query: 1908 FQWFSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISS 2087 +QW+ +MLH+G+RPNVPTCNSLL AFLR + ++A +LQ M+ + L PSLQTYTLL+S Sbjct: 593 WQWYQAMLHAGLRPNVPTCNSLLSAFLRVHRLSDAYNLLQNMVALGLNPSLQTYTLLLSC 652 Query: 2088 CTTNGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESK 2267 CT + DM LMA TGHPAH FL+S+P+A P GQN+ +H F D++HSEDRESK Sbjct: 653 CTEARSPYDMGFCCQLMAVTGHPAHMFLLSMPSAGPDGQNVRDHVGKFLDMMHSEDRESK 712 Query: 2268 RGFTDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXX 2447 RG D+VV+FL+ S LKEEAG VWEVA ++N+YP+AV K+ YW INLHVMS GT Sbjct: 713 RGLVDSVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVREKSSCYWLINLHVMSDGTAVTA 772 Query: 2448 XXXXXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENG 2627 ++M +SG+ P RIDI+TGWG+RSRV GSSLV+Q+V+ +LS+F PF ENG Sbjct: 773 LSRTLAWFRQQMLVSGISPSRIDIVTGWGRRSRVTGSSLVRQAVQDLLSIFSFPFFTENG 832 Query: 2628 NSGCFVGFGRPLIEWMNASPLERMHLL 2708 NSGCFVG G PL W+ S +ERMHLL Sbjct: 833 NSGCFVGCGEPLNRWLLQSYVERMHLL 859 >ref|NP_001185030.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332191659|gb|AEE29780.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 886 Score = 714 bits (1842), Expect = 0.0 Identities = 363/708 (51%), Positives = 482/708 (68%), Gaps = 12/708 (1%) Frame = +3 Query: 603 PDYLYGVKRPGNPSSCVNNPTL-LNSRRYYCIPNEQTKHFV-------SQESHLLPDING 758 P Y G G P SC+ +PT ++S + + + +HF ++ES + N Sbjct: 155 PSYDGGSDAFGLPKSCMVDPTRPISSVKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNP 214 Query: 759 RDHLQYQITSVPPTGAVPLYHKESHNLQNNNVCSTGNVNG--TTDASRPYTNEN--CPGN 926 + + TG V + + S+++ ++ +T N G T+ RP+ + N P Sbjct: 215 SSNFR-GAKEAERTGFVKGFRQVSNSVVGKSLPTTNNTYGKRTSVLQRPHIDSNRFVPSG 273 Query: 927 YGNNRGRTXXXXXXXXXXXXXXLYQQFPVSNNGNLVDYVSTILRQNRWGPATLEALKNLN 1106 + N+ +Q+ N+G++V+ VS++LR+ RWGPA EAL+NL Sbjct: 274 FSNSSVEMMKGPSGTALTS-----RQY--CNSGHIVENVSSVLRRFRWGPAAEEALQNLG 326 Query: 1107 VTLNVYQVNQILKLQQDPDLALNFFFWAKQQEGFRHDEHSYTTMIGILGNSRKFDAIDAL 1286 + ++ YQ NQ+LK D AL FF+W K+Q GF+HD H+YTTM+G LG +++F AI+ L Sbjct: 327 LRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKL 386 Query: 1287 LGEMINDGCRPTVVTYNRLIHSYGRANHIREAVDVFHEMQAIGCMPDRTTYGTLINIHAK 1466 L EM+ DGC+P VTYNRLIHSYGRAN++ EA++VF++MQ GC PDR TY TLI+IHAK Sbjct: 387 LDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAK 446 Query: 1467 AGYLDQALRMYSEMQEAGLSPDSFVYSVMINCLGKSGDLSAAYKLFCEMTERGCVPTLVT 1646 AG+LD A+ MY MQ GLSPD+F YSV+INCLGK+G L AA+KLFCEM ++GC P LVT Sbjct: 447 AGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVT 506 Query: 1647 YNNMIDLHAKARKYTIALKLYQDMQNAGFQPDKYTFNVVMEVHKYCGRYNEAESIFNEMQ 1826 YN M+DLHAKAR Y ALKLY+DMQNAGF+PDK T+++VMEV +CG EAE++F EMQ Sbjct: 507 YNIMMDLHAKARNYQNALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQ 566 Query: 1827 KAGWMPDEVVYGVLVDMWGKAGNVQRAFQWFSSMLHSGIRPNVPTCNSLLGAFLRANMFN 2006 + W+PDE VYG+LVD+WGKAGNV++A+QW+ +MLH+G+RPNVPTCNSLL FLR N Sbjct: 567 QKNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSTFLRVNKIA 626 Query: 2007 EALMILQEMLQMNLCPSLQTYTLLISSCTTNGAHQDMYLFSTLMANTGHPAHSFLISLPT 2186 EA +LQ ML + L PSLQTYTLL+S CT + DM LMA+TGHPAH FL+ +P Sbjct: 627 EAYELLQNMLALGLRPSLQTYTLLLSCCTDGRSKLDMGFCGQLMASTGHPAHMFLLKMPA 686 Query: 2187 AEPGGQNIMNHAAGFFDLIHSEDRESKRGFTDAVVNFLYSSDLKEEAGFVWEVAMERNLY 2366 A P G+N+ NHA F DL+HSEDRESKRG DAVV+FL+ S KEEAG VWEVA ++N++ Sbjct: 687 AGPDGENVRNHANNFLDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVF 746 Query: 2367 PNAVTIKAPKYWSINLHVMSMGTXXXXXXXXXXXXXEKMFMSGVEPDRIDIITGWGKRSR 2546 P+A+ K+ YW INLHVMS GT ++M SG P RIDI+TGWG+RSR Sbjct: 747 PDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRKQMLASGTCPSRIDIVTGWGRRSR 806 Query: 2547 VMGSSLVKQSVEQMLSVFHSPFSLENGNSGCFVGFGRPLIEWMNASPL 2690 V G+S+V+Q+VE++L++F SPF E+GNSGCFVG G PL W+ S L Sbjct: 807 VTGTSMVRQAVEELLNIFGSPFFTESGNSGCFVGSGEPLNRWLLQSHL 854 >ref|XP_002321537.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222868533|gb|EEF05664.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 834 Score = 713 bits (1841), Expect = 0.0 Identities = 344/564 (60%), Positives = 433/564 (76%) Frame = +3 Query: 1017 NNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAKQ 1196 + G++V+ VS ILRQ RWGP+ EAL NLN ++ YQ NQ+LK QD +AL FF W KQ Sbjct: 271 STGHVVENVSQILRQLRWGPSAEEALVNLNCHMDAYQANQVLKQLQDHTVALGFFHWLKQ 330 Query: 1197 QEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHIR 1376 GF+HD ++YTTM+GILG +++F AI+ LL +M+ DGC+PTVVTYNRLIHSYGRAN++ Sbjct: 331 LPGFKHDGYTYTTMVGILGRAKQFVAINKLLDQMVRDGCQPTVVTYNRLIHSYGRANYLN 390 Query: 1377 EAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVMI 1556 +AV+VF++MQ GC PDR TY TLI+IHAKAG+L+ A+ MY MQ AGLSPD+F YSVMI Sbjct: 391 DAVEVFNQMQKAGCEPDRVTYCTLIDIHAKAGFLNFAMEMYQRMQAAGLSPDTFTYSVMI 450 Query: 1557 NCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGFQ 1736 NCLGK+G L+AA KLFCEM E+GCVP LVTYN MI L AKAR Y ALKLY+DMQNAGF+ Sbjct: 451 NCLGKAGHLAAADKLFCEMIEQGCVPNLVTYNIMIALQAKARNYQNALKLYRDMQNAGFE 510 Query: 1737 PDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQW 1916 PDK T+++VMEV + G +EAE+IF+EM++ W+PDE VYG+LVD+WGKAGNV++A++W Sbjct: 511 PDKVTYSIVMEVLGHSGYLDEAEAIFSEMKRKNWVPDEPVYGLLVDLWGKAGNVEKAWEW 570 Query: 1917 FSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCTT 2096 + +MLH+G+ PNVPTCNSLL AFLR N +A +LQ ML + L PSLQTYTLL+S CT Sbjct: 571 YQAMLHAGLCPNVPTCNSLLSAFLRVNRLPDAYNLLQSMLNLGLNPSLQTYTLLLSCCTE 630 Query: 2097 NGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRGF 2276 + DM + LM+ TGHPAH FL SLP+A P GQN+ +H + F D++HSEDRESKRG Sbjct: 631 ARSPYDMGCYCELMSVTGHPAHMFLSSLPSAGPDGQNVRHHVSKFLDMMHSEDRESKRGL 690 Query: 2277 TDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXXX 2456 DAVV+FL+ S LKEEAG VWE+A +RN+YP+AV K+ YW INLHVMS GT Sbjct: 691 VDAVVDFLHKSGLKEEAGSVWEIAAQRNVYPDAVKEKSSCYWLINLHVMSEGTAVTALSR 750 Query: 2457 XXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNSG 2636 +M +SGV P RIDI+TGWG+RSRV GSSLV+Q+V+++L +F PF ENGN+G Sbjct: 751 TLAWFRRQMLVSGVIPSRIDIVTGWGRRSRVTGSSLVRQAVQELLHIFSFPFFTENGNTG 810 Query: 2637 CFVGFGRPLIEWMNASPLERMHLL 2708 CFVG G PL W+ S +ERMHLL Sbjct: 811 CFVGCGEPLSRWLLQSYVERMHLL 834 >ref|XP_006476670.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like [Citrus sinensis] Length = 856 Score = 712 bits (1839), Expect = 0.0 Identities = 337/565 (59%), Positives = 433/565 (76%) Frame = +3 Query: 1014 SNNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAK 1193 ++ GN+V+ VS ILRQ +WGP EAL N N +++ YQ NQ+LK QD +AL FF W + Sbjct: 292 ASTGNVVESVSRILRQWKWGPLAEEALGNTNYSMDAYQANQVLKQLQDHTVALGFFNWLR 351 Query: 1194 QQEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHI 1373 +Q GF+HDEH+YTTM+GILG +R+F AI+ LL +M+ DGC+P VVTYNRLIHSYGRAN++ Sbjct: 352 RQAGFKHDEHTYTTMVGILGRARQFGAINKLLDQMVRDGCQPNVVTYNRLIHSYGRANYL 411 Query: 1374 REAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVM 1553 EA+DVF +MQ +GC PDR TY TLI+IHAKAG+LD A+ MY +MQ AGLSPD+F YSV+ Sbjct: 412 NEALDVFKQMQVVGCEPDRVTYCTLIDIHAKAGFLDVAMDMYKKMQAAGLSPDTFTYSVI 471 Query: 1554 INCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGF 1733 INCLGK+G L AA++LFCEM +GC+P LVTYN MI L AKAR Y ALKLY+DMQNAGF Sbjct: 472 INCLGKAGHLQAAHQLFCEMVNQGCIPNLVTYNIMIALQAKARNYQSALKLYRDMQNAGF 531 Query: 1734 QPDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQ 1913 +PDK T+++VMEV +CG +EAE++F EM++ W+PDE VYG+LVD+WGKAGNV++A++ Sbjct: 532 EPDKVTYSIVMEVLGHCGYLDEAEAVFAEMRRKNWVPDEPVYGLLVDLWGKAGNVRKAWE 591 Query: 1914 WFSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCT 2093 W+ +ML +G+RPNVPTCNSLL AFLR ++A +LQ ML + L PSLQTYTLL+S CT Sbjct: 592 WYEAMLQAGLRPNVPTCNSLLSAFLRVGQLSDAYHLLQGMLNLGLKPSLQTYTLLLSCCT 651 Query: 2094 TNGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRG 2273 + DM LMA +GHPAH FL+S+P+ P GQN+ +H F +++HSEDRESKRG Sbjct: 652 EARSPYDMGFCHELMAVSGHPAHMFLLSMPSPGPDGQNVRDHVGSFLEMMHSEDRESKRG 711 Query: 2274 FTDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXX 2453 DAVV+FL+ S LKEEAG VWEVA ++N+YP+AV K YW INLHVMS GT Sbjct: 712 LVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVKEKGMSYWLINLHVMSDGTAVIALS 771 Query: 2454 XXXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNS 2633 ++M +SGV P RIDI+TGWG+RSRV G+SLV+Q+V+++L +F PF ENGNS Sbjct: 772 RTLAWFRKQMLISGVGPSRIDIVTGWGRRSRVTGTSLVRQAVQELLHMFSFPFFTENGNS 831 Query: 2634 GCFVGFGRPLIEWMNASPLERMHLL 2708 GCFVG G PL +W+ S ++RMHLL Sbjct: 832 GCFVGCGEPLNKWLLQSYVDRMHLL 856 >ref|XP_006439668.1| hypothetical protein CICLE_v10018829mg [Citrus clementina] gi|557541930|gb|ESR52908.1| hypothetical protein CICLE_v10018829mg [Citrus clementina] Length = 856 Score = 712 bits (1837), Expect = 0.0 Identities = 336/565 (59%), Positives = 434/565 (76%) Frame = +3 Query: 1014 SNNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAK 1193 ++ GN+V+ VS IL+Q +WGP EAL N N +++ YQ NQ+LK QD +AL FF W + Sbjct: 292 ASTGNVVESVSRILQQWKWGPLAEEALGNTNYSMDAYQANQVLKQLQDHTVALGFFNWLR 351 Query: 1194 QQEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHI 1373 +Q GF+HDEH+YTTM+GILG +R+F AI+ LL +M+ DGC+P VVTYNRLIHSYGRAN++ Sbjct: 352 RQAGFKHDEHTYTTMVGILGRARQFGAINKLLDQMVRDGCQPNVVTYNRLIHSYGRANYL 411 Query: 1374 REAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVM 1553 EA+DVF +MQ +GC PDR TY TLI+IHAKAG+LD A+ MY +MQ AGLSPD+F YSV+ Sbjct: 412 NEALDVFKQMQVVGCEPDRVTYCTLIDIHAKAGFLDVAMDMYKKMQAAGLSPDTFTYSVI 471 Query: 1554 INCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGF 1733 INCLGK+G L AA++LFCEM +GC+P LVTYN MI L AKAR Y ALKLY+DMQNAGF Sbjct: 472 INCLGKAGHLQAAHQLFCEMVNQGCIPNLVTYNIMIALQAKARNYQSALKLYRDMQNAGF 531 Query: 1734 QPDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQ 1913 +PDK T+++VMEV +CG +EAE++F EM++ W+PDE VYG+LVD+WGKAGNV++A++ Sbjct: 532 EPDKVTYSIVMEVLGHCGYLDEAEAVFAEMRRKNWVPDEPVYGLLVDLWGKAGNVRKAWE 591 Query: 1914 WFSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCT 2093 W+ +ML +G+RPNVPTCNSLL AFLR ++A +L+ ML + L PSLQTYTLL+S CT Sbjct: 592 WYEAMLQAGLRPNVPTCNSLLSAFLRVGQLSDAFHLLRGMLNLGLKPSLQTYTLLLSCCT 651 Query: 2094 TNGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRG 2273 + DM LMA +GHPAH FL+S+P+ P GQN+ +H + F +++HSEDRESKRG Sbjct: 652 EARSPYDMGFCHELMAVSGHPAHMFLLSMPSPGPDGQNVRDHVSSFLEMMHSEDRESKRG 711 Query: 2274 FTDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXX 2453 DAVV+FL+ S LKEEAG VWEVA ++N+YP+AV K YW INLHVMS GT Sbjct: 712 LVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVREKGMSYWLINLHVMSDGTAVIALS 771 Query: 2454 XXXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNS 2633 ++M MSGV P RIDI+TGWG+RSRV G+SLV+Q+V+++L +F PF ENGNS Sbjct: 772 RTLAWFRKQMLMSGVGPSRIDIVTGWGRRSRVTGTSLVRQAVQELLHMFSFPFFTENGNS 831 Query: 2634 GCFVGFGRPLIEWMNASPLERMHLL 2708 GCFVG G PL +W+ S ++RMHLL Sbjct: 832 GCFVGCGEPLNKWLLQSYVDRMHLL 856 >ref|XP_006306742.1| hypothetical protein CARUB_v10008274mg [Capsella rubella] gi|482575453|gb|EOA39640.1| hypothetical protein CARUB_v10008274mg [Capsella rubella] Length = 878 Score = 710 bits (1832), Expect = 0.0 Identities = 335/564 (59%), Positives = 429/564 (76%) Frame = +3 Query: 1017 NNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAKQ 1196 N+G++V+ VS++L++ RWGPA EAL+NL+ ++ YQ NQ+LK D AL FF+W K+ Sbjct: 315 NSGHIVENVSSVLKRFRWGPAAEEALQNLDFRIDAYQANQVLKQMNDYGNALGFFYWLKR 374 Query: 1197 QEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHIR 1376 Q GF+HD H+YTTM+G LG +++F AI+ LL EM+ DGC+P VTYNRLIHSYGRAN++ Sbjct: 375 QPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLN 434 Query: 1377 EAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVMI 1556 EA++VF++MQ GC PDR TY TLI+IHAKAG+LD A+ MY MQ GLSPD+F YSV+I Sbjct: 435 EAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVII 494 Query: 1557 NCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGFQ 1736 NCLGK G L AA++LFCEM ++GC P LVTYN MIDLHAKAR Y ALKLY+DMQNAGF+ Sbjct: 495 NCLGKGGHLPAAHRLFCEMVDQGCTPNLVTYNIMIDLHAKARNYPSALKLYRDMQNAGFE 554 Query: 1737 PDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQW 1916 PDK T+++VMEV + G EAE++F EMQ+ W+PDE VYG+LVD+WGKAGNV++A+QW Sbjct: 555 PDKVTYSIVMEVLGHTGYLEEAEAVFTEMQQKNWVPDEPVYGLLVDLWGKAGNVEKAWQW 614 Query: 1917 FSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCTT 2096 + +MLH+G+ PNVPTCNSLL FLR N EA +LQ ML + L PSLQTYTLL+S CT Sbjct: 615 YQAMLHAGLLPNVPTCNSLLSTFLRVNKIAEAYDLLQNMLALGLRPSLQTYTLLLSCCTD 674 Query: 2097 NGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRGF 2276 + DM LMA+TGHPAH FL+ +P A P GQN+ NHA F +L+HSEDRESKRG Sbjct: 675 GRSKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGQNVRNHANNFLNLMHSEDRESKRGL 734 Query: 2277 TDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXXX 2456 DAVV+FL+ S KEEAG VWEVA ++N++P+A+ K+ YW INLHVMS GT Sbjct: 735 VDAVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSSSYWLINLHVMSEGTAITALSR 794 Query: 2457 XXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNSG 2636 ++M +SG P RIDI+TGWG+RSRV G+S+V+Q+VE++L++F SPF E+GNSG Sbjct: 795 TLAWFRKQMLVSGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFGSPFFTESGNSG 854 Query: 2637 CFVGFGRPLIEWMNASPLERMHLL 2708 CFVG G L +W+ S +ERMHLL Sbjct: 855 CFVGCGESLNKWLLQSHVERMHLL 878 >ref|XP_004299605.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like [Fragaria vesca subsp. vesca] Length = 879 Score = 710 bits (1832), Expect = 0.0 Identities = 339/564 (60%), Positives = 432/564 (76%) Frame = +3 Query: 1017 NNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAKQ 1196 +NGN+V VS IL+Q +WGP+ +L+NLN +++ YQ NQILK QD +AL FF W K+ Sbjct: 316 HNGNVVQNVSHILQQLKWGPSAEASLRNLNCSMDAYQANQILKQLQDHTVALGFFNWLKR 375 Query: 1197 QEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHIR 1376 Q GFRHD H+YTTM+GILG +R+F AI+ LL +M+N+GC+P VVTYNRLIHSYGRAN+++ Sbjct: 376 QAGFRHDGHTYTTMVGILGRARQFGAINKLLNQMVNEGCQPNVVTYNRLIHSYGRANYLK 435 Query: 1377 EAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVMI 1556 +A++VF +MQ GC PDR TY TLI+IHAK+G+LD AL +Y MQ+AGLSPD+F YSVMI Sbjct: 436 DAMNVFSQMQEAGCEPDRVTYCTLIDIHAKSGFLDIALGLYDRMQKAGLSPDTFTYSVMI 495 Query: 1557 NCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGFQ 1736 NCLGK+G L+AA++LFCEM + GCVP LVTYN MI L AKAR Y ALKLY+DMQ AGFQ Sbjct: 496 NCLGKAGHLAAAHRLFCEMVDHGCVPNLVTYNIMIALQAKARNYETALKLYRDMQGAGFQ 555 Query: 1737 PDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQW 1916 PDK T+++VMEV +CG EAE++F EM++ W+PDE VYG+LVD+WGKAGNVQ+A+ W Sbjct: 556 PDKVTYSIVMEVLGHCGYLEEAEALFGEMKRKNWVPDEPVYGLLVDLWGKAGNVQKAWDW 615 Query: 1917 FSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCTT 2096 + +MLH+G+RPNVPTCNSLL AFLR + ++A +LQ M+ + L PSLQTYTLL+S CT Sbjct: 616 YQAMLHAGLRPNVPTCNSLLSAFLRVHRLSDAYNLLQSMVGLGLNPSLQTYTLLLSCCTE 675 Query: 2097 NGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRGF 2276 + DM LMA T HPAH+FL+S+P+A P GQN+ H F DL+HSEDRESKRG Sbjct: 676 AQSPYDMEFCCELMAVTRHPAHTFLLSMPSAGPDGQNVREHMNSFLDLMHSEDRESKRGL 735 Query: 2277 TDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXXX 2456 DAVV+FL+ S LKEEAG VWEVA ++N+YP+AV K YW INLHVMS GT Sbjct: 736 VDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVKEKGSCYWLINLHVMSDGTAVTALSR 795 Query: 2457 XXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNSG 2636 ++M +SGV P+RIDI+TGWG+RSRV G+S+V+ +V+++L +F PF ENGNSG Sbjct: 796 TLAWFRQQMLISGVCPNRIDIVTGWGRRSRVTGTSMVRHAVQELLHMFSFPFFTENGNSG 855 Query: 2637 CFVGFGRPLIEWMNASPLERMHLL 2708 CFVG G L W+ S +ERMHLL Sbjct: 856 CFVGCGESLNRWLLESYVERMHLL 879 >gb|EXC34220.1| hypothetical protein L484_010090 [Morus notabilis] Length = 872 Score = 709 bits (1831), Expect = 0.0 Identities = 337/567 (59%), Positives = 432/567 (76%) Frame = +3 Query: 1008 PVSNNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFW 1187 P +N N+V+ VS +L RWG A EAL+NLN ++ +Q NQ+LK QD ++AL FF+W Sbjct: 306 PYANTANVVERVSHMLHGLRWGRAAEEALENLNYAMDAFQANQVLKQLQDHNVALGFFYW 365 Query: 1188 AKQQEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRAN 1367 K+Q GF+HD H+YTTM+GILG SR+F AI+ LL EM+ +GC+P VVTYNRLIHSYGRAN Sbjct: 366 LKRQAGFKHDGHTYTTMVGILGRSREFGAINKLLHEMVKEGCQPNVVTYNRLIHSYGRAN 425 Query: 1368 HIREAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYS 1547 +++EA++VF++MQ GC PDR TY TLI+IHAKAG+LD ALR+Y MQ+AGLSPD+F YS Sbjct: 426 YLKEAINVFNQMQNAGCEPDRVTYCTLIDIHAKAGFLDVALRLYDRMQQAGLSPDTFTYS 485 Query: 1548 VMINCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNA 1727 V+INCLGK G L+AA+ LFC+M GCVP LVTYN MI L AKAR Y ALKLY+DMQNA Sbjct: 486 VIINCLGKGGHLTAAHNLFCKMVSEGCVPNLVTYNIMIALQAKARNYETALKLYRDMQNA 545 Query: 1728 GFQPDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRA 1907 GF PDK T+++VMEV +CG EAE++F EM+ W+PDE VYG+LVD+WGK+GN+++A Sbjct: 546 GFDPDKVTYSIVMEVLGHCGYLEEAEAVFVEMRHKNWVPDEPVYGLLVDLWGKSGNIEKA 605 Query: 1908 FQWFSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISS 2087 ++W+ +ML++G++PNVPTCNSLL AFLR + EA +LQ M+ L PSLQTYTLL+S Sbjct: 606 WEWYQAMLNAGLQPNVPTCNSLLSAFLRVHRLTEAYELLQSMVDWGLNPSLQTYTLLLSC 665 Query: 2088 CTTNGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESK 2267 CT + DM LMA TGHPAH+FL+S+P+A P GQN+ +HA+ F DL+HSEDRE K Sbjct: 666 CTEAQSPYDMGFCCKLMATTGHPAHTFLLSMPSAGPDGQNVRDHASRFLDLMHSEDREGK 725 Query: 2268 RGFTDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXX 2447 RG DAVV+FL+ S LKEEAG VWEVA ++N+YP+AV K+ +W INLHVMS GT Sbjct: 726 RGLVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVKEKSSCHWLINLHVMSDGTAVTA 785 Query: 2448 XXXXXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENG 2627 +M +SG+ P RIDI+TGWG+RSRV G+SLV+Q+V+++L +F PF ENG Sbjct: 786 LSRTLAWFRREMLISGICPSRIDIVTGWGRRSRVTGASLVRQAVQELLRMFSFPFFTENG 845 Query: 2628 NSGCFVGFGRPLIEWMNASPLERMHLL 2708 NSGCFVG G PL W+ S +ERMHLL Sbjct: 846 NSGCFVGCGEPLNRWLLQSYVERMHLL 872 >ref|XP_003523047.2| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like isoform X1 [Glycine max] Length = 882 Score = 709 bits (1829), Expect = 0.0 Identities = 340/565 (60%), Positives = 434/565 (76%) Frame = +3 Query: 1014 SNNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAK 1193 +N+G++V+ V IL+Q RWGPAT + L NLN +++ YQ NQILK QD +A+ FF W K Sbjct: 318 TNSGHVVEVVKDILKQLRWGPATEKTLYNLNFSIDAYQANQILKQLQDHSVAVGFFCWLK 377 Query: 1194 QQEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHI 1373 +Q GF HD H+YTTM+GILG +R+F AI+ LL +M+ DGC+P VVTYNRLIHSYGRAN++ Sbjct: 378 RQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGCQPNVVTYNRLIHSYGRANYL 437 Query: 1374 REAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVM 1553 REA++VF++MQ +GC PDR TY TLI+IHAKAG+LD A+ MY MQE GLSPD+F YSVM Sbjct: 438 REALNVFNQMQEMGCEPDRVTYCTLIDIHAKAGFLDVAMSMYERMQEVGLSPDTFTYSVM 497 Query: 1554 INCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGF 1733 INCLGKSG+LSAA++LFCEM ++GCVP +VTYN +I L AKAR Y AL+LY+DMQNAGF Sbjct: 498 INCLGKSGNLSAAHRLFCEMVDQGCVPNIVTYNILIALQAKARNYQTALELYRDMQNAGF 557 Query: 1734 QPDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQ 1913 +PDK T+++VMEV +CG EAE++F EM++ W+PDE VYG+LVD+WGKAGNV++A++ Sbjct: 558 KPDKVTYSIVMEVLGHCGYLEEAEAVFFEMRQNHWVPDEPVYGLLVDLWGKAGNVEKAWE 617 Query: 1914 WFSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCT 2093 W+ +ML +G+ PNVPTCNSLL AFLR + +A +LQ M+ + L PSLQTYTLL+S CT Sbjct: 618 WYHTMLRAGLLPNVPTCNSLLSAFLRVHRLPDAYNLLQNMVTLGLNPSLQTYTLLLSCCT 677 Query: 2094 TNGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRG 2273 + DM LMA +GHPAH+FL S+P A P GQN+ +H + F DL+HSEDRE KRG Sbjct: 678 EAQSPYDMGFCCELMAVSGHPAHAFLQSMPAAGPDGQNVRDHVSKFLDLMHSEDREGKRG 737 Query: 2274 FTDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXX 2453 DAVV+FL+ S LKEEAG VWEVA ++N+YP+AV K+ YW INLHVMS GT Sbjct: 738 LVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVKEKSTCYWLINLHVMSDGTAVTALS 797 Query: 2454 XXXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNS 2633 +M SGV P+RIDI+TGWG+RSRV GSSLV+Q+V+++L VF PF EN NS Sbjct: 798 RTLAWFRRQMLASGVGPNRIDIVTGWGRRSRVTGSSLVRQAVQELLHVFSFPFFTENSNS 857 Query: 2634 GCFVGFGRPLIEWMNASPLERMHLL 2708 GCFVG G PL +W+ S +ERMHLL Sbjct: 858 GCFVGCGEPLSQWLVHSYVERMHLL 882 >ref|XP_006578589.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like isoform X2 [Glycine max] Length = 898 Score = 709 bits (1829), Expect = 0.0 Identities = 340/565 (60%), Positives = 434/565 (76%) Frame = +3 Query: 1014 SNNGNLVDYVSTILRQNRWGPATLEALKNLNVTLNVYQVNQILKLQQDPDLALNFFFWAK 1193 +N+G++V+ V IL+Q RWGPAT + L NLN +++ YQ NQILK QD +A+ FF W K Sbjct: 334 TNSGHVVEVVKDILKQLRWGPATEKTLYNLNFSIDAYQANQILKQLQDHSVAVGFFCWLK 393 Query: 1194 QQEGFRHDEHSYTTMIGILGNSRKFDAIDALLGEMINDGCRPTVVTYNRLIHSYGRANHI 1373 +Q GF HD H+YTTM+GILG +R+F AI+ LL +M+ DGC+P VVTYNRLIHSYGRAN++ Sbjct: 394 RQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGCQPNVVTYNRLIHSYGRANYL 453 Query: 1374 REAVDVFHEMQAIGCMPDRTTYGTLINIHAKAGYLDQALRMYSEMQEAGLSPDSFVYSVM 1553 REA++VF++MQ +GC PDR TY TLI+IHAKAG+LD A+ MY MQE GLSPD+F YSVM Sbjct: 454 REALNVFNQMQEMGCEPDRVTYCTLIDIHAKAGFLDVAMSMYERMQEVGLSPDTFTYSVM 513 Query: 1554 INCLGKSGDLSAAYKLFCEMTERGCVPTLVTYNNMIDLHAKARKYTIALKLYQDMQNAGF 1733 INCLGKSG+LSAA++LFCEM ++GCVP +VTYN +I L AKAR Y AL+LY+DMQNAGF Sbjct: 514 INCLGKSGNLSAAHRLFCEMVDQGCVPNIVTYNILIALQAKARNYQTALELYRDMQNAGF 573 Query: 1734 QPDKYTFNVVMEVHKYCGRYNEAESIFNEMQKAGWMPDEVVYGVLVDMWGKAGNVQRAFQ 1913 +PDK T+++VMEV +CG EAE++F EM++ W+PDE VYG+LVD+WGKAGNV++A++ Sbjct: 574 KPDKVTYSIVMEVLGHCGYLEEAEAVFFEMRQNHWVPDEPVYGLLVDLWGKAGNVEKAWE 633 Query: 1914 WFSSMLHSGIRPNVPTCNSLLGAFLRANMFNEALMILQEMLQMNLCPSLQTYTLLISSCT 2093 W+ +ML +G+ PNVPTCNSLL AFLR + +A +LQ M+ + L PSLQTYTLL+S CT Sbjct: 634 WYHTMLRAGLLPNVPTCNSLLSAFLRVHRLPDAYNLLQNMVTLGLNPSLQTYTLLLSCCT 693 Query: 2094 TNGAHQDMYLFSTLMANTGHPAHSFLISLPTAEPGGQNIMNHAAGFFDLIHSEDRESKRG 2273 + DM LMA +GHPAH+FL S+P A P GQN+ +H + F DL+HSEDRE KRG Sbjct: 694 EAQSPYDMGFCCELMAVSGHPAHAFLQSMPAAGPDGQNVRDHVSKFLDLMHSEDREGKRG 753 Query: 2274 FTDAVVNFLYSSDLKEEAGFVWEVAMERNLYPNAVTIKAPKYWSINLHVMSMGTXXXXXX 2453 DAVV+FL+ S LKEEAG VWEVA ++N+YP+AV K+ YW INLHVMS GT Sbjct: 754 LVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVKEKSTCYWLINLHVMSDGTAVTALS 813 Query: 2454 XXXXXXXEKMFMSGVEPDRIDIITGWGKRSRVMGSSLVKQSVEQMLSVFHSPFSLENGNS 2633 +M SGV P+RIDI+TGWG+RSRV GSSLV+Q+V+++L VF PF EN NS Sbjct: 814 RTLAWFRRQMLASGVGPNRIDIVTGWGRRSRVTGSSLVRQAVQELLHVFSFPFFTENSNS 873 Query: 2634 GCFVGFGRPLIEWMNASPLERMHLL 2708 GCFVG G PL +W+ S +ERMHLL Sbjct: 874 GCFVGCGEPLSQWLVHSYVERMHLL 898