BLASTX nr result
ID: Rauwolfia21_contig00017729
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00017729 (1927 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat... 897 0.0 ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containi... 879 0.0 gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis] 877 0.0 emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera] 868 0.0 ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containi... 855 0.0 ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containi... 855 0.0 ref|XP_002530608.1| pentatricopeptide repeat-containing protein,... 832 0.0 ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat... 832 0.0 ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containi... 825 0.0 ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citr... 822 0.0 ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat... 821 0.0 ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat... 818 0.0 gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protei... 803 0.0 gb|EPS62602.1| hypothetical protein M569_12187, partial [Genlise... 781 0.0 ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutr... 774 0.0 ref|XP_006393982.1| hypothetical protein EUTSA_v10003830mg [Eutr... 760 0.0 ref|XP_002866691.1| pentatricopeptide repeat-containing protein ... 757 0.0 ref|NP_190542.4| pentatricopeptide repeat-containing protein [Ar... 756 0.0 emb|CAB66911.1| putative protein [Arabidopsis thaliana] 756 0.0 ref|XP_006292382.1| hypothetical protein CARUB_v10018595mg [Caps... 756 0.0 >ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like [Vitis vinifera] Length = 622 Score = 897 bits (2318), Expect = 0.0 Identities = 434/554 (78%), Positives = 484/554 (87%) Frame = +1 Query: 265 NCFERIAEARRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQ 444 NC I+E R G L+R+ + E + Q DEF+ADVEKVYRILRKFHSR+PKLELALQ Sbjct: 23 NC--TISERRGGFGLVRLESNRENCTYDQNYDEFSADVEKVYRILRKFHSRVPKLELALQ 80 Query: 445 ESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGA 624 ESG+ VRSGLTERVLNRCGDAGNLGYRFF+WASKQPGYRHSY+VYKAMIKILGKMRQFGA Sbjct: 81 ESGVAVRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYEVYKAMIKILGKMRQFGA 140 Query: 625 VWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLL 804 VWALIEEMR+ENP +SP VFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDE+VFGCLL Sbjct: 141 VWALIEEMRRENPQFVSPYVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEHVFGCLL 200 Query: 805 DALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEP 984 DALCKNGSVKEAA LFEDMR++F PT+KHFTSLLYGWC+EGKLMEAK+VLV++REAGFEP Sbjct: 201 DALCKNGSVKEAASLFEDMRIRFTPTLKHFTSLLYGWCREGKLMEAKYVLVQIREAGFEP 260 Query: 985 DIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVF 1164 DIVVYNNLL GYA AGKMVDA+ LL+EM+ K CEPN SFT ++QALCA+ KMEEAMRVF Sbjct: 261 DIVVYNNLLTGYAAAGKMVDAYDLLKEMRRKECEPNVMSFTTLIQALCAKKKMEEAMRVF 320 Query: 1165 SEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXX 1344 EM+ GC AD VTYTTLISGFCKWG+I++GYELLD+MIQ+GH PN +YL+I+ AH Sbjct: 321 FEMQSCGCPADAVTYTTLISGFCKWGKISKGYELLDNMIQQGHIPNPMTYLHIMAAHEKK 380 Query: 1345 XXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTH 1524 M+KIG PDL IYN VIRLACKLGEIKE +R W ++E G+SPG+DT Sbjct: 381 EELEECIELMEEMRKIGCTPDLNIYNIVIRLACKLGEIKEGVRVWNEMEATGLSPGLDTF 440 Query: 1525 VILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCI 1704 VI+I+G + Q CLVEAC++FKEMV RGLLSAPQYGTLK+LLNSLLR++KLEMSK+VWSCI Sbjct: 441 VIMIHGFLSQRCLVEACEFFKEMVGRGLLSAPQYGTLKELLNSLLRAEKLEMSKDVWSCI 500 Query: 1705 MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNR 1884 MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNR Sbjct: 501 MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNR 560 Query: 1885 QFAAEITEKVRKMA 1926 Q AAEITEKVRKMA Sbjct: 561 QIAAEITEKVRKMA 574 >ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like [Solanum tuberosum] Length = 625 Score = 879 bits (2270), Expect = 0.0 Identities = 421/527 (79%), Positives = 468/527 (88%) Frame = +1 Query: 346 SQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYR 525 ++ DEF+ADVEKVYRILRKFHSR+PKLELAL ESG+V RSGLTERVLNRCGDAGNLGYR Sbjct: 51 NKNHDEFSADVEKVYRILRKFHSRVPKLELALLESGVVARSGLTERVLNRCGDAGNLGYR 110 Query: 526 FFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRR 705 FF+W SKQPGYRHS+D YKAMIKILGKMRQFG VWAL+EEMR ENP L+PEVF+VLMRR Sbjct: 111 FFVWVSKQPGYRHSHDAYKAMIKILGKMRQFGTVWALVEEMRIENPQFLTPEVFIVLMRR 170 Query: 706 FASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTI 885 FAS RMVKKAIEVLDEMPKYG EPDEYVFGCLLDALCKNGSVKEAA LF++MR +F PTI Sbjct: 171 FASGRMVKKAIEVLDEMPKYGVEPDEYVFGCLLDALCKNGSVKEAAALFDEMRFRFSPTI 230 Query: 886 KHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQE 1065 KHFTSLLYGWCKEGKL+EAK VLVKMREAGFEPDIVVYNNLLNGYAV+ KM DAF LLQE Sbjct: 231 KHFTSLLYGWCKEGKLIEAKVVLVKMREAGFEPDIVVYNNLLNGYAVSRKMADAFDLLQE 290 Query: 1066 MKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGE 1245 M+ KGC PN TSFTIV+QALC Q+KMEEAMRVF +MERSGCE DVVTYTTLISGFCKWG+ Sbjct: 291 MRRKGCNPNETSFTIVIQALCLQDKMEEAMRVFLDMERSGCEGDVVTYTTLISGFCKWGK 350 Query: 1246 INRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNT 1425 I +GYEL+D+M+QKG+ PN+T+YL+I+LAH M KIG+ PD +IYN Sbjct: 351 IEKGYELVDTMLQKGYNPNQTTYLHIMLAHEKKEELEECLELVKEMGKIGIPPDHSIYNI 410 Query: 1426 VIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRG 1605 VIRLACKLGEI E +R W QIE NG+SPGVDT +I+ING VEQG L+EACD+FKEM+ RG Sbjct: 411 VIRLACKLGEIDEGVRVWNQIEANGISPGVDTFIIMINGFVEQGRLIEACDHFKEMIGRG 470 Query: 1606 LLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKE 1785 LLSAPQYGTLKDLLNSLLR++KLE+ K+VWSCIMTKGC+LNV AWTIWIHALFSNGHVKE Sbjct: 471 LLSAPQYGTLKDLLNSLLRAEKLELCKDVWSCIMTKGCELNVSAWTIWIHALFSNGHVKE 530 Query: 1786 ACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926 AC+YCLDMMDAG+MPQPDTFAKLM+GLRKLYNR+ AAEITEK RKMA Sbjct: 531 ACAYCLDMMDAGLMPQPDTFAKLMKGLRKLYNREIAAEITEKARKMA 577 >gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis] Length = 638 Score = 877 bits (2267), Expect = 0.0 Identities = 416/570 (72%), Positives = 484/570 (84%) Frame = +1 Query: 217 LSSYSHLGLHQNPLNMNCFERIAEARRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRI 396 LS + QNP N G + + P S D +T DEF+ DVEK+YRI Sbjct: 30 LSPQTQFSSTQNPHNR---------ATGFSPVHLEQNPVVSDDDETHDEFSGDVEKIYRI 80 Query: 397 LRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDV 576 LRKFHSR+ KLELALQESG+V+RSGLTERVL RCGDAG+LGYRFF+WASKQPGYR SY+V Sbjct: 81 LRKFHSRVSKLELALQESGVVLRSGLTERVLGRCGDAGSLGYRFFVWASKQPGYRPSYEV 140 Query: 577 YKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEM 756 YKAMI+ LGKMRQFGAVWAL+EEMRKENP L++PE+FVVLMRRFASARMVKKA+EV DEM Sbjct: 141 YKAMIRALGKMRQFGAVWALLEEMRKENPQLITPEIFVVLMRRFASARMVKKAVEVFDEM 200 Query: 757 PKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLM 936 PKYGCEPDE+VFGCLLDALCKNGSVKEAA LFE+MR+KF P++KHFTSLLYGWC+EGKLM Sbjct: 201 PKYGCEPDEHVFGCLLDALCKNGSVKEAASLFEEMRVKFTPSLKHFTSLLYGWCREGKLM 260 Query: 937 EAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVV 1116 EAKFVLV+M+EAGFEPD+VVYNNLL GYA AGKM DA+ L++EM+ KGC PNA S+T+++ Sbjct: 261 EAKFVLVQMKEAGFEPDVVVYNNLLGGYAQAGKMADAYDLMKEMRGKGCSPNAASYTVLI 320 Query: 1117 QALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHT 1296 QALC + KMEEAMRVF EM+RSGC+ADV+TYTTLISGFCKWG+I RGYE+LDSMIQ+G + Sbjct: 321 QALCKREKMEEAMRVFVEMQRSGCDADVMTYTTLISGFCKWGKIERGYEILDSMIQRGFS 380 Query: 1297 PNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRF 1476 PN T+YL+I+LAH M+KIG VPDL IYNTVIRLACKL E+KE +R Sbjct: 381 PNETTYLHIMLAHEKKEEFEECVELIGEMRKIGCVPDLKIYNTVIRLACKLREVKEGVRL 440 Query: 1477 WTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSL 1656 W +IE +G+SPG+DT V++I+G + QGCL+EAC YFKEMV RGLLS PQYGTLK+LLN+L Sbjct: 441 WNEIEASGLSPGLDTFVVMIHGFLGQGCLIEACQYFKEMVERGLLSGPQYGTLKELLNAL 500 Query: 1657 LRSDKLEMSKEVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQP 1836 LR+DKLEM+K+VW+CI+ KGC++NVYAWTIWIHALF NGHVKEACSYCLDMMDA VMPQP Sbjct: 501 LRADKLEMAKDVWTCIVNKGCEINVYAWTIWIHALFKNGHVKEACSYCLDMMDADVMPQP 560 Query: 1837 DTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926 DTFAKLMRGL+KLYNRQ AAEITEKVRKMA Sbjct: 561 DTFAKLMRGLKKLYNRQIAAEITEKVRKMA 590 >emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera] Length = 655 Score = 868 bits (2242), Expect = 0.0 Identities = 418/518 (80%), Positives = 463/518 (89%), Gaps = 1/518 (0%) Frame = +1 Query: 376 VEK-VYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQP 552 +EK VYRILRKFHSR+PKLELALQESG+ VRSGLTERVLNRCGDAGNLGYRFF+WASKQP Sbjct: 90 IEKTVYRILRKFHSRVPKLELALQESGVAVRSGLTERVLNRCGDAGNLGYRFFVWASKQP 149 Query: 553 GYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKK 732 GYRHSY+VYKAMIKILGKMRQFGAVWALIEEMR+ENP +SP VFVVLMRRFASARMVKK Sbjct: 150 GYRHSYEVYKAMIKILGKMRQFGAVWALIEEMRRENPQFVSPYVFVVLMRRFASARMVKK 209 Query: 733 AIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYG 912 AIEVLDEMPKYGCEPDE+VFGCLLDALCKNGSVKEAA LFEDMR++F PT+KHFTSLLYG Sbjct: 210 AIEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAASLFEDMRIRFTPTLKHFTSLLYG 269 Query: 913 WCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPN 1092 WC+EGKLMEAK+VLV++REAGFEPDIVVYNNLL GYA AGKMVDA+ LL+EM+ K CEPN Sbjct: 270 WCREGKLMEAKYVLVQIREAGFEPDIVVYNNLLTGYAAAGKMVDAYDLLKEMRRKECEPN 329 Query: 1093 ATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLD 1272 SFT ++QALCA+ KMEEAMRVF EM+ GC AD VTYTTLISGFCKWG+I++GYELLD Sbjct: 330 VMSFTTLIQALCAKKKMEEAMRVFFEMQSCGCPADAVTYTTLISGFCKWGKISKGYELLD 389 Query: 1273 SMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLG 1452 +MIQ+GH PN +YL+I+ AH M+KIG PDL IYN VIRLACKLG Sbjct: 390 NMIQQGHIPNPMTYLHIMAAHEKKEELEECIELMEEMRKIGCTPDLNIYNIVIRLACKLG 449 Query: 1453 EIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGT 1632 EIKE +R W ++E G+SPG+DT VI+I+G + Q CLVEAC++FKEMV RGLLSAPQYGT Sbjct: 450 EIKEGVRVWNEMEATGLSPGLDTFVIMIHGFLSQRCLVEACEFFKEMVGRGLLSAPQYGT 509 Query: 1633 LKDLLNSLLRSDKLEMSKEVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMM 1812 LK+LLNSLLR++KLEMSK+VWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMM Sbjct: 510 LKELLNSLLRAEKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMM 569 Query: 1813 DAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926 DAGVMPQPDTFAKLMRGLRKLYNRQ AAEITEKVRKMA Sbjct: 570 DAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMA 607 >ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like [Cucumis sativus] Length = 664 Score = 855 bits (2208), Expect = 0.0 Identities = 400/545 (73%), Positives = 471/545 (86%) Frame = +1 Query: 292 RRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSG 471 R G I ++T P S+ + DEF+ DVEKVYRILRKFH+R+PKLELALQESG+++RSG Sbjct: 72 RGGFGPIHLKTTPHESAHDRDADEFSVDVEKVYRILRKFHTRVPKLELALQESGVIMRSG 131 Query: 472 LTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMR 651 L ERVL+RCGDAGNLGYRFF+WASKQPGYRHSY+VYKAMIK LGKMRQFGAVWALIEEMR Sbjct: 132 LPERVLSRCGDAGNLGYRFFVWASKQPGYRHSYEVYKAMIKTLGKMRQFGAVWALIEEMR 191 Query: 652 KENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSV 831 KENP++L+PEVF+VLMRRFAS RMVKKA+EVLDEMPKYGCEPDEYVFGCLLDALCKNGSV Sbjct: 192 KENPYMLTPEVFIVLMRRFASVRMVKKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSV 251 Query: 832 KEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLL 1011 KEAA LFEDMR++F P ++HFTSLLYGWC+EGK+MEAK VLV+++EAGFEPDIVVYNNLL Sbjct: 252 KEAASLFEDMRVRFNPNLRHFTSLLYGWCREGKIMEAKHVLVQIKEAGFEPDIVVYNNLL 311 Query: 1012 NGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCE 1191 GYA AGKM DAF LL EMK C PNA SFTI++Q+ C KM+EAMR+F+EM+ SGCE Sbjct: 312 GGYAQAGKMRDAFDLLAEMKKVNCGPNAASFTILIQSFCKTEKMDEAMRIFTEMQGSGCE 371 Query: 1192 ADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXX 1371 ADVVTYTTLISGFCKWG ++ YE+LD MIQKGH P++ SYL I++AH Sbjct: 372 ADVVTYTTLISGFCKWGNTDKAYEILDDMIQKGHDPSQLSYLCIMMAHEKKEELEECMEL 431 Query: 1372 XXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVE 1551 M+KIG VPDL IYNT+IRL CKLG++KEA+R W +++ G++PG+DT++++++G + Sbjct: 432 IEEMRKIGCVPDLNIYNTMIRLVCKLGDLKEAVRLWGEMQAGGLNPGLDTYILMVHGFLS 491 Query: 1552 QGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDLNV 1731 QGCLVEACDYFKEMV RGLLSAPQYGTLK+L N+LLR++KLEM+K +WSC+ TKGC+LNV Sbjct: 492 QGCLVEACDYFKEMVERGLLSAPQYGTLKELTNALLRAEKLEMAKNMWSCMTTKGCELNV 551 Query: 1732 YAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEK 1911 AWTIWIHALFSNGHVKEACSYCLDMMDA +MPQPDTFAKLMRGL+KL++RQ A EITEK Sbjct: 552 SAWTIWIHALFSNGHVKEACSYCLDMMDADLMPQPDTFAKLMRGLKKLFHRQLAVEITEK 611 Query: 1912 VRKMA 1926 VRKMA Sbjct: 612 VRKMA 616 >ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like [Cucumis sativus] Length = 641 Score = 855 bits (2208), Expect = 0.0 Identities = 400/545 (73%), Positives = 471/545 (86%) Frame = +1 Query: 292 RRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSG 471 R G I ++T P S+ + DEF+ DVEKVYRILRKFH+R+PKLELALQESG+++RSG Sbjct: 49 RGGFGPIHLKTTPHESAHDRDADEFSVDVEKVYRILRKFHTRVPKLELALQESGVIMRSG 108 Query: 472 LTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMR 651 L ERVL+RCGDAGNLGYRFF+WASKQPGYRHSY+VYKAMIK LGKMRQFGAVWALIEEMR Sbjct: 109 LPERVLSRCGDAGNLGYRFFVWASKQPGYRHSYEVYKAMIKTLGKMRQFGAVWALIEEMR 168 Query: 652 KENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSV 831 KENP++L+PEVF+VLMRRFAS RMVKKA+EVLDEMPKYGCEPDEYVFGCLLDALCKNGSV Sbjct: 169 KENPYMLTPEVFIVLMRRFASVRMVKKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSV 228 Query: 832 KEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLL 1011 KEAA LFEDMR++F P ++HFTSLLYGWC+EGK+MEAK VLV+++EAGFEPDIVVYNNLL Sbjct: 229 KEAASLFEDMRVRFNPNLRHFTSLLYGWCREGKIMEAKHVLVQIKEAGFEPDIVVYNNLL 288 Query: 1012 NGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCE 1191 GYA AGKM DAF LL EMK C PNA SFTI++Q+ C KM+EAMR+F+EM+ SGCE Sbjct: 289 GGYAQAGKMRDAFDLLAEMKKVNCGPNAASFTILIQSFCKTEKMDEAMRIFTEMQGSGCE 348 Query: 1192 ADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXX 1371 ADVVTYTTLISGFCKWG ++ YE+LD MIQKGH P++ SYL I++AH Sbjct: 349 ADVVTYTTLISGFCKWGNTDKAYEILDDMIQKGHDPSQLSYLCIMMAHEKKEELEECMEL 408 Query: 1372 XXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVE 1551 M+KIG VPDL IYNT+IRL CKLG++KEA+R W +++ G++PG+DT++++++G + Sbjct: 409 IEEMRKIGCVPDLNIYNTMIRLVCKLGDLKEAVRLWGEMQAGGLNPGLDTYILMVHGFLS 468 Query: 1552 QGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDLNV 1731 QGCLVEACDYFKEMV RGLLSAPQYGTLK+L N+LLR++KLEM+K +WSC+ TKGC+LNV Sbjct: 469 QGCLVEACDYFKEMVERGLLSAPQYGTLKELTNALLRAEKLEMAKNMWSCMTTKGCELNV 528 Query: 1732 YAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEK 1911 AWTIWIHALFSNGHVKEACSYCLDMMDA +MPQPDTFAKLMRGL+KL++RQ A EITEK Sbjct: 529 SAWTIWIHALFSNGHVKEACSYCLDMMDADLMPQPDTFAKLMRGLKKLFHRQLAVEITEK 588 Query: 1912 VRKMA 1926 VRKMA Sbjct: 589 VRKMA 593 >ref|XP_002530608.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223529856|gb|EEF31788.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 596 Score = 832 bits (2150), Expect = 0.0 Identities = 405/617 (65%), Positives = 496/617 (80%), Gaps = 4/617 (0%) Frame = +1 Query: 79 MQTLSSKKSLVLCGKYAPLFSSAKRNTPRKEILHLVLYNESSNNRCLSSYSHLGLHQNPL 258 MQ LSSK ++ L K+ F+ ++H+ LY + + +NPL Sbjct: 1 MQRLSSK-TISLLNKHCCRFN----------LIHVQLYQKGQEP----------IDRNPL 39 Query: 259 NMNCFERIAEARRGLDLIRIRTEPEPSSD----SQTQDEFTADVEKVYRILRKFHSRIPK 426 + N R G ++ ++T+ +SD S DEF DVEKVYRILR FHSR+PK Sbjct: 40 SNNL-------RNGFGVVCLKTQENNTSDRDNSSSKVDEFAKDVEKVYRILRNFHSRVPK 92 Query: 427 LELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGK 606 LELALQESG+ +R+GLTERVLNRCGDAGNLGYRFF+WASKQPGYRHSY+ YKAM+KI K Sbjct: 93 LELALQESGVTMRAGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYENYKAMVKIFSK 152 Query: 607 MRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEY 786 MRQFGAVWAL+EEMRK+N L++ E+F+VL+RRFASAR+V+KAIEVLDEMPKYGCEPDEY Sbjct: 153 MRQFGAVWALLEEMRKDNSVLITSELFIVLIRRFASARLVEKAIEVLDEMPKYGCEPDEY 212 Query: 787 VFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMR 966 VFGCLLDALCKNGSVK+AA LFEDMR++F P+++HFTSLLYGWC+EGKL+EAK VLV+MR Sbjct: 213 VFGCLLDALCKNGSVKQAASLFEDMRVRFSPSLRHFTSLLYGWCREGKLIEAKHVLVQMR 272 Query: 967 EAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKME 1146 EAGFEPDIVV+NNLL+ Y++AGKM DAF LL+EM KGCEPNA S+TI++QA C+Q KM+ Sbjct: 273 EAGFEPDIVVFNNLLSAYSMAGKMTDAFDLLKEMVRKGCEPNANSYTIMIQAFCSQEKMD 332 Query: 1147 EAMRVFSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYIL 1326 EAMRVF EMER+GCEADVVTYT LISGFCKWG+INRGY++LD+M QKGH PN+ +YL IL Sbjct: 333 EAMRVFVEMERTGCEADVVTYTALISGFCKWGKINRGYQILDAMKQKGHMPNQLTYLRIL 392 Query: 1327 LAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMS 1506 LAH M+ +G VPDL+IYN VIRLACKLGE+K+ ++ W ++E + S Sbjct: 393 LAHEKKEELEECLELIESMRMVGCVPDLSIYNVVIRLACKLGEVKQGVQIWNEMEASDFS 452 Query: 1507 PGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSK 1686 P +DT VI+I+G + QGCLVEAC+YFKEM+ RGLL+ PQYG LK+LLN+LLR +KL M+K Sbjct: 453 PELDTFVIMIHGFLGQGCLVEACEYFKEMIGRGLLTTPQYGILKELLNALLRGEKLGMAK 512 Query: 1687 EVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGL 1866 +VWSCI+TKGC+LN AWTIWIH+LFSNGHVKEACSYCLDMM+A +MP+P+TFAKLMRGL Sbjct: 513 DVWSCIVTKGCELNADAWTIWIHSLFSNGHVKEACSYCLDMMEADIMPKPETFAKLMRGL 572 Query: 1867 RKLYNRQFAAEITEKVR 1917 RKLYNR+FAAEITEK++ Sbjct: 573 RKLYNREFAAEITEKIK 589 >ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like isoform X1 [Cicer arietinum] gi|502165084|ref|XP_004513408.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like isoform X2 [Cicer arietinum] Length = 655 Score = 832 bits (2148), Expect = 0.0 Identities = 390/541 (72%), Positives = 465/541 (85%), Gaps = 1/541 (0%) Frame = +1 Query: 307 LIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERV 486 LI +++ +D + DEFT+DVEKVYRILRK+HSR+PKLELAL+ESG+VV SGLTERV Sbjct: 69 LIHLQSNANHFNDQNSDDEFTSDVEKVYRILRKYHSRVPKLELALKESGVVVSSGLTERV 128 Query: 487 LNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPH 666 LNRCG++GNL YRFF WASKQ GYRHS +VYKAMIK+L KMRQFGAVWALI+EMR ENP Sbjct: 129 LNRCGNSGNLAYRFFSWASKQSGYRHSEEVYKAMIKVLSKMRQFGAVWALIDEMRLENPQ 188 Query: 667 LLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAAL 846 L+SP VFV+LMRRFASARMV KAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGS+KEAA Sbjct: 189 LISPHVFVILMRRFASARMVHKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSIKEAAS 248 Query: 847 LFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAV 1026 LFEDMR +F PT+KHFTSLLYGWCKEGKL+EAK VLV+M++AG EPDIVV+NNLL GYA Sbjct: 249 LFEDMRYRFPPTVKHFTSLLYGWCKEGKLVEAKHVLVQMKDAGIEPDIVVFNNLLGGYAQ 308 Query: 1027 AGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVT 1206 GKM DA+ LL+EMK KGCEPNA S+TI++Q+LC K+EEAMR+F EM+R+ C+ DV+T Sbjct: 309 GGKMADAYDLLKEMKRKGCEPNAASYTILIQSLCKHEKLEEAMRIFVEMQRNDCQMDVIT 368 Query: 1207 YTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQ 1386 YTTLISGFCKWG+I RGYELLD MIQ+GH+PN+ +YL+I+LAH M+ Sbjct: 369 YTTLISGFCKWGKIKRGYELLDQMIQEGHSPNQLTYLHIMLAHEKKEELEECMELVNEMK 428 Query: 1387 KIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLV 1566 KIG VP+L IYNTVIRLACK GE+K+ +R W ++E +G+SPG DT V++ING +EQ CL+ Sbjct: 429 KIGCVPNLNIYNTVIRLACKFGEVKQGVRLWNEMEASGLSPGTDTFVVMINGFLEQDCLI 488 Query: 1567 EACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCI-MTKGCDLNVYAWT 1743 EAC+YFKEMV RGL +APQYGTLK+L+NSLLR++KLEM+K+ W+CI +K C++NV AWT Sbjct: 489 EACEYFKEMVGRGLFAAPQYGTLKELMNSLLRAEKLEMAKDTWNCITASKSCEMNVAAWT 548 Query: 1744 IWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKM 1923 IWIHALFS GHVKEACS+C+DMMD +MPQPDTFAKL+RGL+KLYNR+FAAEITEKVRKM Sbjct: 549 IWIHALFSKGHVKEACSFCIDMMDNDLMPQPDTFAKLIRGLKKLYNREFAAEITEKVRKM 608 Query: 1924 A 1926 A Sbjct: 609 A 609 >ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like isoform X1 [Glycine max] gi|571514894|ref|XP_006597171.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like isoform X2 [Glycine max] gi|571514897|ref|XP_006597172.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like isoform X3 [Glycine max] Length = 654 Score = 825 bits (2131), Expect(2) = 0.0 Identities = 392/541 (72%), Positives = 463/541 (85%), Gaps = 1/541 (0%) Frame = +1 Query: 307 LIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERV 486 LIR++ +D T DEF +DVEKVYRILRK+HSR+PKLELAL+ESG+VVR GLTERV Sbjct: 69 LIRLQEISINHTDDHTHDEFASDVEKVYRILRKYHSRVPKLELALRESGVVVRPGLTERV 128 Query: 487 LNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPH 666 L+RCGDAGNL YRF+ WASKQ G+R +D YKAMIK+L +MRQFGAVWALIEEMR+ENPH Sbjct: 129 LSRCGDAGNLAYRFYSWASKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPH 188 Query: 667 LLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAAL 846 L++P+VFV+LMRRFASARMV KA+EVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAA Sbjct: 189 LITPQVFVILMRRFASARMVHKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAAS 248 Query: 847 LFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAV 1026 LFEDMR ++ P++KHFTSLLYGWCKEGKLMEAK VLV+M++ G EPDIVVYNNLL GYA Sbjct: 249 LFEDMRYRWKPSVKHFTSLLYGWCKEGKLMEAKHVLVQMKDMGIEPDIVVYNNLLGGYAQ 308 Query: 1027 AGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVT 1206 AGKM DA+ LL+EM+ K CEPNATS+T+++Q+LC ++EEA R+F EM+ +GC+ADVVT Sbjct: 309 AGKMGDAYDLLKEMRRKRCEPNATSYTVLIQSLCKHERLEEATRLFVEMQTNGCQADVVT 368 Query: 1207 YTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQ 1386 Y+TLISGFCKWG+I RGYELLD MIQ+GH PN+ Y +I+LAH MQ Sbjct: 369 YSTLISGFCKWGKIKRGYELLDEMIQQGHFPNQVIYQHIMLAHEKKEELEECKELVNEMQ 428 Query: 1387 KIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLV 1566 KIG PDL+IYNTVIRLACKLGE+KE I+ W ++E +G+SPG+DT VI+ING +EQGCLV Sbjct: 429 KIGCAPDLSIYNTVIRLACKLGEVKEGIQLWNEMESSGLSPGMDTFVIMINGFLEQGCLV 488 Query: 1567 EACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCI-MTKGCDLNVYAWT 1743 EAC+YFKEMV RGL +APQYGTLK+L+NSLLR++KLEM+K+ W+CI +KGC LNV AWT Sbjct: 489 EACEYFKEMVGRGLFTAPQYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWT 548 Query: 1744 IWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKM 1923 IWIHALFS GHVKEACS+C+DMMD +MP PDTFAKLM GL+KLYNRQFAAEITEKVRKM Sbjct: 549 IWIHALFSKGHVKEACSFCIDMMDKDLMPNPDTFAKLMHGLKKLYNRQFAAEITEKVRKM 608 Query: 1924 A 1926 A Sbjct: 609 A 609 Score = 25.0 bits (53), Expect(2) = 0.0 Identities = 12/37 (32%), Positives = 21/37 (56%) Frame = +2 Query: 116 AVSTLLYFHLPNETHRGRKYCTSSSTMNHQTIVVFPP 226 A+S+LL + +ET +CT+S + +T + PP Sbjct: 13 AISSLLSLVIRHETTVCHFFCTTSEVSSSRTSSLLPP 49 >ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citrus clementina] gi|557528135|gb|ESR39385.1| hypothetical protein CICLE_v10025134mg [Citrus clementina] Length = 638 Score = 822 bits (2122), Expect = 0.0 Identities = 391/547 (71%), Positives = 469/547 (85%), Gaps = 6/547 (1%) Frame = +1 Query: 304 DLIRIRTEPEPSSDSQTQD------EFTADVEKVYRILRKFHSRIPKLELALQESGIVVR 465 +L+ ++T+ + + T D EF+ DVEK++RIL+KFHSR+PKLELALQ SG+V+R Sbjct: 44 NLVCLKTKEDDCKCNNTTDTHGSHNEFSHDVEKIFRILKKFHSRLPKLELALQHSGVVLR 103 Query: 466 SGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEE 645 GLTERV+NRCGDAGNLGYR+++WASKQP Y HSYDVY+A+IK L KMR+FGAVWAL+EE Sbjct: 104 PGLTERVINRCGDAGNLGYRYYMWASKQPNYVHSYDVYRALIKSLSKMRKFGAVWALMEE 163 Query: 646 MRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNG 825 MRKE P L++ EVFV+LMRRFASARMVKKAIEVLDEMPKYGCEPDE+VFGCLLDALCKN Sbjct: 164 MRKEKPQLITTEVFVILMRRFASARMVKKAIEVLDEMPKYGCEPDEFVFGCLLDALCKNS 223 Query: 826 SVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNN 1005 SVKEAA LF++MR +F P+++HFTSLLYGWCKEGKL+EAK+VLV+M++AGFEPDIVVYNN Sbjct: 224 SVKEAAKLFDEMRERFKPSLRHFTSLLYGWCKEGKLVEAKYVLVQMKDAGFEPDIVVYNN 283 Query: 1006 LLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSG 1185 LL+GYA GKM DAF LL+EM+ KGC+PNA S+T+++QALC KMEEA R F EMERSG Sbjct: 284 LLSGYAQMGKMTDAFELLKEMRRKGCDPNANSYTVLIQALCRMEKMEEANRAFVEMERSG 343 Query: 1186 CEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXX 1365 CEADVVTYTTLISGFCK +I+R YE+LDSMIQ+G PN+ +YL+I+LAH Sbjct: 344 CEADVVTYTTLISGFCKSRKIDRCYEILDSMIQRGILPNQLTYLHIMLAHEKKEELEECV 403 Query: 1366 XXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGL 1545 M+KIG VPD++ YN VIRLACKLGE+KEA+ W ++E +SPG D+ V++++G Sbjct: 404 ELMGEMRKIGCVPDVSNYNVVIRLACKLGELKEAVNVWNEMEAASLSPGTDSFVVMVHGF 463 Query: 1546 VEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDL 1725 + QGCL+EAC+YFKEMV RGLLSAPQYGTLK+LLNSLLR+ K+EM+K+VWSCI+TKGC+L Sbjct: 464 LGQGCLIEACEYFKEMVGRGLLSAPQYGTLKELLNSLLRAQKVEMAKDVWSCIVTKGCEL 523 Query: 1726 NVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEIT 1905 NVYAWTIWIH+LFSNGHVKEACSYCLDMMDA VMPQPDTFAKLMRGL+KLYNRQ AAEIT Sbjct: 524 NVYAWTIWIHSLFSNGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKKLYNRQIAAEIT 583 Query: 1906 EKVRKMA 1926 EKVRKMA Sbjct: 584 EKVRKMA 590 >ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like, partial [Glycine max] Length = 656 Score = 821 bits (2120), Expect = 0.0 Identities = 398/593 (67%), Positives = 483/593 (81%), Gaps = 8/593 (1%) Frame = +1 Query: 172 ILHLVLYNESSNNRCLSSYSHLGLHQNPLNM-------NCFERIAEARRGLDLIRIRTEP 330 +L LV+ +E++ + S L Q P + + F+ A + IR++ Sbjct: 20 LLSLVIRHENTLCHFFCTTSELSSSQTPSSQLPPPHFKSTFDNNALTNQ-FGFIRLQEIS 78 Query: 331 EPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAG 510 +D QT DEF +DVEKVYRILRK+HSR+PKLELAL+ESG+VVR GLTERVLNRCGDAG Sbjct: 79 INHTDDQTHDEFASDVEKVYRILRKYHSRVPKLELALRESGVVVRPGLTERVLNRCGDAG 138 Query: 511 NLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFV 690 NL YRF+ WASKQ G+R +D YKAMIK+L +MRQFGAVWALIEEMR+ENPHL++P+VFV Sbjct: 139 NLAYRFYSWASKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPHLITPQVFV 198 Query: 691 VLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLK 870 +LMRRFASARMV KA++VLDEMP YGCEPDEYVFGCLLDAL KNGSVKEAA LFE++R + Sbjct: 199 ILMRRFASARMVHKAVQVLDEMPNYGCEPDEYVFGCLLDALRKNGSVKEAASLFEELRYR 258 Query: 871 FIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAF 1050 + P++KHFTSLLYGWCKEGKLMEAK VLV+M++AG EPDIVVYNNLL GYA A KM DA+ Sbjct: 259 WKPSVKHFTSLLYGWCKEGKLMEAKHVLVQMKDAGIEPDIVVYNNLLGGYAQADKMGDAY 318 Query: 1051 VLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGF 1230 LL+EM+ KGCEPNATS+T+++Q+LC ++EEA RVF EM+R+GC+AD+VTY+TLISGF Sbjct: 319 DLLKEMRRKGCEPNATSYTVLIQSLCKHERLEEATRVFVEMQRNGCQADLVTYSTLISGF 378 Query: 1231 CKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDL 1410 CKWG+I RGYELLD MIQ+GH PN+ Y +I++AH MQKIG PDL Sbjct: 379 CKWGKIKRGYELLDEMIQQGHFPNQVIYQHIMVAHEKKEELEECKELVNEMQKIGCAPDL 438 Query: 1411 TIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKE 1590 +IYNTVIRLACKLGE+KE +R W ++E +G+SP +DT VI+ING +EQGCLVEAC+YFKE Sbjct: 439 SIYNTVIRLACKLGEVKEGVRLWNEMESSGLSPSIDTFVIMINGFLEQGCLVEACEYFKE 498 Query: 1591 MVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCI-MTKGCDLNVYAWTIWIHALFS 1767 MV RGL +APQYGTLK+L+NSLLR++KLEM+K+ W+CI +KGC LNV AWTIWIHALFS Sbjct: 499 MVGRGLFAAPQYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWTIWIHALFS 558 Query: 1768 NGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926 GHVKEACS+C+ MMD +MPQPDTFAKLMRGL+KLYNR+FAAEITEKVRKMA Sbjct: 559 KGHVKEACSFCIAMMDKDLMPQPDTFAKLMRGLKKLYNREFAAEITEKVRKMA 611 >ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g65820-like [Citrus sinensis] Length = 638 Score = 818 bits (2113), Expect = 0.0 Identities = 390/547 (71%), Positives = 467/547 (85%), Gaps = 6/547 (1%) Frame = +1 Query: 304 DLIRIRTEPEPSSDSQTQD------EFTADVEKVYRILRKFHSRIPKLELALQESGIVVR 465 +L+ ++T+ + T D EF+ DVEK++RIL+KFHSR+PKLELALQ SG+V+R Sbjct: 44 NLVCLKTKEDDCKCDNTTDTHGSHNEFSHDVEKIFRILKKFHSRLPKLELALQHSGVVLR 103 Query: 466 SGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEE 645 GLTERV+NRCGDAGNLGYR+++WASKQP Y HSYDVY+A+IK L KMR+FGAVWAL+EE Sbjct: 104 PGLTERVINRCGDAGNLGYRYYMWASKQPNYVHSYDVYRALIKSLSKMRKFGAVWALMEE 163 Query: 646 MRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNG 825 MRKE P L++ EVFV+LMRRFASARMVKKAIEVLDEMPKYGCEPDE+VFGCLLDALCKN Sbjct: 164 MRKEKPQLITTEVFVILMRRFASARMVKKAIEVLDEMPKYGCEPDEFVFGCLLDALCKNS 223 Query: 826 SVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNN 1005 SVKEAA LF+++R +F P+++HFTSLLYGWCKEGKL+EAK+VLV+M++AGFEPDIVVYNN Sbjct: 224 SVKEAAKLFDEIRERFKPSLRHFTSLLYGWCKEGKLVEAKYVLVQMKDAGFEPDIVVYNN 283 Query: 1006 LLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSG 1185 LL+GYA GKM DAF LL+EM+ KGC+PNA S+T+++QALC KMEEA R F EMERSG Sbjct: 284 LLSGYAQMGKMTDAFELLKEMRRKGCDPNANSYTVLIQALCRMEKMEEANRAFVEMERSG 343 Query: 1186 CEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXX 1365 CEADVVTYTTLISGFCK +I+R YE+LDSMIQ+G PN+ +YL+I+LAH Sbjct: 344 CEADVVTYTTLISGFCKSRKIDRCYEILDSMIQRGILPNQLTYLHIMLAHEKKEELEECV 403 Query: 1366 XXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGL 1545 M+KIG VPD++ YN VIRLACKLGE+KEA+ W ++E +SPG D+ V++++G Sbjct: 404 ELMGEMRKIGCVPDVSNYNVVIRLACKLGELKEAVNVWNEMEAASLSPGTDSFVVMVHGF 463 Query: 1546 VEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDL 1725 + QGCL+EAC+YFKEMV RGLLSAPQYGTLK LLNSLLR+ K+EM+K+VWSCI+TKGC+L Sbjct: 464 LGQGCLIEACEYFKEMVGRGLLSAPQYGTLKALLNSLLRAQKVEMAKDVWSCIVTKGCEL 523 Query: 1726 NVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEIT 1905 NVYAWTIWIH+LFSNGHVKEACSYCLDMMDA VMPQPDTFAKLMRGL+KLYNRQ AAEIT Sbjct: 524 NVYAWTIWIHSLFSNGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKKLYNRQIAAEIT 583 Query: 1906 EKVRKMA 1926 EKVRKMA Sbjct: 584 EKVRKMA 590 >gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 647 Score = 803 bits (2075), Expect = 0.0 Identities = 393/613 (64%), Positives = 480/613 (78%), Gaps = 2/613 (0%) Frame = +1 Query: 94 SKKSLVLCGKYAPLFSSAKRNTPRKEILHLVLYNESSNNRCLSSYSHLGLHQNPLNMNCF 273 S K+L L + L S+ NT H++ N ++NN + N LN+ Sbjct: 5 SSKTLCLIARQRHLSLSSYPNTYH---FHILPDNNNNNN-----------NSNSLNLLS- 49 Query: 274 ERIAEARRGLDLIRIRT-EPEPSSDSQTQ-DEFTADVEKVYRILRKFHSRIPKLELALQE 447 + ++ G L+ + T +P SD+ Q D+F +DVEK+YRILRKFH+R+PKL LALQ+ Sbjct: 50 ---SNSKSGFGLVTLETKQPTLKSDNDQQTDDFASDVEKIYRILRKFHTRVPKLNLALQQ 106 Query: 448 SGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAV 627 SG+V R GLTERVLNRCGDAGNLGY+FF WASKQPGY SY++YKAMIKILGKMRQFGAV Sbjct: 107 SGVVFRPGLTERVLNRCGDAGNLGYKFFTWASKQPGYHPSYEIYKAMIKILGKMRQFGAV 166 Query: 628 WALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLD 807 WALIEE+++ENPH ++ E+F++L+RRFAS+RMVKKAIEV DEMPKYGC D+ VFG LLD Sbjct: 167 WALIEEIKRENPHFITAELFILLIRRFASSRMVKKAIEVFDEMPKYGCLQDDAVFGSLLD 226 Query: 808 ALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPD 987 ALCKNG+VKEAAL+FE+MR++F+P +KHFTSLLYGWCKEG+++EAK VLV+M+EAGFEPD Sbjct: 227 ALCKNGNVKEAALVFEEMRVRFLPNLKHFTSLLYGWCKEGRILEAKHVLVQMKEAGFEPD 286 Query: 988 IVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFS 1167 IVV+NNLL+GY + KM DAF LL+EM+ KG +PNA S+TIV+Q LC ++MEEAMRVF Sbjct: 287 IVVFNNLLSGYVLGNKMGDAFDLLKEMRKKGIDPNANSYTIVIQGLCKADRMEEAMRVFV 346 Query: 1168 EMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXX 1347 +MER+GC DVV YTTLISGFCKWG + +GYE+LD MI +G PN +YL+I+LAH Sbjct: 347 DMERNGCRGDVVVYTTLISGFCKWGRVEKGYEVLDRMISEGLMPNSLTYLHIMLAHEKKD 406 Query: 1348 XXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHV 1527 M+KIG VPD IYN V+RLACKL E+KEA R W ++E G SPGVD + Sbjct: 407 ELEECLELMEEMRKIGCVPDGGIYNVVVRLACKLEEVKEAARVWNEMEGRGFSPGVDNFI 466 Query: 1528 ILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIM 1707 ++I+G + QGCLVEAC+YFKEM RGL PQYG LKDLLNSLLR++KLEM+K VWSCI+ Sbjct: 467 VMIHGFIGQGCLVEACEYFKEMAGRGLFCVPQYGILKDLLNSLLRAEKLEMAKNVWSCIV 526 Query: 1708 TKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQ 1887 +KGC+LNV AWTIW+HALFS GHVKEACSYCL+MMD VMPQPDTFAKLMRGLRKLYNRQ Sbjct: 527 SKGCELNVSAWTIWVHALFSKGHVKEACSYCLEMMDVDVMPQPDTFAKLMRGLRKLYNRQ 586 Query: 1888 FAAEITEKVRKMA 1926 AAEITEKVRKMA Sbjct: 587 IAAEITEKVRKMA 599 >gb|EPS62602.1| hypothetical protein M569_12187, partial [Genlisea aurea] Length = 593 Score = 781 bits (2018), Expect = 0.0 Identities = 387/551 (70%), Positives = 457/551 (82%), Gaps = 7/551 (1%) Frame = +1 Query: 295 RGLDLIRIRTEPEPSSDS-----QTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIV 459 RG DLIRI + + S D+F+ADVEKVY+ILRKF+S++PKLELALQ SG+ Sbjct: 3 RGFDLIRIEEDEQQQDCSVGRRNNISDDFSADVEKVYKILRKFNSKVPKLELALQHSGVS 62 Query: 460 VRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALI 639 VRSGLTERVLNRCGDAGNLGYRFF+WASKQPGY HS+DVYKAMI+ILGKMRQFGAVWALI Sbjct: 63 VRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYNHSHDVYKAMIRILGKMRQFGAVWALI 122 Query: 640 EEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCK 819 EEMRKENP LL+PEVF+VLMRRFASARMVKKA+EVLDEMP YGCEPDEYVFGCLLDALCK Sbjct: 123 EEMRKENPQLLTPEVFIVLMRRFASARMVKKAVEVLDEMPSYGCEPDEYVFGCLLDALCK 182 Query: 820 NGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVY 999 NGSVKEA+LL EDM+++F PT+KHFTSLL+GWC+EGKL+EAK VL KMREAGF PDIVVY Sbjct: 183 NGSVKEASLLMEDMQMRFKPTMKHFTSLLHGWCREGKLIEAKTVLQKMREAGFLPDIVVY 242 Query: 1000 NNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMER 1179 N LL GYA AGK+ DA LL EM+ C P ATS+T V+++LCA+ KM EA+++FSEME Sbjct: 243 NTLLAGYAAAGKIADARHLLLEMRRNSCRPTATSYTAVIRSLCAREKMAEAVQLFSEMEA 302 Query: 1180 SGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXX 1359 GCEADVV YTTLISGFCK G+ +GYELLD+MI+KG TPN T+Y Y++ AH Sbjct: 303 DGCEADVVAYTTLISGFCKRGKTGKGYELLDAMIRKGITPNNTTYSYLISAHEKEEELEE 362 Query: 1360 XXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILIN 1539 M++IG+ PD +YN VIRL+CKLGE+++ IR ++E +G+SPGVDT VILIN Sbjct: 363 CLGLAKSMRQIGVTPDSAVYNPVIRLSCKLGEVEDGIRLMNEMEEDGISPGVDTFVILIN 422 Query: 1540 GLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMT-KG 1716 GL+ G L EAC F+EMV RGL++APQYG LKDLLNSLLR KL++SK+VWS ++T KG Sbjct: 423 GLILHGHLDEACLRFEEMVGRGLVAAPQYGLLKDLLNSLLRCGKLQLSKDVWSKMVTSKG 482 Query: 1717 -CDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFA 1893 CD+NVYAWTIWIHAL S G+VKEAC Y L+MM+AG+MPQPDTFAKL+RGLRKLYNR+ A Sbjct: 483 CCDVNVYAWTIWIHALLSKGYVKEACFYGLEMMEAGLMPQPDTFAKLIRGLRKLYNREIA 542 Query: 1894 AEITEKVRKMA 1926 AEITEKV++MA Sbjct: 543 AEITEKVKRMA 553 >ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutrema salsugineum] gi|557105226|gb|ESQ45560.1| hypothetical protein EUTSA_v10010190mg [Eutrema salsugineum] Length = 645 Score = 774 bits (1998), Expect = 0.0 Identities = 369/529 (69%), Positives = 439/529 (82%), Gaps = 3/529 (0%) Frame = +1 Query: 349 QTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRF 528 Q +DEF DVEK+YRILR +HSR+PKLEL L ESGI +R GL RVL+RCGDAGNLGYRF Sbjct: 64 QQEDEFAGDVEKIYRILRNYHSRVPKLELVLHESGINLRPGLIVRVLSRCGDAGNLGYRF 123 Query: 529 FIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRF 708 F+WA+KQPGY HSY+V K+M+KIL KMRQFGAVWALIEEMRKENP L+ PE+FVVLMRRF Sbjct: 124 FLWAAKQPGYCHSYEVCKSMVKILSKMRQFGAVWALIEEMRKENPQLIEPELFVVLMRRF 183 Query: 709 ASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIK 888 ASA MVKKA+EVLDEMPKYG EPDEY+FGCLLDALCKNGSVK+A+ LFEDMR KF P ++ Sbjct: 184 ASANMVKKAVEVLDEMPKYGIEPDEYIFGCLLDALCKNGSVKDASKLFEDMRDKFPPNLR 243 Query: 889 HFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEM 1068 +FTSLLYGWC+EGKL+EAK VLV+M+EAG EPDIVV+ NLL+GYA AGKM DA+ L+++M Sbjct: 244 YFTSLLYGWCREGKLIEAKHVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMKDM 303 Query: 1069 KSKGCEPNATSFTIVVQALCAQNK-MEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGE 1245 + +G EPNA +T+++QALC K M+EAMRVF EMER GCEAD+VTYT LISGFCKWG Sbjct: 304 RRRGYEPNANCYTVLIQALCKMEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGM 363 Query: 1246 INRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNT 1425 I++GY +LD M +KG P + +Y+ I++AH M++ G +PDL IYN Sbjct: 364 IDKGYSVLDDMRKKGVMPLQVTYMQIMVAHEKKEQFEECLDLIEKMKQNGCLPDLLIYNV 423 Query: 1426 VIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRG 1605 VIRLACKLGE+KEA+R W ++E NG+SPGVDT VI+ING QGCL+EACD+FKEMV RG Sbjct: 424 VIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFASQGCLIEACDHFKEMVSRG 483 Query: 1606 LLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTK--GCDLNVYAWTIWIHALFSNGHV 1779 + SAP YGTLK LLN+L+R DKLEM+K+VWSC+ K C+LNV AWTIWIHALF+ GHV Sbjct: 484 IFSAPHYGTLKILLNTLVRDDKLEMAKDVWSCLSNKSSSCELNVSAWTIWIHALFARGHV 543 Query: 1780 KEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926 KEACSYCLDMM+ +MPQPDT+AKLM+GL KLYNR AAEITEKVRKMA Sbjct: 544 KEACSYCLDMMEMDLMPQPDTYAKLMKGLNKLYNRTIAAEITEKVRKMA 592 >ref|XP_006393982.1| hypothetical protein EUTSA_v10003830mg [Eutrema salsugineum] gi|557090621|gb|ESQ31268.1| hypothetical protein EUTSA_v10003830mg [Eutrema salsugineum] Length = 620 Score = 760 bits (1962), Expect = 0.0 Identities = 361/544 (66%), Positives = 445/544 (81%), Gaps = 1/544 (0%) Frame = +1 Query: 298 GLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLT 477 G L+ + + + + DEF +DVEK YRILRKFHSR+PKLELAL ESG+ +R GL Sbjct: 50 GTGLVCLDKSHKERTKNSNHDEFASDVEKAYRILRKFHSRVPKLELALNESGVELRPGLI 109 Query: 478 ERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKE 657 ERVLNRCGDAGNLGYRFF+WA+KQPGY HSY VYK+M+KIL KMR F AVWALIEEMRKE Sbjct: 110 ERVLNRCGDAGNLGYRFFVWAAKQPGYCHSYQVYKSMVKILSKMRHFEAVWALIEEMRKE 169 Query: 658 NPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKE 837 NP L+ PE+FVVL+RRFAS+ MVKKAIEVLDEMPK+G EPDEYVFGCLLDALCKNGSVK+ Sbjct: 170 NPQLIEPELFVVLVRRFASSNMVKKAIEVLDEMPKFGLEPDEYVFGCLLDALCKNGSVKD 229 Query: 838 AALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNG 1017 AA LFE+MRL+F P +++FTSLLYGWC+EGK+MEA+ VLV+M+EA FEPD+VVY NLL+G Sbjct: 230 AAKLFEEMRLRFPPNLRYFTSLLYGWCREGKMMEAEHVLVEMKEARFEPDVVVYTNLLSG 289 Query: 1018 YAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEAD 1197 YA AGKM +A+ LL++M+ +G EPNA +T+++QALC ++MEEAMRVF EMER CEAD Sbjct: 290 YAHAGKMAEAYDLLKDMRRRGFEPNANCYTVLIQALCKVDRMEEAMRVFVEMERYECEAD 349 Query: 1198 VVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXX 1377 +VTY L+SGFCKWG+I++ Y +LD MI+K P++ +Y++I+ AH Sbjct: 350 IVTYNALVSGFCKWGKIDKCYSVLDDMIKKCLMPSQLTYMHIMAAHEKKEKFEECLELME 409 Query: 1378 XMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQG 1557 M++IG DL +YN VIRLACKLGE+KEA+R W ++E +G+SPGVDT VI+I+GL QG Sbjct: 410 KMKEIGYHLDLGVYNVVIRLACKLGEVKEAVRLWNEMEASGLSPGVDTFVIMIDGLTNQG 469 Query: 1558 CLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKG-CDLNVY 1734 CL+EACD+FK MV RGL S PQYGTLK LLN+LLR KLE +K++WSCIM++G C+LNV Sbjct: 470 CLLEACDHFKVMVSRGLFSVPQYGTLKSLLNALLRDGKLETAKDIWSCIMSEGSCELNVS 529 Query: 1735 AWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKV 1914 +WTIWIHALFS G+VK+ACSYCL+MM+ M QPDTFAKLM+GL+KLYNR+FA EITEKV Sbjct: 530 SWTIWIHALFSKGYVKDACSYCLEMMEMDFMLQPDTFAKLMKGLKKLYNREFAVEITEKV 589 Query: 1915 RKMA 1926 R MA Sbjct: 590 RNMA 593 >ref|XP_002866691.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297312526|gb|EFH42950.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 638 Score = 757 bits (1954), Expect = 0.0 Identities = 362/524 (69%), Positives = 439/524 (83%), Gaps = 1/524 (0%) Frame = +1 Query: 358 DEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFIW 537 DEF +DVEK YRILRKFHSR+PKLELAL ESG+ +R GL ERVLNRCGDAGNLGYRFF+W Sbjct: 78 DEFASDVEKAYRILRKFHSRVPKLELALNESGVELRPGLIERVLNRCGDAGNLGYRFFVW 137 Query: 538 ASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFASA 717 A+KQP Y HS +VYK+M+KIL KMRQFGAVW LIEEMRKENP L+ PE+FVVL++RFASA Sbjct: 138 AAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASA 197 Query: 718 RMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHFT 897 MVKKAIEVLDEMP +G EPDEYVFGCLLDALCK+GSVK+AA LFEDMRL+F +++FT Sbjct: 198 DMVKKAIEVLDEMPTFGLEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRLRFPVNLRYFT 257 Query: 898 SLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKSK 1077 SLLYGWC+E K+MEAK+VLV+M+EAGFEPDIV Y NLL+GYA AGKM DA+ LL++M+ + Sbjct: 258 SLLYGWCREEKMMEAKYVLVQMKEAGFEPDIVDYTNLLSGYANAGKMADAYDLLKDMRRR 317 Query: 1078 GCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGEINRG 1257 G EPNAT +T+++QALC ++MEEAM+VF EMER CEADVVTYT L+SGFCKWG+I++ Sbjct: 318 GFEPNATCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKC 377 Query: 1258 YELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRL 1437 Y +LD MI+KG P++ +Y++I+ AH M++I PD+ IYN VIRL Sbjct: 378 YLVLDDMIKKGLMPSQLTYMHIMAAHEKKEKLIECLELMEKMKQIEYHPDIGIYNVVIRL 437 Query: 1438 ACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLLSA 1617 ACKLGE+KEA+R W ++E NG+SPG DT VI+INGL QGCL+EACD+FKEMV RGL S Sbjct: 438 ACKLGEVKEAVRLWNEMEGNGLSPGADTFVIIINGLTSQGCLLEACDHFKEMVARGLFSV 497 Query: 1618 PQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKG-CDLNVYAWTIWIHALFSNGHVKEACS 1794 PQYGTLK LLN+LL+ KLEM+K+VWSCI +KG C+L+V +WTIWIHALFS G+ KEACS Sbjct: 498 PQYGTLKLLLNTLLKDKKLEMAKDVWSCITSKGSCELSVSSWTIWIHALFSKGYEKEACS 557 Query: 1795 YCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926 YCL+M++ MPQPDTFAKLM+GL+KLY+R+FA EITEKVR MA Sbjct: 558 YCLEMIELEFMPQPDTFAKLMKGLKKLYHREFAVEITEKVRNMA 601 >ref|NP_190542.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546755|sp|P0C8A0.1|PP275_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g49730 gi|332645062|gb|AEE78583.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 638 Score = 756 bits (1953), Expect = 0.0 Identities = 369/557 (66%), Positives = 444/557 (79%), Gaps = 3/557 (0%) Frame = +1 Query: 265 NCFERIAEARRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQ 444 N F E + G+ L+ PE + +DEF +VEK+YRILR HSR+PKLELAL Sbjct: 39 NDFVESTERKNGVGLVC----PE-----KHEDEFAGEVEKIYRILRNHHSRVPKLELALN 89 Query: 445 ESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGA 624 ESGI +R GL RVL+RCGDAGNLGYRFF+WA+KQPGY HSY+V K+M+ IL KMRQFGA Sbjct: 90 ESGIDLRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGA 149 Query: 625 VWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLL 804 VW LIEEMRK NP L+ PE+FVVLMRRFASA MVKKA+EVLDEMPKYG EPDEYVFGCLL Sbjct: 150 VWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFGCLL 209 Query: 805 DALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEP 984 DALCKNGSVKEA+ +FEDMR KF P +++FTSLLYGWC+EGKLMEAK VLV+M+EAG EP Sbjct: 210 DALCKNGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEP 269 Query: 985 DIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALC-AQNKMEEAMRV 1161 DIVV+ NLL+GYA AGKM DA+ L+ +M+ +G EPN +T+++QALC + +M+EAMRV Sbjct: 270 DIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRV 329 Query: 1162 FSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXX 1341 F EMER GCEAD+VTYT LISGFCKWG I++GY +LD M +KG P++ +Y+ I++AH Sbjct: 330 FVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEK 389 Query: 1342 XXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDT 1521 M++ G PDL IYN VIRLACKLGE+KEA+R W ++E NG+SPGVDT Sbjct: 390 KEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDT 449 Query: 1522 HVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSC 1701 VI+ING QG L+EAC++FKEMV RG+ SAPQYGTLK LLN+L+R DKLEM+K+VWSC Sbjct: 450 FVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKLEMAKDVWSC 509 Query: 1702 I--MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKL 1875 I T C+LNV AWTIWIHAL++ GHVKEACSYCLDMM+ +MPQP+T+AKLM+GL KL Sbjct: 510 ISNKTSSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPNTYAKLMKGLNKL 569 Query: 1876 YNRQFAAEITEKVRKMA 1926 YNR AAEITEKV KMA Sbjct: 570 YNRTIAAEITEKVVKMA 586 >emb|CAB66911.1| putative protein [Arabidopsis thaliana] Length = 1184 Score = 756 bits (1953), Expect = 0.0 Identities = 369/557 (66%), Positives = 444/557 (79%), Gaps = 3/557 (0%) Frame = +1 Query: 265 NCFERIAEARRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQ 444 N F E + G+ L+ PE + +DEF +VEK+YRILR HSR+PKLELAL Sbjct: 39 NDFVESTERKNGVGLVC----PE-----KHEDEFAGEVEKIYRILRNHHSRVPKLELALN 89 Query: 445 ESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGA 624 ESGI +R GL RVL+RCGDAGNLGYRFF+WA+KQPGY HSY+V K+M+ IL KMRQFGA Sbjct: 90 ESGIDLRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGA 149 Query: 625 VWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLL 804 VW LIEEMRK NP L+ PE+FVVLMRRFASA MVKKA+EVLDEMPKYG EPDEYVFGCLL Sbjct: 150 VWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFGCLL 209 Query: 805 DALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEP 984 DALCKNGSVKEA+ +FEDMR KF P +++FTSLLYGWC+EGKLMEAK VLV+M+EAG EP Sbjct: 210 DALCKNGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEP 269 Query: 985 DIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALC-AQNKMEEAMRV 1161 DIVV+ NLL+GYA AGKM DA+ L+ +M+ +G EPN +T+++QALC + +M+EAMRV Sbjct: 270 DIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRV 329 Query: 1162 FSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXX 1341 F EMER GCEAD+VTYT LISGFCKWG I++GY +LD M +KG P++ +Y+ I++AH Sbjct: 330 FVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEK 389 Query: 1342 XXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDT 1521 M++ G PDL IYN VIRLACKLGE+KEA+R W ++E NG+SPGVDT Sbjct: 390 KEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDT 449 Query: 1522 HVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSC 1701 VI+ING QG L+EAC++FKEMV RG+ SAPQYGTLK LLN+L+R DKLEM+K+VWSC Sbjct: 450 FVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKLEMAKDVWSC 509 Query: 1702 I--MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKL 1875 I T C+LNV AWTIWIHAL++ GHVKEACSYCLDMM+ +MPQP+T+AKLM+GL KL Sbjct: 510 ISNKTSSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPNTYAKLMKGLNKL 569 Query: 1876 YNRQFAAEITEKVRKMA 1926 YNR AAEITEKV KMA Sbjct: 570 YNRTIAAEITEKVVKMA 586 >ref|XP_006292382.1| hypothetical protein CARUB_v10018595mg [Capsella rubella] gi|482561089|gb|EOA25280.1| hypothetical protein CARUB_v10018595mg [Capsella rubella] Length = 639 Score = 756 bits (1951), Expect = 0.0 Identities = 360/527 (68%), Positives = 434/527 (82%), Gaps = 3/527 (0%) Frame = +1 Query: 355 QDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFI 534 +DEF DV+K+YRILR +HSR+PKLELAL ES I +R GL RVL+RCGDAGNLGYRFF+ Sbjct: 61 EDEFAGDVDKIYRILRNYHSRVPKLELALNESSIDLRPGLIVRVLSRCGDAGNLGYRFFL 120 Query: 535 WASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFAS 714 WA+KQPGY HSY+V K+M+K+L KMRQFGAVW LIEEMRKENP L+ PE+FV+LMRRFAS Sbjct: 121 WAAKQPGYCHSYEVCKSMVKVLSKMRQFGAVWGLIEEMRKENPELIEPELFVILMRRFAS 180 Query: 715 ARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHF 894 A MVKKA+EVLDEMPKYG EPDEYVFGCLLDALCKNGSVK+A+ LFEDM+ K+ P +++F Sbjct: 181 ANMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKDASKLFEDMKEKYPPNLRYF 240 Query: 895 TSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKS 1074 TSLLYGWC+EGKLMEAK VLV+M+EAG EPDIVV+ NLL+GYA AGKM DA+ L+++M+ Sbjct: 241 TSLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMKDMRK 300 Query: 1075 KGCEPNATSFTIVVQALC-AQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGEIN 1251 +G EPNA +T+++QALC + +M+EAMRVF EMER GCEAD+VTYT LISGFCKW I+ Sbjct: 301 RGYEPNANCYTVLIQALCKTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWEMID 360 Query: 1252 RGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVI 1431 +GY +LD M +KG P++ +Y+ I++AH M++IG DL IYN VI Sbjct: 361 KGYSVLDDMRKKGVIPSQVTYMQIMVAHEKKEQFEECLDLIEKMKQIGCQLDLLIYNVVI 420 Query: 1432 RLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLL 1611 RLACKLGE+KEA+R W ++E NG+SPGVDT VI+ING QGCLVEAC++FKEMV RG+ Sbjct: 421 RLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGCLVEACNHFKEMVSRGIF 480 Query: 1612 SAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTK--GCDLNVYAWTIWIHALFSNGHVKE 1785 SAPQYGTLK LLN+L+R +KLEM+K+VWSCI K C+LNV AWTIWIHAL + GHVKE Sbjct: 481 SAPQYGTLKLLLNNLVRDEKLEMAKDVWSCISNKSSSCELNVSAWTIWIHALLAKGHVKE 540 Query: 1786 ACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926 ACSYCLDMM +MPQPDT+ KLM+GL KLYNR AAEITEKV KMA Sbjct: 541 ACSYCLDMMKMDLMPQPDTYVKLMKGLNKLYNRTIAAEITEKVMKMA 587