BLASTX nr result
ID: Catharanthus22_contig00015571
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00015571 (2166 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281719.2| PREDICTED: pentatricopeptide repeat-containi... 840 0.0 ref|XP_006475772.1| PREDICTED: pentatricopeptide repeat-containi... 830 0.0 ref|XP_006451021.1| hypothetical protein CICLE_v10007671mg [Citr... 828 0.0 ref|XP_004146349.1| PREDICTED: pentatricopeptide repeat-containi... 825 0.0 ref|XP_004167840.1| PREDICTED: pentatricopeptide repeat-containi... 822 0.0 ref|XP_006351154.1| PREDICTED: pentatricopeptide repeat-containi... 818 0.0 ref|XP_002529360.1| pentatricopeptide repeat-containing protein,... 817 0.0 ref|XP_004250591.1| PREDICTED: pentatricopeptide repeat-containi... 810 0.0 ref|XP_004288819.1| PREDICTED: pentatricopeptide repeat-containi... 791 0.0 gb|EOY30938.1| Pentatricopeptide repeat-containing protein, puta... 790 0.0 ref|XP_006583750.1| PREDICTED: pentatricopeptide repeat-containi... 788 0.0 ref|XP_003520007.1| PREDICTED: pentatricopeptide repeat-containi... 786 0.0 ref|XP_006385618.1| hypothetical protein POPTR_0003s08690g [Popu... 785 0.0 gb|ESW25189.1| hypothetical protein PHAVU_003G014900g [Phaseolus... 783 0.0 ref|XP_004513004.1| PREDICTED: pentatricopeptide repeat-containi... 765 0.0 gb|EXB76274.1| hypothetical protein L484_025631 [Morus notabilis] 758 0.0 gb|ABD96949.1| hypothetical protein [Cleome spinosa] 723 0.0 gb|EPS73045.1| hypothetical protein M569_01711 [Genlisea aurea] 686 0.0 gb|EMJ05120.1| hypothetical protein PRUPE_ppa015065mg [Prunus pe... 668 0.0 ref|XP_006415328.1| hypothetical protein EUTSA_v10007222mg [Eutr... 632 e-178 >ref|XP_002281719.2| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like [Vitis vinifera] Length = 662 Score = 840 bits (2171), Expect = 0.0 Identities = 403/615 (65%), Positives = 508/615 (82%) Frame = +2 Query: 8 LAKAIHYSNTASSETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIA 187 L+K +H S ++ H TKK+CI LLK CK M+ LK+IQ Q+ G HQ+ D L+K + Sbjct: 18 LSKPLHLSTSS-----HFTKKSCIFLLKNCKSMQHLKQIQTQILRTGFHQSGDTLNKFMV 72 Query: 188 FTVDPVLGNLCYAQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDN 367 DP +GNL YA++IFN ID P LF YN++IK++TK GSFRKA+ LF +LRE GL PDN Sbjct: 73 CCTDPSIGNLHYAERIFNYIDIPGLFIYNLVIKAFTKNGSFRKAVLLFRQLREEGLSPDN 132 Query: 368 YTYPFVFKAVVGLRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDE 547 +TYPFVFKA+ L V++GE ++GFV+KSG FD Y+ NS+MDMY+E+G + + +VF+E Sbjct: 133 FTYPFVFKAIGCLGEVREGEKVYGFVVKSGLEFDTYVCNSLMDMYAEVGRVQNLRQVFEE 192 Query: 548 LPERDSVSWNILISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLEL 727 +P+RD VSWN+LISG+++C R +DAV VF M+++ +P+EATVVSTLSAC ALK LEL Sbjct: 193 MPQRDVVSWNVLISGYVKCRRYEDAVDVFRRMQQQSSLRPNEATVVSTLSACIALKMLEL 252 Query: 728 GKEIHEYVSTELQFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSC 907 GKEIH YV +L FT++IGNAL+DMY KCG L IAR+IF+ MP K VICWTSMVSGYV+C Sbjct: 253 GKEIHRYVREQLGFTIKIGNALVDMYCKCGHLSIAREIFNDMPIKTVICWTSMVSGYVNC 312 Query: 908 GLLDKARDLFDRSPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALL 1087 G LD+AR+LF+RSPVRD+VLWTAMINGYVQFN D+A+ALFR+MQ++R+SPD+FTLVALL Sbjct: 313 GQLDEARELFERSPVRDVVLWTAMINGYVQFNRFDDAVALFREMQIKRVSPDRFTLVALL 372 Query: 1088 TGCAQLGSLEQGEWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAA 1267 TGCAQLG+LEQG+WIH YI+EN+I +DAVVGTALIEMYAKCG++ KSLE+F+ +KEKD A Sbjct: 373 TGCAQLGTLEQGKWIHGYIDENKIMIDAVVGTALIEMYAKCGFIEKSLEIFNGLKEKDTA 432 Query: 1268 SWTSIISALAMNGKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSM 1447 SWTSII LAMNGKTSKALELF+++ Q G++PDDITFIGVLSACSHGGLVEEGR++F SM Sbjct: 433 SWTSIICGLAMNGKTSKALELFAEMVQTGVKPDDITFIGVLSACSHGGLVEEGRKHFRSM 492 Query: 1448 KNIYQIEPKLEHYGCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGD 1627 +YQIEPKLEHYGCLIDL GRAG L EAE++IEK PN + ++++PLYGALLSACRT+G+ Sbjct: 493 TAVYQIEPKLEHYGCLIDLLGRAGQLDEAEELIEKSPNVNNEVIVPLYGALLSACRTHGN 552 Query: 1628 VDMGERIAKLLMQMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIE 1807 V+MGER+AK L+ +ES D+S+H+LLANIYASA RWED+ KVRRKM+ L +KK PGCSS+E Sbjct: 553 VEMGERVAKRLVGIESGDSSVHTLLANIYASADRWEDVTKVRRKMKDLGVKKVPGCSSVE 612 Query: 1808 VDSDTQEFNASNAFH 1852 V+ EF +A H Sbjct: 613 VNGIVHEFLVGDASH 627 >ref|XP_006475772.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like [Citrus sinensis] Length = 663 Score = 830 bits (2144), Expect = 0.0 Identities = 404/610 (66%), Positives = 490/610 (80%) Frame = +2 Query: 41 SSETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLC 220 SS T+ LTKK+CI LLK CK M QLK+IQAQ+F +GL QN + L+KL+ F P GNL Sbjct: 24 SSHTSTLTKKSCIYLLKNCKSMTQLKQIQAQIFQIGLQQNPETLNKLMVFCTHPSHGNLL 83 Query: 221 YAQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVV 400 YA+KIF I P L YN++IK++ K GSFRK+L LF KLRE G+ PDN+TYPFVFKAV Sbjct: 84 YAEKIFGSIQSPCLLAYNLLIKAFAKKGSFRKSLLLFSKLRERGVSPDNFTYPFVFKAVG 143 Query: 401 GLRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNI 580 L VK GE +HG+V+K+G FD Y+ NSIMDMY LG I + K+FDE+P++D VSWN+ Sbjct: 144 WLGEVKKGEKVHGYVVKTGLEFDTYVCNSIMDMYGVLGKICNVKKLFDEMPDKDVVSWNV 203 Query: 581 LISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTE 760 ISG ++C R +DAV VF MR+ PDE TVVSTLSAC+ALKNLELGKEIH Y++ E Sbjct: 204 SISGHVKCMRFEDAVDVFRRMRQGCNLMPDEGTVVSTLSACTALKNLELGKEIHRYINQE 263 Query: 761 LQFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFD 940 L+FT +GNALLDMY KCGCL AR++FD MP KNVICWTSMVSGYV+CG L+KARDLFD Sbjct: 264 LEFTPIMGNALLDMYCKCGCLSEARELFDEMPNKNVICWTSMVSGYVNCGQLEKARDLFD 323 Query: 941 RSPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQ 1120 RSPVRD+VLWTAMINGYVQFN DEA+ALFR+MQ+ R+ PDKF LVALLTGCAQLG+LEQ Sbjct: 324 RSPVRDIVLWTAMINGYVQFNRFDEAVALFREMQIIRLKPDKFILVALLTGCAQLGALEQ 383 Query: 1121 GEWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAM 1300 G+WIH YI ENRI VDAVV TALIEMYAKCG + K+LE+F ++EKDAASWTSII LAM Sbjct: 384 GKWIHGYINENRITVDAVVATALIEMYAKCGLIEKALEIFYELREKDAASWTSIICGLAM 443 Query: 1301 NGKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLE 1480 NGK +KALELFS++ G +PDDITFIGVLSACSHGGLV+EGR +FN+M +YQI+PKLE Sbjct: 444 NGKINKALELFSQMISGGAKPDDITFIGVLSACSHGGLVDEGRRFFNTMTEVYQIQPKLE 503 Query: 1481 HYGCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLL 1660 HYGCLIDL GRAGLL EAE++I KIPN++ +I++PLYGALLSACR YG+VDMGE++A LL Sbjct: 504 HYGCLIDLLGRAGLLDEAEELIRKIPNENNEIIVPLYGALLSACRIYGNVDMGEKLAALL 563 Query: 1661 MQMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEFNAS 1840 ++ES D+S H+LLANIYASA+RWED+ VR+KM+ + ++K PGCSSIE++ EF Sbjct: 564 EKIESKDSSFHTLLANIYASANRWEDVTNVRQKMKEMGVRKVPGCSSIEINGIIHEFLVG 623 Query: 1841 NAFHRGRERI 1870 + H + I Sbjct: 624 DPSHSEMKEI 633 >ref|XP_006451021.1| hypothetical protein CICLE_v10007671mg [Citrus clementina] gi|557554247|gb|ESR64261.1| hypothetical protein CICLE_v10007671mg [Citrus clementina] Length = 663 Score = 828 bits (2138), Expect = 0.0 Identities = 403/610 (66%), Positives = 490/610 (80%) Frame = +2 Query: 41 SSETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLC 220 SS T+ LTKK+CI LLK CK + QLK+IQAQ+F +GL QN + L+KL+ F P GNL Sbjct: 24 SSHTSTLTKKSCIYLLKNCKSITQLKQIQAQIFQIGLQQNPETLNKLMVFCTQPSHGNLL 83 Query: 221 YAQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVV 400 YA+KIF I P L YN++IK++ K GSFRK+L LF KLRE G+ PDN+TYPFVFKAV Sbjct: 84 YAEKIFGSIQSPCLLAYNLLIKAFAKKGSFRKSLLLFSKLRERGVSPDNFTYPFVFKAVG 143 Query: 401 GLRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNI 580 L VK GE +HG+V+K+G FD Y+ NSIMDMY+ LG I + K+FDE+P++D VSWN+ Sbjct: 144 CLGEVKKGEKVHGYVVKTGLEFDTYVCNSIMDMYAVLGKICNVKKLFDEMPDKDVVSWNV 203 Query: 581 LISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTE 760 ISG ++C R +DAV VF MR+ PDE TVVSTLSAC+ALKNLELGKEIH Y++ E Sbjct: 204 SISGHVKCMRFEDAVDVFRRMRQGCNLMPDEGTVVSTLSACTALKNLELGKEIHRYINQE 263 Query: 761 LQFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFD 940 L+FT +GNALLDMY KCGCL AR++FD MP KNVICWTSMVSGYV+CG L+KARDLFD Sbjct: 264 LEFTPIMGNALLDMYCKCGCLSEARELFDEMPNKNVICWTSMVSGYVNCGQLEKARDLFD 323 Query: 941 RSPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQ 1120 RSPVRD+VLWTAMINGYVQFN DEA+ALFR+MQ+ R+ PDKF LVALLTGCAQLG+LEQ Sbjct: 324 RSPVRDIVLWTAMINGYVQFNRFDEAVALFREMQIIRLKPDKFILVALLTGCAQLGALEQ 383 Query: 1121 GEWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAM 1300 G+WIH YI ENRI VDAVV TALIEMYAKCG + K+LE+F ++EKDAASWTSII LAM Sbjct: 384 GKWIHGYINENRITVDAVVATALIEMYAKCGLIEKALEIFYELREKDAASWTSIICGLAM 443 Query: 1301 NGKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLE 1480 NGK +KALELFS++ G +PDDITFIGVLSACSHGGLV+EGR +FN+M +YQI+PKLE Sbjct: 444 NGKINKALELFSQMISGGAKPDDITFIGVLSACSHGGLVDEGRRFFNTMTEVYQIQPKLE 503 Query: 1481 HYGCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLL 1660 HYGCLIDL GRAGLL EAE+ I KIPN++ +I++PLYGALLSACR YG+VDMGE++A LL Sbjct: 504 HYGCLIDLLGRAGLLDEAEEWIRKIPNENNEIIVPLYGALLSACRIYGNVDMGEKLAALL 563 Query: 1661 MQMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEFNAS 1840 ++ES D+S H+LLANIYASA+RWED+ VR+KM+ + ++K PGCSSIE++ EF Sbjct: 564 EKIESKDSSFHTLLANIYASANRWEDVTNVRQKMKEMGVRKVPGCSSIEINGIIHEFLVG 623 Query: 1841 NAFHRGRERI 1870 + H + I Sbjct: 624 DPSHSEMKEI 633 >ref|XP_004146349.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like [Cucumis sativus] Length = 781 Score = 825 bits (2130), Expect = 0.0 Identities = 391/598 (65%), Positives = 488/598 (81%) Frame = +2 Query: 59 LTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCYAQKIF 238 LTKK+CI+ L+ CK M QLK+IQ+Q+F +GL + D ++KL+AF D LGNL YA+KIF Sbjct: 141 LTKKSCIECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIF 200 Query: 239 NQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGLRTVK 418 N + PSLF YNVM+K Y K G RK L LF +LRE+GLWPD +TYPFV KA+ LR V+ Sbjct: 201 NYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVR 260 Query: 419 DGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILISGFL 598 GE + GF++K+G D Y++NS++DMY EL ++++ K+FDE+ RDSVSWN++ISG++ Sbjct: 261 QGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYV 320 Query: 599 RCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQFTVR 778 RC R +DA+ F M++EG KPDEATVVSTLSAC+ALKNLELG EIH YV EL FT R Sbjct: 321 RCRRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTTR 380 Query: 779 IGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRSPVRD 958 I NALLDMY+KCGCL IAR IFD M KNVICWTSM+SGY++CG L +ARDLFD+SPVRD Sbjct: 381 IDNALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRD 440 Query: 959 LVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGEWIHR 1138 +VLWTAMINGYVQF++ D+A+ALFR+MQ++R+ PDKFT+V LLTGCAQLG+LEQG+WIH Sbjct: 441 VVLWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHG 500 Query: 1139 YIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNGKTSK 1318 Y++ENRI +D VVGTALIEMY+KCG V KSLE+F +++KD ASWTSII LAMNGKTS+ Sbjct: 501 YLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSE 560 Query: 1319 ALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHYGCLI 1498 AL LFS++E+ G +PDDITFIGVLSACSHGGLVEEGR +FNSMK +++IEPK+EHYGC+I Sbjct: 561 ALRLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVI 620 Query: 1499 DLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQMESS 1678 DL GRAGLL EAE++I++IP ++ +IV+PLYGALLSACR + +VDMGER+AK L +ES Sbjct: 621 DLLGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESC 680 Query: 1679 DASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEFNASNAFH 1852 D+SIH+LLANIYAS RWED +KVRRKM+ L +KK PGCS IEVD EF + H Sbjct: 681 DSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSH 738 >ref|XP_004167840.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like, partial [Cucumis sativus] Length = 735 Score = 822 bits (2124), Expect = 0.0 Identities = 390/607 (64%), Positives = 491/607 (80%) Frame = +2 Query: 32 NTASSETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLG 211 +++S+ KK+CI+ L+ CK M QLK+IQ+Q+F +GL + D ++KL+AF D LG Sbjct: 86 SSSSASNLQTNKKSCIECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLG 145 Query: 212 NLCYAQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFK 391 NL YA+KIFN + PSLF YNVM+K Y K G RK L LF +LRE+GLWPD +TYPFV K Sbjct: 146 NLRYAEKIFNYVQDPSLFVYNVMVKIYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLK 205 Query: 392 AVVGLRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVS 571 A+ LR V+ GE + GF++K+G D Y++NS++DMY EL ++++ K+FDE+ RDSVS Sbjct: 206 AIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVS 265 Query: 572 WNILISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYV 751 WN++ISG++RC R +DA+ F M++EG KPDEATVVSTLSAC+ALKNLELG EIH YV Sbjct: 266 WNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYV 325 Query: 752 STELQFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARD 931 EL FT RI NALLDMY+KCGCL IAR IFD M KNVICWTSM+SGY++CG L +ARD Sbjct: 326 RKELGFTTRIDNALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARD 385 Query: 932 LFDRSPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGS 1111 LFD+SPVRD+VLWTAMINGYVQF++ D+A+ALFR+MQ+++I PDKFT+V LLTGCAQLG+ Sbjct: 386 LFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREMQIQKIKPDKFTVVTLLTGCAQLGA 445 Query: 1112 LEQGEWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISA 1291 LEQG+WIH Y++ENRI +D VVGTALIEMY+KCG V KSLE+F +++KD ASWTSII Sbjct: 446 LEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICG 505 Query: 1292 LAMNGKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEP 1471 LAMNGKTS+AL LFS++E+ G +PDDITFIGVLSACSHGGLVEEGR +FNSMK +++IEP Sbjct: 506 LAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEP 565 Query: 1472 KLEHYGCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIA 1651 K+EHYGC+IDL GRAGLL EAE++I++IP ++ +IV+PLYGALLSACR + +VDMGER+A Sbjct: 566 KVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLA 625 Query: 1652 KLLMQMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEF 1831 K L +ES D+SIH+LLANIYAS RWED +KVRRKM+ L +KK PGCS IEVD EF Sbjct: 626 KKLENIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEF 685 Query: 1832 NASNAFH 1852 + H Sbjct: 686 LVGDPSH 692 >ref|XP_006351154.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like [Solanum tuberosum] Length = 604 Score = 818 bits (2114), Expect = 0.0 Identities = 399/591 (67%), Positives = 485/591 (82%) Frame = +2 Query: 59 LTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCYAQKIF 238 L K C +LLKTCK + +LK+I AQV IL H++I LHKL+AFT + YA+KIF Sbjct: 11 LDTKTCFELLKTCKSITKLKQIHAQVIILNFHKHIGILHKLLAFTTHDDT-DFNYAKKIF 69 Query: 239 NQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGLRTVK 418 + + +LF YNVMIK Y K G F+K L LFD+LR +GL+PDN+TYPFVFKA+ L+ VK Sbjct: 70 SCCENRTLFMYNVMIKGYVKTGQFKKPLFLFDELRIHGLFPDNFTYPFVFKAIGELKMVK 129 Query: 419 DGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILISGFL 598 GE IHG+VLKSG FD Y+ NS+MDMY G ++S KVFDE+P+RDSV+WNILISGF+ Sbjct: 130 GGEKIHGYVLKSGVLFDNYVGNSVMDMYGLFGYVESLNKVFDEMPQRDSVAWNILISGFV 189 Query: 599 RCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQFTVR 778 RC R DAV V+ MR E ++PDEATVVSTLSAC+ALK+LELG+EIH YV EL+F++ Sbjct: 190 RCGRFRDAVVVYKKMREENGARPDEATVVSTLSACAALKSLELGREIHGYVVEELEFSLI 249 Query: 779 IGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRSPVRD 958 IGNAL+DMY KCGCL++AR+IFD MP KNVICWTSMVSGYV+ G LD+AR LF+RSPVRD Sbjct: 250 IGNALVDMYCKCGCLMVAREIFDDMPMKNVICWTSMVSGYVNSGQLDEARKLFERSPVRD 309 Query: 959 LVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGEWIHR 1138 LVLWT MINGYVQFN VD+A+ LFR MQM+ I PDK+TLVALLTGCAQLG+L+QGEWIH Sbjct: 310 LVLWTTMINGYVQFNRVDDAMDLFRSMQMQGIKPDKYTLVALLTGCAQLGALQQGEWIHD 369 Query: 1139 YIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNGKTSK 1318 Y++ENRI V AVVGTALIEMYAKCG + KS+E+FD ++EKD ASWTSII +LAM+G T K Sbjct: 370 YMKENRITVTAVVGTALIEMYAKCGCIEKSMEIFDELEEKDTASWTSIICSLAMSGNTRK 429 Query: 1319 ALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHYGCLI 1498 ALELFS++EQAG PDDIT+IGVLSACSHGGLVEEGR+YF++M I+ I+PKLEHYGCLI Sbjct: 430 ALELFSEMEQAGFHPDDITYIGVLSACSHGGLVEEGRKYFHAMSRIHAIQPKLEHYGCLI 489 Query: 1499 DLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQMESS 1678 DL GRAGLL EAE MI +IPN+D +I++P+YGALLSACR YG+VD+GER+A+LLM++ES Sbjct: 490 DLLGRAGLLSEAEVMISQIPNRDNEIIVPIYGALLSACRIYGNVDVGERVAELLMEIESY 549 Query: 1679 DASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEF 1831 D+S H+LLAN YASA RWED+ KVR MR L +KKSPGCSSI++ + EF Sbjct: 550 DSSTHTLLANTYASAGRWEDVLKVRGTMRDLGVKKSPGCSSIDISGNVHEF 600 >ref|XP_002529360.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223531180|gb|EEF33027.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 683 Score = 817 bits (2111), Expect = 0.0 Identities = 383/607 (63%), Positives = 496/607 (81%) Frame = +2 Query: 32 NTASSETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLG 211 +T ++ TT L++++CI LK+CK M LK+I AQ+F +GLHQ+I +L+KL+AF DP G Sbjct: 26 STFTNPTTGLSQQSCISYLKSCKSMTHLKQIHAQIFRVGLHQDIVSLNKLMAFCTDPFNG 85 Query: 212 NLCYAQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFK 391 NL YA+K+F I P L YN++IK++ K G++++ L LF KLRE+GLWPDN+TYPFVFK Sbjct: 86 NLNYAEKMFKYIRYPCLLIYNLIIKAFAKKGNYKRTLVLFSKLREDGLWPDNFTYPFVFK 145 Query: 392 AVVGLRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVS 571 A+ L V E + G V K+G FD Y+ NS++DMY++L D +FDE+P+RD +S Sbjct: 146 AIGYLGEVSKAEKLRGLVTKTGLEFDTYVRNSLIDMYAQLALTDVMKMLFDEMPDRDVIS 205 Query: 572 WNILISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYV 751 WN++ISG+++C R +DA+ VF M+ E PDEATVVSTLSAC+ALK LELGK+IH YV Sbjct: 206 WNVMISGYVKCRRFEDAINVFCRMQEESGLMPDEATVVSTLSACTALKRLELGKKIHHYV 265 Query: 752 STELQFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARD 931 ++FT IGNALLDMY KCGCL IAR +F+ MP KNVICWT+MVSGY +CG L++AR+ Sbjct: 266 RDNVKFTPIIGNALLDMYCKCGCLSIARAVFEEMPSKNVICWTTMVSGYANCGELEEARE 325 Query: 932 LFDRSPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGS 1111 LF+ SP+RD+V+WTAMINGYVQFN DEA+ALFR+MQ+ ++ PDKF +V+LLTGCAQ G+ Sbjct: 326 LFEGSPIRDVVIWTAMINGYVQFNRFDEAVALFREMQIRKVKPDKFIVVSLLTGCAQTGA 385 Query: 1112 LEQGEWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISA 1291 +EQG+WIH +I+ENRIP+DAVVGTALIEMYAKCG++ K+LE+F ++ KD ASWTSII Sbjct: 386 IEQGKWIHEFIDENRIPIDAVVGTALIEMYAKCGFIEKALEIFYGLRVKDTASWTSIICG 445 Query: 1292 LAMNGKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEP 1471 LAMNGKTSKALELFSK++QAG+RPDDITFIGVLSACSHGGLVEEGR++FNSM+ YQI+P Sbjct: 446 LAMNGKTSKALELFSKMKQAGVRPDDITFIGVLSACSHGGLVEEGRKFFNSMRMEYQIKP 505 Query: 1472 KLEHYGCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIA 1651 K+EHYGCL+DL GRAGLL EAE++I+KIP++++ I +PLYG+LLSACR YG+V+MGER+A Sbjct: 506 KVEHYGCLVDLLGRAGLLNEAEELIKKIPDENKAITVPLYGSLLSACRIYGNVEMGERVA 565 Query: 1652 KLLMQMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEF 1831 K L++ ESSD+S+H+LLANIYA A RWED+ KVRRKM+ L +KK+PGCSSIEVDS EF Sbjct: 566 KQLVKFESSDSSVHTLLANIYAFADRWEDVTKVRRKMKDLGVKKTPGCSSIEVDSIIHEF 625 Query: 1832 NASNAFH 1852 + + H Sbjct: 626 FSGHPSH 632 >ref|XP_004250591.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like [Solanum lycopersicum] Length = 600 Score = 810 bits (2093), Expect = 0.0 Identities = 398/596 (66%), Positives = 481/596 (80%) Frame = +2 Query: 44 SETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCY 223 S + K C +LLKTCK + +LK+I AQV IL H++I LHKL+AFT + Y Sbjct: 2 SSIRSIDTKTCFELLKTCKSITKLKQIHAQVIILNFHKHIGILHKLLAFTTHDDT-DFNY 60 Query: 224 AQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVG 403 A+KIF+ + +LF YNVMIK Y K G F+K L LF++L+ +GL+PDN+TYPFVFKA+ Sbjct: 61 AKKIFSCCENRTLFMYNVMIKGYVKTGQFKKPLYLFNELKIHGLFPDNFTYPFVFKAIGE 120 Query: 404 LRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNIL 583 L+ VK GE IHG+VLKSG FD Y+ NS+MDMY G ++S KVFDE+P RDSV+WNIL Sbjct: 121 LKMVKGGEKIHGYVLKSGVLFDNYVGNSVMDMYGLFGYVESLNKVFDEMPNRDSVAWNIL 180 Query: 584 ISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTEL 763 ISGF+RC R DAV V+ MR E KPDEATVVSTLSAC+ALK+LE+G+EIH YV EL Sbjct: 181 ISGFVRCGRFQDAVVVYKKMREENAVKPDEATVVSTLSACTALKSLEIGREIHGYVVEEL 240 Query: 764 QFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDR 943 +F++ IGNAL+DMY KCGCLI+AR+IFD MP KNVICWTSMV GYV+ G LD+AR LF+R Sbjct: 241 EFSLIIGNALVDMYCKCGCLIVAREIFDDMPMKNVICWTSMVLGYVNNGQLDEARKLFER 300 Query: 944 SPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQG 1123 SPVRDLVLWT MINGYVQFN VD+A+ LFR MQ++ I PDK+TLVALLTGCAQLG+L+QG Sbjct: 301 SPVRDLVLWTTMINGYVQFNCVDDAMDLFRSMQIQGIKPDKYTLVALLTGCAQLGALQQG 360 Query: 1124 EWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMN 1303 EWIH Y++ENRI V AVVGTALIEMYAKCG + KS E+FD ++EKD ASWTSII ALAM+ Sbjct: 361 EWIHDYMKENRITVTAVVGTALIEMYAKCGCIEKSKEIFDELEEKDTASWTSIICALAMS 420 Query: 1304 GKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEH 1483 G T KALELFS++EQAG PDDIT+IGVLSACSHGGLVEEGR+YF++M I+ I+PKLEH Sbjct: 421 GNTRKALELFSEMEQAGFHPDDITYIGVLSACSHGGLVEEGRKYFHAMSRIHAIQPKLEH 480 Query: 1484 YGCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLM 1663 YGCLIDL GRAGLL EAE MI +IPNKD KI++P+YGALLSACR YG+VD+GER+A+LLM Sbjct: 481 YGCLIDLLGRAGLLSEAEVMISQIPNKDNKIIVPIYGALLSACRIYGNVDVGERVAELLM 540 Query: 1664 QMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEF 1831 ++ES D+S H+LLAN YAS+ RWED KVR MR L +KKSPGCSSI ++ + EF Sbjct: 541 EIESYDSSTHTLLANTYASSGRWEDASKVRGTMRDLGVKKSPGCSSININGNVHEF 596 >ref|XP_004288819.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like [Fragaria vesca subsp. vesca] Length = 633 Score = 791 bits (2042), Expect = 0.0 Identities = 376/604 (62%), Positives = 483/604 (79%) Frame = +2 Query: 59 LTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCYAQKIF 238 L K +CI L+++CK MKQLK+IQ Q+FILGLHQ L+KL+ F DP L ++ YA+K+F Sbjct: 2 LNKTSCIHLIQSCKSMKQLKQIQTQMFILGLHQCKATLNKLMVFCTDPSLRDIHYAEKVF 61 Query: 239 NQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGLRTVK 418 I P LF YNVMIK+ K F+ + LF KLRE+GLWPD++TYPF KA+ GL + Sbjct: 62 THIQSPGLFIYNVMIKALAKRKRFKSVIELFRKLREDGLWPDSFTYPFACKAIGGLGDAR 121 Query: 419 DGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILISGFL 598 +GE IHGF +K+G FD Y+ NS++ MY+ELG + +++ +FD +PERD VSWN++ISG + Sbjct: 122 EGEKIHGFAVKNGFRFDTYVCNSLIYMYAELGQVHNAMNLFDGIPERDVVSWNVMISGCV 181 Query: 599 RCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQFTVR 778 C R +AV VF MR+E KPDEATVVSTLSAC+ALK+LELGKE+H+YV EL+F+ Sbjct: 182 GCMRFREAVSVFRRMRKESNVKPDEATVVSTLSACAALKDLELGKEVHDYVRAELEFSAI 241 Query: 779 IGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRSPVRD 958 IGNA LDMY+KCGCL AR++FD + +NV+CWTSMV GYV+CG+LD+AR+LFDRSP +D Sbjct: 242 IGNAALDMYAKCGCLSEARKVFDEIRVRNVMCWTSMVCGYVNCGMLDEARELFDRSPAKD 301 Query: 959 LVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGEWIHR 1138 +VLWTAM+NGYVQ+N DEA+ALF++MQ + DKFT+VALLTGCAQLG+LEQG+WIH+ Sbjct: 302 VVLWTAMMNGYVQYNQFDEAVALFQEMQFRGVRVDKFTIVALLTGCAQLGALEQGKWIHK 361 Query: 1139 YIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNGKTSK 1318 YI+E I +DA+VGTALIEM+AKCG +GKSLE+F+A+ EKDAA+WTS+I ALAMNG TSK Sbjct: 362 YIDEIGIKIDAIVGTALIEMFAKCGCIGKSLEIFNALNEKDAATWTSMICALAMNGMTSK 421 Query: 1319 ALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHYGCLI 1498 ALELFSK++Q I PDDITFIGVLSACSHGGLV+EG+ F+SM+ Y +EPKLEHYGCLI Sbjct: 422 ALELFSKMKQVRINPDDITFIGVLSACSHGGLVDEGQNLFDSMRKDYGMEPKLEHYGCLI 481 Query: 1499 DLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQMESS 1678 DL GRAGLL+EAE++I+ IP+++ KI++PLY ALL ACR +G+VDM ER+AK L +ESS Sbjct: 482 DLLGRAGLLREAEELIDSIPSENNKIIVPLYSALLGACRIHGNVDMSERVAKRLSDIESS 541 Query: 1679 DASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEFNASNAFHRG 1858 +S H+LLANIYA A RWED+ KVRR+MR L +KK PGCSS+E++ EF S++ H Sbjct: 542 GSSSHTLLANIYADAERWEDVTKVRRRMRDLGVKKMPGCSSVEINGVIHEFLVSDSSHSQ 601 Query: 1859 RERI 1870 E+I Sbjct: 602 MEQI 605 >gb|EOY30938.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 626 Score = 790 bits (2039), Expect = 0.0 Identities = 378/590 (64%), Positives = 477/590 (80%) Frame = +2 Query: 44 SETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCY 223 S + LTK++CI LLK CK M LK+IQAQ F+LGLHQ+ L+KLIAF D +GN Y Sbjct: 29 SRSPPLTKQSCIFLLKNCKSMNHLKQIQAQTFLLGLHQDCHTLNKLIAFCTDSSIGNFRY 88 Query: 224 AQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVG 403 A+K+F+ I PSLF YNVMIK++ K GS++ A+ +F KLRE GLWPDN+TYPFVFKA+ Sbjct: 89 AEKVFSLIRNPSLFIYNVMIKTFVKKGSYKNAILVFGKLREQGLWPDNFTYPFVFKAIGS 148 Query: 404 LRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNIL 583 L V +GE IHG V KSG FD Y+ NS+MDMY +LG + S K+FD++PERD V+WN+L Sbjct: 149 LGEVFEGEKIHGVVAKSGLEFDAYVINSLMDMYVQLGRVVYSKKIFDKMPERDVVAWNVL 208 Query: 584 ISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTEL 763 ISG +RC +DAV VF M +EG KP+EAT+VSTLSAC+AL+ LELG EI YV EL Sbjct: 209 ISGLVRCGIFEDAVNVFGLMIKEGLVKPNEATIVSTLSACTALRRLELGNEIDRYVRKEL 268 Query: 764 QFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDR 943 + T +GNALLDMY KCGCL IA ++FD MP KNV CWTSMVSGYV+CGLLD+AR+LFDR Sbjct: 269 ELTTIMGNALLDMYCKCGCLDIAIKVFDEMPIKNVNCWTSMVSGYVNCGLLDEARELFDR 328 Query: 944 SPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQG 1123 SPVRD+VLWTAMINGYVQFN DE++ LF++MQ++R+ PD F +V+LLTGCAQ+G+L QG Sbjct: 329 SPVRDVVLWTAMINGYVQFNRFDESMELFKEMQIQRVKPDNFVVVSLLTGCAQMGALGQG 388 Query: 1124 EWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMN 1303 +WIH Y+ ENRI VD +VGTALIEMYAKCG V ++LE+F + +KD ASWTS+I LA+N Sbjct: 389 KWIHAYLNENRIVVDTIVGTALIEMYAKCGCVEEALEIFYGLSKKDTASWTSVICGLAVN 448 Query: 1304 GKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEH 1483 G+ SKALELFS+++Q +PDDITFIGVLSAC+HGGLVEEGR++F+SM +YQIEPKLEH Sbjct: 449 GEASKALELFSQMKQTEEKPDDITFIGVLSACNHGGLVEEGRQFFDSMSKVYQIEPKLEH 508 Query: 1484 YGCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLM 1663 Y CLIDL GRAG L E EK+I+ IP++D ++V+PLYG+LLSACRTY +V+MGE +A+ L+ Sbjct: 509 YACLIDLLGRAGRLAEVEKLIDDIPSQDNELVVPLYGSLLSACRTYVNVEMGEWVAQRLV 568 Query: 1664 QMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVD 1813 +++SSD+SIH+LLANIYASA RW D+ +VR KM+ L +KK PGCSSI+V+ Sbjct: 569 EIKSSDSSIHTLLANIYASADRWGDVIRVRAKMKDLGVKKVPGCSSIDVN 618 >ref|XP_006583750.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like isoform X1 [Glycine max] Length = 621 Score = 788 bits (2034), Expect = 0.0 Identities = 373/598 (62%), Positives = 479/598 (80%) Frame = +2 Query: 35 TASSETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGN 214 +A E + L K I LLK+CK M QLK+IQA +F +GL Q+ D L+KL+AF++D LG+ Sbjct: 5 SALIECSKLMKGTYISLLKSCKSMSQLKQIQAHIFCVGLQQDRDTLNKLMAFSMDSSLGD 64 Query: 215 LCYAQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKA 394 YA +IFN I PSLF YN+MIK++ K GSFR A+ LF +LRE+G+WPDNYTYP+V K Sbjct: 65 FNYANRIFNYIHDPSLFIYNLMIKAFVKSGSFRSAISLFQQLREHGVWPDNYTYPYVLKG 124 Query: 395 VVGLRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSW 574 + + V++GE +H FV+K+G FD Y+ NS MDMY+ELG ++ +VF+E+P+RD+VSW Sbjct: 125 IGCIGEVREGEKVHAFVVKTGLEFDPYVCNSFMDMYAELGLVEGFTQVFEEMPDRDAVSW 184 Query: 575 NILISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVS 754 NI+ISG++RC R ++AV V+ M E KP+EATVVSTLSAC+ L+NLELGKEIH+Y++ Sbjct: 185 NIMISGYVRCKRFEEAVDVYRRMWTESNEKPNEATVVSTLSACAVLRNLELGKEIHDYIA 244 Query: 755 TELQFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDL 934 +EL T +GNALLDMY KCG + +AR+IFD M KNV CWTSMV+GYV CG LD+AR+L Sbjct: 245 SELDLTTIMGNALLDMYCKCGHVSVAREIFDAMTVKNVNCWTSMVTGYVICGQLDQARNL 304 Query: 935 FDRSPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSL 1114 F+RSP RD+VLWTAMINGYVQFN +E IALF +MQ+ + PDKF +V LLTGCAQ G+L Sbjct: 305 FERSPSRDIVLWTAMINGYVQFNRFEETIALFGEMQIRGVKPDKFIVVTLLTGCAQSGAL 364 Query: 1115 EQGEWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISAL 1294 EQG+WIH YI+ENRI VDAVVGTALIEMYAKCG + KS E+F+ +KEKD SWTSII L Sbjct: 365 EQGKWIHNYIDENRIKVDAVVGTALIEMYAKCGCIEKSFEIFNGLKEKDTTSWTSIICGL 424 Query: 1295 AMNGKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPK 1474 AMNGK S+ALELF ++ G++PDDITF+ VLSACSH GLVEEGR+ F+SM ++Y IEP Sbjct: 425 AMNGKPSEALELFKAMQTCGLKPDDITFVAVLSACSHAGLVEEGRKLFHSMSSMYHIEPN 484 Query: 1475 LEHYGCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAK 1654 LEHYGC IDL GRAGLL+EAE++++K+P ++ +I++PLYGALLSACRTYG++DMGER+A Sbjct: 485 LEHYGCFIDLLGRAGLLQEAEELVKKLPAQNNEIIVPLYGALLSACRTYGNIDMGERLAT 544 Query: 1655 LLMQMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQE 1828 L +++SSD+S+H+LLA+IYASA RWED+RKVR KM+ L IKK PG S+IEVD Q+ Sbjct: 545 ALAKVKSSDSSLHTLLASIYASADRWEDVRKVRNKMKDLGIKKVPGYSAIEVDGKWQQ 602 >ref|XP_003520007.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like [Glycine max] Length = 591 Score = 786 bits (2031), Expect = 0.0 Identities = 371/589 (62%), Positives = 476/589 (80%) Frame = +2 Query: 47 ETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCYA 226 + + L K I LLK+CK M QLK+IQA +F GL Q+ D L+KL+AF++D LG+ YA Sbjct: 2 QCSKLLKGTYISLLKSCKSMSQLKQIQAHIFCFGLQQDRDILNKLMAFSMDSSLGDFNYA 61 Query: 227 QKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGL 406 +IFN I PSLF YN+MIK++ K GS R A+ LF +LRE G+WPDNYTYP+V K + + Sbjct: 62 NRIFNHIHHPSLFIYNLMIKAFVKRGSLRSAISLFQQLRERGVWPDNYTYPYVLKGIGCI 121 Query: 407 RTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILI 586 V++GE IH FV+K+G FD Y+ NS+MDMY+ELG ++ +VF+E+PERD+VSWNI+I Sbjct: 122 GEVREGEKIHAFVVKTGLEFDPYVCNSLMDMYAELGLVEGFTQVFEEMPERDAVSWNIMI 181 Query: 587 SGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQ 766 SG++RC R ++AV V+ M+ E KP+EATVVSTLSAC+ L+NLELGKEIH+Y++ EL Sbjct: 182 SGYVRCKRFEEAVDVYRRMQMESNEKPNEATVVSTLSACAVLRNLELGKEIHDYIANELD 241 Query: 767 FTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRS 946 T +GNALLDMY KCGC+ +AR+IFD M KNV CWTSMV+GYV CG LD+AR LF+RS Sbjct: 242 LTPIMGNALLDMYCKCGCVSVAREIFDAMIVKNVNCWTSMVTGYVICGQLDQARYLFERS 301 Query: 947 PVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGE 1126 P RD+VLWTAMINGYVQFN+ ++AIALF +MQ+ + PDKF +V LLTGCAQLG+LEQG+ Sbjct: 302 PSRDVVLWTAMINGYVQFNHFEDAIALFGEMQIRGVEPDKFIVVTLLTGCAQLGALEQGK 361 Query: 1127 WIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNG 1306 WIH YI+ENRI +DAVV TALIEMYAKCG + KSLE+F+ +K+ D SWTSII LAMNG Sbjct: 362 WIHNYIDENRIKMDAVVSTALIEMYAKCGCIEKSLEIFNGLKDMDTTSWTSIICGLAMNG 421 Query: 1307 KTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHY 1486 KTS+ALELF ++ G++PDDITF+ VLSAC H GLVEEGR+ F+SM +IY IEP LEHY Sbjct: 422 KTSEALELFEAMQTCGLKPDDITFVAVLSACGHAGLVEEGRKLFHSMSSIYHIEPNLEHY 481 Query: 1487 GCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQ 1666 GC IDL GRAGLL+EAE++++K+P+++ +I++PLYGALLSACRTYG++DMGER+A L + Sbjct: 482 GCFIDLLGRAGLLQEAEELVKKLPDQNNEIIVPLYGALLSACRTYGNIDMGERLATALAK 541 Query: 1667 MESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVD 1813 ++SSD+S+H+LLA+IYASA RWED+RKVR KM+ L IKK PG S+IEVD Sbjct: 542 VKSSDSSLHTLLASIYASADRWEDVRKVRSKMKDLGIKKVPGYSAIEVD 590 >ref|XP_006385618.1| hypothetical protein POPTR_0003s08690g [Populus trichocarpa] gi|550342748|gb|ERP63415.1| hypothetical protein POPTR_0003s08690g [Populus trichocarpa] Length = 609 Score = 785 bits (2027), Expect = 0.0 Identities = 379/580 (65%), Positives = 469/580 (80%) Frame = +2 Query: 104 MKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCYAQKIFNQIDKPSLFTYNVMI 283 M QLK+IQAQ+F GLHQ+ D L KL+ F DP GNL +A++IFN I P LF YN+MI Sbjct: 1 MNQLKQIQAQIFRGGLHQSTDTLKKLMVFCADPSNGNLVHAERIFNYIQNPGLFVYNIMI 60 Query: 284 KSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGLRTVKDGETIHGFVLKSGNS 463 K++ K G FRK L LF+KLRE+GLWPDN+TYPFV KA+ L V + E +HGFV+K+G Sbjct: 61 KAFAKKGIFRKCLMLFNKLREDGLWPDNFTYPFVLKAIGCLGEVLEAEKLHGFVMKTGLE 120 Query: 464 FDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILISGFLRCNRLDDAVGVFMNM 643 D Y+ N ++DMY++LG +D K+FDE+PERD VSWN+LISG+++ R +DA+ VF M Sbjct: 121 SDTYVCNPLIDMYAKLGQVDVMRKLFDEMPERDVVSWNVLISGYVKRRRFEDAIDVFCCM 180 Query: 644 RREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQFTVRIGNALLDMYSKCGCL 823 R E +P+E TVVSTLSAC+ALK LELGKEIH YV L+ T IG+ALLDMY KCGCL Sbjct: 181 REESYLRPNEPTVVSTLSACAALKCLELGKEIHCYVRDRLELTSIIGSALLDMYCKCGCL 240 Query: 824 IIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRSPVRDLVLWTAMINGYVQFN 1003 +AR+IFD MP KNVICWTSMVSGYV+ G LDKAR+LF+RSPV+D+VLWTAMINGYVQFN Sbjct: 241 SVARKIFDEMPHKNVICWTSMVSGYVNYGELDKARELFERSPVKDVVLWTAMINGYVQFN 300 Query: 1004 NVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGEWIHRYIEENRIPVDAVVGT 1183 + DEA+ALF++MQ++R+ PDKF LVALLTGCAQ+G+LEQG WIH YI+E IPVDAVVGT Sbjct: 301 HFDEAVALFQEMQIQRVKPDKFVLVALLTGCAQMGALEQGTWIHGYIDEKGIPVDAVVGT 360 Query: 1184 ALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNGKTSKALELFSKLEQAGIRP 1363 +LIEMY+KCG + K+L +F ++EKD A+WTSII LAMNGKTSKALELFSK++Q P Sbjct: 361 SLIEMYSKCGCIEKALRIFCGLREKDTATWTSIICGLAMNGKTSKALELFSKMKQVEAIP 420 Query: 1364 DDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHYGCLIDLFGRAGLLKEAEKM 1543 D++TFIGVLSACSHGGLVEEGRE+FNSM +IY IEPKLEHYGCLIDL GRAG L EAE++ Sbjct: 421 DEVTFIGVLSACSHGGLVEEGREFFNSMTSIYNIEPKLEHYGCLIDLLGRAGQLDEAEEL 480 Query: 1544 IEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQMESSDASIHSLLANIYASA 1723 I+KI N + +I++PLYG+LLSACR Y +V MGER+A+ L+++ES D+S+H+LLANIYASA Sbjct: 481 IKKIVNANNEIIVPLYGSLLSACRIYKNVQMGERVAEQLVKIESRDSSVHTLLANIYASA 540 Query: 1724 SRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEFNASN 1843 RW D+ +VRR+M+ L +KK PGCSSIEVD EF N Sbjct: 541 GRWVDVNRVRREMKDLGVKKVPGCSSIEVDGIVHEFLVGN 580 >gb|ESW25189.1| hypothetical protein PHAVU_003G014900g [Phaseolus vulgaris] Length = 595 Score = 783 bits (2022), Expect = 0.0 Identities = 369/585 (63%), Positives = 476/585 (81%), Gaps = 1/585 (0%) Frame = +2 Query: 77 IDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCYAQKIFNQIDKP 256 I LLK+CK M Q K+IQA +F +GL Q+ D L+KL+AF++D LG+ YA +IF I P Sbjct: 6 ISLLKSCKSMSQFKQIQAHIFSVGLQQDRDTLNKLMAFSMDSSLGDFNYANRIFKHIHNP 65 Query: 257 SLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGLRTVKDGETIH 436 SLF YN+MIK++ K GSFR A+ LF +LRE+G+WPDNYTYP+V K + + VK+G+ +H Sbjct: 66 SLFIYNLMIKAFVKRGSFRTAISLFHQLREHGVWPDNYTYPYVLKGIGCIGEVKEGQKVH 125 Query: 437 GFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILISGFLRCNRLD 616 FV+++G FD Y+ NS+MDMY+ELG ++ +VF+E+PERD+VSWNI+ISG++RC R Sbjct: 126 AFVVRTGLEFDAYVGNSLMDMYAELGLVEGFTQVFEEMPERDTVSWNIMISGYVRCKRFQ 185 Query: 617 DAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQFTVRIGNALL 796 +AV V+ MR+E KP+EATVVS+LSAC+AL+NLELGKEIH+Y+ EL FT+ +GNALL Sbjct: 186 EAVDVYNRMRKESNEKPNEATVVSSLSACTALRNLELGKEIHDYIVNELDFTIIMGNALL 245 Query: 797 DMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRSPVRDLVLWTA 976 DMY KCG + +A++IFD M KNV CWTSMV+GYV+CG LD+ARD F+RSP RD+VLWTA Sbjct: 246 DMYCKCGHVSVAQEIFDAMRVKNVNCWTSMVTGYVACGWLDQARDYFERSPSRDIVLWTA 305 Query: 977 MINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGEWIHRYIEENR 1156 MINGYVQFN +EAIALF +MQM + PD F +V LLTGCAQ G+LEQG+WIH YI+ENR Sbjct: 306 MINGYVQFNRFEEAIALFGEMQMRGVRPDNFIVVTLLTGCAQSGALEQGKWIHNYIDENR 365 Query: 1157 IPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNGKTSKALELFS 1336 I VDAVVGTALIEMYAKCG + +LE+FDA+KEKD ASWT+II LAMNGKTSKALELF Sbjct: 366 ILVDAVVGTALIEMYAKCGCIDIALEIFDALKEKDTASWTAIICGLAMNGKTSKALELFE 425 Query: 1337 KLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHYGCLIDLFGRA 1516 ++ G +PDD+TFI VLSAC+H GLVEEGR+ F+SM ++Y IEP LEHYGC IDL GRA Sbjct: 426 AMQVCGFKPDDVTFIAVLSACTHAGLVEEGRKLFHSMSSVYHIEPNLEHYGCFIDLLGRA 485 Query: 1517 GLLKEAEKMIEKIPNK-DEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQMESSDASIH 1693 GLL+EAE+++ K+P++ +++I++PLYGALLSACRTY ++DMGER+A L +++SSD+S+H Sbjct: 486 GLLQEAEELVRKLPDENNDEIIVPLYGALLSACRTYSNIDMGERLATALAKVKSSDSSLH 545 Query: 1694 SLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQE 1828 +LLA+IYASA RWED+RKVR KM+ + IKK PG S+IEVD Q+ Sbjct: 546 TLLASIYASADRWEDVRKVRSKMKDMGIKKVPGYSAIEVDGKWQQ 590 >ref|XP_004513004.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31430-like isoform X1 [Cicer arietinum] Length = 605 Score = 765 bits (1975), Expect = 0.0 Identities = 369/596 (61%), Positives = 461/596 (77%) Frame = +2 Query: 65 KKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCYAQKIFNQ 244 K CI LLK+CK M LK+IQ +F GL Q+ D L+KL+A ++ + YA +IFN Sbjct: 2 KGTCISLLKSCKSMSHLKQIQTLIFSTGLQQDRDTLNKLMAVSIQ----DFHYALRIFNH 57 Query: 245 IDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGLRTVKDG 424 PSLF YN++IKS+ K G+F A+ LF++LRE+GLWPDNYTYP+V KA+ + V G Sbjct: 58 TQHPSLFIYNLLIKSFVKRGTFTAAISLFNQLREDGLWPDNYTYPYVLKAIGCMGEVGQG 117 Query: 425 ETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILISGFLRC 604 E +H FV+K+G FD Y+ NS+MDMY+ELG + +F+E+P+RD+VSWNI+ISGF+RC Sbjct: 118 EKVHAFVIKTGLDFDNYVCNSLMDMYAELGRVACLKHMFEEMPDRDNVSWNIMISGFVRC 177 Query: 605 NRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQFTVRIG 784 R +AV VF MR E KP EATVVSTL+AC+AL+++ELGKEIH Y++ EL FT +G Sbjct: 178 KRFREAVEVFQQMRMENNEKPSEATVVSTLTACAALRHVELGKEIHSYIANELDFTTIMG 237 Query: 785 NALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRSPVRDLV 964 NALLDMY KCGC+ +AR+IFD M KNV CWTSMV+GYV+CG LD+ARDLFD+SP RD+V Sbjct: 238 NALLDMYCKCGCVSVAREIFDGMTVKNVNCWTSMVTGYVNCGQLDQARDLFDKSPTRDIV 297 Query: 965 LWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGEWIHRYI 1144 LWTAMINGYVQFN DEAIALF +MQ+ + PDKF +V+LLT CAQLG+LE G WIH Y+ Sbjct: 298 LWTAMINGYVQFNCFDEAIALFGEMQVRGVKPDKFIVVSLLTCCAQLGALEHGRWIHDYV 357 Query: 1145 EENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNGKTSKAL 1324 ENRI VDAVVGT+LIEMYAKCG + KSLEVF+ +KEKD ASWTSII LAMNGKT KAL Sbjct: 358 RENRITVDAVVGTSLIEMYAKCGCIEKSLEVFNGLKEKDTASWTSIICGLAMNGKTKKAL 417 Query: 1325 ELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHYGCLIDL 1504 ELF +++ G +PDD+TFI +LSACSH GLVEEGR F+SM IY IEP LEHYGC IDL Sbjct: 418 ELFEEMKTFGAKPDDVTFIVLLSACSHAGLVEEGRRLFHSMSCIYDIEPNLEHYGCFIDL 477 Query: 1505 FGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQMESSDA 1684 GRAGLL EAE++I K+P++ +I++P+YG+LLSACRTYG+ DMGER+A L +++SSD+ Sbjct: 478 LGRAGLLHEAEELIRKLPDQKNEIIVPIYGSLLSACRTYGNTDMGERLATTLAKVKSSDS 537 Query: 1685 SIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEFNASNAFH 1852 S+HSLLA+IYASA RWED K R KM+ L IKK PGCS+IEVD + + H Sbjct: 538 SLHSLLASIYASADRWEDASKTRSKMKDLHIKKVPGCSAIEVDGIGNKVEVGDLSH 593 >gb|EXB76274.1| hypothetical protein L484_025631 [Morus notabilis] Length = 564 Score = 758 bits (1958), Expect = 0.0 Identities = 365/554 (65%), Positives = 447/554 (80%) Frame = +2 Query: 182 IAFTVDPVLGNLCYAQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWP 361 +AF DP LGNL YA+KIF I +P+LF YNVMIK+ TK GSFR+A+ +F++LRE GLWP Sbjct: 1 MAFCTDPSLGNLQYAEKIFGFIQEPTLFVYNVMIKALTKKGSFRRAILVFERLREEGLWP 60 Query: 362 DNYTYPFVFKAVVGLRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVF 541 DN+TYPFV KA+ L V++G+ +HGFV+K+G FD Y+ NS++DMY LG + K+F Sbjct: 61 DNFTYPFVMKAIGCLGEVQEGKKVHGFVVKTGLEFDTYVCNSLIDMYGHLGMLLYVEKLF 120 Query: 542 DELPERDSVSWNILISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNL 721 +++ ERDSVSWN+ IS F+R R DAV VF MR E KPDE T+VSTLSAC ALK+L Sbjct: 121 EKMSERDSVSWNVTISAFVRWGRFSDAVSVFRRMRLESNVKPDEPTIVSTLSACRALKDL 180 Query: 722 ELGKEIHEYVSTELQFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYV 901 ELG+EIH+YV +L FT RI N LLDMY+KCG L +AR+IFD MP KNVICWTSMVSGYV Sbjct: 181 ELGEEIHDYVRNKLGFTDRINNVLLDMYAKCGWLSVAREIFDEMPTKNVICWTSMVSGYV 240 Query: 902 SCGLLDKARDLFDRSPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVA 1081 G LD+A +LFDRSPVRD+VLWTAMINGYVQ+N D+A+ LF++MQ +R+ DKFT+VA Sbjct: 241 KFGKLDEAIELFDRSPVRDVVLWTAMINGYVQYNRFDDAMDLFQEMQSKRVKADKFTMVA 300 Query: 1082 LLTGCAQLGSLEQGEWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKD 1261 LLTGCAQLG+LEQGEWIH YI EN I +DAVVGTALIEMYAKCG + KSLE+F ++EKD Sbjct: 301 LLTGCAQLGALEQGEWIHGYINENGIDIDAVVGTALIEMYAKCGCIDKSLEIFKEVREKD 360 Query: 1262 AASWTSIISALAMNGKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFN 1441 A+WTSII LAMNG++SKALELFS++ QAGI PDDITFIGVLSACSH GLVEEGR++F+ Sbjct: 361 TAAWTSIICGLAMNGRSSKALELFSEMRQAGINPDDITFIGVLSACSHAGLVEEGRQFFH 420 Query: 1442 SMKNIYQIEPKLEHYGCLIDLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTY 1621 SM IY IEPK EHYGCLIDLFGRAGLL EAE++IE+IPN +++ LYGALLSACR + Sbjct: 421 SMMEIYMIEPKYEHYGCLIDLFGRAGLLDEAEELIERIPNDSNTVLVSLYGALLSACRIH 480 Query: 1622 GDVDMGERIAKLLMQMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSS 1801 G+V+MGER+A+ L +ESSD+S+H LLANIYASA RWEDM KVR M+ ++K+PGCSS Sbjct: 481 GNVEMGERVARRLADIESSDSSVHMLLANIYASAERWEDMTKVRMNMKDFGVRKTPGCSS 540 Query: 1802 IEVDSDTQEFNASN 1843 IE++ EF A + Sbjct: 541 IEINGVVSEFVAGD 554 >gb|ABD96949.1| hypothetical protein [Cleome spinosa] Length = 639 Score = 723 bits (1866), Expect = 0.0 Identities = 354/598 (59%), Positives = 451/598 (75%) Frame = +2 Query: 59 LTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCYAQKIF 238 L+K C+DLL++C+ M L +I A++F +GL N+D L K++ F DP G++ YA+++ Sbjct: 14 LSKNYCVDLLQSCESMAHLTQIHAKIFRVGLQDNMDTLTKIVLFCTDPSRGSIRYAERVL 73 Query: 239 NQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGLRTVK 418 + P L YN+MIK+ K +FRK L LF +LR+ GL PDN+T P VFKA+ L V Sbjct: 74 GFVQSPCLVMYNLMIKAVAKDENFRKVLVLFSELRKQGLNPDNFTLPPVFKAMGCLGKVV 133 Query: 419 DGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILISGFL 598 +GE +HG+V+KSG FD + NS+M MY LG ++ + KVFDE+PERD VSWN+LIS ++ Sbjct: 134 EGEKVHGYVVKSG--FDACVCNSVMGMYGALGKMEVAKKVFDEIPERDVVSWNVLISSYV 191 Query: 599 RCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQFTVR 778 + +DA+ VF MRRE K DEATVVSTLSACS L+N E+G+EIH YV EL+ T + Sbjct: 192 GHRKFEDAIAVFRRMRRESNLKADEATVVSTLSACSVLRNQEVGEEIHRYVDAELEMTTK 251 Query: 779 IGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRSPVRD 958 IGNALLDMY KCGC+ AR IFD M KNVICWTSMVSGY S G LD+AR+LF+RSPVRD Sbjct: 252 IGNALLDMYCKCGCVDKARAIFDEMGNKNVICWTSMVSGYASNGSLDEARELFERSPVRD 311 Query: 959 LVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGEWIHR 1138 +VLWTAMINGYVQFN DEA+ LFRKMQ++R+ PD F LV LL GCAQ G+LEQG+W+H Sbjct: 312 IVLWTAMINGYVQFNLFDEALKLFRKMQIQRLRPDNFILVTLLKGCAQTGALEQGKWLHG 371 Query: 1139 YIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNGKTSK 1318 YI EN I +D VVGTAL+++YAKCG V K+LEVF +KE+D ASWTS+I LA+NG TSK Sbjct: 372 YIHENSITLDRVVGTALVDVYAKCGCVEKALEVFYEMKERDTASWTSVIYGLAVNGMTSK 431 Query: 1319 ALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHYGCLI 1498 AL+ FS++E+AG RPDDITFIGVL+AC+HGGLVEEGR YF+SM Y+I+PK EHY CLI Sbjct: 432 ALDFFSQMEEAGFRPDDITFIGVLTACNHGGLVEEGRRYFDSMTKTYKIQPKSEHYSCLI 491 Query: 1499 DLFGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQMESS 1678 DL RAGLL EAE ++E IP + IV+PLY +LLSACR YG++ M ER+ + L ++E Sbjct: 492 DLLCRAGLLDEAELLLEMIPIESSDIVVPLYCSLLSACRNYGNLKMSERVGRRLERVEVK 551 Query: 1679 DASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEFNASNAFH 1852 D+S+H+LLA++YASA+RWED+ VRRKM+ L I+K PGCSSIEV+ EF H Sbjct: 552 DSSVHTLLASVYASANRWEDVTTVRRKMKELGIRKFPGCSSIEVNGVLHEFMVGGPSH 609 >gb|EPS73045.1| hypothetical protein M569_01711 [Genlisea aurea] Length = 664 Score = 686 bits (1771), Expect = 0.0 Identities = 356/625 (56%), Positives = 453/625 (72%), Gaps = 5/625 (0%) Frame = +2 Query: 23 HYSNTASSETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIAFTVDP 202 H+S+ +S LT+KA I LLK CK M +I A F GLH NID LHK++AF D Sbjct: 16 HFSSVSSPR---LTRKAYIQLLKGCKSMTHFYQIHALGFSHGLHDNIDVLHKVVAFAAD- 71 Query: 203 VLGNLCYAQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPF 382 +L YA+K+F ++++P+LF YNV+IK + K GSFRKAL LFD+LR GLWPDNYTYPF Sbjct: 72 --ADLSYAEKVFKRVERPTLFIYNVLIKRFVKSGSFRKALHLFDELRLRGLWPDNYTYPF 129 Query: 383 VFKAVVGLRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERD 562 V KAV GL +V +GE IHGF LKSG +D Y+ NS++DMY ELG + K+FDE+P RD Sbjct: 130 VCKAVAGLSSVAEGEKIHGFALKSGMLYDGYVCNSLLDMYGELGIHGCAGKLFDEMPLRD 189 Query: 563 SVSWNILISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIH 742 VSWN+LIS F + N D A+ V+ M E PDEATVVSTLSAC+A K+++LG+ IH Sbjct: 190 LVSWNVLISVFAKNNMPDHAIAVYKRMGSETALCPDEATVVSTLSACAAAKDIDLGRGIH 249 Query: 743 EYVSTELQFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDK 922 +YVSTEL FT + NALLDMY+KCG L AR+IFD++ +KNV+CWTSMVS + G L + Sbjct: 250 KYVSTELGFTAIVQNALLDMYAKCGHLETARRIFDSVREKNVVCWTSMVSACANAGNLVE 309 Query: 923 ARDLFDRSPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQ 1102 AR LF+RSPVRD+V+WTAMINGYVQFN VD+A+ALFR MQ ++ PD++TLVALLTGCA Sbjct: 310 ARALFERSPVRDVVMWTAMINGYVQFNMVDDAMALFRLMQNAKLEPDRYTLVALLTGCAH 369 Query: 1103 LGSLEQGEWIHRYIEENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSI 1282 G+LEQGEWIH Y+ ENRIPVD V+GTAL+EMYAKCG + +SL +F + +D ASWTS+ Sbjct: 370 SGALEQGEWIHNYLIENRIPVDVVLGTALVEMYAKCGCLNESLRIFRRLDRRDTASWTSM 429 Query: 1283 ISALAMNGKTSKALELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQ 1462 I ALAMNG +++LE+F ++ +AGI PDD+ FIGVL+ACSHGGLV+EGR +F SM Y Sbjct: 430 IFALAMNGDVAESLEVFEEMIRAGIPPDDVAFIGVLTACSHGGLVDEGRRHFASMAETYG 489 Query: 1463 IEPKLEHYGCLIDLFGRAGLLKEAEK---MIEKIPNKDEK--IVIPLYGALLSACRTYGD 1627 IEPKLEHYGCLIDLFGRAGLL EAEK M + +D V LYGALL CR Y + Sbjct: 490 IEPKLEHYGCLIDLFGRAGLLDEAEKATAMSMRSSGRDSNGIGVDSLYGALLGGCRKYEN 549 Query: 1628 VDMGERIAKLLMQMESSDASIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIE 1807 VD+ ER+AK ++E D L+++IYA+ASRW+D ++R+ + IKK PGCS++E Sbjct: 550 VDVAERVAK---RVEGGD---KVLMSSIYAAASRWDDAVRIRKGRGKMGIKKLPGCSAVE 603 Query: 1808 VDSDTQEFNASNAFHRGRERIPNDV 1882 D + F RG+ R N + Sbjct: 604 A-GDLISEHLGFCFVRGKRRKGNSM 627 >gb|EMJ05120.1| hypothetical protein PRUPE_ppa015065mg [Prunus persica] Length = 531 Score = 668 bits (1724), Expect = 0.0 Identities = 331/548 (60%), Positives = 415/548 (75%) Frame = +2 Query: 104 MKQLKEIQAQVFILGLHQNIDALHKLIAFTVDPVLGNLCYAQKIFNQIDKPSLFTYNVMI 283 M QLK+IQ+Q+F+LGLHQ+ L KL+AF DP LGNL A+K+F+ I P LF YN MI Sbjct: 1 MNQLKQIQSQMFLLGLHQDRFTLSKLMAFCTDPSLGNLYCAEKVFHYIQNPCLFVYNRMI 60 Query: 284 KSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGLRTVKDGETIHGFVLKSGNS 463 K++ K GSFR AL LF LRE GLWPD++TYPFVFKA+ LR ++G +HG V+K+G Sbjct: 61 KAFAKRGSFRSALELFRLLREEGLWPDSFTYPFVFKAIGCLREPREGAKVHGLVVKTGFE 120 Query: 464 FDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILISGFLRCNRLDDAVGVFMNM 643 FD Y+ NS++D + C+ PER+ + WN+ ISG++RC R +DA +F M Sbjct: 121 FDAYVCNSLID----INCLTK--------PERNWLCWNVTISGYVRCRRFEDAFDMFQRM 168 Query: 644 RREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQFTVRIGNALLDMYSKCGCL 823 R E KPDEATVVSTLSAC+ALKNLELGK+IH+YV +EL+ T IGNALL+MY+KCGCL Sbjct: 169 RCESNKKPDEATVVSTLSACTALKNLELGKQIHDYVKSELKLTTIIGNALLNMYAKCGCL 228 Query: 824 IIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRSPVRDLVLWTAMINGYVQFN 1003 R+IFD +P KNVIC TSMVSGY D VLWTAMINGY Q+N Sbjct: 229 NEGRRIFDEIPSKNVICCTSMVSGY-------------------DAVLWTAMINGYAQYN 269 Query: 1004 NVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGEWIHRYIEENRIPVDAVVGT 1183 DEA+ALF++MQ+ R+ DKFT V LLTGCAQ G+LEQG+WIHRY+EEN I +DA Sbjct: 270 RFDEAVALFQEMQIRRVKGDKFTAVTLLTGCAQSGALEQGKWIHRYMEENGIKIDA---P 326 Query: 1184 ALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNGKTSKALELFSKLEQAGIRP 1363 ALIEMYAKCG + KSLE+F+ ++EKDAA WTSI+ LA NGK SKA+ELFS++ Q GI P Sbjct: 327 ALIEMYAKCGCIDKSLEIFNGLREKDAACWTSIVCGLAKNGKASKAVELFSEMIQIGINP 386 Query: 1364 DDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHYGCLIDLFGRAGLLKEAEKM 1543 DDI FI VL ACSHGGLV+EGR++FNSM+ +Y+IEPKLEHY CL+DL GRAGLL EAE+M Sbjct: 387 DDINFIAVLRACSHGGLVDEGRKFFNSMRKMYEIEPKLEHYACLVDLLGRAGLLDEAEEM 446 Query: 1544 IEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQMESSDASIHSLLANIYASA 1723 IE++P+++++I+IPLYGALLSACR +G+V+MGER+AK L +ESS +S+H+LLAN YASA Sbjct: 447 IERVPSENKEIMIPLYGALLSACRIHGNVEMGERVAKRLADIESSGSSVHTLLANTYASA 506 Query: 1724 SRWEDMRK 1747 RWED+ K Sbjct: 507 DRWEDVTK 514 >ref|XP_006415328.1| hypothetical protein EUTSA_v10007222mg [Eutrema salsugineum] gi|557093099|gb|ESQ33681.1| hypothetical protein EUTSA_v10007222mg [Eutrema salsugineum] Length = 569 Score = 632 bits (1630), Expect = e-178 Identities = 306/529 (57%), Positives = 400/529 (75%) Frame = +2 Query: 245 IDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDNYTYPFVFKAVVGLRTVKDG 424 + PSL YN M+KS + SF K L LF +LR GL+PDN+T P V K++ LR V +G Sbjct: 6 VQAPSLIMYNKMLKSLAETKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLRKVLEG 65 Query: 425 ETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDELPERDSVSWNILISGFLRC 604 E +HG+ +K+G FD Y+ NS+M MY+ LG ++ + KVFDE+PERD VSWN LIS ++ Sbjct: 66 EKVHGYAMKAGLEFDSYVCNSLMGMYASLGKMEITHKVFDEMPERDVVSWNGLISSYVGH 125 Query: 605 NRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLELGKEIHEYVSTELQFTVRIG 784 R +DA+ VF MRRE KPDE T+VSTLSACSALKNL++G+ IH YV TE + +V+IG Sbjct: 126 GRFEDAIAVFKRMRRESNLKPDEGTIVSTLSACSALKNLDIGEGIHRYVVTEFEMSVKIG 185 Query: 785 NALLDMYSKCGCLIIARQIFDTMPQKNVICWTSMVSGYVSCGLLDKARDLFDRSPVRDLV 964 NAL+DM+ KCGCL AR +FD++ KNV CWTSMVSGYVS G +D+ R+LF+RSPV+D+V Sbjct: 186 NALVDMFCKCGCLDKARVVFDSVRSKNVKCWTSMVSGYVSNGRIDEGRELFERSPVKDVV 245 Query: 965 LWTAMINGYVQFNNVDEAIALFRKMQMERISPDKFTLVALLTGCAQLGSLEQGEWIHRYI 1144 LWTAM+NGYVQFN DEA+ LFR MQ E + D F LV+LL GC Q G+LEQG+WIH YI Sbjct: 246 LWTAMMNGYVQFNRFDEALELFRCMQTEGVKLDNFVLVSLLKGCGQTGALEQGKWIHGYI 305 Query: 1145 EENRIPVDAVVGTALIEMYAKCGWVGKSLEVFDAIKEKDAASWTSIISALAMNGKTSKAL 1324 ENR+ VD VVGTAL++MYAKCG + +LEVF KE+D ASWTS+I LAMNG + +AL Sbjct: 306 YENRVAVDKVVGTALVDMYAKCGCIETALEVFYETKERDTASWTSLIYGLAMNGMSRRAL 365 Query: 1325 ELFSKLEQAGIRPDDITFIGVLSACSHGGLVEEGREYFNSMKNIYQIEPKLEHYGCLIDL 1504 EL+ ++E +R DDITF+ VL+AC+HGG V EGR F+SM ++I+PK EHY CLIDL Sbjct: 366 ELYYEMENVDVRLDDITFVAVLTACNHGGFVAEGRRIFHSMTKEHKIQPKSEHYSCLIDL 425 Query: 1505 FGRAGLLKEAEKMIEKIPNKDEKIVIPLYGALLSACRTYGDVDMGERIAKLLMQMESSDA 1684 RAGLL EAE++I+KI N++++ ++P+Y +LLSA R YG+V + ER+A+ L +E SD+ Sbjct: 426 LCRAGLLDEAEELIDKIQNENDETLVPVYCSLLSAARNYGNVKLAERVAEKLEIVEVSDS 485 Query: 1685 SIHSLLANIYASASRWEDMRKVRRKMRGLRIKKSPGCSSIEVDSDTQEF 1831 S H+LLA++YASA+RWED+ VRR+M+ L I+K PGCSS+EVD EF Sbjct: 486 SAHTLLASVYASANRWEDVAIVRREMKDLGIRKFPGCSSVEVDGIPHEF 534 Score = 94.0 bits (232), Expect = 2e-16 Identities = 76/354 (21%), Positives = 161/354 (45%), Gaps = 9/354 (2%) Frame = +2 Query: 8 LAKAIHYSNTASSETTHLTKKACIDLLKTCKCMKQLKEIQAQVFILGLHQNIDALHKLIA 187 + + IH E + A +D+ C C+ + + + V +N+ +++ Sbjct: 166 IGEGIHRYVVTEFEMSVKIGNALVDMFCKCGCLDKARVVFDSV----RSKNVKCWTSMVS 221 Query: 188 FTVDPVLGNLCYAQKIFNQIDKPSLFTYNVMIKSYTKMGSFRKALCLFDKLRENGLWPDN 367 V G + +++F + + + M+ Y + F +AL LF ++ G+ DN Sbjct: 222 GYVSN--GRIDEGRELFERSPVKDVVLWTAMMNGYVQFNRFDEALELFRCMQTEGVKLDN 279 Query: 368 YTYPFVFKAVVGLRTVKDGETIHGFVLKSGNSFDCYIFNSIMDMYSELGCIDSSVKVFDE 547 + + K ++ G+ IHG++ ++ + D + +++DMY++ GCI+++++VF E Sbjct: 280 FVLVSLLKGCGQTGALEQGKWIHGYIYENRVAVDKVVGTALVDMYAKCGCIETALEVFYE 339 Query: 548 LPERDSVSWNILISGFLRCNRLDDAVGVFMNMRREGKSKPDEATVVSTLSACSALKNLEL 727 ERD+ SW LI G A+ ++ M + D+ T V+ L+AC+ + Sbjct: 340 TKERDTASWTSLIYGLAMNGMSRRALELYYEMENV-DVRLDDITFVAVLTACNHGGFVAE 398 Query: 728 GKEIHEYVSTE--LQFTVRIGNALLDMYSKCGCLIIARQIFDTMPQKN----VICWTSMV 889 G+ I ++ E +Q + L+D+ + G L A ++ D + +N V + S++ Sbjct: 399 GRRIFHSMTKEHKIQPKSEHYSCLIDLLCRAGLLDEAEELIDKIQNENDETLVPVYCSLL 458 Query: 890 S---GYVSCGLLDKARDLFDRSPVRDLVLWTAMINGYVQFNNVDEAIALFRKMQ 1042 S Y + L ++ + + V D T + + Y N ++ + R+M+ Sbjct: 459 SAARNYGNVKLAERVAEKLEIVEVSDSSAHTLLASVYASANRWEDVAIVRREMK 512