BLASTX nr result
ID: Mentha24_contig00003066
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00003066 (1173 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus... 539 e-150 ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfam... 496 e-138 ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containi... 495 e-137 gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis... 490 e-136 ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containi... 489 e-136 ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containi... 482 e-133 ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containi... 478 e-132 ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citr... 474 e-131 ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containi... 474 e-131 ref|XP_002534070.1| pentatricopeptide repeat-containing protein,... 466 e-129 ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containi... 463 e-128 ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phas... 458 e-126 ref|XP_002306741.1| pentatricopeptide repeat-containing family p... 449 e-123 ref|XP_004491336.1| PREDICTED: pentatricopeptide repeat-containi... 447 e-123 ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containi... 443 e-122 ref|XP_003617444.1| Pentatricopeptide repeat-containing protein ... 432 e-118 ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutr... 431 e-118 ref|NP_181820.1| pentatricopeptide repeat-containing protein [Ar... 427 e-117 ref|XP_002880012.1| pentatricopeptide repeat-containing protein ... 426 e-116 ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Caps... 414 e-113 >gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus guttatus] Length = 505 Score = 539 bits (1388), Expect = e-150 Identities = 262/365 (71%), Positives = 309/365 (84%), Gaps = 5/365 (1%) Frame = +2 Query: 89 ISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSFCATSAAA-DLHYALS 265 I+DQP+LS+LET+C TI DL KIHA LIKTGLA DTIA+SR+L+FCA A DL YA S Sbjct: 2 IADQPFLSLLETNCHTIKDLTKIHAQLIKTGLAKDTIAVSRILAFCAAPGPARDLDYAFS 61 Query: 266 LFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGKLTYPSVFKAYTQLG 445 +F I++PNLFTWNTIIR F SS P+VAISLF++MLT S + P LTYPSVFKAYTQLG Sbjct: 62 VFSHIEKPNLFTWNTIIRGFCQSSHPHVAISLFVDMLTNSTLEPENLTYPSVFKAYTQLG 121 Query: 446 LAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENRDLDVVAWNSMIM 625 LA DGAQLHGRI+KLG E DPFIRNSIIHMYA CGL G+A LFDE+ D DVVAWNSM+M Sbjct: 122 LAGDGAQLHGRIIKLGFEHDPFIRNSIIHMYADCGLFGSARKLFDEDEDTDVVAWNSMVM 181 Query: 626 GFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQEKQIRPTHFT 805 G AKCGE++ESWRLFCKIP RN+ISWNTMISGYVRNG+W++AL+LF EMQ++QIRP+ FT Sbjct: 182 GLAKCGEVDESWRLFCKIPCRNDISWNTMISGYVRNGKWVDALSLFAEMQQRQIRPSEFT 241 Query: 806 LVSILNACGKLGALEQGKWIHDYIKR---NDFELNVIVITAIIDMYCKCGEIGMAREVFK 976 LVS+LNAC KLGALEQGKWIH YIK+ N+ + N IV+TAIIDMYCKCG+I AREVF+ Sbjct: 242 LVSMLNACAKLGALEQGKWIHRYIKKSDINNIDRNTIVVTAIIDMYCKCGDIKTAREVFE 301 Query: 977 TSPRKGLACWNSMMLGLANNGHYEQVFELFSKLE-SSNLRPDAVSFVAVLTASNHSVRVD 1153 ++P+K L+ WNSM+LGLA NG E+ F+LF++LE SSNL PD+VSF+ VLTASNHSVRVD Sbjct: 302 STPQKALSGWNSMILGLATNGFEEEAFQLFTELEQSSNLNPDSVSFIGVLTASNHSVRVD 361 Query: 1154 DARKY 1168 AR+Y Sbjct: 362 KAREY 366 >ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma cacao] gi|508701125|gb|EOX93021.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma cacao] Length = 538 Score = 496 bits (1277), Expect = e-138 Identities = 242/379 (63%), Positives = 298/379 (78%), Gaps = 1/379 (0%) Frame = +2 Query: 35 MPPCFCSFNPPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRV 214 M CFCS P SI+KFISDQPYLS+LE +C ++ DLKK+HA LIKTGL +D IA SRV Sbjct: 1 MVQCFCSLTPSPASITKFISDQPYLSLLENNCTSMKDLKKLHAQLIKTGLVNDIIAASRV 60 Query: 215 LSFCATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVL 394 L+FC S A D++YA +F +I PNLFTWNTIIR FS SS+P +AISLF++ML S + Sbjct: 61 LAFCV-SPAGDMNYAYLVFTQIKNPNLFTWNTIIRGFSQSSNPQIAISLFIDMLVGSSIQ 119 Query: 395 PGKLTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNL 574 P +LTYPSVFKAY QLGLA DG QLHGR++KLGL+ D FIRN+II+MYA+CGLL A + Sbjct: 120 PERLTYPSVFKAYAQLGLACDGRQLHGRVIKLGLDYDQFIRNTIIYMYANCGLLSEAWRM 179 Query: 575 FDENR-DLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEA 751 FDE +LD+VAWNSMI+G AKCGE++ES RLF K+ RN +SWN+MISGYVRNGR++EA Sbjct: 180 FDEEHMELDIVAWNSMIIGLAKCGEVDESRRLFNKMVSRNTVSWNSMISGYVRNGRFLEA 239 Query: 752 LNLFHEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDM 931 L LF EMQE+ IRP+ FT+VS+LNAC LGA+ QGKWIHDYI + +FELN IV+TAIIDM Sbjct: 240 LELFQEMQEEHIRPSEFTMVSLLNACACLGAITQGKWIHDYILKQNFELNGIVVTAIIDM 299 Query: 932 YCKCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSF 1111 YCKCG A +VF TSP++GL+CWNSM+LGLA NG + +LFSKLES +L+PD V+F Sbjct: 300 YCKCGNAEKALQVFTTSPKEGLSCWNSMILGLATNGCENEARQLFSKLESLSLKPDHVTF 359 Query: 1112 VAVLTASNHSVRVDDARKY 1168 + VL A N + VD A+ Y Sbjct: 360 IGVLMACNSAGMVDKAKYY 378 >ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 550 Score = 495 bits (1274), Expect = e-137 Identities = 243/378 (64%), Positives = 302/378 (79%) Frame = +2 Query: 35 MPPCFCSFNPPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRV 214 M PC CSF STSISKFISD+P+L MLE C + DL+KIHAHLIKTGLA+DT+A SRV Sbjct: 1 MTPCCCSFTS-STSISKFISDKPHLFMLENQCTNMKDLQKIHAHLIKTGLANDTVAASRV 59 Query: 215 LSFCATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVL 394 L+FCA S A D++YA +FR I PNLF WNTIIR FS+SS+P AISLF++ML TS V Sbjct: 60 LAFCA-SPAGDINYAYMVFRHIHNPNLFIWNTIIRGFSNSSNPEAAISLFIDMLVTSTVQ 118 Query: 395 PGKLTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNL 574 P +LTYPSVFKAY QLGLA DGAQLHGR+VKLGLESD F+RN+IIHMY++CGLL A + Sbjct: 119 PQRLTYPSVFKAYAQLGLAHDGAQLHGRVVKLGLESDQFVRNTIIHMYSNCGLLSEARRV 178 Query: 575 FDENRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEAL 754 FDE+ + D+VAWNSMIMG +KCGE+ ES RLF K+P RN ISWN+MI G VRNG + EAL Sbjct: 179 FDEDLEFDIVAWNSMIMGLSKCGEVGESRRLFDKMPQRNSISWNSMIGGSVRNGMYTEAL 238 Query: 755 NLFHEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMY 934 +LF EMQ+++I+P+ FT+VS+LNA +LGA+ QG+WIH+YI++N +LN IV+TAII+MY Sbjct: 239 DLFGEMQKQKIKPSEFTMVSLLNASAQLGAIRQGEWIHEYIRKNHIQLNPIVVTAIINMY 298 Query: 935 CKCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFV 1114 KCG I A VF+ +PR GL+CWNS+++GLA NG E+ ELFS+L+SS+ PD VSF+ Sbjct: 299 SKCGSIEKAVHVFEAAPRTGLSCWNSIIMGLATNGCEEEAIELFSRLKSSSFVPDDVSFL 358 Query: 1115 AVLTASNHSVRVDDARKY 1168 VLTA +HS V+ ARKY Sbjct: 359 GVLTACSHSGMVEKARKY 376 >gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis] gi|587904202|gb|EXB92403.1| hypothetical protein L484_021387 [Morus notabilis] Length = 530 Score = 490 bits (1262), Expect = e-136 Identities = 245/383 (63%), Positives = 301/383 (78%), Gaps = 5/383 (1%) Frame = +2 Query: 35 MPPCFCSF----NPPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIA 202 M P CS +P TSI+KFISDQP+LSMLE C T++DL+KIHAHLIKTGL S TIA Sbjct: 1 MTPFCCSQTFTPSPSPTSIAKFISDQPHLSMLEKRCATMSDLRKIHAHLIKTGLISHTIA 60 Query: 203 ISRVLSFCATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTT 382 SR+L+FCA S A +++YAL +F +I PNLF WNTIIR FS SS P AI LF++ML Sbjct: 61 SSRLLAFCA-SPAGNINYALMVFSQIQNPNLFIWNTIIRGFSRSSTPQTAIFLFIDMLVG 119 Query: 383 SQVLPGKLTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGN 562 S + P +LTYPSVFKAY QLGLA GAQLHGR++KLGL+ D F+RN+IIHMY +CG L Sbjct: 120 SPLEPQRLTYPSVFKAYAQLGLACFGAQLHGRVIKLGLDCDRFVRNTIIHMYINCGFLSE 179 Query: 563 AGNLFDENRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRW 742 A LFDE+ +LD+VAWNSMIMG +KCGE+ ES RLF ++PLRN +SWN+MISGYVRNG+ Sbjct: 180 ARQLFDESSELDLVAWNSMIMGLSKCGEVGESRRLFDRMPLRNSVSWNSMISGYVRNGKC 239 Query: 743 IEALNLFHEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAI 922 +EAL LF +MQ + I+ + FT+VS+LNA G+LGA+ QG+WIH+YI +N ELNVIV+TAI Sbjct: 240 VEALELFGKMQGEGIKASEFTMVSLLNASGRLGAIRQGEWIHEYITKNGIELNVIVVTAI 299 Query: 923 IDMYCKCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESS-NLRPD 1099 IDMYCKCG + A VFKT+P+ GL+CWNSM++GLA NG E+ ELFS+LESS +LRPD Sbjct: 300 IDMYCKCGSVNKALSVFKTAPKLGLSCWNSMVMGLAMNGCEEEALELFSRLESSIDLRPD 359 Query: 1100 AVSFVAVLTASNHSVRVDDARKY 1168 VSF+AVLTA NHS VD AR Y Sbjct: 360 GVSFLAVLTACNHSGMVDKARDY 382 >ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic [Vitis vinifera] gi|302143555|emb|CBI22116.3| unnamed protein product [Vitis vinifera] Length = 533 Score = 489 bits (1259), Expect = e-136 Identities = 238/375 (63%), Positives = 298/375 (79%) Frame = +2 Query: 44 CFCSFNPPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSF 223 C + PSTSISKFISD P+LS+LE C T+ DL+KIHAHL+KTGLA +A+S VL+F Sbjct: 6 CSLTSPSPSTSISKFISDHPHLSILEKHCTTMKDLQKIHAHLLKTGLAKHPLAVSPVLAF 65 Query: 224 CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGK 403 CATS D++YA +F +I PNLF+WNTIIR FS SS P+ AISLF++ML S V P + Sbjct: 66 CATSPGGDINYAYLVFTQIHSPNLFSWNTIIRGFSQSSTPHHAISLFIDMLIVSSVQPHR 125 Query: 404 LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 583 LTYPSVFKAY QLGLA GAQLHGR++KLGL+ DPFIRN+II+MYA+CG L F E Sbjct: 126 LTYPSVFKAYAQLGLAHYGAQLHGRVIKLGLQFDPFIRNTIIYMYANCGFLSEMWKAFYE 185 Query: 584 NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 763 D D+VAWNSMIMG AKCGE++ES +LF ++PLRN +SWN+MISGYVRNGR EAL+LF Sbjct: 186 RMDFDIVAWNSMIMGLAKCGEVDESRKLFDEMPLRNTVSWNSMISGYVRNGRLREALDLF 245 Query: 764 HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 943 +MQE++I+P+ FT+VS+LNA +LGAL+QG+WIHDYI++N+FELNVIV +IIDMYCKC Sbjct: 246 GQMQEERIKPSEFTMVSLLNASARLGALKQGEWIHDYIRKNNFELNVIVTASIIDMYCKC 305 Query: 944 GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 1123 G IG A +VF+ +P KGL+ WN+M+LGLA NG + +LFS+LE SNLRPD V+FV VL Sbjct: 306 GSIGEAFQVFEMAPLKGLSSWNTMILGLAMNGCENEAIQLFSRLECSNLRPDDVTFVGVL 365 Query: 1124 TASNHSVRVDDARKY 1168 TA N+S VD A++Y Sbjct: 366 TACNYSGLVDKAKEY 380 >ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Solanum tuberosum] Length = 522 Score = 482 bits (1241), Expect = e-133 Identities = 240/368 (65%), Positives = 287/368 (77%), Gaps = 1/368 (0%) Frame = +2 Query: 68 STSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSFCATSAA-A 244 STSISKFISDQPYL MLET C T+TDLKKIHAHLIK+GL D IA SRVL+F A S Sbjct: 8 STSISKFISDQPYLHMLETKCTTMTDLKKIHAHLIKSGLIKDKIASSRVLAFSAKSPPIG 67 Query: 245 DLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGKLTYPSVF 424 D++YA +F I+ PNLFTWNTIIR FS SS P AI LF+EML SQV P LTYPSVF Sbjct: 68 DINYANLVFTHIENPNLFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSVF 127 Query: 425 KAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENRDLDVV 604 KAY + GL +GAQLHGRI+KLGLE D FIRN++++MYASCG L A LFDE+ DVV Sbjct: 128 KAYARGGLVKNGAQLHGRIIKLGLEFDTFIRNTMLYMYASCGFLVEARKLFDEDEIEDVV 187 Query: 605 AWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQEKQ 784 +WNSMIMG AK GEI++SWRLF K+ RN++SWN+MISG+VRNG+W EAL LF MQE+ Sbjct: 188 SWNSMIMGLAKSGEIDDSWRLFSKMSTRNDVSWNSMISGFVRNGKWNEALELFSTMQEEN 247 Query: 785 IRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGMAR 964 I+P+ FTLVS+LNACG LGALEQG WI+ Y+K+N+ ELNVIV+TAIIDMYCKCG + MA Sbjct: 248 IKPSEFTLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCKCGNVEMAW 307 Query: 965 EVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNHSV 1144 VF + KGL+ WNSM+LGLA NG + +LF++L+ S L+PD+VSF+ VLTA NHS Sbjct: 308 HVFISISNKGLSSWNSMILGLATNGFEDDAIKLFARLQCSILKPDSVSFIGVLTACNHSG 367 Query: 1145 RVDDARKY 1168 VD A+ Y Sbjct: 368 LVDKAKDY 375 Score = 94.7 bits (234), Expect = 7e-17 Identities = 72/308 (23%), Positives = 145/308 (47%), Gaps = 9/308 (2%) Frame = +2 Query: 152 KIHAHLIKTGLASDTIAISRVLSFCATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSH 331 ++H +IK GL DT + +L A+ L A LF + ++ +WN++I + Sbjct: 141 QLHGRIIKLGLEFDTFIRNTMLYMYASCGF--LVEARKLFDEDEIEDVVSWNSMIMGLAK 198 Query: 332 SSDPNVAISLFLEMLTTSQVLPGKLTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPF 511 S + + + LF +M T + V ++ S+ + + G + +L + + ++ F Sbjct: 199 SGEIDDSWRLFSKMSTRNDV-----SWNSMISGFVRNGKWNEALELFSTMQEENIKPSEF 253 Query: 512 IRNSIIHMYASCGLLG--NAGNLF-----DENRDLDVVAWNSMIMGFAKCGEIEESWRLF 670 +++ + +CG LG GN N +L+V+ ++I + KCG +E +W +F Sbjct: 254 ---TLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCKCGNVEMAWHVF 310 Query: 671 CKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQEKQIRPTHFTLVSILNACGKLGALE 850 I + SWN+MI G NG +A+ LF +Q ++P + + +L AC G ++ Sbjct: 311 ISISNKGLSSWNSMILGLATNGFEDDAIKLFARLQCSILKPDSVSFIGVLTACNHSGLVD 370 Query: 851 QGKWIHDYIKRN-DFELNVIVITAIIDMYCKCGEIGMAREVFKTSPRK-GLACWNSMMLG 1024 + K +K+ E ++ ++D+ + G + A EV ++ + W S++ Sbjct: 371 KAKDYFQLMKKEYGIEPSIKHYGCMVDILGRAGLVEEADEVIRSMKMEPDAVIWCSLLSA 430 Query: 1025 LANNGHYE 1048 ++G+ E Sbjct: 431 CRSHGNME 438 >ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Solanum lycopersicum] Length = 522 Score = 478 bits (1230), Expect = e-132 Identities = 236/368 (64%), Positives = 287/368 (77%), Gaps = 1/368 (0%) Frame = +2 Query: 68 STSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSFCATSAA-A 244 STSISKFI DQPYL MLET C T+TDLKKIHAHLIK+GL D IA SRVL+F A S Sbjct: 8 STSISKFILDQPYLHMLETKCTTMTDLKKIHAHLIKSGLIKDKIAASRVLAFSAKSPPIG 67 Query: 245 DLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGKLTYPSVF 424 D++YA +F I+ PN FTWNTIIR FS SS P AI LF+EML SQV P LTYPSVF Sbjct: 68 DINYANLVFTHIENPNPFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSVF 127 Query: 425 KAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENRDLDVV 604 KAY + G+A +GAQLHGRI+KLGLE D FIRN++++MYASCG L A LFDE+ DVV Sbjct: 128 KAYARGGIAKNGAQLHGRIMKLGLEFDTFIRNTLLYMYASCGFLVEARKLFDEDEIEDVV 187 Query: 605 AWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQEKQ 784 +WNSMI+G AK GEI++SWRLF K+P RN++SWN+MISG+VRNG+W EAL LF MQE+ Sbjct: 188 SWNSMIIGLAKSGEIDDSWRLFSKMPTRNDVSWNSMISGFVRNGKWNEALELFSTMQEEN 247 Query: 785 IRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGMAR 964 ++P+ FTLVS+LNACG LGALEQG WI+ Y+K+N+ ELNVIV+TAIIDMYCKC + MA Sbjct: 248 VKPSEFTLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCKCANVEMAW 307 Query: 965 EVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNHSV 1144 VF +S KGL+ WNSM+LGLA NG + +LF++L+ S L+PD+VSF+ VLTA NHS Sbjct: 308 HVFVSSSNKGLSSWNSMILGLATNGFEDDAIKLFARLQCSILKPDSVSFIGVLTACNHSG 367 Query: 1145 RVDDARKY 1168 V+ A+ Y Sbjct: 368 LVEKAKDY 375 >ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citrus clementina] gi|557539373|gb|ESR50417.1| hypothetical protein CICLE_v10031197mg [Citrus clementina] Length = 534 Score = 474 bits (1220), Expect = e-131 Identities = 234/370 (63%), Positives = 290/370 (78%), Gaps = 1/370 (0%) Frame = +2 Query: 62 PPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSFCATSAA 241 P TS+SKFISDQP LS+L+ C ++ DLKKIHAHLIKTGL D IA SR+L+FC TS A Sbjct: 8 PSPTSMSKFISDQPLLSLLDKQCTSMKDLKKIHAHLIKTGLPKDPIAASRILAFC-TSPA 66 Query: 242 ADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGKLTYPSV 421 D++YA +F +I +PNLF WNTIIR FS SS P AI LF++ML TS + P +LTYPS+ Sbjct: 67 GDINYAYLVFTQIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDMLVTSPIQPQRLTYPSL 126 Query: 422 FKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE-NRDLD 598 FKAY QLGLA DGAQLHGR+VK GLE D FI N+II+MYA+CG L A +FDE + + D Sbjct: 127 FKAYAQLGLARDGAQLHGRVVKQGLEFDQFIHNTIIYMYANCGFLSEARLIFDEVDTEFD 186 Query: 599 VVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQE 778 VVAWNSMI+G AKCGEI+ES RLF K+ RN +SWN+MISGYVRN ++ EAL LF EMQE Sbjct: 187 VVAWNSMIIGLAKCGEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFREMQE 246 Query: 779 KQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGM 958 + I+P+ FT+VS+LNAC KLGA+ QG+WIH+++ N FELN IV+TAIIDMYCKCG Sbjct: 247 QNIKPSEFTMVSLLNACAKLGAIRQGEWIHNFLVTNCFELNTIVVTAIIDMYCKCGCPER 306 Query: 959 AREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNH 1138 A +VF T P+KGL+CWNSM+ GLA NG+ + +LFS L+SSNL+PD +SF+AVLTA NH Sbjct: 307 ALQVFNTVPKKGLSCWNSMVFGLAMNGYENEAIKLFSGLQSSNLKPDYISFIAVLTACNH 366 Query: 1139 SVRVDDARKY 1168 S +V+ A+ Y Sbjct: 367 SGKVNQAKDY 376 >ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Citrus sinensis] Length = 534 Score = 474 bits (1219), Expect = e-131 Identities = 235/370 (63%), Positives = 289/370 (78%), Gaps = 1/370 (0%) Frame = +2 Query: 62 PPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSFCATSAA 241 P TS+SKFISDQP LS+L+ C ++ DLKKIHAHLIKTGLA D IA SR+L+FC TS A Sbjct: 8 PSPTSMSKFISDQPLLSLLDKQCTSMKDLKKIHAHLIKTGLAKDPIAASRILTFC-TSPA 66 Query: 242 ADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGKLTYPSV 421 D++YA +F +I +PNLF WNTIIR FS SS P AI LF++ML TS + P +LTYPS+ Sbjct: 67 GDINYAYLVFTQIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDMLVTSPIQPQRLTYPSL 126 Query: 422 FKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE-NRDLD 598 FKAY QLGLA DGAQLHGR+VK GLE D FI N+II+MYA+CG L A +FDE + + D Sbjct: 127 FKAYAQLGLARDGAQLHGRVVKQGLEFDQFIHNTIIYMYANCGFLSEARLMFDEVDTEFD 186 Query: 599 VVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQE 778 VVAWNSMI+G AKCGEI+ES RLF K+ RN +SWN+MISGYVRN ++ EAL LF EMQE Sbjct: 187 VVAWNSMIIGLAKCGEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFREMQE 246 Query: 779 KQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGM 958 + I+P+ FT+VS+LNAC KLGA+ QG+WIH+++ N FELN IV+TAIIDMYCKCG Sbjct: 247 QNIKPSEFTMVSLLNACAKLGAIRQGEWIHNFLVTNCFELNTIVVTAIIDMYCKCGCPER 306 Query: 959 AREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNH 1138 A +VF T P+KGL+CWNSM+ GLA NG+ + +LFS L+SSNL PD SF+AVLTA NH Sbjct: 307 ALQVFNTVPKKGLSCWNSMVFGLAMNGYENEAIKLFSGLQSSNLTPDYTSFIAVLTACNH 366 Query: 1139 SVRVDDARKY 1168 S +V+ A+ Y Sbjct: 367 SGKVNQAKDY 376 >ref|XP_002534070.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223525897|gb|EEF28314.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 533 Score = 466 bits (1199), Expect = e-129 Identities = 228/375 (60%), Positives = 290/375 (77%), Gaps = 1/375 (0%) Frame = +2 Query: 47 FCSFNPPS-TSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSF 223 FC PPS +ISK ISDQ YLSML+ +C T+ DLKKIH+ LIKTGLA DT A SR+L+F Sbjct: 4 FCCLLPPSPATISKLISDQTYLSMLDKNCTTMKDLKKIHSQLIKTGLAKDTNAASRILAF 63 Query: 224 CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGK 403 CA S A D++YA +F +I PN+F WNTIIR FS SS P +ISL+++ML TS V P + Sbjct: 64 CA-SPAGDINYAYLVFVQIQNPNIFAWNTIIRGFSRSSVPQNSISLYIDMLLTSPVQPQR 122 Query: 404 LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 583 LTYPSVFKA+ QL LA +GAQLHG+++KLGLE+D FIRN+I+ MY +CG A +FD Sbjct: 123 LTYPSVFKAFAQLDLASEGAQLHGKMIKLGLENDSFIRNTILFMYVNCGFTSEARKVFDR 182 Query: 584 NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 763 D D+VAWN+MIMG AKCG ++ES RLF K+ LRN +SWN+MISGYVRNGR+ +AL LF Sbjct: 183 GMDFDIVAWNTMIMGVAKCGLVDESRRLFDKMSLRNAVSWNSMISGYVRNGRFFDALELF 242 Query: 764 HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 943 +MQ ++I P+ FT+VS+LNAC LGA+ QG+WIHDY+ + FELN IV+TAIIDMY KC Sbjct: 243 QKMQVERIEPSEFTMVSLLNACACLGAIRQGEWIHDYMVKKKFELNPIVVTAIIDMYSKC 302 Query: 944 GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 1123 G I A +VF+++PR+GL+CWNSM+LGLA NG + +LFS L+SS+LRPD VSF+AVL Sbjct: 303 GSIDKAVQVFQSAPRRGLSCWNSMILGLAMNGQENEALQLFSVLQSSDLRPDDVSFIAVL 362 Query: 1124 TASNHSVRVDDARKY 1168 TA +H+ VD A+ Y Sbjct: 363 TACDHTGMVDKAKDY 377 >ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Glycine max] Length = 534 Score = 463 bits (1192), Expect = e-128 Identities = 233/377 (61%), Positives = 293/377 (77%), Gaps = 4/377 (1%) Frame = +2 Query: 50 CSFNPPSTS----ISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVL 217 CS PPS+S I+KFISDQP L+ML+T C + DL+KIHAH+IKTGLA T+A SRVL Sbjct: 5 CSALPPSSSSSPSIAKFISDQPCLTMLQTQCTNMKDLQKIHAHIIKTGLAHHTVAASRVL 64 Query: 218 SFCATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLP 397 +FCA+S+ D++YA LF I PNL+ WNTIIR FS SS P++AISLF++ML +S VLP Sbjct: 65 TFCASSSG-DINYAYLLFTTIPSPNLYCWNTIIRGFSRSSTPHLAISLFVDMLCSS-VLP 122 Query: 398 GKLTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLF 577 +LTYPSVFKAY QLG DGAQLHGR+VKLGLE D FI+N+II+MYA+ GLL A +F Sbjct: 123 QRLTYPSVFKAYAQLGAGYDGAQLHGRVVKLGLEKDQFIQNTIIYMYANSGLLSEARRVF 182 Query: 578 DENRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALN 757 DE DLDVVA NSMIMG AKCGE+++S RLF +P R ++WN+MISGYVRN R +EAL Sbjct: 183 DELVDLDVVACNSMIMGLAKCGEVDKSRRLFDNMPTRTRVTWNSMISGYVRNKRLMEALE 242 Query: 758 LFHEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYC 937 LF +MQ +++ P+ FT+VS+L+AC LGAL+ G+W+HDY+KR FELNVIV+TAIIDMYC Sbjct: 243 LFRKMQGERVEPSEFTMVSLLSACAHLGALKHGEWVHDYVKRGHFELNVIVLTAIIDMYC 302 Query: 938 KCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVA 1117 KCG I A EVF+ SP +GL+CWNS+++GLA NG+ + E FSKLE+S+L+PD VSF+ Sbjct: 303 KCGVIVKAIEVFEASPTRGLSCWNSIIIGLALNGYERKAIEYFSKLEASDLKPDHVSFIG 362 Query: 1118 VLTASNHSVRVDDARKY 1168 VLTA + V AR Y Sbjct: 363 VLTACKYIGAVGKARDY 379 >ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phaseolus vulgaris] gi|561014990|gb|ESW13851.1| hypothetical protein PHAVU_008G231600g [Phaseolus vulgaris] Length = 525 Score = 458 bits (1179), Expect = e-126 Identities = 230/378 (60%), Positives = 284/378 (75%), Gaps = 2/378 (0%) Frame = +2 Query: 41 PCFCSFNPPSTS--ISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRV 214 P CS PPS+S I+ FISD P L+ML+ C + DL+KIH H+IKTGLA D IA SRV Sbjct: 2 PILCSALPPSSSPSIANFISDHPCLTMLQNQCTNMKDLQKIHPHIIKTGLALDHIAASRV 61 Query: 215 LSFCATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVL 394 L+FCA+S+ D++YA +F I PNL+ WNTIIR FS SS P AISLF++ML S V Sbjct: 62 LTFCASSSG-DINYAYLVFTGIPNPNLYCWNTIIRGFSRSSTPQFAISLFVDMLY-SAVE 119 Query: 395 PGKLTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNL 574 P +LTYPSVFKAY QLG DGAQLHGR+VKLGLE D FI N+I++MYA+ GL+ A + Sbjct: 120 PQRLTYPSVFKAYAQLGAGHDGAQLHGRVVKLGLEKDQFISNTILYMYANSGLMSEARRV 179 Query: 575 FDENRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEAL 754 FDE +LDVVA NSMIMG AKCGE+++S RLF +P R +SWN+MISGYVRNGR E L Sbjct: 180 FDEPLELDVVACNSMIMGLAKCGEVDKSRRLFDNMPTRTAVSWNSMISGYVRNGRLTEGL 239 Query: 755 NLFHEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMY 934 LF +MQE+ + P+ FT+VS+L+AC LGAL+ G+W+HDYIKR +F+LNVIV+TAIIDMY Sbjct: 240 ELFRKMQEEGVEPSEFTMVSLLSACAHLGALQHGEWVHDYIKRGNFKLNVIVLTAIIDMY 299 Query: 935 CKCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFV 1114 CKCG I A EVF SP +GL CWNS+++GLA NGH + E FSKLESSN++PD VSF+ Sbjct: 300 CKCGSIEKAVEVFAASPTRGLPCWNSIIIGLALNGHEREAIEYFSKLESSNIKPDCVSFI 359 Query: 1115 AVLTASNHSVRVDDARKY 1168 VLTA + V +AR Y Sbjct: 360 GVLTACKYLGAVREARDY 377 >ref|XP_002306741.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222856190|gb|EEE93737.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 509 Score = 449 bits (1155), Expect = e-123 Identities = 222/353 (62%), Positives = 277/353 (78%), Gaps = 1/353 (0%) Frame = +2 Query: 113 MLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSFCATSAAADLHYALSLFRRIDRPN 292 ML+ +C ++ DL+KIHA LIKTGLA DTIA SRVL+FC TS A D++YA +F +I PN Sbjct: 1 MLDKNCTSMKDLQKIHAQLIKTGLAKDTIAASRVLAFC-TSPAGDINYAYLVFTQIRNPN 59 Query: 293 LFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVL-PGKLTYPSVFKAYTQLGLAMDGAQL 469 LF WNTIIR FS SS P+ AISLF++M+ TS P +LTYPSVFKAY QLGLA +GAQL Sbjct: 60 LFVWNTIIRGFSQSSTPHNAISLFIDMMFTSPTTQPQRLTYPSVFKAYAQLGLAHEGAQL 119 Query: 470 HGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENRDLDVVAWNSMIMGFAKCGEI 649 HGR++KLGLE+D FI+N+I++MY +CG LG A +FD DVV WN+MI+G AKCGEI Sbjct: 120 HGRVIKLGLENDQFIQNTILNMYVNCGFLGEAQRIFDGATGFDVVTWNTMIIGLAKCGEI 179 Query: 650 EESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQEKQIRPTHFTLVSILNAC 829 ++S RLF K+ LRN +SWN+MISGYVR GR+ EA+ LF MQE+ I+P+ FT+VS+LNAC Sbjct: 180 DKSRRLFDKMLLRNTVSWNSMISGYVRKGRFFEAMELFSRMQEEGIKPSEFTMVSLLNAC 239 Query: 830 GKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGMAREVFKTSPRKGLACWN 1009 LGAL QG+WIHDYI +N+F LN IVITAIIDMY KCG I A +VFK++P+KGL+CWN Sbjct: 240 ACLGALRQGEWIHDYIVKNNFALNSIVITAIIDMYSKCGSIDKALQVFKSAPKKGLSCWN 299 Query: 1010 SMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNHSVRVDDARKY 1168 S++LGLA +G + LFSKLESSNL+PD VSF+ VLTA NH+ VD A+ Y Sbjct: 300 SLILGLAMSGRGNEAVRLFSKLESSNLKPDHVSFIGVLTACNHAGMVDRAKDY 352 >ref|XP_004491336.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Cicer arietinum] Length = 536 Score = 447 bits (1151), Expect = e-123 Identities = 226/382 (59%), Positives = 287/382 (75%), Gaps = 4/382 (1%) Frame = +2 Query: 35 MPPC--FCSFNPPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAIS 208 + PC F PP SISKFISDQP L+ML+ C T+ I+ H+IKTGL + IA + Sbjct: 2 LTPCSLFSQSPPPPPSISKFISDQPCLTMLQNHCTTLKHFHMIYPHIIKTGLTHNPIAST 61 Query: 209 RVLSFCATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQ 388 RVL+FCA S + +++YA LF R+ PNL++WNTIIRAFS SS P AISLF++ML SQ Sbjct: 62 RVLTFCA-SPSGNINYAYKLFARMPNPNLYSWNTIIRAFSRSSTPQFAISLFVDMLY-SQ 119 Query: 389 VLPGKLTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAG 568 + P LTYPSVFKAY QL G+QLHG +VKLGL+ D FI N+II+MYA+ GLL A Sbjct: 120 IQPQHLTYPSVFKAYAQLSAGDYGSQLHGMVVKLGLQRDQFIHNTIIYMYANSGLLSEAK 179 Query: 569 NLFDENRDL-DVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWI 745 +FDE +L DVVA+NSMIMGFAKCGEI+E+ +LF ++ R ++WN+MISGYVRNG+ + Sbjct: 180 RVFDEKLELGDVVAFNSMIMGFAKCGEIDEARKLFDEMFTRTSVTWNSMISGYVRNGKLM 239 Query: 746 EALNLFHEMQ-EKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAI 922 EAL LFH+MQ E+++ P+ FT+VS+LNAC LGAL+ GKW+HDYIKRNDFELNVIV+TAI Sbjct: 240 EALELFHKMQLEERVEPSEFTMVSLLNACAHLGALQHGKWVHDYIKRNDFELNVIVLTAI 299 Query: 923 IDMYCKCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDA 1102 IDMYCKCG + A +VF T P +GL+CWNS+++GLA NGH + FE FS+LE S +PD+ Sbjct: 300 IDMYCKCGSVENAIQVFDTYPGRGLSCWNSIIIGLAMNGHEREAFEFFSELELSKFKPDS 359 Query: 1103 VSFVAVLTASNHSVRVDDARKY 1168 VSF+ VLTA H VD A+ Y Sbjct: 360 VSFIGVLTACKHLGAVDKAKDY 381 >ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Cucumis sativus] gi|449530724|ref|XP_004172343.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Cucumis sativus] Length = 543 Score = 443 bits (1140), Expect = e-122 Identities = 213/372 (57%), Positives = 289/372 (77%) Frame = +2 Query: 53 SFNPPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSFCAT 232 S +P STS IS+QPYLSM++ C T+ DL++ HAHLIK+G A ++ A SR+L+FCA Sbjct: 10 SLSPISTS-KLIISNQPYLSMVDKYCTTMRDLQQFHAHLIKSGQAIESFAASRILAFCA- 67 Query: 233 SAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGKLTY 412 S ++ YA +F ++ PNLF+WNT+IR FS SS+P +A+ LF++ML +SQV P +LTY Sbjct: 68 SPLGNMDYAYLVFLQMQNPNLFSWNTVIRGFSQSSNPQIALYLFIDMLVSSQVEPQRLTY 127 Query: 413 PSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENRD 592 PS+FKAY+QLGLA DGAQLHGRI+KLGL+ DPFIRN+I++MYA+ G L A +F++ + Sbjct: 128 PSIFKAYSQLGLAHDGAQLHGRIIKLGLQFDPFIRNTILYMYATGGFLSEARRIFNQEME 187 Query: 593 LDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEM 772 DVV+WNSMI+G AKCGEI+ES +LF K+P++N ISWN+MI GYVRNG + EAL LF +M Sbjct: 188 FDVVSWNSMILGLAKCGEIDESRKLFDKMPVKNPISWNSMIGGYVRNGMFKEALKLFIKM 247 Query: 773 QEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEI 952 QE++I+P+ FT+VS+LNA ++GAL QG WIH+YIK+N+ +LN IV+TAIIDMYCKCG I Sbjct: 248 QEERIQPSEFTMVSLLNASAQIGALRQGVWIHEYIKKNNLQLNAIVVTAIIDMYCKCGSI 307 Query: 953 GMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTAS 1132 G A +VF+ P + L+ WNSM+ GLA NG ++ +F LESS+L+PD +SF+AVLTA Sbjct: 308 GNALQVFEKIPCRSLSSWNSMIFGLAVNGCEKEAILVFKMLESSSLKPDCISFMAVLTAC 367 Query: 1133 NHSVRVDDARKY 1168 NH VD+ ++ Sbjct: 368 NHGAMVDEGMEF 379 >ref|XP_003617444.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355518779|gb|AET00403.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 542 Score = 432 bits (1111), Expect = e-118 Identities = 224/374 (59%), Positives = 281/374 (75%), Gaps = 5/374 (1%) Frame = +2 Query: 62 PPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSFCATSAA 241 PPS ISKFIS+ P L+ML+ C TI +I+ H+IKTGL + IA +R L+FCA S + Sbjct: 18 PPS--ISKFISNHPCLTMLQNHCTTINHFHQIYPHIIKTGLTLNPIASTRALTFCA-SPS 74 Query: 242 ADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGKLTYPSV 421 +++YA LF R+ PNL++WNTIIRAFS SS P AISLF++ML SQ+ P LTYPSV Sbjct: 75 GNINYAYKLFVRMPNPNLYSWNTIIRAFSRSSTPQFAISLFVDMLY-SQIQPQYLTYPSV 133 Query: 422 FKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENR---- 589 FKAY QLG A GAQLHGR+VKLGL++D FI N+II+MYA+ GL+ A +FD + Sbjct: 134 FKAYAQLGHAHYGAQLHGRVVKLGLQNDQFICNTIIYMYANGGLMSEARRVFDGKKLELY 193 Query: 590 DLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHE 769 D DVVA NSMIMG+AKCGEI+ES LF + R +SWN+MISGYVRNG+ +EAL LF++ Sbjct: 194 DHDVVAINSMIMGYAKCGEIDESRNLFDDMITRTSVSWNSMISGYVRNGKLMEALELFNK 253 Query: 770 MQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGE 949 MQ + + FT+VS+LNAC LGAL+ GKW+HDYIKRN FELNVIV+TAIIDMYCKCG Sbjct: 254 MQVEGFEVSEFTMVSLLNACAHLGALQHGKWVHDYIKRNHFELNVIVVTAIIDMYCKCGS 313 Query: 950 IGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSN-LRPDAVSFVAVLT 1126 + A EVF+T PR+GL+CWNS+++GLA NGH + FE FSKLESS L+PD+VSF+ VLT Sbjct: 314 VENAVEVFETCPRRGLSCWNSIIIGLAMNGHEREAFEFFSKLESSKLLKPDSVSFIGVLT 373 Query: 1127 ASNHSVRVDDARKY 1168 A H ++ AR Y Sbjct: 374 ACKHLGAINKARDY 387 >ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutrema salsugineum] gi|557112734|gb|ESQ53018.1| hypothetical protein EUTSA_v10017572mg [Eutrema salsugineum] Length = 546 Score = 431 bits (1108), Expect = e-118 Identities = 207/375 (55%), Positives = 282/375 (75%) Frame = +2 Query: 44 CFCSFNPPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSF 223 C+ P+ S + +S ++ +++T C T+ +LK+IHA+LIKTGL SDTIA SRVL+F Sbjct: 7 CYSGMTMPTFSSTISVSGNSHIRLIDTQCSTMRELKQIHANLIKTGLISDTIAASRVLAF 66 Query: 224 CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTSQVLPGK 403 C TS + D+ YA LF RI+ N F WNTIIR FS SS P ++I++F++M +++ P + Sbjct: 67 CCTSPS-DMSYAYLLFTRINHKNPFVWNTIIRGFSRSSFPEMSITIFIDMFSSASAKPQR 125 Query: 404 LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 583 LTYPSVFKAY LG A DG QLHG ++K GLE D FIRN+++HMYA+CG A +F Sbjct: 126 LTYPSVFKAYASLGKARDGMQLHGMVIKEGLEDDSFIRNTMLHMYATCGCFVEAWRIFMA 185 Query: 584 NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 763 + DVVAWNSM+MG A+ G IE++ +LF ++P RNEISWN+MISG+V+NGR+ +AL +F Sbjct: 186 MKHFDVVAWNSMMMGLARYGLIEQAQKLFDEMPQRNEISWNSMISGFVKNGRFKDALEMF 245 Query: 764 HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 943 +MQE+ ++P FT+VS+LNAC LGA EQG+WIH+YI +N FELN IVITA+IDMYCKC Sbjct: 246 RKMQERNVKPDGFTMVSLLNACAYLGASEQGRWIHEYIVKNRFELNSIVITALIDMYCKC 305 Query: 944 GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 1123 G I VF+++P K L+CWNSM+LGLANNG+ E+ +LFS+LESS+L PD+VSF+ VL Sbjct: 306 GCIEEGLRVFESAPNKQLSCWNSMVLGLANNGYEERAMDLFSELESSDLEPDSVSFIGVL 365 Query: 1124 TASNHSVRVDDARKY 1168 TA +S +VD+A ++ Sbjct: 366 TACAYSGKVDEAGEF 380 >ref|NP_181820.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206274|sp|Q9SJG6.1|PP200_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g42920, chloroplastic; Flags: Precursor gi|4512663|gb|AAD21717.1| hypothetical protein [Arabidopsis thaliana] gi|20197867|gb|AAM15291.1| hypothetical protein [Arabidopsis thaliana] gi|110738441|dbj|BAF01146.1| hypothetical protein [Arabidopsis thaliana] gi|330255093|gb|AEC10187.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 559 Score = 427 bits (1097), Expect = e-117 Identities = 216/382 (56%), Positives = 278/382 (72%), Gaps = 4/382 (1%) Frame = +2 Query: 35 MPPCFCSFNP---PSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAI 205 M P SF+ P+ S +S YL +++T C T+ +LK+IHA LIKTGL SDT+ Sbjct: 1 MSPTILSFSGVTVPAMPSSGSLSGNTYLRLIDTQCSTMRELKQIHASLIKTGLISDTVTA 60 Query: 206 SRVLSFCATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTS 385 SRVL+FC S + D++YA +F RI+ N F WNTIIR FS SS P +AIS+F++ML +S Sbjct: 61 SRVLAFCCASPS-DMNYAYLVFTRINHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSS 119 Query: 386 -QVLPGKLTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGN 562 V P +LTYPSVFKAY +LG A DG QLHG ++K GLE D FIRN+++HMY +CG L Sbjct: 120 PSVKPQRLTYPSVFKAYGRLGQARDGRQLHGMVIKEGLEDDSFIRNTMLHMYVTCGCLIE 179 Query: 563 AGNLFDENRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRW 742 A +F DVVAWNSMIMGFAKCG I+++ LF ++P RN +SWN+MISG+VRNGR+ Sbjct: 180 AWRIFLGMIGFDVVAWNSMIMGFAKCGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRF 239 Query: 743 IEALNLFHEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAI 922 +AL++F EMQEK ++P FT+VS+LNAC LGA EQG+WIH+YI RN FELN IV+TA+ Sbjct: 240 KDALDMFREMQEKDVKPDGFTMVSLLNACAYLGASEQGRWIHEYIVRNRFELNSIVVTAL 299 Query: 923 IDMYCKCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDA 1102 IDMYCKCG I VF+ +P+K L+CWNSM+LGLANNG E+ +LFS+LE S L PD+ Sbjct: 300 IDMYCKCGCIEEGLNVFECAPKKQLSCWNSMILGLANNGFEERAMDLFSELERSGLEPDS 359 Query: 1103 VSFVAVLTASNHSVRVDDARKY 1168 VSF+ VLTA HS V A ++ Sbjct: 360 VSFIGVLTACAHSGEVHRADEF 381 >ref|XP_002880012.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297325851|gb|EFH56271.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 542 Score = 426 bits (1094), Expect = e-116 Identities = 213/376 (56%), Positives = 273/376 (72%), Gaps = 1/376 (0%) Frame = +2 Query: 44 CFCSFNPPSTSISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSF 223 CF P+ S F+S L +++T C T+ +LK+IHA+LIKTGL SDT+A SRVL+F Sbjct: 7 CFSGVTVPAIPSSGFVSGNTCLRLIDTRCSTMRELKQIHANLIKTGLISDTVAASRVLAF 66 Query: 224 CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTS-QVLPG 400 C S + D +YA +F RI+ N F WNTIIR FS SS P +AIS+F++ML +S V P Sbjct: 67 CCASPS-DRNYAYLVFTRINHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQ 125 Query: 401 KLTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFD 580 +LTYPSVFKAY LGLA DG QLHGR++K GLE D FIRN+++HMY +CG L A LF Sbjct: 126 RLTYPSVFKAYASLGLARDGRQLHGRVIKEGLEDDSFIRNTMLHMYVTCGCLVEAWRLFV 185 Query: 581 ENRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNL 760 DVVAWNS+IMG AKCG I+++ +LF ++P RN +SWN+MISG+VRNGR+ +AL + Sbjct: 186 GMMGFDVVAWNSIIMGLAKCGLIDQAQKLFDEMPQRNGVSWNSMISGFVRNGRFKDALEM 245 Query: 761 FHEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCK 940 F EMQE+ ++P FT+VS+LNAC LGA EQG+WIH YI RN FELN IVITA+IDMYCK Sbjct: 246 FREMQERDVKPDGFTMVSLLNACAYLGASEQGRWIHKYIVRNRFELNSIVITALIDMYCK 305 Query: 941 CGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAV 1120 CG +VF+ +P K L+CWNSM+LGLANNG E+ +LF +LE + L PD+VSF+ V Sbjct: 306 CGCFEEGLKVFECAPTKQLSCWNSMILGLANNGCEERAMDLFLELERTGLEPDSVSFIGV 365 Query: 1121 LTASNHSVRVDDARKY 1168 LTA HS V A ++ Sbjct: 366 LTACAHSGEVHKAGEF 381 >ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Capsella rubella] gi|565472276|ref|XP_006293940.1| hypothetical protein CARUB_v10022931mg [Capsella rubella] gi|482562647|gb|EOA26837.1| hypothetical protein CARUB_v10022931mg [Capsella rubella] gi|482562648|gb|EOA26838.1| hypothetical protein CARUB_v10022931mg [Capsella rubella] Length = 555 Score = 414 bits (1064), Expect = e-113 Identities = 207/363 (57%), Positives = 270/363 (74%), Gaps = 1/363 (0%) Frame = +2 Query: 74 SISKFISDQPYLSMLETSCRTITDLKKIHAHLIKTGLASDTIAISRVLSFCATSAAADLH 253 +++ F S YL +++T C T+ +LK+IH +LIKTGL SDT+A SRVL+FC S + D++ Sbjct: 12 TVAAFPSPASYLRLIDTQCSTMRELKQIHGNLIKTGLISDTVAASRVLAFCCASPS-DMN 70 Query: 254 YALSLFRRIDRPNLFTWNTIIRAFSHSSDPNVAISLFLEMLTTS-QVLPGKLTYPSVFKA 430 YA +F RI+ N F WNTIIR FS SS P +AIS+F++ML +S V P LTYPSVFKA Sbjct: 71 YAYLVFTRINHKNPFVWNTIIRGFSQSSFPEMAISIFIDMLCSSPSVKPQNLTYPSVFKA 130 Query: 431 YTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENRDLDVVAW 610 Y +LG A+DG QLHGR++K GLE D FIRN+++ MY + G L A +F D DVVAW Sbjct: 131 YGRLGQAIDGRQLHGRVLKEGLEDDSFIRNTMLQMYVTSGCLVEAWRIFVGMTDFDVVAW 190 Query: 611 NSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQEKQIR 790 NSMIMG AKCG I ++ +LF ++P RNE+SWN+MISG+VRNGR+ +AL +F EMQE+ ++ Sbjct: 191 NSMIMGLAKCGLISQAQQLFDEMPHRNEVSWNSMISGFVRNGRFKDALEMFREMQERNVK 250 Query: 791 PTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGMAREV 970 P FT+VS+LNAC LGA EQG+WIH+YI RN FELN IVITA+I+MYCKCG I +V Sbjct: 251 PDGFTMVSLLNACAYLGANEQGRWIHEYIARNRFELNSIVITALIEMYCKCGCIEEGLKV 310 Query: 971 FKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNHSVRV 1150 F+ +P+K L+CWNSM+LGLANNG E+ +LF +LE L PD+VSF+ VLTA +S V Sbjct: 311 FECAPKKQLSCWNSMILGLANNGCEERAMDLFLELERFGLEPDSVSFIGVLTACAYSGEV 370 Query: 1151 DDA 1159 A Sbjct: 371 HKA 373