BLASTX nr result
ID: Akebia23_contig00001455
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00001455 (1617 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citr... 454 e-125 ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containi... 449 e-123 ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phas... 447 e-123 ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phas... 443 e-122 ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containi... 441 e-121 ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containi... 432 e-118 ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein... 430 e-118 gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis] 429 e-117 ref|XP_003623530.1| Pentatricopeptide repeat-containing protein ... 428 e-117 ref|XP_002309173.2| pentatricopeptide repeat-containing family p... 425 e-116 gb|AHB18409.1| pentatricopeptide repeat-containing protein [Goss... 418 e-114 ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containi... 395 e-107 ref|XP_002533822.1| pentatricopeptide repeat-containing protein,... 379 e-102 ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containi... 363 2e-97 ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutr... 355 4e-95 ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Caps... 338 3e-90 ref|XP_002879744.1| pentatricopeptide repeat-containing protein ... 335 3e-89 ref|NP_181376.3| pentatricopeptide repeat-containing protein [Ar... 335 3e-89 gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|3137637... 335 3e-89 ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [A... 298 5e-78 >ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citrus clementina] gi|557531581|gb|ESR42764.1| hypothetical protein CICLE_v10013613mg [Citrus clementina] Length = 506 Score = 454 bits (1169), Expect = e-125 Identities = 225/397 (56%), Positives = 299/397 (75%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+IK L +NSQF + +LD +EK E F PE IF++LI+ Y A++ QD++ +F++I Sbjct: 83 YHFVIKTLAENSQFCDISSVLDHIEKRENFETPEFIFIDLIKTYADAHRFQDSVNLFYKI 142 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 PKFRC PSV SL+A+LSVLC+ KE ++MV Q+LLKS + MNIR+EE+SFRILI+ LC+IN Sbjct: 143 PKFRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKS-QLMNIRIEESSFRILISTLCRIN 201 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 + +AIEILN M + G+ D K S ILSS+C+ RDLSS E+LGF++E++K+GF D Sbjct: 202 RVGFAIEILNCMINDGFCVDGKTCSWILSSVCEQRDLSSDELLGFVQEMKKLGFCFGMVD 261 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+NVI LVK + DAL ILNQMK GIKP++VCYT VL+GV+ + KAEE+FDE+L Sbjct: 262 YTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVLNGVIVQEDYVKAEELFDELL 321 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 VLG++PD+YTYN+Y+ GLC Q+N E G KM+ CMEELG KP+V TYNTL+ ALCK EL Sbjct: 322 VLGLVPDVYTYNVYINGLCKQNNVEAGIKMIACMEELGSKPDVITYNTLLQALCKVRELN 381 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 + RE++K+M KG+ N TY I+ID L +G+++EA GLLE L KGL S +FDE I Sbjct: 382 RLRELVKEMKWKGIVLNLQTYSIMIDGLASKGDIIEACGLLEEALNKGLCTQSSMFDETI 441 Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLS 1191 CGLC+RGLV +A ++L +M +DV+PGA WEALLLS Sbjct: 442 CGLCQRGLVRKALELLKQMADKDVSPGARVWEALLLS 478 >ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Fragaria vesca subsp. vesca] Length = 491 Score = 449 bits (1156), Expect = e-123 Identities = 218/397 (54%), Positives = 297/397 (74%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y+F++K L + SQ +P +LDRLE IEKF+PPE IF NLIR YG AN+++DAI++F RI Sbjct: 74 YNFVLKTLFKTSQLSHIPSVLDRLESIEKFHPPESIFANLIRFYGSANRVEDAIDVFCRI 133 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 PKFRC PS SL+++L VLC EGL+MV QVL+ S AM IRLEE+SFRILI+ LC+I Sbjct: 134 PKFRCDPSAVSLNSLLYVLCGSSEGLKMVPQVLMNSR-AMGIRLEESSFRILISALCRIG 192 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 YAIEI+ M GYD D K+ SL+LSSLC+ + + +EV+GF+EE++KVGF P D Sbjct: 193 SVGYAIEIMKCMISNGYDLDVKICSLVLSSLCEQKGVGGLEVVGFVEEMKKVGFCPGMLD 252 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 YSNVI LVK G+ LDAL +L +MK+ G+KP++VCYT VL GV++ G + A++VFDE+L Sbjct: 253 YSNVIRCLVKQGKGLDALRVLCKMKVEGMKPDIVCYTMVLYGVIANGDYKNADKVFDELL 312 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 VLG++PD+YTYN+Y+ GLC+Q+N E G KM+ CM+ELGC+PN+ TYN L+ ALCK EL Sbjct: 313 VLGLVPDVYTYNVYINGLCNQNNVEAGIKMITCMDELGCRPNLITYNLLLKALCKNEELS 372 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 +ARE++ +M L GV N T++I++D L C+G+V EA +E ML K + +D++I Sbjct: 373 RARELVSEMTLNGVGVNLQTHIIMLDGLFCKGDVDEACIFMEEMLDKFMCRRCSAYDDVI 432 Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLS 1191 GLC+RGLV +A +L +M+ ++V PGA +WEALLLS Sbjct: 433 YGLCQRGLVCKAMDLLLKMVDKNVVPGARAWEALLLS 469 >ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris] gi|561011455|gb|ESW10362.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris] Length = 513 Score = 447 bits (1150), Expect = e-123 Identities = 220/427 (51%), Positives = 306/427 (71%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+IK LT SQFQ +PP+LD LE +EKF PE V LIR YG ++K+QDA+++F RI Sbjct: 82 YYFLIKTLTCTSQFQDIPPVLDHLEHLEKFETPEFNLVYLIRFYGLSDKVQDAVDLFLRI 141 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P+FRCTP+V SL+ +LS+LC+K+E L+MV ++LLKS MNIR+EE++F++LI LC+I Sbjct: 142 PRFRCTPTVCSLNLVLSLLCRKRECLKMVPEILLKSQH-MNIRVEESTFQVLIKALCRIK 200 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 + YAI++LN M GY D + SLI+SSLC+ D++SVE L ++RK+GF P D Sbjct: 201 RVGYAIKMLNYMIEGGYGLDETMCSLIISSLCEQEDMTSVEALVIWRDMRKLGFCPGIMD 260 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+N+I FLVK G+ +DALDILNQ K GIKP+VVCYT VL G+++ G++ K EE+FDE+L Sbjct: 261 YTNMIRFLVKEGKGMDALDILNQQKKDGIKPDVVCYTMVLSGIIAEGEYVKLEELFDEIL 320 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 V G++PD+YTYN+Y+ GLC Q+N + K++ MEEL CKPNV T N L+GALC AG+LR Sbjct: 321 VFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECKPNVVTCNILLGALCVAGDLR 380 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 KAR V+K+MG KGV+ + H+Y I++D L+ +GE+ EA LLE ML K P S FD +I Sbjct: 381 KARGVMKEMGWKGVRLDLHSYRIMLDGLVGKGEIGEACFLLEEMLEKSFFPRSSTFDHII 440 Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSRFELSFKEAISPDLDGLLSNLPH 1260 +C++GL+ EA ++ +++ + PGA +WEALL S +L F E L G +NL + Sbjct: 441 FQMCQKGLIVEAIELTKKIVAKSFVPGARAWEALLKSGSKLGFSETTFSGLLGQKNNLSY 500 Query: 1261 HVN*NGN 1281 +GN Sbjct: 501 QTG-SGN 506 >ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris] gi|561013301|gb|ESW12162.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris] Length = 514 Score = 443 bits (1140), Expect = e-122 Identities = 218/419 (52%), Positives = 303/419 (72%), Gaps = 1/419 (0%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+IK LT S Q +PP+LD LE++E F PE I V LIR YG ++++QDA+++F RI Sbjct: 82 YYFVIKTLTSTSHLQDIPPVLDHLEQLETFETPEFILVYLIRFYGLSDRVQDAVDLFLRI 141 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P+FRCTP+V SL+ +LS+LC+K+E L+MV ++LLKS MNIR+EE++F++LI LC+I Sbjct: 142 PRFRCTPTVWSLNLVLSLLCRKRECLKMVPEILLKSQH-MNIRVEESTFQVLIEALCRIK 200 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 + YAI++LN M GY D + SLI+SSLC+ D++SVE L ++RK+GF P D Sbjct: 201 RVGYAIKMLNYMIEGGYGLDETICSLIISSLCEQEDMTSVEALVIWRDMRKLGFCPGVMD 260 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+N+I FLVK G+ DALDILNQ K GIKP+VVCYT VL G+V+ G++ K EE+FDE+L Sbjct: 261 YTNMIRFLVKEGKGTDALDILNQQKKDGIKPDVVCYTMVLSGIVAEGEYVKLEELFDEIL 320 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 V G++PD+YTYN+Y+ GLC Q+N + K++ MEEL C+PNV T NTL+GALC AG+LR Sbjct: 321 VFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECRPNVVTCNTLLGALCVAGDLR 380 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 KAR V+K+MG KGV N H+Y I++D L+ +GE+ EA LLE ML K P S FD +I Sbjct: 381 KARGVMKEMGWKGVGLNLHSYRIMLDGLVGKGEIGEACFLLEEMLEKCFFPRSSTFDHII 440 Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLL-SRFELSFKEAISPDLDGLLSNL 1254 +C++GL++EA ++ +++ + PGA +WEALLL S +L F E L G ++NL Sbjct: 441 FQMCQKGLIAEAIELTKKIVAKSFVPGARAWEALLLKSGSKLGFSETTFSGLLGQINNL 499 >ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Vitis vinifera] Length = 505 Score = 441 bits (1134), Expect = e-121 Identities = 225/425 (52%), Positives = 300/425 (70%), Gaps = 4/425 (0%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+I LT+ QF LPP+L RLEK+EKF PE IF NLI++YG AN +DA+++FFRI Sbjct: 80 YRFVISTLTRCRQFHHLPPLLHRLEKVEKFETPEFIFTNLIKVYGNANMFEDAVDLFFRI 139 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P FRC PSV SL+A+L VLCK++EGL MV Q+LLKS +AMNIRLEE+SFRIL+ LC+I Sbjct: 140 PNFRCVPSVYSLNALLYVLCKRREGLVMVPQILLKS-QAMNIRLEESSFRILVAALCRIK 198 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 K +YAI ILN M + GY D+K+ S+ILSSLC+ + LS EVL F+EE+RK+GF P D Sbjct: 199 KHNYAIRILNYMLNDGYAVDAKMCSIILSSLCEQKGLSGDEVLRFMEEMRKLGFYPGRVD 258 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 +NVI FLVK G +DAL + +QMK GIKP+ V YT +L+GV + G + KA+++FDEML Sbjct: 259 CNNVIRFLVKEGMVMDALGVFDQMKTDGIKPDTVSYTMILNGVTADGDYEKADDLFDEML 318 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 VLGV+PDI+ YN+Y+ LC Q+N E G +ML M ELGCKP+ TYN L+ + K +L Sbjct: 319 VLGVVPDIHAYNVYINSLCKQNNIEEGVRMLASMRELGCKPDYVTYNMLLEGMSKVRDLG 378 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 RE+ ++M L+GV+ N TY I++D L+ +GE+ E+ LLE ML K FDE+I Sbjct: 379 GMRELAREMELEGVQWNWETYRIMLDGLVGKGEIDESCSLLEEMLDKYFSCWCSTFDEII 438 Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSRFELSFKEAISPDLDGLL----S 1248 C LC+RGLV +A Q++ +M+ + +APGA +WEALLL E SF E +L + + Sbjct: 439 CELCQRGLVCKALQLVNKMVRKTIAPGARAWEALLLGSVEFSFAETSLTELVNPIQIHPA 498 Query: 1249 NLPHH 1263 LP H Sbjct: 499 RLPEH 503 >ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Glycine max] Length = 499 Score = 432 bits (1112), Expect = e-118 Identities = 214/414 (51%), Positives = 299/414 (72%), Gaps = 1/414 (0%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F++K LT SQ Q +PP+L LE +EKF PE I V LIR YG ++++QDA+++FFRI Sbjct: 85 YFFVLKTLTSTSQLQDIPPVLYHLEHLEKFETPESILVYLIRFYGLSDRVQDAVDLFFRI 144 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P+FRCTP+V SL+ +LS+LC+K++ L+MV ++LLKS MNIR+EE++FR+LI LC+I Sbjct: 145 PRFRCTPTVCSLNLVLSLLCRKRDCLEMVPEILLKSQH-MNIRVEESTFRVLIRALCRIK 203 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 + YAI++LN M GY D K+ SL++S+LC+ +DL+S E L ++RK+GF P D Sbjct: 204 RVGYAIKMLNFMVEDGYGLDEKICSLVISALCEQKDLTSAEALVVWRDMRKLGFCPGVMD 263 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+N+I FLVK GR +DALDILNQ K GIK +VV YT VL G+V+ G++ +E+FDEML Sbjct: 264 YTNMIRFLVKEGRGMDALDILNQQKQDGIKLDVVSYTMVLSGIVAEGEYVMLDELFDEML 323 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 V+G+IPD YTYN+Y+ GLC Q+N +++ MEELGCKPNV TYNTL+GAL AG+ Sbjct: 324 VIGLIPDAYTYNVYINGLCKQNNVAEALQIVASMEELGCKPNVVTYNTLLGALSVAGDFV 383 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 KARE++K+MG KGV N HTY I++D L+ +GE+ E+ LLE ML K L P S FD +I Sbjct: 384 KARELMKEMGWKGVGLNLHTYRIVLDGLVGKGEIGESCLLLEEMLEKCLFPRSSTFDNII 443 Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLL-SRFELSFKEAISPDLDG 1239 +C++ L +EA ++ +++ + PGAS+WEALLL S +L + +A L G Sbjct: 444 FQMCQKDLFTEAMELTKKVVAKSFLPGASTWEALLLNSGSKLGYSKATFSGLLG 497 >ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] gi|508715129|gb|EOY07026.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 542 Score = 430 bits (1106), Expect = e-118 Identities = 209/397 (52%), Positives = 288/397 (72%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+IK L QN F +P +L LE +EKF PE+IF +LI YG AN+IQDA++IF+RI Sbjct: 121 YHFLIKTLIQNLHFNHIPSVLHHLEHVEKFQTPEYIFADLITTYGIANRIQDAVDIFYRI 180 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 PKFRC PS SL+++L++LC+ + L++V QVLLKS MNIR+EE++ RIL++ LC++N Sbjct: 181 PKFRCVPSAYSLNSLLALLCRNQYSLKLVPQVLLKSL-LMNIRVEESTLRILVSALCRMN 239 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 K SYAI+IL M G + K+ S ILSS+C DL +V+G E+ K+GF P D Sbjct: 240 KVSYAIDILQRMIDEGLGVNDKVCSFILSSICAKADLDGEDVMGLWRELGKLGFCPAMSD 299 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+ +I FLVK GR LDALD LNQMK GIKP +V YT L+GV++ G + A+E+FDE+L Sbjct: 300 YNCLIRFLVKKGRGLDALDFLNQMKSVGIKPGIVSYTMALNGVIAEGDYMLADELFDELL 359 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 +LG++PD+YTYN Y+ LC Q+ E G KM+ CMEEL CKPNV TYN L+ A+CK GE+ Sbjct: 360 MLGLVPDVYTYNAYIDALCKQNKVEEGIKMVACMEELRCKPNVLTYNMLLEAICKVGEIS 419 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 +A E++K+M KG++ N +Y ++ID L+ +GE++EA GL+E +L K S+ FDE+I Sbjct: 420 RAMELVKEMKYKGIEMNLVSYTVIIDGLVSKGEILEAHGLVEEVLHKCFCHQSLAFDEVI 479 Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLS 1191 CGLC+RGLV EA ++L +M+ ++V+PGA WEALLLS Sbjct: 480 CGLCQRGLVCEALELLRKMVAKNVSPGARGWEALLLS 516 >gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis] Length = 494 Score = 429 bits (1103), Expect = e-117 Identities = 219/411 (53%), Positives = 301/411 (73%), Gaps = 4/411 (0%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F++K L + SQF + +LDR+E +EKF PE+ F +I YGF ++I+DAI+IF+RI Sbjct: 72 YHFVLKTLIKTSQFDHIHSVLDRIEFVEKFETPEYFFAQIIGFYGFLDRIEDAIDIFWRI 131 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 PKFRC PS SL+++L VLC++ EGL+ V +VL+KS + MNIRLEEASFRILIT LCKI Sbjct: 132 PKFRCVPSSYSLNSLLYVLCRRNEGLRFVPEVLIKSRD-MNIRLEEASFRILITALCKIG 190 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLC---KSRDLSSVEVLGFLEEIRKVGFSPN 531 K YAIEIL+ M GYD D+++ SLILS LC K DL+ +VL L+++ K+GF P Sbjct: 191 KVGYAIEILDCMISDGYDIDARICSLILSFLCGKNKELDLAGFDVLELLQKMEKMGFCPR 250 Query: 532 GFDYSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFD 711 DYS VI LV+ R L+ALDIL QMK G+KP+VVCYT VL G+V+ G++ KA+E+FD Sbjct: 251 MGDYSKVIRILVREKRGLEALDILGQMKADGMKPDVVCYTMVLHGIVAEGEYSKADEMFD 310 Query: 712 EMLVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAG 891 EMLVLG++PD+YTYN Y+ GLC Q++ + ++ MEELGCKPN+ TYN ++ ALCK G Sbjct: 311 EMLVLGLVPDVYTYNAYINGLCKQNDVDGALDTILRMEELGCKPNLITYNLILRALCKNG 370 Query: 892 ELRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFD 1071 E +A+E++ +M LKG + TY+I++D L+ +GE+VEA GL+E ML K L ++D Sbjct: 371 EFGRAKELVAEMSLKGFEDYLQTYIIMLDVLLGKGEIVEACGLMEEMLDKLLCRRCSMYD 430 Query: 1072 EMICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSR-FELSFKEAI 1221 E+I GLC+RGL +A ++L +M+ ++VAPGA +W+ALLLS EL+ EAI Sbjct: 431 EIIFGLCRRGLDCKASEMLGKMVGKNVAPGARAWDALLLSSGSELTLPEAI 481 >ref|XP_003623530.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355498545|gb|AES79748.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 653 Score = 428 bits (1101), Expect = e-117 Identities = 216/419 (51%), Positives = 302/419 (72%), Gaps = 3/419 (0%) Frame = +1 Query: 1 YSFIIKILTQ--NSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFF 174 Y F+IK +T S ++P IL+ LE EKF PE IF+ LIR YGF +++QDA+++FF Sbjct: 77 YFFLIKTITNINTSHLHEIPHILNHLEHNEKFETPEFIFMYLIRFYGFNDRVQDAVDLFF 136 Query: 175 RIPKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCK 354 RIP+FRCTP+V SL+ +LS+LC K+E L+MV +LLKS + M IRLEE+SF +LI LC+ Sbjct: 137 RIPRFRCTPTVCSLNLLLSLLCGKRECLRMVPDILLKSRD-MKIRLEESSFWVLIKALCR 195 Query: 355 INKFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNG 534 I + YAI+++N M GY D K+ SLI+SSLC+ DL+SVE L +RK+GF P Sbjct: 196 IKRVDYAIKMMNCMVEDGYCLDDKICSLIISSLCEQNDLTSVEALVVWGNMRKLGFCPGV 255 Query: 535 FDYSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDE 714 D +N+I FLVK G+ +DAL+ILNQ+K GIKP++VCYT VL G+V G + K +E+FDE Sbjct: 256 MDCTNMIRFLVKEGKGMDALEILNQLKEDGIKPDIVCYTIVLSGIVKEGDYVKLDELFDE 315 Query: 715 MLVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGE 894 +LVLG++PD+YTYN+Y+ GLC Q+NF+ K++V ME+LGCKPNV TYNTL+GALC +G+ Sbjct: 316 ILVLGLVPDVYTYNVYINGLCKQNNFDEALKIVVSMEKLGCKPNVVTYNTLLGALCMSGD 375 Query: 895 LRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDE 1074 L KA+ V+K+M LKGV+ N HTY I++D L+ +GE+ EA LLE ML K P S FD Sbjct: 376 LGKAKRVMKEMRLKGVELNLHTYRIMLDGLVGKGEIGEACVLLEEMLEKCFYPRSSTFDS 435 Query: 1075 MICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLL-SRFELSFKEAISPDLDGLLS 1248 ++ +C++GL+S+A ++ +++ + PGA WEALLL S ++++ E GLLS Sbjct: 436 IVHQMCQKGLISDALVLMNKIVAKSFDPGAKVWEALLLNSESKVTYSET---TFAGLLS 491 >ref|XP_002309173.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550335936|gb|EEE92696.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 490 Score = 425 bits (1092), Expect = e-116 Identities = 204/379 (53%), Positives = 277/379 (73%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 + FI K L + SQF +P +LD LEK+E F PPE F LI +YG NK +AIE+F+RI Sbjct: 80 FDFIFKTLVKTSQFHHIPSVLDHLEKVESFEPPESTFAYLIEVYGRTNKTHEAIELFYRI 139 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 PKFRC PSV SL+ ++SVLC+ +GL++V ++LLKS + MNIR+EE++F++LIT LC+I Sbjct: 140 PKFRCVPSVYSLNTLISVLCRNSKGLKLVPEILLKS-QVMNIRVEESTFQVLITALCRIR 198 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 K +AIE+LN M + G+ ++++YSL+LS LC+ +D + EV+GFLE++RK+GF P D Sbjct: 199 KVGFAIEMLNCMVNDGFIVNAEIYSLLLSCLCEQKDATKFEVIGFLEQLRKLGFFPGMVD 258 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 YSNVI FLVK R LDAL +LN MK IKP++ CYT VL GV+ + KA+E+FDE+L Sbjct: 259 YSNVIRFLVKGKRGLDALHVLNHMKSDRIKPDIFCYTMVLHGVIEDKDYLKADELFDELL 318 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 V G++PD YTYN+Y+ GLC Q+N + G KM+ MEELGCKPN+ TYN L+ LCK GEL Sbjct: 319 VFGLVPDAYTYNVYINGLCKQNNVQAGIKMVASMEELGCKPNLITYNMLVKQLCKVGELS 378 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 KA E++++MGLKG+ N TY I+ID L G++VEA GL E L KGL S++FDE+I Sbjct: 379 KAGELVREMGLKGIGLNMQTYRIMIDGLASNGKIVEACGLFEEALDKGLCTQSLMFDEII 438 Query: 1081 CGLCKRGLVSEAQQVLTEM 1137 CGLC R L +A ++L +M Sbjct: 439 CGLCHRDLSCKALKLLEKM 457 >gb|AHB18409.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum] Length = 480 Score = 418 bits (1075), Expect = e-114 Identities = 202/397 (50%), Positives = 288/397 (72%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+IK L N QF +P +L L+ ++ F PE+IF +L++ YG AN+IQDA++IF+RI Sbjct: 73 YHFLIKTLLHNRQFHHIPSLLHHLQ-LQHFQTPEYIFTHLVKFYGKANRIQDAVDIFYRI 131 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P+FRC PS SL+A+L++LC+ + GL+++ QVLL S MNIRLEE++FR+L+ LC++N Sbjct: 132 PQFRCFPSAYSLNALLALLCRSQRGLKLLPQVLLNSLH-MNIRLEESTFRLLVCTLCRMN 190 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 K +YAIEIL M G + K++S +LSS+C DL +V+GF +RK+GFSP D Sbjct: 191 KVAYAIEILQRMLDDGLGVNDKVFSFVLSSVCAEGDLDGEDVIGFWRGLRKLGFSPAMGD 250 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y V+ FLVK GR LDA D+LNQMK GI P ++ YT VL+GV + G + A+E+FDE+L Sbjct: 251 YDGVVRFLVKKGRGLDAWDVLNQMKSDGIMPGIISYTMVLNGVTAEGDYILADELFDELL 310 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 +LG++P++YTY Y+ LC Q+ E G KM+ CMEELGCKPNV YNTL+ + KAGE+ Sbjct: 311 MLGLVPNVYTYKAYIDALCKQNKVEEGIKMVACMEELGCKPNVLIYNTLLRTISKAGEIS 370 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 +ARE++K+M KG++ N +Y I+ID L+ GE++EA L+E +L K + S+ FDE+I Sbjct: 371 RARELVKEMKYKGIEMNWVSYTIIIDGLVSNGEILEACALVEEVLHKCIFIKSLTFDEVI 430 Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLS 1191 CGLC+RGLV +A+++L +M+ R ++PGA WEALLLS Sbjct: 431 CGLCQRGLVCKARELLGKMVERSISPGARVWEALLLS 467 >ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Cucumis sativus] gi|449483740|ref|XP_004156675.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Cucumis sativus] Length = 491 Score = 395 bits (1016), Expect = e-107 Identities = 199/416 (47%), Positives = 285/416 (68%), Gaps = 1/416 (0%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F++K L + SQF +PP+L RL+ +E F PE+IFV+LI++YG N+IQDA+ +F RI Sbjct: 75 YYFVLKTLARTSQFHHIPPVLHRLQFLENFQTPEYIFVDLIKLYGRMNRIQDAVTLFRRI 134 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P FRC PS SL+++LS L + +GL ++ ++L SH +M IRLE ++F+ILIT LCK+N Sbjct: 135 PMFRCVPSTLSLNSLLSQLSRNAQGLPIIPDIILNSH-SMGIRLEHSTFQILITALCKVN 193 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 K +A+E+ N M GY + ++ SLIL+SLC+ + S VLGFLEE+R+ GF P D Sbjct: 194 KVGHAMELFNYMITEGYGLNPQICSLILASLCQQKKSSGDVVLGFLEEMRQKGFCPAVVD 253 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 YSNVI F V G DA+D+LN+MK G KP++VCYT VL+GV++ G + A+E+FDE+L Sbjct: 254 YSNVIKFFVTRGMGSDAVDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMADELFDELL 313 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 + G++PDIYTYN+Y+ GLC Q + G +M+ ME LGC+PNV TYN ++ +LCK GEL Sbjct: 314 LFGLVPDIYTYNVYIHGLCKQGDSVAGLQMIPHMEALGCQPNVITYNVILKSLCKTGELD 373 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 +AR++ KM LKG+ N T+ I+ID L GEV+EA LLE ML P F E++ Sbjct: 374 EARKLRSKMQLKGLAENLRTFRIMIDGLFHNGEVIEACVLLEEMLGSRFPPQISTFSEIL 433 Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLL-SRFELSFKEAISPDLDGLL 1245 LCKR +V +A ++L M+ ++ +PG +WE LLL S EL+ +++ L L+ Sbjct: 434 SWLCKRHMVGKAVELLALMVGKNFSPGPKAWEILLLSSESELTSVKSLETTLKDLV 489 >ref|XP_002533822.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223526239|gb|EEF28557.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 373 Score = 379 bits (972), Expect = e-102 Identities = 189/374 (50%), Positives = 263/374 (70%) Frame = +1 Query: 148 IQDAIEIFFRIPKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASF 327 +Q+AI +F+R P FRC PSV L+ +LSVLC+ EGL V +VLLKS + MNIR+EE+SF Sbjct: 1 MQNAIHLFYRTPNFRCVPSVYLLNTLLSVLCRTNEGLNFVPEVLLKSQD-MNIRMEESSF 59 Query: 328 RILITVLCKINKFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEI 507 R+LI LC INK YA+E+ N M + G+ DSK+ SL+LSSLC D+SS EV+ FL E+ Sbjct: 60 RLLINALCSINKVGYAVEMFNCMINDGFSVDSKICSLLLSSLCYQADISSSEVMRFLGEL 119 Query: 508 RKVGFSPNGFDYSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQF 687 RK GF P DYS VI FLV+ G ++AL++LNQMK+ GIKP++VCYT VL+GV++ G + Sbjct: 120 RKFGFCPGIKDYSKVINFLVRRGMGMEALNVLNQMKLDGIKPDIVCYTTVLNGVIANGVY 179 Query: 688 HKAEEVFDEMLVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTL 867 KA+E+FDE+LV G++PD+YTYN+Y+ GLC Q+N E G +M+ MEELGCKPN+ TYN L Sbjct: 180 SKADELFDELLVFGLVPDVYTYNVYIYGLCKQNNVEAGIEMVTSMEELGCKPNLITYNIL 239 Query: 868 MGALCKAGELRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGL 1047 + LCK GE +AR++++ MG KG+ TY ++I L G++V+A LLE L KGL Sbjct: 240 LEDLCKNGEDSRARDLVRDMGSKGIGLGMQTYKVMIHGLTSGGKIVKACSLLEEALDKGL 299 Query: 1048 VPTSIIFDEMICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSRFELSFKEAISP 1227 P + FDE+I GLC+ G + +A ++L +++ ++V+PG WE LLL + ++F E Sbjct: 300 CPRGLRFDEVIYGLCQTGSICKALELLEKVVNKNVSPGVRVWETLLL-KSNINFVEDTFI 358 Query: 1228 DLDGLLSNLPHHVN 1269 DL + PH N Sbjct: 359 DLVWVWETHPHCQN 372 >ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Solanum lycopersicum] Length = 496 Score = 363 bits (931), Expect = 2e-97 Identities = 183/396 (46%), Positives = 268/396 (67%), Gaps = 1/396 (0%) Frame = +1 Query: 1 YSFIIKILTQN-SQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFR 177 Y FI+K LTQN S + ++P ILD + K E F PE+IF LI+ YG +N A E+FF Sbjct: 94 YYFILKTLTQNPSTWDEIPLILDYIRKFENFETPEYIFTYLIKFYGDSNMTHLAYEMFFT 153 Query: 178 IPKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKI 357 +P +RC PSV SL+ ++ VLCK L++V QVL+KS + +NI +EE++F+ILI LC+I Sbjct: 154 MPAYRCNPSVKSLNCLIWVLCKNNYDLRIVLQVLVKS-QLLNIWVEESTFKILIRALCRI 212 Query: 358 NKFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGF 537 K + A+++L LM G++ D+ + SLILS++ +D VE+ G LEE+RK+G+SP Sbjct: 213 GKTNNAVDLLKLMVDSGFNLDANICSLILSTMPDVKDCVGVEIWGVLEEMRKLGYSPKRV 272 Query: 538 DYSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEM 717 D NVI F V G+ +DAL++LN+MKM G+ P+VVCY VL+G++ G++ A+E+FDE+ Sbjct: 273 DLCNVIRFYVNNGKGIDALEVLNKMKMCGMVPDVVCYNLVLNGLIFEGEYSNADELFDEL 332 Query: 718 LVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGEL 897 LVLG+ PDI TYN+Y+ GLC Q ++L CME+LGCKP + TY+T++ LC+ G L Sbjct: 333 LVLGLNPDIVTYNVYINGLCKQDKMVEALRVLGCMEDLGCKPEMNTYHTILDGLCRCGML 392 Query: 898 RKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEM 1077 +EVL +M KG++ +SH Y ++I+ +I GEV EA+ LL M+ G VP SI FD + Sbjct: 393 SSVKEVLGQMKSKGLQLSSHIYGVIINCMIRNGEVDEAYNLLHEMVDMGFVPQSITFDGL 452 Query: 1078 ICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALL 1185 I LC +G E ++L+ M +++ PG SWEA + Sbjct: 453 IGLLCNKGSFYEVMELLSIMSTKNLVPGIRSWEAFV 488 >ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum] gi|557112223|gb|ESQ52507.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum] Length = 456 Score = 355 bits (910), Expect = 4e-95 Identities = 174/380 (45%), Positives = 268/380 (70%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+IK L + SQ + + +L+ +E EKF+ PE IF ++I YGF+ +I++AI++FF+I Sbjct: 78 YKFVIKTLAKTSQLENIASVLNHIEISEKFDTPESIFRDVIFAYGFSGRIEEAIDVFFKI 137 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P FRC PS +L+A+LSVL +K++GL+MV +VLLK+ + + +RLEE++ ILI LC+I Sbjct: 138 PNFRCVPSAYTLNALLSVLVRKRQGLKMVPEVLLKASK-LGVRLEESTLGILIDALCRIG 196 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 + A +++ M Y D +LYSL+LSS+CK +D S +V+G+LE +RK FSP+ D Sbjct: 197 EVDCATDLVKDMSDDCYIVDPRLYSLLLSSVCKHKDSSCFDVIGYLEGLRKTRFSPDLRD 256 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+ V+ FLV+ GR + + +LNQMK I+P++VCYT +L GV++ + KA+++FDE+L Sbjct: 257 YTAVMRFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIILQGVIADEDYKKADKLFDELL 316 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 +LG++PD+YTYN+Y+ GLC QS+ E G KM+ CME+LG +PNV TYN L+ AL KAG++ Sbjct: 317 LLGLVPDVYTYNVYINGLCKQSDIECGIKMMSCMEKLGSEPNVVTYNILIKALVKAGDMS 376 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 +A+ + ++M GV NSH+Y I+++ I EVV A GLLE + LV S +E+I Sbjct: 377 RAKIIWEEMETNGVDRNSHSYDIMVNASIEADEVVCAHGLLEEAFSRSLVVKSSRTEEVI 436 Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140 C LC +GL+ +A ++L ++ Sbjct: 437 CRLCDKGLMDKAVELLVHLV 456 >ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Capsella rubella] gi|482562854|gb|EOA27044.1| hypothetical protein CARUB_v10023139mg [Capsella rubella] Length = 470 Score = 338 bits (868), Expect = 3e-90 Identities = 170/380 (44%), Positives = 255/380 (67%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+IK L + SQ + + +L LE EKF+ PE IF ++I YGFA +I +AI++FF+I Sbjct: 92 YRFVIKTLAKTSQLENIASVLSHLEVSEKFDTPESIFRDVIAAYGFAGRIGEAIDVFFKI 151 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P FRC PS +L+A+L VL +K+E L++V ++L+K+ M +RLEE++F ILI LCKI Sbjct: 152 PNFRCVPSAYTLNALLLVLVRKRESLELVPEILVKASR-MGVRLEESTFGILIDALCKIG 210 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 + A E++ M D +LYS +LSS+CK +D S +V+G+LE++RK FSP D Sbjct: 211 EVDCATELVRYMSIDCVIVDPRLYSQLLSSVCKHKDSSCFDVVGYLEDLRKTRFSPGLRD 270 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+ V+ FLV+ GR + + +LNQMK I+P++VCYT VL GV++ ++ KA++ FDE+L Sbjct: 271 YTVVMSFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIVLQGVIADAEYSKADKFFDELL 330 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 +LG+ PD+YTYN+Y+ GLC Q++ E KM+ M +LG +PNV TYN L+ AL AG+L Sbjct: 331 LLGLAPDVYTYNVYMNGLCKQNDIEGALKMMSSMNKLGSEPNVITYNILIKALVNAGDLS 390 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 +A+ + ++MG+ GV NSHTY I+I I G+VV A G LE + S +E+I Sbjct: 391 QAKTLWEEMGINGVNRNSHTYDIMISAFIEVGDVVSAQGFLEEAFNMNVFAKSSRTEEVI 450 Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140 LC +GL+ +A ++L ++ Sbjct: 451 SRLCDKGLMDKAVELLAHLV 470 >ref|XP_002879744.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297325583|gb|EFH56003.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 444 Score = 335 bits (860), Expect = 3e-89 Identities = 169/380 (44%), Positives = 256/380 (67%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+I+ L + SQ + + +LD LE EKF+ PE IF ++I YGF+ +I++AI++FF+I Sbjct: 66 YRFVIETLAKTSQLENIASVLDHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIDVFFKI 125 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P FRC PS +L+A+L VL +K++ L++V ++L+K+ M +RLEE++F ILI LC+I Sbjct: 126 PNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKASR-MGVRLEESTFGILINALCRIG 184 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 + A E++ M D +LYSL+LSS+CK +D S +V+G+LE++RK F P D Sbjct: 185 EVDCATELVRYMSEDSVIVDPRLYSLLLSSVCKHKDSSCFDVIGYLEDLRKTRFLPGLRD 244 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+ V+ FLV+ GR + + +LNQMK I P+VVCYT VL GV++ + KA+++FDE+L Sbjct: 245 YTVVMRFLVEGGRGKEVVSVLNQMKCDRIDPDVVCYTIVLLGVIADEDYPKADKLFDELL 304 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 +LG+ PD+YTYN+Y+ GLC Q++ E KM+ M +LG +PNV TYN ++ L KAG+L Sbjct: 305 LLGLDPDVYTYNVYINGLCKQNDIEGAIKMMSSMNKLGSEPNVVTYNIVIKGLVKAGDLS 364 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 +A+ + K+M + GV NSHTY I+I I EVV A GLLE L S +E+I Sbjct: 365 RAKTLWKEMEMNGVNRNSHTYDIMISAYIEVDEVVCAQGLLEEAFNMNLFVKSSKIEEVI 424 Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140 LC++GL+ +A ++L ++ Sbjct: 425 SRLCEKGLMDKAVELLAHLV 444 >ref|NP_181376.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546769|sp|Q8L6Y7.2|PP193_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g38420, mitochondrial; Flags: Precursor gi|3395430|gb|AAC28762.1| hypothetical protein [Arabidopsis thaliana] gi|330254441|gb|AEC09535.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 453 Score = 335 bits (860), Expect = 3e-89 Identities = 168/380 (44%), Positives = 257/380 (67%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+IK L ++SQ + + +L LE EKF+ PE IF ++I YGF+ +I++AIE+FF+I Sbjct: 75 YRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIEVFFKI 134 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P FRC PS +L+A+L VL +K++ L++V ++L+K+ M +RLEE++F ILI LC+I Sbjct: 135 PNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACR-MGVRLEESTFGILIDALCRIG 193 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 + A E++ M D +LYS +LSS+CK +D S +V+G+LE++RK FSP D Sbjct: 194 EVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLRKTRFSPGLRD 253 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+ V+ FLV+ GR + + +LNQMK ++P++VCYT VL GV++ + KA+++FDE+L Sbjct: 254 YTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYPKADKLFDELL 313 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 +LG+ PD+YTYN+Y+ GLC Q++ E KM+ M +LG +PNV TYN L+ AL KAG+L Sbjct: 314 LLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILIKALVKAGDLS 373 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 +A+ + K+M GV NSHT+ I+I I EVV A GLLE + S +E+I Sbjct: 374 RAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFNMNVFVKSSRIEEVI 433 Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140 LC++GL+ +A ++L ++ Sbjct: 434 SRLCEKGLMDQAVELLAHLV 453 >gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|31376375|gb|AAP49514.1| At2g38420 [Arabidopsis thaliana] Length = 444 Score = 335 bits (860), Expect = 3e-89 Identities = 168/380 (44%), Positives = 257/380 (67%) Frame = +1 Query: 1 YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180 Y F+IK L ++SQ + + +L LE EKF+ PE IF ++I YGF+ +I++AIE+FF+I Sbjct: 66 YRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIEVFFKI 125 Query: 181 PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360 P FRC PS +L+A+L VL +K++ L++V ++L+K+ M +RLEE++F ILI LC+I Sbjct: 126 PNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACR-MGVRLEESTFGILIDALCRIG 184 Query: 361 KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540 + A E++ M D +LYS +LSS+CK +D S +V+G+LE++RK FSP D Sbjct: 185 EVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLRKTRFSPGLRD 244 Query: 541 YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720 Y+ V+ FLV+ GR + + +LNQMK ++P++VCYT VL GV++ + KA+++FDE+L Sbjct: 245 YTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYPKADKLFDELL 304 Query: 721 VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900 +LG+ PD+YTYN+Y+ GLC Q++ E KM+ M +LG +PNV TYN L+ AL KAG+L Sbjct: 305 LLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILIKALVKAGDLS 364 Query: 901 KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080 +A+ + K+M GV NSHT+ I+I I EVV A GLLE + S +E+I Sbjct: 365 RAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFNMNVFVKSSRIEEVI 424 Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140 LC++GL+ +A ++L ++ Sbjct: 425 SRLCEKGLMDQAVELLAHLV 444 >ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [Amborella trichopoda] gi|548857785|gb|ERN15583.1| hypothetical protein AMTR_s00048p00149840 [Amborella trichopoda] Length = 464 Score = 298 bits (763), Expect = 5e-78 Identities = 161/376 (42%), Positives = 235/376 (62%) Frame = +1 Query: 13 IKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRIPKFR 192 I IL QN QF L +L L+ KF+ PE + LI+ + +++A+++FF +P R Sbjct: 88 IVILAQNPQFSGLKTLLRCLQSNRKFSTPETRIIGLIQSCASSKMVKEALDLFFAMPHLR 147 Query: 193 CTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKINKFSY 372 C PS +SL+A+LSVLC + +V ++L+K+ E MNIRL+ +SFRILI LC+I K + Sbjct: 148 CQPSTTSLNALLSVLCDT-DSFHLVPELLIKTLE-MNIRLDASSFRILIGSLCRIGKLGF 205 Query: 373 AIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFDYSNV 552 AIE+L LMP G PDS Y+ IL LC+ + S E+ GFL+E++ GF P+ Y+ V Sbjct: 206 AIELLRLMPDQGCWPDSGFYAEILCKLCEFGEFS--EIYGFLDEMKDAGFFPDKIAYAIV 263 Query: 553 IGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEMLVLGV 732 I L K GR +A ILN+MK+ G KP+ + YT ++DG G+F +A EVFDEML +G+ Sbjct: 264 IDSLAKGGRLNEARAILNRMKLEGAKPDTITYTSMMDGFYKIGEFKQAGEVFDEMLAMGL 323 Query: 733 IPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELRKARE 912 +PD++TY++Y+ GLC + E ++L M E+GC+PNV TYNTL+ C G LR+A E Sbjct: 324 VPDVFTYSVYINGLCRERKLEEAKEVLCVMREMGCRPNVITYNTLIRTFCSDGNLRRADE 383 Query: 913 VLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMICGLC 1092 ++ +MG GV NS TY LI+ + EG VVEA LL M+ KG P ++ ++ Sbjct: 384 LVAEMGSNGVCGNSVTYRTLINAYLREGMVVEANELLVQMVGKGFFPHFSTWEALLSSTV 443 Query: 1093 KRGLVSEAQQVLTEMI 1140 + + +A L E+I Sbjct: 444 FKWDILQALNALEELI 459 Score = 74.3 bits (181), Expect = 1e-10 Identities = 55/235 (23%), Positives = 93/235 (39%), Gaps = 35/235 (14%) Frame = +1 Query: 586 DALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEMLVLGVIPDIYTYNIYV 765 +ALD+ M +P+ +L + FH E+ + L + + D ++ I + Sbjct: 135 EALDLFFAMPHLRCQPSTTSLNALLSVLCDTDSFHLVPELLIKTLEMNIRLDASSFRILI 194 Query: 766 KGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALC--------------------- 882 LC ++L M + GC P+ Y ++ LC Sbjct: 195 GSLCRIGKLGFAIELLRLMPDQGCWPDSGFYAEILCKLCEFGEFSEIYGFLDEMKDAGFF 254 Query: 883 --------------KAGELRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGL 1020 K G L +AR +L +M L+G K ++ TY ++D GE +A + Sbjct: 255 PDKIAYAIVIDSLAKGGRLNEARAILNRMKLEGAKPDTITYTSMMDGFYKIGEFKQAGEV 314 Query: 1021 LEVMLRKGLVPTSIIFDEMICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALL 1185 + ML GLVP + I GLC+ + EA++VL M P ++ L+ Sbjct: 315 FDEMLAMGLVPDVFTYSVYINGLCRERKLEEAKEVLCVMREMGCRPNVITYNTLI 369 Score = 67.8 bits (164), Expect = 1e-08 Identities = 47/189 (24%), Positives = 93/189 (49%), Gaps = 1/189 (0%) Frame = +1 Query: 655 VLDGVVSAGQFHKAEEVFDEMLVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELG 834 ++ S+ +A ++F M L P + N + LC +F ++L+ E+ Sbjct: 123 LIQSCASSKMVKEALDLFFAMPHLRCQPSTTSLNALLSVLCDTDSFHLVPELLIKTLEMN 182 Query: 835 CKPNVTTYNTLMGALCKAGELRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAF 1014 + + +++ L+G+LC+ G+L A E+L+ M +G +S Y ++ +L GE E + Sbjct: 183 IRLDASSFRILIGSLCRIGKLGFAIELLRLMPDQGCWPDSGFYAEILCKLCEFGEFSEIY 242 Query: 1015 GLLEVMLRKGLVPTSIIFDEMICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSR 1194 G L+ M G P I + +I L K G ++EA+ +L M P ++ +++ Sbjct: 243 GFLDEMKDAGFFPDKIAYAIVIDSLAKGGRLNEARAILNRMKLEGAKPDTITYTSMMDGF 302 Query: 1195 FEL-SFKEA 1218 +++ FK+A Sbjct: 303 YKIGEFKQA 311