BLASTX nr result
ID: Rheum21_contig00008481
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00008481 (1446 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003632411.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 382 e-103 ref|XP_004146883.1| PREDICTED: pentatricopeptide repeat-containi... 374 e-101 ref|XP_006349176.1| PREDICTED: pentatricopeptide repeat-containi... 374 e-101 gb|EOX95289.1| Tetratricopeptide repeat (TPR)-like superfamily p... 372 e-100 ref|XP_004229380.1| PREDICTED: pentatricopeptide repeat-containi... 371 e-100 emb|CBI36964.3| unnamed protein product [Vitis vinifera] 366 2e-98 ref|XP_002515265.1| pentatricopeptide repeat-containing protein,... 360 6e-97 ref|XP_002301516.2| pentatricopeptide repeat-containing family p... 354 5e-95 gb|EMJ22677.1| hypothetical protein PRUPE_ppa005037mg [Prunus pe... 352 3e-94 gb|AFN53641.1| pentatricopeptide repeat-containing protein [Linu... 342 2e-91 ref|XP_006480799.1| PREDICTED: pentatricopeptide repeat-containi... 340 9e-91 ref|XP_002277733.1| PREDICTED: pentatricopeptide repeat-containi... 339 2e-90 ref|XP_006429067.1| hypothetical protein CICLE_v10011521mg [Citr... 338 3e-90 emb|CAN71816.1| hypothetical protein VITISV_023421 [Vitis vinifera] 338 3e-90 ref|XP_006307239.1| hypothetical protein CARUB_v10008848mg [Caps... 332 2e-88 ref|XP_002889397.1| pentatricopeptide repeat-containing protein ... 332 2e-88 ref|XP_004288536.1| PREDICTED: pentatricopeptide repeat-containi... 330 7e-88 ref|XP_006396396.1| hypothetical protein EUTSA_v10029427mg [Eutr... 330 9e-88 ref|NP_171739.1| pentatricopeptide repeat-containing protein [Ar... 330 9e-88 gb|AAL38889.1| unknown protein [Arabidopsis thaliana] 328 3e-87 >ref|XP_003632411.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like [Vitis vinifera] Length = 506 Score = 382 bits (981), Expect = e-103 Identities = 199/444 (44%), Positives = 289/444 (65%), Gaps = 2/444 (0%) Frame = +3 Query: 120 RLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDK 299 RLY +SA+GATG VAD L+ +++EG ++ K +L C+++LR Y++ H LEI +WM+ Sbjct: 45 RLYHRLSALGATGGSVADTLNEHIKEGKLIAKHELSSCIKQLRKYRQFQHALEIMDWMEN 104 Query: 300 MKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAA 479 K S+ D A+RLDL+ KTKG+A AE++F +LS KNL TYG LLNCYC +K+ +KA Sbjct: 105 RKIFFSYADYAVRLDLLSKTKGLATAEEYFNNLSPSAKNLLTYGTLLNCYCKEKMEEKAL 164 Query: 480 ELLKEMEKLNYVS-SLTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSY 656 L ++M++LN+ S SLTFNNLM+L +R +PE VP L +EMK+R+ +++TY++LMQSY Sbjct: 165 ALFEKMDELNFASTSLTFNNLMSLHMRLGKPEMVPPLVDEMKKRSISPDTFTYNILMQSY 224 Query: 657 QLLNDMEAVDRTLQEIEKEGKN-CHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLR 833 LND+E +R L+EI++E ++ W+ YSNLAAVY+ N+ + + Sbjct: 225 ARLNDIEGAERVLEEIKRENEDKLSWTTYSNLAAVYV---NARLFEKAELALKKLEEEMG 281 Query: 834 VRDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVR 1013 D + YHFLIS YA I NL+EVNR+W SLKS F T+N+SY MLQAL L +D L Sbjct: 282 FHDRLAYHFLISLYAGINNLSEVNRVWNSLKSAFPKTNNMSYFIMLQALANLNDVDGLKI 341 Query: 1014 YFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFL 1193 F+EW+S +D R+ ++ + +L + I +AE + +AV++S F ++ +AH L Sbjct: 342 CFEEWKSSCFSFDVRLANVAVRAFLGWDMIKDAESILYEAVKRSSGPFYTALDMFMAHHL 401 Query: 1194 GKREISRALEHLEAVADAIKETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXX 1373 REI AL+++EA A +K ++WQP+ ER+ F KYF EEKDV+GAE Sbjct: 402 KVREIDTALKYMEAAASEVK-NNEWQPAPERVLAFLKYFEEEKDVEGAEKFCKILKNISG 460 Query: 1374 VESKSYLWLLRTYVAAEETAPGMR 1445 ++S +Y LL+T VAA T P MR Sbjct: 461 LDSNAYQLLLQTXVAAGRTEPEMR 484 >ref|XP_004146883.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like [Cucumis sativus] gi|449518825|ref|XP_004166436.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like [Cucumis sativus] Length = 474 Score = 374 bits (960), Expect = e-101 Identities = 193/442 (43%), Positives = 278/442 (62%), Gaps = 1/442 (0%) Frame = +3 Query: 123 LYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDKM 302 LY +SA+GATG VA ++ ++ EG++V+K +L C+++LR Y+R+HHCL+I EWM+ Sbjct: 11 LYPRLSALGATGGSVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQIMEWMETR 70 Query: 303 KHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAAE 482 K N S D ALRLDLI K G+ AEK+F L KN TYGALLNCYC + + +KA Sbjct: 71 KINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALT 130 Query: 483 LLKEMEKLNYVSSLTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSYQL 662 L K+M++L +SL+FNNLMT+ +R + PEKVP L EMK+R F + ++TY++ M S Sbjct: 131 LFKKMDELKISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCAS 190 Query: 663 LNDMEAVDRTLQEIEKEGKN-CHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLRVR 839 LND+ V+ L+E++ E +N W+ YSNLA+ Y++AG MK+ + Sbjct: 191 LNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEEMKSDK-N 249 Query: 840 DPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVRYF 1019 D + YH LIS YAS NL+EVNRIW +LKS + T +N+SY+ MLQAL KL ++ L R + Sbjct: 250 DRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTY 309 Query: 1020 KEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFLGK 1199 KEWES +++D RI + ++ YL+Q+ +A +F DA ++S F + RE+ + +FL Sbjct: 310 KEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKL 369 Query: 1200 REISRALEHLEAVADAIKETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVE 1379 +++ A HLE+ KE + W PS F YF EEKDV+GAE ++ Sbjct: 370 KQVDSAFSHLESALSESKEKE-WHPSLATTTAFLNYFEEEKDVEGAEDFARILKRLKCLD 428 Query: 1380 SKSYLWLLRTYVAAEETAPGMR 1445 + Y LL+TYVAA + AP MR Sbjct: 429 ASGYHLLLKTYVAAGKLAPDMR 450 >ref|XP_006349176.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like [Solanum tuberosum] Length = 495 Score = 374 bits (959), Expect = e-101 Identities = 200/444 (45%), Positives = 279/444 (62%), Gaps = 2/444 (0%) Frame = +3 Query: 120 RLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDK 299 RLYR +SA+GAT V + ++ Y REG VV+K +L C+++LR YKR+ H LEI EWM+K Sbjct: 33 RLYRRLSALGATKGSVTETINAYTREGRVVKKYELEKCIKELRKYKRYQHALEIMEWMEK 92 Query: 300 MKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAA 479 N S+ D +RLDLI K +GI AEK+F LS +N TYGALLNCYC +K++DKA Sbjct: 93 RGINFSYGDYGVRLDLIAKVQGITAAEKYFGGLSPSMQNQSTYGALLNCYCVEKMADKAL 152 Query: 480 ELLKEMEKLNYVS-SLTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSY 656 ++M++L + + SL FNNLM+L +R +PEKV L +EMK R P+ +++Y++ M SY Sbjct: 153 SFFEKMDQLKFTNKSLAFNNLMSLYMRLGQPEKVAPLVQEMKSRKVPLCTFSYNIWMNSY 212 Query: 657 QLLNDMEAVDRTLQEIEKE-GKNCHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLR 833 L+D+E V+R +E+++E K C W+ YSNLA Y++AG++ M Sbjct: 213 SCLDDIEGVERVFEELKQENAKECDWTTYSNLAVAYVKAGHNEKAELALKKLEEEM---G 269 Query: 834 VRDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVR 1013 R+ YH+LIS +A I NL EV RIW SLKS T N SY+ MLQ+L K +D L + Sbjct: 270 PRNRQAYHYLISLHARISNLGEVYRIWGSLKSSLDLT-NSSYLVMLQSLSKHNDMDGLKK 328 Query: 1014 YFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFL 1193 Y++EWES YD R+ + V+ YLR + + AEKVF+ A+++S F E+ + +L Sbjct: 329 YYEEWESSCSTYDMRLANNVIGAYLRHDMLNNAEKVFHSALKRSQGPFFLAWEMFMLFYL 388 Query: 1194 GKREISRALEHLEAVADAIKETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXX 1373 KR+I+ A + +EA+A IKE +KW+P E I +F +YFVEEKDVDGAE Sbjct: 389 RKRQINFAQQCMEAIASRIKE-NKWRPKYETISNFLEYFVEEKDVDGAEDFYKFLKKVNC 447 Query: 1374 VESKSYLWLLRTYVAAEETAPGMR 1445 + S Y LLRTY AA T MR Sbjct: 448 LSSDVYSSLLRTYAAANRTTDDMR 471 >gb|EOX95289.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative [Theobroma cacao] Length = 501 Score = 372 bits (955), Expect = e-100 Identities = 204/483 (42%), Positives = 296/483 (61%), Gaps = 2/483 (0%) Frame = +3 Query: 3 LRGFCAATSLQAETAVEAVEHASSAGVSADKEDSMSTMWRLYRGISAIGATGDKVADVLD 182 +R C ATS +A+ A+ A S + RLY +SA+ ATG V++ L+ Sbjct: 14 VRKLCTATSEKAKIK------AAVAAASPMRN-------RLYPRLSALAATGGTVSEALN 60 Query: 183 GYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLICKTK 362 ++ EG +RKD+L CV++LR Y+R+ H L+I +WM++ + SH D+A+RLDLI KTK Sbjct: 61 DFIMEGKKIRKDELGRCVKELRKYRRYQHALDIMDWMERRNLHLSHVDHAIRLDLIAKTK 120 Query: 363 GIAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAAELLKEMEKLNYV-SSLTFNNL 539 GI AE + +L KN TYGALLNCYC + + DKA+ L ++M++L + ++L FNNL Sbjct: 121 GIDAAENYLSALPPSAKNQLTYGALLNCYCNNLMKDKASSLFQKMDELRFTNNTLPFNNL 180 Query: 540 MTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIEKEGK 719 M L +R +PEKVP L +E+K RN P +TY + MQSY LND+E V+R L+E+ ++ + Sbjct: 181 MCLYMRLGQPEKVPELVDELKLRNIPRCRFTYVVWMQSYANLNDIEGVERVLEELAQDSE 240 Query: 720 N-CHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLRVRDPVPYHFLISFYASIGNLA 896 + C W+ Y+NLAA+Y++AG K++ R YHFLIS YA NLA Sbjct: 241 DKCTWTTYNNLAAIYVKAG---LFEKAEACLKKLEKDMMPRQREAYHFLISLYAGTSNLA 297 Query: 897 EVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVRYFKEWESGYKHYDSRIPSLVM 1076 EV+R+WE+LK F T +N SY+ M+QAL KL L+ L + F EWES YD R+ + + Sbjct: 298 EVHRVWEALKRAFSTVTNTSYLVMVQALAKLKDLEGLKKCFAEWESSCSAYDIRLATSTI 357 Query: 1077 AGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFLGKREISRALEHLEAVADAIKE 1256 GYL + + EAE V +A+++S F K RE+ + +FL K + AL+H+EAV + E Sbjct: 358 RGYLSGDLLEEAELVLGNAMKRSKGPFHKVRELFMVYFLEKCQFDLALQHVEAV---VSE 414 Query: 1257 TDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAAEETAP 1436 W+P+ E I F YF++E+DVD AE ++S +Y LL+TYVAA + AP Sbjct: 415 MGDWRPAPETITAFFDYFMKERDVDAAEEFCRILKSKNGLDSNAYHLLLKTYVAAGKVAP 474 Query: 1437 GMR 1445 MR Sbjct: 475 DMR 477 >ref|XP_004229380.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like [Solanum lycopersicum] Length = 495 Score = 371 bits (953), Expect = e-100 Identities = 200/444 (45%), Positives = 279/444 (62%), Gaps = 2/444 (0%) Frame = +3 Query: 120 RLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDK 299 RLYR +SA+GAT V + ++ Y REG VV+K +L C+++LR YKR+ H LEI EWM+K Sbjct: 33 RLYRRLSALGATKGSVTETINAYTREGRVVKKYELEKCIKELRKYKRYQHALEIMEWMEK 92 Query: 300 MKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAA 479 N S+ D +RLDLI K +GI AEK+F SLS +N TYGALLNCYC +K++DKA Sbjct: 93 RGINFSYGDYGVRLDLIAKVQGITAAEKYFGSLSPSMQNQSTYGALLNCYCVEKMTDKAL 152 Query: 480 ELLKEMEKLNYVS-SLTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSY 656 + M++L + + SL FNNLM+L +R +PEKV L +EMK R P+ +++Y++ M SY Sbjct: 153 TFFERMDQLKFTNRSLAFNNLMSLYMRLGQPEKVAPLVQEMKSRKVPLCTFSYNVWMNSY 212 Query: 657 QLLNDMEAVDRTLQEIEKE-GKNCHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLR 833 L+D+E V+R +E+++E K C W+ YSNLA Y++AG++ M Sbjct: 213 SCLDDIEGVERVFEELKQENAKECDWTTYSNLAVAYVKAGHNEKAELALKKLEEEM---G 269 Query: 834 VRDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVR 1013 R+ YH+LIS +A I NL EV RIW SLKS T N SY+ MLQ+L K +D L + Sbjct: 270 PRNRQAYHYLISLHARISNLGEVYRIWGSLKSSLDLT-NSSYLVMLQSLSKHNDMDGLKK 328 Query: 1014 YFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFL 1193 Y++EWES YD R+ + V+ YLR + + AEKVF+ A+++S F E+ + +L Sbjct: 329 YYEEWESSCSTYDMRLANNVIGAYLRHDMLNNAEKVFHCALKRSQGPFFLAWEMFMLFYL 388 Query: 1194 GKREISRALEHLEAVADAIKETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXX 1373 KR+I+ A + +EA+A IKE +KW+P E I +F +YFVEEKDVDGAE Sbjct: 389 RKRQINFAQQCMEAIASRIKE-NKWRPKYETISNFLEYFVEEKDVDGAEEFYKFLKKVNC 447 Query: 1374 VESKSYLWLLRTYVAAEETAPGMR 1445 + S Y LLRTY AA T M+ Sbjct: 448 LSSDVYNSLLRTYAAANRTTDDMK 471 >emb|CBI36964.3| unnamed protein product [Vitis vinifera] Length = 526 Score = 366 bits (939), Expect = 2e-98 Identities = 187/409 (45%), Positives = 273/409 (66%), Gaps = 2/409 (0%) Frame = +3 Query: 120 RLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDK 299 RLY +SA+GATG VAD L+ +++EG ++ K +L C+++LR Y++ H LEI +WM+ Sbjct: 84 RLYHRLSALGATGGSVADTLNEHIKEGKLIAKHELSSCIKQLRKYRQFQHALEIMDWMEN 143 Query: 300 MKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAA 479 K S+ D A+RLDL+ KTKG+A AE++F +LS KNL TYG LLNCYC +K+ +KA Sbjct: 144 RKIFFSYADYAVRLDLLSKTKGLATAEEYFNNLSPSAKNLLTYGTLLNCYCKEKMEEKAL 203 Query: 480 ELLKEMEKLNYVS-SLTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSY 656 L ++M++LN+ S SLTFNNLM+L +R +PE VP L +EMK+R+ +++TY++LMQSY Sbjct: 204 ALFEKMDELNFASTSLTFNNLMSLHMRLGKPEMVPPLVDEMKKRSISPDTFTYNILMQSY 263 Query: 657 QLLNDMEAVDRTLQEIEKEGKN-CHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLR 833 LND+E +R L+EI++E ++ W+ YSNLAAVY+ N+ + + Sbjct: 264 ARLNDIEGAERVLEEIKRENEDKLSWTTYSNLAAVYV---NARLFEKAELALKKLEEEMG 320 Query: 834 VRDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVR 1013 D + YHFLIS YA I NL+EVNR+W SLKS F T+N+SY MLQAL L +D L Sbjct: 321 FHDRLAYHFLISLYAGINNLSEVNRVWNSLKSAFPKTNNMSYFIMLQALANLNDVDGLKI 380 Query: 1014 YFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFL 1193 F+EW+S +D R+ ++ + +L + I +AE + +AV++S F ++ +AH L Sbjct: 381 CFEEWKSSCFSFDVRLANVAVRAFLGWDMIKDAESILYEAVKRSSGPFYTALDMFMAHHL 440 Query: 1194 GKREISRALEHLEAVADAIKETDKWQPSRERIRDFCKYFVEEKDVDGAE 1340 REI AL+++EA A +K ++WQP+ ER+ F KYF EEKDV+GAE Sbjct: 441 KVREIDTALKYMEAAASEVK-NNEWQPAPERVLAFLKYFEEEKDVEGAE 488 >ref|XP_002515265.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223545745|gb|EEF47249.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 504 Score = 360 bits (925), Expect = 6e-97 Identities = 197/450 (43%), Positives = 278/450 (61%), Gaps = 8/450 (1%) Frame = +3 Query: 120 RLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDK 299 +LY +SA+GATG V+ L+ ++ EG + K +L C+R+LR Y+R H EI EWM+K Sbjct: 39 KLYHKLSALGATGGSVSRTLNEHIMEGKTITKIELSRCIRELRKYRRFDHAFEIMEWMEK 98 Query: 300 MKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYT-YGALLNCYCTDKLSDKA 476 K N S+ D A+RLDLI K +GIA AE +F LS KN +T YGALLNCYC + +SDKA Sbjct: 99 RKMNFSYADRAIRLDLIGKARGIAAAEDYFNGLSPSAKNHHTSYGALLNCYCKELMSDKA 158 Query: 477 AELLKEMEKLNYV-SSLTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQS 653 L +EM++ ++ SSL FNNLM++ +R +PEKVP L +EMK+R S+TY++ MQS Sbjct: 159 LALFQEMDEKKFLYSSLPFNNLMSMYMRLGQPEKVPPLVDEMKKRKVSPCSFTYNIWMQS 218 Query: 654 YQLLNDMEAVDRTLQEIEKEG--KNCHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKN 827 Y LND + VDR L+EI +G N W+ YSNLA +Y++AG ++K Sbjct: 219 YGCLNDFQGVDRVLREIVNDGGKDNLQWTTYSNLATIYLKAG-------IFEKAESALKK 271 Query: 828 LRV----RDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQ 995 L R+ YHFLIS YA GN EVNR+W LKS F +N+SY+ MLQAL KL Sbjct: 272 LEAIMGFRNREAYHFLISIYAGTGNSNEVNRVWGLLKSSFNMINNLSYLVMLQALAKLKD 331 Query: 996 LDELVRYFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREI 1175 ++ + + F+EWESG +YD RI ++ + +L+ + EAE +F+DA++++ F K RE Sbjct: 332 VEGVAKCFREWESGCTNYDMRIANVAIRVFLQHDMYEEAELIFDDALKRTRGPFFKARER 391 Query: 1176 LIAHFLGKREISRALEHLEAVADAIKETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXX 1355 + FL ++ AL+H+ A A + E +W+P +E + + YF EKDVDGAE Sbjct: 392 FMLFFLKIHQLDLALKHMRA-AFSESEKHEWKPLQETVNAYFDYFRTEKDVDGAEKLSKI 450 Query: 1356 XXXXXXVESKSYLWLLRTYVAAEETAPGMR 1445 + S Y LL+TY+AA + AP MR Sbjct: 451 LKHINCLNSSVYSLLLKTYIAAGKLAPEMR 480 >ref|XP_002301516.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550345391|gb|EEE80789.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 511 Score = 354 bits (909), Expect = 5e-95 Identities = 194/484 (40%), Positives = 281/484 (58%), Gaps = 16/484 (3%) Frame = +3 Query: 42 TAVEAVEH----ASSAGVSADKEDSMSTMWRLYRGISAIGATGDKVADVLDGYVREGHVV 209 T AV H A + V+A+K R+YR +S +GA+G V+ L+ + EG Sbjct: 11 TGSRAVRHLCVAAEAVPVAAEKRK------RMYRRLSELGASGGSVSKTLNELILEGGKT 64 Query: 210 RKDDLLDCVRKLRNYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLICKTKGIAEAEKFF 389 K +L C++KLR Y R H +E+ EWM K K N SH D+A+ LDL KTKGIA AE +F Sbjct: 65 SKINLTTCIKKLRKYGRFDHAIEVMEWMQKRKMNFSHVDHAVYLDLTAKTKGIAAAENYF 124 Query: 390 ESLSAENKNLYTYGALLNCYCTDKLSDKAAELLKEMEKLNYVS-SLTFNNLMTLALRSEE 566 ++L +N TY LLNCYC +++S+KA L ++M+K+ +S S+ F+NLMTL +R + Sbjct: 125 DNLPPSVQNHVTYSTLLNCYCKERMSEKALTLFEKMDKMKLLSTSMPFSNLMTLHMRLGQ 184 Query: 567 PEKVPGLAEEMKRRNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIEKEGK-NCHWSVYS 743 PEKV + +EMK+R ++TY++ MQSY LND E V R L E++ +GK N W+ YS Sbjct: 185 PEKVLDIVQEMKQRGVSPGTFTYNIWMQSYGCLNDFEGVQRVLDEMKTDGKENFSWTTYS 244 Query: 744 NLAAVYIRAG----------NSXXXXXXXXXXXXSMKNLRVRDPVPYHFLISFYASIGNL 893 NLA +Y++AG K D YHFLIS YA NL Sbjct: 245 NLATIYVKAGLFDKAESALRKLEEQIECGRDCDFQKKRRHDADREAYHFLISLYAGTSNL 304 Query: 894 AEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVRYFKEWESGYKHYDSRIPSLV 1073 +EV+R+W SLKS F+TT+N+SY+ +LQAL KL ++ L++ FKEWES YD R+ ++ Sbjct: 305 SEVHRVWNSLKSSFRTTTNISYLNVLQALAKLKDVEGLLKCFKEWESSCHSYDMRLANVA 364 Query: 1074 MAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFLGKREISRALEHLEAVADAIK 1253 + L + EA +F++A++++ F K RE+ + FL + AL+H++A K Sbjct: 365 IRACLEHDMYEEAASIFDEALKRTKGLFFKAREMFMVFFLKNHQPDLALKHMKAAFSEAK 424 Query: 1254 ETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAAEETA 1433 E + WQP ++ + F YF + KDVDGAE + S +Y+ LL+TY AA A Sbjct: 425 EIE-WQPDQKTVSAFLNYFEDGKDVDGAERLCKIWKQINRLNSNAYILLLKTYTAAGRLA 483 Query: 1434 PGMR 1445 P MR Sbjct: 484 PEMR 487 >gb|EMJ22677.1| hypothetical protein PRUPE_ppa005037mg [Prunus persica] Length = 480 Score = 352 bits (902), Expect = 3e-94 Identities = 186/411 (45%), Positives = 259/411 (63%), Gaps = 4/411 (0%) Frame = +3 Query: 120 RLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDK 299 RLYR +SA+GATG VA L+ Y+ EG +++K +L C+++LR Y++ H LEI EWM+ Sbjct: 37 RLYRRLSALGATGGSVAKTLNQYIMEGKMLKKYELERCIKELRKYRKFQHALEIMEWMEF 96 Query: 300 MKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAA 479 K N S D A+RLDL K KGI AE +F LS K+ +TYGALLNCYC + + +KA Sbjct: 97 RKMNYSKADFAIRLDLTSKVKGIEAAEDYFSGLSPSLKDRFTYGALLNCYCKELMEEKAL 156 Query: 480 ELLKEMEKLNYV-SSLTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSY 656 L + M++L + SSL FNNLM++ +R ++PEKV L +EMK+RN P++++TY++ MQS+ Sbjct: 157 ALYETMDELEFASSSLVFNNLMSMHMRKQQPEKVAPLVQEMKQRNIPLDTFTYNIWMQSF 216 Query: 657 QLLNDMEAVDRTLQEIEK-EGKNCHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLR 833 LND E +R L E++K +G C WS YSNLAA+Y++A MK L+ Sbjct: 217 ASLNDFEGAERVLDEMQKQDGNQCSWSTYSNLAAIYVKAKIFDKAELALKKSEEMMKPLK 276 Query: 834 VRDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVR 1013 R+ YHFLIS YA NL EV R+WESLK F T+N+SY+ MLQAL KL ++ L Sbjct: 277 QRN--TYHFLISLYACTSNLGEVKRVWESLKKAFPATNNMSYLIMLQALCKLNDIEGLKE 334 Query: 1014 YFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFL 1193 F+EWE YD R+ + + GYL Q+ EA VF DA +++ F K RE+ + +FL Sbjct: 335 CFEEWECKCSSYDMRLANTAIRGYLSQDMYEEAALVFADACKRTKGPFFKAREMFMLYFL 394 Query: 1194 GKREISRALEHLEAVADAIKET--DKWQPSRERIRDFCKYFVEEKDVDGAE 1340 ++ A+ +L A A+ ET +W PS + F KYF EEKDV+ AE Sbjct: 395 KNCQVDLAVSYLGA---AVSETADGEWHPSPDTTSAFFKYFEEEKDVESAE 442 >gb|AFN53641.1| pentatricopeptide repeat-containing protein [Linum usitatissimum] Length = 516 Score = 342 bits (878), Expect = 2e-91 Identities = 193/480 (40%), Positives = 282/480 (58%), Gaps = 9/480 (1%) Frame = +3 Query: 33 QAETAVEAVEHASSAGVSADKEDSMSTMWRLYRGISAIGATGDKVADVLDGYVREGHVVR 212 Q TA EA AS+ S ++ S RLYR +SA+GATG V VL+ YV +G +R Sbjct: 18 QLSTATEA--SASAPDPSYKEQRSSGNPLRLYRRLSALGATGGTVDKVLNDYVMDGMSIR 75 Query: 213 KDDLLDCVRKLRNYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLICKTKGIAEAEKFFE 392 K +L+ CV++LR Y R +H LEI EWM+K N H D A+RLDLICKTKGI EAE +F Sbjct: 76 KVELMRCVKELRKYGRFNHGLEIMEWMEKRGINLGHGDLAVRLDLICKTKGITEAENYFN 135 Query: 393 SLSAENKNLYTYGALLNCYCTDKLSDKAAELLKEMEKLNYV-SSLTFNNLMTLALRSEEP 569 L KN TYG+LLN YC S+KA +L ++M+KL + +SL FNNLM++ +R + Sbjct: 136 GLVPSAKNPATYGSLLNSYCKKLDSEKALQLFQKMDKLKFFRNSLPFNNLMSMYMRLGQQ 195 Query: 570 EKVPGLAEEMKRRNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIEKE---GKNCHWSVY 740 EKVP L +MK+ N P ++TY++ +QS + D E + + L+E+ + G N +W+ Y Sbjct: 196 EKVPELVSQMKQMNLPPCTFTYNIWIQSLGHMRDFEGIKKVLEEMRNDVNFGNNFNWTTY 255 Query: 741 SNLAAVYIRAGNSXXXXXXXXXXXXSMKNLRVRDPVPYHFLISFYASIGNLAEVNRIWES 920 SNLAAVY AG + + D YHFL++ Y I +L EV+R+W Sbjct: 256 SNLAAVYTSAGE---FERAKLALKMMEERIDSHDRNAYHFLLTLYGGIADLEEVHRVWGC 312 Query: 921 LKSDFKTTSNVSYMTMLQALLKLGQLDELVRYFKEWESGYKHYDSRIPSLVMAGYLRQNK 1100 LK+ F +N SY+ MLQAL +L ++ + + F+EWES YD R+ ++ + YL + Sbjct: 313 LKAKFNQVTNASYLVMLQALARLKDVEGISKVFEEWESVCTSYDMRVANVAIRVYLEKGM 372 Query: 1101 IGEAEKVFNDAVEKSGRAFCKGREILIAHFLGKREISRALEHLEAVADAIKETDK----- 1265 EAE VF+ A+E++ + K RE+L+ L +R++ AL+ ++A + + +K Sbjct: 373 YNEAEAVFDGAMERTPGPYFKTREMLMVSLLKRRQLEPALKQMKAAFTEVGQNEKGHEKE 432 Query: 1266 WQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAAEETAPGMR 1445 W+PS E + F YF EEKDV+GAE +S Y L++TY+AA ++A MR Sbjct: 433 WRPSAEIVNAFFGYFEEEKDVEGAEKMWKILKCINRCDSTVYRLLMKTYIAAGKSAVDMR 492 >ref|XP_006480799.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like [Citrus sinensis] Length = 508 Score = 340 bits (872), Expect = 9e-91 Identities = 182/446 (40%), Positives = 273/446 (61%), Gaps = 4/446 (0%) Frame = +3 Query: 120 RLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDK 299 +LY+ +SA+GATG V L+ Y+ EG VRKD L CVR LR + R+ H LE+ EWM+ Sbjct: 45 KLYKRLSALGATGGSVTGALNAYIMEGKTVRKDMLEYCVRSLRKFGRYRHALEVIEWMES 104 Query: 300 MKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAA 479 K + S+ D A+ LDL KTKGIA AE++F LS KN YTYGALLNCYC + ++++A Sbjct: 105 RKIHFSYTDFAVYLDLTAKTKGIAAAEEYFNGLSEYAKNRYTYGALLNCYCKELMTERAL 164 Query: 480 ELLKEMEKLNYV-SSLTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSY 656 L ++M++L ++ +++ FNNL T+ LR +PEKV L +MK+RN +++ TY + MQSY Sbjct: 165 ALFEKMDELKFLGNTVAFNNLSTMYLRLGQPEKVRPLVNQMKQRNISLDNLTYIVWMQSY 224 Query: 657 QLLNDMEAVDRTLQEIEKEGKN-CHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLR 833 LND++ V+R E+ E ++ C W+ YSNLA++Y++A ++ ++ Sbjct: 225 SHLNDIDGVERVFYEMCNECEDKCRWTTYSNLASIYVKA----ELFEKAELALKKLEEMK 280 Query: 834 VRDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVR 1013 RD YHFLIS Y + NL VNR+W LKS F T N SY+ +LQAL KL +D L + Sbjct: 281 PRDRKAYHFLISLYCNTSNLDAVNRVWGILKSTFPPT-NTSYLVLLQALAKLNAIDILKQ 339 Query: 1014 YFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRA--FCKGREILIAH 1187 F+EWES YD R+ +++ YL+++ EA +FN+A +++ + F K RE + + Sbjct: 340 CFEEWESRCSSYDMRLADVIIRAYLQKDMYEEAALIFNNAKKRANASARFFKSRESFMIY 399 Query: 1188 FLGKREISRALEHLEAVADAIKETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXX 1367 +L R++ AL +EA K+ W+P + + F ++F EEKDVDGAE Sbjct: 400 YLRSRQLDLALNEMEAALSEAKQF-HWRPMQVTVDTFFRFFEEEKDVDGAEEFCKVLKSL 458 Query: 1368 XXVESKSYLWLLRTYVAAEETAPGMR 1445 ++ +Y L++TY+AA + A MR Sbjct: 459 NCLDFSAYSLLIKTYIAAGKLASDMR 484 >ref|XP_002277733.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Vitis vinifera] gi|296089781|emb|CBI39600.3| unnamed protein product [Vitis vinifera] Length = 498 Score = 339 bits (869), Expect = 2e-90 Identities = 183/460 (39%), Positives = 275/460 (59%), Gaps = 2/460 (0%) Frame = +3 Query: 69 SSAGVSADKEDSMSTMWRLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLR 248 S+A +A K+ S W+L SA+ T VA+ LD +V+EG V++ D++ CV +LR Sbjct: 21 STATATAAKDKEASLYWKL----SALRGTEGDVAETLDKWVKEGKSVKRFDMISCVNQLR 76 Query: 249 NYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTY 428 +K++ H +I+EWM+K K++ ++ D A+R+DL+ KT+GIA+AE +F SL K TY Sbjct: 77 RFKKYKHAAQIYEWMEKSKNDLNNADRAIRIDLLAKTEGIAQAENYFNSLQESAKTNKTY 136 Query: 429 GALLNCYCTDKLSDKAAELLKEMEKLNYVSS-LTFNNLMTLALRSEEPEKVPGLAEEMKR 605 GALLNCYC + + DKA EL K++++LN+VSS L++NN+++L LR +PEKVP L EM+ Sbjct: 137 GALLNCYCKENMLDKAVELFKKLKELNFVSSALSYNNMISLYLRVGQPEKVPSLVHEMEE 196 Query: 606 RNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIEKEGKNCHWSVYSNLAAVYIRAGNSXX 785 ++ P + YTY+LLM SY + D EAV++ L +++K G W Y NLA +Y+ AG++ Sbjct: 197 KDIPADLYTYNLLMNSYASVKDFEAVEQVLDKMKKRGVERDWFTYGNLANIYVDAGHTKK 256 Query: 786 XXXXXXXXXXSMKNLRVRDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMT 965 +N + DP + LI+ YA NL VNR WESLK +N SY+ Sbjct: 257 ANYALQKLE---QNKNLHDPEAFRMLINLYARTSNLEGVNRAWESLKLAHPKINNKSYLI 313 Query: 966 MLQALLKLGQLDELVRYFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKS 1145 ML AL KLG + L + FKEWESG YD R+ ++++ YL + I EA + ++ Sbjct: 314 MLLALSKLGDVAGLEKCFKEWESGCSTYDVRLSNVMLESYLNREIIEEANLLSESIAKRG 373 Query: 1146 GRAFCKGREILIAHFLGKREISRALEHLE-AVADAIKETDKWQPSRERIRDFCKYFVEEK 1322 K ++ + +L K ++ A+++L+ + A E +KW P+ E I F +YF E K Sbjct: 374 PELKLKTLDLFMKFYLKKHQLDLAMKYLDMGASKADPENNKWFPTEETITMFLEYFEEVK 433 Query: 1323 DVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAAEETAPGM 1442 DVD AE ++SK Y LLRTY+AA + P M Sbjct: 434 DVDSAEKFCETMRKISRLDSKIYDSLLRTYIAAGKEEPLM 473 >ref|XP_006429067.1| hypothetical protein CICLE_v10011521mg [Citrus clementina] gi|557531124|gb|ESR42307.1| hypothetical protein CICLE_v10011521mg [Citrus clementina] Length = 508 Score = 338 bits (867), Expect = 3e-90 Identities = 182/446 (40%), Positives = 273/446 (61%), Gaps = 4/446 (0%) Frame = +3 Query: 120 RLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDK 299 +LY+ +SA+GATG V L+ Y+ EG VRKD L CVR LR + R+ H LE+ EWM+ Sbjct: 45 KLYKRLSALGATGGSVTGALNAYIMEGKTVRKDMLEYCVRSLRKFGRYRHALEVIEWMES 104 Query: 300 MKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAA 479 K + S+ D A+ LDL KTKGIA AE++F SLS KN YTYGALLNCYC + ++++A Sbjct: 105 RKIHFSYTDFAVYLDLTAKTKGIAAAEEYFNSLSEYAKNRYTYGALLNCYCKELMTERAL 164 Query: 480 ELLKEMEKLNYV-SSLTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSY 656 L ++M++L ++ +++ FNNL T+ LR +PEKV L +MK+RN +++ TY + MQSY Sbjct: 165 ALFEKMDELKFLGNTVAFNNLSTMYLRLGQPEKVRPLVNQMKQRNISLDNLTYIVWMQSY 224 Query: 657 QLLNDMEAVDRTLQEIEKEGKN-CHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLR 833 LND++ V+R E+ E ++ C W+ YSNLA++Y++A ++ ++ Sbjct: 225 SHLNDIDGVERVFYEMCNECEDKCRWTTYSNLASIYVKA----ELFEKAELALKKLEEMK 280 Query: 834 VRDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVR 1013 RD YHFLIS Y + NL VNR+W LKS F T N S + +LQAL KL +D L + Sbjct: 281 PRDRKAYHFLISLYCNTSNLDAVNRVWGILKSTFPPT-NTSSLVLLQALAKLNAIDILKQ 339 Query: 1014 YFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRA--FCKGREILIAH 1187 F+EWES YD R+ +++ YL+++ EA +FN+A +++ + F K RE + + Sbjct: 340 CFEEWESRCSSYDMRLADVIIRAYLQKDMYEEAALIFNNAKKRANASARFFKSRESFMIY 399 Query: 1188 FLGKREISRALEHLEAVADAIKETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXX 1367 +L R++ AL +EA K+ W+P + + F ++F EEKDVDGAE Sbjct: 400 YLRSRQLDLALNEMEAALSEAKQF-HWRPMQVTVDTFFRFFEEEKDVDGAEEFCKVLKSL 458 Query: 1368 XXVESKSYLWLLRTYVAAEETAPGMR 1445 ++ +Y L++TY+AA + A MR Sbjct: 459 NCLDFSAYSLLIKTYIAAGKLASDMR 484 >emb|CAN71816.1| hypothetical protein VITISV_023421 [Vitis vinifera] Length = 494 Score = 338 bits (867), Expect = 3e-90 Identities = 181/455 (39%), Positives = 273/455 (60%), Gaps = 2/455 (0%) Frame = +3 Query: 84 SADKEDSMSTMWRLYRGISAIGATGDKVADVLDGYVREGHVVRKDDLLDCVRKLRNYKRH 263 +A K+ S W+L SA+ T VA+ LD +V+EG V++ D++ CV +LR +K++ Sbjct: 22 TATKDKEASLYWKL----SALRGTEGDVAETLDKWVKEGKSVKRFDMISCVNQLRRFKKY 77 Query: 264 HHCLEIFEWMDKMKHNNSHKDNALRLDLICKTKGIAEAEKFFESLSAENKNLYTYGALLN 443 H +I+EWM+K K++ ++ D A+R+DL+ KT+GIA+AE +F SL K TYGALLN Sbjct: 78 KHAAQIYEWMEKSKNDLNNADRAIRIDLLAKTEGIAQAENYFNSLQESAKTNKTYGALLN 137 Query: 444 CYCTDKLSDKAAELLKEMEKLNYVSS-LTFNNLMTLALRSEEPEKVPGLAEEMKRRNFPM 620 CYC + + DKA EL K++++LN+VSS L++NN+++L LR +PEKVP L EM+ ++ P Sbjct: 138 CYCKENMVDKAVELFKKLKELNFVSSALSYNNMISLYLRVGQPEKVPSLVHEMEEKDIPA 197 Query: 621 NSYTYSLLMQSYQLLNDMEAVDRTLQEIEKEGKNCHWSVYSNLAAVYIRAGNSXXXXXXX 800 + YTY+LLM SY + D EAV++ L++++K G W Y NLA +Y+ AG++ Sbjct: 198 DLYTYNLLMNSYASVKDFEAVEQVLEKMKKRGVERDWFTYGNLANIYVDAGHTKKANYAL 257 Query: 801 XXXXXSMKNLRVRDPVPYHFLISFYASIGNLAEVNRIWESLKSDFKTTSNVSYMTMLQAL 980 +N + DP + LI+ YA NL VNR WESLK +N SY+ ML AL Sbjct: 258 QKLE---QNKNLHDPEAFRMLINLYARTSNLEGVNRAWESLKLAHPKINNKSYLIMLLAL 314 Query: 981 LKLGQLDELVRYFKEWESGYKHYDSRIPSLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFC 1160 KLG + L + FKEWESG YD R+ ++++ YL + I EA + ++ Sbjct: 315 SKLGDVAGLEKCFKEWESGCSTYDVRLSNVMLESYLNREMIEEANLLSESIAKRGPELKL 374 Query: 1161 KGREILIAHFLGKREISRALEHLE-AVADAIKETDKWQPSRERIRDFCKYFVEEKDVDGA 1337 K ++ + +L K ++ A+++L+ + A E +KW P+ E I F +YF E KDVD A Sbjct: 375 KTLDLFMKFYLKKHQLDLAMKYLDMGASKADPENNKWFPTEETITMFLEYFEEVKDVDSA 434 Query: 1338 EXXXXXXXXXXXVESKSYLWLLRTYVAAEETAPGM 1442 E ++SK Y LLRTY+AA + P M Sbjct: 435 EKFCETMRKISRLDSKIYDSLLRTYIAAGKEEPLM 469 >ref|XP_006307239.1| hypothetical protein CARUB_v10008848mg [Capsella rubella] gi|482575950|gb|EOA40137.1| hypothetical protein CARUB_v10008848mg [Capsella rubella] Length = 523 Score = 332 bits (852), Expect = 2e-88 Identities = 197/487 (40%), Positives = 283/487 (58%), Gaps = 9/487 (1%) Frame = +3 Query: 12 FCA-ATSLQAETAVEAVEHASSA--GV--SADKEDSMSTMWRLYRGISAIGATGDKVADV 176 FCA A S A VEA A +A GV S + S LY+ +S + TG VA+ Sbjct: 17 FCATAVSPAATNGVEASVFAPAAANGVQGSVSAPTAASRQRELYKKLSKLNVTGGTVAET 76 Query: 177 LDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLICK 356 L+ ++ EG VRKDDL C + LR ++RH H LEIF+WM+K K S D+A+RLDLI K Sbjct: 77 LNQFIMEGVTVRKDDLFRCAKDLRKFRRHQHALEIFDWMEKRKMTLSVSDHAIRLDLIGK 136 Query: 357 TKGIAEAEKFFESLSAENKNLY-TYGALLNCYCTDKLSDKAAELLKEMEKLNYVS-SLTF 530 TKG+ EAE +F +L KN TYGAL+NCYC +KA ++M++L + + SL F Sbjct: 137 TKGLEEAENYFNNLDPSVKNHQSTYGALMNCYCVVLEEEKAKAHFEKMDELKFANNSLPF 196 Query: 531 NNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIEK 710 NN+M++ +R +PEKVP L EMK+R+ + TYS+ MQS LND++ +++ + E+ K Sbjct: 197 NNMMSMYMRLGQPEKVPVLVNEMKQRDISPSGVTYSIWMQSCGSLNDLDGLEKIIDEMGK 256 Query: 711 EGK-NCHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLRVRDPVPYHFLISFYASIG 887 + + W+ +SNLAA+Y +AG M N RD +HFLIS YA I Sbjct: 257 DSEAKTTWNTFSNLAAIYTKAGLYEKAESALKSMEEKM-NPNYRDS--HHFLISLYAGIS 313 Query: 888 NLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVRYFKEWESGYKHYDSRIPS 1067 EV R+WE LK +N+SY+ MLQA+ KLG LD + + F EWES YD R+ + Sbjct: 314 KAPEVFRVWELLKKARPEVNNMSYLVMLQAMSKLGDLDGIKKIFTEWESKCWAYDMRLAN 373 Query: 1068 LVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFLGKREISRALEHLE-AVAD 1244 + + YL+ N EAEK+ + A++KS F K R++L+ H L + A++HLE AV+D Sbjct: 374 IAINTYLKGNMHEEAEKILDGAMKKSKGPFSKARQLLMIHLLQNGKADLAMKHLEAAVSD 433 Query: 1245 AIKETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAAE 1424 + + D+W S E + F +F KDV+GAE +S + +L++TY A+ Sbjct: 434 STENKDEWSWSSELVSLFYLHFENAKDVNGAEEFSRILSKWRPFDSDTATFLIKTYAASG 493 Query: 1425 ETAPGMR 1445 +T P MR Sbjct: 494 KTRPDMR 500 >ref|XP_002889397.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297335239|gb|EFH65656.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 524 Score = 332 bits (851), Expect = 2e-88 Identities = 191/488 (39%), Positives = 277/488 (56%), Gaps = 10/488 (2%) Frame = +3 Query: 12 FCAATSLQA------ETAVEAVEHASSAGVSADKEDSMSTMWRLYRGISAIGATGDKVAD 173 FCA A E +V + A+ S + S LY+ +S + G VA+ Sbjct: 17 FCATVFAPASATGVVEASVSSPAAANGVEASVPAPTAASRQRELYKKLSKLSVAGGTVAE 76 Query: 174 VLDGYVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLIC 353 L+ ++ EG VRK DL C + LR ++RH H LEIF+WM+K K S D+A+RLDLI Sbjct: 77 TLNQFIMEGITVRKVDLFRCAKDLRKFRRHQHALEIFDWMEKRKMTFSVSDHAIRLDLIA 136 Query: 354 KTKGIAEAEKFFESLSAENKNLY-TYGALLNCYCTDKLSDKAAELLKEMEKLNYVS-SLT 527 K KG+ AE +F +L KN TYGAL+NCYC + KA ++M++LN+V+ SL Sbjct: 137 KAKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEEGKAKAHFEKMDELNFVNNSLP 196 Query: 528 FNNLMTLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIE 707 FNN+M++ +R +PEKVP L + MK+R TYS+ MQS LND++ +++ + E+ Sbjct: 197 FNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIWMQSCGSLNDLDGLEKIIDEMG 256 Query: 708 KEGK-NCHWSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLRVRDPVPYHFLISFYASI 884 K+ + W+ +SNLAA++ +AG M N RD +HFLIS YA I Sbjct: 257 KDSEAKTTWNTFSNLAAIFTKAGLYEKAESALKSMEKKM-NPNNRDS--HHFLISLYAGI 313 Query: 885 GNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVRYFKEWESGYKHYDSRIP 1064 EV R+WESLK +N+SY+ MLQA+ KLG +D + + F EWES YD R+ Sbjct: 314 SKGTEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDIDGIKKIFTEWESKCWAYDMRLA 373 Query: 1065 SLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFLGKREISRALEHLE-AVA 1241 ++ + YL+ N EAEK+ + A+EKS F K R++L+ H L + A++HLE AV+ Sbjct: 374 NIAINTYLKGNMYEEAEKILDGAMEKSKGPFSKARQLLMIHLLENGKADLAMKHLETAVS 433 Query: 1242 DAIKETDKWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAA 1421 D + D+W S E + F +F KDVDGAE V+ ++ +L++TY AA Sbjct: 434 DPAENKDEWSWSSELVSLFFLHFKRAKDVDGAEDFCKILSNWKPVDCETMSFLIKTYAAA 493 Query: 1422 EETAPGMR 1445 E+T P MR Sbjct: 494 EKTCPDMR 501 >ref|XP_004288536.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like [Fragaria vesca subsp. vesca] Length = 446 Score = 330 bits (847), Expect = 7e-88 Identities = 181/418 (43%), Positives = 255/418 (61%), Gaps = 4/418 (0%) Frame = +3 Query: 195 EGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLICKTKGIAE 374 EG ++K +L CV++LR Y++ LEI EWM+ K N S D A+RLDL K KGI Sbjct: 2 EGRKIQKYELDRCVKELRKYRQFQTALEIMEWMEFRKINYSLPDYAVRLDLTAKAKGIEA 61 Query: 375 AEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAAELLKEMEKLNYV-SSLTFNNLMTLA 551 AE +F +L KN TYG+LLNCYC + + +KA L K+M++LNYV S+L FNNLM L Sbjct: 62 AESYFSNLPQSAKNKLTYGSLLNCYCKEVMEEKALALYKKMDELNYVDSALVFNNLMALY 121 Query: 552 LRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIEKEGKN-CH 728 +R ++PEKV EEMKRR +++++Y++ MQSY LNDM+ V+ ++E++ + ++ C Sbjct: 122 MRKKQPEKVAPFVEEMKRREIRLDTFSYNIWMQSYASLNDMKGVESVVEEMQSQDEDECD 181 Query: 729 WSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLRVRDPVPYHFLISFYASIGNLAEVNR 908 WS YSNLA++Y++A M + + + YHFLI+ YA+ GNL EV R Sbjct: 182 WSTYSNLASIYVKAQLYEKAEVALKLSEKVMMSGKPQRQT-YHFLITLYANTGNLGEVKR 240 Query: 909 IWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVRYFKEWESGYKHYDSRIPSLVMAGYL 1088 IWESLK F T+N+SY+ ++QAL KL ++ L F+EW+S YD R+ ++V+ YL Sbjct: 241 IWESLKLAFPDTNNISYLLVVQALCKLKDVEGLKECFEEWQSNCSSYDMRLANVVIRAYL 300 Query: 1089 RQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFLGKREISRALEHLEAVADAIKET--D 1262 QN EA +F DA ++ F K REI +A+FL R+ A+ +LE +AI ET D Sbjct: 301 SQNMYEEALLIFKDATKRCRGPFFKAREIFMAYFLDNRQPDLAISYLE---EAILETKDD 357 Query: 1263 KWQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAAEETAP 1436 +W+PS E I F YF E KD+D AE + S Y LL+ YVAA E P Sbjct: 358 EWRPSPETIAAFLNYFEETKDIDSAENFCKILKRLNCLSSNEYCLLLKVYVAAGEFLP 415 >ref|XP_006396396.1| hypothetical protein EUTSA_v10029427mg [Eutrema salsugineum] gi|557097413|gb|ESQ37849.1| hypothetical protein EUTSA_v10029427mg [Eutrema salsugineum] Length = 505 Score = 330 bits (846), Expect = 9e-88 Identities = 189/488 (38%), Positives = 277/488 (56%), Gaps = 8/488 (1%) Frame = +3 Query: 6 RGFCAATSLQAETAVEAVEHASSAGVSADKEDSMSTMWRLYRGISAIGATGDKVADVLDG 185 R FCA + A A+ A+S K +Y+ +S++G G K+ + L+ Sbjct: 10 RRFCATLAAAAPAAIGGEAAAASVPKKTAKNQRS-----VYKKLSSLGKRGGKMEETLNQ 64 Query: 186 YVREGHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLICKTKG 365 + EG V+KDDL+ V+ LR +++ LE+FEWM++ + S D+A+RLDLI KT G Sbjct: 65 FTMEGIPVKKDDLIRYVKDLRKHRQPQRALEVFEWMERKEIAFSGSDHAIRLDLIAKTNG 124 Query: 366 IAEAEKFFESLSAENKNLYTYGALLNCYCTDKLSDKAAELLKEMEKLNYVS-SLTFNNLM 542 + AE +F SL K +YG+LLNCYC + +KA +M LN V+ SL FNNLM Sbjct: 125 LKAAESYFNSLDLSTKTQSSYGSLLNCYCVEGEEEKAKAHFDKMCDLNLVTNSLPFNNLM 184 Query: 543 TLALRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIEKEGKN 722 + LR +PEKVP L MK++N TYS+ +QS +LND++ V++ ++E++ +G Sbjct: 185 AMNLRLGQPEKVPALVVAMKQKNISPCDVTYSMWIQSCGILNDLDGVEKVIEEMKADGGE 244 Query: 723 CH--WSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLRVR----DPVPYHFLISFYASI 884 C W ++NLAA+Y +AG ++K+L + + YHFLIS YA I Sbjct: 245 CRSSWDTFANLAAIYSKAG-------LYSKAEAALKSLEEKMNPHERSSYHFLISLYAGI 297 Query: 885 GNLAEVNRIWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVRYFKEWESGYKHYDSRIP 1064 N EV R+W+ LK +N S +TMLQAL KL +D + + F EWES YD R+ Sbjct: 298 SNAPEVYRVWDLLKKGHPKVNNSSCLTMLQALSKLDDIDGMKKIFTEWESTCYTYDMRMA 357 Query: 1065 SLVMAGYLRQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFLGKREISRALEHLEAVAD 1244 +++++ YL++N EAE VFN A++K F K R++L+ H L + AL+H EA Sbjct: 358 NVMISSYLKENMYEEAEAVFNGAMKKCKGQFSKARQLLMMHLLKNDQADLALKHFEA--- 414 Query: 1245 AIKETDK-WQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAA 1421 A+ DK W S E IR F +F E KDVDGAE ++S +Y L++TY+AA Sbjct: 415 AVSNQDKNWTWSSELIRSFSLHFEESKDVDGAEEFCKTLTIWSPLDSDTYTLLIKTYIAA 474 Query: 1422 EETAPGMR 1445 E+ PGMR Sbjct: 475 EKACPGMR 482 >ref|NP_171739.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75173342|sp|Q9FZ24.1|PPR4_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g02370, mitochondrial; Flags: Precursor gi|9857533|gb|AAG00888.1|AC064879_6 Hypothetical protein [Arabidopsis thaliana] gi|332189300|gb|AEE27421.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 537 Score = 330 bits (846), Expect = 9e-88 Identities = 190/480 (39%), Positives = 281/480 (58%), Gaps = 4/480 (0%) Frame = +3 Query: 18 AATSLQAETAVEAVEHASSAGVSADKEDSMSTMWRLYRGISAIGATGDKVADVLDGYVRE 197 AA ++A + A E+ V+A S LY+ +S + TG VA+ L+ ++ E Sbjct: 40 AANVVEASVSSPAAENGVRTSVAAPTVASRQR--ELYKKLSMLSVTGGTVAETLNQFIME 97 Query: 198 GHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLICKTKGIAEA 377 G VRKDDL C + LR ++R H EIF+WM+K K S D+A+ LDLI KTKG+ A Sbjct: 98 GITVRKDDLFRCAKTLRKFRRPQHAFEIFDWMEKRKMTFSVSDHAICLDLIGKTKGLEAA 157 Query: 378 EKFFESLSAENKNLY-TYGALLNCYCTDKLSDKAAELLKEMEKLNYVS-SLTFNNLMTLA 551 E +F +L KN TYGAL+NCYC + +KA + M++LN+V+ SL FNN+M++ Sbjct: 158 ENYFNNLDPSAKNHQSTYGALMNCYCVELEEEKAKAHFEIMDELNFVNNSLPFNNMMSMY 217 Query: 552 LRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIEKEGK-NCH 728 +R +PEKVP L + MK+R TYS+ MQS LND++ +++ + E+ K+ + Sbjct: 218 MRLSQPEKVPVLVDAMKQRGISPCGVTYSIWMQSCGSLNDLDGLEKIIDEMGKDSEAKTT 277 Query: 729 WSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLRVRDPVPYHFLISFYASIGNLAEVNR 908 W+ +SNLAA+Y +AG M N RD +HFL+S YA I EV R Sbjct: 278 WNTFSNLAAIYTKAGLYEKADSALKSMEEKM-NPNNRDS--HHFLMSLYAGISKGPEVYR 334 Query: 909 IWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVRYFKEWESGYKHYDSRIPSLVMAGYL 1088 +WESLK +N+SY+ MLQA+ KLG LD + + F EWES YD R+ ++ + YL Sbjct: 335 VWESLKKARPEVNNLSYLVMLQAMSKLGDLDGIKKIFTEWESKCWAYDMRLANIAINTYL 394 Query: 1089 RQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFLGKREISRALEHLE-AVADAIKETDK 1265 + N EAEK+ + A++KS F K R++L+ H L + A++HLE AV+D+ + D+ Sbjct: 395 KGNMYEEAEKILDGAMKKSKGPFSKARQLLMIHLLENDKADLAMKHLEAAVSDSAENKDE 454 Query: 1266 WQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAAEETAPGMR 1445 W S E + F +F + KDVDGAE ++S++ +L++TY AAE+T+P MR Sbjct: 455 WGWSSELVSLFFLHFEKAKDVDGAEDFCKILSNWKPLDSETMTFLIKTYAAAEKTSPDMR 514 >gb|AAL38889.1| unknown protein [Arabidopsis thaliana] Length = 537 Score = 328 bits (842), Expect = 3e-87 Identities = 189/480 (39%), Positives = 281/480 (58%), Gaps = 4/480 (0%) Frame = +3 Query: 18 AATSLQAETAVEAVEHASSAGVSADKEDSMSTMWRLYRGISAIGATGDKVADVLDGYVRE 197 AA ++A + A E+ V+A S LY+ +S + TG VA+ L+ ++ E Sbjct: 40 AANVVEASVSSPAAENGVRTSVAAPTVASRQR--ELYKKLSMLSVTGGTVAETLNQFIME 97 Query: 198 GHVVRKDDLLDCVRKLRNYKRHHHCLEIFEWMDKMKHNNSHKDNALRLDLICKTKGIAEA 377 G VRKDDL C + LR ++R H EIF+WM+K K S D+A+ LDLI KTKG+ A Sbjct: 98 GITVRKDDLFRCAKTLRKFRRPQHAFEIFDWMEKRKMTFSVSDHAICLDLIGKTKGLEAA 157 Query: 378 EKFFESLSAENKNLY-TYGALLNCYCTDKLSDKAAELLKEMEKLNYVS-SLTFNNLMTLA 551 E +F +L KN TYGAL+NCYC + +KA + M++LN+V+ SL FNN+M++ Sbjct: 158 ENYFNNLDPSAKNHQSTYGALMNCYCVELEEEKAKAHFEIMDELNFVNNSLPFNNMMSMY 217 Query: 552 LRSEEPEKVPGLAEEMKRRNFPMNSYTYSLLMQSYQLLNDMEAVDRTLQEIEKEGK-NCH 728 +R +PEKVP L + +K+R TYS+ MQS LND++ +++ + E+ K+ + Sbjct: 218 MRLSQPEKVPVLVDAIKQRGISPCGVTYSIWMQSCGSLNDLDGLEKIIDEMGKDSEAKTT 277 Query: 729 WSVYSNLAAVYIRAGNSXXXXXXXXXXXXSMKNLRVRDPVPYHFLISFYASIGNLAEVNR 908 W+ +SNLAA+Y +AG M N RD +HFL+S YA I EV R Sbjct: 278 WNTFSNLAAIYTKAGLYEKADSALKSMEEKM-NPNNRDS--HHFLMSLYAGISKGPEVYR 334 Query: 909 IWESLKSDFKTTSNVSYMTMLQALLKLGQLDELVRYFKEWESGYKHYDSRIPSLVMAGYL 1088 +WESLK +N+SY+ MLQA+ KLG LD + + F EWES YD R+ ++ + YL Sbjct: 335 VWESLKKARPEVNNLSYLVMLQAMSKLGDLDGIKKIFTEWESKCWAYDMRLANIAINTYL 394 Query: 1089 RQNKIGEAEKVFNDAVEKSGRAFCKGREILIAHFLGKREISRALEHLE-AVADAIKETDK 1265 + N EAEK+ + A++KS F K R++L+ H L + A++HLE AV+D+ + D+ Sbjct: 395 KGNMYEEAEKILDGAMKKSKGPFSKARQLLMIHLLENDKADLAMKHLEAAVSDSAENKDE 454 Query: 1266 WQPSRERIRDFCKYFVEEKDVDGAEXXXXXXXXXXXVESKSYLWLLRTYVAAEETAPGMR 1445 W S E + F +F + KDVDGAE ++S++ +L++TY AAE+T+P MR Sbjct: 455 WGWSSELVSLFFLHFEKAKDVDGAEDFCKILSNWKPLDSETMTFLIKTYAAAEKTSPDMR 514