BLASTX nr result
ID: Akebia24_contig00012830
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00012830 (962 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006343601.1| PREDICTED: pentatricopeptide repeat-containi... 177 7e-42 ref|XP_004242995.1| PREDICTED: pentatricopeptide repeat-containi... 174 4e-41 ref|XP_002528404.1| pentatricopeptide repeat-containing protein,... 171 5e-40 ref|XP_003634022.1| PREDICTED: pentatricopeptide repeat-containi... 166 2e-38 ref|XP_007013880.1| Tetratricopeptide repeat (TPR)-like superfam... 164 6e-38 ref|XP_004146719.1| PREDICTED: pentatricopeptide repeat-containi... 162 2e-37 ref|XP_006474045.1| PREDICTED: pentatricopeptide repeat-containi... 158 4e-36 ref|XP_004287149.1| PREDICTED: pentatricopeptide repeat-containi... 157 6e-36 ref|XP_006412665.1| hypothetical protein EUTSA_v10024344mg [Eutr... 154 4e-35 ref|XP_006285536.1| hypothetical protein CARUB_v10006977mg [Caps... 152 1e-34 ref|XP_006453565.1| hypothetical protein CICLE_v10007430mg [Citr... 152 3e-34 ref|XP_007203708.1| hypothetical protein PRUPE_ppa019391mg, part... 150 7e-34 gb|EXB42922.1| Pentatricopeptide repeat-containing protein [Moru... 147 8e-33 ref|XP_006857035.1| hypothetical protein AMTR_s00065p00020910 [A... 141 3e-31 gb|EYU37145.1| hypothetical protein MIMGU_mgv1a000931mg [Mimulus... 140 6e-31 ref|NP_567856.1| pentatricopeptide repeat-containing protein [Ar... 139 2e-30 ref|XP_002869359.1| pentatricopeptide repeat-containing protein ... 137 6e-30 emb|CAA18211.1| puative protein [Arabidopsis thaliana] gi|726998... 134 5e-29 gb|EPS64936.1| hypothetical protein M569_09839, partial [Genlise... 131 4e-28 ref|XP_002444089.1| hypothetical protein SORBIDRAFT_07g007540 [S... 129 1e-27 >ref|XP_006343601.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like isoform X1 [Solanum tuberosum] gi|565353364|ref|XP_006343602.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like isoform X2 [Solanum tuberosum] Length = 937 Score = 177 bits (448), Expect = 7e-42 Identities = 121/285 (42%), Positives = 157/285 (55%), Gaps = 32/285 (11%) Frame = -3 Query: 759 MASMKFST-VSEIYETRKSNLLG---NFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFCQ 592 MAS+K V +E++K N NF W V S G G V+ F Sbjct: 1 MASLKLPLYVDSSWESKKLNCTVKALNFTDSKCW--VPSFLG----GGAFVVSPFCNLKH 54 Query: 591 VRVSRSSTDSAHVSESI--QEGLVGKKYPIQN-----------RDIKKNGRNLWTRFHTL 451 +RVSR T+ SE EG+ G + + N RD +K N+W RF + Sbjct: 55 IRVSRLETEELETSELSLDNEGVDGFEGELGNDSFVTERPNLGRDSQKGKFNVWKRFRRV 114 Query: 450 K---RENKGESTLR----KNEEEEPSIISNGSISNELMASLAS--------IGTESSVEH 316 K R++ S+ R KN EE +I+ S+E + + IG++SS++ Sbjct: 115 KKVPRDSNHRSSFRLKDRKNGMEENPMIAFDVNSDESVIDSQNGVDFPDENIGSDSSLDQ 174 Query: 315 CNNILKQLERCNDDQTLCFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQDLI 136 CN ILK+LER ND + L FF WMRKNGKLK+NV A NL LRVLGRR DW AE +++++ Sbjct: 175 CNAILKELERGNDGKALSFFRWMRKNGKLKQNVTAYNLILRVLGRRGDWDGAEGMIKEMS 234 Query: 135 TNLGSKLNFQVFNTLIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 G KL +QVFNTLIYAC K+GL LG KWFH+MLENGVQPNIA Sbjct: 235 MESGCKLTYQVFNTLIYACHKKGLVELGAKWFHMMLENGVQPNIA 279 >ref|XP_004242995.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Solanum lycopersicum] Length = 1201 Score = 174 bits (442), Expect = 4e-41 Identities = 107/238 (44%), Positives = 139/238 (58%), Gaps = 28/238 (11%) Frame = -3 Query: 630 GTCNVNSFIRFCQVRVSRSSTDSAHVSE-SIQ-------EGLVGKKY-----PIQNRDIK 490 G V+ F +RVSR T+ SE SI EG +G + P RD K Sbjct: 306 GAFVVSPFCNLKHIRVSRLETEELETSELSIDNEGVDGFEGELGNESFVTERPNLGRDSK 365 Query: 489 KNGRNLWTRFHTLKRENKGE---STLRKNE-----EEEPSIISNGSISNELMASL----- 349 K N+W RF +K+ K S+ R + EE P I+ + + ++ S Sbjct: 366 KGKFNVWRRFRRVKKVPKDSNYRSSFRLKDRKYGTEENPRIVFDVNSDENVIDSQNGVDF 425 Query: 348 --ASIGTESSVEHCNNILKQLERCNDDQTLCFFDWMRKNGKLKENVIACNLALRVLGRRQ 175 +IG++SS++ CN ILK+LER +D + L FF WMRKNGKLK+NV A NL LRVLGRR Sbjct: 426 HDENIGSDSSLDQCNAILKELERGDDGKALSFFRWMRKNGKLKQNVTAYNLILRVLGRRG 485 Query: 174 DWVTAETLLQDLITNLGSKLNFQVFNTLIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 DW AE +++++ G KL +QVFNTLIYAC K+GL LG KWFH+MLENGVQPNIA Sbjct: 486 DWDGAEGMIKEMSMESGCKLTYQVFNTLIYACHKKGLVELGAKWFHMMLENGVQPNIA 543 >ref|XP_002528404.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223532192|gb|EEF33997.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 955 Score = 171 bits (432), Expect = 5e-40 Identities = 108/272 (39%), Positives = 155/272 (56%), Gaps = 14/272 (5%) Frame = -3 Query: 774 RLQ*IMASMKFSTVSEIYETRKSNLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFC 595 +L+ MAS++ + + ++++K N N + S S S++ G C + + F Sbjct: 32 KLERTMASLRLTISLDTFDSKKPNFSRNPLQLSTHTSPFSISSSTPSPGACIITTLTTFS 91 Query: 594 QVRVSR-------------SSTDSAHVSESIQEGLVGKKYPIQNRDIKKNGRNLWTRFHT 454 V+VSR +S D H E I EGL+ + P R+I+K R + Sbjct: 92 PVKVSRIETELFEDDVVLSTSNDLPH--ECINEGLIDRN-PNSKREIRKKYRGGAKKRGK 148 Query: 453 LKRENKGESTLRKNEEEEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCN-D 277 K K E+E + G EL + + I S+EHCN ILK+LERC+ D Sbjct: 149 RKVGFKFNYKRNGIEQEIEDLFVEGG---ELDVNYSVIHCNLSLEHCNLILKRLERCSSD 205 Query: 276 DQTLCFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFN 97 D++L FF+WMR NGKL++N+ A N+ LRVLGRR+DW TAE ++ ++ + GS+L+F+VFN Sbjct: 206 DKSLRFFEWMRNNGKLEKNLNAYNVILRVLGRREDWGTAERMIGEVSDSFGSELDFRVFN 265 Query: 96 TLIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 TLIYACS+RG LG KWF +MLE GVQPNIA Sbjct: 266 TLIYACSRRGNMLLGGKWFRMMLELGVQPNIA 297 >ref|XP_003634022.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Vitis vinifera] gi|297745081|emb|CBI38673.3| unnamed protein product [Vitis vinifera] Length = 900 Score = 166 bits (419), Expect = 2e-38 Identities = 108/268 (40%), Positives = 147/268 (54%), Gaps = 15/268 (5%) Frame = -3 Query: 759 MASMKFSTVSEIYETRKSNLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFCQVRVS 580 MAS+KFS + Y++ K + S+ + I +NSF R + +S Sbjct: 1 MASLKFSVSVDTYDSNKFHF--------------SVNPSLPI-----INSFARVKPINIS 41 Query: 579 RSSTDSAHVSESIQEGLVGKKYPIQNRDI--------KKNGRN-LWTRFHTLKR------ 445 R +S S+S V N+D N RN +W R +KR Sbjct: 42 RLEAESWDTSDS---NSVVDNIKTWNKDSGSENLILESSNFRNDIWRRVQGVKRVRRRDP 98 Query: 444 ENKGESTLRKNEEEEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQTL 265 +K S N EE +++ +E+ + IG E SVE CN ILK LERC+D +T+ Sbjct: 99 NSKFRSIRNDNGHEEQKSVNH--FDDEIDVNEYGIGPELSVERCNAILKGLERCSDSKTM 156 Query: 264 CFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLIY 85 FF+WMR+NGKL+ NV A NLALRVLGRR DW AET++ ++ + ++NFQV+NTLIY Sbjct: 157 KFFEWMRENGKLEGNVSAYNLALRVLGRRGDWDAAETMIWEMNGDSDCQVNFQVYNTLIY 216 Query: 84 ACSKRGLGALGTKWFHLMLENGVQPNIA 1 AC K+G LGTKWF LMLENGV+PN+A Sbjct: 217 ACYKQGHVELGTKWFRLMLENGVRPNVA 244 >ref|XP_007013880.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] gi|508784243|gb|EOY31499.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] Length = 916 Score = 164 bits (414), Expect = 6e-38 Identities = 107/272 (39%), Positives = 142/272 (52%), Gaps = 19/272 (6%) Frame = -3 Query: 759 MASMKFSTVSEIYETRKSNLLGNFDRRSDWNSVSSLTGCVQIT-GTCNVNSFIRFCQVRV 583 MAS+K + +++K N N D S+ S T C+ +T N+ S R +V Sbjct: 1 MASLKLPISLDTVDSKKLNFYVNPSHVPDHCSIFSFTSCIHVTKAASNLTSLTRLKHFKV 60 Query: 582 SRSSTDSAHVSE----------SIQEGLV--------GKKYPIQNRDIKKNGRNLWTRFH 457 SR T+ ++ E S + LV G+K + I+KN F Sbjct: 61 SRFETEFPNIPEPSPVDKDIHFSSKIDLVNENPKFVEGQKGQNPKKGIRKN-----VGFK 115 Query: 456 TLKRENKGESTLRKNEEEEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCND 277 R N+ E E E + N S L ++I ++ HCN ILK+LER ND Sbjct: 116 FRFRRNRNEI------EREDLFVHNNS---GLDVDYSAIKPNLNLPHCNFILKRLERSND 166 Query: 276 DQTLCFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFN 97 L FF+WMR NGKLK NV A L LRVLGRR+DW AE +L+ + G KLNFQVFN Sbjct: 167 SNALRFFEWMRSNGKLKGNVTAYRLVLRVLGRREDWDAAEMMLRQANGDSGCKLNFQVFN 226 Query: 96 TLIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 T+IYACSK+GL LG KWF +MLE+G +PN+A Sbjct: 227 TIIYACSKKGLVELGAKWFRMMLEHGFRPNVA 258 >ref|XP_004146719.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Cucumis sativus] Length = 894 Score = 162 bits (409), Expect = 2e-37 Identities = 100/256 (39%), Positives = 148/256 (57%), Gaps = 3/256 (1%) Frame = -3 Query: 759 MASMKFSTVSEIYETRKSNLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFCQV-RV 583 MAS+K S +++ K + N SD+ S+ S+ + + + + S R + +V Sbjct: 1 MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60 Query: 582 SRSSTDSAHVSESIQEGLVGKKYPIQNRDIKKNGRNLWTRFHTLKRENK--GESTLRKNE 409 S+ D++ VS+S + +V +K ++ T K+ +K S + Sbjct: 61 SQVEQDASDVSQSRFDEIVARK-----------------KYFTSKKPSKRAAGSHFSFSR 103 Query: 408 EEEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQTLCFFDWMRKNGKL 229 +I+ NG EL + ++I ++ S+E CN ILK+LE+CND +TL FF+WMR NGKL Sbjct: 104 NCNDNILFNGG---ELDVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKL 160 Query: 228 KENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLIYACSKRGLGALGT 49 K NV A NL LRVLGR++DW AE L++++ LGS+L+FQVFNTLIYAC K GT Sbjct: 161 KHNVSAYNLVLRVLGRQEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGT 220 Query: 48 KWFHLMLENGVQPNIA 1 KWF +MLE VQPN+A Sbjct: 221 KWFRMMLECQVQPNVA 236 >ref|XP_006474045.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Citrus sinensis] Length = 915 Score = 158 bits (399), Expect = 4e-36 Identities = 104/269 (38%), Positives = 152/269 (56%), Gaps = 16/269 (5%) Frame = -3 Query: 759 MASMKFSTVS-EIYETRKSNLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFCQVRV 583 MAS+K ++S + ++RK N N + SD + S T +T + V V Sbjct: 1 MASLKLLSISLDTVDSRKLNFAANPPQLSDHFPIFSFTMSCIVTASNRVKHV-----KNV 55 Query: 582 SRSSTDSAHVSESIQ-----EGLVGKKYPIQ-----NRDIKKNGRNLWTRFHTLKRENKG 433 S S TD ++ES + E VG + + +R +KK R+ K + Sbjct: 56 SSSETDLCSMNESKETDIGIENDVGSEVFVGECSNVSRKVKKG------RYGVKKGSKRD 109 Query: 432 -ESTLR----KNEEEEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQT 268 + +LR E+E +N EL + + IG + S++ CN ILK+LE+ +D ++ Sbjct: 110 VDMSLRFRRSAREQEREYFFAN---DGELDVNYSVIGADLSLDECNAILKRLEKYSDSKS 166 Query: 267 LCFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLI 88 L FF+WMR NGKL++NV A NL LRV RR+DW AE +++++ +LG+KLNFQ+FNTLI Sbjct: 167 LKFFEWMRTNGKLEKNVTAYNLVLRVFSRREDWDAAEKMIREVRMSLGAKLNFQLFNTLI 226 Query: 87 YACSKRGLGALGTKWFHLMLENGVQPNIA 1 YAC+KRG LG KWFH+MLE VQPN+A Sbjct: 227 YACNKRGCVELGAKWFHMMLECDVQPNVA 255 >ref|XP_004287149.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 885 Score = 157 bits (397), Expect = 6e-36 Identities = 94/211 (44%), Positives = 132/211 (62%), Gaps = 5/211 (2%) Frame = -3 Query: 618 VNSFIRFCQVRVSRSSTDSAHVSESIQEGLVGKKYPIQNRDIKKN--GRNLWTRFHTLKR 445 VNS R ++V+R ++ +V+ES+ E QN D ++ G+ + KR Sbjct: 31 VNSLNRVNAIKVNRFQSE-LNVAESLNE---------QNPDCSRHEIGKGISGTKRLSKR 80 Query: 444 ENKGESTLRKNE---EEEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDD 274 E S+ RK++ + E +++G E + I ++ S+EHCN+ILK+LER +D Sbjct: 81 EVGLRSSSRKSKWVRKLENVFVNDG----EFDVDYSVIKSDMSLEHCNDILKRLERSSDF 136 Query: 273 QTLCFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNT 94 +TL FF+WMR NGKLK NV A N RVLGRR++W AE L+Q+++T G +LN+QVFNT Sbjct: 137 KTLKFFEWMRINGKLKGNVSAFNSVFRVLGRRENWDAAENLIQEMVTEFGCELNYQVFNT 196 Query: 93 LIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 LIYACSK G LG KWF +MLE GVQPN+A Sbjct: 197 LIYACSKLGRVELGAKWFAMMLEYGVQPNVA 227 >ref|XP_006412665.1| hypothetical protein EUTSA_v10024344mg [Eutrema salsugineum] gi|557113835|gb|ESQ54118.1| hypothetical protein EUTSA_v10024344mg [Eutrema salsugineum] Length = 916 Score = 154 bits (390), Expect = 4e-35 Identities = 94/266 (35%), Positives = 153/266 (57%), Gaps = 13/266 (4%) Frame = -3 Query: 759 MASMKFSTVSEIYETRKSNLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFCQVRVS 580 M S++ ST + +++++ + N + +D + S+T + T T + S I + RV+ Sbjct: 1 MVSLRLSTPLDPFDSKRFHFSANPFQFTDQFPIFSVTSSISATRTFTIGSPISVNKTRVA 60 Query: 579 RSST---------DSAHVSESIQEGLVGKKYPIQNRDIKKNGRNLWT-RFHTLKRENKGE 430 R T D + +S+ E VG+ + + K G N+ + +K++ + Sbjct: 61 RLDTEANEAENAIDRSSEDDSVSEASVGRSWSSK----LKGGNNVTSSNKRGIKKDVTRK 116 Query: 429 STLRKNEEE---EPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQTLCF 259 S+ R+ E E ++NG E+ + +++ + S+EH N ILK+LE C+D + F Sbjct: 117 SSFRRESNELELEGLFVNNG----EMDVNYSAMKPDLSLEHYNGILKRLECCSDTNAVKF 172 Query: 258 FDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLIYAC 79 FDWMR GKL+ N++A +L LRVL RR++W AE L+++L G + +FQVFNT+IYAC Sbjct: 173 FDWMRCKGKLEGNIVAYSLILRVLARREEWDRAEDLIKELCGFQGFQQSFQVFNTVIYAC 232 Query: 78 SKRGLGALGTKWFHLMLENGVQPNIA 1 SK+G LG+KWF LMLE GV+PN+A Sbjct: 233 SKKGNVKLGSKWFQLMLELGVRPNVA 258 >ref|XP_006285536.1| hypothetical protein CARUB_v10006977mg [Capsella rubella] gi|482554241|gb|EOA18434.1| hypothetical protein CARUB_v10006977mg [Capsella rubella] Length = 907 Score = 152 bits (385), Expect = 1e-34 Identities = 92/258 (35%), Positives = 144/258 (55%), Gaps = 5/258 (1%) Frame = -3 Query: 759 MASMKFSTVSEIYETRKSNLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFCQVRVS 580 M S++FS + +++++ + N + D + S+T + S +R ++RVS Sbjct: 1 MGSLRFSIPLDPFDSKRFHFSANPFQFPDQFPIFSVTS--SYVPATRIGSLVRAEKIRVS 58 Query: 579 RSSTDSAHVSESIQEGLVGKKYPIQNRDIKK-----NGRNLWTRFHTLKRENKGESTLRK 415 R ++ +I K + +K +G T+ +K+ + ++ Sbjct: 59 RLDVEAEETENAIDSASAAKVERSSSSKLKSGKTVSSGNKRGTKKDVVKKFSFRRESI-- 116 Query: 414 NEEEEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQTLCFFDWMRKNG 235 N E E +++NG E+ + ++I S+EHCN ILK+LE C+D + FFDWM NG Sbjct: 117 NLELEELLVNNG----EMDVNYSAIKPTLSLEHCNGILKRLESCSDSNAVKFFDWMSCNG 172 Query: 234 KLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLIYACSKRGLGAL 55 KL+ N A +L LRVLGRRQDW AE L+++L G + +FQVFNT+IYAC+K+G L Sbjct: 173 KLQGNFSAYSLILRVLGRRQDWDRAEDLIKELCGFQGFQQSFQVFNTVIYACAKKGNVKL 232 Query: 54 GTKWFHLMLENGVQPNIA 1 G+KWF LMLE GV+PN+A Sbjct: 233 GSKWFQLMLELGVRPNVA 250 >ref|XP_006453565.1| hypothetical protein CICLE_v10007430mg [Citrus clementina] gi|557556791|gb|ESR66805.1| hypothetical protein CICLE_v10007430mg [Citrus clementina] Length = 851 Score = 152 bits (383), Expect = 3e-34 Identities = 84/189 (44%), Positives = 121/189 (64%) Frame = -3 Query: 567 DSAHVSESIQEGLVGKKYPIQNRDIKKNGRNLWTRFHTLKRENKGESTLRKNEEEEPSII 388 + ++VS +++G G K RD+ ++ RF RE +E E Sbjct: 23 ECSNVSRKVKKGRYGVKKG-SKRDV-----DMSLRFRRSARE----------QEREYFFA 66 Query: 387 SNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQTLCFFDWMRKNGKLKENVIAC 208 ++G EL + + IG + S++ CN ILK+LE+ +D ++L FF+WMR NGKL++NVIA Sbjct: 67 NDG----ELDVNYSVIGADLSLDECNAILKRLEKYSDSKSLKFFEWMRTNGKLEKNVIAY 122 Query: 207 NLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLIYACSKRGLGALGTKWFHLML 28 NL LRV RR+DW AE +++++ +LG+KLNFQ+FNTLIYAC+KRG LG KWFH+ML Sbjct: 123 NLVLRVFSRREDWDAAEKMIREVRMSLGTKLNFQLFNTLIYACNKRGCVELGAKWFHMML 182 Query: 27 ENGVQPNIA 1 E VQPN+A Sbjct: 183 ECDVQPNVA 191 >ref|XP_007203708.1| hypothetical protein PRUPE_ppa019391mg, partial [Prunus persica] gi|462399239|gb|EMJ04907.1| hypothetical protein PRUPE_ppa019391mg, partial [Prunus persica] Length = 766 Score = 150 bits (379), Expect = 7e-34 Identities = 71/108 (65%), Positives = 88/108 (81%) Frame = -3 Query: 324 VEHCNNILKQLERCNDDQTLCFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQ 145 +EHCN+ILK+LERC+D +TL FF+WMR NGKL+ NV A NL LRV+GRR+DW AE L+Q Sbjct: 1 LEHCNDILKRLERCSDVKTLRFFEWMRSNGKLERNVSAFNLVLRVMGRREDWDGAEKLVQ 60 Query: 144 DLITNLGSKLNFQVFNTLIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 ++I +LG +LN+QVFNTLIYAC K G LG KWF +MLE+ VQPNIA Sbjct: 61 EVIADLGCELNYQVFNTLIYACCKLGRLELGGKWFRMMLEHEVQPNIA 108 >gb|EXB42922.1| Pentatricopeptide repeat-containing protein [Morus notabilis] Length = 889 Score = 147 bits (370), Expect = 8e-33 Identities = 93/255 (36%), Positives = 143/255 (56%), Gaps = 2/255 (0%) Frame = -3 Query: 759 MASMKFSTVSEIYETRKSNLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFCQVRVS 580 M S+KFS + ++++K N S +S L GC C VNS R ++ + Sbjct: 1 MGSLKFSISLDPFDSKKLN-------SSPISSYFHL-GC----RACIVNSLNRVSNIKAN 48 Query: 579 RSSTDSAHV--SESIQEGLVGKKYPIQNRDIKKNGRNLWTRFHTLKRENKGESTLRKNEE 406 + + S+ + E ++ +K P + R KK + +K+ R E Sbjct: 49 PINDEITLSLNSDLVSETIIQQK-PNKFRGSKKEAKRFLGSKVGMKKN-------RWERE 100 Query: 405 EEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQTLCFFDWMRKNGKLK 226 E +++G I + + I ++ S+E CN++LK+LE C+D +TL FF+WMR +GKL+ Sbjct: 101 LENLFVNDGEID----VNYSVIRSDLSLEQCNSVLKRLESCSDSKTLRFFEWMRSHGKLE 156 Query: 225 ENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLIYACSKRGLGALGTK 46 N+ A NL RVL R++DW TAE ++ +L LG ++ +QVFNTLIYACSK G LG K Sbjct: 157 GNISAYNLVFRVLSRKEDWGTAEKMIWELKNELGCEMGYQVFNTLIYACSKLGRVELGAK 216 Query: 45 WFHLMLENGVQPNIA 1 WF +MLE+GV+PN+A Sbjct: 217 WFRMMLEHGVRPNVA 231 >ref|XP_006857035.1| hypothetical protein AMTR_s00065p00020910 [Amborella trichopoda] gi|548861118|gb|ERN18502.1| hypothetical protein AMTR_s00065p00020910 [Amborella trichopoda] Length = 903 Score = 141 bits (356), Expect = 3e-31 Identities = 78/172 (45%), Positives = 106/172 (61%), Gaps = 7/172 (4%) Frame = -3 Query: 495 IKKNGRNLWTRFHTLKRENKGESTLRK--NEEEEPSII-----SNGSISNELMASLASIG 337 ++ +GR LW R KR + E + R+ E+ PS+ S S +EL A L+++ Sbjct: 69 VRNSGRKLWKRLRGFKRPIESEVSARRLAKTEQCPSLDRKDGDSLSSTESELEAKLSTLE 128 Query: 336 TESSVEHCNNILKQLERCNDDQTLCFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAE 157 SS+E+CNN LK LE+ ND + L F+WM+ NGKL N A NLALRVL R++DW +E Sbjct: 129 PLSSIENCNNYLKLLEKSNDAKALQLFEWMKSNGKLDRNPTAYNLALRVLSRKEDWKASE 188 Query: 156 TLLQDLITNLGSKLNFQVFNTLIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 LL+++ T + Q+FNTLIY CSKR L GTKWF +ML GV+PN A Sbjct: 189 ELLREMPTVSNCSPSSQMFNTLIYVCSKRELVGWGTKWFRMMLYCGVKPNQA 240 >gb|EYU37145.1| hypothetical protein MIMGU_mgv1a000931mg [Mimulus guttatus] Length = 939 Score = 140 bits (354), Expect = 6e-31 Identities = 89/237 (37%), Positives = 128/237 (54%) Frame = -3 Query: 711 KSNLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFCQVRVSRSSTDSAHVSESIQEG 532 + L D D S+ +L CV + N + Q + S D A + Sbjct: 64 RDEFLDTSDSILDGYSIDNLEKCVD---AADDNLIV---QEQNSNGEFDRARID------ 111 Query: 531 LVGKKYPIQNRDIKKNGRNLWTRFHTLKRENKGESTLRKNEEEEPSIISNGSISNELMAS 352 + K + N+ + RNL TR + K + KGE E + G + Sbjct: 112 -IWKTFRGVNKARRSANRNLDTRRNGSKYK-KGEKFTTPFERDRVL----GGDQTLVDID 165 Query: 351 LASIGTESSVEHCNNILKQLERCNDDQTLCFFDWMRKNGKLKENVIACNLALRVLGRRQD 172 L +G + S E CN IL+QLER ND + L FF+WM+ NGKLK+NV A N LRVLGR+ D Sbjct: 166 LDDVGPDLSSERCNLILEQLERSNDSKALTFFEWMKANGKLKKNVAAYNSILRVLGRKTD 225 Query: 171 WVTAETLLQDLITNLGSKLNFQVFNTLIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 W AE +++++I++ +LN+QVFNTLIYAC+K GL +GT+WF +ML+ V+PN+A Sbjct: 226 WNGAEIMIKEMISDSSCELNYQVFNTLIYACNKSGLVDMGTRWFKIMLDYNVRPNVA 282 >ref|NP_567856.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635625|sp|O65567.2|PP342_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g30825, chloroplastic; Flags: Precursor gi|332660415|gb|AEE85815.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 904 Score = 139 bits (349), Expect = 2e-30 Identities = 88/259 (33%), Positives = 146/259 (56%), Gaps = 6/259 (2%) Frame = -3 Query: 759 MASMKFSTVSEIYETRKS--NLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFC-QV 589 M S++FS + +++++ + N + D + +T + T ++ S R ++ Sbjct: 1 MGSLRFSIPLDPFDSKRKRFHFSANPSQFPDQFPIHFVTSSIHATRASSIGSSTRVLDKI 60 Query: 588 RVSRSSTDSAHVSESIQEGLVGKKYPIQ-NRDIKKNGRNLWTRFHTLKREN--KGESTLR 418 RVS T++ + + P++ +R K +G T+ + ++ + +G + L Sbjct: 61 RVSSLGTEANENAINSASAA-----PVERSRSSKLSGDQRGTKKYVARKFSFRRGSNDL- 114 Query: 417 KNEEEEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQTLCFFDWMRKN 238 E E ++NG I + ++I S+EHCN ILK+LE C+D + FFDWMR N Sbjct: 115 ---ELENLFVNNGEID----VNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCN 167 Query: 237 GKLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLIYACSKRGLGA 58 GKL N +A +L LRVLGRR++W AE L+++L + ++QVFNT+IYAC+K+G Sbjct: 168 GKLVGNFVAYSLILRVLGRREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVK 227 Query: 57 LGTKWFHLMLENGVQPNIA 1 L +KWFH+MLE GV+PN+A Sbjct: 228 LASKWFHMMLEFGVRPNVA 246 >ref|XP_002869359.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297315195|gb|EFH45618.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 906 Score = 137 bits (345), Expect = 6e-30 Identities = 88/258 (34%), Positives = 144/258 (55%), Gaps = 5/258 (1%) Frame = -3 Query: 759 MASMKFSTVSEIYETRKSNLLGNFDRRSDWNSVSSLTGCVQITGTCNVNSFIRFCQVRVS 580 M S++ S + +++++ + N + D + S++ V T + S IR ++RVS Sbjct: 1 MGSLRLSIPLDPFDSKRFHFSANPFQFPDQVPIFSVSTSVPAT---RIGSLIRVKKIRVS 57 Query: 579 RSSTDSAHVSESIQEGLVGKKYPIQNRDIKKNGRNLWTRFHT--LKRENKGESTLRKNEE 406 R ++ +I V + ++ + K G N T + K++ + + R+ Sbjct: 58 RLDIEAKEAENAIDSDSVNVE---RSSNSKLKGSNTVTSGNQRGTKKDVARKFSFRRESN 114 Query: 405 E---EPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQTLCFFDWMRKNG 235 + E ++NG E+ + ++I S+EH N ILK+LE C+D + FFDWMR G Sbjct: 115 DLELENLFVNNG----EMDVNYSAIKPGLSLEHYNAILKRLESCSDTNAIKFFDWMRCKG 170 Query: 234 KLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLIYACSKRGLGAL 55 KL+ N A +L LRVLGRR++W AE L+++L G + +FQVFNT+IYAC+K+G L Sbjct: 171 KLEGNFGAYSLILRVLGRREEWNRAEDLIEELCGFQGFQQSFQVFNTVIYACTKKGNVKL 230 Query: 54 GTKWFHLMLENGVQPNIA 1 +KWF +MLE GV+PN+A Sbjct: 231 ASKWFQMMLELGVRPNVA 248 >emb|CAA18211.1| puative protein [Arabidopsis thaliana] gi|7269983|emb|CAB79800.1| puative protein [Arabidopsis thaliana] Length = 1075 Score = 134 bits (337), Expect = 5e-29 Identities = 67/136 (49%), Positives = 93/136 (68%) Frame = -3 Query: 408 EEEPSIISNGSISNELMASLASIGTESSVEHCNNILKQLERCNDDQTLCFFDWMRKNGKL 229 E E ++NG I + ++I S+EHCN ILK+LE C+D + FFDWMR NGKL Sbjct: 286 ELENLFVNNGEID----VNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCNGKL 341 Query: 228 KENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNTLIYACSKRGLGALGT 49 N +A +L LRVLGRR++W AE L+++L + ++QVFNT+IYAC+K+G L + Sbjct: 342 VGNFVAYSLILRVLGRREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVKLAS 401 Query: 48 KWFHLMLENGVQPNIA 1 KWFH+MLE GV+PN+A Sbjct: 402 KWFHMMLEFGVRPNVA 417 >gb|EPS64936.1| hypothetical protein M569_09839, partial [Genlisea aurea] Length = 865 Score = 131 bits (330), Expect = 4e-28 Identities = 77/211 (36%), Positives = 121/211 (57%), Gaps = 14/211 (6%) Frame = -3 Query: 591 VRVSRSSTDSAHVSESIQEGLVGKKYPIQNRDIKKNGRNLWTRFHTLK--RENKGEST-- 424 + VS D SES + L +K +NRD G+++ + K RE+K +S Sbjct: 1 ITVSNLENDVPDSSES-KSNLDSRK---KNRDFTAQGKDVSKQCRIAKMWREHKKQSLDP 56 Query: 423 ---LRKNEEEEPSIISNGSISNELMASLAS-------IGTESSVEHCNNILKQLERCNDD 274 +K+ + P+ + + S + S + E ++E CN IL++LE+ +D Sbjct: 57 HLQSKKSRKVRPTSLQQRASSGSALGSETDLCLDSWDVRPEETIERCNMILERLEKSDDS 116 Query: 273 QTLCFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQDLITNLGSKLNFQVFNT 94 + + FF WMR N KLK+NVIA N+ LRVL R+ DW AE L+++++++ G LN+Q+FNT Sbjct: 117 KAISFFKWMRLNQKLKKNVIAHNVILRVLTRKDDWDGAEGLVKEMVSDSGCLLNYQIFNT 176 Query: 93 LIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 +IYAC K+GL + T+WF +ML V PN+A Sbjct: 177 VIYACYKKGLSDVATRWFKMMLNYQVDPNVA 207 >ref|XP_002444089.1| hypothetical protein SORBIDRAFT_07g007540 [Sorghum bicolor] gi|241940439|gb|EES13584.1| hypothetical protein SORBIDRAFT_07g007540 [Sorghum bicolor] Length = 942 Score = 129 bits (325), Expect = 1e-27 Identities = 70/167 (41%), Positives = 100/167 (59%), Gaps = 3/167 (1%) Frame = -3 Query: 492 KKNGRNLWTRFHTLKRENKGEST---LRKNEEEEPSIISNGSISNELMASLASIGTESSV 322 KK G LW R K+ K + L K+ S + + + A L+ I ESS+ Sbjct: 115 KKKGCKLWRRLQGGKKLVKHRAPKHGLGKDRHGHKSAVKDDGVD----ALLSGISKESSI 170 Query: 321 EHCNNILKQLERCNDDQTLCFFDWMRKNGKLKENVIACNLALRVLGRRQDWVTAETLLQD 142 E CN+ L +LE+ +D++ L FFDWM+ NGKLK N A +LAL+ + ++DW AE LL + Sbjct: 171 EECNSALIRLEKLSDEKALNFFDWMKVNGKLKGNPHAYHLALQAIAWKEDWKMAELLLCE 230 Query: 141 LITNLGSKLNFQVFNTLIYACSKRGLGALGTKWFHLMLENGVQPNIA 1 ++ + G L+ + FN LIY C+KR L A TKWFH+MLE VQPN++ Sbjct: 231 MVADSGCTLDARAFNGLIYVCAKRRLDAWATKWFHMMLEREVQPNLS 277