BLASTX nr result
ID: Akebia24_contig00019700
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00019700 (1669 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007034168.1| Pentatricopeptide repeat-containing protein,... 484 e-134 ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citr... 479 e-132 ref|XP_007222864.1| hypothetical protein PRUPE_ppa004279mg [Prun... 478 e-132 ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containi... 474 e-131 ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containi... 464 e-128 ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Popu... 460 e-127 ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [A... 459 e-126 gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis] 453 e-125 ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containi... 451 e-124 ref|XP_002516403.1| pentatricopeptide repeat-containing protein,... 449 e-123 ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containi... 444 e-122 ref|XP_007163800.1| hypothetical protein PHAVU_001G265200g [Phas... 440 e-120 ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containi... 422 e-115 ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. l... 402 e-109 ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutr... 402 e-109 ref|NP_193155.4| pentatricopeptide repeat-containing protein [Ar... 397 e-107 ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Caps... 392 e-106 emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera] 389 e-105 emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thal... 382 e-103 ref|XP_001762610.1| predicted protein [Physcomitrella patens] gi... 172 4e-40 >ref|XP_007034168.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] gi|508713197|gb|EOY05094.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 504 Score = 484 bits (1245), Expect = e-134 Identities = 240/453 (52%), Positives = 325/453 (71%), Gaps = 1/453 (0%) Frame = -2 Query: 1494 PQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRF 1318 P+ + N++ L V+T+H S PLQ+LR++G+W++ W+V+RF Sbjct: 52 PRPDGSSCKNHTALLVETYHHHRRLKALLERLEKDDSCPLQMLRDDGDWTKDIFWVVIRF 111 Query: 1317 LKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINL 1138 L+ S+ EILQVF +WKN+ KSRINE+NYEKI +AV LREM + Sbjct: 112 LRRASRSNEILQVFHMWKNIEKSRINELNYEKIIGLLGEEGRVGQAVQALREMGGYGLKP 171 Query: 1137 SLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCV 958 SL++YNSIIH +A G+F+DAL FL EM + L P +TY+GLI AYG Y MYDE+G C+ Sbjct: 172 SLEVYNSIIHAYARNGKFDDALSFLNEMKEIGLAPETDTYDGLIEAYGKYKMYDEIGTCL 231 Query: 957 KKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDL 778 K MEL+ C PD TYNLLI EF+RGGL +RME+V + LLSK+MNLQ+ +L+AMLEAY + Sbjct: 232 KMMELDRCRPDHFTYNLLIREFSRGGLLQRMEQVYQILLSKQMNLQSSSLVAMLEAYANF 291 Query: 777 GLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWC 598 G+L+KMEKVY++V+NS T K IR +A YI+N+MFSRL+DLG+D++SR G NDLVWC Sbjct: 292 GILDKMEKVYRKVVNSMT-LKEDTIRILASVYIKNYMFSRLDDLGIDLSSRTGRNDLVWC 350 Query: 597 LRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIETR 418 LRLLSHAC+ SR+G++S+I M AK SWN+T++N++ L Y+KMKDF++L ILLSQ+ + Sbjct: 351 LRLLSHACLLSRKGMDSVILEMCEAKASWNVTISNIILLAYMKMKDFKRLRILLSQLPSH 410 Query: 417 LLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSC 238 ++PDI+T+G+L DA GFDG E WR+MGLL + EMNTDPLVL AF KG FLR C Sbjct: 411 QVRPDIITIGILSDAIEIGFDGAEALETWRKMGLLYRTVEMNTDPLVLIAFGKGHFLRDC 470 Query: 237 EEMYSSLGQKSSEKRVWTYQDLIDMVCKYNGRK 139 EE+Y+SL K+ +++ WTY LID+V K+ ++ Sbjct: 471 EEIYTSLEPKARKEKRWTYHHLIDLVIKHKAKR 503 >ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citrus clementina] gi|557522919|gb|ESR34286.1| hypothetical protein CICLE_v10004784mg [Citrus clementina] Length = 510 Score = 479 bits (1232), Expect = e-132 Identities = 241/493 (48%), Positives = 335/493 (67%), Gaps = 1/493 (0%) Frame = -2 Query: 1617 NNNFYKTLITRXXXXXXXXXXXXXXPILHRKKTQNHYHQETPQNFSTKLTNYSLLFVDTF 1438 NN YK + + IL RK P + T ++ L V+++ Sbjct: 13 NNTHYKVRLNKNKKNKLTHNRVFFSKILIRKPISCCCLSSAP-SLDYHSTKHTTLLVESY 71 Query: 1437 HEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKN 1261 HE S PLQIL+++G+W++ W V+RFLK +S+ ++I QVFD+WKN Sbjct: 72 HEHQALNALIQRLNKKVSCPLQILQHDGDWTKDHFWAVIRFLKNSSRSRQIPQVFDMWKN 131 Query: 1260 LNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSIIHGFAYKGEFE 1081 + KSRINE N +KI EAV +EM+ + SL++YNSIIHG++ G+F Sbjct: 132 IEKSRINEFNSQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSIIHGYSKIGKFN 191 Query: 1080 DALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMELEGCFPDSVTYNLLI 901 +AL+FL EM + NL P +TY+GLI+AYG Y MYDE+ C+K M+L+GC PD +TYNLLI Sbjct: 192 EALLFLNEMKEMNLSPQSDTYDGLIQAYGKYKMYDEIDMCLKMMKLDGCSPDHITYNLLI 251 Query: 900 VEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTP 721 EFA GL KRME +++L+K+M+L++ T++A+L+AY++ G+L+KMEK Y+R+LNS+TP Sbjct: 252 QEFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEKFYKRLLNSRTP 311 Query: 720 FKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHACISSRRGIESII 541 K ++RK+A YI+N+MFSRL+DLG D+ASR G +LVWCLRLLSHAC+ S RGI+S++ Sbjct: 312 LKEDLVRKLAEVYIKNYMFSRLDDLGDDLASRIGRTELVWCLRLLSHACLLSHRGIDSVV 371 Query: 540 QAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIETRLLKPDIVTVGVLFDASVGG 361 + M AK WN+T N++ L YLKMKDF+ L +LLS++ TR +KPDIVT+G+L+DA G Sbjct: 372 REMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIVTIGILYDARRIG 431 Query: 360 FDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLGQKSSEKRVWTY 181 FDG E W+R+G L K E+NTDPLVL + KG FLR CEE+YSSL S EK+ WTY Sbjct: 432 FDGTGALEMWKRIGFLFKTVEINTDPLVLAVYGKGHFLRYCEEVYSSLEPYSREKKRWTY 491 Query: 180 QDLIDMVCKYNGR 142 Q+LID+V K+NG+ Sbjct: 492 QNLIDLVIKHNGK 504 >ref|XP_007222864.1| hypothetical protein PRUPE_ppa004279mg [Prunus persica] gi|462419800|gb|EMJ24063.1| hypothetical protein PRUPE_ppa004279mg [Prunus persica] Length = 518 Score = 478 bits (1229), Expect = e-132 Identities = 244/448 (54%), Positives = 310/448 (69%) Frame = -2 Query: 1494 PQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVMRFL 1315 P + STK T L V+TFHE PLQ+L +G+W++ + W +RFL Sbjct: 65 PDSSSTKHTT---LLVETFHEHQRLKALLQNLINGSCPLQLLGEDGDWTKDQFWAAIRFL 121 Query: 1314 KETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLS 1135 K T + EILQ+FD+WKN+ KSRINE NY KI EAV +EMK++N+ S Sbjct: 122 KHTFRFNEILQLFDMWKNIEKSRINEFNYSKIIGLLGEEGLIEEAVRCFQEMKSHNLRPS 181 Query: 1134 LKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVK 955 L++YNS+IH A +G FEDAL FL EM + NL P +TY+GLI AYG Y MYD++G CVK Sbjct: 182 LEVYNSVIHVCARQGNFEDALFFLNEMKEMNLAPETDTYDGLIEAYGKYRMYDQIGMCVK 241 Query: 954 KMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLG 775 KM+L GC PD +TYNLLI EFARGGL KRME V +++LS++M LQ+ TLIAM+E Y G Sbjct: 242 KMKLNGCSPDHITYNLLIREFARGGLLKRMESVYQSMLSRRMALQSSTLIAMVEVYAKFG 301 Query: 774 LLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCL 595 +LEKME VY+RVLNS T K +IRK+A YI+N+MFSRLE LGVD++SR G DLVWCL Sbjct: 302 ILEKMENVYRRVLNSGTVVKNDLIRKLAEVYIDNYMFSRLEKLGVDLSSRFGQTDLVWCL 361 Query: 594 RLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIETRL 415 RLLS A + S+RG++SI+ M WN TV N++ L YLKMKDF L I LSQ+ T+ Sbjct: 362 RLLSQAGVLSQRGMDSIVDEMKEQNVPWNETVANIIMLAYLKMKDFTHLRIFLSQLLTQG 421 Query: 414 LKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCE 235 ++PDI+TVG++FDA+ G+DG+ T + WR G L KA EMNTDPLVLT F KG FLR+CE Sbjct: 422 VEPDIITVGIVFDANRIGYDGSRTLDTWRENGFLRKAVEMNTDPLVLTTFGKGHFLRNCE 481 Query: 234 EMYSSLGQKSSEKRVWTYQDLIDMVCKY 151 YSSL + E + WTY LID+V K+ Sbjct: 482 AAYSSLEPEDRENKTWTYHHLIDLVFKH 509 >ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Vitis vinifera] Length = 581 Score = 474 bits (1221), Expect = e-131 Identities = 244/451 (54%), Positives = 317/451 (70%), Gaps = 1/451 (0%) Frame = -2 Query: 1512 HYHQETPQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKEL 1336 H+ Q + N TK T L V+T HE S PLQ+LR++G+W+++ Sbjct: 76 HHLQLSHNNNPTKHTT---LLVETLHENERLGVLIQKLSNKASSPLQLLRDDGDWNKQHF 132 Query: 1335 WLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMK 1156 W V+RFLK+ S+ EIL VF LWK+++KSRINE NY KI E+V L MK Sbjct: 133 WAVIRFLKDASRSSEILPVFHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEGMK 192 Query: 1155 NNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYD 976 + + SL++YN +IH FA KGEF+ AL FL E+ NL ETY+GLI++YG Y MYD Sbjct: 193 THGLKPSLEIYNLVIHCFARKGEFDRALYFLNELKANNLIADTETYDGLIQSYGKYKMYD 252 Query: 975 EMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAML 796 E+ +CVKKME +GC PD +TYNLLI EF+RGGL KRMERV +T+LSKKM LQ+ TL+ ML Sbjct: 253 ELDECVKKMESDGCLPDHITYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVML 312 Query: 795 EAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGV 616 EAY + G++EKME Y+RVLNS+T K +IRK+A YIEN+ FSRL D+G+++AS Sbjct: 313 EAYANFGIIEKMENAYRRVLNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLNLASVTSR 372 Query: 615 NDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILL 436 DLVWCLRLLSHAC+ SR+G++SI++ M WN TV N + L YLKMKDF +L ILL Sbjct: 373 TDLVWCLRLLSHACLLSRKGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILL 432 Query: 435 SQIETRLLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKG 256 ++ TR +KPDIVTVG+LFDA+ GF+G WRR G L++A EMNTDPLVL+AF KG Sbjct: 433 LELSTRHVKPDIVTVGILFDANRIGFNGTMALNTWRRTGFLDEAVEMNTDPLVLSAFGKG 492 Query: 255 FFLRSCEEMYSSLGQKSSEKRVWTYQDLIDM 163 FL+SCEEMYSSL ++ +K++WTYQ+LID+ Sbjct: 493 NFLQSCEEMYSSLEPEARKKKIWTYQNLIDL 523 >ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 509 Score = 464 bits (1193), Expect = e-128 Identities = 238/450 (52%), Positives = 307/450 (68%), Gaps = 1/450 (0%) Frame = -2 Query: 1494 PQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRF 1318 P + ++ T ++ L V+ HE PLQ+LR++G+W+ + W V+RF Sbjct: 58 PPHKTSSSTEHTTLHVEPSHEYHKLRALLDILMEKDCCPLQLLRDDGDWTIDQFWAVIRF 117 Query: 1317 LKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINL 1138 L S+P+EILQ+FD+W+N+ KSRINE NY KI EAV ++MK+ + L Sbjct: 118 LIHASRPKEILQLFDIWRNIEKSRINEFNYSKIIGLLVEEDLIEEAVVCFQDMKSQGLGL 177 Query: 1137 SLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCV 958 S++LYN+IIHG + G F DA+ FL EM + NL P +TY+GLI AYG Y MYDEMG C+ Sbjct: 178 SVELYNTIIHGLSRNGNFVDAVHFLNEMKEMNLAPDADTYDGLIEAYGKYKMYDEMGMCL 237 Query: 957 KKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDL 778 KKM L GC PD +TYNLLI EFA GGL R+ERV ++++S++M+LQ TLIA+LE Y Sbjct: 238 KKMRLNGCSPDYITYNLLIREFAHGGLLNRVERVYQSMVSRRMDLQVPTLIAILEVYAKF 297 Query: 777 GLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWC 598 G+LEKME Y+RVLNS+ K +I+K+A YIEN+MFS+LE+LGVD++ R G DLVWC Sbjct: 298 GILEKMEVFYRRVLNSRAILKEDLIKKVAEVYIENYMFSKLENLGVDLSPRFGQTDLVWC 357 Query: 597 LRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIETR 418 LRLLSHA + SRRG+ SII M WN TV N+M L YLKMKDF +L L SQ TR Sbjct: 358 LRLLSHAGLLSRRGMNSIILEMEGKSVPWNATVANIMMLAYLKMKDFTRLRSLFSQSLTR 417 Query: 417 LLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSC 238 + PDI+T G+LFDA+ G+DG+ T WR+ G+L KA EMNTDPLV+T F KG FLR+C Sbjct: 418 GVDPDIITFGILFDANRIGYDGSATLNTWRKHGILYKAVEMNTDPLVITTFGKGHFLRNC 477 Query: 237 EEMYSSLGQKSSEKRVWTYQDLIDMVCKYN 148 E YSSL + EK+ WTYQDLID V K N Sbjct: 478 EAAYSSLEPEVREKKTWTYQDLIDSVFKDN 507 >ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa] gi|550324215|gb|EEE99423.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa] Length = 508 Score = 460 bits (1184), Expect = e-127 Identities = 246/521 (47%), Positives = 330/521 (63%), Gaps = 20/521 (3%) Frame = -2 Query: 1662 VSLDFNKTS---FNWRFQNNNFYKTLITRXXXXXXXXXXXXXXPILHRKKTQNHYHQETP 1492 ++L F S + W+F N YKTL+T I H Sbjct: 1 MALKFQNNSIPPWTWKFNN---YKTLVTTHSLSLAFRHPTAHATICHG------------ 45 Query: 1491 QNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVMRFLK 1312 Q+ STK T L VD+FHE +PLQ+L+ +G+WS+ + W V++FLK Sbjct: 46 QDHSTKHTT---LLVDSFHEHKRLKSLLHNLNSNQNPLQLLQQDGDWSKDDFWSVIKFLK 102 Query: 1311 ETSQPQEILQV-----------------FDLWKNLNKSRINEINYEKIXXXXXXXXXXXE 1183 +++ +ILQV F +W+++ K+RINE NYEKI + Sbjct: 103 LSARSNQILQVHSLAHLFFLAARKIEFVFHMWRDVEKTRINEFNYEKIIGLLGEEGLMED 162 Query: 1182 AVSVLREMKNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIR 1003 AV+ EMK+ + LSL++YNSIIHG+A G+F+DAL +L +M + NL P +TY+GLI Sbjct: 163 AVTAFMEMKSFGLCLSLEVYNSIIHGYARNGKFDDALFYLNQMNEMNLSPESDTYDGLIE 222 Query: 1002 AYGNYGMYDEMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNL 823 AYG Y MYDEM C+KKMEL+GC PD TYNLLI +FA+GGL RMERV +++ +K+M L Sbjct: 223 AYGTYRMYDEMAMCLKKMELDGCSPDRYTYNLLIQKFAQGGLLTRMERVYQSMRTKRMKL 282 Query: 822 QTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLG 643 Q+ TLI+MLEAY + G++EKMEK+ + NS+ K ++RK+AG YI N+MFSRL DL Sbjct: 283 QSSTLISMLEAYANFGIVEKMEKILRWAWNSKITVKEDLVRKLAGVYIANYMFSRLHDLA 342 Query: 642 VDVASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMK 463 VD+ S G D+VWCL LLSHAC+ SRRG++++++ M AK WNITV N++ L YLKMK Sbjct: 343 VDLTSITGRTDIVWCLHLLSHACLLSRRGMDAVVREMEDAKACWNITVANIILLAYLKMK 402 Query: 462 DFRQLDILLSQIETRLLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDP 283 DF +L ILLS++ ++PDIVT G+LFDA GFDG E WR+MGLL + EMNTDP Sbjct: 403 DFTRLRILLSKLPEIRVEPDIVTFGILFDAEEIGFDGKECLEMWRKMGLLYRRVEMNTDP 462 Query: 282 LVLTAFRKGFFLRSCEEMYSSLGQKSSEKRVWTYQDLIDMV 160 L L+AF KG FLRSCEE YSSL + EK+ WTY D I++V Sbjct: 463 LALSAFGKGSFLRSCEEGYSSLEPNAREKKRWTYVDFINLV 503 >ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda] gi|548859508|gb|ERN17188.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda] Length = 506 Score = 459 bits (1181), Expect = e-126 Identities = 233/454 (51%), Positives = 307/454 (67%) Frame = -2 Query: 1503 QETPQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVM 1324 ++ PQ+ + + L V F + PL++LR+EG+W++ + W VM Sbjct: 57 EQNPQD-----SKHRALLVQNFFQTQQLLDLIEKIKGGIDPLKLLRDEGDWNKDQFWAVM 111 Query: 1323 RFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNI 1144 + LKETS+ +E +QVFD W N+ +SR+++ NY K+ EA ++L+E+K+ + Sbjct: 112 KLLKETSRIKEAMQVFDYWVNVERSRLDDSNYTKMIELLVDAGLMDEATTMLKEVKDFGV 171 Query: 1143 NLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGK 964 ++ +YN I+HG+A G F+ A +FL+EM D L P ETY+GLIRAYGN+ MYD+M K Sbjct: 172 RPTVAVYNFIVHGYANTGNFDKANLFLREMRDLGLVPESETYDGLIRAYGNHRMYDDMAK 231 Query: 963 CVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYV 784 C KKME EG PD +TYN+LI EFARGGL RME RTLLSKKM LQ TL+AMLEAY Sbjct: 232 CAKKMESEGFTPDHLTYNILIREFARGGLMVRMEGAYRTLLSKKMGLQYSTLVAMLEAYA 291 Query: 783 DLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLV 604 LG + +ME V++R+L S+ P K ++RK+A YI+NH FSRLEDLG+ VAS+ G DL Sbjct: 292 ALGCVNEMETVFRRLLKSKIPLKEDLVRKVARAYIKNHRFSRLEDLGLGVASKTGRTDLF 351 Query: 603 WCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIE 424 WCL LLSHAC+ SR+GI+S+IQ M A N+T N+ AL YLKMKD + LD+LLSQ++ Sbjct: 352 WCLLLLSHACLCSRKGIKSVIQEMKSAMVRPNVTFANITALTYLKMKDVQYLDVLLSQLQ 411 Query: 423 TRLLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLR 244 + PDIVTVGV+ DA V GFD WR+ G L + EMNTDPLVLTAF KG+FLR Sbjct: 412 LLNVNPDIVTVGVVMDAYVSGFDDIKALRMWRKTGFLRRPVEMNTDPLVLTAFGKGYFLR 471 Query: 243 SCEEMYSSLGQKSSEKRVWTYQDLIDMVCKYNGR 142 SCEE+Y SLG K E++VWTY DLID+V N R Sbjct: 472 SCEELYLSLGAKGRERKVWTYNDLIDLVFNQNER 505 >gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis] Length = 664 Score = 453 bits (1166), Expect = e-125 Identities = 227/443 (51%), Positives = 309/443 (69%), Gaps = 1/443 (0%) Frame = -2 Query: 1470 TNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRFLKETSQPQ 1294 T ++ L V+TFHE S P+++LR +G+W ++ W V+RFL+ S+ + Sbjct: 61 TEHTTLLVETFHEHRKFKTLLKRLSKNDSCPMRLLREDGDWCKEHFWAVVRFLRHGSRTK 120 Query: 1293 EILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSI 1114 EI+QVFDLWKN+ KSRINE+NY KI EAV EMK+ ++ +L++YNS+ Sbjct: 121 EIVQVFDLWKNIEKSRINELNYCKIIKMLGEEGLMEEAVLSFEEMKSCGLSPTLEVYNSM 180 Query: 1113 IHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMELEGC 934 IHGF+ KG+F+DAL++L EM + N+ P +TY GLI AY Y MYDE+G C+KKM+L GC Sbjct: 181 IHGFSQKGDFDDALVYLNEMREQNVVPETDTYEGLIEAYAKYEMYDEIGLCLKKMKLNGC 240 Query: 933 FPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEK 754 PD +TYNLL+ +F++GGL KRME V T++SK+M LQ+ TL+AMLE Y G+L+KMEK Sbjct: 241 PPDHITYNLLMRKFSKGGLLKRMESVYHTMISKRMYLQSSTLVAMLETYARFGILDKMEK 300 Query: 753 VYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHAC 574 Y R L ++TP +IRK+A YI+N++FSRLE LGVD+++ G DL+WCLRLLSHA Sbjct: 301 FYMRTLKTKTPLGEDLIRKLAEVYIDNYLFSRLETLGVDLSTTFGETDLLWCLRLLSHAF 360 Query: 573 ISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIETRLLKPDIVT 394 + SR+G++ +IQ M A WN+T N++ L +LKMKDF L I LSQ+ T ++PDIVT Sbjct: 361 LFSRKGMDFVIQEMERAHIPWNVTFANIILLTHLKMKDFTHLRISLSQL-THSVEPDIVT 419 Query: 393 VGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLG 214 VG+LFDA GFDG T E W+RM KA EMNTDP+V+TAF KG FL++CE YSSL Sbjct: 420 VGILFDAIGMGFDGTRTLETWKRMDFFYKAVEMNTDPVVITAFGKGNFLQNCERAYSSLE 479 Query: 213 QKSSEKRVWTYQDLIDMVCKYNG 145 + E + WTY +L+D+V K+ G Sbjct: 480 SEVRETKSWTYNNLVDLVFKHKG 502 >ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Glycine max] Length = 509 Score = 451 bits (1160), Expect = e-124 Identities = 221/443 (49%), Positives = 311/443 (70%), Gaps = 1/443 (0%) Frame = -2 Query: 1470 TNYSLLFVDTFH-EXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVMRFLKETSQPQ 1294 T ++ L V+T+H +PL +L +G+WS+ W V+RFLK S+ Sbjct: 59 TKHTTLLVETYHLHDSLRALLAKLQKEDCNPLHVLAEDGDWSKDHFWAVVRFLKSASRFT 118 Query: 1293 EILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSI 1114 +ILQVFD+WKN+ KSRI+E NY KI +A+S LR+MK I SL YN I Sbjct: 119 QILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMEDALSALRDMKVQGIKPSLDTYNPI 178 Query: 1113 IHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMELEGC 934 IHG + +G+F DAL F+ EM + L+ ETY+GL+ AYG + MYDEMG+CVKKMELEGC Sbjct: 179 IHGLSREGKFSDALRFIDEMKESGLELDSETYDGLLGAYGKFQMYDEMGECVKKMELEGC 238 Query: 933 FPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEK 754 PD +TYN+LI E+AR GL +RME++ + ++SK+M++Q+ TL+AMLEAY G++EKME Sbjct: 239 SPDHITYNILIQEYARAGLLQRMEKLYQRMVSKRMHVQSSTLVAMLEAYTTFGMVEKMEN 298 Query: 753 VYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHAC 574 Y+++L+S+T + +IRK+A YI+N+MFSRLEDL +D+ G ++LVWCLRLLS+AC Sbjct: 299 FYRKILSSKTCLEDDLIRKVAEVYIKNYMFSRLEDLALDLCPAFGESNLVWCLRLLSYAC 358 Query: 573 ISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIETRLLKPDIVT 394 S++G++ +++ M AK +WN+TV N++ L Y+KMKDFR L ILLSQ+ ++PDI+T Sbjct: 359 PLSKKGMDIVVREMRDAKVNWNVTVANIIMLAYVKMKDFRHLKILLSQLPIYRVQPDIIT 418 Query: 393 VGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLG 214 +G+LFDA+ GFDG+ E WRRMG L + E+ TD LVLTAF KG FL+SCEE+YSSL Sbjct: 419 IGILFDATRIGFDGSGALETWRRMGYLYRVVEIKTDSLVLTAFGKGHFLKSCEEVYSSLH 478 Query: 213 QKSSEKRVWTYQDLIDMVCKYNG 145 + +++ WTY DLI ++ K+ G Sbjct: 479 PEDRKRKTWTYHDLIALLSKHTG 501 >ref|XP_002516403.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544501|gb|EEF46020.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 502 Score = 449 bits (1154), Expect = e-123 Identities = 224/441 (50%), Positives = 305/441 (69%), Gaps = 1/441 (0%) Frame = -2 Query: 1464 YSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRFLKETSQPQEI 1288 ++ L V+++HE S PLQ+L+++ +WS+ W V+RFL+ +S+ EI Sbjct: 57 HNTLLVESYHEHQRLKALLARLNKKGSCPLQMLQDDADWSKDHFWAVIRFLRHSSRSDEI 116 Query: 1287 LQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSIIH 1108 LQVFD+WK++ KSRINE NYEK+ +A S EMK ++ SL++YNS+IH Sbjct: 117 LQVFDMWKDIEKSRINEFNYEKVIEILGEEGLIEDAYSAFIEMKTLCLSPSLQVYNSLIH 176 Query: 1107 GFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMELEGCFP 928 G+A G+F+DA+ +L + + NL P+ +TYNGLI+AYG Y MYDEMG C+KKME+EGC P Sbjct: 177 GYARNGKFDDAVFYLNHLKEINLSPVSDTYNGLIQAYGKYKMYDEMGMCLKKMEMEGCSP 236 Query: 927 DSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVY 748 D VTYNLLI E A GL RME+V +T +M+L++ TL AMLEAY + G++EKME + Sbjct: 237 DHVTYNLLIQELAEAGLLTRMEKVYQTTRMNRMDLKSTTLTAMLEAYANFGIVEKMELIL 296 Query: 747 QRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHACIS 568 +R NS+ K +I+K+A YIEN MFSRLE LG ++ R+G ND+VWCL LLS+AC+ Sbjct: 297 KRTRNSKALLKEDLIKKIALVYIENFMFSRLEKLGHYLSKRSGQNDMVWCLLLLSNACML 356 Query: 567 SRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIETRLLKPDIVTVG 388 S++G++S+++ M VAK SWN+T N++ L YLKMKD +L ILLS + ++KPDIVTVG Sbjct: 357 SQKGMDSVVREMKVAKVSWNVTFINIILLAYLKMKDSMRLGILLSTLTNHIVKPDIVTVG 416 Query: 387 VLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLGQK 208 VLFDA+ GF GN E WRR G+L + E TDPLVL AF KG FL+ CEE YSSL Sbjct: 417 VLFDANNIGFHGNGILETWRRTGILYRCVETETDPLVLAAFGKGQFLKKCEEAYSSLEPV 476 Query: 207 SSEKRVWTYQDLIDMVCKYNG 145 + +K WTY +LID+V Y+G Sbjct: 477 ARQKEKWTYCNLIDLVATYDG 497 >ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like isoform X1 [Glycine max] Length = 506 Score = 444 bits (1142), Expect = e-122 Identities = 221/443 (49%), Positives = 306/443 (69%), Gaps = 1/443 (0%) Frame = -2 Query: 1470 TNYSLLFVDTFH-EXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVMRFLKETSQPQ 1294 T ++ L V+T+H +PL +L + +WS+ W V+RFLK +S Sbjct: 57 TKHTTLLVETYHLHHSLRALLAKLENEYSNPLHMLAEDADWSKDHFWAVVRFLKSSSNFT 116 Query: 1293 EILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSI 1114 ILQVFD+WKN+ KSRI+E NY KI +A+S L++MK I SL YN I Sbjct: 117 HILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMKDALSALQDMKVQGIKPSLDTYNPI 176 Query: 1113 IHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMELEGC 934 IHG + +G+F DAL F+ EM + L+ ETY+GLI AYG + MYDEMG+CVKKMELEGC Sbjct: 177 IHGLSREGKFSDALRFIDEMKESGLELDSETYDGLIGAYGKFQMYDEMGECVKKMELEGC 236 Query: 933 FPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEK 754 PD +TYN+LI E+A GGL +RME++ + +LSK+M++++ TL+AMLEAY G++EKMEK Sbjct: 237 SPDPITYNILIQEYAGGGLLQRMEKLYQRMLSKRMHVKSSTLVAMLEAYTTFGMVEKMEK 296 Query: 753 VYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHAC 574 Y+++LNS+T + +IRK+A YI N MFSRLEDL +D+ G ++L WC RLLS+AC Sbjct: 297 FYRKILNSKTCIEDDLIRKVAEVYINNFMFSRLEDLALDLCPAFGESNLEWCFRLLSYAC 356 Query: 573 ISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIETRLLKPDIVT 394 + S++G++ ++Q M AK SWN+TV N++ L Y+KMK+FR L ILLSQ+ ++PDI+T Sbjct: 357 LLSKKGMDIVVQEMQDAKVSWNVTVANIIMLAYVKMKEFRHLRILLSQLPIYRVQPDIIT 416 Query: 393 VGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLG 214 +G+LFDA+ GFDG+ E WRRMG L + EM TD LVLTAF KG FL+SCEE+YSSL Sbjct: 417 IGILFDATRIGFDGSGALETWRRMGYLYRVVEMKTDSLVLTAFGKGHFLKSCEEVYSSLH 476 Query: 213 QKSSEKRVWTYQDLIDMVCKYNG 145 + +++ TY DLI ++ K+ G Sbjct: 477 PEDRKRKTCTYHDLIPLLSKHTG 499 >ref|XP_007163800.1| hypothetical protein PHAVU_001G265200g [Phaseolus vulgaris] gi|561037264|gb|ESW35794.1| hypothetical protein PHAVU_001G265200g [Phaseolus vulgaris] Length = 496 Score = 440 bits (1131), Expect = e-120 Identities = 215/408 (52%), Positives = 289/408 (70%) Frame = -2 Query: 1383 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1204 P+ IL +G+WS+ W +RFLK S+ EILQVFD+WK + KSRI+E NY KI Sbjct: 88 PMYILAQDGDWSKDHFWAAVRFLKNASRFVEILQVFDMWKEIEKSRISEFNYNKIIGLLC 147 Query: 1203 XXXXXXEAVSVLREMKNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPE 1024 EA+S +EMK + SL YN IIHG + G+F DAL FL EM + L P E Sbjct: 148 EDEMMEEALSAFQEMKVQGMKPSLDTYNPIIHGLSKAGKFSDALRFLDEMKESGLDPDSE 207 Query: 1023 TYNGLIRAYGNYGMYDEMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTL 844 TY+GLI AYG + +YDEMG+CVKKMELEGC PD +TYN+LI E+AR G+ +RME++ + + Sbjct: 208 TYDGLIGAYGKFQLYDEMGECVKKMELEGCSPDHITYNILIQEYARAGILQRMEKLYQRM 267 Query: 843 LSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHMF 664 LSK+M LQ+ T +AML+AY G++EKME +++VLNS++ + IRKMA YI+N+MF Sbjct: 268 LSKRMRLQSSTFVAMLKAYTTFGIVEKMEFFFRKVLNSKSCLEDDFIRKMAEVYIKNYMF 327 Query: 663 SRLEDLGVDVASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVMA 484 SRLEDL +D+ S G +DLVWCLRLLS+AC+ S++G++ +++ M AK +WN+ N++ Sbjct: 328 SRLEDLALDLCSAFGESDLVWCLRLLSYACLLSKKGMDIVVKEMQDAKINWNVAFANIIM 387 Query: 483 LMYLKMKDFRQLDILLSQIETRLLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKA 304 L Y+KMKDFR L ILLSQ+ L PDIVT+G++ DAS GFDG E WRRMG L++ Sbjct: 388 LAYVKMKDFRHLRILLSQLRINRLGPDIVTIGIVLDASRIGFDGRGALESWRRMGYLDRV 447 Query: 303 AEMNTDPLVLTAFRKGFFLRSCEEMYSSLGQKSSEKRVWTYQDLIDMV 160 E+ TD LVLTAF KG FL+SCEE+Y+SL + E++ WTY DLI ++ Sbjct: 448 VELKTDSLVLTAFGKGHFLKSCEEVYTSLHPEDRERKKWTYNDLIALL 495 >ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Citrus sinensis] Length = 477 Score = 422 bits (1085), Expect = e-115 Identities = 214/444 (48%), Positives = 299/444 (67%), Gaps = 1/444 (0%) Frame = -2 Query: 1470 TNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRFLKETSQPQ 1294 T ++ L V+++HE S PLQIL+++G+W++ W V+RFLK +S+ + Sbjct: 61 TKHTTLLVESYHEHQALNALIQRLNKKVSCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120 Query: 1293 EILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSI 1114 +I QVFD+WKN+ KSRINE NY+KI EAV +EM+ + SL++YNSI Sbjct: 121 QIPQVFDMWKNIEKSRINEFNYQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180 Query: 1113 IHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMELEGC 934 IHG++ G+F +AL+FL EM + NL P +TY+GLI+AY Sbjct: 181 IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAY--------------------- 219 Query: 933 FPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEK 754 EFA GL KRME +++L+K+M+L++ T++A+L+AY++ G+L+KMEK Sbjct: 220 ------------EFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 267 Query: 753 VYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHAC 574 Y+R+LNS+TP K ++RK+A YI+N+MFSRL+DLG D+ASR G +LVWCLRLLSHAC Sbjct: 268 FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDLASRIGRTELVWCLRLLSHAC 327 Query: 573 ISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLDILLSQIETRLLKPDIVT 394 + S RGI+S+++ M AK WN+T N++ L YLKMKDF+ L +LLS++ TR +KPDIVT Sbjct: 328 LLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIVT 387 Query: 393 VGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLG 214 +G+L+DA GFDG E WRR+G L K E+NTDPLVL + KG FLR CEE+YSSL Sbjct: 388 IGILYDARRIGFDGTGALEMWRRIGFLSKTVEINTDPLVLAVYGKGHFLRYCEEVYSSLE 447 Query: 213 QKSSEKRVWTYQDLIDMVCKYNGR 142 S EK+ WTYQ+LID+V K+NG+ Sbjct: 448 PYSREKKRWTYQNLIDLVIKHNGK 471 >ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297314141|gb|EFH44564.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 502 Score = 402 bits (1033), Expect = e-109 Identities = 200/413 (48%), Positives = 286/413 (69%), Gaps = 3/413 (0%) Frame = -2 Query: 1383 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1204 PL++L+ G+WS+ W V+RFL+ +S+ EIL VFD WKNL +SRI+E NYE++ Sbjct: 84 PLRLLQEYGDWSKDHFWAVIRFLRHSSRLHEILPVFDAWKNLERSRISEANYERVIRLLC 143 Query: 1203 XXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIP 1027 EA+ R M ++ ++ SL++YNSIIHG+A +G+FE+A+ +L M + L PI Sbjct: 144 EEKSMNEAIRAFRGMIDDHELSPSLEIYNSIIHGYADEGKFEEAMFYLNHMKENGLLPIT 203 Query: 1026 ETYNGLIRAYGNYGMYDEMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRT 847 ETY+GLI AYG + MYDE+ C+K+ME EGC D VTYNLLI EF+RGGL KRME++ ++ Sbjct: 204 ETYDGLIEAYGKWKMYDEIVLCLKRMESEGCVRDHVTYNLLIREFSRGGLLKRMEQMYQS 263 Query: 846 LLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHM 667 L+S+KM L+ TL++MLEAY + GL+EKME+ +++ G++RK+A YI+N M Sbjct: 264 LMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLANVYIDNLM 323 Query: 666 FSRLEDLGVDV-ASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNV 490 FSRL+DLG + +SR DL WCLRLL HA + SR+G++ +I+ M A+ WN T N+ Sbjct: 324 FSRLDDLGRGISSSRTRRTDLAWCLRLLCHARLVSRKGLDYVIKEMKEARVPWNTTFANI 383 Query: 489 MALMYLKMKDFRQLDILLSQIETRLLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLE 310 L Y KM DF+ +++LLS++ T+ +K D+VTVG++FD S GFD F W+++G L+ Sbjct: 384 TLLAYSKMGDFKSIELLLSELRTKHVKLDLVTVGIIFDLSEAGFDVTGVFMTWKKIGFLD 443 Query: 309 KAAEMNTDPLVLTAFRKGFFLRSCEEMYS-SLGQKSSEKRVWTYQDLIDMVCK 154 K EM TDPLV AF KG FL+SCEE+ + SLG + E + WTYQ L+++V K Sbjct: 444 KPVEMKTDPLVHAAFGKGKFLKSCEEVKNQSLGMRGEESKAWTYQYLMEVVVK 496 >ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum] gi|557115950|gb|ESQ56233.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum] Length = 495 Score = 402 bits (1032), Expect = e-109 Identities = 200/417 (47%), Positives = 283/417 (67%), Gaps = 2/417 (0%) Frame = -2 Query: 1383 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1204 PL++LR +G+WS+ + W V+RFL+ +S+ EIL VFD WKNL SRINE NYEKI Sbjct: 82 PLRLLREDGDWSKHQFWAVVRFLRHSSRLHEILPVFDAWKNLEPSRINEANYEKILRFLC 141 Query: 1203 XXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIP 1027 EA+ + M + ++ SL++YNSIIHG+A G+FE+A+ ++ M + ++ P Sbjct: 142 EEKSMNEAIRAFQCMIDEHELSPSLEIYNSIIHGYANDGKFEEAMFYMNHMKENDMLPET 201 Query: 1026 ETYNGLIRAYGNYGMYDEMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRT 847 ETY+GLI AYG + +YDE+ C+KKME +GC D VTYNLLI EFARGGL KRME++ ++ Sbjct: 202 ETYDGLIEAYGKWKLYDEIVLCIKKMESDGCVRDHVTYNLLIREFARGGLLKRMEQMYQS 261 Query: 846 LLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHM 667 L+S+KM L+ TL++MLEAY + G+LEKME Y +++ ++RK+A YI+N M Sbjct: 262 LMSRKMTLEPCTLLSMLEAYAEFGVLEKMEDTYNKIVRFGISLDEDLVRKVANVYIDNLM 321 Query: 666 FSRLEDLGVDVASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVM 487 FSRL+DLG + DL WCLRLL HAC+ SR+G++ +++ M A+ WN T N++ Sbjct: 322 FSRLDDLGRGIRR----TDLAWCLRLLCHACLVSRKGLDYVVKEMEEARVPWNATFANIV 377 Query: 486 ALMYLKMKDFRQLDILLSQIETRLLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEK 307 L Y KM DFR +++LLS++ T+ +K D+VTVG++ D SV GFDG F W+++G L+K Sbjct: 378 LLAYSKMGDFRSVELLLSELRTKHVKLDLVTVGIVLDLSVDGFDGTGVFMTWKKIGFLDK 437 Query: 306 AAEMNTDPLVLTAFRKGFFLRSCEEMYSS-LGQKSSEKRVWTYQDLIDMVCKYNGRK 139 E TDPLV AF KG FLRSCEE+ + LG + E + WTYQ L+++V K K Sbjct: 438 PVETKTDPLVHAAFGKGRFLRSCEEVKNQVLGTRVEESKSWTYQYLMELVVKNQKNK 494 >ref|NP_193155.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635638|sp|O23278.2|PP310_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g14190, chloroplastic; Flags: Precursor gi|332657991|gb|AEE83391.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 501 Score = 397 bits (1019), Expect = e-107 Identities = 199/413 (48%), Positives = 283/413 (68%), Gaps = 3/413 (0%) Frame = -2 Query: 1383 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1204 PL++L+ +G+WS+ W V+RFL+++S+ EIL VFD WKNL SRI+E NYE+I Sbjct: 83 PLRLLQEDGDWSKDHFWAVIRFLRQSSRLHEILPVFDTWKNLEPSRISENNYERIIRFLC 142 Query: 1203 XXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIP 1027 EA+ R M ++ ++ SL++YNSIIH +A G+FE+A+ +L M + L PI Sbjct: 143 EEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHMKENGLLPIT 202 Query: 1026 ETYNGLIRAYGNYGMYDEMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRT 847 ETY+GLI AYG + MYDE+ C+K+ME +GC D VTYNLLI EF+RGGL KRME++ ++ Sbjct: 203 ETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLLKRMEQMYQS 262 Query: 846 LLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHM 667 L+S+KM L+ TL++MLEAY + GL+EKME+ +++ G++RK+A YIEN M Sbjct: 263 LMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLANVYIENLM 322 Query: 666 FSRLEDLGVDV-ASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNV 490 FSRL+DLG + ASR +L WCLRLL HA + SR+G++ +++ M A+ WN T N+ Sbjct: 323 FSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEARVPWNTTFANI 382 Query: 489 MALMYLKMKDFRQLDILLSQIETRLLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLE 310 L Y KM DF +++LLS++ + +K D+VTVG++FD S FDG F W+++G L+ Sbjct: 383 ALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVFMTWKKIGFLD 442 Query: 309 KAAEMNTDPLVLTAFRKGFFLRSCEEMYS-SLGQKSSEKRVWTYQDLIDMVCK 154 K EM TDPLV AF KG FLRSCEE+ + SLG + E + WTYQ L+++V K Sbjct: 443 KPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMELVVK 495 >ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Capsella rubella] gi|482552277|gb|EOA16470.1| hypothetical protein CARUB_v10004636mg [Capsella rubella] Length = 501 Score = 392 bits (1006), Expect = e-106 Identities = 191/410 (46%), Positives = 282/410 (68%), Gaps = 2/410 (0%) Frame = -2 Query: 1383 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1204 PLQ+L+ +G+WS+ W V+RFL+ +S+ EIL V+D WKNL SRI+ +NYE++ Sbjct: 90 PLQLLQEDGDWSKDHFWAVIRFLRHSSRLHEILPVYDAWKNLEPSRISVVNYERVIRFLC 149 Query: 1203 XXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIP 1027 EA+ R M ++ ++ SL++YNSIIHG+A G+FE+A+ +L +M + L PI Sbjct: 150 EERSMNEAIRAFRSMIDDDELSPSLEIYNSIIHGYADDGKFEEAMFYLNQMKENGLSPIS 209 Query: 1026 ETYNGLIRAYGNYGMYDEMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRT 847 ETY+GLI AYG + MYDE+ CV++ME +GC D VTYNLLI +F+RGGL KRME++ ++ Sbjct: 210 ETYDGLIEAYGKWKMYDEIVLCVRRMESDGCVRDHVTYNLLIRQFSRGGLLKRMEQMYQS 269 Query: 846 LLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHM 667 L+S+KM L+ TL++MLEAY + G++EKME+ +++ G++RK+A YI+N M Sbjct: 270 LMSRKMTLEPCTLLSMLEAYAEFGVIEKMEETCNKIIRFGISLDDGLVRKLAKVYIDNLM 329 Query: 666 FSRLEDLGVDVA-SRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNV 490 FSRL+DLG ++ SR +DL WCLRLL H+ + SR+G++ +++ M AK +WN T N+ Sbjct: 330 FSRLDDLGRGISYSRTRRSDLAWCLRLLCHSRLVSRKGLDYVLKEMTEAKVTWNTTFANI 389 Query: 489 MALMYLKMKDFRQLDILLSQIETRLLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLE 310 + L Y KM DF+ +++LL + T+ +K D+VTVG++FD S GFDG F W+++G L+ Sbjct: 390 VLLAYSKMGDFKSIELLLDGLRTKRVKLDLVTVGIVFDLSEAGFDGTGVFMTWKKIGFLD 449 Query: 309 KAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLGQKSSEKRVWTYQDLIDMV 160 K EM TDPLV AF KG FLR CEEM + + WTYQ+L+++V Sbjct: 450 KPVEMKTDPLVHAAFGKGQFLRRCEEM------RGEDPTPWTYQNLMELV 493 >emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera] Length = 1697 Score = 389 bits (998), Expect = e-105 Identities = 192/338 (56%), Positives = 249/338 (73%) Frame = -2 Query: 1383 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1204 PLQ+LR++G+W+++ W V+RFLK+ S+ EIL VF LWK+++KSRINE NY KI Sbjct: 1356 PLQLLRDDGDWNKQHFWAVIRFLKDASRSSEILPVFHLWKDMDKSRINEFNYAKIIGLLS 1415 Query: 1203 XXXXXXEAVSVLREMKNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPE 1024 E+V L MK + + SL++YN +IH FA KGEF+ AL FL E+ NL E Sbjct: 1416 QEDLAEESVLALEXMKTHGLKPSLEIYNLVIHCFARKGEFDRALYFLNELKXNNLIADTE 1475 Query: 1023 TYNGLIRAYGNYGMYDEMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTL 844 TY+GLI++YG Y MYDE+ +CVKKME +GC PD +TYNLLI EF+RGGL KRMERV +T+ Sbjct: 1476 TYDGLIQSYGKYKMYDELDECVKKMESDGCLPDHITYNLLIQEFSRGGLLKRMERVFQTV 1535 Query: 843 LSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHMF 664 LSKKM LQ+ TL+ MLEAY + G++EKME Y+RVLNS+T K +IRK+A YIEN+ F Sbjct: 1536 LSKKMGLQSSTLVVMLEAYANFGIIEKMENAYRRVLNSKTSLKDDLIRKLAEVYIENYKF 1595 Query: 663 SRLEDLGVDVASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVMA 484 SRL D+G+D+AS DLVWCLRLLSHAC+ SR+G++SI++ M WN TV N + Sbjct: 1596 SRLADMGLDLASVTSRTDLVWCLRLLSHACLLSRKGLDSIVKEMEAKNVPWNATVANTIL 1655 Query: 483 LMYLKMKDFRQLDILLSQIETRLLKPDIVTVGVLFDAS 370 L YLKMKDF +L ILL ++ TR +KPDIVTVG+LFDA+ Sbjct: 1656 LAYLKMKDFTRLRILLLELSTRHVKPDIVTVGILFDAN 1693 >emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thaliana] gi|7268124|emb|CAB78461.1| salt-inducible protein homolog [Arabidopsis thaliana] Length = 561 Score = 382 bits (982), Expect = e-103 Identities = 197/419 (47%), Positives = 278/419 (66%), Gaps = 16/419 (3%) Frame = -2 Query: 1362 EGNWSEKELWLVMRFLKETSQPQEIL-------------QVFDLWKNLNKSRINEINYEK 1222 +G+WS+ W V+RFL+++S+ EIL QVFD WKNL SRI+E NYE+ Sbjct: 137 DGDWSKDHFWAVIRFLRQSSRLHEILPNMKMTFCFFFQLQVFDTWKNLEPSRISENNYER 196 Query: 1221 IXXXXXXXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDF 1045 I EA+ R M ++ ++ SL++YNSIIH +A G+FE+A+ +L M + Sbjct: 197 IIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHMKEN 256 Query: 1044 NLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRM 865 L PI ETY+GLI AYG + MYDE+ C+K+ME +GC D VTYNLLI EF+RGGL KRM Sbjct: 257 GLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLLKRM 316 Query: 864 ERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGF 685 E++ ++L+S+KM L+ TL++MLEAY + GL+EKME+ +++ G++RK+A Sbjct: 317 EQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLANV 376 Query: 684 YIENHMFSRLEDLGVDV-ASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWN 508 YIEN MFSRL+DLG + ASR +L WCLRLL HA + SR+G++ +++ M A+ WN Sbjct: 377 YIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEARVPWN 436 Query: 507 ITVTNVMALMYLKMKDFRQLDILLSQIETRLLKPDIVTVGVLFDASVGGFDGNTTFERWR 328 T N+ L Y KM DF +++LLS++ + +K D+VTVG++FD S FDG F W+ Sbjct: 437 TTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVFMTWK 496 Query: 327 RMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYS-SLGQKSSEKRVWTYQDLIDMVCK 154 ++G L+K EM TDPLV AF KG FLRSCEE+ + SLG + E + WTYQ L+++V K Sbjct: 497 KIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMELVVK 555 >ref|XP_001762610.1| predicted protein [Physcomitrella patens] gi|162686343|gb|EDQ72733.1| predicted protein [Physcomitrella patens] Length = 418 Score = 172 bits (436), Expect = 4e-40 Identities = 120/393 (30%), Positives = 197/393 (50%), Gaps = 4/393 (1%) Frame = -2 Query: 1335 WLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMK 1156 W V+ +L + EIL+VF W+ + + E+ Y KI EA ++ EM Sbjct: 10 WTVIDYLHGHRRMAEILEVFKWWQQQDGYKPYELYYTKIIRMLGQAHMPTEARTLFIEMC 69 Query: 1155 NNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMV-DFNLKPIPETYNGLIRAYGNYGMY 979 + S+ Y ++ G+A +GEFE+A L++M+ + KP TY GLI AYG +GMY Sbjct: 70 ELGLRPSVVTYTYLLQGYAERGEFEEAEQILRDMILSGDAKPNTTTYAGLIYAYGKHGMY 129 Query: 978 DEMGKCVKKMELEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAM 799 D M + +M+ + D +Y LI +ARGGLF RM++ + + M + T+ A+ Sbjct: 130 DRMWRTFNRMKTQHIPADEFSYRTLIKAYARGGLFSRMQQTMKEMSRNGMYADSATMNAV 189 Query: 798 LEAYVDLGLLEKMEKVYQRVLNSQTPFKYGMIRKMAGFYIENHMFSRLEDL--GVDVASR 625 + AY + GL+++MEK Y+ + + I+ + Y+++ +F +L V + R Sbjct: 190 VLAYAEAGLVKEMEKQYEVMWKNSFTAGQETIKAIVRAYVKDSLFFQLSGYVKRVGLRKR 249 Query: 624 NGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNVMALMYLKMKDFRQLD 445 VN L W LLSHA + + Q M FS ++T N+MAL Y + K L Sbjct: 250 TMVNYL-WNALLLSHAANLAMDDLGVDFQNMKYLGFSPDVTTCNIMALAYSRAKQLEDLH 308 Query: 444 ILLSQIETRLLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAF 265 L+ ++ + PD+VT G + D E L+ AAE+ TDPLV Sbjct: 309 QLIVTMQDNGIAPDLVTYGAVIDVFTEEKLRPNLLEELVEFRNLDVAAEVETDPLVFEVL 368 Query: 264 RKGFFLRSCEEMYSSL-GQKSSEKRVWTYQDLI 169 KG F +CE++ ++ G++ +++ TY +L+ Sbjct: 369 GKGRFHVACEKLARNMEGERMNQR---TYGELV 398