BLASTX nr result
ID: Akebia22_contig00006424
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00006424 (1675 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007034168.1| Pentatricopeptide repeat-containing protein,... 484 e-134 ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citr... 481 e-133 ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containi... 479 e-132 ref|XP_007222864.1| hypothetical protein PRUPE_ppa004279mg [Prun... 478 e-132 ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containi... 465 e-128 ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [A... 465 e-128 ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Popu... 459 e-126 gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis] 455 e-125 ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containi... 451 e-124 ref|XP_002516403.1| pentatricopeptide repeat-containing protein,... 448 e-123 ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containi... 443 e-122 ref|XP_007163800.1| hypothetical protein PHAVU_001G265200g [Phas... 440 e-120 ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containi... 425 e-116 ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. l... 407 e-111 ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutr... 407 e-110 ref|NP_193155.4| pentatricopeptide repeat-containing protein [Ar... 402 e-109 ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Caps... 396 e-107 emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera] 393 e-106 emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thal... 387 e-105 ref|XP_001762610.1| predicted protein [Physcomitrella patens] gi... 175 5e-41 >ref|XP_007034168.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] gi|508713197|gb|EOY05094.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 504 Score = 484 bits (1246), Expect = e-134 Identities = 241/453 (53%), Positives = 325/453 (71%), Gaps = 1/453 (0%) Frame = -3 Query: 1475 PQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRF 1299 P+ + N++ L V+T+H S PLQ+LR++G+W++ W+V+RF Sbjct: 52 PRPDGSSCKNHTALLVETYHHHRRLKALLERLEKDDSCPLQMLRDDGDWTKDIFWVVIRF 111 Query: 1298 LKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINL 1119 L+ S+ EILQVF +WKN+ KSRINE+NYEKI +AV LREM + Sbjct: 112 LRRASRSNEILQVFHMWKNIEKSRINELNYEKIIGLLGEEGRVGQAVQALREMGGYGLKP 171 Query: 1118 SLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCV 939 SL++YNSIIH +A G+F+DAL FL EM + L P +TY+GLI AYG Y MYDE+G C+ Sbjct: 172 SLEVYNSIIHAYARNGKFDDALSFLNEMKEIGLAPETDTYDGLIEAYGKYKMYDEIGTCL 231 Query: 938 KKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDL 759 K ME + C PD TYNLLI EF+RGGL +RME+V + LLSK+MNLQ+ +L+AMLEAY + Sbjct: 232 KMMELDRCRPDHFTYNLLIREFSRGGLLQRMEQVYQILLSKQMNLQSSSLVAMLEAYANF 291 Query: 758 GLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWC 579 G+L+KMEKVY++V+NS T LK IR +A YI+N+MFSRL+DLG+D++SR G NDLVWC Sbjct: 292 GILDKMEKVYRKVVNSMT-LKEDTIRILASVYIKNYMFSRLDDLGIDLSSRTGRNDLVWC 350 Query: 578 LRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILLSQIETR 399 LRLLSHAC+ SR+G++S+I M AK SWN+T++NI+ L Y+KMKDF++L ILLSQ+ + Sbjct: 351 LRLLSHACLLSRKGMDSVILEMCEAKASWNVTISNIILLAYMKMKDFKRLRILLSQLPSH 410 Query: 398 SLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSC 219 ++PDI+T+G+L DA GFDG E WR+MGLL + EMNTDPLVL AF KG FLR C Sbjct: 411 QVRPDIITIGILSDAIEIGFDGAEALETWRKMGLLYRTVEMNTDPLVLIAFGKGHFLRDC 470 Query: 218 EEMYSSLGQKSSEKRVWTYQDLIDMVCKYNGRK 120 EE+Y+SL K+ +++ WTY LID+V K+ ++ Sbjct: 471 EEIYTSLEPKARKEKRWTYHHLIDLVIKHKAKR 503 >ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citrus clementina] gi|557522919|gb|ESR34286.1| hypothetical protein CICLE_v10004784mg [Citrus clementina] Length = 510 Score = 481 bits (1237), Expect = e-133 Identities = 246/512 (48%), Positives = 343/512 (66%), Gaps = 1/512 (0%) Frame = -3 Query: 1655 MNAIVSLDFNKTSFNWRFQNNNLYKTLITRXXXXXXXXXXXXXXPILHRKKTQNHYYQET 1476 M+++VSL + NN YK + + IL RK Sbjct: 1 MDSVVSLHLHNN-------NNTHYKVRLNKNKKNKLTHNRVFFSKILIRKPISCCCLSSA 53 Query: 1475 PQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRF 1299 P + T ++ L V+++HE S PLQIL+++G+W++ W V+RF Sbjct: 54 P-SLDYHSTKHTTLLVESYHEHQALNALIQRLNKKVSCPLQILQHDGDWTKDHFWAVIRF 112 Query: 1298 LKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINL 1119 LK +S+ ++I QVFD+WKN+ KSRINE N +KI EAV +EM+ + Sbjct: 113 LKNSSRSRQIPQVFDMWKNIEKSRINEFNSQKIIGMLCEEGLMEEAVRAFQEMEGFALKP 172 Query: 1118 SLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCV 939 SL++YNSIIHG++ G+F +AL+FL EM + NL P +TY+GLI+AYG Y MYDE+ C+ Sbjct: 173 SLEIYNSIIHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAYGKYKMYDEIDMCL 232 Query: 938 KKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDL 759 K M+ +GC PD +TYNLLI EFA GL KRME +++L+K+M+L++ T++A+L+AY++ Sbjct: 233 KMMKLDGCSPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNF 292 Query: 758 GLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWC 579 G+L+KMEK Y+R+LNS+TPLK ++RK+A YI+N+MFSRL+DLG D+ASR G +LVWC Sbjct: 293 GMLDKMEKFYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDLASRIGRTELVWC 352 Query: 578 LRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILLSQIETR 399 LRLLSHAC+ S RGI+S+++ M AK WN+T NI+ L YLKMKDF+ L +LLS++ TR Sbjct: 353 LRLLSHACLLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTR 412 Query: 398 SLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSC 219 +KPDIVT+G+L+DA GFDG E W+R+G L K E+NTDPLVL + KG FLR C Sbjct: 413 HVKPDIVTIGILYDARRIGFDGTGALEMWKRIGFLFKTVEINTDPLVLAVYGKGHFLRYC 472 Query: 218 EEMYSSLGQKSSEKRVWTYQDLIDMVCKYNGR 123 EE+YSSL S EK+ WTYQ+LID+V K+NG+ Sbjct: 473 EEVYSSLEPYSREKKRWTYQNLIDLVIKHNGK 504 >ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Vitis vinifera] Length = 581 Score = 479 bits (1234), Expect = e-132 Identities = 246/451 (54%), Positives = 319/451 (70%), Gaps = 1/451 (0%) Frame = -3 Query: 1493 HYYQETPQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKEL 1317 H+ Q + N TK T L V+T HE S PLQ+LR++G+W+++ Sbjct: 76 HHLQLSHNNNPTKHTT---LLVETLHENERLGVLIQKLSNKASSPLQLLRDDGDWNKQHF 132 Query: 1316 WLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMK 1137 W V+RFLK+ S+ EIL VF LWK+++KSRINE NY KI E+V L MK Sbjct: 133 WAVIRFLKDASRSSEILPVFHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEGMK 192 Query: 1136 NNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYD 957 + + SL++YN +IH FA KGEF+ AL FL E+ NL ETY+GLI++YG Y MYD Sbjct: 193 THGLKPSLEIYNLVIHCFARKGEFDRALYFLNELKANNLIADTETYDGLIQSYGKYKMYD 252 Query: 956 EMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAML 777 E+ +CVKKMES+GC PD +TYNLLI EF+RGGL KRMERV +T+LSKKM LQ+ TL+ ML Sbjct: 253 ELDECVKKMESDGCLPDHITYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVML 312 Query: 776 EAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGV 597 EAY + G++EKME Y+RVLNS+T LK +IRK+A YIEN+ FSRL D+G+++AS Sbjct: 313 EAYANFGIIEKMENAYRRVLNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLNLASVTSR 372 Query: 596 NDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILL 417 DLVWCLRLLSHAC+ SR+G++SI++ M WN TV N + L YLKMKDF +L ILL Sbjct: 373 TDLVWCLRLLSHACLLSRKGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILL 432 Query: 416 SQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKG 237 ++ TR +KPDIVTVG+LFDA+ GF+G WRR G L++A EMNTDPLVL+AF KG Sbjct: 433 LELSTRHVKPDIVTVGILFDANRIGFNGTMALNTWRRTGFLDEAVEMNTDPLVLSAFGKG 492 Query: 236 FFLRSCEEMYSSLGQKSSEKRVWTYQDLIDM 144 FL+SCEEMYSSL ++ +K++WTYQ+LID+ Sbjct: 493 NFLQSCEEMYSSLEPEARKKKIWTYQNLIDL 523 >ref|XP_007222864.1| hypothetical protein PRUPE_ppa004279mg [Prunus persica] gi|462419800|gb|EMJ24063.1| hypothetical protein PRUPE_ppa004279mg [Prunus persica] Length = 518 Score = 478 bits (1230), Expect = e-132 Identities = 244/448 (54%), Positives = 310/448 (69%) Frame = -3 Query: 1475 PQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVMRFL 1296 P + STK T L V+TFHE PLQ+L +G+W++ + W +RFL Sbjct: 65 PDSSSTKHTT---LLVETFHEHQRLKALLQNLINGSCPLQLLGEDGDWTKDQFWAAIRFL 121 Query: 1295 KETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLS 1116 K T + EILQ+FD+WKN+ KSRINE NY KI EAV +EMK++N+ S Sbjct: 122 KHTFRFNEILQLFDMWKNIEKSRINEFNYSKIIGLLGEEGLIEEAVRCFQEMKSHNLRPS 181 Query: 1115 LKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVK 936 L++YNS+IH A +G FEDAL FL EM + NL P +TY+GLI AYG Y MYD++G CVK Sbjct: 182 LEVYNSVIHVCARQGNFEDALFFLNEMKEMNLAPETDTYDGLIEAYGKYRMYDQIGMCVK 241 Query: 935 KMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLG 756 KM+ GC PD +TYNLLI EFARGGL KRME V +++LS++M LQ+ TLIAM+E Y G Sbjct: 242 KMKLNGCSPDHITYNLLIREFARGGLLKRMESVYQSMLSRRMALQSSTLIAMVEVYAKFG 301 Query: 755 LLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCL 576 +LEKME VY+RVLNS T +K +IRK+A YI+N+MFSRLE LGVD++SR G DLVWCL Sbjct: 302 ILEKMENVYRRVLNSGTVVKNDLIRKLAEVYIDNYMFSRLEKLGVDLSSRFGQTDLVWCL 361 Query: 575 RLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILLSQIETRS 396 RLLS A + S+RG++SI+ M WN TV NI+ L YLKMKDF L I LSQ+ T+ Sbjct: 362 RLLSQAGVLSQRGMDSIVDEMKEQNVPWNETVANIIMLAYLKMKDFTHLRIFLSQLLTQG 421 Query: 395 LKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCE 216 ++PDI+TVG++FDA+ G+DG+ T + WR G L KA EMNTDPLVLT F KG FLR+CE Sbjct: 422 VEPDIITVGIVFDANRIGYDGSRTLDTWRENGFLRKAVEMNTDPLVLTTFGKGHFLRNCE 481 Query: 215 EMYSSLGQKSSEKRVWTYQDLIDMVCKY 132 YSSL + E + WTY LID+V K+ Sbjct: 482 AAYSSLEPEDRENKTWTYHHLIDLVFKH 509 >ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 509 Score = 465 bits (1197), Expect = e-128 Identities = 240/458 (52%), Positives = 311/458 (67%), Gaps = 3/458 (0%) Frame = -3 Query: 1493 HYYQETP--QNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEK 1323 H++ +P + ++ T ++ L V+ HE PLQ+LR++G+W+ Sbjct: 50 HFFSPSPLPPHKTSSSTEHTTLHVEPSHEYHKLRALLDILMEKDCCPLQLLRDDGDWTID 109 Query: 1322 ELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLRE 1143 + W V+RFL S+P+EILQ+FD+W+N+ KSRINE NY KI EAV ++ Sbjct: 110 QFWAVIRFLIHASRPKEILQLFDIWRNIEKSRINEFNYSKIIGLLVEEDLIEEAVVCFQD 169 Query: 1142 MKNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGM 963 MK+ + LS++LYN+IIHG + G F DA+ FL EM + NL P +TY+GLI AYG Y M Sbjct: 170 MKSQGLGLSVELYNTIIHGLSRNGNFVDAVHFLNEMKEMNLAPDADTYDGLIEAYGKYKM 229 Query: 962 YDEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIA 783 YDEMG C+KKM GC PD +TYNLLI EFA GGL R+ERV ++++S++M+LQ TLIA Sbjct: 230 YDEMGMCLKKMRLNGCSPDYITYNLLIREFAHGGLLNRVERVYQSMVSRRMDLQVPTLIA 289 Query: 782 MLEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRN 603 +LE Y G+LEKME Y+RVLNS+ LK +I+K+A YIEN+MFS+LE+LGVD++ R Sbjct: 290 ILEVYAKFGILEKMEVFYRRVLNSRAILKEDLIKKVAEVYIENYMFSKLENLGVDLSPRF 349 Query: 602 GVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDI 423 G DLVWCLRLLSHA + SRRG+ SII M WN TV NIM L YLKMKDF +L Sbjct: 350 GQTDLVWCLRLLSHAGLLSRRGMNSIILEMEGKSVPWNATVANIMMLAYLKMKDFTRLRS 409 Query: 422 LLSQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFR 243 L SQ TR + PDI+T G+LFDA+ G+DG+ T WR+ G+L KA EMNTDPLV+T F Sbjct: 410 LFSQSLTRGVDPDIITFGILFDANRIGYDGSATLNTWRKHGILYKAVEMNTDPLVITTFG 469 Query: 242 KGFFLRSCEEMYSSLGQKSSEKRVWTYQDLIDMVCKYN 129 KG FLR+CE YSSL + EK+ WTYQDLID V K N Sbjct: 470 KGHFLRNCEAAYSSLEPEVREKKTWTYQDLIDSVFKDN 507 >ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda] gi|548859508|gb|ERN17188.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda] Length = 506 Score = 465 bits (1196), Expect = e-128 Identities = 236/454 (51%), Positives = 310/454 (68%) Frame = -3 Query: 1484 QETPQNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVM 1305 ++ PQ+ + + L V F + PL++LR+EG+W++ + W VM Sbjct: 57 EQNPQD-----SKHRALLVQNFFQTQQLLDLIEKIKGGIDPLKLLRDEGDWNKDQFWAVM 111 Query: 1304 RFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNI 1125 + LKETS+ +E +QVFD W N+ +SR+++ NY K+ EA ++L+E+K+ + Sbjct: 112 KLLKETSRIKEAMQVFDYWVNVERSRLDDSNYTKMIELLVDAGLMDEATTMLKEVKDFGV 171 Query: 1124 NLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGK 945 ++ +YN I+HG+A G F+ A +FL+EM D L P ETY+GLIRAYGN+ MYD+M K Sbjct: 172 RPTVAVYNFIVHGYANTGNFDKANLFLREMRDLGLVPESETYDGLIRAYGNHRMYDDMAK 231 Query: 944 CVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYV 765 C KKMESEG PD +TYN+LI EFARGGL RME RTLLSKKM LQ TL+AMLEAY Sbjct: 232 CAKKMESEGFTPDHLTYNILIREFARGGLMVRMEGAYRTLLSKKMGLQYSTLVAMLEAYA 291 Query: 764 DLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLV 585 LG + +ME V++R+L S+ PLK ++RK+A YI+NH FSRLEDLG+ VAS+ G DL Sbjct: 292 ALGCVNEMETVFRRLLKSKIPLKEDLVRKVARAYIKNHRFSRLEDLGLGVASKTGRTDLF 351 Query: 584 WCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILLSQIE 405 WCL LLSHAC+ SR+GI+S+IQ M A N+T NI AL YLKMKD + LD+LLSQ++ Sbjct: 352 WCLLLLSHACLCSRKGIKSVIQEMKSAMVRPNVTFANITALTYLKMKDVQYLDVLLSQLQ 411 Query: 404 TRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLR 225 ++ PDIVTVGV+ DA V GFD WR+ G L + EMNTDPLVLTAF KG+FLR Sbjct: 412 LLNVNPDIVTVGVVMDAYVSGFDDIKALRMWRKTGFLRRPVEMNTDPLVLTAFGKGYFLR 471 Query: 224 SCEEMYSSLGQKSSEKRVWTYQDLIDMVCKYNGR 123 SCEE+Y SLG K E++VWTY DLID+V N R Sbjct: 472 SCEELYLSLGAKGRERKVWTYNDLIDLVFNQNER 505 >ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa] gi|550324215|gb|EEE99423.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa] Length = 508 Score = 459 bits (1182), Expect = e-126 Identities = 246/521 (47%), Positives = 330/521 (63%), Gaps = 20/521 (3%) Frame = -3 Query: 1643 VSLDFNKTS---FNWRFQNNNLYKTLITRXXXXXXXXXXXXXXPILHRKKTQNHYYQETP 1473 ++L F S + W+F N YKTL+T I H Sbjct: 1 MALKFQNNSIPPWTWKFNN---YKTLVTTHSLSLAFRHPTAHATICHG------------ 45 Query: 1472 QNFSTKLTNYSLLFVDTFHEXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVMRFLK 1293 Q+ STK T L VD+FHE +PLQ+L+ +G+WS+ + W V++FLK Sbjct: 46 QDHSTKHTT---LLVDSFHEHKRLKSLLHNLNSNQNPLQLLQQDGDWSKDDFWSVIKFLK 102 Query: 1292 ETSQPQEILQV-----------------FDLWKNLNKSRINEINYEKIXXXXXXXXXXXE 1164 +++ +ILQV F +W+++ K+RINE NYEKI + Sbjct: 103 LSARSNQILQVHSLAHLFFLAARKIEFVFHMWRDVEKTRINEFNYEKIIGLLGEEGLMED 162 Query: 1163 AVSVLREMKNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIR 984 AV+ EMK+ + LSL++YNSIIHG+A G+F+DAL +L +M + NL P +TY+GLI Sbjct: 163 AVTAFMEMKSFGLCLSLEVYNSIIHGYARNGKFDDALFYLNQMNEMNLSPESDTYDGLIE 222 Query: 983 AYGNYGMYDEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNL 804 AYG Y MYDEM C+KKME +GC PD TYNLLI +FA+GGL RMERV +++ +K+M L Sbjct: 223 AYGTYRMYDEMAMCLKKMELDGCSPDRYTYNLLIQKFAQGGLLTRMERVYQSMRTKRMKL 282 Query: 803 QTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLG 624 Q+ TLI+MLEAY + G++EKMEK+ + NS+ +K ++RK+AG YI N+MFSRL DL Sbjct: 283 QSSTLISMLEAYANFGIVEKMEKILRWAWNSKITVKEDLVRKLAGVYIANYMFSRLHDLA 342 Query: 623 VDVASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMK 444 VD+ S G D+VWCL LLSHAC+ SRRG++++++ M AK WNITV NI+ L YLKMK Sbjct: 343 VDLTSITGRTDIVWCLHLLSHACLLSRRGMDAVVREMEDAKACWNITVANIILLAYLKMK 402 Query: 443 DFRQLDILLSQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDP 264 DF +L ILLS++ ++PDIVT G+LFDA GFDG E WR+MGLL + EMNTDP Sbjct: 403 DFTRLRILLSKLPEIRVEPDIVTFGILFDAEEIGFDGKECLEMWRKMGLLYRRVEMNTDP 462 Query: 263 LVLTAFRKGFFLRSCEEMYSSLGQKSSEKRVWTYQDLIDMV 141 L L+AF KG FLRSCEE YSSL + EK+ WTY D I++V Sbjct: 463 LALSAFGKGSFLRSCEEGYSSLEPNAREKKRWTYVDFINLV 503 >gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis] Length = 664 Score = 455 bits (1171), Expect = e-125 Identities = 229/443 (51%), Positives = 310/443 (69%), Gaps = 1/443 (0%) Frame = -3 Query: 1451 TNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRFLKETSQPQ 1275 T ++ L V+TFHE S P+++LR +G+W ++ W V+RFL+ S+ + Sbjct: 61 TEHTTLLVETFHEHRKFKTLLKRLSKNDSCPMRLLREDGDWCKEHFWAVVRFLRHGSRTK 120 Query: 1274 EILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSI 1095 EI+QVFDLWKN+ KSRINE+NY KI EAV EMK+ ++ +L++YNS+ Sbjct: 121 EIVQVFDLWKNIEKSRINELNYCKIIKMLGEEGLMEEAVLSFEEMKSCGLSPTLEVYNSM 180 Query: 1094 IHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMESEGC 915 IHGF+ KG+F+DAL++L EM + N+ P +TY GLI AY Y MYDE+G C+KKM+ GC Sbjct: 181 IHGFSQKGDFDDALVYLNEMREQNVVPETDTYEGLIEAYAKYEMYDEIGLCLKKMKLNGC 240 Query: 914 FPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEK 735 PD +TYNLL+ +F++GGL KRME V T++SK+M LQ+ TL+AMLE Y G+L+KMEK Sbjct: 241 PPDHITYNLLMRKFSKGGLLKRMESVYHTMISKRMYLQSSTLVAMLETYARFGILDKMEK 300 Query: 734 VYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHAC 555 Y R L ++TPL +IRK+A YI+N++FSRLE LGVD+++ G DL+WCLRLLSHA Sbjct: 301 FYMRTLKTKTPLGEDLIRKLAEVYIDNYLFSRLETLGVDLSTTFGETDLLWCLRLLSHAF 360 Query: 554 ISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILLSQIETRSLKPDIVT 375 + SR+G++ +IQ M A WN+T NI+ L +LKMKDF L I LSQ+ T S++PDIVT Sbjct: 361 LFSRKGMDFVIQEMERAHIPWNVTFANIILLTHLKMKDFTHLRISLSQL-THSVEPDIVT 419 Query: 374 VGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLG 195 VG+LFDA GFDG T E W+RM KA EMNTDP+V+TAF KG FL++CE YSSL Sbjct: 420 VGILFDAIGMGFDGTRTLETWKRMDFFYKAVEMNTDPVVITAFGKGNFLQNCERAYSSLE 479 Query: 194 QKSSEKRVWTYQDLIDMVCKYNG 126 + E + WTY +L+D+V K+ G Sbjct: 480 SEVRETKSWTYNNLVDLVFKHKG 502 >ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Glycine max] Length = 509 Score = 451 bits (1160), Expect = e-124 Identities = 222/443 (50%), Positives = 311/443 (70%), Gaps = 1/443 (0%) Frame = -3 Query: 1451 TNYSLLFVDTFH-EXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVMRFLKETSQPQ 1275 T ++ L V+T+H +PL +L +G+WS+ W V+RFLK S+ Sbjct: 59 TKHTTLLVETYHLHDSLRALLAKLQKEDCNPLHVLAEDGDWSKDHFWAVVRFLKSASRFT 118 Query: 1274 EILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSI 1095 +ILQVFD+WKN+ KSRI+E NY KI +A+S LR+MK I SL YN I Sbjct: 119 QILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMEDALSALRDMKVQGIKPSLDTYNPI 178 Query: 1094 IHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMESEGC 915 IHG + +G+F DAL F+ EM + L+ ETY+GL+ AYG + MYDEMG+CVKKME EGC Sbjct: 179 IHGLSREGKFSDALRFIDEMKESGLELDSETYDGLLGAYGKFQMYDEMGECVKKMELEGC 238 Query: 914 FPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEK 735 PD +TYN+LI E+AR GL +RME++ + ++SK+M++Q+ TL+AMLEAY G++EKME Sbjct: 239 SPDHITYNILIQEYARAGLLQRMEKLYQRMVSKRMHVQSSTLVAMLEAYTTFGMVEKMEN 298 Query: 734 VYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHAC 555 Y+++L+S+T L+ +IRK+A YI+N+MFSRLEDL +D+ G ++LVWCLRLLS+AC Sbjct: 299 FYRKILSSKTCLEDDLIRKVAEVYIKNYMFSRLEDLALDLCPAFGESNLVWCLRLLSYAC 358 Query: 554 ISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILLSQIETRSLKPDIVT 375 S++G++ +++ M AK +WN+TV NI+ L Y+KMKDFR L ILLSQ+ ++PDI+T Sbjct: 359 PLSKKGMDIVVREMRDAKVNWNVTVANIIMLAYVKMKDFRHLKILLSQLPIYRVQPDIIT 418 Query: 374 VGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLG 195 +G+LFDA+ GFDG+ E WRRMG L + E+ TD LVLTAF KG FL+SCEE+YSSL Sbjct: 419 IGILFDATRIGFDGSGALETWRRMGYLYRVVEIKTDSLVLTAFGKGHFLKSCEEVYSSLH 478 Query: 194 QKSSEKRVWTYQDLIDMVCKYNG 126 + +++ WTY DLI ++ K+ G Sbjct: 479 PEDRKRKTWTYHDLIALLSKHTG 501 >ref|XP_002516403.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544501|gb|EEF46020.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 502 Score = 448 bits (1152), Expect = e-123 Identities = 226/441 (51%), Positives = 304/441 (68%), Gaps = 1/441 (0%) Frame = -3 Query: 1445 YSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRFLKETSQPQEI 1269 ++ L V+++HE S PLQ+L+++ +WS+ W V+RFL+ +S+ EI Sbjct: 57 HNTLLVESYHEHQRLKALLARLNKKGSCPLQMLQDDADWSKDHFWAVIRFLRHSSRSDEI 116 Query: 1268 LQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSIIH 1089 LQVFD+WK++ KSRINE NYEK+ +A S EMK ++ SL++YNS+IH Sbjct: 117 LQVFDMWKDIEKSRINEFNYEKVIEILGEEGLIEDAYSAFIEMKTLCLSPSLQVYNSLIH 176 Query: 1088 GFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMESEGCFP 909 G+A G+F+DA+ +L + + NL P+ +TYNGLI+AYG Y MYDEMG C+KKME EGC P Sbjct: 177 GYARNGKFDDAVFYLNHLKEINLSPVSDTYNGLIQAYGKYKMYDEMGMCLKKMEMEGCSP 236 Query: 908 DSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVY 729 D VTYNLLI E A GL RME+V +T +M+L++ TL AMLEAY + G++EKME + Sbjct: 237 DHVTYNLLIQELAEAGLLTRMEKVYQTTRMNRMDLKSTTLTAMLEAYANFGIVEKMELIL 296 Query: 728 QRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHACIS 549 +R NS+ LK +I+K+A YIEN MFSRLE LG ++ R+G ND+VWCL LLS+AC+ Sbjct: 297 KRTRNSKALLKEDLIKKIALVYIENFMFSRLEKLGHYLSKRSGQNDMVWCLLLLSNACML 356 Query: 548 SRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILLSQIETRSLKPDIVTVG 369 S++G++S+++ M VAK SWN+T NI+ L YLKMKD +L ILLS + +KPDIVTVG Sbjct: 357 SQKGMDSVVREMKVAKVSWNVTFINIILLAYLKMKDSMRLGILLSTLTNHIVKPDIVTVG 416 Query: 368 VLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLGQK 189 VLFDA+ GF GN E WRR G+L + E TDPLVL AF KG FL+ CEE YSSL Sbjct: 417 VLFDANNIGFHGNGILETWRRTGILYRCVETETDPLVLAAFGKGQFLKKCEEAYSSLEPV 476 Query: 188 SSEKRVWTYQDLIDMVCKYNG 126 + +K WTY +LID+V Y+G Sbjct: 477 ARQKEKWTYCNLIDLVATYDG 497 >ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like isoform X1 [Glycine max] Length = 506 Score = 443 bits (1140), Expect = e-122 Identities = 221/443 (49%), Positives = 306/443 (69%), Gaps = 1/443 (0%) Frame = -3 Query: 1451 TNYSLLFVDTFH-EXXXXXXXXXXXXXXXSPLQILRNEGNWSEKELWLVMRFLKETSQPQ 1275 T ++ L V+T+H +PL +L + +WS+ W V+RFLK +S Sbjct: 57 TKHTTLLVETYHLHHSLRALLAKLENEYSNPLHMLAEDADWSKDHFWAVVRFLKSSSNFT 116 Query: 1274 EILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSI 1095 ILQVFD+WKN+ KSRI+E NY KI +A+S L++MK I SL YN I Sbjct: 117 HILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMKDALSALQDMKVQGIKPSLDTYNPI 176 Query: 1094 IHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMESEGC 915 IHG + +G+F DAL F+ EM + L+ ETY+GLI AYG + MYDEMG+CVKKME EGC Sbjct: 177 IHGLSREGKFSDALRFIDEMKESGLELDSETYDGLIGAYGKFQMYDEMGECVKKMELEGC 236 Query: 914 FPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEK 735 PD +TYN+LI E+A GGL +RME++ + +LSK+M++++ TL+AMLEAY G++EKMEK Sbjct: 237 SPDPITYNILIQEYAGGGLLQRMEKLYQRMLSKRMHVKSSTLVAMLEAYTTFGMVEKMEK 296 Query: 734 VYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHAC 555 Y+++LNS+T ++ +IRK+A YI N MFSRLEDL +D+ G ++L WC RLLS+AC Sbjct: 297 FYRKILNSKTCIEDDLIRKVAEVYINNFMFSRLEDLALDLCPAFGESNLEWCFRLLSYAC 356 Query: 554 ISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILLSQIETRSLKPDIVT 375 + S++G++ ++Q M AK SWN+TV NI+ L Y+KMK+FR L ILLSQ+ ++PDI+T Sbjct: 357 LLSKKGMDIVVQEMQDAKVSWNVTVANIIMLAYVKMKEFRHLRILLSQLPIYRVQPDIIT 416 Query: 374 VGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLG 195 +G+LFDA+ GFDG+ E WRRMG L + EM TD LVLTAF KG FL+SCEE+YSSL Sbjct: 417 IGILFDATRIGFDGSGALETWRRMGYLYRVVEMKTDSLVLTAFGKGHFLKSCEEVYSSLH 476 Query: 194 QKSSEKRVWTYQDLIDMVCKYNG 126 + +++ TY DLI ++ K+ G Sbjct: 477 PEDRKRKTCTYHDLIPLLSKHTG 499 >ref|XP_007163800.1| hypothetical protein PHAVU_001G265200g [Phaseolus vulgaris] gi|561037264|gb|ESW35794.1| hypothetical protein PHAVU_001G265200g [Phaseolus vulgaris] Length = 496 Score = 440 bits (1131), Expect = e-120 Identities = 216/408 (52%), Positives = 289/408 (70%) Frame = -3 Query: 1364 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1185 P+ IL +G+WS+ W +RFLK S+ EILQVFD+WK + KSRI+E NY KI Sbjct: 88 PMYILAQDGDWSKDHFWAAVRFLKNASRFVEILQVFDMWKEIEKSRISEFNYNKIIGLLC 147 Query: 1184 XXXXXXEAVSVLREMKNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPE 1005 EA+S +EMK + SL YN IIHG + G+F DAL FL EM + L P E Sbjct: 148 EDEMMEEALSAFQEMKVQGMKPSLDTYNPIIHGLSKAGKFSDALRFLDEMKESGLDPDSE 207 Query: 1004 TYNGLIRAYGNYGMYDEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTL 825 TY+GLI AYG + +YDEMG+CVKKME EGC PD +TYN+LI E+AR G+ +RME++ + + Sbjct: 208 TYDGLIGAYGKFQLYDEMGECVKKMELEGCSPDHITYNILIQEYARAGILQRMEKLYQRM 267 Query: 824 LSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMF 645 LSK+M LQ+ T +AML+AY G++EKME +++VLNS++ L+ IRKMA YI+N+MF Sbjct: 268 LSKRMRLQSSTFVAMLKAYTTFGIVEKMEFFFRKVLNSKSCLEDDFIRKMAEVYIKNYMF 327 Query: 644 SRLEDLGVDVASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMA 465 SRLEDL +D+ S G +DLVWCLRLLS+AC+ S++G++ +++ M AK +WN+ NI+ Sbjct: 328 SRLEDLALDLCSAFGESDLVWCLRLLSYACLLSKKGMDIVVKEMQDAKINWNVAFANIIM 387 Query: 464 LMYLKMKDFRQLDILLSQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKA 285 L Y+KMKDFR L ILLSQ+ L PDIVT+G++ DAS GFDG E WRRMG L++ Sbjct: 388 LAYVKMKDFRHLRILLSQLRINRLGPDIVTIGIVLDASRIGFDGRGALESWRRMGYLDRV 447 Query: 284 AEMNTDPLVLTAFRKGFFLRSCEEMYSSLGQKSSEKRVWTYQDLIDMV 141 E+ TD LVLTAF KG FL+SCEE+Y+SL + E++ WTY DLI ++ Sbjct: 448 VELKTDSLVLTAFGKGHFLKSCEEVYTSLHPEDRERKKWTYNDLIALL 495 >ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Citrus sinensis] Length = 477 Score = 425 bits (1092), Expect = e-116 Identities = 216/444 (48%), Positives = 300/444 (67%), Gaps = 1/444 (0%) Frame = -3 Query: 1451 TNYSLLFVDTFHEXXXXXXXXXXXXXXXS-PLQILRNEGNWSEKELWLVMRFLKETSQPQ 1275 T ++ L V+++HE S PLQIL+++G+W++ W V+RFLK +S+ + Sbjct: 61 TKHTTLLVESYHEHQALNALIQRLNKKVSCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120 Query: 1274 EILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMKNNNINLSLKLYNSI 1095 +I QVFD+WKN+ KSRINE NY+KI EAV +EM+ + SL++YNSI Sbjct: 121 QIPQVFDMWKNIEKSRINEFNYQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180 Query: 1094 IHGFAYKGEFEDALIFLKEMVDFNLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMESEGC 915 IHG++ G+F +AL+FL EM + NL P +TY+GLI+AY Sbjct: 181 IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAY--------------------- 219 Query: 914 FPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEK 735 EFA GL KRME +++L+K+M+L++ T++A+L+AY++ G+L+KMEK Sbjct: 220 ------------EFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 267 Query: 734 VYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDLGVDVASRNGVNDLVWCLRLLSHAC 555 Y+R+LNS+TPLK ++RK+A YI+N+MFSRL+DLG D+ASR G +LVWCLRLLSHAC Sbjct: 268 FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDLASRIGRTELVWCLRLLSHAC 327 Query: 554 ISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLDILLSQIETRSLKPDIVT 375 + S RGI+S+++ M AK WN+T NI+ L YLKMKDF+ L +LLS++ TR +KPDIVT Sbjct: 328 LLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIVT 387 Query: 374 VGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLG 195 +G+L+DA GFDG E WRR+G L K E+NTDPLVL + KG FLR CEE+YSSL Sbjct: 388 IGILYDARRIGFDGTGALEMWRRIGFLSKTVEINTDPLVLAVYGKGHFLRYCEEVYSSLE 447 Query: 194 QKSSEKRVWTYQDLIDMVCKYNGR 123 S EK+ WTYQ+LID+V K+NG+ Sbjct: 448 PYSREKKRWTYQNLIDLVIKHNGK 471 >ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297314141|gb|EFH44564.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 502 Score = 407 bits (1046), Expect = e-111 Identities = 203/413 (49%), Positives = 288/413 (69%), Gaps = 3/413 (0%) Frame = -3 Query: 1364 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1185 PL++L+ G+WS+ W V+RFL+ +S+ EIL VFD WKNL +SRI+E NYE++ Sbjct: 84 PLRLLQEYGDWSKDHFWAVIRFLRHSSRLHEILPVFDAWKNLERSRISEANYERVIRLLC 143 Query: 1184 XXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIP 1008 EA+ R M ++ ++ SL++YNSIIHG+A +G+FE+A+ +L M + L PI Sbjct: 144 EEKSMNEAIRAFRGMIDDHELSPSLEIYNSIIHGYADEGKFEEAMFYLNHMKENGLLPIT 203 Query: 1007 ETYNGLIRAYGNYGMYDEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRT 828 ETY+GLI AYG + MYDE+ C+K+MESEGC D VTYNLLI EF+RGGL KRME++ ++ Sbjct: 204 ETYDGLIEAYGKWKMYDEIVLCLKRMESEGCVRDHVTYNLLIREFSRGGLLKRMEQMYQS 263 Query: 827 LLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHM 648 L+S+KM L+ TL++MLEAY + GL+EKME+ +++ L G++RK+A YI+N M Sbjct: 264 LMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLANVYIDNLM 323 Query: 647 FSRLEDLGVDV-ASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNI 471 FSRL+DLG + +SR DL WCLRLL HA + SR+G++ +I+ M A+ WN T NI Sbjct: 324 FSRLDDLGRGISSSRTRRTDLAWCLRLLCHARLVSRKGLDYVIKEMKEARVPWNTTFANI 383 Query: 470 MALMYLKMKDFRQLDILLSQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLE 291 L Y KM DF+ +++LLS++ T+ +K D+VTVG++FD S GFD F W+++G L+ Sbjct: 384 TLLAYSKMGDFKSIELLLSELRTKHVKLDLVTVGIIFDLSEAGFDVTGVFMTWKKIGFLD 443 Query: 290 KAAEMNTDPLVLTAFRKGFFLRSCEEMYS-SLGQKSSEKRVWTYQDLIDMVCK 135 K EM TDPLV AF KG FL+SCEE+ + SLG + E + WTYQ L+++V K Sbjct: 444 KPVEMKTDPLVHAAFGKGKFLKSCEEVKNQSLGMRGEESKAWTYQYLMEVVVK 496 >ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum] gi|557115950|gb|ESQ56233.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum] Length = 495 Score = 407 bits (1045), Expect = e-110 Identities = 203/417 (48%), Positives = 285/417 (68%), Gaps = 2/417 (0%) Frame = -3 Query: 1364 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1185 PL++LR +G+WS+ + W V+RFL+ +S+ EIL VFD WKNL SRINE NYEKI Sbjct: 82 PLRLLREDGDWSKHQFWAVVRFLRHSSRLHEILPVFDAWKNLEPSRINEANYEKILRFLC 141 Query: 1184 XXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIP 1008 EA+ + M + ++ SL++YNSIIHG+A G+FE+A+ ++ M + ++ P Sbjct: 142 EEKSMNEAIRAFQCMIDEHELSPSLEIYNSIIHGYANDGKFEEAMFYMNHMKENDMLPET 201 Query: 1007 ETYNGLIRAYGNYGMYDEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRT 828 ETY+GLI AYG + +YDE+ C+KKMES+GC D VTYNLLI EFARGGL KRME++ ++ Sbjct: 202 ETYDGLIEAYGKWKLYDEIVLCIKKMESDGCVRDHVTYNLLIREFARGGLLKRMEQMYQS 261 Query: 827 LLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHM 648 L+S+KM L+ TL++MLEAY + G+LEKME Y +++ L ++RK+A YI+N M Sbjct: 262 LMSRKMTLEPCTLLSMLEAYAEFGVLEKMEDTYNKIVRFGISLDEDLVRKVANVYIDNLM 321 Query: 647 FSRLEDLGVDVASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIM 468 FSRL+DLG + DL WCLRLL HAC+ SR+G++ +++ M A+ WN T NI+ Sbjct: 322 FSRLDDLGRGIRR----TDLAWCLRLLCHACLVSRKGLDYVVKEMEEARVPWNATFANIV 377 Query: 467 ALMYLKMKDFRQLDILLSQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEK 288 L Y KM DFR +++LLS++ T+ +K D+VTVG++ D SV GFDG F W+++G L+K Sbjct: 378 LLAYSKMGDFRSVELLLSELRTKHVKLDLVTVGIVLDLSVDGFDGTGVFMTWKKIGFLDK 437 Query: 287 AAEMNTDPLVLTAFRKGFFLRSCEEMYSS-LGQKSSEKRVWTYQDLIDMVCKYNGRK 120 E TDPLV AF KG FLRSCEE+ + LG + E + WTYQ L+++V K K Sbjct: 438 PVETKTDPLVHAAFGKGRFLRSCEEVKNQVLGTRVEESKSWTYQYLMELVVKNQKNK 494 >ref|NP_193155.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635638|sp|O23278.2|PP310_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g14190, chloroplastic; Flags: Precursor gi|332657991|gb|AEE83391.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 501 Score = 402 bits (1032), Expect = e-109 Identities = 202/413 (48%), Positives = 285/413 (69%), Gaps = 3/413 (0%) Frame = -3 Query: 1364 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1185 PL++L+ +G+WS+ W V+RFL+++S+ EIL VFD WKNL SRI+E NYE+I Sbjct: 83 PLRLLQEDGDWSKDHFWAVIRFLRQSSRLHEILPVFDTWKNLEPSRISENNYERIIRFLC 142 Query: 1184 XXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIP 1008 EA+ R M ++ ++ SL++YNSIIH +A G+FE+A+ +L M + L PI Sbjct: 143 EEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHMKENGLLPIT 202 Query: 1007 ETYNGLIRAYGNYGMYDEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRT 828 ETY+GLI AYG + MYDE+ C+K+MES+GC D VTYNLLI EF+RGGL KRME++ ++ Sbjct: 203 ETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLLKRMEQMYQS 262 Query: 827 LLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHM 648 L+S+KM L+ TL++MLEAY + GL+EKME+ +++ L G++RK+A YIEN M Sbjct: 263 LMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLANVYIENLM 322 Query: 647 FSRLEDLGVDV-ASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNI 471 FSRL+DLG + ASR +L WCLRLL HA + SR+G++ +++ M A+ WN T NI Sbjct: 323 FSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEARVPWNTTFANI 382 Query: 470 MALMYLKMKDFRQLDILLSQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLE 291 L Y KM DF +++LLS++ + +K D+VTVG++FD S FDG F W+++G L+ Sbjct: 383 ALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVFMTWKKIGFLD 442 Query: 290 KAAEMNTDPLVLTAFRKGFFLRSCEEMYS-SLGQKSSEKRVWTYQDLIDMVCK 135 K EM TDPLV AF KG FLRSCEE+ + SLG + E + WTYQ L+++V K Sbjct: 443 KPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMELVVK 495 >ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Capsella rubella] gi|482552277|gb|EOA16470.1| hypothetical protein CARUB_v10004636mg [Capsella rubella] Length = 501 Score = 396 bits (1018), Expect = e-107 Identities = 194/410 (47%), Positives = 284/410 (69%), Gaps = 2/410 (0%) Frame = -3 Query: 1364 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1185 PLQ+L+ +G+WS+ W V+RFL+ +S+ EIL V+D WKNL SRI+ +NYE++ Sbjct: 90 PLQLLQEDGDWSKDHFWAVIRFLRHSSRLHEILPVYDAWKNLEPSRISVVNYERVIRFLC 149 Query: 1184 XXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIP 1008 EA+ R M ++ ++ SL++YNSIIHG+A G+FE+A+ +L +M + L PI Sbjct: 150 EERSMNEAIRAFRSMIDDDELSPSLEIYNSIIHGYADDGKFEEAMFYLNQMKENGLSPIS 209 Query: 1007 ETYNGLIRAYGNYGMYDEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRT 828 ETY+GLI AYG + MYDE+ CV++MES+GC D VTYNLLI +F+RGGL KRME++ ++ Sbjct: 210 ETYDGLIEAYGKWKMYDEIVLCVRRMESDGCVRDHVTYNLLIRQFSRGGLLKRMEQMYQS 269 Query: 827 LLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHM 648 L+S+KM L+ TL++MLEAY + G++EKME+ +++ L G++RK+A YI+N M Sbjct: 270 LMSRKMTLEPCTLLSMLEAYAEFGVIEKMEETCNKIIRFGISLDDGLVRKLAKVYIDNLM 329 Query: 647 FSRLEDLGVDVA-SRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNI 471 FSRL+DLG ++ SR +DL WCLRLL H+ + SR+G++ +++ M AK +WN T NI Sbjct: 330 FSRLDDLGRGISYSRTRRSDLAWCLRLLCHSRLVSRKGLDYVLKEMTEAKVTWNTTFANI 389 Query: 470 MALMYLKMKDFRQLDILLSQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLE 291 + L Y KM DF+ +++LL + T+ +K D+VTVG++FD S GFDG F W+++G L+ Sbjct: 390 VLLAYSKMGDFKSIELLLDGLRTKRVKLDLVTVGIVFDLSEAGFDGTGVFMTWKKIGFLD 449 Query: 290 KAAEMNTDPLVLTAFRKGFFLRSCEEMYSSLGQKSSEKRVWTYQDLIDMV 141 K EM TDPLV AF KG FLR CEEM + + WTYQ+L+++V Sbjct: 450 KPVEMKTDPLVHAAFGKGQFLRRCEEM------RGEDPTPWTYQNLMELV 493 >emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera] Length = 1697 Score = 393 bits (1009), Expect = e-106 Identities = 194/338 (57%), Positives = 251/338 (74%) Frame = -3 Query: 1364 PLQILRNEGNWSEKELWLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXX 1185 PLQ+LR++G+W+++ W V+RFLK+ S+ EIL VF LWK+++KSRINE NY KI Sbjct: 1356 PLQLLRDDGDWNKQHFWAVIRFLKDASRSSEILPVFHLWKDMDKSRINEFNYAKIIGLLS 1415 Query: 1184 XXXXXXEAVSVLREMKNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDFNLKPIPE 1005 E+V L MK + + SL++YN +IH FA KGEF+ AL FL E+ NL E Sbjct: 1416 QEDLAEESVLALEXMKTHGLKPSLEIYNLVIHCFARKGEFDRALYFLNELKXNNLIADTE 1475 Query: 1004 TYNGLIRAYGNYGMYDEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTL 825 TY+GLI++YG Y MYDE+ +CVKKMES+GC PD +TYNLLI EF+RGGL KRMERV +T+ Sbjct: 1476 TYDGLIQSYGKYKMYDELDECVKKMESDGCLPDHITYNLLIQEFSRGGLLKRMERVFQTV 1535 Query: 824 LSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMF 645 LSKKM LQ+ TL+ MLEAY + G++EKME Y+RVLNS+T LK +IRK+A YIEN+ F Sbjct: 1536 LSKKMGLQSSTLVVMLEAYANFGIIEKMENAYRRVLNSKTSLKDDLIRKLAEVYIENYKF 1595 Query: 644 SRLEDLGVDVASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMA 465 SRL D+G+D+AS DLVWCLRLLSHAC+ SR+G++SI++ M WN TV N + Sbjct: 1596 SRLADMGLDLASVTSRTDLVWCLRLLSHACLLSRKGLDSIVKEMEAKNVPWNATVANTIL 1655 Query: 464 LMYLKMKDFRQLDILLSQIETRSLKPDIVTVGVLFDAS 351 L YLKMKDF +L ILL ++ TR +KPDIVTVG+LFDA+ Sbjct: 1656 LAYLKMKDFTRLRILLLELSTRHVKPDIVTVGILFDAN 1693 >emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thaliana] gi|7268124|emb|CAB78461.1| salt-inducible protein homolog [Arabidopsis thaliana] Length = 561 Score = 387 bits (995), Expect = e-105 Identities = 200/419 (47%), Positives = 280/419 (66%), Gaps = 16/419 (3%) Frame = -3 Query: 1343 EGNWSEKELWLVMRFLKETSQPQEIL-------------QVFDLWKNLNKSRINEINYEK 1203 +G+WS+ W V+RFL+++S+ EIL QVFD WKNL SRI+E NYE+ Sbjct: 137 DGDWSKDHFWAVIRFLRQSSRLHEILPNMKMTFCFFFQLQVFDTWKNLEPSRISENNYER 196 Query: 1202 IXXXXXXXXXXXEAVSVLREM-KNNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMVDF 1026 I EA+ R M ++ ++ SL++YNSIIH +A G+FE+A+ +L M + Sbjct: 197 IIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHMKEN 256 Query: 1025 NLKPIPETYNGLIRAYGNYGMYDEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRM 846 L PI ETY+GLI AYG + MYDE+ C+K+MES+GC D VTYNLLI EF+RGGL KRM Sbjct: 257 GLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLLKRM 316 Query: 845 ERVDRTLLSKKMNLQTFTLIAMLEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGF 666 E++ ++L+S+KM L+ TL++MLEAY + GL+EKME+ +++ L G++RK+A Sbjct: 317 EQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLANV 376 Query: 665 YIENHMFSRLEDLGVDV-ASRNGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWN 489 YIEN MFSRL+DLG + ASR +L WCLRLL HA + SR+G++ +++ M A+ WN Sbjct: 377 YIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEARVPWN 436 Query: 488 ITVTNIMALMYLKMKDFRQLDILLSQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWR 309 T NI L Y KM DF +++LLS++ + +K D+VTVG++FD S FDG F W+ Sbjct: 437 TTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVFMTWK 496 Query: 308 RMGLLEKAAEMNTDPLVLTAFRKGFFLRSCEEMYS-SLGQKSSEKRVWTYQDLIDMVCK 135 ++G L+K EM TDPLV AF KG FLRSCEE+ + SLG + E + WTYQ L+++V K Sbjct: 497 KIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMELVVK 555 >ref|XP_001762610.1| predicted protein [Physcomitrella patens] gi|162686343|gb|EDQ72733.1| predicted protein [Physcomitrella patens] Length = 418 Score = 175 bits (444), Expect = 5e-41 Identities = 121/393 (30%), Positives = 198/393 (50%), Gaps = 4/393 (1%) Frame = -3 Query: 1316 WLVMRFLKETSQPQEILQVFDLWKNLNKSRINEINYEKIXXXXXXXXXXXEAVSVLREMK 1137 W V+ +L + EIL+VF W+ + + E+ Y KI EA ++ EM Sbjct: 10 WTVIDYLHGHRRMAEILEVFKWWQQQDGYKPYELYYTKIIRMLGQAHMPTEARTLFIEMC 69 Query: 1136 NNNINLSLKLYNSIIHGFAYKGEFEDALIFLKEMV-DFNLKPIPETYNGLIRAYGNYGMY 960 + S+ Y ++ G+A +GEFE+A L++M+ + KP TY GLI AYG +GMY Sbjct: 70 ELGLRPSVVTYTYLLQGYAERGEFEEAEQILRDMILSGDAKPNTTTYAGLIYAYGKHGMY 129 Query: 959 DEMGKCVKKMESEGCFPDSVTYNLLIVEFARGGLFKRMERVDRTLLSKKMNLQTFTLIAM 780 D M + +M+++ D +Y LI +ARGGLF RM++ + + M + T+ A+ Sbjct: 130 DRMWRTFNRMKTQHIPADEFSYRTLIKAYARGGLFSRMQQTMKEMSRNGMYADSATMNAV 189 Query: 779 LEAYVDLGLLEKMEKVYQRVLNSQTPLKYGMIRKMAGFYIENHMFSRLEDL--GVDVASR 606 + AY + GL+++MEK Y+ + + I+ + Y+++ +F +L V + R Sbjct: 190 VLAYAEAGLVKEMEKQYEVMWKNSFTAGQETIKAIVRAYVKDSLFFQLSGYVKRVGLRKR 249 Query: 605 NGVNDLVWCLRLLSHACISSRRGIESIIQAMVVAKFSWNITVTNIMALMYLKMKDFRQLD 426 VN L W LLSHA + + Q M FS ++T NIMAL Y + K L Sbjct: 250 TMVNYL-WNALLLSHAANLAMDDLGVDFQNMKYLGFSPDVTTCNIMALAYSRAKQLEDLH 308 Query: 425 ILLSQIETRSLKPDIVTVGVLFDASVGGFDGNTTFERWRRMGLLEKAAEMNTDPLVLTAF 246 L+ ++ + PD+VT G + D E L+ AAE+ TDPLV Sbjct: 309 QLIVTMQDNGIAPDLVTYGAVIDVFTEEKLRPNLLEELVEFRNLDVAAEVETDPLVFEVL 368 Query: 245 RKGFFLRSCEEMYSSL-GQKSSEKRVWTYQDLI 150 KG F +CE++ ++ G++ +++ TY +L+ Sbjct: 369 GKGRFHVACEKLARNMEGERMNQR---TYGELV 398