BLASTX nr result
ID: Paeonia23_contig00009808
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia23_contig00009808 (2137 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containi... 599 e-168 ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citr... 572 e-160 ref|XP_007034168.1| Pentatricopeptide repeat-containing protein,... 572 e-160 ref|XP_007222864.1| hypothetical protein PRUPE_ppa004279mg [Prun... 563 e-157 ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containi... 542 e-151 gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis] 540 e-151 ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containi... 538 e-150 ref|XP_007163800.1| hypothetical protein PHAVU_001G265200g [Phas... 531 e-148 ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containi... 528 e-147 ref|XP_002516403.1| pentatricopeptide repeat-containing protein,... 528 e-147 ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Popu... 525 e-146 ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containi... 512 e-142 ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [A... 495 e-137 ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutr... 478 e-132 emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera] 477 e-132 ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. l... 476 e-131 ref|NP_193155.4| pentatricopeptide repeat-containing protein [Ar... 473 e-130 ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Caps... 464 e-128 emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thal... 444 e-122 gb|ACU28207.1| At4g14190-like protein [Arabidopsis thaliana] gi|... 196 3e-47 >ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Vitis vinifera] Length = 581 Score = 599 bits (1544), Expect = e-168 Identities = 295/450 (65%), Positives = 360/450 (80%), Gaps = 2/450 (0%) Frame = -1 Query: 1876 PSH--AIPHNGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFW 1703 P+H + HN TK T LL++T HE +RLG L++KL+ K S P+Q+L+D+GDW+K HFW Sbjct: 74 PTHHLQLSHNNNPTKHTTLLVETLHENERLGVLIQKLSNKASSPLQLLRDDGDWNKQHFW 133 Query: 1702 TVIRFLQHASRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKS 1523 VIRFL+ ASRS E VF LW +++KSRINEFNY+KII +V AL+ MK+ Sbjct: 134 AVIRFLKDASRSSEILPVFHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEGMKT 193 Query: 1522 HGLRLSLDIYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDE 1343 HGL+ SL+IYNL+IH F+ KGEF A+ FLNEL+ +L +TETYDGLIQ+YGK+KMYDE Sbjct: 194 HGLKPSLEIYNLVIHCFARKGEFDRALYFLNELKANNLIADTETYDGLIQSYGKYKMYDE 253 Query: 1342 IGNCVKKMESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLE 1163 + CVKKMES GC+PDHITYNLLI+EFSRGGL+KRME ++QT+LSK+M L ST+V MLE Sbjct: 254 LDECVKKMESDGCLPDHITYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVMLE 313 Query: 1162 AYAKFGLLGKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMT 983 AYA FG++ KME YR+VLNSK LKD L+RKLA VYIENY FSRL D+ L+L S T T Sbjct: 314 AYANFGIIEKMENAYRRVLNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLNLASVTSRT 373 Query: 982 DLVWCLRLLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLT 803 DLVWCLRLLSHAC+LSRKG+DS+V EM+ + + WN TVAN ILLAY+KMKDF L ILL Sbjct: 374 DLVWCLRLLSHACLLSRKGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILLL 433 Query: 802 ELPFHRVNADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGD 623 EL V D+VTVGI+FDAN+IGF+GT ALNTWR++G L +AVEMNTDPLVL+AFGKG+ Sbjct: 434 ELSTRHVKPDIVTVGILFDANRIGFNGTMALNTWRRTGFLDEAVEMNTDPLVLSAFGKGN 493 Query: 622 FLRSCEERYSSLEPKAREKKIWTYQNLIDL 533 FL+SCEE YSSLEP+AR+KKIWTYQNLIDL Sbjct: 494 FLQSCEEMYSSLEPEARKKKIWTYQNLIDL 523 >ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citrus clementina] gi|557522919|gb|ESR34286.1| hypothetical protein CICLE_v10004784mg [Citrus clementina] Length = 510 Score = 572 bits (1474), Expect = e-160 Identities = 277/442 (62%), Positives = 349/442 (78%) Frame = -1 Query: 1843 TKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHASRSK 1664 TK T LL++++HE Q L +L+++LNKK S P+QIL+ +GDW+K HFW VIRFL+++SRS+ Sbjct: 61 TKHTTLLVESYHEHQALNALIQRLNKKVSCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120 Query: 1663 EAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHGLRLSLDIYNLI 1484 + PQVFD+W NIEKSRINEFN KII AV A Q+M+ L+ SL+IYN I Sbjct: 121 QIPQVFDMWKNIEKSRINEFNSQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180 Query: 1483 IHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKMESSGC 1304 IHG+S G+F +A+LFLNE++E +L P ++TYDGLIQAYGK+KMYDEI C+K M+ GC Sbjct: 181 IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAYGKYKMYDEIDMCLKMMKLDGC 240 Query: 1303 VPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLLGKMER 1124 PDHITYNLLI+EF+ GL+KRME Y+++L+KRM L STMV +L+AY FG+L KME+ Sbjct: 241 SPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 300 Query: 1123 IYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDLVWCLRLLSHAC 944 Y+++LNS+ PLK+ LVRKLA VYI+NYMFSRL+DL DL S G T+LVWCLRLLSHAC Sbjct: 301 FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDLASRIGRTELVWCLRLLSHAC 360 Query: 943 ILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRVNADLVT 764 +LS +G+DSVV EM+ K+ WNVT ANIILLAY+KMKDFKHL +LL+ELP V D+VT Sbjct: 361 LLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIVT 420 Query: 763 VGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRSCEERYSSLE 584 +GI++DA +IGFDGTGAL W++ G L + VE+NTDPLVL +GKG FLR CEE YSSLE Sbjct: 421 IGILYDARRIGFDGTGALEMWKRIGFLFKTVEINTDPLVLAVYGKGHFLRYCEEVYSSLE 480 Query: 583 PKAREKKIWTYQNLIDLVFKPN 518 P +REKK WTYQNLIDLV K N Sbjct: 481 PYSREKKRWTYQNLIDLVIKHN 502 >ref|XP_007034168.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] gi|508713197|gb|EOY05094.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 504 Score = 572 bits (1474), Expect = e-160 Identities = 285/457 (62%), Positives = 350/457 (76%) Frame = -1 Query: 1876 PSHAIPHNGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTV 1697 PS P + TALL++T+H +RL +LLE+L K S P+Q+L+D+GDW+K FW V Sbjct: 49 PSPPRPDGSSCKNHTALLVETYHHHRRLKALLERLEKDDSCPLQMLRDDGDWTKDIFWVV 108 Query: 1696 IRFLQHASRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHG 1517 IRFL+ ASRS E QVF +W NIEKSRINE NY KII AV AL++M +G Sbjct: 109 IRFLRRASRSNEILQVFHMWKNIEKSRINELNYEKIIGLLGEEGRVGQAVQALREMGGYG 168 Query: 1516 LRLSLDIYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIG 1337 L+ SL++YN IIH ++ G+F DA+ FLNE++E L P T+TYDGLI+AYGK+KMYDEIG Sbjct: 169 LKPSLEVYNSIIHAYARNGKFDDALSFLNEMKEIGLAPETDTYDGLIEAYGKYKMYDEIG 228 Query: 1336 NCVKKMESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAY 1157 C+K ME C PDH TYNLLIREFSRGGL++RME +YQ LLSK+M L S++V MLEAY Sbjct: 229 TCLKMMELDRCRPDHFTYNLLIREFSRGGLLQRMEQVYQILLSKQMNLQSSSLVAMLEAY 288 Query: 1156 AKFGLLGKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDL 977 A FG+L KME++YRKV+NS M LK+ +R LA VYI+NYMFSRL+DL +DL S TG DL Sbjct: 289 ANFGILDKMEKVYRKVVNS-MTLKEDTIRILASVYIKNYMFSRLDDLGIDLSSRTGRNDL 347 Query: 976 VWCLRLLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTEL 797 VWCLRLLSHAC+LSRKGMDSV+ EM K SWNVT++NIILLAYMKMKDFK L ILL++L Sbjct: 348 VWCLRLLSHACLLSRKGMDSVILEMCEAKASWNVTISNIILLAYMKMKDFKRLRILLSQL 407 Query: 796 PFHRVNADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFL 617 P H+V D++T+GI+ DA +IGFDG AL TWRK G+L + VEMNTDPLVL AFGKG FL Sbjct: 408 PSHQVRPDIITIGILSDAIEIGFDGAEALETWRKMGLLYRTVEMNTDPLVLIAFGKGHFL 467 Query: 616 RSCEERYSSLEPKAREKKIWTYQNLIDLVFKPNVRQP 506 R CEE Y+SLEPKAR++K WTY +LIDLV K ++P Sbjct: 468 RDCEEIYTSLEPKARKEKRWTYHHLIDLVIKHKAKRP 504 >ref|XP_007222864.1| hypothetical protein PRUPE_ppa004279mg [Prunus persica] gi|462419800|gb|EMJ24063.1| hypothetical protein PRUPE_ppa004279mg [Prunus persica] Length = 518 Score = 563 bits (1450), Expect = e-157 Identities = 275/440 (62%), Positives = 343/440 (77%) Frame = -1 Query: 1843 TKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHASRSK 1664 TK T LL++TFHE QRL +LL+ L GS P+Q+L ++GDW+K FW IRFL+H R Sbjct: 70 TKHTTLLVETFHEHQRLKALLQNLIN-GSCPLQLLGEDGDWTKDQFWAAIRFLKHTFRFN 128 Query: 1663 EAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHGLRLSLDIYNLI 1484 E Q+FD+W NIEKSRINEFNYSKII AV Q+MKSH LR SL++YN + Sbjct: 129 EILQLFDMWKNIEKSRINEFNYSKIIGLLGEEGLIEEAVRCFQEMKSHNLRPSLEVYNSV 188 Query: 1483 IHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKMESSGC 1304 IH + +G F DA+ FLNE++E +L P T+TYDGLI+AYGK++MYD+IG CVKKM+ +GC Sbjct: 189 IHVCARQGNFEDALFFLNEMKEMNLAPETDTYDGLIEAYGKYRMYDQIGMCVKKMKLNGC 248 Query: 1303 VPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLLGKMER 1124 PDHITYNLLIREF+RGGL+KRMES+YQ++LS+RM L ST++ M+E YAKFG+L KME Sbjct: 249 SPDHITYNLLIREFARGGLLKRMESVYQSMLSRRMALQSSTLIAMVEVYAKFGILEKMEN 308 Query: 1123 IYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDLVWCLRLLSHAC 944 +YR+VLNS +K+ L+RKLA VYI+NYMFSRL L +DL S G TDLVWCLRLLS A Sbjct: 309 VYRRVLNSGTVVKNDLIRKLAEVYIDNYMFSRLEKLGVDLSSRFGQTDLVWCLRLLSQAG 368 Query: 943 ILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRVNADLVT 764 +LS++GMDS+V EMK + + WN TVANII+LAY+KMKDF HL I L++L V D++T Sbjct: 369 VLSQRGMDSIVDEMKEQNVPWNETVANIIMLAYLKMKDFTHLRIFLSQLLTQGVEPDIIT 428 Query: 763 VGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRSCEERYSSLE 584 VGIVFDAN+IG+DG+ L+TWR++G L +AVEMNTDPLVLT FGKG FLR+CE YSSLE Sbjct: 429 VGIVFDANRIGYDGSRTLDTWRENGFLRKAVEMNTDPLVLTTFGKGHFLRNCEAAYSSLE 488 Query: 583 PKAREKKIWTYQNLIDLVFK 524 P+ RE K WTY +LIDLVFK Sbjct: 489 PEDRENKTWTYHHLIDLVFK 508 >ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Glycine max] Length = 509 Score = 542 bits (1397), Expect = e-151 Identities = 264/449 (58%), Positives = 345/449 (76%) Frame = -1 Query: 1870 HAIPHNGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIR 1691 H+ H TK T LL++T+H L +LL KL K+ P+ +L ++GDWSK HFW V+R Sbjct: 50 HSSYHRFADTKHTTLLVETYHLHDSLRALLAKLQKEDCNPLHVLAEDGDWSKDHFWAVVR 109 Query: 1690 FLQHASRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHGLR 1511 FL+ ASR + QVFD+W NIEKSRI+EFNY+KII A+SAL+ MK G++ Sbjct: 110 FLKSASRFTQILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMEDALSALRDMKVQGIK 169 Query: 1510 LSLDIYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNC 1331 SLD YN IIHG S +G+F DA+ F++E++E+ L+ ++ETYDGL+ AYGK +MYDE+G C Sbjct: 170 PSLDTYNPIIHGLSREGKFSDALRFIDEMKESGLELDSETYDGLLGAYGKFQMYDEMGEC 229 Query: 1330 VKKMESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAK 1151 VKKME GC PDHITYN+LI+E++R GL++RME +YQ ++SKRM + ST+V MLEAY Sbjct: 230 VKKMELEGCSPDHITYNILIQEYARAGLLQRMEKLYQRMVSKRMHVQSSTLVAMLEAYTT 289 Query: 1150 FGLLGKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDLVW 971 FG++ KME YRK+L+SK L+D L+RK+A VYI+NYMFSRL DLALDL G ++LVW Sbjct: 290 FGMVEKMENFYRKILSSKTCLEDDLIRKVAEVYIKNYMFSRLEDLALDLCPAFGESNLVW 349 Query: 970 CLRLLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPF 791 CLRLLS+AC LS+KGMD VV EM+ K++WNVTVANII+LAY+KMKDF+HL ILL++LP Sbjct: 350 CLRLLSYACPLSKKGMDIVVREMRDAKVNWNVTVANIIMLAYVKMKDFRHLKILLSQLPI 409 Query: 790 HRVNADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRS 611 +RV D++T+GI+FDA +IGFDG+GAL TWR+ G L + VE+ TD LVLTAFGKG FL+S Sbjct: 410 YRVQPDIITIGILFDATRIGFDGSGALETWRRMGYLYRVVEIKTDSLVLTAFGKGHFLKS 469 Query: 610 CEERYSSLEPKAREKKIWTYQNLIDLVFK 524 CEE YSSL P+ R++K WTY +LI L+ K Sbjct: 470 CEEVYSSLHPEDRKRKTWTYHDLIALLSK 498 >gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis] Length = 664 Score = 540 bits (1392), Expect = e-151 Identities = 259/440 (58%), Positives = 340/440 (77%) Frame = -1 Query: 1843 TKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHASRSK 1664 T+ T LL++TFHE ++ +LL++L+K S P+++L+++GDW K HFW V+RFL+H SR+K Sbjct: 61 TEHTTLLVETFHEHRKFKTLLKRLSKNDSCPMRLLREDGDWCKEHFWAVVRFLRHGSRTK 120 Query: 1663 EAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHGLRLSLDIYNLI 1484 E QVFDLW NIEKSRINE NY KII+ AV + ++MKS GL +L++YN + Sbjct: 121 EIVQVFDLWKNIEKSRINELNYCKIIKMLGEEGLMEEAVLSFEEMKSCGLSPTLEVYNSM 180 Query: 1483 IHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKMESSGC 1304 IHGFS KG+F DA+++LNE+ E ++ P T+TY+GLI+AY K++MYDEIG C+KKM+ +GC Sbjct: 181 IHGFSQKGDFDDALVYLNEMREQNVVPETDTYEGLIEAYAKYEMYDEIGLCLKKMKLNGC 240 Query: 1303 VPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLLGKMER 1124 PDHITYNLL+R+FS+GGL+KRMES+Y T++SKRM L ST+V MLE YA+FG+L KME+ Sbjct: 241 PPDHITYNLLMRKFSKGGLLKRMESVYHTMISKRMYLQSSTLVAMLETYARFGILDKMEK 300 Query: 1123 IYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDLVWCLRLLSHAC 944 Y + L +K PL + L+RKLA VYI+NY+FSRL L +DL + G TDL+WCLRLLSHA Sbjct: 301 FYMRTLKTKTPLGEDLIRKLAEVYIDNYLFSRLETLGVDLSTTFGETDLLWCLRLLSHAF 360 Query: 943 ILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRVNADLVT 764 + SRKGMD V+ EM+ I WNVT ANIILL ++KMKDF HL I L++L H V D+VT Sbjct: 361 LFSRKGMDFVIQEMERAHIPWNVTFANIILLTHLKMKDFTHLRISLSQLT-HSVEPDIVT 419 Query: 763 VGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRSCEERYSSLE 584 VGI+FDA +GFDGT L TW++ +AVEMNTDP+V+TAFGKG+FL++CE YSSLE Sbjct: 420 VGILFDAIGMGFDGTRTLETWKRMDFFYKAVEMNTDPVVITAFGKGNFLQNCERAYSSLE 479 Query: 583 PKAREKKIWTYQNLIDLVFK 524 + RE K WTY NL+DLVFK Sbjct: 480 SEVRETKSWTYNNLVDLVFK 499 >ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 509 Score = 538 bits (1385), Expect = e-150 Identities = 264/456 (57%), Positives = 342/456 (75%), Gaps = 1/456 (0%) Frame = -1 Query: 1882 YDPSHAIPHN-GTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHF 1706 + PS PH + T+ T L ++ HE +L +LL+ L +K P+Q+L+D+GDW+ F Sbjct: 52 FSPSPLPPHKTSSSTEHTTLHVEPSHEYHKLRALLDILMEKDCCPLQLLRDDGDWTIDQF 111 Query: 1705 WTVIRFLQHASRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMK 1526 W VIRFL HASR KE Q+FD+W NIEKSRINEFNYSKII AV Q MK Sbjct: 112 WAVIRFLIHASRPKEILQLFDIWRNIEKSRINEFNYSKIIGLLVEEDLIEEAVVCFQDMK 171 Query: 1525 SHGLRLSLDIYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYD 1346 S GL LS+++YN IIHG S G F DA+ FLNE++E +L P+ +TYDGLI+AYGK+KMYD Sbjct: 172 SQGLGLSVELYNTIIHGLSRNGNFVDAVHFLNEMKEMNLAPDADTYDGLIEAYGKYKMYD 231 Query: 1345 EIGNCVKKMESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTML 1166 E+G C+KKM +GC PD+ITYNLLIREF+ GGL+ R+E +YQ+++S+RM L T++ +L Sbjct: 232 EMGMCLKKMRLNGCSPDYITYNLLIREFAHGGLLNRVERVYQSMVSRRMDLQVPTLIAIL 291 Query: 1165 EAYAKFGLLGKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGM 986 E YAKFG+L KME YR+VLNS+ LK+ L++K+A VYIENYMFS+L +L +DL G Sbjct: 292 EVYAKFGILEKMEVFYRRVLNSRAILKEDLIKKVAEVYIENYMFSKLENLGVDLSPRFGQ 351 Query: 985 TDLVWCLRLLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILL 806 TDLVWCLRLLSHA +LSR+GM+S++ EM+ + + WN TVANI++LAY+KMKDF L L Sbjct: 352 TDLVWCLRLLSHAGLLSRRGMNSIILEMEGKSVPWNATVANIMMLAYLKMKDFTRLRSLF 411 Query: 805 TELPFHRVNADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKG 626 ++ V+ D++T GI+FDAN+IG+DG+ LNTWRK G+L +AVEMNTDPLV+T FGKG Sbjct: 412 SQSLTRGVDPDIITFGILFDANRIGYDGSATLNTWRKHGILYKAVEMNTDPLVITTFGKG 471 Query: 625 DFLRSCEERYSSLEPKAREKKIWTYQNLIDLVFKPN 518 FLR+CE YSSLEP+ REKK WTYQ+LID VFK N Sbjct: 472 HFLRNCEAAYSSLEPEVREKKTWTYQDLIDSVFKDN 507 >ref|XP_007163800.1| hypothetical protein PHAVU_001G265200g [Phaseolus vulgaris] gi|561037264|gb|ESW35794.1| hypothetical protein PHAVU_001G265200g [Phaseolus vulgaris] Length = 496 Score = 531 bits (1368), Expect = e-148 Identities = 262/447 (58%), Positives = 337/447 (75%) Frame = -1 Query: 1870 HAIPHNGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIR 1691 HA + +K LL++T+H L +LL KL ++ S P+ IL +GDWSK HFW +R Sbjct: 49 HAFGDTVSDSKHRTLLVETYHHHDSLRALLAKLEREDSNPMYILAQDGDWSKDHFWAAVR 108 Query: 1690 FLQHASRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHGLR 1511 FL++ASR E QVFD+W IEKSRI+EFNY+KII A+SA Q+MK G++ Sbjct: 109 FLKNASRFVEILQVFDMWKEIEKSRISEFNYNKIIGLLCEDEMMEEALSAFQEMKVQGMK 168 Query: 1510 LSLDIYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNC 1331 SLD YN IIHG S G+F DA+ FL+E++E+ L P++ETYDGLI AYGK ++YDE+G C Sbjct: 169 PSLDTYNPIIHGLSKAGKFSDALRFLDEMKESGLDPDSETYDGLIGAYGKFQLYDEMGEC 228 Query: 1330 VKKMESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAK 1151 VKKME GC PDHITYN+LI+E++R G+++RME +YQ +LSKRM L ST V ML+AY Sbjct: 229 VKKMELEGCSPDHITYNILIQEYARAGILQRMEKLYQRMLSKRMRLQSSTFVAMLKAYTT 288 Query: 1150 FGLLGKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDLVW 971 FG++ KME +RKVLNSK L+D +RK+A VYI+NYMFSRL DLALDL S G +DLVW Sbjct: 289 FGIVEKMEFFFRKVLNSKSCLEDDFIRKMAEVYIKNYMFSRLEDLALDLCSAFGESDLVW 348 Query: 970 CLRLLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPF 791 CLRLLS+AC+LS+KGMD VV EM+ KI+WNV ANII+LAY+KMKDF+HL ILL++L Sbjct: 349 CLRLLSYACLLSKKGMDIVVKEMQDAKINWNVAFANIIMLAYVKMKDFRHLRILLSQLRI 408 Query: 790 HRVNADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRS 611 +R+ D+VT+GIV DA++IGFDG GAL +WR+ G L + VE+ TD LVLTAFGKG FL+S Sbjct: 409 NRLGPDIVTIGIVLDASRIGFDGRGALESWRRMGYLDRVVELKTDSLVLTAFGKGHFLKS 468 Query: 610 CEERYSSLEPKAREKKIWTYQNLIDLV 530 CEE Y+SL P+ RE+K WTY +LI L+ Sbjct: 469 CEEVYTSLHPEDRERKKWTYNDLIALL 495 >ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like isoform X1 [Glycine max] Length = 506 Score = 528 bits (1361), Expect = e-147 Identities = 261/449 (58%), Positives = 339/449 (75%) Frame = -1 Query: 1870 HAIPHNGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIR 1691 HA H TK T LL++T+H L +LL KL + S P+ +L ++ DWSK HFW V+R Sbjct: 48 HASYHRFADTKHTTLLVETYHLHHSLRALLAKLENEYSNPLHMLAEDADWSKDHFWAVVR 107 Query: 1690 FLQHASRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHGLR 1511 FL+ +S QVFD+W NIEKSRI+EFNY+KII A+SALQ MK G++ Sbjct: 108 FLKSSSNFTHILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMKDALSALQDMKVQGIK 167 Query: 1510 LSLDIYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNC 1331 SLD YN IIHG S +G+F DA+ F++E++E+ L+ ++ETYDGLI AYGK +MYDE+G C Sbjct: 168 PSLDTYNPIIHGLSREGKFSDALRFIDEMKESGLELDSETYDGLIGAYGKFQMYDEMGEC 227 Query: 1330 VKKMESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAK 1151 VKKME GC PD ITYN+LI+E++ GGL++RME +YQ +LSKRM + ST+V MLEAY Sbjct: 228 VKKMELEGCSPDPITYNILIQEYAGGGLLQRMEKLYQRMLSKRMHVKSSTLVAMLEAYTT 287 Query: 1150 FGLLGKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDLVW 971 FG++ KME+ YRK+LNSK ++D L+RK+A VYI N+MFSRL DLALDL G ++L W Sbjct: 288 FGMVEKMEKFYRKILNSKTCIEDDLIRKVAEVYINNFMFSRLEDLALDLCPAFGESNLEW 347 Query: 970 CLRLLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPF 791 C RLLS+AC+LS+KGMD VV EM+ K+SWNVTVANII+LAY+KMK+F+HL ILL++LP Sbjct: 348 CFRLLSYACLLSKKGMDIVVQEMQDAKVSWNVTVANIIMLAYVKMKEFRHLRILLSQLPI 407 Query: 790 HRVNADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRS 611 +RV D++T+GI+FDA +IGFDG+GAL TWR+ G L + VEM TD LVLTAFGKG FL+S Sbjct: 408 YRVQPDIITIGILFDATRIGFDGSGALETWRRMGYLYRVVEMKTDSLVLTAFGKGHFLKS 467 Query: 610 CEERYSSLEPKAREKKIWTYQNLIDLVFK 524 CEE YSSL P+ R++K TY +LI L+ K Sbjct: 468 CEEVYSSLHPEDRKRKTCTYHDLIPLLSK 496 >ref|XP_002516403.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544501|gb|EEF46020.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 502 Score = 528 bits (1361), Expect = e-147 Identities = 263/449 (58%), Positives = 331/449 (73%) Frame = -1 Query: 1876 PSHAIPHNGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTV 1697 P HAI + + K LL++++HE QRL +LL +LNKKGS P+Q+L+D+ DWSK HFW V Sbjct: 45 PLHAISQDNSI-KHNTLLVESYHEHQRLKALLARLNKKGSCPLQMLQDDADWSKDHFWAV 103 Query: 1696 IRFLQHASRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHG 1517 IRFL+H+SRS E QVFD+W +IEKSRINEFNY K+I A SA +MK+ Sbjct: 104 IRFLRHSSRSDEILQVFDMWKDIEKSRINEFNYEKVIEILGEEGLIEDAYSAFIEMKTLC 163 Query: 1516 LRLSLDIYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIG 1337 L SL +YN +IHG++ G+F DA+ +LN L+E +L P ++TY+GLIQAYGK+KMYDE+G Sbjct: 164 LSPSLQVYNSLIHGYARNGKFDDAVFYLNHLKEINLSPVSDTYNGLIQAYGKYKMYDEMG 223 Query: 1336 NCVKKMESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAY 1157 C+KKME GC PDH+TYNLLI+E + GL+ RME +YQT RM L +T+ MLEAY Sbjct: 224 MCLKKMEMEGCSPDHVTYNLLIQELAEAGLLTRMEKVYQTTRMNRMDLKSTTLTAMLEAY 283 Query: 1156 AKFGLLGKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDL 977 A FG++ KME I ++ NSK LK+ L++K+A VYIEN+MFSRL L L +G D+ Sbjct: 284 ANFGIVEKMELILKRTRNSKALLKEDLIKKIALVYIENFMFSRLEKLGHYLSKRSGQNDM 343 Query: 976 VWCLRLLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTEL 797 VWCL LLS+AC+LS+KGMDSVV EMKV K+SWNVT NIILLAY+KMKD L ILL+ L Sbjct: 344 VWCLLLLSNACMLSQKGMDSVVREMKVAKVSWNVTFINIILLAYLKMKDSMRLGILLSTL 403 Query: 796 PFHRVNADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFL 617 H V D+VTVG++FDAN IGF G G L TWR++G+L + VE TDPLVL AFGKG FL Sbjct: 404 TNHIVKPDIVTVGVLFDANNIGFHGNGILETWRRTGILYRCVETETDPLVLAAFGKGQFL 463 Query: 616 RSCEERYSSLEPKAREKKIWTYQNLIDLV 530 + CEE YSSLEP AR+K+ WTY NLIDLV Sbjct: 464 KKCEEAYSSLEPVARQKEKWTYCNLIDLV 492 >ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa] gi|550324215|gb|EEE99423.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa] Length = 508 Score = 525 bits (1351), Expect = e-146 Identities = 264/455 (58%), Positives = 338/455 (74%), Gaps = 17/455 (3%) Frame = -1 Query: 1843 TKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHASRSK 1664 TK T LL+D+FHE +RL SLL LN + P+Q+L+ +GDWSK FW+VI+FL+ ++RS Sbjct: 50 TKHTTLLVDSFHEHKRLKSLLHNLNSNQN-PLQLLQQDGDWSKDDFWSVIKFLKLSARSN 108 Query: 1663 EAPQV-----------------FDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQ 1535 + QV F +W ++EK+RINEFNY KII AV+A Sbjct: 109 QILQVHSLAHLFFLAARKIEFVFHMWRDVEKTRINEFNYEKIIGLLGEEGLMEDAVTAFM 168 Query: 1534 KMKSHGLRLSLDIYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHK 1355 +MKS GL LSL++YN IIHG++ G+F DA+ +LN++ E +L P ++TYDGLI+AYG ++ Sbjct: 169 EMKSFGLCLSLEVYNSIIHGYARNGKFDDALFYLNQMNEMNLSPESDTYDGLIEAYGTYR 228 Query: 1354 MYDEIGNCVKKMESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMV 1175 MYDE+ C+KKME GC PD TYNLLI++F++GGL+ RME +YQ++ +KRM L ST++ Sbjct: 229 MYDEMAMCLKKMELDGCSPDRYTYNLLIQKFAQGGLLTRMERVYQSMRTKRMKLQSSTLI 288 Query: 1174 TMLEAYAKFGLLGKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSE 995 +MLEAYA FG++ KME+I R NSK+ +K+ LVRKLAGVYI NYMFSRL+DLA+DL S Sbjct: 289 SMLEAYANFGIVEKMEKILRWAWNSKITVKEDLVRKLAGVYIANYMFSRLHDLAVDLTSI 348 Query: 994 TGMTDLVWCLRLLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLS 815 TG TD+VWCL LLSHAC+LSR+GMD+VV EM+ K WN+TVANIILLAY+KMKDF L Sbjct: 349 TGRTDIVWCLHLLSHACLLSRRGMDAVVREMEDAKACWNITVANIILLAYLKMKDFTRLR 408 Query: 814 ILLTELPFHRVNADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAF 635 ILL++LP RV D+VT GI+FDA +IGFDG L WRK G+L + VEMNTDPL L+AF Sbjct: 409 ILLSKLPEIRVEPDIVTFGILFDAEEIGFDGKECLEMWRKMGLLYRRVEMNTDPLALSAF 468 Query: 634 GKGDFLRSCEERYSSLEPKAREKKIWTYQNLIDLV 530 GKG FLRSCEE YSSLEP AREKK WTY + I+LV Sbjct: 469 GKGSFLRSCEEGYSSLEPNAREKKRWTYVDFINLV 503 >ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Citrus sinensis] Length = 477 Score = 512 bits (1319), Expect = e-142 Identities = 257/442 (58%), Positives = 324/442 (73%) Frame = -1 Query: 1843 TKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHASRSK 1664 TK T LL++++HE Q L +L+++LNKK S P+QIL+ +GDW+K HFW VIRFL+++SRS+ Sbjct: 61 TKHTTLLVESYHEHQALNALIQRLNKKVSCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120 Query: 1663 EAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHGLRLSLDIYNLI 1484 + PQVFD+W NIEKSRINEFNY KII AV A Q+M+ L+ SL+IYN I Sbjct: 121 QIPQVFDMWKNIEKSRINEFNYQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180 Query: 1483 IHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKMESSGC 1304 IHG+S G+F +A+LFLNE++E +L P ++TYDGLIQAY Sbjct: 181 IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAY--------------------- 219 Query: 1303 VPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLLGKMER 1124 EF+ GL+KRME Y+++L+KRM L STMV +L+AY FG+L KME+ Sbjct: 220 ------------EFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 267 Query: 1123 IYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDLVWCLRLLSHAC 944 Y+++LNS+ PLK+ LVRKLA VYI+NYMFSRL+DL DL S G T+LVWCLRLLSHAC Sbjct: 268 FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDLASRIGRTELVWCLRLLSHAC 327 Query: 943 ILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRVNADLVT 764 +LS +G+DSVV EM+ K+ WNVT ANIILLAY+KMKDFKHL +LL+ELP V D+VT Sbjct: 328 LLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIVT 387 Query: 763 VGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRSCEERYSSLE 584 +GI++DA +IGFDGTGAL WR+ G LS+ VE+NTDPLVL +GKG FLR CEE YSSLE Sbjct: 388 IGILYDARRIGFDGTGALEMWRRIGFLSKTVEINTDPLVLAVYGKGHFLRYCEEVYSSLE 447 Query: 583 PKAREKKIWTYQNLIDLVFKPN 518 P +REKK WTYQNLIDLV K N Sbjct: 448 PYSREKKRWTYQNLIDLVIKHN 469 >ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda] gi|548859508|gb|ERN17188.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda] Length = 506 Score = 495 bits (1274), Expect = e-137 Identities = 246/459 (53%), Positives = 326/459 (71%) Frame = -1 Query: 1888 TQYDPSHAIPHNGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYH 1709 T+ +P+ + N +K ALL+ F + Q+L L+EK+ K G P+++L+DEGDW+K Sbjct: 48 TRTNPAIPLEQNPQDSKHRALLVQNFFQTQQLLDLIEKI-KGGIDPLKLLRDEGDWNKDQ 106 Query: 1708 FWTVIRFLQHASRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKM 1529 FW V++ L+ SR KEA QVFD W N+E+SR+++ NY+K+I A + L+++ Sbjct: 107 FWAVMKLLKETSRIKEAMQVFDYWVNVERSRLDDSNYTKMIELLVDAGLMDEATTMLKEV 166 Query: 1528 KSHGLRLSLDIYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMY 1349 K G+R ++ +YN I+HG++ G F A LFL E+ + L P +ETYDGLI+AYG H+MY Sbjct: 167 KDFGVRPTVAVYNFIVHGYANTGNFDKANLFLREMRDLGLVPESETYDGLIRAYGNHRMY 226 Query: 1348 DEIGNCVKKMESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTM 1169 D++ C KKMES G PDH+TYN+LIREF+RGGL+ RME Y+TLLSK+M L ST+V M Sbjct: 227 DDMAKCAKKMESEGFTPDHLTYNILIREFARGGLMVRMEGAYRTLLSKKMGLQYSTLVAM 286 Query: 1168 LEAYAKFGLLGKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETG 989 LEAYA G + +ME ++R++L SK+PLK+ LVRK+A YI+N+ FSRL DL L + S+TG Sbjct: 287 LEAYAALGCVNEMETVFRRLLKSKIPLKEDLVRKVARAYIKNHRFSRLEDLGLGVASKTG 346 Query: 988 MTDLVWCLRLLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSIL 809 TDL WCL LLSHAC+ SRKG+ SV+ EMK + NVT ANI L Y+KMKD ++L +L Sbjct: 347 RTDLFWCLLLLSHACLCSRKGIKSVIQEMKSAMVRPNVTFANITALTYLKMKDVQYLDVL 406 Query: 808 LTELPFHRVNADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGK 629 L++L VN D+VTVG+V DA GFD AL WRK+G L + VEMNTDPLVLTAFGK Sbjct: 407 LSQLQLLNVNPDIVTVGVVMDAYVSGFDDIKALRMWRKTGFLRRPVEMNTDPLVLTAFGK 466 Query: 628 GDFLRSCEERYSSLEPKAREKKIWTYQNLIDLVFKPNVR 512 G FLRSCEE Y SL K RE+K+WTY +LIDLVF N R Sbjct: 467 GYFLRSCEELYLSLGAKGRERKVWTYNDLIDLVFNQNER 505 >ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum] gi|557115950|gb|ESQ56233.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum] Length = 495 Score = 478 bits (1231), Expect = e-132 Identities = 243/443 (54%), Positives = 314/443 (70%), Gaps = 3/443 (0%) Frame = -1 Query: 1843 TKRTALLLDTFHEKQR-LGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHASRS 1667 T+ T+LL D++H R L SL +L++ GS P+++L+++GDWSK+ FW V+RFL+H+SR Sbjct: 51 TQSTSLLSDSYHHHHRFLNSLPRRLSRTGSCPLRLLREDGDWSKHQFWAVVRFLRHSSRL 110 Query: 1666 KEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKM-KSHGLRLSLDIYN 1490 E VFD W N+E SRINE NY KI+R A+ A Q M H L SL+IYN Sbjct: 111 HEILPVFDAWKNLEPSRINEANYEKILRFLCEEKSMNEAIRAFQCMIDEHELSPSLEIYN 170 Query: 1489 LIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKMESS 1310 IIHG++ G+F +AM ++N ++E + P TETYDGLI+AYGK K+YDEI C+KKMES Sbjct: 171 SIIHGYANDGKFEEAMFYMNHMKENDMLPETETYDGLIEAYGKWKLYDEIVLCIKKMESD 230 Query: 1309 GCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLLGKM 1130 GCV DH+TYNLLIREF+RGGL+KRME MYQ+L+S++M L P T+++MLEAYA+FG+L KM Sbjct: 231 GCVRDHVTYNLLIREFARGGLLKRMEQMYQSLMSRKMTLEPCTLLSMLEAYAEFGVLEKM 290 Query: 1129 ERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDLVWCLRLLSH 950 E Y K++ + L + LVRK+A VYI+N MFSRL+DL + TDL WCLRLL H Sbjct: 291 EDTYNKIVRFGISLDEDLVRKVANVYIDNLMFSRLDDLGRGI----RRTDLAWCLRLLCH 346 Query: 949 ACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRVNADL 770 AC++SRKG+D VV EM+ ++ WN T ANI+LLAY KM DF+ + +LL+EL V DL Sbjct: 347 ACLVSRKGLDYVVKEMEEARVPWNATFANIVLLAYSKMGDFRSVELLLSELRTKHVKLDL 406 Query: 769 VTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRSCEE-RYS 593 VTVGIV D + GFDGTG TW+K G L + VE TDPLV AFGKG FLRSCEE + Sbjct: 407 VTVGIVLDLSVDGFDGTGVFMTWKKIGFLDKPVETKTDPLVHAAFGKGRFLRSCEEVKNQ 466 Query: 592 SLEPKAREKKIWTYQNLIDLVFK 524 L + E K WTYQ L++LV K Sbjct: 467 VLGTRVEESKSWTYQYLMELVVK 489 >emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera] Length = 1697 Score = 477 bits (1228), Expect = e-132 Identities = 235/366 (64%), Positives = 288/366 (78%) Frame = -1 Query: 1825 LLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHASRSKEAPQVF 1646 L++T HE +RLG L++KL+ K S P+Q+L+D+GDW+K HFW VIRFL+ ASRS E VF Sbjct: 1332 LVETLHENERLGVLIQKLSNKASSPLQLLRDDGDWNKQHFWAVIRFLKDASRSSEILPVF 1391 Query: 1645 DLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKMKSHGLRLSLDIYNLIIHGFST 1466 LW +++KSRINEFNY+KII +V AL+ MK+HGL+ SL+IYNL+IH F+ Sbjct: 1392 HLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEXMKTHGLKPSLEIYNLVIHCFAR 1451 Query: 1465 KGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKMESSGCVPDHIT 1286 KGEF A+ FLNEL+ +L +TETYDGLIQ+YGK+KMYDE+ CVKKMES GC+PDHIT Sbjct: 1452 KGEFDRALYFLNELKXNNLIADTETYDGLIQSYGKYKMYDELDECVKKMESDGCLPDHIT 1511 Query: 1285 YNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLLGKMERIYRKVL 1106 YNLLI+EFSRGGL+KRME ++QT+LSK+M L ST+V MLEAYA FG++ KME YR+VL Sbjct: 1512 YNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVMLEAYANFGIIEKMENAYRRVL 1571 Query: 1105 NSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDLYSETGMTDLVWCLRLLSHACILSRKG 926 NSK LKD L+RKLA VYIENY FSRL D+ LDL S T TDLVWCLRLLSHAC+LSRKG Sbjct: 1572 NSKTSLKDDLIRKLAEVYIENYKFSRLADMGLDLASVTSRTDLVWCLRLLSHACLLSRKG 1631 Query: 925 MDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRVNADLVTVGIVFD 746 +DS+V EM+ + + WN TVAN ILLAY+KMKDF L ILL EL V D+VTVGI+FD Sbjct: 1632 LDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILLLELSTRHVKPDIVTVGILFD 1691 Query: 745 ANKIGF 728 AN+I F Sbjct: 1692 ANRIEF 1697 >ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297314141|gb|EFH44564.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 502 Score = 476 bits (1225), Expect = e-131 Identities = 246/447 (55%), Positives = 316/447 (70%), Gaps = 3/447 (0%) Frame = -1 Query: 1855 NGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHA 1676 NG ++ T+L+ H + L SL +L GS P+++L++ GDWSK HFW VIRFL+H+ Sbjct: 53 NGDASQSTSLI---HHHHRFLSSLPRRLELPGSCPLRLLQEYGDWSKDHFWAVIRFLRHS 109 Query: 1675 SRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKM-KSHGLRLSLD 1499 SR E VFD W N+E+SRI+E NY ++IR A+ A + M H L SL+ Sbjct: 110 SRLHEILPVFDAWKNLERSRISEANYERVIRLLCEEKSMNEAIRAFRGMIDDHELSPSLE 169 Query: 1498 IYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKM 1319 IYN IIHG++ +G+F +AM +LN ++E L P TETYDGLI+AYGK KMYDEI C+K+M Sbjct: 170 IYNSIIHGYADEGKFEEAMFYLNHMKENGLLPITETYDGLIEAYGKWKMYDEIVLCLKRM 229 Query: 1318 ESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLL 1139 ES GCV DH+TYNLLIREFSRGGL+KRME MYQ+L+S++M L PST+++MLEAYA+FGL+ Sbjct: 230 ESEGCVRDHVTYNLLIREFSRGGLLKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLI 289 Query: 1138 GKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDL-YSETGMTDLVWCLR 962 KME K++ + L +GLVRKLA VYI+N MFSRL+DL + S T TDL WCLR Sbjct: 290 EKMEETCNKIIRFGISLDEGLVRKLANVYIDNLMFSRLDDLGRGISSSRTRRTDLAWCLR 349 Query: 961 LLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRV 782 LL HA ++SRKG+D V+ EMK ++ WN T ANI LLAY KM DFK + +LL+EL V Sbjct: 350 LLCHARLVSRKGLDYVIKEMKEARVPWNTTFANITLLAYSKMGDFKSIELLLSELRTKHV 409 Query: 781 NADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRSCEE 602 DLVTVGI+FD ++ GFD TG TW+K G L + VEM TDPLV AFGKG FL+SCEE Sbjct: 410 KLDLVTVGIIFDLSEAGFDVTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGKFLKSCEE 469 Query: 601 -RYSSLEPKAREKKIWTYQNLIDLVFK 524 + SL + E K WTYQ L+++V K Sbjct: 470 VKNQSLGMRGEESKAWTYQYLMEVVVK 496 >ref|NP_193155.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635638|sp|O23278.2|PP310_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g14190, chloroplastic; Flags: Precursor gi|332657991|gb|AEE83391.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 501 Score = 473 bits (1218), Expect = e-130 Identities = 249/447 (55%), Positives = 313/447 (70%), Gaps = 3/447 (0%) Frame = -1 Query: 1855 NGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHA 1676 NG T+ T+LL H + L SL +L+ GS P+++L+++GDWSK HFW VIRFL+ + Sbjct: 53 NGDATQPTSLL----HHHRFLSSLTRRLSLSGSCPLRLLQEDGDWSKDHFWAVIRFLRQS 108 Query: 1675 SRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKM-KSHGLRLSLD 1499 SR E VFD W N+E SRI+E NY +IIR A+ A + M H L SL+ Sbjct: 109 SRLHEILPVFDTWKNLEPSRISENNYERIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLE 168 Query: 1498 IYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKM 1319 IYN IIH ++ G+F +AM +LN ++E L P TETYDGLI+AYGK KMYDEI C+K+M Sbjct: 169 IYNSIIHSYADDGKFEEAMFYLNHMKENGLLPITETYDGLIEAYGKWKMYDEIVLCLKRM 228 Query: 1318 ESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLL 1139 ES GCV DH+TYNLLIREFSRGGL+KRME MYQ+L+S++M L PST+++MLEAYA+FGL+ Sbjct: 229 ESDGCVRDHVTYNLLIREFSRGGLLKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLI 288 Query: 1138 GKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDL-YSETGMTDLVWCLR 962 KME K++ + L +GLVRKLA VYIEN MFSRL+DL + S T T+L WCLR Sbjct: 289 EKMEETCNKIIRFGISLDEGLVRKLANVYIENLMFSRLDDLGRGISASRTRRTELAWCLR 348 Query: 961 LLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRV 782 LL HA ++SRKG+D VV EM+ ++ WN T ANI LLAY KM DF + +LL+EL V Sbjct: 349 LLCHARLVSRKGLDYVVKEMEEARVPWNTTFANIALLAYSKMGDFTSIELLLSELRIKHV 408 Query: 781 NADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRSCEE 602 DLVTVGIVFD ++ FDGTG TW+K G L + VEM TDPLV AFGKG FLRSCEE Sbjct: 409 KLDLVTVGIVFDLSEARFDGTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEE 468 Query: 601 -RYSSLEPKAREKKIWTYQNLIDLVFK 524 + SL + E K WTYQ L++LV K Sbjct: 469 VKNQSLGTRDGESKSWTYQYLMELVVK 495 >ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Capsella rubella] gi|482552277|gb|EOA16470.1| hypothetical protein CARUB_v10004636mg [Capsella rubella] Length = 501 Score = 464 bits (1193), Expect = e-128 Identities = 239/444 (53%), Positives = 309/444 (69%), Gaps = 2/444 (0%) Frame = -1 Query: 1855 NGTFTKRTALLLDTFHEKQRLGSLLEKLNKKGSIPVQILKDEGDWSKYHFWTVIRFLQHA 1676 NG T + LL H++ + SL +L GS P+Q+L+++GDWSK HFW VIRFL+H+ Sbjct: 57 NGYATTQPTSLLHHHHQRF-ISSLPRRLGLPGSCPLQLLQEDGDWSKDHFWAVIRFLRHS 115 Query: 1675 SRSKEAPQVFDLWTNIEKSRINEFNYSKIIRXXXXXXXXXXAVSALQKM-KSHGLRLSLD 1499 SR E V+D W N+E SRI+ NY ++IR A+ A + M L SL+ Sbjct: 116 SRLHEILPVYDAWKNLEPSRISVVNYERVIRFLCEERSMNEAIRAFRSMIDDDELSPSLE 175 Query: 1498 IYNLIIHGFSTKGEFGDAMLFLNELEEASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKM 1319 IYN IIHG++ G+F +AM +LN+++E L P +ETYDGLI+AYGK KMYDEI CV++M Sbjct: 176 IYNSIIHGYADDGKFEEAMFYLNQMKENGLSPISETYDGLIEAYGKWKMYDEIVLCVRRM 235 Query: 1318 ESSGCVPDHITYNLLIREFSRGGLIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLL 1139 ES GCV DH+TYNLLIR+FSRGGL+KRME MYQ+L+S++M L P T+++MLEAYA+FG++ Sbjct: 236 ESDGCVRDHVTYNLLIRQFSRGGLLKRMEQMYQSLMSRKMTLEPCTLLSMLEAYAEFGVI 295 Query: 1138 GKMERIYRKVLNSKMPLKDGLVRKLAGVYIENYMFSRLNDLALDL-YSETGMTDLVWCLR 962 KME K++ + L DGLVRKLA VYI+N MFSRL+DL + YS T +DL WCLR Sbjct: 296 EKMEETCNKIIRFGISLDDGLVRKLAKVYIDNLMFSRLDDLGRGISYSRTRRSDLAWCLR 355 Query: 961 LLSHACILSRKGMDSVVGEMKVEKISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRV 782 LL H+ ++SRKG+D V+ EM K++WN T ANI+LLAY KM DFK + +LL L RV Sbjct: 356 LLCHSRLVSRKGLDYVLKEMTEAKVTWNTTFANIVLLAYSKMGDFKSIELLLDGLRTKRV 415 Query: 781 NADLVTVGIVFDANKIGFDGTGALNTWRKSGVLSQAVEMNTDPLVLTAFGKGDFLRSCEE 602 DLVTVGIVFD ++ GFDGTG TW+K G L + VEM TDPLV AFGKG FLR CEE Sbjct: 416 KLDLVTVGIVFDLSEAGFDGTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGQFLRRCEE 475 Query: 601 RYSSLEPKAREKKIWTYQNLIDLV 530 + + WTYQNL++LV Sbjct: 476 M------RGEDPTPWTYQNLMELV 493 >emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thaliana] gi|7268124|emb|CAB78461.1| salt-inducible protein homolog [Arabidopsis thaliana] Length = 561 Score = 444 bits (1142), Expect = e-122 Identities = 235/420 (55%), Positives = 290/420 (69%), Gaps = 16/420 (3%) Frame = -1 Query: 1735 DEGDWSKYHFWTVIRFLQHASRSKEAP-------------QVFDLWTNIEKSRINEFNYS 1595 ++GDWSK HFW VIRFL+ +SR E QVFD W N+E SRI+E NY Sbjct: 136 EDGDWSKDHFWAVIRFLRQSSRLHEILPNMKMTFCFFFQLQVFDTWKNLEPSRISENNYE 195 Query: 1594 KIIRXXXXXXXXXXAVSALQKM-KSHGLRLSLDIYNLIIHGFSTKGEFGDAMLFLNELEE 1418 +IIR A+ A + M H L SL+IYN IIH ++ G+F +AM +LN ++E Sbjct: 196 RIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHMKE 255 Query: 1417 ASLKPNTETYDGLIQAYGKHKMYDEIGNCVKKMESSGCVPDHITYNLLIREFSRGGLIKR 1238 L P TETYDGLI+AYGK KMYDEI C+K+MES GCV DH+TYNLLIREFSRGGL+KR Sbjct: 256 NGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLLKR 315 Query: 1237 MESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLLGKMERIYRKVLNSKMPLKDGLVRKLAG 1058 ME MYQ+L+S++M L PST+++MLEAYA+FGL+ KME K++ + L +GLVRKLA Sbjct: 316 MEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLAN 375 Query: 1057 VYIENYMFSRLNDLALDL-YSETGMTDLVWCLRLLSHACILSRKGMDSVVGEMKVEKISW 881 VYIEN MFSRL+DL + S T T+L WCLRLL HA ++SRKG+D VV EM+ ++ W Sbjct: 376 VYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEARVPW 435 Query: 880 NVTVANIILLAYMKMKDFKHLSILLTELPFHRVNADLVTVGIVFDANKIGFDGTGALNTW 701 N T ANI LLAY KM DF + +LL+EL V DLVTVGIVFD ++ FDGTG TW Sbjct: 436 NTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVFMTW 495 Query: 700 RKSGVLSQAVEMNTDPLVLTAFGKGDFLRSCEE-RYSSLEPKAREKKIWTYQNLIDLVFK 524 +K G L + VEM TDPLV AFGKG FLRSCEE + SL + E K WTYQ L++LV K Sbjct: 496 KKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMELVVK 555 >gb|ACU28207.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685446|gb|ACU28212.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685448|gb|ACU28213.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685450|gb|ACU28214.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685452|gb|ACU28215.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685458|gb|ACU28218.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685460|gb|ACU28219.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685466|gb|ACU28222.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685470|gb|ACU28224.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685474|gb|ACU28226.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685478|gb|ACU28228.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685492|gb|ACU28235.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685498|gb|ACU28238.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685500|gb|ACU28239.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685502|gb|ACU28240.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685508|gb|ACU28243.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685510|gb|ACU28244.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685512|gb|ACU28245.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685516|gb|ACU28247.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685518|gb|ACU28248.1| At4g14190-like protein [Arabidopsis thaliana] gi|255685520|gb|ACU28249.1| At4g14190-like protein [Arabidopsis thaliana] Length = 177 Score = 196 bits (499), Expect = 3e-47 Identities = 101/177 (57%), Positives = 129/177 (72%), Gaps = 1/177 (0%) Frame = -1 Query: 1249 LIKRMESMYQTLLSKRMVLHPSTMVTMLEAYAKFGLLGKMERIYRKVLNSKMPLKDGLVR 1070 L+KRME MYQ+L+S++M L PST+++MLEAYA+FGL+ KME K++ + L +GLVR Sbjct: 1 LLKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVR 60 Query: 1069 KLAGVYIENYMFSRLNDLALDLY-SETGMTDLVWCLRLLSHACILSRKGMDSVVGEMKVE 893 KLA VYIEN MFSRL+DL + S T T+L WCLRLL HA ++SRKG+D VV EM+ Sbjct: 61 KLANVYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEA 120 Query: 892 KISWNVTVANIILLAYMKMKDFKHLSILLTELPFHRVNADLVTVGIVFDANKIGFDG 722 ++ WN T ANI LLAY KM DF + +LL+EL V DLVTVGIVFD ++ FDG Sbjct: 121 RVPWNTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDG 177