BLASTX nr result
ID: Catharanthus23_contig00002642
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00002642 (2484 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006362890.1| PREDICTED: pentatricopeptide repeat-containi... 620 e-175 ref|XP_004251386.1| PREDICTED: pentatricopeptide repeat-containi... 616 e-173 emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera] 595 e-167 gb|EOX95584.1| Pentatricopeptide repeat-containing protein, mito... 590 e-166 ref|XP_006491485.1| PREDICTED: conserved oligomeric Golgi comple... 588 e-165 gb|EMJ20615.1| hypothetical protein PRUPE_ppa021922mg [Prunus pe... 580 e-162 ref|XP_003626608.1| Pentatricopeptide repeat-containing protein ... 578 e-162 gb|ESW11013.1| hypothetical protein PHAVU_009G258200g, partial [... 577 e-162 ref|XP_002320901.2| hypothetical protein POPTR_0014s10150g [Popu... 575 e-161 gb|EPS71710.1| hypothetical protein M569_03047, partial [Genlise... 560 e-156 gb|AHB18410.1| pentatricopeptide repeat-containing protein [Goss... 551 e-154 ref|XP_006444724.1| hypothetical protein CICLE_v10023955mg, part... 548 e-153 gb|EXC13666.1| hypothetical protein L484_019627 [Morus notabilis] 536 e-149 ref|XP_004494974.1| PREDICTED: conserved oligomeric Golgi comple... 521 e-145 ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containi... 518 e-144 ref|NP_001154199.1| uncharacterized protein [Arabidopsis thalian... 516 e-143 gb|AAC19289.1| contains similarity to Arabidopsis membrane-assoc... 516 e-143 ref|XP_006605274.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 491 e-136 ref|XP_004308275.1| PREDICTED: uncharacterized protein LOC101307... 454 e-125 ref|XP_006837400.1| hypothetical protein AMTR_s00111p00140430 [A... 446 e-122 >ref|XP_006362890.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Solanum tuberosum] Length = 479 Score = 620 bits (1599), Expect = e-175 Identities = 315/477 (66%), Positives = 368/477 (77%), Gaps = 19/477 (3%) Frame = +3 Query: 309 LRSIANHCSLPQLFA--SCSSTQLK--------------HHPQESHQEKQRKEE--QHLK 434 +R +++H S L A CSST L +H Q+ Q+++R++E +H + Sbjct: 1 MRMLSHHFSSKDLLALVMCSSTWLSKVEPLSAWYKFKSHYHTQQPEQDRKRRQEDEEHKQ 60 Query: 435 RKEES-SIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 611 + SIGSPAR+ KLIA QSDPLLAKEIFDLASR+P+F+H YATFH+LILKLGRS F Sbjct: 61 NMNQGPSIGSPARVQKLIASQSDPLLAKEIFDLASREPDFQHSYATFHTLILKLGRSRQF 120 Query: 612 PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 791 LMQ +Y ISPSLFS+IIQIYGDAGLP +ALKTFY IL FNMKPLPKHLN + Sbjct: 121 SLMQSVFSSLKSQHYSISPSLFSRIIQIYGDAGLPDKALKTFYTILEFNMKPLPKHLNLI 180 Query: 792 LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 971 L+ILV HRNFLRPA DLFR AH YGV NT SYNILMRAFCLNDDLSIAYSLFNQM KR+ Sbjct: 181 LEILVTHRNFLRPAFDLFRSAHTYGVLANTESYNILMRAFCLNDDLSIAYSLFNQMFKRE 240 Query: 972 VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1151 + P++ESYRIL+QG CRKSQVN AVDLLEDMLNKGFVPD SYSTLLNSLCRKK K AY Sbjct: 241 ISPNVESYRILMQGLCRKSQVNTAVDLLEDMLNKGFVPDALSYSTLLNSLCRKKKFKEAY 300 Query: 1152 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGL 1331 KLLCRMK+KGCNPDIVHYNTVILGFCREGRA DACK+LEDMP NGCLPNLVSY+TL+GGL Sbjct: 301 KLLCRMKVKGCNPDIVHYNTVILGFCREGRAADACKILEDMPSNGCLPNLVSYRTLVGGL 360 Query: 1332 SDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHI 1511 S+QG+YDEA+ Y EM+S+GF+PHFSVV+ +VKGFCN+GK+EEACGV L HG+ H Sbjct: 361 SNQGMYDEAKNYMVEMMSKGFSPHFSVVHTVVKGFCNLGKIEEACGVAGSILSHGEPLHT 420 Query: 1512 DTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682 DTW EI+ RI E D E L E+++ E+KP RIVE A L EYL+ ++S+ Sbjct: 421 DTWEEIVSRILEWDAAEKIGNTLVELIQAEIKPEMRIVEAGARLGEYLMNSIKSKSR 477 >ref|XP_004251386.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Solanum lycopersicum] Length = 479 Score = 616 bits (1589), Expect = e-173 Identities = 313/477 (65%), Positives = 361/477 (75%), Gaps = 19/477 (3%) Frame = +3 Query: 309 LRSIANHCSLPQLFA--SCSSTQL------------KHH-----PQESHQEKQRKEEQHL 431 +R +++H S L CSS +L K H P++ +++Q EE Sbjct: 1 MRMLSHHFSSKDLLVLVMCSSARLSKAEPLSAWYKFKSHYHTQQPEQDRKQRQADEEHKQ 60 Query: 432 KRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 611 + SIGSPAR+ KLIA QSDPLLAKEIFDLASR+P+F+H YATFH+LILKLGRS F Sbjct: 61 NTNQGPSIGSPARVQKLIASQSDPLLAKEIFDLASREPDFQHSYATFHTLILKLGRSRQF 120 Query: 612 PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 791 LMQ +Y ISPSLFS IIQIYGDAGLP ALKTFY IL FNMKPLPKHLN + Sbjct: 121 SLMQSVLSSLKSQHYSISPSLFSHIIQIYGDAGLPDRALKTFYTILEFNMKPLPKHLNLI 180 Query: 792 LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 971 L+ILV HRNFLRPA DLFR AH YGV NT SYNILMRAFCLNDDLSIAYSLFNQM KR+ Sbjct: 181 LEILVTHRNFLRPAFDLFRSAHTYGVLANTESYNILMRAFCLNDDLSIAYSLFNQMFKRE 240 Query: 972 VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1151 + P++ESYRIL+QG CRKSQVN AVDLLEDMLNKGFVPD SYSTLLNSLCRKK K AY Sbjct: 241 ISPNVESYRILMQGLCRKSQVNTAVDLLEDMLNKGFVPDALSYSTLLNSLCRKKKFKEAY 300 Query: 1152 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGL 1331 KLLCRMK+KGCNPDIVHYNTVILGFCREGRA DACK+LEDMP NGCLPNLVSY+TL+GGL Sbjct: 301 KLLCRMKVKGCNPDIVHYNTVILGFCREGRAADACKILEDMPSNGCLPNLVSYRTLVGGL 360 Query: 1332 SDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHI 1511 SDQG+YDEA+ Y EM+S+GF+PHFSVV+ +VKGFCN+GK+EEACGV L HG+ H Sbjct: 361 SDQGMYDEAKNYMVEMMSKGFSPHFSVVHAVVKGFCNLGKIEEACGVAGSILSHGEPLHT 420 Query: 1512 DTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682 DTW EI+ I E D E L ++++ E+KP TRIVE A L EYL+ ++S+ Sbjct: 421 DTWEEIVSIILEWDAAEKIGNTLVQLIQAEIKPETRIVEAGARLGEYLMNNIKSKSR 477 >emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera] Length = 422 Score = 595 bits (1533), Expect = e-167 Identities = 288/421 (68%), Positives = 340/421 (80%) Frame = +3 Query: 420 EQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGR 599 E H+K S IGSP+R+ KLIA QSDPLLAKEIFDLAS QPNF+H Y++FH LILKLG Sbjct: 3 EPHVK---PSPIGSPSRVQKLIASQSDPLLAKEIFDLASLQPNFKHSYSSFHILILKLGW 59 Query: 600 SGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKH 779 + F LMQ Y I+PSLFS II+IYG+A LP +ALKTF+ +L F+ KPLPKH Sbjct: 60 ARQFSLMQDLLMRLKSEQYSINPSLFSDIIEIYGEANLPDQALKTFHSMLQFHSKPLPKH 119 Query: 780 LNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQM 959 LN +L +LV+HRN++RPA DLF+ AH+YGVSP+T SYNILM AFC N DLSIAY+LFNQM Sbjct: 120 LNXLLQLLVSHRNYIRPAFDLFKSAHRYGVSPDTKSYNILMSAFCFNGDLSIAYTLFNQM 179 Query: 960 SKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHL 1139 KRDV PD+ESYRIL+QG CRKSQVN+AVDLLEDMLNKG+VPD SY+TLLNSLCRKK L Sbjct: 180 FKRDVAPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGYVPDALSYTTLLNSLCRKKKL 239 Query: 1140 KAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTL 1319 K AYKLLCRMK+KGCNPDIVHYNTVILGFCREGR LDACKVLEDMP NGC PNL+SY TL Sbjct: 240 KEAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRXLDACKVLEDMPSNGCSPNLMSYGTL 299 Query: 1320 IGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQ 1499 + GL DQGLYDEA+ Y EMLS+GF+PHFSV + L+ GFCN+GKLEEAC VL E LRHG+ Sbjct: 300 VSGLCDQGLYDEAKNYVEEMLSKGFSPHFSVFHALINGFCNVGKLEEACEVLXEMLRHGE 359 Query: 1500 TPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRS 1679 H +TW+ I+PRI EVD+ + I +E LK+E+ P+TR+VE GLEEY+I+K +S Sbjct: 360 AXHTETWVAIIPRICEVDKLVRMENIFDEXLKLEITPNTRLVEAGIGLEEYVIRKVRDKS 419 Query: 1680 K 1682 + Sbjct: 420 R 420 >gb|EOX95584.1| Pentatricopeptide repeat-containing protein, mitochondrial [Theobroma cacao] Length = 461 Score = 590 bits (1522), Expect = e-166 Identities = 287/421 (68%), Positives = 343/421 (81%) Frame = +3 Query: 420 EQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGR 599 +Q R S+IGSPAR+ KLI+ QSDPLLAKEIFD AS Q FRH Y++F LILKLGR Sbjct: 39 KQQPPRTCTSAIGSPARVPKLISAQSDPLLAKEIFDYASNQLGFRHSYSSFLVLILKLGR 98 Query: 600 SGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKH 779 S HF L+ YP++P+LFS +I+IY +A LP ALKTFY++L FN+KPLPKH Sbjct: 99 SKHFSLVDDLLIRLKTDRYPVTPTLFSYLIKIYAEANLPERALKTFYKMLEFNIKPLPKH 158 Query: 780 LNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQM 959 LNR+L++LV+HRNFL PA DLF+ AHK+GV PNT SYNILM AFCLN DLS+AY LFN+M Sbjct: 159 LNRILELLVSHRNFLMPAFDLFKNAHKHGVLPNTKSYNILMGAFCLNGDLSVAYKLFNKM 218 Query: 960 SKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHL 1139 +RDVVPD+ESYRIL+QG CRKSQVN AVDLLED+LNKGF+PD+ SY+TLLNSLCRKK L Sbjct: 219 FERDVVPDVESYRILMQGLCRKSQVNTAVDLLEDILNKGFIPDSLSYTTLLNSLCRKKKL 278 Query: 1140 KAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTL 1319 + AYKLLCRMK+KGCNPD+VHYNTVILGFCREGRALDA KVLEDMP NGCLPNLVSY+TL Sbjct: 279 REAYKLLCRMKVKGCNPDLVHYNTVILGFCREGRALDAVKVLEDMPSNGCLPNLVSYRTL 338 Query: 1320 IGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQ 1499 IGGL DQG++DEA+KY EML +GF+PHFSV + LVKGFCN+GK+EEA GV E L++G+ Sbjct: 339 IGGLCDQGMFDEAKKYMEEMLIKGFSPHFSVSHTLVKGFCNVGKIEEAIGVFGEMLKYGE 398 Query: 1500 TPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRS 1679 PH+DTW+ I+PRI E E E IL EV+K+E+K TRIV+ GLE+YLI+K +RS Sbjct: 399 VPHMDTWVLIIPRICEDYETERMGEILEEVMKVEIKRDTRIVDAGTGLEDYLIRKIRSRS 458 Query: 1680 K 1682 K Sbjct: 459 K 459 >ref|XP_006491485.1| PREDICTED: conserved oligomeric Golgi complex subunit 4-like [Citrus sinensis] Length = 1352 Score = 588 bits (1515), Expect = e-165 Identities = 281/436 (64%), Positives = 350/436 (80%) Frame = +3 Query: 387 QESHQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYA 566 QES ++++E + + S IGSP R+ KLIA QSDPLLAKEIFD ASRQPNFRH + Sbjct: 38 QESPSSPEQQQESSISNSK-SPIGSPCRVQKLIASQSDPLLAKEIFDYASRQPNFRHSNS 96 Query: 567 TFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRI 746 T+ LILKLGR+ +F L+ +YP++PSLF+ +I+IY ++ LP ALKTF + Sbjct: 97 TYLILILKLGRAKYFSLIDDILITLKSEHYPVTPSLFTYLIKIYAESNLPDRALKTFRSM 156 Query: 747 LHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDD 926 L FN KPLPK LNR+L++LV HRN+LRPA DLF+ AHK+GV PNT SYNI+MRAFC N D Sbjct: 157 LEFNCKPLPKQLNRILELLVTHRNYLRPAFDLFKSAHKHGVLPNTKSYNIMMRAFCFNGD 216 Query: 927 LSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYST 1106 +SIAY+LFN+M +R V+PD+ESYRIL+QG CRKSQVN+AVDLLEDMLNKGFVPDT SY+T Sbjct: 217 ISIAYTLFNKMFERGVMPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDTLSYTT 276 Query: 1107 LLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNG 1286 LLNSLCRKK L+ AYKLLCRMK+KGCNPDIVHYNTV+LGFCREGRA+DACKVLEDMP NG Sbjct: 277 LLNSLCRKKKLREAYKLLCRMKVKGCNPDIVHYNTVVLGFCREGRAIDACKVLEDMPSNG 336 Query: 1287 CLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEAC 1466 CLPNLVSY+TL+GGL DQG++D A+KY M+S+GF+PHFSV + L+KGFCN+GK++EAC Sbjct: 337 CLPNLVSYRTLVGGLCDQGMFDVAKKYMQLMISKGFSPHFSVSHALIKGFCNVGKVDEAC 396 Query: 1467 GVLEEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLE 1646 GVLEE L+ G+ PH DTW+ I+P+I +E E +LNE++K+E+K TRIVE GLE Sbjct: 397 GVLEELLKAGEAPHEDTWVMIVPQICAGEEMEKLGEVLNEIVKVEIKGDTRIVEAGIGLE 456 Query: 1647 EYLIKKKLTRSKNK*F 1694 +YLI K +R + + F Sbjct: 457 DYLIGKTRSRPRREKF 472 >gb|EMJ20615.1| hypothetical protein PRUPE_ppa021922mg [Prunus persica] Length = 465 Score = 580 bits (1494), Expect = e-162 Identities = 277/415 (66%), Positives = 338/415 (81%) Frame = +3 Query: 420 EQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGR 599 + H + E SIGSP+RI LIA QSDPLLAKEIFDLA+RQP+FRH Y++F +LILKLGR Sbjct: 42 QPHNQNHEIGSIGSPSRIQNLIASQSDPLLAKEIFDLAARQPHFRHSYSSFFTLILKLGR 101 Query: 600 SGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKH 779 S +F L+ NY +SP+LF+ +I+IYG+A LP +AL+TFY ++ F+ +P KH Sbjct: 102 SKYFSLVDDLLIRLKTQNYSVSPALFAHLIKIYGEANLPQKALRTFYTMVEFDCRPSVKH 161 Query: 780 LNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQM 959 LNR+L ILV+HRNFLRPA D+F+ AH++GV PNT SYNILMRAFCLN DLSIAY LFN+M Sbjct: 162 LNRILQILVSHRNFLRPAFDVFKDAHRHGVMPNTQSYNILMRAFCLNGDLSIAYQLFNKM 221 Query: 960 SKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHL 1139 +RD+VPD++SYRIL+QG CRK QVN AVD LEDMLNKGFVPD+ SY++LLNSLCRKK L Sbjct: 222 FERDLVPDVQSYRILMQGLCRKGQVNTAVDFLEDMLNKGFVPDSLSYTSLLNSLCRKKKL 281 Query: 1140 KAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTL 1319 + AYKLLCRMK+KGCNPDIVHYNTVILGFCREGR +DACKVLEDM NGCLPNLVSY+TL Sbjct: 282 REAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRPVDACKVLEDMASNGCLPNLVSYRTL 341 Query: 1320 IGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQ 1499 + GL D G+ DEA+ Y M+SRGF+PHFSVV+ LVKGFCN+G++EEA VLEE L+HG+ Sbjct: 342 VSGLCDHGMLDEAKSYMETMISRGFSPHFSVVHALVKGFCNVGRVEEAFAVLEEVLKHGE 401 Query: 1500 TPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKK 1664 PH DTW+ I+P I E E E + IL EV+K+E++P+TRIVE GLE+YLIKK Sbjct: 402 VPHTDTWLTIVPGICEEIELERLEEILREVMKVEIRPNTRIVEAAIGLEDYLIKK 456 >ref|XP_003626608.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|87240852|gb|ABD32710.1| Tetratricopeptide-like helical [Medicago truncatula] gi|355501623|gb|AES82826.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 451 Score = 578 bits (1490), Expect = e-162 Identities = 282/420 (67%), Positives = 337/420 (80%), Gaps = 1/420 (0%) Frame = +3 Query: 426 HLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSG 605 H S IGSP R+ KLIA QSDPLLAKEIFD AS QPNFRH Y+T+ LILK GRS Sbjct: 30 HSSSSSSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHNYSTYLILILKFGRSK 89 Query: 606 HFPLMQXXXXXXXXXN-YPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHL 782 HF L+ + PI+P+LFS +I+IYG+A LP +AL TFY +L FN+KPL KHL Sbjct: 90 HFSLLDDLLRRLKSESSQPITPTLFSYLIKIYGEANLPDKALNTFYIMLQFNIKPLTKHL 149 Query: 783 NRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMS 962 NR+LDILV+HRN+LRPA DLF+ AHK+GV P+T SYNILMRAFCLN D+SIAY+LFN+M Sbjct: 150 NRILDILVSHRNYLRPAFDLFKDAHKHGVFPDTKSYNILMRAFCLNGDISIAYTLFNKMF 209 Query: 963 KRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLK 1142 KRDVVPDI+SYRIL+Q CRKSQVN AVDL EDMLNKGFVPD+++Y+TLLNSLCRKK L+ Sbjct: 210 KRDVVPDIQSYRILMQALCRKSQVNGAVDLFEDMLNKGFVPDSFTYTTLLNSLCRKKKLR 269 Query: 1143 AAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLI 1322 AYKLLCRMK+KGCNPDIVHYNTVILGFCREGRA DACKV++DM NGCLPNLVSY+TL+ Sbjct: 270 EAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDMQANGCLPNLVSYRTLV 329 Query: 1323 GGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQT 1502 GL G+ DEA KY EMLS+GF+PHF+V++ LVKGFCN+G++EEACGVL + L H + Sbjct: 330 NGLCHLGMLDEATKYVEEMLSKGFSPHFAVIHALVKGFCNVGRIEEACGVLTKSLEHREA 389 Query: 1503 PHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682 PH DTWM I+P+I EVD+ D +L EVLKIE+K TRIV+ GLE+YLI+K +S+ Sbjct: 390 PHKDTWMIIVPQICEVDDGVKIDGVLEEVLKIEIKGDTRIVDAGIGLEDYLIRKIRAKSR 449 >gb|ESW11013.1| hypothetical protein PHAVU_009G258200g, partial [Phaseolus vulgaris] Length = 418 Score = 577 bits (1487), Expect = e-162 Identities = 280/409 (68%), Positives = 330/409 (80%) Frame = +3 Query: 456 GSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQXXXX 635 GSP R+ KLIA QSDPLLAKEIFD+ASRQPNFRH Y+T+ LILKLGRS +F + Sbjct: 8 GSPTRVQKLIASQSDPLLAKEIFDVASRQPNFRHTYSTYLILILKLGRSKNFSFIDHLLR 67 Query: 636 XXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILVNHR 815 + PI+P+LF+ +I++Y +A LP +ALKTFY ILHF+ KPLPKHLNR+L++LV+HR Sbjct: 68 CLRSDSQPITPTLFTYLIRVYAEADLPEKALKTFYNILHFDCKPLPKHLNRILELLVSHR 127 Query: 816 NFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDIESY 995 N++RPA LF+ AH+YGV PNT SYNILMRAFCLN D+SIAYSLFN+M KRDVVPDIESY Sbjct: 128 NYIRPAFLLFKDAHRYGVEPNTKSYNILMRAFCLNGDISIAYSLFNKMFKRDVVPDIESY 187 Query: 996 RILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCRMKL 1175 RIL+Q CRKSQVN AVDLLEDMLNKGFVPD+ +Y+TLLNSLCRKK L+ AYKLLCRMK+ Sbjct: 188 RILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYKLLCRMKV 247 Query: 1176 KGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGLYDE 1355 KGCNPDIVHYNTVILGFCREGRA DACKV+ DM NGCLPNLVSY+TL GL D G+ DE Sbjct: 248 KGCNPDIVHYNTVILGFCREGRAHDACKVIADMRANGCLPNLVSYRTLARGLCDMGMLDE 307 Query: 1356 ARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWMEILP 1535 ARKY EML +GF+PHF+VV+ LVKGFCN+G+ E+ACGVL L HG+ PH+DTWM ++P Sbjct: 308 ARKYVEEMLCKGFSPHFAVVHALVKGFCNVGRAEDACGVLTMSLEHGEAPHVDTWMVLMP 367 Query: 1536 RISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682 I EVD+ L EVLKIE+K TRIV+ GLE YLIKK S+ Sbjct: 368 VICEVDDGGKISGALEEVLKIEIKGHTRIVDAGIGLENYLIKKIRANSR 416 >ref|XP_002320901.2| hypothetical protein POPTR_0014s10150g [Populus trichocarpa] gi|550323886|gb|EEE99216.2| hypothetical protein POPTR_0014s10150g [Populus trichocarpa] Length = 475 Score = 575 bits (1481), Expect = e-161 Identities = 274/429 (63%), Positives = 339/429 (79%) Frame = +3 Query: 378 HHPQESHQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRH 557 HH Q+ H+ + + H +S IGSP+R+ KLIA QSDPLLAKEIFD ASRQPNF+H Sbjct: 40 HHHQQ-HKRELEPSDSHPNANTKSPIGSPSRVQKLIASQSDPLLAKEIFDYASRQPNFQH 98 Query: 558 PYATFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTF 737 Y+++ LILKLGR+ +F + NYP++ +LFS II IYG A LP EALK F Sbjct: 99 SYSSYLILILKLGRAKYFSFIDDLLTDLKSKNYPVTQTLFSYIINIYGKANLPDEALKIF 158 Query: 738 YRILHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCL 917 Y IL F+ P PKHLN +L+ILV+H N+++PA DLF+ AH Y V PNT SYNIL+RAFCL Sbjct: 159 YTILKFDCNPSPKHLNGILEILVSHHNYIKPAFDLFKDAHTYDVFPNTKSYNILIRAFCL 218 Query: 918 NDDLSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYS 1097 N +S+AYSLFNQM KRDV+PD+ESYRIL+Q CRKSQVN AVDLLEDMLNKG+VPD S Sbjct: 219 NGQISMAYSLFNQMFKRDVMPDVESYRILMQALCRKSQVNGAVDLLEDMLNKGYVPDALS 278 Query: 1098 YSTLLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMP 1277 Y+TLLNSLCRKK L+ AYKLLCRMK+KGCNPDI+HYNTVILGFCREGRA+DACKVLEDM Sbjct: 279 YTTLLNSLCRKKKLREAYKLLCRMKVKGCNPDIIHYNTVILGFCREGRAMDACKVLEDME 338 Query: 1278 PNGCLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLE 1457 NGC+PNLVSY+TL+GGL DQG++DEA+ + EM+ +GF+PHF+V L+KGFCN+GK+E Sbjct: 339 SNGCMPNLVSYRTLVGGLCDQGMFDEAKSHLEEMMMKGFSPHFAVSNALIKGFCNVGKIE 398 Query: 1458 EACGVLEEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRA 1637 EACGV+EE L+HG+ PH +TW+ ++ RI EVD+ + IL++V K+E+K TRIVE Sbjct: 399 EACGVVEELLKHGEAPHTETWVMMVSRICEVDDLQRIGEILDKVKKVELKGDTRIVEAGI 458 Query: 1638 GLEEYLIKK 1664 GLEEYLIK+ Sbjct: 459 GLEEYLIKR 467 >gb|EPS71710.1| hypothetical protein M569_03047, partial [Genlisea aurea] Length = 407 Score = 560 bits (1443), Expect = e-156 Identities = 277/412 (67%), Positives = 328/412 (79%), Gaps = 3/412 (0%) Frame = +3 Query: 432 KRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 611 K +S IGSPARI KLIA Q DPLLAKEIFDLASRQP F+H YATFH+LI KLGRS HF Sbjct: 1 KENAQSCIGSPARIQKLIASQKDPLLAKEIFDLASRQPGFQHSYATFHTLIDKLGRSRHF 60 Query: 612 PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 791 LM+ +SPSLFS+II+ YGDA LP +ALKTFY IL FNMKPL KHLNR+ Sbjct: 61 GLMENIILSLKLQRCSVSPSLFSRIIRFYGDANLPDKALKTFYTILEFNMKPLRKHLNRI 120 Query: 792 LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 971 L+ILV++RN LRPA D+FR AH+YGVSPNT SYNI+MRAFCLNDDLSIAY+LFNQM KRD Sbjct: 121 LEILVSNRNLLRPAFDIFRAAHRYGVSPNTESYNIMMRAFCLNDDLSIAYTLFNQMFKRD 180 Query: 972 VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1151 +VP++ESYRIL+QG CRKSQVNKAVDLLEDM+NKG+VPD+ SY+TLLNSLCRKK LK AY Sbjct: 181 IVPNVESYRILMQGLCRKSQVNKAVDLLEDMMNKGYVPDSLSYTTLLNSLCRKKKLKEAY 240 Query: 1152 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVL-EDMPPNGCLPNLVSYQTLIGG 1328 KLLCRMK++GCNPDIVHYNTVI GFC+ GRA DACK++ EDMP GCLPNLVSYQ L+GG Sbjct: 241 KLLCRMKVRGCNPDIVHYNTVISGFCKSGRASDACKIVEEDMPSKGCLPNLVSYQNLVGG 300 Query: 1329 LSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFL--RHGQT 1502 L DQG+YDEA++Y M+SR F+PHFSVV++LV+G+C G EEAC VL + L + G Sbjct: 301 LCDQGMYDEAKRYVKVMVSRDFSPHFSVVHMLVRGYCKTGSHEEACEVLVDLLMMKRGGC 360 Query: 1503 PHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLI 1658 PH+++W E+LP + + E E + + +L KPSTRIV+ G EYLI Sbjct: 361 PHLESWAEVLPHV--IRESEGLESKMKGIL---AKPSTRIVDSGVGWAEYLI 407 >gb|AHB18410.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum] Length = 458 Score = 551 bits (1420), Expect = e-154 Identities = 270/412 (65%), Positives = 325/412 (78%) Frame = +3 Query: 447 SSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQX 626 S I SP R+ KLI+ SDPLLA+EIFD+A QP FRH Y++F LILKLGRS HF L+ Sbjct: 47 SPIASPTRVLKLISAWSDPLLAEEIFDVAITQPGFRHSYSSFLVLILKLGRSKHFSLVDD 106 Query: 627 XXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILV 806 Y ++P+LFS +I+IY +A LP +AL FY++L FN+KPLP+HLNR+L++LV Sbjct: 107 LLVCLKSDQYRVTPTLFSYLIKIYAEADLPEKALSVFYKMLEFNVKPLPRHLNRILELLV 166 Query: 807 NHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDI 986 +HRNF+ PA DLF+ AHKYGV PNT SYNILM AFCLN DLSIAY LFN+M +RDV+PDI Sbjct: 167 SHRNFIMPAFDLFKTAHKYGVFPNTKSYNILMGAFCLNGDLSIAYKLFNKMLERDVMPDI 226 Query: 987 ESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCR 1166 ESY IL+QG CRKSQVN+AVDLLED LNKGF PD+ SYSTLLNSLCRKK L+ AYKLLCR Sbjct: 227 ESYGILMQGLCRKSQVNRAVDLLEDRLNKGFAPDSLSYSTLLNSLCRKKKLREAYKLLCR 286 Query: 1167 MKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGL 1346 MK+KGCNPDIVHYNTVILGFCREGRA+ A KVLEDMP NGCLPNLVSY+TL+G L DQG+ Sbjct: 287 MKVKGCNPDIVHYNTVILGFCREGRAMGAVKVLEDMPSNGCLPNLVSYRTLVGWLCDQGM 346 Query: 1347 YDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWME 1526 +DEA+K+ EMLS+GF+ HFSV + L+KGFC++GK++ A VL E L + + PH DTW Sbjct: 347 FDEAKKHMEEMLSKGFSSHFSVSHALIKGFCSVGKIDAATEVLGEMLEYREVPHTDTWGT 406 Query: 1527 ILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682 I+P I E E E + IL EV+KIE+K TRIVE GLE+YLI+K RSK Sbjct: 407 IVPTICEDYETEKMEEILEEVMKIEIKRDTRIVEAGIGLEDYLIRKIRNRSK 458 >ref|XP_006444724.1| hypothetical protein CICLE_v10023955mg, partial [Citrus clementina] gi|557546986|gb|ESR57964.1| hypothetical protein CICLE_v10023955mg, partial [Citrus clementina] Length = 423 Score = 548 bits (1412), Expect = e-153 Identities = 259/385 (67%), Positives = 318/385 (82%) Frame = +3 Query: 387 QESHQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYA 566 QES ++++E + + S IGSP R+ KLIA QSDPLLAKEIFD ASRQPNFRH + Sbjct: 38 QESPSSPEQQQESSISNSK-SPIGSPCRVQKLIASQSDPLLAKEIFDYASRQPNFRHSNS 96 Query: 567 TFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRI 746 T+ LILKLGR+ +F L+ +YP++PSLF+ +I+IY ++ LP ALKTF + Sbjct: 97 TYLILILKLGRAKYFSLIDDILITLKSEHYPVTPSLFTYLIKIYAESNLPDRALKTFRSM 156 Query: 747 LHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDD 926 L FN KPLPK LNR+L++LV HRN+LRPA DLF+ AHK+GV PNT SYNI+MRAFC N D Sbjct: 157 LEFNCKPLPKQLNRILELLVTHRNYLRPAFDLFKSAHKHGVLPNTKSYNIMMRAFCFNGD 216 Query: 927 LSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYST 1106 +SIAY+LFN+M +R V+PD+ESYRIL+QG CRKSQVN+AVDLLEDMLNKGFVPDT SY+T Sbjct: 217 ISIAYTLFNKMFERGVMPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDTLSYTT 276 Query: 1107 LLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNG 1286 LLNSLCRKK L+ AYKLLCRMK+KGCNPDIVHYNTV+LGFCREGRA+DACKVLEDMP NG Sbjct: 277 LLNSLCRKKKLREAYKLLCRMKVKGCNPDIVHYNTVVLGFCREGRAIDACKVLEDMPSNG 336 Query: 1287 CLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEAC 1466 CLPNLVSY+TL+GGL DQG++D A+KY M+S+GF+PHFSV + L+KGFCN+GK++EAC Sbjct: 337 CLPNLVSYRTLVGGLCDQGMFDVAKKYMQLMISKGFSPHFSVSHALIKGFCNVGKVDEAC 396 Query: 1467 GVLEEFLRHGQTPHIDTWMEILPRI 1541 GVLEE L+ G+ PH DTW+ I+P+I Sbjct: 397 GVLEELLKAGEAPHEDTWVMIVPQI 421 >gb|EXC13666.1| hypothetical protein L484_019627 [Morus notabilis] Length = 458 Score = 536 bits (1381), Expect = e-149 Identities = 259/402 (64%), Positives = 322/402 (80%), Gaps = 1/402 (0%) Frame = +3 Query: 459 SPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQXXXXX 638 SP+R+ KLI QSDPLLAKEIFD ASRQPNFRH Y++F LILKLGRS +F L+ Sbjct: 47 SPSRVQKLIVSQSDPLLAKEIFDYASRQPNFRHSYSSFLILILKLGRSKYFSLIDNLLVR 106 Query: 639 XXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILVNHRN 818 YP++ +LFS +I+IYG+A LP + L+TFY ++ F+ KPLPKHLN++L+ILV++R+ Sbjct: 107 LKAERYPVTSTLFSHLIRIYGEADLPDKVLRTFYMMIEFDFKPLPKHLNQILEILVSYRS 166 Query: 819 FLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDIESYR 998 + A DLF+ AH+YGV NT SYNI+MR FCLN DLSIAY LFN+M +RD+VP+ ESYR Sbjct: 167 HILSAFDLFKSAHRYGVLLNTESYNIMMRVFCLNGDLSIAYQLFNKMFERDLVPNDESYR 226 Query: 999 ILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCRMKLK 1178 IL+QG CRK QVN AVD LEDMLNKGF PDT SY+TLLNSLCRKK L+ AYKLLCRMK+K Sbjct: 227 ILMQGLCRKGQVNTAVDFLEDMLNKGFTPDTLSYTTLLNSLCRKKQLREAYKLLCRMKVK 286 Query: 1179 GCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGLYDEA 1358 GCNPDIVHYNTVI+GFCREGRA+DACKVLEDM NGCLPN+VSY++L+ GL QG DEA Sbjct: 287 GCNPDIVHYNTVIVGFCREGRAMDACKVLEDMAENGCLPNVVSYRSLVSGLCHQGSLDEA 346 Query: 1359 RKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWMEILPR 1538 ++Y EM+S+G +PHFSVV+ LVKGFCN+G++EE CG+L E L+HG+ PH+DTW+ ILPR Sbjct: 347 KRYMEEMMSKGLSPHFSVVHALVKGFCNVGRVEETCGILAESLKHGEVPHMDTWIAILPR 406 Query: 1539 ISEVDEKENFDCILNEVLKI-EVKPSTRIVEIRAGLEEYLIK 1661 I E +E E+ D IL VLKI +V+ T++ E R LE+ L+K Sbjct: 407 ICEENEIESLDEILKGVLKIDQVQLGTKMHEPRTCLEDPLMK 448 >ref|XP_004494974.1| PREDICTED: conserved oligomeric Golgi complex subunit 4-like [Cicer arietinum] Length = 1302 Score = 521 bits (1341), Expect = e-145 Identities = 259/419 (61%), Positives = 311/419 (74%) Frame = +3 Query: 426 HLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSG 605 H S IGSP R+ KLIA QSDPLLAKEIFD AS QPNFRH Y+T+ L+LK GRS Sbjct: 42 HSYSNSSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHTYSTYLILLLKFGRSK 101 Query: 606 HFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLN 785 HF L+ + PI+P+LFS +IQIY A LP +AL TFY +L FN KPL KHLN Sbjct: 102 HFSLLDDLLRRLKSDSQPITPTLFSYLIQIYAQADLPDKALNTFYTMLQFNCKPLTKHLN 161 Query: 786 RVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSK 965 R+L LV+HRN++RPA DLF+ AHK+GV P+T SYNILMRAFCLN D+SIAY+LFN+M + Sbjct: 162 RILVFLVSHRNYVRPAFDLFKDAHKHGVFPDTKSYNILMRAFCLNGDISIAYTLFNKMFQ 221 Query: 966 RDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKA 1145 RDV+PDIESYRIL+Q CRKSQVN AVDLLEDMLNKGFVPD+ +Y+TLLN Sbjct: 222 RDVIPDIESYRILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNR--------- 272 Query: 1146 AYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIG 1325 CNPDIVHYNTVILGFCREGRA DACKVL+DM NGCLPNLVSY+TL+ Sbjct: 273 ------------CNPDIVHYNTVILGFCREGRASDACKVLDDMRANGCLPNLVSYRTLVN 320 Query: 1326 GLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTP 1505 GL D G+ DEA KY EM+S+GF+PHF+V++ LVKG CNIG++EEACGVL + L H + P Sbjct: 321 GLCDLGMLDEATKYVEEMMSKGFSPHFAVIHALVKGLCNIGRIEEACGVLTKSLEHREAP 380 Query: 1506 HIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682 H DTWM ++P+I EVD+ +L EVLKIE+K TRIV+ GLE+YLI+K +S+ Sbjct: 381 HTDTWMIVVPQICEVDDGLKIGGVLEEVLKIEIKGHTRIVDAGIGLEDYLIRKIRAKSR 439 >ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucumis sativus] gi|449499186|ref|XP_004160743.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucumis sativus] Length = 482 Score = 518 bits (1335), Expect = e-144 Identities = 265/471 (56%), Positives = 338/471 (71%), Gaps = 8/471 (1%) Frame = +3 Query: 276 HVLPVHSYRSRLRSIANHCS-----LPQLFASCSSTQLKH---HPQESHQEKQRKEEQHL 431 H+L +YR+ A H + L L +S SS H H + K EQ Sbjct: 4 HLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHEQ-C 62 Query: 432 KRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 611 + + + SIGSP R+ KLIA QSDPLLAKEIFD A RQP+FR ++ LILKLGRS +F Sbjct: 63 EDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYF 122 Query: 612 PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 791 L+ YP++P+ FS II+IYG+A LP +ALK FY ++ F P K LNR+ Sbjct: 123 SLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRI 182 Query: 792 LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 971 L+ILV+HRNF+RPA DLF+ A +GV PNT SYNIL+RAFC N ++SIAY+LFN+M +R+ Sbjct: 183 LEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFERN 242 Query: 972 VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1151 V+PD+E+YR L+QG CRK+QVN AVDLLEDMLNKG++PDT SY+TLLNSLCRKK L+ AY Sbjct: 243 VIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAY 302 Query: 1152 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGL 1331 KLLCRMK+KGCNPDI HYNTVI+GFCREGRALDACK+LEDM NGCLPNLVSY++L GL Sbjct: 303 KLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGL 362 Query: 1332 SDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHI 1511 DQG+++ A+ Y EM +GF PHFSV++ LVKGF +IG++ E+C VLE+ L+ G+ PH Sbjct: 363 CDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHS 422 Query: 1512 DTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKK 1664 DTW I+ I EV++ F + ++LK +V+ TRIVE GL EYLI+K Sbjct: 423 DTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRK 473 >ref|NP_001154199.1| uncharacterized protein [Arabidopsis thaliana] gi|223635643|sp|Q8LDU5.2|PP298_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g01400, mitochondrial; Flags: Precursor gi|332656621|gb|AEE82021.1| uncharacterized protein AT4G01400 [Arabidopsis thaliana] Length = 466 Score = 516 bits (1330), Expect = e-143 Identities = 248/430 (57%), Positives = 318/430 (73%) Frame = +3 Query: 396 HQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFH 575 + + + + + +S IGSP R+ KLIA QSDPLLAKEIFD AS+QPNFRH ++ Sbjct: 29 YSSSEHEARKPIVSNPKSPIGSPTRVQKLIASQSDPLLAKEIFDYASQQPNFRHSRSSHL 88 Query: 576 SLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHF 755 LILKLGR +F L+ YP++ +F+ +I++Y +A LP + L TFY++L F Sbjct: 89 ILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLIKVYAEAKLPEKVLSTFYKMLEF 148 Query: 756 NMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSI 935 N P PKHLNR+LD+LV+HR +L+ A +LF+ + +GV PNT SYN+LM+AFCLNDDLSI Sbjct: 149 NFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGVMPNTRSYNLLMQAFCLNDDLSI 208 Query: 936 AYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLN 1115 AY LF +M +RDVVPD++SY+ILIQGFCRK QVN A++LL+DMLNKGFVPD SY+TLLN Sbjct: 209 AYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAMELLDDMLNKGFVPDRLSYTTLLN 268 Query: 1116 SLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLP 1295 SLCRK L+ AYKLLCRMKLKGCNPD+VHYNT+ILGFCRE RA+DA KVL+DM NGC P Sbjct: 269 SLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFCREDRAMDARKVLDDMLSNGCSP 328 Query: 1296 NLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVL 1475 N VSY+TLIGGL DQG++DE +KY EM+S+GF+PHFSV LVKGFC+ GK+EEAC V+ Sbjct: 329 NSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKGFCSFGKVEEACDVV 388 Query: 1476 EEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYL 1655 E +++G+T H DTW ++P I DE E L + +K E+ TRIV++ GL YL Sbjct: 389 EVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGDTRIVDVGIGLGSYL 448 Query: 1656 IKKKLTRSKN 1685 K + KN Sbjct: 449 SSKLQMKRKN 458 >gb|AAC19289.1| contains similarity to Arabidopsis membrane-associated salt-inducible-like protein (GB:AL021637) [Arabidopsis thaliana] Length = 991 Score = 516 bits (1328), Expect = e-143 Identities = 247/424 (58%), Positives = 316/424 (74%) Frame = +3 Query: 396 HQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFH 575 + + + + + +S IGSP R+ KLIA QSDPLLAKEIFD AS+QPNFRH ++ Sbjct: 29 YSSSEHEARKPIVSNPKSPIGSPTRVQKLIASQSDPLLAKEIFDYASQQPNFRHSRSSHL 88 Query: 576 SLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHF 755 LILKLGR +F L+ YP++ +F+ +I++Y +A LP + L TFY++L F Sbjct: 89 ILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLIKVYAEAKLPEKVLSTFYKMLEF 148 Query: 756 NMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSI 935 N P PKHLNR+LD+LV+HR +L+ A +LF+ + +GV PNT SYN+LM+AFCLNDDLSI Sbjct: 149 NFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGVMPNTRSYNLLMQAFCLNDDLSI 208 Query: 936 AYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLN 1115 AY LF +M +RDVVPD++SY+ILIQGFCRK QVN A++LL+DMLNKGFVPD SY+TLLN Sbjct: 209 AYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAMELLDDMLNKGFVPDRLSYTTLLN 268 Query: 1116 SLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLP 1295 SLCRK L+ AYKLLCRMKLKGCNPD+VHYNT+ILGFCRE RA+DA KVL+DM NGC P Sbjct: 269 SLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFCREDRAMDARKVLDDMLSNGCSP 328 Query: 1296 NLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVL 1475 N VSY+TLIGGL DQG++DE +KY EM+S+GF+PHFSV LVKGFC+ GK+EEAC V+ Sbjct: 329 NSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKGFCSFGKVEEACDVV 388 Query: 1476 EEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYL 1655 E +++G+T H DTW ++P I DE E L + +K E+ TRIV++ GL YL Sbjct: 389 EVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGDTRIVDVGIGLGSYL 448 Query: 1656 IKKK 1667 K K Sbjct: 449 SKNK 452 >ref|XP_006605274.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like, partial [Glycine max] Length = 403 Score = 491 bits (1265), Expect = e-136 Identities = 239/374 (63%), Positives = 293/374 (78%) Frame = +3 Query: 561 YATFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFY 740 Y+++ L+LKLGRS HF + ++PI+P+LF+ + ++Y +A LP +ALKTFY Sbjct: 29 YSSYLILLLKLGRSKHFTFLDGLLRPLKSDSHPITPTLFTYLFKVYPEADLPDKALKTFY 88 Query: 741 RILHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLN 920 ILHFN KPLPKHLNR+L++LV+HRN+LRPA DLF+ + YGV P+T S NILMR FCLN Sbjct: 89 TILHFNCKPLPKHLNRILEVLVSHRNYLRPAFDLFKDSRSYGVEPDTKSCNILMRPFCLN 148 Query: 921 DDLSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSY 1100 D+SIAYSLFN M KRDVVPDIESYRIL+Q CRKS+VN AVDLLEDMLN GFVPD+ +Y Sbjct: 149 GDISIAYSLFNIMFKRDVVPDIESYRILMQALCRKSRVNGAVDLLEDMLN-GFVPDSLTY 207 Query: 1101 STLLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPP 1280 +TLLNSLCRKK + AYKLLCRMK+KGCNPDIVH NTVILGFCR+GR DACKV+ DM Sbjct: 208 TTLLNSLCRKKKFREAYKLLCRMKVKGCNPDIVHXNTVILGFCRDGRTHDACKVISDMRA 267 Query: 1281 NGCLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEE 1460 NG LPNLVSY+TL+ GL + G+ DEA KY EMLS+ F+PHF+VV+ LVKGFCN+G+ E+ Sbjct: 268 NGSLPNLVSYRTLVSGLCNMGMLDEASKYMEEMLSKDFSPHFAVVHALVKGFCNVGRTED 327 Query: 1461 ACGVLEEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAG 1640 ACGVL + L HG+ PH+DTWM I+P I EVD++ L EVLKIE+K TRIV+ G Sbjct: 328 ACGVLTKALEHGEAPHVDTWMIIMPVICEVDDEGKSSGALEEVLKIEIKGHTRIVDAGIG 387 Query: 1641 LEEYLIKKKLTRSK 1682 LE YLI K +RS+ Sbjct: 388 LENYLIGKIRSRSR 401 >ref|XP_004308275.1| PREDICTED: uncharacterized protein LOC101307637 [Fragaria vesca subsp. vesca] Length = 2481 Score = 454 bits (1168), Expect = e-125 Identities = 226/386 (58%), Positives = 285/386 (73%), Gaps = 2/386 (0%) Frame = +3 Query: 444 ESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQ 623 ES +GSPAR+ KLIA QSDPLLAKEIFD A++ P+FRH Y+++ +LILKLGR+ +F L+ Sbjct: 35 ESILGSPARVQKLIASQSDPLLAKEIFDFAAQHPHFRHSYSSYFTLILKLGRAHYFSLVD 94 Query: 624 XXXXXXXXX--NYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLD 797 +Y SP+LF+ +I+IYGDA LP +AL+TFY + FN KP KHLNR+L+ Sbjct: 95 DLLLRLKSQPTSYSPSPALFTHLIKIYGDAHLPQKALRTFYTMFQFNCKPTVKHLNRILE 154 Query: 798 ILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVV 977 ILV HRNFLR A D+FR AH++GV P+T SYNILMRAFCLN DLS+AY LFN+M +RDVV Sbjct: 155 ILVAHRNFLRSAFDVFRDAHRHGVVPDTKSYNILMRAFCLNGDLSVAYGLFNKMYERDVV 214 Query: 978 PDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKL 1157 PD+ESYRIL+QG CRK QVN +VD LEDM+NKGFVPD+ SY++L Sbjct: 215 PDVESYRILMQGLCRKGQVNTSVDFLEDMMNKGFVPDSLSYTSL---------------- 258 Query: 1158 LCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSD 1337 MK+KGCNPDIVHYNTVI GFCREGRA+DACKVLEDM +TL+ GL D Sbjct: 259 ---MKVKGCNPDIVHYNTVISGFCREGRAVDACKVLEDM------------ETLVSGLCD 303 Query: 1338 QGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDT 1517 QG+ DEA+KY M+ +GF+PHFSVV+ LVKGFCN+G++E+ACGV+EE LRHG+ PH DT Sbjct: 304 QGMLDEAKKYMEVMILKGFSPHFSVVHGLVKGFCNVGRIEDACGVMEEILRHGEVPHRDT 363 Query: 1518 WMEILPRISEVDEKENFDCILNEVLK 1595 W+ I+P I E E + + +++K Sbjct: 364 WITIIPGICEEIELVRLEEVWKQIMK 389 >ref|XP_006837400.1| hypothetical protein AMTR_s00111p00140430 [Amborella trichopoda] gi|548840018|gb|ERN00254.1| hypothetical protein AMTR_s00111p00140430 [Amborella trichopoda] Length = 429 Score = 446 bits (1146), Expect = e-122 Identities = 220/385 (57%), Positives = 281/385 (72%) Frame = +3 Query: 447 SSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQX 626 S+IGSPAR+ KLIA Q DPLLA EIFDLASRQPNF Y++FHSLILKLGR F LM+ Sbjct: 43 SAIGSPARVQKLIASQPDPLLAYEIFDLASRQPNFTPSYSSFHSLILKLGRHRQFSLMEK 102 Query: 627 XXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILV 806 P++P LFS +I IYGD+G+P +++KTF+++L F KP+ KH N ++ +LV Sbjct: 103 LISKLKSEGRPVTPGLFSDVITIYGDSGMPDQSVKTFFKMLEFQCKPVAKHFNALILVLV 162 Query: 807 NHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDI 986 H N ++ A LF+ K+G+S NT ++NILM+AFC D LSIAY LFNQM K+ +VPD+ Sbjct: 163 EH-NRVQVAYSLFKDLEKFGISANTETFNILMKAFCFYDKLSIAYKLFNQMFKQGLVPDV 221 Query: 987 ESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCR 1166 ESYRIL+QG CRKSQV A++ +DM+NKGFVPD SY+TLLNSLCRKK L+ AYK+LCR Sbjct: 222 ESYRILMQGLCRKSQVKTALNFFDDMMNKGFVPDALSYNTLLNSLCRKKKLREAYKMLCR 281 Query: 1167 MKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGL 1346 MK+KGCNPDI+HYNTVI GF REGRA DACKVLE+MP NGCLPN +SY+TL+ GL +G Sbjct: 282 MKVKGCNPDILHYNTVITGFVREGRASDACKVLEEMPSNGCLPNSLSYRTLVDGLCKEGK 341 Query: 1347 YDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWME 1526 EA+ Y EM+ +GF PH S ++ LV C GK++EAC +++ G PH TW Sbjct: 342 LVEAKHYLGEMICKGFMPHTSSLHFLVVRICGGGKIDEACEMVKAAGNIGMAPHAKTWEL 401 Query: 1527 ILPRISEVDEKENFDCILNEVLKIE 1601 ++ RI +VDE + IL EV+K E Sbjct: 402 VMQRIFDVDE-VRIEAILREVVKRE 425