BLASTX nr result

ID: Catharanthus23_contig00002642 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00002642
         (2484 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006362890.1| PREDICTED: pentatricopeptide repeat-containi...   620   e-175
ref|XP_004251386.1| PREDICTED: pentatricopeptide repeat-containi...   616   e-173
emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera]   595   e-167
gb|EOX95584.1| Pentatricopeptide repeat-containing protein, mito...   590   e-166
ref|XP_006491485.1| PREDICTED: conserved oligomeric Golgi comple...   588   e-165
gb|EMJ20615.1| hypothetical protein PRUPE_ppa021922mg [Prunus pe...   580   e-162
ref|XP_003626608.1| Pentatricopeptide repeat-containing protein ...   578   e-162
gb|ESW11013.1| hypothetical protein PHAVU_009G258200g, partial [...   577   e-162
ref|XP_002320901.2| hypothetical protein POPTR_0014s10150g [Popu...   575   e-161
gb|EPS71710.1| hypothetical protein M569_03047, partial [Genlise...   560   e-156
gb|AHB18410.1| pentatricopeptide repeat-containing protein [Goss...   551   e-154
ref|XP_006444724.1| hypothetical protein CICLE_v10023955mg, part...   548   e-153
gb|EXC13666.1| hypothetical protein L484_019627 [Morus notabilis]     536   e-149
ref|XP_004494974.1| PREDICTED: conserved oligomeric Golgi comple...   521   e-145
ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containi...   518   e-144
ref|NP_001154199.1| uncharacterized protein [Arabidopsis thalian...   516   e-143
gb|AAC19289.1| contains similarity to Arabidopsis membrane-assoc...   516   e-143
ref|XP_006605274.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   491   e-136
ref|XP_004308275.1| PREDICTED: uncharacterized protein LOC101307...   454   e-125
ref|XP_006837400.1| hypothetical protein AMTR_s00111p00140430 [A...   446   e-122

>ref|XP_006362890.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400,
            mitochondrial-like [Solanum tuberosum]
          Length = 479

 Score =  620 bits (1599), Expect = e-175
 Identities = 315/477 (66%), Positives = 368/477 (77%), Gaps = 19/477 (3%)
 Frame = +3

Query: 309  LRSIANHCSLPQLFA--SCSSTQLK--------------HHPQESHQEKQRKEE--QHLK 434
            +R +++H S   L A   CSST L               +H Q+  Q+++R++E  +H +
Sbjct: 1    MRMLSHHFSSKDLLALVMCSSTWLSKVEPLSAWYKFKSHYHTQQPEQDRKRRQEDEEHKQ 60

Query: 435  RKEES-SIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 611
               +  SIGSPAR+ KLIA QSDPLLAKEIFDLASR+P+F+H YATFH+LILKLGRS  F
Sbjct: 61   NMNQGPSIGSPARVQKLIASQSDPLLAKEIFDLASREPDFQHSYATFHTLILKLGRSRQF 120

Query: 612  PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 791
             LMQ         +Y ISPSLFS+IIQIYGDAGLP +ALKTFY IL FNMKPLPKHLN +
Sbjct: 121  SLMQSVFSSLKSQHYSISPSLFSRIIQIYGDAGLPDKALKTFYTILEFNMKPLPKHLNLI 180

Query: 792  LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 971
            L+ILV HRNFLRPA DLFR AH YGV  NT SYNILMRAFCLNDDLSIAYSLFNQM KR+
Sbjct: 181  LEILVTHRNFLRPAFDLFRSAHTYGVLANTESYNILMRAFCLNDDLSIAYSLFNQMFKRE 240

Query: 972  VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1151
            + P++ESYRIL+QG CRKSQVN AVDLLEDMLNKGFVPD  SYSTLLNSLCRKK  K AY
Sbjct: 241  ISPNVESYRILMQGLCRKSQVNTAVDLLEDMLNKGFVPDALSYSTLLNSLCRKKKFKEAY 300

Query: 1152 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGL 1331
            KLLCRMK+KGCNPDIVHYNTVILGFCREGRA DACK+LEDMP NGCLPNLVSY+TL+GGL
Sbjct: 301  KLLCRMKVKGCNPDIVHYNTVILGFCREGRAADACKILEDMPSNGCLPNLVSYRTLVGGL 360

Query: 1332 SDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHI 1511
            S+QG+YDEA+ Y  EM+S+GF+PHFSVV+ +VKGFCN+GK+EEACGV    L HG+  H 
Sbjct: 361  SNQGMYDEAKNYMVEMMSKGFSPHFSVVHTVVKGFCNLGKIEEACGVAGSILSHGEPLHT 420

Query: 1512 DTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682
            DTW EI+ RI E D  E     L E+++ E+KP  RIVE  A L EYL+    ++S+
Sbjct: 421  DTWEEIVSRILEWDAAEKIGNTLVELIQAEIKPEMRIVEAGARLGEYLMNSIKSKSR 477


>ref|XP_004251386.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400,
            mitochondrial-like [Solanum lycopersicum]
          Length = 479

 Score =  616 bits (1589), Expect = e-173
 Identities = 313/477 (65%), Positives = 361/477 (75%), Gaps = 19/477 (3%)
 Frame = +3

Query: 309  LRSIANHCSLPQLFA--SCSSTQL------------KHH-----PQESHQEKQRKEEQHL 431
            +R +++H S   L     CSS +L            K H     P++  +++Q  EE   
Sbjct: 1    MRMLSHHFSSKDLLVLVMCSSARLSKAEPLSAWYKFKSHYHTQQPEQDRKQRQADEEHKQ 60

Query: 432  KRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 611
               +  SIGSPAR+ KLIA QSDPLLAKEIFDLASR+P+F+H YATFH+LILKLGRS  F
Sbjct: 61   NTNQGPSIGSPARVQKLIASQSDPLLAKEIFDLASREPDFQHSYATFHTLILKLGRSRQF 120

Query: 612  PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 791
             LMQ         +Y ISPSLFS IIQIYGDAGLP  ALKTFY IL FNMKPLPKHLN +
Sbjct: 121  SLMQSVLSSLKSQHYSISPSLFSHIIQIYGDAGLPDRALKTFYTILEFNMKPLPKHLNLI 180

Query: 792  LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 971
            L+ILV HRNFLRPA DLFR AH YGV  NT SYNILMRAFCLNDDLSIAYSLFNQM KR+
Sbjct: 181  LEILVTHRNFLRPAFDLFRSAHTYGVLANTESYNILMRAFCLNDDLSIAYSLFNQMFKRE 240

Query: 972  VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1151
            + P++ESYRIL+QG CRKSQVN AVDLLEDMLNKGFVPD  SYSTLLNSLCRKK  K AY
Sbjct: 241  ISPNVESYRILMQGLCRKSQVNTAVDLLEDMLNKGFVPDALSYSTLLNSLCRKKKFKEAY 300

Query: 1152 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGL 1331
            KLLCRMK+KGCNPDIVHYNTVILGFCREGRA DACK+LEDMP NGCLPNLVSY+TL+GGL
Sbjct: 301  KLLCRMKVKGCNPDIVHYNTVILGFCREGRAADACKILEDMPSNGCLPNLVSYRTLVGGL 360

Query: 1332 SDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHI 1511
            SDQG+YDEA+ Y  EM+S+GF+PHFSVV+ +VKGFCN+GK+EEACGV    L HG+  H 
Sbjct: 361  SDQGMYDEAKNYMVEMMSKGFSPHFSVVHAVVKGFCNLGKIEEACGVAGSILSHGEPLHT 420

Query: 1512 DTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682
            DTW EI+  I E D  E     L ++++ E+KP TRIVE  A L EYL+    ++S+
Sbjct: 421  DTWEEIVSIILEWDAAEKIGNTLVQLIQAEIKPETRIVEAGARLGEYLMNNIKSKSR 477


>emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera]
          Length = 422

 Score =  595 bits (1533), Expect = e-167
 Identities = 288/421 (68%), Positives = 340/421 (80%)
 Frame = +3

Query: 420  EQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGR 599
            E H+K    S IGSP+R+ KLIA QSDPLLAKEIFDLAS QPNF+H Y++FH LILKLG 
Sbjct: 3    EPHVK---PSPIGSPSRVQKLIASQSDPLLAKEIFDLASLQPNFKHSYSSFHILILKLGW 59

Query: 600  SGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKH 779
            +  F LMQ          Y I+PSLFS II+IYG+A LP +ALKTF+ +L F+ KPLPKH
Sbjct: 60   ARQFSLMQDLLMRLKSEQYSINPSLFSDIIEIYGEANLPDQALKTFHSMLQFHSKPLPKH 119

Query: 780  LNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQM 959
            LN +L +LV+HRN++RPA DLF+ AH+YGVSP+T SYNILM AFC N DLSIAY+LFNQM
Sbjct: 120  LNXLLQLLVSHRNYIRPAFDLFKSAHRYGVSPDTKSYNILMSAFCFNGDLSIAYTLFNQM 179

Query: 960  SKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHL 1139
             KRDV PD+ESYRIL+QG CRKSQVN+AVDLLEDMLNKG+VPD  SY+TLLNSLCRKK L
Sbjct: 180  FKRDVAPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGYVPDALSYTTLLNSLCRKKKL 239

Query: 1140 KAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTL 1319
            K AYKLLCRMK+KGCNPDIVHYNTVILGFCREGR LDACKVLEDMP NGC PNL+SY TL
Sbjct: 240  KEAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRXLDACKVLEDMPSNGCSPNLMSYGTL 299

Query: 1320 IGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQ 1499
            + GL DQGLYDEA+ Y  EMLS+GF+PHFSV + L+ GFCN+GKLEEAC VL E LRHG+
Sbjct: 300  VSGLCDQGLYDEAKNYVEEMLSKGFSPHFSVFHALINGFCNVGKLEEACEVLXEMLRHGE 359

Query: 1500 TPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRS 1679
              H +TW+ I+PRI EVD+    + I +E LK+E+ P+TR+VE   GLEEY+I+K   +S
Sbjct: 360  AXHTETWVAIIPRICEVDKLVRMENIFDEXLKLEITPNTRLVEAGIGLEEYVIRKVRDKS 419

Query: 1680 K 1682
            +
Sbjct: 420  R 420


>gb|EOX95584.1| Pentatricopeptide repeat-containing protein, mitochondrial [Theobroma
            cacao]
          Length = 461

 Score =  590 bits (1522), Expect = e-166
 Identities = 287/421 (68%), Positives = 343/421 (81%)
 Frame = +3

Query: 420  EQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGR 599
            +Q   R   S+IGSPAR+ KLI+ QSDPLLAKEIFD AS Q  FRH Y++F  LILKLGR
Sbjct: 39   KQQPPRTCTSAIGSPARVPKLISAQSDPLLAKEIFDYASNQLGFRHSYSSFLVLILKLGR 98

Query: 600  SGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKH 779
            S HF L+           YP++P+LFS +I+IY +A LP  ALKTFY++L FN+KPLPKH
Sbjct: 99   SKHFSLVDDLLIRLKTDRYPVTPTLFSYLIKIYAEANLPERALKTFYKMLEFNIKPLPKH 158

Query: 780  LNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQM 959
            LNR+L++LV+HRNFL PA DLF+ AHK+GV PNT SYNILM AFCLN DLS+AY LFN+M
Sbjct: 159  LNRILELLVSHRNFLMPAFDLFKNAHKHGVLPNTKSYNILMGAFCLNGDLSVAYKLFNKM 218

Query: 960  SKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHL 1139
             +RDVVPD+ESYRIL+QG CRKSQVN AVDLLED+LNKGF+PD+ SY+TLLNSLCRKK L
Sbjct: 219  FERDVVPDVESYRILMQGLCRKSQVNTAVDLLEDILNKGFIPDSLSYTTLLNSLCRKKKL 278

Query: 1140 KAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTL 1319
            + AYKLLCRMK+KGCNPD+VHYNTVILGFCREGRALDA KVLEDMP NGCLPNLVSY+TL
Sbjct: 279  REAYKLLCRMKVKGCNPDLVHYNTVILGFCREGRALDAVKVLEDMPSNGCLPNLVSYRTL 338

Query: 1320 IGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQ 1499
            IGGL DQG++DEA+KY  EML +GF+PHFSV + LVKGFCN+GK+EEA GV  E L++G+
Sbjct: 339  IGGLCDQGMFDEAKKYMEEMLIKGFSPHFSVSHTLVKGFCNVGKIEEAIGVFGEMLKYGE 398

Query: 1500 TPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRS 1679
             PH+DTW+ I+PRI E  E E    IL EV+K+E+K  TRIV+   GLE+YLI+K  +RS
Sbjct: 399  VPHMDTWVLIIPRICEDYETERMGEILEEVMKVEIKRDTRIVDAGTGLEDYLIRKIRSRS 458

Query: 1680 K 1682
            K
Sbjct: 459  K 459


>ref|XP_006491485.1| PREDICTED: conserved oligomeric Golgi complex subunit 4-like [Citrus
            sinensis]
          Length = 1352

 Score =  588 bits (1515), Expect = e-165
 Identities = 281/436 (64%), Positives = 350/436 (80%)
 Frame = +3

Query: 387  QESHQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYA 566
            QES    ++++E  +   + S IGSP R+ KLIA QSDPLLAKEIFD ASRQPNFRH  +
Sbjct: 38   QESPSSPEQQQESSISNSK-SPIGSPCRVQKLIASQSDPLLAKEIFDYASRQPNFRHSNS 96

Query: 567  TFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRI 746
            T+  LILKLGR+ +F L+          +YP++PSLF+ +I+IY ++ LP  ALKTF  +
Sbjct: 97   TYLILILKLGRAKYFSLIDDILITLKSEHYPVTPSLFTYLIKIYAESNLPDRALKTFRSM 156

Query: 747  LHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDD 926
            L FN KPLPK LNR+L++LV HRN+LRPA DLF+ AHK+GV PNT SYNI+MRAFC N D
Sbjct: 157  LEFNCKPLPKQLNRILELLVTHRNYLRPAFDLFKSAHKHGVLPNTKSYNIMMRAFCFNGD 216

Query: 927  LSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYST 1106
            +SIAY+LFN+M +R V+PD+ESYRIL+QG CRKSQVN+AVDLLEDMLNKGFVPDT SY+T
Sbjct: 217  ISIAYTLFNKMFERGVMPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDTLSYTT 276

Query: 1107 LLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNG 1286
            LLNSLCRKK L+ AYKLLCRMK+KGCNPDIVHYNTV+LGFCREGRA+DACKVLEDMP NG
Sbjct: 277  LLNSLCRKKKLREAYKLLCRMKVKGCNPDIVHYNTVVLGFCREGRAIDACKVLEDMPSNG 336

Query: 1287 CLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEAC 1466
            CLPNLVSY+TL+GGL DQG++D A+KY   M+S+GF+PHFSV + L+KGFCN+GK++EAC
Sbjct: 337  CLPNLVSYRTLVGGLCDQGMFDVAKKYMQLMISKGFSPHFSVSHALIKGFCNVGKVDEAC 396

Query: 1467 GVLEEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLE 1646
            GVLEE L+ G+ PH DTW+ I+P+I   +E E    +LNE++K+E+K  TRIVE   GLE
Sbjct: 397  GVLEELLKAGEAPHEDTWVMIVPQICAGEEMEKLGEVLNEIVKVEIKGDTRIVEAGIGLE 456

Query: 1647 EYLIKKKLTRSKNK*F 1694
            +YLI K  +R + + F
Sbjct: 457  DYLIGKTRSRPRREKF 472


>gb|EMJ20615.1| hypothetical protein PRUPE_ppa021922mg [Prunus persica]
          Length = 465

 Score =  580 bits (1494), Expect = e-162
 Identities = 277/415 (66%), Positives = 338/415 (81%)
 Frame = +3

Query: 420  EQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGR 599
            + H +  E  SIGSP+RI  LIA QSDPLLAKEIFDLA+RQP+FRH Y++F +LILKLGR
Sbjct: 42   QPHNQNHEIGSIGSPSRIQNLIASQSDPLLAKEIFDLAARQPHFRHSYSSFFTLILKLGR 101

Query: 600  SGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKH 779
            S +F L+          NY +SP+LF+ +I+IYG+A LP +AL+TFY ++ F+ +P  KH
Sbjct: 102  SKYFSLVDDLLIRLKTQNYSVSPALFAHLIKIYGEANLPQKALRTFYTMVEFDCRPSVKH 161

Query: 780  LNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQM 959
            LNR+L ILV+HRNFLRPA D+F+ AH++GV PNT SYNILMRAFCLN DLSIAY LFN+M
Sbjct: 162  LNRILQILVSHRNFLRPAFDVFKDAHRHGVMPNTQSYNILMRAFCLNGDLSIAYQLFNKM 221

Query: 960  SKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHL 1139
             +RD+VPD++SYRIL+QG CRK QVN AVD LEDMLNKGFVPD+ SY++LLNSLCRKK L
Sbjct: 222  FERDLVPDVQSYRILMQGLCRKGQVNTAVDFLEDMLNKGFVPDSLSYTSLLNSLCRKKKL 281

Query: 1140 KAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTL 1319
            + AYKLLCRMK+KGCNPDIVHYNTVILGFCREGR +DACKVLEDM  NGCLPNLVSY+TL
Sbjct: 282  REAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRPVDACKVLEDMASNGCLPNLVSYRTL 341

Query: 1320 IGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQ 1499
            + GL D G+ DEA+ Y   M+SRGF+PHFSVV+ LVKGFCN+G++EEA  VLEE L+HG+
Sbjct: 342  VSGLCDHGMLDEAKSYMETMISRGFSPHFSVVHALVKGFCNVGRVEEAFAVLEEVLKHGE 401

Query: 1500 TPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKK 1664
             PH DTW+ I+P I E  E E  + IL EV+K+E++P+TRIVE   GLE+YLIKK
Sbjct: 402  VPHTDTWLTIVPGICEEIELERLEEILREVMKVEIRPNTRIVEAAIGLEDYLIKK 456


>ref|XP_003626608.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|87240852|gb|ABD32710.1| Tetratricopeptide-like helical
            [Medicago truncatula] gi|355501623|gb|AES82826.1|
            Pentatricopeptide repeat-containing protein [Medicago
            truncatula]
          Length = 451

 Score =  578 bits (1490), Expect = e-162
 Identities = 282/420 (67%), Positives = 337/420 (80%), Gaps = 1/420 (0%)
 Frame = +3

Query: 426  HLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSG 605
            H      S IGSP R+ KLIA QSDPLLAKEIFD AS QPNFRH Y+T+  LILK GRS 
Sbjct: 30   HSSSSSSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHNYSTYLILILKFGRSK 89

Query: 606  HFPLMQXXXXXXXXXN-YPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHL 782
            HF L+          +  PI+P+LFS +I+IYG+A LP +AL TFY +L FN+KPL KHL
Sbjct: 90   HFSLLDDLLRRLKSESSQPITPTLFSYLIKIYGEANLPDKALNTFYIMLQFNIKPLTKHL 149

Query: 783  NRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMS 962
            NR+LDILV+HRN+LRPA DLF+ AHK+GV P+T SYNILMRAFCLN D+SIAY+LFN+M 
Sbjct: 150  NRILDILVSHRNYLRPAFDLFKDAHKHGVFPDTKSYNILMRAFCLNGDISIAYTLFNKMF 209

Query: 963  KRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLK 1142
            KRDVVPDI+SYRIL+Q  CRKSQVN AVDL EDMLNKGFVPD+++Y+TLLNSLCRKK L+
Sbjct: 210  KRDVVPDIQSYRILMQALCRKSQVNGAVDLFEDMLNKGFVPDSFTYTTLLNSLCRKKKLR 269

Query: 1143 AAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLI 1322
             AYKLLCRMK+KGCNPDIVHYNTVILGFCREGRA DACKV++DM  NGCLPNLVSY+TL+
Sbjct: 270  EAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDMQANGCLPNLVSYRTLV 329

Query: 1323 GGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQT 1502
             GL   G+ DEA KY  EMLS+GF+PHF+V++ LVKGFCN+G++EEACGVL + L H + 
Sbjct: 330  NGLCHLGMLDEATKYVEEMLSKGFSPHFAVIHALVKGFCNVGRIEEACGVLTKSLEHREA 389

Query: 1503 PHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682
            PH DTWM I+P+I EVD+    D +L EVLKIE+K  TRIV+   GLE+YLI+K   +S+
Sbjct: 390  PHKDTWMIIVPQICEVDDGVKIDGVLEEVLKIEIKGDTRIVDAGIGLEDYLIRKIRAKSR 449


>gb|ESW11013.1| hypothetical protein PHAVU_009G258200g, partial [Phaseolus vulgaris]
          Length = 418

 Score =  577 bits (1487), Expect = e-162
 Identities = 280/409 (68%), Positives = 330/409 (80%)
 Frame = +3

Query: 456  GSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQXXXX 635
            GSP R+ KLIA QSDPLLAKEIFD+ASRQPNFRH Y+T+  LILKLGRS +F  +     
Sbjct: 8    GSPTRVQKLIASQSDPLLAKEIFDVASRQPNFRHTYSTYLILILKLGRSKNFSFIDHLLR 67

Query: 636  XXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILVNHR 815
                 + PI+P+LF+ +I++Y +A LP +ALKTFY ILHF+ KPLPKHLNR+L++LV+HR
Sbjct: 68   CLRSDSQPITPTLFTYLIRVYAEADLPEKALKTFYNILHFDCKPLPKHLNRILELLVSHR 127

Query: 816  NFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDIESY 995
            N++RPA  LF+ AH+YGV PNT SYNILMRAFCLN D+SIAYSLFN+M KRDVVPDIESY
Sbjct: 128  NYIRPAFLLFKDAHRYGVEPNTKSYNILMRAFCLNGDISIAYSLFNKMFKRDVVPDIESY 187

Query: 996  RILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCRMKL 1175
            RIL+Q  CRKSQVN AVDLLEDMLNKGFVPD+ +Y+TLLNSLCRKK L+ AYKLLCRMK+
Sbjct: 188  RILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYKLLCRMKV 247

Query: 1176 KGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGLYDE 1355
            KGCNPDIVHYNTVILGFCREGRA DACKV+ DM  NGCLPNLVSY+TL  GL D G+ DE
Sbjct: 248  KGCNPDIVHYNTVILGFCREGRAHDACKVIADMRANGCLPNLVSYRTLARGLCDMGMLDE 307

Query: 1356 ARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWMEILP 1535
            ARKY  EML +GF+PHF+VV+ LVKGFCN+G+ E+ACGVL   L HG+ PH+DTWM ++P
Sbjct: 308  ARKYVEEMLCKGFSPHFAVVHALVKGFCNVGRAEDACGVLTMSLEHGEAPHVDTWMVLMP 367

Query: 1536 RISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682
             I EVD+       L EVLKIE+K  TRIV+   GLE YLIKK    S+
Sbjct: 368  VICEVDDGGKISGALEEVLKIEIKGHTRIVDAGIGLENYLIKKIRANSR 416


>ref|XP_002320901.2| hypothetical protein POPTR_0014s10150g [Populus trichocarpa]
            gi|550323886|gb|EEE99216.2| hypothetical protein
            POPTR_0014s10150g [Populus trichocarpa]
          Length = 475

 Score =  575 bits (1481), Expect = e-161
 Identities = 274/429 (63%), Positives = 339/429 (79%)
 Frame = +3

Query: 378  HHPQESHQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRH 557
            HH Q+ H+ +    + H     +S IGSP+R+ KLIA QSDPLLAKEIFD ASRQPNF+H
Sbjct: 40   HHHQQ-HKRELEPSDSHPNANTKSPIGSPSRVQKLIASQSDPLLAKEIFDYASRQPNFQH 98

Query: 558  PYATFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTF 737
             Y+++  LILKLGR+ +F  +          NYP++ +LFS II IYG A LP EALK F
Sbjct: 99   SYSSYLILILKLGRAKYFSFIDDLLTDLKSKNYPVTQTLFSYIINIYGKANLPDEALKIF 158

Query: 738  YRILHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCL 917
            Y IL F+  P PKHLN +L+ILV+H N+++PA DLF+ AH Y V PNT SYNIL+RAFCL
Sbjct: 159  YTILKFDCNPSPKHLNGILEILVSHHNYIKPAFDLFKDAHTYDVFPNTKSYNILIRAFCL 218

Query: 918  NDDLSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYS 1097
            N  +S+AYSLFNQM KRDV+PD+ESYRIL+Q  CRKSQVN AVDLLEDMLNKG+VPD  S
Sbjct: 219  NGQISMAYSLFNQMFKRDVMPDVESYRILMQALCRKSQVNGAVDLLEDMLNKGYVPDALS 278

Query: 1098 YSTLLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMP 1277
            Y+TLLNSLCRKK L+ AYKLLCRMK+KGCNPDI+HYNTVILGFCREGRA+DACKVLEDM 
Sbjct: 279  YTTLLNSLCRKKKLREAYKLLCRMKVKGCNPDIIHYNTVILGFCREGRAMDACKVLEDME 338

Query: 1278 PNGCLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLE 1457
             NGC+PNLVSY+TL+GGL DQG++DEA+ +  EM+ +GF+PHF+V   L+KGFCN+GK+E
Sbjct: 339  SNGCMPNLVSYRTLVGGLCDQGMFDEAKSHLEEMMMKGFSPHFAVSNALIKGFCNVGKIE 398

Query: 1458 EACGVLEEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRA 1637
            EACGV+EE L+HG+ PH +TW+ ++ RI EVD+ +    IL++V K+E+K  TRIVE   
Sbjct: 399  EACGVVEELLKHGEAPHTETWVMMVSRICEVDDLQRIGEILDKVKKVELKGDTRIVEAGI 458

Query: 1638 GLEEYLIKK 1664
            GLEEYLIK+
Sbjct: 459  GLEEYLIKR 467


>gb|EPS71710.1| hypothetical protein M569_03047, partial [Genlisea aurea]
          Length = 407

 Score =  560 bits (1443), Expect = e-156
 Identities = 277/412 (67%), Positives = 328/412 (79%), Gaps = 3/412 (0%)
 Frame = +3

Query: 432  KRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 611
            K   +S IGSPARI KLIA Q DPLLAKEIFDLASRQP F+H YATFH+LI KLGRS HF
Sbjct: 1    KENAQSCIGSPARIQKLIASQKDPLLAKEIFDLASRQPGFQHSYATFHTLIDKLGRSRHF 60

Query: 612  PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 791
             LM+            +SPSLFS+II+ YGDA LP +ALKTFY IL FNMKPL KHLNR+
Sbjct: 61   GLMENIILSLKLQRCSVSPSLFSRIIRFYGDANLPDKALKTFYTILEFNMKPLRKHLNRI 120

Query: 792  LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 971
            L+ILV++RN LRPA D+FR AH+YGVSPNT SYNI+MRAFCLNDDLSIAY+LFNQM KRD
Sbjct: 121  LEILVSNRNLLRPAFDIFRAAHRYGVSPNTESYNIMMRAFCLNDDLSIAYTLFNQMFKRD 180

Query: 972  VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1151
            +VP++ESYRIL+QG CRKSQVNKAVDLLEDM+NKG+VPD+ SY+TLLNSLCRKK LK AY
Sbjct: 181  IVPNVESYRILMQGLCRKSQVNKAVDLLEDMMNKGYVPDSLSYTTLLNSLCRKKKLKEAY 240

Query: 1152 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVL-EDMPPNGCLPNLVSYQTLIGG 1328
            KLLCRMK++GCNPDIVHYNTVI GFC+ GRA DACK++ EDMP  GCLPNLVSYQ L+GG
Sbjct: 241  KLLCRMKVRGCNPDIVHYNTVISGFCKSGRASDACKIVEEDMPSKGCLPNLVSYQNLVGG 300

Query: 1329 LSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFL--RHGQT 1502
            L DQG+YDEA++Y   M+SR F+PHFSVV++LV+G+C  G  EEAC VL + L  + G  
Sbjct: 301  LCDQGMYDEAKRYVKVMVSRDFSPHFSVVHMLVRGYCKTGSHEEACEVLVDLLMMKRGGC 360

Query: 1503 PHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLI 1658
            PH+++W E+LP +  + E E  +  +  +L    KPSTRIV+   G  EYLI
Sbjct: 361  PHLESWAEVLPHV--IRESEGLESKMKGIL---AKPSTRIVDSGVGWAEYLI 407


>gb|AHB18410.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 458

 Score =  551 bits (1420), Expect = e-154
 Identities = 270/412 (65%), Positives = 325/412 (78%)
 Frame = +3

Query: 447  SSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQX 626
            S I SP R+ KLI+  SDPLLA+EIFD+A  QP FRH Y++F  LILKLGRS HF L+  
Sbjct: 47   SPIASPTRVLKLISAWSDPLLAEEIFDVAITQPGFRHSYSSFLVLILKLGRSKHFSLVDD 106

Query: 627  XXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILV 806
                     Y ++P+LFS +I+IY +A LP +AL  FY++L FN+KPLP+HLNR+L++LV
Sbjct: 107  LLVCLKSDQYRVTPTLFSYLIKIYAEADLPEKALSVFYKMLEFNVKPLPRHLNRILELLV 166

Query: 807  NHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDI 986
            +HRNF+ PA DLF+ AHKYGV PNT SYNILM AFCLN DLSIAY LFN+M +RDV+PDI
Sbjct: 167  SHRNFIMPAFDLFKTAHKYGVFPNTKSYNILMGAFCLNGDLSIAYKLFNKMLERDVMPDI 226

Query: 987  ESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCR 1166
            ESY IL+QG CRKSQVN+AVDLLED LNKGF PD+ SYSTLLNSLCRKK L+ AYKLLCR
Sbjct: 227  ESYGILMQGLCRKSQVNRAVDLLEDRLNKGFAPDSLSYSTLLNSLCRKKKLREAYKLLCR 286

Query: 1167 MKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGL 1346
            MK+KGCNPDIVHYNTVILGFCREGRA+ A KVLEDMP NGCLPNLVSY+TL+G L DQG+
Sbjct: 287  MKVKGCNPDIVHYNTVILGFCREGRAMGAVKVLEDMPSNGCLPNLVSYRTLVGWLCDQGM 346

Query: 1347 YDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWME 1526
            +DEA+K+  EMLS+GF+ HFSV + L+KGFC++GK++ A  VL E L + + PH DTW  
Sbjct: 347  FDEAKKHMEEMLSKGFSSHFSVSHALIKGFCSVGKIDAATEVLGEMLEYREVPHTDTWGT 406

Query: 1527 ILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682
            I+P I E  E E  + IL EV+KIE+K  TRIVE   GLE+YLI+K   RSK
Sbjct: 407  IVPTICEDYETEKMEEILEEVMKIEIKRDTRIVEAGIGLEDYLIRKIRNRSK 458


>ref|XP_006444724.1| hypothetical protein CICLE_v10023955mg, partial [Citrus clementina]
            gi|557546986|gb|ESR57964.1| hypothetical protein
            CICLE_v10023955mg, partial [Citrus clementina]
          Length = 423

 Score =  548 bits (1412), Expect = e-153
 Identities = 259/385 (67%), Positives = 318/385 (82%)
 Frame = +3

Query: 387  QESHQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYA 566
            QES    ++++E  +   + S IGSP R+ KLIA QSDPLLAKEIFD ASRQPNFRH  +
Sbjct: 38   QESPSSPEQQQESSISNSK-SPIGSPCRVQKLIASQSDPLLAKEIFDYASRQPNFRHSNS 96

Query: 567  TFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRI 746
            T+  LILKLGR+ +F L+          +YP++PSLF+ +I+IY ++ LP  ALKTF  +
Sbjct: 97   TYLILILKLGRAKYFSLIDDILITLKSEHYPVTPSLFTYLIKIYAESNLPDRALKTFRSM 156

Query: 747  LHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDD 926
            L FN KPLPK LNR+L++LV HRN+LRPA DLF+ AHK+GV PNT SYNI+MRAFC N D
Sbjct: 157  LEFNCKPLPKQLNRILELLVTHRNYLRPAFDLFKSAHKHGVLPNTKSYNIMMRAFCFNGD 216

Query: 927  LSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYST 1106
            +SIAY+LFN+M +R V+PD+ESYRIL+QG CRKSQVN+AVDLLEDMLNKGFVPDT SY+T
Sbjct: 217  ISIAYTLFNKMFERGVMPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDTLSYTT 276

Query: 1107 LLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNG 1286
            LLNSLCRKK L+ AYKLLCRMK+KGCNPDIVHYNTV+LGFCREGRA+DACKVLEDMP NG
Sbjct: 277  LLNSLCRKKKLREAYKLLCRMKVKGCNPDIVHYNTVVLGFCREGRAIDACKVLEDMPSNG 336

Query: 1287 CLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEAC 1466
            CLPNLVSY+TL+GGL DQG++D A+KY   M+S+GF+PHFSV + L+KGFCN+GK++EAC
Sbjct: 337  CLPNLVSYRTLVGGLCDQGMFDVAKKYMQLMISKGFSPHFSVSHALIKGFCNVGKVDEAC 396

Query: 1467 GVLEEFLRHGQTPHIDTWMEILPRI 1541
            GVLEE L+ G+ PH DTW+ I+P+I
Sbjct: 397  GVLEELLKAGEAPHEDTWVMIVPQI 421


>gb|EXC13666.1| hypothetical protein L484_019627 [Morus notabilis]
          Length = 458

 Score =  536 bits (1381), Expect = e-149
 Identities = 259/402 (64%), Positives = 322/402 (80%), Gaps = 1/402 (0%)
 Frame = +3

Query: 459  SPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQXXXXX 638
            SP+R+ KLI  QSDPLLAKEIFD ASRQPNFRH Y++F  LILKLGRS +F L+      
Sbjct: 47   SPSRVQKLIVSQSDPLLAKEIFDYASRQPNFRHSYSSFLILILKLGRSKYFSLIDNLLVR 106

Query: 639  XXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILVNHRN 818
                 YP++ +LFS +I+IYG+A LP + L+TFY ++ F+ KPLPKHLN++L+ILV++R+
Sbjct: 107  LKAERYPVTSTLFSHLIRIYGEADLPDKVLRTFYMMIEFDFKPLPKHLNQILEILVSYRS 166

Query: 819  FLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDIESYR 998
             +  A DLF+ AH+YGV  NT SYNI+MR FCLN DLSIAY LFN+M +RD+VP+ ESYR
Sbjct: 167  HILSAFDLFKSAHRYGVLLNTESYNIMMRVFCLNGDLSIAYQLFNKMFERDLVPNDESYR 226

Query: 999  ILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCRMKLK 1178
            IL+QG CRK QVN AVD LEDMLNKGF PDT SY+TLLNSLCRKK L+ AYKLLCRMK+K
Sbjct: 227  ILMQGLCRKGQVNTAVDFLEDMLNKGFTPDTLSYTTLLNSLCRKKQLREAYKLLCRMKVK 286

Query: 1179 GCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGLYDEA 1358
            GCNPDIVHYNTVI+GFCREGRA+DACKVLEDM  NGCLPN+VSY++L+ GL  QG  DEA
Sbjct: 287  GCNPDIVHYNTVIVGFCREGRAMDACKVLEDMAENGCLPNVVSYRSLVSGLCHQGSLDEA 346

Query: 1359 RKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWMEILPR 1538
            ++Y  EM+S+G +PHFSVV+ LVKGFCN+G++EE CG+L E L+HG+ PH+DTW+ ILPR
Sbjct: 347  KRYMEEMMSKGLSPHFSVVHALVKGFCNVGRVEETCGILAESLKHGEVPHMDTWIAILPR 406

Query: 1539 ISEVDEKENFDCILNEVLKI-EVKPSTRIVEIRAGLEEYLIK 1661
            I E +E E+ D IL  VLKI +V+  T++ E R  LE+ L+K
Sbjct: 407  ICEENEIESLDEILKGVLKIDQVQLGTKMHEPRTCLEDPLMK 448


>ref|XP_004494974.1| PREDICTED: conserved oligomeric Golgi complex subunit 4-like [Cicer
            arietinum]
          Length = 1302

 Score =  521 bits (1341), Expect = e-145
 Identities = 259/419 (61%), Positives = 311/419 (74%)
 Frame = +3

Query: 426  HLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSG 605
            H      S IGSP R+ KLIA QSDPLLAKEIFD AS QPNFRH Y+T+  L+LK GRS 
Sbjct: 42   HSYSNSSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHTYSTYLILLLKFGRSK 101

Query: 606  HFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLN 785
            HF L+          + PI+P+LFS +IQIY  A LP +AL TFY +L FN KPL KHLN
Sbjct: 102  HFSLLDDLLRRLKSDSQPITPTLFSYLIQIYAQADLPDKALNTFYTMLQFNCKPLTKHLN 161

Query: 786  RVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSK 965
            R+L  LV+HRN++RPA DLF+ AHK+GV P+T SYNILMRAFCLN D+SIAY+LFN+M +
Sbjct: 162  RILVFLVSHRNYVRPAFDLFKDAHKHGVFPDTKSYNILMRAFCLNGDISIAYTLFNKMFQ 221

Query: 966  RDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKA 1145
            RDV+PDIESYRIL+Q  CRKSQVN AVDLLEDMLNKGFVPD+ +Y+TLLN          
Sbjct: 222  RDVIPDIESYRILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNR--------- 272

Query: 1146 AYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIG 1325
                        CNPDIVHYNTVILGFCREGRA DACKVL+DM  NGCLPNLVSY+TL+ 
Sbjct: 273  ------------CNPDIVHYNTVILGFCREGRASDACKVLDDMRANGCLPNLVSYRTLVN 320

Query: 1326 GLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTP 1505
            GL D G+ DEA KY  EM+S+GF+PHF+V++ LVKG CNIG++EEACGVL + L H + P
Sbjct: 321  GLCDLGMLDEATKYVEEMMSKGFSPHFAVIHALVKGLCNIGRIEEACGVLTKSLEHREAP 380

Query: 1506 HIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1682
            H DTWM ++P+I EVD+      +L EVLKIE+K  TRIV+   GLE+YLI+K   +S+
Sbjct: 381  HTDTWMIVVPQICEVDDGLKIGGVLEEVLKIEIKGHTRIVDAGIGLEDYLIRKIRAKSR 439


>ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400,
            mitochondrial-like [Cucumis sativus]
            gi|449499186|ref|XP_004160743.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g01400,
            mitochondrial-like [Cucumis sativus]
          Length = 482

 Score =  518 bits (1335), Expect = e-144
 Identities = 265/471 (56%), Positives = 338/471 (71%), Gaps = 8/471 (1%)
 Frame = +3

Query: 276  HVLPVHSYRSRLRSIANHCS-----LPQLFASCSSTQLKH---HPQESHQEKQRKEEQHL 431
            H+L   +YR+     A H +     L  L +S SS    H   H +        K EQ  
Sbjct: 4    HLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHEQ-C 62

Query: 432  KRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 611
            + + + SIGSP R+ KLIA QSDPLLAKEIFD A RQP+FR   ++   LILKLGRS +F
Sbjct: 63   EDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYF 122

Query: 612  PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 791
             L+           YP++P+ FS II+IYG+A LP +ALK FY ++ F   P  K LNR+
Sbjct: 123  SLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRI 182

Query: 792  LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 971
            L+ILV+HRNF+RPA DLF+ A  +GV PNT SYNIL+RAFC N ++SIAY+LFN+M +R+
Sbjct: 183  LEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFERN 242

Query: 972  VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1151
            V+PD+E+YR L+QG CRK+QVN AVDLLEDMLNKG++PDT SY+TLLNSLCRKK L+ AY
Sbjct: 243  VIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAY 302

Query: 1152 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGL 1331
            KLLCRMK+KGCNPDI HYNTVI+GFCREGRALDACK+LEDM  NGCLPNLVSY++L  GL
Sbjct: 303  KLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGL 362

Query: 1332 SDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHI 1511
             DQG+++ A+ Y  EM  +GF PHFSV++ LVKGF +IG++ E+C VLE+ L+ G+ PH 
Sbjct: 363  CDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHS 422

Query: 1512 DTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKK 1664
            DTW  I+  I EV++   F  +  ++LK +V+  TRIVE   GL EYLI+K
Sbjct: 423  DTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRK 473


>ref|NP_001154199.1| uncharacterized protein [Arabidopsis thaliana]
            gi|223635643|sp|Q8LDU5.2|PP298_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g01400, mitochondrial; Flags: Precursor
            gi|332656621|gb|AEE82021.1| uncharacterized protein
            AT4G01400 [Arabidopsis thaliana]
          Length = 466

 Score =  516 bits (1330), Expect = e-143
 Identities = 248/430 (57%), Positives = 318/430 (73%)
 Frame = +3

Query: 396  HQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFH 575
            +   + +  + +    +S IGSP R+ KLIA QSDPLLAKEIFD AS+QPNFRH  ++  
Sbjct: 29   YSSSEHEARKPIVSNPKSPIGSPTRVQKLIASQSDPLLAKEIFDYASQQPNFRHSRSSHL 88

Query: 576  SLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHF 755
             LILKLGR  +F L+           YP++  +F+ +I++Y +A LP + L TFY++L F
Sbjct: 89   ILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLIKVYAEAKLPEKVLSTFYKMLEF 148

Query: 756  NMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSI 935
            N  P PKHLNR+LD+LV+HR +L+ A +LF+ +  +GV PNT SYN+LM+AFCLNDDLSI
Sbjct: 149  NFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGVMPNTRSYNLLMQAFCLNDDLSI 208

Query: 936  AYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLN 1115
            AY LF +M +RDVVPD++SY+ILIQGFCRK QVN A++LL+DMLNKGFVPD  SY+TLLN
Sbjct: 209  AYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAMELLDDMLNKGFVPDRLSYTTLLN 268

Query: 1116 SLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLP 1295
            SLCRK  L+ AYKLLCRMKLKGCNPD+VHYNT+ILGFCRE RA+DA KVL+DM  NGC P
Sbjct: 269  SLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFCREDRAMDARKVLDDMLSNGCSP 328

Query: 1296 NLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVL 1475
            N VSY+TLIGGL DQG++DE +KY  EM+S+GF+PHFSV   LVKGFC+ GK+EEAC V+
Sbjct: 329  NSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKGFCSFGKVEEACDVV 388

Query: 1476 EEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYL 1655
            E  +++G+T H DTW  ++P I   DE E     L + +K E+   TRIV++  GL  YL
Sbjct: 389  EVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGDTRIVDVGIGLGSYL 448

Query: 1656 IKKKLTRSKN 1685
              K   + KN
Sbjct: 449  SSKLQMKRKN 458


>gb|AAC19289.1| contains similarity to Arabidopsis membrane-associated
            salt-inducible-like protein (GB:AL021637) [Arabidopsis
            thaliana]
          Length = 991

 Score =  516 bits (1328), Expect = e-143
 Identities = 247/424 (58%), Positives = 316/424 (74%)
 Frame = +3

Query: 396  HQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFH 575
            +   + +  + +    +S IGSP R+ KLIA QSDPLLAKEIFD AS+QPNFRH  ++  
Sbjct: 29   YSSSEHEARKPIVSNPKSPIGSPTRVQKLIASQSDPLLAKEIFDYASQQPNFRHSRSSHL 88

Query: 576  SLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHF 755
             LILKLGR  +F L+           YP++  +F+ +I++Y +A LP + L TFY++L F
Sbjct: 89   ILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLIKVYAEAKLPEKVLSTFYKMLEF 148

Query: 756  NMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSI 935
            N  P PKHLNR+LD+LV+HR +L+ A +LF+ +  +GV PNT SYN+LM+AFCLNDDLSI
Sbjct: 149  NFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGVMPNTRSYNLLMQAFCLNDDLSI 208

Query: 936  AYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLN 1115
            AY LF +M +RDVVPD++SY+ILIQGFCRK QVN A++LL+DMLNKGFVPD  SY+TLLN
Sbjct: 209  AYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAMELLDDMLNKGFVPDRLSYTTLLN 268

Query: 1116 SLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLP 1295
            SLCRK  L+ AYKLLCRMKLKGCNPD+VHYNT+ILGFCRE RA+DA KVL+DM  NGC P
Sbjct: 269  SLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFCREDRAMDARKVLDDMLSNGCSP 328

Query: 1296 NLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVL 1475
            N VSY+TLIGGL DQG++DE +KY  EM+S+GF+PHFSV   LVKGFC+ GK+EEAC V+
Sbjct: 329  NSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKGFCSFGKVEEACDVV 388

Query: 1476 EEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYL 1655
            E  +++G+T H DTW  ++P I   DE E     L + +K E+   TRIV++  GL  YL
Sbjct: 389  EVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGDTRIVDVGIGLGSYL 448

Query: 1656 IKKK 1667
             K K
Sbjct: 449  SKNK 452


>ref|XP_006605274.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g01400, mitochondrial-like, partial [Glycine
            max]
          Length = 403

 Score =  491 bits (1265), Expect = e-136
 Identities = 239/374 (63%), Positives = 293/374 (78%)
 Frame = +3

Query: 561  YATFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFY 740
            Y+++  L+LKLGRS HF  +          ++PI+P+LF+ + ++Y +A LP +ALKTFY
Sbjct: 29   YSSYLILLLKLGRSKHFTFLDGLLRPLKSDSHPITPTLFTYLFKVYPEADLPDKALKTFY 88

Query: 741  RILHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLN 920
             ILHFN KPLPKHLNR+L++LV+HRN+LRPA DLF+ +  YGV P+T S NILMR FCLN
Sbjct: 89   TILHFNCKPLPKHLNRILEVLVSHRNYLRPAFDLFKDSRSYGVEPDTKSCNILMRPFCLN 148

Query: 921  DDLSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSY 1100
             D+SIAYSLFN M KRDVVPDIESYRIL+Q  CRKS+VN AVDLLEDMLN GFVPD+ +Y
Sbjct: 149  GDISIAYSLFNIMFKRDVVPDIESYRILMQALCRKSRVNGAVDLLEDMLN-GFVPDSLTY 207

Query: 1101 STLLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPP 1280
            +TLLNSLCRKK  + AYKLLCRMK+KGCNPDIVH NTVILGFCR+GR  DACKV+ DM  
Sbjct: 208  TTLLNSLCRKKKFREAYKLLCRMKVKGCNPDIVHXNTVILGFCRDGRTHDACKVISDMRA 267

Query: 1281 NGCLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEE 1460
            NG LPNLVSY+TL+ GL + G+ DEA KY  EMLS+ F+PHF+VV+ LVKGFCN+G+ E+
Sbjct: 268  NGSLPNLVSYRTLVSGLCNMGMLDEASKYMEEMLSKDFSPHFAVVHALVKGFCNVGRTED 327

Query: 1461 ACGVLEEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAG 1640
            ACGVL + L HG+ PH+DTWM I+P I EVD++      L EVLKIE+K  TRIV+   G
Sbjct: 328  ACGVLTKALEHGEAPHVDTWMIIMPVICEVDDEGKSSGALEEVLKIEIKGHTRIVDAGIG 387

Query: 1641 LEEYLIKKKLTRSK 1682
            LE YLI K  +RS+
Sbjct: 388  LENYLIGKIRSRSR 401


>ref|XP_004308275.1| PREDICTED: uncharacterized protein LOC101307637 [Fragaria vesca
            subsp. vesca]
          Length = 2481

 Score =  454 bits (1168), Expect = e-125
 Identities = 226/386 (58%), Positives = 285/386 (73%), Gaps = 2/386 (0%)
 Frame = +3

Query: 444  ESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQ 623
            ES +GSPAR+ KLIA QSDPLLAKEIFD A++ P+FRH Y+++ +LILKLGR+ +F L+ 
Sbjct: 35   ESILGSPARVQKLIASQSDPLLAKEIFDFAAQHPHFRHSYSSYFTLILKLGRAHYFSLVD 94

Query: 624  XXXXXXXXX--NYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLD 797
                       +Y  SP+LF+ +I+IYGDA LP +AL+TFY +  FN KP  KHLNR+L+
Sbjct: 95   DLLLRLKSQPTSYSPSPALFTHLIKIYGDAHLPQKALRTFYTMFQFNCKPTVKHLNRILE 154

Query: 798  ILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVV 977
            ILV HRNFLR A D+FR AH++GV P+T SYNILMRAFCLN DLS+AY LFN+M +RDVV
Sbjct: 155  ILVAHRNFLRSAFDVFRDAHRHGVVPDTKSYNILMRAFCLNGDLSVAYGLFNKMYERDVV 214

Query: 978  PDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKL 1157
            PD+ESYRIL+QG CRK QVN +VD LEDM+NKGFVPD+ SY++L                
Sbjct: 215  PDVESYRILMQGLCRKGQVNTSVDFLEDMMNKGFVPDSLSYTSL---------------- 258

Query: 1158 LCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSD 1337
               MK+KGCNPDIVHYNTVI GFCREGRA+DACKVLEDM            +TL+ GL D
Sbjct: 259  ---MKVKGCNPDIVHYNTVISGFCREGRAVDACKVLEDM------------ETLVSGLCD 303

Query: 1338 QGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDT 1517
            QG+ DEA+KY   M+ +GF+PHFSVV+ LVKGFCN+G++E+ACGV+EE LRHG+ PH DT
Sbjct: 304  QGMLDEAKKYMEVMILKGFSPHFSVVHGLVKGFCNVGRIEDACGVMEEILRHGEVPHRDT 363

Query: 1518 WMEILPRISEVDEKENFDCILNEVLK 1595
            W+ I+P I E  E    + +  +++K
Sbjct: 364  WITIIPGICEEIELVRLEEVWKQIMK 389


>ref|XP_006837400.1| hypothetical protein AMTR_s00111p00140430 [Amborella trichopoda]
            gi|548840018|gb|ERN00254.1| hypothetical protein
            AMTR_s00111p00140430 [Amborella trichopoda]
          Length = 429

 Score =  446 bits (1146), Expect = e-122
 Identities = 220/385 (57%), Positives = 281/385 (72%)
 Frame = +3

Query: 447  SSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQX 626
            S+IGSPAR+ KLIA Q DPLLA EIFDLASRQPNF   Y++FHSLILKLGR   F LM+ 
Sbjct: 43   SAIGSPARVQKLIASQPDPLLAYEIFDLASRQPNFTPSYSSFHSLILKLGRHRQFSLMEK 102

Query: 627  XXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILV 806
                      P++P LFS +I IYGD+G+P +++KTF+++L F  KP+ KH N ++ +LV
Sbjct: 103  LISKLKSEGRPVTPGLFSDVITIYGDSGMPDQSVKTFFKMLEFQCKPVAKHFNALILVLV 162

Query: 807  NHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDI 986
             H N ++ A  LF+   K+G+S NT ++NILM+AFC  D LSIAY LFNQM K+ +VPD+
Sbjct: 163  EH-NRVQVAYSLFKDLEKFGISANTETFNILMKAFCFYDKLSIAYKLFNQMFKQGLVPDV 221

Query: 987  ESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCR 1166
            ESYRIL+QG CRKSQV  A++  +DM+NKGFVPD  SY+TLLNSLCRKK L+ AYK+LCR
Sbjct: 222  ESYRILMQGLCRKSQVKTALNFFDDMMNKGFVPDALSYNTLLNSLCRKKKLREAYKMLCR 281

Query: 1167 MKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGL 1346
            MK+KGCNPDI+HYNTVI GF REGRA DACKVLE+MP NGCLPN +SY+TL+ GL  +G 
Sbjct: 282  MKVKGCNPDILHYNTVITGFVREGRASDACKVLEEMPSNGCLPNSLSYRTLVDGLCKEGK 341

Query: 1347 YDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWME 1526
              EA+ Y  EM+ +GF PH S ++ LV   C  GK++EAC +++     G  PH  TW  
Sbjct: 342  LVEAKHYLGEMICKGFMPHTSSLHFLVVRICGGGKIDEACEMVKAAGNIGMAPHAKTWEL 401

Query: 1527 ILPRISEVDEKENFDCILNEVLKIE 1601
            ++ RI +VDE    + IL EV+K E
Sbjct: 402  VMQRIFDVDE-VRIEAILREVVKRE 425


Top