BLASTX nr result

ID: Catharanthus22_contig00003128 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00003128
         (2849 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006362890.1| PREDICTED: pentatricopeptide repeat-containi...   620   e-174
ref|XP_004251386.1| PREDICTED: pentatricopeptide repeat-containi...   616   e-173
emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera]   595   e-167
gb|EOX95584.1| Pentatricopeptide repeat-containing protein, mito...   590   e-166
ref|XP_006491485.1| PREDICTED: conserved oligomeric Golgi comple...   588   e-165
gb|EMJ20615.1| hypothetical protein PRUPE_ppa021922mg [Prunus pe...   580   e-162
ref|XP_003626608.1| Pentatricopeptide repeat-containing protein ...   578   e-162
gb|ESW11013.1| hypothetical protein PHAVU_009G258200g, partial [...   577   e-161
ref|XP_002320901.2| hypothetical protein POPTR_0014s10150g [Popu...   575   e-161
gb|EPS71710.1| hypothetical protein M569_03047, partial [Genlise...   560   e-156
gb|AHB18410.1| pentatricopeptide repeat-containing protein [Goss...   551   e-154
ref|XP_006444724.1| hypothetical protein CICLE_v10023955mg, part...   548   e-153
gb|EXC13666.1| hypothetical protein L484_019627 [Morus notabilis]     536   e-149
ref|XP_004494974.1| PREDICTED: conserved oligomeric Golgi comple...   521   e-145
ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containi...   518   e-144
ref|NP_001154199.1| uncharacterized protein [Arabidopsis thalian...   516   e-143
gb|AAC19289.1| contains similarity to Arabidopsis membrane-assoc...   516   e-143
ref|XP_006605274.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   491   e-136
ref|XP_004308275.1| PREDICTED: uncharacterized protein LOC101307...   454   e-124
ref|XP_006837400.1| hypothetical protein AMTR_s00111p00140430 [A...   446   e-122

>ref|XP_006362890.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400,
            mitochondrial-like [Solanum tuberosum]
          Length = 479

 Score =  620 bits (1599), Expect = e-174
 Identities = 315/477 (66%), Positives = 368/477 (77%), Gaps = 19/477 (3%)
 Frame = +1

Query: 328  LRSIANHCSLPQLFA--SCSSTQLK--------------HHPQESHQEKQRKEE--QHLK 453
            +R +++H S   L A   CSST L               +H Q+  Q+++R++E  +H +
Sbjct: 1    MRMLSHHFSSKDLLALVMCSSTWLSKVEPLSAWYKFKSHYHTQQPEQDRKRRQEDEEHKQ 60

Query: 454  RKEES-SIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 630
               +  SIGSPAR+ KLIA QSDPLLAKEIFDLASR+P+F+H YATFH+LILKLGRS  F
Sbjct: 61   NMNQGPSIGSPARVQKLIASQSDPLLAKEIFDLASREPDFQHSYATFHTLILKLGRSRQF 120

Query: 631  PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 810
             LMQ         +Y ISPSLFS+IIQIYGDAGLP +ALKTFY IL FNMKPLPKHLN +
Sbjct: 121  SLMQSVFSSLKSQHYSISPSLFSRIIQIYGDAGLPDKALKTFYTILEFNMKPLPKHLNLI 180

Query: 811  LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 990
            L+ILV HRNFLRPA DLFR AH YGV  NT SYNILMRAFCLNDDLSIAYSLFNQM KR+
Sbjct: 181  LEILVTHRNFLRPAFDLFRSAHTYGVLANTESYNILMRAFCLNDDLSIAYSLFNQMFKRE 240

Query: 991  VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1170
            + P++ESYRIL+QG CRKSQVN AVDLLEDMLNKGFVPD  SYSTLLNSLCRKK  K AY
Sbjct: 241  ISPNVESYRILMQGLCRKSQVNTAVDLLEDMLNKGFVPDALSYSTLLNSLCRKKKFKEAY 300

Query: 1171 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGL 1350
            KLLCRMK+KGCNPDIVHYNTVILGFCREGRA DACK+LEDMP NGCLPNLVSY+TL+GGL
Sbjct: 301  KLLCRMKVKGCNPDIVHYNTVILGFCREGRAADACKILEDMPSNGCLPNLVSYRTLVGGL 360

Query: 1351 SDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHI 1530
            S+QG+YDEA+ Y  EM+S+GF+PHFSVV+ +VKGFCN+GK+EEACGV    L HG+  H 
Sbjct: 361  SNQGMYDEAKNYMVEMMSKGFSPHFSVVHTVVKGFCNLGKIEEACGVAGSILSHGEPLHT 420

Query: 1531 DTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1701
            DTW EI+ RI E D  E     L E+++ E+KP  RIVE  A L EYL+    ++S+
Sbjct: 421  DTWEEIVSRILEWDAAEKIGNTLVELIQAEIKPEMRIVEAGARLGEYLMNSIKSKSR 477


>ref|XP_004251386.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400,
            mitochondrial-like [Solanum lycopersicum]
          Length = 479

 Score =  616 bits (1589), Expect = e-173
 Identities = 313/477 (65%), Positives = 361/477 (75%), Gaps = 19/477 (3%)
 Frame = +1

Query: 328  LRSIANHCSLPQLFA--SCSSTQL------------KHH-----PQESHQEKQRKEEQHL 450
            +R +++H S   L     CSS +L            K H     P++  +++Q  EE   
Sbjct: 1    MRMLSHHFSSKDLLVLVMCSSARLSKAEPLSAWYKFKSHYHTQQPEQDRKQRQADEEHKQ 60

Query: 451  KRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 630
               +  SIGSPAR+ KLIA QSDPLLAKEIFDLASR+P+F+H YATFH+LILKLGRS  F
Sbjct: 61   NTNQGPSIGSPARVQKLIASQSDPLLAKEIFDLASREPDFQHSYATFHTLILKLGRSRQF 120

Query: 631  PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 810
             LMQ         +Y ISPSLFS IIQIYGDAGLP  ALKTFY IL FNMKPLPKHLN +
Sbjct: 121  SLMQSVLSSLKSQHYSISPSLFSHIIQIYGDAGLPDRALKTFYTILEFNMKPLPKHLNLI 180

Query: 811  LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 990
            L+ILV HRNFLRPA DLFR AH YGV  NT SYNILMRAFCLNDDLSIAYSLFNQM KR+
Sbjct: 181  LEILVTHRNFLRPAFDLFRSAHTYGVLANTESYNILMRAFCLNDDLSIAYSLFNQMFKRE 240

Query: 991  VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1170
            + P++ESYRIL+QG CRKSQVN AVDLLEDMLNKGFVPD  SYSTLLNSLCRKK  K AY
Sbjct: 241  ISPNVESYRILMQGLCRKSQVNTAVDLLEDMLNKGFVPDALSYSTLLNSLCRKKKFKEAY 300

Query: 1171 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGL 1350
            KLLCRMK+KGCNPDIVHYNTVILGFCREGRA DACK+LEDMP NGCLPNLVSY+TL+GGL
Sbjct: 301  KLLCRMKVKGCNPDIVHYNTVILGFCREGRAADACKILEDMPSNGCLPNLVSYRTLVGGL 360

Query: 1351 SDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHI 1530
            SDQG+YDEA+ Y  EM+S+GF+PHFSVV+ +VKGFCN+GK+EEACGV    L HG+  H 
Sbjct: 361  SDQGMYDEAKNYMVEMMSKGFSPHFSVVHAVVKGFCNLGKIEEACGVAGSILSHGEPLHT 420

Query: 1531 DTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1701
            DTW EI+  I E D  E     L ++++ E+KP TRIVE  A L EYL+    ++S+
Sbjct: 421  DTWEEIVSIILEWDAAEKIGNTLVQLIQAEIKPETRIVEAGARLGEYLMNNIKSKSR 477


>emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera]
          Length = 422

 Score =  595 bits (1533), Expect = e-167
 Identities = 288/421 (68%), Positives = 340/421 (80%)
 Frame = +1

Query: 439  EQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGR 618
            E H+K    S IGSP+R+ KLIA QSDPLLAKEIFDLAS QPNF+H Y++FH LILKLG 
Sbjct: 3    EPHVK---PSPIGSPSRVQKLIASQSDPLLAKEIFDLASLQPNFKHSYSSFHILILKLGW 59

Query: 619  SGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKH 798
            +  F LMQ          Y I+PSLFS II+IYG+A LP +ALKTF+ +L F+ KPLPKH
Sbjct: 60   ARQFSLMQDLLMRLKSEQYSINPSLFSDIIEIYGEANLPDQALKTFHSMLQFHSKPLPKH 119

Query: 799  LNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQM 978
            LN +L +LV+HRN++RPA DLF+ AH+YGVSP+T SYNILM AFC N DLSIAY+LFNQM
Sbjct: 120  LNXLLQLLVSHRNYIRPAFDLFKSAHRYGVSPDTKSYNILMSAFCFNGDLSIAYTLFNQM 179

Query: 979  SKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHL 1158
             KRDV PD+ESYRIL+QG CRKSQVN+AVDLLEDMLNKG+VPD  SY+TLLNSLCRKK L
Sbjct: 180  FKRDVAPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGYVPDALSYTTLLNSLCRKKKL 239

Query: 1159 KAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTL 1338
            K AYKLLCRMK+KGCNPDIVHYNTVILGFCREGR LDACKVLEDMP NGC PNL+SY TL
Sbjct: 240  KEAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRXLDACKVLEDMPSNGCSPNLMSYGTL 299

Query: 1339 IGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQ 1518
            + GL DQGLYDEA+ Y  EMLS+GF+PHFSV + L+ GFCN+GKLEEAC VL E LRHG+
Sbjct: 300  VSGLCDQGLYDEAKNYVEEMLSKGFSPHFSVFHALINGFCNVGKLEEACEVLXEMLRHGE 359

Query: 1519 TPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRS 1698
              H +TW+ I+PRI EVD+    + I +E LK+E+ P+TR+VE   GLEEY+I+K   +S
Sbjct: 360  AXHTETWVAIIPRICEVDKLVRMENIFDEXLKLEITPNTRLVEAGIGLEEYVIRKVRDKS 419

Query: 1699 K 1701
            +
Sbjct: 420  R 420


>gb|EOX95584.1| Pentatricopeptide repeat-containing protein, mitochondrial [Theobroma
            cacao]
          Length = 461

 Score =  590 bits (1522), Expect = e-166
 Identities = 287/421 (68%), Positives = 343/421 (81%)
 Frame = +1

Query: 439  EQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGR 618
            +Q   R   S+IGSPAR+ KLI+ QSDPLLAKEIFD AS Q  FRH Y++F  LILKLGR
Sbjct: 39   KQQPPRTCTSAIGSPARVPKLISAQSDPLLAKEIFDYASNQLGFRHSYSSFLVLILKLGR 98

Query: 619  SGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKH 798
            S HF L+           YP++P+LFS +I+IY +A LP  ALKTFY++L FN+KPLPKH
Sbjct: 99   SKHFSLVDDLLIRLKTDRYPVTPTLFSYLIKIYAEANLPERALKTFYKMLEFNIKPLPKH 158

Query: 799  LNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQM 978
            LNR+L++LV+HRNFL PA DLF+ AHK+GV PNT SYNILM AFCLN DLS+AY LFN+M
Sbjct: 159  LNRILELLVSHRNFLMPAFDLFKNAHKHGVLPNTKSYNILMGAFCLNGDLSVAYKLFNKM 218

Query: 979  SKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHL 1158
             +RDVVPD+ESYRIL+QG CRKSQVN AVDLLED+LNKGF+PD+ SY+TLLNSLCRKK L
Sbjct: 219  FERDVVPDVESYRILMQGLCRKSQVNTAVDLLEDILNKGFIPDSLSYTTLLNSLCRKKKL 278

Query: 1159 KAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTL 1338
            + AYKLLCRMK+KGCNPD+VHYNTVILGFCREGRALDA KVLEDMP NGCLPNLVSY+TL
Sbjct: 279  REAYKLLCRMKVKGCNPDLVHYNTVILGFCREGRALDAVKVLEDMPSNGCLPNLVSYRTL 338

Query: 1339 IGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQ 1518
            IGGL DQG++DEA+KY  EML +GF+PHFSV + LVKGFCN+GK+EEA GV  E L++G+
Sbjct: 339  IGGLCDQGMFDEAKKYMEEMLIKGFSPHFSVSHTLVKGFCNVGKIEEAIGVFGEMLKYGE 398

Query: 1519 TPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRS 1698
             PH+DTW+ I+PRI E  E E    IL EV+K+E+K  TRIV+   GLE+YLI+K  +RS
Sbjct: 399  VPHMDTWVLIIPRICEDYETERMGEILEEVMKVEIKRDTRIVDAGTGLEDYLIRKIRSRS 458

Query: 1699 K 1701
            K
Sbjct: 459  K 459


>ref|XP_006491485.1| PREDICTED: conserved oligomeric Golgi complex subunit 4-like [Citrus
            sinensis]
          Length = 1352

 Score =  588 bits (1515), Expect = e-165
 Identities = 281/436 (64%), Positives = 350/436 (80%)
 Frame = +1

Query: 406  QESHQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYA 585
            QES    ++++E  +   + S IGSP R+ KLIA QSDPLLAKEIFD ASRQPNFRH  +
Sbjct: 38   QESPSSPEQQQESSISNSK-SPIGSPCRVQKLIASQSDPLLAKEIFDYASRQPNFRHSNS 96

Query: 586  TFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRI 765
            T+  LILKLGR+ +F L+          +YP++PSLF+ +I+IY ++ LP  ALKTF  +
Sbjct: 97   TYLILILKLGRAKYFSLIDDILITLKSEHYPVTPSLFTYLIKIYAESNLPDRALKTFRSM 156

Query: 766  LHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDD 945
            L FN KPLPK LNR+L++LV HRN+LRPA DLF+ AHK+GV PNT SYNI+MRAFC N D
Sbjct: 157  LEFNCKPLPKQLNRILELLVTHRNYLRPAFDLFKSAHKHGVLPNTKSYNIMMRAFCFNGD 216

Query: 946  LSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYST 1125
            +SIAY+LFN+M +R V+PD+ESYRIL+QG CRKSQVN+AVDLLEDMLNKGFVPDT SY+T
Sbjct: 217  ISIAYTLFNKMFERGVMPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDTLSYTT 276

Query: 1126 LLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNG 1305
            LLNSLCRKK L+ AYKLLCRMK+KGCNPDIVHYNTV+LGFCREGRA+DACKVLEDMP NG
Sbjct: 277  LLNSLCRKKKLREAYKLLCRMKVKGCNPDIVHYNTVVLGFCREGRAIDACKVLEDMPSNG 336

Query: 1306 CLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEAC 1485
            CLPNLVSY+TL+GGL DQG++D A+KY   M+S+GF+PHFSV + L+KGFCN+GK++EAC
Sbjct: 337  CLPNLVSYRTLVGGLCDQGMFDVAKKYMQLMISKGFSPHFSVSHALIKGFCNVGKVDEAC 396

Query: 1486 GVLEEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLE 1665
            GVLEE L+ G+ PH DTW+ I+P+I   +E E    +LNE++K+E+K  TRIVE   GLE
Sbjct: 397  GVLEELLKAGEAPHEDTWVMIVPQICAGEEMEKLGEVLNEIVKVEIKGDTRIVEAGIGLE 456

Query: 1666 EYLIKKKLTRSKNK*F 1713
            +YLI K  +R + + F
Sbjct: 457  DYLIGKTRSRPRREKF 472


>gb|EMJ20615.1| hypothetical protein PRUPE_ppa021922mg [Prunus persica]
          Length = 465

 Score =  580 bits (1494), Expect = e-162
 Identities = 277/415 (66%), Positives = 338/415 (81%)
 Frame = +1

Query: 439  EQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGR 618
            + H +  E  SIGSP+RI  LIA QSDPLLAKEIFDLA+RQP+FRH Y++F +LILKLGR
Sbjct: 42   QPHNQNHEIGSIGSPSRIQNLIASQSDPLLAKEIFDLAARQPHFRHSYSSFFTLILKLGR 101

Query: 619  SGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKH 798
            S +F L+          NY +SP+LF+ +I+IYG+A LP +AL+TFY ++ F+ +P  KH
Sbjct: 102  SKYFSLVDDLLIRLKTQNYSVSPALFAHLIKIYGEANLPQKALRTFYTMVEFDCRPSVKH 161

Query: 799  LNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQM 978
            LNR+L ILV+HRNFLRPA D+F+ AH++GV PNT SYNILMRAFCLN DLSIAY LFN+M
Sbjct: 162  LNRILQILVSHRNFLRPAFDVFKDAHRHGVMPNTQSYNILMRAFCLNGDLSIAYQLFNKM 221

Query: 979  SKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHL 1158
             +RD+VPD++SYRIL+QG CRK QVN AVD LEDMLNKGFVPD+ SY++LLNSLCRKK L
Sbjct: 222  FERDLVPDVQSYRILMQGLCRKGQVNTAVDFLEDMLNKGFVPDSLSYTSLLNSLCRKKKL 281

Query: 1159 KAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTL 1338
            + AYKLLCRMK+KGCNPDIVHYNTVILGFCREGR +DACKVLEDM  NGCLPNLVSY+TL
Sbjct: 282  REAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRPVDACKVLEDMASNGCLPNLVSYRTL 341

Query: 1339 IGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQ 1518
            + GL D G+ DEA+ Y   M+SRGF+PHFSVV+ LVKGFCN+G++EEA  VLEE L+HG+
Sbjct: 342  VSGLCDHGMLDEAKSYMETMISRGFSPHFSVVHALVKGFCNVGRVEEAFAVLEEVLKHGE 401

Query: 1519 TPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKK 1683
             PH DTW+ I+P I E  E E  + IL EV+K+E++P+TRIVE   GLE+YLIKK
Sbjct: 402  VPHTDTWLTIVPGICEEIELERLEEILREVMKVEIRPNTRIVEAAIGLEDYLIKK 456


>ref|XP_003626608.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|87240852|gb|ABD32710.1| Tetratricopeptide-like helical
            [Medicago truncatula] gi|355501623|gb|AES82826.1|
            Pentatricopeptide repeat-containing protein [Medicago
            truncatula]
          Length = 451

 Score =  578 bits (1490), Expect = e-162
 Identities = 282/420 (67%), Positives = 337/420 (80%), Gaps = 1/420 (0%)
 Frame = +1

Query: 445  HLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSG 624
            H      S IGSP R+ KLIA QSDPLLAKEIFD AS QPNFRH Y+T+  LILK GRS 
Sbjct: 30   HSSSSSSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHNYSTYLILILKFGRSK 89

Query: 625  HFPLMQXXXXXXXXXN-YPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHL 801
            HF L+          +  PI+P+LFS +I+IYG+A LP +AL TFY +L FN+KPL KHL
Sbjct: 90   HFSLLDDLLRRLKSESSQPITPTLFSYLIKIYGEANLPDKALNTFYIMLQFNIKPLTKHL 149

Query: 802  NRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMS 981
            NR+LDILV+HRN+LRPA DLF+ AHK+GV P+T SYNILMRAFCLN D+SIAY+LFN+M 
Sbjct: 150  NRILDILVSHRNYLRPAFDLFKDAHKHGVFPDTKSYNILMRAFCLNGDISIAYTLFNKMF 209

Query: 982  KRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLK 1161
            KRDVVPDI+SYRIL+Q  CRKSQVN AVDL EDMLNKGFVPD+++Y+TLLNSLCRKK L+
Sbjct: 210  KRDVVPDIQSYRILMQALCRKSQVNGAVDLFEDMLNKGFVPDSFTYTTLLNSLCRKKKLR 269

Query: 1162 AAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLI 1341
             AYKLLCRMK+KGCNPDIVHYNTVILGFCREGRA DACKV++DM  NGCLPNLVSY+TL+
Sbjct: 270  EAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDMQANGCLPNLVSYRTLV 329

Query: 1342 GGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQT 1521
             GL   G+ DEA KY  EMLS+GF+PHF+V++ LVKGFCN+G++EEACGVL + L H + 
Sbjct: 330  NGLCHLGMLDEATKYVEEMLSKGFSPHFAVIHALVKGFCNVGRIEEACGVLTKSLEHREA 389

Query: 1522 PHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1701
            PH DTWM I+P+I EVD+    D +L EVLKIE+K  TRIV+   GLE+YLI+K   +S+
Sbjct: 390  PHKDTWMIIVPQICEVDDGVKIDGVLEEVLKIEIKGDTRIVDAGIGLEDYLIRKIRAKSR 449


>gb|ESW11013.1| hypothetical protein PHAVU_009G258200g, partial [Phaseolus vulgaris]
          Length = 418

 Score =  577 bits (1487), Expect = e-161
 Identities = 280/409 (68%), Positives = 330/409 (80%)
 Frame = +1

Query: 475  GSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQXXXX 654
            GSP R+ KLIA QSDPLLAKEIFD+ASRQPNFRH Y+T+  LILKLGRS +F  +     
Sbjct: 8    GSPTRVQKLIASQSDPLLAKEIFDVASRQPNFRHTYSTYLILILKLGRSKNFSFIDHLLR 67

Query: 655  XXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILVNHR 834
                 + PI+P+LF+ +I++Y +A LP +ALKTFY ILHF+ KPLPKHLNR+L++LV+HR
Sbjct: 68   CLRSDSQPITPTLFTYLIRVYAEADLPEKALKTFYNILHFDCKPLPKHLNRILELLVSHR 127

Query: 835  NFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDIESY 1014
            N++RPA  LF+ AH+YGV PNT SYNILMRAFCLN D+SIAYSLFN+M KRDVVPDIESY
Sbjct: 128  NYIRPAFLLFKDAHRYGVEPNTKSYNILMRAFCLNGDISIAYSLFNKMFKRDVVPDIESY 187

Query: 1015 RILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCRMKL 1194
            RIL+Q  CRKSQVN AVDLLEDMLNKGFVPD+ +Y+TLLNSLCRKK L+ AYKLLCRMK+
Sbjct: 188  RILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYKLLCRMKV 247

Query: 1195 KGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGLYDE 1374
            KGCNPDIVHYNTVILGFCREGRA DACKV+ DM  NGCLPNLVSY+TL  GL D G+ DE
Sbjct: 248  KGCNPDIVHYNTVILGFCREGRAHDACKVIADMRANGCLPNLVSYRTLARGLCDMGMLDE 307

Query: 1375 ARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWMEILP 1554
            ARKY  EML +GF+PHF+VV+ LVKGFCN+G+ E+ACGVL   L HG+ PH+DTWM ++P
Sbjct: 308  ARKYVEEMLCKGFSPHFAVVHALVKGFCNVGRAEDACGVLTMSLEHGEAPHVDTWMVLMP 367

Query: 1555 RISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1701
             I EVD+       L EVLKIE+K  TRIV+   GLE YLIKK    S+
Sbjct: 368  VICEVDDGGKISGALEEVLKIEIKGHTRIVDAGIGLENYLIKKIRANSR 416


>ref|XP_002320901.2| hypothetical protein POPTR_0014s10150g [Populus trichocarpa]
            gi|550323886|gb|EEE99216.2| hypothetical protein
            POPTR_0014s10150g [Populus trichocarpa]
          Length = 475

 Score =  575 bits (1481), Expect = e-161
 Identities = 274/429 (63%), Positives = 339/429 (79%)
 Frame = +1

Query: 397  HHPQESHQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRH 576
            HH Q+ H+ +    + H     +S IGSP+R+ KLIA QSDPLLAKEIFD ASRQPNF+H
Sbjct: 40   HHHQQ-HKRELEPSDSHPNANTKSPIGSPSRVQKLIASQSDPLLAKEIFDYASRQPNFQH 98

Query: 577  PYATFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTF 756
             Y+++  LILKLGR+ +F  +          NYP++ +LFS II IYG A LP EALK F
Sbjct: 99   SYSSYLILILKLGRAKYFSFIDDLLTDLKSKNYPVTQTLFSYIINIYGKANLPDEALKIF 158

Query: 757  YRILHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCL 936
            Y IL F+  P PKHLN +L+ILV+H N+++PA DLF+ AH Y V PNT SYNIL+RAFCL
Sbjct: 159  YTILKFDCNPSPKHLNGILEILVSHHNYIKPAFDLFKDAHTYDVFPNTKSYNILIRAFCL 218

Query: 937  NDDLSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYS 1116
            N  +S+AYSLFNQM KRDV+PD+ESYRIL+Q  CRKSQVN AVDLLEDMLNKG+VPD  S
Sbjct: 219  NGQISMAYSLFNQMFKRDVMPDVESYRILMQALCRKSQVNGAVDLLEDMLNKGYVPDALS 278

Query: 1117 YSTLLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMP 1296
            Y+TLLNSLCRKK L+ AYKLLCRMK+KGCNPDI+HYNTVILGFCREGRA+DACKVLEDM 
Sbjct: 279  YTTLLNSLCRKKKLREAYKLLCRMKVKGCNPDIIHYNTVILGFCREGRAMDACKVLEDME 338

Query: 1297 PNGCLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLE 1476
             NGC+PNLVSY+TL+GGL DQG++DEA+ +  EM+ +GF+PHF+V   L+KGFCN+GK+E
Sbjct: 339  SNGCMPNLVSYRTLVGGLCDQGMFDEAKSHLEEMMMKGFSPHFAVSNALIKGFCNVGKIE 398

Query: 1477 EACGVLEEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRA 1656
            EACGV+EE L+HG+ PH +TW+ ++ RI EVD+ +    IL++V K+E+K  TRIVE   
Sbjct: 399  EACGVVEELLKHGEAPHTETWVMMVSRICEVDDLQRIGEILDKVKKVELKGDTRIVEAGI 458

Query: 1657 GLEEYLIKK 1683
            GLEEYLIK+
Sbjct: 459  GLEEYLIKR 467


>gb|EPS71710.1| hypothetical protein M569_03047, partial [Genlisea aurea]
          Length = 407

 Score =  560 bits (1443), Expect = e-156
 Identities = 277/412 (67%), Positives = 328/412 (79%), Gaps = 3/412 (0%)
 Frame = +1

Query: 451  KRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 630
            K   +S IGSPARI KLIA Q DPLLAKEIFDLASRQP F+H YATFH+LI KLGRS HF
Sbjct: 1    KENAQSCIGSPARIQKLIASQKDPLLAKEIFDLASRQPGFQHSYATFHTLIDKLGRSRHF 60

Query: 631  PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 810
             LM+            +SPSLFS+II+ YGDA LP +ALKTFY IL FNMKPL KHLNR+
Sbjct: 61   GLMENIILSLKLQRCSVSPSLFSRIIRFYGDANLPDKALKTFYTILEFNMKPLRKHLNRI 120

Query: 811  LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 990
            L+ILV++RN LRPA D+FR AH+YGVSPNT SYNI+MRAFCLNDDLSIAY+LFNQM KRD
Sbjct: 121  LEILVSNRNLLRPAFDIFRAAHRYGVSPNTESYNIMMRAFCLNDDLSIAYTLFNQMFKRD 180

Query: 991  VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1170
            +VP++ESYRIL+QG CRKSQVNKAVDLLEDM+NKG+VPD+ SY+TLLNSLCRKK LK AY
Sbjct: 181  IVPNVESYRILMQGLCRKSQVNKAVDLLEDMMNKGYVPDSLSYTTLLNSLCRKKKLKEAY 240

Query: 1171 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVL-EDMPPNGCLPNLVSYQTLIGG 1347
            KLLCRMK++GCNPDIVHYNTVI GFC+ GRA DACK++ EDMP  GCLPNLVSYQ L+GG
Sbjct: 241  KLLCRMKVRGCNPDIVHYNTVISGFCKSGRASDACKIVEEDMPSKGCLPNLVSYQNLVGG 300

Query: 1348 LSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFL--RHGQT 1521
            L DQG+YDEA++Y   M+SR F+PHFSVV++LV+G+C  G  EEAC VL + L  + G  
Sbjct: 301  LCDQGMYDEAKRYVKVMVSRDFSPHFSVVHMLVRGYCKTGSHEEACEVLVDLLMMKRGGC 360

Query: 1522 PHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLI 1677
            PH+++W E+LP +  + E E  +  +  +L    KPSTRIV+   G  EYLI
Sbjct: 361  PHLESWAEVLPHV--IRESEGLESKMKGIL---AKPSTRIVDSGVGWAEYLI 407


>gb|AHB18410.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 458

 Score =  551 bits (1420), Expect = e-154
 Identities = 270/412 (65%), Positives = 325/412 (78%)
 Frame = +1

Query: 466  SSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQX 645
            S I SP R+ KLI+  SDPLLA+EIFD+A  QP FRH Y++F  LILKLGRS HF L+  
Sbjct: 47   SPIASPTRVLKLISAWSDPLLAEEIFDVAITQPGFRHSYSSFLVLILKLGRSKHFSLVDD 106

Query: 646  XXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILV 825
                     Y ++P+LFS +I+IY +A LP +AL  FY++L FN+KPLP+HLNR+L++LV
Sbjct: 107  LLVCLKSDQYRVTPTLFSYLIKIYAEADLPEKALSVFYKMLEFNVKPLPRHLNRILELLV 166

Query: 826  NHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDI 1005
            +HRNF+ PA DLF+ AHKYGV PNT SYNILM AFCLN DLSIAY LFN+M +RDV+PDI
Sbjct: 167  SHRNFIMPAFDLFKTAHKYGVFPNTKSYNILMGAFCLNGDLSIAYKLFNKMLERDVMPDI 226

Query: 1006 ESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCR 1185
            ESY IL+QG CRKSQVN+AVDLLED LNKGF PD+ SYSTLLNSLCRKK L+ AYKLLCR
Sbjct: 227  ESYGILMQGLCRKSQVNRAVDLLEDRLNKGFAPDSLSYSTLLNSLCRKKKLREAYKLLCR 286

Query: 1186 MKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGL 1365
            MK+KGCNPDIVHYNTVILGFCREGRA+ A KVLEDMP NGCLPNLVSY+TL+G L DQG+
Sbjct: 287  MKVKGCNPDIVHYNTVILGFCREGRAMGAVKVLEDMPSNGCLPNLVSYRTLVGWLCDQGM 346

Query: 1366 YDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWME 1545
            +DEA+K+  EMLS+GF+ HFSV + L+KGFC++GK++ A  VL E L + + PH DTW  
Sbjct: 347  FDEAKKHMEEMLSKGFSSHFSVSHALIKGFCSVGKIDAATEVLGEMLEYREVPHTDTWGT 406

Query: 1546 ILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1701
            I+P I E  E E  + IL EV+KIE+K  TRIVE   GLE+YLI+K   RSK
Sbjct: 407  IVPTICEDYETEKMEEILEEVMKIEIKRDTRIVEAGIGLEDYLIRKIRNRSK 458


>ref|XP_006444724.1| hypothetical protein CICLE_v10023955mg, partial [Citrus clementina]
            gi|557546986|gb|ESR57964.1| hypothetical protein
            CICLE_v10023955mg, partial [Citrus clementina]
          Length = 423

 Score =  548 bits (1412), Expect = e-153
 Identities = 259/385 (67%), Positives = 318/385 (82%)
 Frame = +1

Query: 406  QESHQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYA 585
            QES    ++++E  +   + S IGSP R+ KLIA QSDPLLAKEIFD ASRQPNFRH  +
Sbjct: 38   QESPSSPEQQQESSISNSK-SPIGSPCRVQKLIASQSDPLLAKEIFDYASRQPNFRHSNS 96

Query: 586  TFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRI 765
            T+  LILKLGR+ +F L+          +YP++PSLF+ +I+IY ++ LP  ALKTF  +
Sbjct: 97   TYLILILKLGRAKYFSLIDDILITLKSEHYPVTPSLFTYLIKIYAESNLPDRALKTFRSM 156

Query: 766  LHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDD 945
            L FN KPLPK LNR+L++LV HRN+LRPA DLF+ AHK+GV PNT SYNI+MRAFC N D
Sbjct: 157  LEFNCKPLPKQLNRILELLVTHRNYLRPAFDLFKSAHKHGVLPNTKSYNIMMRAFCFNGD 216

Query: 946  LSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYST 1125
            +SIAY+LFN+M +R V+PD+ESYRIL+QG CRKSQVN+AVDLLEDMLNKGFVPDT SY+T
Sbjct: 217  ISIAYTLFNKMFERGVMPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDTLSYTT 276

Query: 1126 LLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNG 1305
            LLNSLCRKK L+ AYKLLCRMK+KGCNPDIVHYNTV+LGFCREGRA+DACKVLEDMP NG
Sbjct: 277  LLNSLCRKKKLREAYKLLCRMKVKGCNPDIVHYNTVVLGFCREGRAIDACKVLEDMPSNG 336

Query: 1306 CLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEAC 1485
            CLPNLVSY+TL+GGL DQG++D A+KY   M+S+GF+PHFSV + L+KGFCN+GK++EAC
Sbjct: 337  CLPNLVSYRTLVGGLCDQGMFDVAKKYMQLMISKGFSPHFSVSHALIKGFCNVGKVDEAC 396

Query: 1486 GVLEEFLRHGQTPHIDTWMEILPRI 1560
            GVLEE L+ G+ PH DTW+ I+P+I
Sbjct: 397  GVLEELLKAGEAPHEDTWVMIVPQI 421


>gb|EXC13666.1| hypothetical protein L484_019627 [Morus notabilis]
          Length = 458

 Score =  536 bits (1381), Expect = e-149
 Identities = 259/402 (64%), Positives = 322/402 (80%), Gaps = 1/402 (0%)
 Frame = +1

Query: 478  SPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQXXXXX 657
            SP+R+ KLI  QSDPLLAKEIFD ASRQPNFRH Y++F  LILKLGRS +F L+      
Sbjct: 47   SPSRVQKLIVSQSDPLLAKEIFDYASRQPNFRHSYSSFLILILKLGRSKYFSLIDNLLVR 106

Query: 658  XXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILVNHRN 837
                 YP++ +LFS +I+IYG+A LP + L+TFY ++ F+ KPLPKHLN++L+ILV++R+
Sbjct: 107  LKAERYPVTSTLFSHLIRIYGEADLPDKVLRTFYMMIEFDFKPLPKHLNQILEILVSYRS 166

Query: 838  FLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDIESYR 1017
             +  A DLF+ AH+YGV  NT SYNI+MR FCLN DLSIAY LFN+M +RD+VP+ ESYR
Sbjct: 167  HILSAFDLFKSAHRYGVLLNTESYNIMMRVFCLNGDLSIAYQLFNKMFERDLVPNDESYR 226

Query: 1018 ILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCRMKLK 1197
            IL+QG CRK QVN AVD LEDMLNKGF PDT SY+TLLNSLCRKK L+ AYKLLCRMK+K
Sbjct: 227  ILMQGLCRKGQVNTAVDFLEDMLNKGFTPDTLSYTTLLNSLCRKKQLREAYKLLCRMKVK 286

Query: 1198 GCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGLYDEA 1377
            GCNPDIVHYNTVI+GFCREGRA+DACKVLEDM  NGCLPN+VSY++L+ GL  QG  DEA
Sbjct: 287  GCNPDIVHYNTVIVGFCREGRAMDACKVLEDMAENGCLPNVVSYRSLVSGLCHQGSLDEA 346

Query: 1378 RKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWMEILPR 1557
            ++Y  EM+S+G +PHFSVV+ LVKGFCN+G++EE CG+L E L+HG+ PH+DTW+ ILPR
Sbjct: 347  KRYMEEMMSKGLSPHFSVVHALVKGFCNVGRVEETCGILAESLKHGEVPHMDTWIAILPR 406

Query: 1558 ISEVDEKENFDCILNEVLKI-EVKPSTRIVEIRAGLEEYLIK 1680
            I E +E E+ D IL  VLKI +V+  T++ E R  LE+ L+K
Sbjct: 407  ICEENEIESLDEILKGVLKIDQVQLGTKMHEPRTCLEDPLMK 448


>ref|XP_004494974.1| PREDICTED: conserved oligomeric Golgi complex subunit 4-like [Cicer
            arietinum]
          Length = 1302

 Score =  521 bits (1341), Expect = e-145
 Identities = 259/419 (61%), Positives = 311/419 (74%)
 Frame = +1

Query: 445  HLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSG 624
            H      S IGSP R+ KLIA QSDPLLAKEIFD AS QPNFRH Y+T+  L+LK GRS 
Sbjct: 42   HSYSNSSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHTYSTYLILLLKFGRSK 101

Query: 625  HFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLN 804
            HF L+          + PI+P+LFS +IQIY  A LP +AL TFY +L FN KPL KHLN
Sbjct: 102  HFSLLDDLLRRLKSDSQPITPTLFSYLIQIYAQADLPDKALNTFYTMLQFNCKPLTKHLN 161

Query: 805  RVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSK 984
            R+L  LV+HRN++RPA DLF+ AHK+GV P+T SYNILMRAFCLN D+SIAY+LFN+M +
Sbjct: 162  RILVFLVSHRNYVRPAFDLFKDAHKHGVFPDTKSYNILMRAFCLNGDISIAYTLFNKMFQ 221

Query: 985  RDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKA 1164
            RDV+PDIESYRIL+Q  CRKSQVN AVDLLEDMLNKGFVPD+ +Y+TLLN          
Sbjct: 222  RDVIPDIESYRILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNR--------- 272

Query: 1165 AYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIG 1344
                        CNPDIVHYNTVILGFCREGRA DACKVL+DM  NGCLPNLVSY+TL+ 
Sbjct: 273  ------------CNPDIVHYNTVILGFCREGRASDACKVLDDMRANGCLPNLVSYRTLVN 320

Query: 1345 GLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTP 1524
            GL D G+ DEA KY  EM+S+GF+PHF+V++ LVKG CNIG++EEACGVL + L H + P
Sbjct: 321  GLCDLGMLDEATKYVEEMMSKGFSPHFAVIHALVKGLCNIGRIEEACGVLTKSLEHREAP 380

Query: 1525 HIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKKKLTRSK 1701
            H DTWM ++P+I EVD+      +L EVLKIE+K  TRIV+   GLE+YLI+K   +S+
Sbjct: 381  HTDTWMIVVPQICEVDDGLKIGGVLEEVLKIEIKGHTRIVDAGIGLEDYLIRKIRAKSR 439


>ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400,
            mitochondrial-like [Cucumis sativus]
            gi|449499186|ref|XP_004160743.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g01400,
            mitochondrial-like [Cucumis sativus]
          Length = 482

 Score =  518 bits (1335), Expect = e-144
 Identities = 265/471 (56%), Positives = 338/471 (71%), Gaps = 8/471 (1%)
 Frame = +1

Query: 295  HVLPVHSYRSRLRSIANHCS-----LPQLFASCSSTQLKH---HPQESHQEKQRKEEQHL 450
            H+L   +YR+     A H +     L  L +S SS    H   H +        K EQ  
Sbjct: 4    HLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHEQ-C 62

Query: 451  KRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHF 630
            + + + SIGSP R+ KLIA QSDPLLAKEIFD A RQP+FR   ++   LILKLGRS +F
Sbjct: 63   EDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYF 122

Query: 631  PLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRV 810
             L+           YP++P+ FS II+IYG+A LP +ALK FY ++ F   P  K LNR+
Sbjct: 123  SLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRI 182

Query: 811  LDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRD 990
            L+ILV+HRNF+RPA DLF+ A  +GV PNT SYNIL+RAFC N ++SIAY+LFN+M +R+
Sbjct: 183  LEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFERN 242

Query: 991  VVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAY 1170
            V+PD+E+YR L+QG CRK+QVN AVDLLEDMLNKG++PDT SY+TLLNSLCRKK L+ AY
Sbjct: 243  VIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAY 302

Query: 1171 KLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGL 1350
            KLLCRMK+KGCNPDI HYNTVI+GFCREGRALDACK+LEDM  NGCLPNLVSY++L  GL
Sbjct: 303  KLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGL 362

Query: 1351 SDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHI 1530
             DQG+++ A+ Y  EM  +GF PHFSV++ LVKGF +IG++ E+C VLE+ L+ G+ PH 
Sbjct: 363  CDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHS 422

Query: 1531 DTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYLIKK 1683
            DTW  I+  I EV++   F  +  ++LK +V+  TRIVE   GL EYLI+K
Sbjct: 423  DTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRK 473


>ref|NP_001154199.1| uncharacterized protein [Arabidopsis thaliana]
            gi|223635643|sp|Q8LDU5.2|PP298_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g01400, mitochondrial; Flags: Precursor
            gi|332656621|gb|AEE82021.1| uncharacterized protein
            AT4G01400 [Arabidopsis thaliana]
          Length = 466

 Score =  516 bits (1330), Expect = e-143
 Identities = 248/430 (57%), Positives = 318/430 (73%)
 Frame = +1

Query: 415  HQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFH 594
            +   + +  + +    +S IGSP R+ KLIA QSDPLLAKEIFD AS+QPNFRH  ++  
Sbjct: 29   YSSSEHEARKPIVSNPKSPIGSPTRVQKLIASQSDPLLAKEIFDYASQQPNFRHSRSSHL 88

Query: 595  SLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHF 774
             LILKLGR  +F L+           YP++  +F+ +I++Y +A LP + L TFY++L F
Sbjct: 89   ILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLIKVYAEAKLPEKVLSTFYKMLEF 148

Query: 775  NMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSI 954
            N  P PKHLNR+LD+LV+HR +L+ A +LF+ +  +GV PNT SYN+LM+AFCLNDDLSI
Sbjct: 149  NFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGVMPNTRSYNLLMQAFCLNDDLSI 208

Query: 955  AYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLN 1134
            AY LF +M +RDVVPD++SY+ILIQGFCRK QVN A++LL+DMLNKGFVPD  SY+TLLN
Sbjct: 209  AYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAMELLDDMLNKGFVPDRLSYTTLLN 268

Query: 1135 SLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLP 1314
            SLCRK  L+ AYKLLCRMKLKGCNPD+VHYNT+ILGFCRE RA+DA KVL+DM  NGC P
Sbjct: 269  SLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFCREDRAMDARKVLDDMLSNGCSP 328

Query: 1315 NLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVL 1494
            N VSY+TLIGGL DQG++DE +KY  EM+S+GF+PHFSV   LVKGFC+ GK+EEAC V+
Sbjct: 329  NSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKGFCSFGKVEEACDVV 388

Query: 1495 EEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYL 1674
            E  +++G+T H DTW  ++P I   DE E     L + +K E+   TRIV++  GL  YL
Sbjct: 389  EVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGDTRIVDVGIGLGSYL 448

Query: 1675 IKKKLTRSKN 1704
              K   + KN
Sbjct: 449  SSKLQMKRKN 458


>gb|AAC19289.1| contains similarity to Arabidopsis membrane-associated
            salt-inducible-like protein (GB:AL021637) [Arabidopsis
            thaliana]
          Length = 991

 Score =  516 bits (1328), Expect = e-143
 Identities = 247/424 (58%), Positives = 316/424 (74%)
 Frame = +1

Query: 415  HQEKQRKEEQHLKRKEESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFH 594
            +   + +  + +    +S IGSP R+ KLIA QSDPLLAKEIFD AS+QPNFRH  ++  
Sbjct: 29   YSSSEHEARKPIVSNPKSPIGSPTRVQKLIASQSDPLLAKEIFDYASQQPNFRHSRSSHL 88

Query: 595  SLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHF 774
             LILKLGR  +F L+           YP++  +F+ +I++Y +A LP + L TFY++L F
Sbjct: 89   ILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLIKVYAEAKLPEKVLSTFYKMLEF 148

Query: 775  NMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSI 954
            N  P PKHLNR+LD+LV+HR +L+ A +LF+ +  +GV PNT SYN+LM+AFCLNDDLSI
Sbjct: 149  NFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGVMPNTRSYNLLMQAFCLNDDLSI 208

Query: 955  AYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLN 1134
            AY LF +M +RDVVPD++SY+ILIQGFCRK QVN A++LL+DMLNKGFVPD  SY+TLLN
Sbjct: 209  AYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAMELLDDMLNKGFVPDRLSYTTLLN 268

Query: 1135 SLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLP 1314
            SLCRK  L+ AYKLLCRMKLKGCNPD+VHYNT+ILGFCRE RA+DA KVL+DM  NGC P
Sbjct: 269  SLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFCREDRAMDARKVLDDMLSNGCSP 328

Query: 1315 NLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVL 1494
            N VSY+TLIGGL DQG++DE +KY  EM+S+GF+PHFSV   LVKGFC+ GK+EEAC V+
Sbjct: 329  NSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKGFCSFGKVEEACDVV 388

Query: 1495 EEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAGLEEYL 1674
            E  +++G+T H DTW  ++P I   DE E     L + +K E+   TRIV++  GL  YL
Sbjct: 389  EVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGDTRIVDVGIGLGSYL 448

Query: 1675 IKKK 1686
             K K
Sbjct: 449  SKNK 452


>ref|XP_006605274.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g01400, mitochondrial-like, partial [Glycine
            max]
          Length = 403

 Score =  491 bits (1265), Expect = e-136
 Identities = 239/374 (63%), Positives = 293/374 (78%)
 Frame = +1

Query: 580  YATFHSLILKLGRSGHFPLMQXXXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFY 759
            Y+++  L+LKLGRS HF  +          ++PI+P+LF+ + ++Y +A LP +ALKTFY
Sbjct: 29   YSSYLILLLKLGRSKHFTFLDGLLRPLKSDSHPITPTLFTYLFKVYPEADLPDKALKTFY 88

Query: 760  RILHFNMKPLPKHLNRVLDILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLN 939
             ILHFN KPLPKHLNR+L++LV+HRN+LRPA DLF+ +  YGV P+T S NILMR FCLN
Sbjct: 89   TILHFNCKPLPKHLNRILEVLVSHRNYLRPAFDLFKDSRSYGVEPDTKSCNILMRPFCLN 148

Query: 940  DDLSIAYSLFNQMSKRDVVPDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSY 1119
             D+SIAYSLFN M KRDVVPDIESYRIL+Q  CRKS+VN AVDLLEDMLN GFVPD+ +Y
Sbjct: 149  GDISIAYSLFNIMFKRDVVPDIESYRILMQALCRKSRVNGAVDLLEDMLN-GFVPDSLTY 207

Query: 1120 STLLNSLCRKKHLKAAYKLLCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPP 1299
            +TLLNSLCRKK  + AYKLLCRMK+KGCNPDIVH NTVILGFCR+GR  DACKV+ DM  
Sbjct: 208  TTLLNSLCRKKKFREAYKLLCRMKVKGCNPDIVHXNTVILGFCRDGRTHDACKVISDMRA 267

Query: 1300 NGCLPNLVSYQTLIGGLSDQGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEE 1479
            NG LPNLVSY+TL+ GL + G+ DEA KY  EMLS+ F+PHF+VV+ LVKGFCN+G+ E+
Sbjct: 268  NGSLPNLVSYRTLVSGLCNMGMLDEASKYMEEMLSKDFSPHFAVVHALVKGFCNVGRTED 327

Query: 1480 ACGVLEEFLRHGQTPHIDTWMEILPRISEVDEKENFDCILNEVLKIEVKPSTRIVEIRAG 1659
            ACGVL + L HG+ PH+DTWM I+P I EVD++      L EVLKIE+K  TRIV+   G
Sbjct: 328  ACGVLTKALEHGEAPHVDTWMIIMPVICEVDDEGKSSGALEEVLKIEIKGHTRIVDAGIG 387

Query: 1660 LEEYLIKKKLTRSK 1701
            LE YLI K  +RS+
Sbjct: 388  LENYLIGKIRSRSR 401


>ref|XP_004308275.1| PREDICTED: uncharacterized protein LOC101307637 [Fragaria vesca
            subsp. vesca]
          Length = 2481

 Score =  454 bits (1168), Expect = e-124
 Identities = 226/386 (58%), Positives = 285/386 (73%), Gaps = 2/386 (0%)
 Frame = +1

Query: 463  ESSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQ 642
            ES +GSPAR+ KLIA QSDPLLAKEIFD A++ P+FRH Y+++ +LILKLGR+ +F L+ 
Sbjct: 35   ESILGSPARVQKLIASQSDPLLAKEIFDFAAQHPHFRHSYSSYFTLILKLGRAHYFSLVD 94

Query: 643  XXXXXXXXX--NYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLD 816
                       +Y  SP+LF+ +I+IYGDA LP +AL+TFY +  FN KP  KHLNR+L+
Sbjct: 95   DLLLRLKSQPTSYSPSPALFTHLIKIYGDAHLPQKALRTFYTMFQFNCKPTVKHLNRILE 154

Query: 817  ILVNHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVV 996
            ILV HRNFLR A D+FR AH++GV P+T SYNILMRAFCLN DLS+AY LFN+M +RDVV
Sbjct: 155  ILVAHRNFLRSAFDVFRDAHRHGVVPDTKSYNILMRAFCLNGDLSVAYGLFNKMYERDVV 214

Query: 997  PDIESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKL 1176
            PD+ESYRIL+QG CRK QVN +VD LEDM+NKGFVPD+ SY++L                
Sbjct: 215  PDVESYRILMQGLCRKGQVNTSVDFLEDMMNKGFVPDSLSYTSL---------------- 258

Query: 1177 LCRMKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSD 1356
               MK+KGCNPDIVHYNTVI GFCREGRA+DACKVLEDM            +TL+ GL D
Sbjct: 259  ---MKVKGCNPDIVHYNTVISGFCREGRAVDACKVLEDM------------ETLVSGLCD 303

Query: 1357 QGLYDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDT 1536
            QG+ DEA+KY   M+ +GF+PHFSVV+ LVKGFCN+G++E+ACGV+EE LRHG+ PH DT
Sbjct: 304  QGMLDEAKKYMEVMILKGFSPHFSVVHGLVKGFCNVGRIEDACGVMEEILRHGEVPHRDT 363

Query: 1537 WMEILPRISEVDEKENFDCILNEVLK 1614
            W+ I+P I E  E    + +  +++K
Sbjct: 364  WITIIPGICEEIELVRLEEVWKQIMK 389


>ref|XP_006837400.1| hypothetical protein AMTR_s00111p00140430 [Amborella trichopoda]
            gi|548840018|gb|ERN00254.1| hypothetical protein
            AMTR_s00111p00140430 [Amborella trichopoda]
          Length = 429

 Score =  446 bits (1146), Expect = e-122
 Identities = 220/385 (57%), Positives = 281/385 (72%)
 Frame = +1

Query: 466  SSIGSPARIHKLIAVQSDPLLAKEIFDLASRQPNFRHPYATFHSLILKLGRSGHFPLMQX 645
            S+IGSPAR+ KLIA Q DPLLA EIFDLASRQPNF   Y++FHSLILKLGR   F LM+ 
Sbjct: 43   SAIGSPARVQKLIASQPDPLLAYEIFDLASRQPNFTPSYSSFHSLILKLGRHRQFSLMEK 102

Query: 646  XXXXXXXXNYPISPSLFSQIIQIYGDAGLPGEALKTFYRILHFNMKPLPKHLNRVLDILV 825
                      P++P LFS +I IYGD+G+P +++KTF+++L F  KP+ KH N ++ +LV
Sbjct: 103  LISKLKSEGRPVTPGLFSDVITIYGDSGMPDQSVKTFFKMLEFQCKPVAKHFNALILVLV 162

Query: 826  NHRNFLRPALDLFRLAHKYGVSPNTISYNILMRAFCLNDDLSIAYSLFNQMSKRDVVPDI 1005
             H N ++ A  LF+   K+G+S NT ++NILM+AFC  D LSIAY LFNQM K+ +VPD+
Sbjct: 163  EH-NRVQVAYSLFKDLEKFGISANTETFNILMKAFCFYDKLSIAYKLFNQMFKQGLVPDV 221

Query: 1006 ESYRILIQGFCRKSQVNKAVDLLEDMLNKGFVPDTYSYSTLLNSLCRKKHLKAAYKLLCR 1185
            ESYRIL+QG CRKSQV  A++  +DM+NKGFVPD  SY+TLLNSLCRKK L+ AYK+LCR
Sbjct: 222  ESYRILMQGLCRKSQVKTALNFFDDMMNKGFVPDALSYNTLLNSLCRKKKLREAYKMLCR 281

Query: 1186 MKLKGCNPDIVHYNTVILGFCREGRALDACKVLEDMPPNGCLPNLVSYQTLIGGLSDQGL 1365
            MK+KGCNPDI+HYNTVI GF REGRA DACKVLE+MP NGCLPN +SY+TL+ GL  +G 
Sbjct: 282  MKVKGCNPDILHYNTVITGFVREGRASDACKVLEEMPSNGCLPNSLSYRTLVDGLCKEGK 341

Query: 1366 YDEARKYTSEMLSRGFAPHFSVVYLLVKGFCNIGKLEEACGVLEEFLRHGQTPHIDTWME 1545
              EA+ Y  EM+ +GF PH S ++ LV   C  GK++EAC +++     G  PH  TW  
Sbjct: 342  LVEAKHYLGEMICKGFMPHTSSLHFLVVRICGGGKIDEACEMVKAAGNIGMAPHAKTWEL 401

Query: 1546 ILPRISEVDEKENFDCILNEVLKIE 1620
            ++ RI +VDE    + IL EV+K E
Sbjct: 402  VMQRIFDVDE-VRIEAILREVVKRE 425


Top