BLASTX nr result

ID: Zingiber25_contig00005896 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00005896
         (1736 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ24063.1| hypothetical protein PRUPE_ppa004279mg [Prunus pe...   411   e-112
gb|ESW35794.1| hypothetical protein PHAVU_001G265200g [Phaseolus...   409   e-111
ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containi...   407   e-110
ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citr...   395   e-107
gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis]     394   e-107
ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containi...   391   e-106
ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containi...   391   e-106
ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [A...   389   e-105
gb|EOY05094.1| Pentatricopeptide repeat-containing protein, puta...   387   e-104
ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containi...   384   e-104
ref|XP_002516403.1| pentatricopeptide repeat-containing protein,...   384   e-104
ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Popu...   381   e-103
ref|NP_193155.4| pentatricopeptide repeat-containing protein [Ar...   352   4e-94
ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutr...   349   2e-93
ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. l...   346   2e-92
ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containi...   342   4e-91
ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Caps...   333   1e-88
emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thal...   328   3e-87
emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera]   319   2e-84
ref|XP_002966251.1| hypothetical protein SELMODRAFT_85839 [Selag...   151   1e-33

>gb|EMJ24063.1| hypothetical protein PRUPE_ppa004279mg [Prunus persica]
          Length = 518

 Score =  411 bits (1057), Expect = e-112
 Identities = 206/445 (46%), Positives = 296/445 (66%)
 Frame = +2

Query: 233  PQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLV 412
            P     +H TLLV+TFH +  L++L+      +GSC LQLL +DGDWT+D  WA + FL 
Sbjct: 65   PDSSSTKHTTLLVETFHEHQRLKALLQNLI--NGSCPLQLLGEDGDWTKDQFWAAIRFLK 122

Query: 413  ETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSL 592
             T R  E LQ+FD+WKNIE +R N  N+ +II L   EG + EA+  F+  K  ++ PSL
Sbjct: 123  HTFRFNEILQLFDMWKNIEKSRINEFNYSKIIGLLGEEGLIEEAVRCFQEMKSHNLRPSL 182

Query: 593  AIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKR 772
             +YN++IH  AR+ +F ++      M E  L P  +TY+GL+ AYG + +YD++  CVK+
Sbjct: 183  EVYNSVIHVCARQGNFEDALFFLNEMKEMNLAPETDTYDGLIEAYGKYRMYDQIGMCVKK 242

Query: 773  MELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGN 952
            M+L+GC PD +TYN+LI E+AR GLL++MES Y+ + S+ + LQ+STL++M+E YA+ G 
Sbjct: 243  MKLNGCSPDHITYNLLIREFARGGLLKRMESVYQSMLSRRMALQSSTLIAMVEVYAKFGI 302

Query: 953  LEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYI 1132
            LEK+E VYRR+L     +K DLIRKLA VYI NY F++LE+LG D+ ++ +G+ DLVW +
Sbjct: 303  LEKMENVYRRVLNSGTVVKNDLIRKLAEVYIDNYMFSRLEKLGVDL-SSRFGQTDLVWCL 361

Query: 1133 LLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCN 1312
             LLS AG++S++G++SI+ EM+E+ VP N  + NI+   YLK++DF  L     Q     
Sbjct: 362  RLLSQAGVLSQRGMDSIVDEMKEQNVPWNETVANIIMLAYLKMKDFTHLRIFLSQLLTQG 421

Query: 1313 IKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCE 1492
            ++PD+IT GI FDA  IGYDG   L+ WR + +L   V + TD LVL+ FGKG FL+ CE
Sbjct: 422  VEPDIITVGIVFDANRIGYDGSRTLDTWRENGFLRKAVEMNTDPLVLTTFGKGHFLRNCE 481

Query: 1493 KKYITVHSRQKQKKIWRYSDVIRLV 1567
              Y ++    ++ K W Y  +I LV
Sbjct: 482  AAYSSLEPEDRENKTWTYHHLIDLV 506


>gb|ESW35794.1| hypothetical protein PHAVU_001G265200g [Phaseolus vulgaris]
          Length = 496

 Score =  409 bits (1052), Expect = e-111
 Identities = 205/441 (46%), Positives = 300/441 (68%)
 Frame = +2

Query: 245  EQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGR 424
            + +HRTLLV+T+H ++ LR+L+ +  R   +  + +L +DGDW++D+ WA V FL    R
Sbjct: 57   DSKHRTLLVETYHHHDSLRALLAKLEREDSN-PMYILAQDGDWSKDHFWAAVRFLKNASR 115

Query: 425  AEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYN 604
              E LQVFD+WK IE +R +  N+ +II L C +  M EA+SAF+  K   + PSL  YN
Sbjct: 116  FVEILQVFDMWKEIEKSRISEFNYNKIIGLLCEDEMMEEALSAFQEMKVQGMKPSLDTYN 175

Query: 605  TIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELS 784
             IIH  ++   F ++      M E+GL P  ETY+GL+ AYG F LYDEM +CVK+MEL 
Sbjct: 176  PIIHGLSKAGKFSDALRFLDEMKESGLDPDSETYDGLIGAYGKFQLYDEMGECVKKMELE 235

Query: 785  GCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKI 964
            GC PD +TYNILI EYAR+G+L++ME  Y+ + SK + LQ+ST V+ML+AY   G +EK+
Sbjct: 236  GCSPDHITYNILIQEYARAGILQRMEKLYQRMLSKRMRLQSSTFVAMLKAYTTFGIVEKM 295

Query: 965  ERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLS 1144
            E  +R++L  K+ +++D IRK+A VYI+NY F++LE+L  D+ +A +G+ DLVW + LLS
Sbjct: 296  EFFFRKVLNSKSCLEDDFIRKMAEVYIKNYMFSRLEDLALDLCSA-FGESDLVWCLRLLS 354

Query: 1145 SAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPD 1324
             A L+SKKG++ ++ EM++ K+  N+   NI+   Y+K++DFR L     Q R+  + PD
Sbjct: 355  YACLLSKKGMDIVVKEMQDAKINWNVAFANIIMLAYVKMKDFRHLRILLSQLRINRLGPD 414

Query: 1325 LITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYI 1504
            ++T GI  DA  IG+DG   LE WRR  YL+ VV ++TD LVL+AFGKG FLK CE+ Y 
Sbjct: 415  IVTIGIVLDASRIGFDGRGALESWRRMGYLDRVVELKTDSLVLTAFGKGHFLKSCEEVYT 474

Query: 1505 TVHSRQKQKKIWRYSDVIRLV 1567
            ++H   +++K W Y+D+I L+
Sbjct: 475  SLHPEDRERKKWTYNDLIALL 495


>ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Glycine max]
          Length = 509

 Score =  407 bits (1045), Expect = e-110
 Identities = 210/441 (47%), Positives = 298/441 (67%)
 Frame = +2

Query: 245  EQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGR 424
            + +H TLLV+T+H ++ LR+L+ +  +   +  L +L +DGDW++D+ WA+V FL    R
Sbjct: 58   DTKHTTLLVETYHLHDSLRALLAKLQKEDCN-PLHVLAEDGDWSKDHFWAVVRFLKSASR 116

Query: 425  AEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYN 604
              + LQVFD+WKNIE +R +  N+ +II L C  G M +A+SA    K   I PSL  YN
Sbjct: 117  FTQILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMEDALSALRDMKVQGIKPSLDTYN 176

Query: 605  TIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELS 784
             IIH  +R+  F ++      M E+GL    ETY+GLL AYG F +YDEM +CVK+MEL 
Sbjct: 177  PIIHGLSREGKFSDALRFIDEMKESGLELDSETYDGLLGAYGKFQMYDEMGECVKKMELE 236

Query: 785  GCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKI 964
            GC PD +TYNILI EYAR+GLL++ME  Y+ + SK +++Q+STLV+MLEAY   G +EK+
Sbjct: 237  GCSPDHITYNILIQEYARAGLLQRMEKLYQRMVSKRMHVQSSTLVAMLEAYTTFGMVEKM 296

Query: 965  ERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLS 1144
            E  YR+IL  K  +++DLIRK+A VYI+NY F++LE+L  D+  A +G+ +LVW + LLS
Sbjct: 297  ENFYRKILSSKTCLEDDLIRKVAEVYIKNYMFSRLEDLALDLCPA-FGESNLVWCLRLLS 355

Query: 1145 SAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPD 1324
             A  +SKKG++ ++ EM + KV  N+ + NI+   Y+K++DFR L     Q  +  ++PD
Sbjct: 356  YACPLSKKGMDIVVREMRDAKVNWNVTVANIIMLAYVKMKDFRHLKILLSQLPIYRVQPD 415

Query: 1325 LITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYI 1504
            +IT GI FDA  IG+DG   LE WRR  YL  VV I+TD LVL+AFGKG FLK CE+ Y 
Sbjct: 416  IITIGILFDATRIGFDGSGALETWRRMGYLYRVVEIKTDSLVLTAFGKGHFLKSCEEVYS 475

Query: 1505 TVHSRQKQKKIWRYSDVIRLV 1567
            ++H   +++K W Y D+I L+
Sbjct: 476  SLHPEDRKRKTWTYHDLIALL 496


>ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citrus clementina]
            gi|557522919|gb|ESR34286.1| hypothetical protein
            CICLE_v10004784mg [Citrus clementina]
          Length = 510

 Score =  395 bits (1015), Expect = e-107
 Identities = 191/440 (43%), Positives = 298/440 (67%)
 Frame = +2

Query: 251  RHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAE 430
            +H TLLV+++H +  L +LI   ++   SC LQ+L+ DGDWT+D+ WA++ FL  + R+ 
Sbjct: 62   KHTTLLVESYHEHQALNALIQRLNKKV-SCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120

Query: 431  EALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTI 610
            +  QVFD+WKNIE +R N  N  +II + C EG M EA+ AF+  +   + PSL IYN+I
Sbjct: 121  QIPQVFDMWKNIEKSRINEFNSQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180

Query: 611  IHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGC 790
            IH +++   F+ +      M E  L P  +TY+GL++AYG + +YDE+  C+K M+L GC
Sbjct: 181  IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAYGKYKMYDEIDMCLKMMKLDGC 240

Query: 791  FPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIER 970
             PD +TYN+LI E+A +GLL++ME TY+ + +K ++L++ST+V++L+AY   G L+K+E+
Sbjct: 241  SPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 300

Query: 971  VYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSA 1150
             Y+R+L  +  +KEDL+RKLA VYI+NY F++L++LG+D+  +  G+ +LVW + LLS A
Sbjct: 301  FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDL-ASRIGRTELVWCLRLLSHA 359

Query: 1151 GLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLI 1330
             L+S +G++S++ EME  KV  N+   NI+   YLK++DF+ L     +    ++KPD++
Sbjct: 360  CLLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIV 419

Query: 1331 TYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITV 1510
            T GI +DA  IG+DG   LE W+R  +L   V I TD LVL+ +GKG FL+ CE+ Y ++
Sbjct: 420  TIGILYDARRIGFDGTGALEMWKRIGFLFKTVEINTDPLVLAVYGKGHFLRYCEEVYSSL 479

Query: 1511 HSRQKQKKIWRYSDVIRLVL 1570
                ++KK W Y ++I LV+
Sbjct: 480  EPYSREKKRWTYQNLIDLVI 499


>gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis]
          Length = 664

 Score =  394 bits (1013), Expect = e-107
 Identities = 194/444 (43%), Positives = 296/444 (66%)
 Frame = +2

Query: 236  QDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVE 415
            Q+    H TLLV+TFH +   ++L+   S+N  SC ++LL +DGDW +++ WA+V FL  
Sbjct: 57   QNSSTEHTTLLVETFHEHRKFKTLLKRLSKND-SCPMRLLREDGDWCKEHFWAVVRFLRH 115

Query: 416  TGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLA 595
              R +E +QVFDLWKNIE +R N  N+ +IIK+   EG M EA+ +FE  K   + P+L 
Sbjct: 116  GSRTKEIVQVFDLWKNIEKSRINELNYCKIIKMLGEEGLMEEAVLSFEEMKSCGLSPTLE 175

Query: 596  IYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRM 775
            +YN++IH F++K DF ++      M E  ++P  +TY GL+ AY  + +YDE+  C+K+M
Sbjct: 176  VYNSMIHGFSQKGDFDDALVYLNEMREQNVVPETDTYEGLIEAYAKYEMYDEIGLCLKKM 235

Query: 776  ELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNL 955
            +L+GC PD +TYN+L+ ++++ GLL++MES Y  + SK + LQ+STLV+MLE YA  G L
Sbjct: 236  KLNGCPPDHITYNLLMRKFSKGGLLKRMESVYHTMISKRMYLQSSTLVAMLETYARFGIL 295

Query: 956  EKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYIL 1135
            +K+E+ Y R LK K  + EDLIRKLA VYI NY F++LE LG D+ T  +G+ DL+W + 
Sbjct: 296  DKMEKFYMRTLKTKTPLGEDLIRKLAEVYIDNYLFSRLETLGVDLST-TFGETDLLWCLR 354

Query: 1136 LLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNI 1315
            LLS A L S+KG++ ++ EME   +P N+   NI+   +LK++DF  L  +  Q    ++
Sbjct: 355  LLSHAFLFSRKGMDFVIQEMERAHIPWNVTFANIILLTHLKMKDFTHLRISLSQL-THSV 413

Query: 1316 KPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK 1495
            +PD++T GI FDA  +G+DG   LE W+R  +    V + TD +V++AFGKG+FL+ CE+
Sbjct: 414  EPDIVTVGILFDAIGMGFDGTRTLETWKRMDFFYKAVEMNTDPVVITAFGKGNFLQNCER 473

Query: 1496 KYITVHSRQKQKKIWRYSDVIRLV 1567
             Y ++ S  ++ K W Y++++ LV
Sbjct: 474  AYSSLESEVRETKSWTYNNLVDLV 497


>ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Vitis vinifera]
          Length = 581

 Score =  391 bits (1005), Expect = e-106
 Identities = 197/438 (44%), Positives = 286/438 (65%)
 Frame = +2

Query: 251  RHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAE 430
            +H TLLV+T H N  L  LI + S N  S  LQLL  DGDW + + WA++ FL +  R+ 
Sbjct: 88   KHTTLLVETLHENERLGVLIQKLS-NKASSPLQLLRDDGDWNKQHFWAVIRFLKDASRSS 146

Query: 431  EALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTI 610
            E L VF LWK+++ +R N  N+ +II L   E    E++ A E  K   + PSL IYN +
Sbjct: 147  EILPVFHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEGMKTHGLKPSLEIYNLV 206

Query: 611  IHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGC 790
            IH FARK +F  +      +    L+   ETY+GL+++YG + +YDE+ +CVK+ME  GC
Sbjct: 207  IHCFARKGEFDRALYFLNELKANNLIADTETYDGLIQSYGKYKMYDELDECVKKMESDGC 266

Query: 791  FPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIER 970
             PD +TYN+LI E++R GLL++ME  ++ + SK + LQ+STLV MLEAYA  G +EK+E 
Sbjct: 267  LPDHITYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVMLEAYANFGIIEKMEN 326

Query: 971  VYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSA 1150
             YRR+L  K  +K+DLIRKLA VYI NY+F++L ++G ++ +    + DLVW + LLS A
Sbjct: 327  AYRRVLNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLNLASVT-SRTDLVWCLRLLSHA 385

Query: 1151 GLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLI 1330
             L+S+KG++SI+ EME + VP N  + N +   YLK++DF  L     +    ++KPD++
Sbjct: 386  CLLSRKGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILLLELSTRHVKPDIV 445

Query: 1331 TYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITV 1510
            T GI FDA  IG++G   L  WRR+ +L++ V + TD LVLSAFGKG+FL+ CE+ Y ++
Sbjct: 446  TVGILFDANRIGFNGTMALNTWRRTGFLDEAVEMNTDPLVLSAFGKGNFLQSCEEMYSSL 505

Query: 1511 HSRQKQKKIWRYSDVIRL 1564
                ++KKIW Y ++I L
Sbjct: 506  EPEARKKKIWTYQNLIDL 523


>ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 506

 Score =  391 bits (1004), Expect = e-106
 Identities = 204/441 (46%), Positives = 294/441 (66%)
 Frame = +2

Query: 245  EQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGR 424
            + +H TLLV+T+H ++ LR+L+ +   N  S  L +L +D DW++D+ WA+V FL  +  
Sbjct: 56   DTKHTTLLVETYHLHHSLRALLAKLE-NEYSNPLHMLAEDADWSKDHFWAVVRFLKSSSN 114

Query: 425  AEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYN 604
                LQVFD+WKNIE +R +  N+ +II L C  G M +A+SA +  K   I PSL  YN
Sbjct: 115  FTHILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMKDALSALQDMKVQGIKPSLDTYN 174

Query: 605  TIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELS 784
             IIH  +R+  F ++      M E+GL    ETY+GL+ AYG F +YDEM +CVK+MEL 
Sbjct: 175  PIIHGLSREGKFSDALRFIDEMKESGLELDSETYDGLIGAYGKFQMYDEMGECVKKMELE 234

Query: 785  GCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKI 964
            GC PD +TYNILI EYA  GLL++ME  Y+ + SK +++++STLV+MLEAY   G +EK+
Sbjct: 235  GCSPDPITYNILIQEYAGGGLLQRMEKLYQRMLSKRMHVKSSTLVAMLEAYTTFGMVEKM 294

Query: 965  ERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLS 1144
            E+ YR+IL  K  I++DLIRK+A VYI N+ F++LE+L  D+  A +G+ +L W   LLS
Sbjct: 295  EKFYRKILNSKTCIEDDLIRKVAEVYINNFMFSRLEDLALDLCPA-FGESNLEWCFRLLS 353

Query: 1145 SAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPD 1324
             A L+SKKG++ ++ EM++ KV  N+ + NI+   Y+K+++FR L     Q  +  ++PD
Sbjct: 354  YACLLSKKGMDIVVQEMQDAKVSWNVTVANIIMLAYVKMKEFRHLRILLSQLPIYRVQPD 413

Query: 1325 LITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYI 1504
            +IT GI FDA  IG+DG   LE WRR  YL  VV ++TD LVL+AFGKG FLK CE+ Y 
Sbjct: 414  IITIGILFDATRIGFDGSGALETWRRMGYLYRVVEMKTDSLVLTAFGKGHFLKSCEEVYS 473

Query: 1505 TVHSRQKQKKIWRYSDVIRLV 1567
            ++H   +++K   Y D+I L+
Sbjct: 474  SLHPEDRKRKTCTYHDLIPLL 494


>ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda]
            gi|548859508|gb|ERN17188.1| hypothetical protein
            AMTR_s00044p00151840 [Amborella trichopoda]
          Length = 506

 Score =  389 bits (999), Expect = e-105
 Identities = 197/445 (44%), Positives = 291/445 (65%)
 Frame = +2

Query: 233  PQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLV 412
            PQD   +HR LLV  F +   L  LI +     G   L+LL  +GDW +D  WA++  L 
Sbjct: 60   PQD--SKHRALLVQNFFQTQQLLDLIEKIK--GGIDPLKLLRDEGDWNKDQFWAVMKLLK 115

Query: 413  ETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSL 592
            ET R +EA+QVFD W N+E +R + SN+ ++I+L    G M EA +  +  K   + P++
Sbjct: 116  ETSRIKEAMQVFDYWVNVERSRLDDSNYTKMIELLVDAGLMDEATTMLKEVKDFGVRPTV 175

Query: 593  AIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKR 772
            A+YN I+H +A   +F  +N     M + GL+P  ETY+GL+RAYG   +YD+M+KC K+
Sbjct: 176  AVYNFIVHGYANTGNFDKANLFLREMRDLGLVPESETYDGLIRAYGNHRMYDDMAKCAKK 235

Query: 773  MELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGN 952
            ME  G  PD +TYNILI E+AR GL+ +ME  YR L SK + LQ STLV+MLEAYA +G 
Sbjct: 236  MESEGFTPDHLTYNILIREFARGGLMVRMEGAYRTLLSKKMGLQYSTLVAMLEAYAALGC 295

Query: 953  LEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYI 1132
            + ++E V+RR+LK K  +KEDL+RK+A  YI+N+RF++LE+LG  V  +  G+ DL W +
Sbjct: 296  VNEMETVFRRLLKSKIPLKEDLVRKVARAYIKNHRFSRLEDLGLGV-ASKTGRTDLFWCL 354

Query: 1133 LLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCN 1312
            LLLS A L S+KG++S++ EM+   V  N+   NI A  YLK++D + L+    Q ++ N
Sbjct: 355  LLLSHACLCSRKGIKSVIQEMKSAMVRPNVTFANITALTYLKMKDVQYLDVLLSQLQLLN 414

Query: 1313 IKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCE 1492
            + PD++T G+  DA   G+D I  L  WR++ +L   V + TD LVL+AFGKG FL+ CE
Sbjct: 415  VNPDIVTVGVVMDAYVSGFDDIKALRMWRKTGFLRRPVEMNTDPLVLTAFGKGYFLRSCE 474

Query: 1493 KKYITVHSRQKQKKIWRYSDVIRLV 1567
            + Y+++ ++ +++K+W Y+D+I LV
Sbjct: 475  ELYLSLGAKGRERKVWTYNDLIDLV 499


>gb|EOY05094.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao]
          Length = 504

 Score =  387 bits (993), Expect = e-104
 Identities = 201/447 (44%), Positives = 293/447 (65%)
 Frame = +2

Query: 248  QRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRA 427
            + H  LLV+T+H +  L++L+    ++  SC LQ+L  DGDWT+D  W ++ FL    R+
Sbjct: 60   KNHTALLVETYHHHRRLKALLERLEKDD-SCPLQMLRDDGDWTKDIFWVVIRFLRRASRS 118

Query: 428  EEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNT 607
             E LQVF +WKNIE +R N  N+ +II L   EG + +A+ A        + PSL +YN+
Sbjct: 119  NEILQVFHMWKNIEKSRINELNYEKIIGLLGEEGRVGQAVQALREMGGYGLKPSLEVYNS 178

Query: 608  IIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSG 787
            IIHA+AR   F ++ +    M E GL P  +TY+GL+ AYG + +YDE+  C+K MEL  
Sbjct: 179  IIHAYARNGKFDDALSFLNEMKEIGLAPETDTYDGLIEAYGKYKMYDEIGTCLKMMELDR 238

Query: 788  CFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIE 967
            C PD  TYN+LI E++R GLL++ME  Y++L SK +NLQ+S+LV+MLEAYA  G L+K+E
Sbjct: 239  CRPDHFTYNLLIREFSRGGLLQRMEQVYQILLSKQMNLQSSSLVAMLEAYANFGILDKME 298

Query: 968  RVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSS 1147
            +VYR+++     +KED IR LA VYI+NY F++L++LG D+ ++  G+ DLVW + LLS 
Sbjct: 299  KVYRKVVN-SMTLKEDTIRILASVYIKNYMFSRLDDLGIDL-SSRTGRNDLVWCLRLLSH 356

Query: 1148 AGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDL 1327
            A L+S+KG++S++ EM E K   N+ I NI+   Y+K++DF+ L     Q     ++PD+
Sbjct: 357  ACLLSRKGMDSVILEMCEAKASWNVTISNIILLAYMKMKDFKRLRILLSQLPSHQVRPDI 416

Query: 1328 ITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYIT 1507
            IT GI  DA +IG+DG   LE WR+   L   V + TD LVL AFGKG FL+ CE+ Y +
Sbjct: 417  ITIGILSDAIEIGFDGAEALETWRKMGLLYRTVEMNTDPLVLIAFGKGHFLRDCEEIYTS 476

Query: 1508 VHSRQKQKKIWRYSDVIRLVLG*KAKR 1588
            +  + +++K W Y  +I LV+  KAKR
Sbjct: 477  LEPKARKEKRWTYHHLIDLVIKHKAKR 503


>ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 509

 Score =  384 bits (985), Expect = e-104
 Identities = 194/435 (44%), Positives = 278/435 (63%)
 Frame = +2

Query: 254  HRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEE 433
            H TL V+  H  + LR+L+ +       C LQLL  DGDWT D  WA++ FL+   R +E
Sbjct: 68   HTTLHVEPSHEYHKLRALL-DILMEKDCCPLQLLRDDGDWTIDQFWAVIRFLIHASRPKE 126

Query: 434  ALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTII 613
             LQ+FD+W+NIE +R N  N+ +II L   E  + EA+  F+  K   +  S+ +YNTII
Sbjct: 127  ILQLFDIWRNIEKSRINEFNYSKIIGLLVEEDLIEEAVVCFQDMKSQGLGLSVELYNTII 186

Query: 614  HAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCF 793
            H  +R  +F ++      M E  L P  +TY+GL+ AYG + +YDEM  C+K+M L+GC 
Sbjct: 187  HGLSRNGNFVDAVHFLNEMKEMNLAPDADTYDGLIEAYGKYKMYDEMGMCLKKMRLNGCS 246

Query: 794  PDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERV 973
            PD +TYN+LI E+A  GLL ++E  Y+ + S+ ++LQ  TL+++LE YA+ G LEK+E  
Sbjct: 247  PDYITYNLLIREFAHGGLLNRVERVYQSMVSRRMDLQVPTLIAILEVYAKFGILEKMEVF 306

Query: 974  YRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAG 1153
            YRR+L  +A +KEDLI+K+A VYI NY F++LE LG D+ +  +G+ DLVW + LLS AG
Sbjct: 307  YRRVLNSRAILKEDLIKKVAEVYIENYMFSKLENLGVDL-SPRFGQTDLVWCLRLLSHAG 365

Query: 1154 LVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLIT 1333
            L+S++G+ SI+ EME + VP N  + NI+   YLK++DF  L   F Q+    + PD+IT
Sbjct: 366  LLSRRGMNSIILEMEGKSVPWNATVANIMMLAYLKMKDFTRLRSLFSQSLTRGVDPDIIT 425

Query: 1334 YGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITVH 1513
            +GI FDA  IGYDG   L  WR+   L   V + TD LV++ FGKG FL+ CE  Y ++ 
Sbjct: 426  FGILFDANRIGYDGSATLNTWRKHGILYKAVEMNTDPLVITTFGKGHFLRNCEAAYSSLE 485

Query: 1514 SRQKQKKIWRYSDVI 1558
               ++KK W Y D+I
Sbjct: 486  PEVREKKTWTYQDLI 500


>ref|XP_002516403.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223544501|gb|EEF46020.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 502

 Score =  384 bits (985), Expect = e-104
 Identities = 192/446 (43%), Positives = 296/446 (66%)
 Frame = +2

Query: 230  LPQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFL 409
            + QD   +H TLLV+++H +  L++L+   ++  GSC LQ+L+ D DW++D+ WA++ FL
Sbjct: 49   ISQDNSIKHNTLLVESYHEHQRLKALLARLNKK-GSCPLQMLQDDADWSKDHFWAVIRFL 107

Query: 410  VETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPS 589
              + R++E LQVFD+WK+IE +R N  N+ ++I++   EG + +A SAF   K   + PS
Sbjct: 108  RHSSRSDEILQVFDMWKDIEKSRINEFNYEKVIEILGEEGLIEDAYSAFIEMKTLCLSPS 167

Query: 590  LAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVK 769
            L +YN++IH +AR   F ++      + E  L P  +TYNGL++AYG + +YDEM  C+K
Sbjct: 168  LQVYNSLIHGYARNGKFDDAVFYLNHLKEINLSPVSDTYNGLIQAYGKYKMYDEMGMCLK 227

Query: 770  RMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMG 949
            +ME+ GC PD VTYN+LI E A +GLL +ME  Y+      ++L+++TL +MLEAYA  G
Sbjct: 228  KMEMEGCSPDHVTYNLLIQELAEAGLLTRMEKVYQTTRMNRMDLKSTTLTAMLEAYANFG 287

Query: 950  NLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWY 1129
             +EK+E + +R    KA +KEDLI+K+ALVYI N+ F++LE+LG+ +   + G+ D+VW 
Sbjct: 288  IVEKMELILKRTRNSKALLKEDLIKKIALVYIENFMFSRLEKLGHYLSKRS-GQNDMVWC 346

Query: 1130 ILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVC 1309
            +LLLS+A ++S+KG++S++ EM+  KV  N+  +NI+   YLK++D   L          
Sbjct: 347  LLLLSNACMLSQKGMDSVVREMKVAKVSWNVTFINIILLAYLKMKDSMRLGILLSTLTNH 406

Query: 1310 NIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLC 1489
             +KPD++T G+ FDA +IG+ G  +LE WRR+  L   V   TD LVL+AFGKG FLK C
Sbjct: 407  IVKPDIVTVGVLFDANNIGFHGNGILETWRRTGILYRCVETETDPLVLAAFGKGQFLKKC 466

Query: 1490 EKKYITVHSRQKQKKIWRYSDVIRLV 1567
            E+ Y ++    +QK+ W Y ++I LV
Sbjct: 467  EEAYSSLEPVARQKEKWTYCNLIDLV 492


>ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa]
            gi|550324215|gb|EEE99423.2| hypothetical protein
            POPTR_0014s14700g [Populus trichocarpa]
          Length = 508

 Score =  381 bits (979), Expect = e-103
 Identities = 199/461 (43%), Positives = 292/461 (63%), Gaps = 17/461 (3%)
 Frame = +2

Query: 236  QDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVE 415
            QD   +H TLLVD+FH +  L+SL+H    NS    LQLL++DGDW++D+ W+++ FL  
Sbjct: 46   QDHSTKHTTLLVDSFHEHKRLKSLLHNL--NSNQNPLQLLQQDGDWSKDDFWSVIKFLKL 103

Query: 416  TGRAEEALQV-----------------FDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEA 544
            + R+ + LQV                 F +W+++E TR N  N+ +II L   EG M +A
Sbjct: 104  SARSNQILQVHSLAHLFFLAARKIEFVFHMWRDVEKTRINEFNYEKIIGLLGEEGLMEDA 163

Query: 545  MSAFEATKKSDIFPSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRA 724
            ++AF   K   +  SL +YN+IIH +AR   F ++      M E  L P  +TY+GL+ A
Sbjct: 164  VTAFMEMKSFGLCLSLEVYNSIIHGYARNGKFDDALFYLNQMNEMNLSPESDTYDGLIEA 223

Query: 725  YGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQ 904
            YG + +YDEM+ C+K+MEL GC PD  TYN+LI ++A+ GLL +ME  Y+ + +K + LQ
Sbjct: 224  YGTYRMYDEMAMCLKKMELDGCSPDRYTYNLLIQKFAQGGLLTRMERVYQSMRTKRMKLQ 283

Query: 905  ASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGN 1084
            +STL+SMLEAYA  G +EK+E++ R     K  +KEDL+RKLA VYI NY F++L +L  
Sbjct: 284  SSTLISMLEAYANFGIVEKMEKILRWAWNSKITVKEDLVRKLAGVYIANYMFSRLHDLAV 343

Query: 1085 DVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIR 1264
            D+ T+  G+ D+VW + LLS A L+S++G+++++ EME+ K   NI + NI+   YLK++
Sbjct: 344  DL-TSITGRTDIVWCLHLLSHACLLSRRGMDAVVREMEDAKACWNITVANIILLAYLKMK 402

Query: 1265 DFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQ 1444
            DF  L     +     ++PD++T+GI FDA +IG+DG   LE WR+   L   V + TD 
Sbjct: 403  DFTRLRILLSKLPEIRVEPDIVTFGILFDAEEIGFDGKECLEMWRKMGLLYRRVEMNTDP 462

Query: 1445 LVLSAFGKGSFLKLCEKKYITVHSRQKQKKIWRYSDVIRLV 1567
            L LSAFGKGSFL+ CE+ Y ++    ++KK W Y D I LV
Sbjct: 463  LALSAFGKGSFLRSCEEGYSSLEPNAREKKRWTYVDFINLV 503


>ref|NP_193155.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|223635638|sp|O23278.2|PP310_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g14190, chloroplastic; Flags: Precursor
            gi|332657991|gb|AEE83391.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 501

 Score =  352 bits (902), Expect = 4e-94
 Identities = 185/432 (42%), Positives = 273/432 (63%), Gaps = 2/432 (0%)
 Frame = +2

Query: 281  HRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEEALQVFDLWK 460
            H +  L SL    S  SGSC L+LL++DGDW++D+ WA++ FL ++ R  E L VFD WK
Sbjct: 64   HHHRFLSSLTRRLSL-SGSCPLRLLQEDGDWSKDHFWAVIRFLRQSSRLHEILPVFDTWK 122

Query: 461  NIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEAT-KKSDIFPSLAIYNTIIHAFARKRD 637
            N+E +R + +N+ RII+  C E  M EA+ AF +     ++ PSL IYN+IIH++A    
Sbjct: 123  NLEPSRISENNYERIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGK 182

Query: 638  FHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEVTYNI 817
            F  +      M E GLLP  ETY+GL+ AYG + +YDE+  C+KRME  GC  D VTYN+
Sbjct: 183  FEEAMFYLNHMKENGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNL 242

Query: 818  LITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRILKLK 997
            LI E++R GLL++ME  Y+ L S+ + L+ STL+SMLEAYAE G +EK+E    +I++  
Sbjct: 243  LIREFSRGGLLKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFG 302

Query: 998  AYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSKKGVE 1177
              + E L+RKLA VYI N  F++L++LG  +  +   + +L W + LL  A LVS+KG++
Sbjct: 303  ISLDEGLVRKLANVYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLD 362

Query: 1178 SILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIFFDAC 1357
             ++ EMEE +VP N    NI    Y K+ DF S+     + R+ ++K DL+T GI FD  
Sbjct: 363  YVVKEMEEARVPWNTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLS 422

Query: 1358 DIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK-KYITVHSRQKQKK 1534
            +  +DG  V   W++  +L+  V ++TD LV +AFGKG FL+ CE+ K  ++ +R  + K
Sbjct: 423  EARFDGTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESK 482

Query: 1535 IWRYSDVIRLVL 1570
             W Y  ++ LV+
Sbjct: 483  SWTYQYLMELVV 494


>ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum]
            gi|557115950|gb|ESQ56233.1| hypothetical protein
            EUTSA_v10027442mg [Eutrema salsugineum]
          Length = 495

 Score =  349 bits (896), Expect = 2e-93
 Identities = 185/439 (42%), Positives = 273/439 (62%), Gaps = 2/439 (0%)
 Frame = +2

Query: 260  TLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEEAL 439
            +LL D++H ++   + +      +GSC L+LL +DGDW++   WA+V FL  + R  E L
Sbjct: 55   SLLSDSYHHHHRFLNSLPRRLSRTGSCPLRLLREDGDWSKHQFWAVVRFLRHSSRLHEIL 114

Query: 440  QVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEAT-KKSDIFPSLAIYNTIIH 616
             VFD WKN+E +R N +N+ +I++  C E  M EA+ AF+    + ++ PSL IYN+IIH
Sbjct: 115  PVFDAWKNLEPSRINEANYEKILRFLCEEKSMNEAIRAFQCMIDEHELSPSLEIYNSIIH 174

Query: 617  AFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFP 796
             +A    F  +      M E  +LP  ETY+GL+ AYG + LYDE+  C+K+ME  GC  
Sbjct: 175  GYANDGKFEEAMFYMNHMKENDMLPETETYDGLIEAYGKWKLYDEIVLCIKKMESDGCVR 234

Query: 797  DEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVY 976
            D VTYN+LI E+AR GLL++ME  Y+ L S+ + L+  TL+SMLEAYAE G LEK+E  Y
Sbjct: 235  DHVTYNLLIREFARGGLLKRMEQMYQSLMSRKMTLEPCTLLSMLEAYAEFGVLEKMEDTY 294

Query: 977  RRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGL 1156
             +I++    + EDL+RK+A VYI N  F++L++LG  +R     + DL W + LL  A L
Sbjct: 295  NKIVRFGISLDEDLVRKVANVYIDNLMFSRLDDLGRGIR-----RTDLAWCLRLLCHACL 349

Query: 1157 VSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITY 1336
            VS+KG++ ++ EMEE +VP N    NI+   Y K+ DFRS+     + R  ++K DL+T 
Sbjct: 350  VSRKGLDYVVKEMEEARVPWNATFANIVLLAYSKMGDFRSVELLLSELRTKHVKLDLVTV 409

Query: 1337 GIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK-KYITVH 1513
            GI  D    G+DG  V   W++  +L+  V  +TD LV +AFGKG FL+ CE+ K   + 
Sbjct: 410  GIVLDLSVDGFDGTGVFMTWKKIGFLDKPVETKTDPLVHAAFGKGRFLRSCEEVKNQVLG 469

Query: 1514 SRQKQKKIWRYSDVIRLVL 1570
            +R ++ K W Y  ++ LV+
Sbjct: 470  TRVEESKSWTYQYLMELVV 488


>ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. lyrata]
            gi|297314141|gb|EFH44564.1| binding protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 502

 Score =  346 bits (888), Expect = 2e-92
 Identities = 185/450 (41%), Positives = 276/450 (61%), Gaps = 2/450 (0%)
 Frame = +2

Query: 227  PLPQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSF 406
            PL  + +    T L+   HR     S +       GSC L+LL++ GDW++D+ WA++ F
Sbjct: 49   PLSINGDASQSTSLIHHHHR---FLSSLPRRLELPGSCPLRLLQEYGDWSKDHFWAVIRF 105

Query: 407  LVETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEAT-KKSDIF 583
            L  + R  E L VFD WKN+E +R + +N+ R+I+L C E  M EA+ AF       ++ 
Sbjct: 106  LRHSSRLHEILPVFDAWKNLERSRISEANYERVIRLLCEEKSMNEAIRAFRGMIDDHELS 165

Query: 584  PSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKC 763
            PSL IYN+IIH +A +  F  +      M E GLLP  ETY+GL+ AYG + +YDE+  C
Sbjct: 166  PSLEIYNSIIHGYADEGKFEEAMFYLNHMKENGLLPITETYDGLIEAYGKWKMYDEIVLC 225

Query: 764  VKRMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAE 943
            +KRME  GC  D VTYN+LI E++R GLL++ME  Y+ L S+ + L+ STL+SMLEAYAE
Sbjct: 226  LKRMESEGCVRDHVTYNLLIREFSRGGLLKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAE 285

Query: 944  MGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLV 1123
             G +EK+E    +I++    + E L+RKLA VYI N  F++L++LG  + ++   + DL 
Sbjct: 286  FGLIEKMEETCNKIIRFGISLDEGLVRKLANVYIDNLMFSRLDDLGRGISSSRTRRTDLA 345

Query: 1124 WYILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQAR 1303
            W + LL  A LVS+KG++ ++ EM+E +VP N    NI    Y K+ DF+S+     + R
Sbjct: 346  WCLRLLCHARLVSRKGLDYVIKEMKEARVPWNTTFANITLLAYSKMGDFKSIELLLSELR 405

Query: 1304 VCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLK 1483
              ++K DL+T GI FD  + G+D   V   W++  +L+  V ++TD LV +AFGKG FLK
Sbjct: 406  TKHVKLDLVTVGIIFDLSEAGFDVTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGKFLK 465

Query: 1484 LCEK-KYITVHSRQKQKKIWRYSDVIRLVL 1570
             CE+ K  ++  R ++ K W Y  ++ +V+
Sbjct: 466  SCEEVKNQSLGMRGEESKAWTYQYLMEVVV 495


>ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Citrus sinensis]
          Length = 477

 Score =  342 bits (876), Expect = 4e-91
 Identities = 175/440 (39%), Positives = 275/440 (62%)
 Frame = +2

Query: 251  RHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAE 430
            +H TLLV+++H +  L +LI   ++   SC LQ+L+ DGDWT+D+ WA++ FL  + R+ 
Sbjct: 62   KHTTLLVESYHEHQALNALIQRLNKKV-SCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120

Query: 431  EALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTI 610
            +  QVFD+WKNIE +R N  N+ +II + C EG M EA+ AF+  +   + PSL IYN+I
Sbjct: 121  QIPQVFDMWKNIEKSRINEFNYQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180

Query: 611  IHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGC 790
            IH +++   F+ +      M E  L P  +TY+GL++AY                     
Sbjct: 181  IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAY--------------------- 219

Query: 791  FPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIER 970
                        E+A +GLL++ME TY+ + +K ++L++ST+V++L+AY   G L+K+E+
Sbjct: 220  ------------EFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 267

Query: 971  VYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSA 1150
             Y+R+L  +  +KEDL+RKLA VYI+NY F++L++LG+D+  +  G+ +LVW + LLS A
Sbjct: 268  FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDL-ASRIGRTELVWCLRLLSHA 326

Query: 1151 GLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLI 1330
             L+S +G++S++ EME  KV  N+   NI+   YLK++DF+ L     +    ++KPD++
Sbjct: 327  CLLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIV 386

Query: 1331 TYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITV 1510
            T GI +DA  IG+DG   LE WRR  +L   V I TD LVL+ +GKG FL+ CE+ Y ++
Sbjct: 387  TIGILYDARRIGFDGTGALEMWRRIGFLSKTVEINTDPLVLAVYGKGHFLRYCEEVYSSL 446

Query: 1511 HSRQKQKKIWRYSDVIRLVL 1570
                ++KK W Y ++I LV+
Sbjct: 447  EPYSREKKRWTYQNLIDLVI 466


>ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Capsella rubella]
            gi|482552277|gb|EOA16470.1| hypothetical protein
            CARUB_v10004636mg [Capsella rubella]
          Length = 501

 Score =  333 bits (854), Expect = 1e-88
 Identities = 174/414 (42%), Positives = 255/414 (61%), Gaps = 1/414 (0%)
 Frame = +2

Query: 332  GSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIK 511
            GSC LQLL++DGDW++D+ WA++ FL  + R  E L V+D WKN+E +R +  N+ R+I+
Sbjct: 87   GSCPLQLLQEDGDWSKDHFWAVIRFLRHSSRLHEILPVYDAWKNLEPSRISVVNYERVIR 146

Query: 512  LFCGEGFMIEAMSAFEATKKSD-IFPSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLL 688
              C E  M EA+ AF +    D + PSL IYN+IIH +A    F  +      M E GL 
Sbjct: 147  FLCEERSMNEAIRAFRSMIDDDELSPSLEIYNSIIHGYADDGKFEEAMFYLNQMKENGLS 206

Query: 689  PTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEKMEST 868
            P  ETY+GL+ AYG + +YDE+  CV+RME  GC  D VTYN+LI +++R GLL++ME  
Sbjct: 207  PISETYDGLIEAYGKWKMYDEIVLCVRRMESDGCVRDHVTYNLLIRQFSRGGLLKRMEQM 266

Query: 869  YRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIR 1048
            Y+ L S+ + L+  TL+SMLEAYAE G +EK+E    +I++    + + L+RKLA VYI 
Sbjct: 267  YQSLMSRKMTLEPCTLLSMLEAYAEFGVIEKMEETCNKIIRFGISLDDGLVRKLAKVYID 326

Query: 1049 NYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPININI 1228
            N  F++L++LG  +  +   + DL W + LL  + LVS+KG++ +L EM E KV  N   
Sbjct: 327  NLMFSRLDDLGRGISYSRTRRSDLAWCLRLLCHSRLVSRKGLDYVLKEMTEAKVTWNTTF 386

Query: 1229 VNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSR 1408
             NI+   Y K+ DF+S+       R   +K DL+T GI FD  + G+DG  V   W++  
Sbjct: 387  ANIVLLAYSKMGDFKSIELLLDGLRTKRVKLDLVTVGIVFDLSEAGFDGTGVFMTWKKIG 446

Query: 1409 YLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITVHSRQKQKKIWRYSDVIRLVL 1570
            +L+  V ++TD LV +AFGKG FL+ CE+       R +    W Y +++ LV+
Sbjct: 447  FLDKPVEMKTDPLVHAAFGKGQFLRRCEE------MRGEDPTPWTYQNLMELVV 494


>emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thaliana]
            gi|7268124|emb|CAB78461.1| salt-inducible protein homolog
            [Arabidopsis thaliana]
          Length = 561

 Score =  328 bits (842), Expect = 3e-87
 Identities = 174/419 (41%), Positives = 259/419 (61%), Gaps = 15/419 (3%)
 Frame = +2

Query: 359  KDGDWTEDNLWAMVSFLVETGRAEEAL-------------QVFDLWKNIEMTRNNPSNHL 499
            +DGDW++D+ WA++ FL ++ R  E L             QVFD WKN+E +R + +N+ 
Sbjct: 136  EDGDWSKDHFWAVIRFLRQSSRLHEILPNMKMTFCFFFQLQVFDTWKNLEPSRISENNYE 195

Query: 500  RIIKLFCGEGFMIEAMSAFEAT-KKSDIFPSLAIYNTIIHAFARKRDFHNSNATFAMMLE 676
            RII+  C E  M EA+ AF +     ++ PSL IYN+IIH++A    F  +      M E
Sbjct: 196  RIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHMKE 255

Query: 677  AGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEK 856
             GLLP  ETY+GL+ AYG + +YDE+  C+KRME  GC  D VTYN+LI E++R GLL++
Sbjct: 256  NGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLLKR 315

Query: 857  MESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLAL 1036
            ME  Y+ L S+ + L+ STL+SMLEAYAE G +EK+E    +I++    + E L+RKLA 
Sbjct: 316  MEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLAN 375

Query: 1037 VYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPI 1216
            VYI N  F++L++LG  +  +   + +L W + LL  A LVS+KG++ ++ EMEE +VP 
Sbjct: 376  VYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEARVPW 435

Query: 1217 NINIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEW 1396
            N    NI    Y K+ DF S+     + R+ ++K DL+T GI FD  +  +DG  V   W
Sbjct: 436  NTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVFMTW 495

Query: 1397 RRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK-KYITVHSRQKQKKIWRYSDVIRLVL 1570
            ++  +L+  V ++TD LV +AFGKG FL+ CE+ K  ++ +R  + K W Y  ++ LV+
Sbjct: 496  KKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMELVV 554


>emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera]
          Length = 1697

 Score =  319 bits (818), Expect = 2e-84
 Identities = 163/363 (44%), Positives = 234/363 (64%)
 Frame = +2

Query: 266  LVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEEALQV 445
            LV+T H N  L  LI + S N  S  LQLL  DGDW + + WA++ FL +  R+ E L V
Sbjct: 1332 LVETLHENERLGVLIQKLS-NKASSPLQLLRDDGDWNKQHFWAVIRFLKDASRSSEILPV 1390

Query: 446  FDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTIIHAFA 625
            F LWK+++ +R N  N+ +II L   E    E++ A E  K   + PSL IYN +IH FA
Sbjct: 1391 FHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEXMKTHGLKPSLEIYNLVIHCFA 1450

Query: 626  RKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEV 805
            RK +F  +      +    L+   ETY+GL+++YG + +YDE+ +CVK+ME  GC PD +
Sbjct: 1451 RKGEFDRALYFLNELKXNNLIADTETYDGLIQSYGKYKMYDELDECVKKMESDGCLPDHI 1510

Query: 806  TYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRI 985
            TYN+LI E++R GLL++ME  ++ + SK + LQ+STLV MLEAYA  G +EK+E  YRR+
Sbjct: 1511 TYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVMLEAYANFGIIEKMENAYRRV 1570

Query: 986  LKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSK 1165
            L  K  +K+DLIRKLA VYI NY+F++L ++G D+ +    + DLVW + LLS A L+S+
Sbjct: 1571 LNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLDLASVT-SRTDLVWCLRLLSHACLLSR 1629

Query: 1166 KGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIF 1345
            KG++SI+ EME + VP N  + N +   YLK++DF  L     +    ++KPD++T GI 
Sbjct: 1630 KGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILLLELSTRHVKPDIVTVGIL 1689

Query: 1346 FDA 1354
            FDA
Sbjct: 1690 FDA 1692


>ref|XP_002966251.1| hypothetical protein SELMODRAFT_85839 [Selaginella moellendorffii]
            gi|300165671|gb|EFJ32278.1| hypothetical protein
            SELMODRAFT_85839 [Selaginella moellendorffii]
          Length = 358

 Score =  151 bits (381), Expect = 1e-33
 Identities = 88/317 (27%), Positives = 158/317 (49%)
 Frame = +2

Query: 542  AMSAFEATKKSDIFPSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLR 721
            A   F+  +   + PS+  ++ ++ ++A   +   + +    ML+ G+ P   TY GL+R
Sbjct: 11   AQGVFDGMEAMQVRPSVVGFSALVQSYAESGEVEGAQSAMKRMLDTGIQPNVVTYGGLIR 70

Query: 722  AYGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNL 901
            AYG  GL+DEM+K V  M+   C PD   Y  +I  YA  GL+ +M+  ++ + +     
Sbjct: 71   AYGKRGLFDEMAKVVNTMKTVRCEPDFFVYKNVIEAYASGGLVGRMDKAFKAMRADGWIP 130

Query: 902  QASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELG 1081
             +  L  + + YA MG ++++E     + ++K + +E+ +R  AL YIR+ +F Q+E   
Sbjct: 131  DSDILNLLAQGYASMGMIKEMEGAQGELRRIKGWPREESVRACALAYIRHNQFYQMEGFV 190

Query: 1082 NDVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKI 1261
              +     G  +L+W +LLL+ A   S K ++     M   +   ++   NI A    ++
Sbjct: 191  KSLGMKRIGG-NLLWNLLLLAHAANFSMKSLQREAVNMWSARCAPDVTTFNIRALALSRM 249

Query: 1262 RDFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTD 1441
            +    L+   +  R  +++PDL+TYG   DA  I      + E+       + +  +RTD
Sbjct: 250  QMLWDLHVLVQHMRAESVRPDLVTYGALVDAYAIARLLPRLPEQLDELDMADTIPDVRTD 309

Query: 1442 QLVLSAFGKGSFLKLCE 1492
             LV  AFG+G F   C+
Sbjct: 310  PLVFQAFGRGRFHAFCD 326


Top