BLASTX nr result

ID: Zingiber23_contig00027173 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber23_contig00027173
         (1678 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ESW35794.1| hypothetical protein PHAVU_001G265200g [Phaseolus...   411   e-112
gb|EMJ24063.1| hypothetical protein PRUPE_ppa004279mg [Prunus pe...   407   e-111
ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containi...   407   e-111
ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containi...   393   e-106
ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containi...   392   e-106
ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citr...   391   e-106
gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis]     390   e-106
ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [A...   390   e-106
ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Popu...   383   e-103
gb|EOY05094.1| Pentatricopeptide repeat-containing protein, puta...   383   e-103
ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containi...   380   e-102
ref|XP_002516403.1| pentatricopeptide repeat-containing protein,...   380   e-102
ref|NP_193155.4| pentatricopeptide repeat-containing protein [Ar...   348   5e-93
ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutr...   345   3e-92
ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. l...   342   2e-91
ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containi...   338   5e-90
ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Caps...   329   2e-87
emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thal...   328   3e-87
emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera]   321   5e-85
ref|XP_002966251.1| hypothetical protein SELMODRAFT_85839 [Selag...   151   9e-34

>gb|ESW35794.1| hypothetical protein PHAVU_001G265200g [Phaseolus vulgaris]
          Length = 496

 Score =  411 bits (1056), Expect = e-112
 Identities = 206/441 (46%), Positives = 301/441 (68%)
 Frame = -1

Query: 1471 EQRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGR 1292
            + +HRTLLV+T+H ++ LR+L+ +  R   S+ + +L +DGDW++D+ WA V FL    R
Sbjct: 57   DSKHRTLLVETYHHHDSLRALLAKLERED-SNPMYILAQDGDWSKDHFWAAVRFLKNASR 115

Query: 1291 AEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYN 1112
              E LQVFD+WK IE +R +  N+ +II L C +  M EA+SAF+  K   + PSL  YN
Sbjct: 116  FVEILQVFDMWKEIEKSRISEFNYNKIIGLLCEDEMMEEALSAFQEMKVQGMKPSLDTYN 175

Query: 1111 TIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELS 932
             IIH  ++   F ++      M E+GL P  ETY+GL+ AYG F LYDEM +CVK+MEL 
Sbjct: 176  PIIHGLSKAGKFSDALRFLDEMKESGLDPDSETYDGLIGAYGKFQLYDEMGECVKKMELE 235

Query: 931  GCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKI 752
            GC PD +TYNILI EYAR+G+L++ME  Y+ + SK + LQ+ST V+ML+AY   G +EK+
Sbjct: 236  GCSPDHITYNILIQEYARAGILQRMEKLYQRMLSKRMRLQSSTFVAMLKAYTTFGIVEKM 295

Query: 751  ERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLS 572
            E  +R++L  K+ +++D IRK+A VYI+NY F++LE+L  D+ +A +G+ DLVW + LLS
Sbjct: 296  EFFFRKVLNSKSCLEDDFIRKMAEVYIKNYMFSRLEDLALDLCSA-FGESDLVWCLRLLS 354

Query: 571  SAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPD 392
             A L+SKKG++ ++ EM++ K+  N+   NI+   Y+K++DFR L     Q R+  + PD
Sbjct: 355  YACLLSKKGMDIVVKEMQDAKINWNVAFANIIMLAYVKMKDFRHLRILLSQLRINRLGPD 414

Query: 391  LITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYI 212
            ++T GI  DA  IG+DG   LE WRR  YL+ VV ++TD LVL+AFGKG FLK CE+ Y 
Sbjct: 415  IVTIGIVLDASRIGFDGRGALESWRRMGYLDRVVELKTDSLVLTAFGKGHFLKSCEEVYT 474

Query: 211  TVHSRQKQKKIWRYSDVIRLV 149
            ++H   +++K W Y+D+I L+
Sbjct: 475  SLHPEDRERKKWTYNDLIALL 495


>gb|EMJ24063.1| hypothetical protein PRUPE_ppa004279mg [Prunus persica]
          Length = 518

 Score =  407 bits (1047), Expect = e-111
 Identities = 205/445 (46%), Positives = 295/445 (66%)
 Frame = -1

Query: 1483 PQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLV 1304
            P     +H TLLV+TFH +  L++L+      +GS  LQLL +DGDWT+D  WA + FL 
Sbjct: 65   PDSSSTKHTTLLVETFHEHQRLKALLQNLI--NGSCPLQLLGEDGDWTKDQFWAAIRFLK 122

Query: 1303 ETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSL 1124
             T R  E LQ+FD+WKNIE +R N  N+ +II L   EG + EA+  F+  K  ++ PSL
Sbjct: 123  HTFRFNEILQLFDMWKNIEKSRINEFNYSKIIGLLGEEGLIEEAVRCFQEMKSHNLRPSL 182

Query: 1123 AIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKR 944
             +YN++IH  AR+ +F ++      M E  L P  +TY+GL+ AYG + +YD++  CVK+
Sbjct: 183  EVYNSVIHVCARQGNFEDALFFLNEMKEMNLAPETDTYDGLIEAYGKYRMYDQIGMCVKK 242

Query: 943  MELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGN 764
            M+L+GC PD +TYN+LI E+AR GLL++MES Y+ + S+ + LQ+STL++M+E YA+ G 
Sbjct: 243  MKLNGCSPDHITYNLLIREFARGGLLKRMESVYQSMLSRRMALQSSTLIAMVEVYAKFGI 302

Query: 763  LEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYI 584
            LEK+E VYRR+L     +K DLIRKLA VYI NY F++LE+LG D+ ++ +G+ DLVW +
Sbjct: 303  LEKMENVYRRVLNSGTVVKNDLIRKLAEVYIDNYMFSRLEKLGVDL-SSRFGQTDLVWCL 361

Query: 583  LLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCN 404
             LLS AG++S++G++SI+ EM+E+ VP N  + NI+   YLK++DF  L     Q     
Sbjct: 362  RLLSQAGVLSQRGMDSIVDEMKEQNVPWNETVANIIMLAYLKMKDFTHLRIFLSQLLTQG 421

Query: 403  IKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCE 224
            ++PD+IT GI FDA  IGYDG   L+ WR + +L   V + TD LVL+ FGKG FL+ CE
Sbjct: 422  VEPDIITVGIVFDANRIGYDGSRTLDTWRENGFLRKAVEMNTDPLVLTTFGKGHFLRNCE 481

Query: 223  KKYITVHSRQKQKKIWRYSDVIRLV 149
              Y ++    ++ K W Y  +I LV
Sbjct: 482  AAYSSLEPEDRENKTWTYHHLIDLV 506


>ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Glycine max]
          Length = 509

 Score =  407 bits (1047), Expect = e-111
 Identities = 210/441 (47%), Positives = 298/441 (67%)
 Frame = -1

Query: 1471 EQRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGR 1292
            + +H TLLV+T+H ++ LR+L+ +  +    + L +L +DGDW++D+ WA+V FL    R
Sbjct: 58   DTKHTTLLVETYHLHDSLRALLAKLQKED-CNPLHVLAEDGDWSKDHFWAVVRFLKSASR 116

Query: 1291 AEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYN 1112
              + LQVFD+WKNIE +R +  N+ +II L C  G M +A+SA    K   I PSL  YN
Sbjct: 117  FTQILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMEDALSALRDMKVQGIKPSLDTYN 176

Query: 1111 TIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELS 932
             IIH  +R+  F ++      M E+GL    ETY+GLL AYG F +YDEM +CVK+MEL 
Sbjct: 177  PIIHGLSREGKFSDALRFIDEMKESGLELDSETYDGLLGAYGKFQMYDEMGECVKKMELE 236

Query: 931  GCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKI 752
            GC PD +TYNILI EYAR+GLL++ME  Y+ + SK +++Q+STLV+MLEAY   G +EK+
Sbjct: 237  GCSPDHITYNILIQEYARAGLLQRMEKLYQRMVSKRMHVQSSTLVAMLEAYTTFGMVEKM 296

Query: 751  ERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLS 572
            E  YR+IL  K  +++DLIRK+A VYI+NY F++LE+L  D+  A +G+ +LVW + LLS
Sbjct: 297  ENFYRKILSSKTCLEDDLIRKVAEVYIKNYMFSRLEDLALDLCPA-FGESNLVWCLRLLS 355

Query: 571  SAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPD 392
             A  +SKKG++ ++ EM + KV  N+ + NI+   Y+K++DFR L     Q  +  ++PD
Sbjct: 356  YACPLSKKGMDIVVREMRDAKVNWNVTVANIIMLAYVKMKDFRHLKILLSQLPIYRVQPD 415

Query: 391  LITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYI 212
            +IT GI FDA  IG+DG   LE WRR  YL  VV I+TD LVL+AFGKG FLK CE+ Y 
Sbjct: 416  IITIGILFDATRIGFDGSGALETWRRMGYLYRVVEIKTDSLVLTAFGKGHFLKSCEEVYS 475

Query: 211  TVHSRQKQKKIWRYSDVIRLV 149
            ++H   +++K W Y D+I L+
Sbjct: 476  SLHPEDRKRKTWTYHDLIALL 496


>ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Vitis vinifera]
          Length = 581

 Score =  393 bits (1010), Expect = e-106
 Identities = 198/438 (45%), Positives = 287/438 (65%)
 Frame = -1

Query: 1465 RHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGRAE 1286
            +H TLLV+T H N  L  LI + S N  SS LQLL  DGDW + + WA++ FL +  R+ 
Sbjct: 88   KHTTLLVETLHENERLGVLIQKLS-NKASSPLQLLRDDGDWNKQHFWAVIRFLKDASRSS 146

Query: 1285 EALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTI 1106
            E L VF LWK+++ +R N  N+ +II L   E    E++ A E  K   + PSL IYN +
Sbjct: 147  EILPVFHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEGMKTHGLKPSLEIYNLV 206

Query: 1105 IHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGC 926
            IH FARK +F  +      +    L+   ETY+GL+++YG + +YDE+ +CVK+ME  GC
Sbjct: 207  IHCFARKGEFDRALYFLNELKANNLIADTETYDGLIQSYGKYKMYDELDECVKKMESDGC 266

Query: 925  FPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIER 746
             PD +TYN+LI E++R GLL++ME  ++ + SK + LQ+STLV MLEAYA  G +EK+E 
Sbjct: 267  LPDHITYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVMLEAYANFGIIEKMEN 326

Query: 745  VYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSA 566
             YRR+L  K  +K+DLIRKLA VYI NY+F++L ++G ++ +    + DLVW + LLS A
Sbjct: 327  AYRRVLNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLNLASVT-SRTDLVWCLRLLSHA 385

Query: 565  GLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLI 386
             L+S+KG++SI+ EME + VP N  + N +   YLK++DF  L     +    ++KPD++
Sbjct: 386  CLLSRKGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILLLELSTRHVKPDIV 445

Query: 385  TYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITV 206
            T GI FDA  IG++G   L  WRR+ +L++ V + TD LVLSAFGKG+FL+ CE+ Y ++
Sbjct: 446  TVGILFDANRIGFNGTMALNTWRRTGFLDEAVEMNTDPLVLSAFGKGNFLQSCEEMYSSL 505

Query: 205  HSRQKQKKIWRYSDVIRL 152
                ++KKIW Y ++I L
Sbjct: 506  EPEARKKKIWTYQNLIDL 523


>ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 506

 Score =  392 bits (1008), Expect = e-106
 Identities = 204/441 (46%), Positives = 295/441 (66%)
 Frame = -1

Query: 1471 EQRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGR 1292
            + +H TLLV+T+H ++ LR+L+ +   N  S+ L +L +D DW++D+ WA+V FL  +  
Sbjct: 56   DTKHTTLLVETYHLHHSLRALLAKLE-NEYSNPLHMLAEDADWSKDHFWAVVRFLKSSSN 114

Query: 1291 AEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYN 1112
                LQVFD+WKNIE +R +  N+ +II L C  G M +A+SA +  K   I PSL  YN
Sbjct: 115  FTHILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMKDALSALQDMKVQGIKPSLDTYN 174

Query: 1111 TIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELS 932
             IIH  +R+  F ++      M E+GL    ETY+GL+ AYG F +YDEM +CVK+MEL 
Sbjct: 175  PIIHGLSREGKFSDALRFIDEMKESGLELDSETYDGLIGAYGKFQMYDEMGECVKKMELE 234

Query: 931  GCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKI 752
            GC PD +TYNILI EYA  GLL++ME  Y+ + SK +++++STLV+MLEAY   G +EK+
Sbjct: 235  GCSPDPITYNILIQEYAGGGLLQRMEKLYQRMLSKRMHVKSSTLVAMLEAYTTFGMVEKM 294

Query: 751  ERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLS 572
            E+ YR+IL  K  I++DLIRK+A VYI N+ F++LE+L  D+  A +G+ +L W   LLS
Sbjct: 295  EKFYRKILNSKTCIEDDLIRKVAEVYINNFMFSRLEDLALDLCPA-FGESNLEWCFRLLS 353

Query: 571  SAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPD 392
             A L+SKKG++ ++ EM++ KV  N+ + NI+   Y+K+++FR L     Q  +  ++PD
Sbjct: 354  YACLLSKKGMDIVVQEMQDAKVSWNVTVANIIMLAYVKMKEFRHLRILLSQLPIYRVQPD 413

Query: 391  LITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYI 212
            +IT GI FDA  IG+DG   LE WRR  YL  VV ++TD LVL+AFGKG FLK CE+ Y 
Sbjct: 414  IITIGILFDATRIGFDGSGALETWRRMGYLYRVVEMKTDSLVLTAFGKGHFLKSCEEVYS 473

Query: 211  TVHSRQKQKKIWRYSDVIRLV 149
            ++H   +++K   Y D+I L+
Sbjct: 474  SLHPEDRKRKTCTYHDLIPLL 494


>ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citrus clementina]
            gi|557522919|gb|ESR34286.1| hypothetical protein
            CICLE_v10004784mg [Citrus clementina]
          Length = 510

 Score =  391 bits (1005), Expect = e-106
 Identities = 190/440 (43%), Positives = 297/440 (67%)
 Frame = -1

Query: 1465 RHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGRAE 1286
            +H TLLV+++H +  L +LI   ++   S  LQ+L+ DGDWT+D+ WA++ FL  + R+ 
Sbjct: 62   KHTTLLVESYHEHQALNALIQRLNKKV-SCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120

Query: 1285 EALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTI 1106
            +  QVFD+WKNIE +R N  N  +II + C EG M EA+ AF+  +   + PSL IYN+I
Sbjct: 121  QIPQVFDMWKNIEKSRINEFNSQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180

Query: 1105 IHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGC 926
            IH +++   F+ +      M E  L P  +TY+GL++AYG + +YDE+  C+K M+L GC
Sbjct: 181  IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAYGKYKMYDEIDMCLKMMKLDGC 240

Query: 925  FPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIER 746
             PD +TYN+LI E+A +GLL++ME TY+ + +K ++L++ST+V++L+AY   G L+K+E+
Sbjct: 241  SPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 300

Query: 745  VYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSA 566
             Y+R+L  +  +KEDL+RKLA VYI+NY F++L++LG+D+  +  G+ +LVW + LLS A
Sbjct: 301  FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDL-ASRIGRTELVWCLRLLSHA 359

Query: 565  GLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLI 386
             L+S +G++S++ EME  KV  N+   NI+   YLK++DF+ L     +    ++KPD++
Sbjct: 360  CLLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIV 419

Query: 385  TYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITV 206
            T GI +DA  IG+DG   LE W+R  +L   V I TD LVL+ +GKG FL+ CE+ Y ++
Sbjct: 420  TIGILYDARRIGFDGTGALEMWKRIGFLFKTVEINTDPLVLAVYGKGHFLRYCEEVYSSL 479

Query: 205  HSRQKQKKIWRYSDVIRLVL 146
                ++KK W Y ++I LV+
Sbjct: 480  EPYSREKKRWTYQNLIDLVI 499


>gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis]
          Length = 664

 Score =  390 bits (1003), Expect = e-106
 Identities = 193/444 (43%), Positives = 295/444 (66%)
 Frame = -1

Query: 1480 QDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVE 1301
            Q+    H TLLV+TFH +   ++L+   S+N  S  ++LL +DGDW +++ WA+V FL  
Sbjct: 57   QNSSTEHTTLLVETFHEHRKFKTLLKRLSKND-SCPMRLLREDGDWCKEHFWAVVRFLRH 115

Query: 1300 TGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLA 1121
              R +E +QVFDLWKNIE +R N  N+ +IIK+   EG M EA+ +FE  K   + P+L 
Sbjct: 116  GSRTKEIVQVFDLWKNIEKSRINELNYCKIIKMLGEEGLMEEAVLSFEEMKSCGLSPTLE 175

Query: 1120 IYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRM 941
            +YN++IH F++K DF ++      M E  ++P  +TY GL+ AY  + +YDE+  C+K+M
Sbjct: 176  VYNSMIHGFSQKGDFDDALVYLNEMREQNVVPETDTYEGLIEAYAKYEMYDEIGLCLKKM 235

Query: 940  ELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNL 761
            +L+GC PD +TYN+L+ ++++ GLL++MES Y  + SK + LQ+STLV+MLE YA  G L
Sbjct: 236  KLNGCPPDHITYNLLMRKFSKGGLLKRMESVYHTMISKRMYLQSSTLVAMLETYARFGIL 295

Query: 760  EKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYIL 581
            +K+E+ Y R LK K  + EDLIRKLA VYI NY F++LE LG D+ T  +G+ DL+W + 
Sbjct: 296  DKMEKFYMRTLKTKTPLGEDLIRKLAEVYIDNYLFSRLETLGVDLST-TFGETDLLWCLR 354

Query: 580  LLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNI 401
            LLS A L S+KG++ ++ EME   +P N+   NI+   +LK++DF  L  +  Q    ++
Sbjct: 355  LLSHAFLFSRKGMDFVIQEMERAHIPWNVTFANIILLTHLKMKDFTHLRISLSQL-THSV 413

Query: 400  KPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK 221
            +PD++T GI FDA  +G+DG   LE W+R  +    V + TD +V++AFGKG+FL+ CE+
Sbjct: 414  EPDIVTVGILFDAIGMGFDGTRTLETWKRMDFFYKAVEMNTDPVVITAFGKGNFLQNCER 473

Query: 220  KYITVHSRQKQKKIWRYSDVIRLV 149
             Y ++ S  ++ K W Y++++ LV
Sbjct: 474  AYSSLESEVRETKSWTYNNLVDLV 497


>ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda]
            gi|548859508|gb|ERN17188.1| hypothetical protein
            AMTR_s00044p00151840 [Amborella trichopoda]
          Length = 506

 Score =  390 bits (1002), Expect = e-106
 Identities = 197/445 (44%), Positives = 291/445 (65%)
 Frame = -1

Query: 1483 PQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLV 1304
            PQD   +HR LLV  F +   L  LI +     G   L+LL  +GDW +D  WA++  L 
Sbjct: 60   PQD--SKHRALLVQNFFQTQQLLDLIEKIK--GGIDPLKLLRDEGDWNKDQFWAVMKLLK 115

Query: 1303 ETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSL 1124
            ET R +EA+QVFD W N+E +R + SN+ ++I+L    G M EA +  +  K   + P++
Sbjct: 116  ETSRIKEAMQVFDYWVNVERSRLDDSNYTKMIELLVDAGLMDEATTMLKEVKDFGVRPTV 175

Query: 1123 AIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKR 944
            A+YN I+H +A   +F  +N     M + GL+P  ETY+GL+RAYG   +YD+M+KC K+
Sbjct: 176  AVYNFIVHGYANTGNFDKANLFLREMRDLGLVPESETYDGLIRAYGNHRMYDDMAKCAKK 235

Query: 943  MELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGN 764
            ME  G  PD +TYNILI E+AR GL+ +ME  YR L SK + LQ STLV+MLEAYA +G 
Sbjct: 236  MESEGFTPDHLTYNILIREFARGGLMVRMEGAYRTLLSKKMGLQYSTLVAMLEAYAALGC 295

Query: 763  LEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYI 584
            + ++E V+RR+LK K  +KEDL+RK+A  YI+N+RF++LE+LG  V  +  G+ DL W +
Sbjct: 296  VNEMETVFRRLLKSKIPLKEDLVRKVARAYIKNHRFSRLEDLGLGV-ASKTGRTDLFWCL 354

Query: 583  LLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCN 404
            LLLS A L S+KG++S++ EM+   V  N+   NI A  YLK++D + L+    Q ++ N
Sbjct: 355  LLLSHACLCSRKGIKSVIQEMKSAMVRPNVTFANITALTYLKMKDVQYLDVLLSQLQLLN 414

Query: 403  IKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCE 224
            + PD++T G+  DA   G+D I  L  WR++ +L   V + TD LVL+AFGKG FL+ CE
Sbjct: 415  VNPDIVTVGVVMDAYVSGFDDIKALRMWRKTGFLRRPVEMNTDPLVLTAFGKGYFLRSCE 474

Query: 223  KKYITVHSRQKQKKIWRYSDVIRLV 149
            + Y+++ ++ +++K+W Y+D+I LV
Sbjct: 475  ELYLSLGAKGRERKVWTYNDLIDLV 499


>ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa]
            gi|550324215|gb|EEE99423.2| hypothetical protein
            POPTR_0014s14700g [Populus trichocarpa]
          Length = 508

 Score =  383 bits (983), Expect = e-103
 Identities = 199/461 (43%), Positives = 293/461 (63%), Gaps = 17/461 (3%)
 Frame = -1

Query: 1480 QDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVE 1301
            QD   +H TLLVD+FH +  L+SL+H    NS  + LQLL++DGDW++D+ W+++ FL  
Sbjct: 46   QDHSTKHTTLLVDSFHEHKRLKSLLHNL--NSNQNPLQLLQQDGDWSKDDFWSVIKFLKL 103

Query: 1300 TGRAEEALQV-----------------FDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEA 1172
            + R+ + LQV                 F +W+++E TR N  N+ +II L   EG M +A
Sbjct: 104  SARSNQILQVHSLAHLFFLAARKIEFVFHMWRDVEKTRINEFNYEKIIGLLGEEGLMEDA 163

Query: 1171 MSAFEATKKSDIFPSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRA 992
            ++AF   K   +  SL +YN+IIH +AR   F ++      M E  L P  +TY+GL+ A
Sbjct: 164  VTAFMEMKSFGLCLSLEVYNSIIHGYARNGKFDDALFYLNQMNEMNLSPESDTYDGLIEA 223

Query: 991  YGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQ 812
            YG + +YDEM+ C+K+MEL GC PD  TYN+LI ++A+ GLL +ME  Y+ + +K + LQ
Sbjct: 224  YGTYRMYDEMAMCLKKMELDGCSPDRYTYNLLIQKFAQGGLLTRMERVYQSMRTKRMKLQ 283

Query: 811  ASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGN 632
            +STL+SMLEAYA  G +EK+E++ R     K  +KEDL+RKLA VYI NY F++L +L  
Sbjct: 284  SSTLISMLEAYANFGIVEKMEKILRWAWNSKITVKEDLVRKLAGVYIANYMFSRLHDLAV 343

Query: 631  DVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIR 452
            D+ T+  G+ D+VW + LLS A L+S++G+++++ EME+ K   NI + NI+   YLK++
Sbjct: 344  DL-TSITGRTDIVWCLHLLSHACLLSRRGMDAVVREMEDAKACWNITVANIILLAYLKMK 402

Query: 451  DFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQ 272
            DF  L     +     ++PD++T+GI FDA +IG+DG   LE WR+   L   V + TD 
Sbjct: 403  DFTRLRILLSKLPEIRVEPDIVTFGILFDAEEIGFDGKECLEMWRKMGLLYRRVEMNTDP 462

Query: 271  LVLSAFGKGSFLKLCEKKYITVHSRQKQKKIWRYSDVIRLV 149
            L LSAFGKGSFL+ CE+ Y ++    ++KK W Y D I LV
Sbjct: 463  LALSAFGKGSFLRSCEEGYSSLEPNAREKKRWTYVDFINLV 503


>gb|EOY05094.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao]
          Length = 504

 Score =  383 bits (983), Expect = e-103
 Identities = 200/447 (44%), Positives = 292/447 (65%)
 Frame = -1

Query: 1468 QRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGRA 1289
            + H  LLV+T+H +  L++L+    ++  S  LQ+L  DGDWT+D  W ++ FL    R+
Sbjct: 60   KNHTALLVETYHHHRRLKALLERLEKDD-SCPLQMLRDDGDWTKDIFWVVIRFLRRASRS 118

Query: 1288 EEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNT 1109
             E LQVF +WKNIE +R N  N+ +II L   EG + +A+ A        + PSL +YN+
Sbjct: 119  NEILQVFHMWKNIEKSRINELNYEKIIGLLGEEGRVGQAVQALREMGGYGLKPSLEVYNS 178

Query: 1108 IIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSG 929
            IIHA+AR   F ++ +    M E GL P  +TY+GL+ AYG + +YDE+  C+K MEL  
Sbjct: 179  IIHAYARNGKFDDALSFLNEMKEIGLAPETDTYDGLIEAYGKYKMYDEIGTCLKMMELDR 238

Query: 928  CFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIE 749
            C PD  TYN+LI E++R GLL++ME  Y++L SK +NLQ+S+LV+MLEAYA  G L+K+E
Sbjct: 239  CRPDHFTYNLLIREFSRGGLLQRMEQVYQILLSKQMNLQSSSLVAMLEAYANFGILDKME 298

Query: 748  RVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSS 569
            +VYR+++     +KED IR LA VYI+NY F++L++LG D+ ++  G+ DLVW + LLS 
Sbjct: 299  KVYRKVVN-SMTLKEDTIRILASVYIKNYMFSRLDDLGIDL-SSRTGRNDLVWCLRLLSH 356

Query: 568  AGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDL 389
            A L+S+KG++S++ EM E K   N+ I NI+   Y+K++DF+ L     Q     ++PD+
Sbjct: 357  ACLLSRKGMDSVILEMCEAKASWNVTISNIILLAYMKMKDFKRLRILLSQLPSHQVRPDI 416

Query: 388  ITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYIT 209
            IT GI  DA +IG+DG   LE WR+   L   V + TD LVL AFGKG FL+ CE+ Y +
Sbjct: 417  ITIGILSDAIEIGFDGAEALETWRKMGLLYRTVEMNTDPLVLIAFGKGHFLRDCEEIYTS 476

Query: 208  VHSRQKQKKIWRYSDVIRLVLG*KAKR 128
            +  + +++K W Y  +I LV+  KAKR
Sbjct: 477  LEPKARKEKRWTYHHLIDLVIKHKAKR 503


>ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 509

 Score =  380 bits (975), Expect = e-102
 Identities = 193/435 (44%), Positives = 277/435 (63%)
 Frame = -1

Query: 1462 HRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGRAEE 1283
            H TL V+  H  + LR+L+ +         LQLL  DGDWT D  WA++ FL+   R +E
Sbjct: 68   HTTLHVEPSHEYHKLRALL-DILMEKDCCPLQLLRDDGDWTIDQFWAVIRFLIHASRPKE 126

Query: 1282 ALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTII 1103
             LQ+FD+W+NIE +R N  N+ +II L   E  + EA+  F+  K   +  S+ +YNTII
Sbjct: 127  ILQLFDIWRNIEKSRINEFNYSKIIGLLVEEDLIEEAVVCFQDMKSQGLGLSVELYNTII 186

Query: 1102 HAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCF 923
            H  +R  +F ++      M E  L P  +TY+GL+ AYG + +YDEM  C+K+M L+GC 
Sbjct: 187  HGLSRNGNFVDAVHFLNEMKEMNLAPDADTYDGLIEAYGKYKMYDEMGMCLKKMRLNGCS 246

Query: 922  PDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERV 743
            PD +TYN+LI E+A  GLL ++E  Y+ + S+ ++LQ  TL+++LE YA+ G LEK+E  
Sbjct: 247  PDYITYNLLIREFAHGGLLNRVERVYQSMVSRRMDLQVPTLIAILEVYAKFGILEKMEVF 306

Query: 742  YRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAG 563
            YRR+L  +A +KEDLI+K+A VYI NY F++LE LG D+ +  +G+ DLVW + LLS AG
Sbjct: 307  YRRVLNSRAILKEDLIKKVAEVYIENYMFSKLENLGVDL-SPRFGQTDLVWCLRLLSHAG 365

Query: 562  LVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLIT 383
            L+S++G+ SI+ EME + VP N  + NI+   YLK++DF  L   F Q+    + PD+IT
Sbjct: 366  LLSRRGMNSIILEMEGKSVPWNATVANIMMLAYLKMKDFTRLRSLFSQSLTRGVDPDIIT 425

Query: 382  YGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITVH 203
            +GI FDA  IGYDG   L  WR+   L   V + TD LV++ FGKG FL+ CE  Y ++ 
Sbjct: 426  FGILFDANRIGYDGSATLNTWRKHGILYKAVEMNTDPLVITTFGKGHFLRNCEAAYSSLE 485

Query: 202  SRQKQKKIWRYSDVI 158
               ++KK W Y D+I
Sbjct: 486  PEVREKKTWTYQDLI 500


>ref|XP_002516403.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223544501|gb|EEF46020.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 502

 Score =  380 bits (975), Expect = e-102
 Identities = 191/446 (42%), Positives = 295/446 (66%)
 Frame = -1

Query: 1486 LPQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFL 1307
            + QD   +H TLLV+++H +  L++L+   ++  GS  LQ+L+ D DW++D+ WA++ FL
Sbjct: 49   ISQDNSIKHNTLLVESYHEHQRLKALLARLNKK-GSCPLQMLQDDADWSKDHFWAVIRFL 107

Query: 1306 VETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPS 1127
              + R++E LQVFD+WK+IE +R N  N+ ++I++   EG + +A SAF   K   + PS
Sbjct: 108  RHSSRSDEILQVFDMWKDIEKSRINEFNYEKVIEILGEEGLIEDAYSAFIEMKTLCLSPS 167

Query: 1126 LAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVK 947
            L +YN++IH +AR   F ++      + E  L P  +TYNGL++AYG + +YDEM  C+K
Sbjct: 168  LQVYNSLIHGYARNGKFDDAVFYLNHLKEINLSPVSDTYNGLIQAYGKYKMYDEMGMCLK 227

Query: 946  RMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMG 767
            +ME+ GC PD VTYN+LI E A +GLL +ME  Y+      ++L+++TL +MLEAYA  G
Sbjct: 228  KMEMEGCSPDHVTYNLLIQELAEAGLLTRMEKVYQTTRMNRMDLKSTTLTAMLEAYANFG 287

Query: 766  NLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWY 587
             +EK+E + +R    KA +KEDLI+K+ALVYI N+ F++LE+LG+ +   + G+ D+VW 
Sbjct: 288  IVEKMELILKRTRNSKALLKEDLIKKIALVYIENFMFSRLEKLGHYLSKRS-GQNDMVWC 346

Query: 586  ILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVC 407
            +LLLS+A ++S+KG++S++ EM+  KV  N+  +NI+   YLK++D   L          
Sbjct: 347  LLLLSNACMLSQKGMDSVVREMKVAKVSWNVTFINIILLAYLKMKDSMRLGILLSTLTNH 406

Query: 406  NIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLC 227
             +KPD++T G+ FDA +IG+ G  +LE WRR+  L   V   TD LVL+AFGKG FLK C
Sbjct: 407  IVKPDIVTVGVLFDANNIGFHGNGILETWRRTGILYRCVETETDPLVLAAFGKGQFLKKC 466

Query: 226  EKKYITVHSRQKQKKIWRYSDVIRLV 149
            E+ Y ++    +QK+ W Y ++I LV
Sbjct: 467  EEAYSSLEPVARQKEKWTYCNLIDLV 492


>ref|NP_193155.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|223635638|sp|O23278.2|PP310_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g14190, chloroplastic; Flags: Precursor
            gi|332657991|gb|AEE83391.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 501

 Score =  348 bits (892), Expect = 5e-93
 Identities = 184/432 (42%), Positives = 272/432 (62%), Gaps = 2/432 (0%)
 Frame = -1

Query: 1435 HRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGRAEEALQVFDLWK 1256
            H +  L SL    S  SGS  L+LL++DGDW++D+ WA++ FL ++ R  E L VFD WK
Sbjct: 64   HHHRFLSSLTRRLSL-SGSCPLRLLQEDGDWSKDHFWAVIRFLRQSSRLHEILPVFDTWK 122

Query: 1255 NIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEAT-KKSDIFPSLAIYNTIIHAFARKRD 1079
            N+E +R + +N+ RII+  C E  M EA+ AF +     ++ PSL IYN+IIH++A    
Sbjct: 123  NLEPSRISENNYERIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGK 182

Query: 1078 FHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEVTYNI 899
            F  +      M E GLLP  ETY+GL+ AYG + +YDE+  C+KRME  GC  D VTYN+
Sbjct: 183  FEEAMFYLNHMKENGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNL 242

Query: 898  LITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRILKLK 719
            LI E++R GLL++ME  Y+ L S+ + L+ STL+SMLEAYAE G +EK+E    +I++  
Sbjct: 243  LIREFSRGGLLKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFG 302

Query: 718  AYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSKKGVE 539
              + E L+RKLA VYI N  F++L++LG  +  +   + +L W + LL  A LVS+KG++
Sbjct: 303  ISLDEGLVRKLANVYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLD 362

Query: 538  SILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIFFDAC 359
             ++ EMEE +VP N    NI    Y K+ DF S+     + R+ ++K DL+T GI FD  
Sbjct: 363  YVVKEMEEARVPWNTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLS 422

Query: 358  DIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK-KYITVHSRQKQKK 182
            +  +DG  V   W++  +L+  V ++TD LV +AFGKG FL+ CE+ K  ++ +R  + K
Sbjct: 423  EARFDGTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESK 482

Query: 181  IWRYSDVIRLVL 146
             W Y  ++ LV+
Sbjct: 483  SWTYQYLMELVV 494


>ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum]
            gi|557115950|gb|ESQ56233.1| hypothetical protein
            EUTSA_v10027442mg [Eutrema salsugineum]
          Length = 495

 Score =  345 bits (886), Expect = 3e-92
 Identities = 184/439 (41%), Positives = 272/439 (61%), Gaps = 2/439 (0%)
 Frame = -1

Query: 1456 TLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGRAEEAL 1277
            +LL D++H ++   + +      +GS  L+LL +DGDW++   WA+V FL  + R  E L
Sbjct: 55   SLLSDSYHHHHRFLNSLPRRLSRTGSCPLRLLREDGDWSKHQFWAVVRFLRHSSRLHEIL 114

Query: 1276 QVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEAT-KKSDIFPSLAIYNTIIH 1100
             VFD WKN+E +R N +N+ +I++  C E  M EA+ AF+    + ++ PSL IYN+IIH
Sbjct: 115  PVFDAWKNLEPSRINEANYEKILRFLCEEKSMNEAIRAFQCMIDEHELSPSLEIYNSIIH 174

Query: 1099 AFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFP 920
             +A    F  +      M E  +LP  ETY+GL+ AYG + LYDE+  C+K+ME  GC  
Sbjct: 175  GYANDGKFEEAMFYMNHMKENDMLPETETYDGLIEAYGKWKLYDEIVLCIKKMESDGCVR 234

Query: 919  DEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVY 740
            D VTYN+LI E+AR GLL++ME  Y+ L S+ + L+  TL+SMLEAYAE G LEK+E  Y
Sbjct: 235  DHVTYNLLIREFARGGLLKRMEQMYQSLMSRKMTLEPCTLLSMLEAYAEFGVLEKMEDTY 294

Query: 739  RRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGL 560
             +I++    + EDL+RK+A VYI N  F++L++LG  +R     + DL W + LL  A L
Sbjct: 295  NKIVRFGISLDEDLVRKVANVYIDNLMFSRLDDLGRGIR-----RTDLAWCLRLLCHACL 349

Query: 559  VSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITY 380
            VS+KG++ ++ EMEE +VP N    NI+   Y K+ DFRS+     + R  ++K DL+T 
Sbjct: 350  VSRKGLDYVVKEMEEARVPWNATFANIVLLAYSKMGDFRSVELLLSELRTKHVKLDLVTV 409

Query: 379  GIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK-KYITVH 203
            GI  D    G+DG  V   W++  +L+  V  +TD LV +AFGKG FL+ CE+ K   + 
Sbjct: 410  GIVLDLSVDGFDGTGVFMTWKKIGFLDKPVETKTDPLVHAAFGKGRFLRSCEEVKNQVLG 469

Query: 202  SRQKQKKIWRYSDVIRLVL 146
            +R ++ K W Y  ++ LV+
Sbjct: 470  TRVEESKSWTYQYLMELVV 488


>ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. lyrata]
            gi|297314141|gb|EFH44564.1| binding protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 502

 Score =  342 bits (878), Expect = 2e-91
 Identities = 184/450 (40%), Positives = 275/450 (61%), Gaps = 2/450 (0%)
 Frame = -1

Query: 1489 PLPQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSF 1310
            PL  + +    T L+   HR     S +       GS  L+LL++ GDW++D+ WA++ F
Sbjct: 49   PLSINGDASQSTSLIHHHHR---FLSSLPRRLELPGSCPLRLLQEYGDWSKDHFWAVIRF 105

Query: 1309 LVETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEAT-KKSDIF 1133
            L  + R  E L VFD WKN+E +R + +N+ R+I+L C E  M EA+ AF       ++ 
Sbjct: 106  LRHSSRLHEILPVFDAWKNLERSRISEANYERVIRLLCEEKSMNEAIRAFRGMIDDHELS 165

Query: 1132 PSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKC 953
            PSL IYN+IIH +A +  F  +      M E GLLP  ETY+GL+ AYG + +YDE+  C
Sbjct: 166  PSLEIYNSIIHGYADEGKFEEAMFYLNHMKENGLLPITETYDGLIEAYGKWKMYDEIVLC 225

Query: 952  VKRMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAE 773
            +KRME  GC  D VTYN+LI E++R GLL++ME  Y+ L S+ + L+ STL+SMLEAYAE
Sbjct: 226  LKRMESEGCVRDHVTYNLLIREFSRGGLLKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAE 285

Query: 772  MGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLV 593
             G +EK+E    +I++    + E L+RKLA VYI N  F++L++LG  + ++   + DL 
Sbjct: 286  FGLIEKMEETCNKIIRFGISLDEGLVRKLANVYIDNLMFSRLDDLGRGISSSRTRRTDLA 345

Query: 592  WYILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQAR 413
            W + LL  A LVS+KG++ ++ EM+E +VP N    NI    Y K+ DF+S+     + R
Sbjct: 346  WCLRLLCHARLVSRKGLDYVIKEMKEARVPWNTTFANITLLAYSKMGDFKSIELLLSELR 405

Query: 412  VCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLK 233
              ++K DL+T GI FD  + G+D   V   W++  +L+  V ++TD LV +AFGKG FLK
Sbjct: 406  TKHVKLDLVTVGIIFDLSEAGFDVTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGKFLK 465

Query: 232  LCEK-KYITVHSRQKQKKIWRYSDVIRLVL 146
             CE+ K  ++  R ++ K W Y  ++ +V+
Sbjct: 466  SCEEVKNQSLGMRGEESKAWTYQYLMEVVV 495


>ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Citrus sinensis]
          Length = 477

 Score =  338 bits (866), Expect = 5e-90
 Identities = 174/440 (39%), Positives = 274/440 (62%)
 Frame = -1

Query: 1465 RHRTLLVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGRAE 1286
            +H TLLV+++H +  L +LI   ++   S  LQ+L+ DGDWT+D+ WA++ FL  + R+ 
Sbjct: 62   KHTTLLVESYHEHQALNALIQRLNKKV-SCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120

Query: 1285 EALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTI 1106
            +  QVFD+WKNIE +R N  N+ +II + C EG M EA+ AF+  +   + PSL IYN+I
Sbjct: 121  QIPQVFDMWKNIEKSRINEFNYQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180

Query: 1105 IHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGC 926
            IH +++   F+ +      M E  L P  +TY+GL++AY                     
Sbjct: 181  IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAY--------------------- 219

Query: 925  FPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIER 746
                        E+A +GLL++ME TY+ + +K ++L++ST+V++L+AY   G L+K+E+
Sbjct: 220  ------------EFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 267

Query: 745  VYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSA 566
             Y+R+L  +  +KEDL+RKLA VYI+NY F++L++LG+D+  +  G+ +LVW + LLS A
Sbjct: 268  FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDL-ASRIGRTELVWCLRLLSHA 326

Query: 565  GLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLI 386
             L+S +G++S++ EME  KV  N+   NI+   YLK++DF+ L     +    ++KPD++
Sbjct: 327  CLLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIV 386

Query: 385  TYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITV 206
            T GI +DA  IG+DG   LE WRR  +L   V I TD LVL+ +GKG FL+ CE+ Y ++
Sbjct: 387  TIGILYDARRIGFDGTGALEMWRRIGFLSKTVEINTDPLVLAVYGKGHFLRYCEEVYSSL 446

Query: 205  HSRQKQKKIWRYSDVIRLVL 146
                ++KK W Y ++I LV+
Sbjct: 447  EPYSREKKRWTYQNLIDLVI 466


>ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Capsella rubella]
            gi|482552277|gb|EOA16470.1| hypothetical protein
            CARUB_v10004636mg [Capsella rubella]
          Length = 501

 Score =  329 bits (844), Expect = 2e-87
 Identities = 173/414 (41%), Positives = 254/414 (61%), Gaps = 1/414 (0%)
 Frame = -1

Query: 1384 GSSALQLLEKDGDWTEDNLWAMVSFLVETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIK 1205
            GS  LQLL++DGDW++D+ WA++ FL  + R  E L V+D WKN+E +R +  N+ R+I+
Sbjct: 87   GSCPLQLLQEDGDWSKDHFWAVIRFLRHSSRLHEILPVYDAWKNLEPSRISVVNYERVIR 146

Query: 1204 LFCGEGFMIEAMSAFEATKKSD-IFPSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLL 1028
              C E  M EA+ AF +    D + PSL IYN+IIH +A    F  +      M E GL 
Sbjct: 147  FLCEERSMNEAIRAFRSMIDDDELSPSLEIYNSIIHGYADDGKFEEAMFYLNQMKENGLS 206

Query: 1027 PTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEKMEST 848
            P  ETY+GL+ AYG + +YDE+  CV+RME  GC  D VTYN+LI +++R GLL++ME  
Sbjct: 207  PISETYDGLIEAYGKWKMYDEIVLCVRRMESDGCVRDHVTYNLLIRQFSRGGLLKRMEQM 266

Query: 847  YRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIR 668
            Y+ L S+ + L+  TL+SMLEAYAE G +EK+E    +I++    + + L+RKLA VYI 
Sbjct: 267  YQSLMSRKMTLEPCTLLSMLEAYAEFGVIEKMEETCNKIIRFGISLDDGLVRKLAKVYID 326

Query: 667  NYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPININI 488
            N  F++L++LG  +  +   + DL W + LL  + LVS+KG++ +L EM E KV  N   
Sbjct: 327  NLMFSRLDDLGRGISYSRTRRSDLAWCLRLLCHSRLVSRKGLDYVLKEMTEAKVTWNTTF 386

Query: 487  VNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSR 308
             NI+   Y K+ DF+S+       R   +K DL+T GI FD  + G+DG  V   W++  
Sbjct: 387  ANIVLLAYSKMGDFKSIELLLDGLRTKRVKLDLVTVGIVFDLSEAGFDGTGVFMTWKKIG 446

Query: 307  YLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITVHSRQKQKKIWRYSDVIRLVL 146
            +L+  V ++TD LV +AFGKG FL+ CE+       R +    W Y +++ LV+
Sbjct: 447  FLDKPVEMKTDPLVHAAFGKGQFLRRCEE------MRGEDPTPWTYQNLMELVV 494


>emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thaliana]
            gi|7268124|emb|CAB78461.1| salt-inducible protein homolog
            [Arabidopsis thaliana]
          Length = 561

 Score =  328 bits (842), Expect = 3e-87
 Identities = 174/419 (41%), Positives = 259/419 (61%), Gaps = 15/419 (3%)
 Frame = -1

Query: 1357 KDGDWTEDNLWAMVSFLVETGRAEEAL-------------QVFDLWKNIEMTRNNPSNHL 1217
            +DGDW++D+ WA++ FL ++ R  E L             QVFD WKN+E +R + +N+ 
Sbjct: 136  EDGDWSKDHFWAVIRFLRQSSRLHEILPNMKMTFCFFFQLQVFDTWKNLEPSRISENNYE 195

Query: 1216 RIIKLFCGEGFMIEAMSAFEAT-KKSDIFPSLAIYNTIIHAFARKRDFHNSNATFAMMLE 1040
            RII+  C E  M EA+ AF +     ++ PSL IYN+IIH++A    F  +      M E
Sbjct: 196  RIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHMKE 255

Query: 1039 AGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEK 860
             GLLP  ETY+GL+ AYG + +YDE+  C+KRME  GC  D VTYN+LI E++R GLL++
Sbjct: 256  NGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLLKR 315

Query: 859  MESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLAL 680
            ME  Y+ L S+ + L+ STL+SMLEAYAE G +EK+E    +I++    + E L+RKLA 
Sbjct: 316  MEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLAN 375

Query: 679  VYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPI 500
            VYI N  F++L++LG  +  +   + +L W + LL  A LVS+KG++ ++ EMEE +VP 
Sbjct: 376  VYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEARVPW 435

Query: 499  NINIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEW 320
            N    NI    Y K+ DF S+     + R+ ++K DL+T GI FD  +  +DG  V   W
Sbjct: 436  NTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVFMTW 495

Query: 319  RRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK-KYITVHSRQKQKKIWRYSDVIRLVL 146
            ++  +L+  V ++TD LV +AFGKG FL+ CE+ K  ++ +R  + K W Y  ++ LV+
Sbjct: 496  KKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMELVV 554


>emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera]
          Length = 1697

 Score =  321 bits (823), Expect = 5e-85
 Identities = 164/363 (45%), Positives = 235/363 (64%)
 Frame = -1

Query: 1450 LVDTFHRNNGLRSLIHEASRNSGSSALQLLEKDGDWTEDNLWAMVSFLVETGRAEEALQV 1271
            LV+T H N  L  LI + S N  SS LQLL  DGDW + + WA++ FL +  R+ E L V
Sbjct: 1332 LVETLHENERLGVLIQKLS-NKASSPLQLLRDDGDWNKQHFWAVIRFLKDASRSSEILPV 1390

Query: 1270 FDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTIIHAFA 1091
            F LWK+++ +R N  N+ +II L   E    E++ A E  K   + PSL IYN +IH FA
Sbjct: 1391 FHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEXMKTHGLKPSLEIYNLVIHCFA 1450

Query: 1090 RKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEV 911
            RK +F  +      +    L+   ETY+GL+++YG + +YDE+ +CVK+ME  GC PD +
Sbjct: 1451 RKGEFDRALYFLNELKXNNLIADTETYDGLIQSYGKYKMYDELDECVKKMESDGCLPDHI 1510

Query: 910  TYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRI 731
            TYN+LI E++R GLL++ME  ++ + SK + LQ+STLV MLEAYA  G +EK+E  YRR+
Sbjct: 1511 TYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVMLEAYANFGIIEKMENAYRRV 1570

Query: 730  LKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSK 551
            L  K  +K+DLIRKLA VYI NY+F++L ++G D+ +    + DLVW + LLS A L+S+
Sbjct: 1571 LNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLDLASVT-SRTDLVWCLRLLSHACLLSR 1629

Query: 550  KGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIF 371
            KG++SI+ EME + VP N  + N +   YLK++DF  L     +    ++KPD++T GI 
Sbjct: 1630 KGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILLLELSTRHVKPDIVTVGIL 1689

Query: 370  FDA 362
            FDA
Sbjct: 1690 FDA 1692


>ref|XP_002966251.1| hypothetical protein SELMODRAFT_85839 [Selaginella moellendorffii]
            gi|300165671|gb|EFJ32278.1| hypothetical protein
            SELMODRAFT_85839 [Selaginella moellendorffii]
          Length = 358

 Score =  151 bits (381), Expect = 9e-34
 Identities = 88/317 (27%), Positives = 158/317 (49%)
 Frame = -1

Query: 1174 AMSAFEATKKSDIFPSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLR 995
            A   F+  +   + PS+  ++ ++ ++A   +   + +    ML+ G+ P   TY GL+R
Sbjct: 11   AQGVFDGMEAMQVRPSVVGFSALVQSYAESGEVEGAQSAMKRMLDTGIQPNVVTYGGLIR 70

Query: 994  AYGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNL 815
            AYG  GL+DEM+K V  M+   C PD   Y  +I  YA  GL+ +M+  ++ + +     
Sbjct: 71   AYGKRGLFDEMAKVVNTMKTVRCEPDFFVYKNVIEAYASGGLVGRMDKAFKAMRADGWIP 130

Query: 814  QASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELG 635
             +  L  + + YA MG ++++E     + ++K + +E+ +R  AL YIR+ +F Q+E   
Sbjct: 131  DSDILNLLAQGYASMGMIKEMEGAQGELRRIKGWPREESVRACALAYIRHNQFYQMEGFV 190

Query: 634  NDVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKI 455
              +     G  +L+W +LLL+ A   S K ++     M   +   ++   NI A    ++
Sbjct: 191  KSLGMKRIGG-NLLWNLLLLAHAANFSMKSLQREAVNMWSARCAPDVTTFNIRALALSRM 249

Query: 454  RDFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTD 275
            +    L+   +  R  +++PDL+TYG   DA  I      + E+       + +  +RTD
Sbjct: 250  QMLWDLHVLVQHMRAESVRPDLVTYGALVDAYAIARLLPRLPEQLDELDMADTIPDVRTD 309

Query: 274  QLVLSAFGKGSFLKLCE 224
             LV  AFG+G F   C+
Sbjct: 310  PLVFQAFGRGRFHAFCD 326


Top