BLASTX nr result
ID: Zingiber25_contig00005896
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber25_contig00005896 (1736 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ24063.1| hypothetical protein PRUPE_ppa004279mg [Prunus pe... 411 e-112 gb|ESW35794.1| hypothetical protein PHAVU_001G265200g [Phaseolus... 409 e-111 ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containi... 407 e-110 ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citr... 395 e-107 gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis] 394 e-107 ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containi... 391 e-106 ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containi... 391 e-106 ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [A... 389 e-105 gb|EOY05094.1| Pentatricopeptide repeat-containing protein, puta... 387 e-104 ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containi... 384 e-104 ref|XP_002516403.1| pentatricopeptide repeat-containing protein,... 384 e-104 ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Popu... 381 e-103 ref|NP_193155.4| pentatricopeptide repeat-containing protein [Ar... 352 4e-94 ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutr... 349 2e-93 ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. l... 346 2e-92 ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containi... 342 4e-91 ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Caps... 333 1e-88 emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thal... 328 3e-87 emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera] 319 2e-84 ref|XP_002966251.1| hypothetical protein SELMODRAFT_85839 [Selag... 151 1e-33 >gb|EMJ24063.1| hypothetical protein PRUPE_ppa004279mg [Prunus persica] Length = 518 Score = 411 bits (1057), Expect = e-112 Identities = 206/445 (46%), Positives = 296/445 (66%) Frame = +2 Query: 233 PQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLV 412 P +H TLLV+TFH + L++L+ +GSC LQLL +DGDWT+D WA + FL Sbjct: 65 PDSSSTKHTTLLVETFHEHQRLKALLQNLI--NGSCPLQLLGEDGDWTKDQFWAAIRFLK 122 Query: 413 ETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSL 592 T R E LQ+FD+WKNIE +R N N+ +II L EG + EA+ F+ K ++ PSL Sbjct: 123 HTFRFNEILQLFDMWKNIEKSRINEFNYSKIIGLLGEEGLIEEAVRCFQEMKSHNLRPSL 182 Query: 593 AIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKR 772 +YN++IH AR+ +F ++ M E L P +TY+GL+ AYG + +YD++ CVK+ Sbjct: 183 EVYNSVIHVCARQGNFEDALFFLNEMKEMNLAPETDTYDGLIEAYGKYRMYDQIGMCVKK 242 Query: 773 MELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGN 952 M+L+GC PD +TYN+LI E+AR GLL++MES Y+ + S+ + LQ+STL++M+E YA+ G Sbjct: 243 MKLNGCSPDHITYNLLIREFARGGLLKRMESVYQSMLSRRMALQSSTLIAMVEVYAKFGI 302 Query: 953 LEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYI 1132 LEK+E VYRR+L +K DLIRKLA VYI NY F++LE+LG D+ ++ +G+ DLVW + Sbjct: 303 LEKMENVYRRVLNSGTVVKNDLIRKLAEVYIDNYMFSRLEKLGVDL-SSRFGQTDLVWCL 361 Query: 1133 LLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCN 1312 LLS AG++S++G++SI+ EM+E+ VP N + NI+ YLK++DF L Q Sbjct: 362 RLLSQAGVLSQRGMDSIVDEMKEQNVPWNETVANIIMLAYLKMKDFTHLRIFLSQLLTQG 421 Query: 1313 IKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCE 1492 ++PD+IT GI FDA IGYDG L+ WR + +L V + TD LVL+ FGKG FL+ CE Sbjct: 422 VEPDIITVGIVFDANRIGYDGSRTLDTWRENGFLRKAVEMNTDPLVLTTFGKGHFLRNCE 481 Query: 1493 KKYITVHSRQKQKKIWRYSDVIRLV 1567 Y ++ ++ K W Y +I LV Sbjct: 482 AAYSSLEPEDRENKTWTYHHLIDLV 506 >gb|ESW35794.1| hypothetical protein PHAVU_001G265200g [Phaseolus vulgaris] Length = 496 Score = 409 bits (1052), Expect = e-111 Identities = 205/441 (46%), Positives = 300/441 (68%) Frame = +2 Query: 245 EQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGR 424 + +HRTLLV+T+H ++ LR+L+ + R + + +L +DGDW++D+ WA V FL R Sbjct: 57 DSKHRTLLVETYHHHDSLRALLAKLEREDSN-PMYILAQDGDWSKDHFWAAVRFLKNASR 115 Query: 425 AEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYN 604 E LQVFD+WK IE +R + N+ +II L C + M EA+SAF+ K + PSL YN Sbjct: 116 FVEILQVFDMWKEIEKSRISEFNYNKIIGLLCEDEMMEEALSAFQEMKVQGMKPSLDTYN 175 Query: 605 TIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELS 784 IIH ++ F ++ M E+GL P ETY+GL+ AYG F LYDEM +CVK+MEL Sbjct: 176 PIIHGLSKAGKFSDALRFLDEMKESGLDPDSETYDGLIGAYGKFQLYDEMGECVKKMELE 235 Query: 785 GCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKI 964 GC PD +TYNILI EYAR+G+L++ME Y+ + SK + LQ+ST V+ML+AY G +EK+ Sbjct: 236 GCSPDHITYNILIQEYARAGILQRMEKLYQRMLSKRMRLQSSTFVAMLKAYTTFGIVEKM 295 Query: 965 ERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLS 1144 E +R++L K+ +++D IRK+A VYI+NY F++LE+L D+ +A +G+ DLVW + LLS Sbjct: 296 EFFFRKVLNSKSCLEDDFIRKMAEVYIKNYMFSRLEDLALDLCSA-FGESDLVWCLRLLS 354 Query: 1145 SAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPD 1324 A L+SKKG++ ++ EM++ K+ N+ NI+ Y+K++DFR L Q R+ + PD Sbjct: 355 YACLLSKKGMDIVVKEMQDAKINWNVAFANIIMLAYVKMKDFRHLRILLSQLRINRLGPD 414 Query: 1325 LITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYI 1504 ++T GI DA IG+DG LE WRR YL+ VV ++TD LVL+AFGKG FLK CE+ Y Sbjct: 415 IVTIGIVLDASRIGFDGRGALESWRRMGYLDRVVELKTDSLVLTAFGKGHFLKSCEEVYT 474 Query: 1505 TVHSRQKQKKIWRYSDVIRLV 1567 ++H +++K W Y+D+I L+ Sbjct: 475 SLHPEDRERKKWTYNDLIALL 495 >ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Glycine max] Length = 509 Score = 407 bits (1045), Expect = e-110 Identities = 210/441 (47%), Positives = 298/441 (67%) Frame = +2 Query: 245 EQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGR 424 + +H TLLV+T+H ++ LR+L+ + + + L +L +DGDW++D+ WA+V FL R Sbjct: 58 DTKHTTLLVETYHLHDSLRALLAKLQKEDCN-PLHVLAEDGDWSKDHFWAVVRFLKSASR 116 Query: 425 AEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYN 604 + LQVFD+WKNIE +R + N+ +II L C G M +A+SA K I PSL YN Sbjct: 117 FTQILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMEDALSALRDMKVQGIKPSLDTYN 176 Query: 605 TIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELS 784 IIH +R+ F ++ M E+GL ETY+GLL AYG F +YDEM +CVK+MEL Sbjct: 177 PIIHGLSREGKFSDALRFIDEMKESGLELDSETYDGLLGAYGKFQMYDEMGECVKKMELE 236 Query: 785 GCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKI 964 GC PD +TYNILI EYAR+GLL++ME Y+ + SK +++Q+STLV+MLEAY G +EK+ Sbjct: 237 GCSPDHITYNILIQEYARAGLLQRMEKLYQRMVSKRMHVQSSTLVAMLEAYTTFGMVEKM 296 Query: 965 ERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLS 1144 E YR+IL K +++DLIRK+A VYI+NY F++LE+L D+ A +G+ +LVW + LLS Sbjct: 297 ENFYRKILSSKTCLEDDLIRKVAEVYIKNYMFSRLEDLALDLCPA-FGESNLVWCLRLLS 355 Query: 1145 SAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPD 1324 A +SKKG++ ++ EM + KV N+ + NI+ Y+K++DFR L Q + ++PD Sbjct: 356 YACPLSKKGMDIVVREMRDAKVNWNVTVANIIMLAYVKMKDFRHLKILLSQLPIYRVQPD 415 Query: 1325 LITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYI 1504 +IT GI FDA IG+DG LE WRR YL VV I+TD LVL+AFGKG FLK CE+ Y Sbjct: 416 IITIGILFDATRIGFDGSGALETWRRMGYLYRVVEIKTDSLVLTAFGKGHFLKSCEEVYS 475 Query: 1505 TVHSRQKQKKIWRYSDVIRLV 1567 ++H +++K W Y D+I L+ Sbjct: 476 SLHPEDRKRKTWTYHDLIALL 496 >ref|XP_006421046.1| hypothetical protein CICLE_v10004784mg [Citrus clementina] gi|557522919|gb|ESR34286.1| hypothetical protein CICLE_v10004784mg [Citrus clementina] Length = 510 Score = 395 bits (1015), Expect = e-107 Identities = 191/440 (43%), Positives = 298/440 (67%) Frame = +2 Query: 251 RHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAE 430 +H TLLV+++H + L +LI ++ SC LQ+L+ DGDWT+D+ WA++ FL + R+ Sbjct: 62 KHTTLLVESYHEHQALNALIQRLNKKV-SCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120 Query: 431 EALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTI 610 + QVFD+WKNIE +R N N +II + C EG M EA+ AF+ + + PSL IYN+I Sbjct: 121 QIPQVFDMWKNIEKSRINEFNSQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180 Query: 611 IHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGC 790 IH +++ F+ + M E L P +TY+GL++AYG + +YDE+ C+K M+L GC Sbjct: 181 IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAYGKYKMYDEIDMCLKMMKLDGC 240 Query: 791 FPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIER 970 PD +TYN+LI E+A +GLL++ME TY+ + +K ++L++ST+V++L+AY G L+K+E+ Sbjct: 241 SPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 300 Query: 971 VYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSA 1150 Y+R+L + +KEDL+RKLA VYI+NY F++L++LG+D+ + G+ +LVW + LLS A Sbjct: 301 FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDL-ASRIGRTELVWCLRLLSHA 359 Query: 1151 GLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLI 1330 L+S +G++S++ EME KV N+ NI+ YLK++DF+ L + ++KPD++ Sbjct: 360 CLLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIV 419 Query: 1331 TYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITV 1510 T GI +DA IG+DG LE W+R +L V I TD LVL+ +GKG FL+ CE+ Y ++ Sbjct: 420 TIGILYDARRIGFDGTGALEMWKRIGFLFKTVEINTDPLVLAVYGKGHFLRYCEEVYSSL 479 Query: 1511 HSRQKQKKIWRYSDVIRLVL 1570 ++KK W Y ++I LV+ Sbjct: 480 EPYSREKKRWTYQNLIDLVI 499 >gb|EXC14264.1| hypothetical protein L484_021763 [Morus notabilis] Length = 664 Score = 394 bits (1013), Expect = e-107 Identities = 194/444 (43%), Positives = 296/444 (66%) Frame = +2 Query: 236 QDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVE 415 Q+ H TLLV+TFH + ++L+ S+N SC ++LL +DGDW +++ WA+V FL Sbjct: 57 QNSSTEHTTLLVETFHEHRKFKTLLKRLSKND-SCPMRLLREDGDWCKEHFWAVVRFLRH 115 Query: 416 TGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLA 595 R +E +QVFDLWKNIE +R N N+ +IIK+ EG M EA+ +FE K + P+L Sbjct: 116 GSRTKEIVQVFDLWKNIEKSRINELNYCKIIKMLGEEGLMEEAVLSFEEMKSCGLSPTLE 175 Query: 596 IYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRM 775 +YN++IH F++K DF ++ M E ++P +TY GL+ AY + +YDE+ C+K+M Sbjct: 176 VYNSMIHGFSQKGDFDDALVYLNEMREQNVVPETDTYEGLIEAYAKYEMYDEIGLCLKKM 235 Query: 776 ELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNL 955 +L+GC PD +TYN+L+ ++++ GLL++MES Y + SK + LQ+STLV+MLE YA G L Sbjct: 236 KLNGCPPDHITYNLLMRKFSKGGLLKRMESVYHTMISKRMYLQSSTLVAMLETYARFGIL 295 Query: 956 EKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYIL 1135 +K+E+ Y R LK K + EDLIRKLA VYI NY F++LE LG D+ T +G+ DL+W + Sbjct: 296 DKMEKFYMRTLKTKTPLGEDLIRKLAEVYIDNYLFSRLETLGVDLST-TFGETDLLWCLR 354 Query: 1136 LLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNI 1315 LLS A L S+KG++ ++ EME +P N+ NI+ +LK++DF L + Q ++ Sbjct: 355 LLSHAFLFSRKGMDFVIQEMERAHIPWNVTFANIILLTHLKMKDFTHLRISLSQL-THSV 413 Query: 1316 KPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK 1495 +PD++T GI FDA +G+DG LE W+R + V + TD +V++AFGKG+FL+ CE+ Sbjct: 414 EPDIVTVGILFDAIGMGFDGTRTLETWKRMDFFYKAVEMNTDPVVITAFGKGNFLQNCER 473 Query: 1496 KYITVHSRQKQKKIWRYSDVIRLV 1567 Y ++ S ++ K W Y++++ LV Sbjct: 474 AYSSLESEVRETKSWTYNNLVDLV 497 >ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Vitis vinifera] Length = 581 Score = 391 bits (1005), Expect = e-106 Identities = 197/438 (44%), Positives = 286/438 (65%) Frame = +2 Query: 251 RHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAE 430 +H TLLV+T H N L LI + S N S LQLL DGDW + + WA++ FL + R+ Sbjct: 88 KHTTLLVETLHENERLGVLIQKLS-NKASSPLQLLRDDGDWNKQHFWAVIRFLKDASRSS 146 Query: 431 EALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTI 610 E L VF LWK+++ +R N N+ +II L E E++ A E K + PSL IYN + Sbjct: 147 EILPVFHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEGMKTHGLKPSLEIYNLV 206 Query: 611 IHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGC 790 IH FARK +F + + L+ ETY+GL+++YG + +YDE+ +CVK+ME GC Sbjct: 207 IHCFARKGEFDRALYFLNELKANNLIADTETYDGLIQSYGKYKMYDELDECVKKMESDGC 266 Query: 791 FPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIER 970 PD +TYN+LI E++R GLL++ME ++ + SK + LQ+STLV MLEAYA G +EK+E Sbjct: 267 LPDHITYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVMLEAYANFGIIEKMEN 326 Query: 971 VYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSA 1150 YRR+L K +K+DLIRKLA VYI NY+F++L ++G ++ + + DLVW + LLS A Sbjct: 327 AYRRVLNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLNLASVT-SRTDLVWCLRLLSHA 385 Query: 1151 GLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLI 1330 L+S+KG++SI+ EME + VP N + N + YLK++DF L + ++KPD++ Sbjct: 386 CLLSRKGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILLLELSTRHVKPDIV 445 Query: 1331 TYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITV 1510 T GI FDA IG++G L WRR+ +L++ V + TD LVLSAFGKG+FL+ CE+ Y ++ Sbjct: 446 TVGILFDANRIGFNGTMALNTWRRTGFLDEAVEMNTDPLVLSAFGKGNFLQSCEEMYSSL 505 Query: 1511 HSRQKQKKIWRYSDVIRL 1564 ++KKIW Y ++I L Sbjct: 506 EPEARKKKIWTYQNLIDL 523 >ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like isoform X1 [Glycine max] Length = 506 Score = 391 bits (1004), Expect = e-106 Identities = 204/441 (46%), Positives = 294/441 (66%) Frame = +2 Query: 245 EQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGR 424 + +H TLLV+T+H ++ LR+L+ + N S L +L +D DW++D+ WA+V FL + Sbjct: 56 DTKHTTLLVETYHLHHSLRALLAKLE-NEYSNPLHMLAEDADWSKDHFWAVVRFLKSSSN 114 Query: 425 AEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYN 604 LQVFD+WKNIE +R + N+ +II L C G M +A+SA + K I PSL YN Sbjct: 115 FTHILQVFDMWKNIEKSRISEFNYNKIIGLLCEGGKMKDALSALQDMKVQGIKPSLDTYN 174 Query: 605 TIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELS 784 IIH +R+ F ++ M E+GL ETY+GL+ AYG F +YDEM +CVK+MEL Sbjct: 175 PIIHGLSREGKFSDALRFIDEMKESGLELDSETYDGLIGAYGKFQMYDEMGECVKKMELE 234 Query: 785 GCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKI 964 GC PD +TYNILI EYA GLL++ME Y+ + SK +++++STLV+MLEAY G +EK+ Sbjct: 235 GCSPDPITYNILIQEYAGGGLLQRMEKLYQRMLSKRMHVKSSTLVAMLEAYTTFGMVEKM 294 Query: 965 ERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLS 1144 E+ YR+IL K I++DLIRK+A VYI N+ F++LE+L D+ A +G+ +L W LLS Sbjct: 295 EKFYRKILNSKTCIEDDLIRKVAEVYINNFMFSRLEDLALDLCPA-FGESNLEWCFRLLS 353 Query: 1145 SAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPD 1324 A L+SKKG++ ++ EM++ KV N+ + NI+ Y+K+++FR L Q + ++PD Sbjct: 354 YACLLSKKGMDIVVQEMQDAKVSWNVTVANIIMLAYVKMKEFRHLRILLSQLPIYRVQPD 413 Query: 1325 LITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYI 1504 +IT GI FDA IG+DG LE WRR YL VV ++TD LVL+AFGKG FLK CE+ Y Sbjct: 414 IITIGILFDATRIGFDGSGALETWRRMGYLYRVVEMKTDSLVLTAFGKGHFLKSCEEVYS 473 Query: 1505 TVHSRQKQKKIWRYSDVIRLV 1567 ++H +++K Y D+I L+ Sbjct: 474 SLHPEDRKRKTCTYHDLIPLL 494 >ref|XP_006855721.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda] gi|548859508|gb|ERN17188.1| hypothetical protein AMTR_s00044p00151840 [Amborella trichopoda] Length = 506 Score = 389 bits (999), Expect = e-105 Identities = 197/445 (44%), Positives = 291/445 (65%) Frame = +2 Query: 233 PQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLV 412 PQD +HR LLV F + L LI + G L+LL +GDW +D WA++ L Sbjct: 60 PQD--SKHRALLVQNFFQTQQLLDLIEKIK--GGIDPLKLLRDEGDWNKDQFWAVMKLLK 115 Query: 413 ETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSL 592 ET R +EA+QVFD W N+E +R + SN+ ++I+L G M EA + + K + P++ Sbjct: 116 ETSRIKEAMQVFDYWVNVERSRLDDSNYTKMIELLVDAGLMDEATTMLKEVKDFGVRPTV 175 Query: 593 AIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKR 772 A+YN I+H +A +F +N M + GL+P ETY+GL+RAYG +YD+M+KC K+ Sbjct: 176 AVYNFIVHGYANTGNFDKANLFLREMRDLGLVPESETYDGLIRAYGNHRMYDDMAKCAKK 235 Query: 773 MELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGN 952 ME G PD +TYNILI E+AR GL+ +ME YR L SK + LQ STLV+MLEAYA +G Sbjct: 236 MESEGFTPDHLTYNILIREFARGGLMVRMEGAYRTLLSKKMGLQYSTLVAMLEAYAALGC 295 Query: 953 LEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYI 1132 + ++E V+RR+LK K +KEDL+RK+A YI+N+RF++LE+LG V + G+ DL W + Sbjct: 296 VNEMETVFRRLLKSKIPLKEDLVRKVARAYIKNHRFSRLEDLGLGV-ASKTGRTDLFWCL 354 Query: 1133 LLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCN 1312 LLLS A L S+KG++S++ EM+ V N+ NI A YLK++D + L+ Q ++ N Sbjct: 355 LLLSHACLCSRKGIKSVIQEMKSAMVRPNVTFANITALTYLKMKDVQYLDVLLSQLQLLN 414 Query: 1313 IKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCE 1492 + PD++T G+ DA G+D I L WR++ +L V + TD LVL+AFGKG FL+ CE Sbjct: 415 VNPDIVTVGVVMDAYVSGFDDIKALRMWRKTGFLRRPVEMNTDPLVLTAFGKGYFLRSCE 474 Query: 1493 KKYITVHSRQKQKKIWRYSDVIRLV 1567 + Y+++ ++ +++K+W Y+D+I LV Sbjct: 475 ELYLSLGAKGRERKVWTYNDLIDLV 499 >gb|EOY05094.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 504 Score = 387 bits (993), Expect = e-104 Identities = 201/447 (44%), Positives = 293/447 (65%) Frame = +2 Query: 248 QRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRA 427 + H LLV+T+H + L++L+ ++ SC LQ+L DGDWT+D W ++ FL R+ Sbjct: 60 KNHTALLVETYHHHRRLKALLERLEKDD-SCPLQMLRDDGDWTKDIFWVVIRFLRRASRS 118 Query: 428 EEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNT 607 E LQVF +WKNIE +R N N+ +II L EG + +A+ A + PSL +YN+ Sbjct: 119 NEILQVFHMWKNIEKSRINELNYEKIIGLLGEEGRVGQAVQALREMGGYGLKPSLEVYNS 178 Query: 608 IIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSG 787 IIHA+AR F ++ + M E GL P +TY+GL+ AYG + +YDE+ C+K MEL Sbjct: 179 IIHAYARNGKFDDALSFLNEMKEIGLAPETDTYDGLIEAYGKYKMYDEIGTCLKMMELDR 238 Query: 788 CFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIE 967 C PD TYN+LI E++R GLL++ME Y++L SK +NLQ+S+LV+MLEAYA G L+K+E Sbjct: 239 CRPDHFTYNLLIREFSRGGLLQRMEQVYQILLSKQMNLQSSSLVAMLEAYANFGILDKME 298 Query: 968 RVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSS 1147 +VYR+++ +KED IR LA VYI+NY F++L++LG D+ ++ G+ DLVW + LLS Sbjct: 299 KVYRKVVN-SMTLKEDTIRILASVYIKNYMFSRLDDLGIDL-SSRTGRNDLVWCLRLLSH 356 Query: 1148 AGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDL 1327 A L+S+KG++S++ EM E K N+ I NI+ Y+K++DF+ L Q ++PD+ Sbjct: 357 ACLLSRKGMDSVILEMCEAKASWNVTISNIILLAYMKMKDFKRLRILLSQLPSHQVRPDI 416 Query: 1328 ITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYIT 1507 IT GI DA +IG+DG LE WR+ L V + TD LVL AFGKG FL+ CE+ Y + Sbjct: 417 ITIGILSDAIEIGFDGAEALETWRKMGLLYRTVEMNTDPLVLIAFGKGHFLRDCEEIYTS 476 Query: 1508 VHSRQKQKKIWRYSDVIRLVLG*KAKR 1588 + + +++K W Y +I LV+ KAKR Sbjct: 477 LEPKARKEKRWTYHHLIDLVIKHKAKR 503 >ref|XP_004298231.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 509 Score = 384 bits (985), Expect = e-104 Identities = 194/435 (44%), Positives = 278/435 (63%) Frame = +2 Query: 254 HRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEE 433 H TL V+ H + LR+L+ + C LQLL DGDWT D WA++ FL+ R +E Sbjct: 68 HTTLHVEPSHEYHKLRALL-DILMEKDCCPLQLLRDDGDWTIDQFWAVIRFLIHASRPKE 126 Query: 434 ALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTII 613 LQ+FD+W+NIE +R N N+ +II L E + EA+ F+ K + S+ +YNTII Sbjct: 127 ILQLFDIWRNIEKSRINEFNYSKIIGLLVEEDLIEEAVVCFQDMKSQGLGLSVELYNTII 186 Query: 614 HAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCF 793 H +R +F ++ M E L P +TY+GL+ AYG + +YDEM C+K+M L+GC Sbjct: 187 HGLSRNGNFVDAVHFLNEMKEMNLAPDADTYDGLIEAYGKYKMYDEMGMCLKKMRLNGCS 246 Query: 794 PDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERV 973 PD +TYN+LI E+A GLL ++E Y+ + S+ ++LQ TL+++LE YA+ G LEK+E Sbjct: 247 PDYITYNLLIREFAHGGLLNRVERVYQSMVSRRMDLQVPTLIAILEVYAKFGILEKMEVF 306 Query: 974 YRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAG 1153 YRR+L +A +KEDLI+K+A VYI NY F++LE LG D+ + +G+ DLVW + LLS AG Sbjct: 307 YRRVLNSRAILKEDLIKKVAEVYIENYMFSKLENLGVDL-SPRFGQTDLVWCLRLLSHAG 365 Query: 1154 LVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLIT 1333 L+S++G+ SI+ EME + VP N + NI+ YLK++DF L F Q+ + PD+IT Sbjct: 366 LLSRRGMNSIILEMEGKSVPWNATVANIMMLAYLKMKDFTRLRSLFSQSLTRGVDPDIIT 425 Query: 1334 YGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITVH 1513 +GI FDA IGYDG L WR+ L V + TD LV++ FGKG FL+ CE Y ++ Sbjct: 426 FGILFDANRIGYDGSATLNTWRKHGILYKAVEMNTDPLVITTFGKGHFLRNCEAAYSSLE 485 Query: 1514 SRQKQKKIWRYSDVI 1558 ++KK W Y D+I Sbjct: 486 PEVREKKTWTYQDLI 500 >ref|XP_002516403.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544501|gb|EEF46020.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 502 Score = 384 bits (985), Expect = e-104 Identities = 192/446 (43%), Positives = 296/446 (66%) Frame = +2 Query: 230 LPQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFL 409 + QD +H TLLV+++H + L++L+ ++ GSC LQ+L+ D DW++D+ WA++ FL Sbjct: 49 ISQDNSIKHNTLLVESYHEHQRLKALLARLNKK-GSCPLQMLQDDADWSKDHFWAVIRFL 107 Query: 410 VETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPS 589 + R++E LQVFD+WK+IE +R N N+ ++I++ EG + +A SAF K + PS Sbjct: 108 RHSSRSDEILQVFDMWKDIEKSRINEFNYEKVIEILGEEGLIEDAYSAFIEMKTLCLSPS 167 Query: 590 LAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVK 769 L +YN++IH +AR F ++ + E L P +TYNGL++AYG + +YDEM C+K Sbjct: 168 LQVYNSLIHGYARNGKFDDAVFYLNHLKEINLSPVSDTYNGLIQAYGKYKMYDEMGMCLK 227 Query: 770 RMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMG 949 +ME+ GC PD VTYN+LI E A +GLL +ME Y+ ++L+++TL +MLEAYA G Sbjct: 228 KMEMEGCSPDHVTYNLLIQELAEAGLLTRMEKVYQTTRMNRMDLKSTTLTAMLEAYANFG 287 Query: 950 NLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWY 1129 +EK+E + +R KA +KEDLI+K+ALVYI N+ F++LE+LG+ + + G+ D+VW Sbjct: 288 IVEKMELILKRTRNSKALLKEDLIKKIALVYIENFMFSRLEKLGHYLSKRS-GQNDMVWC 346 Query: 1130 ILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVC 1309 +LLLS+A ++S+KG++S++ EM+ KV N+ +NI+ YLK++D L Sbjct: 347 LLLLSNACMLSQKGMDSVVREMKVAKVSWNVTFINIILLAYLKMKDSMRLGILLSTLTNH 406 Query: 1310 NIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLC 1489 +KPD++T G+ FDA +IG+ G +LE WRR+ L V TD LVL+AFGKG FLK C Sbjct: 407 IVKPDIVTVGVLFDANNIGFHGNGILETWRRTGILYRCVETETDPLVLAAFGKGQFLKKC 466 Query: 1490 EKKYITVHSRQKQKKIWRYSDVIRLV 1567 E+ Y ++ +QK+ W Y ++I LV Sbjct: 467 EEAYSSLEPVARQKEKWTYCNLIDLV 492 >ref|XP_002321108.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa] gi|550324215|gb|EEE99423.2| hypothetical protein POPTR_0014s14700g [Populus trichocarpa] Length = 508 Score = 381 bits (979), Expect = e-103 Identities = 199/461 (43%), Positives = 292/461 (63%), Gaps = 17/461 (3%) Frame = +2 Query: 236 QDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVE 415 QD +H TLLVD+FH + L+SL+H NS LQLL++DGDW++D+ W+++ FL Sbjct: 46 QDHSTKHTTLLVDSFHEHKRLKSLLHNL--NSNQNPLQLLQQDGDWSKDDFWSVIKFLKL 103 Query: 416 TGRAEEALQV-----------------FDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEA 544 + R+ + LQV F +W+++E TR N N+ +II L EG M +A Sbjct: 104 SARSNQILQVHSLAHLFFLAARKIEFVFHMWRDVEKTRINEFNYEKIIGLLGEEGLMEDA 163 Query: 545 MSAFEATKKSDIFPSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRA 724 ++AF K + SL +YN+IIH +AR F ++ M E L P +TY+GL+ A Sbjct: 164 VTAFMEMKSFGLCLSLEVYNSIIHGYARNGKFDDALFYLNQMNEMNLSPESDTYDGLIEA 223 Query: 725 YGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQ 904 YG + +YDEM+ C+K+MEL GC PD TYN+LI ++A+ GLL +ME Y+ + +K + LQ Sbjct: 224 YGTYRMYDEMAMCLKKMELDGCSPDRYTYNLLIQKFAQGGLLTRMERVYQSMRTKRMKLQ 283 Query: 905 ASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGN 1084 +STL+SMLEAYA G +EK+E++ R K +KEDL+RKLA VYI NY F++L +L Sbjct: 284 SSTLISMLEAYANFGIVEKMEKILRWAWNSKITVKEDLVRKLAGVYIANYMFSRLHDLAV 343 Query: 1085 DVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIR 1264 D+ T+ G+ D+VW + LLS A L+S++G+++++ EME+ K NI + NI+ YLK++ Sbjct: 344 DL-TSITGRTDIVWCLHLLSHACLLSRRGMDAVVREMEDAKACWNITVANIILLAYLKMK 402 Query: 1265 DFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQ 1444 DF L + ++PD++T+GI FDA +IG+DG LE WR+ L V + TD Sbjct: 403 DFTRLRILLSKLPEIRVEPDIVTFGILFDAEEIGFDGKECLEMWRKMGLLYRRVEMNTDP 462 Query: 1445 LVLSAFGKGSFLKLCEKKYITVHSRQKQKKIWRYSDVIRLV 1567 L LSAFGKGSFL+ CE+ Y ++ ++KK W Y D I LV Sbjct: 463 LALSAFGKGSFLRSCEEGYSSLEPNAREKKRWTYVDFINLV 503 >ref|NP_193155.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635638|sp|O23278.2|PP310_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g14190, chloroplastic; Flags: Precursor gi|332657991|gb|AEE83391.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 501 Score = 352 bits (902), Expect = 4e-94 Identities = 185/432 (42%), Positives = 273/432 (63%), Gaps = 2/432 (0%) Frame = +2 Query: 281 HRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEEALQVFDLWK 460 H + L SL S SGSC L+LL++DGDW++D+ WA++ FL ++ R E L VFD WK Sbjct: 64 HHHRFLSSLTRRLSL-SGSCPLRLLQEDGDWSKDHFWAVIRFLRQSSRLHEILPVFDTWK 122 Query: 461 NIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEAT-KKSDIFPSLAIYNTIIHAFARKRD 637 N+E +R + +N+ RII+ C E M EA+ AF + ++ PSL IYN+IIH++A Sbjct: 123 NLEPSRISENNYERIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGK 182 Query: 638 FHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEVTYNI 817 F + M E GLLP ETY+GL+ AYG + +YDE+ C+KRME GC D VTYN+ Sbjct: 183 FEEAMFYLNHMKENGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNL 242 Query: 818 LITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRILKLK 997 LI E++R GLL++ME Y+ L S+ + L+ STL+SMLEAYAE G +EK+E +I++ Sbjct: 243 LIREFSRGGLLKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFG 302 Query: 998 AYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSKKGVE 1177 + E L+RKLA VYI N F++L++LG + + + +L W + LL A LVS+KG++ Sbjct: 303 ISLDEGLVRKLANVYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLD 362 Query: 1178 SILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIFFDAC 1357 ++ EMEE +VP N NI Y K+ DF S+ + R+ ++K DL+T GI FD Sbjct: 363 YVVKEMEEARVPWNTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLS 422 Query: 1358 DIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK-KYITVHSRQKQKK 1534 + +DG V W++ +L+ V ++TD LV +AFGKG FL+ CE+ K ++ +R + K Sbjct: 423 EARFDGTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESK 482 Query: 1535 IWRYSDVIRLVL 1570 W Y ++ LV+ Sbjct: 483 SWTYQYLMELVV 494 >ref|XP_006414780.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum] gi|557115950|gb|ESQ56233.1| hypothetical protein EUTSA_v10027442mg [Eutrema salsugineum] Length = 495 Score = 349 bits (896), Expect = 2e-93 Identities = 185/439 (42%), Positives = 273/439 (62%), Gaps = 2/439 (0%) Frame = +2 Query: 260 TLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEEAL 439 +LL D++H ++ + + +GSC L+LL +DGDW++ WA+V FL + R E L Sbjct: 55 SLLSDSYHHHHRFLNSLPRRLSRTGSCPLRLLREDGDWSKHQFWAVVRFLRHSSRLHEIL 114 Query: 440 QVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEAT-KKSDIFPSLAIYNTIIH 616 VFD WKN+E +R N +N+ +I++ C E M EA+ AF+ + ++ PSL IYN+IIH Sbjct: 115 PVFDAWKNLEPSRINEANYEKILRFLCEEKSMNEAIRAFQCMIDEHELSPSLEIYNSIIH 174 Query: 617 AFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFP 796 +A F + M E +LP ETY+GL+ AYG + LYDE+ C+K+ME GC Sbjct: 175 GYANDGKFEEAMFYMNHMKENDMLPETETYDGLIEAYGKWKLYDEIVLCIKKMESDGCVR 234 Query: 797 DEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVY 976 D VTYN+LI E+AR GLL++ME Y+ L S+ + L+ TL+SMLEAYAE G LEK+E Y Sbjct: 235 DHVTYNLLIREFARGGLLKRMEQMYQSLMSRKMTLEPCTLLSMLEAYAEFGVLEKMEDTY 294 Query: 977 RRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGL 1156 +I++ + EDL+RK+A VYI N F++L++LG +R + DL W + LL A L Sbjct: 295 NKIVRFGISLDEDLVRKVANVYIDNLMFSRLDDLGRGIR-----RTDLAWCLRLLCHACL 349 Query: 1157 VSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITY 1336 VS+KG++ ++ EMEE +VP N NI+ Y K+ DFRS+ + R ++K DL+T Sbjct: 350 VSRKGLDYVVKEMEEARVPWNATFANIVLLAYSKMGDFRSVELLLSELRTKHVKLDLVTV 409 Query: 1337 GIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK-KYITVH 1513 GI D G+DG V W++ +L+ V +TD LV +AFGKG FL+ CE+ K + Sbjct: 410 GIVLDLSVDGFDGTGVFMTWKKIGFLDKPVETKTDPLVHAAFGKGRFLRSCEEVKNQVLG 469 Query: 1514 SRQKQKKIWRYSDVIRLVL 1570 +R ++ K W Y ++ LV+ Sbjct: 470 TRVEESKSWTYQYLMELVV 488 >ref|XP_002868305.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297314141|gb|EFH44564.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 502 Score = 346 bits (888), Expect = 2e-92 Identities = 185/450 (41%), Positives = 276/450 (61%), Gaps = 2/450 (0%) Frame = +2 Query: 227 PLPQDWEQRHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSF 406 PL + + T L+ HR S + GSC L+LL++ GDW++D+ WA++ F Sbjct: 49 PLSINGDASQSTSLIHHHHR---FLSSLPRRLELPGSCPLRLLQEYGDWSKDHFWAVIRF 105 Query: 407 LVETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEAT-KKSDIF 583 L + R E L VFD WKN+E +R + +N+ R+I+L C E M EA+ AF ++ Sbjct: 106 LRHSSRLHEILPVFDAWKNLERSRISEANYERVIRLLCEEKSMNEAIRAFRGMIDDHELS 165 Query: 584 PSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKC 763 PSL IYN+IIH +A + F + M E GLLP ETY+GL+ AYG + +YDE+ C Sbjct: 166 PSLEIYNSIIHGYADEGKFEEAMFYLNHMKENGLLPITETYDGLIEAYGKWKMYDEIVLC 225 Query: 764 VKRMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAE 943 +KRME GC D VTYN+LI E++R GLL++ME Y+ L S+ + L+ STL+SMLEAYAE Sbjct: 226 LKRMESEGCVRDHVTYNLLIREFSRGGLLKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAE 285 Query: 944 MGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLV 1123 G +EK+E +I++ + E L+RKLA VYI N F++L++LG + ++ + DL Sbjct: 286 FGLIEKMEETCNKIIRFGISLDEGLVRKLANVYIDNLMFSRLDDLGRGISSSRTRRTDLA 345 Query: 1124 WYILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQAR 1303 W + LL A LVS+KG++ ++ EM+E +VP N NI Y K+ DF+S+ + R Sbjct: 346 WCLRLLCHARLVSRKGLDYVIKEMKEARVPWNTTFANITLLAYSKMGDFKSIELLLSELR 405 Query: 1304 VCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLK 1483 ++K DL+T GI FD + G+D V W++ +L+ V ++TD LV +AFGKG FLK Sbjct: 406 TKHVKLDLVTVGIIFDLSEAGFDVTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGKFLK 465 Query: 1484 LCEK-KYITVHSRQKQKKIWRYSDVIRLVL 1570 CE+ K ++ R ++ K W Y ++ +V+ Sbjct: 466 SCEEVKNQSLGMRGEESKAWTYQYLMEVVV 495 >ref|XP_006492991.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Citrus sinensis] Length = 477 Score = 342 bits (876), Expect = 4e-91 Identities = 175/440 (39%), Positives = 275/440 (62%) Frame = +2 Query: 251 RHRTLLVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAE 430 +H TLLV+++H + L +LI ++ SC LQ+L+ DGDWT+D+ WA++ FL + R+ Sbjct: 62 KHTTLLVESYHEHQALNALIQRLNKKV-SCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120 Query: 431 EALQVFDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTI 610 + QVFD+WKNIE +R N N+ +II + C EG M EA+ AF+ + + PSL IYN+I Sbjct: 121 QIPQVFDMWKNIEKSRINEFNYQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180 Query: 611 IHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGC 790 IH +++ F+ + M E L P +TY+GL++AY Sbjct: 181 IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAY--------------------- 219 Query: 791 FPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIER 970 E+A +GLL++ME TY+ + +K ++L++ST+V++L+AY G L+K+E+ Sbjct: 220 ------------EFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 267 Query: 971 VYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSA 1150 Y+R+L + +KEDL+RKLA VYI+NY F++L++LG+D+ + G+ +LVW + LLS A Sbjct: 268 FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDL-ASRIGRTELVWCLRLLSHA 326 Query: 1151 GLVSKKGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLI 1330 L+S +G++S++ EME KV N+ NI+ YLK++DF+ L + ++KPD++ Sbjct: 327 CLLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIV 386 Query: 1331 TYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITV 1510 T GI +DA IG+DG LE WRR +L V I TD LVL+ +GKG FL+ CE+ Y ++ Sbjct: 387 TIGILYDARRIGFDGTGALEMWRRIGFLSKTVEINTDPLVLAVYGKGHFLRYCEEVYSSL 446 Query: 1511 HSRQKQKKIWRYSDVIRLVL 1570 ++KK W Y ++I LV+ Sbjct: 447 EPYSREKKRWTYQNLIDLVI 466 >ref|XP_006283572.1| hypothetical protein CARUB_v10004636mg [Capsella rubella] gi|482552277|gb|EOA16470.1| hypothetical protein CARUB_v10004636mg [Capsella rubella] Length = 501 Score = 333 bits (854), Expect = 1e-88 Identities = 174/414 (42%), Positives = 255/414 (61%), Gaps = 1/414 (0%) Frame = +2 Query: 332 GSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEEALQVFDLWKNIEMTRNNPSNHLRIIK 511 GSC LQLL++DGDW++D+ WA++ FL + R E L V+D WKN+E +R + N+ R+I+ Sbjct: 87 GSCPLQLLQEDGDWSKDHFWAVIRFLRHSSRLHEILPVYDAWKNLEPSRISVVNYERVIR 146 Query: 512 LFCGEGFMIEAMSAFEATKKSD-IFPSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLL 688 C E M EA+ AF + D + PSL IYN+IIH +A F + M E GL Sbjct: 147 FLCEERSMNEAIRAFRSMIDDDELSPSLEIYNSIIHGYADDGKFEEAMFYLNQMKENGLS 206 Query: 689 PTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEKMEST 868 P ETY+GL+ AYG + +YDE+ CV+RME GC D VTYN+LI +++R GLL++ME Sbjct: 207 PISETYDGLIEAYGKWKMYDEIVLCVRRMESDGCVRDHVTYNLLIRQFSRGGLLKRMEQM 266 Query: 869 YRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIR 1048 Y+ L S+ + L+ TL+SMLEAYAE G +EK+E +I++ + + L+RKLA VYI Sbjct: 267 YQSLMSRKMTLEPCTLLSMLEAYAEFGVIEKMEETCNKIIRFGISLDDGLVRKLAKVYID 326 Query: 1049 NYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPININI 1228 N F++L++LG + + + DL W + LL + LVS+KG++ +L EM E KV N Sbjct: 327 NLMFSRLDDLGRGISYSRTRRSDLAWCLRLLCHSRLVSRKGLDYVLKEMTEAKVTWNTTF 386 Query: 1229 VNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSR 1408 NI+ Y K+ DF+S+ R +K DL+T GI FD + G+DG V W++ Sbjct: 387 ANIVLLAYSKMGDFKSIELLLDGLRTKRVKLDLVTVGIVFDLSEAGFDGTGVFMTWKKIG 446 Query: 1409 YLEDVVGIRTDQLVLSAFGKGSFLKLCEKKYITVHSRQKQKKIWRYSDVIRLVL 1570 +L+ V ++TD LV +AFGKG FL+ CE+ R + W Y +++ LV+ Sbjct: 447 FLDKPVEMKTDPLVHAAFGKGQFLRRCEE------MRGEDPTPWTYQNLMELVV 494 >emb|CAB10198.1| salt-inducible protein homolog [Arabidopsis thaliana] gi|7268124|emb|CAB78461.1| salt-inducible protein homolog [Arabidopsis thaliana] Length = 561 Score = 328 bits (842), Expect = 3e-87 Identities = 174/419 (41%), Positives = 259/419 (61%), Gaps = 15/419 (3%) Frame = +2 Query: 359 KDGDWTEDNLWAMVSFLVETGRAEEAL-------------QVFDLWKNIEMTRNNPSNHL 499 +DGDW++D+ WA++ FL ++ R E L QVFD WKN+E +R + +N+ Sbjct: 136 EDGDWSKDHFWAVIRFLRQSSRLHEILPNMKMTFCFFFQLQVFDTWKNLEPSRISENNYE 195 Query: 500 RIIKLFCGEGFMIEAMSAFEAT-KKSDIFPSLAIYNTIIHAFARKRDFHNSNATFAMMLE 676 RII+ C E M EA+ AF + ++ PSL IYN+IIH++A F + M E Sbjct: 196 RIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHMKE 255 Query: 677 AGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEK 856 GLLP ETY+GL+ AYG + +YDE+ C+KRME GC D VTYN+LI E++R GLL++ Sbjct: 256 NGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLLKR 315 Query: 857 MESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLAL 1036 ME Y+ L S+ + L+ STL+SMLEAYAE G +EK+E +I++ + E L+RKLA Sbjct: 316 MEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRKLAN 375 Query: 1037 VYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPI 1216 VYI N F++L++LG + + + +L W + LL A LVS+KG++ ++ EMEE +VP Sbjct: 376 VYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEARVPW 435 Query: 1217 NINIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEW 1396 N NI Y K+ DF S+ + R+ ++K DL+T GI FD + +DG V W Sbjct: 436 NTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVFMTW 495 Query: 1397 RRSRYLEDVVGIRTDQLVLSAFGKGSFLKLCEK-KYITVHSRQKQKKIWRYSDVIRLVL 1570 ++ +L+ V ++TD LV +AFGKG FL+ CE+ K ++ +R + K W Y ++ LV+ Sbjct: 496 KKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMELVV 554 >emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera] Length = 1697 Score = 319 bits (818), Expect = 2e-84 Identities = 163/363 (44%), Positives = 234/363 (64%) Frame = +2 Query: 266 LVDTFHRNNGLRSLIHEASRNSGSCALQLLEKDGDWTEDNLWAMVSFLVETGRAEEALQV 445 LV+T H N L LI + S N S LQLL DGDW + + WA++ FL + R+ E L V Sbjct: 1332 LVETLHENERLGVLIQKLS-NKASSPLQLLRDDGDWNKQHFWAVIRFLKDASRSSEILPV 1390 Query: 446 FDLWKNIEMTRNNPSNHLRIIKLFCGEGFMIEAMSAFEATKKSDIFPSLAIYNTIIHAFA 625 F LWK+++ +R N N+ +II L E E++ A E K + PSL IYN +IH FA Sbjct: 1391 FHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEXMKTHGLKPSLEIYNLVIHCFA 1450 Query: 626 RKRDFHNSNATFAMMLEAGLLPTPETYNGLLRAYGCFGLYDEMSKCVKRMELSGCFPDEV 805 RK +F + + L+ ETY+GL+++YG + +YDE+ +CVK+ME GC PD + Sbjct: 1451 RKGEFDRALYFLNELKXNNLIADTETYDGLIQSYGKYKMYDELDECVKKMESDGCLPDHI 1510 Query: 806 TYNILITEYARSGLLEKMESTYRVLSSKNLNLQASTLVSMLEAYAEMGNLEKIERVYRRI 985 TYN+LI E++R GLL++ME ++ + SK + LQ+STLV MLEAYA G +EK+E YRR+ Sbjct: 1511 TYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVMLEAYANFGIIEKMENAYRRV 1570 Query: 986 LKLKAYIKEDLIRKLALVYIRNYRFAQLEELGNDVRTANWGKIDLVWYILLLSSAGLVSK 1165 L K +K+DLIRKLA VYI NY+F++L ++G D+ + + DLVW + LLS A L+S+ Sbjct: 1571 LNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLDLASVT-SRTDLVWCLRLLSHACLLSR 1629 Query: 1166 KGVESILHEMEEEKVPININIVNILAHFYLKIRDFRSLNGAFRQARVCNIKPDLITYGIF 1345 KG++SI+ EME + VP N + N + YLK++DF L + ++KPD++T GI Sbjct: 1630 KGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILLLELSTRHVKPDIVTVGIL 1689 Query: 1346 FDA 1354 FDA Sbjct: 1690 FDA 1692 >ref|XP_002966251.1| hypothetical protein SELMODRAFT_85839 [Selaginella moellendorffii] gi|300165671|gb|EFJ32278.1| hypothetical protein SELMODRAFT_85839 [Selaginella moellendorffii] Length = 358 Score = 151 bits (381), Expect = 1e-33 Identities = 88/317 (27%), Positives = 158/317 (49%) Frame = +2 Query: 542 AMSAFEATKKSDIFPSLAIYNTIIHAFARKRDFHNSNATFAMMLEAGLLPTPETYNGLLR 721 A F+ + + PS+ ++ ++ ++A + + + ML+ G+ P TY GL+R Sbjct: 11 AQGVFDGMEAMQVRPSVVGFSALVQSYAESGEVEGAQSAMKRMLDTGIQPNVVTYGGLIR 70 Query: 722 AYGCFGLYDEMSKCVKRMELSGCFPDEVTYNILITEYARSGLLEKMESTYRVLSSKNLNL 901 AYG GL+DEM+K V M+ C PD Y +I YA GL+ +M+ ++ + + Sbjct: 71 AYGKRGLFDEMAKVVNTMKTVRCEPDFFVYKNVIEAYASGGLVGRMDKAFKAMRADGWIP 130 Query: 902 QASTLVSMLEAYAEMGNLEKIERVYRRILKLKAYIKEDLIRKLALVYIRNYRFAQLEELG 1081 + L + + YA MG ++++E + ++K + +E+ +R AL YIR+ +F Q+E Sbjct: 131 DSDILNLLAQGYASMGMIKEMEGAQGELRRIKGWPREESVRACALAYIRHNQFYQMEGFV 190 Query: 1082 NDVRTANWGKIDLVWYILLLSSAGLVSKKGVESILHEMEEEKVPININIVNILAHFYLKI 1261 + G +L+W +LLL+ A S K ++ M + ++ NI A ++ Sbjct: 191 KSLGMKRIGG-NLLWNLLLLAHAANFSMKSLQREAVNMWSARCAPDVTTFNIRALALSRM 249 Query: 1262 RDFRSLNGAFRQARVCNIKPDLITYGIFFDACDIGYDGIHVLEEWRRSRYLEDVVGIRTD 1441 + L+ + R +++PDL+TYG DA I + E+ + + +RTD Sbjct: 250 QMLWDLHVLVQHMRAESVRPDLVTYGALVDAYAIARLLPRLPEQLDELDMADTIPDVRTD 309 Query: 1442 QLVLSAFGKGSFLKLCE 1492 LV AFG+G F C+ Sbjct: 310 PLVFQAFGRGRFHAFCD 326