BLASTX nr result
ID: Mentha25_contig00040920
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00040920 (1268 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU39729.1| hypothetical protein MIMGU_mgv1a004960mg [Mimulus... 523 e-146 emb|CAN73672.1| hypothetical protein VITISV_031859 [Vitis vinifera] 434 e-119 ref|XP_006352332.1| PREDICTED: pentatricopeptide repeat-containi... 422 e-115 ref|XP_003612228.1| Pentatricopeptide repeat-containing protein ... 397 e-108 ref|XP_004512166.1| PREDICTED: pentatricopeptide repeat-containi... 394 e-107 ref|XP_007204770.1| hypothetical protein PRUPE_ppa015604mg [Prun... 394 e-107 ref|XP_003516541.1| PREDICTED: pentatricopeptide repeat-containi... 393 e-107 ref|XP_007157883.1| hypothetical protein PHAVU_002G106000g [Phas... 393 e-106 ref|XP_006383060.1| pentatricopeptide repeat-containing family p... 388 e-105 ref|XP_004135020.1| PREDICTED: pentatricopeptide repeat-containi... 380 e-103 gb|EXC35313.1| hypothetical protein L484_026636 [Morus notabilis] 370 e-100 ref|XP_004158900.1| PREDICTED: pentatricopeptide repeat-containi... 352 3e-94 ref|XP_002877796.1| binding protein [Arabidopsis lyrata subsp. l... 346 1e-92 ref|NP_190700.2| pentatricopeptide repeat-containing protein [Ar... 345 2e-92 ref|XP_006403930.1| hypothetical protein EUTSA_v10010283mg [Eutr... 342 3e-91 ref|XP_006425390.1| hypothetical protein CICLE_v10027592mg [Citr... 331 5e-88 ref|XP_006857380.1| hypothetical protein AMTR_s00067p00130250 [A... 316 1e-83 ref|XP_002531149.1| pentatricopeptide repeat-containing protein,... 298 4e-78 emb|CAB62654.1| putative protein [Arabidopsis thaliana] 285 4e-74 ref|XP_006844721.1| hypothetical protein AMTR_s00016p00252780 [A... 263 2e-67 >gb|EYU39729.1| hypothetical protein MIMGU_mgv1a004960mg [Mimulus guttatus] Length = 502 Score = 523 bits (1348), Expect = e-146 Identities = 259/377 (68%), Positives = 302/377 (80%), Gaps = 1/377 (0%) Frame = +3 Query: 141 KTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLDSPDAFCVNTV 320 + IHLFQIQ+ LITSG+FQDPSF+GRLLKLSS+LI+DL T+LIFK +D PDAFCVNTV Sbjct: 14 RNKIHLFQIQAQLITSGVFQDPSFSGRLLKLSSSLIDDLCYTLLIFKCIDFPDAFCVNTV 73 Query: 321 IKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGRMCHGHAVKLG 500 IK Y+CSNHH AV FY E LR GDF+PNGFTFPPLISACAKLGCLSLG+MCHGHA+K G Sbjct: 74 IKGYTCSNHHQIAVSFYAEALRRGDFYPNGFTFPPLISACAKLGCLSLGQMCHGHALKFG 133 Query: 501 V-DSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVGEMELAHKLF 677 V D VLPVQNSL+HFY CC L+DVA KV EM V+DLVSWNT++ G AK GEME AHK+F Sbjct: 134 VVDHVLPVQNSLLHFYGCCRLVDVAGKVLDEMPVKDLVSWNTVIGGLAKAGEMESAHKMF 193 Query: 678 DAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVITACGRSNRLK 857 D +P +NVVSWNVMITGYL F +PGNAL+LFR MM +++ SNDTT VQVI AC RSNRLK Sbjct: 194 DEMPRKNVVSWNVMITGYLNFRSPGNALQLFRRMMSRNYESNDTTKVQVIAACARSNRLK 253 Query: 858 EGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVSWNAMILGHC 1037 EG+S+HGF++++ + SLIIDT MIDMYSKCGR D+A IF++M KN VSWNAMILGHC Sbjct: 254 EGKSIHGFIIKACTDFSLIIDTNMIDMYSKCGRTDIARKIFDKMPIKNLVSWNAMILGHC 313 Query: 1038 IHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILPDELTFIGVLCACAHLGL 1217 IHG+PV+GLSLY+EM DK I PDELTFIGVLCACA LGL Sbjct: 314 IHGDPVDGLSLYSEMADK----------------------INPDELTFIGVLCACARLGL 351 Query: 1218 LEEGRNHFSQMTDVFCL 1268 L +G+N+FS+M D+F L Sbjct: 352 LTDGKNYFSEMIDLFHL 368 >emb|CAN73672.1| hypothetical protein VITISV_031859 [Vitis vinifera] Length = 901 Score = 434 bits (1115), Expect = e-119 Identities = 215/385 (55%), Positives = 278/385 (72%) Frame = +3 Query: 108 TNRALYLLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRL 287 +N L LL+ C+ L QIQ+YLI SGLF+ P A ++LK+S++ D+ T+LIF+ + Sbjct: 372 SNSCLALLKTCRNMRQLSQIQAYLIISGLFRKPFVASKVLKVSADYA-DVNYTILIFRSI 430 Query: 288 DSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLG 467 DSPD CVN VIK YS S+ +A++FY E LR+G F N FTFPPL S C K GC+ G Sbjct: 431 DSPDTVCVNAVIKAYSISSVAHQALVFYFETLRNG-FMCNSFTFPPLFSCCRKXGCVEYG 489 Query: 468 RMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKV 647 HG A+K GVD+VL VQNS+VH Y CCG+++ A KVF EM RDLVSWN+++D +AK+ Sbjct: 490 EKFHGQAIKNGVDNVLDVQNSMVHMYGCCGVVEXAEKVFGEMSKRDLVSWNSIIDAYAKL 549 Query: 648 GEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVI 827 G + LAH+LFDA+PERN VSWN+M+ GYLK GNPG ALKLFREM +TT+V V+ Sbjct: 550 GHLVLAHRLFDAMPERNAVSWNIMMGGYLKGGNPGCALKLFREMANAGLRGGETTMVSVL 609 Query: 828 TACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSV 1007 TAC RS RLKEGRS+HG L+R+F SLI+DTA+IDMYSKC R+D+A ++++RM + N V Sbjct: 610 TACCRSARLKEGRSIHGVLIRTFLKSSLILDTALIDMYSKCERVDVARVVYDRMTKXNLV 669 Query: 1008 SWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILPDELTFIG 1187 WNAMILGHCIHGN +GL L+ EM+D R +D + K E ++PDE+TFIG Sbjct: 670 CWNAMILGHCIHGNAEDGLKLFEEMVDGIRSEDG-EINLDKGIKRIEGQGLJPDEITFIG 728 Query: 1188 VLCACAHLGLLEEGRNHFSQMTDVF 1262 VLCACA GLL EGR+++SQM + F Sbjct: 729 VLCACAREGLLAEGRSYYSQMINTF 753 >ref|XP_006352332.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like isoform X1 [Solanum tuberosum] gi|565371484|ref|XP_006352333.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like isoform X2 [Solanum tuberosum] Length = 534 Score = 422 bits (1085), Expect = e-115 Identities = 209/394 (53%), Positives = 280/394 (71%), Gaps = 2/394 (0%) Frame = +3 Query: 87 NNLSATITNRALYLLQCCKTSIHLFQIQSYLITSGLFQ--DPSFAGRLLKLSSNLINDLR 260 ++L+ T ++AL L C++ LFQIQ++LI +GL Q +PS++ R LKL + +D+ Sbjct: 23 SSLTPTYQSKALEFLDSCQSLAQLFQIQAHLIITGLLQVQNPSYSCRFLKLCTQHCDDIE 82 Query: 261 CTVLIFKRLDSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISAC 440 T L+FK + PD F VNTVIK Y+CS+ AV+FY + L++G F PN FTFPPL+SAC Sbjct: 83 YTALVFKCIHFPDTFSVNTVIKAYACSSLPDNAVVFYFQRLKNG-FLPNSFTFPPLMSAC 141 Query: 441 AKLGCLSLGRMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWN 620 A+ G L G+ CHG VK GVD VL VQNSLVHFY+CCG +D+A KVF EM RD+VSWN Sbjct: 142 ARRGRLDSGQKCHGQVVKNGVDGVLQVQNSLVHFYSCCGFIDLARKVFDEMHQRDVVSWN 201 Query: 621 TMVDGFAKVGEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNS 800 ++++G+ KVGE+ +A +LFDA+PE N+V WNVM+TGYL NPG LKLFREM + N Sbjct: 202 SIMNGYVKVGELVVARQLFDAMPECNLVGWNVMMTGYLNSNNPGKCLKLFREMAQRGLNG 261 Query: 801 NDTTVVQVITACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIF 980 NDTT+V +TAC RS R+KEG+SVHG L+++ L+LI+ T +I MYS+CGR ++ LIF Sbjct: 262 NDTTIVIAVTACARSARMKEGKSVHGCLIKASKDLNLIVSTTLIHMYSRCGRAEIGRLIF 321 Query: 981 ERMQRKNSVSWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWI 1160 +R+ KN V WNAMILG+CIHG P +GL+LY+++L + K++ + Sbjct: 322 DRISIKNIVCWNAMILGYCIHGIPKDGLNLYSDLLSSRLESTE---------KNHVKYHA 372 Query: 1161 LPDELTFIGVLCACAHLGLLEEGRNHFSQMTDVF 1262 LPDE+TF+GVLCACA GLL EGR HF M+DVF Sbjct: 373 LPDEITFVGVLCACAREGLLTEGRKHFGNMSDVF 406 >ref|XP_003612228.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355513563|gb|AES95186.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 665 Score = 397 bits (1020), Expect = e-108 Identities = 204/378 (53%), Positives = 261/378 (69%), Gaps = 1/378 (0%) Frame = +3 Query: 138 CKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLDSP-DAFCVN 314 C+T+ HL QIQS LITS +++P + LL +SNL + T LIF ++P D FCVN Sbjct: 48 CQTTHHLLQIQSLLITSSFYRNPFLSRTLLSRASNLCT-VDFTFLIFHHFNNPLDTFCVN 106 Query: 315 TVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGRMCHGHAVK 494 TVI Y S +A++FY L+ G F N +TF LISAC+K+ C+ G+MCHG AVK Sbjct: 107 TVINSYCNSYVPHKAIVFYFSSLKIG-FFANSYTFVSLISACSKMSCVDNGKMCHGQAVK 165 Query: 495 LGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVGEMELAHKL 674 GVD VLPV+NSL H Y CG ++VA +F M+ RDLVSWN+M+DG+ KVG++ AHKL Sbjct: 166 NGVDFVLPVENSLAHMYGSCGYVEVARVMFDGMVSRDLVSWNSMIDGYVKVGDLSAAHKL 225 Query: 675 FDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVITACGRSNRL 854 FD +PERN+V+WN +I+GY K NPG ALKLFREM N T+V +TACGRS RL Sbjct: 226 FDVMPERNLVTWNCLISGYSKGRNPGYALKLFREMGRLRIRENARTMVCAVTACGRSGRL 285 Query: 855 KEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVSWNAMILGH 1034 KEG+SVHG ++R F SLI+DTA+IDMY KCGR++ A +FERM +N VSWNAMILGH Sbjct: 286 KEGKSVHGSMIRLFMRSSLILDTALIDMYCKCGRVEAASKVFERMSSRNLVSWNAMILGH 345 Query: 1035 CIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILPDELTFIGVLCACAHLG 1214 CIHGNP +GLSL+ M+ R K + + + + +LPDE+TFIG+LCACA Sbjct: 346 CIHGNPEDGLSLFDLMVGMERVKGEVEVDESSSADRGLVR-LLPDEITFIGILCACARAE 404 Query: 1215 LLEEGRNHFSQMTDVFCL 1268 LL EGR++F QM DVF L Sbjct: 405 LLSEGRSYFKQMIDVFGL 422 >ref|XP_004512166.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like [Cicer arietinum] Length = 598 Score = 394 bits (1013), Expect = e-107 Identities = 203/378 (53%), Positives = 266/378 (70%), Gaps = 1/378 (0%) Frame = +3 Query: 138 CKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLDSP-DAFCVN 314 C+T+ HL QIQ+ LITS +++P LL+ +SNL D+ T LIF+ ++P D FCVN Sbjct: 60 CQTTRHLLQIQALLITSSFYRNPFLVRTLLRRASNLC-DVAFTFLIFQHFNNPLDTFCVN 118 Query: 315 TVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGRMCHGHAVK 494 TVI Y S ++A++FY + L+ F PN +TF PLI +C+ +GC+ GRMCH AVK Sbjct: 119 TVINSYCNSYVPNKAIVFYFQSLKIR-FFPNSYTFVPLIGSCSNMGCVDSGRMCHAQAVK 177 Query: 495 LGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVGEMELAHKL 674 GVD VLPVQNSLVH YA CG + VA +F M+ RD VSWN+M+DG+ KVG++ AH+L Sbjct: 178 NGVDFVLPVQNSLVHMYASCGDVCVARVMFDAMMDRDSVSWNSMIDGYVKVGDLNAAHQL 237 Query: 675 FDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVITACGRSNRL 854 FD +PERN+V+WN MI+G+LK NPG LKLFREM N T+V V+TACGRS RL Sbjct: 238 FDVMPERNLVTWNCMISGFLKGRNPGYGLKLFREMGRLGLRGNVRTMVSVVTACGRSGRL 297 Query: 855 KEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVSWNAMILGH 1034 KEG+SVHG ++R F+ +LI+DTA+IDMY KC R+++A +FERM +N VSWNAMILGH Sbjct: 298 KEGKSVHGSIIRLFARSNLILDTALIDMYCKCRRVEVASKVFERMGNRNLVSWNAMILGH 357 Query: 1035 CIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILPDELTFIGVLCACAHLG 1214 CI G+P +GLSL+ M+ R K + + + + S + LPDE+TFIGVLCACA Sbjct: 358 CIRGSPEDGLSLFDLMVGMVRVKGEVEIDESPSADSGLVRF-LPDEITFIGVLCACARAE 416 Query: 1215 LLEEGRNHFSQMTDVFCL 1268 LL EGR++F QM DVF L Sbjct: 417 LLSEGRSYFKQMIDVFGL 434 >ref|XP_007204770.1| hypothetical protein PRUPE_ppa015604mg [Prunus persica] gi|462400301|gb|EMJ05969.1| hypothetical protein PRUPE_ppa015604mg [Prunus persica] Length = 568 Score = 394 bits (1011), Expect = e-107 Identities = 198/388 (51%), Positives = 269/388 (69%), Gaps = 2/388 (0%) Frame = +3 Query: 111 NRALY-LLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRL 287 NR ++ LL CK I + QI ++LIT GLF D +A +LLK S+ D +LIF+ + Sbjct: 48 NRHIFSLLDACKNLIQITQIHAHLITRGLF-DSFWARKLLKSYSDF-RDFDYVILIFRCI 105 Query: 288 DSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLG 467 D P FCVNTVIK YS S+ +A++ Y E LR+G F P +TF PLI +CAK+G + G Sbjct: 106 DLPGTFCVNTVIKAYSVSSMPDQALVVYFEWLRNG-FAPTSYTFVPLIGSCAKMGSVESG 164 Query: 468 RMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKV 647 R CHG VK G+DS+L VQNSL+H Y +++A +F EM RDLVSWNT++DG+A+ Sbjct: 165 RKCHGQVVKHGLDSLLQVQNSLIHMYCSSEKVELARMMFDEMSERDLVSWNTILDGYARF 224 Query: 648 GEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVI 827 G++++AH LFD +PERNVVSWNVM+ GY K G PG ALKLFR+MM N TT+ ++ Sbjct: 225 GDLDVAHNLFDEMPERNVVSWNVMLGGYWKGGKPGCALKLFRKMMGMELKGNSTTIANML 284 Query: 828 TACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSV 1007 ACGRS RL EGRSVHG+L+R +++I TA+IDMY KC R+++A +FE M +N V Sbjct: 285 AACGRSARLNEGRSVHGYLIRKLFEFNIVISTALIDMYCKCKRVEVACRVFESMANRNLV 344 Query: 1008 SWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEE-NWILPDELTFI 1184 WNA+ILGHCIHGN +GL+LY EM+ + + KD + + + +++ I+PDE+TFI Sbjct: 345 CWNAIILGHCIHGNAKDGLNLYREMVGRMKSKDGETIPAKGSSRPDDDGGGIIPDEITFI 404 Query: 1185 GVLCACAHLGLLEEGRNHFSQMTDVFCL 1268 GVLCACA GL+ E ++FSQM +VFC+ Sbjct: 405 GVLCACARAGLVREAADYFSQMINVFCV 432 >ref|XP_003516541.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like [Glycine max] Length = 579 Score = 393 bits (1010), Expect = e-107 Identities = 199/390 (51%), Positives = 270/390 (69%) Frame = +3 Query: 93 LSATITNRALYLLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVL 272 LS+ ++ L C+ + HL QIQ+ L+TS LF++P A +L +S+L D+ T + Sbjct: 36 LSSLFSHFEALLQNSCQNARHLLQIQALLVTSSLFRNPYLARTILSRASHLC-DVAYTRV 94 Query: 273 IFKRLDSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLG 452 IF+ ++S D FCVN VI+ YS S+ EA++FY L G F PN +TF PL+++CAK+G Sbjct: 95 IFRSINSLDTFCVNIVIQAYSNSHAPREAIVFYFRSLMRG-FFPNSYTFVPLVASCAKMG 153 Query: 453 CLSLGRMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVD 632 C+ G+ CH A K GVDSVLPVQNSL+H Y CCG + +A +F ML RDLVSWN++++ Sbjct: 154 CIGSGKECHAQATKNGVDSVLPVQNSLIHMYVCCGGVQLARVLFDGMLSRDLVSWNSIIN 213 Query: 633 GFAKVGEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTT 812 G VGE+ AH+LFD +PERN+V+WNVMI+GYLK NPG A+KLFREM N T Sbjct: 214 GHMMVGELNAAHRLFDKMPERNLVTWNVMISGYLKGRNPGYAMKLFREMGRLGLRGNART 273 Query: 813 VVQVITACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQ 992 +V V TACGRS RLKE +SVHG +VR SLI+DTA+I MY KC ++++A ++FERM+ Sbjct: 274 MVCVATACGRSGRLKEAKSVHGSIVRMSLRSSLILDTALIGMYCKCRKVEVAQIVFERMR 333 Query: 993 RKNSVSWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILPDE 1172 +N VSWN MILGHCI G+P +GL L+ M+ + K + +S+E +LP+E Sbjct: 334 ERNLVSWNMMILGHCIRGSPEDGLDLFEVMISMGKMKHGV--------ESDETLRLLPNE 385 Query: 1173 LTFIGVLCACAHLGLLEEGRNHFSQMTDVF 1262 +TFIGVLCACA +L+EGR++F QMTDVF Sbjct: 386 VTFIGVLCACARAEMLDEGRSYFKQMTDVF 415 >ref|XP_007157883.1| hypothetical protein PHAVU_002G106000g [Phaseolus vulgaris] gi|561031298|gb|ESW29877.1| hypothetical protein PHAVU_002G106000g [Phaseolus vulgaris] Length = 583 Score = 393 bits (1009), Expect = e-106 Identities = 199/381 (52%), Positives = 265/381 (69%), Gaps = 2/381 (0%) Frame = +3 Query: 126 LLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLDSPDAF 305 L C+++ HL QIQ+ L+TS LF++P A +L +S L D+ T+LIF+ ++S D F Sbjct: 51 LRNSCRSARHLLQIQALLVTSSLFRNPFLARTVLSRASRLC-DVAYTLLIFRHINSSDTF 109 Query: 306 CVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGRMCHGH 485 CVNTVI Y S+ + V+FY L G F PN +TF PL+ +CA+ GC+ G+ CH Sbjct: 110 CVNTVIHAYCDSDAPHQTVIFYFRSLMRG-FFPNSYTFVPLVGSCARTGCVDSGKECHAQ 168 Query: 486 AVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVGEMELA 665 A K GVDSVLPVQNSL+H YACCG + +A +F ML RDLVSWN+++DG VGE+ A Sbjct: 169 ATKNGVDSVLPVQNSLIHMYACCGGVQLARVLFDGMLTRDLVSWNSIIDGHMMVGELNAA 228 Query: 666 HKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVITACGRS 845 H+LFD +P+RN+V+WNVMI+GYLK NPG A+KLFR M N T+V + TACGRS Sbjct: 229 HRLFDQMPDRNLVTWNVMISGYLKGRNPGYAMKLFRTMGRLGMRGNARTMVCLATACGRS 288 Query: 846 NRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVSWNAMI 1025 RLKEGRSVHG +V+ F SLI+DTA+IDMYSKC R+++A +F+RM +N +SWNAMI Sbjct: 289 GRLKEGRSVHGSIVKMFVRSSLILDTALIDMYSKCRRVEVARTVFDRMTERNLISWNAMI 348 Query: 1026 LGHCIHGNPVEGLSLYAEM--LDKSRQKDSLAAESARNFKSNEENWILPDELTFIGVLCA 1199 LG CI G+P +GLSL+ EM +D + +++SL +LPDE+TFIG+LCA Sbjct: 349 LGSCIQGSPEDGLSLFGEMVGIDGNDREESLR--------------LLPDEVTFIGILCA 394 Query: 1200 CAHLGLLEEGRNHFSQMTDVF 1262 CA LL EGR++F +MT+VF Sbjct: 395 CARAELLAEGRSYFKKMTEVF 415 >ref|XP_006383060.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550338637|gb|ERP60857.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 564 Score = 388 bits (997), Expect = e-105 Identities = 195/382 (51%), Positives = 266/382 (69%), Gaps = 2/382 (0%) Frame = +3 Query: 111 NRALYLLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLD 290 N LL HL+QIQ+ LIT GLF ++ RLLK ++ D+ T+ IFK + Sbjct: 52 NPRFELLYSTLNPFHLYQIQAQLITCGLFS--LWSPRLLKHFADF-GDIDYTIFIFKFIA 108 Query: 291 SPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGR 470 SP F VN V+K YS S+ ++A++FY E+L+ G F PN +TF L CAK+GC LG+ Sbjct: 109 SPGTFVVNNVVKAYSLSSEPNKALVFYFEMLKSG-FCPNSYTFVSLFGCCAKVGCAKLGK 167 Query: 471 MCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVG 650 HG AVK GVD +LPV+NSL+H Y CCG M +A KVF EM RDLVSWN+++DG+A +G Sbjct: 168 KYHGQAVKNGVDRILPVENSLIHCYGCCGDMGLAKKVFDEMSHRDLVSWNSIIDGYATLG 227 Query: 651 EMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVIT 830 E+ +AH LF+ +PERNVVSWN++I+GYLK NPG L LFR+MM ND+T+V V++ Sbjct: 228 ELGIAHGLFEVMPERNVVSWNILISGYLKGNNPGCVLMLFRKMMNDGMRGNDSTIVSVLS 287 Query: 831 ACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVS 1010 ACGRS RL+EGRSVHGF+V+ FS +++I +T +IDMY++C ++++A IF+++ R+N Sbjct: 288 ACGRSARLREGRSVHGFIVKKFSSMNVIHETTLIDMYNRCHKVEMARRIFDKVVRRNLGC 347 Query: 1011 WNAMILGHCIHGNPVEGLSLYAEMLDKS--RQKDSLAAESARNFKSNEENWILPDELTFI 1184 WNAMILGHC+HGNP +GL L+ +M+D++ ++DS + PDE+TFI Sbjct: 348 WNAMILGHCLHGNPDDGLELFKDMVDRAGLGKRDS----------------VHPDEVTFI 391 Query: 1185 GVLCACAHLGLLEEGRNHFSQM 1250 GVLCACA GLL EG+N FSQM Sbjct: 392 GVLCACARAGLLTEGKNFFSQM 413 Score = 78.6 bits (192), Expect = 5e-12 Identities = 67/287 (23%), Positives = 124/287 (43%), Gaps = 15/287 (5%) Frame = +3 Query: 273 IFKRLDSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLG 452 +F+ + + N +I Y N+ ++ + +++ DG N T ++SAC + Sbjct: 235 LFEVMPERNVVSWNILISGYLKGNNPGCVLMLFRKMMNDG-MRGNDSTIVSVLSACGRSA 293 Query: 453 CLSLGRMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVD 632 L GR HG VK F M +++ T++D Sbjct: 294 RLREGRSVHGFIVKK----------------------------FSSM---NVIHETTLID 322 Query: 633 GFAKVGEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQ-------S 791 + + ++E+A ++FD + RN+ WN MI G+ GNP + L+LF++M+ + S Sbjct: 323 MYNRCHKVEMARRIFDKVVRRNLGCWNAMILGHCLHGNPDDGLELFKDMVDRAGLGKRDS 382 Query: 792 FNSNDTTVVQVITACGRSNRLKEGRSVHGFLVRSFSCL-SLIIDTAMIDMYSKCGRIDLA 968 + ++ T + V+ AC R+ L EG++ ++ S + M ++Y++ G I A Sbjct: 383 VHPDEVTFIGVLCACARAGLLTEGKNFFSQMIYSHGLKPNFAHFWCMANLYARAGLIQEA 442 Query: 969 HLIFERMQRK------NSVSWNAMILGHC-IHGNPVEGLSLYAEMLD 1088 I Q + S+ W A +L C GN G + ++D Sbjct: 443 EDILRTTQEEEEDMPLESLVW-ANLLNSCRFQGNVALGERIANSLID 488 >ref|XP_004135020.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like [Cucumis sativus] Length = 575 Score = 380 bits (975), Expect = e-103 Identities = 191/385 (49%), Positives = 264/385 (68%), Gaps = 1/385 (0%) Frame = +3 Query: 111 NRALYLLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLD 290 N++ LLQ C++ LFQ +LITSGLF D +A R+L L ++ D+ TVLIF+ + Sbjct: 50 NQSHSLLQSCQSVRELFQFHGHLITSGLFNDHFWANRVL-LQASEFGDIVYTVLIFRHIK 108 Query: 291 SPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGR 470 P+ FCVN VIK YS S EAV Y E L +G P+ +TF L SACA GC + GR Sbjct: 109 VPNTFCVNRVIKAYSLSTVPLEAVFVYFEWLGNG-LRPDSYTFLSLFSACASFGCGASGR 167 Query: 471 MCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVG 650 CHG A K GVDSV+ + NSL+H Y CC +++ KVF EM +DLVSWN++V +A+VG Sbjct: 168 KCHGQAFKNGVDSVMVLGNSLIHMYGCCKHIELGRKVFDEMSTQDLVSWNSIVTAYARVG 227 Query: 651 EMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVIT 830 ++ AH +FD +PERNVVSWN+MI+ YL+ GNPG A+KLFR M+ N+TT+V V++ Sbjct: 228 DLYTAHDMFDVMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNVGIRGNNTTMVNVLS 287 Query: 831 ACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVS 1010 AC RS RL EGRSVHGF+ R+ + I+TA++DMYSKC R+ +A +F+R+ +N V+ Sbjct: 288 ACSRSARLNEGRSVHGFMYRASMKFCVFINTALVDMYSKCHRVSVARRVFDRLMIRNLVT 347 Query: 1011 WNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNE-ENWILPDELTFIG 1187 WNAMILGH +HGNP +GL L+ EM+ + R+ + + + FK +E + + PD++TFIG Sbjct: 348 WNAMILGHSLHGNPKDGLELFEEMVGELREINE-ETGNGKKFKQDEGKRKVFPDQITFIG 406 Query: 1188 VLCACAHLGLLEEGRNHFSQMTDVF 1262 VLCACA GLL++ N+F +M +VF Sbjct: 407 VLCACARAGLLKDAENYFDEMINVF 431 >gb|EXC35313.1| hypothetical protein L484_026636 [Morus notabilis] Length = 577 Score = 370 bits (950), Expect = e-100 Identities = 189/381 (49%), Positives = 259/381 (67%), Gaps = 2/381 (0%) Frame = +3 Query: 126 LLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLDSPDAF 305 LL +T I + Q+ + ++TSG+F +A + LK S+ + T+LIF+ +D P AF Sbjct: 51 LLDASQTLIQVRQVHANMLTSGIFTS-FWARKFLKFYSDF-GHVDYTILIFRYIDFPGAF 108 Query: 306 CVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGRMCHGH 485 CVNTV++ YS ++A++FY E LR+G F PN +TF ++ CAKLG L G MC G Sbjct: 109 CVNTVLRAYSVGFDSNQALIFYFESLRNG-FSPNSYTFVTVLGCCAKLGSLESGEMCRGQ 167 Query: 486 AVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVGEMELA 665 A+K GVDS L +QNSL+H Y CCG + +A KV EM RDLVSWN+++D + +VG +++A Sbjct: 168 AIKNGVDSALQIQNSLIHMYGCCGNVGLARKVLDEMSERDLVSWNSLLDVYVRVGRVDVA 227 Query: 666 HKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVITACGRS 845 H++FD +PERNV SWN++ GYL G PG LKL REM + TTVV ITAC R+ Sbjct: 228 HRMFDKMPERNVASWNIIARGYLNGGVPGCVLKLVREMGKLGLRGDGTTVVNAITACARA 287 Query: 846 NRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVSWNAMI 1025 +RLKEGRSVHG L+R+ S+ IDTA+IDMYSKC R+ +A +F+ M KN VSWNAMI Sbjct: 288 SRLKEGRSVHGSLIRTGLESSVFIDTALIDMYSKCHRVGVACTVFDNMVEKNLVSWNAMI 347 Query: 1026 LGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENW--ILPDELTFIGVLCA 1199 LGHCIHG+P+ G+ LY EM+ K+ +++ + NE+ + PDE+TFIGVLCA Sbjct: 348 LGHCIHGDPLAGIRLYNEMVGIKSSKNE-ESDNCEILRPNEDGGGKLRPDEVTFIGVLCA 406 Query: 1200 CAHLGLLEEGRNHFSQMTDVF 1262 CA LL EG+++F +MT+VF Sbjct: 407 CARARLLPEGKDYFREMTNVF 427 >ref|XP_004158900.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like [Cucumis sativus] Length = 547 Score = 352 bits (902), Expect = 3e-94 Identities = 173/349 (49%), Positives = 240/349 (68%), Gaps = 1/349 (0%) Frame = +3 Query: 219 RLLKLSSNLINDLRCTVLIFKRLDSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDF 398 R + L ++ D+ TVLIF+ + P+ FCVN VIK YS S EAV Y E L +G Sbjct: 86 RAVLLQASEFGDIVYTVLIFRHIKVPNTFCVNRVIKAYSLSTVPLEAVFVYFEWLGNG-L 144 Query: 399 HPNGFTFPPLISACAKLGCLSLGRMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYK 578 P+ +TF L SACA GC + GR CHG A K GVDSV+ + NSL+H Y CC +++ K Sbjct: 145 RPDSYTFLSLFSACASFGCGASGRKCHGQAFKNGVDSVMVLGNSLIHMYGCCKHIELGRK 204 Query: 579 VFVEMLVRDLVSWNTMVDGFAKVGEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNA 758 VF EM +DLVSWN++V +A+VG++ AH +FD +PERNVVSWN+MI+ YL+ GNPG A Sbjct: 205 VFDEMSTQDLVSWNSIVTAYARVGDLYTAHDMFDVMPERNVVSWNLMISEYLRGGNPGCA 264 Query: 759 LKLFREMMVQSFNSNDTTVVQVITACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDM 938 +KLFR M+ N+TT+V V++AC RS RL EGRSVHGF+ R+ + I+TA++DM Sbjct: 265 MKLFRNMVNVGIRGNNTTMVNVLSACSRSARLNEGRSVHGFMYRASMKFCVFINTALVDM 324 Query: 939 YSKCGRIDLAHLIFERMQRKNSVSWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAA 1118 YSKC R+ +A +F+R+ +N V+WNAMILGH +HGNP +GL L+ EM+ + R+ + Sbjct: 325 YSKCHRVSVARRVFDRLMIRNLVTWNAMILGHSLHGNPKDGLELFEEMVGELREINE-ET 383 Query: 1119 ESARNFKSNE-ENWILPDELTFIGVLCACAHLGLLEEGRNHFSQMTDVF 1262 + + FK +E + + PD++TFIGVLCACA GLL++ N+F +M +VF Sbjct: 384 GNGKKFKQDEGKRKVFPDQITFIGVLCACARAGLLKDAENYFDEMINVF 432 >ref|XP_002877796.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297323634|gb|EFH54055.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 530 Score = 346 bits (888), Expect = 1e-92 Identities = 174/370 (47%), Positives = 241/370 (65%) Frame = +3 Query: 153 HLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLDSPDAFCVNTVIKKY 332 HLFQ+ + LITSG F D S+A RLLK SS D T+ IF+ + +C N V K Y Sbjct: 37 HLFQVHARLITSGNFWDSSWAIRLLKCSSRF-GDSSYTLSIFRSIGK--LYCANPVFKAY 93 Query: 333 SCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGRMCHGHAVKLGVDSV 512 S+ +A+ FY ++LR G F P+ +TF L+S K C+ G+MCHG A+K G D V Sbjct: 94 LVSSSPKQALGFYFDILRFG-FVPDTYTFVSLVSCIEKTCCVDSGKMCHGQAIKHGCDQV 152 Query: 513 LPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVGEMELAHKLFDAIPE 692 LPVQNSL+H Y CCG +D+A K+FVE+ RD+VSWN+++ G + G++ AHKLFD +PE Sbjct: 153 LPVQNSLIHMYTCCGALDLAKKLFVEIPKRDIVSWNSIIAGVVRNGDVLYAHKLFDEMPE 212 Query: 693 RNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVITACGRSNRLKEGRSV 872 +N++SWN+MI+ YL NPG ++ LFREM+ F N+ T+V ++ ACGRS RLKEGRSV Sbjct: 213 KNMISWNIMISAYLGANNPGVSIFLFREMVGAGFQGNENTLVLLLNACGRSARLKEGRSV 272 Query: 873 HGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVSWNAMILGHCIHGNP 1052 H L+R+F S++IDTA+IDMY KC +DLA IF+ + +N V+WN MIL HC+HG P Sbjct: 273 HASLIRTFLNSSVVIDTALIDMYGKCKEVDLARRIFDSLSVRNKVTWNVMILAHCLHGRP 332 Query: 1053 VEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILPDELTFIGVLCACAHLGLLEEGR 1232 +GL L+ M++ + PDE+TF+GVLC CA GL+ +G+ Sbjct: 333 EDGLELFEAMIN---------------------GLLRPDEVTFVGVLCGCARAGLVYQGQ 371 Query: 1233 NHFSQMTDVF 1262 +++S M D F Sbjct: 372 SYYSLMVDEF 381 >ref|NP_190700.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|122230198|sp|Q0WVU0.1|PP278_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g51320 gi|110741620|dbj|BAE98758.1| hypothetical protein [Arabidopsis thaliana] gi|332645257|gb|AEE78778.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 530 Score = 345 bits (886), Expect = 2e-92 Identities = 174/379 (45%), Positives = 245/379 (64%) Frame = +3 Query: 126 LLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLDSPDAF 305 L++ + HLFQ+ + LITSG F D S+A RLLK SS+ D TV I++ + + Sbjct: 28 LVEDSNSITHLFQVHARLITSGNFWDSSWAIRLLK-SSSRFGDSSYTVSIYRSIGK--LY 84 Query: 306 CVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGRMCHGH 485 C N V K Y S+ +A+ FY ++LR G F P+ +TF LIS K C+ G+MCHG Sbjct: 85 CANPVFKAYLVSSSPKQALGFYFDILRFG-FVPDSYTFVSLISCIEKTCCVDSGKMCHGQ 143 Query: 486 AVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVGEMELA 665 A+K G D VLPVQNSL+H Y CCG +D+A K+FVE+ RD+VSWN+++ G + G++ A Sbjct: 144 AIKHGCDQVLPVQNSLMHMYTCCGALDLAKKLFVEIPKRDIVSWNSIIAGMVRNGDVLAA 203 Query: 666 HKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVITACGRS 845 HKLFD +P++N++SWN+MI+ YL NPG ++ LFREM+ F N++T+V ++ ACGRS Sbjct: 204 HKLFDEMPDKNIISWNIMISAYLGANNPGVSISLFREMVRAGFQGNESTLVLLLNACGRS 263 Query: 846 NRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVSWNAMI 1025 RLKEGRSVH L+R+F S++IDTA+IDMY KC + LA IF+ + +N V+WN MI Sbjct: 264 ARLKEGRSVHASLIRTFLNSSVVIDTALIDMYGKCKEVGLARRIFDSLSIRNKVTWNVMI 323 Query: 1026 LGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILPDELTFIGVLCACA 1205 L HC+HG P GL L+ M++ + PDE+TF+GVLC CA Sbjct: 324 LAHCLHGRPEGGLELFEAMIN---------------------GMLRPDEVTFVGVLCGCA 362 Query: 1206 HLGLLEEGRNHFSQMTDVF 1262 GL+ +G++++S M D F Sbjct: 363 RAGLVSQGQSYYSLMVDEF 381 >ref|XP_006403930.1| hypothetical protein EUTSA_v10010283mg [Eutrema salsugineum] gi|557105049|gb|ESQ45383.1| hypothetical protein EUTSA_v10010283mg [Eutrema salsugineum] Length = 529 Score = 342 bits (876), Expect = 3e-91 Identities = 171/383 (44%), Positives = 247/383 (64%) Frame = +3 Query: 114 RALYLLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLKLSSNLINDLRCTVLIFKRLDS 293 R L++ T HLFQ+ + LI SG F D ++ RLLK SS D TV IF+ + Sbjct: 24 RGFKLVEESTTVRHLFQVHARLIASGNFWDSTWGIRLLKCSSRF-GDASYTVSIFRSIGK 82 Query: 294 PDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLGRM 473 +C N V K Y S+ +A+ FY ++ + G F P+ ++F PL K C+ G+M Sbjct: 83 --LYCANPVFKAYLLSSTPQQALGFYFDIRKCG-FVPDTYSFVPLFGCIEKTCCVDSGKM 139 Query: 474 CHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKVGE 653 CHG A+K G D VLPVQNSL+H Y CCG +++A K+FVE+ RD+VSWN+++ G + G+ Sbjct: 140 CHGQAIKHGCDQVLPVQNSLMHMYTCCGALELAKKLFVEIPKRDIVSWNSIIAGAVRDGD 199 Query: 654 MELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVITA 833 + AHKLFD +PE+N+VSWN+MI+ YL NPG ++KLFREM+ F+ N+ T+V +++A Sbjct: 200 ILYAHKLFDEMPEKNMVSWNIMISAYLGANNPGVSIKLFREMVGAGFHGNERTLVLLMSA 259 Query: 834 CGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSVSW 1013 CGRS RLKEGRSVH L+R S++IDTA+I+MY KC +DLA IF+ + R+N V+W Sbjct: 260 CGRSARLKEGRSVHASLIRILLNTSVVIDTALINMYGKCKEVDLARRIFDSVSRRNRVTW 319 Query: 1014 NAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILPDELTFIGVL 1193 N MIL HC+HG+P +GL L+ +M++ ++PDE+TF+GVL Sbjct: 320 NVMILAHCLHGDPEDGLKLFQDMIN---------------------GMLIPDEVTFVGVL 358 Query: 1194 CACAHLGLLEEGRNHFSQMTDVF 1262 C CA GL+ +G+++++ M D F Sbjct: 359 CGCARSGLVSQGKSYYAMMVDEF 381 >ref|XP_006425390.1| hypothetical protein CICLE_v10027592mg [Citrus clementina] gi|557527380|gb|ESR38630.1| hypothetical protein CICLE_v10027592mg [Citrus clementina] Length = 563 Score = 331 bits (848), Expect = 5e-88 Identities = 175/385 (45%), Positives = 244/385 (63%), Gaps = 1/385 (0%) Frame = +3 Query: 111 NRALYLLQCCKTSIHLFQIQSYLITSGLFQDPSF-AGRLLKLSSNLINDLRCTVLIFKRL 287 +R + L+ C+ L QIQ++LITSGLF + SF LLK S++ TVL+FK + Sbjct: 49 DRTISFLKSCQNMKQLLQIQAHLITSGLFFNNSFWTINLLKHSADF-GSPDYTVLVFKCI 107 Query: 288 DSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAKLGCLSLG 467 ++P FCVN V+K YS S +AV+FY +++++G F PN +TF L +CAK GC+ G Sbjct: 108 NNPGTFCVNAVVKAYSNSCVPDQAVVFYFQMIKNG-FMPNSYTFVSLFGSCAKTGCVERG 166 Query: 468 RMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTMVDGFAKV 647 MCHG A+K GVD LPV NSL++ Y C G MD A FV+M RDL+SWN++V G + Sbjct: 167 GMCHGLALKNGVDFELPVMNSLINMYGCFGAMDCARNTFVQMSHRDLISWNSIVSGHVRS 226 Query: 648 GEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSNDTTVVQVI 827 G+M AH+LFD +PERNVVSWN+MI+GY K GNPG +LKLFREMM F ND T+ V+ Sbjct: 227 GDMSAAHELFDIMPERNVVSWNIMISGYSKSGNPGCSLKLFREMMKSGFRGNDKTMASVL 286 Query: 828 TACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFERMQRKNSV 1007 TACGRS R EGRSVHG+ VR+ ++I+DTA+ID+YSKC ++++A +F+ M +N Sbjct: 287 TACGRSARFNEGRSVHGYTVRTSLKPNIILDTALIDLYSKCQKVEVAQRVFDSMADRN-- 344 Query: 1008 SWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILPDELTFIG 1187 +EG+ L+ +++++ S++ PDE+TFIG Sbjct: 345 ---------------LEGIKLFTALVNETVAGGSIS----------------PDEITFIG 373 Query: 1188 VLCACAHLGLLEEGRNHFSQMTDVF 1262 V+CAC LL EGR +F +M D + Sbjct: 374 VICACVRAELLTEGRIYFRKMIDFY 398 >ref|XP_006857380.1| hypothetical protein AMTR_s00067p00130250 [Amborella trichopoda] gi|548861473|gb|ERN18847.1| hypothetical protein AMTR_s00067p00130250 [Amborella trichopoda] Length = 823 Score = 316 bits (810), Expect = 1e-83 Identities = 170/392 (43%), Positives = 243/392 (61%), Gaps = 2/392 (0%) Frame = +3 Query: 90 NLSATITNRALYLLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLLK-LSSNLINDLRCT 266 N +++ +AL L CKT Q+Q++ IT+GL P + L+K L+++ L Sbjct: 15 NCKSSVVKQALVSLDSCKTMREFKQLQAHTITNGLQNHPLLSTHLVKFLATSDSGCLSYA 74 Query: 267 VLIFKRLDSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGFTFPPLISACAK 446 +++F++L+SP+ NT+IK S S+ +A+ FY E++ G HPN FTFPPL+++CAK Sbjct: 75 LMVFRQLNSPELRAYNTIIKALSLSSDPIQAISFYHEMVLKG-VHPNNFTFPPLVASCAK 133 Query: 447 LGCLSLGRMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTM 626 + ++ G CH VK G D V+ V NSLVH YAC L+ A +VF EM+ RD VSWN+M Sbjct: 134 VTAINEGEKCHTEVVKRGFDQVIFVANSLVHMYACFKLISYARQVFYEMVERDFVSWNSM 193 Query: 627 VDGFAKVGEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSND 806 ++G +G++ A KLFD +PERN +SWNVMI GY + G+PG+ LKLFREM + Sbjct: 194 INGHILLGDIMNARKLFDEMPERNQISWNVMIGGYARSGSPGHGLKLFREMQKKGIKGTI 253 Query: 807 TTVVQVITACGRSNRLKEGRSVHGFLVRSFSCLS-LIIDTAMIDMYSKCGRIDLAHLIFE 983 TT+V ++ AC +S RL EGRSVH +++RS S S +I++TA++DMY KCG++D A +F Sbjct: 254 TTMVSILNACAKSARLLEGRSVHCYIIRSSSMDSGVILETALVDMYCKCGKLDSAKRVFY 313 Query: 984 RMQRKNSVSWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWIL 1163 M +N VSWNAMI G I G+ E L+L F S E + I Sbjct: 314 EMPERNLVSWNAMIFGQAICGDYKEALAL---------------------FDSMELHSIE 352 Query: 1164 PDELTFIGVLCACAHLGLLEEGRNHFSQMTDV 1259 PDE++++GVLCACA L EGR +F QM + Sbjct: 353 PDEVSYVGVLCACARGVALLEGRRYFDQMNRI 384 >ref|XP_002531149.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223529262|gb|EEF31234.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 311 Score = 298 bits (762), Expect = 4e-78 Identities = 142/274 (51%), Positives = 192/274 (70%) Frame = +3 Query: 447 LGCLSLGRMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEMLVRDLVSWNTM 626 +GCL G+ CHG +K GVD +LPVQNSL+HFY CCGL+++A KVF EM DLVSWN++ Sbjct: 1 MGCLQSGQKCHGQVLKNGVDCILPVQNSLIHFYGCCGLVELARKVFDEMSQADLVSWNSI 60 Query: 627 VDGFAKVGEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFREMMVQSFNSND 806 V+ +A VGE++ AH +F+ + + VVSWNVMI GYLK NPG +L LFR+M+ ND Sbjct: 61 VNAYANVGELDTAHDIFNIMLGKTVVSWNVMIYGYLKGNNPGCSLMLFRKMVNSGLRGND 120 Query: 807 TTVVQVITACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCGRIDLAHLIFER 986 T+V V++ACG+S RL EGRS+HGFL+R+ S+I+ T+++DMYSKC +++LA IF+ Sbjct: 121 KTMVSVLSACGKSARLTEGRSIHGFLIRTSLNFSVILLTSLMDMYSKCQKVELARSIFDS 180 Query: 987 MQRKNSVSWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARNFKSNEENWILP 1166 M +N + WNAMILGHCIHG P +GL L+AEM++ + + ILP Sbjct: 181 MVHRNLICWNAMILGHCIHGKPADGLDLFAEMVNSTGET------------------ILP 222 Query: 1167 DELTFIGVLCACAHLGLLEEGRNHFSQMTDVFCL 1268 DE+T+IGV+ ACA GLL EGR FSQM D + + Sbjct: 223 DEVTYIGVISACARAGLLTEGRKFFSQMMDKYTI 256 >emb|CAB62654.1| putative protein [Arabidopsis thaliana] Length = 486 Score = 285 bits (728), Expect = 4e-74 Identities = 144/343 (41%), Positives = 205/343 (59%) Frame = +3 Query: 234 SSNLINDLRCTVLIFKRLDSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPNGF 413 SS+ D TV I++ + +C N V K Y S+ +A+ FY ++LR G F P+ + Sbjct: 40 SSSRFGDSSYTVSIYRSIGK--LYCANPVFKAYLVSSSPKQALGFYFDILRFG-FVPDSY 96 Query: 414 TFPPLISACAKLGCLSLGRMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFVEM 593 TF LIS K C+ G+MCHG A+K G D VLPVQNSL+H Y CCG +D+A K+FVE+ Sbjct: 97 TFVSLISCIEKTCCVDSGKMCHGQAIKHGCDQVLPVQNSLMHMYTCCGALDLAKKLFVEI 156 Query: 594 LVRDLVSWNTMVDGFAKVGEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKLFR 773 RD+VSWN+++ G + G++ AHKLFD +P++N++SWN+MI+ YL NPG ++ LFR Sbjct: 157 PKRDIVSWNSIIAGMVRNGDVLAAHKLFDEMPDKNIISWNIMISAYLGANNPGVSISLFR 216 Query: 774 EMMVQSFNSNDTTVVQVITACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSKCG 953 EM+ F N++T+V ++ ACGRS RLKE A+IDMY KC Sbjct: 217 EMVRAGFQGNESTLVLLLNACGRSARLKE---------------------ALIDMYGKCK 255 Query: 954 RIDLAHLIFERMQRKNSVSWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESARN 1133 + LA IF+ + +N V+WN MIL HC+HG P GL L+ M++ Sbjct: 256 EVGLARRIFDSLSIRNKVTWNVMILAHCLHGRPEGGLELFEAMIN--------------- 300 Query: 1134 FKSNEENWILPDELTFIGVLCACAHLGLLEEGRNHFSQMTDVF 1262 + PDE+TF+GVLC CA GL+ +G++++S M D F Sbjct: 301 ------GMLRPDEVTFVGVLCGCARAGLVSQGQSYYSLMVDEF 337 >ref|XP_006844721.1| hypothetical protein AMTR_s00016p00252780 [Amborella trichopoda] gi|548847192|gb|ERN06396.1| hypothetical protein AMTR_s00016p00252780 [Amborella trichopoda] Length = 428 Score = 263 bits (671), Expect = 2e-67 Identities = 149/405 (36%), Positives = 230/405 (56%), Gaps = 6/405 (1%) Frame = +3 Query: 66 IDSFNRNNNLSATITN------RALYLLQCCKTSIHLFQIQSYLITSGLFQDPSFAGRLL 227 I +F+ ++ LS +N +AL LLQ C TS HL QI ++L +GL +D +L+ Sbjct: 11 IPTFSHDHFLSPQSSNPSFSHYQALSLLQKCSTSNHLLQIHAHLFRTGLHRDYILITKLI 70 Query: 228 KLSSNLINDLRCTVLIFKRLDSPDAFCVNTVIKKYSCSNHHSEAVLFYVELLRDGDFHPN 407 L S + + L+F ++++P F NT+I+ Y SN+ EA+L Y ++ G F P+ Sbjct: 71 NLCS-IHQKIDHATLVFNQIENPLTFTWNTMIRAYFKSNYPEEAILMYNLMVIHG-FLPD 128 Query: 408 GFTFPPLISACAKLGCLSLGRMCHGHAVKLGVDSVLPVQNSLVHFYACCGLMDVAYKVFV 587 FT+P +I AC L G+ HG A+K G+ + +QN+L+ Y C +A+K+F Sbjct: 129 KFTYPFVIKACVAFSSLEKGKEIHGRAIKAGMVPDIFLQNTLMELYMKCNEKTLAHKLFD 188 Query: 588 EMLVRDLVSWNTMVDGFAKVGEMELAHKLFDAIPERNVVSWNVMITGYLKFGNPGNALKL 767 +M V+ +VSW TMV G G+M A ++FD +PERNVVSW MI GY++ P AL+L Sbjct: 189 KMSVKSVVSWTTMVAGLVSHGDMASARRVFDEMPERNVVSWTAMIHGYVRNNQPHEALEL 248 Query: 768 FREMMVQSFNSNDTTVVQVITACGRSNRLKEGRSVHGFLVRSFSCLSLIIDTAMIDMYSK 947 F M+ + N+ T+V ++ C N L+ GR VH F+ +S LS+ + TA+IDMYS Sbjct: 249 FILMLRANVRPNEFTIVSLLLVCTSLNSLRLGRWVHEFMAKSGFELSVYLGTALIDMYSN 308 Query: 948 CGRIDLAHLIFERMQRKNSVSWNAMILGHCIHGNPVEGLSLYAEMLDKSRQKDSLAAESA 1127 CG I+ A +F+ M ++ +WN+MI +HG E L+++ M Sbjct: 309 CGSINDAKNVFDGMSERSVATWNSMITSLGVHGKGKEALNVFGAM--------------- 353 Query: 1128 RNFKSNEENWILPDELTFIGVLCACAHLGLLEEGRNHFSQMTDVF 1262 E+ + PD++TF+GVLCAC ++GL+EEG +F M V+ Sbjct: 354 ------EKGKVRPDDITFVGVLCACVNMGLVEEGGVYFDSMYSVY 392