BLASTX nr result
ID: Mentha25_contig00054649
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00054649 (415 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU26534.1| hypothetical protein MIMGU_mgv1a024701mg, partial... 228 9e-58 ref|XP_004231338.1| PREDICTED: pentatricopeptide repeat-containi... 196 2e-48 ref|XP_007218862.1| hypothetical protein PRUPE_ppa002838mg [Prun... 190 2e-46 ref|XP_004308755.1| PREDICTED: pentatricopeptide repeat-containi... 184 8e-45 ref|XP_007042234.1| Mitochondrial editing factor 21 [Theobroma c... 184 1e-44 gb|EPS73257.1| hypothetical protein M569_01496 [Genlisea aurea] 184 1e-44 ref|XP_002269662.2| PREDICTED: putative pentatricopeptide repeat... 177 1e-42 emb|CBI23791.3| unnamed protein product [Vitis vinifera] 177 1e-42 ref|XP_004156102.1| PREDICTED: putative pentatricopeptide repeat... 172 3e-41 ref|XP_004140278.1| PREDICTED: pentatricopeptide repeat-containi... 172 3e-41 gb|EXB37797.1| hypothetical protein L484_002732 [Morus notabilis] 171 1e-40 ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily p... 137 2e-30 ref|XP_002871343.1| pentatricopeptide repeat-containing protein ... 137 2e-30 ref|XP_002523296.1| pentatricopeptide repeat-containing protein,... 135 8e-30 ref|NP_196468.1| pentatricopeptide repeat-containing protein [Ar... 133 2e-29 dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana] 133 2e-29 ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Caps... 132 7e-29 ref|XP_006399350.1| hypothetical protein EUTSA_v10013320mg [Eutr... 130 1e-28 gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis] 130 2e-28 gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus... 130 2e-28 >gb|EYU26534.1| hypothetical protein MIMGU_mgv1a024701mg, partial [Mimulus guttatus] Length = 518 Score = 228 bits (580), Expect = 9e-58 Identities = 103/135 (76%), Positives = 125/135 (92%) Frame = +2 Query: 11 SGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLKPN 190 SGY ++GT ++ARELFD+MP KNIVSWT+MISGYTQNGLA NALQLFD MS +DS++KPN Sbjct: 84 SGYMRDGTFELARELFDKMPVKNIVSWTSMISGYTQNGLADNALQLFDEMSRQDSEMKPN 143 Query: 191 WVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLCFD 370 WVTI+SVLPACAHSSALE+G++IH FA+++G DSHPSVQTAL+GMYAKCGSL DA+ CFD Sbjct: 144 WVTIMSVLPACAHSSALERGKKIHNFAQEKGLDSHPSVQTALVGMYAKCGSLADARWCFD 203 Query: 371 RIKPSSRNLVAWNSL 415 RIKP+S+NLV+WNS+ Sbjct: 204 RIKPNSKNLVSWNSM 218 Score = 59.7 bits (143), Expect = 4e-07 Identities = 44/149 (29%), Positives = 70/149 (46%), Gaps = 11/149 (7%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMP--TKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDS 175 AL+ Y K G++ AR FD + +KN+VSW +MI+ Y +GL A+Q F+ M +S Sbjct: 184 ALVGMYAKCGSLADARWCFDRIKPNSKNLVSWNSMITAYASHGLGMEAVQTFEDMI--ES 241 Query: 176 DLKPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSV---------QTALIGMY 328 + P+ ++ +L C+HS ++ G R FDS SV ++ + Sbjct: 242 GVGPDGISFTGLLSGCSHSGLVDIGLRY--------FDSMSSVYLVEKKHEHYACVVDLL 293 Query: 329 AKCGSLQDAKLCFDRIKPSSRNLVAWNSL 415 + G L +A R+ P W SL Sbjct: 294 GRAGRLVEAYELISRM-PMQAGPSVWGSL 321 >ref|XP_004231338.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like [Solanum lycopersicum] Length = 625 Score = 196 bits (499), Expect = 2e-48 Identities = 89/137 (64%), Positives = 113/137 (82%) Frame = +2 Query: 5 LISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLK 184 LI+GY K+G A ELF+EMP +NIVSWT MISGY QNGLA +LQLFD+M DS+++ Sbjct: 189 LIAGYMKDGLFKDAEELFEEMPIRNIVSWTAMISGYAQNGLADESLQLFDKMLDPDSEVR 248 Query: 185 PNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLC 364 PNWVT++SVLPACAHS+AL++G++IH FAR+ G + +PSVQTALI MYAKCGSL DA+LC Sbjct: 249 PNWVTVMSVLPACAHSAALDRGKKIHSFAREAGLEKNPSVQTALIAMYAKCGSLVDARLC 308 Query: 365 FDRIKPSSRNLVAWNSL 415 FD+I P + LVAWN++ Sbjct: 309 FDQINPREKKLVAWNTM 325 Score = 61.6 bits (148), Expect = 1e-07 Identities = 40/137 (29%), Positives = 65/137 (47%) Frame = +2 Query: 5 LISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLK 184 +++ Y G +D A +FD + + + MI T G+ +++F +M S + Sbjct: 57 MVAMYASSGEIDSASYIFDSATEPSSLLYNAMIRALTLYGITKRTIEIFFQMHSLG--FR 114 Query: 185 PNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLC 364 + T V +CA S + G+ +H GF V T+L+ MY KCG L DA+ Sbjct: 115 GDNFTFPFVFKSCADLSDVWCGKCVHSLILRSGFVFDMYVGTSLVDMYVKCGDLIDARKL 174 Query: 365 FDRIKPSSRNLVAWNSL 415 FD + R++ AWN L Sbjct: 175 FDEM--PVRDVSAWNVL 189 >ref|XP_007218862.1| hypothetical protein PRUPE_ppa002838mg [Prunus persica] gi|462415324|gb|EMJ20061.1| hypothetical protein PRUPE_ppa002838mg [Prunus persica] Length = 628 Score = 190 bits (482), Expect = 2e-46 Identities = 89/138 (64%), Positives = 110/138 (79%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 ALI+GY K+G + A +LF MP KNIVSWT MISGYTQNGLA AL LFD M +DS++ Sbjct: 190 ALIAGYMKDGEICFAEDLFRRMPCKNIVSWTAMISGYTQNGLAEQALVLFDEMLRKDSEV 249 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPNWVTI+SVLPACAHS+ALE+GR+IH FA G DS+ S+QTAL+ MYAKCGSL DA+ Sbjct: 250 KPNWVTIMSVLPACAHSAALERGRQIHNFASRTGLDSNTSIQTALLAMYAKCGSLSDARQ 309 Query: 362 CFDRIKPSSRNLVAWNSL 415 CF+R+ + +LVAWN++ Sbjct: 310 CFERVHQTENSLVAWNTM 327 Score = 65.5 bits (158), Expect = 7e-09 Identities = 39/137 (28%), Positives = 71/137 (51%) Frame = +2 Query: 5 LISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLK 184 +++ Y +D A +F + + + + ++I YT G + ++++ +M LK Sbjct: 59 MVAMYASSDNLDSAVNIFHRVNNPSTLLYNSIIRAYTLYGYSEKTMEIYGQMHRLG--LK 116 Query: 185 PNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLC 364 + T VL CA+ S++ G+ +H + G S V T+LI MY KCG + DA+ Sbjct: 117 GDNFTYPFVLKCCANLSSIWLGKCVHSLSLRIGLASDMYVGTSLIDMYVKCGEMSDARSS 176 Query: 365 FDRIKPSSRNLVAWNSL 415 FD K + R++ +WN+L Sbjct: 177 FD--KMTVRDVSSWNAL 191 >ref|XP_004308755.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like [Fragaria vesca subsp. vesca] Length = 631 Score = 184 bits (468), Expect = 8e-45 Identities = 85/138 (61%), Positives = 108/138 (78%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 A+I+GY +EG + VA ELF EM KNIVSWT MISGYTQNG+A AL +FD M DS++ Sbjct: 132 AMIAGYMREGEICVAEELFGEMRCKNIVSWTAMISGYTQNGMAEQALGVFDEMLREDSEV 191 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPNWVT++SVLP CAHS+ALE+GR+IH FAR G + + S+QTAL+ MYAKCGSL +A+ Sbjct: 192 KPNWVTVMSVLPGCAHSAALERGRKIHEFARGIGLEKNASIQTALVAMYAKCGSLSEARQ 251 Query: 362 CFDRIKPSSRNLVAWNSL 415 CF RI S ++LV WN++ Sbjct: 252 CFQRISGSEKSLVVWNTM 269 Score = 61.6 bits (148), Expect = 1e-07 Identities = 36/137 (26%), Positives = 71/137 (51%) Frame = +2 Query: 5 LISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLK 184 +++ Y G + A +F + + + + ++I +T +G A +++++ +M LK Sbjct: 1 MVAMYASSGDLASAVNVFHSVNYPSTLLYNSIIRAHTLHGYIAKSIEIYGQMHCLG--LK 58 Query: 185 PNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLC 364 + T VL CA S + G+ +H + G +S V T+LI MY KC + +A++ Sbjct: 59 GDHFTYPFVLKCCAELSDVRIGKCVHGLSLRTGLESDVYVGTSLIDMYVKCCEMSNARMV 118 Query: 365 FDRIKPSSRNLVAWNSL 415 FD + S R + +WN++ Sbjct: 119 FDEL--SVRGVSSWNAM 133 >ref|XP_007042234.1| Mitochondrial editing factor 21 [Theobroma cacao] gi|508706169|gb|EOX98065.1| Mitochondrial editing factor 21 [Theobroma cacao] Length = 613 Score = 184 bits (467), Expect = 1e-44 Identities = 87/138 (63%), Positives = 108/138 (78%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 ALI+GY KEG + VA LF MP +NIVSWT MISGYTQNGLA AL LFD M +S++ Sbjct: 236 ALIAGYMKEGEIGVAEGLFGRMPRRNIVSWTVMISGYTQNGLAKEALSLFDEMMKEESEV 295 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPNWVTI+SVLPACA+S+ALE+GRRI+ + G +S+PSVQ ALI MYA CGSL DA+ Sbjct: 296 KPNWVTIMSVLPACAYSAALERGRRINEYVNRIGLESNPSVQNALIAMYAACGSLVDARC 355 Query: 362 CFDRIKPSSRNLVAWNSL 415 CF+RI+ + +NL AWN++ Sbjct: 356 CFNRIRENEKNLCAWNAM 373 >gb|EPS73257.1| hypothetical protein M569_01496 [Genlisea aurea] Length = 393 Score = 184 bits (466), Expect = 1e-44 Identities = 89/139 (64%), Positives = 112/139 (80%), Gaps = 1/139 (0%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 A+ISGY +EG+ AR LFDEMP +NIVSWT+MISGYTQ+G + ALQLFD M+ DS + Sbjct: 81 AMISGYMREGSCAHARVLFDEMPVRNIVSWTSMISGYTQSGHSDAALQLFDLMTMEDSTV 140 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPNWVT++S+LPAC +SS LE+GRRIH FAR++G D+H SVQTALI MYAKCGSL DA+ Sbjct: 141 KPNWVTVMSILPACENSSDLERGRRIHSFAREKGLDTHSSVQTALIAMYAKCGSLLDARS 200 Query: 362 CFDRIKPSSRNLVA-WNSL 415 CFD + +SRN + WN++ Sbjct: 201 CFDGMNSTSRNSSSTWNAM 219 Score = 56.6 bits (135), Expect = 3e-06 Identities = 40/142 (28%), Positives = 65/142 (45%), Gaps = 4/142 (2%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVS---WTTMISGYTQNGLAANALQLFDRMSSRD 172 ALI+ Y K G++ AR FD M + + S W MI+ Y+ +GL A++ F+ M Sbjct: 184 ALIAMYAKCGSLLDARSCFDGMNSTSRNSSSTWNAMITAYSSHGLGKEAVETFEAMIKSG 243 Query: 173 SDLKPNWVTIISVLPACAHSSALEQG-RRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQ 349 S P+ +T +L C HS +EQG + + G + ++ M + G + Sbjct: 244 S-AAPDSITFTGLLSGCGHSGLVEQGLKYFEEMSSTHGVEKTREHYACVVDMLGRAGRVA 302 Query: 350 DAKLCFDRIKPSSRNLVAWNSL 415 +A D + P + W SL Sbjct: 303 EAYEVSDGM-PMAAGASVWGSL 323 Score = 55.5 bits (132), Expect = 8e-06 Identities = 31/69 (44%), Positives = 41/69 (59%) Frame = +2 Query: 209 VLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLCFDRIKPSS 388 VL + A + GR IH +R +G DS V TALI MYAKCG L DA+ FD I Sbjct: 16 VLKSVADLEFIITGRCIHGLSRKDGLDSDMYVATALIDMYAKCGDLVDARKVFDEI--PV 73 Query: 389 RNLVAWNSL 415 R++ +WN++ Sbjct: 74 RDVSSWNAM 82 >ref|XP_002269662.2| PREDICTED: putative pentatricopeptide repeat-containing protein At3g49142-like [Vitis vinifera] Length = 689 Score = 177 bits (449), Expect = 1e-42 Identities = 86/138 (62%), Positives = 104/138 (75%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 ALI+GY KEG + VA +LF+ M +NIVSWT MISGYTQNG A AL LFD M S++ Sbjct: 252 ALIAGYMKEGEIGVAEDLFERMEHRNIVSWTAMISGYTQNGFAEQALGLFDEMLQDGSEM 311 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPNWVTI+SVLPACA S+ALE+GRRIH FA G + SVQTAL GMYAKC SL +A+ Sbjct: 312 KPNWVTIVSVLPACAQSAALERGRRIHDFANGIGLHLNSSVQTALAGMYAKCYSLVEARC 371 Query: 362 CFDRIKPSSRNLVAWNSL 415 CFD I + +NL+AWN++ Sbjct: 372 CFDMIAQNGKNLIAWNTM 389 Score = 57.8 bits (138), Expect = 2e-06 Identities = 41/140 (29%), Positives = 69/140 (49%), Gaps = 3/140 (2%) Frame = +2 Query: 5 LISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANA---LQLFDRMSSRDS 175 +++ Y G +D A +FD + + + + ++I YT++G L+ + RM Sbjct: 118 MVAMYASSGDLDSAVVVFDRIDNPSSLLYNSIIRAYTRHGXXXXXXXXLEAYARMHFLG- 176 Query: 176 DLKPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDA 355 L + T+ VL +CA S + GR +H G + V +LI MY KCG + DA Sbjct: 177 -LLGDNFTLPFVLKSCADLSRVCMGRCVHGQGLRVGLEGDFYVGASLIDMYVKCGVIGDA 235 Query: 356 KLCFDRIKPSSRNLVAWNSL 415 + FD K R++ +WN+L Sbjct: 236 RKLFD--KMIVRDMASWNAL 253 >emb|CBI23791.3| unnamed protein product [Vitis vinifera] Length = 615 Score = 177 bits (449), Expect = 1e-42 Identities = 86/138 (62%), Positives = 104/138 (75%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 ALI+GY KEG + VA +LF+ M +NIVSWT MISGYTQNG A AL LFD M S++ Sbjct: 178 ALIAGYMKEGEIGVAEDLFERMEHRNIVSWTAMISGYTQNGFAEQALGLFDEMLQDGSEM 237 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPNWVTI+SVLPACA S+ALE+GRRIH FA G + SVQTAL GMYAKC SL +A+ Sbjct: 238 KPNWVTIVSVLPACAQSAALERGRRIHDFANGIGLHLNSSVQTALAGMYAKCYSLVEARC 297 Query: 362 CFDRIKPSSRNLVAWNSL 415 CFD I + +NL+AWN++ Sbjct: 298 CFDMIAQNGKNLIAWNTM 315 >ref|XP_004156102.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g23330-like [Cucumis sativus] Length = 642 Score = 172 bits (437), Expect = 3e-41 Identities = 81/138 (58%), Positives = 104/138 (75%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 AL++GYTK G +D A +F+ MP +NIVSWTTMISGY+Q+GLA AL LFD M DS + Sbjct: 205 ALLAGYTKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSGV 264 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 +PNWVTI+SVLPACA S LE+GR+IH A G +S+ SV AL MYAKCGSL DA+ Sbjct: 265 RPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDARN 324 Query: 362 CFDRIKPSSRNLVAWNSL 415 CFD++ + +NL+AWN++ Sbjct: 325 CFDKLNRNEKNLIAWNTM 342 >ref|XP_004140278.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like [Cucumis sativus] Length = 679 Score = 172 bits (437), Expect = 3e-41 Identities = 81/138 (58%), Positives = 104/138 (75%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 AL++GYTK G +D A +F+ MP +NIVSWTTMISGY+Q+GLA AL LFD M DS + Sbjct: 242 ALLAGYTKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSGV 301 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 +PNWVTI+SVLPACA S LE+GR+IH A G +S+ SV AL MYAKCGSL DA+ Sbjct: 302 RPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDARN 361 Query: 362 CFDRIKPSSRNLVAWNSL 415 CFD++ + +NL+AWN++ Sbjct: 362 CFDKLNRNEKNLIAWNTM 379 >gb|EXB37797.1| hypothetical protein L484_002732 [Morus notabilis] Length = 672 Score = 171 bits (433), Expect = 1e-40 Identities = 84/138 (60%), Positives = 102/138 (73%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 ALI+GY K G + +A +LF M +NIVSWT MISGY QNGLA AL LFD+M DS + Sbjct: 235 ALIAGYMKIGEIRLAEDLFGRMVRRNIVSWTAMISGYAQNGLAGQALVLFDKMLEDDSGI 294 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KP WVTI+SVLPACAHS+ALE+GR IH+ A G DS SVQ+ALI MYA+CGSL +A Sbjct: 295 KPTWVTIMSVLPACAHSAALERGREIHKLASRIGLDSDVSVQSALIAMYARCGSLAEACQ 354 Query: 362 CFDRIKPSSRNLVAWNSL 415 CFDRI ++LV WN++ Sbjct: 355 CFDRIHQHKKDLVVWNTM 372 Score = 60.5 bits (145), Expect = 2e-07 Identities = 40/137 (29%), Positives = 68/137 (49%) Frame = +2 Query: 5 LISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLK 184 +I+ Y G + A +F + + + ++I Y+ + + ++ RM R LK Sbjct: 104 MIAMYASAGDLRSAVAVFRRIKYPSALLCNSIIRAYSWHWFPKKTIGVYFRM--RSLGLK 161 Query: 185 PNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLC 364 + T VL +CA S + GR H + GF+ V T+LI MY KCG + DA+ Sbjct: 162 ADHFTYPFVLKSCADLSDVRMGRYAHGLSLRTGFEEDFYVGTSLINMYVKCGGIGDARKM 221 Query: 365 FDRIKPSSRNLVAWNSL 415 FD + + R++ +WN+L Sbjct: 222 FDVM--TVRDISSWNAL 236 Score = 55.5 bits (132), Expect = 8e-06 Identities = 40/143 (27%), Positives = 73/143 (51%), Gaps = 5/143 (3%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPT--KNIVSWTTMISGYTQNGLAANALQLFDRMSSRDS 175 ALI+ Y + G++ A + FD + K++V W TMIS Y +G ++ F+ M + Sbjct: 338 ALIAMYARCGSLAEACQCFDRIHQHKKDLVVWNTMISAYASHGRGLESVSTFEDMIR--A 395 Query: 176 DLKPNWVTIISVLPACAHSSALEQGRRIHRFARDEG-FDSHPSVQ--TALIGMYAKCGSL 346 ++P+ ++ +L C+HS ++ G I F R + ++ P VQ ++ + + G L Sbjct: 396 RIQPDIISFTGLLSGCSHSGLVDLG--IKYFNRMKTMYNVEPEVQHCACVVDLLGRAGRL 453 Query: 347 QDAKLCFDRIKPSSRNLVAWNSL 415 +AK D++ P AW +L Sbjct: 454 VEAKELIDKM-PMQAGASAWGAL 475 >ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily protein, putative [Theobroma cacao] gi|508703740|gb|EOX95636.1| Pentatricopeptide repeat (PPR) superfamily protein, putative [Theobroma cacao] Length = 515 Score = 137 bits (345), Expect = 2e-30 Identities = 74/138 (53%), Positives = 92/138 (66%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 ALISGY+ G M A ELF MP KN+VSWTTMISGY+QNG + AL +F RM +++ + Sbjct: 155 ALISGYSMCGDMKEALELFKSMPEKNVVSWTTMISGYSQNGQYSKALDMFLRME-KETGV 213 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPN VTI SVLPACA+ ALE G RI +AR+ G V ++ MYA+CG ++ AKL Sbjct: 214 KPNRVTIASVLPACANLGALEVGERIETYARENGLFEDLYVSNTVLEMYARCGKIEVAKL 273 Query: 362 CFDRIKPSSRNLVAWNSL 415 FD I RNL WNS+ Sbjct: 274 VFDEI-GKRRNLCVWNSM 290 Score = 58.5 bits (140), Expect = 9e-07 Identities = 39/124 (31%), Positives = 61/124 (49%) Frame = +2 Query: 44 ARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLKPNWVTIISVLPAC 223 A +LF+ +P K + + +I Y+ + L L+ +M ++ PN + I + PAC Sbjct: 37 AHKLFNLIPQKTVFLYNKLIQAYSSINQSHRCLTLYSQMCL--NNCSPNEHSFIFLFPAC 94 Query: 224 AHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLCFDRIKPSSRNLVA 403 A +L G+ +H GF TAL+ MYAK L A+ FD ++ RNL Sbjct: 95 ASLPSLLHGQILHTQFLKSGFGLDCYALTALLVMYAKLRMLPLARKVFDEMR--VRNLPT 152 Query: 404 WNSL 415 WN+L Sbjct: 153 WNAL 156 Score = 57.0 bits (136), Expect = 3e-06 Identities = 38/138 (27%), Positives = 71/138 (51%), Gaps = 5/138 (3%) Frame = +2 Query: 17 YTKEGTMDVARELFDEM-PTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLKPNW 193 Y + G ++VA+ +FDE+ +N+ W +MI G +G A + +D+M + P+ Sbjct: 262 YARCGKIEVAKLVFDEIGKRRNLCVWNSMIMGLALHGKCIEAFEYYDQMLQEGT--APDD 319 Query: 194 VTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQ--TALIGMYAKCGSLQDAKLCF 367 VT + VL AC H + +GR + + + + P ++ ++ + + G+LQ+A + Sbjct: 320 VTFVGVLLACTHGRLVVKGRELFE-SMGKKYHISPKLEHYGCMVDLLGRSGALQEA---Y 375 Query: 368 DRIK--PSSRNLVAWNSL 415 D IK P + V W +L Sbjct: 376 DLIKSMPMKPDAVVWGAL 393 >ref|XP_002871343.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297317180|gb|EFH47602.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 511 Score = 137 bits (345), Expect = 2e-30 Identities = 68/138 (49%), Positives = 95/138 (68%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 A+I+GY + G M A ELFD MP KN+ SWTT+ISG++QNG + AL +F M +D + Sbjct: 153 AMITGYQRRGDMKAAMELFDSMPNKNVTSWTTVISGFSQNGNYSEALTMFLCME-KDKSV 211 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPN +T++SVLPACA+ LE GRR+ +AR+ GF + V+ A + MY+KCG + AK Sbjct: 212 KPNHITLVSVLPACANLGELEIGRRLEGYARENGFFDNIYVRNATLEMYSKCGMIDVAKR 271 Query: 362 CFDRIKPSSRNLVAWNSL 415 FD I + RNL++WNS+ Sbjct: 272 LFDEI-GNQRNLISWNSM 288 >ref|XP_002523296.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223537384|gb|EEF39012.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 353 Score = 135 bits (339), Expect = 8e-30 Identities = 69/138 (50%), Positives = 96/138 (69%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 ALI+GY++ G M+ A ++F MP +N+VSWT MISGY+QNG A AL+LF +M +++ L Sbjct: 153 ALIAGYSRCGDMEGALKIFKLMPDRNVVSWTAMISGYSQNGRYAKALELFLKME-KENGL 211 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 +PN VTI S+LPACA+ ALE G RI +AR+ G + V AL+ MYA+CG + A+ Sbjct: 212 RPNEVTIASILPACANLGALEVGDRIETYARENGLLRNLYVSNALLEMYARCGKIDMARK 271 Query: 362 CFDRIKPSSRNLVAWNSL 415 FD+I RNL +WNS+ Sbjct: 272 VFDKIIGKRRNLCSWNSM 289 >ref|NP_196468.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75171895|sp|Q9FNN7.1|PP371_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g08510 gi|9759345|dbj|BAB10000.1| unnamed protein product [Arabidopsis thaliana] gi|50897238|gb|AAT85758.1| At5g08510 [Arabidopsis thaliana] gi|332003930|gb|AED91313.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 511 Score = 133 bits (335), Expect = 2e-29 Identities = 67/138 (48%), Positives = 94/138 (68%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 A+I+GY + G M A ELFD MP KN+ SWTT+ISG++QNG + AL++F M +D + Sbjct: 153 AMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCME-KDKSV 211 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPN +T++SVLPACA+ LE GRR+ +AR+ GF + V A I MY+KCG + AK Sbjct: 212 KPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMIDVAKR 271 Query: 362 CFDRIKPSSRNLVAWNSL 415 F+ + + RNL +WNS+ Sbjct: 272 LFEEL-GNQRNLCSWNSM 288 Score = 63.9 bits (154), Expect = 2e-08 Identities = 45/143 (31%), Positives = 74/143 (51%), Gaps = 5/143 (3%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPT-KNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSD 178 A I Y+K G +DVA+ LF+E+ +N+ SW +MI +G AL LF +M Sbjct: 255 ATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGE- 313 Query: 179 LKPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQ--TALIGMYAKCGSLQD 352 KP+ VT + +L AC H + +G+ + + + +E P ++ +I + + G LQ+ Sbjct: 314 -KPDAVTFVGLLLACVHGGMVVKGQELFK-SMEEVHKISPKLEHYGCMIDLLGRVGKLQE 371 Query: 353 AKLCFDRIK--PSSRNLVAWNSL 415 A +D IK P + V W +L Sbjct: 372 A---YDLIKTMPMKPDAVVWGTL 391 >dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana] Length = 504 Score = 133 bits (335), Expect = 2e-29 Identities = 67/138 (48%), Positives = 94/138 (68%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 A+I+GY + G M A ELFD MP KN+ SWTT+ISG++QNG + AL++F M +D + Sbjct: 146 AMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCME-KDKSV 204 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPN +T++SVLPACA+ LE GRR+ +AR+ GF + V A I MY+KCG + AK Sbjct: 205 KPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMIDVAKR 264 Query: 362 CFDRIKPSSRNLVAWNSL 415 F+ + + RNL +WNS+ Sbjct: 265 LFEEL-GNQRNLCSWNSM 281 Score = 63.9 bits (154), Expect = 2e-08 Identities = 45/143 (31%), Positives = 74/143 (51%), Gaps = 5/143 (3%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPT-KNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSD 178 A I Y+K G +DVA+ LF+E+ +N+ SW +MI +G AL LF +M Sbjct: 248 ATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGE- 306 Query: 179 LKPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQ--TALIGMYAKCGSLQD 352 KP+ VT + +L AC H + +G+ + + + +E P ++ +I + + G LQ+ Sbjct: 307 -KPDAVTFVGLLLACVHGGMVVKGQELFK-SMEEVHKISPKLEHYGCMIDLLGRVGKLQE 364 Query: 353 AKLCFDRIK--PSSRNLVAWNSL 415 A +D IK P + V W +L Sbjct: 365 A---YDLIKTMPMKPDAVVWGTL 384 >ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Capsella rubella] gi|482558368|gb|EOA22560.1| hypothetical protein CARUB_v10003220mg [Capsella rubella] Length = 511 Score = 132 bits (331), Expect = 7e-29 Identities = 67/137 (48%), Positives = 93/137 (67%) Frame = +2 Query: 5 LISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLK 184 +I+GY ++G M A ELFD MP KN++SWTT+ISG++QNG + AL +F M +D +K Sbjct: 154 MITGYQRQGDMKAAMELFDSMPCKNVISWTTVISGFSQNGNYSEALTMFLCME-KDKSVK 212 Query: 185 PNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLC 364 PN VT++SVLPACA+ LE GRR+ +AR+ GF + V A + MY+KCG + AK Sbjct: 213 PNHVTLVSVLPACANLGELEIGRRLESYARENGFFDNIYVCNATLEMYSKCGMIDLAKQL 272 Query: 365 FDRIKPSSRNLVAWNSL 415 F I + RNL +WNS+ Sbjct: 273 FHEI-GNQRNLCSWNSM 288 Score = 59.3 bits (142), Expect = 5e-07 Identities = 41/143 (28%), Positives = 74/143 (51%), Gaps = 5/143 (3%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPT-KNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSD 178 A + Y+K G +D+A++LF E+ +N+ SW +MI +G AL+L+ +M Sbjct: 255 ATLEMYSKCGMIDLAKQLFHEIGNQRNLCSWNSMIGSLATHGKHHEALELYAQMLREGE- 313 Query: 179 LKPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQ--TALIGMYAKCGSLQD 352 KP+ VT + +L AC H + +G + + + +E P ++ +I + + G LQ+ Sbjct: 314 -KPDAVTFVGLLLACVHGGMVVKGHELFK-SMEEVHKISPKLEHYGCMIDLLGRVGKLQE 371 Query: 353 AKLCFDRIK--PSSRNLVAWNSL 415 A +D I+ P + V W +L Sbjct: 372 A---YDLIETMPMKPDAVVWGTL 391 >ref|XP_006399350.1| hypothetical protein EUTSA_v10013320mg [Eutrema salsugineum] gi|557100440|gb|ESQ40803.1| hypothetical protein EUTSA_v10013320mg [Eutrema salsugineum] Length = 502 Score = 130 bits (328), Expect = 1e-28 Identities = 66/138 (47%), Positives = 93/138 (67%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 A+I+ Y ++G M A ELFD MP KN++SWTT+ISG++QNG + AL +F M S + + Sbjct: 143 AMITVYNRQGDMKAAMELFDSMPCKNVISWTTVISGFSQNGNYSKALSMFLCMESNKT-V 201 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPN +T+ SVLPAC + AL+ GRR+ +AR+ GF + V A + MY+KCG + AK Sbjct: 202 KPNHITVASVLPACGNLGALDIGRRLEGYARENGFFDNIYVSNATLEMYSKCGMIDVAKR 261 Query: 362 CFDRIKPSSRNLVAWNSL 415 FD I + RNL +WNS+ Sbjct: 262 IFDEI-GNQRNLCSWNSM 278 Score = 64.3 bits (155), Expect = 2e-08 Identities = 42/143 (29%), Positives = 77/143 (53%), Gaps = 5/143 (3%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPT-KNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSD 178 A + Y+K G +DVA+ +FDE+ +N+ SW +M+SG +G AL+L+ +M Sbjct: 245 ATLEMYSKCGMIDVAKRIFDEIGNQRNLCSWNSMVSGLATHGKHDEALELYAQMLREGE- 303 Query: 179 LKPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQ--TALIGMYAKCGSLQD 352 KP+ VT + +L AC H + +G+ + + + ++ P ++ +I + + G LQ+ Sbjct: 304 -KPDAVTFVGLLLACVHGGMVVKGKELFK-SMEQVHKISPKLEHYGCMIDLLGRVGKLQE 361 Query: 353 AKLCFDRIK--PSSRNLVAWNSL 415 A ++ IK P + V W +L Sbjct: 362 A---YNLIKTMPMKPDAVVWGTL 381 >gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis] Length = 513 Score = 130 bits (327), Expect = 2e-28 Identities = 67/138 (48%), Positives = 92/138 (66%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 +++SGY + G M+ A ELF MP +N+VSWT MISGY++NG A AL +F +M ++ D+ Sbjct: 156 SMLSGYARSGDMEGASELFRLMPQRNVVSWTAMISGYSKNGQYAKALAMFLQME-KERDV 214 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 +PN +TI SVLPACA+ ALE G R+ +AR GF V A++ MYAKCG + A+ Sbjct: 215 RPNAITIASVLPACANLGALEVGERVEEYARKVGFLKDLYVSNAVLEMYAKCGRIDTARR 274 Query: 362 CFDRIKPSSRNLVAWNSL 415 FD I RNL +WNS+ Sbjct: 275 VFDEI-GRRRNLCSWNSM 291 >gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus guttatus] Length = 516 Score = 130 bits (326), Expect = 2e-28 Identities = 63/138 (45%), Positives = 92/138 (66%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDL 181 +LI+GY + G M A LF MP++N++SWT +ISG++QNG AL+++ M RD + Sbjct: 153 SLIAGYARNGDMSEALRLFSNMPSRNVISWTAIISGFSQNGKYKEALEMYLAME-RDGKV 211 Query: 182 KPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKL 361 KPN VT+ SVLPACA+ ALE G+RI +AR G+ + V A++ +YA+CG ++ A Sbjct: 212 KPNHVTLASVLPACANLGALEVGQRIEAYARANGYFKNAFVCNAVLELYARCGVIEKAMQ 271 Query: 362 CFDRIKPSSRNLVAWNSL 415 FD I +RNL +WN+L Sbjct: 272 VFDEIGSGNRNLCSWNTL 289 Score = 60.8 bits (146), Expect = 2e-07 Identities = 39/143 (27%), Positives = 77/143 (53%), Gaps = 5/143 (3%) Frame = +2 Query: 2 ALISGYTKEGTMDVARELFDEMPT--KNIVSWTTMISGYTQNGLAANALQLFDRMSSRDS 175 A++ Y + G ++ A ++FDE+ + +N+ SW T+I G +G AL++F++M ++ Sbjct: 255 AVLELYARCGVIEKAMQVFDEIGSGNRNLCSWNTLIMGLAVHGRCDGALEIFNQMLTKG- 313 Query: 176 DLKPNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQ--TALIGMYAKCGSLQ 349 + P+ VT + + AC H + +GR I + ++ F P ++ ++ + + G LQ Sbjct: 314 -VTPDDVTFVGAILACTHGGMVNKGREIFD-SMEKRFSITPKIEHYGCMVDLLGRAGLLQ 371 Query: 350 DA-KLCFDRIKPSSRNLVAWNSL 415 +A KL + P + V W +L Sbjct: 372 EAYKLI--KAMPMKPDSVVWGTL 392 Score = 55.5 bits (132), Expect = 8e-06 Identities = 37/137 (27%), Positives = 63/137 (45%) Frame = +2 Query: 5 LISGYTKEGTMDVARELFDEMPTKNIVSWTTMISGYTQNGLAANALQLFDRMSSRDSDLK 184 LI+ + ++ A +L D+ P + ++ +I Y+ +G L+ ++ Sbjct: 22 LITKLLEIPNINYAHKLLDKTPDPTLFLYSKLIKAYSSHGPHFQCFSLYSQILHLSFSPN 81 Query: 185 PNWVTIISVLPACAHSSALEQGRRIHRFARDEGFDSHPSVQTALIGMYAKCGSLQDAKLC 364 PN T + ACA S QG+ +H G D TAL+ MYAK G L+ ++ Sbjct: 82 PNCFTFL--FSACAKLSNPSQGQMLHAHFIKFGLDYDVYALTALVDMYAKMGLLRFSRKI 139 Query: 365 FDRIKPSSRNLVAWNSL 415 FD + + ++ WNSL Sbjct: 140 FDEM--NDKDAPTWNSL 154