BLASTX nr result
ID: Mentha27_contig00011147
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00011147 (704 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU37498.1| hypothetical protein MIMGU_mgv1a021373mg, partial... 250 3e-64 ref|XP_007015351.1| Pentatricopeptide repeat-containing protein,... 192 9e-47 ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containi... 190 4e-46 ref|XP_004246209.1| PREDICTED: pentatricopeptide repeat-containi... 189 8e-46 ref|XP_007158057.1| hypothetical protein PHAVU_002G120500g [Phas... 182 9e-44 ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containi... 181 2e-43 gb|AFK33630.1| unknown [Lotus japonicus] 181 2e-43 ref|XP_003612457.1| Pentatricopeptide repeat-containing protein ... 181 3e-43 ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citr... 180 4e-43 ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containi... 171 3e-40 gb|EXB70628.1| hypothetical protein L484_023813 [Morus notabilis] 152 8e-35 ref|XP_002519945.1| pentatricopeptide repeat-containing protein,... 152 1e-34 ref|NP_174459.1| pentatricopeptide repeat-containing protein [Ar... 149 7e-34 ref|XP_006415303.1| hypothetical protein EUTSA_v10009456mg [Eutr... 148 1e-33 ref|XP_002893686.1| pentatricopeptide repeat-containing protein ... 144 4e-32 ref|XP_006303657.1| hypothetical protein CARUB_v10011695mg [Caps... 142 1e-31 emb|CAN69066.1| hypothetical protein VITISV_016070 [Vitis vinifera] 131 2e-28 ref|XP_004292904.1| PREDICTED: pentatricopeptide repeat-containi... 114 2e-23 ref|XP_006841553.1| hypothetical protein AMTR_s00003p00175270 [A... 110 6e-22 ref|XP_002517553.1| pentatricopeptide repeat-containing protein,... 108 1e-21 >gb|EYU37498.1| hypothetical protein MIMGU_mgv1a021373mg, partial [Mimulus guttatus] Length = 345 Score = 250 bits (639), Expect = 3e-64 Identities = 120/189 (63%), Positives = 148/189 (78%) Frame = +2 Query: 137 LKLPIPIDIYTSLIKECTESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKH 316 LKLPIP DIYTSLIKECTE DPL +IELH+H+RRSG R +LP +N++LLM+V+ C Sbjct: 1 LKLPIPPDIYTSLIKECTELGDPLKSIELHEHMRRSGFRFTLPLLNRLLLMYVSSGCLDR 60 Query: 317 ARQLFDEMSVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPGI 496 ARQLFD+M +RDFNSWA+LIAG VENGE+DEAI LFV ML + N D M S+ GI Sbjct: 61 ARQLFDQMFLRDFNSWAVLIAGFVENGEHDEAINLFVEMLNRQDMGNVGLDRMGFSVSGI 120 Query: 497 VMCVLRACVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFEQVN 676 ++CVL+AC+ T DFE G QVHGWLWKMGFS ++ LS FLI+FYG++ +EGAQ+VF+ V Sbjct: 121 LVCVLKACLFTSDFELGTQVHGWLWKMGFSESASLSCFLINFYGRLDCFEGAQTVFDHVR 180 Query: 677 CRDTAVWTS 703 +TAVWTS Sbjct: 181 NPNTAVWTS 189 Score = 59.7 bits (143), Expect = 9e-07 Identities = 45/169 (26%), Positives = 72/169 (42%) Frame = +2 Query: 161 IYTSLIKECTESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQLFDEM 340 I ++K C ++D L ++H + + G S ++ + ++CF+ A+ +FD + Sbjct: 120 ILVCVLKACLFTSDFELGTQVHGWLWKMGFSESASLSCFLINFYGRLDCFEGAQTVFDHV 179 Query: 341 SVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPGIVMCVLRAC 520 + W I NG +EA+ +F M G E + S VL+AC Sbjct: 180 RNPNTAVWTSRIVSFCSNGNFEEAVSVFKEM----GREGVRENSYTFS------TVLKAC 229 Query: 521 VCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFE 667 D G+QVH K G +S + L+ FYGK A VFE Sbjct: 230 RKMGDIRCGQQVHANSIKSGLESDSYVQCALVDFYGKCGFLNDATRVFE 278 >ref|XP_007015351.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] gi|508785714|gb|EOY32970.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 413 Score = 192 bits (488), Expect = 9e-47 Identities = 110/244 (45%), Positives = 149/244 (61%), Gaps = 11/244 (4%) Frame = +2 Query: 5 ANPKTQIHFPLHRPEYKQ-------PVPAAVKK---RNPKPSSTTSDILRLMDALKLPIP 154 A P +Q+H L KQ P P + K NP S TTSDILRLMD+L LPIP Sbjct: 33 APPISQLHSQLPLRITKQSSKTPPPPTPISTSKPISSNPCSSHTTSDILRLMDSLSLPIP 92 Query: 155 IDIYTSLIKECTESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQLFD 334 DIY SL+KECT + A+ELH HIR S I+ SLP +N++LLMHV+ AR LFD Sbjct: 93 PDIYASLVKECTVTRHSRRALELHSHIRNSRIKPSLPLLNRLLLMHVSCGHLDIARHLFD 152 Query: 335 EMSVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPG-IVMCVL 511 +M +RDFNSWAI+I + G++++AI FVRM ++ P I++C+L Sbjct: 153 QMLLRDFNSWAIMIVACLHAGDSEQAIAYFVRM---------ERHNLLFKCPSWIIVCLL 203 Query: 512 RACVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTA 691 ++CV T++ G+QVHG L K+G S +S LS LI+FYGK + + A VF Q++ R+T Sbjct: 204 KSCVVTKNMGLGKQVHGQLLKLGASNDSSLSGSLINFYGKFRCLDDADFVFNQLSRRNTV 263 Query: 692 VWTS 703 WT+ Sbjct: 264 TWTA 267 >ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Glycine max] Length = 423 Score = 190 bits (482), Expect = 4e-46 Identities = 110/246 (44%), Positives = 144/246 (58%), Gaps = 14/246 (5%) Frame = +2 Query: 8 NPKTQIHF------PLHR-PEYKQPVPAAV-------KKRNPKPSSTTSDILRLMDALKL 145 +P Q+ F P+H P + P P KK+ + +TTSDIL LM+AL Sbjct: 34 SPNHQLEFRLPLRHPIHNFPNHTSPQPLTQTTTFTKKKKKKKRKGATTSDILHLMEALPF 93 Query: 146 PIPIDIYTSLIKECTESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQ 325 P+PIDIYTSLIKECT S DP AIEL HI +SGI+ LP +N+IL+M V+ ++AR Sbjct: 94 PVPIDIYTSLIKECTVSGDPETAIELATHISKSGIKPPLPFLNRILVMFVSCGLLENARH 153 Query: 326 LFDEMSVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPGIVMC 505 +FD+M VRDFN+WA L +N + +EA +FV ML G ME P I C Sbjct: 154 MFDKMRVRDFNTWATLFVAYYDNTDYEEATNVFVNMLTQLGM-------MEFP-PWIWAC 205 Query: 506 VLRACVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRD 685 +LRAC CT + G QVHGWL K+G + LSS LI+FYG+ E A VF+ V+ + Sbjct: 206 LLRACACTVNVPLGMQVHGWLLKLGTCDHVLLSSSLINFYGRFTCLEDASVVFDGVSRHN 265 Query: 686 TAVWTS 703 T WT+ Sbjct: 266 TLTWTA 271 >ref|XP_004246209.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Solanum lycopersicum] Length = 465 Score = 189 bits (480), Expect = 8e-46 Identities = 97/192 (50%), Positives = 133/192 (69%) Frame = +2 Query: 128 MDALKLPIPIDIYTSLIKECTESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVEC 307 MD+L IP+D+Y SLIKECTES DPL A+E+++H+ +S + SLP +N++LLM V C Sbjct: 1 MDSLGFNIPVDVYVSLIKECTESRDPLNAVEVYEHVCKSDVIPSLPLLNRLLLMLVLCGC 60 Query: 308 FKHARQLFDEMSVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSI 487 F+ ARQLFD+M VR+ SWA +IAG VENGE A+RLF+ M + G GD ++ Sbjct: 61 FEQARQLFDKMRVRNSQSWAAMIAGCVENGECVGALRLFMEMQSEAGNLCKCGDLID--- 117 Query: 488 PGIVMCVLRACVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFE 667 GI++CVL+ACV + EFGRQ+HGWL K+G + L+SFLI FYG+ + E A +VF+ Sbjct: 118 DGILVCVLKACVELMNLEFGRQIHGWLLKLGNCESMVLNSFLIKFYGEFGYLESADNVFD 177 Query: 668 QVNCRDTAVWTS 703 V +T VWT+ Sbjct: 178 HVPHCNTVVWTA 189 >ref|XP_007158057.1| hypothetical protein PHAVU_002G120500g [Phaseolus vulgaris] gi|561031472|gb|ESW30051.1| hypothetical protein PHAVU_002G120500g [Phaseolus vulgaris] Length = 420 Score = 182 bits (462), Expect = 9e-44 Identities = 106/232 (45%), Positives = 137/232 (59%), Gaps = 5/232 (2%) Frame = +2 Query: 23 IHFPLH---RPEYKQPVPAAVK--KRNPKPSSTTSDILRLMDALKLPIPIDIYTSLIKEC 187 + +P+H RP Q K K+ + +TT DIL LMDAL PI IDIYTSLIKEC Sbjct: 45 LRYPIHTFPRPLIPQTTTFTKKEIKKKKRKEATTLDILHLMDALPFPITIDIYTSLIKEC 104 Query: 188 TESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFNSWA 367 T S DP AIEL+ HI +S I+ LP +N+IL+M V+ ++AR +F++M VRDFNSWA Sbjct: 105 TVSGDPETAIELYTHISKSDIKPPLPFLNRILIMFVSCGMLENARHMFEKMRVRDFNSWA 164 Query: 368 ILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPGIVMCVLRACVCTEDFEFG 547 L +N E +EA +FV ML G M P I C+LRAC CT + G Sbjct: 165 TLFVAYYDNAEYEEATAVFVNMLGQLG--------MLQFPPWIWACLLRACACTLNVPLG 216 Query: 548 RQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 QVHGWL K+G + LSS LI+FYG+ E A +VF V+ +T WT+ Sbjct: 217 LQVHGWLLKLGACDHVLLSSSLINFYGRFTCLEDASAVFNGVSRHNTLTWTA 268 >ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Cicer arietinum] Length = 418 Score = 181 bits (459), Expect = 2e-43 Identities = 112/242 (46%), Positives = 140/242 (57%), Gaps = 10/242 (4%) Frame = +2 Query: 8 NPKTQIHFPLHRPEYKQPVPAAV---------KKRNPKPSSTTSDILRLMDALKLPIPID 160 NPK I+FP H P QP+ K N + S+TTS IL LMDAL PIPID Sbjct: 35 NPKL-INFP-HHPS-SQPLTVTPPRNKNNTKNKNNNKRKSATTSHILPLMDALHFPIPID 91 Query: 161 IYTSLIKECTESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQLFDEM 340 IYTSL+KECT S DP A ELH HI RSGI L +N+IL+M V+ + AR +FDEM Sbjct: 92 IYTSLVKECTLSGDPETATELHSHITRSGIGPPLTLLNRILIMFVSCGLLQSARHVFDEM 151 Query: 341 SVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELS-IPGIVMCVLRA 517 VR+F+SWAIL EN + + AI +F+RML G ME +P C+L A Sbjct: 152 PVRNFHSWAILFVAYYENSDYENAIDVFMRMLRQLGV-------MEFPFLPWFWSCLLTA 204 Query: 518 CVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVW 697 C CT + G QVHG L K+G + +SS LI FYG+ K E A VF +V+ +T W Sbjct: 205 CACTVNVPLGMQVHGSLTKLGACDHVLISSSLIRFYGRFKCLEDANVVFNRVSRHNTLTW 264 Query: 698 TS 703 T+ Sbjct: 265 TA 266 >gb|AFK33630.1| unknown [Lotus japonicus] Length = 356 Score = 181 bits (459), Expect = 2e-43 Identities = 99/209 (47%), Positives = 127/209 (60%) Frame = +2 Query: 77 KKRNPKPSSTTSDILRLMDALKLPIPIDIYTSLIKECTESTDPLLAIELHDHIRRSGIRL 256 KK+ + +TTS IL LMD L PIPIDIYTSLIKECT S DP AIELH HI SGI+ Sbjct: 4 KKKRKRKGATTSHILHLMDVLPFPIPIDIYTSLIKECTLSPDPQTAIELHTHIAHSGIKP 63 Query: 257 SLPAMNQILLMHVAVECFKHARQLFDEMSVRDFNSWAILIAGSVENGENDEAIRLFVRML 436 L +N+IL+M V+ +A QLFD M V+DFNSWA L +N + +EAI +F+ ML Sbjct: 64 PLSFINRILVMFVSCGLLDYACQLFDAMPVKDFNSWATLFIAYYDNADYEEAIDVFLAML 123 Query: 437 LDKGFENASGDHMELSIPGIVMCVLRACVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLI 616 G M P I C L+AC C E+ G QVHGWL K+G + LSS LI Sbjct: 124 HQLG--------MSEFPPWICACFLKACACIENIPLGMQVHGWLLKLGTCDHVLLSSSLI 175 Query: 617 SFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 FYG+ + A +VF +++ +T+ WT+ Sbjct: 176 RFYGRFTCVKDANAVFNKLSRHNTSTWTA 204 >ref|XP_003612457.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355513792|gb|AES95415.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 418 Score = 181 bits (458), Expect = 3e-43 Identities = 109/233 (46%), Positives = 137/233 (58%), Gaps = 1/233 (0%) Frame = +2 Query: 8 NPKTQIHFPLHRPEYKQPVPAAVKKRNPKPSSTTSDILRLMDALKLPIPIDIYTSLIKEC 187 NPK + + L P QP+ K + + TTS IL LMDAL PI IDIYTSL+KEC Sbjct: 43 NPKPK-NLSLIHPS-SQPITPPKKSKRRRKCDTTSHILPLMDALHFPITIDIYTSLVKEC 100 Query: 188 TESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFNSWA 367 T STDP AIELH I GI L L +N+IL+M V+ ++AR++FD MSVRDF+SWA Sbjct: 101 TLSTDPETAIELHTQIITRGIELPLTLLNRILIMFVSCGLLENARRVFDVMSVRDFHSWA 160 Query: 368 ILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSI-PGIVMCVLRACVCTEDFEF 544 L ENGE + AI +FV ML D M S P I C+L+AC CT + Sbjct: 161 TLFVSYYENGEYENAIDVFVSMLCQL-------DVMGFSFPPWIWSCLLKACACTMNVPL 213 Query: 545 GRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 G QVHG L K+G + +SS LI FYG+ K E A VF +V+ +T WT+ Sbjct: 214 GMQVHGCLLKLGACDHVLISSSLIRFYGRFKCLEDANMVFNRVSRHNTLTWTA 266 >ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citrus clementina] gi|557539679|gb|ESR50723.1| hypothetical protein CICLE_v10033975mg [Citrus clementina] Length = 425 Score = 180 bits (457), Expect = 4e-43 Identities = 107/230 (46%), Positives = 142/230 (61%), Gaps = 6/230 (2%) Frame = +2 Query: 32 PLHRPEYKQPVPAAVKKRNPK---PSSTTS-DILRLMDALKLPIPIDIYTSLIKECTEST 199 PL P+ +P+ + R P++T+S +IL LMD L LPI D+YT LIKECT Sbjct: 40 PLRSPKPTKPLKTSSNWRETTQSIPANTSSANILHLMDNLCLPITTDMYTCLIKECTFQK 99 Query: 200 DPLLAIELHDHIR-RSGIRLSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFNSWAILI 376 D A EL +HIR R I+ +L +N++LLMHV+ ARQLFDEM +RDFNSWA++I Sbjct: 100 DSAGAFELLNHIRKRVNIKPTLLFLNRLLLMHVSCGQLDTARQLFDEMPLRDFNSWAVMI 159 Query: 377 AGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPG-IVMCVLRACVCTEDFEFGRQ 553 G V+ + E I LF M+ K HM L P I++CVL+ACVCT + E G+Q Sbjct: 160 VGYVDVADYQECITLFAEMMKRKK------GHMLLVFPAWIIVCVLKACVCTMNMELGKQ 213 Query: 554 VHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 VHG L+K+G SRN L+ LI+FYGK + E A VF Q+ +T VWT+ Sbjct: 214 VHGLLFKLGSSRNISLTGSLINFYGKFRCLEDADFVFSQLKRHNTVVWTA 263 >ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Vitis vinifera] Length = 414 Score = 171 bits (432), Expect = 3e-40 Identities = 102/214 (47%), Positives = 138/214 (64%), Gaps = 5/214 (2%) Frame = +2 Query: 77 KKRNPKPSSTTS---DILRLMDALKLPIPIDIYTSLIKECTESTDPLLAIELHDHIRRSG 247 KK N + TTS DILRLMD L LPIP DIY SLIKE + + D A +L HI RSG Sbjct: 48 KKSNSNATPTTSTPTDILRLMDGLGLPIPPDIYASLIKESSTTGDATQATQLLAHINRSG 107 Query: 248 IRLSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFN--SWAILIAGSVENGENDEAIRL 421 + LS +N+ILLM+V+ AR +FD+M+V + N SWAI++A ++NG +EAI L Sbjct: 108 LPLSSALLNRILLMYVSCGLIHTARHMFDKMNVLNKNSISWAIMLAAYMDNGFYEEAIFL 167 Query: 422 FVRMLLDKGFENASGDHMELSIPGIVMCVLRACVCTEDFEFGRQVHGWLWKMGFSRNSGL 601 FV+M+ E S +EL I +CVL+ACV T + G+QVHGWL K+G++ N L Sbjct: 168 FVQMM-----ELHSTIMLELP-AWIFICVLKACVHTMNLTLGKQVHGWLLKVGYATNLFL 221 Query: 602 SSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 S +LISFYGK + + A VF+Q + R+T +WT+ Sbjct: 222 SCYLISFYGKFRCLDDADFVFDQTSERNTVIWTA 255 >gb|EXB70628.1| hypothetical protein L484_023813 [Morus notabilis] Length = 453 Score = 152 bits (385), Expect = 8e-35 Identities = 90/218 (41%), Positives = 129/218 (59%), Gaps = 6/218 (2%) Frame = +2 Query: 62 VPAAVKKRNP---KPSSTTSDILRLMDALKLPIPIDIYTSLIKECTESTDPLLAIELHDH 232 V ++K+N P+ +TSD+LRLMDAL LPI D+Y S +KECT S D A +LH+H Sbjct: 88 VEKKMRKKNALIAPPACSTSDVLRLMDALCLPISPDMYISFMKECTISADFCGAEDLHNH 147 Query: 233 IRRSGIR-LSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFNSWAILIAGSVENGENDE 409 I R+ ++ L+LP +N++L M+V+ A LF M +DF SWA +I +V N + +E Sbjct: 148 ISRNSLQHLALPLLNRLLFMNVSCGRLDLACDLFYRMPFKDFKSWATMIVANVNNSDYEE 207 Query: 410 AIRLFVRMLLDKGFENASGDHME-LSIPG-IVMCVLRACVCTEDFEFGRQVHGWLWKMGF 583 A LF++ML H+ L P I++C+L+ CVCT + E G+QVH K+G Sbjct: 208 ATSLFLKML----------HHINMLEFPSWIIVCLLKTCVCTRNMELGKQVHACALKLGH 257 Query: 584 SRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVW 697 + + L+S LI+FYGK E A VF Q+ DT W Sbjct: 258 ANSLYLASCLINFYGKYGCLESANLVFNQLPRHDTLTW 295 >ref|XP_002519945.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223540991|gb|EEF42549.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 403 Score = 152 bits (383), Expect = 1e-34 Identities = 89/235 (37%), Positives = 138/235 (58%), Gaps = 5/235 (2%) Frame = +2 Query: 14 KTQIHFPLHR--PEYKQPVPAAVKKRNPKPSSTTSDILRLMDALKLPIPIDIYTSLIKEC 187 K I P H P + P + K S ++SDI+RLMD+L PIP DIYTSLIKEC Sbjct: 24 KKNITAPNHTKLPPLRTPNIKPINHLPAKKSCSSSDIMRLMDSLCHPIPPDIYTSLIKEC 83 Query: 188 TESTDPLLAIELHDH-IRRSGIRLSLPAMNQILLMHVAVECFKHARQLFDEMSVR-DFNS 361 T ++D A+ LH H I ++ ++L+ P ++++LLMHV+ AR LFD+M ++ DF S Sbjct: 84 TLTSDSTEALCLHSHLISQTNLKLTPPLVHRLLLMHVSCGQLDIARNLFDKMPLKKDFIS 143 Query: 362 WAILIAGSVENGENDEAIRLFVRMLLDKG-FENASGDHMELSIPGIVMCVLRACVCTEDF 538 W I+I G N + + I LF+ MLL ++ D +I I++C+++ C+ + + Sbjct: 144 WVIVIVGCFSNSKYEAGINLFIDMLLQHSVYDGLMFDLNTWNI--IILCIIKCCIYSMNI 201 Query: 539 EFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 G+QVHG L+K+G + + L+ FYGK+ E SVF +++ +TA WT+ Sbjct: 202 SLGKQVHGILFKVGLTSEISFNVSLMDFYGKLGCLEDVNSVFNKLDNHNTATWTA 256 >ref|NP_174459.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75169166|sp|Q9C6R9.1|PPR66_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g31790 gi|12321298|gb|AAG50719.1|AC079041_12 hypothetical protein [Arabidopsis thaliana] gi|111074348|gb|ABH04547.1| At1g31790 [Arabidopsis thaliana] gi|332193272|gb|AEE31393.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 409 Score = 149 bits (377), Expect = 7e-34 Identities = 92/232 (39%), Positives = 127/232 (54%), Gaps = 9/232 (3%) Frame = +2 Query: 35 LHRPEYKQPVPAAV------KKRNPKPSSTTSDILRLMDALKLPIPIDIYTSLIKECTES 196 L +P++++ P V + +NP +TSDILRLMD+L LP DIY+ L KE Sbjct: 42 LRKPKHQKSEPVVVIQQPQIQPQNPSSRCSTSDILRLMDSLSLPGNEDIYSCLAKESARE 101 Query: 197 TDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFNSWAILI 376 D A EL HI +S IR ++ +N++LLMHV+ RQ+FD M RDF+SWAI+ Sbjct: 102 NDQRGAHELQVHIMKSSIRPTITFINRLLLMHVSCGRLDITRQMFDRMPHRDFHSWAIVF 161 Query: 377 AGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPG-IVMCVLRACVCTEDFEFGRQ 553 G +E G+ ++A LFV ML IP I+ CVL+AC DFE G+Q Sbjct: 162 LGCIEMGDYEDAAFLFVSML-------KHSQKGAFKIPSWILGCVLKACAMIRDFELGKQ 214 Query: 554 VHGWLWKMGF--SRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 VH K+GF +S LS LI FYG+ + E A V Q++ +T W + Sbjct: 215 VHALCHKLGFIDEEDSYLSGSLIRFYGEFRCLEDANLVLHQLSNANTVAWAA 266 >ref|XP_006415303.1| hypothetical protein EUTSA_v10009456mg [Eutrema salsugineum] gi|557093074|gb|ESQ33656.1| hypothetical protein EUTSA_v10009456mg [Eutrema salsugineum] Length = 400 Score = 148 bits (374), Expect = 1e-33 Identities = 92/233 (39%), Positives = 131/233 (56%), Gaps = 10/233 (4%) Frame = +2 Query: 35 LHRPEYKQPVPAAVKK------RNPK--PSSTTSDILRLMDALKLPIPIDIYTSLIKECT 190 L RP+++ P V + R PK P +TSDILRLMD+L LP D+Y+ L KE T Sbjct: 41 LRRPKHQISEPVVVIQPQIQIDRAPKSNPRCSTSDILRLMDSLSLPGNEDLYSCLAKEST 100 Query: 191 ESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFNSWAI 370 D A +L HI S +R +N++LLMHV+ RQ+FD+M RDF+SWAI Sbjct: 101 TECDQRGAYDLQVHIMNSSVRPRTTFLNRLLLMHVSCGRLDITRQMFDKMPQRDFHSWAI 160 Query: 371 LIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPGIVMCVLRACVCTEDFEFGR 550 +I G +E G+ +A+ LFV ML ++ + + P I+ CVL+AC D + G+ Sbjct: 161 VILGCIEMGDYQDAVFLFVSMLKNQ-------NRVSKIPPWIMGCVLKACGMIRDLDLGK 213 Query: 551 QVHGWLWKMGF--SRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 QVHG K+GF +S LS L+ FYG+ + E A V Q++ +T VW + Sbjct: 214 QVHGLCQKLGFIEVEDSYLSGCLVRFYGEFRCLEDANLVLNQLSNANTVVWAA 266 >ref|XP_002893686.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297339528|gb|EFH69945.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 410 Score = 144 bits (362), Expect = 4e-32 Identities = 91/233 (39%), Positives = 128/233 (54%), Gaps = 10/233 (4%) Frame = +2 Query: 35 LHRPEYKQPVPAAV------KKRNPKPSSTTSDILRLMDALKLPIPIDIYTSLIKECTES 196 L +P++++ P V + + P P +TSDILRLMD+L LP D+Y+ L KE Sbjct: 42 LRKPKHQKSEPVVVIQQPQIQPQKPSPRCSTSDILRLMDSLSLPGNEDLYSCLAKESARE 101 Query: 197 TDPLLAIELHDHIRRSGIRLSLPA-MNQILLMHVAVECFKHARQLFDEMSVRDFNSWAIL 373 D A EL HI +S IR +N++LLMHV+ R +FD+M RDF+SWAI+ Sbjct: 102 NDRRGAYELQVHIMKSSIRRPTTTFVNRLLLMHVSCGRLDITRHMFDKMPHRDFHSWAIV 161 Query: 374 IAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPGIVM-CVLRACVCTEDFEFGR 550 G +E G+ ++A LFV ML K +N + IP +M CVL+AC DFE G+ Sbjct: 162 FLGCIEMGDYEDAALLFVSML--KHSQNGA-----FKIPSWIMGCVLKACAMIRDFELGK 214 Query: 551 QVHGWLWKMGF--SRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 QVH K+G +S LS LI FYG+ + E A V Q++ +T W + Sbjct: 215 QVHALCHKLGCIDEEDSYLSGSLIRFYGEFRCLEDANLVLHQLSNANTVAWAA 267 >ref|XP_006303657.1| hypothetical protein CARUB_v10011695mg [Capsella rubella] gi|482572368|gb|EOA36555.1| hypothetical protein CARUB_v10011695mg [Capsella rubella] Length = 411 Score = 142 bits (357), Expect = 1e-31 Identities = 90/233 (38%), Positives = 125/233 (53%), Gaps = 10/233 (4%) Frame = +2 Query: 35 LHRPEYKQPVPAAVKKR-------NPKPSSTTSDILRLMDALKLPIPIDIYTSLIKECTE 193 L +P++++ P V ++ P + SDILRLMD L LP D+Y+ L KE Sbjct: 42 LRKPKHQKSEPVVVIQQPQIQTTQKSSPRCSISDILRLMDTLSLPGNEDLYSCLAKESAR 101 Query: 194 STDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFNSWAIL 373 D A EL HI +S IR S +N++LLMHV+ R +FD+M RDF+SWAI+ Sbjct: 102 ENDRRGAYELQVHIMKSSIRPSTTFVNRLLLMHVSCGRLDITRNMFDKMPHRDFHSWAIV 161 Query: 374 IAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPGIVM-CVLRACVCTEDFEFGR 550 G +E G+ ++A LFV ML K +N IP +M CVL+AC D G+ Sbjct: 162 FLGCIEMGDYEDAALLFVAML--KHSKNGGA----FKIPSWIMGCVLKACAMIRDLALGK 215 Query: 551 QVHGWLWKMGF--SRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 QVHG K+GF +S L LI FYG+ + E A V Q++ +T VW + Sbjct: 216 QVHGLCQKLGFIGEEDSYLLGSLIRFYGEFRCLEDANLVLHQLSNANTVVWAA 268 >emb|CAN69066.1| hypothetical protein VITISV_016070 [Vitis vinifera] Length = 543 Score = 131 bits (329), Expect = 2e-28 Identities = 83/171 (48%), Positives = 108/171 (63%), Gaps = 5/171 (2%) Frame = +2 Query: 77 KKRNPKPSSTTS---DILRLMDALKLPIPIDIYTSLIKECTESTDPLLAIELHDHIRRSG 247 KK N + TTS DILRLMD L LPIP DIY SLIKE + + D A +L HI RSG Sbjct: 211 KKSNSNATPTTSTPTDILRLMDGLGLPIPPDIYASLIKESSTTGDATQATQLLAHINRSG 270 Query: 248 IRLSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFN--SWAILIAGSVENGENDEAIRL 421 + LS +N+ILLM+V+ AR +FD+M+V + N SWAI++A ++NG +EAI L Sbjct: 271 LPLSSALLNRILLMYVSCGLIHTARHMFDKMNVLNKNSISWAIMLAAYMDNGFYEEAIFL 330 Query: 422 FVRMLLDKGFENASGDHMELSIPGIVMCVLRACVCTEDFEFGRQVHGWLWK 574 FV+M+ E S +EL I +CVL+ACV T + G+QVHGWL K Sbjct: 331 FVQMM-----ELHSTIMLELP-AWIFICVLKACVHTMNLTLGKQVHGWLTK 375 >ref|XP_004292904.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like [Fragaria vesca subsp. vesca] Length = 421 Score = 114 bits (286), Expect = 2e-23 Identities = 78/218 (35%), Positives = 119/218 (54%), Gaps = 9/218 (4%) Frame = +2 Query: 77 KKRNPKPSSTTSDILRLMDALKLPIPID------IYTSLIKECTESTDPLLAIELHDHIR 238 KK +TSDILRLMD L++P+ +Y SLI +C++S A+ L H+ Sbjct: 68 KKNENGSRCSTSDILRLMDGLQVPVTSTTLSDNHMYASLINDCSDSG---AALHLQAHLT 124 Query: 239 RSGIRLSLPAMNQILLMHVAVECFKHARQLFDEMSVRDFNSWAILIAGSVENGENDEAIR 418 R L +N++LL HV +A QLFDEM ++DFNSWA LI +N + EA+R Sbjct: 125 RKSPPPPLHLLNRLLLRHVCNGRLDNAHQLFDEMPLKDFNSWATLIVAYAQNADYAEALR 184 Query: 419 LFVRMLLDKGFENASGDHMELS-IPGIVM-CVLRACVCTEDFEFGRQVHGWLWKMGF-SR 589 LF+ ML + H+++S P +M CVL A T D G Q+HG K+G +R Sbjct: 185 LFLSML------HLQDCHVDISEFPAWIMACVLDA---TMDVGLGEQLHGCCLKLGHANR 235 Query: 590 NSGLSSFLISFYGKMKHYEGAQSVFEQVNCRDTAVWTS 703 + +++ LI+ YG+++ +E AQ ++ + WT+ Sbjct: 236 DMFVATSLINLYGRLRCHEAAQRASLGLSQPNALTWTA 273 >ref|XP_006841553.1| hypothetical protein AMTR_s00003p00175270 [Amborella trichopoda] gi|548843574|gb|ERN03228.1| hypothetical protein AMTR_s00003p00175270 [Amborella trichopoda] Length = 327 Score = 110 bits (274), Expect = 6e-22 Identities = 64/196 (32%), Positives = 105/196 (53%), Gaps = 4/196 (2%) Frame = +2 Query: 128 MDALKLPIPIDIYTSLIKECTESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVEC 307 M +L++P+ Y+SL+KECT S + E+H HI ++ + + NQI+LM++A C Sbjct: 1 MYSLQIPLTPIAYSSLLKECTSSKSLVEGSEIHAHINKTSLYPGIHIENQIILMYMACRC 60 Query: 308 FKHARQLFDEMSVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMEL-- 481 A Q+FD+MS R+ ++W +I G ++ G N+E + L++RM H E+ Sbjct: 61 PTLAYQVFDKMSHRNTDTWQFMITGLMDLGMNEETLDLYIRM------------HQEMVR 108 Query: 482 --SIPGIVMCVLRACVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQ 655 I VLRAC ED G+Q+H K G S+++ L L+ FY +MK A+ Sbjct: 109 MKPNTAIQGGVLRACAFIEDVGLGKQIHAKAIKSGSSKDTYLGCCLVDFYVEMKCLVSAR 168 Query: 656 SVFEQVNCRDTAVWTS 703 F+++ + WT+ Sbjct: 169 KAFDEICKPNVVAWTA 184 >ref|XP_002517553.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223543185|gb|EEF44717.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 653 Score = 108 bits (271), Expect = 1e-21 Identities = 61/187 (32%), Positives = 98/187 (52%), Gaps = 5/187 (2%) Frame = +2 Query: 158 DIYT--SLIKECTESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAVECFKHARQLF 331 DIYT +++ C + D + E+H H+ R G + A+N ++ M+V C AR +F Sbjct: 202 DIYTFPCVLRSCGGANDFIRGKEIHCHVIRFGFETDVSAVNALITMYVKCGCVGSARTVF 261 Query: 332 DEMSVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMELSIPGIVM--- 502 D+M RD SW +I+G ENGE E + LF++ML ELS+ +M Sbjct: 262 DKMLQRDRISWNAMISGYFENGECVEGLNLFLQML-------------ELSVDPDLMTMT 308 Query: 503 CVLRACVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSVFEQVNCR 682 V+ AC D GR++HG++ + G+ + + S LI Y + +++ A+ VF + CR Sbjct: 309 SVISACELLGDDRLGREIHGYVVRTGYGNDVSVHSLLIQMYASLGYWKEAEKVFSETECR 368 Query: 683 DTAVWTS 703 D WT+ Sbjct: 369 DVVSWTA 375 Score = 71.6 bits (174), Expect = 2e-10 Identities = 52/194 (26%), Positives = 90/194 (46%), Gaps = 2/194 (1%) Frame = +2 Query: 128 MDALKLPIPIDIYT--SLIKECTESTDPLLAIELHDHIRRSGIRLSLPAMNQILLMHVAV 301 + L+L + D+ T S+I C D L E+H ++ R+G + + ++ M+ ++ Sbjct: 293 LQMLELSVDPDLMTMTSVISACELLGDDRLGREIHGYVVRTGYGNDVSVHSLLIQMYASL 352 Query: 302 ECFKHARQLFDEMSVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENASGDHMEL 481 +K A ++F E RD SW +I+G N +D+A+ + K E A E+ Sbjct: 353 GYWKEAEKVFSETECRDVVSWTAMISGYEGNLMHDKALETY------KNMELAGIVPDEI 406 Query: 482 SIPGIVMCVLRACVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHYEGAQSV 661 +I CVL AC + G ++H +MG +++ LI Y K K + A V Sbjct: 407 TI----ACVLSACASLGQLDLGMRLHELANRMGLMSFVIVANSLIDMYSKCKCIDKALEV 462 Query: 662 FEQVNCRDTAVWTS 703 F + ++ WTS Sbjct: 463 FHCIQDKNVISWTS 476 Score = 66.2 bits (160), Expect = 1e-08 Identities = 53/200 (26%), Positives = 89/200 (44%), Gaps = 5/200 (2%) Frame = +2 Query: 119 LRLMDALKLPIPIDIYTSLIKECTEST-----DPLLAIELHDHIRRSGIRLSLPAMNQIL 283 L M LK+ + + + +LI+ C D + L+ + +RL N +L Sbjct: 89 LNSMQELKILVEDETFIALIRLCENKRGYTEGDYVFKAVLNSLVNPLSVRLG----NALL 144 Query: 284 LMHVAVECFKHARQLFDEMSVRDFNSWAILIAGSVENGENDEAIRLFVRMLLDKGFENAS 463 M+V +A +F M R+ SW +L+ G + G DEA+ L+ RML + Sbjct: 145 SMYVRFSDLNNAWNVFGRMGERNLFSWNVLVGGYAKAGFFDEALCLYHRML----WVGIK 200 Query: 464 GDHMELSIPGIVMCVLRACVCTEDFEFGRQVHGWLWKMGFSRNSGLSSFLISFYGKMKHY 643 D CVLR+C DF G+++H + + GF + + LI+ Y K Sbjct: 201 PDIYTFP------CVLRSCGGANDFIRGKEIHCHVIRFGFETDVSAVNALITMYVKCGCV 254 Query: 644 EGAQSVFEQVNCRDTAVWTS 703 A++VF+++ RD W + Sbjct: 255 GSARTVFDKMLQRDRISWNA 274