BLASTX nr result
ID: Mentha25_contig00021304
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00021304 (765 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU27821.1| hypothetical protein MIMGU_mgv1a006926mg [Mimulus... 318 1e-84 ref|XP_006348079.1| PREDICTED: pentatricopeptide repeat-containi... 233 4e-59 ref|XP_004233795.1| PREDICTED: pentatricopeptide repeat-containi... 229 1e-57 ref|XP_006449088.1| hypothetical protein CICLE_v10018367mg [Citr... 219 6e-55 ref|XP_006467990.1| PREDICTED: pentatricopeptide repeat-containi... 218 2e-54 ref|XP_007025994.1| Tetratricopeptide repeat-like superfamily pr... 214 3e-53 ref|XP_007212718.1| hypothetical protein PRUPE_ppa018797mg [Prun... 211 2e-52 ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containi... 209 1e-51 ref|XP_002518527.1| pentatricopeptide repeat-containing protein,... 204 3e-50 ref|XP_002305605.1| pentatricopeptide repeat-containing family p... 196 8e-48 ref|XP_006285106.1| hypothetical protein CARUB_v10006439mg [Caps... 165 1e-38 sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-c... 160 5e-37 ref|XP_006413812.1| hypothetical protein EUTSA_v10024760mg [Eutr... 157 3e-36 ref|NP_193849.2| pentatricopeptide repeat-containing protein [Ar... 135 2e-29 ref|XP_002867861.1| predicted protein [Arabidopsis lyrata subsp.... 128 3e-27 ref|XP_006826483.1| hypothetical protein AMTR_s00004p00243870 [A... 126 7e-27 emb|CAA17536.1| putative protein [Arabidopsis thaliana] gi|72689... 126 1e-26 ref|XP_006857674.1| hypothetical protein AMTR_s00061p00160470 [A... 77 5e-12 ref|XP_006487095.1| PREDICTED: pentatricopeptide repeat-containi... 77 7e-12 ref|XP_006579638.1| PREDICTED: pentatricopeptide repeat-containi... 74 7e-11 >gb|EYU27821.1| hypothetical protein MIMGU_mgv1a006926mg [Mimulus guttatus] Length = 426 Score = 318 bits (815), Expect = 1e-84 Identities = 148/254 (58%), Positives = 201/254 (79%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TWS IA+IL+KDGKFERI ++FDVG++T EMFD IIDG+SKRGDF AFDY+N +CSK + Sbjct: 70 TWSSIARILHKDGKFERISKVFDVGIFTPEMFDLIIDGHSKRGDFEAAFDYLNRMCSKEI 129 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 PSFSTY+SIL+GAC++ D E+ EN+LS+MV KGHI+ T D+D ++K+LC G+TFAV Sbjct: 130 GPSFSTYSSILNGACKHQDGEIIENMLSLMVEKGHIAETPVCDYDSIVKELCDEGKTFAV 189 Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542 DLF +RA + +EL++ TYECM ALL E +R+EDA++LY I++ K IL+SE CYSEFV+ Sbjct: 190 DLFSERAYEAKIELQHGTYECMLMALLSEEARLEDAIKLYKIVREKNILLSESCYSEFVV 249 Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722 LC+++PS +I+N LVD+ + G + +ELS +I+KQC E WREAEE+F +L++G+ Sbjct: 250 ILCKENPSREITNLLVDITKQGFFFQ--PKELSGYISKQCAEGRWREAEEIFNAVLNKGF 307 Query: 723 LLDPLCCGSFVKRY 764 LLD CCGS VKR+ Sbjct: 308 LLDSTCCGSIVKRH 321 >ref|XP_006348079.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like isoform X1 [Solanum tuberosum] gi|565362693|ref|XP_006348080.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like isoform X2 [Solanum tuberosum] gi|565362695|ref|XP_006348081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like isoform X3 [Solanum tuberosum] Length = 584 Score = 233 bits (595), Expect = 4e-59 Identities = 113/254 (44%), Positives = 174/254 (68%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TWS+IA++L KDGKFE+I I D GV + M++ +ID YS+RG+F AF Y+N++ SK + Sbjct: 229 TWSLIAQMLCKDGKFEQIVPILDKGVCSPVMYNILIDCYSERGNFEAAFGYLNDMYSKCI 288 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 +P+F+T++SILDGAC+Y + EV E+V+S MV KGH+ + D+D VI++ +G+ +A Sbjct: 289 DPTFNTFSSILDGACKYQNAEVIESVMSSMVEKGHLPKVVLPDYDSVIRRFSDMGKAYAA 348 Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542 +LFF+ A + ++L++ TY M A EG + EDA+ +YNI+ +KI +S++CYS F+ Sbjct: 349 ELFFREAYEKRIKLQDNTYGSMLRAFSKEG-KAEDAIWMYNIIVERKIFISDKCYSAFMS 407 Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722 LC ++PSL++S+ L D+I G + P ++S FI QCE+R W+EAEEL +I R Sbjct: 408 VLCNENPSLEVSSLLKDLIGRGFVP--PVSQVSKFIVSQCEKRQWKEAEELLNVIFQRRL 465 Query: 723 LLDPLCCGSFVKRY 764 + CC S V+ Y Sbjct: 466 QFESFCCCSLVRHY 479 >ref|XP_004233795.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like [Solanum lycopersicum] Length = 584 Score = 229 bits (583), Expect = 1e-57 Identities = 109/254 (42%), Positives = 172/254 (67%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TWS+IA++L KDGKFE+I I D GV + +++ +ID YS+RG F AF Y+N++ S+ + Sbjct: 229 TWSLIAQMLCKDGKFEKIVAILDKGVCSPLIYNILIDCYSERGKFDAAFGYLNDMYSERI 288 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 +P+FST++SILDGAC+Y + +V E+V+S MV KGH+ + D+D VI+K +G+ +A Sbjct: 289 DPTFSTFSSILDGACKYQNAQVIESVMSSMVEKGHLPKVVTPDYDSVIQKFSGIGKAYAA 348 Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542 +LFF+ A + +++L++ TY M A EG + EDA+ +YNI+ +KI ++ +CYS F+ Sbjct: 349 ELFFREAYEKSIKLQDKTYGSMLRAFSKEG-KAEDAIWMYNIIVERKIFINGKCYSAFMS 407 Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722 LC + PS+++S+ L D+I G + P ++S FI QCE+ W+EAEEL +I +G Sbjct: 408 VLCNEIPSVEVSSLLKDLIGRGFVP--PVSQVSKFIVSQCEKHQWKEAEELLNVIFQKGL 465 Query: 723 LLDPLCCGSFVKRY 764 + CC S V+ Y Sbjct: 466 QFESFCCCSLVRHY 479 >ref|XP_006449088.1| hypothetical protein CICLE_v10018367mg [Citrus clementina] gi|557551699|gb|ESR62328.1| hypothetical protein CICLE_v10018367mg [Citrus clementina] Length = 578 Score = 219 bits (559), Expect = 6e-55 Identities = 109/255 (42%), Positives = 168/255 (65%), Gaps = 1/255 (0%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCS-KG 179 TWS++A+IL + GKFE + + D G+Y+S M++ +ID YSK+GDFG AFD +NE+C+ + Sbjct: 222 TWSLVAQILCRSGKFEVVLGLLDSGIYSSVMYNLVIDFYSKKGDFGAAFDRLNEMCNGRN 281 Query: 180 MEPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFA 359 + P FSTY+SILDG CRY EV++ ++ +MV K + + S +D VI+KL +G+T+A Sbjct: 282 LTPGFSTYSSILDGGCRYEKTEVSDRIVGLMVEKKLLPKNFLSGNDSVIQKLSDMGKTYA 341 Query: 360 VDLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFV 539 ++ FKRA D+ +EL++ TY CM AL EG RV++ +++Y+++ + I V + Y FV Sbjct: 342 AEMIFKRACDEKIELQDDTYGCMLKALSKEG-RVKEVIQIYHLISERGITVKDSDYYAFV 400 Query: 540 IALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRG 719 LC++ ++ L DV+ G I C + ELS F+ QC + W+E EEL +LD+G Sbjct: 401 NVLCKEHQPEEVCGLLRDVVERGYI-PC-AMELSRFVASQCGKGKWKEVEELLSAVLDQG 458 Query: 720 WLLDPLCCGSFVKRY 764 LLD CC S ++ Y Sbjct: 459 LLLDSFCCSSLMEYY 473 >ref|XP_006467990.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like [Citrus sinensis] Length = 538 Score = 218 bits (554), Expect = 2e-54 Identities = 110/255 (43%), Positives = 169/255 (66%), Gaps = 1/255 (0%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCS-KG 179 TWS++A+IL + GKFE + + D G+Y+S M++ +ID YSK+GDFG AFD +NE+C+ + Sbjct: 182 TWSLVAQILCRSGKFEVVLGLLDSGIYSSVMYNLVIDFYSKKGDFGAAFDRLNEMCNGRN 241 Query: 180 MEPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFA 359 + P FSTY+SILDGA RY EV++ ++ +MV K + + S +D VI+KL +G+T+A Sbjct: 242 LTPGFSTYSSILDGARRYEKTEVSDRIVGLMVEKKLLPKHFLSGNDYVIQKLSDMGKTYA 301 Query: 360 VDLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFV 539 ++ FKRA D+ +EL++ TY CM AL EG RV++A+++Y+++ + I V + Y FV Sbjct: 302 AEMIFKRACDEKIELQDDTYGCMLKALSKEG-RVKEAIQIYHLISERGITVRDSDYYAFV 360 Query: 540 IALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRG 719 LC++ ++ L DV+ G I C + ELS F+ QC + W+E EEL +LD+G Sbjct: 361 NVLCKEHQPEEVCGLLRDVVERGYI-PC-AMELSRFVASQCGKGKWKEVEELLSAVLDKG 418 Query: 720 WLLDPLCCGSFVKRY 764 LLD CC S ++ Y Sbjct: 419 LLLDSFCCSSLMEYY 433 >ref|XP_007025994.1| Tetratricopeptide repeat-like superfamily protein, putative [Theobroma cacao] gi|508781360|gb|EOY28616.1| Tetratricopeptide repeat-like superfamily protein, putative [Theobroma cacao] Length = 578 Score = 214 bits (545), Expect = 3e-53 Identities = 108/254 (42%), Positives = 172/254 (67%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 +WS++A+IL K+GK ++ + + G+Y SE++D +ID YSK GDFG AF+ +NE+ ++ + Sbjct: 223 SWSLVAQILCKNGKLGKVVGLLEKGIYNSEIYDLVIDFYSKSGDFGAAFNRLNEMYNRKV 282 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 + SF TY+SILDGAC+Y+D EV +L +MV K + R + S DL+I KLC + +T A Sbjct: 283 DTSFCTYSSILDGACKYNDGEVIGRILRMMVEKELVPRHQFSKKDLIIPKLCDLRKTHAA 342 Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542 ++ FK+A D+N+ L N TY M AL E +R+++A+E+ ++ ++I+V+E CYS F+ Sbjct: 343 EMLFKKACDENIRLRNDTYGSMLKALSQE-ARIDEAIEVCRMILKRRIIVNESCYSAFIN 401 Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722 ALC++D S LVD+I+ G + C S +LS +I+ QC + +WR+AEEL L+L++G Sbjct: 402 ALCKEDQSDDGYELLVDIIKRG-HNPCAS-KLSKYISSQCSQMNWRKAEELLDLMLEKGL 459 Query: 723 LLDPLCCGSFVKRY 764 L D C ++ Y Sbjct: 460 LPDSFGCCLLIQYY 473 >ref|XP_007212718.1| hypothetical protein PRUPE_ppa018797mg [Prunus persica] gi|462408583|gb|EMJ13917.1| hypothetical protein PRUPE_ppa018797mg [Prunus persica] Length = 584 Score = 211 bits (537), Expect = 2e-52 Identities = 106/254 (41%), Positives = 161/254 (63%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TWS++A+IL KDGKFERI R+ D+ +Y S M++ ++DG SK G+F AF ++NE+C + + Sbjct: 213 TWSLVAQILCKDGKFERILRLLDLNIYNSMMYNLLVDGCSKSGNFDAAFSHLNEMCDRKV 272 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 +P FSTY+SILDGAC+ + EV E V S+MV K + S++D +++KLC +G+T A Sbjct: 273 DPDFSTYSSILDGACKLGNVEVVERVTSVMVEKKLLPNCPLSEYDSIVEKLCDLGKTHAA 332 Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542 ++FFK+A D+ + L++ TY M AL E R ++A+ +Y ++ + I+V Y F Sbjct: 333 EMFFKKACDEKIGLQDGTYGLMLKALTNE-VRTKEAISVYRLISERGIVVDGSSYHAFAD 391 Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722 LC+++ + L+DVI G + ELS FI+ C WREAE L ++LD+G Sbjct: 392 VLCKEERYEEGFELLMDVISRGCSPS--ASELSCFISFLCRRGRWREAEYLLNVVLDKGL 449 Query: 723 LLDPLCCGSFVKRY 764 L D +CC V RY Sbjct: 450 LPDLICCSPLVGRY 463 >ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like [Vitis vinifera] Length = 569 Score = 209 bits (531), Expect = 1e-51 Identities = 102/251 (40%), Positives = 161/251 (64%), Gaps = 1/251 (0%) Frame = +3 Query: 15 IAKILYKDGKFERICRIFDVGVYTSEM-FDFIIDGYSKRGDFGTAFDYVNELCSKGMEPS 191 IA IL K+GK ER+ R+ D+ + + + + +ID Y +RG+F AF Y+NE+C++ +P Sbjct: 217 IALILCKNGKLERVVRLLDMSIVCNALIYKLVIDCYCERGNFSAAFHYLNEMCNRKFDPG 276 Query: 192 FSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAVDLF 371 F Y SILDGAC+Y + EV + V+ MV KG + + S++D +I+K+C +G+T A +F Sbjct: 277 FCAYNSILDGACKYENDEVIQIVMGSMVEKGLLPKLLLSEYDSIIQKICNLGKTHAAQMF 336 Query: 372 FKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVIALC 551 FKRAR++ +EL+NATY CM AL +G RV++A+ +Y ++ + V + CY FV LC Sbjct: 337 FKRARNEKIELDNATYGCMLRALAKDG-RVKEAIGVYLVILESGVTVKDGCYHAFVNVLC 395 Query: 552 RQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGWLLD 731 +DPS ++S + ++I G S C S +LS FI C+ W EA++L + +++G L D Sbjct: 396 EEDPSQEVSKLMGEIIGKG-FSPCGS-KLSKFITSLCKNGRWTEADDLLNVTIEKGLLPD 453 Query: 732 PLCCGSFVKRY 764 CC + V+ Y Sbjct: 454 SFCCSALVEHY 464 >ref|XP_002518527.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223542372|gb|EEF43914.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 599 Score = 204 bits (519), Expect = 3e-50 Identities = 108/254 (42%), Positives = 162/254 (63%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TWS++A IL KDG FERI ++ D+G+ S M++ ++D YSK GDF AF +NE+ + + Sbjct: 237 TWSLVAHILCKDGNFERIVKLLDMGICNSVMYNAVVDYYSKNGDFKAAFCRLNEMYDRKV 296 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 EP FSTY+SILDGAC+ + +V E V++IMV K +S+ +SD+D +I+KLC +G+ A Sbjct: 297 EPGFSTYSSILDGACKCRNLQVIERVVAIMVGKQLLSKCPSSDYDSIIQKLCDLGKVSAA 356 Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542 LFFKRA D+ + L++ATY M A EG +E+A+ LY ++ + + + + FV Sbjct: 357 TLFFKRACDERIGLQDATYGRMLRAFSIEGI-LEEAIGLYQVILERGLTIKDNASDAFVD 415 Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722 L +D + + D++R G S C S LS +I C++R W+EAEEL Y++L++G Sbjct: 416 LLSEKDQYAEGYEIVRDIMRRG-FSPCTS-SLSKYITLLCKKRRWKEAEELLYMVLEKGL 473 Query: 723 LLDPLCCGSFVKRY 764 L D L S VK Y Sbjct: 474 LPDTLSFCSLVKHY 487 >ref|XP_002305605.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222848569|gb|EEE86116.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 564 Score = 196 bits (498), Expect = 8e-48 Identities = 106/254 (41%), Positives = 159/254 (62%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TWS+IA+IL KDG FERI + D+GVY S +++ +ID SKRGDF AF+ +N++C + + Sbjct: 208 TWSLIAQILCKDGNFERIVKFLDMGVYNSVLYNGVIDCCSKRGDFEAAFERLNQMCERKL 267 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 +P FSTY++ILDGAC++ + EV E V+ IM KG + + S D VI+K + + Sbjct: 268 DPGFSTYSAILDGACKHGNEEVIERVMDIMAEKGLLPKCPLSQCDSVIQKFSDLCKMNVA 327 Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542 +FF+RA D+ + L++ATY CM AL E +RV++A+ LY+++ K I V + Y F+ Sbjct: 328 TMFFRRACDEKIGLQDATYGCMLKALSKE-ARVKEAIGLYSLISEKGIRVKDSTYHAFLD 386 Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722 L +D + L D++R G + + LS FI +R WRE E+L L+L++G Sbjct: 387 LLSEEDQYEEGYEILGDMMRRGF--RPGTVGLSKFILLLSRKRRWREVEDLLDLVLEKGL 444 Query: 723 LLDPLCCGSFVKRY 764 L D LCC S V+ Y Sbjct: 445 LPDSLCCCSLVEHY 458 >ref|XP_006285106.1| hypothetical protein CARUB_v10006439mg [Capsella rubella] gi|482553811|gb|EOA18004.1| hypothetical protein CARUB_v10006439mg [Capsella rubella] Length = 585 Score = 165 bits (418), Expect = 1e-38 Identities = 89/257 (34%), Positives = 156/257 (60%), Gaps = 3/257 (1%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TW ++A+IL + G+ + + ++ + GV + +++ +++ YS+ G+F F+ ++E+ +K + Sbjct: 225 TWDLVAQILCEQGRSKSVVKLMETGVESCKIYTNLVECYSRNGEFDAVFNVIHEMDNKKL 284 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 E SFS+Y+ +LD CR D E+ VL +MV K ++ ++ +D +I++LC +G+TFA Sbjct: 285 ELSFSSYSCVLDDVCRLGDAELMGKVLGLMVEKKFLAVDASAVNDEIIERLCDMGKTFAS 344 Query: 363 DLFFKRA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEF 536 ++ F++A + V L + TY CM AL +G R ++AV++Y ++ K I V E CY+EF Sbjct: 345 EMLFRKACNGETVRLRDGTYGCMLKALSRKG-RTKEAVDVYRLICRKGITVLDESCYTEF 403 Query: 537 VIALCRQDPSLKIS-NALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILD 713 ALCR D S + LVDVI+ G + C + LS+ + C +R WR AE+L +++ Sbjct: 404 ANALCRDDNSPEEELELLVDVIKRGFV-PC-TRRLSEVLASLCRKRRWRHAEKLLDSVME 461 Query: 714 RGWLLDPLCCGSFVKRY 764 D CG ++RY Sbjct: 462 MEVYFDSFSCGILMERY 478 >sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21170 Length = 585 Score = 160 bits (405), Expect = 5e-37 Identities = 88/257 (34%), Positives = 152/257 (59%), Gaps = 3/257 (1%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TW +IA+IL + G+ + + ++ + GV + +++ +++ YS+ G+F F ++E+ K + Sbjct: 225 TWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDAVFSLIHEMDDKKL 284 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 E SF +Y +LD ACR D E + VL +MV K ++ ++ +D +I++LC +G+TFA Sbjct: 285 ELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKIIERLCDMGKTFAS 344 Query: 363 DLFFKRA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEF 536 ++ F++A + V L ++TY CM A L R ++AV++Y ++ K I V E CY EF Sbjct: 345 EMLFRKACNGETVRLWDSTYGCMLKA-LSRKKRTKEAVDVYRMICRKGITVLDESCYIEF 403 Query: 537 VIALCRQD-PSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILD 713 ALCR D S + LVDVI+ G + C + +LS+ + C +R W+ AE+L +++ Sbjct: 404 ANALCRDDNSSEEEEELLVDVIKRGFV-PC-THKLSEVLASMCRKRRWKSAEKLLDSVME 461 Query: 714 RGWLLDPLCCGSFVKRY 764 D CG ++RY Sbjct: 462 MEVYFDSFACGLLMERY 478 >ref|XP_006413812.1| hypothetical protein EUTSA_v10024760mg [Eutrema salsugineum] gi|557114982|gb|ESQ55265.1| hypothetical protein EUTSA_v10024760mg [Eutrema salsugineum] Length = 584 Score = 157 bits (398), Expect = 3e-36 Identities = 86/257 (33%), Positives = 151/257 (58%), Gaps = 3/257 (1%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TW ++A++L + GKF+ + ++ + GV + +++ +++ YS+ G+F F + E+ +K + Sbjct: 225 TWDLVAQVLCEQGKFKSVVKLMETGVESCKIYTNLVECYSRNGEFDAVFSVIQEMDAKKL 284 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 E SF +Y +LD ACR D E+ + VL +MV K ++ ++ +D +I++LC +G+TFA Sbjct: 285 ELSFCSYGYVLDDACRLGDSELIDKVLGLMVEKEFLTLDDSTVNDQIIERLCDMGKTFAS 344 Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEFV 539 ++ F RA + + + TY CM +L G R ++AV++Y ++ K I V E CY EF Sbjct: 345 EMLFHRACNGGT-VRDRTYGCMLKSLSVIG-RTKEAVDVYRLICRKGITVLDESCYKEFA 402 Query: 540 IALCRQD--PSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILD 713 ALCR D S + L+DVI+ G + C + +LS+ + C +R W AE+L +++ Sbjct: 403 NALCRDDDNSSEEEGELLIDVIKRGFV-PC-TLKLSEVLASLCRKRRWNRAEKLLDSVME 460 Query: 714 RGWLLDPLCCGSFVKRY 764 D CG ++RY Sbjct: 461 MEVHFDSFSCGLLMERY 477 >ref|NP_193849.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332659015|gb|AEE84415.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 551 Score = 135 bits (339), Expect = 2e-29 Identities = 74/205 (36%), Positives = 125/205 (60%), Gaps = 3/205 (1%) Frame = +3 Query: 3 TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182 TW +IA+IL + G+ + + ++ + GV + +++ +++ YS+ G+F F ++E+ K + Sbjct: 225 TWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDAVFSLIHEMDDKKL 284 Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362 E SF +Y +LD ACR D E + VL +MV K ++ ++ +D +I++LC +G+TFA Sbjct: 285 ELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKIIERLCDMGKTFAS 344 Query: 363 DLFFKRA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEF 536 ++ F++A + V L ++TY CM A L R ++AV++Y ++ K I V E CY EF Sbjct: 345 EMLFRKACNGETVRLWDSTYGCMLKA-LSRKKRTKEAVDVYRMICRKGITVLDESCYIEF 403 Query: 537 VIALCRQD-PSLKISNALVDVIRSG 608 ALCR D S + LVDVI+ G Sbjct: 404 ANALCRDDNSSEEEEELLVDVIKRG 428 >ref|XP_002867861.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297313697|gb|EFH44120.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 534 Score = 128 bits (321), Expect = 3e-27 Identities = 71/200 (35%), Positives = 122/200 (61%), Gaps = 3/200 (1%) Frame = +3 Query: 18 AKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFS 197 A IL + G+ + + ++ + GV + +++ +++ YS+ G+F F ++E+ K +E SFS Sbjct: 213 AMILCEHGRSKSVVKLMETGVESCKIYTNLVECYSRNGEFDATFSLIHEMDGKKLELSFS 272 Query: 198 TYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAVDLFFK 377 +Y +LD ACR D E+ + VL MV K ++ ++ +D +I++LC +G+TFA ++ F+ Sbjct: 273 SYGCVLDNACRLGDAELIDKVLGSMVEKKFLTLGDSALNDQMIERLCDMGKTFASEMLFR 332 Query: 378 RA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKI-LVSERCYSEFVIALC 551 +A + V L +TY CM A L R ++AV++Y ++ K I ++ E CY+EF ALC Sbjct: 333 KACNGETVRLRESTYGCMLKA-LSRKERTKEAVDVYRMICRKGINVLDESCYNEFANALC 391 Query: 552 RQDPSLKI-SNALVDVIRSG 608 R D S + LVDVI+ G Sbjct: 392 RDDNSSEEGEELLVDVIKRG 411 >ref|XP_006826483.1| hypothetical protein AMTR_s00004p00243870 [Amborella trichopoda] gi|548830797|gb|ERM93720.1| hypothetical protein AMTR_s00004p00243870 [Amborella trichopoda] Length = 359 Score = 126 bits (317), Expect = 7e-27 Identities = 76/246 (30%), Positives = 126/246 (51%), Gaps = 22/246 (8%) Frame = +3 Query: 93 MFDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFSTYTSILDGACRYHDREVAENVLSIM 272 +++ I+DGY + GDF AF+ + + KG+EP F++Y SILDG+CR+ + A VL IM Sbjct: 12 VYNLILDGYCRNGDFVIAFEVIERIYGKGLEPDFASYGSILDGSCRFGNMGTAVRVLRIM 71 Query: 273 VAKGHISRTRAS----------------------DHDLVIKKLCAVGRTFAVDLFFKRAR 386 + K + +D I+KLC +G T A +L F AR Sbjct: 72 LEKRLVPTVGGEFSPNDCFTLNDNNCIVAAISYLHYDAFIRKLCKLGMTHAAELVFGIAR 131 Query: 387 DDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVIALCRQDPS 566 V L+NA Y + A R+++AV +Y +L + I ++ + + AL +++PS Sbjct: 132 SALVPLQNACYIALLKA-FSRDRRIKEAVRMYFLLLQRDIAMNISECNVLLNALFKEEPS 190 Query: 567 LKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGWLLDPLCCG 746 +++ + VI G +S +I+ QC + W+EA EL ++ L+RG + D G Sbjct: 191 EEVNKVIKSVIEKGFYP--DPLAISSYISAQCSKGGWQEANELLWVTLERGVMPDGFVWG 248 Query: 747 SFVKRY 764 SF++ Y Sbjct: 249 SFIRHY 254 >emb|CAA17536.1| putative protein [Arabidopsis thaliana] gi|7268914|emb|CAB79117.1| putative protein [Arabidopsis thaliana] Length = 534 Score = 126 bits (316), Expect = 1e-26 Identities = 71/200 (35%), Positives = 120/200 (60%), Gaps = 3/200 (1%) Frame = +3 Query: 18 AKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFS 197 A IL + G+ + + ++ + GV + +++ +++ YS+ G+F F ++E+ K +E SF Sbjct: 213 AMILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDAVFSLIHEMDDKKLELSFC 272 Query: 198 TYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAVDLFFK 377 +Y +LD ACR D E + VL +MV K ++ ++ +D +I++LC +G+TFA ++ F+ Sbjct: 273 SYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKIIERLCDMGKTFASEMLFR 332 Query: 378 RA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEFVIALC 551 +A + V L ++TY CM A L R ++AV++Y ++ K I V E CY EF ALC Sbjct: 333 KACNGETVRLWDSTYGCMLKA-LSRKKRTKEAVDVYRMICRKGITVLDESCYIEFANALC 391 Query: 552 RQD-PSLKISNALVDVIRSG 608 R D S + LVDVI+ G Sbjct: 392 RDDNSSEEEEELLVDVIKRG 411 >ref|XP_006857674.1| hypothetical protein AMTR_s00061p00160470 [Amborella trichopoda] gi|548861770|gb|ERN19141.1| hypothetical protein AMTR_s00061p00160470 [Amborella trichopoda] Length = 372 Score = 77.4 bits (189), Expect = 5e-12 Identities = 45/152 (29%), Positives = 85/152 (55%) Frame = +3 Query: 309 DHDLVIKKLCAVGRTFAVDLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNI 488 D+ + I++LC +G T A +L F A + V L+NA+Y + R+++AV +Y + Sbjct: 119 DYGVFIRRLCKLGMTDAAELVFGIAHNALVFLQNASYIALLKGF-SRDKRIKEAVRMYFL 177 Query: 489 LQFKKILVSERCYSEFVIALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEE 668 L + I ++ + + AL +++ S +++ + VIR G + +S I+ QC + Sbjct: 178 LLQRDIALNICECNVLLNALFKEEQSEEVNKVIKSVIRKGFYPDPLA--ISSHISSQCSK 235 Query: 669 RHWREAEELFYLILDRGWLLDPLCCGSFVKRY 764 W+EA EL +++L+RG + + CGSF++ Y Sbjct: 236 GGWQEANELLWVMLERGVMPNGFACGSFIRHY 267 >ref|XP_006487095.1| PREDICTED: pentatricopeptide repeat-containing protein At5g64320, mitochondrial-like isoform X1 [Citrus sinensis] gi|568867543|ref|XP_006487096.1| PREDICTED: pentatricopeptide repeat-containing protein At5g64320, mitochondrial-like isoform X2 [Citrus sinensis] Length = 728 Score = 77.0 bits (188), Expect = 7e-12 Identities = 48/182 (26%), Positives = 90/182 (49%) Frame = +3 Query: 96 FDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFSTYTSILDGACRYHDREVAENVLSIMV 275 F+ +I G K+ FG+A + VN + KG EP+ TYT ++DG C+ E A +++ M+ Sbjct: 395 FNILIHGLCKQRRFGSALELVNAMAVKGCEPNIVTYTILVDGFCKEGQLEKANIIINEML 454 Query: 276 AKGHISRTRASDHDLVIKKLCAVGRTFAVDLFFKRARDDNVELENATYECMFAALLCEGS 455 AKG T ++ +I LC+ G+ F + + T+ + + LC+G Sbjct: 455 AKGLSLNT--VGYNCLIHALCSAGKIIEAMEIFGEMPSKGCKRDIYTFNSIISG-LCKGD 511 Query: 456 RVEDAVELYNILQFKKILVSERCYSEFVIALCRQDPSLKISNALVDVIRSGVISKCPSEE 635 R+E+A+ LY + + + + Y+ + A R+ + + D++ G CP +E Sbjct: 512 RIEEALGLYQDMLLEGVTANTVTYNTLIHAFLRRGSLHEAHKLVNDMLFRG----CPLDE 567 Query: 636 LS 641 ++ Sbjct: 568 IT 569 >ref|XP_006579638.1| PREDICTED: pentatricopeptide repeat-containing protein At2g26790, mitochondrial-like [Glycine max] Length = 801 Score = 73.6 bits (179), Expect = 7e-11 Identities = 66/249 (26%), Positives = 110/249 (44%), Gaps = 32/249 (12%) Frame = +3 Query: 69 DVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFSTYTSILDGACRY-HDRE 245 DV YT+ +I+GY +GD TAF+ E+ KG++P TY + G R H RE Sbjct: 421 DVKHYTT-----LINGYCLQGDLVTAFNMFKEMKEKGLKPDIVTYNVLAAGLSRNGHARE 475 Query: 246 VAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAVDLFFKRARDDNVELENA---- 413 + +L M ++G + ++ H ++I+ LC+ G+ +++F D N+E+ +A Sbjct: 476 TVK-LLDFMESQG--MKPNSTTHKMIIEGLCSGGKVLEAEVYFNSLEDKNIEIYSAMVNG 532 Query: 414 ---------TYECMFAAL-----------------LCEGSRVEDAVELYNILQFKKILVS 515 +YE L LC +E AV+L + + + S Sbjct: 533 YCETDLVKKSYEVFLKLLNQGDMAKKASCFKLLSKLCMTGDIEKAVKLLDRMLLSNVEPS 592 Query: 516 ERCYSEFVIALCRQDPSLKISNALVDV-IRSGVISKCPSEELSDFINKQCEERHWREAEE 692 + YS+ + ALC Q +K + L DV + G + + IN C +EA + Sbjct: 593 KIMYSKILAALC-QAGDMKNARTLFDVFVHRGFTPDVVTYTI--MINSYCRMNCLQEAHD 649 Query: 693 LFYLILDRG 719 LF + RG Sbjct: 650 LFQDMKRRG 658