BLASTX nr result

ID: Mentha25_contig00021304 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00021304
         (765 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU27821.1| hypothetical protein MIMGU_mgv1a006926mg [Mimulus...   318   1e-84
ref|XP_006348079.1| PREDICTED: pentatricopeptide repeat-containi...   233   4e-59
ref|XP_004233795.1| PREDICTED: pentatricopeptide repeat-containi...   229   1e-57
ref|XP_006449088.1| hypothetical protein CICLE_v10018367mg [Citr...   219   6e-55
ref|XP_006467990.1| PREDICTED: pentatricopeptide repeat-containi...   218   2e-54
ref|XP_007025994.1| Tetratricopeptide repeat-like superfamily pr...   214   3e-53
ref|XP_007212718.1| hypothetical protein PRUPE_ppa018797mg [Prun...   211   2e-52
ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containi...   209   1e-51
ref|XP_002518527.1| pentatricopeptide repeat-containing protein,...   204   3e-50
ref|XP_002305605.1| pentatricopeptide repeat-containing family p...   196   8e-48
ref|XP_006285106.1| hypothetical protein CARUB_v10006439mg [Caps...   165   1e-38
sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-c...   160   5e-37
ref|XP_006413812.1| hypothetical protein EUTSA_v10024760mg [Eutr...   157   3e-36
ref|NP_193849.2| pentatricopeptide repeat-containing protein [Ar...   135   2e-29
ref|XP_002867861.1| predicted protein [Arabidopsis lyrata subsp....   128   3e-27
ref|XP_006826483.1| hypothetical protein AMTR_s00004p00243870 [A...   126   7e-27
emb|CAA17536.1| putative protein [Arabidopsis thaliana] gi|72689...   126   1e-26
ref|XP_006857674.1| hypothetical protein AMTR_s00061p00160470 [A...    77   5e-12
ref|XP_006487095.1| PREDICTED: pentatricopeptide repeat-containi...    77   7e-12
ref|XP_006579638.1| PREDICTED: pentatricopeptide repeat-containi...    74   7e-11

>gb|EYU27821.1| hypothetical protein MIMGU_mgv1a006926mg [Mimulus guttatus]
          Length = 426

 Score =  318 bits (815), Expect = 1e-84
 Identities = 148/254 (58%), Positives = 201/254 (79%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TWS IA+IL+KDGKFERI ++FDVG++T EMFD IIDG+SKRGDF  AFDY+N +CSK +
Sbjct: 70  TWSSIARILHKDGKFERISKVFDVGIFTPEMFDLIIDGHSKRGDFEAAFDYLNRMCSKEI 129

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
            PSFSTY+SIL+GAC++ D E+ EN+LS+MV KGHI+ T   D+D ++K+LC  G+TFAV
Sbjct: 130 GPSFSTYSSILNGACKHQDGEIIENMLSLMVEKGHIAETPVCDYDSIVKELCDEGKTFAV 189

Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542
           DLF +RA +  +EL++ TYECM  ALL E +R+EDA++LY I++ K IL+SE CYSEFV+
Sbjct: 190 DLFSERAYEAKIELQHGTYECMLMALLSEEARLEDAIKLYKIVREKNILLSESCYSEFVV 249

Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722
            LC+++PS +I+N LVD+ + G   +   +ELS +I+KQC E  WREAEE+F  +L++G+
Sbjct: 250 ILCKENPSREITNLLVDITKQGFFFQ--PKELSGYISKQCAEGRWREAEEIFNAVLNKGF 307

Query: 723 LLDPLCCGSFVKRY 764
           LLD  CCGS VKR+
Sbjct: 308 LLDSTCCGSIVKRH 321


>ref|XP_006348079.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21170-like isoform X1 [Solanum tuberosum]
           gi|565362693|ref|XP_006348080.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g21170-like isoform X2 [Solanum tuberosum]
           gi|565362695|ref|XP_006348081.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g21170-like isoform X3 [Solanum tuberosum]
          Length = 584

 Score =  233 bits (595), Expect = 4e-59
 Identities = 113/254 (44%), Positives = 174/254 (68%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TWS+IA++L KDGKFE+I  I D GV +  M++ +ID YS+RG+F  AF Y+N++ SK +
Sbjct: 229 TWSLIAQMLCKDGKFEQIVPILDKGVCSPVMYNILIDCYSERGNFEAAFGYLNDMYSKCI 288

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           +P+F+T++SILDGAC+Y + EV E+V+S MV KGH+ +    D+D VI++   +G+ +A 
Sbjct: 289 DPTFNTFSSILDGACKYQNAEVIESVMSSMVEKGHLPKVVLPDYDSVIRRFSDMGKAYAA 348

Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542
           +LFF+ A +  ++L++ TY  M  A   EG + EDA+ +YNI+  +KI +S++CYS F+ 
Sbjct: 349 ELFFREAYEKRIKLQDNTYGSMLRAFSKEG-KAEDAIWMYNIIVERKIFISDKCYSAFMS 407

Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722
            LC ++PSL++S+ L D+I  G +   P  ++S FI  QCE+R W+EAEEL  +I  R  
Sbjct: 408 VLCNENPSLEVSSLLKDLIGRGFVP--PVSQVSKFIVSQCEKRQWKEAEELLNVIFQRRL 465

Query: 723 LLDPLCCGSFVKRY 764
             +  CC S V+ Y
Sbjct: 466 QFESFCCCSLVRHY 479


>ref|XP_004233795.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21170-like [Solanum lycopersicum]
          Length = 584

 Score =  229 bits (583), Expect = 1e-57
 Identities = 109/254 (42%), Positives = 172/254 (67%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TWS+IA++L KDGKFE+I  I D GV +  +++ +ID YS+RG F  AF Y+N++ S+ +
Sbjct: 229 TWSLIAQMLCKDGKFEKIVAILDKGVCSPLIYNILIDCYSERGKFDAAFGYLNDMYSERI 288

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           +P+FST++SILDGAC+Y + +V E+V+S MV KGH+ +    D+D VI+K   +G+ +A 
Sbjct: 289 DPTFSTFSSILDGACKYQNAQVIESVMSSMVEKGHLPKVVTPDYDSVIQKFSGIGKAYAA 348

Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542
           +LFF+ A + +++L++ TY  M  A   EG + EDA+ +YNI+  +KI ++ +CYS F+ 
Sbjct: 349 ELFFREAYEKSIKLQDKTYGSMLRAFSKEG-KAEDAIWMYNIIVERKIFINGKCYSAFMS 407

Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722
            LC + PS+++S+ L D+I  G +   P  ++S FI  QCE+  W+EAEEL  +I  +G 
Sbjct: 408 VLCNEIPSVEVSSLLKDLIGRGFVP--PVSQVSKFIVSQCEKHQWKEAEELLNVIFQKGL 465

Query: 723 LLDPLCCGSFVKRY 764
             +  CC S V+ Y
Sbjct: 466 QFESFCCCSLVRHY 479


>ref|XP_006449088.1| hypothetical protein CICLE_v10018367mg [Citrus clementina]
           gi|557551699|gb|ESR62328.1| hypothetical protein
           CICLE_v10018367mg [Citrus clementina]
          Length = 578

 Score =  219 bits (559), Expect = 6e-55
 Identities = 109/255 (42%), Positives = 168/255 (65%), Gaps = 1/255 (0%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCS-KG 179
           TWS++A+IL + GKFE +  + D G+Y+S M++ +ID YSK+GDFG AFD +NE+C+ + 
Sbjct: 222 TWSLVAQILCRSGKFEVVLGLLDSGIYSSVMYNLVIDFYSKKGDFGAAFDRLNEMCNGRN 281

Query: 180 MEPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFA 359
           + P FSTY+SILDG CRY   EV++ ++ +MV K  + +   S +D VI+KL  +G+T+A
Sbjct: 282 LTPGFSTYSSILDGGCRYEKTEVSDRIVGLMVEKKLLPKNFLSGNDSVIQKLSDMGKTYA 341

Query: 360 VDLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFV 539
            ++ FKRA D+ +EL++ TY CM  AL  EG RV++ +++Y+++  + I V +  Y  FV
Sbjct: 342 AEMIFKRACDEKIELQDDTYGCMLKALSKEG-RVKEVIQIYHLISERGITVKDSDYYAFV 400

Query: 540 IALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRG 719
             LC++    ++   L DV+  G I  C + ELS F+  QC +  W+E EEL   +LD+G
Sbjct: 401 NVLCKEHQPEEVCGLLRDVVERGYI-PC-AMELSRFVASQCGKGKWKEVEELLSAVLDQG 458

Query: 720 WLLDPLCCGSFVKRY 764
            LLD  CC S ++ Y
Sbjct: 459 LLLDSFCCSSLMEYY 473


>ref|XP_006467990.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21170-like [Citrus sinensis]
          Length = 538

 Score =  218 bits (554), Expect = 2e-54
 Identities = 110/255 (43%), Positives = 169/255 (66%), Gaps = 1/255 (0%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCS-KG 179
           TWS++A+IL + GKFE +  + D G+Y+S M++ +ID YSK+GDFG AFD +NE+C+ + 
Sbjct: 182 TWSLVAQILCRSGKFEVVLGLLDSGIYSSVMYNLVIDFYSKKGDFGAAFDRLNEMCNGRN 241

Query: 180 MEPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFA 359
           + P FSTY+SILDGA RY   EV++ ++ +MV K  + +   S +D VI+KL  +G+T+A
Sbjct: 242 LTPGFSTYSSILDGARRYEKTEVSDRIVGLMVEKKLLPKHFLSGNDYVIQKLSDMGKTYA 301

Query: 360 VDLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFV 539
            ++ FKRA D+ +EL++ TY CM  AL  EG RV++A+++Y+++  + I V +  Y  FV
Sbjct: 302 AEMIFKRACDEKIELQDDTYGCMLKALSKEG-RVKEAIQIYHLISERGITVRDSDYYAFV 360

Query: 540 IALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRG 719
             LC++    ++   L DV+  G I  C + ELS F+  QC +  W+E EEL   +LD+G
Sbjct: 361 NVLCKEHQPEEVCGLLRDVVERGYI-PC-AMELSRFVASQCGKGKWKEVEELLSAVLDKG 418

Query: 720 WLLDPLCCGSFVKRY 764
            LLD  CC S ++ Y
Sbjct: 419 LLLDSFCCSSLMEYY 433


>ref|XP_007025994.1| Tetratricopeptide repeat-like superfamily protein, putative
           [Theobroma cacao] gi|508781360|gb|EOY28616.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative [Theobroma cacao]
          Length = 578

 Score =  214 bits (545), Expect = 3e-53
 Identities = 108/254 (42%), Positives = 172/254 (67%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           +WS++A+IL K+GK  ++  + + G+Y SE++D +ID YSK GDFG AF+ +NE+ ++ +
Sbjct: 223 SWSLVAQILCKNGKLGKVVGLLEKGIYNSEIYDLVIDFYSKSGDFGAAFNRLNEMYNRKV 282

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           + SF TY+SILDGAC+Y+D EV   +L +MV K  + R + S  DL+I KLC + +T A 
Sbjct: 283 DTSFCTYSSILDGACKYNDGEVIGRILRMMVEKELVPRHQFSKKDLIIPKLCDLRKTHAA 342

Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542
           ++ FK+A D+N+ L N TY  M  AL  E +R+++A+E+  ++  ++I+V+E CYS F+ 
Sbjct: 343 EMLFKKACDENIRLRNDTYGSMLKALSQE-ARIDEAIEVCRMILKRRIIVNESCYSAFIN 401

Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722
           ALC++D S      LVD+I+ G  + C S +LS +I+ QC + +WR+AEEL  L+L++G 
Sbjct: 402 ALCKEDQSDDGYELLVDIIKRG-HNPCAS-KLSKYISSQCSQMNWRKAEELLDLMLEKGL 459

Query: 723 LLDPLCCGSFVKRY 764
           L D   C   ++ Y
Sbjct: 460 LPDSFGCCLLIQYY 473


>ref|XP_007212718.1| hypothetical protein PRUPE_ppa018797mg [Prunus persica]
           gi|462408583|gb|EMJ13917.1| hypothetical protein
           PRUPE_ppa018797mg [Prunus persica]
          Length = 584

 Score =  211 bits (537), Expect = 2e-52
 Identities = 106/254 (41%), Positives = 161/254 (63%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TWS++A+IL KDGKFERI R+ D+ +Y S M++ ++DG SK G+F  AF ++NE+C + +
Sbjct: 213 TWSLVAQILCKDGKFERILRLLDLNIYNSMMYNLLVDGCSKSGNFDAAFSHLNEMCDRKV 272

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           +P FSTY+SILDGAC+  + EV E V S+MV K  +     S++D +++KLC +G+T A 
Sbjct: 273 DPDFSTYSSILDGACKLGNVEVVERVTSVMVEKKLLPNCPLSEYDSIVEKLCDLGKTHAA 332

Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542
           ++FFK+A D+ + L++ TY  M  AL  E  R ++A+ +Y ++  + I+V    Y  F  
Sbjct: 333 EMFFKKACDEKIGLQDGTYGLMLKALTNE-VRTKEAISVYRLISERGIVVDGSSYHAFAD 391

Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722
            LC+++   +    L+DVI  G      + ELS FI+  C    WREAE L  ++LD+G 
Sbjct: 392 VLCKEERYEEGFELLMDVISRGCSPS--ASELSCFISFLCRRGRWREAEYLLNVVLDKGL 449

Query: 723 LLDPLCCGSFVKRY 764
           L D +CC   V RY
Sbjct: 450 LPDLICCSPLVGRY 463


>ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21170-like [Vitis vinifera]
          Length = 569

 Score =  209 bits (531), Expect = 1e-51
 Identities = 102/251 (40%), Positives = 161/251 (64%), Gaps = 1/251 (0%)
 Frame = +3

Query: 15  IAKILYKDGKFERICRIFDVGVYTSEM-FDFIIDGYSKRGDFGTAFDYVNELCSKGMEPS 191
           IA IL K+GK ER+ R+ D+ +  + + +  +ID Y +RG+F  AF Y+NE+C++  +P 
Sbjct: 217 IALILCKNGKLERVVRLLDMSIVCNALIYKLVIDCYCERGNFSAAFHYLNEMCNRKFDPG 276

Query: 192 FSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAVDLF 371
           F  Y SILDGAC+Y + EV + V+  MV KG + +   S++D +I+K+C +G+T A  +F
Sbjct: 277 FCAYNSILDGACKYENDEVIQIVMGSMVEKGLLPKLLLSEYDSIIQKICNLGKTHAAQMF 336

Query: 372 FKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVIALC 551
           FKRAR++ +EL+NATY CM  AL  +G RV++A+ +Y ++    + V + CY  FV  LC
Sbjct: 337 FKRARNEKIELDNATYGCMLRALAKDG-RVKEAIGVYLVILESGVTVKDGCYHAFVNVLC 395

Query: 552 RQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGWLLD 731
            +DPS ++S  + ++I  G  S C S +LS FI   C+   W EA++L  + +++G L D
Sbjct: 396 EEDPSQEVSKLMGEIIGKG-FSPCGS-KLSKFITSLCKNGRWTEADDLLNVTIEKGLLPD 453

Query: 732 PLCCGSFVKRY 764
             CC + V+ Y
Sbjct: 454 SFCCSALVEHY 464


>ref|XP_002518527.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223542372|gb|EEF43914.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 599

 Score =  204 bits (519), Expect = 3e-50
 Identities = 108/254 (42%), Positives = 162/254 (63%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TWS++A IL KDG FERI ++ D+G+  S M++ ++D YSK GDF  AF  +NE+  + +
Sbjct: 237 TWSLVAHILCKDGNFERIVKLLDMGICNSVMYNAVVDYYSKNGDFKAAFCRLNEMYDRKV 296

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           EP FSTY+SILDGAC+  + +V E V++IMV K  +S+  +SD+D +I+KLC +G+  A 
Sbjct: 297 EPGFSTYSSILDGACKCRNLQVIERVVAIMVGKQLLSKCPSSDYDSIIQKLCDLGKVSAA 356

Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542
            LFFKRA D+ + L++ATY  M  A   EG  +E+A+ LY ++  + + + +     FV 
Sbjct: 357 TLFFKRACDERIGLQDATYGRMLRAFSIEGI-LEEAIGLYQVILERGLTIKDNASDAFVD 415

Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722
            L  +D   +    + D++R G  S C S  LS +I   C++R W+EAEEL Y++L++G 
Sbjct: 416 LLSEKDQYAEGYEIVRDIMRRG-FSPCTS-SLSKYITLLCKKRRWKEAEELLYMVLEKGL 473

Query: 723 LLDPLCCGSFVKRY 764
           L D L   S VK Y
Sbjct: 474 LPDTLSFCSLVKHY 487


>ref|XP_002305605.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|222848569|gb|EEE86116.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 564

 Score =  196 bits (498), Expect = 8e-48
 Identities = 106/254 (41%), Positives = 159/254 (62%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TWS+IA+IL KDG FERI +  D+GVY S +++ +ID  SKRGDF  AF+ +N++C + +
Sbjct: 208 TWSLIAQILCKDGNFERIVKFLDMGVYNSVLYNGVIDCCSKRGDFEAAFERLNQMCERKL 267

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           +P FSTY++ILDGAC++ + EV E V+ IM  KG + +   S  D VI+K   + +    
Sbjct: 268 DPGFSTYSAILDGACKHGNEEVIERVMDIMAEKGLLPKCPLSQCDSVIQKFSDLCKMNVA 327

Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVI 542
            +FF+RA D+ + L++ATY CM  AL  E +RV++A+ LY+++  K I V +  Y  F+ 
Sbjct: 328 TMFFRRACDEKIGLQDATYGCMLKALSKE-ARVKEAIGLYSLISEKGIRVKDSTYHAFLD 386

Query: 543 ALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGW 722
            L  +D   +    L D++R G   +  +  LS FI     +R WRE E+L  L+L++G 
Sbjct: 387 LLSEEDQYEEGYEILGDMMRRGF--RPGTVGLSKFILLLSRKRRWREVEDLLDLVLEKGL 444

Query: 723 LLDPLCCGSFVKRY 764
           L D LCC S V+ Y
Sbjct: 445 LPDSLCCCSLVEHY 458


>ref|XP_006285106.1| hypothetical protein CARUB_v10006439mg [Capsella rubella]
           gi|482553811|gb|EOA18004.1| hypothetical protein
           CARUB_v10006439mg [Capsella rubella]
          Length = 585

 Score =  165 bits (418), Expect = 1e-38
 Identities = 89/257 (34%), Positives = 156/257 (60%), Gaps = 3/257 (1%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TW ++A+IL + G+ + + ++ + GV + +++  +++ YS+ G+F   F+ ++E+ +K +
Sbjct: 225 TWDLVAQILCEQGRSKSVVKLMETGVESCKIYTNLVECYSRNGEFDAVFNVIHEMDNKKL 284

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           E SFS+Y+ +LD  CR  D E+   VL +MV K  ++   ++ +D +I++LC +G+TFA 
Sbjct: 285 ELSFSSYSCVLDDVCRLGDAELMGKVLGLMVEKKFLAVDASAVNDEIIERLCDMGKTFAS 344

Query: 363 DLFFKRA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEF 536
           ++ F++A   + V L + TY CM  AL  +G R ++AV++Y ++  K I V  E CY+EF
Sbjct: 345 EMLFRKACNGETVRLRDGTYGCMLKALSRKG-RTKEAVDVYRLICRKGITVLDESCYTEF 403

Query: 537 VIALCRQDPSLKIS-NALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILD 713
             ALCR D S +     LVDVI+ G +  C +  LS+ +   C +R WR AE+L   +++
Sbjct: 404 ANALCRDDNSPEEELELLVDVIKRGFV-PC-TRRLSEVLASLCRKRRWRHAEKLLDSVME 461

Query: 714 RGWLLDPLCCGSFVKRY 764
                D   CG  ++RY
Sbjct: 462 MEVYFDSFSCGILMERY 478


>sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21170
          Length = 585

 Score =  160 bits (405), Expect = 5e-37
 Identities = 88/257 (34%), Positives = 152/257 (59%), Gaps = 3/257 (1%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TW +IA+IL + G+ + + ++ + GV + +++  +++ YS+ G+F   F  ++E+  K +
Sbjct: 225 TWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDAVFSLIHEMDDKKL 284

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           E SF +Y  +LD ACR  D E  + VL +MV K  ++   ++ +D +I++LC +G+TFA 
Sbjct: 285 ELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKIIERLCDMGKTFAS 344

Query: 363 DLFFKRA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEF 536
           ++ F++A   + V L ++TY CM  A L    R ++AV++Y ++  K I V  E CY EF
Sbjct: 345 EMLFRKACNGETVRLWDSTYGCMLKA-LSRKKRTKEAVDVYRMICRKGITVLDESCYIEF 403

Query: 537 VIALCRQD-PSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILD 713
             ALCR D  S +    LVDVI+ G +  C + +LS+ +   C +R W+ AE+L   +++
Sbjct: 404 ANALCRDDNSSEEEEELLVDVIKRGFV-PC-THKLSEVLASMCRKRRWKSAEKLLDSVME 461

Query: 714 RGWLLDPLCCGSFVKRY 764
                D   CG  ++RY
Sbjct: 462 MEVYFDSFACGLLMERY 478


>ref|XP_006413812.1| hypothetical protein EUTSA_v10024760mg [Eutrema salsugineum]
           gi|557114982|gb|ESQ55265.1| hypothetical protein
           EUTSA_v10024760mg [Eutrema salsugineum]
          Length = 584

 Score =  157 bits (398), Expect = 3e-36
 Identities = 86/257 (33%), Positives = 151/257 (58%), Gaps = 3/257 (1%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TW ++A++L + GKF+ + ++ + GV + +++  +++ YS+ G+F   F  + E+ +K +
Sbjct: 225 TWDLVAQVLCEQGKFKSVVKLMETGVESCKIYTNLVECYSRNGEFDAVFSVIQEMDAKKL 284

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           E SF +Y  +LD ACR  D E+ + VL +MV K  ++   ++ +D +I++LC +G+TFA 
Sbjct: 285 ELSFCSYGYVLDDACRLGDSELIDKVLGLMVEKEFLTLDDSTVNDQIIERLCDMGKTFAS 344

Query: 363 DLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEFV 539
           ++ F RA +    + + TY CM  +L   G R ++AV++Y ++  K I V  E CY EF 
Sbjct: 345 EMLFHRACNGGT-VRDRTYGCMLKSLSVIG-RTKEAVDVYRLICRKGITVLDESCYKEFA 402

Query: 540 IALCRQD--PSLKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILD 713
            ALCR D   S +    L+DVI+ G +  C + +LS+ +   C +R W  AE+L   +++
Sbjct: 403 NALCRDDDNSSEEEGELLIDVIKRGFV-PC-TLKLSEVLASLCRKRRWNRAEKLLDSVME 460

Query: 714 RGWLLDPLCCGSFVKRY 764
                D   CG  ++RY
Sbjct: 461 MEVHFDSFSCGLLMERY 477


>ref|NP_193849.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|332659015|gb|AEE84415.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 551

 Score =  135 bits (339), Expect = 2e-29
 Identities = 74/205 (36%), Positives = 125/205 (60%), Gaps = 3/205 (1%)
 Frame = +3

Query: 3   TWSVIAKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGM 182
           TW +IA+IL + G+ + + ++ + GV + +++  +++ YS+ G+F   F  ++E+  K +
Sbjct: 225 TWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDAVFSLIHEMDDKKL 284

Query: 183 EPSFSTYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAV 362
           E SF +Y  +LD ACR  D E  + VL +MV K  ++   ++ +D +I++LC +G+TFA 
Sbjct: 285 ELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKIIERLCDMGKTFAS 344

Query: 363 DLFFKRA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEF 536
           ++ F++A   + V L ++TY CM  A L    R ++AV++Y ++  K I V  E CY EF
Sbjct: 345 EMLFRKACNGETVRLWDSTYGCMLKA-LSRKKRTKEAVDVYRMICRKGITVLDESCYIEF 403

Query: 537 VIALCRQD-PSLKISNALVDVIRSG 608
             ALCR D  S +    LVDVI+ G
Sbjct: 404 ANALCRDDNSSEEEEELLVDVIKRG 428


>ref|XP_002867861.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297313697|gb|EFH44120.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 534

 Score =  128 bits (321), Expect = 3e-27
 Identities = 71/200 (35%), Positives = 122/200 (61%), Gaps = 3/200 (1%)
 Frame = +3

Query: 18  AKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFS 197
           A IL + G+ + + ++ + GV + +++  +++ YS+ G+F   F  ++E+  K +E SFS
Sbjct: 213 AMILCEHGRSKSVVKLMETGVESCKIYTNLVECYSRNGEFDATFSLIHEMDGKKLELSFS 272

Query: 198 TYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAVDLFFK 377
           +Y  +LD ACR  D E+ + VL  MV K  ++   ++ +D +I++LC +G+TFA ++ F+
Sbjct: 273 SYGCVLDNACRLGDAELIDKVLGSMVEKKFLTLGDSALNDQMIERLCDMGKTFASEMLFR 332

Query: 378 RA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKI-LVSERCYSEFVIALC 551
           +A   + V L  +TY CM  A L    R ++AV++Y ++  K I ++ E CY+EF  ALC
Sbjct: 333 KACNGETVRLRESTYGCMLKA-LSRKERTKEAVDVYRMICRKGINVLDESCYNEFANALC 391

Query: 552 RQDPSLKI-SNALVDVIRSG 608
           R D S +     LVDVI+ G
Sbjct: 392 RDDNSSEEGEELLVDVIKRG 411


>ref|XP_006826483.1| hypothetical protein AMTR_s00004p00243870 [Amborella trichopoda]
           gi|548830797|gb|ERM93720.1| hypothetical protein
           AMTR_s00004p00243870 [Amborella trichopoda]
          Length = 359

 Score =  126 bits (317), Expect = 7e-27
 Identities = 76/246 (30%), Positives = 126/246 (51%), Gaps = 22/246 (8%)
 Frame = +3

Query: 93  MFDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFSTYTSILDGACRYHDREVAENVLSIM 272
           +++ I+DGY + GDF  AF+ +  +  KG+EP F++Y SILDG+CR+ +   A  VL IM
Sbjct: 12  VYNLILDGYCRNGDFVIAFEVIERIYGKGLEPDFASYGSILDGSCRFGNMGTAVRVLRIM 71

Query: 273 VAKGHISRTRAS----------------------DHDLVIKKLCAVGRTFAVDLFFKRAR 386
           + K  +                             +D  I+KLC +G T A +L F  AR
Sbjct: 72  LEKRLVPTVGGEFSPNDCFTLNDNNCIVAAISYLHYDAFIRKLCKLGMTHAAELVFGIAR 131

Query: 387 DDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILVSERCYSEFVIALCRQDPS 566
              V L+NA Y  +  A      R+++AV +Y +L  + I ++    +  + AL +++PS
Sbjct: 132 SALVPLQNACYIALLKA-FSRDRRIKEAVRMYFLLLQRDIAMNISECNVLLNALFKEEPS 190

Query: 567 LKISNALVDVIRSGVISKCPSEELSDFINKQCEERHWREAEELFYLILDRGWLLDPLCCG 746
            +++  +  VI  G         +S +I+ QC +  W+EA EL ++ L+RG + D    G
Sbjct: 191 EEVNKVIKSVIEKGFYP--DPLAISSYISAQCSKGGWQEANELLWVTLERGVMPDGFVWG 248

Query: 747 SFVKRY 764
           SF++ Y
Sbjct: 249 SFIRHY 254


>emb|CAA17536.1| putative protein [Arabidopsis thaliana] gi|7268914|emb|CAB79117.1|
           putative protein [Arabidopsis thaliana]
          Length = 534

 Score =  126 bits (316), Expect = 1e-26
 Identities = 71/200 (35%), Positives = 120/200 (60%), Gaps = 3/200 (1%)
 Frame = +3

Query: 18  AKILYKDGKFERICRIFDVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFS 197
           A IL + G+ + + ++ + GV + +++  +++ YS+ G+F   F  ++E+  K +E SF 
Sbjct: 213 AMILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDAVFSLIHEMDDKKLELSFC 272

Query: 198 TYTSILDGACRYHDREVAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAVDLFFK 377
           +Y  +LD ACR  D E  + VL +MV K  ++   ++ +D +I++LC +G+TFA ++ F+
Sbjct: 273 SYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKIIERLCDMGKTFASEMLFR 332

Query: 378 RA-RDDNVELENATYECMFAALLCEGSRVEDAVELYNILQFKKILV-SERCYSEFVIALC 551
           +A   + V L ++TY CM  A L    R ++AV++Y ++  K I V  E CY EF  ALC
Sbjct: 333 KACNGETVRLWDSTYGCMLKA-LSRKKRTKEAVDVYRMICRKGITVLDESCYIEFANALC 391

Query: 552 RQD-PSLKISNALVDVIRSG 608
           R D  S +    LVDVI+ G
Sbjct: 392 RDDNSSEEEEELLVDVIKRG 411


>ref|XP_006857674.1| hypothetical protein AMTR_s00061p00160470 [Amborella trichopoda]
           gi|548861770|gb|ERN19141.1| hypothetical protein
           AMTR_s00061p00160470 [Amborella trichopoda]
          Length = 372

 Score = 77.4 bits (189), Expect = 5e-12
 Identities = 45/152 (29%), Positives = 85/152 (55%)
 Frame = +3

Query: 309 DHDLVIKKLCAVGRTFAVDLFFKRARDDNVELENATYECMFAALLCEGSRVEDAVELYNI 488
           D+ + I++LC +G T A +L F  A +  V L+NA+Y  +         R+++AV +Y +
Sbjct: 119 DYGVFIRRLCKLGMTDAAELVFGIAHNALVFLQNASYIALLKGF-SRDKRIKEAVRMYFL 177

Query: 489 LQFKKILVSERCYSEFVIALCRQDPSLKISNALVDVIRSGVISKCPSEELSDFINKQCEE 668
           L  + I ++    +  + AL +++ S +++  +  VIR G      +  +S  I+ QC +
Sbjct: 178 LLQRDIALNICECNVLLNALFKEEQSEEVNKVIKSVIRKGFYPDPLA--ISSHISSQCSK 235

Query: 669 RHWREAEELFYLILDRGWLLDPLCCGSFVKRY 764
             W+EA EL +++L+RG + +   CGSF++ Y
Sbjct: 236 GGWQEANELLWVMLERGVMPNGFACGSFIRHY 267


>ref|XP_006487095.1| PREDICTED: pentatricopeptide repeat-containing protein At5g64320,
           mitochondrial-like isoform X1 [Citrus sinensis]
           gi|568867543|ref|XP_006487096.1| PREDICTED:
           pentatricopeptide repeat-containing protein At5g64320,
           mitochondrial-like isoform X2 [Citrus sinensis]
          Length = 728

 Score = 77.0 bits (188), Expect = 7e-12
 Identities = 48/182 (26%), Positives = 90/182 (49%)
 Frame = +3

Query: 96  FDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFSTYTSILDGACRYHDREVAENVLSIMV 275
           F+ +I G  K+  FG+A + VN +  KG EP+  TYT ++DG C+    E A  +++ M+
Sbjct: 395 FNILIHGLCKQRRFGSALELVNAMAVKGCEPNIVTYTILVDGFCKEGQLEKANIIINEML 454

Query: 276 AKGHISRTRASDHDLVIKKLCAVGRTFAVDLFFKRARDDNVELENATYECMFAALLCEGS 455
           AKG    T    ++ +I  LC+ G+       F        + +  T+  + +  LC+G 
Sbjct: 455 AKGLSLNT--VGYNCLIHALCSAGKIIEAMEIFGEMPSKGCKRDIYTFNSIISG-LCKGD 511

Query: 456 RVEDAVELYNILQFKKILVSERCYSEFVIALCRQDPSLKISNALVDVIRSGVISKCPSEE 635
           R+E+A+ LY  +  + +  +   Y+  + A  R+    +    + D++  G    CP +E
Sbjct: 512 RIEEALGLYQDMLLEGVTANTVTYNTLIHAFLRRGSLHEAHKLVNDMLFRG----CPLDE 567

Query: 636 LS 641
           ++
Sbjct: 568 IT 569


>ref|XP_006579638.1| PREDICTED: pentatricopeptide repeat-containing protein At2g26790,
            mitochondrial-like [Glycine max]
          Length = 801

 Score = 73.6 bits (179), Expect = 7e-11
 Identities = 66/249 (26%), Positives = 110/249 (44%), Gaps = 32/249 (12%)
 Frame = +3

Query: 69   DVGVYTSEMFDFIIDGYSKRGDFGTAFDYVNELCSKGMEPSFSTYTSILDGACRY-HDRE 245
            DV  YT+     +I+GY  +GD  TAF+   E+  KG++P   TY  +  G  R  H RE
Sbjct: 421  DVKHYTT-----LINGYCLQGDLVTAFNMFKEMKEKGLKPDIVTYNVLAAGLSRNGHARE 475

Query: 246  VAENVLSIMVAKGHISRTRASDHDLVIKKLCAVGRTFAVDLFFKRARDDNVELENA---- 413
              + +L  M ++G   +  ++ H ++I+ LC+ G+    +++F    D N+E+ +A    
Sbjct: 476  TVK-LLDFMESQG--MKPNSTTHKMIIEGLCSGGKVLEAEVYFNSLEDKNIEIYSAMVNG 532

Query: 414  ---------TYECMFAAL-----------------LCEGSRVEDAVELYNILQFKKILVS 515
                     +YE     L                 LC    +E AV+L + +    +  S
Sbjct: 533  YCETDLVKKSYEVFLKLLNQGDMAKKASCFKLLSKLCMTGDIEKAVKLLDRMLLSNVEPS 592

Query: 516  ERCYSEFVIALCRQDPSLKISNALVDV-IRSGVISKCPSEELSDFINKQCEERHWREAEE 692
            +  YS+ + ALC Q   +K +  L DV +  G      +  +   IN  C     +EA +
Sbjct: 593  KIMYSKILAALC-QAGDMKNARTLFDVFVHRGFTPDVVTYTI--MINSYCRMNCLQEAHD 649

Query: 693  LFYLILDRG 719
            LF  +  RG
Sbjct: 650  LFQDMKRRG 658


Top