BLASTX nr result

ID: Mentha25_contig00008819 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00008819
         (821 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU45090.1| hypothetical protein MIMGU_mgv1a021870mg, partial...   221   3e-55
emb|CBI19618.3| unnamed protein product [Vitis vinifera]              202   2e-49
ref|XP_006375040.1| hypothetical protein POPTR_0014s03840g [Popu...   189   8e-46
ref|XP_003589826.1| Pentatricopeptide repeat-containing protein ...   122   2e-25
ref|XP_002864446.1| pentatricopeptide repeat-containing protein ...   120   5e-25
ref|XP_003620912.1| Pentatricopeptide repeat-containing protein ...   120   6e-25
ref|XP_004144134.1| PREDICTED: pentatricopeptide repeat-containi...   117   4e-24
ref|NP_200442.1| pentatricopeptide repeat-containing protein [Ar...   117   7e-24
ref|XP_007020439.1| Tetratricopeptide repeat-like superfamily pr...   116   1e-23
ref|XP_007225613.1| hypothetical protein PRUPE_ppa003215mg [Prun...   116   1e-23
ref|XP_006401350.1| hypothetical protein EUTSA_v10015484mg [Eutr...   115   1e-23
ref|XP_004160258.1| PREDICTED: pentatricopeptide repeat-containi...   115   1e-23
ref|XP_004498089.1| PREDICTED: pentatricopeptide repeat-containi...   114   3e-23
ref|XP_004157162.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   113   1e-22
ref|XP_004142608.1| PREDICTED: pentatricopeptide repeat-containi...   113   1e-22
ref|XP_002282081.2| PREDICTED: pentatricopeptide repeat-containi...   113   1e-22
ref|XP_003532746.1| PREDICTED: pentatricopeptide repeat-containi...   113   1e-22
ref|XP_002529936.1| pentatricopeptide repeat-containing protein,...   113   1e-22
gb|EEE66861.1| hypothetical protein OsJ_23658 [Oryza sativa Japo...   113   1e-22
gb|EXB93457.1| hypothetical protein L484_006119 [Morus notabilis]     112   2e-22

>gb|EYU45090.1| hypothetical protein MIMGU_mgv1a021870mg, partial [Mimulus
           guttatus]
          Length = 277

 Score =  221 bits (563), Expect = 3e-55
 Identities = 123/252 (48%), Positives = 155/252 (61%), Gaps = 2/252 (0%)
 Frame = +2

Query: 71  RPAKLGAHSGRR-NVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYV 247
           +P+      GR  N++  +E LE M +N   A+   + ELMQ T D +S+  GDRIYEYV
Sbjct: 2   KPSSNSKPFGRESNINLALETLEAMGRNETPAEPIRVSELMQFTVDSKSLPAGDRIYEYV 61

Query: 248 MRFSSNYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAI 427
           MRFSS+Y VS+FNE+IDMY KLGDYRR GR+FEQM+C+NIDSWN MI  L ENG+  EAI
Sbjct: 62  MRFSSSYDVSVFNELIDMYFKLGDYRRAGRVFEQMVCKNIDSWNTMIKGLSENGQENEAI 121

Query: 428 QVFTRLVKGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYD 607
           Q+F +LVK                                    +DYGITPS DHY  Y 
Sbjct: 122 QLFAKLVK------------------------------------EDYGITPSLDHYTSYV 145

Query: 608 NLVRNSKREANNASRMIQKKPVSGQNRAP-SDRGMAYKKLMSLSEKAKEAGYVPDTRYVL 784
           NL R + R           + VS ++RA  SD+ +AY+KL  LS++AK+AGYV DTRYVL
Sbjct: 146 NLQRKTNR-----------RVVSEKDRAKNSDKSLAYEKLRCLSDEAKKAGYVADTRYVL 194

Query: 785 HDLDQEAKERAL 820
           HD+D+EAKERAL
Sbjct: 195 HDIDEEAKERAL 206


>emb|CBI19618.3| unnamed protein product [Vitis vinifera]
          Length = 576

 Score =  202 bits (513), Expect = 2e-49
 Identities = 116/281 (41%), Positives = 163/281 (58%), Gaps = 43/281 (15%)
 Frame = +2

Query: 107  NVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNYSVSIFN 286
            NV + +  ++EM +N V   +  L EL+Q+  D++ +E G R +E VMR SSN SV +FN
Sbjct: 227  NVEAALHVIDEMERNGVTVSALGLAELLQVCIDLKLLEVGKRAHELVMRLSSNPSVIVFN 286

Query: 287  EMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDKI 466
            ++++MY  LGD R   R+FE+M  R +DSWN MI+ LV+NGE EEA+ +F++L K  D I
Sbjct: 287  KLLEMYFDLGDTRSACRVFEEMRGRTLDSWNRMILGLVKNGEGEEALAIFSKLKK--DGI 344

Query: 467  KPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHY----------------- 595
            +P+ +TF  +L ACE LG VE+G A+F+SM  DYGITPS +H+                 
Sbjct: 345  EPDGSTFIGVLSACECLGAVEEGLAHFNSMSTDYGITPSMEHFAIIVDLFGRLQKIAEAK 404

Query: 596  ------------LCYDNLVRNSKRE---------ANNASRMIQKKP-----VSGQNRAPS 697
                        + +  L +  K E           +  ++  KK      VS Q  A  
Sbjct: 405  EFIASMPLEPSSMIWQTLQKYLKTERVDEPAPLTTGSGLKLSHKKRVKSNFVSKQKNASP 464

Query: 698  DRGMAYKKLMSLSEKAKEAGYVPDTRYVLHDLDQEAKERAL 820
            ++  AY+KL SL +  KEAGYV DTRYVLHDLDQEAKE++L
Sbjct: 465  EKSKAYEKLRSLHKGVKEAGYVSDTRYVLHDLDQEAKEKSL 505


>ref|XP_006375040.1| hypothetical protein POPTR_0014s03840g [Populus trichocarpa]
           gi|550323354|gb|ERP52837.1| hypothetical protein
           POPTR_0014s03840g [Populus trichocarpa]
          Length = 429

 Score =  189 bits (481), Expect = 8e-46
 Identities = 106/283 (37%), Positives = 165/283 (58%), Gaps = 46/283 (16%)
 Frame = +2

Query: 110 VHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNY--SVSIF 283
           V + +E ++E  +N   AD   +++L+Q+ AD++ +E G ++ EYVMR SS +  SV + 
Sbjct: 79  VEAALEIMDEKERNGGYADLLDIVKLIQVCADLKLLEAGKKVDEYVMRSSSKFKSSVVVL 138

Query: 284 NEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDK 463
           N +++MY KLGD      IFEQM  RN+DSWN M++ L EN E E+A+++F+++ KG D 
Sbjct: 139 NNLVEMYCKLGDTNGAREIFEQMGVRNLDSWNKMLLGLAENKEGEKALEIFSQM-KG-DG 196

Query: 464 IKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLVRNSKREANN 643
           I+P+ ++F  +L AC  LG  ++G+ +F+SM +DYGITP+ +HY  + +L+  + + A  
Sbjct: 197 IRPDGSSFVGVLMACVCLGAEKEGQKHFESMSRDYGITPTVEHYEVFVDLLGRTGKIA-E 255

Query: 644 ASRMIQKKPV--------------------------------------------SGQNRA 691
           A  ++   P+                                            +   R 
Sbjct: 256 AKELVSNMPIDPNSRIWETLQKYSKARTQGQLGYPVSPPGLKLGDMKRAKDNTNTNHRRV 315

Query: 692 PSDRGMAYKKLMSLSEKAKEAGYVPDTRYVLHDLDQEAKERAL 820
            SDR  AY+KL SLS++ ++AGYVPDTR+VLHDLDQEAKE+AL
Sbjct: 316 TSDRSKAYEKLRSLSKEVRDAGYVPDTRFVLHDLDQEAKEKAL 358


>ref|XP_003589826.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355478874|gb|AES60077.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 526

 Score =  122 bits (305), Expect = 2e-25
 Identities = 66/198 (33%), Positives = 108/198 (54%)
 Frame = +2

Query: 77  AKLGAHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRF 256
           A +  ++   N +  ++    M    V  D   +L ++   AD+ ++  G+ I+ Y+ + 
Sbjct: 214 AMISGYTQAHNPNEAIKLFRRMQLENVKPDEIAILAVLSACADLGALHLGEWIHNYIEKH 273

Query: 257 SSNYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVF 436
             +  V ++N +IDMY K G+ R+   +FE M  + I +W  MI  L  +G  +EA++VF
Sbjct: 274 KLSKIVPLYNSLIDMYAKSGNIRKALELFENMKHKTIITWTTMIAGLALHGLGKEALRVF 333

Query: 437 TRLVKGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLV 616
           + + K ED++KPN+ TF +IL AC  +G VE GR YF SMR  YGI P  +HY C  +L+
Sbjct: 334 SCMEK-EDRVKPNEVTFIAILSACSHVGLVELGRDYFTSMRSRYGIEPKIEHYGCMIDLL 392

Query: 617 RNSKREANNASRMIQKKP 670
             +      A  M+ + P
Sbjct: 393 GRA-GHLQEAKEMVLRMP 409


>ref|XP_002864446.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297310281|gb|EFH40705.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 531

 Score =  120 bits (302), Expect = 5e-25
 Identities = 70/194 (36%), Positives = 106/194 (54%)
 Frame = +2

Query: 89  AHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNY 268
           A SGR +    +E  + M    V  D   LL ++   AD+ S+E G+RI  YV     N 
Sbjct: 226 ARSGRAS--EAIEVFQRMLMENVDPDEVTLLAVLSACADLGSLELGERICSYVDHRGMNR 283

Query: 269 SVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLV 448
           +VS+ N +IDMY K G+  +   +FE +  RN+ +W  +I  L  +G   EA+ +F R+V
Sbjct: 284 AVSLNNAVIDMYAKSGNITKALEVFESVNERNVVTWTTIITGLATHGHGAEALVMFDRMV 343

Query: 449 KGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLVRNSK 628
           K    +KPN  TF +IL AC  +G V+ G  +F+SMR  YGI P+ +HY C  +L+  + 
Sbjct: 344 KA--GVKPNDVTFIAILSACSHVGWVDLGNRFFNSMRSKYGINPNIEHYGCMIDLLGRAG 401

Query: 629 REANNASRMIQKKP 670
           +    A  +I+  P
Sbjct: 402 K-LREAEEVIKSMP 414


>ref|XP_003620912.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355495927|gb|AES77130.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 415

 Score =  120 bits (301), Expect = 6e-25
 Identities = 85/268 (31%), Positives = 139/268 (51%), Gaps = 30/268 (11%)
 Frame = +2

Query: 107 NVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNYSVSIFN 286
           NV+  +E + + A     AD +  L L++L  D++S+E G R++E++ R     +V + N
Sbjct: 83  NVNQVLELMGQGA----FADYSDFLSLLKLCEDLKSLELGKRVHEFLRRSKFGGNVELCN 138

Query: 287 EMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDKI 466
            +I +Y+K G  +   ++F++M  RN+ SWN+MI     NG   + + VF ++   +  I
Sbjct: 139 RLIGLYVKCGSVKDARKVFDKMPDRNVGSWNLMIGGYNVNGLGIDGLLVFKQM--RQQGI 196

Query: 467 KPNKTTFASILKACELLGEVEKGRAYFDSMRKDYG---------------ITPSSDHYLC 601
            P++ TFA +L  C L+  VE+G  ++  +   +G               I    +   C
Sbjct: 197 VPDEETFALVLAVCALVDGVEEGMEHYLGVVNIFGCAGRLNEAHEFIENIIHGDLEREDC 256

Query: 602 YDNL---VRNSKREANNASRMIQKKPVSG------QNRAPSDR-GMAY-----KKLMSLS 736
            D L   +  SK  A++   + Q+K  S       +NR    R  M Y     +KL  L+
Sbjct: 257 ADELLTVIDPSKAAADDKVPLPQRKKQSAINMMEEKNRVSEYRCNMPYEEEDDEKLRGLT 316

Query: 737 EKAKEAGYVPDTRYVLHDLDQEAKERAL 820
            + +EAGYVPDTRYVLHD+D+E KE+AL
Sbjct: 317 GQMREAGYVPDTRYVLHDIDEEEKEKAL 344


>ref|XP_004144134.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g56310-like [Cucumis sativus]
           gi|449493602|ref|XP_004159370.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g56310-like [Cucumis sativus]
          Length = 548

 Score =  117 bits (294), Expect = 4e-24
 Identities = 61/168 (36%), Positives = 97/168 (57%)
 Frame = +2

Query: 113 HSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNYSVSIFNEM 292
           H  +E   +M    V  D   +L ++   AD+ ++E G+ I+ Y+ +      VS++N +
Sbjct: 236 HEAIELFRKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVSLYNAL 295

Query: 293 IDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDKIKP 472
           IDMY K G+ RR   +FE M  +++ +W+ +I AL  +G   EAI +F R+ K   K++P
Sbjct: 296 IDMYAKSGNIRRALEVFENMKQKSVITWSTVIAALALHGLGGEAIDMFLRMEKA--KVRP 353

Query: 473 NKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLV 616
           N+ TF +IL AC  +G V+ GR YFD M+  Y I P  +HY C  +L+
Sbjct: 354 NEVTFVAILSACSHVGMVDVGRYYFDQMQSMYKIEPKIEHYGCMIDLL 401



 Score = 60.5 bits (145), Expect = 7e-07
 Identities = 38/153 (24%), Positives = 78/153 (50%), Gaps = 1/153 (0%)
 Frame = +2

Query: 80  KLGAHSGRRNVHS-TVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRF 256
           KL A    R +H+ TV +  +M       D N    L+Q+ +    +    +++++V   
Sbjct: 134 KLSAVEVGRQIHTQTVSSALDM-------DVNVATSLIQMYSSCGFVSDARKLFDFV--- 183

Query: 257 SSNYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVF 436
                V+++N M+  Y+K+G+ +   ++F +M  RN+ SW  +I    +     EAI++F
Sbjct: 184 -GFKDVALWNAMVAGYVKVGELKSARKVFNEMPQRNVISWTTLIAGYAQTNRPHEAIELF 242

Query: 437 TRLVKGEDKIKPNKTTFASILKACELLGEVEKG 535
            ++    ++++P++    ++L AC  LG +E G
Sbjct: 243 RKMQL--EEVEPDEIAMLAVLSACADLGALELG 273


>ref|NP_200442.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75171630|sp|Q9FMA1.1|PP433_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g56310 gi|10177829|dbj|BAB11258.1| unnamed protein
           product [Arabidopsis thaliana]
           gi|332009364|gb|AED96747.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 530

 Score =  117 bits (292), Expect = 7e-24
 Identities = 69/194 (35%), Positives = 106/194 (54%)
 Frame = +2

Query: 89  AHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNY 268
           A SGR +    +E  + M    V  D   LL ++   AD+ S+E G+RI  YV     N 
Sbjct: 226 AKSGRAS--EAIEVFQRMLMENVEPDEVTLLAVLSACADLGSLELGERICSYVDHRGMNR 283

Query: 269 SVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLV 448
           +VS+ N +IDMY K G+  +   +FE +  RN+ +W  +I  L  +G   EA+ +F R+V
Sbjct: 284 AVSLNNAVIDMYAKSGNITKALDVFECVNERNVVTWTTIIAGLATHGHGAEALAMFNRMV 343

Query: 449 KGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLVRNSK 628
           K    ++PN  TF +IL AC  +G V+ G+  F+SMR  YGI P+ +HY C  +L+  + 
Sbjct: 344 KA--GVRPNDVTFIAILSACSHVGWVDLGKRLFNSMRSKYGIHPNIEHYGCMIDLLGRAG 401

Query: 629 REANNASRMIQKKP 670
           +    A  +I+  P
Sbjct: 402 K-LREADEVIKSMP 414



 Score = 57.0 bits (136), Expect = 8e-06
 Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 2/90 (2%)
 Frame = +2

Query: 272 VSIFNEMIDMYLKLGDYRRGGRIFEQMLC--RNIDSWNMMIMALVENGEAEEAIQVFTRL 445
           V+++N ++  Y K+G+      + E M C  RN  SW  +I    ++G A EAI+VF R+
Sbjct: 182 VNVWNALLAGYGKVGEMDEARSLLEMMPCWVRNEVSWTCVISGYAKSGRASEAIEVFQRM 241

Query: 446 VKGEDKIKPNKTTFASILKACELLGEVEKG 535
           +   + ++P++ T  ++L AC  LG +E G
Sbjct: 242 LM--ENVEPDEVTLLAVLSACADLGSLELG 269


>ref|XP_007020439.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao]
           gi|508720067|gb|EOY11964.1| Tetratricopeptide
           repeat-like superfamily protein [Theobroma cacao]
          Length = 615

 Score =  116 bits (290), Expect = 1e-23
 Identities = 60/164 (36%), Positives = 96/164 (58%)
 Frame = +2

Query: 122 VEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDM 301
           VE   EM K  V A+   ++ ++   AD+ +++ G R++EY  R     +V + N +IDM
Sbjct: 228 VELFIEMQKIGVEANEVTVVAVLAACADLGALDLGKRVHEYSKRSGFGKNVRVLNTLIDM 287

Query: 302 YLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDKIKPNKT 481
           Y+K G      R+F +M  R + SW+ MI  L  +G+A+EA++VF+ ++  E  + PN  
Sbjct: 288 YVKCGCLEEARRVFNEMEERTVVSWSAMIQGLAMHGQAQEAVRVFSMMI--EMGVMPNGV 345

Query: 482 TFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNL 613
           TF  +L AC  +G V++GR +F  M +DYGI P  +HY C  +L
Sbjct: 346 TFIGLLHACSHMGLVDEGRRFFSGMIRDYGIIPEIEHYGCMVDL 389


>ref|XP_007225613.1| hypothetical protein PRUPE_ppa003215mg [Prunus persica]
           gi|462422549|gb|EMJ26812.1| hypothetical protein
           PRUPE_ppa003215mg [Prunus persica]
          Length = 592

 Score =  116 bits (290), Expect = 1e-23
 Identities = 65/188 (34%), Positives = 102/188 (54%)
 Frame = +2

Query: 110 VHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNYSVSIFNE 289
           V   VE L  + K +V  D +   +LMQ   + +++E    ++E + R  S  +VS +N 
Sbjct: 210 VKEAVEILGMLEKQQVQVDLHLYFQLMQACGEAKALEEAKFVHENITRLLSPLNVSTYNR 269

Query: 290 MIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDKIK 469
           +++MY K G       +F QM  RN+ SW++MI  L +NG  E+AI +FT   K    +K
Sbjct: 270 ILEMYSKCGSMDSTFMVFNQMPNRNLTSWDIMIAWLAKNGLGEDAIDLFTEFKKA--GLK 327

Query: 470 PNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLVRNSKREANNAS 649
           P+   F  +  AC +LG+  +G  +F+SM KDYGI PS DHY+   +++  S      A 
Sbjct: 328 PDGQMFIGVFYACSVLGDTTEGLLHFESMSKDYGIVPSMDHYVSVVDML-GSTGYLEEAL 386

Query: 650 RMIQKKPV 673
             I+K P+
Sbjct: 387 EFIEKMPL 394



 Score = 68.2 bits (165), Expect = 4e-09
 Identities = 50/189 (26%), Positives = 90/189 (47%), Gaps = 19/189 (10%)
 Frame = +2

Query: 311 LGDYRRGGRIFEQM-----LCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDKIKPN 475
           LGD   G   FE M     +  ++D +  ++  L   G  EEA++   ++      ++PN
Sbjct: 343 LGDTTEGLLHFESMSKDYGIVPSMDHYVSVVDMLGSTGYLEEALEFIEKM-----PLEPN 397

Query: 476 KTTFASILKACELLGEVEKGRAYFDSMRK----------DYGITPSSDHYLCYDNLVRNS 625
              + +++  C + G++E G    + + +            G+ P  D      +LV+  
Sbjct: 398 VDVWKTLMNLCRVHGQLELGDRCAELVEQLDASSLNEQSKAGLVPVKD-----SDLVKEK 452

Query: 626 KREANNASRMIQKKPVSGQNRAPS----DRGMAYKKLMSLSEKAKEAGYVPDTRYVLHDL 793
           +++   A  +++ +    + RA      +    Y +L  L E+ KEAGY+P+TR+VLHD+
Sbjct: 453 EKKKLAAQNLLEVRSRVHEYRAGDTSHPENDKIYAQLRGLREQMKEAGYIPETRFVLHDI 512

Query: 794 DQEAKERAL 820
           DQE KE AL
Sbjct: 513 DQEGKEDAL 521


>ref|XP_006401350.1| hypothetical protein EUTSA_v10015484mg [Eutrema salsugineum]
           gi|557102440|gb|ESQ42803.1| hypothetical protein
           EUTSA_v10015484mg [Eutrema salsugineum]
          Length = 561

 Score =  115 bits (289), Expect = 1e-23
 Identities = 73/194 (37%), Positives = 103/194 (53%), Gaps = 2/194 (1%)
 Frame = +2

Query: 77  AKLGAHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRF 256
           AK+G  S        +E  + M  + V  D   LL  +   AD+ S+E G+RI  YV   
Sbjct: 246 AKIGRAS------EAIEVFQRMLLDNVEPDEVTLLAALSACADLGSIEFGERICSYVDHR 299

Query: 257 SSNYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVF 436
             N +VS+ N +IDMY K GD ++    FE +  RN+ +W  MI  L  +G   EA+ +F
Sbjct: 300 GMNRAVSMNNALIDMYAKSGDIKKALVEFESLNERNVVTWTTMITGLATHGLGAEALAMF 359

Query: 437 TRLVKGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLV 616
            R+VK    +KPN  TF +IL AC  +G V  G+  F SMR  YGI P+ +HY C  +L+
Sbjct: 360 NRMVKA--GVKPNDVTFIAILSACSHVGLVGLGKCLFASMRWKYGIEPNIEHYGCMIDLL 417

Query: 617 RNSK--REANNASR 652
             +   REA   S+
Sbjct: 418 GRAGRIREAEEVSK 431


>ref|XP_004160258.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like
            [Cucumis sativus]
          Length = 583

 Score =  115 bits (289), Expect = 1e-23
 Identities = 83/302 (27%), Positives = 138/302 (45%), Gaps = 56/302 (18%)
 Frame = +2

Query: 83   LGAHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSS 262
            +G ++        ++   EM    ++ +   ++ ++   ADM ++  G RI+++  R   
Sbjct: 217  IGGYAQCGKSKEAIDLFLEMEDAGLLPNEVTVVAVLVACADMGNLVLGRRIHDFSNRSGY 276

Query: 263  NYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTR 442
              ++ + N +IDMY+K G      RIF+ M  R + SW+ MI  L  +G AE+A+ +F +
Sbjct: 277  EKNIRVCNTLIDMYVKCGCLEDACRIFDNMEERTVVSWSAMIAGLAAHGRAEDALALFNK 336

Query: 443  LVKGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNL--- 613
            ++     +KPN  TF  IL AC  +G VEKGR YF SM +DYGI P  +HY C  +L   
Sbjct: 337  MIN--TGVKPNAVTFIGILHACSHMGMVEKGRKYFASMTRDYGIVPRIEHYGCMVDLFSR 394

Query: 614  -------------------------------VRNSKREANNASRMIQK------------ 664
                                           V  + + A  A+R + K            
Sbjct: 395  AGLLQEAHEFIMNMPIAPNGVVWGALLGGCKVHKNVKLAEEATRHLSKLDPLNDGYYVVL 454

Query: 665  ----------KPVSGQNRAPSDRGMAYKKLMSLSEKAKEAGYVPDTRYVLHDLDQEAKER 814
                      + V+   +   DRG  ++KL+   ++ K  GYVP+T  VL D++++ KE+
Sbjct: 455  SNIYAEAGRWEDVARVRKLMRDRG-TWEKLL---QRMKLKGYVPNTSVVLLDMEEDQKEK 510

Query: 815  AL 820
             L
Sbjct: 511  FL 512


>ref|XP_004498089.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g56310-like [Cicer arietinum]
          Length = 546

 Score =  114 bits (286), Expect = 3e-23
 Identities = 60/180 (33%), Positives = 100/180 (55%)
 Frame = +2

Query: 77  AKLGAHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRF 256
           A +  ++   N +  ++    M    V  D   +L ++   AD+ ++  G+ I+ Y+ + 
Sbjct: 234 ALISGYTQAHNPNEAIKLFRRMQLENVKPDEIAILAVLSACADLGALHLGEWIHNYIEKH 293

Query: 257 SSNYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVF 436
             N  V ++N +IDMY K G+  +  ++FE M  + I +W  MI  L  +G  +EA+ VF
Sbjct: 294 KLNKIVPLYNSLIDMYAKSGNISKALKLFENMNHKTIITWTTMIAGLALHGLGKEALHVF 353

Query: 437 TRLVKGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLV 616
           +R+ K E ++KPN+ TF ++L AC  +  VE+G  YF SMR  YGI P  +HY C  +L+
Sbjct: 354 SRMEK-EGRVKPNEVTFIAVLCACSHVRLVEQGLNYFTSMRSRYGIEPKVEHYGCMIDLL 412


>ref|XP_004157162.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At2g25580-like [Cucumis sativus]
          Length = 731

 Score =  113 bits (282), Expect = 1e-22
 Identities = 63/198 (31%), Positives = 104/198 (52%)
 Frame = +2

Query: 80  KLGAHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFS 259
           KL        +   V+ LE + K  +  D +  L+LM    +  S+E    +  YV++  
Sbjct: 339 KLDEFCKEGKLKEAVQILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKVVCNYVIKSQ 398

Query: 260 SNYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFT 439
           ++  VS +N++++MY K G       IF +M  RNI SW+ MI  L +NG  E+AI +F 
Sbjct: 399 THVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLFY 458

Query: 440 RLVKGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLVR 619
              K    ++P+   F  +  AC +LG+ ++G  +F+SM K+YGITPS  HY+   +++ 
Sbjct: 459 EFKKA--GLRPDGKMFIGVFSACSVLGDADEGMLHFESMTKNYGITPSMHHYVSIVDML- 515

Query: 620 NSKREANNASRMIQKKPV 673
            S    + A   I+K P+
Sbjct: 516 GSIGFVDEAVEFIEKMPL 533


>ref|XP_004142608.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g25580-like [Cucumis sativus]
          Length = 671

 Score =  113 bits (282), Expect = 1e-22
 Identities = 63/198 (31%), Positives = 104/198 (52%)
 Frame = +2

Query: 80  KLGAHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFS 259
           KL        +   V+ LE + K  +  D +  L+LM    +  S+E    +  YV++  
Sbjct: 279 KLDEFCKEGKLKEAVQILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKVVCNYVIKSQ 338

Query: 260 SNYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFT 439
           ++  VS +N++++MY K G       IF +M  RNI SW+ MI  L +NG  E+AI +F 
Sbjct: 339 THVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLFY 398

Query: 440 RLVKGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLVR 619
              K    ++P+   F  +  AC +LG+ ++G  +F+SM K+YGITPS  HY+   +++ 
Sbjct: 399 EFKKA--GLRPDGKMFIGVFSACSVLGDADEGMLHFESMTKNYGITPSMHHYVSIVDML- 455

Query: 620 NSKREANNASRMIQKKPV 673
            S    + A   I+K P+
Sbjct: 456 GSIGFVDEAVEFIEKMPL 473


>ref|XP_002282081.2| PREDICTED: pentatricopeptide repeat-containing protein
           At2g20540-like [Vitis vinifera]
          Length = 541

 Score =  113 bits (282), Expect = 1e-22
 Identities = 67/196 (34%), Positives = 108/196 (55%), Gaps = 3/196 (1%)
 Frame = +2

Query: 95  SGRRNVHSTVEALEEMAKNRVVA---DSNHLLELMQLTADMESMEGGDRIYEYVMRFSSN 265
           SG   +    +ALE   + ++V    D   L+ ++   A + ++E G  I+ Y  +    
Sbjct: 222 SGYARIGCYADALEFFRRMQMVGIEPDEISLVSVLPACAQLGALELGKWIHFYADKAGFL 281

Query: 266 YSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRL 445
            ++ + N +I+MY K G    G R+F+QM  R++ SW+ MI+ L  +G A EAI++F  +
Sbjct: 282 RNICVCNALIEMYAKCGSIDEGRRLFDQMNERDVISWSTMIVGLANHGRAHEAIELFQEM 341

Query: 446 VKGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLVRNS 625
            K   KI+PN  TF  +L AC   G + +G  YF+SM++DY I P  +HY C  NL+  S
Sbjct: 342 QKA--KIEPNIITFVGLLSACAHAGLLNEGLRYFESMKRDYNIEPGVEHYGCLVNLLGLS 399

Query: 626 KREANNASRMIQKKPV 673
            R  + A  +I+K P+
Sbjct: 400 GR-LDQALELIKKMPM 414


>ref|XP_003532746.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g25580-like isoform X1 [Glycine max]
          Length = 664

 Score =  113 bits (282), Expect = 1e-22
 Identities = 62/189 (32%), Positives = 104/189 (55%)
 Frame = +2

Query: 107 NVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNYSVSIFN 286
           NV   VE LE + K  +  D    L+LM    + +S+E    ++ + ++  S   VS +N
Sbjct: 281 NVKEAVEVLELLEKLDIPVDLPRYLQLMHQCGENKSLEEAKNVHRHALQHLSPLQVSTYN 340

Query: 287 EMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDKI 466
            +++MYL+ G       IF  M  RN+ +W+ MI  L +NG AE++I +FT+       +
Sbjct: 341 RILEMYLECGSVDDALNIFNNMPERNLTTWDTMITQLAKNGFAEDSIDLFTQF--KNLGL 398

Query: 467 KPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLVRNSKREANNA 646
           KP+   F  +L AC +LG++++G  +F+SM KDYGI PS  H++   +++  S    + A
Sbjct: 399 KPDGQMFIGVLFACGMLGDIDEGMQHFESMNKDYGIVPSMTHFVSVVDMI-GSIGHLDEA 457

Query: 647 SRMIQKKPV 673
              I+K P+
Sbjct: 458 FEFIEKMPM 466


>ref|XP_002529936.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223530566|gb|EEF32444.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 548

 Score =  113 bits (282), Expect = 1e-22
 Identities = 58/190 (30%), Positives = 110/190 (57%)
 Frame = +2

Query: 104 RNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNYSVSIF 283
           + V   VE L  + + RV+ D    L+LM++  + ++ E    ++++++R  S  +VS F
Sbjct: 212 KKVKEAVEVLNLLEERRVLVDLPRFLQLMRICGEAKASEEAKVVHDHLVRLQSPLAVSTF 271

Query: 284 NEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDK 463
           N++++MY K GD      +F +M  RN+ +W+ MI  L +NG  E+AI +F++  +    
Sbjct: 272 NKILEMYGKCGDMDSAFAVFNKMPKRNLTTWDTMIAWLAKNGLGEDAIDLFSQFKQA--G 329

Query: 464 IKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLVRNSKREANN 643
           + P+   F  +  AC ++G+V +G  +F+SM+KDYGI PS +H++   +++  +    + 
Sbjct: 330 LVPDAQLFIGVFSACGVVGDVIEGMLHFESMKKDYGIVPSMEHFVSIVDML-GTIGHLDE 388

Query: 644 ASRMIQKKPV 673
           A   I+K P+
Sbjct: 389 ALEFIEKMPM 398


>gb|EEE66861.1| hypothetical protein OsJ_23658 [Oryza sativa Japonica Group]
          Length = 728

 Score =  113 bits (282), Expect = 1e-22
 Identities = 69/243 (28%), Positives = 119/243 (48%), Gaps = 10/243 (4%)
 Frame = +2

Query: 122  VEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDM 301
            ++    M +  V AD   L  +    A++  +E G +++  V +        + + ++DM
Sbjct: 333  LDLFRRMLREGVAADRFTLTSVAAACANVGMVEQGRQVHGCVEKLWYKLDAPLASAIVDM 392

Query: 302  YLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFTRLVKGEDKIKPNKT 481
            Y K G+      IF++   +NI  W  M+ +   +G+   AI++F R+    +K+ PN+ 
Sbjct: 393  YAKCGNLEDARSIFDRACTKNIAVWTSMLCSYASHGQGRIAIELFERMTA--EKMTPNEI 450

Query: 482  TFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNL---------VRNSKRE 634
            T   +L AC  +G V +G  YF  M+++YGI PS +HY C  +L          +N   E
Sbjct: 451  TLVGVLSACSHVGLVSEGELYFKQMQEEYGIVPSIEHYNCIVDLYGRSGLLDKAKNFIEE 510

Query: 635  AN-NASRMIQKKPVSGQNRAPSDRGMAYKKLMSLSEKAKEAGYVPDTRYVLHDLDQEAKE 811
             N N   ++ K  ++  N+  ++    Y  L  L E+ KE GY   T  V+HD++ E +E
Sbjct: 511  NNINHEAIVWKTLLNASNQQSAE---IYAYLEKLVERLKEIGYTSRTDLVVHDVEDEQRE 567

Query: 812  RAL 820
             AL
Sbjct: 568  TAL 570



 Score = 58.9 bits (141), Expect = 2e-06
 Identities = 29/101 (28%), Positives = 57/101 (56%)
 Frame = +2

Query: 134 EEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMYLKL 313
           E +A+     ++  L  +++  A M  +E G R++ +++R   +  V + N ++DMY K 
Sbjct: 101 EMLAEGEATPNAFVLAAVVRCCAGMGDVESGKRVHGWMLRNGVHLDVVLCNAVLDMYAKC 160

Query: 314 GDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVF 436
           G + R  R+F  M  R+  SWN+ I A +++G+   ++Q+F
Sbjct: 161 GQFERARRVFGAMAERDAVSWNIAIGACIQSGDILGSMQLF 201


>gb|EXB93457.1| hypothetical protein L484_006119 [Morus notabilis]
          Length = 612

 Score =  112 bits (280), Expect = 2e-22
 Identities = 60/185 (32%), Positives = 99/185 (53%)
 Frame = +2

Query: 77  AKLGAHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYEYVMRF 256
           A L  ++        ++  E M   ++  D+  L+  +   A    +E G+RI  YV   
Sbjct: 300 AMLAGYAQNGRSIEAIKLFERMMNEKIKPDNAALVSALSACAQSGFVEVGERISAYVESH 359

Query: 257 SSNYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVF 436
           S    V + + ++DM+ K G+  +  ++F++M  ++I SWN MI  L  NG AEEAI ++
Sbjct: 360 SFTSDVKVASALLDMHSKFGNIDKARQVFDEMRVKDIVSWNSMISGLAVNGYAEEAIHLY 419

Query: 437 TRLVKGEDKIKPNKTTFASILKACELLGEVEKGRAYFDSMRKDYGITPSSDHYLCYDNLV 616
            ++   E  +KP+  TF+ +L AC   G +E G  +F+SM+ DYGITP  +H+ C  +L 
Sbjct: 420 EKM--KETGLKPDNITFSGLLTACTHAGLIELGLKFFESMKSDYGITPEIEHHACVIDLF 477

Query: 617 RNSKR 631
             S R
Sbjct: 478 CRSGR 482



 Score = 64.7 bits (156), Expect = 4e-08
 Identities = 58/236 (24%), Positives = 107/236 (45%), Gaps = 7/236 (2%)
 Frame = +2

Query: 83  LGAHSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTADMESMEGGDRIYE-YVMRFS 259
           +  ++   + H  +  +E M           L+ L+ + A +  +E G  I + Y+   S
Sbjct: 200 ISCYAQNEDYHEALRLIERMQAENFGPSKITLVILLSICAKLGDLEMGLGIKKKYIDDSS 259

Query: 260 SNYSVSIFNEMIDMYLKLGDYRRGGRIFEQMLCRNIDSWNMMIMALVENGEAEEAIQVFT 439
               + I   ++++Y+K G      R F++M  R++ +W+ M+    +NG + EAI++F 
Sbjct: 260 LRSDMIISTAILELYVKCGAVDGARREFDRMDRRDVVAWSAMLAGYAQNGRSIEAIKLFE 319

Query: 440 RLVKGEDKIKPNKTTFASILKACELLGEVEKGR---AYFDSMRKDYGITPSS---DHYLC 601
           R++   +KIKP+     S L AC   G VE G    AY +S      +  +S   D +  
Sbjct: 320 RMM--NEKIKPDNAALVSALSACAQSGFVEVGERISAYVESHSFTSDVKVASALLDMHSK 377

Query: 602 YDNLVRNSKREANNASRMIQKKPVSGQNRAPSDRGMAYKKLMSLSEKAKEAGYVPD 769
           + N+  +  R+  +  R+      +      +  G A ++ + L EK KE G  PD
Sbjct: 378 FGNI--DKARQVFDEMRVKDIVSWNSMISGLAVNGYA-EEAIHLYEKMKETGLKPD 430


Top