BLASTX nr result

ID: Mentha25_contig00001004 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00001004
         (1062 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI19618.3| unnamed protein product [Vitis vinifera]              246   9e-63
ref|XP_006375040.1| hypothetical protein POPTR_0014s03840g [Popu...   233   1e-58
gb|EYU45090.1| hypothetical protein MIMGU_mgv1a021870mg, partial...   231   4e-58
ref|XP_007202366.1| hypothetical protein PRUPE_ppa009223mg [Prun...   198   3e-48
ref|NP_001239891.1| uncharacterized protein LOC100783921 [Glycin...   197   5e-48
ref|XP_003620888.1| Pentatricopeptide repeat-containing protein ...   196   1e-47
ref|XP_006345330.1| PREDICTED: pentatricopeptide repeat-containi...   196   2e-47
ref|XP_006393565.1| hypothetical protein EUTSA_v10011729mg [Eutr...   189   1e-45
ref|XP_003517821.1| PREDICTED: pentatricopeptide repeat-containi...   189   2e-45
ref|XP_003620912.1| Pentatricopeptide repeat-containing protein ...   189   2e-45
gb|AAD46029.1|AC007519_14 F16N3.14 [Arabidopsis thaliana]             186   1e-44
ref|NP_175189.2| pentatricopeptide repeat  protein DYW1 [Arabido...   186   1e-44
ref|XP_002891372.1| hypothetical protein ARALYDRAFT_473903 [Arab...   186   2e-44
ref|XP_007152924.1| hypothetical protein PHAVU_004G171700g [Phas...   184   7e-44
ref|XP_004505212.1| PREDICTED: pentatricopeptide repeat-containi...   183   9e-44
ref|XP_004505209.1| PREDICTED: pentatricopeptide repeat-containi...   183   9e-44
ref|XP_002279824.1| PREDICTED: pentatricopeptide repeat-containi...   181   5e-43
gb|EXB38821.1| Serine acetyltransferase 5 [Morus notabilis]           181   6e-43
gb|AFK42502.1| unknown [Medicago truncatula]                          181   6e-43
ref|XP_003607988.1| Pentatricopeptide repeat-containing protein ...   181   6e-43

>emb|CBI19618.3| unnamed protein product [Vitis vinifera]
          Length = 576

 Score =  246 bits (629), Expect = 9e-63
 Identities = 129/254 (50%), Positives = 160/254 (62%), Gaps = 43/254 (16%)
 Frame = +1

Query: 1    ENGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITP 180
            +NGE EEA+ +F++L K  D ++P+ +TF  +L ACE LG VE+G A+F SM  DYGITP
Sbjct: 325  KNGEGEEALAIFSKLKK--DGIEPDGSTFIGVLSACECLGAVEEGLAHFNSMSTDYGITP 382

Query: 181  STDHHLSYNNLVRNSKKEAD---------------------------------------- 240
            S +H     +L    +K A+                                        
Sbjct: 383  SMEHFAIIVDLFGRLQKIAEAKEFIASMPLEPSSMIWQTLQKYLKTERVDEPAPLTTGSG 442

Query: 241  ---NASRMIQKKLVSEKNRAPSDRGMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKE 411
               +  + ++   VS++  A  ++  AY+KL SL K  KEAGYV DTRYVLHDLD EAKE
Sbjct: 443  LKLSHKKRVKSNFVSKQKNASPEKSKAYEKLRSLHKGVKEAGYVSDTRYVLHDLDQEAKE 502

Query: 412  RALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRF 591
            ++LLYHSERLAIAYGLISTPPGT LR+IKNLRICGDCHNFIKILS  EKRE IVRD KRF
Sbjct: 503  KSLLYHSERLAIAYGLISTPPGTTLRIIKNLRICGDCHNFIKILSNIEKREIIVRDNKRF 562

Query: 592  HHFKDGECSCRDFW 633
            HHF+DG+CSC D+W
Sbjct: 563  HHFRDGKCSCGDYW 576


>ref|XP_006375040.1| hypothetical protein POPTR_0014s03840g [Populus trichocarpa]
           gi|550323354|gb|ERP52837.1| hypothetical protein
           POPTR_0014s03840g [Populus trichocarpa]
          Length = 429

 Score =  233 bits (593), Expect = 1e-58
 Identities = 128/254 (50%), Positives = 165/254 (64%), Gaps = 43/254 (16%)
 Frame = +1

Query: 1   ENGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITP 180
           EN E E+A+++F+++ KG D ++P+ ++F  +L AC  LG  ++G+ +F SM +DYGITP
Sbjct: 178 ENKEGEKALEIFSQM-KG-DGIRPDGSSFVGVLMACVCLGAEKEGQKHFESMSRDYGITP 235

Query: 181 STDHHLSYNNLVRNSKKEA-----------DNASRM---IQK------------------ 264
           + +H+  + +L+  + K A           D  SR+   +QK                  
Sbjct: 236 TVEHYEVFVDLLGRTGKIAEAKELVSNMPIDPNSRIWETLQKYSKARTQGQLGYPVSPPG 295

Query: 265 -KLVSEKN----------RAPSDRGMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKE 411
            KL   K           R  SDR  AY+KL SLSK+ ++AGYVPDTR+VLHDLD EAKE
Sbjct: 296 LKLGDMKRAKDNTNTNHRRVTSDRSKAYEKLRSLSKEVRDAGYVPDTRFVLHDLDQEAKE 355

Query: 412 RALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRF 591
           +AL YHSERLAIAYGLI+T PGT LR++KNLRICGDCHNFIKILS  E REFIVRD KRF
Sbjct: 356 KALFYHSERLAIAYGLINTSPGTTLRIMKNLRICGDCHNFIKILSKIEDREFIVRDNKRF 415

Query: 592 HHFKDGECSCRDFW 633
           HHFK G CSCRD+W
Sbjct: 416 HHFKAGNCSCRDYW 429


>gb|EYU45090.1| hypothetical protein MIMGU_mgv1a021870mg, partial [Mimulus
           guttatus]
          Length = 277

 Score =  231 bits (589), Expect = 4e-58
 Identities = 123/212 (58%), Positives = 142/212 (66%), Gaps = 1/212 (0%)
 Frame = +1

Query: 1   ENGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITP 180
           ENG+  EAIQ+F +LVK                                    +DYGITP
Sbjct: 113 ENGQENEAIQLFAKLVK------------------------------------EDYGITP 136

Query: 181 STDHHLSYNNLVRNSKKEADNASRMIQKKLVSEKNRAP-SDRGMAYKKLMSLSKKAKEAG 357
           S DH+ SY NL R + +           ++VSEK+RA  SD+ +AY+KL  LS +AK+AG
Sbjct: 137 SLDHYTSYVNLQRKTNR-----------RVVSEKDRAKNSDKSLAYEKLRCLSDEAKKAG 185

Query: 358 YVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIK 537
           YV DTRYVLHD+D EAKERAL+YHSERLAIAYGLI T PGT LR+IKNLRICGDCHNFIK
Sbjct: 186 YVADTRYVLHDIDEEAKERALMYHSERLAIAYGLIRTTPGTTLRIIKNLRICGDCHNFIK 245

Query: 538 ILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 633
           ILS FE+REFIVRD KRFHHFKDG CSCRDFW
Sbjct: 246 ILSKFEEREFIVRDNKRFHHFKDGICSCRDFW 277


>ref|XP_007202366.1| hypothetical protein PRUPE_ppa009223mg [Prunus persica]
           gi|462397897|gb|EMJ03565.1| hypothetical protein
           PRUPE_ppa009223mg [Prunus persica]
          Length = 301

 Score =  198 bits (504), Expect = 3e-48
 Identities = 116/268 (43%), Positives = 147/268 (54%), Gaps = 58/268 (21%)
 Frame = +1

Query: 4   NGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPS 183
           NG+ +E + +F ++      LKPN+ TF  +L AC     VE+G  YF SM+ +Y I P 
Sbjct: 36  NGQGDEGLLLFEQM--RNLGLKPNKETFVVVLVACASAEAVEEGLTYFESMKNEYEIVPE 93

Query: 184 TDHHLSYNNLV----------------------------RN------------------- 222
            +H+L   +++                            RN                   
Sbjct: 94  IEHYLGLIDVLGKSGHLNEAEEFIEKMPFEPTAEVWEALRNFARIHGDIELEDRAEDLLV 153

Query: 223 ----SKKEADNASRMIQKK-----LVSEKNRAPSDR--GMAYKKLMSLSKKAKEAGYVPD 369
               SK  A+     ++K+     ++ EKNR    R    AY+KL  L  + +EAGYVPD
Sbjct: 154 SLDPSKANAEKIPLPLRKQHSEINMLGEKNRVSEYRITNEAYEKLKGLKGQMREAGYVPD 213

Query: 370 TRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILST 549
           TRYVLHD+D EAKE+AL YHSERLAIAYGLISTP    LR+IKNLRICGDCHN IKI+S 
Sbjct: 214 TRYVLHDIDQEAKEQALQYHSERLAIAYGLISTPARQTLRIIKNLRICGDCHNAIKIMSK 273

Query: 550 FEKREFIVRDTKRFHHFKDGECSCRDFW 633
              RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 274 IVGRELIVRDNKRFHHFKDGKCSCGDYW 301


>ref|NP_001239891.1| uncharacterized protein LOC100783921 [Glycine max]
           gi|255636013|gb|ACU18351.1| unknown [Glycine max]
          Length = 449

 Score =  197 bits (502), Expect = 5e-48
 Identities = 123/272 (45%), Positives = 151/272 (55%), Gaps = 62/272 (22%)
 Frame = +1

Query: 4   NGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPS 183
           NG   + + VF ++ + E  L P+  TF  +L AC     VE+G  +F SM K+YGI PS
Sbjct: 181 NGLGCDGLLVFQQMKQAE--LPPDGETFELVLAACSQAEAVEEGFLHFESM-KEYGIVPS 237

Query: 184 TDHHLSYNNLVRNSK--KEAD--------------------------------------- 240
            +H+L   N++ N+   KEA+                                       
Sbjct: 238 MEHYLEVINIMGNAGQLKEAEEFIENVPIELGVEAWESLRKFARIHGDLDLEDCAEELLT 297

Query: 241 --NASRMIQKKL-------------VSEKNRAPSDR-GMAYK-----KLMSLSKKAKEAG 357
             + S+ I  KL             + EKNRA   R  + YK     KL  LS + +EAG
Sbjct: 298 RFDPSKAIADKLPTPPRKKQSDVNMLEEKNRATEYRYSIPYKEEDNEKLGGLSGQMREAG 357

Query: 358 YVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIK 537
           YVPDTRYVLHD+D E KE+AL YHSERLAIAYGLISTPP T LR+IKNLRICGDCHN IK
Sbjct: 358 YVPDTRYVLHDIDEEEKEKALQYHSERLAIAYGLISTPPRTTLRIIKNLRICGDCHNAIK 417

Query: 538 ILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 633
           I+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 418 IMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 449


>ref|XP_003620888.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355495903|gb|AES77106.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 446

 Score =  196 bits (499), Expect = 1e-47
 Identities = 121/272 (44%), Positives = 148/272 (54%), Gaps = 62/272 (22%)
 Frame = +1

Query: 4   NGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPS 183
           NG   + + VF ++   +  + P+E TFA +L  C L+  VE+G   F SM K+YGI P 
Sbjct: 178 NGLGIDGLLVFKQM--RQQGVVPDEETFALVLAVCALVDGVEEGLMQFESM-KEYGIVPG 234

Query: 184 TDHHLSYNNL----------------------------VRN------------------- 222
            +H+L   N+                            +RN                   
Sbjct: 235 MEHYLGVVNIFGCAGRLDEAHEFIENMPIEAGVELWETLRNFARIHGDLEREDCADELLT 294

Query: 223 ----SKKEADNASRMIQKK-----LVSEKNRAPSDR-GMAYK-----KLMSLSKKAKEAG 357
               SK  AD      +KK     ++ EKNR    R  M YK     KL  L+ + +EAG
Sbjct: 295 VLDPSKAAADKVPLPQRKKQSAINMLEEKNRVSEYRCNMPYKEEGDVKLRGLTGQMREAG 354

Query: 358 YVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIK 537
           YVPDTRYVLHD+D E KE+AL YHSERLAIAYGLISTPP T LR+IKNLRICGDCHN IK
Sbjct: 355 YVPDTRYVLHDIDEEEKEKALQYHSERLAIAYGLISTPPRTTLRIIKNLRICGDCHNAIK 414

Query: 538 ILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 633
           I+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 415 IMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 446


>ref|XP_006345330.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like
            [Solanum tuberosum]
          Length = 466

 Score =  196 bits (497), Expect = 2e-47
 Identities = 114/273 (41%), Positives = 146/273 (53%), Gaps = 62/273 (22%)
 Frame = +1

Query: 1    ENGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITP 180
            E G+ E  + +F ++ K    L+PN  TF+ +L  C   G V+ G  YF  M+ +YGI P
Sbjct: 195  ETGDGESGLLLFEQMRKLR-LLEPNGDTFSVVLSGCASKGSVKDGFTYFELMKNEYGIVP 253

Query: 181  STDHHLSY----------NNLVR------------------------------------- 219
              +H+L            N L+                                      
Sbjct: 254  GVEHYLGIIDVLGKSGHLNELLEFIEDMPIEPTKVVWEAVINFARIHGDIELEDRTEELL 313

Query: 220  ----NSKKEADNASRMIQKK-----LVSEKNRAPS------DRGMAYKKLMSLSKKAKEA 354
                +S+  AD      QK+     ++  K+RA         R  AY+KL  LS + ++A
Sbjct: 314  IRLDSSRNMADKPLAPFQKRHSEFSMLEGKDRANEFKSTIPHRADAYEKLKGLSGQMRDA 373

Query: 355  GYVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFI 534
            GYVPDTRYVLHD+D  AKE+AL+YHSERLAIAYGLISTP  T LR+IKNLRICGDCHN I
Sbjct: 374  GYVPDTRYVLHDIDEAAKEQALMYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAI 433

Query: 535  KILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 633
            KI+S    RE IVRD KRFHHF+DG+CSC D+W
Sbjct: 434  KIMSKIVGRELIVRDNKRFHHFRDGKCSCNDYW 466


>ref|XP_006393565.1| hypothetical protein EUTSA_v10011729mg [Eutrema salsugineum]
           gi|557090143|gb|ESQ30851.1| hypothetical protein
           EUTSA_v10011729mg [Eutrema salsugineum]
          Length = 254

 Score =  189 bits (481), Expect = 1e-45
 Identities = 89/145 (61%), Positives = 114/145 (78%), Gaps = 2/145 (1%)
 Frame = +1

Query: 205 NNLVRNSKKEADNASRMIQ--KKLVSEKNRAPSDRGMAYKKLMSLSKKAKEAGYVPDTRY 378
           + +V   K+ + + S  I+  K+ +S + +A  DR  A  KL SL K+ ++AGYVPDT+Y
Sbjct: 110 SEVVEPKKRISSDRSTKIKGDKQEISSQKKAIVDRSKALVKLKSLGKEVRDAGYVPDTKY 169

Query: 379 VLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEK 558
           VLHD+D EAKE+AL++HSERLAIA+GLI+TPPGT +RV+KNLRICGDCHNFIK LS+ E 
Sbjct: 170 VLHDIDEEAKEKALMHHSERLAIAFGLINTPPGTTIRVMKNLRICGDCHNFIKRLSSIED 229

Query: 559 REFIVRDTKRFHHFKDGECSCRDFW 633
           REFIVRD KRFHHF+DG CSC D+W
Sbjct: 230 REFIVRDNKRFHHFRDGSCSCGDYW 254


>ref|XP_003517821.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15690-like [Glycine max]
          Length = 452

 Score =  189 bits (480), Expect = 2e-45
 Identities = 118/272 (43%), Positives = 147/272 (54%), Gaps = 62/272 (22%)
 Frame = +1

Query: 4   NGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPS 183
           NG   + + VF ++ +    + P+  TF  +L AC     VE+G  +F SM K++GI PS
Sbjct: 184 NGLGCDGLLVFQQMKQAG--VPPDGETFELVLAACAQAEAVEEGFLHFESM-KEHGIVPS 240

Query: 184 TDHHLSYNNL----------------------------VRN------------------- 222
            +H+L   N+                            +RN                   
Sbjct: 241 MEHYLEVINILGNTGQLNEAEEFIEKIPIELGVEAWESLRNFAQKHGDLDLEDHAEEVLT 300

Query: 223 ----SKKEADNASRMIQKK-----LVSEKNRAPSDRGM------AYKKLMSLSKKAKEAG 357
               SK  AD      +KK     ++ EKNR    R        A++KL  LS + +EAG
Sbjct: 301 CLDPSKAVADKLPPPPRKKQSDMNMLEEKNRVTEYRYSIPYKEEAHEKLGGLSGQMREAG 360

Query: 358 YVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIK 537
           YVPDTRYVLHD+D E KE+AL YHSERLAIAYGLISTPP T LR+IKNLRICGDCHN IK
Sbjct: 361 YVPDTRYVLHDIDEEEKEKALQYHSERLAIAYGLISTPPRTTLRIIKNLRICGDCHNAIK 420

Query: 538 ILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 633
           I+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 421 IMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 452


>ref|XP_003620912.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355495927|gb|AES77130.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 415

 Score =  189 bits (479), Expect = 2e-45
 Identities = 109/240 (45%), Positives = 141/240 (58%), Gaps = 30/240 (12%)
 Frame = +1

Query: 4   NGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPS 183
           NG   + + VF ++   +  + P+E TFA +L  C L+  VE+G  ++  +   +G    
Sbjct: 178 NGLGIDGLLVFKQM--RQQGIVPDEETFALVLAVCALVDGVEEGMEHYLGVVNIFGCAGR 235

Query: 184 TDH-HLSYNNLVRN-----------------SKKEADNASRMIQKK------LVSEKNRA 291
            +  H    N++                   SK  AD+   + Q+K      ++ EKNR 
Sbjct: 236 LNEAHEFIENIIHGDLEREDCADELLTVIDPSKAAADDKVPLPQRKKQSAINMMEEKNRV 295

Query: 292 PSDR-GMAYK-----KLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAY 453
              R  M Y+     KL  L+ + +EAGYVPDTRYVLHD+D E KE+AL YHSE LAIAY
Sbjct: 296 SEYRCNMPYEEEDDEKLRGLTGQMREAGYVPDTRYVLHDIDEEEKEKALQYHSECLAIAY 355

Query: 454 GLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 633
           GLISTPP T LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 356 GLISTPPRTTLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 415


>gb|AAD46029.1|AC007519_14 F16N3.14 [Arabidopsis thaliana]
          Length = 571

 Score =  186 bits (472), Expect = 1e-44
 Identities = 87/145 (60%), Positives = 108/145 (74%)
 Frame = +1

Query: 199 SYNNLVRNSKKEADNASRMIQKKLVSEKNRAPSDRGMAYKKLMSLSKKAKEAGYVPDTRY 378
           S++  VR  K E     +           +A  DR  AY KL SL K+ ++AGYVP+T+Y
Sbjct: 438 SHSTKVRGDKPEISGGEK-----------KAIVDRSKAYVKLKSLGKEVRDAGYVPETKY 486

Query: 379 VLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEK 558
           VLHD+D EAKE+AL++HSERLAIA+G+I+TPPGT +RV+KNLRICGDCHNFIKILS+ E 
Sbjct: 487 VLHDIDEEAKEKALMHHSERLAIAFGIINTPPGTTIRVMKNLRICGDCHNFIKILSSIED 546

Query: 559 REFIVRDTKRFHHFKDGECSCRDFW 633
           RE IVRD KRFHHF+DG CSC D+W
Sbjct: 547 REIIVRDNKRFHHFRDGNCSCGDYW 571


>ref|NP_175189.2| pentatricopeptide repeat  protein DYW1 [Arabidopsis thaliana]
           gi|193806408|sp|P0C7R1.1|PPR74_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g47580, chloroplastic; Flags: Precursor
           gi|332194069|gb|AEE32190.1| pentatricopeptide repeat
           protein DYW1 [Arabidopsis thaliana]
          Length = 239

 Score =  186 bits (472), Expect = 1e-44
 Identities = 87/145 (60%), Positives = 108/145 (74%)
 Frame = +1

Query: 199 SYNNLVRNSKKEADNASRMIQKKLVSEKNRAPSDRGMAYKKLMSLSKKAKEAGYVPDTRY 378
           S++  VR  K E     +           +A  DR  AY KL SL K+ ++AGYVP+T+Y
Sbjct: 106 SHSTKVRGDKPEISGGEK-----------KAIVDRSKAYVKLKSLGKEVRDAGYVPETKY 154

Query: 379 VLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEK 558
           VLHD+D EAKE+AL++HSERLAIA+G+I+TPPGT +RV+KNLRICGDCHNFIKILS+ E 
Sbjct: 155 VLHDIDEEAKEKALMHHSERLAIAFGIINTPPGTTIRVMKNLRICGDCHNFIKILSSIED 214

Query: 559 REFIVRDTKRFHHFKDGECSCRDFW 633
           RE IVRD KRFHHF+DG CSC D+W
Sbjct: 215 REIIVRDNKRFHHFRDGNCSCGDYW 239


>ref|XP_002891372.1| hypothetical protein ARALYDRAFT_473903 [Arabidopsis lyrata subsp.
           lyrata] gi|297337214|gb|EFH67631.1| hypothetical protein
           ARALYDRAFT_473903 [Arabidopsis lyrata subsp. lyrata]
          Length = 415

 Score =  186 bits (471), Expect = 2e-44
 Identities = 104/206 (50%), Positives = 140/206 (67%)
 Frame = +1

Query: 16  EEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSTDHH 195
           E A +VF +L +     +  +T     + A EL G V +      ++RKD     +T  H
Sbjct: 234 ETADKVFNKLPE-----RNLDTWSGGRVTAKELSGSVVRN-----TVRKD-----TTLRH 278

Query: 196 LSYNNLVRNSKKEADNASRMIQKKLVSEKNRAPSDRGMAYKKLMSLSKKAKEAGYVPDTR 375
           +S ++   ++K   D      + K++ EK +A  DR  AY KL SL+K+ ++AGYVP+T+
Sbjct: 279 ISPSS--HSTKIRGD------KPKILGEK-KAIVDRSKAYVKLKSLAKEVRDAGYVPETK 329

Query: 376 YVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFE 555
           YVLHD+D EAKE+AL++HSERLAIA+GLI+TPPGT +RV+KNLRICGDCHNFIKILS+ E
Sbjct: 330 YVLHDIDEEAKEKALMHHSERLAIAFGLINTPPGTTIRVMKNLRICGDCHNFIKILSSIE 389

Query: 556 KREFIVRDTKRFHHFKDGECSCRDFW 633
            RE IVRD KRFHHF+ G CSC D+W
Sbjct: 390 DREIIVRDNKRFHHFRYGSCSCGDYW 415


>ref|XP_007152924.1| hypothetical protein PHAVU_004G171700g [Phaseolus vulgaris]
           gi|561026233|gb|ESW24918.1| hypothetical protein
           PHAVU_004G171700g [Phaseolus vulgaris]
          Length = 460

 Score =  184 bits (466), Expect = 7e-44
 Identities = 114/272 (41%), Positives = 142/272 (52%), Gaps = 62/272 (22%)
 Frame = +1

Query: 4   NGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPS 183
           NG A + +  F ++ +       +  TF  +L AC     VE+G  +   M K+ GI PS
Sbjct: 192 NGLARDGLLAFQKMKQAGVPF--DGETFKLVLAACAQAEAVEEGLLHLEYM-KENGIVPS 248

Query: 184 TDHHLSYNNLVRN----------------------------------------------- 222
            +H+L   N++ N                                               
Sbjct: 249 MEHYLEVVNILGNAGRLNEAEEFIEKIPIEVGAEGWESLRNLARIHGNLDLEDRAKELLM 308

Query: 223 ----SKKEADNASRMIQKK-----LVSEKNRAPSDRGM------AYKKLMSLSKKAKEAG 357
               SK  AD  +   +KK     ++ EKNR    R        A++KL  LS + +EAG
Sbjct: 309 YLDPSKSIADELAMPPRKKQHDINMLEEKNRVSEYRYSIPYKEEAHEKLGGLSGQMREAG 368

Query: 358 YVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIK 537
           YVPDTRYVLHD+D E KE+AL YHSERLAIAYGLISTPP T LR+IKNLRICGDCHN IK
Sbjct: 369 YVPDTRYVLHDIDEEEKEKALQYHSERLAIAYGLISTPPRTTLRIIKNLRICGDCHNAIK 428

Query: 538 ILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 633
           I+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 429 IMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 460


>ref|XP_004505212.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like
            [Cicer arietinum]
          Length = 567

 Score =  183 bits (465), Expect = 9e-44
 Identities = 114/266 (42%), Positives = 141/266 (53%), Gaps = 60/266 (22%)
 Frame = +1

Query: 16   EEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSTDHH 195
            +EA+Q+F  +   E  L+    T  ++L AC     VE    +F SM+  YGI P  +H+
Sbjct: 305  DEALQLFEHM--NELGLEITSETLLAVLSACGSAEAVEDAYLHFESMKNKYGIEPGVEHY 362

Query: 196  LSYNNLVRNSK--KEADN------------------------------------ASRMIQ 261
            +    ++  S   KEA+                                       R+  
Sbjct: 363  MGLLEVLGQSGYLKEAEEFIEKLPFGPTVTVLETLKSYARIHGDIDLEDHVEELIVRLDP 422

Query: 262  KKLVSEKNRAPS-------------DRGMAYK---------KLMSLSKKAKEAGYVPDTR 375
             K V+ K   P              +R + YK         KL +LS   KEAGYVPDTR
Sbjct: 423  SKAVANKIPTPPPKKYSAISMLEGRNRMIEYKNPTLYKDDEKLKALSGM-KEAGYVPDTR 481

Query: 376  YVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFE 555
            YVLHD+D EAKE+ALLYHSERLAIAYGLISTPP T LR+IKNLR+CGDCHN IKI+S   
Sbjct: 482  YVLHDIDQEAKEQALLYHSERLAIAYGLISTPPRTPLRIIKNLRVCGDCHNAIKIMSRIV 541

Query: 556  KREFIVRDTKRFHHFKDGECSCRDFW 633
             RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 542  GRELIVRDNKRFHHFKDGKCSCGDYW 567


>ref|XP_004505209.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like
            [Cicer arietinum]
          Length = 567

 Score =  183 bits (465), Expect = 9e-44
 Identities = 114/266 (42%), Positives = 141/266 (53%), Gaps = 60/266 (22%)
 Frame = +1

Query: 16   EEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSTDHH 195
            +EA+Q+F  +   E  L+    T  ++L AC     VE    +F SM+  YGI P  +H+
Sbjct: 305  DEALQLFEHM--NELGLEITSETLLAVLSACGSAEAVEDAYLHFESMKNKYGIEPGVEHY 362

Query: 196  LSYNNLVRNSK--KEADN------------------------------------ASRMIQ 261
            +    ++  S   KEA+                                       R+  
Sbjct: 363  MGLLEVLGQSGYLKEAEEFIEKLPFGPTVTVLETLKSYARIHGDIDLEDHVEELIVRLDP 422

Query: 262  KKLVSEKNRAPS-------------DRGMAYK---------KLMSLSKKAKEAGYVPDTR 375
             K V+ K   P              +R + YK         KL +LS   KEAGYVPDTR
Sbjct: 423  SKAVANKIPTPPPKKYSAISMLEGRNRMIEYKNPTLYKDDEKLKALSGM-KEAGYVPDTR 481

Query: 376  YVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFE 555
            YVLHD+D EAKE+ALLYHSERLAIAYGLISTPP T LR+IKNLR+CGDCHN IKI+S   
Sbjct: 482  YVLHDIDQEAKEQALLYHSERLAIAYGLISTPPRTPLRIIKNLRVCGDCHNAIKIMSRIV 541

Query: 556  KREFIVRDTKRFHHFKDGECSCRDFW 633
             RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 542  GRELIVRDNKRFHHFKDGKCSCGDYW 567


>ref|XP_002279824.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15690-like [Vitis vinifera]
          Length = 476

 Score =  181 bits (459), Expect = 5e-43
 Identities = 87/124 (70%), Positives = 98/124 (79%)
 Frame = +1

Query: 262 KKLVSEKNRAPSDRGMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERL 441
           K  VSE       +G AY+KL  L+ + +EAGYVPDTRYVLHD+D EAKE+ALLYHSERL
Sbjct: 353 KNRVSEYRSTNPYKGDAYEKLKGLNGQMREAGYVPDTRYVLHDIDQEAKEQALLYHSERL 412

Query: 442 AIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSC 621
           AIAYGLISTP  T LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC
Sbjct: 413 AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSC 472

Query: 622 RDFW 633
            D+W
Sbjct: 473 GDYW 476


>gb|EXB38821.1| Serine acetyltransferase 5 [Morus notabilis]
          Length = 819

 Score =  181 bits (458), Expect = 6e-43
 Identities = 87/143 (60%), Positives = 109/143 (76%), Gaps = 1/143 (0%)
 Frame = +1

Query: 208  NLVRNSKKEADNASRMIQKKLVSEKNRAPSD-RGMAYKKLMSLSKKAKEAGYVPDTRYVL 384
            N +  ++++  + S MI++K    + R P+  +   Y+KL  L+ + +EAGYVPDTRYVL
Sbjct: 677  NKIPLAQRKRHSESSMIEEKSRVSEYRCPNPYKEEVYQKLKGLNGQLREAGYVPDTRYVL 736

Query: 385  HDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKRE 564
            HD+D EAKE+AL YHSERLAIAYGLISTPP T LR++KNLRICGDCHN IKI+S    RE
Sbjct: 737  HDIDEEAKEQALQYHSERLAIAYGLISTPPRTTLRIMKNLRICGDCHNAIKIMSKIVGRE 796

Query: 565  FIVRDTKRFHHFKDGECSCRDFW 633
             IVRD KRFHHFKDG+CSC D+W
Sbjct: 797  LIVRDNKRFHHFKDGKCSCGDYW 819


>gb|AFK42502.1| unknown [Medicago truncatula]
          Length = 565

 Score =  181 bits (458), Expect = 6e-43
 Identities = 109/265 (41%), Positives = 142/265 (53%), Gaps = 59/265 (22%)
 Frame = +1

Query: 16   EEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSTDHH 195
            +E +Q+F ++   E  L+    T  ++L AC     VE    Y  SM+  YGI P  +H+
Sbjct: 303  DEGLQLFEQM--NELGLEITSETMLAVLSACGSAEAVEDAYIYLESMKSKYGIEPGVEHY 360

Query: 196  LSYNNLVRNSK--KEAD-------------------NASRM---------IQKKLVS--- 276
            +   +++  S   KEA+                   N +R+         +++ +VS   
Sbjct: 361  MGLLDVLGQSGYLKEAEEFIEQLPFEPTVTVFETLKNYARIHGDVDLEDHVEELIVSLDP 420

Query: 277  ---EKNRAPSDRGMAYKKLMSLSKK-----------------------AKEAGYVPDTRY 378
                 N+ P+     Y  +  L  K                        K+AGYVPDTRY
Sbjct: 421  SKAVANKIPTPPPKKYTAISMLDGKNRIIEYKNPTLYKDDEKLIAMNSMKDAGYVPDTRY 480

Query: 379  VLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEK 558
            VLHD+D EAKE+ALLYHSERLAIAYGLISTPP T LR+IKNLR+CGDCHN IKI+S    
Sbjct: 481  VLHDIDQEAKEQALLYHSERLAIAYGLISTPPRTPLRIIKNLRVCGDCHNAIKIMSRIVG 540

Query: 559  REFIVRDTKRFHHFKDGECSCRDFW 633
            RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 541  RELIVRDNKRFHHFKDGKCSCGDYW 565


>ref|XP_003607988.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355509043|gb|AES90185.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 565

 Score =  181 bits (458), Expect = 6e-43
 Identities = 109/265 (41%), Positives = 142/265 (53%), Gaps = 59/265 (22%)
 Frame = +1

Query: 16   EEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSTDHH 195
            +E +Q+F ++   E  L+    T  ++L AC     VE    Y  SM+  YGI P  +H+
Sbjct: 303  DEGLQLFEQM--NELGLEITSETMLAVLSACGSAEAVEDAYIYLESMKSKYGIEPGVEHY 360

Query: 196  LSYNNLVRNSK--KEAD-------------------NASRM---------IQKKLVS--- 276
            +   +++  S   KEA+                   N +R+         +++ +VS   
Sbjct: 361  MGLLDVLGQSGYLKEAEEFIEQLPFEPTVTVFETLKNYARIHGDVDLEDHVEELIVSLDP 420

Query: 277  ---EKNRAPSDRGMAYKKLMSLSKK-----------------------AKEAGYVPDTRY 378
                 N+ P+     Y  +  L  K                        K+AGYVPDTRY
Sbjct: 421  SKAVANKIPTPPPKKYTAISMLDGKNRIIEYKNPTLYKDDEKLIAMNSMKDAGYVPDTRY 480

Query: 379  VLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEK 558
            VLHD+D EAKE+ALLYHSERLAIAYGLISTPP T LR+IKNLR+CGDCHN IKI+S    
Sbjct: 481  VLHDIDQEAKEQALLYHSERLAIAYGLISTPPRTPLRIIKNLRVCGDCHNAIKIMSRIVG 540

Query: 559  REFIVRDTKRFHHFKDGECSCRDFW 633
            RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 541  RELIVRDNKRFHHFKDGKCSCGDYW 565


Top