BLASTX nr result

ID: Ephedra27_contig00021733 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00021733
         (673 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABR16456.1| unknown [Picea sitchensis]                             163   4e-38
gb|EXC46728.1| hypothetical protein L484_002071 [Morus notabilis]     148   1e-33
ref|XP_006832827.1| hypothetical protein AMTR_s00095p00020800, p...   148   1e-33
ref|XP_006832826.1| hypothetical protein AMTR_s00095p00019470 [A...   147   3e-33
ref|XP_006832830.1| hypothetical protein AMTR_s00095p00027730 [A...   147   4e-33
ref|XP_002522167.1| pentatricopeptide repeat-containing protein,...   147   4e-33
emb|CBI30135.3| unnamed protein product [Vitis vinifera]              145   1e-32
ref|XP_002266581.1| PREDICTED: pentatricopeptide repeat-containi...   145   1e-32
ref|XP_002308888.2| hypothetical protein POPTR_0006s03790g [Popu...   144   2e-32
ref|XP_006644067.1| PREDICTED: pentatricopeptide repeat-containi...   143   6e-32
ref|XP_003567340.1| PREDICTED: pentatricopeptide repeat-containi...   142   9e-32
gb|EOY08858.1| Pentatricopeptide repeat-containing protein isofo...   139   6e-31
gb|EOY08857.1| Pentatricopeptide repeat-containing protein isofo...   139   6e-31
ref|XP_006307055.1| hypothetical protein CARUB_v10008643mg [Caps...   139   1e-30
ref|XP_002890106.1| DNA binding protein [Arabidopsis lyrata subs...   138   1e-30
ref|XP_006348039.1| PREDICTED: pentatricopeptide repeat-containi...   138   2e-30
ref|XP_003518031.1| PREDICTED: pentatricopeptide repeat-containi...   138   2e-30
gb|AAD39676.1|AC007591_41 F9L1.43 [Arabidopsis thaliana]              137   2e-30
tpg|DAA54205.1| TPA: hypothetical protein ZEAMMB73_351899 [Zea m...   137   2e-30
ref|NP_173001.2| pentatricopeptide repeat-containing protein [Ar...   137   2e-30

>gb|ABR16456.1| unknown [Picea sitchensis]
          Length = 600

 Score =  163 bits (413), Expect = 4e-38
 Identities = 87/224 (38%), Positives = 140/224 (62%)
 Frame = -2

Query: 672  GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
            G+  KAE  L E E V  S+ +    +R  ++ LL  YA L +  +V+R WK+ E+   +
Sbjct: 349  GLAEKAEVVLKELENV--SLKD----KRSRLKMLLPLYAELGKPTEVERIWKDFEAFPAL 402

Query: 492  PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
             + +Y +G+ AWG LG I+KAE  FE+LL    +LS  HY+ALL VYA+H+L  K + L+
Sbjct: 403  RLDEYATGVVAWGKLGQIEKAEITFEKLLNSGKKLSAKHYNALLNVYADHHLLLKGKELV 462

Query: 312  EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
            + M+ NG   +  I   L++L+VN G+L+KA+S+L +  +  +L+P Y +++ +L+KYAE
Sbjct: 463  KRMSDNGCTIEPPIWDALIRLHVNAGELEKADSILFKACNQKQLRPKYWTMVTILEKYAE 522

Query: 132  IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
             GD+  A+KIF   R+ GY      + +L+K++  A +PAYG +
Sbjct: 523  RGDVANAEKIFDRMRQAGYTGSAGAFASLLKSYANARVPAYGFR 566


>gb|EXC46728.1| hypothetical protein L484_002071 [Morus notabilis]
          Length = 623

 Score =  148 bits (374), Expect = 1e-33
 Identities = 84/224 (37%), Positives = 132/224 (58%)
 Frame = -2

Query: 672  GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
            G+  KAE  L E E   +++NE    RR+    LL  YA L    +V+R WK  ES+   
Sbjct: 374  GLKEKAEAVLKEMEG--DNLNEDPKVRRY----LLLLYAELGHADEVERVWKACESNPRT 427

Query: 492  PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
               ++++ I AWG L  + KAE  FE++L+   + S  HY+ALL VYANH + ++ ++L+
Sbjct: 428  --SEFLAAIEAWGKLKKVKKAEDAFEKMLKAVKKPSAVHYNALLRVYANHKMLTRGKDLI 485

Query: 312  EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
            + MA+N  K  RM    +VKLYV  G+++KA+SVL +     +LKP Y S + ++ +YA+
Sbjct: 486  KRMAENDGKISRMTLDSVVKLYVEAGEIEKADSVLQKATQQNQLKPLYVSYMAIMDEYAK 545

Query: 132  IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
             GDI   +K+F   ++ GY      + ALI+ ++ A  PAYG++
Sbjct: 546  KGDIHNTEKMFLRMKQAGYDARFRQFQALIQAYVNAKTPAYGIR 589


>ref|XP_006832827.1| hypothetical protein AMTR_s00095p00020800, partial [Amborella
           trichopoda] gi|548837327|gb|ERM98105.1| hypothetical
           protein AMTR_s00095p00020800, partial [Amborella
           trichopoda]
          Length = 425

 Score =  148 bits (374), Expect = 1e-33
 Identities = 84/234 (35%), Positives = 133/234 (56%), Gaps = 10/234 (4%)
 Frame = -2

Query: 672 GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
           G I KAE  L + E       E   R R   + LL  YA L +  +V R WK +E    +
Sbjct: 166 GFIDKAEAVLKDLE------GEDIERNRDACKALLPLYAALGKADEVSRIWKVVEPSPRL 219

Query: 492 PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
             ++ ++ I AWG LG+ +KAE +FER+L+    +S  +YSAL++VY  H   +K ++L+
Sbjct: 220 --EECLAVIEAWGKLGDTEKAEAVFERMLKTWKNISSRYYSALIKVYTTHKQLNKGKDLV 277

Query: 312 EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSE----------DYDGTKLKPTYKS 163
           + MA NG+    +    LV+LYV  G+++KA+S+L++               +LKP Y S
Sbjct: 278 KRMADNGITIGPVTWDALVRLYVEAGEVEKADSILAKAAAQQNNSTSQSSKNRLKPLYSS 337

Query: 162 LLLVLQKYAEIGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
            +++++KY E GDI  A+KIF   R+ GY+  +  Y  L++T++ AN PAYG +
Sbjct: 338 YMVIMEKYCERGDIHNAEKIFHRLRQAGYEGRMSDYQTLLQTYVNANTPAYGFR 391


>ref|XP_006832826.1| hypothetical protein AMTR_s00095p00019470 [Amborella trichopoda]
            gi|548837326|gb|ERM98104.1| hypothetical protein
            AMTR_s00095p00019470 [Amborella trichopoda]
          Length = 637

 Score =  147 bits (371), Expect = 3e-33
 Identities = 83/234 (35%), Positives = 131/234 (55%), Gaps = 10/234 (4%)
 Frame = -2

Query: 672  GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
            G I KAE  L + E       E   R R   + LL  YA L +  +V R WK +E     
Sbjct: 378  GFIDKAEAVLKDLE------GEDMERNRDACKALLPLYAALGKADEVSRIWKVVEPSPRF 431

Query: 492  PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
               + ++ I AWG LG+ +KAE +FER+L+    +S  +YSAL++VY  H   +K ++L+
Sbjct: 432  D--ECLAAIEAWGKLGDTEKAEAVFERMLKTWKNISSRYYSALIKVYTTHKQLNKGKDLV 489

Query: 312  EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSE----------DYDGTKLKPTYKS 163
            + MA NG+    +    LV+LYV  G+++KA+S+L++               +LKP Y S
Sbjct: 490  KRMADNGITIGPVTWDALVRLYVEAGEVEKADSILAKAAAQQNNSTSQSSKNRLKPLYSS 549

Query: 162  LLLVLQKYAEIGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
             +++++KY E GDI  A+KIF   ++ GY+  +  Y  L++T++ AN PAYG +
Sbjct: 550  YMVIMEKYCERGDIHNAEKIFHRLKQAGYEGRMSDYQTLLQTYVNANTPAYGFR 603


>ref|XP_006832830.1| hypothetical protein AMTR_s00095p00027730 [Amborella trichopoda]
           gi|548837330|gb|ERM98108.1| hypothetical protein
           AMTR_s00095p00027730 [Amborella trichopoda]
          Length = 443

 Score =  147 bits (370), Expect = 4e-33
 Identities = 84/234 (35%), Positives = 131/234 (55%), Gaps = 10/234 (4%)
 Frame = -2

Query: 672 GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
           G I KAE  L + E       E   R R   + LL  YA L +  +V R WK +E    +
Sbjct: 184 GFIDKAEAVLKDLE------GEDIERNRDACKALLPLYAALGKADEVSRIWKVVEPSPRL 237

Query: 492 PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
              + ++ I AWG LG+ +KAE +FER+L+    +S  +YSAL++VY  H   +K ++L+
Sbjct: 238 D--ECLAVIEAWGKLGDTEKAEAVFERMLKTWKNISSRYYSALIKVYTTHKQLNKGKDLV 295

Query: 312 EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSE----------DYDGTKLKPTYKS 163
           + MA NG+    +    LV+LYV  G+++KA+S+L++               +LKP Y S
Sbjct: 296 KRMADNGITIGPVTWDALVRLYVEAGEVEKADSILAKAAAQQNNWTSQSSKNRLKPLYSS 355

Query: 162 LLLVLQKYAEIGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
            +++++KY E GDI  A+KIF   R+ GY   +  Y  L++T++ AN PAYG +
Sbjct: 356 YMVIMEKYCERGDIHNAEKIFHRLRQAGYVGRMSHYQTLLQTYVNANTPAYGFR 409


>ref|XP_002522167.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223538605|gb|EEF40208.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 342

 Score =  147 bits (370), Expect = 4e-33
 Identities = 84/224 (37%), Positives = 134/224 (59%)
 Frame = -2

Query: 672 GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
           G+  KAE  L E E    ++ E     R   R LL  YA L +  +V+R WK  ES  ++
Sbjct: 93  GLKEKAEAILKEMEG--GNLEE----HRWACRLLLPLYAALGKADEVERVWKVCESSPQL 146

Query: 492 PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
             ++ ++ I AWG L  IDKAE +F R+L    +LS  HYSALL+VYA+H + +  ++L+
Sbjct: 147 --EECVAVIEAWGKLKKIDKAEEVFNRMLTTWKKLSSRHYSALLKVYASHKMLANGKDLI 204

Query: 312 EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
           + MA +G +   +    LVKLYV  G+++KA+SVL +      +KP + S ++++ +YA+
Sbjct: 205 KKMADSGCRIGPLTWDSLVKLYVEAGEVEKADSVLHKAAQQNHMKPMFSSYIVIMDQYAK 264

Query: 132 IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
            GD+  A+K+F   R+ GY   +  + AL++T+I A  PAYG++
Sbjct: 265 RGDVHNAEKMFHRMRQAGYVARLRQFQALVQTYINAKAPAYGIR 308


>emb|CBI30135.3| unnamed protein product [Vitis vinifera]
          Length = 624

 Score =  145 bits (365), Expect = 1e-32
 Identities = 79/224 (35%), Positives = 134/224 (59%)
 Frame = -2

Query: 672  GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
            G+  KAE  L E E    ++ E     R + R LL PYA L +  DV+R WK  ES+  +
Sbjct: 375  GLKEKAEAILKEMEG--GNLKE----NRWVCRVLLPPYAALGKADDVERIWKVCESNPRL 428

Query: 492  PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
            P  ++++ I A+G L  +++AE IF ++ +   RLS  HYSALL+VYA+H +  K ++L+
Sbjct: 429  P--EFVAAIEAYGKLKKVEEAEAIFNKMSKTFKRLSSKHYSALLKVYADHKMLIKGKDLV 486

Query: 312  EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
            + M+ +G +   +    LVKLYV  G+++KA+ +L +    + +KP + + + ++ +YA+
Sbjct: 487  KQMSDSGCRIGPLTWDALVKLYVEAGEVEKADKILQKAMQQSPIKPMFSTYMAIMDQYAK 546

Query: 132  IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
             GDI  ++K+F   R+ GY   +  +  LI+ ++ A  PAYG+K
Sbjct: 547  RGDIHNSEKMFHQMRQSGYVSRLRQFQCLIQAYVNAKAPAYGIK 590


>ref|XP_002266581.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
           mitochondrial-like [Vitis vinifera]
          Length = 587

 Score =  145 bits (365), Expect = 1e-32
 Identities = 79/224 (35%), Positives = 134/224 (59%)
 Frame = -2

Query: 672 GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
           G+  KAE  L E E    ++ E     R + R LL PYA L +  DV+R WK  ES+  +
Sbjct: 338 GLKEKAEAILKEMEG--GNLKE----NRWVCRVLLPPYAALGKADDVERIWKVCESNPRL 391

Query: 492 PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
           P  ++++ I A+G L  +++AE IF ++ +   RLS  HYSALL+VYA+H +  K ++L+
Sbjct: 392 P--EFVAAIEAYGKLKKVEEAEAIFNKMSKTFKRLSSKHYSALLKVYADHKMLIKGKDLV 449

Query: 312 EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
           + M+ +G +   +    LVKLYV  G+++KA+ +L +    + +KP + + + ++ +YA+
Sbjct: 450 KQMSDSGCRIGPLTWDALVKLYVEAGEVEKADKILQKAMQQSPIKPMFSTYMAIMDQYAK 509

Query: 132 IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
            GDI  ++K+F   R+ GY   +  +  LI+ ++ A  PAYG+K
Sbjct: 510 RGDIHNSEKMFHQMRQSGYVSRLRQFQCLIQAYVNAKAPAYGIK 553


>ref|XP_002308888.2| hypothetical protein POPTR_0006s03790g [Populus trichocarpa]
            gi|550335400|gb|EEE92411.2| hypothetical protein
            POPTR_0006s03790g [Populus trichocarpa]
          Length = 623

 Score =  144 bits (364), Expect = 2e-32
 Identities = 83/224 (37%), Positives = 128/224 (57%)
 Frame = -2

Query: 672  GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
            G+  KAE  L E E    ++ E     R   R +L  Y  L +  +V R WK  E    +
Sbjct: 374  GLKEKAEAILKEMEG--GNLEE----HRWACRFMLPLYGALGKADEVSRVWKFCEKSPRL 427

Query: 492  PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
               + M+ I AWG L  ID+AE +FE + +   +LS  HYS LL+VYANH + SK ++L+
Sbjct: 428  D--ECMAAIEAWGRLKKIDEAEAVFELMSKTWKKLSSRHYSTLLKVYANHKMLSKGKDLI 485

Query: 312  EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
            + M  +G +   +    LVKLYV  G+++KA+S+L++     K+KP Y S L+++++YA 
Sbjct: 486  KRMGDSGCRIGPLTWDALVKLYVEAGEVEKADSILNKAVQQNKIKPMYSSFLIIMERYAT 545

Query: 132  IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
             GDI  A+K+F   R+ GY+  I  +  LI+ +I A  P YG++
Sbjct: 546  KGDIHNAEKMFHRMRQAGYQARIRQFQTLIQAYIIAKAPCYGMR 589


>ref|XP_006644067.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
           mitochondrial-like [Oryza brachyantha]
          Length = 594

 Score =  143 bits (360), Expect = 6e-32
 Identities = 80/220 (36%), Positives = 127/220 (57%)
 Frame = -2

Query: 660 KAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEIPMKD 481
           KAE  LG+ E    + N      R   + +L  YALL +  DV+RHWK  E++  +  ++
Sbjct: 348 KAEAILGQMEEDDITEN------RSACKFVLPLYALLGKSADVERHWKVCEANPRL--EE 399

Query: 480 YMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLMEHMA 301
            +  I A+GMLG ++KAE IFE + +    LS   Y+A+L+VYAN  L  K + L + M 
Sbjct: 400 CLPAIEAFGMLGEVEKAEEIFENMFKTWKTLSSRFYNAMLKVYANKKLFDKGKELAKRMG 459

Query: 300 KNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAEIGDI 121
            +G +        LVKLY+N G+++KA+S+L +     K+KP Y + L++L  Y++ GDI
Sbjct: 460 DDGCRVGPSTLDSLVKLYLNAGEVEKADSILHKLSHKDKIKPMYNTYLMLLDSYSKKGDI 519

Query: 120 VIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
             A+K+F+  R++GY   I  Y  L++ ++ A  P YG +
Sbjct: 520 HNAEKLFSKIRQMGYTGRIRQYQLLLEAYLNAKAPIYGFR 559


>ref|XP_003567340.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
           mitochondrial-like [Brachypodium distachyon]
          Length = 612

 Score =  142 bits (358), Expect = 9e-32
 Identities = 73/213 (34%), Positives = 125/213 (58%)
 Frame = -2

Query: 639 EAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEIPMKDYMSGISA 460
           +AE +  SM       R+  + ++  YA L ++ DV+R WK  +S+  +   + +S I A
Sbjct: 367 KAEAILESMEGDMKENRNACKMVMPLYAFLGKKDDVERIWKVCQSNTRLD--ECLSAIEA 424

Query: 459 WGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLMEHMAKNGVKED 280
           +G LG+++KAE +F  + +    LS  +Y+A++ VYAN NL  K + L + M ++G +  
Sbjct: 425 FGRLGDVEKAEEVFGNMFKTWKTLSSKYYNAMMRVYANQNLMDKGKELAKRMEEDGCRLG 484

Query: 279 RMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAEIGDIVIAQKIF 100
                 LVKLYV+ G+++KAES+L +     K+KP Y S L++L  Y++IGD+  ++K+F
Sbjct: 485 ISTLDSLVKLYVDAGEVEKAESLLHKLSVKNKMKPQYSSYLMLLDSYSKIGDVHNSEKVF 544

Query: 99  AATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
           +  R++GY   I  Y  L+  ++ A  P YG +
Sbjct: 545 SKLRQMGYNGRIRQYQLLLHAYLHAKAPVYGFR 577


>gb|EOY08858.1| Pentatricopeptide repeat-containing protein isoform 2, partial
           [Theobroma cacao]
          Length = 621

 Score =  139 bits (351), Expect = 6e-31
 Identities = 69/197 (35%), Positives = 124/197 (62%)
 Frame = -2

Query: 591 RHIIRNLLQPYALLNREKDVDRHWKEIESDVEIPMKDYMSGISAWGMLGNIDKAERIFER 412
           R   R LL  YA L +  +V+R WK  ES+  +  ++YM+ I AWG L  I++AE +FE 
Sbjct: 393 RWACRFLLPLYADLGKAVEVERVWKVCESNPRL--EEYMAAIEAWGKLNKIEEAEAVFEM 450

Query: 411 LLELNHRLSGFHYSALLEVYANHNLQSKAENLMEHMAKNGVKEDRMIRSLLVKLYVNMGD 232
           +L+   +L   +Y++LL+VY+NH +  K ++L++ MA +G +   +    LVKLYV  G+
Sbjct: 451 MLKTWKKLPARYYASLLKVYSNHKMLQKGKDLVKRMADDGCQIGPLTWDALVKLYVEAGE 510

Query: 231 LQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAEIGDIVIAQKIFAATRKLGYKRDIPMYM 52
           ++KA+S+L +     ++KP + S + V+++Y++ GDI  ++K+F   R+ GY   +  + 
Sbjct: 511 VEKADSILQKACQQNQVKPMFSSFMAVMEQYSKRGDIHNSEKMFHRMRQAGYMARLRQFQ 570

Query: 51  ALIKTHIKANIPAYGLK 1
           +L++ ++ A  PAYG++
Sbjct: 571 SLVQAYVNAKAPAYGIR 587


>gb|EOY08857.1| Pentatricopeptide repeat-containing protein isoform 1 [Theobroma
           cacao]
          Length = 623

 Score =  139 bits (351), Expect = 6e-31
 Identities = 69/197 (35%), Positives = 124/197 (62%)
 Frame = -2

Query: 591 RHIIRNLLQPYALLNREKDVDRHWKEIESDVEIPMKDYMSGISAWGMLGNIDKAERIFER 412
           R   R LL  YA L +  +V+R WK  ES+  +  ++YM+ I AWG L  I++AE +FE 
Sbjct: 395 RWACRFLLPLYADLGKAVEVERVWKVCESNPRL--EEYMAAIEAWGKLNKIEEAEAVFEM 452

Query: 411 LLELNHRLSGFHYSALLEVYANHNLQSKAENLMEHMAKNGVKEDRMIRSLLVKLYVNMGD 232
           +L+   +L   +Y++LL+VY+NH +  K ++L++ MA +G +   +    LVKLYV  G+
Sbjct: 453 MLKTWKKLPARYYASLLKVYSNHKMLQKGKDLVKRMADDGCQIGPLTWDALVKLYVEAGE 512

Query: 231 LQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAEIGDIVIAQKIFAATRKLGYKRDIPMYM 52
           ++KA+S+L +     ++KP + S + V+++Y++ GDI  ++K+F   R+ GY   +  + 
Sbjct: 513 VEKADSILQKACQQNQVKPMFSSFMAVMEQYSKRGDIHNSEKMFHRMRQAGYMARLRQFQ 572

Query: 51  ALIKTHIKANIPAYGLK 1
           +L++ ++ A  PAYG++
Sbjct: 573 SLVQAYVNAKAPAYGIR 589


>ref|XP_006307055.1| hypothetical protein CARUB_v10008643mg [Capsella rubella]
           gi|482575766|gb|EOA39953.1| hypothetical protein
           CARUB_v10008643mg [Capsella rubella]
          Length = 597

 Score =  139 bits (349), Expect = 1e-30
 Identities = 75/223 (33%), Positives = 126/223 (56%)
 Frame = -2

Query: 672 GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
           G+  KAE+ L E E       E     RH+ ++LL  Y  L R  +V R WK  E     
Sbjct: 348 GLKEKAEKVLKEME------GESLEENRHVCKDLLSIYGFLQRADEVTRIWKICEEKPRY 401

Query: 492 PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
              + ++ I A+G +  + +AE IFE++L++ HR+S   YS LL+VY +H + S+ ++L+
Sbjct: 402 --NESLAAILAFGKIDKVKEAEAIFEKILKMGHRVSSNVYSVLLKVYIDHKMVSEGKDLV 459

Query: 312 EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
           + M+ +G     +    ++KLYV +G+++KAES LS+     ++KP   S + V+ +Y +
Sbjct: 460 KQMSDSGCSIGALTWDAVIKLYVEVGEVEKAESALSKATQSKQIKPLMSSFMHVMDEYVK 519

Query: 132 IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGL 4
            GD+   +KIF   ++ GY+     Y ALI+ ++ A  PAYG+
Sbjct: 520 RGDVHNTEKIFERMKQAGYQSRFRAYQALIQAYVNAKAPAYGM 562



 Score = 60.1 bits (144), Expect = 6e-07
 Identities = 51/218 (23%), Positives = 98/218 (44%), Gaps = 1/218 (0%)
 Frame = -2

Query: 657 AEETLGEAERVCNSMNEFS-PRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEIPMKD 481
           A   + + E V N M +   P        +L  Y  +N++K  D      + +++  +  
Sbjct: 242 ATSNVRKTEEVFNKMRDLGFPLSTFACDQMLILYRRVNKKKIADVLLLMEKENLKPSLNT 301

Query: 480 YMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLMEHMA 301
           Y   I   G++ +I   E+I E +      L     S +   YA+  L+ KAE +++ M 
Sbjct: 302 YKILIDIKGLMNDITGMEQILETMKSEGVELDLRAQSIIARNYASAGLKEKAEKVLKEME 361

Query: 300 KNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAEIGDI 121
              ++E+R +   L+ +Y   G LQ+A+ V +  +   + KP Y   L  +  + +I  +
Sbjct: 362 GESLEENRHVCKDLLSIY---GFLQRADEV-TRIWKICEEKPRYNESLAAILAFGKIDKV 417

Query: 120 VIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYG 7
             A+ IF    K+G++    +Y  L+K +I   + + G
Sbjct: 418 KEAEAIFEKILKMGHRVSSNVYSVLLKVYIDHKMVSEG 455


>ref|XP_002890106.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
           gi|297335948|gb|EFH66365.1| DNA binding protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 594

 Score =  138 bits (348), Expect = 1e-30
 Identities = 74/224 (33%), Positives = 124/224 (55%)
 Frame = -2

Query: 672 GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
           G+  KAE+ L E E       E     RH+ ++LL  Y  L R  +V R WK  E     
Sbjct: 345 GLKEKAEKVLKEME------GESLEENRHVYKDLLSVYGFLQRADEVTRIWKICEEKPRY 398

Query: 492 PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
              + ++ I A+G +  + +AE +FE++L+++HR+S   YS LL VY +H + S+ ++L+
Sbjct: 399 --NESLAAILAFGKIDKVKEAEAVFEKMLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLV 456

Query: 312 EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
           + M  +G     +    ++KLY+  G+++KAES LS+     ++KP   S + V+ +Y  
Sbjct: 457 KQMLDSGCNIGALTLDAVIKLYLEAGEVEKAESSLSKAIQSKQIKPLMSSFMYVMGEYVR 516

Query: 132 IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
            GD+   +KIF   ++ GY+     Y ALI+ ++ A  PAYG+K
Sbjct: 517 RGDVHNTEKIFQRMKQFGYQSRFRTYQALIQAYVNAKAPAYGMK 560


>ref|XP_006348039.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
            mitochondrial-like [Solanum tuberosum]
          Length = 624

 Score =  138 bits (347), Expect = 2e-30
 Identities = 78/223 (34%), Positives = 126/223 (56%)
 Frame = -2

Query: 672  GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
            G+  KAE  L E E         +   R   R+LL  YA L R  +V R W+  ES+  +
Sbjct: 375  GLNEKAENVLKEME------GGDTKSTRWACRSLLPHYAALGRADEVARIWQVCESNPRL 428

Query: 492  PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
              ++ ++ + AWG L NI++AE IF+++      LS  HYS LL +YANH + SK ++L+
Sbjct: 429  --EECVAAVDAWGKLHNIEEAEAIFDKMAAKWPTLSSKHYSVLLNIYANHKMLSKGKDLV 486

Query: 312  EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
            + MA +G +   +    LV+LY+  G+++KA+S+L +  +  +L+P   S L+++ +YA+
Sbjct: 487  KRMADSGCRIGPVTWDALVRLYIEAGEVEKADSILHKAGEQNRLRPMINSYLMIMDQYAK 546

Query: 132  IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGL 4
             GDI   +K+F   R+ GY      Y  LI+ +I A  P YG+
Sbjct: 547  KGDIHNTEKMFHRMRQAGYVSRATQYQHLIRAYINAKAPCYGI 589


>ref|XP_003518031.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
            mitochondrial-like [Glycine max]
          Length = 609

 Score =  138 bits (347), Expect = 2e-30
 Identities = 75/224 (33%), Positives = 132/224 (58%)
 Frame = -2

Query: 672  GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
            G+  KAE  L E E       E     + +   LL+ YA L +  +V+R WK  ES   +
Sbjct: 361  GLKEKAEAMLKEME------GENLKENQWVCATLLRLYANLGKADEVERIWKVCESKPRV 414

Query: 492  PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
              +D ++ + AWG L  I++AE +FE ++    +L+  +YS LL++YAN+ + +K + L+
Sbjct: 415  --EDCLAAVEAWGKLNKIEEAEAVFE-MVSKKWKLNSKNYSVLLKIYANNKMLTKGKELV 471

Query: 312  EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
            + MA +GV+   +    LVKLY+  G+++KA+S+L +     +L+P + + L +L++YA+
Sbjct: 472  KLMADSGVRIGPLTWDALVKLYIQAGEVEKADSILHKAIQQNQLQPMFTTYLAILEQYAK 531

Query: 132  IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
             GD+  ++KIF   R+ GY   I  +  LI+ ++ A +PAYG++
Sbjct: 532  RGDVHNSEKIFLKMRQAGYTSRISQFQVLIQAYVNAKVPAYGIR 575


>gb|AAD39676.1|AC007591_41 F9L1.43 [Arabidopsis thaliana]
          Length = 623

 Score =  137 bits (346), Expect = 2e-30
 Identities = 74/224 (33%), Positives = 125/224 (55%)
 Frame = -2

Query: 672  GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
            G+  KAE+ L E E       E     RH+ ++LL  Y  L RE +V R WK  E +   
Sbjct: 374  GLKEKAEKVLKEME------GESLEENRHMCKDLLSVYGYLQREDEVRRVWKICEENPRY 427

Query: 492  PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
               + ++ I A+G +  +  AE +FE++L+++HR+S   YS LL VY +H + S+ ++L+
Sbjct: 428  --NEVLAAILAFGKIDKVKDAEAVFEKVLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLV 485

Query: 312  EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
            + M+ +G     +    ++KLYV  G+++KAES LS+     ++KP   S + ++ +Y  
Sbjct: 486  KQMSDSGCNIGALTWDAVIKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVR 545

Query: 132  IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
             GD+   +KIF   ++ GY+     Y  LI+ ++ A  PAYG+K
Sbjct: 546  RGDVHNTEKIFQRMKQAGYQSRFWAYQTLIQAYVNAKAPAYGMK 589


>tpg|DAA54205.1| TPA: hypothetical protein ZEAMMB73_351899 [Zea mays]
          Length = 613

 Score =  137 bits (346), Expect = 2e-30
 Identities = 80/214 (37%), Positives = 123/214 (57%), Gaps = 1/214 (0%)
 Frame = -2

Query: 639  EAERVCNSMN-EFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEIPMKDYMSGIS 463
            +AE +  SM  +   + R   + LL  YA L     V+R WK  E +  I   + +S I 
Sbjct: 367  KAETLLESMEGDDIQKNRAACKFLLPLYAFLGNGDAVERIWKVCEDNTRID--ECLSAID 424

Query: 462  AWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLMEHMAKNGVKE 283
            A+G LGN++KAE++FE +      LS  +Y+ALL+VYAN NL  K E L + M + G K 
Sbjct: 425  AFGKLGNVEKAEKVFEDMFVKWKNLSSKYYTALLKVYANQNLLDKGEELAKRMDEEGAKF 484

Query: 282  DRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAEIGDIVIAQKI 103
                 + LVKLY + G+++KAES+L +      +KP+Y S + +L  Y++ GD+  ++K+
Sbjct: 485  GTPTLNALVKLYADAGEVEKAESLLHKLSLKNNVKPSYSSYMTLLDSYSKKGDVHNSEKV 544

Query: 102  FAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
            F   R++GY   I MY  L+ +++ A  PAYG K
Sbjct: 545  FNKLRQIGYTGRIRMYQFLLHSYLHAKAPAYGFK 578


>ref|NP_173001.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|193806396|sp|Q9XI21.2|PPR44_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g15480, mitochondrial; Flags: Precursor
           gi|332191207|gb|AEE29328.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 594

 Score =  137 bits (346), Expect = 2e-30
 Identities = 74/224 (33%), Positives = 125/224 (55%)
 Frame = -2

Query: 672 GMIGKAEETLGEAERVCNSMNEFSPRRRHIIRNLLQPYALLNREKDVDRHWKEIESDVEI 493
           G+  KAE+ L E E       E     RH+ ++LL  Y  L RE +V R WK  E +   
Sbjct: 345 GLKEKAEKVLKEME------GESLEENRHMCKDLLSVYGYLQREDEVRRVWKICEENPRY 398

Query: 492 PMKDYMSGISAWGMLGNIDKAERIFERLLELNHRLSGFHYSALLEVYANHNLQSKAENLM 313
              + ++ I A+G +  +  AE +FE++L+++HR+S   YS LL VY +H + S+ ++L+
Sbjct: 399 --NEVLAAILAFGKIDKVKDAEAVFEKVLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLV 456

Query: 312 EHMAKNGVKEDRMIRSLLVKLYVNMGDLQKAESVLSEDYDGTKLKPTYKSLLLVLQKYAE 133
           + M+ +G     +    ++KLYV  G+++KAES LS+     ++KP   S + ++ +Y  
Sbjct: 457 KQMSDSGCNIGALTWDAVIKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVR 516

Query: 132 IGDIVIAQKIFAATRKLGYKRDIPMYMALIKTHIKANIPAYGLK 1
            GD+   +KIF   ++ GY+     Y  LI+ ++ A  PAYG+K
Sbjct: 517 RGDVHNTEKIFQRMKQAGYQSRFWAYQTLIQAYVNAKAPAYGMK 560


Top