BLASTX nr result

ID: Ephedra25_contig00024310 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00024310
         (1754 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABR16456.1| unknown [Picea sitchensis]                             265   6e-68
ref|XP_006832826.1| hypothetical protein AMTR_s00095p00019470 [A...   253   1e-64
ref|XP_004136634.1| PREDICTED: pentatricopeptide repeat-containi...   246   2e-62
gb|ADN33879.1| DNA-binding protein [Cucumis melo subsp. melo]         242   4e-61
ref|XP_002322619.2| hypothetical protein POPTR_0016s03540g [Popu...   241   6e-61
ref|XP_002308888.2| hypothetical protein POPTR_0006s03790g [Popu...   241   9e-61
ref|XP_004967670.1| PREDICTED: pentatricopeptide repeat-containi...   241   9e-61
ref|XP_004162627.1| PREDICTED: pentatricopeptide repeat-containi...   241   9e-61
ref|XP_002890106.1| DNA binding protein [Arabidopsis lyrata subs...   240   1e-60
ref|XP_006430551.1| hypothetical protein CICLE_v10011296mg [Citr...   240   2e-60
ref|XP_004162629.1| PREDICTED: pentatricopeptide repeat-containi...   239   2e-60
gb|AAD39676.1|AC007591_41 F9L1.43 [Arabidopsis thaliana]              239   4e-60
ref|XP_004303594.1| PREDICTED: pentatricopeptide repeat-containi...   239   4e-60
ref|NP_173001.2| pentatricopeptide repeat-containing protein [Ar...   239   4e-60
gb|EXC46728.1| hypothetical protein L484_002071 [Morus notabilis]     238   5e-60
gb|EOY08858.1| Pentatricopeptide repeat-containing protein isofo...   238   6e-60
gb|EOY08857.1| Pentatricopeptide repeat-containing protein isofo...   238   6e-60
ref|XP_006482077.1| PREDICTED: pentatricopeptide repeat-containi...   237   1e-59
ref|XP_002889301.1| hypothetical protein ARALYDRAFT_477224 [Arab...   237   1e-59
ref|XP_006416900.1| hypothetical protein EUTSA_v10007131mg [Eutr...   234   7e-59

>gb|ABR16456.1| unknown [Picea sitchensis]
          Length = 600

 Score =  265 bits (676), Expect = 6e-68
 Identities = 161/452 (35%), Positives = 246/452 (54%), Gaps = 1/452 (0%)
 Frame = -3

Query: 1503 SDGKACKYESPNKTPMVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXXELCKHKFF 1324
            SD +  K E  N +P+ +   +    +L +SL++W+  GNV+           L   + F
Sbjct: 119  SDAEVEKEEVHNASPLYQTVLSCKFIDLPSSLEQWLADGNVLTRREVVITFVHLRNRRMF 178

Query: 1323 RAALQVSSWLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKGIKVYRALL 1144
            +  L+VS WL  KKP++  E  YA+ L+ I+K       +KYF  IPSDM+G + Y  LL
Sbjct: 179  KRLLKVSDWLEGKKPFKKTERDYASRLDVITKILGIFKGEKYFASIPSDMRGQRAYGTLL 238

Query: 1143 KTYVARRNVKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVSMRSSSVRT 964
              Y +  NV+KAE  F++MK  G  L   +YN +LL Y   ++ KI + L  M    V+ 
Sbjct: 239  ANYSSTCNVEKAEEIFKKMKAEGFSLTAFEYNQLLLLYKRLDKKKIQDVLKMMEDEGVKP 298

Query: 963  SANTYRNLIYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMDRKAEKTI 784
            +  TY+ LI   G   DI GME+V + M ++ +E ++  L  L++HYI+  +  KAE  +
Sbjct: 299  TIFTYKILIDVKGWMGDIGGMEQVAENMKSEDIEMDSGTLELLARHYIRAGLAEKAEVVL 358

Query: 783  SALEKLNGKEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPSKDYTAGIL 604
              LE ++ K+                LY  L K ++V+R WK+ E+   +   +Y  G++
Sbjct: 359  KELENVSLKD------KRSRLKMLLPLYAELGKPTEVERIWKDFEAFPALRLDEYATGVV 412

Query: 603  AWGNVGNIEKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFEQMSMRNEKL 424
            AWG +G IEKAE  FE +LNSG++     YNALL VYA++ L  K ++L ++MS     +
Sbjct: 413  AWGKLGQIEKAEITFEKLLNSGKKLSAKHYNALLNVYADHHLLLKGKELVKRMSDNGCTI 472

Query: 423  YRLSRHALIRLFLSLGDLIKAESML-DYQGGKSRRPAYSSFFLILKEYAQKEEIRNAERV 247
                  ALIRL ++ G+L KA+S+L      K  RP Y +   IL++YA++ ++ NAE++
Sbjct: 473  EPPIWDALIRLHVNAGELEKADSILFKACNQKQLRPKYWTMVTILEKYAERGDVANAEKI 532

Query: 246  FYRMRGAGCTRRYGMYVPLLHAYIREHTPAQG 151
            F RMR AG T   G +  LL +Y     PA G
Sbjct: 533  FDRMRQAGYTGSAGAFASLLKSYANARVPAYG 564


>ref|XP_006832826.1| hypothetical protein AMTR_s00095p00019470 [Amborella trichopoda]
            gi|548837326|gb|ERM98104.1| hypothetical protein
            AMTR_s00095p00019470 [Amborella trichopoda]
          Length = 637

 Score =  253 bits (647), Expect = 1e-64
 Identities = 156/470 (33%), Positives = 254/470 (54%), Gaps = 17/470 (3%)
 Frame = -3

Query: 1509 EESDGKACKYESPNKTP---MVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXXELC 1339
            EE  G   + E+P K+    + K+  N    ++ +++DKWV  GNV+           L 
Sbjct: 143  EEEGGSCVEKEAPKKSVESRLFKVILNAQHQSIHSAIDKWVADGNVVNREGVWGAMFNLR 202

Query: 1338 KHKFFRAALQVSSWLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKGIKV 1159
            K + F  ALQ+S WL   KP++FNE  Y + ++ I+K      A+KY E++P  ++G  +
Sbjct: 203  KRRMFARALQLSDWLDANKPFEFNERDYQSRVDLIAKVHGIHRAEKYIEKLPQPVRGEVL 262

Query: 1158 YRALLKTYVARRNVKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVSMRS 979
            YR LL      RNVKKAE  F +MK+ G K+    Y+ ++L Y   ++ KI + L+ M  
Sbjct: 263  YRTLLANCSTTRNVKKAEEVFNKMKDLGFKMTAFSYDQLILIYKRIDKKKIADVLLMMEK 322

Query: 978  SSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMDRK 799
             +V+ S+ TY  L+   G  +DI+GME++I  M ++GL P+    A ++K+YI      K
Sbjct: 323  DNVKPSSFTYMLLVDVKGRSSDIEGMEQIITSMKSEGLVPDIPFQATVAKYYIFGGFIDK 382

Query: 798  AEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPSKDY 619
            AE   + L+ L G++                LY  L K  +V R WK +E     PS  +
Sbjct: 383  AE---AVLKDLEGED---MERNRDACKALLPLYAALGKADEVSRIWKVVE-----PSPRF 431

Query: 618  ---TAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFEQ 448
                A I AWG +G+ EKAE +FE ML + +   +  Y+AL++VY  +   +K +DL ++
Sbjct: 432  DECLAAIEAWGKLGDTEKAEAVFERMLKTWKNISSRYYSALIKVYTTHKQLNKGKDLVKR 491

Query: 447  MSMRNEKLYRLSRHALIRLFLSLGDLIKAESML----------DYQGGKSR-RPAYSSFF 301
            M+     +  ++  AL+RL++  G++ KA+S+L            Q  K+R +P YSS+ 
Sbjct: 492  MADNGITIGPVTWDALVRLYVEAGEVEKADSILAKAAAQQNNSTSQSSKNRLKPLYSSYM 551

Query: 300  LILKEYAQKEEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQG 151
            +I+++Y ++ +I NAE++F+R++ AG   R   Y  LL  Y+  +TPA G
Sbjct: 552  VIMEKYCERGDIHNAEKIFHRLKQAGYEGRMSDYQTLLQTYVNANTPAYG 601


>ref|XP_004136634.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
            mitochondrial-like [Cucumis sativus]
            gi|449506005|ref|XP_004162626.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g80270,
            mitochondrial-like [Cucumis sativus]
          Length = 606

 Score =  246 bits (629), Expect = 2e-62
 Identities = 146/460 (31%), Positives = 247/460 (53%), Gaps = 3/460 (0%)
 Frame = -3

Query: 1512 EEESDGKACKYESPNKTPMVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXXELCKH 1333
            E E++    K+     + + K  +N    ++ ++LDKWV +GN +           L + 
Sbjct: 122  EGETELAEKKFTKWVPSELTKAIWNASGLSVSSALDKWVSEGNELSWDDISSTMMSLRRR 181

Query: 1332 KFFRAALQVSSWLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKGIKVYR 1153
            + F  ALQ S WL      +FNE+ YA+ L+ I+K      A+ Y  +IP   +G  +YR
Sbjct: 182  RMFGKALQFSEWLEASGQLEFNENDYASRLDLIAKVQGLHKAESYIAKIPKSFQGEVMYR 241

Query: 1152 ALLKTYVARRNVKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVSMRSSS 973
             LL  YVA  NV KAE  F +MK+    +    YN +L+ Y  ++R KI + L+ M   +
Sbjct: 242  TLLANYVAANNVNKAEEVFNKMKDLEFPMTTFAYNQVLVLYKRNDRRKIADVLLLMEKEN 301

Query: 972  VRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMDRKAE 793
            V+ S  TY+ LI A GL  DI GME+V+  M A+G+E +   L  L+KHY+   +  KA+
Sbjct: 302  VKPSPFTYKILIDAKGLSKDISGMEQVVDTMKAEGIELDVFALCLLAKHYVSCGLKDKAK 361

Query: 792  KTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPSKDYTA 613
             T+  +E++N K                 LYG L+ E +V+R W+  E++  I  ++  A
Sbjct: 362  ATLKEMEEINSK------GSRWPCRLLLPLYGELEMEDEVRRLWEICEANPHI--EECMA 413

Query: 612  GILAWGNVGNIEKAEKIFETMLNSGQRE--CTPQYNALLQVYAENILQSKAEDLFEQMSM 439
             I+AWG + NI +AEKIF+ ++ +  ++   T  Y  +++VY +  + +K ++L  QM+ 
Sbjct: 414  AIVAWGKLKNIHEAEKIFDKVVKTWPKKKISTKHYCTMIKVYGDCKMLTKGKELVNQMAE 473

Query: 438  RNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQGGK-SRRPAYSSFFLILKEYAQKEEIR 262
                +  L+  A+++L++  G++ KA++ L     K   RP Y S+  ++  YA++ ++ 
Sbjct: 474  SGYSIDPLAWDAVVKLYVEAGEVEKADTFLVKAVKKYEMRPLYCSYRTLMNHYARRGDVH 533

Query: 261  NAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQGQTQ 142
            NAE++FY+MR +G    +  +  L+ AY+   TPA G  +
Sbjct: 534  NAEKIFYKMRQSGYGPWFNQFETLIQAYVNSKTPAYGMRE 573


>gb|ADN33879.1| DNA-binding protein [Cucumis melo subsp. melo]
          Length = 608

 Score =  242 bits (617), Expect = 4e-61
 Identities = 142/436 (32%), Positives = 239/436 (54%), Gaps = 7/436 (1%)
 Frame = -3

Query: 1428 WNLGA-----SLDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSSWLWTKKPYQFNE 1264
            WN  A     +LDKWV +G+ +           L K + F  ALQ S WL      +FNE
Sbjct: 149  WNAPALSVTSALDKWVSEGHELSRDDISSTMFGLRKRRMFGKALQFSEWLEASGQLEFNE 208

Query: 1263 DYYATHLECISKSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRNVKKAEFCFERMK 1084
              YA+HL+ I+K      A+ Y  +IP+  +G  VYR LL  YV   +VKKAE  F RMK
Sbjct: 209  ADYASHLDLIAKVQGLHKAETYIAKIPNSFRGEAVYRTLLANYVLANDVKKAEEVFNRMK 268

Query: 1083 EFGSKLLIADYNTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNLIYALGLDNDIDG 904
            +    +    Y+ ML+ Y   +R +I + L  M   +V+    TY+ LI A GL NDI G
Sbjct: 269  DLEFPMTTFAYDQMLILYKRIDRRRIADILSLMEKENVKPRPFTYKILIDAKGLSNDISG 328

Query: 903  MEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNGKEXXXXXXXXXX 724
            ME+V+  M A+G++ +   L  L+KHY+   +  KA   + A E++N K           
Sbjct: 329  MEQVVDTMKAEGIKLDVDTLLLLAKHYVLGGLKDKAMPILKATEEVNSK------GSRWP 382

Query: 723  XXXXXXLYGLLKKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNIEKAEKIFETMLN 544
                  LYG L+ E +V+R W+  E +  +  ++  A I+AWG + NI++AEKIF+ ++ 
Sbjct: 383  CRYLLPLYGELQMEDEVRRLWEICEPNPNV--EECMAAIVAWGKLKNIQEAEKIFDRVVK 440

Query: 543  SGQRECTPQYNALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHALIRLFLSLGDLIK 364
            + +R  T  Y+ +++VY ++ + +K ++L  QM+    ++  +   A+++L++  G++ K
Sbjct: 441  TWKRLSTKHYSTMIKVYGDSKMLTKGKELVNQMAKSGCRIDPMIWDAVVKLYVEAGEVEK 500

Query: 363  AESMLDYQGGK--SRRPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAGCTRRYGMYVPL 190
            A+S L ++  K    +P + S+  ++  YA+K ++ N+E++F+++R +G    +G +V L
Sbjct: 501  ADSFL-FKAVKQYGMKPLFDSYRTLMVHYARKGDVHNSEKIFHKIRQSGYPTHFGQFVTL 559

Query: 189  LHAYIREHTPAQGQTQ 142
            + AY+   TPA G  +
Sbjct: 560  VQAYLNAKTPAYGMRE 575


>ref|XP_002322619.2| hypothetical protein POPTR_0016s03540g [Populus trichocarpa]
            gi|550320743|gb|EEF04380.2| hypothetical protein
            POPTR_0016s03540g [Populus trichocarpa]
          Length = 618

 Score =  241 bits (616), Expect = 6e-61
 Identities = 146/424 (34%), Positives = 230/424 (54%), Gaps = 1/424 (0%)
 Frame = -3

Query: 1410 LDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSSWLWTKKPYQFNEDYYATHLECIS 1231
            LDKWV +G  +           L K + F  ALQ+S W+   K   F+E  YA+ L+ I+
Sbjct: 170  LDKWVAEGRDLDQLEISNAMFNLRKRRLFGRALQLSEWVEANKRKDFDERDYASRLDLIA 229

Query: 1230 KSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRNVKKAEFCFERMKEFGSKLLIADY 1051
            K      A+ Y E+IP  +KG  +YR LL   V+  N KKA   F +MK+    + +  Y
Sbjct: 230  KVRGLQKAEVYIEKIPKSLKGEVIYRTLLANCVSANNAKKAVEVFNKMKDLELPITLFSY 289

Query: 1050 NTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAK 871
            N +LL Y   ++ KI + L+SM   +V+ S  TY  LI   G  NDI GME++ + M A+
Sbjct: 290  NQLLLLYKRHDKKKIADVLLSMEKENVKPSLFTYILLIDTKGQSNDIAGMEQIAETMKAE 349

Query: 870  GLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLL 691
            G+EP+    A +++HY+   +  KAE  +  +E  N +E                LYG L
Sbjct: 350  GIEPDIKTQAIMARHYVSGGLKEKAEIVLKEMEGGNLEE------HRWACQFMLPLYGTL 403

Query: 690  KKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYN 511
             K  +V R WK  +    +   +  A I AWG +  I +AE +FE M  + ++  +  Y+
Sbjct: 404  GKADEVSRLWKFCKKSPRL--DECMAAIEAWGQLKKIPEAEAVFELMSKTWKKLSSKHYS 461

Query: 510  ALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQGGK 331
            ALL+VYA N + SK +DL +QM     ++  L+  ALI+L++  G++ KA+S+L+    +
Sbjct: 462  ALLKVYANNKMLSKGKDLIKQMGDSGCRIGPLTWDALIKLYVEAGEVEKADSILNKAVQQ 521

Query: 330  SR-RPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQ 154
            ++ +P +SS+ +I+++YA+K +I NAE++F+RMR AG   R   +  L+ AYI    P  
Sbjct: 522  NQMKPMFSSYMIIMEKYAKKGDIHNAEKMFHRMRQAGYQARSKQFQTLIQAYINAKAPCY 581

Query: 153  GQTQ 142
            G  +
Sbjct: 582  GMRE 585


>ref|XP_002308888.2| hypothetical protein POPTR_0006s03790g [Populus trichocarpa]
            gi|550335400|gb|EEE92411.2| hypothetical protein
            POPTR_0006s03790g [Populus trichocarpa]
          Length = 623

 Score =  241 bits (614), Expect = 9e-61
 Identities = 150/462 (32%), Positives = 244/462 (52%), Gaps = 1/462 (0%)
 Frame = -3

Query: 1524 VSHNEEESDGKACKYESPNKTPMVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXXE 1345
            +S NE +S  K+   + P  T +     +  D ++ + LDKWV +G  +           
Sbjct: 138  LSDNETDSVVKSLPRKRPT-TELFNAIVSASDVSVQSVLDKWVAEGKDLDRLEISNAMIN 196

Query: 1344 LCKHKFFRAALQVSSWLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKGI 1165
            L K + F  ALQ+S W    KP +F E  YA+ L+ I+K      A+ Y ++IP   KG 
Sbjct: 197  LRKRRMFGRALQLSEWFEANKPQEFVERDYASRLDLIAKVRGLHKAEVYIDKIPKSFKGE 256

Query: 1164 KVYRALLKTYVARRNVKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVSM 985
             +YR LL   V   NVKKAE  F +M++    +     N +LL Y   ++ KI + L+ M
Sbjct: 257  VIYRTLLANCVVDHNVKKAEEVFNKMRDLEFPITPFACNQLLLLYKRLDKKKIADVLLLM 316

Query: 984  RSSSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMD 805
               +V+ S  TY+ LI   G  ND+ GM+++++ M A+G+EP+    A +++HY+   + 
Sbjct: 317  EKENVKPSLFTYKILIDTKGQSNDMTGMDQIVETMKAEGIEPDIRTQAIMARHYVSGGLK 376

Query: 804  RKAEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPSK 625
             KAE  +  +E  N +E                LYG L K  +V R WK  E    +   
Sbjct: 377  EKAEAILKEMEGGNLEE------HRWACRFMLPLYGALGKADEVSRVWKFCEKSPRL--D 428

Query: 624  DYTAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFEQM 445
            +  A I AWG +  I++AE +FE M  + ++  +  Y+ LL+VYA + + SK +DL ++M
Sbjct: 429  ECMAAIEAWGRLKKIDEAEAVFELMSKTWKKLSSRHYSTLLKVYANHKMLSKGKDLIKRM 488

Query: 444  SMRNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQGGKSR-RPAYSSFFLILKEYAQKEE 268
                 ++  L+  AL++L++  G++ KA+S+L+    +++ +P YSSF +I++ YA K +
Sbjct: 489  GDSGCRIGPLTWDALVKLYVEAGEVEKADSILNKAVQQNKIKPMYSSFLIIMERYATKGD 548

Query: 267  IRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQGQTQ 142
            I NAE++F+RMR AG   R   +  L+ AYI    P  G  +
Sbjct: 549  IHNAEKMFHRMRQAGYQARIRQFQTLIQAYIIAKAPCYGMRE 590


>ref|XP_004967670.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
            mitochondrial-like [Setaria italica]
          Length = 613

 Score =  241 bits (614), Expect = 9e-61
 Identities = 146/422 (34%), Positives = 229/422 (54%), Gaps = 1/422 (0%)
 Frame = -3

Query: 1413 SLDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSSWLWTKKPYQFNEDYYATHLECI 1234
            +L+KW   GNV+           L K ++F  ALQ+  W+   K  +F E  YA+ ++  
Sbjct: 163  ALEKWANDGNVLDRGELFFVLLNLRKRRWFGKALQLLEWVEESKLLEFVERDYASRVDLT 222

Query: 1233 SKSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRNVKKAEFCFERMKEFGSKLLIAD 1054
            +K      A++Y E+IP+  +G  VYR LL   V   NV KAE  F RMK+ G +  I  
Sbjct: 223  AKVYGLHKAEQYIEKIPAAHRGEIVYRTLLANCVQEANVNKAEKVFNRMKDLGFQPTIFS 282

Query: 1053 YNTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTA 874
            +N +LL Y   ++ KI + L  M   +V+ S  +Y+ L+ A G   DI+GMEKVI++M  
Sbjct: 283  FNQLLLLYKRVDKKKITDVLAMMEKENVKPSLFSYKLLVDAKGTSRDIEGMEKVIEQMET 342

Query: 873  KGLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGL 694
            +G+EP+    A  ++HYI    D + EK  + LE + G +                LY  
Sbjct: 343  EGVEPDLTFKATAARHYI---FDGQREKAEALLESMEGDD---INTNRAACKILLPLYAF 396

Query: 693  LKKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQY 514
            L K  DV+R WK  + +  +   +  + I A+G +G++EKAE++FE M    +   +  Y
Sbjct: 397  LGKNDDVERIWKVCKDNTRL--DECHSAIQAFGTLGDVEKAEEVFENMFLRWKTLSSKYY 454

Query: 513  NALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQGG 334
            NA+L+VYA   L  K ++L ++M   + K    +  AL++L++  G++ KA+S+L     
Sbjct: 455  NAMLKVYANQNLLDKGKELAKRMDENHIKFGNTTLDALVKLYVDAGEVEKADSLLHKLSQ 514

Query: 333  KSR-RPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPA 157
            K R RP YSS+ ++L  Y++K ++ N+ERVF+++R  G T R   Y  LLHAY+    PA
Sbjct: 515  KHRIRPQYSSYLMLLDCYSKKGDVHNSERVFHKLRQIGYTGRIRQYQLLLHAYLHAKAPA 574

Query: 156  QG 151
             G
Sbjct: 575  YG 576


>ref|XP_004162627.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
            mitochondrial-like [Cucumis sativus]
          Length = 610

 Score =  241 bits (614), Expect = 9e-61
 Identities = 136/440 (30%), Positives = 240/440 (54%), Gaps = 1/440 (0%)
 Frame = -3

Query: 1458 MVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSSWLWTKKP 1279
            + K  +N  D+++ ++L KWV QGN +           L + + FR ALQ S WL     
Sbjct: 146  LTKAIWNAPDFSVASALVKWVSQGNKLSRDDISSTMISLRRRQMFRKALQFSEWLEANGQ 205

Query: 1278 YQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRNVKKAEFC 1099
             +FNE  YA+ +  I+K      A+ Y  +IP   +G  V+RALL  YV   NV+KAE  
Sbjct: 206  LEFNERDYASRVHLIAKVQGLHKAESYIAKIPKSFQGEVVHRALLANYVVANNVEKAEEV 265

Query: 1098 FERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNLIYALGLD 919
            F ++K+    + I  YN ML+ Y   +R KI + L+ M   +++    TY+ LI   GL 
Sbjct: 266  FNKIKDLEFPMSIFAYNQMLVLYKKIDRRKIADVLLLMEKENIKPCPFTYKILIDGKGLS 325

Query: 918  NDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNGKEXXXXX 739
            NDI GME+V+  M A+G+E + + L+ L+KHY+   +  KA+  +  +E+ N        
Sbjct: 326  NDISGMEQVVDSMKAEGIELDVSTLSLLAKHYVSCGLKVKAKAILKEIEETNSN------ 379

Query: 738  XXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNIEKAEKIF 559
                        YG L+ E +V+R W+  E++  I  ++  A I+AWG + N+++AEKIF
Sbjct: 380  GPQWLCRILLPFYGKLQMEDEVRRLWEICEANPHI--EECMAAIVAWGQLKNVQEAEKIF 437

Query: 558  ETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHALIRLFLSL 379
            + ++ + ++     Y+ ++ VY ++ + +K +++  QM+     +  L+ +A+++L++  
Sbjct: 438  DRVVKTWKKLSARHYSIMMNVYRDSKMLTKGKEVVNQMAESGCHIDLLTCNAIVKLYVEA 497

Query: 378  GDLIKAESMLDYQGGK-SRRPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAGCTRRYGM 202
            G++ KA+S L     K   +P ++S+  ++  YA++ ++ NAE++F +MR +    R G 
Sbjct: 498  GEVEKADSFLVKAVKKYGMKPLFTSYKTLMDHYARRGDVHNAEKIFDKMRQSSYIPRLGQ 557

Query: 201  YVPLLHAYIREHTPAQGQTQ 142
            +  L+ AY+   TPA G  +
Sbjct: 558  FGTLIQAYVNAKTPAYGMRE 577


>ref|XP_002890106.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
            gi|297335948|gb|EFH66365.1| DNA binding protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 594

 Score =  240 bits (613), Expect = 1e-60
 Identities = 148/461 (32%), Positives = 240/461 (52%), Gaps = 4/461 (0%)
 Frame = -3

Query: 1521 SHNEEESDGKACKYESP-NKTP--MVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXX 1351
            S +E + +G   +   P +K P  + K   +    ++G++LDKWVEQG            
Sbjct: 106  SGDEGDIEGAELELHVPASKRPSELFKAIVSVSGLSVGSALDKWVEQGKDTSRTEFASAM 165

Query: 1350 XELCKHKFFRAALQVSSWLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMK 1171
             +L K + +  ALQ++ WL   K ++  E  YA+ L+ ISK       + Y E IP   +
Sbjct: 166  LQLRKRRMYGRALQMTEWLDDNKQFEMKERDYASRLDLISKVRGLYKGEAYIETIPESFR 225

Query: 1170 GIKVYRALLKTYVARRNVKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLV 991
            G  VYR LL  YVA  NV+ AE  F +MK+ G  L     N ML+ Y   ++ KI + L+
Sbjct: 226  GELVYRTLLSNYVATSNVRTAEAVFNKMKDLGFPLSTFTCNQMLILYKRVDKKKIADVLL 285

Query: 990  SMRSSSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFS 811
             M   +++ + NTY+ LI   GL NDI GME++++ M ++G+EP+    A ++++Y    
Sbjct: 286  LMEKENLKPNLNTYKILIDTKGLSNDITGMEQIVETMKSEGVEPDLRARALIARNYASAG 345

Query: 810  MDRKAEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIP 631
            +  KAEK +  +E  + +E                +YG L++  +V R WK  E      
Sbjct: 346  LKEKAEKVLKEMEGESLEE------NRHVYKDLLSVYGFLQRADEVTRIWKICEEKP--R 397

Query: 630  SKDYTAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFE 451
              +  A ILA+G +  +++AE +FE ML    R  +  Y+ LL+VY ++ + S+ +DL +
Sbjct: 398  YNESLAAILAFGKIDKVKEAEAVFEKMLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLVK 457

Query: 450  QMSMRNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQ-GGKSRRPAYSSFFLILKEYAQK 274
            QM      +  L+  A+I+L+L  G++ KAES L      K  +P  SSF  ++ EY ++
Sbjct: 458  QMLDSGCNIGALTLDAVIKLYLEAGEVEKAESSLSKAIQSKQIKPLMSSFMYVMGEYVRR 517

Query: 273  EEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQG 151
             ++ N E++F RM+  G   R+  Y  L+ AY+    PA G
Sbjct: 518  GDVHNTEKIFQRMKQFGYQSRFRTYQALIQAYVNAKAPAYG 558


>ref|XP_006430551.1| hypothetical protein CICLE_v10011296mg [Citrus clementina]
            gi|557532608|gb|ESR43791.1| hypothetical protein
            CICLE_v10011296mg [Citrus clementina]
          Length = 623

 Score =  240 bits (612), Expect = 2e-60
 Identities = 150/456 (32%), Positives = 246/456 (53%), Gaps = 4/456 (0%)
 Frame = -3

Query: 1506 ESDGKACKYESPNK---TPMVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXXELCK 1336
            E++ +A +  +P+K   + + K   +  D ++ ++L K+ E+GN +           L  
Sbjct: 140  ETETEASRKTTPSKRKFSKLFKAIMDAPDISIHSTLTKYAEEGNDLSRAEIALAMANLRT 199

Query: 1335 HKFFRAALQVSSWLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKGIKVY 1156
             + +  ALQ+S WL T K   F E  YA+ L+ I+K      A+ Y ++IP   +G  VY
Sbjct: 200  RRMYGKALQLSEWLETNKKLDFIERDYASCLDLIAKLRGLQKAESYIQKIPESFRGEVVY 259

Query: 1155 RALLKTYVARRNVKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVSMRSS 976
            R LL   VA  NVKKAE  F RMK+ G  +     N +L+ Y   ++ K+ + L+ M   
Sbjct: 260  RTLLANCVAGNNVKKAEEVFNRMKDKGFPVTSFACNQLLILYKRLDKKKVADVLLLMEKE 319

Query: 975  SVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMDRKA 796
            +V+ +  +Y+ LI   G  ND+ GM++V++ M ++G+EP+++  A L+KHY+      KA
Sbjct: 320  NVKLTQFSYKILIDIKGQSNDLTGMDQVVEAMKSEGIEPDSSTQAILAKHYVSGGRKEKA 379

Query: 795  EKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPSKDYT 616
            E  +  +E  N KE                LY  L K   V R WK  ES+ ++      
Sbjct: 380  EAMLKEMEGDNLKE------HRWTCRLLLPLYAELGKADQVARIWKLCESNPWLDV--CM 431

Query: 615  AGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFEQMSMR 436
            A I AWG +  +E+AE +F+ M  + ++  T  Y ALL+VYA++ + SK +DL +QM+  
Sbjct: 432  AAIEAWGKLNKVEEAEAVFKRMSKTWKKLSTKHYTALLKVYADHKMLSKGKDLVKQMAES 491

Query: 435  NEKLYRLSRHALIRLFLSLGDLIKAES-MLDYQGGKSRRPAYSSFFLILKEYAQKEEIRN 259
               +  L+  AL++L +  G++ KA+S +L  Q     +P +SS+ LI+ +YA++ +I N
Sbjct: 492  GCHIGPLTWDALVKLHVEGGEVEKADSILLKAQQQNKFKPMFSSYMLIMDQYAKRGDIHN 551

Query: 258  AERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQG 151
             E++F+RMR  G   R+  +  L+ AYI   TPA G
Sbjct: 552  TEKIFHRMRQVGYVARFKQFQTLVQAYINAKTPAYG 587


>ref|XP_004162629.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
            mitochondrial-like [Cucumis sativus]
          Length = 608

 Score =  239 bits (611), Expect = 2e-60
 Identities = 137/423 (32%), Positives = 233/423 (55%), Gaps = 1/423 (0%)
 Frame = -3

Query: 1416 ASLDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSSWLWTKKPYQFNEDYYATHLEC 1237
            ++LDKWV +G  +           L K + F  ALQ S WL      +FN+  YA+ L+ 
Sbjct: 158  SALDKWVSEGKDLSRAEISLAMLHLRKRRMFGKALQFSEWLEANGQLEFNQRDYASRLDL 217

Query: 1236 ISKSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRNVKKAEFCFERMKEFGSKLLIA 1057
            I+K      A+ Y  +IP   +G  ++R LL  YVA  NVKKAE  F +MK+    +   
Sbjct: 218  IAKVQGLPKAESYIAKIPQSFQGEVIHRTLLANYVAANNVKKAEEVFNKMKDLEFPMTPF 277

Query: 1056 DYNTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMT 877
             ++ ML+ Y   ++ K+ + L  M   +V+ S  TY+ LI A GL NDI GME+V+  M 
Sbjct: 278  AHDQMLILYKRIDKRKLADILSLMEKENVKPSPFTYKILIDAKGLCNDISGMEQVVDSMK 337

Query: 876  AKGLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYG 697
            A+G++P+ + L+ L+KHY+   +  KA+  +  +E+ N K                 LYG
Sbjct: 338  AEGIKPDVSTLSLLAKHYVSNGLKDKAKVILKDMEENNSK------GSRLPCRILLPLYG 391

Query: 696  LLKKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQ 517
             L+ E +V+R WK  E++  +  ++  A I+AWG + N+++AEKIF+  + + ++  T  
Sbjct: 392  ALQMEDEVRRLWKICEANPHM--EESMAAIVAWGKLKNVQEAEKIFDRFVKTWKKPSTRH 449

Query: 516  YNALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQG 337
            YN ++ VY  + + +K ++L  QM+    ++  L+  A+++L++  G++ KA+S L    
Sbjct: 450  YNTMMNVYGGSKMLTKGKELVNQMAESGCRMDELTWDAVVKLYVEAGEVEKADSFLVKAV 509

Query: 336  GK-SRRPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTP 160
             K   +P ++S+  ++  YA++ ++ NAE++F +M  +G   R G +  LL AY+   TP
Sbjct: 510  QKYGMKPLFTSYKTLMDHYARRGDVHNAEKIFDKMIQSGFVPRLGQFGTLLQAYVNSKTP 569

Query: 159  AQG 151
            A G
Sbjct: 570  AYG 572


>gb|AAD39676.1|AC007591_41 F9L1.43 [Arabidopsis thaliana]
          Length = 623

 Score =  239 bits (609), Expect = 4e-60
 Identities = 143/447 (31%), Positives = 234/447 (52%), Gaps = 1/447 (0%)
 Frame = -3

Query: 1479 ESPNKTPMVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSS 1300
            ES   + M K   +    ++G++LDKWVEQG             +L K + F  ALQ++ 
Sbjct: 152  ESKRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRKEFESAMLQLRKRRMFGRALQMTE 211

Query: 1299 WLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRN 1120
            WL   K ++  E  YA  L+ ISK       + Y + IP   +G  VYR LL  +VA  N
Sbjct: 212  WLDENKQFEMEERDYACRLDLISKVRGWYKGEAYIKTIPESFRGELVYRTLLANHVATSN 271

Query: 1119 VKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNL 940
            V+ AE  F +MK+ G  L     N ML+ Y   ++ KI + L+ +   +++ + NTY+ L
Sbjct: 272  VRTAEAVFNKMKDLGFPLSTFTCNQMLILYKRVDKKKIADVLLLLEKENLKPNLNTYKIL 331

Query: 939  IYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNG 760
            I   G  NDI GME++++ M ++G+E +    A +++HY    +  KAEK +  +E  + 
Sbjct: 332  IDTKGSSNDITGMEQIVETMKSEGVELDLRARALIARHYASAGLKEKAEKVLKEMEGESL 391

Query: 759  KEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNI 580
            +E                +YG L++E +V+R WK  E +      +  A ILA+G +  +
Sbjct: 392  EE------NRHMCKDLLSVYGYLQREDEVRRVWKICEENP--RYNEVLAAILAFGKIDKV 443

Query: 579  EKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHAL 400
            + AE +FE +L    R  +  Y+ LL+VY ++ + S+ +DL +QMS     +  L+  A+
Sbjct: 444  KDAEAVFEKVLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLVKQMSDSGCNIGALTWDAV 503

Query: 399  IRLFLSLGDLIKAESMLDYQ-GGKSRRPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAG 223
            I+L++  G++ KAES L      K  +P  SSF  ++ EY ++ ++ N E++F RM+ AG
Sbjct: 504  IKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVRRGDVHNTEKIFQRMKQAG 563

Query: 222  CTRRYGMYVPLLHAYIREHTPAQGQTQ 142
               R+  Y  L+ AY+    PA G  +
Sbjct: 564  YQSRFWAYQTLIQAYVNAKAPAYGMKE 590


>ref|XP_004303594.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 625

 Score =  239 bits (609), Expect = 4e-60
 Identities = 144/427 (33%), Positives = 232/427 (54%), Gaps = 1/427 (0%)
 Frame = -3

Query: 1428 WNLGASLDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSSWLWTKKPYQFNEDYYAT 1249
            +++ ++LDKWV++GN +           L + + F  ALQ+S WL   K  +F+E  YA+
Sbjct: 171  FSVHSALDKWVKEGNDLSRAEISLTKINLRRRRMFGRALQLSEWLEAHKQIEFSERDYAS 230

Query: 1248 HLECISKSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRNVKKAEFCFERMKEFGSK 1069
             L+ I+K      A+KY E IP   +G K+YR LL   V   N+KKAE  F +MK+    
Sbjct: 231  RLDLIAKVYGLWKAEKYVEMIPQSFRGEKIYRTLLVNCVGANNLKKAEEIFNKMKDLEFP 290

Query: 1068 LLIADYNTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNLIYALGLDNDIDGMEKVI 889
                  N +LL Y   ++ KI + L+ M   +V+ +A TY+ LI   G  NDI GME+V 
Sbjct: 291  FTSFTCNQLLLLYKRLDKKKIADVLLFMEKENVKPTAFTYKLLIATKGESNDIAGMEQVY 350

Query: 888  KEMTAKGLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNGKEXXXXXXXXXXXXXXX 709
              M ++G+EP+  + A ++KHY    +  KAE  +  +E  N KE               
Sbjct: 351  GTMKSEGIEPDITVKAVMAKHYASGGLKEKAEAVLKEMEGGNLKE------NRWACRALL 404

Query: 708  XLYGLLKKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNIEKAEKIFETMLNSGQRE 529
             LY  L +  +V R W       ++  ++  A I AWG +  IE+AE +F+ ML + +R 
Sbjct: 405  PLYAELGQVDEVGRVWNVCMPKPWV--EECMAAIEAWGKLNKIEEAEAVFDKMLKTWKRL 462

Query: 528  CTPQYNALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHALIRLFLSLGDLIKAESML 349
             + QY  LLQVY  + +  K +DL ++M   N ++  L   AL++L++ +G++ KA+S+L
Sbjct: 463  SSRQYAVLLQVYTNHKMIEKGKDLVKRMVDNNCEVDPLIWDALVKLYVEIGEVEKADSIL 522

Query: 348  DYQGGKS-RRPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIR 172
                 K+ ++P +SS+  ++ +Y+++ +I N+E++FYRMR  G T R   +  L+ AYI 
Sbjct: 523  RKAAEKNHKKPMFSSYMALMDQYSKRGDIHNSEKIFYRMRRDGYTARLRQFQSLVQAYIN 582

Query: 171  EHTPAQG 151
               PA G
Sbjct: 583  AKAPAYG 589


>ref|NP_173001.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|193806396|sp|Q9XI21.2|PPR44_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g15480, mitochondrial; Flags: Precursor
            gi|332191207|gb|AEE29328.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 594

 Score =  239 bits (609), Expect = 4e-60
 Identities = 143/447 (31%), Positives = 234/447 (52%), Gaps = 1/447 (0%)
 Frame = -3

Query: 1479 ESPNKTPMVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSS 1300
            ES   + M K   +    ++G++LDKWVEQG             +L K + F  ALQ++ 
Sbjct: 123  ESKRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRKEFESAMLQLRKRRMFGRALQMTE 182

Query: 1299 WLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRN 1120
            WL   K ++  E  YA  L+ ISK       + Y + IP   +G  VYR LL  +VA  N
Sbjct: 183  WLDENKQFEMEERDYACRLDLISKVRGWYKGEAYIKTIPESFRGELVYRTLLANHVATSN 242

Query: 1119 VKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNL 940
            V+ AE  F +MK+ G  L     N ML+ Y   ++ KI + L+ +   +++ + NTY+ L
Sbjct: 243  VRTAEAVFNKMKDLGFPLSTFTCNQMLILYKRVDKKKIADVLLLLEKENLKPNLNTYKIL 302

Query: 939  IYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNG 760
            I   G  NDI GME++++ M ++G+E +    A +++HY    +  KAEK +  +E  + 
Sbjct: 303  IDTKGSSNDITGMEQIVETMKSEGVELDLRARALIARHYASAGLKEKAEKVLKEMEGESL 362

Query: 759  KEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNI 580
            +E                +YG L++E +V+R WK  E +      +  A ILA+G +  +
Sbjct: 363  EE------NRHMCKDLLSVYGYLQREDEVRRVWKICEENP--RYNEVLAAILAFGKIDKV 414

Query: 579  EKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHAL 400
            + AE +FE +L    R  +  Y+ LL+VY ++ + S+ +DL +QMS     +  L+  A+
Sbjct: 415  KDAEAVFEKVLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLVKQMSDSGCNIGALTWDAV 474

Query: 399  IRLFLSLGDLIKAESMLDYQ-GGKSRRPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAG 223
            I+L++  G++ KAES L      K  +P  SSF  ++ EY ++ ++ N E++F RM+ AG
Sbjct: 475  IKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVRRGDVHNTEKIFQRMKQAG 534

Query: 222  CTRRYGMYVPLLHAYIREHTPAQGQTQ 142
               R+  Y  L+ AY+    PA G  +
Sbjct: 535  YQSRFWAYQTLIQAYVNAKAPAYGMKE 561


>gb|EXC46728.1| hypothetical protein L484_002071 [Morus notabilis]
          Length = 623

 Score =  238 bits (608), Expect = 5e-60
 Identities = 149/460 (32%), Positives = 246/460 (53%), Gaps = 3/460 (0%)
 Frame = -3

Query: 1521 SHNEEESDGKACKYESPNK--TPMVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXX 1348
            S NE E D +A   ES +     +++         L  +LD+WV++GN +          
Sbjct: 136  SKNELEMDTEAGLSESVSSRTNSLLRTVMVTRGMPLPKALDEWVKEGNDLSRGVILFVIR 195

Query: 1347 ELCKHKFFRAALQVSSWLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKG 1168
            +L K++ +  ALQ S WL ++K  +  E  YA+ ++ I+K      A+K+ E+IP  +KG
Sbjct: 196  KLRKYRMYGKALQFSEWLESRKKLELVERDYASRVDLIAKVYGPQRAEKFIEKIPKSLKG 255

Query: 1167 IKVYRALLKTYVARRNVKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVS 988
              VYR LL   V   N KKAE  F ++K+    L     N +LL Y  ++R KI + L+ 
Sbjct: 256  ELVYRTLLANCVQANNSKKAEEVFNKIKDLELPLTSFTCNQLLLLYKRTDRKKIADVLLL 315

Query: 987  MRSSSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSM 808
            M   +V+    TYR LI   G  NDI GME +++ M A+G+E +    A L+++Y    +
Sbjct: 316  MEKENVKPCLMTYRILIDVKGQSNDITGMEHILETMKAEGIETDTHTKAVLARNYAIAGL 375

Query: 807  DRKAEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPS 628
              KAE  +  +E  N  E                LY  L    +V+R WK  ES+    +
Sbjct: 376  KEKAEAVLKEMEGDNLNE------DPKVRRYLLLLYAELGHADEVERVWKACESNP--RT 427

Query: 627  KDYTAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFEQ 448
             ++ A I AWG +  ++KAE  FE ML + ++     YNALL+VYA + + ++ +DL ++
Sbjct: 428  SEFLAAIEAWGKLKKVKKAEDAFEKMLKAVKKPSAVHYNALLRVYANHKMLTRGKDLIKR 487

Query: 447  MSMRNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQGGKSR-RPAYSSFFLILKEYAQKE 271
            M+  + K+ R++  ++++L++  G++ KA+S+L     +++ +P Y S+  I+ EYA+K 
Sbjct: 488  MAENDGKISRMTLDSVVKLYVEAGEIEKADSVLQKATQQNQLKPLYVSYMAIMDEYAKKG 547

Query: 270  EIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQG 151
            +I N E++F RM+ AG   R+  +  L+ AY+   TPA G
Sbjct: 548  DIHNTEKMFLRMKQAGYDARFRQFQALIQAYVNAKTPAYG 587


>gb|EOY08858.1| Pentatricopeptide repeat-containing protein isoform 2, partial
            [Theobroma cacao]
          Length = 621

 Score =  238 bits (607), Expect = 6e-60
 Identities = 140/421 (33%), Positives = 231/421 (54%), Gaps = 1/421 (0%)
 Frame = -3

Query: 1410 LDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSSWLWTKKPYQFNEDYYATHLECIS 1231
            LDKW+E+G              L K + +  ALQ+S WL   K   F E  YA+ L+ I+
Sbjct: 173  LDKWLEEGKAFNRTEISVAMLNLRKRRMYGRALQLSEWLEANKQLDFTERDYASRLDLIA 232

Query: 1230 KSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRNVKKAEFCFERMKEFGSKLLIADY 1051
            K      A+ Y E+IP   +G  +YR LL   V   NVKKAE  F +M++    +     
Sbjct: 233  KVRGLLKAEMYIEKIPKSFRGEVIYRTLLANCVVANNVKKAEEVFNKMRDLELPITSFSC 292

Query: 1050 NTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAK 871
            N +LL Y   ++ KI + L+ M   +V+ S  TY+ LI   GL NDI GM+++++ M A+
Sbjct: 293  NQLLLLYKRLDKKKIADVLLLMEKENVKPSLFTYKILIDTKGLSNDITGMDQIVETMKAE 352

Query: 870  GLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLL 691
            G+EP+    + L+KHY+   +  KA + +  +E  N KE                LY  L
Sbjct: 353  GVEPDIHTQSILAKHYVSGGLTEKAVEVLKGMEGDNIKE------NRWACRFLLPLYADL 406

Query: 690  KKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYN 511
             K  +V+R WK  ES+  +  ++Y A I AWG +  IE+AE +FE ML + ++     Y 
Sbjct: 407  GKAVEVERVWKVCESNPRL--EEYMAAIEAWGKLNKIEEAEAVFEMMLKTWKKLPARYYA 464

Query: 510  ALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQGGK 331
            +LL+VY+ + +  K +DL ++M+    ++  L+  AL++L++  G++ KA+S+L     +
Sbjct: 465  SLLKVYSNHKMLQKGKDLVKRMADDGCQIGPLTWDALVKLYVEAGEVEKADSILQKACQQ 524

Query: 330  SR-RPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQ 154
            ++ +P +SSF  ++++Y+++ +I N+E++F+RMR AG   R   +  L+ AY+    PA 
Sbjct: 525  NQVKPMFSSFMAVMEQYSKRGDIHNSEKMFHRMRQAGYMARLRQFQSLVQAYVNAKAPAY 584

Query: 153  G 151
            G
Sbjct: 585  G 585


>gb|EOY08857.1| Pentatricopeptide repeat-containing protein isoform 1 [Theobroma
            cacao]
          Length = 623

 Score =  238 bits (607), Expect = 6e-60
 Identities = 140/421 (33%), Positives = 231/421 (54%), Gaps = 1/421 (0%)
 Frame = -3

Query: 1410 LDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSSWLWTKKPYQFNEDYYATHLECIS 1231
            LDKW+E+G              L K + +  ALQ+S WL   K   F E  YA+ L+ I+
Sbjct: 175  LDKWLEEGKAFNRTEISVAMLNLRKRRMYGRALQLSEWLEANKQLDFTERDYASRLDLIA 234

Query: 1230 KSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRNVKKAEFCFERMKEFGSKLLIADY 1051
            K      A+ Y E+IP   +G  +YR LL   V   NVKKAE  F +M++    +     
Sbjct: 235  KVRGLLKAEMYIEKIPKSFRGEVIYRTLLANCVVANNVKKAEEVFNKMRDLELPITSFSC 294

Query: 1050 NTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAK 871
            N +LL Y   ++ KI + L+ M   +V+ S  TY+ LI   GL NDI GM+++++ M A+
Sbjct: 295  NQLLLLYKRLDKKKIADVLLLMEKENVKPSLFTYKILIDTKGLSNDITGMDQIVETMKAE 354

Query: 870  GLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLL 691
            G+EP+    + L+KHY+   +  KA + +  +E  N KE                LY  L
Sbjct: 355  GVEPDIHTQSILAKHYVSGGLTEKAVEVLKGMEGDNIKE------NRWACRFLLPLYADL 408

Query: 690  KKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYN 511
             K  +V+R WK  ES+  +  ++Y A I AWG +  IE+AE +FE ML + ++     Y 
Sbjct: 409  GKAVEVERVWKVCESNPRL--EEYMAAIEAWGKLNKIEEAEAVFEMMLKTWKKLPARYYA 466

Query: 510  ALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQGGK 331
            +LL+VY+ + +  K +DL ++M+    ++  L+  AL++L++  G++ KA+S+L     +
Sbjct: 467  SLLKVYSNHKMLQKGKDLVKRMADDGCQIGPLTWDALVKLYVEAGEVEKADSILQKACQQ 526

Query: 330  SR-RPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQ 154
            ++ +P +SSF  ++++Y+++ +I N+E++F+RMR AG   R   +  L+ AY+    PA 
Sbjct: 527  NQVKPMFSSFMAVMEQYSKRGDIHNSEKMFHRMRQAGYMARLRQFQSLVQAYVNAKAPAY 586

Query: 153  G 151
            G
Sbjct: 587  G 587


>ref|XP_006482077.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80270,
            mitochondrial-like [Citrus sinensis]
          Length = 623

 Score =  237 bits (604), Expect = 1e-59
 Identities = 149/456 (32%), Positives = 244/456 (53%), Gaps = 4/456 (0%)
 Frame = -3

Query: 1506 ESDGKACKYESPNKTPMVKI---CFNPHDWNLGASLDKWVEQGNVIIXXXXXXXXXELCK 1336
            E++ +  +  +P+K  + K+     +  D +  ++L  +VE+GN +           L  
Sbjct: 140  ETETEPSRKTTPSKRKLSKLFKAIMDAPDISNNSTLTTYVEEGNDLSRAEISLAMANLRT 199

Query: 1335 HKFFRAALQVSSWLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMKGIKVY 1156
             + +  ALQ S WL T K   F E  YA+ L+ I+K      A+ Y ++IP   +G  VY
Sbjct: 200  RRMYGKALQFSEWLETNKKLDFIERDYASRLDLIAKLRGLQKAESYIQKIPESFRGEVVY 259

Query: 1155 RALLKTYVARRNVKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLVSMRSS 976
            R LL + VA  NVKKAE  F RMK+ G  +     N +L+ Y   ++ K+ + L+ M   
Sbjct: 260  RTLLASCVAGNNVKKAEEVFNRMKDKGFPVTSFACNQLLILYKRLDKKKVADVLLLMEKE 319

Query: 975  SVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFSMDRKA 796
            +V+ +  +Y+ LI   G  ND+ GM++V++ M ++G+EP++  LA L+KHY+      KA
Sbjct: 320  NVKLTQFSYKILIDIKGRSNDLTGMDQVVEAMKSEGIEPDSGTLAILAKHYVSGGRKEKA 379

Query: 795  EKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIPSKDYT 616
            E  +  +E  N KE                LY  L K   V R WK  ES+ ++      
Sbjct: 380  EAILKEMEGDNLKE------HRWTCRLLLPLYAELGKADQVARIWKLCESNPWLDV--CM 431

Query: 615  AGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFEQMSMR 436
            A I AWG +  +E+AE +F+ M  + ++  T  Y ALL+VYA++ + SK +DL +QM+  
Sbjct: 432  AAIEAWGKLNKVEEAEAVFKKMSKTWKKLSTKHYTALLKVYADHKMLSKGKDLVKQMAES 491

Query: 435  NEKLYRLSRHALIRLFLSLGDLIKAES-MLDYQGGKSRRPAYSSFFLILKEYAQKEEIRN 259
               +  L+  AL++L +  G++ KA+S +L  Q     +P +SS+ LI+ +YA++ +I +
Sbjct: 492  GCHIGPLAWDALVKLHVEGGEVEKADSILLKAQQQNKFKPMFSSYMLIMDQYAKRGDIHS 551

Query: 258  AERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQG 151
             E++F+RMR  G   R+  +  L+ AYI   TPA G
Sbjct: 552  TEKIFHRMRQVGYVARFKQFQTLVQAYINAKTPAYG 587


>ref|XP_002889301.1| hypothetical protein ARALYDRAFT_477224 [Arabidopsis lyrata subsp.
            lyrata] gi|297335142|gb|EFH65560.1| hypothetical protein
            ARALYDRAFT_477224 [Arabidopsis lyrata subsp. lyrata]
          Length = 597

 Score =  237 bits (604), Expect = 1e-59
 Identities = 148/461 (32%), Positives = 238/461 (51%), Gaps = 7/461 (1%)
 Frame = -3

Query: 1512 EEESDGKACKYESPNKT------PMVKICFNPHDWNLGASLDKWVEQGNVIIXXXXXXXX 1351
            +EE + +    +  NKT       + K   +    ++G++LDKWVE+GN I         
Sbjct: 109  DEEEELELDLIDDSNKTVEKKPSELFKTIVSAPGLSIGSALDKWVEEGNEITRVEVAKAM 168

Query: 1350 XELCKHKFFRAALQVSSWLWTKKPYQFNEDYYATHLECISKSGNATVAQKYFEEIPSDMK 1171
             +L + + +  ALQ+S WL   K  + NE  Y++ L+   K       + Y ++IP   K
Sbjct: 169  LQLRRRRMYGRALQLSEWLEANKKIEMNERDYSSRLDLTVKIRGLENGEAYMQKIPKSFK 228

Query: 1170 GIKVYRALLKTYVARRNVKKAEFCFERMKEFGSKLLIADYNTMLLAYCLSERSKIHNFLV 991
            G  +YR LL   VA  NVKK+E  F RMK+ G  L     + MLL Y   +R KI + L+
Sbjct: 229  GEVIYRTLLANCVAAGNVKKSELVFNRMKDLGFPLSGFTCDQMLLLYKRIDRKKIADVLL 288

Query: 990  SMRSSSVRTSANTYRNLIYALGLDNDIDGMEKVIKEMTAKGLEPNAAILAFLSKHYIKFS 811
             M   +V+ S  TY+ LI   G  NDI GME++++ M  +G++P+    A  +KHY    
Sbjct: 289  LMEKENVKPSLLTYKILIDVKGATNDISGMEQILETMKDEGVQPDFQTQALTAKHYSGAG 348

Query: 810  MDRKAEKTISALEKLNGKEXXXXXXXXXXXXXXXXLYGLLKKESDVQRHWKEIESDVFIP 631
            +  KAEK +  +      E                +Y  L +E +V R WK  ES  +  
Sbjct: 349  LKEKAEKVLKEM------EGESLEANRRAFKDLLSIYASLGREDEVTRIWKICESKPYF- 401

Query: 630  SKDYTAGILAWGNVGNIEKAEKIFETMLNSGQRECTPQYNALLQVYAENILQSKAEDLFE 451
              +  A I A+G +  +++AE IFE ++  G+R  +  Y+ LL+VY ++ + SK +DL +
Sbjct: 402  -DESLAAIHAFGKLNKVQEAEAIFEKIVTMGRRASSNTYSVLLRVYVDHKMLSKGKDLVK 460

Query: 450  QMSMRNEKLYRLSRHALIRLFLSLGDLIKAESMLDYQGGKSR-RPAYSSFFLILKEYAQK 274
            +M+    ++   +  ALI+L++  G++ KA+SMLD    +S  +   +SF  I+ EY+++
Sbjct: 461  RMAESGCRIEATTWDALIKLYVEAGEVEKADSMLDKASKQSHTKLMMNSFMYIMDEYSKR 520

Query: 273  EEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIREHTPAQG 151
             ++ N E++F +MR  G T R   +  LL AYI    PA G
Sbjct: 521  GDVHNTEKIFLKMREVGYTSRLRQFQALLQAYINAKAPAYG 561


>ref|XP_006416900.1| hypothetical protein EUTSA_v10007131mg [Eutrema salsugineum]
            gi|567151345|ref|XP_006416901.1| hypothetical protein
            EUTSA_v10007131mg [Eutrema salsugineum]
            gi|557094671|gb|ESQ35253.1| hypothetical protein
            EUTSA_v10007131mg [Eutrema salsugineum]
            gi|557094672|gb|ESQ35254.1| hypothetical protein
            EUTSA_v10007131mg [Eutrema salsugineum]
          Length = 601

 Score =  234 bits (598), Expect = 7e-59
 Identities = 136/426 (31%), Positives = 229/426 (53%), Gaps = 1/426 (0%)
 Frame = -3

Query: 1425 NLGASLDKWVEQGNVIIXXXXXXXXXELCKHKFFRAALQVSSWLWTKKPYQFNEDYYATH 1246
            ++ ++LDKWVE+G  I          +L + + +  ALQ++ WL   KP++  E  YA+ 
Sbjct: 148  SVASALDKWVEEGKEINRTEIANAMLQLRRRRMYGRALQMAEWLEENKPFELEERDYASR 207

Query: 1245 LECISKSGNATVAQKYFEEIPSDMKGIKVYRALLKTYVARRNVKKAEFCFERMKEFGSKL 1066
            L+ I+K       + Y E IP   +G  VYR LL  Y A  NV+KAE  F +MK  G   
Sbjct: 208  LDLIAKIRGLHKGEVYIERIPESFRGELVYRTLLSNYAATSNVRKAEAVFNKMKGLGFPR 267

Query: 1065 LIADYNTMLLAYCLSERSKIHNFLVSMRSSSVRTSANTYRNLIYALGLDNDIDGMEKVIK 886
                 + ML+ Y   ++ KI + L+ M + +++ S  TY+ LI A G  NDI GME++++
Sbjct: 268  TSYACDQMLMLYKRVDKKKIADVLLLMENENLKPSLYTYKILIDAKGSSNDISGMEQIVE 327

Query: 885  EMTAKGLEPNAAILAFLSKHYIKFSMDRKAEKTISALEKLNGKEXXXXXXXXXXXXXXXX 706
             M ++G+E +    + +++HY    +  KAEK +  +E  N                   
Sbjct: 328  AMKSEGVELDLRAQSIIARHYASAGLKEKAEKVLQEMEGEN------LEANRHVCKILLS 381

Query: 705  LYGLLKKESDVQRHWKEIESDVFIPSKDYTAGILAWGNVGNIEKAEKIFETMLNSGQREC 526
            +YG L++  +V R W+  E + F   ++  A ILA+G +  +++AE +FE  +  G R  
Sbjct: 382  IYGSLQRADEVTRIWRICEENPFY--EESLAAILAFGKINKVKEAEAVFEKSVKMGHRVS 439

Query: 525  TPQYNALLQVYAENILQSKAEDLFEQMSMRNEKLYRLSRHALIRLFLSLGDLIKAESML- 349
            +  Y+ LL+VY ++ + S+ +DL ++M      +  L+  ALI+L++  G++ KA+S L 
Sbjct: 440  SSIYSVLLRVYVDHKMVSEGKDLVKRMLDSGCNIGALTWDALIKLYVEAGEVEKADSTLR 499

Query: 348  DYQGGKSRRPAYSSFFLILKEYAQKEEIRNAERVFYRMRGAGCTRRYGMYVPLLHAYIRE 169
                 K  +P  SSF  ++ EYA+K ++ N+E++F +MR AG   R+  +  L+ AY+  
Sbjct: 500  KATESKQIKPLMSSFMYVMDEYARKGDVHNSEKIFQKMRQAGYQSRFRQFQSLVQAYVNA 559

Query: 168  HTPAQG 151
             TPA G
Sbjct: 560  KTPAYG 565


Top