BLASTX nr result

ID: Mentha22_contig00008009 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00008009
         (700 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU41644.1| hypothetical protein MIMGU_mgv1a001284mg [Mimulus...   366   4e-99
ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi...   323   3e-86
ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containi...   321   1e-85
ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containi...   317   2e-84
ref|XP_007208081.1| hypothetical protein PRUPE_ppa001520mg [Prun...   309   5e-82
ref|XP_006381507.1| pentatricopeptide repeat-containing family p...   308   1e-81
ref|XP_007027210.1| Tetratricopeptide repeat (TPR)-like superfam...   307   3e-81
ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containi...   306   4e-81
gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]     296   4e-78
ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citr...   294   2e-77
ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containi...   294   2e-77
ref|XP_002525196.1| pentatricopeptide repeat-containing protein,...   293   5e-77
ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Caps...   291   1e-76
ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutr...   288   9e-76
ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containi...   285   1e-74
ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar...   285   1e-74
dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]                283   4e-74
ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutr...   282   7e-74
ref|XP_007162713.1| hypothetical protein PHAVU_001G174000g [Phas...   279   6e-73
ref|XP_007162712.1| hypothetical protein PHAVU_001G174000g [Phas...   279   6e-73

>gb|EYU41644.1| hypothetical protein MIMGU_mgv1a001284mg [Mimulus guttatus]
          Length = 847

 Score =  366 bits (940), Expect = 4e-99
 Identities = 181/231 (78%), Positives = 202/231 (87%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CG DYY AK LM+EMK  GL+PN ISWS LIDVCGGSGNV+GA+QIL S+ E+GIQPDVI
Sbjct: 515  CGADYYRAKALMDEMKTLGLSPNQISWSTLIDVCGGSGNVAGAIQILRSLHETGIQPDVI 574

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIKICV HKK KLAFMLFAEM+KY+IKPN VTY TIL ARS YGSLQ VQQ L +Y
Sbjct: 575  AYTTAIKICVKHKKPKLAFMLFAEMKKYEIKPNLVTYKTILTARSRYGSLQEVQQSLAVY 634

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITDFGRQSLLLEKVIEYLQD 161
            QQMRKAGYK NDYYLKQLIE+WCEGV+Q EH NEG+FAS ITDFG QS+LLEKV E+LQD
Sbjct: 635  QQMRKAGYKPNDYYLKQLIEEWCEGVLQNEHHNEGQFASRITDFGPQSMLLEKVAEHLQD 694

Query: 160  STAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQEQ 8
            S AESL IDL+GLTKVEARI+VLAVLRKIKEKY AGNS++DD+ IILG ++
Sbjct: 695  SNAESLSIDLQGLTKVEARIIVLAVLRKIKEKYIAGNSMEDDVSIILGLQE 745


>ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic [Vitis vinifera]
            gi|297741486|emb|CBI32618.3| unnamed protein product
            [Vitis vinifera]
          Length = 842

 Score =  323 bits (828), Expect = 3e-86
 Identities = 164/230 (71%), Positives = 192/230 (83%), Gaps = 2/230 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYY AK LM+EMK  GL+PNHISWSILID+CGG+GN+ GAV+IL +MRE+GI+PDV+
Sbjct: 511  CGTDYYRAKALMDEMKTAGLSPNHISWSILIDICGGTGNIVGAVRILKTMREAGIKPDVV 570

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIK CV  K  K+AF LFAEM++YQI+PN VTYNT+LRARS YGSL  VQQ L IY
Sbjct: 571  AYTTAIKYCVESKNLKIAFSLFAEMKRYQIQPNLVTYNTLLRARSRYGSLHEVQQCLAIY 630

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDFGR-QSLLLEKVIEYL 167
            Q MRKAGYK NDYYLK+LIE+WCEGVIQ  + N+ +F+S +  D+GR QSLLLEKV  +L
Sbjct: 631  QHMRKAGYKSNDYYLKELIEEWCEGVIQDNNLNQSKFSSVNRADWGRPQSLLLEKVAAHL 690

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
            Q S AESL IDL+GLT+VEARIVVLAVLR IKE Y  G+ IKDD+LIILG
Sbjct: 691  QKSVAESLAIDLQGLTQVEARIVVLAVLRMIKENYILGHPIKDDILIILG 740


>ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum lycopersicum]
          Length = 857

 Score =  321 bits (823), Expect = 1e-85
 Identities = 161/232 (69%), Positives = 190/232 (81%), Gaps = 1/232 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CG+DYY AK LMEEMK  GL+PNHI+W+ILID+CGGSGNV GA+QIL  MRE+GIQPDV+
Sbjct: 530  CGSDYYRAKALMEEMKEVGLSPNHITWTILIDICGGSGNVEGALQILRVMREAGIQPDVV 589

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
             YTT IK+CV +K  K AF LFA M++YQIKPN VTYNT+LRARS YGSLQ VQQ L IY
Sbjct: 590  TYTTIIKVCVENKDFKSAFSLFAAMKRYQIKPNMVTYNTLLRARSRYGSLQEVQQCLAIY 649

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDFGRQSLLLEKVIEYLQ 164
            Q MRKAGYK NDYYLKQLIE WCEGVIQ  +Q +  F++ + TD G QS++LEKV E+LQ
Sbjct: 650  QDMRKAGYKPNDYYLKQLIEQWCEGVIQNANQRKYNFSTRNRTDLGPQSMILEKVAEHLQ 709

Query: 163  DSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQEQ 8
              +A S+ I+LRGLTKVEARIVVLAVLR I+EKY AG+SIKDD+ I LG ++
Sbjct: 710  KDSANSISINLRGLTKVEARIVVLAVLRMIREKYTAGDSIKDDVQIFLGVKE 761


>ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum tuberosum]
          Length = 859

 Score =  317 bits (812), Expect = 2e-84
 Identities = 157/232 (67%), Positives = 191/232 (82%), Gaps = 1/232 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CG+DYY AK LMEEMK  GL+PNHI+W+ILID+CGGSGNV GA+QIL +MRE+GIQPDV+
Sbjct: 528  CGSDYYRAKALMEEMKEVGLSPNHITWTILIDICGGSGNVEGALQILRAMREAGIQPDVV 587

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
             YTT IK+CV +K  K AF LFA M++YQIKPN VTYNT+LRARS YGSLQ VQQ L IY
Sbjct: 588  TYTTIIKVCVENKDFKSAFSLFAAMKRYQIKPNMVTYNTLLRARSRYGSLQEVQQCLAIY 647

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDFGRQSLLLEKVIEYLQ 164
            Q MRKAGYK NDYYLKQLIE WCEGVIQ  +Q +  F++ + TD G +S++L+KV E+LQ
Sbjct: 648  QHMRKAGYKPNDYYLKQLIEQWCEGVIQNGNQRKYNFSTRNRTDLGPESMILDKVAEHLQ 707

Query: 163  DSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQEQ 8
              +A S+ I+LRGL+KVEARIVVLAVLR I+EKY AG+SIK+D+ I LG ++
Sbjct: 708  KDSANSISINLRGLSKVEARIVVLAVLRMIREKYTAGDSIKEDVQIFLGVQE 759


>ref|XP_007208081.1| hypothetical protein PRUPE_ppa001520mg [Prunus persica]
            gi|462403723|gb|EMJ09280.1| hypothetical protein
            PRUPE_ppa001520mg [Prunus persica]
          Length = 809

 Score =  309 bits (792), Expect = 5e-82
 Identities = 154/231 (66%), Positives = 187/231 (80%), Gaps = 2/231 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYYHAK L++EM+  GL PN ISWSIL D+CGGSGNV GA+QIL +MR +G++PDV+
Sbjct: 481  CGTDYYHAKALLDEMRAVGLYPNQISWSILADICGGSGNVEGALQILKNMRAAGMKPDVV 540

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIK+CV ++  +LA  LF EM+KYQI PN VTYNT+LRARS YGS+  VQQ L IY
Sbjct: 541  AYTTAIKVCVENENLELALSLFGEMKKYQIHPNLVTYNTLLRARSRYGSVSEVQQCLAIY 600

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDFGRQ-SLLLEKVIEYL 167
            Q MRKAGYK NDYYL+QLIE+WCEGVIQ  +  +  F+S + TD GR  SLLLEKV E+L
Sbjct: 601  QDMRKAGYKSNDYYLEQLIEEWCEGVIQDSNAKQEEFSSCNKTDIGRPGSLLLEKVAEHL 660

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQ 14
            Q   AE+L +DL+GLTKVEARIVVLAVLR IKE Y  G+S+KDD+LI++G+
Sbjct: 661  QTHIAETLAVDLQGLTKVEARIVVLAVLRMIKENYTLGHSVKDDMLIVVGE 711



 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 56/229 (24%), Positives = 102/229 (44%), Gaps = 8/229 (3%)
 Frame = -1

Query: 688  YYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVIAYTT 509
            ++ A ++ E+M   G+TPN ++WS LI  C  +G V  A+Q+   M  +G +P+   +  
Sbjct: 382  WHMALNVKEDMLSAGVTPNTVTWSSLISACANAGIVEKAIQLFEEMLLAGSEPNSQCFNI 441

Query: 508  AIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQY--LVIYQQ 335
             +  CV   +   AF LF  +++   KP + TYNT+++A        G   Y    +  +
Sbjct: 442  LLHACVEANQYDRAFRLFQSLKRLSFKPTTTTYNTLMKA-------CGTDYYHAKALLDE 494

Query: 334  MRKAGYKRNDYYLKQLIEDWC------EGVIQTEHQNEGRFASHITDFGRQSLLLEKVIE 173
            MR  G   N      ++ D C      EG +Q       R A    D    +  ++  +E
Sbjct: 495  MRAVGLYPNQISW-SILADICGGSGNVEGALQI--LKNMRAAGMKPDVVAYTTAIKVCVE 551

Query: 172  YLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLI 26
                  A SLF +++   ++   +V    L + + +Y + + ++  L I
Sbjct: 552  NENLELALSLFGEMKKY-QIHPNLVTYNTLLRARSRYGSVSEVQQCLAI 599


>ref|XP_006381507.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550336211|gb|ERP59304.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 828

 Score =  308 bits (789), Expect = 1e-81
 Identities = 158/229 (68%), Positives = 188/229 (82%), Gaps = 2/229 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CG+DY+ AK LM+EMK  G++PNHISWSILID+CG SGNVSGAVQIL +MR +G++PDV+
Sbjct: 522  CGSDYHRAKALMDEMKTVGISPNHISWSILIDICGVSGNVSGAVQILKNMRLAGVEPDVV 581

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIK+CV  K  KLAF LFAEM++ QI PN VTYNT+LRAR+ YGSL+ VQQ L IY
Sbjct: 582  AYTTAIKVCVETKNLKLAFSLFAEMKRCQINPNLVTYNTLLRARTRYGSLREVQQCLAIY 641

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDFGR-QSLLLEKVIEYL 167
            Q MRKAGYK NDYYLKQLIE+WCEGVIQ  +Q +G FAS   TD GR +SLLLEKV  +L
Sbjct: 642  QDMRKAGYKSNDYYLKQLIEEWCEGVIQDNNQIQGGFASCKRTDLGRPRSLLLEKVAAHL 701

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIIL 20
            Q++ +E+L IDL+GLTKVEARIVVLAVLR IKE Y  G S+K+D+ I L
Sbjct: 702  QNNISENLAIDLQGLTKVEARIVVLAVLRMIKENYTLGYSVKEDMWITL 750


>ref|XP_007027210.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao] gi|508715815|gb|EOY07712.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein,
            putative [Theobroma cacao]
          Length = 858

 Score =  307 bits (786), Expect = 3e-81
 Identities = 159/236 (67%), Positives = 187/236 (79%), Gaps = 3/236 (1%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            C TDYY AK LM+EMK  GL+PNH+SWSILID+C GSGNV GA+QIL +M  +GI+PDV+
Sbjct: 528  CCTDYYRAKALMDEMKSVGLSPNHVSWSILIDICRGSGNVEGAIQILKTMHVTGIKPDVV 587

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIK+CV  K  KLAF LF EM++Y+++PN VTYNT+LRARS YGSL  VQQ L IY
Sbjct: 588  AYTTAIKVCVGSKNLKLAFSLFEEMKRYRVQPNLVTYNTLLRARSRYGSLHEVQQCLAIY 647

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVI-QTEHQNEGRFASHITDFGR-QSLLLEKVIEYL 167
            Q MRKAGYK ND YLK+LIE+WCEGVI +  H+ EG  +   TD  R  SLLLEK+  +L
Sbjct: 648  QDMRKAGYKSNDIYLKELIEEWCEGVIKENNHKREGLSSCKRTDLERPHSLLLEKIAVHL 707

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG-QEQHA 2
            Q STAES  IDLRGLTKVEARIVVLAVLR IKE +  G+S+KDD+LIILG  E+HA
Sbjct: 708  QMSTAESPAIDLRGLTKVEARIVVLAVLRMIKENHILGHSVKDDMLIILGVSERHA 763


>ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 847

 Score =  306 bits (784), Expect = 4e-81
 Identities = 156/230 (67%), Positives = 182/230 (79%), Gaps = 2/230 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CG+DYYHAK LM+EMK  GL PN I+WSIL D+CG SGNV GA+QIL SMR +GIQPDV+
Sbjct: 516  CGSDYYHAKALMDEMKTVGLLPNQITWSILADICGSSGNVQGALQILKSMRVAGIQPDVV 575

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIKICV  +   LA +LFAEM+KYQI PN VTYNT+LRARS YGS+  VQQ L IY
Sbjct: 576  AYTTAIKICVESENLDLALLLFAEMKKYQIHPNLVTYNTLLRARSRYGSVSEVQQCLAIY 635

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFA-SHITDFGRQ-SLLLEKVIEYL 167
            Q MRKAGYK NDYYL+QLIE+WCEGVIQ     +G F+     D GR  SLLLEKV E+L
Sbjct: 636  QDMRKAGYKPNDYYLEQLIEEWCEGVIQDSCPKQGEFSYGDKADIGRPGSLLLEKVAEHL 695

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
            Q   A++L +DL+GLTKVEARIVVLAVLR IKE Y  G+S+KDD+LI++G
Sbjct: 696  QQHIADTLAVDLQGLTKVEARIVVLAVLRMIKENYILGDSVKDDMLIMVG 745


>gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]
          Length = 822

 Score =  296 bits (758), Expect = 4e-78
 Identities = 149/230 (64%), Positives = 183/230 (79%), Gaps = 2/230 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CG+DYYHAK L+EEM+  GL+PN I+WSILID+CG  GNV GA+QIL +MR +GI+PDV+
Sbjct: 498  CGSDYYHAKALIEEMEAVGLSPNQITWSILIDICGDLGNVEGALQILKTMRATGIEPDVV 557

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTT IK+CV  K  K AF LFAEM++YQI+PN VTYNT+LRAR+ YGSLQ V+Q L +Y
Sbjct: 558  AYTTVIKVCVESKDLKQAFELFAEMKRYQIQPNLVTYNTLLRARNRYGSLQEVKQCLAVY 617

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDFGR-QSLLLEKVIEYL 167
            Q MR+AGY  NDYYLKQLIE+WCEGVIQ  +QN    +S + TD  R QSLLLEKV E+L
Sbjct: 618  QDMRRAGYNSNDYYLKQLIEEWCEGVIQGNNQNREESSSFNKTDKKRPQSLLLEKVAEHL 677

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
            +   AE+L +D++GL KVEARIVVLAVLR +KE Y  G  +KDD+LII+G
Sbjct: 678  EKHIAETLTVDVQGLKKVEARIVVLAVLRMVKENYTMGYLVKDDMLIIIG 727


>ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citrus clementina]
            gi|568853887|ref|XP_006480569.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Citrus sinensis]
            gi|557530964|gb|ESR42147.1| hypothetical protein
            CICLE_v10011055mg [Citrus clementina]
          Length = 850

 Score =  294 bits (752), Expect = 2e-77
 Identities = 149/230 (64%), Positives = 177/230 (76%), Gaps = 2/230 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            C TDYY  K LM+EM+  GL+PNHISW+ILID CGGSGNV GA+QIL  MRE G+ PDV+
Sbjct: 525  CCTDYYRVKALMDEMRTVGLSPNHISWTILIDACGGSGNVEGALQILKIMREDGMSPDVV 584

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIK+CV  K+ KLAF LF EM+ YQI+PN VTY T+LRARS YGSL  VQQ L +Y
Sbjct: 585  AYTTAIKVCVRSKRLKLAFSLFEEMKHYQIQPNLVTYITLLRARSRYGSLHEVQQCLAVY 644

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGR--FASHITDFGRQSLLLEKVIEYL 167
            Q M KAGYK ND YLK++IE+WCEGVIQ ++QN+G             QSLLLEKV  +L
Sbjct: 645  QDMWKAGYKANDTYLKEVIEEWCEGVIQDKNQNQGEVTLCRRTNSQRPQSLLLEKVAVHL 704

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
            Q S AE+L IDL+GLTKVEARIVVLAVL+ +KE Y+ G  +KDDL+I+LG
Sbjct: 705  QKSAAENLAIDLQGLTKVEARIVVLAVLQMMKENYSLGVPVKDDLMIVLG 754


>ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cucumis sativus]
          Length = 849

 Score =  294 bits (752), Expect = 2e-77
 Identities = 150/229 (65%), Positives = 180/229 (78%), Gaps = 2/229 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYYHAK LMEEMK  GLTPNHISWSIL+D+CG S +V  AVQILT+MR +G+ PDV+
Sbjct: 528  CGTDYYHAKALMEEMKSVGLTPNHISWSILVDICGRSHDVESAVQILTTMRMAGVDPDVV 587

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIK+CV  K  KLAF LF EM++++I+PN VTY+T+LRARS YGSL  VQQ L IY
Sbjct: 588  AYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIY 647

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFA-SHITDFGR-QSLLLEKVIEYL 167
            Q MRK+G+K ND+YLK+LI +WCEGVIQ  +Q        +  D G+ + L+LEKV ++L
Sbjct: 648  QDMRKSGFKSNDHYLKELIAEWCEGVIQKNNQQPVEITPCNKIDIGKPRCLILEKVADHL 707

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIIL 20
            Q S AESL IDL+ LTKVEARIVVLAVLR IKE YA G S+KDD+ IIL
Sbjct: 708  QKSFAESLTIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIIL 756


>ref|XP_002525196.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223535493|gb|EEF37162.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 786

 Score =  293 bits (749), Expect = 5e-77
 Identities = 150/230 (65%), Positives = 180/230 (78%), Gaps = 2/230 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CG+DY  AK LM+EM+  GL+PNHISWSILID+CG SGN+ GA+QIL +MR +GI+PDVI
Sbjct: 458  CGSDYNRAKALMDEMQAVGLSPNHISWSILIDICGSSGNMEGAIQILKNMRMAGIEPDVI 517

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIK+ V  K  K+AF LFAEM++YQ+KPN VTY+T+LRAR+ YGSL+ VQQ L IY
Sbjct: 518  AYTTAIKVSVESKNLKMAFSLFAEMKRYQLKPNLVTYDTLLRARTRYGSLKEVQQCLAIY 577

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRF-ASHITDFGR-QSLLLEKVIEYL 167
            Q MRKAGYK ND YLKQLIE+WCEGVIQ   Q +  F      +FGR  SLLLEKV  +L
Sbjct: 578  QDMRKAGYKSNDNYLKQLIEEWCEGVIQDNDQCQDDFKPCKRAEFGRPHSLLLEKVAAHL 637

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
              + AESL +DL+GLTKVEARIVVLAVLR +KE Y  G+ +KDD+ I LG
Sbjct: 638  HHNVAESLSVDLQGLTKVEARIVVLAVLRMVKENYIQGHLVKDDMSITLG 687


>ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Capsella rubella]
            gi|482555757|gb|EOA19949.1| hypothetical protein
            CARUB_v10000200mg [Capsella rubella]
          Length = 858

 Score =  291 bits (745), Expect = 1e-76
 Identities = 146/233 (62%), Positives = 180/233 (77%), Gaps = 2/233 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYY  K+LM+EMK  GLTPN I+WS LID+CGGSG+V GAV+IL +M  +G +PDV+
Sbjct: 536  CGTDYYRGKELMDEMKSLGLTPNQITWSTLIDMCGGSGDVEGAVRILRTMHSAGTRPDVV 595

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIKIC  +K  KLAF LF EM +YQIKPN VTYNT+L+ARS YGSL  V+Q L IY
Sbjct: 596  AYTTAIKICAENKSLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIY 655

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITDFGRQ--SLLLEKVIEYL 167
            Q MRKAGYK ND++LK+LIE+WCEGVIQ   Q++ + +    D   +  SLL+EKV  +L
Sbjct: 656  QDMRKAGYKPNDHFLKELIEEWCEGVIQENGQSQNKISDQEGDHAGRPVSLLIEKVATHL 715

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQEQ 8
            Q+ TA +L IDL+GLTKVEAR+VVLAVLR IKE Y  G+ + DD+LIILG  +
Sbjct: 716  QERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYMRGDVVIDDVLIILGTSE 768


>ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099829|gb|ESQ40192.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 858

 Score =  288 bits (738), Expect = 9e-76
 Identities = 147/230 (63%), Positives = 179/230 (77%), Gaps = 2/230 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYY  K+LM+EM+  GL PN I+WS LID+CGGSG+V GAV IL +M  +G +PDV+
Sbjct: 533  CGTDYYRGKELMDEMRSLGLAPNQITWSTLIDICGGSGDVEGAVGILRTMHSAGTRPDVV 592

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIKIC  +K  KLAF LF EM +YQIKPN VTYNT+L+ARS YGSL  V+Q L IY
Sbjct: 593  AYTTAIKICAENKSLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIY 652

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHI-TDFGRQ-SLLLEKVIEYL 167
            Q MRKAGYK ND++LK+LIE+WCEGVIQ   Q++ + +    T+ GR  SLL+EKV  +L
Sbjct: 653  QDMRKAGYKPNDHFLKELIEEWCEGVIQENSQSQIKTSDQEGTNLGRPVSLLIEKVATHL 712

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
            Q+ TA +L IDL+GLTKVEAR+VVLAVLR IKE Y  G+ + DDLLIILG
Sbjct: 713  QERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYIRGDVVTDDLLIILG 762


>ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Glycine max]
          Length = 811

 Score =  285 bits (729), Expect = 1e-74
 Identities = 142/230 (61%), Positives = 175/230 (76%), Gaps = 2/230 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYYHAK L++EM+  GL+PN ISWSILID+CG S NV GA++IL +M ++GI+PDVI
Sbjct: 490  CGTDYYHAKALIKEMETVGLSPNQISWSILIDICGASSNVEGAIEILKTMGDAGIKPDVI 549

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIK+CV  K    A  L+ EM+ YQI+PN VTYNT+L+ARS YG L  VQQ L IY
Sbjct: 550  AYTTAIKVCVESKNFMQALTLYEEMKCYQIRPNWVTYNTLLKARSKYGFLHEVQQCLAIY 609

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITDFGR--QSLLLEKVIEYL 167
            Q MRKAGYK NDYYL++LIE+WCEGVIQ   + +G F+S         QSLLLEK+  +L
Sbjct: 610  QDMRKAGYKPNDYYLEELIEEWCEGVIQNNREKQGEFSSSNKSESERPQSLLLEKIAAHL 669

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
                A+ L ID++GLTKVEAR+VVLAVLR IKE Y  G+S+ DD+LII+G
Sbjct: 670  LKRVADILAIDVQGLTKVEARLVVLAVLRMIKENYGLGHSVNDDILIIIG 719


>ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g02830, chloroplastic; Flags: Precursor
            gi|332003140|gb|AED90523.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 852

 Score =  285 bits (728), Expect = 1e-74
 Identities = 143/233 (61%), Positives = 181/233 (77%), Gaps = 2/233 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYY  K+LM+EMK  GL+PN I+WS LID+CGGSG+V GAV+IL +M  +G +PDV+
Sbjct: 536  CGTDYYRGKELMDEMKSLGLSPNQITWSTLIDMCGGSGDVEGAVRILRTMHSAGTRPDVV 595

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIKIC  +K  KLAF LF EM +YQIKPN VTYNT+L+ARS YGSL  V+Q L IY
Sbjct: 596  AYTTAIKICAENKCLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIY 655

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITD-FGRQ-SLLLEKVIEYL 167
            Q MR AGYK ND++LK+LIE+WCEGVIQ   Q++ + +    D  GR  SLL+EKV  ++
Sbjct: 656  QDMRNAGYKPNDHFLKELIEEWCEGVIQENGQSQDKISDQEGDNAGRPVSLLIEKVATHM 715

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQEQ 8
            Q+ TA +L IDL+GLTK+EAR+VVLAVLR IKE Y  G+ + DD+LII+G ++
Sbjct: 716  QERTAGNLAIDLQGLTKIEARLVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDE 768


>dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]
          Length = 852

 Score =  283 bits (724), Expect = 4e-74
 Identities = 142/233 (60%), Positives = 181/233 (77%), Gaps = 2/233 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYY  K+LM+EMK  GL+PN I+WS LID+CGGSG+V GAV+IL +M  +G +PDV+
Sbjct: 536  CGTDYYRGKELMDEMKSLGLSPNQITWSTLIDMCGGSGDVEGAVRILRTMHSAGTRPDVV 595

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIKIC  +K  KLAF LF EM +YQIKPN VTYNT+L+ARS YGSL  V+Q L IY
Sbjct: 596  AYTTAIKICAENKCLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIY 655

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITD-FGRQ-SLLLEKVIEYL 167
            Q MR AGYK ND++LK+LIE+WCEGVIQ   +++ + +    D  GR  SLL+EKV  ++
Sbjct: 656  QDMRNAGYKPNDHFLKELIEEWCEGVIQENGRSQDKISDQEGDNAGRPVSLLIEKVATHM 715

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQEQ 8
            Q+ TA +L IDL+GLTK+EAR+VVLAVLR IKE Y  G+ + DD+LII+G ++
Sbjct: 716  QERTAGNLAIDLQGLTKIEARLVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDE 768


>ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099830|gb|ESQ40193.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 863

 Score =  282 bits (722), Expect = 7e-74
 Identities = 147/235 (62%), Positives = 179/235 (76%), Gaps = 7/235 (2%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYY  K+LM+EM+  GL PN I+WS LID+CGGSG+V GAV IL +M  +G +PDV+
Sbjct: 533  CGTDYYRGKELMDEMRSLGLAPNQITWSTLIDICGGSGDVEGAVGILRTMHSAGTRPDVV 592

Query: 520  AYTTAIK-----ICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQ 356
            AYTTAIK     IC  +K  KLAF LF EM +YQIKPN VTYNT+L+ARS YGSL  V+Q
Sbjct: 593  AYTTAIKHAIFQICAENKSLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQ 652

Query: 355  YLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHI-TDFGRQ-SLLLEK 182
             L IYQ MRKAGYK ND++LK+LIE+WCEGVIQ   Q++ + +    T+ GR  SLL+EK
Sbjct: 653  CLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQENSQSQIKTSDQEGTNLGRPVSLLIEK 712

Query: 181  VIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
            V  +LQ+ TA +L IDL+GLTKVEAR+VVLAVLR IKE Y  G+ + DDLLIILG
Sbjct: 713  VATHLQERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYIRGDVVTDDLLIILG 767


>ref|XP_007162713.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
            gi|561036177|gb|ESW34707.1| hypothetical protein
            PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 809

 Score =  279 bits (714), Expect = 6e-73
 Identities = 139/230 (60%), Positives = 178/230 (77%), Gaps = 2/230 (0%)
 Frame = -1

Query: 700  CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
            CGTDYYHAK L++EM+  GL+PN ISWS LID+CG S NV GA++IL +M ++GI+PDVI
Sbjct: 484  CGTDYYHAKALIKEMETVGLSPNQISWSTLIDICGASANVEGAIEILKNMGDAGIKPDVI 543

Query: 520  AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
            AYTTAIK+CV  K    A  L+ EM+ Y I+PN +TYNT+L+ARS YGSL  VQQ L IY
Sbjct: 544  AYTTAIKVCVESKNFMQALALYKEMKSYHIRPNLITYNTLLKARSKYGSLHEVQQCLAIY 603

Query: 340  QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRF-ASHITDFGR-QSLLLEKVIEYL 167
            Q MRKAGYK ND YL++LIE+WCEGVIQ   + +G F +S+ ++  + QSLLLEK+  +L
Sbjct: 604  QDMRKAGYKPNDCYLEELIEEWCEGVIQDNREIQGEFSSSNKSELEKSQSLLLEKIAAHL 663

Query: 166  QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
                A+ L ID++GLTKVEAR+VVLAVLR IKE Y+ G+SI DD+LI++G
Sbjct: 664  LKRVADILAIDVQGLTKVEARLVVLAVLRMIKENYSLGHSINDDILIVIG 713


>ref|XP_007162712.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
           gi|561036176|gb|ESW34706.1| hypothetical protein
           PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 594

 Score =  279 bits (714), Expect = 6e-73
 Identities = 139/230 (60%), Positives = 178/230 (77%), Gaps = 2/230 (0%)
 Frame = -1

Query: 700 CGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVI 521
           CGTDYYHAK L++EM+  GL+PN ISWS LID+CG S NV GA++IL +M ++GI+PDVI
Sbjct: 269 CGTDYYHAKALIKEMETVGLSPNQISWSTLIDICGASANVEGAIEILKNMGDAGIKPDVI 328

Query: 520 AYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIY 341
           AYTTAIK+CV  K    A  L+ EM+ Y I+PN +TYNT+L+ARS YGSL  VQQ L IY
Sbjct: 329 AYTTAIKVCVESKNFMQALALYKEMKSYHIRPNLITYNTLLKARSKYGSLHEVQQCLAIY 388

Query: 340 QQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRF-ASHITDFGR-QSLLLEKVIEYL 167
           Q MRKAGYK ND YL++LIE+WCEGVIQ   + +G F +S+ ++  + QSLLLEK+  +L
Sbjct: 389 QDMRKAGYKPNDCYLEELIEEWCEGVIQDNREIQGEFSSSNKSELEKSQSLLLEKIAAHL 448

Query: 166 QDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG 17
               A+ L ID++GLTKVEAR+VVLAVLR IKE Y+ G+SI DD+LI++G
Sbjct: 449 LKRVADILAIDVQGLTKVEARLVVLAVLRMIKENYSLGHSINDDILIVIG 498


Top