BLASTX nr result

ID: Mentha29_contig00020085 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00020085
         (779 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21860.1| hypothetical protein MIMGU_mgv1a025261mg, partial...   360   4e-97
ref|XP_006347148.1| PREDICTED: pentatricopeptide repeat-containi...   353   5e-95
ref|XP_003632713.1| PREDICTED: pentatricopeptide repeat-containi...   351   1e-94
ref|XP_004233816.1| PREDICTED: pentatricopeptide repeat-containi...   345   8e-93
emb|CBI29931.3| unnamed protein product [Vitis vinifera]              338   1e-90
ref|XP_006479094.1| PREDICTED: pentatricopeptide repeat-containi...   337   3e-90
ref|XP_007030706.1| Pentatricopeptide repeat (PPR) superfamily p...   336   5e-90
ref|XP_006410248.1| hypothetical protein EUTSA_v10016243mg [Eutr...   333   5e-89
ref|XP_006282436.1| hypothetical protein CARUB_v10004043mg [Caps...   330   3e-88
ref|NP_193101.2| pentatricopeptide repeat-containing protein [Ar...   329   6e-88
emb|CAB36829.1| putative protein [Arabidopsis thaliana] gi|72680...   329   6e-88
ref|XP_004293058.1| PREDICTED: pentatricopeptide repeat-containi...   328   1e-87
ref|XP_002868345.1| pentatricopeptide repeat-containing protein ...   328   2e-87
ref|XP_004142047.1| PREDICTED: pentatricopeptide repeat-containi...   324   2e-86
ref|XP_004160501.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   322   9e-86
ref|XP_006443406.1| hypothetical protein CICLE_v10018850mg [Citr...   321   2e-85
tpg|DAA37919.1| TPA: hypothetical protein ZEAMMB73_411767 [Zea m...   321   2e-85
ref|XP_003619016.1| Pentatricopeptide repeat-containing protein ...   321   2e-85
dbj|BAJ97995.1| predicted protein [Hordeum vulgare subsp. vulgare]    321   2e-85
ref|XP_002446412.1| hypothetical protein SORBIDRAFT_06g015580 [S...   321   2e-85

>gb|EYU21860.1| hypothetical protein MIMGU_mgv1a025261mg, partial [Mimulus guttatus]
          Length = 1007

 Score =  360 bits (923), Expect = 4e-97
 Identities = 168/217 (77%), Positives = 194/217 (89%), Gaps = 1/217 (0%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRAGQ+  A+EFVESMPIEPDAM+WRT LSACTVHKNRE GE+AA NLLELEP DSATY
Sbjct: 787  LGRAGQVSRAREFVESMPIEPDAMVWRTLLSACTVHKNREIGEIAAKNLLELEPKDSATY 846

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VLMSNMYAVTGKWD+RD  R+LMR RGV+KEPG+SWIEVKNSVH+FFVGD+LHPL  +IY
Sbjct: 847  VLMSNMYAVTGKWDYRDRVRQLMRNRGVRKEPGQSWIEVKNSVHAFFVGDKLHPLADQIY 906

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPR-IIPLHV 538
            +YL +L ++V AIGYV D SSLWN++EL QKD TA IHSEKLAVAFGL++L   IIPLHV
Sbjct: 907  NYLKDLNERVAAIGYVQDYSSLWNDLELEQKDPTAHIHSEKLAVAFGLMSLSEMIIPLHV 966

Query: 539  MKNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQ 649
            MKNLRVC+DCHNW+ F+SK+VDRT+IVRD+YRFHHF+
Sbjct: 967  MKNLRVCSDCHNWLKFVSKIVDRTVIVRDSYRFHHFE 1003


>ref|XP_006347148.1| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like
            [Solanum tuberosum]
          Length = 1057

 Score =  353 bits (905), Expect = 5e-95
 Identities = 160/226 (70%), Positives = 189/226 (83%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRAG L  A +FVE+MP+EPDAM+WRT LSAC VHKN E GE   + LLELEP DSATY
Sbjct: 832  LGRAGHLQRAMKFVETMPVEPDAMVWRTLLSACIVHKNIEIGEETGHRLLELEPQDSATY 891

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV G+WD R+ +R LM++RGVKKEPGRSWIEVKN++H+FFVGDRLHPL + IY
Sbjct: 892  VLLSNLYAVLGRWDSRNQTRLLMKDRGVKKEPGRSWIEVKNTIHAFFVGDRLHPLANHIY 951

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             +++EL K+V  IGYV D +SLWN++EL QKD TA IHSEKLA+AFGLL+LP +IP+ VM
Sbjct: 952  DFVEELNKRVVMIGYVQDNNSLWNDLELGQKDPTAYIHSEKLAIAFGLLSLPEMIPIRVM 1011

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCHNWI  +SKV DR IIVRDAYRFHHF DG CSC D+W
Sbjct: 1012 KNLRVCNDCHNWIKCVSKVADRAIIVRDAYRFHHFADGQCSCNDFW 1057


>ref|XP_003632713.1| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like
            [Vitis vinifera]
          Length = 989

 Score =  351 bits (901), Expect = 1e-94
 Identities = 161/226 (71%), Positives = 191/226 (84%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRA  L  A+EF+E MPIEPDAMIWRT LSACTVHKN E GE AA +LLELEP DSATY
Sbjct: 764  LGRAALLCCAREFIEEMPIEPDAMIWRTLLSACTVHKNIEIGEFAARHLLELEPEDSATY 823

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SNMYAV+GKWD+RD +R++M++RGVKKEPGRSWIEVKNS+H+FFVGDRLHPL  +IY
Sbjct: 824  VLLSNMYAVSGKWDYRDRTRQMMKDRGVKKEPGRSWIEVKNSIHAFFVGDRLHPLAEQIY 883

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             Y+D+L ++   IGYV D+ +L N+VE  QKD TA IHSEKLAVAFGLL+L   +P+ V+
Sbjct: 884  EYIDDLNERAGEIGYVQDRYNLLNDVEQEQKDPTAYIHSEKLAVAFGLLSLTNTMPIRVI 943

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCHNWI F+SK+ +R I+VRDAYRFHHF+ G CSCKDYW
Sbjct: 944  KNLRVCNDCHNWIKFVSKISNRAIVVRDAYRFHHFEGGVCSCKDYW 989


>ref|XP_004233816.1| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like
            [Solanum lycopersicum]
          Length = 1057

 Score =  345 bits (886), Expect = 8e-93
 Identities = 157/226 (69%), Positives = 187/226 (82%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRAG L  A  FVE+MP+EPDAM+WRT LSAC VHKN E GE   + LLELEP DSATY
Sbjct: 832  LGRAGHLQRAMNFVETMPVEPDAMVWRTLLSACIVHKNIEIGEETGHRLLELEPQDSATY 891

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV G+WD R+ +R LM++RGVKKEPGRSWIEV+N++H+FFVGDRLHPL + IY
Sbjct: 892  VLLSNLYAVLGRWDSRNQTRLLMKDRGVKKEPGRSWIEVQNTIHAFFVGDRLHPLANHIY 951

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             +++EL K+V  IGYV D +SLWN++EL QKD TA IHSEKLA+AFGLL+L  +IP+ VM
Sbjct: 952  DFVEELNKRVVMIGYVQDNNSLWNDLELGQKDPTAYIHSEKLAIAFGLLSLHEMIPIRVM 1011

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCHNWI  +SKV +R IIVRDAYRFHHF DG CSC D+W
Sbjct: 1012 KNLRVCNDCHNWIKCVSKVANRAIIVRDAYRFHHFADGQCSCNDFW 1057


>emb|CBI29931.3| unnamed protein product [Vitis vinifera]
          Length = 838

 Score =  338 bits (868), Expect = 1e-90
 Identities = 154/214 (71%), Positives = 182/214 (85%)
 Frame = +2

Query: 38   FVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATYVLMSNMYAVTGK 217
            FV  MPIEPDAMIWRT LSACTVHKN E GE AA +LLELEP DSATYVL+SNMYAV+GK
Sbjct: 625  FVGEMPIEPDAMIWRTLLSACTVHKNIEIGEFAARHLLELEPEDSATYVLLSNMYAVSGK 684

Query: 218  WDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIYSYLDELTKKVTA 397
            WD+RD +R++M++RGVKKEPGRSWIEVKNS+H+FFVGDRLHPL  +IY Y+D+L ++   
Sbjct: 685  WDYRDRTRQMMKDRGVKKEPGRSWIEVKNSIHAFFVGDRLHPLAEQIYEYIDDLNERAGE 744

Query: 398  IGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVMKNLRVCNDCHNW 577
            IGYV D+ +L N+VE  QKD TA IHSEKLAVAFGLL+L   +P+ V+KNLRVCNDCHNW
Sbjct: 745  IGYVQDRYNLLNDVEQEQKDPTAYIHSEKLAVAFGLLSLTNTMPIRVIKNLRVCNDCHNW 804

Query: 578  INFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            I F+SK+ +R I+VRDAYRFHHF+ G CSCKDYW
Sbjct: 805  IKFVSKISNRAIVVRDAYRFHHFEGGVCSCKDYW 838


>ref|XP_006479094.1| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like
            isoform X1 [Citrus sinensis]
            gi|568850820|ref|XP_006479095.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g13650-like isoform X2 [Citrus sinensis]
            gi|568850822|ref|XP_006479096.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g13650-like isoform X3 [Citrus sinensis]
          Length = 1077

 Score =  337 bits (864), Expect = 3e-90
 Identities = 154/226 (68%), Positives = 184/226 (81%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRAG L  A+EF E MPIEPDAM+WRT LSAC VHKN E GE AAN+LLELEP DSATY
Sbjct: 852  LGRAGSLSRAREFTEQMPIEPDAMVWRTLLSACRVHKNMEIGEYAANHLLELEPEDSATY 911

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YA  GKWD RD  R++M++RGVKKEPG+SWIEVKNS+H+FFVGDRLHPL  +IY
Sbjct: 912  VLLSNIYAAAGKWDCRDQIRQIMKDRGVKKEPGQSWIEVKNSIHAFFVGDRLHPLADKIY 971

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             YL  L ++V  IGYV  + SLW+++E  QKD    IHSEKLA+AFGLL+L   +P+ V+
Sbjct: 972  DYLGNLNRRVAEIGYVQGRYSLWSDLEQEQKDPCVYIHSEKLAIAFGLLSLSDSMPILVI 1031

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCHNWI F+SK+ +RTI+VRDA RFHHF+ G CSC+DYW
Sbjct: 1032 KNLRVCNDCHNWIKFVSKISNRTIVVRDANRFHHFEGGVCSCRDYW 1077


>ref|XP_007030706.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2
            [Theobroma cacao] gi|508719311|gb|EOY11208.1|
            Pentatricopeptide repeat (PPR) superfamily protein
            isoform 2 [Theobroma cacao]
          Length = 1072

 Score =  336 bits (862), Expect = 5e-90
 Identities = 153/226 (67%), Positives = 190/226 (84%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRAG L  A++FVE MPIEPDA+IWRT LSAC VHKN + GE AA++LL+LEP DSA+Y
Sbjct: 847  LGRAGLLCRARKFVEDMPIEPDAIIWRTLLSACAVHKNVDIGEFAAHHLLKLEPQDSASY 906

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV+ KWD RD +R++M+ERGVKKEP +SWIEVKNS+H+FFVGDRLHPL  +IY
Sbjct: 907  VLLSNLYAVSKKWDSRDQTRQMMKERGVKKEPAQSWIEVKNSIHAFFVGDRLHPLAEKIY 966

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             +L++L K+   IGYV D+ S +++VE  QKD T  IHSEKLA+AFGLL+LP  IP+ V+
Sbjct: 967  EHLEDLNKRAAEIGYVQDRYSRFSDVEQGQKDPTVHIHSEKLAIAFGLLSLPSAIPVRVI 1026

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCHNWI F+SK+ ++ IIVRDAYRFHHF+ GSCSC+DYW
Sbjct: 1027 KNLRVCNDCHNWIKFVSKISNQLIIVRDAYRFHHFEGGSCSCRDYW 1072


>ref|XP_006410248.1| hypothetical protein EUTSA_v10016243mg [Eutrema salsugineum]
            gi|557111417|gb|ESQ51701.1| hypothetical protein
            EUTSA_v10016243mg [Eutrema salsugineum]
          Length = 844

 Score =  333 bits (853), Expect = 5e-89
 Identities = 154/226 (68%), Positives = 185/226 (81%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            L RAG L  AKEF++ MPIEPDA++WRT LSAC VHKN E GE AA +L+ELEP DSATY
Sbjct: 619  LTRAGLLSRAKEFIQEMPIEPDALVWRTLLSACVVHKNLEIGEFAARHLVELEPEDSATY 678

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV  KWD RD +R+ M+E+GVKKEPG+SWIEVKNS+HSF+VGD+ HPL  EI+
Sbjct: 679  VLLSNLYAVCRKWDARDQTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIH 738

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             Y  +LTK+ + IGYV D  SL NE +  QKD T  IHSEKLA++FGLL+LP  IP++VM
Sbjct: 739  EYFQDLTKRASEIGYVQDCFSLLNEAQQEQKDPTIFIHSEKLAISFGLLSLPGTIPINVM 798

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCH+WI F+SKV +R IIVRDAYRFHHF+ G+CSCKDYW
Sbjct: 799  KNLRVCNDCHDWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 844


>ref|XP_006282436.1| hypothetical protein CARUB_v10004043mg [Capsella rubella]
            gi|565439136|ref|XP_006282437.1| hypothetical protein
            CARUB_v10004043mg [Capsella rubella]
            gi|565439139|ref|XP_006282438.1| hypothetical protein
            CARUB_v10004043mg [Capsella rubella]
            gi|482551141|gb|EOA15334.1| hypothetical protein
            CARUB_v10004043mg [Capsella rubella]
            gi|482551142|gb|EOA15335.1| hypothetical protein
            CARUB_v10004043mg [Capsella rubella]
            gi|482551143|gb|EOA15336.1| hypothetical protein
            CARUB_v10004043mg [Capsella rubella]
          Length = 1050

 Score =  330 bits (846), Expect = 3e-88
 Identities = 152/226 (67%), Positives = 186/226 (82%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            L RAG L  AK+F+  MPIEPDA++WRT LSAC VHKN E GE AA +LLELEP DSATY
Sbjct: 825  LTRAGLLSRAKDFILEMPIEPDALVWRTLLSACVVHKNMEIGEFAARHLLELEPEDSATY 884

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV  +WD RD +R+ M+++GVKKEPG+SWIEVKNS+HSF+VGD+ HPL  EI+
Sbjct: 885  VLLSNLYAVCKEWDSRDLTRQKMKQKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLTDEIH 944

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             Y  +LTK+ + IGYVPD  SL NE++  QKD    IHSEKLA++FGLL+LPR +P++VM
Sbjct: 945  EYFQDLTKRASDIGYVPDCFSLLNELQQEQKDPMIFIHSEKLAISFGLLSLPRTMPINVM 1004

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCH+WI F+SKV +R IIVRDAYRFHHF+ G+CSCKDYW
Sbjct: 1005 KNLRVCNDCHDWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1050


>ref|NP_193101.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|223635639|sp|Q9SVP7.2|PP307_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g13650 gi|332657909|gb|AEE83309.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 1064

 Score =  329 bits (844), Expect = 6e-88
 Identities = 152/226 (67%), Positives = 186/226 (82%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            L RAG L  AKEF++ MPI+PDA++WRT LSAC VHKN E GE AA++LLELEP DSATY
Sbjct: 839  LTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATY 898

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV+ KWD RD +R+ M+E+GVKKEPG+SWIEVKNS+HSF+VGD+ HPL  EI+
Sbjct: 899  VLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIH 958

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             Y  +LTK+ + IGYV D  SL NE++  QKD    IHSEKLA++FGLL+LP  +P++VM
Sbjct: 959  EYFQDLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVM 1018

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCH WI F+SKV +R IIVRDAYRFHHF+ G+CSCKDYW
Sbjct: 1019 KNLRVCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1064


>emb|CAB36829.1| putative protein [Arabidopsis thaliana] gi|7268069|emb|CAB78407.1|
            putative protein [Arabidopsis thaliana]
          Length = 1024

 Score =  329 bits (844), Expect = 6e-88
 Identities = 152/226 (67%), Positives = 186/226 (82%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            L RAG L  AKEF++ MPI+PDA++WRT LSAC VHKN E GE AA++LLELEP DSATY
Sbjct: 799  LTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATY 858

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV+ KWD RD +R+ M+E+GVKKEPG+SWIEVKNS+HSF+VGD+ HPL  EI+
Sbjct: 859  VLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIH 918

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             Y  +LTK+ + IGYV D  SL NE++  QKD    IHSEKLA++FGLL+LP  +P++VM
Sbjct: 919  EYFQDLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVM 978

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCH WI F+SKV +R IIVRDAYRFHHF+ G+CSCKDYW
Sbjct: 979  KNLRVCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1024


>ref|XP_004293058.1| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like
            [Fragaria vesca subsp. vesca]
          Length = 1277

 Score =  328 bits (842), Expect = 1e-87
 Identities = 150/226 (66%), Positives = 183/226 (80%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            L RAG L  A++F+  MPI+PD+ IWRT LSAC   KN E GEVAA +LL+LEP DSATY
Sbjct: 1052 LSRAGSLNCARKFITEMPIKPDSTIWRTLLSACIAKKNTEIGEVAARHLLKLEPEDSATY 1111

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SNMYAV G W +RD +R+LM+ERGVKKEPGRSWIEVKNSVH+F+VGDRLHPL ++IY
Sbjct: 1112 VLISNMYAVAGLWGYRDQARQLMKERGVKKEPGRSWIEVKNSVHAFYVGDRLHPLANKIY 1171

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             +L +L ++   IGYV D+++LWN++E + KD T  IHSEKLA+ FGL++L   IP+ V+
Sbjct: 1172 EFLGDLNERAAEIGYVEDRNNLWNDMEQQHKDPTVYIHSEKLAITFGLISLSSTIPIRVI 1231

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCHNWI   SK+  RTIIVRDAYRFHHF+DG CSCKDYW
Sbjct: 1232 KNLRVCNDCHNWIKHTSKISKRTIIVRDAYRFHHFKDGVCSCKDYW 1277


>ref|XP_002868345.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297314181|gb|EFH44604.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 1047

 Score =  328 bits (840), Expect = 2e-87
 Identities = 152/226 (67%), Positives = 186/226 (82%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            L RAG L  AK+F+  MPIEPDA++WRT LSAC VHKN E GE AA++LLELEP DSATY
Sbjct: 822  LTRAGLLSRAKDFILEMPIEPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATY 881

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV  KWD RD +R+ M+E+GVKKEPG+SWIEVKNS+HSF+VGD+ HPL  EI+
Sbjct: 882  VLLSNLYAVCRKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIH 941

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             Y  +LTK+ + IGYV D  SL +E++  QKD T  IHSEKLA++FGLL+LP  +P++VM
Sbjct: 942  EYFKDLTKRASEIGYVQDCFSLLSELQQEQKDPTIFIHSEKLAISFGLLSLPATMPINVM 1001

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCH+WI F+SKV +R IIVRDAYRFHHF+ G+CSCKDYW
Sbjct: 1002 KNLRVCNDCHDWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1047


>ref|XP_004142047.1| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like
            [Cucumis sativus]
          Length = 1037

 Score =  324 bits (831), Expect = 2e-86
 Identities = 149/226 (65%), Positives = 183/226 (80%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRAGQL  A E+++ MPI  DAMIWRT LSAC +HKN E GE AA++LLELEP DSATY
Sbjct: 812  LGRAGQLDRAMEYIKEMPIPADAMIWRTLLSACVIHKNIEIGERAAHHLLELEPEDSATY 871

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV+ +W HRD SR+LM++RGVKKEPGRSWIEVKN+VH+F+ GD+LHPL ++IY
Sbjct: 872  VLISNIYAVSRQWIHRDWSRKLMKDRGVKKEPGRSWIEVKNAVHAFYAGDKLHPLTNQIY 931

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             Y+  L ++ + IGYV D  SL NE E  QKD    +HSEKLA+AFGLL+L   IP+ VM
Sbjct: 932  EYIGHLNRRTSEIGYVQDSFSLLNESEQGQKDPITHVHSEKLAIAFGLLSLGNNIPIRVM 991

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCHNWI ++SK+ +R+IIVRDA+RFHHF  G CSCKD+W
Sbjct: 992  KNLRVCNDCHNWIKYVSKISNRSIIVRDAHRFHHFDGGVCSCKDFW 1037


>ref|XP_004160501.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g13650-like [Cucumis sativus]
          Length = 1037

 Score =  322 bits (825), Expect = 9e-86
 Identities = 148/226 (65%), Positives = 182/226 (80%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRAGQL  A E+++ MPI  DAMIWRT LSAC +HKN E GE AA++LLELEP DSATY
Sbjct: 812  LGRAGQLDRAMEYIKEMPIPADAMIWRTLLSACVIHKNIEIGERAAHHLLELEPEDSATY 871

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN+YAV+ +W HRD SR+LM++ GVKKEPGRSWIEVKN+VH+F+ GD+LHPL ++IY
Sbjct: 872  VLISNIYAVSRQWIHRDWSRKLMKDXGVKKEPGRSWIEVKNAVHAFYAGDKLHPLTNQIY 931

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             Y+  L ++ + IGYV D  SL NE E  QKD    +HSEKLA+AFGLL+L   IP+ VM
Sbjct: 932  EYIGHLNRRTSEIGYVQDSFSLLNESEQGQKDPITHVHSEKLAIAFGLLSLGNNIPIRVM 991

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCHNWI ++SK+ +R+IIVRDA+RFHHF  G CSCKD+W
Sbjct: 992  KNLRVCNDCHNWIKYVSKISNRSIIVRDAHRFHHFDGGVCSCKDFW 1037


>ref|XP_006443406.1| hypothetical protein CICLE_v10018850mg [Citrus clementina]
            gi|557545668|gb|ESR56646.1| hypothetical protein
            CICLE_v10018850mg [Citrus clementina]
          Length = 840

 Score =  321 bits (823), Expect = 2e-85
 Identities = 146/214 (68%), Positives = 175/214 (81%)
 Frame = +2

Query: 38   FVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATYVLMSNMYAVTGK 217
            FV  MPIEPDAM+WRT LSAC VHKN E GE AAN+LLELEP DSATYVL+SN+YA  GK
Sbjct: 627  FVGQMPIEPDAMVWRTLLSACRVHKNMEIGEYAANHLLELEPEDSATYVLLSNIYAAAGK 686

Query: 218  WDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIYSYLDELTKKVTA 397
            WD RD  R++M++RGVKKEPG+SWIEVKNS+H+FFVGDRLHPL  +IY YL  L ++V  
Sbjct: 687  WDCRDQIRQIMKDRGVKKEPGQSWIEVKNSIHAFFVGDRLHPLADKIYDYLGNLNRRVAE 746

Query: 398  IGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVMKNLRVCNDCHNW 577
            IGYV  + SLW+++E  QKD    IHSEKLA+AFGLL+L   +P+ V+KNLRVCNDCHNW
Sbjct: 747  IGYVQGRYSLWSDLEQEQKDPCVYIHSEKLAIAFGLLSLSDSMPILVIKNLRVCNDCHNW 806

Query: 578  INFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            I F+SK+ +RTI+VRDA RFHHF+ G CSC+DYW
Sbjct: 807  IKFVSKISNRTIVVRDANRFHHFEGGVCSCRDYW 840


>tpg|DAA37919.1| TPA: hypothetical protein ZEAMMB73_411767 [Zea mays]
          Length = 920

 Score =  321 bits (822), Expect = 2e-85
 Identities = 148/226 (65%), Positives = 183/226 (80%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRAGQL  A+ FV+ MPI  +AMIWRT LSAC VHKN E GE+AA +LLELEP+DSA+Y
Sbjct: 695  LGRAGQLDRARRFVDEMPITANAMIWRTLLSACKVHKNIEIGELAAKHLLELEPHDSASY 754

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN YAVTGKW +RD  R++M++RG++KEPGRSWIEVKN+VH+FFVGDRLHPL  +IY
Sbjct: 755  VLLSNAYAVTGKWANRDQVRKMMKDRGIRKEPGRSWIEVKNAVHAFFVGDRLHPLSDQIY 814

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             +L EL  +++ IGY  +  +L++E E  QKD TA +HSEKLAVAFGL+TLP  IPL V+
Sbjct: 815  KFLSELNDRLSKIGYKQENPNLFHEKEQEQKDPTAFVHSEKLAVAFGLMTLPPCIPLRVI 874

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVC+DCH+W+ F S+V  R I++RD YRFHHF  GSCSC DYW
Sbjct: 875  KNLRVCDDCHSWMKFTSEVTRREIVLRDVYRFHHFNSGSCSCGDYW 920


>ref|XP_003619016.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355494031|gb|AES75234.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 999

 Score =  321 bits (822), Expect = 2e-85
 Identities = 150/226 (66%), Positives = 179/226 (79%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGR+G L  AK FVE MPI+PDAM+WRT LSAC VHKN + GE AA++LLELEP DSATY
Sbjct: 774  LGRSGLLSRAKRFVEEMPIQPDAMVWRTLLSACNVHKNIDIGEFAASHLLELEPKDSATY 833

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SNMYAV+GKWD RD +R++M++RGVKKEPGRSW+EV NSVH+FF GD+ HP    IY
Sbjct: 834  VLVSNMYAVSGKWDCRDRTRQMMKDRGVKKEPGRSWVEVDNSVHAFFAGDQNHPRADMIY 893

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             YL  L  +    GYVP  +SL ++ E+RQKD T  IHSE+LA+AFGLL+L    PL+V 
Sbjct: 894  EYLRGLDFRAAENGYVPRCNSLLSDAEIRQKDPTEIIHSERLAIAFGLLSLTSSTPLYVF 953

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVC DCHNWI  +SK+ DR IIVRD+YRFHHF+ GSCSCKDYW
Sbjct: 954  KNLRVCEDCHNWIKHVSKITDRVIIVRDSYRFHHFKVGSCSCKDYW 999


>dbj|BAJ97995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 919

 Score =  321 bits (822), Expect = 2e-85
 Identities = 147/226 (65%), Positives = 182/226 (80%)
 Frame = +2

Query: 2    LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
            LGRAGQL  A++FVE MP+  +AM+WRT LSAC VHKN E GE+AA  LLELEP+DSA+Y
Sbjct: 694  LGRAGQLDRARKFVEEMPVSANAMVWRTLLSACRVHKNIEIGELAAKYLLELEPHDSASY 753

Query: 182  VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
            VL+SN YAVTGKW  RDH R++M++RGV+KEPGRSWIEVKN VH+FFVGDRLHPL H+IY
Sbjct: 754  VLLSNAYAVTGKWACRDHVRKMMKDRGVRKEPGRSWIEVKNVVHAFFVGDRLHPLAHQIY 813

Query: 362  SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
             YL +L  ++  IGY+     L++E E  QKD TA +HSEKLAVAFGL++LP  +PL V+
Sbjct: 814  KYLADLDDRLAKIGYIQGNYFLFHEKEKEQKDPTAFVHSEKLAVAFGLMSLPPSMPLRVI 873

Query: 542  KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
            KNLRVCNDCH W+ F S+V+ R I++RD YRFHHF +G+CSC D+W
Sbjct: 874  KNLRVCNDCHTWMKFTSEVMGREIVLRDVYRFHHFNNGNCSCGDFW 919


>ref|XP_002446412.1| hypothetical protein SORBIDRAFT_06g015580 [Sorghum bicolor]
           gi|241937595|gb|EES10740.1| hypothetical protein
           SORBIDRAFT_06g015580 [Sorghum bicolor]
          Length = 317

 Score =  321 bits (822), Expect = 2e-85
 Identities = 147/226 (65%), Positives = 182/226 (80%)
 Frame = +2

Query: 2   LGRAGQLFHAKEFVESMPIEPDAMIWRTFLSACTVHKNREFGEVAANNLLELEPNDSATY 181
           LGRAGQL  A+ FV+ MPI  DAM+WRT LSAC VHKN E GE+AA +LLELEP+DSA+Y
Sbjct: 92  LGRAGQLDRARRFVDEMPITADAMVWRTLLSACKVHKNIEIGELAAKHLLELEPHDSASY 151

Query: 182 VLMSNMYAVTGKWDHRDHSRRLMRERGVKKEPGRSWIEVKNSVHSFFVGDRLHPLVHEIY 361
           VL+SN YAVTGKW +RD  R++M++RG++KEPGRSWIE KN+VH+FFVGDRLHPL  +IY
Sbjct: 152 VLLSNAYAVTGKWANRDQVRKMMKDRGIRKEPGRSWIEAKNAVHAFFVGDRLHPLSDQIY 211

Query: 362 SYLDELTKKVTAIGYVPDQSSLWNEVELRQKDLTAQIHSEKLAVAFGLLTLPRIIPLHVM 541
            +L EL  ++  IGY  ++ +L++E E  QKD TA +HSEKLAVAFGL+TLP  IPL V+
Sbjct: 212 KFLSELNDRLAKIGYKQEKPNLFHEKEQEQKDPTAFVHSEKLAVAFGLMTLPPCIPLRVI 271

Query: 542 KNLRVCNDCHNWINFISKVVDRTIIVRDAYRFHHFQDGSCSCKDYW 679
           KNLRVC+DCH+W+ F S+V  R I++RD YRFHHF  GSCSC DYW
Sbjct: 272 KNLRVCDDCHSWMKFTSEVTRREIVLRDVYRFHHFNSGSCSCGDYW 317


Top