BLASTX nr result

ID: Mentha28_contig00011345 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00011345
         (1320 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU41644.1| hypothetical protein MIMGU_mgv1a001284mg [Mimulus...   489   e-135
ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containi...   405   e-110
ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containi...   399   e-108
ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi...   389   e-105
ref|XP_007208081.1| hypothetical protein PRUPE_ppa001520mg [Prun...   380   e-103
ref|XP_007027210.1| Tetratricopeptide repeat (TPR)-like superfam...   371   e-100
gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]     371   e-100
ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containi...   366   1e-98
ref|XP_006381507.1| pentatricopeptide repeat-containing family p...   364   5e-98
ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citr...   363   9e-98
ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar...   354   4e-95
ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Caps...   353   7e-95
dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]                353   1e-94
ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutr...   352   2e-94
ref|XP_002525196.1| pentatricopeptide repeat-containing protein,...   350   6e-94
ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containi...   347   5e-93
ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containi...   347   9e-93
ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutr...   346   1e-92
ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containi...   345   3e-92
ref|XP_007162713.1| hypothetical protein PHAVU_001G174000g [Phas...   343   1e-91

>gb|EYU41644.1| hypothetical protein MIMGU_mgv1a001284mg [Mimulus guttatus]
          Length = 847

 Score =  489 bits (1258), Expect = e-135
 Identities = 256/382 (67%), Positives = 299/382 (78%), Gaps = 9/382 (2%)
 Frame = -3

Query: 1318 WKERVSEQTLRDDKLSHADKAMDQMIEMGVPLTLCSHLTMRVPFRPTISTYNIMLKACGT 1139
            WKER SEQT  D+ L+ +D ++  +    V     SHLT  VPFRPT STYNI++KACG 
Sbjct: 459  WKERGSEQTTSDNNLNSSDDSL-AVHHTRVTSRPYSHLTTGVPFRPTTSTYNILMKACGA 517

Query: 1138 DYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVVAYT 959
            DYY AK LM+EMK  GL+PN ISWS LIDVCGGSGNV+GA+QIL S+ E+GIQPDV+AYT
Sbjct: 518  DYYRAKALMDEMKTLGLSPNQISWSTLIDVCGGSGNVAGAIQILRSLHETGIQPDVIAYT 577

Query: 958  TAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIYQQM 779
            TAIKICV HKK KLAFMLFAEM+KY+IKPN VTY TIL ARS YGSLQ VQQ L +YQQM
Sbjct: 578  TAIKICVKHKKPKLAFMLFAEMKKYEIKPNLVTYKTILTARSRYGSLQEVQQSLAVYQQM 637

Query: 778  RKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITDFGRQSLLLEKVIEYLQDSTA 599
            RKAGYK NDYYLKQLIE+WCEGV+Q EH NEG+FAS ITDFG QS+LLEKV E+LQDS A
Sbjct: 638  RKAGYKPNDYYLKQLIEEWCEGVLQNEHHNEGQFASRITDFGPQSMLLEKVAEHLQDSNA 697

Query: 598  ESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQEQHAE--------- 446
            ESL IDL+GLTKVEARI+VLAVLRKIKEKY AGNS++DD+ IILG ++            
Sbjct: 698  ESLSIDLQGLTKVEARIIVLAVLRKIKEKYIAGNSMEDDVSIILGLQELGSDFIKGESDG 757

Query: 445  VGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNLEEVDEKSRSAETSESPTR 266
            V +AVIRLL++DL L V   G R+G G   KGS + SSS+   E  E+S S + SESPTR
Sbjct: 758  VKEAVIRLLEHDLGLQVFAAGSRSGRG---KGSRMYSSSIG--ETIERSESKQASESPTR 812

Query: 265  RPIILRRLKVTRESMHQWLQRK 200
            RP++L+RLKVTRES+H WLQ+K
Sbjct: 813  RPMVLQRLKVTRESLHHWLQKK 834


>ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum tuberosum]
          Length = 859

 Score =  405 bits (1042), Expect = e-110
 Identities = 210/390 (53%), Positives = 274/390 (70%), Gaps = 17/390 (4%)
 Frame = -3

Query: 1318 WKERVSEQTLRDDKLSHADKAMDQ----MIEMGVPLTLCS----HLTMRVPFRPTISTYN 1163
            WKE   ++   +D     D  +D     ++   +P    +    H + RVPFRPT STYN
Sbjct: 463  WKENALQKDNCEDFGGKTDNTIDLSPTLVVSASIPTRTSASSHGHFSTRVPFRPTTSTYN 522

Query: 1162 IMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGI 983
            I++KACG+DYY AK LMEEMK  GL+PNHI+W+ILID+CGGSGNV GA+QIL +MRE+GI
Sbjct: 523  ILIKACGSDYYRAKALMEEMKEVGLSPNHITWTILIDICGGSGNVEGALQILRAMREAGI 582

Query: 982  QPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQ 803
            QPDVV YTT IK+CV +K  K AF LFA M++YQIKPN VTYNT+LRARS YGSLQ VQQ
Sbjct: 583  QPDVVTYTTIIKVCVENKDFKSAFSLFAAMKRYQIKPNMVTYNTLLRARSRYGSLQEVQQ 642

Query: 802  YLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDFGRQSLLLEKV 626
             L IYQ MRKAGYK NDYYLKQLIE WCEGVIQ  +Q +  F++ + TD G +S++L+KV
Sbjct: 643  CLAIYQHMRKAGYKPNDYYLKQLIEQWCEGVIQNGNQRKYNFSTRNRTDLGPESMILDKV 702

Query: 625  IEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQE---- 458
             E+LQ  +A S+ I+LRGL+KVEARIVVLAVLR I+EKY AG+SIK+D+ I LG +    
Sbjct: 703  AEHLQKDSANSISINLRGLSKVEARIVVLAVLRMIREKYTAGDSIKEDVQIFLGVQEVGI 762

Query: 457  ----QHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNLEEVDEKSRSA 290
                Q + V +A+++LLQ+DL L VI    R G  +   G +      N+EE  E+    
Sbjct: 763  RAVGQESVVKEAIVKLLQHDLGLEVISAASRIGNDRNQDGINHPDKHSNMEENAERVILR 822

Query: 289  ETSESPTRRPIILRRLKVTRESMHQWLQRK 200
                SPTR+P++L+++++T+ES+  WL R+
Sbjct: 823  ANVHSPTRKPVVLQKMRITKESLQSWLTRR 852



 Score = 59.7 bits (143), Expect = 3e-06
 Identities = 32/117 (27%), Positives = 60/117 (51%), Gaps = 2/117 (1%)
 Frame = -3

Query: 1210 HLTMRVPFRPTISTYNIMLK--ACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGS 1037
            HL M    +  + TY+ ++K  A    +  A ++ ++M   G+TPN ++WS LI  C  +
Sbjct: 356  HLEMAGALKLDVFTYSTLIKVFADAKMWQMALEIKKDMLSAGVTPNIVTWSSLISACANA 415

Query: 1036 GNVSGAVQILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNS 866
            G V  A+Q+   M ++G +P+   Y   +  CV   +   AF LF   ++  ++ ++
Sbjct: 416  GLVDQAIQLFEEMLQAGCEPNSQCYNILLHACVEACQYDRAFRLFRSWKENALQKDN 472


>ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum lycopersicum]
          Length = 857

 Score =  399 bits (1025), Expect = e-108
 Identities = 213/390 (54%), Positives = 272/390 (69%), Gaps = 17/390 (4%)
 Frame = -3

Query: 1318 WKERVSEQTLRDDKLSHADKAMDQ----MIEMGVPLTLCS----HLTMRVPFRPTISTYN 1163
            WKE   ++   +D     D  +D     ++   +P    +    H++ RVPF PT STYN
Sbjct: 465  WKENALQKDKCEDYGGKTDNNIDLSPTLVVSASIPTRTSASSHRHISTRVPFIPTTSTYN 524

Query: 1162 IMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGI 983
            I++KACG+DYY AK LMEEMK  GL+PNHI+W+ILID+CGGSGNV GA+QIL  MRE+GI
Sbjct: 525  ILMKACGSDYYRAKALMEEMKEVGLSPNHITWTILIDICGGSGNVEGALQILRVMREAGI 584

Query: 982  QPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQ 803
            QPDVV YTT IK+CV +K  K AF LFA M++YQIKPN VTYNT+LRARS YGSLQ VQQ
Sbjct: 585  QPDVVTYTTIIKVCVENKDFKSAFSLFAAMKRYQIKPNMVTYNTLLRARSRYGSLQEVQQ 644

Query: 802  YLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDFGRQSLLLEKV 626
             L IYQ MRKAGYK NDYYLKQLIE WCEGVIQ  +Q +  F++ + TD G QS++LEKV
Sbjct: 645  CLAIYQDMRKAGYKPNDYYLKQLIEQWCEGVIQNANQRKYNFSTRNRTDLGPQSMILEKV 704

Query: 625  IEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG------ 464
             E+LQ  +A S+ I+LRGLTKVEARIVVLAVLR I+EKY AG+SIKDD+ I LG      
Sbjct: 705  AEHLQKDSANSISINLRGLTKVEARIVVLAVLRMIREKYTAGDSIKDDVQIFLGVKEVGI 764

Query: 463  --QEQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNLEEVDEKSRSA 290
               +Q + V +A+I+LLQ+DL L VI      G G         +   N+EE  E+    
Sbjct: 765  RAVKQESVVKEAIIQLLQHDLGLEVISAASTIGNGINHPD----NKHSNMEENAERVILR 820

Query: 289  ETSESPTRRPIILRRLKVTRESMHQWLQRK 200
             +  SPTR+P++L+++++T+ES+  WL R+
Sbjct: 821  PSVYSPTRKPVVLQKMRITKESLQSWLTRR 850



 Score = 59.3 bits (142), Expect = 4e-06
 Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 2/105 (1%)
 Frame = -3

Query: 1210 HLTMRVPFRPTISTYNIMLK--ACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGS 1037
            HL M    +  + TY+ ++K  A    +  A ++ ++M   G+TPN ++WS LI  C  +
Sbjct: 358  HLEMAGALKLDVFTYSTLIKVFADAKMWQMALEIKKDMLSAGVTPNIVTWSSLISACANA 417

Query: 1036 GNVSGAVQILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLF 902
            G V  A+Q+   M ++G +P+   Y   +  CV   +   AF LF
Sbjct: 418  GVVDQAIQLFEEMLQAGCEPNSQCYNILLHACVEACQYDRAFRLF 462


>ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic [Vitis vinifera]
            gi|297741486|emb|CBI32618.3| unnamed protein product
            [Vitis vinifera]
          Length = 842

 Score =  389 bits (1000), Expect = e-105
 Identities = 207/341 (60%), Positives = 253/341 (74%), Gaps = 10/341 (2%)
 Frame = -3

Query: 1192 PFRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQ 1013
            PF PT +TYNI++KACGTDYY AK LM+EMK  GL+PNHISWSILID+CGG+GN+ GAV+
Sbjct: 496  PFTPTTTTYNILMKACGTDYYRAKALMDEMKTAGLSPNHISWSILIDICGGTGNIVGAVR 555

Query: 1012 ILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARS 833
            IL +MRE+GI+PDVVAYTTAIK CV  K  K+AF LFAEM++YQI+PN VTYNT+LRARS
Sbjct: 556  ILKTMREAGIKPDVVAYTTAIKYCVESKNLKIAFSLFAEMKRYQIQPNLVTYNTLLRARS 615

Query: 832  MYGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDF 656
             YGSL  VQQ L IYQ MRKAGYK NDYYLK+LIE+WCEGVIQ  + N+ +F+S +  D+
Sbjct: 616  RYGSLHEVQQCLAIYQHMRKAGYKSNDYYLKELIEEWCEGVIQDNNLNQSKFSSVNRADW 675

Query: 655  GR-QSLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDL 479
            GR QSLLLEKV  +LQ S AESL IDL+GLT+VEARIVVLAVLR IKE Y  G+ IKDD+
Sbjct: 676  GRPQSLLLEKVAAHLQKSVAESLAIDLQGLTQVEARIVVLAVLRMIKENYILGHPIKDDI 735

Query: 478  LIILG--------QEQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLN 323
            LIILG         E  + V  A+I+LLQ++L L V   G +    K +       S  +
Sbjct: 736  LIILGIKKVDANLVEHESPVKGAIIKLLQDELGLEVAFAGPKIALDKRINLGGPPGSDPD 795

Query: 322  LEEVDEKSRSAETSESPTRRPIILRRLKVTRESMHQWLQRK 200
             +E   ++R     ES TRRP +L+R KVTR+S+  WLQR+
Sbjct: 796  WQEALGRNRLPTELESSTRRPAVLQRFKVTRKSLDHWLQRR 836


>ref|XP_007208081.1| hypothetical protein PRUPE_ppa001520mg [Prunus persica]
            gi|462403723|gb|EMJ09280.1| hypothetical protein
            PRUPE_ppa001520mg [Prunus persica]
          Length = 809

 Score =  380 bits (977), Expect = e-103
 Identities = 195/346 (56%), Positives = 252/346 (72%), Gaps = 8/346 (2%)
 Frame = -3

Query: 1198 RVPFRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGA 1019
            R+ F+PT +TYN ++KACGTDYYHAK L++EM+  GL PN ISWSIL D+CGGSGNV GA
Sbjct: 464  RLSFKPTTTTYNTLMKACGTDYYHAKALLDEMRAVGLYPNQISWSILADICGGSGNVEGA 523

Query: 1018 VQILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRA 839
            +QIL +MR +G++PDVVAYTTAIK+CV ++  +LA  LF EM+KYQI PN VTYNT+LRA
Sbjct: 524  LQILKNMRAAGMKPDVVAYTTAIKVCVENENLELALSLFGEMKKYQIHPNLVTYNTLLRA 583

Query: 838  RSMYGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HIT 662
            RS YGS+  VQQ L IYQ MRKAGYK NDYYL+QLIE+WCEGVIQ  +  +  F+S + T
Sbjct: 584  RSRYGSVSEVQQCLAIYQDMRKAGYKSNDYYLEQLIEEWCEGVIQDSNAKQEEFSSCNKT 643

Query: 661  DFGRQ-SLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKD 485
            D GR  SLLLEKV E+LQ   AE+L +DL+GLTKVEARIVVLAVLR IKE Y  G+S+KD
Sbjct: 644  DIGRPGSLLLEKVAEHLQTHIAETLAVDLQGLTKVEARIVVLAVLRMIKENYTLGHSVKD 703

Query: 484  DLLIILGQ------EQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLN 323
            D+LI++G+       Q+ EV  A+ +LLQ++L L V+  G + G    ++  +   S  +
Sbjct: 704  DMLIVVGEVDGGSTTQNLEVKDAITKLLQDELGLKVLAAGAKVGLDTTIERGNTTDSDQD 763

Query: 322  LEEVDEKSRSAETSESPTRRPIILRRLKVTRESMHQWLQRKTLPSK 185
            L+E+  +          TRRP+ L RLKVTR S+  WL+R++ P +
Sbjct: 764  LDEMSGRDELPAELIYSTRRPVALERLKVTRGSLQHWLRRRSAPRR 809



 Score = 74.7 bits (182), Expect = 8e-11
 Identities = 62/256 (24%), Positives = 113/256 (44%), Gaps = 10/256 (3%)
 Frame = -3

Query: 1210 HLTMRVPFRPTISTYNIMLK--ACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGS 1037
            HL      +  + TY+ ++K  A    ++ A ++ E+M   G+TPN ++WS LI  C  +
Sbjct: 355  HLESTGVLKLDVFTYSTIVKVFADAKLWHMALNVKEDMLSAGVTPNTVTWSSLISACANA 414

Query: 1036 GNVSGAVQILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTY 857
            G V  A+Q+   M  +G +P+   +   +  CV   +   AF LF  +++   KP + TY
Sbjct: 415  GIVEKAIQLFEEMLLAGSEPNSQCFNILLHACVEANQYDRAFRLFQSLKRLSFKPTTTTY 474

Query: 856  NTILRARSMYGSLQGVQQY--LVIYQQMRKAGYKRNDYYLKQLIEDWC------EGVIQT 701
            NT+++A        G   Y    +  +MR  G   N      ++ D C      EG +Q 
Sbjct: 475  NTLMKA-------CGTDYYHAKALLDEMRAVGLYPNQISW-SILADICGGSGNVEGALQI 526

Query: 700  EHQNEGRFASHITDFGRQSLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKI 521
                  R A    D    +  ++  +E      A SLF +++   ++   +V    L + 
Sbjct: 527  --LKNMRAAGMKPDVVAYTTAIKVCVENENLELALSLFGEMKKY-QIHPNLVTYNTLLRA 583

Query: 520  KEKYAAGNSIKDDLLI 473
            + +Y + + ++  L I
Sbjct: 584  RSRYGSVSEVQQCLAI 599


>ref|XP_007027210.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao] gi|508715815|gb|EOY07712.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein,
            putative [Theobroma cacao]
          Length = 858

 Score =  371 bits (953), Expect = e-100
 Identities = 207/354 (58%), Positives = 257/354 (72%), Gaps = 12/354 (3%)
 Frame = -3

Query: 1225 LTLCSHLTM--RVPFRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILID 1052
            LT   HL+   +  F PT +TYNI++KAC TDYY AK LM+EMK  GL+PNH+SWSILID
Sbjct: 500  LTNSHHLSFAKKFSFTPTTATYNILMKACCTDYYRAKALMDEMKSVGLSPNHVSWSILID 559

Query: 1051 VCGGSGNVSGAVQILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKP 872
            +C GSGNV GA+QIL +M  +GI+PDVVAYTTAIK+CV  K  KLAF LF EM++Y+++P
Sbjct: 560  ICRGSGNVEGAIQILKTMHVTGIKPDVVAYTTAIKVCVGSKNLKLAFSLFEEMKRYRVQP 619

Query: 871  NSVTYNTILRARSMYGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVI-QTEH 695
            N VTYNT+LRARS YGSL  VQQ L IYQ MRKAGYK ND YLK+LIE+WCEGVI +  H
Sbjct: 620  NLVTYNTLLRARSRYGSLHEVQQCLAIYQDMRKAGYKSNDIYLKELIEEWCEGVIKENNH 679

Query: 694  QNEGRFASHITDFGR-QSLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIK 518
            + EG  +   TD  R  SLLLEK+  +LQ STAES  IDLRGLTKVEARIVVLAVLR IK
Sbjct: 680  KREGLSSCKRTDLERPHSLLLEKIAVHLQMSTAESPAIDLRGLTKVEARIVVLAVLRMIK 739

Query: 517  EKYAAGNSIKDDLLIILG-QEQHA-------EVGQAVIRLLQNDLSLHVIEGGLRAGEGK 362
            E +  G+S+KDD+LIILG  E+HA       EV  AV++LLQ++L L V+    +   G 
Sbjct: 740  ENHILGHSVKDDMLIILGVSERHANAAKQKSEVKDAVMKLLQDELGLEVLLVEPQVKNGL 799

Query: 361  GMKGSSIVSSSLNLEEVDEKSRSAETSESPTRRPIILRRLKVTRESMHQWLQRK 200
                + I +  + LE V + S S++   S TRRP+IL+RLKVTR+S++ WL R+
Sbjct: 800  VDLQTPIDADPVLLETVGKNSLSSKPLSS-TRRPVILQRLKVTRKSLNHWLWRR 852


>gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]
          Length = 822

 Score =  371 bits (952), Expect = e-100
 Identities = 200/382 (52%), Positives = 263/382 (68%), Gaps = 11/382 (2%)
 Frame = -3

Query: 1312 ERVSEQTLRDDKLSHADKAMDQMIEMGVPLTLCS-HLTMRVPFRPTISTYNIMLKACGTD 1136
            +  SE+  R D+ S+    +  + +     TLC  +    +PF PT +TYNI++KACG+D
Sbjct: 445  QETSEEDGRGDRDSNQSAGVTSISQSS---TLCGLNFARELPFTPTTTTYNILMKACGSD 501

Query: 1135 YYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVVAYTT 956
            YYHAK L+EEM+  GL+PN I+WSILID+CG  GNV GA+QIL +MR +GI+PDVVAYTT
Sbjct: 502  YYHAKALIEEMEAVGLSPNQITWSILIDICGDLGNVEGALQILKTMRATGIEPDVVAYTT 561

Query: 955  AIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIYQQMR 776
             IK+CV  K  K AF LFAEM++YQI+PN VTYNT+LRAR+ YGSLQ V+Q L +YQ MR
Sbjct: 562  VIKVCVESKDLKQAFELFAEMKRYQIQPNLVTYNTLLRARNRYGSLQEVKQCLAVYQDMR 621

Query: 775  KAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFAS-HITDFGR-QSLLLEKVIEYLQDST 602
            +AGY  NDYYLKQLIE+WCEGVIQ  +QN    +S + TD  R QSLLLEKV E+L+   
Sbjct: 622  RAGYNSNDYYLKQLIEEWCEGVIQGNNQNREESSSFNKTDKKRPQSLLLEKVAEHLEKHI 681

Query: 601  AESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG--------QEQHAE 446
            AE+L +D++GL KVEARIVVLAVLR +KE Y  G  +KDD+LII+G         EQ  E
Sbjct: 682  AETLTVDVQGLKKVEARIVVLAVLRMVKENYTMGYLVKDDMLIIIGACKVDAVPDEQELE 741

Query: 445  VGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNLEEVDEKSRSAETSESPTR 266
            V  A+ +LL+++L L V+  GL+    + +   S+ SS  + E            +  TR
Sbjct: 742  VKDAITKLLKDELGLEVLSTGLKIEPNRQVDSDSLGSSDFSGE-----------MKYSTR 790

Query: 265  RPIILRRLKVTRESMHQWLQRK 200
            RP++++RLKVT+ES+  WLQRK
Sbjct: 791  RPVVIQRLKVTKESLQHWLQRK 812


>ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 847

 Score =  366 bits (940), Expect = 1e-98
 Identities = 194/354 (54%), Positives = 247/354 (69%), Gaps = 11/354 (3%)
 Frame = -3

Query: 1225 LTLCSHLTMRVPFRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVC 1046
            + L S+    + F+PT +TYN ++KACG+DYYHAK LM+EMK  GL PN I+WSIL D+C
Sbjct: 490  IILPSNFAEGLSFKPTTTTYNTLMKACGSDYYHAKALMDEMKTVGLLPNQITWSILADIC 549

Query: 1045 GGSGNVSGAVQILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNS 866
            G SGNV GA+QIL SMR +GIQPDVVAYTTAIKICV  +   LA +LFAEM+KYQI PN 
Sbjct: 550  GSSGNVQGALQILKSMRVAGIQPDVVAYTTAIKICVESENLDLALLLFAEMKKYQIHPNL 609

Query: 865  VTYNTILRARSMYGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNE 686
            VTYNT+LRARS YGS+  VQQ L IYQ MRKAGYK NDYYL+QLIE+WCEGVIQ     +
Sbjct: 610  VTYNTLLRARSRYGSVSEVQQCLAIYQDMRKAGYKPNDYYLEQLIEEWCEGVIQDSCPKQ 669

Query: 685  GRFA-SHITDFGRQ-SLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEK 512
            G F+     D GR  SLLLEKV E+LQ   A++L +DL+GLTKVEARIVVLAVLR IKE 
Sbjct: 670  GEFSYGDKADIGRPGSLLLEKVAEHLQQHIADTLAVDLQGLTKVEARIVVLAVLRMIKEN 729

Query: 511  YAAGNSIKDDLLIILG---------QEQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKG 359
            Y  G+S+KDD+LI++G            + EV  A+ +LLQ++L L V+    +      
Sbjct: 730  YILGDSVKDDMLIMVGVHDEVDGGSTAHNLEVKDAITKLLQDELGLKVLSTVPKVALDTT 789

Query: 358  MKGSSIVSSSLNLEEVDEKSRSAETSESPTRRPIILRRLKVTRESMHQWLQRKT 197
            +   + + S  NL+E   +          TRRP++L RLKV+R+S+ QWL++++
Sbjct: 790  IVSQNTIDSDQNLDEKPLRKELQPELIYSTRRPVVLERLKVSRKSLQQWLRKRS 843


>ref|XP_006381507.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550336211|gb|ERP59304.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 828

 Score =  364 bits (934), Expect = 5e-98
 Identities = 200/370 (54%), Positives = 261/370 (70%), Gaps = 9/370 (2%)
 Frame = -3

Query: 1282 DKLSHADKAMDQMIEMGVPLTLCSHLTMRVPFRPTISTYNIMLKACGTDYYHAKDLMEEM 1103
            D++ HA K    M  + VP +   +   + PF PT +TY++++KACG+DY+ AK LM+EM
Sbjct: 478  DEIEHAQKHCPNMTTI-VPNSHHLNFIKKFPFTPTPATYHMLMKACGSDYHRAKALMDEM 536

Query: 1102 KMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVVAYTTAIKICVNHKKT 923
            K  G++PNHISWSILID+CG SGNVSGAVQIL +MR +G++PDVVAYTTAIK+CV  K  
Sbjct: 537  KTVGISPNHISWSILIDICGVSGNVSGAVQILKNMRLAGVEPDVVAYTTAIKVCVETKNL 596

Query: 922  KLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIYQQMRKAGYKRNDYYL 743
            KLAF LFAEM++ QI PN VTYNT+LRAR+ YGSL+ VQQ L IYQ MRKAGYK NDYYL
Sbjct: 597  KLAFSLFAEMKRCQINPNLVTYNTLLRARTRYGSLREVQQCLAIYQDMRKAGYKSNDYYL 656

Query: 742  KQLIEDWCEGVIQTEHQNEGRFAS-HITDFGR-QSLLLEKVIEYLQDSTAESLFIDLRGL 569
            KQLIE+WCEGVIQ  +Q +G FAS   TD GR +SLLLEKV  +LQ++ +E+L IDL+GL
Sbjct: 657  KQLIEEWCEGVIQDNNQIQGGFASCKRTDLGRPRSLLLEKVAAHLQNNISENLAIDLQGL 716

Query: 568  TKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIIL-------GQEQHAEVGQAVIRLLQND 410
            TKVEARIVVLAVLR IKE Y  G S+K+D+ I L         ++ +EV  A+I LL+N+
Sbjct: 717  TKVEARIVVLAVLRMIKENYTLGYSVKEDMWITLDVSKVDPASKRDSEVKNAIIELLRNE 776

Query: 409  LSLHVIEGGLRAGEGKGMKGSSIVSSSLNLEEVDEKSRSAETSESPTRRPIILRRLKVTR 230
            L L V                 +V+   +L+++   S+S       +  P++ +RLKV R
Sbjct: 777  LGLEV-----------------LVAVPGHLDDIKTDSKS-------SLDPVVTQRLKVRR 812

Query: 229  ESMHQWLQRK 200
            +S+H+WLQR+
Sbjct: 813  KSLHEWLQRR 822


>ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citrus clementina]
            gi|568853887|ref|XP_006480569.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Citrus sinensis]
            gi|557530964|gb|ESR42147.1| hypothetical protein
            CICLE_v10011055mg [Citrus clementina]
          Length = 850

 Score =  363 bits (932), Expect = 9e-98
 Identities = 204/380 (53%), Positives = 256/380 (67%), Gaps = 15/380 (3%)
 Frame = -3

Query: 1294 TLRDDKLSHADKAMDQMIEMGVPLTLCSHLTMRVPFRPTISTYNIMLKACGTDYYHAKDL 1115
            T R   + H DK         VP +  S    R  F+PT +TYNI++KAC TDYY  K L
Sbjct: 476  TDRISNMEHKDKQSITNTPNFVPNSHYSSFDKRFSFKPTTTTYNILMKACCTDYYRVKAL 535

Query: 1114 MEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDVVAYTTAIKICVN 935
            M+EM+  GL+PNHISW+ILID CGGSGNV GA+QIL  MRE G+ PDVVAYTTAIK+CV 
Sbjct: 536  MDEMRTVGLSPNHISWTILIDACGGSGNVEGALQILKIMREDGMSPDVVAYTTAIKVCVR 595

Query: 934  HKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVIYQQMRKAGYKRN 755
             K+ KLAF LF EM+ YQI+PN VTY T+LRARS YGSL  VQQ L +YQ M KAGYK N
Sbjct: 596  SKRLKLAFSLFEEMKHYQIQPNLVTYITLLRARSRYGSLHEVQQCLAVYQDMWKAGYKAN 655

Query: 754  DYYLKQLIEDWCEGVIQTEHQNEGR--FASHITDFGRQSLLLEKVIEYLQDSTAESLFID 581
            D YLK++IE+WCEGVIQ ++QN+G             QSLLLEKV  +LQ S AE+L ID
Sbjct: 656  DTYLKEVIEEWCEGVIQDKNQNQGEVTLCRRTNSQRPQSLLLEKVAVHLQKSAAENLAID 715

Query: 580  LRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG-------QEQH-AEVGQAVIR 425
            L+GLTKVEARIVVLAVL+ +KE Y+ G  +KDDL+I+LG       Q +H  EV  A+ +
Sbjct: 716  LQGLTKVEARIVVLAVLQMMKENYSLGVPVKDDLMIVLGPNKVNKIQAKHDLEVKDAITK 775

Query: 424  LLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNLEE-VDEKSRSAET----SESPTRRP 260
            LLQ+DL L V            + G SI   + ++++ +D +S  A+T     +S TRRP
Sbjct: 776  LLQDDLGLKVF-----------LDGPSIQHKNAHMQKLLDSESNMAKTLHIELKSSTRRP 824

Query: 259  IILRRLKVTRESMHQWLQRK 200
             IL+RLKV ++S+H WLQR+
Sbjct: 825  KILQRLKVPKKSLHHWLQRR 844


>ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g02830, chloroplastic; Flags: Precursor
            gi|332003140|gb|AED90523.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 852

 Score =  354 bits (909), Expect = 4e-95
 Identities = 195/392 (49%), Positives = 265/392 (67%), Gaps = 19/392 (4%)
 Frame = -3

Query: 1318 WK-ERVSEQTLRDDKLSHADKAMDQMIEMGVPLTLCSH--------LTMRVPFRPTISTY 1166
            WK   V+E    DD +S    +   +++   P +L +          + R  F+PT +TY
Sbjct: 470  WKGSSVNESLYADDIVSKGRTSSPNILKNNGPGSLVNRNSNSPYIQASKRFCFKPTTATY 529

Query: 1165 NIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESG 986
            NI+LKACGTDYY  K+LM+EMK  GL+PN I+WS LID+CGGSG+V GAV+IL +M  +G
Sbjct: 530  NILLKACGTDYYRGKELMDEMKSLGLSPNQITWSTLIDMCGGSGDVEGAVRILRTMHSAG 589

Query: 985  IQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQ 806
             +PDVVAYTTAIKIC  +K  KLAF LF EM +YQIKPN VTYNT+L+ARS YGSL  V+
Sbjct: 590  TRPDVVAYTTAIKICAENKCLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVR 649

Query: 805  QYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITD-FGRQ-SLLLE 632
            Q L IYQ MR AGYK ND++LK+LIE+WCEGVIQ   Q++ + +    D  GR  SLL+E
Sbjct: 650  QCLAIYQDMRNAGYKPNDHFLKELIEEWCEGVIQENGQSQDKISDQEGDNAGRPVSLLIE 709

Query: 631  KVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQE-- 458
            KV  ++Q+ TA +L IDL+GLTK+EAR+VVLAVLR IKE Y  G+ + DD+LII+G +  
Sbjct: 710  KVATHMQERTAGNLAIDLQGLTKIEARLVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDEA 769

Query: 457  ------QHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNLEEVDEKSR 296
                  Q   V +A+++LL+++LSL V+  G R          +I+  +  +++ D+++ 
Sbjct: 770  NTVSGKQEITVQEALVKLLRDELSLVVLPAGQR----------NIIQDAHCVDDADQENT 819

Query: 295  SAETSESPTRRPIILRRLKVTRESMHQWLQRK 200
             +  S S TRRP IL RL VT+ S++QWLQR+
Sbjct: 820  KSFVSISSTRRPAILERLMVTKASLYQWLQRR 851


>ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Capsella rubella]
            gi|482555757|gb|EOA19949.1| hypothetical protein
            CARUB_v10000200mg [Capsella rubella]
          Length = 858

 Score =  353 bits (907), Expect = 7e-95
 Identities = 191/340 (56%), Positives = 242/340 (71%), Gaps = 10/340 (2%)
 Frame = -3

Query: 1189 FRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQI 1010
            F+PT +TYNI+LKACGTDYY  K+LM+EMK  GLTPN I+WS LID+CGGSG+V GAV+I
Sbjct: 522  FKPTTATYNILLKACGTDYYRGKELMDEMKSLGLTPNQITWSTLIDMCGGSGDVEGAVRI 581

Query: 1009 LTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSM 830
            L +M  +G +PDVVAYTTAIKIC  +K  KLAF LF EM +YQIKPN VTYNT+L+ARS 
Sbjct: 582  LRTMHSAGTRPDVVAYTTAIKICAENKSLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSK 641

Query: 829  YGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITDFGR 650
            YGSL  V+Q L IYQ MRKAGYK ND++LK+LIE+WCEGVIQ   Q++ + +    D   
Sbjct: 642  YGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQENGQSQNKISDQEGDHAG 701

Query: 649  Q--SLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLL 476
            +  SLL+EKV  +LQ+ TA +L IDL+GLTKVEAR+VVLAVLR IKE Y  G+ + DD+L
Sbjct: 702  RPVSLLIEKVATHLQERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYMRGDVVIDDVL 761

Query: 475  IILGQ--------EQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNL 320
            IILG         +Q   V +A+++LLQ +LSL V+  G R           +  ++ + 
Sbjct: 762  IILGTSEANTDSGKQDIAVKEALVKLLQEELSLVVLPAGQR---NIKQDAHCVDDANQDT 818

Query: 319  EEVDEKSRSAETSESPTRRPIILRRLKVTRESMHQWLQRK 200
            E   E ++S   S S TRRP IL RL VT+ S++QWLQRK
Sbjct: 819  EHTLENTKSF-ISISSTRRPAILERLMVTKASLYQWLQRK 857


>dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]
          Length = 852

 Score =  353 bits (905), Expect = 1e-94
 Identities = 194/392 (49%), Positives = 265/392 (67%), Gaps = 19/392 (4%)
 Frame = -3

Query: 1318 WK-ERVSEQTLRDDKLSHADKAMDQMIEMGVPLTLCSH--------LTMRVPFRPTISTY 1166
            WK   V+E    DD +S    +   +++   P +L +          + R  F+PT +TY
Sbjct: 470  WKGSSVNESLYADDIVSKGRTSSPNILKNNGPGSLVNRNSNSPYIQASKRFCFKPTTATY 529

Query: 1165 NIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESG 986
            NI+LKACGTDYY  K+LM+EMK  GL+PN I+WS LID+CGGSG+V GAV+IL +M  +G
Sbjct: 530  NILLKACGTDYYRGKELMDEMKSLGLSPNQITWSTLIDMCGGSGDVEGAVRILRTMHSAG 589

Query: 985  IQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQ 806
             +PDVVAYTTAIKIC  +K  KLAF LF EM +YQIKPN VTYNT+L+ARS YGSL  V+
Sbjct: 590  TRPDVVAYTTAIKICAENKCLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSKYGSLLEVR 649

Query: 805  QYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITD-FGRQ-SLLLE 632
            Q L IYQ MR AGYK ND++LK+LIE+WCEGVIQ   +++ + +    D  GR  SLL+E
Sbjct: 650  QCLAIYQDMRNAGYKPNDHFLKELIEEWCEGVIQENGRSQDKISDQEGDNAGRPVSLLIE 709

Query: 631  KVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILGQE-- 458
            KV  ++Q+ TA +L IDL+GLTK+EAR+VVLAVLR IKE Y  G+ + DD+LII+G +  
Sbjct: 710  KVATHMQERTAGNLAIDLQGLTKIEARLVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDEA 769

Query: 457  ------QHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNLEEVDEKSR 296
                  Q   V +A+++LL+++LSL V+  G R          +I+  +  +++ D+++ 
Sbjct: 770  NTVSGKQEITVQEALVKLLRDELSLVVLPAGQR----------NIIQDAHCVDDADQENT 819

Query: 295  SAETSESPTRRPIILRRLKVTRESMHQWLQRK 200
             +  S S TRRP IL RL VT+ S++QWLQR+
Sbjct: 820  KSFVSISSTRRPAILERLMVTKASLYQWLQRR 851


>ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099829|gb|ESQ40192.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 858

 Score =  352 bits (904), Expect = 2e-94
 Identities = 190/340 (55%), Positives = 240/340 (70%), Gaps = 10/340 (2%)
 Frame = -3

Query: 1189 FRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQI 1010
            F+PT +TYNI+LKACGTDYY  K+LM+EM+  GL PN I+WS LID+CGGSG+V GAV I
Sbjct: 519  FKPTTATYNILLKACGTDYYRGKELMDEMRSLGLAPNQITWSTLIDICGGSGDVEGAVGI 578

Query: 1009 LTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSM 830
            L +M  +G +PDVVAYTTAIKIC  +K  KLAF LF EM +YQIKPN VTYNT+L+ARS 
Sbjct: 579  LRTMHSAGTRPDVVAYTTAIKICAENKSLKLAFSLFEEMRRYQIKPNWVTYNTLLKARSK 638

Query: 829  YGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHI-TDFG 653
            YGSL  V+Q L IYQ MRKAGYK ND++LK+LIE+WCEGVIQ   Q++ + +    T+ G
Sbjct: 639  YGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQENSQSQIKTSDQEGTNLG 698

Query: 652  RQ-SLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLL 476
            R  SLL+EKV  +LQ+ TA +L IDL+GLTKVEAR+VVLAVLR IKE Y  G+ + DDLL
Sbjct: 699  RPVSLLIEKVATHLQERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYIRGDVVTDDLL 758

Query: 475  IILGQ--------EQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNL 320
            IILG         +Q   V   +++LL+++LSL V+  G R      +    +  +   +
Sbjct: 759  IILGTGEANIDPGKQEIAVKDVLVQLLKDELSLVVLPAGHRHVLDITLDARCVDDADQGI 818

Query: 319  EEVDEKSRSAETSESPTRRPIILRRLKVTRESMHQWLQRK 200
            E   E ++S     S TRRP IL RL VT+ S+HQWLQRK
Sbjct: 819  ELTSENTKSI-VGISSTRRPAILERLMVTKASLHQWLQRK 857


>ref|XP_002525196.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223535493|gb|EEF37162.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 786

 Score =  350 bits (899), Expect = 6e-94
 Identities = 192/344 (55%), Positives = 240/344 (69%), Gaps = 11/344 (3%)
 Frame = -3

Query: 1198 RVPFRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGA 1019
            + PF P+ +TYN ++KACG+DY  AK LM+EM+  GL+PNHISWSILID+CG SGN+ GA
Sbjct: 441  KFPFTPSSATYNTLMKACGSDYNRAKALMDEMQAVGLSPNHISWSILIDICGSSGNMEGA 500

Query: 1018 VQILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRA 839
            +QIL +MR +GI+PDV+AYTTAIK+ V  K  K+AF LFAEM++YQ+KPN VTY+T+LRA
Sbjct: 501  IQILKNMRMAGIEPDVIAYTTAIKVSVESKNLKMAFSLFAEMKRYQLKPNLVTYDTLLRA 560

Query: 838  RSMYGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRF-ASHIT 662
            R+ YGSL+ VQQ L IYQ MRKAGYK ND YLKQLIE+WCEGVIQ   Q +  F      
Sbjct: 561  RTRYGSLKEVQQCLAIYQDMRKAGYKSNDNYLKQLIEEWCEGVIQDNDQCQDDFKPCKRA 620

Query: 661  DFGR-QSLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKD 485
            +FGR  SLLLEKV  +L  + AESL +DL+GLTKVEARIVVLAVLR +KE Y  G+ +KD
Sbjct: 621  EFGRPHSLLLEKVAAHLHHNVAESLSVDLQGLTKVEARIVVLAVLRMVKENYIQGHLVKD 680

Query: 484  DLLIILGQE--------QHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSS 329
            D+ I LG +        Q AEV  A+ +LL N+L L V+    R          + +   
Sbjct: 681  DMSITLGIDKVDVLPATQKAEVKDAIFKLLHNELGLEVLIVVPRYTADL----ETDLEIP 736

Query: 328  LNLEEVDEKSRSAETSE-SPTRRPIILRRLKVTRESMHQWLQRK 200
            LN  +   KS   E    S  RRP++L+RLKVTR S+H WLQRK
Sbjct: 737  LNSYQNWSKSSGRENIRVSSARRPLVLQRLKVTRNSLHSWLQRK 780


>ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cucumis sativus]
          Length = 849

 Score =  347 bits (891), Expect = 5e-93
 Identities = 189/339 (55%), Positives = 238/339 (70%), Gaps = 10/339 (2%)
 Frame = -3

Query: 1189 FRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQI 1010
            F+PTI+TYNI++KACGTDYYHAK LMEEMK  GLTPNHISWSIL+D+CG S +V  AVQI
Sbjct: 514  FKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILVDICGRSHDVESAVQI 573

Query: 1009 LTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSM 830
            LT+MR +G+ PDVVAYTTAIK+CV  K  KLAF LF EM++++I+PN VTY+T+LRARS 
Sbjct: 574  LTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARST 633

Query: 829  YGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFA-SHITDFG 653
            YGSL  VQQ L IYQ MRK+G+K ND+YLK+LI +WCEGVIQ  +Q        +  D G
Sbjct: 634  YGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQKNNQQPVEITPCNKIDIG 693

Query: 652  R-QSLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLL 476
            + + L+LEKV ++LQ S AESL IDL+ LTKVEARIVVLAVLR IKE YA G S+KDD+ 
Sbjct: 694  KPRCLILEKVADHLQKSFAESLTIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIF 753

Query: 475  IILGQE--------QHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNL 320
            IIL           Q+ EV  A+ RLLQ++L L V+  G      K        S S  +
Sbjct: 754  IILEVNKVETDLVPQNFEVRDAITRLLQDELGLEVLPTGPTIALDKVPN-----SESSKI 808

Query: 319  EEVDEKSRSAETSESPTRRPIILRRLKVTRESMHQWLQR 203
                +   +   ++  TR+P  ++RLKVT++S+  WLQR
Sbjct: 809  SHTTKLKGTMGRNKYFTRKPADVQRLKVTKKSLQDWLQR 847


>ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cicer arietinum]
          Length = 799

 Score =  347 bits (889), Expect = 9e-93
 Identities = 192/387 (49%), Positives = 256/387 (66%), Gaps = 14/387 (3%)
 Frame = -3

Query: 1318 WKERVSEQTLRDDKLSHADKAMDQMIEMGVPLTLCSH----LTMRVPFRPTISTYNIMLK 1151
            WK   +  +  +   S+A++     +   VP  + S      T R PF+PT STYN +LK
Sbjct: 420  WKGNKTLVSFGESHNSNAEEGGMDSVTTTVPKGISSSHIMSFTERFPFKPTTSTYNTLLK 479

Query: 1150 ACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQILTSMRESGIQPDV 971
            ACGT+YYHAK L+ EMK  GL+PN ISWSILI++CGGS NV GA++IL +M ++G++PDV
Sbjct: 480  ACGTNYYHAKALINEMKTVGLSPNQISWSILINICGGSENVEGAIEILRTMIDAGVKPDV 539

Query: 970  VAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRARSMYGSLQGVQQYLVI 791
            VAYTTAIK+CV  K    A  L+ EM+ Y+ +PN VTYNT+LRARS YGSL+ VQQ L I
Sbjct: 540  VAYTTAIKVCVESKNFTKALTLYEEMKSYETQPNLVTYNTLLRARSKYGSLREVQQCLAI 599

Query: 790  YQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRF-ASHITDFGR-QSLLLEKVIEY 617
            YQ MRKAGYK NDYYL++LIE+WCEGVIQ   + E  F +S   +  R +SLLLEK+  +
Sbjct: 600  YQDMRKAGYKPNDYYLEELIEEWCEGVIQDNEEYEVEFSSSKKPEIERPESLLLEKIAAH 659

Query: 616  LQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKDDLLIILG--------Q 461
            L    A+ L ID++GL+KVEAR+V+LAVLR IKE YA G+S+ DD+LII+G         
Sbjct: 660  LLKRVADILAIDVQGLSKVEARLVILAVLRMIKENYAFGHSVNDDILIIIGATKADESPA 719

Query: 460  EQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSSLNLEEVDEKSRSAETS 281
            ++  EV +AVI+LL+N+L L  +    R             S S  L+   E +    T 
Sbjct: 720  KEILEVQEAVIKLLRNELGLEALPAKTRFAP----------SDSPKLQNTKENALPT-TM 768

Query: 280  ESPTRRPIILRRLKVTRESMHQWLQRK 200
               TRRP +L+RLKVT++S+H+WLQR+
Sbjct: 769  VFHTRRPAVLQRLKVTKQSLHRWLQRR 795


>ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099830|gb|ESQ40193.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 863

 Score =  346 bits (888), Expect = 1e-92
 Identities = 190/345 (55%), Positives = 240/345 (69%), Gaps = 15/345 (4%)
 Frame = -3

Query: 1189 FRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGAVQI 1010
            F+PT +TYNI+LKACGTDYY  K+LM+EM+  GL PN I+WS LID+CGGSG+V GAV I
Sbjct: 519  FKPTTATYNILLKACGTDYYRGKELMDEMRSLGLAPNQITWSTLIDICGGSGDVEGAVGI 578

Query: 1009 LTSMRESGIQPDVVAYTTAIK-----ICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTIL 845
            L +M  +G +PDVVAYTTAIK     IC  +K  KLAF LF EM +YQIKPN VTYNT+L
Sbjct: 579  LRTMHSAGTRPDVVAYTTAIKHAIFQICAENKSLKLAFSLFEEMRRYQIKPNWVTYNTLL 638

Query: 844  RARSMYGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHI 665
            +ARS YGSL  V+Q L IYQ MRKAGYK ND++LK+LIE+WCEGVIQ   Q++ + +   
Sbjct: 639  KARSKYGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQENSQSQIKTSDQE 698

Query: 664  -TDFGRQ-SLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSI 491
             T+ GR  SLL+EKV  +LQ+ TA +L IDL+GLTKVEAR+VVLAVLR IKE Y  G+ +
Sbjct: 699  GTNLGRPVSLLIEKVATHLQERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYIRGDVV 758

Query: 490  KDDLLIILGQ--------EQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVS 335
             DDLLIILG         +Q   V   +++LL+++LSL V+  G R      +    +  
Sbjct: 759  TDDLLIILGTGEANIDPGKQEIAVKDVLVQLLKDELSLVVLPAGHRHVLDITLDARCVDD 818

Query: 334  SSLNLEEVDEKSRSAETSESPTRRPIILRRLKVTRESMHQWLQRK 200
            +   +E   E ++S     S TRRP IL RL VT+ S+HQWLQRK
Sbjct: 819  ADQGIELTSENTKSI-VGISSTRRPAILERLMVTKASLHQWLQRK 862


>ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Glycine max]
          Length = 811

 Score =  345 bits (885), Expect = 3e-92
 Identities = 185/349 (53%), Positives = 237/349 (67%), Gaps = 16/349 (4%)
 Frame = -3

Query: 1198 RVPFRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGA 1019
            R PF PT +TYNI+LKACGTDYYHAK L++EM+  GL+PN ISWSILID+CG S NV GA
Sbjct: 473  RFPFTPTTTTYNILLKACGTDYYHAKALIKEMETVGLSPNQISWSILIDICGASSNVEGA 532

Query: 1018 VQILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRA 839
            ++IL +M ++GI+PDV+AYTTAIK+CV  K    A  L+ EM+ YQI+PN VTYNT+L+A
Sbjct: 533  IEILKTMGDAGIKPDVIAYTTAIKVCVESKNFMQALTLYEEMKCYQIRPNWVTYNTLLKA 592

Query: 838  RSMYGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRFASHITD 659
            RS YG L  VQQ L IYQ MRKAGYK NDYYL++LIE+WCEGVIQ   + +G F+S    
Sbjct: 593  RSKYGFLHEVQQCLAIYQDMRKAGYKPNDYYLEELIEEWCEGVIQNNREKQGEFSSSNKS 652

Query: 658  FGR--QSLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKD 485
                 QSLLLEK+  +L    A+ L ID++GLTKVEAR+VVLAVLR IKE Y  G+S+ D
Sbjct: 653  ESERPQSLLLEKIAAHLLKRVADILAIDVQGLTKVEARLVVLAVLRMIKENYGLGHSVND 712

Query: 484  DLLIILG--------QEQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSS 329
            D+LII+G         +   EV +A+I+LL+N+L L V     R            +S +
Sbjct: 713  DILIIIGATKVDENPSKHILEVQEAIIKLLRNELGLEVFPAKTRLA----------LSDT 762

Query: 328  LNLEEVDEKSRSAETSES------PTRRPIILRRLKVTRESMHQWLQRK 200
             NLE  +  + S E           TRRP +L RLKVT++S+++WL RK
Sbjct: 763  ANLEYPNFSNLSIEAQPGENALGFQTRRPGVLVRLKVTKKSLYRWLHRK 811


>ref|XP_007162713.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
            gi|561036177|gb|ESW34707.1| hypothetical protein
            PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 809

 Score =  343 bits (879), Expect = 1e-91
 Identities = 179/343 (52%), Positives = 239/343 (69%), Gaps = 10/343 (2%)
 Frame = -3

Query: 1198 RVPFRPTISTYNIMLKACGTDYYHAKDLMEEMKMFGLTPNHISWSILIDVCGGSGNVSGA 1019
            R PF PT +TYNI+LKACGTDYYHAK L++EM+  GL+PN ISWS LID+CG S NV GA
Sbjct: 467  RFPFTPTTTTYNILLKACGTDYYHAKALIKEMETVGLSPNQISWSTLIDICGASANVEGA 526

Query: 1018 VQILTSMRESGIQPDVVAYTTAIKICVNHKKTKLAFMLFAEMEKYQIKPNSVTYNTILRA 839
            ++IL +M ++GI+PDV+AYTTAIK+CV  K    A  L+ EM+ Y I+PN +TYNT+L+A
Sbjct: 527  IEILKNMGDAGIKPDVIAYTTAIKVCVESKNFMQALALYKEMKSYHIRPNLITYNTLLKA 586

Query: 838  RSMYGSLQGVQQYLVIYQQMRKAGYKRNDYYLKQLIEDWCEGVIQTEHQNEGRF-ASHIT 662
            RS YGSL  VQQ L IYQ MRKAGYK ND YL++LIE+WCEGVIQ   + +G F +S+ +
Sbjct: 587  RSKYGSLHEVQQCLAIYQDMRKAGYKPNDCYLEELIEEWCEGVIQDNREIQGEFSSSNKS 646

Query: 661  DFGR-QSLLLEKVIEYLQDSTAESLFIDLRGLTKVEARIVVLAVLRKIKEKYAAGNSIKD 485
            +  + QSLLLEK+  +L    A+ L ID++GLTKVEAR+VVLAVLR IKE Y+ G+SI D
Sbjct: 647  ELEKSQSLLLEKIAAHLLKRVADILAIDVQGLTKVEARLVVLAVLRMIKENYSLGHSIND 706

Query: 484  DLLIILG--------QEQHAEVGQAVIRLLQNDLSLHVIEGGLRAGEGKGMKGSSIVSSS 329
            D+LI++G         ++  EV +A+++LL+N+L L       R       K  +   ++
Sbjct: 707  DILIVIGATKVDENPAKRILEVQEAILKLLRNELGLEAFPARTRLALSDTPKLKNPTLAN 766

Query: 328  LNLEEVDEKSRSAETSESPTRRPIILRRLKVTRESMHQWLQRK 200
            L +E V  +     +    TRRP IL RLK+TR+S++ WL RK
Sbjct: 767  LKIEAVPAEDALPTSMGFQTRRPGILVRLKITRKSLYSWLHRK 809


Top