BLASTX nr result

ID: Catharanthus23_contig00006797 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00006797
         (2620 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containi...  1063   0.0  
ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containi...  1055   0.0  
emb|CBI32743.3| unnamed protein product [Vitis vinifera]             1053   0.0  
ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containi...  1053   0.0  
gb|EXC31617.1| hypothetical protein L484_008414 [Morus notabilis]    1043   0.0  
gb|EOY04385.1| Tetratricopeptide repeat (TPR)-like superfamily p...  1037   0.0  
ref|XP_002530985.1| pentatricopeptide repeat-containing protein,...  1021   0.0  
ref|XP_004299746.1| PREDICTED: pentatricopeptide repeat-containi...  1006   0.0  
ref|XP_006481496.1| PREDICTED: pentatricopeptide repeat-containi...   993   0.0  
ref|XP_004163187.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   990   0.0  
ref|XP_004149878.1| PREDICTED: pentatricopeptide repeat-containi...   990   0.0  
ref|XP_006428766.1| hypothetical protein CICLE_v10011107mg [Citr...   989   0.0  
ref|XP_002315730.1| pentatricopeptide repeat-containing family p...   984   0.0  
gb|EMJ26432.1| hypothetical protein PRUPE_ppa001877mg [Prunus pe...   977   0.0  
ref|XP_006296196.1| hypothetical protein CARUB_v10025361mg [Caps...   973   0.0  
ref|XP_006410903.1| hypothetical protein EUTSA_v10017966mg [Eutr...   969   0.0  
ref|NP_181260.1| pentatricopeptide repeat-containing protein [Ar...   968   0.0  
ref|XP_002881498.1| pentatricopeptide repeat-containing protein ...   957   0.0  
ref|XP_003524868.1| PREDICTED: pentatricopeptide repeat-containi...   946   0.0  
ref|XP_003532699.1| PREDICTED: pentatricopeptide repeat-containi...   943   0.0  

>ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            isoform X1 [Solanum tuberosum]
          Length = 731

 Score = 1063 bits (2748), Expect = 0.0
 Identities = 530/698 (75%), Positives = 605/698 (86%), Gaps = 3/698 (0%)
 Frame = +2

Query: 317  FQHADSTAGSERRPRGKH-QVSEKVEDVICRMMDNRAWTTRLQNSIRNLVPTFDHELVYN 493
            F +++S    +R P+G   +  EK+ED+ICRMM  RAWTTRLQNSIRN+VP+FDHELVYN
Sbjct: 33   FYNSESLNNHDRIPKGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYN 92

Query: 494  VLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVE 673
            VLH AKNSEHALQFFRWVERSGLF+H+RETH KII+ILGRA KLNHARCILLDMP KGV+
Sbjct: 93   VLHSAKNSEHALQFFRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVD 152

Query: 674  WDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYMMAKRY 853
            WDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGV+RT+KSY+ALF VI RRGRYMMAKRY
Sbjct: 153  WDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRY 212

Query: 854  FNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTYNTMINGCHR 1033
            FNKM+ +GIEPT HTYNL+IWGFFLSSKV+TA+RFFEDMKS+ I+PDVVTYNTMING  R
Sbjct: 213  FNKMVNQGIEPTGHTYNLLIWGFFLSSKVDTAIRFFEDMKSKGIMPDVVTYNTMINGYIR 272

Query: 1034 IKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIEPNAVT 1213
            +KK++EAEKYFVEMK RNI PTV+SYTT+IKGY +V ++DDA+RL EEMK  GI+PNA+T
Sbjct: 273  VKKIEEAEKYFVEMKARNIEPTVISYTTLIKGYSAVERIDDAVRLFEEMKSFGIKPNAIT 332

Query: 1214 YTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAKVLKAM 1393
            Y+TLLPGLCDA+KMSEA  IL EM DK IAPKDN+IF+RL+SGQC+AGD DAAA VLK M
Sbjct: 333  YSTLLPGLCDAQKMSEAGAILKEMEDKYIAPKDNSIFIRLISGQCEAGDLDAAADVLKTM 392

Query: 1394 IRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAYNPMIE 1573
            IRLS+PTEAGHYGVLIENFCKAG+YDRAV             RPQS+  +EPSAYN +I+
Sbjct: 393  IRLSVPTEAGHYGVLIENFCKAGIYDRAVKFLDKLIEKEIVLRPQSSSSMEPSAYNLIID 452

Query: 1574 YLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKIMVRRKIPSE 1753
            YLC+NGQTGKAET  RQLMK GVQDP+AFN+L+CGHS+EG+PD A ELLKIM RRK+ S+
Sbjct: 453  YLCNNGQTGKAETFFRQLMKTGVQDPIAFNNLVCGHSREGVPDSAFELLKIMGRRKVLSD 512

Query: 1754 ESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGRVQTASRVMK 1933
              AHKSL+ESYL K EPADAK ALD+M+E GH PDS LYRSVMESL  DGRVQTASRVMK
Sbjct: 513  GIAHKSLVESYLKKREPADAKAALDNMLEHGHDPDSLLYRSVMESLMGDGRVQTASRVMK 572

Query: 1934 TMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILSVLCEKGKTI 2113
             MLEKGVKEHMDLI+ ILEALL+RGHVEEALGRIEL++H+ L+PDLD +LSVLCEKGKT 
Sbjct: 573  IMLEKGVKEHMDLISTILEALLMRGHVEEALGRIELLLHNSLSPDLDGLLSVLCEKGKTS 632

Query: 2114 AALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSSRENLI 2293
            AALKLLDF L+R+ NIDFS+YDKVLD+LLAAGKTLNAYS+LCK+ME GGV D  S E LI
Sbjct: 633  AALKLLDFILERNCNIDFSSYDKVLDSLLAAGKTLNAYSILCKMMENGGVKDHKSCEELI 692

Query: 2294 KALNQEGNTKQADILSRMIL--EKVSDNKKGKKQTKMA 2401
            K+LN EGNTKQADIL RMIL  E   D+KKGKK+T +A
Sbjct: 693  KSLNDEGNTKQADILRRMILGKETTLDSKKGKKKTPIA 730


>ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            isoform 1 [Solanum lycopersicum]
            gi|460413221|ref|XP_004251993.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g37230-like isoform 2 [Solanum lycopersicum]
          Length = 731

 Score = 1055 bits (2729), Expect = 0.0
 Identities = 524/698 (75%), Positives = 602/698 (86%), Gaps = 3/698 (0%)
 Frame = +2

Query: 317  FQHADSTAGSERRPRGKH-QVSEKVEDVICRMMDNRAWTTRLQNSIRNLVPTFDHELVYN 493
            F + +S    ER P+G   +  EK+ED+ICRMM  RAWTTRLQNSIRN+VP+FDHELVYN
Sbjct: 33   FYNTESLNNHERIPKGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYN 92

Query: 494  VLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVE 673
            VLH AKNSEHALQFFRWVERSGLF+H+RETH KII+ILGRA KLNHARCILLDMP KGV+
Sbjct: 93   VLHSAKNSEHALQFFRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVD 152

Query: 674  WDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYMMAKRY 853
            WDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGV+RT+KSY+ALF VI RRGRYMMAKRY
Sbjct: 153  WDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRY 212

Query: 854  FNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTYNTMINGCHR 1033
            FN+M+ +GIEPT HTYNL+IWGFFLSSKV+TA+RFFEDMK + I+PDVVTYNTMING + 
Sbjct: 213  FNRMVNQGIEPTGHTYNLLIWGFFLSSKVDTAIRFFEDMKGKGIMPDVVTYNTMINGYNC 272

Query: 1034 IKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIEPNAVT 1213
            +KK++EAEKYFVEMK RNI P V+SYTT+IKGY +V ++DDAL+L EEMK  GI+PNA+T
Sbjct: 273  VKKIEEAEKYFVEMKARNIEPNVISYTTLIKGYSAVERIDDALKLFEEMKSFGIKPNAIT 332

Query: 1214 YTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAKVLKAM 1393
            Y+TLLPGLCDA+KMSEA  IL EM ++ IAPKDN+IF+RL+SGQC+AGD DAAA VLK M
Sbjct: 333  YSTLLPGLCDAQKMSEAGTILKEMEERYIAPKDNSIFIRLISGQCEAGDLDAAADVLKTM 392

Query: 1394 IRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAYNPMIE 1573
            IRLS+PTEAGHYGVLIENFCKAG+YDRAV             RPQS+  +E SAYN +I+
Sbjct: 393  IRLSVPTEAGHYGVLIENFCKAGIYDRAVKFLDKLIEKEIVLRPQSSSSMETSAYNLIID 452

Query: 1574 YLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKIMVRRKIPSE 1753
            YLC+NGQTGKAETL RQLMK G+QDP+AFN+L+CGHS+EG+PD A ELLKIM RRK+ S+
Sbjct: 453  YLCNNGQTGKAETLFRQLMKTGIQDPIAFNNLVCGHSREGVPDSAFELLKIMGRRKVLSD 512

Query: 1754 ESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGRVQTASRVMK 1933
              AHKSL+ESYL KGEPADAK ALD+M+E GH PDS LYRSVMESL  DGRVQTASRVMK
Sbjct: 513  SIAHKSLVESYLKKGEPADAKAALDNMLEHGHDPDSLLYRSVMESLMGDGRVQTASRVMK 572

Query: 1934 TMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILSVLCEKGKTI 2113
             MLEKGVKEHMDLI+ ILEALL+RGHVEEA GRIEL++H+ L+PDLD +LSVLCEKGKT 
Sbjct: 573  IMLEKGVKEHMDLISTILEALLMRGHVEEAFGRIELLLHNSLSPDLDGLLSVLCEKGKTT 632

Query: 2114 AALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSSRENLI 2293
            AALKLLDF L+R+ NIDFS+YDKVLD+LLAAGKTLNAYS+LCK+ME GGV D  S E LI
Sbjct: 633  AALKLLDFILERNCNIDFSSYDKVLDSLLAAGKTLNAYSILCKMMENGGVKDHKSCEELI 692

Query: 2294 KALNQEGNTKQADILSRMIL--EKVSDNKKGKKQTKMA 2401
            K+LN EGNTKQADIL RMIL  E   D+KKGKK+T +A
Sbjct: 693  KSLNDEGNTKQADILRRMILGKETTLDSKKGKKKTPIA 730


>emb|CBI32743.3| unnamed protein product [Vitis vinifera]
          Length = 772

 Score = 1053 bits (2722), Expect = 0.0
 Identities = 540/763 (70%), Positives = 627/763 (82%), Gaps = 7/763 (0%)
 Frame = +2

Query: 134  MAFLSVSKSSHFNPNL--SKFSSPXXXXXXXXXXXXXXXEPTNPETR---PESEQSSNSR 298
            MA++SV+K   + P L  S  S+P                     T    PE+  S +  
Sbjct: 1    MAYISVTKLHQWKPRLFISGASNPSSLNFIQSFSSVDESISAGDLTSSPIPETPVSGSPS 60

Query: 299  ETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTTRLQNSIRNLVPTFDH 478
            E        A   A S R PRGK +  EK+ED+ICRMM NRAWTTRLQNSIR+LVP FDH
Sbjct: 61   EPGNLTAAEAGEKA-SPRTPRGKLRNPEKIEDIICRMMANRAWTTRLQNSIRSLVPQFDH 119

Query: 479  ELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMP 658
             LV+NVLHG++NS+HALQFFRWVER+GLF+H+R+THLKIIEILGRASKLNHARCILLDMP
Sbjct: 120  SLVWNVLHGSRNSDHALQFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMP 179

Query: 659  KKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYM 838
            KKGVEWDEDL+VL+IDSYGKAGIVQESVK+FQKM+ELGV+RTIKSYDALFKVI+RRGRYM
Sbjct: 180  KKGVEWDEDLFVLLIDSYGKAGIVQESVKVFQKMKELGVERTIKSYDALFKVILRRGRYM 239

Query: 839  MAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTYNTMI 1018
            MAKRYFN ML EG+ PT HTYN+MIWGFFLS KVETA RFFE+MK R I PDVVTYNTMI
Sbjct: 240  MAKRYFNAMLNEGVMPTCHTYNIMIWGFFLSLKVETANRFFEEMKERRISPDVVTYNTMI 299

Query: 1019 NGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIE 1198
            NG +RIKKM+EAEK+FVEMKGRNI PTV+SYTTMIKGYVSVG+VDD LRL EEMK  GI+
Sbjct: 300  NGYYRIKKMEEAEKFFVEMKGRNIEPTVISYTTMIKGYVSVGRVDDGLRLFEEMKSFGIK 359

Query: 1199 PNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAK 1378
            PNAVTY+TLLPGLCD EKM EA+ ++ EMV++ IAPKDN+IF+RL++ QCKAG  DAAA 
Sbjct: 360  PNAVTYSTLLPGLCDGEKMLEAQNVVKEMVERYIAPKDNSIFMRLITCQCKAGQLDAAAD 419

Query: 1379 VLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAY 1558
            VLKAMIRLSIPTEAGHYGVLIENFCK+G+YDRAV             RPQ++L +E S Y
Sbjct: 420  VLKAMIRLSIPTEAGHYGVLIENFCKSGVYDRAVKLLDKLIEKEIILRPQNSLEMESSGY 479

Query: 1559 NPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKIMVRR 1738
            N +IEYLC++GQT KAETL RQLMK GVQDP+AFN+LI GHSKEG P+ A E+LKIM RR
Sbjct: 480  NLIIEYLCNSGQTSKAETLFRQLMKKGVQDPIAFNNLIRGHSKEGAPESAFEILKIMGRR 539

Query: 1739 KIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGRVQTA 1918
            ++P E  A++ LIES+L KGEPADAK ALD MIE+GH+PDSSL+RSVMESLFEDGR+QTA
Sbjct: 540  EVPREADAYRLLIESFLKKGEPADAKTALDGMIENGHIPDSSLFRSVMESLFEDGRIQTA 599

Query: 1919 SRVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILSVLCE 2098
            SRVM  M+EKGVKE+MDL+AKILEALLLRGHVEEALGRI+L+M++G  PD D +LSVLC 
Sbjct: 600  SRVMNNMVEKGVKENMDLVAKILEALLLRGHVEEALGRIDLLMNNGCEPDFDGLLSVLCA 659

Query: 2099 KGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSS 2278
            KGKTIAALKLLDF L+RD+NI FS+Y+ VLDALL AGKTLNAYS+LCKIM+KGG TD SS
Sbjct: 660  KGKTIAALKLLDFGLERDYNISFSSYENVLDALLTAGKTLNAYSILCKIMQKGGATDWSS 719

Query: 2279 RENLIKALNQEGNTKQADILSRMIL--EKVSDNKKGKKQTKMA 2401
             ++LI++LN+EGNTKQADILSRMI   EKV  +KKGKKQ  ++
Sbjct: 720  CKDLIRSLNEEGNTKQADILSRMIKGEEKVHGSKKGKKQASVS 762


>ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Vitis vinifera]
          Length = 763

 Score = 1053 bits (2722), Expect = 0.0
 Identities = 540/764 (70%), Positives = 627/764 (82%), Gaps = 7/764 (0%)
 Frame = +2

Query: 134  MAFLSVSKSSHFNPNL--SKFSSPXXXXXXXXXXXXXXXEPTNPETR---PESEQSSNSR 298
            MA++SV+K   + P L  S  S+P                     T    PE+  S +  
Sbjct: 1    MAYISVTKLHQWKPRLFISGASNPSSLNFIQSFSSVDESISAGDLTSSPIPETPVSGSPS 60

Query: 299  ETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTTRLQNSIRNLVPTFDH 478
            E        A   A S R PRGK +  EK+ED+ICRMM NRAWTTRLQNSIR+LVP FDH
Sbjct: 61   EPGNLTAAEAGEKA-SPRTPRGKLRNPEKIEDIICRMMANRAWTTRLQNSIRSLVPQFDH 119

Query: 479  ELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMP 658
             LV+NVLHG++NS+HALQFFRWVER+GLF+H+R+THLKIIEILGRASKLNHARCILLDMP
Sbjct: 120  SLVWNVLHGSRNSDHALQFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMP 179

Query: 659  KKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYM 838
            KKGVEWDEDL+VL+IDSYGKAGIVQESVK+FQKM+ELGV+RTIKSYDALFKVI+RRGRYM
Sbjct: 180  KKGVEWDEDLFVLLIDSYGKAGIVQESVKVFQKMKELGVERTIKSYDALFKVILRRGRYM 239

Query: 839  MAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTYNTMI 1018
            MAKRYFN ML EG+ PT HTYN+MIWGFFLS KVETA RFFE+MK R I PDVVTYNTMI
Sbjct: 240  MAKRYFNAMLNEGVMPTCHTYNIMIWGFFLSLKVETANRFFEEMKERRISPDVVTYNTMI 299

Query: 1019 NGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIE 1198
            NG +RIKKM+EAEK+FVEMKGRNI PTV+SYTTMIKGYVSVG+VDD LRL EEMK  GI+
Sbjct: 300  NGYYRIKKMEEAEKFFVEMKGRNIEPTVISYTTMIKGYVSVGRVDDGLRLFEEMKSFGIK 359

Query: 1199 PNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAK 1378
            PNAVTY+TLLPGLCD EKM EA+ ++ EMV++ IAPKDN+IF+RL++ QCKAG  DAAA 
Sbjct: 360  PNAVTYSTLLPGLCDGEKMLEAQNVVKEMVERYIAPKDNSIFMRLITCQCKAGQLDAAAD 419

Query: 1379 VLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAY 1558
            VLKAMIRLSIPTEAGHYGVLIENFCK+G+YDRAV             RPQ++L +E S Y
Sbjct: 420  VLKAMIRLSIPTEAGHYGVLIENFCKSGVYDRAVKLLDKLIEKEIILRPQNSLEMESSGY 479

Query: 1559 NPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKIMVRR 1738
            N +IEYLC++GQT KAETL RQLMK GVQDP+AFN+LI GHSKEG P+ A E+LKIM RR
Sbjct: 480  NLIIEYLCNSGQTSKAETLFRQLMKKGVQDPIAFNNLIRGHSKEGAPESAFEILKIMGRR 539

Query: 1739 KIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGRVQTA 1918
            ++P E  A++ LIES+L KGEPADAK ALD MIE+GH+PDSSL+RSVMESLFEDGR+QTA
Sbjct: 540  EVPREADAYRLLIESFLKKGEPADAKTALDGMIENGHIPDSSLFRSVMESLFEDGRIQTA 599

Query: 1919 SRVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILSVLCE 2098
            SRVM  M+EKGVKE+MDL+AKILEALLLRGHVEEALGRI+L+M++G  PD D +LSVLC 
Sbjct: 600  SRVMNNMVEKGVKENMDLVAKILEALLLRGHVEEALGRIDLLMNNGCEPDFDGLLSVLCA 659

Query: 2099 KGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSS 2278
            KGKTIAALKLLDF L+RD+NI FS+Y+ VLDALL AGKTLNAYS+LCKIM+KGG TD SS
Sbjct: 660  KGKTIAALKLLDFGLERDYNISFSSYENVLDALLTAGKTLNAYSILCKIMQKGGATDWSS 719

Query: 2279 RENLIKALNQEGNTKQADILSRMIL--EKVSDNKKGKKQTKMAA 2404
             ++LI++LN+EGNTKQADILSRMI   EKV  +KKGKKQ  + +
Sbjct: 720  CKDLIRSLNEEGNTKQADILSRMIKGEEKVHGSKKGKKQASVVS 763


>gb|EXC31617.1| hypothetical protein L484_008414 [Morus notabilis]
          Length = 768

 Score = 1043 bits (2698), Expect = 0.0
 Identities = 518/688 (75%), Positives = 600/688 (87%), Gaps = 2/688 (0%)
 Frame = +2

Query: 347  ERRPRGKHQVSEKVEDVICRMMDNRAWTTRLQNSIRNLVPTFDHELVYNVLHGAKNSEHA 526
            +R PRGK +  EK+ED+ICRMM NRAWTTRLQNSIR LVP FDH LV+NVLHGA+NS+HA
Sbjct: 81   QRTPRGKSRNPEKIEDIICRMMANRAWTTRLQNSIRRLVPQFDHSLVWNVLHGARNSDHA 140

Query: 527  LQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLMID 706
            LQFFRWVERSGLF H+RETHLKIIEIL RASKLNHARCILLDMPKK V+WDEDL+VL ID
Sbjct: 141  LQFFRWVERSGLFNHDRETHLKIIEILTRASKLNHARCILLDMPKKSVQWDEDLFVLFID 200

Query: 707  SYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYMMAKRYFNKMLREGIEP 886
             YGKAGIVQESV++F KM+ELGV+R++KSYDALFKVI+RRGRYMMAKRYFN M+ EGIEP
Sbjct: 201  GYGKAGIVQESVRMFNKMKELGVERSVKSYDALFKVILRRGRYMMAKRYFNAMINEGIEP 260

Query: 887  TRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTYNTMINGCHRIKKMDEAEKYF 1066
            T+HTYN+M+WGFFLS ++ETA RF+EDMK+R + PDVVTYNTMING +R K MDEAEK F
Sbjct: 261  TKHTYNIMLWGFFLSLRLETAKRFYEDMKNRGVWPDVVTYNTMINGYNRFKMMDEAEKMF 320

Query: 1067 VEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIEPNAVTYTTLLPGLCDA 1246
            VEMKGRNI PTV+SYTTMIKGYVS+G+VDD LRL EEMK  GI+PNAVTYTTLLPGLCDA
Sbjct: 321  VEMKGRNIAPTVISYTTMIKGYVSIGRVDDGLRLFEEMKSFGIKPNAVTYTTLLPGLCDA 380

Query: 1247 EKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAKVLKAMIRLSIPTEAGH 1426
            EKMSEAR +L EMVD+ IAPKDN+IFLRL+S QCK GD DAAA VLKAMIRLSIPTEAGH
Sbjct: 381  EKMSEARTMLKEMVDRYIAPKDNSIFLRLLSSQCKVGDLDAAADVLKAMIRLSIPTEAGH 440

Query: 1427 YGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAYNPMIEYLCSNGQTGKA 1606
            YG+LIENFCKA +YDRAV             RPQS+  +E SAYN MI++LC++GQTGKA
Sbjct: 441  YGILIENFCKAAVYDRAVKLLDKLIEKEIVLRPQSSTEMEASAYNAMIQFLCNHGQTGKA 500

Query: 1607 ETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKIMVRRKIPSEESAHKSLIESY 1786
            E   RQLMK GVQDPVAFN+LI GHSKEG PD A E+LKIM RR +  +  +++ LI+SY
Sbjct: 501  EIFFRQLMKKGVQDPVAFNNLIRGHSKEGNPDSAFEILKIMGRRGVARDADSYRLLIKSY 560

Query: 1787 LSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGRVQTASRVMKTMLEKGVKEHM 1966
            LSKGEPADAK ALDSMIE+ HLP+SSL+RSVMESL+EDGR QTASRVMK+M+EKGVKE+M
Sbjct: 561  LSKGEPADAKTALDSMIENDHLPESSLFRSVMESLYEDGRAQTASRVMKSMIEKGVKENM 620

Query: 1967 DLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILSVLCEKGKTIAALKLLDFSLD 2146
            DL+AKILEALL+RGHVEEALGRI+L+M SG AP+ D++LSVLCEKGKTIAALKLLDF L+
Sbjct: 621  DLVAKILEALLVRGHVEEALGRIDLLMQSGCAPNFDSLLSVLCEKGKTIAALKLLDFCLE 680

Query: 2147 RDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSSRENLIKALNQEGNTKQ 2326
            RD+ +DFS+YDKVLDALLAAGKTLNAYS+LCKIM KGGVTD S  E+LIK+LN+EGNTKQ
Sbjct: 681  RDYVVDFSSYDKVLDALLAAGKTLNAYSILCKIMGKGGVTDWSGCEDLIKSLNKEGNTKQ 740

Query: 2327 ADILSRMIL--EKVSDNKKGKKQTKMAA 2404
            ADI+SRMI   ++ S ++KGK++  ++A
Sbjct: 741  ADIISRMIKGGQEASGSRKGKRKASLSA 768


>gb|EOY04385.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
            cacao]
          Length = 743

 Score = 1037 bits (2681), Expect = 0.0
 Identities = 533/760 (70%), Positives = 617/760 (81%), Gaps = 3/760 (0%)
 Frame = +2

Query: 134  MAFLSVSKSSHFNPNL-SKFSSPXXXXXXXXXXXXXXXEPTNPETRPESEQSSNSRETTR 310
            MAF+SVSK+    P    + S+P               E  N   + E E+    R +  
Sbjct: 1    MAFMSVSKTYKLKPRFYHRISNPLHFFTTSQDPSTASQELNNAPPQQEGEKVVTQRTS-- 58

Query: 311  HEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTTRLQNSIRNLVPTFDHELVY 490
                           PRGK +  EKVEDVICRMM+NRAWTTRLQNSIR LVP FDH LVY
Sbjct: 59   ---------------PRGKTRNPEKVEDVICRMMENRAWTTRLQNSIRALVPEFDHALVY 103

Query: 491  NVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGV 670
            NVLHGAKNSE ALQFFRWVER+GL +H+RE H+KII+ILGRASKLNHARCILLDMPKKGV
Sbjct: 104  NVLHGAKNSEQALQFFRWVERAGLIRHDREAHMKIIQILGRASKLNHARCILLDMPKKGV 163

Query: 671  EWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYMMAKR 850
            EWDEDL+V++IDSYGKAGIVQE+VK+FQKM ELGV+RTIKSYDA FKVI+RRGRYMMAKR
Sbjct: 164  EWDEDLFVVLIDSYGKAGIVQEAVKIFQKMNELGVERTIKSYDAFFKVILRRGRYMMAKR 223

Query: 851  YFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTYNTMINGCH 1030
            YFNKML EGI PTRHTYN+M+WGFFLS +++TA RF+EDMK+R I PDVVTYNTMING  
Sbjct: 224  YFNKMLSEGIVPTRHTYNIMLWGFFLSLRLDTANRFYEDMKTRGISPDVVTYNTMINGYS 283

Query: 1031 RIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIEPNAV 1210
            R KKM+EAEK FVEMKG+N+ PTV+SYTTMIKGYV+V +VDD LRLLEEMK  GI+PNA 
Sbjct: 284  RFKKMEEAEKLFVEMKGKNLAPTVISYTTMIKGYVAVEQVDDGLRLLEEMKSFGIKPNAT 343

Query: 1211 TYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAKVLKA 1390
            TY+TLLPGLCDA KM+EA+ IL EMV+  IAPKDN+IF+ L++ QCK+GD DAAA VLKA
Sbjct: 344  TYSTLLPGLCDAGKMTEAKSILKEMVEWYIAPKDNSIFINLLNSQCKSGDLDAAADVLKA 403

Query: 1391 MIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAYNPMI 1570
            MIRLSIPTEAGHYGVLIENFCKA L+DRA+             RPQ++L +E SAYN MI
Sbjct: 404  MIRLSIPTEAGHYGVLIENFCKANLFDRAIKLLDKLVEKEIILRPQNSLDMEASAYNAMI 463

Query: 1571 EYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKIMVRRKIPS 1750
            +YLC +GQTGKAE   RQLMK GV DP AFN+LI GH+KEG P LA E+LKIM RR +P 
Sbjct: 464  QYLCHHGQTGKAEVFFRQLMKKGVLDPTAFNNLIRGHAKEGNPGLAFEILKIMGRRGVPK 523

Query: 1751 EESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGRVQTASRVM 1930
            +  A+K LIESYL KGEPADAK +LDSMIE G LP+S +++SVMESLFEDGR+QTASRVM
Sbjct: 524  DADAYKLLIESYLRKGEPADAKTSLDSMIEDGLLPESGIFKSVMESLFEDGRIQTASRVM 583

Query: 1931 KTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILSVLCEKGKT 2110
            K+M+EKGVKEHMDL+AKILEALL+RGHVEEALGRIEL+M +G AP+LD++LSVL EKGKT
Sbjct: 584  KSMVEKGVKEHMDLVAKILEALLMRGHVEEALGRIELLMQNGCAPNLDSLLSVLSEKGKT 643

Query: 2111 IAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSSRENL 2290
            IAALKLLDF L+RD +IDFS+Y+KVLDALLAAGKTLNAYS+LCKIMEKGG+T+ SS E+L
Sbjct: 644  IAALKLLDFGLERDCSIDFSSYEKVLDALLAAGKTLNAYSILCKIMEKGGITNWSSLEDL 703

Query: 2291 IKALNQEGNTKQADILSRMIL--EKVSDNKKGKKQTKMAA 2404
            IK+LNQEGNTKQADILSRMI   E  S +KKGKKQ  +A+
Sbjct: 704  IKSLNQEGNTKQADILSRMIKGGEAASGSKKGKKQATVAS 743


>ref|XP_002530985.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223529437|gb|EEF31397.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 753

 Score = 1021 bits (2639), Expect = 0.0
 Identities = 516/720 (71%), Positives = 601/720 (83%), Gaps = 8/720 (1%)
 Frame = +2

Query: 269  PESEQSSNSRETTRHEFQHA------DSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWT 430
            P   Q SN +  T ++   A      + T   +R PRGK    EKVED I RMM NR WT
Sbjct: 34   PSVTQISNPQSETLNDAAAAAAATQENQTQTYQRIPRGKRPDPEKVEDTISRMMANRPWT 93

Query: 431  TRLQNSIRNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILG 610
            TRLQNSIRNLVP FDH LVYNVLH A+NSEHALQFFRWVER+GLF+++R+TH+KIIEILG
Sbjct: 94   TRLQNSIRNLVPHFDHSLVYNVLHAARNSEHALQFFRWVERAGLFKNDRDTHMKIIEILG 153

Query: 611  RASKLNHARCILLDMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIK 790
            RASKLNHARCILLDMPKKGVEWDE ++V++I+SYGKAGIVQE+VK+F KM ELGV+R+IK
Sbjct: 154  RASKLNHARCILLDMPKKGVEWDEYMFVVLIESYGKAGIVQEAVKIFNKMNELGVERSIK 213

Query: 791  SYDALFKVIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDM 970
            SYDALFKVI+RRGRYMMAKR FNKML +GI+PTRHTYN+M+WGFFLS ++ETA+RF++DM
Sbjct: 214  SYDALFKVILRRGRYMMAKRVFNKMLNDGIQPTRHTYNIMLWGFFLSLRLETAMRFYDDM 273

Query: 971  KSREILPDVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKV 1150
            K+R I PDVVTYNTMING +R KKM+EAEK FVEMKG+NI PTV+SYTTMIKGYV+V +V
Sbjct: 274  KNRGISPDVVTYNTMINGFYRFKKMEEAEKLFVEMKGKNIAPTVISYTTMIKGYVAVDRV 333

Query: 1151 DDALRLLEEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLR 1330
            DD LRLLEEMK   I+PN  TY+TLLPGLCDA KM+EA+ IL EMV + +APKDN+IFLR
Sbjct: 334  DDGLRLLEEMKSFNIKPNVHTYSTLLPGLCDAWKMTEAKDILIEMVARHLAPKDNSIFLR 393

Query: 1331 LMSGQCKAGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXX 1510
            L+S QCKAGD  AA  VL  M+RL IPTEAGHYGVLIENFCKA  YDRAV          
Sbjct: 394  LLSCQCKAGDLRAAEDVLNTMMRLHIPTEAGHYGVLIENFCKAEEYDRAVKYLDKLIEKE 453

Query: 1511 XXXRPQSTLHLEPSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKE 1690
               RPQSTL +E +AYNPMI+YLCS+GQTGKAE   RQLMK GVQDP+AFN+LICGH+KE
Sbjct: 454  IILRPQSTLEIESNAYNPMIQYLCSHGQTGKAEIFFRQLMKKGVQDPLAFNNLICGHAKE 513

Query: 1691 GIPDLADELLKIMVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLY 1870
            G PD A E+ KIM +R +P +  A++ +IESYL KGEPADAK ALD M+E GH+PD S++
Sbjct: 514  GYPDSAFEIFKIMGKRGVPRDADAYRLIIESYLRKGEPADAKTALDGMLEDGHVPDPSVF 573

Query: 1871 RSVMESLFEDGRVQTASRVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMH 2050
            RSVMESLFEDGRVQTASRVMK+M+EKGVKE+MDL+ KILEALL+RGHVEEALGRIEL+M 
Sbjct: 574  RSVMESLFEDGRVQTASRVMKSMVEKGVKENMDLVGKILEALLMRGHVEEALGRIELLMQ 633

Query: 2051 SGLAPDLDNILSVLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYS 2230
            SG   + D++LSVL EKGKTIAALKLLDF+L+RDFN+DF +YDKVLDALLAAGKTLNAYS
Sbjct: 634  SGFHVNFDDLLSVLSEKGKTIAALKLLDFALERDFNLDFKSYDKVLDALLAAGKTLNAYS 693

Query: 2231 VLCKIMEKGGVTDKSSRENLIKALNQEGNTKQADILSRMIL--EKVSDNKKGKKQTKMAA 2404
            +LCKIM+KGGV+D SS ++LIK+LNQEGNTKQADILSRMI   EK  +NKKGKKQ   AA
Sbjct: 694  ILCKIMQKGGVSDWSSSKDLIKSLNQEGNTKQADILSRMIKGGEKSHENKKGKKQASFAA 753


>ref|XP_004299746.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Fragaria vesca subsp. vesca]
          Length = 763

 Score = 1006 bits (2602), Expect = 0.0
 Identities = 516/764 (67%), Positives = 610/764 (79%), Gaps = 7/764 (0%)
 Frame = +2

Query: 134  MAFLSVSKSSHFNPNLSKFSSPXXXXXXXXXXXXXXXEPTNPETRPESEQSSNSRETTRH 313
            MAF+S+SK S + P LS   S                +P +    P +E  + S    ++
Sbjct: 1    MAFISLSKPSQWRPRLSNPQS-LPLLRLFCSTETPSPQPGSASDAPPAETPTGSPPDPQN 59

Query: 314  EFQHADSTAGSERRPRGKH----QVSEKVEDVICRMMDNRAWTTRLQNSIRNLVPTFDHE 481
                A S     + P+ +     +  EK ED+ICRMM NRAWTTRLQNSIR+LVP FDH 
Sbjct: 60   GSAAAASAPPPPQTPKPRQLRRARNPEKTEDIICRMMANRAWTTRLQNSIRDLVPEFDHN 119

Query: 482  LVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPK 661
            LV+NVLHGAK S+ ALQFFRWVERS LFQH+RETHLKIIEILGRASKLNHARCILLDMPK
Sbjct: 120  LVWNVLHGAKTSDQALQFFRWVERSRLFQHDRETHLKIIEILGRASKLNHARCILLDMPK 179

Query: 662  KGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYMM 841
            KGV+WDEDL++ +IDSYGKAGIVQESVKLF +M+ELGV+R++KSY+ALFK I+RRGRYMM
Sbjct: 180  KGVQWDEDLFIHLIDSYGKAGIVQESVKLFNQMKELGVERSLKSYEALFKSILRRGRYMM 239

Query: 842  AKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTYNTMIN 1021
             KRYFN ML EGIEPTRHTYN+MIWGFFLS ++ETA RFFEDMK+R + PDVVTYNTMIN
Sbjct: 240  GKRYFNHMLAEGIEPTRHTYNIMIWGFFLSLRLETAKRFFEDMKTRGLSPDVVTYNTMIN 299

Query: 1022 GCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIEP 1201
            G +R K MDEAE+ FVE+KG+NI P V+SYTTMIKGYVSVGKVDD  RL +EMK  GI+P
Sbjct: 300  GYNRFKMMDEAEQLFVELKGKNIQPNVISYTTMIKGYVSVGKVDDGYRLFQEMKSFGIKP 359

Query: 1202 NAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAKV 1381
            N VT++TLLPGLCDAEK  EA+ +L EMV++ IAPKDN++F +L+  QCK+GD DAAA V
Sbjct: 360  NDVTFSTLLPGLCDAEKKDEAQNLLSEMVERHIAPKDNSVFEKLLYCQCKSGDLDAAANV 419

Query: 1382 LKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAYN 1561
            LKAMIRL IPTEAGHYG+LIENFCKAG+YDRAV+            R QS++ LE SAYN
Sbjct: 420  LKAMIRLHIPTEAGHYGILIENFCKAGVYDRAVHLLDRLIEKEIIMRSQSSMELEASAYN 479

Query: 1562 PMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKIMVRRK 1741
            PMIEYLC +GQT KAE L RQLMK GVQD VAFN+LI GH+KEG  D A E+LKIM RR 
Sbjct: 480  PMIEYLCDHGQTDKAEVLFRQLMKKGVQDSVAFNNLIRGHAKEGNSDSAFEILKIMGRRG 539

Query: 1742 IPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGRVQTAS 1921
            +P E  ++K LI+SYLSKGEPADAK ALDSMIE+GH+P+SSL+RSVMESLFEDGRVQTAS
Sbjct: 540  VPREADSYKLLIKSYLSKGEPADAKTALDSMIENGHVPESSLFRSVMESLFEDGRVQTAS 599

Query: 1922 RVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILSVLCEK 2101
            R+MK+M+EKGV E+MDL+AKILEAL +RGHVEEALGRI+L+M SG AP+ D++LSVL EK
Sbjct: 600  RIMKSMVEKGVNENMDLVAKILEALFIRGHVEEALGRIDLLMQSGCAPEFDSLLSVLAEK 659

Query: 2102 GKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSSR 2281
            GKTIAA+KLLDF L+RD  +DF +YDKVLDALL +GKTLNAYS+LCKIM+KGGVTD  S 
Sbjct: 660  GKTIAAVKLLDFCLERDCMVDFKSYDKVLDALLESGKTLNAYSILCKIMDKGGVTDWRST 719

Query: 2282 ENLIKALNQEGNTKQADILSRMIL---EKVSDNKKGKKQTKMAA 2404
            ++LIK+LN EGNTKQAD+LSR I    +    +KKGKKQ  MA+
Sbjct: 720  DDLIKSLNLEGNTKQADVLSRKIKGGEDMAGQSKKGKKQVSMAS 763


>ref|XP_006481496.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Citrus sinensis]
          Length = 751

 Score =  993 bits (2566), Expect = 0.0
 Identities = 499/712 (70%), Positives = 589/712 (82%), Gaps = 1/712 (0%)
 Frame = +2

Query: 272  ESEQSSNSRETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTTRLQNSI 451
            + +Q+ +S       FQ  +  +  +R PRG H+   K+ED IC++M  RAWTTRLQN I
Sbjct: 40   DQQQTQDSPAPNPDPFQADEEPSQRQRIPRGNHRSPVKLEDTICKLMAERAWTTRLQNKI 99

Query: 452  RNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNH 631
            R LVP FDH LVYNVLHGAKNSEHALQFFRWVER+GLF H+RETHLK+IEILGR  KLNH
Sbjct: 100  RALVPQFDHNLVYNVLHGAKNSEHALQFFRWVERAGLFNHDRETHLKMIEILGRVGKLNH 159

Query: 632  ARCILLDMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFK 811
            ARCILLDMPKKGV+WDED++ ++I+SYGK GIVQESVK+F  M++LGV+R++KSYDALFK
Sbjct: 160  ARCILLDMPKKGVQWDEDMFEVLIESYGKKGIVQESVKIFDIMKQLGVERSVKSYDALFK 219

Query: 812  VIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILP 991
            +I+RRGRYMMAKRYFNKML EGIEPTRHTYN+M+WGFFLS K+ETA+RFFEDMKSR I P
Sbjct: 220  LILRRGRYMMAKRYFNKMLSEGIEPTRHTYNVMLWGFFLSLKLETAIRFFEDMKSRGISP 279

Query: 992  DVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLL 1171
            DVVTYNTMING +R KKMDEAEK F EMK +NI PTV+SYTTMIKGYV+V + DDALR+ 
Sbjct: 280  DVVTYNTMINGYNRFKKMDEAEKLFAEMKEKNIEPTVISYTTMIKGYVAVERADDALRIF 339

Query: 1172 EEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCK 1351
            +EMK   ++PNAVTYT LLPGLCDA KM E +K+L EMV++ I PKDN++F++L+  QCK
Sbjct: 340  DEMKSFDVKPNAVTYTALLPGLCDAGKMVEVQKVLREMVERYIPPKDNSVFMKLLGVQCK 399

Query: 1352 AGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQS 1531
            +G  +AAA VLKAMIRLSIPTEAGHYG+LIENFCKA +YDRA+             RPQS
Sbjct: 400  SGHLNAAADVLKAMIRLSIPTEAGHYGILIENFCKAEMYDRAIKLLDKLVEKEIILRPQS 459

Query: 1532 TLHLEPSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLAD 1711
            TL +E S+YNPMI++LC NGQTGKAE   RQLMK GV DPVAFN+LI GHSKEG PD A 
Sbjct: 460  TLDMEASSYNPMIQHLCHNGQTGKAEIFFRQLMKKGVLDPVAFNNLIRGHSKEGNPDSAF 519

Query: 1712 ELLKIMVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESL 1891
            E++KIM RR +P +  A+  LIESYL KGEPADAK ALDSMIE GH P SSL+RSVMESL
Sbjct: 520  EIVKIMGRRGVPRDADAYICLIESYLRKGEPADAKTALDSMIEDGHSPASSLFRSVMESL 579

Query: 1892 FEDGRVQTASRVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDL 2071
            FEDGRVQTASRVMK+M+EKGVKE++DL+AKILEALL+RGHVEEALGRI+L+M SG  P+ 
Sbjct: 580  FEDGRVQTASRVMKSMVEKGVKENLDLVAKILEALLMRGHVEEALGRIDLMMQSGSVPNF 639

Query: 2072 DNILSVLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIME 2251
            D++LSVL EKGKTIAA+KLLDF L RD  ID ++Y+KVLDALLAAGKTLNAYS+L KIME
Sbjct: 640  DSLLSVLSEKGKTIAAVKLLDFCLGRDCIIDLASYEKVLDALLAAGKTLNAYSILFKIME 699

Query: 2252 KGGVTDKSSRENLIKALNQEGNTKQADILSRMILEKVS-DNKKGKKQTKMAA 2404
            KGGVTD  S + LI  LNQEGNTKQADILSRMI  ++S  ++K KKQ+ +A+
Sbjct: 700  KGGVTDWKSSDKLIAGLNQEGNTKQADILSRMIRGEMSRGSQKEKKQSAVAS 751


>ref|XP_004163187.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g37230-like [Cucumis sativus]
          Length = 760

 Score =  990 bits (2559), Expect = 0.0
 Identities = 508/765 (66%), Positives = 604/765 (78%)
 Frame = +2

Query: 107  LHFSLYKRKMAFLSVSKSSHFNPNLSKFSSPXXXXXXXXXXXXXXXEPTNPETRPESEQS 286
            LHF+ Y R ++  S+SK +  N +L  FSS                 P +P    ++   
Sbjct: 9    LHFTHY-RVLSSSSISKPTALN-SLHFFSSTQEPISTATQNG----SPNDPSASSDAALP 62

Query: 287  SNSRETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTTRLQNSIRNLVP 466
                    +  Q         R PRG+ +  EK+E +IC+MM NR WTTRLQNSIR+LVP
Sbjct: 63   QTGESAAVNGVQQVKG-----RIPRGRPRDPEKLEXIICKMMANREWTTRLQNSIRSLVP 117

Query: 467  TFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCIL 646
             FDH LVYNVLH AK SEHAL FFRWVER+GLFQH+RETH KIIEILGRASKLNHARCIL
Sbjct: 118  QFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCIL 177

Query: 647  LDMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRR 826
            LDMP KGV+WDEDL+V++I+SYGKAGIVQE+VK+FQKM+ELGV+R++KSYDALFK IMRR
Sbjct: 178  LDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRR 237

Query: 827  GRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTY 1006
            GRYMMAKRYFN ML EGIEP RHTYN+M+WGFFLS ++ETA RF+EDMKSR I PDVVTY
Sbjct: 238  GRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTY 297

Query: 1007 NTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKV 1186
            NTMING  R K M+EAE++F EMKG+NI PTV+SYTTMIKGYVSV + DDALRL EEMK 
Sbjct: 298  NTMINGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKA 357

Query: 1187 SGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFD 1366
            +G +PN +TY+TLLPGLCDAEK+ EARKIL EMV +  APKDN+IF+RL+S QCK GD D
Sbjct: 358  AGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLD 417

Query: 1367 AAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLE 1546
            AA  VLKAMIRLSIPTEAGHYG+LIEN CKAG+YD+AV             RPQSTL +E
Sbjct: 418  AAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEME 477

Query: 1547 PSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKI 1726
             SAYN +I+YLC++GQTGKA+T  RQL+K G+QD VAFN+LI GH+KEG PDLA E+LKI
Sbjct: 478  ASAYNLIIQYLCNHGQTGKADTFFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKI 537

Query: 1727 MVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGR 1906
            M RR +  +  ++K LI+SYLSKGEPADAK ALDSMIE+GH PDS+L+RSVMESLF DGR
Sbjct: 538  MGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGR 597

Query: 1907 VQTASRVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILS 2086
            VQTASRVM +ML+KG+ E++DL+AKILEAL +RGH EEALGRI L+M+    PD +++LS
Sbjct: 598  VQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLS 657

Query: 2087 VLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVT 2266
            VLCEKGKT +A KLLDF L+R+ NI+FS+Y+KVLDALL AGKTLNAY++LCKIMEKGG  
Sbjct: 658  VLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAK 717

Query: 2267 DKSSRENLIKALNQEGNTKQADILSRMILEKVSDNKKGKKQTKMA 2401
            D SS ++LIK+LNQEGNTKQADILSRMI  K  D K+ KK +  A
Sbjct: 718  DWSSCDDLIKSLNQEGNTKQADILSRMI--KGGDRKRSKKPSLAA 760


>ref|XP_004149878.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Cucumis sativus]
          Length = 760

 Score =  990 bits (2559), Expect = 0.0
 Identities = 508/765 (66%), Positives = 604/765 (78%)
 Frame = +2

Query: 107  LHFSLYKRKMAFLSVSKSSHFNPNLSKFSSPXXXXXXXXXXXXXXXEPTNPETRPESEQS 286
            LHF+ Y R ++  S+SK +  N +L  FSS                 P +P    ++   
Sbjct: 9    LHFTHY-RVLSSSSISKPTALN-SLHFFSSTQEPISTATQNG----SPNDPSASSDAALP 62

Query: 287  SNSRETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTTRLQNSIRNLVP 466
                    +  Q         R PRG+ +  EK+E +IC+MM NR WTTRLQNSIR+LVP
Sbjct: 63   QTGESAAVNGVQQVKG-----RIPRGRPRDPEKLEKIICKMMANREWTTRLQNSIRSLVP 117

Query: 467  TFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCIL 646
             FDH LVYNVLH AK SEHAL FFRWVER+GLFQH+RETH KIIEILGRASKLNHARCIL
Sbjct: 118  QFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCIL 177

Query: 647  LDMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRR 826
            LDMP KGV+WDEDL+V++I+SYGKAGIVQE+VK+FQKM+ELGV+R++KSYDALFK IMRR
Sbjct: 178  LDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRR 237

Query: 827  GRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTY 1006
            GRYMMAKRYFN ML EGIEP RHTYN+M+WGFFLS ++ETA RF+EDMKSR I PDVVTY
Sbjct: 238  GRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTY 297

Query: 1007 NTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKV 1186
            NTMING  R K M+EAE++F EMKG+NI PTV+SYTTMIKGYVSV + DDALRL EEMK 
Sbjct: 298  NTMINGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKA 357

Query: 1187 SGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFD 1366
            +G +PN +TY+TLLPGLCDAEK+ EARKIL EMV +  APKDN+IF+RL+S QCK GD D
Sbjct: 358  AGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLD 417

Query: 1367 AAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLE 1546
            AA  VLKAMIRLSIPTEAGHYG+LIEN CKAG+YD+AV             RPQSTL +E
Sbjct: 418  AAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEME 477

Query: 1547 PSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKI 1726
             SAYN +I+YLC++GQTGKA+T  RQL+K G+QD VAFN+LI GH+KEG PDLA E+LKI
Sbjct: 478  ASAYNLIIQYLCNHGQTGKADTFFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKI 537

Query: 1727 MVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGR 1906
            M RR +  +  ++K LI+SYLSKGEPADAK ALDSMIE+GH PDS+L+RSVMESLF DGR
Sbjct: 538  MGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGR 597

Query: 1907 VQTASRVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILS 2086
            VQTASRVM +ML+KG+ E++DL+AKILEAL +RGH EEALGRI L+M+    PD +++LS
Sbjct: 598  VQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLS 657

Query: 2087 VLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVT 2266
            VLCEKGKT +A KLLDF L+R+ NI+FS+Y+KVLDALL AGKTLNAY++LCKIMEKGG  
Sbjct: 658  VLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAK 717

Query: 2267 DKSSRENLIKALNQEGNTKQADILSRMILEKVSDNKKGKKQTKMA 2401
            D SS ++LIK+LNQEGNTKQADILSRMI  K  D K+ KK +  A
Sbjct: 718  DWSSCDDLIKSLNQEGNTKQADILSRMI--KGGDRKRSKKPSLAA 760


>ref|XP_006428766.1| hypothetical protein CICLE_v10011107mg [Citrus clementina]
            gi|557530823|gb|ESR42006.1| hypothetical protein
            CICLE_v10011107mg [Citrus clementina]
          Length = 787

 Score =  989 bits (2558), Expect = 0.0
 Identities = 499/712 (70%), Positives = 588/712 (82%), Gaps = 1/712 (0%)
 Frame = +2

Query: 272  ESEQSSNSRETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTTRLQNSI 451
            + +Q+ +S       FQ  +  +  +R PRG H+   K+ED IC++M  RAWTTRLQN I
Sbjct: 76   DQQQTQDSPAPNPDPFQADEEPSQRQRIPRGNHRSPVKLEDTICKLMAERAWTTRLQNKI 135

Query: 452  RNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNH 631
            R LVP FDH LVYNVLHGAKNSEHALQFFRWVER+GLF H+RETHLK+IEILGR  KLNH
Sbjct: 136  RALVPQFDHNLVYNVLHGAKNSEHALQFFRWVERAGLFNHDRETHLKMIEILGRVGKLNH 195

Query: 632  ARCILLDMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFK 811
            ARCILLDMPKKGV+WDEDL+ ++I+SYGK GIVQESVK+F  M++LGV+R++KSYDALFK
Sbjct: 196  ARCILLDMPKKGVQWDEDLFEVLIESYGKKGIVQESVKIFDIMKQLGVERSVKSYDALFK 255

Query: 812  VIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILP 991
            +I+RRGRYMMAKRYFNKML EGIEPTRHTYN+M+WGFFLS K+ETA+RFFEDMKSR I P
Sbjct: 256  LILRRGRYMMAKRYFNKMLSEGIEPTRHTYNVMLWGFFLSLKLETAIRFFEDMKSRGISP 315

Query: 992  DVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLL 1171
            DVVTYNTMING +R KKMDEAEK F EMK +NI PTV+SYTTMIKGYV+V + DDALR+ 
Sbjct: 316  DVVTYNTMINGYNRFKKMDEAEKLFAEMKEKNIEPTVISYTTMIKGYVAVERADDALRIF 375

Query: 1172 EEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCK 1351
            +EMK   ++PNAVTYT LLPGLCDA KM E +K+L EMV++ I PKDN++F++L+  QCK
Sbjct: 376  DEMKSFDVKPNAVTYTALLPGLCDAGKMVEVQKVLREMVERYIPPKDNSVFMKLLDVQCK 435

Query: 1352 AGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQS 1531
            +G  +AAA VLKAMIRLSIPTEAGHYG+LIENFCKA +YDRA+             RPQS
Sbjct: 436  SGHLNAAADVLKAMIRLSIPTEAGHYGILIENFCKAEMYDRAIKLLDKLVEKEIILRPQS 495

Query: 1532 TLHLEPSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLAD 1711
            TL +E S+YN MI++LC NGQTGKAE   RQLMK GV DPVAFN+LI GHSKEG PD A 
Sbjct: 496  TLDMEASSYNLMIQHLCHNGQTGKAEIFFRQLMKKGVLDPVAFNNLIRGHSKEGNPDSAF 555

Query: 1712 ELLKIMVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESL 1891
            E++KIM RR +P +  A+  LIESYL KGEPADAK ALDSMIE GH P SSL+RSVMESL
Sbjct: 556  EIVKIMGRRGVPRDADAYICLIESYLRKGEPADAKTALDSMIEDGHSPASSLFRSVMESL 615

Query: 1892 FEDGRVQTASRVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDL 2071
            FEDGRVQTASRVMK+M+EKGVKE++DL+AKILEALL+RGHVEEALGRI+L+M SG  P+ 
Sbjct: 616  FEDGRVQTASRVMKSMVEKGVKENLDLVAKILEALLMRGHVEEALGRIDLMMQSGSVPNF 675

Query: 2072 DNILSVLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIME 2251
            D++LSVL EKGKTIAA+KLLDF L RD  ID ++Y+KVLDALLAAGKTLNAYS+L KIME
Sbjct: 676  DSLLSVLSEKGKTIAAVKLLDFCLGRDCIIDLASYEKVLDALLAAGKTLNAYSILFKIME 735

Query: 2252 KGGVTDKSSRENLIKALNQEGNTKQADILSRMILEKVS-DNKKGKKQTKMAA 2404
            KGGVTD  S + LI  LNQEGNTKQADILSRMI  ++S  ++K KKQ+ +A+
Sbjct: 736  KGGVTDWKSSDKLIAGLNQEGNTKQADILSRMIRGEMSRGSQKEKKQSAVAS 787


>ref|XP_002315730.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222864770|gb|EEF01901.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 760

 Score =  984 bits (2544), Expect = 0.0
 Identities = 502/725 (69%), Positives = 595/725 (82%), Gaps = 5/725 (0%)
 Frame = +2

Query: 245  EPTNPETRPESEQSSNSRETTRHEFQHADSTAGSERRPRGK--HQVSEKVEDVICRMMDN 418
            +P +P   PE+  S   +   + E  +       +R PR K  H+  EK+ED+ICRMM N
Sbjct: 38   DPISPN--PETTASPGPKPDPKTETPNVAQEKQYQRIPRAKQQHRSPEKLEDIICRMMAN 95

Query: 419  RAWTTRLQNSIRNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKII 598
            R WTTRLQNSIR LVP FDH LVYNVLHGA+  +HALQFFRWVER+GL QH+RETH+KII
Sbjct: 96   RDWTTRLQNSIRALVPEFDHSLVYNVLHGARKPDHALQFFRWVERAGLIQHDRETHMKII 155

Query: 599  EILGRASKLNHARCILL-DMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGV 775
            +ILGR S LNHARCI+L DMPKKG E DED++VL+IDSYGKAGIVQESVK+F KM+ELGV
Sbjct: 156  QILGRYSMLNHARCIVLEDMPKKGFELDEDMFVLLIDSYGKAGIVQESVKMFSKMKELGV 215

Query: 776  QRTIKSYDALFKVIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALR 955
            +R++KSY+ALFKVI+R+GRYMMAKR+FNKML EGI PTRHTYN++IWGFFLS ++ TA+R
Sbjct: 216  ERSVKSYNALFKVIVRKGRYMMAKRFFNKMLDEGIGPTRHTYNVLIWGFFLSMRLRTAVR 275

Query: 956  FFEDMKSREILPDVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYV 1135
            F+EDMK R I PDVVTYNTMING +R K+M+EAEK F EMK ++I PTV+SYTTMIKGY 
Sbjct: 276  FYEDMKVRGISPDVVTYNTMINGYYRHKRMEEAEKLFAEMKAKDIAPTVISYTTMIKGYF 335

Query: 1136 SVGKVDDALRLLEEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDN 1315
            +V +++D LRLLEEMK  GI+PN VTYTTLLP LCDA KM+EA+ IL EMV + IAPKDN
Sbjct: 336  AVDRINDGLRLLEEMKSVGIKPNNVTYTTLLPDLCDAGKMTEAKDILKEMVRRRIAPKDN 395

Query: 1316 AIFLRLMSGQCKAGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXX 1495
            +IFL+L++ QCKAGD  AA  VL  MI+LSIP+EAGHYGVLIENFCKA  YD+AV     
Sbjct: 396  SIFLKLLNSQCKAGDLKAAVDVLDGMIKLSIPSEAGHYGVLIENFCKAEEYDQAVKFVDK 455

Query: 1496 XXXXXXXXRPQSTLHLEPSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLIC 1675
                    RPQSTL +E  AYNP+I+YLCS+GQTGKAE L RQL+K GV+DP+AFN+LIC
Sbjct: 456  LIENDIILRPQSTLEMESGAYNPVIQYLCSHGQTGKAEILFRQLLKKGVEDPLAFNNLIC 515

Query: 1676 GHSKEGIPDLADELLKIMVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLP 1855
            GH+KEG PD A E+LKIM R+ IP +  A++ LIESYL KGEPADAK ALDSMIE GHLP
Sbjct: 516  GHAKEGTPDSAFEILKIMGRKGIPRDADAYRLLIESYLRKGEPADAKTALDSMIEDGHLP 575

Query: 1856 DSSLYRSVMESLFEDGRVQTASRVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRI 2035
            DSS++RSVMESL+EDGRVQTASRVMK+M+EKGVKE+MDL+AKILEALL+RGH EEALGRI
Sbjct: 576  DSSVFRSVMESLYEDGRVQTASRVMKSMVEKGVKENMDLVAKILEALLMRGHEEEALGRI 635

Query: 2036 ELVMHSGLAPDLDNILSVLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKT 2215
            +L+M S    + D++LS+L EKGKTIAALKLLDF L RD +IDF +YDKVLDALLAAGKT
Sbjct: 636  DLLMSSQCNVNFDSLLSILSEKGKTIAALKLLDFGLQRDCDIDFKSYDKVLDALLAAGKT 695

Query: 2216 LNAYSVLCKIMEKGGVTDKSSRENLIKALNQEGNTKQADILSRMIL--EKVSDNKKGKKQ 2389
            LNAYS+LCKIMEKGGVT   S E+LIK+LNQEGNTKQADILSRMI   +K  +NKKGKK+
Sbjct: 696  LNAYSILCKIMEKGGVTSWRSYEDLIKSLNQEGNTKQADILSRMIKGDDKSHENKKGKKK 755

Query: 2390 TKMAA 2404
              +AA
Sbjct: 756  ASVAA 760


>gb|EMJ26432.1| hypothetical protein PRUPE_ppa001877mg [Prunus persica]
          Length = 749

 Score =  977 bits (2526), Expect = 0.0
 Identities = 510/762 (66%), Positives = 603/762 (79%), Gaps = 5/762 (0%)
 Frame = +2

Query: 134  MAFLSVSKSSHFNPNLSKFSSPXXXXXXXXXXXXXXXEPTNPETRPESEQSSNSRETTRH 313
            MA++S+SK   + P   + S+P                  + E   E+    +   T  H
Sbjct: 1    MAYISLSKPFQWRP---RPSNPQTLTLFRLFSSTEAATGASTEAPTETPNPQDGSVTPTH 57

Query: 314  EFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTTRLQNSIRNLVPTFDHELVYN 493
                        R+ R ++  +EK+ED+ICRMM NR WTTRLQNSIRNLVP FDH LV+N
Sbjct: 58   --------VPKARQHRTRN--AEKIEDIICRMMANRVWTTRLQNSIRNLVPEFDHNLVWN 107

Query: 494  VLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVE 673
            VLHGA++ EHALQFFRWVERSGLF+H+RETHLKIIEIL R SKLNHARCILLDMPKKGV+
Sbjct: 108  VLHGARSWEHALQFFRWVERSGLFKHDRETHLKIIEILSRNSKLNHARCILLDMPKKGVQ 167

Query: 674  WDEDLWVLMIDSYGKAG---IVQESVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYMMA 844
             DEDL++ +ID YGK+    I+QESVKLF KM+ELGV+R++KSY+AL+K I+R GR MMA
Sbjct: 168  LDEDLFIGLIDGYGKSDKGCIIQESVKLFIKMKELGVERSLKSYEALYKAILRWGRCMMA 227

Query: 845  KRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTYNTMING 1024
            KRYFN ML EGIEPTRHTYN+MIWGF  S K+ETA RFFEDMKSR I PD+VTYNTMI+G
Sbjct: 228  KRYFNAMLSEGIEPTRHTYNVMIWGFLKSRKLETAKRFFEDMKSRGISPDLVTYNTMIHG 287

Query: 1025 CHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIEPN 1204
              R+ KMDE+E+ FVE+KGRNI P V+SYTTMIKGYVSVG+VDD LRL  EMK  GI PN
Sbjct: 288  YIRVDKMDESEQLFVELKGRNIEPNVISYTTMIKGYVSVGRVDDGLRLFGEMKSFGIRPN 347

Query: 1205 AVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAKVL 1384
            AVT++TLLPGLCDAEK   A K+L EMV K IAP DN+IF RL+S QCK+GD DAAA VL
Sbjct: 348  AVTFSTLLPGLCDAEKKDAAHKVLMEMVSKYIAPIDNSIFERLLSLQCKSGDMDAAAYVL 407

Query: 1385 KAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAYNP 1564
            KAMIRL IPTEAGHYG+LIENFCKAG+YD+AV             RPQ+++ LEPSA+NP
Sbjct: 408  KAMIRLRIPTEAGHYGILIENFCKAGVYDQAVKLLDKLIEKEIILRPQNSIELEPSAFNP 467

Query: 1565 MIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKIMVRRKI 1744
            MIEYLC++GQTGKAE   RQLMK GV+D VAFN+L+ GH+KEG  D A E+L+IM RR I
Sbjct: 468  MIEYLCNHGQTGKAEAFFRQLMKKGVEDSVAFNNLLRGHAKEGNSDSAFEILRIMNRRGI 527

Query: 1745 PSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSVMESLFEDGRVQTASR 1924
            P E  ++  LI+SYLSKGEPADAK ALDSMIE GH+P+SSL+RSV+ESLFEDGRVQTASR
Sbjct: 528  PGEADSYILLIKSYLSKGEPADAKTALDSMIEGGHIPESSLFRSVIESLFEDGRVQTASR 587

Query: 1925 VMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGLAPDLDNILSVLCEKG 2104
            VMK+M+EKGV E+MDL+AKILEAL +RGHVEEALGRI+L+M SG A   D++LSVL +KG
Sbjct: 588  VMKSMVEKGVMENMDLVAKILEALFMRGHVEEALGRIDLLMQSGCALQFDSLLSVLADKG 647

Query: 2105 KTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSSRE 2284
            KTIAALKLLDF L+RD ++DFS+YDKVLDALLA+GKTLNAYS+LCK+MEKGG+TD SS E
Sbjct: 648  KTIAALKLLDFCLERDCSVDFSSYDKVLDALLASGKTLNAYSILCKLMEKGGITDWSSTE 707

Query: 2285 NLIKALNQEGNTKQADILSRMIL--EKVSDNKKGKKQTKMAA 2404
            +LIK+LNQEGNTKQADILSRMI   EK S  KKGKKQ  +A+
Sbjct: 708  DLIKSLNQEGNTKQADILSRMIKGGEKSSQGKKGKKQASLAS 749


>ref|XP_006296196.1| hypothetical protein CARUB_v10025361mg [Capsella rubella]
            gi|482564904|gb|EOA29094.1| hypothetical protein
            CARUB_v10025361mg [Capsella rubella]
          Length = 757

 Score =  973 bits (2516), Expect = 0.0
 Identities = 490/701 (69%), Positives = 580/701 (82%), Gaps = 2/701 (0%)
 Frame = +2

Query: 254  NPETRPESEQSSNSRETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTT 433
            NPET     QS++++  T++     ++    ER  RGK Q  EK+ED ICRMMDNRAWTT
Sbjct: 48   NPET-----QSADAKPETKNLGSSTETRPLRERFQRGKRQNHEKLEDTICRMMDNRAWTT 102

Query: 434  RLQNSIRNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGR 613
            RLQNSIR+LVP +DH LVYNVLHGAK  EHALQFFRW ERSGL +H+R+TH+K+I++LG 
Sbjct: 103  RLQNSIRDLVPEWDHSLVYNVLHGAKKLEHALQFFRWTERSGLIRHDRDTHMKMIKMLGE 162

Query: 614  ASKLNHARCILLDMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKS 793
              K+N+ARCILLDMP+KGV WDED++V++I+SYGKAGIVQESVK+FQKM++LGV+RTIKS
Sbjct: 163  VQKVNYARCILLDMPEKGVPWDEDMFVVLIESYGKAGIVQESVKIFQKMKDLGVERTIKS 222

Query: 794  YDALFKVIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMK 973
            Y+ LFKVIMRRGRYMMAKRYFNKM+ EG+EPTRHTYNLM+WGFFLS ++ETALRFFEDMK
Sbjct: 223  YNTLFKVIMRRGRYMMAKRYFNKMVSEGVEPTRHTYNLMLWGFFLSLRLETALRFFEDMK 282

Query: 974  SREILPDVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVD 1153
            +R I PD VTYNTMING  R KKMDEAEK FVEMKG NI P+VVSYTTMIKGY+SV +VD
Sbjct: 283  TRGISPDAVTYNTMINGYCRFKKMDEAEKLFVEMKGNNIEPSVVSYTTMIKGYLSVDRVD 342

Query: 1154 DALRLLEEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRL 1333
            D LR+ EEM+ SGIEPNA TY+T+LPGLCDA KM EA+ IL  M+ K IAPKDN+IFL+L
Sbjct: 343  DGLRIFEEMRSSGIEPNATTYSTVLPGLCDAGKMVEAKNILKNMMAKHIAPKDNSIFLKL 402

Query: 1334 MSGQCKAGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXX 1513
            +  Q KAGD  AA +VLKAM  L++P EAGHYGVLIEN CKA  Y+RA+           
Sbjct: 403  LVSQSKAGDMAAATEVLKAMATLNVPAEAGHYGVLIENQCKANAYNRAIKLLDTLLEKEI 462

Query: 1514 XXRPQSTLHLEPSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEG 1693
              R Q TL +EPSAYNP+IEYLC+NGQT KAE L RQLMK GVQD  A N+LI GH+KEG
Sbjct: 463  ILRHQDTLEMEPSAYNPIIEYLCNNGQTSKAEVLFRQLMKRGVQDQDALNNLISGHAKEG 522

Query: 1694 IPDLADELLKIMVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYR 1873
             PD + E+LKIM RR +P E +A++ LI+SY+SKGEP DAK ALDSM+E GH+PDSSL+R
Sbjct: 523  NPDSSYEILKIMSRRGVPREANAYELLIKSYMSKGEPGDAKTALDSMVEDGHVPDSSLFR 582

Query: 1874 SVMESLFEDGRVQTASRVMKTMLEK--GVKEHMDLIAKILEALLLRGHVEEALGRIELVM 2047
            SV+ESLFEDGRVQTASRVM  M++K  G++E+MDLIAKILEALL+RGHVEEALGRI+L+ 
Sbjct: 583  SVIESLFEDGRVQTASRVMMIMIDKNVGIEENMDLIAKILEALLMRGHVEEALGRIDLLN 642

Query: 2048 HSGLAPDLDNILSVLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAY 2227
             +G A DLD++LSVL EKGKTIAALKLLDF L+RD ++DFS+Y+KVLDALL AGKTLNAY
Sbjct: 643  QNGHAADLDSLLSVLSEKGKTIAALKLLDFGLERDLSLDFSSYEKVLDALLGAGKTLNAY 702

Query: 2228 SVLCKIMEKGGVTDKSSRENLIKALNQEGNTKQADILSRMI 2350
            SVLCKIMEKG  TD  S + LIK+LNQEGNTKQAD+LSRMI
Sbjct: 703  SVLCKIMEKGSATDWKSSDELIKSLNQEGNTKQADVLSRMI 743


>ref|XP_006410903.1| hypothetical protein EUTSA_v10017966mg [Eutrema salsugineum]
            gi|557112072|gb|ESQ52356.1| hypothetical protein
            EUTSA_v10017966mg [Eutrema salsugineum]
          Length = 761

 Score =  969 bits (2506), Expect = 0.0
 Identities = 491/722 (68%), Positives = 591/722 (81%), Gaps = 3/722 (0%)
 Frame = +2

Query: 245  EPTNPETRPESEQSSNSR-ETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNR 421
            E  NP   P+++ +   + ETT       +     ER  RGK Q  EK+ED ICRMMDNR
Sbjct: 40   ETQNPVANPQTQSADAVKPETTNLGSIRPEGRPLRERFQRGKRQNHEKLEDTICRMMDNR 99

Query: 422  AWTTRLQNSIRNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIE 601
             WTTRLQNSIR+LVP +DH LVYNVLHGA+  +HALQFFRW ERSGL +H+R+TH+K+IE
Sbjct: 100  EWTTRLQNSIRDLVPEWDHSLVYNVLHGARKLDHALQFFRWSERSGLIRHDRDTHMKMIE 159

Query: 602  ILGRASKLNHARCILLDMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQR 781
            +LG+ASKLNHARCILLDMP+KG+ WDED++V++I+SYGKAGIVQESVK+FQKM++LGV+R
Sbjct: 160  MLGQASKLNHARCILLDMPEKGIPWDEDMFVVLIESYGKAGIVQESVKIFQKMKDLGVER 219

Query: 782  TIKSYDALFKVIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFF 961
            TIKSYD LFKVI+RRGRYMMAKRYFNKM+ EGIEPTRHTYNLM+WGFFLS ++ETALRF+
Sbjct: 220  TIKSYDTLFKVILRRGRYMMAKRYFNKMVSEGIEPTRHTYNLMLWGFFLSLRLETALRFY 279

Query: 962  EDMKSREILPDVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSV 1141
            EDM SR I PDVVTYNTMING  R KKMDEAEK FVEMKG+NI P+VVSYTTMIKGY++V
Sbjct: 280  EDMISRGISPDVVTYNTMINGYCRFKKMDEAEKVFVEMKGKNIEPSVVSYTTMIKGYLAV 339

Query: 1142 GKVDDALRLLEEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAI 1321
             +VDD LR+ +EM+  GIEPNA TY+TLLPGLCDA KM EA+ IL  M+ K IAPKDN+I
Sbjct: 340  ERVDDGLRIFDEMRSFGIEPNATTYSTLLPGLCDAGKMVEAKSILKNMMAKHIAPKDNSI 399

Query: 1322 FLRLMSGQCKAGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXX 1501
            FL+L+  Q KAGD  AA +VLKAM  L++P EAGHYGVLIEN CKA  ++RA+       
Sbjct: 400  FLKLLVSQSKAGDMAAATEVLKAMATLNVPAEAGHYGVLIENQCKANAHNRAIKLLDILV 459

Query: 1502 XXXXXXRPQSTLHLEPSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGH 1681
                  R Q TL +EP+AYNP+IEYLC+NGQT KAE L RQLMK GVQD  A N+LI GH
Sbjct: 460  EKEIILRHQDTLEMEPNAYNPIIEYLCNNGQTSKAEVLFRQLMKRGVQDQEALNNLIRGH 519

Query: 1682 SKEGIPDLADELLKIMVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDS 1861
            +KEG PD + E+LKIM RR +P + +A++ LI+SY+SKGEP DAK ALDSM+E GH+PDS
Sbjct: 520  AKEGNPDSSYEILKIMSRRGVPRDANAYELLIKSYMSKGEPGDAKTALDSMVEDGHVPDS 579

Query: 1862 SLYRSVMESLFEDGRVQTASRVMKTMLEK--GVKEHMDLIAKILEALLLRGHVEEALGRI 2035
            SL+RSV+ESLFEDGRVQTASRVM  M++K  G++++MDL+AKILEALL+RGHVEEALGRI
Sbjct: 580  SLFRSVIESLFEDGRVQTASRVMMIMIDKNVGIEDNMDLVAKILEALLMRGHVEEALGRI 639

Query: 2036 ELVMHSGLAPDLDNILSVLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKT 2215
            +L+  +G + DLD++LSVL EKGKTIAALKLLDF L+RD ++DFS+YDKVLDALL AGKT
Sbjct: 640  DLLNQNGHSADLDSLLSVLSEKGKTIAALKLLDFGLERDLSLDFSSYDKVLDALLGAGKT 699

Query: 2216 LNAYSVLCKIMEKGGVTDKSSRENLIKALNQEGNTKQADILSRMILEKVSDNKKGKKQTK 2395
            LNAYSVLCKIM KG VTD  S ++LIK+LNQEGNTKQAD+LSRMI +K    KK KKQ+ 
Sbjct: 700  LNAYSVLCKIMAKGSVTDWKSCDDLIKSLNQEGNTKQADVLSRMI-KKGEGIKKDKKQST 758

Query: 2396 MA 2401
            ++
Sbjct: 759  VS 760


>ref|NP_181260.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75216851|sp|Q9ZUU3.1|PP190_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g37230 gi|4056478|gb|AAC98044.1| unknown protein
            [Arabidopsis thaliana] gi|28973644|gb|AAO64144.1| unknown
            protein [Arabidopsis thaliana]
            gi|110736716|dbj|BAF00321.1| hypothetical protein
            [Arabidopsis thaliana] gi|330254276|gb|AEC09370.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 757

 Score =  968 bits (2502), Expect = 0.0
 Identities = 493/721 (68%), Positives = 585/721 (81%), Gaps = 5/721 (0%)
 Frame = +2

Query: 254  NPETRPESEQSSNSRETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTT 433
            NPET     QS +++  T+      ++    ER  RGK Q  EK+ED ICRMMDNRAWTT
Sbjct: 48   NPET-----QSPDAKSETKKNLTSTETRPLRERFQRGKRQNHEKLEDTICRMMDNRAWTT 102

Query: 434  RLQNSIRNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGR 613
            RLQNSIR+LVP +DH LVYNVLHGAK  EHALQFFRW ERSGL +H+R+TH+K+I++LG 
Sbjct: 103  RLQNSIRDLVPEWDHSLVYNVLHGAKKLEHALQFFRWTERSGLIRHDRDTHMKMIKMLGE 162

Query: 614  ASKLNHARCILLDMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKS 793
             SKLNHARCILLDMP+KGV WDED++V++I+SYGKAGIVQESVK+FQKM++LGV+RTIKS
Sbjct: 163  VSKLNHARCILLDMPEKGVPWDEDMFVVLIESYGKAGIVQESVKIFQKMKDLGVERTIKS 222

Query: 794  YDALFKVIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMK 973
            Y++LFKVI+RRGRYMMAKRYFNKM+ EG+EPTRHTYNLM+WGFFLS ++ETALRFFEDMK
Sbjct: 223  YNSLFKVILRRGRYMMAKRYFNKMVSEGVEPTRHTYNLMLWGFFLSLRLETALRFFEDMK 282

Query: 974  SREILPDVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVD 1153
            +R I PD  T+NTMING  R KKMDEAEK FVEMKG  I P+VVSYTTMIKGY++V +VD
Sbjct: 283  TRGISPDDATFNTMINGFCRFKKMDEAEKLFVEMKGNKIGPSVVSYTTMIKGYLAVDRVD 342

Query: 1154 DALRLLEEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRL 1333
            D LR+ EEM+ SGIEPNA TY+TLLPGLCDA KM EA+ IL  M+ K IAPKDN+IFL+L
Sbjct: 343  DGLRIFEEMRSSGIEPNATTYSTLLPGLCDAGKMVEAKNILKNMMAKHIAPKDNSIFLKL 402

Query: 1334 MSGQCKAGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXX 1513
            +  Q KAGD  AA +VLKAM  L++P EAGHYGVLIEN CKA  Y+RA+           
Sbjct: 403  LVSQSKAGDMAAATEVLKAMATLNVPAEAGHYGVLIENQCKASAYNRAIKLLDTLIEKEI 462

Query: 1514 XXRPQSTLHLEPSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEG 1693
              R Q TL +EPSAYNP+IEYLC+NGQT KAE L RQLMK GVQD  A N+LI GH+KEG
Sbjct: 463  ILRHQDTLEMEPSAYNPIIEYLCNNGQTAKAEVLFRQLMKRGVQDQDALNNLIRGHAKEG 522

Query: 1694 IPDLADELLKIMVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYR 1873
             PD + E+LKIM RR +P E +A++ LI+SY+SKGEP DAK ALDSM+E GH+PDSSL+R
Sbjct: 523  NPDSSYEILKIMSRRGVPRESNAYELLIKSYMSKGEPGDAKTALDSMVEDGHVPDSSLFR 582

Query: 1874 SVMESLFEDGRVQTASRVMKTMLEK--GVKEHMDLIAKILEALLLRGHVEEALGRIELVM 2047
            SV+ESLFEDGRVQTASRVM  M++K  G++++MDLIAKILEALL+RGHVEEALGRI+L+ 
Sbjct: 583  SVIESLFEDGRVQTASRVMMIMIDKNVGIEDNMDLIAKILEALLMRGHVEEALGRIDLLN 642

Query: 2048 HSGLAPDLDNILSVLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAY 2227
             +G   DLD++LSVL EKGKTIAALKLLDF L+RD +++FS+YDKVLDALL AGKTLNAY
Sbjct: 643  QNGHTADLDSLLSVLSEKGKTIAALKLLDFGLERDLSLEFSSYDKVLDALLGAGKTLNAY 702

Query: 2228 SVLCKIMEKGGVTDKSSRENLIKALNQEGNTKQADILSRMILEKVSDNKKG---KKQTKM 2398
            SVLCKIMEKG  TD  S + LIK+LNQEGNTKQAD+LSRMI       KKG   KKQ  +
Sbjct: 703  SVLCKIMEKGSSTDWKSSDELIKSLNQEGNTKQADVLSRMI-------KKGQGIKKQNNV 755

Query: 2399 A 2401
            +
Sbjct: 756  S 756


>ref|XP_002881498.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297327337|gb|EFH57757.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 756

 Score =  957 bits (2475), Expect = 0.0
 Identities = 483/701 (68%), Positives = 575/701 (82%), Gaps = 2/701 (0%)
 Frame = +2

Query: 254  NPETRPESEQSSNSRETTRHEFQHADSTAGSERRPRGKHQVSEKVEDVICRMMDNRAWTT 433
            NPET     QS +++  T++     ++    ER  RGK Q  EK+ED ICRMMDNRAWTT
Sbjct: 48   NPET-----QSPDAKPETKN-LGSTETRPLRERFQRGKRQNHEKLEDTICRMMDNRAWTT 101

Query: 434  RLQNSIRNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLFQHNRETHLKIIEILGR 613
            RLQNSIR+LVP +DH LVYNVLHGAK  EHALQFFRW ERSGL +H+R+TH+K+I++LG 
Sbjct: 102  RLQNSIRDLVPEWDHSLVYNVLHGAKKLEHALQFFRWTERSGLIRHDRDTHMKMIKMLGE 161

Query: 614  ASKLNHARCILLDMPKKGVEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKS 793
              KLNHARCILLDMP+KGV WDED++V++I+SYGKAGIVQESVK+FQKM++LGV+RTIKS
Sbjct: 162  VQKLNHARCILLDMPEKGVPWDEDMFVVLIESYGKAGIVQESVKIFQKMKDLGVERTIKS 221

Query: 794  YDALFKVIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMK 973
            Y+ LFKVI+RRGRYMMAKRYFNKM+ EG+EPTRHTYNLM+WGFFLS ++ETALRFF+DMK
Sbjct: 222  YNTLFKVILRRGRYMMAKRYFNKMVSEGVEPTRHTYNLMLWGFFLSLRLETALRFFDDMK 281

Query: 974  SREILPDVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVD 1153
            +R I PD VTYNT+ING  R KKMDEAEK FVEMKG N  P+VV+YTTMIKGY+SV +VD
Sbjct: 282  TRGISPDAVTYNTIINGYCRFKKMDEAEKLFVEMKGNNSEPSVVTYTTMIKGYLSVDRVD 341

Query: 1154 DALRLLEEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRL 1333
            D LR+ EEM+  GIEPNA TY+TLLPGLCD  KM EA+ IL  M+ K IAPKDN+IFL+L
Sbjct: 342  DGLRIFEEMRSFGIEPNATTYSTLLPGLCDVGKMVEAKNILKNMMAKHIAPKDNSIFLKL 401

Query: 1334 MSGQCKAGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXX 1513
            +  Q KAGD  AA +VLKAM  L++P EAGHYGVLIEN CKA  Y+RA+           
Sbjct: 402  LVSQSKAGDMAAATEVLKAMATLNVPAEAGHYGVLIENQCKASAYNRAIKLLDTLIEKEI 461

Query: 1514 XXRPQSTLHLEPSAYNPMIEYLCSNGQTGKAETLLRQLMKIGVQDPVAFNHLICGHSKEG 1693
              R Q TL +EPSAYNP+IEYLC+NGQT KAE L RQLMK GVQD  A N+LI GH+KEG
Sbjct: 462  ILRHQDTLEMEPSAYNPIIEYLCNNGQTAKAEVLFRQLMKRGVQDQDALNNLIRGHAKEG 521

Query: 1694 IPDLADELLKIMVRRKIPSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYR 1873
             P+ + E+LKIM RR +P E +A++ LI+SY+SKGEP DAK ALDSM+E GH+PDS+L+R
Sbjct: 522  NPESSYEILKIMSRRGVPREANAYELLIKSYMSKGEPGDAKTALDSMVEDGHVPDSALFR 581

Query: 1874 SVMESLFEDGRVQTASRVMKTMLEK--GVKEHMDLIAKILEALLLRGHVEEALGRIELVM 2047
            SV+ESLFEDGRVQTASRVM  M++K  G++++MDLIAKILEALL+RGHVEEALGRI+L+ 
Sbjct: 582  SVIESLFEDGRVQTASRVMMIMIDKNVGIEDNMDLIAKILEALLMRGHVEEALGRIDLLN 641

Query: 2048 HSGLAPDLDNILSVLCEKGKTIAALKLLDFSLDRDFNIDFSTYDKVLDALLAAGKTLNAY 2227
             +G   DLD++LSVL EKGKTIAALKLLDF L+RD ++DFS+YDKVLDALL AGKTLNAY
Sbjct: 642  QNGHTADLDSLLSVLSEKGKTIAALKLLDFGLERDLSLDFSSYDKVLDALLGAGKTLNAY 701

Query: 2228 SVLCKIMEKGGVTDKSSRENLIKALNQEGNTKQADILSRMI 2350
            SVLCKIMEKG  TD  S + LIK+LNQEGNTKQAD+LSRMI
Sbjct: 702  SVLCKIMEKGSSTDWKSSDELIKSLNQEGNTKQADVLSRMI 742



 Score =  138 bits (347), Expect = 1e-29
 Identities = 128/585 (21%), Positives = 247/585 (42%), Gaps = 22/585 (3%)
 Frame = +2

Query: 671  EWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYMM--- 841
            EWD  L   ++    K   ++ +++ F+  E  G+ R  +  D   K+I   G       
Sbjct: 113  EWDHSLVYNVLHGAKK---LEHALQFFRWTERSGLIRHDR--DTHMKMIKMLGEVQKLNH 167

Query: 842  AKRYFNKMLREGIEPTRHTYNLMIWGFFLSSKVETALRFFEDMKSREILPDVVTYNTMIN 1021
            A+     M  +G+      + ++I  +  +  V+ +++ F+ MK   +   + +YNT+  
Sbjct: 168  ARCILLDMPEKGVPWDEDMFVVLIESYGKAGIVQESVKIFQKMKDLGVERTIKSYNTLFK 227

Query: 1022 GCHRIKKMDEAEKYFVEMKGRNIVPTVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIEP 1201
               R  +   A++YF +M    + PT  +Y  M+ G+    +++ ALR  ++MK  GI P
Sbjct: 228  VILRRGRYMMAKRYFNKMVSEGVEPTRHTYNLMLWGFFLSLRLETALRFFDDMKTRGISP 287

Query: 1202 NAVTYTTLLPGLCDAEKMSEARKILDEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAKV 1381
            +AVTY T++ G C  +KM EA K+  EM   +  P     +  ++ G       D   ++
Sbjct: 288  DAVTYNTIINGYCRFKKMDEAEKLFVEMKGNNSEPSV-VTYTTMIKGYLSVDRVDDGLRI 346

Query: 1382 LKAMIRLSIPTEAGHYGVLIENFCKAGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAYN 1561
             + M    I   A  Y  L+   C  G    A N            +     H+ P   +
Sbjct: 347  FEEMRSFGIEPNATTYSTLLPGLCDVGKMVEAKNIL----------KNMMAKHIAPKDNS 396

Query: 1562 PMIEYLCSNGQTGK---AETLLRQLMKIGVQDPVAFNH---LICGHSKEGIPDLADELLK 1723
              ++ L S  + G    A  +L+ +  + V  P    H   LI    K    + A +LL 
Sbjct: 397  IFLKLLVSQSKAGDMAAATEVLKAMATLNV--PAEAGHYGVLIENQCKASAYNRAIKLLD 454

Query: 1724 IMVRRKI--------PSEESAHKSLIESYLSKGEPADAKMALDSMIESGHLPDSSLYRSV 1879
             ++ ++I          E SA+  +IE   + G+ A A++    +++ G + D     ++
Sbjct: 455  TLIEKEIILRHQDTLEMEPSAYNPIIEYLCNNGQTAKAEVLFRQLMKRG-VQDQDALNNL 513

Query: 1880 MESLFEDGRVQTASRVMKTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELVMHSGL 2059
            +    ++G  +++  ++K M  +GV    +    ++++ + +G   +A   ++ ++  G 
Sbjct: 514  IRGHAKEGNPESSYEILKIMSRRGVPREANAYELLIKSYMSKGEPGDAKTALDSMVEDGH 573

Query: 2060 APD---LDNILSVLCEKGKTIAALKLLDFSLDRDFNID--FSTYDKVLDALLAAGKTLNA 2224
             PD     +++  L E G+   A +++   +D++  I+       K+L+ALL  G    A
Sbjct: 574  VPDSALFRSVIESLFEDGRVQTASRVMMIMIDKNVGIEDNMDLIAKILEALLMRGHVEEA 633

Query: 2225 YSVLCKIMEKGGVTDKSSRENLIKALNQEGNTKQADILSRMILEK 2359
                                  I  LNQ G+T   D L  ++ EK
Sbjct: 634  LG-------------------RIDLLNQNGHTADLDSLLSVLSEK 659


>ref|XP_003524868.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Glycine max]
          Length = 733

 Score =  946 bits (2444), Expect = 0.0
 Identities = 471/658 (71%), Positives = 559/658 (84%), Gaps = 3/658 (0%)
 Frame = +2

Query: 386  VEDVICRMMDNRAWTTRLQNSIRNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLF 565
            +E  IC+MM NRAWTTRLQNSIR+LVP FD  LVYNVLHGA + EHALQF+RWVER+GLF
Sbjct: 56   LELTICKMMSNRAWTTRLQNSIRSLVPEFDPSLVYNVLHGAASPEHALQFYRWVERAGLF 115

Query: 566  QHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEW---DEDLWVLMIDSYGKAGIVQE 736
             H  ET LKI++ILGR SKLNHARCIL +  + GV      ED +V +IDSYG+AGIVQE
Sbjct: 116  THTPETTLKIVQILGRYSKLNHARCILFNDTRGGVSRAAVTEDAFVSLIDSYGRAGIVQE 175

Query: 737  SVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIW 916
            SVKLF+KM+ELG+ RT+KSYDALFKVI+RRGRYMMAKRY+N ML EG++PTRHT+N+++W
Sbjct: 176  SVKLFKKMKELGLDRTVKSYDALFKVILRRGRYMMAKRYYNAMLLEGVDPTRHTFNILLW 235

Query: 917  GFFLSSKVETALRFFEDMKSREILPDVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVP 1096
            G FLS +++TA+RF+EDMKSR ILPDVVTYNT+ING  R KK+DEAEK FVEMKGR+IVP
Sbjct: 236  GMFLSLRLDTAVRFYEDMKSRGILPDVVTYNTLINGYFRFKKVDEAEKLFVEMKGRDIVP 295

Query: 1097 TVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKIL 1276
             V+S+TTM+KGYV+ G++DDAL++ EEMK  G++PN VT++TLLPGLCDAEKM+EAR +L
Sbjct: 296  NVISFTTMLKGYVAAGRIDDALKVFEEMKGCGVKPNVVTFSTLLPGLCDAEKMAEARDVL 355

Query: 1277 DEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCK 1456
             EMV++ IAPKDNA+F+++MS QCKAGD DAAA VLKAM+RLSIPTEAGHYGVLIE+FCK
Sbjct: 356  GEMVERYIAPKDNALFMKMMSCQCKAGDLDAAADVLKAMVRLSIPTEAGHYGVLIESFCK 415

Query: 1457 AGLYDRAVNXXXXXXXXXXXXRPQSTLHLEPSAYNPMIEYLCSNGQTGKAETLLRQLMKI 1636
            A +YD+A              RPQ+   +EPSAYN MI YLC +G+TGKAET  RQL+K 
Sbjct: 416  ANVYDKAEKLLDKLIEKEIVLRPQNDSEMEPSAYNLMIGYLCEHGRTGKAETFFRQLLKK 475

Query: 1637 GVQDPVAFNHLICGHSKEGIPDLADELLKIMVRRKIPSEESAHKSLIESYLSKGEPADAK 1816
            GVQD VAFN+LI GHSKEG PD A E++KIM RR +  +  +++ LIESYL KGEPADAK
Sbjct: 476  GVQDSVAFNNLIRGHSKEGNPDSAFEIMKIMGRRGVARDVDSYRLLIESYLRKGEPADAK 535

Query: 1817 MALDSMIESGHLPDSSLYRSVMESLFEDGRVQTASRVMKTMLEKGVKEHMDLIAKILEAL 1996
             ALD M+ESGHLP+SSLYRSVMESLF+DGRVQTASRVMK+M+EKG KE+MDL+ KILEAL
Sbjct: 536  TALDGMLESGHLPESSLYRSVMESLFDDGRVQTASRVMKSMVEKGAKENMDLVLKILEAL 595

Query: 1997 LLRGHVEEALGRIELVMHSGLAPDLDNILSVLCEKGKTIAALKLLDFSLDRDFNIDFSTY 2176
            LLRGHVEEALGRI+L+MH+G  PD D++LSVLCEK KTIAALKLLDF L+RD  IDFS Y
Sbjct: 596  LLRGHVEEALGRIDLLMHNGCEPDFDHLLSVLCEKEKTIAALKLLDFVLERDCIIDFSIY 655

Query: 2177 DKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSSRENLIKALNQEGNTKQADILSRMI 2350
            DKVLDALLAAGKTLNAYS+LCKI+EKGG TD SSR+ LIK+LNQEGNTKQAD+LSRMI
Sbjct: 656  DKVLDALLAAGKTLNAYSILCKILEKGGSTDWSSRDELIKSLNQEGNTKQADVLSRMI 713


>ref|XP_003532699.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Glycine max]
          Length = 738

 Score =  943 bits (2437), Expect = 0.0
 Identities = 476/682 (69%), Positives = 566/682 (82%), Gaps = 10/682 (1%)
 Frame = +2

Query: 386  VEDVICRMMDNRAWTTRLQNSIRNLVPTFDHELVYNVLHGAKNSEHALQFFRWVERSGLF 565
            +E  IC+MM NRAWTTRLQNSIR+LVP FD  LVYNVLHGA + EHALQF+RWVER+GLF
Sbjct: 56   LELTICKMMSNRAWTTRLQNSIRSLVPEFDPSLVYNVLHGAASPEHALQFYRWVERAGLF 115

Query: 566  QHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEW---DEDLWVLMIDSYGKAGIVQE 736
             H  ET LKI++ILGR SKLNHARCIL D  + G       ED +V +IDSYG+AGIVQE
Sbjct: 116  THTPETTLKIVQILGRYSKLNHARCILFDDTRGGASRATVTEDAFVSLIDSYGRAGIVQE 175

Query: 737  SVKLFQKMEELGVQRTIKSYDALFKVIMRRGRYMMAKRYFNKMLREGIEPTRHTYNLMIW 916
            SVKLF+KM+ELGV RT+KSYDALFKVI+RRGRYMMAKRY+N ML E +EPTRHTYN+++W
Sbjct: 176  SVKLFKKMKELGVDRTVKSYDALFKVILRRGRYMMAKRYYNAMLNESVEPTRHTYNILLW 235

Query: 917  GFFLSSKVETALRFFEDMKSREILPDVVTYNTMINGCHRIKKMDEAEKYFVEMKGRNIVP 1096
            G FLS +++TA+RF+EDMKSR ILPDVVTYNT+ING  R KK++EAEK FVEMKGR+IVP
Sbjct: 236  GMFLSLRLDTAVRFYEDMKSRGILPDVVTYNTLINGYFRFKKVEEAEKLFVEMKGRDIVP 295

Query: 1097 TVVSYTTMIKGYVSVGKVDDALRLLEEMKVSGIEPNAVTYTTLLPGLCDAEKMSEARKIL 1276
             V+S+TTM+KGYV+ G++DDAL++ EEMK  G++PNAVT++TLLPGLCDAEKM+EAR +L
Sbjct: 296  NVISFTTMLKGYVAAGQIDDALKVFEEMKGCGVKPNAVTFSTLLPGLCDAEKMAEARDVL 355

Query: 1277 DEMVDKSIAPKDNAIFLRLMSGQCKAGDFDAAAKVLKAMIRLSIPTEAGHYGVLIENFCK 1456
             EMV++ IAPKDNA+F++LMS QCKAGD DAA  VLKAMIRLSIPTEAGHYGVLIENFCK
Sbjct: 356  GEMVERYIAPKDNAVFMKLMSCQCKAGDLDAAGDVLKAMIRLSIPTEAGHYGVLIENFCK 415

Query: 1457 AGLYDRAVNXXXXXXXXXXXXRPQST-----LHLEPSAYNPMIEYLCSNGQTGKAETLLR 1621
            A LYD+A              R ++        +EPSAYN MI YLC +G+TGKAET  R
Sbjct: 416  ANLYDKAEKLLDKMIEKEIVLRQKNAYETELFEMEPSAYNLMIGYLCEHGRTGKAETFFR 475

Query: 1622 QLMKIGVQDPVAFNHLICGHSKEGIPDLADELLKIMVRRKIPSEESAHKSLIESYLSKGE 1801
            QLMK GVQD V+FN+LICGHSKEG PD A E++KIM RR +  +  +++ LIESYL KGE
Sbjct: 476  QLMKKGVQDSVSFNNLICGHSKEGNPDSAFEIIKIMGRRGVARDADSYRLLIESYLRKGE 535

Query: 1802 PADAKMALDSMIESGHLPDSSLYRSVMESLFEDGRVQTASRVMKTMLEKGVKEHMDLIAK 1981
            PADAK ALD M+ESGHLP+SSLYRSVMESLF+DGRVQTASRVMK+M+EKGVKE+MDL++K
Sbjct: 536  PADAKTALDGMLESGHLPESSLYRSVMESLFDDGRVQTASRVMKSMVEKGVKENMDLVSK 595

Query: 1982 ILEALLLRGHVEEALGRIELVMHSGLAPDLDNILSVLCEKGKTIAALKLLDFSLDRDFNI 2161
            +LEALL+RGHVEEALGRI L+M +G  PD D++LSVLCEK KTIAALKLLDF L+RD  I
Sbjct: 596  VLEALLMRGHVEEALGRIHLLMLNGCEPDFDHLLSVLCEKEKTIAALKLLDFVLERDCII 655

Query: 2162 DFSTYDKVLDALLAAGKTLNAYSVLCKIMEKGGVTDKSSRENLIKALNQEGNTKQADILS 2341
            DFS YDKVLDALLAAGKTLNAYS+LCKI+EKGG TD SSR+ LIK+LNQEGNTKQAD+LS
Sbjct: 656  DFSIYDKVLDALLAAGKTLNAYSILCKILEKGGSTDWSSRDELIKSLNQEGNTKQADVLS 715

Query: 2342 RMI--LEKVSDNKKGKKQTKMA 2401
            RMI   +     + GK++T ++
Sbjct: 716  RMIKGTDGGPPKRGGKRKTTVS 737


Top