BLASTX nr result

ID: Catharanthus22_contig00005067 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00005067
         (3639 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006363176.1| PREDICTED: pentatricopeptide repeat-containi...   927   0.0  
ref|XP_004232626.1| PREDICTED: pentatricopeptide repeat-containi...   924   0.0  
ref|XP_002275605.1| PREDICTED: pentatricopeptide repeat-containi...   909   0.0  
emb|CBI27232.3| unnamed protein product [Vitis vinifera]              901   0.0  
ref|XP_006448599.1| hypothetical protein CICLE_v10014519mg [Citr...   879   0.0  
gb|EMJ12567.1| hypothetical protein PRUPE_ppa002507mg [Prunus pe...   870   0.0  
ref|XP_006468575.1| PREDICTED: pentatricopeptide repeat-containi...   869   0.0  
ref|XP_002304600.2| hypothetical protein POPTR_0003s15360g [Popu...   867   0.0  
gb|EOX96827.1| Pentatricopeptide repeat (PPR) superfamily protei...   862   0.0  
ref|XP_002297917.1| hypothetical protein POPTR_0001s12190g [Popu...   862   0.0  
ref|XP_004295517.1| PREDICTED: pentatricopeptide repeat-containi...   855   0.0  
ref|XP_002528143.1| pentatricopeptide repeat-containing protein,...   849   0.0  
ref|XP_002867892.1| EMB1025 [Arabidopsis lyrata subsp. lyrata] g...   818   0.0  
ref|NP_193742.1| pentatricopeptide repeat-containing protein [Ar...   812   0.0  
ref|XP_006283284.1| hypothetical protein CARUB_v10004320mg [Caps...   808   0.0  
ref|XP_003534864.1| PREDICTED: pentatricopeptide repeat-containi...   802   0.0  
gb|EXB83265.1| hypothetical protein L484_011559 [Morus notabilis]     797   0.0  
gb|ESW10855.1| hypothetical protein PHAVU_009G243700g [Phaseolus...   793   0.0  
ref|XP_006404148.1| hypothetical protein EUTSA_v10010168mg [Eutr...   790   0.0  
ref|XP_003594857.1| Pentatricopeptide repeat-containing protein ...   781   0.0  

>ref|XP_006363176.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like
            isoform X1 [Solanum tuberosum]
            gi|565395083|ref|XP_006363177.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g20090-like isoform X2 [Solanum tuberosum]
          Length = 717

 Score =  927 bits (2396), Expect = 0.0
 Identities = 447/635 (70%), Positives = 529/635 (83%), Gaps = 1/635 (0%)
 Frame = +3

Query: 345  NSCETEKDYEEDIIAIQSTNSHMLPKR-SHKVEVEPPITDRLFKHAPKSGSYKQGDSTFY 521
            NSC  E    E+ ++  S    + P   S K EVE PI+D+LFK APK GS+K GDSTFY
Sbjct: 86   NSCGAEV---EEPLSDNSFKVTLKPNLGSCKTEVEVPISDKLFKEAPKLGSFKLGDSTFY 142

Query: 522  SLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYE 701
            SLIE YANSGDF SLE VF RM  E+R F+EKSFI+VFRAYGKA LPEKAVELF+RMV E
Sbjct: 143  SLIEKYANSGDFTSLEKVFDRMKCEKRVFIEKSFILVFRAYGKARLPEKAVELFERMVDE 202

Query: 702  FQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCKNIKPNVLTFNLVIKAMCKLQLVD 881
            FQCKRTVKSFNSVLNVI+Q GLY  AL+F++ VVN +NI PNVL+FNLVIK MCKL++VD
Sbjct: 203  FQCKRTVKSFNSVLNVIVQTGLYRHALDFYADVVNNRNIMPNVLSFNLVIKTMCKLRMVD 262

Query: 882  RAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLI 1061
            RA+EVFREMP  KC+ DV+TYCTLMDGLCK+DR++EAV LLDEMQ+EGC P P TFNVLI
Sbjct: 263  RAMEVFREMPTWKCEPDVYTYCTLMDGLCKDDRIDEAVILLDEMQVEGCLPVPVTFNVLI 322

Query: 1062 NGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFI 1241
            NGLC+KGDL+RAAK+VDNMFLKGCVPNEVTYNTLIHGLCL+GKLEKA+SL+DRMVS+K+I
Sbjct: 323  NGLCRKGDLARAAKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLEKAVSLVDRMVSNKYI 382

Query: 1242 PNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNL 1421
            P D+TYGTII+G V++ RA DGV ++++M+E+GH  N+++YS+LVSGLFKEG+ EEAL +
Sbjct: 383  PTDITYGTIINGFVKQRRATDGVQILLAMQEKGHLANEYVYSALVSGLFKEGKPEEALKI 442

Query: 1422 WKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFK 1601
            WK ++E G KPNTV YSA IDGLCR G+P EAKEIL EM   GC PNAYTY SLMKG+FK
Sbjct: 443  WKGMIEKGVKPNTVAYSAFIDGLCREGRPDEAKEILSEMNKMGCTPNAYTYCSLMKGYFK 502

Query: 1602 VGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVA 1781
             G+SN A+LLWK+M   G   NE CYS+L HGLC +GKLKEA MVW+HMLGKG  PDVVA
Sbjct: 503  TGDSNKAILLWKDMATSGITCNEICYSVLTHGLCQDGKLKEAMMVWKHMLGKGLVPDVVA 562

Query: 1782 YTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNN 1961
            Y+SMIHGLC+ GSV+QGL LFNEM  +GSDSQPDV  YN++ NALCK ++I+ AI LLN 
Sbjct: 563  YSSMIHGLCNAGSVDQGLRLFNEMQCRGSDSQPDVIAYNIIINALCKVDRISLAIDLLNT 622

Query: 1962 MLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQK 2141
            MLDRGCDPD +TCNIFL T  +K NPSQDG +FLD+L L+L++RQRI GASRIIEVMLQK
Sbjct: 623  MLDRGCDPDTITCNIFLKTLNDKANPSQDGEDFLDKLVLQLYRRQRIVGASRIIEVMLQK 682

Query: 2142 FLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLFV 2246
             +YPK STWE+ IRELCKP+K+Q AINKCW+ LF+
Sbjct: 683  IIYPKSSTWEMIIRELCKPKKVQGAINKCWSDLFI 717


>ref|XP_004232626.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like
            [Solanum lycopersicum]
          Length = 717

 Score =  924 bits (2389), Expect = 0.0
 Identities = 448/635 (70%), Positives = 530/635 (83%), Gaps = 1/635 (0%)
 Frame = +3

Query: 345  NSCETEKDYEEDIIAIQSTNSHMLPKR-SHKVEVEPPITDRLFKHAPKSGSYKQGDSTFY 521
            NSC TE    E+ ++ +S    + P   S + EVE PI+D+LFK APK GS+K GDSTFY
Sbjct: 86   NSCVTEV---EEPLSDKSFKVTLKPNLGSCETEVEVPISDKLFKEAPKLGSFKLGDSTFY 142

Query: 522  SLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYE 701
            SLIE YANS DF SLE VF RM  E+R F+EKSFI+VFRAYGKA LPEKAVELF+RMV E
Sbjct: 143  SLIEKYANSEDFTSLEKVFGRMKCEKRVFIEKSFILVFRAYGKARLPEKAVELFERMVDE 202

Query: 702  FQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCKNIKPNVLTFNLVIKAMCKLQLVD 881
            FQCKRTVKSFNSVLNVI+Q GLY RAL+F++ VVN +NI PNVL+FNLVIK MCKL++VD
Sbjct: 203  FQCKRTVKSFNSVLNVIVQTGLYHRALDFYADVVNNRNIMPNVLSFNLVIKTMCKLRMVD 262

Query: 882  RAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLI 1061
            RA+EVFREMP  KC+ DV+TYCTLMDGLCK+DR++EAV LLDEMQ+EGC P P TFNVLI
Sbjct: 263  RAMEVFREMPTWKCEPDVYTYCTLMDGLCKDDRIDEAVILLDEMQVEGCLPVPVTFNVLI 322

Query: 1062 NGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFI 1241
            NGLC+KGDL+RAAK+VDNMFLKGCVPN+VTYNTLIHGLCL+GKLEKA+SLLDRMVS+K+I
Sbjct: 323  NGLCRKGDLARAAKLVDNMFLKGCVPNDVTYNTLIHGLCLKGKLEKAVSLLDRMVSNKYI 382

Query: 1242 PNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNL 1421
            P D+TYGTII+G V++ RA DGV ++++M+E+GH  N+++YS+LVSGLFKEG+ EEAL +
Sbjct: 383  PTDITYGTIINGFVKQRRATDGVQILLAMQEKGHLANEYVYSALVSGLFKEGKPEEALKI 442

Query: 1422 WKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFK 1601
            WK+++E G KPN V YSA IDGLCR GKP EAKEIL EM   GC PNAYTY SLMKG+FK
Sbjct: 443  WKEMIEKGVKPNIVAYSAFIDGLCREGKPDEAKEILSEMNKMGCTPNAYTYCSLMKGYFK 502

Query: 1602 VGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVA 1781
              +SN A+LLWK+M   G   NE CYS+LIHGLC +GKLKEA MVW+HMLGKG  PD VA
Sbjct: 503  TSDSNKAILLWKDMATSGITCNEICYSVLIHGLCQDGKLKEAMMVWKHMLGKGLVPDAVA 562

Query: 1782 YTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNN 1961
            Y+SMIHGLC+ GSV+QGL LFNEML +GSDSQPDV  YN++ NALCK ++I+ AI LLN 
Sbjct: 563  YSSMIHGLCNAGSVDQGLRLFNEMLCRGSDSQPDVVAYNIIINALCKVDRISLAIDLLNT 622

Query: 1962 MLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQK 2141
            MLDRGCDPD +TCNIFL T  EK NPSQDG +FLD+L L+L++RQRI GASRIIEVMLQK
Sbjct: 623  MLDRGCDPDKITCNIFLKTLNEKANPSQDGEDFLDKLVLQLYRRQRIIGASRIIEVMLQK 682

Query: 2142 FLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLFV 2246
             L PK STWE+ IRELCKP+K+Q AINKCW+ LF+
Sbjct: 683  ILSPKSSTWEMIIRELCKPKKVQGAINKCWSDLFI 717


>ref|XP_002275605.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like
            [Vitis vinifera]
          Length = 644

 Score =  909 bits (2350), Expect = 0.0
 Identities = 438/603 (72%), Positives = 512/603 (84%), Gaps = 1/603 (0%)
 Frame = +3

Query: 438  EVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEK 617
            E + PI D++FK A + GSYK GDSTFYSLIENYANSGDF +L  VF RM  ERR F+EK
Sbjct: 41   ESDAPIPDQIFKSASQMGSYKSGDSTFYSLIENYANSGDFGTLFQVFDRMKRERRVFIEK 100

Query: 618  SFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSY 797
            +FI+VFRAYGKAHLPEKA+ELF RMV EFQC+RTV+SFNSVLNVIIQEGL+ RALEF+  
Sbjct: 101  NFILVFRAYGKAHLPEKAIELFGRMVDEFQCRRTVRSFNSVLNVIIQEGLFHRALEFYEC 160

Query: 798  VVNCK-NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKE 974
             V  K NI PNVL+FNLVIKAMCKL LVDRA+EVFREM   KC+ DVFTYCTLMDGLCKE
Sbjct: 161  GVGGKTNISPNVLSFNLVIKAMCKLGLVDRAIEVFREMAIQKCEPDVFTYCTLMDGLCKE 220

Query: 975  DRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTY 1154
            DR++EAV LLDEMQIEGCFP+  TFNVLINGLCKKGD+ R  K+VDNMFLKGCVPNEVTY
Sbjct: 221  DRIDEAVLLLDEMQIEGCFPSSVTFNVLINGLCKKGDMVRVTKLVDNMFLKGCVPNEVTY 280

Query: 1155 NTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEE 1334
            NT+I+GLCL+GKL+KA+SLLDRMV+ K +PNDVTYGT+I+GLV++GR+VDGVH++ S+EE
Sbjct: 281  NTIINGLCLKGKLDKAVSLLDRMVASKCVPNDVTYGTLINGLVKQGRSVDGVHLLSSLEE 340

Query: 1335 RGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFE 1514
            RGH  N++ YS+L+SGLFKE +SEEA+ LWKK++E G +PN VVYSALIDGLCR GK  E
Sbjct: 341  RGHHANEYAYSTLISGLFKEEKSEEAMGLWKKMVEKGCQPNIVVYSALIDGLCREGKLDE 400

Query: 1515 AKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIH 1694
            AKEIL EMVNKGC PNA+TYSSL+KGFFK GNS  A+ +WKEM +  CV NE CYS+LIH
Sbjct: 401  AKEILCEMVNKGCTPNAFTYSSLIKGFFKTGNSQKAIRVWKEMAKNNCVPNEICYSVLIH 460

Query: 1695 GLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDS 1874
            GLC++GKL+EA M+W HMLG+G  PDVVAY+SMIHGLC+ GSVE GL LFNEML + SDS
Sbjct: 461  GLCEDGKLREAMMMWTHMLGRGLRPDVVAYSSMIHGLCNAGSVEVGLKLFNEMLCQESDS 520

Query: 1875 QPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGR 2054
            QPDV TYN+L  ALCK   I+ AI LLN+MLDRGC+PD++TCNIFL   +EK+NP QDGR
Sbjct: 521  QPDVVTYNILLRALCKQNSISHAIDLLNSMLDRGCNPDLITCNIFLNALREKLNPPQDGR 580

Query: 2055 EFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWN 2234
            EFLDEL +RLHKRQRI GA++IIEVMLQKFL P  STWE  I ELCKP+K+Q  I+KCW+
Sbjct: 581  EFLDELVVRLHKRQRIVGAAKIIEVMLQKFLPPNASTWERIIPELCKPKKVQAIIDKCWS 640

Query: 2235 SLF 2243
            SLF
Sbjct: 641  SLF 643


>emb|CBI27232.3| unnamed protein product [Vitis vinifera]
          Length = 660

 Score =  901 bits (2329), Expect = 0.0
 Identities = 434/594 (73%), Positives = 506/594 (85%), Gaps = 1/594 (0%)
 Frame = +3

Query: 465  LFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAY 644
            +FK A + GSYK GDSTFYSLIENYANSGDF +L  VF RM  ERR F+EK+FI+VFRAY
Sbjct: 66   IFKSASQMGSYKSGDSTFYSLIENYANSGDFGTLFQVFDRMKRERRVFIEKNFILVFRAY 125

Query: 645  GKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCK-NIK 821
            GKAHLPEKA+ELF RMV EFQC+RTV+SFNSVLNVIIQEGL+ RALEF+   V  K NI 
Sbjct: 126  GKAHLPEKAIELFGRMVDEFQCRRTVRSFNSVLNVIIQEGLFHRALEFYECGVGGKTNIS 185

Query: 822  PNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVAL 1001
            PNVL+FNLVIKAMCKL LVDRA+EVFREM   KC+ DVFTYCTLMDGLCKEDR++EAV L
Sbjct: 186  PNVLSFNLVIKAMCKLGLVDRAIEVFREMAIQKCEPDVFTYCTLMDGLCKEDRIDEAVLL 245

Query: 1002 LDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCL 1181
            LDEMQIEGCFP+  TFNVLINGLCKKGD+ R  K+VDNMFLKGCVPNEVTYNT+I+GLCL
Sbjct: 246  LDEMQIEGCFPSSVTFNVLINGLCKKGDMVRVTKLVDNMFLKGCVPNEVTYNTIINGLCL 305

Query: 1182 QGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHI 1361
            +GKL+KA+SLLDRMV+ K +PNDVTYGT+I+GLV++GR+VDGVH++ S+EERGH  N++ 
Sbjct: 306  KGKLDKAVSLLDRMVASKCVPNDVTYGTLINGLVKQGRSVDGVHLLSSLEERGHHANEYA 365

Query: 1362 YSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMV 1541
            YS+L+SGLFKE +SEEA+ LWKK++E G +PN VVYSALIDGLCR GK  EAKEIL EMV
Sbjct: 366  YSTLISGLFKEEKSEEAMGLWKKMVEKGCQPNIVVYSALIDGLCREGKLDEAKEILCEMV 425

Query: 1542 NKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLK 1721
            NKGC PNA+TYSSL+KGFFK GNS  A+ +WKEM +  CV NE CYS+LIHGLC++GKL+
Sbjct: 426  NKGCTPNAFTYSSLIKGFFKTGNSQKAIRVWKEMAKNNCVPNEICYSVLIHGLCEDGKLR 485

Query: 1722 EATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNV 1901
            EA M+W HMLG+G  PDVVAY+SMIHGLC+ GSVE GL LFNEML + SDSQPDV TYN+
Sbjct: 486  EAMMMWTHMLGRGLRPDVVAYSSMIHGLCNAGSVEVGLKLFNEMLCQESDSQPDVVTYNI 545

Query: 1902 LFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLR 2081
            L  ALCK   I+ AI LLN+MLDRGC+PD++TCNIFL   +EK+NP QDGREFLDEL +R
Sbjct: 546  LLRALCKQNSISHAIDLLNSMLDRGCNPDLITCNIFLNALREKLNPPQDGREFLDELVVR 605

Query: 2082 LHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLF 2243
            LHKRQRI GA++IIEVMLQKFL P  STWE  I ELCKP+K+Q  I+KCW+SLF
Sbjct: 606  LHKRQRIVGAAKIIEVMLQKFLPPNASTWERIIPELCKPKKVQAIIDKCWSSLF 659


>ref|XP_006448599.1| hypothetical protein CICLE_v10014519mg [Citrus clementina]
            gi|557551210|gb|ESR61839.1| hypothetical protein
            CICLE_v10014519mg [Citrus clementina]
          Length = 664

 Score =  879 bits (2270), Expect = 0.0
 Identities = 418/618 (67%), Positives = 508/618 (82%), Gaps = 2/618 (0%)
 Frame = +3

Query: 396  STNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMV 575
            S+N HM  +     + E P +D +F   PK GSY+ GDSTFYSLI++YANSGDFKSLEMV
Sbjct: 46   SSNKHMETEPQGNAKSEQPFSDEVFNSTPKLGSYQLGDSTFYSLIQHYANSGDFKSLEMV 105

Query: 576  FSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVII 755
              RM  E+R  +EKSFI +F+AYGKAHL E+AV LF  MV EFQCKRTVKSFNSVLNVII
Sbjct: 106  LCRMRREKRVALEKSFIFIFKAYGKAHLVEEAVRLFHTMVDEFQCKRTVKSFNSVLNVII 165

Query: 756  QEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDA 929
            QEGLY RALEF++++VN K  NI PN LTFNLVIKA+C+L LVD A+E+FREMP   C+ 
Sbjct: 166  QEGLYHRALEFYNHIVNAKHMNILPNTLTFNLVIKAVCRLGLVDNAIELFREMPVRNCEP 225

Query: 930  DVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVV 1109
            D++TYCTLMDGLCKE+R++EAV LLDEMQ++GCFP P TFNVLINGLCK G L RAAK+V
Sbjct: 226  DIYTYCTLMDGLCKENRLDEAVLLLDEMQVDGCFPTPVTFNVLINGLCKNGGLGRAAKLV 285

Query: 1110 DNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRK 1289
            DNMFLKGC+PNEVTYNTLIHGLCL+G L+KA+SLLDRMV+ K +PN+VTYGTII+GLV+ 
Sbjct: 286  DNMFLKGCLPNEVTYNTLIHGLCLKGDLDKAVSLLDRMVASKCMPNEVTYGTIINGLVKL 345

Query: 1290 GRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVY 1469
            GRAVDG  V++SMEER    N++IYS+L+SGLFKEG++E+A+ LWK++ME G KPNTVVY
Sbjct: 346  GRAVDGARVLMSMEERKFHVNEYIYSTLISGLFKEGKAEDAMKLWKQMMEKGCKPNTVVY 405

Query: 1470 SALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTE 1649
            SALIDGLCRVGKP EA+EIL EM+N GC  NA+TYSSLMKGFF+ G  + AV +WK+M +
Sbjct: 406  SALIDGLCRVGKPDEAEEILSEMINNGCAANAFTYSSLMKGFFESGKGHKAVEIWKDMAK 465

Query: 1650 KGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQ 1829
              CV+NE CYS+LIHGLC++GKL+EA MVW  ML +G+ PDVVAY+SMIHGLC+ GS+E+
Sbjct: 466  NNCVYNEVCYSVLIHGLCEDGKLREARMVWTQMLSRGYKPDVVAYSSMIHGLCNAGSLEE 525

Query: 1830 GLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIF 2009
             L LFNEML     SQPDVFTYN+L NALCK   I+ +I LLN+M+DRGCDPD+VTCNIF
Sbjct: 526  ALKLFNEMLCPEPKSQPDVFTYNILLNALCKQSNISHSIDLLNSMMDRGCDPDLVTCNIF 585

Query: 2010 LTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIREL 2189
            LT  KEK+   QDG +FL+EL +RL KRQR  G  +I+EVMLQKFL PK STWE  ++EL
Sbjct: 586  LTALKEKLETPQDGTDFLNELAIRLFKRQRTSGGFKIVEVMLQKFLPPKTSTWERVVQEL 645

Query: 2190 CKPRKIQVAINKCWNSLF 2243
            C+P++IQ AINKCW++L+
Sbjct: 646  CRPKRIQAAINKCWSNLY 663


>gb|EMJ12567.1| hypothetical protein PRUPE_ppa002507mg [Prunus persica]
          Length = 664

 Score =  870 bits (2247), Expect = 0.0
 Identities = 426/643 (66%), Positives = 513/643 (79%), Gaps = 2/643 (0%)
 Frame = +3

Query: 321  CPFSALPNNSCETEKDYEEDIIAIQSTNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYK 500
            CP S     +C     +   ++AI S N  +  +  +  E EPPI++ +FK   K GSYK
Sbjct: 26   CPISPCELLTCSLHSHFS--VLAIPS-NQALQTEPVNNDETEPPISNEIFKKGTKLGSYK 82

Query: 501  QGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVEL 680
             GDSTFYSLIENYAN GDF+SLE V  RM  ERR F+E+SFI++FRAYGKAHLP KAVEL
Sbjct: 83   SGDSTFYSLIENYANLGDFRSLEQVLDRMKRERRVFIEQSFILMFRAYGKAHLPNKAVEL 142

Query: 681  FDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIK 854
            F RMV EFQC+RTVKSFNSVLNVIIQEG YS ALEF+S+VV     NI PNVL+FNL+IK
Sbjct: 143  FYRMVDEFQCRRTVKSFNSVLNVIIQEGHYSHALEFYSHVVGTTGMNISPNVLSFNLIIK 202

Query: 855  AMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFP 1034
            +MCKL LVDRAV+VFREMP   C  DVFTY TLMDGLCKE R++EAV LLDEMQ+EGC P
Sbjct: 203  SMCKLGLVDRAVQVFREMPLRNCTPDVFTYSTLMDGLCKEKRIDEAVFLLDEMQLEGCIP 262

Query: 1035 NPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLL 1214
            +P TFNVLIN LCKKGDL RAAK+VDNM LKGCVPNEVTYNTLIHGLCL+GKL KA+SLL
Sbjct: 263  SPVTFNVLINALCKKGDLGRAAKLVDNMLLKGCVPNEVTYNTLIHGLCLKGKLAKAVSLL 322

Query: 1215 DRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKE 1394
            DRMVS+K +PNDVTYGTII+GLV++GRAVDG  V++SMEERG+  N++IYS LVSGLFKE
Sbjct: 323  DRMVSNKCVPNDVTYGTIINGLVKRGRAVDGARVLMSMEERGNHANEYIYSVLVSGLFKE 382

Query: 1395 GRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTY 1574
            G+SE+A+ LWK+++E G KPNT+ YS LI+GLC  GKP EAKE+  EMV+ GC PN++TY
Sbjct: 383  GKSEDAMRLWKEMLEKGCKPNTIAYSTLINGLCGEGKPDEAKEVFSEMVSNGCMPNSFTY 442

Query: 1575 SSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLG 1754
            SSLM+GFF+ G S  A+LLWKEM     + NE CYS+LIHGLC++G+L EA + W+ MLG
Sbjct: 443  SSLMRGFFQTGQSQKAILLWKEMANN--MRNEVCYSVLIHGLCEDGQLNEALIAWQQMLG 500

Query: 1755 KGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKI 1934
            +G+ PDVVAY+SMIHGLC+ G VEQGL LFNEML +  + QPDV TYN+LFN  CK   I
Sbjct: 501  RGYKPDVVAYSSMIHGLCNAGLVEQGLKLFNEMLCQEPECQPDVITYNILFNVFCKQSSI 560

Query: 1935 TPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGAS 2114
            + AI  LN MLDRGCDPD VTC+IFL + +E+++P QDGREFL+EL +RL K+QRI GAS
Sbjct: 561  SLAIDHLNRMLDRGCDPDSVTCDIFLRSLRERLDPPQDGREFLNELVVRLFKQQRIVGAS 620

Query: 2115 RIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLF 2243
             I+EVMLQKFL PK STW   ++ELCKP+ ++ AI+KCW+SL+
Sbjct: 621  IIVEVMLQKFLPPKASTWTRVVQELCKPKMVRAAIDKCWSSLY 663


>ref|XP_006468575.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like
            [Citrus sinensis]
          Length = 664

 Score =  869 bits (2246), Expect = 0.0
 Identities = 413/618 (66%), Positives = 505/618 (81%), Gaps = 2/618 (0%)
 Frame = +3

Query: 396  STNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMV 575
            S+N  M  +     + E P +D +F   PK GSY+ GDSTFYSLI++YANSGDFKSLEMV
Sbjct: 46   SSNKQMETEPQGNAKSEQPFSDEIFNSTPKLGSYQLGDSTFYSLIQHYANSGDFKSLEMV 105

Query: 576  FSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVII 755
              RM  E+R  +EKSFI +F+AYGKAHL E+A+ LF  MV EF CKRTVKSFNSVLNVII
Sbjct: 106  LYRMRREKRVVLEKSFIFIFKAYGKAHLVEEAIRLFHTMVDEFHCKRTVKSFNSVLNVII 165

Query: 756  QEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDA 929
            QEGLY RALEF++++VN K  NI PN LTFNLVIK +C+L LVD A+++FREMP   C+ 
Sbjct: 166  QEGLYHRALEFYNHIVNAKHMNILPNTLTFNLVIKTVCRLGLVDNAIQLFREMPVRNCEP 225

Query: 930  DVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVV 1109
            D++TYCTLMDGLCKE+R++EAV LLDEMQ++GCFP P TFNVLINGLCK G+L RAAK+V
Sbjct: 226  DIYTYCTLMDGLCKENRLDEAVLLLDEMQVDGCFPTPVTFNVLINGLCKNGELGRAAKLV 285

Query: 1110 DNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRK 1289
            DNMFLKGC+PNEVTYNTLIHGLCL+G L+KA+SLLDRMV+ K +PN+VTYGTII+GLV+ 
Sbjct: 286  DNMFLKGCLPNEVTYNTLIHGLCLKGNLDKAVSLLDRMVASKCMPNEVTYGTIINGLVKL 345

Query: 1290 GRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVY 1469
            GRAVDG  V++SMEER    N++IYS+L+SGLFKEG++E+A+ LWK++ME G KPNTVVY
Sbjct: 346  GRAVDGARVLMSMEERKFHVNEYIYSTLISGLFKEGKAEDAMKLWKQMMEKGCKPNTVVY 405

Query: 1470 SALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTE 1649
            SALIDGLCRVGKP EA+EIL EM+N GC  NA+TYSSLMKGFF+ G  + AV +WK+M +
Sbjct: 406  SALIDGLCRVGKPDEAEEILFEMINNGCAANAFTYSSLMKGFFESGKGHKAVEIWKDMAK 465

Query: 1650 KGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQ 1829
              CV+NE CYS+LIHGLC++GKL+EA MVW  ML +G  PDVVAY+SMIHGLC+ GSVE+
Sbjct: 466  NNCVYNEVCYSVLIHGLCEDGKLREARMVWTQMLSRGCKPDVVAYSSMIHGLCNAGSVEE 525

Query: 1830 GLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIF 2009
             L LFNEML     SQPDVFTYN+L NALCK   I+ +I LLN+M+DRGCDPD+VTCNIF
Sbjct: 526  ALKLFNEMLCLEPKSQPDVFTYNILLNALCKQSNISHSIDLLNSMMDRGCDPDLVTCNIF 585

Query: 2010 LTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIREL 2189
            LT  KEK+   QDG +FL+EL +RL KRQR  G  +I+EVMLQKFL P+ STWE  ++EL
Sbjct: 586  LTALKEKLEAPQDGTDFLNELAIRLFKRQRTSGGFKIVEVMLQKFLSPQTSTWERVVQEL 645

Query: 2190 CKPRKIQVAINKCWNSLF 2243
            C+P++IQ AINKCW++L+
Sbjct: 646  CRPKRIQAAINKCWSNLY 663


>ref|XP_002304600.2| hypothetical protein POPTR_0003s15360g [Populus trichocarpa]
            gi|550343237|gb|EEE79579.2| hypothetical protein
            POPTR_0003s15360g [Populus trichocarpa]
          Length = 672

 Score =  867 bits (2241), Expect = 0.0
 Identities = 417/609 (68%), Positives = 499/609 (81%), Gaps = 2/609 (0%)
 Frame = +3

Query: 423  RSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERR 602
            R H +E +PPI+D++FK  PK GSYK GDSTFYSLI+NYAN GDFKSLE V  RM  E+R
Sbjct: 63   REHGIEHDPPISDKIFKSGPKMGSYKLGDSTFYSLIDNYANLGDFKSLEKVLDRMRCEKR 122

Query: 603  AFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRAL 782
              VEK F+V+F+AYGKAHLPEKAV LFDRM YEF+CKRTVKSFNSVLNVIIQEGL+ RAL
Sbjct: 123  VVVEKCFVVIFKAYGKAHLPEKAVGLFDRMAYEFECKRTVKSFNSVLNVIIQEGLFYRAL 182

Query: 783  EFHSYVVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLM 956
            EF+++V+  K  NI PNVLTFNLVIK MCK+ LVD AV++FR+MP  KC  DV+TYCTLM
Sbjct: 183  EFYNHVIGAKGVNISPNVLTFNLVIKTMCKVGLVDDAVQMFRDMPVSKCQPDVYTYCTLM 242

Query: 957  DGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCV 1136
            DGLCK DR++EAV+LLDEMQI+GCFP+P TFNVLINGLCKKGDL+R AK+VDNMFLKGC 
Sbjct: 243  DGLCKADRIDEAVSLLDEMQIDGCFPSPVTFNVLINGLCKKGDLARVAKLVDNMFLKGCA 302

Query: 1137 PNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHV 1316
            PNEVTYNTLIHGLCL+GKLEKAISLLDRMVS K +PN VTYGTII+GLV++GRA+DG  V
Sbjct: 303  PNEVTYNTLIHGLCLKGKLEKAISLLDRMVSSKCVPNVVTYGTIINGLVKQGRALDGARV 362

Query: 1317 MVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCR 1496
            +  MEERG+  N+++YS+L+SGLFKEG+S+EA+ L+K++     + NT+VYSA+IDGLCR
Sbjct: 363  LALMEERGYHVNEYVYSALISGLFKEGKSQEAMQLFKEMTVKECELNTIVYSAVIDGLCR 422

Query: 1497 VGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFC 1676
             GKP EA E+L EM N  C+PNAYTYSSLMKGFF+ GN + A+ +WK+M +     NE C
Sbjct: 423  DGKPDEALEVLSEMTNNRCKPNAYTYSSLMKGFFEAGNGHKAIEMWKDMAKHNFTQNEVC 482

Query: 1677 YSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEML 1856
            YS+LIHGLC +GK+KEA MVW  MLGKG  PDVVAY SMI+GL + G VE  L L+NEML
Sbjct: 483  YSVLIHGLCKDGKVKEAMMVWAQMLGKGCKPDVVAYGSMINGLSNAGLVEDALQLYNEML 542

Query: 1857 YKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMN 2036
             +  DSQPDV TYN+L NALCK   I+ AI LLN+MLDRGCDPD+VTC IFL T +EK++
Sbjct: 543  CQEPDSQPDVVTYNILLNALCKQSSISRAIDLLNSMLDRGCDPDLVTCIIFLRTLREKLD 602

Query: 2037 PSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVA 2216
            P QDGREFLD L +RL KRQR+ GAS+I+EVMLQK L PKPSTW   + +LC P+K+Q A
Sbjct: 603  PPQDGREFLDGLVVRLLKRQRVLGASKIVEVMLQKLLPPKPSTWTRVVEDLCNPKKVQAA 662

Query: 2217 INKCWNSLF 2243
            I KCW+ L+
Sbjct: 663  IQKCWSILY 671


>gb|EOX96827.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 636

 Score =  862 bits (2227), Expect = 0.0
 Identities = 405/600 (67%), Positives = 498/600 (83%), Gaps = 2/600 (0%)
 Frame = +3

Query: 450  PITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIV 629
            P++D+LF  AP+SGS++ GDST YSLI +YA+  DF SL  V  RM  + R F+EK F++
Sbjct: 36   PLSDQLFNSAPQSGSFRLGDSTCYSLIHHYAHKVDFASLHDVLCRMKLQNRVFIEKYFLL 95

Query: 630  VFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNC 809
            +F+AYG+AHLPEKAV+LF RM +EF CK TVKSFNSVLNVIIQEG Y RA +F++  V+ 
Sbjct: 96   IFKAYGRAHLPEKAVDLFHRMPHEFHCKPTVKSFNSVLNVIIQEGFYHRAFDFYNCSVSA 155

Query: 810  KN--IKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRV 983
            KN  I PNVLTFNL++KAMCKL  VDRA+EVFREMP  KC  DV+TYCTLMDGLCKEDR+
Sbjct: 156  KNTNISPNVLTFNLLLKAMCKLGWVDRAIEVFREMPLRKCAPDVYTYCTLMDGLCKEDRI 215

Query: 984  EEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTL 1163
            +EAV+LLDEMQ EGCFP P TFNVLINGLCKKGDL+RAAK+VDNMFLKGC+PN+VTYNTL
Sbjct: 216  DEAVSLLDEMQTEGCFPTPVTFNVLINGLCKKGDLARAAKLVDNMFLKGCLPNQVTYNTL 275

Query: 1164 IHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGH 1343
            IHGLCL+GKL+KA+ LLDRMVS   IPND+TYGTI++GLV++GR  D V ++VSMEERG+
Sbjct: 276  IHGLCLKGKLDKAVILLDRMVSSNCIPNDITYGTIVNGLVKQGRVEDAVMLVVSMEERGY 335

Query: 1344 QGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKE 1523
              N+++YS+L+SGLFK G+SEEA+  W ++ME G+KPNTVVYS+LIDGLCR GKP EA+E
Sbjct: 336  GVNEYVYSALISGLFKGGKSEEAMKRWTEMMEKGYKPNTVVYSSLIDGLCREGKPNEAEE 395

Query: 1524 ILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLC 1703
            +L EM+ KGC PNAYTYSSLMKGFFK GN + AV +WK+M E  C+H++ CYS+LIHGLC
Sbjct: 396  VLSEMIEKGCIPNAYTYSSLMKGFFKTGNCHKAVQVWKDMAEHKCIHSQVCYSVLIHGLC 455

Query: 1704 DEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPD 1883
            ++G L EA M WRHML KG  PD VAY+SMI GLC+ GS+E+ L LFNEMLY+ ++SQPD
Sbjct: 456  EDGNLSEAMMAWRHMLDKGCKPDAVAYSSMIQGLCNAGSLEEALKLFNEMLYQEAESQPD 515

Query: 1884 VFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFL 2063
            V TYN+LFNALC  + I+ A+ LLN+MLD+ CDPDI TCNIFL T +EK++P QDGREFL
Sbjct: 516  VITYNILFNALCNQKSISHAVDLLNSMLDQACDPDIATCNIFLRTLREKVDPPQDGREFL 575

Query: 2064 DELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLF 2243
            DEL +RL KRQR+ GAS+I++VMLQKFL PK STW   + ELCKP+KIQ AI+KCW +++
Sbjct: 576  DELVIRLFKRQRVFGASKIVQVMLQKFLPPKASTWARVVEELCKPKKIQAAIDKCWRNIY 635


>ref|XP_002297917.1| hypothetical protein POPTR_0001s12190g [Populus trichocarpa]
            gi|222845175|gb|EEE82722.1| hypothetical protein
            POPTR_0001s12190g [Populus trichocarpa]
          Length = 670

 Score =  862 bits (2226), Expect = 0.0
 Identities = 431/686 (62%), Positives = 527/686 (76%), Gaps = 3/686 (0%)
 Frame = +3

Query: 195  CIPFVEKVLSVLVIPMLAFTSKSARLILTSNPCKSSFIFLIHCPFSALPNN-SCETEKDY 371
            C PF    +   +  + +F SK   L + SN       F  H    A+P+  + ETE   
Sbjct: 4    CQPFNTNSILKALNNLFSFPSKFLSLSMHSN-------FSAH----AIPSTKTIETEP-- 50

Query: 372  EEDIIAIQSTNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSG 551
                  +  T       + + +E +PPI+D++FK  PK GSY+ GDSTFYSLI NYAN G
Sbjct: 51   ------LNHTQHCNTTDQENGIEPDPPISDKIFKSGPKMGSYRLGDSTFYSLINNYANLG 104

Query: 552  DFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSF 731
            DFKSLE V  RM  E+R   EK FIV+F+AYGKAHLPEKAV+LFDRM  EF+CKRT KSF
Sbjct: 105  DFKSLEKVLDRMKCEKRVIFEKCFIVIFKAYGKAHLPEKAVDLFDRMACEFECKRTGKSF 164

Query: 732  NSVLNVIIQEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFRE 905
            NSVLNVIIQEGL+ RALEF+++V+  K  +I PNVLTFNLVIKAMCK+ LVD A++VFR+
Sbjct: 165  NSVLNVIIQEGLFHRALEFYNHVIGAKGVSISPNVLTFNLVIKAMCKVGLVDDAIQVFRD 224

Query: 906  MPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGD 1085
            M   KC+ DV+TYCTLMDGLCK DR++EAV+LLDEMQI+GCFP+P TFNVLINGLCKKGD
Sbjct: 225  MTIRKCEPDVYTYCTLMDGLCKADRIDEAVSLLDEMQIDGCFPSPVTFNVLINGLCKKGD 284

Query: 1086 LSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGT 1265
            LSRAAK+VDNMFLKGC+PNEVTYNTLIHGLCL+GKLEKAISLLDRMVS K +PN VTYGT
Sbjct: 285  LSRAAKLVDNMFLKGCIPNEVTYNTLIHGLCLKGKLEKAISLLDRMVSSKCVPNVVTYGT 344

Query: 1266 IIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENG 1445
            II+GLV++GRA+DG  V+  MEERG+  N+++YS+L+SGLFKEG+S+EA++L+K++   G
Sbjct: 345  IINGLVKQGRALDGACVLALMEERGYCVNEYVYSTLISGLFKEGKSQEAMHLFKEMTVKG 404

Query: 1446 HKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAV 1625
            ++ NT+VYSA+IDGLCR GKP +A E+L EM NKGC PNAYT SSLMKGFF+ GNS+ AV
Sbjct: 405  YELNTIVYSAVIDGLCRDGKPDDAVEVLSEMTNKGCTPNAYTCSSLMKGFFEAGNSHRAV 464

Query: 1626 LLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGL 1805
             +WK+M +     NE CYS+LIHGLC +GK+KEA MVW  MLGKG  PDVVAY+SMI+GL
Sbjct: 465  EVWKDMAKHNFTQNEVCYSVLIHGLCKDGKVKEAMMVWTQMLGKGCKPDVVAYSSMINGL 524

Query: 1806 CSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDP 1985
               G VE  + L+NEML +G DSQPDV TYN+L N LCK   I+ AI LLN+MLDRGCDP
Sbjct: 525  SIAGLVEDAMQLYNEMLCQGPDSQPDVVTYNILLNTLCKQSSISRAIDLLNSMLDRGCDP 584

Query: 1986 DIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPST 2165
            D+VTC IFL   +EK++P QDGREFLDEL +RL KRQR+ GAS+I+EVMLQK L PK ST
Sbjct: 585  DLVTCTIFLRMLREKLDPPQDGREFLDELVVRLLKRQRVLGASKIVEVMLQKLLPPKHST 644

Query: 2166 WEIAIRELCKPRKIQVAINKCWNSLF 2243
            W   +  LCKP+K+Q  I KCW+ L+
Sbjct: 645  WARVVENLCKPKKVQAVIQKCWSILY 670


>ref|XP_004295517.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like
            [Fragaria vesca subsp. vesca]
          Length = 647

 Score =  855 bits (2209), Expect = 0.0
 Identities = 414/604 (68%), Positives = 494/604 (81%), Gaps = 2/604 (0%)
 Frame = +3

Query: 438  EVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEK 617
            E +PPI++ +F+  P  G+YK GDSTFYSLIENYA+ GDF SLE V  RM  ERR FVE 
Sbjct: 43   EPDPPISEEIFRKGPNFGAYKSGDSTFYSLIENYASLGDFGSLEKVLDRMKRERRVFVEG 102

Query: 618  SFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSY 797
            SFI VFRA+GKAHLP +AV+LF RMV EFQC+RTVKSFNSVLNVI+QEG Y+ ALEF+ +
Sbjct: 103  SFIAVFRAFGKAHLPNQAVDLFHRMVDEFQCRRTVKSFNSVLNVIVQEGHYAHALEFYDH 162

Query: 798  VVNCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCK 971
            VV  +  NI PNVL++NL+IKA+C+  LVD+AVE FREMP   C  DVFTYCTLMDGLCK
Sbjct: 163  VVGDRSMNISPNVLSYNLIIKALCRFGLVDKAVEKFREMPVRDCAPDVFTYCTLMDGLCK 222

Query: 972  EDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVT 1151
             +RV+EAV LLDEMQIEGC P+PA FNVLI+ +CKKGDL RAAK+VDNMFLKGCVPNEVT
Sbjct: 223  VNRVDEAVFLLDEMQIEGCSPSPAAFNVLIDAVCKKGDLGRAAKLVDNMFLKGCVPNEVT 282

Query: 1152 YNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSME 1331
            YNTLIHGLCLQGKLEKAISLLDRMV +K +PNDVTYGTII+GLV++GR++DGV V++SME
Sbjct: 283  YNTLIHGLCLQGKLEKAISLLDRMVLNKCVPNDVTYGTIINGLVKQGRSLDGVRVLISME 342

Query: 1332 ERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPF 1511
            ERG + N++IYS LVSGLFKEG+SEEA+ LWK++ME G KPNTVVYSALIDGLC  GKP 
Sbjct: 343  ERGRRANEYIYSVLVSGLFKEGKSEEAMKLWKEMMEKGCKPNTVVYSALIDGLCLDGKPD 402

Query: 1512 EAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILI 1691
            EAKE+  EMV  GC PN+Y YSSLM+GFF+ G S  A+LLWKEM     V NE CYS++I
Sbjct: 403  EAKEVFCEMVRNGCMPNSYAYSSLMRGFFRTGQSQKAILLWKEMAANNVVRNEVCYSVII 462

Query: 1692 HGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSD 1871
             G C EGK+KEA MVW+ +L +G+  DVVAY+SMIHGLC+ G VEQGL LFN+ML +  +
Sbjct: 463  DGFCKEGKVKEALMVWKQILARGYKLDVVAYSSMIHGLCNDGLVEQGLKLFNDMLSQEPE 522

Query: 1872 SQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDG 2051
             QPDV TYN+L NALCK   I+ AI LLN+MLD GCDPD+VTC+IFLTT  EK++P QDG
Sbjct: 523  CQPDVITYNILLNALCKQHTISRAIDLLNSMLDHGCDPDLVTCDIFLTTLGEKLDPPQDG 582

Query: 2052 REFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCW 2231
            REFL+EL +RL KRQR  GA RI+EVML+KFL P   TW   ++ELCKP+K++ AI+KCW
Sbjct: 583  REFLNELVVRLFKRQRTVGAFRIVEVMLKKFLPPTACTWTTVVQELCKPKKVRAAIDKCW 642

Query: 2232 NSLF 2243
            +SL+
Sbjct: 643  SSLY 646


>ref|XP_002528143.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223532441|gb|EEF34234.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 653

 Score =  849 bits (2193), Expect = 0.0
 Identities = 411/600 (68%), Positives = 491/600 (81%), Gaps = 2/600 (0%)
 Frame = +3

Query: 450  PITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIV 629
            PI+D++F   PK GS+K GDSTFYSLIENYA S DF SLE V +RM  E R F EKSF V
Sbjct: 53   PISDKIFSSPPKMGSFKVGDSTFYSLIENYAYSSDFNSLEKVLNRMRLENRVFSEKSFFV 112

Query: 630  VFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNC 809
            +F+AYGKAHLP KA+ELF RM +EF CK TVKSFNSVLNVIIQ G + RALEF+++VV  
Sbjct: 113  MFKAYGKAHLPNKAIELFYRMSFEFYCKPTVKSFNSVLNVIIQAGFHDRALEFYNHVVGA 172

Query: 810  K--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRV 983
            K  NI PNVL+FNL+IK+MCKL LVD A+E+FREMP  KC  D +TYCTLMDGLCK DR+
Sbjct: 173  KDMNILPNVLSFNLIIKSMCKLGLVDNAIELFREMPVRKCVPDAYTYCTLMDGLCKVDRI 232

Query: 984  EEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTL 1163
            +EAV+LLDEMQIEGCFP+PATFNVLINGLCKKGD +R  K+VDNMFLKGCVPNEVTYNTL
Sbjct: 233  DEAVSLLDEMQIEGCFPSPATFNVLINGLCKKGDFTRVTKLVDNMFLKGCVPNEVTYNTL 292

Query: 1164 IHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGH 1343
            IHGLCL+GKL+KA+SLLDRMVS K +PN+VTYGTII+GLV++GRA+DG  V+V MEERG+
Sbjct: 293  IHGLCLKGKLDKALSLLDRMVSSKCVPNEVTYGTIINGLVKQGRALDGARVLVLMEERGY 352

Query: 1344 QGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKE 1523
              N+++YS LVSGLFKEG+SEEA+ L+K+ M+ G K NTV+YSAL+DGLCR  KP EA +
Sbjct: 353  IVNEYVYSVLVSGLFKEGKSEEAMRLFKESMDKGCKLNTVLYSALVDGLCRDRKPDEAMK 412

Query: 1524 ILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLC 1703
            IL EM +KGC PNA+T+SSLMKGFF+VGNS+ A+ +WK+MT+  C  NE CYS+LIHGLC
Sbjct: 413  ILSEMTDKGCAPNAFTFSSLMKGFFEVGNSHKAIEVWKDMTKINCAENEVCYSVLIHGLC 472

Query: 1704 DEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPD 1883
             +GK+ EA MVW  ML  G  PDVVAY+SMI GLC  GSVE+ L L+NEML    DSQPD
Sbjct: 473  KDGKVMEAMMVWAKMLATGCRPDVVAYSSMIQGLCDAGSVEEALKLYNEMLCLEPDSQPD 532

Query: 1884 VFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFL 2063
            V TYN+LFNALCK   I+ A+ LLN+MLDRGCDPD+VTCNIFL   +EK++P QDG +FL
Sbjct: 533  VITYNILFNALCKQSSISRAVDLLNSMLDRGCDPDLVTCNIFLRMLREKLDPPQDGAKFL 592

Query: 2064 DELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSLF 2243
            DEL +RL KRQR  GAS+I+EVMLQKFL PK STW   + ELC+P+KIQ  I+KCW+ L+
Sbjct: 593  DELVVRLLKRQRNLGASKIVEVMLQKFLSPKASTWARVVHELCQPKKIQAVIDKCWSKLY 652


>ref|XP_002867892.1| EMB1025 [Arabidopsis lyrata subsp. lyrata]
            gi|297313728|gb|EFH44151.1| EMB1025 [Arabidopsis lyrata
            subsp. lyrata]
          Length = 658

 Score =  818 bits (2112), Expect = 0.0
 Identities = 411/665 (61%), Positives = 497/665 (74%), Gaps = 9/665 (1%)
 Frame = +3

Query: 273  ILTSNPCKSSFIFLIHCPFSAL-----PNNSCETEKDYEEDIIAIQSTNSHMLPKRSHKV 437
            +L+SNP K    F IH  FSA      PN S E E                         
Sbjct: 22   LLSSNPVK----FSIHLRFSASSVSVSPNPSMEVE------------------------T 53

Query: 438  EVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEK 617
             +E PI++++FK APK GS+K GDST  S+IENYAN GDF S+E + SR+  E R  +E+
Sbjct: 54   PLEAPISEQMFKSAPKMGSFKLGDSTLSSMIENYANLGDFASVEKLLSRIRLENRVIIER 113

Query: 618  SFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSY 797
            SFIVVFRAYGKAHLPEKAV+LF RMV EF+CKR+VKSFNSVLNVII EGLY R LEF+ Y
Sbjct: 114  SFIVVFRAYGKAHLPEKAVDLFHRMVDEFRCKRSVKSFNSVLNVIINEGLYHRGLEFYDY 173

Query: 798  VVNCK---NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLC 968
            VVN     NI PN L+FNLVIKA+CKL  VDRA+EVFR MP  KC  D +TYCTLMDGLC
Sbjct: 174  VVNSNMNMNISPNGLSFNLVIKALCKLGFVDRAIEVFRGMPEKKCLPDGYTYCTLMDGLC 233

Query: 969  KEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEV 1148
            KE+R++EAV LLDEMQ EGC P+P  +NVLI+GLCKKGDLSR  K+VDNMFLKGC PNEV
Sbjct: 234  KEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKGDLSRVTKLVDNMFLKGCFPNEV 293

Query: 1149 TYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSM 1328
            TYNTLIHGLCL+GKL+KA+SLL+RMVS K IPNDVTYGT+I+GLV++ RA+DG  +++SM
Sbjct: 294  TYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRAMDGARLLISM 353

Query: 1329 EERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKP 1508
            EERG++ N HIYS L+SGLFKEG++EEA+ LWKK+ E G +PN VVYSA+IDGLCR GKP
Sbjct: 354  EERGYRLNQHIYSVLISGLFKEGKAEEAMTLWKKMAEKGCRPNIVVYSAVIDGLCREGKP 413

Query: 1509 FEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSIL 1688
             EAKEIL  M++ GC PN YTYSSLMKGFFK G S  A+ +W+EM E GC  NEFCYS+L
Sbjct: 414  NEAKEILNGMISSGCLPNVYTYSSLMKGFFKTGLSEEAIQVWREMDETGCSRNEFCYSVL 473

Query: 1689 IHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEML-YKG 1865
            I GLC  G++KEA MVW  ML  G  PD VAY+SMI GLC +GS++  L L++EML  + 
Sbjct: 474  IDGLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSMIKGLCGIGSMDAALKLYHEMLCQEE 533

Query: 1866 SDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQ 2045
              SQPDV TYN+L + LC  + ++ A+ LLN MLDRGCDPD++TCN FL T  EK +  +
Sbjct: 534  PKSQPDVVTYNILLDGLCMQKDVSRAVDLLNCMLDRGCDPDVITCNTFLNTLSEKSDSCE 593

Query: 2046 DGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINK 2225
            +GR FL+EL  RL KRQR+ GA +I+EVML K+L PK STW + + E+CKP+KI  AINK
Sbjct: 594  EGRSFLEELVARLLKRQRVSGACKIVEVMLGKYLAPKTSTWAMIVPEICKPKKINAAINK 653

Query: 2226 CWNSL 2240
            CW +L
Sbjct: 654  CWRNL 658


>ref|NP_193742.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75098720|sp|O49436.1|PP327_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g20090; AltName: Full=Protein EMBRYO DEFECTIVE 1025
            gi|2827663|emb|CAA16617.1| membrane-associated
            salt-inducible-like protein [Arabidopsis thaliana]
            gi|7268804|emb|CAB79009.1| membrane-associated
            salt-inducible-like protein [Arabidopsis thaliana]
            gi|58013024|gb|AAW62965.1| embryo-defective 1025
            [Arabidopsis thaliana] gi|332658871|gb|AEE84271.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 660

 Score =  812 bits (2098), Expect = 0.0
 Identities = 408/673 (60%), Positives = 498/673 (73%), Gaps = 4/673 (0%)
 Frame = +3

Query: 234  IPMLAFTSKSARLILTSNPCKSSFIFLIHCPFSALPNNSCETEKDYEEDIIAIQSTNSHM 413
            I   ++  K +R IL+SNP   S         S  PN S E  ++               
Sbjct: 10   ISFFSYFLKESR-ILSSNPVNFSIHLRFSSSVSVSPNPSMEVVEN--------------- 53

Query: 414  LPKRSHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWH 593
                     +E PI++++FK APK GS+K GDST  S+IE+YANSGDF S+E + SR+  
Sbjct: 54   --------PLEAPISEKMFKSAPKMGSFKLGDSTLSSMIESYANSGDFDSVEKLLSRIRL 105

Query: 594  ERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYS 773
            E R  +E+SFIVVFRAYGKAHLP+KAV+LF RMV EF+CKR+VKSFNSVLNVII EGLY 
Sbjct: 106  ENRVIIERSFIVVFRAYGKAHLPDKAVDLFHRMVDEFRCKRSVKSFNSVLNVIINEGLYH 165

Query: 774  RALEFHSYVVNCK---NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTY 944
            R LEF+ YVVN     NI PN L+FNLVIKA+CKL+ VDRA+EVFR MP  KC  D +TY
Sbjct: 166  RGLEFYDYVVNSNMNMNISPNGLSFNLVIKALCKLRFVDRAIEVFRGMPERKCLPDGYTY 225

Query: 945  CTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFL 1124
            CTLMDGLCKE+R++EAV LLDEMQ EGC P+P  +NVLI+GLCKKGDL+R  K+VDNMFL
Sbjct: 226  CTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNMFL 285

Query: 1125 KGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVD 1304
            KGCVPNEVTYNTLIHGLCL+GKL+KA+SLL+RMVS K IPNDVTYGT+I+GLV++ RA D
Sbjct: 286  KGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATD 345

Query: 1305 GVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALID 1484
             V ++ SMEERG+  N HIYS L+SGLFKEG++EEA++LW+K+ E G KPN VVYS L+D
Sbjct: 346  AVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVD 405

Query: 1485 GLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVH 1664
            GLCR GKP EAKEIL  M+  GC PNAYTYSSLMKGFFK G    AV +WKEM + GC  
Sbjct: 406  GLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSR 465

Query: 1665 NEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLF 1844
            N+FCYS+LI GLC  G++KEA MVW  ML  G  PD VAY+S+I GLC +GS++  L L+
Sbjct: 466  NKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSIIKGLCGIGSMDAALKLY 525

Query: 1845 NEML-YKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTF 2021
            +EML  +   SQPDV TYN+L + LC  + I+ A+ LLN+MLDRGCDPD++TCN FL T 
Sbjct: 526  HEMLCQEEPKSQPDVVTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDVITCNTFLNTL 585

Query: 2022 KEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPR 2201
             EK N    GR FL+EL +RL KRQR+ GA  I+EVML K+L PK STW + +RE+CKP+
Sbjct: 586  SEKSNSCDKGRSFLEELVVRLLKRQRVSGACTIVEVMLGKYLAPKTSTWAMIVREICKPK 645

Query: 2202 KIQVAINKCWNSL 2240
            KI  AI+KCW +L
Sbjct: 646  KINAAIDKCWRNL 658


>ref|XP_006283284.1| hypothetical protein CARUB_v10004320mg [Capsella rubella]
            gi|482551989|gb|EOA16182.1| hypothetical protein
            CARUB_v10004320mg [Capsella rubella]
          Length = 660

 Score =  808 bits (2086), Expect = 0.0
 Identities = 397/621 (63%), Positives = 485/621 (78%), Gaps = 6/621 (0%)
 Frame = +3

Query: 396  STNSHMLPKRSHKVE--VEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLE 569
            S++  + P  S +VE   E PI++ +FK APK GSYK GDST  S+IENYANSGDF S+E
Sbjct: 38   SSSVSVSPDPSMEVENPSEAPISENMFKSAPKMGSYKLGDSTLSSMIENYANSGDFASVE 97

Query: 570  MVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNV 749
             V SR+  E R   E SFIVVFRAYGKAHLP KAV+LF RMV EFQCKR+VKSFNSVLNV
Sbjct: 98   QVLSRVRLENRVISEHSFIVVFRAYGKAHLPGKAVDLFHRMVDEFQCKRSVKSFNSVLNV 157

Query: 750  IIQEGLYSRALEFHSYVVNCK---NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMK 920
            I+ EGLY R LEF+ YVVN     NI PN L+FNLVIKA+CKL  V++A+EVFREMP  K
Sbjct: 158  ILNEGLYHRGLEFYDYVVNSNMNMNIAPNGLSFNLVIKALCKLGFVNKAIEVFREMPEKK 217

Query: 921  CDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAA 1100
            C  D +TYCTLMDGLCKE+R++EAV LLDEMQ EGC P+  T+NVLI+GLCKKGDL+R  
Sbjct: 218  CLPDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSSVTYNVLIDGLCKKGDLTRVT 277

Query: 1101 KVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGL 1280
            K+VDNMFLKGCVPNEVTYNTLIHGLCL+GKL KA+SLL+RMVS K IPNDVTYGT+I+GL
Sbjct: 278  KLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLNKAVSLLERMVSSKCIPNDVTYGTLINGL 337

Query: 1281 VRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNT 1460
            V++ RA D V +++SMEERG+  N HIYS L+SGLFKEG++EEA+ LWKK++E G +PN 
Sbjct: 338  VKQRRATDAVRLLISMEERGYCLNQHIYSVLISGLFKEGKAEEAMTLWKKMVEKGCRPNI 397

Query: 1461 VVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKE 1640
            VVYSAL+DGLCR GKP EAKEI   M++ GC PNAYTYSSLMKGFF+ G S  A+ +W+E
Sbjct: 398  VVYSALVDGLCREGKPNEAKEIFRGMISNGCLPNAYTYSSLMKGFFRTGLSEEAIQVWRE 457

Query: 1641 MTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGS 1820
            M + GC  NEFCYS+LI GLC  G++ EA M+W  ML  G  PD VAY+SMI GLC +GS
Sbjct: 458  MDDTGCSRNEFCYSVLIDGLCGIGRVNEAMMLWSKMLTIGIKPDTVAYSSMIKGLCGIGS 517

Query: 1821 VEQGLLLFNEMLYKGS-DSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVT 1997
            ++  L L++EML +    SQPD+ TYN+LF+ LC  + ++ A+ LLN MLDRGCDPD++T
Sbjct: 518  MDAALKLYHEMLCEEEPKSQPDIVTYNILFDGLCMQKDVSRAVDLLNFMLDRGCDPDVIT 577

Query: 1998 CNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIA 2177
            CN FL T  EK +  ++GR FL+EL LRL KRQR+ GA +I+EVML K+L PK STW + 
Sbjct: 578  CNTFLKTLSEKSDSCEEGRNFLEELVLRLLKRQRVSGACKIVEVMLDKYLTPKISTWVLI 637

Query: 2178 IRELCKPRKIQVAINKCWNSL 2240
            + E+CKP+KI  AI+KCW +L
Sbjct: 638  VPEICKPKKINAAIDKCWRNL 658


>ref|XP_003534864.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like
            isoform X1 [Glycine max] gi|571476386|ref|XP_006586943.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g20090-like isoform X2 [Glycine max]
            gi|571476388|ref|XP_006586944.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g20090-like isoform X3 [Glycine max]
            gi|571476390|ref|XP_006586945.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g20090-like isoform X4 [Glycine max]
            gi|571476393|ref|XP_006586946.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g20090-like isoform X5 [Glycine max]
            gi|571476395|ref|XP_006586947.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g20090-like isoform X6 [Glycine max]
          Length = 642

 Score =  802 bits (2071), Expect = 0.0
 Identities = 394/641 (61%), Positives = 499/641 (77%), Gaps = 4/641 (0%)
 Frame = +3

Query: 330  SALPNNSCET--EKDYEEDIIAIQSTNSHMLPKRSHKVEVEPPITDRLFKHAPKSGSYKQ 503
            S+ P N   T   + + + +I + S +S      SHK    P  +  +FK   + GSYK 
Sbjct: 9    SSFPTNLLRTTLHRYFSQTLITLPSYSSS-----SHK----PHPSSEIFKSGTQMGSYKL 59

Query: 504  GDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELF 683
            GD +FYSLIE++A+S DF+SLE V  +M  ERR F+EK+FIV+F+AYGKAHLPEKAV+LF
Sbjct: 60   GDLSFYSLIESHASSLDFRSLEEVLHQMKRERRVFLEKNFIVMFKAYGKAHLPEKAVDLF 119

Query: 684  DRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVVNCK--NIKPNVLTFNLVIKA 857
             RM  EFQCK+TVKSFNSVLNVI+QEGL++RALEF+++VV  K  NI PN LTFNLVIKA
Sbjct: 120  HRMWGEFQCKQTVKSFNSVLNVIVQEGLFNRALEFYNHVVASKSLNIHPNALTFNLVIKA 179

Query: 858  MCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPN 1037
            MC+L LVD+A+EVFRE+P   C  D +TY TLM GLCKE+R++EAV+LLDEMQ+EG FPN
Sbjct: 180  MCRLGLVDKAIEVFREIPLRNCAPDNYTYSTLMHGLCKEERIDEAVSLLDEMQVEGTFPN 239

Query: 1038 PATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLD 1217
               FNVLI+ LCKKGDL RAAK+VDNMFLKGCVPNEVTYN L+HGLCL+GKLEKA+SLL+
Sbjct: 240  LVAFNVLISALCKKGDLGRAAKLVDNMFLKGCVPNEVTYNALVHGLCLKGKLEKAVSLLN 299

Query: 1218 RMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEG 1397
            +MVS+K +PNDVT+GT+I+G V +GRA DG  V+VS+E RGH+GN+++YSSL+SGL KEG
Sbjct: 300  QMVSNKCVPNDVTFGTLINGFVMQGRASDGTRVLVSLEARGHRGNEYVYSSLISGLCKEG 359

Query: 1398 RSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYS 1577
            +  +A+ LWK+++  G  PNT+VYSALIDGLCR GK  EA+  L EM NKG  PN++TYS
Sbjct: 360  KFNQAMELWKEMVGKGCGPNTIVYSALIDGLCREGKLDEARGFLSEMKNKGYLPNSFTYS 419

Query: 1578 SLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGK 1757
            SLM+G+F+ G+S+ A+L+WKEM    C+HNE CYSILI+GLC +GK  EA MVW+ ML +
Sbjct: 420  SLMRGYFEAGDSHKAILVWKEMANNNCIHNEVCYSILINGLCKDGKFMEALMVWKQMLSR 479

Query: 1758 GWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKIT 1937
            G   DVVAY+SMIHG C+   VEQGL LFN+ML +G   QPDV TYN+L NA C  + I 
Sbjct: 480  GIKLDVVAYSSMIHGFCNANLVEQGLKLFNQMLCQGPVVQPDVITYNILLNAFCIQKSIF 539

Query: 1938 PAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASR 2117
             AI +LN MLD+GCDPD +TC+IFL T +E MNP QDGREFLDEL +RL KRQR  GAS+
Sbjct: 540  RAIDILNIMLDQGCDPDFITCDIFLKTLRENMNPPQDGREFLDELVVRLVKRQRTIGASK 599

Query: 2118 IIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNSL 2240
            IIEVM+ KFL PK STW + ++++CKP+ ++ AI++CW+ L
Sbjct: 600  IIEVMMHKFLLPKASTWAMVVQQVCKPKNVRKAISECWSRL 640


>gb|EXB83265.1| hypothetical protein L484_011559 [Morus notabilis]
          Length = 699

 Score =  797 bits (2058), Expect = 0.0
 Identities = 396/604 (65%), Positives = 478/604 (79%), Gaps = 19/604 (3%)
 Frame = +3

Query: 450  PITDRLF---KHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKS 620
            P++ +LF     +P SGSYK GDSTFYSLI NYA+S DF+SLE V  R+  ERR  VEK 
Sbjct: 45   PLSPQLFMPSSSSPDSGSYKLGDSTFYSLIHNYASSADFRSLEKVLDRIKSERRVLVEKC 104

Query: 621  FIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFH--- 791
            FIV+FRAYGKAHLP KAV+LF RM+++F+C+ TVKSFNSVLNVIIQE  +S AL+F+   
Sbjct: 105  FIVIFRAYGKAHLPNKAVDLFQRMLHDFRCRPTVKSFNSVLNVIIQEHKFSYALDFYYSN 164

Query: 792  ----------SYVVNCKN--IKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADV 935
                        ++N KN  I PNVLTFNLVIKAMCKL LVDRAV+VFRE+P   C  DV
Sbjct: 165  VVALRSGVCKDNILNMKNMNISPNVLTFNLVIKAMCKLGLVDRAVQVFREIPLRNCTPDV 224

Query: 936  FTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDN 1115
            FTY TLMDGLCKE+R++EAV+LLDEMQIEGCFP+P TFNVLI+ LCKKGD+ RAAK+VDN
Sbjct: 225  FTYSTLMDGLCKENRIDEAVSLLDEMQIEGCFPSPVTFNVLISALCKKGDIGRAAKLVDN 284

Query: 1116 MFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGR 1295
            MFLK C+PNE TYN LIHGLCL+GKL KA+SLLDRMV +K +PNDVTYGTII+GLV+ GR
Sbjct: 285  MFLKDCLPNEATYNALIHGLCLKGKLNKAVSLLDRMVMNKCVPNDVTYGTIINGLVKHGR 344

Query: 1296 AVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSA 1475
            A DG +++VSMEERG   N+++YS+L+SGLFKEG+ EEA+ LWK +   GHKPN VVYSA
Sbjct: 345  AFDGANLLVSMEERGRHANEYVYSALISGLFKEGKYEEAMGLWKDMTGKGHKPNVVVYSA 404

Query: 1476 LIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKG 1655
            LIDGLCR GKP +AKE++ EMV  G  PN+ TYSSLM+GFFK   S+ A+LLWKE+    
Sbjct: 405  LIDGLCREGKPDKAKEVMFEMVKNGFNPNSRTYSSLMRGFFKASESHKAILLWKEIVANN 464

Query: 1656 CVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGL 1835
             + NEFCYS+LI GLC +GKLKEA M+W+ ML +G+ PDVVAY+SMIHGLC+ G VE+G+
Sbjct: 465  -LENEFCYSVLIDGLCGDGKLKEALMMWKQMLYRGFKPDVVAYSSMIHGLCTAGLVEEGM 523

Query: 1836 LLFNEMLYKGSDSQPDVFTYNVLFNALCKH-EKITPAIHLLNNMLDRGCDPDIVTCNIFL 2012
             LFNEML    +SQPDV TYN+L NALCK+   I+ A+ LLN MLD GCDPD++TC+IFL
Sbjct: 524  NLFNEMLCLEPESQPDVITYNILLNALCKNGGSISRAVDLLNYMLDLGCDPDVITCDIFL 583

Query: 2013 TTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELC 2192
             T +EK+ P QDGREFLDEL +RL KR+RI GA  I+EVMLQKFL PK STW   I++LC
Sbjct: 584  RTLREKLEPPQDGREFLDELAVRLLKRERIKGAVTIVEVMLQKFLPPKASTWARVIQQLC 643

Query: 2193 KPRK 2204
            KP+K
Sbjct: 644  KPKK 647


>gb|ESW10855.1| hypothetical protein PHAVU_009G243700g [Phaseolus vulgaris]
          Length = 645

 Score =  793 bits (2049), Expect = 0.0
 Identities = 380/601 (63%), Positives = 483/601 (80%), Gaps = 2/601 (0%)
 Frame = +3

Query: 444  EPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRAFVEKSF 623
            +P  +  +FK   K GSYK GD +FYSLI+N+A++ DF SLE V  +M  ERR FVE++F
Sbjct: 43   QPHPSAEIFKSGTKMGSYKLGDLSFYSLIQNHASTLDFGSLEEVLQQMKRERRVFVERNF 102

Query: 624  IVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALEFHSYVV 803
            IV+F+AYGKAHLPEKAV+LF RM  EFQCK+TVKSFNSVL+V+IQEGL++RALE +S+VV
Sbjct: 103  IVMFKAYGKAHLPEKAVDLFLRMGGEFQCKQTVKSFNSVLSVVIQEGLFNRALELYSHVV 162

Query: 804  NCK--NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDGLCKED 977
              K  NI PN LTFNL+IKAMC+L LVD+AVEVFRE+P   C  D +TY TLM GLC+E 
Sbjct: 163  ASKSFNIHPNALTFNLLIKAMCRLGLVDQAVEVFREIPLRNCAPDAYTYSTLMHGLCQEG 222

Query: 978  RVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPNEVTYN 1157
            R++EAV+LLDEMQ+EG FPNP  FNVLI+ LCK GDL+RAAK+VDNMFLKGCVPNEVTYN
Sbjct: 223  RIDEAVSLLDEMQVEGTFPNPVAFNVLISALCKNGDLARAAKLVDNMFLKGCVPNEVTYN 282

Query: 1158 TLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMVSMEER 1337
             L+HGLCL+GKLEKA+SLL+RMV +K +PNDVT+GT+I+G V++GRA +G  V+VS+EER
Sbjct: 283  ALVHGLCLKGKLEKAVSLLNRMVLNKCVPNDVTFGTLINGFVKQGRASEGARVLVSLEER 342

Query: 1338 GHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVGKPFEA 1517
             H GN+++YSSL+SGL KEG+   A+ LWK+++  G KPNTVVYSALIDGLCR GK  EA
Sbjct: 343  DHCGNEYVYSSLISGLCKEGKFNHAMQLWKEMVGKGCKPNTVVYSALIDGLCREGKLDEA 402

Query: 1518 KEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYSILIHG 1697
            +E+L EM +KG  PN++TYSSLM+G+F+ G S+ A+L+WKEM +  C HNE CYSILI+G
Sbjct: 403  REVLSEMKSKGYLPNSFTYSSLMRGYFEAGISHKAILVWKEMADNNCNHNEVCYSILING 462

Query: 1698 LCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEMLYKGSDSQ 1877
            LC +GK+ EA MVW+ ML +G   DVVAY+SMIHG C+   +E GL LFN+ML +  + Q
Sbjct: 463  LCKDGKVMEALMVWKQMLSRGIKLDVVAYSSMIHGFCNANLIEHGLKLFNQMLCQEPEVQ 522

Query: 1878 PDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNPSQDGRE 2057
            PDV TYN++ NALC H  I+ AI +LN MLD+GCDPD +TC++FL T +E +NP QDGRE
Sbjct: 523  PDVITYNIILNALCMHNSISRAIDILNIMLDQGCDPDFITCDVFLKTLRENVNPPQDGRE 582

Query: 2058 FLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAINKCWNS 2237
            FLDEL +RL KRQR  GAS+IIEVML KFL PK STW + +++LCKP++++  I++CW+ 
Sbjct: 583  FLDELVVRLVKRQRTIGASKIIEVMLHKFLLPKASTWAMIVQQLCKPKRVRKVISECWSK 642

Query: 2238 L 2240
            L
Sbjct: 643  L 643


>ref|XP_006404148.1| hypothetical protein EUTSA_v10010168mg [Eutrema salsugineum]
            gi|557105267|gb|ESQ45601.1| hypothetical protein
            EUTSA_v10010168mg [Eutrema salsugineum]
          Length = 696

 Score =  790 bits (2040), Expect = 0.0
 Identities = 398/667 (59%), Positives = 493/667 (73%), Gaps = 3/667 (0%)
 Frame = +3

Query: 249  FTSKSARLILTSNPCKSSFIFL-IHCPFSALPNNSCETEKDYEEDIIAIQSTNSHMLPKR 425
            F +KS   IL+SNP K S   L      S  P  S ETE+ + E+  A            
Sbjct: 49   FLNKSR--ILSSNPVKLSIHLLCFSSSVSVSPKPSMETEQQHTENPSAA----------- 95

Query: 426  SHKVEVEPPITDRLFKHAPKSGSYKQGDSTFYSLIENYANSGDFKSLEMVFSRMWHERRA 605
                    PI++++F+ APK GSYK GDST  S+IENYANSGDF S+E + SR+  E R 
Sbjct: 96   --------PISEKMFESAPKMGSYKLGDSTLSSMIENYANSGDFASVEKLLSRIRLENRM 147

Query: 606  FVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSVLNVIIQEGLYSRALE 785
              E SFIV+FRAYGKAHLPEK +ELF RMV EFQCKRT+KSFNSVLNVII EG Y R LE
Sbjct: 148  IREHSFIVLFRAYGKAHLPEKTIELFHRMVDEFQCKRTIKSFNSVLNVIINEGRYHRGLE 207

Query: 786  FHSYVVNCK-NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMPAMKCDADVFTYCTLMDG 962
            F+ YVVN   NI PN L+FNLVIKAMCKL  VDRA+EVFR MP  KC  D +TYCTLMDG
Sbjct: 208  FYDYVVNSNMNIAPNGLSFNLVIKAMCKLGFVDRAIEVFRVMPEKKCVPDGYTYCTLMDG 267

Query: 963  LCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLSRAAKVVDNMFLKGCVPN 1142
            LCKE+R++EAV LLDEMQ EGC P+  T+NVLI+GLCKKGDL+R  K+VDNMFLKGCVPN
Sbjct: 268  LCKEERIDEAVLLLDEMQSEGCSPSSVTYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPN 327

Query: 1143 EVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTIIDGLVRKGRAVDGVHVMV 1322
            +VTYNTLIHGLCL+GKL+KA+SLL+RMVS K IPNDVTYGT+I+GLV++ RA+DG  +++
Sbjct: 328  KVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRAMDGAGLLI 387

Query: 1323 SMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHKPNTVVYSALIDGLCRVG 1502
            SMEERG++ N H+YS L+SGLFKEG+ EEA++LWKK+ E G +PN VVYSAL+DGLCR G
Sbjct: 388  SMEERGYRLNQHVYSILISGLFKEGKVEEAMSLWKKMGEKGCQPNIVVYSALVDGLCRQG 447

Query: 1503 KPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLLWKEMTEKGCVHNEFCYS 1682
            K  EAKEI   M++ GC PN YTYSSLMKGFFK G S  A+ +W+EM    C  N+ CYS
Sbjct: 448  KTKEAKEIFDIMISNGCLPNVYTYSSLMKGFFKTGLSEEAIQVWREMDNTECSRNKVCYS 507

Query: 1683 ILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCSVGSVEQGLLLFNEML-Y 1859
            +LI GLC  G++KEA MVW  ML  G  PD VAY+SMI G C +GS++  + L++EML  
Sbjct: 508  VLIDGLCGVGRVKEAMMVWSKMLIIGIKPDTVAYSSMIKGFCGIGSMDAAIRLYHEMLCQ 567

Query: 1860 KGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDIVTCNIFLTTFKEKMNP 2039
            +   SQPDV TYN++ +  C  + I+ A+ LLN MLDRGCDPD +TC+ FL T  +K + 
Sbjct: 568  EDHKSQPDVVTYNIIIDGFCMQKDISRAVDLLNCMLDRGCDPDAITCDTFLKTLSKKSDS 627

Query: 2040 SQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWEIAIRELCKPRKIQVAI 2219
             ++G+ FL+EL +RL KRQR+ GA +I+EVML K+L PK STW + + E+CKP+KI VAI
Sbjct: 628  CEEGKSFLEELVVRLLKRQRVSGACKIVEVMLSKYLTPKASTWAMIVPEICKPKKINVAI 687

Query: 2220 NKCWNSL 2240
            +KCW ++
Sbjct: 688  DKCWRNM 694


>ref|XP_003594857.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355483905|gb|AES65108.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 647

 Score =  781 bits (2016), Expect = 0.0
 Identities = 381/619 (61%), Positives = 486/619 (78%), Gaps = 8/619 (1%)
 Frame = +3

Query: 396  STNSHMLPKRSHKVEVEPPITDRLFKH-----APKSGSYKQGDSTFYSLIENYANSGDFK 560
            S +S  LP   H +   PP   ++FK      + K GSYK GD +FYSLIEN++NS DF 
Sbjct: 29   SYSSSNLPHTHHSL---PP---QIFKSPSNTSSHKWGSYKLGDLSFYSLIENFSNSLDFT 82

Query: 561  SLEMVFSRMWHERRAFVEKSFIVVFRAYGKAHLPEKAVELFDRMVYEFQCKRTVKSFNSV 740
            SLE +  +M  E R F+EKSFI++F+AYGKAHLP+KA++LF RM  EF CK+TVKSFN+V
Sbjct: 83   SLEQLLHQMKCENRVFIEKSFIIMFKAYGKAHLPQKALDLFHRMGAEFHCKQTVKSFNTV 142

Query: 741  LNVIIQEGLYSRALEFHSYVVNCK---NIKPNVLTFNLVIKAMCKLQLVDRAVEVFREMP 911
            LNV+IQEG +  ALEF+++V++     NI+PN L+FNLVIKA+C++  VD+AVEVFR M 
Sbjct: 143  LNVVIQEGCFDLALEFYNHVIDSNSFSNIQPNGLSFNLVIKALCRVGNVDQAVEVFRGMS 202

Query: 912  AMKCDADVFTYCTLMDGLCKEDRVEEAVALLDEMQIEGCFPNPATFNVLINGLCKKGDLS 1091
               C AD +TY TLM GLC E R++EAV+LLDEMQ+EG FPNP  FNVLI+ LCKKGDLS
Sbjct: 203  DRNCVADGYTYSTLMHGLCNEGRIDEAVSLLDEMQVEGTFPNPVAFNVLISALCKKGDLS 262

Query: 1092 RAAKVVDNMFLKGCVPNEVTYNTLIHGLCLQGKLEKAISLLDRMVSDKFIPNDVTYGTII 1271
            RA+K+VDNMFLKGCVPNEVTYN+L+HGLCL+GKL+KA+SLL+RMV++K +PND+T+GT++
Sbjct: 263  RASKLVDNMFLKGCVPNEVTYNSLVHGLCLKGKLDKAMSLLNRMVANKCVPNDITFGTLV 322

Query: 1272 DGLVRKGRAVDGVHVMVSMEERGHQGNDHIYSSLVSGLFKEGRSEEALNLWKKIMENGHK 1451
            DG V+ GRA+DGV V+VS+EE+G++GN+  YSSL+SGLFKEG+ E  + LWK+++E G K
Sbjct: 323  DGFVKHGRALDGVRVLVSLEEKGYRGNEFSYSSLISGLFKEGKGEHGMQLWKEMVEKGCK 382

Query: 1452 PNTVVYSALIDGLCRVGKPFEAKEILPEMVNKGCEPNAYTYSSLMKGFFKVGNSNMAVLL 1631
            PNT+VYSALIDGLCR GKP EAKE L EM NKG  PN++TYSSLM G+F+ G+ + A+L+
Sbjct: 383  PNTIVYSALIDGLCREGKPDEAKEYLIEMKNKGHTPNSFTYSSLMWGYFEAGDIHKAILV 442

Query: 1632 WKEMTEKGCVHNEFCYSILIHGLCDEGKLKEATMVWRHMLGKGWTPDVVAYTSMIHGLCS 1811
            WKEMT+  C H+E CYSILI+GLC  GKLKEA +VW+ ML +G   DVVAY+SMIHG C+
Sbjct: 443  WKEMTDNDCNHHEVCYSILINGLCKNGKLKEALIVWKQMLSRGIKLDVVAYSSMIHGFCN 502

Query: 1812 VGSVEQGLLLFNEMLYKGSDSQPDVFTYNVLFNALCKHEKITPAIHLLNNMLDRGCDPDI 1991
               VEQG+ LFN+ML      QPDV TYN+L NA C    ++ AI +LN MLD+GCDPD 
Sbjct: 503  AQLVEQGMKLFNQMLCHNPKLQPDVVTYNILLNAFCTKNSVSRAIDILNTMLDQGCDPDF 562

Query: 1992 VTCNIFLTTFKEKMNPSQDGREFLDELFLRLHKRQRIDGASRIIEVMLQKFLYPKPSTWE 2171
            +TC+IFL T ++ M+P QDGREFLDEL +RL KRQR  GAS IIEVMLQKFL PKPSTW 
Sbjct: 563  ITCDIFLKTLRDNMDPPQDGREFLDELVVRLIKRQRTVGASNIIEVMLQKFLLPKPSTWA 622

Query: 2172 IAIRELCKPRKIQVAINKC 2228
            +A+++LCKP K++  I++C
Sbjct: 623  LAVQQLCKPMKVRKTISEC 641


Top