BLASTX nr result

ID: Rehmannia22_contig00006903 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00006903
         (2440 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containi...   868   0.0  
ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containi...   840   0.0  
ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containi...   836   0.0  
ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containi...   832   0.0  
ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containi...   830   0.0  
ref|XP_002529286.1| pentatricopeptide repeat-containing protein,...   823   0.0  
ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citr...   822   0.0  
ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Popu...   810   0.0  
ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containi...   809   0.0  
gb|EMJ26349.1| hypothetical protein PRUPE_ppa002505mg [Prunus pe...   809   0.0  
ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containi...   808   0.0  
gb|EOX98058.1| Pentatricopeptide repeat superfamily protein isof...   806   0.0  
gb|EOX98059.1| Pentatricopeptide repeat (PPR) superfamily protei...   799   0.0  
gb|ESW20506.1| hypothetical protein PHAVU_006G214900g [Phaseolus...   769   0.0  
ref|XP_004486236.1| PREDICTED: pentatricopeptide repeat-containi...   766   0.0  
ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutr...   766   0.0  
ref|NP_172560.2| pentatricopeptide repeat-containing protein [Ar...   764   0.0  
ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containi...   763   0.0  
ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arab...   763   0.0  
ref|XP_003531588.2| PREDICTED: pentatricopeptide repeat-containi...   761   0.0  

>ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic [Vitis vinifera]
            gi|298204537|emb|CBI23812.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  868 bits (2243), Expect = 0.0
 Identities = 439/658 (66%), Positives = 534/658 (81%), Gaps = 1/658 (0%)
 Frame = -1

Query: 2380 MEVSSVLGNGFQPAFVFPTTRPIKNSKXXXXXXXXXXPQRVSTPISSNGAEKHSEVRSKP 2201
            ME+S VLG GF+      +  P  +S            +  S+ ++S     H E + +P
Sbjct: 1    MEIS-VLGGGFKQVITRLSPLPSLSSPASPLPSTT---RAKSSHLTSATPPLHKESQIEP 56

Query: 2200 LNVSRNLSKN-PSASYKARESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNR 2024
             +VS    K   S  YKAR+SAIL++Q SSDL SAL R G++LK QDLNV+LRHFGKL R
Sbjct: 57   THVSVTPRKRCHSVGYKARQSAILEVQQSSDLGSALARLGDMLKVQDLNVILRHFGKLCR 116

Query: 2023 RKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNST 1844
             +DL QLFDWM++H K    SYS+YIKF+G+  N +KALEIYNSI+D+S RNNVSVCNS 
Sbjct: 117  WQDLSQLFDWMQKHEKITFSSYSTYIKFMGKSLNPIKALEIYNSIQDESVRNNVSVCNSV 176

Query: 1843 LYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRG 1664
            L CLI++GKF++SLKLF+QMKQ GL PD VTYSTLL GC KVK GY KA+ELV+E++   
Sbjct: 177  LSCLIRNGKFENSLKLFHQMKQDGLRPDAVTYSTLLAGCMKVKHGYSKALELVQEMERSR 236

Query: 1663 LHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKAD 1484
            L MDSV+YGTL++VCASNN+C+ AE YF++MK EGH PNVFHYSSLLNAY+  G+Y KAD
Sbjct: 237  LPMDSVIYGTLLAVCASNNRCKEAENYFNQMKDEGHLPNVFHYSSLLNAYSADGDYKKAD 296

Query: 1483 ELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGL 1304
             L+Q M+SAGLV NKVILTTLLKVYV+GGLF KSRELL EL+ LGYAEDEMPYCLLMDGL
Sbjct: 297  MLVQDMKSAGLVPNKVILTTLLKVYVRGGLFEKSRELLAELEDLGYAEDEMPYCLLMDGL 356

Query: 1303 AKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDV 1124
            AK  ++  AK++F+EM++K+VK+DGY YSIMISA CRSGL++EAKQLA +FE  Y KYD+
Sbjct: 357  AKSRRILEAKSIFEEMKKKQVKSDGYCYSIMISAFCRSGLLKEAKQLARDFEATYDKYDL 416

Query: 1123 VILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMED 944
            V+LN+MLCAYCR+GEME+VM++M+KMDE AISPDWNTF ILIKYFCKE LYLLAYRTMED
Sbjct: 417  VMLNTMLCAYCRAGEMESVMQMMRKMDELAISPDWNTFHILIKYFCKEKLYLLAYRTMED 476

Query: 943  MHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGL 764
            MH KGHQPEE+L +SLI+HLG+  AHS+AFSVYNML+YSKR + KALHEKILHIL+AG L
Sbjct: 477  MHNKGHQPEEELCSSLISHLGKIRAHSQAFSVYNMLRYSKRTMCKALHEKILHILVAGRL 536

Query: 763  LKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVS 584
            LKDAYVVVKDN  LIS+P+IKKFAT+FM+ GN+NL+NDV+K+IH SGYKIDQ++F MAV+
Sbjct: 537  LKDAYVVVKDNEGLISKPSIKKFATAFMKFGNVNLINDVMKAIHGSGYKIDQELFQMAVT 596

Query: 583  RYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALK 410
            RYI +P KKELLLHLLQWMPGQG+ +DSS RN+ILKNSHLFG   I+E++SK +   K
Sbjct: 597  RYIAEPEKKELLLHLLQWMPGQGYVVDSSTRNMILKNSHLFGRQLIAEMLSKQHARAK 654


>ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X2 [Solanum tuberosum]
          Length = 651

 Score =  840 bits (2171), Expect = 0.0
 Identities = 413/590 (70%), Positives = 495/590 (83%)
 Frame = -1

Query: 2149 RESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTN 1970
            R+SAIL IQ SSDL+SAL R G+ LK QD+NV+LR+FGKL+RR++LYQ F+WM+Q+ K N
Sbjct: 62   RQSAILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLSRRRELYQAFEWMQQNQKIN 121

Query: 1969 IPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFN 1790
            + SYSSY+KF+G+  + + A+E+Y  IKD S + NVSVCN+ L  LIK+GK +SSLKLF 
Sbjct: 122  VASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFT 181

Query: 1789 QMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASN 1610
            QMK+ GLVPD+ TYSTLL GCAKV  GY+KA+ELV+E+ S GL MDSV YG+L+SVCAS+
Sbjct: 182  QMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCASH 241

Query: 1609 NQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVIL 1430
             +C  A KYF +MK EGHSPNV+HYSSLLNAY+   NY KA+ LI++MRSAGLVLNKVI 
Sbjct: 242  KECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVIY 301

Query: 1429 TTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMRE 1250
            TTLLKVYVKGGLF KS+ELL EL+ALGYA+DEMP+CLLMDGLAK G L  AK+VFDEM E
Sbjct: 302  TTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMME 361

Query: 1249 KEVKNDGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEMEN 1070
            K VK DGYSYSIMISA CRSGL+E+AK++A EFE KY KYD+VILN+ML AYCR+G+MEN
Sbjct: 362  KHVKTDGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKMEN 421

Query: 1069 VMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLIN 890
            VM +MKKMD+SAISPDWNTF ILI+YFCKE LYLLAYRTMEDMH KGHQPEE L +SLI 
Sbjct: 422  VMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIY 481

Query: 889  HLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGLLKDAYVVVKDNAKLISEP 710
            HLG+TGAHSEAFSVYNML+YSKR I+ ALHE ILHIL+AG LLKDAYVVVKDNA  IS+P
Sbjct: 482  HLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVVKDNAGFISQP 541

Query: 709  AIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQW 530
            AIKKF+ +FMR GN+NL+NDV+ ++H SG+KIDQ++F +A++RYI +P KKELLL LL+W
Sbjct: 542  AIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKKELLLWLLKW 601

Query: 529  MPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALKTNKSHEGRTR 380
            MPG+G+ IDSS RNLILKNSHLFGH  I+E +SKH    K  K H+   R
Sbjct: 602  MPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSKKVKLHKENAR 651


>ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X1 [Solanum tuberosum]
          Length = 652

 Score =  836 bits (2159), Expect = 0.0
 Identities = 413/591 (69%), Positives = 495/591 (83%), Gaps = 1/591 (0%)
 Frame = -1

Query: 2149 RESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTN 1970
            R+SAIL IQ SSDL+SAL R G+ LK QD+NV+LR+FGKL+RR++LYQ F+WM+Q+ K N
Sbjct: 62   RQSAILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLSRRRELYQAFEWMQQNQKIN 121

Query: 1969 IPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFN 1790
            + SYSSY+KF+G+  + + A+E+Y  IKD S + NVSVCN+ L  LIK+GK +SSLKLF 
Sbjct: 122  VASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFT 181

Query: 1789 QMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASN 1610
            QMK+ GLVPD+ TYSTLL GCAKV  GY+KA+ELV+E+ S GL MDSV YG+L+SVCAS+
Sbjct: 182  QMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCASH 241

Query: 1609 NQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVIL 1430
             +C  A KYF +MK EGHSPNV+HYSSLLNAY+   NY KA+ LI++MRSAGLVLNKVI 
Sbjct: 242  KECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVIY 301

Query: 1429 TTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMRE 1250
            TTLLKVYVKGGLF KS+ELL EL+ALGYA+DEMP+CLLMDGLAK G L  AK+VFDEM E
Sbjct: 302  TTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMME 361

Query: 1249 KEVKN-DGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEME 1073
            K VK  DGYSYSIMISA CRSGL+E+AK++A EFE KY KYD+VILN+ML AYCR+G+ME
Sbjct: 362  KHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKME 421

Query: 1072 NVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLI 893
            NVM +MKKMD+SAISPDWNTF ILI+YFCKE LYLLAYRTMEDMH KGHQPEE L +SLI
Sbjct: 422  NVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLI 481

Query: 892  NHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGLLKDAYVVVKDNAKLISE 713
             HLG+TGAHSEAFSVYNML+YSKR I+ ALHE ILHIL+AG LLKDAYVVVKDNA  IS+
Sbjct: 482  YHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVVKDNAGFISQ 541

Query: 712  PAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQ 533
            PAIKKF+ +FMR GN+NL+NDV+ ++H SG+KIDQ++F +A++RYI +P KKELLL LL+
Sbjct: 542  PAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKKELLLWLLK 601

Query: 532  WMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALKTNKSHEGRTR 380
            WMPG+G+ IDSS RNLILKNSHLFGH  I+E +SKH    K  K H+   R
Sbjct: 602  WMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSKKVKLHKENAR 652


>ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 646

 Score =  832 bits (2148), Expect = 0.0
 Identities = 410/581 (70%), Positives = 491/581 (84%), Gaps = 1/581 (0%)
 Frame = -1

Query: 2149 RESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTN 1970
            R+SAIL IQ SSDL+SAL R G+ LK QD+NV+LR+FGKL+RR++LYQ F+WM+Q+ K N
Sbjct: 62   RQSAILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLSRRRELYQAFEWMQQNQKIN 121

Query: 1969 IPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFN 1790
            + SYSSY+KF+G+  + + A+E+Y  IKD S + NVSVCN+ L  LIK+GK +SSLKLF 
Sbjct: 122  VASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFT 181

Query: 1789 QMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASN 1610
            QMK+ GLVPD+ TYSTLL GCAKV  GY+KA+ELV+E+ S GL MDSV YG+L+SVCAS+
Sbjct: 182  QMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCASH 241

Query: 1609 NQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVIL 1430
             +C  A KYF +MK EGHSPNV+HYSSLLNAY+   NY KA+ LI++MRSAGLVLNKVI 
Sbjct: 242  KECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVIY 301

Query: 1429 TTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMRE 1250
            TTLLKVYVKGGLF KS+ELL EL+ALGYA+DEMP+CLLMDGLAK G L  AK+VFDEM E
Sbjct: 302  TTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMME 361

Query: 1249 KEVKN-DGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEME 1073
            K VK  DGYSYSIMISA CRSGL+E+AK++A EFE KY KYD+VILN+ML AYCR+G+ME
Sbjct: 362  KHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKME 421

Query: 1072 NVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLI 893
            NVM +MKKMD+SAISPDWNTF ILI+YFCKE LYLLAYRTMEDMH KGHQPEE L +SLI
Sbjct: 422  NVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLI 481

Query: 892  NHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGLLKDAYVVVKDNAKLISE 713
             HLG+TGAHSEAFSVYNML+YSKR I+ ALHE ILHIL+AG LLKDAYVVVKDNA  IS+
Sbjct: 482  YHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVVKDNAGFISQ 541

Query: 712  PAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQ 533
            PAIKKF+ +FMR GN+NL+NDV+ ++H SG+KIDQ++F +A++RYI +P KKELLL LL+
Sbjct: 542  PAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKKELLLWLLK 601

Query: 532  WMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALK 410
            WMPG+G+ IDSS RNLILKNSHLFGH  I+E +SKH    K
Sbjct: 602  WMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSK 642


>ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Solanum lycopersicum]
          Length = 642

 Score =  830 bits (2145), Expect = 0.0
 Identities = 410/585 (70%), Positives = 490/585 (83%)
 Frame = -1

Query: 2164 ASYKARESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQ 1985
            AS   R+S IL IQ SSDL+SAL R G+ LK QD+NV+LR+FGKLNRR +L Q+F+WM+Q
Sbjct: 57   ASRTDRQSTILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLNRRPELCQVFEWMQQ 116

Query: 1984 HGKTNIPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSS 1805
            + K N+ SYSSY+KF+G+  + + A+E+Y  IKD S + NVSVCN+ L  LIK+GK +SS
Sbjct: 117  NQKINVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKFNVSVCNAFLSSLIKNGKSESS 176

Query: 1804 LKLFNQMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRGLHMDSVVYGTLIS 1625
            LKLF QMK+ GLVPD+ TYSTLL GCAKV  GY+KA+ELV+E+ S GL MDSV YG+L+S
Sbjct: 177  LKLFTQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQEMMSNGLEMDSVTYGSLLS 236

Query: 1624 VCASNNQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKADELIQQMRSAGLVL 1445
            VCAS+ +C  A KYF +MK EGHSPNV+HYSSLLNAY+   NY KA+ LI++MRSAGLVL
Sbjct: 237  VCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEALIEEMRSAGLVL 296

Query: 1444 NKVILTTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGLAKCGKLAVAKAVF 1265
            NKVI TTLLKVYVKGGLF KS+ELL EL+ALGYA+DEMP+CLLMDGLAK G L  AK+VF
Sbjct: 297  NKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVF 356

Query: 1264 DEMREKEVKNDGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDVVILNSMLCAYCRS 1085
            DEM EK+VK DGYSYSIMISA CR GL+E+AK+LA EFE KY KYD+VILN+ML AYCR+
Sbjct: 357  DEMMEKQVKTDGYSYSIMISAFCRRGLLEDAKKLASEFEEKYDKYDIVILNAMLSAYCRA 416

Query: 1084 GEMENVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLS 905
            G+MENVM +MKKMD+SAISPDWNTF ILI+YFCKE LYLLAYRTMEDMH KGHQPEE L 
Sbjct: 417  GKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPEEGLC 476

Query: 904  ASLINHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGLLKDAYVVVKDNAK 725
            +SLI HLG+TGAHSEAFSVYNML+YSKR I+ ALHE ILHIL+AG LLKDAYVVVKDNA 
Sbjct: 477  SSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHENILHILIAGRLLKDAYVVVKDNAG 536

Query: 724  LISEPAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVSRYIEQPGKKELLL 545
             IS+PAIKKF+ +FMR GN+NL+NDV+ ++H SG+KIDQ++F +A++RYI +P KKELLL
Sbjct: 537  FISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKKELLL 596

Query: 544  HLLQWMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALK 410
             LL+WMP +G+ IDSS RNLILKNSHLFGH  I+E +SKH    K
Sbjct: 597  WLLKWMPVKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSK 641


>ref|XP_002529286.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223531275|gb|EEF33118.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 672

 Score =  823 bits (2125), Expect = 0.0
 Identities = 410/630 (65%), Positives = 505/630 (80%), Gaps = 7/630 (1%)
 Frame = -1

Query: 2266 QRVSTPISSNGAE-----KHSEVRSKPLNVSRNLSKNPSASYKARESAILDIQHSSDLSS 2102
            QR S  +S+N        K      +P N   ++ +  S SY AR++AIL++Q S DL S
Sbjct: 42   QRSSAVLSTNTTTETPLLKQPHNNEQPPNGQFHVQRRHSKSYLARQAAILEVQQSPDLDS 101

Query: 2101 ALLRSGEVLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSN 1922
            AL R G +LKAQDLNV+LR+ GK +R +DL +LFDWM+QH K ++ SY+SY+KF+G+  N
Sbjct: 102  ALRRLGAILKAQDLNVILRNLGKQSRWQDLSKLFDWMQQHSKISVSSYTSYMKFMGKSLN 161

Query: 1921 SMKALEIYNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYST 1742
              KALEIYNSI D+S +NNV +CNS L CL++SGKFD SLKLF++MKQ GL PD +TYST
Sbjct: 162  PAKALEIYNSIADESVKNNVFICNSVLSCLVRSGKFDISLKLFHKMKQNGLTPDTITYST 221

Query: 1741 LLLGCAKVKDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSE 1562
            LL GC K KDGY K ++ V+E+K  GL MD+V+YGT+++VCAS+N+CE AE YF +MK+E
Sbjct: 222  LLSGCIKAKDGYSKTLDFVQELKYNGLQMDTVIYGTILAVCASHNRCEEAESYFSQMKNE 281

Query: 1561 GHSPNVFHYSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKS 1382
            GH PNVFHYSSLLNAYA  GNY KA+EL+Q M+S GLV NKVI TTLLKVYV+GGLF KS
Sbjct: 282  GHLPNVFHYSSLLNAYASSGNYKKAEELVQDMKSLGLVPNKVIWTTLLKVYVRGGLFEKS 341

Query: 1381 RELLDELQALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISA 1202
            ++LL EL+ LGYAEDEMPYCLLMDGL+K G++  A++ FDEM+EK VK+DGY+YSIMISA
Sbjct: 342  QQLLLELETLGYAEDEMPYCLLMDGLSKAGRVDEARSFFDEMKEKNVKSDGYAYSIMISA 401

Query: 1201 LCRSGLIEEAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEMENVMKLMKKMDESAISPD 1022
             CR  L+EEAKQLA EFE KY KYDVVILN+MLCAYCR+G+ME+VM+ M+KMDE AISP 
Sbjct: 402  YCRGRLLEEAKQLAKEFEAKYDKYDVVILNTMLCAYCRAGDMESVMQTMRKMDELAISPS 461

Query: 1021 WNTFQILIKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYN 842
            + TF ILIKYFCK+ LYLLAY+TMEDMHRKGHQPEE+L + LI HLG+  A++EAFSVY 
Sbjct: 462  YCTFHILIKYFCKQKLYLLAYQTMEDMHRKGHQPEEELCSMLIFHLGKAKAYTEAFSVYT 521

Query: 841  MLKYSKRGINKALHEKILHILLAGGLLKDAYVVVKDNAKLISEPAIKKFATSFMRKGNIN 662
            MLKY KR + KALHEKILH+LL G LLKDAYVVVKDNA+LIS+ AIKKFA +FM+ GNIN
Sbjct: 522  MLKYGKRTMCKALHEKILHVLLGGQLLKDAYVVVKDNAELISQAAIKKFANAFMKLGNIN 581

Query: 661  LVNDVIKSIHYSGYKIDQ--DIFYMAVSRYIEQPGKKELLLHLLQWMPGQGFPIDSSMRN 488
            L+NDV+K IH SGYKIDQ  ++F MA+SRYI QP KK+LL+ LLQWMPG G+ +D+S RN
Sbjct: 582  LINDVMKVIHSSGYKIDQASELFQMAISRYIAQPEKKDLLVQLLQWMPGHGYVVDASTRN 641

Query: 487  LILKNSHLFGHHSISELMSKHYTALKTNKS 398
            LILK+SHLFG   I+E++SK +   KT KS
Sbjct: 642  LILKSSHLFGRQLIAEILSKQHIISKTLKS 671


>ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citrus clementina]
            gi|557534005|gb|ESR45123.1| hypothetical protein
            CICLE_v10000525mg [Citrus clementina]
          Length = 660

 Score =  822 bits (2123), Expect = 0.0
 Identities = 402/593 (67%), Positives = 498/593 (83%)
 Frame = -1

Query: 2176 KNPSASYKARESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNRRKDLYQLFD 1997
            K  S+SY AR+SAIL++Q SSDL+S+L R G +LK  DLN +LRHFG L R +D+ QLF+
Sbjct: 65   KRQSSSYLARKSAILEVQQSSDLTSSLERLGGILKVPDLNAILRHFGDLGRGRDVLQLFE 124

Query: 1996 WMRQHGKTNIPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNSTLYCLIKSGK 1817
            WM+QHGKT+I SYSSYIKF+G+  NS+KALEIYNSI D+S + NV +CNS L CL+++GK
Sbjct: 125  WMQQHGKTSISSYSSYIKFLGKSGNSLKALEIYNSITDESDKVNVFICNSILSCLVRNGK 184

Query: 1816 FDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRGLHMDSVVYG 1637
            F+SSLKLF++MKQ+GL PD VTY+TLL GC K K+GY KA+ELV+E+K  G  MD+V+YG
Sbjct: 185  FESSLKLFDKMKQSGLTPDAVTYNTLLTGCIKDKNGYSKALELVQELKYNGAQMDNVMYG 244

Query: 1636 TLISVCASNNQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKADELIQQMRSA 1457
             L+++CASNN C  A+ YF++MK EGHSPNV+HYSSLLNAY+  G+Y KADELIQ M+S+
Sbjct: 245  ILLAICASNNLCAKAQSYFNQMKVEGHSPNVYHYSSLLNAYSSGGDYTKADELIQDMKSS 304

Query: 1456 GLVLNKVILTTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGLAKCGKLAVA 1277
            GLV NKVILTTLLKVYV+GGLF KSRELL EL  LGYAE+EMPYCLLMDGL+K G L  A
Sbjct: 305  GLVPNKVILTTLLKVYVRGGLFEKSRELLAELDTLGYAENEMPYCLLMDGLSKAGCLDEA 364

Query: 1276 KAVFDEMREKEVKNDGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDVVILNSMLCA 1097
            + VF+EM+EK VK+DGY++SIMISA CR G  EEAKQLA +FE KY KYDVV+LNSMLCA
Sbjct: 365  RVVFNEMQEKCVKSDGYAHSIMISAFCRGGCFEEAKQLAGDFEAKYDKYDVVLLNSMLCA 424

Query: 1096 YCRSGEMENVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMEDMHRKGHQPE 917
            YCR+G+ME+VM +M+K+DE AISPD+NTF ILIKYFCKE +Y+LAYRTM DMHRKGHQPE
Sbjct: 425  YCRTGDMESVMHVMRKLDELAISPDYNTFHILIKYFCKEKMYILAYRTMVDMHRKGHQPE 484

Query: 916  EDLSASLINHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGLLKDAYVVVK 737
            E+L +SLI HLG+  AHSEA SVYNML+YSKR + KALHEKILHIL++G LLKDAYVVVK
Sbjct: 485  EELCSSLIFHLGKMRAHSEALSVYNMLRYSKRSMCKALHEKILHILISGKLLKDAYVVVK 544

Query: 736  DNAKLISEPAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVSRYIEQPGKK 557
            DN++ IS P IKKFA++F+R GNINLVNDV+K+IH +GY+IDQ IF++A++RYI +  KK
Sbjct: 545  DNSESISHPVIKKFASAFVRLGNINLVNDVMKAIHTTGYRIDQGIFHIAIARYIAEREKK 604

Query: 556  ELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALKTNKS 398
            ELLL LL+WM GQG+ +DSS RNLILKNSHL G   I++++SK +   K++K+
Sbjct: 605  ELLLKLLEWMTGQGYVVDSSTRNLILKNSHLLGRQLIADILSKQHMKSKSSKT 657


>ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Populus trichocarpa]
            gi|550347847|gb|EEE84472.2| hypothetical protein
            POPTR_0001s21880g [Populus trichocarpa]
          Length = 673

 Score =  810 bits (2091), Expect = 0.0
 Identities = 398/615 (64%), Positives = 500/615 (81%)
 Frame = -1

Query: 2242 SNGAEKHSEVRSKPLNVSRNLSKNPSASYKARESAILDIQHSSDLSSALLRSGEVLKAQD 2063
            ++ ++  +  R +P   + +  +  S SY +R++AIL++Q S  L SAL R G +LK QD
Sbjct: 58   NDDSQPATTTRRRPKGGAVDAQRRQSKSYMSRKAAILEVQQSPHLDSALQRLGGMLKVQD 117

Query: 2062 LNVVLRHFGKLNRRKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKALEIYNSIKD 1883
            LN++LR+FG+  R +DL QLFDWM++H K +  SYSSYIKF+G   N  KALEIY+SI D
Sbjct: 118  LNIILRNFGEQCRWQDLSQLFDWMQRHNKISASSYSSYIKFMGTSLNPAKALEIYHSIPD 177

Query: 1882 DSTRNNVSVCNSTLYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKDGYF 1703
            +S + NV +CNS L CL+++ KFDSS+K F++MK  GL PD +TYSTLL GC K+KDGY 
Sbjct: 178  ESKKTNVFICNSLLRCLVRNTKFDSSMKFFHKMKNNGLTPDAITYSTLLAGCMKIKDGYS 237

Query: 1702 KAMELVREIKSRGLHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNVFHYSSLL 1523
            KA++LV+E+   GL MDS++YGTL++VCASNN+CE A+ YF++MK EGHSPN+FHYSSLL
Sbjct: 238  KALDLVQELNYNGLQMDSIMYGTLLAVCASNNRCEEAQSYFNQMKDEGHSPNIFHYSSLL 297

Query: 1522 NAYAVVGNYGKADELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDELQALGYA 1343
            NAY+  GNY KA+EL+Q M+S+GLV NKVILTTLLKVYV+GGLF KSR+LL EL  LG+A
Sbjct: 298  NAYSSDGNYKKAEELVQDMKSSGLVPNKVILTTLLKVYVRGGLFEKSRDLLVELDTLGFA 357

Query: 1342 EDEMPYCLLMDGLAKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGLIEEAKQL 1163
            ++EMPYCLLMDGLAK G L  A++VF+EM+EK VK+ GYSYSIMIS+ CR GL EEAK+L
Sbjct: 358  KNEMPYCLLMDGLAKNGLLDEARSVFNEMKEKRVKSGGYSYSIMISSFCRGGLFEEAKEL 417

Query: 1162 ACEFEIKYGKYDVVILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQILIKYFCK 983
            A EFE KY KYDVVILN++LCAYCR+GE E+VM+ M+KMDE AISPD+NTF ILIKYFCK
Sbjct: 418  AEEFEAKYDKYDVVILNTILCAYCRTGEKESVMRTMRKMDELAISPDYNTFHILIKYFCK 477

Query: 982  EHLYLLAYRTMEDMHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSKRGINKAL 803
            E LY+LAY+TMEDMHRKGHQP E+L +SLI HLG+  AH+EAFSVY+MLK SKR ++KA 
Sbjct: 478  EKLYMLAYQTMEDMHRKGHQPMEELCSSLILHLGKIKAHAEAFSVYSMLKSSKRTMSKAF 537

Query: 802  HEKILHILLAGGLLKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVIKSIHYSG 623
            HE ILHIL+AG LLKDAYVVVKDNA+LIS  AIKKFA+SF++ G+INL+NDV+K IH SG
Sbjct: 538  HEDILHILIAGRLLKDAYVVVKDNAELISPAAIKKFASSFVKLGDINLINDVMKVIHGSG 597

Query: 622  YKIDQDIFYMAVSRYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFGHHSIS 443
            YKIDQ++F MAVSRYI +P KK+LL+ LLQWMPGQG+ +DSS RNLILKNSHLFG   I+
Sbjct: 598  YKIDQELFLMAVSRYIAEPEKKDLLIQLLQWMPGQGYVVDSSTRNLILKNSHLFGRQLIA 657

Query: 442  ELMSKHYTALKTNKS 398
            E++SK +   K  K+
Sbjct: 658  EILSKQHMTSKALKA 672


>ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Cucumis sativus]
          Length = 668

 Score =  809 bits (2090), Expect = 0.0
 Identities = 394/595 (66%), Positives = 493/595 (82%)
 Frame = -1

Query: 2176 KNPSASYKARESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNRRKDLYQLFD 1997
            K  S SY  R+SAI  ++  S+L+ AL R G +LKAQDLNV+LRHFG L+R KDL QLF+
Sbjct: 68   KRHSKSYLERQSAIAQVKDCSELAPALARYGGLLKAQDLNVILRHFGMLSRWKDLSQLFE 127

Query: 1996 WMRQHGKTNIPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNSTLYCLIKSGK 1817
            WM++ GKTN+ SYSSYIKF+GR  N +KALE+YN+I++ S +N++ +CNS L CL+++GK
Sbjct: 128  WMQETGKTNVSSYSSYIKFMGRGLNPLKALEVYNNIEEVSIKNSIFICNSILNCLVRNGK 187

Query: 1816 FDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRGLHMDSVVYG 1637
            FD+S+KLF+QMK  GL PD VTYST+L GC +VK GY KAMEL++E++  GL MD V YG
Sbjct: 188  FDTSVKLFHQMKNDGLCPDTVTYSTMLTGCIRVKHGYAKAMELLKELQDNGLCMDCVSYG 247

Query: 1636 TLISVCASNNQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKADELIQQMRSA 1457
            TLI++CAS+N+ E AE++F++M++EGHSPN+FHY SLLNAY++ G+Y KADELI+ M+  
Sbjct: 248  TLIAICASHNRLEDAERFFNQMRAEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLT 307

Query: 1456 GLVLNKVILTTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGLAKCGKLAVA 1277
            GLV NKVILTTLLKVYV+GGLF KSR+LL EL++LGY E+EMPYCLLMDGLAK G +  A
Sbjct: 308  GLVPNKVILTTLLKVYVRGGLFEKSRKLLSELESLGYGENEMPYCLLMDGLAKAGSIREA 367

Query: 1276 KAVFDEMREKEVKNDGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDVVILNSMLCA 1097
            K VFDEM+ K VK DGY++SIMISA CR GL+EEAK LA +FE  Y +YD+VILN+MLCA
Sbjct: 368  KTVFDEMKAKNVKTDGYAHSIMISAFCRGGLLEEAKLLAKDFEATYDRYDIVILNTMLCA 427

Query: 1096 YCRSGEMENVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMEDMHRKGHQPE 917
            YCR+GEME+VM++++KMD+ AISPD+NTF ILIKYF KE LYLL YRT+EDMHRKGHQPE
Sbjct: 428  YCRAGEMESVMQMLRKMDDLAISPDYNTFHILIKYFFKEKLYLLCYRTLEDMHRKGHQPE 487

Query: 916  EDLSASLINHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGLLKDAYVVVK 737
            E+L +SLI  LG   A+SEAFSVYN+LKYSKR + KALHEKILHIL+AG LLKDAYVVVK
Sbjct: 488  EELCSSLILSLGNIRAYSEAFSVYNILKYSKRTMCKALHEKILHILIAGRLLKDAYVVVK 547

Query: 736  DNAKLISEPAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVSRYIEQPGKK 557
            DNA +IS+PAI+KFA  FM+ GN+NL+NDV+K+IH SGYKIDQD+F +A SRYIE P KK
Sbjct: 548  DNAGVISKPAIRKFAFGFMKFGNVNLINDVMKAIHGSGYKIDQDLFMIATSRYIELPEKK 607

Query: 556  ELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALKTNKSHE 392
            +L + LL+WMPGQG+ +DSS RNLILKN+HLFG   I+E++SKH    K+ KS E
Sbjct: 608  DLFIQLLKWMPGQGYVVDSSTRNLILKNAHLFGRQLIAEILSKHSLLSKSTKSRE 662


>gb|EMJ26349.1| hypothetical protein PRUPE_ppa002505mg [Prunus persica]
          Length = 664

 Score =  809 bits (2089), Expect = 0.0
 Identities = 411/663 (61%), Positives = 515/663 (77%)
 Frame = -1

Query: 2380 MEVSSVLGNGFQPAFVFPTTRPIKNSKXXXXXXXXXXPQRVSTPISSNGAEKHSEVRSKP 2201
            MEVSSV G G Q     P+  PI +              + S    +    K  E  ++P
Sbjct: 1    MEVSSVHGVGVQHVLCGPS--PISSLSILSATPWRSTRAKNSHLCCATTLVK--ERHTEP 56

Query: 2200 LNVSRNLSKNPSASYKARESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNRR 2021
             N    + K  S  Y AR+SAIL++Q SSDL SAL R G  LK QDLN ++RHFG L R 
Sbjct: 57   PNNGSGIPKRHSKQYLARQSAILEVQESSDLDSALTRLGGSLKVQDLNAIIRHFGILKRW 116

Query: 2020 KDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNSTL 1841
             DL QLF+WM+Q+GK +  SYSSYIKF+G+  N +KALEIYN+I+D ST+ NV +CNS L
Sbjct: 117  HDLSQLFEWMQQNGKISASSYSSYIKFMGKSLNPVKALEIYNNIQDASTKKNVHICNSVL 176

Query: 1840 YCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRGL 1661
              LI+SGKFD S KLF+QMKQ GL PD VTYSTLL GC KVK GY KA+ELV+E++   L
Sbjct: 177  GSLIRSGKFDGSFKLFHQMKQDGLTPDAVTYSTLLAGCNKVKHGYSKALELVQELQRNEL 236

Query: 1660 HMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKADE 1481
             MDSV+YGTL++VCASNN+ E AE YF +MK+EG+ PNVFHYS++LNAY++ GNY +AD+
Sbjct: 237  QMDSVIYGTLLAVCASNNKLEEAEGYFKQMKNEGYLPNVFHYSAMLNAYSISGNYKEADD 296

Query: 1480 LIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGLA 1301
            L+Q M+SAGLV NKVILTTLLKVYV+GGLF KSRELL EL+ALGYAEDEMPYCLLMD LA
Sbjct: 297  LVQDMKSAGLVPNKVILTTLLKVYVRGGLFEKSRELLAELEALGYAEDEMPYCLLMDALA 356

Query: 1300 KCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDVV 1121
            K G++  AK VFDEM+EK ++++GYSYSIMISA CR GL+E+AKQL+ + E  + K+D+V
Sbjct: 357  KAGRIHEAKLVFDEMKEKSIRSNGYSYSIMISAFCRGGLLEDAKQLSKDVERTHDKFDLV 416

Query: 1120 ILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMEDM 941
            +LN+M+CAYCR+GEM++VM++M+KMDE  I+PD+NTF ILIKYFCKE LYLLAY+TMEDM
Sbjct: 417  MLNTMICAYCRAGEMDSVMEMMRKMDEQKITPDYNTFHILIKYFCKEKLYLLAYQTMEDM 476

Query: 940  HRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGLL 761
            H KGHQP+E+L +SL+  LG+  A+SEA+SVYN+L+YSKR + KALHEKILHILLAG LL
Sbjct: 477  HNKGHQPDEELCSSLMFLLGKIRAYSEAYSVYNILRYSKRTMCKALHEKILHILLAGQLL 536

Query: 760  KDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVSR 581
            KDAYVVVKDNA LIS+PA+KKF+T+F++ GNINL+NDV+K I  SG KIDQ +F MA+SR
Sbjct: 537  KDAYVVVKDNAGLISKPAVKKFSTAFLKLGNINLINDVLKVIDASGCKIDQGLFQMAISR 596

Query: 580  YIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALKTNK 401
            YI  P KKELL+ +L WMPGQG+ +DS+ RNLILKNSHLFG   I++++SK +   K +K
Sbjct: 597  YIALPEKKELLIQMLLWMPGQGYVVDSATRNLILKNSHLFGRQHIADVLSKQHMISKASK 656

Query: 400  SHE 392
            S +
Sbjct: 657  SRK 659


>ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 642

 Score =  808 bits (2086), Expect = 0.0
 Identities = 397/621 (63%), Positives = 500/621 (80%), Gaps = 1/621 (0%)
 Frame = -1

Query: 2257 STPISSNGAEKHSEVRSKPL-NVSRNLSKNPSASYKARESAILDIQHSSDLSSALLRSGE 2081
            ST + ++     + + ++PL + SR +++  S  Y AR+SAIL +QHSSDL SAL R G 
Sbjct: 21   STRVKTSQVSCATTLDNQPLRHDSRRVTRPHSKQYLARQSAILQVQHSSDLESALTRLGG 80

Query: 2080 VLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKALEI 1901
             L  QDLN ++RHFG L R  DL QLF+WM+Q+GK +  SYSSYIKF+G+  N +KALEI
Sbjct: 81   SLNVQDLNAIIRHFGMLKRWHDLSQLFEWMQQNGKVSASSYSSYIKFMGKSLNPVKALEI 140

Query: 1900 YNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAK 1721
            YNSI+D+ST+ NV +CNS L  L++SGKFD S+KLF+QMKQ GL PD VTYSTLL GC K
Sbjct: 141  YNSIQDESTKKNVHICNSVLGSLVRSGKFDGSIKLFHQMKQDGLTPDAVTYSTLLAGCIK 200

Query: 1720 VKDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNVF 1541
             K GY KA+ELV+E+++  L MDSV+YGTL+++CASNN+ E AE YF +MK EGH PN F
Sbjct: 201  FKHGYSKALELVQELQNNELQMDSVIYGTLLAICASNNKWEEAESYFKQMKDEGHLPNEF 260

Query: 1540 HYSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDEL 1361
            HYSSLLNAY++ GNY KAD+++Q M+SAGLV NKV LTTLLK YV+GGLF KSRELL EL
Sbjct: 261  HYSSLLNAYSISGNYKKADDVVQDMKSAGLVPNKVTLTTLLKAYVRGGLFEKSRELLTEL 320

Query: 1360 QALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGLI 1181
            +ALGYAEDEMPYC+LMD  AK G++  AK VFDE++EK V++DGYSYSIMISA CR GL+
Sbjct: 321  EALGYAEDEMPYCILMDAFAKAGRIEDAKLVFDEIKEKSVRSDGYSYSIMISAFCRGGLV 380

Query: 1180 EEAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQIL 1001
            ++AKQLA +FE  Y KYD+V+LN+M+CAYCR+GEM++VM++++KMDE  I+PD NTF IL
Sbjct: 381  DDAKQLAKDFERTYDKYDLVMLNTMICAYCRAGEMDSVMEMLRKMDELKITPDNNTFHIL 440

Query: 1000 IKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSKR 821
            IKYFCKE LY+LAY+TMEDMH KG+ P+E+L +SL+ HLG+  A+SEA+S+YN+L+YSKR
Sbjct: 441  IKYFCKEKLYMLAYKTMEDMHNKGYPPDEELCSSLMFHLGKIRAYSEAYSIYNILRYSKR 500

Query: 820  GINKALHEKILHILLAGGLLKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVIK 641
             + KALHEKILHIL+AG LLKDAYVVVKDN +LIS+ A  KFAT+FM+ GNINL+NDV+K
Sbjct: 501  TMCKALHEKILHILVAGRLLKDAYVVVKDNPRLISKAATMKFATAFMKLGNINLINDVLK 560

Query: 640  SIHYSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHLF 461
            +I  SG KIDQ IF MA+SRYI  P KK+LLL LLQWMPGQG+ +DSS RNLILKNSHLF
Sbjct: 561  AIDGSGCKIDQGIFQMAISRYISDPDKKDLLLQLLQWMPGQGYTVDSSTRNLILKNSHLF 620

Query: 460  GHHSISELMSKHYTALKTNKS 398
                I+E++SK +   K +KS
Sbjct: 621  DRQHIAEMLSKQHMISKASKS 641


>gb|EOX98058.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao]
          Length = 717

 Score =  806 bits (2081), Expect = 0.0
 Identities = 413/661 (62%), Positives = 516/661 (78%), Gaps = 1/661 (0%)
 Frame = -1

Query: 2380 MEVSSVLGNGFQ-PAFVFPTTRPIKNSKXXXXXXXXXXPQRVSTPISSNGAEKHSEVRSK 2204
            ME+SS+LG GF      FP+      S             R S   + N A   +    K
Sbjct: 1    MEISSLLGTGFHLQILTFPSP-----SSFPRIPSLSPPTPRASIS-NLNSATSTTATPVK 54

Query: 2203 PLNVSRNLSKNPSASYKARESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNR 2024
              N +R  SK    SY  R+SA+L++Q SSDL+SAL   G +LK QDLNV++RHFGKL +
Sbjct: 55   EPNPTRPHSK----SYLQRKSALLEVQQSSDLNSALQNFGGILKPQDLNVIIRHFGKLGK 110

Query: 2023 RKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNST 1844
               L +LF WM+QHGKTN  SYSSYIK +G+  + +KALEIYNSI D+STR NV +CNS 
Sbjct: 111  WHHLSELFAWMQQHGKTNGSSYSSYIKIMGKKLSPIKALEIYNSIPDESTRINVFICNSL 170

Query: 1843 LYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRG 1664
            L  L+++GKF+S +KLF++MKQ GL PD VTY+TLL GC K+K G+ KA+EL++E+K  G
Sbjct: 171  LSSLVRNGKFESGIKLFDKMKQDGLTPDSVTYNTLLAGCIKIKHGHSKALELIKELKYNG 230

Query: 1663 LHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKAD 1484
            L MDSV+YGTL++VCAS+   E A+ YF++M+ EGHSPN++HYSSLLNAY+  GNY KAD
Sbjct: 231  LKMDSVMYGTLLAVCASSGLHEEAQNYFNQMREEGHSPNLYHYSSLLNAYSYDGNYCKAD 290

Query: 1483 ELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGL 1304
            EL++QM+S+GLV NKVILTTLLKVYV+GGLF KS +LL EL+ALGYAEDEMP+CLLMDGL
Sbjct: 291  ELVEQMKSSGLVPNKVILTTLLKVYVRGGLFEKSTKLLAELEALGYAEDEMPFCLLMDGL 350

Query: 1303 AKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDV 1124
            +K G+L  A++VF EM++K VK+DGYS+SIMISALCR+GL EEAK+LA +FE +Y KYD+
Sbjct: 351  SKAGRLDEARSVFVEMQQKCVKSDGYSHSIMISALCRAGLFEEAKELAQDFEAQYNKYDL 410

Query: 1123 VILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMED 944
            V+LN+MLCAYCR+GEME+VM+ MKKMDE AISPD+NTF ILIKYFCKE LYLLAY+TMED
Sbjct: 411  VMLNTMLCAYCRAGEMESVMQTMKKMDELAISPDYNTFHILIKYFCKEKLYLLAYKTMED 470

Query: 943  MHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGL 764
            MH KG+ PEE+L +SLI  LG+  AH EAFSVYNML+YSKR + KALHEKILHIL+AG L
Sbjct: 471  MHGKGYHPEEELCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMCKALHEKILHILIAGQL 530

Query: 763  LKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVS 584
            LKDAYVVVKDNA+LIS+PAI KFAT+FM+ GNIN++NDV+K +H SGYKIDQ +F MA+S
Sbjct: 531  LKDAYVVVKDNAELISQPAITKFATAFMKLGNINMINDVLKVLHGSGYKIDQGLFQMAIS 590

Query: 583  RYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALKTN 404
            RY+ QP KKELLL LLQWMPG G+ +DSS RN+ILKNS L G    +E++SK +   K +
Sbjct: 591  RYLGQPEKKELLLQLLQWMPGHGYVVDSSTRNMILKNSQLLGRQLTAEILSKQHMMSKVS 650

Query: 403  K 401
            +
Sbjct: 651  R 651


>gb|EOX98059.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2
            [Theobroma cacao]
          Length = 649

 Score =  799 bits (2064), Expect = 0.0
 Identities = 413/662 (62%), Positives = 515/662 (77%), Gaps = 1/662 (0%)
 Frame = -1

Query: 2380 MEVSSVLGNGFQ-PAFVFPTTRPIKNSKXXXXXXXXXXPQRVSTPISSNGAEKHSEVRSK 2204
            ME+SS+LG GF      FP+      S             R S   + N A   +    K
Sbjct: 1    MEISSLLGTGFHLQILTFPSP-----SSFPRIPSLSPPTPRASIS-NLNSATSTTATPVK 54

Query: 2203 PLNVSRNLSKNPSASYKARESAILDIQHSSDLSSALLRSGEVLKAQDLNVVLRHFGKLNR 2024
              N +R  SK    SY  R+SA+L++Q SSDL+SAL   G +LK QDLNV++RHFGKL +
Sbjct: 55   EPNPTRPHSK----SYLQRKSALLEVQQSSDLNSALQNFGGILKPQDLNVIIRHFGKLGK 110

Query: 2023 RKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNST 1844
               L +LF WM+QHGKTN  SYSSYIK +G+  + +KALEIYNSI D+STR NV +CNS 
Sbjct: 111  WHHLSELFAWMQQHGKTNGSSYSSYIKIMGKKLSPIKALEIYNSIPDESTRINVFICNSL 170

Query: 1843 LYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRG 1664
            L  L+++GKF+S +KLF++MKQ GL PD VTY+TLL GC K+K G+ KA+EL++E+K  G
Sbjct: 171  LSSLVRNGKFESGIKLFDKMKQDGLTPDSVTYNTLLAGCIKIKHGHSKALELIKELKYNG 230

Query: 1663 LHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKAD 1484
            L MDSV+YGTL++VCAS+   E A+ YF++M+ EGHSPN++HYSSLLNAY+  GNY KAD
Sbjct: 231  LKMDSVMYGTLLAVCASSGLHEEAQNYFNQMREEGHSPNLYHYSSLLNAYSYDGNYCKAD 290

Query: 1483 ELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGL 1304
            EL++QM+S+GLV NKVILTTLLKVYV+GGLF KS +LL EL+ALGYAEDEMP+CLLMDGL
Sbjct: 291  ELVEQMKSSGLVPNKVILTTLLKVYVRGGLFEKSTKLLAELEALGYAEDEMPFCLLMDGL 350

Query: 1303 AKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDV 1124
            +K G+L  A++VF EM++K VK+DGYS+SIMISALCR+GL EEAK+LA +FE +Y KYD+
Sbjct: 351  SKAGRLDEARSVFVEMQQKCVKSDGYSHSIMISALCRAGLFEEAKELAQDFEAQYNKYDL 410

Query: 1123 VILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMED 944
            V+LN+MLCAYCR+GEME+VM+ MKKMDE AISPD+NTF ILIKYFCKE LYLLAY+TMED
Sbjct: 411  VMLNTMLCAYCRAGEMESVMQTMKKMDELAISPDYNTFHILIKYFCKEKLYLLAYKTMED 470

Query: 943  MHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGL 764
            MH KG+ PEE+L +SLI  LG+  AH EAFSVYNML+YSKR + KALHEKILHIL+AG L
Sbjct: 471  MHGKGYHPEEELCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMCKALHEKILHILIAGQL 530

Query: 763  LKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVS 584
            LKDAYVVVKDNA+LIS+PAI KFAT+FM+ GNIN++NDV+K +H SGYKIDQ    MA+S
Sbjct: 531  LKDAYVVVKDNAELISQPAITKFATAFMKLGNINMINDVLKVLHGSGYKIDQ----MAIS 586

Query: 583  RYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALKTN 404
            RY+ QP KKELLL LLQWMPG G+ +DSS RN+ILKNS L G    +E++SK +   K +
Sbjct: 587  RYLGQPEKKELLLQLLQWMPGHGYVVDSSTRNMILKNSQLLGRQLTAEILSKQHMMSKVS 646

Query: 403  KS 398
            +S
Sbjct: 647  RS 648


>gb|ESW20506.1| hypothetical protein PHAVU_006G214900g [Phaseolus vulgaris]
          Length = 639

 Score =  769 bits (1986), Expect = 0.0
 Identities = 387/617 (62%), Positives = 484/617 (78%)
 Frame = -1

Query: 2260 VSTPISSNGAEKHSEVRSKPLNVSRNLSKNPSASYKARESAILDIQHSSDLSSALLRSGE 2081
            ++ P S++ AE  S+      +V    SK P  S  AR+SA L+IQ SSDL SAL R GE
Sbjct: 20   LAIPASASTAEPLSQTPPHRNSVKLRSSK-PFPS--ARKSATLEIQRSSDLPSALARLGE 76

Query: 2080 VLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKALEI 1901
             L  +DLN  L HF   N+   + QLF WM+++ K ++ SYS Y++F+  + ++ + L++
Sbjct: 77   TLTVKDLNAALYHFKNSNKFNHISQLFKWMQENNKLDVSSYSHYMRFMANNLDAAEMLQL 136

Query: 1900 YNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAK 1721
            Y+SI+D+S R N+ VCNS L CLIK GKFDS +KLF QM+  GLVPD VTYSTLL GC K
Sbjct: 137  YHSIQDESARKNILVCNSVLGCLIKKGKFDSGMKLFRQMQLDGLVPDPVTYSTLLAGCIK 196

Query: 1720 VKDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNVF 1541
            +++GY KA+EL++E++   L MD V+YGT+++VCASN + E AEKYF++MK EGHS NV+
Sbjct: 197  IENGYPKALELIQELQHSKLQMDGVIYGTILAVCASNGKWEEAEKYFNQMKDEGHSRNVY 256

Query: 1540 HYSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDEL 1361
            HYSSLLNAY+  GNY KAD L Q M+S GLV NKVILTTLLKVYVKGGLF KSRELL EL
Sbjct: 257  HYSSLLNAYSTCGNYKKADILFQDMKSEGLVPNKVILTTLLKVYVKGGLFDKSRELLAEL 316

Query: 1360 QALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGLI 1181
            ++LGYAEDEMPYC+LMDGLAK G++  AK +FDEM +  V++DGY++SIMISALCRS L 
Sbjct: 317  KSLGYAEDEMPYCILMDGLAKAGQIHEAKLIFDEMMKNHVRSDGYAHSIMISALCRSKLF 376

Query: 1180 EEAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQIL 1001
             EAKQLA +FE    KYD+VILNSMLCA+CR GEME+VM+ +KKMDE AISP +NTF IL
Sbjct: 377  REAKQLAKDFETTSNKYDIVILNSMLCAFCRVGEMESVMETLKKMDELAISPSYNTFHIL 436

Query: 1000 IKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSKR 821
            IKYFC+E +YLLAYRTM+DMH KGHQP E+L ++LI+HLG+  A+SEAFSVYNML+Y KR
Sbjct: 437  IKYFCREKMYLLAYRTMKDMHSKGHQPGEELCSTLISHLGQVNAYSEAFSVYNMLRYGKR 496

Query: 820  GINKALHEKILHILLAGGLLKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVIK 641
             + K+LHEKIL+ILLAG LLKDAYVVVKDNAK IS P  KKFA +FM+ GNIN +NDV+K
Sbjct: 497  TMCKSLHEKILYILLAGHLLKDAYVVVKDNAKYISRPPTKKFAIAFMKSGNINYINDVLK 556

Query: 640  SIHYSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHLF 461
            ++H SGYK+DQD+F MAVSRY+ +P KK+LLLHLLQWM GQG+ +DSS RNLILK+SHLF
Sbjct: 557  TLHDSGYKLDQDLFAMAVSRYLGEPEKKDLLLHLLQWMSGQGYMVDSSTRNLILKHSHLF 616

Query: 460  GHHSISELMSKHYTALK 410
            G   I+E++SK    LK
Sbjct: 617  GRQLIAEVLSKQQVQLK 633


>ref|XP_004486236.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Cicer arietinum]
          Length = 642

 Score =  766 bits (1978), Expect = 0.0
 Identities = 376/616 (61%), Positives = 483/616 (78%)
 Frame = -1

Query: 2257 STPISSNGAEKHSEVRSKPLNVSRNLSKNPSASYKARESAILDIQHSSDLSSALLRSGEV 2078
            S   S++  E  +     P    +++    S  + AR+SA L +  +SDL+S L + G+ 
Sbjct: 21   SISASASITEPPTPTPPSPSQTKKSIKFVNSKPFSARKSAKLQLHRASDLNSVLSKVGKT 80

Query: 2077 LKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKALEIY 1898
            L  ++LN  L HFG  N+   + QLF WM+++ K ++ SYS+YIKF+    ++   L++Y
Sbjct: 81   LTVKELNSTLHHFGNSNKFNHISQLFLWMQENKKLDVYSYSNYIKFMANKLDASTVLKLY 140

Query: 1897 NSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKV 1718
            N+I+D+S ++NV VCNS L CLIK GKFD+++KLF+QMKQ GLVPD+VTYS L+ GC KV
Sbjct: 141  NNIQDESAKDNVYVCNSVLSCLIKKGKFDTAIKLFHQMKQDGLVPDLVTYSMLIAGCVKV 200

Query: 1717 KDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNVFH 1538
            KDGY KA++L++E++   L MD+V+YG +++VCASN + E AE YF+ MK+EGHSPNV+H
Sbjct: 201  KDGYSKALQLIQELQDNKLRMDNVIYGAILAVCASNGKWEEAEHYFNGMKNEGHSPNVYH 260

Query: 1537 YSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDELQ 1358
            YSSLLNAY+  GN+ KAD LIQ M+S GLV NKVILTTLLKVYV+GGL  KSRELL +L+
Sbjct: 261  YSSLLNAYSASGNFKKADSLIQDMKSEGLVPNKVILTTLLKVYVRGGLLEKSRELLTKLE 320

Query: 1357 ALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGLIE 1178
            +L YAEDEMPYC+LMDGLAK G++  AK VFDEM +K V++DGY++SIMISA CR+ L E
Sbjct: 321  SLSYAEDEMPYCVLMDGLAKAGQVHEAKIVFDEMMKKHVRSDGYAHSIMISAFCRAKLFE 380

Query: 1177 EAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQILI 998
            EAKQLA  F+  + KYDVVI+NSMLCA+CR+GEME+VM+ ++KMDE AISPD+NTF ILI
Sbjct: 381  EAKQLAKNFQTTFNKYDVVIMNSMLCAFCRAGEMESVMETLRKMDELAISPDYNTFNILI 440

Query: 997  KYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSKRG 818
            KYFC++++YLLAY+TMEDMH KG+QP E+L +SLI HLG+  A+SEAFSVYNMLKYSKR 
Sbjct: 441  KYFCRQNMYLLAYQTMEDMHSKGYQPVEELCSSLIYHLGQANAYSEAFSVYNMLKYSKRT 500

Query: 817  INKALHEKILHILLAGGLLKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVIKS 638
            I K LHEKILHILLAG LLKDAYVV KDNA  IS    KKFA++FM+ GNINL+NDV+K+
Sbjct: 501  IRKTLHEKILHILLAGKLLKDAYVVFKDNATFISGHTTKKFASAFMKLGNINLINDVMKT 560

Query: 637  IHYSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFG 458
            +H  GYKIDQD+F MAV+RY+ QP KK+LLLHLLQWMPGQG+ +D S RNLILKNSHLFG
Sbjct: 561  LHNCGYKIDQDLFEMAVTRYLGQPEKKDLLLHLLQWMPGQGYVVDPSTRNLILKNSHLFG 620

Query: 457  HHSISELMSKHYTALK 410
               I+E++SK   +LK
Sbjct: 621  RQLIAEVLSKQRVSLK 636


>ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutrema salsugineum]
            gi|557095175|gb|ESQ35757.1| hypothetical protein
            EUTSA_v10007006mg [Eutrema salsugineum]
          Length = 666

 Score =  766 bits (1977), Expect = 0.0
 Identities = 384/620 (61%), Positives = 486/620 (78%), Gaps = 1/620 (0%)
 Frame = -1

Query: 2263 RVSTPISSNGAEKHSEVRSKPLNVSR-NLSKNPSASYKARESAILDIQHSSDLSSALLRS 2087
            R  T  ++  A   S V   P  V+  + SK  S SY  R+SAI +++ S D  S+L R 
Sbjct: 34   RTLTAATATSAAAVSTVAESPATVAEASRSKRHSKSYLTRKSAISEVERSPDFLSSLQRL 93

Query: 2086 GEVLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKAL 1907
              VLK QDLNV+LR FG   R +DL QLFDWM+Q GK ++ +YSS IKFVG  S S KAL
Sbjct: 94   AGVLKVQDLNVILRDFGISGRWQDLIQLFDWMQQQGKISVSTYSSCIKFVGAKSVS-KAL 152

Query: 1906 EIYNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGC 1727
            EIY SI D+ST+ NV +CNS L CL+K+GK +S  KLF+QMK+ GL PD++TY+TLL GC
Sbjct: 153  EIYQSIPDESTKINVYICNSILSCLVKNGKLESCFKLFDQMKRDGLKPDVITYNTLLAGC 212

Query: 1726 AKVKDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPN 1547
             KVK+GY KAMELV E+   G+ MD V+YGT++++CASN +CE AE +  +MK +GHSPN
Sbjct: 213  IKVKNGYSKAMELVGELPHNGIQMDGVMYGTVLAICASNGRCEEAESFIQQMKVKGHSPN 272

Query: 1546 VFHYSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLD 1367
            ++HYSSLLN+Y+  G+Y KADEL+ +M+S G+V NKV++TTLLKVY++GGLF +SRELL 
Sbjct: 273  IYHYSSLLNSYSWKGDYKKADELMTEMKSVGIVPNKVMMTTLLKVYIRGGLFERSRELLS 332

Query: 1366 ELQALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSG 1187
            EL++ GYAE+EMPYC+LMDGL+K GK   A+++FDEM+ K VK+DGY+ SIMISALCRS 
Sbjct: 333  ELESAGYAENEMPYCMLMDGLSKAGKFEEARSIFDEMKGKGVKSDGYANSIMISALCRSK 392

Query: 1186 LIEEAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQ 1007
              EEAKQLA + E  Y K D+V+LN+MLCAYCR+GEME+VM++MKKMDE A+SPD+NTF 
Sbjct: 393  RFEEAKQLARDSESTYEKCDLVMLNTMLCAYCRAGEMESVMRMMKKMDEQAVSPDYNTFH 452

Query: 1006 ILIKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYS 827
            ILIKYF KE L+LLAY+T+ DMH KGH+ EE+L +SLI HLG+  AHSEAFSVY+ML+YS
Sbjct: 453  ILIKYFIKEKLHLLAYQTLLDMHSKGHRLEEELCSSLIYHLGKIRAHSEAFSVYSMLRYS 512

Query: 826  KRGINKALHEKILHILLAGGLLKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDV 647
            KR I K LHEKILHIL+ G LLKDAYVVVKDNAK+IS+P +K+F  +FM  GN+NLVNDV
Sbjct: 513  KRTICKDLHEKILHILIHGKLLKDAYVVVKDNAKMISQPTLKRFGRAFMNSGNVNLVNDV 572

Query: 646  IKSIHYSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSH 467
            +K +H SG+KIDQ  F +A+SRYI QP KKELLL LLQWMPGQG+ +DSS RNLILKNS+
Sbjct: 573  LKVLHGSGHKIDQVQFEIAISRYISQPDKKELLLQLLQWMPGQGYVVDSSTRNLILKNSN 632

Query: 466  LFGHHSISELMSKHYTALKT 407
            LFG   I+E++SKH+ A +T
Sbjct: 633  LFGRQLIAEILSKHHIASRT 652


>ref|NP_172560.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|122242678|sp|Q0WVV0.1|PPR31_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g10910, chloroplastic; Flags: Precursor
            gi|110741600|dbj|BAE98748.1| membrane-associated
            salt-inducible protein isolog [Arabidopsis thaliana]
            gi|332190541|gb|AEE28662.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 664

 Score =  764 bits (1972), Expect = 0.0
 Identities = 386/617 (62%), Positives = 487/617 (78%), Gaps = 1/617 (0%)
 Frame = -1

Query: 2263 RVSTPISSNGAEKHSEVRSKPLNVSRN-LSKNPSASYKARESAILDIQHSSDLSSALLRS 2087
            R+ TP +   A   S V   P NV+    SK  S SY AR+SAI ++Q SSD  S+L R 
Sbjct: 36   RILTPTA---ATTSSAVIELPANVAEAPRSKRHSNSYLARKSAISEVQRSSDFLSSLQRL 92

Query: 2086 GEVLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKAL 1907
              VLK QDLNV+LR FG   R +DL QLF+WM+QHGK ++ +YSS IKFVG   N  KAL
Sbjct: 93   ATVLKVQDLNVILRDFGISGRWQDLIQLFEWMQQHGKISVSTYSSCIKFVGA-KNVSKAL 151

Query: 1906 EIYNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGC 1727
            EIY SI D+ST+ NV +CNS L CL+K+GK DS +KLF+QMK+ GL PD+VTY+TLL GC
Sbjct: 152  EIYQSIPDESTKINVYICNSILSCLVKNGKLDSCIKLFDQMKRDGLKPDVVTYNTLLAGC 211

Query: 1726 AKVKDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPN 1547
             KVK+GY KA+EL+ E+   G+ MDSV+YGT++++CASN + E AE +  +MK EGHSPN
Sbjct: 212  IKVKNGYPKAIELIGELPHNGIQMDSVMYGTVLAICASNGRSEEAENFIQQMKVEGHSPN 271

Query: 1546 VFHYSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLD 1367
            ++HYSSLLN+Y+  G+Y KADEL+ +M+S GLV NKV++TTLLKVY+KGGLF +SRELL 
Sbjct: 272  IYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTTLLKVYIKGGLFDRSRELLS 331

Query: 1366 ELQALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSG 1187
            EL++ GYAE+EMPYC+LMDGL+K GKL  A+++FD+M+ K V++DGY+ SIMISALCRS 
Sbjct: 332  ELESAGYAENEMPYCMLMDGLSKAGKLEEARSIFDDMKGKGVRSDGYANSIMISALCRSK 391

Query: 1186 LIEEAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQ 1007
              +EAK+L+ + E  Y K D+V+LN+MLCAYCR+GEME+VM++MKKMDE A+SPD+NTF 
Sbjct: 392  RFKEAKELSRDSETTYEKCDLVMLNTMLCAYCRAGEMESVMRMMKKMDEQAVSPDYNTFH 451

Query: 1006 ILIKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYS 827
            ILIKYF KE L+LLAY+T  DMH KGH+ EE+L +SLI HLG+  A +EAFSVYNML+YS
Sbjct: 452  ILIKYFIKEKLHLLAYQTTLDMHSKGHRLEEELCSSLIYHLGKIRAQAEAFSVYNMLRYS 511

Query: 826  KRGINKALHEKILHILLAGGLLKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDV 647
            KR I K LHEKILHIL+ G LLKDAY+VVKDNAK+IS+P +KKF  +FM  GNINLVNDV
Sbjct: 512  KRTICKELHEKILHILIQGNLLKDAYIVVKDNAKMISQPTLKKFGRAFMISGNINLVNDV 571

Query: 646  IKSIHYSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSH 467
            +K +H SG+KIDQ  F +A+SRYI QP KKELLL LLQWMPGQG+ +DSS RNLILKNSH
Sbjct: 572  LKVLHGSGHKIDQVQFEIAISRYISQPDKKELLLQLLQWMPGQGYVVDSSTRNLILKNSH 631

Query: 466  LFGHHSISELMSKHYTA 416
            +FG   I+E++SKH+ A
Sbjct: 632  MFGRLLIAEILSKHHVA 648


>ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X4 [Solanum tuberosum]
          Length = 539

 Score =  763 bits (1971), Expect = 0.0
 Identities = 378/539 (70%), Positives = 450/539 (83%), Gaps = 1/539 (0%)
 Frame = -1

Query: 1993 MRQHGKTNIPSYSSYIKFVGRDSNSMKALEIYNSIKDDSTRNNVSVCNSTLYCLIKSGKF 1814
            M+Q+ K N+ SYSSY+KF+G+  + + A+E+Y  IKD S + NVSVCN+ L  LIK+GK 
Sbjct: 1    MQQNQKINVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKS 60

Query: 1813 DSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKDGYFKAMELVREIKSRGLHMDSVVYGT 1634
            +SSLKLF QMK+ GLVPD+ TYSTLL GCAKV  GY+KA+ELV+E+ S GL MDSV YG+
Sbjct: 61   ESSLKLFTQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGS 120

Query: 1633 LISVCASNNQCEAAEKYFDEMKSEGHSPNVFHYSSLLNAYAVVGNYGKADELIQQMRSAG 1454
            L+SVCAS+ +C  A KYF +MK EGHSPNV+HYSSLLNAY+   NY KA+ LI++MRSAG
Sbjct: 121  LLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAG 180

Query: 1453 LVLNKVILTTLLKVYVKGGLFVKSRELLDELQALGYAEDEMPYCLLMDGLAKCGKLAVAK 1274
            LVLNKVI TTLLKVYVKGGLF KS+ELL EL+ALGYA+DEMP+CLLMDGLAK G L  AK
Sbjct: 181  LVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAK 240

Query: 1273 AVFDEMREKEVKN-DGYSYSIMISALCRSGLIEEAKQLACEFEIKYGKYDVVILNSMLCA 1097
            +VFDEM EK VK  DGYSYSIMISA CRSGL+E+AK++A EFE KY KYD+VILN+ML A
Sbjct: 241  SVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSA 300

Query: 1096 YCRSGEMENVMKLMKKMDESAISPDWNTFQILIKYFCKEHLYLLAYRTMEDMHRKGHQPE 917
            YCR+G+MENVM +MKKMD+SAISPDWNTF ILI+YFCKE LYLLAYRTMEDMH KGHQPE
Sbjct: 301  YCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPE 360

Query: 916  EDLSASLINHLGRTGAHSEAFSVYNMLKYSKRGINKALHEKILHILLAGGLLKDAYVVVK 737
            E L +SLI HLG+TGAHSEAFSVYNML+YSKR I+ ALHE ILHIL+AG LLKDAYVVVK
Sbjct: 361  EGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVVK 420

Query: 736  DNAKLISEPAIKKFATSFMRKGNINLVNDVIKSIHYSGYKIDQDIFYMAVSRYIEQPGKK 557
            DNA  IS+PAIKKF+ +FMR GN+NL+NDV+ ++H SG+KIDQ++F +A++RYI +P KK
Sbjct: 421  DNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKK 480

Query: 556  ELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFGHHSISELMSKHYTALKTNKSHEGRTR 380
            ELLL LL+WMPG+G+ IDSS RNLILKNSHLFGH  I+E +SKH    K  K H+   R
Sbjct: 481  ELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSKKVKLHKENAR 539


>ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arabidopsis lyrata subsp.
            lyrata] gi|297335683|gb|EFH66100.1| hypothetical protein
            ARALYDRAFT_888388 [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  763 bits (1969), Expect = 0.0
 Identities = 384/612 (62%), Positives = 483/612 (78%), Gaps = 1/612 (0%)
 Frame = -1

Query: 2248 ISSNGAEKHSEVRSKPLNVS-RNLSKNPSASYKARESAILDIQHSSDLSSALLRSGEVLK 2072
            ++S  A   + V   P  V+    SK  S SY  R+SAI ++Q SSD  S+L R   VLK
Sbjct: 39   LTSTAATTSTAVVESPATVAGAPRSKRHSNSYLTRKSAISEVQRSSDFLSSLHRLERVLK 98

Query: 2071 AQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDSNSMKALEIYNS 1892
             QDLNV+LR FG   R +DL QLFDWM+QHGK ++ +YSS IKFVG   N  KALEIY S
Sbjct: 99   VQDLNVILRDFGISGRWQDLIQLFDWMQQHGKISVSTYSSCIKFVGA-KNVSKALEIYQS 157

Query: 1891 IKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKD 1712
            I D+ST+ NV +CNS L CL+K+GK DS +KLF+QMK+ GL PD++TY+TLL GC KVK+
Sbjct: 158  IPDESTKINVYICNSILSCLVKNGKLDSCIKLFDQMKRGGLKPDVITYNTLLAGCIKVKN 217

Query: 1711 GYFKAMELVREIKSRGLHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNVFHYS 1532
            GY KA+EL+ E+   G+ MDSV+YGT++++CASN +CE AE +  +MK+EGHSPN++HYS
Sbjct: 218  GYPKAVELIGELPHNGIQMDSVMYGTVLAICASNGRCEEAENFIQQMKAEGHSPNIYHYS 277

Query: 1531 SLLNAYAVVGNYGKADELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDELQAL 1352
            SLLN+Y+  G+Y KADEL+ +M+S GLV NKV++TTLLKVY+KGGLF +SRELL EL++ 
Sbjct: 278  SLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTTLLKVYIKGGLFDRSRELLSELESA 337

Query: 1351 GYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGLIEEA 1172
            GYAE+EMPYC+LMDGL+K GKL  A+++FD+M+ K VK+DGY+ SIMISALCRS   EEA
Sbjct: 338  GYAENEMPYCMLMDGLSKAGKLEEARSIFDDMKGKGVKSDGYANSIMISALCRSKRFEEA 397

Query: 1171 KQLACEFEIKYGKYDVVILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQILIKY 992
            K+L+ + E  Y K D+V+LN+MLCAYCR+GEME+VM++MKKMDE AI PD+NTF ILIKY
Sbjct: 398  KELSRDSETTYEKCDLVMLNTMLCAYCRAGEMESVMRMMKKMDEQAIIPDYNTFHILIKY 457

Query: 991  FCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSKRGIN 812
            F KE L+LLAY+T  DMH KGH+ EE+L +SLI HLG+  A SEAFSVYNML+YSKR I 
Sbjct: 458  FIKEKLHLLAYQTTLDMHSKGHRLEEELCSSLIYHLGKIRAPSEAFSVYNMLRYSKRTIC 517

Query: 811  KALHEKILHILLAGGLLKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVIKSIH 632
            K LHEKILHIL+ G LLKDAY+VVKDNAK+IS+P +KKF  +FM  GNINLVNDV+K +H
Sbjct: 518  KELHEKILHILIHGDLLKDAYIVVKDNAKMISQPTLKKFGRAFMISGNINLVNDVLKVLH 577

Query: 631  YSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHLFGHH 452
             SG+KIDQ  F +A+SRYI  P KKELLL LLQWMPGQG+ +DSS RNLILKNSH+FG  
Sbjct: 578  GSGHKIDQVQFEIAISRYILLPDKKELLLQLLQWMPGQGYIVDSSTRNLILKNSHMFGRL 637

Query: 451  SISELMSKHYTA 416
             I+E++SKH+ A
Sbjct: 638  LIAEILSKHHVA 649


>ref|XP_003531588.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Glycine max]
          Length = 646

 Score =  761 bits (1966), Expect = 0.0
 Identities = 376/618 (60%), Positives = 486/618 (78%), Gaps = 1/618 (0%)
 Frame = -1

Query: 2260 VSTPISSNGAEKHSEVRSKPLNVSRNLSKNPSASYKARESAILDIQHSSDLSSALLRSGE 2081
            + + ++ + +   +E  ++P   S  L  + + S  AR++A L+++++SDL+SAL R G+
Sbjct: 24   IPSSLAISASVSTTEPHTQPYRNSVKLGSSKTFS-SARKAATLEVRNASDLASALARVGD 82

Query: 2080 VLKAQDLNVVLRHFGKLNRRKDLYQLFDWMRQHGKTNIPSYSSYIKFVGRDS-NSMKALE 1904
             L  +DLN  L HF K N+   + QLF WM+++ K +  SYS YI+F+   + ++ K L+
Sbjct: 83   ALTVKDLNAALYHFKKSNKFNHISQLFSWMQENNKLDALSYSHYIRFMASHNLDAAKMLQ 142

Query: 1903 IYNSIKDDSTRNNVSVCNSTLYCLIKSGKFDSSLKLFNQMKQAGLVPDIVTYSTLLLGCA 1724
            +Y+SI++ S + NV VCNS L CLIK  KF+S+L LF QMK  GL+PD+VTY+TLL GC 
Sbjct: 143  LYHSIQNQSAKINVLVCNSVLSCLIKKAKFNSALNLFQQMKLDGLLPDLVTYTTLLAGCI 202

Query: 1723 KVKDGYFKAMELVREIKSRGLHMDSVVYGTLISVCASNNQCEAAEKYFDEMKSEGHSPNV 1544
            K+++GY KA+EL++E++   L MD V+YGT+++VCASN + E AE YF++MK EGH+PNV
Sbjct: 203  KIENGYAKALELIQELQHNKLQMDGVIYGTIMAVCASNTKWEEAEYYFNQMKDEGHTPNV 262

Query: 1543 FHYSSLLNAYAVVGNYGKADELIQQMRSAGLVLNKVILTTLLKVYVKGGLFVKSRELLDE 1364
            +HYSSL+NAY+  GNY KAD LIQ M+S GLV NKVILTTLLKVYVKGGLF KSRELL E
Sbjct: 263  YHYSSLINAYSACGNYKKADMLIQDMKSEGLVPNKVILTTLLKVYVKGGLFEKSRELLAE 322

Query: 1363 LQALGYAEDEMPYCLLMDGLAKCGKLAVAKAVFDEMREKEVKNDGYSYSIMISALCRSGL 1184
            L++LGYAEDEMPYC+ MDGLAK G++  AK +FDEM +  V++DGY++SIMISA CR+ L
Sbjct: 323  LKSLGYAEDEMPYCIFMDGLAKAGQIHEAKLIFDEMMKNHVRSDGYAHSIMISAFCRAKL 382

Query: 1183 IEEAKQLACEFEIKYGKYDVVILNSMLCAYCRSGEMENVMKLMKKMDESAISPDWNTFQI 1004
              EAKQLA +FE    KYD+VILNSMLCA+CR GEME VM+ +KKMDE AI+P +NTF I
Sbjct: 383  FREAKQLAKDFETTSNKYDLVILNSMLCAFCRVGEMERVMETLKKMDELAINPGYNTFHI 442

Query: 1003 LIKYFCKEHLYLLAYRTMEDMHRKGHQPEEDLSASLINHLGRTGAHSEAFSVYNMLKYSK 824
            LIKYFC+E +YLLAYRTM+DMH KGHQP E+L +SLI+HLG+  A+SEAFSVYNMLKYSK
Sbjct: 443  LIKYFCREKMYLLAYRTMKDMHSKGHQPVEELCSSLISHLGQVNAYSEAFSVYNMLKYSK 502

Query: 823  RGINKALHEKILHILLAGGLLKDAYVVVKDNAKLISEPAIKKFATSFMRKGNINLVNDVI 644
            R + K+LHEKILHILLAG LLKDAYVVVKDNAK IS PA KKFA++FM+ GN+N +NDV+
Sbjct: 503  RTMCKSLHEKILHILLAGQLLKDAYVVVKDNAKFISRPATKKFASAFMKSGNLNYINDVL 562

Query: 643  KSIHYSGYKIDQDIFYMAVSRYIEQPGKKELLLHLLQWMPGQGFPIDSSMRNLILKNSHL 464
            K++H  GYK+DQD+F MAVSRY++QP KK+LLLHLLQWM GQG+ +DSS RNLILKNSHL
Sbjct: 563  KTLHDCGYKLDQDLFAMAVSRYLDQPEKKDLLLHLLQWMAGQGYAVDSSTRNLILKNSHL 622

Query: 463  FGHHSISELMSKHYTALK 410
            FG   I+E++SK    LK
Sbjct: 623  FGRQLIAEVLSKQQVKLK 640


Top