BLASTX nr result

ID: Astragalus22_contig00004513 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00004513
         (2214 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004512460.1| PREDICTED: pentatricopeptide repeat-containi...  1038   0.0  
ref|XP_007158217.1| hypothetical protein PHAVU_002G134100g [Phas...  1033   0.0  
ref|XP_017426458.1| PREDICTED: pentatricopeptide repeat-containi...  1030   0.0  
ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containi...  1020   0.0  
ref|XP_014521259.1| pentatricopeptide repeat-containing protein ...  1019   0.0  
ref|XP_020226361.1| pentatricopeptide repeat-containing protein ...  1018   0.0  
ref|XP_003533450.1| PREDICTED: pentatricopeptide repeat-containi...  1014   0.0  
gb|KHN40422.1| Pentatricopeptide repeat-containing protein [Glyc...  1014   0.0  
ref|XP_006572948.1| PREDICTED: pentatricopeptide repeat-containi...  1012   0.0  
ref|XP_020226365.1| pentatricopeptide repeat-containing protein ...  1003   0.0  
gb|KYP54428.1| Pentatricopeptide repeat-containing protein At1g3...  1003   0.0  
ref|XP_019422179.1| PREDICTED: pentatricopeptide repeat-containi...  1001   0.0  
ref|XP_020237290.1| pentatricopeptide repeat-containing protein ...   979   0.0  
ref|XP_020226366.1| pentatricopeptide repeat-containing protein ...   977   0.0  
ref|XP_016201186.1| pentatricopeptide repeat-containing protein ...   976   0.0  
ref|XP_015963236.1| pentatricopeptide repeat-containing protein ...   975   0.0  
gb|PNY00351.1| pentatricopeptide repeat-containing protein at1g3...   967   0.0  
gb|KRH39618.1| hypothetical protein GLYMA_09G209700 [Glycine max]     961   0.0  
gb|KHN40416.1| Pentatricopeptide repeat-containing protein [Glyc...   956   0.0  
ref|XP_020997576.1| pentatricopeptide repeat-containing protein ...   928   0.0  

>ref|XP_004512460.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Cicer arietinum]
          Length = 606

 Score = 1038 bits (2683), Expect = 0.0
 Identities = 506/607 (83%), Positives = 551/607 (90%), Gaps = 1/607 (0%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            MTGT+ L+QTHFL L NN  QS ELS +FNEKGW  LLK+C +MEEFKQVHA  LK G+F
Sbjct: 1    MTGTTALNQTHFLLLTNNSHQSFELSKSFNEKGWLCLLKRCNNMEEFKQVHAYFLKCGIF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
            +DSFC SNLVATCALTKWGSM+YACSIF QIEEP SF+YNTMIRGNVN+MKL EALLLYV
Sbjct: 61   FDSFCGSNLVATCALTKWGSMDYACSIFTQIEEPCSFDYNTMIRGNVNNMKLDEALLLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP+ FTYPFVLKACSLLG LKE +QIHG V K GLEGD+FV+NSLI+MYGK G
Sbjct: 121  EMLERGIEPDKFTYPFVLKACSLLGALKEGVQIHGHVLKTGLEGDLFVENSLINMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADM-SSEGSCRAEESTLVNV 990
             IK ACDVF+KM  +SVASWSAIIGAH CVEMWHECLVLL DM SSEG CR EESTLV+V
Sbjct: 181  AIKDACDVFDKMGERSVASWSAIIGAHVCVEMWHECLVLLGDMMSSEGRCRPEESTLVSV 240

Query: 991  LSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNR 1170
            LSACTHLGS NLG+ IHG LLRNISELNVVVKTSLIDMYVKCG LEKGL VF++M EKNR
Sbjct: 241  LSACTHLGSYNLGRFIHGNLLRNISELNVVVKTSLIDMYVKCGCLEKGLHVFRNMPEKNR 300

Query: 1171 YSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKN 1350
            YSYTVMISGLA+HGHG++ALE+F+EM+ +GL PDDVVYVGVLSACSHAGLV+EGLQCFK 
Sbjct: 301  YSYTVMISGLAVHGHGKEALEVFSEMVEQGLEPDDVVYVGVLSACSHAGLVDEGLQCFKR 360

Query: 1351 MQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLE 1530
            MQFEHKIKPTIQHYGCMVDLMGR+GM+KEAY+LIKSM IKPNDVVWRSLLSACKVHL+LE
Sbjct: 361  MQFEHKIKPTIQHYGCMVDLMGRSGMLKEAYELIKSMPIKPNDVVWRSLLSACKVHLNLE 420

Query: 1531 IGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAK 1710
            IG+IAA NLF+LN NNPGDYLVLANMYAK QKWD+VAKIRRKMADK LVQTPGFSLVEAK
Sbjct: 421  IGQIAADNLFMLNPNNPGDYLVLANMYAKVQKWDEVAKIRRKMADKHLVQTPGFSLVEAK 480

Query: 1711 RNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQ 1890
            R VYKFVS DKS PQWN +YDMIHQMEWQLKFEGYV DTSQVLLDVDEEEKRERLKCHSQ
Sbjct: 481  RKVYKFVSLDKSSPQWNIVYDMIHQMEWQLKFEGYVADTSQVLLDVDEEEKRERLKCHSQ 540

Query: 1891 KVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGT 2070
            K+AIAF+LIHT SEG P+RITRNLRMCSDCHTYTK+ISM+Y REIT+RDR RFHHFK+GT
Sbjct: 541  KLAIAFALIHT-SEGCPLRITRNLRMCSDCHTYTKYISMIYNREITIRDRHRFHHFKNGT 599

Query: 2071 CSCKDYW 2091
            C+CKDYW
Sbjct: 600  CTCKDYW 606


>ref|XP_007158217.1| hypothetical protein PHAVU_002G134100g [Phaseolus vulgaris]
 gb|ESW30211.1| hypothetical protein PHAVU_002G134100g [Phaseolus vulgaris]
          Length = 605

 Score = 1033 bits (2672), Expect = 0.0
 Identities = 504/606 (83%), Positives = 553/606 (91%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVL Q+H L LPNNPPQ+SEL+  FNE+GW  LLK+CKSMEEFKQVHAQ+LKLGLF
Sbjct: 1    MSGTSVLCQSHLLSLPNNPPQNSELNAKFNEQGWLSLLKRCKSMEEFKQVHAQILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
             DSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN+M L++ALLLYV
Sbjct: 61   LDSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNNMNLEKALLLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLE+GIE +NFTYPFVLKACSLLG LKE +QIHGQVFKAGLE D FVQN LISMYGK G
Sbjct: 121  EMLEKGIEHDNFTYPFVLKACSLLGALKEGVQIHGQVFKAGLEDDTFVQNGLISMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
            EI HAC +FE+M+ KSVASWS+IIGAHA VE+W +CL+LL DMSSEG  RAEES LV  L
Sbjct: 181  EINHACALFEQMDEKSVASWSSIIGAHARVELWQDCLMLLGDMSSEGRHRAEESILVTAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGL VFQSMA KNRY
Sbjct: 241  SACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLCVFQSMAVKNRY 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMISGLA HG GR+AL +F+EM+ EGLAPDDVVYVGVLSACSHAGLVNEGLQCF +M
Sbjct: 301  SYTVMISGLAFHGRGREALRVFSEMVEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNSM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            Q  HKIKPTIQHYGCMVDLMGRAGM+KEA DLIK M IKPNDV+WRSLLSACKVHL+LEI
Sbjct: 361  QLVHKIKPTIQHYGCMVDLMGRAGMLKEACDLIKGMQIKPNDVIWRSLLSACKVHLNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAKR 1713
            GE+AA+N+F LN +NPGDYLVLA+MYA+AQKW DVA+IR +MA+K LVQTPGFSLVEA R
Sbjct: 421  GEVAAENVFKLNQHNPGDYLVLASMYARAQKWTDVARIRTEMAEKHLVQTPGFSLVEANR 480

Query: 1714 NVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQK 1893
             V+KFVSQDKSQPQ +TIYDMIHQMEWQLKFEGY PDTSQVLLDVDEEEKR+RLK HSQK
Sbjct: 481  KVHKFVSQDKSQPQCDTIYDMIHQMEWQLKFEGYAPDTSQVLLDVDEEEKRQRLKYHSQK 540

Query: 1894 VAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGTC 2073
            +AIAF+LI T SEGSP+RI+RNLRMCSDCHTYTKFISM+YEREI+VRDR RFHHFKDGTC
Sbjct: 541  LAIAFALIQT-SEGSPVRISRNLRMCSDCHTYTKFISMIYEREISVRDRNRFHHFKDGTC 599

Query: 2074 SCKDYW 2091
            SCKDYW
Sbjct: 600  SCKDYW 605


>ref|XP_017426458.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Vigna angularis]
 gb|KOM45472.1| hypothetical protein LR48_Vigan06g077800 [Vigna angularis]
 dbj|BAT99683.1| hypothetical protein VIGAN_10118700 [Vigna angularis var. angularis]
          Length = 605

 Score = 1030 bits (2662), Expect = 0.0
 Identities = 496/606 (81%), Positives = 555/606 (91%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVL Q+H L LPNNPPQ+SEL+  FNE+GW  LLK+CKSMEEFKQVHAQ+LKLGLF
Sbjct: 1    MSGTSVLCQSHLLSLPNNPPQNSELNAKFNEQGWLSLLKRCKSMEEFKQVHAQILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
            WDSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN++ L++ALLLYV
Sbjct: 61   WDSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNNLNLEKALLLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLE+GIE +NFTYPFVLKACSLLG LKE +Q+HGQVFKAGLE D +V N LISMYGK G
Sbjct: 121  EMLEKGIEHDNFTYPFVLKACSLLGALKEGVQVHGQVFKAGLEDDTYVHNGLISMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
            EI HACDVFE+M+ +SVASWS+IIGAHA VE+W +CL+LL DMS+EG  RAEES LV+ L
Sbjct: 181  EINHACDVFEQMDERSVASWSSIIGAHASVELWQDCLMLLGDMSNEGRHRAEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSP++G+CIHGILLRNISELNVVVKTSLIDMY+KCG+L+KGL VFQ+MA KNRY
Sbjct: 241  SACTHLGSPDIGRCIHGILLRNISELNVVVKTSLIDMYIKCGNLDKGLCVFQNMAVKNRY 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMISGLA HG GR+AL +F EM+ EGLAPDDVVYVGVLSACSHAGLVNEGLQCF +M
Sbjct: 301  SYTVMISGLAFHGRGREALRVFCEMVEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNHM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            Q  HKIKPTIQHYGCMVDLMGRAGM+KEAY+LIK M IKPNDVVWRSLLSACKVHL+LEI
Sbjct: 361  QLVHKIKPTIQHYGCMVDLMGRAGMLKEAYELIKGMPIKPNDVVWRSLLSACKVHLNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAKR 1713
            GEIAA+N+F LN +NPGDYLVLA+MYA+AQKW DVA+IR +MA+K LVQTPGFSLVEA R
Sbjct: 421  GEIAAENIFKLNQHNPGDYLVLASMYARAQKWTDVARIRTEMAEKHLVQTPGFSLVEANR 480

Query: 1714 NVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQK 1893
             V+KFVSQDKSQP+ +TIY+MIHQMEWQLKFEGY PDTSQVLLDVDEEEKR+RLK HSQK
Sbjct: 481  KVHKFVSQDKSQPECDTIYEMIHQMEWQLKFEGYAPDTSQVLLDVDEEEKRQRLKHHSQK 540

Query: 1894 VAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGTC 2073
            +AIAF+LI T SEGSPIRI+RNLRMCSDCHTYTKFISM+YEREI+VRDR RFHHFKDGTC
Sbjct: 541  LAIAFALIQT-SEGSPIRISRNLRMCSDCHTYTKFISMIYEREISVRDRNRFHHFKDGTC 599

Query: 2074 SCKDYW 2091
            SCKDYW
Sbjct: 600  SCKDYW 605


>ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Glycine max]
 gb|KRH74305.1| hypothetical protein GLYMA_01G011300 [Glycine max]
          Length = 605

 Score = 1020 bits (2637), Expect = 0.0
 Identities = 501/606 (82%), Positives = 548/606 (90%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVL Q+H L LPN+PPQSSEL+  FNE+GW  LLK+CKSMEEFKQVHA +LKLGLF
Sbjct: 1    MSGTSVLCQSHLLSLPNSPPQSSELNAKFNEQGWLSLLKRCKSMEEFKQVHAHILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
            +DSFC SNLVA+CAL++WGSMEYACSIF QIEEPGSFEYNTMIRGNVN M L+EALLLYV
Sbjct: 61   YDSFCGSNLVASCALSRWGSMEYACSIFSQIEEPGSFEYNTMIRGNVNSMDLEEALLLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP+NFTYPFVLKACSLL  LKE +QIH  VFKAGLE DVFVQN LISMYGK G
Sbjct: 121  EMLERGIEPDNFTYPFVLKACSLLVALKEGVQIHAHVFKAGLEVDVFVQNGLISMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DMS EG  RAEES LV+ L
Sbjct: 181  AIEHAGVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDMSGEGRHRAEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGL VFQ+MA KNRY
Sbjct: 241  SACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLCVFQNMAHKNRY 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMI+GLAIHG GR+A+ +F++ML EGL PDDVVYVGVLSACSHAGLVNEGLQCF  M
Sbjct: 301  SYTVMIAGLAIHGRGREAVRVFSDMLEEGLTPDDVVYVGVLSACSHAGLVNEGLQCFNRM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            QFEH IKPTIQHYGCMVDLMGRAGM+KEAYDLIKSM IKPNDVVWRSLLSACKVH +LEI
Sbjct: 361  QFEHMIKPTIQHYGCMVDLMGRAGMLKEAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAKR 1713
            GEIAA+N+F LN +NPGDYLVLANMYA+A+KW +VA+IR +MA+K LVQTPGFSLVEA R
Sbjct: 421  GEIAAENIFRLNKHNPGDYLVLANMYARAKKWANVARIRTEMAEKHLVQTPGFSLVEANR 480

Query: 1714 NVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQK 1893
            NVYKFVSQDKSQP   TIYDMI QMEWQLKFEGY PD SQVLLDVDE+EKR+RLK HSQK
Sbjct: 481  NVYKFVSQDKSQPICETIYDMIQQMEWQLKFEGYTPDMSQVLLDVDEDEKRQRLKHHSQK 540

Query: 1894 VAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGTC 2073
            +AIAF+LI T SEGSPIRI+RNLRMC+DCHTYTKFIS++YEREITVRDR RFHHFKDGTC
Sbjct: 541  LAIAFALIQT-SEGSPIRISRNLRMCNDCHTYTKFISVIYEREITVRDRNRFHHFKDGTC 599

Query: 2074 SCKDYW 2091
            SCKDYW
Sbjct: 600  SCKDYW 605


>ref|XP_014521259.1| pentatricopeptide repeat-containing protein At1g31920 [Vigna radiata
            var. radiata]
 ref|XP_014521260.1| pentatricopeptide repeat-containing protein At1g31920 [Vigna radiata
            var. radiata]
          Length = 605

 Score = 1019 bits (2636), Expect = 0.0
 Identities = 492/606 (81%), Positives = 551/606 (90%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVL Q+H L LPNNPP +SEL+  FNE+GW  LLK+CKSMEEFK VHAQ+LKLGLF
Sbjct: 1    MSGTSVLCQSHLLSLPNNPPLNSELNAKFNEQGWLSLLKRCKSMEEFKHVHAQILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
            WDSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMIRG+VN+M L++ALLLYV
Sbjct: 61   WDSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGHVNNMNLEKALLLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLE+GIE +NFTYPFVLKACSLLG LKE +QIHGQVFKAGLE D +VQN LISMYGK G
Sbjct: 121  EMLEKGIEHDNFTYPFVLKACSLLGALKEGVQIHGQVFKAGLEDDTYVQNGLISMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
            EIKHACDVFE+M+ +SVASWS+IIGAHA VE+W +CL+LL DMS+EG  R EES LV+ L
Sbjct: 181  EIKHACDVFEQMDERSVASWSSIIGAHASVELWQDCLMLLGDMSNEGRHRPEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSP++G+CIHGILLRNISELNVVVKTSLIDMYVKCG+LEKGL VFQ+MA KNRY
Sbjct: 241  SACTHLGSPDIGRCIHGILLRNISELNVVVKTSLIDMYVKCGNLEKGLCVFQNMAVKNRY 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMISGLA HG GR+AL +F EM+ EGLAPDDVVYVGVLSACSHAGLVNEGLQCF  M
Sbjct: 301  SYTVMISGLAFHGRGREALRVFCEMMEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNRM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            Q  HKIKPTIQHYGCMVDLMGRAGM+ EAY+LIK M IKPNDVVWRSLLSACKVHL+LEI
Sbjct: 361  QLVHKIKPTIQHYGCMVDLMGRAGMLMEAYELIKGMQIKPNDVVWRSLLSACKVHLNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAKR 1713
            GEIAA+N+F LN +NPGDYLVLA+MYA+AQKW DVA+IR +MA+K L+QTPGFSLVEA R
Sbjct: 421  GEIAAENIFKLNPHNPGDYLVLASMYARAQKWTDVARIRTEMAEKHLLQTPGFSLVEANR 480

Query: 1714 NVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQK 1893
             V+KFVSQDKSQP+ + +Y+MIHQMEWQLKFEGY PDTSQVLLDVDEEEK++RLK HSQK
Sbjct: 481  KVHKFVSQDKSQPECDIVYEMIHQMEWQLKFEGYAPDTSQVLLDVDEEEKKQRLKYHSQK 540

Query: 1894 VAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGTC 2073
            +AIAF+LI T SEGSPIRI+RNLRMCSDCHTYTKFIS++YEREI+VRDR RFHHFKDGTC
Sbjct: 541  LAIAFALIQT-SEGSPIRISRNLRMCSDCHTYTKFISVIYEREISVRDRNRFHHFKDGTC 599

Query: 2074 SCKDYW 2091
            SCKDYW
Sbjct: 600  SCKDYW 605


>ref|XP_020226361.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
 ref|XP_020226362.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
 ref|XP_020226363.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
          Length = 607

 Score = 1018 bits (2631), Expect = 0.0
 Identities = 500/608 (82%), Positives = 551/608 (90%), Gaps = 2/608 (0%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVLSQTH L LPNNPPQSSEL++ FN+KGW  LLK+CKSMEEFKQVHAQ+LKLGLF
Sbjct: 1    MSGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLLKRCKSMEEFKQVHAQILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
             DSFC SNLVATCAL+KWGSMEYACSIFRQIEEPGSFEYNTMIRG+VN++ L+EAL LYV
Sbjct: 61   LDSFCGSNLVATCALSKWGSMEYACSIFRQIEEPGSFEYNTMIRGSVNNVNLEEALFLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP+ FTYPFV KACSLLG LKE +QIHG +FKAGL+GD FVQNSLISMYGK  
Sbjct: 121  EMLERGIEPDKFTYPFVFKACSLLGALKEGVQIHGHIFKAGLDGDTFVQNSLISMYGKCR 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             IKHA  VFE+M+ +SVASWSAIIGAHA VEMW ECL+LL DMSSEG  RAEES LV+ L
Sbjct: 181  AIKHAYAVFEQMDERSVASWSAIIGAHASVEMWQECLMLLGDMSSEGRHRAEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSP LG+CIHGILLRNIS+LNVVVKTSLIDMYVKCG LEKGL VFQ+MAEKNR+
Sbjct: 241  SACTHLGSPILGRCIHGILLRNISKLNVVVKTSLIDMYVKCGCLEKGLSVFQNMAEKNRF 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMI+GLAIHG GR+AL +F++ML EGLAPDDVVYVGVLSACSHAGLVNEGLQCF  M
Sbjct: 301  SYTVMIAGLAIHGRGREALRVFSDMLEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNRM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            +FEHKIKPT+QHYGCMVDLMGRAGM++EAYDLIK M IKPNDVVWRSLLSACKVHL+LEI
Sbjct: 361  RFEHKIKPTVQHYGCMVDLMGRAGMLREAYDLIKRMPIKPNDVVWRSLLSACKVHLNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTP--GFSLVEA 1707
            G IAA+NLF LN +NPGDYL+LANM+AKA+KWDDVA+IR +MA+K LVQ    GFSLVEA
Sbjct: 421  GVIAAENLFKLNQHNPGDYLMLANMFAKAKKWDDVARIRTEMAEKHLVQMQILGFSLVEA 480

Query: 1708 KRNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHS 1887
             R +YKFVSQDKS+PQ + IYDMIHQMEWQLKFEGY PD SQVLLDV+EEEKRERLK HS
Sbjct: 481  NRKLYKFVSQDKSKPQCDIIYDMIHQMEWQLKFEGYTPDMSQVLLDVNEEEKRERLKYHS 540

Query: 1888 QKVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDG 2067
            QK+AIAF+LI T SEGSPIRI+RNL+MCSDCHTYTKFISM+YEREITVRDR RFHHFKDG
Sbjct: 541  QKLAIAFALIQT-SEGSPIRISRNLKMCSDCHTYTKFISMIYEREITVRDRNRFHHFKDG 599

Query: 2068 TCSCKDYW 2091
            TCSCKDYW
Sbjct: 600  TCSCKDYW 607


>ref|XP_003533450.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Glycine max]
          Length = 604

 Score = 1014 bits (2623), Expect = 0.0
 Identities = 500/606 (82%), Positives = 548/606 (90%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+ TSVL Q+HFL LPNNPPQSSEL+  FN +G   LLK+CKSMEEFKQVHA +LKLGLF
Sbjct: 1    MSWTSVLCQSHFLSLPNNPPQSSELNAKFNVQG-LSLLKRCKSMEEFKQVHAHILKLGLF 59

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
            +DSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN M L+EALLLYV
Sbjct: 60   YDSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNSMNLEEALLLYV 119

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP+NFTYPFVLKACSLLG LKE +QIH  VFKAGLEGDVFVQN LI+MYGK G
Sbjct: 120  EMLERGIEPDNFTYPFVLKACSLLGALKEGVQIHAHVFKAGLEGDVFVQNGLINMYGKCG 179

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DMS EG  RAEES LV+ L
Sbjct: 180  AIEHASVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDMSGEGRHRAEESILVSAL 239

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSPN G+CIHGILLRNISELNV VKTSLIDMYVK GSLEKGL VFQ+MA+KNRY
Sbjct: 240  SACTHLGSPNFGRCIHGILLRNISELNVAVKTSLIDMYVKSGSLEKGLCVFQNMAQKNRY 299

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTV+I+GLAIHG GR+AL +F++ML EGLAPDDVVYVGVLSACSHAGLVNEGLQCF  +
Sbjct: 300  SYTVIITGLAIHGRGREALSVFSDMLEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNRL 359

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            QFEHKIKPTIQHYGCMVDLMGRAGM+K AYDLIKSM IKPNDVVWRSLLSACKVH +LEI
Sbjct: 360  QFEHKIKPTIQHYGCMVDLMGRAGMLKGAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEI 419

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAKR 1713
            GEIAA+N+F LN +NPGDYLVLANMYA+A+KW DVA+IR +MA+K LVQTPGFSLVEA R
Sbjct: 420  GEIAAENIFKLNQHNPGDYLVLANMYARAKKWADVARIRTEMAEKHLVQTPGFSLVEANR 479

Query: 1714 NVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQK 1893
            NVYKFVSQDKSQPQ  TIYDMI QMEWQLKFEGY PD SQVLLDVDE+EKR+RLK HSQK
Sbjct: 480  NVYKFVSQDKSQPQCETIYDMIQQMEWQLKFEGYTPDMSQVLLDVDEDEKRQRLKHHSQK 539

Query: 1894 VAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGTC 2073
            +AIAF+LI T SEGS IRI+RN+RMC+DCHTYTKFIS++YEREITVRDR RFHHFKDGTC
Sbjct: 540  LAIAFALIQT-SEGSRIRISRNIRMCNDCHTYTKFISVIYEREITVRDRNRFHHFKDGTC 598

Query: 2074 SCKDYW 2091
            SCKDYW
Sbjct: 599  SCKDYW 604


>gb|KHN40422.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 605

 Score = 1014 bits (2622), Expect = 0.0
 Identities = 499/606 (82%), Positives = 546/606 (90%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVL Q+H L LPN+P QSSEL+  FNE+GW  LLK+CKSMEEFKQVHA +LKLGLF
Sbjct: 1    MSGTSVLCQSHLLSLPNSPLQSSELNAKFNEQGWLSLLKRCKSMEEFKQVHAHILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
            +DSFC SNLVA+CAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN M L+EALLLYV
Sbjct: 61   YDSFCGSNLVASCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNSMDLEEALLLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP+NFTYPFVLKACSLL  LKE +QIH  VFKAGLE DVFVQN LISMYGK G
Sbjct: 121  EMLERGIEPDNFTYPFVLKACSLLVALKEGVQIHAHVFKAGLEVDVFVQNGLISMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DMS EG  RAEES LV+ L
Sbjct: 181  AIEHAGVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDMSGEGRHRAEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGL VF +MA KNRY
Sbjct: 241  SACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLCVFHNMAHKNRY 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMI+GLAIHG GR+A+ +F++ML EGL PDDVVYVGVLSACSHAGLV EGLQCF  M
Sbjct: 301  SYTVMIAGLAIHGRGREAVRVFSDMLEEGLTPDDVVYVGVLSACSHAGLVKEGLQCFNRM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            QFEH IKPTIQHYGCMVDLMGRAGM+KEAYDLIKSM IKPNDVVWRSLLSACKVH +LEI
Sbjct: 361  QFEHMIKPTIQHYGCMVDLMGRAGMLKEAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAKR 1713
            GEIAA+N+F LN +NPGDYLVLANMYA+A+KW +VA+IR +MA+K LVQTPGFSLVEA R
Sbjct: 421  GEIAAENIFRLNKHNPGDYLVLANMYARAKKWANVARIRTEMAEKHLVQTPGFSLVEANR 480

Query: 1714 NVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQK 1893
            NVYKFVSQDKSQP   TIYDMI QMEWQLKFEGY PD SQVLLDVDE+EKR+RLK HSQK
Sbjct: 481  NVYKFVSQDKSQPICETIYDMIQQMEWQLKFEGYTPDMSQVLLDVDEDEKRQRLKHHSQK 540

Query: 1894 VAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGTC 2073
            +AIAF+LI T SEGSPIRI+RNLRMC+DCHTYTKFIS++YEREITVRDR RFHHFKDGTC
Sbjct: 541  LAIAFALIQT-SEGSPIRISRNLRMCNDCHTYTKFISVIYEREITVRDRNRFHHFKDGTC 599

Query: 2074 SCKDYW 2091
            SCKDYW
Sbjct: 600  SCKDYW 605


>ref|XP_006572948.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Glycine max]
 gb|KRH74309.1| hypothetical protein GLYMA_01G011700 [Glycine max]
          Length = 605

 Score = 1012 bits (2616), Expect = 0.0
 Identities = 497/606 (82%), Positives = 543/606 (89%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVL Q+H L LPN+P QSSEL+  FNE+GW  LLK+CKSMEEFK+VHA +LKLGLF
Sbjct: 1    MSGTSVLCQSHLLSLPNSPLQSSELNAKFNEQGWLSLLKRCKSMEEFKKVHAHILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
            +DSFC SNLVA+CAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN M L+EALLLYV
Sbjct: 61   YDSFCGSNLVASCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNSMDLEEALLLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP+NFTYPFVLKACSLL  LKE +QIH  VF AGLE DVFVQN LISMYGK G
Sbjct: 121  EMLERGIEPDNFTYPFVLKACSLLVALKEGVQIHAHVFNAGLEVDVFVQNGLISMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DMS EG  RAEES LV+ L
Sbjct: 181  AIEHAGVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDMSREGRHRAEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGL VFQ+MA KNRY
Sbjct: 241  SACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLCVFQNMAHKNRY 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMI+GLAIHG GR+AL +F++ML EGL PDDVVYVGVLSACSHAGLV EG QCF  M
Sbjct: 301  SYTVMIAGLAIHGRGREALRVFSDMLEEGLTPDDVVYVGVLSACSHAGLVKEGFQCFNRM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            QFEH IKPTIQHYGCMVDLMGRAGM+KEAYDLIKSM IKPNDVVWRSLLSACKVH +LEI
Sbjct: 361  QFEHMIKPTIQHYGCMVDLMGRAGMLKEAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAKR 1713
            GEIAA N+F LN +NPGDYLVLANMYA+AQKW +VA+IR +M +K LVQTPGFSLVEA R
Sbjct: 421  GEIAADNIFKLNKHNPGDYLVLANMYARAQKWANVARIRTEMVEKNLVQTPGFSLVEANR 480

Query: 1714 NVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQK 1893
            NVYKFVSQDKSQPQ  TIYDMI QMEWQLKFEGY PD SQVLLDVDE+EKR+RLK HSQK
Sbjct: 481  NVYKFVSQDKSQPQCETIYDMIQQMEWQLKFEGYTPDMSQVLLDVDEDEKRQRLKHHSQK 540

Query: 1894 VAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGTC 2073
            +AIAF+LI T SEGSP+RI+RNLRMC+DCHTYTKFIS++YEREITVRD  RFHHFKDGTC
Sbjct: 541  LAIAFALIQT-SEGSPVRISRNLRMCNDCHTYTKFISVIYEREITVRDSNRFHHFKDGTC 599

Query: 2074 SCKDYW 2091
            SCKDYW
Sbjct: 600  SCKDYW 605


>ref|XP_020226365.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
          Length = 607

 Score = 1003 bits (2593), Expect = 0.0
 Identities = 493/608 (81%), Positives = 546/608 (89%), Gaps = 2/608 (0%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVLSQTH L LPNNPPQSSEL++ FN+KGW  LLK+CK MEEFKQVHAQ+LKLGLF
Sbjct: 1    MSGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLLKRCKCMEEFKQVHAQILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
             DSFC SNLVATCAL+KWGSM YACSIFRQIEEPGSFEYNTMIRG+VN+M L+EAL LYV
Sbjct: 61   LDSFCGSNLVATCALSKWGSMGYACSIFRQIEEPGSFEYNTMIRGSVNNMNLEEALFLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP  FTYPFV K CSLLG LKE +QIHG +FKAG +GD FVQNSLISMYGK G
Sbjct: 121  EMLERGIEPEKFTYPFVFKGCSLLGALKEGVQIHGHIFKAGFDGDTFVQNSLISMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             IKHA  VFE+M+ +SVASWSAIIGAHA VEMW ECL+LL DMSSEG  +AEES LV+ L
Sbjct: 181  AIKHAYAVFEQMDERSVASWSAIIGAHASVEMWQECLMLLGDMSSEGQHKAEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            S CTHLGSP LG+CIHGILLRNISELNVVVKTSLIDMYVKCG LEKGL VFQ+MAEKN++
Sbjct: 241  STCTHLGSPILGRCIHGILLRNISELNVVVKTSLIDMYVKCGCLEKGLSVFQNMAEKNKF 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMI+GLAI G GR+AL +F++M+ EGLAPDDVVYVGVLSACSHAGLVNEGLQ F  M
Sbjct: 301  SYTVMIAGLAIDGRGREALRVFSDMMEEGLAPDDVVYVGVLSACSHAGLVNEGLQFFNRM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            +FEHKIKPT+QHYGCMVDLMGRAGM++EAYDLIK M IKPNDVVWRSLLSACKVHL+LEI
Sbjct: 361  RFEHKIKPTVQHYGCMVDLMGRAGMLREAYDLIKRMPIKPNDVVWRSLLSACKVHLNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLV--QTPGFSLVEA 1707
            G IAA+NLF LN +NPGDYL+LANM+A+A+KW+DVA+IR +MA+K LV  QT GFSLVEA
Sbjct: 421  GVIAAENLFKLNQHNPGDYLMLANMFARAKKWNDVARIRTEMAEKHLVQLQTLGFSLVEA 480

Query: 1708 KRNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHS 1887
             R VYKFVSQDKS+PQ +TIYDMIHQMEWQLKFEGY PD SQVLLDV+EEEKRERLK HS
Sbjct: 481  NRKVYKFVSQDKSKPQCDTIYDMIHQMEWQLKFEGYRPDMSQVLLDVNEEEKRERLKYHS 540

Query: 1888 QKVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDG 2067
            QK+AIAF+LI T SEGSPIRI+RNL+MCSDCHTYTKFISM+YE+EITVRDR RFHHFKDG
Sbjct: 541  QKLAIAFALIQT-SEGSPIRISRNLKMCSDCHTYTKFISMIYEQEITVRDRNRFHHFKDG 599

Query: 2068 TCSCKDYW 2091
            TCSCKDYW
Sbjct: 600  TCSCKDYW 607


>gb|KYP54428.1| Pentatricopeptide repeat-containing protein At1g31920 family [Cajanus
            cajan]
          Length = 601

 Score = 1003 bits (2593), Expect = 0.0
 Identities = 495/601 (82%), Positives = 545/601 (90%), Gaps = 2/601 (0%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVLSQTH L LPNNPPQSSEL++ FN+KGW  LLK+CK MEEFKQVHAQ+LKLGLF
Sbjct: 1    MSGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLLKRCKCMEEFKQVHAQILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
             DSFC SNLVATCAL+KWGSMEYACSIFRQIEEPGSFEYNTMIRG+VN+M L+EAL LYV
Sbjct: 61   LDSFCGSNLVATCALSKWGSMEYACSIFRQIEEPGSFEYNTMIRGSVNNMNLEEALFLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP+ FTYPFV KACSLLG LKE +QIHG +FKAGL+GD FVQNSLISMYGK G
Sbjct: 121  EMLERGIEPDKFTYPFVFKACSLLGALKEGVQIHGHIFKAGLDGDTFVQNSLISMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             IKHA  VFE+M+ +SVASWSAIIGAHA VEMW ECL+LL DMSSEG  RAEES LV+ L
Sbjct: 181  AIKHAYAVFEQMDERSVASWSAIIGAHASVEMWQECLMLLGDMSSEGRHRAEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSP LG+CIHGILLRNISELNVVVKTSLIDMYVKCG LEKGL VFQ+MAEKNR+
Sbjct: 241  SACTHLGSPILGRCIHGILLRNISELNVVVKTSLIDMYVKCGCLEKGLSVFQNMAEKNRF 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMI+GLAIHG GR+AL +F++M+ EGLAPDDVVYVGVLSACSHAGLVNEGLQ F  M
Sbjct: 301  SYTVMIAGLAIHGRGREALRVFSDMMEEGLAPDDVVYVGVLSACSHAGLVNEGLQFFNRM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            +FEHKIKPT+QHYGCMVDLMGR GM++EAYDLIK + IKPNDVVWRSLLSACKVHL+LEI
Sbjct: 361  RFEHKIKPTVQHYGCMVDLMGRVGMLREAYDLIKRVPIKPNDVVWRSLLSACKVHLNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLV--QTPGFSLVEA 1707
            G IAAKNLF LN +NPGDYL+LANM+A+A+KW+DVA+IR KMA+K LV  QTPGFSLVEA
Sbjct: 421  GVIAAKNLFKLNQHNPGDYLMLANMFARAKKWNDVARIRTKMAEKHLVQLQTPGFSLVEA 480

Query: 1708 KRNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHS 1887
             R VYKFVSQDKS+PQ +TIYDMIHQMEWQLKFEGY PD SQVLLDV+EEEKRERLK HS
Sbjct: 481  NRKVYKFVSQDKSKPQCDTIYDMIHQMEWQLKFEGYTPDMSQVLLDVNEEEKRERLKYHS 540

Query: 1888 QKVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDG 2067
            QK+AIAF+LI T SEGSPIRI+RNL+MCSDCHTYTKFISM+YEREITVRDR RFHHFKDG
Sbjct: 541  QKLAIAFALIQT-SEGSPIRISRNLKMCSDCHTYTKFISMIYEREITVRDRNRFHHFKDG 599

Query: 2068 T 2070
            T
Sbjct: 600  T 600


>ref|XP_019422179.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Lupinus angustifolius]
          Length = 606

 Score = 1001 bits (2589), Expect = 0.0
 Identities = 486/607 (80%), Positives = 545/607 (89%), Gaps = 1/607 (0%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNN-PPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGL 450
            MTGT+VL+QTHFLPLPNN PPQ SEL+  FNE+GW  +LK CKS+EE KQVHA +LKLG 
Sbjct: 1    MTGTTVLTQTHFLPLPNNSPPQCSELNIKFNEQGWLSMLKGCKSLEELKQVHAHILKLGF 60

Query: 451  FWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLY 630
              DSFC SNLVATCAL+KWGSM+YACSIFRQIEEPGSFEYNTMIRGNVN M L+EALLLY
Sbjct: 61   LLDSFCESNLVATCALSKWGSMDYACSIFRQIEEPGSFEYNTMIRGNVNYMNLEEALLLY 120

Query: 631  VEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKW 810
            +EMLERGIEP+NFTYPFVLKACSLLG + E MQIHG V K GL+GDVFVQNSLISMYGK+
Sbjct: 121  LEMLERGIEPDNFTYPFVLKACSLLGCVNEGMQIHGHVLKGGLKGDVFVQNSLISMYGKF 180

Query: 811  GEIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNV 990
            G IKHAC VFEKM+ KSVASWSAIIGAHA VEMWHECL+L  DMSSEG  RAEESTLV V
Sbjct: 181  GGIKHACAVFEKMDEKSVASWSAIIGAHASVEMWHECLMLFGDMSSEGHHRAEESTLVTV 240

Query: 991  LSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNR 1170
            ++ACTHLGS ++G+CIHGILLRNISELNV+VKTSLI+MYVKCG LEKGL VF +M EKNR
Sbjct: 241  ITACTHLGSLDIGRCIHGILLRNISELNVIVKTSLINMYVKCGCLEKGLSVFDNMVEKNR 300

Query: 1171 YSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKN 1350
            +SYT+MISGLAIHGHG++AL +F+EML EGL PDDVVYVGVLSACSHAGLV+EGLQCF  
Sbjct: 301  HSYTIMISGLAIHGHGKEALRVFSEMLEEGLEPDDVVYVGVLSACSHAGLVDEGLQCFNR 360

Query: 1351 MQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLE 1530
            M+FEHKI+PT+QHYGC+VDLMGRA M++EAYDLIKSM IKPNDVVWRSLLSACKVH +LE
Sbjct: 361  MRFEHKIEPTVQHYGCVVDLMGRARMLREAYDLIKSMPIKPNDVVWRSLLSACKVHHNLE 420

Query: 1531 IGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAK 1710
            +GEIAA+NLF+LN  NPGDYL+LANMYA+AQ W + A++R +MA    VQTPGFSLVE K
Sbjct: 421  LGEIAAQNLFMLNPYNPGDYLMLANMYARAQNWANAARVRTEMAGNSSVQTPGFSLVEVK 480

Query: 1711 RNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQ 1890
            R VYKFVSQDKSQPQ+N+IYDMIHQMEWQLKFEGYV DTSQVLLDVDEEEKR+RLK HSQ
Sbjct: 481  RVVYKFVSQDKSQPQYNSIYDMIHQMEWQLKFEGYVADTSQVLLDVDEEEKRQRLKYHSQ 540

Query: 1891 KVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGT 2070
            K+AIAF+LIHT SEGS IRI+RN+RMC+DCHTYTK IS++Y+REITVRDR RFHHFK GT
Sbjct: 541  KLAIAFALIHT-SEGSTIRISRNIRMCNDCHTYTKLISIIYDREITVRDRNRFHHFKHGT 599

Query: 2071 CSCKDYW 2091
            CSCKDYW
Sbjct: 600  CSCKDYW 606


>ref|XP_020237290.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
          Length = 626

 Score =  979 bits (2531), Expect = 0.0
 Identities = 485/614 (78%), Positives = 541/614 (88%), Gaps = 2/614 (0%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVLSQTH L LPNNPPQSSEL++ FN+KGW  L K+CK MEEFKQVHAQ+LK GLF
Sbjct: 1    MSGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLHKRCKCMEEFKQVHAQILKFGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
              SFC SNLVATCAL+KWGSMEYA SIF+QI+EPGSFEYN MIRG+VN+M L+E L LYV
Sbjct: 61   LYSFCGSNLVATCALSKWGSMEYAFSIFKQIKEPGSFEYNIMIRGSVNNMNLEETLFLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP+ FTYPFV KACSLLG  KE +QIHG +FK GL+GD FVQNSLI+MYGK G
Sbjct: 121  EMLERGIEPDKFTYPFVFKACSLLGAFKEGVQIHGHIFKVGLDGDAFVQNSLINMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             IKHA  VFE+M  +SVASWSA+IGAHA VEMW ECL+LL DMSSEG  RAEES LV+ L
Sbjct: 181  AIKHAYAVFEQMVERSVASWSAVIGAHASVEMWQECLMLLGDMSSEGRHRAEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSP LG+CIHGILLRNISELNVVVKTSLI MYVKCG LEKGL +FQ++AEKNR+
Sbjct: 241  SACTHLGSPILGRCIHGILLRNISELNVVVKTSLIYMYVKCGCLEKGLSLFQNIAEKNRF 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMI+GLAIHG GR+AL +F++M+ EGLAPDDVVYVGVLSACSHA LVNEGLQ F  M
Sbjct: 301  SYTVMIAGLAIHGRGREALRVFSDMMEEGLAPDDVVYVGVLSACSHASLVNEGLQFFNRM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            +FEHKIKPT+QHYGCMVDLMGRAGM++EAYDLIK M IKPNDVVWRSLLSACKVHL+LEI
Sbjct: 361  RFEHKIKPTVQHYGCMVDLMGRAGMLREAYDLIKRMPIKPNDVVWRSLLSACKVHLNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLV--QTPGFSLVEA 1707
            G IAA+NLF LN +NPGDYL+LANM+A+A+KW+DVA+IR +MA+K LV  QTPGFSLVEA
Sbjct: 421  GVIAAENLFKLNQHNPGDYLMLANMFARAKKWNDVARIRTEMAEKHLVQLQTPGFSLVEA 480

Query: 1708 KRNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHS 1887
             R VYKFVSQDKS+PQ +TIYDMIHQMEWQLKFEGY PD SQVLLDV+EEEKRERLK HS
Sbjct: 481  NRKVYKFVSQDKSKPQCDTIYDMIHQMEWQLKFEGYTPDMSQVLLDVNEEEKRERLKYHS 540

Query: 1888 QKVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDG 2067
            QK+AIAF+LI T SE SPIRI+RNL+MCSD HTYTKFISM YE EITVRD  RFHHFKDG
Sbjct: 541  QKLAIAFALIQT-SEDSPIRISRNLKMCSDYHTYTKFISMFYEWEITVRDHNRFHHFKDG 599

Query: 2068 TCSCKDYW*ISSYL 2109
            TCSCKDYW IS +L
Sbjct: 600  TCSCKDYWRISLHL 613


>ref|XP_020226366.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
          Length = 590

 Score =  977 bits (2525), Expect = 0.0
 Identities = 483/588 (82%), Positives = 533/588 (90%), Gaps = 2/588 (0%)
 Frame = +1

Query: 274  MTGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            M+GTSVLSQTH L LPNNPPQSSEL++ FN+KGW  LLK+CK MEEFKQVHAQ+LKLGLF
Sbjct: 1    MSGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLLKRCKCMEEFKQVHAQILKLGLF 60

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
             DSFC SNLVATCAL+KWGSMEYACSIFRQIEEPGSFEYNTMIRG+VN+M L+EAL LYV
Sbjct: 61   LDSFCGSNLVATCALSKWGSMEYACSIFRQIEEPGSFEYNTMIRGSVNNMNLEEALFLYV 120

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EMLERGIEP+ FTYPFV KACSLLG LKE +QIHG +FKAGL+GD FVQNSLISMYGK G
Sbjct: 121  EMLERGIEPDKFTYPFVFKACSLLGALKEGVQIHGHIFKAGLDGDTFVQNSLISMYGKCG 180

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             IKHA  VFE+M+ +SVASWSAIIGAHA VEMW ECL+LL DMSSEG  RAEES LV+ L
Sbjct: 181  AIKHAYAVFEQMDERSVASWSAIIGAHASVEMWQECLMLLGDMSSEGRHRAEESILVSAL 240

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSP LG+CIHGILLRNISELNVVVKTSLIDMYVKCG LEKGL VFQ+MAEKNR+
Sbjct: 241  SACTHLGSPILGRCIHGILLRNISELNVVVKTSLIDMYVKCGCLEKGLSVFQNMAEKNRF 300

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMI+GLAIHG GR+AL +F++M+ EGLAPDDVVYVGVLSACSHAGLVNEGLQ F  M
Sbjct: 301  SYTVMIAGLAIHGRGREALRVFSDMMEEGLAPDDVVYVGVLSACSHAGLVNEGLQFFNRM 360

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            +FEHKIKPT+QHYGCMVDLMGR GM++EAYDLIK + IKPNDVVWRSLLSACKVHL+LEI
Sbjct: 361  RFEHKIKPTVQHYGCMVDLMGRVGMLREAYDLIKRVPIKPNDVVWRSLLSACKVHLNLEI 420

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLV--QTPGFSLVEA 1707
            G IAAKNLF LN +NPGDYL+LANM+A+A+KW+DVA+IR KMA+K LV  QTPGFSLVEA
Sbjct: 421  GVIAAKNLFKLNQHNPGDYLMLANMFARAKKWNDVARIRTKMAEKHLVQLQTPGFSLVEA 480

Query: 1708 KRNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHS 1887
             R VYKFVSQDKS+PQ +TIYDMIHQMEWQLKFEGY PD SQVLLDV+EEEKRERLK HS
Sbjct: 481  NRKVYKFVSQDKSKPQCDTIYDMIHQMEWQLKFEGYTPDMSQVLLDVNEEEKRERLKYHS 540

Query: 1888 QKVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITV 2031
            QK+AIAF+LI T SEGSPIRI+RNL+MCSDCHTYTKFISM+YEREITV
Sbjct: 541  QKLAIAFALIQT-SEGSPIRISRNLKMCSDCHTYTKFISMIYEREITV 587


>ref|XP_016201186.1| pentatricopeptide repeat-containing protein At1g31920 isoform X1
            [Arachis ipaensis]
          Length = 608

 Score =  976 bits (2524), Expect = 0.0
 Identities = 475/606 (78%), Positives = 531/606 (87%), Gaps = 1/606 (0%)
 Frame = +1

Query: 277  TGTSVLSQTHFLPLPNN-PPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            T T +LS+THFL L NN PP+++EL+  F E GW PLLK+CKSMEEFKQVHA +LKLGLF
Sbjct: 4    TETYILSKTHFLSLSNNIPPKNTELNVRFYEHGWLPLLKRCKSMEEFKQVHAHILKLGLF 63

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
            WD FC SNLVATCAL KWGSMEYACSIFR+IEEP SFEYNTMIRGNVN M L+EAL+LY+
Sbjct: 64   WDHFCGSNLVATCALAKWGSMEYACSIFRRIEEPSSFEYNTMIRGNVNCMNLEEALILYI 123

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EML+ GIEP+NFTYPFV KACSLLG LKE MQI+  VFKAGLEGD+FVQNSLISMYGK G
Sbjct: 124  EMLKEGIEPDNFTYPFVFKACSLLGALKEGMQIYSHVFKAGLEGDLFVQNSLISMYGKCG 183

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             I+HA DVF+KM  +SVASWSA+IGAHA  EMWHECL L  DM  +G  R EESTLV+VL
Sbjct: 184  AIEHARDVFDKMSERSVASWSAVIGAHASAEMWHECLKLFNDMMHDGRYRPEESTLVSVL 243

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSPNLG  +HGILLRN +ELNV+VKTSLIDMY KCG +EKGL VF SMAEKN++
Sbjct: 244  SACTHLGSPNLGSSVHGILLRNTTELNVIVKTSLIDMYAKCGCIEKGLCVFHSMAEKNKH 303

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMISGLA+HG G +AL IF EML +GLAPDDVVYVGVLSAC+HAGLVNEGL+ F  M
Sbjct: 304  SYTVMISGLAVHGRGSEALRIFAEMLEQGLAPDDVVYVGVLSACTHAGLVNEGLEFFNRM 363

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            QFEHKI PTIQHYGCMVDLMGRAGM++EAY+LIKSM  KPNDV+WRSLL+ACKV  +LE+
Sbjct: 364  QFEHKIDPTIQHYGCMVDLMGRAGMLREAYELIKSMPRKPNDVLWRSLLNACKVQHNLEL 423

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAKR 1713
            GEIAA+N+F+LN +NPGDYLVLANMYA+AQKW D+AKIR +M  K LVQTPGFSLVE KR
Sbjct: 424  GEIAAENIFMLNPHNPGDYLVLANMYARAQKWVDMAKIRTEMVRKGLVQTPGFSLVEVKR 483

Query: 1714 NVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQK 1893
             VYKFVS DKSQP +++IY MIHQMEWQLKFEGY+PDTSQVLLDVDEEEKR+RLK HSQK
Sbjct: 484  RVYKFVSHDKSQPHFHSIYAMIHQMEWQLKFEGYIPDTSQVLLDVDEEEKRQRLKYHSQK 543

Query: 1894 VAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGTC 2073
            +AIAF+LIHT SEGSPIRI RNLRMC+DCHTYTKFISM+YEREITVRDR  FHHFKDG C
Sbjct: 544  LAIAFALIHT-SEGSPIRIFRNLRMCNDCHTYTKFISMIYEREITVRDRNLFHHFKDGAC 602

Query: 2074 SCKDYW 2091
            SCKDYW
Sbjct: 603  SCKDYW 608


>ref|XP_015963236.1| pentatricopeptide repeat-containing protein At1g31920 isoform X1
            [Arachis duranensis]
          Length = 608

 Score =  975 bits (2521), Expect = 0.0
 Identities = 475/606 (78%), Positives = 533/606 (87%), Gaps = 1/606 (0%)
 Frame = +1

Query: 277  TGTSVLSQTHFLPLPNN-PPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 453
            T T +LS+T FL L NN PP++++L+  F E GW PLLK+CKSMEEFKQVHA +LKLGLF
Sbjct: 4    TETYILSRTQFLSLSNNIPPKNTDLNVRFYEHGWLPLLKRCKSMEEFKQVHAHILKLGLF 63

Query: 454  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 633
            WD FC SNLVATCAL KWGSMEYACSIFR+IEEP SFEYNTMIRGNVN M L+EAL+LYV
Sbjct: 64   WDHFCGSNLVATCALAKWGSMEYACSIFRRIEEPSSFEYNTMIRGNVNCMNLEEALILYV 123

Query: 634  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 813
            EML++GIEP+NFTYPFV KACSLLG LKE MQI+  VFKAGLEGD+F+QNSLISMYGK G
Sbjct: 124  EMLKKGIEPDNFTYPFVFKACSLLGALKEGMQIYSHVFKAGLEGDLFLQNSLISMYGKCG 183

Query: 814  EIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 993
             I+HA DVF+KM  +SVASWSAIIGAHA  EMWHECL L  DM  +G  R EESTLV+VL
Sbjct: 184  AIEHARDVFDKMSERSVASWSAIIGAHASAEMWHECLKLFNDMMHDGRYRPEESTLVSVL 243

Query: 994  SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1173
            SACTHLGSPNLG  +HGILLRN +ELNV+VKTSLIDMY KCG +EKGL VF SMAEKN++
Sbjct: 244  SACTHLGSPNLGSSVHGILLRNTTELNVIVKTSLIDMYAKCGCIEKGLCVFHSMAEKNKH 303

Query: 1174 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1353
            SYTVMISGLA+HG G +AL IF EML +GLAPDDVVYVGVLSAC+HAGLVNEGL+ F  M
Sbjct: 304  SYTVMISGLAVHGCGSEALRIFAEMLEQGLAPDDVVYVGVLSACTHAGLVNEGLEFFNRM 363

Query: 1354 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1533
            QFEHKI+PTIQHYGCMVDLMGRAGM++EAY+LIKSM  KPNDV+WRSLL+ACKVH +LE+
Sbjct: 364  QFEHKIEPTIQHYGCMVDLMGRAGMLREAYELIKSMPRKPNDVLWRSLLNACKVHHNLEL 423

Query: 1534 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADKKLVQTPGFSLVEAKR 1713
            GEIAA+N+F+LN +NPGDYLVLANMYA+AQKW D+AKIR +M  K LVQTPGFSLVE KR
Sbjct: 424  GEIAAENIFMLNPHNPGDYLVLANMYARAQKWVDMAKIRTEMVRKGLVQTPGFSLVEVKR 483

Query: 1714 NVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLLDVDEEEKRERLKCHSQK 1893
             VYKFVS DKSQP +++IY MIHQMEWQLKFEGY+PDTSQVLLDVDEEEKR+RLK HSQK
Sbjct: 484  RVYKFVSHDKSQPHFHSIYAMIHQMEWQLKFEGYIPDTSQVLLDVDEEEKRQRLKYHSQK 543

Query: 1894 VAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYEREITVRDRLRFHHFKDGTC 2073
            +AIAF+LIHT SEGSPIRI RNLRMC+DCHTYTKFISM+YEREITVRDR  FHHFKDG C
Sbjct: 544  LAIAFALIHT-SEGSPIRIFRNLRMCNDCHTYTKFISMIYEREITVRDRNLFHHFKDGAC 602

Query: 2074 SCKDYW 2091
            SCKDYW
Sbjct: 603  SCKDYW 608


>gb|PNY00351.1| pentatricopeptide repeat-containing protein at1g31920-like protein
            [Trifolium pratense]
          Length = 562

 Score =  967 bits (2499), Expect = 0.0
 Identities = 462/563 (82%), Positives = 517/563 (91%)
 Frame = +1

Query: 403  MEEFKQVHAQVLKLGLFWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMI 582
            MEEFKQVHA VLK G+F+DSFC SNLVATCALTKWGSM+YACSIF QI+EP SF+YNTMI
Sbjct: 1    MEEFKQVHAHVLKWGIFFDSFCMSNLVATCALTKWGSMDYACSIFNQIDEPSSFDYNTMI 60

Query: 583  RGNVNDMKLKEALLLYVEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLE 762
            RGNV++MKL+EALLLYVEM+E G+EP+ FTYPFVLKACSLLG   E +Q+HG VFK G E
Sbjct: 61   RGNVSEMKLEEALLLYVEMIEEGVEPDKFTYPFVLKACSLLGACDEGIQVHGHVFKMGFE 120

Query: 763  GDVFVQNSLISMYGKWGEIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADM 942
            GDVFVQNSL++MYGK G I  A DVF+K+  KSVASWSAIIGA+ACVEMWHECL+LL +M
Sbjct: 121  GDVFVQNSLVNMYGKCGAINCARDVFDKIGEKSVASWSAIIGAYACVEMWHECLMLLGEM 180

Query: 943  SSEGSCRAEESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGS 1122
            S EG CR EESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVV+TSLIDMYVKCG 
Sbjct: 181  SIEGRCRVEESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVVETSLIDMYVKCGC 240

Query: 1123 LEKGLFVFQSMAEKNRYSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSA 1302
            LEKGL VF++M+EKN +SYTVMISGLAIHGHG++AL++FT+M+ EG  PD VV+VGVLSA
Sbjct: 241  LEKGLRVFENMSEKNIFSYTVMISGLAIHGHGKEALKVFTQMVEEGFEPDHVVFVGVLSA 300

Query: 1303 CSHAGLVNEGLQCFKNMQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDV 1482
            CSHAGLV+EGLQCFK MQFE KI PT+QHYGCMVDL+GR GM+KEAY+LIKSMSIKPNDV
Sbjct: 301  CSHAGLVDEGLQCFKTMQFEKKINPTVQHYGCMVDLLGRIGMLKEAYELIKSMSIKPNDV 360

Query: 1483 VWRSLLSACKVHLDLEIGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMA 1662
            +WRSLLSACKVHL+LEI EIAA+NLF+LN +NPGDYLVLANMYAK QKWDDVAKIR+KMA
Sbjct: 361  IWRSLLSACKVHLNLEIAEIAAENLFVLNQSNPGDYLVLANMYAKVQKWDDVAKIRKKMA 420

Query: 1663 DKKLVQTPGFSLVEAKRNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLL 1842
            +K LVQTPGFSL+EAKR VYKFVSQD+S PQWNTIYDMIHQMEWQLKFEGY+PDTSQVLL
Sbjct: 421  EKSLVQTPGFSLIEAKRKVYKFVSQDRSIPQWNTIYDMIHQMEWQLKFEGYIPDTSQVLL 480

Query: 1843 DVDEEEKRERLKCHSQKVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYERE 2022
            DV+EEEK+ERLK HSQK+AIAF LIHT SEGSP+RITRNLRMCSDCHTYTK+ISM+YERE
Sbjct: 481  DVEEEEKKERLKYHSQKLAIAFGLIHT-SEGSPLRITRNLRMCSDCHTYTKYISMIYERE 539

Query: 2023 ITVRDRLRFHHFKDGTCSCKDYW 2091
            IT+RDR RFHHFK+GTCSCKDYW
Sbjct: 540  ITIRDRHRFHHFKNGTCSCKDYW 562


>gb|KRH39618.1| hypothetical protein GLYMA_09G209700 [Glycine max]
          Length = 562

 Score =  961 bits (2484), Expect = 0.0
 Identities = 471/563 (83%), Positives = 514/563 (91%)
 Frame = +1

Query: 403  MEEFKQVHAQVLKLGLFWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMI 582
            MEEFKQVHA +LKLGLF+DSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMI
Sbjct: 1    MEEFKQVHAHILKLGLFYDSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMI 60

Query: 583  RGNVNDMKLKEALLLYVEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLE 762
            RGNVN M L+EALLLYVEMLERGIEP+NFTYPFVLKACSLLG LKE +QIH  VFKAGLE
Sbjct: 61   RGNVNSMNLEEALLLYVEMLERGIEPDNFTYPFVLKACSLLGALKEGVQIHAHVFKAGLE 120

Query: 763  GDVFVQNSLISMYGKWGEIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADM 942
            GDVFVQN LI+MYGK G I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DM
Sbjct: 121  GDVFVQNGLINMYGKCGAIEHASVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDM 180

Query: 943  SSEGSCRAEESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGS 1122
            S EG  RAEES LV+ LSACTHLGSPN G+CIHGILLRNISELNV VKTSLIDMYVK GS
Sbjct: 181  SGEGRHRAEESILVSALSACTHLGSPNFGRCIHGILLRNISELNVAVKTSLIDMYVKSGS 240

Query: 1123 LEKGLFVFQSMAEKNRYSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSA 1302
            LEKGL VFQ+MA+KNRYSYTV+I+GLAIHG GR+AL +F++ML EGLAPDDVVYVGVLSA
Sbjct: 241  LEKGLCVFQNMAQKNRYSYTVIITGLAIHGRGREALSVFSDMLEEGLAPDDVVYVGVLSA 300

Query: 1303 CSHAGLVNEGLQCFKNMQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDV 1482
            CSHAGLVNEGLQCF  +QFEHKIKPTIQHYGCMVDLMGRAGM+K AYDLIKSM IKPNDV
Sbjct: 301  CSHAGLVNEGLQCFNRLQFEHKIKPTIQHYGCMVDLMGRAGMLKGAYDLIKSMPIKPNDV 360

Query: 1483 VWRSLLSACKVHLDLEIGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMA 1662
            VWRSLLSACKVH +LEIGEIAA+N+F LN +NPGDYLVLANMYA+A+KW DVA+IR +MA
Sbjct: 361  VWRSLLSACKVHHNLEIGEIAAENIFKLNQHNPGDYLVLANMYARAKKWADVARIRTEMA 420

Query: 1663 DKKLVQTPGFSLVEAKRNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLL 1842
            +K LVQTPGFSLVEA RNVYKFVSQDKSQPQ  TIYDMI QMEWQLKFEGY PD SQVLL
Sbjct: 421  EKHLVQTPGFSLVEANRNVYKFVSQDKSQPQCETIYDMIQQMEWQLKFEGYTPDMSQVLL 480

Query: 1843 DVDEEEKRERLKCHSQKVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYERE 2022
            DVDE+EKR+RLK HSQK+AIAF+LI T SEGS IRI+RN+RMC+DCHTYTKFIS++YERE
Sbjct: 481  DVDEDEKRQRLKHHSQKLAIAFALIQT-SEGSRIRISRNIRMCNDCHTYTKFISVIYERE 539

Query: 2023 ITVRDRLRFHHFKDGTCSCKDYW 2091
            ITVRDR RFHHFKDGTCSCKDYW
Sbjct: 540  ITVRDRNRFHHFKDGTCSCKDYW 562


>gb|KHN40416.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 562

 Score =  956 bits (2470), Expect = 0.0
 Identities = 470/563 (83%), Positives = 512/563 (90%)
 Frame = +1

Query: 403  MEEFKQVHAQVLKLGLFWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMI 582
            MEEFKQVHA +LKLGLF+DSFC SNLVA+CAL++WGSMEYACSIFRQIEEPGSFEYNTMI
Sbjct: 1    MEEFKQVHAHILKLGLFYDSFCGSNLVASCALSRWGSMEYACSIFRQIEEPGSFEYNTMI 60

Query: 583  RGNVNDMKLKEALLLYVEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLE 762
            RGNVN M L+EALLLYVEMLERGIEP+NFTYPFVLKACSLL  LKE +QIH  VFKAGLE
Sbjct: 61   RGNVNSMDLEEALLLYVEMLERGIEPDNFTYPFVLKACSLLVALKEGVQIHAHVFKAGLE 120

Query: 763  GDVFVQNSLISMYGKWGEIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADM 942
             DVFVQN LISMYGK G I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DM
Sbjct: 121  VDVFVQNGLISMYGKCGAIEHAGVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDM 180

Query: 943  SSEGSCRAEESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGS 1122
            S EG  RAEES LV+ LSACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGS
Sbjct: 181  SGEGRHRAEESILVSALSACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGS 240

Query: 1123 LEKGLFVFQSMAEKNRYSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSA 1302
            LEKGL VFQ+MA KNRYSYTVMI+GLAIHG GR+A+ +F++ML EGL PDDVVYVGVLSA
Sbjct: 241  LEKGLCVFQNMAHKNRYSYTVMIAGLAIHGRGREAVRVFSDMLEEGLTPDDVVYVGVLSA 300

Query: 1303 CSHAGLVNEGLQCFKNMQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDV 1482
            CSHAGLV EGLQCF  MQFEH IKPTIQHYGCMVDLMGRAGM+KEAYDLIKSM IKPNDV
Sbjct: 301  CSHAGLVKEGLQCFNRMQFEHMIKPTIQHYGCMVDLMGRAGMLKEAYDLIKSMPIKPNDV 360

Query: 1483 VWRSLLSACKVHLDLEIGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMA 1662
            VWRSLLSACKVH +LEIGEIAA+N+F LN +NPGDYLVLANMYA+A+KW +VA+IR +MA
Sbjct: 361  VWRSLLSACKVHHNLEIGEIAAENIFRLNKHNPGDYLVLANMYARAKKWANVARIRTEMA 420

Query: 1663 DKKLVQTPGFSLVEAKRNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLL 1842
            +K LVQTPGFSLVEA RNVYKFVSQDKSQP   TIYDMI QMEWQLKFEGY PD SQVLL
Sbjct: 421  EKHLVQTPGFSLVEANRNVYKFVSQDKSQPICETIYDMIQQMEWQLKFEGYTPDMSQVLL 480

Query: 1843 DVDEEEKRERLKCHSQKVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYERE 2022
            DVDE+EKR+RLK HSQK+AIAF+LI T SEGSP+RI+RNLRMC+DCHTYTKFIS++YERE
Sbjct: 481  DVDEDEKRQRLKHHSQKLAIAFALIQT-SEGSPVRISRNLRMCNDCHTYTKFISVIYERE 539

Query: 2023 ITVRDRLRFHHFKDGTCSCKDYW 2091
            ITVRDR RFHHFKDGTCSCKDYW
Sbjct: 540  ITVRDRNRFHHFKDGTCSCKDYW 562


>ref|XP_020997576.1| pentatricopeptide repeat-containing protein At1g31920 isoform X2
            [Arachis duranensis]
          Length = 562

 Score =  928 bits (2398), Expect = 0.0
 Identities = 451/563 (80%), Positives = 501/563 (88%)
 Frame = +1

Query: 403  MEEFKQVHAQVLKLGLFWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMI 582
            MEEFKQVHA +LKLGLFWD FC SNLVATCAL KWGSMEYACSIFR+IEEP SFEYNTMI
Sbjct: 1    MEEFKQVHAHILKLGLFWDHFCGSNLVATCALAKWGSMEYACSIFRRIEEPSSFEYNTMI 60

Query: 583  RGNVNDMKLKEALLLYVEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLE 762
            RGNVN M L+EAL+LYVEML++GIEP+NFTYPFV KACSLLG LKE MQI+  VFKAGLE
Sbjct: 61   RGNVNCMNLEEALILYVEMLKKGIEPDNFTYPFVFKACSLLGALKEGMQIYSHVFKAGLE 120

Query: 763  GDVFVQNSLISMYGKWGEIKHACDVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADM 942
            GD+F+QNSLISMYGK G I+HA DVF+KM  +SVASWSAIIGAHA  EMWHECL L  DM
Sbjct: 121  GDLFLQNSLISMYGKCGAIEHARDVFDKMSERSVASWSAIIGAHASAEMWHECLKLFNDM 180

Query: 943  SSEGSCRAEESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGS 1122
              +G  R EESTLV+VLSACTHLGSPNLG  +HGILLRN +ELNV+VKTSLIDMY KCG 
Sbjct: 181  MHDGRYRPEESTLVSVLSACTHLGSPNLGSSVHGILLRNTTELNVIVKTSLIDMYAKCGC 240

Query: 1123 LEKGLFVFQSMAEKNRYSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSA 1302
            +EKGL VF SMAEKN++SYTVMISGLA+HG G +AL IF EML +GLAPDDVVYVGVLSA
Sbjct: 241  IEKGLCVFHSMAEKNKHSYTVMISGLAVHGCGSEALRIFAEMLEQGLAPDDVVYVGVLSA 300

Query: 1303 CSHAGLVNEGLQCFKNMQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDV 1482
            C+HAGLVNEGL+ F  MQFEHKI+PTIQHYGCMVDLMGRAGM++EAY+LIKSM  KPNDV
Sbjct: 301  CTHAGLVNEGLEFFNRMQFEHKIEPTIQHYGCMVDLMGRAGMLREAYELIKSMPRKPNDV 360

Query: 1483 VWRSLLSACKVHLDLEIGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMA 1662
            +WRSLL+ACKVH +LE+GEIAA+N+F+LN +NPGDYLVLANMYA+AQKW D+AKIR +M 
Sbjct: 361  LWRSLLNACKVHHNLELGEIAAENIFMLNPHNPGDYLVLANMYARAQKWVDMAKIRTEMV 420

Query: 1663 DKKLVQTPGFSLVEAKRNVYKFVSQDKSQPQWNTIYDMIHQMEWQLKFEGYVPDTSQVLL 1842
             K LVQTPGFSLVE KR VYKFVS DKSQP +++IY MIHQMEWQLKFEGY+PDTSQVLL
Sbjct: 421  RKGLVQTPGFSLVEVKRRVYKFVSHDKSQPHFHSIYAMIHQMEWQLKFEGYIPDTSQVLL 480

Query: 1843 DVDEEEKRERLKCHSQKVAIAFSLIHTSSEGSPIRITRNLRMCSDCHTYTKFISMVYERE 2022
            DVDEEEKR+RLK HSQK+AIAF+LIHT SEGSPIRI RNLRMC+DCHTYTKFISM+YERE
Sbjct: 481  DVDEEEKRQRLKYHSQKLAIAFALIHT-SEGSPIRIFRNLRMCNDCHTYTKFISMIYERE 539

Query: 2023 ITVRDRLRFHHFKDGTCSCKDYW 2091
            ITVRDR  FHHFKDG CSCKDYW
Sbjct: 540  ITVRDRNLFHHFKDGACSCKDYW 562


Top