BLASTX nr result

ID: Astragalus23_contig00004122 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00004122
         (1677 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_020226361.1| pentatricopeptide repeat-containing protein ...   781   0.0  
ref|XP_020226366.1| pentatricopeptide repeat-containing protein ...   778   0.0  
gb|KYP54428.1| Pentatricopeptide repeat-containing protein At1g3...   778   0.0  
ref|XP_007158217.1| hypothetical protein PHAVU_002G134100g [Phas...   777   0.0  
ref|XP_004512460.1| PREDICTED: pentatricopeptide repeat-containi...   776   0.0  
ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containi...   775   0.0  
ref|XP_017426458.1| PREDICTED: pentatricopeptide repeat-containi...   774   0.0  
ref|XP_003533450.1| PREDICTED: pentatricopeptide repeat-containi...   771   0.0  
ref|XP_014521259.1| pentatricopeptide repeat-containing protein ...   770   0.0  
gb|KHN40422.1| Pentatricopeptide repeat-containing protein [Glyc...   769   0.0  
ref|XP_006572948.1| PREDICTED: pentatricopeptide repeat-containi...   766   0.0  
ref|XP_020226365.1| pentatricopeptide repeat-containing protein ...   764   0.0  
gb|OIV94429.1| hypothetical protein TanjilG_25491 [Lupinus angus...   754   0.0  
ref|XP_019422179.1| PREDICTED: pentatricopeptide repeat-containi...   754   0.0  
ref|XP_020237290.1| pentatricopeptide repeat-containing protein ...   744   0.0  
ref|XP_016201186.1| pentatricopeptide repeat-containing protein ...   729   0.0  
ref|XP_015963236.1| pentatricopeptide repeat-containing protein ...   728   0.0  
gb|KRH39618.1| hypothetical protein GLYMA_09G209700 [Glycine max]     718   0.0  
gb|KHN40416.1| Pentatricopeptide repeat-containing protein [Glyc...   712   0.0  
gb|PNY00351.1| pentatricopeptide repeat-containing protein at1g3...   706   0.0  

>ref|XP_020226361.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
 ref|XP_020226362.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
 ref|XP_020226363.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
          Length = 607

 Score =  781 bits (2018), Expect = 0.0
 Identities = 382/464 (82%), Positives = 423/464 (91%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVLSQTH L LPNNPPQSSEL++ FN+KGW  LLK+CKSMEEFKQVHAQ+LKLGLF 
Sbjct: 2    SGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLLKRCKSMEEFKQVHAQILKLGLFL 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVATCAL+KWGSMEYACSIFRQIEEPGSFEYNTMIRG+VN++ L+EAL LYVE
Sbjct: 62   DSFCGSNLVATCALSKWGSMEYACSIFRQIEEPGSFEYNTMIRGSVNNVNLEEALFLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP+ FTYPFV KACSLLG LKE +QIHG +FKAGL+GD FVQNSLISMYGK  A
Sbjct: 122  MLERGIEPDKFTYPFVFKACSLLGALKEGVQIHGHIFKAGLDGDTFVQNSLISMYGKCRA 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            IKHA  VFE+M+ +SVASWSAIIGAHA VEMW ECL+LL DMSSEG  RAEES LV+ LS
Sbjct: 182  IKHAYAVFEQMDERSVASWSAIIGAHASVEMWQECLMLLGDMSSEGRHRAEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSP LG+CIHGILLRNIS+LNVVVKTSLIDMYVKCG LEKGL VFQ+MAEKNR+S
Sbjct: 242  ACTHLGSPILGRCIHGILLRNISKLNVVVKTSLIDMYVKCGCLEKGLSVFQNMAEKNRFS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMI+GLAIHG GR+AL +F++ML EGLAPDDVVYVGVLSACSHAGLVNEGLQCF  M+
Sbjct: 302  YTVMIAGLAIHGRGREALRVFSDMLEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNRMR 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
            FEHKIKPT+QHYGCMVDLMGRAGM++EAYDLIK M IKPNDVVWRSLLSACKVHL+LEIG
Sbjct: 362  FEHKIKPTVQHYGCMVDLMGRAGMLREAYDLIKRMPIKPNDVVWRSLLSACKVHLNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
             IAA+NLF LN +NPGDYL+LANM+AKA+KWDDVA+IR +MA+K
Sbjct: 422  VIAAENLFKLNQHNPGDYLMLANMFAKAKKWDDVARIRTEMAEK 465


>ref|XP_020226366.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
          Length = 590

 Score =  778 bits (2008), Expect = 0.0
 Identities = 380/464 (81%), Positives = 421/464 (90%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVLSQTH L LPNNPPQSSEL++ FN+KGW  LLK+CK MEEFKQVHAQ+LKLGLF 
Sbjct: 2    SGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLLKRCKCMEEFKQVHAQILKLGLFL 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVATCAL+KWGSMEYACSIFRQIEEPGSFEYNTMIRG+VN+M L+EAL LYVE
Sbjct: 62   DSFCGSNLVATCALSKWGSMEYACSIFRQIEEPGSFEYNTMIRGSVNNMNLEEALFLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP+ FTYPFV KACSLLG LKE +QIHG +FKAGL+GD FVQNSLISMYGK GA
Sbjct: 122  MLERGIEPDKFTYPFVFKACSLLGALKEGVQIHGHIFKAGLDGDTFVQNSLISMYGKCGA 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            IKHA  VFE+M+ +SVASWSAIIGAHA VEMW ECL+LL DMSSEG  RAEES LV+ LS
Sbjct: 182  IKHAYAVFEQMDERSVASWSAIIGAHASVEMWQECLMLLGDMSSEGRHRAEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSP LG+CIHGILLRNISELNVVVKTSLIDMYVKCG LEKGL VFQ+MAEKNR+S
Sbjct: 242  ACTHLGSPILGRCIHGILLRNISELNVVVKTSLIDMYVKCGCLEKGLSVFQNMAEKNRFS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMI+GLAIHG GR+AL +F++M+ EGLAPDDVVYVGVLSACSHAGLVNEGLQ F  M+
Sbjct: 302  YTVMIAGLAIHGRGREALRVFSDMMEEGLAPDDVVYVGVLSACSHAGLVNEGLQFFNRMR 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
            FEHKIKPT+QHYGCMVDLMGR GM++EAYDLIK + IKPNDVVWRSLLSACKVHL+LEIG
Sbjct: 362  FEHKIKPTVQHYGCMVDLMGRVGMLREAYDLIKRVPIKPNDVVWRSLLSACKVHLNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
             IAAKNLF LN +NPGDYL+LANM+A+A+KW+DVA+IR KMA+K
Sbjct: 422  VIAAKNLFKLNQHNPGDYLMLANMFARAKKWNDVARIRTKMAEK 465


>gb|KYP54428.1| Pentatricopeptide repeat-containing protein At1g31920 family [Cajanus
            cajan]
          Length = 601

 Score =  778 bits (2008), Expect = 0.0
 Identities = 380/464 (81%), Positives = 421/464 (90%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVLSQTH L LPNNPPQSSEL++ FN+KGW  LLK+CK MEEFKQVHAQ+LKLGLF 
Sbjct: 2    SGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLLKRCKCMEEFKQVHAQILKLGLFL 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVATCAL+KWGSMEYACSIFRQIEEPGSFEYNTMIRG+VN+M L+EAL LYVE
Sbjct: 62   DSFCGSNLVATCALSKWGSMEYACSIFRQIEEPGSFEYNTMIRGSVNNMNLEEALFLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP+ FTYPFV KACSLLG LKE +QIHG +FKAGL+GD FVQNSLISMYGK GA
Sbjct: 122  MLERGIEPDKFTYPFVFKACSLLGALKEGVQIHGHIFKAGLDGDTFVQNSLISMYGKCGA 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            IKHA  VFE+M+ +SVASWSAIIGAHA VEMW ECL+LL DMSSEG  RAEES LV+ LS
Sbjct: 182  IKHAYAVFEQMDERSVASWSAIIGAHASVEMWQECLMLLGDMSSEGRHRAEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSP LG+CIHGILLRNISELNVVVKTSLIDMYVKCG LEKGL VFQ+MAEKNR+S
Sbjct: 242  ACTHLGSPILGRCIHGILLRNISELNVVVKTSLIDMYVKCGCLEKGLSVFQNMAEKNRFS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMI+GLAIHG GR+AL +F++M+ EGLAPDDVVYVGVLSACSHAGLVNEGLQ F  M+
Sbjct: 302  YTVMIAGLAIHGRGREALRVFSDMMEEGLAPDDVVYVGVLSACSHAGLVNEGLQFFNRMR 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
            FEHKIKPT+QHYGCMVDLMGR GM++EAYDLIK + IKPNDVVWRSLLSACKVHL+LEIG
Sbjct: 362  FEHKIKPTVQHYGCMVDLMGRVGMLREAYDLIKRVPIKPNDVVWRSLLSACKVHLNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
             IAAKNLF LN +NPGDYL+LANM+A+A+KW+DVA+IR KMA+K
Sbjct: 422  VIAAKNLFKLNQHNPGDYLMLANMFARAKKWNDVARIRTKMAEK 465


>ref|XP_007158217.1| hypothetical protein PHAVU_002G134100g [Phaseolus vulgaris]
 gb|ESW30211.1| hypothetical protein PHAVU_002G134100g [Phaseolus vulgaris]
          Length = 605

 Score =  777 bits (2007), Expect = 0.0
 Identities = 380/464 (81%), Positives = 419/464 (90%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVL Q+H L LPNNPPQ+SEL+  FNE+GW  LLK+CKSMEEFKQVHAQ+LKLGLF 
Sbjct: 2    SGTSVLCQSHLLSLPNNPPQNSELNAKFNEQGWLSLLKRCKSMEEFKQVHAQILKLGLFL 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN+M L++ALLLYVE
Sbjct: 62   DSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNNMNLEKALLLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLE+GIE +NFTYPFVLKACSLLG LKE +QIHGQVFKAGLE D FVQN LISMYGK G 
Sbjct: 122  MLEKGIEHDNFTYPFVLKACSLLGALKEGVQIHGQVFKAGLEDDTFVQNGLISMYGKCGE 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            I HAC +FE+M+ KSVASWS+IIGAHA VE+W +CL+LL DMSSEG  RAEES LV  LS
Sbjct: 182  INHACALFEQMDEKSVASWSSIIGAHARVELWQDCLMLLGDMSSEGRHRAEESILVTALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGL VFQSMA KNRYS
Sbjct: 242  ACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLCVFQSMAVKNRYS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMISGLA HG GR+AL +F+EM+ EGLAPDDVVYVGVLSACSHAGLVNEGLQCF +MQ
Sbjct: 302  YTVMISGLAFHGRGREALRVFSEMVEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNSMQ 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
              HKIKPTIQHYGCMVDLMGRAGM+KEA DLIK M IKPNDV+WRSLLSACKVHL+LEIG
Sbjct: 362  LVHKIKPTIQHYGCMVDLMGRAGMLKEACDLIKGMQIKPNDVIWRSLLSACKVHLNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            E+AA+N+F LN +NPGDYLVLA+MYA+AQKW DVA+IR +MA+K
Sbjct: 422  EVAAENVFKLNQHNPGDYLVLASMYARAQKWTDVARIRTEMAEK 465


>ref|XP_004512460.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Cicer arietinum]
          Length = 606

 Score =  776 bits (2005), Expect = 0.0
 Identities = 382/465 (82%), Positives = 420/465 (90%), Gaps = 1/465 (0%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            +GT+ L+QTHFL L NN  QS ELS +FNEKGW  LLK+C +MEEFKQVHA  LK G+F+
Sbjct: 2    TGTTALNQTHFLLLTNNSHQSFELSKSFNEKGWLCLLKRCNNMEEFKQVHAYFLKCGIFF 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVATCALTKWGSM+YACSIF QIEEP SF+YNTMIRGNVN+MKL EALLLYVE
Sbjct: 62   DSFCGSNLVATCALTKWGSMDYACSIFTQIEEPCSFDYNTMIRGNVNNMKLDEALLLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP+ FTYPFVLKACSLLG LKE +QIHG V K GLEGD+FV+NSLI+MYGK GA
Sbjct: 122  MLERGIEPDKFTYPFVLKACSLLGALKEGVQIHGHVLKTGLEGDLFVENSLINMYGKCGA 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADM-SSEGSCRAEESTLVNVL 999
            IK AC+VF+KM  +SVASWSAIIGAH CVEMWHECLVLL DM SSEG CR EESTLV+VL
Sbjct: 182  IKDACDVFDKMGERSVASWSAIIGAHVCVEMWHECLVLLGDMMSSEGRCRPEESTLVSVL 241

Query: 1000 SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1179
            SACTHLGS NLG+ IHG LLRNISELNVVVKTSLIDMYVKCG LEKGL VF++M EKNRY
Sbjct: 242  SACTHLGSYNLGRFIHGNLLRNISELNVVVKTSLIDMYVKCGCLEKGLHVFRNMPEKNRY 301

Query: 1180 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1359
            SYTVMISGLA+HGHG++ALE+F+EM+ +GL PDDVVYVGVLSACSHAGLV+EGLQCFK M
Sbjct: 302  SYTVMISGLAVHGHGKEALEVFSEMVEQGLEPDDVVYVGVLSACSHAGLVDEGLQCFKRM 361

Query: 1360 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1539
            QFEHKIKPTIQHYGCMVDLMGR+GM+KEAY+LIKSM IKPNDVVWRSLLSACKVHL+LEI
Sbjct: 362  QFEHKIKPTIQHYGCMVDLMGRSGMLKEAYELIKSMPIKPNDVVWRSLLSACKVHLNLEI 421

Query: 1540 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            G+IAA NLF+LN NNPGDYLVLANMYAK QKWD+VAKIRRKMADK
Sbjct: 422  GQIAADNLFMLNPNNPGDYLVLANMYAKVQKWDEVAKIRRKMADK 466


>ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Glycine max]
 gb|KRH74305.1| hypothetical protein GLYMA_01G011300 [Glycine max]
          Length = 605

 Score =  775 bits (2001), Expect = 0.0
 Identities = 381/464 (82%), Positives = 419/464 (90%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVL Q+H L LPN+PPQSSEL+  FNE+GW  LLK+CKSMEEFKQVHA +LKLGLF+
Sbjct: 2    SGTSVLCQSHLLSLPNSPPQSSELNAKFNEQGWLSLLKRCKSMEEFKQVHAHILKLGLFY 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVA+CAL++WGSMEYACSIF QIEEPGSFEYNTMIRGNVN M L+EALLLYVE
Sbjct: 62   DSFCGSNLVASCALSRWGSMEYACSIFSQIEEPGSFEYNTMIRGNVNSMDLEEALLLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP+NFTYPFVLKACSLL  LKE +QIH  VFKAGLE DVFVQN LISMYGK GA
Sbjct: 122  MLERGIEPDNFTYPFVLKACSLLVALKEGVQIHAHVFKAGLEVDVFVQNGLISMYGKCGA 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DMS EG  RAEES LV+ LS
Sbjct: 182  IEHAGVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDMSGEGRHRAEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGL VFQ+MA KNRYS
Sbjct: 242  ACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLCVFQNMAHKNRYS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMI+GLAIHG GR+A+ +F++ML EGL PDDVVYVGVLSACSHAGLVNEGLQCF  MQ
Sbjct: 302  YTVMIAGLAIHGRGREAVRVFSDMLEEGLTPDDVVYVGVLSACSHAGLVNEGLQCFNRMQ 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
            FEH IKPTIQHYGCMVDLMGRAGM+KEAYDLIKSM IKPNDVVWRSLLSACKVH +LEIG
Sbjct: 362  FEHMIKPTIQHYGCMVDLMGRAGMLKEAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            EIAA+N+F LN +NPGDYLVLANMYA+A+KW +VA+IR +MA+K
Sbjct: 422  EIAAENIFRLNKHNPGDYLVLANMYARAKKWANVARIRTEMAEK 465


>ref|XP_017426458.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Vigna angularis]
 gb|KOM45472.1| hypothetical protein LR48_Vigan06g077800 [Vigna angularis]
 dbj|BAT99683.1| hypothetical protein VIGAN_10118700 [Vigna angularis var. angularis]
          Length = 605

 Score =  774 bits (1999), Expect = 0.0
 Identities = 372/464 (80%), Positives = 421/464 (90%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVL Q+H L LPNNPPQ+SEL+  FNE+GW  LLK+CKSMEEFKQVHAQ+LKLGLFW
Sbjct: 2    SGTSVLCQSHLLSLPNNPPQNSELNAKFNEQGWLSLLKRCKSMEEFKQVHAQILKLGLFW 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN++ L++ALLLYVE
Sbjct: 62   DSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNNLNLEKALLLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLE+GIE +NFTYPFVLKACSLLG LKE +Q+HGQVFKAGLE D +V N LISMYGK G 
Sbjct: 122  MLEKGIEHDNFTYPFVLKACSLLGALKEGVQVHGQVFKAGLEDDTYVHNGLISMYGKCGE 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            I HAC+VFE+M+ +SVASWS+IIGAHA VE+W +CL+LL DMS+EG  RAEES LV+ LS
Sbjct: 182  INHACDVFEQMDERSVASWSSIIGAHASVELWQDCLMLLGDMSNEGRHRAEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSP++G+CIHGILLRNISELNVVVKTSLIDMY+KCG+L+KGL VFQ+MA KNRYS
Sbjct: 242  ACTHLGSPDIGRCIHGILLRNISELNVVVKTSLIDMYIKCGNLDKGLCVFQNMAVKNRYS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMISGLA HG GR+AL +F EM+ EGLAPDDVVYVGVLSACSHAGLVNEGLQCF +MQ
Sbjct: 302  YTVMISGLAFHGRGREALRVFCEMVEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNHMQ 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
              HKIKPTIQHYGCMVDLMGRAGM+KEAY+LIK M IKPNDVVWRSLLSACKVHL+LEIG
Sbjct: 362  LVHKIKPTIQHYGCMVDLMGRAGMLKEAYELIKGMPIKPNDVVWRSLLSACKVHLNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            EIAA+N+F LN +NPGDYLVLA+MYA+AQKW DVA+IR +MA+K
Sbjct: 422  EIAAENIFKLNQHNPGDYLVLASMYARAQKWTDVARIRTEMAEK 465


>ref|XP_003533450.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Glycine max]
          Length = 604

 Score =  771 bits (1990), Expect = 0.0
 Identities = 381/464 (82%), Positives = 419/464 (90%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            S TSVL Q+HFL LPNNPPQSSEL+  FN +G   LLK+CKSMEEFKQVHA +LKLGLF+
Sbjct: 2    SWTSVLCQSHFLSLPNNPPQSSELNAKFNVQG-LSLLKRCKSMEEFKQVHAHILKLGLFY 60

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN M L+EALLLYVE
Sbjct: 61   DSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNSMNLEEALLLYVE 120

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP+NFTYPFVLKACSLLG LKE +QIH  VFKAGLEGDVFVQN LI+MYGK GA
Sbjct: 121  MLERGIEPDNFTYPFVLKACSLLGALKEGVQIHAHVFKAGLEGDVFVQNGLINMYGKCGA 180

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DMS EG  RAEES LV+ LS
Sbjct: 181  IEHASVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDMSGEGRHRAEESILVSALS 240

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSPN G+CIHGILLRNISELNV VKTSLIDMYVK GSLEKGL VFQ+MA+KNRYS
Sbjct: 241  ACTHLGSPNFGRCIHGILLRNISELNVAVKTSLIDMYVKSGSLEKGLCVFQNMAQKNRYS 300

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTV+I+GLAIHG GR+AL +F++ML EGLAPDDVVYVGVLSACSHAGLVNEGLQCF  +Q
Sbjct: 301  YTVIITGLAIHGRGREALSVFSDMLEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNRLQ 360

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
            FEHKIKPTIQHYGCMVDLMGRAGM+K AYDLIKSM IKPNDVVWRSLLSACKVH +LEIG
Sbjct: 361  FEHKIKPTIQHYGCMVDLMGRAGMLKGAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEIG 420

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            EIAA+N+F LN +NPGDYLVLANMYA+A+KW DVA+IR +MA+K
Sbjct: 421  EIAAENIFKLNQHNPGDYLVLANMYARAKKWADVARIRTEMAEK 464


>ref|XP_014521259.1| pentatricopeptide repeat-containing protein At1g31920 [Vigna radiata
            var. radiata]
 ref|XP_014521260.1| pentatricopeptide repeat-containing protein At1g31920 [Vigna radiata
            var. radiata]
          Length = 605

 Score =  770 bits (1989), Expect = 0.0
 Identities = 373/464 (80%), Positives = 418/464 (90%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVL Q+H L LPNNPP +SEL+  FNE+GW  LLK+CKSMEEFK VHAQ+LKLGLFW
Sbjct: 2    SGTSVLCQSHLLSLPNNPPLNSELNAKFNEQGWLSLLKRCKSMEEFKHVHAQILKLGLFW 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMIRG+VN+M L++ALLLYVE
Sbjct: 62   DSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGHVNNMNLEKALLLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLE+GIE +NFTYPFVLKACSLLG LKE +QIHGQVFKAGLE D +VQN LISMYGK G 
Sbjct: 122  MLEKGIEHDNFTYPFVLKACSLLGALKEGVQIHGQVFKAGLEDDTYVQNGLISMYGKCGE 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            IKHAC+VFE+M+ +SVASWS+IIGAHA VE+W +CL+LL DMS+EG  R EES LV+ LS
Sbjct: 182  IKHACDVFEQMDERSVASWSSIIGAHASVELWQDCLMLLGDMSNEGRHRPEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSP++G+CIHGILLRNISELNVVVKTSLIDMYVKCG+LEKGL VFQ+MA KNRYS
Sbjct: 242  ACTHLGSPDIGRCIHGILLRNISELNVVVKTSLIDMYVKCGNLEKGLCVFQNMAVKNRYS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMISGLA HG GR+AL +F EM+ EGLAPDDVVYVGVLSACSHAGLVNEGLQCF  MQ
Sbjct: 302  YTVMISGLAFHGRGREALRVFCEMMEEGLAPDDVVYVGVLSACSHAGLVNEGLQCFNRMQ 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
              HKIKPTIQHYGCMVDLMGRAGM+ EAY+LIK M IKPNDVVWRSLLSACKVHL+LEIG
Sbjct: 362  LVHKIKPTIQHYGCMVDLMGRAGMLMEAYELIKGMQIKPNDVVWRSLLSACKVHLNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            EIAA+N+F LN +NPGDYLVLA+MYA+AQKW DVA+IR +MA+K
Sbjct: 422  EIAAENIFKLNPHNPGDYLVLASMYARAQKWTDVARIRTEMAEK 465


>gb|KHN40422.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 605

 Score =  769 bits (1986), Expect = 0.0
 Identities = 379/464 (81%), Positives = 417/464 (89%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVL Q+H L LPN+P QSSEL+  FNE+GW  LLK+CKSMEEFKQVHA +LKLGLF+
Sbjct: 2    SGTSVLCQSHLLSLPNSPLQSSELNAKFNEQGWLSLLKRCKSMEEFKQVHAHILKLGLFY 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVA+CAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN M L+EALLLYVE
Sbjct: 62   DSFCGSNLVASCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNSMDLEEALLLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP+NFTYPFVLKACSLL  LKE +QIH  VFKAGLE DVFVQN LISMYGK GA
Sbjct: 122  MLERGIEPDNFTYPFVLKACSLLVALKEGVQIHAHVFKAGLEVDVFVQNGLISMYGKCGA 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DMS EG  RAEES LV+ LS
Sbjct: 182  IEHAGVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDMSGEGRHRAEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGL VF +MA KNRYS
Sbjct: 242  ACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLCVFHNMAHKNRYS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMI+GLAIHG GR+A+ +F++ML EGL PDDVVYVGVLSACSHAGLV EGLQCF  MQ
Sbjct: 302  YTVMIAGLAIHGRGREAVRVFSDMLEEGLTPDDVVYVGVLSACSHAGLVKEGLQCFNRMQ 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
            FEH IKPTIQHYGCMVDLMGRAGM+KEAYDLIKSM IKPNDVVWRSLLSACKVH +LEIG
Sbjct: 362  FEHMIKPTIQHYGCMVDLMGRAGMLKEAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            EIAA+N+F LN +NPGDYLVLANMYA+A+KW +VA+IR +MA+K
Sbjct: 422  EIAAENIFRLNKHNPGDYLVLANMYARAKKWANVARIRTEMAEK 465


>ref|XP_006572948.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Glycine max]
 gb|KRH74309.1| hypothetical protein GLYMA_01G011700 [Glycine max]
          Length = 605

 Score =  766 bits (1978), Expect = 0.0
 Identities = 378/464 (81%), Positives = 414/464 (89%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVL Q+H L LPN+P QSSEL+  FNE+GW  LLK+CKSMEEFK+VHA +LKLGLF+
Sbjct: 2    SGTSVLCQSHLLSLPNSPLQSSELNAKFNEQGWLSLLKRCKSMEEFKKVHAHILKLGLFY 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVA+CAL++WGSMEYACSIFRQIEEPGSFEYNTMIRGNVN M L+EALLLYVE
Sbjct: 62   DSFCGSNLVASCALSRWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNSMDLEEALLLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP+NFTYPFVLKACSLL  LKE +QIH  VF AGLE DVFVQN LISMYGK GA
Sbjct: 122  MLERGIEPDNFTYPFVLKACSLLVALKEGVQIHAHVFNAGLEVDVFVQNGLISMYGKCGA 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            I+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DMS EG  RAEES LV+ LS
Sbjct: 182  IEHAGVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDMSREGRHRAEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGL VFQ+MA KNRYS
Sbjct: 242  ACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLCVFQNMAHKNRYS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMI+GLAIHG GR+AL +F++ML EGL PDDVVYVGVLSACSHAGLV EG QCF  MQ
Sbjct: 302  YTVMIAGLAIHGRGREALRVFSDMLEEGLTPDDVVYVGVLSACSHAGLVKEGFQCFNRMQ 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
            FEH IKPTIQHYGCMVDLMGRAGM+KEAYDLIKSM IKPNDVVWRSLLSACKVH +LEIG
Sbjct: 362  FEHMIKPTIQHYGCMVDLMGRAGMLKEAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            EIAA N+F LN +NPGDYLVLANMYA+AQKW +VA+IR +M +K
Sbjct: 422  EIAADNIFKLNKHNPGDYLVLANMYARAQKWANVARIRTEMVEK 465


>ref|XP_020226365.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
          Length = 607

 Score =  764 bits (1974), Expect = 0.0
 Identities = 373/464 (80%), Positives = 416/464 (89%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVLSQTH L LPNNPPQSSEL++ FN+KGW  LLK+CK MEEFKQVHAQ+LKLGLF 
Sbjct: 2    SGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLLKRCKCMEEFKQVHAQILKLGLFL 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
            DSFC SNLVATCAL+KWGSM YACSIFRQIEEPGSFEYNTMIRG+VN+M L+EAL LYVE
Sbjct: 62   DSFCGSNLVATCALSKWGSMGYACSIFRQIEEPGSFEYNTMIRGSVNNMNLEEALFLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP  FTYPFV K CSLLG LKE +QIHG +FKAG +GD FVQNSLISMYGK GA
Sbjct: 122  MLERGIEPEKFTYPFVFKGCSLLGALKEGVQIHGHIFKAGFDGDTFVQNSLISMYGKCGA 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            IKHA  VFE+M+ +SVASWSAIIGAHA VEMW ECL+LL DMSSEG  +AEES LV+ LS
Sbjct: 182  IKHAYAVFEQMDERSVASWSAIIGAHASVEMWQECLMLLGDMSSEGQHKAEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
             CTHLGSP LG+CIHGILLRNISELNVVVKTSLIDMYVKCG LEKGL VFQ+MAEKN++S
Sbjct: 242  TCTHLGSPILGRCIHGILLRNISELNVVVKTSLIDMYVKCGCLEKGLSVFQNMAEKNKFS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMI+GLAI G GR+AL +F++M+ EGLAPDDVVYVGVLSACSHAGLVNEGLQ F  M+
Sbjct: 302  YTVMIAGLAIDGRGREALRVFSDMMEEGLAPDDVVYVGVLSACSHAGLVNEGLQFFNRMR 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
            FEHKIKPT+QHYGCMVDLMGRAGM++EAYDLIK M IKPNDVVWRSLLSACKVHL+LEIG
Sbjct: 362  FEHKIKPTVQHYGCMVDLMGRAGMLREAYDLIKRMPIKPNDVVWRSLLSACKVHLNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
             IAA+NLF LN +NPGDYL+LANM+A+A+KW+DVA+IR +MA+K
Sbjct: 422  VIAAENLFKLNQHNPGDYLMLANMFARAKKWNDVARIRTEMAEK 465


>gb|OIV94429.1| hypothetical protein TanjilG_25491 [Lupinus angustifolius]
          Length = 567

 Score =  754 bits (1947), Expect = 0.0
 Identities = 365/463 (78%), Positives = 414/463 (89%), Gaps = 1/463 (0%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNN-PPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 459
            +GT+VL+QTHFLPLPNN PPQ SEL+  FNE+GW  +LK CKS+EE KQVHA +LKLG  
Sbjct: 2    TGTTVLTQTHFLPLPNNSPPQCSELNIKFNEQGWLSMLKGCKSLEELKQVHAHILKLGFL 61

Query: 460  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 639
             DSFC SNLVATCAL+KWGSM+YACSIFRQIEEPGSFEYNTMIRGNVN M L+EALLLY+
Sbjct: 62   LDSFCESNLVATCALSKWGSMDYACSIFRQIEEPGSFEYNTMIRGNVNYMNLEEALLLYL 121

Query: 640  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 819
            EMLERGIEP+NFTYPFVLKACSLLG + E MQIHG V K GL+GDVFVQNSLISMYGK+G
Sbjct: 122  EMLERGIEPDNFTYPFVLKACSLLGCVNEGMQIHGHVLKGGLKGDVFVQNSLISMYGKFG 181

Query: 820  AIKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 999
             IKHAC VFEKM+ KSVASWSAIIGAHA VEMWHECL+L  DMSSEG  RAEESTLV V+
Sbjct: 182  GIKHACAVFEKMDEKSVASWSAIIGAHASVEMWHECLMLFGDMSSEGHHRAEESTLVTVI 241

Query: 1000 SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1179
            +ACTHLGS ++G+CIHGILLRNISELNV+VKTSLI+MYVKCG LEKGL VF +M EKNR+
Sbjct: 242  TACTHLGSLDIGRCIHGILLRNISELNVIVKTSLINMYVKCGCLEKGLSVFDNMVEKNRH 301

Query: 1180 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1359
            SYT+MISGLAIHGHG++AL +F+EML EGL PDDVVYVGVLSACSHAGLV+EGLQCF  M
Sbjct: 302  SYTIMISGLAIHGHGKEALRVFSEMLEEGLEPDDVVYVGVLSACSHAGLVDEGLQCFNRM 361

Query: 1360 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1539
            +FEHKI+PT+QHYGC+VDLMGRA M++EAYDLIKSM IKPNDVVWRSLLSACKVH +LE+
Sbjct: 362  RFEHKIEPTVQHYGCVVDLMGRARMLREAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEL 421

Query: 1540 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMA 1668
            GEIAA+NLF+LN  NPGDYL+LANMYA+AQ W + A++R +MA
Sbjct: 422  GEIAAQNLFMLNPYNPGDYLMLANMYARAQNWANAARVRTEMA 464


>ref|XP_019422179.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Lupinus angustifolius]
          Length = 606

 Score =  754 bits (1947), Expect = 0.0
 Identities = 365/463 (78%), Positives = 414/463 (89%), Gaps = 1/463 (0%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNN-PPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLF 459
            +GT+VL+QTHFLPLPNN PPQ SEL+  FNE+GW  +LK CKS+EE KQVHA +LKLG  
Sbjct: 2    TGTTVLTQTHFLPLPNNSPPQCSELNIKFNEQGWLSMLKGCKSLEELKQVHAHILKLGFL 61

Query: 460  WDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYV 639
             DSFC SNLVATCAL+KWGSM+YACSIFRQIEEPGSFEYNTMIRGNVN M L+EALLLY+
Sbjct: 62   LDSFCESNLVATCALSKWGSMDYACSIFRQIEEPGSFEYNTMIRGNVNYMNLEEALLLYL 121

Query: 640  EMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWG 819
            EMLERGIEP+NFTYPFVLKACSLLG + E MQIHG V K GL+GDVFVQNSLISMYGK+G
Sbjct: 122  EMLERGIEPDNFTYPFVLKACSLLGCVNEGMQIHGHVLKGGLKGDVFVQNSLISMYGKFG 181

Query: 820  AIKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVL 999
             IKHAC VFEKM+ KSVASWSAIIGAHA VEMWHECL+L  DMSSEG  RAEESTLV V+
Sbjct: 182  GIKHACAVFEKMDEKSVASWSAIIGAHASVEMWHECLMLFGDMSSEGHHRAEESTLVTVI 241

Query: 1000 SACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRY 1179
            +ACTHLGS ++G+CIHGILLRNISELNV+VKTSLI+MYVKCG LEKGL VF +M EKNR+
Sbjct: 242  TACTHLGSLDIGRCIHGILLRNISELNVIVKTSLINMYVKCGCLEKGLSVFDNMVEKNRH 301

Query: 1180 SYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNM 1359
            SYT+MISGLAIHGHG++AL +F+EML EGL PDDVVYVGVLSACSHAGLV+EGLQCF  M
Sbjct: 302  SYTIMISGLAIHGHGKEALRVFSEMLEEGLEPDDVVYVGVLSACSHAGLVDEGLQCFNRM 361

Query: 1360 QFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEI 1539
            +FEHKI+PT+QHYGC+VDLMGRA M++EAYDLIKSM IKPNDVVWRSLLSACKVH +LE+
Sbjct: 362  RFEHKIEPTVQHYGCVVDLMGRARMLREAYDLIKSMPIKPNDVVWRSLLSACKVHHNLEL 421

Query: 1540 GEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMA 1668
            GEIAA+NLF+LN  NPGDYL+LANMYA+AQ W + A++R +MA
Sbjct: 422  GEIAAQNLFMLNPYNPGDYLMLANMYARAQNWANAARVRTEMA 464


>ref|XP_020237290.1| pentatricopeptide repeat-containing protein At1g31920-like [Cajanus
            cajan]
          Length = 626

 Score =  744 bits (1922), Expect = 0.0
 Identities = 364/464 (78%), Positives = 411/464 (88%)
 Frame = +1

Query: 283  SGTSVLSQTHFLPLPNNPPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKLGLFW 462
            SGTSVLSQTH L LPNNPPQSSEL++ FN+KGW  L K+CK MEEFKQVHAQ+LK GLF 
Sbjct: 2    SGTSVLSQTHLLSLPNNPPQSSELNSKFNDKGWLSLHKRCKCMEEFKQVHAQILKFGLFL 61

Query: 463  DSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALLLYVE 642
             SFC SNLVATCAL+KWGSMEYA SIF+QI+EPGSFEYN MIRG+VN+M L+E L LYVE
Sbjct: 62   YSFCGSNLVATCALSKWGSMEYAFSIFKQIKEPGSFEYNIMIRGSVNNMNLEETLFLYVE 121

Query: 643  MLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYGKWGA 822
            MLERGIEP+ FTYPFV KACSLLG  KE +QIHG +FK GL+GD FVQNSLI+MYGK GA
Sbjct: 122  MLERGIEPDKFTYPFVFKACSLLGAFKEGVQIHGHIFKVGLDGDAFVQNSLINMYGKCGA 181

Query: 823  IKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLVNVLS 1002
            IKHA  VFE+M  +SVASWSA+IGAHA VEMW ECL+LL DMSSEG  RAEES LV+ LS
Sbjct: 182  IKHAYAVFEQMVERSVASWSAVIGAHASVEMWQECLMLLGDMSSEGRHRAEESILVSALS 241

Query: 1003 ACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEKNRYS 1182
            ACTHLGSP LG+CIHGILLRNISELNVVVKTSLI MYVKCG LEKGL +FQ++AEKNR+S
Sbjct: 242  ACTHLGSPILGRCIHGILLRNISELNVVVKTSLIYMYVKCGCLEKGLSLFQNIAEKNRFS 301

Query: 1183 YTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCFKNMQ 1362
            YTVMI+GLAIHG GR+AL +F++M+ EGLAPDDVVYVGVLSACSHA LVNEGLQ F  M+
Sbjct: 302  YTVMIAGLAIHGRGREALRVFSDMMEEGLAPDDVVYVGVLSACSHASLVNEGLQFFNRMR 361

Query: 1363 FEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLDLEIG 1542
            FEHKIKPT+QHYGCMVDLMGRAGM++EAYDLIK M IKPNDVVWRSLLSACKVHL+LEIG
Sbjct: 362  FEHKIKPTVQHYGCMVDLMGRAGMLREAYDLIKRMPIKPNDVVWRSLLSACKVHLNLEIG 421

Query: 1543 EIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
             IAA+NLF LN +NPGDYL+LANM+A+A+KW+DVA+IR +MA+K
Sbjct: 422  VIAAENLFKLNQHNPGDYLMLANMFARAKKWNDVARIRTEMAEK 465


>ref|XP_016201186.1| pentatricopeptide repeat-containing protein At1g31920 isoform X1
            [Arachis ipaensis]
          Length = 608

 Score =  729 bits (1881), Expect = 0.0
 Identities = 355/468 (75%), Positives = 404/468 (86%), Gaps = 1/468 (0%)
 Frame = +1

Query: 274  MTGSGTSVLSQTHFLPLPNN-PPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKL 450
            M  + T +LS+THFL L NN PP+++EL+  F E GW PLLK+CKSMEEFKQVHA +LKL
Sbjct: 1    MVCTETYILSKTHFLSLSNNIPPKNTELNVRFYEHGWLPLLKRCKSMEEFKQVHAHILKL 60

Query: 451  GLFWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALL 630
            GLFWD FC SNLVATCAL KWGSMEYACSIFR+IEEP SFEYNTMIRGNVN M L+EAL+
Sbjct: 61   GLFWDHFCGSNLVATCALAKWGSMEYACSIFRRIEEPSSFEYNTMIRGNVNCMNLEEALI 120

Query: 631  LYVEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYG 810
            LY+EML+ GIEP+NFTYPFV KACSLLG LKE MQI+  VFKAGLEGD+FVQNSLISMYG
Sbjct: 121  LYIEMLKEGIEPDNFTYPFVFKACSLLGALKEGMQIYSHVFKAGLEGDLFVQNSLISMYG 180

Query: 811  KWGAIKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLV 990
            K GAI+HA +VF+KM  +SVASWSA+IGAHA  EMWHECL L  DM  +G  R EESTLV
Sbjct: 181  KCGAIEHARDVFDKMSERSVASWSAVIGAHASAEMWHECLKLFNDMMHDGRYRPEESTLV 240

Query: 991  NVLSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEK 1170
            +VLSACTHLGSPNLG  +HGILLRN +ELNV+VKTSLIDMY KCG +EKGL VF SMAEK
Sbjct: 241  SVLSACTHLGSPNLGSSVHGILLRNTTELNVIVKTSLIDMYAKCGCIEKGLCVFHSMAEK 300

Query: 1171 NRYSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCF 1350
            N++SYTVMISGLA+HG G +AL IF EML +GLAPDDVVYVGVLSAC+HAGLVNEGL+ F
Sbjct: 301  NKHSYTVMISGLAVHGRGSEALRIFAEMLEQGLAPDDVVYVGVLSACTHAGLVNEGLEFF 360

Query: 1351 KNMQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLD 1530
              MQFEHKI PTIQHYGCMVDLMGRAGM++EAY+LIKSM  KPNDV+WRSLL+ACKV  +
Sbjct: 361  NRMQFEHKIDPTIQHYGCMVDLMGRAGMLREAYELIKSMPRKPNDVLWRSLLNACKVQHN 420

Query: 1531 LEIGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            LE+GEIAA+N+F+LN +NPGDYLVLANMYA+AQKW D+AKIR +M  K
Sbjct: 421  LELGEIAAENIFMLNPHNPGDYLVLANMYARAQKWVDMAKIRTEMVRK 468


>ref|XP_015963236.1| pentatricopeptide repeat-containing protein At1g31920 isoform X1
            [Arachis duranensis]
          Length = 608

 Score =  728 bits (1878), Expect = 0.0
 Identities = 355/468 (75%), Positives = 406/468 (86%), Gaps = 1/468 (0%)
 Frame = +1

Query: 274  MTGSGTSVLSQTHFLPLPNN-PPQSSELSTTFNEKGWFPLLKKCKSMEEFKQVHAQVLKL 450
            M  + T +LS+T FL L NN PP++++L+  F E GW PLLK+CKSMEEFKQVHA +LKL
Sbjct: 1    MVCTETYILSRTQFLSLSNNIPPKNTDLNVRFYEHGWLPLLKRCKSMEEFKQVHAHILKL 60

Query: 451  GLFWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMIRGNVNDMKLKEALL 630
            GLFWD FC SNLVATCAL KWGSMEYACSIFR+IEEP SFEYNTMIRGNVN M L+EAL+
Sbjct: 61   GLFWDHFCGSNLVATCALAKWGSMEYACSIFRRIEEPSSFEYNTMIRGNVNCMNLEEALI 120

Query: 631  LYVEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLEGDVFVQNSLISMYG 810
            LYVEML++GIEP+NFTYPFV KACSLLG LKE MQI+  VFKAGLEGD+F+QNSLISMYG
Sbjct: 121  LYVEMLKKGIEPDNFTYPFVFKACSLLGALKEGMQIYSHVFKAGLEGDLFLQNSLISMYG 180

Query: 811  KWGAIKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADMSSEGSCRAEESTLV 990
            K GAI+HA +VF+KM  +SVASWSAIIGAHA  EMWHECL L  DM  +G  R EESTLV
Sbjct: 181  KCGAIEHARDVFDKMSERSVASWSAIIGAHASAEMWHECLKLFNDMMHDGRYRPEESTLV 240

Query: 991  NVLSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGSLEKGLFVFQSMAEK 1170
            +VLSACTHLGSPNLG  +HGILLRN +ELNV+VKTSLIDMY KCG +EKGL VF SMAEK
Sbjct: 241  SVLSACTHLGSPNLGSSVHGILLRNTTELNVIVKTSLIDMYAKCGCIEKGLCVFHSMAEK 300

Query: 1171 NRYSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSACSHAGLVNEGLQCF 1350
            N++SYTVMISGLA+HG G +AL IF EML +GLAPDDVVYVGVLSAC+HAGLVNEGL+ F
Sbjct: 301  NKHSYTVMISGLAVHGCGSEALRIFAEMLEQGLAPDDVVYVGVLSACTHAGLVNEGLEFF 360

Query: 1351 KNMQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDVVWRSLLSACKVHLD 1530
              MQFEHKI+PTIQHYGCMVDLMGRAGM++EAY+LIKSM  KPNDV+WRSLL+ACKVH +
Sbjct: 361  NRMQFEHKIEPTIQHYGCMVDLMGRAGMLREAYELIKSMPRKPNDVLWRSLLNACKVHHN 420

Query: 1531 LEIGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMADK 1674
            LE+GEIAA+N+F+LN +NPGDYLVLANMYA+AQKW D+AKIR +M  K
Sbjct: 421  LELGEIAAENIFMLNPHNPGDYLVLANMYARAQKWVDMAKIRTEMVRK 468


>gb|KRH39618.1| hypothetical protein GLYMA_09G209700 [Glycine max]
          Length = 562

 Score =  718 bits (1853), Expect = 0.0
 Identities = 352/422 (83%), Positives = 386/422 (91%)
 Frame = +1

Query: 409  MEEFKQVHAQVLKLGLFWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMI 588
            MEEFKQVHA +LKLGLF+DSFC SNLVATCAL++WGSMEYACSIFRQIEEPGSFEYNTMI
Sbjct: 1    MEEFKQVHAHILKLGLFYDSFCGSNLVATCALSRWGSMEYACSIFRQIEEPGSFEYNTMI 60

Query: 589  RGNVNDMKLKEALLLYVEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLE 768
            RGNVN M L+EALLLYVEMLERGIEP+NFTYPFVLKACSLLG LKE +QIH  VFKAGLE
Sbjct: 61   RGNVNSMNLEEALLLYVEMLERGIEPDNFTYPFVLKACSLLGALKEGVQIHAHVFKAGLE 120

Query: 769  GDVFVQNSLISMYGKWGAIKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADM 948
            GDVFVQN LI+MYGK GAI+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DM
Sbjct: 121  GDVFVQNGLINMYGKCGAIEHASVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDM 180

Query: 949  SSEGSCRAEESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGS 1128
            S EG  RAEES LV+ LSACTHLGSPN G+CIHGILLRNISELNV VKTSLIDMYVK GS
Sbjct: 181  SGEGRHRAEESILVSALSACTHLGSPNFGRCIHGILLRNISELNVAVKTSLIDMYVKSGS 240

Query: 1129 LEKGLFVFQSMAEKNRYSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSA 1308
            LEKGL VFQ+MA+KNRYSYTV+I+GLAIHG GR+AL +F++ML EGLAPDDVVYVGVLSA
Sbjct: 241  LEKGLCVFQNMAQKNRYSYTVIITGLAIHGRGREALSVFSDMLEEGLAPDDVVYVGVLSA 300

Query: 1309 CSHAGLVNEGLQCFKNMQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDV 1488
            CSHAGLVNEGLQCF  +QFEHKIKPTIQHYGCMVDLMGRAGM+K AYDLIKSM IKPNDV
Sbjct: 301  CSHAGLVNEGLQCFNRLQFEHKIKPTIQHYGCMVDLMGRAGMLKGAYDLIKSMPIKPNDV 360

Query: 1489 VWRSLLSACKVHLDLEIGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMA 1668
            VWRSLLSACKVH +LEIGEIAA+N+F LN +NPGDYLVLANMYA+A+KW DVA+IR +MA
Sbjct: 361  VWRSLLSACKVHHNLEIGEIAAENIFKLNQHNPGDYLVLANMYARAKKWADVARIRTEMA 420

Query: 1669 DK 1674
            +K
Sbjct: 421  EK 422


>gb|KHN40416.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 562

 Score =  712 bits (1837), Expect = 0.0
 Identities = 351/422 (83%), Positives = 384/422 (90%)
 Frame = +1

Query: 409  MEEFKQVHAQVLKLGLFWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMI 588
            MEEFKQVHA +LKLGLF+DSFC SNLVA+CAL++WGSMEYACSIFRQIEEPGSFEYNTMI
Sbjct: 1    MEEFKQVHAHILKLGLFYDSFCGSNLVASCALSRWGSMEYACSIFRQIEEPGSFEYNTMI 60

Query: 589  RGNVNDMKLKEALLLYVEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLE 768
            RGNVN M L+EALLLYVEMLERGIEP+NFTYPFVLKACSLL  LKE +QIH  VFKAGLE
Sbjct: 61   RGNVNSMDLEEALLLYVEMLERGIEPDNFTYPFVLKACSLLVALKEGVQIHAHVFKAGLE 120

Query: 769  GDVFVQNSLISMYGKWGAIKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADM 948
             DVFVQN LISMYGK GAI+HA  VFE+M+ KSVASWS+IIGAHA VEMWHECL+LL DM
Sbjct: 121  VDVFVQNGLISMYGKCGAIEHAGVVFEQMDEKSVASWSSIIGAHASVEMWHECLMLLGDM 180

Query: 949  SSEGSCRAEESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGS 1128
            S EG  RAEES LV+ LSACTHLGSPNLG+CIHGILLRNISELNVVVKTSLIDMYVKCGS
Sbjct: 181  SGEGRHRAEESILVSALSACTHLGSPNLGRCIHGILLRNISELNVVVKTSLIDMYVKCGS 240

Query: 1129 LEKGLFVFQSMAEKNRYSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSA 1308
            LEKGL VFQ+MA KNRYSYTVMI+GLAIHG GR+A+ +F++ML EGL PDDVVYVGVLSA
Sbjct: 241  LEKGLCVFQNMAHKNRYSYTVMIAGLAIHGRGREAVRVFSDMLEEGLTPDDVVYVGVLSA 300

Query: 1309 CSHAGLVNEGLQCFKNMQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDV 1488
            CSHAGLV EGLQCF  MQFEH IKPTIQHYGCMVDLMGRAGM+KEAYDLIKSM IKPNDV
Sbjct: 301  CSHAGLVKEGLQCFNRMQFEHMIKPTIQHYGCMVDLMGRAGMLKEAYDLIKSMPIKPNDV 360

Query: 1489 VWRSLLSACKVHLDLEIGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMA 1668
            VWRSLLSACKVH +LEIGEIAA+N+F LN +NPGDYLVLANMYA+A+KW +VA+IR +MA
Sbjct: 361  VWRSLLSACKVHHNLEIGEIAAENIFRLNKHNPGDYLVLANMYARAKKWANVARIRTEMA 420

Query: 1669 DK 1674
            +K
Sbjct: 421  EK 422


>gb|PNY00351.1| pentatricopeptide repeat-containing protein at1g31920-like protein
            [Trifolium pratense]
          Length = 562

 Score =  706 bits (1821), Expect = 0.0
 Identities = 339/422 (80%), Positives = 384/422 (90%)
 Frame = +1

Query: 409  MEEFKQVHAQVLKLGLFWDSFCCSNLVATCALTKWGSMEYACSIFRQIEEPGSFEYNTMI 588
            MEEFKQVHA VLK G+F+DSFC SNLVATCALTKWGSM+YACSIF QI+EP SF+YNTMI
Sbjct: 1    MEEFKQVHAHVLKWGIFFDSFCMSNLVATCALTKWGSMDYACSIFNQIDEPSSFDYNTMI 60

Query: 589  RGNVNDMKLKEALLLYVEMLERGIEPNNFTYPFVLKACSLLGELKERMQIHGQVFKAGLE 768
            RGNV++MKL+EALLLYVEM+E G+EP+ FTYPFVLKACSLLG   E +Q+HG VFK G E
Sbjct: 61   RGNVSEMKLEEALLLYVEMIEEGVEPDKFTYPFVLKACSLLGACDEGIQVHGHVFKMGFE 120

Query: 769  GDVFVQNSLISMYGKWGAIKHACNVFEKMEGKSVASWSAIIGAHACVEMWHECLVLLADM 948
            GDVFVQNSL++MYGK GAI  A +VF+K+  KSVASWSAIIGA+ACVEMWHECL+LL +M
Sbjct: 121  GDVFVQNSLVNMYGKCGAINCARDVFDKIGEKSVASWSAIIGAYACVEMWHECLMLLGEM 180

Query: 949  SSEGSCRAEESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVVKTSLIDMYVKCGS 1128
            S EG CR EESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVV+TSLIDMYVKCG 
Sbjct: 181  SIEGRCRVEESTLVNVLSACTHLGSPNLGKCIHGILLRNISELNVVVETSLIDMYVKCGC 240

Query: 1129 LEKGLFVFQSMAEKNRYSYTVMISGLAIHGHGRKALEIFTEMLAEGLAPDDVVYVGVLSA 1308
            LEKGL VF++M+EKN +SYTVMISGLAIHGHG++AL++FT+M+ EG  PD VV+VGVLSA
Sbjct: 241  LEKGLRVFENMSEKNIFSYTVMISGLAIHGHGKEALKVFTQMVEEGFEPDHVVFVGVLSA 300

Query: 1309 CSHAGLVNEGLQCFKNMQFEHKIKPTIQHYGCMVDLMGRAGMIKEAYDLIKSMSIKPNDV 1488
            CSHAGLV+EGLQCFK MQFE KI PT+QHYGCMVDL+GR GM+KEAY+LIKSMSIKPNDV
Sbjct: 301  CSHAGLVDEGLQCFKTMQFEKKINPTVQHYGCMVDLLGRIGMLKEAYELIKSMSIKPNDV 360

Query: 1489 VWRSLLSACKVHLDLEIGEIAAKNLFLLNSNNPGDYLVLANMYAKAQKWDDVAKIRRKMA 1668
            +WRSLLSACKVHL+LEI EIAA+NLF+LN +NPGDYLVLANMYAK QKWDDVAKIR+KMA
Sbjct: 361  IWRSLLSACKVHLNLEIAEIAAENLFVLNQSNPGDYLVLANMYAKVQKWDDVAKIRKKMA 420

Query: 1669 DK 1674
            +K
Sbjct: 421  EK 422


Top