BLASTX nr result

ID: Astragalus24_contig00019227 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00019227
         (747 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004504624.1| PREDICTED: putative pentatricopeptide repeat...   296   3e-92
ref|XP_006585050.1| PREDICTED: putative pentatricopeptide repeat...   266   7e-81
gb|KHN28906.1| Putative pentatricopeptide repeat-containing prot...   266   4e-80
gb|KYP61290.1| Putative pentatricopeptide repeat-containing prot...   259   3e-79
ref|XP_020222219.1| putative pentatricopeptide repeat-containing...   259   3e-78
ref|XP_014505571.1| putative pentatricopeptide repeat-containing...   255   2e-76
ref|XP_007158843.1| hypothetical protein PHAVU_002G186700g [Phas...   248   1e-73
ref|XP_017405632.1| PREDICTED: putative pentatricopeptide repeat...   244   2e-72
gb|KRH58559.1| hypothetical protein GLYMA_05G135600 [Glycine max]     232   3e-69
ref|XP_016191251.1| putative pentatricopeptide repeat-containing...   229   8e-67
ref|XP_015957940.1| putative pentatricopeptide repeat-containing...   228   3e-66
ref|XP_019464819.1| PREDICTED: putative pentatricopeptide repeat...   226   2e-65
ref|XP_019459942.1| PREDICTED: putative pentatricopeptide repeat...   213   8e-61
dbj|GAU13767.1| hypothetical protein TSUD_82770 [Trifolium subte...   201   2e-58
ref|XP_006585057.1| PREDICTED: putative pentatricopeptide repeat...   197   6e-55
dbj|GAU13768.1| hypothetical protein TSUD_82760 [Trifolium subte...   195   2e-54
ref|XP_020409876.1| putative pentatricopeptide repeat-containing...   195   9e-54
ref|XP_008389961.1| PREDICTED: putative pentatricopeptide repeat...   194   1e-53
ref|XP_021811422.1| putative pentatricopeptide repeat-containing...   193   4e-53
ref|XP_008221377.2| PREDICTED: putative pentatricopeptide repeat...   193   4e-53

>ref|XP_004504624.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Cicer arietinum]
 ref|XP_004504625.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Cicer arietinum]
 ref|XP_004504626.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Cicer arietinum]
          Length = 745

 Score =  296 bits (758), Expect = 3e-92
 Identities = 150/215 (69%), Positives = 164/215 (76%), Gaps = 4/215 (1%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQNSIFVSPIIWFTSFFYVIRYP 292
           + F RN  HT G RVSSF+SPF  N             QN IF  P+I FTSFF VIRYP
Sbjct: 2   LIFARNLFHTTGSRVSSFYSPFHSNPFPLFFTLSSQSSQNPIFGFPMILFTSFFCVIRYP 61

Query: 293 FVSKPPFDNTVSDSVRSFSQRDGPHMLDSALSPTRVSEFLVNLKGDPKSALKFFKSAGAG 472
           F SK  FD+TVS+SVRSF+QRD PHM DSAL+P  VS+ LVNLKGDPKSALKFF SAG  
Sbjct: 62  FSSKSSFDDTVSESVRSFAQRDDPHMFDSALAPIWVSKVLVNLKGDPKSALKFFHSAGNQ 121

Query: 473 VGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGF 640
           VG  HT ESYCIL+HILF G FYFDAKNVIKEWILLR+    CD FDMLW TR+VC+ GF
Sbjct: 122 VGFSHTAESYCILVHILFCGMFYFDAKNVIKEWILLRREIPGCDWFDMLWLTRNVCRTGF 181

Query: 641 GVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           GVFDALFGVLVELGML+E+ QCFWKMKKF VLPKV
Sbjct: 182 GVFDALFGVLVELGMLDESRQCFWKMKKFRVLPKV 216


>ref|XP_006585050.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 ref|XP_006585051.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 ref|XP_006585052.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 ref|XP_006585054.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 ref|XP_006585055.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 ref|XP_014634267.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 ref|XP_014634268.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 ref|XP_014634269.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 ref|XP_014634270.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 ref|XP_014634271.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Glycine max]
 gb|KRH42454.1| hypothetical protein GLYMA_08G090700 [Glycine max]
 gb|KRH42455.1| hypothetical protein GLYMA_08G090700 [Glycine max]
 gb|KRH42456.1| hypothetical protein GLYMA_08G090700 [Glycine max]
 gb|KRH42457.1| hypothetical protein GLYMA_08G090700 [Glycine max]
 gb|KRH42458.1| hypothetical protein GLYMA_08G090700 [Glycine max]
          Length = 751

 Score =  266 bits (681), Expect = 7e-81
 Identities = 136/215 (63%), Positives = 153/215 (71%), Gaps = 4/215 (1%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQNSIFVSPIIWFTSFFYVIRYP 292
           + F RN       RVSSFHS   +N             QNSIF  P+IWFTSF  VIRYP
Sbjct: 2   LLFARNIGGRASLRVSSFHSSPLQNPFPLFLTPSSLSSQNSIFARPVIWFTSFLCVIRYP 61

Query: 293 FVSKPPFDNTVSDSVRSFSQRDGPHMLDSALSPTRVSEFLVNLKGDPKSALKFFKSAGAG 472
           FVSKP FD+  S+S+RSF Q+DGPH+ DSAL+P  VS+ LV LKGDPKSALKFFK AGA 
Sbjct: 62  FVSKPSFDDIASESMRSFLQQDGPHLSDSALAPIWVSKALVKLKGDPKSALKFFKEAGAR 121

Query: 473 VGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGF 640
            G  H  ESYC+L HILF G FY DA++VIKEWILL +    CD FDMLW TR+VC+ GF
Sbjct: 122 AGFRHAAESYCVLAHILFCGMFYLDARSVIKEWILLGREFPGCDFFDMLWSTRNVCRPGF 181

Query: 641 GVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           GVFD LF VLV+LGMLEEA QCFWKM KF VLPKV
Sbjct: 182 GVFDTLFNVLVDLGMLEEARQCFWKMNKFRVLPKV 216


>gb|KHN28906.1| Putative pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 852

 Score =  266 bits (681), Expect = 4e-80
 Identities = 136/215 (63%), Positives = 153/215 (71%), Gaps = 4/215 (1%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQNSIFVSPIIWFTSFFYVIRYP 292
           + F RN       RVSSFHS   +N             QNSIF  P+IWFTSF  VIRYP
Sbjct: 2   LLFARNIGGRASLRVSSFHSSPLQNPFPLFLTPSSLSSQNSIFARPVIWFTSFLCVIRYP 61

Query: 293 FVSKPPFDNTVSDSVRSFSQRDGPHMLDSALSPTRVSEFLVNLKGDPKSALKFFKSAGAG 472
           FVSKP FD+  S+S+RSF Q+DGPH+ DSAL+P  VS+ LV LKGDPKSALKFFK AGA 
Sbjct: 62  FVSKPSFDDIASESMRSFLQQDGPHLSDSALAPIWVSKALVKLKGDPKSALKFFKEAGAR 121

Query: 473 VGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGF 640
            G  H  ESYC+L HILF G FY DA++VIKEWILL +    CD FDMLW TR+VC+ GF
Sbjct: 122 AGFRHAAESYCVLAHILFCGMFYLDARSVIKEWILLGREFPGCDFFDMLWSTRNVCRPGF 181

Query: 641 GVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           GVFD LF VLV+LGMLEEA QCFWKM KF VLPKV
Sbjct: 182 GVFDTLFNVLVDLGMLEEARQCFWKMNKFRVLPKV 216


>gb|KYP61290.1| Putative pentatricopeptide repeat-containing protein At2g02150
           family [Cajanus cajan]
          Length = 637

 Score =  259 bits (663), Expect = 3e-79
 Identities = 134/215 (62%), Positives = 153/215 (71%), Gaps = 4/215 (1%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQNSIFVSPIIWFTSFFYVIRYP 292
           + F RN       RVSSF+S   +N             QNSIF  P+IWFTSF  VIRYP
Sbjct: 2   LLFARNIGGRASPRVSSFYSSPLQNPFPLFFTPSFPSSQNSIFARPMIWFTSFLCVIRYP 61

Query: 293 FVSKPPFDNTVSDSVRSFSQRDGPHMLDSALSPTRVSEFLVNLKGDPKSALKFFKSAGAG 472
           FVSKP FD+  S+S+RS  Q+DGPHM +SAL+P  VS+ LV LKGDPKSALKFFK AGA 
Sbjct: 62  FVSKPSFDDIASESMRSALQKDGPHMFESALAPIWVSKALVKLKGDPKSALKFFKEAGAR 121

Query: 473 VGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGF 640
            G  HT ESYCIL HILF G FY DA++VI+EWILL +    CD FD+LW TR+VC+ GF
Sbjct: 122 PGFRHTAESYCILAHILFCGVFYLDARSVIREWILLGREIPGCDFFDLLWSTRNVCRPGF 181

Query: 641 GVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           GVFD LF VLVELGMLEEA QCFWKM +F VLPKV
Sbjct: 182 GVFDTLFSVLVELGMLEEARQCFWKMNRFRVLPKV 216


>ref|XP_020222219.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Cajanus cajan]
 ref|XP_020222220.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Cajanus cajan]
 ref|XP_020222222.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Cajanus cajan]
 ref|XP_020222223.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Cajanus cajan]
          Length = 751

 Score =  259 bits (663), Expect = 3e-78
 Identities = 134/215 (62%), Positives = 153/215 (71%), Gaps = 4/215 (1%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQNSIFVSPIIWFTSFFYVIRYP 292
           + F RN       RVSSF+S   +N             QNSIF  P+IWFTSF  VIRYP
Sbjct: 2   LLFARNIGGRASPRVSSFYSSPLQNPFPLFFTPSFPSSQNSIFARPMIWFTSFLCVIRYP 61

Query: 293 FVSKPPFDNTVSDSVRSFSQRDGPHMLDSALSPTRVSEFLVNLKGDPKSALKFFKSAGAG 472
           FVSKP FD+  S+S+RS  Q+DGPHM +SAL+P  VS+ LV LKGDPKSALKFFK AGA 
Sbjct: 62  FVSKPSFDDIASESMRSALQKDGPHMFESALAPIWVSKALVKLKGDPKSALKFFKEAGAR 121

Query: 473 VGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGF 640
            G  HT ESYCIL HILF G FY DA++VI+EWILL +    CD FD+LW TR+VC+ GF
Sbjct: 122 PGFRHTAESYCILAHILFCGVFYLDARSVIREWILLGREIPGCDFFDLLWSTRNVCRPGF 181

Query: 641 GVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           GVFD LF VLVELGMLEEA QCFWKM +F VLPKV
Sbjct: 182 GVFDTLFSVLVELGMLEEARQCFWKMNRFRVLPKV 216


>ref|XP_014505571.1| putative pentatricopeptide repeat-containing protein At2g02150
           isoform X1 [Vigna radiata var. radiata]
 ref|XP_014505572.1| putative pentatricopeptide repeat-containing protein At2g02150
           isoform X1 [Vigna radiata var. radiata]
 ref|XP_014505573.1| putative pentatricopeptide repeat-containing protein At2g02150
           isoform X1 [Vigna radiata var. radiata]
 ref|XP_014505574.1| putative pentatricopeptide repeat-containing protein At2g02150
           isoform X1 [Vigna radiata var. radiata]
 ref|XP_014505575.1| putative pentatricopeptide repeat-containing protein At2g02150
           isoform X1 [Vigna radiata var. radiata]
 ref|XP_022638986.1| putative pentatricopeptide repeat-containing protein At2g02150
           isoform X1 [Vigna radiata var. radiata]
          Length = 777

 Score =  255 bits (652), Expect = 2e-76
 Identities = 137/235 (58%), Positives = 158/235 (67%), Gaps = 5/235 (2%)
 Frame = +2

Query: 56  QHSLSI*RIKIQLKLF*PNMF-FLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQN 232
           QH   I R+KIQ  LF  NM  F R+       RVSS +S   +N             +N
Sbjct: 8   QHQSPIRRVKIQFTLFSLNMLLFARSISGKASLRVSSSYSSSLQNPFTLFSSPSFPSSKN 67

Query: 233 SIFVSPIIWFTSFFYVIRYPFVSKPPFDNTVSDSVRSFSQRDGPHMLDSALSPTRVSEFL 412
           SIF  P+IWFTSF  V+R PFVSK  FD+  S+S+RSF Q+DGPHM DSA +P  VS  L
Sbjct: 68  SIFARPMIWFTSFLCVMRCPFVSKSSFDDIASESMRSFLQQDGPHMFDSAPAPIWVSMVL 127

Query: 413 VNLKGDPKSALKFFKSAGAGVGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK-- 586
             LKGDPKSALKFFK AGA  G  H  ESYC+L HILF G+FY DA+NVI+EWILL +  
Sbjct: 128 EKLKGDPKSALKFFKEAGARTGFRHAAESYCVLAHILFCGKFYLDARNVIREWILLGREF 187

Query: 587 --CDGFDMLWFTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
              D FDMLW TR+VC+ GFGVFD LF VLV+LGML+EA QCFWKM KF VLPKV
Sbjct: 188 PGVDFFDMLWSTRNVCRPGFGVFDTLFSVLVDLGMLDEARQCFWKMNKFMVLPKV 242


>ref|XP_007158843.1| hypothetical protein PHAVU_002G186700g [Phaseolus vulgaris]
 gb|ESW30837.1| hypothetical protein PHAVU_002G186700g [Phaseolus vulgaris]
          Length = 751

 Score =  248 bits (632), Expect = 1e-73
 Identities = 129/215 (60%), Positives = 147/215 (68%), Gaps = 4/215 (1%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQNSIFVSPIIWFTSFFYVIRYP 292
           + F RN       RVSS  S   +N             +NSIF  P+IWFTSF  V+RYP
Sbjct: 2   LLFARNIRGMASLRVSSSCSSSLQNPFPLFFTPSFLSSKNSIFARPMIWFTSFLCVMRYP 61

Query: 293 FVSKPPFDNTVSDSVRSFSQRDGPHMLDSALSPTRVSEFLVNLKGDPKSALKFFKSAGAG 472
           FVSK   D+  S+S+RSF  +DGPHM DSAL+P  VS  LV LKGDPKSALKFFK AGA 
Sbjct: 62  FVSKSSSDDIASESMRSFLLQDGPHMFDSALAPVWVSVALVKLKGDPKSALKFFKEAGAR 121

Query: 473 VGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGF 640
            G  H  ESYC+L HILF GRFY DA+NVI+EWILL +     D FDMLW TR+VC+ GF
Sbjct: 122 AGFRHAAESYCVLAHILFCGRFYLDARNVIREWILLGREFPGVDFFDMLWSTRNVCRPGF 181

Query: 641 GVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           GVFD LF VLV+LGML+EA QCFWKM KF VLPKV
Sbjct: 182 GVFDTLFSVLVDLGMLDEARQCFWKMNKFRVLPKV 216


>ref|XP_017405632.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Vigna angularis]
 ref|XP_017405633.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Vigna angularis]
 ref|XP_017405634.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Vigna angularis]
 ref|XP_017405635.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Vigna angularis]
 ref|XP_017405636.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X1 [Vigna angularis]
 gb|KOM25524.1| hypothetical protein LR48_Vigan107s004300 [Vigna angularis]
 dbj|BAT74209.1| hypothetical protein VIGAN_01182900 [Vigna angularis var.
           angularis]
          Length = 751

 Score =  244 bits (623), Expect = 2e-72
 Identities = 126/215 (58%), Positives = 147/215 (68%), Gaps = 4/215 (1%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQNSIFVSPIIWFTSFFYVIRYP 292
           + F RN       RVSS +S   +N             +NSIF  P+IWFTSF  V+R P
Sbjct: 2   LLFARNIGGKVSLRVSSSYSSSLQNPFPLFSTPSFPSTKNSIFARPVIWFTSFLCVMRCP 61

Query: 293 FVSKPPFDNTVSDSVRSFSQRDGPHMLDSALSPTRVSEFLVNLKGDPKSALKFFKSAGAG 472
           FVSK  FD+  S+S+RSF Q+DGPHM DSA +P  VS  L  LKGDPKSALKFFK A A 
Sbjct: 62  FVSKSSFDDIASESMRSFLQQDGPHMFDSAPAPIWVSMVLEKLKGDPKSALKFFKEAAAR 121

Query: 473 VGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGF 640
            G  H  ESYC+L HILF G+FY DA+NVI+EWILLR+     D FD+LW TR+VC+ GF
Sbjct: 122 AGFRHAAESYCVLAHILFCGKFYLDARNVIREWILLRREFPGVDFFDVLWSTRNVCRPGF 181

Query: 641 GVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           GVFD LF VLV+LGML+EA QCFWKM KF VLPKV
Sbjct: 182 GVFDTLFSVLVDLGMLDEARQCFWKMNKFRVLPKV 216


>gb|KRH58559.1| hypothetical protein GLYMA_05G135600 [Glycine max]
          Length = 586

 Score =  232 bits (592), Expect = 3e-69
 Identities = 120/199 (60%), Positives = 138/199 (69%), Gaps = 4/199 (2%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQNSIFVSPIIWFTSFFYVIRYP 292
           + F RN       RVSSFHS   +N             QNSIF  P+IWF SF  V+RYP
Sbjct: 2   LLFARNIGSRASLRVSSFHSSPLQNPFPLFFTPSSLSSQNSIFARPMIWFASFLCVMRYP 61

Query: 293 FVSKPPFDNTVSDSVRSFSQRDGPHMLDSALSPTRVSEFLVNLKGDPKSALKFFKSAGAG 472
           FVSKP FD+  S+S+RSF Q+D PH+ DSAL P  VS+ L+NLKGDPKSALKFFK AGA 
Sbjct: 62  FVSKPSFDDIASESMRSFLQQDRPHLFDSALVPIWVSKDLLNLKGDPKSALKFFKEAGAR 121

Query: 473 VGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGF 640
            G  H  ESYC+L HILF G FY DA++VIKEWILL +    CD FDMLW TR+VC+ GF
Sbjct: 122 AGFRHAAESYCVLAHILFCGMFYLDARSVIKEWILLGREFPGCDFFDMLWSTRNVCRPGF 181

Query: 641 GVFDALFGVLVELGMLEEA 697
           GVFD LF VLV+LGMLEEA
Sbjct: 182 GVFDTLFSVLVDLGMLEEA 200


>ref|XP_016191251.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis ipaensis]
 ref|XP_016191254.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis ipaensis]
 ref|XP_016191255.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis ipaensis]
 ref|XP_020975612.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis ipaensis]
 ref|XP_020975613.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis ipaensis]
          Length = 761

 Score =  229 bits (585), Expect = 8e-67
 Identities = 128/226 (56%), Positives = 145/226 (64%), Gaps = 15/226 (6%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPF-------PRNXXXXXXXXXXXXXQNSIFVSPIIWFTSF 271
           + FLRNF +T GRR S   S F       P +             QN  F  P+IWFT F
Sbjct: 2   LLFLRNFFYT-GRRASPSVSSFSYFFHQNPTHSLPVIFTALTPSSQNPNFACPVIWFTGF 60

Query: 272 FYVIRYPFVSKPPFDNTVSDSVRSF-SQRDGP---HMLDSALSPTRVSEFLVNLKGDPKS 439
             VIRYPF SKP FD+  S+SVRSF  Q+D P    + DSAL+P  V + L  L+GDP S
Sbjct: 61  MCVIRYPFSSKPSFDDIASESVRSFLQQQDSPLIDRIFDSALAPVLVPKILEKLQGDPVS 120

Query: 440 ALKFFKSAGAGVGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLR----KCDGFDML 607
           ALKFF+ AG   G  HT ESYCIL HILF  +FYFDAK VI+EWILLR     CD FDML
Sbjct: 121 ALKFFQLAGTRTGFRHTTESYCILAHILFREKFYFDAKRVIREWILLRGEFPGCDFFDML 180

Query: 608 WFTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           W TR+VC++GFGVFD LF V VELGMLEEA QCF KM  F VLPKV
Sbjct: 181 WLTRNVCRSGFGVFDTLFSVFVELGMLEEASQCFLKMANFKVLPKV 226


>ref|XP_015957940.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis duranensis]
 ref|XP_015957941.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis duranensis]
 ref|XP_015957944.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis duranensis]
 ref|XP_020995111.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis duranensis]
 ref|XP_020995112.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis duranensis]
 ref|XP_020995113.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis duranensis]
 ref|XP_020995114.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis duranensis]
 ref|XP_020995115.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Arachis duranensis]
          Length = 761

 Score =  228 bits (581), Expect = 3e-66
 Identities = 126/226 (55%), Positives = 145/226 (64%), Gaps = 15/226 (6%)
 Frame = +2

Query: 113 MFFLRNFMHTNGRRVSSFHSPF-------PRNXXXXXXXXXXXXXQNSIFVSPIIWFTSF 271
           + FLRNF +T GRR S   S F       P +             QN  F  P+IWFT F
Sbjct: 2   LLFLRNFFYT-GRRASPSVSSFSYFFHQNPTHSFPVIFTASTPSSQNPNFACPVIWFTGF 60

Query: 272 FYVIRYPFVSKPPFDNTVSDSVRSF-SQRDGP---HMLDSALSPTRVSEFLVNLKGDPKS 439
             VIRYPF SKP FD+  S+SVRSF  Q+D P    + DSAL+P  V + L  L+GDP S
Sbjct: 61  MCVIRYPFSSKPSFDDIASESVRSFLQQQDSPLIDRIFDSALAPVLVPKILEKLQGDPVS 120

Query: 440 ALKFFKSAGAGVGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLR----KCDGFDML 607
           AL+FF+ AG   G  HT ESYCIL HILF  +FYFDAK +I+EWILLR     CD FDML
Sbjct: 121 ALRFFQLAGTRTGFRHTTESYCILAHILFREKFYFDAKRIIREWILLRGEFPGCDFFDML 180

Query: 608 WFTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           W TR+VC++GFGVFD LF V VELGMLEEA QCF KM  F VLPKV
Sbjct: 181 WLTRNVCRSGFGVFDTLFSVFVELGMLEEASQCFLKMANFKVLPKV 226


>ref|XP_019464819.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Lupinus angustifolius]
          Length = 763

 Score =  226 bits (575), Expect = 2e-65
 Identities = 128/227 (56%), Positives = 146/227 (64%), Gaps = 16/227 (7%)
 Frame = +2

Query: 113 MFFLRNFMHTNGR---RVSSFHSPFPRNXXXXXXXXXXXXX---QNSIFVSPIIWFTSFF 274
           + FLRNF H   R    VSSF S  P+N                QNSI   PII FTSF 
Sbjct: 2   LLFLRNFFHIPRRVSPSVSSFSSSIPKNTSYPFPIFLSHSSPTSQNSILTCPIICFTSFL 61

Query: 275 YVIRYPFVSKPPFDN--TVSDSVRSFSQR----DGPHMLDSALSPTRVSEFLVNLKGDPK 436
           YVIRY F SKP FD+    S S+RS  ++    +   + DSAL+P+ VS+ LV LKGDPK
Sbjct: 62  YVIRYSFTSKPHFDHDHVDSQSMRSLLKKQFGIETDTISDSALTPSWVSKVLVKLKGDPK 121

Query: 437 SALKFFKSAGAGVGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDM 604
            ALKFFK AGA  G  H  +SYCIL HILF G FY DAK +I E I LR+    CD FDM
Sbjct: 122 LALKFFKWAGARTGFRHATDSYCILAHILFCGMFYLDAKKIITELIFLRREDPGCDFFDM 181

Query: 605 LWFTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           LW TRDVC+ GFGVFD+LF VL+ELGMLEEA +CFWKMKKF V PKV
Sbjct: 182 LWLTRDVCRPGFGVFDSLFSVLIELGMLEEASECFWKMKKFRVFPKV 228


>ref|XP_019459942.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Lupinus angustifolius]
          Length = 760

 Score =  213 bits (543), Expect = 8e-61
 Identities = 123/225 (54%), Positives = 142/225 (63%), Gaps = 16/225 (7%)
 Frame = +2

Query: 119 FLRNFMHTNGR---RVSSFHSPFPRNXXXXXXXXXXXXX---QNSIFVSPIIWFTSFFYV 280
           FLRN+ H   R    V+SF S  P+N                QNSI   P I    F  V
Sbjct: 4   FLRNYFHIARRVSPSVTSFSSSIPKNITYPFPLFLYPSPPSSQNSILTYPTI---CFLCV 60

Query: 281 IRYPFVSKPPFD--NTVSDSVRSFSQR----DGPHMLDSALSPTRVSEFLVNLKGDPKSA 442
           +RYPF SKP FD  N  S S+ +  ++    +  ++ DSAL+P  VSE LV LKGDPK A
Sbjct: 61  VRYPFTSKPYFDDDNIDSQSMCTLLKKQFGIEIDNIFDSALAPIWVSEVLVKLKGDPKLA 120

Query: 443 LKFFKSAGAGVGICHTNESYCILIHILFFGRFYFDAKNVIKEWILLRK----CDGFDMLW 610
           LKFFK A A    CHT +SYCIL HILF G FYFDA+ +I +WILLR+    CD FD LW
Sbjct: 121 LKFFKWAEARTTFCHTTDSYCILAHILFCGMFYFDARKIIMKWILLRREIPGCDFFDTLW 180

Query: 611 FTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
            TRDVC+ GFGVFD LF VLVELGMLEEA QCFWKMKKF VLP+V
Sbjct: 181 LTRDVCRPGFGVFDTLFSVLVELGMLEEARQCFWKMKKFRVLPRV 225


>dbj|GAU13767.1| hypothetical protein TSUD_82770 [Trifolium subterraneum]
          Length = 456

 Score =  201 bits (511), Expect = 2e-58
 Identities = 102/135 (75%), Positives = 110/135 (81%), Gaps = 5/135 (3%)
 Frame = +2

Query: 356 DGPHMLDSALSPTRVSEFLVNLKGDPKSALKFF-KSAGAGVGICHTNESYCILIHILFFG 532
           + PHM DSAL+P  VS+ LVNLKGDPKSALKFF  SAG  VG  HTNESYCIL+HILF G
Sbjct: 17  NAPHMFDSALAPIWVSKVLVNLKGDPKSALKFFYSSAGNQVGFRHTNESYCILVHILFCG 76

Query: 533 RFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGFGVFDALFGVLVELGMLEEAM 700
            FYFDAKNVI+EWILLR+    CD FDMLW TR+VC+  F VFDALFGVLVELGMLEEA 
Sbjct: 77  MFYFDAKNVIEEWILLRREIPGCDLFDMLWLTRNVCRTDFRVFDALFGVLVELGMLEEAR 136

Query: 701 QCFWKMKKFGVLPKV 745
           QCFWKMK F VLPKV
Sbjct: 137 QCFWKMKNFRVLPKV 151


>ref|XP_006585057.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 isoform X2 [Glycine max]
 gb|KRH42452.1| hypothetical protein GLYMA_08G090700 [Glycine max]
 gb|KRH42453.1| hypothetical protein GLYMA_08G090700 [Glycine max]
          Length = 702

 Score =  197 bits (500), Expect = 6e-55
 Identities = 97/138 (70%), Positives = 108/138 (78%), Gaps = 4/138 (2%)
 Frame = +2

Query: 344 FSQRDGPHMLDSALSPTRVSEFLVNLKGDPKSALKFFKSAGAGVGICHTNESYCILIHIL 523
           F Q+DGPH+ DSAL+P  VS+ LV LKGDPKSALKFFK AGA  G  H  ESYC+L HIL
Sbjct: 30  FLQQDGPHLSDSALAPIWVSKALVKLKGDPKSALKFFKEAGARAGFRHAAESYCVLAHIL 89

Query: 524 FFGRFYFDAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGFGVFDALFGVLVELGMLE 691
           F G FY DA++VIKEWILL +    CD FDMLW TR+VC+ GFGVFD LF VLV+LGMLE
Sbjct: 90  FCGMFYLDARSVIKEWILLGREFPGCDFFDMLWSTRNVCRPGFGVFDTLFNVLVDLGMLE 149

Query: 692 EAMQCFWKMKKFGVLPKV 745
           EA QCFWKM KF VLPKV
Sbjct: 150 EARQCFWKMNKFRVLPKV 167


>dbj|GAU13768.1| hypothetical protein TSUD_82760 [Trifolium subterraneum]
          Length = 659

 Score =  195 bits (495), Expect = 2e-54
 Identities = 100/131 (76%), Positives = 107/131 (81%), Gaps = 5/131 (3%)
 Frame = +2

Query: 368 MLDSALSPTRVSEFLVNLKGDPKSALKFF-KSAGAGVGICHTNESYCILIHILFFGRFYF 544
           M DSAL+P  VS+ LVNLKGDPKSALKFF  SAG  VG  HTNESYCIL+HILF G FYF
Sbjct: 1   MFDSALAPIWVSKVLVNLKGDPKSALKFFYSSAGNQVGFRHTNESYCILVHILFCGMFYF 60

Query: 545 DAKNVIKEWILLRK----CDGFDMLWFTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFW 712
           DAKNVI+EWILLR+    CD FDMLW TR+VC+  F VFDALFGVLVELGMLEEA QCFW
Sbjct: 61  DAKNVIEEWILLRREIPGCDLFDMLWLTRNVCRTDFRVFDALFGVLVELGMLEEARQCFW 120

Query: 713 KMKKFGVLPKV 745
           KMK F VLPKV
Sbjct: 121 KMKNFRVLPKV 131


>ref|XP_020409876.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Prunus persica]
 ref|XP_020409877.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Prunus persica]
 ref|XP_020409878.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Prunus persica]
 gb|ONI31976.1| hypothetical protein PRUPE_1G342600 [Prunus persica]
          Length = 807

 Score =  195 bits (495), Expect = 9e-54
 Identities = 99/186 (53%), Positives = 131/186 (70%), Gaps = 14/186 (7%)
 Frame = +2

Query: 230 NSIFVSPIIWFTSFFYVIRYPFVSKPPF----DNTVSDSVRSFSQRD---GPHMLD---S 379
           +S+   P++WFTSF ++ R+PFV+K       DN  ++S+R   Q D    P +++   S
Sbjct: 87  SSLIACPLVWFTSFLFITRFPFVTKSNPNSFPDNLNTESLRIIIQHDYWDDPRIVNLFGS 146

Query: 380 ALSPTRVSEFLVNLKGDPKSALKFFKSAGAGVGICHTNESYCILIHILFFGRFYFDAKNV 559
           AL+P  VS+FLV L+GDPK ALK F+ +   +G CHT ESYCIL+HILF+ R YFDA  +
Sbjct: 147 ALAPIWVSKFLVELRGDPKLALKLFRWSKTRIGFCHTTESYCILVHILFYARMYFDAHEI 206

Query: 560 IKEWILLRK----CDGFDMLWFTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFWKMKKF 727
           +KE + LR+    CD FD+LW TR+VC+ GFGVFDALF VLVE GMLE+A +CF +MKKF
Sbjct: 207 LKELVSLRRVLPGCDVFDVLWSTRNVCRLGFGVFDALFSVLVEFGMLEKASECFLRMKKF 266

Query: 728 GVLPKV 745
            VLPKV
Sbjct: 267 RVLPKV 272


>ref|XP_008389961.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Malus domestica]
 ref|XP_017192130.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Malus domestica]
          Length = 796

 Score =  194 bits (494), Expect = 1e-53
 Identities = 115/245 (46%), Positives = 141/245 (57%), Gaps = 27/245 (11%)
 Frame = +2

Query: 92  LKLF*PNMFFLRNFMHTNGRRVSSFHS---------PFPRNXXXXXXXXXXXXXQN--SI 238
           +  F   + FLRN   T  R  SS  S          +P N              +  S+
Sbjct: 19  ISFFSEMLLFLRNLFRTGCRASSSASSRVSXLSSIPQYPSNCRFINLSSLTSSSSHATSL 78

Query: 239 FVSPIIWFTSFFYVIRYPFVSK------PPFDNTVSDSVRSFSQRDG------PHMLDSA 382
              P +WFT F  + R+PFV+K      P   NT  DS+    Q D        ++ DSA
Sbjct: 79  IACPFVWFTGFLCIFRFPFVTKSQPSSFPESLNT--DSLSRIVQHDYWDDPRIVNLFDSA 136

Query: 383 LSPTRVSEFLVNLKGDPKSALKFFKSAGAGVGICHTNESYCILIHILFFGRFYFDAKNVI 562
           L+P  VS FLV LKGDPK ALK FK A   +G  HT ESYCIL+HILFF R Y DA  V+
Sbjct: 137 LAPIWVSRFLVELKGDPKLALKLFKWAKTQIGFRHTTESYCILVHILFFARMYVDAHEVL 196

Query: 563 KEWILLRK----CDGFDMLWFTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFWKMKKFG 730
           +E +LL +    CD FD+LW+TR+VC+ GFGVFDALFGVLVE+GMLEEA +CF +MKKF 
Sbjct: 197 RELVLLSRALPGCDVFDVLWWTRNVCRVGFGVFDALFGVLVEVGMLEEASECFLRMKKFR 256

Query: 731 VLPKV 745
           VLPKV
Sbjct: 257 VLPKV 261


>ref|XP_021811422.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Prunus avium]
 ref|XP_021811423.1| putative pentatricopeptide repeat-containing protein At2g02150
           [Prunus avium]
          Length = 807

 Score =  193 bits (490), Expect = 4e-53
 Identities = 99/185 (53%), Positives = 129/185 (69%), Gaps = 14/185 (7%)
 Frame = +2

Query: 233 SIFVSPIIWFTSFFYVIRYPFVSKPPF----DNTVSDSVRSFSQRD---GPHMLD---SA 382
           S+   P++WFTSF  + R+PFV+K       DN  ++S+R   Q D    P +++   SA
Sbjct: 88  SLIACPLVWFTSFLCITRFPFVTKSNPNSFPDNINTESLRIIIQHDYWDDPRIVNLFGSA 147

Query: 383 LSPTRVSEFLVNLKGDPKSALKFFKSAGAGVGICHTNESYCILIHILFFGRFYFDAKNVI 562
           L+P  VS+FLV L+GDPK ALK F+ +   +G CHT ESYCIL+HILF+ R YFDA  ++
Sbjct: 148 LAPIWVSKFLVELRGDPKLALKLFRWSKTQIGFCHTTESYCILVHILFYARMYFDAHEIL 207

Query: 563 KEWILLRK----CDGFDMLWFTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFWKMKKFG 730
           +E + LR+    CD FD+LW TR+VC+ GFGVFDALF VLVE GMLEEA +CF +MKKF 
Sbjct: 208 RELVSLRRVLPGCDVFDVLWSTRNVCRLGFGVFDALFSVLVEFGMLEEASECFLRMKKFR 267

Query: 731 VLPKV 745
           VLPKV
Sbjct: 268 VLPKV 272


>ref|XP_008221377.2| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Prunus mume]
 ref|XP_016647858.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Prunus mume]
 ref|XP_016647859.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At2g02150 [Prunus mume]
          Length = 807

 Score =  193 bits (490), Expect = 4e-53
 Identities = 109/239 (45%), Positives = 146/239 (61%), Gaps = 23/239 (9%)
 Frame = +2

Query: 98  LF*PNMFFLRNFMHTNGRRVSSFHSPFPRNXXXXXXXXXXXXXQNSIFVS---------P 250
           +F   + FLRN +    R  +SFH   P +              +S+ +S         P
Sbjct: 36  IFSKMLIFLRNLLQMGCR--ASFHRVSPLSSIPQHSSNCLFINVSSLSLSSSHGSLIACP 93

Query: 251 IIWFTSFFYVIRYPFVSKPP----FDNTVSDSVRSFSQRD---GPHMLD---SALSPTRV 400
           ++WFTSF  + R+PFV+K       DN  ++S+R   Q D    P +++   SAL+P   
Sbjct: 94  LVWFTSFLCITRFPFVTKSNPNSFRDNLNTESLRIIIQHDYWDDPRIVNLFGSALAPIWA 153

Query: 401 SEFLVNLKGDPKSALKFFKSAGAGVGICHTNESYCILIHILFFGRFYFDAKNVIKEWILL 580
           S+FLV L+GDPK ALK F+ +   +G CHT ESYCIL+HILF+ R YFDA  ++KE + L
Sbjct: 154 SKFLVELRGDPKLALKLFRWSKTRIGFCHTTESYCILVHILFYARMYFDAHEILKELVSL 213

Query: 581 RK----CDGFDMLWFTRDVCQAGFGVFDALFGVLVELGMLEEAMQCFWKMKKFGVLPKV 745
           R+    CD FD+LW TR+VC+ GFGVFDALF VLVE GMLE+A +CF +MKKF VLPKV
Sbjct: 214 RRVSLGCDVFDVLWSTRNVCRLGFGVFDALFSVLVEFGMLEKASECFLRMKKFRVLPKV 272


Top