BLASTX nr result

ID: Astragalus24_contig00005788 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00005788
         (948 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012568596.1| PREDICTED: pentatricopeptide repeat-containi...   347   e-109
dbj|GAU32154.1| hypothetical protein TSUD_68250 [Trifolium subte...   345   e-109
ref|XP_003617156.2| PPR containing plant-like protein [Medicago ...   335   e-105
gb|PNY12861.1| PPR containing plant-like protein [Trifolium prat...   336   e-103
ref|XP_019434547.1| PREDICTED: pentatricopeptide repeat-containi...   325   e-101
gb|PNY12930.1| pentatricopeptide repeat-containing protein [Trif...   323   e-100
dbj|GAU21648.1| hypothetical protein TSUD_251310 [Trifolium subt...   321   e-100
ref|XP_003617158.1| PPR containing plant-like protein [Medicago ...   319   6e-98
gb|KYP73199.1| Pentatricopeptide repeat-containing protein At1g7...   313   1e-96
ref|XP_020204964.1| pentatricopeptide repeat-containing protein ...   313   2e-96
gb|KHN26011.1| Pentatricopeptide repeat-containing protein [Glyc...   292   5e-91
ref|XP_007141545.1| hypothetical protein PHAVU_008G205300g [Phas...   298   7e-91
ref|XP_006595790.1| PREDICTED: pentatricopeptide repeat-containi...   292   1e-88
gb|KHN38419.1| Pentatricopeptide repeat-containing protein [Glyc...   284   2e-87
ref|XP_003518473.1| PREDICTED: pentatricopeptide repeat-containi...   284   1e-85
ref|XP_017430727.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   283   5e-85
ref|XP_016187228.1| pentatricopeptide repeat-containing protein ...   269   7e-81
ref|XP_020962746.1| pentatricopeptide repeat-containing protein ...   263   3e-79
ref|XP_020982874.1| pentatricopeptide repeat-containing protein ...   264   6e-79
ref|XP_020982873.1| pentatricopeptide repeat-containing protein ...   264   8e-79

>ref|XP_012568596.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210
            [Cicer arietinum]
          Length = 874

 Score =  347 bits (891), Expect = e-109
 Identities = 172/255 (67%), Positives = 209/255 (81%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM RNNI PN+SS++L+LNSY+KSGR  DA  FF S+R  QG +SRRLY+SMI GL K
Sbjct: 610  FELMLRNNIAPNVSSQILLLNSYVKSGRLADALTFFNSIR-HQGAVSRRLYDSMIQGLCK 668

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            SNK D+A   LFEML AGLNP I  YE L+QKLCSLKRYHEAINLV++Y++ GRRLTS+L
Sbjct: 669  SNKVDIAHNLLFEMLKAGLNPGIGCYENLVQKLCSLKRYHEAINLVHMYLKTGRRLTSYL 728

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GNILL+HSL +  VY+TC++SRGA+EGE S IS L F+IGAFSG LRVNHSI+ELE+LIA
Sbjct: 729  GNILLFHSLQSQEVYDTCIQSRGAKEGESSAISTLSFVIGAFSGCLRVNHSIEELEKLIA 788

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
            +CFP D+YTYNLLL + +DFD  QA  LF RI Q  +GYEPNGWTY+IMV GF N+GR+D
Sbjct: 789  LCFPLDLYTYNLLLRRVSDFDFDQALRLFDRIRQ--RGYEPNGWTYNIMVHGFANNGRRD 846

Query: 722  DAKHWIGEMNRKGFH 766
            +AK W  EM++KGF+
Sbjct: 847  EAKQWSEEMSQKGFY 861


>dbj|GAU32154.1| hypothetical protein TSUD_68250 [Trifolium subterraneum]
          Length = 850

 Score =  345 bits (885), Expect = e-109
 Identities = 170/260 (65%), Positives = 212/260 (81%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            ++LMRR+N+ P + S+ L+LNSYLKSG+   A +FF SLR  QG++S++LY SM++GL K
Sbjct: 589  YELMRRSNMVPTIVSQALVLNSYLKSGKIFYALSFFDSLR-RQGVVSKKLYTSMVIGLCK 647

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            +NK ++A  FLFEMLNAGLN  I+ YE L+QKLCSLK+YHEAINLV VYM+ GRRLTSFL
Sbjct: 648  NNKANIAHDFLFEMLNAGLNTGIECYESLVQKLCSLKKYHEAINLVQVYMKTGRRLTSFL 707

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GNILL+HSL +P++Y  CV+  GA+EGE S IS L F+IGAFSG LRVNHS++ELEELI 
Sbjct: 708  GNILLFHSLMSPDLYEICVQMGGAKEGESSPISTLNFVIGAFSGCLRVNHSVEELEELIV 767

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFP D+ TYNLLL K T++DM QACELF+RI Q  +GYEPNGWTY+IMV GF NHGR D
Sbjct: 768  TCFPLDMCTYNLLLRKITNYDMNQACELFNRICQ--RGYEPNGWTYNIMVNGFSNHGRND 825

Query: 722  DAKHWIGEMNRKGFHPKENT 781
            +AK W+ EM++KGF+P ENT
Sbjct: 826  EAKQWVEEMHQKGFYPTENT 845


>ref|XP_003617156.2| PPR containing plant-like protein [Medicago truncatula]
 gb|AET00115.2| PPR containing plant-like protein [Medicago truncatula]
          Length = 879

 Score =  335 bits (858), Expect = e-105
 Identities = 166/266 (62%), Positives = 210/266 (78%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            ++LM RNNI P+ SS+ L+L  YLKSG+ +DA NFF SLR  QG +S+++Y S+I  L K
Sbjct: 609  YELMLRNNIVPSSSSQRLVLIGYLKSGKISDALNFFHSLR-RQGTVSKKVYQSIIFALCK 667

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            S K D+A  FLF+M  AGLNPSI+ +E+L+Q LCSL+RYHEAINLV+VY++MGRRLT+FL
Sbjct: 668  SCKADIAHDFLFQMFKAGLNPSIECFEILVQTLCSLERYHEAINLVHVYIKMGRRLTNFL 727

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GNILL HSL +P++Y+ CVR RGA+E ECS +S L FIIGAFS  LRVN S++ELE+LI+
Sbjct: 728  GNILLSHSLISPDIYHACVRLRGAKEEECSPMSTLSFIIGAFSRCLRVNPSVEELEKLIS 787

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFP D YTYN LL + T +DM QACELF+RI Q  +GYEPN WTY+IMV GF NHGR D
Sbjct: 788  TCFPLDFYTYNQLLRRVTQYDMNQACELFNRIRQ--RGYEPNDWTYNIMVSGFSNHGRND 845

Query: 722  DAKHWIGEMNRKGFHPKENTLSMYQK 799
            +AK W+ EM++KGF+P+ENT    QK
Sbjct: 846  EAKQWVEEMHQKGFYPRENTKRNVQK 871


>gb|PNY12861.1| PPR containing plant-like protein [Trifolium pratense]
          Length = 1080

 Score =  336 bits (861), Expect = e-103
 Identities = 172/259 (66%), Positives = 205/259 (79%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            +DLM R NI P + S+ L+L SYLKSGR   A  FF SLR  QG++S++LY SM++GL K
Sbjct: 819  YDLMVRCNIVPTIISQALVLISYLKSGRIHYALKFFDSLR-RQGVVSKKLYTSMVIGLCK 877

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            +NK D+A  FLFEMLNA LNP I+ YE L+QKLCSLKRY EAINLV VYM+ GRRLTSFL
Sbjct: 878  NNKADIARDFLFEMLNAKLNPGIECYESLVQKLCSLKRYDEAINLVQVYMKRGRRLTSFL 937

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GNI L HSLT+P+VY+ CV+   AEEGE S IS L F+IGAFSGRLRVN SI+ELEELIA
Sbjct: 938  GNIFLCHSLTSPDVYDICVQIGRAEEGESSPISTLSFVIGAFSGRLRVNRSIEELEELIA 997

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
            +CFP D YTYNLLL + T++DM QACEL +RI Q  +GYEPN WTY+IMV GF NHGR D
Sbjct: 998  MCFPLDTYTYNLLLRRITNYDMNQACELVNRICQ--RGYEPNDWTYNIMVDGFKNHGRND 1055

Query: 722  DAKHWIGEMNRKGFHPKEN 778
            +AK W+ EM++KGF+P EN
Sbjct: 1056 EAKQWVEEMHKKGFYPIEN 1074


>ref|XP_019434547.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210
            [Lupinus angustifolius]
 gb|OIW16274.1| hypothetical protein TanjilG_18989 [Lupinus angustifolius]
          Length = 869

 Score =  325 bits (834), Expect = e-101
 Identities = 159/260 (61%), Positives = 209/260 (80%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN ++P++SS+VLML SYLKS   ++A  FF +LRC QG++SR+L+N++IVGL K
Sbjct: 607  FELMQRNGVEPDMSSQVLMLKSYLKSESISEALTFFHNLRC-QGIVSRKLFNTLIVGLCK 665

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            SNK D+A +FLFEM+ A LNPSI+ YEVL+Q+LCS +RY +AI++VN+Y +MGRRLTSF+
Sbjct: 666  SNKVDIAREFLFEMIKAELNPSIECYEVLVQQLCSSQRYRDAIHVVNLYEKMGRRLTSFI 725

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN+LLYHSL +  +Y+ C + RG  +GE SG S+L  IIGAFSG LRVNH I++LEELI+
Sbjct: 726  GNVLLYHSLISRELYDACAQLRGVGDGEFSGSSMLTLIIGAFSGHLRVNHFIEDLEELIS 785

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFP DIYTYNLLL KA+  DM QA ELF R+ Q  +GYEPN WTYD+MV GF  HGR++
Sbjct: 786  KCFPLDIYTYNLLLRKASHGDMDQAFELFGRMCQ--RGYEPNWWTYDVMVHGFSKHGRQN 843

Query: 722  DAKHWIGEMNRKGFHPKENT 781
            +AK W+ EM+ KG +PKE+T
Sbjct: 844  EAKRWVEEMSHKGLYPKEST 863


>gb|PNY12930.1| pentatricopeptide repeat-containing protein [Trifolium pratense]
          Length = 893

 Score =  323 bits (827), Expect = e-100
 Identities = 168/274 (61%), Positives = 202/274 (73%), Gaps = 14/274 (5%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            ++LM  NNI P + S+ L+LNSYL S R ++A NFF SLR DQG++S+RLY+SMI GL K
Sbjct: 613  YELMLLNNIVPTIVSQALLLNSYLGSERISEALNFFYSLR-DQGVVSKRLYSSMINGLCK 671

Query: 182  SNKPDMAL--------------QFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLV 319
             NK D+A               + L +MLNAGLNP I+ YE L+QKLCSLKRY EAINLV
Sbjct: 672  HNKADIACDKSDIAHDKADIARRILVDMLNAGLNPGIECYENLVQKLCSLKRYPEAINLV 731

Query: 320  NVYMRMGRRLTSFLGNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRL 499
             VYM+MGRRLTSFLGNILL+HSL TPNVY+TCV+ RG +EGE S  S L  +IG FS  L
Sbjct: 732  QVYMKMGRRLTSFLGNILLFHSLITPNVYHTCVKMRGEKEGESSPFSTLTVVIGVFSDCL 791

Query: 500  RVNHSIKELEELIAICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTY 679
            +VNHSI   EEL+A+CFP DIYTYNLLL + T +DM QACELF+RI Q  +GYEPNGWTY
Sbjct: 792  KVNHSI---EELVALCFPLDIYTYNLLLRRTTSYDMNQACELFNRIRQ--RGYEPNGWTY 846

Query: 680  DIMVVGFLNHGRKDDAKHWIGEMNRKGFHPKENT 781
            DIMV GF  HGR  + K W+ EM+ +GF+P E T
Sbjct: 847  DIMVHGFSKHGRNYETKQWLEEMHHEGFYPTETT 880


>dbj|GAU21648.1| hypothetical protein TSUD_251310 [Trifolium subterraneum]
          Length = 835

 Score =  321 bits (823), Expect = e-100
 Identities = 167/263 (63%), Positives = 200/263 (76%), Gaps = 3/263 (1%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            ++LM R+NI P + S+ L+LNSYL S R  DA NFF SLR  QG++S+RLY+SMI+GL K
Sbjct: 575  YELMLRSNIVPTIVSQSLLLNSYLGSERIYDALNFFNSLR-RQGVVSKRLYSSMIIGLCK 633

Query: 182  SNKPDMAL---QFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLT 352
             N  DMA      LF+MLNAGLNP I+ YE L+Q LCSL++Y EAINLV VYM+ GRRLT
Sbjct: 634  HNMDDMAHIAHDILFDMLNAGLNPGIECYESLVQTLCSLEKYREAINLVQVYMKTGRRLT 693

Query: 353  SFLGNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEE 532
            SFLGNILL+HS    +VY+TCV+ RGA+EGE S  S L  +IG F+G +RVNHSI   EE
Sbjct: 694  SFLGNILLFHS---SDVYHTCVQMRGAKEGESSPFSTLTAVIGVFTGCVRVNHSI---EE 747

Query: 533  LIAICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHG 712
            LIA+CFP DIYTYNLLL + T +DM QACELF+RIHQ  +GYEPN WTYDIMV  F  HG
Sbjct: 748  LIALCFPLDIYTYNLLLRRKTSYDMNQACELFNRIHQ--RGYEPNRWTYDIMVHAFAKHG 805

Query: 713  RKDDAKHWIGEMNRKGFHPKENT 781
            RKD+AK W+ EM+ KGFHP E T
Sbjct: 806  RKDEAKQWVNEMHHKGFHPTETT 828


>ref|XP_003617158.1| PPR containing plant-like protein [Medicago truncatula]
 gb|AET00117.1| PPR containing plant-like protein [Medicago truncatula]
          Length = 978

 Score =  319 bits (817), Expect = 6e-98
 Identities = 161/260 (61%), Positives = 202/260 (77%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            ++LM RNNI P L S+ L+LNSYL++G+  DA NFF SLR   G++S++LY SM++GL K
Sbjct: 621  YELMPRNNIVPTLLSQRLVLNSYLRNGKIIDALNFFNSLR-RLGVVSKKLYCSMVIGLCK 679

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            SNK D+A  FLFEMLNAG+NP I+ +E L+ KLCSL+RYH+AINLV VYM+ GRRLTSFL
Sbjct: 680  SNKVDIAHDFLFEMLNAGVNPDIECFESLVWKLCSLRRYHKAINLVQVYMKGGRRLTSFL 739

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN LL+HS  +P+VY   V  RGAEEGE S IS L F+IGAFSG L VN SI+ELE+LIA
Sbjct: 740  GNTLLWHSSLSPDVYGILVHLRGAEEGENSPISTLSFVIGAFSGCLSVNRSIEELEKLIA 799

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
            +CFP D +TYN LL +   +DM QACELF+R+ Q  +G +PNGWTYD MV GFLNHGR D
Sbjct: 800  MCFPLDTHTYNQLLRRVASYDMNQACELFNRMCQ--RGCKPNGWTYDFMVRGFLNHGRND 857

Query: 722  DAKHWIGEMNRKGFHPKENT 781
            +AK W+ EM++KGF   ++T
Sbjct: 858  EAKQWVEEMHQKGFDLTDST 877


>gb|KYP73199.1| Pentatricopeptide repeat-containing protein At1g71210 family [Cajanus
            cajan]
          Length = 873

 Score =  313 bits (802), Expect = 1e-96
 Identities = 153/260 (58%), Positives = 198/260 (76%), Gaps = 1/260 (0%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN I PNL S + ML  YLKSGR +DA NFF  +R  +G+  +RLYN++I+GL K
Sbjct: 604  FELMQRNGITPNLCSCIFMLQGYLKSGRISDALNFFNDVRL-RGLAGKRLYNALIIGLCK 662

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            SNK D+A + LF ML  GLNPS++ YE+++QKLCSL+RYHEA+++VNVY +MGR LTSF+
Sbjct: 663  SNKADIAREMLFSMLRVGLNPSVECYELVVQKLCSLRRYHEAMHIVNVYEKMGRPLTSFI 722

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN+LLYHSL +P +Y+TCV  RG EEG  SG S+L  +IGAFSG +RV H +K+LE+LI 
Sbjct: 723  GNVLLYHSLISPQLYDTCVHLRGVEEGGFSGNSLLTLMIGAFSGCVRVRHYVKDLEQLIE 782

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFP DI+TYNLLL +    DM  AC LF RI Q  +GYEPN WTYDIM+ GF +HGR+D
Sbjct: 783  KCFPLDIFTYNLLLKQVAKSDMNIACMLFGRICQ--RGYEPNCWTYDIMIRGFSDHGRRD 840

Query: 722  DAKHWIGEMNRKGF-HPKEN 778
             AK W+ +M R+GF H ++N
Sbjct: 841  KAKRWLEKMFRRGFYHDRQN 860


>ref|XP_020204964.1| pentatricopeptide repeat-containing protein At1g71210 [Cajanus cajan]
          Length = 885

 Score =  313 bits (802), Expect = 2e-96
 Identities = 153/260 (58%), Positives = 198/260 (76%), Gaps = 1/260 (0%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN I PNL S + ML  YLKSGR +DA NFF  +R  +G+  +RLYN++I+GL K
Sbjct: 616  FELMQRNGITPNLCSCIFMLQGYLKSGRISDALNFFNDVRL-RGLAGKRLYNALIIGLCK 674

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            SNK D+A + LF ML  GLNPS++ YE+++QKLCSL+RYHEA+++VNVY +MGR LTSF+
Sbjct: 675  SNKADIAREMLFSMLRVGLNPSVECYELVVQKLCSLRRYHEAMHIVNVYEKMGRPLTSFI 734

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN+LLYHSL +P +Y+TCV  RG EEG  SG S+L  +IGAFSG +RV H +K+LE+LI 
Sbjct: 735  GNVLLYHSLISPQLYDTCVHLRGVEEGGFSGNSLLTLMIGAFSGCVRVRHYVKDLEQLIE 794

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFP DI+TYNLLL +    DM  AC LF RI Q  +GYEPN WTYDIM+ GF +HGR+D
Sbjct: 795  KCFPLDIFTYNLLLKQVAKSDMNIACMLFGRICQ--RGYEPNCWTYDIMIRGFSDHGRRD 852

Query: 722  DAKHWIGEMNRKGF-HPKEN 778
             AK W+ +M R+GF H ++N
Sbjct: 853  KAKRWLEKMFRRGFYHDRQN 872


>gb|KHN26011.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 599

 Score =  292 bits (747), Expect = 5e-91
 Identities = 145/255 (56%), Positives = 194/255 (76%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN IKPNLSS +LML+ YL SGR +DA NFF  +R  QG+ +++LY ++I GL K
Sbjct: 344  FELMQRNGIKPNLSSLILMLHVYLLSGRISDALNFFNGVR-RQGLATKKLYVALITGLCK 402

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
             NK D++ ++ F ML  GLNPS++ YE+L+QKLCSL++Y EAI+++NV  +MGR ++SF+
Sbjct: 403  FNKIDISREYFFSMLRVGLNPSLECYELLVQKLCSLQKYSEAIHIINVSQKMGRPVSSFI 462

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN+LLYHSL +P +Y+TC   RGAEEG  SG S L ++IGAFSGRLRV+H I +LE L+ 
Sbjct: 463  GNVLLYHSLISPQLYDTCNYLRGAEEGVFSGNSTLCWMIGAFSGRLRVSHYIADLERLVE 522

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFPP+I+TYNLLL +    DM +A  LF R+ Q  +GY+PN WTYDIMV GF  HGRK 
Sbjct: 523  RCFPPNIFTYNLLLKQVAKSDMDKARLLFARMCQ--RGYQPNCWTYDIMVRGFSIHGRKH 580

Query: 722  DAKHWIGEMNRKGFH 766
            +A+ W+ EM RKGF+
Sbjct: 581  EARRWLEEMFRKGFY 595


>ref|XP_007141545.1| hypothetical protein PHAVU_008G205300g [Phaseolus vulgaris]
 ref|XP_007141546.1| hypothetical protein PHAVU_008G205300g [Phaseolus vulgaris]
 gb|ESW13539.1| hypothetical protein PHAVU_008G205300g [Phaseolus vulgaris]
 gb|ESW13540.1| hypothetical protein PHAVU_008G205300g [Phaseolus vulgaris]
          Length = 875

 Score =  298 bits (763), Expect = 7e-91
 Identities = 145/255 (56%), Positives = 192/255 (75%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+R+ ++PNL S++ +L  YL SGR  DA NFF  +R +QG+  + LY +++ GL K
Sbjct: 617  FELMQRSGVEPNLLSRIFVLRGYLFSGRIADALNFFNVVR-NQGLARKALYTTLVSGLCK 675

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            SN+ DM+L+F F M   GL P ++ +E+L+QKLCSL+RYHEAI++VN Y +MGR ++SF+
Sbjct: 676  SNRIDMSLEFFFTMFRVGLYPGLECFELLVQKLCSLRRYHEAIHIVNAYEKMGRPVSSFI 735

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN+LLYHSL +P +YNTCV  +G EEG  SG S L  +IGAFSG LRV+H I +LE+LI 
Sbjct: 736  GNVLLYHSLISPQLYNTCVHLKGVEEGGFSGNSALSLVIGAFSGCLRVSHYISDLEQLIE 795

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFPPDI+TYNLLL + +  DM +A  LF RI Q  KGY+P+ WTYDIMV GF NHGRKD
Sbjct: 796  KCFPPDIFTYNLLLKELSKSDMDKARLLFARICQ--KGYKPDDWTYDIMVRGFSNHGRKD 853

Query: 722  DAKHWIGEMNRKGFH 766
            +AK W+ EM RKGF+
Sbjct: 854  EAKQWLEEMLRKGFY 868


>ref|XP_006595790.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like
            [Glycine max]
 ref|XP_006595792.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like
            [Glycine max]
 ref|XP_014622675.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like
            [Glycine max]
 ref|XP_014622676.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like
            [Glycine max]
 gb|KRH14653.1| hypothetical protein GLYMA_14G039600 [Glycine max]
          Length = 868

 Score =  292 bits (747), Expect = 1e-88
 Identities = 145/255 (56%), Positives = 194/255 (76%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN IKPNLSS +LML+ YL SGR +DA NFF  +R  QG+ +++LY ++I GL K
Sbjct: 613  FELMQRNGIKPNLSSLILMLHVYLLSGRISDALNFFNGVR-RQGLATKKLYVALITGLCK 671

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
             NK D++ ++ F ML  GLNPS++ YE+L+QKLCSL++Y EAI+++NV  +MGR ++SF+
Sbjct: 672  FNKIDISREYFFSMLRVGLNPSLECYELLVQKLCSLQKYSEAIHIINVSQKMGRPVSSFI 731

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN+LLYHSL +P +Y+TC   RGAEEG  SG S L ++IGAFSGRLRV+H I +LE L+ 
Sbjct: 732  GNVLLYHSLISPQLYDTCNYLRGAEEGVFSGNSTLCWMIGAFSGRLRVSHYIADLERLVE 791

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFPP+I+TYNLLL +    DM +A  LF R+ Q  +GY+PN WTYDIMV GF  HGRK 
Sbjct: 792  RCFPPNIFTYNLLLKQVAKSDMDKARLLFARMCQ--RGYQPNCWTYDIMVRGFSIHGRKH 849

Query: 722  DAKHWIGEMNRKGFH 766
            +A+ W+ EM RKGF+
Sbjct: 850  EARRWLEEMFRKGFY 864


>gb|KHN38419.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 662

 Score =  284 bits (727), Expect = 2e-87
 Identities = 139/251 (55%), Positives = 188/251 (74%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN I PN+ S +LM+N YL SGR +DA NFF  ++  +G+ +++LY ++I GL K
Sbjct: 414  FELMQRNGITPNMCSLILMMNGYLISGRISDALNFFNDVQ-RRGLATKKLYVALITGLCK 472

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            SNK D++ ++ F ML  GLNPS++ YE+L+QKLCSL+RY EA++++NV  +MGR ++SF+
Sbjct: 473  SNKVDISREYFFRMLRVGLNPSLECYELLVQKLCSLQRYSEAMHIINVSQKMGRPVSSFI 532

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN+LLYHSL +P +Y+TCV  RG EEG  SG S L  +IGAFSGRLRV+H I +LE LI 
Sbjct: 533  GNVLLYHSLISPQLYDTCVNLRGVEEGVFSGNSTLCLMIGAFSGRLRVSHYITDLERLIE 592

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFPP+I+TYNLLL +    DM +A  LF R+ Q  +GY+PN WTYDIMV GF  HGR D
Sbjct: 593  KCFPPNIFTYNLLLKQVARSDMDKARLLFARMCQ--RGYQPNSWTYDIMVRGFSIHGRND 650

Query: 722  DAKHWIGEMNR 754
            +A+ W+ EM R
Sbjct: 651  EARRWLKEMFR 661


>ref|XP_003518473.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like
            [Glycine max]
 gb|KRH73491.1| hypothetical protein GLYMA_02G276200 [Glycine max]
          Length = 872

 Score =  284 bits (727), Expect = 1e-85
 Identities = 139/251 (55%), Positives = 188/251 (74%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN I PN+ S +LM+N YL SGR +DA NFF  ++  +G+ +++LY ++I GL K
Sbjct: 624  FELMQRNGITPNMCSLILMMNGYLISGRISDALNFFNDVQ-RRGLATKKLYVALITGLCK 682

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            SNK D++ ++ F ML  GLNPS++ YE+L+QKLCSL+RY EA++++NV  +MGR ++SF+
Sbjct: 683  SNKVDISREYFFRMLRVGLNPSLECYELLVQKLCSLQRYSEAMHIINVSQKMGRPVSSFI 742

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN+LLYHSL +P +Y+TCV  RG EEG  SG S L  +IGAFSGRLRV+H I +LE LI 
Sbjct: 743  GNVLLYHSLISPQLYDTCVNLRGVEEGVFSGNSTLCLMIGAFSGRLRVSHYITDLERLIE 802

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFPP+I+TYNLLL +    DM +A  LF R+ Q  +GY+PN WTYDIMV GF  HGR D
Sbjct: 803  KCFPPNIFTYNLLLKQVARSDMDKARLLFARMCQ--RGYQPNSWTYDIMVRGFSIHGRND 860

Query: 722  DAKHWIGEMNR 754
            +A+ W+ EM R
Sbjct: 861  EARRWLKEMFR 871


>ref|XP_017430727.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At1g71210 [Vigna angularis]
          Length = 875

 Score =  283 bits (723), Expect = 5e-85
 Identities = 144/254 (56%), Positives = 184/254 (72%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+ N ++PNL+S+VL+L  YL+SGR +DA +FF  +R  QG+  +RLY +++ GL K
Sbjct: 618  FELMQINGVEPNLNSRVLVLRGYLRSGRISDALSFFNVVR-GQGLECKRLYTTLVNGLCK 676

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
             N+ DM+L F   M   GLNPS++ YE+L+Q+LCSL+RY EAI +VN Y +MGR ++SF+
Sbjct: 677  CNRIDMSLGFFLSMFRVGLNPSLECYELLVQELCSLRRYQEAIRIVNAYEKMGRPISSFM 736

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN LL HSL +P +Y+TCV  RG  EGE S  S L  +IGAFSG LRV H I +LE LI 
Sbjct: 737  GNQLLQHSLISPKLYDTCVYLRGVGEGEFSANSTLNLVIGAFSGCLRVTHYISDLERLIE 796

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFPPDI+TYNLLL + +  DM +A  LF RI Q  KGYEP+GWTY IMV GF NHGRKD
Sbjct: 797  KCFPPDIFTYNLLLKELSKSDMDKARLLFGRICQ--KGYEPDGWTYHIMVRGFSNHGRKD 854

Query: 722  DAKHWIGEMNRKGF 763
            +AK W  EM R GF
Sbjct: 855  EAKRWNKEMLRTGF 868


>ref|XP_016187228.1| pentatricopeptide repeat-containing protein At1g71210 isoform X2
            [Arachis ipaensis]
          Length = 753

 Score =  269 bits (688), Expect = 7e-81
 Identities = 138/251 (54%), Positives = 180/251 (71%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN I+P  SS +LML +YL+SGR  DA NFF ++   +G+ SR+LYN ++V L K
Sbjct: 496  FELMQRNGIQPTSSSLILMLKAYLESGRTYDALNFFNNV-WSRGLASRKLYNCLVVSLCK 554

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            S  P+ A   L +ML  G +PSI+ YE L+ +LCS KRYHEA+NLVNVY +MGR+LTSFL
Sbjct: 555  SKNPEPAYLLLQQMLRDGFHPSIECYENLVLELCSSKRYHEAVNLVNVYEKMGRQLTSFL 614

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GN+LLY S+ +P VYN CVR RG +E   +  S+L F++GAF    RV+H +++LEELIA
Sbjct: 615  GNVLLYQSMFSPEVYNACVRLRGVKEEGKTDWSMLSFVVGAFYDHRRVSH-VEDLEELIA 673

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFP DIYTYNLLL KA   DMGQA ELF R+ +  +G+EPN WTY I+V GF  H  +D
Sbjct: 674  KCFPLDIYTYNLLLRKACKSDMGQAYELFERMRR--RGFEPNRWTYTILVYGFKRHEMRD 731

Query: 722  DAKHWIGEMNR 754
            +A+ W  E  R
Sbjct: 732  EAERWFQESKR 742


>ref|XP_020962746.1| pentatricopeptide repeat-containing protein At1g71210-like isoform X3
            [Arachis ipaensis]
 ref|XP_020962747.1| pentatricopeptide repeat-containing protein At1g71210-like isoform X4
            [Arachis ipaensis]
          Length = 668

 Score =  263 bits (672), Expect = 3e-79
 Identities = 135/251 (53%), Positives = 178/251 (70%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN I+P  SS  LML +YLKSGR  DA  FF ++   +G+ +++LYN  ++ L K
Sbjct: 411  FELMQRNGIQPTSSSLSLMLKAYLKSGRTYDALIFFSNV-WSRGLATKKLYNCFVIFLCK 469

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            S  P+ A +F  +ML  GLNPSI+ YE+L+Q+LCS +RYHEA+NLV++Y +MGRRLTSFL
Sbjct: 470  SKNPEPAYRFFLQMLEDGLNPSIECYEILVQELCSSERYHEAVNLVDMYEKMGRRLTSFL 529

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GNILLY S+ +P VYN CVR RG +E   +  S+L F++GAF    RV+H +++LE+LIA
Sbjct: 530  GNILLYRSMFSPEVYNACVRLRGVKEEGKTDWSMLSFVVGAFRRHHRVSH-VEDLEKLIA 588

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFP DIYTYNLLL KA   D  QAC+LF R+ Q  +G+EPN W Y  MV GF  HG +D
Sbjct: 589  KCFPLDIYTYNLLLRKAWKSDKEQACQLFERMCQ--RGFEPNQWIYSTMVDGFERHGMRD 646

Query: 722  DAKHWIGEMNR 754
             A+ W  E  R
Sbjct: 647  KAERWFKESKR 657


>ref|XP_020982874.1| pentatricopeptide repeat-containing protein At1g71210-like isoform X3
            [Arachis duranensis]
          Length = 733

 Score =  264 bits (674), Expect = 6e-79
 Identities = 136/251 (54%), Positives = 179/251 (71%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN I+P  SS +LML +YL+SGR  DA NFF ++   +G+ +R+LYN ++V L K
Sbjct: 472  FELMQRNGIQPTSSSLILMLKAYLESGRTYDALNFFNNV-WSRGLATRKLYNCLVVSLCK 530

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            S  P  A   L +ML  G +PSI+ YE L+Q+LCS KRYHEA++LVNVY +MGRRLTS L
Sbjct: 531  SKNPGPAYLLLQQMLRDGFHPSIECYENLVQELCSSKRYHEAVDLVNVYEKMGRRLTSSL 590

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GNILL  S+++  VYN CVR RG +E   +  S+L F++GAF    RV+H +++LE+L A
Sbjct: 591  GNILLSQSMSSLEVYNACVRLRGVKEEGKTDWSMLSFVVGAFYDHHRVSH-VEDLEKLTA 649

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFP DIYTYNLLL KA   DMGQACELF R+ +  +G+EPN WTY I+V GF  HG +D
Sbjct: 650  KCFPLDIYTYNLLLRKACKSDMGQACELFERMRR--RGFEPNRWTYTILVYGFKRHGTRD 707

Query: 722  DAKHWIGEMNR 754
            +A+ W  E  R
Sbjct: 708  EAERWFQESKR 718


>ref|XP_020982873.1| pentatricopeptide repeat-containing protein At1g71210-like isoform X2
            [Arachis duranensis]
          Length = 753

 Score =  264 bits (674), Expect = 8e-79
 Identities = 136/251 (54%), Positives = 179/251 (71%)
 Frame = +2

Query: 2    FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181
            F+LM+RN I+P  SS +LML +YL+SGR  DA NFF ++   +G+ +R+LYN ++V L K
Sbjct: 496  FELMQRNGIQPTSSSLILMLKAYLESGRTYDALNFFNNV-WSRGLATRKLYNCLVVSLCK 554

Query: 182  SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361
            S  P  A   L +ML  G +PSI+ YE L+Q+LCS KRYHEA++LVNVY +MGRRLTS L
Sbjct: 555  SKNPGPAYLLLQQMLRDGFHPSIECYENLVQELCSSKRYHEAVDLVNVYEKMGRRLTSSL 614

Query: 362  GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541
            GNILL  S+++  VYN CVR RG +E   +  S+L F++GAF    RV+H +++LE+L A
Sbjct: 615  GNILLSQSMSSLEVYNACVRLRGVKEEGKTDWSMLSFVVGAFYDHHRVSH-VEDLEKLTA 673

Query: 542  ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721
             CFP DIYTYNLLL KA   DMGQACELF R+ +  +G+EPN WTY I+V GF  HG +D
Sbjct: 674  KCFPLDIYTYNLLLRKACKSDMGQACELFERMRR--RGFEPNRWTYTILVYGFKRHGTRD 731

Query: 722  DAKHWIGEMNR 754
            +A+ W  E  R
Sbjct: 732  EAERWFQESKR 742


Top