BLASTX nr result
ID: Astragalus24_contig00005788
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00005788 (948 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_012568596.1| PREDICTED: pentatricopeptide repeat-containi... 347 e-109 dbj|GAU32154.1| hypothetical protein TSUD_68250 [Trifolium subte... 345 e-109 ref|XP_003617156.2| PPR containing plant-like protein [Medicago ... 335 e-105 gb|PNY12861.1| PPR containing plant-like protein [Trifolium prat... 336 e-103 ref|XP_019434547.1| PREDICTED: pentatricopeptide repeat-containi... 325 e-101 gb|PNY12930.1| pentatricopeptide repeat-containing protein [Trif... 323 e-100 dbj|GAU21648.1| hypothetical protein TSUD_251310 [Trifolium subt... 321 e-100 ref|XP_003617158.1| PPR containing plant-like protein [Medicago ... 319 6e-98 gb|KYP73199.1| Pentatricopeptide repeat-containing protein At1g7... 313 1e-96 ref|XP_020204964.1| pentatricopeptide repeat-containing protein ... 313 2e-96 gb|KHN26011.1| Pentatricopeptide repeat-containing protein [Glyc... 292 5e-91 ref|XP_007141545.1| hypothetical protein PHAVU_008G205300g [Phas... 298 7e-91 ref|XP_006595790.1| PREDICTED: pentatricopeptide repeat-containi... 292 1e-88 gb|KHN38419.1| Pentatricopeptide repeat-containing protein [Glyc... 284 2e-87 ref|XP_003518473.1| PREDICTED: pentatricopeptide repeat-containi... 284 1e-85 ref|XP_017430727.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 283 5e-85 ref|XP_016187228.1| pentatricopeptide repeat-containing protein ... 269 7e-81 ref|XP_020962746.1| pentatricopeptide repeat-containing protein ... 263 3e-79 ref|XP_020982874.1| pentatricopeptide repeat-containing protein ... 264 6e-79 ref|XP_020982873.1| pentatricopeptide repeat-containing protein ... 264 8e-79 >ref|XP_012568596.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210 [Cicer arietinum] Length = 874 Score = 347 bits (891), Expect = e-109 Identities = 172/255 (67%), Positives = 209/255 (81%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM RNNI PN+SS++L+LNSY+KSGR DA FF S+R QG +SRRLY+SMI GL K Sbjct: 610 FELMLRNNIAPNVSSQILLLNSYVKSGRLADALTFFNSIR-HQGAVSRRLYDSMIQGLCK 668 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 SNK D+A LFEML AGLNP I YE L+QKLCSLKRYHEAINLV++Y++ GRRLTS+L Sbjct: 669 SNKVDIAHNLLFEMLKAGLNPGIGCYENLVQKLCSLKRYHEAINLVHMYLKTGRRLTSYL 728 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GNILL+HSL + VY+TC++SRGA+EGE S IS L F+IGAFSG LRVNHSI+ELE+LIA Sbjct: 729 GNILLFHSLQSQEVYDTCIQSRGAKEGESSAISTLSFVIGAFSGCLRVNHSIEELEKLIA 788 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 +CFP D+YTYNLLL + +DFD QA LF RI Q +GYEPNGWTY+IMV GF N+GR+D Sbjct: 789 LCFPLDLYTYNLLLRRVSDFDFDQALRLFDRIRQ--RGYEPNGWTYNIMVHGFANNGRRD 846 Query: 722 DAKHWIGEMNRKGFH 766 +AK W EM++KGF+ Sbjct: 847 EAKQWSEEMSQKGFY 861 >dbj|GAU32154.1| hypothetical protein TSUD_68250 [Trifolium subterraneum] Length = 850 Score = 345 bits (885), Expect = e-109 Identities = 170/260 (65%), Positives = 212/260 (81%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 ++LMRR+N+ P + S+ L+LNSYLKSG+ A +FF SLR QG++S++LY SM++GL K Sbjct: 589 YELMRRSNMVPTIVSQALVLNSYLKSGKIFYALSFFDSLR-RQGVVSKKLYTSMVIGLCK 647 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 +NK ++A FLFEMLNAGLN I+ YE L+QKLCSLK+YHEAINLV VYM+ GRRLTSFL Sbjct: 648 NNKANIAHDFLFEMLNAGLNTGIECYESLVQKLCSLKKYHEAINLVQVYMKTGRRLTSFL 707 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GNILL+HSL +P++Y CV+ GA+EGE S IS L F+IGAFSG LRVNHS++ELEELI Sbjct: 708 GNILLFHSLMSPDLYEICVQMGGAKEGESSPISTLNFVIGAFSGCLRVNHSVEELEELIV 767 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFP D+ TYNLLL K T++DM QACELF+RI Q +GYEPNGWTY+IMV GF NHGR D Sbjct: 768 TCFPLDMCTYNLLLRKITNYDMNQACELFNRICQ--RGYEPNGWTYNIMVNGFSNHGRND 825 Query: 722 DAKHWIGEMNRKGFHPKENT 781 +AK W+ EM++KGF+P ENT Sbjct: 826 EAKQWVEEMHQKGFYPTENT 845 >ref|XP_003617156.2| PPR containing plant-like protein [Medicago truncatula] gb|AET00115.2| PPR containing plant-like protein [Medicago truncatula] Length = 879 Score = 335 bits (858), Expect = e-105 Identities = 166/266 (62%), Positives = 210/266 (78%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 ++LM RNNI P+ SS+ L+L YLKSG+ +DA NFF SLR QG +S+++Y S+I L K Sbjct: 609 YELMLRNNIVPSSSSQRLVLIGYLKSGKISDALNFFHSLR-RQGTVSKKVYQSIIFALCK 667 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 S K D+A FLF+M AGLNPSI+ +E+L+Q LCSL+RYHEAINLV+VY++MGRRLT+FL Sbjct: 668 SCKADIAHDFLFQMFKAGLNPSIECFEILVQTLCSLERYHEAINLVHVYIKMGRRLTNFL 727 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GNILL HSL +P++Y+ CVR RGA+E ECS +S L FIIGAFS LRVN S++ELE+LI+ Sbjct: 728 GNILLSHSLISPDIYHACVRLRGAKEEECSPMSTLSFIIGAFSRCLRVNPSVEELEKLIS 787 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFP D YTYN LL + T +DM QACELF+RI Q +GYEPN WTY+IMV GF NHGR D Sbjct: 788 TCFPLDFYTYNQLLRRVTQYDMNQACELFNRIRQ--RGYEPNDWTYNIMVSGFSNHGRND 845 Query: 722 DAKHWIGEMNRKGFHPKENTLSMYQK 799 +AK W+ EM++KGF+P+ENT QK Sbjct: 846 EAKQWVEEMHQKGFYPRENTKRNVQK 871 >gb|PNY12861.1| PPR containing plant-like protein [Trifolium pratense] Length = 1080 Score = 336 bits (861), Expect = e-103 Identities = 172/259 (66%), Positives = 205/259 (79%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 +DLM R NI P + S+ L+L SYLKSGR A FF SLR QG++S++LY SM++GL K Sbjct: 819 YDLMVRCNIVPTIISQALVLISYLKSGRIHYALKFFDSLR-RQGVVSKKLYTSMVIGLCK 877 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 +NK D+A FLFEMLNA LNP I+ YE L+QKLCSLKRY EAINLV VYM+ GRRLTSFL Sbjct: 878 NNKADIARDFLFEMLNAKLNPGIECYESLVQKLCSLKRYDEAINLVQVYMKRGRRLTSFL 937 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GNI L HSLT+P+VY+ CV+ AEEGE S IS L F+IGAFSGRLRVN SI+ELEELIA Sbjct: 938 GNIFLCHSLTSPDVYDICVQIGRAEEGESSPISTLSFVIGAFSGRLRVNRSIEELEELIA 997 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 +CFP D YTYNLLL + T++DM QACEL +RI Q +GYEPN WTY+IMV GF NHGR D Sbjct: 998 MCFPLDTYTYNLLLRRITNYDMNQACELVNRICQ--RGYEPNDWTYNIMVDGFKNHGRND 1055 Query: 722 DAKHWIGEMNRKGFHPKEN 778 +AK W+ EM++KGF+P EN Sbjct: 1056 EAKQWVEEMHKKGFYPIEN 1074 >ref|XP_019434547.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210 [Lupinus angustifolius] gb|OIW16274.1| hypothetical protein TanjilG_18989 [Lupinus angustifolius] Length = 869 Score = 325 bits (834), Expect = e-101 Identities = 159/260 (61%), Positives = 209/260 (80%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN ++P++SS+VLML SYLKS ++A FF +LRC QG++SR+L+N++IVGL K Sbjct: 607 FELMQRNGVEPDMSSQVLMLKSYLKSESISEALTFFHNLRC-QGIVSRKLFNTLIVGLCK 665 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 SNK D+A +FLFEM+ A LNPSI+ YEVL+Q+LCS +RY +AI++VN+Y +MGRRLTSF+ Sbjct: 666 SNKVDIAREFLFEMIKAELNPSIECYEVLVQQLCSSQRYRDAIHVVNLYEKMGRRLTSFI 725 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN+LLYHSL + +Y+ C + RG +GE SG S+L IIGAFSG LRVNH I++LEELI+ Sbjct: 726 GNVLLYHSLISRELYDACAQLRGVGDGEFSGSSMLTLIIGAFSGHLRVNHFIEDLEELIS 785 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFP DIYTYNLLL KA+ DM QA ELF R+ Q +GYEPN WTYD+MV GF HGR++ Sbjct: 786 KCFPLDIYTYNLLLRKASHGDMDQAFELFGRMCQ--RGYEPNWWTYDVMVHGFSKHGRQN 843 Query: 722 DAKHWIGEMNRKGFHPKENT 781 +AK W+ EM+ KG +PKE+T Sbjct: 844 EAKRWVEEMSHKGLYPKEST 863 >gb|PNY12930.1| pentatricopeptide repeat-containing protein [Trifolium pratense] Length = 893 Score = 323 bits (827), Expect = e-100 Identities = 168/274 (61%), Positives = 202/274 (73%), Gaps = 14/274 (5%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 ++LM NNI P + S+ L+LNSYL S R ++A NFF SLR DQG++S+RLY+SMI GL K Sbjct: 613 YELMLLNNIVPTIVSQALLLNSYLGSERISEALNFFYSLR-DQGVVSKRLYSSMINGLCK 671 Query: 182 SNKPDMAL--------------QFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLV 319 NK D+A + L +MLNAGLNP I+ YE L+QKLCSLKRY EAINLV Sbjct: 672 HNKADIACDKSDIAHDKADIARRILVDMLNAGLNPGIECYENLVQKLCSLKRYPEAINLV 731 Query: 320 NVYMRMGRRLTSFLGNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRL 499 VYM+MGRRLTSFLGNILL+HSL TPNVY+TCV+ RG +EGE S S L +IG FS L Sbjct: 732 QVYMKMGRRLTSFLGNILLFHSLITPNVYHTCVKMRGEKEGESSPFSTLTVVIGVFSDCL 791 Query: 500 RVNHSIKELEELIAICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTY 679 +VNHSI EEL+A+CFP DIYTYNLLL + T +DM QACELF+RI Q +GYEPNGWTY Sbjct: 792 KVNHSI---EELVALCFPLDIYTYNLLLRRTTSYDMNQACELFNRIRQ--RGYEPNGWTY 846 Query: 680 DIMVVGFLNHGRKDDAKHWIGEMNRKGFHPKENT 781 DIMV GF HGR + K W+ EM+ +GF+P E T Sbjct: 847 DIMVHGFSKHGRNYETKQWLEEMHHEGFYPTETT 880 >dbj|GAU21648.1| hypothetical protein TSUD_251310 [Trifolium subterraneum] Length = 835 Score = 321 bits (823), Expect = e-100 Identities = 167/263 (63%), Positives = 200/263 (76%), Gaps = 3/263 (1%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 ++LM R+NI P + S+ L+LNSYL S R DA NFF SLR QG++S+RLY+SMI+GL K Sbjct: 575 YELMLRSNIVPTIVSQSLLLNSYLGSERIYDALNFFNSLR-RQGVVSKRLYSSMIIGLCK 633 Query: 182 SNKPDMAL---QFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLT 352 N DMA LF+MLNAGLNP I+ YE L+Q LCSL++Y EAINLV VYM+ GRRLT Sbjct: 634 HNMDDMAHIAHDILFDMLNAGLNPGIECYESLVQTLCSLEKYREAINLVQVYMKTGRRLT 693 Query: 353 SFLGNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEE 532 SFLGNILL+HS +VY+TCV+ RGA+EGE S S L +IG F+G +RVNHSI EE Sbjct: 694 SFLGNILLFHS---SDVYHTCVQMRGAKEGESSPFSTLTAVIGVFTGCVRVNHSI---EE 747 Query: 533 LIAICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHG 712 LIA+CFP DIYTYNLLL + T +DM QACELF+RIHQ +GYEPN WTYDIMV F HG Sbjct: 748 LIALCFPLDIYTYNLLLRRKTSYDMNQACELFNRIHQ--RGYEPNRWTYDIMVHAFAKHG 805 Query: 713 RKDDAKHWIGEMNRKGFHPKENT 781 RKD+AK W+ EM+ KGFHP E T Sbjct: 806 RKDEAKQWVNEMHHKGFHPTETT 828 >ref|XP_003617158.1| PPR containing plant-like protein [Medicago truncatula] gb|AET00117.1| PPR containing plant-like protein [Medicago truncatula] Length = 978 Score = 319 bits (817), Expect = 6e-98 Identities = 161/260 (61%), Positives = 202/260 (77%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 ++LM RNNI P L S+ L+LNSYL++G+ DA NFF SLR G++S++LY SM++GL K Sbjct: 621 YELMPRNNIVPTLLSQRLVLNSYLRNGKIIDALNFFNSLR-RLGVVSKKLYCSMVIGLCK 679 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 SNK D+A FLFEMLNAG+NP I+ +E L+ KLCSL+RYH+AINLV VYM+ GRRLTSFL Sbjct: 680 SNKVDIAHDFLFEMLNAGVNPDIECFESLVWKLCSLRRYHKAINLVQVYMKGGRRLTSFL 739 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN LL+HS +P+VY V RGAEEGE S IS L F+IGAFSG L VN SI+ELE+LIA Sbjct: 740 GNTLLWHSSLSPDVYGILVHLRGAEEGENSPISTLSFVIGAFSGCLSVNRSIEELEKLIA 799 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 +CFP D +TYN LL + +DM QACELF+R+ Q +G +PNGWTYD MV GFLNHGR D Sbjct: 800 MCFPLDTHTYNQLLRRVASYDMNQACELFNRMCQ--RGCKPNGWTYDFMVRGFLNHGRND 857 Query: 722 DAKHWIGEMNRKGFHPKENT 781 +AK W+ EM++KGF ++T Sbjct: 858 EAKQWVEEMHQKGFDLTDST 877 >gb|KYP73199.1| Pentatricopeptide repeat-containing protein At1g71210 family [Cajanus cajan] Length = 873 Score = 313 bits (802), Expect = 1e-96 Identities = 153/260 (58%), Positives = 198/260 (76%), Gaps = 1/260 (0%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN I PNL S + ML YLKSGR +DA NFF +R +G+ +RLYN++I+GL K Sbjct: 604 FELMQRNGITPNLCSCIFMLQGYLKSGRISDALNFFNDVRL-RGLAGKRLYNALIIGLCK 662 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 SNK D+A + LF ML GLNPS++ YE+++QKLCSL+RYHEA+++VNVY +MGR LTSF+ Sbjct: 663 SNKADIAREMLFSMLRVGLNPSVECYELVVQKLCSLRRYHEAMHIVNVYEKMGRPLTSFI 722 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN+LLYHSL +P +Y+TCV RG EEG SG S+L +IGAFSG +RV H +K+LE+LI Sbjct: 723 GNVLLYHSLISPQLYDTCVHLRGVEEGGFSGNSLLTLMIGAFSGCVRVRHYVKDLEQLIE 782 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFP DI+TYNLLL + DM AC LF RI Q +GYEPN WTYDIM+ GF +HGR+D Sbjct: 783 KCFPLDIFTYNLLLKQVAKSDMNIACMLFGRICQ--RGYEPNCWTYDIMIRGFSDHGRRD 840 Query: 722 DAKHWIGEMNRKGF-HPKEN 778 AK W+ +M R+GF H ++N Sbjct: 841 KAKRWLEKMFRRGFYHDRQN 860 >ref|XP_020204964.1| pentatricopeptide repeat-containing protein At1g71210 [Cajanus cajan] Length = 885 Score = 313 bits (802), Expect = 2e-96 Identities = 153/260 (58%), Positives = 198/260 (76%), Gaps = 1/260 (0%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN I PNL S + ML YLKSGR +DA NFF +R +G+ +RLYN++I+GL K Sbjct: 616 FELMQRNGITPNLCSCIFMLQGYLKSGRISDALNFFNDVRL-RGLAGKRLYNALIIGLCK 674 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 SNK D+A + LF ML GLNPS++ YE+++QKLCSL+RYHEA+++VNVY +MGR LTSF+ Sbjct: 675 SNKADIAREMLFSMLRVGLNPSVECYELVVQKLCSLRRYHEAMHIVNVYEKMGRPLTSFI 734 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN+LLYHSL +P +Y+TCV RG EEG SG S+L +IGAFSG +RV H +K+LE+LI Sbjct: 735 GNVLLYHSLISPQLYDTCVHLRGVEEGGFSGNSLLTLMIGAFSGCVRVRHYVKDLEQLIE 794 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFP DI+TYNLLL + DM AC LF RI Q +GYEPN WTYDIM+ GF +HGR+D Sbjct: 795 KCFPLDIFTYNLLLKQVAKSDMNIACMLFGRICQ--RGYEPNCWTYDIMIRGFSDHGRRD 852 Query: 722 DAKHWIGEMNRKGF-HPKEN 778 AK W+ +M R+GF H ++N Sbjct: 853 KAKRWLEKMFRRGFYHDRQN 872 >gb|KHN26011.1| Pentatricopeptide repeat-containing protein [Glycine soja] Length = 599 Score = 292 bits (747), Expect = 5e-91 Identities = 145/255 (56%), Positives = 194/255 (76%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN IKPNLSS +LML+ YL SGR +DA NFF +R QG+ +++LY ++I GL K Sbjct: 344 FELMQRNGIKPNLSSLILMLHVYLLSGRISDALNFFNGVR-RQGLATKKLYVALITGLCK 402 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 NK D++ ++ F ML GLNPS++ YE+L+QKLCSL++Y EAI+++NV +MGR ++SF+ Sbjct: 403 FNKIDISREYFFSMLRVGLNPSLECYELLVQKLCSLQKYSEAIHIINVSQKMGRPVSSFI 462 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN+LLYHSL +P +Y+TC RGAEEG SG S L ++IGAFSGRLRV+H I +LE L+ Sbjct: 463 GNVLLYHSLISPQLYDTCNYLRGAEEGVFSGNSTLCWMIGAFSGRLRVSHYIADLERLVE 522 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFPP+I+TYNLLL + DM +A LF R+ Q +GY+PN WTYDIMV GF HGRK Sbjct: 523 RCFPPNIFTYNLLLKQVAKSDMDKARLLFARMCQ--RGYQPNCWTYDIMVRGFSIHGRKH 580 Query: 722 DAKHWIGEMNRKGFH 766 +A+ W+ EM RKGF+ Sbjct: 581 EARRWLEEMFRKGFY 595 >ref|XP_007141545.1| hypothetical protein PHAVU_008G205300g [Phaseolus vulgaris] ref|XP_007141546.1| hypothetical protein PHAVU_008G205300g [Phaseolus vulgaris] gb|ESW13539.1| hypothetical protein PHAVU_008G205300g [Phaseolus vulgaris] gb|ESW13540.1| hypothetical protein PHAVU_008G205300g [Phaseolus vulgaris] Length = 875 Score = 298 bits (763), Expect = 7e-91 Identities = 145/255 (56%), Positives = 192/255 (75%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+R+ ++PNL S++ +L YL SGR DA NFF +R +QG+ + LY +++ GL K Sbjct: 617 FELMQRSGVEPNLLSRIFVLRGYLFSGRIADALNFFNVVR-NQGLARKALYTTLVSGLCK 675 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 SN+ DM+L+F F M GL P ++ +E+L+QKLCSL+RYHEAI++VN Y +MGR ++SF+ Sbjct: 676 SNRIDMSLEFFFTMFRVGLYPGLECFELLVQKLCSLRRYHEAIHIVNAYEKMGRPVSSFI 735 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN+LLYHSL +P +YNTCV +G EEG SG S L +IGAFSG LRV+H I +LE+LI Sbjct: 736 GNVLLYHSLISPQLYNTCVHLKGVEEGGFSGNSALSLVIGAFSGCLRVSHYISDLEQLIE 795 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFPPDI+TYNLLL + + DM +A LF RI Q KGY+P+ WTYDIMV GF NHGRKD Sbjct: 796 KCFPPDIFTYNLLLKELSKSDMDKARLLFARICQ--KGYKPDDWTYDIMVRGFSNHGRKD 853 Query: 722 DAKHWIGEMNRKGFH 766 +AK W+ EM RKGF+ Sbjct: 854 EAKQWLEEMLRKGFY 868 >ref|XP_006595790.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like [Glycine max] ref|XP_006595792.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like [Glycine max] ref|XP_014622675.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like [Glycine max] ref|XP_014622676.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like [Glycine max] gb|KRH14653.1| hypothetical protein GLYMA_14G039600 [Glycine max] Length = 868 Score = 292 bits (747), Expect = 1e-88 Identities = 145/255 (56%), Positives = 194/255 (76%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN IKPNLSS +LML+ YL SGR +DA NFF +R QG+ +++LY ++I GL K Sbjct: 613 FELMQRNGIKPNLSSLILMLHVYLLSGRISDALNFFNGVR-RQGLATKKLYVALITGLCK 671 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 NK D++ ++ F ML GLNPS++ YE+L+QKLCSL++Y EAI+++NV +MGR ++SF+ Sbjct: 672 FNKIDISREYFFSMLRVGLNPSLECYELLVQKLCSLQKYSEAIHIINVSQKMGRPVSSFI 731 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN+LLYHSL +P +Y+TC RGAEEG SG S L ++IGAFSGRLRV+H I +LE L+ Sbjct: 732 GNVLLYHSLISPQLYDTCNYLRGAEEGVFSGNSTLCWMIGAFSGRLRVSHYIADLERLVE 791 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFPP+I+TYNLLL + DM +A LF R+ Q +GY+PN WTYDIMV GF HGRK Sbjct: 792 RCFPPNIFTYNLLLKQVAKSDMDKARLLFARMCQ--RGYQPNCWTYDIMVRGFSIHGRKH 849 Query: 722 DAKHWIGEMNRKGFH 766 +A+ W+ EM RKGF+ Sbjct: 850 EARRWLEEMFRKGFY 864 >gb|KHN38419.1| Pentatricopeptide repeat-containing protein [Glycine soja] Length = 662 Score = 284 bits (727), Expect = 2e-87 Identities = 139/251 (55%), Positives = 188/251 (74%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN I PN+ S +LM+N YL SGR +DA NFF ++ +G+ +++LY ++I GL K Sbjct: 414 FELMQRNGITPNMCSLILMMNGYLISGRISDALNFFNDVQ-RRGLATKKLYVALITGLCK 472 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 SNK D++ ++ F ML GLNPS++ YE+L+QKLCSL+RY EA++++NV +MGR ++SF+ Sbjct: 473 SNKVDISREYFFRMLRVGLNPSLECYELLVQKLCSLQRYSEAMHIINVSQKMGRPVSSFI 532 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN+LLYHSL +P +Y+TCV RG EEG SG S L +IGAFSGRLRV+H I +LE LI Sbjct: 533 GNVLLYHSLISPQLYDTCVNLRGVEEGVFSGNSTLCLMIGAFSGRLRVSHYITDLERLIE 592 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFPP+I+TYNLLL + DM +A LF R+ Q +GY+PN WTYDIMV GF HGR D Sbjct: 593 KCFPPNIFTYNLLLKQVARSDMDKARLLFARMCQ--RGYQPNSWTYDIMVRGFSIHGRND 650 Query: 722 DAKHWIGEMNR 754 +A+ W+ EM R Sbjct: 651 EARRWLKEMFR 661 >ref|XP_003518473.1| PREDICTED: pentatricopeptide repeat-containing protein At1g71210-like [Glycine max] gb|KRH73491.1| hypothetical protein GLYMA_02G276200 [Glycine max] Length = 872 Score = 284 bits (727), Expect = 1e-85 Identities = 139/251 (55%), Positives = 188/251 (74%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN I PN+ S +LM+N YL SGR +DA NFF ++ +G+ +++LY ++I GL K Sbjct: 624 FELMQRNGITPNMCSLILMMNGYLISGRISDALNFFNDVQ-RRGLATKKLYVALITGLCK 682 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 SNK D++ ++ F ML GLNPS++ YE+L+QKLCSL+RY EA++++NV +MGR ++SF+ Sbjct: 683 SNKVDISREYFFRMLRVGLNPSLECYELLVQKLCSLQRYSEAMHIINVSQKMGRPVSSFI 742 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN+LLYHSL +P +Y+TCV RG EEG SG S L +IGAFSGRLRV+H I +LE LI Sbjct: 743 GNVLLYHSLISPQLYDTCVNLRGVEEGVFSGNSTLCLMIGAFSGRLRVSHYITDLERLIE 802 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFPP+I+TYNLLL + DM +A LF R+ Q +GY+PN WTYDIMV GF HGR D Sbjct: 803 KCFPPNIFTYNLLLKQVARSDMDKARLLFARMCQ--RGYQPNSWTYDIMVRGFSIHGRND 860 Query: 722 DAKHWIGEMNR 754 +A+ W+ EM R Sbjct: 861 EARRWLKEMFR 871 >ref|XP_017430727.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g71210 [Vigna angularis] Length = 875 Score = 283 bits (723), Expect = 5e-85 Identities = 144/254 (56%), Positives = 184/254 (72%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+ N ++PNL+S+VL+L YL+SGR +DA +FF +R QG+ +RLY +++ GL K Sbjct: 618 FELMQINGVEPNLNSRVLVLRGYLRSGRISDALSFFNVVR-GQGLECKRLYTTLVNGLCK 676 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 N+ DM+L F M GLNPS++ YE+L+Q+LCSL+RY EAI +VN Y +MGR ++SF+ Sbjct: 677 CNRIDMSLGFFLSMFRVGLNPSLECYELLVQELCSLRRYQEAIRIVNAYEKMGRPISSFM 736 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN LL HSL +P +Y+TCV RG EGE S S L +IGAFSG LRV H I +LE LI Sbjct: 737 GNQLLQHSLISPKLYDTCVYLRGVGEGEFSANSTLNLVIGAFSGCLRVTHYISDLERLIE 796 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFPPDI+TYNLLL + + DM +A LF RI Q KGYEP+GWTY IMV GF NHGRKD Sbjct: 797 KCFPPDIFTYNLLLKELSKSDMDKARLLFGRICQ--KGYEPDGWTYHIMVRGFSNHGRKD 854 Query: 722 DAKHWIGEMNRKGF 763 +AK W EM R GF Sbjct: 855 EAKRWNKEMLRTGF 868 >ref|XP_016187228.1| pentatricopeptide repeat-containing protein At1g71210 isoform X2 [Arachis ipaensis] Length = 753 Score = 269 bits (688), Expect = 7e-81 Identities = 138/251 (54%), Positives = 180/251 (71%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN I+P SS +LML +YL+SGR DA NFF ++ +G+ SR+LYN ++V L K Sbjct: 496 FELMQRNGIQPTSSSLILMLKAYLESGRTYDALNFFNNV-WSRGLASRKLYNCLVVSLCK 554 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 S P+ A L +ML G +PSI+ YE L+ +LCS KRYHEA+NLVNVY +MGR+LTSFL Sbjct: 555 SKNPEPAYLLLQQMLRDGFHPSIECYENLVLELCSSKRYHEAVNLVNVYEKMGRQLTSFL 614 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GN+LLY S+ +P VYN CVR RG +E + S+L F++GAF RV+H +++LEELIA Sbjct: 615 GNVLLYQSMFSPEVYNACVRLRGVKEEGKTDWSMLSFVVGAFYDHRRVSH-VEDLEELIA 673 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFP DIYTYNLLL KA DMGQA ELF R+ + +G+EPN WTY I+V GF H +D Sbjct: 674 KCFPLDIYTYNLLLRKACKSDMGQAYELFERMRR--RGFEPNRWTYTILVYGFKRHEMRD 731 Query: 722 DAKHWIGEMNR 754 +A+ W E R Sbjct: 732 EAERWFQESKR 742 >ref|XP_020962746.1| pentatricopeptide repeat-containing protein At1g71210-like isoform X3 [Arachis ipaensis] ref|XP_020962747.1| pentatricopeptide repeat-containing protein At1g71210-like isoform X4 [Arachis ipaensis] Length = 668 Score = 263 bits (672), Expect = 3e-79 Identities = 135/251 (53%), Positives = 178/251 (70%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN I+P SS LML +YLKSGR DA FF ++ +G+ +++LYN ++ L K Sbjct: 411 FELMQRNGIQPTSSSLSLMLKAYLKSGRTYDALIFFSNV-WSRGLATKKLYNCFVIFLCK 469 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 S P+ A +F +ML GLNPSI+ YE+L+Q+LCS +RYHEA+NLV++Y +MGRRLTSFL Sbjct: 470 SKNPEPAYRFFLQMLEDGLNPSIECYEILVQELCSSERYHEAVNLVDMYEKMGRRLTSFL 529 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GNILLY S+ +P VYN CVR RG +E + S+L F++GAF RV+H +++LE+LIA Sbjct: 530 GNILLYRSMFSPEVYNACVRLRGVKEEGKTDWSMLSFVVGAFRRHHRVSH-VEDLEKLIA 588 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFP DIYTYNLLL KA D QAC+LF R+ Q +G+EPN W Y MV GF HG +D Sbjct: 589 KCFPLDIYTYNLLLRKAWKSDKEQACQLFERMCQ--RGFEPNQWIYSTMVDGFERHGMRD 646 Query: 722 DAKHWIGEMNR 754 A+ W E R Sbjct: 647 KAERWFKESKR 657 >ref|XP_020982874.1| pentatricopeptide repeat-containing protein At1g71210-like isoform X3 [Arachis duranensis] Length = 733 Score = 264 bits (674), Expect = 6e-79 Identities = 136/251 (54%), Positives = 179/251 (71%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN I+P SS +LML +YL+SGR DA NFF ++ +G+ +R+LYN ++V L K Sbjct: 472 FELMQRNGIQPTSSSLILMLKAYLESGRTYDALNFFNNV-WSRGLATRKLYNCLVVSLCK 530 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 S P A L +ML G +PSI+ YE L+Q+LCS KRYHEA++LVNVY +MGRRLTS L Sbjct: 531 SKNPGPAYLLLQQMLRDGFHPSIECYENLVQELCSSKRYHEAVDLVNVYEKMGRRLTSSL 590 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GNILL S+++ VYN CVR RG +E + S+L F++GAF RV+H +++LE+L A Sbjct: 591 GNILLSQSMSSLEVYNACVRLRGVKEEGKTDWSMLSFVVGAFYDHHRVSH-VEDLEKLTA 649 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFP DIYTYNLLL KA DMGQACELF R+ + +G+EPN WTY I+V GF HG +D Sbjct: 650 KCFPLDIYTYNLLLRKACKSDMGQACELFERMRR--RGFEPNRWTYTILVYGFKRHGTRD 707 Query: 722 DAKHWIGEMNR 754 +A+ W E R Sbjct: 708 EAERWFQESKR 718 >ref|XP_020982873.1| pentatricopeptide repeat-containing protein At1g71210-like isoform X2 [Arachis duranensis] Length = 753 Score = 264 bits (674), Expect = 8e-79 Identities = 136/251 (54%), Positives = 179/251 (71%) Frame = +2 Query: 2 FDLMRRNNIKPNLSSKVLMLNSYLKSGRFTDAWNFFRSLRCDQGMISRRLYNSMIVGLSK 181 F+LM+RN I+P SS +LML +YL+SGR DA NFF ++ +G+ +R+LYN ++V L K Sbjct: 496 FELMQRNGIQPTSSSLILMLKAYLESGRTYDALNFFNNV-WSRGLATRKLYNCLVVSLCK 554 Query: 182 SNKPDMALQFLFEMLNAGLNPSIDSYEVLMQKLCSLKRYHEAINLVNVYMRMGRRLTSFL 361 S P A L +ML G +PSI+ YE L+Q+LCS KRYHEA++LVNVY +MGRRLTS L Sbjct: 555 SKNPGPAYLLLQQMLRDGFHPSIECYENLVQELCSSKRYHEAVDLVNVYEKMGRRLTSSL 614 Query: 362 GNILLYHSLTTPNVYNTCVRSRGAEEGECSGISILIFIIGAFSGRLRVNHSIKELEELIA 541 GNILL S+++ VYN CVR RG +E + S+L F++GAF RV+H +++LE+L A Sbjct: 615 GNILLSQSMSSLEVYNACVRLRGVKEEGKTDWSMLSFVVGAFYDHHRVSH-VEDLEKLTA 673 Query: 542 ICFPPDIYTYNLLLTKATDFDMGQACELFHRIHQKGKGYEPNGWTYDIMVVGFLNHGRKD 721 CFP DIYTYNLLL KA DMGQACELF R+ + +G+EPN WTY I+V GF HG +D Sbjct: 674 KCFPLDIYTYNLLLRKACKSDMGQACELFERMRR--RGFEPNRWTYTILVYGFKRHGTRD 731 Query: 722 DAKHWIGEMNR 754 +A+ W E R Sbjct: 732 EAERWFQESKR 742