BLASTX nr result
ID: Astragalus23_contig00029214
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00029214 (966 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNY13423.1| pentatricopeptide repeat-containing protein at4g2... 394 e-129 ref|XP_004515007.1| PREDICTED: pentatricopeptide repeat-containi... 393 e-129 ref|XP_012569772.1| PREDICTED: pentatricopeptide repeat-containi... 387 e-126 ref|XP_004511291.1| PREDICTED: pentatricopeptide repeat-containi... 384 e-125 ref|XP_003598903.2| PPR containing plant-like protein [Medicago ... 380 e-124 dbj|BAT93460.1| hypothetical protein VIGAN_07242700 [Vigna angul... 380 e-123 ref|XP_017425383.1| PREDICTED: pentatricopeptide repeat-containi... 379 e-123 ref|XP_014498722.2| pentatricopeptide repeat-containing protein ... 377 e-122 gb|KHN11425.1| Pentatricopeptide repeat-containing protein [Glyc... 364 e-121 ref|XP_019432613.1| PREDICTED: pentatricopeptide repeat-containi... 373 e-121 gb|KYP41047.1| Pentatricopeptide repeat-containing protein At4g2... 367 e-119 gb|KHN23162.1| Pentatricopeptide repeat-containing protein [Glyc... 366 e-119 ref|XP_020240563.1| pentatricopeptide repeat-containing protein ... 367 e-118 gb|KRG90809.1| hypothetical protein GLYMA_20G115000 [Glycine max] 366 e-118 ref|XP_006605814.1| PREDICTED: pentatricopeptide repeat-containi... 367 e-118 ref|XP_016204326.1| pentatricopeptide repeat-containing protein ... 364 e-118 ref|XP_015969135.1| pentatricopeptide repeat-containing protein ... 364 e-117 ref|XP_014632180.1| PREDICTED: pentatricopeptide repeat-containi... 364 e-117 ref|XP_007149018.1| hypothetical protein PHAVU_005G033500g [Phas... 360 e-116 dbj|GAU10008.1| hypothetical protein TSUD_120070 [Trifolium subt... 353 e-114 >gb|PNY13423.1| pentatricopeptide repeat-containing protein at4g20740-like protein [Trifolium pratense] Length = 721 Score = 394 bits (1012), Expect = e-129 Identities = 194/228 (85%), Positives = 212/228 (92%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEK YVS+EIYNILMDSL G+++KALSL DEI GSDLKPD+ST++IAILCLVD GE Sbjct: 492 HLKEKSYVSVEIYNILMDSLRLSGDVEKALSLFDEIKGSDLKPDSSTFNIAILCLVDRGE 551 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 IKEAC HNKIIE SCIPSV AY CLAKGLC+IGEIDEAMMLVRDCLG+VT+GPMEFKY Sbjct: 552 IKEACECHNKIIEMSCIPSVAAYCCLAKGLCEIGEIDEAMMLVRDCLGNVTNGPMEFKYC 611 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+IH+CKANDAEKVIDVLNEMMQQGCPL +VVCSA+ISGMCK+GTIEEAR VFSNLRER Sbjct: 612 LTIIHICKANDAEKVIDVLNEMMQQGCPLGSVVCSAIISGMCKYGTIEEARNVFSNLRER 671 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 KLLTESDTIVYDE LIDHMKKKTADLVISG+KFFGLESKLKSKGCKL+ Sbjct: 672 KLLTESDTIVYDEFLIDHMKKKTADLVISGLKFFGLESKLKSKGCKLL 719 >ref|XP_004515007.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] Length = 720 Score = 393 bits (1010), Expect = e-129 Identities = 196/228 (85%), Positives = 210/228 (92%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKGYVS+EIYN+LMDSL GELKKALSL DEI GSD+KPD+STY+IAILCLVD GE Sbjct: 491 HLKEKGYVSVEIYNVLMDSLRLSGELKKALSLFDEIKGSDMKPDSSTYNIAILCLVDCGE 550 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SCIPSV Y LAKGLC+IGEIDEAMMLVRDCLG+ TSGPMEFKY Sbjct: 551 IQEACVCHNKIIEMSCIPSVAVYHRLAKGLCEIGEIDEAMMLVRDCLGNATSGPMEFKYC 610 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+IH+CK NDAEKVIDVLNEMMQQG PL NVVCSA+ISGMCKHGTIEEARKVFSNLR R Sbjct: 611 LTLIHICKFNDAEKVIDVLNEMMQQGFPLCNVVCSAIISGMCKHGTIEEARKVFSNLRNR 670 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 KLLTESDTIVYDELLIDHMKKKTADLVISG+KFFGLESKLKSKGCKL+ Sbjct: 671 KLLTESDTIVYDELLIDHMKKKTADLVISGLKFFGLESKLKSKGCKLL 718 >ref|XP_012569772.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] ref|XP_012569773.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] ref|XP_012569774.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] ref|XP_012569775.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] ref|XP_012569776.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] Length = 720 Score = 387 bits (993), Expect = e-126 Identities = 193/228 (84%), Positives = 209/228 (91%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKGYVS+EIYN+LMDSL GE+KKALSL DEI GSD+KPD+STY+IAILCLV GE Sbjct: 491 HLKEKGYVSVEIYNVLMDSLRLSGEVKKALSLFDEIKGSDMKPDSSTYNIAILCLVARGE 550 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SCIPSV Y LAKGLC+IGEIDEAMMLVRDCLG+ TSGPMEFKY Sbjct: 551 IQEACVCHNKIIEMSCIPSVAVYHRLAKGLCEIGEIDEAMMLVRDCLGNATSGPMEFKYC 610 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+IH+CK NDAEKVIDVLNEMMQQG PL NVVCSA+ISGMCKHGTIEEARKVFSNLR+R Sbjct: 611 LTLIHICKFNDAEKVIDVLNEMMQQGFPLCNVVCSAIISGMCKHGTIEEARKVFSNLRDR 670 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 KLLTESDTIVYDELLIDHMKKKTADLVISG+KFFGLESKLK KGCKL+ Sbjct: 671 KLLTESDTIVYDELLIDHMKKKTADLVISGLKFFGLESKLKLKGCKLL 718 >ref|XP_004511291.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] ref|XP_004511445.2| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] ref|XP_012574365.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] ref|XP_012574366.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] ref|XP_012574367.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] ref|XP_012574368.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cicer arietinum] Length = 720 Score = 384 bits (987), Expect = e-125 Identities = 190/228 (83%), Positives = 210/228 (92%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKGYVS+EIYN+LMDSL GE+KKALSL DEI GS +KPD+STY+IAILCL+ GE Sbjct: 491 HLKEKGYVSVEIYNVLMDSLRLSGEVKKALSLFDEIKGSGMKPDSSTYNIAILCLIARGE 550 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SCIPSVV Y LAKGLC+IGEI+EAMMLVRDCLG+ TSGPMEFKY Sbjct: 551 IQEACVCHNKIIEMSCIPSVVVYHRLAKGLCEIGEIEEAMMLVRDCLGNATSGPMEFKYC 610 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT++H+CK NDAEKVIDVLNEMMQQG PL NVVCSA+ISGMCKHGTIEEARKVFSNLR+R Sbjct: 611 LTLVHICKFNDAEKVIDVLNEMMQQGFPLCNVVCSAIISGMCKHGTIEEARKVFSNLRDR 670 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 KLLTESDTIVYDELLIDHMKKKTADLVISG+KFFGLESKLKSKGCK++ Sbjct: 671 KLLTESDTIVYDELLIDHMKKKTADLVISGLKFFGLESKLKSKGCKVL 718 >ref|XP_003598903.2| PPR containing plant-like protein [Medicago truncatula] gb|AES69154.2| PPR containing plant-like protein [Medicago truncatula] Length = 723 Score = 380 bits (977), Expect = e-124 Identities = 188/228 (82%), Positives = 210/228 (92%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEK YVS+EIYNI M+SL+ G+++KALSL DEI GSDL+PD+STY+IAILCLVD G+ Sbjct: 494 HLKEKSYVSVEIYNIFMESLHLSGKVEKALSLFDEIKGSDLEPDSSTYNIAILCLVDHGQ 553 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 IKEAC HNKIIE S IPSV AY+CLAKGLC IGEIDEAM+LVRDCLG+VTSGPMEFKY Sbjct: 554 IKEACECHNKIIEMSSIPSVAAYNCLAKGLCNIGEIDEAMLLVRDCLGNVTSGPMEFKYC 613 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+I MCK+N AEK+IDVLNEMMQ+GC LDNVVCSA+ISGMCK+GTIEEARKVFS LRER Sbjct: 614 LTIIRMCKSNVAEKLIDVLNEMMQEGCSLDNVVCSAIISGMCKYGTIEEARKVFSILRER 673 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 KLLTESDTIVYDELLIDHMKKKTADLVISG+KFFGLESKLKSKGCKL+ Sbjct: 674 KLLTESDTIVYDELLIDHMKKKTADLVISGLKFFGLESKLKSKGCKLL 721 Score = 63.5 bits (153), Expect = 2e-07 Identities = 61/251 (24%), Positives = 102/251 (40%), Gaps = 34/251 (13%) Frame = +3 Query: 36 IYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKI 215 +YN +MD+L G L ALS+ ++ L ++ T+ I I L G+I E ++ Sbjct: 227 LYNRIMDALVKTGHLDLALSVYNDFREDGLVEESVTFMILIKGLCKGGKIDEMLEVLGRM 286 Query: 216 IEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCL---------------------GSV 332 EK C P V AY+ L + + K G +D + + ++ G V Sbjct: 287 REKLCKPDVFAYTALVRIMVKEGNLDGCLRVWKEMKRDRVDPDVMAYGTIIGGLAKGGRV 346 Query: 333 TSGPMEFK-------------YSLTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAV 473 + G FK Y V N D+L +++ G D + + + Sbjct: 347 SEGYELFKEMKSKGHLIDRAIYGSLVESFVAGNKVGLAFDLLKDLVSSGYRADLGMYNNL 406 Query: 474 ISGMCKHGTIEEARKVFSNLRERKLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLE 653 I G+C +E+A K+F + L E D + LL+ + + K + +FF L Sbjct: 407 IEGLCNLNKVEKAYKLFQVTIQEGL--EPDFLSVKPLLLAYAEAKRME------EFFMLL 458 Query: 654 SKLKSKGCKLI 686 K+K G +I Sbjct: 459 EKMKKLGFPVI 469 >dbj|BAT93460.1| hypothetical protein VIGAN_07242700 [Vigna angularis var. angularis] Length = 716 Score = 380 bits (975), Expect = e-123 Identities = 187/228 (82%), Positives = 206/228 (90%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKG+VS+EIYNILMDSLY GE+KKALSL DE+ G ++PD+ TYSI ILCLVD+GE Sbjct: 487 HLKEKGHVSVEIYNILMDSLYKTGEVKKALSLFDEMKGLGMEPDSITYSIVILCLVDLGE 546 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SCIPSV AY L KGLCKIGEIDEAMMLVRDCLGSV+ GP EFKYS Sbjct: 547 IQEACVCHNKIIEMSCIPSVAAYRSLTKGLCKIGEIDEAMMLVRDCLGSVSDGPTEFKYS 606 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LTVIH CK+NDAEKVIDVLNEMM+QGC LDNV+ SAVISGMCKHGTIEEARKVFSNLRER Sbjct: 607 LTVIHACKSNDAEKVIDVLNEMMEQGCSLDNVIYSAVISGMCKHGTIEEARKVFSNLRER 666 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTESDTIVYDELLIDHMK+KTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 667 NYLTESDTIVYDELLIDHMKRKTADLVLSSLKFFGLESKLKAKGCKLL 714 Score = 61.6 bits (148), Expect = 9e-07 Identities = 41/159 (25%), Positives = 73/159 (45%) Frame = +3 Query: 111 NGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEI 290 N +KP Y+ + L G + ++ E + V + L KGLCK G I Sbjct: 210 NKFGVKPRVFLYNRVMDALFRTGHLDLGLSVYDDFKEDGLVEESVTFMVLVKGLCKGGRI 269 Query: 291 DEAMMLVRDCLGSVTSGPMEFKYSLTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSA 470 DE M+ V + P F Y+ V + +A D + + V EM + G +D + Sbjct: 270 DE-MLEVLGRMRERLCKPDVFAYTALVRILVRAGDLDACLRVWEEMKRDGVVMDPKAYAT 328 Query: 471 VISGMCKHGTIEEARKVFSNLRERKLLTESDTIVYDELL 587 +I G+ K G ++E ++F ++ + +L D ++Y +L+ Sbjct: 329 MIVGLAKGGKVQEGYELFKEMKSKGILV--DRVIYGKLV 365 Score = 61.2 bits (147), Expect = 1e-06 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 1/167 (0%) Frame = +3 Query: 36 IYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKI 215 +YN +MD+L+ G L LS+ D+ L ++ T+ + + L G I E ++ Sbjct: 220 LYNRVMDALFRTGHLDLGLSVYDDFKEDGLVEESVTFMVLVKGLCKGGRIDEMLEVLGRM 279 Query: 216 IEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDC-LGSVTSGPMEFKYSLTVIHMCKAN 392 E+ C P V AY+ L + L + G++D + + + V P Y+ ++ + K Sbjct: 280 RERLCKPDVFAYTALVRILVRAGDLDACLRVWEEMKRDGVVMDPK--AYATMIVGLAKGG 337 Query: 393 DAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNL 533 ++ ++ EM +G +D V+ ++ G + A + +L Sbjct: 338 KVQEGYELFKEMKSKGILVDRVIYGKLVEAFVAAGKVGLAFDLLKDL 384 >ref|XP_017425383.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Vigna angularis] gb|KOM42525.1| hypothetical protein LR48_Vigan05g012900 [Vigna angularis] Length = 716 Score = 379 bits (973), Expect = e-123 Identities = 186/228 (81%), Positives = 206/228 (90%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKG+VS+EIYNILMDSLY GE+KKALSL DE+ G ++PD+ TYSI ILCLVD+GE Sbjct: 487 HLKEKGHVSVEIYNILMDSLYKTGEVKKALSLFDEMKGLGMEPDSITYSIVILCLVDLGE 546 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SCIPSV AY L KGLCKIGEIDEAMMLVRDCLGSV+ GP EFKYS Sbjct: 547 IQEACVCHNKIIEMSCIPSVAAYRSLTKGLCKIGEIDEAMMLVRDCLGSVSDGPTEFKYS 606 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 +TVIH CK+NDAEKVIDVLNEMM+QGC LDNV+ SAVISGMCKHGTIEEARKVFSNLRER Sbjct: 607 ITVIHACKSNDAEKVIDVLNEMMEQGCSLDNVIYSAVISGMCKHGTIEEARKVFSNLRER 666 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTESDTIVYDELLIDHMK+KTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 667 NYLTESDTIVYDELLIDHMKRKTADLVLSSLKFFGLESKLKAKGCKLL 714 Score = 61.6 bits (148), Expect = 9e-07 Identities = 41/159 (25%), Positives = 73/159 (45%) Frame = +3 Query: 111 NGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEI 290 N +KP Y+ + L G + ++ E + V + L KGLCK G I Sbjct: 210 NKFGVKPRVFLYNRVMDALFRTGHLDLGLSVYDDFKEDGLVEESVTFMVLVKGLCKGGRI 269 Query: 291 DEAMMLVRDCLGSVTSGPMEFKYSLTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSA 470 DE M+ V + P F Y+ V + +A D + + V EM + G +D + Sbjct: 270 DE-MLEVLGRMRERLCKPDVFAYTALVRILVRAGDLDACLRVWEEMKRDGVVMDPKAYAT 328 Query: 471 VISGMCKHGTIEEARKVFSNLRERKLLTESDTIVYDELL 587 +I G+ K G ++E ++F ++ + +L D ++Y +L+ Sbjct: 329 MIVGLAKGGKVQEGYELFKEMKSKGILV--DRVIYGKLV 365 Score = 61.2 bits (147), Expect = 1e-06 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 1/167 (0%) Frame = +3 Query: 36 IYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKI 215 +YN +MD+L+ G L LS+ D+ L ++ T+ + + L G I E ++ Sbjct: 220 LYNRVMDALFRTGHLDLGLSVYDDFKEDGLVEESVTFMVLVKGLCKGGRIDEMLEVLGRM 279 Query: 216 IEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDC-LGSVTSGPMEFKYSLTVIHMCKAN 392 E+ C P V AY+ L + L + G++D + + + V P Y+ ++ + K Sbjct: 280 RERLCKPDVFAYTALVRILVRAGDLDACLRVWEEMKRDGVVMDPK--AYATMIVGLAKGG 337 Query: 393 DAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNL 533 ++ ++ EM +G +D V+ ++ G + A + +L Sbjct: 338 KVQEGYELFKEMKSKGILVDRVIYGKLVEAFVAAGKVGLAFDLLKDL 384 >ref|XP_014498722.2| pentatricopeptide repeat-containing protein At4g20740 [Vigna radiata var. radiata] Length = 758 Score = 377 bits (969), Expect = e-122 Identities = 185/228 (81%), Positives = 206/228 (90%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKG+VS+EIYNILMDSLY GE+KKALSL DE+ G ++PD+ TYSI ILCLVD+GE Sbjct: 529 HLKEKGHVSVEIYNILMDSLYKTGEVKKALSLFDEMKGLSMEPDSITYSIVILCLVDLGE 588 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SCIPSV AY+ L KGLCKIGEI+EAMMLVRDCLG V+ GP EFKYS Sbjct: 589 IQEACVCHNKIIEMSCIPSVAAYTSLTKGLCKIGEIEEAMMLVRDCLGCVSDGPTEFKYS 648 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LTVIH CK+NDAEKVIDVLNEMM+QGC LDNV+ SAVISGMCKHGTIEEARKVFSNLRER Sbjct: 649 LTVIHACKSNDAEKVIDVLNEMMEQGCSLDNVIYSAVISGMCKHGTIEEARKVFSNLRER 708 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTESDTIVYDELLIDHMK+KTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 709 NYLTESDTIVYDELLIDHMKRKTADLVLSSLKFFGLESKLKAKGCKLL 756 Score = 62.0 bits (149), Expect = 7e-07 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 1/167 (0%) Frame = +3 Query: 36 IYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKI 215 +YN +MD+L+ G L LS+ D+ L ++ T+ + + L G I E ++ Sbjct: 262 LYNRVMDALFRTGHLDLGLSVYDDFKEDGLVEESVTFMVLVKGLCQAGRIDEMLEVLGRM 321 Query: 216 IEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDC-LGSVTSGPMEFKYSLTVIHMCKAN 392 E+ C P V AY+ L + L + G++D + + + V P Y+ ++ + K Sbjct: 322 RERLCKPDVFAYTALVRILVRAGDLDACLRVWEEMKRDGVVVDPK--AYATMIVGLAKGG 379 Query: 393 DAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNL 533 ++ ++ EM +G +D V+ ++ G + A + +L Sbjct: 380 KVQEGYELFKEMKSKGILVDRVIYGKLVEAFVAVGKVGLAFDLLKDL 426 >gb|KHN11425.1| Pentatricopeptide repeat-containing protein [Glycine soja] Length = 425 Score = 364 bits (934), Expect = e-121 Identities = 176/227 (77%), Positives = 202/227 (88%) Frame = +3 Query: 6 LKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEI 185 LKEKG+VS+EIYNI MDSL+ IGE+KKALSL DE+ G LKPD+ TY AILCLVD+GEI Sbjct: 197 LKEKGHVSVEIYNIFMDSLHKIGEVKKALSLFDEMKGLSLKPDSFTYCTAILCLVDLGEI 256 Query: 186 KEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYSL 365 KEAC HN+IIE SCIPSV AYS L KGLC+IGEIDEAM+LV DCLG+V+ GP+EFKYSL Sbjct: 257 KEACACHNRIIEMSCIPSVAAYSSLTKGLCQIGEIDEAMLLVHDCLGNVSDGPLEFKYSL 316 Query: 366 TVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRERK 545 T+IH CK+N AEKVIDVLNEM++QGC +DNV+ ++ISGMCKHGTIEEARKVFSNLRER Sbjct: 317 TIIHACKSNVAEKVIDVLNEMIEQGCSIDNVIYCSIISGMCKHGTIEEARKVFSNLRERN 376 Query: 546 LLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTES+TIVYDELLIDHMKKKTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 377 FLTESNTIVYDELLIDHMKKKTADLVLSSLKFFGLESKLKAKGCKLL 423 >ref|XP_019432613.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Lupinus angustifolius] gb|OIW21254.1| hypothetical protein TanjilG_31184 [Lupinus angustifolius] Length = 725 Score = 373 bits (958), Expect = e-121 Identities = 185/227 (81%), Positives = 207/227 (91%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKGYVS++IYNILMDSL+ IGE+KKAL LLDE+N S+LKPD+ TYS AILC VD+GE Sbjct: 497 HLKEKGYVSVKIYNILMDSLHKIGEMKKALLLLDEMNDSNLKPDSFTYSTAILCHVDLGE 556 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SC+PSV AYSCLAKGLCKIGEID AMMLVRDCLG+VTSGPMEFK+S Sbjct: 557 IQEACVCHNKIIEMSCVPSVAAYSCLAKGLCKIGEIDPAMMLVRDCLGNVTSGPMEFKHS 616 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+IH CK+NDA KVIDVLNEM+QQGCP NV SAVISGM K+GTIEEARKVFSNLRER Sbjct: 617 LTIIHACKSNDAAKVIDVLNEMIQQGCPPGNVAYSAVISGMSKYGTIEEARKVFSNLRER 676 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKL 683 KLLTE++TIVYDE LIDHMKKKTADLV++G+KFF LESKLKSKGC L Sbjct: 677 KLLTEAETIVYDEFLIDHMKKKTADLVLAGLKFFDLESKLKSKGCML 723 Score = 61.2 bits (147), Expect = 1e-06 Identities = 43/168 (25%), Positives = 75/168 (44%) Frame = +3 Query: 36 IYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKI 215 +YN +MD+L G L ALS+ D+ + L + T+ I I L G I E ++ Sbjct: 230 LYNRVMDALVKTGHLNLALSVYDDFSEDGLVEETVTFMILIKGLCKAGRIDEMLETLGRM 289 Query: 216 IEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYSLTVIHMCKAND 395 K C P V AY+ L K L G +D + + + P Y+ +I + K Sbjct: 290 RTKLCKPDVFAYTALVKMLVPEGNLDGCLRVWEEMKRDRVE-PDVMAYATIIIGLSKVGR 348 Query: 396 AEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRE 539 E+ ++ EM ++G +D + ++I T+++ F L++ Sbjct: 349 VEEGYELFKEMKKKGHLIDRAIYGSLIDSFV---TVKKLGSAFDLLKD 393 >gb|KYP41047.1| Pentatricopeptide repeat-containing protein At4g20740 family [Cajanus cajan] Length = 679 Score = 367 bits (941), Expect = e-119 Identities = 180/228 (78%), Positives = 207/228 (90%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKG+VS+EIYNILM+SL+ IGE KKALSL DE+ LKPD+ TYSIAILCLVD+GE Sbjct: 450 FLKEKGHVSVEIYNILMESLHKIGEGKKALSLFDEMKDLSLKPDSLTYSIAILCLVDLGE 509 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SC PSV AYS LAKGLCKIGEI+EAM+LVRDCLG+V+ GPM FKYS Sbjct: 510 IQEACVCHNKIIEMSCFPSVAAYSSLAKGLCKIGEIEEAMLLVRDCLGNVSDGPMVFKYS 569 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+IH CK+NDA+KVIDVL+EM++QGC LD+VV SA+ISGMCK+GTIEEARKVFSNLRER Sbjct: 570 LTIIHACKSNDAKKVIDVLDEMLEQGCSLDSVVYSAIISGMCKYGTIEEARKVFSNLRER 629 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTES+TIVYDELLIDHMKKKTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 630 NFLTESETIVYDELLIDHMKKKTADLVLSSLKFFGLESKLKAKGCKLL 677 >gb|KHN23162.1| Pentatricopeptide repeat-containing protein [Glycine soja] Length = 680 Score = 366 bits (939), Expect = e-119 Identities = 177/227 (77%), Positives = 203/227 (89%) Frame = +3 Query: 6 LKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEI 185 LKEKG+VS+EIYNI MDSL+ IGE+KKALSL DE+ G LKPD+ TY AILCLVD+GEI Sbjct: 452 LKEKGHVSVEIYNIFMDSLHKIGEVKKALSLFDEMKGLSLKPDSFTYCTAILCLVDLGEI 511 Query: 186 KEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYSL 365 KEAC HN+IIE SCIPSV AYS L KGLC+IGEIDEAM+LVRDCLG+V+ GP+EFKYSL Sbjct: 512 KEACACHNRIIEMSCIPSVAAYSSLTKGLCQIGEIDEAMLLVRDCLGNVSDGPLEFKYSL 571 Query: 366 TVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRERK 545 T+IH CK+N AEKVIDVLNEM++QGC +DNV+ ++ISGMCKHGTIEEARKVFSNLRER Sbjct: 572 TIIHACKSNVAEKVIDVLNEMIEQGCSIDNVIYCSIISGMCKHGTIEEARKVFSNLRERN 631 Query: 546 LLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTES+TIVYDELLIDHMKKKTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 632 FLTESNTIVYDELLIDHMKKKTADLVLSSLKFFGLESKLKAKGCKLL 678 >ref|XP_020240563.1| pentatricopeptide repeat-containing protein At4g20740 [Cajanus cajan] ref|XP_020240564.1| pentatricopeptide repeat-containing protein At4g20740 [Cajanus cajan] ref|XP_020240565.1| pentatricopeptide repeat-containing protein At4g20740 [Cajanus cajan] Length = 720 Score = 367 bits (941), Expect = e-118 Identities = 180/228 (78%), Positives = 207/228 (90%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKG+VS+EIYNILM+SL+ IGE KKALSL DE+ LKPD+ TYSIAILCLVD+GE Sbjct: 491 FLKEKGHVSVEIYNILMESLHKIGEGKKALSLFDEMKDLSLKPDSLTYSIAILCLVDLGE 550 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SC PSV AYS LAKGLCKIGEI+EAM+LVRDCLG+V+ GPM FKYS Sbjct: 551 IQEACVCHNKIIEMSCFPSVAAYSSLAKGLCKIGEIEEAMLLVRDCLGNVSDGPMVFKYS 610 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+IH CK+NDA+KVIDVL+EM++QGC LD+VV SA+ISGMCK+GTIEEARKVFSNLRER Sbjct: 611 LTIIHACKSNDAKKVIDVLDEMLEQGCSLDSVVYSAIISGMCKYGTIEEARKVFSNLRER 670 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTES+TIVYDELLIDHMKKKTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 671 NFLTESETIVYDELLIDHMKKKTADLVLSSLKFFGLESKLKAKGCKLL 718 >gb|KRG90809.1| hypothetical protein GLYMA_20G115000 [Glycine max] Length = 695 Score = 366 bits (939), Expect = e-118 Identities = 178/227 (78%), Positives = 202/227 (88%) Frame = +3 Query: 6 LKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEI 185 LKEKG+VS+EIYNI MDSL+ IGE+KKALSL DE+ G LKPD+ TY AILCLVD+GEI Sbjct: 467 LKEKGHVSVEIYNIFMDSLHKIGEVKKALSLFDEMKGLSLKPDSFTYCTAILCLVDLGEI 526 Query: 186 KEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYSL 365 KEAC HN+IIE SCIPSV AYS L KGLC+IGEIDEAM+LVRDCLG+V+ GPMEFKYSL Sbjct: 527 KEACACHNRIIEMSCIPSVAAYSSLTKGLCQIGEIDEAMLLVRDCLGNVSDGPMEFKYSL 586 Query: 366 TVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRERK 545 T+IH CK+N EKVIDVLNEM++QGC LDNV+ ++ISGMCKHGTIEEARKVFSNLRER Sbjct: 587 TIIHACKSNVPEKVIDVLNEMIEQGCSLDNVIYCSIISGMCKHGTIEEARKVFSNLRERN 646 Query: 546 LLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTES+TIVYDELLIDHMKKKTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 647 FLTESNTIVYDELLIDHMKKKTADLVLSSLKFFGLESKLKAKGCKLL 693 Score = 68.2 bits (165), Expect = 7e-09 Identities = 43/166 (25%), Positives = 79/166 (47%) Frame = +3 Query: 36 IYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKI 215 +YN +MD+L G L ALS+ D++ L ++ T+ + + L G I E ++ Sbjct: 199 LYNRVMDALVRTGHLDLALSVYDDLKEDGLVEESVTFMVLVKGLCKCGRIDEMLKVLGRM 258 Query: 216 IEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYSLTVIHMCKAND 395 E+ C P V AY+ L K L G +D A + V + + P Y+ ++ + K Sbjct: 259 RERLCKPDVFAYTALVKILVPAGNLD-ACLRVWEEMKRDRVEPDVKAYATMIVGLAKGGR 317 Query: 396 AEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNL 533 ++ ++ EM +GC +D+V+ A++ G + A + +L Sbjct: 318 VQEGYELFREMKGKGCLVDSVIYGALVEAFVAEGKVGLAFDLLKDL 363 >ref|XP_006605814.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Glycine max] gb|KRG90529.1| hypothetical protein GLYMA_20G097200 [Glycine max] gb|KRG90530.1| hypothetical protein GLYMA_20G097200 [Glycine max] Length = 764 Score = 367 bits (941), Expect = e-118 Identities = 178/227 (78%), Positives = 203/227 (89%) Frame = +3 Query: 6 LKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEI 185 LKEKG+VS+EIYNI MDSL+ IGE+KKALSL DE+ G LKPD+ TY AILCLVD+GEI Sbjct: 536 LKEKGHVSVEIYNIFMDSLHKIGEVKKALSLFDEMKGLSLKPDSFTYCTAILCLVDLGEI 595 Query: 186 KEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYSL 365 KEAC HN+IIE SCIPSV AYS L KGLC+IGEIDEAM+LVRDCLG+V+ GP+EFKYSL Sbjct: 596 KEACACHNRIIEMSCIPSVAAYSSLTKGLCQIGEIDEAMLLVRDCLGNVSDGPLEFKYSL 655 Query: 366 TVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRERK 545 T+IH CK+N AEKVIDVLNEM++QGC LDNV+ ++ISGMCKHGTIEEARKVFSNLRER Sbjct: 656 TIIHACKSNVAEKVIDVLNEMIEQGCSLDNVIYCSIISGMCKHGTIEEARKVFSNLRERN 715 Query: 546 LLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTES+TIVYDELLIDHMKKKTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 716 FLTESNTIVYDELLIDHMKKKTADLVLSSLKFFGLESKLKAKGCKLL 762 Score = 70.5 bits (171), Expect = 1e-09 Identities = 44/166 (26%), Positives = 79/166 (47%) Frame = +3 Query: 36 IYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKI 215 +YN +MD+L G L ALS+ D++ L ++ T+ + + L G I E ++ Sbjct: 268 LYNRVMDALVRTGHLDLALSVYDDLKEDGLVEESVTFMVLVKGLCKCGRIDEMLEVLGRM 327 Query: 216 IEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYSLTVIHMCKAND 395 E+ C P V AY+ L K L G +D A + V + + P Y+ ++ + K Sbjct: 328 RERLCKPDVFAYTALVKILVPAGNLD-ACLRVWEEMKRDRVEPDVKAYATMIVGLAKGGR 386 Query: 396 AEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNL 533 ++ ++ EM +GC +D V+ A++ G +E A + +L Sbjct: 387 VQEGYELFREMKGKGCLVDRVIYGALVEAFVAEGKVELAFDLLKDL 432 >ref|XP_016204326.1| pentatricopeptide repeat-containing protein At4g20740, partial [Arachis ipaensis] Length = 687 Score = 364 bits (934), Expect = e-118 Identities = 176/228 (77%), Positives = 204/228 (89%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LK KGY+S+E+YNILMDSL+ +GE+KKAL L DE+N S+LKPD+ TYSI ILC VD+G+ Sbjct: 458 HLKAKGYISVEMYNILMDSLHKVGEIKKALLLFDEMNASNLKPDSFTYSITILCHVDLGK 517 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EAC YHNKIIE SCIPSV AYS LAKGLCKIGEID MMLVRDCL ++ SGPMEFKYS Sbjct: 518 IQEACEYHNKIIEMSCIPSVAAYSFLAKGLCKIGEIDAGMMLVRDCLANIDSGPMEFKYS 577 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+IH CK+NDAEKVI+VLNEMMQQGCPLD V C+AVISGM KHGTIEEARKVFSNLR+R Sbjct: 578 LTIIHSCKSNDAEKVIEVLNEMMQQGCPLDIVACAAVISGMSKHGTIEEARKVFSNLRDR 637 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 +LLTE+DTIVYDELLI+H K++TADLV+SG+K FGLESKLKSKG K + Sbjct: 638 RLLTEADTIVYDELLINHTKQRTADLVLSGIKLFGLESKLKSKGFKFL 685 >ref|XP_015969135.1| pentatricopeptide repeat-containing protein At4g20740 [Arachis duranensis] Length = 720 Score = 364 bits (934), Expect = e-117 Identities = 176/228 (77%), Positives = 203/228 (89%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LK KGY+S+E+YNILMDSL+ +GE+KKAL L DE+N S+LKPD+ TYSI ILC VD+G+ Sbjct: 491 HLKAKGYISVEMYNILMDSLHKVGEMKKALLLFDEMNASNLKPDSFTYSITILCHVDLGK 550 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EAC YHNKIIE SCIPSV AYS LAKGLCKIGEID MMLVRDCL ++ SGPMEFKYS Sbjct: 551 IQEACEYHNKIIEMSCIPSVAAYSSLAKGLCKIGEIDAGMMLVRDCLANIDSGPMEFKYS 610 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+IH CK+NDAEKVI+VLNEMMQQGCPLD V C+AVISGM KHGTIEEARKVFSNLR+R Sbjct: 611 LTIIHSCKSNDAEKVIEVLNEMMQQGCPLDIVACAAVISGMSKHGTIEEARKVFSNLRDR 670 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 +LLTE+DTIVYDELLI H K++TADLV+SG+K FGLESKLKSKG K + Sbjct: 671 RLLTEADTIVYDELLISHTKQRTADLVLSGIKLFGLESKLKSKGFKFL 718 >ref|XP_014632180.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Glycine max] gb|KRH55198.1| hypothetical protein GLYMA_06G236700 [Glycine max] Length = 764 Score = 364 bits (934), Expect = e-117 Identities = 176/227 (77%), Positives = 202/227 (88%) Frame = +3 Query: 6 LKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEI 185 LKEKG+VS+EIYNI MDSL+ IGE+KKALSL DE+ G LKPD+ TY AILCLVD+GEI Sbjct: 536 LKEKGHVSVEIYNIFMDSLHKIGEVKKALSLFDEMKGLSLKPDSFTYCTAILCLVDLGEI 595 Query: 186 KEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYSL 365 KEAC HN+IIE SCIPSV AYS L KGLC+IGEIDEAM+LV DCLG+V+ GP+EFKYSL Sbjct: 596 KEACACHNRIIEMSCIPSVAAYSSLTKGLCQIGEIDEAMLLVHDCLGNVSDGPLEFKYSL 655 Query: 366 TVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRERK 545 T+IH CK+N AEKVIDVLNEM++QGC +DNV+ ++ISGMCKHGTIEEARKVFSNLRER Sbjct: 656 TIIHACKSNVAEKVIDVLNEMIEQGCSIDNVIYCSIISGMCKHGTIEEARKVFSNLRERN 715 Query: 546 LLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTES+TIVYDELLIDHMKKKTADLV+S +KFFGLESKLK+KGCKL+ Sbjct: 716 FLTESNTIVYDELLIDHMKKKTADLVLSSLKFFGLESKLKAKGCKLL 762 Score = 67.4 bits (163), Expect = 1e-08 Identities = 43/166 (25%), Positives = 78/166 (46%) Frame = +3 Query: 36 IYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKI 215 +YN +MD+L G L ALS+ D++ L ++ T+ + + L G I E ++ Sbjct: 268 LYNRVMDALVRTGHLDLALSVYDDLKEDGLVEESVTFMVLVKGLCKCGRIDEMLEVLGRM 327 Query: 216 IEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYSLTVIHMCKAND 395 E+ C P V AY+ L K L G +D A + V + + P Y+ ++ + K Sbjct: 328 RERLCKPDVFAYTALVKILVPAGNLD-ACLRVWEEMKRDRVVPDVKAYATMIVGLAKGGR 386 Query: 396 AEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNL 533 ++ ++ EM +GC +D V+ A++ G + A + +L Sbjct: 387 VQEGYELFREMKGKGCLVDRVIYGALVEAFVAEGKVGLAFDLLKDL 432 >ref|XP_007149018.1| hypothetical protein PHAVU_005G033500g [Phaseolus vulgaris] gb|ESW21012.1| hypothetical protein PHAVU_005G033500g [Phaseolus vulgaris] Length = 715 Score = 360 bits (925), Expect = e-116 Identities = 181/228 (79%), Positives = 202/228 (88%) Frame = +3 Query: 3 YLKEKGYVSIEIYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGE 182 +LKEKG+VS+EIYNIL DSLY IGE KKALSL DE+ S ++PD+ TYSI I CLVD+GE Sbjct: 487 HLKEKGHVSVEIYNILTDSLYKIGEEKKALSLFDEMK-SMMEPDSITYSIVIQCLVDLGE 545 Query: 183 IKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSVTSGPMEFKYS 362 I+EACV HNKIIE SCIPSV AY LAKGLCKIGEIDEAMMLVRDCLGSV+ GPMEFKYS Sbjct: 546 IQEACVCHNKIIEMSCIPSVAAYRSLAKGLCKIGEIDEAMMLVRDCLGSVSDGPMEFKYS 605 Query: 363 LTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNLRER 542 LT+IH CK+NDAEKVI VLNEMM+QGC LDNV+ SA+ISGMCKHGTIEEARKVFSNLRER Sbjct: 606 LTIIHACKSNDAEKVIGVLNEMMEQGCSLDNVIYSAIISGMCKHGTIEEARKVFSNLRER 665 Query: 543 KLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 LTESDTIVY+ELLIDH K+KTADLV+ +KFFGLESKLK+KG KL+ Sbjct: 666 NYLTESDTIVYEELLIDHTKRKTADLVLLSLKFFGLESKLKAKGSKLL 713 Score = 61.2 bits (147), Expect = 1e-06 Identities = 41/159 (25%), Positives = 72/159 (45%) Frame = +3 Query: 111 NGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEI 290 N +KP Y+ + L G + ++ E + V + L KGLCK G I Sbjct: 210 NKFGVKPRVFLYNRVMDALFKTGHLDLGLSVYDDFKEDGLVEESVTFMLLVKGLCKGGRI 269 Query: 291 DEAMMLVRDCLGSVTSGPMEFKYSLTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSA 470 DE M+ V + P F Y+ V + +A D + + V EM + G +D + Sbjct: 270 DE-MLEVLGRMRESLCKPDVFAYTALVRILVRAGDLDACLRVWEEMKRDGVVVDPKAYAT 328 Query: 471 VISGMCKHGTIEEARKVFSNLRERKLLTESDTIVYDELL 587 +I G+ K G ++E ++F ++ + L D ++Y +L+ Sbjct: 329 MIVGLAKGGRVQEGYELFKEMKSKGFLV--DRVIYGKLV 365 Score = 58.9 bits (141), Expect = 7e-06 Identities = 38/167 (22%), Positives = 75/167 (44%), Gaps = 1/167 (0%) Frame = +3 Query: 36 IYNILMDSLYAIGELKKALSLLDEINGSDLKPDASTYSIAILCLVDVGEIKEACVYHNKI 215 +YN +MD+L+ G L LS+ D+ L ++ T+ + + L G I E ++ Sbjct: 220 LYNRVMDALFKTGHLDLGLSVYDDFKEDGLVEESVTFMLLVKGLCKGGRIDEMLEVLGRM 279 Query: 216 IEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDC-LGSVTSGPMEFKYSLTVIHMCKAN 392 E C P V AY+ L + L + G++D + + + V P Y+ ++ + K Sbjct: 280 RESLCKPDVFAYTALVRILVRAGDLDACLRVWEEMKRDGVVVDPK--AYATMIVGLAKGG 337 Query: 393 DAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEARKVFSNL 533 ++ ++ EM +G +D V+ ++ G + A + +L Sbjct: 338 RVQEGYELFKEMKSKGFLVDRVIYGKLVEAFVAGGKVGLAFDLLKDL 384 >dbj|GAU10008.1| hypothetical protein TSUD_120070 [Trifolium subterraneum] Length = 626 Score = 353 bits (906), Expect = e-114 Identities = 179/238 (75%), Positives = 202/238 (84%), Gaps = 11/238 (4%) Frame = +3 Query: 6 LKEKGY-VSIEIYNILMDSLYAIGELKKALSLL----------DEINGSDLKPDASTYSI 152 L GY + IYN L++ L + +++KA L D ++GSDLKPD+ST++I Sbjct: 387 LVSSGYRADLGIYNNLIEGLCNLNKVEKAYKLFQVTIQEGLEPDFLSGSDLKPDSSTFNI 446 Query: 153 AILCLVDVGEIKEACVYHNKIIEKSCIPSVVAYSCLAKGLCKIGEIDEAMMLVRDCLGSV 332 AILCLVD GEIKEAC HNKIIE SCIPS+ AY CLAKGLC+IGEIDEAMMLVRDCLG+V Sbjct: 447 AILCLVDRGEIKEACECHNKIIEMSCIPSIAAYCCLAKGLCEIGEIDEAMMLVRDCLGNV 506 Query: 333 TSGPMEFKYSLTVIHMCKANDAEKVIDVLNEMMQQGCPLDNVVCSAVISGMCKHGTIEEA 512 T+GPMEFKY LTVIH+CK+NDAEKVIDVLNEMMQQGCPL +VVCSA+ISGMCK+GTIEEA Sbjct: 507 TNGPMEFKYCLTVIHICKSNDAEKVIDVLNEMMQQGCPLGSVVCSAIISGMCKYGTIEEA 566 Query: 513 RKVFSNLRERKLLTESDTIVYDELLIDHMKKKTADLVISGVKFFGLESKLKSKGCKLI 686 RKVFSNLRE KLLTESDTIVYDE LIDHMKKKTADLVISG+KFFGLESKLKSKGCKL+ Sbjct: 567 RKVFSNLREHKLLTESDTIVYDEFLIDHMKKKTADLVISGLKFFGLESKLKSKGCKLL 624