BLASTX nr result
ID: Astragalus22_contig00000993
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00000993 (741 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004490153.1| PREDICTED: pentatricopeptide repeat-containi... 406 e-138 ref|XP_003614017.1| PPR containing plant-like protein [Medicago ... 400 e-135 gb|KYP65771.1| Pentatricopeptide repeat-containing protein At2g2... 388 e-131 ref|XP_020218171.1| pentatricopeptide repeat-containing protein ... 388 e-130 ref|XP_003520106.1| PREDICTED: pentatricopeptide repeat-containi... 387 e-130 gb|OIW02168.1| hypothetical protein TanjilG_02392 [Lupinus angus... 378 e-127 ref|XP_019459635.1| PREDICTED: pentatricopeptide repeat-containi... 378 e-126 ref|XP_007157581.1| hypothetical protein PHAVU_002G081200g [Phas... 377 e-126 ref|XP_017435484.1| PREDICTED: pentatricopeptide repeat-containi... 373 e-125 gb|KHN02018.1| Pentatricopeptide repeat-containing protein [Glyc... 373 e-125 ref|XP_014492913.1| pentatricopeptide repeat-containing protein ... 371 e-124 gb|KRH75002.1| hypothetical protein GLYMA_01G056200 [Glycine max] 360 e-120 ref|XP_016203711.1| pentatricopeptide repeat-containing protein ... 356 e-118 gb|PNX95537.1| pentatricopeptide repeat-containing protein at2g2... 355 e-117 ref|XP_015966522.1| pentatricopeptide repeat-containing protein ... 354 e-117 dbj|GAU50198.1| hypothetical protein TSUD_408860 [Trifolium subt... 357 e-117 gb|KHN03629.1| Pentatricopeptide repeat-containing protein [Glyc... 345 e-115 gb|OMO71979.1| hypothetical protein COLO4_27920 [Corchorus olito... 335 e-110 ref|XP_021621830.1| pentatricopeptide repeat-containing protein ... 335 e-109 ref|XP_021299781.1| pentatricopeptide repeat-containing protein ... 332 e-108 >ref|XP_004490153.1| PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Cicer arietinum] Length = 523 Score = 406 bits (1044), Expect = e-138 Identities = 197/238 (82%), Positives = 214/238 (89%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNL+LAKRLFD M E+DIVCWNAMISGMAMHGDGK A+KLF MEK+G+KPDDI Sbjct: 283 DMYAKCGNLELAKRLFDSMQERDIVCWNAMISGMAMHGDGKGAVKLFYDMEKVGMKPDDI 342 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSGMA EGLKLLD+MCSVYNIEPKSEHYGCLVDLLSRAGLFEEA+VI+RR Sbjct: 343 TFIAVFTACSYSGMAFEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRR 402 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 ITNSW G+ E LAWRAFLSACCNHG L+L+NHSGVYVLLSNLYA+SGKH Sbjct: 403 ITNSWIGNAETLAWRAFLSACCNHGKTQLAEVAAEKLLQLENHSGVYVLLSNLYASSGKH 462 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +DARRVRD+MKIKGANKAPGCSS EIDGVVSEFIAGEK HP ++EIHS+LEKMHMQLD Sbjct: 463 SDARRVRDMMKIKGANKAPGCSSTEIDGVVSEFIAGEKIHPQMEEIHSVLEKMHMQLD 520 Score = 73.9 bits (180), Expect = 2e-11 Identities = 46/144 (31%), Positives = 74/144 (51%), Gaps = 1/144 (0%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK GN+D A+ FD PEKD W AMISG + K +L LF M+ + PD+ F Sbjct: 183 YAKMGNVDSARLFFDEAPEKDKGIWGAMISGYVQNSCFKESLYLFRLMQLTDIVPDESIF 242 Query: 553 IAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYG-CLVDLLSRAGLFEEALVIMRRI 377 +++ +AC++ G G+ + R + + P S L+D+ ++ G E + +R+ Sbjct: 243 VSILSACAHLGALDIGV-WIHRYLNRSKMLPLSVRLSTSLLDMYAKCGNLE----LAKRL 297 Query: 376 TNSWSGSEEILAWRAFLSACCNHG 305 +S +I+ W A +S HG Sbjct: 298 FDSMQ-ERDIVCWNAMISGMAMHG 320 >ref|XP_003614017.1| PPR containing plant-like protein [Medicago truncatula] gb|AES96975.1| PPR containing plant-like protein [Medicago truncatula] Length = 525 Score = 400 bits (1029), Expect = e-135 Identities = 196/240 (81%), Positives = 214/240 (89%), Gaps = 2/240 (0%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNL+LAKRLFD M +D+VCWNAMISGMAMHGDGK ALKLF MEK+GVKPDDI Sbjct: 283 DMYAKCGNLELAKRLFDSMNMRDVVCWNAMISGMAMHGDGKGALKLFYDMEKVGVKPDDI 342 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSGMA EGL LLD+MCSVYNI PKSEHYGCLVDLLSRAGLFEEA+V++R+ Sbjct: 343 TFIAVFTACSYSGMAYEGLMLLDKMCSVYNIVPKSEHYGCLVDLLSRAGLFEEAMVMIRK 402 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDN--HSGVYVLLSNLYAASG 206 ITNSW+GSEE LAWRAFLSACCNHG L+LDN HSGVYVLLSNLYAASG Sbjct: 403 ITNSWNGSEETLAWRAFLSACCNHGETQLAELAAEKVLQLDNHIHSGVYVLLSNLYAASG 462 Query: 205 KHTDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 KH+DARRVRD+MKIKG NKAPGCSSVEIDGV+SEFIAGEKTHP ++EIHS+L+KMHMQLD Sbjct: 463 KHSDARRVRDMMKIKGTNKAPGCSSVEIDGVISEFIAGEKTHPQMEEIHSVLKKMHMQLD 522 Score = 72.8 bits (177), Expect = 5e-11 Identities = 41/143 (28%), Positives = 73/143 (51%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PEKD W AMISG + K +L LF M+ + PD+ F Sbjct: 183 YAKVGDVDSARLFFDEAPEKDKGIWGAMISGYVQNSCFKESLYLFRLMQLTDIVPDESIF 242 Query: 553 IAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRRIT 374 +++ +AC++ G G+ + + + + L+D+ ++ G E + +R+ Sbjct: 243 VSILSACAHLGALEIGVWIHQHLNQLKLVPLSVRLSTSLLDMYAKCGNLE----LAKRLF 298 Query: 373 NSWSGSEEILAWRAFLSACCNHG 305 +S + +++ W A +S HG Sbjct: 299 DSMN-MRDVVCWNAMISGMAMHG 320 >gb|KYP65771.1| Pentatricopeptide repeat-containing protein At2g20540 family [Cajanus cajan] Length = 500 Score = 388 bits (996), Expect = e-131 Identities = 190/238 (79%), Positives = 206/238 (86%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNLD+AKRLFD PEKDIVCWNAMISGMAMHGDG SALK+F ME+ G+KPDDI Sbjct: 263 DMYAKCGNLDMAKRLFDSAPEKDIVCWNAMISGMAMHGDGASALKMFLDMERTGMKPDDI 322 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSG+A EGL+LL +MCSVY I PKSEHYGCLVDLLSR+G FEEA+V+MRR Sbjct: 323 TFIAVFTACSYSGLAHEGLQLLHKMCSVYKIVPKSEHYGCLVDLLSRSGHFEEAMVMMRR 382 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 ITNSW+ SEE LAWRAFLSACCN G L LDNHSGVYVLLSNLYAASGKH Sbjct: 383 ITNSWNASEETLAWRAFLSACCNQGQAQLAESAAERLLLLDNHSGVYVLLSNLYAASGKH 442 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +DARRVRD+MK KG KAPGCSSVEIDGVVSEFIAGE+THP + EIHS+LEKMHMQLD Sbjct: 443 SDARRVRDMMKNKGVEKAPGCSSVEIDGVVSEFIAGEETHPQMKEIHSVLEKMHMQLD 500 Score = 72.8 bits (177), Expect = 5e-11 Identities = 44/143 (30%), Positives = 74/143 (51%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK GN+D A+ FD PEKD W AMISG + K L LF ++ V PD+ F Sbjct: 164 YAKAGNVDSARLFFDEAPEKDRGIWGAMISGYVQNSCFKEGLHLFRLLQLTEVVPDESIF 223 Query: 553 IAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRRIT 374 +++ +AC++ G G+ + R + + + L+D+ ++ G + A +R+ Sbjct: 224 VSILSACAHLGALDIGI-WIHRYLNRAMVPLSTRLSTSLLDMYAKCGNLDMA----KRLF 278 Query: 373 NSWSGSEEILAWRAFLSACCNHG 305 +S + ++I+ W A +S HG Sbjct: 279 DS-APEKDIVCWNAMISGMAMHG 300 >ref|XP_020218171.1| pentatricopeptide repeat-containing protein At2g20540-like [Cajanus cajan] Length = 517 Score = 388 bits (996), Expect = e-130 Identities = 190/238 (79%), Positives = 206/238 (86%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNLD+AKRLFD PEKDIVCWNAMISGMAMHGDG SALK+F ME+ G+KPDDI Sbjct: 280 DMYAKCGNLDMAKRLFDSAPEKDIVCWNAMISGMAMHGDGASALKMFLDMERTGMKPDDI 339 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSG+A EGL+LL +MCSVY I PKSEHYGCLVDLLSR+G FEEA+V+MRR Sbjct: 340 TFIAVFTACSYSGLAHEGLQLLHKMCSVYKIVPKSEHYGCLVDLLSRSGHFEEAMVMMRR 399 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 ITNSW+ SEE LAWRAFLSACCN G L LDNHSGVYVLLSNLYAASGKH Sbjct: 400 ITNSWNASEETLAWRAFLSACCNQGQAQLAESAAERLLLLDNHSGVYVLLSNLYAASGKH 459 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +DARRVRD+MK KG KAPGCSSVEIDGVVSEFIAGE+THP + EIHS+LEKMHMQLD Sbjct: 460 SDARRVRDMMKNKGVEKAPGCSSVEIDGVVSEFIAGEETHPQMKEIHSVLEKMHMQLD 517 Score = 72.8 bits (177), Expect = 5e-11 Identities = 44/143 (30%), Positives = 74/143 (51%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK GN+D A+ FD PEKD W AMISG + K L LF ++ V PD+ F Sbjct: 181 YAKAGNVDSARLFFDEAPEKDRGIWGAMISGYVQNSCFKEGLHLFRLLQLTEVVPDESIF 240 Query: 553 IAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRRIT 374 +++ +AC++ G G+ + R + + + L+D+ ++ G + A +R+ Sbjct: 241 VSILSACAHLGALDIGI-WIHRYLNRAMVPLSTRLSTSLLDMYAKCGNLDMA----KRLF 295 Query: 373 NSWSGSEEILAWRAFLSACCNHG 305 +S + ++I+ W A +S HG Sbjct: 296 DS-APEKDIVCWNAMISGMAMHG 317 >ref|XP_003520106.1| PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Glycine max] gb|KRH70861.1| hypothetical protein GLYMA_02G114500 [Glycine max] Length = 518 Score = 387 bits (994), Expect = e-130 Identities = 189/239 (79%), Positives = 213/239 (89%), Gaps = 1/239 (0%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNL+LAKRLFD MPE+DIVCWNAMISG+AMHGDG SALK+FS MEK G+KPDDI Sbjct: 280 DMYAKCGNLELAKRLFDSMPERDIVCWNAMISGLAMHGDGASALKMFSEMEKTGIKPDDI 339 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSGMA EGL+LLD+M S+Y IEPKSEHYGCLVDLLSRAGLF EA+V++RR Sbjct: 340 TFIAVFTACSYSGMAHEGLQLLDKMSSLYEIEPKSEHYGCLVDLLSRAGLFGEAMVMIRR 399 Query: 379 ITN-SWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGK 203 IT+ SW+GSEE LAWRAFLSACCNHG LRL+NHSGVYVLLSNLYAASGK Sbjct: 400 ITSTSWNGSEETLAWRAFLSACCNHGQAQLAERAAKRLLRLENHSGVYVLLSNLYAASGK 459 Query: 202 HTDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 H+DARRVR++M+ KG +KAPGCSSVEIDGVVSEFIAGE+THP ++EIHS+LE +HMQLD Sbjct: 460 HSDARRVRNMMRNKGVDKAPGCSSVEIDGVVSEFIAGEETHPQMEEIHSVLEILHMQLD 518 Score = 70.9 bits (172), Expect = 2e-10 Identities = 45/146 (30%), Positives = 74/146 (50%), Gaps = 3/146 (2%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PEKD W AMISG + K L LF ++ V PD+ F Sbjct: 181 YAKVGDVDSARLFFDEAPEKDRGIWGAMISGYVQNSCFKEGLYLFRLLQLTHVVPDESIF 240 Query: 553 IAVFTACSYSGMASEGL---KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMR 383 +++ +AC++ G G+ + L+R +I + L+D+ ++ G E + + Sbjct: 241 VSILSACAHLGALDIGIWIHRYLNRKTVSLSIRLSTS----LLDMYAKCGNLE----LAK 292 Query: 382 RITNSWSGSEEILAWRAFLSACCNHG 305 R+ +S +I+ W A +S HG Sbjct: 293 RLFDSMP-ERDIVCWNAMISGLAMHG 317 >gb|OIW02168.1| hypothetical protein TanjilG_02392 [Lupinus angustifolius] Length = 500 Score = 378 bits (970), Expect = e-127 Identities = 184/238 (77%), Positives = 210/238 (88%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCG+L LAK LFDLMPE+DIVCWNAMISGMAMHG+G SALKLFS MEK G++PDDI Sbjct: 263 DMYAKCGHLKLAKALFDLMPERDIVCWNAMISGMAMHGNGTSALKLFSDMEKAGIEPDDI 322 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACS+SGMA EGL+LL++MCSVY IEPKSEHYGC+VDLLSRA LFEEA+ ++RR Sbjct: 323 TFIAVFTACSHSGMAYEGLQLLEKMCSVYKIEPKSEHYGCIVDLLSRASLFEEAMAVIRR 382 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 IT+S +GSEE LAWRAFLSACCNHG L+L+NHSGVYVLLSNLYAAS KH Sbjct: 383 ITSSSNGSEETLAWRAFLSACCNHGQAQLAEFAAERLLQLENHSGVYVLLSNLYAASEKH 442 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +DARRVRD+MK KGA+K PGCSSVEIDGVV+EFIAGEK HP +++I+S+LEKMHMQLD Sbjct: 443 SDARRVRDMMKNKGADKTPGCSSVEIDGVVNEFIAGEKVHPLMEDIYSVLEKMHMQLD 500 Score = 65.1 bits (157), Expect = 2e-08 Identities = 39/143 (27%), Positives = 68/143 (47%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G+++ A+ FD PEKD W AMISG + K L +F M+ + PD+ Sbjct: 164 YAKVGDVNSARFFFDEAPEKDRGIWGAMISGYVQNNCFKEGLYMFHLMQLTDIVPDESIL 223 Query: 553 IAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRRIT 374 +++F+AC++ G G+ + R + I ++D+ ++ G + A + + Sbjct: 224 VSIFSACAHLGALDIGI-WIHRYLNQARIPLSVRLSTSILDMYAKCGHLKLAKALFDLMP 282 Query: 373 NSWSGSEEILAWRAFLSACCNHG 305 +I+ W A +S HG Sbjct: 283 -----ERDIVCWNAMISGMAMHG 300 >ref|XP_019459635.1| PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Lupinus angustifolius] Length = 522 Score = 378 bits (970), Expect = e-126 Identities = 184/238 (77%), Positives = 210/238 (88%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCG+L LAK LFDLMPE+DIVCWNAMISGMAMHG+G SALKLFS MEK G++PDDI Sbjct: 285 DMYAKCGHLKLAKALFDLMPERDIVCWNAMISGMAMHGNGTSALKLFSDMEKAGIEPDDI 344 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACS+SGMA EGL+LL++MCSVY IEPKSEHYGC+VDLLSRA LFEEA+ ++RR Sbjct: 345 TFIAVFTACSHSGMAYEGLQLLEKMCSVYKIEPKSEHYGCIVDLLSRASLFEEAMAVIRR 404 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 IT+S +GSEE LAWRAFLSACCNHG L+L+NHSGVYVLLSNLYAAS KH Sbjct: 405 ITSSSNGSEETLAWRAFLSACCNHGQAQLAEFAAERLLQLENHSGVYVLLSNLYAASEKH 464 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +DARRVRD+MK KGA+K PGCSSVEIDGVV+EFIAGEK HP +++I+S+LEKMHMQLD Sbjct: 465 SDARRVRDMMKNKGADKTPGCSSVEIDGVVNEFIAGEKVHPLMEDIYSVLEKMHMQLD 522 Score = 65.1 bits (157), Expect = 2e-08 Identities = 39/143 (27%), Positives = 68/143 (47%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G+++ A+ FD PEKD W AMISG + K L +F M+ + PD+ Sbjct: 186 YAKVGDVNSARFFFDEAPEKDRGIWGAMISGYVQNNCFKEGLYMFHLMQLTDIVPDESIL 245 Query: 553 IAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRRIT 374 +++F+AC++ G G+ + R + I ++D+ ++ G + A + + Sbjct: 246 VSIFSACAHLGALDIGI-WIHRYLNQARIPLSVRLSTSILDMYAKCGHLKLAKALFDLMP 304 Query: 373 NSWSGSEEILAWRAFLSACCNHG 305 +I+ W A +S HG Sbjct: 305 -----ERDIVCWNAMISGMAMHG 322 >ref|XP_007157581.1| hypothetical protein PHAVU_002G081200g [Phaseolus vulgaris] gb|ESW29575.1| hypothetical protein PHAVU_002G081200g [Phaseolus vulgaris] Length = 517 Score = 377 bits (968), Expect = e-126 Identities = 183/238 (76%), Positives = 207/238 (86%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNLDLAKRLFDLMPE+DIVCWNAMISG AMHGDG SALK+FS MEK G++PDD+ Sbjct: 280 DMYAKCGNLDLAKRLFDLMPERDIVCWNAMISGTAMHGDGASALKMFSDMEKAGIRPDDV 339 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSGMA EGL+LL +MCSVY IEPK+EHY CLVDLLSRAGLF+EA+V+MRR Sbjct: 340 TFIAVFTACSYSGMAHEGLQLLCKMCSVYKIEPKNEHYSCLVDLLSRAGLFQEAMVMMRR 399 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 I+N + S+E LAWRAFLSACCNHG +RL+NHSGVYVLLSNLY+ASGKH Sbjct: 400 ISNLGNVSDETLAWRAFLSACCNHGQAQLAERAAERLIRLENHSGVYVLLSNLYSASGKH 459 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +DARRVRD+M+ KG +K PG SSVEI GVVSEFIAGE+THP + EIHS+LEKMHMQLD Sbjct: 460 SDARRVRDMMRNKGVDKVPGSSSVEIGGVVSEFIAGEETHPMMKEIHSVLEKMHMQLD 517 Score = 64.7 bits (156), Expect = 3e-08 Identities = 42/146 (28%), Positives = 70/146 (47%), Gaps = 3/146 (2%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PEKD W AMISG + K L LF ++ V PD+ Sbjct: 181 YAKVGDVDSARLFFDEAPEKDRGIWGAMISGYVQNSCFKEGLYLFRLLQLTEVVPDESIC 240 Query: 553 IAVFTACSYSGMASEGL---KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMR 383 +++ +AC++ G G+ + L+R +I + L+D+ ++ G + A + Sbjct: 241 VSILSACAHLGALDIGIWIHRYLNRAAVPLSIRLSTS----LLDMYAKCGNLDLAKRLFD 296 Query: 382 RITNSWSGSEEILAWRAFLSACCNHG 305 + +I+ W A +S HG Sbjct: 297 LMP-----ERDIVCWNAMISGTAMHG 317 >ref|XP_017435484.1| PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Vigna angularis] gb|KOM53127.1| hypothetical protein LR48_Vigan09g178600 [Vigna angularis] dbj|BAT99107.1| hypothetical protein VIGAN_10049200 [Vigna angularis var. angularis] Length = 517 Score = 373 bits (957), Expect = e-125 Identities = 184/238 (77%), Positives = 206/238 (86%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNLDLAKRLFDLMP++DIVCWNAMISGMA+HGDG SALKLFS MEK G+KPDDI Sbjct: 280 DMYAKCGNLDLAKRLFDLMPQRDIVCWNAMISGMAIHGDGASALKLFSDMEKAGIKPDDI 339 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSGMA EGL+LL +MCSVY IEPKSEHY CLVDLLSRAG FEEA+V++RR Sbjct: 340 TFIAVFTACSYSGMAHEGLQLLYKMCSVYKIEPKSEHYSCLVDLLSRAGHFEEAMVMLRR 399 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 I++S + SEE LAWRAFLSACCNHG LRL NHSGVYVLLSN+Y+ASGKH Sbjct: 400 ISSSGNVSEETLAWRAFLSACCNHGQAQLAERAAERLLRLQNHSGVYVLLSNVYSASGKH 459 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +DARRVRD+M+ K +K PG SSVEI GVVSEFIAGE+THP + EIHS+LEKMH+QLD Sbjct: 460 SDARRVRDMMRNKRVDKVPGSSSVEIGGVVSEFIAGEETHPMMKEIHSVLEKMHLQLD 517 Score = 60.1 bits (144), Expect = 1e-06 Identities = 41/146 (28%), Positives = 68/146 (46%), Gaps = 3/146 (2%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ F PEKD W AMISG + K L LF ++ V PD+ Sbjct: 181 YAKVGDVDSARLFFYEAPEKDRGIWGAMISGYVQNSCFKEGLYLFRLLQLTEVVPDESIC 240 Query: 553 IAVFTACSYSGMASEGL---KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMR 383 +++ +AC++ G G+ + L R +I + L+D+ ++ G + A + Sbjct: 241 VSILSACAHLGALDIGIWIHRYLKRAALPLSIRLSTS----LLDMYAKCGNLDLAKRLFD 296 Query: 382 RITNSWSGSEEILAWRAFLSACCNHG 305 + +I+ W A +S HG Sbjct: 297 LMP-----QRDIVCWNAMISGMAIHG 317 >gb|KHN02018.1| Pentatricopeptide repeat-containing protein [Glycine soja] Length = 518 Score = 373 bits (957), Expect = e-125 Identities = 183/239 (76%), Positives = 210/239 (87%), Gaps = 1/239 (0%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 D+YAKC NL+L KRLF+ MPE++IV WNAMISG+AMHGDG SALK+FS MEK G+KPDDI Sbjct: 280 DIYAKCRNLELTKRLFNSMPERNIVFWNAMISGLAMHGDGASALKMFSEMEKTGIKPDDI 339 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSGMA EGL+LLD+M S+Y IEPKSEHYGCLVDLLSRAGLF EA+V++RR Sbjct: 340 TFIAVFTACSYSGMAHEGLQLLDKMSSLYEIEPKSEHYGCLVDLLSRAGLFGEAMVMIRR 399 Query: 379 ITN-SWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGK 203 IT+ SW+GSEE LAWRAFLSACCNHG LRL+NHSGVYVLLSNLYAASGK Sbjct: 400 ITSTSWNGSEETLAWRAFLSACCNHGQAQLAERAAKRLLRLENHSGVYVLLSNLYAASGK 459 Query: 202 HTDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 H+DARRVR++M+ KG +KAPGCSSVEIDGVVSEFIAGE+THP ++EIHS+LE +HMQLD Sbjct: 460 HSDARRVRNMMRNKGVDKAPGCSSVEIDGVVSEFIAGEETHPQMEEIHSVLEILHMQLD 518 Score = 68.2 bits (165), Expect = 2e-09 Identities = 45/146 (30%), Positives = 72/146 (49%), Gaps = 3/146 (2%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PEKD W AMISG + K L LF ++ V PD+ F Sbjct: 181 YAKVGDVDSARLFFDEAPEKDRGIWGAMISGYVQNSCFKEGLYLFRLLQLTHVVPDESIF 240 Query: 553 IAVFTACSYSGMASEGL---KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMR 383 +++ +AC++ G G+ + L+R +I + L+D+ ++ L + + Sbjct: 241 VSILSACAHLGALDIGIWIHRYLNRKTVSLSIRLSTS----LLDIYAKC----RNLELTK 292 Query: 382 RITNSWSGSEEILAWRAFLSACCNHG 305 R+ NS I+ W A +S HG Sbjct: 293 RLFNSMP-ERNIVFWNAMISGLAMHG 317 >ref|XP_014492913.1| pentatricopeptide repeat-containing protein At2g20540-like [Vigna radiata var. radiata] Length = 517 Score = 371 bits (952), Expect = e-124 Identities = 183/238 (76%), Positives = 205/238 (86%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNL+LAKRLFDLMPE+DIVCWNAMISGMAMHGDG SALKLFS MEK G+KPDDI Sbjct: 280 DMYAKCGNLELAKRLFDLMPERDIVCWNAMISGMAMHGDGASALKLFSDMEKAGIKPDDI 339 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSGMA EGL+LL +MCS Y IEPKSEHY CLVDLLSRAGLFEEA+V++RR Sbjct: 340 TFIAVFTACSYSGMAHEGLQLLHKMCSEYKIEPKSEHYSCLVDLLSRAGLFEEAMVMIRR 399 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 I++S + SEE LAWRAFLSACCNHG LRL NHSGVYVLLSN+Y+ASGKH Sbjct: 400 ISSSGNVSEETLAWRAFLSACCNHGQAQLAERVAERLLRLQNHSGVYVLLSNVYSASGKH 459 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +DARRVR++M+ K +K PG SSVEI GVVSEFIAGE+THP EIHS+LEK+H+QLD Sbjct: 460 SDARRVREMMRNKRVDKVPGSSSVEIGGVVSEFIAGEETHPMTKEIHSVLEKIHLQLD 517 Score = 63.9 bits (154), Expect = 5e-08 Identities = 41/143 (28%), Positives = 67/143 (46%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PEKD W AMISG + K L LF ++ V PD+ Sbjct: 181 YAKVGDVDSARLFFDEAPEKDRGIWGAMISGYVQNSCFKEGLYLFRLLQLTEVVPDESIC 240 Query: 553 IAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRRIT 374 +++ +AC++ G G+ + R + + L+D+ ++ G E A + + Sbjct: 241 VSILSACAHLGALDIGI-WIHRYLKRAALPLSTRLSTSLLDMYAKCGNLELAKRLFDLMP 299 Query: 373 NSWSGSEEILAWRAFLSACCNHG 305 +I+ W A +S HG Sbjct: 300 -----ERDIVCWNAMISGMAMHG 317 >gb|KRH75002.1| hypothetical protein GLYMA_01G056200 [Glycine max] Length = 477 Score = 360 bits (923), Expect = e-120 Identities = 176/239 (73%), Positives = 205/239 (85%), Gaps = 1/239 (0%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 D+YAKC NL+L KRLF+ MPE++IV WNAMISG+AMHGDG SALKLFS MEK G++PD+I Sbjct: 239 DIYAKCRNLELTKRLFNSMPERNIVFWNAMISGLAMHGDGASALKLFSDMEKAGIRPDNI 298 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 FIAVFTAC YSGMA EGL+LL +MCSVY IEPKSE YGCLVDLL+RAGLFEEA+V+MRR Sbjct: 299 AFIAVFTACRYSGMAHEGLQLLHKMCSVYKIEPKSEQYGCLVDLLTRAGLFEEAMVMMRR 358 Query: 379 IT-NSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGK 203 IT NSW+GSEE LAWRAFLSACCNHG LRL+NHSGVYVLLS+LY ASGK Sbjct: 359 ITSNSWNGSEETLAWRAFLSACCNHGHAQLAQCAAERLLRLENHSGVYVLLSSLYGASGK 418 Query: 202 HTDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 H+++RRVRD+M+ KG +KAPGCS+VE DGVV+EFIAGE+TH ++EIH ILEK+HMQLD Sbjct: 419 HSNSRRVRDMMRNKGVDKAPGCSTVESDGVVNEFIAGEETHSQMEEIHPILEKLHMQLD 477 Score = 68.9 bits (167), Expect = 1e-09 Identities = 45/144 (31%), Positives = 71/144 (49%), Gaps = 1/144 (0%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PEKD W AMISG + K L LF ++ V PDD F Sbjct: 142 YAKVGDVDSARLFFDEAPEKDRGTWGAMISGYVQNSCFKEGLHLFRLLQLAHVVPDDSIF 201 Query: 553 IAVFTACSYSGMASEGL-KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRRI 377 +++ +AC++ G G+ L+R ++ + L+D+ ++ L + +R+ Sbjct: 202 VSILSACAHLGALDIGIYTYLNRKTVPLSLRLSTS----LLDIYAKC----RNLELTKRL 253 Query: 376 TNSWSGSEEILAWRAFLSACCNHG 305 NS I+ W A +S HG Sbjct: 254 FNSMP-ERNIVFWNAMISGLAMHG 276 >ref|XP_016203711.1| pentatricopeptide repeat-containing protein At5g06540-like [Arachis ipaensis] Length = 518 Score = 356 bits (914), Expect = e-118 Identities = 174/239 (72%), Positives = 204/239 (85%), Gaps = 1/239 (0%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCG+LD+AKRLFD M E+D+VCWNAMI GMAMHGDG SAL+LFS MEK G+K DD+ Sbjct: 280 DMYAKCGHLDMAKRLFDSMKERDVVCWNAMIFGMAMHGDGISALQLFSDMEKAGIKVDDL 339 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIA+FTACSYSGM EGLKLLD+MC+V+ I+PK EHYGCLVDLLSRAGLF+EA+V++RR Sbjct: 340 TFIAIFTACSYSGMVLEGLKLLDKMCNVHKIKPKIEHYGCLVDLLSRAGLFKEAMVMIRR 399 Query: 379 ITNSWSG-SEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGK 203 I +S SG SEE L+WRAFLSACCNHG L+L+NHSGVYVLLSNLYA +GK Sbjct: 400 IRDSSSGTSEEALSWRAFLSACCNHGQTKIAEFAAAKLLKLENHSGVYVLLSNLYATTGK 459 Query: 202 HTDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +++ RRVRD+MK KG+ KAPGCSSVEIDGV+SEFIA EKTH + EIHS+L+KMHMQLD Sbjct: 460 YSETRRVRDMMKNKGSEKAPGCSSVEIDGVISEFIASEKTHLQMHEIHSVLKKMHMQLD 518 Score = 60.8 bits (146), Expect = 6e-07 Identities = 38/146 (26%), Positives = 70/146 (47%), Gaps = 3/146 (2%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PE+D W AMISG + L LF ++ PD+ Sbjct: 181 YAKVGDIDSARLFFDEAPERDRGIWGAMISGYVKNNCFNEGLHLFRLIQSTDEVPDESIL 240 Query: 553 IAVFTACSYSGMASEGL---KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMR 383 +++ +AC++ G G+ + L+ ++I + L+D+ ++ G + A + Sbjct: 241 VSILSACAHLGALDIGIWVHRYLNEARVTFSIRLSTS----LLDMYAKCGHLDMA----K 292 Query: 382 RITNSWSGSEEILAWRAFLSACCNHG 305 R+ +S +++ W A + HG Sbjct: 293 RLFDSMK-ERDVVCWNAMIFGMAMHG 317 >gb|PNX95537.1| pentatricopeptide repeat-containing protein at2g20540-like protein [Trifolium pratense] Length = 539 Score = 355 bits (911), Expect = e-117 Identities = 172/215 (80%), Positives = 189/215 (87%), Gaps = 2/215 (0%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNL+LAKRLFD M E+DI+CWNAMISGMAMHGDGK ALKLF MEK+G+KPDDI Sbjct: 186 DMYAKCGNLELAKRLFDSMQERDIICWNAMISGMAMHGDGKGALKLFYEMEKVGMKPDDI 245 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSGMA EGL LLD+MCSVY+IEPKSEHY CLVDLLSR G FEEA+V++R+ Sbjct: 246 TFIAVFTACSYSGMAYEGLMLLDKMCSVYHIEPKSEHYSCLVDLLSRKGFFEEAMVVIRK 305 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDN--HSGVYVLLSNLYAASG 206 +TNSW+GSEE LAWRAFLSACCNHG RLDN HSGVYVLLSNLYAASG Sbjct: 306 MTNSWNGSEETLAWRAFLSACCNHGETQLAELAAEKVFRLDNHIHSGVYVLLSNLYAASG 365 Query: 205 KHTDARRVRDVMKIKGANKAPGCSSVEIDGVVSEF 101 KHTDARRVRD+MKIKGANK+PGCSSVEIDGVV++F Sbjct: 366 KHTDARRVRDMMKIKGANKSPGCSSVEIDGVVTQF 400 Score = 73.2 bits (178), Expect = 4e-11 Identities = 45/144 (31%), Positives = 74/144 (51%), Gaps = 1/144 (0%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PEKD W AMISG + K +L LF M+ + PD+ F Sbjct: 86 YAKVGDVDSARLFFDEAPEKDKGIWGAMISGYVQNSCFKESLYLFRLMQLTDIVPDESIF 145 Query: 553 IAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYG-CLVDLLSRAGLFEEALVIMRRI 377 +++ +AC++ G G+ + R + + P S L+D+ ++ G E + +R+ Sbjct: 146 VSILSACAHLGALDIGV-WIHRCLNRSKLVPLSVRLSTSLLDMYAKCGNLE----LAKRL 200 Query: 376 TNSWSGSEEILAWRAFLSACCNHG 305 +S +I+ W A +S HG Sbjct: 201 FDSMQ-ERDIICWNAMISGMAMHG 223 >ref|XP_015966522.1| pentatricopeptide repeat-containing protein At5g06540-like [Arachis duranensis] ref|XP_020998694.1| pentatricopeptide repeat-containing protein At5g06540-like [Arachis duranensis] ref|XP_020998695.1| pentatricopeptide repeat-containing protein At5g06540-like [Arachis duranensis] Length = 520 Score = 354 bits (908), Expect = e-117 Identities = 174/239 (72%), Positives = 203/239 (84%), Gaps = 1/239 (0%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCG+L +AKRLFD M E+D+VCWNAMI GMAMHGDG SAL+LF MEK G+K DD+ Sbjct: 282 DMYAKCGHLVMAKRLFDSMKERDVVCWNAMIFGMAMHGDGISALQLFLDMEKAGIKLDDL 341 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIA+FTACSYSGM EGLKLLD+MC+V+ IEPK EHYGCLVDLLSRAGLF+EA+V++RR Sbjct: 342 TFIAIFTACSYSGMVLEGLKLLDKMCNVHKIEPKIEHYGCLVDLLSRAGLFKEAMVMIRR 401 Query: 379 ITNSWSG-SEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGK 203 I +S SG SEE L+WRAFLSACCNHG L+L+NHSGVYVLLSNLYA +GK Sbjct: 402 IRDSSSGTSEEALSWRAFLSACCNHGQTKFAELAAAKLLKLENHSGVYVLLSNLYATTGK 461 Query: 202 HTDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 +++ RRVRD+MK KG+ KAPGCSSVEIDGV+SEFIAGEKTH + EIHS+L+KMHMQLD Sbjct: 462 YSETRRVRDMMKNKGSEKAPGCSSVEIDGVISEFIAGEKTHLQMHEIHSVLKKMHMQLD 520 Score = 63.2 bits (152), Expect = 9e-08 Identities = 38/146 (26%), Positives = 72/146 (49%), Gaps = 3/146 (2%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PE+D+ W AMISG + L LF ++ PD+ Sbjct: 183 YAKVGDIDSARLFFDEAPERDMGIWGAMISGYVKNNCFNEGLHLFRLIQSTDEVPDESIL 242 Query: 553 IAVFTACSYSGMASEGL---KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMR 383 +++ ++C++ G G+ + L+ ++I + L+D+ ++ G LV+ + Sbjct: 243 VSILSSCAHLGALDIGIWVHRYLNEARITFSIRLSTS----LLDMYAKCG----HLVMAK 294 Query: 382 RITNSWSGSEEILAWRAFLSACCNHG 305 R+ +S +++ W A + HG Sbjct: 295 RLFDSMK-ERDVVCWNAMIFGMAMHG 319 >dbj|GAU50198.1| hypothetical protein TSUD_408860 [Trifolium subterraneum] Length = 660 Score = 357 bits (916), Expect = e-117 Identities = 176/214 (82%), Positives = 190/214 (88%), Gaps = 2/214 (0%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNL+LAKRLFD + E+DIVCWNAMISGMAMHGDGK ALKLF MEK+G+KPDDI Sbjct: 283 DMYAKCGNLELAKRLFDSVQERDIVCWNAMISGMAMHGDGKGALKLFYEMEKVGMKPDDI 342 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVFTACSYSGMA EGLKLLD+MCSVYN+EPKSEHY CLVDLLSR GLFE+A+VI+R+ Sbjct: 343 TFIAVFTACSYSGMAYEGLKLLDKMCSVYNMEPKSEHYSCLVDLLSRKGLFEKAMVIIRK 402 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDN--HSGVYVLLSNLYAASG 206 +TNSW+GSEE LAWRAFLSACCNHG LRLDN HSGVYVLLSNLYA SG Sbjct: 403 MTNSWNGSEETLAWRAFLSACCNHGETQLAELAAEKVLRLDNHIHSGVYVLLSNLYATSG 462 Query: 205 KHTDARRVRDVMKIKGANKAPGCSSVEIDGVVSE 104 KHTDARRVRDVMKIKGANKAPGCSSVEIDGVV+E Sbjct: 463 KHTDARRVRDVMKIKGANKAPGCSSVEIDGVVNE 496 Score = 72.4 bits (176), Expect = 8e-11 Identities = 46/144 (31%), Positives = 74/144 (51%), Gaps = 1/144 (0%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 YAK G++D A+ FD PEKD W AMISG + K +L LF M+ + PD+ F Sbjct: 183 YAKVGDVDSARLFFDEAPEKDKGIWGAMISGYVQNSCFKESLYLFRLMQLTDIVPDESIF 242 Query: 553 IAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYG-CLVDLLSRAGLFEEALVIMRRI 377 +++ +AC++ G G+ + R S + P S L+D+ ++ G E + +R+ Sbjct: 243 VSILSACAHLGALDIGV-WIHRCLSRSKLIPLSIRLSTSLLDMYAKCGNLE----LAKRL 297 Query: 376 TNSWSGSEEILAWRAFLSACCNHG 305 +S +I+ W A +S HG Sbjct: 298 FDSVQ-ERDIVCWNAMISGMAMHG 320 >gb|KHN03629.1| Pentatricopeptide repeat-containing protein [Glycine soja] Length = 416 Score = 345 bits (884), Expect = e-115 Identities = 170/229 (74%), Positives = 196/229 (85%), Gaps = 1/229 (0%) Frame = -3 Query: 709 LAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITFIAVFTACS 530 L KRLF+ MPE++IV WNAMISG+AMHGDG SALKLFS MEK G++PD+I FIAVFTAC Sbjct: 188 LTKRLFNSMPERNIVFWNAMISGLAMHGDGASALKLFSDMEKAGIRPDNIAFIAVFTACR 247 Query: 529 YSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRRIT-NSWSGSE 353 YSGMA EGL+LL +MCSVY IEPKSE YGCLVDLL+RAGLFEEA+V+MRRIT NSW+GSE Sbjct: 248 YSGMAHEGLQLLHKMCSVYKIEPKSEQYGCLVDLLTRAGLFEEAMVMMRRITSNSWNGSE 307 Query: 352 EILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKHTDARRVRDV 173 E LAWRAFLSACCNHG LRL+NHSGVYVLLS+LY ASGKH+++RRVRD+ Sbjct: 308 ETLAWRAFLSACCNHGHAQLAQCAAERLLRLENHSGVYVLLSSLYGASGKHSNSRRVRDM 367 Query: 172 MKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 M+ KG +KAPGCS+VE DGVVSEFIAGE+TH ++EIH ILEK+HMQLD Sbjct: 368 MRNKGVDKAPGCSTVESDGVVSEFIAGEETHSQMEEIHPILEKLHMQLD 416 >gb|OMO71979.1| hypothetical protein COLO4_27920 [Corchorus olitorius] Length = 523 Score = 335 bits (859), Expect = e-110 Identities = 158/238 (66%), Positives = 199/238 (83%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNLD+AK+LFD M ++D+V WN MISGMAMHGDG+SAL+LF ME+ GV+PDDI Sbjct: 277 DMYAKCGNLDMAKKLFDEMQQRDVVSWNVMISGMAMHGDGESALELFRQMEEDGVRPDDI 336 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVF+ACSYSGM EGL++LDRMC+VY IEPKSEH+GC++DLLSR GLFEEA I++ Sbjct: 337 TFIAVFSACSYSGMVFEGLRILDRMCNVYKIEPKSEHFGCIIDLLSRGGLFEEANQIIQG 396 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 I +S + S+E +AWRA LS+CC++G ++L++HSGVYVLLSNLYAASGKH Sbjct: 397 IPDSSNPSDEAIAWRALLSSCCSNGQTKLAEVAAKKLMQLEHHSGVYVLLSNLYAASGKH 456 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 DA+R++ +MK +G NKAPGCSSV+I+GVV EFIAGEK+HP +++IHSILEK+ QLD Sbjct: 457 NDAKRIKQMMKNRGVNKAPGCSSVKINGVVHEFIAGEKSHPQLEDIHSILEKLDNQLD 514 Score = 72.8 bits (177), Expect = 5e-11 Identities = 42/146 (28%), Positives = 72/146 (49%), Gaps = 3/146 (2%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 Y K G++D A+ LFD P KD W AMISG + K L +F M+ G++PD+ F Sbjct: 178 YVKMGDIDNARLLFDEAPLKDRGIWGAMISGYVQNNCFKEGLYMFRLMQMSGIEPDEGIF 237 Query: 553 IAVFTACSYSGMASEGL---KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMR 383 +++ AC++ G G+ K LD+ ++ + CLVD+ ++ G + A + Sbjct: 238 VSIICACAHLGALDTGIWVHKYLDQQKFPLSLRLST----CLVDMYAKCGNLDMAKKLFD 293 Query: 382 RITNSWSGSEEILAWRAFLSACCNHG 305 + ++++W +S HG Sbjct: 294 EMQ-----QRDVVSWNVMISGMAMHG 314 >ref|XP_021621830.1| pentatricopeptide repeat-containing protein At2g20540-like [Manihot esculenta] gb|OAY59401.1| hypothetical protein MANES_01G029700 [Manihot esculenta] Length = 535 Score = 335 bits (859), Expect = e-109 Identities = 158/238 (66%), Positives = 194/238 (81%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNLDLAK+LFD MP++D VCWNAMISG+AMHGDG+ ALKLF M++ G KPDD+ Sbjct: 284 DMYAKCGNLDLAKKLFDEMPQRDTVCWNAMISGLAMHGDGEGALKLFWEMQEAGFKPDDV 343 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 T +AVF+ACSYSG A EGL++LDRMC+V+NIEPKSEHYGC+VDLLSRAGL +EA I++R Sbjct: 344 TLMAVFSACSYSGRAHEGLRILDRMCNVHNIEPKSEHYGCMVDLLSRAGLLQEAKEIIQR 403 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 + NS S SEE +AWRA LSACCN G L+L+ HSG YVLLSNLYA +GKH Sbjct: 404 MPNSRSSSEEAIAWRALLSACCNQGQAQLAEVAAERLLQLELHSGAYVLLSNLYATAGKH 463 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 DA+R++ +M+ KG NKAPGCSS++IDG+V EF+AGEKTH ++EI S+LEKM QL+ Sbjct: 464 NDAKRIKKMMRNKGVNKAPGCSSIKIDGIVHEFVAGEKTHKQMEEIESVLEKMKKQLN 521 Score = 64.7 bits (156), Expect = 3e-08 Identities = 41/145 (28%), Positives = 68/145 (46%), Gaps = 3/145 (2%) Frame = -3 Query: 730 AKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITFI 551 AK G++D A+ FD PEKD W AMISG + K L +F M+ + PD+ F+ Sbjct: 186 AKVGDIDSARLFFDGAPEKDRGIWGAMISGYVQNNCFKECLYMFRLMQMTDMVPDEGIFL 245 Query: 550 AVFTACSYSGMASEGL---KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 ++ AC+ G G+ + LDR+ +I + L+D+ ++ G + A + Sbjct: 246 SILCACAQLGALDTGIWIHRYLDRIGQPLSIRLTTS----LIDMYAKCGNLDLAKKLFDE 301 Query: 379 ITNSWSGSEEILAWRAFLSACCNHG 305 + + + W A +S HG Sbjct: 302 MP-----QRDTVCWNAMISGLAMHG 321 >ref|XP_021299781.1| pentatricopeptide repeat-containing protein At5g66520-like [Herrania umbratica] Length = 523 Score = 332 bits (850), Expect = e-108 Identities = 157/238 (65%), Positives = 198/238 (83%) Frame = -3 Query: 739 DMYAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDI 560 DMYAKCGNLD+AK+LFD M ++D+V WNAMISGMAMHGDG+SAL+LF MEK GV+PDDI Sbjct: 280 DMYAKCGNLDIAKKLFDGMQQRDVVSWNAMISGMAMHGDGESALELFRQMEKDGVRPDDI 339 Query: 559 TFIAVFTACSYSGMASEGLKLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMRR 380 TFIAVF+ACSYSGMA EGL +LDRMC+VYNIEPKSEH+GC++DLLSR G FEEA I+RR Sbjct: 340 TFIAVFSACSYSGMAFEGLTILDRMCNVYNIEPKSEHFGCIIDLLSRGGFFEEANKIIRR 399 Query: 379 ITNSWSGSEEILAWRAFLSACCNHGXXXXXXXXXXXXLRLDNHSGVYVLLSNLYAASGKH 200 + +S + S+E +AWRA LS+CC++G ++L++HSGVYVLLSNLYAASGK+ Sbjct: 400 MPDSSNPSDEAIAWRALLSSCCSNGQSKMAEVAAEKLMQLEDHSGVYVLLSNLYAASGKY 459 Query: 199 TDARRVRDVMKIKGANKAPGCSSVEIDGVVSEFIAGEKTHPHIDEIHSILEKMHMQLD 26 DA+ ++ +MK +G NK PGCSSV+I+GVV EFIAGEK+HP +++IH ILEK+ Q+D Sbjct: 460 YDAKIIKQMMKNRGVNKVPGCSSVKINGVVHEFIAGEKSHPQMEDIHLILEKLEKQMD 517 Score = 71.6 bits (174), Expect = 1e-10 Identities = 41/146 (28%), Positives = 74/146 (50%), Gaps = 3/146 (2%) Frame = -3 Query: 733 YAKCGNLDLAKRLFDLMPEKDIVCWNAMISGMAMHGDGKSALKLFSYMEKIGVKPDDITF 554 Y K G++D A+ LFD P KD W AMISG + K L +F M+ ++PD+ + Sbjct: 181 YGKIGDIDTARLLFDEAPVKDTGIWGAMISGYVKNNCFKEGLYMFRLMQMSDIEPDEAIY 240 Query: 553 IAVFTACSYSGMASEGL---KLLDRMCSVYNIEPKSEHYGCLVDLLSRAGLFEEALVIMR 383 +++ AC++ G G+ K LD+ ++ + CL+D+ ++ G L I + Sbjct: 241 VSILCACAHLGALDTGIWIHKYLDKQKFSLSLRLST----CLLDMYAKCG----NLDIAK 292 Query: 382 RITNSWSGSEEILAWRAFLSACCNHG 305 ++ + ++++W A +S HG Sbjct: 293 KLFDGMQ-QRDVVSWNAMISGMAMHG 317