BLASTX nr result
ID: Astragalus23_contig00012417
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00012417 (2899 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP39482.1| Pentatricopeptide repeat-containing protein At4g2... 105 2e-19 ref|XP_020203036.1| pentatricopeptide repeat-containing protein ... 105 2e-19 ref|XP_003630936.1| PPR containing plant-like protein [Medicago ... 103 1e-18 gb|KHN05282.1| Pentatricopeptide repeat-containing protein [Glyc... 102 2e-18 gb|KRH12777.1| hypothetical protein GLYMA_15G193700 [Glycine max] 102 2e-18 ref|XP_003547574.1| PREDICTED: pentatricopeptide repeat-containi... 102 2e-18 gb|KHN41349.1| Pentatricopeptide repeat-containing protein [Glyc... 102 3e-18 ref|XP_006587119.1| PREDICTED: pentatricopeptide repeat-containi... 102 3e-18 ref|XP_004503357.1| PREDICTED: pentatricopeptide repeat-containi... 100 2e-17 gb|PNY11660.1| pentatricopeptide repeat-containing protein at4g2... 99 3e-17 ref|XP_017410279.1| PREDICTED: pentatricopeptide repeat-containi... 98 4e-17 dbj|BAT79937.1| hypothetical protein VIGAN_02288100 [Vigna angul... 98 5e-17 ref|XP_014501087.1| pentatricopeptide repeat-containing protein ... 97 8e-17 ref|XP_007138522.1| hypothetical protein PHAVU_009G216300g [Phas... 97 1e-16 ref|XP_019415839.1| PREDICTED: pentatricopeptide repeat-containi... 93 2e-15 gb|OIV98311.1| hypothetical protein TanjilG_16638 [Lupinus angus... 93 2e-15 ref|XP_020414411.1| pentatricopeptide repeat-containing protein ... 79 5e-11 ref|XP_008245930.2| PREDICTED: pentatricopeptide repeat-containi... 77 1e-10 ref|XP_021614895.1| pentatricopeptide repeat-containing protein ... 77 2e-10 ref|XP_018826801.1| PREDICTED: pentatricopeptide repeat-containi... 77 2e-10 >gb|KYP39482.1| Pentatricopeptide repeat-containing protein At4g21300 family [Cajanus cajan] Length = 724 Score = 105 bits (262), Expect = 2e-19 Identities = 75/244 (30%), Positives = 119/244 (48%), Gaps = 19/244 (7%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E PLF A+ + ++FA FLPS+L+ G K+ + +Y+V+H + Sbjct: 238 IAGYVQNGFTDEAAPLFNAMISVGVKPDSVTFASFLPSILKSGSLKHCKEVHSYIVRHRI 297 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDV+M I N L VAVC++ MI G +NIF Sbjct: 298 PFDVYLKSALIDIYFKGGDVKMARKIFQQNTLVDVAVCTA--MISGYVLNGLNIDAINIF 355 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV------- 1200 W +QE +PN LT AC AL LKL L + + SV + + + Sbjct: 356 RWLIQEGMVPNSLTMASVLPACAALAALKLGKELHCDILKKQLGSVVNVGSAITDMYAKC 415 Query: 1199 --INSIDEYLKFLLNSNTLCFYSKHNTCS--GKDEFKFQLVNTGIAMYSNYGDPFDSCKL 1032 ++ E+ K + +++C+ S ++ S GK E L + G FDS L Sbjct: 416 GRLDLAYEFFKRMSERDSVCWNSMISSFSQNGKPEMAIDLFH----QMGMSGAKFDSVSL 471 Query: 1031 INVM 1020 + + Sbjct: 472 SSAL 475 >ref|XP_020203036.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus cajan] ref|XP_020203037.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus cajan] ref|XP_020203038.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus cajan] ref|XP_020203039.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus cajan] ref|XP_020203041.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus cajan] Length = 849 Score = 105 bits (262), Expect = 2e-19 Identities = 75/244 (30%), Positives = 119/244 (48%), Gaps = 19/244 (7%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E PLF A+ + ++FA FLPS+L+ G K+ + +Y+V+H + Sbjct: 322 IAGYVQNGFTDEAAPLFNAMISVGVKPDSVTFASFLPSILKSGSLKHCKEVHSYIVRHRI 381 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDV+M I N L VAVC++ MI G +NIF Sbjct: 382 PFDVYLKSALIDIYFKGGDVKMARKIFQQNTLVDVAVCTA--MISGYVLNGLNIDAINIF 439 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV------- 1200 W +QE +PN LT AC AL LKL L + + SV + + + Sbjct: 440 RWLIQEGMVPNSLTMASVLPACAALAALKLGKELHCDILKKQLGSVVNVGSAITDMYAKC 499 Query: 1199 --INSIDEYLKFLLNSNTLCFYSKHNTCS--GKDEFKFQLVNTGIAMYSNYGDPFDSCKL 1032 ++ E+ K + +++C+ S ++ S GK E L + G FDS L Sbjct: 500 GRLDLAYEFFKRMSERDSVCWNSMISSFSQNGKPEMAIDLFH----QMGMSGAKFDSVSL 555 Query: 1031 INVM 1020 + + Sbjct: 556 SSAL 559 >ref|XP_003630936.1| PPR containing plant-like protein [Medicago truncatula] gb|AET05412.1| PPR containing plant-like protein [Medicago truncatula] Length = 959 Score = 103 bits (257), Expect = 1e-18 Identities = 61/150 (40%), Positives = 86/150 (57%), Gaps = 8/150 (5%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E + LF A+ + + I+FA FLPSVL+ G KY + +Y+V+HG+ Sbjct: 351 IAGYVQNGFTDEAVALFKAMVTSGVKLDSITFASFLPSVLKSGSLKYCKEVHSYIVRHGV 410 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDVEM N L VAVC++ MI G + LN+F Sbjct: 411 PFDVYLKSALVDIYFKGGDVEMACKTFQQNTLVDVAVCTA--MISGYVLNGLNVEALNLF 468 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKL 1269 W +QE +PN LT AC AL LKL Sbjct: 469 RWLIQEGMVPNCLTMASVLPACAALASLKL 498 >gb|KHN05282.1| Pentatricopeptide repeat-containing protein [Glycine soja] Length = 772 Score = 102 bits (254), Expect = 2e-18 Identities = 72/244 (29%), Positives = 118/244 (48%), Gaps = 19/244 (7%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E PLF A+ + ++FA FLPS+LE G ++ + +Y+V+H + Sbjct: 245 IAGYVQNGFTDEAAPLFNAMISAGVKPDSVTFASFLPSILESGSLRHCKEVHSYIVRHRV 304 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDVEM I N L VAVC++ MI G +N F Sbjct: 305 PFDVYLKSALIDIYFKGGDVEMARKIFQQNTLVDVAVCTA--MISGYVLHGLNIDAINTF 362 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV------- 1200 W +QE +PN LT AC AL LKL L + + ++ + + + Sbjct: 363 RWLIQEGMVPNSLTMASVLPACAALAALKLGKELHCDILKKQLENIVNVGSAITDMYAKC 422 Query: 1199 --INSIDEYLKFLLNSNTLCFYSKHNTCS--GKDEFKFQLVNTGIAMYSNYGDPFDSCKL 1032 ++ E+ + + ++++C+ S ++ S GK E L G FDS L Sbjct: 423 GRLDLAYEFFRRMSETDSICWNSMISSFSQNGKPEMAVDL----FRQMGMSGAKFDSVSL 478 Query: 1031 INVM 1020 + + Sbjct: 479 SSAL 482 >gb|KRH12777.1| hypothetical protein GLYMA_15G193700 [Glycine max] Length = 825 Score = 102 bits (254), Expect = 2e-18 Identities = 72/244 (29%), Positives = 118/244 (48%), Gaps = 19/244 (7%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E PLF A+ + ++FA FLPS+LE G ++ + +Y+V+H + Sbjct: 298 IAGYVQNGFTDEAAPLFNAMISAGVKPDSVTFASFLPSILESGSLRHCKEVHSYIVRHRV 357 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDVEM I N L VAVC++ MI G +N F Sbjct: 358 PFDVYLKSALIDIYFKGGDVEMARKIFQQNTLVDVAVCTA--MISGYVLHGLNIDAINTF 415 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV------- 1200 W +QE +PN LT AC AL LKL L + + ++ + + + Sbjct: 416 RWLIQEGMVPNSLTMASVLPACAALAALKLGKELHCDILKKQLENIVNVGSAITDMYAKC 475 Query: 1199 --INSIDEYLKFLLNSNTLCFYSKHNTCS--GKDEFKFQLVNTGIAMYSNYGDPFDSCKL 1032 ++ E+ + + ++++C+ S ++ S GK E L G FDS L Sbjct: 476 GRLDLAYEFFRRMSETDSICWNSMISSFSQNGKPEMAVDL----FRQMGMSGAKFDSVSL 531 Query: 1031 INVM 1020 + + Sbjct: 532 SSAL 535 >ref|XP_003547574.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Glycine max] Length = 846 Score = 102 bits (254), Expect = 2e-18 Identities = 72/244 (29%), Positives = 118/244 (48%), Gaps = 19/244 (7%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E PLF A+ + ++FA FLPS+LE G ++ + +Y+V+H + Sbjct: 319 IAGYVQNGFTDEAAPLFNAMISAGVKPDSVTFASFLPSILESGSLRHCKEVHSYIVRHRV 378 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDVEM I N L VAVC++ MI G +N F Sbjct: 379 PFDVYLKSALIDIYFKGGDVEMARKIFQQNTLVDVAVCTA--MISGYVLHGLNIDAINTF 436 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV------- 1200 W +QE +PN LT AC AL LKL L + + ++ + + + Sbjct: 437 RWLIQEGMVPNSLTMASVLPACAALAALKLGKELHCDILKKQLENIVNVGSAITDMYAKC 496 Query: 1199 --INSIDEYLKFLLNSNTLCFYSKHNTCS--GKDEFKFQLVNTGIAMYSNYGDPFDSCKL 1032 ++ E+ + + ++++C+ S ++ S GK E L G FDS L Sbjct: 497 GRLDLAYEFFRRMSETDSICWNSMISSFSQNGKPEMAVDL----FRQMGMSGAKFDSVSL 552 Query: 1031 INVM 1020 + + Sbjct: 553 SSAL 556 >gb|KHN41349.1| Pentatricopeptide repeat-containing protein [Glycine soja] Length = 788 Score = 102 bits (253), Expect = 3e-18 Identities = 82/294 (27%), Positives = 136/294 (46%), Gaps = 19/294 (6%) Frame = -3 Query: 1844 FDPHSFESLYLVALGSSVGEQELVLIYAGKHKEASHAYGPVNTNLSCGVYVTSYLRVAST 1665 FDP +L VA+ S G L+YA K P ++ + Y++ T Sbjct: 221 FDPQVANTL--VAMYSKCGN----LLYARKLFNTM----PQTDTVTWNGLIAGYVQNGFT 270 Query: 1664 NEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGLTIDGHLKSSI 1485 +E PLF A+ + ++FA FLPS+LE G ++ + +Y+V+H + D +LKS++ Sbjct: 271 DEAAPLFNAMISAGVKPDSVTFASFLPSILESGSLRHCKEVHSYIVRHRVPFDVYLKSAL 330 Query: 1484 LDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIFDWWVQEEKMP 1329 +D+YF GDVEM I N+L VAVC++ MI G +N F W +QE + Sbjct: 331 IDVYFKGGDVEMARKIFQQNILVDVAVCTA--MISGYVLHGLNIDAINTFRWLIQEGMVT 388 Query: 1328 NFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV---------INSIDEYL 1176 N LT AC A+ LK L H+ + ++ + + + ++ E+ Sbjct: 389 NSLTMASVLPACAAVAALKPGKELHCHILKKRLENIVNVGSAITDMYAKCGRLDLAYEFF 448 Query: 1175 KFLLNSNTLCFYSKHNTCS--GKDEFKFQLVNTGIAMYSNYGDPFDSCKLINVM 1020 + + + +++C+ S ++ S GK E L G FDS L + + Sbjct: 449 RRMSDRDSVCWNSMISSFSQNGKPEIAIDL----FRQMGMSGAKFDSVSLSSAL 498 >ref|XP_006587119.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Glycine max] ref|XP_014617512.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Glycine max] ref|XP_014617513.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Glycine max] ref|XP_014617514.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Glycine max] ref|XP_014617515.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Glycine max] ref|XP_014617516.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Glycine max] gb|KRH37767.1| hypothetical protein GLYMA_09G088000 [Glycine max] gb|KRH37768.1| hypothetical protein GLYMA_09G088000 [Glycine max] Length = 848 Score = 102 bits (253), Expect = 3e-18 Identities = 82/294 (27%), Positives = 136/294 (46%), Gaps = 19/294 (6%) Frame = -3 Query: 1844 FDPHSFESLYLVALGSSVGEQELVLIYAGKHKEASHAYGPVNTNLSCGVYVTSYLRVAST 1665 FDP +L VA+ S G L+YA K P ++ + Y++ T Sbjct: 281 FDPQVANTL--VAMYSKCGN----LLYARKLFNTM----PQTDTVTWNGLIAGYVQNGFT 330 Query: 1664 NEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGLTIDGHLKSSI 1485 +E PLF A+ + ++FA FLPS+LE G ++ + +Y+V+H + D +LKS++ Sbjct: 331 DEAAPLFNAMISAGVKPDSVTFASFLPSILESGSLRHCKEVHSYIVRHRVPFDVYLKSAL 390 Query: 1484 LDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIFDWWVQEEKMP 1329 +D+YF GDVEM I N+L VAVC++ MI G +N F W +QE + Sbjct: 391 IDVYFKGGDVEMARKIFQQNILVDVAVCTA--MISGYVLHGLNIDAINTFRWLIQEGMVT 448 Query: 1328 NFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV---------INSIDEYL 1176 N LT AC A+ LK L H+ + ++ + + + ++ E+ Sbjct: 449 NSLTMASVLPACAAVAALKPGKELHCHILKKRLENIVNVGSAITDMYAKCGRLDLAYEFF 508 Query: 1175 KFLLNSNTLCFYSKHNTCS--GKDEFKFQLVNTGIAMYSNYGDPFDSCKLINVM 1020 + + + +++C+ S ++ S GK E L G FDS L + + Sbjct: 509 RRMSDRDSVCWNSMISSFSQNGKPEIAIDL----FRQMGMSGAKFDSVSLSSAL 558 >ref|XP_004503357.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Cicer arietinum] ref|XP_012572043.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Cicer arietinum] Length = 875 Score = 99.8 bits (247), Expect = 2e-17 Identities = 59/149 (39%), Positives = 83/149 (55%), Gaps = 8/149 (5%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E + LF A+ + + I+FA FLPS+LE G + +Y+V+HG+ Sbjct: 348 IAGYVQNGFTDEAVTLFKAMIASGVKPDSITFASFLPSILESGSLNNCKEVHSYIVRHGV 407 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDVEM N L +AVC++ MI G + +NIF Sbjct: 408 PFDVYLKSALVDIYFKGGDVEMARKTFQQNTLVDIAVCTA--MISGYVLNGMNIEAINIF 465 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILK 1272 W VQE MPN LT AC AL LK Sbjct: 466 RWLVQEGIMPNCLTMASVLPACAALASLK 494 >gb|PNY11660.1| pentatricopeptide repeat-containing protein at4g21300-like protein [Trifolium pratense] Length = 768 Score = 98.6 bits (244), Expect = 3e-17 Identities = 57/150 (38%), Positives = 86/150 (57%), Gaps = 8/150 (5%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E + LF A+ + + I+FA FLPS+LE G K+ + +Y+V+H + Sbjct: 235 IAGYVQNGFTDEAVALFKAMIASGVKLDSITFASFLPSILESGTLKHCKEVHSYIVRHDV 294 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDVEM N L VAVC++ MI G + +N+F Sbjct: 295 PFDVYLKSALVDIYFKGGDVEMARKTFQQNTLVDVAVCTA--MISGYALNGLNVEAINMF 352 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKL 1269 W +QE+ +PN LT +C AL LKL Sbjct: 353 RWLIQEKMIPNCLTMASVLPSCAALASLKL 382 >ref|XP_017410279.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Vigna angularis] gb|KOM29530.1| hypothetical protein LR48_Vigan721s001200 [Vigna angularis] Length = 848 Score = 98.2 bits (243), Expect = 4e-17 Identities = 84/328 (25%), Positives = 144/328 (43%), Gaps = 45/328 (13%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ ++E PLF A+ + ++FA FLPS+L+ G K+ + Y+V+H + Sbjct: 321 MAGYVQNGFSDEAAPLFNAMISAGVKPDAVTFASFLPSLLKSGSLKHCKEVHGYIVRHRV 380 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDV+M I N L VAVC++ MI G +NIF Sbjct: 381 PFDVYLKSALIDIYFKGGDVKMAFKIFQLNTLVDVAVCTA--MISGYVLNGLNRDAINIF 438 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV------- 1200 W ++E +PN LT AC A+ +KL L + + ++ + + + Sbjct: 439 RWLIKEGMVPNCLTMASVLPACAAMAAIKLGKELHCDILKKRLENIVNVGSAITDMYAKC 498 Query: 1199 --INSIDEYLKFLLNSNTLCFYSKHNTCS--GKDEFKFQL---------------VNTGI 1077 ++ E+ + + +++C+ S ++ S GK E L +++ + Sbjct: 499 GRVDLAYEFFRRMSQRDSVCWNSMLSSFSQNGKPEMAIDLFRQMGISGAKYDTVSLSSAL 558 Query: 1076 AMYSN-----YGDPFDSCKLINVMLLTDVGTWIALIDGYDQ------TRFINKEQDARTF 930 + SN YG +C + N D ALID Y + R + D + Sbjct: 559 SAASNLSALYYGKEMHACVIRNAFSF-DTFVASALIDMYSKCGKLALARCVFDLMDGKNE 617 Query: 929 ASWGIDYLKYDKGGCLVQSNSLLKEVVG 846 SW Y G + L E++G Sbjct: 618 VSWNSIIAAYGNHGFPRECLDLFHEMLG 645 >dbj|BAT79937.1| hypothetical protein VIGAN_02288100 [Vigna angularis var. angularis] Length = 863 Score = 98.2 bits (243), Expect = 5e-17 Identities = 84/328 (25%), Positives = 144/328 (43%), Gaps = 45/328 (13%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ ++E PLF A+ + ++FA FLPS+L+ G K+ + Y+V+H + Sbjct: 321 MAGYVQNGFSDEAAPLFNAMISAGVKPDAVTFASFLPSLLKSGSLKHCKEVHGYIVRHRV 380 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDV+M I N L VAVC++ MI G +NIF Sbjct: 381 PFDVYLKSALIDIYFKGGDVKMAFKIFQLNTLVDVAVCTA--MISGYVLNGLNRDAINIF 438 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV------- 1200 W ++E +PN LT AC A+ +KL L + + ++ + + + Sbjct: 439 RWLIKEGMVPNCLTMASVLPACAAMAAIKLGKELHCDILKKRLENIVNVGSAITDMYAKC 498 Query: 1199 --INSIDEYLKFLLNSNTLCFYSKHNTCS--GKDEFKFQL---------------VNTGI 1077 ++ E+ + + +++C+ S ++ S GK E L +++ + Sbjct: 499 GRVDLAYEFFRRMSQRDSVCWNSMLSSFSQNGKPEMAIDLFRQMGISGAKYDTVSLSSAL 558 Query: 1076 AMYSN-----YGDPFDSCKLINVMLLTDVGTWIALIDGYDQ------TRFINKEQDARTF 930 + SN YG +C + N D ALID Y + R + D + Sbjct: 559 SAASNLSALYYGKEMHACVIRNAFSF-DTFVASALIDMYSKCGKLALARCVFDLMDGKNE 617 Query: 929 ASWGIDYLKYDKGGCLVQSNSLLKEVVG 846 SW Y G + L E++G Sbjct: 618 VSWNSIIAAYGNHGFPRECLDLFHEMLG 645 >ref|XP_014501087.1| pentatricopeptide repeat-containing protein At4g21300 [Vigna radiata var. radiata] Length = 848 Score = 97.4 bits (241), Expect = 8e-17 Identities = 65/220 (29%), Positives = 110/220 (50%), Gaps = 19/220 (8%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ ++E PLF A+ + ++FA FLPSVL+ G K+ + Y+V+H + Sbjct: 321 MAGYVQNGFSDEAAPLFNAMISAGVKPDAVTFASFLPSVLKTGSLKHCKEVHGYIVRHRV 380 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF GDV+M I N L VAVC++ MI G +NIF Sbjct: 381 PFDVYLKSALIDIYFKGGDVKMAYKIFQQNTLVDVAVCTA--MISGYVLNGLNRDAINIF 438 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVAMAKSASVKKLEAYV------- 1200 W ++E +PN LT AC A+ +KL L + + ++ + + + Sbjct: 439 RWLIKEGMVPNCLTMASVLPACAAMAAIKLGKELHCDILKKRLENIVNVGSAITDMYAKC 498 Query: 1199 --INSIDEYLKFLLNSNTLCFYSKHNTCS--GKDEFKFQL 1092 ++ E+ K + +++C+ S ++ S GK E L Sbjct: 499 GRVDLAYEFFKRMSQRDSVCWNSMLSSFSQNGKPEMAIDL 538 >ref|XP_007138522.1| hypothetical protein PHAVU_009G216300g [Phaseolus vulgaris] gb|ESW10516.1| hypothetical protein PHAVU_009G216300g [Phaseolus vulgaris] Length = 848 Score = 96.7 bits (239), Expect = 1e-16 Identities = 54/150 (36%), Positives = 85/150 (56%), Gaps = 8/150 (5%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E PLF A+ + ++FA FLPS+L+ G K+ + +Y+V+H + Sbjct: 322 IAGYVQNGFTDEAAPLFNAMISAGVKPDAVTFASFLPSILKSGSLKHCKEVHSYIVRHRV 381 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILG--------KCLNIF 1359 D +LKS+++DIYF +GDV+ I N L VAVC++ MI G + +NIF Sbjct: 382 PFDVYLKSALIDIYFKSGDVKTAYKIFQQNTLVDVAVCTA--MISGYVLNGLNMEAINIF 439 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKL 1269 W ++E +PN LT AC A+ +KL Sbjct: 440 RWLIKEGMVPNCLTMASVLPACAAVAAMKL 469 >ref|XP_019415839.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Lupinus angustifolius] ref|XP_019415840.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Lupinus angustifolius] ref|XP_019415841.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Lupinus angustifolius] Length = 857 Score = 93.2 bits (230), Expect = 2e-15 Identities = 58/158 (36%), Positives = 83/158 (52%), Gaps = 8/158 (5%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E +PLF + T + I+FA FLPS++E G K + +Y+V+H L Sbjct: 325 IAGYVQNGFTDEAVPLFKEMISTGVKPDSITFASFLPSIVESGSIKRGKEIHSYIVRHRL 384 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILGKCLN--------IF 1359 D +LKS+++DIY GDV+M I N VAVC++ MI G LN + Sbjct: 385 PFDVYLKSALIDIYLKGGDVDMALKIFQQNASVDVAVCTA--MISGYVLNGLNIDAITVL 442 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHV 1245 W +QE PN LT AC AL LKL L ++ Sbjct: 443 RWLIQEGMTPNRLTMASVLPACAALASLKLGKELHCYI 480 >gb|OIV98311.1| hypothetical protein TanjilG_16638 [Lupinus angustifolius] Length = 929 Score = 93.2 bits (230), Expect = 2e-15 Identities = 58/158 (36%), Positives = 83/158 (52%), Gaps = 8/158 (5%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + Y++ T+E +PLF + T + I+FA FLPS++E G K + +Y+V+H L Sbjct: 325 IAGYVQNGFTDEAVPLFKEMISTGVKPDSITFASFLPSIVESGSIKRGKEIHSYIVRHRL 384 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILGKCLN--------IF 1359 D +LKS+++DIY GDV+M I N VAVC++ MI G LN + Sbjct: 385 PFDVYLKSALIDIYLKGGDVDMALKIFQQNASVDVAVCTA--MISGYVLNGLNIDAITVL 442 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHV 1245 W +QE PN LT AC AL LKL L ++ Sbjct: 443 RWLIQEGMTPNRLTMASVLPACAALASLKLGKELHCYI 480 >ref|XP_020414411.1| pentatricopeptide repeat-containing protein At4g21300 [Prunus persica] ref|XP_020414412.1| pentatricopeptide repeat-containing protein At4g21300 [Prunus persica] Length = 840 Score = 78.6 bits (192), Expect = 5e-11 Identities = 79/328 (24%), Positives = 139/328 (42%), Gaps = 45/328 (13%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 ++ Y++ E LF A+ + + I+FA FLPSV E K + + Y+V+H + Sbjct: 314 ISGYIQNGFMVEASRLFQAMISSSVKPDSITFASFLPSVAELANLKQGKEIYGYIVRHCV 373 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILGKCLN--------IF 1359 +D LKS+++D+YF +V+M I + + + +C++ MI G LN IF Sbjct: 374 PLDVFLKSALIDVYFKCRNVDMARKIFNQSTRTDIVMCTA--MISGLVLNGMNHDALEIF 431 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVA---------MAKSASVKKLEA 1206 W ++E+ PN LT AC L LKL L ++ + + + ++ Sbjct: 432 RWLLKEKMRPNSLTLASVLPACAGLVALKLGKELHGNILKHGLDGRLHLGSALTDMYAKS 491 Query: 1205 YVINSIDEYLKFLLNSNTLCFYSKHNTCS--GKDE-------------FKFQLVNTGIAM 1071 ++ + + + +T+C+ S + S GK E K+ V+ A+ Sbjct: 492 GRLDLAHQVFERMFERDTICWNSMITSYSQNGKPEEAIDIFRQMGMAGAKYDCVSISAAL 551 Query: 1070 YS-------NYGDPFDSCKLINVMLLTDVGTWIALIDGYDQ------TRFINKEQDARTF 930 + +YG +I +D+ ALID Y + R + + + Sbjct: 552 SACANLPALHYGKEIHGF-MIRSAFSSDLFAESALIDVYAKCGNLVFARRVFDMMEEKNE 610 Query: 929 ASWGIDYLKYDKGGCLVQSNSLLKEVVG 846 SW Y GCL S L +E++G Sbjct: 611 VSWNSIISAYGSHGCLQDSLVLFREMLG 638 >ref|XP_008245930.2| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like, partial [Prunus mume] Length = 646 Score = 77.4 bits (189), Expect = 1e-10 Identities = 80/328 (24%), Positives = 138/328 (42%), Gaps = 45/328 (13%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 ++ Y++ E LF A+ + + I+FA FLPSV E K + Y+V+H + Sbjct: 120 ISGYIQNGFMVEASRLFQAMISSSVKPDSITFASFLPSVAELASLKQGKEIHGYIVRHCV 179 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILGKCLN--------IF 1359 +D LKS+++D+YF +V+M I + + V +C++ MI G LN IF Sbjct: 180 PLDVFLKSALIDVYFKCRNVDMARKIFNQSTRTDVVMCTA--MISGLVLNGMNNDALDIF 237 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHVA---------MAKSASVKKLEA 1206 W ++E+ PN LT AC L LKL L ++ + + + ++ Sbjct: 238 RWLLKEKMRPNSLTLASVLPACAGLVALKLGKELHGNILKHGLDGRLHLGSALTDMYAKS 297 Query: 1205 YVINSIDEYLKFLLNSNTLCFYSKHNTCS--GKDE-------------FKFQLVNTGIAM 1071 ++ + + + +T+C+ S + S GK E K+ V+ A+ Sbjct: 298 GRLDLAHQVFERMFERDTICWNSMITSYSQNGKPEEAIDIFRQMGMAGAKYDCVSISAAL 357 Query: 1070 YS-------NYGDPFDSCKLINVMLLTDVGTWIALIDGYDQ------TRFINKEQDARTF 930 + +YG +I +D+ ALID Y + R + + + Sbjct: 358 SACGNLPALHYGKEIHGF-MIRSAFRSDLFAESALIDVYAKCGNLVLARRVFDMMEEKNE 416 Query: 929 ASWGIDYLKYDKGGCLVQSNSLLKEVVG 846 SW Y GCL S L +E++G Sbjct: 417 VSWNSIISAYGSHGCLQDSVVLFREMLG 444 >ref|XP_021614895.1| pentatricopeptide repeat-containing protein At4g21300 [Manihot esculenta] ref|XP_021614896.1| pentatricopeptide repeat-containing protein At4g21300 [Manihot esculenta] ref|XP_021614897.1| pentatricopeptide repeat-containing protein At4g21300 [Manihot esculenta] gb|OAY47087.1| hypothetical protein MANES_06G051200 [Manihot esculenta] gb|OAY47088.1| hypothetical protein MANES_06G051200 [Manihot esculenta] gb|OAY47089.1| hypothetical protein MANES_06G051200 [Manihot esculenta] gb|OAY47090.1| hypothetical protein MANES_06G051200 [Manihot esculenta] Length = 845 Score = 77.0 bits (188), Expect = 2e-10 Identities = 50/158 (31%), Positives = 79/158 (50%), Gaps = 8/158 (5%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 + +++ NE LF + + I+ A FLPSV+E K + Y+++HG+ Sbjct: 318 IAGHVQNGFMNEASHLFSEMIAAGVKPDSITLASFLPSVVESANIKQGKEIHGYVLRHGV 377 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILGKCLN--------IF 1359 +D LKS+++DIYF DV+M I + + L + VC++ MI G LN F Sbjct: 378 NLDIFLKSALIDIYFKCRDVKMACKIFNQSTLIDIVVCTA--MISGYVLNGLNYEALDTF 435 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKLQVILFAHV 1245 W ++E+ PN +T AC L LKL L A++ Sbjct: 436 RWLLEEKMCPNAVTLASILPACAGLATLKLGKELHANI 473 >ref|XP_018826801.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Juglans regia] Length = 850 Score = 77.0 bits (188), Expect = 2e-10 Identities = 51/150 (34%), Positives = 75/150 (50%), Gaps = 8/150 (5%) Frame = -3 Query: 1694 VTSYLRVASTNEILPLFIAINFTKIIEGLISFACFLPSVLEYGGFKYYWKSFNYMVQHGL 1515 ++ +++ E LF + + I+FA FLPSV E G K + YMV+HG+ Sbjct: 324 ISGFVQNGFMREASNLFREMISVSVKPDSITFASFLPSVTEIAGLKQGKEIHGYMVRHGV 383 Query: 1514 TIDGHLKSSILDIYFSNGDVEMIGNILHHNVLYGVAVCSSNTMILGKCLN--------IF 1359 +D +KS+++DIYF DV M + + V VC++ MI G LN IF Sbjct: 384 PLDLFVKSALIDIYFKCRDVGMARKVFGQSNTVDVIVCTA--MISGFVLNGINSDALEIF 441 Query: 1358 DWWVQEEKMPNFLTTTGSFLACVALDILKL 1269 W ++E+ PN +T AC AL LKL Sbjct: 442 RWLLKEKMRPNSVTLASVLPACAALAALKL 471