BLASTX nr result
ID: Perilla23_contig00015806
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00015806 (631 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_012836976.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 288 2e-75 gb|EYU37716.1| hypothetical protein MIMGU_mgv1a003131mg [Erythra... 288 2e-75 ref|XP_011088297.1| PREDICTED: pentatricopeptide repeat-containi... 287 4e-75 ref|XP_011088296.1| PREDICTED: pentatricopeptide repeat-containi... 287 4e-75 ref|XP_010313330.1| PREDICTED: pentatricopeptide repeat-containi... 246 1e-62 ref|XP_004251011.1| PREDICTED: pentatricopeptide repeat-containi... 246 1e-62 ref|XP_006349130.1| PREDICTED: pentatricopeptide repeat-containi... 244 4e-62 gb|EPS74001.1| hypothetical protein M569_00754 [Genlisea aurea] 239 7e-61 emb|CBI32449.3| unnamed protein product [Vitis vinifera] 238 2e-60 ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containi... 238 2e-60 ref|XP_012450253.1| PREDICTED: pentatricopeptide repeat-containi... 236 1e-59 ref|XP_007042349.1| Endonucleases isoform 2 [Theobroma cacao] gi... 234 4e-59 ref|XP_007042348.1| Pentatricopeptide repeat-containing protein ... 234 4e-59 gb|KHG30621.1| hypothetical protein F383_13349 [Gossypium arboreum] 233 7e-59 ref|XP_010258255.1| PREDICTED: pentatricopeptide repeat-containi... 231 3e-58 ref|XP_009803628.1| PREDICTED: pentatricopeptide repeat-containi... 228 2e-57 ref|XP_009616749.1| PREDICTED: pentatricopeptide repeat-containi... 227 4e-57 ref|XP_008236630.1| PREDICTED: pentatricopeptide repeat-containi... 226 6e-57 ref|XP_012072457.1| PREDICTED: pentatricopeptide repeat-containi... 226 6e-57 ref|XP_004292454.1| PREDICTED: pentatricopeptide repeat-containi... 226 8e-57 >ref|XP_012836976.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g15820 [Erythranthe guttatus] Length = 781 Score = 288 bits (736), Expect = 2e-75 Identities = 139/211 (65%), Positives = 171/211 (81%), Gaps = 3/211 (1%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE LRV+LDE+K+S A++FVFRE+ +IHSFLKRHI+NQFHEWLA + Sbjct: 571 KEQREILIGLLLGGLRVELDEKKKSCAVNFVFRENLKIHSFLKRHIYNQFHEWLASKLMD 630 Query: 450 DD---VPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSSG 280 +D +P +FTT+ H+ F+ +ADQF P+G PVIPKLIHRWLTPRVLAYWYMYGGYRTSSG Sbjct: 631 NDDENIPYQFTTVSHTYFKFFADQFCPEGQPVIPKLIHRWLTPRVLAYWYMYGGYRTSSG 690 Query: 279 DILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLADL 100 DILLKL+F KEDV+ I K++KA SLNCRVKR+G+VFW+GFLGSDAAEFWKLTEPFVLADL Sbjct: 691 DILLKLRFGKEDVLRITKSLKANSLNCRVKRKGTVFWVGFLGSDAAEFWKLTEPFVLADL 750 Query: 99 KDSLEANFEPLDGRLGFNDISFSSDSDTNGN 7 K+ LEA E +G ++++FSSD D+ N Sbjct: 751 KEFLEAGIESSNGSSETSNVNFSSDDDSPPN 781 >gb|EYU37716.1| hypothetical protein MIMGU_mgv1a003131mg [Erythranthe guttata] Length = 606 Score = 288 bits (736), Expect = 2e-75 Identities = 139/211 (65%), Positives = 171/211 (81%), Gaps = 3/211 (1%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE LRV+LDE+K+S A++FVFRE+ +IHSFLKRHI+NQFHEWLA + Sbjct: 396 KEQREILIGLLLGGLRVELDEKKKSCAVNFVFRENLKIHSFLKRHIYNQFHEWLASKLMD 455 Query: 450 DD---VPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSSG 280 +D +P +FTT+ H+ F+ +ADQF P+G PVIPKLIHRWLTPRVLAYWYMYGGYRTSSG Sbjct: 456 NDDENIPYQFTTVSHTYFKFFADQFCPEGQPVIPKLIHRWLTPRVLAYWYMYGGYRTSSG 515 Query: 279 DILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLADL 100 DILLKL+F KEDV+ I K++KA SLNCRVKR+G+VFW+GFLGSDAAEFWKLTEPFVLADL Sbjct: 516 DILLKLRFGKEDVLRITKSLKANSLNCRVKRKGTVFWVGFLGSDAAEFWKLTEPFVLADL 575 Query: 99 KDSLEANFEPLDGRLGFNDISFSSDSDTNGN 7 K+ LEA E +G ++++FSSD D+ N Sbjct: 576 KEFLEAGIESSNGSSETSNVNFSSDDDSPPN 606 >ref|XP_011088297.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 isoform X2 [Sesamum indicum] gi|747081997|ref|XP_011088298.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 isoform X2 [Sesamum indicum] gi|747081999|ref|XP_011088299.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 isoform X2 [Sesamum indicum] Length = 711 Score = 287 bits (734), Expect = 4e-75 Identities = 142/217 (65%), Positives = 164/217 (75%), Gaps = 7/217 (3%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE LRVK+DEEK+SYA+ FVFRE++ HSFLKRHI+NQFHEWLA + + Sbjct: 483 KEQREILIGLLLGGLRVKIDEEKKSYAVHFVFRENSNTHSFLKRHIYNQFHEWLADKLPV 542 Query: 450 DD------VPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRT 289 DD +PC F TI HS F+ YADQFWPQG+P IP LIHRWLTPR+LAYWYMYGGYRT Sbjct: 543 DDNDRGNDIPCEFMTISHSYFKFYADQFWPQGIPSIPNLIHRWLTPRILAYWYMYGGYRT 602 Query: 288 SSGDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVL 109 SS DILLKL+ KEDV IAK KAKSLN R+K +G VFW+GFLGS A EFWKL EPF+L Sbjct: 603 SSRDILLKLQSRKEDVPRIAKAFKAKSLNSRIKWKGRVFWVGFLGSHATEFWKLIEPFIL 662 Query: 108 ADLKDSLEANFE-PLDGRLGFNDISFSSDSDTNGNRS 1 +DLK SLEA E P G +I+FSSDSDT+GN S Sbjct: 663 SDLKASLEAGIESPSKGVSRLKNINFSSDSDTDGNTS 699 >ref|XP_011088296.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 isoform X1 [Sesamum indicum] Length = 832 Score = 287 bits (734), Expect = 4e-75 Identities = 142/217 (65%), Positives = 164/217 (75%), Gaps = 7/217 (3%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE LRVK+DEEK+SYA+ FVFRE++ HSFLKRHI+NQFHEWLA + + Sbjct: 604 KEQREILIGLLLGGLRVKIDEEKKSYAVHFVFRENSNTHSFLKRHIYNQFHEWLADKLPV 663 Query: 450 DD------VPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRT 289 DD +PC F TI HS F+ YADQFWPQG+P IP LIHRWLTPR+LAYWYMYGGYRT Sbjct: 664 DDNDRGNDIPCEFMTISHSYFKFYADQFWPQGIPSIPNLIHRWLTPRILAYWYMYGGYRT 723 Query: 288 SSGDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVL 109 SS DILLKL+ KEDV IAK KAKSLN R+K +G VFW+GFLGS A EFWKL EPF+L Sbjct: 724 SSRDILLKLQSRKEDVPRIAKAFKAKSLNSRIKWKGRVFWVGFLGSHATEFWKLIEPFIL 783 Query: 108 ADLKDSLEANFE-PLDGRLGFNDISFSSDSDTNGNRS 1 +DLK SLEA E P G +I+FSSDSDT+GN S Sbjct: 784 SDLKASLEAGIESPSKGVSRLKNINFSSDSDTDGNTS 820 >ref|XP_010313330.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 isoform X2 [Solanum lycopersicum] Length = 660 Score = 246 bits (627), Expect = 1e-62 Identities = 120/213 (56%), Positives = 153/213 (71%), Gaps = 4/213 (1%) Frame = -2 Query: 627 EQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWL-AHNVQI 451 EQRE ++++ DEE+R +AI + F E++RIHS KRHIH++FHEWL +H++ + Sbjct: 438 EQREVLTGSLLGGVQIRSDEERRLHAIHYEFNEESRIHSIFKRHIHDEFHEWLGSHDMMV 497 Query: 450 D---DVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSSG 280 D D+P FTTI HS F YADQFWP G P IPKLIHRWL+PRVLAYWYMYGGYRTSSG Sbjct: 498 DSTADIPSSFTTISHSDFTFYADQFWPNGRPCIPKLIHRWLSPRVLAYWYMYGGYRTSSG 557 Query: 279 DILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLADL 100 DILL++K E VV I K +KAKSL+CRVK RG VFW+GFLG +A FWK+ EPF++ +L Sbjct: 558 DILLRVKGSSEGVVSILKALKAKSLHCRVKNRGKVFWIGFLGDNATFFWKVVEPFIIDEL 617 Query: 99 KDSLEANFEPLDGRLGFNDISFSSDSDTNGNRS 1 K L+A + +G L I+F S S++N S Sbjct: 618 KGLLKAGGDS-NGSLETQCINFDSGSESNEKNS 649 >ref|XP_004251011.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 isoform X1 [Solanum lycopersicum] gi|723745286|ref|XP_010313327.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 isoform X1 [Solanum lycopersicum] gi|723745290|ref|XP_010313328.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 isoform X1 [Solanum lycopersicum] gi|723745297|ref|XP_010313329.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 isoform X1 [Solanum lycopersicum] Length = 816 Score = 246 bits (627), Expect = 1e-62 Identities = 120/213 (56%), Positives = 153/213 (71%), Gaps = 4/213 (1%) Frame = -2 Query: 627 EQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWL-AHNVQI 451 EQRE ++++ DEE+R +AI + F E++RIHS KRHIH++FHEWL +H++ + Sbjct: 594 EQREVLTGSLLGGVQIRSDEERRLHAIHYEFNEESRIHSIFKRHIHDEFHEWLGSHDMMV 653 Query: 450 D---DVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSSG 280 D D+P FTTI HS F YADQFWP G P IPKLIHRWL+PRVLAYWYMYGGYRTSSG Sbjct: 654 DSTADIPSSFTTISHSDFTFYADQFWPNGRPCIPKLIHRWLSPRVLAYWYMYGGYRTSSG 713 Query: 279 DILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLADL 100 DILL++K E VV I K +KAKSL+CRVK RG VFW+GFLG +A FWK+ EPF++ +L Sbjct: 714 DILLRVKGSSEGVVSILKALKAKSLHCRVKNRGKVFWIGFLGDNATFFWKVVEPFIIDEL 773 Query: 99 KDSLEANFEPLDGRLGFNDISFSSDSDTNGNRS 1 K L+A + +G L I+F S S++N S Sbjct: 774 KGLLKAGGDS-NGSLETQCINFDSGSESNEKNS 805 >ref|XP_006349130.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820-like, partial [Solanum tuberosum] Length = 813 Score = 244 bits (622), Expect = 4e-62 Identities = 119/213 (55%), Positives = 154/213 (72%), Gaps = 4/213 (1%) Frame = -2 Query: 627 EQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWL-AHNVQI 451 E+RE ++++ D E+R +AI + F E++RIHS K+HIH++FHEWL +H++ + Sbjct: 594 EKREVLTGLLLGGVQIRSDGERRLHAIHYEFNEESRIHSIFKKHIHDEFHEWLGSHDMMV 653 Query: 450 D---DVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSSG 280 D D+P FTTI HS F YADQFWP G P IPKLIHRWL+PRVLAYWYMYGGYRTSSG Sbjct: 654 DSTADIPNSFTTISHSYFTFYADQFWPNGRPCIPKLIHRWLSPRVLAYWYMYGGYRTSSG 713 Query: 279 DILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLADL 100 DILL++K E VV I K +KAKSLNCRVK+RG VFW+GFLG +A FWK+ EPF+L +L Sbjct: 714 DILLRVKGSSEGVVSILKALKAKSLNCRVKKRGRVFWIGFLGDNATFFWKVVEPFILDEL 773 Query: 99 KDSLEANFEPLDGRLGFNDISFSSDSDTNGNRS 1 K+ L+A + +G L I+F S S+T+ S Sbjct: 774 KELLKAGGDS-NGSLETQCINFDSGSETDEKNS 805 >gb|EPS74001.1| hypothetical protein M569_00754 [Genlisea aurea] Length = 850 Score = 239 bits (611), Expect = 7e-61 Identities = 116/215 (53%), Positives = 152/215 (70%), Gaps = 7/215 (3%) Frame = -2 Query: 627 EQREXXXXXXXXXLRVKLDEEK-RSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +QRE L V+ DEE + + ++F FRED++IHS LKRHI+N FHEW+A + + Sbjct: 632 KQREIIIGLLLSGLSVQQDEEDPKRFLLNFRFREDSKIHSVLKRHIYNTFHEWIAFQMLV 691 Query: 450 DD---VPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSSG 280 DD +P F+T+ HSC++ YA+QFWPQG PVIPKLIHRWLTPR LAYWYMYGG R +G Sbjct: 692 DDSDRIPHHFSTVSHSCYRFYAEQFWPQGRPVIPKLIHRWLTPRSLAYWYMYGGCRIRTG 751 Query: 279 DILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLADL 100 DI+L+LK KE+V+ IAK ++A+S NC VKR+G+ FW+GF+G+DAAEFWKL EPFV+ADL Sbjct: 752 DIVLRLKHQKEEVLGIAKILRARSFNCLVKRKGNAFWLGFIGTDAAEFWKLVEPFVIADL 811 Query: 99 KDSLEANFEPLDGRLGFND---ISFSSDSDTNGNR 4 K L+A +D FS+D D + R Sbjct: 812 KGDLQAGSSSGISSTSNSDEGSFEFSTDDDDDDER 846 >emb|CBI32449.3| unnamed protein product [Vitis vinifera] Length = 790 Score = 238 bits (608), Expect = 2e-60 Identities = 115/212 (54%), Positives = 155/212 (73%), Gaps = 4/212 (1%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE L+++ DEE++++ I F F E++ HS L+RHIH Q+HEWL + ++ Sbjct: 577 KEQREILIGLLLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKL 636 Query: 450 ----DDVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSS 283 DDVP +F+TI HS F YADQFWP+G P+IPKLIHRWL+PRVLAYWYMYGG+RTSS Sbjct: 637 SDDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSS 696 Query: 282 GDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 GDILLKLK +E V + +T+KA+S++CRVKR+G+VFW+G LGS++ FWKL EP++L D Sbjct: 697 GDILLKLKGSREGVEKVVRTLKAQSMDCRVKRKGTVFWIGLLGSNSTWFWKLIEPYILDD 756 Query: 102 LKDSLEANFEPLDGRLGFNDISFSSDSDTNGN 7 +KD ++A + N ISF S SDT+ N Sbjct: 757 VKDFVKAGCQ--------NTISFGSGSDTDEN 780 >ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 [Vitis vinifera] Length = 823 Score = 238 bits (608), Expect = 2e-60 Identities = 115/212 (54%), Positives = 155/212 (73%), Gaps = 4/212 (1%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE L+++ DEE++++ I F F E++ HS L+RHIH Q+HEWL + ++ Sbjct: 610 KEQREILIGLLLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKL 669 Query: 450 ----DDVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSS 283 DDVP +F+TI HS F YADQFWP+G P+IPKLIHRWL+PRVLAYWYMYGG+RTSS Sbjct: 670 SDDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSS 729 Query: 282 GDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 GDILLKLK +E V + +T+KA+S++CRVKR+G+VFW+G LGS++ FWKL EP++L D Sbjct: 730 GDILLKLKGSREGVEKVVRTLKAQSMDCRVKRKGTVFWIGLLGSNSTWFWKLIEPYILDD 789 Query: 102 LKDSLEANFEPLDGRLGFNDISFSSDSDTNGN 7 +KD ++A + N ISF S SDT+ N Sbjct: 790 VKDFVKAGCQ--------NTISFGSGSDTDEN 813 >ref|XP_012450253.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 [Gossypium raimondii] gi|763744783|gb|KJB12222.1| hypothetical protein B456_002G007000 [Gossypium raimondii] Length = 835 Score = 236 bits (601), Expect = 1e-59 Identities = 118/211 (55%), Positives = 149/211 (70%), Gaps = 5/211 (2%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE LR+ DEE++++ I F F + HS LKRHIH+Q+HEWL + ++ Sbjct: 612 KEQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKL 671 Query: 450 D----DVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSS 283 D+P +F TI HS F YADQFWP+G PVIPKLIHRWL+P VLAYWYMYGGYRTS+ Sbjct: 672 TAGNGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSA 731 Query: 282 GDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 GDILLKLK E V + KT+K+KSLNCRVKR+G VFW+GFL +D+ FWKL EP+VL + Sbjct: 732 GDILLKLKGSSEGVKKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDE 791 Query: 102 LKDSLEANFEPLDG-RLGFNDISFSSDSDTN 13 LKD L+A E D + DI+F S SD++ Sbjct: 792 LKDFLKAGSETADDCAVKSRDINFDSASDSD 822 >ref|XP_007042349.1| Endonucleases isoform 2 [Theobroma cacao] gi|508706284|gb|EOX98180.1| Endonucleases isoform 2 [Theobroma cacao] Length = 621 Score = 234 bits (596), Expect = 4e-59 Identities = 114/215 (53%), Positives = 153/215 (71%), Gaps = 5/215 (2%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQR+ L++ D E++++ I F F +++ HS LKRHIH+Q+HEWL + + Sbjct: 400 KEQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKP 459 Query: 450 ----DDVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSS 283 DD+P +F+TI HS F YADQFWP+G PVIPKLIHRWL+P VLAYWYMYGGY+TS Sbjct: 460 TDGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSY 519 Query: 282 GDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 GDILLKLK +E V + KT+KAK+L+CRVKR+G V+W+GFLGS++ FWKL EP++L D Sbjct: 520 GDILLKLKGSREGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYILDD 579 Query: 102 LKDSLEANFEPLDG-RLGFNDISFSSDSDTNGNRS 1 LKD L+ + DG + DI+F S SD++ S Sbjct: 580 LKDFLKIGSDTTDGYAVESQDINFDSASDSDEKAS 614 >ref|XP_007042348.1| Pentatricopeptide repeat-containing protein isoform 1 [Theobroma cacao] gi|508706283|gb|EOX98179.1| Pentatricopeptide repeat-containing protein isoform 1 [Theobroma cacao] Length = 823 Score = 234 bits (596), Expect = 4e-59 Identities = 114/215 (53%), Positives = 153/215 (71%), Gaps = 5/215 (2%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQR+ L++ D E++++ I F F +++ HS LKRHIH+Q+HEWL + + Sbjct: 602 KEQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKP 661 Query: 450 ----DDVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSS 283 DD+P +F+TI HS F YADQFWP+G PVIPKLIHRWL+P VLAYWYMYGGY+TS Sbjct: 662 TDGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSY 721 Query: 282 GDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 GDILLKLK +E V + KT+KAK+L+CRVKR+G V+W+GFLGS++ FWKL EP++L D Sbjct: 722 GDILLKLKGSREGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYILDD 781 Query: 102 LKDSLEANFEPLDG-RLGFNDISFSSDSDTNGNRS 1 LKD L+ + DG + DI+F S SD++ S Sbjct: 782 LKDFLKIGSDTTDGYAVESQDINFDSASDSDEKAS 816 >gb|KHG30621.1| hypothetical protein F383_13349 [Gossypium arboreum] Length = 836 Score = 233 bits (594), Expect = 7e-59 Identities = 117/211 (55%), Positives = 148/211 (70%), Gaps = 5/211 (2%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE LR+ DEE++++ I F F + HS LKRHIH+Q+HEWL + ++ Sbjct: 613 KEQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKL 672 Query: 450 D----DVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSS 283 D+ +F TI HS F YADQFWP+G PVIPKLIHRWL+P VLAYWYMYGGYRTS+ Sbjct: 673 TAGNGDILHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSA 732 Query: 282 GDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 GDILLKLK E V + KT+K+KSLNCRVKR+G VFW+GFL +D+ FWKL EP++L D Sbjct: 733 GDILLKLKGSSEGVEKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYILDD 792 Query: 102 LKDSLEANFEPLDG-RLGFNDISFSSDSDTN 13 LKD L+A E D + DI+F S SD++ Sbjct: 793 LKDFLKAGSETADDCAVESRDINFDSASDSD 823 >ref|XP_010258255.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 [Nelumbo nucifera] Length = 838 Score = 231 bits (589), Expect = 3e-58 Identities = 117/214 (54%), Positives = 150/214 (70%), Gaps = 4/214 (1%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE L ++ DEE+R++AI F F E++ +HS L+RHIH+Q+HEWL Sbjct: 617 KEQREILVGLLLGGLCIETDEERRNHAIHFEFNENSDVHSVLRRHIHDQYHEWLNSPGMP 676 Query: 450 DD---VPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSSG 280 +D +P RF+TI HS F YADQFWP+G PVIPKLIHRWL+PRVLAYWYM+GG R +SG Sbjct: 677 NDEENLPFRFSTIRHSYFGFYADQFWPKGQPVIPKLIHRWLSPRVLAYWYMHGGQRMASG 736 Query: 279 DILLKLK-FDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 DILLKLK +EDV + KT+K KSL+CRVKR+G VFW+GF GS+A FWKLTEP++L Sbjct: 737 DILLKLKSATREDVERVVKTLKTKSLDCRVKRKGRVFWIGFFGSNAVWFWKLTEPYILDG 796 Query: 102 LKDSLEANFEPLDGRLGFNDISFSSDSDTNGNRS 1 LK+ L+ G +I+F SDSD + S Sbjct: 797 LKELLKPGHPLEKGVTEDQNINFDSDSDFDDRAS 830 >ref|XP_009803628.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 [Nicotiana sylvestris] Length = 797 Score = 228 bits (581), Expect = 2e-57 Identities = 114/216 (52%), Positives = 152/216 (70%), Gaps = 6/216 (2%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKL--DEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWL-AHN 460 +EQRE L+++ + +R +AI F F ++RIH+ LK HIHN+FH+WL +H+ Sbjct: 575 KEQREVLMGLLLGGLQIRSYGERRRRVHAIHFEFNAESRIHAILKGHIHNEFHQWLGSHD 634 Query: 459 VQID---DVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRT 289 + +D D+P FTTI HSCF YADQFWP G P IPKLIHRWL+P VLAYWYMYGGYRT Sbjct: 635 MMVDGTDDIPSSFTTISHSCFTFYADQFWPNGRPGIPKLIHRWLSPCVLAYWYMYGGYRT 694 Query: 288 SSGDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVL 109 SSGDILL++K +E+V+ I K +K KSL+CRVK+RG+VFW+GFLG +A FW + +PF+L Sbjct: 695 SSGDILLRVKGSREEVLNIVKALKDKSLDCRVKKRGTVFWIGFLGDNATWFWTVVKPFIL 754 Query: 108 ADLKDSLEANFEPLDGRLGFNDISFSSDSDTNGNRS 1 +LKD L+A +G L I+F S S ++ S Sbjct: 755 GELKDLLKAG-GCSNGSLENQRINFYSGSGSDEKNS 789 >ref|XP_009616749.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 [Nicotiana tomentosiformis] Length = 802 Score = 227 bits (579), Expect = 4e-57 Identities = 108/195 (55%), Positives = 142/195 (72%), Gaps = 4/195 (2%) Frame = -2 Query: 573 DEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWL-AHNVQ---IDDVPCRFTTIPHSCF 406 + +R +AI F F E++RIH+ LK H+H++FHEWL +H++ DD+P FTTI HSCF Sbjct: 597 ERRRRVHAIHFDFNEESRIHAILKGHVHDEFHEWLGSHDLTGDGTDDIPSSFTTISHSCF 656 Query: 405 QLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSSGDILLKLKFDKEDVVMIAK 226 YADQFWP G P IPKLIHRWL+P VLAYWYMYGGYRTSSGDILL++K +E+V I K Sbjct: 657 TFYADQFWPNGCPGIPKLIHRWLSPCVLAYWYMYGGYRTSSGDILLRVKGSREEVANIVK 716 Query: 225 TIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLADLKDSLEANFEPLDGRLGFN 46 +KA+SL+ VK+RG VFW+GFLG +A FWK+ EP++L LKD L+A + DG L Sbjct: 717 VLKAQSLDSLVKKRGRVFWIGFLGDNATWFWKVVEPYILDKLKDLLKAGGKS-DGSLEIQ 775 Query: 45 DISFSSDSDTNGNRS 1 ++F S S+++ S Sbjct: 776 SVNFDSGSESDEKNS 790 >ref|XP_008236630.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 [Prunus mume] Length = 830 Score = 226 bits (577), Expect = 6e-57 Identities = 107/214 (50%), Positives = 152/214 (71%), Gaps = 4/214 (1%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE L+++ DE+++++ I F F E++ HS L+RH+++Q+HEWL + + Sbjct: 606 KEQREVLVGMLLGGLQIESDEDRKNHMIRFEFSENSSTHSLLRRHMYDQYHEWLHPSCKT 665 Query: 450 ----DDVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSS 283 DD+P +F+TI HSC YADQFWP+G VIPKLIHRWL+P LAYWYMYGG+R+SS Sbjct: 666 SESTDDIPYKFSTISHSCLGFYADQFWPKGRQVIPKLIHRWLSPCALAYWYMYGGHRSSS 725 Query: 282 GDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 GDILLK+K ++E V I + +KAKSL+C+VKR+G VFW+GFLGS++ FWKL EP++L D Sbjct: 726 GDILLKIKGNEEGVEKIVRALKAKSLDCKVKRKGRVFWIGFLGSNSTWFWKLVEPYILDD 785 Query: 102 LKDSLEANFEPLDGRLGFNDISFSSDSDTNGNRS 1 LK L+ + + +++F S SDT+ N S Sbjct: 786 LKHLLKGGQISDNSAVETENVNFGSGSDTDENAS 819 >ref|XP_012072457.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 [Jatropha curcas] gi|643730809|gb|KDP38241.1| hypothetical protein JCGZ_04884 [Jatropha curcas] Length = 846 Score = 226 bits (577), Expect = 6e-57 Identities = 109/215 (50%), Positives = 149/215 (69%), Gaps = 5/215 (2%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 + QRE L+++ DEE++ + I F F E++ +HS L+RH+++++HEWL + ++ Sbjct: 621 KNQREILVGLLLGGLQIESDEERKRHMIRFEFNENSSVHSVLRRHLYDEYHEWLHPSCKL 680 Query: 450 ----DDVPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSS 283 DD+ RF+TI HS F YADQFWP+G +IPKLIHRWL+P+VLAYWYMYGG+RTSS Sbjct: 681 NDGSDDISYRFSTISHSYFGFYADQFWPKGRAIIPKLIHRWLSPQVLAYWYMYGGHRTSS 740 Query: 282 GDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 GDILLKLK +E V + K KAKSL+CRVK +G VFW+GFLGSD+ FWKL EP+++ D Sbjct: 741 GDILLKLKGSREGVAKVVKAFKAKSLSCRVKVKGRVFWIGFLGSDSIWFWKLVEPYIIDD 800 Query: 102 LKDSLEANFEPLDGR-LGFNDISFSSDSDTNGNRS 1 LKD L + D + I+F S+SD + S Sbjct: 801 LKDYLRVGDQMSDNNAVETQHINFDSESDIDAAES 835 >ref|XP_004292454.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 [Fragaria vesca subsp. vesca] Length = 794 Score = 226 bits (576), Expect = 8e-57 Identities = 106/215 (49%), Positives = 157/215 (73%), Gaps = 5/215 (2%) Frame = -2 Query: 630 QEQREXXXXXXXXXLRVKLDEEKRSYAIDFVFREDNRIHSFLKRHIHNQFHEWLAHNVQI 451 +EQRE LR++ D++++++ I F F E++ H+ L+RHI++Q+HEWL + ++ Sbjct: 572 KEQREILVGLLLGGLRIESDDDRKNHMIRFEFSENSSAHAVLRRHIYDQYHEWLHPSCKL 631 Query: 450 DD----VPCRFTTIPHSCFQLYADQFWPQGLPVIPKLIHRWLTPRVLAYWYMYGGYRTSS 283 + +P +F++I HS F YAD+FWP G +IPKL+HRWL+P VLAYWYMYGGYRT+S Sbjct: 632 GENTEHIPYKFSSISHSYFGFYADKFWPNGRQMIPKLVHRWLSPCVLAYWYMYGGYRTAS 691 Query: 282 GDILLKLKFDKEDVVMIAKTIKAKSLNCRVKRRGSVFWMGFLGSDAAEFWKLTEPFVLAD 103 GDILLK+K +E V I +T+K +SL+C+VKR+G VFW+GFLGS++ FWKLTEP++L D Sbjct: 692 GDILLKIKGSEEGVQKIVRTLKTRSLDCKVKRKGRVFWIGFLGSNSTLFWKLTEPYILDD 751 Query: 102 LKDSLEANFEPLDGRLGFND-ISFSSDSDTNGNRS 1 LK L+++ E + G N+ ++FSS SD++ N S Sbjct: 752 LKQVLKSDGESSENSTGSNENMNFSSGSDSDENAS 786