BLASTX nr result
ID: Chrysanthemum22_contig00035164
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00035164 (980 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KVH93256.1| LAGLIDADG DNA endonuclease [Cynara cardunculus va... 333 e-105 ref|XP_021989156.1| pentatricopeptide repeat-containing protein ... 332 e-104 ref|XP_023730198.1| pentatricopeptide repeat-containing protein ... 323 e-100 gb|OMP01566.1| hypothetical protein COLO4_11724 [Corchorus olito... 271 3e-81 ref|XP_022726240.1| pentatricopeptide repeat-containing protein ... 271 1e-80 emb|CBI32449.3| unnamed protein product, partial [Vitis vinifera] 269 2e-80 ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containi... 269 4e-80 gb|PPD69577.1| hypothetical protein GOBAR_DD33538 [Gossypium bar... 267 3e-79 ref|XP_016715659.1| PREDICTED: pentatricopeptide repeat-containi... 267 3e-79 gb|EOX98180.1| Endonucleases isoform 2 [Theobroma cacao] 262 4e-79 ref|XP_018809767.1| PREDICTED: pentatricopeptide repeat-containi... 265 2e-78 gb|OMO74186.1| hypothetical protein CCACVL1_16922 [Corchorus cap... 264 2e-78 ref|XP_017638502.1| PREDICTED: pentatricopeptide repeat-containi... 265 3e-78 ref|XP_016732545.1| PREDICTED: pentatricopeptide repeat-containi... 264 4e-78 gb|PPR89814.1| hypothetical protein GOBAR_AA30879 [Gossypium bar... 263 6e-78 gb|KCW63832.1| hypothetical protein EUGRSUZ_G01504 [Eucalyptus g... 261 8e-78 ref|XP_012450253.1| PREDICTED: pentatricopeptide repeat-containi... 263 1e-77 ref|XP_015874239.1| PREDICTED: pentatricopeptide repeat-containi... 262 1e-77 ref|XP_007042348.2| PREDICTED: pentatricopeptide repeat-containi... 262 2e-77 gb|EOX98179.1| Pentatricopeptide repeat-containing protein isofo... 262 2e-77 >gb|KVH93256.1| LAGLIDADG DNA endonuclease [Cynara cardunculus var. scolymus] Length = 785 Score = 333 bits (855), Expect = e-105 Identities = 163/212 (76%), Positives = 176/212 (83%), Gaps = 11/212 (5%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE QI+SDE+ KNH LVFKFNEDS VHKVLKRHI ++Y +WLD S KQD Sbjct: 572 EQREALVGLLLGGLQIESDEQGKNHKLVFKFNEDSGVHKVLKRHIRNQYHKWLDSSKKQD 631 Query: 182 D------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGD 343 + TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGD Sbjct: 632 GNEDKSCQFTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGD 691 Query: 344 ILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDLK 523 ILLK+RGSEDGVD++VK L KKSL+CKVKRKGR FWIGLLG NSTWFWKLVDPYIVGDLK Sbjct: 692 ILLKLRGSEDGVDRIVKTLGKKSLSCKVKRKGRFFWIGLLGSNSTWFWKLVDPYIVGDLK 751 Query: 524 DLVKP-----DLKEDLRTIDFDRSDSDNSEDD 604 DL+KP DLKE+ RTI+FDRSDSD SEDD Sbjct: 752 DLLKPENISSDLKEEARTINFDRSDSDYSEDD 783 >ref|XP_021989156.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Helianthus annuus] gb|OTG11840.1| putative endonuclease [Helianthus annuus] Length = 808 Score = 332 bits (851), Expect = e-104 Identities = 159/207 (76%), Positives = 176/207 (85%), Gaps = 5/207 (2%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE +I+SDE RKNH LVFKFNE+S VH VLK+ I +EY EWLDLSS++D Sbjct: 601 EQREALVGLLLGGVKIESDETRKNHTLVFKFNENSGVHNVLKKRIRNEYNEWLDLSSEKD 660 Query: 182 DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIR 361 D+ TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGY+TSSGDILLK+R Sbjct: 661 DQFTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYKTSSGDILLKLR 720 Query: 362 GSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDLKDLVKP- 538 GSEDGVD++VKALQKKSL CKVKRKG LFWIGLLG NS WFWKLVDPYIV +LKD +KP Sbjct: 721 GSEDGVDRIVKALQKKSLTCKVKRKGGLFWIGLLGSNSVWFWKLVDPYIVAELKDHLKPE 780 Query: 539 ----DLKEDLRTIDFDRSDSDNSEDDA 607 DLKE+ +TIDFD+SDSD SEDDA Sbjct: 781 RFSSDLKEESQTIDFDKSDSDYSEDDA 807 >ref|XP_023730198.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Lactuca sativa] ref|XP_023730202.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Lactuca sativa] ref|XP_023730208.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Lactuca sativa] gb|PLY97530.1| hypothetical protein LSAT_5X113460 [Lactuca sativa] Length = 851 Score = 323 bits (827), Expect = e-100 Identities = 156/208 (75%), Positives = 172/208 (82%), Gaps = 5/208 (2%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE QI+SDE+ KNH LVFKFNEDS VHKVLKRHIS EY EWLD S++Q Sbjct: 647 EQREALVGLLLGGVQIESDEKGKNHTLVFKFNEDSGVHKVLKRHISYEYHEWLDSSNEQ- 705 Query: 182 DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIR 361 TTI HSYFGFYADQFWPQGQP IPKLIHRWLSPRVLAYWYMYGGY+TSSGDILLK++ Sbjct: 706 --FTTIPHSYFGFYADQFWPQGQPAIPKLIHRWLSPRVLAYWYMYGGYKTSSGDILLKVK 763 Query: 362 GSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDLKDLVKP- 538 GSEDGVD++VK LQKKSL CKVKRKGR+FWIGLLG NS WFWKLVDPYIV DLKD++KP Sbjct: 764 GSEDGVDRIVKTLQKKSLTCKVKRKGRVFWIGLLGSNSEWFWKLVDPYIVRDLKDVLKPG 823 Query: 539 ----DLKEDLRTIDFDRSDSDNSEDDAL 610 DLKE+ + ++FDRSDSD SEDD L Sbjct: 824 NIASDLKEEAQNVEFDRSDSDYSEDDIL 851 >gb|OMP01566.1| hypothetical protein COLO4_11724 [Corchorus olitorius] Length = 750 Score = 271 bits (692), Expect = 3e-81 Identities = 137/223 (61%), Positives = 162/223 (72%), Gaps = 18/223 (8%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK-- 175 EQRE QIDSD ERKNHML F+FN++S VH +LKRHI +Y EWL SSK Sbjct: 525 EQREILVGFLLGGLQIDSDGERKNHMLRFEFNQNSVVHSLLKRHIHDQYHEWLHPSSKLV 584 Query: 176 ---QDD---KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSS 337 DD K +TISH+YFGFYADQFWP+GQ VIPKLIHRWLSP VLAYWYMYGGYRTSS Sbjct: 585 TYGNDDIPHKFSTISHTYFGFYADQFWPKGQQVIPKLIHRWLSPLVLAYWYMYGGYRTSS 644 Query: 338 GDILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGD 517 GDILLK++GS +GV+KVVK L+ KSLNC+VKRKG++FWIG +G +STWFWKLV+PYI+ D Sbjct: 645 GDILLKLKGSREGVEKVVKTLRAKSLNCRVKRKGKVFWIGFIGSDSTWFWKLVEPYILDD 704 Query: 518 LKDLVK-------PDLKEDLRTIDFD---RSDSDNSEDDAL*D 616 LKDL+K D + + +FD SDSD D + D Sbjct: 705 LKDLLKAGSHDSAEDYAAESQDFNFDSASESDSDEKASDNIDD 747 >ref|XP_022726240.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Durio zibethinus] Length = 824 Score = 271 bits (692), Expect = 1e-80 Identities = 137/221 (61%), Positives = 159/221 (71%), Gaps = 20/221 (9%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK-- 175 EQRE QI SD ERKNH + F+FN++S H +LKRHI +Y EWL S K Sbjct: 603 EQREILVGLLLGGLQIYSDAERKNHTIRFEFNQNSVTHSILKRHIHDQYHEWLHPSGKPT 662 Query: 176 --QDD---KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 DD K +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTSSG Sbjct: 663 AGSDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYRTSSG 722 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS++GV+KVVK L+ KSLNC+VKRKGR+FWIG LG NSTWFWKLV+P+I+ DL Sbjct: 723 DILLKLKGSQEGVEKVVKTLKAKSLNCRVKRKGRVFWIGFLGSNSTWFWKLVEPHILDDL 782 Query: 521 KDLVK-------------PDLKEDLRTIDFDRSDSDNSEDD 604 KD +K D+ D + D D DSD SEDD Sbjct: 783 KDFLKAGSDTMDNYAVESQDINFDSAS-DSDEKDSDYSEDD 822 >emb|CBI32449.3| unnamed protein product, partial [Vitis vinifera] Length = 790 Score = 269 bits (688), Expect = 2e-80 Identities = 129/209 (61%), Positives = 160/209 (76%), Gaps = 7/209 (3%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE Q++SDEERKNH++ F+FNE+S H VL+RHI +Y EWL+ SSK Sbjct: 578 EQREILIGLLLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKLS 637 Query: 182 D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 D K +TISHSYFGFYADQFWP+G+P+IPKLIHRWLSPRVLAYWYMYGG+RTSSG Sbjct: 638 DDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSSG 697 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV+KVV+ L+ +S++C+VKRKG +FWIGLLG NSTWFWKL++PYI+ D+ Sbjct: 698 DILLKLKGSREGVEKVVRTLKAQSMDCRVKRKGTVFWIGLLGSNSTWFWKLIEPYILDDV 757 Query: 521 KDLVKPDLKEDLRTIDFDRSDSDNSEDDA 607 KD VK + TI F S SD E+ A Sbjct: 758 KDFVKAGCQ---NTISFG-SGSDTDENAA 782 >ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Vitis vinifera] ref|XP_019074619.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Vitis vinifera] ref|XP_019074621.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Vitis vinifera] ref|XP_019074622.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Vitis vinifera] Length = 823 Score = 269 bits (688), Expect = 4e-80 Identities = 129/209 (61%), Positives = 160/209 (76%), Gaps = 7/209 (3%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE Q++SDEERKNH++ F+FNE+S H VL+RHI +Y EWL+ SSK Sbjct: 611 EQREILIGLLLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKLS 670 Query: 182 D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 D K +TISHSYFGFYADQFWP+G+P+IPKLIHRWLSPRVLAYWYMYGG+RTSSG Sbjct: 671 DDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSSG 730 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV+KVV+ L+ +S++C+VKRKG +FWIGLLG NSTWFWKL++PYI+ D+ Sbjct: 731 DILLKLKGSREGVEKVVRTLKAQSMDCRVKRKGTVFWIGLLGSNSTWFWKLIEPYILDDV 790 Query: 521 KDLVKPDLKEDLRTIDFDRSDSDNSEDDA 607 KD VK + TI F S SD E+ A Sbjct: 791 KDFVKAGCQ---NTISFG-SGSDTDENAA 815 >gb|PPD69577.1| hypothetical protein GOBAR_DD33538 [Gossypium barbadense] Length = 836 Score = 267 bits (682), Expect = 3e-79 Identities = 135/221 (61%), Positives = 156/221 (70%), Gaps = 20/221 (9%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE +IDSDEERKNHM+ F+FN S H +LKRHI +Y EWL SSK Sbjct: 614 EQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 673 Query: 182 -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 K TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G Sbjct: 674 AGNGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 733 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV KVVKAL+ KSLNC+VKRKGR+FWIG L +S WFWKLV+PY++ DL Sbjct: 734 DILLKLKGSSEGVKKVVKALKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDDL 793 Query: 521 KDLVK------PDLKEDLRTIDFD-------RSDSDNSEDD 604 KD +K D + R I+FD + SD SEDD Sbjct: 794 KDFLKAGSETADDCAVESRDINFDSASDSDEKGSSDYSEDD 834 >ref|XP_016715659.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Gossypium hirsutum] Length = 836 Score = 267 bits (682), Expect = 3e-79 Identities = 135/221 (61%), Positives = 156/221 (70%), Gaps = 20/221 (9%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE +IDSDEERKNHM+ F+FN S H +LKRHI +Y EWL SSK Sbjct: 614 EQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 673 Query: 182 -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 K TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G Sbjct: 674 AGNGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 733 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV KVVKAL+ KSLNC+VKRKGR+FWIG L +S WFWKLV+PY++ DL Sbjct: 734 DILLKLKGSSEGVKKVVKALKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDDL 793 Query: 521 KDLVK------PDLKEDLRTIDFD-------RSDSDNSEDD 604 KD +K D + R I+FD + SD SEDD Sbjct: 794 KDFLKAGSETADDCAVESRDINFDSASDSDEKGSSDYSEDD 834 >gb|EOX98180.1| Endonucleases isoform 2 [Theobroma cacao] Length = 621 Score = 262 bits (669), Expect = 4e-79 Identities = 128/215 (59%), Positives = 156/215 (72%), Gaps = 14/215 (6%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQR+ +IDSD ERKNHM+ F+FN++S H +LKRHI +Y EWL SSK Sbjct: 401 EQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKPT 460 Query: 182 D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 D K +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS G Sbjct: 461 DGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSYG 520 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV+KVVK L+ K+L+C+VKRKG+++WIG LG NS WFWKLV+PYI+ DL Sbjct: 521 DILLKLKGSREGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYILDDL 580 Query: 521 KDLVK------PDLKEDLRTIDFD-RSDSDNSEDD 604 KD +K + + I+FD SDSD D Sbjct: 581 KDFLKIGSDTTDGYAVESQDINFDSASDSDEKASD 615 >ref|XP_018809767.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Juglans regia] Length = 834 Score = 265 bits (677), Expect = 2e-78 Identities = 130/220 (59%), Positives = 157/220 (71%), Gaps = 18/220 (8%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE QI+SDEERKNHML F+FNE+S H VLKRHI +Y EWL S K Sbjct: 614 EQREILVGLLLGGLQIESDEERKNHMLRFEFNENSSSHFVLKRHIHEQYYEWLHPSCKPS 673 Query: 182 D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 + + TISHSYFGFYADQFWP+G+P+IPKLIHRWLSP LAYWYMYGGYRTSSG Sbjct: 674 EDAVDIPCRFCTISHSYFGFYADQFWPKGRPMIPKLIHRWLSPCALAYWYMYGGYRTSSG 733 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++G+ +GVDKVVKAL+ KSL C+VKRKGR+FWIG LG NS+WFWKL++PY++ D+ Sbjct: 734 DILLKLKGNPEGVDKVVKALKAKSLECRVKRKGRVFWIGFLGSNSSWFWKLIEPYVLDDM 793 Query: 521 KDLVKPDL---------KEDLRTIDFDRSD--SDNSEDDA 607 KD +K + ED+ D +D + N DDA Sbjct: 794 KDFLKAGVATSENISGETEDMNYDDVSETDEMASNCSDDA 833 >gb|OMO74186.1| hypothetical protein CCACVL1_16922 [Corchorus capsularis] Length = 799 Score = 264 bits (675), Expect = 2e-78 Identities = 134/223 (60%), Positives = 160/223 (71%), Gaps = 18/223 (8%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK-- 175 +QRE QIDSD ER NHML F+FN++S VH +LKRHI +Y EWL SSK Sbjct: 574 QQREILVGLLLGGLQIDSDGERMNHMLRFEFNQNSVVHSLLKRHIHDQYHEWLHPSSKAV 633 Query: 176 ---QDD---KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSS 337 DD K +TISH+YFGFYADQFWP+GQ VIPKLIHRWLSP VLAYWYMYGGYRTSS Sbjct: 634 TYGNDDIPHKFSTISHTYFGFYADQFWPKGQQVIPKLIHRWLSPLVLAYWYMYGGYRTSS 693 Query: 338 GDILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGD 517 GDILLK++GS +GV+KVVK L+ KSLNC+VKRKG++FW+G LG +S WFWKLV+PYI+ D Sbjct: 694 GDILLKLKGSHEGVEKVVKTLRAKSLNCRVKRKGKVFWLGFLGSDSIWFWKLVEPYILDD 753 Query: 518 LKDLV-------KPDLKEDLRTIDFD---RSDSDNSEDDAL*D 616 LKDL+ D + + I+FD SDSD D + D Sbjct: 754 LKDLLMAGNHDSAEDYAAESQDINFDSASESDSDEKASDNIED 796 >ref|XP_017638502.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Gossypium arboreum] gb|KHG30621.1| hypothetical protein F383_13349 [Gossypium arboreum] Length = 836 Score = 265 bits (676), Expect = 3e-78 Identities = 133/210 (63%), Positives = 153/210 (72%), Gaps = 14/210 (6%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE +IDSDEERKNHM+ F+FN S H +LKRHI +Y EWL SSK Sbjct: 614 EQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 673 Query: 182 -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 K TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G Sbjct: 674 AGNGDILHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 733 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV+KVVK L+ KSLNC+VKRKGR+FWIG L +S WFWKLV+PYI+ DL Sbjct: 734 DILLKLKGSSEGVEKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYILDDL 793 Query: 521 KDLVK------PDLKEDLRTIDFD-RSDSD 589 KD +K D + R I+FD SDSD Sbjct: 794 KDFLKAGSETADDCAVESRDINFDSASDSD 823 >ref|XP_016732545.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Gossypium hirsutum] Length = 835 Score = 264 bits (675), Expect = 4e-78 Identities = 135/224 (60%), Positives = 157/224 (70%), Gaps = 14/224 (6%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE +IDSDE+RKNHM+ F+FN S H +LKRHI +Y EWL SSK Sbjct: 614 EQREILMGLLLGGLRIDSDEKRKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 673 Query: 182 -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 K TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G Sbjct: 674 AGDGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 733 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV+KVVK L+ KSLNC+VKRKGR+FWIG L +S WFWKLV+PYI+ DL Sbjct: 734 DILLKLKGSSEGVEKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYILDDL 793 Query: 521 KDLVK------PDLKEDLRTIDFD-RSDSDNSEDDAL*DFFCYN 631 KD +K D + R I+FD SDSD D + YN Sbjct: 794 KDFLKAGSETADDCAVESRDINFDSASDSDEKGSS---DCYTYN 834 >gb|PPR89814.1| hypothetical protein GOBAR_AA30879 [Gossypium barbadense] Length = 806 Score = 263 bits (672), Expect = 6e-78 Identities = 131/207 (63%), Positives = 152/207 (73%), Gaps = 20/207 (9%) Frame = +2 Query: 44 QIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD-------DKVTTIS 202 +IDSDEERKNHM+ F+FN S H +LKRHI +Y EWL SSK K TIS Sbjct: 598 RIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLTAGNGDIPHKFNTIS 657 Query: 203 HSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGSEDGVD 382 HSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+GDILLK++GS +GV Sbjct: 658 HSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAGDILLKLKGSSEGVK 717 Query: 383 KVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDLKDLVK------PDL 544 KVVKAL+ KSLNC+VKRKGR+FWIG L +S WFWKLV+PY++ DLKD +K D Sbjct: 718 KVVKALKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDDLKDFLKAGSETADDC 777 Query: 545 KEDLRTIDFD-------RSDSDNSEDD 604 + R I+FD + SD SEDD Sbjct: 778 AVESRDINFDSASDSDEKGSSDYSEDD 804 >gb|KCW63832.1| hypothetical protein EUGRSUZ_G01504 [Eucalyptus grandis] Length = 708 Score = 261 bits (666), Expect = 8e-78 Identities = 131/210 (62%), Positives = 156/210 (74%), Gaps = 9/210 (4%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE QIDSDE+RKNHM+ FKFNE+S +H LKRHI Y EWL S K D Sbjct: 500 EQREILVGMLLGGLQIDSDEQRKNHMIKFKFNENSGMHSALKRHIYEHYHEWLHPSCKLD 559 Query: 182 DK-------VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 D +TI HSYFGFYADQFWP+G+PVIPKLIHRWLSP LAYWYMYGGYR SSG Sbjct: 560 DNSNEIPNSFSTIRHSYFGFYADQFWPRGKPVIPKLIHRWLSPCALAYWYMYGGYRMSSG 619 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK+RGS++GV++VVKAL+ KSL+C+VKRKG+++WIGLLG NSTWFWKL++PY++ DL Sbjct: 620 DILLKLRGSQEGVERVVKALKAKSLDCRVKRKGQVYWIGLLGSNSTWFWKLIEPYVL-DL 678 Query: 521 KDLVKPDLKEDLRTIDFDRSD--SDNSEDD 604 + + D E L SD SDNSE+D Sbjct: 679 -NFAQEDDGEILSFNSGSDSDKNSDNSEED 707 >ref|XP_012450253.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820 [Gossypium raimondii] gb|KJB12222.1| hypothetical protein B456_002G007000 [Gossypium raimondii] Length = 835 Score = 263 bits (671), Expect = 1e-77 Identities = 133/221 (60%), Positives = 154/221 (69%), Gaps = 20/221 (9%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE +IDSDEERKNHM+ F+FN S H +LKRHI +Y EWL SSK Sbjct: 613 EQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 672 Query: 182 -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 K TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G Sbjct: 673 AGNGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 732 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV KVVK L+ KSLNC+VKRKGR+FWIG L +S WFWKLV+PY++ +L Sbjct: 733 DILLKLKGSSEGVKKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDEL 792 Query: 521 KDLVK------PDLKEDLRTIDFD-------RSDSDNSEDD 604 KD +K D R I+FD + SD SEDD Sbjct: 793 KDFLKAGSETADDCAVKSRDINFDSASDSDEKGSSDYSEDD 833 >ref|XP_015874239.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Ziziphus jujuba] Length = 821 Score = 262 bits (670), Expect = 1e-77 Identities = 126/215 (58%), Positives = 156/215 (72%), Gaps = 14/215 (6%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQRE +I+SDEERKNHML F+FNE+S +H +LKRHI +Y EWL S K + Sbjct: 600 EQREILVGLLLGGLKIESDEERKNHMLRFEFNENSGLHSILKRHIHDQYHEWLHPSCKTN 659 Query: 182 DKV-------TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 D + +TISHSYFGFYADQFWP+G+ IPKLIHRWLSPRVLAYWYMYGG+RTSSG Sbjct: 660 DAIEDIPCRFSTISHSYFGFYADQFWPKGRQTIPKLIHRWLSPRVLAYWYMYGGHRTSSG 719 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++G+++ V+K+VK L+ +SLNC+VK+KGR+FWIG LG NSTWFWKL +PYI+ DL Sbjct: 720 DILLKLKGNQEAVEKIVKTLKARSLNCRVKKKGRVFWIGFLGNNSTWFWKLTEPYIIDDL 779 Query: 521 KDLVK------PDLKEDLRTIDFDR-SDSDNSEDD 604 KD +K + I F+ SDSD D Sbjct: 780 KDSLKVGGETIGSSTYETENISFESGSDSDEKASD 814 >ref|XP_007042348.2| PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Theobroma cacao] Length = 823 Score = 262 bits (669), Expect = 2e-77 Identities = 128/215 (59%), Positives = 156/215 (72%), Gaps = 14/215 (6%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQR+ +IDSD ERKNHM+ F+FN++S H +LKRHI +Y EWL SSK Sbjct: 603 EQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKPT 662 Query: 182 D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 D K +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS G Sbjct: 663 DGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSYG 722 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV+KVVK L+ K+L+C+VKRKG+++WIG LG NS WFWKLV+PYI+ DL Sbjct: 723 DILLKLKGSREGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYILDDL 782 Query: 521 KDLVK------PDLKEDLRTIDFD-RSDSDNSEDD 604 KD +K + + I+FD SDSD D Sbjct: 783 KDFLKIGSDTTDGYAVESQDINFDSASDSDEKASD 817 >gb|EOX98179.1| Pentatricopeptide repeat-containing protein isoform 1 [Theobroma cacao] Length = 823 Score = 262 bits (669), Expect = 2e-77 Identities = 128/215 (59%), Positives = 156/215 (72%), Gaps = 14/215 (6%) Frame = +2 Query: 2 EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181 EQR+ +IDSD ERKNHM+ F+FN++S H +LKRHI +Y EWL SSK Sbjct: 603 EQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKPT 662 Query: 182 D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340 D K +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS G Sbjct: 663 DGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSYG 722 Query: 341 DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520 DILLK++GS +GV+KVVK L+ K+L+C+VKRKG+++WIG LG NS WFWKLV+PYI+ DL Sbjct: 723 DILLKLKGSREGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYILDDL 782 Query: 521 KDLVK------PDLKEDLRTIDFD-RSDSDNSEDD 604 KD +K + + I+FD SDSD D Sbjct: 783 KDFLKIGSDTTDGYAVESQDINFDSASDSDEKASD 817