BLASTX nr result

ID: Chrysanthemum22_contig00035164 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00035164
         (980 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVH93256.1| LAGLIDADG DNA endonuclease [Cynara cardunculus va...   333   e-105
ref|XP_021989156.1| pentatricopeptide repeat-containing protein ...   332   e-104
ref|XP_023730198.1| pentatricopeptide repeat-containing protein ...   323   e-100
gb|OMP01566.1| hypothetical protein COLO4_11724 [Corchorus olito...   271   3e-81
ref|XP_022726240.1| pentatricopeptide repeat-containing protein ...   271   1e-80
emb|CBI32449.3| unnamed protein product, partial [Vitis vinifera]     269   2e-80
ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containi...   269   4e-80
gb|PPD69577.1| hypothetical protein GOBAR_DD33538 [Gossypium bar...   267   3e-79
ref|XP_016715659.1| PREDICTED: pentatricopeptide repeat-containi...   267   3e-79
gb|EOX98180.1| Endonucleases isoform 2 [Theobroma cacao]              262   4e-79
ref|XP_018809767.1| PREDICTED: pentatricopeptide repeat-containi...   265   2e-78
gb|OMO74186.1| hypothetical protein CCACVL1_16922 [Corchorus cap...   264   2e-78
ref|XP_017638502.1| PREDICTED: pentatricopeptide repeat-containi...   265   3e-78
ref|XP_016732545.1| PREDICTED: pentatricopeptide repeat-containi...   264   4e-78
gb|PPR89814.1| hypothetical protein GOBAR_AA30879 [Gossypium bar...   263   6e-78
gb|KCW63832.1| hypothetical protein EUGRSUZ_G01504 [Eucalyptus g...   261   8e-78
ref|XP_012450253.1| PREDICTED: pentatricopeptide repeat-containi...   263   1e-77
ref|XP_015874239.1| PREDICTED: pentatricopeptide repeat-containi...   262   1e-77
ref|XP_007042348.2| PREDICTED: pentatricopeptide repeat-containi...   262   2e-77
gb|EOX98179.1| Pentatricopeptide repeat-containing protein isofo...   262   2e-77

>gb|KVH93256.1| LAGLIDADG DNA endonuclease [Cynara cardunculus var. scolymus]
          Length = 785

 Score =  333 bits (855), Expect = e-105
 Identities = 163/212 (76%), Positives = 176/212 (83%), Gaps = 11/212 (5%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          QI+SDE+ KNH LVFKFNEDS VHKVLKRHI ++Y +WLD S KQD
Sbjct: 572  EQREALVGLLLGGLQIESDEQGKNHKLVFKFNEDSGVHKVLKRHIRNQYHKWLDSSKKQD 631

Query: 182  D------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGD 343
                   + TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGD
Sbjct: 632  GNEDKSCQFTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGD 691

Query: 344  ILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDLK 523
            ILLK+RGSEDGVD++VK L KKSL+CKVKRKGR FWIGLLG NSTWFWKLVDPYIVGDLK
Sbjct: 692  ILLKLRGSEDGVDRIVKTLGKKSLSCKVKRKGRFFWIGLLGSNSTWFWKLVDPYIVGDLK 751

Query: 524  DLVKP-----DLKEDLRTIDFDRSDSDNSEDD 604
            DL+KP     DLKE+ RTI+FDRSDSD SEDD
Sbjct: 752  DLLKPENISSDLKEEARTINFDRSDSDYSEDD 783


>ref|XP_021989156.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Helianthus annuus]
 gb|OTG11840.1| putative endonuclease [Helianthus annuus]
          Length = 808

 Score =  332 bits (851), Expect = e-104
 Identities = 159/207 (76%), Positives = 176/207 (85%), Gaps = 5/207 (2%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          +I+SDE RKNH LVFKFNE+S VH VLK+ I +EY EWLDLSS++D
Sbjct: 601  EQREALVGLLLGGVKIESDETRKNHTLVFKFNENSGVHNVLKKRIRNEYNEWLDLSSEKD 660

Query: 182  DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIR 361
            D+ TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGY+TSSGDILLK+R
Sbjct: 661  DQFTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYKTSSGDILLKLR 720

Query: 362  GSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDLKDLVKP- 538
            GSEDGVD++VKALQKKSL CKVKRKG LFWIGLLG NS WFWKLVDPYIV +LKD +KP 
Sbjct: 721  GSEDGVDRIVKALQKKSLTCKVKRKGGLFWIGLLGSNSVWFWKLVDPYIVAELKDHLKPE 780

Query: 539  ----DLKEDLRTIDFDRSDSDNSEDDA 607
                DLKE+ +TIDFD+SDSD SEDDA
Sbjct: 781  RFSSDLKEESQTIDFDKSDSDYSEDDA 807


>ref|XP_023730198.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Lactuca sativa]
 ref|XP_023730202.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Lactuca sativa]
 ref|XP_023730208.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Lactuca sativa]
 gb|PLY97530.1| hypothetical protein LSAT_5X113460 [Lactuca sativa]
          Length = 851

 Score =  323 bits (827), Expect = e-100
 Identities = 156/208 (75%), Positives = 172/208 (82%), Gaps = 5/208 (2%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          QI+SDE+ KNH LVFKFNEDS VHKVLKRHIS EY EWLD S++Q 
Sbjct: 647  EQREALVGLLLGGVQIESDEKGKNHTLVFKFNEDSGVHKVLKRHISYEYHEWLDSSNEQ- 705

Query: 182  DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIR 361
               TTI HSYFGFYADQFWPQGQP IPKLIHRWLSPRVLAYWYMYGGY+TSSGDILLK++
Sbjct: 706  --FTTIPHSYFGFYADQFWPQGQPAIPKLIHRWLSPRVLAYWYMYGGYKTSSGDILLKVK 763

Query: 362  GSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDLKDLVKP- 538
            GSEDGVD++VK LQKKSL CKVKRKGR+FWIGLLG NS WFWKLVDPYIV DLKD++KP 
Sbjct: 764  GSEDGVDRIVKTLQKKSLTCKVKRKGRVFWIGLLGSNSEWFWKLVDPYIVRDLKDVLKPG 823

Query: 539  ----DLKEDLRTIDFDRSDSDNSEDDAL 610
                DLKE+ + ++FDRSDSD SEDD L
Sbjct: 824  NIASDLKEEAQNVEFDRSDSDYSEDDIL 851


>gb|OMP01566.1| hypothetical protein COLO4_11724 [Corchorus olitorius]
          Length = 750

 Score =  271 bits (692), Expect = 3e-81
 Identities = 137/223 (61%), Positives = 162/223 (72%), Gaps = 18/223 (8%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK-- 175
            EQRE          QIDSD ERKNHML F+FN++S VH +LKRHI  +Y EWL  SSK  
Sbjct: 525  EQREILVGFLLGGLQIDSDGERKNHMLRFEFNQNSVVHSLLKRHIHDQYHEWLHPSSKLV 584

Query: 176  ---QDD---KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSS 337
                DD   K +TISH+YFGFYADQFWP+GQ VIPKLIHRWLSP VLAYWYMYGGYRTSS
Sbjct: 585  TYGNDDIPHKFSTISHTYFGFYADQFWPKGQQVIPKLIHRWLSPLVLAYWYMYGGYRTSS 644

Query: 338  GDILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGD 517
            GDILLK++GS +GV+KVVK L+ KSLNC+VKRKG++FWIG +G +STWFWKLV+PYI+ D
Sbjct: 645  GDILLKLKGSREGVEKVVKTLRAKSLNCRVKRKGKVFWIGFIGSDSTWFWKLVEPYILDD 704

Query: 518  LKDLVK-------PDLKEDLRTIDFD---RSDSDNSEDDAL*D 616
            LKDL+K        D   + +  +FD    SDSD    D + D
Sbjct: 705  LKDLLKAGSHDSAEDYAAESQDFNFDSASESDSDEKASDNIDD 747


>ref|XP_022726240.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Durio zibethinus]
          Length = 824

 Score =  271 bits (692), Expect = 1e-80
 Identities = 137/221 (61%), Positives = 159/221 (71%), Gaps = 20/221 (9%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK-- 175
            EQRE          QI SD ERKNH + F+FN++S  H +LKRHI  +Y EWL  S K  
Sbjct: 603  EQREILVGLLLGGLQIYSDAERKNHTIRFEFNQNSVTHSILKRHIHDQYHEWLHPSGKPT 662

Query: 176  --QDD---KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
               DD   K +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTSSG
Sbjct: 663  AGSDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYRTSSG 722

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS++GV+KVVK L+ KSLNC+VKRKGR+FWIG LG NSTWFWKLV+P+I+ DL
Sbjct: 723  DILLKLKGSQEGVEKVVKTLKAKSLNCRVKRKGRVFWIGFLGSNSTWFWKLVEPHILDDL 782

Query: 521  KDLVK-------------PDLKEDLRTIDFDRSDSDNSEDD 604
            KD +K              D+  D  + D D  DSD SEDD
Sbjct: 783  KDFLKAGSDTMDNYAVESQDINFDSAS-DSDEKDSDYSEDD 822


>emb|CBI32449.3| unnamed protein product, partial [Vitis vinifera]
          Length = 790

 Score =  269 bits (688), Expect = 2e-80
 Identities = 129/209 (61%), Positives = 160/209 (76%), Gaps = 7/209 (3%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          Q++SDEERKNH++ F+FNE+S  H VL+RHI  +Y EWL+ SSK  
Sbjct: 578  EQREILIGLLLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKLS 637

Query: 182  D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
            D       K +TISHSYFGFYADQFWP+G+P+IPKLIHRWLSPRVLAYWYMYGG+RTSSG
Sbjct: 638  DDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSSG 697

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV+KVV+ L+ +S++C+VKRKG +FWIGLLG NSTWFWKL++PYI+ D+
Sbjct: 698  DILLKLKGSREGVEKVVRTLKAQSMDCRVKRKGTVFWIGLLGSNSTWFWKLIEPYILDDV 757

Query: 521  KDLVKPDLKEDLRTIDFDRSDSDNSEDDA 607
            KD VK   +    TI F  S SD  E+ A
Sbjct: 758  KDFVKAGCQ---NTISFG-SGSDTDENAA 782


>ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Vitis vinifera]
 ref|XP_019074619.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Vitis vinifera]
 ref|XP_019074621.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Vitis vinifera]
 ref|XP_019074622.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Vitis vinifera]
          Length = 823

 Score =  269 bits (688), Expect = 4e-80
 Identities = 129/209 (61%), Positives = 160/209 (76%), Gaps = 7/209 (3%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          Q++SDEERKNH++ F+FNE+S  H VL+RHI  +Y EWL+ SSK  
Sbjct: 611  EQREILIGLLLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKLS 670

Query: 182  D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
            D       K +TISHSYFGFYADQFWP+G+P+IPKLIHRWLSPRVLAYWYMYGG+RTSSG
Sbjct: 671  DDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSSG 730

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV+KVV+ L+ +S++C+VKRKG +FWIGLLG NSTWFWKL++PYI+ D+
Sbjct: 731  DILLKLKGSREGVEKVVRTLKAQSMDCRVKRKGTVFWIGLLGSNSTWFWKLIEPYILDDV 790

Query: 521  KDLVKPDLKEDLRTIDFDRSDSDNSEDDA 607
            KD VK   +    TI F  S SD  E+ A
Sbjct: 791  KDFVKAGCQ---NTISFG-SGSDTDENAA 815


>gb|PPD69577.1| hypothetical protein GOBAR_DD33538 [Gossypium barbadense]
          Length = 836

 Score =  267 bits (682), Expect = 3e-79
 Identities = 135/221 (61%), Positives = 156/221 (70%), Gaps = 20/221 (9%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          +IDSDEERKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK  
Sbjct: 614  EQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 673

Query: 182  -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
                    K  TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G
Sbjct: 674  AGNGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 733

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV KVVKAL+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PY++ DL
Sbjct: 734  DILLKLKGSSEGVKKVVKALKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDDL 793

Query: 521  KDLVK------PDLKEDLRTIDFD-------RSDSDNSEDD 604
            KD +K       D   + R I+FD       +  SD SEDD
Sbjct: 794  KDFLKAGSETADDCAVESRDINFDSASDSDEKGSSDYSEDD 834


>ref|XP_016715659.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic-like [Gossypium hirsutum]
          Length = 836

 Score =  267 bits (682), Expect = 3e-79
 Identities = 135/221 (61%), Positives = 156/221 (70%), Gaps = 20/221 (9%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          +IDSDEERKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK  
Sbjct: 614  EQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 673

Query: 182  -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
                    K  TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G
Sbjct: 674  AGNGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 733

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV KVVKAL+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PY++ DL
Sbjct: 734  DILLKLKGSSEGVKKVVKALKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDDL 793

Query: 521  KDLVK------PDLKEDLRTIDFD-------RSDSDNSEDD 604
            KD +K       D   + R I+FD       +  SD SEDD
Sbjct: 794  KDFLKAGSETADDCAVESRDINFDSASDSDEKGSSDYSEDD 834


>gb|EOX98180.1| Endonucleases isoform 2 [Theobroma cacao]
          Length = 621

 Score =  262 bits (669), Expect = 4e-79
 Identities = 128/215 (59%), Positives = 156/215 (72%), Gaps = 14/215 (6%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQR+          +IDSD ERKNHM+ F+FN++S  H +LKRHI  +Y EWL  SSK  
Sbjct: 401  EQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKPT 460

Query: 182  D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
            D       K +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS G
Sbjct: 461  DGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSYG 520

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV+KVVK L+ K+L+C+VKRKG+++WIG LG NS WFWKLV+PYI+ DL
Sbjct: 521  DILLKLKGSREGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYILDDL 580

Query: 521  KDLVK------PDLKEDLRTIDFD-RSDSDNSEDD 604
            KD +K           + + I+FD  SDSD    D
Sbjct: 581  KDFLKIGSDTTDGYAVESQDINFDSASDSDEKASD 615


>ref|XP_018809767.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Juglans regia]
          Length = 834

 Score =  265 bits (677), Expect = 2e-78
 Identities = 130/220 (59%), Positives = 157/220 (71%), Gaps = 18/220 (8%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          QI+SDEERKNHML F+FNE+S  H VLKRHI  +Y EWL  S K  
Sbjct: 614  EQREILVGLLLGGLQIESDEERKNHMLRFEFNENSSSHFVLKRHIHEQYYEWLHPSCKPS 673

Query: 182  D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
            +       +  TISHSYFGFYADQFWP+G+P+IPKLIHRWLSP  LAYWYMYGGYRTSSG
Sbjct: 674  EDAVDIPCRFCTISHSYFGFYADQFWPKGRPMIPKLIHRWLSPCALAYWYMYGGYRTSSG 733

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++G+ +GVDKVVKAL+ KSL C+VKRKGR+FWIG LG NS+WFWKL++PY++ D+
Sbjct: 734  DILLKLKGNPEGVDKVVKALKAKSLECRVKRKGRVFWIGFLGSNSSWFWKLIEPYVLDDM 793

Query: 521  KDLVKPDL---------KEDLRTIDFDRSD--SDNSEDDA 607
            KD +K  +          ED+   D   +D  + N  DDA
Sbjct: 794  KDFLKAGVATSENISGETEDMNYDDVSETDEMASNCSDDA 833


>gb|OMO74186.1| hypothetical protein CCACVL1_16922 [Corchorus capsularis]
          Length = 799

 Score =  264 bits (675), Expect = 2e-78
 Identities = 134/223 (60%), Positives = 160/223 (71%), Gaps = 18/223 (8%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK-- 175
            +QRE          QIDSD ER NHML F+FN++S VH +LKRHI  +Y EWL  SSK  
Sbjct: 574  QQREILVGLLLGGLQIDSDGERMNHMLRFEFNQNSVVHSLLKRHIHDQYHEWLHPSSKAV 633

Query: 176  ---QDD---KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSS 337
                DD   K +TISH+YFGFYADQFWP+GQ VIPKLIHRWLSP VLAYWYMYGGYRTSS
Sbjct: 634  TYGNDDIPHKFSTISHTYFGFYADQFWPKGQQVIPKLIHRWLSPLVLAYWYMYGGYRTSS 693

Query: 338  GDILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGD 517
            GDILLK++GS +GV+KVVK L+ KSLNC+VKRKG++FW+G LG +S WFWKLV+PYI+ D
Sbjct: 694  GDILLKLKGSHEGVEKVVKTLRAKSLNCRVKRKGKVFWLGFLGSDSIWFWKLVEPYILDD 753

Query: 518  LKDLV-------KPDLKEDLRTIDFD---RSDSDNSEDDAL*D 616
            LKDL+         D   + + I+FD    SDSD    D + D
Sbjct: 754  LKDLLMAGNHDSAEDYAAESQDINFDSASESDSDEKASDNIED 796


>ref|XP_017638502.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Gossypium arboreum]
 gb|KHG30621.1| hypothetical protein F383_13349 [Gossypium arboreum]
          Length = 836

 Score =  265 bits (676), Expect = 3e-78
 Identities = 133/210 (63%), Positives = 153/210 (72%), Gaps = 14/210 (6%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          +IDSDEERKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK  
Sbjct: 614  EQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 673

Query: 182  -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
                    K  TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G
Sbjct: 674  AGNGDILHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 733

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV+KVVK L+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PYI+ DL
Sbjct: 734  DILLKLKGSSEGVEKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYILDDL 793

Query: 521  KDLVK------PDLKEDLRTIDFD-RSDSD 589
            KD +K       D   + R I+FD  SDSD
Sbjct: 794  KDFLKAGSETADDCAVESRDINFDSASDSD 823


>ref|XP_016732545.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic-like [Gossypium hirsutum]
          Length = 835

 Score =  264 bits (675), Expect = 4e-78
 Identities = 135/224 (60%), Positives = 157/224 (70%), Gaps = 14/224 (6%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          +IDSDE+RKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK  
Sbjct: 614  EQREILMGLLLGGLRIDSDEKRKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 673

Query: 182  -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
                    K  TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G
Sbjct: 674  AGDGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 733

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV+KVVK L+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PYI+ DL
Sbjct: 734  DILLKLKGSSEGVEKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYILDDL 793

Query: 521  KDLVK------PDLKEDLRTIDFD-RSDSDNSEDDAL*DFFCYN 631
            KD +K       D   + R I+FD  SDSD        D + YN
Sbjct: 794  KDFLKAGSETADDCAVESRDINFDSASDSDEKGSS---DCYTYN 834


>gb|PPR89814.1| hypothetical protein GOBAR_AA30879 [Gossypium barbadense]
          Length = 806

 Score =  263 bits (672), Expect = 6e-78
 Identities = 131/207 (63%), Positives = 152/207 (73%), Gaps = 20/207 (9%)
 Frame = +2

Query: 44   QIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD-------DKVTTIS 202
            +IDSDEERKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK          K  TIS
Sbjct: 598  RIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLTAGNGDIPHKFNTIS 657

Query: 203  HSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGSEDGVD 382
            HSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+GDILLK++GS +GV 
Sbjct: 658  HSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAGDILLKLKGSSEGVK 717

Query: 383  KVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDLKDLVK------PDL 544
            KVVKAL+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PY++ DLKD +K       D 
Sbjct: 718  KVVKALKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDDLKDFLKAGSETADDC 777

Query: 545  KEDLRTIDFD-------RSDSDNSEDD 604
              + R I+FD       +  SD SEDD
Sbjct: 778  AVESRDINFDSASDSDEKGSSDYSEDD 804


>gb|KCW63832.1| hypothetical protein EUGRSUZ_G01504 [Eucalyptus grandis]
          Length = 708

 Score =  261 bits (666), Expect = 8e-78
 Identities = 131/210 (62%), Positives = 156/210 (74%), Gaps = 9/210 (4%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          QIDSDE+RKNHM+ FKFNE+S +H  LKRHI   Y EWL  S K D
Sbjct: 500  EQREILVGMLLGGLQIDSDEQRKNHMIKFKFNENSGMHSALKRHIYEHYHEWLHPSCKLD 559

Query: 182  DK-------VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
            D         +TI HSYFGFYADQFWP+G+PVIPKLIHRWLSP  LAYWYMYGGYR SSG
Sbjct: 560  DNSNEIPNSFSTIRHSYFGFYADQFWPRGKPVIPKLIHRWLSPCALAYWYMYGGYRMSSG 619

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK+RGS++GV++VVKAL+ KSL+C+VKRKG+++WIGLLG NSTWFWKL++PY++ DL
Sbjct: 620  DILLKLRGSQEGVERVVKALKAKSLDCRVKRKGQVYWIGLLGSNSTWFWKLIEPYVL-DL 678

Query: 521  KDLVKPDLKEDLRTIDFDRSD--SDNSEDD 604
             +  + D  E L       SD  SDNSE+D
Sbjct: 679  -NFAQEDDGEILSFNSGSDSDKNSDNSEED 707


>ref|XP_012450253.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            [Gossypium raimondii]
 gb|KJB12222.1| hypothetical protein B456_002G007000 [Gossypium raimondii]
          Length = 835

 Score =  263 bits (671), Expect = 1e-77
 Identities = 133/221 (60%), Positives = 154/221 (69%), Gaps = 20/221 (9%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          +IDSDEERKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK  
Sbjct: 613  EQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLT 672

Query: 182  -------DKVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
                    K  TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+G
Sbjct: 673  AGNGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAG 732

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV KVVK L+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PY++ +L
Sbjct: 733  DILLKLKGSSEGVKKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDEL 792

Query: 521  KDLVK------PDLKEDLRTIDFD-------RSDSDNSEDD 604
            KD +K       D     R I+FD       +  SD SEDD
Sbjct: 793  KDFLKAGSETADDCAVKSRDINFDSASDSDEKGSSDYSEDD 833


>ref|XP_015874239.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Ziziphus jujuba]
          Length = 821

 Score =  262 bits (670), Expect = 1e-77
 Identities = 126/215 (58%), Positives = 156/215 (72%), Gaps = 14/215 (6%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQRE          +I+SDEERKNHML F+FNE+S +H +LKRHI  +Y EWL  S K +
Sbjct: 600  EQREILVGLLLGGLKIESDEERKNHMLRFEFNENSGLHSILKRHIHDQYHEWLHPSCKTN 659

Query: 182  DKV-------TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
            D +       +TISHSYFGFYADQFWP+G+  IPKLIHRWLSPRVLAYWYMYGG+RTSSG
Sbjct: 660  DAIEDIPCRFSTISHSYFGFYADQFWPKGRQTIPKLIHRWLSPRVLAYWYMYGGHRTSSG 719

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++G+++ V+K+VK L+ +SLNC+VK+KGR+FWIG LG NSTWFWKL +PYI+ DL
Sbjct: 720  DILLKLKGNQEAVEKIVKTLKARSLNCRVKKKGRVFWIGFLGNNSTWFWKLTEPYIIDDL 779

Query: 521  KDLVK------PDLKEDLRTIDFDR-SDSDNSEDD 604
            KD +K           +   I F+  SDSD    D
Sbjct: 780  KDSLKVGGETIGSSTYETENISFESGSDSDEKASD 814


>ref|XP_007042348.2| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Theobroma cacao]
          Length = 823

 Score =  262 bits (669), Expect = 2e-77
 Identities = 128/215 (59%), Positives = 156/215 (72%), Gaps = 14/215 (6%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQR+          +IDSD ERKNHM+ F+FN++S  H +LKRHI  +Y EWL  SSK  
Sbjct: 603  EQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKPT 662

Query: 182  D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
            D       K +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS G
Sbjct: 663  DGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSYG 722

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV+KVVK L+ K+L+C+VKRKG+++WIG LG NS WFWKLV+PYI+ DL
Sbjct: 723  DILLKLKGSREGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYILDDL 782

Query: 521  KDLVK------PDLKEDLRTIDFD-RSDSDNSEDD 604
            KD +K           + + I+FD  SDSD    D
Sbjct: 783  KDFLKIGSDTTDGYAVESQDINFDSASDSDEKASD 817


>gb|EOX98179.1| Pentatricopeptide repeat-containing protein isoform 1 [Theobroma
            cacao]
          Length = 823

 Score =  262 bits (669), Expect = 2e-77
 Identities = 128/215 (59%), Positives = 156/215 (72%), Gaps = 14/215 (6%)
 Frame = +2

Query: 2    EQREXXXXXXXXXXQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD 181
            EQR+          +IDSD ERKNHM+ F+FN++S  H +LKRHI  +Y EWL  SSK  
Sbjct: 603  EQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKPT 662

Query: 182  D-------KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSG 340
            D       K +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS G
Sbjct: 663  DGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSYG 722

Query: 341  DILLKIRGSEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGDL 520
            DILLK++GS +GV+KVVK L+ K+L+C+VKRKG+++WIG LG NS WFWKLV+PYI+ DL
Sbjct: 723  DILLKLKGSREGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYILDDL 782

Query: 521  KDLVK------PDLKEDLRTIDFD-RSDSDNSEDD 604
            KD +K           + + I+FD  SDSD    D
Sbjct: 783  KDFLKIGSDTTDGYAVESQDINFDSASDSDEKASD 817


Top