BLASTX nr result

ID: Chrysanthemum21_contig00015892 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00015892
         (1415 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_021989156.1| pentatricopeptide repeat-containing protein ...   580   0.0  
gb|KVH93256.1| LAGLIDADG DNA endonuclease [Cynara cardunculus va...   559   0.0  
ref|XP_023730198.1| pentatricopeptide repeat-containing protein ...   550   0.0  
ref|XP_021896589.1| pentatricopeptide repeat-containing protein ...   464   e-152
gb|OMP01566.1| hypothetical protein COLO4_11724 [Corchorus olito...   459   e-151
ref|XP_022726240.1| pentatricopeptide repeat-containing protein ...   461   e-151
emb|CBI32449.3| unnamed protein product, partial [Vitis vinifera]     457   e-150
ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containi...   457   e-150
gb|EOX98180.1| Endonucleases isoform 2 [Theobroma cacao]              449   e-149
gb|KCW63832.1| hypothetical protein EUGRSUZ_G01504 [Eucalyptus g...   448   e-148
ref|XP_015874239.1| PREDICTED: pentatricopeptide repeat-containi...   451   e-148
gb|OMO74186.1| hypothetical protein CCACVL1_16922 [Corchorus cap...   450   e-147
ref|XP_007042348.2| PREDICTED: pentatricopeptide repeat-containi...   450   e-147
gb|EOX98179.1| Pentatricopeptide repeat-containing protein isofo...   449   e-147
ref|XP_021287294.1| pentatricopeptide repeat-containing protein ...   448   e-146
ref|XP_018809767.1| PREDICTED: pentatricopeptide repeat-containi...   447   e-146
gb|PPD69577.1| hypothetical protein GOBAR_DD33538 [Gossypium bar...   445   e-145
ref|XP_017638502.1| PREDICTED: pentatricopeptide repeat-containi...   445   e-145
ref|XP_016715659.1| PREDICTED: pentatricopeptide repeat-containi...   444   e-145
ref|XP_016732545.1| PREDICTED: pentatricopeptide repeat-containi...   443   e-144

>ref|XP_021989156.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Helianthus annuus]
 gb|OTG11840.1| putative endonuclease [Helianthus annuus]
          Length = 808

 Score =  580 bits (1494), Expect = 0.0
 Identities = 281/378 (74%), Positives = 317/378 (83%), Gaps = 5/378 (1%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            S ST+GFHKIIE+L KAQAAELA SVM+EFIESGKKPL PS+I+LMDMYFTLGMHDKLEY
Sbjct: 430  SPSTSGFHKIIEVLCKAQAAELAESVMKEFIESGKKPLTPSYINLMDMYFTLGMHDKLEY 489

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TFLQS+E C PNRT+YN+YLKS+VH GNL KAE++LR+MQNLED+GVD+KSCN IL GYL
Sbjct: 490  TFLQSVENCRPNRTVYNLYLKSMVHIGNLEKAEDVLRQMQNLEDVGVDSKSCNAILMGYL 549

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
            DGREY K E+IY LM EKKFHIEPDL+E LE+VL +N+E +KNP T KL+KEQRE     
Sbjct: 550  DGREYVKAEKIYALMKEKKFHIEPDLIETLEQVLSSNEEVVKNPTTLKLNKEQREALVGL 609

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDDKVTTISHS 696
                ++I+SDE RKNH LVFKFNE+S VH VLK+ I +EY EWLDLSS++DD+ TTISHS
Sbjct: 610  LLGGVKIESDETRKNHTLVFKFNENSGVHNVLKKRIRNEYNEWLDLSSEKDDQFTTISHS 669

Query: 695  YFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGSEDGVDKV 516
            YFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGY+TSSGDILLK+RGSEDGVD++
Sbjct: 670  YFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYKTSSGDILLKLRGSEDGVDRI 729

Query: 515  VKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVG-----XXXXXXXXXXXXX 351
            VKALQKKSL CKVKRKG LFWIGLLG NS WFWKLVDPYIV                   
Sbjct: 730  VKALQKKSLTCKVKRKGGLFWIGLLGSNSVWFWKLVDPYIVAELKDHLKPERFSSDLKEE 789

Query: 350  LRTIDFDRSDSDNSEDDA 297
             +TIDFD+SDSD SEDDA
Sbjct: 790  SQTIDFDKSDSDYSEDDA 807


>gb|KVH93256.1| LAGLIDADG DNA endonuclease [Cynara cardunculus var. scolymus]
          Length = 785

 Score =  559 bits (1440), Expect = 0.0
 Identities = 282/383 (73%), Positives = 308/383 (80%), Gaps = 11/383 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SAST  FHKIIE+L KA   ELA S+M+EFI SGKK LMPSFI+LM+MY TLGMHDKLEY
Sbjct: 401  SASTVAFHKIIEVLCKAHTTELAESLMKEFIGSGKKALMPSFINLMEMYLTLGMHDKLEY 460

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
             F QS EKC PNRTIYNIYLKSLVH+G+L KAEEILR+MQ+ E +GVDTKSCNTILRGYL
Sbjct: 461  YFFQSFEKCRPNRTIYNIYLKSLVHSGSLDKAEEILRQMQSDETVGVDTKSCNTILRGYL 520

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
            DGRE  K EQIY LM EKKF IEP LME LE+VL+AN+EA+KNP+  KLSKEQRE     
Sbjct: 521  DGRENVKAEQIYGLMREKKFQIEPALMEKLEKVLRANEEAVKNPIILKLSKEQREALVGL 580

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDD------KV 714
                LQI+SDE+ KNH LVFKFNEDS VHKVLKRHI ++Y +WLD S KQD       + 
Sbjct: 581  LLGGLQIESDEQGKNHKLVFKFNEDSGVHKVLKRHIRNQYHKWLDSSKKQDGNEDKSCQF 640

Query: 713  TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGSE 534
            TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLK+RGSE
Sbjct: 641  TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKLRGSE 700

Query: 533  DGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVG-----XXXXXXX 369
            DGVD++VK L KKSL+CKVKRKGR FWIGLLG NSTWFWKLVDPYIVG            
Sbjct: 701  DGVDRIVKTLGKKSLSCKVKRKGRFFWIGLLGSNSTWFWKLVDPYIVGDLKDLLKPENIS 760

Query: 368  XXXXXXLRTIDFDRSDSDNSEDD 300
                   RTI+FDRSDSD SEDD
Sbjct: 761  SDLKEEARTINFDRSDSDYSEDD 783


>ref|XP_023730198.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Lactuca sativa]
 ref|XP_023730202.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Lactuca sativa]
 ref|XP_023730208.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Lactuca sativa]
 gb|PLY97530.1| hypothetical protein LSAT_5X113460 [Lactuca sativa]
          Length = 851

 Score =  550 bits (1418), Expect = 0.0
 Identities = 276/379 (72%), Positives = 308/379 (81%), Gaps = 5/379 (1%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SAS AGFHKIIEIL K+ + ELA SVM+EFIESGKK LMP F+DLM+MY TL MHD+LEY
Sbjct: 476  SASIAGFHKIIEILCKSNSHELAESVMKEFIESGKKTLMPPFLDLMNMYLTLRMHDRLEY 535

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TFLQSLEKC PNRT+YNIYLKSLV + NL KAE+ LREMQ+ E +GVDT+SCNTILRGYL
Sbjct: 536  TFLQSLEKCPPNRTLYNIYLKSLVDSMNLQKAEKTLREMQSHEAVGVDTESCNTILRGYL 595

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
            DG+EYAK E+IY LM EKKFHIEPDL+ENLE VL +N+EA+KNPV  KLSKEQRE     
Sbjct: 596  DGKEYAKAEKIYTLMKEKKFHIEPDLIENLEHVLSSNEEAVKNPVILKLSKEQREALVGL 655

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDDKVTTISHS 696
                +QI+SDE+ KNH LVFKFNEDS VHKVLKRHIS EY EWLD S++Q    TTI HS
Sbjct: 656  LLGGVQIESDEKGKNHTLVFKFNEDSGVHKVLKRHISYEYHEWLDSSNEQ---FTTIPHS 712

Query: 695  YFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGSEDGVDKV 516
            YFGFYADQFWPQGQP IPKLIHRWLSPRVLAYWYMYGGY+TSSGDILLK++GSEDGVD++
Sbjct: 713  YFGFYADQFWPQGQPAIPKLIHRWLSPRVLAYWYMYGGYKTSSGDILLKVKGSEDGVDRI 772

Query: 515  VKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVG-----XXXXXXXXXXXXX 351
            VK LQKKSL CKVKRKGR+FWIGLLG NS WFWKLVDPYIV                   
Sbjct: 773  VKTLQKKSLTCKVKRKGRVFWIGLLGSNSEWFWKLVDPYIVRDLKDVLKPGNIASDLKEE 832

Query: 350  LRTIDFDRSDSDNSEDDAL 294
             + ++FDRSDSD SEDD L
Sbjct: 833  AQNVEFDRSDSDYSEDDIL 851


>ref|XP_021896589.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Carica papaya]
          Length = 847

 Score =  464 bits (1194), Expect = e-152
 Identities = 232/387 (59%), Positives = 288/387 (74%), Gaps = 15/387 (3%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            S S A +HKIIEI  KAQ  ELA S+M+EF+ESGKKPLMPS+IDL+DMY +LG+H+KLE 
Sbjct: 458  SPSVAAYHKIIEIFCKAQQIELAESLMKEFVESGKKPLMPSYIDLVDMYLSLGLHNKLEL 517

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TF++ +EKC PNRTIYNIYL SLV  GNLGKAEEI  EMQ+   +GV T+SCNTILRGYL
Sbjct: 518  TFVECMEKCRPNRTIYNIYLDSLVRVGNLGKAEEIFNEMQSNGTVGVSTRSCNTILRGYL 577

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               +Y K E+IYD+M + K+ IE  LME L+RVL   ++ +K PV+ KLSKEQRE     
Sbjct: 578  ASGDYVKAEKIYDIMCQNKYDIESPLMEQLDRVLSLVRKDVKKPVSLKLSKEQREILVGL 637

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDD-------K 717
                LQI SDEERKNHM+ F F E+  VH +L+++I+++Y EWL  SSK  +       K
Sbjct: 638  LLGGLQIVSDEERKNHMIRFDFRENYGVHSILRQYINNQYHEWLHPSSKPSNDSDETPFK 697

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TISHSYFGFYADQF+P+G PVIPKLIHRWLSPRVLAYWYMYGG+RTSSGDILL++RGS
Sbjct: 698  FSTISHSYFGFYADQFFPRGVPVIPKLIHRWLSPRVLAYWYMYGGHRTSSGDILLRLRGS 757

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGXXXXXXXXXXX 357
             +GV+KVVK L+ KSL+C+VK+KG++FWIGL+G NSTWFWKL +PYI+            
Sbjct: 758  LEGVEKVVKTLKAKSLDCRVKKKGKVFWIGLMGSNSTWFWKLTEPYILEDLKDFLKAGGT 817

Query: 356  XXLRTIDF--------DRSDSDNSEDD 300
              ++ I+F        D+  SD SEDD
Sbjct: 818  GEIQDINFDSGSDLDLDQKTSDYSEDD 844


>gb|OMP01566.1| hypothetical protein COLO4_11724 [Corchorus olitorius]
          Length = 750

 Score =  459 bits (1180), Expect = e-151
 Identities = 226/349 (64%), Positives = 269/349 (77%), Gaps = 8/349 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SAS A +HKIIE+L K+Q  +LA S M+EFIESGKKPLMPS+I+L DMY  L +HDK+E 
Sbjct: 354  SASVAAYHKIIEVLCKSQQIDLAESFMKEFIESGKKPLMPSYIELTDMYLNLSLHDKVES 413

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TFL+ LEKC PNR IYNIYL SLV  GN+GKA EI R+M     +GV  KSCNTIL GYL
Sbjct: 414  TFLECLEKCQPNRAIYNIYLDSLVKVGNIGKAREIFRQMHQNVAVGVSAKSCNTILGGYL 473

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               ++ K E+IYD+M  KK+ IE  LME LE VL  +++ +K PV+ KLSKEQRE     
Sbjct: 474  SSGDFLKAEKIYDMMCLKKYEIESPLMEKLEYVLSLSRKEVKKPVSLKLSKEQREILVGF 533

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK-----QDD--- 720
                LQIDSD ERKNHML F+FN++S VH +LKRHI  +Y EWL  SSK      DD   
Sbjct: 534  LLGGLQIDSDGERKNHMLRFEFNQNSVVHSLLKRHIHDQYHEWLHPSSKLVTYGNDDIPH 593

Query: 719  KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRG 540
            K +TISH+YFGFYADQFWP+GQ VIPKLIHRWLSP VLAYWYMYGGYRTSSGDILLK++G
Sbjct: 594  KFSTISHTYFGFYADQFWPKGQQVIPKLIHRWLSPLVLAYWYMYGGYRTSSGDILLKLKG 653

Query: 539  SEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
            S +GV+KVVK L+ KSLNC+VKRKG++FWIG +G +STWFWKLV+PYI+
Sbjct: 654  SREGVEKVVKTLRAKSLNCRVKRKGKVFWIGFIGSDSTWFWKLVEPYIL 702


>ref|XP_022726240.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Durio zibethinus]
          Length = 824

 Score =  461 bits (1185), Expect = e-151
 Identities = 234/391 (59%), Positives = 278/391 (71%), Gaps = 19/391 (4%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SAS A +HKIIE+  K+Q  + A S+M+EFIE GKKPLMPS+I+L++MY  L +HDKLE 
Sbjct: 432  SASVAAYHKIIEVFCKSQQMDRAESLMKEFIEGGKKPLMPSYIELVEMYLNLSLHDKLEL 491

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TFL+ LEKC PNRTIYNIYL SLV  GNLGKAEEI  +M     IGV+ KSCNTIL GYL
Sbjct: 492  TFLECLEKCRPNRTIYNIYLNSLVKVGNLGKAEEIFNQMHGNVTIGVNGKSCNTILDGYL 551

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               ++ K E+IYDLM +KK+ IE  LME L+ VL  +++ +K PV+ KLSKEQRE     
Sbjct: 552  SSGDFFKAEKIYDLMCQKKYEIESSLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGL 611

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK----QDD---K 717
                LQI SD ERKNH + F+FN++S  H +LKRHI  +Y EWL  S K     DD   K
Sbjct: 612  LLGGLQIYSDAERKNHTIRFEFNQNSVTHSILKRHIHDQYHEWLHPSGKPTAGSDDIPHK 671

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTSSGDILLK++GS
Sbjct: 672  FSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYRTSSGDILLKLKGS 731

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGXXXXXXXXXXX 357
            ++GV+KVVK L+ KSLNC+VKRKGR+FWIG LG NSTWFWKLV+P+I+            
Sbjct: 732  QEGVEKVVKTLKAKSLNCRVKRKGRVFWIGFLGSNSTWFWKLVEPHILDDLKDFLKAGSD 791

Query: 356  XXLR------------TIDFDRSDSDNSEDD 300
                              D D  DSD SEDD
Sbjct: 792  TMDNYAVESQDINFDSASDSDEKDSDYSEDD 822


>emb|CBI32449.3| unnamed protein product, partial [Vitis vinifera]
          Length = 790

 Score =  457 bits (1177), Expect = e-150
 Identities = 220/348 (63%), Positives = 270/348 (77%), Gaps = 7/348 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            S S   +HKIIE+LSKAQ  EL  S+M EFI SG KPLMPS+IDLM+MYF L +HDKLE 
Sbjct: 407  STSVVAYHKIIEVLSKAQEIELVESLMTEFINSGMKPLMPSYIDLMNMYFNLSLHDKLEA 466

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
             F + LEKC PNR IYNIY+ SLV  GNL KAEEI  +M +   IGV+TKSCNTIL GYL
Sbjct: 467  AFYECLEKCRPNRAIYNIYMDSLVQIGNLDKAEEIFNQMYSNGAIGVNTKSCNTILSGYL 526

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               +Y K E+IYDLM +KK+ I+  LME L+ VL  +++ +K PV+ KLSKEQRE     
Sbjct: 527  SCGDYLKAEKIYDLMCQKKYAIDAPLMEKLDYVLSLSRKVVKRPVSLKLSKEQREILIGL 586

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDD-------K 717
                LQ++SDEERKNH++ F+FNE+S  H VL+RHI  +Y EWL+ SSK  D       K
Sbjct: 587  LLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKLSDDNDDVPYK 646

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TISHSYFGFYADQFWP+G+P+IPKLIHRWLSPRVLAYWYMYGG+RTSSGDILLK++GS
Sbjct: 647  FSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSSGDILLKLKGS 706

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
             +GV+KVV+ L+ +S++C+VKRKG +FWIGLLG NSTWFWKL++PYI+
Sbjct: 707  REGVEKVVRTLKAQSMDCRVKRKGTVFWIGLLGSNSTWFWKLIEPYIL 754


>ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Vitis vinifera]
 ref|XP_019074619.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Vitis vinifera]
 ref|XP_019074621.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Vitis vinifera]
 ref|XP_019074622.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Vitis vinifera]
          Length = 823

 Score =  457 bits (1177), Expect = e-150
 Identities = 220/348 (63%), Positives = 270/348 (77%), Gaps = 7/348 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            S S   +HKIIE+LSKAQ  EL  S+M EFI SG KPLMPS+IDLM+MYF L +HDKLE 
Sbjct: 440  STSVVAYHKIIEVLSKAQEIELVESLMTEFINSGMKPLMPSYIDLMNMYFNLSLHDKLEA 499

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
             F + LEKC PNR IYNIY+ SLV  GNL KAEEI  +M +   IGV+TKSCNTIL GYL
Sbjct: 500  AFYECLEKCRPNRAIYNIYMDSLVQIGNLDKAEEIFNQMYSNGAIGVNTKSCNTILSGYL 559

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               +Y K E+IYDLM +KK+ I+  LME L+ VL  +++ +K PV+ KLSKEQRE     
Sbjct: 560  SCGDYLKAEKIYDLMCQKKYAIDAPLMEKLDYVLSLSRKVVKRPVSLKLSKEQREILIGL 619

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDD-------K 717
                LQ++SDEERKNH++ F+FNE+S  H VL+RHI  +Y EWL+ SSK  D       K
Sbjct: 620  LLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKLSDDNDDVPYK 679

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TISHSYFGFYADQFWP+G+P+IPKLIHRWLSPRVLAYWYMYGG+RTSSGDILLK++GS
Sbjct: 680  FSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSSGDILLKLKGS 739

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
             +GV+KVV+ L+ +S++C+VKRKG +FWIGLLG NSTWFWKL++PYI+
Sbjct: 740  REGVEKVVRTLKAQSMDCRVKRKGTVFWIGLLGSNSTWFWKLIEPYIL 787


>gb|EOX98180.1| Endonucleases isoform 2 [Theobroma cacao]
          Length = 621

 Score =  449 bits (1155), Expect = e-149
 Identities = 217/348 (62%), Positives = 268/348 (77%), Gaps = 7/348 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SAS A +HKIIE+L K+Q  +LA S+M+EF+ESGKKPLMPS+I+L DMY  + +HDKLE 
Sbjct: 230  SASVAAYHKIIEVLCKSQQMDLAESLMKEFMESGKKPLMPSYIELTDMYLNMSLHDKLES 289

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TFL+ LEKC PNRTIYNIYL SLV  GNL KA EI  +M     IGV+ +SCNTIL GYL
Sbjct: 290  TFLECLEKCRPNRTIYNIYLNSLVKVGNLEKAGEIFGQMHGNSTIGVNARSCNTILGGYL 349

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               ++ K E+IYDLM +KK+ IE  L+E L+ VL  +++ +K PV+ KLSKEQR+     
Sbjct: 350  SSGDFLKAEKIYDLMCQKKYEIESLLIEKLDYVLSLSRKEVKKPVSLKLSKEQRQILVGL 409

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDD-------K 717
                L+IDSD ERKNHM+ F+FN++S  H +LKRHI  +Y EWL  SSK  D       K
Sbjct: 410  LLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKPTDGNDDIPHK 469

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS GDILLK++GS
Sbjct: 470  FSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSYGDILLKLKGS 529

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
             +GV+KVVK L+ K+L+C+VKRKG+++WIG LG NS WFWKLV+PYI+
Sbjct: 530  REGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYIL 577


>gb|KCW63832.1| hypothetical protein EUGRSUZ_G01504 [Eucalyptus grandis]
          Length = 708

 Score =  448 bits (1152), Expect = e-148
 Identities = 219/379 (57%), Positives = 273/379 (72%), Gaps = 7/379 (1%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SA+ A +HKIIE++ KAQ  ELA S+M+EF +SG KPL PSFID+M+MYF LG+HDKLE 
Sbjct: 329  SATVAAYHKIIEVICKAQDVELAESLMKEFKDSGLKPLGPSFIDMMNMYFKLGLHDKLES 388

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
             F Q +EKC PNR IY IYL SLV  G++ KAEEI  EM +   IG+  ++CN+IL GYL
Sbjct: 389  AFSQCVEKCQPNRVIYGIYLDSLVRIGDISKAEEIFSEMHSSGAIGIGGRNCNSILGGYL 448

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               +Y K E++Y LM +KK+ IEP  ME L+ +L    +A+K PV+ KL+KEQRE     
Sbjct: 449  SAGDYVKAEKVYHLMCQKKYEIEPASMEKLDPILSLRGKAVKKPVSLKLTKEQREILVGM 508

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDD-------K 717
                LQIDSDE+RKNHM+ FKFNE+S +H  LKRHI   Y EWL  S K DD        
Sbjct: 509  LLGGLQIDSDEQRKNHMIKFKFNENSGMHSALKRHIYEHYHEWLHPSCKLDDNSNEIPNS 568

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TI HSYFGFYADQFWP+G+PVIPKLIHRWLSP  LAYWYMYGGYR SSGDILLK+RGS
Sbjct: 569  FSTIRHSYFGFYADQFWPRGKPVIPKLIHRWLSPCALAYWYMYGGYRMSSGDILLKLRGS 628

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVGXXXXXXXXXXX 357
            ++GV++VVKAL+ KSL+C+VKRKG+++WIGLLG NSTWFWKL++PY++            
Sbjct: 629  QEGVERVVKALKAKSLDCRVKRKGQVYWIGLLGSNSTWFWKLIEPYVLDLNFAQEDDGEI 688

Query: 356  XXLRTIDFDRSDSDNSEDD 300
                +      +SDNSE+D
Sbjct: 689  LSFNSGSDSDKNSDNSEED 707


>ref|XP_015874239.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Ziziphus jujuba]
          Length = 821

 Score =  451 bits (1160), Expect = e-148
 Identities = 214/348 (61%), Positives = 267/348 (76%), Gaps = 7/348 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            S S   +HK+IEIL +AQ  ELA SVM EF+ SG KPLMPS++DLM MYF LG+HDK+E 
Sbjct: 429  STSYLAYHKVIEILCRAQEVELAESVMVEFLNSGLKPLMPSYVDLMSMYFDLGLHDKVEL 488

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
             F+Q L+KC PNRTIY IYL SLV   NL KAEEI  +MQN   IGVD +SCN IL GYL
Sbjct: 489  AFIQCLQKCRPNRTIYTIYLDSLVKGSNLEKAEEIFDQMQNSGAIGVDARSCNIILSGYL 548

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               +Y K E+IYDLM +K++ IE +LME ++ VL  +++ +K P++ KLSKEQRE     
Sbjct: 549  SSGDYVKAEKIYDLMCQKRYDIESELMEKIDYVLSLSRKVVKKPLSLKLSKEQREILVGL 608

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDDKV------ 714
                L+I+SDEERKNHML F+FNE+S +H +LKRHI  +Y EWL  S K +D +      
Sbjct: 609  LLGGLKIESDEERKNHMLRFEFNENSGLHSILKRHIHDQYHEWLHPSCKTNDAIEDIPCR 668

Query: 713  -TTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TISHSYFGFYADQFWP+G+  IPKLIHRWLSPRVLAYWYMYGG+RTSSGDILLK++G+
Sbjct: 669  FSTISHSYFGFYADQFWPKGRQTIPKLIHRWLSPRVLAYWYMYGGHRTSSGDILLKLKGN 728

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
            ++ V+K+VK L+ +SLNC+VK+KGR+FWIG LG NSTWFWKL +PYI+
Sbjct: 729  QEAVEKIVKTLKARSLNCRVKKKGRVFWIGFLGNNSTWFWKLTEPYII 776


>gb|OMO74186.1| hypothetical protein CCACVL1_16922 [Corchorus capsularis]
          Length = 799

 Score =  450 bits (1157), Expect = e-147
 Identities = 219/349 (62%), Positives = 269/349 (77%), Gaps = 8/349 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SAS A +HKIIE+L K+Q  +LA S M+EFIESGKKPLMPS+I+L DMY  L +HDK+E 
Sbjct: 403  SASVAAYHKIIEVLCKSQQIDLAESFMKEFIESGKKPLMPSYIELTDMYLNLSLHDKVES 462

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TFL+ LEKC PNR IYNIYL SLV  GN+GKA EI ++M     +GV+ KSCNT+L GYL
Sbjct: 463  TFLECLEKCQPNRAIYNIYLDSLVKVGNIGKAVEIFKQMLQNVAVGVNAKSCNTMLGGYL 522

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               ++ K E++YDLM +KK+ IE  LME L+ +L  +++ +K PV+ KLSK+QRE     
Sbjct: 523  SSGDFLKAEKLYDLMCQKKYEIESPLMEKLDYILSLSRKEVKKPVSLKLSKQQREILVGL 582

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK-----QDD--- 720
                LQIDSD ER NHML F+FN++S VH +LKRHI  +Y EWL  SSK      DD   
Sbjct: 583  LLGGLQIDSDGERMNHMLRFEFNQNSVVHSLLKRHIHDQYHEWLHPSSKAVTYGNDDIPH 642

Query: 719  KVTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRG 540
            K +TISH+YFGFYADQFWP+GQ VIPKLIHRWLSP VLAYWYMYGGYRTSSGDILLK++G
Sbjct: 643  KFSTISHTYFGFYADQFWPKGQQVIPKLIHRWLSPLVLAYWYMYGGYRTSSGDILLKLKG 702

Query: 539  SEDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
            S +GV+KVVK L+ KSLNC+VKRKG++FW+G LG +S WFWKLV+PYI+
Sbjct: 703  SHEGVEKVVKTLRAKSLNCRVKRKGKVFWLGFLGSDSIWFWKLVEPYIL 751


>ref|XP_007042348.2| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Theobroma cacao]
          Length = 823

 Score =  450 bits (1158), Expect = e-147
 Identities = 217/348 (62%), Positives = 268/348 (77%), Gaps = 7/348 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SAS A +HKIIE+L K+Q  +LA S+M+EF+ESGKKPLMPS+I+L DMY  + +HDKLE 
Sbjct: 432  SASVAAYHKIIEVLCKSQQMDLAESLMKEFMESGKKPLMPSYIELTDMYLNMSLHDKLES 491

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TFL+ LEKC PNRTIYNIYL SLV  GNL KA EI  +M     IGV+ +SCNTIL GYL
Sbjct: 492  TFLECLEKCRPNRTIYNIYLNSLVKVGNLEKAGEIFGQMHGNSTIGVNARSCNTILGGYL 551

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               ++ K E+IYDLM +KK+ IE  L+E L+ VL  +++ +K PV+ KLSKEQR+     
Sbjct: 552  SSGDFLKAEKIYDLMCQKKYEIESPLIEKLDYVLSLSRKEVKKPVSLKLSKEQRQILVGL 611

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDD-------K 717
                L+IDSD ERKNHM+ F+FN++S  H +LKRHI  +Y EWL  SSK  D       K
Sbjct: 612  LLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKPTDGNDDIPHK 671

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS GDILLK++GS
Sbjct: 672  FSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSYGDILLKLKGS 731

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
             +GV+KVVK L+ K+L+C+VKRKG+++WIG LG NS WFWKLV+PYI+
Sbjct: 732  REGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYIL 779


>gb|EOX98179.1| Pentatricopeptide repeat-containing protein isoform 1 [Theobroma
            cacao]
          Length = 823

 Score =  449 bits (1155), Expect = e-147
 Identities = 217/348 (62%), Positives = 268/348 (77%), Gaps = 7/348 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SAS A +HKIIE+L K+Q  +LA S+M+EF+ESGKKPLMPS+I+L DMY  + +HDKLE 
Sbjct: 432  SASVAAYHKIIEVLCKSQQMDLAESLMKEFMESGKKPLMPSYIELTDMYLNMSLHDKLES 491

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TFL+ LEKC PNRTIYNIYL SLV  GNL KA EI  +M     IGV+ +SCNTIL GYL
Sbjct: 492  TFLECLEKCRPNRTIYNIYLNSLVKVGNLEKAGEIFGQMHGNSTIGVNARSCNTILGGYL 551

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               ++ K E+IYDLM +KK+ IE  L+E L+ VL  +++ +K PV+ KLSKEQR+     
Sbjct: 552  SSGDFLKAEKIYDLMCQKKYEIESLLIEKLDYVLSLSRKEVKKPVSLKLSKEQRQILVGL 611

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDD-------K 717
                L+IDSD ERKNHM+ F+FN++S  H +LKRHI  +Y EWL  SSK  D       K
Sbjct: 612  LLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKPTDGNDDIPHK 671

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS GDILLK++GS
Sbjct: 672  FSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSYGDILLKLKGS 731

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
             +GV+KVVK L+ K+L+C+VKRKG+++WIG LG NS WFWKLV+PYI+
Sbjct: 732  REGVEKVVKTLKAKTLHCRVKRKGKVYWIGFLGSNSMWFWKLVEPYIL 779


>ref|XP_021287294.1| pentatricopeptide repeat-containing protein At2g15820, chloroplastic
            [Herrania umbratica]
          Length = 823

 Score =  448 bits (1152), Expect = e-146
 Identities = 218/348 (62%), Positives = 269/348 (77%), Gaps = 7/348 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            SAS A +HKIIE+L K+Q  +LA S+M+EF+ESGKKPLMPS+I+L DMY  + +HDKLE 
Sbjct: 432  SASVAAYHKIIEVLCKSQQMDLAESLMKEFMESGKKPLMPSYIELTDMYLNVSLHDKLES 491

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
            TFL+ LEKC PNR+IYNIYL SLV  GNL KA EI  +M     IGV+ KSCNTIL GYL
Sbjct: 492  TFLECLEKCRPNRSIYNIYLNSLVKVGNLEKAGEIFSQMHGNATIGVNAKSCNTILGGYL 551

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               ++ K E+IYDLM +KK+ IE  L+E L+ VL  +++ +K PV+ KLSKEQRE     
Sbjct: 552  SSGDFLKAEKIYDLMWQKKYEIESPLIEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGL 611

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSK----QDD---K 717
                L+IDSD ERKNHM+ F+FN++S  H +LKRHI  +Y EWL  S+K     DD   +
Sbjct: 612  LLGGLKIDSDGERKNHMIRFEFNQNSFTHSILKRHIHDQYHEWLHPSTKPTGGNDDIPHR 671

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
             +TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGY+TS GDILLK++GS
Sbjct: 672  FSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSCGDILLKLKGS 731

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
             +GV+KVVK L+ K+L+C+VKRKG++FWIG LG NS WFWKLV+PYI+
Sbjct: 732  HEGVEKVVKTLKAKTLHCRVKRKGKVFWIGFLGSNSMWFWKLVEPYIL 779


>ref|XP_018809767.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Juglans regia]
          Length = 834

 Score =  447 bits (1151), Expect = e-146
 Identities = 217/348 (62%), Positives = 268/348 (77%), Gaps = 7/348 (2%)
 Frame = -1

Query: 1415 SASTAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEY 1236
            S+S A +H+IIE+L KAQ  ELA S+M EFI+S  KPL PS+ID+M+MYF L +HDKLE 
Sbjct: 443  SSSVAAYHEIIEVLCKAQEVELAESLMVEFIKSNLKPLTPSYIDVMNMYFNLSLHDKLEL 502

Query: 1235 TFLQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYL 1056
             F Q LEKC PNRT+Y+IYL SLV  GNL +AEEI   M++ + IGV+++SCNTIL GYL
Sbjct: 503  VFSQCLEKCQPNRTVYSIYLDSLVKVGNLDRAEEIFNVMRSNQAIGVNSRSCNTILGGYL 562

Query: 1055 DGREYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXX 876
               EY K E+IYDLM +K++ I+  LME L+ VL  +++ +K PV+ KLSKEQRE     
Sbjct: 563  SSGEYVKAEKIYDLMCQKRYGIDSPLMEKLDYVLSLSRKQVKKPVSLKLSKEQREILVGL 622

Query: 875  XXXXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQDD-------K 717
                LQI+SDEERKNHML F+FNE+S  H VLKRHI  +Y EWL  S K  +       +
Sbjct: 623  LLGGLQIESDEERKNHMLRFEFNENSSSHFVLKRHIHEQYYEWLHPSCKPSEDAVDIPCR 682

Query: 716  VTTISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGS 537
              TISHSYFGFYADQFWP+G+P+IPKLIHRWLSP  LAYWYMYGGYRTSSGDILLK++G+
Sbjct: 683  FCTISHSYFGFYADQFWPKGRPMIPKLIHRWLSPCALAYWYMYGGYRTSSGDILLKLKGN 742

Query: 536  EDGVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
             +GVDKVVKAL+ KSL C+VKRKGR+FWIG LG NS+WFWKL++PY++
Sbjct: 743  PEGVDKVVKALKAKSLECRVKRKGRVFWIGFLGSNSSWFWKLIEPYVL 790


>gb|PPD69577.1| hypothetical protein GOBAR_DD33538 [Gossypium barbadense]
          Length = 836

 Score =  445 bits (1144), Expect = e-145
 Identities = 231/390 (59%), Positives = 272/390 (69%), Gaps = 20/390 (5%)
 Frame = -1

Query: 1409 STAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEYTF 1230
            S A +HKIIE+L +++  +LA S M+E IESG KPLMPS+I L D Y  L  HDKLE TF
Sbjct: 445  SIASYHKIIEVLCESEQMDLAESFMKELIESGMKPLMPSYIKLTDTYLRLNYHDKLESTF 504

Query: 1229 LQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYLDG 1050
            L+ LEKC PNRTIY+IYL SLV  GNLGKAEEI   M     IGV+ KSCNTIL GYL  
Sbjct: 505  LECLEKCRPNRTIYSIYLSSLVKVGNLGKAEEIFNHMGKNVTIGVNAKSCNTILYGYLSS 564

Query: 1049 REYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXXXX 870
             + +K E+IYDLM +KKF IE  LME LE VL+++++ +K PV+ KLSKEQRE       
Sbjct: 565  GDNSKAEKIYDLMCQKKFEIESPLMEKLESVLRSSRKEVKKPVSLKLSKEQREILMGLLL 624

Query: 869  XXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQ-------DDKVT 711
              L+IDSDEERKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK          K  
Sbjct: 625  GGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLTAGNGDIPHKFN 684

Query: 710  TISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGSED 531
            TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+GDILLK++GS +
Sbjct: 685  TISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAGDILLKLKGSSE 744

Query: 530  GVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVG------XXXXXXX 369
            GV KVVKAL+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PY++              
Sbjct: 745  GVKKVVKALKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDDLKDFLKAGSETA 804

Query: 368  XXXXXXLRTIDFD-------RSDSDNSEDD 300
                   R I+FD       +  SD SEDD
Sbjct: 805  DDCAVESRDINFDSASDSDEKGSSDYSEDD 834


>ref|XP_017638502.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic [Gossypium arboreum]
 gb|KHG30621.1| hypothetical protein F383_13349 [Gossypium arboreum]
          Length = 836

 Score =  445 bits (1144), Expect = e-145
 Identities = 221/346 (63%), Positives = 260/346 (75%), Gaps = 7/346 (2%)
 Frame = -1

Query: 1409 STAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEYTF 1230
            S A +HKIIE+L +++  +LA S M+E IESG KPLMPS+I L D Y  L  HDKLE TF
Sbjct: 445  SIASYHKIIEVLCESEQMDLAESFMKELIESGMKPLMPSYIKLTDTYLRLNCHDKLESTF 504

Query: 1229 LQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYLDG 1050
            L+ LEKC PNRTIYNIYL SLV  GNLGKAEEI   M     IGV+ KSCNTIL GYL  
Sbjct: 505  LECLEKCRPNRTIYNIYLSSLVKVGNLGKAEEIFNHMGENVTIGVNAKSCNTILCGYLSS 564

Query: 1049 REYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXXXX 870
             + +K E+IYDLM +KKF IE  LME LE VL+++++ +K P++ KLSKEQRE       
Sbjct: 565  GDNSKAEKIYDLMCQKKFEIESPLMEKLENVLRSSRKEVKKPLSLKLSKEQREILMGLLL 624

Query: 869  XXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQD-------DKVT 711
              L+IDSDEERKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK          K  
Sbjct: 625  GGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLTAGNGDILHKFN 684

Query: 710  TISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGSED 531
            TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+GDILLK++GS +
Sbjct: 685  TISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAGDILLKLKGSSE 744

Query: 530  GVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
            GV+KVVK L+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PYI+
Sbjct: 745  GVEKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYIL 790


>ref|XP_016715659.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic-like [Gossypium hirsutum]
          Length = 836

 Score =  444 bits (1142), Expect = e-145
 Identities = 231/390 (59%), Positives = 271/390 (69%), Gaps = 20/390 (5%)
 Frame = -1

Query: 1409 STAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEYTF 1230
            S A +HKIIE+L +++  +LA S M+E IESG KPLMPS+I L D Y  L  HDKLE TF
Sbjct: 445  SIASYHKIIEVLCESEQMDLAESFMKELIESGMKPLMPSYIKLTDTYLRLNYHDKLESTF 504

Query: 1229 LQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYLDG 1050
            L+ LEKC PNRTIY+IYL SLV  GNLGKAEEI   M     IGV+ KSCNTIL GYL  
Sbjct: 505  LECLEKCRPNRTIYSIYLSSLVKVGNLGKAEEIFNHMGKNVTIGVNAKSCNTILYGYLSS 564

Query: 1049 REYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXXXX 870
             +  K E+IYDLM +KKF IE  LME LE VL+++++ +K PV+ KLSKEQRE       
Sbjct: 565  GDNLKAEKIYDLMCQKKFEIESPLMEKLESVLRSSRKEVKKPVSLKLSKEQREILMGLLL 624

Query: 869  XXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQ-------DDKVT 711
              L+IDSDEERKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK          K  
Sbjct: 625  GGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLTAGNGDIPHKFN 684

Query: 710  TISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGSED 531
            TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+GDILLK++GS +
Sbjct: 685  TISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAGDILLKLKGSSE 744

Query: 530  GVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIVG------XXXXXXX 369
            GV KVVKAL+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PY++              
Sbjct: 745  GVKKVVKALKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYVLDDLKDFLKAGSETA 804

Query: 368  XXXXXXLRTIDFD-------RSDSDNSEDD 300
                   R I+FD       +  SD SEDD
Sbjct: 805  DDCAVESRDINFDSASDSDEKGSSDYSEDD 834


>ref|XP_016732545.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820,
            chloroplastic-like [Gossypium hirsutum]
          Length = 835

 Score =  443 bits (1140), Expect = e-144
 Identities = 220/346 (63%), Positives = 260/346 (75%), Gaps = 7/346 (2%)
 Frame = -1

Query: 1409 STAGFHKIIEILSKAQAAELAGSVMEEFIESGKKPLMPSFIDLMDMYFTLGMHDKLEYTF 1230
            S A +HKIIE+L +++  +LA S M+E IESG KPLMPS+I L D Y  L  HDKLE TF
Sbjct: 445  SIASYHKIIEVLCESEQMDLAESFMKELIESGMKPLMPSYIKLTDTYLRLNCHDKLESTF 504

Query: 1229 LQSLEKCHPNRTIYNIYLKSLVHTGNLGKAEEILREMQNLEDIGVDTKSCNTILRGYLDG 1050
            L+ LEKC PNRTIYNIYL SLV  GNLGKAEEI   M     IGV+ KSCNTIL GYL  
Sbjct: 505  LECLEKCRPNRTIYNIYLSSLVKVGNLGKAEEIFNHMGENVTIGVNAKSCNTILCGYLSS 564

Query: 1049 REYAKVEQIYDLMSEKKFHIEPDLMENLERVLKANKEALKNPVTKKLSKEQREXXXXXXX 870
             + +K E+IYDLM +KKF IE  LME LE VL+++++ +K P++ KLSKEQRE       
Sbjct: 565  GDNSKAEKIYDLMCQKKFEIESPLMEKLENVLRSSRKEVKKPLSLKLSKEQREILMGLLL 624

Query: 869  XXLQIDSDEERKNHMLVFKFNEDSDVHKVLKRHISSEYKEWLDLSSKQ-------DDKVT 711
              L+IDSDE+RKNHM+ F+FN  S  H +LKRHI  +Y EWL  SSK          K  
Sbjct: 625  GGLRIDSDEKRKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKLTAGDGDIPHKFN 684

Query: 710  TISHSYFGFYADQFWPQGQPVIPKLIHRWLSPRVLAYWYMYGGYRTSSGDILLKIRGSED 531
            TISHSYFGFYADQFWP+GQPVIPKLIHRWLSP VLAYWYMYGGYRTS+GDILLK++GS +
Sbjct: 685  TISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSAGDILLKLKGSSE 744

Query: 530  GVDKVVKALQKKSLNCKVKRKGRLFWIGLLGKNSTWFWKLVDPYIV 393
            GV+KVVK L+ KSLNC+VKRKGR+FWIG L  +S WFWKLV+PYI+
Sbjct: 745  GVEKVVKTLKSKSLNCRVKRKGRVFWIGFLRTDSMWFWKLVEPYIL 790


Top