BLASTX nr result

ID: Chrysanthemum22_contig00000328 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00000328
         (2155 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_023751596.1| protein CHUP1, chloroplastic [Lactuca sativa...   572   0.0  
ref|XP_021997473.1| protein CHUP1, chloroplastic [Helianthus ann...   567   0.0  
gb|KVI05336.1| hypothetical protein Ccrd_016339 [Cynara carduncu...   548   e-179
emb|CDP00563.1| unnamed protein product [Coffea canephora]            493   e-158
emb|CBI27077.3| unnamed protein product, partial [Vitis vinifera]     487   e-156
ref|XP_002281154.2| PREDICTED: protein CHUP1, chloroplastic [Vit...   487   e-156
ref|XP_004298311.1| PREDICTED: protein CHUP1, chloroplastic [Fra...   482   e-154
ref|XP_022956771.1| protein CHUP1, chloroplastic-like [Cucurbita...   481   e-154
ref|XP_007046334.2| PREDICTED: protein CHUP1, chloroplastic [The...   481   e-154
gb|EOY02162.1| Hydroxyproline-rich glycoprotein family protein i...   479   e-154
ref|XP_017218711.1| PREDICTED: protein CHUP1, chloroplastic [Dau...   480   e-153
ref|XP_019177562.1| PREDICTED: protein CHUP1, chloroplastic [Ipo...   480   e-153
gb|EOY02159.1| Hydroxyproline-rich glycoprotein family protein i...   479   e-153
ref|XP_024019384.1| protein CHUP1, chloroplastic [Morus notabilis]    479   e-153
ref|XP_021300387.1| protein CHUP1, chloroplastic [Herrania umbra...   478   e-153
ref|XP_024194997.1| protein CHUP1, chloroplastic [Rosa chinensis...   478   e-152
ref|XP_023529698.1| protein CHUP1, chloroplastic-like [Cucurbita...   478   e-152
ref|XP_023001067.1| protein CHUP1, chloroplastic-like [Cucurbita...   478   e-152
gb|KJB50773.1| hypothetical protein B456_008G187000 [Gossypium r...   473   e-152
gb|KJB50776.1| hypothetical protein B456_008G187000 [Gossypium r...   473   e-152

>ref|XP_023751596.1| protein CHUP1, chloroplastic [Lactuca sativa]
 gb|PLY94732.1| hypothetical protein LSAT_8X37441 [Lactuca sativa]
          Length = 963

 Score =  572 bits (1475), Expect = 0.0
 Identities = 320/445 (71%), Positives = 344/445 (77%), Gaps = 7/445 (1%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEEV +G  Y+KEL+AARNKIKELQRQFQLEAN                 TKEQ+
Sbjct: 177  ERKKLQEEVVNGANYKKELDAARNKIKELQRQFQLEANQTKGQLLLLKQQVGILQTKEQD 236

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            AFKKDTDI                   KNRELQHEKR+LV+KLDAAESRVA LSSTTETE
Sbjct: 237  AFKKDTDIERKLKSLKELEVEVVELKRKNRELQHEKRQLVVKLDAAESRVAVLSSTTETE 296

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            MVARV+EEVNKL  TNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETP+GK
Sbjct: 297  MVARVKEEVNKLAHTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPSGK 356

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
            TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDL+SNFSQPSSPGSEDFDTA    
Sbjct: 357  TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLESNFSQPSSPGSEDFDTASIDS 416

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSH--XXXXXXXXXXXXXXXXXXXSQKPRGPLE 1262
                    SKKP LIQKLKKWG K KDDS+                     SQKPRGPLE
Sbjct: 417  SMSRYSSFSKKPSLIQKLKKWG-KSKDDSNYSSALTSPARSFTGGSPRVSVSQKPRGPLE 475

Query: 1261 ALMLRNAGESVAITTFGAAEQDSPTSP-APSDVSSSFQLMSKSVEGVLDEKYPAYKDRHK 1085
            ALMLRNAGESVAITTFG A+QDS  SP  P++V+SSF LMSKSVEGVLDEKYPAYKDRHK
Sbjct: 476  ALMLRNAGESVAITTFGVADQDSANSPETPNNVASSFHLMSKSVEGVLDEKYPAYKDRHK 535

Query: 1084 LALEREKKIKEKADQARAARFGD-TSSFKPPKDNRTVSLPPKLAQVKERVVLPIDA---S 917
            LALEREKKIKEKADQARAARFGD T+SFKPP  +++VSLPPKLAQVKER V+   A   S
Sbjct: 536  LALEREKKIKEKADQARAARFGDSTTSFKPPSYSKSVSLPPKLAQVKERAVISPSADVIS 595

Query: 916  GAQSSDDKAANLSAVSKMQFAHIEK 842
            G QS+D K+ ++  VSK+ FA IEK
Sbjct: 596  GDQSTDGKSTSMPPVSKIPFADIEK 620



 Score =  399 bits (1026), Expect = e-123
 Identities = 202/214 (94%), Positives = 209/214 (97%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQSLMKREAKKDTSIISS+ +NTADARSNMIGEIENRSTFLLAVKADV
Sbjct: 687  DKVHRAPELVEFYQSLMKREAKKDTSIISSSASNTADARSNMIGEIENRSTFLLAVKADV 746

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFVESLASEVRAASFTD+EDL+TFVNWLDEELSFLVDERAVLKHFDWPEGKADA R
Sbjct: 747  ETQGDFVESLASEVRAASFTDIEDLVTFVNWLDEELSFLVDERAVLKHFDWPEGKADAFR 806

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLEKQVSNFVDD S+ CEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK
Sbjct: 807  EAAFEYQDLMKLEKQVSNFVDDPSVPCEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 866

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIP+NWLQDSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 867  EFGIPINWLQDSGVVGKIKLSSVQLARKYMKRVA 900


>ref|XP_021997473.1| protein CHUP1, chloroplastic [Helianthus annuus]
 gb|OTG04703.1| hypothetical protein HannXRQ_Chr12g0365341 [Helianthus annuus]
          Length = 939

 Score =  567 bits (1462), Expect = 0.0
 Identities = 313/440 (71%), Positives = 337/440 (76%), Gaps = 2/440 (0%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEEV     Y+KEL +AR+KIKELQRQFQLEAN                 TKE++
Sbjct: 161  ERKKLQEEVLQAASYKKELNSARSKIKELQRQFQLEANQTKGQLLLLKQQVGILQTKEKD 220

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A +KD DI                   KNRELQHEKRELVIKLDAAE+R+ATLSSTTETE
Sbjct: 221  AVRKDIDIDKKLKTLKELEVDVVELKRKNRELQHEKRELVIKLDAAEARIATLSSTTETE 280

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            MVA VREEVNKL+ TNEDLLKQVEGLQ+NRFSEVEELVYLRWVNACLRFELRNYETPAGK
Sbjct: 281  MVANVREEVNKLKHTNEDLLKQVEGLQINRFSEVEELVYLRWVNACLRFELRNYETPAGK 340

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
            TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFS PSSPGSEDFDTA    
Sbjct: 341  TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSLPSSPGSEDFDTASIDS 400

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDD-SHXXXXXXXXXXXXXXXXXXXSQKPRGPLEA 1259
                    SKKP LIQKLKKWG KGKDD                      SQKPRGPLEA
Sbjct: 401  SMSRYSSLSKKPSLIQKLKKWG-KGKDDYQSSALSSPSRSFSGGSPRPSSSQKPRGPLEA 459

Query: 1258 LMLRNAGESVAITTFGAAEQDSPTSPA-PSDVSSSFQLMSKSVEGVLDEKYPAYKDRHKL 1082
            LMLRNAGESVAITTFG  +Q+SP SP+ P++V+SSFQLMS+SVEGVLDEKYPAYKDRHKL
Sbjct: 460  LMLRNAGESVAITTFGEGDQESPNSPSDPNNVASSFQLMSRSVEGVLDEKYPAYKDRHKL 519

Query: 1081 ALEREKKIKEKADQARAARFGDTSSFKPPKDNRTVSLPPKLAQVKERVVLPIDASGAQSS 902
            ALEREKKIKEKA+QARA RFGDTS +KPP +++TVSLPPKLAQVKERV L  DAS  QS+
Sbjct: 520  ALEREKKIKEKANQARAIRFGDTSGYKPPINSKTVSLPPKLAQVKERVTLTPDASANQST 579

Query: 901  DDKAANLSAVSKMQFAHIEK 842
            D      S VSKMQ+A IEK
Sbjct: 580  DGSELGSSTVSKMQYADIEK 599



 Score =  392 bits (1006), Expect = e-120
 Identities = 199/214 (92%), Positives = 206/214 (96%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQSLMKREAKKDTS+IS+  + TADARSNMIGEIENRSTFLLAVKADV
Sbjct: 663  DKVHRAPELVEFYQSLMKREAKKDTSVISTITSTTADARSNMIGEIENRSTFLLAVKADV 722

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFVESLASEVRAASFTD+EDLL FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 723  ETQGDFVESLASEVRAASFTDIEDLLAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 782

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLE QV+NFVDD +L+CE ALKKMYKLLEKVENSVYALLRTRDMAMSRYK
Sbjct: 783  EAAFEYQDLMKLESQVTNFVDDPNLSCEAALKKMYKLLEKVENSVYALLRTRDMAMSRYK 842

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA
Sbjct: 843  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 876


>gb|KVI05336.1| hypothetical protein Ccrd_016339 [Cynara cardunculus var. scolymus]
          Length = 1000

 Score =  548 bits (1411), Expect = e-179
 Identities = 310/453 (68%), Positives = 329/453 (72%), Gaps = 15/453 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEEV HG  Y+KELEAARNKIKELQRQFQLEAN                 TKEQ+
Sbjct: 202  ERKKLQEEVLHGASYKKELEAARNKIKELQRQFQLEANQTKGQLLLLKQQVGILQTKEQD 261

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            AFKKDTDI                   KNRELQHEKR+LV+KLDAAESR+ATLS TTE  
Sbjct: 262  AFKKDTDIDKKLKTLKELEVDVVELKRKNRELQHEKRQLVVKLDAAESRIATLSCTTE-- 319

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
                          TN+DLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAG+
Sbjct: 320  -------------HTNDDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGR 366

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
            TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDL+SNFSQPSSPGSEDFDTA    
Sbjct: 367  TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLESNFSQPSSPGSEDFDTASIDS 426

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG K K+DS                    SQKPRGPLEAL
Sbjct: 427  SMSRYSSFSKKPSLIQKLKKWG-KSKEDS-SALSSPARSFSGGSPRVSISQKPRGPLEAL 484

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP------------APSD---VSSSFQLMSKSVEGVL 1121
            MLRNAGESVAITTFG  +QDS  SP            AP+D   V+SSF LMSKSVEGVL
Sbjct: 485  MLRNAGESVAITTFGEGDQDSINSPETPNLPRINTGNAPTDLNNVASSFHLMSKSVEGVL 544

Query: 1120 DEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKDNRTVSLPPKLAQVKER 941
            DEKYPAYKDRHKLALEREKKIKEKADQARA RFGDTSSFKPP ++R VSLPPKLAQVKER
Sbjct: 545  DEKYPAYKDRHKLALEREKKIKEKADQARAVRFGDTSSFKPPNNSRPVSLPPKLAQVKER 604

Query: 940  VVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            VV+P + SG Q +D    +  +VSKMQFAHIEK
Sbjct: 605  VVIPANTSGDQPTDGIVTSSPSVSKMQFAHIEK 637



 Score =  367 bits (942), Expect = e-110
 Identities = 196/238 (82%), Positives = 204/238 (85%), Gaps = 24/238 (10%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQSLMKREAKKDTSI+S++ A+TADARSNMIGEIENRSTFLLAVKADV
Sbjct: 706  DKVHRAPELVEFYQSLMKREAKKDTSIVSASAASTADARSNMIGEIENRSTFLLAVKADV 765

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFVESLASEVRAASFTD+ DLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADA R
Sbjct: 766  ETQGDFVESLASEVRAASFTDIADLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADAFR 825

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLEKQVSNFVDD SL+CE ALKKMYKLLE    +VYALLRTRDMAMSRYK
Sbjct: 826  EAAFEYQDLMKLEKQVSNFVDDPSLSCEAALKKMYKLLE----NVYALLRTRDMAMSRYK 881

Query: 103  EFGIPVNWLQDSGVVGK------------------------IKLASVQLARKYMKRVA 2
            EFGIPVNWLQDSGVVGK                        IKL+SVQLARKYMKRVA
Sbjct: 882  EFGIPVNWLQDSGVVGKVWSIWKIIYKRNRTQRLKGKHFLQIKLSSVQLARKYMKRVA 939


>emb|CDP00563.1| unnamed protein product [Coffea canephora]
          Length = 987

 Score =  493 bits (1269), Expect = e-158
 Identities = 275/450 (61%), Positives = 323/450 (71%), Gaps = 12/450 (2%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            +RKKLQEEV+ G   R+ELE ARNKIKELQ+Q QLEAN                 +KE E
Sbjct: 204  QRKKLQEEVSQGASTRRELEIARNKIKELQKQIQLEANQTKGQLLLLKQQVSGLQSKETE 263

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
             F+KD ++                   KN+ELQHEKREL++KLDAAE++VA+LS+ TETE
Sbjct: 264  TFRKDAEVENKLKALKELEVEVMELKRKNKELQHEKRELIVKLDAAEAKVASLSNMTETE 323

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            MVA+VREEVN +++ NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+TP+GK
Sbjct: 324  MVAQVREEVNNMRQKNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPSGK 383

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDL+K+LSPRS+E+AK+LMLEYA SERGQ GDTDL+SNFS PSSPGSEDFD      
Sbjct: 384  ISARDLSKSLSPRSRERAKRLMLEYAESERGQ-GDTDLESNFSHPSSPGSEDFDNTSIDS 442

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG K KDDS                    S +P+GPLEAL
Sbjct: 443  SMSRYSSLSKKPSLIQKLKKWG-KNKDDSSALSSPTRSLGGKSPSRASTSIRPKGPLEAL 501

Query: 1255 MLRNAGESVAITTFGAAEQ--DSPTSPAP----------SDVSSSFQLMSKSVEGVLDEK 1112
            MLRNAG+SVAIT+FG AEQ  DSP +PAP          + V SSFQLMSKSVEGVLDEK
Sbjct: 502  MLRNAGDSVAITSFGTAEQDPDSPETPAPLQIRTQDGSLNSVVSSFQLMSKSVEGVLDEK 561

Query: 1111 YPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKDNRTVSLPPKLAQVKERVVL 932
            YPAYKDRHKLALEREKKIKEKA+QAR ARFGDTSSFKP +   +++LPPKL+ +KER  +
Sbjct: 562  YPAYKDRHKLALEREKKIKEKAEQARVARFGDTSSFKPDR-TTSITLPPKLSHIKERTSI 620

Query: 931  PIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
              D++  +  +D   +   VSKM+ AHIEK
Sbjct: 621  SGDSN--EQPNDSKDDSQTVSKMKLAHIEK 648



 Score =  364 bits (935), Expect = e-109
 Identities = 182/214 (85%), Positives = 203/214 (94%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPE+VEFYQSLMKREAKKD+S + S+ ++T++ARSNMIGEIENRS+FLLAVKADV
Sbjct: 712  DKVHRAPEVVEFYQSLMKREAKKDSSPLISSTSSTSEARSNMIGEIENRSSFLLAVKADV 771

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV+SLA+EVRAASFT++EDL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 772  ETQGDFVQSLATEVRAASFTNIEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 831

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLEKQVS FVDD +L CE ALKKMYKLLEKVE SVYALLRTRDMA+SRYK
Sbjct: 832  EAAFEYQDLVKLEKQVSTFVDDPNLPCESALKKMYKLLEKVEQSVYALLRTRDMAISRYK 891

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPV+WL D+G++GKIKL+SVQLARKYMKRVA
Sbjct: 892  EFGIPVDWLSDTGLIGKIKLSSVQLARKYMKRVA 925


>emb|CBI27077.3| unnamed protein product, partial [Vitis vinifera]
          Length = 969

 Score =  487 bits (1254), Expect = e-156
 Identities = 274/459 (59%), Positives = 321/459 (69%), Gaps = 21/459 (4%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQ+EV  GV  RKELE ARNKIKELQRQ Q+EAN                 TKEQE
Sbjct: 172  ERKKLQDEVALGVSARKELEVARNKIKELQRQIQVEANQTKGHLLLLKQQVSGLQTKEQE 231

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A KKD +I                   +N+ELQHEKREL++KLD AE+RVA LS+ TE+E
Sbjct: 232  AIKKDAEIEKKLKAAKELEVEVVELKRRNKELQHEKRELLVKLDGAEARVAALSNMTESE 291

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            MVA+ RE+VN L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+TP GK
Sbjct: 292  MVAKAREDVNNLRHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPGGK 351

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDL+K+LSPRSQE+AKQLMLEYAGSERGQ GDTDL+SNFS PSSPGSEDFD A    
Sbjct: 352  ISARDLSKSLSPRSQERAKQLMLEYAGSERGQ-GDTDLESNFSHPSSPGSEDFDNASIDS 410

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG K +DDS                    S +PRGPLEAL
Sbjct: 411  STSRYSSLSKKPSLIQKLKKWG-KSRDDSSVLSSPARSFGGGSPGRTSISLRPRGPLEAL 469

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP----------------APSDVSSSFQLMSKSVEGV 1124
            MLRNAG+ VAITTFG  +Q++P SP                + ++V++SFQLMSKSVEGV
Sbjct: 470  MLRNAGDGVAITTFGKIDQEAPESPETPNLSHIRTRVSSSDSLNNVAASFQLMSKSVEGV 529

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFK-----PPKDNRTVSLPPKL 959
            LDEKYPAYKDRHKLALEREK+IKEKA++ARA RFGD+S  K       + +++V+LPPKL
Sbjct: 530  LDEKYPAYKDRHKLALEREKQIKEKAEKARAERFGDSSDLKYESRAKAERDKSVTLPPKL 589

Query: 958  AQVKERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            A++KE+ ++  D+S  QS D K  +    SKM+ AHIEK
Sbjct: 590  AKIKEKPLVSADSSD-QSIDSKMEDSQVASKMKLAHIEK 627



 Score =  357 bits (917), Expect = e-107
 Identities = 180/214 (84%), Positives = 197/214 (92%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQ+LMKREAKKDT  + S+ +N ADARSNMIGEI N+S+FLLAVKADV
Sbjct: 693  DKVHRAPELVEFYQTLMKREAKKDTPSLVSSTSNAADARSNMIGEIANKSSFLLAVKADV 752

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV+SLA+EVRAASFT +EDL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 753  ETQGDFVQSLATEVRAASFTKIEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 812

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLEK+VS F DD  L+CE ALKKMY LLEKVE SVYALLRTRDMA+SRY+
Sbjct: 813  EAAFEYQDLMKLEKRVSTFEDDPKLSCEAALKKMYSLLEKVEQSVYALLRTRDMAISRYR 872

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPV+WL DSGVVGKIKL+SVQLARKYMKRV+
Sbjct: 873  EFGIPVDWLLDSGVVGKIKLSSVQLARKYMKRVS 906


>ref|XP_002281154.2| PREDICTED: protein CHUP1, chloroplastic [Vitis vinifera]
          Length = 1003

 Score =  487 bits (1254), Expect = e-156
 Identities = 274/459 (59%), Positives = 321/459 (69%), Gaps = 21/459 (4%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQ+EV  GV  RKELE ARNKIKELQRQ Q+EAN                 TKEQE
Sbjct: 206  ERKKLQDEVALGVSARKELEVARNKIKELQRQIQVEANQTKGHLLLLKQQVSGLQTKEQE 265

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A KKD +I                   +N+ELQHEKREL++KLD AE+RVA LS+ TE+E
Sbjct: 266  AIKKDAEIEKKLKAAKELEVEVVELKRRNKELQHEKRELLVKLDGAEARVAALSNMTESE 325

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            MVA+ RE+VN L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+TP GK
Sbjct: 326  MVAKAREDVNNLRHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPGGK 385

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDL+K+LSPRSQE+AKQLMLEYAGSERGQ GDTDL+SNFS PSSPGSEDFD A    
Sbjct: 386  ISARDLSKSLSPRSQERAKQLMLEYAGSERGQ-GDTDLESNFSHPSSPGSEDFDNASIDS 444

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG K +DDS                    S +PRGPLEAL
Sbjct: 445  STSRYSSLSKKPSLIQKLKKWG-KSRDDSSVLSSPARSFGGGSPGRTSISLRPRGPLEAL 503

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP----------------APSDVSSSFQLMSKSVEGV 1124
            MLRNAG+ VAITTFG  +Q++P SP                + ++V++SFQLMSKSVEGV
Sbjct: 504  MLRNAGDGVAITTFGKIDQEAPESPETPNLSHIRTRVSSSDSLNNVAASFQLMSKSVEGV 563

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFK-----PPKDNRTVSLPPKL 959
            LDEKYPAYKDRHKLALEREK+IKEKA++ARA RFGD+S  K       + +++V+LPPKL
Sbjct: 564  LDEKYPAYKDRHKLALEREKQIKEKAEKARAERFGDSSDLKYESRAKAERDKSVTLPPKL 623

Query: 958  AQVKERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            A++KE+ ++  D+S  QS D K  +    SKM+ AHIEK
Sbjct: 624  AKIKEKPLVSADSSD-QSIDSKMEDSQVASKMKLAHIEK 661



 Score =  357 bits (917), Expect = e-106
 Identities = 180/214 (84%), Positives = 197/214 (92%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQ+LMKREAKKDT  + S+ +N ADARSNMIGEI N+S+FLLAVKADV
Sbjct: 727  DKVHRAPELVEFYQTLMKREAKKDTPSLVSSTSNAADARSNMIGEIANKSSFLLAVKADV 786

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV+SLA+EVRAASFT +EDL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 787  ETQGDFVQSLATEVRAASFTKIEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 846

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLEK+VS F DD  L+CE ALKKMY LLEKVE SVYALLRTRDMA+SRY+
Sbjct: 847  EAAFEYQDLMKLEKRVSTFEDDPKLSCEAALKKMYSLLEKVEQSVYALLRTRDMAISRYR 906

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPV+WL DSGVVGKIKL+SVQLARKYMKRV+
Sbjct: 907  EFGIPVDWLLDSGVVGKIKLSSVQLARKYMKRVS 940


>ref|XP_004298311.1| PREDICTED: protein CHUP1, chloroplastic [Fragaria vesca subsp. vesca]
          Length = 1001

 Score =  482 bits (1240), Expect = e-154
 Identities = 277/461 (60%), Positives = 317/461 (68%), Gaps = 23/461 (4%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEE+  G   +KELEAARNKIKELQRQ QLEAN                  KE+E
Sbjct: 203  ERKKLQEEIAQGATTKKELEAARNKIKELQRQIQLEANQTKGQLLLLKQQVSGLQEKEEE 262

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A +KD++I                   KN+ELQ EKREL IKL+AAESRVA LS+ TETE
Sbjct: 263  AVRKDSEIEKKLKAVKDLEVEVMELKRKNKELQIEKRELSIKLNAAESRVAELSNMTETE 322

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            MVA VR EVN L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNY+TP GK
Sbjct: 323  MVANVRSEVNNLKHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPQGK 382

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNKNLSP+SQEKAKQLMLEYAGSERGQ GDTD++SN+SQPSSPGSEDFD A    
Sbjct: 383  ISARDLNKNLSPKSQEKAKQLMLEYAGSERGQ-GDTDMESNYSQPSSPGSEDFDNASIDS 441

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    +K+P LIQKLKKWG K KDDS                    S +PRGPLE+L
Sbjct: 442  STSRYSALTKRPSLIQKLKKWG-KSKDDSSALSSPARSFSGSSPGRASMSVRPRGPLESL 500

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP----------------APSDVSSSFQLMSKSVEGV 1124
            MLRNA + VAITTFG  +Q+ P SP                +P+ VSSSFQLMSKSVEGV
Sbjct: 501  MLRNASDGVAITTFGKMDQELPDSPQTPTLPSIRTQMPSSDSPNSVSSSFQLMSKSVEGV 560

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTS----SFKP---PKDNRTVSLPP 965
            LDEKYPAYKDRHKLALERE++IKE+A+QARA +FGD S    S++P      +RTVSLPP
Sbjct: 561  LDEKYPAYKDRHKLALERERQIKERAEQARAEKFGDKSNVSFSYEPRTKGDKDRTVSLPP 620

Query: 964  KLAQVKERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            KL  +KE+ V+  D+S  Q+   KA +   +SKM+ A IEK
Sbjct: 621  KLTLIKEKTVISGDSSN-QADGGKAFDPQEISKMKLAQIEK 660



 Score =  353 bits (907), Expect = e-105
 Identities = 180/214 (84%), Positives = 196/214 (91%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQSLMKREAKKDTS + ST +N + ARSNMIGEIEN+S+FLLAVKADV
Sbjct: 725  DKVHRAPELVEFYQSLMKREAKKDTSSLISTSSNVSSARSNMIGEIENKSSFLLAVKADV 784

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            E QGDFV SLA+EVRAASFT++EDL+ FVNWLDEELSFLVDERAVLKHFDWPEGK DALR
Sbjct: 785  EAQGDFVMSLATEVRAASFTNIEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKVDALR 844

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLE++VS FVDD  L+CE ALKKM+ LLEKVE SVYALLRTRDMA+SR K
Sbjct: 845  EAAFEYQDLIKLEQKVSTFVDDPKLSCEAALKKMFSLLEKVEQSVYALLRTRDMAISRCK 904

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPV+WL DSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 905  EFGIPVDWLLDSGVVGKIKLSSVQLARKYMKRVA 938


>ref|XP_022956771.1| protein CHUP1, chloroplastic-like [Cucurbita moschata]
          Length = 988

 Score =  481 bits (1239), Expect = e-154
 Identities = 275/460 (59%), Positives = 315/460 (68%), Gaps = 22/460 (4%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEE+      +KELE ARNKIKELQRQ QL+AN                  KEQE
Sbjct: 187  ERKKLQEEIAQAATVKKELEFARNKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEQE 246

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
              KKD +I                   KN+ELQ EKREL IKLDAAE+R++TLS+ TE+E
Sbjct: 247  TIKKDAEIEKKLKAVKELEVEVMELKRKNKELQIEKRELTIKLDAAENRISTLSNMTESE 306

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            MV++ REEVN L+ TNEDL+KQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+ P GK
Sbjct: 307  MVSQTREEVNNLRHTNEDLIKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPTGK 366

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNKNLSP+SQEKAKQLMLEYAGSERGQ GDTDL+SNFSQPSSPGSEDFD A    
Sbjct: 367  VSARDLNKNLSPKSQEKAKQLMLEYAGSERGQ-GDTDLESNFSQPSSPGSEDFDNASIDS 425

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG + KDDS                    SQKPRGPLEAL
Sbjct: 426  SFSRYSSLSKKPSLIQKLKKWGGRSKDDSSVVSSPARSFSGGSPSRMSMSQKPRGPLEAL 485

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP-----------APSD----VSSSFQLMSKSVEGVL 1121
            MLRN  +SVAIT+FG  EQ+ P SP            P+D    V+SSFQLMSKSV GVL
Sbjct: 486  MLRNTSDSVAITSFGTMEQEVPDSPGTPNLPSIRTQTPNDSLNSVASSFQLMSKSVGGVL 545

Query: 1120 DEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSS------FKPPKD-NRTVSLPPK 962
            DEKYPAYKDRHKLAL REK+IKE+ADQARA RFG+ S+      FK   + +R V LPPK
Sbjct: 546  DEKYPAYKDRHKLALAREKQIKERADQARAERFGNISNSNLNPEFKGKTERDRPVVLPPK 605

Query: 961  LAQVKERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            L+Q+KE+ V+  DA+   S ++K    SA+S+M+ A IEK
Sbjct: 606  LSQIKEKPVVSSDAADV-SGENKKIESSAISRMKLAEIEK 644



 Score =  358 bits (920), Expect = e-107
 Identities = 178/214 (83%), Positives = 200/214 (93%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQSLMKREAKKDT ++SST +N +DARSNMIGEIENRS+FL+AVKADV
Sbjct: 711  DKVHRAPELVEFYQSLMKREAKKDTPLLSSTSSNVSDARSNMIGEIENRSSFLIAVKADV 770

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV SLA+EVRAA+F+++ED++ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 771  ETQGDFVISLAAEVRAATFSNIEDVVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 830

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EA+FEYQDL KLEK+V+ FVD+  L CE ALKKMY LLEKVE SVYALLRTRDMA+SRY+
Sbjct: 831  EASFEYQDLMKLEKRVTTFVDEPKLPCEAALKKMYSLLEKVEQSVYALLRTRDMAISRYR 890

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPV+WL D+GVVGKIKL+SVQLARKYMKRVA
Sbjct: 891  EFGIPVDWLSDTGVVGKIKLSSVQLARKYMKRVA 924


>ref|XP_007046334.2| PREDICTED: protein CHUP1, chloroplastic [Theobroma cacao]
          Length = 996

 Score =  481 bits (1239), Expect = e-154
 Identities = 269/455 (59%), Positives = 314/455 (69%), Gaps = 17/455 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQE++ HG   +KELE ARNKIKELQRQ QL+AN                  KEQE
Sbjct: 201  ERKKLQEDIAHGASVKKELEVARNKIKELQRQIQLDANQTKAQLLFLKQQVSGLQAKEQE 260

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A K D+++                   KN+ELQHEKREL +KLDAAE+++A LS+ TETE
Sbjct: 261  AIKNDSEVEKKLKAVKELEMEVMELRRKNKELQHEKRELTVKLDAAEAKIAALSNMTETE 320

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            +  R REEV+ L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+TP GK
Sbjct: 321  IDVRAREEVSNLRHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPEGK 380

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNK+LSP+SQE AKQL+LEYAGSERGQ GDTD++SNFS PSS GSED D A    
Sbjct: 381  ISARDLNKSLSPKSQETAKQLLLEYAGSERGQ-GDTDIESNFSHPSSTGSEDLDNASIYS 439

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG + KDDS                    SQ P+GPLEAL
Sbjct: 440  SNSRYSSLSKKPSLIQKLKKWG-RSKDDSSAVSSPARSLSGGSPSRISMSQHPQGPLEAL 498

Query: 1255 MLRNAGESVAITTFGAAEQ---DSPTSP-------------APSDVSSSFQLMSKSVEGV 1124
            MLRNAG+ VAITTFG  EQ   DSP +P             +P+ V++SF LMS+SV+G 
Sbjct: 499  MLRNAGDGVAITTFGKNEQEFTDSPETPTIPNIRTQVSSGDSPNSVATSFHLMSRSVDGS 558

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKD-NRTVSLPPKLAQVK 947
            L+EKYPAYKDRHKLALEREK+IK+KA QARA RFGD S+F    +  + V LPPKLAQ+K
Sbjct: 559  LEEKYPAYKDRHKLALEREKQIKQKAQQARAERFGDKSNFSSKAEREKPVILPPKLAQIK 618

Query: 946  ERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            ER V P D+SG QS+DDKA +   +SKM+ AHIEK
Sbjct: 619  ERTVFPGDSSG-QSNDDKAVDSQTISKMKLAHIEK 652



 Score =  370 bits (950), Expect = e-111
 Identities = 186/214 (86%), Positives = 201/214 (93%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQ+LMKREAKKDTS + S  +N +DARSNMIGEIENRS+FLLAVKADV
Sbjct: 720  DKVHRAPELVEFYQTLMKREAKKDTSSLISPTSNPSDARSNMIGEIENRSSFLLAVKADV 779

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV+SLA+E+RAASFT +EDL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 780  ETQGDFVQSLATEIRAASFTSIEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 839

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLEKQ+S+FVDD SL CE ALKKMYKLLEKVE S+YALLRTRDMA+SRYK
Sbjct: 840  EAAFEYQDLVKLEKQISSFVDDPSLPCEVALKKMYKLLEKVEQSIYALLRTRDMAISRYK 899

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPVNWL DSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 900  EFGIPVNWLLDSGVVGKIKLSSVQLARKYMKRVA 933


>gb|EOY02162.1| Hydroxyproline-rich glycoprotein family protein isoform 4 [Theobroma
            cacao]
          Length = 933

 Score =  479 bits (1234), Expect = e-154
 Identities = 269/455 (59%), Positives = 312/455 (68%), Gaps = 17/455 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQE++ HG   +KELE ARNKIKELQRQ QL+AN                  KEQE
Sbjct: 201  ERKKLQEDIAHGASVKKELEVARNKIKELQRQIQLDANQTKAQLLFLKQQVSGLQAKEQE 260

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A K D ++                   KN+ELQHEKREL +KLDAAE+++A LS+ TETE
Sbjct: 261  AIKNDAEVEKKLKAVKELEMEVMELRRKNKELQHEKRELTVKLDAAEAKIAALSNMTETE 320

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            +  R REEV+ L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+TP GK
Sbjct: 321  IDVRAREEVSNLRHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPEGK 380

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNK+LSP+SQE AKQL+LEYAGSERGQ GDTD++SNFS PSS GSED D A    
Sbjct: 381  ISARDLNKSLSPKSQETAKQLLLEYAGSERGQ-GDTDIESNFSHPSSTGSEDLDNASIYS 439

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG + KDDS                    SQ  RGPLEAL
Sbjct: 440  SNSRYSSLSKKPSLIQKLKKWG-RSKDDSSAVSSPARSLSGGSPSRISMSQHSRGPLEAL 498

Query: 1255 MLRNAGESVAITTFGAAEQ---DSPTSP-------------APSDVSSSFQLMSKSVEGV 1124
            MLRNAG+ VAITTFG  EQ   DSP +P             +P+ V++SF LMS+SV+G 
Sbjct: 499  MLRNAGDGVAITTFGKNEQEFTDSPETPTIPNIRTQVSSGDSPNSVATSFHLMSRSVDGS 558

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKD-NRTVSLPPKLAQVK 947
            L+EKYPAYKDRHKLALEREK+IK+KA QARA RFGD S+F    +  + V LPPKLAQ+K
Sbjct: 559  LEEKYPAYKDRHKLALEREKQIKQKAQQARAERFGDKSNFSSKAEREKPVILPPKLAQIK 618

Query: 946  ERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            ER V P D+SG QS+DDKA +   +SKM+ AHIEK
Sbjct: 619  ERTVFPGDSSG-QSNDDKAVDSQTISKMKLAHIEK 652



 Score =  285 bits (729), Expect = 2e-80
 Identities = 142/160 (88%), Positives = 152/160 (95%)
 Frame = -1

Query: 481  AVKADVETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEG 302
            +VKADVETQGDFV+SLA+E+RAASFT +EDL+ FVNWLDEELSFLVDERAVLKHFDWPEG
Sbjct: 711  SVKADVETQGDFVQSLATEIRAASFTSIEDLVAFVNWLDEELSFLVDERAVLKHFDWPEG 770

Query: 301  KADALREAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDM 122
            KADALREAAFEYQDL KLEKQ+S+FVDD SL CE ALKKMYKLLEKVE SVYALLRTRDM
Sbjct: 771  KADALREAAFEYQDLVKLEKQISSFVDDPSLPCEAALKKMYKLLEKVEQSVYALLRTRDM 830

Query: 121  AMSRYKEFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            A+SRYKEFGIPVNWL DSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 831  AISRYKEFGIPVNWLLDSGVVGKIKLSSVQLARKYMKRVA 870


>ref|XP_017218711.1| PREDICTED: protein CHUP1, chloroplastic [Daucus carota subsp.
            sativus]
 ref|XP_017218712.1| PREDICTED: protein CHUP1, chloroplastic [Daucus carota subsp.
            sativus]
 gb|KZM86557.1| hypothetical protein DCAR_023691 [Daucus carota subsp. sativus]
          Length = 982

 Score =  480 bits (1236), Expect = e-153
 Identities = 270/453 (59%), Positives = 314/453 (69%), Gaps = 15/453 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERK+LQEEV+ G   +K+LE AR KIKELQRQ Q+EA                   KE+E
Sbjct: 192  ERKRLQEEVSLGASAKKDLEVARKKIKELQRQMQMEATQTKGQLLLLKQQVIGLQVKEEE 251

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            AFKKDT++                   KNRELQHEKREL +KLD AE+++ +LS+ TE+E
Sbjct: 252  AFKKDTEVEKMLKSLKTLEMEVAELKRKNRELQHEKRELAVKLDVAEAKITSLSNMTESE 311

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            +VA VREEVN L+ TNEDL KQVEGLQMNRFSEVEELVYLRWVNACLRFEL+NY+TPAGK
Sbjct: 312  LVASVREEVNNLKHTNEDLSKQVEGLQMNRFSEVEELVYLRWVNACLRFELKNYQTPAGK 371

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNKNLSPRSQE+AKQLMLEYAGSERGQ GDTDL+SN+S PSSPGS+DFD      
Sbjct: 372  MSARDLNKNLSPRSQERAKQLMLEYAGSERGQ-GDTDLESNYSHPSSPGSDDFDNTSIDS 430

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP +IQKLKKWG K KDDS                    S +PRGPLE+L
Sbjct: 431  STSRFSSVSKKPSIIQKLKKWG-KVKDDSSALSSPARSFAGGSPSRSITSNRPRGPLESL 489

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP---------------APSDVSSSFQLMSKSVEGVL 1121
            MLRNA +SVAITTFG  EQD  ++P               + ++V+SSF LMS+SV+G +
Sbjct: 490  MLRNASDSVAITTFGMQEQDDSSAPQTPRLPPIRTQASADSLNNVASSFGLMSRSVDGAI 549

Query: 1120 DEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKDNRTVSLPPKLAQVKER 941
            D KYP YKDRHKLALEREK IKEKADQARA +FGD S+FKP K   + SLPPKLAQVKE+
Sbjct: 550  DGKYPVYKDRHKLALEREKHIKEKADQARAVKFGDPSTFKPLK---SASLPPKLAQVKEK 606

Query: 940  VVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            VV   D+S  QS D K  +  AVS+M+FA IEK
Sbjct: 607  VVFTGDSSD-QSGDGKMVDSQAVSRMKFADIEK 638



 Score =  369 bits (946), Expect = e-111
 Identities = 183/214 (85%), Positives = 204/214 (95%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            +KVHRAPE+VEFYQSLMKREAKKDT+ + ++ +NTA+ARSNMIGEIENRSTFLLAVKADV
Sbjct: 706  EKVHRAPEVVEFYQSLMKREAKKDTTSLITSTSNTANARSNMIGEIENRSTFLLAVKADV 765

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV+SLA+EVRAA+FTD+EDL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADA R
Sbjct: 766  ETQGDFVQSLAAEVRAATFTDIEDLVVFVNWLDEELSFLVDERAVLKHFDWPEGKADAFR 825

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EA+FEYQDL KLEKQV++FVDD ++ CE ALKKMYKLLEK+E SVYALLRTRDMA+SRYK
Sbjct: 826  EASFEYQDLMKLEKQVTSFVDDPNVPCEAALKKMYKLLEKLEQSVYALLRTRDMAVSRYK 885

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPVNWLQDSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 886  EFGIPVNWLQDSGVVGKIKLSSVQLARKYMKRVA 919


>ref|XP_019177562.1| PREDICTED: protein CHUP1, chloroplastic [Ipomoea nil]
          Length = 980

 Score =  480 bits (1235), Expect = e-153
 Identities = 269/455 (59%), Positives = 318/455 (69%), Gaps = 17/455 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            E+K+LQEEV+  V  RKELE ARNKIKE+QRQ QLEA+                  KEQE
Sbjct: 190  EKKRLQEEVSKTVDARKELEVARNKIKEMQRQVQLEASQTKGQLLLLKQQVSGLHAKEQE 249

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            AFK+D ++                   KN+ELQ EKRELV+KLDAA+++V +LS+ TE+E
Sbjct: 250  AFKRDAEVEKKLKLLKELEVEVMELKRKNKELQIEKRELVMKLDAAQAKVTSLSNMTESE 309

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            +VA VREEV  L+ TNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNY+ P GK
Sbjct: 310  VVANVREEVTALRHTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYQAPTGK 369

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             +ARDL+K+LSPRSQEKAKQLMLEYAGSERGQ GDTDL+SNFS PSSPGSEDFD      
Sbjct: 370  VTARDLSKSLSPRSQEKAKQLMLEYAGSERGQ-GDTDLESNFSHPSSPGSEDFDNTSIDS 428

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKPGL+QKLKKWG + KDDS                      +PRGPLEAL
Sbjct: 429  STSRYSSLSKKPGLLQKLKKWGGRSKDDS-VFTSPSRSFGGSPSRMSMSHSRPRGPLEAL 487

Query: 1255 MLRNAGESVAITTFGAAEQD---SPTSPAPSD--------------VSSSFQLMSKSVEG 1127
            MLRNAGESVAIT+FG AEQ+   SP +P  S               V+SSF LMSKSVEG
Sbjct: 488  MLRNAGESVAITSFGTAEQEFLNSPETPKLSQASRVHDMSPDSLNTVASSFHLMSKSVEG 547

Query: 1126 VLDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKDNRTVSLPPKLAQVK 947
            V++EKYPAYKDRHKLA+EREK++KEKA++ARAA+FGDTSSFK    +R+++LPPKL Q+K
Sbjct: 548  VMEEKYPAYKDRHKLAVEREKQLKEKAERARAAKFGDTSSFKV---DRSITLPPKLTQIK 604

Query: 946  ERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            E+ V  +     + S D  A+  ++SKMQ AHIEK
Sbjct: 605  EKSV--VSGESTEQSSDPKADSQSISKMQLAHIEK 637



 Score =  358 bits (918), Expect = e-107
 Identities = 183/216 (84%), Positives = 204/216 (94%), Gaps = 2/216 (0%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTS--IISSTLANTADARSNMIGEIENRSTFLLAVKA 470
            DKVHRAPELVEFYQ+LMKRE+KKD+S  +ISST +NT+DARSNMIGEI N+S+FLLAVKA
Sbjct: 704  DKVHRAPELVEFYQTLMKRESKKDSSSPLISST-SNTSDARSNMIGEIANKSSFLLAVKA 762

Query: 469  DVETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADA 290
            DVETQGDFV+SLASE+RAASF++++DL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADA
Sbjct: 763  DVETQGDFVQSLASEIRAASFSNIDDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKADA 822

Query: 289  LREAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSR 110
            LREAAFEYQDL KLEKQVS FVDD +L+CE ALKKMYKLLEKVE SVYALLRTRDMA+SR
Sbjct: 823  LREAAFEYQDLMKLEKQVSLFVDDPNLSCEAALKKMYKLLEKVEQSVYALLRTRDMAISR 882

Query: 109  YKEFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            Y+EFGIP +WL DSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 883  YREFGIPTDWLLDSGVVGKIKLSSVQLARKYMKRVA 918


>gb|EOY02159.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
 gb|EOY02160.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
 gb|EOY02161.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
 gb|EOY02163.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
 gb|EOY02164.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
 gb|EOY02165.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
 gb|EOY02166.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 996

 Score =  479 bits (1234), Expect = e-153
 Identities = 269/455 (59%), Positives = 312/455 (68%), Gaps = 17/455 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQE++ HG   +KELE ARNKIKELQRQ QL+AN                  KEQE
Sbjct: 201  ERKKLQEDIAHGASVKKELEVARNKIKELQRQIQLDANQTKAQLLFLKQQVSGLQAKEQE 260

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A K D ++                   KN+ELQHEKREL +KLDAAE+++A LS+ TETE
Sbjct: 261  AIKNDAEVEKKLKAVKELEMEVMELRRKNKELQHEKRELTVKLDAAEAKIAALSNMTETE 320

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            +  R REEV+ L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+TP GK
Sbjct: 321  IDVRAREEVSNLRHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPEGK 380

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNK+LSP+SQE AKQL+LEYAGSERGQ GDTD++SNFS PSS GSED D A    
Sbjct: 381  ISARDLNKSLSPKSQETAKQLLLEYAGSERGQ-GDTDIESNFSHPSSTGSEDLDNASIYS 439

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG + KDDS                    SQ  RGPLEAL
Sbjct: 440  SNSRYSSLSKKPSLIQKLKKWG-RSKDDSSAVSSPARSLSGGSPSRISMSQHSRGPLEAL 498

Query: 1255 MLRNAGESVAITTFGAAEQ---DSPTSP-------------APSDVSSSFQLMSKSVEGV 1124
            MLRNAG+ VAITTFG  EQ   DSP +P             +P+ V++SF LMS+SV+G 
Sbjct: 499  MLRNAGDGVAITTFGKNEQEFTDSPETPTIPNIRTQVSSGDSPNSVATSFHLMSRSVDGS 558

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKD-NRTVSLPPKLAQVK 947
            L+EKYPAYKDRHKLALEREK+IK+KA QARA RFGD S+F    +  + V LPPKLAQ+K
Sbjct: 559  LEEKYPAYKDRHKLALEREKQIKQKAQQARAERFGDKSNFSSKAEREKPVILPPKLAQIK 618

Query: 946  ERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            ER V P D+SG QS+DDKA +   +SKM+ AHIEK
Sbjct: 619  ERTVFPGDSSG-QSNDDKAVDSQTISKMKLAHIEK 652



 Score =  371 bits (952), Expect = e-112
 Identities = 187/214 (87%), Positives = 201/214 (93%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQ+LMKREAKKDTS + S  +N +DARSNMIGEIENRS+FLLAVKADV
Sbjct: 720  DKVHRAPELVEFYQTLMKREAKKDTSSLISPTSNPSDARSNMIGEIENRSSFLLAVKADV 779

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV+SLA+E+RAASFT +EDL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 780  ETQGDFVQSLATEIRAASFTSIEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 839

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLEKQ+S+FVDD SL CE ALKKMYKLLEKVE SVYALLRTRDMA+SRYK
Sbjct: 840  EAAFEYQDLVKLEKQISSFVDDPSLPCEAALKKMYKLLEKVEQSVYALLRTRDMAISRYK 899

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPVNWL DSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 900  EFGIPVNWLLDSGVVGKIKLSSVQLARKYMKRVA 933


>ref|XP_024019384.1| protein CHUP1, chloroplastic [Morus notabilis]
          Length = 997

 Score =  479 bits (1234), Expect = e-153
 Identities = 271/455 (59%), Positives = 311/455 (68%), Gaps = 17/455 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQ+E+  G   RKELEAARNKIKELQRQ QL+AN                  KE+E
Sbjct: 201  ERKKLQDEIAQGASARKELEAARNKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEEE 260

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A KKD ++                   KN+ELQHEKREL++KLDAA++RV  LSS TE+E
Sbjct: 261  AVKKDAELEKKLKAVKELEVEVVELKRKNKELQHEKRELIVKLDAAQARVTALSSMTESE 320

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
             VA  REEVN L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+ P GK
Sbjct: 321  KVANAREEVNNLRHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPPGK 380

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNK+LSPRSQEKAKQLMLEYAGSERGQ GDTD++SNFS PSSPGSEDFD A    
Sbjct: 381  MSARDLNKSLSPRSQEKAKQLMLEYAGSERGQ-GDTDIESNFSHPSSPGSEDFDNASIDS 439

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                     KK  LIQKLKKWG + KDDS                    S +P+GPLE L
Sbjct: 440  FTSRVSSLGKKTSLIQKLKKWG-RSKDDSSALLSPSRSLSGGSPSRMSMSVRPKGPLEVL 498

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP-----------APSD----VSSSFQLMSKSVEGVL 1121
            MLRN G+SVAITT+G  EQD P SP           A SD    V+SSFQLMSKSVEGVL
Sbjct: 499  MLRNVGDSVAITTYGTMEQDLPASPETPTLPNMKRQASSDSLNSVASSFQLMSKSVEGVL 558

Query: 1120 DEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKDNR--TVSLPPKLAQVK 947
            DEKYPAYKDRHKLALEREK+IKEKAD+ARA +F D+S+    K  R   V LPPKL+Q+K
Sbjct: 559  DEKYPAYKDRHKLALEREKQIKEKADRARAKKFSDSSNLSSTKGERANAVVLPPKLSQIK 618

Query: 946  ERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            E+ V+  D +  QS+D K+ +  ++SKM+ A IEK
Sbjct: 619  EKPVVSADTND-QSNDGKSVDSQSISKMKLAEIEK 652



 Score =  355 bits (912), Expect = e-106
 Identities = 181/215 (84%), Positives = 201/215 (93%), Gaps = 1/215 (0%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTA-DARSNMIGEIENRSTFLLAVKAD 467
            DKVHRAPELVEFYQ+LMKREAKKDTS + S+++N A +ARSNMIGEI N+S+FLLAVKAD
Sbjct: 715  DKVHRAPELVEFYQTLMKREAKKDTSSLLSSVSNNASEARSNMIGEIANKSSFLLAVKAD 774

Query: 466  VETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADAL 287
            VETQGDFV SLA+EVRAASFT++EDL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADAL
Sbjct: 775  VETQGDFVMSLATEVRAASFTNIEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKADAL 834

Query: 286  REAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRY 107
            REAAFEYQDL KLEK+V++FVDD  L+CE ALKKMY LLEKVE SVYALLRTRDMA+SRY
Sbjct: 835  REAAFEYQDLVKLEKRVTSFVDDPKLSCEAALKKMYSLLEKVEQSVYALLRTRDMAISRY 894

Query: 106  KEFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            +EFGIPV+WL DSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 895  REFGIPVDWLLDSGVVGKIKLSSVQLARKYMKRVA 929


>ref|XP_021300387.1| protein CHUP1, chloroplastic [Herrania umbratica]
 ref|XP_021300388.1| protein CHUP1, chloroplastic [Herrania umbratica]
          Length = 998

 Score =  478 bits (1231), Expect = e-153
 Identities = 267/455 (58%), Positives = 312/455 (68%), Gaps = 17/455 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQE++ HG   +KELE A+NKIKELQRQ Q+EAN                  KEQE
Sbjct: 201  ERKKLQEDIAHGASVKKELEVAKNKIKELQRQIQVEANQTKAQLLFLKQQVSGLQAKEQE 260

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A K D ++                   KN+ELQHEKREL +KLDAAE+++A LS  TETE
Sbjct: 261  AIKNDAEVEKKLKAVQELEMEVMELRRKNKELQHEKRELTVKLDAAEAKIAALSGMTETE 320

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            + AR REEV+ L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+TP GK
Sbjct: 321  IDARAREEVSNLRHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPRGK 380

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNK+LSP+SQE+AKQL+LEYAGS RGQ GDTDL+SNFS PSSPGSED D      
Sbjct: 381  ISARDLNKSLSPKSQERAKQLLLEYAGSGRGQ-GDTDLESNFSHPSSPGSEDLDNTSIYS 439

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG + KDDS                    SQ PRGPLEAL
Sbjct: 440  SNSRYSSLSKKPSLIQKLKKWG-RSKDDSSAVSSPARSLSGGSPSRISMSQHPRGPLEAL 498

Query: 1255 MLRNAGESVAITTFGAAEQ---DSPTSP-------------APSDVSSSFQLMSKSVEGV 1124
            MLRNAG+ VAITTFG  EQ   DSP +P             +P+ V++SF LMS+S++G+
Sbjct: 499  MLRNAGDGVAITTFGKNEQEFTDSPETPTLPNIRTQASSGDSPNSVATSFHLMSRSMDGI 558

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKD-NRTVSLPPKLAQVK 947
            L+EKYPAYKDRHKLALEREK+IK+KA QARA RFGD S+F    +  + V LPPKLAQ+K
Sbjct: 559  LEEKYPAYKDRHKLALEREKQIKQKAQQARAERFGDKSNFSSKAEREKPVILPPKLAQIK 618

Query: 946  ERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            ER V   D+S  QS+DDKA +   +SKM+ AHIEK
Sbjct: 619  ERTVFSGDSS-EQSNDDKAVDSQTISKMKLAHIEK 652



 Score =  366 bits (940), Expect = e-110
 Identities = 185/214 (86%), Positives = 200/214 (93%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPE+VEFYQ+LMKREAKKD S + S  +N +DARSNMIGEIENRS+FLLAVKADV
Sbjct: 722  DKVHRAPEVVEFYQTLMKREAKKDMSSLISPTSNPSDARSNMIGEIENRSSFLLAVKADV 781

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV SLA+E+RAASFT++EDL+ FVNWLDEELSFLVDERAV+KHFDWPEGKADALR
Sbjct: 782  ETQGDFVHSLATEIRAASFTNIEDLVAFVNWLDEELSFLVDERAVVKHFDWPEGKADALR 841

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLEKQVS+FVDD SL CE ALKKMYKLLEKVE SVYALLRTRDMA+SRYK
Sbjct: 842  EAAFEYQDLVKLEKQVSSFVDDPSLPCEAALKKMYKLLEKVEQSVYALLRTRDMAISRYK 901

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPVNWL DSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 902  EFGIPVNWLLDSGVVGKIKLSSVQLARKYMKRVA 935


>ref|XP_024194997.1| protein CHUP1, chloroplastic [Rosa chinensis]
 gb|PRQ40435.1| hypothetical protein RchiOBHm_Chr4g0435971 [Rosa chinensis]
          Length = 1001

 Score =  478 bits (1231), Expect = e-152
 Identities = 277/461 (60%), Positives = 314/461 (68%), Gaps = 23/461 (4%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEEV  G   +KELEAARNKIKELQRQ QL+AN                  KE+E
Sbjct: 203  ERKKLQEEVAQGATTKKELEAARNKIKELQRQIQLDANQTKGQLLLLKQQVSGLQEKEEE 262

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A KKD++I                   KN+ELQ EKREL IKL+ AESRV TLS+ TETE
Sbjct: 263  AVKKDSEIEKKLKAVKDLEVEVMELKRKNKELQIEKRELSIKLNTAESRVVTLSNMTETE 322

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            MVA VR EVN L+  NEDLLKQVEGLQMNRFSEVEELVYLRW+NACLRFELRNY+TP GK
Sbjct: 323  MVANVRGEVNNLKHANEDLLKQVEGLQMNRFSEVEELVYLRWLNACLRFELRNYQTPQGK 382

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNKNLSP+SQEKAKQLMLEYAGSERGQ GDTD++SN+SQPSSPGSEDFD A    
Sbjct: 383  ISARDLNKNLSPKSQEKAKQLMLEYAGSERGQ-GDTDMESNYSQPSSPGSEDFDNASIDS 441

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SK+P LIQKLKKWG K KDDS                    S +PRGPLEAL
Sbjct: 442  STSRYSSLSKRPSLIQKLKKWG-KSKDDSSSLSSPVRSLSGSSPGRASMSVRPRGPLEAL 500

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP----------------APSDVSSSFQLMSKSVEGV 1124
            MLRNA + VAITTFG  +Q+ P SP                +P+ VSSSF +MSKSVEGV
Sbjct: 501  MLRNASDGVAITTFGKMDQELPDSPQTPTLPNIRTQMPSSNSPNSVSSSFHVMSKSVEGV 560

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTS----SFKP---PKDNRTVSLPP 965
            LDEKYPAYKDRHKLALEREK+IKE+ADQARA +FGD S    S++P      +R VSLPP
Sbjct: 561  LDEKYPAYKDRHKLALEREKQIKERADQARAEKFGDKSNVSFSYEPRTKADKDRIVSLPP 620

Query: 964  KLAQVKERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            KL  +KE+ V+  D+S  Q+   KA +   +SKM+ A IEK
Sbjct: 621  KLTLIKEKAVISGDSSN-QADGGKAFDPQEISKMKLAQIEK 660



 Score =  355 bits (910), Expect = e-105
 Identities = 180/214 (84%), Positives = 196/214 (91%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQSLMKREAKKDTS + ST +N + ARSNMIGEIEN+S+FLLAVKADV
Sbjct: 725  DKVHRAPELVEFYQSLMKREAKKDTSSLISTSSNVSSARSNMIGEIENKSSFLLAVKADV 784

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            E QGDFV SLA+EVRAASFT+++DL+ FVNWLDEELSFLVDERAVLKHFDWPEGK DALR
Sbjct: 785  EAQGDFVMSLATEVRAASFTNIDDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKVDALR 844

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLE++VS FVDD  L+CE ALKKMY LLEKVE SVYALLRTRDMA+SR K
Sbjct: 845  EAAFEYQDLMKLEQKVSTFVDDPKLSCEAALKKMYSLLEKVEQSVYALLRTRDMAISRCK 904

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPV+WL DSGVVGKIKL+SVQLARKYMKRVA
Sbjct: 905  EFGIPVDWLLDSGVVGKIKLSSVQLARKYMKRVA 938


>ref|XP_023529698.1| protein CHUP1, chloroplastic-like [Cucurbita pepo subsp. pepo]
 ref|XP_023529708.1| protein CHUP1, chloroplastic-like [Cucurbita pepo subsp. pepo]
 ref|XP_023529716.1| protein CHUP1, chloroplastic-like [Cucurbita pepo subsp. pepo]
          Length = 988

 Score =  478 bits (1230), Expect = e-152
 Identities = 273/460 (59%), Positives = 315/460 (68%), Gaps = 22/460 (4%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEE+      +KELE ARNKIKELQRQ QL+AN                  KEQE
Sbjct: 187  ERKKLQEEIAQAATVKKELEFARNKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEQE 246

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
              KKD++I                   KN+ELQ EKREL IKL AAE+R++TLS+ TE+E
Sbjct: 247  TIKKDSEIEKKLKAVKELEVEVMELKRKNKELQIEKRELTIKLGAAENRISTLSNMTESE 306

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            +V++ REEVN L+ TNEDL+KQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+ P GK
Sbjct: 307  LVSQTREEVNNLRHTNEDLIKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPTGK 366

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNKNLSP+SQEKAKQLMLEYAGSERGQ GDTDL+SNFSQPSSPGSEDFD A    
Sbjct: 367  VSARDLNKNLSPKSQEKAKQLMLEYAGSERGQ-GDTDLESNFSQPSSPGSEDFDNASIDS 425

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG + KDDS                    SQKPRGPLEAL
Sbjct: 426  SFSRYSSLSKKPSLIQKLKKWGGRSKDDSSVVSSPARSFSGGSPSRMSMSQKPRGPLEAL 485

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP-----------APSD----VSSSFQLMSKSVEGVL 1121
            MLRN  +SVAIT+FG  EQ+ P SP            P+D    V+SSFQLMSKSV GVL
Sbjct: 486  MLRNTSDSVAITSFGTMEQEIPDSPGTPNLPSIRTQTPNDSLNSVASSFQLMSKSVGGVL 545

Query: 1120 DEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSS------FKPPKD-NRTVSLPPK 962
            DEKYPAYKDRHKLAL REK+IKE+ADQARA RFG+ S+      FK   + +R V LPPK
Sbjct: 546  DEKYPAYKDRHKLALAREKQIKERADQARAERFGNISNSNLNPEFKGKTERDRPVVLPPK 605

Query: 961  LAQVKERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            L+Q+KE+ V+  DA+   S ++K    SA+S+M+ A IEK
Sbjct: 606  LSQIKEKPVVSSDAADV-SGENKKIESSAISRMKLAEIEK 644



 Score =  358 bits (920), Expect = e-107
 Identities = 178/214 (83%), Positives = 200/214 (93%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQSLMKREAKKDT ++SST +N +DARSNMIGEIENRS+FL+AVKADV
Sbjct: 711  DKVHRAPELVEFYQSLMKREAKKDTPLLSSTSSNVSDARSNMIGEIENRSSFLIAVKADV 770

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV SLA+EVRAA+F+++ED++ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 771  ETQGDFVISLAAEVRAATFSNIEDVVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 830

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EA+FEYQDL KLEK+V+ FVD+  L CE ALKKMY LLEKVE SVYALLRTRDMA+SRY+
Sbjct: 831  EASFEYQDLMKLEKRVTTFVDEPKLPCEAALKKMYSLLEKVEQSVYALLRTRDMAISRYR 890

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPV+WL D+GVVGKIKL+SVQLARKYMKRVA
Sbjct: 891  EFGIPVDWLSDTGVVGKIKLSSVQLARKYMKRVA 924


>ref|XP_023001067.1| protein CHUP1, chloroplastic-like [Cucurbita maxima]
 ref|XP_023001820.1| protein CHUP1, chloroplastic-like [Cucurbita maxima]
 ref|XP_023002609.1| protein CHUP1, chloroplastic-like [Cucurbita maxima]
          Length = 987

 Score =  478 bits (1229), Expect = e-152
 Identities = 272/460 (59%), Positives = 314/460 (68%), Gaps = 22/460 (4%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEE+      +KELE ARNKIKELQRQ QL+AN                  KEQE
Sbjct: 186  ERKKLQEEIAQAATVKKELEFARNKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEQE 245

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
              KKD +I                   KN+ELQ EKREL IKLDAAE+R++TLS+ TE+E
Sbjct: 246  TIKKDAEIEKKLKAVKELEVEVMELKRKNKELQIEKRELTIKLDAAENRISTLSNMTESE 305

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            +V++ RE+VN L+ TNEDL+KQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+ P GK
Sbjct: 306  LVSQTREDVNNLRHTNEDLIKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPTGK 365

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNKNLSP+SQEKAKQLMLEYAGSERGQ GDTDL+SNFSQPSSPGSEDFD A    
Sbjct: 366  VSARDLNKNLSPKSQEKAKQLMLEYAGSERGQ-GDTDLESNFSQPSSPGSEDFDNASIDS 424

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKP LIQKLKKWG + KDDS                    SQKPRGPLEAL
Sbjct: 425  SFSRYSSLSKKPSLIQKLKKWGGRSKDDSSVVSSPARSFSGGSPSRMSMSQKPRGPLEAL 484

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSP-----------APSD----VSSSFQLMSKSVEGVL 1121
            MLRN  +SVAIT+FG  EQ+ P SP            P+D    V+SSFQLMSKSV GVL
Sbjct: 485  MLRNTSDSVAITSFGTMEQEVPDSPGTPNLPSIRTQTPNDSLNSVASSFQLMSKSVGGVL 544

Query: 1120 DEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSS------FKPPKD-NRTVSLPPK 962
            DEKYPAYKDRHKLAL REK+IKE+ADQARA RFG+ S+      FK   + +R V LPPK
Sbjct: 545  DEKYPAYKDRHKLALAREKQIKERADQARAERFGNISNSNLNPEFKGKTERDRPVVLPPK 604

Query: 961  LAQVKERVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            L+Q+KE+ V+  DA+   S ++K    S +S+M+ A IEK
Sbjct: 605  LSQIKEKPVVSSDAADV-SGENKKIESSTISRMKLAEIEK 643



 Score =  358 bits (920), Expect = e-107
 Identities = 178/214 (83%), Positives = 200/214 (93%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQSLMKREAKKDT ++SST +N +DARSNMIGEIENRS+FL+AVKADV
Sbjct: 710  DKVHRAPELVEFYQSLMKREAKKDTPLLSSTSSNVSDARSNMIGEIENRSSFLIAVKADV 769

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV SLA+EVRAA+F+++ED++ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 770  ETQGDFVISLAAEVRAATFSNIEDVVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 829

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EA+FEYQDL KLEK+V+ FVD+  L CE ALKKMY LLEKVE SVYALLRTRDMA+SRY+
Sbjct: 830  EASFEYQDLMKLEKRVTTFVDEPKLPCEAALKKMYSLLEKVEQSVYALLRTRDMAISRYR 889

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPV+WL D+GVVGKIKL+SVQLARKYMKRVA
Sbjct: 890  EFGIPVDWLSDTGVVGKIKLSSVQLARKYMKRVA 923


>gb|KJB50773.1| hypothetical protein B456_008G187000 [Gossypium raimondii]
          Length = 852

 Score =  473 bits (1218), Expect = e-152
 Identities = 265/454 (58%), Positives = 310/454 (68%), Gaps = 16/454 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEE+ HG   +KELE ARNKIKELQRQ QL+AN                  KEQE
Sbjct: 69   ERKKLQEEIAHGASIKKELEVARNKIKELQRQIQLDANQTKAQLLFLKQQVSGLQAKEQE 128

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A K D +I                   KN+ELQHEKREL +KLDAAE+++ +LS+ TE E
Sbjct: 129  AIKSDAEIEKKLKALKDLEIEVVELRRKNKELQHEKRELTVKLDAAEAKIVSLSNMTENE 188

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            + A  REEVN L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+TP GK
Sbjct: 189  IAATAREEVNNLKHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPGGK 248

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNK+LSP+SQEKAK+L+LEYAGSERGQ GDTDL+SN+S PSSPGSEDFD A    
Sbjct: 249  ISARDLNKSLSPKSQEKAKRLLLEYAGSERGQ-GDTDLESNYSHPSSPGSEDFDNASIDS 307

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKPGLIQKLKKWG K KDDS                    S + RGPLE+L
Sbjct: 308  SMSRYSSLSKKPGLIQKLKKWG-KSKDDSSALSSPARSFSGGSPSRTSMSLRQRGPLESL 366

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSPAPS----------------DVSSSFQLMSKSVEGV 1124
            MLRNAG+ VAITTFG  EQ+   SP  S                +V++SFQLMSKSVEG 
Sbjct: 367  MLRNAGDGVAITTFGKMEQELTGSPETSTLPNIRTQPSSGDSLNNVAASFQLMSKSVEGT 426

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKDNRTVSLPPKLAQVKE 944
            L+EKYPA+KDRHKLA+EREK+IK+KA+QARA RFG+ +  + P     V+LPPKLAQ+KE
Sbjct: 427  LEEKYPAFKDRHKLAMEREKQIKKKAEQARAERFGEKTEREKP-----VNLPPKLAQIKE 481

Query: 943  RVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            + V+    S  QS+DDKA +   +SKM+ AHIEK
Sbjct: 482  KTVVS-GNSNEQSNDDKAVDSQTISKMKLAHIEK 514



 Score =  373 bits (958), Expect = e-114
 Identities = 188/214 (87%), Positives = 203/214 (94%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQ+LMKREAKKDTS + ST +NT+DARSNMIGEIENRSTFLLAVKADV
Sbjct: 576  DKVHRAPELVEFYQTLMKREAKKDTSSLLSTTSNTSDARSNMIGEIENRSTFLLAVKADV 635

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV+SLA+E+RAASFT+VEDL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 636  ETQGDFVQSLAAEIRAASFTNVEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 695

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEKVENSVYALLRTRDMAMSRYK 104
            EAAFEYQDL KLEK VS+FVDD +L CE ALKKMYKLLEKVE SVYALLRTRDMA+SRY+
Sbjct: 696  EAAFEYQDLMKLEKLVSSFVDDPNLPCEAALKKMYKLLEKVEQSVYALLRTRDMAISRYR 755

Query: 103  EFGIPVNWLQDSGVVGKIKLASVQLARKYMKRVA 2
            EFGIPVNWL DSG+VGKIKL+SVQLARKYMKRVA
Sbjct: 756  EFGIPVNWLLDSGIVGKIKLSSVQLARKYMKRVA 789


>gb|KJB50776.1| hypothetical protein B456_008G187000 [Gossypium raimondii]
          Length = 859

 Score =  473 bits (1218), Expect = e-152
 Identities = 265/454 (58%), Positives = 310/454 (68%), Gaps = 16/454 (3%)
 Frame = -1

Query: 2155 ERKKLQEEVNHGVVYRKELEAARNKIKELQRQFQLEANXXXXXXXXXXXXXXXXTTKEQE 1976
            ERKKLQEE+ HG   +KELE ARNKIKELQRQ QL+AN                  KEQE
Sbjct: 193  ERKKLQEEIAHGASIKKELEVARNKIKELQRQIQLDANQTKAQLLFLKQQVSGLQAKEQE 252

Query: 1975 AFKKDTDIXXXXXXXXXXXXXXXXXXXKNRELQHEKRELVIKLDAAESRVATLSSTTETE 1796
            A K D +I                   KN+ELQHEKREL +KLDAAE+++ +LS+ TE E
Sbjct: 253  AIKSDAEIEKKLKALKDLEIEVVELRRKNKELQHEKRELTVKLDAAEAKIVSLSNMTENE 312

Query: 1795 MVARVREEVNKLQRTNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRFELRNYETPAGK 1616
            + A  REEVN L+  NEDLLKQVEGLQMNRFSEVEELVYLRWVNACLR+ELRNY+TP GK
Sbjct: 313  IAATAREEVNNLKHANEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPGGK 372

Query: 1615 TSARDLNKNLSPRSQEKAKQLMLEYAGSERGQGGDTDLDSNFSQPSSPGSEDFDTAXXXX 1436
             SARDLNK+LSP+SQEKAK+L+LEYAGSERGQ GDTDL+SN+S PSSPGSEDFD A    
Sbjct: 373  ISARDLNKSLSPKSQEKAKRLLLEYAGSERGQ-GDTDLESNYSHPSSPGSEDFDNASIDS 431

Query: 1435 XXXXXXXXSKKPGLIQKLKKWGSKGKDDSHXXXXXXXXXXXXXXXXXXXSQKPRGPLEAL 1256
                    SKKPGLIQKLKKWG K KDDS                    S + RGPLE+L
Sbjct: 432  SMSRYSSLSKKPGLIQKLKKWG-KSKDDSSALSSPARSFSGGSPSRTSMSLRQRGPLESL 490

Query: 1255 MLRNAGESVAITTFGAAEQDSPTSPAPS----------------DVSSSFQLMSKSVEGV 1124
            MLRNAG+ VAITTFG  EQ+   SP  S                +V++SFQLMSKSVEG 
Sbjct: 491  MLRNAGDGVAITTFGKMEQELTGSPETSTLPNIRTQPSSGDSLNNVAASFQLMSKSVEGT 550

Query: 1123 LDEKYPAYKDRHKLALEREKKIKEKADQARAARFGDTSSFKPPKDNRTVSLPPKLAQVKE 944
            L+EKYPA+KDRHKLA+EREK+IK+KA+QARA RFG+ +  + P     V+LPPKLAQ+KE
Sbjct: 551  LEEKYPAFKDRHKLAMEREKQIKKKAEQARAERFGEKTEREKP-----VNLPPKLAQIKE 605

Query: 943  RVVLPIDASGAQSSDDKAANLSAVSKMQFAHIEK 842
            + V+    S  QS+DDKA +   +SKM+ AHIEK
Sbjct: 606  KTVVS-GNSNEQSNDDKAVDSQTISKMKLAHIEK 638



 Score =  279 bits (713), Expect = 1e-78
 Identities = 140/160 (87%), Positives = 151/160 (94%)
 Frame = -1

Query: 643  DKVHRAPELVEFYQSLMKREAKKDTSIISSTLANTADARSNMIGEIENRSTFLLAVKADV 464
            DKVHRAPELVEFYQ+LMKREAKKDTS + ST +NT+DARSNMIGEIENRSTFLLAVKADV
Sbjct: 700  DKVHRAPELVEFYQTLMKREAKKDTSSLLSTTSNTSDARSNMIGEIENRSTFLLAVKADV 759

Query: 463  ETQGDFVESLASEVRAASFTDVEDLLTFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 284
            ETQGDFV+SLA+E+RAASFT+VEDL+ FVNWLDEELSFLVDERAVLKHFDWPEGKADALR
Sbjct: 760  ETQGDFVQSLAAEIRAASFTNVEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGKADALR 819

Query: 283  EAAFEYQDLKKLEKQVSNFVDDSSLACEPALKKMYKLLEK 164
            EAAFEYQDL KLEK VS+FVDD +L CE ALKKMYKLLEK
Sbjct: 820  EAAFEYQDLMKLEKLVSSFVDDPNLPCEAALKKMYKLLEK 859


Top