BLASTX nr result

ID: Acanthopanax21_contig00003969 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Acanthopanax21_contig00003969
         (1232 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_017247383.1| PREDICTED: uncharacterized protein LOC108218...   540   0.0  
ref|XP_017247377.1| PREDICTED: uncharacterized protein LOC108218...   540   0.0  
gb|POF00173.1| general transcription factor 3c polypeptide 2 [Qu...   515   e-179
ref|XP_023920540.1| uncharacterized protein LOC112032069 isoform...   525   e-179
ref|XP_023920539.1| uncharacterized protein LOC112032069 isoform...   525   e-176
gb|OVA03597.1| WD40 repeat [Macleaya cordata]                         507   e-167
ref|XP_021728695.1| uncharacterized protein LOC110695770 [Chenop...   504   e-167
ref|XP_021859677.1| uncharacterized protein LOC110798793 isoform...   493   e-164
ref|XP_021764765.1| uncharacterized protein LOC110729340 [Chenop...   497   e-164
ref|XP_010242589.1| PREDICTED: uncharacterized protein LOC104586...   491   e-163
ref|XP_019051477.1| PREDICTED: uncharacterized protein LOC104586...   491   e-163
ref|XP_021859676.1| uncharacterized protein LOC110798793 isoform...   493   e-163
gb|OMO95816.1| hypothetical protein COLO4_15658 [Corchorus olito...   492   e-162
dbj|GAV66087.1| hypothetical protein CFOL_v3_09597 [Cephalotus f...   487   e-162
gb|EOX93901.1| DNA binding protein, putative isoform 1 [Theobrom...   487   e-162
ref|XP_017969461.1| PREDICTED: uncharacterized protein LOC186127...   485   e-161
ref|XP_017969459.1| PREDICTED: uncharacterized protein LOC186127...   485   e-161
ref|XP_017969458.1| PREDICTED: uncharacterized protein LOC186127...   485   e-161
ref|XP_017969462.1| PREDICTED: uncharacterized protein LOC186127...   485   e-161
emb|CDP15391.1| unnamed protein product [Coffea canephora]            486   e-161

>ref|XP_017247383.1| PREDICTED: uncharacterized protein LOC108218785 isoform X2 [Daucus
            carota subsp. sativus]
          Length = 764

 Score =  540 bits (1390), Expect = 0.0
 Identities = 265/377 (70%), Positives = 301/377 (79%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQS+PLT+EWSASPPHDLILAGCHDGVVALWKFS NVS KDTRPLLCFSADTVPIRALAW
Sbjct: 406  RQSVPLTVEWSASPPHDLILAGCHDGVVALWKFSANVSCKDTRPLLCFSADTVPIRALAW 465

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
            AP QSDP SANV VT GH  LKFWD+RDPFRPLWDLNP+Q++ICSLDW+PDPR II+SY+
Sbjct: 466  APSQSDPGSANVIVTAGHGCLKFWDIRDPFRPLWDLNPIQKVICSLDWVPDPRGIIISYE 525

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DGTIRILSLS+AA ++PVTG+PFVGT Q+ LH Y CS YTIWSIQVSR+TGMVAYC+ADG
Sbjct: 526  DGTIRILSLSEAANNIPVTGKPFVGTPQEGLHRYCCSSYTIWSIQVSRITGMVAYCSADG 585

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
            TVL FQLTT+A+ +DPLR+RAPHFLC SL +E+STL MYTPLPD+P   K          
Sbjct: 586  TVLRFQLTTKAMGRDPLRHRAPHFLCGSLTEEDSTLIMYTPLPDVPLLFK---------- 635

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXX 332
                   ++SNQEK+VK EM  GQ SNQ  LAL +GDDPGI+SGSED  + E        
Sbjct: 636  -------NLSNQEKKVKIEMNGGQPSNQQALALSHGDDPGIESGSED-IMAEKSKKSSKS 687

Query: 331  XXXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSE 152
                 K+P A QA VCR D+PEPVQ  +    E  D+ E+LPPKIIAMHRVRWN+NKGSE
Sbjct: 688  KTKSTKVPNASQALVCRDDNPEPVQLREGNANEENDKFEILPPKIIAMHRVRWNINKGSE 747

Query: 151  RWLCYGGASGILRCQEI 101
            RWLCYGGASGI+RCQEI
Sbjct: 748  RWLCYGGASGIIRCQEI 764


>ref|XP_017247377.1| PREDICTED: uncharacterized protein LOC108218785 isoform X1 [Daucus
            carota subsp. sativus]
 ref|XP_017247378.1| PREDICTED: uncharacterized protein LOC108218785 isoform X1 [Daucus
            carota subsp. sativus]
 ref|XP_017247380.1| PREDICTED: uncharacterized protein LOC108218785 isoform X1 [Daucus
            carota subsp. sativus]
 ref|XP_017247381.1| PREDICTED: uncharacterized protein LOC108218785 isoform X1 [Daucus
            carota subsp. sativus]
 ref|XP_017247382.1| PREDICTED: uncharacterized protein LOC108218785 isoform X1 [Daucus
            carota subsp. sativus]
          Length = 925

 Score =  540 bits (1390), Expect = 0.0
 Identities = 265/377 (70%), Positives = 301/377 (79%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQS+PLT+EWSASPPHDLILAGCHDGVVALWKFS NVS KDTRPLLCFSADTVPIRALAW
Sbjct: 567  RQSVPLTVEWSASPPHDLILAGCHDGVVALWKFSANVSCKDTRPLLCFSADTVPIRALAW 626

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
            AP QSDP SANV VT GH  LKFWD+RDPFRPLWDLNP+Q++ICSLDW+PDPR II+SY+
Sbjct: 627  APSQSDPGSANVIVTAGHGCLKFWDIRDPFRPLWDLNPIQKVICSLDWVPDPRGIIISYE 686

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DGTIRILSLS+AA ++PVTG+PFVGT Q+ LH Y CS YTIWSIQVSR+TGMVAYC+ADG
Sbjct: 687  DGTIRILSLSEAANNIPVTGKPFVGTPQEGLHRYCCSSYTIWSIQVSRITGMVAYCSADG 746

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
            TVL FQLTT+A+ +DPLR+RAPHFLC SL +E+STL MYTPLPD+P   K          
Sbjct: 747  TVLRFQLTTKAMGRDPLRHRAPHFLCGSLTEEDSTLIMYTPLPDVPLLFK---------- 796

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXX 332
                   ++SNQEK+VK EM  GQ SNQ  LAL +GDDPGI+SGSED  + E        
Sbjct: 797  -------NLSNQEKKVKIEMNGGQPSNQQALALSHGDDPGIESGSED-IMAEKSKKSSKS 848

Query: 331  XXXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSE 152
                 K+P A QA VCR D+PEPVQ  +    E  D+ E+LPPKIIAMHRVRWN+NKGSE
Sbjct: 849  KTKSTKVPNASQALVCRDDNPEPVQLREGNANEENDKFEILPPKIIAMHRVRWNINKGSE 908

Query: 151  RWLCYGGASGILRCQEI 101
            RWLCYGGASGI+RCQEI
Sbjct: 909  RWLCYGGASGIIRCQEI 925


>gb|POF00173.1| general transcription factor 3c polypeptide 2 [Quercus suber]
          Length = 398

 Score =  515 bits (1326), Expect = e-179
 Identities = 242/373 (64%), Positives = 293/373 (78%)
 Frame = -2

Query: 1210 LEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWAPIQSDP 1031
            +EWSASPPHD +LAGCHDG VALWKFS + S +DTRPLLCFSADTVPIRALAWAP++SDP
Sbjct: 17   VEWSASPPHDYLLAGCHDGTVALWKFSASCSSEDTRPLLCFSADTVPIRALAWAPLESDP 76

Query: 1030 ESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDDGTIRIL 851
            ESANV VT GH GLKFWD+RDP+RPLWDL+PV RII SLDWL +PRC+I+S+DDGT+RIL
Sbjct: 77   ESANVIVTAGHGGLKFWDLRDPYRPLWDLHPVPRIIYSLDWLSNPRCVILSFDDGTMRIL 136

Query: 850  SLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGTVLCFQL 671
            SL KAAYDVPVTG+PF GT+QQ LHSYYCS + IWS+QVSR+TGM AYCTADGTVL FQL
Sbjct: 137  SLLKAAYDVPVTGKPFGGTKQQGLHSYYCSSFAIWSVQVSRITGMAAYCTADGTVLRFQL 196

Query: 670  TTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPRTIRGFL 491
            T++AV+KDP RNR PHFLC SL +EES +T+ TP+P+ PFP+KKSLN+  + P ++R F 
Sbjct: 197  TSKAVDKDPSRNRTPHFLCGSLTEEESLITINTPVPNTPFPLKKSLNKGGDTPLSMREFS 256

Query: 490  SVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXXXXXXXKM 311
            S     KR  ++MAK  +++   LALCYGDDPG +SG+E+                  K 
Sbjct: 257  SEPQHVKRANDKMAKSPSTDATTLALCYGDDPGTESGTEEALTRPKSKKRPNSRSSNKKN 316

Query: 310  PKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSERWLCYGG 131
            P+ + A VCR ++P   Q +++ K E R  IEV PPKI+AM RVRWNMNKGSERWLCYGG
Sbjct: 317  PEDDLALVCRDEEPPNTQEKENGKAEAR-TIEVFPPKIVAMRRVRWNMNKGSERWLCYGG 375

Query: 130  ASGILRCQEINLS 92
             +G++RCQEI LS
Sbjct: 376  EAGVVRCQEIVLS 388


>ref|XP_023920540.1| uncharacterized protein LOC112032069 isoform X2 [Quercus suber]
          Length = 733

 Score =  525 bits (1352), Expect = e-179
 Identities = 247/379 (65%), Positives = 299/379 (78%)
 Frame = -2

Query: 1228 QSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWA 1049
            QS+PLT+EWSASPPHD +LAGCHDG VALWKFS + S +DTRPLLCFSADTVPIRALAWA
Sbjct: 346  QSIPLTVEWSASPPHDYLLAGCHDGTVALWKFSASCSSEDTRPLLCFSADTVPIRALAWA 405

Query: 1048 PIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDD 869
            P++SDPESANV VT GH GLKFWD+RDP+RPLWDL+PV RII SLDWL +PRC+I+S+DD
Sbjct: 406  PLESDPESANVIVTAGHGGLKFWDLRDPYRPLWDLHPVPRIIYSLDWLSNPRCVILSFDD 465

Query: 868  GTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGT 689
            GT+RILSL KAAYDVPVTG+PF GT+QQ LHSYYCS + IWS+QVSR+TGM AYCTADGT
Sbjct: 466  GTMRILSLLKAAYDVPVTGKPFGGTKQQGLHSYYCSSFAIWSVQVSRITGMAAYCTADGT 525

Query: 688  VLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPR 509
            VL FQLT++AV+KDP RNR PHFLC SL +EES +T+ TP+P+ PFP+KKSLN+  + P 
Sbjct: 526  VLRFQLTSKAVDKDPSRNRTPHFLCGSLTEEESLITINTPVPNTPFPLKKSLNKGGDTPL 585

Query: 508  TIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXXX 329
            ++R F S     KR  ++MAK  +++   LALCYGDDPG +SG+E+              
Sbjct: 586  SMREFSSEPQHVKRANDKMAKSPSTDATTLALCYGDDPGTESGTEEALTRPKSKKRPNSR 645

Query: 328  XXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSER 149
                K P+ + A VCR ++P   Q +++ K E R  IEV PPKI+AM RVRWNMNKGSER
Sbjct: 646  SSNKKNPEDDLALVCRDEEPPNTQEKENGKAEAR-TIEVFPPKIVAMRRVRWNMNKGSER 704

Query: 148  WLCYGGASGILRCQEINLS 92
            WLCYGG +G++RCQEI LS
Sbjct: 705  WLCYGGEAGVVRCQEIVLS 723


>ref|XP_023920539.1| uncharacterized protein LOC112032069 isoform X1 [Quercus suber]
 gb|POF00175.1| general transcription factor 3c polypeptide 2 [Quercus suber]
          Length = 908

 Score =  525 bits (1352), Expect = e-176
 Identities = 247/379 (65%), Positives = 299/379 (78%)
 Frame = -2

Query: 1228 QSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWA 1049
            QS+PLT+EWSASPPHD +LAGCHDG VALWKFS + S +DTRPLLCFSADTVPIRALAWA
Sbjct: 521  QSIPLTVEWSASPPHDYLLAGCHDGTVALWKFSASCSSEDTRPLLCFSADTVPIRALAWA 580

Query: 1048 PIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDD 869
            P++SDPESANV VT GH GLKFWD+RDP+RPLWDL+PV RII SLDWL +PRC+I+S+DD
Sbjct: 581  PLESDPESANVIVTAGHGGLKFWDLRDPYRPLWDLHPVPRIIYSLDWLSNPRCVILSFDD 640

Query: 868  GTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGT 689
            GT+RILSL KAAYDVPVTG+PF GT+QQ LHSYYCS + IWS+QVSR+TGM AYCTADGT
Sbjct: 641  GTMRILSLLKAAYDVPVTGKPFGGTKQQGLHSYYCSSFAIWSVQVSRITGMAAYCTADGT 700

Query: 688  VLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPR 509
            VL FQLT++AV+KDP RNR PHFLC SL +EES +T+ TP+P+ PFP+KKSLN+  + P 
Sbjct: 701  VLRFQLTSKAVDKDPSRNRTPHFLCGSLTEEESLITINTPVPNTPFPLKKSLNKGGDTPL 760

Query: 508  TIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXXX 329
            ++R F S     KR  ++MAK  +++   LALCYGDDPG +SG+E+              
Sbjct: 761  SMREFSSEPQHVKRANDKMAKSPSTDATTLALCYGDDPGTESGTEEALTRPKSKKRPNSR 820

Query: 328  XXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSER 149
                K P+ + A VCR ++P   Q +++ K E R  IEV PPKI+AM RVRWNMNKGSER
Sbjct: 821  SSNKKNPEDDLALVCRDEEPPNTQEKENGKAEAR-TIEVFPPKIVAMRRVRWNMNKGSER 879

Query: 148  WLCYGGASGILRCQEINLS 92
            WLCYGG +G++RCQEI LS
Sbjct: 880  WLCYGGEAGVVRCQEIVLS 898


>gb|OVA03597.1| WD40 repeat [Macleaya cordata]
          Length = 1088

 Score =  507 bits (1305), Expect = e-167
 Identities = 250/383 (65%), Positives = 293/383 (76%), Gaps = 1/383 (0%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQS+PLTLEWS S PHDLILAGCHDG VALWKFS + S +DT+PLL FSAD VPIRALAW
Sbjct: 712  RQSIPLTLEWSRSSPHDLILAGCHDGTVALWKFSPSGSSQDTKPLLYFSADNVPIRALAW 771

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
            AP +SD ESANV  T GH GL+FWD+RDP+RPLWDLNPV+R+I SLDWLPDPRC+I+S+D
Sbjct: 772  APCESDAESANVIATAGHGGLRFWDLRDPYRPLWDLNPVRRVIYSLDWLPDPRCVIMSFD 831

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DGT+RILSLSKAAYDVPVTG PFVGTQQQ LHSY+CS + IWS+  SR  GMVAYC+ADG
Sbjct: 832  DGTLRILSLSKAAYDVPVTGTPFVGTQQQGLHSYFCSSFPIWSVHASRPAGMVAYCSADG 891

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
             V+ FQLT++AV+KDP RNRAPHFLC SL +E+STL +  PLPD+PFPMKKSLNEW + P
Sbjct: 892  NVVRFQLTSKAVDKDPSRNRAPHFLCGSLREEDSTLAVNIPLPDVPFPMKKSLNEWGDTP 951

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXX 332
            R+IRGFLS   Q KR     A  Q S+   LALCYGDDP  + GS      +        
Sbjct: 952  RSIRGFLSDVYQAKR-----ANDQASDDTKLALCYGDDPS-NFGSGMTLASQKCKTTPKP 1005

Query: 331  XXXXXKMPKAEQASVCRADDPEPVQR-EDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                 K P ++QA   R ++PE  QR  ++ K E   EIEV P KI+AMHRVRWNMN+GS
Sbjct: 1006 KVAKKKTPASDQALAIR-EEPENSQRGGENRKGEKETEIEVFPSKIVAMHRVRWNMNEGS 1064

Query: 154  ERWLCYGGASGILRCQEINLSGL 86
            ERWLCYGGA+GI+RCQEI +  +
Sbjct: 1065 ERWLCYGGAAGIVRCQEIAIRSI 1087


>ref|XP_021728695.1| uncharacterized protein LOC110695770 [Chenopodium quinoa]
          Length = 1034

 Score =  504 bits (1297), Expect = e-167
 Identities = 238/378 (62%), Positives = 285/378 (75%), Gaps = 1/378 (0%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQSMPLT+EWSASPPHD +LAGCHDGVVALWKFSV+V  +D RPLLCFSA+T PIRAL W
Sbjct: 650  RQSMPLTVEWSASPPHDFLLAGCHDGVVALWKFSVDVQSEDIRPLLCFSAETSPIRALTW 709

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
            AP + DPES+N+ VT GH GLKFWD+RDPFRPLWD NP QR I  LDW+PDPRC++VSYD
Sbjct: 710  APFEGDPESSNIIVTAGHGGLKFWDLRDPFRPLWDANPSQRFIYGLDWVPDPRCVLVSYD 769

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DGT+R+LS+S+A+YDVPVTG+PF GTQQQ LHSYYCS + IW+I VSRLTGMVAYC ADG
Sbjct: 770  DGTLRMLSISRASYDVPVTGQPFSGTQQQGLHSYYCSSFAIWNIHVSRLTGMVAYCCADG 829

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
            TVL FQLT +AV+KDPLRNRAPHF+CSS + EES +TM T LP  P PM+KS  EWAN P
Sbjct: 830  TVLYFQLTHKAVDKDPLRNRAPHFICSSFSSEESAVTMNTQLPSSPQPMRKSATEWANTP 889

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDD-PGIDSGSEDKTVEEXXXXXXX 335
            R  R   +  NQ KR ++   KGQ  +   LALCYGDD  G+D+     +V+        
Sbjct: 890  RPARTVATGLNQAKRTRKGTEKGQKPDDQVLALCYGDDAEGVDTNEAQTSVK-----AGK 944

Query: 334  XXXXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                  +    +QA +C   D E +  +   K EV  EI+V PPK ++M++VRWNMNKGS
Sbjct: 945  GKSIKSRKSSDDQALICADRDGESIDGKPEPKAEVVAEIDVFPPKAVSMNKVRWNMNKGS 1004

Query: 154  ERWLCYGGASGILRCQEI 101
            E+WLCYGGA+GI+RCQEI
Sbjct: 1005 EKWLCYGGAAGIVRCQEI 1022


>ref|XP_021859677.1| uncharacterized protein LOC110798793 isoform X2 [Spinacia oleracea]
          Length = 868

 Score =  493 bits (1270), Expect = e-164
 Identities = 237/377 (62%), Positives = 280/377 (74%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQSMPLT+EWSASPPHDLILAGCHDGVVALWKFSV+V  +D RPLLCFSA+T PIRAL W
Sbjct: 481  RQSMPLTVEWSASPPHDLILAGCHDGVVALWKFSVDVPSEDARPLLCFSAETGPIRALTW 540

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
            AP + D ES N+ VT GH GLKFWD+RDPFRPLWD NP QR I  LDW+PDPRC++VSYD
Sbjct: 541  APYEGDRESTNIIVTAGHGGLKFWDLRDPFRPLWDANPSQRFIYGLDWVPDPRCLLVSYD 600

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DGT+R+LSLS+AAYDVPVTG+PF GTQQQ LHSY+CS + +W+I VSRLTGMVAYC ADG
Sbjct: 601  DGTLRMLSLSRAAYDVPVTGQPFTGTQQQGLHSYHCSSFAVWNIHVSRLTGMVAYCCADG 660

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
            TVL FQLT +AV+KDPLRNRAPHFLC SL  EES + M T L   P  MKKS  EW+N P
Sbjct: 661  TVLYFQLTHKAVDKDPLRNRAPHFLCGSLTSEESAVAMNTQLSSFPQRMKKSATEWSNTP 720

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXX 332
            R  R   S  NQ KR ++   K Q  + P LALCYGDD G +  +E +T           
Sbjct: 721  RPARTVASGLNQAKRNRKGTEKFQKVDDPVLALCYGDDTGKEDTNEAET-STSVQEGSKG 779

Query: 331  XXXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSE 152
                 +    +Q  +C   + E +  +   K EV  E+EVLPPK+++M++VRWNMNKGS+
Sbjct: 780  KTGNSRKGNDDQVLICPVGNGENMNGKAEVKQEVAAEMEVLPPKVVSMYKVRWNMNKGSQ 839

Query: 151  RWLCYGGASGILRCQEI 101
            RWLCYGGA+GI+RCQEI
Sbjct: 840  RWLCYGGAAGIVRCQEI 856


>ref|XP_021764765.1| uncharacterized protein LOC110729340 [Chenopodium quinoa]
          Length = 1029

 Score =  497 bits (1279), Expect = e-164
 Identities = 237/378 (62%), Positives = 284/378 (75%), Gaps = 1/378 (0%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQSMPLT+EWSASPPHD +LAGCHDGVVALWKFSV+V  +D RPLLCFSA+T PIRAL+W
Sbjct: 647  RQSMPLTVEWSASPPHDFLLAGCHDGVVALWKFSVDVQSEDARPLLCFSAETSPIRALSW 706

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
            AP + DPES+N+ VT GH GLKFWD+RDPFRPLWD NP QR I  LDW+PDPRC++VSYD
Sbjct: 707  APFEGDPESSNIIVTAGHGGLKFWDLRDPFRPLWDANPSQRFIYGLDWVPDPRCVLVSYD 766

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DGT+R+LS+S+AAYDVPVTG+PF GTQQQ LHSY+CS + +W+I VSRLTGMVAYC ADG
Sbjct: 767  DGTLRMLSISRAAYDVPVTGQPFSGTQQQGLHSYHCSAFAVWNIHVSRLTGMVAYCCADG 826

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
            TVL FQLT +AV+KDPLRNRAPHF+CSS + EES +TM T LP  P PM+KS  EWAN P
Sbjct: 827  TVLYFQLTHKAVDKDPLRNRAPHFICSSFSSEESAVTMNTQLPSSPQPMRKSATEWANTP 886

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDP-GIDSGSEDKTVEEXXXXXXX 335
            R  R   +  NQ KR ++   KGQ  +   LALCYGDD  G+D+     +V+        
Sbjct: 887  RPARTVATGLNQAKRTRKGTGKGQKPDDQVLALCYGDDAGGVDTNEAQTSVK-----GGK 941

Query: 334  XXXXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                  +    +QA VC   D + V  +   K     EIEV PPK ++M++VRWNMNKGS
Sbjct: 942  GKSIKSRKASDDQALVCADRDGDNVDGKPEPKAVA--EIEVFPPKAVSMNKVRWNMNKGS 999

Query: 154  ERWLCYGGASGILRCQEI 101
            E+WLCYGGA+GI+RCQEI
Sbjct: 1000 EKWLCYGGAAGIVRCQEI 1017


>ref|XP_010242589.1| PREDICTED: uncharacterized protein LOC104586906 isoform X2 [Nelumbo
            nucifera]
          Length = 882

 Score =  491 bits (1263), Expect = e-163
 Identities = 242/381 (63%), Positives = 284/381 (74%), Gaps = 1/381 (0%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQS+PLT+EWS S PHDLILAGCHDG VALWKF    S +DTRPLLCFSADTVPIRAL+W
Sbjct: 502  RQSIPLTMEWSPSAPHDLILAGCHDGTVALWKFFPGGSSQDTRPLLCFSADTVPIRALSW 561

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
            AP +SD E ANV VT GH  L+FWD+RDP+RPLW++N V+R++ SLDWL DPRCII++YD
Sbjct: 562  APDESDAEGANVIVTAGHGSLRFWDLRDPYRPLWEINSVRRVVYSLDWLLDPRCIILAYD 621

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DGT+RILSLSKAAYDVPVTG+PF GTQQQ LHSYYCS +TIWS+ VSRLTGMVAYC ADG
Sbjct: 622  DGTLRILSLSKAAYDVPVTGKPFSGTQQQGLHSYYCSSFTIWSVHVSRLTGMVAYCNADG 681

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
            TVL FQLT +AV+KDP RN+ PHFLC SL +++STL++ TPLP  PFPMKKSLNEW + P
Sbjct: 682  TVLHFQLTAKAVDKDPSRNKTPHFLCGSLTEDDSTLSVNTPLPCTPFPMKKSLNEWGDTP 741

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXX 332
            R+IRG LS SNQ K+  +E+          LALCYGDDP    G ++             
Sbjct: 742  RSIRGILSGSNQAKKANDEV----------LALCYGDDPEPGFGYDNSPAN---PNRRTQ 788

Query: 331  XXXXXKMPKAEQASVCRADDP-EPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                 K  K      C A++    +QR  +EK     EIE+ PPKIIAMHRVRWNMNKGS
Sbjct: 789  KPNTCKKKKLGSDLACSAEEELGNLQRGGNEKSAAMSEIEIFPPKIIAMHRVRWNMNKGS 848

Query: 154  ERWLCYGGASGILRCQEINLS 92
             R LCYGGA+GI+RCQ+I  S
Sbjct: 849  GRLLCYGGAAGIVRCQDIAAS 869


>ref|XP_019051477.1| PREDICTED: uncharacterized protein LOC104586906 isoform X1 [Nelumbo
            nucifera]
          Length = 891

 Score =  491 bits (1263), Expect = e-163
 Identities = 242/381 (63%), Positives = 284/381 (74%), Gaps = 1/381 (0%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQS+PLT+EWS S PHDLILAGCHDG VALWKF    S +DTRPLLCFSADTVPIRAL+W
Sbjct: 511  RQSIPLTMEWSPSAPHDLILAGCHDGTVALWKFFPGGSSQDTRPLLCFSADTVPIRALSW 570

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
            AP +SD E ANV VT GH  L+FWD+RDP+RPLW++N V+R++ SLDWL DPRCII++YD
Sbjct: 571  APDESDAEGANVIVTAGHGSLRFWDLRDPYRPLWEINSVRRVVYSLDWLLDPRCIILAYD 630

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DGT+RILSLSKAAYDVPVTG+PF GTQQQ LHSYYCS +TIWS+ VSRLTGMVAYC ADG
Sbjct: 631  DGTLRILSLSKAAYDVPVTGKPFSGTQQQGLHSYYCSSFTIWSVHVSRLTGMVAYCNADG 690

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
            TVL FQLT +AV+KDP RN+ PHFLC SL +++STL++ TPLP  PFPMKKSLNEW + P
Sbjct: 691  TVLHFQLTAKAVDKDPSRNKTPHFLCGSLTEDDSTLSVNTPLPCTPFPMKKSLNEWGDTP 750

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXX 332
            R+IRG LS SNQ K+  +E+          LALCYGDDP    G ++             
Sbjct: 751  RSIRGILSGSNQAKKANDEV----------LALCYGDDPEPGFGYDNSPAN---PNRRTQ 797

Query: 331  XXXXXKMPKAEQASVCRADDP-EPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                 K  K      C A++    +QR  +EK     EIE+ PPKIIAMHRVRWNMNKGS
Sbjct: 798  KPNTCKKKKLGSDLACSAEEELGNLQRGGNEKSAAMSEIEIFPPKIIAMHRVRWNMNKGS 857

Query: 154  ERWLCYGGASGILRCQEINLS 92
             R LCYGGA+GI+RCQ+I  S
Sbjct: 858  GRLLCYGGAAGIVRCQDIAAS 878


>ref|XP_021859676.1| uncharacterized protein LOC110798793 isoform X1 [Spinacia oleracea]
          Length = 1029

 Score =  493 bits (1270), Expect = e-163
 Identities = 237/377 (62%), Positives = 280/377 (74%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQSMPLT+EWSASPPHDLILAGCHDGVVALWKFSV+V  +D RPLLCFSA+T PIRAL W
Sbjct: 642  RQSMPLTVEWSASPPHDLILAGCHDGVVALWKFSVDVPSEDARPLLCFSAETGPIRALTW 701

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
            AP + D ES N+ VT GH GLKFWD+RDPFRPLWD NP QR I  LDW+PDPRC++VSYD
Sbjct: 702  APYEGDRESTNIIVTAGHGGLKFWDLRDPFRPLWDANPSQRFIYGLDWVPDPRCLLVSYD 761

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DGT+R+LSLS+AAYDVPVTG+PF GTQQQ LHSY+CS + +W+I VSRLTGMVAYC ADG
Sbjct: 762  DGTLRMLSLSRAAYDVPVTGQPFTGTQQQGLHSYHCSSFAVWNIHVSRLTGMVAYCCADG 821

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
            TVL FQLT +AV+KDPLRNRAPHFLC SL  EES + M T L   P  MKKS  EW+N P
Sbjct: 822  TVLYFQLTHKAVDKDPLRNRAPHFLCGSLTSEESAVAMNTQLSSFPQRMKKSATEWSNTP 881

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXX 332
            R  R   S  NQ KR ++   K Q  + P LALCYGDD G +  +E +T           
Sbjct: 882  RPARTVASGLNQAKRNRKGTEKFQKVDDPVLALCYGDDTGKEDTNEAET-STSVQEGSKG 940

Query: 331  XXXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSE 152
                 +    +Q  +C   + E +  +   K EV  E+EVLPPK+++M++VRWNMNKGS+
Sbjct: 941  KTGNSRKGNDDQVLICPVGNGENMNGKAEVKQEVAAEMEVLPPKVVSMYKVRWNMNKGSQ 1000

Query: 151  RWLCYGGASGILRCQEI 101
            RWLCYGGA+GI+RCQEI
Sbjct: 1001 RWLCYGGAAGIVRCQEI 1017


>gb|OMO95816.1| hypothetical protein COLO4_15658 [Corchorus olitorius]
          Length = 1008

 Score =  492 bits (1267), Expect = e-162
 Identities = 233/378 (61%), Positives = 289/378 (76%)
 Frame = -2

Query: 1228 QSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWA 1049
            QS+PLT+EWS SPPHD +LAGCHDG+VALWKFS + SPKDTRPLLCFSADTVPIR++AWA
Sbjct: 624  QSIPLTVEWSTSPPHDYLLAGCHDGMVALWKFSASASPKDTRPLLCFSADTVPIRSVAWA 683

Query: 1048 PIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDD 869
            P  SD ES NV +T GH GLKFWD+RDPF PLWD++P  + I SLDWLP+PRC+I+S+DD
Sbjct: 684  PSGSDMESTNVILTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDD 743

Query: 868  GTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGT 689
            GT+++LSLS+A  DVPVTG+PF GT+QQ LH Y CS + IW+IQVSRLTGMVAYC ADGT
Sbjct: 744  GTMKLLSLSQAVSDVPVTGKPFTGTKQQGLHLYNCSSFAIWNIQVSRLTGMVAYCGADGT 803

Query: 688  VLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPR 509
            V  FQLT++AV+KD  RNRAPHF+C SL +EES +T+ TPLPDIP  MKKS +++   PR
Sbjct: 804  VSHFQLTSKAVDKDFSRNRAPHFVCGSLIEEESVITINTPLPDIPLTMKKSTSDYGEGPR 863

Query: 508  TIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXXX 329
            ++R FL+ +NQ K  K++ AK QTS++  LALCYGDDPG++S SE+              
Sbjct: 864  SMRAFLTETNQAKNAKDKKAKVQTSDKQTLALCYGDDPGVESDSEETLAALKCKKKQNSQ 923

Query: 328  XXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSER 149
                K    +QA   R ++       + +K E  +EIEV P K++AMHRVRWNMNKGSER
Sbjct: 924  SERNKKADNDQALAIRIEE----ATNNTQKEETGNEIEVFPAKMVAMHRVRWNMNKGSER 979

Query: 148  WLCYGGASGILRCQEINL 95
            WLCYGGA+GI+RCQEI +
Sbjct: 980  WLCYGGAAGIVRCQEIKV 997


>dbj|GAV66087.1| hypothetical protein CFOL_v3_09597 [Cephalotus follicularis]
          Length = 863

 Score =  487 bits (1254), Expect = e-162
 Identities = 233/379 (61%), Positives = 288/379 (75%), Gaps = 1/379 (0%)
 Frame = -2

Query: 1228 QSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWA 1049
            QS+PLTLEWS+SPPHD +LAGCHDG VALWKFS++ S KDTRPLL F+AD++PIRA+AWA
Sbjct: 470  QSIPLTLEWSSSPPHDYLLAGCHDGTVALWKFSISNSSKDTRPLLRFTADSLPIRAVAWA 529

Query: 1048 PIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDD 869
            P++SD E ANV +T GH GLKFWD+RDPFRPLW+L+PV R I SLDWLPDPRC+I+S+DD
Sbjct: 530  PVESDLERANVILTAGHGGLKFWDIRDPFRPLWELHPVPRFIYSLDWLPDPRCVILSFDD 589

Query: 868  GTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGT 689
            GT+RILSL  AAYD PVTG+ F GT+QQ LH Y CS + IWS+QVSRLTGMVAYC+ADGT
Sbjct: 590  GTMRILSLVNAAYDTPVTGKAFTGTKQQGLHLYNCSSFAIWSVQVSRLTGMVAYCSADGT 649

Query: 688  VLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPR 509
            VL FQLTTRAVEKDP RNRAPHF+C +L+ EES +T+ TPLP  P  +KKS+N   ++ R
Sbjct: 650  VLHFQLTTRAVEKDPSRNRAPHFMCGALSAEESAVTVKTPLPHTPVALKKSINGCGDSSR 709

Query: 508  TIRGFLSVSNQEK-RVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXX 332
            ++R  L  +N+ K R  ++ A   +S+   LALCYGDDPGI+  SE+             
Sbjct: 710  SMRSLLYEANRAKTRANDKKANAPSSDNQTLALCYGDDPGIEFESEETLAALKRKKNSKS 769

Query: 331  XXXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSE 152
                 K  K +Q  VC  ++   ++ +++E+ E   E E  PPKI+AMHRVRWNMNKGSE
Sbjct: 770  KSSSKKKTKNDQPLVCIPEESTNLKGKENERGETETETEKFPPKIVAMHRVRWNMNKGSE 829

Query: 151  RWLCYGGASGILRCQEINL 95
            RWLCYGGA+GILRCQEI +
Sbjct: 830  RWLCYGGAAGILRCQEIRV 848


>gb|EOX93901.1| DNA binding protein, putative isoform 1 [Theobroma cacao]
          Length = 868

 Score =  487 bits (1254), Expect = e-162
 Identities = 232/378 (61%), Positives = 287/378 (75%), Gaps = 2/378 (0%)
 Frame = -2

Query: 1228 QSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWA 1049
            QS+PLT+EWS SPPH+ +LAGCHDG+VALWKFS + SP DTRPLLCFSADTVPIR++AWA
Sbjct: 483  QSIPLTVEWSTSPPHNYLLAGCHDGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWA 542

Query: 1048 PIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDD 869
            P  SD ESANV +T GH GLKFWD+RDPF PLWD++P  + I SLDWLP+PRC+I+S+DD
Sbjct: 543  PSGSDMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDD 602

Query: 868  GTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGT 689
            GT+++LSL +AA DVPVTG+PF GT+QQ LH Y CS + IW++QVSRLTGMVAYC ADG 
Sbjct: 603  GTMKMLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGN 662

Query: 688  VLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPR 509
            V  FQLT++AV+KD  RNRAPHF+C SL +EES + + TPLPDIP  +KK  N++   PR
Sbjct: 663  VTRFQLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGEGPR 722

Query: 508  TIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXXX 329
            ++R FL+ SNQ K  K+  AK  T ++  LALCYG+DPG++S SE+              
Sbjct: 723  SMRAFLTESNQAKNAKDNKAKVPTPDKQTLALCYGNDPGVESESEETLTLAALKGKIKQK 782

Query: 328  XXXXKMPKA--EQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                +M KA  +QA   R ++P   Q+E     E  +EIEV PPKI+AMHRVRWNMNKGS
Sbjct: 783  SKSDRMKKAGDDQALAVRINEPANTQKE-----EAGNEIEVFPPKIVAMHRVRWNMNKGS 837

Query: 154  ERWLCYGGASGILRCQEI 101
            ERWLCYGGA+GI+RCQEI
Sbjct: 838  ERWLCYGGAAGIVRCQEI 855


>ref|XP_017969461.1| PREDICTED: uncharacterized protein LOC18612763 isoform X3 [Theobroma
            cacao]
          Length = 865

 Score =  485 bits (1248), Expect = e-161
 Identities = 231/378 (61%), Positives = 288/378 (76%), Gaps = 2/378 (0%)
 Frame = -2

Query: 1228 QSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWA 1049
            QS+PLT+EWS SPP++ +LAGCHDG+VALWKFS + SP DTRPLLCFSADTVPIR++AWA
Sbjct: 480  QSIPLTVEWSTSPPYNYLLAGCHDGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWA 539

Query: 1048 PIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDD 869
            P  SD ESANV +T GH GLKFWD+RDPF PLWD++P  + I SLDWLP+PRC+I+S+DD
Sbjct: 540  PSGSDMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDD 599

Query: 868  GTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGT 689
            GT+++LSL +AA DVPVTG+PF GT+QQ LH Y CS + IW++QVSRLTGMVAYC ADG 
Sbjct: 600  GTMKMLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGN 659

Query: 688  VLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPR 509
            V  FQLT++AV+KD  RNRAPHF+C SL +EES + + TPLPDIP  +KK  N++  +PR
Sbjct: 660  VTRFQLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPR 719

Query: 508  TIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXXX 329
            ++R FL+ SNQ K  K+  AK  T ++  LALCYG+DPG++S SE+              
Sbjct: 720  SMRAFLTESNQAKNAKDNKAKVPTPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQK 779

Query: 328  XXXXKMPKA--EQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                +M KA  +QA   R ++P   Q+E     E  +EIEV PPKI+AMHRVRWNMNKGS
Sbjct: 780  SKSDRMKKAGDDQALAVRINEPTNTQKE-----EAGNEIEVFPPKIVAMHRVRWNMNKGS 834

Query: 154  ERWLCYGGASGILRCQEI 101
            ERWLCYGGA+GI+RCQEI
Sbjct: 835  ERWLCYGGAAGIVRCQEI 852


>ref|XP_017969459.1| PREDICTED: uncharacterized protein LOC18612763 isoform X2 [Theobroma
            cacao]
 ref|XP_017969460.1| PREDICTED: uncharacterized protein LOC18612763 isoform X2 [Theobroma
            cacao]
          Length = 869

 Score =  485 bits (1248), Expect = e-161
 Identities = 231/378 (61%), Positives = 288/378 (76%), Gaps = 2/378 (0%)
 Frame = -2

Query: 1228 QSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWA 1049
            QS+PLT+EWS SPP++ +LAGCHDG+VALWKFS + SP DTRPLLCFSADTVPIR++AWA
Sbjct: 484  QSIPLTVEWSTSPPYNYLLAGCHDGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWA 543

Query: 1048 PIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDD 869
            P  SD ESANV +T GH GLKFWD+RDPF PLWD++P  + I SLDWLP+PRC+I+S+DD
Sbjct: 544  PSGSDMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDD 603

Query: 868  GTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGT 689
            GT+++LSL +AA DVPVTG+PF GT+QQ LH Y CS + IW++QVSRLTGMVAYC ADG 
Sbjct: 604  GTMKMLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGN 663

Query: 688  VLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPR 509
            V  FQLT++AV+KD  RNRAPHF+C SL +EES + + TPLPDIP  +KK  N++  +PR
Sbjct: 664  VTRFQLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPR 723

Query: 508  TIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXXX 329
            ++R FL+ SNQ K  K+  AK  T ++  LALCYG+DPG++S SE+              
Sbjct: 724  SMRAFLTESNQAKNAKDNKAKVPTPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQK 783

Query: 328  XXXXKMPKA--EQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                +M KA  +QA   R ++P   Q+E     E  +EIEV PPKI+AMHRVRWNMNKGS
Sbjct: 784  SKSDRMKKAGDDQALAVRINEPTNTQKE-----EAGNEIEVFPPKIVAMHRVRWNMNKGS 838

Query: 154  ERWLCYGGASGILRCQEI 101
            ERWLCYGGA+GI+RCQEI
Sbjct: 839  ERWLCYGGAAGIVRCQEI 856


>ref|XP_017969458.1| PREDICTED: uncharacterized protein LOC18612763 isoform X1 [Theobroma
            cacao]
          Length = 877

 Score =  485 bits (1248), Expect = e-161
 Identities = 231/378 (61%), Positives = 288/378 (76%), Gaps = 2/378 (0%)
 Frame = -2

Query: 1228 QSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWA 1049
            QS+PLT+EWS SPP++ +LAGCHDG+VALWKFS + SP DTRPLLCFSADTVPIR++AWA
Sbjct: 492  QSIPLTVEWSTSPPYNYLLAGCHDGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWA 551

Query: 1048 PIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDD 869
            P  SD ESANV +T GH GLKFWD+RDPF PLWD++P  + I SLDWLP+PRC+I+S+DD
Sbjct: 552  PSGSDMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDD 611

Query: 868  GTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGT 689
            GT+++LSL +AA DVPVTG+PF GT+QQ LH Y CS + IW++QVSRLTGMVAYC ADG 
Sbjct: 612  GTMKMLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGN 671

Query: 688  VLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPR 509
            V  FQLT++AV+KD  RNRAPHF+C SL +EES + + TPLPDIP  +KK  N++  +PR
Sbjct: 672  VTRFQLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPR 731

Query: 508  TIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXXX 329
            ++R FL+ SNQ K  K+  AK  T ++  LALCYG+DPG++S SE+              
Sbjct: 732  SMRAFLTESNQAKNAKDNKAKVPTPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQK 791

Query: 328  XXXXKMPKA--EQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                +M KA  +QA   R ++P   Q+E     E  +EIEV PPKI+AMHRVRWNMNKGS
Sbjct: 792  SKSDRMKKAGDDQALAVRINEPTNTQKE-----EAGNEIEVFPPKIVAMHRVRWNMNKGS 846

Query: 154  ERWLCYGGASGILRCQEI 101
            ERWLCYGGA+GI+RCQEI
Sbjct: 847  ERWLCYGGAAGIVRCQEI 864


>ref|XP_017969462.1| PREDICTED: uncharacterized protein LOC18612763 isoform X4 [Theobroma
            cacao]
          Length = 878

 Score =  485 bits (1248), Expect = e-161
 Identities = 231/378 (61%), Positives = 288/378 (76%), Gaps = 2/378 (0%)
 Frame = -2

Query: 1228 QSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAWA 1049
            QS+PLT+EWS SPP++ +LAGCHDG+VALWKFS + SP DTRPLLCFSADTVPIR++AWA
Sbjct: 493  QSIPLTVEWSTSPPYNYLLAGCHDGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWA 552

Query: 1048 PIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYDD 869
            P  SD ESANV +T GH GLKFWD+RDPF PLWD++P  + I SLDWLP+PRC+I+S+DD
Sbjct: 553  PSGSDMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDD 612

Query: 868  GTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADGT 689
            GT+++LSL +AA DVPVTG+PF GT+QQ LH Y CS + IW++QVSRLTGMVAYC ADG 
Sbjct: 613  GTMKMLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGN 672

Query: 688  VLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAPR 509
            V  FQLT++AV+KD  RNRAPHF+C SL +EES + + TPLPDIP  +KK  N++  +PR
Sbjct: 673  VTRFQLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPR 732

Query: 508  TIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXXX 329
            ++R FL+ SNQ K  K+  AK  T ++  LALCYG+DPG++S SE+              
Sbjct: 733  SMRAFLTESNQAKNAKDNKAKVPTPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQK 792

Query: 328  XXXXKMPKA--EQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGS 155
                +M KA  +QA   R ++P   Q+E     E  +EIEV PPKI+AMHRVRWNMNKGS
Sbjct: 793  SKSDRMKKAGDDQALAVRINEPTNTQKE-----EAGNEIEVFPPKIVAMHRVRWNMNKGS 847

Query: 154  ERWLCYGGASGILRCQEI 101
            ERWLCYGGA+GI+RCQEI
Sbjct: 848  ERWLCYGGAAGIVRCQEI 865


>emb|CDP15391.1| unnamed protein product [Coffea canephora]
          Length = 942

 Score =  486 bits (1250), Expect = e-161
 Identities = 237/381 (62%), Positives = 280/381 (73%)
 Frame = -2

Query: 1231 RQSMPLTLEWSASPPHDLILAGCHDGVVALWKFSVNVSPKDTRPLLCFSADTVPIRALAW 1052
            RQS+PLTLEWSAS PHD+ILAGCHDGVVALWKF    S ++TRPLLCFSADTV IRAL W
Sbjct: 571  RQSIPLTLEWSASSPHDMILAGCHDGVVALWKFCATGSLQETRPLLCFSADTVTIRALTW 630

Query: 1051 APIQSDPESANVFVTGGHKGLKFWDMRDPFRPLWDLNPVQRIICSLDWLPDPRCIIVSYD 872
             P+ S  ESAN+ VT GH+GLKFWD+RDPFRPLWD  P QR+I SLDWLPDPRCIIVS+D
Sbjct: 631  VPVSSYSESANIIVTAGHRGLKFWDLRDPFRPLWDFYPFQRVIYSLDWLPDPRCIIVSFD 690

Query: 871  DGTIRILSLSKAAYDVPVTGRPFVGTQQQVLHSYYCSPYTIWSIQVSRLTGMVAYCTADG 692
            DG +RILSL KAA D PVTG+PF G QQ+  HSY CSP+ IWS+  SRLTGMVAYC ADG
Sbjct: 691  DGALRILSLLKAANDAPVTGKPFEGAQQKGFHSYLCSPFQIWSVHTSRLTGMVAYCGADG 750

Query: 691  TVLCFQLTTRAVEKDPLRNRAPHFLCSSLADEESTLTMYTPLPDIPFPMKKSLNEWANAP 512
            T L FQLTTRAVEKDPLRNRAPHFLC +L +E STLTM+T LP+ PFPM+KSL EW  AP
Sbjct: 751  TALRFQLTTRAVEKDPLRNRAPHFLCGALTEENSTLTMFTSLPNTPFPMRKSLREWGEAP 810

Query: 511  RTIRGFLSVSNQEKRVKEEMAKGQTSNQPPLALCYGDDPGIDSGSEDKTVEEXXXXXXXX 332
            RT+RG++SVSNQEKR K+++ K + S +   ALC   D   + G +   V E        
Sbjct: 811  RTVRGYISVSNQEKRAKQKVVKVR-SEEKHKALCKRGDLDSEFGPDCMAVTETREAGKVK 869

Query: 331  XXXXXKMPKAEQASVCRADDPEPVQREDHEKVEVRDEIEVLPPKIIAMHRVRWNMNKGSE 152
                    +A+Q  +   +D   + R + E      E+EV P K +AMHRVRWN NKGSE
Sbjct: 870  TSSN---SEADQRPIMVGEDNPDIMRGEVE------EVEVFPSKTVAMHRVRWNTNKGSE 920

Query: 151  RWLCYGGASGILRCQEINLSG 89
             WLCYGGA+G++R QEI++ G
Sbjct: 921  NWLCYGGAAGVVRFQEIDMCG 941


Top