BLASTX nr result

ID: Chrysanthemum21_contig00025254 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00025254
         (1079 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022017923.1| uncharacterized protein LOC110917784 isoform...   370   e-114
ref|XP_022017922.1| uncharacterized protein LOC110917784 isoform...   370   e-113
ref|XP_022017920.1| uncharacterized protein LOC110917784 isoform...   370   e-112
gb|OTF91434.1| putative agenet-like domain, Agenet domain, plant...   370   e-112
gb|KVH89807.1| Agenet-like domain-containing protein [Cynara car...   332   3e-99
gb|KVH95523.1| Agenet-like domain-containing protein [Cynara car...   221   9e-61
gb|KVH91932.1| Agenet-like domain-containing protein, partial [C...   198   1e-52
ref|XP_023755699.1| uncharacterized protein LOC111904150 [Lactuc...   189   1e-49
ref|XP_022029598.1| uncharacterized protein LOC110930577 isoform...   178   9e-46
gb|OTG32540.1| putative agenet-like domain, Agenet domain, plant...   177   1e-45
ref|XP_022029595.1| uncharacterized protein LOC110930577 isoform...   177   1e-45
ref|XP_022022783.1| uncharacterized protein LOC110922895 isoform...   150   2e-36
ref|XP_022022782.1| uncharacterized protein LOC110922895 isoform...   150   2e-36
ref|XP_022022781.1| uncharacterized protein LOC110922895 isoform...   150   2e-36
ref|XP_022022780.1| uncharacterized protein LOC110922895 isoform...   150   2e-36
ref|XP_017973233.1| PREDICTED: serine-rich adhesin for platelets...   125   1e-27
ref|XP_021281568.1| uncharacterized protein LOC110414569 isoform...   124   3e-27
ref|XP_021281560.1| uncharacterized protein LOC110414569 isoform...   124   3e-27
emb|CDP09978.1| unnamed protein product [Coffea canephora]            124   3e-27
gb|EOY24312.1| G2484-1 protein, putative isoform 4 [Theobroma ca...   124   4e-27

>ref|XP_022017923.1| uncharacterized protein LOC110917784 isoform X3 [Helianthus annuus]
          Length = 1309

 Score =  370 bits (950), Expect = e-114
 Identities = 216/403 (53%), Positives = 260/403 (64%), Gaps = 52/403 (12%)
 Frame = -2

Query: 1057 ETDDQLKSPILGVSLVHQDNKETMENDG---------ILQVESAVASGLRNPSFIVRNEA 905
            E ++Q KSPI GVSLVHQ +KE ++  G         I QVE  VAS  +NPS +V  EA
Sbjct: 17   ENEEQPKSPIFGVSLVHQGSKENIQVGGPSEGNSQNEISQVECVVASDSKNPSALVEKEA 76

Query: 904  STDLVEKVVLGIDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLESL-CELSEKNGNNS 728
            S DL EK +  ID HCV PVES N SQ E+EP + +  Q+C ++LE+   +L EK  NNS
Sbjct: 77   SPDLAEKAIHEIDAHCVTPVESCNASQVEREPGTKNDSQDCSEKLETNPSDLLEKTVNNS 136

Query: 727  QARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLKGKQ---EVTKDVRHGI-------A 578
            Q   L DPKE+++EVNK+ EHQT  D  RPSD    G     +   DV++         A
Sbjct: 137  QTGCLQDPKETVIEVNKSNEHQTFVDGSRPSDDVADGSSLSCKTADDVQNKALSVSVDNA 196

Query: 577  TQEGKNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKIL-----DSSSSHLDATKLHEVP 413
            T E K+ +FEV+  + L QDGKG  SYP FQVSNLPKI      DSSS+  D  K HEV 
Sbjct: 197  TPEEKDLTFEVNKRSDLGQDGKGSLSYPAFQVSNLPKITEGPSKDSSSNQRDPKKFHEVS 256

Query: 412  HSPQNTSGLTAQTGVKGKAERKPRRKSVPKENARKS----------LSLTSPATI----- 278
             SPQN SGLT+Q G+KGK+ERKPRRKSV KENARK           + LTSPAT      
Sbjct: 257  VSPQNPSGLTSQIGIKGKSERKPRRKSVGKENARKGTVPARGEVSPVLLTSPATSQVTRF 316

Query: 277  ------------GFLSSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPEE 134
                        G L S K+PD NN+ SI QQPFTD+QQVQLRAQILVYGS+ISGSPPEE
Sbjct: 317  EGVKSRENATKPGVLPSPKIPDLNNSTSILQQPFTDNQQVQLRAQILVYGSLISGSPPEE 376

Query: 133  PHMLAAFGQSDGGRIWEGVWRAYLERLHVQKAQAKSNIPLQTR 5
             +M+AAFGQ DGGR WEGVWRA +ERLH+QK+Q KS+ P+Q+R
Sbjct: 377  TYMIAAFGQPDGGRTWEGVWRACVERLHIQKSQPKSSTPIQSR 419


>ref|XP_022017922.1| uncharacterized protein LOC110917784 isoform X2 [Helianthus annuus]
          Length = 1756

 Score =  370 bits (950), Expect = e-113
 Identities = 216/403 (53%), Positives = 260/403 (64%), Gaps = 52/403 (12%)
 Frame = -2

Query: 1057 ETDDQLKSPILGVSLVHQDNKETMENDG---------ILQVESAVASGLRNPSFIVRNEA 905
            E ++Q KSPI GVSLVHQ +KE ++  G         I QVE  VAS  +NPS +V  EA
Sbjct: 464  ENEEQPKSPIFGVSLVHQGSKENIQVGGPSEGNSQNEISQVECVVASDSKNPSALVEKEA 523

Query: 904  STDLVEKVVLGIDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLESL-CELSEKNGNNS 728
            S DL EK +  ID HCV PVES N SQ E+EP + +  Q+C ++LE+   +L EK  NNS
Sbjct: 524  SPDLAEKAIHEIDAHCVTPVESCNASQVEREPGTKNDSQDCSEKLETNPSDLLEKTVNNS 583

Query: 727  QARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLKGKQ---EVTKDVRHGI-------A 578
            Q   L DPKE+++EVNK+ EHQT  D  RPSD    G     +   DV++         A
Sbjct: 584  QTGCLQDPKETVIEVNKSNEHQTFVDGSRPSDDVADGSSLSCKTADDVQNKALSVSVDNA 643

Query: 577  TQEGKNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKIL-----DSSSSHLDATKLHEVP 413
            T E K+ +FEV+  + L QDGKG  SYP FQVSNLPKI      DSSS+  D  K HEV 
Sbjct: 644  TPEEKDLTFEVNKRSDLGQDGKGSLSYPAFQVSNLPKITEGPSKDSSSNQRDPKKFHEVS 703

Query: 412  HSPQNTSGLTAQTGVKGKAERKPRRKSVPKENARKS----------LSLTSPATI----- 278
             SPQN SGLT+Q G+KGK+ERKPRRKSV KENARK           + LTSPAT      
Sbjct: 704  VSPQNPSGLTSQIGIKGKSERKPRRKSVGKENARKGTVPARGEVSPVLLTSPATSQVTRF 763

Query: 277  ------------GFLSSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPEE 134
                        G L S K+PD NN+ SI QQPFTD+QQVQLRAQILVYGS+ISGSPPEE
Sbjct: 764  EGVKSRENATKPGVLPSPKIPDLNNSTSILQQPFTDNQQVQLRAQILVYGSLISGSPPEE 823

Query: 133  PHMLAAFGQSDGGRIWEGVWRAYLERLHVQKAQAKSNIPLQTR 5
             +M+AAFGQ DGGR WEGVWRA +ERLH+QK+Q KS+ P+Q+R
Sbjct: 824  TYMIAAFGQPDGGRTWEGVWRACVERLHIQKSQPKSSTPIQSR 866


>ref|XP_022017920.1| uncharacterized protein LOC110917784 isoform X1 [Helianthus annuus]
 ref|XP_022017921.1| uncharacterized protein LOC110917784 isoform X1 [Helianthus annuus]
          Length = 1768

 Score =  370 bits (950), Expect = e-112
 Identities = 216/403 (53%), Positives = 260/403 (64%), Gaps = 52/403 (12%)
 Frame = -2

Query: 1057 ETDDQLKSPILGVSLVHQDNKETMENDG---------ILQVESAVASGLRNPSFIVRNEA 905
            E ++Q KSPI GVSLVHQ +KE ++  G         I QVE  VAS  +NPS +V  EA
Sbjct: 476  ENEEQPKSPIFGVSLVHQGSKENIQVGGPSEGNSQNEISQVECVVASDSKNPSALVEKEA 535

Query: 904  STDLVEKVVLGIDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLESL-CELSEKNGNNS 728
            S DL EK +  ID HCV PVES N SQ E+EP + +  Q+C ++LE+   +L EK  NNS
Sbjct: 536  SPDLAEKAIHEIDAHCVTPVESCNASQVEREPGTKNDSQDCSEKLETNPSDLLEKTVNNS 595

Query: 727  QARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLKGKQ---EVTKDVRHGI-------A 578
            Q   L DPKE+++EVNK+ EHQT  D  RPSD    G     +   DV++         A
Sbjct: 596  QTGCLQDPKETVIEVNKSNEHQTFVDGSRPSDDVADGSSLSCKTADDVQNKALSVSVDNA 655

Query: 577  TQEGKNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKIL-----DSSSSHLDATKLHEVP 413
            T E K+ +FEV+  + L QDGKG  SYP FQVSNLPKI      DSSS+  D  K HEV 
Sbjct: 656  TPEEKDLTFEVNKRSDLGQDGKGSLSYPAFQVSNLPKITEGPSKDSSSNQRDPKKFHEVS 715

Query: 412  HSPQNTSGLTAQTGVKGKAERKPRRKSVPKENARKS----------LSLTSPATI----- 278
             SPQN SGLT+Q G+KGK+ERKPRRKSV KENARK           + LTSPAT      
Sbjct: 716  VSPQNPSGLTSQIGIKGKSERKPRRKSVGKENARKGTVPARGEVSPVLLTSPATSQVTRF 775

Query: 277  ------------GFLSSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPEE 134
                        G L S K+PD NN+ SI QQPFTD+QQVQLRAQILVYGS+ISGSPPEE
Sbjct: 776  EGVKSRENATKPGVLPSPKIPDLNNSTSILQQPFTDNQQVQLRAQILVYGSLISGSPPEE 835

Query: 133  PHMLAAFGQSDGGRIWEGVWRAYLERLHVQKAQAKSNIPLQTR 5
             +M+AAFGQ DGGR WEGVWRA +ERLH+QK+Q KS+ P+Q+R
Sbjct: 836  TYMIAAFGQPDGGRTWEGVWRACVERLHIQKSQPKSSTPIQSR 878


>gb|OTF91434.1| putative agenet-like domain, Agenet domain, plant type,
            Glutamine-Leucine-Glutamine, QLQ [Helianthus annuus]
          Length = 1785

 Score =  370 bits (950), Expect = e-112
 Identities = 216/403 (53%), Positives = 260/403 (64%), Gaps = 52/403 (12%)
 Frame = -2

Query: 1057 ETDDQLKSPILGVSLVHQDNKETMENDG---------ILQVESAVASGLRNPSFIVRNEA 905
            E ++Q KSPI GVSLVHQ +KE ++  G         I QVE  VAS  +NPS +V  EA
Sbjct: 493  ENEEQPKSPIFGVSLVHQGSKENIQVGGPSEGNSQNEISQVECVVASDSKNPSALVEKEA 552

Query: 904  STDLVEKVVLGIDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLESL-CELSEKNGNNS 728
            S DL EK +  ID HCV PVES N SQ E+EP + +  Q+C ++LE+   +L EK  NNS
Sbjct: 553  SPDLAEKAIHEIDAHCVTPVESCNASQVEREPGTKNDSQDCSEKLETNPSDLLEKTVNNS 612

Query: 727  QARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLKGKQ---EVTKDVRHGI-------A 578
            Q   L DPKE+++EVNK+ EHQT  D  RPSD    G     +   DV++         A
Sbjct: 613  QTGCLQDPKETVIEVNKSNEHQTFVDGSRPSDDVADGSSLSCKTADDVQNKALSVSVDNA 672

Query: 577  TQEGKNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKIL-----DSSSSHLDATKLHEVP 413
            T E K+ +FEV+  + L QDGKG  SYP FQVSNLPKI      DSSS+  D  K HEV 
Sbjct: 673  TPEEKDLTFEVNKRSDLGQDGKGSLSYPAFQVSNLPKITEGPSKDSSSNQRDPKKFHEVS 732

Query: 412  HSPQNTSGLTAQTGVKGKAERKPRRKSVPKENARKS----------LSLTSPATI----- 278
             SPQN SGLT+Q G+KGK+ERKPRRKSV KENARK           + LTSPAT      
Sbjct: 733  VSPQNPSGLTSQIGIKGKSERKPRRKSVGKENARKGTVPARGEVSPVLLTSPATSQVTRF 792

Query: 277  ------------GFLSSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPEE 134
                        G L S K+PD NN+ SI QQPFTD+QQVQLRAQILVYGS+ISGSPPEE
Sbjct: 793  EGVKSRENATKPGVLPSPKIPDLNNSTSILQQPFTDNQQVQLRAQILVYGSLISGSPPEE 852

Query: 133  PHMLAAFGQSDGGRIWEGVWRAYLERLHVQKAQAKSNIPLQTR 5
             +M+AAFGQ DGGR WEGVWRA +ERLH+QK+Q KS+ P+Q+R
Sbjct: 853  TYMIAAFGQPDGGRTWEGVWRACVERLHIQKSQPKSSTPIQSR 895


>gb|KVH89807.1| Agenet-like domain-containing protein [Cynara cardunculus var.
            scolymus]
          Length = 2206

 Score =  332 bits (851), Expect = 3e-99
 Identities = 230/477 (48%), Positives = 260/477 (54%), Gaps = 123/477 (25%)
 Frame = -2

Query: 1063 NKETDDQLKSPILGVSLVHQDNKETME---------NDGILQVESAVASGLRNPSFIVRN 911
            N ETDDQ KSPI GVSLVHQD  E M+          + I QVES VAS   NPS     
Sbjct: 589  NSETDDQPKSPIFGVSLVHQDYNEKMKVGFSCEGNSQNEIPQVESVVASASMNPSSFDEK 648

Query: 910  EASTDLVEKVVLGIDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLE-SLCELSEKNGN 734
            EAS DLV+KVV     H V PVE  N SQ EQE  +N+ GQEC K LE S    S K G+
Sbjct: 649  EASPDLVKKVV-----HRVTPVECCNASQIEQEL-TNTDGQECFKILETSPFASSAKGGS 702

Query: 733  NSQARGLHDPKESMVEVN--------------KTIEHQTSADVVRPSDF----------- 629
            NS+A GL +PKESM+E +              K+ EHQT   V R S+            
Sbjct: 703  NSEAGGLEEPKESMIEDHHLNTISTPVLGFAVKSDEHQTFVGVARSSECDADHIEPDGGS 762

Query: 628  ----------------------KLKGKQEVTKDVRHG---------------------IA 578
                                    K KQE TKDVRHG                      A
Sbjct: 763  FSSLDKPHVVSPTCVSSMELSESTKDKQEATKDVRHGGVLPSKVADDVEDNIRSVPSSNA 822

Query: 577  TQEGKNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKIL-----DSSSSHLDATKLHEVP 413
            TQE  NF+FEV+ SAGL Q  KGF SYPTFQVS  PKI+     DSS+S +DA KLH   
Sbjct: 823  TQEETNFTFEVNKSAGLGQADKGFPSYPTFQVSVSPKIMEGPPTDSSTSQVDAAKLHGSS 882

Query: 412  HSPQNTSGLTAQTGVKGKAERKPRRKSVPKENARK-------------------SLSLTS 290
             +PQN S +T Q GVK  +ERK RRKSV KENARK                   S+SLTS
Sbjct: 883  LTPQNLSCMTPQIGVK--SERKTRRKSVGKENARKGNHLKETPPARDSVRVEKSSVSLTS 940

Query: 289  PATI---------------------GFLSSAKLPDSNNTPSIFQQPFTDSQQVQLRAQIL 173
            PAT                      G     KLPD NN+ SIFQQPFTD+QQVQLRAQIL
Sbjct: 941  PATGHVIQFEEVKSNDIERGGTKPGGIFPLPKLPDLNNSTSIFQQPFTDNQQVQLRAQIL 1000

Query: 172  VYGSVISGSPPEEPHMLAAFGQSDGGRIWEGVWRAYLERLHVQKAQAKSNIPLQTRS 2
            VYGS+ISG+ PEE HM+AAFGQSDGGR WEGVWRA +ERLHVQKAQA S  P+++RS
Sbjct: 1001 VYGSLISGTTPEESHMIAAFGQSDGGRTWEGVWRACVERLHVQKAQANSATPMKSRS 1057


>gb|KVH95523.1| Agenet-like domain-containing protein [Cynara cardunculus var.
            scolymus]
          Length = 2260

 Score =  221 bits (564), Expect = 9e-61
 Identities = 173/468 (36%), Positives = 226/468 (48%), Gaps = 122/468 (26%)
 Frame = -2

Query: 1039 KSPILGVSLVHQDNKETME---------NDGILQVESAVASGLRNPSFIVRNEASTDLVE 887
            KSPI GVS VH DN++  E          D   +V S +      P  +V   ASTD   
Sbjct: 572  KSPIHGVSSVHHDNEKKEEAGFFGDGTSGDESPKVASMIDCAAVEPLPVVEKSASTDRDG 631

Query: 886  KVVLGIDGHCVKPVESRNTSQSEQEPESNSGGQECIKR--LESL-CELSEKNGNNSQARG 716
             VV  + GHC +PV++ +   SEQ  E+N  G  C     + SL    S K G+  +A G
Sbjct: 632  NVVHQMAGHCARPVDNNHAIMSEQTQEANPDGLGCSTTSVMSSLPFNSSAKVGDIGEAGG 691

Query: 715  LHDPKESM------VEVNKTIEHQTSADVVRPSDFKLKGKQ------------------- 611
            L DPKES+        ++   E Q S +VV PS+     K+                   
Sbjct: 692  LQDPKESISGYHRAAPLSLAAEQQISTEVVPPSECDASHKRDNEDSSSSVDKTQYVSPSN 751

Query: 610  ----------------------------EVTKDVR-------HGIATQEGKNFSFEVDAS 536
                                        EVT+D++         I  +  K+ +FEV  +
Sbjct: 752  TNNTELLQSTKVTHEMAEGAIYENASLLEVTEDLKGKMQSVSSNIDIRRDKSSTFEVTTT 811

Query: 535  AGLAQDGKGFQSYPTFQVSNLP-----KILDSSSSHLDATKLHEVPHS-PQNTSGLTAQT 374
            A L Q GKG QS+P  Q  N+        ++SSSS LD  K  EV H+ PQN    T Q 
Sbjct: 812  AALEQTGKGQQSFPAIQACNMSMNMEGSSINSSSSILDPQKPEEVRHAIPQNPVSATIQV 871

Query: 373  GVKGKAERKPRRKSVPKENARKS------------------LSLTSPAT---IGF----- 272
            G KG +ERK RRKSV KE+A+ S                  + LTS AT   I F     
Sbjct: 872  GSKGNSERKTRRKSVGKESAKNSNRLKETTPRQSGRIEKSSVRLTSSATGHAIQFEELKP 931

Query: 271  ----------------LSSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPP 140
                            + ++ LPD NN+ +IF+QPFTD+QQVQLRAQILVYGS+ISGSPP
Sbjct: 932  QEKIECSIKKPSGILPIPTSNLPDLNNSTAIFKQPFTDNQQVQLRAQILVYGSLISGSPP 991

Query: 139  EEPHMLAAFGQSDGGR-IWEGVWRAYLERLHVQKAQAKS-NIPLQTRS 2
            EE HM+AAFGQSDGGR  WEG W A +ER+H +K+QA + +  +Q+RS
Sbjct: 992  EEAHMIAAFGQSDGGRETWEGAWHACVERVHGKKSQANNPSTLMQSRS 1039


>gb|KVH91932.1| Agenet-like domain-containing protein, partial [Cynara cardunculus
            var. scolymus]
          Length = 1819

 Score =  198 bits (503), Expect = 1e-52
 Identities = 156/420 (37%), Positives = 200/420 (47%), Gaps = 113/420 (26%)
 Frame = -2

Query: 922  IVRNEASTDLVEKVVLGIDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLESL---CEL 752
            +   +A +D  EKVV  +DG C  PV + N S SEQ  E N G  +C K LE++   CE 
Sbjct: 686  VFEEDAVSDDAEKVVHKLDGDCDPPVGNCNASPSEQIAEDNPGDLKCSKTLETIPLPCES 745

Query: 751  SEKNGNNSQARGLHDPKESM---------VEVNKTIEHQTSADVVRPS------------ 635
            S + GN+++A GL  P+ES+         + VN + E Q S DV  PS            
Sbjct: 746  SAEGGNDAEAGGLLTPRESIAGDGVEPQNLAVNMSDEQQASVDVAGPSERDANHVVQNGV 805

Query: 634  --------------DFK-------LKGKQEVTKDVRH--------------------GIA 578
                          D K          K EV KD R+                      A
Sbjct: 806  GSTLLDKPQIFSFSDIKSIELSQTASDKHEVPKDTRYVNAPLSDASNDKEGLQSVSSSSA 865

Query: 577  TQEGKNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKI--LDSSSSHLDATKLHEVP-HS 407
            T+E K+F+FEV ASAG  Q G   +S     +S   K+  + S +S LD  KLHE    S
Sbjct: 866  TKEEKSFTFEVIASAGPGQTGIQNRS-----LSKNTKVSSVSSHASLLDPNKLHEDSLFS 920

Query: 406  PQNTSGLTAQTGVKGKAERKPRRKSVPKENARKSLSLT------------------SPAT 281
             Q  S    + G  G +ERKPRRKS  KE A+K  +L                   +P  
Sbjct: 921  QQTPSSAAIEAGSNGNSERKPRRKSAGKETAKKGNNLKETTPRRRSGRVEKSPGVLNPPA 980

Query: 280  IGFLSSAK-------------------------LPDSNNTPSIFQQPFTDSQQVQLRAQI 176
            IG ++S +                         LPD NN+ S+FQQ FTD+QQVQLRAQI
Sbjct: 981  IGHVTSVEGLKPSESVECSNQRPGDIPPILTSNLPDLNNSTSMFQQSFTDTQQVQLRAQI 1040

Query: 175  LVYGSVISGSPPEEPHMLAAFGQSDGG-RIWEGVWRAYLERLHVQKAQAKS-NIPLQTRS 2
            LVYGS+ISG  PEE HM+AAFGQSDGG R WE  W A LER+  +K+QA + + P+Q RS
Sbjct: 1041 LVYGSLISGMAPEESHMIAAFGQSDGGRRAWEAAWHACLERVRGRKSQANNPDTPMQPRS 1100


>ref|XP_023755699.1| uncharacterized protein LOC111904150 [Lactuca sativa]
 ref|XP_023755706.1| uncharacterized protein LOC111904150 [Lactuca sativa]
 gb|PLY99044.1| hypothetical protein LSAT_6X90740 [Lactuca sativa]
          Length = 2164

 Score =  189 bits (480), Expect = 1e-49
 Identities = 154/424 (36%), Positives = 197/424 (46%), Gaps = 74/424 (17%)
 Frame = -2

Query: 1051 DDQLKSPILGVSLVHQDNKETMENDGILQVESAVASGLRNPSFIVRNEASTDLVEKVVLG 872
            +DQ +SPILGVSL+H DNKE  E  G  Q  +       +P  +V+ ++           
Sbjct: 814  EDQPRSPILGVSLLHDDNKEKAEVGGNCQKGAPQVDS--SPLPLVKKDS----------- 860

Query: 871  IDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLESL---CELSEKNGNNSQARGLHDPK 701
             D H   PV S +   SEQ  + N    +C K LE+     +LSE+  NN+ A GL DPK
Sbjct: 861  -DSHDDLPVGSSDVGPSEQTVDVNQSDLKCSKPLETTPLSYDLSEE-ANNADAGGLLDPK 918

Query: 700  ESM----------VEVNKTIEHQTSADVVRPSDFKLKG---------------------- 617
            ESM          + VN + E Q S  V  PS  + +G                      
Sbjct: 919  ESMAGEGGVEPLNLTVNISNEQQASVSVAVPSVHETEGGSASLDKDKPQIFSFLDTKSIE 978

Query: 616  -------KQEVTK--------------------DVRHGIATQEGKNFSFEVDASAGLAQD 518
                   K EVTK                     V    A +EGK+F+FEV    G+   
Sbjct: 979  PSQSAKDKHEVTKGTMENTPLSKISNEGYGGLLSVSSSSARKEGKSFTFEVTGQTGI--- 1035

Query: 517  GKGFQSYPTFQVSNLPKILDSSSSHLDATKLHEVPHSPQNTSGLTAQTGVKGKAERKPRR 338
             +         VS+   +LDS+  H D         +PQ  S    ++G K  +ERKPRR
Sbjct: 1036 -QNIDFSKVSPVSSSGGLLDSNKPHQDTLV------TPQTPSVAPVESGAKKTSERKPRR 1088

Query: 337  KSVPKENARKS----------LSLTSPATIGFLSSAKLPDSNNTPSIFQQPFTDSQQVQL 188
            KSV KE A+K            S T  +    ++S   PD N   SI  Q FTD QQVQL
Sbjct: 1089 KSVGKETAKKGNHSKETTPTRRSGTEKSPSPLITSENQPDLNIPTSIPHQSFTDIQQVQL 1148

Query: 187  RAQILVYGSVISGSPPEEPHMLAAFGQSDGGR-IWEGVWRAYLERLHVQKAQAKSNI-PL 14
            RAQILVYGS+ISG  PEEPHM+AAFGQSD GR  WE  W A +ER+H  K Q  + + PL
Sbjct: 1149 RAQILVYGSLISGMSPEEPHMIAAFGQSDPGRKAWEAAWHACIERVHGHKTQVNNPLTPL 1208

Query: 13   QTRS 2
            Q+RS
Sbjct: 1209 QSRS 1212


>ref|XP_022029598.1| uncharacterized protein LOC110930577 isoform X2 [Helianthus annuus]
          Length = 1514

 Score =  178 bits (451), Expect = 9e-46
 Identities = 141/381 (37%), Positives = 183/381 (48%), Gaps = 35/381 (9%)
 Frame = -2

Query: 1054 TDDQLKSPILGVSLVHQDNKETMENDGILQVESAVASGLRNPSFIVRNEASTDLVEKVVL 875
            T+DQ +SPILGVSL+  DNKE +E      V S      +  S  V     +D  EK+V 
Sbjct: 311  TEDQPRSPILGVSLLSDDNKEKVE------VGSTSEGAFQKQSPEVEKGTVSDGAEKIV- 363

Query: 874  GIDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLESLCELSEKNGNNSQARGLHDPKES 695
                         N  QSEQ  +  SG                          L DPKES
Sbjct: 364  -----------HINAGQSEQTGDDESG-------------------------SLPDPKES 387

Query: 694  MVEVNKTIEHQTSADVVRPSDFKLKGKQEVTKDVRHGI--ATQEGKNFSFEVDASAGLAQ 521
            M         +  A+ +  +D K     +  KD +H +   T+E KNF+FEV+AS+    
Sbjct: 388  MT-------GEDGAEPLNLADAKSVELSQTAKD-KHDVTKGTKEDKNFTFEVNASSSQN- 438

Query: 520  DGKGFQSYPTFQVSNLPKILDSSSSHLDATKLHEVPHSPQNTSGLTAQTGVKGKAERKPR 341
                  ++ T  VS+    + S  S LD  KLHE     Q+    +A T  KG +ERKPR
Sbjct: 439  -----INFSTNTVSS----VSSHGSQLDPNKLHEDSPVTQSPITTSAATRAKGTSERKPR 489

Query: 340  RKSVPKENARKSLSL-------------------TSPAT-------------IGFLSSAK 257
            +KSV  E A++S +L                   TSP T             +  +S++ 
Sbjct: 490  KKSVGTETAKQSNNLKEKTPARRSRKTEKSPPLQTSPTTGHVTPVSNPKPGGMTPISTST 549

Query: 256  LPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPEEPHMLAAFGQSDGG-RIWEG 80
            LP  NN+ S+F   FTD+QQVQLRAQILVYGS++SG PPEEPHM+AAFGQSDGG R WE 
Sbjct: 550  LPILNNSTSVFHHSFTDNQQVQLRAQILVYGSILSGVPPEEPHMIAAFGQSDGGRRAWEA 609

Query: 79   VWRAYLERLHVQKAQAKSNIP 17
             W AYLER+   KAQ  +  P
Sbjct: 610  TWHAYLERVRGPKAQPNNPGP 630


>gb|OTG32540.1| putative agenet-like domain, Agenet domain, plant type [Helianthus
            annuus]
          Length = 1543

 Score =  177 bits (450), Expect = 1e-45
 Identities = 138/393 (35%), Positives = 192/393 (48%), Gaps = 47/393 (11%)
 Frame = -2

Query: 1054 TDDQLKSPILGVSLVHQDNKETMENDGILQVESAVASGLRNPSFIVRNEASTDLVEKVVL 875
            T+DQ +SPILGVSL+  DNKE +E      V S      +  S  V     +D  EK+V 
Sbjct: 306  TEDQPRSPILGVSLLSDDNKEKVE------VGSTSEGAFQKQSPEVEKGTVSDGAEKIV- 358

Query: 874  GIDGHCVKPVESRNTSQSEQ----EPESNSGGQECIKRL----------ESLCELSEKNG 737
                  +   +S  T   E     +P+ +  G++  + L          ++  ++   +G
Sbjct: 359  -----HINAGQSEQTGDDESGSLPDPKESMTGEDGAEPLNLAVNMPDQQQASVDIDSPSG 413

Query: 736  NNSQARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLKGKQEVTKDVRHGIATQEGKNF 557
             ++    L  P+ S+ +       QT+ D           K +VTK       T+E KNF
Sbjct: 414  CDTNPSSLDKPQTSLSDAKSVELSQTAKD-----------KHDVTK------GTKEDKNF 456

Query: 556  SFEVDASAGLAQDGKGFQSYPTFQVSNLPKILDSSSSHLDATKLHEVPHSPQNTSGLTAQ 377
            +FEV+AS+          ++ T  VS+    + S  S LD  KLHE     Q+    +A 
Sbjct: 457  TFEVNASSSQN------INFSTNTVSS----VSSHGSQLDPNKLHEDSPVTQSPITTSAA 506

Query: 376  TGVKGKAERKPRRKSVPKENARKSLSL-------------------TSPAT--------- 281
            T  KG +ERKPR+KSV  E A++S +L                   TSP T         
Sbjct: 507  TRAKGTSERKPRKKSVGTETAKQSNNLKEKTPARRSRKTEKSPPLQTSPTTGHVTPVSNP 566

Query: 280  ----IGFLSSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPEEPHMLAAF 113
                +  +S++ LP  NN+ S+F   FTD+QQVQLRAQILVYGS++SG PPEEPHM+AAF
Sbjct: 567  KPGGMTPISTSTLPILNNSTSVFHHSFTDNQQVQLRAQILVYGSILSGVPPEEPHMIAAF 626

Query: 112  GQSDGG-RIWEGVWRAYLERLHVQKAQAKSNIP 17
            GQSDGG R WE  W AYLER+   KAQ  +  P
Sbjct: 627  GQSDGGRRAWEATWHAYLERVRGPKAQPNNPGP 659


>ref|XP_022029595.1| uncharacterized protein LOC110930577 isoform X1 [Helianthus annuus]
 ref|XP_022029596.1| uncharacterized protein LOC110930577 isoform X1 [Helianthus annuus]
 ref|XP_022029597.1| uncharacterized protein LOC110930577 isoform X1 [Helianthus annuus]
          Length = 1548

 Score =  177 bits (450), Expect = 1e-45
 Identities = 138/393 (35%), Positives = 192/393 (48%), Gaps = 47/393 (11%)
 Frame = -2

Query: 1054 TDDQLKSPILGVSLVHQDNKETMENDGILQVESAVASGLRNPSFIVRNEASTDLVEKVVL 875
            T+DQ +SPILGVSL+  DNKE +E      V S      +  S  V     +D  EK+V 
Sbjct: 311  TEDQPRSPILGVSLLSDDNKEKVE------VGSTSEGAFQKQSPEVEKGTVSDGAEKIV- 363

Query: 874  GIDGHCVKPVESRNTSQSEQ----EPESNSGGQECIKRL----------ESLCELSEKNG 737
                  +   +S  T   E     +P+ +  G++  + L          ++  ++   +G
Sbjct: 364  -----HINAGQSEQTGDDESGSLPDPKESMTGEDGAEPLNLAVNMPDQQQASVDIDSPSG 418

Query: 736  NNSQARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLKGKQEVTKDVRHGIATQEGKNF 557
             ++    L  P+ S+ +       QT+ D           K +VTK       T+E KNF
Sbjct: 419  CDTNPSSLDKPQTSLSDAKSVELSQTAKD-----------KHDVTK------GTKEDKNF 461

Query: 556  SFEVDASAGLAQDGKGFQSYPTFQVSNLPKILDSSSSHLDATKLHEVPHSPQNTSGLTAQ 377
            +FEV+AS+          ++ T  VS+    + S  S LD  KLHE     Q+    +A 
Sbjct: 462  TFEVNASSSQN------INFSTNTVSS----VSSHGSQLDPNKLHEDSPVTQSPITTSAA 511

Query: 376  TGVKGKAERKPRRKSVPKENARKSLSL-------------------TSPAT--------- 281
            T  KG +ERKPR+KSV  E A++S +L                   TSP T         
Sbjct: 512  TRAKGTSERKPRKKSVGTETAKQSNNLKEKTPARRSRKTEKSPPLQTSPTTGHVTPVSNP 571

Query: 280  ----IGFLSSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPEEPHMLAAF 113
                +  +S++ LP  NN+ S+F   FTD+QQVQLRAQILVYGS++SG PPEEPHM+AAF
Sbjct: 572  KPGGMTPISTSTLPILNNSTSVFHHSFTDNQQVQLRAQILVYGSILSGVPPEEPHMIAAF 631

Query: 112  GQSDGG-RIWEGVWRAYLERLHVQKAQAKSNIP 17
            GQSDGG R WE  W AYLER+   KAQ  +  P
Sbjct: 632  GQSDGGRRAWEATWHAYLERVRGPKAQPNNPGP 664


>ref|XP_022022783.1| uncharacterized protein LOC110922895 isoform X4 [Helianthus annuus]
 ref|XP_022022784.1| uncharacterized protein LOC110922895 isoform X4 [Helianthus annuus]
          Length = 2400

 Score =  150 bits (380), Expect = 2e-36
 Identities = 130/398 (32%), Positives = 188/398 (47%), Gaps = 49/398 (12%)
 Frame = -2

Query: 1072 ESDNKETDDQLKSPILGVSLVHQDNKETMENDGILQVESAVASGLRNPSFIVRNEASTDL 893
            +S  +  +D   SP+L  +         +E + +   E+   + L  P  ++ +E     
Sbjct: 1018 DSHMRNEEDSSVSPLLYTNNTEPSQSARVEPNDVAINEN---NSLPVPDVVLTSEGDDSH 1074

Query: 892  VEKVVLGIDG-HCVKPVESRNTSQSEQEPESNSGGQECIKRL-----ESLCELSEKNGNN 731
             E   + +D    V P+ + N ++  Q    NSG    I+       E +  +SE + N+
Sbjct: 1075 KEDSSVSMDKLQSVSPLLNTNNTEPSQSTRVNSGDGAIIENTALPEAEDVVRISECDDNH 1134

Query: 730  SQ-ARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLK---GKQEVTKDVR---HGIATQ 572
             +   G       +++ N T   Q++   V PS+  +       E T+DV+     I++ 
Sbjct: 1135 VRHEEGKLQSVSPLLDTNNTELSQSTR--VGPSEGAINETASLPEATEDVQGKEQSISSN 1192

Query: 571  EG----KNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKILDSSSSHLDATKLHEVPHSP 404
                   +F+FEV  +AGL +  KG Q +P  Q SNL    +     +D+ K  E P   
Sbjct: 1193 SDMKTDNSFTFEVSGTAGLTETCKGLQLFPASQSSNLSTTTEVPC--IDSNKPIEDPAVT 1250

Query: 403  QNTSGLTAQTGVKGKAERKPRRKSVPKENARKS----------------LSLTSP----- 287
              T G  + T + G  ERK RRKSV KE+A KS                L++T+P     
Sbjct: 1251 LQTPG--SGTVITGGKERKTRRKSVGKESATKSKETTTIRQGRVKKSSPLTVTTPQSPPA 1308

Query: 286  ATIGFL----------SSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPE 137
            A  G             S+ LPD NN   IF QPFTD+QQVQLRAQILVYGS+ISG PPE
Sbjct: 1309 AAAGHAIQFQELGSKEKSSPLPDLNNATCIFHQPFTDNQQVQLRAQILVYGSLISGMPPE 1368

Query: 136  EPHMLAAFGQSDGGR-IWEGVWRAYLERLHVQKAQAKS 26
            EPHM+AAFGQSD  R  W+  W A L R + QK+ A +
Sbjct: 1369 EPHMIAAFGQSDAERKTWKDAWHACLNRDNGQKSHANT 1406


>ref|XP_022022782.1| uncharacterized protein LOC110922895 isoform X3 [Helianthus annuus]
          Length = 2411

 Score =  150 bits (380), Expect = 2e-36
 Identities = 130/398 (32%), Positives = 188/398 (47%), Gaps = 49/398 (12%)
 Frame = -2

Query: 1072 ESDNKETDDQLKSPILGVSLVHQDNKETMENDGILQVESAVASGLRNPSFIVRNEASTDL 893
            +S  +  +D   SP+L  +         +E + +   E+   + L  P  ++ +E     
Sbjct: 1030 DSHMRNEEDSSVSPLLYTNNTEPSQSARVEPNDVAINEN---NSLPVPDVVLTSEGDDSH 1086

Query: 892  VEKVVLGIDG-HCVKPVESRNTSQSEQEPESNSGGQECIKRL-----ESLCELSEKNGNN 731
             E   + +D    V P+ + N ++  Q    NSG    I+       E +  +SE + N+
Sbjct: 1087 KEDSSVSMDKLQSVSPLLNTNNTEPSQSTRVNSGDGAIIENTALPEAEDVVRISECDDNH 1146

Query: 730  SQ-ARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLK---GKQEVTKDVR---HGIATQ 572
             +   G       +++ N T   Q++   V PS+  +       E T+DV+     I++ 
Sbjct: 1147 VRHEEGKLQSVSPLLDTNNTELSQSTR--VGPSEGAINETASLPEATEDVQGKEQSISSN 1204

Query: 571  EG----KNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKILDSSSSHLDATKLHEVPHSP 404
                   +F+FEV  +AGL +  KG Q +P  Q SNL    +     +D+ K  E P   
Sbjct: 1205 SDMKTDNSFTFEVSGTAGLTETCKGLQLFPASQSSNLSTTTEVPC--IDSNKPIEDPAVT 1262

Query: 403  QNTSGLTAQTGVKGKAERKPRRKSVPKENARKS----------------LSLTSP----- 287
              T G  + T + G  ERK RRKSV KE+A KS                L++T+P     
Sbjct: 1263 LQTPG--SGTVITGGKERKTRRKSVGKESATKSKETTTIRQGRVKKSSPLTVTTPQSPPA 1320

Query: 286  ATIGFL----------SSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPE 137
            A  G             S+ LPD NN   IF QPFTD+QQVQLRAQILVYGS+ISG PPE
Sbjct: 1321 AAAGHAIQFQELGSKEKSSPLPDLNNATCIFHQPFTDNQQVQLRAQILVYGSLISGMPPE 1380

Query: 136  EPHMLAAFGQSDGGR-IWEGVWRAYLERLHVQKAQAKS 26
            EPHM+AAFGQSD  R  W+  W A L R + QK+ A +
Sbjct: 1381 EPHMIAAFGQSDAERKTWKDAWHACLNRDNGQKSHANT 1418


>ref|XP_022022781.1| uncharacterized protein LOC110922895 isoform X2 [Helianthus annuus]
 gb|OTF86304.1| putative agenet-like domain, Agenet domain, plant type [Helianthus
            annuus]
          Length = 2411

 Score =  150 bits (380), Expect = 2e-36
 Identities = 130/398 (32%), Positives = 188/398 (47%), Gaps = 49/398 (12%)
 Frame = -2

Query: 1072 ESDNKETDDQLKSPILGVSLVHQDNKETMENDGILQVESAVASGLRNPSFIVRNEASTDL 893
            +S  +  +D   SP+L  +         +E + +   E+   + L  P  ++ +E     
Sbjct: 1029 DSHMRNEEDSSVSPLLYTNNTEPSQSARVEPNDVAINEN---NSLPVPDVVLTSEGDDSH 1085

Query: 892  VEKVVLGIDG-HCVKPVESRNTSQSEQEPESNSGGQECIKRL-----ESLCELSEKNGNN 731
             E   + +D    V P+ + N ++  Q    NSG    I+       E +  +SE + N+
Sbjct: 1086 KEDSSVSMDKLQSVSPLLNTNNTEPSQSTRVNSGDGAIIENTALPEAEDVVRISECDDNH 1145

Query: 730  SQ-ARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLK---GKQEVTKDVR---HGIATQ 572
             +   G       +++ N T   Q++   V PS+  +       E T+DV+     I++ 
Sbjct: 1146 VRHEEGKLQSVSPLLDTNNTELSQSTR--VGPSEGAINETASLPEATEDVQGKEQSISSN 1203

Query: 571  EG----KNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKILDSSSSHLDATKLHEVPHSP 404
                   +F+FEV  +AGL +  KG Q +P  Q SNL    +     +D+ K  E P   
Sbjct: 1204 SDMKTDNSFTFEVSGTAGLTETCKGLQLFPASQSSNLSTTTEVPC--IDSNKPIEDPAVT 1261

Query: 403  QNTSGLTAQTGVKGKAERKPRRKSVPKENARKS----------------LSLTSP----- 287
              T G  + T + G  ERK RRKSV KE+A KS                L++T+P     
Sbjct: 1262 LQTPG--SGTVITGGKERKTRRKSVGKESATKSKETTTIRQGRVKKSSPLTVTTPQSPPA 1319

Query: 286  ATIGFL----------SSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPE 137
            A  G             S+ LPD NN   IF QPFTD+QQVQLRAQILVYGS+ISG PPE
Sbjct: 1320 AAAGHAIQFQELGSKEKSSPLPDLNNATCIFHQPFTDNQQVQLRAQILVYGSLISGMPPE 1379

Query: 136  EPHMLAAFGQSDGGR-IWEGVWRAYLERLHVQKAQAKS 26
            EPHM+AAFGQSD  R  W+  W A L R + QK+ A +
Sbjct: 1380 EPHMIAAFGQSDAERKTWKDAWHACLNRDNGQKSHANT 1417


>ref|XP_022022780.1| uncharacterized protein LOC110922895 isoform X1 [Helianthus annuus]
          Length = 2412

 Score =  150 bits (380), Expect = 2e-36
 Identities = 130/398 (32%), Positives = 188/398 (47%), Gaps = 49/398 (12%)
 Frame = -2

Query: 1072 ESDNKETDDQLKSPILGVSLVHQDNKETMENDGILQVESAVASGLRNPSFIVRNEASTDL 893
            +S  +  +D   SP+L  +         +E + +   E+   + L  P  ++ +E     
Sbjct: 1030 DSHMRNEEDSSVSPLLYTNNTEPSQSARVEPNDVAINEN---NSLPVPDVVLTSEGDDSH 1086

Query: 892  VEKVVLGIDG-HCVKPVESRNTSQSEQEPESNSGGQECIKRL-----ESLCELSEKNGNN 731
             E   + +D    V P+ + N ++  Q    NSG    I+       E +  +SE + N+
Sbjct: 1087 KEDSSVSMDKLQSVSPLLNTNNTEPSQSTRVNSGDGAIIENTALPEAEDVVRISECDDNH 1146

Query: 730  SQ-ARGLHDPKESMVEVNKTIEHQTSADVVRPSDFKLK---GKQEVTKDVR---HGIATQ 572
             +   G       +++ N T   Q++   V PS+  +       E T+DV+     I++ 
Sbjct: 1147 VRHEEGKLQSVSPLLDTNNTELSQSTR--VGPSEGAINETASLPEATEDVQGKEQSISSN 1204

Query: 571  EG----KNFSFEVDASAGLAQDGKGFQSYPTFQVSNLPKILDSSSSHLDATKLHEVPHSP 404
                   +F+FEV  +AGL +  KG Q +P  Q SNL    +     +D+ K  E P   
Sbjct: 1205 SDMKTDNSFTFEVSGTAGLTETCKGLQLFPASQSSNLSTTTEVPC--IDSNKPIEDPAVT 1262

Query: 403  QNTSGLTAQTGVKGKAERKPRRKSVPKENARKS----------------LSLTSP----- 287
              T G  + T + G  ERK RRKSV KE+A KS                L++T+P     
Sbjct: 1263 LQTPG--SGTVITGGKERKTRRKSVGKESATKSKETTTIRQGRVKKSSPLTVTTPQSPPA 1320

Query: 286  ATIGFL----------SSAKLPDSNNTPSIFQQPFTDSQQVQLRAQILVYGSVISGSPPE 137
            A  G             S+ LPD NN   IF QPFTD+QQVQLRAQILVYGS+ISG PPE
Sbjct: 1321 AAAGHAIQFQELGSKEKSSPLPDLNNATCIFHQPFTDNQQVQLRAQILVYGSLISGMPPE 1380

Query: 136  EPHMLAAFGQSDGGR-IWEGVWRAYLERLHVQKAQAKS 26
            EPHM+AAFGQSD  R  W+  W A L R + QK+ A +
Sbjct: 1381 EPHMIAAFGQSDAERKTWKDAWHACLNRDNGQKSHANT 1418


>ref|XP_017973233.1| PREDICTED: serine-rich adhesin for platelets isoform X2 [Theobroma
            cacao]
          Length = 2124

 Score =  125 bits (313), Expect = 1e-27
 Identities = 101/321 (31%), Positives = 159/321 (49%), Gaps = 32/321 (9%)
 Frame = -2

Query: 871  IDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLESLCELSEKNGNNSQARGLHDPKESM 692
            +DG   K   S +TS    E ++       I+   S  +L   +  +       +  +S 
Sbjct: 722  VDGDPAKTHSSSSTSVISSESQTKF---HMIESGSSSVDLDNPSCGSPIVIRTSEQSQSK 778

Query: 691  VEVNKTIEHQTSADVVRPSDFKLKGKQEVTKDVRHGIATQEGKNFSFEVDASAGLAQD-- 518
            +E         SA      + +   +Q +++D +   A+   ++F+F+V   A +++   
Sbjct: 779  IEEGVKRSADQSASASGVINGEASEEQSISQDTKGNDASPGDRSFTFKVPPLADMSEKEA 838

Query: 517  GKGFQSYPTFQVSNLPKILD-----SSSSHLDATKLHEVPHS-PQNTSGLTAQTGVKGKA 356
            GK +Q + T Q   L  +++     S SS + A    +  H+ PQ +     + G +G +
Sbjct: 839  GKNWQPFSTMQHDKLSSVVEGTPSTSGSSKVAAKTAQDASHANPQASEREKVRVGSRGTS 898

Query: 355  ERKPRR---KSVPKENARKSLSL--TSPA---------TIGFLSSA---KLPDSNNTP-- 233
            ERK RR   K+  K+ A+K ++   T+PA         +   LSSA   +L  SN     
Sbjct: 899  ERKTRRTGGKNTGKDAAKKGIAAKETTPARQSERSDRSSNASLSSAGIGQLIQSNEMQHY 958

Query: 232  ---SIFQQPFTDSQQVQLRAQILVYGSVISGSPPEEPHMLAAFGQSDGGR-IWEGVWRAY 65
                +F QPFTD QQVQLRAQI VYG++I G+ P+E +M++AFG  DGGR IWE  WRA 
Sbjct: 959  GHIEVFHQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGPDGGRSIWENAWRAC 1018

Query: 64   LERLHVQKAQAKS-NIPLQTR 5
            +ER+H QK+   S   PLQ+R
Sbjct: 1019 IERVHGQKSHLVSPETPLQSR 1039


>ref|XP_021281568.1| uncharacterized protein LOC110414569 isoform X4 [Herrania umbratica]
          Length = 2112

 Score =  124 bits (311), Expect = 3e-27
 Identities = 97/270 (35%), Positives = 148/270 (54%), Gaps = 35/270 (12%)
 Frame = -2

Query: 709  DPKESMVE--VNKTIEHQTSADVVRPSDFKLKGKQEVTKDVRHGIATQEGKNFSFEVDAS 536
            +  +S +E  V ++ +   SA VV   + +   +Q +++D +   A+   ++F+F+V   
Sbjct: 773  EQSQSKIEEGVKRSTDQSASASVV--INREASKEQSISQDAKGNDASLGDRSFTFKVPPL 830

Query: 535  AGLA--QDGKGFQSYPTFQVSNLP-KILD-----SSSSHLDATKLHEVPHS-PQNTSGLT 383
            A L+  + GK +Q + T Q   L  K+++     S +S + A    +  H+ PQ +    
Sbjct: 831  ADLSVKEAGKNWQPFSTMQHDKLSSKVVEGTPSTSGTSKVAAKTAQDASHANPQASGREK 890

Query: 382  AQTGVKGKAERKPRR---KSVPKENARKSLSL--TSPA---------TIGFLSSA---KL 254
             + G +G +ERK RR   KS  KE A+K ++   T+PA         +   LSSA   +L
Sbjct: 891  VRVGSRGASERKTRRTGGKSTGKEAAKKGIAAKETTPARQSERSDRSSNASLSSAGIGQL 950

Query: 253  PDSNNTP-----SIFQQPFTDSQQVQLRAQILVYGSVISGSPPEEPHMLAAFGQSDGGR- 92
              SN         +F QPFTD QQVQLRAQI VYG++I G+ P+E +M++AFG  DGGR 
Sbjct: 951  IQSNEMQHYGHVEVFHQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGPDGGRS 1010

Query: 91   IWEGVWRAYLERLHVQKAQAKS-NIPLQTR 5
            IWE  WRA +ER+H QK+   S   PLQ+R
Sbjct: 1011 IWENAWRACIERVHGQKSHLVSPETPLQSR 1040


>ref|XP_021281560.1| uncharacterized protein LOC110414569 isoform X3 [Herrania umbratica]
          Length = 2125

 Score =  124 bits (311), Expect = 3e-27
 Identities = 97/270 (35%), Positives = 148/270 (54%), Gaps = 35/270 (12%)
 Frame = -2

Query: 709  DPKESMVE--VNKTIEHQTSADVVRPSDFKLKGKQEVTKDVRHGIATQEGKNFSFEVDAS 536
            +  +S +E  V ++ +   SA VV   + +   +Q +++D +   A+   ++F+F+V   
Sbjct: 773  EQSQSKIEEGVKRSTDQSASASVV--INREASKEQSISQDAKGNDASLGDRSFTFKVPPL 830

Query: 535  AGLA--QDGKGFQSYPTFQVSNLP-KILD-----SSSSHLDATKLHEVPHS-PQNTSGLT 383
            A L+  + GK +Q + T Q   L  K+++     S +S + A    +  H+ PQ +    
Sbjct: 831  ADLSVKEAGKNWQPFSTMQHDKLSSKVVEGTPSTSGTSKVAAKTAQDASHANPQASGREK 890

Query: 382  AQTGVKGKAERKPRR---KSVPKENARKSLSL--TSPA---------TIGFLSSA---KL 254
             + G +G +ERK RR   KS  KE A+K ++   T+PA         +   LSSA   +L
Sbjct: 891  VRVGSRGASERKTRRTGGKSTGKEAAKKGIAAKETTPARQSERSDRSSNASLSSAGIGQL 950

Query: 253  PDSNNTP-----SIFQQPFTDSQQVQLRAQILVYGSVISGSPPEEPHMLAAFGQSDGGR- 92
              SN         +F QPFTD QQVQLRAQI VYG++I G+ P+E +M++AFG  DGGR 
Sbjct: 951  IQSNEMQHYGHVEVFHQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGPDGGRS 1010

Query: 91   IWEGVWRAYLERLHVQKAQAKS-NIPLQTR 5
            IWE  WRA +ER+H QK+   S   PLQ+R
Sbjct: 1011 IWENAWRACIERVHGQKSHLVSPETPLQSR 1040


>emb|CDP09978.1| unnamed protein product [Coffea canephora]
          Length = 2176

 Score =  124 bits (311), Expect = 3e-27
 Identities = 123/385 (31%), Positives = 175/385 (45%), Gaps = 61/385 (15%)
 Frame = -2

Query: 973  ILQVESAVASGLRNPSFIVRNEASTDLVEKVVLGI-DGHCVKPVESRNTSQSEQEPESNS 797
            IL   SA AS +     +V   AS +L+        +G     VE+ N  + ++E +  +
Sbjct: 717  ILAETSAAASNVEQ---VVAERASVELLVHCQPNAKEGEGGDVVENLNPDEPQKEKKRVA 773

Query: 796  GGQECIKRLESLCELSEKNGNNSQARGLHDPKESMVEVNKTIEHQTSADVVR---PSDFK 626
               E   +  S+    EK  + S   G+  P+ S  E+NK  +   +  + +   PSD K
Sbjct: 774  ASSEV--QGGSISPAIEKPDDTSDGIGV--PELSECEMNK--QAGVTGGMTKNFPPSDCK 827

Query: 625  LKGKQEVTKD---VRHGIATQEGKNFSFEVDASAGLAQDG--KGFQSYPTFQVSNLPKIL 461
             +   + +     ++  +A+++  +F+F+V     L + G  KG+QS P  Q      ++
Sbjct: 828  ERNDGDTSSSDVALQVNVASKDEGSFAFDVSPLERLPEGGTSKGWQSDPHIQAHKRSTVV 887

Query: 460  D-----SSSSHLDATKLHEVPHSPQNTSGLTAQT-GVKGKAERKPRRKSVP--KENARKS 305
            D     S  S +D   + E+ H  Q T    A     KG +ERK RR S    KENARK 
Sbjct: 888  DKFPSTSGGSQVDPIVVQEISHGSQQTPDKGAPPQAAKGTSERKTRRSSAKSGKENARKG 947

Query: 304  --LSLTSP-----------ATIGFLSSAKL--------------------------PDSN 242
              L  T+P           A IG   S +L                          PD N
Sbjct: 948  NPLKETAPLKHSERGDRLSAPIGSAGSCQLKQLEVTSVERSGAKQGVVLPVSVSSLPDLN 1007

Query: 241  NTPSI---FQQPFTDSQQVQLRAQILVYGSVISGSPPEEPHMLAAFGQSDGGR-IWEGVW 74
             +  +   FQQPFTD QQVQLRAQI VYGS+I G  P+E  M++AFG  +GGR  WE  W
Sbjct: 1008 TSAQVSLFFQQPFTDLQQVQLRAQIFVYGSLIQGVAPDEACMVSAFGMCEGGRSFWEPAW 1067

Query: 73   RAYLERLHVQKAQ-AKSNIPLQTRS 2
            RA LERLH  K     S  P+Q+RS
Sbjct: 1068 RACLERLHGPKLHPGSSETPVQSRS 1092


>gb|EOY24312.1| G2484-1 protein, putative isoform 4 [Theobroma cacao]
          Length = 2110

 Score =  124 bits (310), Expect = 4e-27
 Identities = 103/322 (31%), Positives = 162/322 (50%), Gaps = 33/322 (10%)
 Frame = -2

Query: 871  IDGHCVKPVESRNTSQSEQEPESNSGGQECIKRLESLCELSEKNGNNSQARGLHDPKESM 692
            +DG   K   S  TS    E ++       I+   S  +L   +  +       +  +S 
Sbjct: 722  VDGDPAKTHSSSFTSVISSESQTKF---HMIESGSSSVDLDNPSCGSPIVIRTSEQSQSK 778

Query: 691  VE-VNKTIEHQTSADVVRPSDFKLKGKQEVTKDVRHGIATQEGKNFSFEVDASAGLAQD- 518
            +E V ++ +   SA  V   +     +Q +++D +   A+   ++F+F+V   A +++  
Sbjct: 779  IEGVKRSADQSASASGVINGE--ASKEQSISQDTKGNDASPGDRSFTFKVPPLADMSEKE 836

Query: 517  -GKGFQSYPTFQVSNLPKILD-----SSSSHLDATKLHEVPHS-PQNTSGLTAQTGVKGK 359
             GK +Q + T Q   L  +++     S SS + A    +  H+ PQ +     + G +G 
Sbjct: 837  AGKNWQPFSTMQHDKLSSVVEGTPSTSGSSKVAAKTAQDASHANPQASEREKVRVGSRGT 896

Query: 358  AERKPRR---KSVPKENARKSLSL--TSPA---------TIGFLSSA---KLPDSNNTP- 233
            +ERK RR   K+  K+ A+K ++   T+PA         +   LSSA   +L  SN    
Sbjct: 897  SERKTRRTGGKNTGKDAAKKGIAAKETTPARQSERSDRSSNASLSSAGIGQLIQSNEMQH 956

Query: 232  ----SIFQQPFTDSQQVQLRAQILVYGSVISGSPPEEPHMLAAFGQSDGGR-IWEGVWRA 68
                 +F QPFTD QQVQLRAQI VYG++I G+ P+E +M++AFG  DGGR IWE  WRA
Sbjct: 957  YGHIEVFHQPFTDLQQVQLRAQIFVYGALIQGTAPDEAYMISAFGGPDGGRSIWENAWRA 1016

Query: 67   YLERLHVQKAQAKS-NIPLQTR 5
             +ER+H QK+   S   PLQ+R
Sbjct: 1017 CIERVHGQKSHLVSPETPLQSR 1038


Top