BLASTX nr result

ID: Chrysanthemum22_contig00003378 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00003378
         (2473 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVI08289.1| Zinc finger, C2H2, partial [Cynara cardunculus va...   333   e-100
ref|XP_023739610.1| protein IWS1 homolog [Lactuca sativa] >gi|13...   218   7e-59
gb|PIN17942.1| hypothetical protein CDL12_09382 [Handroanthus im...   164   4e-38
gb|PIN19072.1| hypothetical protein CDL12_08241 [Handroanthus im...   160   9e-37
gb|EYU38966.1| hypothetical protein MIMGU_mgv1a004239mg [Erythra...   156   1e-36
ref|XP_012835550.1| PREDICTED: centromere protein F isoform X2 [...   156   2e-36
ref|XP_012835549.1| PREDICTED: centromere protein F isoform X1 [...   156   3e-36
emb|CDP15126.1| unnamed protein product [Coffea canephora]            159   3e-36
ref|XP_011084140.1| uncharacterized protein LOC105166467 isoform...   159   4e-36
ref|XP_011084139.1| uncharacterized protein LOC105166467 isoform...   159   4e-36
ref|XP_017236352.1| PREDICTED: uncharacterized protein LOC108209...   157   1e-35
ref|XP_017236351.1| PREDICTED: uncharacterized protein LOC108209...   155   4e-35
ref|XP_006358822.2| PREDICTED: uncharacterized protein LOC102585...   151   9e-34
ref|XP_021972241.1| SNF2 domain-containing protein CLASSY 3-like...   150   1e-33
ref|XP_022026623.1| uncharacterized protein LOC110927273 isoform...   149   1e-33
gb|PHU21313.1| hypothetical protein BC332_06420 [Capsicum chinense]   150   1e-33
gb|EEF41771.1| hypothetical protein RCOM_1554430, partial [Ricin...   135   3e-33
ref|XP_022026624.1| uncharacterized protein LOC110927273 isoform...   147   9e-33
ref|XP_022026622.1| uncharacterized protein LOC110927273 isoform...   147   1e-32
ref|XP_021972240.1| SNF2 domain-containing protein CLASSY 3-like...   146   2e-32

>gb|KVI08289.1| Zinc finger, C2H2, partial [Cynara cardunculus var. scolymus]
          Length = 631

 Score =  333 bits (854), Expect = e-100
 Identities = 199/393 (50%), Positives = 249/393 (63%), Gaps = 25/393 (6%)
 Frame = -2

Query: 2286 DIQDQDQTKHTTSSGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSDPL 2107
            D+  QD +K T   GH  HICHRCGWPFPNSHPS KHRRAHKRICGTIEGYTKLI S+ +
Sbjct: 41   DMNTQDHSK-TGDEGHGPHICHRCGWPFPNSHPSPKHRRAHKRICGTIEGYTKLIESEAI 99

Query: 2106 SDDD-------EEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGI 1948
            SDD+       +++ TPSP IEK I+KESG+ A G       +E+D FSDA+TEF D+GI
Sbjct: 100  SDDEHHSDEDKDKDKTPSPKIEKGIIKESGNSAGG-------TEDDSFSDAMTEFSDSGI 152

Query: 1947 SPVSEERLDDTKLLDSLTEARNVDHNLEVSGTPDNHLKE-DKPAENDINALPHLADSKIM 1771
            SPV+      TKLLDS  E    D NLEV  TPD H K+ D+  ENDIN + +  D++I 
Sbjct: 153  SPVA------TKLLDSPIEVSKADDNLEVFKTPDTHTKDADETEENDINTVCNSVDTQIK 206

Query: 1770 LPESDTN-SVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVI---------------D 1639
            L +SD   S PL+EDR+GS KDK +D+ E  NTV+ + DSA  +               D
Sbjct: 207  LLDSDMKPSDPLIEDRDGSCKDKLMDLVEISNTVEMKPDSANAMDVSSKNVVIFEVSDKD 266

Query: 1638 QGEEAVYVLSVPHDIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTFE 1459
            Q E  VYVLSVP DIPLV+++E L++DFKDH++I+S+ P+ LD             DTFE
Sbjct: 267  QEEPVVYVLSVPSDIPLVDHSETLIDDFKDHETIYSNVPMVLD------------HDTFE 314

Query: 1458 VKTTKEIIQESQTSENGESSI-ESAVSEHTNTGFSASLPAVVIDEAKVPETRPLETSEID 1282
            VKT +  IQESQ SE GE SI ES VSEH NTG S  +P  V +E K  ET+PLE S+  
Sbjct: 315  VKTEEHKIQESQASETGEFSIDESIVSEHPNTG-SILVPTDVTEEVKESETKPLEMSKTV 373

Query: 1281 TESKDSLSEAEVSLDKHSVATLSETKENFEHEH 1183
             E+K SL E   +  K     +++ K  F+HEH
Sbjct: 374  PEAKVSLGERTETASKE----INQEKLEFDHEH 402


>ref|XP_023739610.1| protein IWS1 homolog [Lactuca sativa]
 gb|PLY96876.1| hypothetical protein LSAT_2X47881 [Lactuca sativa]
          Length = 490

 Score =  218 bits (556), Expect = 7e-59
 Identities = 159/377 (42%), Positives = 205/377 (54%), Gaps = 16/377 (4%)
 Frame = -2

Query: 2274 QDQTKHTTSSGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSDPLSDD- 2098
            QDQ+K TTSSGH   ICHRCGWPFP SHPSSKHRRAHKRICGTIEGY KLI S+ +SDD 
Sbjct: 4    QDQSKTTTSSGH---ICHRCGWPFPKSHPSSKHRRAHKRICGTIEGYPKLIDSEAVSDDE 60

Query: 2097 ---DEE--ENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGIS-PVS 1936
               DEE  E TPSP IEK+I+           ERS++SE+DLFSDAVTEF D G S P +
Sbjct: 61   HYSDEEDIEITPSPKIEKKIID----------ERSSKSEDDLFSDAVTEFSDGGFSTPGA 110

Query: 1935 EER-LDDTKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDINALPHLADSKIMLPES 1759
            EER  D TKLL S  E    D NLEV  TPD    EDK                   P S
Sbjct: 111  EERFFDTTKLLFSPVELIKGDDNLEVFQTPDT---EDKNG-----------------PHS 150

Query: 1758 DTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVPHDIPLVEN 1579
              + + LV+D   S  D A     P  + + ++   K  ++ EEA Y+LSVP DIP+V+ 
Sbjct: 151  HKDKIKLVDDVAPSKSDVA-----PSKSDKDQEKKEK--EEEEEAEYILSVPSDIPIVDQ 203

Query: 1578 AEPLLNDFKDH-KSIHSSAPLCLDLDGDKDVERRSMVDTFEVKTTKEIIQESQTSENGES 1402
            AE LL DFK+H K+IHS+                           ++ I+ESQT +NGES
Sbjct: 204  AETLLQDFKNHDKTIHSN--------------------------VEDKIEESQTYKNGES 237

Query: 1401 SIESAVSEHTNTGFSASLPAVVIDEAKVPETRPLETSEIDT-----ESKDSLSEAEVSLD 1237
            ++ES   +   +  +      ++ E++       E   +++     +SKDS S +  SL+
Sbjct: 238  TMESEKVDDGVSVLNEESNHEIVKESESESVIECEEKIVESVIEGVDSKDSGSSSRNSLE 297

Query: 1236 KH--SVATLSETKENFE 1192
             +  SV+ LS    + E
Sbjct: 298  ANWGSVSVLSTASYDVE 314


>gb|PIN17942.1| hypothetical protein CDL12_09382 [Handroanthus impetiginosus]
          Length = 944

 Score =  164 bits (416), Expect = 4e-38
 Identities = 132/376 (35%), Positives = 191/376 (50%), Gaps = 26/376 (6%)
 Frame = -2

Query: 2289 MDIQDQDQTKHTTS-SGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSD 2113
            MD QD   T  +    GH  H+CHRCGWPFPN HPS+KHRRAHK++CGTIEGY K+IHS+
Sbjct: 1    MDGQDHKMTAPSAGHEGHGTHLCHRCGWPFPNPHPSAKHRRAHKKVCGTIEGY-KIIHSE 59

Query: 2112 ------PLSDD----DEEENTPSPNIEKRIVK-ESGSGAVGILERSNRSEEDLFSDAVTE 1966
                   +SDD    DE E+TPSP + K+  +  S SG VG   +SN+SE+D+FSDAVTE
Sbjct: 60   EHDQHLAVSDDEHGSDENEHTPSPMVVKKNAEFASASGGVG--GKSNKSEDDMFSDAVTE 117

Query: 1965 FPDAGISPVSEERLDDTKLLDSLTEARNVD------HNLEVSGTPDNHLKEDKPAENDIN 1804
            F D+GISP  EE+ +  + +D   E ++V+      ++L+V  + D   K D P  +D  
Sbjct: 118  FSDSGISPHLEEQFESMRGVDKSVEEKSVESDLYANNSLKVDESADRTEKFDVPTRSDET 177

Query: 1803 ALPHLA-----DSKIMLPESDTNSVPL-VEDRNGSHKDKAIDIGEPL--NTVQTEQDSAK 1648
            + P         S   +P +DT++  + VE  N   +D   ++   L  N      D  K
Sbjct: 178  SNPGAPVNAKDQSGSAIPTTDTSAEAVSVELVNRLQQDCKSEMPRDLLENNGNECGDGNK 237

Query: 1647 VIDQGEEAVYVLSVPHDIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVD 1468
               QGEE   + S+  D    + ++P L   ++      S  +  DL    +  + S   
Sbjct: 238  ---QGEED-KLASLTLDSEEGKISDPALTATEELCDKSVSELVEHDLPRQNETPQNSDAS 293

Query: 1467 TFEVKTTKEIIQESQTSENGESSIESAVSEHTNTGFSASLPAVVIDEAKVPETRPLETSE 1288
              EVK+    +Q S ++  G S I  A   H NT  S  +  V      +P  +P   +E
Sbjct: 294  A-EVKSVAHSVQISSSTGTG-SEIPLAEETHENTDASLGVKEVCYSTEDMPSVKPSHAAE 351

Query: 1287 IDTESKDSLSEAEVSL 1240
            +   S   L + EV+L
Sbjct: 352  L---SNTVLVDKEVNL 364


>gb|PIN19072.1| hypothetical protein CDL12_08241 [Handroanthus impetiginosus]
          Length = 941

 Score =  160 bits (405), Expect = 9e-37
 Identities = 126/374 (33%), Positives = 187/374 (50%), Gaps = 24/374 (6%)
 Frame = -2

Query: 2289 MDIQDQDQTKHTTS-SGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSD 2113
            MD QD   T  +    GH  H+CHRCGWPFPN HPS+KHRRAHK++CGTIEGY K+IHS+
Sbjct: 1    MDGQDHKMTAPSAGHEGHGTHLCHRCGWPFPNPHPSAKHRRAHKKVCGTIEGY-KIIHSE 59

Query: 2112 ------PLSDD----DEEENTPSPNIEKRIVK-ESGSGAVGILERSNRSEEDLFSDAVTE 1966
                   +SDD    DE E+TPSP + K+  +  S SG VG   +SN+SE+D+FSDAVTE
Sbjct: 60   EHDQHLAVSDDEHGSDENEHTPSPMVVKKNAEFASASGGVG--GKSNKSEDDMFSDAVTE 117

Query: 1965 FPDAGISPVSEERLDDTKLLDSLTEARNVD------HNLEVSGTPDNHLKEDKPAENDIN 1804
            F D+GISP  EE+    + +D   E ++V+       +L+V    D   K D P  +D  
Sbjct: 118  FSDSGISPHLEEQFKSVRGVDKSVEEKSVESDLYPNDSLKVDENADRTEKFDDPTRSDET 177

Query: 1803 ALPHLA-----DSKIMLPESDTNSVPL-VEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVI 1642
            + P         S   +P +DT++  + VE  N   +D   ++  P + ++   +     
Sbjct: 178  SNPGAPVNAKDQSGSAIPITDTSAEAVSVELVNRLQQDCKSEM--PRDLLENNGNECGDG 235

Query: 1641 DQGEEAVYVLSVPHDIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTF 1462
            D+  E   + S+  D    + ++P L   ++      S  +  DL    +  + S     
Sbjct: 236  DKQGEEDKLASLTLDSEEGKISDPALTATEELCDKSVSELVEHDLPCQNETPQNSDASA- 294

Query: 1461 EVKTTKEIIQESQTSENGESSIESAVSEHTNTGFSASLPAVVIDEAKVPETRPLETSEID 1282
            EVK+    +Q S ++  G S I  A   H N   S  +  V      +P  +P   +E+ 
Sbjct: 295  EVKSVAHSVQISSSTGTG-SEIPLAEETHENADASLGVKEVCDSTEDMPSVKPSHAAEL- 352

Query: 1281 TESKDSLSEAEVSL 1240
              S   L + EV+L
Sbjct: 353  --SNTVLVDKEVNL 364


>gb|EYU38966.1| hypothetical protein MIMGU_mgv1a004239mg [Erythranthe guttata]
          Length = 538

 Score =  156 bits (394), Expect = 1e-36
 Identities = 119/316 (37%), Positives = 160/316 (50%), Gaps = 24/316 (7%)
 Frame = -2

Query: 2253 TSSGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSD-------PLSDD- 2098
            TS+GHEVHIC RC WPFPN HPS+KHRRAHKR+CGT+EGY KLIHS+        +SDD 
Sbjct: 11   TSTGHEVHICSRCKWPFPNPHPSAKHRRAHKRVCGTVEGY-KLIHSEEEHDRHLSISDDE 69

Query: 2097 ---DEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGISPVSEER 1927
               D E +TPSPN+ K+  ++  SG  G   +SNRSE+D+FSDAVTEF D+GISP   ER
Sbjct: 70   HASDSENHTPSPNLVKKKAEDFASGE-GAGAKSNRSEDDVFSDAVTEFSDSGISPSLVER 128

Query: 1926 L--------DD----TKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDINALPHLAD 1783
            L        DD    T     +TE  N    +       N +K D   E  + +   L +
Sbjct: 129  LVMEENPLEDDDPIKTAEKPDITEQVNDPTRIVEMNLQPNIIKSDVSREIAVES-QSLNE 187

Query: 1782 SKIMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVP 1603
            S I   E    S+ L      S K + + + EP  TVQ E   A V  +         +P
Sbjct: 188  SDIQREEDKLASITL-----DSEKGEVVSVSEPAFTVQHESLHASVTAK--------RIP 234

Query: 1602 HDIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDK-DVERRSMVDTFEVKTTKEIIQES 1426
             +  + ENA   + +              + +D DK  +E +  VD  E K    I++  
Sbjct: 235  TE-TVCENAPVEVKE--------------VSVDEDKTTIEFKKDVDHVEEKPNSVILE-- 277

Query: 1425 QTSENGESSIESAVSE 1378
              +ENGE+   + VS+
Sbjct: 278  TVNENGEAGGSAVVSD 293


>ref|XP_012835550.1| PREDICTED: centromere protein F isoform X2 [Erythranthe guttata]
          Length = 584

 Score =  156 bits (394), Expect = 2e-36
 Identities = 119/316 (37%), Positives = 160/316 (50%), Gaps = 24/316 (7%)
 Frame = -2

Query: 2253 TSSGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSD-------PLSDD- 2098
            TS+GHEVHIC RC WPFPN HPS+KHRRAHKR+CGT+EGY KLIHS+        +SDD 
Sbjct: 11   TSTGHEVHICSRCKWPFPNPHPSAKHRRAHKRVCGTVEGY-KLIHSEEEHDRHLSISDDE 69

Query: 2097 ---DEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGISPVSEER 1927
               D E +TPSPN+ K+  ++  SG  G   +SNRSE+D+FSDAVTEF D+GISP   ER
Sbjct: 70   HASDSENHTPSPNLVKKKAEDFASGE-GAGAKSNRSEDDVFSDAVTEFSDSGISPSLVER 128

Query: 1926 L--------DD----TKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDINALPHLAD 1783
            L        DD    T     +TE  N    +       N +K D   E  + +   L +
Sbjct: 129  LVMEENPLEDDDPIKTAEKPDITEQVNDPTRIVEMNLQPNIIKSDVSREIAVES-QSLNE 187

Query: 1782 SKIMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVP 1603
            S I   E    S+ L      S K + + + EP  TVQ E   A V  +         +P
Sbjct: 188  SDIQREEDKLASITL-----DSEKGEVVSVSEPAFTVQHESLHASVTAK--------RIP 234

Query: 1602 HDIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDK-DVERRSMVDTFEVKTTKEIIQES 1426
             +  + ENA   + +              + +D DK  +E +  VD  E K    I++  
Sbjct: 235  TE-TVCENAPVEVKE--------------VSVDEDKTTIEFKKDVDHVEEKPNSVILE-- 277

Query: 1425 QTSENGESSIESAVSE 1378
              +ENGE+   + VS+
Sbjct: 278  TVNENGEAGGSAVVSD 293


>ref|XP_012835549.1| PREDICTED: centromere protein F isoform X1 [Erythranthe guttata]
          Length = 592

 Score =  156 bits (394), Expect = 3e-36
 Identities = 119/316 (37%), Positives = 160/316 (50%), Gaps = 24/316 (7%)
 Frame = -2

Query: 2253 TSSGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSD-------PLSDD- 2098
            TS+GHEVHIC RC WPFPN HPS+KHRRAHKR+CGT+EGY KLIHS+        +SDD 
Sbjct: 11   TSTGHEVHICSRCKWPFPNPHPSAKHRRAHKRVCGTVEGY-KLIHSEEEHDRHLSISDDE 69

Query: 2097 ---DEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGISPVSEER 1927
               D E +TPSPN+ K+  ++  SG  G   +SNRSE+D+FSDAVTEF D+GISP   ER
Sbjct: 70   HASDSENHTPSPNLVKKKAEDFASGE-GAGAKSNRSEDDVFSDAVTEFSDSGISPSLVER 128

Query: 1926 L--------DD----TKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDINALPHLAD 1783
            L        DD    T     +TE  N    +       N +K D   E  + +   L +
Sbjct: 129  LVMEENPLEDDDPIKTAEKPDITEQVNDPTRIVEMNLQPNIIKSDVSREIAVES-QSLNE 187

Query: 1782 SKIMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVP 1603
            S I   E    S+ L      S K + + + EP  TVQ E   A V  +         +P
Sbjct: 188  SDIQREEDKLASITL-----DSEKGEVVSVSEPAFTVQHESLHASVTAK--------RIP 234

Query: 1602 HDIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDK-DVERRSMVDTFEVKTTKEIIQES 1426
             +  + ENA   + +              + +D DK  +E +  VD  E K    I++  
Sbjct: 235  TE-TVCENAPVEVKE--------------VSVDEDKTTIEFKKDVDHVEEKPNSVILE-- 277

Query: 1425 QTSENGESSIESAVSE 1378
              +ENGE+   + VS+
Sbjct: 278  TVNENGEAGGSAVVSD 293


>emb|CDP15126.1| unnamed protein product [Coffea canephora]
          Length = 1107

 Score =  159 bits (402), Expect = 3e-36
 Identities = 130/393 (33%), Positives = 193/393 (49%), Gaps = 30/393 (7%)
 Frame = -2

Query: 2289 MDIQDQDQTKHTTSS--GHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHS 2116
            MD+QD  +T  +     GH VH+CH+CGWPFPN HPS+KHRRAHKR+CG +EGY KL+ S
Sbjct: 1    MDVQDHKKTPSSAGGHEGHGVHVCHKCGWPFPNPHPSAKHRRAHKRVCGKVEGY-KLVDS 59

Query: 2115 --DPLSDDDEEEN-----TPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPD 1957
              D +SDDD   +     TPSP +EK  VKE GSGA GI  +S++SE+D+FSDAVTEF D
Sbjct: 60   ETDHISDDDHLSDDDIVKTPSPKMEKGSVKEVGSGA-GIGLKSSKSEDDVFSDAVTEFSD 118

Query: 1956 AGISPVSEERLDDTKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDINALPHLADSK 1777
            +GISP  EERL+  + +D+   A  V H L      D+   ED  A++    L  L   +
Sbjct: 119  SGISPSIEERLESVREVDNTVGAELV-HELN-----DSQKSEDCRADDTTKQLDDLTTGR 172

Query: 1776 IMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVPHD 1597
                  + ++  +VE            I E  NT     + A+ +  G E    L +   
Sbjct: 173  ------EISNAEVVES----------VINEAENTKPASDNRAEEVSFGVEQTDGLQINSS 216

Query: 1596 IPLVEN-AEPLLNDFKD--HKSIHSSAPLCLDLDGDKDVERRSMVDTFEVKTTKEIIQES 1426
              + E  +E L+ + +    K I SS         + +++ +  V+  E  T   ++   
Sbjct: 217  PNVFETISEDLVANAESGKQKEIGSS-------KSETNIQVKESVNEVESSTESVVLLSK 269

Query: 1425 QTSENGESSIESAVSEHTNTGFSASLPAVVID---EAKVPETRPLE-------------- 1297
               E   S  +S V+E    G S  L    ++   + KV +T  +E              
Sbjct: 270  SPDEASLSKSKSDVAE----GSSGCLVVETMEHEADRKVSDTMTMEPKLHEASGSISHAA 325

Query: 1296 -TSEIDTESKDSLSEAEVSLDKHSVATLSETKE 1201
               EI  + K+  +++E  +   SV+T++E  E
Sbjct: 326  AVKEIVEQEKEPSNKSEARMT--SVSTINEIIE 356


>ref|XP_011084140.1| uncharacterized protein LOC105166467 isoform X2 [Sesamum indicum]
          Length = 1081

 Score =  159 bits (401), Expect = 4e-36
 Identities = 118/369 (31%), Positives = 181/369 (49%), Gaps = 10/369 (2%)
 Frame = -2

Query: 2289 MDIQDQDQTKHTTSSGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSD- 2113
            MD QD   T  T   GH VH+CHRCGWPFPN+HPS+KHRRAHKR+CGTIEGY K+IHS+ 
Sbjct: 1    MDSQDHKMTA-TGHEGHGVHLCHRCGWPFPNAHPSAKHRRAHKRVCGTIEGY-KIIHSEE 58

Query: 2112 -----PLSDD----DEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFP 1960
                  +SDD    D++E+TP P + K+  +E  S + G  E+SNRSE+D+FSDAVTEF 
Sbjct: 59   HDDHLAVSDDEHASDDDEHTPVPQLVKKNSEEFRSSS-GAGEKSNRSEDDVFSDAVTEFS 117

Query: 1959 DAGISPVSEERLDDTKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDINALPHLADS 1780
            D+GISP  EER +  + LD   E ++V+ +L        +  E    +  ++    L D 
Sbjct: 118  DSGISPRLEERFESVRGLDKRMEQKSVEGDL--------YRTESLKVDETVDKTEQLED- 168

Query: 1779 KIMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVPH 1600
                 E     V  + +   ++         P+     E  S ++I+  +  +     P 
Sbjct: 169  PTRCEEMSNRVVASIANNQSANV-------LPVTDSSAEAVSVELINGLQPDLIKSETPT 221

Query: 1599 DIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTFEVKTTKEIIQESQT 1420
            D   V N     N++ D   +   +    D+ G++D      +D+           E + 
Sbjct: 222  D---VNNT----NEYGDGGILKGQSGHNADIQGEEDNLASVTLDS-----------EGKI 263

Query: 1419 SENGESSIESAVSEHTNTGFSASLPAVVIDEAKVPETRPLETSEIDTESKDSLSEAEVSL 1240
            S  G  ++E+  + H        L + V+ E   P++  L+  +   ES+D    AE S 
Sbjct: 264  SGPGIKAVETKEASHD------KLVSGVVLEYLPPKSETLQNLDAPAESRDVADSAENSC 317

Query: 1239 DKHSVATLS 1213
              ++V  ++
Sbjct: 318  SANTVGEIA 326


>ref|XP_011084139.1| uncharacterized protein LOC105166467 isoform X1 [Sesamum indicum]
          Length = 1092

 Score =  159 bits (401), Expect = 4e-36
 Identities = 118/369 (31%), Positives = 181/369 (49%), Gaps = 10/369 (2%)
 Frame = -2

Query: 2289 MDIQDQDQTKHTTSSGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSD- 2113
            MD QD   T  T   GH VH+CHRCGWPFPN+HPS+KHRRAHKR+CGTIEGY K+IHS+ 
Sbjct: 1    MDSQDHKMTA-TGHEGHGVHLCHRCGWPFPNAHPSAKHRRAHKRVCGTIEGY-KIIHSEE 58

Query: 2112 -----PLSDD----DEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFP 1960
                  +SDD    D++E+TP P + K+  +E  S + G  E+SNRSE+D+FSDAVTEF 
Sbjct: 59   HDDHLAVSDDEHASDDDEHTPVPQLVKKNSEEFRSSS-GAGEKSNRSEDDVFSDAVTEFS 117

Query: 1959 DAGISPVSEERLDDTKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDINALPHLADS 1780
            D+GISP  EER +  + LD   E ++V+ +L        +  E    +  ++    L D 
Sbjct: 118  DSGISPRLEERFESVRGLDKRMEQKSVEGDL--------YRTESLKVDETVDKTEQLED- 168

Query: 1779 KIMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVPH 1600
                 E     V  + +   ++         P+     E  S ++I+  +  +     P 
Sbjct: 169  PTRCEEMSNRVVASIANNQSANV-------LPVTDSSAEAVSVELINGLQPDLIKSETPT 221

Query: 1599 DIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTFEVKTTKEIIQESQT 1420
            D   V N     N++ D   +   +    D+ G++D      +D+           E + 
Sbjct: 222  D---VNNT----NEYGDGGILKGQSGHNADIQGEEDNLASVTLDS-----------EGKI 263

Query: 1419 SENGESSIESAVSEHTNTGFSASLPAVVIDEAKVPETRPLETSEIDTESKDSLSEAEVSL 1240
            S  G  ++E+  + H        L + V+ E   P++  L+  +   ES+D    AE S 
Sbjct: 264  SGPGIKAVETKEASHD------KLVSGVVLEYLPPKSETLQNLDAPAESRDVADSAENSC 317

Query: 1239 DKHSVATLS 1213
              ++V  ++
Sbjct: 318  SANTVGEIA 326


>ref|XP_017236352.1| PREDICTED: uncharacterized protein LOC108209769 isoform X2 [Daucus
            carota subsp. sativus]
          Length = 968

 Score =  157 bits (396), Expect = 1e-35
 Identities = 111/355 (31%), Positives = 178/355 (50%), Gaps = 9/355 (2%)
 Frame = -2

Query: 2235 VHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSDPLSDDDEEENTPSPNIEKR 2056
            VH+CH+CGWPFP  HPSSKHRRAHKRICGTI+GY       P+SDD +   TPSP  +KR
Sbjct: 15   VHVCHKCGWPFPKLHPSSKHRRAHKRICGTIQGY-------PVSDDAQICETPSPEADKR 67

Query: 2055 IVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGISPVSEER-LDDTKLLDSLTEARNV 1879
             +        G+ ERS RSE+++F+DAV EF D+G+SP SEER L+D K L+       V
Sbjct: 68   SI-------AGVGERSYRSEDEVFADAVAEFSDSGMSPASEERQLEDVKELEKNVSMTVV 120

Query: 1878 DHNLEVSGTPDNHLKEDKPAENDINALPHLADSKIMLPESDTNSVPLVEDRNGSHKDKAI 1699
            D ++  +GT    LK D   ++ +  +       I        S P V+++    +D   
Sbjct: 121  DDDVFSTGT----LKLDDVCDS-VKPMSSQPAHNIAPSTGLAESSPSVQNKGYLTQDTID 175

Query: 1698 DIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVPHDIPLVENAEPLLNDFKDHKSIHSSAPL 1519
                 L+ +       K      EA +V +V  D+P+VE+A+ +L D ++ K + S  PL
Sbjct: 176  GSSSRLSEIHGMNGEEK------EATHVQAVFSDLPIVEDADIMLKDVENQKLLKSEIPL 229

Query: 1518 CL---DLDGDKDVERRSMVDTFEVKTTKEIIQ-ESQTSENGESSIESAVSEHTNTGFSAS 1351
             L    +D     + ++M ++  ++  + + + ESQ++E+  S +     +  +      
Sbjct: 230  VLGSVTVDRTLTKDNKNMPESQSIEPGRYLTELESQSTEHELSHLSGRAKQEASAVTVLI 289

Query: 1350 LPAVVIDEAKVPETRPLETSEIDTESKDSLSEAEVSLD----KHSVATLSETKEN 1198
               V  DE        +E    + E ++++    V+ D     H+   L + K++
Sbjct: 290  GEVVTQDEKSGTHCDSVEVCNSNREPEENMHVLSVASDLPIVNHADLMLQDFKDH 344


>ref|XP_017236351.1| PREDICTED: uncharacterized protein LOC108209769 isoform X1 [Daucus
            carota subsp. sativus]
          Length = 996

 Score =  155 bits (392), Expect = 4e-35
 Identities = 113/377 (29%), Positives = 179/377 (47%), Gaps = 31/377 (8%)
 Frame = -2

Query: 2235 VHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSDPLSDDDEEENTPSPNIEKR 2056
            VH+CH+CGWPFP  HPSSKHRRAHKRICGTI+GY       P+SDD +   TPSP  +KR
Sbjct: 15   VHVCHKCGWPFPKLHPSSKHRRAHKRICGTIQGY-------PVSDDAQICETPSPEADKR 67

Query: 2055 IVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGISPVSEER-LDDTKLLDSLTEARNV 1879
             +        G+ ERS RSE+++F+DAV EF D+G+SP SEER L+D K L+       V
Sbjct: 68   SI-------AGVGERSYRSEDEVFADAVAEFSDSGMSPASEERQLEDVKELEKNVSMTVV 120

Query: 1878 DHNLEVSGT----------------------PDNHLKEDKPAENDINALPHLADSKIMLP 1765
            D ++  +GT                      P   L E  P+  D   L +     +  P
Sbjct: 121  DDDVFSTGTLKLDDVCDSVKPMSSQPAHNIAPSTGLAESSPSGKDEAELSNNGLFLLSAP 180

Query: 1764 ESDTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVPHDIPLV 1585
              +T  V       G      ID      +     +   +  + +EA +V +V  D+P+V
Sbjct: 181  VENTEVVVDAVQNKGYLTQDTID-----GSSSRLSEIHGMNGEEKEATHVQAVFSDLPIV 235

Query: 1584 ENAEPLLNDFKDHKSIHSSAPLCL---DLDGDKDVERRSMVDTFEVKTTKEIIQ-ESQTS 1417
            E+A+ +L D ++ K + S  PL L    +D     + ++M ++  ++  + + + ESQ++
Sbjct: 236  EDADIMLKDVENQKLLKSEIPLVLGSVTVDRTLTKDNKNMPESQSIEPGRYLTELESQST 295

Query: 1416 ENGESSIESAVSEHTNTGFSASLPAVVIDEAKVPETRPLETSEIDTESKDSLSEAEVSLD 1237
            E+  S +     +  +         V  DE        +E    + E ++++    V+ D
Sbjct: 296  EHELSHLSGRAKQEASAVTVLIGEVVTQDEKSGTHCDSVEVCNSNREPEENMHVLSVASD 355

Query: 1236 ----KHSVATLSETKEN 1198
                 H+   L + K++
Sbjct: 356  LPIVNHADLMLQDFKDH 372


>ref|XP_006358822.2| PREDICTED: uncharacterized protein LOC102585759 [Solanum tuberosum]
          Length = 1014

 Score =  151 bits (381), Expect = 9e-34
 Identities = 132/393 (33%), Positives = 196/393 (49%), Gaps = 25/393 (6%)
 Frame = -2

Query: 2289 MDIQDQDQTKHTTSSGHE---VHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIH 2119
            M ++ QD  K TT SGHE    H+CH+C WPFPN HPS++HRRAHK++CG IEGY KL  
Sbjct: 1    MLMESQDH-KMTTPSGHENHGSHLCHKCSWPFPNPHPSARHRRAHKKVCGKIEGY-KLSE 58

Query: 2118 SD-------PLSDD----DEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAV 1972
            S+        +SDD    D ++ TPSP  +K  VK+  SG     ++S RSE++ FSDAV
Sbjct: 59   SEAGNSTHSAVSDDEHHSDGDQQTPSPIGKKTSVKDGSSG-----DKSYRSEDETFSDAV 113

Query: 1971 TEFPDAGISPVSEERLDDTKLLDSLTEARNVDHNLEVSGTPDNHLKED-----KPAENDI 1807
             EF D+GISP  EER +  K L+  T  + VD         D  LK D       + ND 
Sbjct: 114  MEFSDSGISPGMEERPEGVKSLN--TNVKKVD---------DELLKADAIGGISVSVNDK 162

Query: 1806 NALPHLADSKIMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEE 1627
            +    + D +   PES TN  P+ +   GS  D+++D+       Q +  + K    G+ 
Sbjct: 163  HLTAEVNDPE--SPESATNQ-PVADKSLGSKLDRSVDL-------QVDASAVKSEISGDA 212

Query: 1626 AVYVLSVPHDIPL--VENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTFEVK 1453
            ++  ++ P  I    ++ +    ND K  + I+++  L   ++    V +  + DT E K
Sbjct: 213  SLQEMNAPESIEAKQMQMSSDQPNDLKAIEDINANEGLADAVEASVQVSQSVVSDTDE-K 271

Query: 1452 TTKEIIQESQTSENGESSIESAVSEHTNTGFSASLPAVVIDEAKVPETRPLETSEID--- 1282
            T  E   + Q +E   S +ES + E  +                VP    L+ SE +   
Sbjct: 272  TCYE--SKPQEAEGKFSVVESKLLEAEDQA-----------TENVPNKAELQHSERENPD 318

Query: 1281 -TESKDSLSEAEVSLDKHSVATLSETKENFEHE 1186
             TE K +LSEAEV     S+  ++  KE+ +H+
Sbjct: 319  STELKFALSEAEVK----SLDGVNVDKEHEQHD 347


>ref|XP_021972241.1| SNF2 domain-containing protein CLASSY 3-like isoform X2 [Helianthus
            annuus]
          Length = 745

 Score =  150 bits (378), Expect = 1e-33
 Identities = 127/406 (31%), Positives = 194/406 (47%), Gaps = 42/406 (10%)
 Frame = -2

Query: 2274 QDQTKHTTSSGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSDPLSDDD 2095
            +DQ  HT  SG  VH+CHRCGWPFPN HPS+KHRRAHK+ICGTI+GYT LI S+  SDDD
Sbjct: 4    KDQITHTPPSG--VHLCHRCGWPFPNPHPSAKHRRAHKKICGTIDGYTSLIGSEVGSDDD 61

Query: 2094 EEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGISPVSEERLDDT 1915
            +E+ TPSP I +++         G+    +RSE+++FSDAVTEFPD+G    S  +  D 
Sbjct: 62   KEK-TPSPKIVEKV--------GGMRVSFSRSEDEVFSDAVTEFPDSG----SASKTLDR 108

Query: 1914 KLLDSLTEARN-----VDHNLEVSGTPDNHLKEDKPAE----NDINALPHLADSKIMLPE 1762
             L  S  +A N     VD ++EVS   + H+++ +  E     D +   H      ++ +
Sbjct: 109  DLFFSFKDAENDEISKVDASVEVSEKINTHVEKSEETEIATVGDSSESNHELTDTGLVKD 168

Query: 1761 SDTNSVPLVEDRNGSHKDKAIDI----GEPLNTVQTEQDSAKVIDQGE------EAVYVL 1612
                  P     + S + K+ID+     E     QT + S K     E      E   ++
Sbjct: 169  GVMGHAPEPGHISHSQETKSIDVVEAEEEKGQESQTSESSTKEAIDSELTKTPPEVSAIV 228

Query: 1611 SVPHDIPLVE------NAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTFEVKT 1450
               HD    E      +   ++ND+K  +  +      L+ +   +V + S  D  E + 
Sbjct: 229  GGYHDNVKQEIKSDYVHVSEVVNDYKTVEETNIE-KFGLEYEVSSEVVKESETDRLE-EV 286

Query: 1449 TKEIIQESQTSENGE--------SSIE--SAVSEHTNTGFSASLPAVVIDE--AKVPETR 1306
             +EI+ +   S++ +        +S++  S + E  N+G        V DE  A + E +
Sbjct: 287  AEEIMDKKSESDHKQDPEVAKEPASVQNVSVLIEKENSGAPNVQIQEVFDEPVAVITEKK 346

Query: 1305 PLETSEIDTESKDSLSEA-----EVSLDKHSVATLSETKENFEHEH 1183
             LE  +++  S   + E       V  +K  +A     KE  EH H
Sbjct: 347  DLEAHKLEKSSIQQIQEVVREPDTVLTEKEDLAGPHLEKELKEHTH 392


>ref|XP_022026623.1| uncharacterized protein LOC110927273 isoform X2 [Helianthus annuus]
          Length = 750

 Score =  149 bits (377), Expect = 1e-33
 Identities = 123/391 (31%), Positives = 187/391 (47%), Gaps = 22/391 (5%)
 Frame = -2

Query: 2289 MDIQDQDQTKHTTSSGHE----VHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLI 2122
            MD+QD   T HTT SGH     VH+CHRCGWPFPN HPS+KHRRAHK+ICGTIEGYT +I
Sbjct: 1    MDVQDH--TTHTTHSGHHEGHGVHLCHRCGWPFPNPHPSAKHRRAHKKICGTIEGYTNII 58

Query: 2121 HSDPLSD-----DDEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPD 1957
             S+ +SD     DD++E +PSP +E +          G L RS    ED F+DAVTEF D
Sbjct: 59   GSEVVSDDDHLSDDDKEKSPSPKVEIK----------GSLARS----EDEFTDAVTEFAD 104

Query: 1956 AGISPVSEERLDDTKLLDSLTEARN-----VDHNLEVSGTPDNHLKEDKPAENDINALPH 1792
             G    S ++  D  L  S  +A N     VD ++EVS   D+H+ E    + +I  +  
Sbjct: 105  TG----SAKKALDRDLFFSFKDAENDETSKVDTSIEVSEKTDSHI-EKTEEKTEIKTVVD 159

Query: 1791 LADSKIMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLN-TVQTEQDSAKVIDQGEEAVYV 1615
             ++S + L    T +   V+D             EP + ++  E  S  V++ GEE    
Sbjct: 160  SSESNVEL----TGTGEPVKDGVAGPA-----ASEPTHASLPQEAKSIDVMEAGEEKGQE 210

Query: 1614 LSVPHDIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTFEVKTTKEII 1435
                      + +E +  + KD +   ++  +   + G  D     +    E+K   E +
Sbjct: 211  ---------SQTSETITEEAKDSEITKTTPEVSAIVGGCDDNVSEEI--KHEIKPDCEHV 259

Query: 1434 QESQTSENGESSIESAVSEHTNTGFSASLPAVVIDE--AKVPETRPLETSEIDTESKDSL 1261
             E       + ++     +  + G    L   V++E  A + ET+ LE   ++  S + +
Sbjct: 260  SEVAKETEVDQTVSVPNEKVEDLGAPKELIQEVVNEPVAVLTETKDLEEPALEKSSCEQI 319

Query: 1260 SEA-----EVSLDKHSVATLSETKENFEHEH 1183
             EA      V ++K  +      K++ EH+H
Sbjct: 320  QEAVNEPVAVLIEKEDLGGPHLEKQSIEHKH 350


>gb|PHU21313.1| hypothetical protein BC332_06420 [Capsicum chinense]
          Length = 965

 Score =  150 bits (379), Expect = 1e-33
 Identities = 115/377 (30%), Positives = 188/377 (49%), Gaps = 11/377 (2%)
 Frame = -2

Query: 2283 IQDQDQTKHTTSSGHE---VHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSD 2113
            +++QD  K TT SGHE    H+CH+CGWPFPN HPS++HRRAHK++CG IEGY +     
Sbjct: 1    MENQDH-KVTTPSGHENHGPHLCHKCGWPFPNPHPSARHRRAHKKVCGKIEGYKRGESEA 59

Query: 2112 PLSDD----DEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGIS 1945
             +SDD    D ++ TPSPN +K  VKE G    GI ++S RSE++ FSDAVTEF D GI+
Sbjct: 60   AVSDDEHHSDGDQQTPSPNDKKTSVKEVG----GIGDKSYRSEDETFSDAVTEFSDCGIN 115

Query: 1944 PVSEERLDDTKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDINALPHLADSKIMLP 1765
             V EER +  K L+  T  + +D +++                   NA+  +++    L 
Sbjct: 116  LVMEERPEGVKSLN--TNVQKIDDDVKA------------------NAIGGISNDPESL- 154

Query: 1764 ESDTNSVPLVEDRNGSHKDKAIDIGEPLNTVQTEQDSAKVIDQGEEAVYVLSVPHDIPLV 1585
            ES TN  P+ +   G+  D+ +D+   L+   T+ +        ++     S+      +
Sbjct: 155  ESATNQ-PVADKSLGTKLDRPVDL--QLDASPTKSEIPGDASLQDDMSATASIEAKQVQM 211

Query: 1584 ENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTFEVKTTKEIIQESQTSENGE 1405
             + +P  NDFK  + I++   L   ++   +V +  + +T    +  E  +     E G+
Sbjct: 212  SSGQP--NDFKGIEDINAGGGLVDTVEASVEVSQSVVSETDGKTSNNESYESKPQEEEGK 269

Query: 1404 SSIESAVSEH---TNTGFSASLPAVVIDEAKVPETRPLETSEIDTESKDSLS-EAEVSLD 1237
            SS    +S      N  FS     +  +E +  ++ P E   +  E ++  S + +++  
Sbjct: 270  SSDPLILSSDLLGANRKFSVVESKLPEEEEQAIDSVPNEADLLHKERENPKSADMKLNFS 329

Query: 1236 KHSVATLSETKENFEHE 1186
            +  V +L     + EHE
Sbjct: 330  EAEVKSLDGVDVDKEHE 346


>gb|EEF41771.1| hypothetical protein RCOM_1554430, partial [Ricinus communis]
          Length = 160

 Score =  135 bits (341), Expect = 3e-33
 Identities = 73/142 (51%), Positives = 93/142 (65%), Gaps = 10/142 (7%)
 Frame = -2

Query: 2241 HEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSD-----PLSDD----DEE 2089
            H VH+CH+CGWPFPN HPS+KHRRAHK+ICGTIEGY KL+ S+      +S+D    DE+
Sbjct: 4    HGVHVCHKCGWPFPNPHPSAKHRRAHKKICGTIEGY-KLVQSEGSTHSTMSEDEHQSDED 62

Query: 2088 ENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGISPVSEERLDDTKL 1909
              TPSP I +R   E GSGA+G  +RS  SE+++F+DAV EFPD+G   V EE  +D K 
Sbjct: 63   HKTPSPQILERSSNEKGSGAIG--DRSGISEDEVFADAVAEFPDSGSRKVIEESPEDVKK 120

Query: 1908 LDS-LTEARNVDHNLEVSGTPD 1846
            L + L    N D    +S   D
Sbjct: 121  LATFLASVANNDTRTTLSYEDD 142


>ref|XP_022026624.1| uncharacterized protein LOC110927273 isoform X3 [Helianthus annuus]
          Length = 732

 Score =  147 bits (370), Expect = 9e-33
 Identities = 121/396 (30%), Positives = 186/396 (46%), Gaps = 27/396 (6%)
 Frame = -2

Query: 2289 MDIQDQDQTKHTTSSGHE----VHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLI 2122
            MD+QD   T HTT SGH     VH+CHRCGWPFPN HPS+KHRRAHK+ICGTIEGYT +I
Sbjct: 1    MDVQDH--TTHTTHSGHHEGHGVHLCHRCGWPFPNPHPSAKHRRAHKKICGTIEGYTNII 58

Query: 2121 HSDPLSD-----DDEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPD 1957
             S+ +SD     DD++E +PSP +E +          G L RS    ED F+DAVTEF D
Sbjct: 59   GSEVVSDDDHLSDDDKEKSPSPKVEIK----------GSLARS----EDEFTDAVTEFAD 104

Query: 1956 AGISPVSEER----------LDDTKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDI 1807
             G +  + +R           D    L +  E   VD ++EVS   D+H+ E    + +I
Sbjct: 105  TGSAKKALDRDLFFSFKDAENDGANELLNAVETSKVDTSIEVSEKTDSHI-EKTEEKTEI 163

Query: 1806 NALPHLADSKIMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLN-TVQTEQDSAKVIDQGE 1630
              +   ++S + L    T +   V+D             EP + ++  E  S  V++ GE
Sbjct: 164  KTVVDSSESNVEL----TGTGEPVKDGVAGPA-----ASEPTHASLPQEAKSIDVMEAGE 214

Query: 1629 EAVYVLSVPHDIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTFEVKT 1450
            E              + +E +  + KD +   ++  +   + G  D     +    E+K 
Sbjct: 215  EKGQE---------SQTSETITEEAKDSEITKTTPEVSAIVGGCDDNVSEEI--KHEIKP 263

Query: 1449 TKEIIQESQTSENGESSIESAVSEHTNTGFSASLPAVVIDE--AKVPETRPLETSEIDTE 1276
              E + E       + ++     +  + G    L   V++E  A + ET+ LE   ++  
Sbjct: 264  DCEHVSEVAKETEVDQTVSVPNEKVEDLGAPKELIQEVVNEPVAVLTETKDLEEPALEKS 323

Query: 1275 SKDSLSEA-----EVSLDKHSVATLSETKENFEHEH 1183
            S + + EA      V ++K  +      K++ EH+H
Sbjct: 324  SCEQIQEAVNEPVAVLIEKEDLGGPHLEKQSIEHKH 359


>ref|XP_022026622.1| uncharacterized protein LOC110927273 isoform X1 [Helianthus annuus]
 gb|OTG35596.1| putative zinc finger, C2H2 [Helianthus annuus]
          Length = 759

 Score =  147 bits (370), Expect = 1e-32
 Identities = 121/396 (30%), Positives = 186/396 (46%), Gaps = 27/396 (6%)
 Frame = -2

Query: 2289 MDIQDQDQTKHTTSSGHE----VHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLI 2122
            MD+QD   T HTT SGH     VH+CHRCGWPFPN HPS+KHRRAHK+ICGTIEGYT +I
Sbjct: 1    MDVQDH--TTHTTHSGHHEGHGVHLCHRCGWPFPNPHPSAKHRRAHKKICGTIEGYTNII 58

Query: 2121 HSDPLSD-----DDEEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPD 1957
             S+ +SD     DD++E +PSP +E +          G L RS    ED F+DAVTEF D
Sbjct: 59   GSEVVSDDDHLSDDDKEKSPSPKVEIK----------GSLARS----EDEFTDAVTEFAD 104

Query: 1956 AGISPVSEER----------LDDTKLLDSLTEARNVDHNLEVSGTPDNHLKEDKPAENDI 1807
             G +  + +R           D    L +  E   VD ++EVS   D+H+ E    + +I
Sbjct: 105  TGSAKKALDRDLFFSFKDAENDGANELLNAVETSKVDTSIEVSEKTDSHI-EKTEEKTEI 163

Query: 1806 NALPHLADSKIMLPESDTNSVPLVEDRNGSHKDKAIDIGEPLN-TVQTEQDSAKVIDQGE 1630
              +   ++S + L    T +   V+D             EP + ++  E  S  V++ GE
Sbjct: 164  KTVVDSSESNVEL----TGTGEPVKDGVAGPA-----ASEPTHASLPQEAKSIDVMEAGE 214

Query: 1629 EAVYVLSVPHDIPLVENAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERRSMVDTFEVKT 1450
            E              + +E +  + KD +   ++  +   + G  D     +    E+K 
Sbjct: 215  EKGQE---------SQTSETITEEAKDSEITKTTPEVSAIVGGCDDNVSEEI--KHEIKP 263

Query: 1449 TKEIIQESQTSENGESSIESAVSEHTNTGFSASLPAVVIDE--AKVPETRPLETSEIDTE 1276
              E + E       + ++     +  + G    L   V++E  A + ET+ LE   ++  
Sbjct: 264  DCEHVSEVAKETEVDQTVSVPNEKVEDLGAPKELIQEVVNEPVAVLTETKDLEEPALEKS 323

Query: 1275 SKDSLSEA-----EVSLDKHSVATLSETKENFEHEH 1183
            S + + EA      V ++K  +      K++ EH+H
Sbjct: 324  SCEQIQEAVNEPVAVLIEKEDLGGPHLEKQSIEHKH 359


>ref|XP_021972240.1| SNF2 domain-containing protein CLASSY 3-like isoform X1 [Helianthus
            annuus]
 gb|OTG19807.1| putative zinc finger, C2H2 [Helianthus annuus]
          Length = 755

 Score =  146 bits (368), Expect = 2e-32
 Identities = 127/416 (30%), Positives = 194/416 (46%), Gaps = 52/416 (12%)
 Frame = -2

Query: 2274 QDQTKHTTSSGHEVHICHRCGWPFPNSHPSSKHRRAHKRICGTIEGYTKLIHSDPLSDDD 2095
            +DQ  HT  SG  VH+CHRCGWPFPN HPS+KHRRAHK+ICGTI+GYT LI S+  SDDD
Sbjct: 4    KDQITHTPPSG--VHLCHRCGWPFPNPHPSAKHRRAHKKICGTIDGYTSLIGSEVGSDDD 61

Query: 2094 EEENTPSPNIEKRIVKESGSGAVGILERSNRSEEDLFSDAVTEFPDAGISPVSEERLDDT 1915
            +E+ TPSP I +++         G+    +RSE+++FSDAVTEFPD+G    S  +  D 
Sbjct: 62   KEK-TPSPKIVEKV--------GGMRVSFSRSEDEVFSDAVTEFPDSG----SASKTLDR 108

Query: 1914 KLLDSLTEARN---------------VDHNLEVSGTPDNHLKEDKPAE----NDINALPH 1792
             L  S  +A N               VD ++EVS   + H+++ +  E     D +   H
Sbjct: 109  DLFFSFKDAENDGANEFLNDPIEISKVDASVEVSEKINTHVEKSEETEIATVGDSSESNH 168

Query: 1791 LADSKIMLPESDTNSVPLVEDRNGSHKDKAIDI----GEPLNTVQTEQDSAKVIDQGE-- 1630
                  ++ +      P     + S + K+ID+     E     QT + S K     E  
Sbjct: 169  ELTDTGLVKDGVMGHAPEPGHISHSQETKSIDVVEAEEEKGQESQTSESSTKEAIDSELT 228

Query: 1629 ----EAVYVLSVPHDIPLVE------NAEPLLNDFKDHKSIHSSAPLCLDLDGDKDVERR 1480
                E   ++   HD    E      +   ++ND+K  +  +      L+ +   +V + 
Sbjct: 229  KTPPEVSAIVGGYHDNVKQEIKSDYVHVSEVVNDYKTVEETNIE-KFGLEYEVSSEVVKE 287

Query: 1479 SMVDTFEVKTTKEIIQESQTSENGE--------SSIE--SAVSEHTNTGFSASLPAVVID 1330
            S  D  E +  +EI+ +   S++ +        +S++  S + E  N+G        V D
Sbjct: 288  SETDRLE-EVAEEIMDKKSESDHKQDPEVAKEPASVQNVSVLIEKENSGAPNVQIQEVFD 346

Query: 1329 E--AKVPETRPLETSEIDTESKDSLSEA-----EVSLDKHSVATLSETKENFEHEH 1183
            E  A + E + LE  +++  S   + E       V  +K  +A     KE  EH H
Sbjct: 347  EPVAVITEKKDLEAHKLEKSSIQQIQEVVREPDTVLTEKEDLAGPHLEKELKEHTH 402