BLASTX nr result

ID: Chrysanthemum22_contig00005217 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00005217
         (1673 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022009214.1| uncharacterized protein LOC110908582 isoform...   452   e-150
ref|XP_022009215.1| uncharacterized protein LOC110908582 isoform...   443   e-146
gb|KVI08298.1| Zein-binding domain-containing protein [Cynara ca...   411   e-133
gb|PLY64780.1| hypothetical protein LSAT_2X44920 [Lactuca sativa]     363   e-116
ref|XP_023745705.1| myosin-binding protein 2-like isoform X2 [La...   363   e-116
ref|XP_023745704.1| myosin-binding protein 2-like isoform X1 [La...   363   e-116
ref|XP_022012351.1| myosin-binding protein 3-like [Helianthus an...   335   e-105
ref|XP_011091435.1| uncharacterized protein LOC105171881 isoform...   235   7e-65
ref|XP_010650354.1| PREDICTED: myosin-binding protein 3 isoform ...   233   7e-64
ref|XP_011091436.1| uncharacterized protein LOC105171881 isoform...   232   7e-64
ref|XP_011091434.1| uncharacterized protein LOC105171881 isoform...   232   8e-64
ref|XP_010650353.1| PREDICTED: myosin-binding protein 3 isoform ...   233   9e-64
ref|XP_012844596.1| PREDICTED: probable myosin-binding protein 6...   224   9e-63
gb|EYU31416.1| hypothetical protein MIMGU_mgv1a004094mg [Erythra...   224   1e-62
ref|XP_010241371.1| PREDICTED: uncharacterized protein LOC104585...   223   3e-60
ref|XP_010241370.1| PREDICTED: uncharacterized protein LOC104585...   223   4e-60
gb|KZV45378.1| hypothetical protein F511_05542 [Dorcoceras hygro...   220   1e-59
gb|PIA36041.1| hypothetical protein AQUCO_03400145v1 [Aquilegia ...   219   7e-59
gb|KZM85966.1| hypothetical protein DCAR_026612 [Daucus carota s...   216   5e-58
ref|XP_017223020.1| PREDICTED: myosin-binding protein 3 [Daucus ...   216   5e-58

>ref|XP_022009214.1| uncharacterized protein LOC110908582 isoform X1 [Helianthus annuus]
 gb|OTF97560.1| Protein of unknown function, DUF593 [Helianthus annuus]
          Length = 582

 Score =  452 bits (1164), Expect = e-150
 Identities = 279/480 (58%), Positives = 323/480 (67%), Gaps = 42/480 (8%)
 Frame = -1

Query: 1382 MNRSFWTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDI-------ST 1224
            MNR FWTF++LVGAFLDLFIAYFLLCGS IALFA+K LGLFGL+LP   +        S 
Sbjct: 1    MNRRFWTFDTLVGAFLDLFIAYFLLCGSTIALFAVKFLGLFGLSLPVNNNNGLFGNPNSG 60

Query: 1223 FRNLLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTN-NGNEN-DDGVNLDNGNRFRELE 1050
            FR+LLVDYPT +VS+VQ+S  RKFPFD++F RA ++N NG  N D GV    GN F ELE
Sbjct: 61   FRSLLVDYPTDKVSAVQFSASRKFPFDSVFFRAQNSNLNGELNVDRGV----GNGFMELE 116

Query: 1049 GEGEASCSSKSDARKVVRNEV-------------DVKGKGGLNYRMXXXXXXXXXXXFDS 909
            GE  ASC SKSD RKV R+ +             DVKGKG LNYR+           FDS
Sbjct: 117  GE--ASCGSKSDGRKV-RSRIGDSGIPMDKERGFDVKGKGALNYRLRGGFRRRRKAVFDS 173

Query: 908  G--KESSVSSSTNWVTCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAESHLVEKSYN 735
            G  K SSVSSS NW+TCV  Q  ND++    GG DGSS+ SG N G YEAE+ +  KS N
Sbjct: 174  GLQKHSSVSSSPNWITCVDQQSSNDNDS---GGPDGSSIVSGANNGNYEAETPV--KSDN 228

Query: 734  L------NDIIKMIPVNEADKDSMIVQLARELEDEQATRAALYVELEKERNXXXXXXXXA 573
            +       D  KMIPVNEADKD MIV L RELE+ +  R+ALYVELEKERN        A
Sbjct: 229  IFEGIVFGDPQKMIPVNEADKDKMIVILTRELEESETARSALYVELEKERNAAATAADEA 288

Query: 572  MSMILRLQEEKASIEMELRQYQRMIEEKSAYDEEEMNIMKEIVLRREREKHFLEKEVEAY 393
            MSMILRLQE+KASIEME RQYQRMIEEKSAYDEEEMNI+KEIVLRREREKHFLEKEV+AY
Sbjct: 289  MSMILRLQEDKASIEMEARQYQRMIEEKSAYDEEEMNILKEIVLRREREKHFLEKEVDAY 348

Query: 392  RQMTHHENDQFYHGDDQDSNEDPDLM-----------KNARLYEDAVADSVKEEEPEKTI 246
            RQM   ENDQF  G ++D  +DP+ M           KN++L+ED   D  K EEPEKTI
Sbjct: 349  RQMLRIENDQFNGGSNEDFTQDPEFMLQQLSMNISEKKNSKLFED--VDFSKTEEPEKTI 406

Query: 245  AIVGEEKSREMEANDATDSRVYDVH-IDHGPKSSEKVNGAKKQSDQNRSGSECSVGLPPV 69
             IV EEK  E++A+   ++   DVH ID+  KS+      KKQ D+  SGSE S GLPPV
Sbjct: 407  PIVEEEKGSEVDASRVGETDSRDVHVIDNESKST---GSKKKQIDRKPSGSETSSGLPPV 463


>ref|XP_022009215.1| uncharacterized protein LOC110908582 isoform X2 [Helianthus annuus]
          Length = 580

 Score =  443 bits (1139), Expect = e-146
 Identities = 273/476 (57%), Positives = 317/476 (66%), Gaps = 38/476 (7%)
 Frame = -1

Query: 1382 MNRSFWTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDI-------ST 1224
            MNR FWTF++LVGAFLDLFIAYFLLCGS IALFA+K LGLFGL+LP   +        S 
Sbjct: 1    MNRRFWTFDTLVGAFLDLFIAYFLLCGSTIALFAVKFLGLFGLSLPVNNNNGLFGNPNSG 60

Query: 1223 FRNLLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTN-NGNEN-DDGVNLDNGNRFRELE 1050
            FR+LLVDYPT +VS+VQ+S  RKFPFD++F RA ++N NG  N D GV    GN F ELE
Sbjct: 61   FRSLLVDYPTDKVSAVQFSASRKFPFDSVFFRAQNSNLNGELNVDRGV----GNGFMELE 116

Query: 1049 GEGEASCSSKSDARKVVRNEV-------------DVKGKGGLNYRMXXXXXXXXXXXFDS 909
            GE  ASC SKSD RKV R+ +             DVKGKG LNYR+           FDS
Sbjct: 117  GE--ASCGSKSDGRKV-RSRIGDSGIPMDKERGFDVKGKGALNYRLRGGFRRRRKAVFDS 173

Query: 908  G--KESSVSSSTNWVTCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEA--ESHLVEKS 741
            G  K SSVSSS NW+TCV  Q  ND++    GG DGSS+ SG N    E   +S  + + 
Sbjct: 174  GLQKHSSVSSSPNWITCVDQQSSNDNDS---GGPDGSSIVSGANNDEAETPVKSDNIFEG 230

Query: 740  YNLNDIIKMIPVNEADKDSMIVQLARELEDEQATRAALYVELEKERNXXXXXXXXAMSMI 561
                D  KMIPVNEADKD MIV L RELE+ +  R+ALYVELEKERN        AMSMI
Sbjct: 231  IVFGDPQKMIPVNEADKDKMIVILTRELEESETARSALYVELEKERNAAATAADEAMSMI 290

Query: 560  LRLQEEKASIEMELRQYQRMIEEKSAYDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMT 381
            LRLQE+KASIEME RQYQRMIEEKSAYDEEEMNI+KEIVLRREREKHFLEKEV+AYRQM 
Sbjct: 291  LRLQEDKASIEMEARQYQRMIEEKSAYDEEEMNILKEIVLRREREKHFLEKEVDAYRQML 350

Query: 380  HHENDQFYHGDDQDSNEDPDLM-----------KNARLYEDAVADSVKEEEPEKTIAIVG 234
              ENDQF  G ++D  +DP+ M           KN++L+ED   D  K EEPEKTI IV 
Sbjct: 351  RIENDQFNGGSNEDFTQDPEFMLQQLSMNISEKKNSKLFED--VDFSKTEEPEKTIPIVE 408

Query: 233  EEKSREMEANDATDSRVYDVH-IDHGPKSSEKVNGAKKQSDQNRSGSECSVGLPPV 69
            EEK  E++A+   ++   DVH ID+  KS+      KKQ D+  SGSE S GLPPV
Sbjct: 409  EEKGSEVDASRVGETDSRDVHVIDNESKST---GSKKKQIDRKPSGSETSSGLPPV 461


>gb|KVI08298.1| Zein-binding domain-containing protein [Cynara cardunculus var.
            scolymus]
          Length = 609

 Score =  411 bits (1056), Expect = e-133
 Identities = 254/503 (50%), Positives = 314/503 (62%), Gaps = 69/503 (13%)
 Frame = -1

Query: 1370 FWTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDI---STFRNLLVDY 1200
            FWTF +LVGAFLDLF+AYFLLCGS IA FA K LG FG +L + Y++   S F NLL DY
Sbjct: 9    FWTFNTLVGAFLDLFVAYFLLCGSTIAFFADKFLGFFGFSLSTPYNVFFDSDFTNLLFDY 68

Query: 1199 PTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEASCSSK 1020
            PT ++S VQ++V RKFPFD+IF R     NG+   DG+NLD G+ FRELEGE  ASCSS 
Sbjct: 69   PTDKISDVQFAVARKFPFDSIFFRIQ---NGH-GSDGLNLDRGDGFRELEGE--ASCSSI 122

Query: 1019 SDARKVVRNEVD-------------VKGKGGLNYRMXXXXXXXXXXXFDSGKESSVSSST 879
            SDARKVVRNE D             +KGKG  NYR+            DSGK SSVS+S 
Sbjct: 123  SDARKVVRNETDDSAVRFEKGRGFDMKGKGAANYRVRGSIRRRRKTSLDSGKHSSVSTSP 182

Query: 878  NWVTCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAESHLVEKS-------------Y 738
             W+TCV DQ   +++KE     +GS + SG N+  YEAE+ +V KS              
Sbjct: 183  TWITCVEDQ---NNHKESNAQLEGSLILSGANSSNYEAETPMVVKSDGRFLDDVSNESGN 239

Query: 737  NLNDIIKMIPVNEADKDSMIVQLARELEDEQATRAALYVELEKERNXXXXXXXXAMSMIL 558
            N + + + I  ++ D+D+ IV L +ELE E+A R+ALYVEL++ERN        AM+MIL
Sbjct: 240  NPSGLKERIATDDGDEDNRIVSLTQELEVERAARSALYVELDEERNAAATAADEAMAMIL 299

Query: 557  RLQEEKASIEMELRQYQRMIEEKSAYDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTH 378
            RLQEEKASIEME RQY+RMIEEKSAYD EEMNI+KEI+LRREREKHFLEKEVEAYRQM  
Sbjct: 300  RLQEEKASIEMESRQYKRMIEEKSAYDLEEMNILKEILLRREREKHFLEKEVEAYRQMDR 359

Query: 377  HENDQFYHGDDQDSNEDPDL--------MKNARLYEDAVADSVKEEEPEKTIAIVGEEKS 222
             ENDQ    + QD NEDPDL        + N +   +   +  K E+ EK IAIVGE   
Sbjct: 360  LENDQLSGINVQDFNEDPDLILHELSMSIANRKNSGNEDLELSKREDIEKPIAIVGEVPD 419

Query: 221  REMEANDA-------------TDSRVYDVH-IDHGPKSSEKVNGAK-------------- 126
             EM+A  A              +S VY+VH ID+ PK+S++  G+K              
Sbjct: 420  LEMKAGHAFNGNKELYKQRTEKESLVYNVHMIDNEPKTSDESKGSKKRPTMDETDGSFLR 479

Query: 125  ---KQSDQNRSGSECSVG-LPPV 69
               K++DQNRSGSE +VG LPP+
Sbjct: 480  RLEKEADQNRSGSEAAVGRLPPI 502


>gb|PLY64780.1| hypothetical protein LSAT_2X44920 [Lactuca sativa]
          Length = 527

 Score =  363 bits (933), Expect = e-116
 Identities = 234/474 (49%), Positives = 289/474 (60%), Gaps = 18/474 (3%)
 Frame = -1

Query: 1370 FWTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDI--STFRNLLVDYP 1197
            FWTF SLVGAFLDL IAYFLLCGS IA FA K LG FGL+LPS + I  S   +LL+DYP
Sbjct: 8    FWTFNSLVGAFLDLSIAYFLLCGSTIAFFAGKFLGFFGLSLPSPFGIPNSDLNSLLLDYP 67

Query: 1196 TQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEASCSSKS 1017
            T ++S+VQ+SV RKFPFD+IF    + N      DG+NL  G R +ELEGE  AS SS S
Sbjct: 68   TDKISAVQFSVTRKFPFDSIFFSVQNCNAS----DGLNLGRGERVKELEGE--ASSSSIS 121

Query: 1016 DARKVVRNEVD-------------VKGKGGLNYRMXXXXXXXXXXXFDSGKESSVSSSTN 876
            DARK V+NE D             +KGKG LN+R+            DSGK SSVSSS N
Sbjct: 122  DARKAVKNETDDSGVKIEKERGFDMKGKGALNHRVRGSFRRRKKSSLDSGKRSSVSSSPN 181

Query: 875  WVTCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAESHLVEKSYNLNDIIKMIPVNEA 696
            W+TCV DQ  N+  KE   G   +S+ SG N+  YEA +   E     ND  + IP+NE 
Sbjct: 182  WITCV-DQQSNE--KEHVAGVAENSILSGANSCNYEAHTPTAE-----NDSQERIPINEE 233

Query: 695  DKDSMIVQLARELEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELR 516
            DK + I+ L REL +EQ  RAALY+ELEKER+        AM+MILRLQEEKASIEME R
Sbjct: 234  DKANTILLLTRELIEEQDARAALYIELEKERSAAATAADEAMAMILRLQEEKASIEMESR 293

Query: 515  QYQRMIEEKSAYDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDS 336
            QYQRMIEEKSAYD EEMNI+KEIVLRRE EKHFLEKEVEAYRQ T  E++Q Y+      
Sbjct: 294  QYQRMIEEKSAYDAEEMNILKEIVLRREMEKHFLEKEVEAYRQ-TSPESNQNYNNTLASG 352

Query: 335  NEDPD-LMKNARLYEDAVADSVK--EEEPEKTIAIVGEEKSREMEANDATDSRVYDVHID 165
            + DPD ++++  +  D     +K  ++  EKTIAIV EE  +E +         Y+ +  
Sbjct: 353  DNDPDRILRDLSMSIDLSKQKLKIDKQLSEKTIAIVEEEHKKETDV-------FYETNGK 405

Query: 164  HGPKSSEKVNGAKKQSDQNRSGSECSVGLPPVXXXXXXXXXXXXXXXLDNERTK 3
               ++     G  K+++ +       +GLPP+               LDNERTK
Sbjct: 406  EEEEAEVSSGGVVKKAEVSG-----GMGLPPM-GSKWKILRRNSTSALDNERTK 453


>ref|XP_023745705.1| myosin-binding protein 2-like isoform X2 [Lactuca sativa]
          Length = 532

 Score =  363 bits (933), Expect = e-116
 Identities = 234/474 (49%), Positives = 289/474 (60%), Gaps = 18/474 (3%)
 Frame = -1

Query: 1370 FWTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDI--STFRNLLVDYP 1197
            FWTF SLVGAFLDL IAYFLLCGS IA FA K LG FGL+LPS + I  S   +LL+DYP
Sbjct: 8    FWTFNSLVGAFLDLSIAYFLLCGSTIAFFAGKFLGFFGLSLPSPFGIPNSDLNSLLLDYP 67

Query: 1196 TQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEASCSSKS 1017
            T ++S+VQ+SV RKFPFD+IF    + N      DG+NL  G R +ELEGE  AS SS S
Sbjct: 68   TDKISAVQFSVTRKFPFDSIFFSVQNCNAS----DGLNLGRGERVKELEGE--ASSSSIS 121

Query: 1016 DARKVVRNEVD-------------VKGKGGLNYRMXXXXXXXXXXXFDSGKESSVSSSTN 876
            DARK V+NE D             +KGKG LN+R+            DSGK SSVSSS N
Sbjct: 122  DARKAVKNETDDSGVKIEKERGFDMKGKGALNHRVRGSFRRRKKSSLDSGKRSSVSSSPN 181

Query: 875  WVTCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAESHLVEKSYNLNDIIKMIPVNEA 696
            W+TCV DQ  N+  KE   G   +S+ SG N+  YEA +   E     ND  + IP+NE 
Sbjct: 182  WITCV-DQQSNE--KEHVAGVAENSILSGANSCNYEAHTPTAE-----NDSQERIPINEE 233

Query: 695  DKDSMIVQLARELEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELR 516
            DK + I+ L REL +EQ  RAALY+ELEKER+        AM+MILRLQEEKASIEME R
Sbjct: 234  DKANTILLLTRELIEEQDARAALYIELEKERSAAATAADEAMAMILRLQEEKASIEMESR 293

Query: 515  QYQRMIEEKSAYDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDS 336
            QYQRMIEEKSAYD EEMNI+KEIVLRRE EKHFLEKEVEAYRQ T  E++Q Y+      
Sbjct: 294  QYQRMIEEKSAYDAEEMNILKEIVLRREMEKHFLEKEVEAYRQ-TSPESNQNYNNTLASG 352

Query: 335  NEDPD-LMKNARLYEDAVADSVK--EEEPEKTIAIVGEEKSREMEANDATDSRVYDVHID 165
            + DPD ++++  +  D     +K  ++  EKTIAIV EE  +E +         Y+ +  
Sbjct: 353  DNDPDRILRDLSMSIDLSKQKLKIDKQLSEKTIAIVEEEHKKETDV-------FYETNGK 405

Query: 164  HGPKSSEKVNGAKKQSDQNRSGSECSVGLPPVXXXXXXXXXXXXXXXLDNERTK 3
               ++     G  K+++ +       +GLPP+               LDNERTK
Sbjct: 406  EEEEAEVSSGGVVKKAEVSG-----GMGLPPM-GSKWKILRRNSTSALDNERTK 453


>ref|XP_023745704.1| myosin-binding protein 2-like isoform X1 [Lactuca sativa]
          Length = 541

 Score =  363 bits (933), Expect = e-116
 Identities = 234/474 (49%), Positives = 289/474 (60%), Gaps = 18/474 (3%)
 Frame = -1

Query: 1370 FWTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDI--STFRNLLVDYP 1197
            FWTF SLVGAFLDL IAYFLLCGS IA FA K LG FGL+LPS + I  S   +LL+DYP
Sbjct: 8    FWTFNSLVGAFLDLSIAYFLLCGSTIAFFAGKFLGFFGLSLPSPFGIPNSDLNSLLLDYP 67

Query: 1196 TQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEASCSSKS 1017
            T ++S+VQ+SV RKFPFD+IF    + N      DG+NL  G R +ELEGE  AS SS S
Sbjct: 68   TDKISAVQFSVTRKFPFDSIFFSVQNCNAS----DGLNLGRGERVKELEGE--ASSSSIS 121

Query: 1016 DARKVVRNEVD-------------VKGKGGLNYRMXXXXXXXXXXXFDSGKESSVSSSTN 876
            DARK V+NE D             +KGKG LN+R+            DSGK SSVSSS N
Sbjct: 122  DARKAVKNETDDSGVKIEKERGFDMKGKGALNHRVRGSFRRRKKSSLDSGKRSSVSSSPN 181

Query: 875  WVTCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAESHLVEKSYNLNDIIKMIPVNEA 696
            W+TCV DQ  N+  KE   G   +S+ SG N+  YEA +   E     ND  + IP+NE 
Sbjct: 182  WITCV-DQQSNE--KEHVAGVAENSILSGANSCNYEAHTPTAE-----NDSQERIPINEE 233

Query: 695  DKDSMIVQLARELEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELR 516
            DK + I+ L REL +EQ  RAALY+ELEKER+        AM+MILRLQEEKASIEME R
Sbjct: 234  DKANTILLLTRELIEEQDARAALYIELEKERSAAATAADEAMAMILRLQEEKASIEMESR 293

Query: 515  QYQRMIEEKSAYDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDS 336
            QYQRMIEEKSAYD EEMNI+KEIVLRRE EKHFLEKEVEAYRQ T  E++Q Y+      
Sbjct: 294  QYQRMIEEKSAYDAEEMNILKEIVLRREMEKHFLEKEVEAYRQ-TSPESNQNYNNTLASG 352

Query: 335  NEDPD-LMKNARLYEDAVADSVK--EEEPEKTIAIVGEEKSREMEANDATDSRVYDVHID 165
            + DPD ++++  +  D     +K  ++  EKTIAIV EE  +E +         Y+ +  
Sbjct: 353  DNDPDRILRDLSMSIDLSKQKLKIDKQLSEKTIAIVEEEHKKETDV-------FYETNGK 405

Query: 164  HGPKSSEKVNGAKKQSDQNRSGSECSVGLPPVXXXXXXXXXXXXXXXLDNERTK 3
               ++     G  K+++ +       +GLPP+               LDNERTK
Sbjct: 406  EEEEAEVSSGGVVKKAEVSG-----GMGLPPM-GSKWKILRRNSTSALDNERTK 453


>ref|XP_022012351.1| myosin-binding protein 3-like [Helianthus annuus]
 gb|OTF95547.1| putative zein-binding domain-containing protein [Helianthus annuus]
          Length = 485

 Score =  335 bits (859), Expect = e-105
 Identities = 221/479 (46%), Positives = 267/479 (55%), Gaps = 19/479 (3%)
 Frame = -1

Query: 1382 MNRSFWTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPS---------AYDI 1230
            M   FWTF +LVGAF+DLFIAYFLLCGS +ALFA+  LGLFGL+LPS         ++D 
Sbjct: 1    MKNRFWTFNTLVGAFVDLFIAYFLLCGSTVALFAVNFLGLFGLSLPSNGYGFVGMISFD- 59

Query: 1229 STFRNLLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELE 1050
              + +LLVDYPT +VS V++S                                       
Sbjct: 60   --YTSLLVDYPTDKVSGVRFSA-------------------------------------- 79

Query: 1049 GEGEASCSSKSDARKVVRNEVDVKGKGGLNYRMXXXXXXXXXXXFDSGKESSVSSSTNWV 870
                         RK   + + V G GG +               D   E   S  +NW+
Sbjct: 80   ------------TRKFPFDSIFVSGCGGCDLE-------------DDLSEGEGSCCSNWI 114

Query: 869  TCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAESHLVEKSYNLNDIIKMIPVNEADK 690
            TCV D+  ND      GG + +S+ S  N   Y  +  L +         K IPVNEADK
Sbjct: 115  TCVDDEQSND------GGQEATSIPSDCNFSRYHEDDVLSDPKATQ----KRIPVNEADK 164

Query: 689  DSMIVQLARELEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQY 510
            D MIV L RELE+ +A RAALYVELEKERN        AMSMILRLQEEKA ++ME RQY
Sbjct: 165  DKMIVLLTRELEESKAARAALYVELEKERNASASAADEAMSMILRLQEEKAVVKMESRQY 224

Query: 509  QRMIEEKSAYDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDSNE 330
            QRMIEEKSAYD EEMNI+KEIVLRREREKHFLEK+VEAYRQM   END+   G++QD   
Sbjct: 225  QRMIEEKSAYDAEEMNILKEIVLRREREKHFLEKQVEAYRQMISVENDRINGGNNQDFVH 284

Query: 329  DPDLM----KNARLYEDAVADSVKEEEPEKTIAIVGEEKSREMEANDA-----TDSRVYD 177
            DPDLM     ++RL+ED   D  ++EEPEKTI   GE+K  ++   DA     + S +YD
Sbjct: 285  DPDLMLHQLSSSRLFED--IDFSEQEEPEKTI---GEDKGLDITPGDAYRGGESGSHIYD 339

Query: 176  VH-IDHGPKSSEKVNGAKKQSDQNRSGSECSVGLPPVXXXXXXXXXXXXXXXLDNERTK 3
            VH ID+ PKS EK NG  KQSDQ ++GS   VGLPPV               LDNERT+
Sbjct: 340  VHVIDNEPKSREKSNGYMKQSDQEQNGSVTRVGLPPV--SSKSSLRRYSTSALDNERTR 396


>ref|XP_011091435.1| uncharacterized protein LOC105171881 isoform X3 [Sesamum indicum]
          Length = 755

 Score =  235 bits (599), Expect = 7e-65
 Identities = 184/483 (38%), Positives = 241/483 (49%), Gaps = 55/483 (11%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDISTF---------RN 1215
            W+   LV AFLDL IAY LLC SA+A  A K +G FGL LP   D   F           
Sbjct: 9    WSLSGLVAAFLDLVIAYLLLCASAVAYLASKFMGFFGLNLPCPCDGIVFNIHSKSFCLNR 68

Query: 1214 LLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNN--GNENDDGVNLDNGNRFRELEGEG 1041
            LLVD+PTQRV  VQ SV  KFPF    S  N  ++  G+   +G+          LE EG
Sbjct: 69   LLVDFPTQRVLDVQLSVKEKFPFSDCISGKNRVSDLVGDNYGNGI----------LEIEG 118

Query: 1040 EASCSSKSDARK---VVRNEV-------DVKGKGGLNYRMXXXXXXXXXXXFDSGKESSV 891
            EASCSS SDARK   V R E+       D+KGKG +N+R               GK SSV
Sbjct: 119  EASCSSVSDARKPADVARKELGSRAEKYDMKGKGVINHRPKSRLRQRRKGGGGLGKYSSV 178

Query: 890  SSS----TNWVTCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAESHL---------- 753
            +SS          V        NKE  G    SS+    +A  +  ES            
Sbjct: 179  ASSDPPLLEGGLVVEPYSHCSTNKEGNGLVGDSSLPVENDAEVHNLESTTEAEVGPRAVT 238

Query: 752  -VEKSYNLNDII----KMIPVNE------------ADKDSMIVQLARELEDEQATRAALY 624
              E +++L++ +     ++ + E             D+ S I  L + LE+E+  RAALY
Sbjct: 239  SCEMNHSLDEDMPIKKNVLSIEELQSNPQGLQSFSGDEKSTIRLLEQTLEEERNARAALY 298

Query: 623  VELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSAYDEEEMNIMKEIV 444
            VELEKER+        AM+MILRLQEEKASIEME RQYQRMIEEKS YD EEM+I+KEI+
Sbjct: 299  VELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMIEEKSVYDAEEMDILKEIL 358

Query: 443  LRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDSNEDPDLMKNARLYEDAVADSVKEE 264
            LRRE+EKHFLEKEVEAYR +    ++Q   GD  D +        AR   D   D+   +
Sbjct: 359  LRREKEKHFLEKEVEAYRMIVSVGDEQL-AGDGSDKS-------YARQLFDLPLDT--ND 408

Query: 263  EPEKTIAIVGEEKSREMEANDATDSRVYDVH-IDHGP--KSSEKVNGAKKQSDQNRSGSE 93
            +P   +  +     +    N  ++ +    H   +G   +  ++ +  KKQ D ++   +
Sbjct: 409  DPVLILRRLAASTDKRTLENKCSEDKPNSEHAFGNGTLVECQDETSSFKKQGDSDKQSVQ 468

Query: 92   CSV 84
             SV
Sbjct: 469  LSV 471


>ref|XP_010650354.1| PREDICTED: myosin-binding protein 3 isoform X2 [Vitis vinifera]
 emb|CBI17315.3| unnamed protein product, partial [Vitis vinifera]
          Length = 797

 Score =  233 bits (594), Expect = 7e-64
 Identities = 176/459 (38%), Positives = 228/459 (49%), Gaps = 62/459 (13%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYD--------ISTFRNL 1212
            WTF  LVGA+LDL IAY LLCGS +A FA K L  FGL LP   +         +  +  
Sbjct: 9    WTFGGLVGAYLDLAIAYLLLCGSTLAFFASKFLSFFGLCLPCPCNGFFGNPNGDNCLQKF 68

Query: 1211 LVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEAS 1032
            LVDYPT+R+SSVQ  V  KFPFD++++   S +   +   G N D+G     +  EGEAS
Sbjct: 69   LVDYPTERISSVQLCVKSKFPFDSVWANEGSPHPNWKLLKGRNSDDG----AVGLEGEAS 124

Query: 1031 CSSKSDARK---------VVRN--------------EVDVKGKGGLNYRMXXXXXXXXXX 921
            CSS  D  +         + RN              + D KGK   N R           
Sbjct: 125  CSSFWDVMRSPDIAGKDSISRNGSCGVMNTPALKEGKSDTKGKRVSNQRPKTGVRRRRRS 184

Query: 920  XFDSGKESSVSSSTNWVTCVVDQGRNDDN-KEVGGGFDGSSVASGGNAGTYEAESHLVEK 744
              D GK SSVSS            R+  +  E G  F G ++    + G    +  LV  
Sbjct: 185  AVDHGKFSSVSSFDPPRLDAPSGLRSPSSVSETGEAFVGKTLVPDASGGEDGFQDELVPI 244

Query: 743  SYNLND-IIKMIPVNE-------ADKDSMIVQ----------------------LARELE 654
              +L +  +  I +NE       ++KD+   +                      L + LE
Sbjct: 245  LIDLGERALHGIKLNEHIDEDKPSEKDASSAEEVKCNARGKLSFNGNTENTVRVLEQALE 304

Query: 653  DEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSAYDE 474
            +E A RAALY ELEKER+        AM+MILR+QEEKASIEME RQ+QR+IEEKSAYD 
Sbjct: 305  EEHAARAALYHELEKERSAAASAADEAMAMILRIQEEKASIEMEARQFQRIIEEKSAYDA 364

Query: 473  EEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDSNEDPDLMKNARLYE 294
            EEMN++KEI+LRREREKHFLEKEVEAYRQM   END    G+  D  + P+    + LY 
Sbjct: 365  EEMNLLKEILLRREREKHFLEKEVEAYRQMMFSEND-LLEGNTHDIVDTPEQRPISSLY- 422

Query: 293  DAVADSVKEEEPEKTIAIVGEEKSREMEANDATDSRVYD 177
                     E+P   +  + E   +E +  DA    VY+
Sbjct: 423  -------LSEDPVLMLRRISESIDKEEKVKDADRCSVYE 454


>ref|XP_011091436.1| uncharacterized protein LOC105171881 isoform X2 [Sesamum indicum]
          Length = 755

 Score =  232 bits (592), Expect = 7e-64
 Identities = 184/487 (37%), Positives = 241/487 (49%), Gaps = 59/487 (12%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDISTF---------RN 1215
            W+   LV AFLDL IAY LLC SA+A  A K +G FGL LP   D   F           
Sbjct: 9    WSLSGLVAAFLDLVIAYLLLCASAVAYLASKFMGFFGLNLPCPCDGIVFNIHSKSFCLNR 68

Query: 1214 LLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNN--GNENDDGVNLDNGNRFRELEGEG 1041
            LLVD+PTQRV  VQ SV  KFPF    S  N  ++  G+   +G+          LE EG
Sbjct: 69   LLVDFPTQRVLDVQLSVKEKFPFSDCISGKNRVSDLVGDNYGNGI----------LEIEG 118

Query: 1040 EASCSSKSDARK---VVRNEV-------DVKGKGGLNYRMXXXXXXXXXXXFDSGKESSV 891
            EASCSS SDARK   V R E+       D+KGKG +N+R               GK SSV
Sbjct: 119  EASCSSVSDARKPADVARKELGSRAEKYDMKGKGVINHRPKSRLRQRRKGGGGLGKYSSV 178

Query: 890  SSSTNWVT---CVVDQGRNDDNKEVGGGFDGSSVASGGN-----------AGTYEAE--- 762
            +SS   +     VV+   +    + G G  G S     N             T EAE   
Sbjct: 179  ASSDPPLLEGGLVVEPYSHCSTNKEGNGLVGDSSLPVENDAEVHNLEYDDKATTEAEVGP 238

Query: 761  --------SHLVEKSYNLNDIIKMIPVNEA----------DKDSMIVQLARELEDEQATR 636
                    +H +++   +   +  I   ++          D+ S I  L + LE+E+  R
Sbjct: 239  RAVTSCEMNHSLDEDMPIKKNVLSIEELQSNPQGLQSFSGDEKSTIRLLEQTLEEERNAR 298

Query: 635  AALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSAYDEEEMNIM 456
            AALYVELEKER+        AM+MILRLQEEKASIEME RQYQRMIEEKS YD EEM+I+
Sbjct: 299  AALYVELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMIEEKSVYDAEEMDIL 358

Query: 455  KEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDSNEDPDLMKNARLYEDAVADS 276
            KEI+LRRE+EKHFLEKEVEAYR +    ++Q   GD  D +        AR   D   D+
Sbjct: 359  KEILLRREKEKHFLEKEVEAYRMIVSVGDEQL-AGDGSDKS-------YARQLFDLPLDT 410

Query: 275  VKEEEPEKTIAIVGEEKSREMEANDATDSRVYDVH-IDHGP--KSSEKVNGAKKQSDQNR 105
               ++P   +  +     +    N  ++ +    H   +G   +  ++ +  KKQ D ++
Sbjct: 411  --NDDPVLILRRLAASTDKRTLENKCSEDKPNSEHAFGNGTLVECQDETSSFKKQGDSDK 468

Query: 104  SGSECSV 84
               + SV
Sbjct: 469  QSVQLSV 475


>ref|XP_011091434.1| uncharacterized protein LOC105171881 isoform X1 [Sesamum indicum]
          Length = 759

 Score =  232 bits (592), Expect = 8e-64
 Identities = 184/487 (37%), Positives = 241/487 (49%), Gaps = 59/487 (12%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDISTF---------RN 1215
            W+   LV AFLDL IAY LLC SA+A  A K +G FGL LP   D   F           
Sbjct: 9    WSLSGLVAAFLDLVIAYLLLCASAVAYLASKFMGFFGLNLPCPCDGIVFNIHSKSFCLNR 68

Query: 1214 LLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNN--GNENDDGVNLDNGNRFRELEGEG 1041
            LLVD+PTQRV  VQ SV  KFPF    S  N  ++  G+   +G+          LE EG
Sbjct: 69   LLVDFPTQRVLDVQLSVKEKFPFSDCISGKNRVSDLVGDNYGNGI----------LEIEG 118

Query: 1040 EASCSSKSDARK---VVRNEV-------DVKGKGGLNYRMXXXXXXXXXXXFDSGKESSV 891
            EASCSS SDARK   V R E+       D+KGKG +N+R               GK SSV
Sbjct: 119  EASCSSVSDARKPADVARKELGSRAEKYDMKGKGVINHRPKSRLRQRRKGGGGLGKYSSV 178

Query: 890  SSSTNWVT---CVVDQGRNDDNKEVGGGFDGSSVASGGN-----------AGTYEAE--- 762
            +SS   +     VV+   +    + G G  G S     N             T EAE   
Sbjct: 179  ASSDPPLLEGGLVVEPYSHCSTNKEGNGLVGDSSLPVENDAEVHNLEYDDKATTEAEVGP 238

Query: 761  --------SHLVEKSYNLNDIIKMIPVNEA----------DKDSMIVQLARELEDEQATR 636
                    +H +++   +   +  I   ++          D+ S I  L + LE+E+  R
Sbjct: 239  RAVTSCEMNHSLDEDMPIKKNVLSIEELQSNPQGLQSFSGDEKSTIRLLEQTLEEERNAR 298

Query: 635  AALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSAYDEEEMNIM 456
            AALYVELEKER+        AM+MILRLQEEKASIEME RQYQRMIEEKS YD EEM+I+
Sbjct: 299  AALYVELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMIEEKSVYDAEEMDIL 358

Query: 455  KEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDSNEDPDLMKNARLYEDAVADS 276
            KEI+LRRE+EKHFLEKEVEAYR +    ++Q   GD  D +        AR   D   D+
Sbjct: 359  KEILLRREKEKHFLEKEVEAYRMIVSVGDEQL-AGDGSDKS-------YARQLFDLPLDT 410

Query: 275  VKEEEPEKTIAIVGEEKSREMEANDATDSRVYDVH-IDHGP--KSSEKVNGAKKQSDQNR 105
               ++P   +  +     +    N  ++ +    H   +G   +  ++ +  KKQ D ++
Sbjct: 411  --NDDPVLILRRLAASTDKRTLENKCSEDKPNSEHAFGNGTLVECQDETSSFKKQGDSDK 468

Query: 104  SGSECSV 84
               + SV
Sbjct: 469  QSVQLSV 475


>ref|XP_010650353.1| PREDICTED: myosin-binding protein 3 isoform X1 [Vitis vinifera]
          Length = 812

 Score =  233 bits (594), Expect = 9e-64
 Identities = 176/459 (38%), Positives = 228/459 (49%), Gaps = 62/459 (13%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYD--------ISTFRNL 1212
            WTF  LVGA+LDL IAY LLCGS +A FA K L  FGL LP   +         +  +  
Sbjct: 9    WTFGGLVGAYLDLAIAYLLLCGSTLAFFASKFLSFFGLCLPCPCNGFFGNPNGDNCLQKF 68

Query: 1211 LVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEAS 1032
            LVDYPT+R+SSVQ  V  KFPFD++++   S +   +   G N D+G     +  EGEAS
Sbjct: 69   LVDYPTERISSVQLCVKSKFPFDSVWANEGSPHPNWKLLKGRNSDDG----AVGLEGEAS 124

Query: 1031 CSSKSDARK---------VVRN--------------EVDVKGKGGLNYRMXXXXXXXXXX 921
            CSS  D  +         + RN              + D KGK   N R           
Sbjct: 125  CSSFWDVMRSPDIAGKDSISRNGSCGVMNTPALKEGKSDTKGKRVSNQRPKTGVRRRRRS 184

Query: 920  XFDSGKESSVSSSTNWVTCVVDQGRNDDN-KEVGGGFDGSSVASGGNAGTYEAESHLVEK 744
              D GK SSVSS            R+  +  E G  F G ++    + G    +  LV  
Sbjct: 185  AVDHGKFSSVSSFDPPRLDAPSGLRSPSSVSETGEAFVGKTLVPDASGGEDGFQDELVPI 244

Query: 743  SYNLND-IIKMIPVNE-------ADKDSMIVQ----------------------LARELE 654
              +L +  +  I +NE       ++KD+   +                      L + LE
Sbjct: 245  LIDLGERALHGIKLNEHIDEDKPSEKDASSAEEVKCNARGKLSFNGNTENTVRVLEQALE 304

Query: 653  DEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSAYDE 474
            +E A RAALY ELEKER+        AM+MILR+QEEKASIEME RQ+QR+IEEKSAYD 
Sbjct: 305  EEHAARAALYHELEKERSAAASAADEAMAMILRIQEEKASIEMEARQFQRIIEEKSAYDA 364

Query: 473  EEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDSNEDPDLMKNARLYE 294
            EEMN++KEI+LRREREKHFLEKEVEAYRQM   END    G+  D  + P+    + LY 
Sbjct: 365  EEMNLLKEILLRREREKHFLEKEVEAYRQMMFSEND-LLEGNTHDIVDTPEQRPISSLY- 422

Query: 293  DAVADSVKEEEPEKTIAIVGEEKSREMEANDATDSRVYD 177
                     E+P   +  + E   +E +  DA    VY+
Sbjct: 423  -------LSEDPVLMLRRISESIDKEEKVKDADRCSVYE 454


>ref|XP_012844596.1| PREDICTED: probable myosin-binding protein 6 [Erythranthe guttata]
          Length = 534

 Score =  224 bits (572), Expect = 9e-63
 Identities = 171/459 (37%), Positives = 232/459 (50%), Gaps = 26/459 (5%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDISTFR---------N 1215
            W+  +L  A+LDL IAY LL  S +A  A K LG  GL LP   +   F          +
Sbjct: 9    WSLSNLAAAYLDLAIAYILLFASVVAYVASKFLGFLGLNLPCPCNGMFFNIHSRNICLNS 68

Query: 1214 LLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEA 1035
            LLVD+PTQ+VS+VQ S+  +FPF      ++ST   N +   +   N N    LE EG+A
Sbjct: 69   LLVDFPTQKVSNVQLSIKHRFPF------SDSTCPKNHDYSIIGGGNSNVNGVLEIEGDA 122

Query: 1034 SCSSKSDARKVVRNEVDVKGKGGLNYRMXXXXXXXXXXXFDSGKESSVSS-----STNWV 870
            SCSS SDARK     VD+KGKG ++YR               GK SSVSS        + 
Sbjct: 123  SCSSVSDARK----PVDMKGKGAVSYRQRGRFRKHRKASGSIGKYSSVSSYDLPLHEPYC 178

Query: 869  TCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAESHLVEKSYNLNDIIKMIPVNEADK 690
                D+G N        G D     +     + + E+H+   ++      + + ++  D 
Sbjct: 179  HSSTDKGENGFTN----GDDSKPSTTLETNRSSDEETHVKRSTH------EELQISSLDD 228

Query: 689  DSMIVQLARELEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQY 510
             + I  L   LE+E+  RAALY ELEKER+        AM+MILRLQ EKA++EME RQY
Sbjct: 229  KTAIRLLEETLEEERTARAALYTELEKERSAAASAADEAMAMILRLQAEKAAVEMEARQY 288

Query: 509  QRMIEEKSAYDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQ---- 342
            QRMIEEKSAYD EEMNI+KEI++RRE EKHFLEK+VE Y   +H E D     D +    
Sbjct: 289  QRMIEEKSAYDAEEMNILKEILVRREMEKHFLEKQVEGYN--SHFEVDSSDKSDGRQSFG 346

Query: 341  ----DSNEDPDLMKNARLYEDAVADSVKEEEPEKTIAIVGEEKSREMEANDAT----DSR 186
                D NEDP           ++   + E   +K IA V +  +R  E  + T      R
Sbjct: 347  SSWFDPNEDP----------VSILHQLAEATDKKEIASV-DNSTRPQECEEITPLPLGGR 395

Query: 185  VYDVHIDHGPKSSEKVNGAKKQSDQNRSGSECSVGLPPV 69
            V ++  +      EK+ G   +++  R+      GLPP+
Sbjct: 396  VQEIGEN---LVVEKIIGTCNEAETKRAN-----GLPPI 426


>gb|EYU31416.1| hypothetical protein MIMGU_mgv1a004094mg [Erythranthe guttata]
          Length = 544

 Score =  224 bits (572), Expect = 1e-62
 Identities = 171/459 (37%), Positives = 232/459 (50%), Gaps = 26/459 (5%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDISTFR---------N 1215
            W+  +L  A+LDL IAY LL  S +A  A K LG  GL LP   +   F          +
Sbjct: 9    WSLSNLAAAYLDLAIAYILLFASVVAYVASKFLGFLGLNLPCPCNGMFFNIHSRNICLNS 68

Query: 1214 LLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEA 1035
            LLVD+PTQ+VS+VQ S+  +FPF      ++ST   N +   +   N N    LE EG+A
Sbjct: 69   LLVDFPTQKVSNVQLSIKHRFPF------SDSTCPKNHDYSIIGGGNSNVNGVLEIEGDA 122

Query: 1034 SCSSKSDARKVVRNEVDVKGKGGLNYRMXXXXXXXXXXXFDSGKESSVSS-----STNWV 870
            SCSS SDARK     VD+KGKG ++YR               GK SSVSS        + 
Sbjct: 123  SCSSVSDARK----PVDMKGKGAVSYRQRGRFRKHRKASGSIGKYSSVSSYDLPLHEPYC 178

Query: 869  TCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAESHLVEKSYNLNDIIKMIPVNEADK 690
                D+G N        G D     +     + + E+H+   ++      + + ++  D 
Sbjct: 179  HSSTDKGENGFTN----GDDSKPSTTLETNRSSDEETHVKRSTH------EELQISSLDD 228

Query: 689  DSMIVQLARELEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQY 510
             + I  L   LE+E+  RAALY ELEKER+        AM+MILRLQ EKA++EME RQY
Sbjct: 229  KTAIRLLEETLEEERTARAALYTELEKERSAAASAADEAMAMILRLQAEKAAVEMEARQY 288

Query: 509  QRMIEEKSAYDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQ---- 342
            QRMIEEKSAYD EEMNI+KEI++RRE EKHFLEK+VE Y   +H E D     D +    
Sbjct: 289  QRMIEEKSAYDAEEMNILKEILVRREMEKHFLEKQVEGYN--SHFEVDSSDKSDGRQSFG 346

Query: 341  ----DSNEDPDLMKNARLYEDAVADSVKEEEPEKTIAIVGEEKSREMEANDAT----DSR 186
                D NEDP           ++   + E   +K IA V +  +R  E  + T      R
Sbjct: 347  SSWFDPNEDP----------VSILHQLAEATDKKEIASV-DNSTRPQECEEITPLPLGGR 395

Query: 185  VYDVHIDHGPKSSEKVNGAKKQSDQNRSGSECSVGLPPV 69
            V ++  +      EK+ G   +++  R+      GLPP+
Sbjct: 396  VQEIGEN---LVVEKIIGTCNEAETKRAN-----GLPPI 426


>ref|XP_010241371.1| PREDICTED: uncharacterized protein LOC104585995 isoform X2 [Nelumbo
            nucifera]
          Length = 806

 Score =  223 bits (568), Expect = 3e-60
 Identities = 172/411 (41%), Positives = 217/411 (52%), Gaps = 65/411 (15%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDI--------STFRNL 1212
            WTF  LVGAFLDL +AY LLCGSA+A FA K LG+FGL LP   +            + L
Sbjct: 9    WTFCGLVGAFLDLALAYLLLCGSALAFFASKFLGIFGLYLPCPCNGFFGVPNGGKCLQRL 68

Query: 1211 LVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEAS 1032
            LVD PT ++SSVQ SV  KFPFD  + +    + G + +  +  D       LE EGEAS
Sbjct: 69   LVDCPTGKISSVQMSVKSKFPFDTFWMK----DQGCQLNVKLLRDRDRCDGLLEMEGEAS 124

Query: 1031 CSSKSDARK----------VVRNEV----------------DVKGKGGLNYRMXXXXXXX 930
            CSS S+ R+            RNE+                DVKGKG +  +        
Sbjct: 125  CSSFSETRRRSHSLALRDLSPRNEMIRFGLTNSPVARESRSDVKGKGVVTQKPRSTLRRR 184

Query: 929  XXXXFDSGKESSVSSS------------TNWVTCVVDQGRNDDNKE-VGGGFDGSSVASG 789
                 + GK SSVSSS            + +         N+++ E V  G DG +   G
Sbjct: 185  RRSAVEHGKFSSVSSSDPPRFNCRNAPRSPYSVSETGHEINEESSEPVNYGGDGLNDDRG 244

Query: 788  GNAGT---------YE-----AESHLVEKSYNLNDIIKMIPVNEADKD----SMIVQLAR 663
             + G          +E      ES  +E+   L + +     NE D D    + I  L +
Sbjct: 245  VSRGIGLGEGKLHGFEQNDPFGESKSMEEGGLLVEELVCHDRNELDFDGKDANAIRVLEQ 304

Query: 662  ELEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSA 483
             LE+EQA RAALYVELEKER+        AM+MILRLQ+EKASIEME RQYQRMIEEKSA
Sbjct: 305  ALEEEQAARAALYVELEKERSAAATAADEAMAMILRLQKEKASIEMEARQYQRMIEEKSA 364

Query: 482  YDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDSNE 330
            YD EEM+I+KEI++RREREKHFLEKEVE+YRQ+     D+   G+  D  E
Sbjct: 365  YDAEEMDILKEILIRREREKHFLEKEVESYRQVV-LARDEKLQGNKHDLAE 414


>ref|XP_010241370.1| PREDICTED: uncharacterized protein LOC104585995 isoform X1 [Nelumbo
            nucifera]
          Length = 808

 Score =  223 bits (568), Expect = 4e-60
 Identities = 172/411 (41%), Positives = 217/411 (52%), Gaps = 65/411 (15%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDI--------STFRNL 1212
            WTF  LVGAFLDL +AY LLCGSA+A FA K LG+FGL LP   +            + L
Sbjct: 9    WTFCGLVGAFLDLALAYLLLCGSALAFFASKFLGIFGLYLPCPCNGFFGVPNGGKCLQRL 68

Query: 1211 LVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEAS 1032
            LVD PT ++SSVQ SV  KFPFD  + +    + G + +  +  D       LE EGEAS
Sbjct: 69   LVDCPTGKISSVQMSVKSKFPFDTFWMK----DQGCQLNVKLLRDRDRCDGLLEMEGEAS 124

Query: 1031 CSSKSDARK----------VVRNEV----------------DVKGKGGLNYRMXXXXXXX 930
            CSS S+ R+            RNE+                DVKGKG +  +        
Sbjct: 125  CSSFSETRRRSHSLALRDLSPRNEMIRFGLTNSPVARESRSDVKGKGVVTQKPRSTLRRR 184

Query: 929  XXXXFDSGKESSVSSS------------TNWVTCVVDQGRNDDNKE-VGGGFDGSSVASG 789
                 + GK SSVSSS            + +         N+++ E V  G DG +   G
Sbjct: 185  RRSAVEHGKFSSVSSSDPPRFNCRNAPRSPYSVSETGHEINEESSEPVNYGGDGLNDDRG 244

Query: 788  GNAGT---------YE-----AESHLVEKSYNLNDIIKMIPVNEADKD----SMIVQLAR 663
             + G          +E      ES  +E+   L + +     NE D D    + I  L +
Sbjct: 245  VSRGIGLGEGKLHGFEQNDPFGESKSMEEGGLLVEELVCHDRNELDFDGKDANAIRVLEQ 304

Query: 662  ELEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSA 483
             LE+EQA RAALYVELEKER+        AM+MILRLQ+EKASIEME RQYQRMIEEKSA
Sbjct: 305  ALEEEQAARAALYVELEKERSAAATAADEAMAMILRLQKEKASIEMEARQYQRMIEEKSA 364

Query: 482  YDEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQFYHGDDQDSNE 330
            YD EEM+I+KEI++RREREKHFLEKEVE+YRQ+     D+   G+  D  E
Sbjct: 365  YDAEEMDILKEILIRREREKHFLEKEVESYRQVV-LARDEKLQGNKHDLAE 414


>gb|KZV45378.1| hypothetical protein F511_05542 [Dorcoceras hygrometricum]
          Length = 718

 Score =  220 bits (560), Expect = 1e-59
 Identities = 169/410 (41%), Positives = 208/410 (50%), Gaps = 59/410 (14%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDIST-----FRNLLVD 1203
            W+   LV A  +L IAY  LC SAIA FA K LG FGL LP     +      F  LLVD
Sbjct: 8    WSLSGLVAAIFNLAIAYLFLCVSAIAFFASKFLGFFGLELPCPCKNTPSKEHCFNRLLVD 67

Query: 1202 YPTQRVSSVQYSVIRKFPF-DAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGEASCS 1026
            +P Q+VS+VQ SV  KFPF D+I++R +  N G +N       NG     LE EG+ASCS
Sbjct: 68   FPAQQVSNVQLSVKEKFPFNDSIWARNHDNNIGRDN-----YANGI----LEIEGDASCS 118

Query: 1025 SKSDARKVVRN-----------EVDVKGKGGLNYR-MXXXXXXXXXXXFDSGKESSVSSS 882
            S SD R+  RN           E DVKGKG ++YR              D GK S+VSS 
Sbjct: 119  SVSDVRQS-RNLVGKDFGQWDEEYDVKGKGAISYRPRSRLHRRSRKGSVDHGKYSAVSSY 177

Query: 881  T----NWVTCVVDQGRNDDNKEVGGGFDGSSVASGGNAGTYEAE-----SHLVEKSYNLN 729
                   +   +   R+  NK  G GF G S        +Y  E     S +  +  NL+
Sbjct: 178  DPSLHEEILGSIPHSRSSSNKG-GDGFAGGSSFLDDYGSSYNIEYKRAPSVVGRRKSNLS 236

Query: 728  DI----------------IKMIPVNEA-----DKDSMIVQLARELEDEQATRAALYVELE 612
             +                + +  + EA      + + I  L + LE+  A R ALY+ELE
Sbjct: 237  SVQINNSSDDDTEVRKTVLSIEDLQEAKYFCGQEGNTIQLLEQALEEANAARDALYIELE 296

Query: 611  KERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSAYDEEEMNIMKEIVLRRE 432
            KERN        AM+MILRLQEEKASIEME RQ+QR+ EEKSAYD EEM+I+KEI++RRE
Sbjct: 297  KERNAAASAAEEAMAMILRLQEEKASIEMEARQHQRIFEEKSAYDAEEMDILKEILVRRE 356

Query: 431  REKHFLEKEVEAYRQMTHHENDQFYHGDDQ-----------DSNEDPDLM 315
             EKH LE EVE YRQM    N Q                  D NEDP LM
Sbjct: 357  MEKHLLEMEVEGYRQMASLGNQQLVDDGPSGAPTNVLGLLIDQNEDPVLM 406


>gb|PIA36041.1| hypothetical protein AQUCO_03400145v1 [Aquilegia coerulea]
          Length = 763

 Score =  219 bits (557), Expect = 7e-59
 Identities = 180/488 (36%), Positives = 245/488 (50%), Gaps = 73/488 (14%)
 Frame = -1

Query: 1367 WTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYD------ISTFRN--- 1215
            WTF +LVGAFLDLFIAYFLLC S    FA K L +FGL LP   +       S  RN   
Sbjct: 12   WTFSALVGAFLDLFIAYFLLCASTFTFFASKFLAIFGLYLPCPCNGLFSDPRSNRRNCIE 71

Query: 1214 -LLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNENDDGVNLDNGNRFRELEGEGE 1038
              LVD P + +SSVQ SV  KFPFD IF++ +   +  +     + D+ + +  L+ EGE
Sbjct: 72   RFLVDTPIENISSVQMSVKGKFPFDPIFTKDHYQGSQLKVKLVKDKDHSS-YGFLDMEGE 130

Query: 1037 ASCSSKSDARK-------------VVRNEVDVKGKGGLNYR-----MXXXXXXXXXXXFD 912
            ASCSS SD +              V +   D KGK  LN R     +             
Sbjct: 131  ASCSSYSDPQNDKLLRLGSINLSPVHKGVDDYKGKKVLNQRPRAGTLRRRRRTFPHGYRM 190

Query: 911  SGKESSV----SSSTNWVTCVVDQG--RNDDNKEVGG---------GFDGSSVASGG-NA 780
             G+ESS+    S      T   D+   R DD++   G          F  + +  G  N 
Sbjct: 191  IGRESSLPPLGSDPIQSETTENDESSVRQDDDQVNAGILSDEGPQHSFQWNDLMGGSKNF 250

Query: 779  GTYEAESHLVEKSYNLNDIIKMIP---VNEADKDSMIVQLARELEDEQATRAALYVELEK 609
            G  E+ +         +D+I  I    V +  + + +  L + LE+E A RAALY +LEK
Sbjct: 251  GRDESSA---------SDVISHIQQELVYDRSEVNAVRILEQALEEEHAARAALYQDLEK 301

Query: 608  ERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSAYDEEEMNIMKEIVLRRER 429
            ER+        AM+MILRLQ+EKAS EME +QYQRMIEEKSAYD+EEMNI+KEI+LRRER
Sbjct: 302  ERSAAATAADEAMAMILRLQKEKASTEMEAKQYQRMIEEKSAYDQEEMNILKEILLRRER 361

Query: 428  EKHFLEKEVEAYRQMTHHENDQFYHGDDQ------DSNEDPDLM-----------KNARL 300
            EKHFLEKE+EAYR++    N++      Q      D +EDP LM           + A++
Sbjct: 362  EKHFLEKELEAYRRIMTPGNEELVGISTQRPRMSFDPSEDPALMLQQISEYIDKKEMAKM 421

Query: 299  YEDAVADSVKEEEPEKTIAIVGEEKSRE---------MEANDATDSRVYDVHIDHGPKSS 147
              ++ AD       ++   +  EE+S           +E  DA  S  Y+    HGP+  
Sbjct: 422  MNNSPADYDALSVEKQGRVLTFEEESPSPYWNETDDLLEQGDARMSLSYETDQGHGPQCI 481

Query: 146  EKVNGAKK 123
            ++   ++K
Sbjct: 482  DEYKHSQK 489


>gb|KZM85966.1| hypothetical protein DCAR_026612 [Daucus carota subsp. sativus]
          Length = 762

 Score =  216 bits (551), Expect = 5e-58
 Identities = 162/399 (40%), Positives = 204/399 (51%), Gaps = 60/399 (15%)
 Frame = -1

Query: 1379 NRSFWTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDISTFR------ 1218
            N  +WTF  LVGAFLDL IAY +LC S++A F  + LGLFGL LP   D    R      
Sbjct: 5    NMQYWTFTGLVGAFLDLGIAYCVLCASSLAFFVSEFLGLFGLRLPCPCDGMFTRPRNHYC 64

Query: 1217 --NLLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNEN-DDGVNLDNGNRFRELEG 1047
               LL DYP ++V  V  SV  KFPF +  +  N  N G +  ++  NL NG     ++ 
Sbjct: 65   LQRLLFDYPHEKVGYVLCSVRNKFPFHSELT--NDENLGVKLVEERSNLGNGYY---VDF 119

Query: 1046 EGEASCSSKSDARKVVRN------------------------EVDVKGKGGLNYRMXXXX 939
            E EASCSS SDA KV RN                        ++D+KGKG   +R     
Sbjct: 120  EAEASCSSVSDA-KVSRNTGEKEMVPRKGIGFEFNVGRWPDKKIDMKGKGISIHRPRANM 178

Query: 938  XXXXXXXFDSGKESSVSSSTNWVTCVVDQGRNDDNKEVGGGFDGS--------------- 804
                   +D  + SSVSS               +  E  GG   S               
Sbjct: 179  RRRRKGGYDHKRSSSVSSHDQVPVPRYSSSIKKEKNEDLGGMSASDEADTNYLSDDRRAP 238

Query: 803  SVASGGNAGTYEAESHLVEKSYNLNDIIKMIPVNE------------ADKDSMIVQLARE 660
            SV S G +G+ + + +      NL +  K    NE             D+ + I  L + 
Sbjct: 239  SVMSAGGSGSRDVDLNSSLPGMNLKE--KYESSNEDFMSDARGALSLGDEINTIGVLEQA 296

Query: 659  LEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSAY 480
            L++E A R ALY+EL+KER+        AM+MILRLQEEKAS+EM+ RQYQR IEEKSAY
Sbjct: 297  LKEEHAARVALYIELDKERHAAASAADEAMAMILRLQEEKASVEMQARQYQREIEEKSAY 356

Query: 479  DEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQ 363
            D EEMNI+KEIVLRREREKHFLEKEVEAYRQ+ +   +Q
Sbjct: 357  DTEEMNILKEIVLRREREKHFLEKEVEAYRQLLYSGKEQ 395


>ref|XP_017223020.1| PREDICTED: myosin-binding protein 3 [Daucus carota subsp. sativus]
          Length = 772

 Score =  216 bits (551), Expect = 5e-58
 Identities = 162/399 (40%), Positives = 204/399 (51%), Gaps = 60/399 (15%)
 Frame = -1

Query: 1379 NRSFWTFESLVGAFLDLFIAYFLLCGSAIALFAIKILGLFGLTLPSAYDISTFR------ 1218
            N  +WTF  LVGAFLDL IAY +LC S++A F  + LGLFGL LP   D    R      
Sbjct: 5    NMQYWTFTGLVGAFLDLGIAYCVLCASSLAFFVSEFLGLFGLRLPCPCDGMFTRPRNHYC 64

Query: 1217 --NLLVDYPTQRVSSVQYSVIRKFPFDAIFSRANSTNNGNEN-DDGVNLDNGNRFRELEG 1047
               LL DYP ++V  V  SV  KFPF +  +  N  N G +  ++  NL NG     ++ 
Sbjct: 65   LQRLLFDYPHEKVGYVLCSVRNKFPFHSELT--NDENLGVKLVEERSNLGNGYY---VDF 119

Query: 1046 EGEASCSSKSDARKVVRN------------------------EVDVKGKGGLNYRMXXXX 939
            E EASCSS SDA KV RN                        ++D+KGKG   +R     
Sbjct: 120  EAEASCSSVSDA-KVSRNTGEKEMVPRKGIGFEFNVGRWPDKKIDMKGKGISIHRPRANM 178

Query: 938  XXXXXXXFDSGKESSVSSSTNWVTCVVDQGRNDDNKEVGGGFDGS--------------- 804
                   +D  + SSVSS               +  E  GG   S               
Sbjct: 179  RRRRKGGYDHKRSSSVSSHDQVPVPRYSSSIKKEKNEDLGGMSASDEADTNYLSDDRRAP 238

Query: 803  SVASGGNAGTYEAESHLVEKSYNLNDIIKMIPVNE------------ADKDSMIVQLARE 660
            SV S G +G+ + + +      NL +  K    NE             D+ + I  L + 
Sbjct: 239  SVMSAGGSGSRDVDLNSSLPGMNLKE--KYESSNEDFMSDARGALSLGDEINTIGVLEQA 296

Query: 659  LEDEQATRAALYVELEKERNXXXXXXXXAMSMILRLQEEKASIEMELRQYQRMIEEKSAY 480
            L++E A R ALY+EL+KER+        AM+MILRLQEEKAS+EM+ RQYQR IEEKSAY
Sbjct: 297  LKEEHAARVALYIELDKERHAAASAADEAMAMILRLQEEKASVEMQARQYQREIEEKSAY 356

Query: 479  DEEEMNIMKEIVLRREREKHFLEKEVEAYRQMTHHENDQ 363
            D EEMNI+KEIVLRREREKHFLEKEVEAYRQ+ +   +Q
Sbjct: 357  DTEEMNILKEIVLRREREKHFLEKEVEAYRQLLYSGKEQ 395


Top