BLASTX nr result

ID: Chrysanthemum21_contig00018777 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00018777
         (2024 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVH96099.1| protein of unknown function DUF1336 [Cynara cardu...   594   0.0  
ref|XP_022020642.1| uncharacterized protein LOC110920751 isoform...   593   0.0  
ref|XP_022020641.1| uncharacterized protein LOC110920751 isoform...   590   0.0  
ref|XP_023740319.1| uncharacterized protein LOC111888363 isoform...   586   0.0  
ref|XP_023740318.1| uncharacterized protein LOC111888363 isoform...   584   0.0  
ref|XP_021999037.1| uncharacterized protein LOC110895955 isoform...   580   0.0  
ref|XP_021999038.1| uncharacterized protein LOC110895955 isoform...   580   0.0  
gb|PLY68838.1| hypothetical protein LSAT_3X50601 [Lactuca sativa]     580   0.0  
gb|KVH97752.1| protein of unknown function DUF1336 [Cynara cardu...   562   0.0  
ref|XP_021999039.1| uncharacterized protein LOC110895955 isoform...   521   e-177
gb|EEF42229.1| conserved hypothetical protein [Ricinus communis]      522   e-177
ref|XP_019164732.1| PREDICTED: uncharacterized protein LOC109160...   514   e-173
ref|XP_017612608.1| PREDICTED: uncharacterized protein LOC108457...   508   e-171
ref|XP_008339963.1| PREDICTED: uncharacterized protein LOC103402...   509   e-171
ref|XP_012458818.1| PREDICTED: uncharacterized protein LOC105779...   505   e-170
ref|XP_016739139.1| PREDICTED: uncharacterized protein LOC107948...   505   e-170
ref|XP_015965313.1| uncharacterized protein LOC107489041 [Arachi...   503   e-169
ref|XP_016680860.1| PREDICTED: uncharacterized protein LOC107899...   503   e-169
ref|XP_016202565.1| uncharacterized protein LOC107643434 isoform...   502   e-169
ref|XP_011071045.1| uncharacterized protein LOC105156572 [Sesamu...   503   e-168

>gb|KVH96099.1| protein of unknown function DUF1336 [Cynara cardunculus var.
            scolymus]
          Length = 494

 Score =  594 bits (1531), Expect = 0.0
 Identities = 308/441 (69%), Positives = 344/441 (78%), Gaps = 15/441 (3%)
 Frame = +2

Query: 203  GSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRD-----GENRRS 367
            GSI++ +YDSAA+LESD S+EDFHSVL+D++ L+GSEG SR +I+++RD     GE+RRS
Sbjct: 59   GSIDEFFYDSAAVLESDCSEEDFHSVLDDVVSLNGSEGASRASIASLRDVNHGDGESRRS 118

Query: 368  SVHPVDVSRCXXXXXXXXXXXXXXXXE-----RDNGG----LFDCGIIPSNCLPCLATID 520
            SVHP +++                  E      DN G    L DCG+IP NCLPCLA   
Sbjct: 119  SVHPEEMNPRSRSDGPNNDFQPVYIDEISSSVDDNAGRENDLLDCGVIPGNCLPCLAATV 178

Query: 521  TSVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEK 697
             SV+KR           KKA HKLSFKW+DGHPNA+  SSK H+QRPIAGSQVPFC +EK
Sbjct: 179  PSVEKRRSLSSSPPSARKKAVHKLSFKWRDGHPNANIFSSKMHLQRPIAGSQVPFCPVEK 238

Query: 698  RMPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELP 877
             + DSWS +EP+TFR+RG NYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELP
Sbjct: 239  TVLDSWSHVEPKTFRVRGVNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELP 298

Query: 878  VVGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNM 1057
            VVGSSS ELPSILVVNVQ+PLYPA+FF+ EIDGEG++ VLYFKLS+SY+KELSSQFQDNM
Sbjct: 299  VVGSSSTELPSILVVNVQVPLYPAAFFQGEIDGEGMNVVLYFKLSDSYSKELSSQFQDNM 358

Query: 1058 RRILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRP 1237
            RRILDDEIEKVKGFPVDTLVP RERLKILGRVVN++DLQLSAPERKLMHAYN KPVLSRP
Sbjct: 359  RRILDDEIEKVKGFPVDTLVPFRERLKILGRVVNVEDLQLSAPERKLMHAYNEKPVLSRP 418

Query: 1238 QHEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXX 1417
            QHEFYQGENY EIDLDMHRFSYISRKGFEAFQDRLKNCILD      GNK          
Sbjct: 419  QHEFYQGENYFEIDLDMHRFSYISRKGFEAFQDRLKNCILD------GNKVEELPEQILC 472

Query: 1418 XXXXNCIDYMNYHMLELNQEP 1480
                N ID M Y ML LNQEP
Sbjct: 473  CVRLNGIDRMRYQMLGLNQEP 493


>ref|XP_022020642.1| uncharacterized protein LOC110920751 isoform X2 [Helianthus annuus]
          Length = 488

 Score =  593 bits (1528), Expect = 0.0
 Identities = 305/445 (68%), Positives = 346/445 (77%), Gaps = 10/445 (2%)
 Frame = +2

Query: 173  DQSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRD- 349
            D+S      +GS ++ WYDSAA+LESD S+EDF SVL+D++ L+GSEG SR +I+++RD 
Sbjct: 43   DRSIANPTFRGSTDESWYDSAAVLESDCSEEDFQSVLDDVVSLNGSEGASRASIASLRDV 102

Query: 350  ----GENRRSSVHPVDVS-RCXXXXXXXXXXXXXXXXERDNG---GLFDCGIIPSNCLPC 505
                GE+RRS+  P + + R                 +   G   GL DCG+IPSNCLPC
Sbjct: 103  THGDGESRRSTALPEETNPRGPNEIRPVYLDEISSSVDESTGREDGLLDCGVIPSNCLPC 162

Query: 506  LATIDTSVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPF 682
            LA    SV+KR           KK  HKLSFKWKDGHPNA+  SSK  +QRPIAGSQVPF
Sbjct: 163  LAATVPSVEKRTSLSSSPSSARKKPVHKLSFKWKDGHPNANIFSSKMQLQRPIAGSQVPF 222

Query: 683  CSIEKRMPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIAR 862
            C ++K + DSWS IEP+TFR+R +NYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIAR
Sbjct: 223  CPVDKTVLDSWSHIEPKTFRVRAENYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIAR 282

Query: 863  FVELPVVGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQ 1042
            FVELP V SSSGELPSILVVNVQ+PLYPA+FF+ EIDGEG++ VLYF+LS+ Y+KELSSQ
Sbjct: 283  FVELPAV-SSSGELPSILVVNVQVPLYPAAFFQGEIDGEGMNIVLYFRLSDGYSKELSSQ 341

Query: 1043 FQDNMRRILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKP 1222
            FQDNMRRILDDEIEKVKGFPVDTLVP RERLKILGRVVN++DLQL+APERKLMHAYN KP
Sbjct: 342  FQDNMRRILDDEIEKVKGFPVDTLVPFRERLKILGRVVNVEDLQLNAPERKLMHAYNEKP 401

Query: 1223 VLSRPQHEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXX 1402
            VLSRPQHEFYQGENY EIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNK     
Sbjct: 402  VLSRPQHEFYQGENYFEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKAEELP 461

Query: 1403 XXXXXXXXXNCIDYMNYHMLELNQE 1477
                     N ID M YHML +NQE
Sbjct: 462  EQILCCVRLNGIDRMRYHMLAVNQE 486


>ref|XP_022020641.1| uncharacterized protein LOC110920751 isoform X1 [Helianthus annuus]
 gb|OTF85867.1| Protein of unknown function (DUF1336) [Helianthus annuus]
          Length = 489

 Score =  590 bits (1522), Expect = 0.0
 Identities = 303/435 (69%), Positives = 342/435 (78%), Gaps = 10/435 (2%)
 Frame = +2

Query: 203  GSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRD-----GENRRS 367
            GS ++ WYDSAA+LESD S+EDF SVL+D++ L+GSEG SR +I+++RD     GE+RRS
Sbjct: 54   GSTDESWYDSAAVLESDCSEEDFQSVLDDVVSLNGSEGASRASIASLRDVTHGDGESRRS 113

Query: 368  SVHPVDVS-RCXXXXXXXXXXXXXXXXERDNG---GLFDCGIIPSNCLPCLATIDTSVDK 535
            +  P + + R                 +   G   GL DCG+IPSNCLPCLA    SV+K
Sbjct: 114  TALPEETNPRGPNEIRPVYLDEISSSVDESTGREDGLLDCGVIPSNCLPCLAATVPSVEK 173

Query: 536  RXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKRMPDS 712
            R           KK  HKLSFKWKDGHPNA+  SSK  +QRPIAGSQVPFC ++K + DS
Sbjct: 174  RTSLSSSPSSARKKPVHKLSFKWKDGHPNANIFSSKMQLQRPIAGSQVPFCPVDKTVLDS 233

Query: 713  WSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPVVGSS 892
            WS IEP+TFR+R +NYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELP V SS
Sbjct: 234  WSHIEPKTFRVRAENYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPAV-SS 292

Query: 893  SGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMRRILD 1072
            SGELPSILVVNVQ+PLYPA+FF+ EIDGEG++ VLYF+LS+ Y+KELSSQFQDNMRRILD
Sbjct: 293  SGELPSILVVNVQVPLYPAAFFQGEIDGEGMNIVLYFRLSDGYSKELSSQFQDNMRRILD 352

Query: 1073 DEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQHEFY 1252
            DEIEKVKGFPVDTLVP RERLKILGRVVN++DLQL+APERKLMHAYN KPVLSRPQHEFY
Sbjct: 353  DEIEKVKGFPVDTLVPFRERLKILGRVVNVEDLQLNAPERKLMHAYNEKPVLSRPQHEFY 412

Query: 1253 QGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXXXXXN 1432
            QGENY EIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNK              N
Sbjct: 413  QGENYFEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKAEELPEQILCCVRLN 472

Query: 1433 CIDYMNYHMLELNQE 1477
             ID M YHML +NQE
Sbjct: 473  GIDRMRYHMLAVNQE 487


>ref|XP_023740319.1| uncharacterized protein LOC111888363 isoform X2 [Lactuca sativa]
          Length = 482

 Score =  586 bits (1510), Expect = 0.0
 Identities = 298/439 (67%), Positives = 341/439 (77%), Gaps = 4/439 (0%)
 Frame = +2

Query: 173  DQSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRDG 352
            D+S      +GS ++ WYDSAA+L+SD S++DF SVL+D+  L+GSEG SR +IS+V   
Sbjct: 48   DKSFVNPTFRGSTDESWYDSAAVLDSDCSEDDFQSVLDDVSSLNGSEGASRASISSVHPE 107

Query: 353  ENRRSSVHPVDVSRCXXXXXXXXXXXXXXXXERDNG---GLFDCGIIPSNCLPCLATIDT 523
            E     ++P   S                  +  +G   GL DCG+IPSNCLPCLA    
Sbjct: 108  E-----MNPRSRSEGPNEIKPVYLDEISSSVDETSGREDGLLDCGVIPSNCLPCLAATVP 162

Query: 524  SVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKR 700
            S++KR           KK+ HKLSFKWKDGHPNA+  SSK H+QRP AGSQVPFC ++K+
Sbjct: 163  SIEKRRSLSSSPPSVRKKSTHKLSFKWKDGHPNANIFSSKIHLQRPKAGSQVPFCPLDKK 222

Query: 701  MPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPV 880
            + DSWS++EP+TFR+RG+NYLRDKKKEHAPNYAAYYPFGVDVFLSQ KIDHIARFVELPV
Sbjct: 223  VLDSWSNVEPKTFRVRGENYLRDKKKEHAPNYAAYYPFGVDVFLSQTKIDHIARFVELPV 282

Query: 881  VGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMR 1060
            + SSSG+LP ILVVNVQ+PLYP +FF+ EIDGEG++ VLYFKLSE+Y+KELSSQFQDNMR
Sbjct: 283  LESSSGDLPCILVVNVQVPLYPCAFFQGEIDGEGMNVVLYFKLSETYSKELSSQFQDNMR 342

Query: 1061 RILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQ 1240
            RILDDEIEKVKGFPVDTLVP RERLKILGRVVN+D+LQLSAPERKLMHAYN KPVLSRPQ
Sbjct: 343  RILDDEIEKVKGFPVDTLVPFRERLKILGRVVNVDELQLSAPERKLMHAYNEKPVLSRPQ 402

Query: 1241 HEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXX 1420
            HEFYQGENY EIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNK           
Sbjct: 403  HEFYQGENYFEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKAEELPEQILCC 462

Query: 1421 XXXNCIDYMNYHMLELNQE 1477
               N ID M YHML LNQE
Sbjct: 463  VRLNGIDRMRYHMLGLNQE 481


>ref|XP_023740318.1| uncharacterized protein LOC111888363 isoform X1 [Lactuca sativa]
          Length = 483

 Score =  584 bits (1505), Expect = 0.0
 Identities = 296/429 (68%), Positives = 337/429 (78%), Gaps = 4/429 (0%)
 Frame = +2

Query: 203  GSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRDGENRRSSVHPV 382
            GS ++ WYDSAA+L+SD S++DF SVL+D+  L+GSEG SR +IS+V   E     ++P 
Sbjct: 59   GSTDESWYDSAAVLDSDCSEDDFQSVLDDVSSLNGSEGASRASISSVHPEE-----MNPR 113

Query: 383  DVSRCXXXXXXXXXXXXXXXXERDNG---GLFDCGIIPSNCLPCLATIDTSVDKRXXXXX 553
              S                  +  +G   GL DCG+IPSNCLPCLA    S++KR     
Sbjct: 114  SRSEGPNEIKPVYLDEISSSVDETSGREDGLLDCGVIPSNCLPCLAATVPSIEKRRSLSS 173

Query: 554  XXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKRMPDSWSDIEP 730
                  KK+ HKLSFKWKDGHPNA+  SSK H+QRP AGSQVPFC ++K++ DSWS++EP
Sbjct: 174  SPPSVRKKSTHKLSFKWKDGHPNANIFSSKIHLQRPKAGSQVPFCPLDKKVLDSWSNVEP 233

Query: 731  QTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPVVGSSSGELPS 910
            +TFR+RG+NYLRDKKKEHAPNYAAYYPFGVDVFLSQ KIDHIARFVELPV+ SSSG+LP 
Sbjct: 234  KTFRVRGENYLRDKKKEHAPNYAAYYPFGVDVFLSQTKIDHIARFVELPVLESSSGDLPC 293

Query: 911  ILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMRRILDDEIEKV 1090
            ILVVNVQ+PLYP +FF+ EIDGEG++ VLYFKLSE+Y+KELSSQFQDNMRRILDDEIEKV
Sbjct: 294  ILVVNVQVPLYPCAFFQGEIDGEGMNVVLYFKLSETYSKELSSQFQDNMRRILDDEIEKV 353

Query: 1091 KGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQHEFYQGENYM 1270
            KGFPVDTLVP RERLKILGRVVN+D+LQLSAPERKLMHAYN KPVLSRPQHEFYQGENY 
Sbjct: 354  KGFPVDTLVPFRERLKILGRVVNVDELQLSAPERKLMHAYNEKPVLSRPQHEFYQGENYF 413

Query: 1271 EIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXXXXXNCIDYMN 1450
            EIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNK              N ID M 
Sbjct: 414  EIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKAEELPEQILCCVRLNGIDRMR 473

Query: 1451 YHMLELNQE 1477
            YHML LNQE
Sbjct: 474  YHMLGLNQE 482


>ref|XP_021999037.1| uncharacterized protein LOC110895955 isoform X1 [Helianthus annuus]
          Length = 467

 Score =  580 bits (1496), Expect = 0.0
 Identities = 292/440 (66%), Positives = 336/440 (76%), Gaps = 1/440 (0%)
 Frame = +2

Query: 164  VAGDQSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAV 343
            +  DQS + + S GSI++ WYDS A+LES+ SDEDFHSVL+D+L L+GSE  SRP+I   
Sbjct: 37   ILSDQS-KFAPSAGSIDEHWYDSVAVLESECSDEDFHSVLDDVLSLNGSEVASRPSIDLR 95

Query: 344  RDGENRRSSVHPVDVSRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIPSNCLPCLATIDT 523
               +   +   PV +                      + GL DCG+IP NCLPCLA    
Sbjct: 96   LKSDEHSNESKPVYLDEISSSIDENAGM---------DSGLLDCGMIPGNCLPCLANTIP 146

Query: 524  SVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKR 700
            S++KR           KK +HKLSFK KDGHP+ +  S KK ++RPIAGSQVPFC  EK+
Sbjct: 147  SIEKRRSSSSSPPNTRKKISHKLSFKLKDGHPSTTIFSLKKRLERPIAGSQVPFCPAEKK 206

Query: 701  MPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPV 880
            + DSWS +EPQ FR+RGKNY RDKKKEHAPNYAAYYPFGVDVFLSQRK+DHIARFVELP+
Sbjct: 207  VLDSWSHVEPQIFRVRGKNYFRDKKKEHAPNYAAYYPFGVDVFLSQRKVDHIARFVELPI 266

Query: 881  VGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMR 1060
            VG SSGELP ILVVN+Q+PLYPA+FF+ EIDGEG+SFVLYFKLS++Y+KELSSQFQDNMR
Sbjct: 267  VGPSSGELPPILVVNIQVPLYPAAFFQGEIDGEGVSFVLYFKLSDNYSKELSSQFQDNMR 326

Query: 1061 RILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQ 1240
            RILDDE+EKVKGFP+DTL P RERLKILGRVVN+DDLQLSAPERK+M+AYN KPVLSRPQ
Sbjct: 327  RILDDEMEKVKGFPLDTLAPFRERLKILGRVVNVDDLQLSAPERKIMNAYNEKPVLSRPQ 386

Query: 1241 HEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXX 1420
            HEFYQG NY EIDLDMHRFSYISRKGFEAFQ+RLKNCILD GLTIQGNK           
Sbjct: 387  HEFYQGVNYFEIDLDMHRFSYISRKGFEAFQERLKNCILDFGLTIQGNKQEELPEQILCC 446

Query: 1421 XXXNCIDYMNYHMLELNQEP 1480
               N IDYMNY ML+LN EP
Sbjct: 447  VRLNGIDYMNYQMLKLNSEP 466


>ref|XP_021999038.1| uncharacterized protein LOC110895955 isoform X2 [Helianthus annuus]
 gb|OTG06227.1| hypothetical protein HannXRQ_Chr12g0382371 [Helianthus annuus]
          Length = 466

 Score =  580 bits (1494), Expect = 0.0
 Identities = 292/440 (66%), Positives = 334/440 (75%), Gaps = 1/440 (0%)
 Frame = +2

Query: 164  VAGDQSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAV 343
            +  DQS    A  GSI++ WYDS A+LES+ SDEDFHSVL+D+L L+GSE  SRP+I   
Sbjct: 37   ILSDQSK--FAPSGSIDEHWYDSVAVLESECSDEDFHSVLDDVLSLNGSEVASRPSIDLR 94

Query: 344  RDGENRRSSVHPVDVSRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIPSNCLPCLATIDT 523
               +   +   PV +                      + GL DCG+IP NCLPCLA    
Sbjct: 95   LKSDEHSNESKPVYLDEISSSIDENAGM---------DSGLLDCGMIPGNCLPCLANTIP 145

Query: 524  SVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKR 700
            S++KR           KK +HKLSFK KDGHP+ +  S KK ++RPIAGSQVPFC  EK+
Sbjct: 146  SIEKRRSSSSSPPNTRKKISHKLSFKLKDGHPSTTIFSLKKRLERPIAGSQVPFCPAEKK 205

Query: 701  MPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPV 880
            + DSWS +EPQ FR+RGKNY RDKKKEHAPNYAAYYPFGVDVFLSQRK+DHIARFVELP+
Sbjct: 206  VLDSWSHVEPQIFRVRGKNYFRDKKKEHAPNYAAYYPFGVDVFLSQRKVDHIARFVELPI 265

Query: 881  VGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMR 1060
            VG SSGELP ILVVN+Q+PLYPA+FF+ EIDGEG+SFVLYFKLS++Y+KELSSQFQDNMR
Sbjct: 266  VGPSSGELPPILVVNIQVPLYPAAFFQGEIDGEGVSFVLYFKLSDNYSKELSSQFQDNMR 325

Query: 1061 RILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQ 1240
            RILDDE+EKVKGFP+DTL P RERLKILGRVVN+DDLQLSAPERK+M+AYN KPVLSRPQ
Sbjct: 326  RILDDEMEKVKGFPLDTLAPFRERLKILGRVVNVDDLQLSAPERKIMNAYNEKPVLSRPQ 385

Query: 1241 HEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXX 1420
            HEFYQG NY EIDLDMHRFSYISRKGFEAFQ+RLKNCILD GLTIQGNK           
Sbjct: 386  HEFYQGVNYFEIDLDMHRFSYISRKGFEAFQERLKNCILDFGLTIQGNKQEELPEQILCC 445

Query: 1421 XXXNCIDYMNYHMLELNQEP 1480
               N IDYMNY ML+LN EP
Sbjct: 446  VRLNGIDYMNYQMLKLNSEP 465


>gb|PLY68838.1| hypothetical protein LSAT_3X50601 [Lactuca sativa]
          Length = 486

 Score =  580 bits (1495), Expect = 0.0
 Identities = 298/443 (67%), Positives = 341/443 (76%), Gaps = 8/443 (1%)
 Frame = +2

Query: 173  DQSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRDG 352
            D+S      +GS ++ WYDSAA+L+SD S++DF SVL+D+  L+GSEG SR +IS+V   
Sbjct: 48   DKSFVNPTFRGSTDESWYDSAAVLDSDCSEDDFQSVLDDVSSLNGSEGASRASISSVHPE 107

Query: 353  ENRRSSVHPVDVSRCXXXXXXXXXXXXXXXXERDNG---GLFDCGIIPSNCLPCLATIDT 523
            E     ++P   S                  +  +G   GL DCG+IPSNCLPCLA    
Sbjct: 108  E-----MNPRSRSEGPNEIKPVYLDEISSSVDETSGREDGLLDCGVIPSNCLPCLAATVP 162

Query: 524  SVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKR 700
            S++KR           KK+ HKLSFKWKDGHPNA+  SSK H+QRP AGSQVPFC ++K+
Sbjct: 163  SIEKRRSLSSSPPSVRKKSTHKLSFKWKDGHPNANIFSSKIHLQRPKAGSQVPFCPLDKK 222

Query: 701  MPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPV 880
            + DSWS++EP+TFR+RG+NYLRDKKKEHAPNYAAYYPFGVDVFLSQ KIDHIARFVELPV
Sbjct: 223  VLDSWSNVEPKTFRVRGENYLRDKKKEHAPNYAAYYPFGVDVFLSQTKIDHIARFVELPV 282

Query: 881  VGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMR 1060
            + SSSG+LP ILVVNVQ+PLYP +FF+ EIDGEG++ VLYFKLSE+Y+KELSSQFQDNMR
Sbjct: 283  LESSSGDLPCILVVNVQVPLYPCAFFQGEIDGEGMNVVLYFKLSETYSKELSSQFQDNMR 342

Query: 1061 RILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQ 1240
            RILDDEIEKVKGFPVDTLVP RERLKILGRVVN+D+LQLSAPERKLMHAYN KPVLSRPQ
Sbjct: 343  RILDDEIEKVKGFPVDTLVPFRERLKILGRVVNVDELQLSAPERKLMHAYNEKPVLSRPQ 402

Query: 1241 HEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQ----GNKXXXXXXX 1408
            HEFYQGENY EIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQ    GNK       
Sbjct: 403  HEFYQGENYFEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQACLFGNKAEELPEQ 462

Query: 1409 XXXXXXXNCIDYMNYHMLELNQE 1477
                   N ID M YHML LNQE
Sbjct: 463  ILCCVRLNGIDRMRYHMLGLNQE 485


>gb|KVH97752.1| protein of unknown function DUF1336 [Cynara cardunculus var.
            scolymus]
          Length = 491

 Score =  562 bits (1449), Expect = 0.0
 Identities = 288/410 (70%), Positives = 323/410 (78%), Gaps = 15/410 (3%)
 Frame = +2

Query: 203  GSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVR-----DGENRRS 367
            GS ++ WYDSAAILESD SDEDF SVL+D+L L+ S+G SRP+I+++R     DGE RRS
Sbjct: 48   GSTDESWYDSAAILESDCSDEDFRSVLDDVLSLNSSDGASRPSIASLRDVNLGDGELRRS 107

Query: 368  SVHPVDV---SRCXXXXXXXXXXXXXXXXERDN------GGLFDCGIIPSNCLPCLATID 520
            SVHP D+   SR                    N       GL DCGI+P NCLP LAT  
Sbjct: 108  SVHPEDMDFRSRFDGRSNQTRPVYLDEISSSINDSAGREDGLLDCGIVPGNCLPFLATTV 167

Query: 521  TSVDK-RXXXXXXXXXXKKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEK 697
             SV+K R          KK AHKLS K KDGHPNA+  SSK H++RPI GSQVPFC  EK
Sbjct: 168  PSVEKRRSLSSSPPSARKKTAHKLSLKLKDGHPNAAMFSSKNHLERPIGGSQVPFCPAEK 227

Query: 698  RMPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELP 877
            ++ DSWS +EP+TFR+RGKNY RDK+KEHAPNYAAYYPFGVDVFLSQRKIDHIARF+ELP
Sbjct: 228  KVFDSWSYVEPRTFRVRGKNYFRDKRKEHAPNYAAYYPFGVDVFLSQRKIDHIARFIELP 287

Query: 878  VVGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNM 1057
            VVG  SGELP IL+VN+QIPLYPA+FF+ EIDGEG+S++LYFKLS+SY KE SSQFQDNM
Sbjct: 288  VVG-PSGELPPILIVNIQIPLYPAAFFQGEIDGEGMSYILYFKLSDSYTKEFSSQFQDNM 346

Query: 1058 RRILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRP 1237
            RRI +DEIEKVKGFPVDTLVP RERLKILGRVVN+DDLQLSAPERK+MHAYN KPVLSRP
Sbjct: 347  RRIFNDEIEKVKGFPVDTLVPFRERLKILGRVVNVDDLQLSAPERKIMHAYNEKPVLSRP 406

Query: 1238 QHEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNK 1387
            QHEFYQGENY EIDLDMHRFSYISRKGFE FQ+RLKNCILD GLTIQ  K
Sbjct: 407  QHEFYQGENYFEIDLDMHRFSYISRKGFEVFQERLKNCILDFGLTIQARK 456


>ref|XP_021999039.1| uncharacterized protein LOC110895955 isoform X3 [Helianthus annuus]
          Length = 441

 Score =  521 bits (1341), Expect = e-177
 Identities = 270/440 (61%), Positives = 310/440 (70%), Gaps = 1/440 (0%)
 Frame = +2

Query: 164  VAGDQSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAV 343
            +  DQS + + S GSI++ WYDS A+LES+ SDEDFHSVL+D+L L+GSE  SRP+I   
Sbjct: 37   ILSDQS-KFAPSAGSIDEHWYDSVAVLESECSDEDFHSVLDDVLSLNGSEVASRPSIDLR 95

Query: 344  RDGENRRSSVHPVDVSRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIPSNCLPCLATIDT 523
               +   +   PV +                      + GL DCG+IP NCLPCLA    
Sbjct: 96   LKSDEHSNESKPVYLDEISSSIDENAGM---------DSGLLDCGMIPGNCLPCLANTIP 146

Query: 524  SVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKR 700
            S++KR           KK +HKLSFK KDGHP+ +  S KK ++RPIAGSQVPFC  EK+
Sbjct: 147  SIEKRRSSSSSPPNTRKKISHKLSFKLKDGHPSTTIFSLKKRLERPIAGSQVPFCPAEKK 206

Query: 701  MPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPV 880
            + DSWS +EPQ FR+RGKNY RDKKKEHAPNYAAYYPFGVDVFLSQRK+DHIARFVELP+
Sbjct: 207  VLDSWSHVEPQIFRVRGKNYFRDKKKEHAPNYAAYYPFGVDVFLSQRKVDHIARFVELPI 266

Query: 881  VGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMR 1060
            VG SSGELP ILVVN+Q+PLYPA+FF+ EIDGEG                          
Sbjct: 267  VGPSSGELPPILVVNIQVPLYPAAFFQGEIDGEG-------------------------- 300

Query: 1061 RILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQ 1240
            RILDDE+EKVKGFP+DTL P RERLKILGRVVN+DDLQLSAPERK+M+AYN KPVLSRPQ
Sbjct: 301  RILDDEMEKVKGFPLDTLAPFRERLKILGRVVNVDDLQLSAPERKIMNAYNEKPVLSRPQ 360

Query: 1241 HEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXX 1420
            HEFYQG NY EIDLDMHRFSYISRKGFEAFQ+RLKNCILD GLTIQGNK           
Sbjct: 361  HEFYQGVNYFEIDLDMHRFSYISRKGFEAFQERLKNCILDFGLTIQGNKQEELPEQILCC 420

Query: 1421 XXXNCIDYMNYHMLELNQEP 1480
               N IDYMNY ML+LN EP
Sbjct: 421  VRLNGIDYMNYQMLKLNSEP 440


>gb|EEF42229.1| conserved hypothetical protein [Ricinus communis]
          Length = 512

 Score =  522 bits (1345), Expect = e-177
 Identities = 273/439 (62%), Positives = 324/439 (73%), Gaps = 12/439 (2%)
 Frame = +2

Query: 200  KGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRD---GENRRSS 370
            +GSIED W+DS AI ESD  +ED+ SV +D+L L+GS+G+    +    D   G + R+S
Sbjct: 70   QGSIEDAWFDSVAIFESD-CEEDYESVPDDLLSLNGSDGLPHDQMKKAGDLSAGNSARNS 128

Query: 371  VHPVDVSRCXXXXXXXXXXXXXXXXE--------RDNGGLFDCGIIPSNCLPCLATIDTS 526
            V    VS+                          ++ G L +CGI+P NCLPCLA+  + 
Sbjct: 129  VSEAPVSKFDGPSNEAKQPVFLDEIASSADENAGKEEGLLENCGILPGNCLPCLASTVSQ 188

Query: 527  VDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKRM 703
            V+KR           KKAA KLSFKWK+GH N S  SSK  +QRPIAGSQVPFC ++K+M
Sbjct: 189  VEKRRSLSSSPPSARKKAALKLSFKWKEGHANNSLFSSKPILQRPIAGSQVPFCPMDKKM 248

Query: 704  PDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPVV 883
             D WS IEP +F++RG+NYLRDKKKE AP +AAYYPFGVDVFLS RKIDHIARFVELPV+
Sbjct: 249  LDCWSHIEPGSFKVRGQNYLRDKKKEFAPAHAAYYPFGVDVFLSPRKIDHIARFVELPVI 308

Query: 884  GSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMRR 1063
             +SSG+LP+ILVVNVQIPLY A+ F+SE+DGEG++FVLYFKLSESY+KEL + FQ+++RR
Sbjct: 309  -NSSGKLPTILVVNVQIPLYTAALFQSEVDGEGMNFVLYFKLSESYSKELPAHFQESIRR 367

Query: 1064 ILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQH 1243
            I+DDE+EKVKGFPVDT+VP RERLKILGRVVN+DDL LS+ ERKLM AYN KPVLSRPQH
Sbjct: 368  IIDDEVEKVKGFPVDTIVPYRERLKILGRVVNVDDLHLSSAERKLMQAYNEKPVLSRPQH 427

Query: 1244 EFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXXX 1423
            EFY GENY EID+DMHRFSYISRKGFEAF DRLK CILDVGLTIQGNK            
Sbjct: 428  EFYLGENYFEIDIDMHRFSYISRKGFEAFLDRLKICILDVGLTIQGNKAEELPEQILCCV 487

Query: 1424 XXNCIDYMNYHMLELNQEP 1480
              N IDYMNYH L LNQ+P
Sbjct: 488  RLNGIDYMNYHQLGLNQDP 506


>ref|XP_019164732.1| PREDICTED: uncharacterized protein LOC109160928 [Ipomoea nil]
          Length = 506

 Score =  514 bits (1324), Expect = e-173
 Identities = 272/452 (60%), Positives = 317/452 (70%), Gaps = 17/452 (3%)
 Frame = +2

Query: 176  QSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVS--RPNISAVRD 349
            +S+      GS+E+ WYDSA I E D SDE+F SV +D+  L+GSE  +  RP  S+   
Sbjct: 55   RSHNNPTLHGSVEEAWYDSATIFECDCSDEEFQSVTDDVHSLNGSEAENAHRPGNSSTSH 114

Query: 350  GE------------NRRSSVHPVDV--SRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIP 487
                          ++ SS+H  D   S+C                 R NG L DCGI+P
Sbjct: 115  SARSSVSGNTKSFIHQHSSMHSKDADGSQCEIKPNEISSCATESC-SRGNGLLDDCGILP 173

Query: 488  SNCLPCLATIDTSVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIA 664
             NCLPCLA+    ++KR           KKAA KLSFKWK+GHP+++ +SSK  ++RPIA
Sbjct: 174  HNCLPCLASAVAPIEKRRSVDSSPPSARKKAALKLSFKWKEGHPHSTLLSSKSLLRRPIA 233

Query: 665  GSQVPFCSIEKRMPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRK 844
            GSQVPFC +EK+MPDSWS IE  TFR+RG+NY RDKKK+ APN AAYYPFGVDVFLSQRK
Sbjct: 234  GSQVPFCPLEKKMPDSWSHIEAGTFRVRGENYFRDKKKDFAPNCAAYYPFGVDVFLSQRK 293

Query: 845  IDHIARFVELPVVGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYA 1024
            IDH+AR VELP+   SSG LP ILVVN Q+PLYP S F+SE DGEGISFV YFKLSESY 
Sbjct: 294  IDHVARLVELPIT-ESSGRLPHILVVNCQVPLYPTSIFQSETDGEGISFVFYFKLSESYT 352

Query: 1025 KELSSQFQDNMRRILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMH 1204
            KEL S FQ+++RR++DDE+EKVKGFPVD++VP RERLKILGRV N+DDL LSA ERKLMH
Sbjct: 353  KELPSHFQESIRRLIDDEVEKVKGFPVDSIVPFRERLKILGRVANVDDLPLSAAERKLMH 412

Query: 1205 AYNGKPVLSRPQHEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGN 1384
            AYN KPVLSRPQHEFY GENY EIDLDMHRFSYISRKGFEAF DRLK+  LD GLTIQGN
Sbjct: 413  AYNEKPVLSRPQHEFYTGENYFEIDLDMHRFSYISRKGFEAFFDRLKHFNLDFGLTIQGN 472

Query: 1385 KXXXXXXXXXXXXXXNCIDYMNYHMLELNQEP 1480
            K              N IDY NY  L LNQ+P
Sbjct: 473  KSEEMPEQILCCLRLNEIDYANYQQLGLNQDP 504


>ref|XP_017612608.1| PREDICTED: uncharacterized protein LOC108457911 isoform X2 [Gossypium
            arboreum]
          Length = 480

 Score =  508 bits (1308), Expect = e-171
 Identities = 266/436 (61%), Positives = 316/436 (72%), Gaps = 1/436 (0%)
 Frame = +2

Query: 176  QSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRDGE 355
            +S+ T+ +    +++W+D  ++ +SD  +E+F SV  D L L+G EGV+  NIS++RD  
Sbjct: 49   RSSFTTPTFQGSQELWFDPVSVFDSD-CEEEFESVQEDTLSLNGLEGVASSNISSLRDAN 107

Query: 356  NRRSSVHPVDVSRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIPSNCLPCLATIDTSVDK 535
                S     + +                  ++ G L +CGI+PSNCLPCLA+  +SV+K
Sbjct: 108  YGEHSSLVDQIQK---------PGDLSTGPGKEVGLLDNCGILPSNCLPCLASTVSSVEK 158

Query: 536  RXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKRMPDS 712
            R           KK A KL FKWK+GHPNA+  SSK+ +QRP AGSQVPFC  EKRM D 
Sbjct: 159  RRSLSSSPPSARKKNALKLPFKWKEGHPNAALFSSKRLLQRPKAGSQVPFCPTEKRMFDC 218

Query: 713  WSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPVVGSS 892
            WS IEP TF++R +NY RDKKK+ A N+AAYYPFGVDVFLS RKIDHIARFVELPVV S 
Sbjct: 219  WSHIEPGTFKVRSENYFRDKKKDFAHNHAAYYPFGVDVFLSPRKIDHIARFVELPVV-SH 277

Query: 893  SGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMRRILD 1072
            SG+LPSILVVNVQIPLYP + F SEIDGEG++FVLYFKLS+SY KEL   FQ+N+RRI+D
Sbjct: 278  SGKLPSILVVNVQIPLYPPALFHSEIDGEGMNFVLYFKLSDSYLKELPPHFQENIRRIID 337

Query: 1073 DEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQHEFY 1252
            DE+EKVKGFPVDT VP RERLKILGRV N++DL +SA ERKLM AYN KPVLSRPQHEFY
Sbjct: 338  DEVEKVKGFPVDTNVPFRERLKILGRVANVEDLHMSAAERKLMQAYNEKPVLSRPQHEFY 397

Query: 1253 QGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXXXXXN 1432
             GENY EID+DMHRFSYISRKGF+AF DRLK CILDVGLTIQGNK              +
Sbjct: 398  SGENYFEIDIDMHRFSYISRKGFDAFLDRLKFCILDVGLTIQGNKPEELPEQILCCVRLS 457

Query: 1433 CIDYMNYHMLELNQEP 1480
             IDYMNYH L LNQEP
Sbjct: 458  GIDYMNYHQLSLNQEP 473


>ref|XP_008339963.1| PREDICTED: uncharacterized protein LOC103402953 [Malus domestica]
          Length = 534

 Score =  509 bits (1312), Expect = e-171
 Identities = 276/474 (58%), Positives = 330/474 (69%), Gaps = 41/474 (8%)
 Frame = +2

Query: 182  NQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRD---- 349
            N  +  +GS ED W+D  A  ESD  DEDFHSV +++L ++G E VS  +  ++RD    
Sbjct: 63   NNPTFQEGS-EDAWFDPVARFESD-CDEDFHSVQDEVLSVNGFERVSVSSNLSLRDANCG 120

Query: 350  ------------------GENRRSSVHPV------------DV---SRCXXXXXXXXXXX 430
                              G++  +SV  V            DV   S             
Sbjct: 121  EYNIIDLHASSADQMHKRGDSANNSVSVVSQKSINHIMSGNDVDGHSTAEANQPVFLDEI 180

Query: 431  XXXXXE---RDNGGLFDCGIIPSNCLPCLATIDTSVDKRXXXXXXXXXX-KKAAHKLSFK 598
                 E   ++ G L +CGI+PS+CLPCLA+   SV+KR           KKAA KL FK
Sbjct: 181  SSSVDESSTKEEGILDNCGILPSHCLPCLASTVPSVEKRRSLSSSPPSARKKAAIKLPFK 240

Query: 599  WKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKRMPDSWSDIEPQTFRIRGKNYLRDKKK 778
            WK+GHPNAS +SSK  +QRPIAGSQVPFC +EK+M DSWS IEP +F++RG NY +D+KK
Sbjct: 241  WKEGHPNASLLSSKMLLQRPIAGSQVPFCPMEKKMFDSWSHIEPNSFKVRGPNYFKDRKK 300

Query: 779  EHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPVVGSSSGELPSILVVNVQIPLYPASFF 958
            EHAP+YAAYYPFG+DVFLSQRKIDHIARFVELPVV SSSG+LP+ILVVNVQ+PLYPA+ F
Sbjct: 301  EHAPSYAAYYPFGLDVFLSQRKIDHIARFVELPVV-SSSGDLPAILVVNVQVPLYPAAIF 359

Query: 959  KSEIDGEGISFVLYFKLSESYAKELSSQFQDNMRRILDDEIEKVKGFPVDTLVPCRERLK 1138
            + E DGEG++FVLYFKL++ Y+KEL   FQ+N+RR++ DE+EKVKGFPVDT+VP RERLK
Sbjct: 360  QGETDGEGMNFVLYFKLNDMYSKELPPNFQENIRRLIGDEVEKVKGFPVDTIVPFRERLK 419

Query: 1139 ILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQHEFYQGENYMEIDLDMHRFSYISRKG 1318
            ILGRV N++DL LSAPERKLM AYN KPVLSRPQHEFY GENY+EIDLDMHRFSYISRKG
Sbjct: 420  ILGRVANVEDLHLSAPERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKG 479

Query: 1319 FEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXXXXXNCIDYMNYHMLELNQEP 1480
            FEAF DRLK+CILDVGLTIQGNK              N IDYMNYH L L Q+P
Sbjct: 480  FEAFLDRLKHCILDVGLTIQGNKPEELPEQILCCIRLNGIDYMNYHQLGLTQDP 533


>ref|XP_012458818.1| PREDICTED: uncharacterized protein LOC105779560 isoform X2 [Gossypium
            raimondii]
 gb|KJB77104.1| hypothetical protein B456_012G120400 [Gossypium raimondii]
          Length = 480

 Score =  505 bits (1301), Expect = e-170
 Identities = 265/436 (60%), Positives = 315/436 (72%), Gaps = 1/436 (0%)
 Frame = +2

Query: 176  QSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRDGE 355
            +S+ T+ +    +++W+D  ++ +SD  +E+F SV  D L L+G EGV+  NIS++RD  
Sbjct: 49   RSSFTNPAFQGSQELWFDPVSVFDSD-CEEEFESVQEDTLSLNGLEGVASSNISSLRDAN 107

Query: 356  NRRSSVHPVDVSRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIPSNCLPCLATIDTSVDK 535
                S     + +                  ++ G L +CGI+PSNCLPCLA+  +SV+K
Sbjct: 108  YGEHSSLVDQMQK---------PGGLSTGPGKEVGLLDNCGILPSNCLPCLASTVSSVEK 158

Query: 536  RXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKRMPDS 712
            R           KK A KL FKWK+GHPNA+  SSK+ +QRP AGSQVPFC  EKRM D 
Sbjct: 159  RRSLSSSPPSARKKNALKLPFKWKEGHPNAALFSSKRLLQRPKAGSQVPFCPTEKRMFDC 218

Query: 713  WSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPVVGSS 892
            WS IEP TF++R +NY RDKKK+ A N+AAYYPFGVDVFLS RKIDHIARFVELPVVG S
Sbjct: 219  WSHIEPGTFKVRSENYFRDKKKDFAHNHAAYYPFGVDVFLSPRKIDHIARFVELPVVGHS 278

Query: 893  SGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMRRILD 1072
             G+LPSILVVNVQIPLYP + F SEIDGEG++FVLYFKLS+SY KEL   FQ+N+RRI+D
Sbjct: 279  -GKLPSILVVNVQIPLYPPALFHSEIDGEGMNFVLYFKLSDSYLKELPPHFQENIRRIID 337

Query: 1073 DEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQHEFY 1252
            D +EKVKGFPVDT VP RERLKILGRV N++DL +SA ERKLM AYN KPVLSRPQHEFY
Sbjct: 338  DGVEKVKGFPVDTNVPFRERLKILGRVANVEDLHMSAAERKLMQAYNEKPVLSRPQHEFY 397

Query: 1253 QGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXXXXXN 1432
             GENY EID+DMHRFSYISRKGF+AF DRLK CILDVGLTIQGNK              +
Sbjct: 398  SGENYFEIDIDMHRFSYISRKGFDAFLDRLKFCILDVGLTIQGNKPEELPEQILCCVRLS 457

Query: 1433 CIDYMNYHMLELNQEP 1480
             IDYMNYH L LNQEP
Sbjct: 458  GIDYMNYHQLSLNQEP 473


>ref|XP_016739139.1| PREDICTED: uncharacterized protein LOC107948969 isoform X2 [Gossypium
            hirsutum]
          Length = 480

 Score =  505 bits (1300), Expect = e-170
 Identities = 268/442 (60%), Positives = 318/442 (71%), Gaps = 7/442 (1%)
 Frame = +2

Query: 176  QSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRDGE 355
            +S+ T+ +    +++W+D  ++ +SD  +E+F SV  D L L+G EGV+  NIS++RD  
Sbjct: 49   RSSFTNPTFQGSQELWFDPVSVFDSD-CEEEFESVQEDTLSLNGLEGVASSNISSLRDAN 107

Query: 356  -NRRSSV-----HPVDVSRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIPSNCLPCLATI 517
                SS+      P D+S                   ++ G L +CGI+PSNCLPCLA+ 
Sbjct: 108  YGEHSSLVDQMQKPGDLST---------------GPGKEVGLLDNCGILPSNCLPCLAST 152

Query: 518  DTSVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIE 694
             +SV+KR           KK A KL FKWK+GHPNA+  SSK+ +QRP AGSQVPFC  E
Sbjct: 153  VSSVEKRRSLSSSPPSARKKNALKLPFKWKEGHPNAALFSSKRLLQRPKAGSQVPFCPTE 212

Query: 695  KRMPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVEL 874
            KRM D WS IEP TF++R +NY RDKKK+ A N+AAYYPFGVDVFLS RKIDHIARFVEL
Sbjct: 213  KRMFDCWSHIEPGTFKVRSENYFRDKKKDFAHNHAAYYPFGVDVFLSPRKIDHIARFVEL 272

Query: 875  PVVGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDN 1054
            PVVG S G+LPSILVVNVQIPLYP + F SEIDGEG++FVLYFKLS+SY KEL   FQ+N
Sbjct: 273  PVVGHS-GKLPSILVVNVQIPLYPPALFHSEIDGEGMNFVLYFKLSDSYLKELPPHFQEN 331

Query: 1055 MRRILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSR 1234
            +RRI+DDE+EKVKGFPVDT VP RERLKILGRV N++DL +SA ERKLM AYN KPVLSR
Sbjct: 332  IRRIIDDEVEKVKGFPVDTNVPFRERLKILGRVANVEDLHMSAAERKLMQAYNEKPVLSR 391

Query: 1235 PQHEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXX 1414
            PQHEFY GENY EID+DMHRF Y SRKGF+AF DRLK CILDVGLTIQGNK         
Sbjct: 392  PQHEFYSGENYFEIDIDMHRFRYTSRKGFDAFLDRLKFCILDVGLTIQGNKPEELPEQIL 451

Query: 1415 XXXXXNCIDYMNYHMLELNQEP 1480
                 + IDYMNYH L LNQEP
Sbjct: 452  CCVRLSGIDYMNYHQLSLNQEP 473


>ref|XP_015965313.1| uncharacterized protein LOC107489041 [Arachis duranensis]
          Length = 486

 Score =  503 bits (1294), Expect = e-169
 Identities = 261/451 (57%), Positives = 324/451 (71%), Gaps = 4/451 (0%)
 Frame = +2

Query: 140  KRKSNCLYVAGDQ--SNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSE 313
            +R S+ LY       +N      GSIE+ W+DS  + +SD  D+D+ SV +D+L L+G++
Sbjct: 40   RRVSSKLYKGSSSLDNNVLDLLCGSIEEAWFDSNVVFDSD-CDDDYQSVPDDLLSLNGND 98

Query: 314  GVSRPNISAVRDGENR-RSSVHPVDVSRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIPS 490
               R + ++V   +++ +S  + VD +                   ++ G L +CGI+P+
Sbjct: 99   ANHRVSTASVDATDHQSKSDGNIVDANE---PVFVDEISSVDANSNKEEGILDNCGILPN 155

Query: 491  NCLPCLATIDTSVDKRXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAG 667
            NCLPCLA+   S++KR           KKA  KLSFKWK+GH NA+ +S+K  +QRPIAG
Sbjct: 156  NCLPCLASTVPSIEKRRSSSSSPPSARKKAPMKLSFKWKEGHGNATLLSTKTLLQRPIAG 215

Query: 668  SQVPFCSIEKRMPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKI 847
            SQVPFC I+K+M D WS I+P TF++RG NY +DKKK+ APNY+AYYPFGVDVFLS RK+
Sbjct: 216  SQVPFCPIDKKMLDCWSHIDPSTFKVRGVNYFKDKKKDFAPNYSAYYPFGVDVFLSPRKV 275

Query: 848  DHIARFVELPVVGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAK 1027
            DHIARFVELP + SSSG+LP ILVVNVQIPLYPA+ F+ E DG+G+SFVLYFKLSE Y+K
Sbjct: 276  DHIARFVELPFI-SSSGKLPPILVVNVQIPLYPATIFQGETDGDGMSFVLYFKLSEGYSK 334

Query: 1028 ELSSQFQDNMRRILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHA 1207
            EL    Q+++R+++DDE+EKVKGFPVDT+ P RERLKILGRVVN++DL LSA ERKLMHA
Sbjct: 335  ELPLHLQESIRKLMDDEVEKVKGFPVDTIAPFRERLKILGRVVNLEDLHLSAAERKLMHA 394

Query: 1208 YNGKPVLSRPQHEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNK 1387
            YN KPVLSRPQHEFY GENY EIDLDMHRFSYISRKGFEAF DRLK C LDVGLTIQGNK
Sbjct: 395  YNEKPVLSRPQHEFYSGENYFEIDLDMHRFSYISRKGFEAFLDRLKICTLDVGLTIQGNK 454

Query: 1388 XXXXXXXXXXXXXXNCIDYMNYHMLELNQEP 1480
                          N IDYMNYH L L Q+P
Sbjct: 455  AEELPEQVLCCVRLNGIDYMNYHQLGLTQDP 485


>ref|XP_016680860.1| PREDICTED: uncharacterized protein LOC107899605 isoform X2 [Gossypium
            hirsutum]
          Length = 493

 Score =  503 bits (1294), Expect = e-169
 Identities = 264/436 (60%), Positives = 314/436 (72%), Gaps = 1/436 (0%)
 Frame = +2

Query: 176  QSNQTSASKGSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRDGE 355
            +S+ T+ +    +++W+D  ++ +SD  +E+F SV  D L L+G EGV+  NIS++RD  
Sbjct: 62   RSSFTNPAFQGSQELWFDPVSVFDSD-CEEEFESVQEDTLSLNGLEGVASSNISSLRDAN 120

Query: 356  NRRSSVHPVDVSRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIPSNCLPCLATIDTSVDK 535
                S     + +                  ++ G L +CGI+PSNCLPCLA+  +SV+K
Sbjct: 121  YGEHSSLVDQMQK---------PGGLSTGPGKEVGLLDNCGILPSNCLPCLASTVSSVEK 171

Query: 536  RXXXXXXXXXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKRMPDS 712
            R           KK A KL FKWK+GHPNA+  SSK+ +QRP AGSQVPFC  EKRM D 
Sbjct: 172  RRSLSSSPPSARKKNALKLPFKWKEGHPNAALFSSKRLLQRPKAGSQVPFCPTEKRMFDC 231

Query: 713  WSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPVVGSS 892
            WS IEP TF++R +NY RDKKK+ A N+AAYYPFGVDVFLS RKIDHIARFVELPVVG S
Sbjct: 232  WSHIEPGTFKVRSENYFRDKKKDFAHNHAAYYPFGVDVFLSPRKIDHIARFVELPVVGHS 291

Query: 893  SGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMRRILD 1072
             G+LPSILVVNVQIPLYP + F SEIDGEG++FVLYFKLS+SY K L   FQ+N+RRI+D
Sbjct: 292  -GKLPSILVVNVQIPLYPPALFHSEIDGEGMNFVLYFKLSDSYLKVLPPHFQENIRRIID 350

Query: 1073 DEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQHEFY 1252
            D +EKVKGFPVDT VP RERLKILGRV N++DL +SA ERKLM AYN KPVLSRPQHEFY
Sbjct: 351  DGVEKVKGFPVDTNVPFRERLKILGRVANVEDLHMSAAERKLMQAYNEKPVLSRPQHEFY 410

Query: 1253 QGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXXXXXN 1432
             GENY EID+DMHRFSYISRKGF+AF DRLK CILDVGLTIQGNK              +
Sbjct: 411  SGENYFEIDIDMHRFSYISRKGFDAFLDRLKFCILDVGLTIQGNKPEELPEQILCCVRLS 470

Query: 1433 CIDYMNYHMLELNQEP 1480
             IDYMNYH L LNQEP
Sbjct: 471  GIDYMNYHQLSLNQEP 486


>ref|XP_016202565.1| uncharacterized protein LOC107643434 isoform X2 [Arachis ipaensis]
          Length = 498

 Score =  502 bits (1293), Expect = e-169
 Identities = 256/428 (59%), Positives = 316/428 (73%), Gaps = 2/428 (0%)
 Frame = +2

Query: 203  GSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVSRPNISAVRDGENR-RSSVHP 379
            GSIE+ W+DS  + +SD  D+D+ SV +D+L L+G++   R + ++V   +++ +S  + 
Sbjct: 75   GSIEEAWFDSNVVFDSD-CDDDYQSVPDDLLSLNGNDANHRVSTASVDATDHQSKSDGNI 133

Query: 380  VDVSRCXXXXXXXXXXXXXXXXERDNGGLFDCGIIPSNCLPCLATIDTSVDKRXXXXXXX 559
            VD +                   ++ G L +CGI+P+NCLPCLA+   S++KR       
Sbjct: 134  VDANE---PVFVDEISSVDANSNKEEGILDNCGILPNNCLPCLASTVPSIEKRRSSSSSP 190

Query: 560  XXX-KKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKRMPDSWSDIEPQT 736
                KKA  KLSFKWK+GH NA+ +S+K  +QRPIAGSQVPFC I+K+M D WS I+P T
Sbjct: 191  PSARKKAPMKLSFKWKEGHGNATLLSTKTLLQRPIAGSQVPFCPIDKKMLDCWSHIDPST 250

Query: 737  FRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPVVGSSSGELPSIL 916
            F++RG NY +DKKK+ APNY+AYYPFGVDVFLS RK+DHIARFVELP + SSSG+LP IL
Sbjct: 251  FKVRGVNYFKDKKKDFAPNYSAYYPFGVDVFLSPRKVDHIARFVELPFI-SSSGKLPPIL 309

Query: 917  VVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMRRILDDEIEKVKG 1096
            VVNVQIPLYPA+ F+ E DG+G+SFVLYFKLSE Y+KEL    Q+++R+++DDE+EKVKG
Sbjct: 310  VVNVQIPLYPATLFQGETDGDGMSFVLYFKLSEGYSKELPLHLQESIRKLMDDEVEKVKG 369

Query: 1097 FPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQHEFYQGENYMEI 1276
            FPVDT+ P RERLKILGRVVN++DL LSA ERKLMHAYN KPVLSRPQHEFY GENY EI
Sbjct: 370  FPVDTIAPFRERLKILGRVVNLEDLHLSAAERKLMHAYNEKPVLSRPQHEFYSGENYFEI 429

Query: 1277 DLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXXXXXNCIDYMNYH 1456
            DLDMHRFSYISRKGFEAF DRLK C LDVGLTIQGNK              N IDYMNYH
Sbjct: 430  DLDMHRFSYISRKGFEAFLDRLKICTLDVGLTIQGNKAEELPEQVLCCVRLNGIDYMNYH 489

Query: 1457 MLELNQEP 1480
             L L Q+P
Sbjct: 490  QLGLTQDP 497


>ref|XP_011071045.1| uncharacterized protein LOC105156572 [Sesamum indicum]
          Length = 550

 Score =  503 bits (1294), Expect = e-168
 Identities = 262/439 (59%), Positives = 312/439 (71%), Gaps = 14/439 (3%)
 Frame = +2

Query: 203  GSIEDIWYDSAAILESDGSDEDFHSVLNDMLPLDGSEGVS--------RPNISAVRDGEN 358
            GS E+ W+DSAA+LESD SDEDF S+ +D++ + G +G S          + SA      
Sbjct: 110  GSSEEAWFDSAAVLESDWSDEDFQSIPDDVISVSGCDGTSVSGSVEHLENSSSANSLSGA 169

Query: 359  RRSSVHPVDVSRCXXXXXXXXXXXXXXXXE------RDNGGLFDCGIIPSNCLPCLATID 520
             RSSVHP D                    E       D+G L +CGI+P+NCLPCLA+  
Sbjct: 170  ARSSVHPSDYDFKVKSDEPINGKKPVFVDEISCSAGGDDGLLNNCGILPNNCLPCLASTV 229

Query: 521  TSVDKRXXXXXXXXXXKKAAHKLSFKWKDGHPNASFVSSKKHIQRPIAGSQVPFCSIEKR 700
                ++          KKAA KL FKWK+G+P A+F+SSK  +QRPIAGSQVPFC + KR
Sbjct: 230  PIEKRQSLSSSPPSMRKKAAVKLPFKWKEGNPTANFLSSKPLLQRPIAGSQVPFCPLGKR 289

Query: 701  MPDSWSDIEPQTFRIRGKNYLRDKKKEHAPNYAAYYPFGVDVFLSQRKIDHIARFVELPV 880
            +PDSWSD++P TFR+RG NYLRDK+KE APN AAYYPFG+DVFLSQRKI HI RFVELP+
Sbjct: 290  VPDSWSDVQPGTFRVRGVNYLRDKRKEFAPNCAAYYPFGLDVFLSQRKIHHIGRFVELPL 349

Query: 881  VGSSSGELPSILVVNVQIPLYPASFFKSEIDGEGISFVLYFKLSESYAKELSSQFQDNMR 1060
            + +S G+LP ILVVNVQIPLYPA+ F+ E DGEGISFVLYFKLSES+AK+L + FQ+N++
Sbjct: 350  I-NSLGKLPPILVVNVQIPLYPAAIFQGETDGEGISFVLYFKLSESFAKDLPAHFQENIK 408

Query: 1061 RILDDEIEKVKGFPVDTLVPCRERLKILGRVVNIDDLQLSAPERKLMHAYNGKPVLSRPQ 1240
            R++DDE+EKVKGF  DT+VP RERLKILGRVVN+DDL +SA ERKLMHAYN KPVLSRPQ
Sbjct: 409  RLIDDEVEKVKGFRTDTVVPFRERLKILGRVVNVDDLPMSAAERKLMHAYNEKPVLSRPQ 468

Query: 1241 HEFYQGENYMEIDLDMHRFSYISRKGFEAFQDRLKNCILDVGLTIQGNKXXXXXXXXXXX 1420
            HEFY GENY EIDLDMHRFSYISRKGFE F DRLK C+LD GLTIQ NK           
Sbjct: 469  HEFYAGENYFEIDLDMHRFSYISRKGFETFLDRLKLCVLDFGLTIQDNKAEELPEQILCC 528

Query: 1421 XXXNCIDYMNYHMLELNQE 1477
               N IDY+NY  L   +E
Sbjct: 529  IRLNEIDYVNYQQLGFCEE 547


Top