BLASTX nr result

ID: Chrysanthemum22_contig00043591 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00043591
         (828 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022008785.1| uncharacterized protein LOC110908193 [Helian...   382   e-127
gb|KVH92524.1| protein of unknown function DUF547 [Cynara cardun...   372   e-123
ref|XP_023743276.1| uncharacterized protein LOC111891456 [Lactuc...   365   e-120
ref|XP_011039198.1| PREDICTED: uncharacterized protein LOC105135...   231   8e-69
ref|XP_011039196.1| PREDICTED: uncharacterized protein LOC105135...   231   4e-68
ref|XP_011039195.1| PREDICTED: uncharacterized protein LOC105135...   231   6e-68
ref|XP_011039194.1| PREDICTED: uncharacterized protein LOC105135...   231   6e-68
gb|PLY66546.1| hypothetical protein LSAT_4X167461 [Lactuca sativa]    217   4e-66
ref|XP_006380714.1| hypothetical protein POPTR_0007s11280g [Popu...   226   5e-66
ref|XP_006380715.1| hypothetical protein POPTR_0007s11280g [Popu...   226   5e-66
ref|XP_018850501.1| PREDICTED: uncharacterized protein LOC109013...   225   6e-66
ref|XP_023875256.1| uncharacterized protein LOC111987747 isoform...   224   1e-65
ref|XP_017980875.1| PREDICTED: uncharacterized protein LOC186110...   223   3e-65
ref|XP_017980873.1| PREDICTED: uncharacterized protein LOC186110...   223   5e-65
ref|XP_007047200.1| PREDICTED: uncharacterized protein LOC186110...   223   5e-65
ref|XP_003635502.2| PREDICTED: uncharacterized protein LOC100855...   221   1e-64
ref|XP_002268917.3| PREDICTED: uncharacterized protein LOC100256...   222   1e-64
ref|XP_018850503.1| PREDICTED: uncharacterized protein LOC109013...   219   5e-64
ref|XP_018850500.1| PREDICTED: uncharacterized protein LOC109013...   219   1e-63
ref|XP_023875257.1| uncharacterized protein LOC111987747 isoform...   218   2e-63

>ref|XP_022008785.1| uncharacterized protein LOC110908193 [Helianthus annuus]
 ref|XP_022008786.1| uncharacterized protein LOC110908193 [Helianthus annuus]
 gb|OTF97072.1| Protein of unknown function, DUF547 [Helianthus annuus]
          Length = 586

 Score =  382 bits (980), Expect = e-127
 Identities = 198/279 (70%), Positives = 220/279 (78%), Gaps = 4/279 (1%)
 Frame = +2

Query: 2    DVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP 181
            D+ KSD+   DD + GKENRSS NSNSK+ + PERV  E+Q+ V++P +KFE + KC PP
Sbjct: 182  DLKKSDSVISDDGVLGKENRSSANSNSKVKSSPERVPEEMQSLVRKPQIKFETAGKCSPP 241

Query: 182  GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTM 361
             PQ  NRL DQE+AQE       GDR++EAESECNKISE+VLKCLIGIFLRLSKLKAKTM
Sbjct: 242  KPQ--NRLADQEKAQESCSSSSSGDRIVEAESECNKISENVLKCLIGIFLRLSKLKAKTM 299

Query: 362  DAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASS----XXXXAS 529
            DAEAFSNLMSV LT GD+GP+FRDPYGVCLKSKRRDIGPYKDLFAIEA S        AS
Sbjct: 300  DAEAFSNLMSVDLTGGDQGPAFRDPYGVCLKSKRRDIGPYKDLFAIEAGSIDFKKKMNAS 359

Query: 530  XXXXXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQK 709
                         ASVNLEGLTHQQKLAFWINTYNICMMNAYL HGI ESPEM+ TLMQK
Sbjct: 360  LLIRRLKLLLEKLASVNLEGLTHQQKLAFWINTYNICMMNAYLEHGIPESPEMMPTLMQK 419

Query: 710  ATINVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNET 826
            ATIN GG LLNA++IEH ILRLP RLKLSC +SPE+NET
Sbjct: 420  ATINAGGYLLNAISIEHFILRLPNRLKLSCPQSPEQNET 458


>gb|KVH92524.1| protein of unknown function DUF547 [Cynara cardunculus var.
           scolymus]
          Length = 580

 Score =  372 bits (954), Expect = e-123
 Identities = 197/275 (71%), Positives = 217/275 (78%)
 Frame = +2

Query: 2   DVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP 181
           D+ K D+S GDD + GKENRSS+NSNSK+ N PER+AN+VQNSVKR  VK   +EK  PP
Sbjct: 190 DLKKPDSSVGDDGVLGKENRSSSNSNSKLKNSPERIANKVQNSVKRIPVKSVTAEKRTPP 249

Query: 182 GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTM 361
             QLQNRL DQERAQE       GDR+LEAESECNKISE+ LKCLIGIFLRLSKLKAKTM
Sbjct: 250 KLQLQNRLADQERAQESCSSSSSGDRMLEAESECNKISENALKCLIGIFLRLSKLKAKTM 309

Query: 362 DAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXXASXXXX 541
           DAEAFSNLMS+ LT GDRGP+FRDPYG+CLKSKRRDIGPYK LFAIEA S          
Sbjct: 310 DAEAFSNLMSLDLTGGDRGPAFRDPYGICLKSKRRDIGPYKHLFAIEAGSIDFKKK---- 365

Query: 542 XXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATIN 721
                    AS+ L  LTHQQKLAFWIN+YNICMMNAYL HGI E+PEM+ TL+QKATI 
Sbjct: 366 -------TNASL-LTRLTHQQKLAFWINSYNICMMNAYLEHGIPENPEMMPTLIQKATIT 417

Query: 722 VGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNET 826
           VGG LLNA TIEH ILRLPYRLKLSCSKSPE++ET
Sbjct: 418 VGGHLLNAATIEHFILRLPYRLKLSCSKSPEKDET 452


>ref|XP_023743276.1| uncharacterized protein LOC111891456 [Lactuca sativa]
 ref|XP_023743277.1| uncharacterized protein LOC111891456 [Lactuca sativa]
 ref|XP_023743278.1| uncharacterized protein LOC111891456 [Lactuca sativa]
          Length = 596

 Score =  365 bits (938), Expect = e-120
 Identities = 195/280 (69%), Positives = 217/280 (77%), Gaps = 5/280 (1%)
 Frame = +2

Query: 2    DVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP 181
            D+ K D+S GDD + GKENRSS+NSNSKITN PE++ NEVQNSVKRP  K E+ EK  P 
Sbjct: 189  DLKKLDSSIGDDGVLGKENRSSSNSNSKITNSPEKITNEVQNSVKRPQNKPEIVEKRTPS 248

Query: 182  GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTM 361
             PQ+  RL+DQERAQE        DR++EAESECNKISE+VL CLIGIFLRLSKLKAKTM
Sbjct: 249  KPQI--RLMDQERAQESCSSSSSSDRIMEAESECNKISENVLNCLIGIFLRLSKLKAKTM 306

Query: 362  DAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASS----XXXXAS 529
            DAEAFSNLM++ LT  DRGP FRDPYGVCLKS++RDIGPYKDLFAIE+ S        AS
Sbjct: 307  DAEAFSNLMTLDLTGVDRGPGFRDPYGVCLKSRKRDIGPYKDLFAIESGSIDFKKKKNAS 366

Query: 530  XXXXXXXXXXXXXASVNLEGL-THQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQ 706
                         ASVNLEGL THQQKLAFWIN YNICMMNAYL HG+ ESPEM+ TLMQ
Sbjct: 367  LLIRRLKLLLEKLASVNLEGLVTHQQKLAFWINVYNICMMNAYLEHGMPESPEMMPTLMQ 426

Query: 707  KATINVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNET 826
            KATIN GG LLNAVTIEH ILRLPYR KLS  KSP+++ET
Sbjct: 427  KATINTGGYLLNAVTIEHFILRLPYRFKLSYPKSPKKDET 466


>ref|XP_011039198.1| PREDICTED: uncharacterized protein LOC105135828 isoform X4 [Populus
            euphratica]
          Length = 522

 Score =  231 bits (588), Expect = 8e-69
 Identities = 141/275 (51%), Positives = 169/275 (61%), Gaps = 5/275 (1%)
 Frame = +2

Query: 14   SDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPPGP-Q 190
            S +S  DD   GKENRS  N   K    P+++A ++   VKR   K E  EK + P   Q
Sbjct: 220  SSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KITTPVKRTPNKRESEEKSLEPSKLQ 276

Query: 191  LQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAE 370
            L+ RLV+QERAQE        DR+ E     NK++E ++KCL  IFLR+S LK K ++  
Sbjct: 277  LECRLVEQERAQESTSAYM-NDRICENNITPNKLTEDIVKCLSSIFLRMSTLKDKVVELG 335

Query: 371  AFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXX 538
             FS+  ++    GDRG   RDPYG+  + K RDIG YK L+AIEASS        A    
Sbjct: 336  TFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGSYKHLYAIEASSIDLNRTTSALFLL 395

Query: 539  XXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATI 718
                      AS NLEGLTHQQKLAFWINTYN CMMNA L HGI E+PEM+  LMQKATI
Sbjct: 396  QRLKFLLGKLASANLEGLTHQQKLAFWINTYNSCMMNAILEHGIPETPEMVVALMQKATI 455

Query: 719  NVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
             VGG LLNA+TIEH ILRLPY LK +C K+ E NE
Sbjct: 456  TVGGHLLNAITIEHFILRLPYHLKFTCPKAVENNE 490


>ref|XP_011039196.1| PREDICTED: uncharacterized protein LOC105135828 isoform X3 [Populus
            euphratica]
          Length = 594

 Score =  231 bits (588), Expect = 4e-68
 Identities = 141/275 (51%), Positives = 169/275 (61%), Gaps = 5/275 (1%)
 Frame = +2

Query: 14   SDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPPGP-Q 190
            S +S  DD   GKENRS  N   K    P+++A ++   VKR   K E  EK + P   Q
Sbjct: 195  SSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KITTPVKRTPNKRESEEKSLEPSKLQ 251

Query: 191  LQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAE 370
            L+ RLV+QERAQE        DR+ E     NK++E ++KCL  IFLR+S LK K ++  
Sbjct: 252  LECRLVEQERAQESTSAYM-NDRICENNITPNKLTEDIVKCLSSIFLRMSTLKDKVVELG 310

Query: 371  AFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXX 538
             FS+  ++    GDRG   RDPYG+  + K RDIG YK L+AIEASS        A    
Sbjct: 311  TFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGSYKHLYAIEASSIDLNRTTSALFLL 370

Query: 539  XXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATI 718
                      AS NLEGLTHQQKLAFWINTYN CMMNA L HGI E+PEM+  LMQKATI
Sbjct: 371  QRLKFLLGKLASANLEGLTHQQKLAFWINTYNSCMMNAILEHGIPETPEMVVALMQKATI 430

Query: 719  NVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
             VGG LLNA+TIEH ILRLPY LK +C K+ E NE
Sbjct: 431  TVGGHLLNAITIEHFILRLPYHLKFTCPKAVENNE 465


>ref|XP_011039195.1| PREDICTED: uncharacterized protein LOC105135828 isoform X2 [Populus
            euphratica]
          Length = 618

 Score =  231 bits (588), Expect = 6e-68
 Identities = 141/275 (51%), Positives = 169/275 (61%), Gaps = 5/275 (1%)
 Frame = +2

Query: 14   SDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPPGP-Q 190
            S +S  DD   GKENRS  N   K    P+++A ++   VKR   K E  EK + P   Q
Sbjct: 219  SSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KITTPVKRTPNKRESEEKSLEPSKLQ 275

Query: 191  LQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAE 370
            L+ RLV+QERAQE        DR+ E     NK++E ++KCL  IFLR+S LK K ++  
Sbjct: 276  LECRLVEQERAQESTSAYM-NDRICENNITPNKLTEDIVKCLSSIFLRMSTLKDKVVELG 334

Query: 371  AFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXX 538
             FS+  ++    GDRG   RDPYG+  + K RDIG YK L+AIEASS        A    
Sbjct: 335  TFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGSYKHLYAIEASSIDLNRTTSALFLL 394

Query: 539  XXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATI 718
                      AS NLEGLTHQQKLAFWINTYN CMMNA L HGI E+PEM+  LMQKATI
Sbjct: 395  QRLKFLLGKLASANLEGLTHQQKLAFWINTYNSCMMNAILEHGIPETPEMVVALMQKATI 454

Query: 719  NVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
             VGG LLNA+TIEH ILRLPY LK +C K+ E NE
Sbjct: 455  TVGGHLLNAITIEHFILRLPYHLKFTCPKAVENNE 489


>ref|XP_011039194.1| PREDICTED: uncharacterized protein LOC105135828 isoform X1 [Populus
            euphratica]
          Length = 619

 Score =  231 bits (588), Expect = 6e-68
 Identities = 141/275 (51%), Positives = 169/275 (61%), Gaps = 5/275 (1%)
 Frame = +2

Query: 14   SDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPPGP-Q 190
            S +S  DD   GKENRS  N   K    P+++A ++   VKR   K E  EK + P   Q
Sbjct: 220  SSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KITTPVKRTPNKRESEEKSLEPSKLQ 276

Query: 191  LQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAE 370
            L+ RLV+QERAQE        DR+ E     NK++E ++KCL  IFLR+S LK K ++  
Sbjct: 277  LECRLVEQERAQESTSAYM-NDRICENNITPNKLTEDIVKCLSSIFLRMSTLKDKVVELG 335

Query: 371  AFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXX 538
             FS+  ++    GDRG   RDPYG+  + K RDIG YK L+AIEASS        A    
Sbjct: 336  TFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGSYKHLYAIEASSIDLNRTTSALFLL 395

Query: 539  XXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATI 718
                      AS NLEGLTHQQKLAFWINTYN CMMNA L HGI E+PEM+  LMQKATI
Sbjct: 396  QRLKFLLGKLASANLEGLTHQQKLAFWINTYNSCMMNAILEHGIPETPEMVVALMQKATI 455

Query: 719  NVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
             VGG LLNA+TIEH ILRLPY LK +C K+ E NE
Sbjct: 456  TVGGHLLNAITIEHFILRLPYHLKFTCPKAVENNE 490


>gb|PLY66546.1| hypothetical protein LSAT_4X167461 [Lactuca sativa]
          Length = 291

 Score =  217 bits (552), Expect = 4e-66
 Identities = 114/161 (70%), Positives = 124/161 (77%), Gaps = 5/161 (3%)
 Frame = +2

Query: 359 MDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----A 526
           MDAEAFSNLM++ LT  DRGP FRDPYGVCLKS++RDIGPYKDLFAIE+ S        A
Sbjct: 1   MDAEAFSNLMTLDLTGVDRGPGFRDPYGVCLKSRKRDIGPYKDLFAIESGSIDFKKKKNA 60

Query: 527 SXXXXXXXXXXXXXASVNLEGL-THQQKLAFWINTYNICMMNAYLGHGINESPEMLATLM 703
           S             ASVNLEGL THQQKLAFWIN YNICMMNAYL HG+ ESPEM+ TLM
Sbjct: 61  SLLIRRLKLLLEKLASVNLEGLVTHQQKLAFWINVYNICMMNAYLEHGMPESPEMMPTLM 120

Query: 704 QKATINVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNET 826
           QKATIN GG LLNAVTIEH ILRLPYR KLS  KSP+++ET
Sbjct: 121 QKATINTGGYLLNAVTIEHFILRLPYRFKLSYPKSPKKDET 161


>ref|XP_006380714.1| hypothetical protein POPTR_0007s11280g [Populus trichocarpa]
 gb|PNT27011.1| hypothetical protein POPTR_007G041100v3 [Populus trichocarpa]
 gb|PNT27015.1| hypothetical protein POPTR_007G041100v3 [Populus trichocarpa]
          Length = 618

 Score =  226 bits (575), Expect = 5e-66
 Identities = 137/275 (49%), Positives = 169/275 (61%), Gaps = 5/275 (1%)
 Frame = +2

Query: 14   SDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPPGP-Q 190
            S +S  DD   GKENRS  N   K    P+++A ++   VKR   K E  EK + P   Q
Sbjct: 219  SSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KITTPVKRTPNKRESEEKSLEPSKLQ 275

Query: 191  LQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAE 370
            L+ RL++QERAQE        DR+ E     NK++E ++KCL  IFLR+S LK K ++  
Sbjct: 276  LECRLIEQERAQESTSAYM-NDRICENNITPNKLTEDIVKCLSSIFLRMSTLKDKVVELG 334

Query: 371  AFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXX 538
             FS+  ++    GDRG   RDPYG+  + K RDIG YK L+AIEASS        A    
Sbjct: 335  TFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGSYKHLYAIEASSIDLNRTTSALFLL 394

Query: 539  XXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATI 718
                      A+ NLEGLTHQQKLAFWINTYN CMMNA L HGI E+PEM+  LMQKATI
Sbjct: 395  QRLRFLLGKLAAANLEGLTHQQKLAFWINTYNSCMMNAILEHGIPETPEMVVALMQKATI 454

Query: 719  NVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
             VGG LLNA+TIEH ILRLPY LK +C K+ + +E
Sbjct: 455  TVGGHLLNAITIEHFILRLPYHLKFTCPKAVKNDE 489


>ref|XP_006380715.1| hypothetical protein POPTR_0007s11280g [Populus trichocarpa]
 ref|XP_006380716.1| hypothetical protein POPTR_0007s11280g [Populus trichocarpa]
 ref|XP_002310156.2| hypothetical protein POPTR_0007s11280g [Populus trichocarpa]
 ref|XP_006380717.1| hypothetical protein POPTR_0007s11280g [Populus trichocarpa]
 gb|PNT27010.1| hypothetical protein POPTR_007G041100v3 [Populus trichocarpa]
 gb|PNT27013.1| hypothetical protein POPTR_007G041100v3 [Populus trichocarpa]
 gb|PNT27014.1| hypothetical protein POPTR_007G041100v3 [Populus trichocarpa]
          Length = 619

 Score =  226 bits (575), Expect = 5e-66
 Identities = 137/275 (49%), Positives = 169/275 (61%), Gaps = 5/275 (1%)
 Frame = +2

Query: 14   SDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPPGP-Q 190
            S +S  DD   GKENRS  N   K    P+++A ++   VKR   K E  EK + P   Q
Sbjct: 220  SSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KITTPVKRTPNKRESEEKSLEPSKLQ 276

Query: 191  LQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAE 370
            L+ RL++QERAQE        DR+ E     NK++E ++KCL  IFLR+S LK K ++  
Sbjct: 277  LECRLIEQERAQESTSAYM-NDRICENNITPNKLTEDIVKCLSSIFLRMSTLKDKVVELG 335

Query: 371  AFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXX 538
             FS+  ++    GDRG   RDPYG+  + K RDIG YK L+AIEASS        A    
Sbjct: 336  TFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGSYKHLYAIEASSIDLNRTTSALFLL 395

Query: 539  XXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATI 718
                      A+ NLEGLTHQQKLAFWINTYN CMMNA L HGI E+PEM+  LMQKATI
Sbjct: 396  QRLRFLLGKLAAANLEGLTHQQKLAFWINTYNSCMMNAILEHGIPETPEMVVALMQKATI 455

Query: 719  NVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
             VGG LLNA+TIEH ILRLPY LK +C K+ + +E
Sbjct: 456  TVGGHLLNAITIEHFILRLPYHLKFTCPKAVKNDE 490


>ref|XP_018850501.1| PREDICTED: uncharacterized protein LOC109013035 isoform X2 [Juglans
           regia]
          Length = 589

 Score =  225 bits (573), Expect = 6e-66
 Identities = 135/264 (51%), Positives = 168/264 (63%), Gaps = 5/264 (1%)
 Frame = +2

Query: 47  GKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GPQLQNRLVDQERA 223
           GKEN   +NS  K    PE+   +V  S+KR  +K E  EKC  P   +L++RLV+ ERA
Sbjct: 203 GKENLLCSNS-VKDKQSPEKKYAKVVASMKRAPIKHESMEKCADPFRSKLESRLVE-ERA 260

Query: 224 QEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAEAFSNLMSVGLT 403
           +E       GDRVLEA+S  NK+SE ++KCL GIF R+S LK K  +  AF    SV   
Sbjct: 261 EESSSGSS-GDRVLEADSTPNKVSEDIVKCLSGIFARMSTLKDKVAELGAFQ---SVASH 316

Query: 404 SGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXXXXXXXXXXXXA 571
           + +R   FRDPY +C + + RDIGPYK L +IEASS        A              A
Sbjct: 317 ASNRETEFRDPYDICPEYRNRDIGPYKHLCSIEASSIDLNRTTNALFLIHRLKLLFGKLA 376

Query: 572 SVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATINVGGLLLNAVT 751
           +VNLEGLT QQKLAFWIN+YN CMMNA+L HGI  +PEM+  LMQKATI VGG LLNA+T
Sbjct: 377 TVNLEGLTQQQKLAFWINSYNSCMMNAFLEHGIPNTPEMVVALMQKATIVVGGHLLNAIT 436

Query: 752 IEHSILRLPYRLKLSCSKSPERNE 823
           IEH ILRLPY LK +C+K+ + +E
Sbjct: 437 IEHFILRLPYHLKFTCAKTAKNDE 460


>ref|XP_023875256.1| uncharacterized protein LOC111987747 isoform X2 [Quercus suber]
 gb|POE82600.1| hypothetical protein CFP56_46449 [Quercus suber]
          Length = 610

 Score =  224 bits (572), Expect = 1e-65
 Identities = 134/275 (48%), Positives = 170/275 (61%), Gaps = 9/275 (3%)
 Frame = +2

Query: 26   NGDDSLP----GKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GPQ 190
            N   S P    GKENR  TNS  K    PE+  +++   VK+P +K E  EKC+     Q
Sbjct: 212  NSSPSFPEDGRGKENRLCTNS-VKDKPSPEKKYSKIATPVKKPPIKRESVEKCVDNFKSQ 270

Query: 191  LQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAE 370
            L++RLVDQE+AQE        D +LEA+S  NK+SE+++KCL  IF+R+S +K K + + 
Sbjct: 271  LESRLVDQEKAQESSSGSSD-DSLLEADSNPNKVSENIVKCLSSIFMRISTVKDKVVHSG 329

Query: 371  AFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXX 538
               +  S    + +R   F DPY +C + + RDIGPYK L+AIEASS        A    
Sbjct: 330  TSQSATS---HASNRETEFGDPYDICSEFRTRDIGPYKHLYAIEASSIDLNRKTNALFLI 386

Query: 539  XXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATI 718
                      A+VNLEGLTHQQKLAFWIN YN CMMNA L HGI E PEM+  LMQKATI
Sbjct: 387  QRLKFLFGRLAAVNLEGLTHQQKLAFWINIYNSCMMNAILEHGIPEMPEMVVALMQKATI 446

Query: 719  NVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
             VGG LL A+TIEH ILRLPY LK +C+K+ + +E
Sbjct: 447  VVGGHLLYAITIEHFILRLPYHLKFACAKAAKNDE 481


>ref|XP_017980875.1| PREDICTED: uncharacterized protein LOC18611093 isoform X3 [Theobroma
            cacao]
          Length = 590

 Score =  223 bits (568), Expect = 3e-65
 Identities = 133/276 (48%), Positives = 170/276 (61%), Gaps = 5/276 (1%)
 Frame = +2

Query: 11   KSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GP 187
            K ++++GD  + GKEN+S  N+  K    PE+   +V   VKR   K E + KC+     
Sbjct: 190  KLNSASGD--VRGKENQSFANA-VKDKQSPEKKITKVVTPVKRLPTKHESANKCLDALKS 246

Query: 188  QLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDA 367
            QL  RLVDQERAQE        D+V EA+S  NKISE  ++CL  IF+RLS LK +++++
Sbjct: 247  QLDGRLVDQERAQESPSGSSD-DKVSEADSTPNKISEDTVRCLCSIFVRLSTLKDRSVES 305

Query: 368  EAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEAS----SXXXXASXX 535
                +  +       R   F+DPYG+C  SK RDIGPYK+L  IEA+    S    A   
Sbjct: 306  GILPSQSAANSYEISRESEFQDPYGICSDSKTRDIGPYKNLCTIEANTVDLSRRMNALFL 365

Query: 536  XXXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKAT 715
                        SVNL+GL+HQQKLAFWINTYN CMMNA L HGI E+PE +  LMQKAT
Sbjct: 366  IHRLKFLLGKLTSVNLDGLSHQQKLAFWINTYNSCMMNAILEHGIPETPESVVGLMQKAT 425

Query: 716  INVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
            I VGG LLNA+TIEH ILRLP+ LK +CSK+ + +E
Sbjct: 426  IVVGGHLLNAITIEHFILRLPFHLKFTCSKAAKNDE 461


>ref|XP_017980873.1| PREDICTED: uncharacterized protein LOC18611093 isoform X2 [Theobroma
            cacao]
          Length = 615

 Score =  223 bits (568), Expect = 5e-65
 Identities = 133/276 (48%), Positives = 170/276 (61%), Gaps = 5/276 (1%)
 Frame = +2

Query: 11   KSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GP 187
            K ++++GD  + GKEN+S  N+  K    PE+   +V   VKR   K E + KC+     
Sbjct: 215  KLNSASGD--VRGKENQSFANA-VKDKQSPEKKITKVVTPVKRLPTKHESANKCLDALKS 271

Query: 188  QLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDA 367
            QL  RLVDQERAQE        D+V EA+S  NKISE  ++CL  IF+RLS LK +++++
Sbjct: 272  QLDGRLVDQERAQESPSGSSD-DKVSEADSTPNKISEDTVRCLCSIFVRLSTLKDRSVES 330

Query: 368  EAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEAS----SXXXXASXX 535
                +  +       R   F+DPYG+C  SK RDIGPYK+L  IEA+    S    A   
Sbjct: 331  GILPSQSAANSYEISRESEFQDPYGICSDSKTRDIGPYKNLCTIEANTVDLSRRMNALFL 390

Query: 536  XXXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKAT 715
                        SVNL+GL+HQQKLAFWINTYN CMMNA L HGI E+PE +  LMQKAT
Sbjct: 391  IHRLKFLLGKLTSVNLDGLSHQQKLAFWINTYNSCMMNAILEHGIPETPESVVGLMQKAT 450

Query: 716  INVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
            I VGG LLNA+TIEH ILRLP+ LK +CSK+ + +E
Sbjct: 451  IVVGGHLLNAITIEHFILRLPFHLKFTCSKAAKNDE 486


>ref|XP_007047200.1| PREDICTED: uncharacterized protein LOC18611093 isoform X1 [Theobroma
            cacao]
 ref|XP_017980870.1| PREDICTED: uncharacterized protein LOC18611093 isoform X1 [Theobroma
            cacao]
 gb|EOX91357.1| Uncharacterized protein TCM_000576 isoform 1 [Theobroma cacao]
 gb|EOX91358.1| Uncharacterized protein TCM_000576 isoform 1 [Theobroma cacao]
          Length = 616

 Score =  223 bits (568), Expect = 5e-65
 Identities = 133/276 (48%), Positives = 170/276 (61%), Gaps = 5/276 (1%)
 Frame = +2

Query: 11   KSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GP 187
            K ++++GD  + GKEN+S  N+  K    PE+   +V   VKR   K E + KC+     
Sbjct: 216  KLNSASGD--VRGKENQSFANA-VKDKQSPEKKITKVVTPVKRLPTKHESANKCLDALKS 272

Query: 188  QLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDA 367
            QL  RLVDQERAQE        D+V EA+S  NKISE  ++CL  IF+RLS LK +++++
Sbjct: 273  QLDGRLVDQERAQESPSGSSD-DKVSEADSTPNKISEDTVRCLCSIFVRLSTLKDRSVES 331

Query: 368  EAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEAS----SXXXXASXX 535
                +  +       R   F+DPYG+C  SK RDIGPYK+L  IEA+    S    A   
Sbjct: 332  GILPSQSAANSYEISRESEFQDPYGICSDSKTRDIGPYKNLCTIEANTVDLSRRMNALFL 391

Query: 536  XXXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKAT 715
                        SVNL+GL+HQQKLAFWINTYN CMMNA L HGI E+PE +  LMQKAT
Sbjct: 392  IHRLKFLLGKLTSVNLDGLSHQQKLAFWINTYNSCMMNAILEHGIPETPESVVGLMQKAT 451

Query: 716  INVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
            I VGG LLNA+TIEH ILRLP+ LK +CSK+ + +E
Sbjct: 452  IVVGGHLLNAITIEHFILRLPFHLKFTCSKAAKNDE 487


>ref|XP_003635502.2| PREDICTED: uncharacterized protein LOC100855363 [Vitis vinifera]
          Length = 593

 Score =  221 bits (564), Expect = 1e-64
 Identities = 130/269 (48%), Positives = 166/269 (61%), Gaps = 5/269 (1%)
 Frame = +2

Query: 32  DDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPPGP-QLQNRLV 208
           ++   GKENR+  NS SK    P++ +  V   VK+  +K E  EK   P   QL+ RLV
Sbjct: 198 EEDRQGKENRTCMNS-SKNKQSPDKKSPRVITLVKKSPIKHEPVEKFRDPLKLQLECRLV 256

Query: 209 DQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAEAFSNLM 388
           DQERAQE        +RV EA+S  NKISE ++KCL  IFLR+S L+ K ++++A    +
Sbjct: 257 DQERAQESSCASLD-ERVSEADSGPNKISEDIVKCLSSIFLRMSTLREKVVESDATPPPL 315

Query: 389 SVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXXXXXXXX 556
           +      +      DPYG+CL+   R++GPYK L  I+A S        A          
Sbjct: 316 AFASNESNGEAESLDPYGICLEFGARNVGPYKHLCDIQAGSVDLNRKTNALFLIHRLKLL 375

Query: 557 XXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATINVGGLL 736
               A VNLEGLTHQQKLAFWIN YN CMMNA+L HG+ E+PEM+  LMQKATINVGG L
Sbjct: 376 LGKLACVNLEGLTHQQKLAFWINIYNSCMMNAFLEHGVPENPEMVVALMQKATINVGGCL 435

Query: 737 LNAVTIEHSILRLPYRLKLSCSKSPERNE 823
           LNA+TIEH ILRLPY LK +CSK+ + +E
Sbjct: 436 LNAITIEHFILRLPYHLKYTCSKAAKTDE 464


>ref|XP_002268917.3| PREDICTED: uncharacterized protein LOC100256691 [Vitis vinifera]
          Length = 616

 Score =  222 bits (565), Expect = 1e-64
 Identities = 130/269 (48%), Positives = 166/269 (61%), Gaps = 5/269 (1%)
 Frame = +2

Query: 32   DDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPPGP-QLQNRLV 208
            ++   GKENR+  NS SK    P++ +  V   VK+  +K E  EK   P   QL+ RLV
Sbjct: 221  EEDRQGKENRTCMNS-SKNKQSPDKKSPRVITPVKKSPIKHEPVEKFRDPLKLQLECRLV 279

Query: 209  DQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAEAFSNLM 388
            DQERAQE        +RV EA+S  NKISE ++KCL  IFLR+S L+ K ++++A    +
Sbjct: 280  DQERAQESSCASLD-ERVSEADSGPNKISEDIVKCLSSIFLRMSTLREKVVESDATPPPL 338

Query: 389  SVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXXXXXXXX 556
            +      +      DPYG+CL+   R++GPYK L  I+A S        A          
Sbjct: 339  AFASNESNGEAESLDPYGICLEFGARNVGPYKHLCDIQAGSVDLNRKTNALFLIHRLKLL 398

Query: 557  XXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATINVGGLL 736
                A VNLEGLTHQQKLAFWIN YN CMMNA+L HG+ E+PEM+  LMQKATINVGG L
Sbjct: 399  LGKLACVNLEGLTHQQKLAFWINIYNSCMMNAFLEHGVPENPEMVVALMQKATINVGGCL 458

Query: 737  LNAVTIEHSILRLPYRLKLSCSKSPERNE 823
            LNA+TIEH ILRLPY LK +CSK+ + +E
Sbjct: 459  LNAITIEHFILRLPYHLKYTCSKAAKTDE 487


>ref|XP_018850503.1| PREDICTED: uncharacterized protein LOC109013035 isoform X4 [Juglans
           regia]
          Length = 563

 Score =  219 bits (558), Expect = 5e-64
 Identities = 135/268 (50%), Positives = 168/268 (62%), Gaps = 9/268 (3%)
 Frame = +2

Query: 47  GKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GPQLQNRLVDQERA 223
           GKEN   +NS  K    PE+   +V  S+KR  +K E  EKC  P   +L++RLV+ ERA
Sbjct: 173 GKENLLCSNS-VKDKQSPEKKYAKVVASMKRAPIKHESMEKCADPFRSKLESRLVE-ERA 230

Query: 224 QEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAEAFSNLMSVGLT 403
           +E       GDRVLEA+S  NK+SE ++KCL GIF R+S LK K  +  AF    SV   
Sbjct: 231 EESSSGSS-GDRVLEADSTPNKVSEDIVKCLSGIFARMSTLKDKVAELGAFQ---SVASH 286

Query: 404 SGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXXXXXXXXXXXXA 571
           + +R   FRDPY +C + + RDIGPYK L +IEASS        A              A
Sbjct: 287 ASNRETEFRDPYDICPEYRNRDIGPYKHLCSIEASSIDLNRTTNALFLIHRLKLLFGKLA 346

Query: 572 SVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQK----ATINVGGLLL 739
           +VNLEGLT QQKLAFWIN+YN CMMNA+L HGI  +PEM+  LMQK    ATI VGG LL
Sbjct: 347 TVNLEGLTQQQKLAFWINSYNSCMMNAFLEHGIPNTPEMVVALMQKCLPQATIVVGGHLL 406

Query: 740 NAVTIEHSILRLPYRLKLSCSKSPERNE 823
           NA+TIEH ILRLPY LK +C+K+ + +E
Sbjct: 407 NAITIEHFILRLPYHLKFTCAKTAKNDE 434


>ref|XP_018850500.1| PREDICTED: uncharacterized protein LOC109013035 isoform X1 [Juglans
           regia]
          Length = 593

 Score =  219 bits (558), Expect = 1e-63
 Identities = 135/268 (50%), Positives = 168/268 (62%), Gaps = 9/268 (3%)
 Frame = +2

Query: 47  GKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GPQLQNRLVDQERA 223
           GKEN   +NS  K    PE+   +V  S+KR  +K E  EKC  P   +L++RLV+ ERA
Sbjct: 203 GKENLLCSNS-VKDKQSPEKKYAKVVASMKRAPIKHESMEKCADPFRSKLESRLVE-ERA 260

Query: 224 QEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAEAFSNLMSVGLT 403
           +E       GDRVLEA+S  NK+SE ++KCL GIF R+S LK K  +  AF    SV   
Sbjct: 261 EESSSGSS-GDRVLEADSTPNKVSEDIVKCLSGIFARMSTLKDKVAELGAFQ---SVASH 316

Query: 404 SGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXXXXXXXXXXXXA 571
           + +R   FRDPY +C + + RDIGPYK L +IEASS        A              A
Sbjct: 317 ASNRETEFRDPYDICPEYRNRDIGPYKHLCSIEASSIDLNRTTNALFLIHRLKLLFGKLA 376

Query: 572 SVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQK----ATINVGGLLL 739
           +VNLEGLT QQKLAFWIN+YN CMMNA+L HGI  +PEM+  LMQK    ATI VGG LL
Sbjct: 377 TVNLEGLTQQQKLAFWINSYNSCMMNAFLEHGIPNTPEMVVALMQKCLPQATIVVGGHLL 436

Query: 740 NAVTIEHSILRLPYRLKLSCSKSPERNE 823
           NA+TIEH ILRLPY LK +C+K+ + +E
Sbjct: 437 NAITIEHFILRLPYHLKFTCAKTAKNDE 464


>ref|XP_023875257.1| uncharacterized protein LOC111987747 isoform X3 [Quercus suber]
          Length = 596

 Score =  218 bits (556), Expect = 2e-63
 Identities = 134/280 (47%), Positives = 170/280 (60%), Gaps = 14/280 (5%)
 Frame = +2

Query: 26   NGDDSLP----GKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GPQ 190
            N   S P    GKENR  TNS  K    PE+  +++   VK+P +K E  EKC+     Q
Sbjct: 193  NSSPSFPEDGRGKENRLCTNS-VKDKPSPEKKYSKIATPVKKPPIKRESVEKCVDNFKSQ 251

Query: 191  LQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAE 370
            L++RLVDQE+AQE        D +LEA+S  NK+SE+++KCL  IF+R+S +K K + + 
Sbjct: 252  LESRLVDQEKAQESSSGSSD-DSLLEADSNPNKVSENIVKCLSSIFMRISTVKDKVVHSG 310

Query: 371  AFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSXXXX----ASXXX 538
               +  S    + +R   F DPY +C + + RDIGPYK L+AIEASS        A    
Sbjct: 311  TSQSATS---HASNRETEFGDPYDICSEFRTRDIGPYKHLYAIEASSIDLNRKTNALFLI 367

Query: 539  XXXXXXXXXXASVNLEGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQK--- 709
                      A+VNLEGLTHQQKLAFWIN YN CMMNA L HGI E PEM+  LMQK   
Sbjct: 368  QRLKFLFGRLAAVNLEGLTHQQKLAFWINIYNSCMMNAILEHGIPEMPEMVVALMQKWCL 427

Query: 710  --ATINVGGLLLNAVTIEHSILRLPYRLKLSCSKSPERNE 823
              ATI VGG LL A+TIEH ILRLPY LK +C+K+ + +E
Sbjct: 428  PQATIVVGGHLLYAITIEHFILRLPYHLKFACAKAAKNDE 467


Top