BLASTX nr result

ID: Chrysanthemum21_contig00038754 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00038754
         (1539 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVH92524.1| protein of unknown function DUF547 [Cynara cardun...   481   e-162
ref|XP_022008785.1| uncharacterized protein LOC110908193 [Helian...   474   e-159
ref|XP_023743276.1| uncharacterized protein LOC111891456 [Lactuc...   468   e-156
ref|XP_011039198.1| PREDICTED: uncharacterized protein LOC105135...   275   9e-83
ref|XP_018850501.1| PREDICTED: uncharacterized protein LOC109013...   276   2e-82
ref|XP_011039195.1| PREDICTED: uncharacterized protein LOC105135...   275   1e-81
ref|XP_011039194.1| PREDICTED: uncharacterized protein LOC105135...   275   1e-81
ref|XP_011039196.1| PREDICTED: uncharacterized protein LOC105135...   275   1e-81
ref|XP_022740042.1| uncharacterized protein LOC111292091 isoform...   269   3e-80
ref|XP_018850500.1| PREDICTED: uncharacterized protein LOC109013...   271   3e-80
ref|XP_017980873.1| PREDICTED: uncharacterized protein LOC186110...   270   8e-80
ref|XP_007047200.1| PREDICTED: uncharacterized protein LOC186110...   270   8e-80
ref|XP_023875256.1| uncharacterized protein LOC111987747 isoform...   270   1e-79
ref|XP_017980875.1| PREDICTED: uncharacterized protein LOC186110...   269   2e-79
ref|XP_018850503.1| PREDICTED: uncharacterized protein LOC109013...   268   2e-79
ref|XP_022740041.1| uncharacterized protein LOC111292091 isoform...   269   3e-79
ref|XP_022740038.1| uncharacterized protein LOC111292091 isoform...   269   3e-79
ref|XP_022740037.1| uncharacterized protein LOC111292091 isoform...   269   3e-79
ref|XP_022740039.1| uncharacterized protein LOC111292091 isoform...   268   8e-79
ref|XP_003635502.2| PREDICTED: uncharacterized protein LOC100855...   264   1e-77

>gb|KVH92524.1| protein of unknown function DUF547 [Cynara cardunculus var. scolymus]
          Length = 580

 Score =  481 bits (1239), Expect = e-162
 Identities = 270/457 (59%), Positives = 307/457 (67%), Gaps = 3/457 (0%)
 Frame = +1

Query: 178  MDIERCKRSKSAGKPVVV--ITRRERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRLG 351
            M+I+ CK S+  GKP++   +TRRER               RQEENVH+ALERAFSRRLG
Sbjct: 1    MEIQGCK-SRLTGKPILNRRLTRRERKLALLEDVDKLKKKLRQEENVHKALERAFSRRLG 59

Query: 352  TLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDNK 531
            TLPHLPPYLPPQT                      SF+QGL QE V +SS+ +IG+ DN 
Sbjct: 60   TLPHLPPYLPPQTLELLAEVAVLEEEVVRLEEEVVSFRQGLYQEAVYLSSRISIGDSDNN 119

Query: 532  -LIEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXXGNDV 708
              +EQS K T SK D     MSPVEV +E+S+ + QKP                  G+D 
Sbjct: 120  NSLEQSSKSTTSKHDALRSSMSPVEVDAESSQESQQKPLNLLSRSASSRMTYSHQIGSDF 179

Query: 709  LNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVK 888
            LNR V+ K  D+ K D+S GDD + GKENRSS+NSNSK+ N PER+AN+VQNSVKR  VK
Sbjct: 180  LNRWVERKHTDLKKPDSSVGDDGVLGKENRSSSNSNSKLKNSPERIANKVQNSVKRIPVK 239

Query: 889  FEVSEKCIPPGPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFL 1068
               +EK  PP  QLQNRL DQERAQE       GDR+LEAESECNKISE+ LKCLIGIFL
Sbjct: 240  SVTAEKRTPPKLQLQNRLADQERAQESCSSSSSGDRMLEAESECNKISENALKCLIGIFL 299

Query: 1069 RLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASS 1248
            RLSKLKAKTMDAEAFSNLMS+ LT GDRGP+FRDPYG+CLKSKRRDIGPYK LFAIEA S
Sbjct: 300  RLSKLKAKTMDAEAFSNLMSLDLTGGDRGPAFRDPYGICLKSKRRDIGPYKHLFAIEAGS 359

Query: 1249 IDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGINES 1428
            IDF                           LTHQQKLAFWIN+YNICMMNAYL HGI E+
Sbjct: 360  IDFKKKTNASLLTR----------------LTHQQKLAFWINSYNICMMNAYLEHGIPEN 403

Query: 1429 PEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            PEM+ TL+QKATI VGG LLNA T+EH ILRLPYRLK
Sbjct: 404  PEMMPTLIQKATITVGGHLLNAATIEHFILRLPYRLK 440


>ref|XP_022008785.1| uncharacterized protein LOC110908193 [Helianthus annuus]
 ref|XP_022008786.1| uncharacterized protein LOC110908193 [Helianthus annuus]
 gb|OTF97072.1| Protein of unknown function, DUF547 [Helianthus annuus]
          Length = 586

 Score =  474 bits (1220), Expect = e-159
 Identities = 266/459 (57%), Positives = 304/459 (66%), Gaps = 5/459 (1%)
 Frame = +1

Query: 178  MDIERCKRSKSAGKPVVV--ITRRERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRLG 351
            M++ RCK SK+AGKP++   +TR ER               RQEENVHRALERAFSRRLG
Sbjct: 1    MEVPRCK-SKAAGKPILNRRLTRTERKLALLEDVDKLKKKLRQEENVHRALERAFSRRLG 59

Query: 352  TLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDNK 531
            TLPHLPPYLPPQT                      SF+ GL QE V +SS+ NI + +N 
Sbjct: 60   TLPHLPPYLPPQTLELLAEVAVLEEEVVRLEEEVVSFRHGLYQEAVGLSSRININSDNNS 119

Query: 532  L--IEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQK-PXXXXXXXXXXXXXXXXXXGN 702
            L  +EQS K               VE+ SE+SK++ QK P                  G 
Sbjct: 120  LEPLEQSAKTGDGS----------VEMASESSKLSQQKVPLNSLSRSASSRLSYSNRIGT 169

Query: 703  DVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPL 882
            DVLNRVV+ KPAD+ KSD+   DD + GKENRSS NSNSK+ + PERV  E+Q+ V++P 
Sbjct: 170  DVLNRVVEIKPADLKKSDSVISDDGVLGKENRSSANSNSKVKSSPERVPEEMQSLVRKPQ 229

Query: 883  VKFEVSEKCIPPGPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGI 1062
            +KFE + KC PP PQ  NRL DQE+AQE       GDR++EAESECNKISE+VLKCLIGI
Sbjct: 230  IKFETAGKCSPPKPQ--NRLADQEKAQESCSSSSSGDRIVEAESECNKISENVLKCLIGI 287

Query: 1063 FLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEA 1242
            FLRLSKLKAKTMDAEAFSNLMSV LT GD+GP+FRDPYGVCLKSKRRDIGPYKDLFAIEA
Sbjct: 288  FLRLSKLKAKTMDAEAFSNLMSVDLTGGDQGPAFRDPYGVCLKSKRRDIGPYKDLFAIEA 347

Query: 1243 SSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGIN 1422
             SIDF                          GLTHQQKLAFWINTYNICMMNAYL HGI 
Sbjct: 348  GSIDFKKKMNASLLIRRLKLLLEKLASVNLEGLTHQQKLAFWINTYNICMMNAYLEHGIP 407

Query: 1423 ESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            ESPEM+ TLMQKATIN GG LLNA+++EH ILRLP RLK
Sbjct: 408  ESPEMMPTLMQKATINAGGYLLNAISIEHFILRLPNRLK 446


>ref|XP_023743276.1| uncharacterized protein LOC111891456 [Lactuca sativa]
 ref|XP_023743277.1| uncharacterized protein LOC111891456 [Lactuca sativa]
 ref|XP_023743278.1| uncharacterized protein LOC111891456 [Lactuca sativa]
          Length = 596

 Score =  468 bits (1203), Expect = e-156
 Identities = 267/459 (58%), Positives = 303/459 (66%), Gaps = 5/459 (1%)
 Frame = +1

Query: 178  MDIERCKRSKSAGKPVVV--ITRRERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRLG 351
            M+I+R K S   GKP++   +TRRER               RQEENVHRALERAFSRRLG
Sbjct: 1    MEIQRHK-STPPGKPILNRRLTRRERKLALLEDVDKLKKKLRQEENVHRALERAFSRRLG 59

Query: 352  TLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDNK 531
            TLPHLPPYLPPQT                      +F+QGL QE V +SS+ +IGN DN 
Sbjct: 60   TLPHLPPYLPPQTLELLAEVAVLEEEVVRLEEEVVNFRQGLYQEAVYLSSRISIGNSDND 119

Query: 532  LIEQ--SVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXXGND 705
              E+  S   T  + DES   +S +EV+SEASK    KP                  G+D
Sbjct: 120  PSEEQSSKTTTFKQNDESGSSISQIEVESEASKPPQHKPLNSLLRSGSCRSSYSHRIGSD 179

Query: 706  VLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLV 885
             LNRVVDT  AD+ K D+S GDD + GKENRSS+NSNSKITN PE++ NEVQNSVKRP  
Sbjct: 180  FLNRVVDT--ADLKKLDSSIGDDGVLGKENRSSSNSNSKITNSPEKITNEVQNSVKRPQN 237

Query: 886  KFEVSEKCIPPGPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIF 1065
            K E+ EK  P  PQ+  RL+DQERAQE        DR++EAESECNKISE+VL CLIGIF
Sbjct: 238  KPEIVEKRTPSKPQI--RLMDQERAQESCSSSSSSDRIMEAESECNKISENVLNCLIGIF 295

Query: 1066 LRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEAS 1245
            LRLSKLKAKTMDAEAFSNLM++ LT  DRGP FRDPYGVCLKS++RDIGPYKDLFAIE+ 
Sbjct: 296  LRLSKLKAKTMDAEAFSNLMTLDLTGVDRGPGFRDPYGVCLKSRKRDIGPYKDLFAIESG 355

Query: 1246 SIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGL-THQQKLAFWINTYNICMMNAYLGHGIN 1422
            SIDF                          GL THQQKLAFWIN YNICMMNAYL HG+ 
Sbjct: 356  SIDFKKKKNASLLIRRLKLLLEKLASVNLEGLVTHQQKLAFWINVYNICMMNAYLEHGMP 415

Query: 1423 ESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            ESPEM+ TLMQKATIN GG LLNAVT+EH ILRLPYR K
Sbjct: 416  ESPEMMPTLMQKATINTGGYLLNAVTIEHFILRLPYRFK 454


>ref|XP_011039198.1| PREDICTED: uncharacterized protein LOC105135828 isoform X4 [Populus
            euphratica]
          Length = 522

 Score =  275 bits (704), Expect = 9e-83
 Identities = 191/468 (40%), Positives = 233/468 (49%), Gaps = 13/468 (2%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVI----TRRERXXXXXXXXXXXXXXXRQEENVHRALERAFSR 342
            K  +E   RS++ G P   I      RER               R EENVHRALERAF+R
Sbjct: 23   KEKMEAQGRSRAVGGPKSAIKWRKANRERKLALLQDVDKLKKKLRHEENVHRALERAFTR 82

Query: 343  RLGTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNL 522
             LG LP LPPYLPP                        +F+QGL QE V VSSK N+ N 
Sbjct: 83   PLGALPRLPPYLPPYILELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYVSSKKNVEN- 141

Query: 523  DNKLIEQSVKPTMSK--------IDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXX 678
                I+Q    T SK        ++E+ P       Q   ++  + K             
Sbjct: 142  SKDAIDQQPSTTRSKHARSKSLSLNETNPANFAARPQPSLARCTSSKRLFSTDPIIERSG 201

Query: 679  XXXXXXGNDVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEV 858
                   N    +    KP     S +S  DD   GKENRS  N   K    P+++A ++
Sbjct: 202  QCSNRPANR--GKYASGKP----NSSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KI 252

Query: 859  QNSVKRPLVKFEVSEKCIPPGP-QLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISE 1035
               VKR   K E  EK + P   QL+ RLV+QERAQE        DR+ E     NK++E
Sbjct: 253  TTPVKRTPNKRESEEKSLEPSKLQLECRLVEQERAQESTSAYM-NDRICENNITPNKLTE 311

Query: 1036 SVLKCLIGIFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGP 1215
             ++KCL  IFLR+S LK K ++   FS+  ++    GDRG   RDPYG+  + K RDIG 
Sbjct: 312  DIVKCLSSIFLRMSTLKDKVVELGTFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGS 371

Query: 1216 YKDLFAIEASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMM 1395
            YK L+AIEASSID                           GLTHQQKLAFWINTYN CMM
Sbjct: 372  YKHLYAIEASSIDLNRTTSALFLLQRLKFLLGKLASANLEGLTHQQKLAFWINTYNSCMM 431

Query: 1396 NAYLGHGINESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            NA L HGI E+PEM+  LMQKATI VGG LLNA+T+EH ILRLPY LK
Sbjct: 432  NAILEHGIPETPEMVVALMQKATITVGGHLLNAITIEHFILRLPYHLK 479


>ref|XP_018850501.1| PREDICTED: uncharacterized protein LOC109013035 isoform X2 [Juglans
            regia]
          Length = 589

 Score =  276 bits (707), Expect = 2e-82
 Identities = 188/458 (41%), Positives = 234/458 (51%), Gaps = 10/458 (2%)
 Frame = +1

Query: 196  KRSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRLGTLPHLP 369
            ++ + A KPV    R  RER               R EENVHRALERAF+R LG LP LP
Sbjct: 6    RKVRGAEKPVTNRRRSNRERKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLP 65

Query: 370  PYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDNKLIEQSV 549
            PYLPP T                      +F+QGL QE V +SSK N+ N  + +   SV
Sbjct: 66   PYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYISSKKNVENSSDAIDRNSV 125

Query: 550  KPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXXGNDVLNRVVDT 729
            K + +K   S    SP      A+  A  +P                       +R  + 
Sbjct: 126  KSSSTKHQRSKS--SPQNELHSATSTARPQPCLARSTSRKLSSPDTFP------DRAANC 177

Query: 730  KPADVMKSDASNGDDSLP-------GKENRSSTNSNSKITNPPERVANEVQNSVKRPLVK 888
                VM+       +S P       GKEN   +NS  K    PE+   +V  S+KR  +K
Sbjct: 178  SSVPVMEKQTLKKRNSPPSFPEDRQGKENLLCSNS-VKDKQSPEKKYAKVVASMKRAPIK 236

Query: 889  FEVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIF 1065
             E  EKC  P   +L++RLV+ ERA+E       GDRVLEA+S  NK+SE ++KCL GIF
Sbjct: 237  HESMEKCADPFRSKLESRLVE-ERAEESSSGSS-GDRVLEADSTPNKVSEDIVKCLSGIF 294

Query: 1066 LRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEAS 1245
             R+S LK K  +  AF    SV   + +R   FRDPY +C + + RDIGPYK L +IEAS
Sbjct: 295  ARMSTLKDKVAELGAFQ---SVASHASNRETEFRDPYDICPEYRNRDIGPYKHLCSIEAS 351

Query: 1246 SIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGINE 1425
            SID                           GLT QQKLAFWIN+YN CMMNA+L HGI  
Sbjct: 352  SIDLNRTTNALFLIHRLKLLFGKLATVNLEGLTQQQKLAFWINSYNSCMMNAFLEHGIPN 411

Query: 1426 SPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            +PEM+  LMQKATI VGG LLNA+T+EH ILRLPY LK
Sbjct: 412  TPEMVVALMQKATIVVGGHLLNAITIEHFILRLPYHLK 449


>ref|XP_011039195.1| PREDICTED: uncharacterized protein LOC105135828 isoform X2 [Populus
            euphratica]
          Length = 618

 Score =  275 bits (704), Expect = 1e-81
 Identities = 191/468 (40%), Positives = 233/468 (49%), Gaps = 13/468 (2%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVI----TRRERXXXXXXXXXXXXXXXRQEENVHRALERAFSR 342
            K  +E   RS++ G P   I      RER               R EENVHRALERAF+R
Sbjct: 22   KEKMEAQGRSRAVGGPKSAIKWRKANRERKLALLQDVDKLKKKLRHEENVHRALERAFTR 81

Query: 343  RLGTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNL 522
             LG LP LPPYLPP                        +F+QGL QE V VSSK N+ N 
Sbjct: 82   PLGALPRLPPYLPPYILELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYVSSKKNVEN- 140

Query: 523  DNKLIEQSVKPTMSK--------IDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXX 678
                I+Q    T SK        ++E+ P       Q   ++  + K             
Sbjct: 141  SKDAIDQQPSTTRSKHARSKSLSLNETNPANFAARPQPSLARCTSSKRLFSTDPIIERSG 200

Query: 679  XXXXXXGNDVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEV 858
                   N    +    KP     S +S  DD   GKENRS  N   K    P+++A ++
Sbjct: 201  QCSNRPANR--GKYASGKP----NSSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KI 251

Query: 859  QNSVKRPLVKFEVSEKCIPPGP-QLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISE 1035
               VKR   K E  EK + P   QL+ RLV+QERAQE        DR+ E     NK++E
Sbjct: 252  TTPVKRTPNKRESEEKSLEPSKLQLECRLVEQERAQESTSAYM-NDRICENNITPNKLTE 310

Query: 1036 SVLKCLIGIFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGP 1215
             ++KCL  IFLR+S LK K ++   FS+  ++    GDRG   RDPYG+  + K RDIG 
Sbjct: 311  DIVKCLSSIFLRMSTLKDKVVELGTFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGS 370

Query: 1216 YKDLFAIEASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMM 1395
            YK L+AIEASSID                           GLTHQQKLAFWINTYN CMM
Sbjct: 371  YKHLYAIEASSIDLNRTTSALFLLQRLKFLLGKLASANLEGLTHQQKLAFWINTYNSCMM 430

Query: 1396 NAYLGHGINESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            NA L HGI E+PEM+  LMQKATI VGG LLNA+T+EH ILRLPY LK
Sbjct: 431  NAILEHGIPETPEMVVALMQKATITVGGHLLNAITIEHFILRLPYHLK 478


>ref|XP_011039194.1| PREDICTED: uncharacterized protein LOC105135828 isoform X1 [Populus
            euphratica]
          Length = 619

 Score =  275 bits (704), Expect = 1e-81
 Identities = 191/468 (40%), Positives = 233/468 (49%), Gaps = 13/468 (2%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVI----TRRERXXXXXXXXXXXXXXXRQEENVHRALERAFSR 342
            K  +E   RS++ G P   I      RER               R EENVHRALERAF+R
Sbjct: 23   KEKMEAQGRSRAVGGPKSAIKWRKANRERKLALLQDVDKLKKKLRHEENVHRALERAFTR 82

Query: 343  RLGTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNL 522
             LG LP LPPYLPP                        +F+QGL QE V VSSK N+ N 
Sbjct: 83   PLGALPRLPPYLPPYILELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYVSSKKNVEN- 141

Query: 523  DNKLIEQSVKPTMSK--------IDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXX 678
                I+Q    T SK        ++E+ P       Q   ++  + K             
Sbjct: 142  SKDAIDQQPSTTRSKHARSKSLSLNETNPANFAARPQPSLARCTSSKRLFSTDPIIERSG 201

Query: 679  XXXXXXGNDVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEV 858
                   N    +    KP     S +S  DD   GKENRS  N   K    P+++A ++
Sbjct: 202  QCSNRPANR--GKYASGKP----NSSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KI 252

Query: 859  QNSVKRPLVKFEVSEKCIPPGP-QLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISE 1035
               VKR   K E  EK + P   QL+ RLV+QERAQE        DR+ E     NK++E
Sbjct: 253  TTPVKRTPNKRESEEKSLEPSKLQLECRLVEQERAQESTSAYM-NDRICENNITPNKLTE 311

Query: 1036 SVLKCLIGIFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGP 1215
             ++KCL  IFLR+S LK K ++   FS+  ++    GDRG   RDPYG+  + K RDIG 
Sbjct: 312  DIVKCLSSIFLRMSTLKDKVVELGTFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGS 371

Query: 1216 YKDLFAIEASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMM 1395
            YK L+AIEASSID                           GLTHQQKLAFWINTYN CMM
Sbjct: 372  YKHLYAIEASSIDLNRTTSALFLLQRLKFLLGKLASANLEGLTHQQKLAFWINTYNSCMM 431

Query: 1396 NAYLGHGINESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            NA L HGI E+PEM+  LMQKATI VGG LLNA+T+EH ILRLPY LK
Sbjct: 432  NAILEHGIPETPEMVVALMQKATITVGGHLLNAITIEHFILRLPYHLK 479


>ref|XP_011039196.1| PREDICTED: uncharacterized protein LOC105135828 isoform X3 [Populus
            euphratica]
          Length = 594

 Score =  275 bits (702), Expect = 1e-81
 Identities = 189/460 (41%), Positives = 230/460 (50%), Gaps = 13/460 (2%)
 Frame = +1

Query: 199  RSKSAGKPVVVI----TRRERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRLGTLPHL 366
            RS++ G P   I      RER               R EENVHRALERAF+R LG LP L
Sbjct: 6    RSRAVGGPKSAIKWRKANRERKLALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRL 65

Query: 367  PPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDNKLIEQS 546
            PPYLPP                        +F+QGL QE V VSSK N+ N     I+Q 
Sbjct: 66   PPYLPPYILELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYVSSKKNVEN-SKDAIDQQ 124

Query: 547  VKPTMSK--------IDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXXGN 702
               T SK        ++E+ P       Q   ++  + K                    N
Sbjct: 125  PSTTRSKHARSKSLSLNETNPANFAARPQPSLARCTSSKRLFSTDPIIERSGQCSNRPAN 184

Query: 703  DVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPL 882
                +    KP     S +S  DD   GKENRS  N   K    P+++A ++   VKR  
Sbjct: 185  R--GKYASGKP----NSSSSLVDDGR-GKENRSCINY-VKDKQSPDKMA-KITTPVKRTP 235

Query: 883  VKFEVSEKCIPPGP-QLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIG 1059
             K E  EK + P   QL+ RLV+QERAQE        DR+ E     NK++E ++KCL  
Sbjct: 236  NKRESEEKSLEPSKLQLECRLVEQERAQESTSAYM-NDRICENNITPNKLTEDIVKCLSS 294

Query: 1060 IFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIE 1239
            IFLR+S LK K ++   FS+  ++    GDRG   RDPYG+  + K RDIG YK L+AIE
Sbjct: 295  IFLRMSTLKDKVVELGTFSSRATLTSPEGDRGNEIRDPYGMSAEFKIRDIGSYKHLYAIE 354

Query: 1240 ASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGI 1419
            ASSID                           GLTHQQKLAFWINTYN CMMNA L HGI
Sbjct: 355  ASSIDLNRTTSALFLLQRLKFLLGKLASANLEGLTHQQKLAFWINTYNSCMMNAILEHGI 414

Query: 1420 NESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
             E+PEM+  LMQKATI VGG LLNA+T+EH ILRLPY LK
Sbjct: 415  PETPEMVVALMQKATITVGGHLLNAITIEHFILRLPYHLK 454


>ref|XP_022740042.1| uncharacterized protein LOC111292091 isoform X5 [Durio zibethinus]
          Length = 516

 Score =  269 bits (687), Expect = 3e-80
 Identities = 187/460 (40%), Positives = 234/460 (50%), Gaps = 5/460 (1%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRL 348
            KM+  +  R+   GK V    R  RER               R EENVHRALERAF+R L
Sbjct: 25   KMEKSQGSRALGTGKAVTNRRRSNRERKMALLQDVDKLKRKLRHEENVHRALERAFTRPL 84

Query: 349  GTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDN 528
            G LP LPPYLPP T                      +F+QGL QE V  SSK N+ NL N
Sbjct: 85   GALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYASSKRNVENL-N 143

Query: 529  KLIEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXX--GN 702
            + IEQS  P  S   + +  +S V   S  + +A  +P                      
Sbjct: 144  ESIEQS--PVRSSKHQRSKSLS-VNEMSSVTTIARSQPSLARSVSSRKLLPPDSIYDRAG 200

Query: 703  DVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPL 882
               +R  + + A   K ++++GD  + GKEN+S TN+  K    PE+  ++V   VKR  
Sbjct: 201  QCFSRSTNGRQAST-KPNSTSGD--IRGKENQSFTNA-VKDKRSPEKKISKVVTPVKRLT 256

Query: 883  VKFEVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIG 1059
             K E   KC+ P   QL  RLVDQE+AQE        D+  EA+S  NKISE  ++CL  
Sbjct: 257  TKHESPNKCLDPLKVQLDGRLVDQEKAQESPSGSSD-DKPSEADSTPNKISEDTVRCLFS 315

Query: 1060 IFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIE 1239
            IF+RLS LK K +++    +   V     +R     DPYG+    K RDIGPYK L AIE
Sbjct: 316  IFVRLSTLKDKAVESGTLPSRSVVSSHERNRESECPDPYGIYSDLKTRDIGPYKHLCAIE 375

Query: 1240 ASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGI 1419
            A++ID                           GL+HQQKLAFWINTYN CMMNA L HGI
Sbjct: 376  ANTIDLNRSTNALFLIHRLKFLLEKLASVILEGLSHQQKLAFWINTYNSCMMNAILEHGI 435

Query: 1420 NESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
             ESPE +  LMQKATI VGG  LNA+T+EH ILRLP+ LK
Sbjct: 436  PESPETVVALMQKATIVVGGHSLNAITIEHFILRLPFHLK 475


>ref|XP_018850500.1| PREDICTED: uncharacterized protein LOC109013035 isoform X1 [Juglans
            regia]
          Length = 593

 Score =  271 bits (692), Expect = 3e-80
 Identities = 188/462 (40%), Positives = 234/462 (50%), Gaps = 14/462 (3%)
 Frame = +1

Query: 196  KRSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRLGTLPHLP 369
            ++ + A KPV    R  RER               R EENVHRALERAF+R LG LP LP
Sbjct: 6    RKVRGAEKPVTNRRRSNRERKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLP 65

Query: 370  PYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDNKLIEQSV 549
            PYLPP T                      +F+QGL QE V +SSK N+ N  + +   SV
Sbjct: 66   PYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYISSKKNVENSSDAIDRNSV 125

Query: 550  KPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXXGNDVLNRVVDT 729
            K + +K   S    SP      A+  A  +P                       +R  + 
Sbjct: 126  KSSSTKHQRSKS--SPQNELHSATSTARPQPCLARSTSRKLSSPDTFP------DRAANC 177

Query: 730  KPADVMKSDASNGDDSLP-------GKENRSSTNSNSKITNPPERVANEVQNSVKRPLVK 888
                VM+       +S P       GKEN   +NS  K    PE+   +V  S+KR  +K
Sbjct: 178  SSVPVMEKQTLKKRNSPPSFPEDRQGKENLLCSNS-VKDKQSPEKKYAKVVASMKRAPIK 236

Query: 889  FEVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIF 1065
             E  EKC  P   +L++RLV+ ERA+E       GDRVLEA+S  NK+SE ++KCL GIF
Sbjct: 237  HESMEKCADPFRSKLESRLVE-ERAEESSSGSS-GDRVLEADSTPNKVSEDIVKCLSGIF 294

Query: 1066 LRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEAS 1245
             R+S LK K  +  AF    SV   + +R   FRDPY +C + + RDIGPYK L +IEAS
Sbjct: 295  ARMSTLKDKVAELGAFQ---SVASHASNRETEFRDPYDICPEYRNRDIGPYKHLCSIEAS 351

Query: 1246 SIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGINE 1425
            SID                           GLT QQKLAFWIN+YN CMMNA+L HGI  
Sbjct: 352  SIDLNRTTNALFLIHRLKLLFGKLATVNLEGLTQQQKLAFWINSYNSCMMNAFLEHGIPN 411

Query: 1426 SPEMLATLMQK----ATINVGGLLLNAVTVEHSILRLPYRLK 1539
            +PEM+  LMQK    ATI VGG LLNA+T+EH ILRLPY LK
Sbjct: 412  TPEMVVALMQKCLPQATIVVGGHLLNAITIEHFILRLPYHLK 453


>ref|XP_017980873.1| PREDICTED: uncharacterized protein LOC18611093 isoform X2 [Theobroma
            cacao]
          Length = 615

 Score =  270 bits (691), Expect = 8e-80
 Identities = 185/459 (40%), Positives = 238/459 (51%), Gaps = 4/459 (0%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRL 348
            KM+  +  R+   GK +    R  RER               R EENVHRALERAF+R L
Sbjct: 25   KMEKSQGGRALGTGKALTNRRRSNRERKMALLQDVDKLKRKLRHEENVHRALERAFTRPL 84

Query: 349  GTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDN 528
            G LP LPPYLPP T                      +F+QGL QE V  SSK N+ NL N
Sbjct: 85   GALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYASSKRNVENL-N 143

Query: 529  KLIEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXXGNDV 708
            + IEQS  P  S   + +  +S  E+ S  +    Q                     N +
Sbjct: 144  ESIEQS--PVRSSKHQRSKSLSVNEMSSVTTIGKPQPSLARSVSSRKLLPPDTTNERNGL 201

Query: 709  -LNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLV 885
              +R  + + A   K ++++GD  + GKEN+S  N+  K    PE+   +V   VKR   
Sbjct: 202  CFSRPTNGRQAST-KLNSASGD--VRGKENQSFANA-VKDKQSPEKKITKVVTPVKRLPT 257

Query: 886  KFEVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGI 1062
            K E + KC+     QL  RLVDQERAQE        D+V EA+S  NKISE  ++CL  I
Sbjct: 258  KHESANKCLDALKSQLDGRLVDQERAQESPSGSSD-DKVSEADSTPNKISEDTVRCLCSI 316

Query: 1063 FLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEA 1242
            F+RLS LK +++++    +  +       R   F+DPYG+C  SK RDIGPYK+L  IEA
Sbjct: 317  FVRLSTLKDRSVESGILPSQSAANSYEISRESEFQDPYGICSDSKTRDIGPYKNLCTIEA 376

Query: 1243 SSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGIN 1422
            +++D                           GL+HQQKLAFWINTYN CMMNA L HGI 
Sbjct: 377  NTVDLSRRMNALFLIHRLKFLLGKLTSVNLDGLSHQQKLAFWINTYNSCMMNAILEHGIP 436

Query: 1423 ESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            E+PE +  LMQKATI VGG LLNA+T+EH ILRLP+ LK
Sbjct: 437  ETPESVVGLMQKATIVVGGHLLNAITIEHFILRLPFHLK 475


>ref|XP_007047200.1| PREDICTED: uncharacterized protein LOC18611093 isoform X1 [Theobroma
            cacao]
 ref|XP_017980870.1| PREDICTED: uncharacterized protein LOC18611093 isoform X1 [Theobroma
            cacao]
 gb|EOX91357.1| Uncharacterized protein TCM_000576 isoform 1 [Theobroma cacao]
 gb|EOX91358.1| Uncharacterized protein TCM_000576 isoform 1 [Theobroma cacao]
          Length = 616

 Score =  270 bits (691), Expect = 8e-80
 Identities = 185/459 (40%), Positives = 238/459 (51%), Gaps = 4/459 (0%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRL 348
            KM+  +  R+   GK +    R  RER               R EENVHRALERAF+R L
Sbjct: 26   KMEKSQGGRALGTGKALTNRRRSNRERKMALLQDVDKLKRKLRHEENVHRALERAFTRPL 85

Query: 349  GTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDN 528
            G LP LPPYLPP T                      +F+QGL QE V  SSK N+ NL N
Sbjct: 86   GALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYASSKRNVENL-N 144

Query: 529  KLIEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXXGNDV 708
            + IEQS  P  S   + +  +S  E+ S  +    Q                     N +
Sbjct: 145  ESIEQS--PVRSSKHQRSKSLSVNEMSSVTTIGKPQPSLARSVSSRKLLPPDTTNERNGL 202

Query: 709  -LNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLV 885
              +R  + + A   K ++++GD  + GKEN+S  N+  K    PE+   +V   VKR   
Sbjct: 203  CFSRPTNGRQAST-KLNSASGD--VRGKENQSFANA-VKDKQSPEKKITKVVTPVKRLPT 258

Query: 886  KFEVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGI 1062
            K E + KC+     QL  RLVDQERAQE        D+V EA+S  NKISE  ++CL  I
Sbjct: 259  KHESANKCLDALKSQLDGRLVDQERAQESPSGSSD-DKVSEADSTPNKISEDTVRCLCSI 317

Query: 1063 FLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEA 1242
            F+RLS LK +++++    +  +       R   F+DPYG+C  SK RDIGPYK+L  IEA
Sbjct: 318  FVRLSTLKDRSVESGILPSQSAANSYEISRESEFQDPYGICSDSKTRDIGPYKNLCTIEA 377

Query: 1243 SSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGIN 1422
            +++D                           GL+HQQKLAFWINTYN CMMNA L HGI 
Sbjct: 378  NTVDLSRRMNALFLIHRLKFLLGKLTSVNLDGLSHQQKLAFWINTYNSCMMNAILEHGIP 437

Query: 1423 ESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            E+PE +  LMQKATI VGG LLNA+T+EH ILRLP+ LK
Sbjct: 438  ETPESVVGLMQKATIVVGGHLLNAITIEHFILRLPFHLK 476


>ref|XP_023875256.1| uncharacterized protein LOC111987747 isoform X2 [Quercus suber]
 gb|POE82600.1| hypothetical protein CFP56_46449 [Quercus suber]
          Length = 610

 Score =  270 bits (690), Expect = 1e-79
 Identities = 183/457 (40%), Positives = 231/457 (50%), Gaps = 10/457 (2%)
 Frame = +1

Query: 199  RSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRLGTLPHLPP 372
            R  +AGKPV    R  RER               RQEENVHRALERAF+R LG LP LPP
Sbjct: 29   RVMAAGKPVTNRRRSNRERKMALLQDVDKLKKKLRQEENVHRALERAFTRPLGALPRLPP 88

Query: 373  YLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDNKLIEQSVK 552
            YLPP T                      +F+QGL QE +  SSK N  N  +++   S++
Sbjct: 89   YLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAIYTSSKRNAEN-SSEIDHNSIR 147

Query: 553  PTMSKIDESTP---LMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXXGNDVLNRVV 723
             +  +  +S P   L S   +      +A                         V+ +  
Sbjct: 148  SSKHQRSKSLPQNELNSATSMPRPLPSLARSTSSRKLLFSDTTPDRTANCSSRLVVGKQS 207

Query: 724  DTKPADVMKSDASNGDDSLP----GKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKF 891
              KP         N   S P    GKENR  TNS  K    PE+  +++   VK+P +K 
Sbjct: 208  PKKP---------NSSPSFPEDGRGKENRLCTNS-VKDKPSPEKKYSKIATPVKKPPIKR 257

Query: 892  EVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFL 1068
            E  EKC+     QL++RLVDQE+AQE        D +LEA+S  NK+SE+++KCL  IF+
Sbjct: 258  ESVEKCVDNFKSQLESRLVDQEKAQESSSGSSD-DSLLEADSNPNKVSENIVKCLSSIFM 316

Query: 1069 RLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASS 1248
            R+S +K K + +    +  S    + +R   F DPY +C + + RDIGPYK L+AIEASS
Sbjct: 317  RISTVKDKVVHSGTSQSATS---HASNRETEFGDPYDICSEFRTRDIGPYKHLYAIEASS 373

Query: 1249 IDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGINES 1428
            ID                           GLTHQQKLAFWIN YN CMMNA L HGI E 
Sbjct: 374  IDLNRKTNALFLIQRLKFLFGRLAAVNLEGLTHQQKLAFWINIYNSCMMNAILEHGIPEM 433

Query: 1429 PEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            PEM+  LMQKATI VGG LL A+T+EH ILRLPY LK
Sbjct: 434  PEMVVALMQKATIVVGGHLLYAITIEHFILRLPYHLK 470


>ref|XP_017980875.1| PREDICTED: uncharacterized protein LOC18611093 isoform X3 [Theobroma
            cacao]
          Length = 590

 Score =  269 bits (687), Expect = 2e-79
 Identities = 179/437 (40%), Positives = 229/437 (52%), Gaps = 2/437 (0%)
 Frame = +1

Query: 235  TRRERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRLGTLPHLPPYLPPQTXXXXXXXX 414
            + RER               R EENVHRALERAF+R LG LP LPPYLPP T        
Sbjct: 22   SNRERKMALLQDVDKLKRKLRHEENVHRALERAFTRPLGALPRLPPYLPPYTLELLAEVA 81

Query: 415  XXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDNKLIEQSVKPTMSKIDESTPLMS 594
                          +F+QGL QE V  SSK N+ NL N+ IEQS  P  S   + +  +S
Sbjct: 82   VLEEEVVRLEEQVVNFRQGLYQEAVYASSKRNVENL-NESIEQS--PVRSSKHQRSKSLS 138

Query: 595  PVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXXGNDV-LNRVVDTKPADVMKSDASNGD 771
              E+ S  +    Q                     N +  +R  + + A   K ++++GD
Sbjct: 139  VNEMSSVTTIGKPQPSLARSVSSRKLLPPDTTNERNGLCFSRPTNGRQAST-KLNSASGD 197

Query: 772  DSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GPQLQNRLVD 948
              + GKEN+S  N+  K    PE+   +V   VKR   K E + KC+     QL  RLVD
Sbjct: 198  --VRGKENQSFANA-VKDKQSPEKKITKVVTPVKRLPTKHESANKCLDALKSQLDGRLVD 254

Query: 949  QERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAEAFSNLMS 1128
            QERAQE        D+V EA+S  NKISE  ++CL  IF+RLS LK +++++    +  +
Sbjct: 255  QERAQESPSGSSD-DKVSEADSTPNKISEDTVRCLCSIFVRLSTLKDRSVESGILPSQSA 313

Query: 1129 VGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEASSIDFXXXXXXXXXXXXXXXXX 1308
                   R   F+DPYG+C  SK RDIGPYK+L  IEA+++D                  
Sbjct: 314  ANSYEISRESEFQDPYGICSDSKTRDIGPYKNLCTIEANTVDLSRRMNALFLIHRLKFLL 373

Query: 1309 XXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGINESPEMLATLMQKATINVGGLLL 1488
                     GL+HQQKLAFWINTYN CMMNA L HGI E+PE +  LMQKATI VGG LL
Sbjct: 374  GKLTSVNLDGLSHQQKLAFWINTYNSCMMNAILEHGIPETPESVVGLMQKATIVVGGHLL 433

Query: 1489 NAVTVEHSILRLPYRLK 1539
            NA+T+EH ILRLP+ LK
Sbjct: 434  NAITIEHFILRLPFHLK 450


>ref|XP_018850503.1| PREDICTED: uncharacterized protein LOC109013035 isoform X4 [Juglans
            regia]
          Length = 563

 Score =  268 bits (685), Expect = 2e-79
 Identities = 180/427 (42%), Positives = 223/427 (52%), Gaps = 12/427 (2%)
 Frame = +1

Query: 295  RQEENVHRALERAFSRRLGTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGL 474
            R EENVHRALERAF+R LG LP LPPYLPP T                      +F+QGL
Sbjct: 11   RHEENVHRALERAFTRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGL 70

Query: 475  NQEPVPVSSKTNIGNLDNKLIEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXX 654
             QE V +SSK N+ N  + +   SVK + +K   S    SP      A+  A  +P    
Sbjct: 71   YQEAVYISSKKNVENSSDAIDRNSVKSSSTKHQRSKS--SPQNELHSATSTARPQPCLAR 128

Query: 655  XXXXXXXXXXXXXXGNDVLNRVVDTKPADVMKSDASNGDDSLP-------GKENRSSTNS 813
                               +R  +     VM+       +S P       GKEN   +NS
Sbjct: 129  STSRKLSSPDTFP------DRAANCSSVPVMEKQTLKKRNSPPSFPEDRQGKENLLCSNS 182

Query: 814  NSKITNPPERVANEVQNSVKRPLVKFEVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXG 990
              K    PE+   +V  S+KR  +K E  EKC  P   +L++RLV+ ERA+E       G
Sbjct: 183  -VKDKQSPEKKYAKVVASMKRAPIKHESMEKCADPFRSKLESRLVE-ERAEESSSGSS-G 239

Query: 991  DRVLEAESECNKISESVLKCLIGIFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRD 1170
            DRVLEA+S  NK+SE ++KCL GIF R+S LK K  +  AF    SV   + +R   FRD
Sbjct: 240  DRVLEADSTPNKVSEDIVKCLSGIFARMSTLKDKVAELGAFQ---SVASHASNRETEFRD 296

Query: 1171 PYGVCLKSKRRDIGPYKDLFAIEASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQ 1350
            PY +C + + RDIGPYK L +IEASSID                           GLT Q
Sbjct: 297  PYDICPEYRNRDIGPYKHLCSIEASSIDLNRTTNALFLIHRLKLLFGKLATVNLEGLTQQ 356

Query: 1351 QKLAFWINTYNICMMNAYLGHGINESPEMLATLMQK----ATINVGGLLLNAVTVEHSIL 1518
            QKLAFWIN+YN CMMNA+L HGI  +PEM+  LMQK    ATI VGG LLNA+T+EH IL
Sbjct: 357  QKLAFWINSYNSCMMNAFLEHGIPNTPEMVVALMQKCLPQATIVVGGHLLNAITIEHFIL 416

Query: 1519 RLPYRLK 1539
            RLPY LK
Sbjct: 417  RLPYHLK 423


>ref|XP_022740041.1| uncharacterized protein LOC111292091 isoform X4 [Durio zibethinus]
          Length = 612

 Score =  269 bits (687), Expect = 3e-79
 Identities = 187/460 (40%), Positives = 234/460 (50%), Gaps = 5/460 (1%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRL 348
            KM+  +  R+   GK V    R  RER               R EENVHRALERAF+R L
Sbjct: 22   KMEKSQGSRALGTGKAVTNRRRSNRERKMALLQDVDKLKRKLRHEENVHRALERAFTRPL 81

Query: 349  GTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDN 528
            G LP LPPYLPP T                      +F+QGL QE V  SSK N+ NL N
Sbjct: 82   GALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYASSKRNVENL-N 140

Query: 529  KLIEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXX--GN 702
            + IEQS  P  S   + +  +S V   S  + +A  +P                      
Sbjct: 141  ESIEQS--PVRSSKHQRSKSLS-VNEMSSVTTIARSQPSLARSVSSRKLLPPDSIYDRAG 197

Query: 703  DVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPL 882
               +R  + + A   K ++++GD  + GKEN+S TN+  K    PE+  ++V   VKR  
Sbjct: 198  QCFSRSTNGRQAST-KPNSTSGD--IRGKENQSFTNA-VKDKRSPEKKISKVVTPVKRLT 253

Query: 883  VKFEVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIG 1059
             K E   KC+ P   QL  RLVDQE+AQE        D+  EA+S  NKISE  ++CL  
Sbjct: 254  TKHESPNKCLDPLKVQLDGRLVDQEKAQESPSGSSD-DKPSEADSTPNKISEDTVRCLFS 312

Query: 1060 IFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIE 1239
            IF+RLS LK K +++    +   V     +R     DPYG+    K RDIGPYK L AIE
Sbjct: 313  IFVRLSTLKDKAVESGTLPSRSVVSSHERNRESECPDPYGIYSDLKTRDIGPYKHLCAIE 372

Query: 1240 ASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGI 1419
            A++ID                           GL+HQQKLAFWINTYN CMMNA L HGI
Sbjct: 373  ANTIDLNRSTNALFLIHRLKFLLEKLASVILEGLSHQQKLAFWINTYNSCMMNAILEHGI 432

Query: 1420 NESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
             ESPE +  LMQKATI VGG  LNA+T+EH ILRLP+ LK
Sbjct: 433  PESPETVVALMQKATIVVGGHSLNAITIEHFILRLPFHLK 472


>ref|XP_022740038.1| uncharacterized protein LOC111292091 isoform X2 [Durio zibethinus]
          Length = 614

 Score =  269 bits (687), Expect = 3e-79
 Identities = 187/460 (40%), Positives = 234/460 (50%), Gaps = 5/460 (1%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRL 348
            KM+  +  R+   GK V    R  RER               R EENVHRALERAF+R L
Sbjct: 24   KMEKSQGSRALGTGKAVTNRRRSNRERKMALLQDVDKLKRKLRHEENVHRALERAFTRPL 83

Query: 349  GTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDN 528
            G LP LPPYLPP T                      +F+QGL QE V  SSK N+ NL N
Sbjct: 84   GALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYASSKRNVENL-N 142

Query: 529  KLIEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXX--GN 702
            + IEQS  P  S   + +  +S V   S  + +A  +P                      
Sbjct: 143  ESIEQS--PVRSSKHQRSKSLS-VNEMSSVTTIARSQPSLARSVSSRKLLPPDSIYDRAG 199

Query: 703  DVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPL 882
               +R  + + A   K ++++GD  + GKEN+S TN+  K    PE+  ++V   VKR  
Sbjct: 200  QCFSRSTNGRQAST-KPNSTSGD--IRGKENQSFTNA-VKDKRSPEKKISKVVTPVKRLT 255

Query: 883  VKFEVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIG 1059
             K E   KC+ P   QL  RLVDQE+AQE        D+  EA+S  NKISE  ++CL  
Sbjct: 256  TKHESPNKCLDPLKVQLDGRLVDQEKAQESPSGSSD-DKPSEADSTPNKISEDTVRCLFS 314

Query: 1060 IFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIE 1239
            IF+RLS LK K +++    +   V     +R     DPYG+    K RDIGPYK L AIE
Sbjct: 315  IFVRLSTLKDKAVESGTLPSRSVVSSHERNRESECPDPYGIYSDLKTRDIGPYKHLCAIE 374

Query: 1240 ASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGI 1419
            A++ID                           GL+HQQKLAFWINTYN CMMNA L HGI
Sbjct: 375  ANTIDLNRSTNALFLIHRLKFLLEKLASVILEGLSHQQKLAFWINTYNSCMMNAILEHGI 434

Query: 1420 NESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
             ESPE +  LMQKATI VGG  LNA+T+EH ILRLP+ LK
Sbjct: 435  PESPETVVALMQKATIVVGGHSLNAITIEHFILRLPFHLK 474


>ref|XP_022740037.1| uncharacterized protein LOC111292091 isoform X1 [Durio zibethinus]
          Length = 615

 Score =  269 bits (687), Expect = 3e-79
 Identities = 187/460 (40%), Positives = 234/460 (50%), Gaps = 5/460 (1%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRL 348
            KM+  +  R+   GK V    R  RER               R EENVHRALERAF+R L
Sbjct: 25   KMEKSQGSRALGTGKAVTNRRRSNRERKMALLQDVDKLKRKLRHEENVHRALERAFTRPL 84

Query: 349  GTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDN 528
            G LP LPPYLPP T                      +F+QGL QE V  SSK N+ NL N
Sbjct: 85   GALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYASSKRNVENL-N 143

Query: 529  KLIEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXX--GN 702
            + IEQS  P  S   + +  +S V   S  + +A  +P                      
Sbjct: 144  ESIEQS--PVRSSKHQRSKSLS-VNEMSSVTTIARSQPSLARSVSSRKLLPPDSIYDRAG 200

Query: 703  DVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPL 882
               +R  + + A   K ++++GD  + GKEN+S TN+  K    PE+  ++V   VKR  
Sbjct: 201  QCFSRSTNGRQAST-KPNSTSGD--IRGKENQSFTNA-VKDKRSPEKKISKVVTPVKRLT 256

Query: 883  VKFEVSEKCIPP-GPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIG 1059
             K E   KC+ P   QL  RLVDQE+AQE        D+  EA+S  NKISE  ++CL  
Sbjct: 257  TKHESPNKCLDPLKVQLDGRLVDQEKAQESPSGSSD-DKPSEADSTPNKISEDTVRCLFS 315

Query: 1060 IFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIE 1239
            IF+RLS LK K +++    +   V     +R     DPYG+    K RDIGPYK L AIE
Sbjct: 316  IFVRLSTLKDKAVESGTLPSRSVVSSHERNRESECPDPYGIYSDLKTRDIGPYKHLCAIE 375

Query: 1240 ASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGI 1419
            A++ID                           GL+HQQKLAFWINTYN CMMNA L HGI
Sbjct: 376  ANTIDLNRSTNALFLIHRLKFLLEKLASVILEGLSHQQKLAFWINTYNSCMMNAILEHGI 435

Query: 1420 NESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
             ESPE +  LMQKATI VGG  LNA+T+EH ILRLP+ LK
Sbjct: 436  PESPETVVALMQKATIVVGGHSLNAITIEHFILRLPFHLK 475


>ref|XP_022740039.1| uncharacterized protein LOC111292091 isoform X3 [Durio zibethinus]
          Length = 613

 Score =  268 bits (684), Expect = 8e-79
 Identities = 186/459 (40%), Positives = 234/459 (50%), Gaps = 4/459 (0%)
 Frame = +1

Query: 175  KMDIERCKRSKSAGKPVVVITR--RERXXXXXXXXXXXXXXXRQEENVHRALERAFSRRL 348
            KM+  +  R+   GK V    R  RER               R EENVHRALERAF+R L
Sbjct: 25   KMEKSQGSRALGTGKAVTNRRRSNRERKMALLQDVDKLKRKLRHEENVHRALERAFTRPL 84

Query: 349  GTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGLNQEPVPVSSKTNIGNLDN 528
            G LP LPPYLPP T                      +F+QGL QE V  SSK N+ NL N
Sbjct: 85   GALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGLYQEAVYASSKRNVENL-N 143

Query: 529  KLIEQSVKPTMSKIDESTPLMSPVEVQSEASKVAAQKPXXXXXXXXXXXXXXXXXX--GN 702
            + IEQS  P  S   + +  +S V   S  + +A  +P                      
Sbjct: 144  ESIEQS--PVRSSKHQRSKSLS-VNEMSSVTTIARSQPSLARSVSSRKLLPPDSIYDRAG 200

Query: 703  DVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITNPPERVANEVQNSVKRPL 882
               +R  + + A   K ++++GD  + GKEN+S TN+  K    PE+  ++V   VKR  
Sbjct: 201  QCFSRSTNGRQAST-KPNSTSGD--IRGKENQSFTNA-VKDKRSPEKKISKVVTPVKRLT 256

Query: 883  VKFEVSEKCIPPGPQLQNRLVDQERAQEXXXXXXXGDRVLEAESECNKISESVLKCLIGI 1062
             K E   KC+ P  +L  RLVDQE+AQE        D+  EA+S  NKISE  ++CL  I
Sbjct: 257  TKHESPNKCLDP-LKLDGRLVDQEKAQESPSGSSD-DKPSEADSTPNKISEDTVRCLFSI 314

Query: 1063 FLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCLKSKRRDIGPYKDLFAIEA 1242
            F+RLS LK K +++    +   V     +R     DPYG+    K RDIGPYK L AIEA
Sbjct: 315  FVRLSTLKDKAVESGTLPSRSVVSSHERNRESECPDPYGIYSDLKTRDIGPYKHLCAIEA 374

Query: 1243 SSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFWINTYNICMMNAYLGHGIN 1422
            ++ID                           GL+HQQKLAFWINTYN CMMNA L HGI 
Sbjct: 375  NTIDLNRSTNALFLIHRLKFLLEKLASVILEGLSHQQKLAFWINTYNSCMMNAILEHGIP 434

Query: 1423 ESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            ESPE +  LMQKATI VGG  LNA+T+EH ILRLP+ LK
Sbjct: 435  ESPETVVALMQKATIVVGGHSLNAITIEHFILRLPFHLK 473


>ref|XP_003635502.2| PREDICTED: uncharacterized protein LOC100855363 [Vitis vinifera]
          Length = 593

 Score =  264 bits (675), Expect = 1e-77
 Identities = 167/417 (40%), Positives = 222/417 (53%), Gaps = 2/417 (0%)
 Frame = +1

Query: 295  RQEENVHRALERAFSRRLGTLPHLPPYLPPQTXXXXXXXXXXXXXXXXXXXXXXSFKQGL 474
            R EENVHRALERAF+R LG LP LPPYLPP T                      +F+QGL
Sbjct: 41   RHEENVHRALERAFTRPLGALPRLPPYLPPYTLELLAEVAVLEEEVVRLEEQVVNFRQGL 100

Query: 475  NQEPVPVSSKTNIGNLDNKLIEQSVKPTMSKIDESTPL-MSPVEVQSEASKVAAQKPXXX 651
             QE V +S K ++ N  + + + SV    SK ++S  L  + + +++ A++         
Sbjct: 101  YQEAVYISCKRHVENSTDVIDQSSVGS--SKQEQSKSLSQNEINLETSATRPLPSLTRST 158

Query: 652  XXXXXXXXXXXXXXXGNDVLNRVVDTKPADVMKSDASNGDDSLPGKENRSSTNSNSKITN 831
                           G+     V  T+      S +   ++   GKENR+  NS SK   
Sbjct: 159  SSRKLLSADSVSDRAGHCSTRTVNGTQALKKRNSSSPLLEEDRQGKENRTCMNS-SKNKQ 217

Query: 832  PPERVANEVQNSVKRPLVKFEVSEKCIPPGP-QLQNRLVDQERAQEXXXXXXXGDRVLEA 1008
             P++ +  V   VK+  +K E  EK   P   QL+ RLVDQERAQE        +RV EA
Sbjct: 218  SPDKKSPRVITLVKKSPIKHEPVEKFRDPLKLQLECRLVDQERAQESSCASLD-ERVSEA 276

Query: 1009 ESECNKISESVLKCLIGIFLRLSKLKAKTMDAEAFSNLMSVGLTSGDRGPSFRDPYGVCL 1188
            +S  NKISE ++KCL  IFLR+S L+ K ++++A    ++      +      DPYG+CL
Sbjct: 277  DSGPNKISEDIVKCLSSIFLRMSTLREKVVESDATPPPLAFASNESNGEAESLDPYGICL 336

Query: 1189 KSKRRDIGPYKDLFAIEASSIDFXXXXXXXXXXXXXXXXXXXXXXXXXXGLTHQQKLAFW 1368
            +   R++GPYK L  I+A S+D                           GLTHQQKLAFW
Sbjct: 337  EFGARNVGPYKHLCDIQAGSVDLNRKTNALFLIHRLKLLLGKLACVNLEGLTHQQKLAFW 396

Query: 1369 INTYNICMMNAYLGHGINESPEMLATLMQKATINVGGLLLNAVTVEHSILRLPYRLK 1539
            IN YN CMMNA+L HG+ E+PEM+  LMQKATINVGG LLNA+T+EH ILRLPY LK
Sbjct: 397  INIYNSCMMNAFLEHGVPENPEMVVALMQKATINVGGCLLNAITIEHFILRLPYHLK 453


Top