BLASTX nr result

ID: Chrysanthemum22_contig00019136 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00019136
         (2350 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OTG38382.1| putative transducin/WD40 repeat-like superfamily ...   594   0.0  
ref|XP_023756470.1| general transcription factor 3C polypeptide ...   587   0.0  
ref|XP_021986505.1| uncharacterized protein LOC110882925 isoform...   594   0.0  
gb|KVI07947.1| AT hook, DNA-binding motif-containing protein [Cy...   541   e-175
ref|XP_023920540.1| uncharacterized protein LOC112032069 isoform...   503   e-164
gb|OMO89342.1| hypothetical protein CCACVL1_07901 [Corchorus cap...   508   e-163
ref|XP_021300958.1| uncharacterized protein LOC110429313 [Herran...   504   e-163
ref|XP_010242589.1| PREDICTED: uncharacterized protein LOC104586...   505   e-163
ref|XP_022765155.1| uncharacterized protein LOC111310200 isoform...   497   e-163
ref|XP_019051477.1| PREDICTED: uncharacterized protein LOC104586...   505   e-163
gb|EOX93901.1| DNA binding protein, putative isoform 1 [Theobrom...   504   e-163
gb|OMO95816.1| hypothetical protein COLO4_15658 [Corchorus olito...   508   e-163
ref|XP_017969458.1| PREDICTED: uncharacterized protein LOC186127...   503   e-163
ref|XP_023920539.1| uncharacterized protein LOC112032069 isoform...   503   e-162
ref|XP_022765154.1| uncharacterized protein LOC111310200 isoform...   497   e-162
gb|KDO61089.1| hypothetical protein CISIN_1g008363mg [Citrus sin...   490   e-162
ref|XP_022765150.1| uncharacterized protein LOC111310200 isoform...   497   e-161
ref|XP_022765149.1| uncharacterized protein LOC111310200 isoform...   497   e-161
ref|XP_017969461.1| PREDICTED: uncharacterized protein LOC186127...   498   e-161
ref|XP_024037794.1| uncharacterized protein LOC18039905 isoform ...   490   e-158

>gb|OTG38382.1| putative transducin/WD40 repeat-like superfamily protein [Helianthus
            annuus]
          Length = 922

 Score =  594 bits (1531), Expect = 0.0
 Identities = 297/451 (65%), Positives = 338/451 (74%), Gaps = 8/451 (1%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKYRMGYLAVLLGSGALEVWEIPLPR 975
            I ED  LPRLVMGLAHNGKVAWDVKW+P D  ++SK+ MGYLAVLLG+GALEVWE+PLPR
Sbjct: 472  IDEDDVLPRLVMGLAHNGKVAWDVKWRPSDFRHVSKHVMGYLAVLLGNGALEVWEVPLPR 531

Query: 976  ATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCHDG 1155
             TKAIFS C+ EG DPRF+KLKPVF+CSMLKCGD +SIP+TLEWSTSAPHDLILAGCHDG
Sbjct: 532  VTKAIFS-CQKEGKDPRFLKLKPVFLCSMLKCGDRQSIPLTLEWSTSAPHDLILAGCHDG 590

Query: 1156 MVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFWDL 1335
            +VALWKFS++  SKDTRPL+CFSADTVPIRAL WAP ASD ESANII T  HKGLKFWD+
Sbjct: 591  VVALWKFSASSSSKDTRPLLCFSADTVPIRALTWAPLASDPESANIIATGSHKGLKFWDI 650

Query: 1336 RDPFRPLWDIPTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCEKTQ 1515
            RDPF PLWDIP QK++ SL+WLPDP CV+LSFDDGEI+I+SLL+AASDVP+TGMPC+K  
Sbjct: 651  RDPFHPLWDIPYQKVVNSLEWLPDPRCVVLSFDDGEIKIISLLKAASDVPVTGMPCDKKP 710

Query: 1516 -QXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHYLCG 1692
                                  TGMVAYCCSDGKVI+FQLT +A+ KD HRNREPHYLCG
Sbjct: 711  LHGSYSYYCSSSSIWSVQVSSLTGMVAYCCSDGKVINFQLTIKAVEKDPHRNREPHYLCG 770

Query: 1693 SLAVEE-STLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQIT 1869
            SL  EE S+LTV SPLP+VP  +KRS  EWG+TP++ RG  S  NQEKRAK +ISK Q  
Sbjct: 771  SLTEEEDSSLTVLSPLPDVPIPMKRSSTEWGDTPKTSRGVKSRSNQEKRAKGQISKFQTP 830

Query: 1870 GNSSDSEGTM-----VXXXXXXXXXXXXXXXXLVCI-DDNKSXXXXXXXXXXXTLPPKII 2031
               S  +  +                      LVCI DDNK              PPKI+
Sbjct: 831  PKPSSGDPLVNTQKNNNDTSREAQIDNQTSQALVCIDDDNKVEETNVKEEETDVYPPKIV 890

Query: 2032 AMHRVRWNMNKGSERLLCYGGASGIVRCQLI 2124
            AMHRVRWNMNKGSERL+CYGGA+GI+RCQ I
Sbjct: 891  AMHRVRWNMNKGSERLVCYGGAAGILRCQKI 921



 Score =  122 bits (306), Expect = 9e-25
 Identities = 91/230 (39%), Positives = 109/230 (47%), Gaps = 8/230 (3%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCP VHQSS +D NVEFIAVSAHPPES+YH IGAPLTGRGLIQIWCVLNTG   +  T 
Sbjct: 216 DWCPGVHQSSTFDTNVEFIAVSAHPPESTYHKIGAPLTGRGLIQIWCVLNTGQSQEGETR 275

Query: 183 LVKFKPRKYTKSKEAKKPKENQXXXXXXXXXXXXLSEIDGIDQLLQDISVQSSENSNNLL 362
           LVK K RK +   +  K    +              E D  D        QS ++ NNLL
Sbjct: 276 LVKVKKRKSSTITDPTKSTRPRGRPRKTPR-----KETDHGD--------QSPKSRNNLL 322

Query: 363 QLVVKADTN------XXXXXXXXXXXXXXXQSVNNVD--NXXXXXXXXXXXXXXNQSPDL 518
           QL  + +T+                     +S NN D  N                SP+ 
Sbjct: 323 QLFSETETDEKFKTQKSPKPRGRPRKKPIKESSNNFDDSNNNMQLLTESTSNSPKTSPEP 382

Query: 519 LDVYKNDPFIPETVTKEDGVSNKLHEHVTKKEYIVYERRPKSYKKRRDIK 668
           L V   D F   T     G+  K    VTK+   VY RRPK ++K +  K
Sbjct: 383 LAVKFPDNFTLLT----PGILAK--AQVTKEVTNVYTRRPKKHRKDQSPK 426


>ref|XP_023756470.1| general transcription factor 3C polypeptide 2 [Lactuca sativa]
 ref|XP_023756471.1| general transcription factor 3C polypeptide 2 [Lactuca sativa]
 ref|XP_023756472.1| general transcription factor 3C polypeptide 2 [Lactuca sativa]
 ref|XP_023756473.1| general transcription factor 3C polypeptide 2 [Lactuca sativa]
 ref|XP_023756474.1| general transcription factor 3C polypeptide 2 [Lactuca sativa]
 gb|PLY90892.1| hypothetical protein LSAT_1X48480 [Lactuca sativa]
          Length = 735

 Score =  587 bits (1513), Expect = 0.0
 Identities = 294/458 (64%), Positives = 341/458 (74%), Gaps = 7/458 (1%)
 Frame = +1

Query: 772  EPVQNIDLISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKYRMGYLAVLLGSGALE 951
            E V N + +S+D+ALPRLVMGLAHNGKVAWDVKW+P DSH+IS YRMGYLAVLLG+GALE
Sbjct: 277  ESVPNSNSVSKDIALPRLVMGLAHNGKVAWDVKWRP-DSHDISSYRMGYLAVLLGNGALE 335

Query: 952  VWEIPLPRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDL 1131
            VWE+P+  ATKA+FSSC+ EG DPRF+KLKPVFMCS LKCGD +SIP+TLEWSTSAPHDL
Sbjct: 336  VWEVPVLSATKALFSSCQKEGTDPRFLKLKPVFMCSKLKCGDRQSIPLTLEWSTSAPHDL 395

Query: 1132 ILAGCHDGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGH 1311
            ILAGCHDG+VALWKFS+N  S DT+PL+CFSADTVPIRALKWAP  SD ESANII TSGH
Sbjct: 396  ILAGCHDGVVALWKFSTNDSSIDTKPLLCFSADTVPIRALKWAPLPSDPESANIIATSGH 455

Query: 1312 KGLKFWDLRDPFRPLWDIPTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPIT 1491
            KG++FWD+RDP+ PLWDIP QKI YSLDW PDP CVILS DDGEI+I++L +A SD P+T
Sbjct: 456  KGVRFWDIRDPYHPLWDIPLQKITYSLDWHPDPRCVILSSDDGEIKIINLSKAVSDTPVT 515

Query: 1492 GMPCEKTQ-QXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRN 1668
              P  KTQ                      T MVAYCCSDGKVIHFQLT +++ +D +RN
Sbjct: 516  ATPTVKTQHHGSHSYYCSSSSIWCVHVSRLTDMVAYCCSDGKVIHFQLTTKSVERDPNRN 575

Query: 1669 REPHYLCGSLAVEE--STLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAK 1842
            REPHYLCGS+  EE  STLTV +P PN+PF +K+S NEWG+TPRSKRGF+S  NQEKRAK
Sbjct: 576  REPHYLCGSVTKEEEKSTLTVLTPSPNIPFPMKKSSNEWGDTPRSKRGFVSITNQEKRAK 635

Query: 1843 KEISKCQITGNSSDSEGTMVXXXXXXXXXXXXXXXXLVCIDD----NKSXXXXXXXXXXX 2010
            + +SK      +  S+                    LVCIDD     K            
Sbjct: 636  EYVSKENPKNKNKSSQA-------------------LVCIDDVANNVKDHMAHKEEDERE 676

Query: 2011 TLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLI 2124
            TLPPKI+AMHRVRWNMNKGSE+ LCYGGA+GI+R Q I
Sbjct: 677  TLPPKIVAMHRVRWNMNKGSEKWLCYGGAAGILRFQEI 714



 Score =  112 bits (280), Expect = 9e-22
 Identities = 51/81 (62%), Positives = 65/81 (80%), Gaps = 1/81 (1%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCP +HQ+S+ D NVEF+AV+AHPPESSYH IGAPLTGRGLIQIWC+LN   K+QD+  
Sbjct: 127 DWCPILHQNSNTDINVEFVAVAAHPPESSYHKIGAPLTGRGLIQIWCLLNAYTKEQDMIP 186

Query: 183 LVKFKPRKYTKSK-EAKKPKE 242
           LVK KP++  ++  E  +PK+
Sbjct: 187 LVKVKPKRNEETNLETNQPKK 207


>ref|XP_021986505.1| uncharacterized protein LOC110882925 isoform X1 [Helianthus annuus]
 ref|XP_021986511.1| uncharacterized protein LOC110882925 isoform X1 [Helianthus annuus]
          Length = 980

 Score =  594 bits (1531), Expect = 0.0
 Identities = 297/451 (65%), Positives = 338/451 (74%), Gaps = 8/451 (1%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKYRMGYLAVLLGSGALEVWEIPLPR 975
            I ED  LPRLVMGLAHNGKVAWDVKW+P D  ++SK+ MGYLAVLLG+GALEVWE+PLPR
Sbjct: 530  IDEDDVLPRLVMGLAHNGKVAWDVKWRPSDFRHVSKHVMGYLAVLLGNGALEVWEVPLPR 589

Query: 976  ATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCHDG 1155
             TKAIFS C+ EG DPRF+KLKPVF+CSMLKCGD +SIP+TLEWSTSAPHDLILAGCHDG
Sbjct: 590  VTKAIFS-CQKEGKDPRFLKLKPVFLCSMLKCGDRQSIPLTLEWSTSAPHDLILAGCHDG 648

Query: 1156 MVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFWDL 1335
            +VALWKFS++  SKDTRPL+CFSADTVPIRAL WAP ASD ESANII T  HKGLKFWD+
Sbjct: 649  VVALWKFSASSSSKDTRPLLCFSADTVPIRALTWAPLASDPESANIIATGSHKGLKFWDI 708

Query: 1336 RDPFRPLWDIPTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCEKTQ 1515
            RDPF PLWDIP QK++ SL+WLPDP CV+LSFDDGEI+I+SLL+AASDVP+TGMPC+K  
Sbjct: 709  RDPFHPLWDIPYQKVVNSLEWLPDPRCVVLSFDDGEIKIISLLKAASDVPVTGMPCDKKP 768

Query: 1516 -QXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHYLCG 1692
                                  TGMVAYCCSDGKVI+FQLT +A+ KD HRNREPHYLCG
Sbjct: 769  LHGSYSYYCSSSSIWSVQVSSLTGMVAYCCSDGKVINFQLTIKAVEKDPHRNREPHYLCG 828

Query: 1693 SLAVEE-STLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQIT 1869
            SL  EE S+LTV SPLP+VP  +KRS  EWG+TP++ RG  S  NQEKRAK +ISK Q  
Sbjct: 829  SLTEEEDSSLTVLSPLPDVPIPMKRSSTEWGDTPKTSRGVKSRSNQEKRAKGQISKFQTP 888

Query: 1870 GNSSDSEGTM-----VXXXXXXXXXXXXXXXXLVCI-DDNKSXXXXXXXXXXXTLPPKII 2031
               S  +  +                      LVCI DDNK              PPKI+
Sbjct: 889  PKPSSGDPLVNTQKNNNDTSREAQIDNQTSQALVCIDDDNKVEETNVKEEETDVYPPKIV 948

Query: 2032 AMHRVRWNMNKGSERLLCYGGASGIVRCQLI 2124
            AMHRVRWNMNKGSERL+CYGGA+GI+RCQ I
Sbjct: 949  AMHRVRWNMNKGSERLVCYGGAAGILRCQKI 979



 Score =  122 bits (306), Expect = 9e-25
 Identities = 91/230 (39%), Positives = 109/230 (47%), Gaps = 8/230 (3%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCP VHQSS +D NVEFIAVSAHPPES+YH IGAPLTGRGLIQIWCVLNTG   +  T 
Sbjct: 274 DWCPGVHQSSTFDTNVEFIAVSAHPPESTYHKIGAPLTGRGLIQIWCVLNTGQSQEGETR 333

Query: 183 LVKFKPRKYTKSKEAKKPKENQXXXXXXXXXXXXLSEIDGIDQLLQDISVQSSENSNNLL 362
           LVK K RK +   +  K    +              E D  D        QS ++ NNLL
Sbjct: 334 LVKVKKRKSSTITDPTKSTRPRGRPRKTPR-----KETDHGD--------QSPKSRNNLL 380

Query: 363 QLVVKADTN------XXXXXXXXXXXXXXXQSVNNVD--NXXXXXXXXXXXXXXNQSPDL 518
           QL  + +T+                     +S NN D  N                SP+ 
Sbjct: 381 QLFSETETDEKFKTQKSPKPRGRPRKKPIKESSNNFDDSNNNMQLLTESTSNSPKTSPEP 440

Query: 519 LDVYKNDPFIPETVTKEDGVSNKLHEHVTKKEYIVYERRPKSYKKRRDIK 668
           L V   D F   T     G+  K    VTK+   VY RRPK ++K +  K
Sbjct: 441 LAVKFPDNFTLLT----PGILAK--AQVTKEVTNVYTRRPKKHRKDQSPK 484


>gb|KVI07947.1| AT hook, DNA-binding motif-containing protein [Cynara cardunculus
            var. scolymus]
          Length = 1062

 Score =  541 bits (1394), Expect = e-175
 Identities = 279/487 (57%), Positives = 341/487 (70%), Gaps = 33/487 (6%)
 Frame = +1

Query: 763  LVLEPVQNIDLISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKYRMGYLAVLLGSG 942
            ++LE   +   I EDVALPRLV+ LAHNGKVAWDVKW+P D++  SK+RMGYLAVLLG+G
Sbjct: 575  VLLETDMDSRCIPEDVALPRLVLCLAHNGKVAWDVKWRPSDTYFNSKHRMGYLAVLLGNG 634

Query: 943  ALEVWEIPLPRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAP 1122
            ALEVWE+P P A + +FS+C  EG DPRF+KL+PVF CSMLKCGD +SIP+TLEWSTS+P
Sbjct: 635  ALEVWEVPAPHAVEVMFSACRKEGTDPRFIKLEPVFRCSMLKCGDRQSIPLTLEWSTSSP 694

Query: 1123 HDLILAGCHDGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVT 1302
            HDLILAGCHDG+VALWKFS++GP KDTRPL+ F+ADTVPIRAL WAP  SD ESANIIVT
Sbjct: 695  HDLILAGCHDGVVALWKFSADGPLKDTRPLLRFTADTVPIRALAWAPVPSDSESANIIVT 754

Query: 1303 SGHKGLKFWDLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASD 1479
            +GHKG KFWDLRDPFRPLWD+ P Q+IIY LDW PDP CV+LSFDDGEI+I+SL +AA D
Sbjct: 755  AGHKGAKFWDLRDPFRPLWDVNPAQRIIYGLDWHPDPRCVVLSFDDGEIQIISLSKAACD 814

Query: 1480 VPITGMPCEKTQQ-XXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKD 1656
            VP+TG P    Q+                     TGMVAYCCSDGKV+HFQLT +A+ KD
Sbjct: 815  VPVTGAPFVAAQRHASHSYHCSSSSIWSVQVSRLTGMVAYCCSDGKVVHFQLTMKAVEKD 874

Query: 1657 HHRNREPHYLCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKR 1836
              RNREPHYLCG++++EES LT+ SPLP+VPF +K+S  EWG+TPR++RG+ S  NQEKR
Sbjct: 875  PSRNREPHYLCGAMSMEESGLTILSPLPDVPFLMKKSSKEWGDTPRTRRGYRSLSNQEKR 934

Query: 1837 AKKEISK-CQ------ITGNS-SDSEGTMVXXXXXXXXXXXXXXXXLVCIDDNK------ 1974
            AK+++ K CQ        GNS S+++ +                  +V   D++      
Sbjct: 935  AKEQMLKECQQPLAVCYDGNSDSETQQSSSSKKGKRDDEEEELPSKIVGKRDDEDEDEEE 994

Query: 1975 ---SXXXXXXXXXXXTLPPK--------------IIAMHRVRWNMNKGSERLLCYGGASG 2103
               +            LP K              I+ M+ VRWN NKGSER LCYGGA+G
Sbjct: 995  QELASKIVGRREDEEELPSKIVGKRDDEEELPSKIVGMYGVRWNTNKGSERWLCYGGAAG 1054

Query: 2104 IVRCQLI 2124
            I+RCQ I
Sbjct: 1055 ILRCQHI 1061



 Score =  113 bits (283), Expect = 6e-22
 Identities = 53/82 (64%), Positives = 62/82 (75%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPRVH+  D D N+EFIAV+AHPPESSYH IGAPLTGRG+IQIW +LN G+KD DV  
Sbjct: 181 DWCPRVHERPDCDINLEFIAVAAHPPESSYHKIGAPLTGRGVIQIWGLLNRGLKDNDVIP 240

Query: 183 LVKFKPRKYTKSKEAKKPKENQ 248
            VK K +  + S +A KPK  Q
Sbjct: 241 HVKRKSKTNSSSNKATKPKSTQ 262


>ref|XP_023920540.1| uncharacterized protein LOC112032069 isoform X2 [Quercus suber]
          Length = 733

 Score =  503 bits (1294), Expect = e-164
 Identities = 262/472 (55%), Positives = 317/472 (67%), Gaps = 28/472 (5%)
 Frame = +1

Query: 793  LISEDVALPRLVMGLAHNGKVAWDVKWKPIDS-HNISKYRMGYLAVLLGSGALEVWEIPL 969
            LIS+DVALPR+V+ LAHNGKVAWDVKW+P ++  +  K+RMGYLAVLLG+G+LEVWE+PL
Sbjct: 249  LISKDVALPRVVLCLAHNGKVAWDVKWRPSNACQSKCKHRMGYLAVLLGNGSLEVWEVPL 308

Query: 970  PRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCH 1149
            PR  K I+SS   EG DPRFVKL+PVF  S+LKCG I+SIP+T+EWS S PHD +LAGCH
Sbjct: 309  PRTMKVIYSSVHQEGTDPRFVKLEPVFRGSLLKCGGIQSIPLTVEWSASPPHDYLLAGCH 368

Query: 1150 DGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFW 1329
            DG VALWKFS++  S+DTRPL+CFSADTVPIRAL WAP  SD ESAN+IVT+GH GLKFW
Sbjct: 369  DGTVALWKFSASCSSEDTRPLLCFSADTVPIRALAWAPLESDPESANVIVTAGHGGLKFW 428

Query: 1330 DLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCE 1506
            DLRDP+RPLWD+ P  +IIYSLDWL +P CVILSFDDG +RI+SLL+AA DVP+TG P  
Sbjct: 429  DLRDPYRPLWDLHPVPRIIYSLDWLSNPRCVILSFDDGTMRILSLLKAAYDVPVTGKPFG 488

Query: 1507 KT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHY 1683
             T QQ                    TGM AYC +DG V+ FQLT++A+ KD  RNR PH+
Sbjct: 489  GTKQQGLHSYYCSSFAIWSVQVSRITGMAAYCTADGTVLRFQLTSKAVDKDPSRNRTPHF 548

Query: 1684 LCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQ 1863
            LCGSL  EES +T+ +P+PN PF LK+SLN+ G+TP S R F S     KRA  +++K  
Sbjct: 549  LCGSLTEEESLITINTPVPNTPFPLKKSLNKGGDTPLSMREFSSEPQHVKRANDKMAKSP 608

Query: 1864 IT-------------GNSSDSEGTMV-------XXXXXXXXXXXXXXXXLVCIDD----- 1968
             T             G  S +E  +                        LVC D+     
Sbjct: 609  STDATTLALCYGDDPGTESGTEEALTRPKSKKRPNSRSSNKKNPEDDLALVCRDEEPPNT 668

Query: 1969 NKSXXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLI 2124
             +              PPKI+AM RVRWNMNKGSER LCYGG +G+VRCQ I
Sbjct: 669  QEKENGKAEARTIEVFPPKIVAMRRVRWNMNKGSERWLCYGGEAGVVRCQEI 720



 Score = 73.6 bits (179), Expect(2) = 1e-09
 Identities = 33/60 (55%), Positives = 47/60 (78%)
 Frame = +3

Query: 48  VEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTSLVKFKPRKYTKSKEA 227
           + FIAV+AHPP +SYH +GAPLTGRG+IQIWC++N GV +++V   +  KP++ TK+  A
Sbjct: 23  INFIAVAAHPPGTSYHKMGAPLTGRGVIQIWCLMNVGVNEEEVPPSLA-KPKQGTKNNGA 81



 Score = 20.4 bits (41), Expect(2) = 1e-09
 Identities = 7/9 (77%), Positives = 8/9 (88%)
 Frame = +1

Query: 1  WIGVPEFIK 27
          WIGV EF+K
Sbjct: 9  WIGVLEFVK 17


>gb|OMO89342.1| hypothetical protein CCACVL1_07901 [Corchorus capsularis]
          Length = 983

 Score =  508 bits (1309), Expect = e-163
 Identities = 260/490 (53%), Positives = 322/490 (65%), Gaps = 29/490 (5%)
 Frame = +1

Query: 751  LQQNLVLEPVQNIDLISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISK--YRMGYLA 924
            +Q +  LE       I  D+ALPR V+ LAHNGKVAWDVKW+P D  N+SK   RMGYLA
Sbjct: 485  IQDSNSLEVGPGSSSIPADMALPRAVLCLAHNGKVAWDVKWRPYDI-NVSKCNQRMGYLA 543

Query: 925  VLLGSGALEVWEIPLPRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLE 1104
            VLLG+G+LEVWE+PLP   + ++SS   +G DPRFVKL+PVF CS LKCGDI+SIP+T+E
Sbjct: 544  VLLGNGSLEVWEVPLPHMVRTVYSSSAKQGTDPRFVKLEPVFKCSKLKCGDIQSIPLTVE 603

Query: 1105 WSTSAPHDLILAGCHDGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLES 1284
            WSTS PHD +LAGCHDGMVALWKFS++   KDTRPL+CFSADTVPIR++ WAP  SD+ES
Sbjct: 604  WSTSPPHDYLLAGCHDGMVALWKFSASASPKDTRPLLCFSADTVPIRSVAWAPSGSDMES 663

Query: 1285 ANIIVTSGHKGLKFWDLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSL 1461
             N+I+T+GH GLKFWD+RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL
Sbjct: 664  TNVILTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKLLSL 723

Query: 1462 LRAASDVPITGMPCEKT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTA 1638
             +A SDVP+TG P   T QQ                    TGMVAYC +DG V HFQLT+
Sbjct: 724  SQAVSDVPVTGKPFTGTKQQGLHLYNCSSFAIWHIQVSRLTGMVAYCGADGTVSHFQLTS 783

Query: 1639 RALGKDHHRNREPHYLCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSH 1818
            +A+ KD  RNR PH+LCGSL  EES + + +PLP++P T+K+S  ++G  PRS R FL+ 
Sbjct: 784  KAVDKDFSRNRAPHFLCGSLTEEESAIIINTPLPDIPLTMKKSTGDYGEGPRSMRAFLTE 843

Query: 1819 CNQEKRAKKEISKCQI-----------------TGNSSDSEGTMV--------XXXXXXX 1923
             NQ K AK + +K Q                   G  SDSE T+                
Sbjct: 844  TNQAKNAKDKKAKVQTCDKQTLALCYGDDPDPDPGVESDSEETLAALKCKKKQKSQSERN 903

Query: 1924 XXXXXXXXXLVCIDDNKSXXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASG 2103
                      + I++  +             P K++AMHRVRWNMNKGSER LCYGGA+G
Sbjct: 904  KKADNDQALAIRIEEATNTQKEETGNEIEVFPGKMVAMHRVRWNMNKGSERWLCYGGAAG 963

Query: 2104 IVRCQLIR*P 2133
            IVRCQ I+ P
Sbjct: 964  IVRCQEIKVP 973



 Score = 89.4 bits (220), Expect = 2e-14
 Identities = 40/75 (53%), Positives = 54/75 (72%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPRVH++       EFIAV+AHPPES YH +G P+TGRG++QIWC+LN GV  ++   
Sbjct: 130 DWCPRVHENPSSHVKCEFIAVAAHPPESYYHKMGTPVTGRGIVQIWCMLNVGVNVEE-PL 188

Query: 183 LVKFKPRKYTKSKEA 227
           L K KP + +++ EA
Sbjct: 189 LSKKKPNQRSQNTEA 203


>ref|XP_021300958.1| uncharacterized protein LOC110429313 [Herrania umbratica]
          Length = 856

 Score =  504 bits (1298), Expect = e-163
 Identities = 257/497 (51%), Positives = 325/497 (65%), Gaps = 26/497 (5%)
 Frame = +1

Query: 751  LQQNLVLEPVQNIDLISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNIS-KYRMGYLAV 927
            L  NL+  P  +I     D+ LPR V+ LAHNGKVAWDVKW+P D +     +RMGYLAV
Sbjct: 363  LDSNLLETPGSSIP---RDIELPRTVLCLAHNGKVAWDVKWQPYDINGCECNHRMGYLAV 419

Query: 928  LLGSGALEVWEIPLPRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEW 1107
            LLG+G+LEVWE+PLP   + ++SS   +G DPRFVKL+PVF CS LKCGD++SIP+T+EW
Sbjct: 420  LLGNGSLEVWEVPLPSMIRIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIPLTVEW 479

Query: 1108 STSAPHDLILAGCHDGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESA 1287
            STS PH+ +LAGCHDG VALWKFS++G   DTRPL+CFSADTVPIR++ WAP  SD+ESA
Sbjct: 480  STSPPHNYLLAGCHDGKVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSGSDMESA 539

Query: 1288 NIIVTSGHKGLKFWDLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLL 1464
            N+++T+GH GLKFWD+RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL+
Sbjct: 540  NVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLI 599

Query: 1465 RAASDVPITGMPCEKT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTAR 1641
            +AA DVP+TG P   T QQ                    TGMVAYC +DG V  FQLT++
Sbjct: 600  QAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSK 659

Query: 1642 ALGKDHHRNREPHYLCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHC 1821
            A+ KD  RNR PH++CGSL  EES + V +PLP++P TLK+  N++G  PRS R FL+  
Sbjct: 660  AVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGEGPRSMRAFLTES 719

Query: 1822 NQEKRAKKEISKCQI-------------TGNSSDSEGTMV----------XXXXXXXXXX 1932
            NQ K AK + +K                 G  S+SE T+                     
Sbjct: 720  NQAKNAKDKKAKVPTPDKRTFALCYGNDRGVESESEETLTLAALKGKIKQKSKSDRTKKA 779

Query: 1933 XXXXXXLVCIDDNKSXXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVR 2112
                   V I++ ++             PPKI+AMHRVRWNMNKGSER LCYGGA+GIVR
Sbjct: 780  GDDQALAVRINEPRNTQKEEAGYEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVR 839

Query: 2113 CQLIR*PRLFTK*CKRS 2163
            CQ I  P +  K  ++S
Sbjct: 840  CQEIIVPDVAKKSARKS 856



 Score = 92.0 bits (227), Expect = 2e-15
 Identities = 42/75 (56%), Positives = 56/75 (74%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPRVH++ +     EFIAV+AHP +S YH IG PLTGRG+IQIWC+LN GVK+++   
Sbjct: 130 DWCPRVHENPNSTVKCEFIAVAAHPADSYYHKIGTPLTGRGIIQIWCMLNVGVKEEE-AP 188

Query: 183 LVKFKPRKYTKSKEA 227
           L K KP+  +++ EA
Sbjct: 189 LSKKKPKWRSQNTEA 203


>ref|XP_010242589.1| PREDICTED: uncharacterized protein LOC104586906 isoform X2 [Nelumbo
            nucifera]
          Length = 882

 Score =  505 bits (1300), Expect = e-163
 Identities = 257/474 (54%), Positives = 315/474 (66%), Gaps = 16/474 (3%)
 Frame = +1

Query: 745  LPLQQNLVLEPVQNIDLISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKYRMGYLA 924
            L   +N     + N   + +DV LPR+V+ LAHNGKVAWDVKW+P++     K  MGYLA
Sbjct: 392  LASDKNTTNNGLGNNSHLPKDVTLPRVVLCLAHNGKVAWDVKWRPLNDSGY-KNSMGYLA 450

Query: 925  VLLGSGALEVWEIPLPRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLE 1104
            VLLG+G+LEVW++PLP   K ++SSC  +G DPRFVKL+PVF CS LKCGD +SIP+T+E
Sbjct: 451  VLLGNGSLEVWDVPLPNTIKVLYSSCRKDGTDPRFVKLEPVFRCSKLKCGDRQSIPLTME 510

Query: 1105 WSTSAPHDLILAGCHDGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLES 1284
            WS SAPHDLILAGCHDG VALWKF   G S+DTRPL+CFSADTVPIRAL WAP  SD E 
Sbjct: 511  WSPSAPHDLILAGCHDGTVALWKFFPGGSSQDTRPLLCFSADTVPIRALSWAPDESDAEG 570

Query: 1285 ANIIVTSGHKGLKFWDLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSL 1461
            AN+IVT+GH  L+FWDLRDP+RPLW+I   ++++YSLDWL DP C+IL++DDG +RI+SL
Sbjct: 571  ANVIVTAGHGSLRFWDLRDPYRPLWEINSVRRVVYSLDWLLDPRCIILAYDDGTLRILSL 630

Query: 1462 LRAASDVPITGMPCEKT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTA 1638
             +AA DVP+TG P   T QQ                    TGMVAYC +DG V+HFQLTA
Sbjct: 631  SKAAYDVPVTGKPFSGTQQQGLHSYYCSSFTIWSVHVSRLTGMVAYCNADGTVLHFQLTA 690

Query: 1639 RALGKDHHRNREPHYLCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSH 1818
            +A+ KD  RN+ PH+LCGSL  ++STL+V +PLP  PF +K+SLNEWG+TPRS RG LS 
Sbjct: 691  KAVDKDPSRNKTPHFLCGSLTEDDSTLSVNTPLPCTPFPMKKSLNEWGDTPRSIRGILSG 750

Query: 1819 CNQEKRAKKEI-SKCQIT------GNSSDSEGTMVXXXXXXXXXXXXXXXXLVCIDDNK- 1974
             NQ K+A  E+ + C         G  +                       L C  + + 
Sbjct: 751  SNQAKKANDEVLALCYGDDPEPGFGYDNSPANPNRRTQKPNTCKKKKLGSDLACSAEEEL 810

Query: 1975 ------SXXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQ 2118
                                PPKIIAMHRVRWNMNKGS RLLCYGGA+GIVRCQ
Sbjct: 811  GNLQRGGNEKSAAMSEIEIFPPKIIAMHRVRWNMNKGSGRLLCYGGAAGIVRCQ 864



 Score =  103 bits (257), Expect = 6e-19
 Identities = 50/85 (58%), Positives = 60/85 (70%), Gaps = 6/85 (7%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPR+H+SSD   N E++AV+AHPPE+SYH IG PLTGRG+IQIWC+LN  VKD+ VT 
Sbjct: 175 DWCPRLHRSSDCHINCEYLAVAAHPPEASYHKIGVPLTGRGVIQIWCILNQNVKDEVVTP 234

Query: 183 LVKFKPRK------YTKSKEAKKPK 239
           L K K R         +S   KKPK
Sbjct: 235 LNKAKGRPGKPNVLKDESSALKKPK 259


>ref|XP_022765155.1| uncharacterized protein LOC111310200 isoform X7 [Durio zibethinus]
 ref|XP_022765156.1| uncharacterized protein LOC111310200 isoform X7 [Durio zibethinus]
 ref|XP_022765157.1| uncharacterized protein LOC111310200 isoform X7 [Durio zibethinus]
          Length = 646

 Score =  497 bits (1279), Expect = e-163
 Identities = 256/479 (53%), Positives = 318/479 (66%), Gaps = 23/479 (4%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKY--RMGYLAVLLGSGALEVWEIPL 969
            I  D+ALPR V+ LAHNGKVAWDVKWKP D  N SK+  RMGYLAVLLG+G+LEVWE+PL
Sbjct: 169  IPGDIALPRAVLCLAHNGKVAWDVKWKPYDI-NDSKFNQRMGYLAVLLGNGSLEVWEVPL 227

Query: 970  PRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCH 1149
            P   + ++S    +G DPRFVKL+PV  CS LKCGDI+SIP+T+EWSTS+PHD +LAGCH
Sbjct: 228  PNMIRNVYSLSPKQGTDPRFVKLEPVLKCSKLKCGDIQSIPLTVEWSTSSPHDYLLAGCH 287

Query: 1150 DGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFW 1329
            DGMVALWKFS++   KDTRPL+CFSAD+VPIR++ WAP  SD+ES N+I+T+GH GLKFW
Sbjct: 288  DGMVALWKFSASDTPKDTRPLLCFSADSVPIRSVAWAPSGSDMESRNVILTAGHGGLKFW 347

Query: 1330 DLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCE 1506
            D+RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL+ AA DVP+TG P  
Sbjct: 348  DIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLVHAACDVPVTGKPFT 407

Query: 1507 KT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHY 1683
             T QQ                    TGMVAYC +DG V  FQLT++A+ KD  R+R PH+
Sbjct: 408  GTKQQGLHLYNCSSFAIWSVQVSRLTGMVAYCGADGTVTRFQLTSKAVDKDFSRHRTPHF 467

Query: 1684 LCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQ 1863
             CGSL  EES + V +PLP++P TLK+  N++G  PRS R FL+  NQ K AK   +K  
Sbjct: 468  PCGSLTEEESAVIVNTPLPDIPVTLKKPSNDYGEGPRSMRAFLTESNQGKNAKDRKAKVA 527

Query: 1864 IT-------------GNSSDSEGTMV------XXXXXXXXXXXXXXXXLVCIDDNKSXXX 1986
             +             G  S+SE T+                        + I++  +   
Sbjct: 528  ASNKQTLALYYGNDPGVESESEETLAALQSKRKQKSNGKKKADDDQVLAIRIEEPTNTQK 587

Query: 1987 XXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*PRLFTK*CKRS 2163
                      PPKI+AMHRVRWNMNKGSER LCYGGA+GIVRCQ I  P +  K  ++S
Sbjct: 588  EETGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIIVPDVDKKSARKS 646



 Score = 75.5 bits (184), Expect = 3e-10
 Identities = 35/60 (58%), Positives = 46/60 (76%)
 Frame = +3

Query: 48  VEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTSLVKFKPRKYTKSKEA 227
           ++FIAV+ HPPES YH +G PLTGRG+IQIWCVLN GV +++   L K KP++  +S EA
Sbjct: 9   IQFIAVATHPPESYYHKMGTPLTGRGIIQIWCVLNVGVNEEE-DPLSKKKPKRGFQSTEA 67


>ref|XP_019051477.1| PREDICTED: uncharacterized protein LOC104586906 isoform X1 [Nelumbo
            nucifera]
          Length = 891

 Score =  505 bits (1300), Expect = e-163
 Identities = 257/474 (54%), Positives = 315/474 (66%), Gaps = 16/474 (3%)
 Frame = +1

Query: 745  LPLQQNLVLEPVQNIDLISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKYRMGYLA 924
            L   +N     + N   + +DV LPR+V+ LAHNGKVAWDVKW+P++     K  MGYLA
Sbjct: 401  LASDKNTTNNGLGNNSHLPKDVTLPRVVLCLAHNGKVAWDVKWRPLNDSGY-KNSMGYLA 459

Query: 925  VLLGSGALEVWEIPLPRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLE 1104
            VLLG+G+LEVW++PLP   K ++SSC  +G DPRFVKL+PVF CS LKCGD +SIP+T+E
Sbjct: 460  VLLGNGSLEVWDVPLPNTIKVLYSSCRKDGTDPRFVKLEPVFRCSKLKCGDRQSIPLTME 519

Query: 1105 WSTSAPHDLILAGCHDGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLES 1284
            WS SAPHDLILAGCHDG VALWKF   G S+DTRPL+CFSADTVPIRAL WAP  SD E 
Sbjct: 520  WSPSAPHDLILAGCHDGTVALWKFFPGGSSQDTRPLLCFSADTVPIRALSWAPDESDAEG 579

Query: 1285 ANIIVTSGHKGLKFWDLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSL 1461
            AN+IVT+GH  L+FWDLRDP+RPLW+I   ++++YSLDWL DP C+IL++DDG +RI+SL
Sbjct: 580  ANVIVTAGHGSLRFWDLRDPYRPLWEINSVRRVVYSLDWLLDPRCIILAYDDGTLRILSL 639

Query: 1462 LRAASDVPITGMPCEKT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTA 1638
             +AA DVP+TG P   T QQ                    TGMVAYC +DG V+HFQLTA
Sbjct: 640  SKAAYDVPVTGKPFSGTQQQGLHSYYCSSFTIWSVHVSRLTGMVAYCNADGTVLHFQLTA 699

Query: 1639 RALGKDHHRNREPHYLCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSH 1818
            +A+ KD  RN+ PH+LCGSL  ++STL+V +PLP  PF +K+SLNEWG+TPRS RG LS 
Sbjct: 700  KAVDKDPSRNKTPHFLCGSLTEDDSTLSVNTPLPCTPFPMKKSLNEWGDTPRSIRGILSG 759

Query: 1819 CNQEKRAKKEI-SKCQIT------GNSSDSEGTMVXXXXXXXXXXXXXXXXLVCIDDNK- 1974
             NQ K+A  E+ + C         G  +                       L C  + + 
Sbjct: 760  SNQAKKANDEVLALCYGDDPEPGFGYDNSPANPNRRTQKPNTCKKKKLGSDLACSAEEEL 819

Query: 1975 ------SXXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQ 2118
                                PPKIIAMHRVRWNMNKGS RLLCYGGA+GIVRCQ
Sbjct: 820  GNLQRGGNEKSAAMSEIEIFPPKIIAMHRVRWNMNKGSGRLLCYGGAAGIVRCQ 873



 Score =  103 bits (257), Expect = 7e-19
 Identities = 50/85 (58%), Positives = 60/85 (70%), Gaps = 6/85 (7%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPR+H+SSD   N E++AV+AHPPE+SYH IG PLTGRG+IQIWC+LN  VKD+ VT 
Sbjct: 184 DWCPRLHRSSDCHINCEYLAVAAHPPEASYHKIGVPLTGRGVIQIWCILNQNVKDEVVTP 243

Query: 183 LVKFKPRK------YTKSKEAKKPK 239
           L K K R         +S   KKPK
Sbjct: 244 LNKAKGRPGKPNVLKDESSALKKPK 268


>gb|EOX93901.1| DNA binding protein, putative isoform 1 [Theobroma cacao]
          Length = 868

 Score =  504 bits (1298), Expect = e-163
 Identities = 254/482 (52%), Positives = 318/482 (65%), Gaps = 26/482 (5%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNIS-KYRMGYLAVLLGSGALEVWEIPLP 972
            I  D+ LPR V+ LAHNGKVAWDVKW+P D ++     RMGYLAVLLG+G+LEVWE+PLP
Sbjct: 387  IPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRMGYLAVLLGNGSLEVWEVPLP 446

Query: 973  RATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCHD 1152
                 ++SS   +G DPRFVKL+PVF CS LKCGD++SIP+T+EWSTS PH+ +LAGCHD
Sbjct: 447  HMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIPLTVEWSTSPPHNYLLAGCHD 506

Query: 1153 GMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFWD 1332
            GMVALWKFS++G   DTRPL+CFSADTVPIR++ WAP  SD+ESAN+++T+GH GLKFWD
Sbjct: 507  GMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSGSDMESANVVLTAGHGGLKFWD 566

Query: 1333 LRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCEK 1509
            +RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL++AA DVP+TG P   
Sbjct: 567  IRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLIQAACDVPVTGKPFTG 626

Query: 1510 T-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHYL 1686
            T QQ                    TGMVAYC +DG V  FQLT++A+ KD  RNR PH++
Sbjct: 627  TKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDKDFSRNRAPHFV 686

Query: 1687 CGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQI 1866
            CGSL  EES + V +PLP++P TLK+  N++G  PRS R FL+  NQ K AK   +K   
Sbjct: 687  CGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGEGPRSMRAFLTESNQAKNAKDNKAKVPT 746

Query: 1867 -------------TGNSSDSEGTMVXXXXXXXXXXXXXXXXL----------VCIDDNKS 1977
                          G  S+SE T+                 +          V I++  +
Sbjct: 747  PDKQTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQALAVRINEPAN 806

Query: 1978 XXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*PRLFTK*CK 2157
                         PPKI+AMHRVRWNMNKGSER LCYGGA+GIVRCQ I  P +  K  +
Sbjct: 807  TQKEEAGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIIVPDVAKKSAR 866

Query: 2158 RS 2163
            +S
Sbjct: 867  KS 868



 Score = 92.4 bits (228), Expect = 2e-15
 Identities = 41/75 (54%), Positives = 57/75 (76%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPRVH++ +     EFIAV+AHPP+S YH IG PLTGRG+IQIWC+LN GV++++   
Sbjct: 129 DWCPRVHENPNSTVKCEFIAVAAHPPDSYYHKIGTPLTGRGIIQIWCMLNVGVEEEE-AP 187

Query: 183 LVKFKPRKYTKSKEA 227
           L K +P+  +++ EA
Sbjct: 188 LSKKRPKWRSQTTEA 202


>gb|OMO95816.1| hypothetical protein COLO4_15658 [Corchorus olitorius]
          Length = 1008

 Score =  508 bits (1307), Expect = e-163
 Identities = 258/472 (54%), Positives = 317/472 (67%), Gaps = 26/472 (5%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISK--YRMGYLAVLLGSGALEVWEIPL 969
            I  D+ALPR V+ LAHNGKVAWDVKW+P D  NISK   RMGYLAVLLG+G+LEVWE+PL
Sbjct: 528  IPADMALPRGVLCLAHNGKVAWDVKWRPYDI-NISKCNQRMGYLAVLLGNGSLEVWEVPL 586

Query: 970  PRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCH 1149
            P   + ++SS   +G DPRFVKL+PVF CS LKCGDI+SIP+T+EWSTS PHD +LAGCH
Sbjct: 587  PHMIRTVYSSSAKQGTDPRFVKLEPVFKCSKLKCGDIQSIPLTVEWSTSPPHDYLLAGCH 646

Query: 1150 DGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFW 1329
            DGMVALWKFS++   KDTRPL+CFSADTVPIR++ WAP  SD+ES N+I+T+GH GLKFW
Sbjct: 647  DGMVALWKFSASASPKDTRPLLCFSADTVPIRSVAWAPSGSDMESTNVILTAGHGGLKFW 706

Query: 1330 DLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCE 1506
            D+RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL +A SDVP+TG P  
Sbjct: 707  DIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKLLSLSQAVSDVPVTGKPFT 766

Query: 1507 KT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHY 1683
             T QQ                    TGMVAYC +DG V HFQLT++A+ KD  RNR PH+
Sbjct: 767  GTKQQGLHLYNCSSFAIWNIQVSRLTGMVAYCGADGTVSHFQLTSKAVDKDFSRNRAPHF 826

Query: 1684 LCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQ 1863
            +CGSL  EES +T+ +PLP++P T+K+S +++G  PRS R FL+  NQ K AK + +K Q
Sbjct: 827  VCGSLIEEESVITINTPLPDIPLTMKKSTSDYGEGPRSMRAFLTETNQAKNAKDKKAKVQ 886

Query: 1864 IT-------------GNSSDSEGTMVXXXXXXXXXXXXXXXXLVCIDD---------NKS 1977
             +             G  SDSE T+                     D            +
Sbjct: 887  TSDKQTLALCYGDDPGVESDSEETLAALKCKKKQNSQSERNKKADNDQALAIRIEEATNN 946

Query: 1978 XXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*P 2133
                         P K++AMHRVRWNMNKGSER LCYGGA+GIVRCQ I+ P
Sbjct: 947  TQKEETGNEIEVFPAKMVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIKVP 998



 Score = 89.4 bits (220), Expect = 2e-14
 Identities = 40/75 (53%), Positives = 54/75 (72%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPRVH++       EFIAV+AHPPES YH +G P+TGRG++QIWC+LN GV  ++   
Sbjct: 129 DWCPRVHENPSSHVKCEFIAVAAHPPESYYHKMGTPVTGRGIVQIWCMLNVGVNVEE-PL 187

Query: 183 LVKFKPRKYTKSKEA 227
           L K KP + +++ EA
Sbjct: 188 LSKKKPNQRSQNTEA 202


>ref|XP_017969458.1| PREDICTED: uncharacterized protein LOC18612763 isoform X1 [Theobroma
            cacao]
          Length = 877

 Score =  503 bits (1295), Expect = e-163
 Identities = 253/482 (52%), Positives = 319/482 (66%), Gaps = 26/482 (5%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNIS-KYRMGYLAVLLGSGALEVWEIPLP 972
            I  D+ LPR V+ LAHNGKVAWDVKW+P D ++     RMGYLAVLLG+G+LEVWE+PLP
Sbjct: 396  IPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRMGYLAVLLGNGSLEVWEVPLP 455

Query: 973  RATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCHD 1152
                 ++SS   +G DPRFVKL+PVF CS LKCGD++SIP+T+EWSTS P++ +LAGCHD
Sbjct: 456  HMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIPLTVEWSTSPPYNYLLAGCHD 515

Query: 1153 GMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFWD 1332
            GMVALWKFS++G   DTRPL+CFSADTVPIR++ WAP  SD+ESAN+++T+GH GLKFWD
Sbjct: 516  GMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSGSDMESANVVLTAGHGGLKFWD 575

Query: 1333 LRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCEK 1509
            +RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL++AA DVP+TG P   
Sbjct: 576  IRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLIQAACDVPVTGKPFTG 635

Query: 1510 T-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHYL 1686
            T QQ                    TGMVAYC +DG V  FQLT++A+ KD  RNR PH++
Sbjct: 636  TKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDKDFSRNRAPHFV 695

Query: 1687 CGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQI 1866
            CGSL  EES + V +PLP++P TLK+  N++G +PRS R FL+  NQ K AK   +K   
Sbjct: 696  CGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMRAFLTESNQAKNAKDNKAKVPT 755

Query: 1867 -------------TGNSSDSEGTMVXXXXXXXXXXXXXXXXL----------VCIDDNKS 1977
                          G  S+SE T+                 +          V I++  +
Sbjct: 756  PDKRTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQALAVRINEPTN 815

Query: 1978 XXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*PRLFTK*CK 2157
                         PPKI+AMHRVRWNMNKGSER LCYGGA+GIVRCQ I  P +  K  +
Sbjct: 816  TQKEEAGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIIVPDVAKKSAR 875

Query: 2158 RS 2163
            +S
Sbjct: 876  KS 877



 Score = 92.4 bits (228), Expect = 2e-15
 Identities = 41/75 (54%), Positives = 57/75 (76%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPRVH++ +     EFIAV+AHPP+S YH IG PLTGRG+IQIWC+LN GV++++   
Sbjct: 138 DWCPRVHENPNSTVKCEFIAVAAHPPDSYYHKIGTPLTGRGIIQIWCMLNVGVEEEE-AP 196

Query: 183 LVKFKPRKYTKSKEA 227
           L K +P+  +++ EA
Sbjct: 197 LSKKRPKWRSQTTEA 211


>ref|XP_023920539.1| uncharacterized protein LOC112032069 isoform X1 [Quercus suber]
 gb|POF00175.1| general transcription factor 3c polypeptide 2 [Quercus suber]
          Length = 908

 Score =  503 bits (1294), Expect = e-162
 Identities = 262/472 (55%), Positives = 317/472 (67%), Gaps = 28/472 (5%)
 Frame = +1

Query: 793  LISEDVALPRLVMGLAHNGKVAWDVKWKPIDS-HNISKYRMGYLAVLLGSGALEVWEIPL 969
            LIS+DVALPR+V+ LAHNGKVAWDVKW+P ++  +  K+RMGYLAVLLG+G+LEVWE+PL
Sbjct: 424  LISKDVALPRVVLCLAHNGKVAWDVKWRPSNACQSKCKHRMGYLAVLLGNGSLEVWEVPL 483

Query: 970  PRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCH 1149
            PR  K I+SS   EG DPRFVKL+PVF  S+LKCG I+SIP+T+EWS S PHD +LAGCH
Sbjct: 484  PRTMKVIYSSVHQEGTDPRFVKLEPVFRGSLLKCGGIQSIPLTVEWSASPPHDYLLAGCH 543

Query: 1150 DGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFW 1329
            DG VALWKFS++  S+DTRPL+CFSADTVPIRAL WAP  SD ESAN+IVT+GH GLKFW
Sbjct: 544  DGTVALWKFSASCSSEDTRPLLCFSADTVPIRALAWAPLESDPESANVIVTAGHGGLKFW 603

Query: 1330 DLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCE 1506
            DLRDP+RPLWD+ P  +IIYSLDWL +P CVILSFDDG +RI+SLL+AA DVP+TG P  
Sbjct: 604  DLRDPYRPLWDLHPVPRIIYSLDWLSNPRCVILSFDDGTMRILSLLKAAYDVPVTGKPFG 663

Query: 1507 KT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHY 1683
             T QQ                    TGM AYC +DG V+ FQLT++A+ KD  RNR PH+
Sbjct: 664  GTKQQGLHSYYCSSFAIWSVQVSRITGMAAYCTADGTVLRFQLTSKAVDKDPSRNRTPHF 723

Query: 1684 LCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQ 1863
            LCGSL  EES +T+ +P+PN PF LK+SLN+ G+TP S R F S     KRA  +++K  
Sbjct: 724  LCGSLTEEESLITINTPVPNTPFPLKKSLNKGGDTPLSMREFSSEPQHVKRANDKMAKSP 783

Query: 1864 IT-------------GNSSDSEGTMV-------XXXXXXXXXXXXXXXXLVCIDD----- 1968
             T             G  S +E  +                        LVC D+     
Sbjct: 784  STDATTLALCYGDDPGTESGTEEALTRPKSKKRPNSRSSNKKNPEDDLALVCRDEEPPNT 843

Query: 1969 NKSXXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLI 2124
             +              PPKI+AM RVRWNMNKGSER LCYGG +G+VRCQ I
Sbjct: 844  QEKENGKAEARTIEVFPPKIVAMRRVRWNMNKGSERWLCYGGEAGVVRCQEI 895



 Score = 94.0 bits (232), Expect = 7e-16
 Identities = 41/75 (54%), Positives = 57/75 (76%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPR+ ++ DY    EFIAV+AHPP +SYH +GAPLTGRG+IQIWC++N GV +++V  
Sbjct: 183 DWCPRIRETPDYHNKCEFIAVAAHPPGTSYHKMGAPLTGRGVIQIWCLMNVGVNEEEVPP 242

Query: 183 LVKFKPRKYTKSKEA 227
            +  KP++ TK+  A
Sbjct: 243 SLA-KPKQGTKNNGA 256


>ref|XP_022765154.1| uncharacterized protein LOC111310200 isoform X6 [Durio zibethinus]
          Length = 731

 Score =  497 bits (1279), Expect = e-162
 Identities = 256/479 (53%), Positives = 318/479 (66%), Gaps = 23/479 (4%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKY--RMGYLAVLLGSGALEVWEIPL 969
            I  D+ALPR V+ LAHNGKVAWDVKWKP D  N SK+  RMGYLAVLLG+G+LEVWE+PL
Sbjct: 254  IPGDIALPRAVLCLAHNGKVAWDVKWKPYDI-NDSKFNQRMGYLAVLLGNGSLEVWEVPL 312

Query: 970  PRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCH 1149
            P   + ++S    +G DPRFVKL+PV  CS LKCGDI+SIP+T+EWSTS+PHD +LAGCH
Sbjct: 313  PNMIRNVYSLSPKQGTDPRFVKLEPVLKCSKLKCGDIQSIPLTVEWSTSSPHDYLLAGCH 372

Query: 1150 DGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFW 1329
            DGMVALWKFS++   KDTRPL+CFSAD+VPIR++ WAP  SD+ES N+I+T+GH GLKFW
Sbjct: 373  DGMVALWKFSASDTPKDTRPLLCFSADSVPIRSVAWAPSGSDMESRNVILTAGHGGLKFW 432

Query: 1330 DLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCE 1506
            D+RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL+ AA DVP+TG P  
Sbjct: 433  DIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLVHAACDVPVTGKPFT 492

Query: 1507 KT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHY 1683
             T QQ                    TGMVAYC +DG V  FQLT++A+ KD  R+R PH+
Sbjct: 493  GTKQQGLHLYNCSSFAIWSVQVSRLTGMVAYCGADGTVTRFQLTSKAVDKDFSRHRTPHF 552

Query: 1684 LCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQ 1863
             CGSL  EES + V +PLP++P TLK+  N++G  PRS R FL+  NQ K AK   +K  
Sbjct: 553  PCGSLTEEESAVIVNTPLPDIPVTLKKPSNDYGEGPRSMRAFLTESNQGKNAKDRKAKVA 612

Query: 1864 IT-------------GNSSDSEGTMV------XXXXXXXXXXXXXXXXLVCIDDNKSXXX 1986
             +             G  S+SE T+                        + I++  +   
Sbjct: 613  ASNKQTLALYYGNDPGVESESEETLAALQSKRKQKSNGKKKADDDQVLAIRIEEPTNTQK 672

Query: 1987 XXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*PRLFTK*CKRS 2163
                      PPKI+AMHRVRWNMNKGSER LCYGGA+GIVRCQ I  P +  K  ++S
Sbjct: 673  EETGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIIVPDVDKKSARKS 731



 Score = 75.5 bits (184), Expect = 3e-10
 Identities = 35/60 (58%), Positives = 46/60 (76%)
 Frame = +3

Query: 48  VEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTSLVKFKPRKYTKSKEA 227
           ++FIAV+ HPPES YH +G PLTGRG+IQIWCVLN GV +++   L K KP++  +S EA
Sbjct: 94  IQFIAVATHPPESYYHKMGTPLTGRGIIQIWCVLNVGVNEEE-DPLSKKKPKRGFQSTEA 152


>gb|KDO61089.1| hypothetical protein CISIN_1g008363mg [Citrus sinensis]
          Length = 568

 Score =  490 bits (1261), Expect = e-162
 Identities = 259/481 (53%), Positives = 313/481 (65%), Gaps = 30/481 (6%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNIS-KYRMGYLAVLLGSGALEVWEIPLP 972
            I +D+ALPR+V+ LAHNGKVAWDVKWKP ++ +   K R+GYLAVLLG+G+LEVWE+PL 
Sbjct: 83   IPKDIALPRVVLCLAHNGKVAWDVKWKPYNAVDCKCKQRLGYLAVLLGNGSLEVWEVPLL 142

Query: 973  RATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCHD 1152
            R  KAI+ S   EG DPRFVKL+PVF CSMLKCG  +SIP+T+EWSTS PHD +LAGCHD
Sbjct: 143  RTMKAIYLSSMKEGTDPRFVKLEPVFRCSMLKCGGTQSIPLTMEWSTSPPHDYLLAGCHD 202

Query: 1153 GMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFWD 1332
            G VALWKF ++  S D+RPL+CFSADT+PIRA+ WAP  SD +SAN+I+T+GH GLKFWD
Sbjct: 203  GTVALWKFVASDSSIDSRPLLCFSADTLPIRAVSWAPAESDSDSANVILTAGHGGLKFWD 262

Query: 1333 LRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCEK 1509
            +RDPFRPLWDI P  K IY LDWLPDP CVILSFDDG +RI+SLL+AA DVP TG P   
Sbjct: 263  IRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILSFDDGAMRIVSLLKAAYDVPATGKPFAG 322

Query: 1510 T-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHYL 1686
            T QQ                    TGMVAYC +DG V  FQLTA+A+ KDH RNR  H+L
Sbjct: 323  TKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSADGTVHRFQLTAKAVEKDHSRNRPMHFL 382

Query: 1687 CGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCN-----QEKRAKKEI 1851
            CGS+  +ES +TV +PL N P  LK+++++ G   RS R FL   N      +K+ K  +
Sbjct: 383  CGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE--RSMRSFLIESNSSKSPNDKKGKNVL 440

Query: 1852 SK-------CQITGNSSDSEGTMV---------XXXXXXXXXXXXXXXXLVCIDD----- 1968
            S        C       +SEG M                          +VCID+     
Sbjct: 441  SSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPKSRSSSKKKEEDDQAMVCIDEEATDI 500

Query: 1969 -NKSXXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*PRLFT 2145
              K             LPPK++AMHRVRWNMNKGSER LCYGGA GI+RCQ IR P +  
Sbjct: 501  QGKENEKGEAGNGIEVLPPKVVAMHRVRWNMNKGSERWLCYGGAGGIIRCQEIRVPDIDK 560

Query: 2146 K 2148
            K
Sbjct: 561  K 561


>ref|XP_022765150.1| uncharacterized protein LOC111310200 isoform X2 [Durio zibethinus]
          Length = 782

 Score =  497 bits (1279), Expect = e-161
 Identities = 256/479 (53%), Positives = 318/479 (66%), Gaps = 23/479 (4%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKY--RMGYLAVLLGSGALEVWEIPL 969
            I  D+ALPR V+ LAHNGKVAWDVKWKP D  N SK+  RMGYLAVLLG+G+LEVWE+PL
Sbjct: 305  IPGDIALPRAVLCLAHNGKVAWDVKWKPYDI-NDSKFNQRMGYLAVLLGNGSLEVWEVPL 363

Query: 970  PRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCH 1149
            P   + ++S    +G DPRFVKL+PV  CS LKCGDI+SIP+T+EWSTS+PHD +LAGCH
Sbjct: 364  PNMIRNVYSLSPKQGTDPRFVKLEPVLKCSKLKCGDIQSIPLTVEWSTSSPHDYLLAGCH 423

Query: 1150 DGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFW 1329
            DGMVALWKFS++   KDTRPL+CFSAD+VPIR++ WAP  SD+ES N+I+T+GH GLKFW
Sbjct: 424  DGMVALWKFSASDTPKDTRPLLCFSADSVPIRSVAWAPSGSDMESRNVILTAGHGGLKFW 483

Query: 1330 DLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCE 1506
            D+RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL+ AA DVP+TG P  
Sbjct: 484  DIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLVHAACDVPVTGKPFT 543

Query: 1507 KT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHY 1683
             T QQ                    TGMVAYC +DG V  FQLT++A+ KD  R+R PH+
Sbjct: 544  GTKQQGLHLYNCSSFAIWSVQVSRLTGMVAYCGADGTVTRFQLTSKAVDKDFSRHRTPHF 603

Query: 1684 LCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQ 1863
             CGSL  EES + V +PLP++P TLK+  N++G  PRS R FL+  NQ K AK   +K  
Sbjct: 604  PCGSLTEEESAVIVNTPLPDIPVTLKKPSNDYGEGPRSMRAFLTESNQGKNAKDRKAKVA 663

Query: 1864 IT-------------GNSSDSEGTMV------XXXXXXXXXXXXXXXXLVCIDDNKSXXX 1986
             +             G  S+SE T+                        + I++  +   
Sbjct: 664  ASNKQTLALYYGNDPGVESESEETLAALQSKRKQKSNGKKKADDDQVLAIRIEEPTNTQK 723

Query: 1987 XXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*PRLFTK*CKRS 2163
                      PPKI+AMHRVRWNMNKGSER LCYGGA+GIVRCQ I  P +  K  ++S
Sbjct: 724  EETGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIIVPDVDKKSARKS 782



 Score = 94.0 bits (232), Expect = 6e-16
 Identities = 43/75 (57%), Positives = 55/75 (73%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPRVH++ +     EFIAV+ HPPES YH +G PLTGRG+IQIWCVLN GV +++   
Sbjct: 130 DWCPRVHENPNRPVKCEFIAVATHPPESYYHKMGTPLTGRGIIQIWCVLNVGVNEEE-DP 188

Query: 183 LVKFKPRKYTKSKEA 227
           L K KP++  +S EA
Sbjct: 189 LSKKKPKRGFQSTEA 203


>ref|XP_022765149.1| uncharacterized protein LOC111310200 isoform X1 [Durio zibethinus]
          Length = 783

 Score =  497 bits (1279), Expect = e-161
 Identities = 256/479 (53%), Positives = 318/479 (66%), Gaps = 23/479 (4%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNISKY--RMGYLAVLLGSGALEVWEIPL 969
            I  D+ALPR V+ LAHNGKVAWDVKWKP D  N SK+  RMGYLAVLLG+G+LEVWE+PL
Sbjct: 306  IPGDIALPRAVLCLAHNGKVAWDVKWKPYDI-NDSKFNQRMGYLAVLLGNGSLEVWEVPL 364

Query: 970  PRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCH 1149
            P   + ++S    +G DPRFVKL+PV  CS LKCGDI+SIP+T+EWSTS+PHD +LAGCH
Sbjct: 365  PNMIRNVYSLSPKQGTDPRFVKLEPVLKCSKLKCGDIQSIPLTVEWSTSSPHDYLLAGCH 424

Query: 1150 DGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFW 1329
            DGMVALWKFS++   KDTRPL+CFSAD+VPIR++ WAP  SD+ES N+I+T+GH GLKFW
Sbjct: 425  DGMVALWKFSASDTPKDTRPLLCFSADSVPIRSVAWAPSGSDMESRNVILTAGHGGLKFW 484

Query: 1330 DLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCE 1506
            D+RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL+ AA DVP+TG P  
Sbjct: 485  DIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLVHAACDVPVTGKPFT 544

Query: 1507 KT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHY 1683
             T QQ                    TGMVAYC +DG V  FQLT++A+ KD  R+R PH+
Sbjct: 545  GTKQQGLHLYNCSSFAIWSVQVSRLTGMVAYCGADGTVTRFQLTSKAVDKDFSRHRTPHF 604

Query: 1684 LCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQ 1863
             CGSL  EES + V +PLP++P TLK+  N++G  PRS R FL+  NQ K AK   +K  
Sbjct: 605  PCGSLTEEESAVIVNTPLPDIPVTLKKPSNDYGEGPRSMRAFLTESNQGKNAKDRKAKVA 664

Query: 1864 IT-------------GNSSDSEGTMV------XXXXXXXXXXXXXXXXLVCIDDNKSXXX 1986
             +             G  S+SE T+                        + I++  +   
Sbjct: 665  ASNKQTLALYYGNDPGVESESEETLAALQSKRKQKSNGKKKADDDQVLAIRIEEPTNTQK 724

Query: 1987 XXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*PRLFTK*CKRS 2163
                      PPKI+AMHRVRWNMNKGSER LCYGGA+GIVRCQ I  P +  K  ++S
Sbjct: 725  EETGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIIVPDVDKKSARKS 783



 Score = 94.0 bits (232), Expect = 6e-16
 Identities = 43/75 (57%), Positives = 55/75 (73%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPRVH++ +     EFIAV+ HPPES YH +G PLTGRG+IQIWCVLN GV +++   
Sbjct: 131 DWCPRVHENPNRPVKCEFIAVATHPPESYYHKMGTPLTGRGIIQIWCVLNVGVNEEE-DP 189

Query: 183 LVKFKPRKYTKSKEA 227
           L K KP++  +S EA
Sbjct: 190 LSKKKPKRGFQSTEA 204


>ref|XP_017969461.1| PREDICTED: uncharacterized protein LOC18612763 isoform X3 [Theobroma
            cacao]
          Length = 865

 Score =  498 bits (1283), Expect = e-161
 Identities = 253/483 (52%), Positives = 319/483 (66%), Gaps = 27/483 (5%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNIS-KYRMGYLAVLLGSGALEV-WEIPL 969
            I  D+ LPR V+ LAHNGKVAWDVKW+P D ++     RMGYLAVLLG+G+LEV WE+PL
Sbjct: 383  IPRDIELPRTVLCLAHNGKVAWDVKWQPYDINDCECNQRMGYLAVLLGNGSLEVRWEVPL 442

Query: 970  PRATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCH 1149
            P     ++SS   +G DPRFVKL+PVF CS LKCGD++SIP+T+EWSTS P++ +LAGCH
Sbjct: 443  PHMISIVYSSSPKQGTDPRFVKLEPVFKCSKLKCGDVQSIPLTVEWSTSPPYNYLLAGCH 502

Query: 1150 DGMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFW 1329
            DGMVALWKFS++G   DTRPL+CFSADTVPIR++ WAP  SD+ESAN+++T+GH GLKFW
Sbjct: 503  DGMVALWKFSASGSPTDTRPLLCFSADTVPIRSVAWAPSGSDMESANVVLTAGHGGLKFW 562

Query: 1330 DLRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCE 1506
            D+RDPF PLWD+ P  K IYSLDWLP+P CVILSFDDG ++++SL++AA DVP+TG P  
Sbjct: 563  DIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILSFDDGTMKMLSLIQAACDVPVTGKPFT 622

Query: 1507 KT-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHY 1683
             T QQ                    TGMVAYC +DG V  FQLT++A+ KD  RNR PH+
Sbjct: 623  GTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGADGNVTRFQLTSKAVDKDFSRNRAPHF 682

Query: 1684 LCGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCNQEKRAKKEISKCQ 1863
            +CGSL  EES + V +PLP++P TLK+  N++G +PRS R FL+  NQ K AK   +K  
Sbjct: 683  VCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGESPRSMRAFLTESNQAKNAKDNKAKVP 742

Query: 1864 I-------------TGNSSDSEGTMVXXXXXXXXXXXXXXXXL----------VCIDDNK 1974
                           G  S+SE T+                 +          V I++  
Sbjct: 743  TPDKRTLALCYGNDPGVESESEETLTLAALKGKIKQKSKSDRMKKAGDDQALAVRINEPT 802

Query: 1975 SXXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*PRLFTK*C 2154
            +             PPKI+AMHRVRWNMNKGSER LCYGGA+GIVRCQ I  P +  K  
Sbjct: 803  NTQKEEAGNEIEVFPPKIVAMHRVRWNMNKGSERWLCYGGAAGIVRCQEIIVPDVAKKSA 862

Query: 2155 KRS 2163
            ++S
Sbjct: 863  RKS 865



 Score = 92.4 bits (228), Expect = 2e-15
 Identities = 41/75 (54%), Positives = 57/75 (76%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQDVTS 182
           DWCPRVH++ +     EFIAV+AHPP+S YH IG PLTGRG+IQIWC+LN GV++++   
Sbjct: 138 DWCPRVHENPNSTVKCEFIAVAAHPPDSYYHKIGTPLTGRGIIQIWCMLNVGVEEEE-AP 196

Query: 183 LVKFKPRKYTKSKEA 227
           L K +P+  +++ EA
Sbjct: 197 LSKKRPKWRSQTTEA 211


>ref|XP_024037794.1| uncharacterized protein LOC18039905 isoform X3 [Citrus clementina]
          Length = 801

 Score =  490 bits (1261), Expect = e-158
 Identities = 259/481 (53%), Positives = 313/481 (65%), Gaps = 30/481 (6%)
 Frame = +1

Query: 796  ISEDVALPRLVMGLAHNGKVAWDVKWKPIDSHNIS-KYRMGYLAVLLGSGALEVWEIPLP 972
            I +D+ALPR+V+ LAHNGKVAWDVKWKP ++ +   K R+GYLAVLLG+G+LEVWE+PL 
Sbjct: 316  IPKDIALPRVVLCLAHNGKVAWDVKWKPYNAVDCKCKQRLGYLAVLLGNGSLEVWEVPLL 375

Query: 973  RATKAIFSSCETEGADPRFVKLKPVFMCSMLKCGDIKSIPITLEWSTSAPHDLILAGCHD 1152
            R  KAI+ S   EG DPRFVKL+PVF CSMLKCG  +SIP+T+EWSTS PHD +LAGCHD
Sbjct: 376  RTMKAIYLSSMKEGTDPRFVKLEPVFRCSMLKCGGTQSIPLTMEWSTSPPHDYLLAGCHD 435

Query: 1153 GMVALWKFSSNGPSKDTRPLICFSADTVPIRALKWAPRASDLESANIIVTSGHKGLKFWD 1332
            G VALWKF ++  S D+RPL+CFSADT+PIRA+ WAP  SD +SAN+I+T+GH GLKFWD
Sbjct: 436  GTVALWKFVASDSSIDSRPLLCFSADTLPIRAVSWAPAESDSDSANVILTAGHGGLKFWD 495

Query: 1333 LRDPFRPLWDI-PTQKIIYSLDWLPDPSCVILSFDDGEIRIMSLLRAASDVPITGMPCEK 1509
            +RDPFRPLWDI P  K IY LDWLPDP CVILSFDDG +RI+SLL+AA DVP TG P   
Sbjct: 496  IRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILSFDDGAMRIVSLLKAAYDVPATGKPFAG 555

Query: 1510 T-QQXXXXXXXXXXXXXXXXXXXXTGMVAYCCSDGKVIHFQLTARALGKDHHRNREPHYL 1686
            T QQ                    TGMVAYC +DG V  FQLTA+A+ KDH RNR  H+L
Sbjct: 556  TKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSADGTVHRFQLTAKAVEKDHSRNRPMHFL 615

Query: 1687 CGSLAVEESTLTVFSPLPNVPFTLKRSLNEWGNTPRSKRGFLSHCN-----QEKRAKKEI 1851
            CGS+  +ES +TV +PL N P  LK+++++ G   RS R FL   N      +K+ K  +
Sbjct: 616  CGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE--RSMRSFLIESNSSKSPNDKKGKNVL 673

Query: 1852 SK-------CQITGNSSDSEGTMV---------XXXXXXXXXXXXXXXXLVCIDD----- 1968
            S        C       +SEG M                          +VCID+     
Sbjct: 674  SSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPKSRSSSKKKEEDDQAMVCIDEEAMDI 733

Query: 1969 -NKSXXXXXXXXXXXTLPPKIIAMHRVRWNMNKGSERLLCYGGASGIVRCQLIR*PRLFT 2145
              K             LPPK++AMHRVRWNMNKGSER LCYGGA GI+RCQ IR P +  
Sbjct: 734  QGKENEKGEAGNGIEVLPPKVVAMHRVRWNMNKGSERWLCYGGAGGIIRCQEIRVPDIDK 793

Query: 2146 K 2148
            K
Sbjct: 794  K 794



 Score =  102 bits (253), Expect = 2e-18
 Identities = 60/162 (37%), Positives = 81/162 (50%), Gaps = 40/162 (24%)
 Frame = +3

Query: 3   DWCPRVHQSSDYDCNVEFIAVSAHPPESSYHIIGAPLTGRGLIQIWCVLNTGVKDQ---- 170
           DWCPRVH+  D     EFIAV+AHPPES YH +GAPLTGRG+IQIWC+LN GV ++    
Sbjct: 21  DWCPRVHEKPDCQVKCEFIAVAAHPPESCYHKLGAPLTGRGMIQIWCMLNVGVNEEEARS 80

Query: 171 ----------------DVTSLVKFKPR---------------KYTKSKE-----AKKPKE 242
                           D T   + +PR               K T+SK       KKPK+
Sbjct: 81  PKRNLKRKSQNFEDSDDKTKRPRGRPRKKPTDEALDDYATKDKLTQSKRPRGRPRKKPKD 140

Query: 243 NQXXXXXXXXXXXXLSEIDGIDQLLQDISVQSSENSNNLLQL 368
                            +DG++Q +Q ++VQ  E+S+N+L +
Sbjct: 141 ESS------------GNLDGVEQFVQPLAVQYPEDSSNMLTI 170


Top