BLASTX nr result

ID: Rehmannia31_contig00019480 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00019480
         (808 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PIN06182.1| hypothetical protein CDL12_21265 [Handroanthus im...   381   e-128
ref|XP_020553638.1| uncharacterized protein LOC105173178 isoform...   371   e-124
ref|XP_011093153.1| uncharacterized protein LOC105173178 isoform...   371   e-122
ref|XP_022870762.1| uncharacterized protein LOC111390006 isoform...   346   e-112
ref|XP_022870761.1| uncharacterized protein LOC111390006 isoform...   346   e-112
gb|EYU35076.1| hypothetical protein MIMGU_mgv1a002868mg [Erythra...   335   e-108
ref|XP_012840183.1| PREDICTED: uncharacterized protein LOC105960...   335   e-108
gb|KZV57637.1| hypothetical protein F511_03097 [Dorcoceras hygro...   327   e-105
ref|XP_022870763.1| uncharacterized protein LOC111390006 isoform...   325   e-104
gb|EOX96023.1| Uncharacterized protein TCM_005376 isoform 2 [The...   314   e-102
gb|EOX96024.1| Uncharacterized protein TCM_005376 isoform 3 [The...   314   e-101
gb|EOX96026.1| Uncharacterized protein TCM_005376 isoform 5 [The...   309   e-100
ref|XP_017969521.1| PREDICTED: uncharacterized protein LOC186141...   314   e-100
ref|XP_007051865.1| PREDICTED: uncharacterized protein LOC186141...   314   e-100
ref|XP_022035812.1| uncharacterized protein LOC110937671 isoform...   310   1e-98
ref|XP_022035811.1| uncharacterized protein LOC110937671 isoform...   310   2e-98
ref|XP_022757275.1| uncharacterized protein LOC111304704 isoform...   308   5e-98
gb|EPS67204.1| hypothetical protein M569_07572, partial [Genlise...   309   6e-98
ref|XP_022757274.1| uncharacterized protein LOC111304704 isoform...   308   1e-97
emb|CDP08843.1| unnamed protein product [Coffea canephora]            310   1e-97

>gb|PIN06182.1| hypothetical protein CDL12_21265 [Handroanthus impetiginosus]
          Length = 455

 Score =  381 bits (978), Expect = e-128
 Identities = 191/267 (71%), Positives = 215/267 (80%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LPFAEVTF+GTVCVAKAEGTGGVLNFSTCAEQLLYEVGDP AYITPDVIVDF+DVTFQPL
Sbjct: 56  LPFAEVTFDGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPSAYITPDVIVDFRDVTFQPL 115

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           S+SKVLC GAKPSPE +PQ LLLLRSK+ GWKGWGEISYGGYE  QRAKAAEFLVRAWME
Sbjct: 116 SNSKVLCSGAKPSPESVPQKLLLLRSKDSGWKGWGEISYGGYESMQRAKAAEFLVRAWME 175

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
           E YPG+  NI+SYIIGLDSLK  CI +IL KP EDIRLR+DGLFEKEEHAI FT+EFTAL
Sbjct: 176 EVYPGLRNNIISYIIGLDSLKAACIDDILPKPSEDIRLRMDGLFEKEEHAIQFTREFTAL 235

Query: 266 YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
           YTN          G KKEIFLEKGLV RE++HWQ+S+ R           S + TN+ S 
Sbjct: 236 YTNGPAGGGGISTGCKKEIFLEKGLVWREHVHWQISMGRNNTVTSTNQNISDIITNEKSL 295

Query: 86  RHESNSPSVPRETIHNSSKESLTTENR 6
             +S+SP V +ET ++S++ES+  E R
Sbjct: 296 SPKSDSPLVGKETKYSSNQESIPPETR 322


>ref|XP_020553638.1| uncharacterized protein LOC105173178 isoform X2 [Sesamum indicum]
          Length = 518

 Score =  371 bits (953), Expect = e-124
 Identities = 185/265 (69%), Positives = 211/265 (79%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LPFAEVTF+G VCVAKAE TGGVL+FSTCA+QLLYEVGDP AYITPDVIVD + VTFQPL
Sbjct: 120 LPFAEVTFDGKVCVAKAESTGGVLDFSTCAQQLLYEVGDPSAYITPDVIVDVRHVTFQPL 179

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           S+SKVLC GAKPSPE +PQ LLLL SK+CGWKGWGEISYGGYE  QRAKAAEFLVRAWM+
Sbjct: 180 SNSKVLCCGAKPSPESVPQKLLLLGSKDCGWKGWGEISYGGYESVQRAKAAEFLVRAWMD 239

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
           E YPGI+ +I+SYIIGLDS+K  CIG I+ KP ED+RLR+DGLFEKEEHA+ FTKEFTAL
Sbjct: 240 ELYPGINNHIISYIIGLDSVKATCIGEIMLKPSEDVRLRMDGLFEKEEHAVQFTKEFTAL 299

Query: 266 YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
           YTN          G KKEIFLEKGLVGREY+ WQ+S+AR           S   T QN +
Sbjct: 300 YTNGPAGGGGISTGCKKEIFLEKGLVGREYVRWQISMARNNIISPTDKSASRSVTKQNYA 359

Query: 86  RHESNSPSVPRETIHNSSKESLTTE 12
             ES+S  VPRET+ ++S+ES+  E
Sbjct: 360 WLESDSIPVPRETVDSASQESVAQE 384


>ref|XP_011093153.1| uncharacterized protein LOC105173178 isoform X1 [Sesamum indicum]
          Length = 646

 Score =  371 bits (953), Expect = e-122
 Identities = 185/265 (69%), Positives = 211/265 (79%)
 Frame = -3

Query: 806  LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
            LPFAEVTF+G VCVAKAE TGGVL+FSTCA+QLLYEVGDP AYITPDVIVD + VTFQPL
Sbjct: 248  LPFAEVTFDGKVCVAKAESTGGVLDFSTCAQQLLYEVGDPSAYITPDVIVDVRHVTFQPL 307

Query: 626  SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
            S+SKVLC GAKPSPE +PQ LLLL SK+CGWKGWGEISYGGYE  QRAKAAEFLVRAWM+
Sbjct: 308  SNSKVLCCGAKPSPESVPQKLLLLGSKDCGWKGWGEISYGGYESVQRAKAAEFLVRAWMD 367

Query: 446  ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
            E YPGI+ +I+SYIIGLDS+K  CIG I+ KP ED+RLR+DGLFEKEEHA+ FTKEFTAL
Sbjct: 368  ELYPGINNHIISYIIGLDSVKATCIGEIMLKPSEDVRLRMDGLFEKEEHAVQFTKEFTAL 427

Query: 266  YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
            YTN          G KKEIFLEKGLVGREY+ WQ+S+AR           S   T QN +
Sbjct: 428  YTNGPAGGGGISTGCKKEIFLEKGLVGREYVRWQISMARNNIISPTDKSASRSVTKQNYA 487

Query: 86   RHESNSPSVPRETIHNSSKESLTTE 12
              ES+S  VPRET+ ++S+ES+  E
Sbjct: 488  WLESDSIPVPRETVDSASQESVAQE 512


>ref|XP_022870762.1| uncharacterized protein LOC111390006 isoform X2 [Olea europaea var.
            sylvestris]
          Length = 643

 Score =  346 bits (887), Expect = e-112
 Identities = 175/266 (65%), Positives = 200/266 (75%)
 Frame = -3

Query: 806  LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
            LPFAEVTF+GTV VAKAE T GVLNFSTC+EQLLYEVGDPGAYITPDVI+D +DV+FQPL
Sbjct: 239  LPFAEVTFDGTVHVAKAEATAGVLNFSTCSEQLLYEVGDPGAYITPDVIIDIRDVSFQPL 298

Query: 626  SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
            S SKV C GAKPS EP+PQ +L+L SK+CGWKGWGEISYGGYE  +RAKAAEFLVRAWME
Sbjct: 299  SKSKVHCSGAKPSSEPLPQKMLVLSSKDCGWKGWGEISYGGYESVKRAKAAEFLVRAWME 358

Query: 446  ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
            E YPG SK ILSYIIG DSLK +   N L +  EDIRLR+DGLFE+E+HAI F KEFTAL
Sbjct: 359  EMYPGTSKRILSYIIGFDSLKAVSTDNKLPRTSEDIRLRMDGLFEQEKHAIQFIKEFTAL 418

Query: 266  YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
            YTN          G KKEI+LEKGLVGREY++WQ++ AR                 Q  +
Sbjct: 419  YTNGPAGGGGISTGHKKEIYLEKGLVGREYVYWQIAAARNNVISSIDQNVDSKIKIQTGT 478

Query: 86   RHESNSPSVPRETIHNSSKESLTTEN 9
             HES+S  V RET ++ SKE L  E+
Sbjct: 479  YHESDSQPVSRETTYSLSKELLVPES 504


>ref|XP_022870761.1| uncharacterized protein LOC111390006 isoform X1 [Olea europaea var.
            sylvestris]
          Length = 655

 Score =  346 bits (887), Expect = e-112
 Identities = 175/266 (65%), Positives = 200/266 (75%)
 Frame = -3

Query: 806  LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
            LPFAEVTF+GTV VAKAE T GVLNFSTC+EQLLYEVGDPGAYITPDVI+D +DV+FQPL
Sbjct: 251  LPFAEVTFDGTVHVAKAEATAGVLNFSTCSEQLLYEVGDPGAYITPDVIIDIRDVSFQPL 310

Query: 626  SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
            S SKV C GAKPS EP+PQ +L+L SK+CGWKGWGEISYGGYE  +RAKAAEFLVRAWME
Sbjct: 311  SKSKVHCSGAKPSSEPLPQKMLVLSSKDCGWKGWGEISYGGYESVKRAKAAEFLVRAWME 370

Query: 446  ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
            E YPG SK ILSYIIG DSLK +   N L +  EDIRLR+DGLFE+E+HAI F KEFTAL
Sbjct: 371  EMYPGTSKRILSYIIGFDSLKAVSTDNKLPRTSEDIRLRMDGLFEQEKHAIQFIKEFTAL 430

Query: 266  YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
            YTN          G KKEI+LEKGLVGREY++WQ++ AR                 Q  +
Sbjct: 431  YTNGPAGGGGISTGHKKEIYLEKGLVGREYVYWQIAAARNNVISSIDQNVDSKIKIQTGT 490

Query: 86   RHESNSPSVPRETIHNSSKESLTTEN 9
             HES+S  V RET ++ SKE L  E+
Sbjct: 491  YHESDSQPVSRETTYSLSKELLVPES 516


>gb|EYU35076.1| hypothetical protein MIMGU_mgv1a002868mg [Erythranthe guttata]
          Length = 629

 Score =  335 bits (858), Expect = e-108
 Identities = 166/247 (67%), Positives = 189/247 (76%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LPFAEVTF+G +CVAKAE +GGVLNF+TCAEQLLYE+GDP AYITPDVI+D QDVTFQPL
Sbjct: 248 LPFAEVTFDGKICVAKAEASGGVLNFNTCAEQLLYEIGDPSAYITPDVIIDIQDVTFQPL 307

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           S+SKVLC GAKPSP  IPQ LLLLRSK+ GWKGWGEISYGGY   QRAKAAEFLVRAW+E
Sbjct: 308 SESKVLCLGAKPSPASIPQKLLLLRSKDNGWKGWGEISYGGYASIQRAKAAEFLVRAWVE 367

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
           E YPG S  I+SYIIGLDSLKT C+ ++ SK  EDIRLR+DGLFEKEEHAI  TKEFTAL
Sbjct: 368 ELYPGTSNKIVSYIIGLDSLKTSCVEDLSSKTSEDIRLRMDGLFEKEEHAIQLTKEFTAL 427

Query: 266 YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
           YTN          G +KEIFLEK LV R+++HWQ   AR           + +   + +S
Sbjct: 428 YTNGPAGGGGISTGHRKEIFLEKALVERKHVHWQTYAARNNNITSLSTTKNTIHVGKENS 487

Query: 86  RHESNSP 66
             ES +P
Sbjct: 488 TKESRAP 494


>ref|XP_012840183.1| PREDICTED: uncharacterized protein LOC105960541 [Erythranthe
           guttata]
          Length = 639

 Score =  335 bits (858), Expect = e-108
 Identities = 166/247 (67%), Positives = 189/247 (76%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LPFAEVTF+G +CVAKAE +GGVLNF+TCAEQLLYE+GDP AYITPDVI+D QDVTFQPL
Sbjct: 258 LPFAEVTFDGKICVAKAEASGGVLNFNTCAEQLLYEIGDPSAYITPDVIIDIQDVTFQPL 317

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           S+SKVLC GAKPSP  IPQ LLLLRSK+ GWKGWGEISYGGY   QRAKAAEFLVRAW+E
Sbjct: 318 SESKVLCLGAKPSPASIPQKLLLLRSKDNGWKGWGEISYGGYASIQRAKAAEFLVRAWVE 377

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
           E YPG S  I+SYIIGLDSLKT C+ ++ SK  EDIRLR+DGLFEKEEHAI  TKEFTAL
Sbjct: 378 ELYPGTSNKIVSYIIGLDSLKTSCVEDLSSKTSEDIRLRMDGLFEKEEHAIQLTKEFTAL 437

Query: 266 YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
           YTN          G +KEIFLEK LV R+++HWQ   AR           + +   + +S
Sbjct: 438 YTNGPAGGGGISTGHRKEIFLEKALVERKHVHWQTYAARNNNITSLSTTKNTIHVGKENS 497

Query: 86  RHESNSP 66
             ES +P
Sbjct: 498 TKESRAP 504


>gb|KZV57637.1| hypothetical protein F511_03097 [Dorcoceras hygrometricum]
          Length = 656

 Score =  327 bits (838), Expect = e-105
 Identities = 169/264 (64%), Positives = 197/264 (74%), Gaps = 3/264 (1%)
 Frame = -3

Query: 806  LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
            LPFAEVTF+GTV VAKAEGTGGV+NF TCAEQLLYE+GDP AYITPDVI+DF++VTFQPL
Sbjct: 257  LPFAEVTFDGTVSVAKAEGTGGVINFCTCAEQLLYEIGDPSAYITPDVIIDFRNVTFQPL 316

Query: 626  SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
            S+SKVLC GAKPS EP+PQ +L L SK+ GWKGWGEISYGG E  QRAKAAEFLVRAW++
Sbjct: 317  SNSKVLCSGAKPSFEPVPQKMLCLGSKDGGWKGWGEISYGGRESIQRAKAAEFLVRAWID 376

Query: 446  ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
            E YPGIS  I SYIIGLDSLK   I + L K  EDIR R+DG+FEKEEHA+ FTKEF AL
Sbjct: 377  EIYPGISSCIFSYIIGLDSLKASSIDDTLPKNIEDIRFRMDGVFEKEEHAVQFTKEFMAL 436

Query: 266  YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
            YTN          G +KEIFLEKGLV R+Y+HWQ+ V R           S   ++  S 
Sbjct: 437  YTNGPAGGGGISTGCRKEIFLEKGLVDRKYVHWQIGVVRNNASNSTDQQTS--ISSSASK 494

Query: 86   RHESNSPS---VPRETIHNSSKES 24
            +H  N P+   + + T +NSS+ES
Sbjct: 495  KHIYNEPNLKCIEKGTTYNSSEES 518


>ref|XP_022870763.1| uncharacterized protein LOC111390006 isoform X3 [Olea europaea var.
           sylvestris]
          Length = 628

 Score =  325 bits (834), Expect = e-104
 Identities = 165/255 (64%), Positives = 189/255 (74%)
 Frame = -3

Query: 773 VCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPLSDSKVLCFGAK 594
           V VAKAE T GVLNFSTC+EQLLYEVGDPGAYITPDVI+D +DV+FQPLS SKV C GAK
Sbjct: 235 VHVAKAEATAGVLNFSTCSEQLLYEVGDPGAYITPDVIIDIRDVSFQPLSKSKVHCSGAK 294

Query: 593 PSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWMEESYPGISKNIL 414
           PS EP+PQ +L+L SK+CGWKGWGEISYGGYE  +RAKAAEFLVRAWMEE YPG SK IL
Sbjct: 295 PSSEPLPQKMLVLSSKDCGWKGWGEISYGGYESVKRAKAAEFLVRAWMEEMYPGTSKRIL 354

Query: 413 SYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTALYTNXXXXXXXX 234
           SYIIG DSLK +   N L +  EDIRLR+DGLFE+E+HAI F KEFTALYTN        
Sbjct: 355 SYIIGFDSLKAVSTDNKLPRTSEDIRLRMDGLFEQEKHAIQFIKEFTALYTNGPAGGGGI 414

Query: 233 XXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSSRHESNSPSVPR 54
             G KKEI+LEKGLVGREY++WQ++ AR                 Q  + HES+S  V R
Sbjct: 415 STGHKKEIYLEKGLVGREYVYWQIAAARNNVISSIDQNVDSKIKIQTGTYHESDSQPVSR 474

Query: 53  ETIHNSSKESLTTEN 9
           ET ++ SKE L  E+
Sbjct: 475 ETTYSLSKELLVPES 489


>gb|EOX96023.1| Uncharacterized protein TCM_005376 isoform 2 [Theobroma cacao]
 gb|EOX96025.1| Uncharacterized protein TCM_005376 isoform 2 [Theobroma cacao]
          Length = 448

 Score =  314 bits (804), Expect = e-102
 Identities = 157/264 (59%), Positives = 187/264 (70%), Gaps = 2/264 (0%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LP+AE++F+G VCV KAEG+GGVLNFSTCAEQLLYEVGDP AYITPDV++DFQ V+FQPL
Sbjct: 56  LPYAEISFSGEVCVMKAEGSGGVLNFSTCAEQLLYEVGDPSAYITPDVVIDFQGVSFQPL 115

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           + SKVLC GAKPS  P+P  LL L  K+CGWKGWGEISYGGYEC +RAKAAEFLVR+WME
Sbjct: 116 TSSKVLCIGAKPSAHPVPDKLLQLVPKDCGWKGWGEISYGGYECVKRAKAAEFLVRSWME 175

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILS--KPCEDIRLRIDGLFEKEEHAIHFTKEFT 273
           E +PG+S  +LSYIIGLDSLK   I N  S  K  EDIRLR+DGLF++++HA    KEFT
Sbjct: 176 EVFPGVSCCVLSYIIGLDSLKATSIDNYSSTWKASEDIRLRMDGLFQEKKHAEQLVKEFT 235

Query: 272 ALYTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQN 93
           ALYTN          G KKEI LEK L+GRE+I W+++  +                 ++
Sbjct: 236 ALYTNGPASGGGISTGLKKEIVLEKQLIGREHIFWRIAAKQTEVSESKCQKHVFRDVMKD 295

Query: 92  SSRHESNSPSVPRETIHNSSKESL 21
              HE   P  P E IHNSS   +
Sbjct: 296 CVLHEPTLPPFPEEDIHNSSSPEI 319


>gb|EOX96024.1| Uncharacterized protein TCM_005376 isoform 3 [Theobroma cacao]
          Length = 494

 Score =  314 bits (804), Expect = e-101
 Identities = 157/264 (59%), Positives = 187/264 (70%), Gaps = 2/264 (0%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LP+AE++F+G VCV KAEG+GGVLNFSTCAEQLLYEVGDP AYITPDV++DFQ V+FQPL
Sbjct: 102 LPYAEISFSGEVCVMKAEGSGGVLNFSTCAEQLLYEVGDPSAYITPDVVIDFQGVSFQPL 161

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           + SKVLC GAKPS  P+P  LL L  K+CGWKGWGEISYGGYEC +RAKAAEFLVR+WME
Sbjct: 162 TSSKVLCIGAKPSAHPVPDKLLQLVPKDCGWKGWGEISYGGYECVKRAKAAEFLVRSWME 221

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILS--KPCEDIRLRIDGLFEKEEHAIHFTKEFT 273
           E +PG+S  +LSYIIGLDSLK   I N  S  K  EDIRLR+DGLF++++HA    KEFT
Sbjct: 222 EVFPGVSCCVLSYIIGLDSLKATSIDNYSSTWKASEDIRLRMDGLFQEKKHAEQLVKEFT 281

Query: 272 ALYTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQN 93
           ALYTN          G KKEI LEK L+GRE+I W+++  +                 ++
Sbjct: 282 ALYTNGPASGGGISTGLKKEIVLEKQLIGREHIFWRIAAKQTEVSESKCQKHVFRDVMKD 341

Query: 92  SSRHESNSPSVPRETIHNSSKESL 21
              HE   P  P E IHNSS   +
Sbjct: 342 CVLHEPTLPPFPEEDIHNSSSPEI 365


>gb|EOX96026.1| Uncharacterized protein TCM_005376 isoform 5 [Theobroma cacao]
          Length = 449

 Score =  309 bits (792), Expect = e-100
 Identities = 157/265 (59%), Positives = 187/265 (70%), Gaps = 3/265 (1%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LP+AE++F+G VCV KAEG+GGVLNFSTCAEQLLYEVGDP AYITPDV++DFQ V+FQPL
Sbjct: 56  LPYAEISFSGEVCVMKAEGSGGVLNFSTCAEQLLYEVGDPSAYITPDVVIDFQGVSFQPL 115

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           + SKVLC GAKPS  P+P  LL L  K+CGWKGWGEISYGGYEC +RAKAAEFLVR+WME
Sbjct: 116 TSSKVLCIGAKPSAHPVPDKLLQLVPKDCGWKGWGEISYGGYECVKRAKAAEFLVRSWME 175

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILS--KPCEDIRLRIDGLFEKEEHAIHFTKEFT 273
           E +PG+S  +LSYIIGLDSLK   I N  S  K  EDIRLR+DGLF++++HA    KEFT
Sbjct: 176 EVFPGVSCCVLSYIIGLDSLKATSIDNYSSTWKASEDIRLRMDGLFQEKKHAEQLVKEFT 235

Query: 272 ALYTN-XXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQ 96
           ALYTN           G KKEI LEK L+GRE+I W+++  +                 +
Sbjct: 236 ALYTNGPASGGGISSTGLKKEIVLEKQLIGREHIFWRIAAKQTEVSESKCQKHVFRDVMK 295

Query: 95  NSSRHESNSPSVPRETIHNSSKESL 21
           +   HE   P  P E IHNSS   +
Sbjct: 296 DCVLHEPTLPPFPEEDIHNSSSPEI 320


>ref|XP_017969521.1| PREDICTED: uncharacterized protein LOC18614176 isoform X2 [Theobroma
            cacao]
          Length = 641

 Score =  314 bits (804), Expect = e-100
 Identities = 157/264 (59%), Positives = 187/264 (70%), Gaps = 2/264 (0%)
 Frame = -3

Query: 806  LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
            LP+AE++F+G VCV KAEG+GGVLNFSTCAEQLLYEVGDP AYITPDV++DFQ V+FQPL
Sbjct: 249  LPYAEISFSGEVCVMKAEGSGGVLNFSTCAEQLLYEVGDPSAYITPDVVIDFQGVSFQPL 308

Query: 626  SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
            + SKVLC GAKPS  P+P  LL L  K+CGWKGWGEISYGGYEC +RAKAAEFLVR+WME
Sbjct: 309  TSSKVLCIGAKPSAHPVPDKLLQLVPKDCGWKGWGEISYGGYECVKRAKAAEFLVRSWME 368

Query: 446  ESYPGISKNILSYIIGLDSLKTMCIGNILS--KPCEDIRLRIDGLFEKEEHAIHFTKEFT 273
            E +PG+S  +LSYIIGLDSLK   I N  S  K  EDIRLR+DGLF++++HA    KEFT
Sbjct: 369  EVFPGVSCCVLSYIIGLDSLKATSIDNYSSTWKASEDIRLRMDGLFQEKKHAEQLVKEFT 428

Query: 272  ALYTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQN 93
            ALYTN          G KKEI LEK L+GRE+I W+++  +                 ++
Sbjct: 429  ALYTNGPASGGGISTGLKKEIVLEKQLIGREHIFWRIAAKQTEVSESKCQKHVFRDVMKD 488

Query: 92   SSRHESNSPSVPRETIHNSSKESL 21
               HE   P  P E IHNSS   +
Sbjct: 489  CVLHEPTLPPFPEEDIHNSSSPEI 512


>ref|XP_007051865.1| PREDICTED: uncharacterized protein LOC18614176 isoform X1 [Theobroma
            cacao]
 gb|EOX96022.1| Uncharacterized protein TCM_005376 isoform 1 [Theobroma cacao]
          Length = 642

 Score =  314 bits (804), Expect = e-100
 Identities = 157/264 (59%), Positives = 187/264 (70%), Gaps = 2/264 (0%)
 Frame = -3

Query: 806  LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
            LP+AE++F+G VCV KAEG+GGVLNFSTCAEQLLYEVGDP AYITPDV++DFQ V+FQPL
Sbjct: 250  LPYAEISFSGEVCVMKAEGSGGVLNFSTCAEQLLYEVGDPSAYITPDVVIDFQGVSFQPL 309

Query: 626  SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
            + SKVLC GAKPS  P+P  LL L  K+CGWKGWGEISYGGYEC +RAKAAEFLVR+WME
Sbjct: 310  TSSKVLCIGAKPSAHPVPDKLLQLVPKDCGWKGWGEISYGGYECVKRAKAAEFLVRSWME 369

Query: 446  ESYPGISKNILSYIIGLDSLKTMCIGNILS--KPCEDIRLRIDGLFEKEEHAIHFTKEFT 273
            E +PG+S  +LSYIIGLDSLK   I N  S  K  EDIRLR+DGLF++++HA    KEFT
Sbjct: 370  EVFPGVSCCVLSYIIGLDSLKATSIDNYSSTWKASEDIRLRMDGLFQEKKHAEQLVKEFT 429

Query: 272  ALYTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQN 93
            ALYTN          G KKEI LEK L+GRE+I W+++  +                 ++
Sbjct: 430  ALYTNGPASGGGISTGLKKEIVLEKQLIGREHIFWRIAAKQTEVSESKCQKHVFRDVMKD 489

Query: 92   SSRHESNSPSVPRETIHNSSKESL 21
               HE   P  P E IHNSS   +
Sbjct: 490  CVLHEPTLPPFPEEDIHNSSSPEI 513


>ref|XP_022035812.1| uncharacterized protein LOC110937671 isoform X2 [Helianthus annuus]
          Length = 635

 Score =  310 bits (795), Expect = 1e-98
 Identities = 157/266 (59%), Positives = 191/266 (71%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LPFAEV ++G +CVAKA+G+GGVLNFSTCA+QLLYEVGDPGAYITPDV++DFQDV+F  L
Sbjct: 251 LPFAEVNYDGNMCVAKADGSGGVLNFSTCAQQLLYEVGDPGAYITPDVVIDFQDVSFHSL 310

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           S  KV+C GAKP+   +P NLL L SKE GWKGWGEISYGGY+C +RAKAAE+LV++WME
Sbjct: 311 STDKVVCTGAKPAATSVPDNLLALASKEAGWKGWGEISYGGYKCVERAKAAEYLVKSWME 370

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
           E  PG S  I+SYIIGLDSLK   + N  +    DIRLR+DGLFE+E+HAI FTKEFTAL
Sbjct: 371 EVCPGTSARIMSYIIGLDSLKATSLEN-YTLVTNDIRLRMDGLFEQEQHAIEFTKEFTAL 429

Query: 266 YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
           YTN          G KKEI LEKGLVGRE+I+W++S                   NQ + 
Sbjct: 430 YTNGPAGGGGISTGHKKEILLEKGLVGREHIYWKISAKE----------------NQPTK 473

Query: 86  RHESNSPSVPRETIHNSSKESLTTEN 9
            +   + ++P ET  N+ KE L+ EN
Sbjct: 474 SNNHQTNTLPTETKSNNPKEFLSPEN 499


>ref|XP_022035811.1| uncharacterized protein LOC110937671 isoform X1 [Helianthus annuus]
 gb|OTG29386.1| hypothetical protein HannXRQ_Chr04g0121651 [Helianthus annuus]
          Length = 642

 Score =  310 bits (795), Expect = 2e-98
 Identities = 157/266 (59%), Positives = 191/266 (71%)
 Frame = -3

Query: 806  LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
            LPFAEV ++G +CVAKA+G+GGVLNFSTCA+QLLYEVGDPGAYITPDV++DFQDV+F  L
Sbjct: 258  LPFAEVNYDGNMCVAKADGSGGVLNFSTCAQQLLYEVGDPGAYITPDVVIDFQDVSFHSL 317

Query: 626  SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
            S  KV+C GAKP+   +P NLL L SKE GWKGWGEISYGGY+C +RAKAAE+LV++WME
Sbjct: 318  STDKVVCTGAKPAATSVPDNLLALASKEAGWKGWGEISYGGYKCVERAKAAEYLVKSWME 377

Query: 446  ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
            E  PG S  I+SYIIGLDSLK   + N  +    DIRLR+DGLFE+E+HAI FTKEFTAL
Sbjct: 378  EVCPGTSARIMSYIIGLDSLKATSLEN-YTLVTNDIRLRMDGLFEQEQHAIEFTKEFTAL 436

Query: 266  YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQNSS 87
            YTN          G KKEI LEKGLVGRE+I+W++S                   NQ + 
Sbjct: 437  YTNGPAGGGGISTGHKKEILLEKGLVGREHIYWKISAKE----------------NQPTK 480

Query: 86   RHESNSPSVPRETIHNSSKESLTTEN 9
             +   + ++P ET  N+ KE L+ EN
Sbjct: 481  SNNHQTNTLPTETKSNNPKEFLSPEN 506


>ref|XP_022757275.1| uncharacterized protein LOC111304704 isoform X6 [Durio zibethinus]
 ref|XP_022757276.1| uncharacterized protein LOC111304704 isoform X6 [Durio zibethinus]
          Length = 589

 Score =  308 bits (788), Expect = 5e-98
 Identities = 155/267 (58%), Positives = 186/267 (69%), Gaps = 2/267 (0%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LP+AE++F G VCV KAEG+GGVLNFSTCAEQLLYEVGDP +YITPDVIVDF+ V+FQPL
Sbjct: 193 LPYAEISFTGKVCVMKAEGSGGVLNFSTCAEQLLYEVGDPSSYITPDVIVDFRGVSFQPL 252

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           S SK+LC GAKPS  P+P  LL L  KECGWKGWGEISYGGYEC +RAKAAEFLV++WME
Sbjct: 253 SSSKILCIGAKPSAHPVPDKLLQLVPKECGWKGWGEISYGGYECVKRAKAAEFLVKSWME 312

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGN--ILSKPCEDIRLRIDGLFEKEEHAIHFTKEFT 273
           E++P +S  +LSYIIGLDSLK   I N  +  +  EDIRLR+DGLF++++HA    KEFT
Sbjct: 313 EAFPDVSCGVLSYIIGLDSLKATSIDNYPLTWRANEDIRLRMDGLFQEKKHAAQLAKEFT 372

Query: 272 ALYTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQN 93
           ALYTN          G KKEI LEK L+GRE+I W+V   R                 + 
Sbjct: 373 ALYTNGPAGGGGISTGLKKEIVLEKQLIGREHIFWRVGAKRTKVGESKCQKHVVEDVMKA 432

Query: 92  SSRHESNSPSVPRETIHNSSKESLTTE 12
           +  HE   P  P+E +HNS  +    E
Sbjct: 433 NVLHEPALPPFPQEDMHNSCLDGSVAE 459


>gb|EPS67204.1| hypothetical protein M569_07572, partial [Genlisea aurea]
          Length = 637

 Score =  309 bits (791), Expect = 6e-98
 Identities = 152/218 (69%), Positives = 176/218 (80%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LPFAEVTF+G VCVAK EG+GGVL+F+TCAEQLLYEVGDPGAYITPDVIVDF DVTF+PL
Sbjct: 277 LPFAEVTFDGKVCVAKPEGSGGVLSFATCAEQLLYEVGDPGAYITPDVIVDFLDVTFEPL 336

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           S  +V+CFGAKPSPE IPQ LL+LRSKE GWKGWGEISYGG E  +RAKAAEFLVRAWME
Sbjct: 337 SADRVVCFGAKPSPESIPQELLMLRSKEKGWKGWGEISYGGRESIRRAKAAEFLVRAWME 396

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
           ES PG  + ++SYIIG DSL+     ++ SK   D+RLR+DGLFE+EEHA+ F KEF AL
Sbjct: 397 ESCPGKGEKVISYIIGFDSLRIPSPDDLHSK-VSDVRLRMDGLFEEEEHALRFVKEFAAL 455

Query: 266 YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVA 153
           YTN          GWKKEIFL+K LV RE +HW++S +
Sbjct: 456 YTNGPAGGGGISTGWKKEIFLQKELVKRESVHWEISAS 493


>ref|XP_022757274.1| uncharacterized protein LOC111304704 isoform X5 [Durio zibethinus]
          Length = 621

 Score =  308 bits (788), Expect = 1e-97
 Identities = 155/267 (58%), Positives = 186/267 (69%), Gaps = 2/267 (0%)
 Frame = -3

Query: 806  LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
            LP+AE++F G VCV KAEG+GGVLNFSTCAEQLLYEVGDP +YITPDVIVDF+ V+FQPL
Sbjct: 225  LPYAEISFTGKVCVMKAEGSGGVLNFSTCAEQLLYEVGDPSSYITPDVIVDFRGVSFQPL 284

Query: 626  SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
            S SK+LC GAKPS  P+P  LL L  KECGWKGWGEISYGGYEC +RAKAAEFLV++WME
Sbjct: 285  SSSKILCIGAKPSAHPVPDKLLQLVPKECGWKGWGEISYGGYECVKRAKAAEFLVKSWME 344

Query: 446  ESYPGISKNILSYIIGLDSLKTMCIGN--ILSKPCEDIRLRIDGLFEKEEHAIHFTKEFT 273
            E++P +S  +LSYIIGLDSLK   I N  +  +  EDIRLR+DGLF++++HA    KEFT
Sbjct: 345  EAFPDVSCGVLSYIIGLDSLKATSIDNYPLTWRANEDIRLRMDGLFQEKKHAAQLAKEFT 404

Query: 272  ALYTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVARXXXXXXXXXXXSHVATNQN 93
            ALYTN          G KKEI LEK L+GRE+I W+V   R                 + 
Sbjct: 405  ALYTNGPAGGGGISTGLKKEIVLEKQLIGREHIFWRVGAKRTKVGESKCQKHVVEDVMKA 464

Query: 92   SSRHESNSPSVPRETIHNSSKESLTTE 12
            +  HE   P  P+E +HNS  +    E
Sbjct: 465  NVLHEPALPPFPQEDMHNSCLDGSVAE 491


>emb|CDP08843.1| unnamed protein product [Coffea canephora]
          Length = 716

 Score =  310 bits (794), Expect = 1e-97
 Identities = 151/219 (68%), Positives = 177/219 (80%)
 Frame = -3

Query: 806 LPFAEVTFNGTVCVAKAEGTGGVLNFSTCAEQLLYEVGDPGAYITPDVIVDFQDVTFQPL 627
           LPFAEV F+GTVCVAKAEG+ GVLN STCAEQLLYEVGDP AYITPDV+++ QDV+FQPL
Sbjct: 323 LPFAEVRFDGTVCVAKAEGSRGVLNPSTCAEQLLYEVGDPSAYITPDVVINLQDVSFQPL 382

Query: 626 SDSKVLCFGAKPSPEPIPQNLLLLRSKECGWKGWGEISYGGYECAQRAKAAEFLVRAWME 447
           SD KVLC GAKPS EP+P  LLLL SK+ GWKGWGEIS GG++C +RA AAE LVR+WME
Sbjct: 383 SDCKVLCSGAKPSAEPLPDKLLLLASKDQGWKGWGEISCGGHKCVERANAAEHLVRSWME 442

Query: 446 ESYPGISKNILSYIIGLDSLKTMCIGNILSKPCEDIRLRIDGLFEKEEHAIHFTKEFTAL 267
           E+YPG++ NI+SYIIGLDSL+       L++  EDIRLR+DGLFE+E+HAI FTKEFTAL
Sbjct: 443 ETYPGVNNNIISYIIGLDSLRANSKIGGLTRAVEDIRLRMDGLFEREDHAIQFTKEFTAL 502

Query: 266 YTNXXXXXXXXXXGWKKEIFLEKGLVGREYIHWQVSVAR 150
           YTN          G+KKEI LEKGLV RE++HW +  AR
Sbjct: 503 YTNGPAGGGGISIGYKKEIILEKGLVCREHVHWHIMAAR 541