BLASTX nr result

ID: Ophiopogon27_contig00025027 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon27_contig00025027
         (1043 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ONK79247.1| uncharacterized protein A4U43_C01F4420 [Asparagus...   238   1e-69
ref|XP_020252860.1| uncharacterized protein LOC109830112 [Aspara...   238   1e-69
gb|ONK69025.1| uncharacterized protein A4U43_C05F18500 [Asparagu...   229   3e-67
ref|XP_008789379.1| PREDICTED: uncharacterized protein LOC103706...   222   9e-63
ref|XP_010920169.1| PREDICTED: uncharacterized protein LOC105044...   212   4e-59
ref|XP_020674452.1| uncharacterized protein LOC110093799 [Dendro...   191   2e-51
ref|XP_010277003.1| PREDICTED: uncharacterized protein LOC104611...   166   2e-42
ref|XP_010277001.1| PREDICTED: uncharacterized protein LOC104611...   166   2e-42
ref|XP_018683632.1| PREDICTED: uncharacterized protein LOC103989...   162   5e-41
ref|XP_020584723.1| uncharacterized protein LOC110027575 [Phalae...   157   2e-39
gb|OVA13617.1| Protein of unknown function DUF688 [Macleaya cord...   155   1e-38
ref|XP_007048702.2| PREDICTED: uncharacterized protein LOC186120...   152   3e-37
gb|EOX92859.1| Uncharacterized protein TCM_001716 isoform 2 [The...   152   3e-37
gb|EOX92858.1| Uncharacterized protein TCM_001716 isoform 1 [The...   152   4e-37
gb|POE99680.1| hypothetical protein CFP56_19009 [Quercus suber]       149   3e-36
ref|XP_023921054.1| uncharacterized protein LOC112032522 [Quercu...   149   3e-36
ref|XP_021274682.1| LOW QUALITY PROTEIN: uncharacterized protein...   146   3e-35
gb|OMO89020.1| hypothetical protein CCACVL1_08059, partial [Corc...   142   4e-34
gb|OMO61058.1| hypothetical protein COLO4_33589 [Corchorus olito...   142   6e-34
gb|PKA46869.1| hypothetical protein AXF42_Ash015763 [Apostasia s...   140   1e-33

>gb|ONK79247.1| uncharacterized protein A4U43_C01F4420 [Asparagus officinalis]
          Length = 621

 Score =  238 bits (607), Expect = 1e-69
 Identities = 139/249 (55%), Positives = 156/249 (62%), Gaps = 11/249 (4%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTERMQPSGSFSTDPQVRDFMMGRFLPAAKAVATA 537
           F DA+ETLS TES F+NCS+SGLSE  E    SGSFSTDPQVRDFMMGRFLPAAKA   A
Sbjct: 149 FCDAVETLSPTESSFVNCSVSGLSEMPES---SGSFSTDPQVRDFMMGRFLPAAKAADVA 205

Query: 536 S------PHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGANFAQEGEGASCS 375
                  PH+SSRKP           +       +NG+Q RVRFPNGA F +E E     
Sbjct: 206 KAMAADVPHASSRKPTPTPNWDRNRVI------IQNGDQNRVRFPNGAEFIEEDEDDE-- 257

Query: 374 CGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQI 195
                            +L  KGCG + +FCLK S CLLNPV GMKVSG RLPP+SVR+ 
Sbjct: 258 -----DEEEEEEYYGNGNLLLKGCGLIPRFCLKGSLCLLNPVTGMKVSGRRLPPSSVRRN 312

Query: 194 RGPQVKAMQNGSLGQTEEEYSWEAVYKHKL-----MHGNPPPQGEDGSKLKSESNHLTYC 30
             PQ KAM++ S GQ EEEYSWEAVYKHKL     +HG+ P Q ED SKL SESN  T  
Sbjct: 313 YVPQFKAMEHESFGQVEEEYSWEAVYKHKLLHGHMLHGHSPIQEEDVSKLTSESNQFTCW 372

Query: 29  SDSLTADGS 3
           SDSLT DGS
Sbjct: 373 SDSLTTDGS 381


>ref|XP_020252860.1| uncharacterized protein LOC109830112 [Asparagus officinalis]
          Length = 629

 Score =  238 bits (607), Expect = 1e-69
 Identities = 139/249 (55%), Positives = 156/249 (62%), Gaps = 11/249 (4%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTERMQPSGSFSTDPQVRDFMMGRFLPAAKAVATA 537
           F DA+ETLS TES F+NCS+SGLSE  E    SGSFSTDPQVRDFMMGRFLPAAKA   A
Sbjct: 157 FCDAVETLSPTESSFVNCSVSGLSEMPES---SGSFSTDPQVRDFMMGRFLPAAKAADVA 213

Query: 536 S------PHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGANFAQEGEGASCS 375
                  PH+SSRKP           +       +NG+Q RVRFPNGA F +E E     
Sbjct: 214 KAMAADVPHASSRKPTPTPNWDRNRVI------IQNGDQNRVRFPNGAEFIEEDEDDE-- 265

Query: 374 CGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQI 195
                            +L  KGCG + +FCLK S CLLNPV GMKVSG RLPP+SVR+ 
Sbjct: 266 -----DEEEEEEYYGNGNLLLKGCGLIPRFCLKGSLCLLNPVTGMKVSGRRLPPSSVRRN 320

Query: 194 RGPQVKAMQNGSLGQTEEEYSWEAVYKHKL-----MHGNPPPQGEDGSKLKSESNHLTYC 30
             PQ KAM++ S GQ EEEYSWEAVYKHKL     +HG+ P Q ED SKL SESN  T  
Sbjct: 321 YVPQFKAMEHESFGQVEEEYSWEAVYKHKLLHGHMLHGHSPIQEEDVSKLTSESNQFTCW 380

Query: 29  SDSLTADGS 3
           SDSLT DGS
Sbjct: 381 SDSLTTDGS 389


>gb|ONK69025.1| uncharacterized protein A4U43_C05F18500 [Asparagus officinalis]
          Length = 517

 Score =  229 bits (584), Expect = 3e-67
 Identities = 136/254 (53%), Positives = 156/254 (61%), Gaps = 31/254 (12%)
 Frame = -1

Query: 671 MNCSISGLSETTERMQPSGSFSTDPQVRDFMMGRFLPAAKAVATASPHSSSRKPAAAAVV 492
           MNCS+SGLSE    M+ SGSFSTDP VRDFMMGRFLPA     T  PHSSSRKP A+  +
Sbjct: 1   MNCSVSGLSEAPGEMEASGSFSTDPGVRDFMMGRFLPAXXXXET--PHSSSRKPPAS--I 56

Query: 491 LAREPVRTLEERAENGEQRRVRFPNGANFAQEGEGASCSCGXXXXXXXXXXXXXXXHLTA 312
            AREP + + ER E+GEQRRVRFPNGA+FA E EG SCSC                   +
Sbjct: 57  RAREPGK-MVERTESGEQRRVRFPNGADFAHEEEGTSCSC--DEDEDEEECHDKNRPFLS 113

Query: 311 KGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQIR-------------------- 192
           KGCG + K      FCLLNP+PGMKVSG RL P SVR++R                    
Sbjct: 114 KGCGLIPK------FCLLNPIPGMKVSGQRLRPDSVRRVRGAGAEEEGLFCVWNPPGSGQ 167

Query: 191 -----------GPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESN 45
                      G QVK M++GSL + EEEYSWEA+YKHKL++GN P Q ED SKL +ESN
Sbjct: 168 RLSQNSVRKVGGAQVKKMEHGSLVKAEEEYSWEAIYKHKLLNGNTPAQEEDISKL-AESN 226

Query: 44  HLTYCSDSLTADGS 3
           H TY SDSLT D S
Sbjct: 227 HHTYLSDSLTTDQS 240


>ref|XP_008789379.1| PREDICTED: uncharacterized protein LOC103706890 [Phoenix
           dactylifera]
 ref|XP_008789380.1| PREDICTED: uncharacterized protein LOC103706890 [Phoenix
           dactylifera]
 ref|XP_008789381.1| PREDICTED: uncharacterized protein LOC103706890 [Phoenix
           dactylifera]
          Length = 719

 Score =  222 bits (565), Expect = 9e-63
 Identities = 138/247 (55%), Positives = 160/247 (64%), Gaps = 9/247 (3%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTERMQPSGSFSTDPQVRDFMMGRFLPAAKAVATA 537
           FSDAL+TLSRTES+FMNCS+SGLS   E   PSGSFSTDPQVRDFMMGRFLPAA+A+AT 
Sbjct: 191 FSDALDTLSRTESFFMNCSVSGLSGIPESAMPSGSFSTDPQVRDFMMGRFLPAAQAMATG 250

Query: 536 SPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRV------RFPN-GANFAQE-GEGAS 381
           SP  + RK A+    LAREP     ER  +G+ RR+      + PN G  +AQ+ GEG S
Sbjct: 251 SPQYTFRKAAS----LAREPPMRPAERFVSGDHRRLLPLPYQKRPNFGLQYAQKHGEGDS 306

Query: 380 CSCGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVR 201
                              HL +K CG L + CLKSSFCLLNPVPGMKV G RLP    R
Sbjct: 307 -----YDDEEEAEDCDETDHLPSKACGLLPRLCLKSSFCLLNPVPGMKVRG-RLPAPPGR 360

Query: 200 QIRGPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDG-SKLKSESNHLTYCSD 24
           +I GP++K   +GS GQ  +E SWEAV+KHKL      PQ EDG S+  SES  LTY SD
Sbjct: 361 RIGGPRIKTFHHGSFGQDGDEDSWEAVHKHKLGQ-RYQPQVEDGRSRSTSESKQLTYWSD 419

Query: 23  SLTADGS 3
           S TADGS
Sbjct: 420 SPTADGS 426


>ref|XP_010920169.1| PREDICTED: uncharacterized protein LOC105044068 [Elaeis guineensis]
 ref|XP_010920176.1| PREDICTED: uncharacterized protein LOC105044068 [Elaeis guineensis]
          Length = 720

 Score =  212 bits (540), Expect = 4e-59
 Identities = 132/246 (53%), Positives = 151/246 (61%), Gaps = 8/246 (3%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTERMQPSGSFSTDPQVRDFMMGRFLPAAKAVATA 537
           FSDAL+TLSRTES+FMNCS+SGLS   E   PSGSFSTDPQVRDFMMGRFLPAA+A+AT 
Sbjct: 192 FSDALDTLSRTESFFMNCSVSGLSGIPESAMPSGSFSTDPQVRDFMMGRFLPAAQAMATG 251

Query: 536 SPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRR------VRFPNGA-NFAQEGEGASC 378
           SP  + RK        AREP     ER  + + RR       + PN    +AQE EG   
Sbjct: 252 SPQYTFRKGTPP----AREPPTRPAERVVSRDHRRPLPLPYQKRPNFVQQYAQEHEGGD- 306

Query: 377 SCGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQ 198
                              L +K CG L +FC+KSSFCLLNPVPGMKV   RLP    R+
Sbjct: 307 ---SYDDEEEEEDCDETDRLPSKACGLLPRFCVKSSFCLLNPVPGMKVR-PRLPAPLGRR 362

Query: 197 IRGPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDG-SKLKSESNHLTYCSDS 21
           I  P++K   +GSLG+  +E SWEAVYKHKL      PQ EDG SK  SES  LTY SDS
Sbjct: 363 IGNPRIKTFHHGSLGEAGDEDSWEAVYKHKLGQ-RYQPQVEDGRSKSTSESKQLTYWSDS 421

Query: 20  LTADGS 3
            TADGS
Sbjct: 422 PTADGS 427


>ref|XP_020674452.1| uncharacterized protein LOC110093799 [Dendrobium catenatum]
 gb|PKU77942.1| hypothetical protein MA16_Dca011562 [Dendrobium catenatum]
          Length = 670

 Score =  191 bits (484), Expect = 2e-51
 Identities = 123/244 (50%), Positives = 140/244 (57%), Gaps = 6/244 (2%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTERMQPSGSFSTDPQVRDFMMGRFLPAAKAVATA 537
           FSDA+ETLSRTES+FMNCS+SG+S   +    SGSFSTDPQVRD MM RFLPAA+A+AT 
Sbjct: 143 FSDAVETLSRTESFFMNCSLSGISGIPDAAVSSGSFSTDPQVRDLMMHRFLPAAQAMATG 202

Query: 536 SPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFP----NGANFAQEGEGASCSCG 369
           SP    RKPA             L ER E+GE R  R P    +  NF   G        
Sbjct: 203 SPQCPMRKPAK------------LVERVEDGENRLRRVPLPYQHRPNFV--GRYPHDEFD 248

Query: 368 XXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNR--LPPTSVRQI 195
                          H T+K CG L + C+KSSFCLLNP+PGMKV G R  LPP++ R+I
Sbjct: 249 SSDGDDNGDYYEGSGHFTSKACGLLPRLCMKSSFCLLNPMPGMKVGGRRRGLPPSTSRKI 308

Query: 194 RGPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSDSLT 15
                    NG L Q E E  WEAVYKHKL  G  P QGE GSK  S+SN L   SDS T
Sbjct: 309 SS---ALHNNGPLSQAENEL-WEAVYKHKLRMG-IPYQGEGGSKSTSKSNQLGQWSDSQT 363

Query: 14  ADGS 3
            DGS
Sbjct: 364 PDGS 367


>ref|XP_010277003.1| PREDICTED: uncharacterized protein LOC104611577 isoform X2 [Nelumbo
           nucifera]
          Length = 681

 Score =  166 bits (420), Expect = 2e-42
 Identities = 112/245 (45%), Positives = 143/245 (58%), Gaps = 7/245 (2%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETT-ERMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           F+DALETLSRTES F+NCS++G+S       + SG+F TDPQ RDFM+GRFLPAAKAVA 
Sbjct: 167 FTDALETLSRTES-FLNCSVTGMSAWDGPNTRSSGTFLTDPQTRDFMLGRFLPAAKAVAA 225

Query: 539 ASPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGANFAQ-----EGEGASCS 375
             P  +SRK      +   +P  T  ++  +G+ R  ++    N  Q     EGE  S  
Sbjct: 226 EMPQYASRKQP----LPYEQPRET--KKVVSGDTRPPQYKYRPNMIQQFPQDEGEEES-- 277

Query: 374 CGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQI 195
                            +L+A  CG   +FCLK SFCLLNPVPGMKV   R+P +SVR++
Sbjct: 278 -----EDEDEDDYGDTGNLSANACGLFPRFCLKGSFCLLNPVPGMKVR-TRVPVSSVRKV 331

Query: 194 RGPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHG-NPPPQGEDGSKLKSESNHLTYCSDSL 18
            G QVK     S  ++++E+SW+AVYKHKL          ED SKL S+SN LTY SDS 
Sbjct: 332 -GKQVKTTYARSHKESKDEHSWDAVYKHKLASRIQRTGVLEDESKLTSQSNRLTYWSDSQ 390

Query: 17  TADGS 3
           T D S
Sbjct: 391 TPDES 395


>ref|XP_010277001.1| PREDICTED: uncharacterized protein LOC104611577 isoform X1 [Nelumbo
           nucifera]
 ref|XP_010277002.1| PREDICTED: uncharacterized protein LOC104611577 isoform X1 [Nelumbo
           nucifera]
 ref|XP_010277004.1| PREDICTED: uncharacterized protein LOC104611577 isoform X1 [Nelumbo
           nucifera]
          Length = 689

 Score =  166 bits (420), Expect = 2e-42
 Identities = 112/245 (45%), Positives = 143/245 (58%), Gaps = 7/245 (2%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETT-ERMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           F+DALETLSRTES F+NCS++G+S       + SG+F TDPQ RDFM+GRFLPAAKAVA 
Sbjct: 167 FTDALETLSRTES-FLNCSVTGMSAWDGPNTRSSGTFLTDPQTRDFMLGRFLPAAKAVAA 225

Query: 539 ASPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGANFAQ-----EGEGASCS 375
             P  +SRK      +   +P  T  ++  +G+ R  ++    N  Q     EGE  S  
Sbjct: 226 EMPQYASRKQP----LPYEQPRET--KKVVSGDTRPPQYKYRPNMIQQFPQDEGEEES-- 277

Query: 374 CGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQI 195
                            +L+A  CG   +FCLK SFCLLNPVPGMKV   R+P +SVR++
Sbjct: 278 -----EDEDEDDYGDTGNLSANACGLFPRFCLKGSFCLLNPVPGMKVR-TRVPVSSVRKV 331

Query: 194 RGPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHG-NPPPQGEDGSKLKSESNHLTYCSDSL 18
            G QVK     S  ++++E+SW+AVYKHKL          ED SKL S+SN LTY SDS 
Sbjct: 332 -GKQVKTTYARSHKESKDEHSWDAVYKHKLASRIQRTGVLEDESKLTSQSNRLTYWSDSQ 390

Query: 17  TADGS 3
           T D S
Sbjct: 391 TPDES 395


>ref|XP_018683632.1| PREDICTED: uncharacterized protein LOC103989171 [Musa acuminata
           subsp. malaccensis]
          Length = 656

 Score =  162 bits (409), Expect = 5e-41
 Identities = 109/243 (44%), Positives = 135/243 (55%), Gaps = 5/243 (2%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTERMQPSGSFSTDPQVRDFMMGRFLPAAKAVATA 537
           FSDAL+TLSR ES  MNCS+SG+S   +  + SGS   DPQVRDFMM RFLPAA+A+   
Sbjct: 184 FSDALDTLSRAESLLMNCSVSGVSGMPDAAKLSGSVPKDPQVRDFMMERFLPAAQAMVCE 243

Query: 536 SPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRR----VRFPNGAN-FAQEGEGASCSC 372
           S   + RK A       RE  + +E  A N   RR    +R+ N  + F Q  +      
Sbjct: 244 STQYTFRKAAGP----PREATKPVERVAINDRNRRPPAPMRYHNMPDYFPQYAKELEEGD 299

Query: 371 GXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQIR 192
                           HL +K CG L++FCLK+SFCLLNPV G+K  G RLPP    + R
Sbjct: 300 SNDDDEDDEYSDDDAGHLPSKACGLLSRFCLKNSFCLLNPVSGIKDRG-RLPPPPRGRTR 358

Query: 191 GPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSDSLTA 12
            PQ+  +   SLG  E+E SWE VY+HKL   N  PQ  + SKL+SE       SDS TA
Sbjct: 359 DPQLWNLHRLSLGPVEDENSWETVYRHKLGQ-NSRPQVAEVSKLRSE-------SDSQTA 410

Query: 11  DGS 3
           DGS
Sbjct: 411 DGS 413


>ref|XP_020584723.1| uncharacterized protein LOC110027575 [Phalaenopsis equestris]
          Length = 649

 Score =  157 bits (398), Expect = 2e-39
 Identities = 105/248 (42%), Positives = 130/248 (52%), Gaps = 10/248 (4%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTERMQPSGSFSTDPQVRDFMMGRFLPAAKAVATA 537
           FS+A+ETLSR+ES+ MNCSISGLS   +   PSGSFSTDP VR+ MM RFLPAA+A+   
Sbjct: 139 FSEAVETLSRSESFLMNCSISGLSGIPDAFVPSGSFSTDPDVRNLMMDRFLPAAQAMVAG 198

Query: 536 SPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNG--------ANFAQEGEGAS 381
           SP     KPA               ER ENGE R  R P          A + ++  G+S
Sbjct: 199 SPQCPPWKPAKVV------------ERVENGEHRLRRVPLPYQHRPYFVARYPRDELGSS 246

Query: 380 CSCGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSG--NRLPPTS 207
                              +  +K CG L + CLK+SFCLLNPV GMKV G   RLPP++
Sbjct: 247 ------DGDDDGDYDEGSGNFASKACGLLPRLCLKNSFCLLNPVTGMKVGGRQRRLPPST 300

Query: 206 VRQIRGPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCS 27
            ++I         +G L Q E E  WEAVYKHKL  G P  +  +     ++SN     S
Sbjct: 301 SQKIGS---ILHDDGPLSQAENEL-WEAVYKHKLRLGIPFHEEGESKSTTTKSNQHGQLS 356

Query: 26  DSLTADGS 3
           DS T D S
Sbjct: 357 DSQTLDRS 364


>gb|OVA13617.1| Protein of unknown function DUF688 [Macleaya cordata]
          Length = 711

 Score =  155 bits (393), Expect = 1e-38
 Identities = 103/239 (43%), Positives = 136/239 (56%), Gaps = 1/239 (0%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETT-ERMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           F DA +TLSR+ES+F NCS+SG+S        PSG+FSTDPQ RDFMMGRFLPAAKA+A+
Sbjct: 165 FMDAPDTLSRSESFFFNCSVSGVSGLDGPDATPSGTFSTDPQTRDFMMGRFLPAAKAMAS 224

Query: 539 ASPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGANFAQEGEGASCSCGXXX 360
            +P  + R+     VV  REP      R  N + R         + +E            
Sbjct: 225 DTPQYAPRRQPQP-VVRDREPTPRQVTRVVNAD-RSHPLSQYRPYVEEYVDVIGEEESED 282

Query: 359 XXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQIRGPQV 180
                        L+AKGCG   + CL+SS CLLNPVPGMKV  +R P +SVR++R  Q 
Sbjct: 283 EDEDDDDYNEAGTLSAKGCGLFPRLCLRSSLCLLNPVPGMKVR-SRAPMSSVRKVR-TQT 340

Query: 179 KAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSDSLTADGS 3
           ++  +G+L ++ +E + E  YK+KL+  +     +D SK KSESN LT  S S T DGS
Sbjct: 341 RSSHSGALSRSVDEKNAEVAYKNKLI--SDLQLLDDESKRKSESNELTCWSGSQTPDGS 397


>ref|XP_007048702.2| PREDICTED: uncharacterized protein LOC18612057 [Theobroma cacao]
          Length = 723

 Score =  152 bits (383), Expect = 3e-37
 Identities = 101/247 (40%), Positives = 135/247 (54%), Gaps = 9/247 (3%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETT-ERMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           + DAL+T SRTES+F+NCSISG+S      ++PSG F+TDPQ RDFMMGRFLPAAKAVA+
Sbjct: 161 YVDALDTFSRTESFFLNCSISGVSGFDGPEIKPSGIFTTDPQTRDFMMGRFLPAAKAVAS 220

Query: 539 ASPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRV------RFPNGA--NFAQEGEGA 384
             P  +SRK       +AREP R +++     +Q+ +      +FPN A  ++ +E EG 
Sbjct: 221 EIPPYASRKQP-----VAREPQRQVKKVVIVDKQQPLYVSSPNKFPNHAQDDWLEESEGE 275

Query: 383 SCSCGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSV 204
               G                 +AK CG   +F LKSSFCLLNPVPGMK+   + P    
Sbjct: 276 DDYSGSQNS-------------SAKVCGLFPQFLLKSSFCLLNPVPGMKIQAQK-PAKPA 321

Query: 203 RQIRGPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSD 24
             +R  Q K+    S  +TE EY+  A  K            ED + LKS S+H++Y SD
Sbjct: 322 HSVRRRQAKSSYLRSGNETESEYAKAATEKGLTRISRTEELIEDKNNLKSGSSHMSYRSD 381

Query: 23  SLTADGS 3
               D +
Sbjct: 382 CQNPDAA 388


>gb|EOX92859.1| Uncharacterized protein TCM_001716 isoform 2 [Theobroma cacao]
          Length = 723

 Score =  152 bits (383), Expect = 3e-37
 Identities = 101/247 (40%), Positives = 135/247 (54%), Gaps = 9/247 (3%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETT-ERMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           + DAL+T SRTES+F+NCSISG+S      ++PSG F+TDPQ RDFMMGRFLPAAKAVA+
Sbjct: 161 YVDALDTFSRTESFFLNCSISGVSGFDGPEIKPSGIFTTDPQTRDFMMGRFLPAAKAVAS 220

Query: 539 ASPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRV------RFPNGA--NFAQEGEGA 384
             P  +SRK       +AREP R +++     +Q+ +      +FPN A  ++ +E EG 
Sbjct: 221 EIPPYASRKQP-----VAREPQRQVKKVVIVDKQQPLYVSSPNKFPNHAQDDWLEESEGE 275

Query: 383 SCSCGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSV 204
               G                 +AK CG   +F LKSSFCLLNPVPGMK+   + P    
Sbjct: 276 DDYSGSQNS-------------SAKVCGLFPQFLLKSSFCLLNPVPGMKIQAQK-PAKPA 321

Query: 203 RQIRGPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSD 24
             +R  Q K+    S  +TE EY+  A  K            ED + LKS S+H++Y SD
Sbjct: 322 HSVRRRQAKSSYLRSGNETESEYAKAATEKGLTRISRTEELIEDKNNLKSGSSHMSYRSD 381

Query: 23  SLTADGS 3
               D +
Sbjct: 382 CQNPDAA 388


>gb|EOX92858.1| Uncharacterized protein TCM_001716 isoform 1 [Theobroma cacao]
          Length = 759

 Score =  152 bits (383), Expect = 4e-37
 Identities = 101/247 (40%), Positives = 135/247 (54%), Gaps = 9/247 (3%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETT-ERMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           + DAL+T SRTES+F+NCSISG+S      ++PSG F+TDPQ RDFMMGRFLPAAKAVA+
Sbjct: 197 YVDALDTFSRTESFFLNCSISGVSGFDGPEIKPSGIFTTDPQTRDFMMGRFLPAAKAVAS 256

Query: 539 ASPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRV------RFPNGA--NFAQEGEGA 384
             P  +SRK       +AREP R +++     +Q+ +      +FPN A  ++ +E EG 
Sbjct: 257 EIPPYASRKQP-----VAREPQRQVKKVVIVDKQQPLYVSSPNKFPNHAQDDWLEESEGE 311

Query: 383 SCSCGXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSV 204
               G                 +AK CG   +F LKSSFCLLNPVPGMK+   + P    
Sbjct: 312 DDYSGSQNS-------------SAKVCGLFPQFLLKSSFCLLNPVPGMKIQAQK-PAKPA 357

Query: 203 RQIRGPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSD 24
             +R  Q K+    S  +TE EY+  A  K            ED + LKS S+H++Y SD
Sbjct: 358 HSVRRRQAKSSYLRSGNETESEYAKAATEKGLTRISRTEELIEDKNNLKSGSSHMSYRSD 417

Query: 23  SLTADGS 3
               D +
Sbjct: 418 CQNPDAA 424


>gb|POE99680.1| hypothetical protein CFP56_19009 [Quercus suber]
          Length = 745

 Score =  149 bits (376), Expect = 3e-36
 Identities = 98/243 (40%), Positives = 142/243 (58%), Gaps = 5/243 (2%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSET-TERMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           + DAL+T+SRTES+ MNCSISGLS      ++PSGSFSTDPQ RDFMMGRFLPAAKA+A+
Sbjct: 160 YLDALDTISRTESFCMNCSISGLSGLDVPDVKPSGSFSTDPQARDFMMGRFLPAAKAMAS 219

Query: 539 AS-PHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGA--NFAQEGEGASCSCG 369
            + PH+S ++P     +   E V + + R+      +   P  A  N  +E E  +    
Sbjct: 220 ETPPHASWKQPVQREQLRQGEKVVSRDRRSPCNHYGQNFVPQYAHNNDGEESENEA---- 275

Query: 368 XXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQIRG 189
                            +AK CG L +FCL +SFCLLNPVPGM++  +R   +SVR++R 
Sbjct: 276 --------DDYDGSETSSAKVCGLLPRFCLSNSFCLLNPVPGMRMQSSR-KVSSVRRVR- 325

Query: 188 PQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQ-GEDGSKLKSESNHLTYCSDSLTA 12
              K+  + S  + E+++   AV++ + ++G+   +  ED ++ KSESN +TY SD    
Sbjct: 326 --AKSSYSVSRSEPEKKHGGAAVHEQRSLNGHQKVEPHEDRNEQKSESNQITYKSDCEKP 383

Query: 11  DGS 3
           D S
Sbjct: 384 DQS 386


>ref|XP_023921054.1| uncharacterized protein LOC112032522 [Quercus suber]
 ref|XP_023921055.1| uncharacterized protein LOC112032522 [Quercus suber]
          Length = 751

 Score =  149 bits (376), Expect = 3e-36
 Identities = 98/243 (40%), Positives = 142/243 (58%), Gaps = 5/243 (2%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSET-TERMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           + DAL+T+SRTES+ MNCSISGLS      ++PSGSFSTDPQ RDFMMGRFLPAAKA+A+
Sbjct: 166 YLDALDTISRTESFCMNCSISGLSGLDVPDVKPSGSFSTDPQARDFMMGRFLPAAKAMAS 225

Query: 539 AS-PHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGA--NFAQEGEGASCSCG 369
            + PH+S ++P     +   E V + + R+      +   P  A  N  +E E  +    
Sbjct: 226 ETPPHASWKQPVQREQLRQGEKVVSRDRRSPCNHYGQNFVPQYAHNNDGEESENEA---- 281

Query: 368 XXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQIRG 189
                            +AK CG L +FCL +SFCLLNPVPGM++  +R   +SVR++R 
Sbjct: 282 --------DDYDGSETSSAKVCGLLPRFCLSNSFCLLNPVPGMRMQSSR-KVSSVRRVR- 331

Query: 188 PQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQ-GEDGSKLKSESNHLTYCSDSLTA 12
              K+  + S  + E+++   AV++ + ++G+   +  ED ++ KSESN +TY SD    
Sbjct: 332 --AKSSYSVSRSEPEKKHGGAAVHEQRSLNGHQKVEPHEDRNEQKSESNQITYKSDCEKP 389

Query: 11  DGS 3
           D S
Sbjct: 390 DQS 392


>ref|XP_021274682.1| LOW QUALITY PROTEIN: uncharacterized protein LOC110409605 [Herrania
           umbratica]
          Length = 728

 Score =  146 bits (369), Expect = 3e-35
 Identities = 98/239 (41%), Positives = 128/239 (53%), Gaps = 1/239 (0%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETT-ERMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           + DAL+TLSRTES+F+NCSISG+S      ++PSG FSTDPQ RD MMGRFLPAAKAVA+
Sbjct: 166 YVDALDTLSRTESFFLNCSISGVSGFDGPEVKPSGIFSTDPQTRDIMMGRFLPAAKAVAS 225

Query: 539 ASPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGANFAQEGEGASCSCGXXX 360
             P  +SRK       +AREP R +++     +Q+ +      NF    +          
Sbjct: 226 EIPPYASRKQP-----VAREPQRQVKKVVIVDKQQPLYVSRPNNFPNHAQD-----DWLE 275

Query: 359 XXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQIRGPQV 180
                       + +AK CG   +F LKSSFCLLNPVPGMK+   + P      +R  Q 
Sbjct: 276 ESEDEDDYSXSQNSSAKVCGLFHQFLLKSSFCLLNPVPGMKIQAQK-PAKPAHSVRRRQA 334

Query: 179 KAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSDSLTADGS 3
           K+    S  +TE EY+  A  K            ED + LKS S H++Y SD    DG+
Sbjct: 335 KSSYLRSGNETENEYAKAATEKGLTGISRTEELIEDKNNLKSGSIHMSYRSDCQNPDGA 393


>gb|OMO89020.1| hypothetical protein CCACVL1_08059, partial [Corchorus capsularis]
          Length = 577

 Score =  142 bits (357), Expect = 4e-34
 Identities = 92/239 (38%), Positives = 127/239 (53%), Gaps = 1/239 (0%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTE-RMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           + DAL TLSRTES+F NCSISG+S   E  ++PSG FSTDPQ RDFMMGRFLPAAKAVA+
Sbjct: 167 YVDALSTLSRTESFFFNCSISGVSGLDELEIKPSGIFSTDPQTRDFMMGRFLPAAKAVAS 226

Query: 539 ASPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGANFAQEGEGASCSCGXXX 360
            +P  ++RK       +AREP R +++     +Q+ +   +  NF    +   C      
Sbjct: 227 ETPPYATRKQP-----VAREPEREIKKLVIVDKQQPLYVSSPNNFQDHAQDGWCD----- 276

Query: 359 XXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQIRGPQV 180
                         +AK CG    F LKSSFCLLNPVPGMK+   + P      +R  Q 
Sbjct: 277 ESEDEDDYSQSSDYSAKVCGLFPPFLLKSSFCLLNPVPGMKIPAQK-PVKPAYSVRTRQA 335

Query: 179 KAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSDSLTADGS 3
           K+       +T++E++  +  K   +        ED   L   S+ ++Y SD   A+G+
Sbjct: 336 KSSYLRYSSETKDEHAKASAEKGLTVVSRMEGLTEDKKNLNIGSSLMSYKSDCQKANGA 394


>gb|OMO61058.1| hypothetical protein COLO4_33589 [Corchorus olitorius]
          Length = 725

 Score =  142 bits (359), Expect = 6e-34
 Identities = 93/239 (38%), Positives = 127/239 (53%), Gaps = 1/239 (0%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTE-RMQPSGSFSTDPQVRDFMMGRFLPAAKAVAT 540
           + DAL TLSRTES+F NCSISG+S   E  ++PSG FSTDPQ RDFMMGRFLPAAKAVA+
Sbjct: 167 YVDALSTLSRTESFFFNCSISGVSGLDEPEIKPSGIFSTDPQTRDFMMGRFLPAAKAVAS 226

Query: 539 ASPHSSSRKPAAAAVVLAREPVRTLEERAENGEQRRVRFPNGANFAQEGEGASCSCGXXX 360
            +P  ++RK       +AREP R +++     +Q+ +   +  NF    +   C      
Sbjct: 227 ETPPYATRKQP-----VAREPEREIKKLVIVDKQQPLYLSSPNNFQYHAQDGWCE----- 276

Query: 359 XXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQIRGPQV 180
                         +AK CG    F LKSSFCLLNPVPGMK+   + P      +R  Q 
Sbjct: 277 ESEDEDDHSQSSDYSAKVCGLFPPFLLKSSFCLLNPVPGMKIPAQK-PVKPAYSVRTRQA 335

Query: 179 KAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSDSLTADGS 3
           K+       +TE+E++  +  K   +        ED   L   S+ ++Y SD   A+G+
Sbjct: 336 KSSYLRYGSETEDEHAKASAEKGLTVVSRLEGLTEDKKNLNIGSSLMSYRSDGQKANGA 394


>gb|PKA46869.1| hypothetical protein AXF42_Ash015763 [Apostasia shenzhenica]
          Length = 567

 Score =  140 bits (353), Expect = 1e-33
 Identities = 102/243 (41%), Positives = 126/243 (51%), Gaps = 5/243 (2%)
 Frame = -1

Query: 716 FSDALETLSRTESYFMNCSISGLSETTERMQPSGSFSTDPQVRDFMMGRFLPAAKAVATA 537
           FSDA +TLS TES+ M CS +G+S           FS D Q + FM+ RFLPAA+A+A  
Sbjct: 148 FSDARDTLSCTESFSMKCSATGMSANFGARNSFSDFSKDQQGKGFMIDRFLPAAQAMAAR 207

Query: 536 SPHSSSRKPAAAAVVLAREP---VRTLEERAENGE-QRRVRFP-NGANFAQEGEGASCSC 372
           SP  + RKP  A  V A  P   ++T+  R  +   + +V FP   AN   E E      
Sbjct: 208 SPRRTWRKPPRAPEVAAANPNPTIKTVRRRESSFHYECQVGFPAPNANDQDEVEN----- 262

Query: 371 GXXXXXXXXXXXXXXXHLTAKGCGFLTKFCLKSSFCLLNPVPGMKVSGNRLPPTSVRQIR 192
                           + + K CG L + CLKSSF LLNPVPGMKV         VR   
Sbjct: 263 ---DDDGDHRNHDDNGYFSPKVCGLLPRCCLKSSFFLLNPVPGMKVQSRH----PVRPPN 315

Query: 191 GPQVKAMQNGSLGQTEEEYSWEAVYKHKLMHGNPPPQGEDGSKLKSESNHLTYCSDSLTA 12
            P   +  + +L     E SWEAVY++KL HG+   Q EDGSKL SESN LTY SDS T 
Sbjct: 316 KPSPLSHNDANL-----EDSWEAVYRYKLNHGD-YIQKEDGSKLASESNQLTYWSDSQTL 369

Query: 11  DGS 3
           D S
Sbjct: 370 DDS 372


Top