BLASTX nr result

ID: Ophiopogon25_contig00006428 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon25_contig00006428
         (1292 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_020269678.1| uncharacterized protein LOC109844919 [Aspara...   560   0.0  
ref|XP_008811331.1| PREDICTED: uncharacterized protein LOC103722...   359   e-115
ref|XP_009413687.1| PREDICTED: uncharacterized protein LOC103994...   337   e-106
ref|XP_019709892.1| PREDICTED: uncharacterized protein LOC105056...   333   e-105
gb|PKA56714.1| hypothetical protein AXF42_Ash012844 [Apostasia s...   278   4e-84
ref|XP_010254726.1| PREDICTED: uncharacterized protein LOC104595...   259   5e-77
gb|OVA08733.1| Ubiquitin-conjugating enzyme E2-binding protein [...   258   2e-76
ref|XP_010649104.1| PREDICTED: uncharacterized protein LOC100260...   254   1e-74
ref|XP_007046828.2| PREDICTED: uncharacterized protein LOC186108...   252   2e-74
gb|EOX90985.1| Uncharacterized protein TCM_000303 isoform 1 [The...   252   2e-74
gb|EOX90986.1| Uncharacterized protein TCM_000303 isoform 2 [The...   252   2e-74
ref|XP_020703622.1| uncharacterized protein LOC110114919 [Dendro...   250   8e-74
ref|XP_021273814.1| uncharacterized protein LOC110408968 [Herran...   246   5e-72
ref|XP_022751404.1| uncharacterized protein LOC111300062 isoform...   244   1e-71
gb|OMO84492.1| hypothetical protein COLO4_22012 [Corchorus olito...   242   1e-70
ref|XP_008379646.1| PREDICTED: uncharacterized protein LOC103442...   241   4e-70
ref|XP_021604018.1| uncharacterized protein LOC110609020 [Maniho...   241   5e-70
ref|XP_017631359.1| PREDICTED: uncharacterized protein LOC108474...   240   6e-70
ref|XP_015889972.1| PREDICTED: uncharacterized protein LOC107424...   239   1e-69
ref|XP_015889905.1| PREDICTED: uncharacterized protein LOC107424...   239   2e-69

>ref|XP_020269678.1| uncharacterized protein LOC109844919 [Asparagus officinalis]
 ref|XP_020269679.1| uncharacterized protein LOC109844919 [Asparagus officinalis]
 ref|XP_020269680.1| uncharacterized protein LOC109844919 [Asparagus officinalis]
 ref|XP_020269681.1| uncharacterized protein LOC109844919 [Asparagus officinalis]
 gb|ONK66251.1| uncharacterized protein A4U43_C06F5780 [Asparagus officinalis]
          Length = 610

 Score =  560 bits (1442), Expect = 0.0
 Identities = 288/429 (67%), Positives = 329/429 (76%), Gaps = 9/429 (2%)
 Frame = +2

Query: 32   TLNPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSE 211
            +LNP+   QWRFT+ETLAHIPTLRLYLFN  VNPS QCL++ SDLQL+RSLLVVSWIDSE
Sbjct: 3    SLNPKK--QWRFTFETLAHIPTLRLYLFNPNVNPSTQCLSLDSDLQLSRSLLVVSWIDSE 60

Query: 212  KAXXXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGDLE 391
            K             LIDP SG E++ RDD +E++M LVLPVDHPVA+++RGA+G   DLE
Sbjct: 61   KPEGVSVRVPIPRVLIDPESGSEVRVRDDCIEIKMRLVLPVDHPVALSVRGALGGDVDLE 120

Query: 392  RGRPLELDSDLKNLSSGGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFGACCCS 571
            R RPLELDS L NLSSG VHLFCK+CS RLTK+PLRRFMEMPSVNWREVADNWFGACCCS
Sbjct: 121  RVRPLELDSGLTNLSSGDVHLFCKICSTRLTKEPLRRFMEMPSVNWREVADNWFGACCCS 180

Query: 572  FGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSKIDVEP 751
            FGGISEKLVLQYIKTYDCAEGTCLLD ASVIICKDDLEGYTFQ   +E  DNS K D EP
Sbjct: 181  FGGISEKLVLQYIKTYDCAEGTCLLDYASVIICKDDLEGYTFQKCSDELIDNSYKFDTEP 240

Query: 752  SGNRTDTMKEGCKNMSDNIPFVCATSS-VANDVERNVLESDSFKC--EKHTNPLACQLTS 922
            +    D ++EGC+N S+++P VCATS+ V N+V    L+ +S  C  EKHT PLA ++T 
Sbjct: 241  NDYAKD-VEEGCRNKSEDLPCVCATSTCVDNEVGEKTLDLESASCKREKHTYPLASRMTL 299

Query: 923  SADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDS--ENHEKLIQLSVDEL 1096
            S DCNG V    SL ME+S L Q SLN   STV  DSL  NND+  +N EKLI  S DEL
Sbjct: 300  SPDCNGFVREQDSLLMENSSLPQQSLNGTQSTVIADSLRYNNDNVPKNLEKLILQSADEL 359

Query: 1097 SSIDHCHCCPDEAQYVVD----ANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDN 1264
            SS  HCHCC DEA+   D    AN+QKL++ S PEKIQKWLHDCSLGSGFMNRIPNLSD+
Sbjct: 360  SSTGHCHCCTDEAKCDTDLFTNANAQKLSKESGPEKIQKWLHDCSLGSGFMNRIPNLSDD 419

Query: 1265 VEWVTFSCR 1291
            ++WV F CR
Sbjct: 420  IKWVEFLCR 428


>ref|XP_008811331.1| PREDICTED: uncharacterized protein LOC103722524 [Phoenix dactylifera]
          Length = 613

 Score =  359 bits (922), Expect = e-115
 Identities = 201/434 (46%), Positives = 270/434 (62%), Gaps = 16/434 (3%)
 Frame = +2

Query: 38   NPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKA 217
            NPR+   WRFT+E LAHIPTLRLYLF +  +PS +C  +R+ L+   SLL+VSW D E+ 
Sbjct: 8    NPRS---WRFTWENLAHIPTLRLYLFRAGFDPSARCSILRASLRFDESLLLVSWTDEEEG 64

Query: 218  XXXXXXXXXXXX-LIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGD--- 385
                         LIDPG  ++  A+ DH+E+++VL+LPVDH V  N RG + S G+   
Sbjct: 65   NHEVSLRVPVPRVLIDPGCPIDCTAKSDHIEIKLVLILPVDHAVVANFRGVLDSAGEEVE 124

Query: 386  ------LERGRPLELDSDLKNLSSGGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADN 547
                   +R  PL LDSD++ LS+ GVH FCK CS +LT+QPLR F+EMPSVNWREVADN
Sbjct: 125  EDGDRSSDRLLPLTLDSDIRKLSADGVHFFCKACSTKLTRQPLRCFVEMPSVNWREVADN 184

Query: 548  WFGACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSD- 724
            WFGACCCSFGGISEKLV +YI +Y+CAEGTCL+D+ASVIICKDDL+  TFQ   EE S+ 
Sbjct: 185  WFGACCCSFGGISEKLVSEYINSYNCAEGTCLVDAASVIICKDDLQDDTFQQCTEEYSEW 244

Query: 725  NSSKIDVEPSGNRTDTMKEGC-KNMSDNIPFVCATSSVANDVERNVLE--SDSFKCEKHT 895
            N S +      N T+  KEG   N+ ++IP    TS+ A  V R   +  ++S   +  T
Sbjct: 245  NKSNL---MENNVTEAPKEGVYGNLCESIPDAGLTSTCAAQVGRVDFDVGTESSGYKNGT 301

Query: 896  NPLACQLTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLN--NDSENHEK 1069
            N L+C + +S+DC G V    S   E++ LAQ SL+ + + + VD L LN    S+  E 
Sbjct: 302  NVLSCPVLNSSDCCGTVT---SFSKETTNLAQPSLDVMQTAIDVDLLKLNIKQCSDVLEN 358

Query: 1070 LIQLSVDELSSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIP 1249
                S +E S  DHCHCC D+  Y ++ +  + A       ++  + +  LG GFM R  
Sbjct: 359  PPPASAEETSK-DHCHCCEDKTNYSLNFHHGRSALGMSITPMKSQIDNSPLGGGFMIRTS 417

Query: 1250 NLSDNVEWVTFSCR 1291
            N+S ++EW+ FSCR
Sbjct: 418  NISSDIEWIEFSCR 431


>ref|XP_009413687.1| PREDICTED: uncharacterized protein LOC103994945 [Musa acuminata
            subsp. malaccensis]
          Length = 635

 Score =  337 bits (863), Expect = e-106
 Identities = 205/451 (45%), Positives = 263/451 (58%), Gaps = 33/451 (7%)
 Frame = +2

Query: 38   NPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKA 217
            NP  R +WRFT+ETLAHIPTLRLYLF+  V+PS  C N+ + L+L +SLL+VSWID    
Sbjct: 8    NPSPR-RWRFTWETLAHIPTLRLYLFHPDVHPSALCGNLSASLRLDQSLLLVSWIDDRNG 66

Query: 218  XXXXXXXXXXXX----LIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGS--- 376
                            LIDP   VE +A DDH+E+++ LVLPVDHPV ++LRG + S   
Sbjct: 67   GGTGDVVSLKAPVPKVLIDPSCPVECRAMDDHIEIKLALVLPVDHPVMMDLRGVLDSYMG 126

Query: 377  ------GGDLERGRPLELDSDLKNLSSGGVHLFCKVCSARLTKQPLRRFMEMPSVNWREV 538
                  GG  +   PL LD D+KNLS+GGVH FCK CS +LTKQPLR F+EMPSVNWREV
Sbjct: 127  EMGRQEGGLSDPLVPLPLDLDIKNLSAGGVHFFCKSCSTKLTKQPLRHFVEMPSVNWREV 186

Query: 539  ADNWFGACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQ 718
            ADNWFG CCCSFGGISEKLV QY+  Y C EGTCLLD AS+IIC+DDLEGY+FQ  L   
Sbjct: 187  ADNWFGTCCCSFGGISEKLVRQYVNRYSCHEGTCLLDGASIIICQDDLEGYSFQELLAGF 246

Query: 719  SDNSSKIDVEPSGNRTDTMKEGCKN--MSDNIPF-VCATSSVANDVERNVLESDSFKCEK 889
            SD+ +K D        D++K G     + DN    V   SS   DV  ++  +      K
Sbjct: 247  SDHKNK-DQVACIVINDSVKGGSGQDFVEDNADVGVMPGSSSKEDVCMDL--ATELPIRK 303

Query: 890  HTNPLACQLTSSAD-CNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNND--SEN 1060
            +T+ L+  +  S+D CN +  R +      S     SL+EI S    D L+LN D    +
Sbjct: 304  NTDLLSSPILKSSDICNNVRRRTECGLKGCSNPIGSSLDEIQSPQHADFLNLNLDHCCGS 363

Query: 1061 HEKLIQLSVDELSSIDHCHCCPDEAQYVVD--------------ANSQKLAENSRPEKIQ 1198
              K +    D   S  HC+ C  ++  +V+              A  Q+   +S     Q
Sbjct: 364  SGKPLPEPSDGFPSEAHCNICDHKSNNLVNYASGGSMLDVPIMPAKVQESMTSSGLSGSQ 423

Query: 1199 KWLHDCSLGSGFMNRIPNLSDNVEWVTFSCR 1291
            KWLH+ SLG GF+ R  NLS+++EWV FSC+
Sbjct: 424  KWLHNSSLGGGFIVRTSNLSNDIEWVGFSCK 454


>ref|XP_019709892.1| PREDICTED: uncharacterized protein LOC105056234 isoform X1 [Elaeis
            guineensis]
          Length = 575

 Score =  333 bits (854), Expect = e-105
 Identities = 191/425 (44%), Positives = 246/425 (57%), Gaps = 10/425 (2%)
 Frame = +2

Query: 47   NRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKAXXX 226
            N   WRFT+E LAHIPTLRLYLF +  +PS +C  +R+ L+   S+L+VSW D E+    
Sbjct: 8    NPRPWRFTWEALAHIPTLRLYLFRAGFDPSARCSTLRASLRFDESVLLVSWTDEEEGNRD 67

Query: 227  XXXXXXXXX-LIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGDL----- 388
                      LIDPG  V+  A+ DH+ +++VLVLPVDHPV  N RG I S G+      
Sbjct: 68   VSLRVPVPRVLIDPGCPVDCTAKSDHIAIKLVLVLPVDHPVVANCRGVIDSAGEEVGDRS 127

Query: 389  -ERGRPLELDSDLKNLSSGGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFGACC 565
             +R  PL LDSD+  LS+ GVH FCK C  +LT+QPLR F+EMPSVNWREVADNWFGACC
Sbjct: 128  PDRLLPLSLDSDIGKLSAEGVHFFCKACLTKLTRQPLRCFVEMPSVNWREVADNWFGACC 187

Query: 566  CSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSKIDV 745
            CSFGGISEKLV QYI +Y+CAEGTCL+D+ASVIICKDDL+ YTFQ   EE S+      +
Sbjct: 188  CSFGGISEKLVSQYINSYNCAEGTCLVDAASVIICKDDLQDYTFQQCSEENSEWKKSDLM 247

Query: 746  EPSGNRTDTMKEG-CKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPLACQLTS 922
            E S    D +KEG C N+ ++IP                       C  H   +A     
Sbjct: 248  ENS--VADALKEGTCGNLCEDIP----------------------HCAAHKPAVA----- 278

Query: 923  SADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLN--NDSENHEKLIQLSVDEL 1096
                        S   E++ LAQ SL+E  S   VD L LN    S+ H   +   V E 
Sbjct: 279  ----------PTSFSKETTYLAQPSLDETHSARDVDLLKLNLKQCSDIHGNPLPAFV-EG 327

Query: 1097 SSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDNVEWV 1276
            +S DHCHCC D+  ++++ +  + A        +  + +  LG GFM R  NL +++EW+
Sbjct: 328  TSKDHCHCCEDKTNFLLNFHHGRSALGVSIAPTKSQIDNSHLGGGFMTRTSNLLNDMEWI 387

Query: 1277 TFSCR 1291
             FSCR
Sbjct: 388  EFSCR 392


>gb|PKA56714.1| hypothetical protein AXF42_Ash012844 [Apostasia shenzhenica]
          Length = 579

 Score =  278 bits (710), Expect = 4e-84
 Identities = 167/426 (39%), Positives = 240/426 (56%), Gaps = 10/426 (2%)
 Frame = +2

Query: 44   RNRNQWRFTYETLAHIPTLRLYLFNSAVNP-SLQCLNIRSDLQLARSLLVVSWIDSEKAX 220
            ++  +WRFT+E LAHIPTLRLYLFN  +   S +C  I + L L  SLL+VSW   E   
Sbjct: 9    KSTRRWRFTWEALAHIPTLRLYLFNPELRDLSDKCHAIGASLSLHESLLLVSWSLDEDGS 68

Query: 221  XXXXXXXXXXX----LIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGDL 388
                           LIDPG  V+++A  DH+E ++ L+LPVDHPV   L G + S GD 
Sbjct: 69   GGGVAVALRVPIPRVLIDPGCPVDVRATGDHIEAKLALLLPVDHPVVTELCGVLYSDGD- 127

Query: 389  ERGRPLELDSDLKNLSSGGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFGACCC 568
             R  PL L SD++ LSS GV++FCK CS +LT+QP+R F+EMPS+NWREVADNWFGACCC
Sbjct: 128  -RRLPLSLGSDIEKLSSEGVNIFCKSCSTKLTRQPIRNFVEMPSMNWREVADNWFGACCC 186

Query: 569  SFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSKIDVE 748
            SFGG SEKLVL+Y++T++C +GTCLLD AS+I+ KDD++G  FQ    + + NSS     
Sbjct: 187  SFGGASEKLVLKYLETFNCCKGTCLLDRASIIVHKDDIDGSLFQ---PDANGNSS----- 238

Query: 749  PSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPLACQLTSSA 928
                        C +  ++  + C  S  + ++     ++ S   E   +   C +T   
Sbjct: 239  ------------CSSRHNS--WACDDSLGSANLGSCQSDTSSLCHELSLDEYLCNVTHEV 284

Query: 929  DCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNND-SENHEKLIQLSVDELSSI 1105
            D + + N         + LAQ   +E+   V+++ L+ N      H +  + +  E++S 
Sbjct: 285  DPSIVAN---------TNLAQHGSDELQGNVSMNGLNPNLPYFRYHSRSFRDAAVEMNSE 335

Query: 1106 DHCHCCPDEAQ----YVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDNVEW 1273
            D   CC DE +    YV   +  K++     +   K ++    GSGFMN   NLS+++EW
Sbjct: 336  DSFQCCVDETRPIVNYVEGRSESKVSVLPNNDLQFKKVY----GSGFMNTTTNLSNDIEW 391

Query: 1274 VTFSCR 1291
            + F CR
Sbjct: 392  IEFCCR 397


>ref|XP_010254726.1| PREDICTED: uncharacterized protein LOC104595621 [Nelumbo nucifera]
          Length = 577

 Score =  259 bits (662), Expect = 5e-77
 Identities = 157/430 (36%), Positives = 224/430 (52%), Gaps = 16/430 (3%)
 Frame = +2

Query: 47   NRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWID--SEKAX 220
            N  +WRFT+ET AHIP LRL+LFN  + P  QC N++  L     +L V WI+  ++   
Sbjct: 10   NPRKWRFTWETQAHIPILRLFLFNPNIKPVSQCQNLKVSLNFEDFMLQVGWIEGINDGIL 69

Query: 221  XXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNL--------RGAIGS 376
                       L+D  S V+ +A +DH+EV++ L+LPVDHP+  +L         G +  
Sbjct: 70   NLSLTVPVPRVLVDLDSSVDFRATEDHIEVKLALLLPVDHPIVTSLISAQDFPDEGFVNE 129

Query: 377  GGDLERGRPLELDSDLKNLSS-GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWF 553
             G L+R +PL +DSD+KNLSS GGV+ +CK CS +LT +PLR F+EMPSVNWREVADNWF
Sbjct: 130  SGLLDRLQPLSVDSDIKNLSSCGGVYFYCKNCSTQLTVRPLRFFVEMPSVNWREVADNWF 189

Query: 554  GACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSS 733
            GACCCSFGG SEK+V +Y  +YDCAEGTCL+D+ASV++C+ DL G  F      +  + S
Sbjct: 190  GACCCSFGGASEKMVTKYANSYDCAEGTCLVDAASVVVCESDLAGLPF-----HKLKHGS 244

Query: 734  KIDVEPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPLACQ 913
            + D+    +R + + + C         +   S   +D+   +                  
Sbjct: 245  EPDLIGEVDRAEALTDSCS--------IDCQSDQTSDINERLF----------------- 279

Query: 914  LTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSENHEKLIQLSVDE 1093
                  C  LVN++ S  +E     +            DS +L   S         S DE
Sbjct: 280  ------CMSLVNKNCSANIEGGHAEKKK----------DSSALLGTSPTMSDGYVTSADE 323

Query: 1094 LSS-----IDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLS 1258
            LS+      +H  C  + +       +Q+  E+      Q  L +  LGSGFM R  N+S
Sbjct: 324  LSAKNLGCYEHEGCSHNASVICSKFEAQESREHIGHLVNQMPLLNGYLGSGFMVRSSNIS 383

Query: 1259 DNVEWVTFSC 1288
             +V+W+ F C
Sbjct: 384  KDVKWIEFLC 393


>gb|OVA08733.1| Ubiquitin-conjugating enzyme E2-binding protein [Macleaya cordata]
          Length = 594

 Score =  258 bits (659), Expect = 2e-76
 Identities = 164/444 (36%), Positives = 233/444 (52%), Gaps = 27/444 (6%)
 Frame = +2

Query: 38   NPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWID---- 205
            NPRN   WR T+ETL+H+PTLRL+LF+  + P+ QC N++  L+   SLL+VSW +    
Sbjct: 6    NPRN---WRCTWETLSHVPTLRLFLFSPDIKPATQCKNLKIHLKFEESLLLVSWNEEIEH 62

Query: 206  -SEKAXXXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGG 382
             ++ +            L++ GS V+ KA +DH+EV++VL+LPVDHP+AVN    +    
Sbjct: 63   QNDGSVDFLLRVPVPRVLVELGSPVDFKATEDHIEVKLVLLLPVDHPLAVNFSSVLNFSD 122

Query: 383  D------LERGRPLELDSDLKNLSSG-GVHLFCKVCSARLTKQPLRRFMEMPSVNWREVA 541
            +       +R + L +DSD+K L SG GV  +CK CS +LTK+PLR F+EMPSV+WREVA
Sbjct: 123  EEDSTECSDRFQSLSIDSDIKRLLSGDGVDFYCKSCSNKLTKRPLRSFVEMPSVDWREVA 182

Query: 542  DNWFGACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQS 721
            DNWFG+CCCSFGGISEKLV++Y  +Y  +EGTCL DS  V++ KDDL GYTF     +  
Sbjct: 183  DNWFGSCCCSFGGISEKLVIKYSNSYSSSEGTCLYDSTCVVVSKDDLVGYTF----PDCF 238

Query: 722  DNSSKIDVEPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNP 901
            D S   D  P     D + E       N+P +   S   +D  R++       C    + 
Sbjct: 239  DGSKMHDCGP-----DPVGE------VNLPEIVVNSH--SDCGRDL-------CSGTKSD 278

Query: 902  LACQLTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSENHEKLIQL 1081
            +A        C          ++E     Q + +    T ++  +S  N    H      
Sbjct: 279  IASNGDDKLSCMSPKRGRIVAYLEDE--IQKNNDTSFCTSSISDVS-GNVMPGH------ 329

Query: 1082 SVDELSSIDHCHCCPDEAQYV---------------VDANSQKLAENSRPEKIQKWLHDC 1216
                 S  DH HCC D+   V                 +  Q+   + +  + +K L + 
Sbjct: 330  ---GFSEKDHSHCCSDKTSSVSRDYYHEVCSHGDSITSSEDQEQTNSIKLLQNRKSLLNG 386

Query: 1217 SLGSGFMNRIPNLSDNVEWVTFSC 1288
            SLG+GFM R  NLS +VEW+ F C
Sbjct: 387  SLGNGFMVRTSNLSKDVEWIEFRC 410


>ref|XP_010649104.1| PREDICTED: uncharacterized protein LOC100260906 [Vitis vinifera]
 ref|XP_010649105.1| PREDICTED: uncharacterized protein LOC100260906 [Vitis vinifera]
          Length = 608

 Score =  254 bits (648), Expect = 1e-74
 Identities = 165/455 (36%), Positives = 228/455 (50%), Gaps = 32/455 (7%)
 Frame = +2

Query: 20   MASPTLNPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSW 199
            M+S      N  +WRFT+E  +HIPTLRL+LF+    P +QC N++ DL   RSLL+VSW
Sbjct: 1    MSSELGTSENPRKWRFTWEAQSHIPTLRLFLFDQGTKPCIQCKNLKVDLNFERSLLLVSW 60

Query: 200  IDSEKAXXXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSG 379
             + E              L+D  S +  +A +DH+EV++VL+LPVDH +  N    +   
Sbjct: 61   FEEETEISFRVPVPRV--LVDIESPISFRAMEDHIEVKLVLLLPVDHHIVSNFNSILNMS 118

Query: 380  GDLERGRPLELDSDLKNLSS-GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFG 556
                  +   +DSD+K+LSS GGVH +CK CS  LTK+PL  F EMPS+NWREVADNWFG
Sbjct: 119  E--ATSQLFSMDSDIKSLSSRGGVHFYCKSCSTNLTKKPLSSFAEMPSINWREVADNWFG 176

Query: 557  ACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSK 736
            ACCCSFGGISEKLV +Y  +Y C E +CLLD+ SVI+CKDDL G+ F        D    
Sbjct: 177  ACCCSFGGISEKLVARYANSYSCGEESCLLDATSVILCKDDLVGFEF-----PDRDGDQN 231

Query: 737  IDVEPSGNRTDTMKEGCKNMSDNI-PFVCATSSVANDVERNVLESDSFKCEKHTNPLACQ 913
             + EP     D + E  ++   N    VC T      V++  +   S K     N L  Q
Sbjct: 232  YESEPDCTEDDCINEDMQDAGGNHGRCVCPT------VKKEKMSDLSGK----LNSLHIQ 281

Query: 914  LTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSE------------ 1057
                 D  G     K          ++++  ++ TV V   S N  S             
Sbjct: 282  KEPFVDSPGYKIIEK----------EITVPSLVGTVPVSYFSENVASAPGCCADNRIHVL 331

Query: 1058 NHEKLIQLSVDELSSIDH-----CHCCPDEAQYVVD-------------ANSQKLAENSR 1183
            NH+K +  + D +S           CC D   +V++             +  QK+ + S 
Sbjct: 332  NHDKEV-CTPDTVSYFSENVPSAPGCCADNRIHVLNHDKEVCMPDTSEISKEQKVTKASE 390

Query: 1184 PEKIQKWLHDCSLGSGFMNRIPNLSDNVEWVTFSC 1288
                +K   +  LG+ FM R  NLS +VEW+ F+C
Sbjct: 391  VLANKKSFLNGFLGNIFMARSYNLSKDVEWIKFAC 425


>ref|XP_007046828.2| PREDICTED: uncharacterized protein LOC18610859 [Theobroma cacao]
          Length = 564

 Score =  252 bits (644), Expect = 2e-74
 Identities = 166/436 (38%), Positives = 225/436 (51%), Gaps = 16/436 (3%)
 Frame = +2

Query: 29   PTLNPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDS 208
            P  NP N  +WRFT+E  +H P LRL+LF+S   PS+QC  ++  L L +S L+VSW   
Sbjct: 2    PMGNPENPRKWRFTWEAQSHSPNLRLFLFDSQTKPSVQCKKLKVHLNLFQSQLLVSWPKE 61

Query: 209  EKAXXXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGS---G 379
            EK             LID  S V  +A DDH+EV++VL+LPV HP+       + S   G
Sbjct: 62   EKEEEVTVRVPIPRVLIDSESPVSFRALDDHIEVKLVLLLPVGHPIVSRFDSVLNSSENG 121

Query: 380  GDL---ERGRPLELDSDLKNLSS--GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVAD 544
             D    +   PL +D+DLK+LSS   GVH +C+ CS RLT+ PLR F+EMPS++WREVAD
Sbjct: 122  DDALAPDAATPLVMDTDLKSLSSIEEGVHFYCRNCSIRLTENPLRNFVEMPSIDWREVAD 181

Query: 545  NWFGACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSD 724
            NWFGACCCSFGGISEK+V ++  +Y CA+G CLL   +V++ KDDL     +LY      
Sbjct: 182  NWFGACCCSFGGISEKMVTRFANSYKCAKGVCLLSFTAVVLSKDDL--VACKLY------ 233

Query: 725  NSSKIDVEPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPL 904
                       NRT   + G    SD     C            VL  D     + TN L
Sbjct: 234  -----------NRTQEHQPGSDFSSD-----C------------VLSEDMLSSRESTNDL 265

Query: 905  ACQLTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDS--------EN 1060
              +L+S    N  V ++  +  E     + + +++ S + V  +S N  S        EN
Sbjct: 266  CGKLSSMHLKNDSVTKNVLVAKE-----EANGHKLFSALPVPDVSENETSVLGCCVHTEN 320

Query: 1061 HEKLIQLSVDELSSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMN 1240
            H   I+  VDE    D    CP      VD N+      S+P   QK   + SLG+ FM 
Sbjct: 321  H---IRNHVDEGGQHDVSETCP------VDQNT------SKPLANQKLFLNGSLGNAFMA 365

Query: 1241 RIPNLSDNVEWVTFSC 1288
            +  NLS ++EW+ F C
Sbjct: 366  KSYNLSMDIEWMEFVC 381


>gb|EOX90985.1| Uncharacterized protein TCM_000303 isoform 1 [Theobroma cacao]
          Length = 564

 Score =  252 bits (644), Expect = 2e-74
 Identities = 164/436 (37%), Positives = 226/436 (51%), Gaps = 16/436 (3%)
 Frame = +2

Query: 29   PTLNPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDS 208
            P  NP N  +WRFT+E  +H P LRL+LF+S   PS+QC  ++  L L +S ++VSW+  
Sbjct: 2    PMENPENPRKWRFTWEAQSHSPNLRLFLFDSQTKPSVQCKKLKVHLNLFQSQVLVSWLKE 61

Query: 209  EKAXXXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGS---G 379
            EK             LID  S V  +A DDH+EV++VL+LPV HP+       + S   G
Sbjct: 62   EKEEEVTVRVPIPRVLIDSESPVSFRALDDHIEVKLVLLLPVGHPIVSRFDSVLNSSENG 121

Query: 380  GDL---ERGRPLELDSDLKNLSS--GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVAD 544
             D    +   PL +D+DLK+LSS   GVH +C+ CS RLT+ PLR F+EMPS++WREVAD
Sbjct: 122  DDALAPDAATPLVMDTDLKSLSSIEEGVHFYCRNCSIRLTENPLRNFVEMPSIDWREVAD 181

Query: 545  NWFGACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSD 724
            NWFGACCCSFGGISEK+V ++  +Y CA+G CLL   +V++ KDDL     +LY      
Sbjct: 182  NWFGACCCSFGGISEKMVTRFANSYKCAKGVCLLSFTAVVLSKDDL--VACKLY------ 233

Query: 725  NSSKIDVEPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPL 904
                       NRT   + G    SD     C            VL  +     + TN L
Sbjct: 234  -----------NRTQEHQPGSDFSSD-----C------------VLSEEMLSSRESTNDL 265

Query: 905  ACQLTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDS--------EN 1060
              +L+S    N  V ++  +  E     + + +++ S + V  +S N  S        EN
Sbjct: 266  CGKLSSMHLKNDSVTKNVLVAKE-----EANGHKLFSALPVPDVSENETSVLGCCVHTEN 320

Query: 1061 HEKLIQLSVDELSSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMN 1240
            H   I+  VDE    D    C      +VD N+ KL  N      QK   + SLG+ FM 
Sbjct: 321  H---IRNHVDEGGQHDVSETC------LVDQNTSKLLAN------QKLFLNGSLGNAFMA 365

Query: 1241 RIPNLSDNVEWVTFSC 1288
            +  NLS ++EW+ F C
Sbjct: 366  KSYNLSMDIEWMEFVC 381


>gb|EOX90986.1| Uncharacterized protein TCM_000303 isoform 2 [Theobroma cacao]
          Length = 565

 Score =  252 bits (644), Expect = 2e-74
 Identities = 164/436 (37%), Positives = 226/436 (51%), Gaps = 16/436 (3%)
 Frame = +2

Query: 29   PTLNPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDS 208
            P  NP N  +WRFT+E  +H P LRL+LF+S   PS+QC  ++  L L +S ++VSW+  
Sbjct: 2    PMENPENPRKWRFTWEAQSHSPNLRLFLFDSQTKPSVQCKKLKVHLNLFQSQVLVSWLKE 61

Query: 209  EKAXXXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGS---G 379
            EK             LID  S V  +A DDH+EV++VL+LPV HP+       + S   G
Sbjct: 62   EKEEEVTVRVPIPRVLIDSESPVSFRALDDHIEVKLVLLLPVGHPIVSRFDSVLNSSENG 121

Query: 380  GDL---ERGRPLELDSDLKNLSS--GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVAD 544
             D    +   PL +D+DLK+LSS   GVH +C+ CS RLT+ PLR F+EMPS++WREVAD
Sbjct: 122  DDALAPDAATPLVMDTDLKSLSSIEEGVHFYCRNCSIRLTENPLRNFVEMPSIDWREVAD 181

Query: 545  NWFGACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSD 724
            NWFGACCCSFGGISEK+V ++  +Y CA+G CLL   +V++ KDDL     +LY      
Sbjct: 182  NWFGACCCSFGGISEKMVTRFANSYKCAKGVCLLSFTAVVLSKDDL--VACKLY------ 233

Query: 725  NSSKIDVEPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPL 904
                       NRT   + G    SD     C            VL  +     + TN L
Sbjct: 234  -----------NRTQEHQPGSDFSSD-----C------------VLSEEMLSSRESTNDL 265

Query: 905  ACQLTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDS--------EN 1060
              +L+S    N  V ++  +  E     + + +++ S + V  +S N  S        EN
Sbjct: 266  CGKLSSMHLKNDSVTKNVLVAKE-----EANGHKLFSALPVPDVSENETSVLGCCVHTEN 320

Query: 1061 HEKLIQLSVDELSSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMN 1240
            H   I+  VDE    D    C      +VD N+ KL  N      QK   + SLG+ FM 
Sbjct: 321  H---IRNHVDEGGQHDVSETC------LVDQNTSKLLAN------QKLFLNGSLGNAFMA 365

Query: 1241 RIPNLSDNVEWVTFSC 1288
            +  NLS ++EW+ F C
Sbjct: 366  KSYNLSMDIEWMEFVC 381


>ref|XP_020703622.1| uncharacterized protein LOC110114919 [Dendrobium catenatum]
          Length = 558

 Score =  250 bits (639), Expect = 8e-74
 Identities = 168/422 (39%), Positives = 232/422 (54%), Gaps = 5/422 (1%)
 Frame = +2

Query: 41   PRNRNQWRFTYETLAHIPTLRLYLFNSAVNP-SLQCLNIRSDLQLARSLLVVSW-IDSEK 214
            P++   WRFT+E+LAHIP LRLYLF+  V   S  C  + + L+L RSLL+VSW I  E 
Sbjct: 12   PKSSRPWRFTWESLAHIPILRLYLFSPDVGDISALCRFVEASLRLDRSLLLVSWHIGDEG 71

Query: 215  AXXXXXXXXXXXX---LIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGD 385
            A               LI+P   V++KA  DH+EV++VL+LPVDHPV   + G+ G   D
Sbjct: 72   AVSNGRSFIWIPVPRVLINPECSVDVKAMADHIEVKLVLLLPVDHPVESEIWGSPGM--D 129

Query: 386  LERGRPLELDSDLKNLSSGGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFGACC 565
             +R  PL L SD++  SS GV++FCK CSA+LT+QPLR+F+E+PS+NW+E+ADNWFG  C
Sbjct: 130  PDRCLPLSLGSDIEKFSSEGVNIFCKSCSAKLTRQPLRKFVEIPSMNWQEIADNWFGG-C 188

Query: 566  CSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSKIDV 745
            CSFG +SEKLV +Y+ T+DC EGTCLLD ASV +C+ DLEG   +  + +    SS+   
Sbjct: 189  CSFGSVSEKLVSKYLDTFDCKEGTCLLDRASVAVCEHDLEGAVLERTVSDFGMKSSE--- 245

Query: 746  EPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPLACQLTSS 925
                N T +M+  C  +S         S   +D   + ++S S     ++N L       
Sbjct: 246  ---SNDTSSMR--CDLVS---------SRAWSDTSAHEVDSSS-----NSNQL------- 279

Query: 926  ADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSENHEKLIQLSVDELSSI 1105
                GLV     + M+  +                  +LNN S N+EK +  S+ E +S 
Sbjct: 280  ----GLVGTQNDINMDCLK-----------------ANLNNFSLNNEKSLSESIVERTSK 318

Query: 1106 DHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDNVEWVTFS 1285
               HCC   A+ V  A   +   +      Q+ L     GSGFM    NLS++VE V FS
Sbjct: 319  CDFHCC--IAEKVQHAGGTEYEASIAACNAQQSLK--VYGSGFMVTSANLSNDVECVEFS 374

Query: 1286 CR 1291
            CR
Sbjct: 375  CR 376


>ref|XP_021273814.1| uncharacterized protein LOC110408968 [Herrania umbratica]
          Length = 565

 Score =  246 bits (627), Expect = 5e-72
 Identities = 165/428 (38%), Positives = 230/428 (53%), Gaps = 11/428 (2%)
 Frame = +2

Query: 38   NPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWI-DSEK 214
            NP+N  +WRFT+E  +H P LRL+LF+S + PS+QC N++  L L +S L+VSW+ + E+
Sbjct: 5    NPKNPRKWRFTWEAQSHSPILRLFLFDSQIKPSVQCKNLKVQLNLFQSQLLVSWLKEEEE 64

Query: 215  AXXXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGS---GGD 385
                         LID  S V  +A DDH+EV++VL+LPV HP+  +    + S   G D
Sbjct: 65   EEEVTVRVPIPRVLIDSESPVSFRALDDHIEVKLVLLLPVGHPIVSSFDSVLNSSENGDD 124

Query: 386  L---ERGRPLELDSDLKNLSS--GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNW 550
                +   PL +D+DLK+LSS   GVH +C+ CS RLTK PLR F++MPS++WREVADNW
Sbjct: 125  ALAPDAATPLVMDTDLKSLSSIEEGVHFYCRNCSIRLTKNPLRNFVDMPSIDWREVADNW 184

Query: 551  FGACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNS 730
            FGACCCSFGGISEK+V ++  +Y CA+G  LL   +V++ KDDL     +LY        
Sbjct: 185  FGACCCSFGGISEKMVTRFANSYTCAKGVGLLSFTTVVLSKDDL--VACKLY-------- 234

Query: 731  SKIDVEPSGNRTDTMKEGCKNMSDNIPF--VCATSSVANDVERNVLESDSFKCEKHTNPL 904
                     NRT   + G    SD +    + ++    ND+  N L S   K +  T  +
Sbjct: 235  ---------NRTQEHQPGSDFSSDCVSSDEMLSSRESTNDLCGN-LSSMHLKNDSVTTNV 284

Query: 905  ACQLTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSENHEKLIQLS 1084
               L +  + NG    HK LF   S L  L ++E  ++V    +   N   NH       
Sbjct: 285  ---LVTKEEANG----HK-LF---SALPVLDVSENETSVLGCCVHTENHIRNH------- 326

Query: 1085 VDELSSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDN 1264
            VDE    +    CP      VD N+ KL  N      QK   + SLG+ FM +  NLS +
Sbjct: 327  VDEGGPHNVSETCP------VDENASKLLAN------QKLFLNGSLGNAFMAKSYNLSMD 374

Query: 1265 VEWVTFSC 1288
            +EW+ F C
Sbjct: 375  IEWMEFVC 382


>ref|XP_022751404.1| uncharacterized protein LOC111300062 isoform X1 [Durio zibethinus]
          Length = 561

 Score =  244 bits (624), Expect = 1e-71
 Identities = 158/426 (37%), Positives = 221/426 (51%), Gaps = 9/426 (2%)
 Frame = +2

Query: 38   NPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKA 217
            N  N  +WRFT+E   H P LRL+LF+S   PSLQC N+   L L++S L VSW+  E  
Sbjct: 3    NLENPRKWRFTWEAQTHSPNLRLFLFDSQTKPSLQCKNLEVQLNLSQSQLFVSWLKVEGK 62

Query: 218  XXXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSG--GD-- 385
                        LID  S V ++A DDH+EV++VL+LPVDHP+  +    + +   GD  
Sbjct: 63   EEVSVRVPIPRVLIDSESPVSLRALDDHIEVKLVLLLPVDHPIVSSFDSVLNTSENGDDA 122

Query: 386  --LERGRPLELDSDLKNLSS--GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWF 553
              L+  +PL + +DLK+LSS   GVH +C+ CS RLTK PLR F+EMPS++WREVADNWF
Sbjct: 123  VELDAAKPLVMGTDLKSLSSMEEGVHFYCRNCSTRLTKSPLRNFLEMPSIDWREVADNWF 182

Query: 554  GACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSS 733
            GACCCSFGGISEKLV ++ K++ C +G CLL+  +V++CKDDLE        +E      
Sbjct: 183  GACCCSFGGISEKLVTRFAKSFTCTKGLCLLNFTTVLLCKDDLEACKLYNGTQEYQPGP- 241

Query: 734  KIDVEPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPLACQ 913
              D       ++ M    + M+D                          CEK  +     
Sbjct: 242  --DFASGYGLSEDMLTSQERMND-------------------------LCEKFGS--VHI 272

Query: 914  LTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVT-VDSLSLNNDSENHEKLIQLSVD 1090
               S++ N LV   ++   ES   + L +++I  T T V    ++ D       IQ  VD
Sbjct: 273  KNDSSNTNVLVTTEEANVQES--FSALPVSDISETETSVPGCCVHTDH------IQKHVD 324

Query: 1091 ELSSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDNVE 1270
            + S  D          Y VD N+ +L  N      QK   +  LG+ FM +  NLS +++
Sbjct: 325  DGSHHD------VSETYPVDQNTSRLLAN------QKLFLNGFLGNAFMAKSYNLSMDID 372

Query: 1271 WVTFSC 1288
            W+ F C
Sbjct: 373  WMEFVC 378


>gb|OMO84492.1| hypothetical protein COLO4_22012 [Corchorus olitorius]
          Length = 552

 Score =  242 bits (617), Expect = 1e-70
 Identities = 155/427 (36%), Positives = 218/427 (51%), Gaps = 10/427 (2%)
 Frame = +2

Query: 38   NPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKA 217
            NP N  +WRFT+E+ +H P LRL+LF S   PS+QC N++  L L++S L+VSW++ E+ 
Sbjct: 5    NPENPKKWRFTWESQSHSPNLRLFLFVSQTKPSIQCKNLKVQLNLSQSQLLVSWLEEEEE 64

Query: 218  XXXXXXXXXXXX-LIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAI----GSGG 382
                         LID  S V  +A DDH+EV++VL+LPVDHP+  +    +     +G 
Sbjct: 65   KGVVSVRVPLPRVLIDSESPVSFRALDDHIEVKLVLLLPVDHPIVSSFDSVLLNLSENGN 124

Query: 383  D---LERGRPLELDSDLKNLSS--GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADN 547
            D    +  +PL +D+DLK+LSS   GVH +C+ CS RLTK PLR F+EMPS++WRE ADN
Sbjct: 125  DAVAFDAAKPLIMDTDLKSLSSIEEGVHFYCRNCSNRLTKVPLRNFVEMPSIDWREAADN 184

Query: 548  WFGACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDN 727
            WFG CCCSFGGISEKLV ++  +Y C +G CLL+  +V++CKDDL    F+LY       
Sbjct: 185  WFGNCCCSFGGISEKLVTKFANSYSCIKGVCLLNCTTVVLCKDDL--VAFELY------- 235

Query: 728  SSKIDVEPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPLA 907
               +  +P  N       G                            D    +K  N L+
Sbjct: 236  DGTLVYQPGPNFASEYGSG---------------------------EDMSSPQKRMNDLS 268

Query: 908  CQLTSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSENHEKLIQLSV 1087
             +L S    N +V+    +  E              T  V +LS+   SEN   +    V
Sbjct: 269  GKLRSMHLKNDIVSTTGPVVEE-------------DTNGVCALSVTGLSENETLVPSCCV 315

Query: 1088 DELSSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDNV 1267
             + S  +    CP      VD ++ KL  N      QK   +  LG+ FM +  NLS ++
Sbjct: 316  HKGSQNNVSETCP------VDQDTSKLLAN------QKLFLNGFLGNAFMAKSYNLSMDI 363

Query: 1268 EWVTFSC 1288
            EW  F+C
Sbjct: 364  EWREFTC 370


>ref|XP_008379646.1| PREDICTED: uncharacterized protein LOC103442619 [Malus domestica]
          Length = 557

 Score =  241 bits (614), Expect = 4e-70
 Identities = 154/431 (35%), Positives = 220/431 (51%), Gaps = 17/431 (3%)
 Frame = +2

Query: 47   NRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKAXXX 226
            N  +WRFT+E  +HIP LRL+LF+S   PS QC  +   +  A SL++VSW  +E A   
Sbjct: 10   NPRKWRFTWEAQSHIPILRLFLFDSCTKPSTQCRKLTVHITPAESLVLVSW--AEDAQEV 67

Query: 227  XXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGDLERG--- 397
                     L+D  S V   A DDH+EV++VL+LPVDHP+ ++    +   G  E+    
Sbjct: 68   SLRVPMPRVLVDAESPVSFSALDDHIEVKLVLLLPVDHPIVLSFDSLLSLDGGEEKALEG 127

Query: 398  --RPLELDSDLKNLSSGGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFGACCCS 571
              +PL L S++K+LSS GVH +C+ CS +LT  PL +F+EMPSVNWREVADNWFGACCCS
Sbjct: 128  ELKPLSLASEVKSLSSSGVHFYCRNCSFKLTASPLSQFVEMPSVNWREVADNWFGACCCS 187

Query: 572  FGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSKIDVEP 751
            FGGISEKLV++Y  +Y CA+G CL++S ++ +CKDDL G+ F    E Q     + D E 
Sbjct: 188  FGGISEKLVVRYANSYMCAKGVCLVNSTNITLCKDDLVGFEFPDLGECQ-----RYDSES 242

Query: 752  SGNRTDTMKEGCKNMSDNIPFVC-------ATSSVANDVERNVLESDSFKCEKHTNPLAC 910
             G+  +   E   N+  N+   C       + S VAND      ES+S       +   C
Sbjct: 243  DGSGDNGFTESELNLGSNL--TCNEDFAAESKSEVAND------ESNSEDAPHLCSGSVC 294

Query: 911  QLTSSAD---CNGLVNRHKSLFMESSRL--AQLSLNEILSTVTVDSLSLNNDSENHEKLI 1075
             + +S+    CN +    ++   +S RL  +++SL +   T + + L      +NHE  +
Sbjct: 295  SIKNSSTPGCCNHMGCHVQNYDGDSCRLCFSEISLEDQKPTKSTEIL------KNHESFL 348

Query: 1076 QLSVDELSSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNL 1255
               ++ +                                             FM R  NL
Sbjct: 349  NGYLENI---------------------------------------------FMVRSSNL 363

Query: 1256 SDNVEWVTFSC 1288
            S +VEWV F C
Sbjct: 364  SIDVEWVEFFC 374


>ref|XP_021604018.1| uncharacterized protein LOC110609020 [Manihot esculenta]
 gb|OAY57451.1| hypothetical protein MANES_02G098000 [Manihot esculenta]
          Length = 573

 Score =  241 bits (614), Expect = 5e-70
 Identities = 159/424 (37%), Positives = 229/424 (54%), Gaps = 10/424 (2%)
 Frame = +2

Query: 47   NRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKAXXX 226
            N  +WRFT+ET +H P L+L++F+S    S+ C+++   L L +S L+VSWI  E     
Sbjct: 5    NPRKWRFTWETQSHSPNLKLFIFDSRTKSSIHCISLEVRLHLPQSQLLVSWI-GEDTEKI 63

Query: 227  XXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGD----LER 394
                     LIDP S V  +A DDH+EV++VL+LPVDHP+  NL  ++   G+    L+ 
Sbjct: 64   SIRVPIPKLLIDPDSPVSFRALDDHIEVKLVLLLPVDHPIFSNL--SLSDDGENNEALDS 121

Query: 395  GRPLELDSDLKNLSS-GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFGACCCS 571
             +PL++DSDLKNLS+  GVH +C+ CS RLT+  +R+F+EMPSV+WRE+ADNWFGACCCS
Sbjct: 122  VKPLKMDSDLKNLSTMEGVHFYCQSCSTRLTRSCIRQFVEMPSVDWREMADNWFGACCCS 181

Query: 572  FGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSKIDVEP 751
            FGGISEKLV ++   Y CA G CLL+S SVIICKDDL    F       +D +     EP
Sbjct: 182  FGGISEKLVNRFADAYTCARGLCLLNSTSVIICKDDLVASNF-------ADWNGIQRFEP 234

Query: 752  SGN---RTDTMKEGCKNMSDNI--PFVCATSSVANDVERNVLESDSFKCEKHTNPLACQL 916
              N   R    +E   +   N+     C   +   DV R  L S       H   + C++
Sbjct: 235  RENFAGRNSLSEEANLDFGSNLRSDASCDNHNEKADVNR-TLRSSHLNFYNHGEDIKCKV 293

Query: 917  TSSADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSENHEKLIQLSVDEL 1096
                + N +      LF  +   + LS N          L   N +   ++ +++S  E+
Sbjct: 294  REE-EPNAI-----GLFY-AKPASDLSEN------VASELGCCNSTHYEQEYVEMSTHEV 340

Query: 1097 SSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDNVEWV 1276
            S +            +VD N+ + A N+   + + +L+   LG+ FM R  NLS ++EW 
Sbjct: 341  SKLS-----------LVDQNNSE-AVNAMVNR-RSFLNG-FLGNVFMARSYNLSMDIEWK 386

Query: 1277 TFSC 1288
             F C
Sbjct: 387  QFVC 390


>ref|XP_017631359.1| PREDICTED: uncharacterized protein LOC108474012 [Gossypium arboreum]
 gb|KHG27086.1| isoleucine--trna ligase [Gossypium arboreum]
          Length = 561

 Score =  240 bits (613), Expect = 6e-70
 Identities = 159/430 (36%), Positives = 222/430 (51%), Gaps = 13/430 (3%)
 Frame = +2

Query: 38   NPRNRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKA 217
            NPRN   WRFT+E  +H P LRL+LF+S  NPS+QC N+   L L++S L+VSW+   + 
Sbjct: 6    NPRN---WRFTWEAQSHSPNLRLFLFDSQANPSVQCRNLEVQLNLSQSHLLVSWLKEGEK 62

Query: 218  XXXXXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGD---- 385
                        LID  + V  +A DDH+EV++VL+LPVDHP+  +    + S  +    
Sbjct: 63   EEVSLRVPIPRVLIDSEAPVSFRALDDHIEVKLVLLLPVDHPIVSSFDLMLDSSENGYND 122

Query: 386  --LERGRPLELDSDLKNLSS-GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFG 556
              L+  +PL +D+DLK+LSS  GVH +C+ CS RLTK PLR F+EMPS++WREVADNWFG
Sbjct: 123  PSLDAAKPLVMDTDLKSLSSMEGVHFYCRKCSTRLTKSPLRNFVEMPSIDWREVADNWFG 182

Query: 557  ACCCSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSK 736
             CCCSFGGISEKLV ++  +Y CA+G CLL+  ++++ KDDL    F+LY          
Sbjct: 183  GCCCSFGGISEKLVTRFANSYRCAKGVCLLNFTTILLFKDDL--VAFKLY---------- 230

Query: 737  IDVEPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPLACQL 916
                   N T   +      SD                 + L  D    ++ TN L  +L
Sbjct: 231  -------NGTHEYQPRPDFSSD-----------------SGLSEDMLSSQERTNDLCEKL 266

Query: 917  TS------SADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSENHEKLIQ 1078
            +S      S   + LV + K+   E    + L L++   T T    S      +    IQ
Sbjct: 267  SSTHLKDNSVSTSALVTKEKASGNEF--FSALPLSDFSETET----SARGCCVHTADHIQ 320

Query: 1079 LSVDELSSIDHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLS 1258
               DE S       CP      VD N+ +L  N      QK   +  LG+ FM +  NLS
Sbjct: 321  NHFDEGSQHSVPETCP------VDQNTSQLLAN------QKLFLNGFLGNVFMAKSYNLS 368

Query: 1259 DNVEWVTFSC 1288
             +++W+ F C
Sbjct: 369  MDIDWMEFVC 378


>ref|XP_015889972.1| PREDICTED: uncharacterized protein LOC107424643, partial [Ziziphus
            jujuba]
          Length = 539

 Score =  239 bits (609), Expect = 1e-69
 Identities = 153/421 (36%), Positives = 212/421 (50%), Gaps = 7/421 (1%)
 Frame = +2

Query: 47   NRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKAXXX 226
            N  +WRFT+E  +HIPTLRL+LF+S  NPS QC N++  + L++S ++VSW         
Sbjct: 10   NPRKWRFTWEAQSHIPTLRLFLFDSHTNPSTQCQNLKVRVSLSQSSVLVSWCQETHISLR 69

Query: 227  XXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGDLERG--- 397
                     L+D  S V  KA DDH+EV++VL+LPVDHP+  +    +    D+E     
Sbjct: 70   VPIPRV---LVDSESPVSFKALDDHIEVKLVLLLPVDHPIISSFDSILNLTDDVENASSD 126

Query: 398  ---RPLELDSDLKNLSS-GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFGACC 565
               + + +DSD+K+LSS GGV  +C+ CS +LT+ PLR F+EMPSVNWREVADNWFGACC
Sbjct: 127  ASSKRMSMDSDIKSLSSCGGVDFYCRSCSVKLTRSPLRNFVEMPSVNWREVADNWFGACC 186

Query: 566  CSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSKIDV 745
            CSFGGISEKLV +Y+ +Y CA+G CLL+S ++ +CKDD+ G  F        D       
Sbjct: 187  CSFGGISEKLVTRYVNSYTCAKGVCLLNSTTIALCKDDIVGCNF-------PDLGGCQSY 239

Query: 746  EPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPLACQLTSS 925
            E  G+  D  +           F  +T +  ++  RN  E   F   KH          +
Sbjct: 240  EDEGDSIDDHE-----------FGGSTLNSGSNHSRN--EKSRFTQFKHEE-------FA 279

Query: 926  ADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSENHEKLIQLSVDELSSI 1105
            A+C G  N    L   SS  +   +N  L+                              
Sbjct: 280  ANCEGKENNGDHLSHPSSE-SDFPVNLTLAQ----------------------------- 309

Query: 1106 DHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDNVEWVTFS 1285
                CC       +   SQK ++N      +K   +  LG+ FM R  NLS +VEW+ F 
Sbjct: 310  ---GCCMHHTSEAL-IESQKPSKNMEILDNKKSFLNGFLGNVFMVRSSNLSMDVEWIEFV 365

Query: 1286 C 1288
            C
Sbjct: 366  C 366


>ref|XP_015889905.1| PREDICTED: uncharacterized protein LOC107424576 [Ziziphus jujuba]
          Length = 554

 Score =  239 bits (609), Expect = 2e-69
 Identities = 153/421 (36%), Positives = 212/421 (50%), Gaps = 7/421 (1%)
 Frame = +2

Query: 47   NRNQWRFTYETLAHIPTLRLYLFNSAVNPSLQCLNIRSDLQLARSLLVVSWIDSEKAXXX 226
            N  +WRFT+E  +HIPTLRL+LF+S  NPS QC N++  + L++S ++VSW         
Sbjct: 10   NPRKWRFTWEAQSHIPTLRLFLFDSHTNPSTQCQNLKVRVSLSQSSVLVSWCQETHISLR 69

Query: 227  XXXXXXXXXLIDPGSGVEIKARDDHVEVRMVLVLPVDHPVAVNLRGAIGSGGDLERG--- 397
                     L+D  S V  KA DDH+EV++VL+LPVDHP+  +    +    D+E     
Sbjct: 70   VPIPRV---LVDSESPVSFKALDDHIEVKLVLLLPVDHPIISSFDSILNLTDDVENASSD 126

Query: 398  ---RPLELDSDLKNLSS-GGVHLFCKVCSARLTKQPLRRFMEMPSVNWREVADNWFGACC 565
               + + +DSD+K+LSS GGV  +C+ CS +LT+ PLR F+EMPSVNWREVADNWFGACC
Sbjct: 127  ASSKRMSMDSDIKSLSSCGGVDFYCRSCSVKLTRSPLRNFVEMPSVNWREVADNWFGACC 186

Query: 566  CSFGGISEKLVLQYIKTYDCAEGTCLLDSASVIICKDDLEGYTFQLYLEEQSDNSSKIDV 745
            CSFGGISEKLV +Y+ +Y CA+G CLL+S ++ +CKDD+ G  F        D       
Sbjct: 187  CSFGGISEKLVTRYVNSYTCAKGVCLLNSTTIALCKDDIVGCNF-------PDLGGCQSY 239

Query: 746  EPSGNRTDTMKEGCKNMSDNIPFVCATSSVANDVERNVLESDSFKCEKHTNPLACQLTSS 925
            E  G+  D  +           F  +T +  ++  RN  E   F   KH          +
Sbjct: 240  EDEGDSIDDHE-----------FGGSTLNSGSNHSRN--EKSRFTQFKHEE-------FA 279

Query: 926  ADCNGLVNRHKSLFMESSRLAQLSLNEILSTVTVDSLSLNNDSENHEKLIQLSVDELSSI 1105
            A+C G  N    L   SS  +   +N  L+                              
Sbjct: 280  ANCEGKENNGDHLSHPSSE-SDFPVNLTLAQ----------------------------- 309

Query: 1106 DHCHCCPDEAQYVVDANSQKLAENSRPEKIQKWLHDCSLGSGFMNRIPNLSDNVEWVTFS 1285
                CC       +   SQK ++N      +K   +  LG+ FM R  NLS +VEW+ F 
Sbjct: 310  ---GCCMHHTSEAL-IESQKPSKNMEILDNKKSFLNGFLGNVFMVRSSNLSMDVEWIEFV 365

Query: 1286 C 1288
            C
Sbjct: 366  C 366


Top