BLASTX nr result

ID: Cheilocostus21_contig00003566 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00003566
         (1886 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009391774.1| PREDICTED: uncharacterized protein LOC103977...   309   6e-96
ref|XP_009407241.1| PREDICTED: uncharacterized protein LOC103989...   160   1e-39
ref|XP_009391295.1| PREDICTED: uncharacterized protein LOC103977...   156   7e-38
ref|XP_010928102.1| PREDICTED: uncharacterized protein LOC105049...   141   7e-32
ref|XP_019708160.1| PREDICTED: uncharacterized protein LOC105049...   141   9e-32
ref|XP_010928100.1| PREDICTED: uncharacterized protein LOC105049...   141   1e-31
ref|XP_019708159.1| PREDICTED: uncharacterized protein LOC105049...   141   1e-31
ref|XP_019708158.1| PREDICTED: uncharacterized protein LOC105049...   141   1e-31
ref|XP_010928105.1| PREDICTED: uncharacterized protein LOC105049...   138   5e-31
ref|XP_008788912.1| PREDICTED: uncharacterized protein LOC103706...   128   2e-27
dbj|GAV79538.1| hypothetical protein CFOL_v3_23003 [Cephalotus f...   117   2e-24
gb|OAY67143.1| hypothetical protein ACMD2_15842 [Ananas comosus]      114   2e-23
ref|XP_010257928.1| PREDICTED: uncharacterized protein LOC104597...   114   2e-23
ref|XP_010257925.1| PREDICTED: uncharacterized protein LOC104597...   114   3e-23
ref|XP_020111007.1| uncharacterized protein LOC109725996 [Ananas...   108   5e-22
gb|OMO72168.1| hypothetical protein COLO4_27800 [Corchorus olito...   110   7e-22
ref|XP_010943305.1| PREDICTED: uncharacterized protein LOC105061...   109   2e-21
ref|XP_010943304.1| PREDICTED: uncharacterized protein LOC105061...   109   2e-21
ref|XP_010943301.1| PREDICTED: uncharacterized protein LOC105061...   109   2e-21
ref|XP_022149065.1| uncharacterized protein LOC111017570 [Momord...   108   3e-21

>ref|XP_009391774.1| PREDICTED: uncharacterized protein LOC103977857 [Musa acuminata
            subsp. malaccensis]
 ref|XP_009391776.1| PREDICTED: uncharacterized protein LOC103977857 [Musa acuminata
            subsp. malaccensis]
 ref|XP_009391777.1| PREDICTED: uncharacterized protein LOC103977857 [Musa acuminata
            subsp. malaccensis]
          Length = 393

 Score =  309 bits (791), Expect = 6e-96
 Identities = 181/393 (46%), Positives = 225/393 (57%), Gaps = 14/393 (3%)
 Frame = +3

Query: 489  MDYEASTNQNQCGLRCTDVL---PYSSSDIFGEETNAYTDNAVAQIKIPDTIIYAKADCY 659
            MD EAS N+NQ     TD L   P  S+    EE   Y + +V ++K  + II++K D  
Sbjct: 1    MDNEASINRNQFDPEFTDALLRNPAGSTGSVEEEEKLYPEKSVTKVKPIEIIIFSKDDTC 60

Query: 660  PIVKDICVDEGLYCSDKVSLENAVSEKKSVFATSMNKDLNVQMTDGSPPDVQDSKFISTV 839
             +VKD+C++EG    +KV LEN    + S    +MN D+N QMTD + P  QDSKFIS V
Sbjct: 61   SVVKDMCIEEGSSSLEKVLLENKEVSEMSFSMINMNSDVNGQMTDNAAPATQDSKFISAV 120

Query: 840  DKSNGDHTCSWNLPETGEKPNTVDHVQSISYFPKISSELLVSPGDFDRDYPHA-----DS 1004
            DK   +   SW+L   GEKPNTVD V+S S   K+SSELL+S G+ + D+ H      D 
Sbjct: 121  DKVVEEQNYSWSLSVAGEKPNTVDQVKSNSSVLKLSSELLLSAGESETDHNHVVPTSFDF 180

Query: 1005 SSDQRQCIAE-----EKNENTCSTQNALPLGATESDTSIGAKENLTDHTSDILVDGYSTA 1169
            SSD+R CI E      K E+TCST   LP    E D S   KEN T   S I+ DG S  
Sbjct: 181  SSDRRHCIDEANIEQNKIEDTCSTMTTLPFSDVEPDKSNVVKENSTSDMSKIMGDGCSIT 240

Query: 1170 ADSVSDGGRNNYSPIEENPTTVEREVGRMDSRLGLASATA-RDVKRENIDGQEILQAQHC 1346
               +SD   +N+S  E NP   E +V   D      +AT  ++ +R + D QE +Q Q+ 
Sbjct: 241  THMISDVTVSNHSAAEGNPANSEMDVRVTDPTFESGAATTVKENRRGDRDSQEFVQVQNR 300

Query: 1347 SIPEMIVPDMTVGRTQSSLYHDNLGDFKFSGPKSSSGHSAFSGNIXXXXXXXXXXXXXFA 1526
            S+ +M VPD T    QS LYH   GD  FSGPK+SSGH A+SGNI             FA
Sbjct: 301  SVSDMTVPDATTAPAQSLLYHSTHGDLNFSGPKASSGHIAYSGNISMRSDSSTTSTRSFA 360

Query: 1527 FPILQAEWNTSPVKMAKARKHRSWRMGLICCKF 1625
            FPILQ EWNTSPVKMAKARK R WRM LICCKF
Sbjct: 361  FPILQTEWNTSPVKMAKARKPRRWRMSLICCKF 393


>ref|XP_009407241.1| PREDICTED: uncharacterized protein LOC103989979 [Musa acuminata
            subsp. malaccensis]
 ref|XP_018682784.1| PREDICTED: uncharacterized protein LOC103989979 [Musa acuminata
            subsp. malaccensis]
 ref|XP_018682785.1| PREDICTED: uncharacterized protein LOC103989979 [Musa acuminata
            subsp. malaccensis]
 ref|XP_018682786.1| PREDICTED: uncharacterized protein LOC103989979 [Musa acuminata
            subsp. malaccensis]
 ref|XP_018682787.1| PREDICTED: uncharacterized protein LOC103989979 [Musa acuminata
            subsp. malaccensis]
 ref|XP_018682788.1| PREDICTED: uncharacterized protein LOC103989979 [Musa acuminata
            subsp. malaccensis]
          Length = 404

 Score =  160 bits (404), Expect = 1e-39
 Identities = 139/424 (32%), Positives = 192/424 (45%), Gaps = 23/424 (5%)
 Frame = +3

Query: 423  NDLASTSFKNELEDIDSSPPSFMDYEASTNQNQCGLRCTDVLPYSSSDIFGEETNA--YT 596
            NDLA    +   +DI+SS P  +  +AS  QNQ G +    +P +S D+  E  N   Y 
Sbjct: 14   NDLAELIDRKPYQDIESSLPYHISKDASVIQNQNGNK---PMPSASDDLRAEIQNIELYA 70

Query: 597  DNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKKSVF-----ATS 761
            D A   +                VKDIC+DE L    K+ LE++ + ++S+      AT+
Sbjct: 71   DKAATGV-------------CQTVKDICMDESLP-HKKILLESSDTTEESLAGIKPSATN 116

Query: 762  MNKDLNVQMTDGSPPDVQDSKFISTVDKSNGDHTCSWNLPETGEKPNTVDHVQSISYFPK 941
             + + + ++T+     +QD    S V K   D     +L E  +K     H+        
Sbjct: 117  TDDNPSGRLTECVTLIMQDLHIASVVAKDAADQYSLCSLVELEDKQKADVHITKHLSDHN 176

Query: 942  ISSELLVSPGDFDRDYPHADSSSDQRQCIAEE-----KNENTCSTQNALPLGATESDTSI 1106
            IS + L+S GDFD      D++   R  I +E     K++  CST  A     T+S  + 
Sbjct: 177  ISFQPLLSTGDFDMVPRQLDTNKFNRLHIFQENTGQVKHDEVCSTTLASSSITTDSKETS 236

Query: 1107 GAKENLTDHTS-DILVDGYSTAADSVSD-----GGRNNYSPIEENPTTVEREVGRMDSRL 1268
            G  +N T  TS   L  G STA    SD      G  N SP      T  +E+       
Sbjct: 237  GLVKNYTSVTSLGSLKGGCSTAEVEPSDVGGEKEGGGNVSPSFNPGATTNKEI------- 289

Query: 1269 GLASATARDVKRENIDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNLGDFKFSGPKS 1448
                    +    N D +  + AQ+C   E +V D     ++ S  H N GD   SGPKS
Sbjct: 290  --------EENSGNTDSESFIDAQNCFSGEEMVFDGVTSSSRCSCCHKNAGDPSSSGPKS 341

Query: 1449 SSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA-----RKHRSWRMGLI 1613
            SSGH  FSGNI             FAFPILQ +WN+SPVKMAKA     +KHR WR G++
Sbjct: 342  SSGHIVFSGNI-SLRSDSTTSTRSFAFPILQPDWNSSPVKMAKADGRHLKKHRFWRSGIL 400

Query: 1614 CCKF 1625
            CCKF
Sbjct: 401  CCKF 404


>ref|XP_009391295.1| PREDICTED: uncharacterized protein LOC103977488 [Musa acuminata
            subsp. malaccensis]
 ref|XP_009391296.1| PREDICTED: uncharacterized protein LOC103977488 [Musa acuminata
            subsp. malaccensis]
 ref|XP_009391298.1| PREDICTED: uncharacterized protein LOC103977488 [Musa acuminata
            subsp. malaccensis]
 ref|XP_018678289.1| PREDICTED: uncharacterized protein LOC103977488 [Musa acuminata
            subsp. malaccensis]
 ref|XP_018678290.1| PREDICTED: uncharacterized protein LOC103977488 [Musa acuminata
            subsp. malaccensis]
 ref|XP_018678291.1| PREDICTED: uncharacterized protein LOC103977488 [Musa acuminata
            subsp. malaccensis]
          Length = 443

 Score =  156 bits (394), Expect = 7e-38
 Identities = 135/400 (33%), Positives = 194/400 (48%), Gaps = 35/400 (8%)
 Frame = +3

Query: 531  RCTDVLPYSSSDIFGE---ETNAYTDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYC 701
            +CT+  PY+   + G    +T + +D  V QIK+ D       DC  ++KDICVD+GL+ 
Sbjct: 56   KCTEA-PYNVLKMIGTVETDTLSISDKTVTQIKVQDN------DCQ-VIKDICVDDGLHA 107

Query: 702  SDKVSLENAVSEKK-----SVFATSMNKDLNVQMTDGSPPDVQDSKFISTVDKSNGDHTC 866
             +K+ L+NA   +        +AT+ N + N QMT G+   V+  K+    D    D   
Sbjct: 108  FEKILLKNAAVSENIPSGFKTYATNANDNWNQQMTHGTALVVEGLKYTDAADDDK-DQKS 166

Query: 867  SWNLPETGEKPNTVDHVQSISYFPKISSELLVSPGDFDRDYPHA-----DSSSDQRQCIA 1031
            SW L E+ +K +  + + S +   KI+ + L S G+ D    H       S S  +Q I 
Sbjct: 167  SWGLFESKQKLDEGNQLASKTS-EKITLQHLFSLGELDTLAHHMVLTSYGSISSTKQSIG 225

Query: 1032 EEKNEN-TCSTQNALPLGATESDT--SIGAKENLTDHTSDILVDGYSTAADS----VSDG 1190
            +   E  T   +++  L    S T  SI   ENL D  S+  ++   +AA +    V+  
Sbjct: 226  QMNIEQVTLEEEHSENLQCDSSMTYESIMILENLNDDLSEGNLETKCSAATNDPLDVTFS 285

Query: 1191 GRNNYSPIEENPTTVEREVGRMDSRLGLASATAR----DVKRENIDGQEILQAQHCSIPE 1358
             RN  S +E+ P      V   D     A+A A     +   +N D ++++ A +    E
Sbjct: 286  DRNGDS-MEKPPNAQLEGVCNSDFNSVAAAAAAAISGTEKNNKNADDKQLVNALYHRSNE 344

Query: 1359 MIVPDMTVGRTQSSLYHDNLGDFKFSGP------KSSSGHSAFSGNIXXXXXXXXXXXXX 1520
             +V D T    QSS Y +N GD  FSGP      K SSGH A+SG+I             
Sbjct: 345  EMVFDATTASAQSSFYPNNHGDLHFSGPSYLSGPKVSSGHIAYSGSISLRSESSTTSTHS 404

Query: 1521 FAFPILQAEWNTSPVKMAKA-----RKHRSWRMGLICCKF 1625
            FAFPILQAEWNTSP++M KA     RK+R WR GL CCKF
Sbjct: 405  FAFPILQAEWNTSPIRMVKADRRHLRKYR-WRTGLFCCKF 443


>ref|XP_010928102.1| PREDICTED: uncharacterized protein LOC105049967 isoform X5 [Elaeis
            guineensis]
 ref|XP_010928103.1| PREDICTED: uncharacterized protein LOC105049967 isoform X5 [Elaeis
            guineensis]
 ref|XP_019708165.1| PREDICTED: uncharacterized protein LOC105049967 isoform X5 [Elaeis
            guineensis]
          Length = 569

 Score =  141 bits (355), Expect = 7e-32
 Identities = 160/560 (28%), Positives = 226/560 (40%), Gaps = 159/560 (28%)
 Frame = +3

Query: 423  NDLASTSFKNELEDIDSSPPSFMDYEASTNQNQCGLRCTDV---LPYSSSDIFGEETNAY 593
            +D+ S+  K   E ID S  S+M  + S N  +   RCT+    +P    ++   ET   
Sbjct: 13   DDVMSSIQKKGCEYIDPSKQSYMGIKESFNGKKNNDRCTEAPFTIPPDEINLGENETEFC 72

Query: 594  TDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKK--SVF---AT 758
            TD  V +I+ P+ IIY    C+ I KDICVDEG+   +K+ +EN  S ++  S F     
Sbjct: 73   TDKTVTEIQPPE-IIYKDGACHNI-KDICVDEGICSLEKILVENDESSQRFPSSFDDTVE 130

Query: 759  SMNKDLNVQMTDGSPPDVQDSKFIS-----TVDKSNGDHTCSWNLPETGEKPNTV----- 908
            S N  L+ +M DGS   V DS+  S     +V+K+  +H  S +L + G   +T      
Sbjct: 131  SGNDALSKEMADGSTTSVHDSRSTSYEHLISVEKNAMEH-FSGSLSKVGANFDTTNPFIS 189

Query: 909  ----DHVQSISYFPKISSEL---LVSPGDFD--RDYPH---------------------- 995
                +++     FP    E     V P +FD   D+ H                      
Sbjct: 190  DISDENISLKQCFPLYEFETDSQQVKPTNFDCGNDHKHYSESLSKVGGKFRATNPFLTDI 249

Query: 996  ADSSSDQRQCI-----------AEEKN-------------------ENTCSTQNALPLGA 1085
            +D +   +Q +           AE  N                   E   ST +A+P  A
Sbjct: 250  SDETISFKQFLSLQELETDSLQAEPTNFDCSNEQLADHQKFYQGTLEEEYSTMSAVPADA 309

Query: 1086 TESDTSIGAKENLT-DHTSDILVDGYSTAADSVSDGGRNNYSP-IEENP----------- 1226
             +S  S G+KEN +       L DGY TAA S+SD G +  S  I+E+P           
Sbjct: 310  RQSTQSNGSKENASISLAQGTLEDGYLTAASSLSDVGESEQSSGIKESPAKSSSEDTLEE 369

Query: 1227 -----TTVEREVGRMDSRLGLAS-------------------ATARDVKREN-------- 1310
                 T+   +V + D   G+                     A   D+K  N        
Sbjct: 370  GCSVTTSALYDVSKSDESSGIKEKPDEGLSQGLLKDVCSQDMALPSDIKESNESSRTMGN 429

Query: 1311 ------------------------IDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNL 1418
                                    ID Q   QA++  + E  V D      +SSL H+  
Sbjct: 430  PANAELESGVSGLASIGGQESKEKIDRQRDGQAKNDPVTEEAVSDNATTSARSSLVHNYH 489

Query: 1419 GDFKF------SGPKSSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA 1580
             D  F      SGP +SSGH  +SG+I             FAFPILQ+EWN+SPVKMAKA
Sbjct: 490  LDADFSEPVYMSGPIASSGHIPYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVKMAKA 549

Query: 1581 -----RKHRSWRMGLICCKF 1625
                 RKHR W +GL+CC+F
Sbjct: 550  DRRHLRKHRGWWLGLLCCRF 569


>ref|XP_019708160.1| PREDICTED: uncharacterized protein LOC105049967 isoform X4 [Elaeis
            guineensis]
 ref|XP_019708161.1| PREDICTED: uncharacterized protein LOC105049967 isoform X4 [Elaeis
            guineensis]
 ref|XP_019708162.1| PREDICTED: uncharacterized protein LOC105049967 isoform X4 [Elaeis
            guineensis]
 ref|XP_019708163.1| PREDICTED: uncharacterized protein LOC105049967 isoform X4 [Elaeis
            guineensis]
 ref|XP_019708164.1| PREDICTED: uncharacterized protein LOC105049967 isoform X4 [Elaeis
            guineensis]
          Length = 602

 Score =  141 bits (355), Expect = 9e-32
 Identities = 160/560 (28%), Positives = 226/560 (40%), Gaps = 159/560 (28%)
 Frame = +3

Query: 423  NDLASTSFKNELEDIDSSPPSFMDYEASTNQNQCGLRCTDV---LPYSSSDIFGEETNAY 593
            +D+ S+  K   E ID S  S+M  + S N  +   RCT+    +P    ++   ET   
Sbjct: 46   DDVMSSIQKKGCEYIDPSKQSYMGIKESFNGKKNNDRCTEAPFTIPPDEINLGENETEFC 105

Query: 594  TDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKK--SVF---AT 758
            TD  V +I+ P+ IIY    C+ I KDICVDEG+   +K+ +EN  S ++  S F     
Sbjct: 106  TDKTVTEIQPPE-IIYKDGACHNI-KDICVDEGICSLEKILVENDESSQRFPSSFDDTVE 163

Query: 759  SMNKDLNVQMTDGSPPDVQDSKFIS-----TVDKSNGDHTCSWNLPETGEKPNTV----- 908
            S N  L+ +M DGS   V DS+  S     +V+K+  +H  S +L + G   +T      
Sbjct: 164  SGNDALSKEMADGSTTSVHDSRSTSYEHLISVEKNAMEH-FSGSLSKVGANFDTTNPFIS 222

Query: 909  ----DHVQSISYFPKISSEL---LVSPGDFD--RDYPH---------------------- 995
                +++     FP    E     V P +FD   D+ H                      
Sbjct: 223  DISDENISLKQCFPLYEFETDSQQVKPTNFDCGNDHKHYSESLSKVGGKFRATNPFLTDI 282

Query: 996  ADSSSDQRQCI-----------AEEKN-------------------ENTCSTQNALPLGA 1085
            +D +   +Q +           AE  N                   E   ST +A+P  A
Sbjct: 283  SDETISFKQFLSLQELETDSLQAEPTNFDCSNEQLADHQKFYQGTLEEEYSTMSAVPADA 342

Query: 1086 TESDTSIGAKENLT-DHTSDILVDGYSTAADSVSDGGRNNYSP-IEENP----------- 1226
             +S  S G+KEN +       L DGY TAA S+SD G +  S  I+E+P           
Sbjct: 343  RQSTQSNGSKENASISLAQGTLEDGYLTAASSLSDVGESEQSSGIKESPAKSSSEDTLEE 402

Query: 1227 -----TTVEREVGRMDSRLGLAS-------------------ATARDVKREN-------- 1310
                 T+   +V + D   G+                     A   D+K  N        
Sbjct: 403  GCSVTTSALYDVSKSDESSGIKEKPDEGLSQGLLKDVCSQDMALPSDIKESNESSRTMGN 462

Query: 1311 ------------------------IDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNL 1418
                                    ID Q   QA++  + E  V D      +SSL H+  
Sbjct: 463  PANAELESGVSGLASIGGQESKEKIDRQRDGQAKNDPVTEEAVSDNATTSARSSLVHNYH 522

Query: 1419 GDFKF------SGPKSSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA 1580
             D  F      SGP +SSGH  +SG+I             FAFPILQ+EWN+SPVKMAKA
Sbjct: 523  LDADFSEPVYMSGPIASSGHIPYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVKMAKA 582

Query: 1581 -----RKHRSWRMGLICCKF 1625
                 RKHR W +GL+CC+F
Sbjct: 583  DRRHLRKHRGWWLGLLCCRF 602


>ref|XP_010928100.1| PREDICTED: uncharacterized protein LOC105049967 isoform X3 [Elaeis
            guineensis]
          Length = 605

 Score =  141 bits (355), Expect = 1e-31
 Identities = 160/560 (28%), Positives = 226/560 (40%), Gaps = 159/560 (28%)
 Frame = +3

Query: 423  NDLASTSFKNELEDIDSSPPSFMDYEASTNQNQCGLRCTDV---LPYSSSDIFGEETNAY 593
            +D+ S+  K   E ID S  S+M  + S N  +   RCT+    +P    ++   ET   
Sbjct: 49   DDVMSSIQKKGCEYIDPSKQSYMGIKESFNGKKNNDRCTEAPFTIPPDEINLGENETEFC 108

Query: 594  TDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKK--SVF---AT 758
            TD  V +I+ P+ IIY    C+ I KDICVDEG+   +K+ +EN  S ++  S F     
Sbjct: 109  TDKTVTEIQPPE-IIYKDGACHNI-KDICVDEGICSLEKILVENDESSQRFPSSFDDTVE 166

Query: 759  SMNKDLNVQMTDGSPPDVQDSKFIS-----TVDKSNGDHTCSWNLPETGEKPNTV----- 908
            S N  L+ +M DGS   V DS+  S     +V+K+  +H  S +L + G   +T      
Sbjct: 167  SGNDALSKEMADGSTTSVHDSRSTSYEHLISVEKNAMEH-FSGSLSKVGANFDTTNPFIS 225

Query: 909  ----DHVQSISYFPKISSEL---LVSPGDFD--RDYPH---------------------- 995
                +++     FP    E     V P +FD   D+ H                      
Sbjct: 226  DISDENISLKQCFPLYEFETDSQQVKPTNFDCGNDHKHYSESLSKVGGKFRATNPFLTDI 285

Query: 996  ADSSSDQRQCI-----------AEEKN-------------------ENTCSTQNALPLGA 1085
            +D +   +Q +           AE  N                   E   ST +A+P  A
Sbjct: 286  SDETISFKQFLSLQELETDSLQAEPTNFDCSNEQLADHQKFYQGTLEEEYSTMSAVPADA 345

Query: 1086 TESDTSIGAKENLT-DHTSDILVDGYSTAADSVSDGGRNNYSP-IEENP----------- 1226
             +S  S G+KEN +       L DGY TAA S+SD G +  S  I+E+P           
Sbjct: 346  RQSTQSNGSKENASISLAQGTLEDGYLTAASSLSDVGESEQSSGIKESPAKSSSEDTLEE 405

Query: 1227 -----TTVEREVGRMDSRLGLAS-------------------ATARDVKREN-------- 1310
                 T+   +V + D   G+                     A   D+K  N        
Sbjct: 406  GCSVTTSALYDVSKSDESSGIKEKPDEGLSQGLLKDVCSQDMALPSDIKESNESSRTMGN 465

Query: 1311 ------------------------IDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNL 1418
                                    ID Q   QA++  + E  V D      +SSL H+  
Sbjct: 466  PANAELESGVSGLASIGGQESKEKIDRQRDGQAKNDPVTEEAVSDNATTSARSSLVHNYH 525

Query: 1419 GDFKF------SGPKSSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA 1580
             D  F      SGP +SSGH  +SG+I             FAFPILQ+EWN+SPVKMAKA
Sbjct: 526  LDADFSEPVYMSGPIASSGHIPYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVKMAKA 585

Query: 1581 -----RKHRSWRMGLICCKF 1625
                 RKHR W +GL+CC+F
Sbjct: 586  DRRHLRKHRGWWLGLLCCRF 605


>ref|XP_019708159.1| PREDICTED: uncharacterized protein LOC105049967 isoform X2 [Elaeis
            guineensis]
          Length = 606

 Score =  141 bits (355), Expect = 1e-31
 Identities = 160/560 (28%), Positives = 226/560 (40%), Gaps = 159/560 (28%)
 Frame = +3

Query: 423  NDLASTSFKNELEDIDSSPPSFMDYEASTNQNQCGLRCTDV---LPYSSSDIFGEETNAY 593
            +D+ S+  K   E ID S  S+M  + S N  +   RCT+    +P    ++   ET   
Sbjct: 50   DDVMSSIQKKGCEYIDPSKQSYMGIKESFNGKKNNDRCTEAPFTIPPDEINLGENETEFC 109

Query: 594  TDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKK--SVF---AT 758
            TD  V +I+ P+ IIY    C+ I KDICVDEG+   +K+ +EN  S ++  S F     
Sbjct: 110  TDKTVTEIQPPE-IIYKDGACHNI-KDICVDEGICSLEKILVENDESSQRFPSSFDDTVE 167

Query: 759  SMNKDLNVQMTDGSPPDVQDSKFIS-----TVDKSNGDHTCSWNLPETGEKPNTV----- 908
            S N  L+ +M DGS   V DS+  S     +V+K+  +H  S +L + G   +T      
Sbjct: 168  SGNDALSKEMADGSTTSVHDSRSTSYEHLISVEKNAMEH-FSGSLSKVGANFDTTNPFIS 226

Query: 909  ----DHVQSISYFPKISSEL---LVSPGDFD--RDYPH---------------------- 995
                +++     FP    E     V P +FD   D+ H                      
Sbjct: 227  DISDENISLKQCFPLYEFETDSQQVKPTNFDCGNDHKHYSESLSKVGGKFRATNPFLTDI 286

Query: 996  ADSSSDQRQCI-----------AEEKN-------------------ENTCSTQNALPLGA 1085
            +D +   +Q +           AE  N                   E   ST +A+P  A
Sbjct: 287  SDETISFKQFLSLQELETDSLQAEPTNFDCSNEQLADHQKFYQGTLEEEYSTMSAVPADA 346

Query: 1086 TESDTSIGAKENLT-DHTSDILVDGYSTAADSVSDGGRNNYSP-IEENP----------- 1226
             +S  S G+KEN +       L DGY TAA S+SD G +  S  I+E+P           
Sbjct: 347  RQSTQSNGSKENASISLAQGTLEDGYLTAASSLSDVGESEQSSGIKESPAKSSSEDTLEE 406

Query: 1227 -----TTVEREVGRMDSRLGLAS-------------------ATARDVKREN-------- 1310
                 T+   +V + D   G+                     A   D+K  N        
Sbjct: 407  GCSVTTSALYDVSKSDESSGIKEKPDEGLSQGLLKDVCSQDMALPSDIKESNESSRTMGN 466

Query: 1311 ------------------------IDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNL 1418
                                    ID Q   QA++  + E  V D      +SSL H+  
Sbjct: 467  PANAELESGVSGLASIGGQESKEKIDRQRDGQAKNDPVTEEAVSDNATTSARSSLVHNYH 526

Query: 1419 GDFKF------SGPKSSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA 1580
             D  F      SGP +SSGH  +SG+I             FAFPILQ+EWN+SPVKMAKA
Sbjct: 527  LDADFSEPVYMSGPIASSGHIPYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVKMAKA 586

Query: 1581 -----RKHRSWRMGLICCKF 1625
                 RKHR W +GL+CC+F
Sbjct: 587  DRRHLRKHRGWWLGLLCCRF 606


>ref|XP_019708158.1| PREDICTED: uncharacterized protein LOC105049967 isoform X1 [Elaeis
            guineensis]
          Length = 638

 Score =  141 bits (355), Expect = 1e-31
 Identities = 160/560 (28%), Positives = 226/560 (40%), Gaps = 159/560 (28%)
 Frame = +3

Query: 423  NDLASTSFKNELEDIDSSPPSFMDYEASTNQNQCGLRCTDV---LPYSSSDIFGEETNAY 593
            +D+ S+  K   E ID S  S+M  + S N  +   RCT+    +P    ++   ET   
Sbjct: 82   DDVMSSIQKKGCEYIDPSKQSYMGIKESFNGKKNNDRCTEAPFTIPPDEINLGENETEFC 141

Query: 594  TDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKK--SVF---AT 758
            TD  V +I+ P+ IIY    C+ I KDICVDEG+   +K+ +EN  S ++  S F     
Sbjct: 142  TDKTVTEIQPPE-IIYKDGACHNI-KDICVDEGICSLEKILVENDESSQRFPSSFDDTVE 199

Query: 759  SMNKDLNVQMTDGSPPDVQDSKFIS-----TVDKSNGDHTCSWNLPETGEKPNTV----- 908
            S N  L+ +M DGS   V DS+  S     +V+K+  +H  S +L + G   +T      
Sbjct: 200  SGNDALSKEMADGSTTSVHDSRSTSYEHLISVEKNAMEH-FSGSLSKVGANFDTTNPFIS 258

Query: 909  ----DHVQSISYFPKISSEL---LVSPGDFD--RDYPH---------------------- 995
                +++     FP    E     V P +FD   D+ H                      
Sbjct: 259  DISDENISLKQCFPLYEFETDSQQVKPTNFDCGNDHKHYSESLSKVGGKFRATNPFLTDI 318

Query: 996  ADSSSDQRQCI-----------AEEKN-------------------ENTCSTQNALPLGA 1085
            +D +   +Q +           AE  N                   E   ST +A+P  A
Sbjct: 319  SDETISFKQFLSLQELETDSLQAEPTNFDCSNEQLADHQKFYQGTLEEEYSTMSAVPADA 378

Query: 1086 TESDTSIGAKENLT-DHTSDILVDGYSTAADSVSDGGRNNYSP-IEENP----------- 1226
             +S  S G+KEN +       L DGY TAA S+SD G +  S  I+E+P           
Sbjct: 379  RQSTQSNGSKENASISLAQGTLEDGYLTAASSLSDVGESEQSSGIKESPAKSSSEDTLEE 438

Query: 1227 -----TTVEREVGRMDSRLGLAS-------------------ATARDVKREN-------- 1310
                 T+   +V + D   G+                     A   D+K  N        
Sbjct: 439  GCSVTTSALYDVSKSDESSGIKEKPDEGLSQGLLKDVCSQDMALPSDIKESNESSRTMGN 498

Query: 1311 ------------------------IDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNL 1418
                                    ID Q   QA++  + E  V D      +SSL H+  
Sbjct: 499  PANAELESGVSGLASIGGQESKEKIDRQRDGQAKNDPVTEEAVSDNATTSARSSLVHNYH 558

Query: 1419 GDFKF------SGPKSSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA 1580
             D  F      SGP +SSGH  +SG+I             FAFPILQ+EWN+SPVKMAKA
Sbjct: 559  LDADFSEPVYMSGPIASSGHIPYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVKMAKA 618

Query: 1581 -----RKHRSWRMGLICCKF 1625
                 RKHR W +GL+CC+F
Sbjct: 619  DRRHLRKHRGWWLGLLCCRF 638


>ref|XP_010928105.1| PREDICTED: uncharacterized protein LOC105049967 isoform X6 [Elaeis
            guineensis]
          Length = 554

 Score =  138 bits (348), Expect = 5e-31
 Identities = 158/552 (28%), Positives = 221/552 (40%), Gaps = 159/552 (28%)
 Frame = +3

Query: 447  KNELEDIDSSPPSFMDYEASTNQNQCGLRCTDV---LPYSSSDIFGEETNAYTDNAVAQI 617
            K   E ID S  S+M  + S N  +   RCT+    +P    ++   ET   TD  V +I
Sbjct: 6    KKGCEYIDPSKQSYMGIKESFNGKKNNDRCTEAPFTIPPDEINLGENETEFCTDKTVTEI 65

Query: 618  KIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKK--SVF---ATSMNKDLNV 782
            + P+ IIY    C+ I KDICVDEG+   +K+ +EN  S ++  S F     S N  L+ 
Sbjct: 66   QPPE-IIYKDGACHNI-KDICVDEGICSLEKILVENDESSQRFPSSFDDTVESGNDALSK 123

Query: 783  QMTDGSPPDVQDSKFIS-----TVDKSNGDHTCSWNLPETGEKPNTV---------DHVQ 920
            +M DGS   V DS+  S     +V+K+  +H  S +L + G   +T          +++ 
Sbjct: 124  EMADGSTTSVHDSRSTSYEHLISVEKNAMEH-FSGSLSKVGANFDTTNPFISDISDENIS 182

Query: 921  SISYFPKISSEL---LVSPGDFD--RDYPH----------------------ADSSSDQR 1019
                FP    E     V P +FD   D+ H                      +D +   +
Sbjct: 183  LKQCFPLYEFETDSQQVKPTNFDCGNDHKHYSESLSKVGGKFRATNPFLTDISDETISFK 242

Query: 1020 QCI-----------AEEKN-------------------ENTCSTQNALPLGATESDTSIG 1109
            Q +           AE  N                   E   ST +A+P  A +S  S G
Sbjct: 243  QFLSLQELETDSLQAEPTNFDCSNEQLADHQKFYQGTLEEEYSTMSAVPADARQSTQSNG 302

Query: 1110 AKENLT-DHTSDILVDGYSTAADSVSDGGRNNYSP-IEENP----------------TTV 1235
            +KEN +       L DGY TAA S+SD G +  S  I+E+P                T+ 
Sbjct: 303  SKENASISLAQGTLEDGYLTAASSLSDVGESEQSSGIKESPAKSSSEDTLEEGCSVTTSA 362

Query: 1236 EREVGRMDSRLGLAS-------------------ATARDVKREN---------------- 1310
              +V + D   G+                     A   D+K  N                
Sbjct: 363  LYDVSKSDESSGIKEKPDEGLSQGLLKDVCSQDMALPSDIKESNESSRTMGNPANAELES 422

Query: 1311 ----------------IDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNLGDFKF--- 1433
                            ID Q   QA++  + E  V D      +SSL H+   D  F   
Sbjct: 423  GVSGLASIGGQESKEKIDRQRDGQAKNDPVTEEAVSDNATTSARSSLVHNYHLDADFSEP 482

Query: 1434 ---SGPKSSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA-----RKH 1589
               SGP +SSGH  +SG+I             FAFPILQ+EWN+SPVKMAKA     RKH
Sbjct: 483  VYMSGPIASSGHIPYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVKMAKADRRHLRKH 542

Query: 1590 RSWRMGLICCKF 1625
            R W +GL+CC+F
Sbjct: 543  RGWWLGLLCCRF 554


>ref|XP_008788912.1| PREDICTED: uncharacterized protein LOC103706556 [Phoenix dactylifera]
 ref|XP_008788914.1| PREDICTED: uncharacterized protein LOC103706556 [Phoenix dactylifera]
 ref|XP_008788915.1| PREDICTED: uncharacterized protein LOC103706556 [Phoenix dactylifera]
 ref|XP_017698139.1| PREDICTED: uncharacterized protein LOC103706556 [Phoenix dactylifera]
 ref|XP_017698140.1| PREDICTED: uncharacterized protein LOC103706556 [Phoenix dactylifera]
 ref|XP_017698141.1| PREDICTED: uncharacterized protein LOC103706556 [Phoenix dactylifera]
 ref|XP_017698142.1| PREDICTED: uncharacterized protein LOC103706556 [Phoenix dactylifera]
          Length = 575

 Score =  128 bits (321), Expect = 2e-27
 Identities = 142/561 (25%), Positives = 211/561 (37%), Gaps = 160/561 (28%)
 Frame = +3

Query: 423  NDLASTSFKNELEDIDSSPPSFMDYEASTNQNQCGLRCTDV---LPYSSSDIFGEETNAY 593
            +DL S+  K   E  D S   F+  E S +      +C +    +P     +   ET   
Sbjct: 17   DDLMSSIQKKAREYSDPSKQIFVGTEESFSGQMNDGKCMEYPFNIPADEIKLGENETVFC 76

Query: 594  TDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKKSVFATSM--- 764
            T+  V +I++P+ I   K   +  +KDICVDEG+   +K+ +EN   E   +F +S    
Sbjct: 77   TEKTVTEIELPEMIGCYKDGAFNNIKDICVDEGICSLEKILVEN--DESSQLFPSSFNYP 134

Query: 765  ----NKDLNVQMTDGSPPDVQDSK-----FISTVDKSNGDHTCSWNLPETGEKPNTV--- 908
                N  L+ +M D +     D K      +++VDK   +   S +L + G K +T    
Sbjct: 135  VASGNSALSKEMADAAATTADDFKSTSREHVTSVDKDGEEQHASGSLSKVGAKFDTANPF 194

Query: 909  ------DHVQSISYFP----------------------KISSELLVSPGD-------FDR 983
                  +++    Y P                      K SSE L   GD       F  
Sbjct: 195  LSDISDENISLKQYLPLHEFEIDPKQVEPTNIDCSDDHKHSSESLSEVGDEFRSTNPFLC 254

Query: 984  DYP-------------------------HADSSSDQ---RQCIAEEKNENTCSTQNALPL 1079
            D P                         + D +++Q   RQ   +   E  C    A+P 
Sbjct: 255  DIPDETISFKHFLSLQEIETDPQQVEPTNFDCNTEQLADRQKFNQGALEEECFITTAVPS 314

Query: 1080 GATESDTSIGAKENLT-------------------------------------DHTSDIL 1148
             A +S+ S G+KEN+T                                       + D L
Sbjct: 315  DARQSNNSCGSKENVTICLSQGTVEEGCLISTSALSDVRESGKSTGVKESPAKSLSEDTL 374

Query: 1149 VDGYSTAADS----------VSDGGRNNYSP------------IEENPTTVEREVGRMDS 1262
             +G  T+A S            +   NN SP            +  +     +  G M +
Sbjct: 375  QEGCFTSASSDVAESEESRETKENPSNNLSPGLLKDVCSPDTAVPSDAKEANQSSGTMGN 434

Query: 1263 RL---------GLASATARDVKRENIDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDN 1415
                       G AS + ++   + ID Q+  QA+   +      D      +S   H+N
Sbjct: 435  PANAELESGVSGAASISGQEESSQKIDRQKDGQAKSDPVTGEAFSDNATASARSLFVHNN 494

Query: 1416 LGDFKF------SGPKSSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAK 1577
             GD  F      SGP +SSGH  +SG+I             FAFPILQ+EWN+SPVKMAK
Sbjct: 495  HGDLNFSEPVCLSGPIASSGHIPYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVKMAK 554

Query: 1578 A-----RKHRSWRMGLICCKF 1625
            A     RKHR WR+GL+CC+F
Sbjct: 555  ADHSHLRKHRGWRVGLLCCRF 575


>dbj|GAV79538.1| hypothetical protein CFOL_v3_23003 [Cephalotus follicularis]
          Length = 475

 Score =  117 bits (294), Expect = 2e-24
 Identities = 111/415 (26%), Positives = 181/415 (43%), Gaps = 44/415 (10%)
 Frame = +3

Query: 513  QNQCGLRCTDVLPYSSSD--IFGEETNAYTDNAVAQIKIPDTIIYAKADCYPIVKDICVD 686
            +N+  L C  V  YS++D   F + +  Y + +V + ++P+ I+  K   Y +VKDIC++
Sbjct: 67   RNESKLDCPVVANYSTNDNESFEKHSVFYFNRSVMKCELPELILCYKESPYHVVKDICIN 126

Query: 687  EGLYCSDKVSLENAVSEKKSVFATSMNKDLNVQMTDGSPPD--VQDSKFISTVDKSNGDH 860
            E +   DK     +  ++KSV     + D N++ T+G P D  +  +   S  + S+ D 
Sbjct: 127  EDVPSKDKNLFFESGVDEKSVCTFPPDMDQNIESTEGKPFDMPIPVAMKASAENDSDKDI 186

Query: 861  TCSWNLPE-----------TGEKPNTVDHVQSISYFPKISSELLVSPGDFDRDYPHADSS 1007
               +++P+           T +  N +   Q IS    +S E L S   F +       +
Sbjct: 187  NDKYDIPDLMPIGEVQDDATDKNANDIPK-QKISLGDMLSMEKLHSENTFSKSCDVVSKN 245

Query: 1008 SDQRQCIAEEKNENTCSTQNALPLGATESDTSIGAKENLTDHTSDI-LVDGYSTAADSVS 1184
            ++Q     +  +E T ++  A    + ES+ S    E   + + D+ L      +A   S
Sbjct: 246  AEQLS--VQSSSEKTVASSLASLSTSDESNNSGNRTEESNNDSEDLTLASPTLVSATKES 303

Query: 1185 DGGRNN--------YSPIEENPTT-----------VEREVGRMDSRLGLASATARDVKRE 1307
            D GR+          S  EE+  +           VE      D   G  +A+ R    +
Sbjct: 304  DSGRDEMVFVSPAIVSASEESANSSFSNDLSYNSKVETGSITFDFNSGAPAASDRKECPQ 363

Query: 1308 NIDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNLGDFKFSGPKSS----SGHSAFSG 1475
              + + +   Q  S  E     +   +TQ S        F  +GP S     SG  A+SG
Sbjct: 364  ITESECLDDTQSSSRLEDADIQLVTSQTQHS---HGESSFSTAGPISGSIIYSGPIAYSG 420

Query: 1476 NIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA-----RKHRSWRMGLICCKF 1625
            ++             FAFP+LQ+EWN+SPV+MAKA     RKHR WR GL+CC+F
Sbjct: 421  SVSLRSDSSTTSTRSFAFPVLQSEWNSSPVRMAKADRRHYRKHRGWRQGLLCCRF 475


>gb|OAY67143.1| hypothetical protein ACMD2_15842 [Ananas comosus]
          Length = 436

 Score =  114 bits (286), Expect = 2e-23
 Identities = 107/399 (26%), Positives = 165/399 (41%), Gaps = 49/399 (12%)
 Frame = +3

Query: 576  EETNAYTDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKKSVFA 755
            EE     + A  + ++ + ++        +VKDICVDEGL+  +     +A        A
Sbjct: 57   EEKRFSIEKAAVETELSNMVVSLDNSASEVVKDICVDEGLHSLNSNFDHSA--------A 108

Query: 756  TSMNKDLNVQMTDGSPP----DVQDSKFISTVDKSNGDHTCSWNLPETGEKPNTVDHVQS 923
               ++D+  ++TDG+      D++       +D ++G      +   +G+KP   + +  
Sbjct: 109  NGNHRDIMKEITDGAAAGITGDMESRSVNGPLDSNSGAEHFLSDSFSSGDKPKAAEEITG 168

Query: 924  ISYFPKISSELLVSPGDFDRDYPHAD------SSSDQRQCIAEEKNEN---------TCS 1058
                 KIS + L+S  D   +   +       SSSD+++       E          T  
Sbjct: 169  DICSGKISLKDLLSANDCASEPQKSSPISLNFSSSDRKRSFNSRTVEQDTVKEYYAVTSV 228

Query: 1059 TQNALPLGAT----ESDTSIGAKENLTDHTSDILV------DGYSTAADSVSDGGRNNYS 1208
               A  L  T    ES T+   K +L D +  I V         S+A  +    G   + 
Sbjct: 229  ASEAATLNQTSKIKESPTNGITKGSLEDRSGTITVFLNKNESSQSSAPSAHIANGSTLFE 288

Query: 1209 PIEENP----------TTVEREVGRMDSRLGLASATARDVKRENIDGQEILQAQHCSIPE 1358
            P+E N               RE G MD             +R   D +++ +A+     +
Sbjct: 289  PVEANSYDDGVDIDDFNPRARETGGMDGS-----------ERTTKDSEQLNRAEKGPAVD 337

Query: 1359 MIVPDMTVGRTQSSLYHDN-----LGDFKFSGPKSSSGHSAFSGNIXXXXXXXXXXXXXF 1523
                D      ++S++H +      G    SGPK+SSGH  +SG+I             F
Sbjct: 338  GSKVDGATNSVRNSVHHYDGDMIMSGPIISSGPKASSGHIPYSGSISHRSDSSTASTRSF 397

Query: 1524 AFPILQAEWNTSPVKMAKA-----RKHRSWRMGLICCKF 1625
            AFPILQ EWNTSPVKMAKA     RKH SWR+GL CC+F
Sbjct: 398  AFPILQTEWNTSPVKMAKADRGRFRKHHSWRVGLFCCRF 436


>ref|XP_010257928.1| PREDICTED: uncharacterized protein LOC104597869 isoform X2 [Nelumbo
            nucifera]
          Length = 415

 Score =  114 bits (284), Expect = 2e-23
 Identities = 109/388 (28%), Positives = 164/388 (42%), Gaps = 25/388 (6%)
 Frame = +3

Query: 537  TDVLPYSSSDIFGEETNAYTDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVS 716
            T VLP     +  + T  YTD +V + ++P+ I+  K   Y +VKDICVDEG+   DK+ 
Sbjct: 62   TYVLPSGEIKLSEKVTKFYTDKSVMECEVPELIVCFKEGPYHVVKDICVDEGVPSQDKIL 121

Query: 717  LENAVSEKKSVFATS---MNKDLNVQMTDG--SPPDVQDSKFISTVDKSNGDHTCSWNLP 881
             EN   + K     S   +N DL  QM        DV  S   S  +K+      S +L 
Sbjct: 122  TENGQVDCKPCSMHSDLDVNSDLTKQMVGSVTLDSDVMKSLVQSDCEKNTDSQCNSKDLF 181

Query: 882  ETGEKPNTVDHVQSISYF--PKISSELLVSPGDFDR-----DYPHADSSSDQRQCIAEEK 1040
            +  EK   V+   + ++    K+ SE ++S G         +  + DS+ +Q+    +  
Sbjct: 182  QKDEKNADVEDEIAHAHILDKKVMSENMLSVGKLKTEKSCPELTNFDSNGEQQAHNQDMS 241

Query: 1041 NENTCSTQNALPLGATESDTSIGAKENLTDHTSDILVDGYSTAADSVSDGGRNNYSPIEE 1220
             E T +  +A+P  A ESD+S                               +N  P+  
Sbjct: 242  REGTLA-NSAVPSPAAESDSS-----------------------------NPDNKVPLN- 270

Query: 1221 NPTTVEREVGRMDSRLGLASATARDVKRENIDGQEILQA--QHCSIPEMIVPDMTVGRTQ 1394
              + VE      DS    ++ + R   ++  D  + L        + +  V  +T     
Sbjct: 271  --SKVENRSITFDSNPSTSATSGRVESKQKADSPQPLHTLLNTSRLEDGPVESLTASSRS 328

Query: 1395 SSLYHDNLGDFKFS--GPKSS----SGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNT 1556
              + H + G+  FS  GP S     SG   +SG+I             FAFPIL +EWN+
Sbjct: 329  FFIQHGH-GESSFSAVGPMSGSITYSGPIPYSGSISLRSDSSTTSNRSFAFPILHSEWNS 387

Query: 1557 SPVKMAKA-----RKHRSWRMGLICCKF 1625
            SPVKMAKA     RKHR W+M  +CC F
Sbjct: 388  SPVKMAKADQRHFRKHRRWKMNFLCCSF 415


>ref|XP_010257925.1| PREDICTED: uncharacterized protein LOC104597869 isoform X1 [Nelumbo
            nucifera]
 ref|XP_010257926.1| PREDICTED: uncharacterized protein LOC104597869 isoform X1 [Nelumbo
            nucifera]
 ref|XP_010257927.1| PREDICTED: uncharacterized protein LOC104597869 isoform X1 [Nelumbo
            nucifera]
          Length = 453

 Score =  114 bits (284), Expect = 3e-23
 Identities = 109/388 (28%), Positives = 164/388 (42%), Gaps = 25/388 (6%)
 Frame = +3

Query: 537  TDVLPYSSSDIFGEETNAYTDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVS 716
            T VLP     +  + T  YTD +V + ++P+ I+  K   Y +VKDICVDEG+   DK+ 
Sbjct: 100  TYVLPSGEIKLSEKVTKFYTDKSVMECEVPELIVCFKEGPYHVVKDICVDEGVPSQDKIL 159

Query: 717  LENAVSEKKSVFATS---MNKDLNVQMTDG--SPPDVQDSKFISTVDKSNGDHTCSWNLP 881
             EN   + K     S   +N DL  QM        DV  S   S  +K+      S +L 
Sbjct: 160  TENGQVDCKPCSMHSDLDVNSDLTKQMVGSVTLDSDVMKSLVQSDCEKNTDSQCNSKDLF 219

Query: 882  ETGEKPNTVDHVQSISYF--PKISSELLVSPGDFDR-----DYPHADSSSDQRQCIAEEK 1040
            +  EK   V+   + ++    K+ SE ++S G         +  + DS+ +Q+    +  
Sbjct: 220  QKDEKNADVEDEIAHAHILDKKVMSENMLSVGKLKTEKSCPELTNFDSNGEQQAHNQDMS 279

Query: 1041 NENTCSTQNALPLGATESDTSIGAKENLTDHTSDILVDGYSTAADSVSDGGRNNYSPIEE 1220
             E T +  +A+P  A ESD+S                               +N  P+  
Sbjct: 280  REGTLA-NSAVPSPAAESDSS-----------------------------NPDNKVPLN- 308

Query: 1221 NPTTVEREVGRMDSRLGLASATARDVKRENIDGQEILQA--QHCSIPEMIVPDMTVGRTQ 1394
              + VE      DS    ++ + R   ++  D  + L        + +  V  +T     
Sbjct: 309  --SKVENRSITFDSNPSTSATSGRVESKQKADSPQPLHTLLNTSRLEDGPVESLTASSRS 366

Query: 1395 SSLYHDNLGDFKFS--GPKSS----SGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNT 1556
              + H + G+  FS  GP S     SG   +SG+I             FAFPIL +EWN+
Sbjct: 367  FFIQHGH-GESSFSAVGPMSGSITYSGPIPYSGSISLRSDSSTTSNRSFAFPILHSEWNS 425

Query: 1557 SPVKMAKA-----RKHRSWRMGLICCKF 1625
            SPVKMAKA     RKHR W+M  +CC F
Sbjct: 426  SPVKMAKADQRHFRKHRRWKMNFLCCSF 453


>ref|XP_020111007.1| uncharacterized protein LOC109725996 [Ananas comosus]
          Length = 362

 Score =  108 bits (271), Expect = 5e-22
 Identities = 103/370 (27%), Positives = 154/370 (41%), Gaps = 49/370 (13%)
 Frame = +3

Query: 663  IVKDICVDEGLYCSDKVSLENAVSEKKSVFATSMNKDLNVQMTDGSPP----DVQDSKFI 830
            +VKDICVDEGL+  +     +A        A   ++D+  ++TDG+      D++     
Sbjct: 12   VVKDICVDEGLHSLNSNFDHSA--------ANGNHRDIMKEITDGAAAGITGDMESRSVN 63

Query: 831  STVDKSNGDHTCSWNLPETGEKPNTVDHVQSISYFPKISSELLVSPGDFDRDYPHAD--- 1001
              +D ++G      +   +G+KP   + +       KIS + L+S  D   +   +    
Sbjct: 64   GPLDSNSGAEHFLSDSFSSGDKPKAAEEITGDICSGKISLKDLLSANDCASEPQKSSPIS 123

Query: 1002 ---SSSDQRQCIAEEKNEN---------TCSTQNALPLGAT----ESDTSIGAKENLTDH 1133
               SSSD+++       E          T     A  L  T    ES T+   K +L D 
Sbjct: 124  LNFSSSDRKRSFNSRTVEQDTVKEYYAVTSVASEAATLNQTSKIKESPTNGITKGSLEDR 183

Query: 1134 TSDILV------DGYSTAADSVSDGGRNNYSPIEENP----------TTVEREVGRMDSR 1265
            +  I V         S+A  +    G   + P+E N               RE G MD  
Sbjct: 184  SGTITVFLNKNESSQSSAPSAHIANGSTLFEPVEANSYDDGVDIDDFNPRARETGGMDGG 243

Query: 1266 LGLASATARDVKRENIDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDN-----LGDFK 1430
                       +R   D +++ +A+     +    D      ++S++H +      G   
Sbjct: 244  -----------ERTTKDSEQLNRAEKGPAVDGSKVDGATNSVRNSVHHYDGDMIMSGPII 292

Query: 1431 FSGPKSSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA-----RKHRS 1595
             SGPK+SSGH  +SG+I             FAFPILQ EWNTSPVKMAKA     RKH  
Sbjct: 293  SSGPKASSGHIPYSGSISHRSDSSTTSTRSFAFPILQTEWNTSPVKMAKADRGRFRKHHG 352

Query: 1596 WRMGLICCKF 1625
            WR+GL CC+F
Sbjct: 353  WRVGLFCCRF 362


>gb|OMO72168.1| hypothetical protein COLO4_27800 [Corchorus olitorius]
          Length = 503

 Score =  110 bits (275), Expect = 7e-22
 Identities = 108/402 (26%), Positives = 172/402 (42%), Gaps = 57/402 (14%)
 Frame = +3

Query: 591  YTDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKVSLENAVSEKKSV-FATSMN 767
            Y D +V +  +P+ ++  K + Y +VKDIC+DEG+   DK   E+ ++EK +  F  S  
Sbjct: 125  YLDKSVMECDLPELVVCYKENTYHVVKDICIDEGVPTQDKFLFESDMNEKNNCNFLPSCK 184

Query: 768  KDLNVQMTDGSPPDVQDSKFIST--------------VDKSNGDHTCSW----------- 872
                 Q    S P+ Q  K I                 D+SN  + C +           
Sbjct: 185  LVEEKQDIPISSPEDQSGKNIDNGCDFNEKLDADACRQDESNKGNQCDFEDFMMKRKVKD 244

Query: 873  ----NLPETGEKP-NTVDHVQSISYFPKISSELLVSPGDFDRDYPHADSSSDQRQC---- 1025
                 +P+   K   T+  + S++    ++S+ + S    D     +  SS +++     
Sbjct: 245  EEMKTIPDDLSKELFTLGELLSMTELSTVTSKAMSSECKSDGIEQQSIQSSSEKEVNVNP 304

Query: 1026 ----IAEEKNENTCSTQNALPLGATESDTSIGAKENLTDHTSDILVDGYS---TAADSVS 1184
                +AEE N NT +  +A  L +   ++  G ++ +   TS + V   S   T ++ VS
Sbjct: 305  PSVFVAEESNNNTEAMLDAPGLISAAGESDNGKEDAIPISTSQVSVSEESTNNTLSNEVS 364

Query: 1185 DGGRNNYSPIEENPTTVEREVGRMDSRLGLASAT-ARDVKRENIDGQEILQAQHCSIPEM 1361
            D  R     I  N               G ++ T ++D  R N+         +C +PE 
Sbjct: 365  DDNRLETESITFN--------------FGSSAPTNSKDECRPNL---------NCELPET 401

Query: 1362 -IVPDM--TVGRTQSSLYHDNLGDFKFS------GPKSSSGHSAFSGNIXXXXXXXXXXX 1514
               P +  T  +  S++     G+  FS      G  S SG  A+SG++           
Sbjct: 402  GTTPKLEDTADQPISNILQRGTGETSFSASGPVTGLISYSGPIAYSGSLSLRSDSSTTST 461

Query: 1515 XXFAFPILQAEWNTSPVKMAKA-----RKHRSWRMGLICCKF 1625
              FAFP+LQ+EWN+SPV+MAKA     RKHR WR GL CC+F
Sbjct: 462  RSFAFPVLQSEWNSSPVRMAKADRRHYRKHRGWRHGLFCCRF 503


>ref|XP_010943305.1| PREDICTED: uncharacterized protein LOC105061062 isoform X7 [Elaeis
            guineensis]
          Length = 535

 Score =  109 bits (272), Expect = 2e-21
 Identities = 80/245 (32%), Positives = 113/245 (46%), Gaps = 44/245 (17%)
 Frame = +3

Query: 1023 CIAEEKNENTCSTQNALPLGATESDTSIGAKENLTDHTS-DILVDGYSTAADSVSDGGRN 1199
            C+++   E  CS   +      ES  S G KE+     S D L +G S +A S   G   
Sbjct: 292  CLSQGTAEEGCSMATSASSDVRESGQSNGVKESPAKSLSEDTLQEGCSASASSDVCGSEE 351

Query: 1200 NYSPIEENPTT-----VEREVGRMDSRL---------------------------GLASA 1283
            + + I+ENP+      + ++V   D+ +                           G AS 
Sbjct: 352  S-NEIQENPSNNLSPGLLKDVCSPDTAVPSDAKEANQSGGTVGNPGNAELENGVSGAASI 410

Query: 1284 TARDVKRENIDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNLGDFKFS------GPK 1445
            + ++   + ID Q+  QA++  + E  V D      +SS  H+N GD  +S      GP 
Sbjct: 411  SGQEESNQKIDRQKDGQAKNDPVTEEAVSDNATASARSSFVHNNHGDLNYSEPVYLSGPI 470

Query: 1446 SSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA-----RKHRSWRMGL 1610
             SSGH  +SG+I             FAFPILQ EWN+SPVKMAKA     RKHR WR+GL
Sbjct: 471  VSSGHIPYSGSISLRSDSSTTSTRSFAFPILQPEWNSSPVKMAKADHSHLRKHRGWRVGL 530

Query: 1611 ICCKF 1625
            +CC+F
Sbjct: 531  LCCRF 535


>ref|XP_010943304.1| PREDICTED: uncharacterized protein LOC105061062 isoform X6 [Elaeis
            guineensis]
          Length = 545

 Score =  109 bits (272), Expect = 2e-21
 Identities = 80/245 (32%), Positives = 113/245 (46%), Gaps = 44/245 (17%)
 Frame = +3

Query: 1023 CIAEEKNENTCSTQNALPLGATESDTSIGAKENLTDHTS-DILVDGYSTAADSVSDGGRN 1199
            C+++   E  CS   +      ES  S G KE+     S D L +G S +A S   G   
Sbjct: 302  CLSQGTAEEGCSMATSASSDVRESGQSNGVKESPAKSLSEDTLQEGCSASASSDVCGSEE 361

Query: 1200 NYSPIEENPTT-----VEREVGRMDSRL---------------------------GLASA 1283
            + + I+ENP+      + ++V   D+ +                           G AS 
Sbjct: 362  S-NEIQENPSNNLSPGLLKDVCSPDTAVPSDAKEANQSGGTVGNPGNAELENGVSGAASI 420

Query: 1284 TARDVKRENIDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNLGDFKFS------GPK 1445
            + ++   + ID Q+  QA++  + E  V D      +SS  H+N GD  +S      GP 
Sbjct: 421  SGQEESNQKIDRQKDGQAKNDPVTEEAVSDNATASARSSFVHNNHGDLNYSEPVYLSGPI 480

Query: 1446 SSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA-----RKHRSWRMGL 1610
             SSGH  +SG+I             FAFPILQ EWN+SPVKMAKA     RKHR WR+GL
Sbjct: 481  VSSGHIPYSGSISLRSDSSTTSTRSFAFPILQPEWNSSPVKMAKADHSHLRKHRGWRVGL 540

Query: 1611 ICCKF 1625
            +CC+F
Sbjct: 541  LCCRF 545


>ref|XP_010943301.1| PREDICTED: uncharacterized protein LOC105061062 isoform X4 [Elaeis
            guineensis]
 ref|XP_010943302.1| PREDICTED: uncharacterized protein LOC105061062 isoform X4 [Elaeis
            guineensis]
          Length = 556

 Score =  109 bits (272), Expect = 2e-21
 Identities = 80/245 (32%), Positives = 113/245 (46%), Gaps = 44/245 (17%)
 Frame = +3

Query: 1023 CIAEEKNENTCSTQNALPLGATESDTSIGAKENLTDHTS-DILVDGYSTAADSVSDGGRN 1199
            C+++   E  CS   +      ES  S G KE+     S D L +G S +A S   G   
Sbjct: 313  CLSQGTAEEGCSMATSASSDVRESGQSNGVKESPAKSLSEDTLQEGCSASASSDVCGSEE 372

Query: 1200 NYSPIEENPTT-----VEREVGRMDSRL---------------------------GLASA 1283
            + + I+ENP+      + ++V   D+ +                           G AS 
Sbjct: 373  S-NEIQENPSNNLSPGLLKDVCSPDTAVPSDAKEANQSGGTVGNPGNAELENGVSGAASI 431

Query: 1284 TARDVKRENIDGQEILQAQHCSIPEMIVPDMTVGRTQSSLYHDNLGDFKFS------GPK 1445
            + ++   + ID Q+  QA++  + E  V D      +SS  H+N GD  +S      GP 
Sbjct: 432  SGQEESNQKIDRQKDGQAKNDPVTEEAVSDNATASARSSFVHNNHGDLNYSEPVYLSGPI 491

Query: 1446 SSSGHSAFSGNIXXXXXXXXXXXXXFAFPILQAEWNTSPVKMAKA-----RKHRSWRMGL 1610
             SSGH  +SG+I             FAFPILQ EWN+SPVKMAKA     RKHR WR+GL
Sbjct: 492  VSSGHIPYSGSISLRSDSSTTSTRSFAFPILQPEWNSSPVKMAKADHSHLRKHRGWRVGL 551

Query: 1611 ICCKF 1625
            +CC+F
Sbjct: 552  LCCRF 556


>ref|XP_022149065.1| uncharacterized protein LOC111017570 [Momordica charantia]
 ref|XP_022149066.1| uncharacterized protein LOC111017570 [Momordica charantia]
          Length = 475

 Score =  108 bits (270), Expect = 3e-21
 Identities = 111/404 (27%), Positives = 176/404 (43%), Gaps = 43/404 (10%)
 Frame = +3

Query: 543  VLPYSSS---DIFGEETNAYTDNAVAQIKIPDTIIYAKADCYPIVKDICVDEGLYCSDKV 713
            V P+++S   D+F E++  Y + ++ + ++P+ I+  K +   IVKDIC+DEG+   D +
Sbjct: 90   VSPFTNSSKVDLFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDML 149

Query: 714  SLENAVSEKKSVFATSMNKDLNVQMTDGSPPDVQDSKFISTVDK---SNGDHTCSWNLPE 884
               N++ EK         KD      D    +++  K  S+      SN D     +L +
Sbjct: 150  LCGNSLDEKAVCAIAPSEKD----WKDELEGELEKRKMFSSEHAESFSNKDSPKQCDLKD 205

Query: 885  TGEKPNTVDHVQSISYF-----PKISSELLV----SPGDFDRDYPHADS------SSDQR 1019
             G  P   +    ++YF     P +S + LV     P    +D  H  S      S+   
Sbjct: 206  LGRIP---EAEYDVAYFTDNDIPNLSMKDLVVESLKPLINHKDESHPQSEQVFIESASLE 262

Query: 1020 QCIAEEKNENTCSTQNALPLGATESDTSIGAKE---NLTDHTSDILVDGYSTAADSVSDG 1190
              ++    E++ ST       +TE   S    E   N      +I  D ++++A   SDG
Sbjct: 263  VPVSVSAVEDSYSTTGEAIAASTEPKNSSSVNEISYNSKVDNGNITFD-FNSSASIASDG 321

Query: 1191 GR---NNYSPIEENPTTVEREVGRMDSRLGLASATARDVKRENIDGQEILQAQHCSIPEM 1361
                 N YS    +  T    V   D        T+     E+ D  ++      S P+ 
Sbjct: 322  MEHHDNGYS--NSSAPTTSASVDCQD--------TSSPDPSESADKSQVQCHHTSSNPKC 371

Query: 1362 I----VPDMTVG----RTQSSLYHDNLGDFKFS--GP----KSSSGHSAFSGNIXXXXXX 1499
            +    +P   VG     + S+     +G+  FS  GP     S+SG   +SG+I      
Sbjct: 372  VEYEDLPKAEVGISXSXSVSTQVQHGIGETSFSSMGPLGSLVSNSGRIGYSGSISLRSDS 431

Query: 1500 XXXXXXXFAFPILQAEWNTSPVKMAKA--RKHRSWRMGLICCKF 1625
                   FAFPI+Q+EWN+SPV+MAKA  RKHR W+ GL+CC+F
Sbjct: 432  STTSTRSFAFPIIQSEWNSSPVRMAKADRRKHRGWKHGLLCCRF 475


Top