BLASTX nr result

ID: Alisma22_contig00007536 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Alisma22_contig00007536
         (1758 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010922571.1 PREDICTED: uncharacterized protein LOC105045848 i...   167   2e-43
XP_008805185.1 PREDICTED: uncharacterized protein LOC103718240 [...   166   6e-43
XP_010922572.1 PREDICTED: uncharacterized protein LOC105045848 i...   154   6e-39
XP_009420007.1 PREDICTED: neural Wiskott-Aldrich syndrome protei...   140   1e-33
JAT48794.1 hypothetical protein g.39421, partial [Anthurium amni...   140   2e-33
XP_010907408.1 PREDICTED: uncharacterized protein LOC105034083 [...   138   1e-32
XP_009420008.1 PREDICTED: neural Wiskott-Aldrich syndrome protei...   133   2e-31
XP_017700949.1 PREDICTED: uncharacterized protein LOC103718115 [...   131   6e-31
XP_020100425.1 circumsporozoite protein isoform X1 [Ananas comosus]   127   1e-28
XP_010267380.1 PREDICTED: uncharacterized protein LOC104604644 i...   124   1e-27
XP_020100426.1 circumsporozoite protein isoform X2 [Ananas comosus]   123   1e-27
XP_020100428.1 circumsporozoite protein isoform X3 [Ananas comosus]   123   2e-27
XP_010267379.1 PREDICTED: uncharacterized protein LOC104604644 i...   119   6e-26
OMO95080.1 hypothetical protein CCACVL1_05581 [Corchorus capsula...   114   3e-24
OAY59588.1 hypothetical protein MANES_01G043100 [Manihot esculenta]   111   4e-23
XP_017974077.1 PREDICTED: uncharacterized protein LOC18605858 is...   108   4e-22
EOY23701.1 Uncharacterized protein TCM_015509 isoform 1 [Theobro...   108   4e-22
XP_017974076.1 PREDICTED: uncharacterized protein LOC18605858 is...   108   4e-22
EOY23702.1 Uncharacterized protein TCM_015509 isoform 2 [Theobro...   108   4e-22
XP_004307917.1 PREDICTED: uncharacterized protein LOC101313650 [...   101   7e-20

>XP_010922571.1 PREDICTED: uncharacterized protein LOC105045848 isoform X1 [Elaeis
            guineensis]
          Length = 332

 Score =  167 bits (424), Expect = 2e-43
 Identities = 131/335 (39%), Positives = 176/335 (52%), Gaps = 12/335 (3%)
 Frame = +2

Query: 74   MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNSTPNP-PTDDASDPPSEPTSS 250
            MLC  ST +S++NWLDRL +SKGF++P+ DL LD FL +S PNP P  ++  PP  P  +
Sbjct: 1    MLCSISTSRSSSNWLDRLHTSKGFSIPA-DLDLDHFL-SSNPNPDPNTNSCFPPLPPPET 58

Query: 251  PFRDSNDRSA--SPPMXXXXXXXXXX---LANQMRAALADLFHMDSPGRCNRPHTFR-DR 412
               D+  R    SPP+             + + M +ALA+LF M   G  + P T R  +
Sbjct: 59   RPSDAWRRQPHPSPPVSAAGNKTAGGKEQIFDLMGSALAELFIM---GDGSAPATLRASK 115

Query: 413  RSARKQGHPRFCVASYPGSAGGSCPSGMPAM--VTPAAMSPSIANNGGVKLKRKRTADGX 586
            +SARKQ +P+ CV S   S GG+  +G PA   VTPA  SPS A N   + K+ RT    
Sbjct: 116  KSARKQPNPKACVPSISASIGGNFLAGAPAACRVTPAT-SPSSAENSVAEAKKSRTK--- 171

Query: 587  XXXXXXXXXXXHATYLLSAAG---ENKREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAW 757
                        A      AG   E+     S+T+VTVIDTS S  WKS K+IFRKG  W
Sbjct: 172  ------------ARRKRGTAGSPVESDLSTYSKTEVTVIDTS-SPGWKSEKLIFRKGIVW 218

Query: 758  KVRDKKKWCLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNSNTFTRLAEGSKCDPK 937
            KVRDKK W + RK+RK GL +R   L S    +Q  +             + EG     K
Sbjct: 219  KVRDKKLWNVCRKKRKLGLVER---LISEKEKEQPLIDMKVPAGKEHSRSVDEGGAHAEK 275

Query: 938  GDLANETIDDRISFPEIRLQFSKSFRRARSKGHTV 1042
             D +NE+ DD+I  P+ + +FS+S R   +K  +V
Sbjct: 276  RDASNES-DDQIQIPKRKPKFSRSPRVPAAKDSSV 309


>XP_008805185.1 PREDICTED: uncharacterized protein LOC103718240 [Phoenix dactylifera]
          Length = 330

 Score =  166 bits (420), Expect = 6e-43
 Identities = 132/352 (37%), Positives = 180/352 (51%), Gaps = 12/352 (3%)
 Frame = +2

Query: 74   MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNSTPNP-PTDDASDPPSEPTSS 250
            MLC  ST +S++NWLDRL +SKGF++P+ DL LD FL +S PNP P  ++  PP  P+ +
Sbjct: 1    MLCSISTSRSSSNWLDRLHTSKGFSIPA-DLDLDHFL-SSNPNPDPNSNSCFPPPPPSET 58

Query: 251  PFRDSNDRSASPPMXXXXXXXXXXLANQ----MRAALADLFHM-DSPGRCNRPHTFR-DR 412
                +  +   PP              Q    M +ALA+LF M D P     P T R  +
Sbjct: 59   RPSCARRKQHPPPPVSASGSKTAGEKEQIFDLMSSALAELFVMGDRPA----PGTLRASK 114

Query: 413  RSARKQGHPRFCVASYPGSAGGSCPSGMPAM--VTPAAMSPSIANNGGVKLKRKRTADGX 586
            +S+RKQ +P+ CV S   S GG+  +G PA   VTPA  SPS A N   + K+ RT    
Sbjct: 115  KSSRKQPNPKACVPSVSASIGGNFLAGAPAACHVTPAT-SPSSAENSVAEAKKSRTK--- 170

Query: 587  XXXXXXXXXXXHATYLLSAAG---ENKREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAW 757
                        A      AG   E+     S+T+VTVIDTS S  WKS K+IFRKG  W
Sbjct: 171  ------------ARRKRGTAGSPVESDLSTYSKTEVTVIDTS-SPGWKSEKLIFRKGIVW 217

Query: 758  KVRDKKKWCLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNSNTFTRLAEGSKCDPK 937
            KVRDKK W + RK+RK GL +R   L S    +Q  +             + EG     K
Sbjct: 218  KVRDKKLWNVCRKKRKLGLVER---LISEKEKEQPLIDMKVPAGKERSRSVDEGGAHAEK 274

Query: 938  GDLANETIDDRISFPEIRLQFSKSFRRARSKGHTVSHPLSLASESSNSTLPP 1093
             D +NE+ DD+I  P  + +FS+S R   +K  +V + L +++   N +  P
Sbjct: 275  IDASNES-DDQIQIPMRKPKFSRSPRVPAAKDSSV-YCLQVSTSRKNGSACP 324


>XP_010922572.1 PREDICTED: uncharacterized protein LOC105045848 isoform X2 [Elaeis
           guineensis]
          Length = 285

 Score =  154 bits (388), Expect = 6e-39
 Identities = 113/267 (42%), Positives = 147/267 (55%), Gaps = 12/267 (4%)
 Frame = +2

Query: 74  MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNSTPNP-PTDDASDPPSEPTSS 250
           MLC  ST +S++NWLDRL +SKGF++P+ DL LD FL +S PNP P  ++  PP  P  +
Sbjct: 1   MLCSISTSRSSSNWLDRLHTSKGFSIPA-DLDLDHFL-SSNPNPDPNTNSCFPPLPPPET 58

Query: 251 PFRDSNDRSA--SPPMXXXXXXXXXX---LANQMRAALADLFHMDSPGRCNRPHTFR-DR 412
              D+  R    SPP+             + + M +ALA+LF M   G  + P T R  +
Sbjct: 59  RPSDAWRRQPHPSPPVSAAGNKTAGGKEQIFDLMGSALAELFIM---GDGSAPATLRASK 115

Query: 413 RSARKQGHPRFCVASYPGSAGGSCPSGMPAM--VTPAAMSPSIANNGGVKLKRKRTADGX 586
           +SARKQ +P+ CV S   S GG+  +G PA   VTPA  SPS A N   + K+ RT    
Sbjct: 116 KSARKQPNPKACVPSISASIGGNFLAGAPAACRVTPAT-SPSSAENSVAEAKKSRTK--- 171

Query: 587 XXXXXXXXXXXHATYLLSAAG---ENKREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAW 757
                       A      AG   E+     S+T+VTVIDTS S  WKS K+IFRKG  W
Sbjct: 172 ------------ARRKRGTAGSPVESDLSTYSKTEVTVIDTS-SPGWKSEKLIFRKGIVW 218

Query: 758 KVRDKKKWCLMRKERKFGLAQRTTTLK 838
           KVRDKK W + RK+RK GL +R  + K
Sbjct: 219 KVRDKKLWNVCRKKRKLGLVERLISEK 245


>XP_009420007.1 PREDICTED: neural Wiskott-Aldrich syndrome protein isoform X1 [Musa
            acuminata subsp. malaccensis]
          Length = 335

 Score =  140 bits (354), Expect = 1e-33
 Identities = 122/351 (34%), Positives = 162/351 (46%), Gaps = 17/351 (4%)
 Frame = +2

Query: 89   STGKSAANWLDRLRSSKGFTLPSPDLSLDQFLL-----NSTPN---------PPTDDASD 226
            S  KS +NWL+RL SS+GF++P+  L LD FL      N +PN         PP +  SD
Sbjct: 10   SNTKSTSNWLERLHSSRGFSVPA-HLHLDHFLSPDSASNPSPNSPPPPPPPPPPEEVLSD 68

Query: 227  PPS-EPTSSPFRDSNDRSASPPMXXXXXXXXXXLANQMRAALADLFHMDSPGRCNRPHTF 403
            PP  EP ++P R        PP           L + +   LA+LF M  P         
Sbjct: 69   PPPPEPLANPRRRKKHLQPPPP-PGASTDGKQRLFDLVGGVLAELFVMGGPPVVR---AL 124

Query: 404  RDRRSARKQGHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIAN--NGGVKLKRKRTA 577
            + ++S+RKQ +P+ CV S   S  G C S +PA   P++   S+A       KL+RKR  
Sbjct: 125  KAKKSSRKQPNPKVCVPSASASIDG-CRS-LPATSPPSSADNSVAEAKKSRSKLRRKRGT 182

Query: 578  DGXXXXXXXXXXXXHATYLLSAAGENKREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAW 757
             G                 LSA         SRTDVTVIDTSC   WKS K+IFRKG  W
Sbjct: 183  AGSPVDLD-----------LSAY--------SRTDVTVIDTSCPG-WKSEKVIFRKGIMW 222

Query: 758  KVRDKKKWCLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNSNTFTRLAEGSKCDPK 937
            KVRDKK W L RK+RK GL  R   L +    +Q    P    +        EG     K
Sbjct: 223  KVRDKKVWTLSRKKRKMGLVGR---LINEKDKEQPLAEPKVQADEGILASFVEGGDPVDK 279

Query: 938  GDLANETIDDRISFPEIRLQFSKSFRRARSKGHTVSHPLSLASESSNSTLP 1090
             D A+  I D++     R +FS+S  R R+   +   P + +S  +  + P
Sbjct: 280  RD-ASGKIGDQVPISIRRQKFSRS-PRTRTAEDSAFQPNATSSRKNGVSCP 328


>JAT48794.1 hypothetical protein g.39421, partial [Anthurium amnicola]
          Length = 347

 Score =  140 bits (353), Expect = 2e-33
 Identities = 119/333 (35%), Positives = 160/333 (48%), Gaps = 10/333 (3%)
 Frame = +2

Query: 74   MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNSTPNPPTDDASDP-PSEPTSS 250
            M CP    +   NWLDRLRSSKGF      L LD FLL+  P+P  D   +P PS PT  
Sbjct: 34   MACPAPAAEPGPNWLDRLRSSKGFPPHHAGLDLDHFLLHH-PDPDPDPEPNPNPSPPTPQ 92

Query: 251  PFRDSNDR--SASPPMXXXXXXXXXXLANQ----MRAALADLFHMDSPGRCNRPHTFRDR 412
                      SA+PP                   M +ALA+LFHM  P     P     +
Sbjct: 93   DHHHHQPSFGSAAPPQHREEAASPAGEGKAWFQLMSSALAELFHMGDPR--GLPALRGGK 150

Query: 413  RSARKQGHPRFCVASYPGSA-GGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKRTADGXX 589
            ++ RKQ +PR CVAS  G   GG+   G+PA   P+A + SIA  G  K   +RT     
Sbjct: 151  KNPRKQPNPRICVASSRGEEQGGAGGGGLPATSPPSAEN-SIA--GAKKWPGRRTK---- 203

Query: 590  XXXXXXXXXXHATYLLSAAGEN-KREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAWKVR 766
                          L + A E+      S T+VTVIDTS S +WKS KIIFRKG  WKVR
Sbjct: 204  --------ARRKRALRTGAAESLDLSAYSCTEVTVIDTS-SPSWKSQKIIFRKGLVWKVR 254

Query: 767  DKKKWCLMRKERKFGLAQRTTTLKSSHGSQQVFLAP-TTVGNSNTFTRLAEGSKCDPKGD 943
            DKK W + RK+R+ G+A+R   L +    +++  A    V          +G   + + D
Sbjct: 255  DKKLWSVCRKKRRLGVAKR---LANEQEQEELLSAERREVLIKEHLASSDDGYVHNDRRD 311

Query: 944  LANETIDDRISFPEIRLQFSKSFRRARSKGHTV 1042
            ++ +T  ++   P  R+QF +S RR  +K  +V
Sbjct: 312  VSKDTSYNQNQIPGKRVQFPRSSRRPIAKDPSV 344


>XP_010907408.1 PREDICTED: uncharacterized protein LOC105034083 [Elaeis guineensis]
          Length = 342

 Score =  138 bits (347), Expect = 1e-32
 Identities = 120/352 (34%), Positives = 162/352 (46%), Gaps = 27/352 (7%)
 Frame = +2

Query: 74   MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLN------------STPNPPTDD 217
            MLC     +S++NWLDRL +SKG ++P+ DL LDQFL +             +P PP   
Sbjct: 1    MLC----SRSSSNWLDRLHTSKGLSIPA-DLDLDQFLSSIPNPNPNSNPKSCSPRPPEAR 55

Query: 218  ASDPP-SEPTSSPFRDSNDR-SASPPMXXXXXXXXXXLANQ-------MRAALADLFHMD 370
             SD P S+PT      S  R    PP              +       M +ALA+LF M 
Sbjct: 56   PSDAPLSQPTGDKPAASRRRWKQQPPPPEEVAAGNKIFVGEKEQLFDLMSSALAELFIM- 114

Query: 371  SPGRCNRPHTFR-----DRRSARKQGHPRFCVASYPGSAGGSCPSGMPAMV-TPAAMSPS 532
                  R H+        ++SARKQ +P+ CV S   S  GS  +G  A    P A SPS
Sbjct: 115  ------RDHSATGILGPSKKSARKQPNPKACVPSASASIDGSFLAGAAAACHVPPATSPS 168

Query: 533  IANNGGVKLKRKRTADGXXXXXXXXXXXXHATYLLSAAGENKREMSSRTDVTVIDTSCSA 712
             A+N   + K+ RT                      +  E+     S+T+VTVIDTS S 
Sbjct: 169  SADNSVAEAKKSRTK------------ARRKRGTTGSPVESDLSTYSKTEVTVIDTS-SP 215

Query: 713  NWKSAKIIFRKGTAWKVRDKKKWCLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNS 892
             WKS K+IFRKG  WKVRDKK W + RK+RK GL +R    K           P+   +S
Sbjct: 216  GWKSEKLIFRKGMVWKVRDKKLWNVCRKKRKVGLVERLIGEKEKEQPLIDMKEPSPKEHS 275

Query: 893  NTFTRLAEGSKCDPKGDLANETIDDRISFPEIRLQFSKSFRRARSKGHTVSH 1048
             +   + EG       D + E +DD+I  P+ R +FS+S R   +K  +  H
Sbjct: 276  GS---VDEGGAHAENRDASRE-MDDQIQIPKRRPRFSRSPRVRAAKDSSAFH 323


>XP_009420008.1 PREDICTED: neural Wiskott-Aldrich syndrome protein isoform X2 [Musa
           acuminata subsp. malaccensis]
          Length = 303

 Score =  133 bits (335), Expect = 2e-31
 Identities = 113/312 (36%), Positives = 145/312 (46%), Gaps = 17/312 (5%)
 Frame = +2

Query: 89  STGKSAANWLDRLRSSKGFTLPSPDLSLDQFLL-----NSTPN---------PPTDDASD 226
           S  KS +NWL+RL SS+GF++P+  L LD FL      N +PN         PP +  SD
Sbjct: 10  SNTKSTSNWLERLHSSRGFSVPA-HLHLDHFLSPDSASNPSPNSPPPPPPPPPPEEVLSD 68

Query: 227 PPS-EPTSSPFRDSNDRSASPPMXXXXXXXXXXLANQMRAALADLFHMDSPGRCNRPHTF 403
           PP  EP ++P R        PP           L + +   LA+LF M  P         
Sbjct: 69  PPPPEPLANPRRRKKHLQPPPP-PGASTDGKQRLFDLVGGVLAELFVMGGPPVVR---AL 124

Query: 404 RDRRSARKQGHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIAN--NGGVKLKRKRTA 577
           + ++S+RKQ +P+ CV S   S  G C S +PA   P++   S+A       KL+RKR  
Sbjct: 125 KAKKSSRKQPNPKVCVPSASASIDG-CRS-LPATSPPSSADNSVAEAKKSRSKLRRKRGT 182

Query: 578 DGXXXXXXXXXXXXHATYLLSAAGENKREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAW 757
            G                 LSA         SRTDVTVIDTSC   WKS K+IFRKG  W
Sbjct: 183 AGSPVDLD-----------LSAY--------SRTDVTVIDTSCPG-WKSEKVIFRKGIMW 222

Query: 758 KVRDKKKWCLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNSNTFTRLAEGSKCDPK 937
           KVRDKK W L RK+RK GL  R   L +    +Q    P    +        EG     K
Sbjct: 223 KVRDKKVWTLSRKKRKMGLVGR---LINEKDKEQPLAEPKVQADEGILASFVEGGDPVDK 279

Query: 938 GDLANETIDDRI 973
            D A+  I D++
Sbjct: 280 RD-ASGKIGDQV 290


>XP_017700949.1 PREDICTED: uncharacterized protein LOC103718115 [Phoenix
           dactylifera] XP_008805010.2 PREDICTED: uncharacterized
           protein LOC103718115 [Phoenix dactylifera]
          Length = 258

 Score =  131 bits (329), Expect = 6e-31
 Identities = 102/276 (36%), Positives = 129/276 (46%), Gaps = 26/276 (9%)
 Frame = +2

Query: 74  MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNSTPNPP--------------- 208
           MLC     +S++NWLDRL +SKGF +P+ D  LD FL +S PNP                
Sbjct: 1   MLC----SRSSSNWLDRLHTSKGFCIPAADHDLDHFL-SSIPNPNPNTNPKSCSPPRPET 55

Query: 209 -TDDA--SDPPSEPTSSPFRDSNDRSASPPMXXXXXXXXXXLANQ----MRAALADLFHM 367
            T DA  S PP+E  ++P R    +    P              Q    M +ALA+LF M
Sbjct: 56  WTSDAPLSQPPAEKPAAPRRRRKQQQQQQPQYAAGNKTFAGEKEQLFDLMSSALAELFIM 115

Query: 368 DSPGRCNRPHTFRDRRSARKQGHPRFCVASYPGSAGGSCPSGMPAMV-TPAAMSPSIANN 544
               R         ++SARKQ +P+ CV S   S  GS  +G  A    P   SPS A+N
Sbjct: 116 GD--RSATGILRASKKSARKQANPKACVPSASASIDGSFLAGAAAACHVPPVTSPSSADN 173

Query: 545 GGVKLKRKRTADGXXXXXXXXXXXXHATYLLSAAG---ENKREMSSRTDVTVIDTSCSAN 715
              + K  RT                A +     G   E+     S+TD TVIDTS S  
Sbjct: 174 SVAEAKNSRTK---------------ARWKRGTTGSPVESDLSTYSKTDATVIDTS-SPG 217

Query: 716 WKSAKIIFRKGTAWKVRDKKKWCLMRKERKFGLAQR 823
           WKS K+IFRKG  WKVRDK  W + RK+RK GL +R
Sbjct: 218 WKSEKLIFRKGMVWKVRDKNLWNVCRKKRKLGLVER 253


>XP_020100425.1 circumsporozoite protein isoform X1 [Ananas comosus]
          Length = 334

 Score =  127 bits (318), Expect = 1e-28
 Identities = 110/337 (32%), Positives = 150/337 (44%), Gaps = 18/337 (5%)
 Frame = +2

Query: 74   MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNST----PNPPTDDASDPPSEP 241
            M C  S  KS++NWLDRL +SKGF++ S DL LD+FL +S+    PNP  +   +P   P
Sbjct: 1    MQCSLSPPKSSSNWLDRLHASKGFSI-SADLDLDRFLASSSSDPDPNPNPNPNPNPNPNP 59

Query: 242  TSSPFRDSNDRSASPPMXXXXXXXXXXLANQ-----MRAALADLFHMDSPGRCNRPHTFR 406
              +P    N     PP            AN      M + LA+LF M  P       T  
Sbjct: 60   NPNPPSPRNATLPDPPTKRRRRRRPAPAANPPLFDLMSSVLAELFVMAGPSPSQAIGTPG 119

Query: 407  DRR-----SARKQGHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKR 571
            +RR     S+RKQ +P+ C    P ++  +   G      P++   S+A      LK+KR
Sbjct: 120  ERRKKKKKSSRKQANPKACP---PSASASAAADGAACGGAPSSADNSVAEEATKGLKKKR 176

Query: 572  TADGXXXXXXXXXXXXHATYLLSAAGENKREMSS---RTDVTVIDTSCSANWKSAKIIFR 742
             A                     A G +K    +   RTDVTVIDTS S  WKS K+I+R
Sbjct: 177  AA---------------------AEGPSKDSDLAGYRRTDVTVIDTS-SPGWKSVKLIYR 214

Query: 743  KGTAWKVRDKKKW-CLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNSNTFTRLAEG 919
            KG  WKVR KK W    +K+R  GL       +S  GS+ + L       S +  +L + 
Sbjct: 215  KGKEWKVRVKKHWNACQKKKRTVGLVGEKGKEQSKLGSKVLDLKEF----SASLDQLRDQ 270

Query: 920  SKCDPKGDLANETIDDRISFPEIRLQFSKSFRRARSK 1030
                 K     +  DDR   P  R +FS+S R +  K
Sbjct: 271  ENVRAKDGDTLKVSDDRTRIPVKRPKFSRSPRLSAVK 307


>XP_010267380.1 PREDICTED: uncharacterized protein LOC104604644 isoform X2 [Nelumbo
            nucifera]
          Length = 358

 Score =  124 bits (312), Expect = 1e-27
 Identities = 118/372 (31%), Positives = 159/372 (42%), Gaps = 28/372 (7%)
 Frame = +2

Query: 74   MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNSTPNPPTDDA----------- 220
            MLC  S+GKSA NWLDRLRSSKGF + +  L L+ FL N  PN  T  +           
Sbjct: 1    MLCSISSGKSAPNWLDRLRSSKGFPV-ADGLDLEHFL-NPNPNQTTLSSETNASYATQEI 58

Query: 221  --SDPPSEPTS---SPFRDSNDRSASPPMXXXXXXXXXXLANQMRAALADLFHMDSPGRC 385
              S P  E TS    P  D     A P                M   LA+LF+M   G  
Sbjct: 59   GYSKPHPESTSLDEKPVADRKKSMAGP--GDRKNQGKEDWFGIMGNVLAELFNMGDSGEF 116

Query: 386  NRPHTFRDRRSARKQGHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKR 565
             +   F ++RS RKQ +P+ CV S   S   S  +  P + +  +MSP   +N   ++K 
Sbjct: 117  QKIRGFDEKRSCRKQPNPKICVFSASASVNDSFLAAAPRLESVPSMSPPSGDNSVTEMKE 176

Query: 566  KRTADGXXXXXXXXXXXXHATYLLSAAGENKREMS----SRTDVTVIDTSCSANWKSAKI 733
               +                  +  A  E+K +      SR +VT+IDTSC   WKS K+
Sbjct: 177  TVNS---------LKPKKQGKVVSIAHDEDKLQTDLSTYSRVEVTIIDTSCPV-WKSEKL 226

Query: 734  IFRKGTAWKVRDKKKW-----CLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTT--VGNS 892
            +FRKG+ WKVRD KKW        RK+RK   + +        G   + L   T   G  
Sbjct: 227  LFRKGSVWKVRD-KKWKSRNASSFRKKRKANHSDKEAGGGKKKGKFFLPLVNITREAGPE 285

Query: 893  NTFTRLAEGSKCDPKGDLANETIDDRISFPEIRLQFSKSFRRARSKGHTVSHPLSL-ASE 1069
                 L EG   D K     E+ D+ I   + R  FS+S R+   +   V H  ++  S 
Sbjct: 286  ENKVPLDEGPPQDEKKAPCKESADNAIVVAK-RRSFSRSPRKPAHRDSPVFHVQAVPTSR 344

Query: 1070 SSNSTLPPPELQ 1105
             S   LP   L+
Sbjct: 345  KSGVHLPRSRLK 356


>XP_020100426.1 circumsporozoite protein isoform X2 [Ananas comosus]
          Length = 319

 Score =  123 bits (309), Expect = 1e-27
 Identities = 109/336 (32%), Positives = 146/336 (43%), Gaps = 17/336 (5%)
 Frame = +2

Query: 74   MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNST----PNPPTDDASDPPSEP 241
            M C  S  KS++NWLDRL +SKGF++ S DL LD+FL +S+    PNP  +   +P   P
Sbjct: 1    MQCSLSPPKSSSNWLDRLHASKGFSI-SADLDLDRFLASSSSDPDPNPNPNPNPNPNPNP 59

Query: 242  TSSPFRDSNDRSASPPMXXXXXXXXXXLANQ-----MRAALADLFHMDSPGRCNRPHTFR 406
              +P    N     PP            AN      M + LA+LF M  P       T  
Sbjct: 60   NPNPPSPRNATLPDPPTKRRRRRRPAPAANPPLFDLMSSVLAELFVMAGPSPSQAIGTPG 119

Query: 407  DRR-----SARKQGHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKR 571
            +RR     S+RKQ +P+ C    P ++  +   G      P++   S+A      LK+KR
Sbjct: 120  ERRKKKKKSSRKQANPKACP---PSASASAAADGAACGGAPSSADNSVAEEATKGLKKKR 176

Query: 572  TADGXXXXXXXXXXXXHATYLLSAAGENKREMSS---RTDVTVIDTSCSANWKSAKIIFR 742
             A                     A G +K    +   RTDVTVIDTS S  WKS K+I+R
Sbjct: 177  AA---------------------AEGPSKDSDLAGYRRTDVTVIDTS-SPGWKSVKLIYR 214

Query: 743  KGTAWKVRDKKKWCLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNSNTFTRLAEGS 922
            KG  WKVR KK W   +K++      RT  L    G +Q  L      N     R  +G 
Sbjct: 215  KGKEWKVRVKKHWNACQKKK------RTVGLVGEKGKEQSKLGSKDQEN----VRAKDGD 264

Query: 923  KCDPKGDLANETIDDRISFPEIRLQFSKSFRRARSK 1030
                      +  DDR   P  R +FS+S R +  K
Sbjct: 265  TL--------KVSDDRTRIPVKRPKFSRSPRLSAVK 292


>XP_020100428.1 circumsporozoite protein isoform X3 [Ananas comosus]
          Length = 317

 Score =  123 bits (308), Expect = 2e-27
 Identities = 107/336 (31%), Positives = 145/336 (43%), Gaps = 17/336 (5%)
 Frame = +2

Query: 74   MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNST----PNPPTDDASDPPSEP 241
            M C  S  KS++NWLDRL +SKGF++ S DL LD+FL +S+    PNP  +   +P   P
Sbjct: 1    MQCSLSPPKSSSNWLDRLHASKGFSI-SADLDLDRFLASSSSDPDPNPNPNPNPNPNPNP 59

Query: 242  TSSPFRDSNDRSASPPMXXXXXXXXXXLANQ-----MRAALADLFHMDSPGRCNRPHTFR 406
              +P    N     PP            AN      M + LA+LF M  P       T  
Sbjct: 60   NPNPPSPRNATLPDPPTKRRRRRRPAPAANPPLFDLMSSVLAELFVMAGPSPSQAIGTPG 119

Query: 407  DRR-----SARKQGHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKR 571
            +RR     S+RKQ +P+ C    P ++  +   G      P++   S+A      LK+KR
Sbjct: 120  ERRKKKKKSSRKQANPKACP---PSASASAAADGAACGGAPSSADNSVAEEATKGLKKKR 176

Query: 572  TADGXXXXXXXXXXXXHATYLLSAAGENKREMSS---RTDVTVIDTSCSANWKSAKIIFR 742
             A                     A G +K    +   RTDVTVIDTS S  WKS K+I+R
Sbjct: 177  AA---------------------AEGPSKDSDLAGYRRTDVTVIDTS-SPGWKSVKLIYR 214

Query: 743  KGTAWKVRDKKKWCLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNSNTFTRLAEGS 922
            KG  WKVR KK W   +K++      RT  L    G +Q              ++L    
Sbjct: 215  KGKEWKVRVKKHWNACQKKK------RTVGLVGEKGKEQ--------------SKLGSKE 254

Query: 923  KCDPKGDLANETIDDRISFPEIRLQFSKSFRRARSK 1030
                K     +  DDR   P  R +FS+S R +  K
Sbjct: 255  NVRAKDGDTLKVSDDRTRIPVKRPKFSRSPRLSAVK 290


>XP_010267379.1 PREDICTED: uncharacterized protein LOC104604644 isoform X1 [Nelumbo
            nucifera]
          Length = 364

 Score =  119 bits (299), Expect = 6e-26
 Identities = 116/380 (30%), Positives = 161/380 (42%), Gaps = 36/380 (9%)
 Frame = +2

Query: 74   MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLNSTPNPPTDDA----------- 220
            MLC  S+GKSA NWLDRLRSSKGF + +  L L+ FL N  PN  T  +           
Sbjct: 1    MLCSISSGKSAPNWLDRLRSSKGFPV-ADGLDLEHFL-NPNPNQTTLSSETNASYATQEI 58

Query: 221  --SDPPSEPTS---SPFRDSNDRSASPPMXXXXXXXXXXLANQMRAALADLFHMDSPGRC 385
              S P  E TS    P  D     A P                M   LA+LF+M   G  
Sbjct: 59   GYSKPHPESTSLDEKPVADRKKSMAGP--GDRKNQGKEDWFGIMGNVLAELFNMGDSGEF 116

Query: 386  NRPHTFRDRRSARKQGHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKR 565
             +   F ++RS RKQ +P+ CV S   S   S  +  P + +  +MSP   +N   ++K 
Sbjct: 117  QKIRGFDEKRSCRKQPNPKICVFSASASVNDSFLAAAPRLESVPSMSPPSGDNSVTEMKE 176

Query: 566  KRTADGXXXXXXXXXXXXHATYLLSAAGENKREMS----SRTDVTVIDTSCSANWKSAKI 733
               +                  +  A  E+K +      SR +VT+IDTSC   WKS K+
Sbjct: 177  TVNS---------LKPKKQGKVVSIAHDEDKLQTDLSTYSRVEVTIIDTSCPV-WKSEKL 226

Query: 734  IFRKGTAWKVRDKKKW-----CLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNS-- 892
            +FRKG+ WKVRD KKW        RK+RK   + +        G  + FL    +     
Sbjct: 227  LFRKGSVWKVRD-KKWKSRNASSFRKKRKANHSDKEAGGGKKKG--KFFLPLVNITREAG 283

Query: 893  --------NTFTRLAEGSKCDPKGDLANETIDDRISFPEIRLQFSKSFRRARSKGHTVSH 1048
                    +  + L +G   D K     E+ D+ I   + R  FS+S R+   +   V H
Sbjct: 284  PEENKVPLDELSYLEQGPPQDEKKAPCKESADNAIVVAK-RRSFSRSPRKPAHRDSPVFH 342

Query: 1049 PLSL-ASESSNSTLPPPELQ 1105
              ++  S  S   LP   L+
Sbjct: 343  VQAVPTSRKSGVHLPRSRLK 362


>OMO95080.1 hypothetical protein CCACVL1_05581 [Corchorus capsularis]
          Length = 354

 Score =  114 bits (286), Expect = 3e-24
 Identities = 103/334 (30%), Positives = 144/334 (43%), Gaps = 27/334 (8%)
 Frame = +2

Query: 74  MLCPFSTGKSAANWLDRLRSSKGFTLPSPD-LSLDQFLLNSTP--NPPTDDASDPPSEPT 244
           MLC   TGKS +NWLDRLRSSKGF  P+ D L LD FL NS P  +P T+ ++ P S   
Sbjct: 1   MLCSIPTGKSGSNWLDRLRSSKGF--PTGDNLDLDHFLTNSNPSDSPLTNASNSPNSNAE 58

Query: 245 SSPFRDSNDRSASPP---MXXXXXXXXXXLANQMRAALADLFHMDSPGRCNRPHTFRDRR 415
           S+   D   ++  PP   +              M   L++LF+M    + +R   F  ++
Sbjct: 59  STHSNDKQLQNPEPPPPEVISGEPAGDKEWFGIMSNVLSELFNMGDGAQSSR---FSKKK 115

Query: 416 SARKQGHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIAN--NGGVKLKRKRTADGXX 589
           ++RKQ +PR C+   P +            V      P+     N   + KR+   +G  
Sbjct: 116 TSRKQTNPRICIIKTPTANSSEEQRSSSGSVRRDKNVPASTTSLNSSQEAKRESKEEGDN 175

Query: 590 XXXXXXXXXXHATYLLSAAGENKREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAWKVRD 769
                              GE +    SR++VTVIDTSC   WK+ K+IFR+   WKV+D
Sbjct: 176 SNVAEDEDEEEG----KEKGEKELLGFSRSEVTVIDTSCQV-WKADKLIFRRKNIWKVKD 230

Query: 770 KKKWCLMRKERKFGLAQRTTTLKSSHGSQQVFLAPTTVGNSNTFTRLAE--GSKC----- 928
           KK      K R FG  +R     +S  +   F       +S+    L E  G +C     
Sbjct: 231 KK-----GKSRSFGRKKRKVPPPTSDDNNGGFCNKKQKISSSELRSLTEPRGRECGSPMN 285

Query: 929 -------DPKGDLANETIDD-----RISFPEIRL 994
                  D +    NET +D     R  FP  RL
Sbjct: 286 HGQKAPGDKEEQACNETAEDLTQVLRKRFPVSRL 319


>OAY59588.1 hypothetical protein MANES_01G043100 [Manihot esculenta]
          Length = 349

 Score =  111 bits (277), Expect = 4e-23
 Identities = 102/327 (31%), Positives = 142/327 (43%), Gaps = 21/327 (6%)
 Frame = +2

Query: 74  MLCPFSTGKSAANWLDRLRSSKGFTLPSPDLSLDQFLLN------STPNPPTDDASDPPS 235
           MLC F TGKS + WLDRLRS+KGF   + D+ LD FL N       +P P   + S+  S
Sbjct: 1   MLCSFPTGKSGSKWLDRLRSNKGFP-AADDVDLDHFLTNHQNSFSDSPLPNPSNTSNSNS 59

Query: 236 EPTSS-PFRDSNDRSASPPMXXXXXXXXXXLANQMRAALADLFHMDSPGRCNRPHTFRDR 412
           E + S   R ++DRS +              A  M   L DLF+M      ++   F  +
Sbjct: 60  ESSQSHSKRVNSDRSHAAETSSESGDKEWLGA--MTNVLCDLFNMGE--LTDKNSRFSGK 115

Query: 413 RSARKQGHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKRTADGXXX 592
           +SARKQ +P+FC  S P SA      G    V  A +S    NN  +         G   
Sbjct: 116 KSARKQANPKFCDVSTPTSANDIDSIGKDESVQAATVSLHSDNNSNIGANANWDDHGEEE 175

Query: 593 XXXXXXXXXHATYLLSAAGENKREMS--SRTDVTVIDTSCSANWKSAKIIFRKGTAWKVR 766
                             G + RE+   SR++VTVIDTS    WK  K++FR+   WKVR
Sbjct: 176 KEKTSG---------GGGGGSDRELKGYSRSEVTVIDTSFEV-WKFDKLVFRRKNIWKVR 225

Query: 767 DK--KKWCLMRKERK----------FGLAQRTTTLKSSHGSQQVFLAPTTVGNSNTFTRL 910
           DK  K W +  K+RK           G  ++  T K+  G  +       V  SN   +L
Sbjct: 226 DKKGKSWTVGTKKRKGNHLESGNGDVGSKKKVKTSKTEFGLSKDSNGGDFVSPSNDDGKL 285

Query: 911 AEGSKCDPKGDLANETIDDRISFPEIR 991
               K     ++  ++ DD+   P+ R
Sbjct: 286 QGEEK-----EVCKDSPDDQFQVPKRR 307


>XP_017974077.1 PREDICTED: uncharacterized protein LOC18605858 isoform X2
           [Theobroma cacao]
          Length = 353

 Score =  108 bits (270), Expect = 4e-22
 Identities = 89/256 (34%), Positives = 127/256 (49%), Gaps = 12/256 (4%)
 Frame = +2

Query: 74  MLCPFSTGKSAANWLDRLRSSKGFTLPSPD-LSLDQFLLNSTP-NPPTDDASDPP---SE 238
           MLC  STGKS +NWLDRLRSSKGF  P+ D L LD FL N  P + P  DAS+ P   SE
Sbjct: 1   MLCSISTGKSGSNWLDRLRSSKGF--PTGDNLDLDHFLTNPNPSDSPITDASNSPNSNSE 58

Query: 239 PTSSPFRDSNDRSASPP-MXXXXXXXXXXLANQMRAALADLFHMDSPGRCNRPHTFRDRR 415
            T S  ++  +R A PP +              M   L++LF+M    + +R   F  ++
Sbjct: 59  STHSNDKELQNRKAPPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTSR---FSRKK 115

Query: 416 SARKQGHPRFCV--ASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKRTADGXX 589
           ++RKQ +P+ C+   S   ++     S           + + + N   + KR+   +G  
Sbjct: 116 TSRKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPASTTSLNSKEEAKREWKEEGDD 175

Query: 590 XXXXXXXXXXHATYLLSAAGENKREM--SSRTDVTVIDTSCSANWKSAKIIFRKGTAWKV 763
                              G+ +RE+   SR++VTVIDTSC   WK  K+IFR+   WKV
Sbjct: 176 YNVEEEEQE-------EEKGKGERELLGYSRSEVTVIDTSCEV-WKVDKLIFRRKNIWKV 227

Query: 764 RDK--KKWCLMRKERK 805
           +DK  K   + RK+RK
Sbjct: 228 KDKKGKSRIVGRKKRK 243


>EOY23701.1 Uncharacterized protein TCM_015509 isoform 1 [Theobroma cacao]
          Length = 353

 Score =  108 bits (270), Expect = 4e-22
 Identities = 88/254 (34%), Positives = 125/254 (49%), Gaps = 10/254 (3%)
 Frame = +2

Query: 74  MLCPFSTGKSAANWLDRLRSSKGFTLPSPD-LSLDQFLLNSTP-NPPTDDASDPP---SE 238
           MLC  STGKS +NWLDRLRSSKGF  P+ D L LD FL N  P + P  DAS+ P   SE
Sbjct: 1   MLCSISTGKSGSNWLDRLRSSKGF--PTGDNLDLDHFLTNPNPSDSPITDASNSPNSNSE 58

Query: 239 PTSSPFRDSNDRSASPP-MXXXXXXXXXXLANQMRAALADLFHMDSPGRCNRPHTFRDRR 415
            T S  ++  +R A PP +              M   L++LF+M    + +R   F  ++
Sbjct: 59  STHSNDKELQNRKAPPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTSR---FSRKK 115

Query: 416 SARKQGHPRFCV--ASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKRTADGXX 589
           ++RKQ +P+ C+   S   ++     S           + + + N   + KR+   +G  
Sbjct: 116 TSRKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPASTTSLNSKEEAKREWKEEGDD 175

Query: 590 XXXXXXXXXXHATYLLSAAGENKREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAWKVRD 769
                           +  GE +    SR++VTVIDTSC   WK  K+IFR+   WKV+D
Sbjct: 176 YNVEEEEQEEE-----NGKGERELLGYSRSEVTVIDTSCEV-WKVDKLIFRRKNIWKVKD 229

Query: 770 K--KKWCLMRKERK 805
           K  K   + RK+RK
Sbjct: 230 KKGKSRIVGRKKRK 243


>XP_017974076.1 PREDICTED: uncharacterized protein LOC18605858 isoform X1
           [Theobroma cacao]
          Length = 355

 Score =  108 bits (270), Expect = 4e-22
 Identities = 89/256 (34%), Positives = 127/256 (49%), Gaps = 12/256 (4%)
 Frame = +2

Query: 74  MLCPFSTGKSAANWLDRLRSSKGFTLPSPD-LSLDQFLLNSTP-NPPTDDASDPP---SE 238
           MLC  STGKS +NWLDRLRSSKGF  P+ D L LD FL N  P + P  DAS+ P   SE
Sbjct: 1   MLCSISTGKSGSNWLDRLRSSKGF--PTGDNLDLDHFLTNPNPSDSPITDASNSPNSNSE 58

Query: 239 PTSSPFRDSNDRSASPP-MXXXXXXXXXXLANQMRAALADLFHMDSPGRCNRPHTFRDRR 415
            T S  ++  +R A PP +              M   L++LF+M    + +R   F  ++
Sbjct: 59  STHSNDKELQNRKAPPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTSR---FSRKK 115

Query: 416 SARKQGHPRFCV--ASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKRTADGXX 589
           ++RKQ +P+ C+   S   ++     S           + + + N   + KR+   +G  
Sbjct: 116 TSRKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPASTTSLNSKEEAKREWKEEGDD 175

Query: 590 XXXXXXXXXXHATYLLSAAGENKREM--SSRTDVTVIDTSCSANWKSAKIIFRKGTAWKV 763
                              G+ +RE+   SR++VTVIDTSC   WK  K+IFR+   WKV
Sbjct: 176 YNVEEEEQE-------EEKGKGERELLGYSRSEVTVIDTSCEV-WKVDKLIFRRKNIWKV 227

Query: 764 RDK--KKWCLMRKERK 805
           +DK  K   + RK+RK
Sbjct: 228 KDKKGKSRIVGRKKRK 243


>EOY23702.1 Uncharacterized protein TCM_015509 isoform 2 [Theobroma cacao]
          Length = 355

 Score =  108 bits (270), Expect = 4e-22
 Identities = 88/254 (34%), Positives = 125/254 (49%), Gaps = 10/254 (3%)
 Frame = +2

Query: 74  MLCPFSTGKSAANWLDRLRSSKGFTLPSPD-LSLDQFLLNSTP-NPPTDDASDPP---SE 238
           MLC  STGKS +NWLDRLRSSKGF  P+ D L LD FL N  P + P  DAS+ P   SE
Sbjct: 1   MLCSISTGKSGSNWLDRLRSSKGF--PTGDNLDLDHFLTNPNPSDSPITDASNSPNSNSE 58

Query: 239 PTSSPFRDSNDRSASPP-MXXXXXXXXXXLANQMRAALADLFHMDSPGRCNRPHTFRDRR 415
            T S  ++  +R A PP +              M   L++LF+M    + +R   F  ++
Sbjct: 59  STHSNDKELQNRKAPPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTSR---FSRKK 115

Query: 416 SARKQGHPRFCV--ASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKRTADGXX 589
           ++RKQ +P+ C+   S   ++     S           + + + N   + KR+   +G  
Sbjct: 116 TSRKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPASTTSLNSKEEAKREWKEEGDD 175

Query: 590 XXXXXXXXXXHATYLLSAAGENKREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAWKVRD 769
                           +  GE +    SR++VTVIDTSC   WK  K+IFR+   WKV+D
Sbjct: 176 YNVEEEEQEEE-----NGKGERELLGYSRSEVTVIDTSCEV-WKVDKLIFRRKNIWKVKD 229

Query: 770 K--KKWCLMRKERK 805
           K  K   + RK+RK
Sbjct: 230 KKGKSRIVGRKKRK 243


>XP_004307917.1 PREDICTED: uncharacterized protein LOC101313650 [Fragaria vesca
           subsp. vesca]
          Length = 323

 Score =  101 bits (251), Expect = 7e-20
 Identities = 81/264 (30%), Positives = 116/264 (43%), Gaps = 1/264 (0%)
 Frame = +2

Query: 74  MLCPFSTGKSAANWLDRLRSSKGFTLPSPD-LSLDQFLLNSTPNPPTDDASDPPSEPTSS 250
           MLC     KS  NWLDRLRS+KGF  P+ D L LD FL ++    PT  +  P     S+
Sbjct: 1   MLCSVRATKSGPNWLDRLRSNKGF--PACDNLDLDHFLKHN----PTSSSESPNPNADST 54

Query: 251 PFRDSNDRSASPPMXXXXXXXXXXLANQMRAALADLFHMDSPGRCNRPHTFRDRRSARKQ 430
           P   +   S+ P            L   M  A+++LF +D     +R      ++  RKQ
Sbjct: 55  PLVSNRPESSGP---TRDAKKGEALLGLMSTAISELFFIDGSEESSR---LSGKKVPRKQ 108

Query: 431 GHPRFCVASYPGSAGGSCPSGMPAMVTPAAMSPSIANNGGVKLKRKRTADGXXXXXXXXX 610
            HPR CV S   S+G      +   V      PS+ +   V+L+ +              
Sbjct: 109 THPRLCVTSKLKSSG-----SIGNDVNDLRTVPSLNSKNEVELEER-------------- 149

Query: 611 XXXHATYLLSAAGENKREMSSRTDVTVIDTSCSANWKSAKIIFRKGTAWKVRDKKKWCLM 790
                       GE + +  S+++VTVIDTSC   WK+ K++FR+ + WKVR+KK     
Sbjct: 150 ------------GERELKGYSKSEVTVIDTSCEV-WKTEKLVFRRKSVWKVREKKS---- 192

Query: 791 RKERKFGLAQRTTTLKSSHGSQQV 862
            K R FG  +R        G   +
Sbjct: 193 -KVRSFGRNKRKVVSGDEEGDDGI 215


Top