BLASTX nr result

ID: Lithospermum23_contig00006814 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum23_contig00006814
         (2737 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [...   369   e-112
XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [...   366   e-112
CDO97516.1 unnamed protein product [Coffea canephora]                 347   e-105
XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [...   342   e-102
KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara car...   318   2e-94
XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [...   316   4e-93
KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus ...   314   3e-92
KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp...   313   4e-92
KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus ...   310   9e-91
XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 i...   310   1e-90
XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [...   310   1e-90
XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 i...   303   3e-88
KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometr...   301   1e-87
XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 i...   294   3e-85
GAV67368.1 hypothetical protein CFOL_v3_10874 [Cephalotus follic...   288   2e-82
XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [...   282   8e-81
XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 i...   280   1e-80
EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobro...   276   5e-78
EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobro...   275   1e-77
KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp...   273   2e-77

>XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera]
          Length = 665

 Score =  369 bits (946), Expect = e-112
 Identities = 253/663 (38%), Positives = 350/663 (52%), Gaps = 58/663 (8%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSNNDQNIKRS 2027
            MDK+EPALVP+WL+SS SV+G G + +  A S L+ DD    K ARK  +++ND +  RS
Sbjct: 1    MDKTEPALVPEWLKSSGSVTGGGSTNHHFAPSLLQSDDGAALKPARKLMVNSNDHDTGRS 60

Query: 2026 SVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHLD 1847
            S  +R TSSYF                      R+R+W+ DI +   KD+S+L D RH D
Sbjct: 61   SNLERTTSSYFRRSSSSNGSGHPRSFSSFGRTNREREWEKDIHDYRDKDKSVLSDHRHRD 120

Query: 1846 YSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPANGL 1667
            YSDPLGN +  R+++ +LRRSQSM++ +RG+ WPRKV        K   +N D   A+G+
Sbjct: 121  YSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSNGDGQLASGI 180

Query: 1666 ----LHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSAL 1499
                + KA+F+R+FPSLG +DK  A ++ RV SPGL+ AI S P+  + +IG D WTSAL
Sbjct: 181  VTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVIGGDGWTSAL 240

Query: 1498 AEVPGVVRSSEGGSLSEKXXXXXXXXXXXSALT-GPSMAETVAQGPPHAK--TTPQLSTE 1328
            AEVP ++ S+  G  S +            + T G +MAET+ QGP  A+   TPQLS  
Sbjct: 241  AEVPVIIGSNTTGVSSVQQSVSASSVSVAPSTTSGLNMAETLVQGPARARANATPQLSVG 300

Query: 1327 TQR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGLS---LLVHSPRGGSVKSDPS 1160
            TQR +ELA+KQSRQLIP+TPS+PK LVP+  +K K K GL    L+ HS RGG  +SD +
Sbjct: 301  TQRLEELALKQSRQLIPMTPSMPKTLVPSPSDKPKSKIGLQPLHLVNHSQRGGPARSDVT 360

Query: 1159 KTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAPSINS 980
            KTS++GKL VLKP RERNG   TAKD           ++      S A S+S+R+P  N 
Sbjct: 361  KTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVANSPLAVTPSAAGSASLRSPRNNP 420

Query: 979  GHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXXXSDKFSE 800
               +AER+  + L              +DFF+LMRKK                 S    +
Sbjct: 421  TLASAERRPSVVLTSVEKRPTSQAQSRNDFFNLMRKKSSTNPPSAVPESGPAVSSSVSEK 480

Query: 799  SDNK-TFLFT---HQRGEDMVLNDINGID-SSENR---------------------NVET 698
            SD   T + T     +G D++ +D +G+D S+ENR                      ++ 
Sbjct: 481  SDELITEVVTAPVTPKGRDILSSDNSGLDWSNENRGDKTENGNNEACGVSQNDRDDEIDN 540

Query: 697  SEADSCS--------------------RHKYLNGGKNGSTYXXXXXXXXXXXXFLRSLGW 578
               D+C                       K+L+ G+  S+             FLRSLGW
Sbjct: 541  VNGDACDVSQRDQGDEVHDGNGDACDVSQKFLDNGEKHSSPDEVLYPDEEEAAFLRSLGW 600

Query: 577  XXXXXXXGLTDEEIKAFYRDATNKFVNAKPSSK-IS*MMARIHLALESRIGCVSAIFSGL 401
                   GLT+EEI AFY++     +  KPSS  +  M+ +I   L+S++G V+   SGL
Sbjct: 601  EENGEDEGLTEEEINAFYKEC----MKLKPSSNLLQRMLPKISPLLDSQMGSVAGAVSGL 656

Query: 400  KSS 392
             SS
Sbjct: 657  SSS 659


>XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [Sesamum indicum]
          Length = 624

 Score =  366 bits (940), Expect = e-112
 Identities = 253/634 (39%), Positives = 333/634 (52%), Gaps = 29/634 (4%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSN-NDQNIKR 2030
            M++SEP LVP+WL+++ +++G G        S    DDH   +VAR  S  N N     R
Sbjct: 1    MERSEPTLVPEWLKNTGNLTGAG--------SISHSDDHAASRVARNKSFVNSNGHEFGR 52

Query: 2029 SSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHL 1850
            SS S+R TSSYF                      RDRDW+ D+ +S  +D+S+L D  H 
Sbjct: 53   SSSSERTTSSYFRRSSSSNSSGNFRSYSSFGRSQRDRDWEKDVYDSRDQDKSVLADHWHW 112

Query: 1849 DYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKV--EIGSKGAYKARSTNADRMPA 1676
            D+SDPLGNS+LS+ ++  LRRSQSMVS +RG+ WP+KV  ++ S     A        P 
Sbjct: 113  DFSDPLGNSLLSKYERDGLRRSQSMVSGKRGDTWPKKVVTDLSSASGKNANGLLYRGSPV 172

Query: 1675 NGLLHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSALA 1496
             G   KA+FE+DFPSLG D+++   EV RVPSPGLS AI S PV  S +I  +KWTSALA
Sbjct: 173  GGRAKKATFEKDFPSLGADERAVVPEVGRVPSPGLSTAIQSLPVGTSGLIVGEKWTSALA 232

Query: 1495 EVPGVVRSSEGGSLS--EKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLSTETQ 1322
            EVP V+  S G +LS  ++            + T  +MAE VAQGP  A+TTPQLS  TQ
Sbjct: 233  EVP-VLVGSNGTALSSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSVGTQ 291

Query: 1321 R-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGLSL--------LVHSPRGGSVKS 1169
            R +ELAIKQSRQLIPVTPS+PKALV    +K KGK G           L HSPRGG+VK 
Sbjct: 292  RLEELAIKQSRQLIPVTPSMPKALVLTSSDKPKGKVGQQQHSISSSLPLNHSPRGGAVKG 351

Query: 1168 DPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAPS 989
            D +K S++GKLQVLKPVRE+NG     KD           ++      SV+ S++ R   
Sbjct: 352  DVAKASNVGKLQVLKPVREKNGVTPVVKDNLSPTSSSKVVTSTLAVSPSVSGSAATRGLP 411

Query: 988  INSGHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXXXSD- 812
             N  H   +RK  L +LEK           +DFF+L+RKK                 S  
Sbjct: 412  NNGVH---DRKPSLTVLEK--RPTSQAQSRNDFFNLVRKKSMPNSSSAVADSAMANCSSV 466

Query: 811  ---------KFSESDNKTFLF----THQRGEDMVLNDINGIDSSENRNVETSEADSCSRH 671
                      FS+ D +  +     T +  +  + N ++    SE +   TS  D+C   
Sbjct: 467  LDTGTAISPSFSDKDVEIDILPSSNTPKAADVPLSNSLSADRLSEEKGDLTSNGDACDAQ 526

Query: 670  KYLNGGKNGSTYXXXXXXXXXXXXFLRSLGWXXXXXXXGLTDEEIKAFYRDATNKFVNAK 491
             Y+  GK   +             FLRSLGW        LTDEEI AFYRD T K++++ 
Sbjct: 527  NYVRNGKKYPS-SDPIISEEEEAAFLRSLGWDENSDEGALTDEEINAFYRDLT-KYIDSN 584

Query: 490  PSSKI-S*MMARIHLALESRIGCVSAIFSGLKSS 392
            PS +I   +  +  L   S +G +  I SGL SS
Sbjct: 585  PSFRILQGVQLKFLLPFGSELGGIGGISSGLSSS 618


>CDO97516.1 unnamed protein product [Coffea canephora]
          Length = 599

 Score =  347 bits (889), Expect = e-105
 Identities = 242/622 (38%), Positives = 330/622 (53%), Gaps = 18/622 (2%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVAR-KNSLSNNDQNIKR 2030
            M++SEP+LVP+WL+SS S +G+G + +P++ S    DDH   K+AR K+S+++ND  I R
Sbjct: 1    MERSEPSLVPEWLKSSGSATGSGTTSHPLSPS----DDHAVSKLARNKSSVNHNDHEIGR 56

Query: 2029 SSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHL 1850
            SSVSDR ++SYF                      R RDWD D+ E   +D  ++G  +H 
Sbjct: 57   SSVSDRTSASYFRRSSSSNGSGQMQSYSSFGRNHRGRDWDKDLYEPRDRDNLVVGGHKHR 116

Query: 1849 DYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNA----DRM 1682
            DY DP  N+     +K  LRRSQSMVS +R E WP++    S  A + +ST+     D+ 
Sbjct: 117  DYLDPPVNNFPGNFEKDGLRRSQSMVSRKRNEIWPKRSIADSNSASRNKSTDGNSLLDKG 176

Query: 1681 PANGLLHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSA 1502
             + G +HK  FERDFPSLG +++ + +EV RVPSPGL+ AIH  P+ ASA+I  DKWTSA
Sbjct: 177  DSVGTVHKVVFERDFPSLGSEERQATSEVGRVPSPGLNTAIHGLPISASAIIAGDKWTSA 236

Query: 1501 LAEVPGVVRSSEGGSLSEKXXXXXXXXXXXSALT--GPSMAETVAQGPPHAKTTPQLSTE 1328
            LAEVP +V     G    +            + T  G +MAETVAQG P  +  P++++ 
Sbjct: 237  LAEVPAIVGGGGTGLSPGRQASLPSSPASLPSSTSAGLNMAETVAQG-PRVQAAPKITSG 295

Query: 1327 TQR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTG-------LSLLVHSPRGGSVK 1172
            TQR +ELAI+QSRQLIP+TPS+PK  + N  +K K K G         LL  S RGG VK
Sbjct: 296  TQRLEELAIRQSRQLIPMTPSMPKPSILNSSDKGKAKAGQPQHPVSSPLLSPSLRGGPVK 355

Query: 1171 SDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAP 992
            +D SKTS+ GKL VLKP RERNG    +KD           ++      SV   ++ R P
Sbjct: 356  TDASKTSNAGKLLVLKPPRERNGVSTASKDTLSPTSSTRAATSGIAVATSVTGLATSRGP 415

Query: 991  SINSGHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKK--XXXXXXXXXXXXXXXXX 818
            +IN   P AERKH LP+LEK           +DFF+LMRKK                   
Sbjct: 416  AINPVSPGAERKHALPMLEK--KPSSQAQSRNDFFNLMRKKSMPSSSSVADAGSAVSAST 473

Query: 817  SDKFSESDNKTFLFTHQRGEDMVLNDINGIDSSENRNVETSEADSCSRHKYLNGGKNGST 638
             D+  E +       H+  +   L+ +NG   +EN           SR   L   +  + 
Sbjct: 474  LDEPGELEVIPAPVIHEDEDVPSLDRLNGCQHTENDLFGIQ-----SRSLPLFSEEEEAA 528

Query: 637  YXXXXXXXXXXXXFLRSLGWXXXXXXXGLTDEEIKAFYRDATNKFVNAKPSSK-IS*MMA 461
            +             L  LGW       GLT+EEI AF+RD  +K++N+KPSSK +  +  
Sbjct: 529  F-------------LHQLGWQENADEDGLTEEEINAFFRD-LSKYMNSKPSSKSLQGVQP 574

Query: 460  RIHLALESRIGCVSAIFSGLKS 395
            +  L L S  G + AI SG  S
Sbjct: 575  KFPLLLSSH-GAIGAISSGSDS 595


>XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [Sesamum indicum]
          Length = 616

 Score =  342 bits (876), Expect = e-102
 Identities = 244/632 (38%), Positives = 332/632 (52%), Gaps = 28/632 (4%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSN-NDQNIKR 2030
            M++SEP L+P+WLRS+ S++G G        S    D+    K+AR  SL N N  +  R
Sbjct: 1    MERSEPTLIPEWLRSAGSLNGGG--------SISHSDEQTTTKLARNKSLVNSNGHDSAR 52

Query: 2029 SSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHL 1850
            S  SDR TSSYF                       DRDW+ D C+S  KD+S+LGD  H 
Sbjct: 53   SFSSDRTTSSYFRRSSSSNGSGHLRSHSSFGRNHHDRDWEKDACDSRDKDKSVLGDRWHR 112

Query: 1849 DYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPANG 1670
            D+SD +GN++LS+ ++  LRRSQSM+S +RG+ W +KV         A   N + +P+ G
Sbjct: 113  DFSDAMGNTLLSKFERDGLRRSQSMISGKRGDTWHKKV---GTDLNIASGNNTNGLPSKG 169

Query: 1669 L----LHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSA 1502
                 ++K +FERDFPSLG +++++  EV RVPSPG+S A+ S P+    +I  +KW SA
Sbjct: 170  SPIGGVNKTTFERDFPSLGAEERAAIPEVGRVPSPGVSSALQSLPIGTPTIIRGEKWRSA 229

Query: 1501 LAEVPGVVRSSEGG-SLSEKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLSTET 1325
            LAEVP +V ++  G S  ++            + T  +MAE VAQGP  A+TTPQLS  T
Sbjct: 230  LAEVPVLVGNNVTGISSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSIGT 289

Query: 1324 QR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGLSLLV--------HSPRGGSVK 1172
            QR +ELAIKQSRQLIPVTPS+PK L     +KQK K G    V         SPRGG VK
Sbjct: 290  QRLEELAIKQSRQLIPVTPSMPKPLAACSADKQKTKVGQQQHVVTSSLAANQSPRGGPVK 349

Query: 1171 SDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAP 992
            +D SKTS++GKL VLKPVRE+NG     K+           S+ P+   S++ S++ R  
Sbjct: 350  ADVSKTSNVGKLHVLKPVREKNGTTPVVKENLSPTSGSKLVSS-PLAAPSLSGSAATR-- 406

Query: 991  SINSGHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKK------------XXXXXXX 848
             +   +P A+RK    +LEK           +DFF+ +RKK                   
Sbjct: 407  -VLPNNPVADRKPVWTVLEK--RPTSQAQSRNDFFNSVRKKSMANSTSVADAAIANSSPV 463

Query: 847  XXXXXXXXXXSDKFSESDNKTFLFTHQRGEDMVLNDINGIDSSENRNVETSEADSCSRHK 668
                      SDK +E++      T  R     +N ++G + S  R+      D C    
Sbjct: 464  DTAPAASPSFSDKLTETEIVVAPNTQDRNASSGVN-LSGENLSGTRSDTACNGDVCDAQN 522

Query: 667  YLNGGKNGSTYXXXXXXXXXXXXFLRSLGWXXXXXXXGLTDEEIKAFYRDATNKFVNAKP 488
            Y++ GK   T             FLRSLGW       GLTDEEI AF+RD T K+V++KP
Sbjct: 523  YVSNGKKNHT-SDPIFSEEEEAAFLRSLGWEENADEGGLTDEEISAFFRDVT-KYVDSKP 580

Query: 487  SSKI-S*MMARIHLALESRIGCVSAIFSGLKS 395
            S KI   +  +I L  +S IG +S   SGL S
Sbjct: 581  SLKILQAVQPKILLPFDSHIGGIS---SGLNS 609


>KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara cardunculus var.
            scolymus]
          Length = 551

 Score =  318 bits (814), Expect = 2e-94
 Identities = 239/578 (41%), Positives = 307/578 (53%), Gaps = 15/578 (2%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSNN-DQNIKR 2030
            M+++EP  VP+WL+SS S+S      +   SSSL PDD    K  R  SL N+ D ++ R
Sbjct: 1    MERTEPTFVPEWLKSSGSLSTIS---HQFTSSSLHPDDQGVSKSLRTKSLVNSGDNDLGR 57

Query: 2029 SSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHL 1850
            +SVSDR TSSYF                      RDRDWD DI E   K++S   D RH 
Sbjct: 58   TSVSDRTTSSYFRRTSSSNGAAHLRSYNSFSRNHRDRDWDKDIYEFRDKEKS---DNRHR 114

Query: 1849 DYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPANG 1670
            DYSD L N + SR +K  LRRS S +S +RGE+WPRKV  G K  +     N   +P+ G
Sbjct: 115  DYSDHLANILPSRFEKDGLRRSHSSLSAKRGESWPRKVA-GDKNGHN----NGSALPSVG 169

Query: 1669 LLH---KASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSAL 1499
                  KA+FERDFPSLG ++K + TE+ RVPSPGL+ AI S P+ +SA+I  D WTSAL
Sbjct: 170  TSSSSGKAAFERDFPSLGAEEKQADTEIGRVPSPGLTTAIQSLPIGSSAVICGDMWTSAL 229

Query: 1498 AEVPGVVRSSEGGSLSEKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLSTETQR 1319
            AEVP +V S+      ++           S  TG +MAET+AQGP  A+TTPQLS  TQR
Sbjct: 230  AEVPMIVGSNGSNISVQQPIQPTSVSATTSMTTGRNMAETLAQGPSRARTTPQLSVGTQR 289

Query: 1318 -QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGLSLLVHSP--------RGGSVKSD 1166
             +ELA+KQSRQLIP+TPS+PKAL  N  +K K K G S L +S         R  SVKSD
Sbjct: 290  LEELAVKQSRQLIPMTPSMPKALALNSSDKPKLKVGQSQLQNSHIVNHPPSLRPVSVKSD 349

Query: 1165 PSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAPSI 986
             +K S++GKL +LK  RERNG   TAK+           ++ P+    V  S+S+R  + 
Sbjct: 350  VTKVSTVGKLHILKSSRERNGTTSTAKESLSPTGGSKLPNS-PLAVPVVVGSASLR--NT 406

Query: 985  NSGHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXXXSDKF 806
                  A+RK   P +EK           +DFF+LMRKK                     
Sbjct: 407  GGSTIVADRK---PCVEK--RPSPQAQSRNDFFNLMRKKSMATNSSSPGASEAGS----- 456

Query: 805  SESDNKTFLFTHQRGEDMVLNDIN-GIDS-SENRNVETSEADSCSRHKYLNGGKNGSTYX 632
            SES N         G D V+ D + G+ + SEN+   +   D+  R    N  KN S+  
Sbjct: 457  SESTNDKPGEPQVGGYDPVVVDRSCGVQTLSENKVDFSCNGDATERS---NNEKNHSSSD 513

Query: 631  XXXXXXXXXXXFLRSLGWXXXXXXXGLTDEEIKAFYRD 518
                       FLRSLGW       GLT+EEI +FYRD
Sbjct: 514  AILYSEEEEARFLRSLGWEETTEEEGLTEEEINSFYRD 551


>XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [Daucus carota subsp.
            sativus]
          Length = 620

 Score =  316 bits (810), Expect = 4e-93
 Identities = 236/627 (37%), Positives = 317/627 (50%), Gaps = 22/627 (3%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSNNDQNIKRS 2027
            M+KSEP LVP+WL+SS SV+G G+S N + + SL  D+    K AR  SL N    I   
Sbjct: 1    MEKSEPTLVPEWLKSSGSVTG-GVSTNHL-NPSLHQDNQATLKAARNKSLVN----IGDH 54

Query: 2026 SVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHLD 1847
             +  R TSSYF                       DRDWD DI +   K++S LGD ++  
Sbjct: 55   DIGHRTTSSYFRRSSSNGTSHLRSYGSFGRNNR-DRDWDRDIHDIRDKEKSNLGDRKYRQ 113

Query: 1846 YSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPANGL 1667
            +SD   ++ LSR +K  LRR+QS +S    E WPR+V    K   K+   N +   A   
Sbjct: 114  FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173

Query: 1666 ----LHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSAL 1499
                +HKASF+RDFPSLG +++    E+ RVPSPGL  AI + P  +SA I    WTSAL
Sbjct: 174  PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233

Query: 1498 AEVPGVVRSSEGGSLSEKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLSTETQR 1319
            AEVP ++ S+   + S             S +TG +MAET+ QGPP  +  PQLS ETQR
Sbjct: 234  AEVPAMIGSNGTTASSVPHSVSSSASVVPSMMTGLNMAETLVQGPPRVQADPQLSVETQR 293

Query: 1318 -QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGL-------SLLVHSPRGGSVKSDP 1163
             +ELAIKQSRQLIPVTPSLPKALV N  +K KGK GL       +L+ HSPRG   K++ 
Sbjct: 294  LEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQQSASTNLVHHSPRGAPTKNEI 353

Query: 1162 SKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAPSIN 983
             KTSS+GKLQVLKP RERNG   T+KD           +       +   S+ +R+   +
Sbjct: 354  IKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLAPALATVGSAPLRSSMNH 413

Query: 982  SGHPAAERKHG-----LPLLEKXXXXXXXXXXXSDFFSLMRKK-XXXXXXXXXXXXXXXX 821
            S   +AERK        P+LEK           +DFF+ MRKK                 
Sbjct: 414  SILVSAERKSAPPVMVTPMLEK--RPSPQAKSRNDFFNSMRKKSMTNSSSAVSNTVSAVS 471

Query: 820  XSDKFSESDNKTFLFTHQRGEDMVL---NDINGIDSSENRNVETSEADSCSRHKYLNGGK 650
             SD    S+ +       +G D+ +   +D   I+   + +++ S     S    L+ G 
Sbjct: 472  PSDLGKNSEGEASASLDSQGRDVPVVESSDEGKINECRDGSIQNSHGPQNS----LDNGV 527

Query: 649  NGSTYXXXXXXXXXXXXFLRSLGW-XXXXXXXGLTDEEIKAFYRDATNKFVNAKPSSKIS 473
            N S+             FLRSLGW        GLT+EEI AFYRD +    +A PS  + 
Sbjct: 528  NHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEINAFYRDVSKYINSAPPSKTLL 587

Query: 472  *MMARIHLALESRIGCVSAIFSGLKSS 392
                ++   +  ++G    + SG+ SS
Sbjct: 588  GTKQKLFGPINFQMGSNGGVSSGVSSS 614


>KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus var. scolymus]
          Length = 629

 Score =  314 bits (805), Expect = 3e-92
 Identities = 246/637 (38%), Positives = 323/637 (50%), Gaps = 60/637 (9%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRP------------------------ 2099
            M+++EP  VP+WL+SS    G+  + +   SSSL P                        
Sbjct: 1    MERTEPTFVPEWLKSS---GGSSTTSHQFTSSSLHPGNSYIYVCCFNKYGVNDHNICFDY 57

Query: 2098 ---------DDHLKPKVAR-KNSLSNNDQNIKRSSVSDRITSSYFXXXXXXXXXXXXXXX 1949
                     D+    K  R K+S++++D ++ R+SVSDR TSSYF               
Sbjct: 58   PSDGIFLAVDEQGSSKSGRNKSSVNSSDNDLGRTSVSDRTTSSYFRRTSLGNGSTHLRSY 117

Query: 1948 XXXXXXXRDRDWDGDICESGSKDRSILGDFRHLDYSDPLGNSILSRVDKKILRRSQSMVS 1769
                   RDRDWD DI E  SK++S   D RH DYSDPL N + SR +K  LRRS S VS
Sbjct: 118  SSFGRNHRDRDWDKDIYEFWSKEKS---DNRHRDYSDPLDNILPSRFEKDGLRRSHSSVS 174

Query: 1768 DQRGEAWPRKVEIGSKGAYKARSTNADRMPANGLLH---KASFERDFPSLGVDDKSSATE 1598
             +RGE+WPRKV      A K+  +N   + + G      K SFERDFPSLG D+K +  +
Sbjct: 175  GKRGESWPRKVVSDLSIANKSSHSNGTALLSGGSSLSNVKTSFERDFPSLGADEKQADPD 234

Query: 1597 VKRVPSPGLSPAIHSFPVVASAMIGSDKWTSALAEVPGVVRSSEGGSLSEKXXXXXXXXX 1418
            + RVPSPGLS AI S P+  SA+IG D WTSALAEVP V+  S G S S           
Sbjct: 235  IGRVPSPGLSSAIQSLPIGNSAVIGGDGWTSALAEVP-VIVGSNGNSTSVSQPVQPTSIT 293

Query: 1417 XXSALT-GPSMAETVAQGPPHAKTTPQ-----------LSTETQR-QELAIKQSRQLIPV 1277
              +++T G +MAET+A GPP  +T PQ           L+  TQR +ELA+KQSRQLIP+
Sbjct: 294  ATTSMTGGRNMAETLAHGPPRTQTAPQVAQMLLMGSTILTVGTQRLEELAVKQSRQLIPM 353

Query: 1276 TPSLPKALVPNLPEKQKGKTGLSLLV---HSPRGGSVKSDPSKTSSIGKLQVLKPVRERN 1106
            TPS+PKAL  +  +K K K G S LV   H+PR  SVKSD SKTS++GKL VLKP RERN
Sbjct: 354  TPSMPKALALSSSDKPKLKIGQSQLVNHPHTPRPLSVKSDVSKTSTVGKLLVLKPSRERN 413

Query: 1105 GGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAPSINSGHPAAERKHGLPLLEKXX 926
            G   TAK+           ++ P+   S   S+ +R    N G  A ERK  +  LEK  
Sbjct: 414  GISPTAKESLSPTGGSKLPNS-PLAVPSAIGSAPLRNMGNNPGVTAVERKPSVATLEK-- 470

Query: 925  XXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXXXSDKFSESDNKTFLFTHQRGEDMVL 746
                     ++FF+LMRKK                 S+     D  + + + ++    V 
Sbjct: 471  RPSSQAQSRNNFFNLMRKK--------------SMISNSSVAPDTGSSVSSSEKPGAPVA 516

Query: 745  NDINGIDSSENRNVETS-----EADSC-SRHKYLNGGKNGSTYXXXXXXXXXXXXFLRSL 584
               +   S  N  VET      + D+C +  +  N GKN S              FLRSL
Sbjct: 517  PPAHLGGSESNTTVETKVDLTCKGDACVATVRSTNNGKNHSGPDAVLCSEEEEARFLRSL 576

Query: 583  GW-XXXXXXXGLTDEEIKAFYRDATNKFVNAKPSSKI 476
            GW        GLT+EEI +FYR+    ++N KP+SKI
Sbjct: 577  GWDETAEEEEGLTEEEISSFYRN----YLNLKPTSKI 609


>KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp. sativus]
          Length = 617

 Score =  313 bits (803), Expect = 4e-92
 Identities = 237/627 (37%), Positives = 317/627 (50%), Gaps = 22/627 (3%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSNNDQNIKRS 2027
            M+KSEP LVP+WL+SS SV+G G+S N + + SL  D+    K AR  SL N    I   
Sbjct: 1    MEKSEPTLVPEWLKSSGSVTG-GVSTNHL-NPSLHQDNQATLKAARNKSLVN----IGDH 54

Query: 2026 SVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHLD 1847
             +  R TSSYF                       DRDWD DI +   K++S LGD ++  
Sbjct: 55   DIGHRTTSSYFRRSSSNGTSHLRSYGSFGRNNR-DRDWDRDIHDIRDKEKSNLGDRKYRQ 113

Query: 1846 YSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPANGL 1667
            +SD   ++ LSR +K  LRR+QS +S    E WPR+V    K   K+   N +   A   
Sbjct: 114  FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173

Query: 1666 ----LHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSAL 1499
                +HKASF+RDFPSLG +++    E+ RVPSPGL  AI + P  +SA I    WTSAL
Sbjct: 174  PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233

Query: 1498 AEVPGVVRSSEGGSLSEKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLSTETQR 1319
            AEVP ++ S+   + S             S +TG +MAET+ QGPP  +  PQLS ETQR
Sbjct: 234  AEVPAMIGSNGTTASSVPHSVSSSASVVPSMMTGLNMAETLVQGPPRVQADPQLSVETQR 293

Query: 1318 -QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGL-------SLLVHSPRGGSVKSDP 1163
             +ELAIKQSRQLIPVTPSLPKALV N  +K KGK GL       +L+ HSPRG   K++ 
Sbjct: 294  LEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQQSASTNLVHHSPRGAPTKNEI 353

Query: 1162 SKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAPSIN 983
             KTSS+GKLQVLKP RERNG   T+KD           +       +   S+ +R+   +
Sbjct: 354  IKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLAPALATVGSAPLRSSMNH 413

Query: 982  SGHPAAERKHG-----LPLLEKXXXXXXXXXXXSDFFSLMRKK-XXXXXXXXXXXXXXXX 821
            S   +AERK        P+LEK           +DFF+ MRKK                 
Sbjct: 414  SILVSAERKSAPPVMVTPMLEK--RPSPQAKSRNDFFNSMRKKSMTNSSSAVSNTVSAVS 471

Query: 820  XSDKFSESDNKTFLFTHQRGEDMVL---NDINGIDSSENRNVETSEADSCSRHKYLNGGK 650
             SD    S+ +       +G D+ +   +D   I+   + +++ S     S    L+ G 
Sbjct: 472  PSDLGKNSEGEASASLDSQGRDVPVVESSDEGKINECRDGSIQNSHGPQNS----LDNGV 527

Query: 649  NGSTYXXXXXXXXXXXXFLRSLGW-XXXXXXXGLTDEEIKAFYRDATNKFVNAKPSSKIS 473
            N S+             FLRSLGW        GLT+EEI AFYRD  N   +A PS  + 
Sbjct: 528  NHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEINAFYRDYIN---SAPPSKTLL 584

Query: 472  *MMARIHLALESRIGCVSAIFSGLKSS 392
                ++   +  ++G    + SG+ SS
Sbjct: 585  GTKQKLFGPINFQMGSNGGVSSGVSSS 611


>KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus var. scolymus]
          Length = 636

 Score =  310 bits (795), Expect = 9e-91
 Identities = 241/625 (38%), Positives = 305/625 (48%), Gaps = 44/625 (7%)
 Frame = -1

Query: 2218 LSLIMDKSEPALVPQWLRSSESVSGNGISINPIASSSLRP-------------------- 2099
            L+L M++SEP  VP+WL+SS  +S      + + SSSL                      
Sbjct: 6    LALTMERSEPTFVPEWLKSSGGLSTTS---HQLQSSSLHSGNSIHFISQQYMLFGISFQF 62

Query: 2098 ----------DDHLKPKVARKNSLSN-NDQNIKRSSVSDRITSSYFXXXXXXXXXXXXXX 1952
                      D+    K  R  S  N +D  + R SVSDR TSSYF              
Sbjct: 63   CYLPDNVVLLDEQGVSKATRNKSFVNISDNELGRPSVSDRTTSSYFRRTSSNGSSHLRSY 122

Query: 1951 XXXXXXXXRDRDWDGDICESGSKDRSILGDFRHLDYSDPLGNSILSRVDKKILRRSQSMV 1772
                     DRDWD DI E   K++    D R  DYSDPLGN + SR +K+ LRRS S V
Sbjct: 123  SSFGRNHR-DRDWDKDIHEFREKEKP---DGRLRDYSDPLGNILPSRFEKEGLRRSHSSV 178

Query: 1771 SDQRGEAWPRKVEIGSKGAYKARSTNADRMPAN-GLLH--KASFERDFPSLGVDDKSSAT 1601
            S +RGE+WPRKV + S  A K    N   + +  G +   K +FERDFPSLG ++K    
Sbjct: 179  SAKRGESWPRKVVVDSSSANKNSHNNGSALRSGAGAIGSVKTAFERDFPSLGAEEKQIDP 238

Query: 1600 EVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSALAEVPGVVRSSEGGSLSEKXXXXXXXX 1421
            E+ RVPSPGL+ AI S P+  SA+IG D WTSALAEVP +V S+   +            
Sbjct: 239  EIGRVPSPGLTTAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNGSNTSVPPPLQSTSIS 298

Query: 1420 XXXSALTGPSMAETVAQGPPHAKTTPQLSTETQR-QELAIKQSRQLIPVTPSLPKALVPN 1244
               S  TG +MAET+AQGPP A+T PQLS  TQR +ELA+KQSRQLIP+TPSLPKAL  N
Sbjct: 299  ATASMATGRNMAETLAQGPPRAQTAPQLSVGTQRLEELAVKQSRQLIPMTPSLPKALALN 358

Query: 1243 LPEKQKGKTGLSLL--------VHSPRGGSVKSDPSKTSSIGKLQVLKPVRERNGGPLTA 1088
              +K K K G   L         HSPR  S K D SKTSS+GKL VLKP RERNG    A
Sbjct: 359  SSDKPKSKVGQLQLQSSHLVNHTHSPRPVSTKFDVSKTSSVGKLHVLKPSRERNGITPIA 418

Query: 1087 KDXXXXXXXXXXXSAVPVGRQSVAASSSIRAPSINSGHPAAERKHGLPLLEKXXXXXXXX 908
            KD           ++ P+   SV  S+ +R    N     A +      LEK        
Sbjct: 419  KDNLSPTGASKLPNS-PLAVTSVVGSAPLRNLGNNPAVAVAVKPGVAATLEK--RPSSQA 475

Query: 907  XXXSDFFSLMRKKXXXXXXXXXXXXXXXXXSDKFSESDNKTFLFTHQRGEDMVLNDINGI 728
               +DFF+LMRKK                     S  D  T   T    +  V++   G+
Sbjct: 476  QSRNDFFNLMRKK----SMTNNSSPVTPDTGSSISAGDKPT--ATEGGIDPAVVDGSGGV 529

Query: 727  DSSENRNVETSEADSCSRHKYLNGGKNGSTYXXXXXXXXXXXXFLRSLGW-XXXXXXXGL 551
              S    V+ S  +  +  +  NG  N S+             FLRSLGW        GL
Sbjct: 530  QVSSGNKVDLSSCNGEATER-SNGKNNSSSDAIILYSEEEEARFLRSLGWEETGEEEEGL 588

Query: 550  TDEEIKAFYRDATNKFVNAKPSSKI 476
            T+EEI +FYRD  +K++N + +SKI
Sbjct: 589  TEEEISSFYRD-VSKYLNLQAASKI 612


>XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 isoform X1 [Nelumbo
            nucifera]
          Length = 645

 Score =  310 bits (795), Expect = 1e-90
 Identities = 241/654 (36%), Positives = 331/654 (50%), Gaps = 49/654 (7%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVAR-KNSLSNNDQNIKR 2030
            M KSEP LVP+WL+ +  ++G G + +  ASSSL+ DD+      R ++SLS  D +  R
Sbjct: 1    MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSDDNAVALPTRNRSSLSIGDYDTPR 60

Query: 2029 SSV-SDRITSSYFXXXXXXXXXXXXXXXXXXXXXXR--------DRDWDGDICESGSKDR 1877
            SS  SDR +S+Y                                DRDW+ DI +   K+R
Sbjct: 61   SSAFSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKER 120

Query: 1876 SILGDFRHLDYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARST 1697
            S+ GD R LD+SDPL + + SR++K  LRRSQSMVS +RGE WPRKV      A    + 
Sbjct: 121  SVPGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKV------AADLNNG 174

Query: 1696 NADRMPANGLL---------HKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPV 1544
            N ++  +NGLL          KA+FERDFPSLG ++K    ++ RV SPGLS A+ S P+
Sbjct: 175  NINQNTSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPM 234

Query: 1543 VASAMIGSDKWTSALAEVPGVV-RSSEGGSLSEKXXXXXXXXXXXSALTGPSMAETVAQG 1367
             +SA+IG D WTSALAEVP ++  +  G S  ++           ++ TG +MAET+AQ 
Sbjct: 235  GSSALIGGDGWTSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQA 294

Query: 1366 PPHAKTTPQLSTETQR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGL------- 1211
            P  A+ +PQLS ETQR +ELAIKQSRQLIP+TPS+PK  V N  EK K K  +       
Sbjct: 295  PSRARISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNA 354

Query: 1210 -----SLLVHSPRGGSVKSDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXS 1046
                    + S RG  ++SD SKTS  GKL VLK  RE+NG    AKD           +
Sbjct: 355  TKTIQQQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVAN 414

Query: 1045 AVPVGRQSVAASSSIRAPSINSGHPAAERK-------HGLPLLEKXXXXXXXXXXXSDFF 887
              P+     AA + +++P  N+   + ERK       HG  + ++           +DFF
Sbjct: 415  N-PLALAPSAAFTPLKSP--NNSKLSNERKSAAASLMHGSSVEKR--PTTSQVQSRNDFF 469

Query: 886  SLMRKKXXXXXXXXXXXXXXXXXSDKFSESDNKTFL---FTHQRGEDMVLNDINGIDSSE 716
            +LMRKK                 S    +S  +T L       +  D    D + +D S 
Sbjct: 470  NLMRKKTSGNLSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQSSDAPSPDPSCLDWST 529

Query: 715  NRNVETSEADSCSR--HKYLNGGKNGSTYXXXXXXXXXXXXFLRSLGW-XXXXXXXGLTD 545
                ET    + S    ++LN G+  S+             FLRSLGW        GLT+
Sbjct: 530  ENGSETISNGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLGWDENAGEEEGLTE 589

Query: 544  EEIKAFYRDATNKFVNAKPSSKI---S*MMARIHLALESRIGCVSAIFSGLKSS 392
            EEI AFY++    ++  +PSSK+   S    ++ + LESR+G      SGL SS
Sbjct: 590  EEISAFYKE----YMKLRPSSKLCRGSQQQVKLPMPLESRVGSFGGASSGLSSS 639


>XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera]
          Length = 655

 Score =  310 bits (795), Expect = 1e-90
 Identities = 241/662 (36%), Positives = 328/662 (49%), Gaps = 57/662 (8%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKN-SLSNNDQNIKR 2030
            M K EP LVP+WL+ + S++G G + +  ASSS   DDH      R   ++S  D +  R
Sbjct: 1    MAKGEPTLVPEWLKGTGSITGGGNTTHHFASSSTHSDDHAVALTTRNRLTMSTGDYDTPR 60

Query: 2029 SSVS-DRITSSYFXXXXXXXXXXXXXXXXXXXXXXR--------DRDWDGDICESGSKDR 1877
            SS   DR +S+YF                               DRDW+ D  +   K++
Sbjct: 61   SSAFLDRTSSAYFRRSSSSNGSMMHDKETSTYSRSYSSFTRSHRDRDWEKDTLDYRDKEK 120

Query: 1876 SILGDFRHLDYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARST 1697
            SILGD R  DYSDPL + + SR +K  LRRSQSM+S +RGE W R+V      A    + 
Sbjct: 121  SILGDHRDRDYSDPLASILTSRXEKDTLRRSQSMISGKRGEGWSRRV------AADTNNG 174

Query: 1696 NADRMPANGLL---------HKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPV 1544
            N +    NGLL          KA+FERDFPSLG ++K  A ++ RV SPGLS ++ S P+
Sbjct: 175  NNNHNNGNGLLVGGSIVSSIQKAAFERDFPSLGAEEKQGALDIGRVSSPGLSSSVQSLPI 234

Query: 1543 VASAMIGSDKWTSALAEVPGVV-RSSEGGSLSEKXXXXXXXXXXXSALTGPSMAETVAQG 1367
             +SA+IG D WTSALAEVP ++  +S G S  ++           ++ TG +MAET+AQ 
Sbjct: 235  GSSAVIGGDGWTSALAEVPVIIGNNSIGPSSVQQATPASSTSGAPNSSTGLNMAETLAQA 294

Query: 1366 PPHAKTTPQLSTETQR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQK-------GKTGL 1211
            P   + +PQLS ETQR +ELAIKQSRQLIP+TPS+PK    N  EK K       G+ G+
Sbjct: 295  PSRTRISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSALNSSEKAKPKAVVRTGEMGI 354

Query: 1210 S-------------LLVHSPRGGSVKSDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXX 1070
            S             L+ HS RGG V+SD  KTS  GKL VLK  RE+NG   +AKD    
Sbjct: 355  SAKTSQQQQLPSSHLVNHSLRGGPVRSDVPKTSHGGKLLVLKAPREKNGISPSAKDGLSP 414

Query: 1069 XXXXXXXSAVPVGRQSVAASSSIRAPSINSGHP------AAERKHGLPLLEKXXXXXXXX 908
                   +   V     A +  +R+P+ NS  P      A+   HG  + ++        
Sbjct: 415  TNASKVVNNSLVLAPLAAYAPPMRSPN-NSKLPNERKSVASSLTHGSAVEKR--PTTSQV 471

Query: 907  XXXSDFFSLMRKKXXXXXXXXXXXXXXXXXSDKFSESDNKTFLF----THQRGEDMVLND 740
               +DFF+LMRKK                 S    +S   T +        +  D   ++
Sbjct: 472  QSRNDFFNLMRKKTSGNLASAVPDPSPTASSSLLEKSSEPTEVVPTAPVSPQSSDAPSSE 531

Query: 739  INGID-SSENRNVETSEAD-SCSRHKYLNGGKNGSTYXXXXXXXXXXXXFLRSLGW-XXX 569
             +G+D S+EN     S  D S    ++ N G+  ST             FLRSLGW    
Sbjct: 532  PSGLDWSTENGGDLVSNGDVSEESQRFSNNGEKRSTADAFVYPDEEEAAFLRSLGWDENA 591

Query: 568  XXXXGLTDEEIKAFYRDATNKFVNAKPSSKI---S*MMARIHLALESRIGCVSAIFSGLK 398
                GLT+EEI AFYR+    ++  +PSS++   +    ++ L LES +G  S   SGL 
Sbjct: 592  GEEEGLTEEEISAFYRE----YMKVRPSSRLCQGAQQQTKVPLPLESHVGSFSGAASGLS 647

Query: 397  SS 392
            SS
Sbjct: 648  SS 649


>XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 isoform X2 [Nelumbo
            nucifera]
          Length = 616

 Score =  303 bits (776), Expect = 3e-88
 Identities = 237/644 (36%), Positives = 325/644 (50%), Gaps = 39/644 (6%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSNNDQNIKRS 2027
            M KSEP LVP+WL+ +  ++G G + +  ASSSL+  D      +R++S SN       S
Sbjct: 1    MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQ-SDRTSSAYSRRSSSSNG------S 53

Query: 2026 SVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHLD 1847
             V D+   SY                       RDRDW+ DI +   K+RS+ GD R LD
Sbjct: 54   IVHDKEIPSY------------TRSYSAFARSHRDRDWEKDILDFRDKERSVPGDHRDLD 101

Query: 1846 YSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPANGL 1667
            +SDPL + + SR++K  LRRSQSMVS +RGE WPRKV      A    + N ++  +NGL
Sbjct: 102  FSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKV------AADLNNGNINQNTSNGL 155

Query: 1666 L---------HKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDK 1514
            L          KA+FERDFPSLG ++K    ++ RV SPGLS A+ S P+ +SA+IG D 
Sbjct: 156  LVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGSSALIGGDG 215

Query: 1513 WTSALAEVPGVV-RSSEGGSLSEKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQL 1337
            WTSALAEVP ++  +  G S  ++           ++ TG +MAET+AQ P  A+ +PQL
Sbjct: 216  WTSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQAPSRARISPQL 275

Query: 1336 STETQR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGL------------SLLVH 1196
            S ETQR +ELAIKQSRQLIP+TPS+PK  V N  EK K K  +               + 
Sbjct: 276  SVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNATKTIQQQQLS 335

Query: 1195 SPRGGSVKSDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVA 1016
            S RG  ++SD SKTS  GKL VLK  RE+NG    AKD           +  P+     A
Sbjct: 336  SLRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVANN-PLALAPSA 394

Query: 1015 ASSSIRAPSINSGHPAAERK-------HGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXX 857
            A + +++P  N+   + ERK       HG  + ++           +DFF+LMRKK    
Sbjct: 395  AFTPLKSP--NNSKLSNERKSAAASLMHGSSVEKR--PTTSQVQSRNDFFNLMRKKTSGN 450

Query: 856  XXXXXXXXXXXXXSDKFSESDNKTFL---FTHQRGEDMVLNDINGIDSSENRNVETSEAD 686
                         S    +S  +T L       +  D    D + +D S     ET    
Sbjct: 451  LSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQSSDAPSPDPSCLDWSTENGSETISNG 510

Query: 685  SCSR--HKYLNGGKNGSTYXXXXXXXXXXXXFLRSLGW-XXXXXXXGLTDEEIKAFYRDA 515
            + S    ++LN G+  S+             FLRSLGW        GLT+EEI AFY++ 
Sbjct: 511  NASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLGWDENAGEEEGLTEEEISAFYKE- 569

Query: 514  TNKFVNAKPSSKI---S*MMARIHLALESRIGCVSAIFSGLKSS 392
               ++  +PSSK+   S    ++ + LESR+G      SGL SS
Sbjct: 570  ---YMKLRPSSKLCRGSQQQVKLPMPLESRVGSFGGASSGLSSS 610


>KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometricum]
          Length = 601

 Score =  301 bits (770), Expect = 1e-87
 Identities = 234/627 (37%), Positives = 308/627 (49%), Gaps = 22/627 (3%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNS-LSNNDQNIKR 2030
            MDKSEP LVP+WL++S + SG G        S+L  DD   PK++R NS +S+N  +  R
Sbjct: 1    MDKSEPTLVPEWLKNSGNQSGGG--------STLHSDDKSAPKLSRNNSFMSSNGHDFGR 52

Query: 2029 SSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHL 1850
            SS S++ TSSYF                      RDRDW+ D  +S  KD+S+ GD  H 
Sbjct: 53   SSSSEKTTSSYFHRSSSSNGSGNLRSYNSFGRNRRDRDWEKDRYDSQDKDKSVSGDRWHR 112

Query: 1849 DYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKV--EIGSKGAYKARSTNADRMPA 1676
             +SD  GNS   + +   LRRSQS  S   G+ W +KV  +  S G     +      P 
Sbjct: 113  VFSDSSGNSFSGKFEWDGLRRSQSATSGPHGDTWTKKVVTDSSSAGGNNTNTLLTKGAPG 172

Query: 1675 NGLLHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSALA 1496
             G+  K  FER+FPSLG ++++   EV RVPSPGLS AI S P+  +A +G +KWTSALA
Sbjct: 173  GGVT-KTRFERNFPSLGSEERAVIPEVGRVPSPGLSSAIQSLPIGTAAAVGGEKWTSALA 231

Query: 1495 EVPGVVRSSEGGSLSEKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLSTETQR- 1319
            EVP +V S+  G  S             S  T  +MAE VAQGP  +   PQ+S  TQR 
Sbjct: 232  EVPVIVGSNGIGVSS--VTQSASTQLASSTTTTLNMAEAVAQGPSRSPAMPQISVGTQRL 289

Query: 1318 QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGLSL-LVHSPRGGSVKSDPSKTSS-I 1145
            +ELAIKQSRQLIPVTPS+PK LV N  +KQK K G     ++S      KSD SK+SS +
Sbjct: 290  EELAIKQSRQLIPVTPSMPKTLVSNSSDKQKTKLGQQQHSINSLPINHSKSDMSKSSSNV 349

Query: 1144 GKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAPSINSGHPAA 965
            GKL VLK  RE+NG     KD           S+  +   SV  + + + P      P  
Sbjct: 350  GKLHVLKLTREKNGVAPVVKDNLSPTTAVNAVSSTLLTSPSVTGAVASKGP---PNMPVL 406

Query: 964  ERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXXXSDKFSESDNKT 785
             RK  L +LEK            +FF+L+RKK                 ++ FS  D+  
Sbjct: 407  NRKPSLAVLEKRNTSQAQAQSRKEFFNLVRKK-------SMAISTSATDAENFSSVDSGH 459

Query: 784  FL----FTHQRGEDMVLNDINGIDSS------------ENRNVETSEADSCSRHKYLNGG 653
             +          ED+   + + ID +            E R+  T   D+CS  KYL  G
Sbjct: 460  AVSPPPSETSEKEDVPAPNTSQIDDAQSSASLSDDLFPEKRDDVTCPDDTCSMPKYLGNG 519

Query: 652  KNGSTYXXXXXXXXXXXXFLRSLGWXXXXXXXGLTDEEIKAFYRDATNKFVNAKPSSKIS 473
             N S              FLRSLGW       GLT+EEI +F++DAT    N+KP+    
Sbjct: 520  MNAS--MDPLFSEEEEAAFLRSLGWEENSDEGGLTEEEISSFFKDATK--YNSKPA---- 571

Query: 472  *MMARIHLALESRIGCVSAIFSGLKSS 392
                RI   ++ +    S I SGL SS
Sbjct: 572  ---LRILEVVQPKFIASSGISSGLSSS 595


>XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 isoform X1
            [Erythranthe guttata]
          Length = 575

 Score =  294 bits (752), Expect = 3e-85
 Identities = 241/628 (38%), Positives = 318/628 (50%), Gaps = 23/628 (3%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSN-NDQNIKR 2030
            MD+SEP+LVPQWL++S S +G G             D+H   +VAR  S  N N  +  R
Sbjct: 1    MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47

Query: 2029 SSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRH- 1853
            +S S + TSSYF                      RDRDW+ D   S  K+R +LG  RH 
Sbjct: 48   ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107

Query: 1852 LDYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPAN 1673
             + S+ LGN  LS+ ++  LRRS SM+S + GE WP+KV   S       + N      +
Sbjct: 108  YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGS 167

Query: 1672 --GLLHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSAL 1499
              G+ +KA+FERDFPSLG DD++   EV RV SPGLS A+ S P+ +SA IG ++WTSAL
Sbjct: 168  PVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSAL 227

Query: 1498 AEVPGVVRSSEGGSLS--EKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLSTET 1325
            AEVP +V S+   SLS  +            S+ T  +MAE VAQGP  A+T PQLS  T
Sbjct: 228  AEVPMLVVSNGTASLSVQQAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLGT 287

Query: 1324 QR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGL--------SLLVH-SPRGG-S 1178
            QR +ELAIKQSRQLIPVTP++PK LV +  +KQK K GL        SL ++ SPRG   
Sbjct: 288  QRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAPP 347

Query: 1177 VKSDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIR 998
             K D SK S++GKL VLKPVRE+NG   + KD              P G    A +S++ 
Sbjct: 348  SKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLS-----------PTG-SGKAVNSTLP 395

Query: 997  APSINSGHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXXX 818
            A       P+A +      LEK           +DFF  MR+K                 
Sbjct: 396  A------SPSAVKPLLTTALEK--RPTTQAQSRNDFFKRMREK---------------SV 432

Query: 817  SDKFSESDNKTFLFTHQRGEDMVLNDINGIDSSENRNVETSEADSCSRHKYLNGG----K 650
            S+  S S+  T +   +  +  V      + ++    VE    +   R    NGG     
Sbjct: 433  SNSSSASETGTAISPEKHAKVAV------VPAAITGAVEPLPEEKAVR-TTCNGGVQHIS 485

Query: 649  NGSTY-XXXXXXXXXXXXFLRSLGWXXXXXXXGLTDEEIKAFYRDATNKFVNAKPSSKI- 476
            NG  Y             FLRS+GW       GLT+EEI AFYRD T K++N+KPS +I 
Sbjct: 486  NGKKYNSEPIISEEEEAKFLRSMGWDENDDEGGLTEEEISAFYRDFT-KYINSKPSLRIL 544

Query: 475  S*MMARIHLALESRIGCVSAIFSGLKSS 392
              +  +  L  +S+IG +S    GL SS
Sbjct: 545  QGVRLKFLLPFDSQIGGIS---PGLSSS 569


>GAV67368.1 hypothetical protein CFOL_v3_10874 [Cephalotus follicularis]
          Length = 625

 Score =  288 bits (736), Expect = 2e-82
 Identities = 211/594 (35%), Positives = 298/594 (50%), Gaps = 16/594 (2%)
 Frame = -1

Query: 2212 LIMDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSNNDQNIK 2033
            ++M++ EP LVP+WL+SS  V  +  + +  +S S + DD+   K+AR NS  ++D +I 
Sbjct: 1    MVMERIEPVLVPEWLKSSGGVISSASTNHQNSSLSSQSDDNCVSKLARNNSPVSSDHDIG 60

Query: 2032 RSSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRH 1853
             +S  DR TSSYF                       D+ W+ DI +   KD+ + G+  H
Sbjct: 61   CASALDRTTSSYFRRSSSSKVSALSRTHSSFGRGHHDKGWEKDIKDYHDKDKPVFGEHSH 120

Query: 1852 LDYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKA-RSTNADRMP- 1679
             D+ DPL   +LSR +K +L RSQSM S +RG+ W RKV      A K+ RS    R+  
Sbjct: 121  DDHYDPLSTILLSRFEKDMLHRSQSMTSGKRGDTWSRKVAGDLTHAKKSNRSDGITRLAG 180

Query: 1678 --ANGLLHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTS 1505
              A   +H ++FERDFPSLG ++     E+ RV SPGLS +I SFPV  S++IGSD WTS
Sbjct: 181  VSAVSSVHNSAFERDFPSLGAEESQGGPEISRVSSPGLSTSIQSFPVGTSSVIGSDGWTS 240

Query: 1504 ALAEVPGVVRSSEGGSLS-EKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLSTE 1328
            ALAEVP V+ +S  G  S ++           S ++G +MAET+ QGP  A+T P  +  
Sbjct: 241  ALAEVPVVMGTSTTGVASAQQSVSASSAPLSPSVMSGLNMAETLVQGPSRARTPPLSTVG 300

Query: 1327 TQR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTG----LSLLVHSPRGGSVKSDP 1163
            TQR +ELAI+QSRQLIP+TPS+PK LV +  EK K K G    L   V+  RGG  + D 
Sbjct: 301  TQRLEELAIRQSRQLIPMTPSMPKPLVVSPSEKSKPKIGPQQHLLQTVNHTRGGPARPDS 360

Query: 1162 SKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAPSIN 983
             KTS+ G+LQ+LK  R+ NG     KD           ++  V   S   S+ +R+ S N
Sbjct: 361  PKTSNDGRLQILKSSRDLNGASSAPKDSSSPTSGNKAVNSPRVVTSSATGSTPLRSSS-N 419

Query: 982  SGHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXXXSDKFS 803
            S + + +R      +             +DFFSL++KK                      
Sbjct: 420  SPNFSIDRNPAPFRVSAEKRPISQAQSRNDFFSLLKKKSSTSFPSTVLDPGSVVSPSASE 479

Query: 802  ESD----NKTFLFTHQRGEDMVLNDINGID-SSENRNVETSEADSCSRHKYLNGGKNGST 638
            +SD      T         D   ++I+  D +++N+      A   S+    NG K+ S 
Sbjct: 480  KSDKLVREVTIASCSLHCGDSTSSEISAADFATDNKGELNGIAYDVSQECLSNGEKHSS- 538

Query: 637  YXXXXXXXXXXXXFLRSLGW-XXXXXXXGLTDEEIKAFYRDATNKFVNAKPSSK 479
                         FLRSLGW        GLT+EEI AF ++    +   KPSSK
Sbjct: 539  -PGVILYPDEEEAFLRSLGWEENGGEDEGLTEEEISAFLKE----YTKLKPSSK 587


>XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [Daucus carota subsp.
            sativus]
          Length = 591

 Score =  282 bits (722), Expect = 8e-81
 Identities = 226/602 (37%), Positives = 297/602 (49%), Gaps = 24/602 (3%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISIN--PIASSSLRPDDHLKPKVAR-KNSLSN-NDQN 2039
            M+K+EP  VP+WL+SS SV+ + +S N   IASSSL  DD    K  R K+S+ + +  N
Sbjct: 1    MEKNEPTFVPEWLKSSGSVT-SAVSTNHHQIASSSLLSDDRATLKSTRNKSSIDDISAHN 59

Query: 2038 IKRSSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDF 1859
               S VSDR TSSYF                       D+ WD D  E    D+  +GD 
Sbjct: 60   SGSSPVSDRTTSSYFRRSSTSNGSQLRSYGSFGRTNR-DKGWDKDTNEYHDSDKLRIGDH 118

Query: 1858 RHLDYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMP 1679
            RH ++SDPLG++  +R +K  L+R+QS +S +  E W RKV        K+   N   + 
Sbjct: 119  RHRNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLL 178

Query: 1678 ANG----LLHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKW 1511
            A       + KA+F+RDFPSLG D++ +  E++RVPSPGLS  + + P+  SA+ G   W
Sbjct: 179  AGSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGW 238

Query: 1510 TSALAEVPGVVRSSEGGSLSEKXXXXXXXXXXXSALT-GPSMAETVAQGPPHAKTTPQLS 1334
            TSALAEV   V ++     S             S++T G +MAET+AQGPPH   T Q S
Sbjct: 239  TSALAEVQVKVGANGINKSSVAQAALPSSASVASSMTSGLNMAETLAQGPPHVHAT-QFS 297

Query: 1333 TETQR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGLSL--------LVHSPRGG 1181
              TQR +E+AIKQS+QLIPVTPS+PKALV N  EK K K               HSPRG 
Sbjct: 298  VGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQHQTSSTHHFNHSPRGT 357

Query: 1180 SVKSDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSI 1001
             +KSD SKTSS+GKLQVLKP RERN      KD           +       SV    S+
Sbjct: 358  PMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPNNPLTAASSVGVPPSL 417

Query: 1000 RAPSINSGHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXX 821
            R+P  N   P         +LEK            DFF+L+RKK                
Sbjct: 418  RSPIKN---PIVASGVVPTVLEKKPSAQLRSRN--DFFNLVRKKSLTNHSSPVVDSVSTV 472

Query: 820  XSDKFSESDNKTFLFTHQRGEDMVLNDINGIDSSENR-NVETSEADSCS-RHKYLNGGKN 647
                  E  ++        GED +L   N  D+ + + N   S  D+C    K  + G+N
Sbjct: 473  S-QSILEQPSEHKAGAPPPGEDSLL--ANQSDTVQYKMNGLISNRDACDGTPKSPDNGEN 529

Query: 646  GSTYXXXXXXXXXXXXF---LRSLGWXXXXXXXG-LTDEEIKAFYRDATNKFVNAKPSSK 479
            G T                 LRSLGW         LT+EEI+ FYRDA +K++  +PSSK
Sbjct: 530  GETRSSSDVILCSEEEEAAFLRSLGWDENAGEDEGLTEEEIREFYRDA-SKYIKPRPSSK 588

Query: 478  IS 473
             S
Sbjct: 589  TS 590


>XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 isoform X2
            [Erythranthe guttata]
          Length = 550

 Score =  280 bits (717), Expect = 1e-80
 Identities = 226/587 (38%), Positives = 294/587 (50%), Gaps = 22/587 (3%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSN-NDQNIKR 2030
            MD+SEP+LVPQWL++S S +G G             D+H   +VAR  S  N N  +  R
Sbjct: 1    MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47

Query: 2029 SSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRH- 1853
            +S S + TSSYF                      RDRDW+ D   S  K+R +LG  RH 
Sbjct: 48   ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107

Query: 1852 LDYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPAN 1673
             + S+ LGN  LS+ ++  LRRS SM+S + GE WP+KV   S       + N      +
Sbjct: 108  YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGS 167

Query: 1672 --GLLHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKWTSAL 1499
              G+ +KA+FERDFPSLG DD++   EV RV SPGLS A+ S P+ +SA IG ++WTSAL
Sbjct: 168  PVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSAL 227

Query: 1498 AEVPGVVRSSEGGSLS--EKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLSTET 1325
            AEVP +V S+   SLS  +            S+ T  +MAE VAQGP  A+T PQLS  T
Sbjct: 228  AEVPMLVVSNGTASLSVQQAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLGT 287

Query: 1324 QR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGL--------SLLVH-SPRGG-S 1178
            QR +ELAIKQSRQLIPVTP++PK LV +  +KQK K GL        SL ++ SPRG   
Sbjct: 288  QRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAPP 347

Query: 1177 VKSDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIR 998
             K D SK S++GKL VLKPVRE+NG   + KD              P G    A +S++ 
Sbjct: 348  SKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLS-----------PTG-SGKAVNSTLP 395

Query: 997  APSINSGHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXXX 818
            A       P+A +      LEK           +DFF  MR+K                 
Sbjct: 396  A------SPSAVKPLLTTALEK--RPTTQAQSRNDFFKRMREK---------------SV 432

Query: 817  SDKFSESDNKTFLFTHQRGEDMVLNDINGIDSSENRNVETSEADSCSRHKYLNGG----K 650
            S+  S S+  T +   +  +  V      + ++    VE    +   R    NGG     
Sbjct: 433  SNSSSASETGTAISPEKHAKVAV------VPAAITGAVEPLPEEKAVR-TTCNGGVQHIS 485

Query: 649  NGSTY-XXXXXXXXXXXXFLRSLGWXXXXXXXGLTDEEIKAFYRDAT 512
            NG  Y             FLRS+GW       GLT+EEI AFYRD T
Sbjct: 486  NGKKYNSEPIISEEEEAKFLRSMGWDENDDEGGLTEEEISAFYRDFT 532


>EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobroma cacao]
          Length = 625

 Score =  276 bits (705), Expect = 5e-78
 Identities = 221/635 (34%), Positives = 313/635 (49%), Gaps = 30/635 (4%)
 Frame = -1

Query: 2209 IMDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSNNDQNIKR 2030
            +M++SEP+LVP+WL+S  SV+G+G S +   SSSL  D+H   +  R       D ++  
Sbjct: 5    VMERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGG 64

Query: 2029 SSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHL 1850
            +SV DR TS+YF                      RDRDWD DI     +++S++ D R+ 
Sbjct: 65   TSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 124

Query: 1849 DYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPANG 1670
            ++SD L N + S  +K +L RSQS ++ +R + WP+KV   S  + K+  +++     NG
Sbjct: 125  NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSS-----NG 178

Query: 1669 LL--------HKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDK 1514
            LL        +K+ FER+FP LG +++  A+E+ RV SPGLS A  S PV  SA+ GSD 
Sbjct: 179  LLSGVSTTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDG 238

Query: 1513 WTSALAEVP-GVVRSSEGGSLSEKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQL 1337
            WTSALA++P GV  S  G +++ +           + +TG +MAET+ QGP  A+T P L
Sbjct: 239  WTSALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLL 298

Query: 1336 STETQR-QELAIKQSRQLIP-VTPSLPKALVPNLPEKQKGKTG----LSLLVHSPRGGSV 1175
            +  TQR +ELAIKQSRQL+P VT S PK LV +  EK K K G     SL ++  RGG+ 
Sbjct: 299  NVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTS 358

Query: 1174 KSDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRA 995
            +SD  K S+ G+L++LKP RE NG  L  KD              P+   SV  S+S  A
Sbjct: 359  RSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPL---SVTPSASASA 415

Query: 994  PSINSGH----PAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXX 827
            P  +SG+      AER      +             +DFF+L++KK              
Sbjct: 416  PFRSSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTNSPSSVADRGP 475

Query: 826  XXXSDKFSESDNKTFLFTHQRGEDMVLNDINGIDSSE---------NRNVETSEADSCS- 677
                    +SD    L T      + L     + SSE         NR+  T   D+ S 
Sbjct: 476  AASPSVSEKSDE---LGTEDASTSVTLQG-GSVPSSEISIADLPTDNRSEITHNGDAYSG 531

Query: 676  RHKYLNGGKNGSTYXXXXXXXXXXXXFLRSLGW-XXXXXXXGLTDEEIKAFYRDATNKFV 500
              +  + G   +              FLRSLGW        GLT+EEI AF+ +     +
Sbjct: 532  SQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE----HM 587

Query: 499  NAKPSSKIS*MMARIHLALESRIGCVSAIFSGLKS 395
              KPS+K+   M  I + L S  G      SGL S
Sbjct: 588  KLKPSAKLFHRMQSI-VPLNSHNGTHDGASSGLSS 621


>EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobroma cacao]
          Length = 620

 Score =  275 bits (702), Expect = 1e-77
 Identities = 221/634 (34%), Positives = 312/634 (49%), Gaps = 30/634 (4%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISINPIASSSLRPDDHLKPKVARKNSLSNNDQNIKRS 2027
            M++SEP+LVP+WL+S  SV+G+G S +   SSSL  D+H   +  R       D ++  +
Sbjct: 1    MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGGT 60

Query: 2026 SVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDFRHLD 1847
            SV DR TS+YF                      RDRDWD DI     +++S++ D R+ +
Sbjct: 61   SVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNRN 120

Query: 1846 YSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMPANGL 1667
            +SD L N + S  +K +L RSQS ++ +R + WP+KV   S  + K+  +++     NGL
Sbjct: 121  FSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSS-----NGL 174

Query: 1666 L--------HKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKW 1511
            L        +K+ FER+FP LG +++  A+E+ RV SPGLS A  S PV  SA+ GSD W
Sbjct: 175  LSGVSTTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGW 234

Query: 1510 TSALAEVP-GVVRSSEGGSLSEKXXXXXXXXXXXSALTGPSMAETVAQGPPHAKTTPQLS 1334
            TSALA++P GV  S  G +++ +           + +TG +MAET+ QGP  A+T P L+
Sbjct: 235  TSALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLN 294

Query: 1333 TETQR-QELAIKQSRQLIP-VTPSLPKALVPNLPEKQKGKTG----LSLLVHSPRGGSVK 1172
              TQR +ELAIKQSRQL+P VT S PK LV +  EK K K G     SL ++  RGG+ +
Sbjct: 295  VGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTSR 354

Query: 1171 SDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSIRAP 992
            SD  K S+ G+L++LKP RE NG  L  KD              P+   SV  S+S  AP
Sbjct: 355  SDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPL---SVTPSASASAP 411

Query: 991  SINSGH----PAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXX 824
              +SG+      AER      +             +DFF+L++KK               
Sbjct: 412  FRSSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTNSPSSVADRGPA 471

Query: 823  XXSDKFSESDNKTFLFTHQRGEDMVLNDINGIDSSE---------NRNVETSEADSCS-R 674
                   +SD    L T      + L     + SSE         NR+  T   D+ S  
Sbjct: 472  ASPSVSEKSDE---LGTEDASTSVTLQG-GSVPSSEISIADLPTDNRSEITHNGDAYSGS 527

Query: 673  HKYLNGGKNGSTYXXXXXXXXXXXXFLRSLGW-XXXXXXXGLTDEEIKAFYRDATNKFVN 497
             +  + G   +              FLRSLGW        GLT+EEI AF+ +     + 
Sbjct: 528  QQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE----HMK 583

Query: 496  AKPSSKIS*MMARIHLALESRIGCVSAIFSGLKS 395
             KPS+K+   M  I + L S  G      SGL S
Sbjct: 584  LKPSAKLFHRMQSI-VPLNSHNGTHDGASSGLSS 616


>KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp. sativus]
          Length = 593

 Score =  273 bits (699), Expect = 2e-77
 Identities = 220/589 (37%), Positives = 288/589 (48%), Gaps = 24/589 (4%)
 Frame = -1

Query: 2206 MDKSEPALVPQWLRSSESVSGNGISIN--PIASSSLRPDDHLKPKVAR-KNSLSN-NDQN 2039
            M+K+EP  VP+WL+SS SV+ + +S N   IASSSL  DD    K  R K+S+ + +  N
Sbjct: 1    MEKNEPTFVPEWLKSSGSVT-SAVSTNHHQIASSSLLSDDRATLKSTRNKSSIDDISAHN 59

Query: 2038 IKRSSVSDRITSSYFXXXXXXXXXXXXXXXXXXXXXXRDRDWDGDICESGSKDRSILGDF 1859
               S VSDR TSSYF                       D+ WD D  E    D+  +GD 
Sbjct: 60   SGSSPVSDRTTSSYFRRSSTSNGSQLRSYGSFGRTNR-DKGWDKDTNEYHDSDKLRIGDH 118

Query: 1858 RHLDYSDPLGNSILSRVDKKILRRSQSMVSDQRGEAWPRKVEIGSKGAYKARSTNADRMP 1679
            RH ++SDPLG++  +R +K  L+R+QS +S +  E W RKV        K+   N   + 
Sbjct: 119  RHRNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLL 178

Query: 1678 ANG----LLHKASFERDFPSLGVDDKSSATEVKRVPSPGLSPAIHSFPVVASAMIGSDKW 1511
            A       + KA+F+RDFPSLG D++ +  E++RVPSPGLS  + + P+  SA+ G   W
Sbjct: 179  AGSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGW 238

Query: 1510 TSALAEVPGVVRSSEGGSLSEKXXXXXXXXXXXSALT-GPSMAETVAQGPPHAKTTPQLS 1334
            TSALAEV   V ++     S             S++T G +MAET+AQGPPH   T Q S
Sbjct: 239  TSALAEVQVKVGANGINKSSVAQAALPSSASVASSMTSGLNMAETLAQGPPHVHAT-QFS 297

Query: 1333 TETQR-QELAIKQSRQLIPVTPSLPKALVPNLPEKQKGKTGLSL--------LVHSPRGG 1181
              TQR +E+AIKQS+QLIPVTPS+PKALV N  EK K K               HSPRG 
Sbjct: 298  VGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQHQTSSTHHFNHSPRGT 357

Query: 1180 SVKSDPSKTSSIGKLQVLKPVRERNGGPLTAKDXXXXXXXXXXXSAVPVGRQSVAASSSI 1001
             +KSD SKTSS+GKLQVLKP RERN      KD           +       SV    S+
Sbjct: 358  PMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPNNPLTAASSVGVPPSL 417

Query: 1000 RAPSINSGHPAAERKHGLPLLEKXXXXXXXXXXXSDFFSLMRKKXXXXXXXXXXXXXXXX 821
            R+P  N   P         +LEK            DFF+L+RKK                
Sbjct: 418  RSPIKN---PIVASGVVPTVLEKKPSAQLRSRN--DFFNLVRKKSLTNHSSPVVDSVSTV 472

Query: 820  XSDKFSESDNKTFLFTHQRGEDMVLNDINGIDSSENR-NVETSEADSCS-RHKYLNGGKN 647
                  E  ++        GED +L   N  D+ + + N   S  D+C    K  + G+N
Sbjct: 473  S-QSILEQPSEHKAGAPPPGEDSLL--ANQSDTVQYKMNGLISNRDACDGTPKSPDNGEN 529

Query: 646  GSTYXXXXXXXXXXXXF---LRSLGWXXXXXXXG-LTDEEIKAFYRDAT 512
            G T                 LRSLGW         LT+EEI+ FYRDA+
Sbjct: 530  GETRSSSDVILCSEEEEAAFLRSLGWDENAGEDEGLTEEEIREFYRDAS 578


Top