BLASTX nr result

ID: Lithospermum23_contig00006824 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum23_contig00006824
         (2776 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

CDO97516.1 unnamed protein product [Coffea canephora]                 384   e-119
XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [...   385   e-119
XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [...   370   e-113
XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [...   362   e-110
XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [...   354   e-107
KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp...   346   e-104
KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara car...   332   e-100
KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus ...   332   1e-98
KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometr...   329   5e-98
XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 i...   327   2e-97
XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [...   322   1e-95
XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 i...   317   7e-93
KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp...   314   2e-92
OMO92072.1 hypothetical protein COLO4_17899 [Corchorus olitorius]     313   9e-92
EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobro...   312   2e-91
EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobro...   312   2e-91
KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus ...   311   3e-91
XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao]                  309   2e-90
XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [...   304   5e-88
XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 i...   299   2e-87

>CDO97516.1 unnamed protein product [Coffea canephora]
          Length = 599

 Score =  384 bits (986), Expect = e-119
 Identities = 254/609 (41%), Positives = 340/609 (55%), Gaps = 20/609 (3%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSS-STSDQNLGR 754
            M++SEP+ VP+WL+S+GS TG+GT+ + LSPS    DD+  S + R  SS + +D  +GR
Sbjct: 1    MERSEPSLVPEWLKSSGSATGSGTTSHPLSPS----DDHAVSKLARNKSSVNHNDHEIGR 56

Query: 755  SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934
            SS S+R  +S F RSSS+NG   +++ SSF            +YE  ++D  ++G  +  
Sbjct: 57   SSVSDRTSASYFRRSSSSNGSGQMQSYSSFGRNHRGRDWDKDLYEPRDRDNLVVGGHKHR 116

Query: 935  XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG 1114
                          E++ LRRS SM++ +R E WP++      +A ++K+ + N   D G
Sbjct: 117  DYLDPPVNNFPGNFEKDGLRRSQSMVSRKRNEIWPKRSIADSNSASRNKSTDGNSLLDKG 176

Query: 1115 H----VHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSA 1282
                 VHK  FE++FPSLG EE+ A  E+ RVPSPG + AIH   +SASA+     WTSA
Sbjct: 177  DSVGTVHKVVFERDFPSLGSEERQATSEVGRVPSPGLNTAIHGLPISASAIIAGDKWTSA 236

Query: 1283 LAEVPGIVKSNETGVSSAKADALGLPTTA--SGISTGLSMAETVAQGXXXXXXXXXXSTE 1456
            LAEVP IV    TG+S  +  +L     +  S  S GL+MAETVAQG          S  
Sbjct: 237  LAEVPAIVGGGGTGLSPGRQASLPSSPASLPSSTSAGLNMAETVAQGPRVQAAPKITSGT 296

Query: 1457 SQRQELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPH-------LILSQRGVAVKP 1615
             + +ELAI+QSRQLIP+TPS+PK  + N  +K KAK G P        L  S RG  VK 
Sbjct: 297  QRLEELAIRQSRQLIPMTPSMPKPSILNSSDKGKAKAGQPQHPVSSPLLSPSLRGGPVKT 356

Query: 1616 DISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEK--AVSSVPDATHSVSASSYVRTQ 1789
            D SKTSN GKL VLKP RERN  S    D L+PT     A S +  AT SV+  +  R  
Sbjct: 357  DASKTSNAGKLLVLKPPRERNGVSTASKDTLSPTSSTRAATSGIAVAT-SVTGLATSRGP 415

Query: 1790 VNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISASHKK 1969
              N   P AERKH  P+LEK+P   SQ QSRNDFF+LMRKKS+P+SS+ +    + S   
Sbjct: 416  AINPVSPGAERKHALPMLEKKP--SSQAQSRNDFFNLMRKKSMPSSSSVADAGSAVSAST 473

Query: 1970 LGEDEGEGEVATSPV--QSEEVPVLANIHVVNSSKDRIMKTSNGYSCNGCESLHSKRNGS 2143
            L E  GE EV  +PV  + E+VP L          DR+         NGC+   +   G 
Sbjct: 474  LDEP-GELEVIPAPVIHEDEDVPSL----------DRL---------NGCQHTENDLFGI 513

Query: 2144 LC-NXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKPPCKIQQGM- 2317
               +                GW+EN +E GLT+EEI+AF++DL+KY+NSKP  K  QG+ 
Sbjct: 514  QSRSLPLFSEEEEAAFLHQLGWQENADEDGLTEEEINAFFRDLSKYMNSKPSSKSLQGVQ 573

Query: 2318 PRFLLALES 2344
            P+F L L S
Sbjct: 574  PKFPLLLSS 582


>XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [Sesamum indicum]
          Length = 624

 Score =  385 bits (988), Expect = e-119
 Identities = 256/624 (41%), Positives = 347/624 (55%), Gaps = 27/624 (4%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSD-QNLGR 754
            M++SEPT VP+WL++ G++TG G        S  HSDD+  S V R  S   S+    GR
Sbjct: 1    MERSEPTLVPEWLKNTGNLTGAG--------SISHSDDHAASRVARNKSFVNSNGHEFGR 52

Query: 755  SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934
            SS+SER  SS F RSSS+N   N R+ SSF            +Y+  ++D+++L D    
Sbjct: 53   SSSSERTTSSYFRRSSSSNSSGNFRSYSSFGRSQRDRDWEKDVYDSRDQDKSVLADHWHW 112

Query: 935  XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNV--NPAPD 1108
                          ER+ LRRS SM++ +RG+TWP+KV T   +A     N +    +P 
Sbjct: 113  DFSDPLGNSLLSKYERDGLRRSQSMVSGKRGDTWPKKVVTDLSSASGKNANGLLYRGSPV 172

Query: 1109 NGHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALA 1288
             G   KA+FE++FPSLG +E+    E+ RVPSPG S AI S  V  S +     WTSALA
Sbjct: 173  GGRAKKATFEKDFPSLGADERAVVPEVGRVPSPGLSTAIQSLPVGTSGLIVGEKWTSALA 232

Query: 1289 EVPGIVKSNETGVSSA-KADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465
            EVP +V SN T +SS  +A      + A G +T L+MAE VAQG          S  +QR
Sbjct: 233  EVPVLVGSNGTALSSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSVGTQR 292

Query: 1466 -QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPH--------LILSQRGVAVKPD 1618
             +ELAIKQSRQLIPVTPS+PKALV    +K K K+G           L  S RG AVK D
Sbjct: 293  LEELAIKQSRQLIPVTPSMPKALVLTSSDKPKGKVGQQQHSISSSLPLNHSPRGGAVKGD 352

Query: 1619 ISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPT-CEKAVSSVPDATHSVSASSYVRTQVN 1795
            ++K SN+GKLQVLKP+RE+N  +    DNL+PT   K V+S    + SVS S+  R   N
Sbjct: 353  VAKASNVGKLQVLKPVREKNGVTPVVKDNLSPTSSSKVVTSTLAVSPSVSGSAATRGLPN 412

Query: 1796 NYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISASHKK-- 1969
            N  H   +RK    +LEKRP SQ+  QSRNDFF+L+RKKS+PNSS+    +  A+     
Sbjct: 413  NGVH---DRKPSLTVLEKRPTSQA--QSRNDFFNLVRKKSMPNSSSAVADSAMANCSSVL 467

Query: 1970 -----LGEDEGEGEVATSPVQSEEVPVLANIHVVNS-SKDRIMKTSNGYSCNG--CESLH 2125
                 +     + +V    + S   P  A++ + NS S DR+ +     + NG  C++ +
Sbjct: 468  DTGTAISPSFSDKDVEIDILPSSNTPKAADVPLSNSLSADRLSEEKGDLTSNGDACDAQN 527

Query: 2126 SKRNGSL--CNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKPPC 2299
              RNG     +                GW+EN++EG LTDEEI+AFY+DL KYI+S P  
Sbjct: 528  YVRNGKKYPSSDPIISEEEEAAFLRSLGWDENSDEGALTDEEINAFYRDLTKYIDSNPSF 587

Query: 2300 KIQQGMP-RFLLALESQIGRVAGI 2368
            +I QG+  +FLL   S++G + GI
Sbjct: 588  RILQGVQLKFLLPFGSELGGIGGI 611


>XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [Sesamum indicum]
          Length = 616

 Score =  370 bits (951), Expect = e-113
 Identities = 249/624 (39%), Positives = 344/624 (55%), Gaps = 29/624 (4%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSD-QNLGR 754
            M++SEPT +P+WLRS GS+ G G        S  HSD+  T+ + R  S   S+  +  R
Sbjct: 1    MERSEPTLIPEWLRSAGSLNGGG--------SISHSDEQTTTKLARNKSLVNSNGHDSAR 52

Query: 755  SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934
            S +S+R  SS F RSSS+NG  +LR+ SSF              +  +KD+++LGD    
Sbjct: 53   SFSSDRTTSSYFRRSSSSNGSGHLRSHSSFGRNHHDRDWEKDACDSRDKDKSVLGDRWHR 112

Query: 935  XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG 1114
                          ER+ LRRS SMI+ +RG+TW +KV T    A     NN N  P  G
Sbjct: 113  DFSDAMGNTLLSKFERDGLRRSQSMISGKRGDTWHKKVGTDLNIA---SGNNTNGLPSKG 169

Query: 1115 H----VHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSA 1282
                 V+K +FE++FPSLG EE+ A  E+ RVPSPG S A+ S  +    +     W SA
Sbjct: 170  SPIGGVNKTTFERDFPSLGAEERAAIPEVGRVPSPGVSSALQSLPIGTPTIIRGEKWRSA 229

Query: 1283 LAEVPGIVKSNETGVSSA-KADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTES 1459
            LAEVP +V +N TG+SS  +A      + A G +T L+MAE VAQG          S  +
Sbjct: 230  LAEVPVLVGNNVTGISSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSIGT 289

Query: 1460 QR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPHLIL--------SQRGVAVK 1612
            QR +ELAIKQSRQLIPVTPS+PK L +   +K K K+G    ++        S RG  VK
Sbjct: 290  QRLEELAIKQSRQLIPVTPSMPKPLAACSADKQKTKVGQQQHVVTSSLAANQSPRGGPVK 349

Query: 1613 PDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDATHSVSASSYVRTQV 1792
             D+SKTSN+GKL VLKP+RE+N ++    +NL+PT    + S P A  S+S S+  R   
Sbjct: 350  ADVSKTSNVGKLHVLKPVREKNGTTPVVKENLSPTSGSKLVSSPLAAPSLSGSAATRVLP 409

Query: 1793 NNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAIS------ 1954
            NN   PVA+RK V  +LEKRP SQ+  QSRNDFF+ +RKKS+ NS++ +  AI+      
Sbjct: 410  NN---PVADRKPVWTVLEKRPTSQA--QSRNDFFNSVRKKSMANSTSVADAAIANSSPVD 464

Query: 1955 ---ASHKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNG--CES 2119
               A+     +   E E+  +P   +          VN S + +  T +  +CNG  C++
Sbjct: 465  TAPAASPSFSDKLTETEIVVAPNTQDRNASSG----VNLSGENLSGTRSDTACNGDVCDA 520

Query: 2120 LHSKRNG--SLCNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKP 2293
             +   NG  +  +                GWEEN +EGGLTDEEISAF++D+ KY++SKP
Sbjct: 521  QNYVSNGKKNHTSDPIFSEEEEAAFLRSLGWEENADEGGLTDEEISAFFRDVTKYVDSKP 580

Query: 2294 PCKIQQGM-PRFLLALESQIGRVA 2362
              KI Q + P+ LL  +S IG ++
Sbjct: 581  SLKILQAVQPKILLPFDSHIGGIS 604


>XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera]
          Length = 665

 Score =  362 bits (929), Expect = e-110
 Identities = 264/657 (40%), Positives = 359/657 (54%), Gaps = 61/657 (9%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757
            MDK+EP  VP+WL+S+GSVTG G++ +  +PS L SDD       RK   +++D + GRS
Sbjct: 1    MDKTEPALVPEWLKSSGSVTGGGSTNHHFAPSLLQSDDGAALKPARKLMVNSNDHDTGRS 60

Query: 758  SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937
            S  ER  SS F RSSS+NG  + R+ SSF            I++Y +KD+++L D R   
Sbjct: 61   SNLERTTSSYFRRSSSSNGSGHPRSFSSFGRTNREREWEKDIHDYRDKDKSVLSDHRHRD 120

Query: 938  XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG- 1114
                        +ER+ LRRS SMI  +RG+ WPRKV     T  K+ ++N +    +G 
Sbjct: 121  YSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSNGDGQLASGI 180

Query: 1115 ---HVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285
                V KA+F++NFPSLG E+K  A +I RV SPG + AI S  +  + + G   WTSAL
Sbjct: 181  VTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVIGGDGWTSAL 240

Query: 1286 AEVPGIVKSNETGVSSAKAD-ALGLPTTASGISTGLSMAETVAQG--XXXXXXXXXXSTE 1456
            AEVP I+ SN TGVSS +   +    + A   ++GL+MAET+ QG            S  
Sbjct: 241  AEVPVIIGSNTTGVSSVQQSVSASSVSVAPSTTSGLNMAETLVQGPARARANATPQLSVG 300

Query: 1457 SQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPHLIL---SQRGVAVKPDIS 1624
            +QR +ELA+KQSRQLIP+TPS+PK LV +P +K K+KIG   L L   SQRG   + D++
Sbjct: 301  TQRLEELALKQSRQLIPMTPSMPKTLVPSPSDKPKSKIGLQPLHLVNHSQRGGPARSDVT 360

Query: 1625 KTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDA-THSVSASSYVRTQVNNY 1801
            KTSN+GKL VLKP RERN  S    D+L+PT    V++ P A T S + S+ +R+  NN 
Sbjct: 361  KTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVANSPLAVTPSAAGSASLRSPRNNP 420

Query: 1802 GHPVAERKH--VPPLLEKRPISQSQTQSRNDFFSLMRKKS---VPNSSTESFPAISASHK 1966
                AER+   V   +EKRP SQ+  QSRNDFF+LMRKKS    P++  ES PA+S+S  
Sbjct: 421  TLASAERRPSVVLTSVEKRPTSQA--QSRNDFFNLMRKKSSTNPPSAVPESGPAVSSSVS 478

Query: 1967 KLGEDEGEGEVATSPVQSEEVPVLA--NIHVVNSSKDRIMKTSNGY-------------- 2098
            +   DE   EV T+PV  +   +L+  N  +  S+++R  KT NG               
Sbjct: 479  E-KSDELITEVVTAPVTPKGRDILSSDNSGLDWSNENRGDKTENGNNEACGVSQNDRDDE 537

Query: 2099 --SCNGCESLHSKR---------NGSLC----------------NXXXXXXXXXXXXXXX 2197
              + NG     S+R         NG  C                +               
Sbjct: 538  IDNVNGDACDVSQRDQGDEVHDGNGDACDVSQKFLDNGEKHSSPDEVLYPDEEEAAFLRS 597

Query: 2198 XGWEENTEEGGLTDEEISAFYKDLNKYINSKPPCKIQQGM-PRFLLALESQIGRVAG 2365
             GWEEN E+ GLT+EEI+AFYK+  K    KP   + Q M P+    L+SQ+G VAG
Sbjct: 598  LGWEENGEDEGLTEEEINAFYKECMKL---KPSSNLLQRMLPKISPLLDSQMGSVAG 651


>XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [Daucus carota subsp.
            sativus]
          Length = 620

 Score =  354 bits (908), Expect = e-107
 Identities = 253/621 (40%), Positives = 338/621 (54%), Gaps = 24/621 (3%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757
            M+KSEPT VP+WL+S+GSVTG G S N L+PS LH D+  T    R  S      N+G  
Sbjct: 1    MEKSEPTLVPEWLKSSGSVTG-GVSTNHLNPS-LHQDNQATLKAARNKSLV----NIGDH 54

Query: 758  SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937
                R  SS F RSSSN G S+LR+  SF            I++  +K+++ LGD +   
Sbjct: 55   DIGHRTTSSYFRRSSSN-GTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQ 113

Query: 938  XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNP----AP 1105
                         E++ LRR+ S I+    E WPR+V +  K   KS +NN N     + 
Sbjct: 114  FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173

Query: 1106 DNGHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285
                VHKASF+++FPSLG EE+    EI RVPSPG   AI +    +SA    G WTSAL
Sbjct: 174  PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233

Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465
            AEVP ++ SN T  SS         +    + TGL+MAET+ QG          S E+QR
Sbjct: 234  AEVPAMIGSNGTTASSVPHSVSSSASVVPSMMTGLNMAETLVQGPPRVQADPQLSVETQR 293

Query: 1466 -QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPH-------LILSQRGVAVKPDI 1621
             +ELAIKQSRQLIPVTPSLPKALV N  +K+K K+G          +  S RG   K +I
Sbjct: 294  LEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQQSASTNLVHHSPRGAPTKNEI 353

Query: 1622 SKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDA-THSVSASSYVRTQVNN 1798
             KTS++GKLQVLKP RERN  S    D L+PT    +++ P A   +   S+ +R+ +N+
Sbjct: 354  IKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLAPALATVGSAPLRSSMNH 413

Query: 1799 YGHPVAERKHVP-----PLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISA-S 1960
                 AERK  P     P+LEKRP    Q +SRNDFF+ MRKKS+ NSS+     +SA S
Sbjct: 414  SILVSAERKSAPPVMVTPMLEKRP--SPQAKSRNDFFNSMRKKSMTNSSSAVSNTVSAVS 471

Query: 1961 HKKLGEDEGEGEVATS-PVQSEEVPVL--ANIHVVNSSKDRIMKTSNGYSCNGCESLHSK 2131
               LG++  EGE + S   Q  +VPV+  ++   +N  +D  ++ S+G       SL + 
Sbjct: 472  PSDLGKN-SEGEASASLDSQGRDVPVVESSDEGKINECRDGSIQNSHGPQ----NSLDNG 526

Query: 2132 RNGSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKIQ 2308
             N S  +                GWEEN  E+ GLT+EEI+AFY+D++KYINS PP K  
Sbjct: 527  VNHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEINAFYRDVSKYINSAPPSKTL 586

Query: 2309 QGMPRFLLA-LESQIGRVAGI 2368
             G  + L   +  Q+G   G+
Sbjct: 587  LGTKQKLFGPINFQMGSNGGV 607


>KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp. sativus]
          Length = 617

 Score =  346 bits (887), Expect = e-104
 Identities = 252/621 (40%), Positives = 335/621 (53%), Gaps = 24/621 (3%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757
            M+KSEPT VP+WL+S+GSVTG G S N L+PS LH D+  T    R  S      N+G  
Sbjct: 1    MEKSEPTLVPEWLKSSGSVTG-GVSTNHLNPS-LHQDNQATLKAARNKSLV----NIGDH 54

Query: 758  SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937
                R  SS F RSSSN G S+LR+  SF            I++  +K+++ LGD +   
Sbjct: 55   DIGHRTTSSYFRRSSSN-GTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQ 113

Query: 938  XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNP----AP 1105
                         E++ LRR+ S I+    E WPR+V +  K   KS +NN N     + 
Sbjct: 114  FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173

Query: 1106 DNGHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285
                VHKASF+++FPSLG EE+    EI RVPSPG   AI +    +SA    G WTSAL
Sbjct: 174  PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233

Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465
            AEVP ++ SN T  SS         +    + TGL+MAET+ QG          S E+QR
Sbjct: 234  AEVPAMIGSNGTTASSVPHSVSSSASVVPSMMTGLNMAETLVQGPPRVQADPQLSVETQR 293

Query: 1466 -QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPH-------LILSQRGVAVKPDI 1621
             +ELAIKQSRQLIPVTPSLPKALV N  +K+K K+G          +  S RG   K +I
Sbjct: 294  LEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQQSASTNLVHHSPRGAPTKNEI 353

Query: 1622 SKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDA-THSVSASSYVRTQVNN 1798
             KTS++GKLQVLKP RERN  S    D L+PT    +++ P A   +   S+ +R+ +N+
Sbjct: 354  IKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLAPALATVGSAPLRSSMNH 413

Query: 1799 YGHPVAERKHVP-----PLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISA-S 1960
                 AERK  P     P+LEKRP    Q +SRNDFF+ MRKKS+ NSS+     +SA S
Sbjct: 414  SILVSAERKSAPPVMVTPMLEKRP--SPQAKSRNDFFNSMRKKSMTNSSSAVSNTVSAVS 471

Query: 1961 HKKLGEDEGEGEVATS-PVQSEEVPVL--ANIHVVNSSKDRIMKTSNGYSCNGCESLHSK 2131
               LG++  EGE + S   Q  +VPV+  ++   +N  +D  ++ S+G       SL + 
Sbjct: 472  PSDLGKN-SEGEASASLDSQGRDVPVVESSDEGKINECRDGSIQNSHGPQ----NSLDNG 526

Query: 2132 RNGSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKIQ 2308
             N S  +                GWEEN  E+ GLT+EEI+AFY+D   YINS PP K  
Sbjct: 527  VNHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEINAFYRD---YINSAPPSKTL 583

Query: 2309 QGMPRFLLA-LESQIGRVAGI 2368
             G  + L   +  Q+G   G+
Sbjct: 584  LGTKQKLFGPINFQMGSNGGV 604


>KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara cardunculus var.
            scolymus]
          Length = 551

 Score =  332 bits (851), Expect = e-100
 Identities = 227/579 (39%), Positives = 328/579 (56%), Gaps = 16/579 (2%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVR-KNSSSTSDQNLGR 754
            M+++EPTFVP+WL+S+GS++   T  +  + SSLH DD G S  +R K+  ++ D +LGR
Sbjct: 1    MERTEPTFVPEWLKSSGSLS---TISHQFTSSSLHPDDQGVSKSLRTKSLVNSGDNDLGR 57

Query: 755  SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934
            +S S+R  SS F R+SS+NG ++LR+ +SF            IYE+ +K+++   D R  
Sbjct: 58   TSVSDRTTSSYFRRTSSSNGAAHLRSYNSFSRNHRDRDWDKDIYEFRDKEKS---DNRHR 114

Query: 935  XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG 1114
                          E++ LRRSHS ++ +RGE+WPRKV        K+ +NN +  P  G
Sbjct: 115  DYSDHLANILPSRFEKDGLRRSHSSLSAKRGESWPRKVAGD-----KNGHNNGSALPSVG 169

Query: 1115 HVH---KASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285
                  KA+FE++FPSLG EEK A  EI RVPSPG + AI S  + +SA+    MWTSAL
Sbjct: 170  TSSSSGKAAFERDFPSLGAEEKQADTEIGRVPSPGLTTAIQSLPIGSSAVICGDMWTSAL 229

Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465
            AEVP IV SN + +S  +       +  + ++TG +MAET+AQG          S  +QR
Sbjct: 230  AEVPMIVGSNGSNISVQQPIQPTSVSATTSMTTGRNMAETLAQGPSRARTTPQLSVGTQR 289

Query: 1466 -QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPHLILSQ--------RGVAVKPD 1618
             +ELA+KQSRQLIP+TPS+PKAL  N  +K K K+G   L  S         R V+VK D
Sbjct: 290  LEELAVKQSRQLIPMTPSMPKALALNSSDKPKLKVGQSQLQNSHIVNHPPSLRPVSVKSD 349

Query: 1619 ISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDATHSVSASSYVRTQVNN 1798
            ++K S +GKL +LK  RERN ++    ++L+PT    + + P A   V  S+ +R   N 
Sbjct: 350  VTKVSTVGKLHILKSSRERNGTTSTAKESLSPTGGSKLPNSPLAVPVVVGSASLR---NT 406

Query: 1799 YGHP-VAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISASHKKLG 1975
             G   VA+RK   P +EKRP    Q QSRNDFF+LMRKKS+  +S+    + + S +   
Sbjct: 407  GGSTIVADRK---PCVEKRP--SPQAQSRNDFFNLMRKKSMATNSSSPGASEAGSSESTN 461

Query: 1976 EDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNG--CESLHSKRNGSLC 2149
            +  GE +V       + V V  +  V   S++++      +SCNG   E  ++++N S  
Sbjct: 462  DKPGEPQVG----GYDPVVVDRSCGVQTLSENKV-----DFSCNGDATERSNNEKNHSSS 512

Query: 2150 NXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKD 2266
            +                GWEE TEE GLT+EEI++FY+D
Sbjct: 513  DAILYSEEEEARFLRSLGWEETTEEEGLTEEEINSFYRD 551


>KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus var. scolymus]
          Length = 636

 Score =  332 bits (850), Expect = 1e-98
 Identities = 250/630 (39%), Positives = 327/630 (51%), Gaps = 52/630 (8%)
 Frame = +2

Query: 572  LMMDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHS---------------------- 685
            L M++SEPTFVP+WL+S+G   G  T+ + L  SSLHS                      
Sbjct: 8    LTMERSEPTFVPEWLKSSG---GLSTTSHQLQSSSLHSGNSIHFISQQYMLFGISFQFCY 64

Query: 686  --------DDYGTSNVVRKNSS-STSDQNLGRSSASERIKSSCFWRSSSNNGPSNLRTTS 838
                    D+ G S   R  S  + SD  LGR S S+R  SS F R+SSN G S+LR+ S
Sbjct: 65   LPDNVVLLDEQGVSKATRNKSFVNISDNELGRPSVSDRTTSSYFRRTSSN-GSSHLRSYS 123

Query: 839  SFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXXXXXXXXXXXXXXVERNSLRRSHSMIAD 1018
            SF            I+E+  K++    D R                E+  LRRSHS ++ 
Sbjct: 124  SFGRNHRDRDWDKDIHEFREKEKP---DGRLRDYSDPLGNILPSRFEKEGLRRSHSSVSA 180

Query: 1019 QRGETWPRKVETSPKTAYKSKNNNVNPAPDN----GHVHKASFEQNFPSLGVEEKLAAFE 1186
            +RGE+WPRKV     +A K+ +NN +         G V K +FE++FPSLG EEK    E
Sbjct: 181  KRGESWPRKVVVDSSSANKNSHNNGSALRSGAGAIGSV-KTAFERDFPSLGAEEKQIDPE 239

Query: 1187 IRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALAEVPGIVKSNETGVS-SAKADALGLPT 1363
            I RVPSPG + AI S  +  SA+ G   WTSALAEVP IV SN +  S      +  +  
Sbjct: 240  IGRVPSPGLTTAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNGSNTSVPPPLQSTSISA 299

Query: 1364 TASGISTGLSMAETVAQGXXXXXXXXXXSTESQR-QELAIKQSRQLIPVTPSLPKALVSN 1540
            TAS ++TG +MAET+AQG          S  +QR +ELA+KQSRQLIP+TPSLPKAL  N
Sbjct: 300  TAS-MATGRNMAETLAQGPPRAQTAPQLSVGTQRLEELAVKQSRQLIPMTPSLPKALALN 358

Query: 1541 PPEKSKAKIGY-----PHLIL---SQRGVAVKPDISKTSNIGKLQVLKPIRERNDSSLQP 1696
              +K K+K+G       HL+    S R V+ K D+SKTS++GKL VLKP RERN  +   
Sbjct: 359  SSDKPKSKVGQLQLQSSHLVNHTHSPRPVSTKFDVSKTSSVGKLHVLKPSRERNGITPIA 418

Query: 1697 IDNLNPTCEKAVSSVPDATHSVSASSYVRTQVNNYGHPVAERKHVPPLLEKRPISQSQTQ 1876
             DNL+PT    + + P A  SV  S+ +R   NN    VA +  V   LEKRP   SQ Q
Sbjct: 419  KDNLSPTGASKLPNSPLAVTSVVGSAPLRNLGNNPAVAVAVKPGVAATLEKRP--SSQAQ 476

Query: 1877 SRNDFFSLMRKKSVPNSSTESFP----AISASHKKLGEDEGEGEVATSPVQSEEVPVLAN 2044
            SRNDFF+LMRKKS+ N+S+   P    +ISA  K    + G        +    V     
Sbjct: 477  SRNDFFNLMRKKSMTNNSSPVTPDTGSSISAGDKPTATEGG--------IDPAVVDGSGG 528

Query: 2045 IHVVNSSKDRIMKTSNGYSCNG--CESLHSKRNGSLCNXXXXXXXXXXXXXXXXGWEE-N 2215
            + V + +K  +       SCNG   E  + K N S                   GWEE  
Sbjct: 529  VQVSSGNKVDLS------SCNGEATERSNGKNNSSSDAIILYSEEEEARFLRSLGWEETG 582

Query: 2216 TEEGGLTDEEISAFYKDLNKYINSKPPCKI 2305
             EE GLT+EEIS+FY+D++KY+N +   KI
Sbjct: 583  EEEEGLTEEEISSFYRDVSKYLNLQAASKI 612


>KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometricum]
          Length = 601

 Score =  329 bits (843), Expect = 5e-98
 Identities = 240/602 (39%), Positives = 324/602 (53%), Gaps = 18/602 (2%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNS-SSTSDQNLGR 754
            MDKSEPT VP+WL+++G+ +G G        S+LHSDD     + R NS  S++  + GR
Sbjct: 1    MDKSEPTLVPEWLKNSGNQSGGG--------STLHSDDKSAPKLSRNNSFMSSNGHDFGR 52

Query: 755  SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934
            SS+SE+  SS F RSSS+NG  NLR+ +SF             Y+  +KD+++ GD    
Sbjct: 53   SSSSEKTTSSYFHRSSSSNGSGNLRSYNSFGRNRRDRDWEKDRYDSQDKDKSVSGDRWHR 112

Query: 935  XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNV--NPAPD 1108
                          E + LRRS S  +   G+TW +KV T   +A  +  N +    AP 
Sbjct: 113  VFSDSSGNSFSGKFEWDGLRRSQSATSGPHGDTWTKKVVTDSSSAGGNNTNTLLTKGAPG 172

Query: 1109 NGHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALA 1288
             G V K  FE+NFPSLG EE+    E+ RVPSPG S AI S  +  +A  G   WTSALA
Sbjct: 173  GG-VTKTRFERNFPSLGSEERAVIPEVGRVPSPGLSSAIQSLPIGTAAAVGGEKWTSALA 231

Query: 1289 EVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR- 1465
            EVP IV SN  GVSS    A      AS  +T L+MAE VAQG          S  +QR 
Sbjct: 232  EVPVIVGSNGIGVSSVTQSA--STQLASSTTTTLNMAEAVAQGPSRSPAMPQISVGTQRL 289

Query: 1466 QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIG-YPHLILSQRGVAVKPDISK-TSNI 1639
            +ELAIKQSRQLIPVTPS+PK LVSN  +K K K+G   H I S      K D+SK +SN+
Sbjct: 290  EELAIKQSRQLIPVTPSMPKTLVSNSSDKQKTKLGQQQHSINSLPINHSKSDMSKSSSNV 349

Query: 1640 GKLQVLKPIRERNDSSLQPIDNLNP-TCEKAVSSVPDATHSVSASSYVRTQVNNYGHPVA 1816
            GKL VLK  RE+N  +    DNL+P T   AVSS    + SV+ +   +   N    PV 
Sbjct: 350  GKLHVLKLTREKNGVAPVVKDNLSPTTAVNAVSSTLLTSPSVTGAVASKGPPN---MPVL 406

Query: 1817 ERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSST----ESFPAISASH------K 1966
             RK    +LEKR  SQ+Q QSR +FF+L+RKKS+  S++    E+F ++ + H       
Sbjct: 407  NRKPSLAVLEKRNTSQAQAQSRKEFFNLVRKKSMAISTSATDAENFSSVDSGHAVSPPPS 466

Query: 1967 KLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNGCESLHSKRNGSL 2146
            +  E E      TS +   +     +  +    +D +  T    +C+  + L +  N S+
Sbjct: 467  ETSEKEDVPAPNTSQIDDAQSSASLSDDLFPEKRDDV--TCPDDTCSMPKYLGNGMNASM 524

Query: 2147 CNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKPPCKIQQGM-PR 2323
                              GWEEN++EGGLT+EEIS+F+KD  KY NSKP  +I + + P+
Sbjct: 525  --DPLFSEEEEAAFLRSLGWEENSDEGGLTEEEISSFFKDATKY-NSKPALRILEVVQPK 581

Query: 2324 FL 2329
            F+
Sbjct: 582  FI 583


>XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 isoform X1
            [Erythranthe guttata]
          Length = 575

 Score =  327 bits (837), Expect = 2e-97
 Identities = 236/615 (38%), Positives = 320/615 (52%), Gaps = 20/615 (3%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSS-STSDQNLGR 754
            MD+SEP+ VPQWL+++GS TG G             D++  S V R  S  +T+  + GR
Sbjct: 1    MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47

Query: 755  SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILG-DFRC 931
            +S S +  SS F RSSS+N   + ++ SSF             Y   +K+R +LG D   
Sbjct: 48   ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107

Query: 932  XXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDN 1111
                           ER+ LRRSHSMI+ + GETWP+KV T   +     N N   A  +
Sbjct: 108  YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGS 167

Query: 1112 --GHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285
              G  +KA+FE++FPSLG +++    E+ RV SPG S A+ S  + +SA  G   WTSAL
Sbjct: 168  PVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSAL 227

Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGI---STGLSMAETVAQGXXXXXXXXXXSTE 1456
            AEVP +V SN T   S +  A    TTAS +   +T L+MAE VAQG          S  
Sbjct: 228  AEVPMLVVSNGTASLSVQ-QAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLG 286

Query: 1457 SQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIG----YP-----HLILSQRGV- 1603
            +QR +ELAIKQSRQLIPVTP++PK LV +  +K K+K+G    +P      +  S RG  
Sbjct: 287  TQRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAP 346

Query: 1604 AVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPT-CEKAVSSVPDATHSVSASSYV 1780
              KPD SK SN+GKL VLKP+RE+N  +    D L+PT   KAV+S   A+         
Sbjct: 347  PSKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLSPTGSGKAVNSTLPAS--------- 397

Query: 1781 RTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISAS 1960
                     P A +  +   LEKRP +Q+  QSRNDFF  MR+KSV NSS+ S    + S
Sbjct: 398  ---------PSAVKPLLTTALEKRPTTQA--QSRNDFFKRMREKSVSNSSSASETGTAIS 446

Query: 1961 HKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNGCESLHSKRNG 2140
             +K      +  V  + +     P+     V  +    +   SNG   N    +  +   
Sbjct: 447  PEK----HAKVAVVPAAITGAVEPLPEEKAVRTTCNGGVQHISNGKKYNSEPIISEEEEA 502

Query: 2141 SLCNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKPPCKIQQGMP 2320
                                GW+EN +EGGLT+EEISAFY+D  KYINSKP  +I QG+ 
Sbjct: 503  KFLR--------------SMGWDENDDEGGLTEEEISAFYRDFTKYINSKPSLRILQGVR 548

Query: 2321 -RFLLALESQIGRVA 2362
             +FLL  +SQIG ++
Sbjct: 549  LKFLLPFDSQIGGIS 563


>XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [Daucus carota subsp.
            sativus]
          Length = 591

 Score =  322 bits (826), Expect = 1e-95
 Identities = 229/604 (37%), Positives = 316/604 (52%), Gaps = 29/604 (4%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGN-GTSINALSPSSLHSDDYGTSNVVRKNSS--STSDQNL 748
            M+K+EPTFVP+WL+S+GSVT    T+ + ++ SSL SDD  T    R  SS    S  N 
Sbjct: 1    MEKNEPTFVPEWLKSSGSVTSAVSTNHHQIASSSLLSDDRATLKSTRNKSSIDDISAHNS 60

Query: 749  GRSSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFR 928
            G S  S+R  SS F RSS++NG S LR+  SF              EY + D+  +GD R
Sbjct: 61   GSSPVSDRTTSSYFRRSSTSNG-SQLRSYGSFGRTNRDKGWDKDTNEYHDSDKLRIGDHR 119

Query: 929  CXXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPD 1108
                            E++ L+R+ S I+ +  E W RKV     +  KS  NN +    
Sbjct: 120  HRNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLLA 179

Query: 1109 NGH----VHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWT 1276
                   V KA+F+++FPSLG +E+   +E+RRVPSPG S  + +  +  SA+ G   WT
Sbjct: 180  GSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGWT 239

Query: 1277 SALAEVPGIVKSNETGVSSAKADALGLPTT---ASGISTGLSMAETVAQGXXXXXXXXXX 1447
            SALAEV   VK    G++ +      LP++   AS +++GL+MAET+AQG          
Sbjct: 240  SALAEVQ--VKVGANGINKSSVAQAALPSSASVASSMTSGLNMAETLAQGPPHVHATQFS 297

Query: 1448 STESQRQELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYP--------HLILSQRGV 1603
                + +E+AIKQS+QLIPVTPS+PKALV N  EKSK K            H   S RG 
Sbjct: 298  VGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQHQTSSTHHFNHSPRGT 357

Query: 1604 AVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVP-DATHSVSASSYV 1780
             +K D+SKTS++GKLQVLKP RERND S Q  D L+PT    V + P  A  SV     +
Sbjct: 358  PMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPNNPLTAASSVGVPPSL 417

Query: 1781 RTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISAS 1960
            R+ + N   P+     VP +LEK+P   +Q +SRNDFF+L+RKKS+ N S+    ++S  
Sbjct: 418  RSPIKN---PIVASGVVPTVLEKKP--SAQLRSRNDFFNLVRKKSLTNHSSPVVDSVSTV 472

Query: 1961 HKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGY-----SCNGC-ESL 2122
             + + E   E +    P    E  +LAN        D +    NG      +C+G  +S 
Sbjct: 473  SQSILEQPSEHKAGAPP--PGEDSLLAN------QSDTVQYKMNGLISNRDACDGTPKSP 524

Query: 2123 HSKRNG---SLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSK 2290
             +  NG   S  +                GW+EN  E+ GLT+EEI  FY+D +KYI  +
Sbjct: 525  DNGENGETRSSSDVILCSEEEEAAFLRSLGWDENAGEDEGLTEEEIREFYRDASKYIKPR 584

Query: 2291 PPCK 2302
            P  K
Sbjct: 585  PSSK 588


>XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 isoform X1 [Nelumbo
            nucifera]
          Length = 645

 Score =  317 bits (811), Expect = 7e-93
 Identities = 238/638 (37%), Positives = 332/638 (52%), Gaps = 42/638 (6%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSS-STSDQNLGR 754
            M KSEPT VP+WL+  G +TG G++ +  + SSL SDD   +   R  SS S  D +  R
Sbjct: 1    MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSDDNAVALPTRNRSSLSIGDYDTPR 60

Query: 755  SSA-SERIKSSCFWRSSSNNG--------PSNLRTTSSFXXXXXXXXXXXXIYEYSNKDR 907
            SSA S+R  S+   RSSS+NG        PS  R+ S+F            I ++ +K+R
Sbjct: 61   SSAFSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKER 120

Query: 908  TILGDFRCXXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNN 1087
            ++ GD R               +E+++LRRS SM++ +RGE WPRKV      A    N 
Sbjct: 121  SVPGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKV------AADLNNG 174

Query: 1088 NVNPAPDNG---------HVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQV 1240
            N+N    NG          + KA+FE++FPSLG EEK    +I RV SPG S A+ S  +
Sbjct: 175  NINQNTSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPM 234

Query: 1241 SASAMRGSGMWTSALAEVPGIVKSNETGVSSAKADALGLPTT-ASGISTGLSMAETVAQG 1417
             +SA+ G   WTSALAEVP I+ +N TG+SS +   LG   + A+  STGL+MAET+AQ 
Sbjct: 235  GSSALIGGDGWTSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQA 294

Query: 1418 XXXXXXXXXXSTESQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIG-------- 1570
                      S E+QR +ELAIKQSRQLIP+TPS+PK  V N  EK+K KI         
Sbjct: 295  PSRARISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNA 354

Query: 1571 ----YPHLILSQRGVAVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSS 1738
                    + S RG  ++ D+SKTS+ GKL VLK  RE+N  S    D  +PT    V++
Sbjct: 355  TKTIQQQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVAN 414

Query: 1739 VPDATHSVSASSYVR----TQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMR 1906
             P A    +A + ++    ++++N     A        +EKRP + SQ QSRNDFF+LMR
Sbjct: 415  NPLALAPSAAFTPLKSPNNSKLSNERKSAAASLMHGSSVEKRP-TTSQVQSRNDFFNLMR 473

Query: 1907 KKSVPN-SSTESFPAISASHKKLGEDEGEGEVATSPV--QSEEVPVLANIHVVNSSKDRI 2077
            KK+  N SS    P+   S   L +   +  +  +PV  QS + P      +  S+++  
Sbjct: 474  KKTSGNLSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQSSDAPSPDPSCLDWSTENGS 533

Query: 2078 MKTSNGYSCNGCES-LHSKRNGSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEIS 2251
               SNG +    +  L++    S  +                GW+EN  EE GLT+EEIS
Sbjct: 534  ETISNGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLGWDENAGEEEGLTEEEIS 593

Query: 2252 AFYKDLNKYINSKPPCKIQQGMPRFLLALESQIGRVAG 2365
            AFYK+  K   S   C+  Q   +  + LES++G   G
Sbjct: 594  AFYKEYMKLRPSSKLCRGSQQQVKLPMPLESRVGSFGG 631


>KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp. sativus]
          Length = 593

 Score =  314 bits (804), Expect = 2e-92
 Identities = 225/595 (37%), Positives = 311/595 (52%), Gaps = 29/595 (4%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGN-GTSINALSPSSLHSDDYGTSNVVRKNSS--STSDQNL 748
            M+K+EPTFVP+WL+S+GSVT    T+ + ++ SSL SDD  T    R  SS    S  N 
Sbjct: 1    MEKNEPTFVPEWLKSSGSVTSAVSTNHHQIASSSLLSDDRATLKSTRNKSSIDDISAHNS 60

Query: 749  GRSSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFR 928
            G S  S+R  SS F RSS++NG S LR+  SF              EY + D+  +GD R
Sbjct: 61   GSSPVSDRTTSSYFRRSSTSNG-SQLRSYGSFGRTNRDKGWDKDTNEYHDSDKLRIGDHR 119

Query: 929  CXXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPD 1108
                            E++ L+R+ S I+ +  E W RKV     +  KS  NN +    
Sbjct: 120  HRNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLLA 179

Query: 1109 NGH----VHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWT 1276
                   V KA+F+++FPSLG +E+   +E+RRVPSPG S  + +  +  SA+ G   WT
Sbjct: 180  GSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGWT 239

Query: 1277 SALAEVPGIVKSNETGVSSAKADALGLPTT---ASGISTGLSMAETVAQGXXXXXXXXXX 1447
            SALAEV   VK    G++ +      LP++   AS +++GL+MAET+AQG          
Sbjct: 240  SALAEVQ--VKVGANGINKSSVAQAALPSSASVASSMTSGLNMAETLAQGPPHVHATQFS 297

Query: 1448 STESQRQELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYP--------HLILSQRGV 1603
                + +E+AIKQS+QLIPVTPS+PKALV N  EKSK K            H   S RG 
Sbjct: 298  VGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQHQTSSTHHFNHSPRGT 357

Query: 1604 AVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVP-DATHSVSASSYV 1780
             +K D+SKTS++GKLQVLKP RERND S Q  D L+PT    V + P  A  SV     +
Sbjct: 358  PMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPNNPLTAASSVGVPPSL 417

Query: 1781 RTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISAS 1960
            R+ + N   P+     VP +LEK+P   +Q +SRNDFF+L+RKKS+ N S+    ++S  
Sbjct: 418  RSPIKN---PIVASGVVPTVLEKKP--SAQLRSRNDFFNLVRKKSLTNHSSPVVDSVSTV 472

Query: 1961 HKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGY-----SCNGC-ESL 2122
             + + E   E +    P    E  +LAN        D +    NG      +C+G  +S 
Sbjct: 473  SQSILEQPSEHKAGAPP--PGEDSLLAN------QSDTVQYKMNGLISNRDACDGTPKSP 524

Query: 2123 HSKRNG---SLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNK 2275
             +  NG   S  +                GW+EN  E+ GLT+EEI  FY+D +K
Sbjct: 525  DNGENGETRSSSDVILCSEEEEAAFLRSLGWDENAGEDEGLTEEEIREFYRDASK 579


>OMO92072.1 hypothetical protein COLO4_17899 [Corchorus olitorius]
          Length = 617

 Score =  313 bits (801), Expect = 9e-92
 Identities = 231/614 (37%), Positives = 326/614 (53%), Gaps = 22/614 (3%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757
            M++SEP+ VP+WL++ GS+TG+  S +  + SSLHSD++      R   S  S +++GR+
Sbjct: 1    MERSEPSLVPEWLKNGGSITGSSNSNHQFTSSSLHSDNHSALRQARNKLSGGSGRHIGRT 60

Query: 758  SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937
            SA ER  S+ F RSSS+NG ++ R  S+F            I  Y ++++++L D R   
Sbjct: 61   SALERTSSAYFRRSSSSNGSAHSRPYSNFTKGHRERDREKDINGYHDREKSVLTDHRNRD 120

Query: 938  XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNGH 1117
                          ++ L+R+ SMI  + G+TWPRKV ++P    KS ++N N       
Sbjct: 121  YSDSLDNMLPSMFAKDVLKRTQSMITGKHGDTWPRKVTSNPSANNKSNHSNGNGLLSGVS 180

Query: 1118 V--HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMW-TSALA 1288
                K++FE++FP LG EEK    EI RVPSPG   A+    V  SA+ GS  W TSALA
Sbjct: 181  TVGTKSAFERDFPVLGAEEKQVGSEIGRVPSPGLGTAV--LPVGTSAVSGSNGWRTSALA 238

Query: 1289 EVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR- 1465
            ++P  V S+ TGV+ A         + +  +TGL+MAET+AQG          + E+QR 
Sbjct: 239  DMPVGVGSSGTGVAVASQSVSASSASMAPPTTGLNMAETLAQGPSRARTPPLLNVETQRL 298

Query: 1466 QELAIKQSRQLIP-VTPSLPKALVSNPPEKSKAKIG-YPHLILS---QRGVAVKPDISKT 1630
            +ELAIKQSRQLIP VT + PK +V +P EKSK K+G   HL LS    RG   + D  K 
Sbjct: 299  EELAIKQSRQLIPLVTTTTPKTMVVSPSEKSKPKVGQQQHLSLSLNYTRGGTSRSDSLKV 358

Query: 1631 SNIGKLQVLKPIRERNDSSLQPIDNLNPT--CEKAVSSVPDATHSVSASSYVRTQVNNYG 1804
            SN  +LQ+LKP RE    SL   DNL+PT    K VSS    T   +AS+  R+  N+  
Sbjct: 359  SNESRLQILKPSRELIGVSLTTKDNLSPTNGSSKPVSSPVSVTPLAAASAPFRSSGNSPN 418

Query: 1805 HPVAERKHVP--PLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFP--AISASHKKL 1972
               AER   P    +EKRP +Q+  QSRNDFF+L++KKS  NSS+   P  A+S S    
Sbjct: 419  FATAERNQNPFRIAIEKRPTAQA--QSRNDFFNLLKKKSTTNSSSVPDPGHAMSPSVPDK 476

Query: 1973 GEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNG------YSCNGCESLHSKR 2134
             ++    +  TS        +L+    V  + +R   T NG        C+    +HS  
Sbjct: 477  SDELSREDTGTSDALQGGSVLLSESTGVLQTDNRSEVTHNGDALAGSQQCSTNGDMHSSP 536

Query: 2135 NGSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKIQQ 2311
            +  L                  GWEENT ++ GLT+EEISAF+++   Y+  KP  K+  
Sbjct: 537  DAFL-----YPDEKEAAFLRSLGWEENTGDDEGLTEEEISAFFEE---YMKLKPSAKLFD 588

Query: 2312 GMPRFLLALESQIG 2353
             M + L+ L S  G
Sbjct: 589  RM-QSLVPLNSPNG 601


>EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobroma cacao]
          Length = 625

 Score =  312 bits (800), Expect = 2e-91
 Identities = 228/598 (38%), Positives = 326/598 (54%), Gaps = 21/598 (3%)
 Frame = +2

Query: 575  MMDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGR 754
            +M++SEP+ VP+WL+S GSVTG+G S +  + SSLHSD++      R   S   D ++G 
Sbjct: 5    VMERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGG 64

Query: 755  SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934
            +S  +R  S+ F RSSS+NG ++LR+ SSF            I  Y +++++++ D R  
Sbjct: 65   TSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 124

Query: 935  XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG 1114
                          E++ L RS S I  +R +TWP+KV +   T+ KS +++ N      
Sbjct: 125  NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGV 183

Query: 1115 HV---HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285
                 +K+ FE+ FP LG EE+  A EI RV SPG S A  S  V  SA+ GS  WTSAL
Sbjct: 184  STTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSAL 243

Query: 1286 AEVPGIVKSNETGVSSAKAD-ALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQ 1462
            A++P  V S+ TGV+ A  + +    + AS   TGL+MAET+ QG          +  +Q
Sbjct: 244  ADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGTQ 303

Query: 1463 R-QELAIKQSRQLIP-VTPSLPKALVSNPPEKSKAKIG-YPHLILS---QRGVAVKPDIS 1624
            R +ELAIKQSRQL+P VT S PK LV +P EKSK K+G   H  LS    RG   + D  
Sbjct: 304  RLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTSRSDSL 363

Query: 1625 KTSNIGKLQVLKPIRERNDSSLQPIDNLNPT--CEKAVSSVPDATHSVSASSYVRTQVNN 1798
            K SN G+L++LKP RE N  SL   DNL+PT    K V+S    T S SAS+  R+  N+
Sbjct: 364  KVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSVTPSASASAPFRSSGNS 423

Query: 1799 YGHPVAERKHVP--PLLEKRPISQSQTQSRNDFFSLMRKKSV---PNSSTESFPAISASH 1963
                 AER   P    +EKRP +Q+  QSRNDFF+L++KKS    P+S  +  PA S S 
Sbjct: 424  PSFATAERNQTPFRINIEKRPTAQA--QSRNDFFNLLKKKSTTNSPSSVADRGPAASPSV 481

Query: 1964 KKLGEDEGEGEVATS-PVQSEEVPVLANIHVVNSSKD-RIMKTSNGYSCNGCESLHSKRN 2137
             +  ++ G  + +TS  +Q   VP  + I + +   D R   T NG + +G +   S  +
Sbjct: 482  SEKSDELGTEDASTSVTLQGGSVP-SSEISIADLPTDNRSEITHNGDAYSGSQQCSSNGD 540

Query: 2138 -GSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKI 2305
              +  +                GWEEN  ++ GLT+EEISAF+++   ++  KP  K+
Sbjct: 541  RHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPSAKL 595


>EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobroma cacao]
          Length = 620

 Score =  312 bits (799), Expect = 2e-91
 Identities = 228/597 (38%), Positives = 325/597 (54%), Gaps = 21/597 (3%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757
            M++SEP+ VP+WL+S GSVTG+G S +  + SSLHSD++      R   S   D ++G +
Sbjct: 1    MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGGT 60

Query: 758  SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937
            S  +R  S+ F RSSS+NG ++LR+ SSF            I  Y +++++++ D R   
Sbjct: 61   SVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNRN 120

Query: 938  XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNGH 1117
                         E++ L RS S I  +R +TWP+KV +   T+ KS +++ N       
Sbjct: 121  FSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGVS 179

Query: 1118 V---HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALA 1288
                +K+ FE+ FP LG EE+  A EI RV SPG S A  S  V  SA+ GS  WTSALA
Sbjct: 180  TTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSALA 239

Query: 1289 EVPGIVKSNETGVSSAKAD-ALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465
            ++P  V S+ TGV+ A  + +    + AS   TGL+MAET+ QG          +  +QR
Sbjct: 240  DMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGTQR 299

Query: 1466 -QELAIKQSRQLIP-VTPSLPKALVSNPPEKSKAKIG-YPHLILS---QRGVAVKPDISK 1627
             +ELAIKQSRQL+P VT S PK LV +P EKSK K+G   H  LS    RG   + D  K
Sbjct: 300  LEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTSRSDSLK 359

Query: 1628 TSNIGKLQVLKPIRERNDSSLQPIDNLNPT--CEKAVSSVPDATHSVSASSYVRTQVNNY 1801
             SN G+L++LKP RE N  SL   DNL+PT    K V+S    T S SAS+  R+  N+ 
Sbjct: 360  VSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSVTPSASASAPFRSSGNSP 419

Query: 1802 GHPVAERKHVP--PLLEKRPISQSQTQSRNDFFSLMRKKSV---PNSSTESFPAISASHK 1966
                AER   P    +EKRP +Q+  QSRNDFF+L++KKS    P+S  +  PA S S  
Sbjct: 420  SFATAERNQTPFRINIEKRPTAQA--QSRNDFFNLLKKKSTTNSPSSVADRGPAASPSVS 477

Query: 1967 KLGEDEGEGEVATS-PVQSEEVPVLANIHVVNSSKD-RIMKTSNGYSCNGCESLHSKRN- 2137
            +  ++ G  + +TS  +Q   VP  + I + +   D R   T NG + +G +   S  + 
Sbjct: 478  EKSDELGTEDASTSVTLQGGSVP-SSEISIADLPTDNRSEITHNGDAYSGSQQCSSNGDR 536

Query: 2138 GSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKI 2305
             +  +                GWEEN  ++ GLT+EEISAF+++   ++  KP  K+
Sbjct: 537  HARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPSAKL 590


>KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus var. scolymus]
          Length = 629

 Score =  311 bits (798), Expect = 3e-91
 Identities = 234/649 (36%), Positives = 330/649 (50%), Gaps = 60/649 (9%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHS------------------------ 685
            M+++EPTFVP+WL+S+G   G+ T+ +  + SSLH                         
Sbjct: 1    MERTEPTFVPEWLKSSG---GSSTTSHQFTSSSLHPGNSYIYVCCFNKYGVNDHNICFDY 57

Query: 686  ---------DDYGTSNVVR-KNSSSTSDQNLGRSSASERIKSSCFWRSSSNNGPSNLRTT 835
                     D+ G+S   R K+S ++SD +LGR+S S+R  SS F R+S  NG ++LR+ 
Sbjct: 58   PSDGIFLAVDEQGSSKSGRNKSSVNSSDNDLGRTSVSDRTTSSYFRRTSLGNGSTHLRSY 117

Query: 836  SSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXXXXXXXXXXXXXXVERNSLRRSHSMIA 1015
            SSF            IYE+ +K+++   D R                E++ LRRSHS ++
Sbjct: 118  SSFGRNHRDRDWDKDIYEFWSKEKS---DNRHRDYSDPLDNILPSRFEKDGLRRSHSSVS 174

Query: 1016 DQRGETWPRKVETSPKTAYKSKNNNVNPAPDNGHV---HKASFEQNFPSLGVEEKLAAFE 1186
             +RGE+WPRKV +    A KS ++N       G      K SFE++FPSLG +EK A  +
Sbjct: 175  GKRGESWPRKVVSDLSIANKSSHSNGTALLSGGSSLSNVKTSFERDFPSLGADEKQADPD 234

Query: 1187 IRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALAEVPGIVKSNETGVSSAKADALGLPTT 1366
            I RVPSPG S AI S  +  SA+ G   WTSALAEVP IV SN    S ++       T 
Sbjct: 235  IGRVPSPGLSSAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNGNSTSVSQPVQPTSITA 294

Query: 1367 ASGISTGLSMAETVAQG-----------XXXXXXXXXXSTESQR-QELAIKQSRQLIPVT 1510
             + ++ G +MAET+A G                     +  +QR +ELA+KQSRQLIP+T
Sbjct: 295  TTSMTGGRNMAETLAHGPPRTQTAPQVAQMLLMGSTILTVGTQRLEELAVKQSRQLIPMT 354

Query: 1511 PSLPKALVSNPPEKSKAKIGYPHLI---LSQRGVAVKPDISKTSNIGKLQVLKPIRERND 1681
            PS+PKAL  +  +K K KIG   L+    + R ++VK D+SKTS +GKL VLKP RERN 
Sbjct: 355  PSMPKALALSSSDKPKLKIGQSQLVNHPHTPRPLSVKSDVSKTSTVGKLLVLKPSRERNG 414

Query: 1682 SSLQPIDNLNPTCEKAVSSVPDATHSVSASSYVRTQVNNYGHPVAERKHVPPLLEKRPIS 1861
             S    ++L+PT    + + P A  S   S+ +R   NN G    ERK     LEKRP  
Sbjct: 415  ISPTAKESLSPTGGSKLPNSPLAVPSAIGSAPLRNMGNNPGVTAVERKPSVATLEKRP-- 472

Query: 1862 QSQTQSRNDFFSLMRKKSVPNSSTESFPAISASHKKLGEDEGEGEVATSPVQSEEVPVLA 2041
             SQ QSRN+FF+LMRKKS+             S+  +  D G    + S  +    PV  
Sbjct: 473  SSQAQSRNNFFNLMRKKSM------------ISNSSVAPDTGS---SVSSSEKPGAPVAP 517

Query: 2042 NIHVVNSSKDRIMKTSNGYSCNG------CESLHSKRNGSLCNXXXXXXXXXXXXXXXXG 2203
              H+  S  +  ++T    +C G        S ++ +N S  +                G
Sbjct: 518  PAHLGGSESNTTVETKVDLTCKGDACVATVRSTNNGKNHSGPDAVLCSEEEEARFLRSLG 577

Query: 2204 WEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKIQQG-MPRFLLALES 2344
            W+E   EE GLT+EEIS+FY++   Y+N KP  KI +G  P+ L+ + S
Sbjct: 578  WDETAEEEEGLTEEEISSFYRN---YLNLKPTSKILKGTKPKPLMEISS 623


>XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao]
          Length = 620

 Score =  309 bits (792), Expect = 2e-90
 Identities = 227/597 (38%), Positives = 324/597 (54%), Gaps = 21/597 (3%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757
            M++SEP+ VP+WL+S GSVTG+G S +  + SSLHSD++      R   S   D ++G +
Sbjct: 1    MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPARNKLSVAGDHDVGGT 60

Query: 758  SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937
            S  +R  S+ F RSSS+NG  +LR+ SSF            I  Y +++++++ D R   
Sbjct: 61   SVLDRTTSAYFRRSSSSNGSVHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNRN 120

Query: 938  XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNGH 1117
                         E++ L RS S I  +R +TWP+KV +   T+ KS +++ N       
Sbjct: 121  FSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSGNGLLSGVS 179

Query: 1118 V---HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALA 1288
                +K++FE+ FP LG EE+    EI RV SPG S A  S  V  SA+ GS  WTSALA
Sbjct: 180  TTVGNKSAFEREFPVLGAEERQVGSEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSALA 239

Query: 1289 EVPGIVKSNETGVSSAKAD-ALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465
            ++P  V S+ TGV+ A  + +    + AS   TGL+MAET+ QG          +  +QR
Sbjct: 240  DMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGTQR 299

Query: 1466 -QELAIKQSRQLIP-VTPSLPKALVSNPPEKSKAKIG-YPHLILS---QRGVAVKPDISK 1627
             +ELAIKQSRQL+P VT S PK LV +P EKSK K+G   H  LS    RG   + D  K
Sbjct: 300  LEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTSRSDSLK 359

Query: 1628 TSNIGKLQVLKPIRERNDSSLQPIDNLNPT--CEKAVSSVPDATHSVSASSYVRTQVNNY 1801
             SN G+L++LKP RE N  SL   DNL+PT    K V+S  + T S SAS+  R+  N+ 
Sbjct: 360  VSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLNVTPSASASAPFRSSGNSP 419

Query: 1802 GHPVAERKHVP--PLLEKRPISQSQTQSRNDFFSLMRKKSV---PNSSTESFPAISASHK 1966
                AER   P    +EKRP +Q+  QSRNDFF+L++KKS    P+S  +  PA S S  
Sbjct: 420  SFATAERNQTPFRINIEKRPTAQA--QSRNDFFNLLKKKSTTNSPSSVADRGPAASPSVS 477

Query: 1967 KLGEDEGEGEVATS-PVQSEEVPVLANIHVVNSSKD-RIMKTSNGYSCNGCESLHSKRN- 2137
            +  ++ G  + +TS  +Q   VP  + I + +   D R   T NG +  G +   S  + 
Sbjct: 478  EKSDELGTEDASTSVTLQGGSVP-SSEISIADLPTDNRSEITHNGDAYAGSQQCSSNGDR 536

Query: 2138 GSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKI 2305
             +  +                GWEEN  ++ GLT+EEISAF+++   ++  KP  K+
Sbjct: 537  HARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPSAKL 590


>XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera]
          Length = 655

 Score =  304 bits (778), Expect = 5e-88
 Identities = 241/645 (37%), Positives = 327/645 (50%), Gaps = 49/645 (7%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKN-SSSTSDQNLGR 754
            M K EPT VP+WL+  GS+TG G + +  + SS HSDD+  +   R   + ST D +  R
Sbjct: 1    MAKGEPTLVPEWLKGTGSITGGGNTTHHFASSSTHSDDHAVALTTRNRLTMSTGDYDTPR 60

Query: 755  SSAS-ERIKSSCFWRSSSNNGP--------SNLRTTSSFXXXXXXXXXXXXIYEYSNKDR 907
            SSA  +R  S+ F RSSS+NG         +  R+ SSF              +Y +K++
Sbjct: 61   SSAFLDRTSSAYFRRSSSSNGSMMHDKETSTYSRSYSSFTRSHRDRDWEKDTLDYRDKEK 120

Query: 908  TILGDFRCXXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNN 1087
            +ILGD R                E+++LRRS SMI+ +RGE W R+V         + +N
Sbjct: 121  SILGDHRDRDYSDPLASILTSRXEKDTLRRSQSMISGKRGEGWSRRVAADTNNG-NNNHN 179

Query: 1088 NVNPAPDNGHV----HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAM 1255
            N N     G +     KA+FE++FPSLG EEK  A +I RV SPG S ++ S  + +SA+
Sbjct: 180  NGNGLLVGGSIVSSIQKAAFERDFPSLGAEEKQGALDIGRVSSPGLSSSVQSLPIGSSAV 239

Query: 1256 RGSGMWTSALAEVPGIVKSNETGVSSA-KADALGLPTTASGISTGLSMAETVAQGXXXXX 1432
             G   WTSALAEVP I+ +N  G SS  +A      + A   STGL+MAET+AQ      
Sbjct: 240  IGGDGWTSALAEVPVIIGNNSIGPSSVQQATPASSTSGAPNSSTGLNMAETLAQAPSRTR 299

Query: 1433 XXXXXSTESQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAK--------------- 1564
                 S E+QR +ELAIKQSRQLIP+TPS+PK    N  EK+K K               
Sbjct: 300  ISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSALNSSEKAKPKAVVRTGEMGISAKTS 359

Query: 1565 ----IGYPHLI-LSQRGVAVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPT-CEK 1726
                +   HL+  S RG  V+ D+ KTS+ GKL VLK  RE+N  S    D L+PT   K
Sbjct: 360  QQQQLPSSHLVNHSLRGGPVRSDVPKTSHGGKLLVLKAPREKNGISPSAKDGLSPTNASK 419

Query: 1727 AVSS----VPDATHSVSASSYVRTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFF 1894
             V++     P A ++    S   +++ N    VA        +EKRP + SQ QSRNDFF
Sbjct: 420  VVNNSLVLAPLAAYAPPMRSPNNSKLPNERKSVASSLTHGSAVEKRP-TTSQVQSRNDFF 478

Query: 1895 SLMRKKSVPN-SSTESFPAISASHKKLGEDEGEGEVA-TSPV--QSEEVPVLANIHVVNS 2062
            +LMRKK+  N +S    P+ +AS   L +     EV  T+PV  QS + P      +  S
Sbjct: 479  NLMRKKTSGNLASAVPDPSPTASSSLLEKSSEPTEVVPTAPVSPQSSDAPSSEPSGLDWS 538

Query: 2063 SKDRIMKTSNGYSCNGCESLHSKRNG---SLCNXXXXXXXXXXXXXXXXGWEENT-EEGG 2230
            +++     SNG      ES     NG   S  +                GW+EN  EE G
Sbjct: 539  TENGGDLVSNGDVSE--ESQRFSNNGEKRSTADAFVYPDEEEAAFLRSLGWDENAGEEEG 596

Query: 2231 LTDEEISAFYKDLNKYINSKPPCKIQQGMPRFLLALESQIGRVAG 2365
            LT+EEISAFY++  K   S   C+  Q   +  L LES +G  +G
Sbjct: 597  LTEEEISAFYREYMKVRPSSRLCQGAQQQTKVPLPLESHVGSFSG 641


>XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 isoform X2
            [Erythranthe guttata]
          Length = 550

 Score =  299 bits (766), Expect = 2e-87
 Identities = 220/585 (37%), Positives = 298/585 (50%), Gaps = 19/585 (3%)
 Frame = +2

Query: 578  MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSS-STSDQNLGR 754
            MD+SEP+ VPQWL+++GS TG G             D++  S V R  S  +T+  + GR
Sbjct: 1    MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47

Query: 755  SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILG-DFRC 931
            +S S +  SS F RSSS+N   + ++ SSF             Y   +K+R +LG D   
Sbjct: 48   ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107

Query: 932  XXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDN 1111
                           ER+ LRRSHSMI+ + GETWP+KV T   +     N N   A  +
Sbjct: 108  YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGS 167

Query: 1112 --GHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285
              G  +KA+FE++FPSLG +++    E+ RV SPG S A+ S  + +SA  G   WTSAL
Sbjct: 168  PVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSAL 227

Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGI---STGLSMAETVAQGXXXXXXXXXXSTE 1456
            AEVP +V SN T   S +  A    TTAS +   +T L+MAE VAQG          S  
Sbjct: 228  AEVPMLVVSNGTASLSVQ-QAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLG 286

Query: 1457 SQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIG----YP-----HLILSQRGV- 1603
            +QR +ELAIKQSRQLIPVTP++PK LV +  +K K+K+G    +P      +  S RG  
Sbjct: 287  TQRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAP 346

Query: 1604 AVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPT-CEKAVSSVPDATHSVSASSYV 1780
              KPD SK SN+GKL VLKP+RE+N  +    D L+PT   KAV+S   A+         
Sbjct: 347  PSKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLSPTGSGKAVNSTLPAS--------- 397

Query: 1781 RTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISAS 1960
                     P A +  +   LEKRP +Q+  QSRNDFF  MR+KSV NSS+ S    + S
Sbjct: 398  ---------PSAVKPLLTTALEKRPTTQA--QSRNDFFKRMREKSVSNSSSASETGTAIS 446

Query: 1961 HKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNGCESLHSKRNG 2140
             +K      +  V  + +     P+     V  +    +   SNG   N    +  +   
Sbjct: 447  PEK----HAKVAVVPAAITGAVEPLPEEKAVRTTCNGGVQHISNGKKYNSEPIISEEEEA 502

Query: 2141 SLCNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNK 2275
                                GW+EN +EGGLT+EEISAFY+D  K
Sbjct: 503  KFLR--------------SMGWDENDDEGGLTEEEISAFYRDFTK 533


Top