BLASTX nr result
ID: Lithospermum23_contig00006824
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum23_contig00006824 (2776 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value CDO97516.1 unnamed protein product [Coffea canephora] 384 e-119 XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [... 385 e-119 XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [... 370 e-113 XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [... 362 e-110 XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [... 354 e-107 KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp... 346 e-104 KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara car... 332 e-100 KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus ... 332 1e-98 KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometr... 329 5e-98 XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 i... 327 2e-97 XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [... 322 1e-95 XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 i... 317 7e-93 KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp... 314 2e-92 OMO92072.1 hypothetical protein COLO4_17899 [Corchorus olitorius] 313 9e-92 EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobro... 312 2e-91 EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobro... 312 2e-91 KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus ... 311 3e-91 XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao] 309 2e-90 XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [... 304 5e-88 XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 i... 299 2e-87 >CDO97516.1 unnamed protein product [Coffea canephora] Length = 599 Score = 384 bits (986), Expect = e-119 Identities = 254/609 (41%), Positives = 340/609 (55%), Gaps = 20/609 (3%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSS-STSDQNLGR 754 M++SEP+ VP+WL+S+GS TG+GT+ + LSPS DD+ S + R SS + +D +GR Sbjct: 1 MERSEPSLVPEWLKSSGSATGSGTTSHPLSPS----DDHAVSKLARNKSSVNHNDHEIGR 56 Query: 755 SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934 SS S+R +S F RSSS+NG +++ SSF +YE ++D ++G + Sbjct: 57 SSVSDRTSASYFRRSSSSNGSGQMQSYSSFGRNHRGRDWDKDLYEPRDRDNLVVGGHKHR 116 Query: 935 XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG 1114 E++ LRRS SM++ +R E WP++ +A ++K+ + N D G Sbjct: 117 DYLDPPVNNFPGNFEKDGLRRSQSMVSRKRNEIWPKRSIADSNSASRNKSTDGNSLLDKG 176 Query: 1115 H----VHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSA 1282 VHK FE++FPSLG EE+ A E+ RVPSPG + AIH +SASA+ WTSA Sbjct: 177 DSVGTVHKVVFERDFPSLGSEERQATSEVGRVPSPGLNTAIHGLPISASAIIAGDKWTSA 236 Query: 1283 LAEVPGIVKSNETGVSSAKADALGLPTTA--SGISTGLSMAETVAQGXXXXXXXXXXSTE 1456 LAEVP IV TG+S + +L + S S GL+MAETVAQG S Sbjct: 237 LAEVPAIVGGGGTGLSPGRQASLPSSPASLPSSTSAGLNMAETVAQGPRVQAAPKITSGT 296 Query: 1457 SQRQELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPH-------LILSQRGVAVKP 1615 + +ELAI+QSRQLIP+TPS+PK + N +K KAK G P L S RG VK Sbjct: 297 QRLEELAIRQSRQLIPMTPSMPKPSILNSSDKGKAKAGQPQHPVSSPLLSPSLRGGPVKT 356 Query: 1616 DISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEK--AVSSVPDATHSVSASSYVRTQ 1789 D SKTSN GKL VLKP RERN S D L+PT A S + AT SV+ + R Sbjct: 357 DASKTSNAGKLLVLKPPRERNGVSTASKDTLSPTSSTRAATSGIAVAT-SVTGLATSRGP 415 Query: 1790 VNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISASHKK 1969 N P AERKH P+LEK+P SQ QSRNDFF+LMRKKS+P+SS+ + + S Sbjct: 416 AINPVSPGAERKHALPMLEKKP--SSQAQSRNDFFNLMRKKSMPSSSSVADAGSAVSAST 473 Query: 1970 LGEDEGEGEVATSPV--QSEEVPVLANIHVVNSSKDRIMKTSNGYSCNGCESLHSKRNGS 2143 L E GE EV +PV + E+VP L DR+ NGC+ + G Sbjct: 474 LDEP-GELEVIPAPVIHEDEDVPSL----------DRL---------NGCQHTENDLFGI 513 Query: 2144 LC-NXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKPPCKIQQGM- 2317 + GW+EN +E GLT+EEI+AF++DL+KY+NSKP K QG+ Sbjct: 514 QSRSLPLFSEEEEAAFLHQLGWQENADEDGLTEEEINAFFRDLSKYMNSKPSSKSLQGVQ 573 Query: 2318 PRFLLALES 2344 P+F L L S Sbjct: 574 PKFPLLLSS 582 >XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [Sesamum indicum] Length = 624 Score = 385 bits (988), Expect = e-119 Identities = 256/624 (41%), Positives = 347/624 (55%), Gaps = 27/624 (4%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSD-QNLGR 754 M++SEPT VP+WL++ G++TG G S HSDD+ S V R S S+ GR Sbjct: 1 MERSEPTLVPEWLKNTGNLTGAG--------SISHSDDHAASRVARNKSFVNSNGHEFGR 52 Query: 755 SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934 SS+SER SS F RSSS+N N R+ SSF +Y+ ++D+++L D Sbjct: 53 SSSSERTTSSYFRRSSSSNSSGNFRSYSSFGRSQRDRDWEKDVYDSRDQDKSVLADHWHW 112 Query: 935 XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNV--NPAPD 1108 ER+ LRRS SM++ +RG+TWP+KV T +A N + +P Sbjct: 113 DFSDPLGNSLLSKYERDGLRRSQSMVSGKRGDTWPKKVVTDLSSASGKNANGLLYRGSPV 172 Query: 1109 NGHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALA 1288 G KA+FE++FPSLG +E+ E+ RVPSPG S AI S V S + WTSALA Sbjct: 173 GGRAKKATFEKDFPSLGADERAVVPEVGRVPSPGLSTAIQSLPVGTSGLIVGEKWTSALA 232 Query: 1289 EVPGIVKSNETGVSSA-KADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465 EVP +V SN T +SS +A + A G +T L+MAE VAQG S +QR Sbjct: 233 EVPVLVGSNGTALSSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSVGTQR 292 Query: 1466 -QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPH--------LILSQRGVAVKPD 1618 +ELAIKQSRQLIPVTPS+PKALV +K K K+G L S RG AVK D Sbjct: 293 LEELAIKQSRQLIPVTPSMPKALVLTSSDKPKGKVGQQQHSISSSLPLNHSPRGGAVKGD 352 Query: 1619 ISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPT-CEKAVSSVPDATHSVSASSYVRTQVN 1795 ++K SN+GKLQVLKP+RE+N + DNL+PT K V+S + SVS S+ R N Sbjct: 353 VAKASNVGKLQVLKPVREKNGVTPVVKDNLSPTSSSKVVTSTLAVSPSVSGSAATRGLPN 412 Query: 1796 NYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISASHKK-- 1969 N H +RK +LEKRP SQ+ QSRNDFF+L+RKKS+PNSS+ + A+ Sbjct: 413 NGVH---DRKPSLTVLEKRPTSQA--QSRNDFFNLVRKKSMPNSSSAVADSAMANCSSVL 467 Query: 1970 -----LGEDEGEGEVATSPVQSEEVPVLANIHVVNS-SKDRIMKTSNGYSCNG--CESLH 2125 + + +V + S P A++ + NS S DR+ + + NG C++ + Sbjct: 468 DTGTAISPSFSDKDVEIDILPSSNTPKAADVPLSNSLSADRLSEEKGDLTSNGDACDAQN 527 Query: 2126 SKRNGSL--CNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKPPC 2299 RNG + GW+EN++EG LTDEEI+AFY+DL KYI+S P Sbjct: 528 YVRNGKKYPSSDPIISEEEEAAFLRSLGWDENSDEGALTDEEINAFYRDLTKYIDSNPSF 587 Query: 2300 KIQQGMP-RFLLALESQIGRVAGI 2368 +I QG+ +FLL S++G + GI Sbjct: 588 RILQGVQLKFLLPFGSELGGIGGI 611 >XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [Sesamum indicum] Length = 616 Score = 370 bits (951), Expect = e-113 Identities = 249/624 (39%), Positives = 344/624 (55%), Gaps = 29/624 (4%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSD-QNLGR 754 M++SEPT +P+WLRS GS+ G G S HSD+ T+ + R S S+ + R Sbjct: 1 MERSEPTLIPEWLRSAGSLNGGG--------SISHSDEQTTTKLARNKSLVNSNGHDSAR 52 Query: 755 SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934 S +S+R SS F RSSS+NG +LR+ SSF + +KD+++LGD Sbjct: 53 SFSSDRTTSSYFRRSSSSNGSGHLRSHSSFGRNHHDRDWEKDACDSRDKDKSVLGDRWHR 112 Query: 935 XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG 1114 ER+ LRRS SMI+ +RG+TW +KV T A NN N P G Sbjct: 113 DFSDAMGNTLLSKFERDGLRRSQSMISGKRGDTWHKKVGTDLNIA---SGNNTNGLPSKG 169 Query: 1115 H----VHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSA 1282 V+K +FE++FPSLG EE+ A E+ RVPSPG S A+ S + + W SA Sbjct: 170 SPIGGVNKTTFERDFPSLGAEERAAIPEVGRVPSPGVSSALQSLPIGTPTIIRGEKWRSA 229 Query: 1283 LAEVPGIVKSNETGVSSA-KADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTES 1459 LAEVP +V +N TG+SS +A + A G +T L+MAE VAQG S + Sbjct: 230 LAEVPVLVGNNVTGISSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSIGT 289 Query: 1460 QR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPHLIL--------SQRGVAVK 1612 QR +ELAIKQSRQLIPVTPS+PK L + +K K K+G ++ S RG VK Sbjct: 290 QRLEELAIKQSRQLIPVTPSMPKPLAACSADKQKTKVGQQQHVVTSSLAANQSPRGGPVK 349 Query: 1613 PDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDATHSVSASSYVRTQV 1792 D+SKTSN+GKL VLKP+RE+N ++ +NL+PT + S P A S+S S+ R Sbjct: 350 ADVSKTSNVGKLHVLKPVREKNGTTPVVKENLSPTSGSKLVSSPLAAPSLSGSAATRVLP 409 Query: 1793 NNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAIS------ 1954 NN PVA+RK V +LEKRP SQ+ QSRNDFF+ +RKKS+ NS++ + AI+ Sbjct: 410 NN---PVADRKPVWTVLEKRPTSQA--QSRNDFFNSVRKKSMANSTSVADAAIANSSPVD 464 Query: 1955 ---ASHKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNG--CES 2119 A+ + E E+ +P + VN S + + T + +CNG C++ Sbjct: 465 TAPAASPSFSDKLTETEIVVAPNTQDRNASSG----VNLSGENLSGTRSDTACNGDVCDA 520 Query: 2120 LHSKRNG--SLCNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKP 2293 + NG + + GWEEN +EGGLTDEEISAF++D+ KY++SKP Sbjct: 521 QNYVSNGKKNHTSDPIFSEEEEAAFLRSLGWEENADEGGLTDEEISAFFRDVTKYVDSKP 580 Query: 2294 PCKIQQGM-PRFLLALESQIGRVA 2362 KI Q + P+ LL +S IG ++ Sbjct: 581 SLKILQAVQPKILLPFDSHIGGIS 604 >XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera] Length = 665 Score = 362 bits (929), Expect = e-110 Identities = 264/657 (40%), Positives = 359/657 (54%), Gaps = 61/657 (9%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757 MDK+EP VP+WL+S+GSVTG G++ + +PS L SDD RK +++D + GRS Sbjct: 1 MDKTEPALVPEWLKSSGSVTGGGSTNHHFAPSLLQSDDGAALKPARKLMVNSNDHDTGRS 60 Query: 758 SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937 S ER SS F RSSS+NG + R+ SSF I++Y +KD+++L D R Sbjct: 61 SNLERTTSSYFRRSSSSNGSGHPRSFSSFGRTNREREWEKDIHDYRDKDKSVLSDHRHRD 120 Query: 938 XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG- 1114 +ER+ LRRS SMI +RG+ WPRKV T K+ ++N + +G Sbjct: 121 YSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSNGDGQLASGI 180 Query: 1115 ---HVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285 V KA+F++NFPSLG E+K A +I RV SPG + AI S + + + G WTSAL Sbjct: 181 VTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVIGGDGWTSAL 240 Query: 1286 AEVPGIVKSNETGVSSAKAD-ALGLPTTASGISTGLSMAETVAQG--XXXXXXXXXXSTE 1456 AEVP I+ SN TGVSS + + + A ++GL+MAET+ QG S Sbjct: 241 AEVPVIIGSNTTGVSSVQQSVSASSVSVAPSTTSGLNMAETLVQGPARARANATPQLSVG 300 Query: 1457 SQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPHLIL---SQRGVAVKPDIS 1624 +QR +ELA+KQSRQLIP+TPS+PK LV +P +K K+KIG L L SQRG + D++ Sbjct: 301 TQRLEELALKQSRQLIPMTPSMPKTLVPSPSDKPKSKIGLQPLHLVNHSQRGGPARSDVT 360 Query: 1625 KTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDA-THSVSASSYVRTQVNNY 1801 KTSN+GKL VLKP RERN S D+L+PT V++ P A T S + S+ +R+ NN Sbjct: 361 KTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVANSPLAVTPSAAGSASLRSPRNNP 420 Query: 1802 GHPVAERKH--VPPLLEKRPISQSQTQSRNDFFSLMRKKS---VPNSSTESFPAISASHK 1966 AER+ V +EKRP SQ+ QSRNDFF+LMRKKS P++ ES PA+S+S Sbjct: 421 TLASAERRPSVVLTSVEKRPTSQA--QSRNDFFNLMRKKSSTNPPSAVPESGPAVSSSVS 478 Query: 1967 KLGEDEGEGEVATSPVQSEEVPVLA--NIHVVNSSKDRIMKTSNGY-------------- 2098 + DE EV T+PV + +L+ N + S+++R KT NG Sbjct: 479 E-KSDELITEVVTAPVTPKGRDILSSDNSGLDWSNENRGDKTENGNNEACGVSQNDRDDE 537 Query: 2099 --SCNGCESLHSKR---------NGSLC----------------NXXXXXXXXXXXXXXX 2197 + NG S+R NG C + Sbjct: 538 IDNVNGDACDVSQRDQGDEVHDGNGDACDVSQKFLDNGEKHSSPDEVLYPDEEEAAFLRS 597 Query: 2198 XGWEENTEEGGLTDEEISAFYKDLNKYINSKPPCKIQQGM-PRFLLALESQIGRVAG 2365 GWEEN E+ GLT+EEI+AFYK+ K KP + Q M P+ L+SQ+G VAG Sbjct: 598 LGWEENGEDEGLTEEEINAFYKECMKL---KPSSNLLQRMLPKISPLLDSQMGSVAG 651 >XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [Daucus carota subsp. sativus] Length = 620 Score = 354 bits (908), Expect = e-107 Identities = 253/621 (40%), Positives = 338/621 (54%), Gaps = 24/621 (3%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757 M+KSEPT VP+WL+S+GSVTG G S N L+PS LH D+ T R S N+G Sbjct: 1 MEKSEPTLVPEWLKSSGSVTG-GVSTNHLNPS-LHQDNQATLKAARNKSLV----NIGDH 54 Query: 758 SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937 R SS F RSSSN G S+LR+ SF I++ +K+++ LGD + Sbjct: 55 DIGHRTTSSYFRRSSSN-GTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQ 113 Query: 938 XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNP----AP 1105 E++ LRR+ S I+ E WPR+V + K KS +NN N + Sbjct: 114 FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173 Query: 1106 DNGHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285 VHKASF+++FPSLG EE+ EI RVPSPG AI + +SA G WTSAL Sbjct: 174 PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233 Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465 AEVP ++ SN T SS + + TGL+MAET+ QG S E+QR Sbjct: 234 AEVPAMIGSNGTTASSVPHSVSSSASVVPSMMTGLNMAETLVQGPPRVQADPQLSVETQR 293 Query: 1466 -QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPH-------LILSQRGVAVKPDI 1621 +ELAIKQSRQLIPVTPSLPKALV N +K+K K+G + S RG K +I Sbjct: 294 LEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQQSASTNLVHHSPRGAPTKNEI 353 Query: 1622 SKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDA-THSVSASSYVRTQVNN 1798 KTS++GKLQVLKP RERN S D L+PT +++ P A + S+ +R+ +N+ Sbjct: 354 IKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLAPALATVGSAPLRSSMNH 413 Query: 1799 YGHPVAERKHVP-----PLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISA-S 1960 AERK P P+LEKRP Q +SRNDFF+ MRKKS+ NSS+ +SA S Sbjct: 414 SILVSAERKSAPPVMVTPMLEKRP--SPQAKSRNDFFNSMRKKSMTNSSSAVSNTVSAVS 471 Query: 1961 HKKLGEDEGEGEVATS-PVQSEEVPVL--ANIHVVNSSKDRIMKTSNGYSCNGCESLHSK 2131 LG++ EGE + S Q +VPV+ ++ +N +D ++ S+G SL + Sbjct: 472 PSDLGKN-SEGEASASLDSQGRDVPVVESSDEGKINECRDGSIQNSHGPQ----NSLDNG 526 Query: 2132 RNGSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKIQ 2308 N S + GWEEN E+ GLT+EEI+AFY+D++KYINS PP K Sbjct: 527 VNHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEINAFYRDVSKYINSAPPSKTL 586 Query: 2309 QGMPRFLLA-LESQIGRVAGI 2368 G + L + Q+G G+ Sbjct: 587 LGTKQKLFGPINFQMGSNGGV 607 >KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp. sativus] Length = 617 Score = 346 bits (887), Expect = e-104 Identities = 252/621 (40%), Positives = 335/621 (53%), Gaps = 24/621 (3%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757 M+KSEPT VP+WL+S+GSVTG G S N L+PS LH D+ T R S N+G Sbjct: 1 MEKSEPTLVPEWLKSSGSVTG-GVSTNHLNPS-LHQDNQATLKAARNKSLV----NIGDH 54 Query: 758 SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937 R SS F RSSSN G S+LR+ SF I++ +K+++ LGD + Sbjct: 55 DIGHRTTSSYFRRSSSN-GTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQ 113 Query: 938 XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNP----AP 1105 E++ LRR+ S I+ E WPR+V + K KS +NN N + Sbjct: 114 FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173 Query: 1106 DNGHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285 VHKASF+++FPSLG EE+ EI RVPSPG AI + +SA G WTSAL Sbjct: 174 PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233 Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465 AEVP ++ SN T SS + + TGL+MAET+ QG S E+QR Sbjct: 234 AEVPAMIGSNGTTASSVPHSVSSSASVVPSMMTGLNMAETLVQGPPRVQADPQLSVETQR 293 Query: 1466 -QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPH-------LILSQRGVAVKPDI 1621 +ELAIKQSRQLIPVTPSLPKALV N +K+K K+G + S RG K +I Sbjct: 294 LEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQQSASTNLVHHSPRGAPTKNEI 353 Query: 1622 SKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDA-THSVSASSYVRTQVNN 1798 KTS++GKLQVLKP RERN S D L+PT +++ P A + S+ +R+ +N+ Sbjct: 354 IKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLAPALATVGSAPLRSSMNH 413 Query: 1799 YGHPVAERKHVP-----PLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISA-S 1960 AERK P P+LEKRP Q +SRNDFF+ MRKKS+ NSS+ +SA S Sbjct: 414 SILVSAERKSAPPVMVTPMLEKRP--SPQAKSRNDFFNSMRKKSMTNSSSAVSNTVSAVS 471 Query: 1961 HKKLGEDEGEGEVATS-PVQSEEVPVL--ANIHVVNSSKDRIMKTSNGYSCNGCESLHSK 2131 LG++ EGE + S Q +VPV+ ++ +N +D ++ S+G SL + Sbjct: 472 PSDLGKN-SEGEASASLDSQGRDVPVVESSDEGKINECRDGSIQNSHGPQ----NSLDNG 526 Query: 2132 RNGSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKIQ 2308 N S + GWEEN E+ GLT+EEI+AFY+D YINS PP K Sbjct: 527 VNHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEINAFYRD---YINSAPPSKTL 583 Query: 2309 QGMPRFLLA-LESQIGRVAGI 2368 G + L + Q+G G+ Sbjct: 584 LGTKQKLFGPINFQMGSNGGV 604 >KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara cardunculus var. scolymus] Length = 551 Score = 332 bits (851), Expect = e-100 Identities = 227/579 (39%), Positives = 328/579 (56%), Gaps = 16/579 (2%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVR-KNSSSTSDQNLGR 754 M+++EPTFVP+WL+S+GS++ T + + SSLH DD G S +R K+ ++ D +LGR Sbjct: 1 MERTEPTFVPEWLKSSGSLS---TISHQFTSSSLHPDDQGVSKSLRTKSLVNSGDNDLGR 57 Query: 755 SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934 +S S+R SS F R+SS+NG ++LR+ +SF IYE+ +K+++ D R Sbjct: 58 TSVSDRTTSSYFRRTSSSNGAAHLRSYNSFSRNHRDRDWDKDIYEFRDKEKS---DNRHR 114 Query: 935 XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG 1114 E++ LRRSHS ++ +RGE+WPRKV K+ +NN + P G Sbjct: 115 DYSDHLANILPSRFEKDGLRRSHSSLSAKRGESWPRKVAGD-----KNGHNNGSALPSVG 169 Query: 1115 HVH---KASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285 KA+FE++FPSLG EEK A EI RVPSPG + AI S + +SA+ MWTSAL Sbjct: 170 TSSSSGKAAFERDFPSLGAEEKQADTEIGRVPSPGLTTAIQSLPIGSSAVICGDMWTSAL 229 Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465 AEVP IV SN + +S + + + ++TG +MAET+AQG S +QR Sbjct: 230 AEVPMIVGSNGSNISVQQPIQPTSVSATTSMTTGRNMAETLAQGPSRARTTPQLSVGTQR 289 Query: 1466 -QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYPHLILSQ--------RGVAVKPD 1618 +ELA+KQSRQLIP+TPS+PKAL N +K K K+G L S R V+VK D Sbjct: 290 LEELAVKQSRQLIPMTPSMPKALALNSSDKPKLKVGQSQLQNSHIVNHPPSLRPVSVKSD 349 Query: 1619 ISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVPDATHSVSASSYVRTQVNN 1798 ++K S +GKL +LK RERN ++ ++L+PT + + P A V S+ +R N Sbjct: 350 VTKVSTVGKLHILKSSRERNGTTSTAKESLSPTGGSKLPNSPLAVPVVVGSASLR---NT 406 Query: 1799 YGHP-VAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISASHKKLG 1975 G VA+RK P +EKRP Q QSRNDFF+LMRKKS+ +S+ + + S + Sbjct: 407 GGSTIVADRK---PCVEKRP--SPQAQSRNDFFNLMRKKSMATNSSSPGASEAGSSESTN 461 Query: 1976 EDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNG--CESLHSKRNGSLC 2149 + GE +V + V V + V S++++ +SCNG E ++++N S Sbjct: 462 DKPGEPQVG----GYDPVVVDRSCGVQTLSENKV-----DFSCNGDATERSNNEKNHSSS 512 Query: 2150 NXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKD 2266 + GWEE TEE GLT+EEI++FY+D Sbjct: 513 DAILYSEEEEARFLRSLGWEETTEEEGLTEEEINSFYRD 551 >KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus var. scolymus] Length = 636 Score = 332 bits (850), Expect = 1e-98 Identities = 250/630 (39%), Positives = 327/630 (51%), Gaps = 52/630 (8%) Frame = +2 Query: 572 LMMDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHS---------------------- 685 L M++SEPTFVP+WL+S+G G T+ + L SSLHS Sbjct: 8 LTMERSEPTFVPEWLKSSG---GLSTTSHQLQSSSLHSGNSIHFISQQYMLFGISFQFCY 64 Query: 686 --------DDYGTSNVVRKNSS-STSDQNLGRSSASERIKSSCFWRSSSNNGPSNLRTTS 838 D+ G S R S + SD LGR S S+R SS F R+SSN G S+LR+ S Sbjct: 65 LPDNVVLLDEQGVSKATRNKSFVNISDNELGRPSVSDRTTSSYFRRTSSN-GSSHLRSYS 123 Query: 839 SFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXXXXXXXXXXXXXXVERNSLRRSHSMIAD 1018 SF I+E+ K++ D R E+ LRRSHS ++ Sbjct: 124 SFGRNHRDRDWDKDIHEFREKEKP---DGRLRDYSDPLGNILPSRFEKEGLRRSHSSVSA 180 Query: 1019 QRGETWPRKVETSPKTAYKSKNNNVNPAPDN----GHVHKASFEQNFPSLGVEEKLAAFE 1186 +RGE+WPRKV +A K+ +NN + G V K +FE++FPSLG EEK E Sbjct: 181 KRGESWPRKVVVDSSSANKNSHNNGSALRSGAGAIGSV-KTAFERDFPSLGAEEKQIDPE 239 Query: 1187 IRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALAEVPGIVKSNETGVS-SAKADALGLPT 1363 I RVPSPG + AI S + SA+ G WTSALAEVP IV SN + S + + Sbjct: 240 IGRVPSPGLTTAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNGSNTSVPPPLQSTSISA 299 Query: 1364 TASGISTGLSMAETVAQGXXXXXXXXXXSTESQR-QELAIKQSRQLIPVTPSLPKALVSN 1540 TAS ++TG +MAET+AQG S +QR +ELA+KQSRQLIP+TPSLPKAL N Sbjct: 300 TAS-MATGRNMAETLAQGPPRAQTAPQLSVGTQRLEELAVKQSRQLIPMTPSLPKALALN 358 Query: 1541 PPEKSKAKIGY-----PHLIL---SQRGVAVKPDISKTSNIGKLQVLKPIRERNDSSLQP 1696 +K K+K+G HL+ S R V+ K D+SKTS++GKL VLKP RERN + Sbjct: 359 SSDKPKSKVGQLQLQSSHLVNHTHSPRPVSTKFDVSKTSSVGKLHVLKPSRERNGITPIA 418 Query: 1697 IDNLNPTCEKAVSSVPDATHSVSASSYVRTQVNNYGHPVAERKHVPPLLEKRPISQSQTQ 1876 DNL+PT + + P A SV S+ +R NN VA + V LEKRP SQ Q Sbjct: 419 KDNLSPTGASKLPNSPLAVTSVVGSAPLRNLGNNPAVAVAVKPGVAATLEKRP--SSQAQ 476 Query: 1877 SRNDFFSLMRKKSVPNSSTESFP----AISASHKKLGEDEGEGEVATSPVQSEEVPVLAN 2044 SRNDFF+LMRKKS+ N+S+ P +ISA K + G + V Sbjct: 477 SRNDFFNLMRKKSMTNNSSPVTPDTGSSISAGDKPTATEGG--------IDPAVVDGSGG 528 Query: 2045 IHVVNSSKDRIMKTSNGYSCNG--CESLHSKRNGSLCNXXXXXXXXXXXXXXXXGWEE-N 2215 + V + +K + SCNG E + K N S GWEE Sbjct: 529 VQVSSGNKVDLS------SCNGEATERSNGKNNSSSDAIILYSEEEEARFLRSLGWEETG 582 Query: 2216 TEEGGLTDEEISAFYKDLNKYINSKPPCKI 2305 EE GLT+EEIS+FY+D++KY+N + KI Sbjct: 583 EEEEGLTEEEISSFYRDVSKYLNLQAASKI 612 >KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometricum] Length = 601 Score = 329 bits (843), Expect = 5e-98 Identities = 240/602 (39%), Positives = 324/602 (53%), Gaps = 18/602 (2%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNS-SSTSDQNLGR 754 MDKSEPT VP+WL+++G+ +G G S+LHSDD + R NS S++ + GR Sbjct: 1 MDKSEPTLVPEWLKNSGNQSGGG--------STLHSDDKSAPKLSRNNSFMSSNGHDFGR 52 Query: 755 SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934 SS+SE+ SS F RSSS+NG NLR+ +SF Y+ +KD+++ GD Sbjct: 53 SSSSEKTTSSYFHRSSSSNGSGNLRSYNSFGRNRRDRDWEKDRYDSQDKDKSVSGDRWHR 112 Query: 935 XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNV--NPAPD 1108 E + LRRS S + G+TW +KV T +A + N + AP Sbjct: 113 VFSDSSGNSFSGKFEWDGLRRSQSATSGPHGDTWTKKVVTDSSSAGGNNTNTLLTKGAPG 172 Query: 1109 NGHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALA 1288 G V K FE+NFPSLG EE+ E+ RVPSPG S AI S + +A G WTSALA Sbjct: 173 GG-VTKTRFERNFPSLGSEERAVIPEVGRVPSPGLSSAIQSLPIGTAAAVGGEKWTSALA 231 Query: 1289 EVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR- 1465 EVP IV SN GVSS A AS +T L+MAE VAQG S +QR Sbjct: 232 EVPVIVGSNGIGVSSVTQSA--STQLASSTTTTLNMAEAVAQGPSRSPAMPQISVGTQRL 289 Query: 1466 QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIG-YPHLILSQRGVAVKPDISK-TSNI 1639 +ELAIKQSRQLIPVTPS+PK LVSN +K K K+G H I S K D+SK +SN+ Sbjct: 290 EELAIKQSRQLIPVTPSMPKTLVSNSSDKQKTKLGQQQHSINSLPINHSKSDMSKSSSNV 349 Query: 1640 GKLQVLKPIRERNDSSLQPIDNLNP-TCEKAVSSVPDATHSVSASSYVRTQVNNYGHPVA 1816 GKL VLK RE+N + DNL+P T AVSS + SV+ + + N PV Sbjct: 350 GKLHVLKLTREKNGVAPVVKDNLSPTTAVNAVSSTLLTSPSVTGAVASKGPPN---MPVL 406 Query: 1817 ERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSST----ESFPAISASH------K 1966 RK +LEKR SQ+Q QSR +FF+L+RKKS+ S++ E+F ++ + H Sbjct: 407 NRKPSLAVLEKRNTSQAQAQSRKEFFNLVRKKSMAISTSATDAENFSSVDSGHAVSPPPS 466 Query: 1967 KLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNGCESLHSKRNGSL 2146 + E E TS + + + + +D + T +C+ + L + N S+ Sbjct: 467 ETSEKEDVPAPNTSQIDDAQSSASLSDDLFPEKRDDV--TCPDDTCSMPKYLGNGMNASM 524 Query: 2147 CNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKPPCKIQQGM-PR 2323 GWEEN++EGGLT+EEIS+F+KD KY NSKP +I + + P+ Sbjct: 525 --DPLFSEEEEAAFLRSLGWEENSDEGGLTEEEISSFFKDATKY-NSKPALRILEVVQPK 581 Query: 2324 FL 2329 F+ Sbjct: 582 FI 583 >XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 isoform X1 [Erythranthe guttata] Length = 575 Score = 327 bits (837), Expect = 2e-97 Identities = 236/615 (38%), Positives = 320/615 (52%), Gaps = 20/615 (3%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSS-STSDQNLGR 754 MD+SEP+ VPQWL+++GS TG G D++ S V R S +T+ + GR Sbjct: 1 MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47 Query: 755 SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILG-DFRC 931 +S S + SS F RSSS+N + ++ SSF Y +K+R +LG D Sbjct: 48 ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107 Query: 932 XXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDN 1111 ER+ LRRSHSMI+ + GETWP+KV T + N N A + Sbjct: 108 YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGS 167 Query: 1112 --GHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285 G +KA+FE++FPSLG +++ E+ RV SPG S A+ S + +SA G WTSAL Sbjct: 168 PVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSAL 227 Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGI---STGLSMAETVAQGXXXXXXXXXXSTE 1456 AEVP +V SN T S + A TTAS + +T L+MAE VAQG S Sbjct: 228 AEVPMLVVSNGTASLSVQ-QAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLG 286 Query: 1457 SQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIG----YP-----HLILSQRGV- 1603 +QR +ELAIKQSRQLIPVTP++PK LV + +K K+K+G +P + S RG Sbjct: 287 TQRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAP 346 Query: 1604 AVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPT-CEKAVSSVPDATHSVSASSYV 1780 KPD SK SN+GKL VLKP+RE+N + D L+PT KAV+S A+ Sbjct: 347 PSKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLSPTGSGKAVNSTLPAS--------- 397 Query: 1781 RTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISAS 1960 P A + + LEKRP +Q+ QSRNDFF MR+KSV NSS+ S + S Sbjct: 398 ---------PSAVKPLLTTALEKRPTTQA--QSRNDFFKRMREKSVSNSSSASETGTAIS 446 Query: 1961 HKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNGCESLHSKRNG 2140 +K + V + + P+ V + + SNG N + + Sbjct: 447 PEK----HAKVAVVPAAITGAVEPLPEEKAVRTTCNGGVQHISNGKKYNSEPIISEEEEA 502 Query: 2141 SLCNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNKYINSKPPCKIQQGMP 2320 GW+EN +EGGLT+EEISAFY+D KYINSKP +I QG+ Sbjct: 503 KFLR--------------SMGWDENDDEGGLTEEEISAFYRDFTKYINSKPSLRILQGVR 548 Query: 2321 -RFLLALESQIGRVA 2362 +FLL +SQIG ++ Sbjct: 549 LKFLLPFDSQIGGIS 563 >XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [Daucus carota subsp. sativus] Length = 591 Score = 322 bits (826), Expect = 1e-95 Identities = 229/604 (37%), Positives = 316/604 (52%), Gaps = 29/604 (4%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGN-GTSINALSPSSLHSDDYGTSNVVRKNSS--STSDQNL 748 M+K+EPTFVP+WL+S+GSVT T+ + ++ SSL SDD T R SS S N Sbjct: 1 MEKNEPTFVPEWLKSSGSVTSAVSTNHHQIASSSLLSDDRATLKSTRNKSSIDDISAHNS 60 Query: 749 GRSSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFR 928 G S S+R SS F RSS++NG S LR+ SF EY + D+ +GD R Sbjct: 61 GSSPVSDRTTSSYFRRSSTSNG-SQLRSYGSFGRTNRDKGWDKDTNEYHDSDKLRIGDHR 119 Query: 929 CXXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPD 1108 E++ L+R+ S I+ + E W RKV + KS NN + Sbjct: 120 HRNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLLA 179 Query: 1109 NGH----VHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWT 1276 V KA+F+++FPSLG +E+ +E+RRVPSPG S + + + SA+ G WT Sbjct: 180 GSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGWT 239 Query: 1277 SALAEVPGIVKSNETGVSSAKADALGLPTT---ASGISTGLSMAETVAQGXXXXXXXXXX 1447 SALAEV VK G++ + LP++ AS +++GL+MAET+AQG Sbjct: 240 SALAEVQ--VKVGANGINKSSVAQAALPSSASVASSMTSGLNMAETLAQGPPHVHATQFS 297 Query: 1448 STESQRQELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYP--------HLILSQRGV 1603 + +E+AIKQS+QLIPVTPS+PKALV N EKSK K H S RG Sbjct: 298 VGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQHQTSSTHHFNHSPRGT 357 Query: 1604 AVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVP-DATHSVSASSYV 1780 +K D+SKTS++GKLQVLKP RERND S Q D L+PT V + P A SV + Sbjct: 358 PMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPNNPLTAASSVGVPPSL 417 Query: 1781 RTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISAS 1960 R+ + N P+ VP +LEK+P +Q +SRNDFF+L+RKKS+ N S+ ++S Sbjct: 418 RSPIKN---PIVASGVVPTVLEKKP--SAQLRSRNDFFNLVRKKSLTNHSSPVVDSVSTV 472 Query: 1961 HKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGY-----SCNGC-ESL 2122 + + E E + P E +LAN D + NG +C+G +S Sbjct: 473 SQSILEQPSEHKAGAPP--PGEDSLLAN------QSDTVQYKMNGLISNRDACDGTPKSP 524 Query: 2123 HSKRNG---SLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSK 2290 + NG S + GW+EN E+ GLT+EEI FY+D +KYI + Sbjct: 525 DNGENGETRSSSDVILCSEEEEAAFLRSLGWDENAGEDEGLTEEEIREFYRDASKYIKPR 584 Query: 2291 PPCK 2302 P K Sbjct: 585 PSSK 588 >XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 isoform X1 [Nelumbo nucifera] Length = 645 Score = 317 bits (811), Expect = 7e-93 Identities = 238/638 (37%), Positives = 332/638 (52%), Gaps = 42/638 (6%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSS-STSDQNLGR 754 M KSEPT VP+WL+ G +TG G++ + + SSL SDD + R SS S D + R Sbjct: 1 MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSDDNAVALPTRNRSSLSIGDYDTPR 60 Query: 755 SSA-SERIKSSCFWRSSSNNG--------PSNLRTTSSFXXXXXXXXXXXXIYEYSNKDR 907 SSA S+R S+ RSSS+NG PS R+ S+F I ++ +K+R Sbjct: 61 SSAFSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKER 120 Query: 908 TILGDFRCXXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNN 1087 ++ GD R +E+++LRRS SM++ +RGE WPRKV A N Sbjct: 121 SVPGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKV------AADLNNG 174 Query: 1088 NVNPAPDNG---------HVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQV 1240 N+N NG + KA+FE++FPSLG EEK +I RV SPG S A+ S + Sbjct: 175 NINQNTSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPM 234 Query: 1241 SASAMRGSGMWTSALAEVPGIVKSNETGVSSAKADALGLPTT-ASGISTGLSMAETVAQG 1417 +SA+ G WTSALAEVP I+ +N TG+SS + LG + A+ STGL+MAET+AQ Sbjct: 235 GSSALIGGDGWTSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQA 294 Query: 1418 XXXXXXXXXXSTESQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIG-------- 1570 S E+QR +ELAIKQSRQLIP+TPS+PK V N EK+K KI Sbjct: 295 PSRARISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNA 354 Query: 1571 ----YPHLILSQRGVAVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSS 1738 + S RG ++ D+SKTS+ GKL VLK RE+N S D +PT V++ Sbjct: 355 TKTIQQQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVAN 414 Query: 1739 VPDATHSVSASSYVR----TQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMR 1906 P A +A + ++ ++++N A +EKRP + SQ QSRNDFF+LMR Sbjct: 415 NPLALAPSAAFTPLKSPNNSKLSNERKSAAASLMHGSSVEKRP-TTSQVQSRNDFFNLMR 473 Query: 1907 KKSVPN-SSTESFPAISASHKKLGEDEGEGEVATSPV--QSEEVPVLANIHVVNSSKDRI 2077 KK+ N SS P+ S L + + + +PV QS + P + S+++ Sbjct: 474 KKTSGNLSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQSSDAPSPDPSCLDWSTENGS 533 Query: 2078 MKTSNGYSCNGCES-LHSKRNGSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEIS 2251 SNG + + L++ S + GW+EN EE GLT+EEIS Sbjct: 534 ETISNGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLGWDENAGEEEGLTEEEIS 593 Query: 2252 AFYKDLNKYINSKPPCKIQQGMPRFLLALESQIGRVAG 2365 AFYK+ K S C+ Q + + LES++G G Sbjct: 594 AFYKEYMKLRPSSKLCRGSQQQVKLPMPLESRVGSFGG 631 >KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp. sativus] Length = 593 Score = 314 bits (804), Expect = 2e-92 Identities = 225/595 (37%), Positives = 311/595 (52%), Gaps = 29/595 (4%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGN-GTSINALSPSSLHSDDYGTSNVVRKNSS--STSDQNL 748 M+K+EPTFVP+WL+S+GSVT T+ + ++ SSL SDD T R SS S N Sbjct: 1 MEKNEPTFVPEWLKSSGSVTSAVSTNHHQIASSSLLSDDRATLKSTRNKSSIDDISAHNS 60 Query: 749 GRSSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFR 928 G S S+R SS F RSS++NG S LR+ SF EY + D+ +GD R Sbjct: 61 GSSPVSDRTTSSYFRRSSTSNG-SQLRSYGSFGRTNRDKGWDKDTNEYHDSDKLRIGDHR 119 Query: 929 CXXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPD 1108 E++ L+R+ S I+ + E W RKV + KS NN + Sbjct: 120 HRNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLLA 179 Query: 1109 NGH----VHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWT 1276 V KA+F+++FPSLG +E+ +E+RRVPSPG S + + + SA+ G WT Sbjct: 180 GSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGWT 239 Query: 1277 SALAEVPGIVKSNETGVSSAKADALGLPTT---ASGISTGLSMAETVAQGXXXXXXXXXX 1447 SALAEV VK G++ + LP++ AS +++GL+MAET+AQG Sbjct: 240 SALAEVQ--VKVGANGINKSSVAQAALPSSASVASSMTSGLNMAETLAQGPPHVHATQFS 297 Query: 1448 STESQRQELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIGYP--------HLILSQRGV 1603 + +E+AIKQS+QLIPVTPS+PKALV N EKSK K H S RG Sbjct: 298 VGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQHQTSSTHHFNHSPRGT 357 Query: 1604 AVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPTCEKAVSSVP-DATHSVSASSYV 1780 +K D+SKTS++GKLQVLKP RERND S Q D L+PT V + P A SV + Sbjct: 358 PMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPNNPLTAASSVGVPPSL 417 Query: 1781 RTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISAS 1960 R+ + N P+ VP +LEK+P +Q +SRNDFF+L+RKKS+ N S+ ++S Sbjct: 418 RSPIKN---PIVASGVVPTVLEKKP--SAQLRSRNDFFNLVRKKSLTNHSSPVVDSVSTV 472 Query: 1961 HKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGY-----SCNGC-ESL 2122 + + E E + P E +LAN D + NG +C+G +S Sbjct: 473 SQSILEQPSEHKAGAPP--PGEDSLLAN------QSDTVQYKMNGLISNRDACDGTPKSP 524 Query: 2123 HSKRNG---SLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNK 2275 + NG S + GW+EN E+ GLT+EEI FY+D +K Sbjct: 525 DNGENGETRSSSDVILCSEEEEAAFLRSLGWDENAGEDEGLTEEEIREFYRDASK 579 >OMO92072.1 hypothetical protein COLO4_17899 [Corchorus olitorius] Length = 617 Score = 313 bits (801), Expect = 9e-92 Identities = 231/614 (37%), Positives = 326/614 (53%), Gaps = 22/614 (3%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757 M++SEP+ VP+WL++ GS+TG+ S + + SSLHSD++ R S S +++GR+ Sbjct: 1 MERSEPSLVPEWLKNGGSITGSSNSNHQFTSSSLHSDNHSALRQARNKLSGGSGRHIGRT 60 Query: 758 SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937 SA ER S+ F RSSS+NG ++ R S+F I Y ++++++L D R Sbjct: 61 SALERTSSAYFRRSSSSNGSAHSRPYSNFTKGHRERDREKDINGYHDREKSVLTDHRNRD 120 Query: 938 XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNGH 1117 ++ L+R+ SMI + G+TWPRKV ++P KS ++N N Sbjct: 121 YSDSLDNMLPSMFAKDVLKRTQSMITGKHGDTWPRKVTSNPSANNKSNHSNGNGLLSGVS 180 Query: 1118 V--HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMW-TSALA 1288 K++FE++FP LG EEK EI RVPSPG A+ V SA+ GS W TSALA Sbjct: 181 TVGTKSAFERDFPVLGAEEKQVGSEIGRVPSPGLGTAV--LPVGTSAVSGSNGWRTSALA 238 Query: 1289 EVPGIVKSNETGVSSAKADALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR- 1465 ++P V S+ TGV+ A + + +TGL+MAET+AQG + E+QR Sbjct: 239 DMPVGVGSSGTGVAVASQSVSASSASMAPPTTGLNMAETLAQGPSRARTPPLLNVETQRL 298 Query: 1466 QELAIKQSRQLIP-VTPSLPKALVSNPPEKSKAKIG-YPHLILS---QRGVAVKPDISKT 1630 +ELAIKQSRQLIP VT + PK +V +P EKSK K+G HL LS RG + D K Sbjct: 299 EELAIKQSRQLIPLVTTTTPKTMVVSPSEKSKPKVGQQQHLSLSLNYTRGGTSRSDSLKV 358 Query: 1631 SNIGKLQVLKPIRERNDSSLQPIDNLNPT--CEKAVSSVPDATHSVSASSYVRTQVNNYG 1804 SN +LQ+LKP RE SL DNL+PT K VSS T +AS+ R+ N+ Sbjct: 359 SNESRLQILKPSRELIGVSLTTKDNLSPTNGSSKPVSSPVSVTPLAAASAPFRSSGNSPN 418 Query: 1805 HPVAERKHVP--PLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFP--AISASHKKL 1972 AER P +EKRP +Q+ QSRNDFF+L++KKS NSS+ P A+S S Sbjct: 419 FATAERNQNPFRIAIEKRPTAQA--QSRNDFFNLLKKKSTTNSSSVPDPGHAMSPSVPDK 476 Query: 1973 GEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNG------YSCNGCESLHSKR 2134 ++ + TS +L+ V + +R T NG C+ +HS Sbjct: 477 SDELSREDTGTSDALQGGSVLLSESTGVLQTDNRSEVTHNGDALAGSQQCSTNGDMHSSP 536 Query: 2135 NGSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKIQQ 2311 + L GWEENT ++ GLT+EEISAF+++ Y+ KP K+ Sbjct: 537 DAFL-----YPDEKEAAFLRSLGWEENTGDDEGLTEEEISAFFEE---YMKLKPSAKLFD 588 Query: 2312 GMPRFLLALESQIG 2353 M + L+ L S G Sbjct: 589 RM-QSLVPLNSPNG 601 >EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobroma cacao] Length = 625 Score = 312 bits (800), Expect = 2e-91 Identities = 228/598 (38%), Positives = 326/598 (54%), Gaps = 21/598 (3%) Frame = +2 Query: 575 MMDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGR 754 +M++SEP+ VP+WL+S GSVTG+G S + + SSLHSD++ R S D ++G Sbjct: 5 VMERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGG 64 Query: 755 SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCX 934 +S +R S+ F RSSS+NG ++LR+ SSF I Y +++++++ D R Sbjct: 65 TSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 124 Query: 935 XXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNG 1114 E++ L RS S I +R +TWP+KV + T+ KS +++ N Sbjct: 125 NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGV 183 Query: 1115 HV---HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285 +K+ FE+ FP LG EE+ A EI RV SPG S A S V SA+ GS WTSAL Sbjct: 184 STTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSAL 243 Query: 1286 AEVPGIVKSNETGVSSAKAD-ALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQ 1462 A++P V S+ TGV+ A + + + AS TGL+MAET+ QG + +Q Sbjct: 244 ADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGTQ 303 Query: 1463 R-QELAIKQSRQLIP-VTPSLPKALVSNPPEKSKAKIG-YPHLILS---QRGVAVKPDIS 1624 R +ELAIKQSRQL+P VT S PK LV +P EKSK K+G H LS RG + D Sbjct: 304 RLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTSRSDSL 363 Query: 1625 KTSNIGKLQVLKPIRERNDSSLQPIDNLNPT--CEKAVSSVPDATHSVSASSYVRTQVNN 1798 K SN G+L++LKP RE N SL DNL+PT K V+S T S SAS+ R+ N+ Sbjct: 364 KVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSVTPSASASAPFRSSGNS 423 Query: 1799 YGHPVAERKHVP--PLLEKRPISQSQTQSRNDFFSLMRKKSV---PNSSTESFPAISASH 1963 AER P +EKRP +Q+ QSRNDFF+L++KKS P+S + PA S S Sbjct: 424 PSFATAERNQTPFRINIEKRPTAQA--QSRNDFFNLLKKKSTTNSPSSVADRGPAASPSV 481 Query: 1964 KKLGEDEGEGEVATS-PVQSEEVPVLANIHVVNSSKD-RIMKTSNGYSCNGCESLHSKRN 2137 + ++ G + +TS +Q VP + I + + D R T NG + +G + S + Sbjct: 482 SEKSDELGTEDASTSVTLQGGSVP-SSEISIADLPTDNRSEITHNGDAYSGSQQCSSNGD 540 Query: 2138 -GSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKI 2305 + + GWEEN ++ GLT+EEISAF+++ ++ KP K+ Sbjct: 541 RHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPSAKL 595 >EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobroma cacao] Length = 620 Score = 312 bits (799), Expect = 2e-91 Identities = 228/597 (38%), Positives = 325/597 (54%), Gaps = 21/597 (3%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757 M++SEP+ VP+WL+S GSVTG+G S + + SSLHSD++ R S D ++G + Sbjct: 1 MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGGT 60 Query: 758 SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937 S +R S+ F RSSS+NG ++LR+ SSF I Y +++++++ D R Sbjct: 61 SVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNRN 120 Query: 938 XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNGH 1117 E++ L RS S I +R +TWP+KV + T+ KS +++ N Sbjct: 121 FSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGVS 179 Query: 1118 V---HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALA 1288 +K+ FE+ FP LG EE+ A EI RV SPG S A S V SA+ GS WTSALA Sbjct: 180 TTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSALA 239 Query: 1289 EVPGIVKSNETGVSSAKAD-ALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465 ++P V S+ TGV+ A + + + AS TGL+MAET+ QG + +QR Sbjct: 240 DMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGTQR 299 Query: 1466 -QELAIKQSRQLIP-VTPSLPKALVSNPPEKSKAKIG-YPHLILS---QRGVAVKPDISK 1627 +ELAIKQSRQL+P VT S PK LV +P EKSK K+G H LS RG + D K Sbjct: 300 LEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTSRSDSLK 359 Query: 1628 TSNIGKLQVLKPIRERNDSSLQPIDNLNPT--CEKAVSSVPDATHSVSASSYVRTQVNNY 1801 SN G+L++LKP RE N SL DNL+PT K V+S T S SAS+ R+ N+ Sbjct: 360 VSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSVTPSASASAPFRSSGNSP 419 Query: 1802 GHPVAERKHVP--PLLEKRPISQSQTQSRNDFFSLMRKKSV---PNSSTESFPAISASHK 1966 AER P +EKRP +Q+ QSRNDFF+L++KKS P+S + PA S S Sbjct: 420 SFATAERNQTPFRINIEKRPTAQA--QSRNDFFNLLKKKSTTNSPSSVADRGPAASPSVS 477 Query: 1967 KLGEDEGEGEVATS-PVQSEEVPVLANIHVVNSSKD-RIMKTSNGYSCNGCESLHSKRN- 2137 + ++ G + +TS +Q VP + I + + D R T NG + +G + S + Sbjct: 478 EKSDELGTEDASTSVTLQGGSVP-SSEISIADLPTDNRSEITHNGDAYSGSQQCSSNGDR 536 Query: 2138 GSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKI 2305 + + GWEEN ++ GLT+EEISAF+++ ++ KP K+ Sbjct: 537 HARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPSAKL 590 >KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus var. scolymus] Length = 629 Score = 311 bits (798), Expect = 3e-91 Identities = 234/649 (36%), Positives = 330/649 (50%), Gaps = 60/649 (9%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHS------------------------ 685 M+++EPTFVP+WL+S+G G+ T+ + + SSLH Sbjct: 1 MERTEPTFVPEWLKSSG---GSSTTSHQFTSSSLHPGNSYIYVCCFNKYGVNDHNICFDY 57 Query: 686 ---------DDYGTSNVVR-KNSSSTSDQNLGRSSASERIKSSCFWRSSSNNGPSNLRTT 835 D+ G+S R K+S ++SD +LGR+S S+R SS F R+S NG ++LR+ Sbjct: 58 PSDGIFLAVDEQGSSKSGRNKSSVNSSDNDLGRTSVSDRTTSSYFRRTSLGNGSTHLRSY 117 Query: 836 SSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXXXXXXXXXXXXXXVERNSLRRSHSMIA 1015 SSF IYE+ +K+++ D R E++ LRRSHS ++ Sbjct: 118 SSFGRNHRDRDWDKDIYEFWSKEKS---DNRHRDYSDPLDNILPSRFEKDGLRRSHSSVS 174 Query: 1016 DQRGETWPRKVETSPKTAYKSKNNNVNPAPDNGHV---HKASFEQNFPSLGVEEKLAAFE 1186 +RGE+WPRKV + A KS ++N G K SFE++FPSLG +EK A + Sbjct: 175 GKRGESWPRKVVSDLSIANKSSHSNGTALLSGGSSLSNVKTSFERDFPSLGADEKQADPD 234 Query: 1187 IRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALAEVPGIVKSNETGVSSAKADALGLPTT 1366 I RVPSPG S AI S + SA+ G WTSALAEVP IV SN S ++ T Sbjct: 235 IGRVPSPGLSSAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNGNSTSVSQPVQPTSITA 294 Query: 1367 ASGISTGLSMAETVAQG-----------XXXXXXXXXXSTESQR-QELAIKQSRQLIPVT 1510 + ++ G +MAET+A G + +QR +ELA+KQSRQLIP+T Sbjct: 295 TTSMTGGRNMAETLAHGPPRTQTAPQVAQMLLMGSTILTVGTQRLEELAVKQSRQLIPMT 354 Query: 1511 PSLPKALVSNPPEKSKAKIGYPHLI---LSQRGVAVKPDISKTSNIGKLQVLKPIRERND 1681 PS+PKAL + +K K KIG L+ + R ++VK D+SKTS +GKL VLKP RERN Sbjct: 355 PSMPKALALSSSDKPKLKIGQSQLVNHPHTPRPLSVKSDVSKTSTVGKLLVLKPSRERNG 414 Query: 1682 SSLQPIDNLNPTCEKAVSSVPDATHSVSASSYVRTQVNNYGHPVAERKHVPPLLEKRPIS 1861 S ++L+PT + + P A S S+ +R NN G ERK LEKRP Sbjct: 415 ISPTAKESLSPTGGSKLPNSPLAVPSAIGSAPLRNMGNNPGVTAVERKPSVATLEKRP-- 472 Query: 1862 QSQTQSRNDFFSLMRKKSVPNSSTESFPAISASHKKLGEDEGEGEVATSPVQSEEVPVLA 2041 SQ QSRN+FF+LMRKKS+ S+ + D G + S + PV Sbjct: 473 SSQAQSRNNFFNLMRKKSM------------ISNSSVAPDTGS---SVSSSEKPGAPVAP 517 Query: 2042 NIHVVNSSKDRIMKTSNGYSCNG------CESLHSKRNGSLCNXXXXXXXXXXXXXXXXG 2203 H+ S + ++T +C G S ++ +N S + G Sbjct: 518 PAHLGGSESNTTVETKVDLTCKGDACVATVRSTNNGKNHSGPDAVLCSEEEEARFLRSLG 577 Query: 2204 WEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKIQQG-MPRFLLALES 2344 W+E EE GLT+EEIS+FY++ Y+N KP KI +G P+ L+ + S Sbjct: 578 WDETAEEEEGLTEEEISSFYRN---YLNLKPTSKILKGTKPKPLMEISS 623 >XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao] Length = 620 Score = 309 bits (792), Expect = 2e-90 Identities = 227/597 (38%), Positives = 324/597 (54%), Gaps = 21/597 (3%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSSSTSDQNLGRS 757 M++SEP+ VP+WL+S GSVTG+G S + + SSLHSD++ R S D ++G + Sbjct: 1 MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPARNKLSVAGDHDVGGT 60 Query: 758 SASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILGDFRCXX 937 S +R S+ F RSSS+NG +LR+ SSF I Y +++++++ D R Sbjct: 61 SVLDRTTSAYFRRSSSSNGSVHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNRN 120 Query: 938 XXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDNGH 1117 E++ L RS S I +R +TWP+KV + T+ KS +++ N Sbjct: 121 FSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSGNGLLSGVS 179 Query: 1118 V---HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSALA 1288 +K++FE+ FP LG EE+ EI RV SPG S A S V SA+ GS WTSALA Sbjct: 180 TTVGNKSAFEREFPVLGAEERQVGSEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSALA 239 Query: 1289 EVPGIVKSNETGVSSAKAD-ALGLPTTASGISTGLSMAETVAQGXXXXXXXXXXSTESQR 1465 ++P V S+ TGV+ A + + + AS TGL+MAET+ QG + +QR Sbjct: 240 DMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVGTQR 299 Query: 1466 -QELAIKQSRQLIP-VTPSLPKALVSNPPEKSKAKIG-YPHLILS---QRGVAVKPDISK 1627 +ELAIKQSRQL+P VT S PK LV +P EKSK K+G H LS RG + D K Sbjct: 300 LEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQHASLSLNYTRGGTSRSDSLK 359 Query: 1628 TSNIGKLQVLKPIRERNDSSLQPIDNLNPT--CEKAVSSVPDATHSVSASSYVRTQVNNY 1801 SN G+L++LKP RE N SL DNL+PT K V+S + T S SAS+ R+ N+ Sbjct: 360 VSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLNVTPSASASAPFRSSGNSP 419 Query: 1802 GHPVAERKHVP--PLLEKRPISQSQTQSRNDFFSLMRKKSV---PNSSTESFPAISASHK 1966 AER P +EKRP +Q+ QSRNDFF+L++KKS P+S + PA S S Sbjct: 420 SFATAERNQTPFRINIEKRPTAQA--QSRNDFFNLLKKKSTTNSPSSVADRGPAASPSVS 477 Query: 1967 KLGEDEGEGEVATS-PVQSEEVPVLANIHVVNSSKD-RIMKTSNGYSCNGCESLHSKRN- 2137 + ++ G + +TS +Q VP + I + + D R T NG + G + S + Sbjct: 478 EKSDELGTEDASTSVTLQGGSVP-SSEISIADLPTDNRSEITHNGDAYAGSQQCSSNGDR 536 Query: 2138 GSLCNXXXXXXXXXXXXXXXXGWEENT-EEGGLTDEEISAFYKDLNKYINSKPPCKI 2305 + + GWEEN ++ GLT+EEISAF+++ ++ KP K+ Sbjct: 537 HARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPSAKL 590 >XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera] Length = 655 Score = 304 bits (778), Expect = 5e-88 Identities = 241/645 (37%), Positives = 327/645 (50%), Gaps = 49/645 (7%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKN-SSSTSDQNLGR 754 M K EPT VP+WL+ GS+TG G + + + SS HSDD+ + R + ST D + R Sbjct: 1 MAKGEPTLVPEWLKGTGSITGGGNTTHHFASSSTHSDDHAVALTTRNRLTMSTGDYDTPR 60 Query: 755 SSAS-ERIKSSCFWRSSSNNGP--------SNLRTTSSFXXXXXXXXXXXXIYEYSNKDR 907 SSA +R S+ F RSSS+NG + R+ SSF +Y +K++ Sbjct: 61 SSAFLDRTSSAYFRRSSSSNGSMMHDKETSTYSRSYSSFTRSHRDRDWEKDTLDYRDKEK 120 Query: 908 TILGDFRCXXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNN 1087 +ILGD R E+++LRRS SMI+ +RGE W R+V + +N Sbjct: 121 SILGDHRDRDYSDPLASILTSRXEKDTLRRSQSMISGKRGEGWSRRVAADTNNG-NNNHN 179 Query: 1088 NVNPAPDNGHV----HKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAM 1255 N N G + KA+FE++FPSLG EEK A +I RV SPG S ++ S + +SA+ Sbjct: 180 NGNGLLVGGSIVSSIQKAAFERDFPSLGAEEKQGALDIGRVSSPGLSSSVQSLPIGSSAV 239 Query: 1256 RGSGMWTSALAEVPGIVKSNETGVSSA-KADALGLPTTASGISTGLSMAETVAQGXXXXX 1432 G WTSALAEVP I+ +N G SS +A + A STGL+MAET+AQ Sbjct: 240 IGGDGWTSALAEVPVIIGNNSIGPSSVQQATPASSTSGAPNSSTGLNMAETLAQAPSRTR 299 Query: 1433 XXXXXSTESQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAK--------------- 1564 S E+QR +ELAIKQSRQLIP+TPS+PK N EK+K K Sbjct: 300 ISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSALNSSEKAKPKAVVRTGEMGISAKTS 359 Query: 1565 ----IGYPHLI-LSQRGVAVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPT-CEK 1726 + HL+ S RG V+ D+ KTS+ GKL VLK RE+N S D L+PT K Sbjct: 360 QQQQLPSSHLVNHSLRGGPVRSDVPKTSHGGKLLVLKAPREKNGISPSAKDGLSPTNASK 419 Query: 1727 AVSS----VPDATHSVSASSYVRTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFF 1894 V++ P A ++ S +++ N VA +EKRP + SQ QSRNDFF Sbjct: 420 VVNNSLVLAPLAAYAPPMRSPNNSKLPNERKSVASSLTHGSAVEKRP-TTSQVQSRNDFF 478 Query: 1895 SLMRKKSVPN-SSTESFPAISASHKKLGEDEGEGEVA-TSPV--QSEEVPVLANIHVVNS 2062 +LMRKK+ N +S P+ +AS L + EV T+PV QS + P + S Sbjct: 479 NLMRKKTSGNLASAVPDPSPTASSSLLEKSSEPTEVVPTAPVSPQSSDAPSSEPSGLDWS 538 Query: 2063 SKDRIMKTSNGYSCNGCESLHSKRNG---SLCNXXXXXXXXXXXXXXXXGWEENT-EEGG 2230 +++ SNG ES NG S + GW+EN EE G Sbjct: 539 TENGGDLVSNGDVSE--ESQRFSNNGEKRSTADAFVYPDEEEAAFLRSLGWDENAGEEEG 596 Query: 2231 LTDEEISAFYKDLNKYINSKPPCKIQQGMPRFLLALESQIGRVAG 2365 LT+EEISAFY++ K S C+ Q + L LES +G +G Sbjct: 597 LTEEEISAFYREYMKVRPSSRLCQGAQQQTKVPLPLESHVGSFSG 641 >XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 isoform X2 [Erythranthe guttata] Length = 550 Score = 299 bits (766), Expect = 2e-87 Identities = 220/585 (37%), Positives = 298/585 (50%), Gaps = 19/585 (3%) Frame = +2 Query: 578 MDKSEPTFVPQWLRSNGSVTGNGTSINALSPSSLHSDDYGTSNVVRKNSS-STSDQNLGR 754 MD+SEP+ VPQWL+++GS TG G D++ S V R S +T+ + GR Sbjct: 1 MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47 Query: 755 SSASERIKSSCFWRSSSNNGPSNLRTTSSFXXXXXXXXXXXXIYEYSNKDRTILG-DFRC 931 +S S + SS F RSSS+N + ++ SSF Y +K+R +LG D Sbjct: 48 ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107 Query: 932 XXXXXXXXXXXXXXVERNSLRRSHSMIADQRGETWPRKVETSPKTAYKSKNNNVNPAPDN 1111 ER+ LRRSHSMI+ + GETWP+KV T + N N A + Sbjct: 108 YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGSGKNNGNGFLAKGS 167 Query: 1112 --GHVHKASFEQNFPSLGVEEKLAAFEIRRVPSPGPSLAIHSSQVSASAMRGSGMWTSAL 1285 G +KA+FE++FPSLG +++ E+ RV SPG S A+ S + +SA G WTSAL Sbjct: 168 PVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTSAL 227 Query: 1286 AEVPGIVKSNETGVSSAKADALGLPTTASGI---STGLSMAETVAQGXXXXXXXXXXSTE 1456 AEVP +V SN T S + A TTAS + +T L+MAE VAQG S Sbjct: 228 AEVPMLVVSNGTASLSVQ-QAAPSSTTASVVVSSTTSLNMAEAVAQGPTRAQTAPQLSLG 286 Query: 1457 SQR-QELAIKQSRQLIPVTPSLPKALVSNPPEKSKAKIG----YP-----HLILSQRGV- 1603 +QR +ELAIKQSRQLIPVTP++PK LV + +K K+K+G +P + S RG Sbjct: 287 TQRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKVGLIQQHPTPSSLPINQSPRGAP 346 Query: 1604 AVKPDISKTSNIGKLQVLKPIRERNDSSLQPIDNLNPT-CEKAVSSVPDATHSVSASSYV 1780 KPD SK SN+GKL VLKP+RE+N + D L+PT KAV+S A+ Sbjct: 347 PSKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLSPTGSGKAVNSTLPAS--------- 397 Query: 1781 RTQVNNYGHPVAERKHVPPLLEKRPISQSQTQSRNDFFSLMRKKSVPNSSTESFPAISAS 1960 P A + + LEKRP +Q+ QSRNDFF MR+KSV NSS+ S + S Sbjct: 398 ---------PSAVKPLLTTALEKRPTTQA--QSRNDFFKRMREKSVSNSSSASETGTAIS 446 Query: 1961 HKKLGEDEGEGEVATSPVQSEEVPVLANIHVVNSSKDRIMKTSNGYSCNGCESLHSKRNG 2140 +K + V + + P+ V + + SNG N + + Sbjct: 447 PEK----HAKVAVVPAAITGAVEPLPEEKAVRTTCNGGVQHISNGKKYNSEPIISEEEEA 502 Query: 2141 SLCNXXXXXXXXXXXXXXXXGWEENTEEGGLTDEEISAFYKDLNK 2275 GW+EN +EGGLT+EEISAFY+D K Sbjct: 503 KFLR--------------SMGWDENDDEGGLTEEEISAFYRDFTK 533