BLASTX nr result
ID: Angelica27_contig00000987
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica27_contig00000987 (3149 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [... 1048 0.0 KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp... 1038 0.0 XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [... 611 0.0 KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp... 597 0.0 XP_017224884.1 PREDICTED: cell wall protein RBR3-like [Daucus ca... 578 0.0 KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus ... 573 0.0 XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [... 566 0.0 KZM81625.1 hypothetical protein DCAR_029238 [Daucus carota subsp... 571 0.0 XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [... 550 0.0 KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara car... 542 e-179 XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [... 542 e-178 XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 i... 533 e-174 KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus ... 531 e-174 XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [... 528 e-172 CDO97516.1 unnamed protein product [Coffea canephora] 523 e-171 XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao] 522 e-170 EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobro... 520 e-169 EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobro... 518 e-169 XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 i... 514 e-167 OMO92072.1 hypothetical protein COLO4_17899 [Corchorus olitorius] 492 e-159 >XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [Daucus carota subsp. sativus] Length = 620 Score = 1048 bits (2711), Expect = 0.0 Identities = 536/620 (86%), Positives = 562/620 (90%), Gaps = 1/620 (0%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTSLHQDNHTTLKAARNKSLVNITDHDLGHRT 1070 MEKSEPTLVPEWLKS+GSVTG VSTNH N SLHQDN TLKAARNKSLVNI DHD+GHRT Sbjct: 1 MEKSEPTLVPEWLKSSGSVTGGVSTNHLNPSLHQDNQATLKAARNKSLVNIGDHDIGHRT 60 Query: 1071 TSAYFRRSSSNGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQFSNSFDS 1250 TS+YFRRSSSNGTSHLRSYGSFGRNNRDRDWD+D +D RDKEKSN GDR++RQFS+SF+S Sbjct: 61 TSSYFRRSSSNGTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQFSDSFES 120 Query: 1251 NLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSPISSVHK 1430 N SRFEKDGLRRTQSTI+RTG EPWPR+VPSDLKNI KSNHNNGNSRLAVSSPISSVHK Sbjct: 121 NSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSSPISSVHK 180 Query: 1431 ASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSALAEVPAMI 1610 ASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGI DGGWTSALAEVPAMI Sbjct: 181 ASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSALAEVPAMI 240 Query: 1611 GSNGTTVSSVPHAVQCSASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVETQRLEELAIK 1790 GSNGTT SSVPH+V SAS PSM TGLNMAETLVQGPPRVQ DPQLSVETQRLEELAIK Sbjct: 241 GSNGTTASSVPHSVSSSASVVPSMMTGLNMAETLVQGPPRVQADPQLSVETQRLEELAIK 300 Query: 1791 QSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPTKSDTSKTSSLG 1970 QSRQLIPVTPS+PKALVLN SDK KGKVGLQQQS STNLVHHSPRGAPTK++ KTSSLG Sbjct: 301 QSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQQSASTNLVHHSPRGAPTKNEIIKTSSLG 360 Query: 1971 KLQVLKPARERNGVSNAAKDTLSPT-SSKLANNPLTPALATVGSAPLRSSVNNSILVSAE 2147 KLQVLKPARERNGVSN +KDTLSPT SSKLANNPL PALATVGSAPLRSS+N+SILVSAE Sbjct: 361 KLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLAPALATVGSAPLRSSMNHSILVSAE 420 Query: 2148 RKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVPAVSPSDLVKHSE 2327 RKSAPPV+V+P+LEKRPSPQAKSRNDFFNSMRKKSM NSSS VSN V AVSPSDL K+SE Sbjct: 421 RKSAPPVMVTPMLEKRPSPQAKSRNDFFNSMRKKSMTNSSSAVSNTVSAVSPSDLGKNSE 480 Query: 2328 AEAAASLDSQERDAPVVESSNGGKINECRDESIRNSYGPQKSLHNGVNHSSTDVILSSEE 2507 EA+ASLDSQ RD PVVESS+ GKINECRD SI+NS+GPQ SL NGVNHSSTDVILSSEE Sbjct: 481 GEASASLDSQGRDVPVVESSDEGKINECRDGSIQNSHGPQNSLDNGVNHSSTDVILSSEE 540 Query: 2508 EEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMPSSKTFLGTKQKLFGPINFQ 2687 EEAAFLRSLGWEEN GEDEGLTEEEINAFYRDVSKYINS P SKT LGTKQKLFGPINFQ Sbjct: 541 EEAAFLRSLGWEENAGEDEGLTEEEINAFYRDVSKYINSAPPSKTLLGTKQKLFGPINFQ 600 Query: 2688 MXXXXXXXXXXXXXDSKLDS 2747 M DSKLDS Sbjct: 601 MGSNGGVSSGVSSSDSKLDS 620 >KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp. sativus] Length = 617 Score = 1038 bits (2684), Expect = 0.0 Identities = 533/620 (85%), Positives = 559/620 (90%), Gaps = 1/620 (0%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTSLHQDNHTTLKAARNKSLVNITDHDLGHRT 1070 MEKSEPTLVPEWLKS+GSVTG VSTNH N SLHQDN TLKAARNKSLVNI DHD+GHRT Sbjct: 1 MEKSEPTLVPEWLKSSGSVTGGVSTNHLNPSLHQDNQATLKAARNKSLVNIGDHDIGHRT 60 Query: 1071 TSAYFRRSSSNGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQFSNSFDS 1250 TS+YFRRSSSNGTSHLRSYGSFGRNNRDRDWD+D +D RDKEKSN GDR++RQFS+SF+S Sbjct: 61 TSSYFRRSSSNGTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQFSDSFES 120 Query: 1251 NLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSPISSVHK 1430 N SRFEKDGLRRTQSTI+RTG EPWPR+VPSDLKNI KSNHNNGNSRLAVSSPISSVHK Sbjct: 121 NSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSSPISSVHK 180 Query: 1431 ASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSALAEVPAMI 1610 ASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGI DGGWTSALAEVPAMI Sbjct: 181 ASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSALAEVPAMI 240 Query: 1611 GSNGTTVSSVPHAVQCSASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVETQRLEELAIK 1790 GSNGTT SSVPH+V SAS PSM TGLNMAETLVQGPPRVQ DPQLSVETQRLEELAIK Sbjct: 241 GSNGTTASSVPHSVSSSASVVPSMMTGLNMAETLVQGPPRVQADPQLSVETQRLEELAIK 300 Query: 1791 QSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPTKSDTSKTSSLG 1970 QSRQLIPVTPS+PKALVLN SDK KGKVGLQQQS STNLVHHSPRGAPTK++ KTSSLG Sbjct: 301 QSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQQSASTNLVHHSPRGAPTKNEIIKTSSLG 360 Query: 1971 KLQVLKPARERNGVSNAAKDTLSPT-SSKLANNPLTPALATVGSAPLRSSVNNSILVSAE 2147 KLQVLKPARERNGVSN +KDTLSPT SSKLANNPL PALATVGSAPLRSS+N+SILVSAE Sbjct: 361 KLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLAPALATVGSAPLRSSMNHSILVSAE 420 Query: 2148 RKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVPAVSPSDLVKHSE 2327 RKSAPPV+V+P+LEKRPSPQAKSRNDFFNSMRKKSM NSSS VSN V AVSPSDL K+SE Sbjct: 421 RKSAPPVMVTPMLEKRPSPQAKSRNDFFNSMRKKSMTNSSSAVSNTVSAVSPSDLGKNSE 480 Query: 2328 AEAAASLDSQERDAPVVESSNGGKINECRDESIRNSYGPQKSLHNGVNHSSTDVILSSEE 2507 EA+ASLDSQ RD PVVESS+ GKINECRD SI+NS+GPQ SL NGVNHSSTDVILSSEE Sbjct: 481 GEASASLDSQGRDVPVVESSDEGKINECRDGSIQNSHGPQNSLDNGVNHSSTDVILSSEE 540 Query: 2508 EEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMPSSKTFLGTKQKLFGPINFQ 2687 EEAAFLRSLGWEEN GEDEGLTEEEINAFYRD YINS P SKT LGTKQKLFGPINFQ Sbjct: 541 EEAAFLRSLGWEENAGEDEGLTEEEINAFYRD---YINSAPPSKTLLGTKQKLFGPINFQ 597 Query: 2688 MXXXXXXXXXXXXXDSKLDS 2747 M DSKLDS Sbjct: 598 MGSNGGVSSGVSSSDSKLDS 617 >XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [Daucus carota subsp. sativus] Length = 591 Score = 611 bits (1576), Expect = 0.0 Identities = 354/603 (58%), Positives = 421/603 (69%), Gaps = 17/603 (2%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHH---NTSLHQDNHTTLKAARNKSLVN-ITDHDL 1058 MEK+EPT VPEWLKS+GSVT VSTNHH ++SL D+ TLK+ RNKS ++ I+ H+ Sbjct: 1 MEKNEPTFVPEWLKSSGSVTSAVSTNHHQIASSSLLSDDRATLKSTRNKSSIDDISAHNS 60 Query: 1059 GH-----RTTSAYFRRSSSNGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRH 1223 G RTTS+YFRRSS++ S LRSYGSFGR NRD+ WDKD + D +K GD RH Sbjct: 61 GSSPVSDRTTSSYFRRSSTSNGSQLRSYGSFGRTNRDKGWDKDTNEYHDSDKLRIGDHRH 120 Query: 1224 RQFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAV 1403 R FS+ SN S+RFEKDGL+RTQS+I+ EPW RKV +D+ + KSN+NNG+S LA Sbjct: 121 RNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLLAG 180 Query: 1404 SSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTS 1583 SS IS+V KA+FDRDFPSLGA+ERQ D E+ RVPSPGL T +QNLP G SA + GWTS Sbjct: 181 SSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGWTS 240 Query: 1584 ALAEVPAMIGSNGTTVSSVPH-AVQCSASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVE 1760 ALAEV +G+NG SSV A+ SAS A SMT+GLNMAETL QGPP V Q SV Sbjct: 241 ALAEVQVKVGANGINKSSVAQAALPSSASVASSMTSGLNMAETLAQGPPHVHA-TQFSVG 299 Query: 1761 TQRLEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQ-QSTSTNLVHHSPRGAPT 1937 TQRLEE+AIKQS+QLIPVTPSMPKALVLN S+K+K K QQ Q++ST+ +HSPRG P Sbjct: 300 TQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQHQTSSTHHFNHSPRGTPM 359 Query: 1938 KSDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT-SSKLANNPLTPALATVGSAP-LR 2111 KSD SKTSSLGKLQVLKPARERN +S KDTLSPT +SK+ NNPLT A ++VG P LR Sbjct: 360 KSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPNNPLT-AASSVGVPPSLR 418 Query: 2112 SSVNNSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVP 2291 S + N I+ S +V +LEK+PS Q +SRNDFFN +RKKS+ N SS V + V Sbjct: 419 SPIKNPIVASG--------VVPTVLEKKPSAQLRSRNDFFNLVRKKSLTNHSSPVVDSVS 470 Query: 2292 AVSPSDLVKHSEAEAAASLDSQE----RDAPVVESSNGGKINECRDESIRNSYGPQKSLH 2459 VS S L + SE +A A ++ + V+ G I+ RD P Sbjct: 471 TVSQSILEQPSEHKAGAPPPGEDSLLANQSDTVQYKMNGLISN-RDACDGTPKSPDNG-E 528 Query: 2460 NGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMPSSK 2639 NG SS+DVIL SEEEEAAFLRSLGW+EN GEDEGLTEEEI FYRD SKYI PSSK Sbjct: 529 NGETRSSSDVILCSEEEEAAFLRSLGWDENAGEDEGLTEEEIREFYRDASKYIKPRPSSK 588 Query: 2640 TFL 2648 T L Sbjct: 589 TSL 591 >KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp. sativus] Length = 593 Score = 597 bits (1540), Expect = 0.0 Identities = 346/591 (58%), Positives = 413/591 (69%), Gaps = 17/591 (2%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHH---NTSLHQDNHTTLKAARNKSLVN-ITDHDL 1058 MEK+EPT VPEWLKS+GSVT VSTNHH ++SL D+ TLK+ RNKS ++ I+ H+ Sbjct: 1 MEKNEPTFVPEWLKSSGSVTSAVSTNHHQIASSSLLSDDRATLKSTRNKSSIDDISAHNS 60 Query: 1059 GH-----RTTSAYFRRSSSNGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRH 1223 G RTTS+YFRRSS++ S LRSYGSFGR NRD+ WDKD + D +K GD RH Sbjct: 61 GSSPVSDRTTSSYFRRSSTSNGSQLRSYGSFGRTNRDKGWDKDTNEYHDSDKLRIGDHRH 120 Query: 1224 RQFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAV 1403 R FS+ SN S+RFEKDGL+RTQS+I+ EPW RKV +D+ + KSN+NNG+S LA Sbjct: 121 RNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLLAG 180 Query: 1404 SSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTS 1583 SS IS+V KA+FDRDFPSLGA+ERQ D E+ RVPSPGL T +QNLP G SA + GWTS Sbjct: 181 SSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGWTS 240 Query: 1584 ALAEVPAMIGSNGTTVSSVPH-AVQCSASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVE 1760 ALAEV +G+NG SSV A+ SAS A SMT+GLNMAETL QGPP V Q SV Sbjct: 241 ALAEVQVKVGANGINKSSVAQAALPSSASVASSMTSGLNMAETLAQGPPHVHA-TQFSVG 299 Query: 1761 TQRLEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQ-QSTSTNLVHHSPRGAPT 1937 TQRLEE+AIKQS+QLIPVTPSMPKALVLN S+K+K K QQ Q++ST+ +HSPRG P Sbjct: 300 TQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQHQTSSTHHFNHSPRGTPM 359 Query: 1938 KSDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT-SSKLANNPLTPALATVGSAP-LR 2111 KSD SKTSSLGKLQVLKPARERN +S KDTLSPT +SK+ NNPLT A ++VG P LR Sbjct: 360 KSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPNNPLT-AASSVGVPPSLR 418 Query: 2112 SSVNNSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVP 2291 S + N I+ S +V +LEK+PS Q +SRNDFFN +RKKS+ N SS V + V Sbjct: 419 SPIKNPIVASG--------VVPTVLEKKPSAQLRSRNDFFNLVRKKSLTNHSSPVVDSVS 470 Query: 2292 AVSPSDLVKHSEAEAAASLDSQE----RDAPVVESSNGGKINECRDESIRNSYGPQKSLH 2459 VS S L + SE +A A ++ + V+ G I+ RD P Sbjct: 471 TVSQSILEQPSEHKAGAPPPGEDSLLANQSDTVQYKMNGLISN-RDACDGTPKSPDNG-E 528 Query: 2460 NGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSK 2612 NG SS+DVIL SEEEEAAFLRSLGW+EN GEDEGLTEEEI FYRD SK Sbjct: 529 NGETRSSSDVILCSEEEEAAFLRSLGWDENAGEDEGLTEEEIREFYRDASK 579 >XP_017224884.1 PREDICTED: cell wall protein RBR3-like [Daucus carota subsp. sativus] Length = 585 Score = 578 bits (1491), Expect = 0.0 Identities = 340/597 (56%), Positives = 408/597 (68%), Gaps = 11/597 (1%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTSLHQDNHTTLKAARNKSLVNITDHDLGH-- 1064 MEKSEP+ VPEWLKS+GSVT VSTNH Q++H TLK RNK +++ HD G Sbjct: 1 MEKSEPSFVPEWLKSSGSVTVAVSTNHR-----QNDHMTLKPTRNKLSADVSAHDSGRSP 55 Query: 1065 ---RTTSAYFRR-SSSNGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQF 1232 RTTS+YFRR SSSNG+ + RSYGSFGRNNRDR WD+D + RD ++ GDRRH+ + Sbjct: 56 VSDRTTSSYFRRTSSSNGSGNSRSYGSFGRNNRDRGWDRDKNEYRDHDRLRLGDRRHQNY 115 Query: 1233 SNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSP 1412 S S S+ S RFEK+GLRRTQS++ +EP R+V +DL + KSN+NN +SRL SS Sbjct: 116 SGSLGSDFSDRFEKNGLRRTQSSVAGKHSEPLSRRVSADLNSSNKSNYNNSSSRLLGSSG 175 Query: 1413 ISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSALA 1592 ISSV K SFDRDFPSLGA+ERQ D I +PSPGL T +Q+L TG S + GWTSALA Sbjct: 176 ISSVRKTSFDRDFPSLGADERQTDHGIRNIPSPGLSTNMQSLSTGYSTVANEVGWTSALA 235 Query: 1593 EVPAMIGSNGTTVSSVPHAVQCSASAAPSMT-TGLNMAETLVQGPPRVQTDPQLSVETQR 1769 EVP M+G+NG SSV A S+++ PS T LNMAETL QGP RV T PQ+SVETQR Sbjct: 236 EVPVMVGANGPITSSVLQAALPSSTSVPSSTAASLNMAETLAQGPLRVDTAPQVSVETQR 295 Query: 1770 LEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPTKSDT 1949 LEELAIKQSRQLIP+TPSMPK+LVLN S+K+K KV QQ TS+ HS RG KSD Sbjct: 296 LEELAIKQSRQLIPMTPSMPKSLVLNSSEKSKVKVSQQQHQTSS---IHSLRGTLEKSDV 352 Query: 1950 SKTSSLGKLQVLKPARERNGVSNAAKDTLSPTS-SKLANNPLTPALATVGSAPLRSSVNN 2126 KT SLGKLQVLKPARERNGVS D LS T+ S +ANNPLT L V P R+ + N Sbjct: 353 PKTLSLGKLQVLKPARERNGVSYPEIDNLSLTNDSTVANNPLT-TLPAVVPPPSRTQIKN 411 Query: 2127 SILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVPAVSPS 2306 ++ RK A ++V LEK+PS Q +SRN+FFN +RKKS+ SSS V++ V VS Sbjct: 412 PNPLNVNRKPA-AIMVPATLEKKPSAQLQSRNEFFNLVRKKSLTKSSS-VADSVSTVSQF 469 Query: 2307 DLVKHSEAEAAASLDSQERDAPVVESSNGGKINE---CRDESIRNSYGPQKSLHNGVNHS 2477 + + SE + A+ L SQ +D+ SN E +I N G Q+S NG S Sbjct: 470 VVEQPSETQTASPL-SQGKDSLSANQSNMDHYKENVNALISNINNGNGHQQSCGNGETRS 528 Query: 2478 STDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMPSSKTFL 2648 +D+IL SEEEEAAFLRSLGW+EN GEDEGLTEEEIN FYRD SKYI SSKT L Sbjct: 529 RSDMILCSEEEEAAFLRSLGWDENAGEDEGLTEEEINEFYRDASKYIKPGSSSKTSL 585 >KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus var. scolymus] Length = 636 Score = 573 bits (1478), Expect = 0.0 Identities = 345/640 (53%), Positives = 420/640 (65%), Gaps = 41/640 (6%) Frame = +3 Query: 879 LILVMEKSEPTLVPEWLKSTGSVTGVVSTNHH--NTSLHQ-------------------- 992 L L ME+SEPT VPEWLKS+G G+ +T+H ++SLH Sbjct: 6 LALTMERSEPTFVPEWLKSSG---GLSTTSHQLQSSSLHSGNSIHFISQQYMLFGISFQF 62 Query: 993 ----------DNHTTLKAARNKSLVNITDHDLGH-----RTTSAYFRRSSSNGTSHLRSY 1127 D KA RNKS VNI+D++LG RTTS+YFRR+SSNG+SHLRSY Sbjct: 63 CYLPDNVVLLDEQGVSKATRNKSFVNISDNELGRPSVSDRTTSSYFRRTSSNGSSHLRSY 122 Query: 1128 GSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQFSNSFDSNLSSRFEKDGLRRTQSTIT 1307 SFGRN+RDRDWDKD ++ R+KEK D R R +S+ + L SRFEK+GLRR+ S+++ Sbjct: 123 SSFGRNHRDRDWDKDIHEFREKEKP---DGRLRDYSDPLGNILPSRFEKEGLRRSHSSVS 179 Query: 1308 RTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSPISSVHKASFDRDFPSLGAEERQKDP 1487 E WPRKV D + K++HNNG++ + + I SV K +F+RDFPSLGAEE+Q DP Sbjct: 180 AKRGESWPRKVVVDSSSANKNSHNNGSALRSGAGAIGSV-KTAFERDFPSLGAEEKQIDP 238 Query: 1488 EIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSALAEVPAMIGSNGTTVSSVPHAVQCSAS 1667 EIGRVPSPGL TAIQ+LP G+SA I GWTSALAEVP ++GSNG+ S P S S Sbjct: 239 EIGRVPSPGLTTAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNGSNTSVPPPLQSTSIS 298 Query: 1668 AAPSMTTGLNMAETLVQGPPRVQTDPQLSVETQRLEELAIKQSRQLIPVTPSMPKALVLN 1847 A SM TG NMAETL QGPPR QT PQLSV TQRLEELA+KQSRQLIP+TPS+PKAL LN Sbjct: 299 ATASMATGRNMAETLAQGPPRAQTAPQLSVGTQRLEELAVKQSRQLIPMTPSLPKALALN 358 Query: 1848 PSDKTKGKVGLQQQSTSTNLVH--HSPRGAPTKSDTSKTSSLGKLQVLKPARERNGVSNA 2021 SDK K KVG Q Q S++LV+ HSPR TK D SKTSS+GKL VLKP+RERNG++ Sbjct: 359 SSDKPKSKVG-QLQLQSSHLVNHTHSPRPVSTKFDVSKTSSVGKLHVLKPSRERNGITPI 417 Query: 2022 AKDTLSPT-SSKLANNPLTPALATVGSAPLRSSVNNSILVSAERKSAPPVLVSPILEKRP 2198 AKD LSPT +SKL N+PL + VGSAPLR+ NN + A + V+ LEKRP Sbjct: 418 AKDNLSPTGASKLPNSPLA-VTSVVGSAPLRNLGNNPAVAVAVKPG-----VAATLEKRP 471 Query: 2199 SPQAKSRNDFFNSMRKKSMANSSSTVSNPVPAVSPSDLVKHSEAEAAASLDSQERDAPVV 2378 S QA+SRNDFFN MRKKSM N+SS V+ P S S K + E D VV Sbjct: 472 SSQAQSRNDFFNLMRKKSMTNNSSPVT-PDTGSSISAGDKPTATEGGI-------DPAVV 523 Query: 2379 ESSNGGKINECRDESIRNSYGPQKSLHNGVNHSSTD-VILSSEEEEAAFLRSLGWEENTG 2555 + S G +++ + + G NG N+SS+D +IL SEEEEA FLRSLGWEE Sbjct: 524 DGSGGVQVSSGNKVDLSSCNGEATERSNGKNNSSSDAIILYSEEEEARFLRSLGWEETGE 583 Query: 2556 EDEGLTEEEINAFYRDVSKYINSMPSSKTFLGTKQKLFGP 2675 E+EGLTEEEI++FYRDVSKY+N +SK F K KL P Sbjct: 584 EEEGLTEEEISSFYRDVSKYLNLQAASKIF---KPKLLMP 620 >XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera] Length = 665 Score = 566 bits (1458), Expect = 0.0 Identities = 348/677 (51%), Positives = 423/677 (62%), Gaps = 58/677 (8%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHH--NTSLHQDNHTTLKAARNKSLVNITDHDLGH 1064 M+K+EP LVPEWLKS+GSVTG STNHH + L D+ LK AR K +VN DHD G Sbjct: 1 MDKTEPALVPEWLKSSGSVTGGGSTNHHFAPSLLQSDDGAALKPAR-KLMVNSNDHDTGR 59 Query: 1065 -----RTTSAYFRRSSS-NGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHR 1226 RTTS+YFRRSSS NG+ H RS+ SFGR NR+R+W+KD +D RDK+KS D RHR Sbjct: 60 SSNLERTTSSYFRRSSSSNGSGHPRSFSSFGRTNREREWEKDIHDYRDKDKSVLSDHRHR 119 Query: 1227 QFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVS 1406 +S+ + L R E+D LRR+QS IT + WPRKV +D+ + K+ H+NG+ +LA Sbjct: 120 DYSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSNGDGQLASG 179 Query: 1407 SPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSA 1586 SSV KA+FDR+FPSLGAE++Q P+IGRV SPGL +AIQ+LP G++ I GWTSA Sbjct: 180 IVTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVIGGDGWTSA 239 Query: 1587 LAEVPAMIGSNGTTVSSVPHAVQCSA-SAAPSMTTGLNMAETLVQGPPRVQTD--PQLSV 1757 LAEVP +IGSN T VSSV +V S+ S APS T+GLNMAETLVQGP R + + PQLSV Sbjct: 240 LAEVPVIIGSNTTGVSSVQQSVSASSVSVAPSTTSGLNMAETLVQGPARARANATPQLSV 299 Query: 1758 ETQRLEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPT 1937 TQRLEELA+KQSRQLIP+TPSMPK LV +PSDK K K+GLQ +LV+HS RG P Sbjct: 300 GTQRLEELALKQSRQLIPMTPSMPKTLVPSPSDKPKSKIGLQ----PLHLVNHSQRGGPA 355 Query: 1938 KSDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT-SSKLANNPLTPALATVGSAPLRS 2114 +SD +KTS++GKL VLKP+RERNGVS AKD+LSPT S++AN+PL + GSA LRS Sbjct: 356 RSDVTKTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVANSPLAVTPSAAGSASLRS 415 Query: 2115 SVNNSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVPA 2294 NN L SAER+ P +V +EKRP+ QA+SRNDFFN MRKKS N S V PA Sbjct: 416 PRNNPTLASAERR---PSVVLTSVEKRPTSQAQSRNDFFNLMRKKSSTNPPSAVPESGPA 472 Query: 2295 VSPSDLVKHSE------------------AEAAASLD-SQERDAPVVESSNG---GKINE 2408 VS S K E + + LD S E E+ N G Sbjct: 473 VSSSVSEKSDELITEVVTAPVTPKGRDILSSDNSGLDWSNENRGDKTENGNNEACGVSQN 532 Query: 2409 CRDESIRNSYG------------------------PQKSLHNGVNHSSTDVILSSEEEEA 2516 RD+ I N G QK L NG HSS D +L +EEEA Sbjct: 533 DRDDEIDNVNGDACDVSQRDQGDEVHDGNGDACDVSQKFLDNGEKHSSPDEVLYPDEEEA 592 Query: 2517 AFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMPSSKTFLGTKQKLFGPINFQMXX 2696 AFLRSLGWEEN GEDEGLTEEEINAFY++ K PSS K+ ++ QM Sbjct: 593 AFLRSLGWEEN-GEDEGLTEEEINAFYKECMKL---KPSSNLLQRMLPKISPLLDSQMGS 648 Query: 2697 XXXXXXXXXXXDSKLDS 2747 DS+L S Sbjct: 649 VAGAVSGLSSSDSELKS 665 >KZM81625.1 hypothetical protein DCAR_029238 [Daucus carota subsp. sativus] Length = 993 Score = 571 bits (1472), Expect = 0.0 Identities = 334/586 (56%), Positives = 402/586 (68%), Gaps = 11/586 (1%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTSLHQDNHTTLKAARNKSLVNITDHDLGH-- 1064 MEKSEP+ VPEWLKS+GSVT VSTNH Q++H TLK RNK +++ HD G Sbjct: 1 MEKSEPSFVPEWLKSSGSVTVAVSTNHR-----QNDHMTLKPTRNKLSADVSAHDSGRSP 55 Query: 1065 ---RTTSAYFRR-SSSNGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQF 1232 RTTS+YFRR SSSNG+ + RSYGSFGRNNRDR WD+D + RD ++ GDRRH+ + Sbjct: 56 VSDRTTSSYFRRTSSSNGSGNSRSYGSFGRNNRDRGWDRDKNEYRDHDRLRLGDRRHQNY 115 Query: 1233 SNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSP 1412 S S S+ S RFEK+GLRRTQS++ +EP R+V +DL + KSN+NN +SRL SS Sbjct: 116 SGSLGSDFSDRFEKNGLRRTQSSVAGKHSEPLSRRVSADLNSSNKSNYNNSSSRLLGSSG 175 Query: 1413 ISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSALA 1592 ISSV K SFDRDFPSLGA+ERQ D I +PSPGL T +Q+L TG S + GWTSALA Sbjct: 176 ISSVRKTSFDRDFPSLGADERQTDHGIRNIPSPGLSTNMQSLSTGYSTVANEVGWTSALA 235 Query: 1593 EVPAMIGSNGTTVSSVPHAVQCSASAAPSMT-TGLNMAETLVQGPPRVQTDPQLSVETQR 1769 EVP M+G+NG SSV A S+++ PS T LNMAETL QGP RV T PQ+SVETQR Sbjct: 236 EVPVMVGANGPITSSVLQAALPSSTSVPSSTAASLNMAETLAQGPLRVDTAPQVSVETQR 295 Query: 1770 LEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPTKSDT 1949 LEELAIKQSRQLIP+TPSMPK+LVLN S+K+K KV QQ TS+ HS RG KSD Sbjct: 296 LEELAIKQSRQLIPMTPSMPKSLVLNSSEKSKVKVSQQQHQTSS---IHSLRGTLEKSDV 352 Query: 1950 SKTSSLGKLQVLKPARERNGVSNAAKDTLSPTS-SKLANNPLTPALATVGSAPLRSSVNN 2126 KT SLGKLQVLKPARERNGVS D LS T+ S +ANNPLT L V P R+ + N Sbjct: 353 PKTLSLGKLQVLKPARERNGVSYPEIDNLSLTNDSTVANNPLT-TLPAVVPPPSRTQIKN 411 Query: 2127 SILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVPAVSPS 2306 ++ RK A ++V LEK+PS Q +SRN+FFN +RKKS+ SSS V++ V VS Sbjct: 412 PNPLNVNRKPA-AIMVPATLEKKPSAQLQSRNEFFNLVRKKSLTKSSS-VADSVSTVSQF 469 Query: 2307 DLVKHSEAEAAASLDSQERDAPVVESSNGGKINE---CRDESIRNSYGPQKSLHNGVNHS 2477 + + SE + A+ L SQ +D+ SN E +I N G Q+S NG S Sbjct: 470 VVEQPSETQTASPL-SQGKDSLSANQSNMDHYKENVNALISNINNGNGHQQSCGNGETRS 528 Query: 2478 STDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKY 2615 +D+IL SEEEEAAFLRSLGW+EN GEDEGLTEEEIN FYRD SKY Sbjct: 529 RSDMILCSEEEEAAFLRSLGWDENAGEDEGLTEEEINEFYRDASKY 574 >XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [Sesamum indicum] Length = 624 Score = 550 bits (1418), Expect = 0.0 Identities = 327/644 (50%), Positives = 420/644 (65%), Gaps = 25/644 (3%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTSLHQDNHTTLKAARNKSLVNITDHDLGH-- 1064 ME+SEPTLVPEWLK+TG++TG S +H D+H + ARNKS VN H+ G Sbjct: 1 MERSEPTLVPEWLKNTGNLTGAGSISH------SDDHAASRVARNKSFVNSNGHEFGRSS 54 Query: 1065 ---RTTSAYFRRSSS-NGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQF 1232 RTTS+YFRRSSS N + + RSY SFGR+ RDRDW+KD YD+RD++KS D H F Sbjct: 55 SSERTTSSYFRRSSSSNSSGNFRSYSSFGRSQRDRDWEKDVYDSRDQDKSVLADHWHWDF 114 Query: 1233 SNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSP 1412 S+ ++L S++E+DGLRR+QS ++ + WP+KV +DL + ++ N N L SP Sbjct: 115 SDPLGNSLLSKYERDGLRRSQSMVSGKRGDTWPKKVVTDLSS---ASGKNANGLLYRGSP 171 Query: 1413 ISS-VHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSAL 1589 + KA+F++DFPSLGA+ER PE+GRVPSPGL TAIQ+LP G+S IV WTSAL Sbjct: 172 VGGRAKKATFEKDFPSLGADERAVVPEVGRVPSPGLSTAIQSLPVGTSGLIVGEKWTSAL 231 Query: 1590 AEVPAMIGSNGTTVSSVPHAV-QCSASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVETQ 1766 AEVP ++GSNGT +SSV A SAS A TT LNMAE + QGP R QT PQLSV TQ Sbjct: 232 AEVPVLVGSNGTALSSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSVGTQ 291 Query: 1767 RLEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQQSTSTNL-VHHSPRGAPTKS 1943 RLEELAIKQSRQLIPVTPSMPKALVL SDK KGKVG QQ S S++L ++HSPRG K Sbjct: 292 RLEELAIKQSRQLIPVTPSMPKALVLTSSDKPKGKVGQQQHSISSSLPLNHSPRGGAVKG 351 Query: 1944 DTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT-SSKLANNPLTPALATVGSAPLRSSV 2120 D +K S++GKLQVLKP RE+NGV+ KD LSPT SSK+ + L + + GSA R Sbjct: 352 DVAKASNVGKLQVLKPVREKNGVTPVVKDNLSPTSSSKVVTSTLAVSPSVSGSAATRGLP 411 Query: 2121 NNSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPV---- 2288 NN + +RK P L +LEKRP+ QA+SRNDFFN +RKKSM NSSS V++ Sbjct: 412 NNGV---HDRK---PSLT--VLEKRPTSQAQSRNDFFNLVRKKSMPNSSSAVADSAMANC 463 Query: 2289 -------PAVSPSDLVKHSEAEAAASLDS-QERDAPVVESSNGGKINECRDESIRN--SY 2438 A+SPS K E + S ++ + D P+ S + +++E + + N + Sbjct: 464 SSVLDTGTAISPSFSDKDVEIDILPSSNTPKAADVPLSNSLSADRLSEEKGDLTSNGDAC 523 Query: 2439 GPQKSLHNGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEG-LTEEEINAFYRDVSKY 2615 Q + NG + S+D I+ SEEEEAAFLRSLGW+EN+ DEG LT+EEINAFYRD++KY Sbjct: 524 DAQNYVRNGKKYPSSDPII-SEEEEAAFLRSLGWDENS--DEGALTDEEINAFYRDLTKY 580 Query: 2616 INSMPSSKTFLGTKQKLFGPINFQMXXXXXXXXXXXXXDSKLDS 2747 I+S PS + G + K P ++ D+KL+S Sbjct: 581 IDSNPSFRILQGVQLKFLLPFGSELGGIGGISSGLSSSDAKLES 624 >KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara cardunculus var. scolymus] Length = 551 Score = 542 bits (1396), Expect = e-179 Identities = 324/581 (55%), Positives = 401/581 (69%), Gaps = 10/581 (1%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTSLHQDNHTTLKAARNKSLVNITDHDLGH-- 1064 ME++EPT VPEWLKS+GS++ +S ++SLH D+ K+ R KSLVN D+DLG Sbjct: 1 MERTEPTFVPEWLKSSGSLS-TISHQFTSSSLHPDDQGVSKSLRTKSLVNSGDNDLGRTS 59 Query: 1065 ---RTTSAYFRR-SSSNGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQF 1232 RTTS+YFRR SSSNG +HLRSY SF RN+RDRDWDKD Y+ RDKEKS D RHR + Sbjct: 60 VSDRTTSSYFRRTSSSNGAAHLRSYNSFSRNHRDRDWDKDIYEFRDKEKS---DNRHRDY 116 Query: 1233 SNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSP 1412 S+ + L SRFEKDGLRR+ S+++ E WPRKV D K+ HNNG++ +V + Sbjct: 117 SDHLANILPSRFEKDGLRRSHSSLSAKRGESWPRKVAGD-----KNGHNNGSALPSVGTS 171 Query: 1413 ISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSALA 1592 SS KA+F+RDFPSLGAEE+Q D EIGRVPSPGL TAIQ+LP GSSA I WTSALA Sbjct: 172 SSS-GKAAFERDFPSLGAEEKQADTEIGRVPSPGLTTAIQSLPIGSSAVICGDMWTSALA 230 Query: 1593 EVPAMIGSNGTTVSSVPHAVQCSASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVETQRL 1772 EVP ++GSNG+ +S S SA SMTTG NMAETL QGP R +T PQLSV TQRL Sbjct: 231 EVPMIVGSNGSNISVQQPIQPTSVSATTSMTTGRNMAETLAQGPSRARTTPQLSVGTQRL 290 Query: 1773 EELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSP--RGAPTKSD 1946 EELA+KQSRQLIP+TPSMPKAL LN SDK K KVG Q Q ++++V+H P R KSD Sbjct: 291 EELAVKQSRQLIPMTPSMPKALALNSSDKPKLKVG-QSQLQNSHIVNHPPSLRPVSVKSD 349 Query: 1947 TSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT-SSKLANNPLTPALATVGSAPLRSSVN 2123 +K S++GKL +LK +RERNG ++ AK++LSPT SKL N+PL + VGSA LR++ Sbjct: 350 VTKVSTVGKLHILKSSRERNGTTSTAKESLSPTGGSKLPNSPLAVPV-VVGSASLRNTGG 408 Query: 2124 NSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVPAVSP 2303 ++I+ A+RK P +EKRPSPQA+SRNDFFN MRKKSMA +SS+ S Sbjct: 409 STIV--ADRK--------PCVEKRPSPQAQSRNDFFNLMRKKSMATNSSSPGASEAGSSE 458 Query: 2304 SDLVKHSEAEAAASLDSQERDAPVVESSNGGK-INECRDESIRNSYGPQKSLHNGVNHSS 2480 S K E + D VV+ S G + ++E + + N ++S +N NHSS Sbjct: 459 STNDKPGEPQVGG------YDPVVVDRSCGVQTLSENKVDFSCNGDATERS-NNEKNHSS 511 Query: 2481 TDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRD 2603 +D IL SEEEEA FLRSLGWEE T E+EGLTEEEIN+FYRD Sbjct: 512 SDAILYSEEEEARFLRSLGWEETT-EEEGLTEEEINSFYRD 551 >XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [Sesamum indicum] Length = 616 Score = 542 bits (1397), Expect = e-178 Identities = 324/618 (52%), Positives = 411/618 (66%), Gaps = 21/618 (3%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTSLHQDNHTTLKAARNKSLVNITDHDLGH-- 1064 ME+SEPTL+PEWL+S GS+ G S +H D TT K ARNKSLVN HD Sbjct: 1 MERSEPTLIPEWLRSAGSLNGGGSISH------SDEQTTTKLARNKSLVNSNGHDSARSF 54 Query: 1065 ---RTTSAYFRRSSS-NGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQF 1232 RTTS+YFRRSSS NG+ HLRS+ SFGRN+ DRDW+KD D+RDK+KS GDR HR F Sbjct: 55 SSDRTTSSYFRRSSSSNGSGHLRSHSSFGRNHHDRDWEKDACDSRDKDKSVLGDRWHRDF 114 Query: 1233 SNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSP 1412 S++ + L S+FE+DGLRR+QS I+ + W +KV +DL NI N+ NG + SP Sbjct: 115 SDAMGNTLLSKFERDGLRRSQSMISGKRGDTWHKKVGTDL-NIASGNNTNGLP--SKGSP 171 Query: 1413 ISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSALA 1592 I V+K +F+RDFPSLGAEER PE+GRVPSPG+ +A+Q+LP G+ I W SALA Sbjct: 172 IGGVNKTTFERDFPSLGAEERAAIPEVGRVPSPGVSSALQSLPIGTPTIIRGEKWRSALA 231 Query: 1593 EVPAMIGSNGTTVSSVPHAV-QCSASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVETQR 1769 EVP ++G+N T +SSV A SAS A TT LNMAE + QGP R QT PQLS+ TQR Sbjct: 232 EVPVLVGNNVTGISSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRAQTTPQLSIGTQR 291 Query: 1770 LEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQQSTSTNL-VHHSPRGAPTKSD 1946 LEELAIKQSRQLIPVTPSMPK L +DK K KVG QQ +++L + SPRG P K+D Sbjct: 292 LEELAIKQSRQLIPVTPSMPKPLAACSADKQKTKVGQQQHVVTSSLAANQSPRGGPVKAD 351 Query: 1947 TSKTSSLGKLQVLKPARERNGVSNAAKDTLSPTS-SKLANNPL-TPALATVGSAPLRSSV 2120 SKTS++GKL VLKP RE+NG + K+ LSPTS SKL ++PL P+L+ GSA R Sbjct: 352 VSKTSNVGKLHVLKPVREKNGTTPVVKENLSPTSGSKLVSSPLAAPSLS--GSAATRVLP 409 Query: 2121 NNSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTV------SN 2282 NN + A+RK V +LEKRP+ QA+SRNDFFNS+RKKSMANS+S S+ Sbjct: 410 NNPV---ADRKP-----VWTVLEKRPTSQAQSRNDFFNSVRKKSMANSTSVADAAIANSS 461 Query: 2283 PV---PAVSPSDLVKHSEAEAAASLDSQERDAPVVESSNGGKINECRDESIRNS--YGPQ 2447 PV PA SPS K +E E + ++Q+R+A + +G ++ R ++ N Q Sbjct: 462 PVDTAPAASPSFSDKLTETEIVVAPNTQDRNASSGVNLSGENLSGTRSDTACNGDVCDAQ 521 Query: 2448 KSLHNGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSM 2627 + NG + ++D I SEEEEAAFLRSLGWEEN E GLT+EEI+AF+RDV+KY++S Sbjct: 522 NYVSNGKKNHTSDPIF-SEEEEAAFLRSLGWEENADEG-GLTDEEISAFFRDVTKYVDSK 579 Query: 2628 PSSKTFLGTKQKLFGPIN 2681 PS K + K+ P + Sbjct: 580 PSLKILQAVQPKILLPFD 597 >XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 isoform X1 [Nelumbo nucifera] Length = 645 Score = 533 bits (1372), Expect = e-174 Identities = 324/626 (51%), Positives = 406/626 (64%), Gaps = 30/626 (4%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHH--NTSLHQDNHTTLKAARNKSLVNITDHD--- 1055 M KSEPTLVPEWLK TG +TG ST HH ++SL D++ RN+S ++I D+D Sbjct: 1 MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSDDNAVALPTRNRSSLSIGDYDTPR 60 Query: 1056 ---LGHRTTSAYFRRSSS-NGT--------SHLRSYGSFGRNNRDRDWDKDFYDNRDKEK 1199 RT+SAY RRSSS NG+ S+ RSY +F R++RDRDW+KD D RDKE+ Sbjct: 61 SSAFSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKER 120 Query: 1200 SNSGDRRHRQFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHN 1379 S GD R FS+ S L+SR EKD LRR+QS ++ E WPRKV +DL N G N N Sbjct: 121 SVPGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAADLNN-GNINQN 179 Query: 1380 NGNSRLAVSSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAG 1559 N L S +SS+ KA+F+RDFPSLGAEE+ P+IGRV SPGL +A+Q+LP GSSA Sbjct: 180 TSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGSSAL 239 Query: 1560 IVDGGWTSALAEVPAMIGSNGTTVSSVPHA-VQCSASAAPSMTTGLNMAETLVQGPPRVQ 1736 I GWTSALAEVP +IG+NGT +SSV A + SAS A + +TGLNMAETL Q P R + Sbjct: 240 IGGDGWTSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQAPSRAR 299 Query: 1737 TDPQLSVETQRLEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQ-QSTSTNLVH 1913 PQLSVETQRLEELAIKQSRQLIP+TPSMPK VLN +K K K+ ++ + +T + Sbjct: 300 ISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNATKTIQ 359 Query: 1914 H----SPRGAPTKSDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPTS-SKLANNPLTP 2078 S RGAP +SD SKTS GKL VLK RE+NG+S AKD SPT+ SK+ANNPL Sbjct: 360 QQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVANNPL-- 417 Query: 2079 ALATVGSAPLRSSVNNSILVSAERKSAPPVLVSPILEKRP-SPQAKSRNDFFNSMRKKSM 2255 ALA + S NNS L + + +A ++ +EKRP + Q +SRNDFFN MRKK+ Sbjct: 418 ALAPSAAFTPLKSPNNSKLSNERKSAAASLMHGSSVEKRPTTSQVQSRNDFFNLMRKKTS 477 Query: 2256 ANSSSTVSNPVPAVSPSDLVKHSEAEA--AASLDSQERDAPVVESSNGGKINECRDESIR 2429 N SS +P P VS S L K +E A AA + Q DAP + S E E+I Sbjct: 478 GNLSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQSSDAPSPDPSCLDWSTENGSETIS 537 Query: 2430 N---SYGPQKSLHNGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYR 2600 N S Q+ L+NG HSS D + +EEEAAFLRSLGW+EN GE+EGLTEEEI+AFY+ Sbjct: 538 NGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLGWDENAGEEEGLTEEEISAFYK 597 Query: 2601 DVSKYINSMPSSKTFLGTKQKLFGPI 2678 + Y+ PSSK G++Q++ P+ Sbjct: 598 E---YMKLRPSSKLCRGSQQQVKLPM 620 >KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus var. scolymus] Length = 629 Score = 531 bits (1369), Expect = e-174 Identities = 325/644 (50%), Positives = 404/644 (62%), Gaps = 53/644 (8%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTS----------------------------- 983 ME++EPT VPEWLKS+G G +T+H TS Sbjct: 1 MERTEPTFVPEWLKSSG---GSSTTSHQFTSSSLHPGNSYIYVCCFNKYGVNDHNICFDY 57 Query: 984 ------LHQDNHTTLKAARNKSLVNITDHDLGH-----RTTSAYFRRSS-SNGTSHLRSY 1127 L D + K+ RNKS VN +D+DLG RTTS+YFRR+S NG++HLRSY Sbjct: 58 PSDGIFLAVDEQGSSKSGRNKSSVNSSDNDLGRTSVSDRTTSSYFRRTSLGNGSTHLRSY 117 Query: 1128 GSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQFSNSFDSNLSSRFEKDGLRRTQSTIT 1307 SFGRN+RDRDWDKD Y+ KEKS D RHR +S+ D+ L SRFEKDGLRR+ S+++ Sbjct: 118 SSFGRNHRDRDWDKDIYEFWSKEKS---DNRHRDYSDPLDNILPSRFEKDGLRRSHSSVS 174 Query: 1308 RTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSPISSVHKASFDRDFPSLGAEERQKDP 1487 E WPRKV SDL KS+H+NG + L+ S +S+V K SF+RDFPSLGA+E+Q DP Sbjct: 175 GKRGESWPRKVVSDLSIANKSSHSNGTALLSGGSSLSNV-KTSFERDFPSLGADEKQADP 233 Query: 1488 EIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSALAEVPAMIGSNGTTVSSVPHAVQCSAS 1667 +IGRVPSPGL +AIQ+LP G+SA I GWTSALAEVP ++GSNG + S S + Sbjct: 234 DIGRVPSPGLSSAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNGNSTSVSQPVQPTSIT 293 Query: 1668 AAPSMTTGLNMAETLVQGPPRVQTDPQ-----------LSVETQRLEELAIKQSRQLIPV 1814 A SMT G NMAETL GPPR QT PQ L+V TQRLEELA+KQSRQLIP+ Sbjct: 294 ATTSMTGGRNMAETLAHGPPRTQTAPQVAQMLLMGSTILTVGTQRLEELAVKQSRQLIPM 353 Query: 1815 TPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPTKSDTSKTSSLGKLQVLKPA 1994 TPSMPKAL L+ SDK K K+G Q H+PR KSD SKTS++GKL VLKP+ Sbjct: 354 TPSMPKALALSSSDKPKLKIGQSQLVNHP----HTPRPLSVKSDVSKTSTVGKLLVLKPS 409 Query: 1995 RERNGVSNAAKDTLSPT-SSKLANNPLTPALATVGSAPLRSSVNNSILVSAERKSAPPVL 2171 RERNG+S AK++LSPT SKL N+PL A +GSAPLR+ NN + + ERK P V Sbjct: 410 RERNGISPTAKESLSPTGGSKLPNSPLAVPSA-IGSAPLRNMGNNPGVTAVERK--PSVA 466 Query: 2172 VSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVPAVSPSDLVKHSEAEAAASLD 2351 LEKRPS QA+SRN+FFN MRKKSM ++SS + +VS S+ + A L Sbjct: 467 T---LEKRPSSQAQSRNNFFNLMRKKSMISNSSVAPDTGSSVSSSE-KPGAPVAPPAHLG 522 Query: 2352 SQERDAPVVESSNGGKINECRDESIRNSYGPQKSLHNGVNHSSTDVILSSEEEEAAFLRS 2531 E + V + C+ ++ +S +NG NHS D +L SEEEEA FLRS Sbjct: 523 GSESNTTVETKVD----LTCKGDA---CVATVRSTNNGKNHSGPDAVLCSEEEEARFLRS 575 Query: 2532 LGWEENTGEDEGLTEEEINAFYRDVSKYINSMPSSKTFLGTKQK 2663 LGW+E E+EGLTEEEI++FYR+ Y+N P+SK GTK K Sbjct: 576 LGWDETAEEEEGLTEEEISSFYRN---YLNLKPTSKILKGTKPK 616 >XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera] Length = 655 Score = 528 bits (1360), Expect = e-172 Identities = 321/639 (50%), Positives = 402/639 (62%), Gaps = 43/639 (6%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHH--NTSLHQDNHTTLKAARNKSLVNITDHDLGH 1064 M K EPTLVPEWLK TGS+TG +T HH ++S H D+H RN+ ++ D+D Sbjct: 1 MAKGEPTLVPEWLKGTGSITGGGNTTHHFASSSTHSDDHAVALTTRNRLTMSTGDYDTPR 60 Query: 1065 ------RTTSAYFRRSSSNGTSHL---------RSYGSFGRNNRDRDWDKDFYDNRDKEK 1199 RT+SAYFRRSSS+ S + RSY SF R++RDRDW+KD D RDKEK Sbjct: 61 SSAFLDRTSSAYFRRSSSSNGSMMHDKETSTYSRSYSSFTRSHRDRDWEKDTLDYRDKEK 120 Query: 1200 SNSGDRRHRQFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHN 1379 S GD R R +S+ S L+SR EKD LRR+QS I+ E W R+V +D N G +NHN Sbjct: 121 SILGDHRDRDYSDPLASILTSRXEKDTLRRSQSMISGKRGEGWSRRVAADTNN-GNNNHN 179 Query: 1380 NGNSRLAVSSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAG 1559 NGN L S +SS+ KA+F+RDFPSLGAEE+Q +IGRV SPGL +++Q+LP GSSA Sbjct: 180 NGNGLLVGGSIVSSIQKAAFERDFPSLGAEEKQGALDIGRVSSPGLSSSVQSLPIGSSAV 239 Query: 1560 IVDGGWTSALAEVPAMIGSNGTTVSSVPHAVQCSA-SAAPSMTTGLNMAETLVQGPPRVQ 1736 I GWTSALAEVP +IG+N SSV A S+ S AP+ +TGLNMAETL Q P R + Sbjct: 240 IGGDGWTSALAEVPVIIGNNSIGPSSVQQATPASSTSGAPNSSTGLNMAETLAQAPSRTR 299 Query: 1737 TDPQLSVETQRLEELAIKQSRQLIPVTPSMPKALVLNPSDKTK-------GKVGL----- 1880 PQLSVETQRLEELAIKQSRQLIP+TPSMPK LN S+K K G++G+ Sbjct: 300 ISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSALNSSEKAKPKAVVRTGEMGISAKTS 359 Query: 1881 -QQQSTSTNLVHHSPRGAPTKSDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT-SSK 2054 QQQ S++LV+HS RG P +SD KTS GKL VLK RE+NG+S +AKD LSPT +SK Sbjct: 360 QQQQLPSSHLVNHSLRGGPVRSDVPKTSHGGKLLVLKAPREKNGISPSAKDGLSPTNASK 419 Query: 2055 LANNPLTPALATVGSAPLRSSVNNSILVSAERKSAPPVLVSPILEKRP-SPQAKSRNDFF 2231 + NN L A + P+RS NNS L + + A + +EKRP + Q +SRNDFF Sbjct: 420 VVNNSLVLAPLAAYAPPMRSP-NNSKLPNERKSVASSLTHGSAVEKRPTTSQVQSRNDFF 478 Query: 2232 NSMRKKSMANSSSTVSNPVPAVSPSDLVKHS---EAEAAASLDSQERDAPVVESS----- 2387 N MRKK+ N +S V +P P S S L K S E A + Q DAP E S Sbjct: 479 NLMRKKTSGNLASAVPDPSPTASSSLLEKSSEPTEVVPTAPVSPQSSDAPSSEPSGLDWS 538 Query: 2388 --NGGKINECRDESIRNSYGPQKSLHNGVNHSSTDVILSSEEEEAAFLRSLGWEENTGED 2561 NGG + D S + Q+ +NG S+ D + +EEEAAFLRSLGW+EN GE+ Sbjct: 539 TENGGDLVSNGDVSEES----QRFSNNGEKRSTADAFVYPDEEEAAFLRSLGWDENAGEE 594 Query: 2562 EGLTEEEINAFYRDVSKYINSMPSSKTFLGTKQKLFGPI 2678 EGLTEEEI+AFYR+ Y+ PSS+ G +Q+ P+ Sbjct: 595 EGLTEEEISAFYRE---YMKVRPSSRLCQGAQQQTKVPL 630 >CDO97516.1 unnamed protein product [Coffea canephora] Length = 599 Score = 523 bits (1347), Expect = e-171 Identities = 307/601 (51%), Positives = 392/601 (65%), Gaps = 10/601 (1%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTSLHQDNHTTLKAARNKSLVNITDHDLGH-- 1064 ME+SEP+LVPEWLKS+GS TG +T+H + D+H K ARNKS VN DH++G Sbjct: 1 MERSEPSLVPEWLKSSGSATGSGTTSHPLSP--SDDHAVSKLARNKSSVNHNDHEIGRSS 58 Query: 1065 ---RTTSAYFRRSSS-NGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHRQF 1232 RT+++YFRRSSS NG+ ++SY SFGRN+R RDWDKD Y+ RD++ G +HR + Sbjct: 59 VSDRTSASYFRRSSSSNGSGQMQSYSSFGRNHRGRDWDKDLYEPRDRDNLVVGGHKHRDY 118 Query: 1233 SNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVSSP 1412 + +N FEKDGLRR+QS ++R E WP++ +D + ++ +GNS L Sbjct: 119 LDPPVNNFPGNFEKDGLRRSQSMVSRKRNEIWPKRSIADSNSASRNKSTDGNSLLDKGDS 178 Query: 1413 ISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTSALA 1592 + +VHK F+RDFPSLG+EERQ E+GRVPSPGL TAI LP +SA I WTSALA Sbjct: 179 VGTVHKVVFERDFPSLGSEERQATSEVGRVPSPGLNTAIHGLPISASAIIAGDKWTSALA 238 Query: 1593 EVPAMIGSNGTTVSSVPHA-VQCSASAAPSMTT-GLNMAETLVQGPPRVQTDPQLSVETQ 1766 EVPA++G GT +S A + S ++ PS T+ GLNMAET+ QG PRVQ P+++ TQ Sbjct: 239 EVPAIVGGGGTGLSPGRQASLPSSPASLPSSTSAGLNMAETVAQG-PRVQAAPKITSGTQ 297 Query: 1767 RLEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPTKSD 1946 RLEELAI+QSRQLIP+TPSMPK +LN SDK K K G Q S+ L+ S RG P K+D Sbjct: 298 RLEELAIRQSRQLIPMTPSMPKPSILNSSDKGKAKAGQPQHPVSSPLLSPSLRGGPVKTD 357 Query: 1947 TSKTSSLGKLQVLKPARERNGVSNAAKDTLSPTSS-KLANNPLTPALATVGSAPLRSSVN 2123 SKTS+ GKL VLKP RERNGVS A+KDTLSPTSS + A + + A + G A R Sbjct: 358 ASKTSNAGKLLVLKPPRERNGVSTASKDTLSPTSSTRAATSGIAVATSVTGLATSRGPAI 417 Query: 2124 NSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVPAVSP 2303 N + AERK A P+LEK+PS QA+SRNDFFN MRKKSM SSS+V++ AVS Sbjct: 418 NPVSPGAERKHA-----LPMLEKKPSSQAQSRNDFFNLMRKKSMP-SSSSVADAGSAVSA 471 Query: 2304 SDLVKHSEAEA-AASLDSQERDAPVVESSNGGKINECRDESIRNSYGPQKSLHNGVNHSS 2480 S L + E E A + ++ D P ++ NG + E + +G Q S Sbjct: 472 STLDEPGELEVIPAPVIHEDEDVPSLDRLNGCQHTE------NDLFGIQ----------S 515 Query: 2481 TDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMPSSKTFLGTKQ 2660 + L SEEEEAAFL LGW+EN ED GLTEEEINAF+RD+SKY+NS PSSK+ G + Sbjct: 516 RSLPLFSEEEEAAFLHQLGWQENADED-GLTEEEINAFFRDLSKYMNSKPSSKSLQGVQP 574 Query: 2661 K 2663 K Sbjct: 575 K 575 >XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao] Length = 620 Score = 522 bits (1344), Expect = e-170 Identities = 320/604 (52%), Positives = 399/604 (66%), Gaps = 19/604 (3%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTS--LHQDNHTTLKAARNKSLVNITDHDLG- 1061 ME+SEP+LVPEWLKS GSVTG ++NH TS LH DNH+ L+ ARNK V DHD+G Sbjct: 1 MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPARNKLSV-AGDHDVGG 59 Query: 1062 ----HRTTSAYFRRSSS-NGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHR 1226 RTTSAYFRRSSS NG+ HLRSY SF + +RDRDWDKD D+EKS D R+R Sbjct: 60 TSVLDRTTSAYFRRSSSSNGSVHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 119 Query: 1227 QFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLA-V 1403 FS+S D+ L S FEKD L R+QS IT ++ WP+KV SD KSNH++GN L+ V Sbjct: 120 NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSGNGLLSGV 178 Query: 1404 SSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTS 1583 S+ + + K++F+R+FP LGAEERQ EIGRV SPGL TA Q+LP G+SA GWTS Sbjct: 179 STTVGN--KSAFEREFPVLGAEERQVGSEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTS 236 Query: 1584 ALAEVPAMIGSNGTTVSSVPHAVQC-SASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVE 1760 ALA++PA +GS+GT V+ V SAS A + TGLNMAETLVQGP R +T P L+V Sbjct: 237 ALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVG 296 Query: 1761 TQRLEELAIKQSRQLIP-VTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPT 1937 TQRLEELAIKQSRQL+P VT S PK LV++PS+K+K KVG QQQ S +L + RG + Sbjct: 297 TQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVG-QQQHASLSL--NYTRGGTS 353 Query: 1938 KSDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT--SSKLANNPLTPALATVGSAPLR 2111 +SD+ K S+ G+L++LKP+RE NGVS KD LSPT SSKL N+PL + SAP R Sbjct: 354 RSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLNVTPSASASAPFR 413 Query: 2112 SSVNNSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVP 2291 SS N+ +AER P +EKRP+ QA+SRNDFFN ++KKS NS S+V++ P Sbjct: 414 SSGNSPSFATAERNQTP---FRINIEKRPTAQAQSRNDFFNLLKKKSTTNSPSSVADRGP 470 Query: 2292 AVSPSDLVKHSE---AEAAASLDSQERDAPVVESSNGGKINECRDESIRNS---YGPQKS 2453 A SPS K E +A+ S+ Q P E S + R E N G Q+ Sbjct: 471 AASPSVSEKSDELGTEDASTSVTLQGGSVPSSEISIADLPTDNRSEITHNGDAYAGSQQC 530 Query: 2454 LHNGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMPS 2633 NG H+ D L +EEEAAFLRSLGWEEN G+DEGLTEEEI+AF+ + ++ PS Sbjct: 531 SSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPS 587 Query: 2634 SKTF 2645 +K F Sbjct: 588 AKLF 591 >EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobroma cacao] Length = 625 Score = 520 bits (1338), Expect = e-169 Identities = 319/605 (52%), Positives = 399/605 (65%), Gaps = 19/605 (3%) Frame = +3 Query: 888 VMEKSEPTLVPEWLKSTGSVTGVVSTNHHNTS--LHQDNHTTLKAARNKSLVNITDHDLG 1061 VME+SEP+LVPEWLKS GSVTG ++NH TS LH DNH+ L+ RNK V DHD+G Sbjct: 5 VMERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSV-AGDHDVG 63 Query: 1062 -----HRTTSAYFRRSSS-NGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRH 1223 RTTSAYFRRSSS NG++HLRSY SF + +RDRDWDKD D+EKS D R+ Sbjct: 64 GTSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRN 123 Query: 1224 RQFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLA- 1400 R FS+S D+ L S FEKD L R+QS IT ++ WP+KV SD KSNH++ N L+ Sbjct: 124 RNFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSG 182 Query: 1401 VSSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWT 1580 VS+ + + K+ F+R+FP LGAEERQ EIGRV SPGL TA Q+LP G+SA GWT Sbjct: 183 VSTTVGN--KSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWT 240 Query: 1581 SALAEVPAMIGSNGTTVSSVPHAVQC-SASAAPSMTTGLNMAETLVQGPPRVQTDPQLSV 1757 SALA++PA +GS+GT V+ V SAS A + TGLNMAETLVQGP R +T P L+V Sbjct: 241 SALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNV 300 Query: 1758 ETQRLEELAIKQSRQLIP-VTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAP 1934 TQRLEELAIKQSRQL+P VT S PK LV++PS+K+K KVG QQQ S +L + RG Sbjct: 301 GTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVG-QQQHASLSL--NYTRGGT 357 Query: 1935 TKSDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT--SSKLANNPLTPALATVGSAPL 2108 ++SD+ K S+ G+L++LKP+RE NGVS KD LSPT SSKL N+PL+ + SAP Sbjct: 358 SRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSVTPSASASAPF 417 Query: 2109 RSSVNNSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPV 2288 RSS N+ +AER P +EKRP+ QA+SRNDFFN ++KKS NS S+V++ Sbjct: 418 RSSGNSPSFATAERNQTP---FRINIEKRPTAQAQSRNDFFNLLKKKSTTNSPSSVADRG 474 Query: 2289 PAVSPSDLVKHSE---AEAAASLDSQERDAPVVESSNGGKINECRDESIRNS---YGPQK 2450 PA SPS K E +A+ S+ Q P E S + R E N G Q+ Sbjct: 475 PAASPSVSEKSDELGTEDASTSVTLQGGSVPSSEISIADLPTDNRSEITHNGDAYSGSQQ 534 Query: 2451 SLHNGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMP 2630 NG H+ D L +EEEAAFLRSLGWEEN G+DEGLTEEEI+AF+ + ++ P Sbjct: 535 CSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKP 591 Query: 2631 SSKTF 2645 S+K F Sbjct: 592 SAKLF 596 >EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobroma cacao] Length = 620 Score = 518 bits (1334), Expect = e-169 Identities = 318/604 (52%), Positives = 398/604 (65%), Gaps = 19/604 (3%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTS--LHQDNHTTLKAARNKSLVNITDHDLG- 1061 ME+SEP+LVPEWLKS GSVTG ++NH TS LH DNH+ L+ RNK V DHD+G Sbjct: 1 MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSV-AGDHDVGG 59 Query: 1062 ----HRTTSAYFRRSSS-NGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHR 1226 RTTSAYFRRSSS NG++HLRSY SF + +RDRDWDKD D+EKS D R+R Sbjct: 60 TSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 119 Query: 1227 QFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLA-V 1403 FS+S D+ L S FEKD L R+QS IT ++ WP+KV SD KSNH++ N L+ V Sbjct: 120 NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGV 178 Query: 1404 SSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGWTS 1583 S+ + + K+ F+R+FP LGAEERQ EIGRV SPGL TA Q+LP G+SA GWTS Sbjct: 179 STTVGN--KSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTS 236 Query: 1584 ALAEVPAMIGSNGTTVSSVPHAVQC-SASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVE 1760 ALA++PA +GS+GT V+ V SAS A + TGLNMAETLVQGP R +T P L+V Sbjct: 237 ALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRARTPPLLNVG 296 Query: 1761 TQRLEELAIKQSRQLIP-VTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPT 1937 TQRLEELAIKQSRQL+P VT S PK LV++PS+K+K KVG QQQ S +L + RG + Sbjct: 297 TQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVG-QQQHASLSL--NYTRGGTS 353 Query: 1938 KSDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT--SSKLANNPLTPALATVGSAPLR 2111 +SD+ K S+ G+L++LKP+RE NGVS KD LSPT SSKL N+PL+ + SAP R Sbjct: 354 RSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPLSVTPSASASAPFR 413 Query: 2112 SSVNNSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVP 2291 SS N+ +AER P +EKRP+ QA+SRNDFFN ++KKS NS S+V++ P Sbjct: 414 SSGNSPSFATAERNQTP---FRINIEKRPTAQAQSRNDFFNLLKKKSTTNSPSSVADRGP 470 Query: 2292 AVSPSDLVKHSE---AEAAASLDSQERDAPVVESSNGGKINECRDESIRNS---YGPQKS 2453 A SPS K E +A+ S+ Q P E S + R E N G Q+ Sbjct: 471 AASPSVSEKSDELGTEDASTSVTLQGGSVPSSEISIADLPTDNRSEITHNGDAYSGSQQC 530 Query: 2454 LHNGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMPS 2633 NG H+ D L +EEEAAFLRSLGWEEN G+DEGLTEEEI+AF+ + ++ PS Sbjct: 531 SSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAFFEE---HMKLKPS 587 Query: 2634 SKTF 2645 +K F Sbjct: 588 AKLF 591 >XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 isoform X2 [Nelumbo nucifera] Length = 616 Score = 514 bits (1325), Expect = e-167 Identities = 318/620 (51%), Positives = 394/620 (63%), Gaps = 24/620 (3%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHH--NTSLHQDNHTTLKAARNKSLVNITDHDLGH 1064 M KSEPTLVPEWLK TG +TG ST HH ++SL D Sbjct: 1 MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSD----------------------- 37 Query: 1065 RTTSAYFRRSSS-NGT--------SHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDR 1217 RT+SAY RRSSS NG+ S+ RSY +F R++RDRDW+KD D RDKE+S GD Sbjct: 38 RTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKERSVPGDH 97 Query: 1218 RHRQFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRL 1397 R FS+ S L+SR EKD LRR+QS ++ E WPRKV +DL N G N N N L Sbjct: 98 RDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAADLNN-GNINQNTSNGLL 156 Query: 1398 AVSSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGW 1577 S +SS+ KA+F+RDFPSLGAEE+ P+IGRV SPGL +A+Q+LP GSSA I GW Sbjct: 157 VGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGSSALIGGDGW 216 Query: 1578 TSALAEVPAMIGSNGTTVSSVPHA-VQCSASAAPSMTTGLNMAETLVQGPPRVQTDPQLS 1754 TSALAEVP +IG+NGT +SSV A + SAS A + +TGLNMAETL Q P R + PQLS Sbjct: 217 TSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQAPSRARISPQLS 276 Query: 1755 VETQRLEELAIKQSRQLIPVTPSMPKALVLNPSDKTKGKVGLQQ-QSTSTNLVHH----S 1919 VETQRLEELAIKQSRQLIP+TPSMPK VLN +K K K+ ++ + +T + S Sbjct: 277 VETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNATKTIQQQQLSS 336 Query: 1920 PRGAPTKSDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPTS-SKLANNPLTPALATVG 2096 RGAP +SD SKTS GKL VLK RE+NG+S AKD SPT+ SK+ANNPL ALA Sbjct: 337 LRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVANNPL--ALAPSA 394 Query: 2097 SAPLRSSVNNSILVSAERKSAPPVLVSPILEKRP-SPQAKSRNDFFNSMRKKSMANSSST 2273 + S NNS L + + +A ++ +EKRP + Q +SRNDFFN MRKK+ N SS Sbjct: 395 AFTPLKSPNNSKLSNERKSAAASLMHGSSVEKRPTTSQVQSRNDFFNLMRKKTSGNLSSA 454 Query: 2274 VSNPVPAVSPSDLVKHSEAEA--AASLDSQERDAPVVESSNGGKINECRDESIRN---SY 2438 +P P VS S L K +E A AA + Q DAP + S E E+I N S Sbjct: 455 APDPSPVVSSSLLDKSTEQTALPAAPVSPQSSDAPSPDPSCLDWSTENGSETISNGNASE 514 Query: 2439 GPQKSLHNGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYI 2618 Q+ L+NG HSS D + +EEEAAFLRSLGW+EN GE+EGLTEEEI+AFY++ Y+ Sbjct: 515 ESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLGWDENAGEEEGLTEEEISAFYKE---YM 571 Query: 2619 NSMPSSKTFLGTKQKLFGPI 2678 PSSK G++Q++ P+ Sbjct: 572 KLRPSSKLCRGSQQQVKLPM 591 >OMO92072.1 hypothetical protein COLO4_17899 [Corchorus olitorius] Length = 617 Score = 492 bits (1267), Expect = e-159 Identities = 302/602 (50%), Positives = 394/602 (65%), Gaps = 17/602 (2%) Frame = +3 Query: 891 MEKSEPTLVPEWLKSTGSVTGVVSTNHHNTS--LHQDNHTTLKAARNKSLVNITDHDLGH 1064 ME+SEP+LVPEWLK+ GS+TG ++NH TS LH DNH+ L+ ARNK L + +G Sbjct: 1 MERSEPSLVPEWLKNGGSITGSSNSNHQFTSSSLHSDNHSALRQARNK-LSGGSGRHIGR 59 Query: 1065 -----RTTSAYFRRSSS-NGTSHLRSYGSFGRNNRDRDWDKDFYDNRDKEKSNSGDRRHR 1226 RT+SAYFRRSSS NG++H R Y +F + +R+RD +KD D+EKS D R+R Sbjct: 60 TSALERTSSAYFRRSSSSNGSAHSRPYSNFTKGHRERDREKDINGYHDREKSVLTDHRNR 119 Query: 1227 QFSNSFDSNLSSRFEKDGLRRTQSTITRTGTEPWPRKVPSDLKNIGKSNHNNGNSRLAVS 1406 +S+S D+ L S F KD L+RTQS IT + WPRKV S+ KSNH+NGN L+ Sbjct: 120 DYSDSLDNMLPSMFAKDVLKRTQSMITGKHGDTWPRKVTSNPSANNKSNHSNGNGLLSGV 179 Query: 1407 SPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIVDGGW-TS 1583 S + + K++F+RDFP LGAEE+Q EIGRVPSPGLGTA+ LP G+SA GW TS Sbjct: 180 STVGT--KSAFERDFPVLGAEEKQVGSEIGRVPSPGLGTAV--LPVGTSAVSGSNGWRTS 235 Query: 1584 ALAEVPAMIGSNGTTVSSVPHAVQCSASAAPSMTTGLNMAETLVQGPPRVQTDPQLSVET 1763 ALA++P +GS+GT V+ +V S+++ TTGLNMAETL QGP R +T P L+VET Sbjct: 236 ALADMPVGVGSSGTGVAVASQSVSASSASMAPPTTGLNMAETLAQGPSRARTPPLLNVET 295 Query: 1764 QRLEELAIKQSRQLIP-VTPSMPKALVLNPSDKTKGKVGLQQQSTSTNLVHHSPRGAPTK 1940 QRLEELAIKQSRQLIP VT + PK +V++PS+K+K KVG QQQ S +L + RG ++ Sbjct: 296 QRLEELAIKQSRQLIPLVTTTTPKTMVVSPSEKSKPKVG-QQQHLSLSL--NYTRGGTSR 352 Query: 1941 SDTSKTSSLGKLQVLKPARERNGVSNAAKDTLSPT--SSKLANNPLTPALATVGSAPLRS 2114 SD+ K S+ +LQ+LKP+RE GVS KD LSPT SSK ++P++ SAP RS Sbjct: 353 SDSLKVSNESRLQILKPSRELIGVSLTTKDNLSPTNGSSKPVSSPVSVTPLAAASAPFRS 412 Query: 2115 SVNNSILVSAERKSAPPVLVSPILEKRPSPQAKSRNDFFNSMRKKSMANSSSTVSNPVPA 2294 S N+ +AER P + +EKRP+ QA+SRNDFFN ++KKS NSSS V +P A Sbjct: 413 SGNSPNFATAERNQNPFRIA---IEKRPTAQAQSRNDFFNLLKKKSTTNSSS-VPDPGHA 468 Query: 2295 VSPSDLVKHSE--AEAAASLDSQERDAPVVESSNGGKINECRDESIRNS---YGPQKSLH 2459 +SPS K E E + D+ + + ++ S G + R E N G Q+ Sbjct: 469 MSPSVPDKSDELSREDTGTSDALQGGSVLLSESTGVLQTDNRSEVTHNGDALAGSQQCST 528 Query: 2460 NGVNHSSTDVILSSEEEEAAFLRSLGWEENTGEDEGLTEEEINAFYRDVSKYINSMPSSK 2639 NG HSS D L +E+EAAFLRSLGWEENTG+DEGLTEEEI+AF+ + Y+ PS+K Sbjct: 529 NGDMHSSPDAFLYPDEKEAAFLRSLGWEENTGDDEGLTEEEISAFFEE---YMKLKPSAK 585 Query: 2640 TF 2645 F Sbjct: 586 LF 587