BLASTX nr result
ID: Lithospermum23_contig00010586
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum23_contig00010586 (2719 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [... 409 e-128 CDO97516.1 unnamed protein product [Coffea canephora] 394 e-123 XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [... 394 e-122 KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus ... 376 e-115 XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [... 375 e-115 KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus ... 364 e-111 XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [... 362 e-110 KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara car... 356 e-109 XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 i... 353 e-107 KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometr... 353 e-107 KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp... 352 e-107 XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 i... 327 3e-98 XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 i... 325 3e-96 XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [... 320 4e-95 XP_017224884.1 PREDICTED: cell wall protein RBR3-like [Daucus ca... 318 3e-94 XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [... 318 2e-93 EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobro... 315 8e-93 EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobro... 315 1e-92 XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 i... 314 2e-92 XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao] 314 3e-92 >XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [Sesamum indicum] Length = 624 Score = 409 bits (1050), Expect = e-128 Identities = 274/642 (42%), Positives = 374/642 (58%), Gaps = 31/642 (4%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104 M+++EPTLVP+WL+ DD+ +V R S ++N GR Sbjct: 1 MERSEPTLVPEWLKNTGNLTGAGSISHS--------DDHAASRVARNKSFVNSNGHEFGR 52 Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924 SS+SER S +FRRS S+N S N R+ + F ++ DRD VY+S ++D+++L D Sbjct: 53 SSSSERTTSSYFRRSSSSNSSGNFRSYSSFGRSQRDRDWEKDVYDSRDQDKSVLADHWHW 112 Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----RA 1756 DFSD LGN+ + ER LRRS S++S +R +T P+ V SA S NAN R Sbjct: 113 DFSDPLGNSLLSKYERDGLRRSQSMVSGKRGDTWPKKVVTDLSSA---SGKNANGLLYRG 169 Query: 1755 SD-NVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTS 1585 S KA+FE+DFPSL +E+ E+ RVPSP L+ +I + PV S +I +KWTS Sbjct: 170 SPVGGRAKKATFEKDFPSLGADERAVVPEVGRVPSPGLSTAIQSLPVGTSGLIVGEKWTS 229 Query: 1584 ALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMA 1405 ALAEVP LV +NGT +S+ A S+ + A GS +T+L+MA Sbjct: 230 ALAEVPVLVGSNGTALSSVQQAAPS------------SSASVALGS-------TTSLNMA 270 Query: 1404 ETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTGQP 1228 E VA P R +TTPQLS +QR +ELAIKQSRQLIPVTPS PKALV +KPK K GQ Sbjct: 271 EAVAQGPSRAQTTPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLTSSDKPKGKVGQQ 330 Query: 1227 H--------LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRT 1072 L HSPRGG+ K DV+K S++GKLQVLKPVRE+NG + DNLSP S+ Sbjct: 331 QHSISSSLPLNHSPRGGAVKGDVAKASNVGKLQVLKPVREKNGVTPVVKDNLSPTSSSKV 390 Query: 1071 VCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRK 892 V S A + SV+GS++ R NN H +RK L +LEKR SQ+ QSRNDFF+L+RK Sbjct: 391 VTSTLAVSPSVSGSAATRGLPNNGVH---DRKPSLTVLEKRPTSQA--QSRNDFFNLVRK 445 Query: 891 KSISNSSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLE--YTNAK 718 KS+ NSS A + + S L G + +D ++ +L + ++ K+ + +N+ Sbjct: 446 KSMPNSSSAVADSAMANCS-SVLDTGTAISPSFSDKDVEIDILPSSNTPKAADVPLSNSL 504 Query: 717 VSDGHSYNEG---------ESLNSKKNGS--TCNXXXXXXXXXXXFLRSLGWEENTEEGG 571 +D S +G ++ N +NG + FLRSLGW+EN++EG Sbjct: 505 SADRLSEEKGDLTSNGDACDAQNYVRNGKKYPSSDPIISEEEEAAFLRSLGWDENSDEGA 564 Query: 570 LTDEEINAFYKDVTKYINSKPSWKILQGMP-RFLLALESQIG 448 LTDEEINAFY+D+TKYI+S PS++ILQG+ +FLL S++G Sbjct: 565 LTDEEINAFYRDLTKYIDSNPSFRILQGVQLKFLLPFGSELG 606 >CDO97516.1 unnamed protein product [Coffea canephora] Length = 599 Score = 394 bits (1012), Expect = e-123 Identities = 262/628 (41%), Positives = 355/628 (56%), Gaps = 20/628 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104 M+++EP+LVP+WL+ DD+ K+ R SS + ND +GR Sbjct: 1 MERSEPSLVPEWLKSSGSATGSGTTSHPLSPS----DDHAVSKLARNKSSVNHNDHEIGR 56 Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924 SS S+R + +FRRS S+NGS +++ + F +NH RD + +YE ++D ++ + R Sbjct: 57 SSVSDRTSASYFRRSSSSNGSGQMQSYSSFGRNHRGRDWDKDLYEPRDRDNLVVGGHKHR 116 Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANRASDNV 1744 D+ D NN E+ LRRS S++S +R+E P+ A S SA + + N D Sbjct: 117 DYLDPPVNNFPGNFEKDGLRRSQSMVSRKRNEIWPKRSIADSNSASRNKSTDGNSLLDKG 176 Query: 1743 ----LLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSA 1582 +HK FE+DFPSL EE+ A SE+ RVPSP L +IH P+SASA+I DKWTSA Sbjct: 177 DSVGTVHKVVFERDFPSLGSEERQATSEVGRVPSPGLNTAIHGLPISASAIIAGDKWTSA 236 Query: 1581 LAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSA--KAHAPGSP-TMDSSISTALS 1411 LAEVP +V G G GTG+S +A P SP ++ SS S L+ Sbjct: 237 LAEVPAIV---GGG------------------GTGLSPGRQASLPSSPASLPSSTSAGLN 275 Query: 1410 MAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG 1234 MAETVA PRV+ P++++ +QR +ELAI+QSRQLIP+TPS PK + N +K KAK G Sbjct: 276 MAETVAQ-GPRVQAAPKITSGTQRLEELAIRQSRQLIPMTPSMPKPSILNSSDKGKAKAG 334 Query: 1233 QP-HLIHSP------RGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSR 1075 QP H + SP RGG K D SKTS+ GKL VLKP RERNG S D LSP +R Sbjct: 335 QPQHPVSSPLLSPSLRGGPVKTDASKTSNAGKLLVLKPPRERNGVSTASKDTLSPTSSTR 394 Query: 1074 TVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMR 895 S A SV G ++ R N P AERKH LP+LEK+ +SQ+ QSRNDFF+LMR Sbjct: 395 AATSGIAVATSVTGLATSRGPAINPVSPGAERKHALPMLEKKPSSQA--QSRNDFFNLMR 452 Query: 894 KKSISNSSPAPESGPVISASD-KKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAK 718 KKS+ +SS ++G +SAS + GE E PV V L ++ + Sbjct: 453 KKSMPSSSSVADAGSAVSASTLDEPGELEVIPAPVIHEDEDVPSLDRLNGCQ-------- 504 Query: 717 VSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINAFYK 538 H+ N+ + S+ + FL LGW+EN +E GLT+EEINAF++ Sbjct: 505 ----HTENDLFGIQSR------SLPLFSEEEEAAFLHQLGWQENADEDGLTEEEINAFFR 554 Query: 537 DVTKYINSKPSWKILQGM-PRFLLALES 457 D++KY+NSKPS K LQG+ P+F L L S Sbjct: 555 DLSKYMNSKPSSKSLQGVQPKFPLLLSS 582 >XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [Sesamum indicum] Length = 616 Score = 394 bits (1012), Expect = e-122 Identities = 272/643 (42%), Positives = 364/643 (56%), Gaps = 32/643 (4%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104 M+++EPTL+P+WLR D+ T K+ R S ++N + R Sbjct: 1 MERSEPTLIPEWLRSAGSLNGGGSISHS--------DEQTTTKLARNKSLVNSNGHDSAR 52 Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924 S +S+R S +FRRS S+NGS +LR+ + F +NHHDRD +S +KD+++L D R Sbjct: 53 SFSSDRTTSSYFRRSSSSNGSGHLRSHSSFGRNHHDRDWEKDACDSRDKDKSVLGDRWHR 112 Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----RA 1756 DFSD++GN + ER LRRS S+IS +R +T + V A S NN N + Sbjct: 113 DFSDAMGNTLLSKFERDGLRRSQSMISGKRGDTWHKKVGTDLNIA---SGNNTNGLPSKG 169 Query: 1755 SDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSA 1582 S ++K +FE+DFPSL EE+ A E+ RVPSP ++ ++ + P+ +I +KW SA Sbjct: 170 SPIGGVNKTTFERDFPSLGAEERAAIPEVGRVPSPGVSSALQSLPIGTPTIIRGEKWRSA 229 Query: 1581 LAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAE 1402 LAEVP LV N TG+S+ A S+ + A GS +T+L+MAE Sbjct: 230 LAEVPVLVGNNVTGISSVQQAAPS------------SSASVALGS-------TTSLNMAE 270 Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG-QP 1228 VA P R +TTPQLS +QR +ELAIKQSRQLIPVTPS PK L +K K K G Q Sbjct: 271 AVAQGPSRAQTTPQLSIGTQRLEELAIKQSRQLIPVTPSMPKPLAACSADKQKTKVGQQQ 330 Query: 1227 HLI-------HSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTV 1069 H++ SPRGG K DVSKTS++GKL VLKPVRE+NG + +NLSP S+ V Sbjct: 331 HVVTSSLAANQSPRGGPVKADVSKTSNVGKLHVLKPVREKNGTTPVVKENLSPTSGSKLV 390 Query: 1068 CSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKK 889 S P A S++GS++ R NN P A+RK + +LEKR SQ+ QSRNDFF+ +RKK Sbjct: 391 SS-PLAAPSLSGSAATRVLPNN---PVADRKPVWTVLEKRPTSQA--QSRNDFFNSVRKK 444 Query: 888 S-----------ISNSSPAPESGPVISASDKKLGEGEDAALPVT-DHSGKVGVL---ANI 754 S I+NSSP + + KL E E P T D + GV N+ Sbjct: 445 SMANSTSVADAAIANSSPVDTAPAASPSFSDKLTETEIVVAPNTQDRNASSGVNLSGENL 504 Query: 753 HSIKSLEYTNAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEG 574 +S N V D +Y N KKN + + FLRSLGWEEN +EG Sbjct: 505 SGTRSDTACNGDVCDAQNYVS----NGKKNHT--SDPIFSEEEEAAFLRSLGWEENADEG 558 Query: 573 GLTDEEINAFYKDVTKYINSKPSWKILQGM-PRFLLALESQIG 448 GLTDEEI+AF++DVTKY++SKPS KILQ + P+ LL +S IG Sbjct: 559 GLTDEEISAFFRDVTKYVDSKPSLKILQAVQPKILLPFDSHIG 601 >KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus var. scolymus] Length = 636 Score = 376 bits (966), Expect = e-115 Identities = 262/586 (44%), Positives = 342/586 (58%), Gaps = 18/586 (3%) Frame = -2 Query: 2172 DDYGTPKVVRKNSS-STNDQNLGRSSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHD 1996 D+ G K R S + +D LGR S S+R S +FRR+ S+NGS +LR+ + F +NH D Sbjct: 73 DEQGVSKATRNKSFVNISDNELGRPSVSDRTTSSYFRRT-SSNGSSHLRSYSSFGRNHRD 131 Query: 1995 RDSNDSVYESGNKDRTILEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPR 1816 RD + ++E K++ D R RD+SD LGN R E++ LRRSHS +S +R E+ PR Sbjct: 132 RDWDKDIHEFREKEKP---DGRLRDYSDPLGNILPSRFEKEGLRRSHSSVSAKRGESWPR 188 Query: 1815 NVEAGSKSAYKISHNNANRASDN---VLLHKASFEQDFPSL--EEKLAASEIRRVPSPCL 1651 V S SA K SHNN + + K +FE+DFPSL EEK EI RVPSP L Sbjct: 189 KVVVDSSSANKNSHNNGSALRSGAGAIGSVKTAFERDFPSLGAEEKQIDPEIGRVPSPGL 248 Query: 1650 TPSIHTFPVSASAMIGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVS 1471 T +I + P+ SA+IG D WTSALAEVP +V +NG+ S VP +++ T +S Sbjct: 249 TTAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNGSNTS--------VPPPLQS--TSIS 298 Query: 1470 AKAHAPGSPTMDSSISTALSMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVT 1294 A A S++T +MAET+A PPR +T PQLS +QR +ELA+KQSRQLIP+T Sbjct: 299 ATA----------SMATGRNMAETLAQGPPRAQTAPQLSVGTQRLEELAVKQSRQLIPMT 348 Query: 1293 PSAPKALVPNPPEKPKAKTGQ-----PHLI---HSPRGGSAKPDVSKTSSIGKLQVLKPV 1138 PS PKAL N +KPK+K GQ HL+ HSPR S K DVSKTSS+GKL VLKP Sbjct: 349 PSLPKALALNSSDKPKSKVGQLQLQSSHLVNHTHSPRPVSTKFDVSKTSSVGKLHVLKPS 408 Query: 1137 RERNGDSLPPMDNLSPKRDSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLL 958 RERNG + DNLSP S+ S P A SV GS+ +R NN A + + L Sbjct: 409 RERNGITPIAKDNLSPTGASKLPNS-PLAVTSVVGSAPLRNLGNNPAVAVAVKPGVAATL 467 Query: 957 EKRAASQSQTQSRNDFFSLMRKKSISNSSP--APESGPVISASDKKLGEGEDAALPVTDH 784 EKR +SQ+ QSRNDFF+LMRKKS++N+S P++G ISA DK V D Sbjct: 468 EKRPSSQA--QSRNDFFNLMRKKSMTNNSSPVTPDTGSSISAGDKPTATEGGIDPAVVDG 525 Query: 783 SGKVGVLANIHSIKSLEYTNAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRS 604 SG GV + + L N + + E N K N S+ FLRS Sbjct: 526 SG--GVQVSSGNKVDLSSCNGEAT--------ERSNGKNNSSSDAIILYSEEEEARFLRS 575 Query: 603 LGWEE-NTEEGGLTDEEINAFYKDVTKYINSKPSWKILQGMPRFLL 469 LGWEE EE GLT+EEI++FY+DV+KY+N + + KI + P+ L+ Sbjct: 576 LGWEETGEEEEGLTEEEISSFYRDVSKYLNLQAASKIFK--PKLLM 619 >XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera] Length = 665 Score = 375 bits (963), Expect = e-115 Identities = 266/671 (39%), Positives = 362/671 (53%), Gaps = 60/671 (8%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101 MDK EP LVP+WL+ SDD K RK ++ND + GRS Sbjct: 1 MDKTEPALVPEWLKSSGSVTGGGSTNHHFAPSLLQSDDGAALKPARKLMVNSNDHDTGRS 60 Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921 S ER S +FRRS S+NGS + R+ + F + + +R+ +++ +KD+++L D R RD Sbjct: 61 SNLERTTSSYFRRSSSSNGSGHPRSFSSFGRTNREREWEKDIHDYRDKDKSVLSDHRHRD 120 Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR--ASDN 1747 +SD LGN R+ER LRRS S+I+ +R + PR V A + K H+N + AS Sbjct: 121 YSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSNGDGQLASGI 180 Query: 1746 VL--LHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579 V + KA+F+++FPSL E+K A +I RV SP LT +I + P+ + +IG D WTSAL Sbjct: 181 VTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVIGGDGWTSAL 240 Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAET 1399 AEVP ++ +N T G ++ + S ++ S ++ L+MAET Sbjct: 241 AEVPVIIGSN-------------------TTGVSSVQQSVSASSVSVAPSTTSGLNMAET 281 Query: 1398 VAHLPPRVKT--TPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG-Q 1231 + P R + TPQLS +QR +ELA+KQSRQLIP+TPS PK LVP+P +KPK+K G Q Sbjct: 282 LVQGPARARANATPQLSVGTQRLEELALKQSRQLIPMTPSMPKTLVPSPSDKPKSKIGLQ 341 Query: 1230 P-HLI-HSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVP 1057 P HL+ HS RGG A+ DV+KTS++GKL VLKP RERNG S D+LSP SR S Sbjct: 342 PLHLVNHSQRGGPARSDVTKTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVANSPL 401 Query: 1056 AATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSISN 877 A T S AGS+S+R+ NN +AER+ + L SQ QSRNDFF+LMRKKS +N Sbjct: 402 AVTPSAAGSASLRSPRNNPTLASAERRPSVVLTSVEKRPTSQAQSRNDFFNLMRKKSSTN 461 Query: 876 -SSPAPESGPVISASDKKLGE---GEDAALPVT---------DHSG-------------- 778 S PESGP +S+S + + E PVT D+SG Sbjct: 462 PPSAVPESGPAVSSSVSEKSDELITEVVTAPVTPKGRDILSSDNSGLDWSNENRGDKTEN 521 Query: 777 ----KVGVLANIHSIKSLEYTNAKVSDGHSYNEGESLNSKKNGSTC-------------- 652 GV N ++ N D ++G+ ++ NG C Sbjct: 522 GNNEACGVSQNDRD-DEIDNVNGDACDVSQRDQGDEVHD-GNGDACDVSQKFLDNGEKHS 579 Query: 651 --NXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINAFYKDVTKYINSKPSWKILQGM-P 481 + FLRSLGWEEN E+ GLT+EEINAFYK+ K KPS +LQ M P Sbjct: 580 SPDEVLYPDEEEAAFLRSLGWEENGEDEGLTEEEINAFYKECMKL---KPSSNLLQRMLP 636 Query: 480 RFLLALESQIG 448 + L+SQ+G Sbjct: 637 KISPLLDSQMG 647 >KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus var. scolymus] Length = 629 Score = 364 bits (934), Expect = e-111 Identities = 260/599 (43%), Positives = 349/599 (58%), Gaps = 27/599 (4%) Frame = -2 Query: 2172 DDYGTPKVVRKNSS-STNDQNLGRSSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHD 1996 D+ G+ K R SS +++D +LGR+S S+R S +FRR+ NGS +LR+ + F +NH D Sbjct: 67 DEQGSSKSGRNKSSVNSSDNDLGRTSVSDRTTSSYFRRTSLGNGSTHLRSYSSFGRNHRD 126 Query: 1995 RDSNDSVYESGNKDRTILEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPR 1816 RD + +YE +K+++ D R RD+SD L N R E+ LRRSHS +S +R E+ PR Sbjct: 127 RDWDKDIYEFWSKEKS---DNRHRDYSDPLDNILPSRFEKDGLRRSHSSVSGKRGESWPR 183 Query: 1815 NVEAGSKSAYKISHNNANR---ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCL 1651 V + A K SH+N ++ K SFE+DFPSL +EK A +I RVPSP L Sbjct: 184 KVVSDLSIANKSSHSNGTALLSGGSSLSNVKTSFERDFPSLGADEKQADPDIGRVPSPGL 243 Query: 1650 TPSIHTFPVSASAMIGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVS 1471 + +I + P+ SA+IG D WTSALAEVP +V +NG N T VS Sbjct: 244 SSAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNG-------------------NSTSVS 284 Query: 1470 AKAHAPGSPTMDSSISTALSMAETVAHLPPRVKTTPQ-----------LSTESQR-QELA 1327 P S T +S++ +MAET+AH PPR +T PQ L+ +QR +ELA Sbjct: 285 QPVQ-PTSITATTSMTGGRNMAETLAHGPPRTQTAPQVAQMLLMGSTILTVGTQRLEELA 343 Query: 1326 IKQSRQLIPVTPSAPKALVPNPPEKPKAKTGQPHLI---HSPRGGSAKPDVSKTSSIGKL 1156 +KQSRQLIP+TPS PKAL + +KPK K GQ L+ H+PR S K DVSKTS++GKL Sbjct: 344 VKQSRQLIPMTPSMPKALALSSSDKPKLKIGQSQLVNHPHTPRPLSVKSDVSKTSTVGKL 403 Query: 1155 QVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERK 976 VLKP RERNG S ++LSP S+ S P A S GS+ +R NN G A ERK Sbjct: 404 LVLKPSRERNGISPTAKESLSPTGGSKLPNS-PLAVPSAIGSAPLRNMGNNPGVTAVERK 462 Query: 975 HLLPLLEKRAASQSQTQSRNDFFSLMRKKS-ISNSSPAPESGPVISASDKKLGEGEDAAL 799 + LEKR +SQ+ QSRN+FF+LMRKKS ISNSS AP++G +S+S+K G A Sbjct: 463 PSVATLEKRPSSQA--QSRNNFFNLMRKKSMISNSSVAPDTGSSVSSSEK---PGAPVAP 517 Query: 798 PVTDHSGKVGVLANIHSIKSLEYT---NAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXX 628 P H G G +N ++ T +A V+ S N G KN S + Sbjct: 518 PA--HLG--GSESNTTVETKVDLTCKGDACVATVRSTNNG------KNHSGPDAVLCSEE 567 Query: 627 XXXXFLRSLGWEENT-EEGGLTDEEINAFYKDVTKYINSKPSWKILQG-MPRFLLALES 457 FLRSLGW+E EE GLT+EEI++FY++ Y+N KP+ KIL+G P+ L+ + S Sbjct: 568 EEARFLRSLGWDETAEEEEGLTEEEISSFYRN---YLNLKPTSKILKGTKPKPLMEISS 623 >XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [Daucus carota subsp. sativus] Length = 620 Score = 362 bits (928), Expect = e-110 Identities = 257/621 (41%), Positives = 344/621 (55%), Gaps = 23/621 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101 M+K+EPTLVP+WL+ D+ T K R S N+G Sbjct: 1 MEKSEPTLVPEWLKSSGSVTGGVSTNHLNPSLHQ--DNQATLKAARNKSLV----NIGDH 54 Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921 R S +FRRS S+NG+ +LR+ F +N+ DRD + +++ +K+++ L D + R Sbjct: 55 DIGHRTTSSYFRRS-SSNGTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQ 113 Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR----AS 1753 FSDS +NS R E+ LRR+ S IS E PR V + K+ K +HNN N +S Sbjct: 114 FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173 Query: 1752 DNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579 +HKASF++DFPSL EE+ EI RVPSP L +I P +SA I WTSAL Sbjct: 174 PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233 Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHA-PGSPTMDSSISTALSMAE 1402 AEVP + + +NGT S+ H+ S ++ S+ T L+MAE Sbjct: 234 AEVPAM---------------------IGSNGTTASSVPHSVSSSASVVPSMMTGLNMAE 272 Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG--- 1234 T+ PPRV+ PQLS E+QR +ELAIKQSRQLIPVTPS PKALV N +K K K G Sbjct: 273 TLVQGPPRVQADPQLSVETQRLEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQ 332 Query: 1233 ---QPHLIH-SPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVC 1066 +L+H SPRG K ++ KTSS+GKLQVLKP RERNG S D LSP S+ Sbjct: 333 QSASTNLVHHSPRGAPTKNEIIKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLAN 392 Query: 1065 SVPAATHSVAGSSSVRAQVNNSGHPAAERKH-----LLPLLEKRAASQSQTQSRNDFFSL 901 + A + GS+ +R+ +N+S +AERK + P+LEKR + Q+ +SRNDFF+ Sbjct: 393 NPLAPALATVGSAPLRSSMNHSILVSAERKSAPPVMVTPMLEKRPSPQA--KSRNDFFNS 450 Query: 900 MRKKSISNSSPAPESGPVISASDKKLGE-GEDAALPVTDHSGK-VGVLANIHSIKSLEYT 727 MRKKS++NSS A S V + S LG+ E A D G+ V V+ + K E Sbjct: 451 MRKKSMTNSSSA-VSNTVSAVSPSDLGKNSEGEASASLDSQGRDVPVVESSDEGKINECR 509 Query: 726 NAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEIN 550 + + + H SL++ N S+ + FLRSLGWEEN E+ GLT+EEIN Sbjct: 510 DGSIQNSH--GPQNSLDNGVNHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEIN 567 Query: 549 AFYKDVTKYINSKPSWKILQG 487 AFY+DV+KYINS P K L G Sbjct: 568 AFYRDVSKYINSAPPSKTLLG 588 >KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara cardunculus var. scolymus] Length = 551 Score = 356 bits (914), Expect = e-109 Identities = 250/603 (41%), Positives = 343/603 (56%), Gaps = 21/603 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVR-KNSSSTNDQNLGR 2104 M++ EPT VP+WL+ DD G K +R K+ ++ D +LGR Sbjct: 1 MERTEPTFVPEWLKSSGSLSTISHQFTSSSLHP---DDQGVSKSLRTKSLVNSGDNDLGR 57 Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924 +S S+R S +FRR+ S+NG+ +LR+ N F +NH DRD + +YE +K+++ D R R Sbjct: 58 TSVSDRTTSSYFRRTSSSNGAAHLRSYNSFSRNHRDRDWDKDIYEFRDKEKS---DNRHR 114 Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR---AS 1753 D+SD L N R E+ LRRSHS +S +R E+ PR V AG K+ HNN + Sbjct: 115 DYSDHLANILPSRFEKDGLRRSHSSLSAKRGESWPRKV-AGDKNG----HNNGSALPSVG 169 Query: 1752 DNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579 + KA+FE+DFPSL EEK A +EI RVPSP LT +I + P+ +SA+I D WTSAL Sbjct: 170 TSSSSGKAAFERDFPSLGAEEKQADTEIGRVPSPGLTTAIQSLPIGSSAVICGDMWTSAL 229 Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAET 1399 AEVP +V +NG+ +S + + P S + +S++T +MAET Sbjct: 230 AEVPMIVGSNGSNISVQ--------------------QPIQPTSVSATTSMTTGRNMAET 269 Query: 1398 VAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTGQ--- 1231 +A P R +TTPQLS +QR +ELA+KQSRQLIP+TPS PKAL N +KPK K GQ Sbjct: 270 LAQGPSRARTTPQLSVGTQRLEELAVKQSRQLIPMTPSMPKALALNSSDKPKLKVGQSQL 329 Query: 1230 --PHLIHSP---RGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVC 1066 H+++ P R S K DV+K S++GKL +LK RERNG + ++LSP S+ Sbjct: 330 QNSHIVNHPPSLRPVSVKSDVTKVSTVGKLHILKSSRERNGTTSTAKESLSPTGGSKLPN 389 Query: 1065 SVPAATHSVAGSSSVRAQVNNSGHP-AAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKK 889 S P A V GS+S+R N G A+RK P +EKR + Q+ QSRNDFF+LMRKK Sbjct: 390 S-PLAVPVVVGSASLR---NTGGSTIVADRK---PCVEKRPSPQA--QSRNDFFNLMRKK 440 Query: 888 SISNSSPAP---ESGPVISASDKKLGEGEDAALP--VTDHSGKVGVLANIHSIKSLEYTN 724 S++ +S +P E+G S +DK GE + V D S V L + Sbjct: 441 SMATNSSSPGASEAGSSESTNDKP-GEPQVGGYDPVVVDRSCGVQTL-----------SE 488 Query: 723 AKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINAF 544 KV + + E N++KN S+ + FLRSLGWEE TEE GLT+EEIN+F Sbjct: 489 NKVDFSCNGDATERSNNEKNHSSSDAILYSEEEEARFLRSLGWEETTEEEGLTEEEINSF 548 Query: 543 YKD 535 Y+D Sbjct: 549 YRD 551 >XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 isoform X1 [Erythranthe guttata] Length = 575 Score = 353 bits (905), Expect = e-107 Identities = 252/635 (39%), Positives = 347/635 (54%), Gaps = 24/635 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104 MD++EP+LVPQWL+ D++ +V R S +TN + GR Sbjct: 1 MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47 Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924 +S S + S +FRRS S+N S + ++ + F +N DRD Y S +K+R +L R R Sbjct: 48 ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107 Query: 1923 -DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----R 1759 + S+ LGN S + ER LRRSHS+IS + ET P+ V S S NN N + Sbjct: 108 YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGS--GKNNGNGFLAK 165 Query: 1758 ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTS 1585 S + +KA+FE+DFPSL +++ E+ RV SP L+ ++ + P+ +SA IG ++WTS Sbjct: 166 GSPVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTS 225 Query: 1584 ALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTG-VSAKAHAPGSPTMDSSIS--TAL 1414 ALAEVP L V +NGT +S + AP S T +S T+L Sbjct: 226 ALAEVPML---------------------VVSNGTASLSVQQAAPSSTTASVVVSSTTSL 264 Query: 1413 SMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKT 1237 +MAE VA P R +T PQLS +QR +ELAIKQSRQLIPVTP+ PK LV + +K K+K Sbjct: 265 NMAEAVAQGPTRAQTAPQLSLGTQRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKV 324 Query: 1236 G--QPH-------LIHSPRGGS-AKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPK 1087 G Q H + SPRG +KPD SK S++GKL VLKPVRE+NG + D LSP Sbjct: 325 GLIQQHPTPSSLPINQSPRGAPPSKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLSPT 384 Query: 1086 RDSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFF 907 + V S A+ P+A + L LEKR +Q+Q SRNDFF Sbjct: 385 GSGKAVNSTLPAS------------------PSAVKPLLTTALEKRPTTQAQ--SRNDFF 424 Query: 906 SLMRKKSISNSSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYT 727 MR+KS+SNSS A E+G IS AA+ +G V L ++++ Sbjct: 425 KRMREKSVSNSSSASETGTAISPEKHAKVAVVPAAI-----TGAVEPLPEEKAVRTTCNG 479 Query: 726 NAK-VSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEIN 550 + +S+G YN ++ ++ FLRS+GW+EN +EGGLT+EEI+ Sbjct: 480 GVQHISNGKKYNSEPIISEEEEAK--------------FLRSMGWDENDDEGGLTEEEIS 525 Query: 549 AFYKDVTKYINSKPSWKILQGMP-RFLLALESQIG 448 AFY+D TKYINSKPS +ILQG+ +FLL +SQIG Sbjct: 526 AFYRDFTKYINSKPSLRILQGVRLKFLLPFDSQIG 560 >KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometricum] Length = 601 Score = 353 bits (906), Expect = e-107 Identities = 248/622 (39%), Positives = 335/622 (53%), Gaps = 19/622 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNS-SSTNDQNLGR 2104 MDK+EPTLVP+WL+ DD PK+ R NS S+N + GR Sbjct: 1 MDKSEPTLVPEWLKNSGNQSGGGSTLHS--------DDKSAPKLSRNNSFMSSNGHDFGR 52 Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924 SS+SE+ S +F RS S+NGS NLR+ N F +N DRD Y+S +KD+++ D R Sbjct: 53 SSSSEKTTSSYFHRSSSSNGSGNLRSYNSFGRNRRDRDWEKDRYDSQDKDKSVSGDRWHR 112 Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----RA 1756 FSDS GN+ S + E LRRS S S +T + V S SA NN N + Sbjct: 113 VFSDSSGNSFSGKFEWDGLRRSQSATSGPHGDTWTKKVVTDSSSA---GGNNTNTLLTKG 169 Query: 1755 SDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSA 1582 + + K FE++FPSL EE+ E+ RVPSP L+ +I + P+ +A +G +KWTSA Sbjct: 170 APGGGVTKTRFERNFPSLGSEERAVIPEVGRVPSPGLSSAIQSLPIGTAAAVGGEKWTSA 229 Query: 1581 LAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAE 1402 LAEVP +V +NG GVS+ + S + SS +T L+MAE Sbjct: 230 LAEVPVIVGSNGIGVSSVTQS----------------------ASTQLASSTTTTLNMAE 267 Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG-QP 1228 VA P R PQ+S +QR +ELAIKQSRQLIPVTPS PK LV N +K K K G Q Sbjct: 268 AVAQGPSRSPAMPQISVGTQRLEELAIKQSRQLIPVTPSMPKTLVSNSSDKQKTKLGQQQ 327 Query: 1227 HLIHSPRGGSAKPDVSKTSS-IGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVPAA 1051 H I+S +K D+SK+SS +GKL VLK RE+NG + DNLSP V S Sbjct: 328 HSINSLPINHSKSDMSKSSSNVGKLHVLKLTREKNGVAPVVKDNLSPTTAVNAVSSTLLT 387 Query: 1050 THSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSISNSS 871 + SV G+ + + N P RK L +LEKR SQ+Q QSR +FF+L+RKKS++ S+ Sbjct: 388 SPSVTGAVASKGPPN---MPVLNRKPSLAVLEKRNTSQAQAQSRKEFFNLVRKKSMAIST 444 Query: 870 PAP--------ESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAKV 715 A +SG +S + E ED P T A++ E + Sbjct: 445 SATDAENFSSVDSGHAVSPPPSETSEKEDVPAPNTSQIDDAQSSASLSDDLFPEKRDDVT 504 Query: 714 SDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINAFYKD 535 + + + L + N S FLRSLGWEEN++EGGLT+EEI++F+KD Sbjct: 505 CPDDTCSMPKYLGNGMNASM--DPLFSEEEEAAFLRSLGWEENSDEGGLTEEEISSFFKD 562 Query: 534 VTKYINSKPSWKILQGM-PRFL 472 TKY NSKP+ +IL+ + P+F+ Sbjct: 563 ATKY-NSKPALRILEVVQPKFI 583 >KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp. sativus] Length = 617 Score = 352 bits (904), Expect = e-107 Identities = 255/621 (41%), Positives = 341/621 (54%), Gaps = 23/621 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101 M+K+EPTLVP+WL+ D+ T K R S N+G Sbjct: 1 MEKSEPTLVPEWLKSSGSVTGGVSTNHLNPSLHQ--DNQATLKAARNKSLV----NIGDH 54 Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921 R S +FRRS S+NG+ +LR+ F +N+ DRD + +++ +K+++ L D + R Sbjct: 55 DIGHRTTSSYFRRS-SSNGTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQ 113 Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR----AS 1753 FSDS +NS R E+ LRR+ S IS E PR V + K+ K +HNN N +S Sbjct: 114 FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173 Query: 1752 DNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579 +HKASF++DFPSL EE+ EI RVPSP L +I P +SA I WTSAL Sbjct: 174 PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233 Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHA-PGSPTMDSSISTALSMAE 1402 AEVP + + +NGT S+ H+ S ++ S+ T L+MAE Sbjct: 234 AEVPAM---------------------IGSNGTTASSVPHSVSSSASVVPSMMTGLNMAE 272 Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG--- 1234 T+ PPRV+ PQLS E+QR +ELAIKQSRQLIPVTPS PKALV N +K K K G Sbjct: 273 TLVQGPPRVQADPQLSVETQRLEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQ 332 Query: 1233 ---QPHLIH-SPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVC 1066 +L+H SPRG K ++ KTSS+GKLQVLKP RERNG S D LSP S+ Sbjct: 333 QSASTNLVHHSPRGAPTKNEIIKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLAN 392 Query: 1065 SVPAATHSVAGSSSVRAQVNNSGHPAAERKH-----LLPLLEKRAASQSQTQSRNDFFSL 901 + A + GS+ +R+ +N+S +AERK + P+LEKR + Q+ +SRNDFF+ Sbjct: 393 NPLAPALATVGSAPLRSSMNHSILVSAERKSAPPVMVTPMLEKRPSPQA--KSRNDFFNS 450 Query: 900 MRKKSISNSSPAPESGPVISASDKKLGE-GEDAALPVTDHSGK-VGVLANIHSIKSLEYT 727 MRKKS++NSS A S V + S LG+ E A D G+ V V+ + K E Sbjct: 451 MRKKSMTNSSSA-VSNTVSAVSPSDLGKNSEGEASASLDSQGRDVPVVESSDEGKINECR 509 Query: 726 NAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEIN 550 + + + H SL++ N S+ + FLRSLGWEEN E+ GLT+EEIN Sbjct: 510 DGSIQNSH--GPQNSLDNGVNHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEIN 567 Query: 549 AFYKDVTKYINSKPSWKILQG 487 AFY+D YINS P K L G Sbjct: 568 AFYRD---YINSAPPSKTLLG 585 >XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 isoform X2 [Erythranthe guttata] Length = 550 Score = 327 bits (839), Expect = 3e-98 Identities = 234/607 (38%), Positives = 326/607 (53%), Gaps = 22/607 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104 MD++EP+LVPQWL+ D++ +V R S +TN + GR Sbjct: 1 MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47 Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924 +S S + S +FRRS S+N S + ++ + F +N DRD Y S +K+R +L R R Sbjct: 48 ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107 Query: 1923 -DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----R 1759 + S+ LGN S + ER LRRSHS+IS + ET P+ V S S NN N + Sbjct: 108 YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGS--GKNNGNGFLAK 165 Query: 1758 ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTS 1585 S + +KA+FE+DFPSL +++ E+ RV SP L+ ++ + P+ +SA IG ++WTS Sbjct: 166 GSPVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTS 225 Query: 1584 ALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSIS--TALS 1411 ALAEVP LV +NGT +S + AP S T +S T+L+ Sbjct: 226 ALAEVPMLVVSNGT--------------------ASLSVQQAAPSSTTASVVVSSTTSLN 265 Query: 1410 MAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG 1234 MAE VA P R +T PQLS +QR +ELAIKQSRQLIPVTP+ PK LV + +K K+K G Sbjct: 266 MAEAVAQGPTRAQTAPQLSLGTQRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKVG 325 Query: 1233 --QPH-------LIHSPRGG-SAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKR 1084 Q H + SPRG +KPD SK S++GKL VLKPVRE+NG + D LSP Sbjct: 326 LIQQHPTPSSLPINQSPRGAPPSKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLSPTG 385 Query: 1083 DSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFS 904 + V S A+ P+A + L LEKR +Q+ QSRNDFF Sbjct: 386 SGKAVNSTLPAS------------------PSAVKPLLTTALEKRPTTQA--QSRNDFFK 425 Query: 903 LMRKKSISNSSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTN 724 MR+KS+SNSS A E+G IS + A +P +G V L ++++ Sbjct: 426 RMREKSVSNSSSASETGTAISPEK----HAKVAVVPAA-ITGAVEPLPEEKAVRTTCNGG 480 Query: 723 AK-VSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINA 547 + +S+G YN ++ ++ FLRS+GW+EN +EGGLT+EEI+A Sbjct: 481 VQHISNGKKYNSEPIISEEEEAK--------------FLRSMGWDENDDEGGLTEEEISA 526 Query: 546 FYKDVTK 526 FY+D TK Sbjct: 527 FYRDFTK 533 >XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 isoform X1 [Nelumbo nucifera] Length = 645 Score = 325 bits (833), Expect = 3e-96 Identities = 251/656 (38%), Positives = 347/656 (52%), Gaps = 45/656 (6%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104 M K+EPTLVP+WL+ SDD R SS S D + R Sbjct: 1 MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSDDNAVALPTRNRSSLSIGDYDTPR 60 Query: 2103 SSA-SERVKSPHFRRSFSNNGSV--------NLRTSNIFCKNHHDRDSNDSVYESGNKDR 1951 SSA S+R S + RRS S+NGS+ R+ + F ++H DRD + + +K+R Sbjct: 61 SSAFSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKER 120 Query: 1950 TILEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHN 1771 ++ D R DFSD L + + R+E+ +LRRS S++S +R E PR V A +++ Sbjct: 121 SVPGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAAD------LNNG 174 Query: 1770 NANRASDNVLL---------HKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPV 1624 N N+ + N LL KA+FE+DFPSL EEK +I RV SP L+ ++ + P+ Sbjct: 175 NINQNTSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPM 234 Query: 1623 SASAMIGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSP 1444 +SA+IG D WTSALAEVP ++ NGTG+S+ A T G+ S ++ Sbjct: 235 GSSALIGGDGWTSALAEVPMIIGNNGTGISSVQQA---------TLGSSASGATNS---- 281 Query: 1443 TMDSSISTALSMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVP 1267 ST L+MAET+A P R + +PQLS E+QR +ELAIKQSRQLIP+TPS PK V Sbjct: 282 ------STGLNMAETLAQAPSRARISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVL 335 Query: 1266 NPPEKPK------------AKTGQPHLIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNG 1123 N EK K KT Q + S RG + DVSKTS GKL VLK RE+NG Sbjct: 336 NSLEKAKPKISVRTGEMNATKTIQQQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNG 395 Query: 1122 DSLPPMDNLSPKRDSRTVCSVPAATHSVAGS---SSVRAQVNNSGHPAAERKHLLPLLEK 952 S D SP S+ + A S A + S ++++N AA +EK Sbjct: 396 ISPIAKDGQSPTNVSKVANNPLALAPSAAFTPLKSPNNSKLSNERKSAAASLMHGSSVEK 455 Query: 951 RAASQSQTQSRNDFFSLMRKKSISN-SSPAPESGPVISAS--DKKLGEGEDAALPVTDHS 781 R + SQ QSRNDFF+LMRKK+ N SS AP+ PV+S+S DK + A PV+ S Sbjct: 456 RPTT-SQVQSRNDFFNLMRKKTSGNLSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQS 514 Query: 780 GKVGVLANIHSIKSLEYTNAKVSDGHSYNEGES-LNSKKNGSTCNXXXXXXXXXXXFLRS 604 S E + +S+G++ E + LN+ + S+ + FLRS Sbjct: 515 SDAPSPDPSCLDWSTENGSETISNGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRS 574 Query: 603 LGWEENT-EEGGLTDEEINAFYKDVTKYINSKPSWKILQG---MPRFLLALESQIG 448 LGW+EN EE GLT+EEI+AFYK+ Y+ +PS K+ +G + + LES++G Sbjct: 575 LGWDENAGEEEGLTEEEISAFYKE---YMKLRPSSKLCRGSQQQVKLPMPLESRVG 627 >XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [Daucus carota subsp. sativus] Length = 591 Score = 320 bits (821), Expect = 4e-95 Identities = 239/621 (38%), Positives = 326/621 (52%), Gaps = 27/621 (4%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXS-DDYGTPKVVRKNSSSTND---QN 2113 M+KNEPT VP+WL+ DD T K R N SS +D N Sbjct: 1 MEKNEPTFVPEWLKSSGSVTSAVSTNHHQIASSSLLSDDRATLKSTR-NKSSIDDISAHN 59 Query: 2112 LGRSSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDF 1933 G S S+R S +FRRS ++NGS LR+ F + + D+ + E + D+ + D Sbjct: 60 SGSSPVSDRTTSSYFRRSSTSNGS-QLRSYGSFGRTNRDKGWDKDTNEYHDSDKLRIGDH 118 Query: 1932 RRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR-- 1759 R R+FSD LG+N S R E+ L+R+ S IS + +E R V A S K ++NN + Sbjct: 119 RHRNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLL 178 Query: 1758 --ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKW 1591 +S + KA+F++DFPSL +E+ E+RRVPSP L+ ++ P+ SA+ G W Sbjct: 179 AGSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGW 238 Query: 1590 TSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALS 1411 TSALAEV V NG S+ A A P S ++ SS+++ L+ Sbjct: 239 TSALAEVQVKVGANGINKSSVAQAAL-------------------PSSASVASSMTSGLN 279 Query: 1410 MAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG 1234 MAET+A PP V T Q S +QR +E+AIKQS+QLIPVTPS PKALV N EK K K Sbjct: 280 MAETLAQGPPHVHAT-QFSVGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAA 338 Query: 1233 QP--------HLIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDS 1078 Q H HSPRG K D+SKTSS+GKLQVLKP RERN S D LSP S Sbjct: 339 QQQHQTSSTHHFNHSPRGTPMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNAS 398 Query: 1077 RTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLM 898 + + A SV S+R+ + N P + +LEK+ + +Q +SRNDFF+L+ Sbjct: 399 KVPNNPLTAASSVGVPPSLRSPIKN---PIVASGVVPTVLEKKPS--AQLRSRNDFFNLV 453 Query: 897 RKKSISN-SSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEY--- 730 RKKS++N SSP +S +S S + A P G+ +LAN +++Y Sbjct: 454 RKKSLTNHSSPVVDSVSTVSQSILEQPSEHKAGAP---PPGEDSLLAN--QSDTVQYKMN 508 Query: 729 ---TNAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTD 562 +N DG + N + S+ + FLRSLGW+EN E+ GLT+ Sbjct: 509 GLISNRDACDGTPKSPDNGENGETRSSS-DVILCSEEEEAAFLRSLGWDENAGEDEGLTE 567 Query: 561 EEINAFYKDVTKYINSKPSWK 499 EEI FY+D +KYI +PS K Sbjct: 568 EEIREFYRDASKYIKPRPSSK 588 >XP_017224884.1 PREDICTED: cell wall protein RBR3-like [Daucus carota subsp. sativus] Length = 585 Score = 318 bits (815), Expect = 3e-94 Identities = 234/613 (38%), Positives = 329/613 (53%), Gaps = 19/613 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVR-KNSSSTNDQNLGR 2104 M+K+EP+ VP+WL+ +D+ T K R K S+ + + GR Sbjct: 1 MEKSEPSFVPEWLKSSGSVTVAVSTNHRQ-------NDHMTLKPTRNKLSADVSAHDSGR 53 Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924 S S+R S +FRR+ S+NGS N R+ F +N+ DR + E + DR L D R + Sbjct: 54 SPVSDRTTSSYFRRTSSSNGSGNSRSYGSFGRNNRDRGWDRDKNEYRDHDRLRLGDRRHQ 113 Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR----A 1756 ++S SLG++ S R E+ LRR+ S ++ + E L R V A S+ K ++NN++ + Sbjct: 114 NYSGSLGSDFSDRFEKNGLRRTQSSVAGKHSEPLSRRVSADLNSSNKSNYNNSSSRLLGS 173 Query: 1755 SDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSA 1582 S + K SF++DFPSL +E+ IR +PSP L+ ++ + S + WTSA Sbjct: 174 SGISSVRKTSFDRDFPSLGADERQTDHGIRNIPSPGLSTNMQSLSTGYSTVANEVGWTSA 233 Query: 1581 LAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAE 1402 LAEVP +V NG S+ +A P S ++ SS + +L+MAE Sbjct: 234 LAEVPVMVGANGPITSS-------------------VLQAALPSSTSVPSSTAASLNMAE 274 Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTGQPH 1225 T+A P RV T PQ+S E+QR +ELAIKQSRQLIP+TPS PK+LV N EK K K Q Sbjct: 275 TLAQGPLRVDTAPQVSVETQRLEELAIKQSRQLIPMTPSMPKSLVLNSSEKSKVKVSQQQ 334 Query: 1224 ----LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVP 1057 IHS RG K DV KT S+GKLQVLKP RERNG S P +DNLS DS TV + P Sbjct: 335 HQTSSIHSLRGTLEKSDVPKTLSLGKLQVLKPARERNGVSYPEIDNLSLTNDS-TVANNP 393 Query: 1056 AATHSVAGSSSVRAQVNNSGHPAAERKH---LLP-LLEKRAASQSQTQSRNDFFSLMRKK 889 T R Q+ N RK ++P LEK+ + +Q QSRN+FF+L+RKK Sbjct: 394 LTTLPAVVPPPSRTQIKNPNPLNVNRKPAAIMVPATLEKKPS--AQLQSRNEFFNLVRKK 451 Query: 888 SISNSSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSL-EYTNAKVS 712 S++ SS +S +S + A P++ GK + AN ++ E NA +S Sbjct: 452 SLTKSSSVADSVSTVSQFVVEQPSETQTASPLS--QGKDSLSANQSNMDHYKENVNALIS 509 Query: 711 DGHSYN-EGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEINAFYK 538 + ++ N +S + + S + FLRSLGW+EN E+ GLT+EEIN FY+ Sbjct: 510 NINNGNGHQQSCGNGETRSRSDMILCSEEEEAAFLRSLGWDENAGEDEGLTEEEINEFYR 569 Query: 537 DVTKYINSKPSWK 499 D +KYI S K Sbjct: 570 DASKYIKPGSSSK 582 >XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera] Length = 655 Score = 318 bits (815), Expect = 2e-93 Identities = 258/663 (38%), Positives = 342/663 (51%), Gaps = 52/663 (7%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKN-SSSTNDQNLGR 2104 M K EPTLVP+WL+ SDD+ R + ST D + R Sbjct: 1 MAKGEPTLVPEWLKGTGSITGGGNTTHHFASSSTHSDDHAVALTTRNRLTMSTGDYDTPR 60 Query: 2103 SSAS-ERVKSPHFRRSFSNNGSVN--------LRTSNIFCKNHHDRDSNDSVYESGNKDR 1951 SSA +R S +FRRS S+NGS+ R+ + F ++H DRD + +K++ Sbjct: 61 SSAFLDRTSSAYFRRSSSSNGSMMHDKETSTYSRSYSSFTRSHRDRDWEKDTLDYRDKEK 120 Query: 1950 TILEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHN 1771 +IL D R RD+SD L + + R E+ +LRRS S+IS +R E R V A + + +HN Sbjct: 121 SILGDHRDRDYSDPLASILTSRXEKDTLRRSQSMISGKRGEGWSRRVAADTNNGNN-NHN 179 Query: 1770 NANR----ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAM 1609 N N S + KA+FE+DFPSL EEK A +I RV SP L+ S+ + P+ +SA+ Sbjct: 180 NGNGLLVGGSIVSSIQKAAFERDFPSLGAEEKQGALDIGRVSSPGLSSSVQSLPIGSSAV 239 Query: 1608 IGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSS 1429 IG D WTSALAEVP ++ N G S+ A T S+ + AP S Sbjct: 240 IGGDGWTSALAEVPVIIGNNSIGPSSVQQA------------TPASSTSGAPNS------ 281 Query: 1428 ISTALSMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEK 1252 ST L+MAET+A P R + +PQLS E+QR +ELAIKQSRQLIP+TPS PK N EK Sbjct: 282 -STGLNMAETLAQAPSRTRISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSALNSSEK 340 Query: 1251 PK-------------AKTGQ------PHLI-HSPRGGSAKPDVSKTSSIGKLQVLKPVRE 1132 K AKT Q HL+ HS RGG + DV KTS GKL VLK RE Sbjct: 341 AKPKAVVRTGEMGISAKTSQQQQLPSSHLVNHSLRGGPVRSDVPKTSHGGKLLVLKAPRE 400 Query: 1131 RNGDSLPPMDNLSPKRDSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLL-- 958 +NG S D LSP S+ V + A + +R+ NNS P ERK + L Sbjct: 401 KNGISPSAKDGLSPTNASKVVNNSLVLAPLAAYAPPMRSP-NNSKLP-NERKSVASSLTH 458 Query: 957 ----EKRAASQSQTQSRNDFFSLMRKKSISN-SSPAPESGPVISAS-DKKLGEGEDA--A 802 EKR + SQ QSRNDFF+LMRKK+ N +S P+ P S+S +K E + Sbjct: 459 GSAVEKRPTT-SQVQSRNDFFNLMRKKTSGNLASAVPDPSPTASSSLLEKSSEPTEVVPT 517 Query: 801 LPVTDHSGKVGVLANIHSIKSLEYTNAKVSDGHSYNEGESL-NSKKNGSTCNXXXXXXXX 625 PV+ S S E VS+G E + N+ + ST + Sbjct: 518 APVSPQSSDAPSSEPSGLDWSTENGGDLVSNGDVSEESQRFSNNGEKRSTADAFVYPDEE 577 Query: 624 XXXFLRSLGWEENT-EEGGLTDEEINAFYKDVTKYINSKPSWKILQG---MPRFLLALES 457 FLRSLGW+EN EE GLT+EEI+AFY++ Y+ +PS ++ QG + L LES Sbjct: 578 EAAFLRSLGWDENAGEEEGLTEEEISAFYRE---YMKVRPSSRLCQGAQQQTKVPLPLES 634 Query: 456 QIG 448 +G Sbjct: 635 HVG 637 >EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobroma cacao] Length = 625 Score = 315 bits (808), Expect = 8e-93 Identities = 223/621 (35%), Positives = 339/621 (54%), Gaps = 21/621 (3%) Frame = -2 Query: 2283 MMDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGR 2104 +M+++EP+LVP+WL+ SD++ + R S D ++G Sbjct: 5 VMERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGG 64 Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924 +S +R S +FRRS S+NGS +LR+ + F K H DRD + + +++++++ D R R Sbjct: 65 TSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 124 Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANRASDNV 1744 +FSDSL N E+ L RS S I+ +R +T P+ V + S ++ K +H+++N V Sbjct: 125 NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGV 183 Query: 1743 ---LLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579 + +K+ FE++FP L EE+ ASEI RV SP L+ + + PV SA+ GSD WTSAL Sbjct: 184 STTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSAL 243 Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAET 1399 A++P V ++GTGV+ V+++ + S +M S+ T L+MAET Sbjct: 244 ADMPAGVGSSGTGVA-------------------VASQNVSASSASMASTTMTGLNMAET 284 Query: 1398 VAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIP-VTPSAPKALVPNPPEKPKAKTGQPH 1225 + P R +T P L+ +QR +ELAIKQSRQL+P VT S PK LV +P EK K K GQ Sbjct: 285 LVQGPSRARTPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQ 344 Query: 1224 ----LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVP 1057 ++ RGG+++ D K S+ G+L++LKP RE NG SL DNLSP S + + P Sbjct: 345 HASLSLNYTRGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSP 404 Query: 1056 -AATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSIS 880 + T S + S+ R+ N+ AER + +Q QSRNDFF+L++KKS + Sbjct: 405 LSVTPSASASAPFRSSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTT 464 Query: 879 NSSPA-----PESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAK- 718 NS + P + P +S +LG EDA+ VT G V ++ SI L N Sbjct: 465 NSPSSVADRGPAASPSVSEKSDELGT-EDASTSVTLQGGSVP--SSEISIADLPTDNRSE 521 Query: 717 -VSDGHSYNEGESLNSK-KNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEINA 547 +G +Y+ + +S + + FLRSLGWEEN ++ GLT+EEI+A Sbjct: 522 ITHNGDAYSGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISA 581 Query: 546 FYKDVTKYINSKPSWKILQGM 484 F+++ ++ KPS K+ M Sbjct: 582 FFEE---HMKLKPSAKLFHRM 599 >EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobroma cacao] Length = 620 Score = 315 bits (807), Expect = 1e-92 Identities = 223/620 (35%), Positives = 338/620 (54%), Gaps = 21/620 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101 M+++EP+LVP+WL+ SD++ + R S D ++G + Sbjct: 1 MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGGT 60 Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921 S +R S +FRRS S+NGS +LR+ + F K H DRD + + +++++++ D R R+ Sbjct: 61 SVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNRN 120 Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANRASDNV- 1744 FSDSL N E+ L RS S I+ +R +T P+ V + S ++ K +H+++N V Sbjct: 121 FSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGVS 179 Query: 1743 --LLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSALA 1576 + +K+ FE++FP L EE+ ASEI RV SP L+ + + PV SA+ GSD WTSALA Sbjct: 180 TTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSALA 239 Query: 1575 EVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAETV 1396 ++P V ++GTGV+ V+++ + S +M S+ T L+MAET+ Sbjct: 240 DMPAGVGSSGTGVA-------------------VASQNVSASSASMASTTMTGLNMAETL 280 Query: 1395 AHLPPRVKTTPQLSTESQR-QELAIKQSRQLIP-VTPSAPKALVPNPPEKPKAKTGQPH- 1225 P R +T P L+ +QR +ELAIKQSRQL+P VT S PK LV +P EK K K GQ Sbjct: 281 VQGPSRARTPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQH 340 Query: 1224 ---LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVP- 1057 ++ RGG+++ D K S+ G+L++LKP RE NG SL DNLSP S + + P Sbjct: 341 ASLSLNYTRGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPL 400 Query: 1056 AATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSISN 877 + T S + S+ R+ N+ AER + +Q QSRNDFF+L++KKS +N Sbjct: 401 SVTPSASASAPFRSSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTN 460 Query: 876 SSPA-----PESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAK-- 718 S + P + P +S +LG EDA+ VT G V ++ SI L N Sbjct: 461 SPSSVADRGPAASPSVSEKSDELGT-EDASTSVTLQGGSVP--SSEISIADLPTDNRSEI 517 Query: 717 VSDGHSYNEGESLNSK-KNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEINAF 544 +G +Y+ + +S + + FLRSLGWEEN ++ GLT+EEI+AF Sbjct: 518 THNGDAYSGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAF 577 Query: 543 YKDVTKYINSKPSWKILQGM 484 +++ ++ KPS K+ M Sbjct: 578 FEE---HMKLKPSAKLFHRM 594 >XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 isoform X2 [Nelumbo nucifera] Length = 616 Score = 314 bits (804), Expect = 2e-92 Identities = 244/654 (37%), Positives = 341/654 (52%), Gaps = 43/654 (6%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101 M K+EPTLVP+WL+ GT + + ST S Sbjct: 1 MAKSEPTLVPEWLK-------------------------GTGGIT--GAGSTTHHFASSS 33 Query: 2100 SASERVKSPHFRRSFSNNGSV--------NLRTSNIFCKNHHDRDSNDSVYESGNKDRTI 1945 S+R S + RRS S+NGS+ R+ + F ++H DRD + + +K+R++ Sbjct: 34 LQSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKERSV 93 Query: 1944 LEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNA 1765 D R DFSD L + + R+E+ +LRRS S++S +R E PR V A +++ N Sbjct: 94 PGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAAD------LNNGNI 147 Query: 1764 NRASDNVLL---------HKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSA 1618 N+ + N LL KA+FE+DFPSL EEK +I RV SP L+ ++ + P+ + Sbjct: 148 NQNTSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGS 207 Query: 1617 SAMIGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTM 1438 SA+IG D WTSALAEVP ++ NGTG+S+ A T G+ S ++ Sbjct: 208 SALIGGDGWTSALAEVPMIIGNNGTGISSVQQA---------TLGSSASGATNS------ 252 Query: 1437 DSSISTALSMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNP 1261 ST L+MAET+A P R + +PQLS E+QR +ELAIKQSRQLIP+TPS PK V N Sbjct: 253 ----STGLNMAETLAQAPSRARISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNS 308 Query: 1260 PEKPK------------AKTGQPHLIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDS 1117 EK K KT Q + S RG + DVSKTS GKL VLK RE+NG S Sbjct: 309 LEKAKPKISVRTGEMNATKTIQQQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGIS 368 Query: 1116 LPPMDNLSPKRDSRTVCSVPAATHSVAGS---SSVRAQVNNSGHPAAERKHLLPLLEKRA 946 D SP S+ + A S A + S ++++N AA +EKR Sbjct: 369 PIAKDGQSPTNVSKVANNPLALAPSAAFTPLKSPNNSKLSNERKSAAASLMHGSSVEKRP 428 Query: 945 ASQSQTQSRNDFFSLMRKKSISN-SSPAPESGPVISAS--DKKLGEGEDAALPVTDHSGK 775 + SQ QSRNDFF+LMRKK+ N SS AP+ PV+S+S DK + A PV+ S Sbjct: 429 TT-SQVQSRNDFFNLMRKKTSGNLSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQSSD 487 Query: 774 VGVLANIHSIKSLEYTNAKVSDGHSYNEGES-LNSKKNGSTCNXXXXXXXXXXXFLRSLG 598 S E + +S+G++ E + LN+ + S+ + FLRSLG Sbjct: 488 APSPDPSCLDWSTENGSETISNGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLG 547 Query: 597 WEENT-EEGGLTDEEINAFYKDVTKYINSKPSWKILQG---MPRFLLALESQIG 448 W+EN EE GLT+EEI+AFYK+ Y+ +PS K+ +G + + LES++G Sbjct: 548 WDENAGEEEGLTEEEISAFYKE---YMKLRPSSKLCRGSQQQVKLPMPLESRVG 598 >XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao] Length = 620 Score = 314 bits (804), Expect = 3e-92 Identities = 223/620 (35%), Positives = 336/620 (54%), Gaps = 21/620 (3%) Frame = -2 Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101 M+++EP+LVP+WL+ SD++ + R S D ++G + Sbjct: 1 MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPARNKLSVAGDHDVGGT 60 Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921 S +R S +FRRS S+NGSV+LR+ + F K H DRD + + +++++++ D R R+ Sbjct: 61 SVLDRTTSAYFRRSSSSNGSVHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNRN 120 Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANRASDNV- 1744 FSDSL N E+ L RS S I+ +R +T P+ V + S ++ K +H++ N V Sbjct: 121 FSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSGNGLLSGVS 179 Query: 1743 --LLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSALA 1576 + +K++FE++FP L EE+ SEI RV SP L+ + + PV SA+ GSD WTSALA Sbjct: 180 TTVGNKSAFEREFPVLGAEERQVGSEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSALA 239 Query: 1575 EVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAETV 1396 ++P V ++GTGV+ V+++ + S +M S+ T L+MAET+ Sbjct: 240 DMPAGVGSSGTGVA-------------------VASQNVSASSASMASTTMTGLNMAETL 280 Query: 1395 AHLPPRVKTTPQLSTESQR-QELAIKQSRQLIP-VTPSAPKALVPNPPEKPKAKTGQPH- 1225 P R +T P L+ +QR +ELAIKQSRQL+P VT S PK LV +P EK K K GQ Sbjct: 281 VQGPSRARTPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQH 340 Query: 1224 ---LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVPA 1054 ++ RGG+++ D K S+ G+L++LKP RE NG SL DNLSP S + + P Sbjct: 341 ASLSLNYTRGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPL 400 Query: 1053 -ATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSISN 877 T S + S+ R+ N+ AER + +Q QSRNDFF+L++KKS +N Sbjct: 401 NVTPSASASAPFRSSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTN 460 Query: 876 SSPA-----PESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAK-- 718 S + P + P +S +LG EDA+ VT G V ++ SI L N Sbjct: 461 SPSSVADRGPAASPSVSEKSDELGT-EDASTSVTLQGGSVP--SSEISIADLPTDNRSEI 517 Query: 717 VSDGHSYNEGESLNSK-KNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEINAF 544 +G +Y + +S + + FLRSLGWEEN ++ GLT+EEI+AF Sbjct: 518 THNGDAYAGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAF 577 Query: 543 YKDVTKYINSKPSWKILQGM 484 +++ ++ KPS K+ M Sbjct: 578 FEE---HMKLKPSAKLFHRM 594