BLASTX nr result
ID: Chrysanthemum21_contig00016919
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00016919 (1035 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KVH88963.1| hypothetical protein Ccrd_024216 [Cynara carduncu... 317 e-100 ref|XP_022018640.1| dentin sialophosphoprotein-like [Helianthus ... 188 5e-51 ref|XP_023763540.1| uncharacterized protein LOC111912012 [Lactuc... 145 1e-35 ref|XP_021300736.1| LOW QUALITY PROTEIN: uncharacterized protein... 90 7e-16 ref|XP_007049103.2| PREDICTED: uncharacterized protein LOC186123... 83 1e-13 gb|EOX93260.1| Uncharacterized protein TCM_002115 isoform 1 [The... 81 5e-13 ref|XP_017257895.1| PREDICTED: probable cyclin-dependent serine/... 79 2e-12 ref|XP_024183551.1| uncharacterized threonine-rich GPI-anchored ... 79 3e-12 gb|EOX93261.1| Uncharacterized protein TCM_002115 isoform 2, par... 79 4e-12 gb|KZM91478.1| hypothetical protein DCAR_021157 [Daucus carota s... 74 8e-11 ref|XP_003553208.1| PREDICTED: uncharacterized protein LOC100811... 74 9e-11 gb|OMO84821.1| hypothetical protein COLO4_21830 [Corchorus olito... 74 9e-11 gb|KHN15602.1| hypothetical protein glysoja_034995 [Glycine soja] 74 1e-10 ref|XP_019162949.1| PREDICTED: uncharacterized protein LOC109159... 74 2e-10 ref|XP_019162948.1| PREDICTED: uncharacterized protein LOC109159... 74 2e-10 ref|XP_006585997.1| PREDICTED: mucin-21-like isoform X2 [Glycine... 72 6e-10 ref|XP_003530674.1| PREDICTED: mucin-21-like isoform X1 [Glycine... 72 7e-10 gb|KHN13472.1| hypothetical protein glysoja_029573 [Glycine soja] 70 2e-09 ref|XP_009347148.2| PREDICTED: uncharacterized protein LOC103938... 70 3e-09 ref|XP_022759061.1| uncharacterized protein LOC111305625 [Durio ... 69 4e-09 >gb|KVH88963.1| hypothetical protein Ccrd_024216 [Cynara cardunculus var. scolymus] Length = 622 Score = 317 bits (813), Expect = e-100 Identities = 177/314 (56%), Positives = 206/314 (65%), Gaps = 28/314 (8%) Frame = -1 Query: 900 LDNKKTNVDSDPFPSTGEDWAQDDFFANTTSATFQQAEPLDPVVQANDGFSGHSNDHDSK 721 L+N K N DS PSTG+DW DD FAN +SATF QAE L+ V +A DG SGH ND S+ Sbjct: 278 LNNTKLNDDSVTAPSTGKDWIPDDLFANMSSATFPQAEQLESVAEAKDGLSGHQNDISSE 337 Query: 720 GVDEEDWFNDGNWQKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFEN 541 G+D DWFNDG WQ +SA+N AV+ QAD LD+ KH++GS QD ND EGV +DWFEN Sbjct: 338 GIDG-DWFNDGIWQTSSANNAAVAQQADLLDLVAKHNEGSSQDKSNDSFTEGVSIDWFEN 396 Query: 540 TNWQKSATNNTTINKDDNLFDMKPQNDAVSSSSLFNDLIQND--------------SSDP 403 TNW KS NNT +KD+N FD+KPQ DAVSS +L NDLIQND S+ Sbjct: 397 TNWLKSTANNTATDKDENSFDIKPQVDAVSSPTLVNDLIQNDLLYNASSQVSSHTEKSEF 456 Query: 402 DNSKKQQSDATDWFQDSQWAIGASSSTTNVXXXXXXXXXXXXXXXTSSTG---------- 253 DNS K SD TDWFQDSQW GASS+TT + TSSTG Sbjct: 457 DNSNKHYSDTTDWFQDSQWPFGASSATT-MAASKDDDKFDEWNDFTSSTGNQGSFPDSWK 515 Query: 252 ---NERVAAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSNKNPTDTQAGFDIF 82 NE V A +K+SELNLF+ST D + EVDFGNFSQSDLFSGS SNKNP DTQ ++IF Sbjct: 516 QNSNENVIASEKISELNLFTSTTDPK--EVDFGNFSQSDLFSGSSSNKNPNDTQEVYNIF 573 Query: 81 SQVSTASR-NANVE 43 S+VSTASR N+N E Sbjct: 574 SEVSTASRKNSNGE 587 >ref|XP_022018640.1| dentin sialophosphoprotein-like [Helianthus annuus] gb|OTF91069.1| hypothetical protein HannXRQ_Chr16g0506711 [Helianthus annuus] Length = 591 Score = 188 bits (477), Expect = 5e-51 Identities = 143/363 (39%), Positives = 178/363 (49%), Gaps = 24/363 (6%) Frame = -1 Query: 1017 DDLFANTTSATFQQ----AEPLDSVVQANDGFPVVQATFKNEGLDNKKTNVDSDPFPSTG 850 D +F T S F++ ++P + AN P V + E D K N SDPF Sbjct: 248 DAVFVQTESFDFEKPNSASDPFQDDLFAN--MPDVDLG-QTESFDFDKPNNASDPF---- 300 Query: 849 EDWAQDDFFANTTSATFQQAEPLDPVVQANDGFSGHSNDHDSKGVDEEDWFNDGNWQKNS 670 QDD FAN +S TFQQ + LD V+QA D G ND K D +DWF+D NWQK+S Sbjct: 301 ----QDDLFANVSSKTFQQNDQLDSVLQAKDDLPGDRNDSSLKRAD-DDWFSDDNWQKSS 355 Query: 669 ASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFENTNWQKSATNNTTINKDD 490 +T +D S+QDN N S DWFEN TNNT K+D Sbjct: 356 VKSTL--------------NDVSVQDNPNVSS-----TDWFEN-------TNNTATIKED 389 Query: 489 NLFDMKPQNDAVSSSSLFNDLIQNDSSDPDNSKKQQSDATDWFQDSQWAIGASSSTTNV- 313 +LFD+KP + D ++SD DWFQDSQWAIG SSSTT Sbjct: 390 SLFDIKPHAN-------------------DTIPFEKSDVNDWFQDSQWAIGGSSSTTTTN 430 Query: 312 ----XXXXXXXXXXXXXXXTSSTGN-------------ERVAAG--DKMSELNLFSSTAD 190 TSSTGN E+V G +KMSEL+LF S D Sbjct: 431 VVVSNVDDDNDGFGEWNDFTSSTGNQDSVQDSWKESGTEKVDYGSSEKMSELDLFQSAVD 490 Query: 189 TRAQEVDFGNFSQSDLFSGSFSNKNPTDTQAGFDIFSQVSTASRNANVEAENSAERPKND 10 +Q+VDFGNF QSD+FSG +KN T TQ +DIFS++ T +R AN EA N+AE D Sbjct: 491 --SQDVDFGNFMQSDMFSG---DKNTTVTQTVYDIFSELPTGNRIANTEAGNNAEGLNKD 545 Query: 9 EFS 1 E + Sbjct: 546 EIT 548 Score = 67.0 bits (162), Expect = 2e-08 Identities = 62/236 (26%), Positives = 100/236 (42%), Gaps = 14/236 (5%) Frame = -1 Query: 1020 QDDLFANTTSATFQQAEPLDSVVQANDGFPVVQATFKNEGLDN--------KKTNVDS-- 871 QDDLFAN +S TFQQ + LDSV+QA D P + + D+ +K++V S Sbjct: 301 QDDLFANVSSKTFQQNDQLDSVLQAKDDLPGDRNDSSLKRADDDWFSDDNWQKSSVKSTL 360 Query: 870 -DPFPSTGEDWAQDDFFANT-TSATFQQAEPLDPVVQANDGFSGHSNDHDSKGVDEEDWF 697 D + + D+F NT +AT ++ D AND + D DWF Sbjct: 361 NDVSVQDNPNVSSTDWFENTNNTATIKEDSLFDIKPHANDTIPFEKS-------DVNDWF 413 Query: 696 NDGNWQ-KNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFENTNWQKSA 520 D W S+S T ++ +D DD ND ++ + D ++ +W++S Sbjct: 414 QDSQWAIGGSSSTTTTNVVVSNVD-----DDNDGFGEWNDFTSSTGNQDSVQD-SWKESG 467 Query: 519 TNNTTINKDDNLFDMKPQNDAVSSSSL-FNDLIQNDSSDPDNSKKQQSDATDWFQD 355 T + + ++ AV S + F + +Q+D D + D F + Sbjct: 468 TEKVDYGSSEKMSELDLFQSAVDSQDVDFGNFMQSDMFSGDKNTTVTQTVYDIFSE 523 >ref|XP_023763540.1| uncharacterized protein LOC111912012 [Lactuca sativa] gb|PLY85688.1| hypothetical protein LSAT_7X93260 [Lactuca sativa] Length = 540 Score = 145 bits (367), Expect = 1e-35 Identities = 102/283 (36%), Positives = 132/283 (46%), Gaps = 6/283 (2%) Frame = -1 Query: 861 PSTGEDWAQDDFFANTTSATFQQAEPLDPVVQANDGFSGHSNDHDSKGVDEEDWFNDGNW 682 P+ +DW QDD F N TFQQAE LD VV+ ND F H N+ SK VD +DWF+D NW Sbjct: 266 PNEDKDWIQDDLFTNMGPTTFQQAEQLDAVVKPNDEFPAHLNNPSSKDVD-QDWFSDNNW 324 Query: 681 QKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFENTNWQKSA-----T 517 QK+S +N+ QD ND E V VDWFEN NWQKS+ Sbjct: 325 QKSSVNNS--------------------QDKPNDSFTESVSVDWFENANWQKSSGFKKGD 364 Query: 516 NNTTINKDDNLFDMKPQNDAVSSSSLFNDLIQNDSSDPDNSKKQQSD-ATDWFQDSQWAI 340 N IN+ ++ PQ +SS +N S D DN KKQ D +TDWFQ+SQW+ Sbjct: 365 FNPQINESGQDHNVAPQ---ISS--------ENKSLDFDNIKKQALDTSTDWFQESQWST 413 Query: 339 GASSSTTNVXXXXXXXXXXXXXXXTSSTGNERVAAGDKMSELNLFSSTADTRAQEVDFGN 160 G SS+T V D + N F+S+ + + Sbjct: 414 GPSSATNIV----------------------NTKEDDDFDDWNDFTSST---PNQDSYKQ 448 Query: 159 FSQSDLFSGSFSNKNPTDTQAGFDIFSQVSTASRNANVEAENS 31 S D F S+ K ++ D FS+ ST +RN N E N+ Sbjct: 449 SSNQDSFPDSW--KQSSEKIPELDFFSESSTTNRNVNGEGGNN 489 Score = 77.8 bits (190), Expect = 5e-12 Identities = 77/252 (30%), Positives = 108/252 (42%), Gaps = 14/252 (5%) Frame = -1 Query: 1029 DWAQDDLFANTTSATFQQAEPLDSVVQANDGFPVVQATFKNEGLDNKKTNVDSDPFPSTG 850 DW QDDLF N TFQQAE LD+VV+ ND FP N ++ D D Sbjct: 271 DWIQDDLFTNMGPTTFQQAEQLDAVVKPNDEFPAHL---------NNPSSKDVD------ 315 Query: 849 EDWAQDDFFANTTSATFQQAEPLDPVVQANDGFSGHSNDHDSKGVDEEDWFNDGNWQKNS 670 +DW D+ + +S Q +P ND F+ +S V DWF + NWQK+S Sbjct: 316 QDWFSDNNW-QKSSVNNSQDKP-------NDSFT------ESVSV---DWFENANWQKSS 358 Query: 669 ASNTA-------VSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFENTNWQ---KSA 520 S Q + ++ SL + K A DWF+ + W SA Sbjct: 359 GFKKGDFNPQINESGQDHNVAPQISSENKSLDFDNIKKQALDTSTDWFQESQWSTGPSSA 418 Query: 519 TNNTTINKDDNLFDMKPQNDAVSSSSLFNDLIQNDSSD--PDNSKK--QQSDATDWFQDS 352 TN +DD+ D ND SS+ + Q+ + D PD+ K+ ++ D+F + Sbjct: 419 TNIVNTKEDDDFDDW---NDFTSSTPNQDSYKQSSNQDSFPDSWKQSSEKIPELDFFSE- 474 Query: 351 QWAIGASSSTTN 316 SSTTN Sbjct: 475 -------SSTTN 479 >ref|XP_021300736.1| LOW QUALITY PROTEIN: uncharacterized protein LOC110429163, partial [Herrania umbratica] Length = 768 Score = 89.7 bits (221), Expect = 7e-16 Identities = 101/380 (26%), Positives = 159/380 (41%), Gaps = 37/380 (9%) Frame = -1 Query: 1029 DWAQDDLFANTTSATFQQAEPLD-SVVQANDGF------PVVQ--------ATFKNEGLD 895 +W QDDL++N+TS T AE D +V +DG PV T N+ D Sbjct: 382 NWFQDDLWSNSTSGTVHHAEQSDLNVGNKDDGMLGNTKSPVSVNGIEDDQWPTSSNKAAD 441 Query: 894 NKKTNVDSDPFPS----TGEDWAQDDFFANTTSATFQQAEPLDPVVQANDGFSGHSNDHD 727 ++ + D D F + G W DF + ++ + ++ DP+V ++ S H + Sbjct: 442 DRTNDEDDDSFGAWNDFKGSRW-DTDFQSASSKNHHEGSKSFDPLVGSSVDLSDHMDTVF 500 Query: 726 SKGVDEED--------------WFNDGNWQKNSASNTAVSLQADKLDM-FDKHDDGSLQD 592 + G D D WF D W S S + V+ QA+ D D D G+ Q Sbjct: 501 ASGKDFVDGKVKDGSNVSNTNNWFQDDLW---SNSTSKVTRQAENFDATIDVMDSGTAQS 557 Query: 591 NLNDKSAEGVDVDWFENTNW---QKSATNNTTINKDDNLFDMKPQNDAVSSSSLFNDLIQ 421 N S ++VDWF + W A + ++K DN F ND SS+++ D Sbjct: 558 MHNSPS---MNVDWFPDDQWLTGNNKAPDRKAVDKSDNSFG--DWNDFKSSTTM-QDAFS 611 Query: 420 NDSSDPDNSKKQQSDATDWFQDSQWAIGASSSTTNVXXXXXXXXXXXXXXXTSSTGNERV 241 + S K D D + W SS + N + +E+ Sbjct: 612 DPSKQAARPDKMTIDDDDDLSGA-WNDFTSSISAN---------DPSSMSFKHTVNHEKP 661 Query: 240 AAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSNKNPTDTQAGFDIFSQVSTAS 61 + G SE+++F D+ + + + GN SQ DLFS SF N+N + + + VS+ Sbjct: 662 SIGT--SEMHVFGM--DSNSHDNNSGNLSQPDLFSRSFGNQNGS-------VEAPVSSRM 710 Query: 60 RNANVEAENSAERPKNDEFS 1 +A+V ++ E KN FS Sbjct: 711 ADASVRGGSNTEVAKNGGFS 730 >ref|XP_007049103.2| PREDICTED: uncharacterized protein LOC18612309 [Theobroma cacao] Length = 864 Score = 83.2 bits (204), Expect = 1e-13 Identities = 108/411 (26%), Positives = 167/411 (40%), Gaps = 68/411 (16%) Frame = -1 Query: 1029 DWAQDDLFANTTSATFQQAEPLD-SVVQANDGF------PV--------VQATFKNEGLD 895 +W QDDL++N+TS T AE D +V +DG PV T N+ +D Sbjct: 448 NWFQDDLWSNSTSGTVHHAEQSDLNVGNKDDGMLGNTKSPVSVNGIEDDQWPTSSNKAVD 507 Query: 894 NKKTNVDSDPF-------------------------PSTGEDWAQD-------DFFANTT 811 + + D D F S+ E+ + D DF + ++ Sbjct: 508 DGTNDEDDDSFGAWNDFKGSSAWGSSISSWKEPANCSSSTEEKSSDPFSGWDTDFQSASS 567 Query: 810 SATFQQAEPLDPVVQANDGFSGHSNDHDSKGVDEED--------------WFNDGNWQKN 673 + ++ DP+V ++ S H + + G D D WF D W Sbjct: 568 TNHNDSSKSFDPLVGSSIDLSDHMDTVFASGKDFVDGKAKDGSNVSSTNNWFQDDLW--- 624 Query: 672 SASNTAVSLQADKLD-MFDKHDDGSLQDNLNDKSAEGVDVDWFENTNW---QKSATNNTT 505 S S + V+ QA+ D D D G+ Q N S ++VDWF + W A + Sbjct: 625 SNSTSKVTCQAENFDATIDVMDSGAAQSMHNSPS---MNVDWFPDDQWLTGNNKAPDRKN 681 Query: 504 INKDDNLFDMKPQNDAVSSSSLFNDLIQNDSSDPDNSKKQQSDATDWFQD---SQWAIGA 334 ++K DN F + ND SS+++ Q+ SDP + T D + W Sbjct: 682 VDKSDNSF--REWNDFKSSTTM-----QDAFSDPSKQAARPDKITIDDNDDLSAAWNDFT 734 Query: 333 SSSTTNVXXXXXXXXXXXXXXXTSSTGNERVAAGDKMSELNLFSSTADTRAQEVDFGNFS 154 SS + N + +E+ + G SE++ FS D+ + + + GN S Sbjct: 735 SSISAN---------DPSSISFKHTVNHEKPSIG--TSEIHFFS--MDSNSHDNNSGNLS 781 Query: 153 QSDLFSGSFSNKNPTDTQAGFDIFSQVSTASRNANVEAENSAERPKNDEFS 1 Q DLFS SFSN+N + T+A VS +A+V ++AE KN FS Sbjct: 782 QPDLFSRSFSNQNGS-TEA------PVSNRMADASVRGGSNAEVAKNGGFS 825 >gb|EOX93260.1| Uncharacterized protein TCM_002115 isoform 1 [Theobroma cacao] Length = 864 Score = 81.3 bits (199), Expect = 5e-13 Identities = 107/411 (26%), Positives = 166/411 (40%), Gaps = 68/411 (16%) Frame = -1 Query: 1029 DWAQDDLFANTTSATFQQAEPLD-SVVQANDGF------PV--------VQATFKNEGLD 895 +W QDDL++N+TS T AE D +V +DG PV T N+ +D Sbjct: 448 NWFQDDLWSNSTSGTVHHAEQSDLNVGNKDDGMLGNTKSPVSVNGIEDDQWPTSSNKAVD 507 Query: 894 NKKTNVDSDPF-------------------------PSTGEDWAQD-------DFFANTT 811 + + D D F S+ E+ + D DF + ++ Sbjct: 508 DGTNDEDDDSFGAWNDFKGSSAWGSSISSWKEPANCSSSTEEKSSDPFSGWDTDFQSASS 567 Query: 810 SATFQQAEPLDPVVQANDGFSGHSNDHDSKGVDEED--------------WFNDGNWQKN 673 + ++ DP+V ++ S H + + G D D WF D W Sbjct: 568 TNHNDSSKSFDPLVGSSIDLSDHMDTVFASGKDFVDGKAKDGSNVSSTNNWFQDDLW--- 624 Query: 672 SASNTAVSLQADKLD-MFDKHDDGSLQDNLNDKSAEGVDVDWFENTNW---QKSATNNTT 505 S S + V+ QA+ D D D G+ Q N S ++VDWF + W A + Sbjct: 625 SNSTSKVTCQAENFDATIDVMDSGAAQSMHNSPS---MNVDWFPDDQWLTGNNKAPDRKN 681 Query: 504 INKDDNLFDMKPQNDAVSSSSLFNDLIQNDSSDPDNSKKQQSDATDWFQD---SQWAIGA 334 ++K DN F + ND SS+++ Q+ SDP + T D + W Sbjct: 682 VDKSDNSF--REWNDFKSSTTM-----QDAFSDPSKQAARPDKITIDDNDDLSAAWNDFT 734 Query: 333 SSSTTNVXXXXXXXXXXXXXXXTSSTGNERVAAGDKMSELNLFSSTADTRAQEVDFGNFS 154 SS + N + +E+ + G SE++ FS D+ + + + GN S Sbjct: 735 SSISAN---------DPSSISFKHTVNHEKPSIG--TSEIHFFS--MDSNSHDNNSGNLS 781 Query: 153 QSDLFSGSFSNKNPTDTQAGFDIFSQVSTASRNANVEAENSAERPKNDEFS 1 Q DLF SFSN+N + T+A VS +A+V ++AE KN FS Sbjct: 782 QPDLFPRSFSNQNGS-TEA------PVSNRMADASVRGGSNAEVAKNGGFS 825 >ref|XP_017257895.1| PREDICTED: probable cyclin-dependent serine/threonine-protein kinase DDB_G0292550 [Daucus carota subsp. sativus] Length = 1105 Score = 79.3 bits (194), Expect = 2e-12 Identities = 100/389 (25%), Positives = 151/389 (38%), Gaps = 53/389 (13%) Frame = -1 Query: 1026 WAQDDLFANTTSATFQQAEPLDSVVQAN-DGFPVVQATF-KNEGLDNKKTNVDSDPFPST 853 WA D AN + ++ DS V N D + + F + ++ +K + D P S Sbjct: 245 WAADFQSANKE----ESSKSYDSFVAPNIDLSSHIDSVFGAGKDVNRRKLSDDLQPAQSA 300 Query: 852 GEDWAQDDFFANTTSATFQQAEPLDPVVQANDGFSGHSNDHDSKGVDEEDWFNDGNWQKN 673 DW QDD + N S QQA + S +++ + + DWF D QKN Sbjct: 301 SHDWMQDDIWKNVDSKVSQQASHFSSTAETTIAVS--PDNYKNTASEGADWFEDDQRQKN 358 Query: 672 SASNTAVSLQADKLDMFDKHDDGSLQDN------------LNDKSAEGVDVDWFE----- 544 S + + D FD +D + N + DK E D DW + Sbjct: 359 ITSEPGNKIIDNPDDSFDDWNDFASSSNAVNLSGDEPSSKVIDKLDESFD-DWNDFASSS 417 Query: 543 -NTNWQKSATNNTTINKDDNLFDMKPQNDAVSSSSLFNDLIQNDSSD----PDNS----- 394 N +N I+K D+ FD ND SSS N S+ PD+S Sbjct: 418 NIANLSGGEPSNKVIDKPDDAFD--DWNDFASSSDAVNISSDEPSNKIIDRPDDSFNGWN 475 Query: 393 -------------KKQQSDATDWFQDS--QWAIGASSSTTNVXXXXXXXXXXXXXXXTSS 259 + +S D F DS W AS+S + Sbjct: 476 EFAASSNDVNHSGNEPRSKTIDKFDDSFDDWNDFASTSN----------YLDLSGNALRN 525 Query: 258 TGNERVAAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFS---GSFSNKNPTDTQAGFD 88 +++ + ++ SE NLFSST +T Q+ + G+FSQ +LFS G + TD + Sbjct: 526 NNHQKAVSSEQTSEKNLFSSTGNT--QDDELGSFSQPNLFSALPGGYDGDAVTD-----N 578 Query: 87 IFSQVSTASRNANVEA------ENSAERP 19 I +V + RN N ++ E++AE P Sbjct: 579 IHKEVYASERNNNEQSHVKELPEDTAEPP 607 Score = 66.6 bits (161), Expect = 3e-08 Identities = 73/278 (26%), Positives = 110/278 (39%), Gaps = 5/278 (1%) Frame = -1 Query: 849 EDWAQDDFFANTTSATFQQAEPLDPVVQANDGFS-GHSNDHDSKGVDEEDWFNDGNWQKN 673 +DW + D + N S Q + PL +A DG S ++ + S+GVD WF D W K+ Sbjct: 794 DDWMRGDLWKNLDSDVSQHSVPLGVTAEATDGISQNNAKNPASQGVD---WFIDNQWHKD 850 Query: 672 SASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFENTNWQKSATNNTTINKD 493 S + + DKHDD S D+ D ++ + V N +N I K Sbjct: 851 ITSEPSNKI-------IDKHDDSS--DDWTDFASSSIVV------NHSGEVPSNKIIGKP 895 Query: 492 DNLFDMKPQNDAVSSSSLFN---DLIQNDSSD-PDNSKKQQSDATDWFQDSQWAIGASSS 325 + FD ND SSS+ N D+ N SD PDNS +D F S A+ S Sbjct: 896 EGSFD--DWNDFASSSNAVNPRGDIPSNKISDEPDNSFDDWND----FASSNNAVKLSGY 949 Query: 324 TTNVXXXXXXXXXXXXXXXTSSTGNERVAAGDKMSELNLFSSTADTRAQEVDFGNFSQSD 145 + +S+ N +NLF + + +F + Sbjct: 950 EPSNKVIDKQNDSSDDWNDFASSSNA----------INLFEDKYSNKMIDEPDDSFDDWN 999 Query: 144 LFSGSFSNKNPTDTQAGFDIFSQVSTASRNANVEAENS 31 F+ S NP+ Q G I ++ +S + N A +S Sbjct: 1000 EFASSSIAANPSGDQPGSKIINKPDNSSDDWNDFASSS 1037 >ref|XP_024183551.1| uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like [Rosa chinensis] ref|XP_024183552.1| uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like [Rosa chinensis] ref|XP_024183553.1| uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like [Rosa chinensis] gb|PRQ52732.1| hypothetical protein RchiOBHm_Chr2g0158651 [Rosa chinensis] Length = 649 Score = 78.6 bits (192), Expect = 3e-12 Identities = 70/246 (28%), Positives = 110/246 (44%), Gaps = 18/246 (7%) Frame = -1 Query: 798 QQAEPLDPVVQANDGFSGHSND-----HDSKGVDEE-------DWFNDGNWQKNSASNTA 655 Q+++ LDP V + S H + DS V DWF+D S SN+ Sbjct: 337 QESKSLDPFVGSTVDLSAHIDTVFGSVGDSTNVKSNHSTSTSNDWFSD---DLLSISNSG 393 Query: 654 VSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFENTNWQ---KSATNNTTINKDDNL 484 ++ Q +L+ D + +N N+ + GVD W E+T WQ K A +NTT ++DD+ Sbjct: 394 LAGQPQQLESLSTVKDDRIAENANNLLSTGVD--WVEDTQWQTTSKEAPDNTTADEDDDS 451 Query: 483 FDMKPQNDAVSSSSLFNDLIQNDSSDPDNSKKQQSDATDWFQDSQWAIGASSSTTNVXXX 304 F ND SSSS N + + + ++ T+ F + +SS + Sbjct: 452 FGA--WNDFTSSSSAQNPSSSSKQTVDQTTAPDKNSVTNLFSTA-----SSSQDDDSFGA 504 Query: 303 XXXXXXXXXXXXTSSTGNERV---AAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFSG 133 SS+ + V D+ S NLFS+ ++ Q++DFGNFSQ DL +G Sbjct: 505 WNDFTSLSSAQNGSSSSKQTVDQMTPADETSVTNLFSTACNS--QDLDFGNFSQPDLSAG 562 Query: 132 SFSNKN 115 S+ + Sbjct: 563 EISSSH 568 >gb|EOX93261.1| Uncharacterized protein TCM_002115 isoform 2, partial [Theobroma cacao] Length = 826 Score = 78.6 bits (192), Expect = 4e-12 Identities = 105/411 (25%), Positives = 166/411 (40%), Gaps = 68/411 (16%) Frame = -1 Query: 1029 DWAQDDLFANTTSATFQQAEPLD-SVVQANDGF------PV--------VQATFKNEGLD 895 +W QDDL++N+TS T AE D +V +DG PV T N+ +D Sbjct: 409 NWFQDDLWSNSTSGTVHHAEQSDLNVGNKDDGMLGNTKSPVSVNGIEDDQWPTSSNKAVD 468 Query: 894 NKKTNVDSDPF-------------------------PSTGEDWAQD-------DFFANTT 811 + + D D F S+ E+ + D DF + ++ Sbjct: 469 DGTNDEDDDSFGAWNDFKGSSAWGSSISSWKEPANCSSSTEEKSSDPFSGWDTDFQSASS 528 Query: 810 SATFQQAEPLDPVVQANDGFSGHSNDHDSKGVDEED--------------WFNDGNWQKN 673 + ++ DP+V ++ S H + + G D D WF D W Sbjct: 529 TNHNDSSKSFDPLVGSSIDLSDHMDTVFASGKDFVDGKAKDGSNVSSTNNWFQDDLW--- 585 Query: 672 SASNTAVSLQADKLD-MFDKHDDGSLQDNLNDKSAEGVDVDWFENTNW---QKSATNNTT 505 S S + V+ QA+ D D D G+ Q N S ++VDWF + W A + Sbjct: 586 SNSTSKVTCQAENFDATIDVMDSGAAQSMHNSPS---MNVDWFPDDQWLTGNNKAPDRKN 642 Query: 504 INKDDNLFDMKPQNDAVSSSSLFNDLIQNDSSDPDNSKKQQSDATDWFQD---SQWAIGA 334 ++K DN F + ND SS+++ Q+ SDP + T D + W Sbjct: 643 VDKSDNSF--REWNDFKSSTTM-----QDAFSDPSKQAARPDKITIDDNDDLSAAWNDFT 695 Query: 333 SSSTTNVXXXXXXXXXXXXXXXTSSTGNERVAAGDKMSELNLFSSTADTRAQEVDFGNFS 154 SS + N + +E+ + G SE++ FS D+ + + + GN S Sbjct: 696 SSISAN---------DPSSISFKHTVNHEKPSIG--TSEIHFFS--MDSNSHDNNSGNLS 742 Query: 153 QSDLFSGSFSNKNPTDTQAGFDIFSQVSTASRNANVEAENSAERPKNDEFS 1 Q DLF SFSN+N + T+A ++ +A+V ++AE KN FS Sbjct: 743 QPDLFPRSFSNQNGS-TEAPVS-----NSRMADASVRGGSNAEVAKNGGFS 787 >gb|KZM91478.1| hypothetical protein DCAR_021157 [Daucus carota subsp. sativus] Length = 601 Score = 74.3 bits (181), Expect = 8e-11 Identities = 91/354 (25%), Positives = 135/354 (38%), Gaps = 47/354 (13%) Frame = -1 Query: 1026 WAQDDLFANTTSATFQQAEPLDSVVQAN-DGFPVVQATF-KNEGLDNKKTNVDSDPFPST 853 WA D AN + ++ DS V N D + + F + ++ +K + D P S Sbjct: 245 WAADFQSANKE----ESSKSYDSFVAPNIDLSSHIDSVFGAGKDVNRRKLSDDLQPAQSA 300 Query: 852 GEDWAQDDFFANTTSATFQQAEPLDPVVQANDGFSGHSNDHDSKGVDEEDWFNDGNWQKN 673 DW QDD + N S QQA + S +++ + + DWF D QKN Sbjct: 301 SHDWMQDDIWKNVDSKVSQQASHFSSTAETTIAVS--PDNYKNTASEGADWFEDDQRQKN 358 Query: 672 SASNTAVSLQADKLDMFDKHDDGSLQDN------------LNDKSAEGVDVDWFE----- 544 S + + D FD +D + N + DK E D DW + Sbjct: 359 ITSEPGNKIIDNPDDSFDDWNDFASSSNAVNLSGDEPSSKVIDKLDESFD-DWNDFASSS 417 Query: 543 -NTNWQKSATNNTTINKDDNLFDMKPQNDAVSSSSLFNDLIQNDSSD----PDNS----- 394 N +N I+K D+ FD ND SSS N S+ PD+S Sbjct: 418 NIANLSGGEPSNKVIDKPDDAFD--DWNDFASSSDAVNISSDEPSNKIIDRPDDSFNGWN 475 Query: 393 -------------KKQQSDATDWFQDS--QWAIGASSSTTNVXXXXXXXXXXXXXXXTSS 259 + +S D F DS W AS+S + Sbjct: 476 EFAASSNDVNHSGNEPRSKTIDKFDDSFDDWNDFASTSN----------YLDLSGNALRN 525 Query: 258 TGNERVAAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFS---GSFSNKNPTD 106 +++ + ++ SE NLFSST +T Q+ + G+FSQ +LFS G + TD Sbjct: 526 NNHQKAVSSEQTSEKNLFSSTGNT--QDDELGSFSQPNLFSALPGGYDGDAVTD 577 >ref|XP_003553208.1| PREDICTED: uncharacterized protein LOC100811805 [Glycine max] gb|KRG99194.1| hypothetical protein GLYMA_18G128300 [Glycine max] Length = 729 Score = 74.3 bits (181), Expect = 9e-11 Identities = 93/354 (26%), Positives = 132/354 (37%), Gaps = 51/354 (14%) Frame = -1 Query: 1029 DWAQDDLFA---NTTSATFQQAEPLDSVVQANDGFPVVQATFKNEGLDNKKTNV------ 877 DW QDDL+ N T+ T AE DS + ND + + N KTN Sbjct: 338 DWMQDDLWQGSDNKTTDTVATAEDKDSFDEWNDFTGSGSTQDPSSTISNSKTNAQTGNVG 397 Query: 876 ------------DSDPFPSTGEDWAQDDFFANT--TSATFQQAEPLDPVVQANDGFSGHS 739 D++ + DW QD + N T+ T E D ND F+G + Sbjct: 398 YSVDFNVTKTLKDANSSSNKDFDWMQDQWQDNNNKTTNTISANEAADSFDAWND-FTGSA 456 Query: 738 NDHDSKGVDEEDWFNDGNWQKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVD 559 N S +F SN+ ++ QA K H+D ++ S+ + Sbjct: 457 NTQHS-------YFG--------LSNSEITGQAGKCQFTQDHNDMKTTESATGSSS---N 498 Query: 558 VDWFENTNWQKS---ATNNTTINKDDNLFDMKPQ--NDAVS---SSSLFNDLIQ------ 421 DW ++ Q S AT T N+ + FD A+S SS + N I Sbjct: 499 FDWMQDNQLQGSDNKATGIATTNEVADEFDAWNDFTGSAISQNPSSGVSNSAITAQIGKS 558 Query: 420 ------NDSSDPDNSKKQQSDATDWFQDSQWAIGASSS----TTNVXXXXXXXXXXXXXX 271 ND + + + DW QD QW + S + TTN Sbjct: 559 EITADLNDMKTEEGTNASSHRSFDWMQDDQWQVSNSKTNDTRTTNDIDSFDLWNDFTSLA 618 Query: 270 XTSSTGN----ERVAAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSN 121 T N + V + +K SE NL SS+ + + DF FSQ DLFSG F + Sbjct: 619 STQDHSNNVLKQTVNSAEKTSETNLLSSS--NSSHDKDFSGFSQHDLFSGQFGS 670 >gb|OMO84821.1| hypothetical protein COLO4_21830 [Corchorus olitorius] Length = 852 Score = 74.3 bits (181), Expect = 9e-11 Identities = 94/375 (25%), Positives = 144/375 (38%), Gaps = 72/375 (19%) Frame = -1 Query: 1026 WAQDDLFANTTSATFQQAEPLDSVVQANDG--------FPV--------VQATFKNEGLD 895 W QDDL++N +S T E D + DG F V T N D Sbjct: 437 WFQDDLWSNFSSGTAHHVEQSDVDLVDKDGGMLGNLNNFSVSVNRNKDDQWPTSSNRAAD 496 Query: 894 NKKTNVDSDPF-------------------------PSTGEDWAQD-------DFFANTT 811 N + D D F S+ E+ + D DF + T Sbjct: 497 NGTNDEDDDSFGAWNDFKTSSAADSSISSWKEPANHTSSTEEKSSDPFSRWGTDFQSANT 556 Query: 810 SATFQQAEPLDPVVQANDGFSGH-------------SNDHDSKGVDEEDWFNDGNWQKNS 670 + ++P DP V + S H ++D V +WF D W S Sbjct: 557 KNHHENSKPSDPFVSTSIDLSDHLDTVFTSGKDLVDGKENDGSKVSNSNWFQDDLW---S 613 Query: 669 ASNTAVSLQADKLDMFDKH-DDGSLQDNLNDKSAEGVDVDWFENTNWQKS---ATNNTTI 502 S + V+ Q + LD D G+ Q N S ++V+WF + W S A + T+ Sbjct: 614 HSTSKVTQQPENLDATSNDVDSGTAQSVQNSPS---MNVNWFPDDQWLTSNHKAPDKRTV 670 Query: 501 NKDDNLFDMKPQNDAVSSSSLFNDLIQN---DSSDPDNSKKQQSDATDWFQDSQWAIGAS 331 ++ D+ FD ND SS+++ D N ++ PD ++D + W S Sbjct: 671 DELDDSFD--DWNDFTSSTTM-QDASSNSWKQATIPDKKTIPENDEL----SAAWNDFTS 723 Query: 330 SSTTNVXXXXXXXXXXXXXXXTSSTGNERVAAGDK----MSELNLFSSTADTRAQEVDFG 163 S++T SS+ +++ +K SE+NLF DT+ FG Sbjct: 724 STSTK---------------DASSSFSKQTVNHEKPSLETSEINLFG--LDTKLNNNSFG 766 Query: 162 NFSQSDLFSGSFSNK 118 + SQ+D FSG+FSN+ Sbjct: 767 SLSQADFFSGAFSNQ 781 >gb|KHN15602.1| hypothetical protein glysoja_034995 [Glycine soja] Length = 729 Score = 73.9 bits (180), Expect = 1e-10 Identities = 93/354 (26%), Positives = 132/354 (37%), Gaps = 51/354 (14%) Frame = -1 Query: 1029 DWAQDDLFA---NTTSATFQQAEPLDSVVQANDGFPVVQATFKNEGLDNKKTNV------ 877 DW QDDL+ N T+ T AE DS + ND + + N KTN Sbjct: 338 DWMQDDLWQGSDNKTTDTVATAEDKDSFDEWNDFTGSGSTQDPSSTISNSKTNAQTGNVG 397 Query: 876 ------------DSDPFPSTGEDWAQDDFFANT--TSATFQQAEPLDPVVQANDGFSGHS 739 D++ + DW QD + N T+ T E D ND F+G + Sbjct: 398 YSVDFNVTKTLKDANSSSNKDFDWMQDQWQDNNNKTTNTISANEAADSFDAWND-FTGSA 456 Query: 738 NDHDSKGVDEEDWFNDGNWQKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVD 559 N S +F SN+ ++ QA K H+D ++ S+ + Sbjct: 457 NTQHS-------YFG--------LSNSEITGQAGKCQFTQDHNDMKTTESATGSSS---N 498 Query: 558 VDWFENTNWQKS---ATNNTTINKDDNLFDMKPQ--NDAVS---SSSLFNDLIQ------ 421 DW ++ Q S AT T N+ + FD A+S SS + N I Sbjct: 499 FDWMQDNQLQGSDNKATGIATTNEVADEFDAWNDFTGSAISQNPSSGVSNSAITAQTGKS 558 Query: 420 ------NDSSDPDNSKKQQSDATDWFQDSQWAIGASSS----TTNVXXXXXXXXXXXXXX 271 ND + + + DW QD QW + S + TTN Sbjct: 559 EITADLNDMKTEEGTNASSHISFDWMQDDQWQVSNSKTNDTRTTNDIDSFDLWNDFTSLA 618 Query: 270 XTSSTGN----ERVAAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSN 121 T N + V + +K SE NL SS+ + + DF FSQ DLFSG F + Sbjct: 619 STQDHSNNVLKQTVNSAEKTSETNLLSSS--NSSHDKDFSGFSQHDLFSGQFGS 670 >ref|XP_019162949.1| PREDICTED: uncharacterized protein LOC109159297 isoform X2 [Ipomoea nil] Length = 712 Score = 73.6 bits (179), Expect = 2e-10 Identities = 95/343 (27%), Positives = 145/343 (42%), Gaps = 36/343 (10%) Frame = -1 Query: 1017 DDLFANTTSATFQQAEPLDSVVQAND-------GFPVVQATFKNEGLDNK-KTNVDSDPF 862 DDL N T+ F+Q D V+ ND P++ F+ DN+ TNV S P Sbjct: 323 DDLGNNATTEAFEQKGTFDRKVEVNDSQQQNSVNTPIIDDWFQ----DNQWPTNVVSAPN 378 Query: 861 PSTG--EDWAQDDFFANTTSATFQQAEPLDPVVQANDG-------FSGHSNDHDSKGVDE 709 P+ ++ + DD+ T+S+T + +PL + ND F + ++SK DE Sbjct: 379 PNATNIDEDSFDDWNDFTSSSTVK--DPLGKAITQNDVSTDMDSIFGSGKDLNESKKGDE 436 Query: 708 EDWFNDGNWQKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFENTNWQ 529 F++ + N +NT Q D +D Q+NLN + VDWF++ W Sbjct: 437 SVAFHEVSKWNN--ANTEAFEQKGSFDPMVGVNDSQQQNNLN----APITVDWFQDNQWP 490 Query: 528 ---KSATNNTTINKDDNLFDMKPQNDAVSSSS----LFNDLIQNDSSD--------PDNS 394 SA N N D++ FD ND SSS+ L + QND+ DN+ Sbjct: 491 TNVASAPNPDATNIDEDSFD--DWNDFTSSSTVKDPLGKGITQNDNQADVPFDILVSDNA 548 Query: 393 KKQQSDATD----WFQDSQWAIGASSSTTNVXXXXXXXXXXXXXXXTSSTGNERVAAGDK 226 DA + F D W +S+ N + N V A + Sbjct: 549 TAPNYDAMNVGGGIFDD--WNDFTASTAIN----------DSQAKAGTQNDNHVVDALEN 596 Query: 225 MSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSNKNPTDTQA 97 SELNLF ++ ++DF +FS S+LFS S ++ ++ A Sbjct: 597 TSELNLF---CPSKFDDMDFSSFSHSNLFSSSHHTESISEAGA 636 >ref|XP_019162948.1| PREDICTED: uncharacterized protein LOC109159297 isoform X1 [Ipomoea nil] Length = 721 Score = 73.6 bits (179), Expect = 2e-10 Identities = 95/343 (27%), Positives = 145/343 (42%), Gaps = 36/343 (10%) Frame = -1 Query: 1017 DDLFANTTSATFQQAEPLDSVVQAND-------GFPVVQATFKNEGLDNK-KTNVDSDPF 862 DDL N T+ F+Q D V+ ND P++ F+ DN+ TNV S P Sbjct: 323 DDLGNNATTEAFEQKGTFDRKVEVNDSQQQNSVNTPIIDDWFQ----DNQWPTNVVSAPN 378 Query: 861 PSTG--EDWAQDDFFANTTSATFQQAEPLDPVVQANDG-------FSGHSNDHDSKGVDE 709 P+ ++ + DD+ T+S+T + +PL + ND F + ++SK DE Sbjct: 379 PNATNIDEDSFDDWNDFTSSSTVK--DPLGKAITQNDVSTDMDSIFGSGKDLNESKKGDE 436 Query: 708 EDWFNDGNWQKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFENTNWQ 529 F++ + N +NT Q D +D Q+NLN + VDWF++ W Sbjct: 437 SVAFHEVSKWNN--ANTEAFEQKGSFDPMVGVNDSQQQNNLN----APITVDWFQDNQWP 490 Query: 528 ---KSATNNTTINKDDNLFDMKPQNDAVSSSS----LFNDLIQNDSSD--------PDNS 394 SA N N D++ FD ND SSS+ L + QND+ DN+ Sbjct: 491 TNVASAPNPDATNIDEDSFD--DWNDFTSSSTVKDPLGKGITQNDNQADVPFDILVSDNA 548 Query: 393 KKQQSDATD----WFQDSQWAIGASSSTTNVXXXXXXXXXXXXXXXTSSTGNERVAAGDK 226 DA + F D W +S+ N + N V A + Sbjct: 549 TAPNYDAMNVGGGIFDD--WNDFTASTAIN----------DSQAKAGTQNDNHVVDALEN 596 Query: 225 MSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSNKNPTDTQA 97 SELNLF ++ ++DF +FS S+LFS S ++ ++ A Sbjct: 597 TSELNLF---CPSKFDDMDFSSFSHSNLFSSSHHTESISEAGA 636 >ref|XP_006585997.1| PREDICTED: mucin-21-like isoform X2 [Glycine max] ref|XP_006585998.1| PREDICTED: mucin-21-like isoform X2 [Glycine max] gb|KRH45824.1| hypothetical protein GLYMA_08G294700 [Glycine max] gb|KRH45825.1| hypothetical protein GLYMA_08G294700 [Glycine max] Length = 615 Score = 71.6 bits (174), Expect = 6e-10 Identities = 89/368 (24%), Positives = 145/368 (39%), Gaps = 54/368 (14%) Frame = -1 Query: 1029 DWAQDDLFA---NTTSATFQQAEPLDSVVQAND--GFPVVQ---ATFKNE---------- 904 DW QDDL+ N T+ T AE DS + ND G Q +T N Sbjct: 225 DWMQDDLWQGSDNKTTDTVPTAEDKDSFDEWNDFTGSGSTQDPSSTISNSKTTAQTGNVG 284 Query: 903 ---GLDNKKTNVDSDPFPSTGEDWAQDDFFANTTSAT--FQQAEPLDPVVQANDGFSGHS 739 ++ KT+ D++ + DW QD + N T E D A + F+G + Sbjct: 285 YSVDFNDTKTSQDANSSSNKDFDWMQDQWQDNNNKTTNAISGNEAAD-AFDAWNNFTGSA 343 Query: 738 N-DHDSKGVDEEDWFNDGNWQKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGV 562 N H S G+ SN+ ++ QA K ++ H+D ++ S+ Sbjct: 344 NTQHSSFGL----------------SNSEITGQAGKFELSQDHNDTKTAESATGSSS--- 384 Query: 561 DVDWFENTNWQKS---ATNNTTINKDDNLFDM----------KPQNDAVSSSSLFNDLIQ 421 + DW ++ WQ S AT T N+ ++FD + + VS S++ + Sbjct: 385 NFDWMQDNQWQGSDDKATGIVTTNEASDVFDTWNDFTGSAISQNPSSGVSDSAITAQTRK 444 Query: 420 ND-SSDPDNSKKQQSD------ATDWFQDSQWAIGASSST-TNVXXXXXXXXXXXXXXXT 265 ++ ++D D+ K ++ + D QD W + + +T T Sbjct: 445 SEVTADLDDMKTEEGTNASSCRSFDRMQDDLWQVSNNKTTVTRTTNDIDSFDVWNDFTSL 504 Query: 264 SSTGNE---------RVAAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSNKNP 112 +ST + + + + SE NL SS+ + + DF FSQ DLFSG F + P Sbjct: 505 ASTQDHSSNVWKQTVNLTSAEMTSETNLLSSS--NSSHDKDFSGFSQHDLFSGQFGSSLP 562 Query: 111 TDTQAGFD 88 + D Sbjct: 563 VTSSNRVD 570 >ref|XP_003530674.1| PREDICTED: mucin-21-like isoform X1 [Glycine max] gb|KRH45823.1| hypothetical protein GLYMA_08G294700 [Glycine max] Length = 726 Score = 71.6 bits (174), Expect = 7e-10 Identities = 89/368 (24%), Positives = 145/368 (39%), Gaps = 54/368 (14%) Frame = -1 Query: 1029 DWAQDDLFA---NTTSATFQQAEPLDSVVQAND--GFPVVQ---ATFKNE---------- 904 DW QDDL+ N T+ T AE DS + ND G Q +T N Sbjct: 336 DWMQDDLWQGSDNKTTDTVPTAEDKDSFDEWNDFTGSGSTQDPSSTISNSKTTAQTGNVG 395 Query: 903 ---GLDNKKTNVDSDPFPSTGEDWAQDDFFANTTSAT--FQQAEPLDPVVQANDGFSGHS 739 ++ KT+ D++ + DW QD + N T E D A + F+G + Sbjct: 396 YSVDFNDTKTSQDANSSSNKDFDWMQDQWQDNNNKTTNAISGNEAAD-AFDAWNNFTGSA 454 Query: 738 N-DHDSKGVDEEDWFNDGNWQKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGV 562 N H S G+ SN+ ++ QA K ++ H+D ++ S+ Sbjct: 455 NTQHSSFGL----------------SNSEITGQAGKFELSQDHNDTKTAESATGSSS--- 495 Query: 561 DVDWFENTNWQKS---ATNNTTINKDDNLFDM----------KPQNDAVSSSSLFNDLIQ 421 + DW ++ WQ S AT T N+ ++FD + + VS S++ + Sbjct: 496 NFDWMQDNQWQGSDDKATGIVTTNEASDVFDTWNDFTGSAISQNPSSGVSDSAITAQTRK 555 Query: 420 ND-SSDPDNSKKQQSD------ATDWFQDSQWAIGASSST-TNVXXXXXXXXXXXXXXXT 265 ++ ++D D+ K ++ + D QD W + + +T T Sbjct: 556 SEVTADLDDMKTEEGTNASSCRSFDRMQDDLWQVSNNKTTVTRTTNDIDSFDVWNDFTSL 615 Query: 264 SSTGNE---------RVAAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSNKNP 112 +ST + + + + SE NL SS+ + + DF FSQ DLFSG F + P Sbjct: 616 ASTQDHSSNVWKQTVNLTSAEMTSETNLLSSS--NSSHDKDFSGFSQHDLFSGQFGSSLP 673 Query: 111 TDTQAGFD 88 + D Sbjct: 674 VTSSNRVD 681 >gb|KHN13472.1| hypothetical protein glysoja_029573 [Glycine soja] Length = 726 Score = 70.1 bits (170), Expect = 2e-09 Identities = 89/368 (24%), Positives = 137/368 (37%), Gaps = 54/368 (14%) Frame = -1 Query: 1029 DWAQDDLFA---NTTSATFQQAEPLDSVVQAND--GFPVVQ---ATFKNE---------- 904 DW QDDL+ N T+ T AE DS + ND G Q +T N Sbjct: 336 DWMQDDLWQGSDNKTTDTVPTAEDKDSFDEWNDFTGSGSTQDPSSTISNSKTTAQTGNVG 395 Query: 903 ---GLDNKKTNVDSDPFPSTGEDWAQDDFFANTTSAT--FQQAEPLDPVVQANDGFSGHS 739 ++ KT+ D++ + DW QD + N T E D A + F+G + Sbjct: 396 YSVDFNDTKTSQDANSSSNKDFDWMQDQWQDNNNKTTNAISGNEAAD-AFDAWNNFTGSA 454 Query: 738 N-DHDSKGVDEEDWFNDGNWQKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGV 562 N H S G+ SN+ ++ QA K + H+D ++ S+ Sbjct: 455 NTQHSSFGL----------------SNSEITGQAGKFEFSQDHNDTKTAESATGSSS--- 495 Query: 561 DVDWFENTNWQKS---ATNNTTINKDDNLFDM----------KPQNDAVSSSSLFNDLIQ 421 + DW ++ WQ S AT T N+ + FD + + VS S++ + Sbjct: 496 NFDWMQDNQWQGSDDKATGIVTTNEASDEFDAWNDFTGSAISQNPSSGVSDSAITAQTRK 555 Query: 420 -------NDSSDPDNSKKQQSDATDWFQDSQWAIGASSST-TNVXXXXXXXXXXXXXXXT 265 ND + + + D QD QW + + +T T Sbjct: 556 SEVTADLNDMKTEEGTNASSCRSFDRMQDDQWQVSNNKTTVTRTTNDIDSFDVWNDFTSL 615 Query: 264 SSTGNE---------RVAAGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSNKNP 112 +ST + + + SE NL SS+ + + DF FSQ DLFSG F + P Sbjct: 616 ASTQDHSSNVWKQTVNPTSAEMTSETNLLSSS--NSSHDKDFSGFSQHDLFSGQFGSSLP 673 Query: 111 TDTQAGFD 88 + D Sbjct: 674 VTSSNRVD 681 >ref|XP_009347148.2| PREDICTED: uncharacterized protein LOC103938828 [Pyrus x bretschneideri] Length = 714 Score = 69.7 bits (169), Expect = 3e-09 Identities = 77/315 (24%), Positives = 124/315 (39%), Gaps = 25/315 (7%) Frame = -1 Query: 894 NKKTNVDSDPFPSTGEDWAQDDFFANTTSATFQQAEPLDPVVQANDGFSGH------SND 733 N+K+N S DW QDD + S E L+ + + G + S Sbjct: 386 NEKSNHSLTGSTSMSTDWFQDDLVGVSNSVFSGGPEQLETLAEVKGGVQDNQLRTTSSKA 445 Query: 732 HDSKGVDEEDWFNDGNWQKNSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVD 553 D+K D++D D D ++ S NL D S + VD Sbjct: 446 SDNKTTDKDD---------------------DSFDAWNDFASLSSAPNLVDGSVKQNGVD 484 Query: 552 WFENTNWQKS---ATNNTTINKDDNLFDMKPQNDAVSSSSLFNDLIQNDSSDPDNSKKQQ 382 W ++ Q + A +N T ++DD+ FD ND SSS N D+S KQ Sbjct: 485 WVQDNQLQTTSSKAPDNKTTDEDDDSFDA--WNDFASSS--------NAPKVVDSSVKQS 534 Query: 381 SDATDWFQDSQWAI----GASSSTTNVXXXXXXXXXXXXXXXTSST--------GNERVA 238 DW QD+Q A + TT+ +S N + Sbjct: 535 G--VDWVQDNQLQTTVNKAADNKTTDADDDSFDAWNDFTGSNNASNLADSSVQQSNNQTT 592 Query: 237 AGDKMSELNLFSSTADTRAQEVDFGNFSQSDLFSGSFSNKNPTDTQAGF----DIFSQVS 70 D SE+NLF + +++ ++DFG+F Q D +G+F++ + + G + +++ Sbjct: 593 PVDHTSEINLFGAASNSG--DLDFGSFLQPDFSAGAFNSSSGSTVVDGAQPEPSVLDRLA 650 Query: 69 TASRNANVEAENSAE 25 AS N ++E+ AE Sbjct: 651 DASTNNGKKSEDVAE 665 >ref|XP_022759061.1| uncharacterized protein LOC111305625 [Durio zibethinus] Length = 852 Score = 69.3 bits (168), Expect = 4e-09 Identities = 100/411 (24%), Positives = 153/411 (37%), Gaps = 68/411 (16%) Frame = -1 Query: 1029 DWAQDDLFANTTSATFQQAEPLDSVVQANDGFPVVQA----------------TFKNEGL 898 +W QDDL++N+TS Q E D+ V DG + T N+ Sbjct: 433 NWLQDDLWSNSTSGILQHLEQSDANVGDKDGGMLRDLSNYSMSINRIQDDQWQTTSNKEA 492 Query: 897 DNKKTNVDSDPFPSTGE------------DWAQDDFFANTTS-----------ATFQQA- 790 DN + D D F + + W + AN+ FQ A Sbjct: 493 DNGTNDEDDDSFGAWNDFKSSSVVHSSISSWKEPAIHANSMEEKSSDPFSGWDTDFQSAN 552 Query: 789 --------EPLDPVVQANDGFSGHSNDHDSKGVDEED--------------WFNDGNWQK 676 + DP+V ++ H + + G D D WF D W Sbjct: 553 SKNHHDGSKSSDPLVGSSIDLFDHMDAVFASGKDLVDGKAKDGSSASNANSWFQDDRWS- 611 Query: 675 NSASNTAVSLQADKLDMFDKHDDGSLQDNLNDKSAEGVDVDWFENTNW---QKSATNNTT 505 NS SN ++ QA+ D + G+ Q N S + VDWF + W K A + T Sbjct: 612 NSTSN--LTRQAENFDTAVMNS-GADQTVHNSSSMK---VDWFPDDQWLTGNKKAPDRKT 665 Query: 504 INKDDNLFDMKPQNDAVSSSSLFNDLIQNDSSD---PDNSKKQQSDATDWFQDSQWAIGA 334 +++ DN F ND +S+++ D N + PDN D+ D A Sbjct: 666 VDESDNSFG--DWNDFKTSTTM-QDAFSNSWKEVAIPDNK------TIDYNDDLSAAWND 716 Query: 333 SSSTTNVXXXXXXXXXXXXXXXTSSTGNERVAAGDKMSELNLFSSTADTRAQEVDFGNFS 154 +S+T+ S G SE +LF T D + +FG+ S Sbjct: 717 FTSSTSAKDPSSISFEQAVHHEKPSVGT---------SETHLF--TMDRNSYNNNFGSLS 765 Query: 153 QSDLFSGSFSNKNPTDTQAGFDIFSQVSTASRNANVEAENSAERPKNDEFS 1 + D FSG+FS+++ + + S +ANV N+AE K+ +FS Sbjct: 766 EPDFFSGAFSSQSVSTEINIMLPEAPDSDRMADANVRGGNNAEVAKDGDFS 816