BLASTX nr result
ID: Dioscorea21_contig00002066
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00002066 (4297 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAX19872.1| unknown, partial [Doryanthes excelsa] 89 8e-15 ref|XP_004079613.1| PREDICTED: uncharacterized protein LOC101155... 88 2e-14 ref|XP_002490879.1| Mucin-like protein [Komagataella pastoris GS... 81 2e-12 emb|CCA37660.1| Cell surface glycoprotein 1 [Komagataella pastor... 75 2e-10 ref|XP_002259727.1| hypothetical protein, conserved in Plasmodiu... 73 6e-10 >gb|AAX19872.1| unknown, partial [Doryanthes excelsa] Length = 252 Score = 89.4 bits (220), Expect = 8e-15 Identities = 73/248 (29%), Positives = 122/248 (49%), Gaps = 19/248 (7%) Frame = +3 Query: 2838 IKVAEKEAIAEKGSTSANDTEDKDSPVSSSFTQATQVALPRDANEVVEDNAKRSHGVEES 3017 + + E + EK S+SA TEDKD ++ V L + N+ ED + ++ + Sbjct: 5 LNLVSNEQVNEKESSSAELTEDKD------YSHKLSVVLEPEVNQPAEDTDR----LQLA 54 Query: 3018 DNSQHEESLQSDPADLPVSRFLMDHIL-HQGDVPDEVGNDESKNNIEEETTSDKLMTISI 3194 + + HEE Q+D DLPVSRFLMDHIL +GD ++ + ESK+ I EE + + I Sbjct: 55 EGASHEEKQQTDLHDLPVSRFLMDHILREEGDSLNKACDLESKDKIHEENDGGEQNEVEI 114 Query: 3195 QEQEIFMDRLSAELVPSENIV--ENKGMEKLEDTI-LQQDNMPADVCD-----VEADNEI 3350 +QE + LSAE V E + ++ G +KLE++ L Q+ DV + V D+ Sbjct: 115 PKQEGILVSLSAEQVTEETSISEDSVGHKKLENSSHLVQEQQSHDVKENTEIPVVKDDVT 174 Query: 3351 KQKHTTDEQ----MGTDIQEKE-AIMDSVSAQQMPEEKSTSEDRDQEKPG-----DHFLE 3500 Q D +G DI+++ ++ VS +P + +Q PG + ++ Sbjct: 175 PQSSLHDSSPTVAVGEDIKDEVIRLLSEVSTHVVPNDSMELTSTEQNTPGIKVDTEECMD 234 Query: 3501 EGIAPEDV 3524 G+ +++ Sbjct: 235 PGLVQDEI 242 Score = 59.3 bits (142), Expect = 9e-06 Identities = 64/252 (25%), Positives = 104/252 (41%), Gaps = 32/252 (12%) Frame = +3 Query: 3315 ADVCDVEADNEIKQKHTTDEQMGTDIQEKEAIMDSVSAQQMPEEKSTSEDRD-------- 3470 A+V ++ ++ ++ +K E ++ E + +S PE +ED D Sbjct: 2 AEVLNLVSNEQVNEK----ESSSAELTEDKDYSHKLSVVLEPEVNQPAEDTDRLQLAEGA 57 Query: 3471 --QEKPG-------------DHFL-EEGIAPEDVCDLKAXXXXXXXXXXXXXXXXXXXXQ 3602 +EK DH L EEG + CDL++ Q Sbjct: 58 SHEEKQQTDLHDLPVSRFLMDHILREEGDSLNKACDLESKDKIHEENDGGEQNEVEIPKQ 117 Query: 3603 EVNMSPLSAKEIPKDEYTD-------KVEVVSDVTNEEQNHETKADDEKPIAKDDESLHS 3761 E + LSA+++ ++ K+E S + E+Q+H+ K + E P+ KDD + S Sbjct: 118 EGILVSLSAEQVTEETSISEDSVGHKKLENSSHLVQEQQSHDVKENTEIPVVKDDVTPQS 177 Query: 3762 NICLSSTTTA-SEPVEIEVPGSLAGESASVVMDDYKGVTPMDQGDLGSDAGAEELTGRTS 3938 ++ SS T A E ++ EV L+ S VV +D +T +Q G EE Sbjct: 178 SLHDSSPTVAVGEDIKDEVIRLLSEVSTHVVPNDSMELTSTEQNTPGIKVDTEECMDPGL 237 Query: 3939 VQDEKIIASPTE 3974 VQDE I A+ E Sbjct: 238 VQDEIIQATVRE 249 >ref|XP_004079613.1| PREDICTED: uncharacterized protein LOC101155885 [Oryzias latipes] Length = 1066 Score = 88.2 bits (217), Expect = 2e-14 Identities = 94/478 (19%), Positives = 203/478 (42%), Gaps = 9/478 (1%) Frame = +3 Query: 2742 NVADEGIANGGIPDIPELAQASAEDTIDTEIKIKVAEKEAIAEK---GSTSANDTEDKDS 2912 NV +E + G + + L + E ++ I+ + E+E + EK G+ + E+++ Sbjct: 200 NVEEENVEEGNVEE-ENLKENVEEGNVEKNIEEENVEEENVGEKMQEGNVKEENVEEEN- 257 Query: 2913 PVSSSFTQATQVALPRDANEVVEDNAKRSHGVEES---DNSQHEESLQSDPADLPVSRFL 3083 V + V V E+N + + EE+ +N + E + + + V + + Sbjct: 258 -VKEENVEEENVEEENLKENVKEENVEEENVEEENVEEENVEEENVEEENVEEENVEKNI 316 Query: 3084 MDHILHQGDVPDEVGNDESKNNIEEETTSDKLMTISIQEQEIFMDRLSAELVPSENIVEN 3263 + + + +V +E +E N+EEE ++ + +QE + + + E V EN+ EN Sbjct: 317 EEENVEEENVEEENVKEE---NVEEENVKEENVGEKMQEGNVKEENVEEENVEEENLKEN 373 Query: 3264 KGMEKLEDTILQQDNMPADVCDVEADNEIKQKHTTDEQMGTDIQEKEAIMDSVSAQQMPE 3443 E +E+ ++++N+ +V + + +++++ +E + + E+E + ++V + + E Sbjct: 374 VKEENVEEENVEEENLKENVKENVEEENVEEENVKEENVEEENVEEENLKENVKEENVEE 433 Query: 3444 EKSTSEDRDQEKPGDHFLEEGIAPEDVCDLKAXXXXXXXXXXXXXXXXXXXXQEVNMSPL 3623 E E+ ++E + +EE E+ K +E N+ Sbjct: 434 ENVEEENVEEENVKEENVEEENVKEENVGEKMQEGNVKEENVEEENVKEENVEEENVEEE 493 Query: 3624 SAKEIPKDEYTDKVEVVSDVTNEEQNHETKADDEKPIAK---DDESLHSNICLSSTTTAS 3794 + KE K+E ++ E V + EE+N E + +E+ + + ++E+L N+ + Sbjct: 494 NLKENVKEENVEE-ENVEEENVEEENVEEENVEEENVEEGNVEEENLKENVEEGNVEKNI 552 Query: 3795 EPVEIEVPGSLAGESASVVMDDYKGVTPMDQGDLGSDAGAEELTGRTSVQDEKIIASPTE 3974 E +E GE M +G++ + EE +V++E + E Sbjct: 553 EEENVEEEN--VGEK-------------MQEGNVKEENVEEENVKEENVEEENV----EE 593 Query: 3975 TEIRENLTQMDNYSPATNASNVSIPRDGENTRTFDNVEQIEKEAHSISEEQITSAISE 4148 ++EN+ + NV E +NVE+ E ++ EE + I E Sbjct: 594 ENLKENVKE----------ENVEEENVEEENVEEENVEEENVEEENVEEENVEKNIEE 641 Score = 71.2 bits (173), Expect = 2e-09 Identities = 75/387 (19%), Positives = 166/387 (42%), Gaps = 1/387 (0%) Frame = +3 Query: 2973 VVEDNAKRSHGVEESDNSQH-EESLQSDPADLPVSRFLMDHILHQGDVPDEVGNDESKNN 3149 V E+N + + EE+ ++ EE++Q + V ++ + + +V +E +E++ N Sbjct: 107 VEEENVEEGNVEEENVEEENVEENMQEGNVEENVQEGNVEKNVEEENVEEENVEEENEEN 166 Query: 3150 IEEETTSDKLMTISIQEQEIFMDRLSAELVPSENIVENKGMEKLEDTILQQDNMPADVCD 3329 +EEE + + +++E+ + + + E V EN+ E E +E+ ++++N+ +V + Sbjct: 167 VEEENLKENVKEENVEEENVEEENVEEENVEEENVEE----ENVEEGNVEEENLKENVEE 222 Query: 3330 VEADNEIKQKHTTDEQMGTDIQEKEAIMDSVSAQQMPEEKSTSEDRDQEKPGDHFLEEGI 3509 + I++++ +E +G +QE ++V + + EE E+ ++E ++ EE + Sbjct: 223 GNVEKNIEEENVEEENVGEKMQEGNVKEENVEEENVKEENVEEENVEEENLKENVKEENV 282 Query: 3510 APEDVCDLKAXXXXXXXXXXXXXXXXXXXXQEVNMSPLSAKEIPKDEYTDKVEVVSDVTN 3689 E+V +E N+ + +E E V + Sbjct: 283 EEENV-------------------------EEENVEEENVEE----------ENVEEENV 307 Query: 3690 EEQNHETKADDEKPIAKDDESLHSNICLSSTTTASEPVEIEVPGSLAGESASVVMDDYKG 3869 EE+N E ++E +++ N+ E V+ E G E V ++ Sbjct: 308 EEENVEKNIEEEN--VEEENVEEENV--KEENVEEENVKEENVGEKMQEGN--VKEENVE 361 Query: 3870 VTPMDQGDLGSDAGAEELTGRTSVQDEKIIASPTETEIRENLTQMDNYSPATNASNVSIP 4049 +++ +L + EE +V++E + + E EN+ + + NV Sbjct: 362 EENVEEENLKENV-KEENVEEENVEEENLKENVKENVEEENVEEENVKEENVEEENVEEE 420 Query: 4050 RDGENTRTFDNVEQIEKEAHSISEEQI 4130 EN + +NVE+ E ++ EE + Sbjct: 421 NLKENVKE-ENVEEENVEEENVEEENV 446 Score = 69.7 bits (169), Expect = 7e-09 Identities = 53/266 (19%), Positives = 126/266 (47%), Gaps = 6/266 (2%) Frame = +3 Query: 2742 NVADEGIANGGIPDIPELAQASAEDTIDTEIKIKVAEKEAIAEK---GSTSANDTEDKDS 2912 NV +E + G + + L + E ++ I+ + E+E + EK G+ + E+++ Sbjct: 523 NVEEENVEEGNVEE-ENLKENVEEGNVEKNIEEENVEEENVGEKMQEGNVKEENVEEEN- 580 Query: 2913 PVSSSFTQATQVALPRDANEVVEDNAKRSHGVEES---DNSQHEESLQSDPADLPVSRFL 3083 V + V V E+N + + EE+ +N + E + + + V + + Sbjct: 581 -VKEENVEEENVEEENLKENVKEENVEEENVEEENVEEENVEEENVEEENVEEENVEKNI 639 Query: 3084 MDHILHQGDVPDEVGNDESKNNIEEETTSDKLMTISIQEQEIFMDRLSAELVPSENIVEN 3263 + + + +V +E +E N+EEE ++ + +QE + + + E V EN+ EN Sbjct: 640 EEENVEEENVEEENVKEE---NVEEENVKEENVGEKMQEGNVKEENVEEENVEEENLKEN 696 Query: 3264 KGMEKLEDTILQQDNMPADVCDVEADNEIKQKHTTDEQMGTDIQEKEAIMDSVSAQQMPE 3443 E +E+ ++++N+ +V + + +++++ +E + + E+E + ++V + + E Sbjct: 697 VKEENVEEENVEEENLKENVKENVEEENVEEENVKEENVEEENVEEENLKENVKEENVEE 756 Query: 3444 EKSTSEDRDQEKPGDHFLEEGIAPED 3521 E E+ ++E + +EE E+ Sbjct: 757 ENVEEENVEEENVKEENVEEENVKEE 782 Score = 65.5 bits (158), Expect = 1e-07 Identities = 76/437 (17%), Positives = 171/437 (39%), Gaps = 14/437 (3%) Frame = +3 Query: 2742 NVADEGIANGGIPDIPELAQASAEDTIDTEIKIKVAEKEAIAEKGSTSANDTEDK----- 2906 NV +E + + + + E+ + ++ + E+E + E+ N E+ Sbjct: 484 NVEEENVEEENLKENVKEENVEEENVEEENVEEENVEEENVEEENVEEGNVEEENLKENV 543 Query: 2907 -----DSPVSSSFTQATQVALPRDANEVVEDNAKRSHGVEES--DNSQHEESLQSDPADL 3065 + + + V V E+N + + EE+ + + EE+L+ + + Sbjct: 544 EEGNVEKNIEEENVEEENVGEKMQEGNVKEENVEEENVKEENVEEENVEEENLKENVKEE 603 Query: 3066 PVSRFLMDHILHQGDVPDEVGNDES--KNNIEEETTSDKLMTISIQEQEIFMDRLSAELV 3239 V + + + +V +E +E+ + N+EEE + +++E+ + + + E V Sbjct: 604 NVE----EENVEEENVEEENVEEENVEEENVEEENVEKNIEEENVEEENVEEENVKEENV 659 Query: 3240 PSENIVENKGMEKLEDTILQQDNMPADVCDVEADNEIKQKHTTDEQMGTDIQEKEAIMDS 3419 EN+ E EK+++ ++++N+ + + E E ++ +E+ + KE + ++ Sbjct: 660 EEENVKEENVGEKMQEGNVKEENVEEENVEEENLKENVKEENVEEENVEEENLKENVKEN 719 Query: 3420 VSAQQMPEEKSTSEDRDQEKPGDHFLEEGIAPEDVCDLKAXXXXXXXXXXXXXXXXXXXX 3599 V + + EE E+ ++E + L+E + E+V Sbjct: 720 VEEENVEEENVKEENVEEENVEEENLKENVKEENV--------------------EEENV 759 Query: 3600 QEVNMSPLSAKEIPKDEYTDKVEVVSDVTNEEQNHETKADDEKPIAKDDESLHSNICLSS 3779 +E N+ + KE +E K E V + E E ++E K++ N+ Sbjct: 760 EEENVEEENVKEENVEEENVKEENVGEKMQEGNVKEENVEEENENVKEENVEEENV--EE 817 Query: 3780 TTTASEPVEIEVPGSLAGESASVVMDDYKGVTPMDQGDLGSDAGAEELTGRTSVQDEKII 3959 E VE E E +V ++ + +G++ + EE +V++ + Sbjct: 818 ENVEEENVEEENVEEENVEEGNVEEEN------LQEGNVEEENVEEENVEEENVEEGNMQ 871 Query: 3960 ASPTETEIRENLTQMDN 4010 E + E Q N Sbjct: 872 EGNVEKNVEEENVQEGN 888 >ref|XP_002490879.1| Mucin-like protein [Komagataella pastoris GS115] gi|238030675|emb|CAY68599.1| Mucin-like protein [Komagataella pastoris GS115] Length = 1416 Score = 81.3 bits (199), Expect = 2e-12 Identities = 110/506 (21%), Positives = 203/506 (40%), Gaps = 20/506 (3%) Frame = +3 Query: 2730 TDGNNVADEGIANGGIPDIPELAQASAEDTIDTEIKIKVAEKEAIAEK--GSTSANDTED 2903 TD +V++ A P E A+ SAE++ TE + E A E+ STS D E+ Sbjct: 438 TDSQSVSESSAAEDSTPT--EEAEESAEESTSTEDAEESTEDFATTEEVEESTSTEDAEE 495 Query: 2904 KDSPVSSSFTQATQVALPRDANEVVEDNAKRSHG---VEESDNSQH-----EESLQSDPA 3059 S + + +T+ A + E E++ + S VEES +++ EES ++ A Sbjct: 496 STSTEEAEESTSTEDAEESTSTEEAEESTEESTSTDEVEESTSTEEVEESTEESTSTEDA 555 Query: 3060 DLPVSRFLMDHILHQGDVPDEVGNDESKNNIE---EETTSDKLMTISIQEQEIFMDRLSA 3230 + S + + DEV S +E EE+TS + S +E+ + + Sbjct: 556 EESTSTEEAEESTEESTSTDEVEESTSTEEVEESTEESTSTDEVEESTSTEEV--EESTE 613 Query: 3231 ELVPSENIVENKGMEKLEDTILQQDNMPADVCDVEADNEIKQKHTTDEQMGTDIQEKEAI 3410 E ++ + E+ E++E++ ++ D + + E ++ +T+E TD E + Sbjct: 614 ESTSTDEVEESTSTEEVEES-TEESTSTEDAEESTSTEEAEE--STEESTSTD--EVDES 668 Query: 3411 MDSVSAQQMPEEKSTSEDRDQE---KPGDHFLEEGIAPEDVCDLKAXXXXXXXXXXXXXX 3581 + A++ EE +++ED ++ + + EE + E+ Sbjct: 669 TSTEEAEESTEESTSTEDAEESTSTEEAEESTEESTSTEET------------EESTEEL 716 Query: 3582 XXXXXXQEVNMSPLSAKEIPKDEYTDKVEVVSDVTNEEQNHETKADDEKP----IAKDDE 3749 +E P S E+ + TD V+ + EQ T +P ++ E Sbjct: 717 TSTEEAEESTEEPTSTDEVDESTSTDDVDESTSTEGTEQFSSTDVPQGRPGFENPTEEVE 776 Query: 3750 SLHSNICLSSTTTASEPVEIEVPGSLAGESASVVMDDYKGVTPMDQGDLGSDAGAEELTG 3929 S + T+T E S S+ DD + T +++ ++ EE T Sbjct: 777 SSSTEEFEEPTSTDETDESTEEATSTEEAEESISTDDVEQSTSVEE----AEESTEESTS 832 Query: 3930 RTSVQDEKIIASPTETEIRENLTQMDNYSPATNASNVSIPRDGENTRTFDNVEQIEKEAH 4109 ++++ T T EN++ +D + + S E+T T D E E Sbjct: 833 TEALEES------TSTGDFENISAVDEELEESTEESTSTEEVEESTSTEDAEESTSTEEA 886 Query: 4110 SISEEQITSAISELLGSTTTTVRSEE 4187 S E+ TS +E +T+T +EE Sbjct: 887 EESTEESTS--TEDAEESTSTEEAEE 910 >emb|CCA37660.1| Cell surface glycoprotein 1 [Komagataella pastoris CBS 7435] Length = 1618 Score = 75.1 bits (183), Expect = 2e-10 Identities = 112/505 (22%), Positives = 190/505 (37%), Gaps = 19/505 (3%) Frame = +3 Query: 2730 TDGNNVADEGIANGGIPDIPELAQASAEDTIDTEIKIKVAEKEAIAEKGSTSANDTEDKD 2909 TD +V++ A P E A+ SAE++ TE + E A E+ S T +D Sbjct: 438 TDSQSVSESSAAEDSTPT--EEAEESAEESTSTEDAEESTEDFATTEEVEES---TSTED 492 Query: 2910 SPVSSSFTQATQVALPRDANEVVEDNAKRSHGVEESDNSQHEESLQSDPADLPVSRFLMD 3089 + S+S +A + DA E S EE++ S EES +D + S ++ Sbjct: 493 AEESTSTEEAEESTSTEDAEE--------STSTEEAEEST-EESTSTDEVEESTSTEEVE 543 Query: 3090 HILHQGDVPDEVGNDESKNNIEEET-----TSDKLMTISIQEQEIFMDRLSAELVPSENI 3254 + DEV S +EE T T D + S +E E + E ++ + Sbjct: 544 ESTEESTSTDEVEESTSTEEVEESTEESTSTEDAEESTSTEEAE----ESTEESTSTDEV 599 Query: 3255 VENKGMEKLEDTILQQ---DNMPADVCDVEADNEIKQKHTTD--EQMGTDIQEKEAIMDS 3419 E+ E+ E++ + D + E + I++ +T+ E+ ++ E +S Sbjct: 600 DESTSTEEAEESTEESTSTDEVEESTSTEEVEESIEESTSTEEPEESTEELTSTEEAEES 659 Query: 3420 VSAQQMPE--EKSTS----EDRDQEKPGDHFLEEGIAPEDVCDLKAXXXXXXXXXXXXXX 3581 S ++ E E+STS E+ +E +EE + E+V + Sbjct: 660 TSTDEVDESTEESTSTEEAEESTEESTSTDEVEESTSTEEV---EESTEESTSTEDAEES 716 Query: 3582 XXXXXXQEVNMSPLSAKEIPKDEYTDKVEVVSDVTNEEQNHETKADDEKPIAKDDESLHS 3761 +E S E+ + T++VE E T +D + +E+ S Sbjct: 717 TSTEEAEESTEESTSTDEVEESTSTEEVE-------ESTEESTSTEDAEESTSTEEAEES 769 Query: 3762 NICLSSTTTASEPVEIEVPGSLAGESASVVMDDYKGVTPMDQGDLGSDAGAEELTGRTSV 3941 +ST E E ES S + T + +E+ TS Sbjct: 770 TEESTSTDEVEESTSTEEVEESTEESTSTDEVEESTSTEEVEESTEESTSTDEVEESTST 829 Query: 3942 QD-EKIIASPTETEIRENLTQMDNYSPATNASNVSIPRDGENTRTFDNVEQIEKEAHSIS 4118 ++ E+ T TE E T + +T S S E+T T + E E+ + Sbjct: 830 EEVEESTEESTSTEDAEESTSTEEAEESTEES-TSTDEVDESTSTEEAEESTEESTSTED 888 Query: 4119 EEQITSA--ISELLGSTTTTVRSEE 4187 E+ TS E +T+T +EE Sbjct: 889 AEESTSTEEAEESTEESTSTEETEE 913 >ref|XP_002259727.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] gi|193809799|emb|CAQ40503.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] Length = 2758 Score = 73.2 bits (178), Expect = 6e-10 Identities = 73/287 (25%), Positives = 116/287 (40%), Gaps = 22/287 (7%) Frame = +1 Query: 364 DKMQKNESPMLDVDEEHDNESSEVNDILENKLSDDD-------GAISGPSSEEKVASSVF 522 D ++K ES DV+EE+DN+ E D +N DDD G + EE+V S V Sbjct: 2238 DGVEKEES---DVEEENDNDDDEEEDEEDNDDDDDDDDGNEDEGEVDSGVDEEEVESDVH 2294 Query: 523 EVVGSTKLGICLNQSEVETETISLDDNKDEIIPKDEETLYQTSDSSTVAENLGTPINSES 702 EV ++ DDN D+ DE+ ++ + V G E Sbjct: 2295 EVESGEEVD---------------DDNNDDEYDDDEDDEEESVEVDQVESLEGEEDEDEG 2339 Query: 703 RKLTIN-EGGGSDATDIVNVEPKMVDEGCEDNEEKKFEEVATKSDDRDQYSEISKATVTT 879 ++ E + D + E + +EG ED+EE +EE D D+ E +A Sbjct: 2340 EGSAVDVEQDEDEEEDDADEEEEDEEEGGEDDEEGDYEE-----GDDDEEEEEEEAVEEE 2394 Query: 880 KDADSHEQIKDSNIPNDRRDAERKADTET--EKISSAEAETEPALEEDKIVITPQG---- 1041 + + E+ +D D D E D E E+ S E E E E+ I I QG Sbjct: 2395 EQEEEEEEEEDEEEEED-EDEEEDEDEEVVEEEESGIEVEEEDETTEEDIEIEGQGESDV 2453 Query: 1042 --------DLEDTTINAKEETETKLDTIHPSSIMVDQEDFRNQKDNQ 1158 D+E+ I +EE + + + V+ +D ++D++ Sbjct: 2454 DVEGEEEEDVEEEEIEVEEEVGSSEEEVEEDEADVEDDDEDEEEDDE 2500