BLASTX nr result
ID: Dioscorea21_contig00013389
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00013389 (1688 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN81514.1| hypothetical protein VITISV_012030 [Vitis vinifera] 333 1e-88 ref|XP_002267519.1| PREDICTED: uncharacterized protein LOC100241... 328 3e-87 ref|XP_002521158.1| conserved hypothetical protein [Ricinus comm... 307 6e-81 ref|XP_002303096.1| predicted protein [Populus trichocarpa] gi|2... 290 1e-75 ref|XP_003520621.1| PREDICTED: uncharacterized protein LOC100793... 289 1e-75 >emb|CAN81514.1| hypothetical protein VITISV_012030 [Vitis vinifera] Length = 1081 Score = 333 bits (853), Expect = 1e-88 Identities = 236/609 (38%), Positives = 311/609 (51%), Gaps = 51/609 (8%) Frame = +3 Query: 3 FYKHQNEVQTIPQPSQTRRITVLKPSKAIEE------GGMNQARIQQ-CPSGEERIWDKN 161 F +H E+Q+IP P T+RITVLKPSK ++ G + +I++ G+ W+KN Sbjct: 262 FTQHLYELQSIPAPPDTKRITVLKPSKVMDNNKFAASGKKIEKQIRKPVQIGQANCWEKN 321 Query: 162 KHRRCSSLENLKVENLS-QPTRIVVLKPSPGKPCEFKALATNRISSPKSLDERDLSRNIE 338 N K + QPTRIVVLKPSP K E K + + SSP+ L + D + Sbjct: 322 NPGYSPPFSNQKADEYPPQPTRIVVLKPSPSKAHEIKVVVSPPSSSPRVLCDEDFHGEPD 381 Query: 339 GDEAIGSRDIAKEITRQMRESMXXXXXXXXXXXXXXXXGYVGDESSFNRSENYVEEAGGN 518 DEA SR++AKEITRQMRE++ GY+GDESSF +SEN E A GN Sbjct: 382 DDEACESREVAKEITRQMRENLSAHRRDETLLSSVFSNGYIGDESSFTKSEN--EFAVGN 439 Query: 519 LSDSEIVTPTSRHSWDYMNRYGXXXXXXXXXXXXXXXESSVIREAKKRLSERWALVASNG 698 LSDSE+++PT RHSWDY+N G ESSV REAKKRLSERWA++ASNG Sbjct: 440 LSDSEVMSPTLRHSWDYINGCGSPYSSSSFSRASYSPESSVCREAKKRLSERWAMMASNG 499 Query: 699 VGQEQTHVRRSSSTLGEMLAISEVKKEENVKELDFTSNGSCGGGDLKASTPCLSIGIASD 878 QEQ HVRRSSSTLGEMLA+S++K+ ++E+D + D + ST C++ + D Sbjct: 500 SCQEQKHVRRSSSTLGEMLALSDIKRSVRLEEVDISKE-----QDPRGSTSCVTSNLVKD 554 Query: 879 AANGERSTMGLXXXXXXXXXXXTYENIELNAEVSDTPKSKPDTRTEEGKTKNGKLSFKGK 1058 + S L Y LN EVS K E K K+ K SFKGK Sbjct: 555 -EEADNSPRNLLRSKSVPVSSXVY-GARLNVEVSHPEVGKTHVPKELTKAKSTKSSFKGK 612 Query: 1059 VSSLFFSRNKKSSREKSIPS-------SLVASDARLHPRNADIAVRDDISKPT------- 1196 VSSLFFSR+KKSS+EKS S S A +H DD+S+ Sbjct: 613 VSSLFFSRSKKSSKEKSGVSLCRDESPSATAETLPVHMTAGKFC--DDVSQCANDSGTEE 670 Query: 1197 -----VERSLSPADNP-------------SKATVSLEKXXXXXXXXXXXXXXXXXXVLEA 1322 + RS S +P ++A +S+ K VLE Sbjct: 671 GISHGLRRSSSKPSSPDLIGMVPTQSIISNEAGLSVAKLVTPGNPSESQGQPSPISVLEP 730 Query: 1323 KFEDDSNANVPPCSESS----------HVGHLQALSRSPPIESLARSLSWDDSCMNTSTI 1472 FE+D N N+ H + +SP IES+AR+LSWDDSC T+T Sbjct: 731 PFEEDDNTNLEFAGNIKTDQQGTQVLVHPLKSNLIDKSPRIESIARTLSWDDSCTETAT- 789 Query: 1473 NNPSKLPTRASFKADXXXXXRIDFIRKLLSSTGL-DNKNSKTVFSRWHSLDSPLDQMLLD 1649 P K P+ AS +A+ + F++ LLS+ G DN + T FSRWHS ++PLD L D Sbjct: 790 PYPLK-PSLASSRAEEDEQDWLFFVQTLLSAAGFDDNVQTDTFFSRWHSPETPLDPALRD 848 Query: 1650 GFLDEKDEE 1676 + + D+E Sbjct: 849 KYAELNDKE 857 >ref|XP_002267519.1| PREDICTED: uncharacterized protein LOC100241277 [Vitis vinifera] Length = 991 Score = 328 bits (840), Expect = 3e-87 Identities = 234/609 (38%), Positives = 310/609 (50%), Gaps = 51/609 (8%) Frame = +3 Query: 3 FYKHQNEVQTIPQPSQTRRITVLKPSKAIEE------GGMNQARIQQ-CPSGEERIWDKN 161 F +H E+Q+IP P T+RITVLKPSK ++ G + +I++ G+ W+KN Sbjct: 262 FTQHLYELQSIPAPPDTKRITVLKPSKVMDNNKFAASGKKIEKQIRKPVQIGQANCWEKN 321 Query: 162 KHRRCSSLENLKVENLS-QPTRIVVLKPSPGKPCEFKALATNRISSPKSLDERDLSRNIE 338 N K + QPTRIVVLKPSP K E K + + SSP+ L + D + Sbjct: 322 NPGYSPPFSNQKADEYPPQPTRIVVLKPSPSKAHEIKVVVSPPSSSPRVLCDEDFHGEPD 381 Query: 339 GDEAIGSRDIAKEITRQMRESMXXXXXXXXXXXXXXXXGYVGDESSFNRSENYVEEAGGN 518 DEA SR++AKEITRQMRE++ GY+GDESSF +SEN E A GN Sbjct: 382 DDEACESREVAKEITRQMRENLSAHRRDETLLSSVFSNGYIGDESSFTKSEN--EFAVGN 439 Query: 519 LSDSEIVTPTSRHSWDYMNRYGXXXXXXXXXXXXXXXESSVIREAKKRLSERWALVASNG 698 LSDSE+++PT RHSWDY+N ESSV REAKKRLSERWA++ASNG Sbjct: 440 LSDSEVMSPTLRHSWDYINS---PYSSSSFSRASYSPESSVCREAKKRLSERWAMMASNG 496 Query: 699 VGQEQTHVRRSSSTLGEMLAISEVKKEENVKELDFTSNGSCGGGDLKASTPCLSIGIASD 878 QEQ HVRRSSSTLGEMLA+S++K+ ++E+D + D + ST C++ + D Sbjct: 497 SCQEQKHVRRSSSTLGEMLALSDIKRSVRLEEVDISKEQ-----DPRGSTSCVTSNLVKD 551 Query: 879 AANGERSTMGLXXXXXXXXXXXTYENIELNAEVSDTPKSKPDTRTEEGKTKNGKLSFKGK 1058 + S L Y LN EVS K E K K+ K SFKGK Sbjct: 552 E-EADNSPRNLLRSKSVPVSSTVY-GARLNVEVSHPEVGKTHVPKELTKAKSTKSSFKGK 609 Query: 1059 VSSLFFSRNKKSSREKSIPS-------SLVASDARLHPRNADIAVRDDISKPT------- 1196 VSSLFFSR+KKSS+EKS S S A +H + DD+S+ Sbjct: 610 VSSLFFSRSKKSSKEKSGVSLCRDESPSATAETLPVHMTAGKVC--DDVSQCANDSGTEE 667 Query: 1197 -----VERSLSPADNP-------------SKATVSLEKXXXXXXXXXXXXXXXXXXVLEA 1322 + RS S +P ++A +S+ K VLE Sbjct: 668 GISHGLRRSSSKPSSPDLIGMVPTQSIISNEAGLSVAKPVTPGNPSESQGQPSPISVLEP 727 Query: 1323 KFEDDSNANVPPCSESS----------HVGHLQALSRSPPIESLARSLSWDDSCMNTSTI 1472 FE+D N N+ H + +SP IES+AR+LSWDDSC T+T Sbjct: 728 PFEEDDNTNLEFAGNIKTDQQGTQVLVHPLKSNLIDKSPRIESIARTLSWDDSCTETAT- 786 Query: 1473 NNPSKLPTRASFKADXXXXXRIDFIRKLLSSTGL-DNKNSKTVFSRWHSLDSPLDQMLLD 1649 P K P+ AS +A+ + F++ LLS+ G DN + T FSRWHS ++PLD L D Sbjct: 787 PYPLK-PSLASSRAEEDEQDWLFFVQTLLSAAGFDDNVQTDTFFSRWHSPETPLDPALRD 845 Query: 1650 GFLDEKDEE 1676 + + D+E Sbjct: 846 KYAELNDKE 854 >ref|XP_002521158.1| conserved hypothetical protein [Ricinus communis] gi|223539727|gb|EEF41309.1| conserved hypothetical protein [Ricinus communis] Length = 990 Score = 307 bits (786), Expect = 6e-81 Identities = 225/613 (36%), Positives = 309/613 (50%), Gaps = 55/613 (8%) Frame = +3 Query: 3 FYKHQNEVQTIPQPSQTRRITVLKPSKAIEE----GGMNQARIQQ---CPSGEERIWDKN 161 F H ++Q+ P +T+RITVL+PSK I+ G M + Q P+G+ +W+KN Sbjct: 263 FSPHLYDMQST-SPPETKRITVLRPSKVIDNDKFPGSMKKGDKQSTKAAPTGQNNVWNKN 321 Query: 162 KHRRCSSLENLKVENLS-QPTRIVVLKPSPGKPCEFKALATNRISSPKSLDERDLSRNIE 338 N + E QPTRIVVLKPSPGK + KA+ + SSP++L + E Sbjct: 322 NSGYSPIYANQRFEEYPPQPTRIVVLKPSPGKTHDVKAVVSPPSSSPRTLQGEEFYGEAE 381 Query: 339 GDEAIGSRDIAKEITRQMRESMXXXXXXXXXXXXXXXXGYVGDESSFNRSENYVEEAGGN 518 DEA R++AK+IT QM E+ GY+GD+SSFN+SEN E A GN Sbjct: 382 DDEAQKPREMAKDITEQMHENRMGHRRDETLLSSVFSNGYIGDDSSFNKSEN--EFAVGN 439 Query: 519 LSDSEIVTPTSRHSWDYMNRYGXXXXXXXXXXXXXXXESSVIREAKKRLSERWALVASNG 698 LSDSEI++P SRHSWDY+NR+G ESSV REAKKRLSERWA++ASNG Sbjct: 440 LSDSEIMSPNSRHSWDYVNRFGSPYSSSSFSRASCSPESSVCREAKKRLSERWAMMASNG 499 Query: 699 VGQEQTHVRRSSSTLGEMLAISEVKKEENVKELDFTSNGSCGGGDLKASTPCLSIGIASD 878 QEQ + RRSSSTLGEMLA+S++KK E++ + + + ST CL+ + + Sbjct: 500 SSQEQKNARRSSSTLGEMLALSDIKKSAR-SEVETINKEQ----EPRGSTSCLTNNLNKE 554 Query: 879 A-ANGERSTMGLXXXXXXXXXXXTYENIELNAEVSDTPKSKPDTRTEEGKTKNGKLSFKG 1055 A+ +S + T L EVSD+ K + E K K+ K S +G Sbjct: 555 GLADSPKSLL----RSRSVPVSSTVYGAGLRVEVSDSEAGKTEVSQELRKAKSTKSSLRG 610 Query: 1056 KVSSLFFSRNKKSSREK------------SIP----------------SSLVASDA---- 1139 KVSSLFFSRNKK ++EK +IP +S+ A+D Sbjct: 611 KVSSLFFSRNKKPNKEKYGVSQSNDECQSAIPETPGSPIPPPGKIGDDASICANDGGLDY 670 Query: 1140 ----RLHPRNADIAVRDDISKPTVERSLSPADNPSKATVSLEKXXXXXXXXXXXXXXXXX 1307 LH ++ D I T + LS + +S+ K Sbjct: 671 CLSPGLHESSSKTTYPDLIGVATKQGLLS-----QEGVLSVPKPAMPGNMGGNQDQPSPI 725 Query: 1308 XVLEAKFEDDSNANVPP-------CSESSHVGHLQALSRSPPIESLARSLSWDDSCMNTS 1466 VLE F++D NA P C + + +SPPIES+AR+LSWDDSC+ T+ Sbjct: 726 SVLEPPFDEDDNAVPEPSGNFRLNCGGAEVPLKSNLIDKSPPIESIARTLSWDDSCVETA 785 Query: 1467 TINN--PSKLPTRASFKADXXXXXRIDFIRKLLSSTGLD-NKNSKTVFSRWHSLDSPLDQ 1637 T + PS + T + FIR LLS+ GLD N + + SRWHS +SPLD Sbjct: 786 TPYSLKPSSISTCPQDEEQDWPF----FIRTLLSAAGLDVNMHLDSFSSRWHSPESPLDP 841 Query: 1638 MLLDGFLDEKDEE 1676 L + +++ D+E Sbjct: 842 ALRNKYVNLNDKE 854 >ref|XP_002303096.1| predicted protein [Populus trichocarpa] gi|222844822|gb|EEE82369.1| predicted protein [Populus trichocarpa] Length = 935 Score = 290 bits (741), Expect = 1e-75 Identities = 206/585 (35%), Positives = 291/585 (49%), Gaps = 27/585 (4%) Frame = +3 Query: 3 FYKHQNEVQTIPQPSQTRRITVLKPSKAIEEGGM-------NQARIQQCPSGEERIWDKN 161 F +H +++Q++P +T+ ITVL+PSK ++ ++ QQ +G+ W+ N Sbjct: 250 FSQHLHDMQSMPPSPETKHITVLRPSKVVDNERFAGSGKKSDKPTKQQAHTGQATGWESN 309 Query: 162 KHRRCSSLENLKVENL--SQPTRIVVLKPSPGKPCEFKALATNRISSPKSLDERDLSRNI 335 + N K+ +QPTRIVVLKPSPGK + KAL + S P+ L D Sbjct: 310 LGYS-PAFPNEKIVEYPPAQPTRIVVLKPSPGKIHDIKALVSPPSSPPRMLHGEDFYDEP 368 Query: 336 EGDEAIGSRDIAKEITRQMRESMXXXXXXXXXXXXXXXXGYVGDESSFNRSENYVEEAGG 515 E E R++AK ITR MRE++ GY GD+SSFN+S N + A Sbjct: 369 EDVEGQEPREVAKLITRNMRENLMGHRRDETLLSSVYSNGYTGDDSSFNKSVN--DYAVE 426 Query: 516 NLSDSEIVTPTSRHSWDYMNRYGXXXXXXXXXXXXXXXESSVIREAKKRLSERWALVASN 695 NLSD+EI++PTSRHSWDY+NR+ ESSV REAKKRLSERWA++ASN Sbjct: 427 NLSDTEIMSPTSRHSWDYINRFDSPYSTSSFSRASCSPESSVCREAKKRLSERWAMMASN 486 Query: 696 GVGQEQTHVRRSSSTLGEMLAISEVKK------EENVKELDFTSNGSCGGGDLKASTPCL 857 G EQ + RRSSSTLGEMLA+S+ KK E+++KEL + ST C+ Sbjct: 487 GRALEQKNARRSSSTLGEMLALSDTKKFMRAEEEDSIKEL-----------QPRGSTSCI 535 Query: 858 SIGIASDAANGERSTMGLXXXXXXXXXXXTYENIELNAEVSDTPKSKPDTRTEEGKTKNG 1037 + + + +G + T N EVS K + + + K+ Sbjct: 536 TSHLNKE--DGTADSPRTLLRSKSLPVSTTVHGARPNVEVSPPDAGKTEVPKDLTRAKSV 593 Query: 1038 KLSFKGKVSSLFFSRNKKSSREKSI--PSSLVASDARLHPRNADIAVRDDISKPTVE-RS 1208 K S KGKVSSLFFSRNKK S++KS+ S A + I + + +S + + Sbjct: 594 KSSLKGKVSSLFFSRNKKPSKDKSVACQSKDEFQSAIPETPSLPIPLTEKVSDGAAQCTN 653 Query: 1209 LSPADNPSKATVSLEKXXXXXXXXXXXXXXXXXXVLEAKFEDDSNA--------NVPPCS 1364 S +N S +S+ K VLE FE+D NA P C Sbjct: 654 NSGHENCSSHGLSVTKPVVPGNMNENQDQPSPISVLEPPFEEDDNAILEASGLIQKPDCR 713 Query: 1365 ESSHVGHLQALSRSPPIESLARSLSWDDSCMNTSTINNPSKLPTRASFKADXXXXXRIDF 1544 + +SPPIES+AR+L+WD+SC T++ P+ S A+ F Sbjct: 714 GIEVPLKSNLIGKSPPIESVARTLTWDNSCAETASSYPLKPTPSPVSLGAEEDEKYWFSF 773 Query: 1545 IRKLLSSTGLD-NKNSKTVFSRWHSLDSPLDQMLLDGFLDEKDEE 1676 ++ LL++ GLD + FSRWHS +SPLD L D + + D+E Sbjct: 774 VQALLTAAGLDCEVQLDSFFSRWHSPESPLDPSLRDKYANPNDKE 818 >ref|XP_003520621.1| PREDICTED: uncharacterized protein LOC100793360 [Glycine max] Length = 1025 Score = 289 bits (740), Expect = 1e-75 Identities = 222/602 (36%), Positives = 310/602 (51%), Gaps = 50/602 (8%) Frame = +3 Query: 21 EVQTIPQPSQTRRITVLKPSKAIE------EGGMNQARIQQCPSGEERIWDKNKHRRCSS 182 E+Q+ P ++T+RITVLKPSK ++ +G N +I++ P+ W+K + S Sbjct: 308 ELQSTPV-AETKRITVLKPSKMVDNENSGGKGKKNDKQIKK-PANVGAGWEK--YSPAYS 363 Query: 183 LENLKVENLS-QPTRIVVLKPSPGKPCEFKALATNRISSPKSLDERDLSRNIEGDE-AIG 356 + K++ + QPTRIVVLKPSPGK E KA+++ +SSP++L + + E D+ + Sbjct: 364 PASQKIDEFAVQPTRIVVLKPSPGKAHEIKAVSSPTMSSPRNLQSGNFYQEPEDDDDVLE 423 Query: 357 SRDIAKEITRQMRESMXXXXXXXXXXXXXXXXGYVGDESSFNRSENYVEEAGGNLSDSEI 536 SR + +IT+QM E++ GY GDESSFN+S++ E GN SD E+ Sbjct: 424 SRKVPSQITQQMHENLRSHQRDEILYSSVFSNGYTGDESSFNKSDH--EYTAGNFSDLEV 481 Query: 537 VTPTSRHSWDYMNRYGXXXXXXXXXXXXXXXESSVIREAKKRLSERWALVASNGVGQEQT 716 ++P+ RHSWDY+NR G ESSV REAKKRLSERWA++++ G QEQ Sbjct: 482 MSPSPRHSWDYINRSGSPFSSSSFSRASCSPESSVCREAKKRLSERWAMMSNKG-SQEQR 540 Query: 717 HVRRSSSTLGEMLAISEVKK------EENVKELDFTSNGSCGGGDLKASTPCLSIGIASD 878 H+RR SSTLGEMLA+S++KK E KE + + + SC + KA T C+ Sbjct: 541 HMRR-SSTLGEMLALSDIKKSVISELEGIHKEQEPSESVSC-SRNFKAET-CM------- 590 Query: 879 AANGERSTMGLXXXXXXXXXXXTYENIELNAEVSDTPKSKPDTRTEEGKTKNGKLSFKGK 1058 + S L YEN LN EV D K E K+K+ K SFKGK Sbjct: 591 ----DGSPRNLSRSKSVPTSSTVYEN-GLNVEVCDNDAGKAHGSGELTKSKSMKSSFKGK 645 Query: 1059 VSSLFFSRNKKSSREKSIPSSLV------ASDARLHPRNADIAVRDDISKPTVERSLSPA 1220 V+S FFSRNKK SREKS S V A + P N+ +RDD+S+ S+ Sbjct: 646 VTSFFFSRNKKPSREKSCLSQSVDESQSTAIETSDSPVNSSRVLRDDVSQSFDSGSIGEC 705 Query: 1221 DNPS----------------------KATVSLEKXXXXXXXXXXXXXXXXXXVLEAKFED 1334 P+ +A ++L K VLE FED Sbjct: 706 SLPAPYESSGKILSDSISNGQGAVPLEAGLTLSKSMVPGISSENQDQPSPISVLEPPFED 765 Query: 1335 DSNANVPP--CSESSHVGHLQAL-----SRSPPIESLARSLSWDDSCMNTSTINNPSKLP 1493 D NA V C +G +L +SPPIES+AR+LSWDDSC + S P Sbjct: 766 D-NAVVESLGCVRGGQLGSRVSLKSNLIDKSPPIESIARTLSWDDSCAEVA-----SPYP 819 Query: 1494 TRASFKADXXXXXRIDFIRKLLSSTGLDNK-NSKTVFSRWHSLDSPLDQMLLDGFLDEKD 1670 R S + + F++KLLS+ G+D++ + +SRWHSL+SPLD L D + + D Sbjct: 820 LRPSSASLDTKQDWLVFVKKLLSAAGIDDQVQPGSFYSRWHSLESPLDPSLRDKYANLND 879 Query: 1671 EE 1676 +E Sbjct: 880 KE 881