BLASTX nr result
ID: Dioscorea21_contig00025842
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00025842 (1585 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI35005.3| unnamed protein product [Vitis vinifera] 214 7e-53 ref|XP_002276178.2| PREDICTED: uncharacterized protein LOC100256... 211 3e-52 ref|XP_002521170.1| conserved hypothetical protein [Ricinus comm... 180 1e-42 ref|XP_003529901.1| PREDICTED: uncharacterized protein LOC100800... 178 3e-42 ref|XP_003548428.1| PREDICTED: uncharacterized protein LOC100803... 172 3e-40 >emb|CBI35005.3| unnamed protein product [Vitis vinifera] Length = 937 Score = 214 bits (544), Expect = 7e-53 Identities = 172/454 (37%), Positives = 234/454 (51%), Gaps = 24/454 (5%) Frame = -1 Query: 1393 GMALYLTQEVISNAFADLNDHPV---CGLPSSNTNSAKVVSEVSGQSTSRKRKHGHRAP- 1226 G+A++L +EVI+NAFADLN P+ G SS+ NS K + Q+ RKRKH A Sbjct: 442 GIAVHLAEEVINNAFADLN--PIDQGTGDVSSSANS-KASTGALLQTRHRKRKHATTATG 498 Query: 1225 --MERPNGVDLEAEAASRKLAAPHAVKIAALRALEALLTVGGSLRSEFWRSDVDLLLITV 1052 E+ + V+ E E + VKIAAL ALEALLTVGG+LRSE WR VDLLLIT+ Sbjct: 499 SSEEQLDRVNFEKEVP-KGYTTFIPVKIAALEALEALLTVGGALRSEHWRLKVDLLLITI 557 Query: 1051 ATNACDAGWASECK-LTLPTDEPASSRTDFQXXXXXXXXXXXXXXAHVRPPYLSQGLELF 875 ATNAC GWA + + ++LP+D S++ DFQ A VRPPYL+QGLELF Sbjct: 558 ATNACKGGWADDERVISLPSDA-TSTQADFQLAALRALLASLLSPARVRPPYLAQGLELF 616 Query: 874 RRGKQETGTELAVFCTHALLALEVLIHPRALPLVDFP-VAKTSINDGVNSIHQKSTSLSN 698 RRGKQETGT LA FCTHALLALEVLIHPRALPL DFP V + S ++G N + +S Sbjct: 617 RRGKQETGTRLAEFCTHALLALEVLIHPRALPLEDFPTVNRKSFDNGANHKYPESMYSGG 676 Query: 697 QKLNMPPFLRCNIEAISDL--EDDELYSSWLRSEEEAPVGDDQLGADIMDTEQLVKHPIV 524 Q LN PF R + + D +LY WL S++E + P+ Sbjct: 677 QDLN-TPFSRGPLGMALGVPNPDYDLYDKWLGSDDEIDI------------------PVT 717 Query: 523 DSERNSESVDRAPVPPHTIDVE-IREVISTNTSKSQEPSHNNSADHAWD------DASCL 365 D +N +VD A E + V ++ K + + SA D + + Sbjct: 718 DPSKNRNNVDDASEAFRDHQTEKLPSVDGASSPKVAKKIDHRSAATGADMREGGTEEEIM 777 Query: 364 VPAAALPNTSELEAGNRDAPISNTANLMKDVESTPTGSG-------PVLNRNEMVTSAPS 206 V + P + E A IS + + ++ + SG + N+++ + Sbjct: 778 VESHQFPESISQEESTFPAVISASTSTKIEIGKVASDSGALDPGDSEIATGNDVLVAKGD 837 Query: 205 NVTNHLEDGSSAPEKMSISSKGKGSVPLYNLDSE 104 + E+ S+A +S S + KG V LD+E Sbjct: 838 SFAIQGENASTA---VSNSERSKGLVS--ELDNE 866 >ref|XP_002276178.2| PREDICTED: uncharacterized protein LOC100256091 [Vitis vinifera] Length = 911 Score = 211 bits (538), Expect = 3e-52 Identities = 171/453 (37%), Positives = 233/453 (51%), Gaps = 24/453 (5%) Frame = -1 Query: 1390 MALYLTQEVISNAFADLNDHPV---CGLPSSNTNSAKVVSEVSGQSTSRKRKHGHRAP-- 1226 +A++L +EVI+NAFADLN P+ G SS+ NS K + Q+ RKRKH A Sbjct: 469 IAVHLAEEVINNAFADLN--PIDQGTGDVSSSANS-KASTGALLQTRHRKRKHATTATGS 525 Query: 1225 -MERPNGVDLEAEAASRKLAAPHAVKIAALRALEALLTVGGSLRSEFWRSDVDLLLITVA 1049 E+ + V+ E E + VKIAAL ALEALLTVGG+LRSE WR VDLLLIT+A Sbjct: 526 SEEQLDRVNFEKEVP-KGYTTFIPVKIAALEALEALLTVGGALRSEHWRLKVDLLLITIA 584 Query: 1048 TNACDAGWASECK-LTLPTDEPASSRTDFQXXXXXXXXXXXXXXAHVRPPYLSQGLELFR 872 TNAC GWA + + ++LP+D S++ DFQ A VRPPYL+QGLELFR Sbjct: 585 TNACKGGWADDERVISLPSDA-TSTQADFQLAALRALLASLLSPARVRPPYLAQGLELFR 643 Query: 871 RGKQETGTELAVFCTHALLALEVLIHPRALPLVDFP-VAKTSINDGVNSIHQKSTSLSNQ 695 RGKQETGT LA FCTHALLALEVLIHPRALPL DFP V + S ++G N + +S Q Sbjct: 644 RGKQETGTRLAEFCTHALLALEVLIHPRALPLEDFPTVNRKSFDNGANHKYPESMYSGGQ 703 Query: 694 KLNMPPFLRCNIEAISDL--EDDELYSSWLRSEEEAPVGDDQLGADIMDTEQLVKHPIVD 521 LN PF R + + D +LY WL S++E + P+ D Sbjct: 704 DLN-TPFSRGPLGMALGVPNPDYDLYDKWLGSDDEIDI------------------PVTD 744 Query: 520 SERNSESVDRAPVPPHTIDVE-IREVISTNTSKSQEPSHNNSADHAWD------DASCLV 362 +N +VD A E + V ++ K + + SA D + +V Sbjct: 745 PSKNRNNVDDASEAFRDHQTEKLPSVDGASSPKVAKKIDHRSAATGADMREGGTEEEIMV 804 Query: 361 PAAALPNTSELEAGNRDAPISNTANLMKDVESTPTGSG-------PVLNRNEMVTSAPSN 203 + P + E A IS + + ++ + SG + N+++ + + Sbjct: 805 ESHQFPESISQEESTFPAVISASTSTKIEIGKVASDSGALDPGDSEIATGNDVLVAKGDS 864 Query: 202 VTNHLEDGSSAPEKMSISSKGKGSVPLYNLDSE 104 E+ S+A +S S + KG V LD+E Sbjct: 865 FAIQGENASTA---VSNSERSKGLVS--ELDNE 892 >ref|XP_002521170.1| conserved hypothetical protein [Ricinus communis] gi|223539617|gb|EEF41201.1| conserved hypothetical protein [Ricinus communis] Length = 863 Score = 180 bits (456), Expect = 1e-42 Identities = 116/274 (42%), Positives = 155/274 (56%), Gaps = 3/274 (1%) Frame = -1 Query: 1393 GMALYLTQEVISNAFADLNDHPVCGLPSSNTNSAKVVSEVSGQSTSRKRKHGHRAPMERP 1214 G+A+YL QEV++N+ DL+ C S+ + K Q +RKRKHG A + Sbjct: 443 GIAIYLAQEVVNNSLLDLDPSVGCIFSSAYS---KASFGALLQPCNRKRKHG--ASEQNY 497 Query: 1213 NGVDLEAEAASRKLAAPHAVKIAALRALEALLTVGGSLRSEFWRSDVDLLLITVATNACD 1034 + + LE EA A+ +VKIAAL AL LLTVGG+L+SE WRS V+ LLIT+A ++C Sbjct: 498 DQLSLEMEAPKSCPASTISVKIAALEALRTLLTVGGALKSESWRSKVEKLLITLAADSCK 557 Query: 1033 AGWASECKLTLPTDEPASSRTDFQXXXXXXXXXXXXXXAHVRPPYLSQGLELFRRGKQET 854 GW+SE + + AS+ D Q + VRPP+L+Q LELF RGKQET Sbjct: 558 GGWSSEERTAFLPNGVASTYADLQLAVLRALLASLLSPSRVRPPHLAQSLELFHRGKQET 617 Query: 853 GTELAVFCTHALLALEVLIHPRALPLVDFPVAKTSINDGVNSIHQKSTSLSNQKLNMPPF 674 GTE++ FC++AL ALEVLIHPRALPL D P A +S +N ++ QK N P Sbjct: 618 GTEISEFCSYALSALEVLIHPRALPLADLPSANSS--HEINYGFPETLYSGGQKHNTP-- 673 Query: 673 LRCNIEAI---SDLEDDELYSSWLRSEEEAPVGD 581 + + I S DD+L SWL +E D Sbjct: 674 ISSGMRGIGHGSPDSDDDLCDSWLDGNKETDTPD 707 >ref|XP_003529901.1| PREDICTED: uncharacterized protein LOC100800871 [Glycine max] Length = 883 Score = 178 bits (452), Expect = 3e-42 Identities = 124/343 (36%), Positives = 175/343 (51%), Gaps = 13/343 (3%) Frame = -1 Query: 1393 GMALYLTQEVISNAFADLN--DHPVCGLPSSNTNSAKVVSEVSGQSTSRKRKHGHRAPME 1220 GMALYL QEVI+NAFADL+ +H G+ + + ++A + + RKRKH Sbjct: 442 GMALYLAQEVINNAFADLSIIEHKNSGILNGSNSNASAGALLL--PIHRKRKHSSTTGSL 499 Query: 1219 RPNGVD-LEAEAASRKLAAPHAVKIAALRALEALLTVGGSLRSEFWRSDVDLLLITVATN 1043 + +G L E + P +++IAAL LE+L+TV G+L+SE WRS VD LL+ A + Sbjct: 500 QEHGEGGLSVEVPKNRPLTPVSLRIAALETLESLITVAGALKSEPWRSKVDSLLLVTAMD 559 Query: 1042 ACDAGWASECKLTLPTDEPASSRTDFQXXXXXXXXXXXXXXAHVRPPYLSQGLELFRRGK 863 + G SE + EPA++ T+ Q A VRPPYL+QGLELFRRG+ Sbjct: 560 SFKEGSVSEERSVFQQKEPAATTTELQLAALRALLVSLLSFARVRPPYLAQGLELFRRGR 619 Query: 862 QETGTELAVFCTHALLALEVLIHPRALPLVDFPVAKTSINDGVNSIHQKSTSLSNQKLNM 683 Q+TGT+LA FC HALL LEVLIHPRALP+VD+ A S S + ++L + Sbjct: 620 QQTGTKLAEFCAHALLTLEVLIHPRALPMVDYAYANNS------SFGEAHSNLQHGYFGW 673 Query: 682 PPFLRCNIEAISDLEDDELYSSWLRSEEEAPVGDDQ---------LGADIMDTEQLVKHP 530 + + DD+L + WL ++ E D+ D E L H Sbjct: 674 SHNTPYGLPQVPPDYDDDLCARWLENDNEVGESLDKNTKYTQEPSEACRASDPEVLFVH- 732 Query: 529 IVDSERN-SESVDRAPVPPHTIDVEIREVISTNTSKSQEPSHN 404 V S+ N E ++ DVE++ V KS +P + Sbjct: 733 -VSSDTNIQERIEMVSETATCADVEMKTVEDETNFKSDQPGES 774 >ref|XP_003548428.1| PREDICTED: uncharacterized protein LOC100803198 [Glycine max] Length = 934 Score = 172 bits (435), Expect = 3e-40 Identities = 111/286 (38%), Positives = 158/286 (55%), Gaps = 3/286 (1%) Frame = -1 Query: 1393 GMALYLTQEVISNAFADLN--DHPVCGLPSSNTNSAKVVSEVSGQSTSRKRKHGHRAPME 1220 G+ALYL QEVI+NAFADL+ +H G+ + + ++A + + + RKRKH Sbjct: 491 GLALYLAQEVINNAFADLSSIEHKNGGILNGSYSNASAGTLLP--PSHRKRKHSSTTGSL 548 Query: 1219 RPNGVD-LEAEAASRKLAAPHAVKIAALRALEALLTVGGSLRSEFWRSDVDLLLITVATN 1043 + +G L E + P +++IAAL LE+L+TV G+L+SE WRS VD LLI A + Sbjct: 549 QEHGEGGLSVEVPKNRPLIPMSLRIAALETLESLITVAGALKSEPWRSKVDSLLIVTAMD 608 Query: 1042 ACDAGWASECKLTLPTDEPASSRTDFQXXXXXXXXXXXXXXAHVRPPYLSQGLELFRRGK 863 + G E + EPA++ TD Q A VRPPYL+QGLELFR+G+ Sbjct: 609 SFKEGSVGEERSVFQQKEPAATTTDLQLAALRALLVSFLSFARVRPPYLAQGLELFRKGR 668 Query: 862 QETGTELAVFCTHALLALEVLIHPRALPLVDFPVAKTSINDGVNSIHQKSTSLSNQKLNM 683 Q+TGT+LA FC HALL LEVLIHPRALP+VD+ A S S + ++L ++ Sbjct: 669 QQTGTKLAEFCAHALLTLEVLIHPRALPMVDYAYANNS------SFGEAHSNLQHEYFGW 722 Query: 682 PPFLRCNIEAISDLEDDELYSSWLRSEEEAPVGDDQLGADIMDTEQ 545 + DD+L + WL + EA D+ L + T++ Sbjct: 723 SNSTPYGLPQDPPDYDDDLCARWLENGNEA---DESLDKNTKYTQE 765