BLASTX nr result
ID: Dioscorea21_contig00004124
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00004124 (2085 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272725.1| PREDICTED: uncharacterized protein LOC100241... 491 e-136 ref|XP_002512618.1| conserved hypothetical protein [Ricinus comm... 475 e-131 ref|XP_003552431.1| PREDICTED: uncharacterized protein LOC100781... 472 e-130 ref|NP_001041736.1| Os01g0100500 [Oryza sativa Japonica Group] g... 470 e-130 ref|XP_002329006.1| predicted protein [Populus trichocarpa] gi|2... 468 e-129 >ref|XP_002272725.1| PREDICTED: uncharacterized protein LOC100241127 [Vitis vinifera] Length = 640 Score = 491 bits (1265), Expect = e-136 Identities = 249/489 (50%), Positives = 323/489 (66%), Gaps = 8/489 (1%) Frame = +1 Query: 58 KGCPANAFLYNGTLCACDPGRYFLN---GSCALF-NASAGTWGISSNVESTSTFLSTVLP 225 + CP+NAF YNGTLC+C+PG Y LN G+C+LF +A W ++S V + +F +T+ Sbjct: 5 RACPSNAFRYNGTLCSCNPG-YVLNATTGACSLFWETAADGWLVNSGVGYSISFPTTIFD 63 Query: 226 LDTIKRVTQSQXXXXXXXXXXXXXXXIFCVVIRFVKVKDGKSIWFRIRWWISRLDFFYAT 405 D IK+ TQSQ FC +RF + DG++IWFRIRWWISRLD +AT Sbjct: 64 FDKIKKFTQSQAMFLEATVVMLISWLFFCFFVRFGSLGDGRTIWFRIRWWISRLDISFAT 123 Query: 406 KHWMDDGMVVRKRKTELGGTFSVASWXXXXXXXXXXXXXXXXKRSIEVHRMRPANAPDLL 585 +HW++D VV+KRKTELGGTFS+ASW KRSIEVH ++ N PDL Sbjct: 124 RHWLEDQKVVKKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKATNGPDLA 183 Query: 586 EFVNDLEFNITTISSMSCSHLRGLETLVTGIPGFIDYRVFPLSTYLNYHCYNTSRGPTIS 765 FVND+EFNITTISSM+CS+LRGL TLVTG PGF+D+RV PLST ++Y C NTSRGPT+ Sbjct: 184 SFVNDMEFNITTISSMTCSNLRGLGTLVTGNPGFLDHRVLPLSTIVSYFCQNTSRGPTVI 243 Query: 766 LKCNNCQVPRRDHFISWDFVDLPNEPATAVGFEFSLTAKDHNDDRHMSYVSGTLE--SSS 939 L+C+ CQ+ + + +ISW FVDLPN PA AVGF+FSL AK H +H+S V G L+ SS Sbjct: 244 LRCSKCQIIQDNLYISWHFVDLPNSPAAAVGFQFSLLAKSHASKKHVSSVRGLLKNGSSL 303 Query: 940 NGTLKTFRGPDLNILKIHLFPRAYNNLHNLKLIQPLFYDFVPGXXXXXXXXXXXXXXXXX 1119 + T TFRG D N+LK +LFPR Y N HNL+LIQPLF +F+ G Sbjct: 304 DDTPVTFRGVDANVLKFNLFPRIYRNFHNLRLIQPLFREFLRGPSFHEINKLRASLESPN 363 Query: 1120 XGIVNTTLYMSYLADYIVEIDKESVNGPVTFLADVGGLYSFSVVIFLYLLLQFEARFKKF 1299 G++N T+Y++ L+ YIVEID +++ GPV+FLAD+GG+Y S+ IF Y L+Q E R KK Sbjct: 364 DGLLNMTVYVNLLSSYIVEIDNQNIMGPVSFLADLGGIYCISIGIFFYFLVQCEYRIKKL 423 Query: 1300 RYEDSAMRNIKNRSRAQRNWDKLRKYVAYTWAPTSLDCSNMSSKGQSKST--VDSSHGIG 1473 R EDS MR I+NR +AQ +WDKLRKYV YTW +LD + ++K ++ T HG G Sbjct: 424 RNEDSIMRKIRNRRKAQEHWDKLRKYVMYTWGCKTLDDNYNNNKKEATCTDIFRPFHGNG 483 Query: 1474 PLHRTEQNS 1500 + Q S Sbjct: 484 SSRKQRQRS 492 >ref|XP_002512618.1| conserved hypothetical protein [Ricinus communis] gi|223548579|gb|EEF50070.1| conserved hypothetical protein [Ricinus communis] Length = 642 Score = 475 bits (1223), Expect = e-131 Identities = 257/544 (47%), Positives = 331/544 (60%), Gaps = 35/544 (6%) Frame = +1 Query: 64 CPANAFLYNGTLCACDPGRYFLN---GSCALFNASAGTWGISSNVESTSTFLSTVLPLDT 234 CP YN TLCAC P +FLN SC LF AS+ + + +T +F T+ D+ Sbjct: 4 CPIGFITYNTTLCAC-PAGWFLNRTTNSCTLFAASSDIQ-TDTGLANTLSFPETIFSFDS 61 Query: 235 IKRVTQSQXXXXXXXXXXXXXXXIFCVVIRFVKVKDGKSIWFRIRWWISRLDFFYATKHW 414 IK+ TQSQ +FC +RF+K DG +IWF+IRWW+SRLD +AT+HW Sbjct: 62 IKKFTQSQAVFLEATSVMLISWLLFCFFLRFMKFGDGSNIWFKIRWWLSRLDICFATRHW 121 Query: 415 MDDGMVVRKRKTELGGTFSVASWXXXXXXXXXXXXXXXXKRSIEVHRMRPANAPDLLEFV 594 +DD VV KRKTELGGTFS+ASW KRSIEVH ++ NAPDL F Sbjct: 122 LDDQQVVVKRKTELGGTFSIASWILFIGLFAALLYQIILKRSIEVHNVKAINAPDLASFA 181 Query: 595 NDLEFNITTISSMSCSHLRGLETLVTGIPGFIDYRVFPLSTYLNYHCYNTSRGPTISLKC 774 ND+EFNITT+SSMSCS+LR L TLVTG PGFID+RV L NY C+NTS GPT++ KC Sbjct: 182 NDIEFNITTVSSMSCSNLRDLGTLVTGSPGFIDHRVVNLIELANYTCWNTSTGPTLNFKC 241 Query: 775 NNCQVPRRDHFISWDFVDLPNEPATAVGFEFSLTAKDHNDDRHMSYVSGTLE--SSSNGT 948 NC++ + +ISW FVDLPN PA+AVG+ F+LT ++H D RH S+VSGTL+ SS + Sbjct: 242 TNCRLNKDYMYISWTFVDLPNAPASAVGYLFNLTTRNHADKRHFSFVSGTLKNGSSFDDR 301 Query: 949 LKTFRGPDLNILKIHLFPRAYNNLHNLKLIQPLFYDFVPGXXXXXXXXXXXXXXXXXXGI 1128 TFRG D NILK +LFPR Y NLH+LKLIQPLF++F+PG G+ Sbjct: 302 PVTFRGRDPNILKFNLFPRIYRNLHDLKLIQPLFHEFIPGSFFHDTTQLKASLATSSGGL 361 Query: 1129 VNTTLYMSYLADYIVEIDKESVNGPVTFLADVGGLYSFSVVIFLYLLLQFEARFKKFRYE 1308 +NTT+Y+ YL+ Y+VE++ +++ GPV+FLAD+GGLY S+ IF Y L+Q E R KK R E Sbjct: 362 INTTIYVQYLSAYVVEVENQNIMGPVSFLADLGGLYCISICIFFYFLVQCEYRIKKLRNE 421 Query: 1309 DSAMRNIKNRSRAQRNWDKLRKYVAYTWAPTSLDCSNMSSKGQSK--------------- 1443 DS MRNI+NR +AQ++W+KLRKYV Y W ++LD +SK S Sbjct: 422 DSTMRNIRNRLKAQKHWEKLRKYVRYRWGCSTLDDDLKNSKQGSSCNCIMAPSVRGNGSI 481 Query: 1444 STVDSSHGIGPLHRTE------QNSVP---------EATEKDRSKLAGSTARYNGKHGNS 1578 S SS G RT+ + S+P T+ LAG T G + +S Sbjct: 482 SGSGSSWRRGSRTRTDSIRLNKRVSIPSDKNAIEGHSDTQVVEHVLAGCTLNTEGINSDS 541 Query: 1579 ADGH 1590 A GH Sbjct: 542 AQGH 545 >ref|XP_003552431.1| PREDICTED: uncharacterized protein LOC100781305 [Glycine max] Length = 617 Score = 472 bits (1214), Expect = e-130 Identities = 238/462 (51%), Positives = 308/462 (66%), Gaps = 4/462 (0%) Frame = +1 Query: 64 CPANAFLYNGTLCACDPGRYF--LNGSCALFNASAGTWGISSNVESTSTFLSTVLPLDTI 237 CPAN+F +N TLCAC PG + +C L+ ++ S +F T+ DTI Sbjct: 4 CPANSFNFNTTLCACTPGHVLQHVRNTCVLYEGNSTILTDSGVDYYALSFPETLFAFDTI 63 Query: 238 KRVTQSQXXXXXXXXXXXXXXXIFCVVIRFVKVKDGKSIWFRIRWWISRLDFFYATKHWM 417 K+ TQSQ +FCV +R K+ DG+++WF+IRWWISRLD +AT+HW+ Sbjct: 64 KKFTQSQAVFLEVTLVMLLSWLLFCVFLRCTKLGDGRNVWFKIRWWISRLDICFATRHWL 123 Query: 418 DDGMVVRKRKTELGGTFSVASWXXXXXXXXXXXXXXXXKRSIEVHRMRPANAPDLLEFVN 597 DD VV KRKTELGG FS+ASW KRSIEVH +R N P+L F+N Sbjct: 124 DDQKVVTKRKTELGGAFSMASWILFIGLFAVLLYQIISKRSIEVHNVRATNGPELASFIN 183 Query: 598 DLEFNITTISSMSCSHLRGLETLVTGIPGFIDYRVFPLSTYLNYHCYNTSRGPTISLKCN 777 D+EFNITT+SSMSC++LR L +VTG PGFID RV PLST NY CYN+S GPTI+LKC Sbjct: 184 DMEFNITTVSSMSCANLRNLGNIVTGNPGFIDQRVVPLSTLANYSCYNSSMGPTIALKCK 243 Query: 778 NCQVPRRDHFISWDFVDLPNEPATAVGFEFSLTAKDHNDDRHMSYVSGTLESSS--NGTL 951 NC+V +ISW FVDLPN PA AVGFEF LT+ D +H+S+VSGTL++ S + + Sbjct: 244 NCKVIYDHMYISWQFVDLPNSPAIAVGFEFRLTSMD-IAKKHVSFVSGTLKNGSDFDDSP 302 Query: 952 KTFRGPDLNILKIHLFPRAYNNLHNLKLIQPLFYDFVPGXXXXXXXXXXXXXXXXXXGIV 1131 TFRG NI+K +LFPR Y+NLH LKLIQPLF++F+PG G+V Sbjct: 303 VTFRGNQSNIVKFNLFPRIYHNLHELKLIQPLFHEFIPGSVLRDTNELRASLESSTDGLV 362 Query: 1132 NTTLYMSYLADYIVEIDKESVNGPVTFLADVGGLYSFSVVIFLYLLLQFEARFKKFRYED 1311 NTTL++++L+DY+VEIDK+++ GPV+FLAD+GGLY S+ IF YLL+Q E R KK R ED Sbjct: 363 NTTLFINFLSDYVVEIDKQNILGPVSFLADLGGLYCISIGIFFYLLIQCEYRIKKLRNED 422 Query: 1312 SAMRNIKNRSRAQRNWDKLRKYVAYTWAPTSLDCSNMSSKGQ 1437 S MR I+NR +AQ +WDKLRKYV YT+ ++D +N +S G+ Sbjct: 423 SIMRRIRNRRKAQEHWDKLRKYVMYTYGCPTID-NNYNSTGK 463 >ref|NP_001041736.1| Os01g0100500 [Oryza sativa Japonica Group] gi|15128436|dbj|BAB62620.1| P0402A09.1 [Oryza sativa Japonica Group] gi|15408844|dbj|BAB64233.1| unknown protein [Oryza sativa Japonica Group] gi|88193759|dbj|BAE79749.1| unknown protein [Oryza sativa Japonica Group] gi|113531267|dbj|BAF03650.1| Os01g0100500 [Oryza sativa Japonica Group] gi|125524044|gb|EAY72158.1| hypothetical protein OsI_00006 [Oryza sativa Indica Group] gi|125568664|gb|EAZ10179.1| hypothetical protein OsJ_00005 [Oryza sativa Japonica Group] Length = 524 Score = 470 bits (1209), Expect = e-130 Identities = 245/475 (51%), Positives = 304/475 (64%), Gaps = 9/475 (1%) Frame = +1 Query: 70 ANAFLYNGTLCACDPGRYF---LNGSCALFNASAGTWGISSNVESTST---FLSTVLPLD 231 A F N TLCAC+PG Y +NG+C G W + S S + FL+ VL LD Sbjct: 6 ARMFASNATLCACEPGFYLSAAINGTC--LGLPDGGWQVGSVGASRNQSFYFLTPVLSLD 63 Query: 232 TIKRVTQSQXXXXXXXXXXXXXXXIFCVVIRFV-KVKDGKSIWFRIRWWISRLDFFYATK 408 ++R+TQSQ FC RF G FR R+W+SRLD Y T Sbjct: 64 VVRRLTQSQALLLEATIAALLSWLAFCAFARFTGHDPTGNKRLFRARFWVSRLDCIYDTT 123 Query: 409 HWMDDGMVVRKRKTELGGTFSVASWXXXXXXXXXXXXXXXXKRSIEVHRMRPANAPDLLE 588 HW DD V+RKRKTELGGT SVAS +R+IEVHR++PANAPDLL Sbjct: 124 HWADDQQVLRKRKTELGGTCSVASLILFVGLVTVLLYQAIQRRNIEVHRVKPANAPDLLS 183 Query: 589 FVNDLEFNITTISSMSCSHLRGLETLVTGIPGFIDYRVFPLSTYLNYHCYNTSRGPTISL 768 FVND+EF+ITTISSMSCS L T+ G PG +D+R+ PLST L Y+C NTS+GP++SL Sbjct: 184 FVNDIEFHITTISSMSCSQLVAPSTIAMGTPGSMDFRLLPLSTLLTYNCQNTSQGPSVSL 243 Query: 769 KCNNCQVPRRDHFISWDFVDLPNEPATAVGFEFSLTAKDHNDDRHMSYVSGTLESS--SN 942 KCN C++P RDH++SW F+DLP +PA AVGF+F+LTAK H DD+H+S VSGT+ S ++ Sbjct: 244 KCNGCRIPPRDHYVSWQFIDLPRQPAAAVGFQFNLTAKQHGDDKHVSSVSGTINSDNFTD 303 Query: 943 GTLKTFRGPDLNILKIHLFPRAYNNLHNLKLIQPLFYDFVPGXXXXXXXXXXXXXXXXXX 1122 LKTFRG D N+LKI LFP+ Y N HNLKL+QPL DF G Sbjct: 304 DKLKTFRGRDSNVLKIQLFPQTYINHHNLKLLQPLVQDFTQGSTFSNVRNLNASLQNPMD 363 Query: 1123 GIVNTTLYMSYLADYIVEIDKESVNGPVTFLADVGGLYSFSVVIFLYLLLQFEARFKKFR 1302 GI+NTTLY+SYL++YIVEI E+V GPV+ LA +GGLY+FSV IFL L+ Q EAR KK R Sbjct: 364 GIINTTLYISYLSNYIVEISNENVLGPVSILASIGGLYAFSVAIFLCLMAQCEARIKKLR 423 Query: 1303 YEDSAMRNIKNRSRAQRNWDKLRKYVAYTWAPTSLDCSNMSSKGQSKSTVDSSHG 1467 EDS M I + RAQ+NWDK+RK+V YTW P++LD S+ S K S +DS HG Sbjct: 424 DEDSRMLKILRKRRAQQNWDKVRKFVMYTWGPSNLDPSDRSGKWPESSVMDSLHG 478 >ref|XP_002329006.1| predicted protein [Populus trichocarpa] gi|222839240|gb|EEE77591.1| predicted protein [Populus trichocarpa] Length = 614 Score = 468 bits (1204), Expect = e-129 Identities = 243/486 (50%), Positives = 317/486 (65%), Gaps = 13/486 (2%) Frame = +1 Query: 64 CPANAFLYNGTLCACDPGRYFLN---GSCALFNASAGTW---GISSNVESTSTFLSTVLP 225 CP+ + +YN + CAC G+ FLN SC+ F + + G+ + +F T+ Sbjct: 8 CPSKSIIYNTSRCACPTGQ-FLNITTNSCSYFWGRSAIYTDTGVEVDNSFGFSFPETIFS 66 Query: 226 LDTIKRVTQSQXXXXXXXXXXXXXXXIFCVVIRFVKVKDG--KSIWFRIRWWISRLDFFY 399 D+IK+ TQSQ +FC +RF K++ +IWFRIRWWISRLD + Sbjct: 67 FDSIKKFTQSQAVFLEATLVMVASWLLFCFFLRFTKLEHHGTHNIWFRIRWWISRLDICF 126 Query: 400 ATKHWMDDGMVVRKRKTELGGTFSVASWXXXXXXXXXXXXXXXXKRSIEVHRMRPANAPD 579 AT+HW+DD VV KRKTELGG FS+ASW KR+IEVH +R NAPD Sbjct: 127 ATRHWLDDRKVVVKRKTELGGAFSIASWILFIGLFATLLYQIISKRTIEVHNVRATNAPD 186 Query: 580 LLEFVNDLEFNITTISSMSCSHLRGLETLVTGIPGFIDYRVFPLSTYLNYHCYNTSRGPT 759 L FVND+EFNITT+SSMSCS+L+GL TLVTG PGFID+RV PL+ ++NY C NTS GPT Sbjct: 187 LAAFVNDMEFNITTVSSMSCSNLQGLGTLVTGNPGFIDHRVAPLTDFVNYTCRNTSMGPT 246 Query: 760 ISLKCNNCQVPRRDHFISWDFVDLPNEPATAVGFEFSLTAKDHNDDRHMSYVSGTLESSS 939 ++ KC+ C + R +ISW FVDLP+ PATAVGF+F+L+AKDH D +H+S+VSGTL+S S Sbjct: 247 LTFKCSKCHLNRDYMYISWQFVDLPSTPATAVGFQFNLSAKDHADKKHVSFVSGTLKSGS 306 Query: 940 --NGTLKTFRGPDLNILKIHLFPRAYNNLHNLKLIQPLFYDFVPGXXXXXXXXXXXXXXX 1113 + TFRG D NILK +LFPR Y+NLH+L+LIQPLF++F+PG Sbjct: 307 TFDDRPVTFRGKDSNILKFNLFPRIYHNLHDLRLIQPLFHEFLPGSFFGETSQLQASLET 366 Query: 1114 XXXGIVNTTLYMSYLADYIVEIDKESVNGPVTFLADVGGLYSFSVVIFLYLLLQFEARFK 1293 G++NTTL +SYL+ YIVEI+ +++ GPV+FLAD+GGLY S+ IF Y L+Q E R K Sbjct: 367 SSDGLINTTLSISYLSSYIVEIESQNIMGPVSFLADLGGLYCISIGIFFYFLVQCEYRVK 426 Query: 1294 KFRYEDSAMRNIKNRSRAQRNWDKLRKYVAYTWAPTSLDCSNMSSKGQSKST---VDSSH 1464 + R ED MR I+NR +A+ +WDKLRKYV YTW +LD S+K S T + S Sbjct: 427 RLRNEDVTMRQIRNRLKAREHWDKLRKYVMYTWGCKTLDNDYESTKQGSSCTGFMIPSIR 486 Query: 1465 GIGPLH 1482 G G LH Sbjct: 487 GNGSLH 492