BLASTX nr result
ID: Dioscorea21_contig00022946
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00022946 (1024 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002285773.2| PREDICTED: uncharacterized protein LOC100267... 243 5e-62 ref|XP_002297610.1| predicted protein [Populus trichocarpa] gi|2... 201 2e-49 ref|XP_002523387.1| conserved hypothetical protein [Ricinus comm... 196 8e-48 tpg|DAA54020.1| TPA: hypothetical protein ZEAMMB73_527273 [Zea m... 147 4e-33 emb|CAA18228.1| putative protein [Arabidopsis thaliana] gi|72694... 146 7e-33 >ref|XP_002285773.2| PREDICTED: uncharacterized protein LOC100267326 [Vitis vinifera] Length = 557 Score = 243 bits (620), Expect = 5e-62 Identities = 144/289 (49%), Positives = 182/289 (62%), Gaps = 22/289 (7%) Frame = -2 Query: 882 SLHISSHNSSMKDLSLFLLKN-------SLASKMKRGIRSFCNGVGSTSTLDQKKN--DC 730 S HI S + DL + + S ++ M+RGIRSFCNG STSTL+Q K D Sbjct: 28 SSHIQSVTAQSLDLPIVTTSSLSIYFFFSSSNMMRRGIRSFCNGDASTSTLNQHKTTPDH 87 Query: 729 NVSCIANDSYVDDNPLTLEEMILQLDLEEEAARRAKIDDYSELNRRMSCVNNSDILRSAR 550 S + + + + + P TLEEMILQL+LEEE AR+AK+ +Y E+ RRMSCVNNSDILRSAR Sbjct: 88 GDSSLISSTTLVEIPPTLEEMILQLELEEEIARKAKLQEYGEMQRRMSCVNNSDILRSAR 147 Query: 549 NAAMNQYPRFSLDGRDAMYRSSFRNYGCVKPGRRSVCCSSTGGGFLNNDYKVNFDGNI-- 376 N A+NQYPRFSLDG+DAMYRSSFRN + PGR+S+CC+ G Y FD + Sbjct: 148 N-ALNQYPRFSLDGKDAMYRSSFRN---LAPGRKSICCNR--GLVRGRCYTDEFDSKLEK 201 Query: 375 ----AYPPTVAGESVVWCKPGVVAKLMGLDAXXXXXXXXXXPGRSRVKAGALNSRKENLR 208 P T+AGESV+WCKPGVVAKLMGL+ RS K ++ +R+ R Sbjct: 202 KTSSCLPSTLAGESVIWCKPGVVAKLMGLEV----MPVPVSCNRSTEKLNSIVNRQNLRR 257 Query: 207 RMGRHELEKERLLMNMNGCKG-------SSNYKTGRYCVMKPINVEPMN 82 R RHE+E+ R +M+MNGC +S KTGRYCVM+P+ VEP N Sbjct: 258 RAQRHEMERRRFVMDMNGCGATQRQGTMASCSKTGRYCVMRPLAVEPAN 306 >ref|XP_002297610.1| predicted protein [Populus trichocarpa] gi|222844868|gb|EEE82415.1| predicted protein [Populus trichocarpa] Length = 272 Score = 201 bits (512), Expect = 2e-49 Identities = 135/283 (47%), Positives = 166/283 (58%), Gaps = 31/283 (10%) Frame = -2 Query: 804 MKRGIRSFCNGVGSTSTLDQKKNDCNVS-----CIANDSYVDDNPL--------TLEEMI 664 MKRGIR+FCNG STSTLDQ N N + C Y N TLE+MI Sbjct: 1 MKRGIRNFCNGDASTSTLDQH-NKANYTADDHHCFVTSPYTHMNHADTAQQGSPTLEQMI 59 Query: 663 LQLDLEEEAARRAKIDDYSELNRR---MSCVNNSDILRSARNAAMNQYPRFSLDGRDAMY 493 LQL+LEEE AR+AK+++Y ++ R MSCVNNSDILRSARNA ++QYPRFSLDG+DAMY Sbjct: 60 LQLELEEEFARKAKLNNYVDVGLRAGRMSCVNNSDILRSARNA-LSQYPRFSLDGKDAMY 118 Query: 492 RSSFRNYGCVKP---GRRSVCCSSTGGGFLN-NDYKVNFDGNIAYPPTVAGESVVWCKPG 325 RSSFRN V GR+SVCC +N N+ F+ ++ PPT+AGE VVWCKPG Sbjct: 119 RSSFRNLDSVSKAAAGRKSVCCDHGLRERMNRNNLGAKFERKLSLPPTLAGERVVWCKPG 178 Query: 324 VVAKLMGLDAXXXXXXXXXXPGRSRVKAGALNSRKENLRRMG-RHELEKERLLMNMNGCK 148 VVAKLMGL+A R + A +++NLRR RHE+E+ RL +++ Sbjct: 179 VVAKLMGLEA-----MPVPINSREDKETLASIIKRQNLRRRAERHEIER-RLAGDVSAFD 232 Query: 147 GSSNYKTGR----------YCVMKPINVEPMNGPLNWNLRHAR 49 G K GR YCV KP+ VEP N W R R Sbjct: 233 G---IKRGRSSMPSCSKPGYCVTKPVAVEPANDGGGWPTRRNR 272 >ref|XP_002523387.1| conserved hypothetical protein [Ricinus communis] gi|223537337|gb|EEF38966.1| conserved hypothetical protein [Ricinus communis] Length = 241 Score = 196 bits (498), Expect = 8e-48 Identities = 121/239 (50%), Positives = 151/239 (63%), Gaps = 15/239 (6%) Frame = -2 Query: 852 MKDLSLFLLKNSLASKMKRGIRSFCNGVGSTSTLDQKK-NDCNVSCIANDSYVDDNPL-- 682 MKDLS F LKNS KMK+GIR+FCNG GSTSTL+Q CN + +VDD+ + Sbjct: 1 MKDLSFFFLKNSFGGKMKKGIRNFCNGDGSTSTLNQHHLKPCN-----DPIHVDDDDIAS 55 Query: 681 --------TLEEMILQLDLEEEAARRAKIDDYSELN-RRMSCVNNSDILRSARNAAMNQY 529 TLEEMILQL+LEEE +R++K+++ + RRMSCVNNSDILRSARNA +NQY Sbjct: 56 VDSQRKQPTLEEMILQLELEEEISRKSKLNELVAMRGRRMSCVNNSDILRSARNA-LNQY 114 Query: 528 PRFSLDGRDAMYRSSFRNYGCVK-PGRRSVCCSSTGGGFLNNDYKVNF--DGNIAYPPTV 358 PRFSLDG+DAMYRSSFRN + GR+SVCC G G L + F N P ++ Sbjct: 115 PRFSLDGKDAMYRSSFRNLDHHQVAGRKSVCCCD-GRGVLMRERNDGFLDRRNSCLPTSL 173 Query: 357 AGESVVWCKPGVVAKLMGLDAXXXXXXXXXXPGRSRVKAGALNSRKENLRRMGRHELEK 181 GE+VVWCKPGV+ KLMGLDA + + R+ RR+ RHE+E+ Sbjct: 174 RGENVVWCKPGVIGKLMGLDAMPVPVH------NRKETISPIIKRQSLRRRVERHEMER 226 >tpg|DAA54020.1| TPA: hypothetical protein ZEAMMB73_527273 [Zea mays] Length = 317 Score = 147 bits (371), Expect = 4e-33 Identities = 110/281 (39%), Positives = 144/281 (51%), Gaps = 65/281 (23%) Frame = -2 Query: 801 KRGIRSFCNGVGSTSTLDQ------KKNDCNVSCIANDSYVDDNP--------------- 685 +RG+ SFC+GV STST+ Q + A+ S+V P Sbjct: 3 RRGLPSFCHGVASTSTVQQLHGKELAAGSAAGADAASSSFVAVPPSVVGSCVAETEVSGT 62 Query: 684 -------LTLEEMILQLDLEEEAARRAK---------IDDYSELNRRMSCVNNSD---IL 562 +TLE+MILQLDLEEEAAR+A+ ++ RRMSCV+ +L Sbjct: 63 GGDGGSAVTLEQMILQLDLEEEAARKARRAATGEGTSAEEQGWCPRRMSCVDGGPADHVL 122 Query: 561 RSARNAAMNQYPRFSLDGRDAMYRSSFRNY---------GCVKPGRRSVCCSSTGG---- 421 RSAR+A + QYPRFSLDGRDAMYR+SF + G +P R SVCC++ G Sbjct: 123 RSARDA-LTQYPRFSLDGRDAMYRASFSGFYQGMGRDGDGANRPARASVCCAAGAGCAAL 181 Query: 420 GFLNNDYKVNFDGNIAYPPTVAGESVVWCKPGVVAKLMGLDAXXXXXXXXXXPGRSRVKA 241 Y+++ + + P TVAGESVVWCKPGVVAKLMGL+A G R KA Sbjct: 182 ACSVGGYEMDLERTLRLPATVAGESVVWCKPGVVAKLMGLEA----VPVPLRGGLRRRKA 237 Query: 240 GAL----------NSRKENLRRMGRHE--LEKERLLMNMNG 154 G RK+ RR G+ E L +E+L M ++G Sbjct: 238 GGHPVAACGGVGGGVRKQKPRRTGQDELALHREKLFMALHG 278 >emb|CAA18228.1| putative protein [Arabidopsis thaliana] gi|7269494|emb|CAB79497.1| putative protein [Arabidopsis thaliana] Length = 619 Score = 146 bits (369), Expect = 7e-33 Identities = 117/283 (41%), Positives = 157/283 (55%), Gaps = 34/283 (12%) Frame = -2 Query: 846 DLSLFLLKNSLASKMKRGIRSFCNGVGSTSTLDQ-KKNDCNVSCIA-NDSYVDDNPLTLE 673 DL+ L K + G S C G GST TL+Q +KND S N + +P TLE Sbjct: 341 DLTHKLFKRMRGRGPRSGFASSCGGDGSTLTLNQHQKNDVGPSVTPENTPFGGGSPRTLE 400 Query: 672 EMILQLDLEEEAARRAKI-----------DDYSELN--------RRMSCVNNSDILRSAR 550 EMILQL++EE+ RRA++ DD+++++ RMSCVN+SDILRSAR Sbjct: 401 EMILQLEVEEDIVRRARLRESYYGTYDNCDDHNDVDDDKLYHQPARMSCVNSSDILRSAR 460 Query: 549 NAAMNQYPRFSLDGRDAMYRSSFRNY------GCVKPGRRSVC---CSSTGGGFLNNDYK 397 NA +NQYPRFSLDG+DAMYRSSFR + ++ GRRS C +S ++ + K Sbjct: 461 NA-LNQYPRFSLDGKDAMYRSSFRRHLGTSADMTIQGGRRSHCGDQRTSKRSSQMSLETK 519 Query: 396 VNFDGNIAYPPTVAGESVVWCKPGVVAKLMGLDAXXXXXXXXXXPGRS-RVKAGALNSRK 220 P TVAGESVVWCK GVVAKLMGL+ G+S + K G L ++ Sbjct: 520 -------RLPRTVAGESVVWCKTGVVAKLMGLE-----MIPVPDKGKSGKDKLGTL-LKR 566 Query: 219 ENLRRMGRHELEKERLLMNMNGCKG---SSNYKTGRYCVMKPI 100 E LRR +ER L ++NG G ++ +G + + +PI Sbjct: 567 ERLRR-------RERTL-DVNGRTGPTTEASCSSGGFNITRPI 601