BLASTX nr result
ID: Dioscorea21_contig00018600
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00018600 (1426 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi... 387 e-105 gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo... 367 5e-99 dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou... 362 2e-97 ref|XP_003534756.1| PREDICTED: uncharacterized protein LOC100781... 341 3e-91 ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm... 330 4e-88 >gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group] Length = 463 Score = 387 bits (993), Expect = e-105 Identities = 220/460 (47%), Positives = 285/460 (61%), Gaps = 29/460 (6%) Frame = -3 Query: 1415 MGSHRDGNPSGDEREGCNFTLELPVTDPT-------FNLESTVCSHGLFMMSPNHWNPST 1257 M S G+P+ R LELP+ F+LE+ VCSHGLFMM+PN W+P++ Sbjct: 1 MPSQERGDPAAARRVAVELELELPLGGAPPYPGAAPFDLEAAVCSHGLFMMAPNRWDPAS 60 Query: 1256 KXXXXXXXXXXXXXXXXSVIIS-HPS-PPNPLLISVSGVA--SLSSQDQLCLLAQVRRML 1089 + +V +S HP+ P + LL+SV G +LS DQ +L QVRRML Sbjct: 61 RALVRPLRLASDRAASVAVRVSRHPARPSDALLVSVLGAPGDALSPPDQTSILEQVRRML 120 Query: 1088 RISQENDRVVREFQEMHGPSKERGFGRVFRSPTLFEDMVKCILLCNCQWSRTLSMARALC 909 R+ +E+ R EFQ MH ++E GFGR+FRSPTLFEDMVKCILLCNCQW+RTLSM+ ALC Sbjct: 121 RLDEEDGRAAAEFQAMHAVAREAGFGRIFRSPTLFEDMVKCILLCNCQWTRTLSMSTALC 180 Query: 908 ELQRELMGCLVAETFQPKTPQVVEXXXXXXXXXKVAIKLETKFVKNYTRCAENEKSPVNQ 729 ELQ EL E FQ +TP + E V +KLETKF ++ C E+ + Sbjct: 181 ELQLELRSSSSTENFQSRTPPIRECKRKRSNKRNVRVKLETKFNEDKLVCLEDPN--LAT 238 Query: 728 DTRPQSVQQCQF------SNTDSCSEELQNCSTLVVKDEPLQI---GNFPSPEELATLDE 576 DT + F S T + SE + S L +++EP G+FP+PEELA LDE Sbjct: 239 DTANLQTYENSFNLPSAASGTGNTSEVSLDHSELKLRNEPCLEDCGGDFPTPEELANLDE 298 Query: 575 NFLAQRCKLGYRAQRILSLVKDIVAGKINIKKLEE---------NCNGQLSGDYNELDMQ 423 +FLA+RC LGYRA+RI+ L + IV GKI ++KLEE Y+ L+ + Sbjct: 299 DFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEE 358 Query: 422 LSGIHGFGPFTRANVLMCMGFYHKIPTDTETIRHLKQFHGRSNCTIRSVQDDVEKIYGAY 243 LS I GFGPFTRANVLMCMGF+H IP DTETIRHLKQFH R++ TI SVQ +++ IYG Y Sbjct: 359 LSTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQFHKRAS-TISSVQKELDNIYGKY 417 Query: 242 APYQFLAYWSELWDDYEKRFGKLSEMPHSDYQVITANNMK 123 AP+QFLAYW ELW Y K+FGK+S+M +Y++ TA+ +K Sbjct: 418 APFQFLAYWCELWGFYNKQFGKISDMEPINYRLFTASKLK 457 >gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group] Length = 442 Score = 367 bits (941), Expect = 5e-99 Identities = 211/449 (46%), Positives = 273/449 (60%), Gaps = 15/449 (3%) Frame = -3 Query: 1415 MGSHRDGNPSGDEREGCNFTLELPVTDPT-------FNLESTVCSHGLFMMSPNHWNPST 1257 M S G+P+ R LELP+ F+LE+ VCSHGLFMM+PN W+P++ Sbjct: 1 MPSPEGGDPAAARRVAVELELELPLGGAPPYPGAAPFDLEAAVCSHGLFMMAPNRWDPAS 60 Query: 1256 KXXXXXXXXXXXXXXXXSVIIS-HPS-PPNPLLISVSGVA---SLSSQDQLCLLAQVRRM 1092 + +V +S HP+ P + LL+SV G +LS DQ +L QVRRM Sbjct: 61 RALVRPLRLASDRAASVAVRVSRHPARPSDALLVSVLGAPDDDALSPLDQTSILEQVRRM 120 Query: 1091 LRISQENDRVVREFQEMHGPSKERGFGRVFRSPTLFEDMVKCILLCNCQWSRTLSMARAL 912 LR+ +E+ R V EFQ MH ++E GFGR+FRSPTLFEDM+KCILLCNCQW+RTLSM+ AL Sbjct: 121 LRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLFEDMIKCILLCNCQWTRTLSMSTAL 180 Query: 911 CELQRELMGCLVAETFQPKTPQVVEXXXXXXXXXKVAIKLETKFVKNYTRCAENEKSPVN 732 CELQ EL E FQ +TP + E V +KLETKF ++ C E+ N Sbjct: 181 CELQLELRSSSSTENFQSRTPPIRECKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLATN 240 Query: 731 QDTRPQSVQQCQFSNTDSCSEELQNCSTLVVKDEPLQI---GNFPSPEELATLDENFLAQ 561 + T + SE + S L ++ E G+FP+PEELA LDE+FLA+ Sbjct: 241 TANENLFSLPSSANETGNTSEVSLDHSELKLRYELCLEDCGGDFPTPEELANLDEDFLAK 300 Query: 560 RCKLGYRAQRILSLVKDIVAGKINIKKLEENCNGQLSGDYNELDMQLSGIHGFGPFTRAN 381 RC LGYRA+RI+ L + IV GKI ++KLEE L +LS I G PF N Sbjct: 301 RCNLGYRARRIVMLARSIVEGKICLQKLEE--------IRKILIEELSTISGIWPFHSCN 352 Query: 380 VLMCMGFYHKIPTDTETIRHLKQFHGRSNCTIRSVQDDVEKIYGAYAPYQFLAYWSELWD 201 VLMCMGF+H IP DTETIRHLKQFH R++ TI SVQ +++ IYG YAP+QFLAYW ELW Sbjct: 353 VLMCMGFFHMIPADTETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWG 411 Query: 200 DYEKRFGKLSEMPHSDYQVITANNMKCSN 114 Y K+FG +S+M +Y++ TA+ +K SN Sbjct: 412 FYNKQFGIISDMEPINYRLFTASKLKKSN 440 >dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group] gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza sativa Japonica Group] Length = 501 Score = 362 bits (928), Expect = 2e-97 Identities = 217/500 (43%), Positives = 282/500 (56%), Gaps = 66/500 (13%) Frame = -3 Query: 1415 MGSHRDGNPSGDEREGCNFTLELPVTDPT-------FNLESTVCSHGLFMMSPNHWNPST 1257 M S G+P+ R LELP+ F+LE+ VCSHGLFMM+PN W+P++ Sbjct: 1 MPSPEGGDPAAARRVAVELELELPLGGAPPYPGAAPFDLEAAVCSHGLFMMAPNRWDPAS 60 Query: 1256 KXXXXXXXXXXXXXXXXSVIIS-HPS-PPNPLLISVSGVA---SLSSQDQLCLLAQVRRM 1092 + +V +S HP+ P + LL+SV G +LS DQ +L QVRRM Sbjct: 61 RALVRPLRLASDRAASVAVRVSRHPARPSDALLVSVLGAPDDDALSPLDQTSILEQVRRM 120 Query: 1091 LRISQENDRVVREFQEMHGPSKERGFGRVFRSPTLFEDMVKCILLCNCQ----------- 945 LR+ +E+ R V EFQ MH ++E GFGR+FRSPTLFEDM+KCILLCNCQ Sbjct: 121 LRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLFEDMIKCILLCNCQFSLPLPLPSLA 180 Query: 944 -------------------------------WSRTLSMARALCELQRELMGCLVAETFQP 858 W+RTLSM+ ALCELQ EL E FQ Sbjct: 181 STSMRNSDTNMSRYLGIAIFHLHSTVLFNCRWTRTLSMSTALCELQLELRSSSSTENFQS 240 Query: 857 KTPQVVEXXXXXXXXXKVAIKLETKFVKNYTRCAENEKSPVNQDTRPQSVQQCQFSNTDS 678 +TP + E V +KLETKF ++ C E+ N + T + Sbjct: 241 RTPPIRECKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLATNTANENLFSLPSSANETGN 300 Query: 677 CSEELQNCSTLVVKDEPLQI---GNFPSPEELATLDENFLAQRCKLGYRAQRILSLVKDI 507 SE + S L ++ E G+FP+PEELA LDE+FLA+RC LGYRA+RI+ L + I Sbjct: 301 TSEVSLDHSELKLRYELCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARSI 360 Query: 506 VAGKINIKKLEENCNGQLS---------GDYNELDMQLSGIHGFGPFTRANVLMCMGFYH 354 V GKI ++KLEE + Y+ L+ +LS I GFGPFTRANVLMCMGF+H Sbjct: 361 VEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFFH 420 Query: 353 KIPTDTETIRHLKQFHGRSNCTIRSVQDDVEKIYGAYAPYQFLAYWSELWDDYEKRFGKL 174 IP DTETIRHLKQFH R++ TI SVQ +++ IYG YAP+QFLAYW ELW Y K+FG + Sbjct: 421 MIPADTETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGII 479 Query: 173 SEMPHSDYQVITANNMKCSN 114 S+M +Y++ TA+ +K SN Sbjct: 480 SDMEPINYRLFTASKLKKSN 499 >ref|XP_003534756.1| PREDICTED: uncharacterized protein LOC100781827 [Glycine max] Length = 426 Score = 341 bits (874), Expect = 3e-91 Identities = 202/436 (46%), Positives = 256/436 (58%), Gaps = 18/436 (4%) Frame = -3 Query: 1355 LELPVTDPTFNLESTVCSHGLFMMSPNHWNPSTKXXXXXXXXXXXXXXXXSVIISHPSPP 1176 +ELP F LE VCSHGLFMM PNHW+P +K ++S Sbjct: 1 MELP---SPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLRSSPSSF-----LVSLSQHS 52 Query: 1175 NPLLISVSGVASLSSQDQLCLLAQVRRMLRISQENDRVVREFQEMHGPSK-ERGF-GRVF 1002 L + V +LS Q Q + AQV RMLR S+ ++ VREF+ +H R F GRVF Sbjct: 53 QSLAVRVHATHALSPQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFSGRVF 112 Query: 1001 RSPTLFEDMVKCILLCNCQWSRTLSMARALCELQRELMG---CLVA---------ETFQP 858 RSPTLFEDMVKCILLCNCQW RTLSMA+ALCELQ EL C +A E F P Sbjct: 113 RSPTLFEDMVKCILLCNCQWPRTLSMAQALCELQLELQNGSPCTIAVSGNSKGESEGFIP 172 Query: 857 KTPQVVEXXXXXXXXXKVAIKLETKFVKNYTRCAENEKSPVNQDTRPQSVQQCQFSNTDS 678 KTP E K+ TK + + + ++ S + + Sbjct: 173 KTPASKETRRN---------KVSTKGMFCKKKLELDGNLQIDHVVASSSTATTLLTTDNG 223 Query: 677 CSEELQN---CSTLVVKDEPL-QIGNFPSPEELATLDENFLAQRCKLGYRAQRILSLVKD 510 SEEL++ C +E + GNFPSP ELA LDE+FLA+RC LGYRA I+ L + Sbjct: 224 DSEELRSHDSCHEFSNGNEYFSRTGNFPSPSELANLDESFLAKRCGLGYRAGYIIELARA 283 Query: 509 IVAGKINIKKLEENCNGQLSGDYNELDMQLSGIHGFGPFTRANVLMCMGFYHKIPTDTET 330 IV GKI + +LEE +Y +LD QL I G+GPFTRANVLMC+G+YH IPTD+ET Sbjct: 284 IVEGKIQLGQLEELSKDASLSNYKQLDDQLKQIRGYGPFTRANVLMCLGYYHVIPTDSET 343 Query: 329 IRHLKQFHGRSNCTIRSVQDDVEKIYGAYAPYQFLAYWSELWDDYEKRFGKLSEMPHSDY 150 +RHLKQ H R T ++++ ++E+IYG Y PYQFLA+WSE+WD YE RFGKL+EM SDY Sbjct: 344 VRHLKQVHSRYT-TSKTIERELEEIYGKYEPYQFLAFWSEVWDFYETRFGKLNEMHSSDY 402 Query: 149 QVITANNMKCSNGCRK 102 ++ITA NM+ + RK Sbjct: 403 KLITACNMRSTTNKRK 418 >ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis] gi|223541451|gb|EEF43001.1| conserved hypothetical protein [Ricinus communis] Length = 458 Score = 330 bits (847), Expect = 4e-88 Identities = 194/427 (45%), Positives = 246/427 (57%), Gaps = 24/427 (5%) Frame = -3 Query: 1331 TFNLESTVCSHGLFMMSPNHWNPSTKXXXXXXXXXXXXXXXXSVIISHPSPPNPLLISVS 1152 TF+LE TVCSHGLFM+SPNHW+P ++ V IS + LL+ V Sbjct: 21 TFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLNDDTDNSLMVSISQHLSKS-LLVRVY 79 Query: 1151 GVASLSSQDQLCLLAQVRRMLRISQENDRVVREFQEMHGPSKERGF-------GRVFRSP 993 G SLS + Q LL Q+ RMLR+S ++ REF+++ + GRV RSP Sbjct: 80 GNRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFEGEECPLIGDFGGRVLRSP 139 Query: 992 TLFEDMVKCILLCNCQWSRTLSMARALCELQRELMGCLVAET-----FQPKTPQVVEXXX 828 TLFEDMVKCILLCNCQWSRTLSMA ALC+ Q EL + F P TP E Sbjct: 140 TLFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQSPQQKHAFNHFIPNTPVKKEPKR 199 Query: 827 XXXXXXKVAIKLETKFVKNYTRCAENEKSPVNQDTRPQSVQQCQFSNTDSCSEELQNCST 648 + E+ ++ C + S + V F N SC ST Sbjct: 200 KIRLSK---VPTESMDLEAADTCLTTDDSQMKISNSLNCVDDGSFDNLKSCQGSNTFYST 256 Query: 647 LVVKDEPLQ------------IGNFPSPEELATLDENFLAQRCKLGYRAQRILSLVKDIV 504 +Q GNFPSP ELA LDE FLA+RC LGYRA RI+ L + IV Sbjct: 257 GPYATSDIQSHLVTQHCAKKTTGNFPSPRELANLDERFLAKRCGLGYRAGRIIKLAQGIV 316 Query: 503 AGKINIKKLEENCNGQLSGDYNELDMQLSGIHGFGPFTRANVLMCMGFYHKIPTDTETIR 324 G+I +++ E+ NG Y++L QL I GFGPFTRANVLMCMGFYH IPTD+ET+R Sbjct: 317 EGRIPLREFEQVSNGGSLSTYSKLTDQLREIEGFGPFTRANVLMCMGFYHVIPTDSETVR 376 Query: 323 HLKQFHGRSNCTIRSVQDDVEKIYGAYAPYQFLAYWSELWDDYEKRFGKLSEMPHSDYQV 144 H KQ H + N TI++VQ + E+IY +AP+QFL YW+ELW YE+RFGKLSEMP S+Y++ Sbjct: 377 HFKQVHAK-NSTIKTVQSEAEEIYRKFAPFQFLVYWAELWHFYEQRFGKLSEMPCSNYKL 435 Query: 143 ITANNMK 123 ITA+N++ Sbjct: 436 ITASNLR 442