BLASTX nr result
ID: Dioscorea21_contig00011100
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00011100 (1236 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAO23078.1| polyprotein [Glycine max] 389 e-106 ref|XP_003524238.1| PREDICTED: uncharacterized protein LOC100782... 305 2e-80 ref|XP_003555357.1| PREDICTED: uncharacterized protein LOC100813... 300 7e-79 ref|XP_003553022.1| PREDICTED: uncharacterized protein LOC100788... 295 2e-77 emb|CAN75225.1| hypothetical protein VITISV_035856 [Vitis vinifera] 294 4e-77 >gb|AAO23078.1| polyprotein [Glycine max] Length = 1552 Score = 389 bits (999), Expect = e-106 Identities = 194/405 (47%), Positives = 266/405 (65%), Gaps = 1/405 (0%) Frame = +2 Query: 23 RRIGFQEMQIRKSKGLCFNCDEKYSPNHKCPNKKLLLLQWDDNDTEIYDSEFFIDPHPPN 202 ++I E+Q+R+ K LC+ CDEK+SP HKCPN++++LLQ ++ D + D + + Sbjct: 311 KKISPAEIQLRREKNLCYFCDEKFSPAHKCPNRQVMLLQLEETDEDQTDEQVMV------ 364 Query: 203 AVQDTGQDSSTK-TSLNAMSSTTLSGTMRFTGIVGGQKITILLDGGSDDTFIQPRVVKFL 379 ++ D T SLNAM + GT+RFTG VGG + IL+DGGS D FIQPRV + L Sbjct: 365 -TEEANMDDDTHHLSLNAMRGSNGVGTIRFTGQVGGIAVKILVDGGSSDNFIQPRVAQVL 423 Query: 380 HMDMLPTLPSKVLVGNGQTLQVEGKIPELAVQVQGYTLLVPAYVXXXXXXXXXXXXSWLA 559 + + P +VLVGNGQ L EG + +L + +QG + VP Y+ +WLA Sbjct: 424 KLPVEPAPNLRVLVGNGQILSAEGIVQQLPLHIQGQEVKVPVYLLQISGADVILGSTWLA 483 Query: 560 TLGPHVIDYEKKIIKFYHNNQFMVLKGEPMKRPAYTTVPQLNRLCSTQAVRECYSLQIIQ 739 TLGPHV DY +KF+ N++F+ L+GE + RL +T+++ EC+++Q+IQ Sbjct: 484 TLGPHVADYAALTLKFFQNDKFITLQGEGNSEATQAQLHHFRRLQNTKSIEECFAIQLIQ 543 Query: 740 EDSQLNTLASTSPNLASEIQSIVSTDAPSELLELLLKFQTVFHVPAGLPPSRSCDHRIPL 919 ++ +TL N+ E+ ++ T + VF VPA LPP R DH IPL Sbjct: 544 KEVPEDTLKDLPTNIDPELAILLHT------------YAQVFAVPASLPPQREQDHAIPL 591 Query: 920 LPASTPVKVKPYRYPHSQKAEIEKMVNQMLIEGLIEHSTSPFSSPIILVKKKDGTWRFCT 1099 S PVKV+PYRYPH+QK +IEKM+ +ML++G+I+ S SPFS PI+LVKKKDG+WRFCT Sbjct: 592 KQGSGPVKVRPYRYPHTQKDQIEKMIQEMLVQGIIQPSNSPFSLPILLVKKKDGSWRFCT 651 Query: 1100 DYRALNAITIKDAYPIPTVDELLDELYGASYFSKLDLRSGYHQIL 1234 DYRALNAIT+KD++P+PTVDELLDEL+GA YFSKLDLRSGYHQIL Sbjct: 652 DYRALNAITVKDSFPMPTVDELLDELHGAQYFSKLDLRSGYHQIL 696 >ref|XP_003524238.1| PREDICTED: uncharacterized protein LOC100782971 [Glycine max] Length = 1863 Score = 305 bits (781), Expect = 2e-80 Identities = 164/402 (40%), Positives = 233/402 (57%), Gaps = 3/402 (0%) Frame = +2 Query: 38 QEMQIRKSKGLCFNCDEKYSPNHKCPNKKLLLLQWDDNDTEIYDSEFFIDPHPPNAVQDT 217 +EM R+ KGLC+NC+EK+S +H+C + LL + D ++ D+ DP PP Sbjct: 223 EEMAYRREKGLCYNCEEKWSSSHRCKGRVLLFIA-DSDEASSMDNPSMEDPAPPTQATLP 281 Query: 218 GQDSST---KTSLNAMSSTTLSGTMRFTGIVGGQKITILLDGGSDDTFIQPRVVKFLHMD 388 D + SL+AM+ + T R G++ ++TIL+D GS F+QPR+ KFL + Sbjct: 282 PFDPTPLLPHISLHAMAGVPATDTFRLYGVINHTRVTILVDSGSTHNFVQPRIAKFLGLP 341 Query: 389 MLPTLPSKVLVGNGQTLQVEGKIPELAVQVQGYTLLVPAYVXXXXXXXXXXXXSWLATLG 568 M T +V+VGNG L+ + P + +Q ++ V V WL TLG Sbjct: 342 MEDTTSLQVMVGNGSVLECKQSCPATTLLLQQHSFTVTLRVLPISGADVVLGVEWLRTLG 401 Query: 569 PHVIDYEKKIIKFYHNNQFMVLKGEPMKRPAYTTVPQLNRLCSTQAVRECYSLQIIQEDS 748 P + DY ++F H Q ++L+ + + Q+ RL T ++ + L ++ Sbjct: 402 PIITDYTSFTMQFTHLGQPIILRADVTTCTDTASAHQVKRLLHTHSLSGLFHLSLLPTH- 460 Query: 749 QLNTLASTSPNLASEIQSIVSTDAPSELLELLLKFQTVFHVPAGLPPSRSCDHRIPLLPA 928 + T+P+ I +I ELLL+F T+F P+ LPP R DH I L+P+ Sbjct: 461 ----IPETAPDPPHPISAIN---------ELLLRFHTIFQQPSSLPPPRQHDHYINLIPS 507 Query: 929 STPVKVKPYRYPHSQKAEIEKMVNQMLIEGLIEHSTSPFSSPIILVKKKDGTWRFCTDYR 1108 + PV V+PY+YPH QK EIEK V+ +L G I+ S SPFSSP++LVKKKDGTWR C DYR Sbjct: 508 AHPVNVRPYKYPHFQKNEIEKQVSALLESGFIQPSRSPFSSPVLLVKKKDGTWRMCVDYR 567 Query: 1109 ALNAITIKDAYPIPTVDELLDELYGASYFSKLDLRSGYHQIL 1234 ALN+ITI+D +PIPT+DELLDEL AS+FSKLDLR G+HQIL Sbjct: 568 ALNSITIRDRFPIPTIDELLDELGHASWFSKLDLRQGFHQIL 609 >ref|XP_003555357.1| PREDICTED: uncharacterized protein LOC100813803 [Glycine max] Length = 2140 Score = 300 bits (767), Expect = 7e-79 Identities = 167/422 (39%), Positives = 239/422 (56%), Gaps = 13/422 (3%) Frame = +2 Query: 8 PPTAFRRIGFQEMQIRKSKGLCFNCDEKYSPNHKCPNKKLLLLQWDDNDTEIYDSEFFID 187 P F + ++M R+ KGLC+NCDEK++ +H+C + L + D + S D Sbjct: 121 PKAPFVQRTQEDMAYRREKGLCYNCDEKWNSSHRCKGRVLFFIANSDETSSPESSPS--D 178 Query: 188 PHPP-NAVQDTGQDSSTKT----------SLNAMSSTTLSGTMRFTGIVGGQKITILLDG 334 P P + D +T+ SL+AM+ + T R G++ ++TIL+D Sbjct: 179 PSSPLKSEHDHTLLEATQAFDLTPLQPHISLHAMAGVPATDTFRLYGLINKTRVTILVDS 238 Query: 335 GSDDTFIQPRVVKFLHMDMLPTLPSKVLVGNGQTLQVEGKIPELAVQVQGYTLLVPAYVX 514 GS F+QPRV KFL++ + T P +V+VGNG L + IP+ + +Q + +V + Sbjct: 239 GSTHNFVQPRVAKFLNLPLHDTQPLRVMVGNGSVLDCQQMIPDTTILIQEHRFVVTLRLL 298 Query: 515 XXXXXXXXXXXSWLATLGPHVIDYEKKIIKFYHNNQFMVLKGEPMKRPAYTTVPQLNRLC 694 WL TLGP + DY +KF + + L+ + + + Q+ RL Sbjct: 299 PLSGADVVLGVEWLRTLGPVITDYTDFTMKFTLFGRPIHLRADVQVNTSPVSAHQVRRLI 358 Query: 695 STQAVRECY--SLQIIQEDSQLNTLASTSPNLASEIQSIVSTDAPSELLELLLKFQTVFH 868 ST++ + SLQ I LNT P + +LL K+Q++F Sbjct: 359 STKSTSGLFHLSLQPIPSSEMLNTTPHPVPAID----------------KLLNKYQSLFE 402 Query: 869 VPAGLPPSRSCDHRIPLLPASTPVKVKPYRYPHSQKAEIEKMVNQMLIEGLIEHSTSPFS 1048 P GLPP R DH+I LLP++ P+ V+PYRYP+SQK EIEK V+ +L GLI+ S SPFS Sbjct: 403 APTGLPPPRQHDHQINLLPSAHPINVRPYRYPYSQKTEIEKQVSALLDSGLIQPSRSPFS 462 Query: 1049 SPIILVKKKDGTWRFCTDYRALNAITIKDAYPIPTVDELLDELYGASYFSKLDLRSGYHQ 1228 SP++LVKKKDGTWR C DYRALN+IT++D +P+PT+DELLDEL AS+FSKLDLR G+HQ Sbjct: 463 SPVLLVKKKDGTWRMCVDYRALNSITVRDRFPLPTIDELLDELGQASWFSKLDLRQGFHQ 522 Query: 1229 IL 1234 IL Sbjct: 523 IL 524 >ref|XP_003553022.1| PREDICTED: uncharacterized protein LOC100788433 [Glycine max] Length = 1433 Score = 295 bits (754), Expect = 2e-77 Identities = 169/421 (40%), Positives = 233/421 (55%), Gaps = 12/421 (2%) Frame = +2 Query: 8 PPTAFRRIGFQEMQIRKSKGLCFNCDEKYSPNHKCPNKKLLLLQWDDNDTEIYDSEF-FI 184 P F + E+ R+ +GLC+NCD+K+S +H C + LLL+ D +SE F Sbjct: 247 PKPPFTQRTPSEIAYRRERGLCYNCDDKWSASHHCKGRVLLLIADPDTPDNPDNSEPPFN 306 Query: 185 DPHP--PNAVQDTGQDS---------STKTSLNAMSSTTLSGTMRFTGIVGGQKITILLD 331 P P P + T D + SLNA+S T R G + +IT+L+D Sbjct: 307 SPAPSLPASTPPTDLDPIPDPDLPFPTPHISLNALSGLPTPETFRLFGYINHTRITVLID 366 Query: 332 GGSDDTFIQPRVVKFLHMDMLPTLPSKVLVGNGQTLQVEGKIPELAVQVQGYTLLVPAYV 511 GS F+QPR+ FLH+ +PT P +VLVGNG L P+ + +Q + + ++ Sbjct: 367 SGSTHNFLQPRLATFLHLPTVPTNPLRVLVGNGAVLTCTHLCPDTTISLQSHHFTLTFHL 426 Query: 512 XXXXXXXXXXXXSWLATLGPHVIDYEKKIIKFYHNNQFMVLKGEPMKRPAYTTVPQLNRL 691 WL LGP DY I+KF+H Q + L + P + Q+ R+ Sbjct: 427 LPISGADVILGIQWLKLLGPITTDYTSLIMKFHHLGQPVELHVDADHGPHPISATQIKRM 486 Query: 692 CSTQAVRECYSLQIIQEDSQLNTLASTSPNLASEIQSIVSTDAPSELLELLLKFQTVFHV 871 T A + L ++ T+ P+ S SI + DA L+ K+Q++F Sbjct: 487 IQTNATSALFHLCVLPASD--TTIPQHPPS--STPSSIPAIDA------LIHKYQSLFQT 536 Query: 872 PAGLPPSRSCDHRIPLLPASTPVKVKPYRYPHSQKAEIEKMVNQMLIEGLIEHSTSPFSS 1051 P LPPSRS DH I L P + P+ V+PYRYPH QKAEIEK V +L GLI+ S SPFSS Sbjct: 537 PTALPPSRSIDHHIHLRPNTEPINVRPYRYPHFQKAEIEKQVADLLSAGLIQVSRSPFSS 596 Query: 1052 PIILVKKKDGTWRFCTDYRALNAITIKDAYPIPTVDELLDELYGASYFSKLDLRSGYHQI 1231 P++LVKKKD +WR C DYRALNA+TI+D +P+PTVDELLD+L AS++SKLDL+ G+HQI Sbjct: 597 PVLLVKKKDDSWRMCVDYRALNAVTIRDRFPMPTVDELLDDLGHASWYSKLDLQQGFHQI 656 Query: 1232 L 1234 L Sbjct: 657 L 657 >emb|CAN75225.1| hypothetical protein VITISV_035856 [Vitis vinifera] Length = 793 Score = 294 bits (752), Expect = 4e-77 Identities = 165/410 (40%), Positives = 238/410 (58%) Frame = +2 Query: 2 STPPTAFRRIGFQEMQIRKSKGLCFNCDEKYSPNHKCPNKKLLLLQWDDNDTEIYDSEFF 181 S P +R+ ++EMQ R+++GLCFNCD+K++ HKC +LLLL+ + + + D + Sbjct: 217 SKPTPTMKRLTWEEMQKRRAQGLCFNCDDKFTVGHKCRGLQLLLLEENSSPNKEDDIDEE 276 Query: 182 IDPHPPNAVQDTGQDSSTKTSLNAMSSTTLSGTMRFTGIVGGQKITILLDGGSDDTFIQP 361 I+ N + + S +A++ + TMR T +G ++ +L+D GS FI Sbjct: 277 IEEPAIN------EQIEPEISFHALTGWSTPKTMRITAKIGQHEVVVLIDSGSTHNFISE 330 Query: 362 RVVKFLHMDMLPTLPSKVLVGNGQTLQVEGKIPELAVQVQGYTLLVPAYVXXXXXXXXXX 541 +V LH+ ++PT P V V NG L+ +G+ + V +QG + Y Sbjct: 331 KVADMLHLPVVPTKPFTVKVVNGTPLKCQGRFEHVHVILQGIPFSLTLYSLPLTGLDLVL 390 Query: 542 XXSWLATLGPHVIDYEKKIIKFYHNNQFMVLKGEPMKRPAYTTVPQLNRLCSTQAVRECY 721 WL LG V +++K ++F NQ L+G T Q ++ S +AV Sbjct: 391 GVQWLEQLGTVVCNWKKLTMEFQWENQTHKLQG---------TNTQTIQVASLKAV---- 437 Query: 722 SLQIIQEDSQLNTLASTSPNLASEIQSIVSTDAPSELLELLLKFQTVFHVPAGLPPSRSC 901 S ++ Q S ++ N E+Q + D + +L+ F+ +F P LPP+R Sbjct: 438 SKELRQGSSMFAICLQSTSN---EVQQAIHLD----MQQLIKAFEDIFQEPNQLPPAREV 490 Query: 902 DHRIPLLPASTPVKVKPYRYPHSQKAEIEKMVNQMLIEGLIEHSTSPFSSPIILVKKKDG 1081 DHRI L + PV V+PYRY + QKAEIEK V ML GLI STSPFSSP++LVKKKDG Sbjct: 491 DHRITLKEGTEPVNVRPYRYAYFQKAEIEKQVRDMLQLGLIRASTSPFSSPVLLVKKKDG 550 Query: 1082 TWRFCTDYRALNAITIKDAYPIPTVDELLDELYGASYFSKLDLRSGYHQI 1231 TWRFCTDYRALNA+TIKD +PIPTVD++LDEL+GA+YF+KLDLR+GYHQ+ Sbjct: 551 TWRFCTDYRALNAVTIKDRFPIPTVDDMLDELHGATYFTKLDLRAGYHQV 600