BLASTX nr result
ID: Dioscorea21_contig00005928
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00005928 (1560 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002275497.1| PREDICTED: WD repeat-containing protein 44-l... 476 e-132 emb|CBI37573.3| unnamed protein product [Vitis vinifera] 464 e-128 ref|XP_002517227.1| WD-repeat protein, putative [Ricinus communi... 452 e-124 ref|XP_002328477.1| predicted protein [Populus trichocarpa] gi|2... 445 e-122 ref|XP_003532860.1| PREDICTED: uncharacterized protein LOC100806... 424 e-116 >ref|XP_002275497.1| PREDICTED: WD repeat-containing protein 44-like [Vitis vinifera] Length = 674 Score = 476 bits (1226), Expect = e-132 Identities = 252/504 (50%), Positives = 314/504 (62%), Gaps = 30/504 (5%) Frame = +1 Query: 4 KSWWKSFVWKRYAMGMCK-NDVSTKSSKLPKTTRMKVRQQKKKCMEVTALYKGQEIKGHK 180 +SWWK F+ KR + VS ++S+ PK +MKVRQ KK+CME TAL GQEI+ HK Sbjct: 172 RSWWKRFISKRKGRDAASASQVSKQTSETPKINQMKVRQNKKRCMEFTALCIGQEIQAHK 231 Query: 181 GAIRILRFSSSGQYLASGGEDTVVRVWQIREVETSSKCFASNGSS----KTGKMMIGRQS 348 G I ++FS GQYLASGGED VVR+W + + S K + G+ K GK GR+ Sbjct: 232 GFIWTMKFSPDGQYLASGGEDGVVRIWCVTSTDASCKYLTTEGNFGCEVKDGKSSFGRKK 291 Query: 349 CNSAPIIIPKKVFKIEETPLHEFHGHTGGILDLSWSNSDCLLTSSEDGTVRLWKVGCDSC 528 + AP++IP K+F+IEE+PL EFHGH +LDL+WS S+ LL+SS D TVRLW+VG D C Sbjct: 292 PSYAPVVIPDKIFQIEESPLQEFHGHASDVLDLAWSKSNSLLSSSMDKTVRLWQVGHDEC 351 Query: 529 LKVFPHNDFVTCVQFNPIDERCFISGSIDGKVRIWGVSENXXXXXXXXXXXXTAVCYQPD 708 L VF HN++VTC+QFNP+D+ FISGSIDGKVRIWGVSE TA+CYQPD Sbjct: 352 LNVFRHNNYVTCIQFNPVDDNYFISGSIDGKVRIWGVSERRVVDWADVRDVITAICYQPD 411 Query: 709 GKGFVVGSVAGICQFYSYSGSGIHLDRRFNVKGRKKSIGRPITSLQFSPTDSEKVMISSA 888 GKGF+VGSV G C FY+ SG+ + LD + N GRKK+ G IT +QFS +S+KVMI+S Sbjct: 412 GKGFIVGSVTGTCCFYNASGNHLQLDAKVNFHGRKKTSGNKITGIQFSQEESQKVMITSE 471 Query: 889 DPNVRIFDKVDGI---------------PFTSDGRYIVSVDEDSHVYVWNYNESRLSSLK 1023 D +RI D +D + FTS GR+IVSV EDS VYVWNY+ S K Sbjct: 472 DSKLRILDGIDVVYKYKGLRKSGSQMSASFTSTGRHIVSVGEDSRVYVWNYDGFCSQSSK 531 Query: 1024 GAKTTRSCELFSSEGVSVAVPWPGIDHRGAGLGFNSVLASSSPQRILEPSTWLRDPDCFS 1203 K+ +SCE F EGVSVA+PW GLG +S+ S Q LE ++W+RD + FS Sbjct: 532 QMKSVQSCEHFFFEGVSVAIPWSDTGAEKKGLGSDSLRQSPPTQDHLEATSWIRDSERFS 591 Query: 1204 RGVCFFTDSPSRGSAATWPEEKLIQPSTTTEDHRHSHHY----------HCLTTISATWS 1353 G F D RG ATWPEEKL DH H H H +S W Sbjct: 592 LGNWFSMDGSCRG-GATWPEEKLPLWGAPEHDHHHHDHQQQQQHNDVQNHNHLALSEAWG 650 Query: 1354 LVIVTASHDGSIRSFHNFGLPVRL 1425 LVI TA DG+IR+FHN+GLPVRL Sbjct: 651 LVIATAGWDGTIRTFHNYGLPVRL 674 >emb|CBI37573.3| unnamed protein product [Vitis vinifera] Length = 668 Score = 464 bits (1195), Expect = e-128 Identities = 248/500 (49%), Positives = 310/500 (62%), Gaps = 26/500 (5%) Frame = +1 Query: 4 KSWWKSFVWKRYAMGMCK-NDVSTKSSKLPKTTRMKVRQQKKKCMEVTALYKGQEIKGHK 180 +SWWK F+ KR + VS ++S+ PK +MKVRQ KK+CME TAL GQEI+ HK Sbjct: 178 RSWWKRFISKRKGRDAASASQVSKQTSETPKINQMKVRQNKKRCMEFTALCIGQEIQAHK 237 Query: 181 GAIRILRFSSSGQYLASGGEDTVVRVWQIREVETSSKCFASNGSSKTGKMMIGRQSCNSA 360 G I ++FS GQYLASGGED VVR+W + + S K + G+ G + + A Sbjct: 238 GFIWTMKFSPDGQYLASGGEDGVVRIWCVTSTDASCKYLTTEGN-------FGCEP-SYA 289 Query: 361 PIIIPKKVFKIEETPLHEFHGHTGGILDLSWSNSDCLLTSSEDGTVRLWKVGCDSCLKVF 540 P++IP K+F+IEE+PL EFHGH +LDL+WS S+ LL+SS D TVRLW+VG D CL VF Sbjct: 290 PVVIPDKIFQIEESPLQEFHGHASDVLDLAWSKSNSLLSSSMDKTVRLWQVGHDECLNVF 349 Query: 541 PHNDFVTCVQFNPIDERCFISGSIDGKVRIWGVSENXXXXXXXXXXXXTAVCYQPDGKGF 720 HN++VTC+QFNP+D+ FISGSIDGKVRIWGVSE TA+CYQPDGKGF Sbjct: 350 RHNNYVTCIQFNPVDDNYFISGSIDGKVRIWGVSERRVVDWADVRDVITAICYQPDGKGF 409 Query: 721 VVGSVAGICQFYSYSGSGIHLDRRFNVKGRKKSIGRPITSLQFSPTDSEKVMISSADPNV 900 +VGSV G C FY+ SG+ + LD + N GRKK+ G IT +QFS +S+KVMI+S D + Sbjct: 410 IVGSVTGTCCFYNASGNHLQLDAKVNFHGRKKTSGNKITGIQFSQEESQKVMITSEDSKL 469 Query: 901 RIFDKVDGI---------------PFTSDGRYIVSVDEDSHVYVWNYNESRLSSLKGAKT 1035 RI D +D + FTS GR+IVSV EDS VYVWNY+ S K K+ Sbjct: 470 RILDGIDVVYKYKGLRKSGSQMSASFTSTGRHIVSVGEDSRVYVWNYDGFCSQSSKQMKS 529 Query: 1036 TRSCELFSSEGVSVAVPWPGIDHRGAGLGFNSVLASSSPQRILEPSTWLRDPDCFSRGVC 1215 +SCE F EGVSVA+PW GLG +S+ S Q LE ++W+RD + FS G Sbjct: 530 VQSCEHFFFEGVSVAIPWSDTGAEKKGLGSDSLRQSPPTQDHLEATSWIRDSERFSLGNW 589 Query: 1216 FFTDSPSRGSAATWPEEKLIQPSTTTEDHRHSHHY----------HCLTTISATWSLVIV 1365 F D RG ATWPEEKL DH H H H +S W LVI Sbjct: 590 FSMDGSCRG-GATWPEEKLPLWGAPEHDHHHHDHQQQQQHNDVQNHNHLALSEAWGLVIA 648 Query: 1366 TASHDGSIRSFHNFGLPVRL 1425 TA DG+IR+FHN+GLPVRL Sbjct: 649 TAGWDGTIRTFHNYGLPVRL 668 >ref|XP_002517227.1| WD-repeat protein, putative [Ricinus communis] gi|223543598|gb|EEF45127.1| WD-repeat protein, putative [Ricinus communis] Length = 654 Score = 452 bits (1162), Expect = e-124 Identities = 240/505 (47%), Positives = 320/505 (63%), Gaps = 31/505 (6%) Frame = +1 Query: 4 KSWWKSFVWKRYAMGM-CKNDVSTKSSKLP--KTTRMKVRQQKKKCMEVTALYKGQEIKG 174 KSWWK FV KR C + +S +S P +T R+KV+Q KK+CME T +Y QE++ Sbjct: 156 KSWWKFFVQKRKQKECTCVSKISELNSDTPTIETNRIKVKQNKKRCMEFTGVYMQQELRA 215 Query: 175 HKGAIRILRFSSSGQYLASGGEDTVVRVWQIREVETSSKCFASNGS--SKTGKMMIGRQS 348 HKG I ++FS GQYLA+GGED +VR+WQ+ V K FAS S K GK ++ Sbjct: 216 HKGFIWTMKFSPDGQYLATGGEDGIVRIWQVTSVNGCQKSFASEDSFDMKEGK---SKRK 272 Query: 349 CNSAPIIIPKKVFKIEETPLHEFHGHTGGILDLSWSNSDCLLTSSEDGTVRLWKVGCDSC 528 + A ++IP+++F++EE+P+HEF+GH+ +LDL+WSNS+CLL+SS+D TVRLW+VG D C Sbjct: 273 MSHASVVIPERIFQLEESPVHEFYGHSSDVLDLAWSNSNCLLSSSKDKTVRLWQVGSDHC 332 Query: 529 LKVFPHNDFVTCVQFNPIDERCFISGSIDGKVRIWGVSENXXXXXXXXXXXXTAVCYQPD 708 L +F H +VTC+QFNP++E FISGSIDGKVRIWGV E +A+CYQPD Sbjct: 333 LNIFHHISYVTCIQFNPVNENYFISGSIDGKVRIWGVCEQRVVDWVDAHDVTSAICYQPD 392 Query: 709 GKGFVVGSVAGICQFYSYSGSGIHLDRRFNVKGRKKSIGRPITSLQFSPTDSEKVMISSA 888 GKGFVVGS+ G C+FY SG+ + L +V+GRK++ G IT +QFS ++VMISS Sbjct: 393 GKGFVVGSITGTCRFYEASGNDLQLGAEIHVQGRKRTAGNRITGIQFSQERPQRVMISSE 452 Query: 889 DPNVRIFDKVD------GIP---------FTSDGRYIVSVDEDSHVYVWNYNESRLSSLK 1023 D +RI D VD G+P FTS G++I+SV ED VYVW+Y+E S K Sbjct: 453 DSKLRILDGVDIVHKYKGLPKSGSQMSASFTSGGKHIISVGEDCRVYVWDYDELCTPSSK 512 Query: 1024 GAKTTRSCELFSSEGVSVAVPWPGIDHRGAGLGFNSVLASSSPQRILEPSTWLRDPDCFS 1203 K+ RSCE F SEGVS+AVPW G+ + S+ + + E ++ RD + FS Sbjct: 513 HKKSVRSCEHFFSEGVSIAVPWSGMRKELKDVDSGSIHSRMPSE---EGTSRRRDSERFS 569 Query: 1204 RGVCFFTDSPSRGSAATWPEEKL--IQPSTTTEDHRHSHH---------YHCLTTISATW 1350 G FF D P RGS+ATWPEEKL + E+ +H H+ H TT+ W Sbjct: 570 LGNWFFIDGPCRGSSATWPEEKLPHWDIAMAEEECQHQHYENFQQQLIKTHGHTTLPNAW 629 Query: 1351 SLVIVTASHDGSIRSFHNFGLPVRL 1425 LVI TA DG+IR+FHN+GLPVRL Sbjct: 630 GLVIATAGWDGTIRTFHNYGLPVRL 654 >ref|XP_002328477.1| predicted protein [Populus trichocarpa] gi|222838192|gb|EEE76557.1| predicted protein [Populus trichocarpa] Length = 655 Score = 445 bits (1144), Expect = e-122 Identities = 236/510 (46%), Positives = 316/510 (61%), Gaps = 36/510 (7%) Frame = +1 Query: 4 KSWWKSFVWK--RYAMGMCKNDVSTKSSKLPKTTRMKVRQQKKKCMEVTALYKGQEIKGH 177 KSWWK F+ K +G C + VS ++ PKT R KV+Q KK CME T +Y QEI+ H Sbjct: 154 KSWWKHFLKKSKEERVGRCVSGVSKLDTEAPKTNRTKVKQNKKGCMEFTGVYMRQEIQAH 213 Query: 178 KGAIRILRFSSSGQYLASGGEDTVVRVWQIREVETSSKCFASNG-------SSKTGKMMI 336 KG I ++FS GQYLA+GGED ++RVW++ V++S K F S G +K+ + Sbjct: 214 KGFIWTMKFSPDGQYLATGGEDRIIRVWRVTLVDSSCKSFPSEGHCDSNLKEAKSNNLST 273 Query: 337 GRQSCNSAPIIIPKKVFKIEETPLHEFHGHTGGILDLSWSNSDCLLTSSEDGTVRLWKVG 516 ++ +S ++IP+KVF+IEETPL EFHGH ILDL+WS+S+ LL+SS D TVRLW++G Sbjct: 274 KKRMYSS--VVIPEKVFQIEETPLQEFHGHASEILDLAWSDSNHLLSSSMDKTVRLWRLG 331 Query: 517 CDSCLKVFPHNDFVTCVQFNPIDERCFISGSIDGKVRIWGVSENXXXXXXXXXXXXTAVC 696 C+ L +F H+++VTC+QFNP+D+ FISGSIDGKVRIWGVSE +A+ Sbjct: 332 CNHSLNIFRHSNYVTCIQFNPVDKNYFISGSIDGKVRIWGVSEKRVVHWTDVRDVISAIS 391 Query: 697 YQPDGKGFVVGSVAGICQFYSYSGSGIHLDRRFNVKGRKKSIGRPITSLQFSPTDSEKVM 876 YQPDG+GFVVG++ G C+FY SG+ + L+ +++GR+++ G ITS+QFS +VM Sbjct: 392 YQPDGRGFVVGTIKGTCRFYEVSGTDLQLEAEVHIQGRRRTSGNRITSIQFSQEICPRVM 451 Query: 877 ISSADPNVRIFDKVD------GIP---------FTSDGRYIVSVDEDSHVYVWNYNESRL 1011 I+S D VR+FD VD G+P FTS+GR+I+SV ED VYVWNY+ Sbjct: 452 ITSEDSKVRVFDGVDIVNKFKGLPKSGSQMSASFTSNGRHIISVGEDCRVYVWNYDGLCT 511 Query: 1012 SSLKGAKTTRSCELFSSEGVSVAVPWPGIDHRGAGLGFNSVLASSSPQRILEPSTWLRDP 1191 S K K+ +SCE F SEGVSVAVPW G GLG L E ++W +D Sbjct: 512 SWSKHIKSVKSCEFFFSEGVSVAVPWSGTGTEVRGLGSRRSLTE-------ETASWRKDS 564 Query: 1192 DCFSRGVCFFTDSPSRGSAATWPEEKL-------IQPSTTTEDHRHSHHYHCLT-----T 1335 + FS G FF D RGS ATWPEEKL + + + H C+ + Sbjct: 565 ERFSLGSWFFMDGRCRGSYATWPEEKLPMWDVPVLDAEYQNQQIKSLHQQQCVNSNDHIS 624 Query: 1336 ISATWSLVIVTASHDGSIRSFHNFGLPVRL 1425 +S W LVIVTA DG IR+FHN+GLP+ L Sbjct: 625 LSEAWGLVIVTAGWDGKIRAFHNYGLPIML 654 >ref|XP_003532860.1| PREDICTED: uncharacterized protein LOC100806747 [Glycine max] Length = 617 Score = 424 bits (1091), Expect = e-116 Identities = 237/501 (47%), Positives = 311/501 (62%), Gaps = 27/501 (5%) Frame = +1 Query: 4 KSWWKSFVWKRYAMGMCKNDVSTKSSKLPKTTRMKVRQQKKKCMEVTALYKGQEIKGHKG 183 K+WWK FV R ++ KT R+KVRQ KK+ +E + LY GQE++ HKG Sbjct: 129 KNWWKRFVNIRKG--------GEGNAGTNKTRRIKVRQNKKRWLEFSGLYLGQEVRAHKG 180 Query: 184 AIRILRFSSSGQYLASGGEDTVVRVWQIREVETSSKCFASNGSSKTGKMMIG----RQSC 351 I ++FS GQYLASGGED VV +W++ ++ SS C + S+ K+ R Sbjct: 181 LIWKMKFSPCGQYLASGGEDGVVCIWRVTSLDKSSICSTTEDSTSNSKVECDNSSPRNKH 240 Query: 352 NSAPII-IPKKVFKIEETPLHEFHGHTGGILDLSWSNSDCLLTSSEDGTVRLWKVGCDSC 528 +S P I +P +F+IEE+PL EF GH+ +LDL+WSNSD LL+SS D TVRLW++GC+ C Sbjct: 241 SSQPFIFLPNSIFQIEESPLQEFFGHSSDVLDLAWSNSDILLSSSMDKTVRLWQIGCNQC 300 Query: 529 LKVFPHNDFVTCVQFNPIDERCFISGSIDGKVRIWGVSENXXXXXXXXXXXXTAVCYQPD 708 L VF HND+VTC+QFNP+DE FISGSIDGKVRIWG+ E +A+ YQ D Sbjct: 301 LNVFHHNDYVTCIQFNPVDENYFISGSIDGKVRIWGIREERVIDWADIRDVISAISYQQD 360 Query: 709 GKGFVVGSVAGICQFYSYSGSGIHLDRRFNVKGRKKSIGRPITSLQFSPTDSEKVMISSA 888 GKGFVVGSV G C FY SG+ L+ + +V G+KK G IT +QFS +S+++MI+S Sbjct: 361 GKGFVVGSVTGTCCFYVASGTYFQLEAQIDVHGKKKVSGNKITGIQFSQKNSQRIMITSE 420 Query: 889 DPNVRIFD------KVDGIP---------FTSDGRYIVSVDEDSHVYVWNYNESRLSSLK 1023 D + IFD K G+P FTS G+ I+SV EDSHVY+WN+++ +S K Sbjct: 421 DSKICIFDGTELVQKYKGLPKSGSQMSGSFTSSGKNIISVGEDSHVYIWNFDDMGNASSK 480 Query: 1024 GAKTTRSCELFSSEGVSVAVPWPGI--DHRGAGLGFNSVLASSSPQRILEPSTWLRDPDC 1197 K+ RSCE F S+GV+VA+PW G+ D R + G S +S P + LE + RD + Sbjct: 481 QTKSERSCEYFFSKGVTVAIPWSGMKADQRDSS-GNYSHRSSEMPTQQLEVAPETRDHEL 539 Query: 1198 FSRGVCFFTDSPSRGSAATWPEEKLIQPS----TTTEDHRHSHHYHCL-TTISATWSLVI 1362 FS G F TD RGS TWPEEKL PS ED + SH +C ++S TW L I Sbjct: 540 FSLGNWFTTDGSCRGS-MTWPEEKL--PSWDLPIAEEDQQLSHKDNCHDRSVSETWGLSI 596 Query: 1363 VTASHDGSIRSFHNFGLPVRL 1425 V A DG+I++FHNFGLPVRL Sbjct: 597 VAAGCDGTIKTFHNFGLPVRL 617