BLASTX nr result
ID: Dioscorea21_contig00007319
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00007319 (2749 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EEE54640.1| hypothetical protein OsJ_01911 [Oryza sativa Japo... 421 e-115 gb|EEC70706.1| hypothetical protein OsI_02076 [Oryza sativa Indi... 379 e-102 ref|XP_002269821.2| PREDICTED: uncharacterized protein LOC100248... 290 1e-75 ref|XP_003522950.1| PREDICTED: uncharacterized protein LOC100812... 261 5e-67 ref|XP_002513079.1| conserved hypothetical protein [Ricinus comm... 253 1e-64 >gb|EEE54640.1| hypothetical protein OsJ_01911 [Oryza sativa Japonica Group] Length = 803 Score = 421 bits (1082), Expect = e-115 Identities = 252/650 (38%), Positives = 370/650 (56%), Gaps = 11/650 (1%) Frame = +1 Query: 223 RVGDTIWGEFSESEDHIVPYPKEREEDTLLICDDFGKKQKNEGSTTDIKS-AQQTSEDKR 399 +VGD IW EF+E+EDHIVPYPK+ E+ L+ G ++KN+ T +I +++S + Sbjct: 176 KVGDAIWAEFNENEDHIVPYPKDTEDSALV---SVGDQKKNDEETDNIPGLTERSSSGQT 232 Query: 400 DLPGCDFENSSSCNANKEFSAPRLDLGSWPDLPSISAALSKGYNDENNESSLQAGLMDFP 579 + P E + A++ +SA +LD+ SWPDLPS++A L + Y+D+N S+ +DF Sbjct: 233 EFPV--LEKQPASQASEHYSATQLDVESWPDLPSLNATLDRNYSDDNIASTY----LDFS 286 Query: 580 EASNLNSVR----VQLEGNRELFNNEHDDREDDSFLDCDWANIGDLDDLDKIFRSNDSIF 747 A +L V VQL+G E+F N+H+++ + SFLDCDW NIGD DD D++F + DSIF Sbjct: 287 SAPSLEKVTGNTTVQLDGETEVFGNDHEEKSN-SFLDCDWGNIGDFDDFDRLFSNGDSIF 345 Query: 748 GHDIGSNADEFLSSSADVISGTAQSIPMPDIPMSRDQPSDQEFPYPFSEHSDGIRKPEEK 927 G+++ ++ FLS+S+D++ T QSIP P IP+++ SD +E S G + E K Sbjct: 346 GNEMVADGSNFLSASSDLVDTTVQSIPFPHIPLNKQLSSDHGSSLLINETSGGTTEQESK 405 Query: 928 TSDITVKTEEQTVSPSNLTYHSSGIQSQPSDKVDKQSKLVKPRKKAEDRSKNNFSQNLNG 1107 D K+ EQ ++L SG +Q + D Q K V+ R++ E+R K+ S + +G Sbjct: 406 VVDANAKSGEQAEHKNHLNNEYSGKPNQFPKEGDVQKKSVRSRRRTEERGKSKMSNSTSG 465 Query: 1108 AWSNRISQSQQFPSPKAHPSLLTSVQTFQHPAISQQRQGESEHMGYPISSSQFMFSGYGF 1287 N Q Q P+ +H Q Q P ++++MG ++QF+F YG+ Sbjct: 466 FSQN---QGQHRPA-SSHSLAKAPAQPLQTPQYLLLH--DNKNMGQLQQANQFIFPDYGY 519 Query: 1288 PAYSFPAIPVLPCAPAERNQMAPVVVGYKSNVDQSKGNSS-DKLPD-SSKPLTMTPQEKI 1461 P+Y FP IP++ AE +Q P Y++++D K +SS +K D S+PL MTPQEKI Sbjct: 520 PSYQFPGIPLMSNVQAESHQTKPATTNYRTSIDSPKQSSSAEKSQDIPSRPLMMTPQEKI 579 Query: 1462 EKXXXXXXXXXXXXXXXXXXXXXXXXXXXDSLASQACSQKNQNQDASTSTTGVEESANKI 1641 EK D++ QA S K++N D+ S+ ++++ANK+ Sbjct: 580 EKLRRRQQMQALIAIQQQQQQFGQDGSGSDTMVPQAYSPKSKNPDSLGSSVVIDDNANKV 639 Query: 1642 LSSEMSMLGELDESNRISLSIDDQSLEETIYYQLQDALRKLDTRVKLSIRDSLFRLAKSA 1821 S E+ G +E + S DD +EE IYYQLQDAL KLDTR + IRDSL RLA S Sbjct: 640 FSLELIPTGH-EEIQKSSGIPDDPFIEEKIYYQLQDALAKLDTRTRRCIRDSLLRLAHSV 698 Query: 1822 MERQSASDRSSTNKINREDDEVAANIDGNGQDRSARFSDTETITNPIDRMVAHLLFHKPS 2001 ERQ SDRSS NK N++DDEV+ + R + S+ ET TNPIDR+VAHLLFHKP Sbjct: 699 SERQITSDRSSANKSNKDDDEVSEDT----SKRRSPASEAETNTNPIDRIVAHLLFHKPC 754 Query: 2002 ESTTKPVTDELPRSPTL----CSTVSLSTPGSISEDQSENAEEMEVEPSL 2139 + P +E+ S L S + PG SE+ +N +EM ++PSL Sbjct: 755 SKVSTPAKEEIKSSTPLPTEPDSKIPTDAPGGPSENH-QNGQEMTLQPSL 803 >gb|EEC70706.1| hypothetical protein OsI_02076 [Oryza sativa Indica Group] Length = 673 Score = 379 bits (972), Expect = e-102 Identities = 238/650 (36%), Positives = 351/650 (54%), Gaps = 11/650 (1%) Frame = +1 Query: 223 RVGDTIWGEFSESEDHIVPYPKEREEDTLLICDDFGKKQKNEGSTTDIKS-AQQTSEDKR 399 +VGD IW EF+E+EDHIVPYPK+ E+ L+ G ++KN+ T +I +++S + Sbjct: 88 KVGDAIWAEFNENEDHIVPYPKDTEDSALV---SVGDQKKNDEETDNIPGLTERSSSGQT 144 Query: 400 DLPGCDFENSSSCNANKEFSAPRLDLGSWPDLPSISAALSKGYNDENNESSLQAGLMDFP 579 + P E + A++ +SA +LD+ SWPDLPS++A L + Y+D+N S+ +DF Sbjct: 145 EFPV--LEKQPASQASEHYSATQLDVESWPDLPSLNATLDRNYSDDNIASTY----LDFS 198 Query: 580 EASNLNSVR----VQLEGNRELFNNEHDDREDDSFLDCDWANIGDLDDLDKIFRSNDSIF 747 A +L V VQL+G E+F N+H+++ + SFLDCDW NIGD DD D++F + DSIF Sbjct: 199 SAPSLEKVTGNTTVQLDGETEVFGNDHEEKSN-SFLDCDWGNIGDFDDFDRLFSNGDSIF 257 Query: 748 GHDIGSNADEFLSSSADVISGTAQSIPMPDIPMSRDQPSDQEFPYPFSEHSDGIRKPEEK 927 G+++ ++ FLS+S+D++ Sbjct: 258 GNEMVADGSNFLSASSDLVV---------------------------------------- 277 Query: 928 TSDITVKTEEQTVSPSNLTYHSSGIQSQPSDKVDKQSKLVKPRKKAEDRSKNNFSQNLNG 1107 D K+ EQ ++L SG +Q + D Q K V+ R++ E+R K+ S + +G Sbjct: 278 --DANAKSGEQAEHKNHLNNEYSGKPNQFPKEGDVQKKSVRSRRRTEERGKSKMSNSTSG 335 Query: 1108 AWSNRISQSQQFPSPKAHPSLLTSVQTFQHPAISQQRQGESEHMGYPISSSQFMFSGYGF 1287 N Q Q P+ +H Q Q P ++++MG ++QF+F GYG+ Sbjct: 336 FSQN---QGQHRPA-SSHSLAKAPAQPLQTPQYLLLH--DNKNMGQLQQANQFIFPGYGY 389 Query: 1288 PAYSFPAIPVLPCAPAERNQMAPVVVGYKSNVDQSKGNSS-DKLPD-SSKPLTMTPQEKI 1461 P+Y FP IP++ AE +Q P Y++++D K +SS +K D S+PL MTPQEKI Sbjct: 390 PSYQFPGIPLMSNVQAESHQTKPATTNYRTSIDSPKQSSSAEKSQDIPSRPLMMTPQEKI 449 Query: 1462 EKXXXXXXXXXXXXXXXXXXXXXXXXXXXDSLASQACSQKNQNQDASTSTTGVEESANKI 1641 EK D++ QA S K++N D+ S+ ++++ANK+ Sbjct: 450 EKLRRRQQMQALIAIQQQQQQFGQDGSGSDTMVPQAYSPKSKNPDSLGSSVVIDDNANKV 509 Query: 1642 LSSEMSMLGELDESNRISLSIDDQSLEETIYYQLQDALRKLDTRVKLSIRDSLFRLAKSA 1821 S E+ G +E + S DD +EE IYYQLQDAL KLDTR + IRDSL RLA S Sbjct: 510 FSLELIPTGH-EEIQKSSGIPDDPFIEEKIYYQLQDALAKLDTRTRRCIRDSLLRLAHSV 568 Query: 1822 MERQSASDRSSTNKINREDDEVAANIDGNGQDRSARFSDTETITNPIDRMVAHLLFHKPS 2001 ERQ SDRSS NK N++DDEV+ + R + S+ ET TNPIDR+VAHLLFHKP Sbjct: 569 SERQITSDRSSANKSNKDDDEVSEDT----SKRRSPASEAETNTNPIDRIVAHLLFHKPC 624 Query: 2002 ESTTKPVTDELPRSPTL----CSTVSLSTPGSISEDQSENAEEMEVEPSL 2139 + P +E+ S L S + PG SE+ +N +EM ++PSL Sbjct: 625 SKVSTPAKEEIKSSTPLPTEPDSKIPTDAPGGPSENH-QNGQEMTLQPSL 673 >ref|XP_002269821.2| PREDICTED: uncharacterized protein LOC100248068 [Vitis vinifera] gi|297742697|emb|CBI35150.3| unnamed protein product [Vitis vinifera] Length = 704 Score = 290 bits (743), Expect = 1e-75 Identities = 226/694 (32%), Positives = 330/694 (47%), Gaps = 56/694 (8%) Frame = +1 Query: 202 MFDWNEDRVGDTIWGEFSESEDHIVPYPKEREEDTLLICDDFGKKQKN-EGSTTDIKSAQ 378 MFDWN++ + + IWG+ ES+DH VPYP E E+ FG K+ TD+K + Sbjct: 1 MFDWNDEELANIIWGDAGESDDHTVPYPNENEKKPPAT---FGVNNKDWNQEVTDVKPTE 57 Query: 379 QTSED-KRDLPGCDFENSSSCNANKEFSAPRLDLGSWPDLPSISAALSKGYNDENNESSL 555 QT+ K G E+S++ + N+ +GSW DL S +AA + N+ S+ Sbjct: 58 QTASGAKIQFHGNKQEHSTNLDINEGLPGTGFSMGSWSDLSSSNAA-------KTNQDSM 110 Query: 556 QAGLMDFPEASNLNSVRVQLEGNRELFNNEHDDREDDSFLDCDWANIGDLDDLDKIFRSN 735 + QL+ + E+F N+HD+ E F+D WANIG DDLD+IF ++ Sbjct: 111 --------------AETTQLDKDPEIFRNQHDENEQGDFVDYGWANIGSFDDLDRIFSND 156 Query: 736 DSIFGHDIGSNADEFLSSSADVISGTAQSIPMPDIPMSRDQPS----------------- 864 +FG+ NADE SSS + P+ P+S D PS Sbjct: 157 APVFGNASLGNADELWSSST--------NSPVKSFPLSVDSPSLGLGALRNTSEHFEIKT 208 Query: 865 ------DQE-------FPYPFS-------------EHSDGIRKP---EEKTSDITVKTEE 957 DQ +P S E+ G KP ++ DI KT Sbjct: 209 EHVEHEDQSSTPAYGIMNHPSSHGQQNTCATMDQVEYGGGKSKPIMKDQIAFDIVGKT-- 266 Query: 958 QTVSPSNLTYHSSGIQSQPSDKVDKQSKLVKPRKKAEDRSKNNFSQNLNGAWSNRISQSQ 1137 T S ++ ++ ++K + Q KL+K +KK E++++ QNL G W +Q Q Sbjct: 267 -TTLNSQYAAENAATPNKFANKANGQKKLLKSQKKLEEKNEGKLLQNLYGTWCPPGNQFQ 325 Query: 1138 QFPSPKAHPSLLTSVQTFQHPAISQQRQGES-EHMGYPISSSQF-MFSGYGFPAYSFPAI 1311 Q+ A TSVQT +SQQRQ + E + Y SS F S Y + P + Sbjct: 326 QYEIQFAP----TSVQTCPSSVLSQQRQLQGHESLHYQHISSPFTASSAYTDLSNKTPVM 381 Query: 1312 PVLPCAPAERNQMAPVVVGYKSNVDQSKGNSSDKLPDSS-KPLTMTPQEKIEKXXXXXXX 1488 P LP + +++ ++ Y+ V N +K D+ KPLTMTPQEK+EK Sbjct: 382 PALPHTHSGQDKHQQLLSSYE--VPHDNANPLNKSLDAPVKPLTMTPQEKVEKLRRRQQI 439 Query: 1489 XXXXXXXXXXXXXXXXXXXXDSLASQACSQKNQNQDASTSTTGVEESANKILSSEMSMLG 1668 + + CSQ+NQN ++ +EE+ + + S + + Sbjct: 440 RAMLAIQKQQQQFNHQVSCTNPSITHRCSQENQNMHMESADGEIEENLSALSSLDPNSPM 499 Query: 1669 ELDESNRISLSIDDQSLEETIYYQLQDALRKLDTRVKLSIRDSLFRLAKSAMERQSASDR 1848 D+S IS+ ID S E+TI +LQD + KLD R++L IRDSLFRL++SAM+R + D Sbjct: 500 GQDDSITISMKIDAYSAEDTILSRLQDIVLKLDIRIRLCIRDSLFRLSQSAMQRHFSCDT 559 Query: 1849 SSTNKINREDDEVAANIDGNGQDRSARFSDTETITNPIDRMVAHLLFHKPSESTTK-PVT 2025 SSTN+ +R+D E A + N +R R D ET TNPIDR VAHLLFH+P E + P Sbjct: 560 SSTNQNSRDDHEFVAKEEINSHNRYVRMGDMETETNPIDRTVAHLLFHRPLELPGRHPEA 619 Query: 2026 DELP---RSPTLCSTVSL-STPGSISEDQSENAE 2115 E P R P T L ++P S + S+N + Sbjct: 620 PESPFSSRLPCEHKTAGLVNSPMGCSPEGSKNKQ 653 >ref|XP_003522950.1| PREDICTED: uncharacterized protein LOC100812174 [Glycine max] Length = 656 Score = 261 bits (668), Expect = 5e-67 Identities = 214/660 (32%), Positives = 327/660 (49%), Gaps = 20/660 (3%) Frame = +1 Query: 202 MFDWNEDRVGDTIWGEFSESEDHIVPYPKEREEDTLLICDDFGKKQKNEGSTTDIKSAQQ 381 MFDWN++ + + IWGE ES+DHIVPYP+ E D KK+ N+ + + + Sbjct: 1 MFDWNDEELANIIWGEGGESDDHIVPYPEVNE-------DVSNKKEWNQEAAATKLTELK 53 Query: 382 TSEDKRDLPGCDFENSSSCNANKEFSAPRLDLGSWPDLPSISAAL----SKGYNDENNES 549 E K D +SS+ + + E +WPDL S+A S G NN S Sbjct: 54 RPEAKTDFHERKLGSSSNLDNSGELPTSGYGTNAWPDLALSSSAKIDHGSLGTEVSNNLS 113 Query: 550 SLQAGLMDFPEASNLNSVRVQLEGNRELFNNEHDDREDDSFLDCDWANIGDLDDLDKIFR 729 L + S+ Q E + E+F N H+ +E F+D WANIG DDLD+IF Sbjct: 114 ELS-------KLSSSREETTQHEKHAEIFQNAHEGKEQGHFVDYGWANIGSFDDLDRIFS 166 Query: 730 SNDSIFGHDIGSNADEFLSSSADVISGTAQSIPMP-DIPMS----RDQPSDQEFPYPFSE 894 ++D IFGH +++E L SS DV + A P+P D P S R++ E + + Sbjct: 167 NDDPIFGHASLDSSNE-LWSSKDVSNNVA---PLPLDTPSSSGALRNRTESLEIKEEYVQ 222 Query: 895 HSD-GIRKPEEKTSDITVKTEEQTVSPSNLTYHSSGIQSQPSDK---VDKQSKLVKPRKK 1062 SD + EK + E + + + + G++S+P+ K V +Q L+K KK Sbjct: 223 CSDESLDLSNEKIGGPGSQVIENSCT-TTANVGNGGVRSKPTGKEQQVFRQKNLLKTWKK 281 Query: 1063 AEDRSKNNFSQNLNGAWSNRISQSQQFPSPKAHPSLLTSVQTFQHPAISQQRQGESEHMG 1242 + + + N Q+ WS + ++QF + A PS +Q+ + Q +Q + Sbjct: 282 SLVKQEENTLQDFYDNWSPSAAPAKQFQNQLA-PS---GIQSSPSSILGQPKQIQGAETL 337 Query: 1243 YPISSSQFMFSG-YGFPAYSFPAIPVLPCAPAERNQMAPVVVGYKSNVDQSKGNSSDKLP 1419 Y + F S YG ++PA+P+L +Q P + GY+ V N + L Sbjct: 338 YQNIINPFAASSVYGNLTNTYPAMPML-------SQTQPALSGYE--VSPGIVNPVNNLV 388 Query: 1420 DSSKPLTMTPQEKIEKXXXXXXXXXXXXXXXXXXXXXXXXXXXDSLASQACSQKNQNQDA 1599 DS KP MTPQEKIEK ++Q C + Q Sbjct: 389 DSVKPQIMTPQEKIEKLRRRQQMQAMIAIQKQQQQLGHQVPSTSKSSTQKCPPEIQ---- 444 Query: 1600 STSTTGVEESANKILSSEMSMLGELDESNRISLSIDDQSLEETIYYQLQDALRKLDTRVK 1779 S + G ++ + + + + E D+SN +S+++ + +E+T+ Y+LQD + KLD +++ Sbjct: 445 SHLSDGTDDDLRTLPALDPPI--EQDDSNTMSVAVGNDFVEDTVLYRLQDIISKLDIKIR 502 Query: 1780 LSIRDSLFRLAKSAMERQSASDRSSTNKINREDDEVAANIDGNGQDRSARFSDTETITNP 1959 L IRDSLFRLA +R SD SSTNK +RE+ EVAA + Q+R AR D ET TNP Sbjct: 503 LCIRDSLFRLA----QRHYTSDTSSTNKSSREELEVAAREESISQNRYARMPDVETETNP 558 Query: 1960 IDRMVAHLLFHKPSESTTKPVTDELPRSP--TLCSTVS---LSTPGS-ISEDQSENAEEM 2121 IDR VAHLLFH+P E T+ +D+L SP T C + + L+ P S + ++ S+N +++ Sbjct: 559 IDRTVAHLLFHRPME-LTQNYSDKL-ESPISTKCESKAANPLNFPVSCLPDEDSKNNQQL 616 >ref|XP_002513079.1| conserved hypothetical protein [Ricinus communis] gi|223548090|gb|EEF49582.1| conserved hypothetical protein [Ricinus communis] Length = 735 Score = 253 bits (647), Expect = 1e-64 Identities = 212/639 (33%), Positives = 311/639 (48%), Gaps = 27/639 (4%) Frame = +1 Query: 223 RVGDTIWGEFSESEDHIVPYPKEREEDTLLICDDFGKKQKNEGSTTDIKSAQQTSED-KR 399 ++ + IW E ES+DHIVPYP E D K+++ T +IKS +Q + K Sbjct: 61 KLTNIIWDEAGESDDHIVPYPGAVE--------DHSKEKEWSQETNNIKSEEQKAPGPKV 112 Query: 400 DLPGCDFENSSSCNANKEFSAPRLDLGSWPDLPSISAALSKGYNDENNESSLQAGLMDFP 579 D+ G E+SS+ N+++ SA + SWP+L +AA + + ++ ++S+ L + Sbjct: 113 DIHGRKLESSSNFNSSEGASASGFGIDSWPNLSLSTAAKT---DQDSLDASVSNNLTEIT 169 Query: 580 EASNLNSVR-VQLEGNRELFNNEHDDREDDSFLDCDWANIGDLDDLDKIFRSNDSIFGHD 756 + + VQL+ + E+F +E F+D WA+IG DDLD++F ++D IFG Sbjct: 170 KLESSGGAETVQLDKDSEIFQK---GKEQGDFVDYGWASIGSFDDLDRMFSNDDPIFGTV 226 Query: 757 IGSNADEFLSSSADVISGTAQSIPM----PDIPMSRDQPSDQEFP----------YPFSE 894 SN DE SSS DV + S + P + + + + + F +PF+ Sbjct: 227 SLSNPDELWSSSKDVTNSPGNSFRIYSDSPTLGLGPLRNTSERFEIKTEYVHDDNHPFTL 286 Query: 895 HSDGIRKPEEKTSDITVKTEEQTVSPSNLTYHSSGIQSQPS--DKVDKQSKLVKPRKKAE 1068 + P Q SP +G +S+ + +++ KQ K +K RKK E Sbjct: 287 GYGKVNDPASHGM--------QNASPVLNQVDFAGGKSKATLKEQICKQKKTMKGRKKLE 338 Query: 1069 DRSKNNFSQNLNGAWSNRISQSQQFPSPKAH------PSLLTSVQTFQHPAISQQRQGES 1230 ++S+ +L G WS+ S QF + A PS+L Q P Q +Q Sbjct: 339 EQSELALYHDLYGNWSSAGSLPGQFKNQCAPNIVCSPPSILNQPSRLQGPESLQYQQ--- 395 Query: 1231 EHMGYPISSSQFMFSGYGFPAYSFPAIPVLP-CAPAERNQMAPVVVGYKSNVDQSKGNSS 1407 IS+S S YG + A+PVL E NQ V+ GY+ V NS Sbjct: 396 ------ISTSLVASSAYGTVTNPYSAMPVLSQIQSGEFNQS--VLSGYE--VSSGNANSV 445 Query: 1408 DKLPDSS-KPLTMTPQEKIEKXXXXXXXXXXXXXXXXXXXXXXXXXXXDSLASQACSQKN 1584 +K DS K TMTPQEKIEK + S +N Sbjct: 446 NKSADSLVKTQTMTPQEKIEKLRKRQQMQAMLAIQKQQQQFGHQVSCTGQSIAPRGSLEN 505 Query: 1585 QNQDASTSTTGVEESANKILSSEMSMLGELDESNRISLSIDDQSLEETIYYQLQDALRKL 1764 QNQ + VE+ + S L E D+S+ ISL+++D S E+++ Y+LQD + KL Sbjct: 506 QNQHFEGTDLEVEDLSAFPAFDPNSPL-EQDDSSTISLAVNDYSAEDSVLYRLQDIIAKL 564 Query: 1765 DTRVKLSIRDSLFRLAKSAMERQSASDRSSTNKINREDDEVAANIDGNGQDRSARFSDTE 1944 D RV+L IRDSLFRLA+SAM+R ASD SSTN +R +++ A + +R+A S+ E Sbjct: 565 DVRVRLCIRDSLFRLAQSAMQRHYASDTSSTNNSSR-NEQAATKDSTSAHNRNANMSEVE 623 Query: 1945 TITNPIDRMVAHLLFHKPSESTTK-PVTDELPRSPTLCS 2058 T TNPIDR VAHLLFH+P E + K P T E P S S Sbjct: 624 TETNPIDRTVAHLLFHRPLELSGKHPDTPESPASTKFSS 662