BLASTX nr result
ID: Dioscorea21_contig00015828
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00015828 (1492 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002275748.1| PREDICTED: uncharacterized protein LOC100265... 305 3e-80 ref|XP_003546785.1| PREDICTED: uncharacterized protein LOC100782... 294 5e-77 ref|XP_003542065.1| PREDICTED: uncharacterized protein LOC100812... 285 3e-74 ref|XP_003528437.1| PREDICTED: uncharacterized protein LOC100796... 262 2e-67 ref|XP_002521351.1| conserved hypothetical protein [Ricinus comm... 260 7e-67 >ref|XP_002275748.1| PREDICTED: uncharacterized protein LOC100265239 [Vitis vinifera] gi|297743028|emb|CBI35895.3| unnamed protein product [Vitis vinifera] Length = 631 Score = 305 bits (780), Expect = 3e-80 Identities = 201/476 (42%), Positives = 257/476 (53%), Gaps = 23/476 (4%) Frame = +1 Query: 133 MNPQQCALPRSSTNGIGRRREMNPRVESKMH-GKSASSSF---GLANGDKGVSFASPSHD 300 MN QQ A PR NG GRRREM R ++K+ GKS S G+ G KG + S S D Sbjct: 1 MNLQQVAQPRPFANGFGRRREMGSRQDNKLQSGKSNPSRLPNAGVFTGTKGGGYESSSRD 60 Query: 301 RLIFVMTCLIGQTVEVHLRNGSIISGIFYTSNTEKDFGIILKMAQVIKDGSSKGQRPFPD 480 RL+++ TC IG VEV ++NGSIISGIF+ +N +KDFGI+LKMA++ KDG +GQ+ D Sbjct: 61 RLVYLTTCFIGLPVEVQVKNGSIISGIFHATNADKDFGIVLKMARLTKDGPVRGQKAISD 120 Query: 481 IVKKPQT--MIIPARELVQVLAKDVPLSVDAFANGNAXXXXXXXXXXXXXXXXXXXXXXX 654 V K + +IIPA+ELVQV+AKDV ++ D F+N Sbjct: 121 SVSKAPSKILIIPAKELVQVIAKDVSVTRDGFSN---------------ELQQDKLQDIM 165 Query: 655 XXXXXXXXXXXXXXXXXXXWTPDKDDPECPELEDIFDGTWNRNWNQFETNEALFGVKSTF 834 W PD+D P+CPELE FDG W R W+QFE N+ LFGV STF Sbjct: 166 LDSIISQSRHIEMERELERWVPDEDIPQCPELEKTFDGPWKRGWDQFEINKKLFGVNSTF 225 Query: 835 DEELYTTKLERGPQMRQLXXXXXXXXXXXXXXDTKDLHLAEERGIHCHGDHDLDEETRFS 1014 DEE+YTTKL+RGPQ R+L +T DLHLAEERG+H H + D+DEE RFS Sbjct: 226 DEEIYTTKLDRGPQTRELEKEALRLAREIEGEETHDLHLAEERGLHLHANFDIDEEARFS 285 Query: 1015 AVRRDIDSRKFVESENAKLDTYNPNKTLKS---AIGRSYSDIAKRGIIDEAWALSTCSSV 1185 +V R +D + ++E+ LD++N S AIGR ++D+ D A S+ SSV Sbjct: 286 SVLRRVDISE--DNEDGMLDSHNDETFGGSSGLAIGRHFADLTTGKSSDVAQVSSSSSSV 343 Query: 1186 DEETGSQIPADRVLNPSGSGDHLNDCIARNSQSMDDGRLDEVQASDQDKEKGSANRCAER 1365 DE SQ L SGS DH +A +SQS R+ E Q S+Q A E+ Sbjct: 344 DEAQSSQSGTGLDLYHSGSHDHARH-LALDSQS----RVQENQFSEQQVGNNHAKEFVEK 398 Query: 1366 IS-HEEAQTVSFNDGQSLNIVE-------DLSANAAASAP------GQENEHSSSE 1491 + EEAQT D QSL + LS NA A AP QEN S +E Sbjct: 399 QTLAEEAQTSKSEDLQSLLDAKKDGSDKGGLSPNATAYAPSHGSSKSQENMGSPAE 454 >ref|XP_003546785.1| PREDICTED: uncharacterized protein LOC100782491 [Glycine max] Length = 639 Score = 294 bits (752), Expect = 5e-77 Identities = 202/483 (41%), Positives = 260/483 (53%), Gaps = 30/483 (6%) Frame = +1 Query: 133 MNPQQCALPRSSTNGIGRR---REMNPRVESK-MHGK---SASSSFGLANGDKGVSFASP 291 MN QQ +SS NG GRR RE + E+K + GK + ++ G G KG S+ SP Sbjct: 1 MNLQQAGQSKSS-NGYGRRKSEREGATKSENKILSGKLNANRLANAGAVTGSKGGSYESP 59 Query: 292 SHDRLIFVMTCLIGQTVEVHLRNGSIISGIFYTSNTEKDFGIILKMAQVIKDGSSKGQRP 471 SHDRL++V TCLIG VEV ++NGSI SGIF+ +NT+KDFGIILKMA + KDGS +GQ+ Sbjct: 60 SHDRLVYVTTCLIGHQVEVQVKNGSIYSGIFHATNTDKDFGIILKMACLTKDGSLRGQKS 119 Query: 472 FPDIVKKP--QTMIIPARELVQVLAKDVPLSVDAFANGNAXXXXXXXXXXXXXXXXXXXX 645 + V KP + +IIPA++LVQV A+DV ++ D AN Sbjct: 120 GTEFVSKPLSKILIIPAKDLVQVTAQDVAITRDGLAN---------------EYHHDMHQ 164 Query: 646 XXXXXXXXXXXXXXXXXXXXXXWTPDKDDPECPELEDIFDGTWNRNWNQFETNEALFGVK 825 W PD+DDP+CPELE+IFDG WNR W+QFETNEALFGVK Sbjct: 165 EIMVDSLISQSRHVDLGRELKPWVPDEDDPQCPELENIFDGHWNRGWDQFETNEALFGVK 224 Query: 826 STFDEELYTTKLERGPQMRQLXXXXXXXXXXXXXXDTKDLHLAEERGIHCHGDHDLDEET 1005 STF+E+LYTTKLE+GPQ R+L +T+DLHLAEERG+H H D D+DEET Sbjct: 225 STFNEDLYTTKLEKGPQTRELERQALRIAREIEGEETQDLHLAEERGLHLHEDFDIDEET 284 Query: 1006 RFSAVRRD--IDSRKFVESENAKLDTYNPNKTLKSAIGRSYSDIAKR-GII------DEA 1158 RFS+V R +D F E D++N G + + KR G I D A Sbjct: 285 RFSSVYRGKRVDDSGF--DEGVLFDSHNSETFGGETFGGVFGSVVKRPGEISGGKGNDGA 342 Query: 1159 WALSTCSSVDEETGSQIPADRVLNPSGSGDH---LNDCIARNSQSMDDG--RLDEVQASD 1323 L+ SSVD SQ L+ SGS DH L + S S DG R+ E S+ Sbjct: 343 QTLANSSSVDHTLSSQSNTGVDLSRSGSSDHAKQLASELPAKSYSTSDGESRIQENSNSN 402 Query: 1324 QDKEKGSANRCAERISHEEAQTVSFNDGQS-LNIVED------LSANAAASAPGQENEHS 1482 Q + G I E+ Q D Q L + +D LS NA++ AP + H+ Sbjct: 403 QHGDNGITKE-ENLIQAEDVQLSKSEDSQGPLYMNKDGSDKGVLSPNASSYAP---SSHT 458 Query: 1483 SSE 1491 SS+ Sbjct: 459 SSK 461 >ref|XP_003542065.1| PREDICTED: uncharacterized protein LOC100812754 [Glycine max] Length = 640 Score = 285 bits (728), Expect = 3e-74 Identities = 197/483 (40%), Positives = 259/483 (53%), Gaps = 30/483 (6%) Frame = +1 Query: 133 MNPQQCALPRSSTNGIGRR---REMNPRVESK-MHGK---SASSSFGLANGDKGVSFASP 291 MN QQ P+SS NG G R RE + E+K + GK + ++ G G KG S+ SP Sbjct: 1 MNLQQAGQPKSS-NGYGHRKSEREGATKSENKILSGKLNANRLANAGAVTGSKGGSYESP 59 Query: 292 SHDRLIFVMTCLIGQTVEVHLRNGSIISGIFYTSNTEKDFGIILKMAQVIKDGSSKGQRP 471 SHDRL++V TCLIG VEV ++NGSI SGIF+ +NT+KDFGIILKMA++ KDGS +GQ+ Sbjct: 60 SHDRLVYVTTCLIGHQVEVQVKNGSIYSGIFHATNTDKDFGIILKMARLTKDGSLRGQKS 119 Query: 472 FPDIVKKP--QTMIIPARELVQVLAKDVPLSVDAFANGNAXXXXXXXXXXXXXXXXXXXX 645 + V KP + +IIPA++LVQV A+DV ++ D AN + Sbjct: 120 GTEFVSKPPLKILIIPAKDLVQVTAQDVAITRDGLANES---------------HHDMHQ 164 Query: 646 XXXXXXXXXXXXXXXXXXXXXXWTPDKDDPECPELEDIFDGTWNRNWNQFETNEALFGVK 825 W PD++DP+CPELE+IFDG WNR W+QFETNEALFGVK Sbjct: 165 EIMVDSLISQSRHVDLGRELKPWVPDEEDPQCPELENIFDGHWNRGWDQFETNEALFGVK 224 Query: 826 STFDEELYTTKLERGPQMRQLXXXXXXXXXXXXXXDTKDLHLAEERGIHCHGDHDLDEET 1005 STF+EELYTTKLE+GPQ R+L +T+DLHLAEERG+H H D+DEET Sbjct: 225 STFNEELYTTKLEKGPQTRELEKQALRIAREIEGEETQDLHLAEERGLHLHEAFDIDEET 284 Query: 1006 RFSAVRR--DIDSRKFVESENAKLDTYNPNKTLKSAIGRSYSDIAKR-GII------DEA 1158 RFS+V R +D F E+ D++N G + + KR G I D A Sbjct: 285 RFSSVYRGKHVDDSGF--DEDILFDSHNSETFGDETFGGVFGSVVKRPGEISGGKGNDGA 342 Query: 1159 WALSTCSSVDEETGSQIPADRVLNPSGSGDH---LNDCIARNSQSMDDG--RLDEVQASD 1323 L+ SS+D Q L+ SGS DH L + S S DG R+ E S+ Sbjct: 343 RTLANSSSMDHTQSCQSNTCVDLSRSGSYDHAKQLASELPAKSYSTSDGESRIQENLNSN 402 Query: 1324 QDKEKGSANRCAERISHEEAQTVSFNDGQS-LNIVED------LSANAAASAPGQENEHS 1482 Q + + E+ Q D Q L +D LS NA++ AP + H+ Sbjct: 403 QHGDNAITKEENPIQAEEDVQLSRSEDSQGPLYSKKDGSDKGVLSPNASSYAP---SSHT 459 Query: 1483 SSE 1491 SS+ Sbjct: 460 SSK 462 >ref|XP_003528437.1| PREDICTED: uncharacterized protein LOC100796073 [Glycine max] Length = 621 Score = 262 bits (669), Expect = 2e-67 Identities = 187/473 (39%), Positives = 252/473 (53%), Gaps = 30/473 (6%) Frame = +1 Query: 133 MNPQQCALPRSSTNGIG---RRREMNPRVESKM-HGKSASSSFGLANGDKGVSFASPSHD 300 MN QQ P+SS NG G +E + ++K+ GKS +SS G+KG S+ SPSHD Sbjct: 1 MNLQQVGQPKSS-NGYGCWKSEKEGATKSDNKIPSGKSNASSRLAMTGNKGGSYGSPSHD 59 Query: 301 RLIFVMTCLIGQTVEVHLRNGSIISGIFYTSNTEKDFGIILKMAQVIKDGSSKGQRPFPD 480 RL+++ TCLIGQ VEV ++NGSI SGIF+ +N+ KDFGIILKMA + KD + +G+ + Sbjct: 60 RLVYLKTCLIGQHVEVQVKNGSIYSGIFHATNSGKDFGIILKMAHLTKDAALQGKESGVE 119 Query: 481 IVKKP--QTMIIPARELVQVLAKDVPLSVDAFANGNAXXXXXXXXXXXXXXXXXXXXXXX 654 V K +T+IIPA +LVQV+AKDV +S D + + Sbjct: 120 FVSKAPFKTLIIPANDLVQVIAKDVAVSRDGLPSES---------------HYDMHQEIM 164 Query: 655 XXXXXXXXXXXXXXXXXXXWTPDKDDPECPELEDIFDGTWNRNWNQFETNEALFGVKSTF 834 W PD+DDP+CPELE+IFDG WNR W+QFETNE LFGVKSTF Sbjct: 165 VDSVISQSCHVETGRELQRWVPDEDDPQCPELENIFDGPWNRGWDQFETNEMLFGVKSTF 224 Query: 835 DEELYTTKLERGPQMRQLXXXXXXXXXXXXXXDTKDLHLAEERGIHCHGDHDLDEETRFS 1014 +E+ YTTKLE+GP+ R+L +T+DLHLAEERG+ + + D+DEETRFS Sbjct: 225 NEDFYTTKLEKGPKTRELEKQALRIAREIEGEETQDLHLAEERGL--YHNFDIDEETRFS 282 Query: 1015 AVRR--DIDSRKFVESENAKLDTYNP----------NKTLKSAIGRSYSDIAKRGIIDEA 1158 +V R +D ++ E+E+ LD++N NK A G+ S+ A + Sbjct: 283 SVYRGKGVDDSEYDENEDKLLDSHNSETFDNIYDLVNKRPVEARGQKGSNGA------QT 336 Query: 1159 WALSTCSSVDEETGSQIPADRVLNPSGSGDH---LNDCIARNSQSMDDGRLDEVQASDQD 1329 W S SSVD SQ L SGS H L + S S DG+ +Q + + Sbjct: 337 W--SNFSSVDHSKLSQSSTGVDLCRSGSNYHAKQLASELPAQSCSFSDGK-SRIQQNSVN 393 Query: 1330 KEKGSANRCAER--ISHEEAQTVSFNDGQ-SLNIVED------LSANAAASAP 1461 G + E I E+ Q D Q SL + +D LS N A+ AP Sbjct: 394 NLHGVNDNTVEENWIQTEDVQLSKSEDLQSSLKLKKDGSDEGGLSTNVASCAP 446 >ref|XP_002521351.1| conserved hypothetical protein [Ricinus communis] gi|223539429|gb|EEF41019.1| conserved hypothetical protein [Ricinus communis] Length = 634 Score = 260 bits (664), Expect = 7e-67 Identities = 186/475 (39%), Positives = 248/475 (52%), Gaps = 22/475 (4%) Frame = +1 Query: 133 MNPQQCALPRSSTNGIGRRR---EMNPRVESKMH-GKS-ASSSFGLANGDKGVSFASPSH 297 M+ QQ P+S NG GRRR E R+++K+ GKS + S A G K + SPS Sbjct: 1 MSLQQPTQPKSYANGFGRRRAEREGGARLDNKLQSGKSNPNRSSSSAIGGKVGVYESPSR 60 Query: 298 DRLIFVMTCLIGQTVEVHLRNGSIISGIFYTSNTEKDFGIILKMAQVIKDGSSKGQRPFP 477 DRL+++ TCLIG VEVHL+NGSI SG +T+N EK+F IILKMA++ KD +GQ+ Sbjct: 61 DRLVYLSTCLIGHPVEVHLKNGSIYSGTCHTTNVEKEFAIILKMARLTKD-VFRGQKT-E 118 Query: 478 DIVKKP-QTMIIPARELVQVLAKDVPLSVDAFANGNAXXXXXXXXXXXXXXXXXXXXXXX 654 + K P +T IIP +E+VQV+AKDV +++D + Sbjct: 119 SLSKAPSKTFIIPGKEVVQVIAKDVSITMDGMTHD---------------LQHEKHQEIM 163 Query: 655 XXXXXXXXXXXXXXXXXXXWTPDKDDPECPELEDIFDGTWNRNWNQFETNEALFGVKSTF 834 W PD+DDP+CPELE+IFDG WNR W+QFETNE LFGVKSTF Sbjct: 164 IDSIISQSRHVEAGRELAPWVPDEDDPQCPELENIFDGPWNRGWDQFETNELLFGVKSTF 223 Query: 835 DEELYTTKLERGPQMRQLXXXXXXXXXXXXXXDTKDLHLAEERGIHCHGDHDLDEETRFS 1014 DEELYTTKLE+GPQMR+L +T+DLHLAEERG H H + D+DEETRFS Sbjct: 224 DEELYTTKLEKGPQMRELEKEAMRMAREIEGEETQDLHLAEERGNHFHENFDIDEETRFS 283 Query: 1015 AVRRD--IDSRKFVESENAKLDTYNPNK--TLKSAIGRSYSDIAKRGIIDEAWALSTCSS 1182 +V R +D + ESE+ LD+ N ++ + D+ D A LS C Sbjct: 284 SVYRGMALDDSGYEESEDILLDSRNAETFGDTPASFTKKSGDLTNGKSNDGARVLSKC-- 341 Query: 1183 VDEETGSQIPADRVLNPSGSGDHLNDCIAR-NSQSMDDGRLDEVQASDQDKEKGSANRCA 1359 SQ A L SGS +H + S+S+ + + E G+++ Sbjct: 342 ------SQSSAGVDLYHSGSYEHPRQLGSELPSKSLSTSETETRTHENLHGEHGASDCIK 395 Query: 1360 ERISHE----EAQTVSFNDGQ-SLNIVED------LSANAAASAPGQENEHSSSE 1491 E I + +A + D Q SL+ +D LS NA A AP S+E Sbjct: 396 EFIEEQTRTGDAPLPTCEDSQSSLDGKKDGSDKGVLSPNATAYAPSSNVSSKSNE 450