BLASTX nr result
ID: Dioscorea21_contig00002312
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00002312 (3147 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003633429.1| PREDICTED: uncharacterized protein LOC100852... 150 2e-33 ref|XP_003555628.1| PREDICTED: uncharacterized protein LOC100803... 145 7e-32 ref|XP_003535782.1| PREDICTED: uncharacterized protein LOC100817... 135 5e-29 ref|XP_002525074.1| hypothetical protein RCOM_0745050 [Ricinus c... 134 2e-28 ref|XP_002311130.1| predicted protein [Populus trichocarpa] gi|2... 131 1e-27 >ref|XP_003633429.1| PREDICTED: uncharacterized protein LOC100852618 [Vitis vinifera] Length = 324 Score = 150 bits (380), Expect = 2e-33 Identities = 118/321 (36%), Positives = 154/321 (47%), Gaps = 18/321 (5%) Frame = +2 Query: 1664 ISPGRCIARDISDSLPLNADDKRLRGLPDVMGDLPLRAQHRLQFEREENAYXXXXXXXXX 1843 ISP RCI D SD + L +K +RGL D + + P+ + + FE E + Sbjct: 9 ISPNRCIGEDASDLVGLRHSEKFIRGLRDDIVE-PVFTRQQPPFEGVEGHFVQGNRNFSS 67 Query: 1844 XXXXXXXXXXXXX--NECSPHQWSPPRR-SPNIYEGHPELGHIRSPPILTHDRMRSPRQR 2014 SP WS PRR SP+ + GHPEL H RSP + DRMRSP R Sbjct: 68 IQRRGPPRIHSKSPMRSGSPGPWSSPRRRSPDGFNGHPELTHRRSPAVYRMDRMRSP-DR 126 Query: 2015 PCFPEDVIVRRPGSPQSFMTRMSDDGREIHLRR----------NDCMRPDRVLERNMQRF 2164 PCFPE+++ RR GSP F+ R S+D R++ R N R+L RN +RF Sbjct: 127 PCFPEEIVARRHGSP-PFLPRPSNDLRDMDSARDHGPPRSVIPNRRSPSGRILLRNSRRF 185 Query: 2165 DMIDPRXXXXXXXXXXXXXN--QFCELPGGGDERVDVRRICEERRSFVRPIRQRFNASED 2338 D+I+PR + +F EL GGD + RR ERR VR R +N + Sbjct: 186 DIIEPRERTDSDEFFGPPMHSGRFHEL--GGDGSGEERRRIGERRGPVRSFRPPYNGAGA 243 Query: 2339 EGSLHNHAGDGPPTFRFRPEAMEGFSERGG--SRDFNGHIQSRLGNVHDRLHGIDEREQY 2512 EG N DGP +RF PEA F ERG R+F+ +++R GN R E Sbjct: 244 EGFRFN-IEDGPRPYRFCPEADSEFLERGNLREREFDRRVKNRPGNAPRRSIEDQEGNYR 302 Query: 2513 HGRQGWSEADFAGV-RPKRRR 2572 HG Q W + F + R KRRR Sbjct: 303 HGEQVWHDQGFDDISRLKRRR 323 >ref|XP_003555628.1| PREDICTED: uncharacterized protein LOC100803295 [Glycine max] Length = 1378 Score = 145 bits (366), Expect = 7e-32 Identities = 194/664 (29%), Positives = 282/664 (42%), Gaps = 55/664 (8%) Frame = +2 Query: 746 LKMPLDEGVSEIKVENPEES-KIVDDSLCADERDSCISTADNQQEVSAVGCLXXXXXXXX 922 ++ ++ V ++ +E E S K++D S+C E S D + ++A G Sbjct: 757 IQSEINNEVVDMDIEMHERSGKVIDKSVCVQE-----SLDDEKSNIAAHGANVLQMKALD 811 Query: 923 XXXXXXXXXXXXXXXEKGSETKSNPKRDALPRTDKGGNDKLSSTPENSKGLDM---IPEK 1093 +E+ SN + D D++ T + K D+ E Sbjct: 812 LLDGKNVCEALV------AESPSNQATNGSHGVDFQCADEVVKTADIVKQTDLDFETMEV 865 Query: 1094 HLPIKDASLNVNLGKQESGKEKQSRIIKLTSATDRSSYGKVKSNDNKSPLMQSEEEKMTD 1273 DA+ +VN G RII L+ AT SS GK + +S ++ + ++D Sbjct: 866 SANADDAAKDVNNGGNPG------RIIVLSRATSSSSPGKTRPISGRSLSSRAGRDVLSD 919 Query: 1274 RPFMRIKPYFRETRDGFCKDRYQKFTSDRNQDQYSGKHRSDLIRMTERGNTHPDG----- 1438 RD D KF+ +R+QD R + +R R N+ D Sbjct: 920 S---LDGDKLHRGRDEVFIDGPHKFSRERHQDISPRNSRFNFVRGRGRLNSRLDSVRSEW 976 Query: 1439 ------SHVFH----RFK----------KSTGMRYPRHNNSSEIAYIPPGCRLVRKHGED 1558 S F+ +F+ T M Y +N + + +Y+ G RL RK D Sbjct: 977 ESDREFSGEFYNGPSQFRGPRPKYAPAFADTDMEY--NNVAPDGSYVGNG-RLGRKPLND 1033 Query: 1559 ESSNLTRLPSRRRSPGPVDREGHPVMGVQVAGRS--FISPGRCIARDISDSLPLNADDKR 1732 S + RRRSPG D G+Q+ R+ ISP RCI D SD + + +DK Sbjct: 1034 GSY----IAPRRRSPGGRD-------GIQIGHRNPRNISPNRCIG-DGSDLVGVRHNDKF 1081 Query: 1733 LRGLPDVMGDLPLRAQHRLQ-----FEREENAYXXXXXXXXXXXXXXXXXXXXXXNE--- 1888 +RGLP+ D + F R + + Sbjct: 1082 MRGLPEDNMDAMFTRSQTFEGMDGRFTRGSRNFSSMQRRGPPRIRSKSPIRSRSRSPGPW 1141 Query: 1889 CSPHQWSPPRRSPNIYEGHPELGHIRSPPILTHDRMRSPRQRPCFPEDVIVRRPGSPQSF 2068 SP + SP RRSP+ + GHPEL H RS P DRMRSP RP FP + +VRR GSP SF Sbjct: 1142 SSPRRRSPRRRSPDGFGGHPELSHRRS-PFYRVDRMRSP-DRPVFPAERVVRRHGSP-SF 1198 Query: 2069 MTRMSDDGREI-HLRRNDCMRPDRVLERNMQRFDMIDPR---XXXXXXXXXXXXXNQFCE 2236 M+R S+D R+I R + R R+L RN +RFD++DPR + E Sbjct: 1199 MSRPSNDMRDIDSARDHGHPRSGRILIRN-RRFDVVDPRDRAENDDEYFGGPMHSGRLLE 1257 Query: 2237 LPGGGDERVDVRRICEERRSFVRPIRQRFNASEDEGSLHNHAGDGPPTFRFRPEAMEGFS 2416 L G G+ + RR ERR VR R +N + E + H +A DGP +RF + + F Sbjct: 1258 LSGEGNG--EDRRRFGERRGPVRSFRPPYNNNVGE-NFHLNAEDGPRHYRFCSDDSD-FH 1313 Query: 2417 ERGGS----RDFNGHIQSRLGNVHD-RLHGIDEREQYH------GRQGWSEADFAGV-RP 2560 ERGG+ RDF+ I+ R NV R +DE+E+ G Q WS+ F + R Sbjct: 1314 ERGGNNIRERDFDRRIKGRPANVPPRRTRNMDEQEENFRHGGGGGGQVWSDDSFDDISRV 1373 Query: 2561 KRRR 2572 KR+R Sbjct: 1374 KRKR 1377 >ref|XP_003535782.1| PREDICTED: uncharacterized protein LOC100817471 [Glycine max] Length = 1396 Score = 135 bits (341), Expect = 5e-29 Identities = 202/781 (25%), Positives = 313/781 (40%), Gaps = 53/781 (6%) Frame = +2 Query: 326 TSSLPVEHVGSDDKGSITTNDLSSVIGTTGSNDEKLATHETSV-IENNEKKNSSYQFDKS 502 TSS+ E + D+ + ++ + N E A+ E + + + ++ SY D Sbjct: 615 TSSVSTEEENAADRDACRLKLMNEPPPASRGNGEGCASDEEKITLSTDMLEDDSYDSDSE 674 Query: 503 S--HHAITETDEHEHKTHKSGDVDG-IKKPA-PKXXXXXXXXXXXXXXXPNLRAEGGDRV 670 S +HA+T + E DG +++P P N + ++ Sbjct: 675 SDENHAVTIAVDTECYVEDDDYEDGEVREPLDPSTAEDVCEVREVEHPDSNFVNKQMEKG 734 Query: 671 LPGEDSSEKREANQLDGSTAVLDVPLKMPLDEGVSEIKVENPEES-KIVDDSLCADER-- 841 + D + + + TA+ + ++ V ++ +E E S K+VD ++C E Sbjct: 735 MVSGDCPTSYQVVEKNNMTAI-----QSEINNEVVDMDIEMHERSGKVVDKNVCVQESLD 789 Query: 842 -DSCISTADNQQEVSAVGCLXXXXXXXXXXXXXXXXXXXXXXXEKGSETKSNPKRDALPR 1018 + C + V+ + GS D + + Sbjct: 790 DEKCNIATHGNKPVNVLQMKALDLLEGKNVCEALVTESPSNQATNGSHGVDVQCADEVVK 849 Query: 1019 TDKGGNDKLSSTPENSKGLDMIPEKHLPIKDASLNVNLGKQESGKEKQSRIIKLTSATDR 1198 T D + T + + +++ KD + NLG RII L+ AT Sbjct: 850 T----TDIVKQTDLDFETMEVSANADDAAKDVNNGGNLG----------RIIDLSRATSS 895 Query: 1199 SSYGKVKSNDNKSPLMQSEEEKMTDRPFMRIKPYFRETRDGFCKDRYQKFTSDRNQDQYS 1378 SS GK + +S ++ + ++D RD D KF+ +R+QD Sbjct: 896 SSPGKTRPMSGRSLSSRAGRDVLSDT---LDGDKLHRGRDEVYIDGPHKFSRERHQDISP 952 Query: 1379 GKHRSDLIRMTERGNTHPDG-----------SHVFH----RFK----------KSTGMRY 1483 K R + +R R N D S F+ +F+ T M Y Sbjct: 953 RKTRMNFVRGRGRLNNRLDSVRNDWESDREFSGEFYNGPSQFRGPRPKYASAFADTDMEY 1012 Query: 1484 PRHNNSSEIAYIPPGCRLVRKHGEDESSNLTRLPSRRRSPGPVDREGHPVMGVQVAGRS- 1660 +N + + +Y+ G RL RK D S + RRRS G D G+Q+ R+ Sbjct: 1013 --NNVAPDGSYVGNG-RLGRKPLNDGSY----IAPRRRSSGGRD-------GIQIGHRNP 1058 Query: 1661 -FISPGRCIARDISDSLPLNADDKRLRGLPDVMGDLPLRAQHRLQ-----FEREENAYXX 1822 ISP RCI D SD + + ++K +R LP+ D + F R + Sbjct: 1059 RNISPNRCIG-DGSDLVGVRHNEKFMRSLPEDNMDAMFTRPQTFEGMDGRFTRGSRNFSS 1117 Query: 1823 XXXXXXXXXXXXXXXXXXXXNE---CSPHQWSPPRRSPNIYEGHPELGHIRSPPILTHDR 1993 + SP + SP RRSP+ + GHPEL H RS P DR Sbjct: 1118 MQRRGPPQIRSKSPIRSRSRSPGPWSSPRRRSPRRRSPDGFGGHPELTHRRS-PFYRVDR 1176 Query: 1994 MRSPRQRPCFPEDVIVRRPGSPQSFMTRMSDDGREI-HLRRNDCMRPDRVLERNMQRFDM 2170 MRSP RP FP + +VRR GSP SFM+R S+D R++ R + R R+L RN +RFD+ Sbjct: 1177 MRSP-DRPVFPAERVVRRHGSP-SFMSRPSNDMRDMDSARDHGHPRSGRILIRN-RRFDV 1233 Query: 2171 IDPR---XXXXXXXXXXXXXNQFCELPGGGDERVDVRRICEERRSFVRPIRQRFNASEDE 2341 +DPR + EL G G+ + RR ERR VR R +N + Sbjct: 1234 VDPRDRVDNDDEYFGGPMHSGRLLELSGEGNG--EDRRRFGERRGPVRSFRPPYNNNNVG 1291 Query: 2342 GSLHNHAGDGPPTFRFRPEAMEGFSERGGS----RDFNGHIQSRLGNVHD-RLHGIDERE 2506 S H +A DGP +RF + + F ERGG+ RDF I+ R NV R +DE+E Sbjct: 1292 ESFHLNAEDGPRHYRFCSDDSD-FHERGGNNLRERDFERRIKGRPANVPPRRTRNMDEQE 1350 Query: 2507 Q 2509 + Sbjct: 1351 E 1351 >ref|XP_002525074.1| hypothetical protein RCOM_0745050 [Ricinus communis] gi|223535655|gb|EEF37321.1| hypothetical protein RCOM_0745050 [Ricinus communis] Length = 1517 Score = 134 bits (337), Expect = 2e-28 Identities = 156/525 (29%), Positives = 229/525 (43%), Gaps = 30/525 (5%) Frame = +2 Query: 1088 EKHLPIKDASLNV-NLGKQESGKEKQSRIIKLTSATDRSSYGKVKSNDNKSPLMQSEEEK 1264 E LP + +N N K + QSRII L+ A++ SS+GK +S +K ++S E+ Sbjct: 961 ESALPKMETLINGDNAPKDANSGGNQSRIINLSIASNMSSFGKTRSISSKPLSLRSGRER 1020 Query: 1265 MTDRPFM--RIKPYFRETRDGFCKDRYQKFTSDRNQDQYSGK-----------HRSDLIR 1405 + D P R+ P RD D QKFT +R Q+ + + R D +R Sbjct: 1021 L-DVPLEGDRLHP---RGRDEAYNDGSQKFTRERYQESRNSRWNFIHGRGRLASRIDSLR 1076 Query: 1406 MTERGNTHPDGSHVFHRFKKSTGMRYPRHNNSSEIAYIPPGCRLVRKHGEDESSNLTRLP 1585 H + + + +N S+ + G R RK +D++ Sbjct: 1077 NDRDSERDCIPRHKYATAVAGSDTEFVNYNMGSDGVFAG-GVRGGRKLVDDDTPIFRHFS 1135 Query: 1586 SRRRSPGPVDREGHPVMGVQVAGRSFISPGRCIARDISDSLPLNADDKRLRGLPDVMGDL 1765 SRRRSPG R+G G+Q+ R R I D S+ + L +K +RG PD G+ Sbjct: 1136 SRRRSPGR--RDGPASRGLQMVRRV----PRSIDEDNSEVVGLRHTEKIMRGFPDD-GEE 1188 Query: 1766 PLRAQHRLQFEREENAYXXXXXXXXXXXXXXXXXXXXXX-NECSPHQWSPPRRSPNIYEG 1942 + + +E + + SP WS RRSP+ + G Sbjct: 1189 HSYSHTQPPYEGLDGPFVQGTRSFSVQRRGLPQMHSKSPIRSRSPGPWSSRRRSPDGFVG 1248 Query: 1943 HPELGHIRSPPILTHDRMRSPRQRPCFPEDVIVRRPGSPQSFMTRMSDDGREIHLRRNDC 2122 PEL H RS P+ +RMRSP P FP D + RR SP S+++R +D RE+ R D Sbjct: 1249 PPELPHRRS-PLYRMERMRSP-DNPGFPADRVGRRHSSP-SYLSR-PNDLREMDPSR-DH 1303 Query: 2123 MRPDRVLE-----------RNMQRFDMIDPRXXXXXXXXXXXXXN--QFCELPGGGDERV 2263 P ++ R +RF + DPR + +F EL G G+E Sbjct: 1304 GHPRSIISNRSPTGRGGLLRGSRRFGIGDPRERPENEEFFAGPVHSGRFHELGGDGNEE- 1362 Query: 2264 DVRRICEERRSFVRPIRQRFNASEDEGSLHNHAGDGPPTFRFRPEAMEGFSERGG--SRD 2437 RR ERR+ VR R FN ++ E + + + DGP +FRF PE F ER R+ Sbjct: 1363 --RRRFGERRAPVRSFRPPFNGTDGE-NFNFNTEDGPRSFRFYPEVDPDFHERPNLRERE 1419 Query: 2438 FNGHIQSRLGNVHDRLHGIDEREQYHGRQGWSEADFAGVRPKRRR 2572 F+ I++R GN R I+E+E + R G A+ V P R Sbjct: 1420 FDRRIKNRPGNAPRRPRSIEEQEGNY-RHGGQMAELVVVIPAEYR 1463 >ref|XP_002311130.1| predicted protein [Populus trichocarpa] gi|222850950|gb|EEE88497.1| predicted protein [Populus trichocarpa] Length = 1370 Score = 131 bits (330), Expect = 1e-27 Identities = 161/545 (29%), Positives = 235/545 (43%), Gaps = 45/545 (8%) Frame = +2 Query: 1058 ENSKGLDMIPEKHLPIKDASLNV-NLGKQESGKEKQSRIIKLTSATDRSSYGKVKSNDNK 1234 EN K + + + LP +ASLN ++ K S +SRII L A++ SS GK +S + Sbjct: 796 ENIK-TNYMEKNELPELEASLNGGDMAKDVSSS--RSRIINLPRASNSSSPGKTRSISGR 852 Query: 1235 SPLMQSEEEKMTDRPFMRIKPYFRETRDGFCKDRYQKFTSDRNQDQYSGKHRSDLIRM-- 1408 S +E++ D P K + + RD D ++F+ DR+Q+ + R + +R Sbjct: 853 P--FSSYQERLPDGPLEGGKLH-PQGRDEIYIDGPRRFSRDRHQEHFPRNSRMNFVRGRG 909 Query: 1409 -------TERGNTHPDGSHVFHRFKKSTGMRYPRHNNSSEIA----------YIPPG--- 1528 T RG+ + ++ + S+ RH +S A P G Sbjct: 910 RISSRIDTLRGDRDSERNYASEFYNGSSDFAVRRHKYASAAAEADSESINYNIAPDGSFV 969 Query: 1529 --CRLVRKHGEDESSNLTRLPSRRRSPGPVDREGHPVMGVQVAGRSFISPGRCIARDISD 1702 R RK +DE+ +PSRRRSP R+ G+Q+ R R I + S+ Sbjct: 970 GTARGGRKLLDDETPVFRNVPSRRRSPE--GRDVPAARGIQMVHRV----PRNIGEEGSE 1023 Query: 1703 SLPLNADDKRLRGLPDVMGDLPLRAQ---------HRLQFEREENAYXXXXXXXXXXXXX 1855 + + +RG PD + R H +Q R ++ Sbjct: 1024 VIGARHTEN-MRGFPDDGTEQAFRRPQPSYEGLDGHFVQGTRNYSSVHRRALPQFRSKSP 1082 Query: 1856 XXXXXXXXXNECSPHQWSPPRR-SPNIYEGHPELGHIRSPPILTHDRMRSPRQRPCFPED 2032 SP WS RR SP+ + G EL + RSP I + R+RSP P FP + Sbjct: 1083 IRSR--------SPGPWSSARRRSPDGFGGTSELSNRRSP-IYSMGRIRSP-DHPGFPRE 1132 Query: 2033 VIVRRPGSPQSFM----TRMSDDGREIHLRRNDCMRPDRVLERNMQRFDMIDPRXXXXXX 2200 ++VRR GSP TR +D G + N + RV RN +RF + DPR Sbjct: 1133 MVVRRHGSPPFLSRPPDTRETDPGHSRSIISNRG-QTGRVFLRNSRRFGITDPRERADSD 1191 Query: 2201 XXXXXXXN--QFCELPGGGDERVDVRRICEERRSFVRPIRQRFNASEDEGSLHNHAGDGP 2374 + +F +L GGD V+ RR ERR VR + FN + E + H + DGP Sbjct: 1192 EFFGGPIHSGRFHDL--GGDGNVEDRRRFSERRGPVRSFKPPFNGAGSE-NFHLNPEDGP 1248 Query: 2375 PTFRFRPEAMEGFSERGG--SRDFNGHIQSRLGNVHDRLHGIDERE--QYHGRQGWSEAD 2542 FRF PE F ER R+F+G I++R GN R GI+E+E HGRQ + Sbjct: 1249 RPFRFFPEDNPEFHERTNLREREFDGRIRNRPGNAPRRPRGIEEQEGNYRHGRQATYDCC 1308 Query: 2543 FAGVR 2557 G R Sbjct: 1309 CVGWR 1313