BLASTX nr result
ID: Dioscorea21_contig00010474
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00010474 (1919 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260... 613 e-173 ref|XP_002302270.1| predicted protein [Populus trichocarpa] gi|2... 588 e-165 ref|XP_003544778.1| PREDICTED: uncharacterized protein LOC100776... 582 e-164 ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776... 578 e-162 ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri... 577 e-162 >ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera] gi|302141899|emb|CBI19102.3| unnamed protein product [Vitis vinifera] Length = 478 Score = 613 bits (1580), Expect = e-173 Identities = 302/478 (63%), Positives = 366/478 (76%) Frame = +2 Query: 116 MARKPSTCALCEQSNLACICPVCVNHRLSDYNARLKPSRSKRDSLHKRLAAELVAGRKAA 295 M RK S+C++CE+SNLA IC VCVN+RL++YN LK S+ +RDSL+ RL+ LVA KA Sbjct: 1 MTRKTSSCSICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKAD 60 Query: 296 DQVNWRIVQAEKLSKMNQRLHFXXXXXXXXXXXXXXVSNDLKIKNDVLDSAFAMLEQKRL 475 DQ+NWR++Q EKL+++ ++L +SNDLK+K +L+SA +MLE+ R+ Sbjct: 61 DQINWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNRV 120 Query: 476 EVLEKYYPNFICTQNLGLMAITSERLHKQSVVVKHICRLFPMHRVNADGEKNDGYGGLYD 655 E LEK+YPN ICTQNLGLMAITSER HKQSVV+K IC+LFP RVN DGEK DG YD Sbjct: 121 EQLEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPYD 180 Query: 656 QICYARLPRGLDPHSVPSEELAASLGYMVQXXXXXXXXXXXXXXXXXGFAGSCSRIWQRD 835 QIC RLPR LDPHSVPS+ELAASLGYMVQ GFAGSCSRIWQR+ Sbjct: 181 QICNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQRE 240 Query: 836 SYWDARPSSPSKEYPLFIPRQNFCSSGGDNSWSDRGCSNFGVASVESERRPYLDSSGANS 1015 SYW+ RPSS S EYPLFIPRQN CS+ G+NSWS+R SNFG+AS+ES+R+P L+SSG++S Sbjct: 241 SYWNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSSS 300 Query: 1016 FNYSSASPHTIETHKELQKGISLLKKSVACITAYCYNSLGLDIPVEATTFEAFAKLLATL 1195 FNYSSAS H++ETHK+LQKGISLLKKSVAC+T YCY+SL LD+P EA+TFEAFAKLLA L Sbjct: 301 FNYSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAIL 360 Query: 1196 SSSKERHSVFSLKHAHTRSDKRTQRLNKSVCHANSVVSSSLAGSTHIIIPIRPNAIDNNL 1375 SSSKE SVFSLK A +RS K+ Q+LNKS+ + NS +SSS + +P+ N DNNL Sbjct: 361 SSSKEVRSVFSLKMACSRSCKQVQQLNKSIWNMNSAISSSTLLESAHTLPMTRNIFDNNL 420 Query: 1376 SNSTASFLYSADMAELGKIESLVEEWDLVEHPTLPPPPSQVEDTEHWTRAMFTDATKK 1549 NS ASFLY+ +M+++GK ESL+EEWDLVEH PPPPSQ ED EHWTRAM DATKK Sbjct: 421 PNSAASFLYTTEMSDIGKNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDATKK 478 >ref|XP_002302270.1| predicted protein [Populus trichocarpa] gi|222843996|gb|EEE81543.1| predicted protein [Populus trichocarpa] Length = 475 Score = 588 bits (1517), Expect = e-165 Identities = 295/478 (61%), Positives = 357/478 (74%) Frame = +2 Query: 116 MARKPSTCALCEQSNLACICPVCVNHRLSDYNARLKPSRSKRDSLHKRLAAELVAGRKAA 295 M +K S CA+CE SN A ICP+CVN+RL++Y LK S+RDSL+ +L+ L+A KA Sbjct: 1 MNKKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKAD 60 Query: 296 DQVNWRIVQAEKLSKMNQRLHFXXXXXXXXXXXXXXVSNDLKIKNDVLDSAFAMLEQKRL 475 DQ NWR+ Q EKL+ ++LH +S DLK KN +L+SA +LE+ R+ Sbjct: 61 DQFNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRM 120 Query: 476 EVLEKYYPNFICTQNLGLMAITSERLHKQSVVVKHICRLFPMHRVNADGEKNDGYGGLYD 655 E LEK+YPN ICTQ+LG MAITSE LHKQSVV+K IC+LFP RVN DGE+N + G YD Sbjct: 121 EQLEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGERN--FSGQYD 178 Query: 656 QICYARLPRGLDPHSVPSEELAASLGYMVQXXXXXXXXXXXXXXXXXGFAGSCSRIWQRD 835 QIC ARLPRGLDPHSV SEELAASLGYMVQ GFAGSCSRIWQRD Sbjct: 179 QICNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRD 238 Query: 836 SYWDARPSSPSKEYPLFIPRQNFCSSGGDNSWSDRGCSNFGVASVESERRPYLDSSGANS 1015 SYW+A PSS S EYPLFIPRQN+CS+ +NSW+D+ SNFGVAS+ESERRP+LDS+ +NS Sbjct: 239 SYWNACPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNS 298 Query: 1016 FNYSSASPHTIETHKELQKGISLLKKSVACITAYCYNSLGLDIPVEATTFEAFAKLLATL 1195 FNYSS SPH++ETHK+LQKG+SLLKKSVAC+TAYCYN L LD+P + +TFEAFAKLL+TL Sbjct: 299 FNYSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTL 358 Query: 1196 SSSKERHSVFSLKHAHTRSDKRTQRLNKSVCHANSVVSSSLAGSTHIIIPIRPNAIDNNL 1375 SSSKE SVF+LK A +RS K+ Q+LNKSV + NS +SSS + + + N DNNL Sbjct: 359 SSSKEVRSVFNLKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNL 418 Query: 1376 SNSTASFLYSADMAELGKIESLVEEWDLVEHPTLPPPPSQVEDTEHWTRAMFTDATKK 1549 NS ASFL++ +++ GK ES ++ WDLVEHPT PPPPSQVED EHWTRAMF DATKK Sbjct: 419 PNSAASFLFATGISD-GKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475 >ref|XP_003544778.1| PREDICTED: uncharacterized protein LOC100776426 isoform 2 [Glycine max] Length = 477 Score = 582 bits (1501), Expect = e-164 Identities = 293/478 (61%), Positives = 355/478 (74%) Frame = +2 Query: 116 MARKPSTCALCEQSNLACICPVCVNHRLSDYNARLKPSRSKRDSLHKRLAAELVAGRKAA 295 MARK S CA+CE SN A IC +CVN+RL++YN LK + +RDSL+ +L+ LV K Sbjct: 1 MARKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGD 60 Query: 296 DQVNWRIVQAEKLSKMNQRLHFXXXXXXXXXXXXXXVSNDLKIKNDVLDSAFAMLEQKRL 475 DQ NWR++Q EKL+++ ++L +S DLK+K +L+SA + LE+ R+ Sbjct: 61 DQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRV 120 Query: 476 EVLEKYYPNFICTQNLGLMAITSERLHKQSVVVKHICRLFPMHRVNADGEKNDGYGGLYD 655 E LEK+YPN ICTQ+LG +AITSE LHK+SVV+K IC+LFP RV +GE+ DG G YD Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYD 180 Query: 656 QICYARLPRGLDPHSVPSEELAASLGYMVQXXXXXXXXXXXXXXXXXGFAGSCSRIWQRD 835 QIC ARLPR LDPHSVPSEEL+ SLGYMVQ GFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 836 SYWDARPSSPSKEYPLFIPRQNFCSSGGDNSWSDRGCSNFGVASVESERRPYLDSSGANS 1015 SYWDARPSS S EYPLFIPRQN+CS+ G+NSWS+R SNFGVASVESERR LDSSG+ S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTS 300 Query: 1016 FNYSSASPHTIETHKELQKGISLLKKSVACITAYCYNSLGLDIPVEATTFEAFAKLLATL 1195 FNYS AS H+++THK+LQKGISLLKKSV CITAYCYNSL LD+P EA+TFEAFAKLLATL Sbjct: 301 FNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATL 360 Query: 1196 SSSKERHSVFSLKHAHTRSDKRTQRLNKSVCHANSVVSSSLAGSTHIIIPIRPNAIDNNL 1375 +SSKE SVFSLK A +R+ K+ Q+LNKSV + NS +SS+ + +P N I+N L Sbjct: 361 ASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTKNLIENYL 420 Query: 1376 SNSTASFLYSADMAELGKIESLVEEWDLVEHPTLPPPPSQVEDTEHWTRAMFTDATKK 1549 +ST SFLY+AD+++ GK E L+E WD+VEHPT PPPPSQ ED EHWTRAMF DA K Sbjct: 421 PSSTGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKGK 477 >ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoform 1 [Glycine max] Length = 475 Score = 578 bits (1489), Expect = e-162 Identities = 295/479 (61%), Positives = 356/479 (74%), Gaps = 1/479 (0%) Frame = +2 Query: 116 MARKPSTCALCEQSNLACICPVCVNHRLSDYNARLKPSRSKRDSLHKRLAAELVAGRKAA 295 MARK S CA+CE SN A IC +CVN+RL++YN LK + +RDSL+ +L+ LV K Sbjct: 1 MARKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGD 60 Query: 296 DQVNWRIVQAEKLSKMNQRLHFXXXXXXXXXXXXXXVSNDLKIKNDVLDSAFAMLEQKRL 475 DQ NWR++Q EKL+++ ++L +S DLK+K +L+SA + LE+ R+ Sbjct: 61 DQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRV 120 Query: 476 EVLEKYYPNFICTQNLGLMAITSERLHKQSVVVKHICRLFPMHRVNADGEKNDGYGGLYD 655 E LEK+YPN ICTQ+LG +AITSE LHK+SVV+K IC+LFP RV +GE+ DG G YD Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYD 180 Query: 656 QICYARLPRGLDPHSVPSEELAASLGYMVQXXXXXXXXXXXXXXXXXGFAGSCSRIWQRD 835 QIC ARLPR LDPHSVPSEEL+ SLGYMVQ GFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 836 SYWDARPSSPSKEYPLFIPRQNFCSSGGDNSWSDRGCSNFGVASVESERRPYLDSSGANS 1015 SYWDARPSS S EYPLFIPRQN+CS+ G+NSWS+R SNFGVASVESERR LDSSG+ S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTS 300 Query: 1016 FNYSSASPHTIETHKELQKGISLLKKSVACITAYCYNSLGLDIPVEATTFEAFAKLLATL 1195 FNYS AS H+++THK+LQKGISLLKKSV CITAYCYNSL LD+P EA+TFEAFAKLLATL Sbjct: 301 FNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATL 360 Query: 1196 SSSKERHSVFSLKHAHTRSDKRTQRLNKSVCHANSVVSS-SLAGSTHIIIPIRPNAIDNN 1372 +SSKE SVFSLK A +R+ K+ Q+LNKSV + NS +SS +L S H + R I+N Sbjct: 361 ASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTR---IENY 417 Query: 1373 LSNSTASFLYSADMAELGKIESLVEEWDLVEHPTLPPPPSQVEDTEHWTRAMFTDATKK 1549 L +ST SFLY+AD+++ GK E L+E WD+VEHPT PPPPSQ ED EHWTRAMF DA K Sbjct: 418 LPSSTGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKGK 475 >ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis] gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase II, putative [Ricinus communis] Length = 478 Score = 577 bits (1488), Expect = e-162 Identities = 292/478 (61%), Positives = 354/478 (74%) Frame = +2 Query: 116 MARKPSTCALCEQSNLACICPVCVNHRLSDYNARLKPSRSKRDSLHKRLAAELVAGRKAA 295 M +K S CA+CE SN A IC VCVN+RL++Y+ LK +S+RD L+ RL+ LVA KA Sbjct: 1 MNKKSSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKAD 60 Query: 296 DQVNWRIVQAEKLSKMNQRLHFXXXXXXXXXXXXXXVSNDLKIKNDVLDSAFAMLEQKRL 475 DQ+NWR+ Q EKL+ + ++L +S+DL K +L+S+ + LE+ R+ Sbjct: 61 DQLNWRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRV 120 Query: 476 EVLEKYYPNFICTQNLGLMAITSERLHKQSVVVKHICRLFPMHRVNADGEKNDGYGGLYD 655 + LEKY+PN ICTQ+LG MAITSE LH SV VK IC+LFP RV +GEK DG G YD Sbjct: 121 DQLEKYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYD 180 Query: 656 QICYARLPRGLDPHSVPSEELAASLGYMVQXXXXXXXXXXXXXXXXXGFAGSCSRIWQRD 835 QIC ARLPRGLDPHS+PSEELAASLGYMVQ GFAGSCSRIWQRD Sbjct: 181 QICNARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 836 SYWDARPSSPSKEYPLFIPRQNFCSSGGDNSWSDRGCSNFGVASVESERRPYLDSSGANS 1015 SYW+ARPSS S EYPLFIPRQ +CS+ G+NSW+DR SNFGVAS+ESERR LDSS ++S Sbjct: 241 SYWNARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSS 300 Query: 1016 FNYSSASPHTIETHKELQKGISLLKKSVACITAYCYNSLGLDIPVEATTFEAFAKLLATL 1195 FNY+SASPH++ETHK+LQKGISL+KKSVAC+TAY YN L LD+P EA+TFEAFAKLLATL Sbjct: 301 FNYNSASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATL 360 Query: 1196 SSSKERHSVFSLKHAHTRSDKRTQRLNKSVCHANSVVSSSLAGSTHIIIPIRPNAIDNNL 1375 SSSKE SVFSLK A +RS K+ Q+LNKSV + NS++SSS + + N DNNL Sbjct: 361 SSSKEVRSVFSLKMACSRSCKQVQKLNKSVWNVNSIISSSTLMESAHAPHLTKNINDNNL 420 Query: 1376 SNSTASFLYSADMAELGKIESLVEEWDLVEHPTLPPPPSQVEDTEHWTRAMFTDATKK 1549 NS SFL++ ++++ GK ESL++ WDLVEHPT PPPPSQ ED EHWTRAMF DATKK Sbjct: 421 RNSATSFLFANEISDAGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478