BLASTX nr result
ID: Dioscorea21_contig00000873
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00000873 (1819 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] 345 2e-92 ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258... 343 7e-92 ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm... 339 2e-90 sp|A2Y040.1|RPAP2_ORYSI RecName: Full=Putative RNA polymerase II... 333 9e-89 sp|Q6AVZ9.1|RPAP2_ORYSJ RecName: Full=Putative RNA polymerase II... 332 2e-88 >emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] Length = 659 Score = 345 bits (886), Expect = 2e-92 Identities = 212/507 (41%), Positives = 293/507 (57%), Gaps = 21/507 (4%) Frame = +3 Query: 18 GDVSMEEWLGPSDAIEGYVPLRDRKQGAKYMSEDE-----STEMVDDAGN---GEMGFTS 173 G+VSME+W+GPS+AIEGYVP RDR K + + S +D N EM F S Sbjct: 161 GEVSMEDWIGPSNAIEGYVPQRDRNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVS 220 Query: 174 CLIMENELSVPQSDKSSVHQQDISNMIAKQ-LENLAIEEKNSPRERTSGKTRNRKTLKKV 350 +I ++E S+ +S K + S+ +K+ E +I ++ S E+++ +N K Sbjct: 221 TIITKDEYSISKSSKGL--KDTTSHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKL- 277 Query: 351 NACKTEEKIKRAAIVCEQSKATPPNKHSVVLEDLSEQXXXXXXXXXXXXXXTVSNEAVLK 530 E K +R+ ++ + +T SV + SE K Sbjct: 278 ----RESKGRRSRVIFKDEFSTA-EVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKPK 332 Query: 531 SSLK-SRGSKDGKTVTWADETYKAPEKKD-----------ADHGGSSNAQASHDDADEDL 674 SSLK S G K ++VTWADE + + +D D G + DD L Sbjct: 333 SSLKPSGGKKVIRSVTWADEKMDSADSRDFCKVRELEVKKEDPNGLGDIDVGDDD--NAL 390 Query: 675 RLXXXXXXXXXXXXXXETVXXXXXXXXXXXXXXXIIILPQPQNNEKGGIEEDEEIFELDR 854 R E V IIILP P++ ++G +D ++ E + Sbjct: 391 RFASAEACAVALSQAAEAVASGETDMTDAVSEAGIIILPHPRDMDEGESLKDADLLEPEP 450 Query: 855 GRVKWPTKPVLLDTDMFEVEDSWHDTPPEGFKLTLSSFATMWMALFGWITSSSIAYIYGY 1034 +KWP KP + +D+F+ +DSW+DTPPEGF LTLS FATMWMALF WITSSSIAYIYG Sbjct: 451 VPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGR 510 Query: 1035 NESSHEDFLTVNGREYPRKIVLQDGKSSEIRQTLDICVGRAIPALVMDLRLSLPVSSLEK 1214 +ES HE++L+VNGREYP+KIVL DG+SSEI+QTL C+ RA+P LV DLRL +PVS+LE+ Sbjct: 511 DESFHEEYLSVNGREYPKKIVLTDGRSSEIKQTLAGCLSRALPGLVADLRLPIPVSNLEQ 570 Query: 1215 AVGQLLDTMSLTEAVPAFRTKQWHVIVLLFLEALSLHRLPALAQRMANRNTLLHKVLNPA 1394 VG+LLDTMS +A+P+FR KQW VIVLLF++ALS+ R+PAL M +R L KV + A Sbjct: 571 GVGRLLDTMSFVDALPSFRMKQWQVIVLLFIDALSVCRIPALTPHMTSRRMLFPKVFDAA 630 Query: 1395 KITSEEYKTMVDLIIPLGRVPQTNSQT 1475 ++++EEY+ M DLIIPLGRVPQ ++Q+ Sbjct: 631 QVSAEEYEVMKDLIIPLGRVPQFSAQS 657 >ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera] gi|296089830|emb|CBI39649.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 343 bits (881), Expect = 7e-92 Identities = 210/507 (41%), Positives = 293/507 (57%), Gaps = 21/507 (4%) Frame = +3 Query: 18 GDVSMEEWLGPSDAIEGYVPLRDRKQGAKYMSE----DESTEMVDDAGNG----EMGFTS 173 G+VSME+W+GPS+AIEGYVP RDR K + +S+ D+G EM F Sbjct: 161 GEVSMEDWIGPSNAIEGYVPQRDRNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVR 220 Query: 174 CLIMENELSVPQSDKSSVHQQDISNMIAKQ-LENLAIEEKNSPRERTSGKTRNRKTLKKV 350 +I E+E S+ +S K + S+ +K+ E +I ++ S E+++ +N K Sbjct: 221 TIITEDEYSISKSSKGL--KDTTSHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKL- 277 Query: 351 NACKTEEKIKRAAIVCEQSKATPPNKHSVVLEDLSEQXXXXXXXXXXXXXXTVSNEAVLK 530 E K +R+ ++ + +T SV + SE LK Sbjct: 278 ----RESKGRRSRVIFKDEFSTA-EVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKLK 332 Query: 531 SSLK-SRGSKDGKTVTWADETYKAPEKKD-----------ADHGGSSNAQASHDDADEDL 674 S LK S G K ++VTWADE + + +D D G + DD L Sbjct: 333 SCLKPSGGKKVTRSVTWADEKMDSADSRDFCKVRELEVKKEDPNGLGDIDVGDDD--NAL 390 Query: 675 RLXXXXXXXXXXXXXXETVXXXXXXXXXXXXXXXIIILPQPQNNEKGGIEEDEEIFELDR 854 R E V IIILP P++ ++G +D ++ E + Sbjct: 391 RFASAEACAIALSQAAEAVASGETDMTDAVSEARIIILPHPRDMDEGESLKDADLLEPEP 450 Query: 855 GRVKWPTKPVLLDTDMFEVEDSWHDTPPEGFKLTLSSFATMWMALFGWITSSSIAYIYGY 1034 +KWP KP + +D+F+ +DSW+DTPPEGF LTLS FATMWMALF WITSSSIAYIYG Sbjct: 451 VPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGR 510 Query: 1035 NESSHEDFLTVNGREYPRKIVLQDGKSSEIRQTLDICVGRAIPALVMDLRLSLPVSSLEK 1214 +ES HE++L+VNGREYP+KIVL DG+SSEI+QTL C+ RA+P LV DLRL +PVS+LE+ Sbjct: 511 DESFHEEYLSVNGREYPKKIVLTDGRSSEIKQTLAGCLARALPGLVADLRLPIPVSNLEQ 570 Query: 1215 AVGQLLDTMSLTEAVPAFRTKQWHVIVLLFLEALSLHRLPALAQRMANRNTLLHKVLNPA 1394 VG+LLDTMS +A+P+FR KQW VIVLLF++ALS+ ++PAL M ++ L KV + A Sbjct: 571 GVGRLLDTMSFVDALPSFRMKQWQVIVLLFIDALSVCQIPALTPHMISKRMLFPKVFDAA 630 Query: 1395 KITSEEYKTMVDLIIPLGRVPQTNSQT 1475 ++++EEY+ M DLIIPLGRVPQ ++Q+ Sbjct: 631 QVSAEEYEVMKDLIIPLGRVPQFSAQS 657 >ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis] gi|223538861|gb|EEF40460.1| conserved hypothetical protein [Ricinus communis] Length = 645 Score = 339 bits (869), Expect = 2e-90 Identities = 200/504 (39%), Positives = 280/504 (55%), Gaps = 20/504 (3%) Frame = +3 Query: 18 GDVSMEEWLGPSDAIEGYVPLRDRKQGAKYMSEDESTEMV-------DDAGNGEMGFTSC 176 G VS+EEW+GPS+AIEGYVP DR + E + + D + FTS Sbjct: 160 GKVSLEEWIGPSNAIEGYVPQGDRDPNPSLKNHKEGLKAICKKPVSKQDCFFSDTDFTST 219 Query: 177 LIMENELSV---PQSDKSSVHQQDISNMIAKQLENLAIEEKNSPRERTSGKTRNRKTLKK 347 +I +E S+ P S+ + K E L + + ++ + +R K +K Sbjct: 220 IITNDEYSISKGPSGLTSTASDIKLQAQTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRK 279 Query: 348 VNACKTEEKIKRAAIVCEQSKATPPNKHSVVLEDLSEQXXXXXXXXXXXXXXTVSNEAVL 527 EK+ + + + ++ + ++ ED+S+ NE+VL Sbjct: 280 -------EKVIKEQLNFQDLPSS--SYYTAEAEDISQATGAANL-----------NESVL 319 Query: 528 KSSLKSRGSK-DGKTVTWADETY---------KAPEKKDADHGGSSNAQASHDDADEDLR 677 K SLKS G+K ++VTWADE + E + + + A+ D LR Sbjct: 320 KPSLKSSGAKRSNRSVTWADERVDNAGSRNLCEVQEMEQTNESHEISESANKGDDGHMLR 379 Query: 678 LXXXXXXXXXXXXXXETVXXXXXXXXXXXXXXXIIILPQPQNNEKGGIEEDEEIFELDRG 857 E V II+LP Q+ +GG E ++ E + Sbjct: 380 FESAEACAVALSQAAEAVASGDADVNKAMSEAGIIVLPPSQDLGQGGNVEKNDMIEQESA 439 Query: 858 RVKWPTKPVLLDTDMFEVEDSWHDTPPEGFKLTLSSFATMWMALFGWITSSSIAYIYGYN 1037 +KWPTKP + +D+F+ EDSW+D PPEGF LTLS FATMWMALF W+TSSS+AYIYG + Sbjct: 440 SLKWPTKPGIPQSDLFDPEDSWYDAPPEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRD 499 Query: 1038 ESSHEDFLTVNGREYPRKIVLQDGKSSEIRQTLDICVGRAIPALVMDLRLSLPVSSLEKA 1217 ES+HED+L+VNGREYPRKIVL+DG+SSEIR T + C+ R P LV +LRL +PVS+LE+ Sbjct: 500 ESAHEDYLSVNGREYPRKIVLRDGRSSEIRLTAESCLARTFPGLVANLRLPIPVSTLEQG 559 Query: 1218 VGQLLDTMSLTEAVPAFRTKQWHVIVLLFLEALSLHRLPALAQRMANRNTLLHKVLNPAK 1397 G+LL+TMS +A+PAFRTKQW VI LLF+EALS+ R+PAL M +R +LH+VL+ A Sbjct: 560 AGRLLETMSFVDALPAFRTKQWQVIALLFIEALSVCRIPALTSYMTSRRMVLHQVLDGAH 619 Query: 1398 ITSEEYKTMVDLIIPLGRVPQTNS 1469 I++EEY M D ++PLGR PQ S Sbjct: 620 ISAEEYDIMKDFMVPLGRDPQARS 643 >sp|A2Y040.1|RPAP2_ORYSI RecName: Full=Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog; AltName: Full=RNA polymerase II-associated protein 2 homolog gi|125550741|gb|EAY96450.1| hypothetical protein OsI_18345 [Oryza sativa Indica Group] Length = 726 Score = 333 bits (854), Expect = 9e-89 Identities = 210/546 (38%), Positives = 295/546 (54%), Gaps = 59/546 (10%) Frame = +3 Query: 9 AGNGDVSMEEWLGPSDAIEGYVPLRDR-----KQGAKY---MSEDESTEMVDDAGNGEMG 164 AG G+V+++EW+GPSDAIEGYVP RDR K+ AK S ++S+ + D+ N G Sbjct: 181 AGTGEVTLQEWIGPSDAIEGYVPRRDRVVGGPKKEAKQNDACSAEQSSNINVDSRNASSG 240 Query: 165 FTSCLIMENELS------------VPQSDKSSVHQQDISNMIAKQLENLAIEEK-----N 293 + ++ EN + Q + + + IS+ I KQLE++ +EEK N Sbjct: 241 ESGMVLTENTKAKKKEATKTPLKMFKQDEDNDMLSSCISDSIVKQLEDVVLEEKKDKKKN 300 Query: 294 SPRERTSGKTRNRKTLKKVNACKTEEKIKRAAIVCEQ--------------------SKA 413 + TS +++ + V E I+ + + Sbjct: 301 KAAKGTSRVGKSKPAKRPVGRDGHEVDFTSTIIMGDHGSEMMDHGALGQYNFSSSILANE 360 Query: 414 TPPNKHSVVLEDLS------EQXXXXXXXXXXXXXXTVSNEAVLKSSLKSRGSKD-GKTV 572 P + ++ + ++ S L+SSLK+ GSK+ G++V Sbjct: 361 QPSSSQYAAIDSVQAYTEELDELFSNAVNIAKDETSDDSGRCTLRSSLKAVGSKNAGRSV 420 Query: 573 TWADETYKAPEKKDADHGGSSNAQASHDDADEDLRLXXXXXXXXXXXXXXETVXXXXXXX 752 WADE E A SS +Q S D + +R E + Sbjct: 421 KWADENGSVLETSRAFVSHSSKSQESMDSS---VRRESAEACAAALIEAAEAISSGTSEV 477 Query: 753 XXXXXXXXIIILPQPQNNEKGGIEEDE-------EIFELDRGRVKWPTKPVLLDTDMFEV 911 IIILP N ++ + D EIFE+DRG VKWP K VLLDTDMF+V Sbjct: 478 EDAVSKAGIIILPDMVNQQQYNNDYDNDKDAGENEIFEIDRGVVKWPKKTVLLDTDMFDV 537 Query: 912 EDSWHDTPPEGFKLTLSSFATMWMALFGWITSSSIAYIYGYNESSHEDFLTVNGREYPRK 1091 +DSWHDTPPEGF LTLSSFATMW ALFGW++ SS+AY+YG +ESS ED L GRE P+K Sbjct: 538 DDSWHDTPPEGFSLTLSSFATMWAALFGWVSRSSLAYVYGLDESSMEDLLIAGGRECPQK 597 Query: 1092 IVLQDGKSSEIRQTLDICVGRAIPALVMDLRLSLPVSSLEKAVGQLLDTMSLTEAVPAFR 1271 VL DG SSEIR+ LD CV A+P LV +LR+ +PVS LE +G LLDTMS +A+P+ R Sbjct: 598 RVLNDGHSSEIRRALDTCVCNALPVLVSNLRMQIPVSKLEITLGYLLDTMSFVDALPSLR 657 Query: 1272 TKQWHVIVLLFLEALSLHRLPALAQRMANRNTLLHKVLNPAKITSEEYKTMVDLIIPLGR 1451 ++QW ++VL+ L+ALSLHRLPALA M++ + LL K+LN A+++ EEY +M+DL++P GR Sbjct: 658 SRQWQLMVLVLLDALSLHRLPALAPIMSD-SKLLQKLLNSAQVSREEYDSMIDLLLPFGR 716 Query: 1452 VPQTNS 1469 Q+ + Sbjct: 717 STQSQA 722 >sp|Q6AVZ9.1|RPAP2_ORYSJ RecName: Full=Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog; AltName: Full=RNA polymerase II-associated protein 2 homolog gi|51038243|gb|AAT94046.1| unknown protein [Oryza sativa Japonica Group] gi|222630100|gb|EEE62232.1| hypothetical protein OsJ_17019 [Oryza sativa Japonica Group] Length = 726 Score = 332 bits (852), Expect = 2e-88 Identities = 210/546 (38%), Positives = 295/546 (54%), Gaps = 59/546 (10%) Frame = +3 Query: 9 AGNGDVSMEEWLGPSDAIEGYVPLRDR-----KQGAKY---MSEDESTEMVDDAGNGEMG 164 AG G+V+++EW+GPSDAIEGYVP RDR K+ AK S ++S+ + D+ N G Sbjct: 181 AGTGEVTLQEWIGPSDAIEGYVPRRDRVVGGPKKEAKQNDACSAEQSSNINVDSRNASSG 240 Query: 165 FTSCLIMENELS------------VPQSDKSSVHQQDISNMIAKQLENLAIEEK-----N 293 + ++ EN + Q + + + IS+ I KQLE++ +EEK N Sbjct: 241 ESGMVLTENTKAKKKEATKTPLKMFKQDEDNDMLSSCISDSIVKQLEDVVLEEKKDKKKN 300 Query: 294 SPRERTSGKTRNRKTLKKVNACKTEEKIKRAAIVCEQ--------------------SKA 413 + TS +++ + V E I+ ++ + Sbjct: 301 KAAKGTSRVGKSKPAKRPVGRDGHEVDFTSTIIMGDRGSEMMDHGALGQYNFSSSILANE 360 Query: 414 TPPNKHSVVLEDLS------EQXXXXXXXXXXXXXXTVSNEAVLKSSLKSRGSKD-GKTV 572 P + ++ + ++ S L+SSLK+ GSK+ G +V Sbjct: 361 QPSSSQYAAIDSVQAYTEELDELFSNAVNIAKDETSDDSGRCTLRSSLKAVGSKNAGHSV 420 Query: 573 TWADETYKAPEKKDADHGGSSNAQASHDDADEDLRLXXXXXXXXXXXXXXETVXXXXXXX 752 WADE E A SS +Q S D + +R E + Sbjct: 421 KWADENGSVLETSRAFVSHSSKSQESMDSS---VRRESAEACAAALIEAAEAISSGTSEV 477 Query: 753 XXXXXXXXIIILPQPQNNEKGGIEEDE-------EIFELDRGRVKWPTKPVLLDTDMFEV 911 IIILP N ++ + D EIFE+DRG VKWP K VLLDTDMF+V Sbjct: 478 EDAVSKAGIIILPDMVNQQQYNNDYDNDKDAGENEIFEIDRGVVKWPKKTVLLDTDMFDV 537 Query: 912 EDSWHDTPPEGFKLTLSSFATMWMALFGWITSSSIAYIYGYNESSHEDFLTVNGREYPRK 1091 +DSWHDTPPEGF LTLSSFATMW ALFGW++ SS+AY+YG +ESS ED L GRE P+K Sbjct: 538 DDSWHDTPPEGFSLTLSSFATMWAALFGWVSRSSLAYVYGLDESSMEDLLIAGGRECPQK 597 Query: 1092 IVLQDGKSSEIRQTLDICVGRAIPALVMDLRLSLPVSSLEKAVGQLLDTMSLTEAVPAFR 1271 VL DG SSEIR+ LD CV A+P LV +LR+ +PVS LE +G LLDTMS +A+P+ R Sbjct: 598 RVLNDGHSSEIRRALDTCVCNALPVLVSNLRMQIPVSKLEITLGYLLDTMSFVDALPSLR 657 Query: 1272 TKQWHVIVLLFLEALSLHRLPALAQRMANRNTLLHKVLNPAKITSEEYKTMVDLIIPLGR 1451 ++QW ++VL+ L+ALSLHRLPALA M++ + LL K+LN A+++ EEY +M+DL++P GR Sbjct: 658 SRQWQLMVLVLLDALSLHRLPALAPIMSD-SKLLQKLLNSAQVSREEYDSMIDLLLPFGR 716 Query: 1452 VPQTNS 1469 Q+ + Sbjct: 717 STQSQA 722