BLASTX nr result
ID: Cephaelis21_contig00012036
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00012036 (1961 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ... 466 e-128 ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma... 461 e-127 ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal doma... 446 e-122 ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal doma... 435 e-119 dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] 424 e-116 >ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 478 Score = 466 bits (1199), Expect = e-128 Identities = 251/477 (52%), Positives = 322/477 (67%), Gaps = 17/477 (3%) Frame = -2 Query: 1783 MSLVADSPLHSS-STSGDDYAAFLELELDS---DSDTSPKXXXXXXXXXXXXXXXXXXXX 1616 MSLV DSPLHSS S+S DD+AA L+ ELDS SD+SPK Sbjct: 1 MSLVTDSPLHSSHSSSSDDFAALLDAELDSKSSSSDSSPKAIKHDDASDANDDVNEEEEE 60 Query: 1615 G-------------RNKRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDICEHPAVWNK 1475 R KR +VE LE+ + + ST V ++ QT +SS K C HP + Sbjct: 61 EESDSDDDSDIATNRIKRSRVETLENGENPKESTRVSLD-QTLVASSSKVACTHPGSFGD 119 Query: 1474 LCVRCGQKTDDESGVAFGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLN 1295 +C+ CG++ +E+GV FGYIHK LRL+N+E+ RLR LN Sbjct: 120 MCILCGERLIEETGVTFGYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTLLN 179 Query: 1294 SSLLDDINGDEEYLKGPKEGLPDTVKNSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYI 1115 S+ L + +EEYLK + + D SL +D M MMTKLRPF+ +FL+EAS++FEMYI Sbjct: 180 STQLMHLTAEEEYLKSQIDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEMYI 239 Query: 1114 YTMGDRNYALEMANLLDPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKV 935 YTMGDR YALEMA LDP YFN+RVI+ + T+R+QKGLD+VLG+ESAVLILDDTE Sbjct: 240 YTMGDRAYALEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTENA 299 Query: 934 WEGHLENLILMDRYHFFASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNF 755 W H +NLILM+RYHFFASS RQF F KSLS+L+SDE+E DGALA+VL+VL++IH F Sbjct: 300 WTKHKDNLILMERYHFFASSCRQFGFEC-KSLSQLKSDENESDGALASVLKVLRRIHHIF 358 Query: 754 FDSEQKESLEDQDVRQGLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCT 575 FD E +++++ +DVRQ L TVRK+VLKGCKIVFSRVF A+N WK+AEQLGATC+ Sbjct: 359 FD-ELEDAIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCS 417 Query: 574 KELDPSITHVVSEDSGTEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENYPVKHSR 404 +E+DPS+THVVS ++GTEKSRWA++ KFLV+PRWI+A Y+WQ+QPEEN+ V + Sbjct: 418 REVDPSVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVNQPK 474 >ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Cucumis sativus] Length = 452 Score = 461 bits (1185), Expect = e-127 Identities = 249/461 (54%), Positives = 316/461 (68%), Gaps = 1/461 (0%) Frame = -2 Query: 1783 MSLVADSPLHSSSTSGDDYAAFLELELDS-DSDTSPKXXXXXXXXXXXXXXXXXXXXGRN 1607 MSL +SP HSSS+ DD+AAFL ++LDS SD+SP R Sbjct: 1 MSLATNSPAHSSSS--DDFAAFLAVDLDSHSSDSSPDEETEGDNNAESV---------RI 49 Query: 1606 KRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDICEHPAVWNKLCVRCGQKTDDESGVA 1427 KRRKVE LE+S + VE Q+ S++ +C HP + +C+ CGQ+ D+ESGV Sbjct: 50 KRRKVEKLENS---EEDIMHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVT 106 Query: 1426 FGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLNSSLLDDINGDEEYLKG 1247 FGYIHK LRL+N+E+ R+R LNS+ L + +EEYL+ Sbjct: 107 FGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRS 166 Query: 1246 PKEGLPDTVKNSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYIYTMGDRNYALEMANLL 1067 + L D K SL L+ + MTKLRPFVHSFL+EASKLFEMYIYTMG+R YA EMA LL Sbjct: 167 QTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLL 226 Query: 1066 DPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKVWEGHLENLILMDRYHF 887 DPK YF+S+VI+ + T+++QKGLDVVLG+ESAVLILDDTE W H ENLILM+RYHF Sbjct: 227 DPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHF 286 Query: 886 FASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNFFDSEQKESLEDQDVRQ 707 FASS RQF F + KSLSEL++DESE DGAL T+L+VL+++H FF +E L D+DVRQ Sbjct: 287 FASSCRQFGF-NCKSLSELKNDESETDGALTTILKVLKQVHHMFF-NEVSGDLVDRDVRQ 344 Query: 706 GLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCTKELDPSITHVVSEDSG 527 L TVR EVL+GCK+VFSRVF AEN Q WK+ EQLG TC+ ELD S+THVV+ D+G Sbjct: 345 VLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAG 404 Query: 526 TEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENYPVKHSR 404 TEKSRWA++EKKFLV+PRWI+A+ Y W++Q EEN+ V+ ++ Sbjct: 405 TEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENFTVEQTK 445 >ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Glycine max] Length = 442 Score = 446 bits (1146), Expect = e-122 Identities = 241/459 (52%), Positives = 312/459 (67%), Gaps = 2/459 (0%) Frame = -2 Query: 1783 MSLVADSPLHSSSTSGDDYAAFLELELDSDS-DTSPKXXXXXXXXXXXXXXXXXXXXGRN 1607 MS+V DSP+HSSS+ DD+ AFL+ ELD+ S D+SP R Sbjct: 1 MSVVTDSPVHSSSS--DDFIAFLDAELDASSPDSSPDKEVVKQDDELQSV--------RT 50 Query: 1606 KRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDIC-EHPAVWNKLCVRCGQKTDDESGV 1430 KRRK E +E + + STS G+ ++ +SSE D+C HP + +C+RCGQK D ESGV Sbjct: 51 KRRKFESIEET---EGSTSEGIVKRSLEASSEVDVCCTHPGSFGNMCIRCGQKLDGESGV 107 Query: 1429 AFGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLNSSLLDDINGDEEYLK 1250 FGYIHK LRL +EE++RLR LNS+ L + +E +L Sbjct: 108 TFGYIHKGLRLHDEEISRLRNTDMKSLLGRKKLYLVLDLDHTLLNSTHLAQLTSEELHLL 167 Query: 1249 GPKEGLPDTVKNSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYIYTMGDRNYALEMANL 1070 + L + K SL +L+ M MMTKLRPFV FL+EAS++FEMYIYTMGDR YALEMA L Sbjct: 168 NQTDSLTNVSKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKL 227 Query: 1069 LDPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKVWEGHLENLILMDRYH 890 LDP+G YFN++VI+ + T+++QKGLDVVLG+ESAV+ILDDTE W H +NLILM+RYH Sbjct: 228 LDPQGEYFNAKVISRDDGTQKHQKGLDVVLGQESAVIILDDTEHAWMKHKDNLILMERYH 287 Query: 889 FFASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNFFDSEQKESLEDQDVR 710 FF SS RQF F + KSL+EL+SDE E DGALA +L+VL+++H FFD ++E +DQDVR Sbjct: 288 FFGSSCRQFGF-NCKSLAELKSDEDETDGALAKILKVLKQVHCMFFD--KQEDFDDQDVR 344 Query: 709 QGLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCTKELDPSITHVVSEDS 530 Q L +VR+EVL GC I+FSR+ K+AEQ+GATC E+DPS+THVV+ D+ Sbjct: 345 QVLSSVRREVLSGCVIIFSRIVH----GAIPSLRKMAEQMGATCLTEIDPSVTHVVATDA 400 Query: 529 GTEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENYPVK 413 GTEK RWAV+EKKF+V+P WI+AA Y WQKQPEEN+ +K Sbjct: 401 GTEKCRWAVKEKKFVVHPLWIEAANYFWQKQPEENFSLK 439 >ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Glycine max] Length = 428 Score = 435 bits (1118), Expect = e-119 Identities = 238/458 (51%), Positives = 304/458 (66%), Gaps = 1/458 (0%) Frame = -2 Query: 1783 MSLVADSPLHSSSTSGDDYAAFLELELDSDS-DTSPKXXXXXXXXXXXXXXXXXXXXGRN 1607 MS+V DSP+HSSS+ DD+ AFL+ ELD+ S D+SP Sbjct: 1 MSVVTDSPVHSSSS--DDFIAFLDAELDASSPDSSPDKEVEKQDDDELESGI-------- 50 Query: 1606 KRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDICEHPAVWNKLCVRCGQKTDDESGVA 1427 KRRK E +E +T GS+SE +C HP + +C+RCGQK D ESGV Sbjct: 51 KRRKFESIE---------------ETEGSTSE-GVCTHPGSFGNMCIRCGQKLDGESGVT 94 Query: 1426 FGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLNSSLLDDINGDEEYLKG 1247 FGYIHK LRL +EE++RLR LNS+ L + +E +L Sbjct: 95 FGYIHKGLRLHDEEISRLRNTDMKSLLCRKKLYLVLDLDHTLLNSTHLAHLTSEESHLLN 154 Query: 1246 PKEGLPDTVKNSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYIYTMGDRNYALEMANLL 1067 + L D K SL +L+ M MMTKLRPFV FL+EAS++FEMYIYTMGDR YALEMA LL Sbjct: 155 QTDSLRDVSKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLL 214 Query: 1066 DPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKVWEGHLENLILMDRYHF 887 DP+G YFN++VI+ + T+++QKGLDVVLG+ESAVLILDDTE W H +NLILM+RYHF Sbjct: 215 DPQGEYFNAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHF 274 Query: 886 FASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNFFDSEQKESLEDQDVRQ 707 F SS RQF F + KSL+EL+SDE+E DGALA +L+VL+++H FFD ++E +D+DVRQ Sbjct: 275 FGSSCRQFGF-NCKSLAELKSDENETDGALAKILKVLKQVHCMFFD--KQEDFDDRDVRQ 331 Query: 706 GLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCTKELDPSITHVVSEDSG 527 L VR+EVL GC I+FSR+ K+AEQ+GATC E+DPS+THVV+ D+G Sbjct: 332 MLSLVRREVLSGCVIIFSRIVH----GAIPSLRKMAEQMGATCLTEIDPSVTHVVATDAG 387 Query: 526 TEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENYPVK 413 TEK RWAV+EKKF+V+P WI+AA Y WQKQPEEN+ +K Sbjct: 388 TEKCRWAVKEKKFVVHPLWIEAANYFWQKQPEENFILK 425 >dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] Length = 1065 Score = 424 bits (1090), Expect = e-116 Identities = 236/459 (51%), Positives = 303/459 (66%), Gaps = 3/459 (0%) Frame = -2 Query: 1789 YEMSLVADSPLHSSSTSGDDYAAFLELELDSDSDTSPKXXXXXXXXXXXXXXXXXXXXGR 1610 ++MS+ +DSP+HSSS+S DD AAFL+ ELDS SD S Sbjct: 624 FKMSVASDSPVHSSSSS-DDLAAFLDAELDSASDASSGPSEEEEAEDDVESGL------- 675 Query: 1609 NKRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDICEHPAVWNKLCVRCGQKTDDESGV 1430 KR+K+E LE + SS K CEHP + +C CGQK + E+GV Sbjct: 676 -KRQKLEHLEEA------------------SSSKGECEHPGSFGNMCFVCGQKLE-ETGV 715 Query: 1429 AFGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLNSSLLDDINGDEEYLK 1250 +F YIHK +RL+ +E++RLR+ LN+++L D+ +EEYLK Sbjct: 716 SFRYIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLK 775 Query: 1249 GPKEGLPDTVK---NSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYIYTMGDRNYALEM 1079 L D SL L+ M+MMTKLRPFVHSFL+EAS++F MYIYTMGDRNYA +M Sbjct: 776 SHTHSLQDGCNVSGGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQM 835 Query: 1078 ANLLDPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKVWEGHLENLILMD 899 A LLDPKG YF RVI+ + T R++K LDVVLG+ESAVLILDDTE W H +NLI+++ Sbjct: 836 AKLLDPKGEYFGDRVISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIE 895 Query: 898 RYHFFASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNFFDSEQKESLEDQ 719 RYHFF+SS RQF+ KSLSEL+SDESE DGALATVL+VL++ H+ FF++ E + ++ Sbjct: 896 RYHFFSSSCRQFDH-RYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVD-EGISNR 953 Query: 718 DVRQGLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCTKELDPSITHVVS 539 DVR L VRKE+LKGCKIVFSRVF + E+ WK+AE+LGATC E+D S+THVV+ Sbjct: 954 DVRLMLKQVRKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVA 1013 Query: 538 EDSGTEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENY 422 D GTEK+RWAV+EKK++V+ WIDAA YLW KQPEEN+ Sbjct: 1014 MDVGTEKARWAVREKKYVVHRGWIDAANYLWMKQPEENF 1052