BLASTX nr result
ID: Cephaelis21_contig00012555
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00012555 (2413 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma... 445 e-122 ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ... 436 e-119 ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal doma... 419 e-114 ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [S... 416 e-113 ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphat... 414 e-113 >ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Cucumis sativus] Length = 452 Score = 445 bits (1145), Expect = e-122 Identities = 235/416 (56%), Positives = 294/416 (70%), Gaps = 1/416 (0%) Frame = -3 Query: 2333 NEDTK-DYDLDSEGNKRRKVDIVESSLDLQSSTSLGVEARTSGASLATGICTHPGVIGGL 2157 +E+T+ D + +S KRRKV+ +E+S + VE ++ +C+HPG G + Sbjct: 35 DEETEGDNNAESVRIKRRKVEKLENS---EEDIMHEVEEQSLEVLSKQQLCSHPGSFGNM 91 Query: 2156 CIRCGQKMDDESGVALGYIHKXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTLVNS 1977 CI CGQ++D+ESGV GYIHK L LVLDLDHTL+NS Sbjct: 92 CIICGQRLDEESGVTFGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNS 151 Query: 1976 SLLDDINGSEAYLKGPRDDLPDALKSSIYRLESMRMMTKLRPFVHSFLKEASKMFEMYIY 1797 + L + E YL+ D L D K S++ L S+ MTKLRPFVHSFLKEASK+FEMYIY Sbjct: 152 TELRYLTVEEEYLRSQTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIY 211 Query: 1796 TMGDRNYALEMAKLLDPEDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTEKVW 1617 TMG+R YA EMAKLLDP+ YFSS+VI+ + TQ+ QKGLDVVLGKESAVLILDDTE W Sbjct: 212 TMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAW 271 Query: 1616 EAHPENLILMERYHFFASSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHSTFF 1437 H ENLILMERYHFFASS RQF + +SLSEL++DESE DGAL T+LKVL+++H FF Sbjct: 272 TKHKENLILMERYHFFASSCRQFGF-NCKSLSELKNDESETDGALTTILKVLKQVHHMFF 330 Query: 1436 NTEHMVGLVNGDVRQVLENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGATCST 1257 N E LV+ DVRQVL+ VR EVL+GC +VFSRVFP + +AEN Q WK+ EQLG TCST Sbjct: 331 N-EVSGDLVDRDVRQVLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCST 389 Query: 1256 KLDPSITHVVSEDSGTEKSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKYPVDHLK 1089 +LD S+THVV+ D+GTEKSRWA++EKKFLV+PRWI+A+ Y W++Q EE + V+ K Sbjct: 390 ELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENFTVEQTK 445 >ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 478 Score = 436 bits (1120), Expect = e-119 Identities = 224/416 (53%), Positives = 293/416 (70%), Gaps = 1/416 (0%) Frame = -3 Query: 2333 NEDTKDYDLDSEGNKRRKVDIVESSLDLQSSTSLGVEARTSGASLATGICTHPGVIGGLC 2154 ++ D D+ + KR +V+ +E+ + + ST + ++ +T AS + CTHPG G +C Sbjct: 63 SDSDDDSDIATNRIKRSRVETLENGENPKESTRVSLD-QTLVASSSKVACTHPGSFGDMC 121 Query: 2153 IRCGQKMDDESGVALGYIHKXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTLVNSS 1974 I CG+++ +E+GV GYIHK LYLVLDLDHTL+NS+ Sbjct: 122 ILCGERLIEETGVTFGYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTLLNST 181 Query: 1973 LLDDINGSEAYLKGPRDDLPDALKSSIYRLESMRMMTKLRPFVHSFLKEASKMFEMYIYT 1794 L + E YLK D + D S++ ++ M MMTKLRPF+ +FLKEAS+MFEMYIYT Sbjct: 182 QLMHLTAEEEYLKSQIDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEMYIYT 241 Query: 1793 MGDRNYALEMAKLLDPEDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTEKVWE 1614 MGDR YALEMAK LDP YF++RVI+ + TQR QKGLD+VLG+ESAVLILDDTE W Sbjct: 242 MGDRAYALEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWT 301 Query: 1613 AHPENLILMERYHFFASSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHSTFFN 1434 H +NLILMERYHFFASS RQF +SLS+L+SDE+E DGALA+VLKVL++IH FF Sbjct: 302 KHKDNLILMERYHFFASSCRQFGF-ECKSLSQLKSDENESDGALASVLKVLRRIHHIFF- 359 Query: 1433 TEHMVGLVNG-DVRQVLENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGATCST 1257 + + ++G DVRQVL VRK+VLKGC +VFSRVFP + +A+N WK+AEQLGATCS Sbjct: 360 -DELEDAIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCSR 418 Query: 1256 KLDPSITHVVSEDSGTEKSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKYPVDHLK 1089 ++DPS+THVVS ++GTEKSRWA++ KFLV+PRWI+A Y+WQ+QPEE + V+ K Sbjct: 419 EVDPSVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVNQPK 474 >ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Glycine max] Length = 442 Score = 419 bits (1078), Expect = e-114 Identities = 220/406 (54%), Positives = 279/406 (68%), Gaps = 1/406 (0%) Frame = -3 Query: 2321 KDYDLDSEGNKRRKVDIVESSLDLQSSTSLGVEARTSGASLATGIC-THPGVIGGLCIRC 2145 +D +L S KRRK + +E + + STS G+ R+ AS +C THPG G +CIRC Sbjct: 41 QDDELQSVRTKRRKFESIEET---EGSTSEGIVKRSLEASSEVDVCCTHPGSFGNMCIRC 97 Query: 2144 GQKMDDESGVALGYIHKXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTLVNSSLLD 1965 GQK+D ESGV GYIHK LYLVLDLDHTL+NS+ L Sbjct: 98 GQKLDGESGVTFGYIHKGLRLHDEEISRLRNTDMKSLLGRKKLYLVLDLDHTLLNSTHLA 157 Query: 1964 DINGSEAYLKGPRDDLPDALKSSIYRLESMRMMTKLRPFVHSFLKEASKMFEMYIYTMGD 1785 + E +L D L + K S+++LE M MMTKLRPFV FLKEAS+MFEMYIYTMGD Sbjct: 158 QLTSEELHLLNQTDSLTNVSKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGD 217 Query: 1784 RNYALEMAKLLDPEDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTEKVWEAHP 1605 R YALEMAKLLDP+ YF+++VI+ + TQ+ QKGLDVVLG+ESAV+ILDDTE W H Sbjct: 218 RPYALEMAKLLDPQGEYFNAKVISRDDGTQKHQKGLDVVLGQESAVIILDDTEHAWMKHK 277 Query: 1604 ENLILMERYHFFASSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHSTFFNTEH 1425 +NLILMERYHFF SS RQF + +SL+EL+SDE E DGALA +LKVL+++H FF+ + Sbjct: 278 DNLILMERYHFFGSSCRQFGF-NCKSLAELKSDEDETDGALAKILKVLKQVHCMFFDKQE 336 Query: 1424 MVGLVNGDVRQVLENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGATCSTKLDP 1245 + DVRQVL +VR+EVL GC ++FSR+ K+AEQ+GATC T++DP Sbjct: 337 --DFDDQDVRQVLSSVRREVLSGCVIIFSRIV----HGAIPSLRKMAEQMGATCLTEIDP 390 Query: 1244 SITHVVSEDSGTEKSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKY 1107 S+THVV+ D+GTEK RWA++EKKF+V+P WI+AA Y WQKQPEE + Sbjct: 391 SVTHVVATDAGTEKCRWAVKEKKFVVHPLWIEAANYFWQKQPEENF 436 >ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] Length = 558 Score = 416 bits (1068), Expect = e-113 Identities = 224/415 (53%), Positives = 279/415 (67%), Gaps = 4/415 (0%) Frame = -3 Query: 2333 NEDTKDYDLDSEGNKRRKVDIVESSLDLQSSTSLGVEARTSGASLATGI--CTHPGVIGG 2160 +ED + ++ G KRR+V E L Q TS+ + +GAS + C HPG GG Sbjct: 60 DEDPEVEAVEQNGTKRRRV---EEQLQDQG-TSVRPDKIPTGASKNVQVEACPHPGYFGG 115 Query: 2159 LCIRCGQKMDDE--SGVALGYIHKXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTL 1986 LC RCG+ D+E SGVA GYIHK L L+LDLDHTL Sbjct: 116 LCFRCGKPQDEENVSGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTL 175 Query: 1985 VNSSLLDDINGSEAYLKGPRDDLPDALKSSIYRLESMRMMTKLRPFVHSFLKEASKMFEM 1806 +NS+ L DI+ +E L D SI+ L+SM+M+TKLRPFV FLKEAS MFEM Sbjct: 176 INSTKLQDISSAEKDLGIQTAASKDDPNRSIFSLDSMQMLTKLRPFVREFLKEASNMFEM 235 Query: 1805 YIYTMGDRNYALEMAKLLDPEDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTE 1626 YIYTMGD+ YA+E+AKLLDP ++YF S+VI+N + TQR QKGLDV+LG ES +ILDDTE Sbjct: 236 YIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTE 295 Query: 1625 KVWEAHPENLILMERYHFFASSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHS 1446 VW+ H ENLILMERYHFFASS RQF ++SLSE DE E DGALATVL VL++IHS Sbjct: 296 YVWQKHKENLILMERYHFFASSCRQFGF-GVRSLSESMQDERESDGALATVLDVLKRIHS 354 Query: 1445 TFFNTEHMVGLVNGDVRQVLENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGAT 1266 FF+ L + DVRQV++ VRKE+L+GC +VFSRVFP +R + Q WK+AE LGA Sbjct: 355 IFFDLAVETDLSSQDVRQVIKAVRKEILQGCKIVFSRVFPNNTRPQEQMLWKMAEHLGAV 414 Query: 1265 CSTKLDPSITHVVSEDSGTEKSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKYPV 1101 CST +D S+THVV+ D GTEK+RW + KKFLV+PRWI+AA + W +QPEE +PV Sbjct: 415 CSTDVDSSVTHVVTVDLGTEKARWGVANKKFLVHPRWIEAANFRWHRQPEEDFPV 469 >ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal phosphatase-like 4; Short=AtCPL4; Short=CTD phosphatase-like 4 gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana] gi|332009601|gb|AED96984.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] Length = 440 Score = 414 bits (1063), Expect = e-113 Identities = 220/399 (55%), Positives = 275/399 (68%), Gaps = 3/399 (0%) Frame = -3 Query: 2276 DIVESSLDLQSSTSLGVEARTSGASLATGICTHPGVIGGLCIRCGQKMDDESGVALGYIH 2097 D VES L Q L AS + G C HPG G +C CGQK++ E+GV+ YIH Sbjct: 44 DDVESGLKRQKLEHL------EEASSSKGECEHPGSFGNMCFVCGQKLE-ETGVSFRYIH 96 Query: 2096 KXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTLVNSSLLDDINGSEAYLKGPRDDL 1917 K LYLVLDLDHTL+N+++L D+ E YLK L Sbjct: 97 KEMRLNEDEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSL 156 Query: 1916 PDALK---SSIYRLESMRMMTKLRPFVHSFLKEASKMFEMYIYTMGDRNYALEMAKLLDP 1746 D S++ LE M+MMTKLRPFVHSFLKEAS+MF MYIYTMGDRNYA +MAKLLDP Sbjct: 157 QDGCNVSGGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDP 216 Query: 1745 EDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTEKVWEAHPENLILMERYHFFA 1566 + YF RVI+ + T R +K LDVVLG+ESAVLILDDTE W H +NLI++ERYHFF+ Sbjct: 217 KGEYFGDRVISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFS 276 Query: 1565 SSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHSTFFNTEHMVGLVNGDVRQVL 1386 SS RQF H +SLSEL+SDESE DGALATVLKVL++ H+ FF G+ N DVR +L Sbjct: 277 SSCRQFDH-RYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVD-EGISNRDVRLML 334 Query: 1385 ENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGATCSTKLDPSITHVVSEDSGTE 1206 + VRKE+LKGC +VFSRVFP +++ E+ WK+AE+LGATC+T++D S+THVV+ D GTE Sbjct: 335 KQVRKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTE 394 Query: 1205 KSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKYPVDHLK 1089 K+RWA++EKK++V+ WIDAA YLW KQPEE + ++ LK Sbjct: 395 KARWAVREKKYVVHRGWIDAANYLWMKQPEENFGLEQLK 433