BLASTX nr result

ID: Cephaelis21_contig00012555 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00012555
         (2413 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma...   445   e-122
ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ...   436   e-119
ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal doma...   419   e-114
ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [S...   416   e-113
ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphat...   414   e-113

>ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Cucumis sativus]
          Length = 452

 Score =  445 bits (1145), Expect = e-122
 Identities = 235/416 (56%), Positives = 294/416 (70%), Gaps = 1/416 (0%)
 Frame = -3

Query: 2333 NEDTK-DYDLDSEGNKRRKVDIVESSLDLQSSTSLGVEARTSGASLATGICTHPGVIGGL 2157
            +E+T+ D + +S   KRRKV+ +E+S   +      VE ++        +C+HPG  G +
Sbjct: 35   DEETEGDNNAESVRIKRRKVEKLENS---EEDIMHEVEEQSLEVLSKQQLCSHPGSFGNM 91

Query: 2156 CIRCGQKMDDESGVALGYIHKXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTLVNS 1977
            CI CGQ++D+ESGV  GYIHK                         L LVLDLDHTL+NS
Sbjct: 92   CIICGQRLDEESGVTFGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNS 151

Query: 1976 SLLDDINGSEAYLKGPRDDLPDALKSSIYRLESMRMMTKLRPFVHSFLKEASKMFEMYIY 1797
            + L  +   E YL+   D L D  K S++ L S+  MTKLRPFVHSFLKEASK+FEMYIY
Sbjct: 152  TELRYLTVEEEYLRSQTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIY 211

Query: 1796 TMGDRNYALEMAKLLDPEDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTEKVW 1617
            TMG+R YA EMAKLLDP+  YFSS+VI+  + TQ+ QKGLDVVLGKESAVLILDDTE  W
Sbjct: 212  TMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAW 271

Query: 1616 EAHPENLILMERYHFFASSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHSTFF 1437
              H ENLILMERYHFFASS RQF   + +SLSEL++DESE DGAL T+LKVL+++H  FF
Sbjct: 272  TKHKENLILMERYHFFASSCRQFGF-NCKSLSELKNDESETDGALTTILKVLKQVHHMFF 330

Query: 1436 NTEHMVGLVNGDVRQVLENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGATCST 1257
            N E    LV+ DVRQVL+ VR EVL+GC +VFSRVFP + +AEN Q WK+ EQLG TCST
Sbjct: 331  N-EVSGDLVDRDVRQVLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCST 389

Query: 1256 KLDPSITHVVSEDSGTEKSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKYPVDHLK 1089
            +LD S+THVV+ D+GTEKSRWA++EKKFLV+PRWI+A+ Y W++Q EE + V+  K
Sbjct: 390  ELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENFTVEQTK 445


>ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223534449|gb|EEF36151.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  436 bits (1120), Expect = e-119
 Identities = 224/416 (53%), Positives = 293/416 (70%), Gaps = 1/416 (0%)
 Frame = -3

Query: 2333 NEDTKDYDLDSEGNKRRKVDIVESSLDLQSSTSLGVEARTSGASLATGICTHPGVIGGLC 2154
            ++   D D+ +   KR +V+ +E+  + + ST + ++ +T  AS +   CTHPG  G +C
Sbjct: 63   SDSDDDSDIATNRIKRSRVETLENGENPKESTRVSLD-QTLVASSSKVACTHPGSFGDMC 121

Query: 2153 IRCGQKMDDESGVALGYIHKXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTLVNSS 1974
            I CG+++ +E+GV  GYIHK                         LYLVLDLDHTL+NS+
Sbjct: 122  ILCGERLIEETGVTFGYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTLLNST 181

Query: 1973 LLDDINGSEAYLKGPRDDLPDALKSSIYRLESMRMMTKLRPFVHSFLKEASKMFEMYIYT 1794
             L  +   E YLK   D + D    S++ ++ M MMTKLRPF+ +FLKEAS+MFEMYIYT
Sbjct: 182  QLMHLTAEEEYLKSQIDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEMYIYT 241

Query: 1793 MGDRNYALEMAKLLDPEDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTEKVWE 1614
            MGDR YALEMAK LDP   YF++RVI+  + TQR QKGLD+VLG+ESAVLILDDTE  W 
Sbjct: 242  MGDRAYALEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWT 301

Query: 1613 AHPENLILMERYHFFASSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHSTFFN 1434
             H +NLILMERYHFFASS RQF     +SLS+L+SDE+E DGALA+VLKVL++IH  FF 
Sbjct: 302  KHKDNLILMERYHFFASSCRQFGF-ECKSLSQLKSDENESDGALASVLKVLRRIHHIFF- 359

Query: 1433 TEHMVGLVNG-DVRQVLENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGATCST 1257
             + +   ++G DVRQVL  VRK+VLKGC +VFSRVFP + +A+N   WK+AEQLGATCS 
Sbjct: 360  -DELEDAIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCSR 418

Query: 1256 KLDPSITHVVSEDSGTEKSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKYPVDHLK 1089
            ++DPS+THVVS ++GTEKSRWA++  KFLV+PRWI+A  Y+WQ+QPEE + V+  K
Sbjct: 419  EVDPSVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVNQPK 474


>ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Glycine max]
          Length = 442

 Score =  419 bits (1078), Expect = e-114
 Identities = 220/406 (54%), Positives = 279/406 (68%), Gaps = 1/406 (0%)
 Frame = -3

Query: 2321 KDYDLDSEGNKRRKVDIVESSLDLQSSTSLGVEARTSGASLATGIC-THPGVIGGLCIRC 2145
            +D +L S   KRRK + +E +   + STS G+  R+  AS    +C THPG  G +CIRC
Sbjct: 41   QDDELQSVRTKRRKFESIEET---EGSTSEGIVKRSLEASSEVDVCCTHPGSFGNMCIRC 97

Query: 2144 GQKMDDESGVALGYIHKXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTLVNSSLLD 1965
            GQK+D ESGV  GYIHK                         LYLVLDLDHTL+NS+ L 
Sbjct: 98   GQKLDGESGVTFGYIHKGLRLHDEEISRLRNTDMKSLLGRKKLYLVLDLDHTLLNSTHLA 157

Query: 1964 DINGSEAYLKGPRDDLPDALKSSIYRLESMRMMTKLRPFVHSFLKEASKMFEMYIYTMGD 1785
             +   E +L    D L +  K S+++LE M MMTKLRPFV  FLKEAS+MFEMYIYTMGD
Sbjct: 158  QLTSEELHLLNQTDSLTNVSKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGD 217

Query: 1784 RNYALEMAKLLDPEDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTEKVWEAHP 1605
            R YALEMAKLLDP+  YF+++VI+  + TQ+ QKGLDVVLG+ESAV+ILDDTE  W  H 
Sbjct: 218  RPYALEMAKLLDPQGEYFNAKVISRDDGTQKHQKGLDVVLGQESAVIILDDTEHAWMKHK 277

Query: 1604 ENLILMERYHFFASSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHSTFFNTEH 1425
            +NLILMERYHFF SS RQF   + +SL+EL+SDE E DGALA +LKVL+++H  FF+ + 
Sbjct: 278  DNLILMERYHFFGSSCRQFGF-NCKSLAELKSDEDETDGALAKILKVLKQVHCMFFDKQE 336

Query: 1424 MVGLVNGDVRQVLENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGATCSTKLDP 1245
                 + DVRQVL +VR+EVL GC ++FSR+             K+AEQ+GATC T++DP
Sbjct: 337  --DFDDQDVRQVLSSVRREVLSGCVIIFSRIV----HGAIPSLRKMAEQMGATCLTEIDP 390

Query: 1244 SITHVVSEDSGTEKSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKY 1107
            S+THVV+ D+GTEK RWA++EKKF+V+P WI+AA Y WQKQPEE +
Sbjct: 391  SVTHVVATDAGTEKCRWAVKEKKFVVHPLWIEAANYFWQKQPEENF 436


>ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
            gi|241915584|gb|EER88728.1| hypothetical protein
            SORBIDRAFT_10g025580 [Sorghum bicolor]
          Length = 558

 Score =  416 bits (1068), Expect = e-113
 Identities = 224/415 (53%), Positives = 279/415 (67%), Gaps = 4/415 (0%)
 Frame = -3

Query: 2333 NEDTKDYDLDSEGNKRRKVDIVESSLDLQSSTSLGVEARTSGASLATGI--CTHPGVIGG 2160
            +ED +   ++  G KRR+V   E  L  Q  TS+  +   +GAS    +  C HPG  GG
Sbjct: 60   DEDPEVEAVEQNGTKRRRV---EEQLQDQG-TSVRPDKIPTGASKNVQVEACPHPGYFGG 115

Query: 2159 LCIRCGQKMDDE--SGVALGYIHKXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTL 1986
            LC RCG+  D+E  SGVA GYIHK                         L L+LDLDHTL
Sbjct: 116  LCFRCGKPQDEENVSGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTL 175

Query: 1985 VNSSLLDDINGSEAYLKGPRDDLPDALKSSIYRLESMRMMTKLRPFVHSFLKEASKMFEM 1806
            +NS+ L DI+ +E  L        D    SI+ L+SM+M+TKLRPFV  FLKEAS MFEM
Sbjct: 176  INSTKLQDISSAEKDLGIQTAASKDDPNRSIFSLDSMQMLTKLRPFVREFLKEASNMFEM 235

Query: 1805 YIYTMGDRNYALEMAKLLDPEDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTE 1626
            YIYTMGD+ YA+E+AKLLDP ++YF S+VI+N + TQR QKGLDV+LG ES  +ILDDTE
Sbjct: 236  YIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTE 295

Query: 1625 KVWEAHPENLILMERYHFFASSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHS 1446
             VW+ H ENLILMERYHFFASS RQF    ++SLSE   DE E DGALATVL VL++IHS
Sbjct: 296  YVWQKHKENLILMERYHFFASSCRQFGF-GVRSLSESMQDERESDGALATVLDVLKRIHS 354

Query: 1445 TFFNTEHMVGLVNGDVRQVLENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGAT 1266
             FF+      L + DVRQV++ VRKE+L+GC +VFSRVFP  +R + Q  WK+AE LGA 
Sbjct: 355  IFFDLAVETDLSSQDVRQVIKAVRKEILQGCKIVFSRVFPNNTRPQEQMLWKMAEHLGAV 414

Query: 1265 CSTKLDPSITHVVSEDSGTEKSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKYPV 1101
            CST +D S+THVV+ D GTEK+RW +  KKFLV+PRWI+AA + W +QPEE +PV
Sbjct: 415  CSTDVDSSVTHVVTVDLGTEKARWGVANKKFLVHPRWIEAANFRWHRQPEEDFPV 469


>ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
            thaliana] gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName:
            Full=RNA polymerase II C-terminal domain phosphatase-like
            4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal
            phosphatase-like 4; Short=AtCPL4; Short=CTD
            phosphatase-like 4 gi|95115186|gb|ABF55959.1|
            carboxyl-terminal phosphatase-like 4 [Arabidopsis
            thaliana] gi|332009601|gb|AED96984.1| RNA polymerase II
            C-terminal domain phosphatase-like 4 [Arabidopsis
            thaliana]
          Length = 440

 Score =  414 bits (1063), Expect = e-113
 Identities = 220/399 (55%), Positives = 275/399 (68%), Gaps = 3/399 (0%)
 Frame = -3

Query: 2276 DIVESSLDLQSSTSLGVEARTSGASLATGICTHPGVIGGLCIRCGQKMDDESGVALGYIH 2097
            D VES L  Q    L        AS + G C HPG  G +C  CGQK++ E+GV+  YIH
Sbjct: 44   DDVESGLKRQKLEHL------EEASSSKGECEHPGSFGNMCFVCGQKLE-ETGVSFRYIH 96

Query: 2096 KXXXXXXXXXXXXXXXXXXXXXXXXXLYLVLDLDHTLVNSSLLDDINGSEAYLKGPRDDL 1917
            K                         LYLVLDLDHTL+N+++L D+   E YLK     L
Sbjct: 97   KEMRLNEDEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTHSL 156

Query: 1916 PDALK---SSIYRLESMRMMTKLRPFVHSFLKEASKMFEMYIYTMGDRNYALEMAKLLDP 1746
             D       S++ LE M+MMTKLRPFVHSFLKEAS+MF MYIYTMGDRNYA +MAKLLDP
Sbjct: 157  QDGCNVSGGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDP 216

Query: 1745 EDVYFSSRVIANVNSTQRFQKGLDVVLGKESAVLILDDTEKVWEAHPENLILMERYHFFA 1566
            +  YF  RVI+  + T R +K LDVVLG+ESAVLILDDTE  W  H +NLI++ERYHFF+
Sbjct: 217  KGEYFGDRVISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFS 276

Query: 1565 SSLRQFHHPSLQSLSELRSDESEVDGALATVLKVLQKIHSTFFNTEHMVGLVNGDVRQVL 1386
            SS RQF H   +SLSEL+SDESE DGALATVLKVL++ H+ FF      G+ N DVR +L
Sbjct: 277  SSCRQFDH-RYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVD-EGISNRDVRLML 334

Query: 1385 ENVRKEVLKGCTLVFSRVFPIESRAENQQYWKLAEQLGATCSTKLDPSITHVVSEDSGTE 1206
            + VRKE+LKGC +VFSRVFP +++ E+   WK+AE+LGATC+T++D S+THVV+ D GTE
Sbjct: 335  KQVRKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTE 394

Query: 1205 KSRWAIQEKKFLVNPRWIDAARYLWQKQPEEKYPVDHLK 1089
            K+RWA++EKK++V+  WIDAA YLW KQPEE + ++ LK
Sbjct: 395  KARWAVREKKYVVHRGWIDAANYLWMKQPEENFGLEQLK 433


Top