BLASTX nr result

ID: Cephaelis21_contig00012036 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00012036
         (1961 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ...   466   e-128
ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma...   461   e-127
ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal doma...   446   e-122
ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal doma...   435   e-119
dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]        424   e-116

>ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223534449|gb|EEF36151.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  466 bits (1199), Expect = e-128
 Identities = 251/477 (52%), Positives = 322/477 (67%), Gaps = 17/477 (3%)
 Frame = -2

Query: 1783 MSLVADSPLHSS-STSGDDYAAFLELELDS---DSDTSPKXXXXXXXXXXXXXXXXXXXX 1616
            MSLV DSPLHSS S+S DD+AA L+ ELDS    SD+SPK                    
Sbjct: 1    MSLVTDSPLHSSHSSSSDDFAALLDAELDSKSSSSDSSPKAIKHDDASDANDDVNEEEEE 60

Query: 1615 G-------------RNKRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDICEHPAVWNK 1475
                          R KR +VE LE+  + + ST V ++ QT  +SS K  C HP  +  
Sbjct: 61   EESDSDDDSDIATNRIKRSRVETLENGENPKESTRVSLD-QTLVASSSKVACTHPGSFGD 119

Query: 1474 LCVRCGQKTDDESGVAFGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLN 1295
            +C+ CG++  +E+GV FGYIHK LRL+N+E+ RLR                       LN
Sbjct: 120  MCILCGERLIEETGVTFGYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTLLN 179

Query: 1294 SSLLDDINGDEEYLKGPKEGLPDTVKNSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYI 1115
            S+ L  +  +EEYLK   + + D    SL  +D M MMTKLRPF+ +FL+EAS++FEMYI
Sbjct: 180  STQLMHLTAEEEYLKSQIDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEMYI 239

Query: 1114 YTMGDRNYALEMANLLDPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKV 935
            YTMGDR YALEMA  LDP   YFN+RVI+  + T+R+QKGLD+VLG+ESAVLILDDTE  
Sbjct: 240  YTMGDRAYALEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTENA 299

Query: 934  WEGHLENLILMDRYHFFASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNF 755
            W  H +NLILM+RYHFFASS RQF F   KSLS+L+SDE+E DGALA+VL+VL++IH  F
Sbjct: 300  WTKHKDNLILMERYHFFASSCRQFGFEC-KSLSQLKSDENESDGALASVLKVLRRIHHIF 358

Query: 754  FDSEQKESLEDQDVRQGLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCT 575
            FD E +++++ +DVRQ L TVRK+VLKGCKIVFSRVF     A+N   WK+AEQLGATC+
Sbjct: 359  FD-ELEDAIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCS 417

Query: 574  KELDPSITHVVSEDSGTEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENYPVKHSR 404
            +E+DPS+THVVS ++GTEKSRWA++  KFLV+PRWI+A  Y+WQ+QPEEN+ V   +
Sbjct: 418  REVDPSVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVNQPK 474


>ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Cucumis sativus]
          Length = 452

 Score =  461 bits (1185), Expect = e-127
 Identities = 249/461 (54%), Positives = 316/461 (68%), Gaps = 1/461 (0%)
 Frame = -2

Query: 1783 MSLVADSPLHSSSTSGDDYAAFLELELDS-DSDTSPKXXXXXXXXXXXXXXXXXXXXGRN 1607
            MSL  +SP HSSS+  DD+AAFL ++LDS  SD+SP                      R 
Sbjct: 1    MSLATNSPAHSSSS--DDFAAFLAVDLDSHSSDSSPDEETEGDNNAESV---------RI 49

Query: 1606 KRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDICEHPAVWNKLCVRCGQKTDDESGVA 1427
            KRRKVE LE+S   +      VE Q+    S++ +C HP  +  +C+ CGQ+ D+ESGV 
Sbjct: 50   KRRKVEKLENS---EEDIMHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVT 106

Query: 1426 FGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLNSSLLDDINGDEEYLKG 1247
            FGYIHK LRL+N+E+ R+R                       LNS+ L  +  +EEYL+ 
Sbjct: 107  FGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRS 166

Query: 1246 PKEGLPDTVKNSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYIYTMGDRNYALEMANLL 1067
              + L D  K SL  L+ +  MTKLRPFVHSFL+EASKLFEMYIYTMG+R YA EMA LL
Sbjct: 167  QTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLL 226

Query: 1066 DPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKVWEGHLENLILMDRYHF 887
            DPK  YF+S+VI+  + T+++QKGLDVVLG+ESAVLILDDTE  W  H ENLILM+RYHF
Sbjct: 227  DPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHF 286

Query: 886  FASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNFFDSEQKESLEDQDVRQ 707
            FASS RQF F + KSLSEL++DESE DGAL T+L+VL+++H  FF +E    L D+DVRQ
Sbjct: 287  FASSCRQFGF-NCKSLSELKNDESETDGALTTILKVLKQVHHMFF-NEVSGDLVDRDVRQ 344

Query: 706  GLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCTKELDPSITHVVSEDSG 527
             L TVR EVL+GCK+VFSRVF     AEN Q WK+ EQLG TC+ ELD S+THVV+ D+G
Sbjct: 345  VLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAG 404

Query: 526  TEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENYPVKHSR 404
            TEKSRWA++EKKFLV+PRWI+A+ Y W++Q EEN+ V+ ++
Sbjct: 405  TEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENFTVEQTK 445


>ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Glycine max]
          Length = 442

 Score =  446 bits (1146), Expect = e-122
 Identities = 241/459 (52%), Positives = 312/459 (67%), Gaps = 2/459 (0%)
 Frame = -2

Query: 1783 MSLVADSPLHSSSTSGDDYAAFLELELDSDS-DTSPKXXXXXXXXXXXXXXXXXXXXGRN 1607
            MS+V DSP+HSSS+  DD+ AFL+ ELD+ S D+SP                      R 
Sbjct: 1    MSVVTDSPVHSSSS--DDFIAFLDAELDASSPDSSPDKEVVKQDDELQSV--------RT 50

Query: 1606 KRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDIC-EHPAVWNKLCVRCGQKTDDESGV 1430
            KRRK E +E +   + STS G+  ++  +SSE D+C  HP  +  +C+RCGQK D ESGV
Sbjct: 51   KRRKFESIEET---EGSTSEGIVKRSLEASSEVDVCCTHPGSFGNMCIRCGQKLDGESGV 107

Query: 1429 AFGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLNSSLLDDINGDEEYLK 1250
             FGYIHK LRL +EE++RLR                       LNS+ L  +  +E +L 
Sbjct: 108  TFGYIHKGLRLHDEEISRLRNTDMKSLLGRKKLYLVLDLDHTLLNSTHLAQLTSEELHLL 167

Query: 1249 GPKEGLPDTVKNSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYIYTMGDRNYALEMANL 1070
               + L +  K SL +L+ M MMTKLRPFV  FL+EAS++FEMYIYTMGDR YALEMA L
Sbjct: 168  NQTDSLTNVSKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKL 227

Query: 1069 LDPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKVWEGHLENLILMDRYH 890
            LDP+G YFN++VI+  + T+++QKGLDVVLG+ESAV+ILDDTE  W  H +NLILM+RYH
Sbjct: 228  LDPQGEYFNAKVISRDDGTQKHQKGLDVVLGQESAVIILDDTEHAWMKHKDNLILMERYH 287

Query: 889  FFASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNFFDSEQKESLEDQDVR 710
            FF SS RQF F + KSL+EL+SDE E DGALA +L+VL+++H  FFD  ++E  +DQDVR
Sbjct: 288  FFGSSCRQFGF-NCKSLAELKSDEDETDGALAKILKVLKQVHCMFFD--KQEDFDDQDVR 344

Query: 709  QGLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCTKELDPSITHVVSEDS 530
            Q L +VR+EVL GC I+FSR+             K+AEQ+GATC  E+DPS+THVV+ D+
Sbjct: 345  QVLSSVRREVLSGCVIIFSRIVH----GAIPSLRKMAEQMGATCLTEIDPSVTHVVATDA 400

Query: 529  GTEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENYPVK 413
            GTEK RWAV+EKKF+V+P WI+AA Y WQKQPEEN+ +K
Sbjct: 401  GTEKCRWAVKEKKFVVHPLWIEAANYFWQKQPEENFSLK 439


>ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Glycine max]
          Length = 428

 Score =  435 bits (1118), Expect = e-119
 Identities = 238/458 (51%), Positives = 304/458 (66%), Gaps = 1/458 (0%)
 Frame = -2

Query: 1783 MSLVADSPLHSSSTSGDDYAAFLELELDSDS-DTSPKXXXXXXXXXXXXXXXXXXXXGRN 1607
            MS+V DSP+HSSS+  DD+ AFL+ ELD+ S D+SP                        
Sbjct: 1    MSVVTDSPVHSSSS--DDFIAFLDAELDASSPDSSPDKEVEKQDDDELESGI-------- 50

Query: 1606 KRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDICEHPAVWNKLCVRCGQKTDDESGVA 1427
            KRRK E +E               +T GS+SE  +C HP  +  +C+RCGQK D ESGV 
Sbjct: 51   KRRKFESIE---------------ETEGSTSE-GVCTHPGSFGNMCIRCGQKLDGESGVT 94

Query: 1426 FGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLNSSLLDDINGDEEYLKG 1247
            FGYIHK LRL +EE++RLR                       LNS+ L  +  +E +L  
Sbjct: 95   FGYIHKGLRLHDEEISRLRNTDMKSLLCRKKLYLVLDLDHTLLNSTHLAHLTSEESHLLN 154

Query: 1246 PKEGLPDTVKNSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYIYTMGDRNYALEMANLL 1067
              + L D  K SL +L+ M MMTKLRPFV  FL+EAS++FEMYIYTMGDR YALEMA LL
Sbjct: 155  QTDSLRDVSKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLL 214

Query: 1066 DPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKVWEGHLENLILMDRYHF 887
            DP+G YFN++VI+  + T+++QKGLDVVLG+ESAVLILDDTE  W  H +NLILM+RYHF
Sbjct: 215  DPQGEYFNAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHF 274

Query: 886  FASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNFFDSEQKESLEDQDVRQ 707
            F SS RQF F + KSL+EL+SDE+E DGALA +L+VL+++H  FFD  ++E  +D+DVRQ
Sbjct: 275  FGSSCRQFGF-NCKSLAELKSDENETDGALAKILKVLKQVHCMFFD--KQEDFDDRDVRQ 331

Query: 706  GLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCTKELDPSITHVVSEDSG 527
             L  VR+EVL GC I+FSR+             K+AEQ+GATC  E+DPS+THVV+ D+G
Sbjct: 332  MLSLVRREVLSGCVIIFSRIVH----GAIPSLRKMAEQMGATCLTEIDPSVTHVVATDAG 387

Query: 526  TEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENYPVK 413
            TEK RWAV+EKKF+V+P WI+AA Y WQKQPEEN+ +K
Sbjct: 388  TEKCRWAVKEKKFVVHPLWIEAANYFWQKQPEENFILK 425


>dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1065

 Score =  424 bits (1090), Expect = e-116
 Identities = 236/459 (51%), Positives = 303/459 (66%), Gaps = 3/459 (0%)
 Frame = -2

Query: 1789 YEMSLVADSPLHSSSTSGDDYAAFLELELDSDSDTSPKXXXXXXXXXXXXXXXXXXXXGR 1610
            ++MS+ +DSP+HSSS+S DD AAFL+ ELDS SD S                        
Sbjct: 624  FKMSVASDSPVHSSSSS-DDLAAFLDAELDSASDASSGPSEEEEAEDDVESGL------- 675

Query: 1609 NKRRKVEILESSLDLQSSTSVGVEAQTSGSSSEKDICEHPAVWNKLCVRCGQKTDDESGV 1430
             KR+K+E LE +                  SS K  CEHP  +  +C  CGQK + E+GV
Sbjct: 676  -KRQKLEHLEEA------------------SSSKGECEHPGSFGNMCFVCGQKLE-ETGV 715

Query: 1429 AFGYIHKNLRLSNEELARLREXXXXXXXXXXXXXXXXXXXXXXLNSSLLDDINGDEEYLK 1250
            +F YIHK +RL+ +E++RLR+                      LN+++L D+  +EEYLK
Sbjct: 716  SFRYIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLK 775

Query: 1249 GPKEGLPDTVK---NSLHRLDKMRMMTKLRPFVHSFLEEASKLFEMYIYTMGDRNYALEM 1079
                 L D       SL  L+ M+MMTKLRPFVHSFL+EAS++F MYIYTMGDRNYA +M
Sbjct: 776  SHTHSLQDGCNVSGGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQM 835

Query: 1078 ANLLDPKGIYFNSRVIANVNSTKRYQKGLDVVLGEESAVLILDDTEKVWEGHLENLILMD 899
            A LLDPKG YF  RVI+  + T R++K LDVVLG+ESAVLILDDTE  W  H +NLI+++
Sbjct: 836  AKLLDPKGEYFGDRVISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIE 895

Query: 898  RYHFFASSLRQFNFPSQKSLSELRSDESEVDGALATVLRVLQKIHSNFFDSEQKESLEDQ 719
            RYHFF+SS RQF+    KSLSEL+SDESE DGALATVL+VL++ H+ FF++   E + ++
Sbjct: 896  RYHFFSSSCRQFDH-RYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVD-EGISNR 953

Query: 718  DVRQGLGTVRKEVLKGCKIVFSRVFSRDSLAENQQYWKLAEQLGATCTKELDPSITHVVS 539
            DVR  L  VRKE+LKGCKIVFSRVF   +  E+   WK+AE+LGATC  E+D S+THVV+
Sbjct: 954  DVRLMLKQVRKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVA 1013

Query: 538  EDSGTEKSRWAVQEKKFLVNPRWIDAARYLWQKQPEENY 422
             D GTEK+RWAV+EKK++V+  WIDAA YLW KQPEEN+
Sbjct: 1014 MDVGTEKARWAVREKKYVVHRGWIDAANYLWMKQPEENF 1052


Top