BLASTX nr result

ID: Cocculus23_contig00017646 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00017646
         (1220 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma...   127   1e-26
ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat...   121   5e-25
ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu...   116   2e-23
ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr...   114   8e-23
ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citr...   114   8e-23
ref|XP_006438857.1| hypothetical protein CICLE_v10030535mg [Citr...   114   8e-23
ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera...   108   6e-21
ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal doma...   108   6e-21
ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma...   102   4e-19
ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal doma...   100   1e-18
gb|AAV92930.1| putative transcription regulator CPL1 [Solanum ly...   100   1e-18
gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l...    99   3e-18
ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal doma...    99   3e-18
gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus...    97   1e-17
gb|ABA93957.1| NLI interacting factor-like phosphatase family pr...    96   2e-17
ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [S...    94   9e-17
ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma...    94   2e-16
ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma...    92   6e-16
emb|CBI35661.3| unnamed protein product [Vitis vinifera]               86   2e-14
ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma...    86   3e-14

>ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Vitis vinifera]
          Length = 1238

 Score =  127 bits (319), Expect = 1e-26
 Identities = 100/276 (36%), Positives = 140/276 (50%), Gaps = 11/276 (3%)
 Frame = +2

Query: 419  SVEEISAEDF-KQEAKVSSKPKTTSDSGVW----MGDLLKY-RVSSNYGHGFHNLAWAQA 580
            SVEEIS EDF KQE +V  + K  +D+ VW    + DL KY +  S Y    +NLAWAQA
Sbjct: 25   SVEEISEEDFNKQEVRVLREAKPKADTRVWTMRDLQDLYKYHQACSGYTPRLYNLAWAQA 84

Query: 581  VQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXXXXXXXX 760
            VQNKPL++I V                        N +    SSAKEV            
Sbjct: 85   VQNKPLNDIFVMDDEESKRSSSS-----------SNTSRDDSSSAKEVAKVIIDDSGDEM 133

Query: 761  XXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDEEASEKKIQS 940
                   + +            ID+DSE   +++ G   +++    +LK+ E  E+ ++S
Sbjct: 134  DVKMDDVSEKEEGELEEGE---IDLDSEPDVKDEGGVL-DVNEPEIDLKERELVER-VKS 188

Query: 941  IQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEM-----VSDNCLSAIDALTQQAFTGFR 1105
            IQE L++VTV +AEKS  GVCS+L   L S+Q++     V ++ +   DAL QQ     R
Sbjct: 189  IQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIR 248

Query: 1106 AVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
            A+N VF SMNS Q+E NK+ FSRLL  ++     +F
Sbjct: 249  ALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIF 284


>ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA
            polymerase II C-terminal domain phosphatase-like 3,
            putative [Theobroma cacao]
          Length = 1290

 Score =  121 bits (304), Expect = 5e-25
 Identities = 96/283 (33%), Positives = 138/283 (48%), Gaps = 18/283 (6%)
 Frame = +2

Query: 419  SVEEISAEDF-KQEAKVSSKPKTT------SDSGVW-MGDLLKY-RVSSNYGHGFHNLAW 571
            S+EEIS EDF KQ+ K+  + K++      S+S VW M DL KY  V   Y  G +N AW
Sbjct: 44   SIEEISEEDFNKQDVKILKESKSSKGGEANSNSRVWTMQDLCKYPSVIRGYASGLYNFAW 103

Query: 572  AQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXXXXX 751
            AQAVQNKPL+EI V                       +N N+K  S +  V +       
Sbjct: 104  AQAVQNKPLNEIFV--------------KDFEQPQQDENKNSKRSSPSSSVASVNSKEEK 149

Query: 752  XXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGR---------NL 904
                       +             +++D E  GE +EG   +LDS  +         N+
Sbjct: 150  GSSGNLAVKVVIDDDSEDEMEEDKVVNLDKEE-GELEEGEI-DLDSEPKEKVLSSEDGNV 207

Query: 905  KDEEASEKKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAIDALTQ 1084
             + +  EK+   I+ VL+ VTV +AEKS  GVCS+L   L+S++ ++ +  + A DAL Q
Sbjct: 208  GNSDELEKRANLIRGVLEGVTVIEAEKSFEGVCSRLHNALESLRALILECSVPAKDALIQ 267

Query: 1085 QAFTGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
             A   F A+N+ FV++N   +EQN    SRLL  +K   P+LF
Sbjct: 268  LA---FGAINSAFVALNCNSKEQNVAILSRLLSIVKGHDPSLF 307


>ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343308|gb|EEE79627.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1247

 Score =  116 bits (291), Expect = 2e-23
 Identities = 93/280 (33%), Positives = 138/280 (49%), Gaps = 13/280 (4%)
 Frame = +2

Query: 413  TQSVEEISAEDFKQEAKVSSK--PKTTSDSG------VW-MGDLLKYRVSSNYGHGFHNL 565
            T SVEEIS +DF ++  V  K  P +T+++       VW + DL KY+V   Y  G +NL
Sbjct: 28   TASVEEISEDDFNKQEVVVVKETPSSTTNNNSSSKQKVWTVRDLYKYQVGGGYMSGLYNL 87

Query: 566  AWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXXX 745
            AWAQAVQNKPL+E+ V                       D++   S+SS           
Sbjct: 88   AWAQAVQNKPLNELFVEVEVD------------------DSSQKSSVSSVNS-------- 121

Query: 746  XXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDEEA-- 919
                         V             ID++ E  GE +EG   +LDS G++     +  
Sbjct: 122  -----SKEDKRTVVIDDSGDEMDVVKVIDIEKEE-GELEEGEI-DLDSEGKSEGGMVSVD 174

Query: 920  SEKKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMV--SDNCLSAIDALTQQAF 1093
            +EK+++SI+E L++V+V   +KS   VC +L   L+S++E+V  ++N   + D+L +  F
Sbjct: 175  TEKRVKSIREDLESVSVIKDDKSFEAVCLKLHNALESLKELVRVNENGFPSKDSLVRLLF 234

Query: 1094 TGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
            T   AVN+ F SMN K +EQNK  F R L  + S  P+ F
Sbjct: 235  TAIGAVNSFFSSMNQKLKEQNKGVFMRFLSLVNSHDPSFF 274


>ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|568858958|ref|XP_006483010.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 3-like
            [Citrus sinensis] gi|557541056|gb|ESR52100.1|
            hypothetical protein CICLE_v10030535mg [Citrus
            clementina]
          Length = 1234

 Score =  114 bits (285), Expect = 8e-23
 Identities = 101/288 (35%), Positives = 135/288 (46%), Gaps = 21/288 (7%)
 Frame = +2

Query: 413  TQSVEEISAEDFK----QEAKVSSKPKTTSDSG------VW-MGDLL-KY-RVSSNYGHG 553
            T SVEEIS EDFK    +  KV  + K     G      VW M DL  KY  +   YG G
Sbjct: 13   TASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPG 72

Query: 554  FHNLAWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNX 733
             HNLAWAQAVQNKPL+EI V                        ++ +K  S A  V + 
Sbjct: 73   LHNLAWAQAVQNKPLNEIFVMEAE-------------------QDDVSKRSSPASSVASV 113

Query: 734  XXXXXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDE 913
                             V              D   E+  EE E   GE++    +  +E
Sbjct: 114  NSGAAAGKDDKKVVEKVVID------------DSGDEIEKEEGELEEGEIELDLESESNE 161

Query: 914  EASEK--------KIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAI 1069
            + SE+         ++SI+E L++V   D   S  GVCS+L   L+S++E+V++N +   
Sbjct: 162  KVSEQVKEEMKLINVESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTK 219

Query: 1070 DALTQQAFTGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
            DAL Q AF+  ++V++VF SMN   +EQNKE  SRLL  IKS  P LF
Sbjct: 220  DALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLF 267


>ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|557541054|gb|ESR52098.1| hypothetical protein
            CICLE_v10030535mg [Citrus clementina]
          Length = 1208

 Score =  114 bits (285), Expect = 8e-23
 Identities = 101/288 (35%), Positives = 135/288 (46%), Gaps = 21/288 (7%)
 Frame = +2

Query: 413  TQSVEEISAEDFK----QEAKVSSKPKTTSDSG------VW-MGDLL-KY-RVSSNYGHG 553
            T SVEEIS EDFK    +  KV  + K     G      VW M DL  KY  +   YG G
Sbjct: 13   TASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPG 72

Query: 554  FHNLAWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNX 733
             HNLAWAQAVQNKPL+EI V                        ++ +K  S A  V + 
Sbjct: 73   LHNLAWAQAVQNKPLNEIFVMEAE-------------------QDDVSKRSSPASSVASV 113

Query: 734  XXXXXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDE 913
                             V              D   E+  EE E   GE++    +  +E
Sbjct: 114  NSGAAAGKDDKKVVEKVVID------------DSGDEIEKEEGELEEGEIELDLESESNE 161

Query: 914  EASEK--------KIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAI 1069
            + SE+         ++SI+E L++V   D   S  GVCS+L   L+S++E+V++N +   
Sbjct: 162  KVSEQVKEEMKLINVESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTK 219

Query: 1070 DALTQQAFTGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
            DAL Q AF+  ++V++VF SMN   +EQNKE  SRLL  IKS  P LF
Sbjct: 220  DALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLF 267


>ref|XP_006438857.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|567892677|ref|XP_006438859.1| hypothetical protein
            CICLE_v10030535mg [Citrus clementina]
            gi|557541053|gb|ESR52097.1| hypothetical protein
            CICLE_v10030535mg [Citrus clementina]
            gi|557541055|gb|ESR52099.1| hypothetical protein
            CICLE_v10030535mg [Citrus clementina]
          Length = 1118

 Score =  114 bits (285), Expect = 8e-23
 Identities = 101/288 (35%), Positives = 135/288 (46%), Gaps = 21/288 (7%)
 Frame = +2

Query: 413  TQSVEEISAEDFK----QEAKVSSKPKTTSDSG------VW-MGDLL-KY-RVSSNYGHG 553
            T SVEEIS EDFK    +  KV  + K     G      VW M DL  KY  +   YG G
Sbjct: 13   TASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPG 72

Query: 554  FHNLAWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNX 733
             HNLAWAQAVQNKPL+EI V                        ++ +K  S A  V + 
Sbjct: 73   LHNLAWAQAVQNKPLNEIFVMEAE-------------------QDDVSKRSSPASSVASV 113

Query: 734  XXXXXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDE 913
                             V              D   E+  EE E   GE++    +  +E
Sbjct: 114  NSGAAAGKDDKKVVEKVVID------------DSGDEIEKEEGELEEGEIELDLESESNE 161

Query: 914  EASEK--------KIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAI 1069
            + SE+         ++SI+E L++V   D   S  GVCS+L   L+S++E+V++N +   
Sbjct: 162  KVSEQVKEEMKLINVESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTK 219

Query: 1070 DALTQQAFTGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
            DAL Q AF+  ++V++VF SMN   +EQNKE  SRLL  IKS  P LF
Sbjct: 220  DALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLF 267


>ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3-like [Cucumis sativus]
          Length = 1249

 Score =  108 bits (269), Expect = 6e-21
 Identities = 85/287 (29%), Positives = 131/287 (45%), Gaps = 20/287 (6%)
 Frame = +2

Query: 413  TQSVEEISAEDF-KQEAKVSSKPKTTS-----DSGVW-MGDLLKYRVSSNYGH--GFHNL 565
            T SVEEIS EDF K ++  S K    S     ++ VW M DL K   +  +G+  G +NL
Sbjct: 20   TASVEEISEEDFNKLDSSASPKVVVPSKDSNRETRVWTMSDLYKNYPAMRHGYASGLYNL 79

Query: 566  AWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKS-----ISSAKEVCN 730
            AWAQAVQNKPL++I V                        +N  K      I  + +  N
Sbjct: 80   AWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDGSNTTKEEDRVVIDDSGDEMN 139

Query: 731  XXXXXXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKD 910
                            +               IDMD+E + E  +      DS   ++  
Sbjct: 140  C---------------DNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDMDING 184

Query: 911  EEAS------EKKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAID 1072
            +E        ++ ++ IQ+ L  VT+  A+KS   VCSQ+ + +++  E++    +   D
Sbjct: 185  QEFDLETKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKD 244

Query: 1073 ALTQQAFTGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
            AL Q+ +   R +N+VF SMN  ++E++KE  SRLL ++K+  P LF
Sbjct: 245  ALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLF 291


>ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Cucumis sativus]
          Length = 1249

 Score =  108 bits (269), Expect = 6e-21
 Identities = 85/287 (29%), Positives = 131/287 (45%), Gaps = 20/287 (6%)
 Frame = +2

Query: 413  TQSVEEISAEDF-KQEAKVSSKPKTTS-----DSGVW-MGDLLKYRVSSNYGH--GFHNL 565
            T SVEEIS EDF K ++  S K    S     ++ VW M DL K   +  +G+  G +NL
Sbjct: 20   TASVEEISEEDFNKLDSSASPKVVVPSKDSNRETRVWTMSDLYKNYPAMRHGYASGLYNL 79

Query: 566  AWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKS-----ISSAKEVCN 730
            AWAQAVQNKPL++I V                        +N  K      I  + +  N
Sbjct: 80   AWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDGSNTTKEEDRVVIDDSGDEMN 139

Query: 731  XXXXXXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKD 910
                            +               IDMD+E + E  +      DS   ++  
Sbjct: 140  C---------------DNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDMDING 184

Query: 911  EEAS------EKKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAID 1072
            +E        ++ ++ IQ+ L  VT+  A+KS   VCSQ+ + +++  E++    +   D
Sbjct: 185  QEFDLETKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKD 244

Query: 1073 ALTQQAFTGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
            AL Q+ +   R +N+VF SMN  ++E++KE  SRLL ++K+  P LF
Sbjct: 245  ALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLF 291


>ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Solanum tuberosum]
          Length = 1218

 Score =  102 bits (253), Expect = 4e-19
 Identities = 86/276 (31%), Positives = 125/276 (45%), Gaps = 12/276 (4%)
 Frame = +2

Query: 419  SVEEISAEDF-KQEAKVSSKPK----------TTSDSGVW-MGDLLKYRVSSNYGHGFHN 562
            SVEEIS + F +Q+   ++K K          +T+ + VW M D  KY +S +Y  G +N
Sbjct: 20   SVEEISEDAFNRQDPPTTTKIKIASNENQNQNSTTTTRVWTMRDAYKYPISRDYARGLYN 79

Query: 563  LAWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXX 742
            LAWAQAVQNKPLDE+ V                       DN+N  + ++A         
Sbjct: 80   LAWAQAVQNKPLDELFVMTS--------------------DNSNQCANANAN-------- 111

Query: 743  XXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDEEAS 922
                                        +D D++  GE +EG   E+D    +L      
Sbjct: 112  ------------------VESKVIIDVDVDDDAKEEGELEEG---EIDLDAADLVLNFGK 150

Query: 923  EKKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAIDALTQQAFTGF 1102
            E     ++E LQ+VT+ +  KS   VCS+L   L ++ E+      +  D L Q   T  
Sbjct: 151  EANF--VREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQDKN--DILIQLFMTAL 206

Query: 1103 RAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPAL 1210
            R +N+VF SMN  Q++QN +  SRLL H K++ PAL
Sbjct: 207  RTINSVFYSMNQDQKQQNTDILSRLLFHAKTQLPAL 242


>ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Solanum lycopersicum]
          Length = 1211

 Score =  100 bits (250), Expect = 1e-18
 Identities = 86/279 (30%), Positives = 124/279 (44%), Gaps = 15/279 (5%)
 Frame = +2

Query: 419  SVEEISAEDFKQE----AKVSSKPKTTSDSG----------VW-MGDLLKYRVSSNYGHG 553
            SVEEIS + F ++       +SK K  S+            VW M D+ KY +S +Y  G
Sbjct: 20   SVEEISEDAFNRQDPPTTSTTSKIKIASNENQNQNSTTATRVWTMRDVYKYPISRDYARG 79

Query: 554  FHNLAWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNX 733
             +NLAWAQAVQNKPLDE+ V                       DN+N  +   +K + + 
Sbjct: 80   LYNLAWAQAVQNKPLDELFVMTS--------------------DNSNQCANGESKVIIDV 119

Query: 734  XXXXXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDE 913
                                           +D D++  GE +EG   E+D    +L   
Sbjct: 120  D------------------------------VDDDAKEEGELEEG---EIDLDSADLVVN 146

Query: 914  EASEKKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAIDALTQQAF 1093
               E     I+E LQ+VT+ +  KS   VCS+L   L ++ E+      +  D L Q   
Sbjct: 147  FGKEANF--IREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQDKN--DILIQLFM 202

Query: 1094 TGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPAL 1210
            T  R +N+VF SMN  Q++QN +  SRLL + K++ PAL
Sbjct: 203  TALRTINSVFYSMNDHQKQQNTDILSRLLFNAKTQLPAL 241


>gb|AAV92930.1| putative transcription regulator CPL1 [Solanum lycopersicum]
          Length = 1227

 Score =  100 bits (250), Expect = 1e-18
 Identities = 86/279 (30%), Positives = 124/279 (44%), Gaps = 15/279 (5%)
 Frame = +2

Query: 419  SVEEISAEDFKQE----AKVSSKPKTTSDSG----------VW-MGDLLKYRVSSNYGHG 553
            SVEEIS + F ++       +SK K  S+            VW M D+ KY +S +Y  G
Sbjct: 20   SVEEISEDAFNRQDPPTTSTTSKIKIASNENQNQNSTTATRVWTMRDVYKYPISRDYARG 79

Query: 554  FHNLAWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNX 733
             +NLAWAQAVQNKPLDE+ V                       DN+N  +   +K + + 
Sbjct: 80   LYNLAWAQAVQNKPLDELFVMTS--------------------DNSNQCANGESKVIIDV 119

Query: 734  XXXXXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDE 913
                                           +D D++  GE +EG   E+D    +L   
Sbjct: 120  D------------------------------VDDDAKEEGELEEG---EIDLDSADLVVN 146

Query: 914  EASEKKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAIDALTQQAF 1093
               E     I+E LQ+VT+ +  KS   VCS+L   L ++ E+      +  D L Q   
Sbjct: 147  FGKEANF--IREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQDKN--DILIQLFM 202

Query: 1094 TGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPAL 1210
            T  R +N+VF SMN  Q++QN +  SRLL + K++ PAL
Sbjct: 203  TALRTINSVFYSMNDHQKQQNTDILSRLLFNAKTQLPAL 241


>gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus
            notabilis]
          Length = 1301

 Score = 99.4 bits (246), Expect = 3e-18
 Identities = 94/295 (31%), Positives = 127/295 (43%), Gaps = 30/295 (10%)
 Frame = +2

Query: 419  SVEEISAEDF-KQEAKVSSKPKTTS------------DSGVW-MGDLL-KYRVSSNYGHG 553
            SVEEIS EDF KQE   +   K  S            DS VW M DL   Y     Y  G
Sbjct: 23   SVEEISEEDFNKQEGNGTGSGKVMSVSDSNSKESKFGDSRVWTMRDLYANYPGFRGYTTG 82

Query: 554  FHNLAWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNX 733
             +NLAWAQAVQNKPL+EI V                       D+++   +SSA    N 
Sbjct: 83   LYNLAWAQAVQNKPLNEIFVMDVDA------------------DDSSRVVLSSASPAVNS 124

Query: 734  XXXXXXXXXXXXXXXNAV-----QXXXXXXXXXXXXIDMDSEMIGEE--KEGTCGELDSS 892
                             V                  ID++SE   +   +E   G+L+  
Sbjct: 125  GRREGKNGVKEVEKVEKVVIDDSADEMEEGELEEGEIDLESEPTQKPAGEEAKDGDLNCE 184

Query: 893  GRNLKDEEAS------EKKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDN 1054
              N+   E        EK++  I E L +V V +AEKS   VCS+L   L+S++ ++S+ 
Sbjct: 185  AENVGGLEVDSRRDELEKRVDLIWETLGSVNVVNAEKSFEEVCSRLQRTLESLRGVLSEK 244

Query: 1055 CLS--AIDALTQQAFTGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
              S    D + Q + T  + VN+VF SM+  Q+EQ KE  SRL   +K+    LF
Sbjct: 245  EFSFPTKDVVIQMSITAIQVVNSVFCSMSVNQKEQKKETLSRLFCSVKNCGTPLF 299


>ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Fragaria vesca subsp. vesca]
          Length = 1230

 Score = 99.4 bits (246), Expect = 3e-18
 Identities = 77/267 (28%), Positives = 123/267 (46%), Gaps = 6/267 (2%)
 Frame = +2

Query: 413  TQSVEEISAEDFKQEAKVSSKPKTTSDSGVWMGDLLKYRVSSNY------GHGFHNLAWA 574
            + SVEEIS EDF ++   + +PK+   SG        + V ++       G G  NLAWA
Sbjct: 21   SNSVEEISEEDFVKQESKAVEPKSNGGSGDGARFWTFHEVLAHPHFRGIGGGGLANLAWA 80

Query: 575  QAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXXXXXX 754
            QAVQNKP +++LV                             S+SS  E           
Sbjct: 81   QAVQNKPFNDLLVKLDSDEKSKQ-------------QQQQRSSVSSGNE----------- 116

Query: 755  XXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDEEASEKKI 934
                    + +             I  DSE         CG+ D +  ++ +    EK++
Sbjct: 117  KVVIIDSGDEMDVEKEEEELEEGEIGFDSE---------CGDNDKAAGSVGNG-VWEKRV 166

Query: 935  QSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAIDALTQQAFTGFRAVN 1114
              ++E L+++T+ +AEKS   VC +    L+S++ ++S+  +S  +AL QQ F   RA++
Sbjct: 167  NLLREALESLTITEAEKSFGDVCHRFLDSLESLRGVLSEINVSTKEALVQQLFNAVRAIS 226

Query: 1115 TVFVSMNSKQQEQNKEAFSRLLHHIKS 1195
            +VF SM++ Q+EQNK+  SR+L   KS
Sbjct: 227  SVFRSMSADQKEQNKDVLSRILSSAKS 253


>gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus guttatus]
          Length = 1220

 Score = 97.1 bits (240), Expect = 1e-17
 Identities = 83/296 (28%), Positives = 126/296 (42%), Gaps = 31/296 (10%)
 Frame = +2

Query: 419  SVEEISAEDFKQEAKVSSKPK----------------TTSDSG--------------VW- 505
            S+EEIS EDF  +  +   P                  TS++               VW 
Sbjct: 28   SIEEISEEDFNAKQALQPSPPPAPPLKSSLNSSHINVVTSNNNNNNSNNSAGGGGARVWT 87

Query: 506  MGDLLKYRVSSNYGHGFHNLAWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXD 685
            M DL +Y+V+S +  G +NLAWAQAV NK LDE+L+                       D
Sbjct: 88   MKDLYEYQVASKHYPGLYNLAWAQAVNNKSLDEVLMMKE--------------------D 127

Query: 686  NNNNKSISSAKEVCNXXXXXXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKE 865
             NN++S     +  +                  V+            ID+DSE++     
Sbjct: 128  GNNDRSNGGISDTSSSKSSKTNDSKVVIDVE--VEGGMEEGELEEGEIDLDSELVVRN-- 183

Query: 866  GTCGELDSSGRNLKDEEASEKKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMV 1045
                 +D +     +E++  +++ SI+  L+++ V DA  S H +CS L   + S+QEMV
Sbjct: 184  -----MDFNVETNSNEKS--RRVDSIKRELESLNVADAIISYHRLCSSLKNTIVSLQEMV 236

Query: 1046 SDNCLSAIDALTQQAFTGFRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
             +   +  D L Q   T  + + +VF SM+ K +EQNK   SRLL  + S  P LF
Sbjct: 237  LEGSFAEKDTLVQLLLTAIQTLYSVFSSMSPKLKEQNKPILSRLLARVTSLKPPLF 292


>gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed
            [Oryza sativa Japonica Group]
          Length = 1272

 Score = 96.3 bits (238), Expect = 2e-17
 Identities = 73/270 (27%), Positives = 118/270 (43%), Gaps = 10/270 (3%)
 Frame = +2

Query: 419  SVEEISAEDFKQEAKVSSKPKTTSD----SGVWMGDLLKYRVSSNYGHGFHNLAWAQAVQ 586
            S+EEISA+DFK+E+  +      +     S VWMG    Y +  +Y   FH+ AWAQAVQ
Sbjct: 56   SLEEISADDFKKESSAAGGAAAAAAAQQRSRVWMG----YNIPRSYAPAFHSFAWAQAVQ 111

Query: 587  NKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXXXXXXXXXX 766
            NKPL                            D  +   +    +  +            
Sbjct: 112  NKPL-----------------------VPRAADAADEDEVEHVVDTSDEEKEEGEIEEGE 148

Query: 767  XXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGR---NLKDEEAS-EKKI 934
                                ID+DS+    EK  +   +   G      ++EE   ++++
Sbjct: 149  AVQTTTTSSSSPPCAQPPETIDLDSD--APEKSESMVAMYGGGAAPAGAEEEEVDFDQRV 206

Query: 935  QSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMV--SDNCLSAIDALTQQAFTGFRA 1108
             SI E L+ V++++AEKS  G C++L  C ++++ +   S + +  +DAL QQAF G   
Sbjct: 207  GSILEELEMVSIEEAEKSFEGACTRLRTCFENLKPLFPESGSPMPMLDALVQQAFVGIDT 266

Query: 1109 VNTVFVSMNSKQQEQNKEAFSRLLHHIKSK 1198
            + TV  S +  ++EQ K    +LL HIK++
Sbjct: 267  ITTVANSYDMPKREQTKNMLLKLLFHIKNR 296


>ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
            gi|241935397|gb|EES08542.1| hypothetical protein
            SORBIDRAFT_05g019010 [Sorghum bicolor]
          Length = 1197

 Score = 94.4 bits (233), Expect = 9e-17
 Identities = 70/270 (25%), Positives = 126/270 (46%), Gaps = 10/270 (3%)
 Frame = +2

Query: 419  SVEEISAEDFKQEAKVS-SKPKTTSDSGV----WMGDLLKYRVSSNYGHGFHNLAWAQAV 583
            S+EEISA+DF++++  +   P   + +G     W+G      ++ N+GH F++ AW+QAV
Sbjct: 34   SIEEISADDFRKDSSSALGGPAAAAAAGQRSRSWVGPPAVGYMARNFGHAFNSFAWSQAV 93

Query: 584  QNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXXXXXXXXX 763
            +NKPL                              ++   +  A +  +           
Sbjct: 94   RNKPLG-----------------------LQPPPASDEDEVEHAVDASDGEKEEGEIEEG 130

Query: 764  XXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEK-EGTCGELDSSGRNLKDEEASEKKIQS 940
                  AV+            ID+D++    EK E   G + +S    ++E   ++++ S
Sbjct: 131  -----EAVEAEASPARAQPETIDLDADADALEKSESLAGAVPASAAE-EEEVNLDQRVGS 184

Query: 941  IQEVLQTVTVKDAEKSLHGVCSQLCACLDSM----QEMVSDNCLSAIDALTQQAFTGFRA 1108
            I E L+ V++++AEKS  G C +L  C +++    QE+ + + ++ ++ L QQAF G   
Sbjct: 185  ILEELEMVSIEEAEKSFEGACGRLHTCFENLKPLFQELENGSPMAILEPLMQQAFIGIDT 244

Query: 1109 VNTVFVSMNSKQQEQNKEAFSRLLHHIKSK 1198
            + TV +S N  + EQNK    + L HIK++
Sbjct: 245  LTTVAISYNLPRSEQNKTTLLKSLFHIKNR 274


>ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1257

 Score = 93.6 bits (231), Expect = 2e-16
 Identities = 88/274 (32%), Positives = 126/274 (45%), Gaps = 9/274 (3%)
 Frame = +2

Query: 413  TQSVEEISAEDF-KQEAKV---SSKPKTTSDSGVW-MGDLL-KY-RVSSNYGHGFHNLAW 571
            T SVEEISAEDF KQ+ KV   ++KP   SD+ VW + DL  KY  +   Y  G +NLAW
Sbjct: 33   TASVEEISAEDFNKQDVKVLNNNNKPNG-SDARVWAVHDLYSKYPTICRGYASGLYNLAW 91

Query: 572  AQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXXXXX 751
            AQAVQNKPL++I V                       ++N+N S   A    N       
Sbjct: 92   AQAVQNKPLNDIFV--------------MEVDSDANANSNSNNSNRLASVAVNPKDVV-- 135

Query: 752  XXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKD--EEASE 925
                       V             ID D+E  GE +      + S    L D   + S 
Sbjct: 136  ----------VVDVDKEEGELEEGEIDADAEPEGEAESVVAVPVVSDSEKLDDVKRDVSN 185

Query: 926  KKIQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAIDALTQQAFTGFR 1105
             +   ++ VL+ VTV +  +S    CS+L    +++ E++S    S  D L + +F    
Sbjct: 186  SEQLGVRGVLEGVTVANVAESFAQTCSKL---QNALPEVLSRPADSERDDLVRLSFNATE 242

Query: 1106 AVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPA 1207
             V +VF SM+S ++EQNK++  RLL  +K +  A
Sbjct: 243  VVYSVFCSMDSLKKEQNKDSILRLLSFVKDQQQA 276


>ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1261

 Score = 91.7 bits (226), Expect = 6e-16
 Identities = 86/272 (31%), Positives = 124/272 (45%), Gaps = 7/272 (2%)
 Frame = +2

Query: 413  TQSVEEISAEDF-KQEAKV---SSKPKTTSDSGVW-MGDLL-KY-RVSSNYGHGFHNLAW 571
            T SVEEISAEDF KQ+ K+   ++KP   SD+ VW + DL  KY  +   Y  G +NLAW
Sbjct: 33   TASVEEISAEDFNKQDVKLLNNNNKPNG-SDARVWAVHDLYSKYPTICRGYASGLYNLAW 91

Query: 572  AQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXXXXX 751
            AQAVQNKPL++I V                       ++N N S   A    N       
Sbjct: 92   AQAVQNKPLNDIFV--------------MEVDSDANANSNRNSSHRLASVAVNPKDVV-- 135

Query: 752  XXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDEEASEKK 931
                       V             ID D+E  GE +       DS   +    + S+ +
Sbjct: 136  ----------VVDVDKEEGELEEGEIDADAEPEGEAESVVVAVSDSEKLDDVKMDVSDSE 185

Query: 932  IQSIQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAIDALTQQAFTGFRAV 1111
                + VL+ VTV +  +S    CS+L    +++ E++S    S  D L + +F     V
Sbjct: 186  QLGARGVLEGVTVANVVESFAQTCSKL---QNTLPEVLSRPAGSEKDDLVRLSFNATEVV 242

Query: 1112 NTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPA 1207
             +VF SM+S ++EQNK++  RLL  +K +  A
Sbjct: 243  YSVFCSMDSSEKEQNKDSILRLLSFVKDQQQA 274


>emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score = 86.3 bits (212), Expect = 2e-14
 Identities = 55/134 (41%), Positives = 83/134 (61%), Gaps = 5/134 (3%)
 Frame = +2

Query: 827  IDMDSEMIGEEKEGTCGELDSSGRNLKDEEASEKKIQSIQEVLQTVTVKDAEKSLHGVCS 1006
            ID+DSE   +++ G   +++    +LK+ E  E+ ++SIQE L++VTV +AEKS  GVCS
Sbjct: 164  IDLDSEPDVKDEGGVL-DVNEPEIDLKERELVER-VKSIQEDLESVTVIEAEKSFSGVCS 221

Query: 1007 QLCACLDSMQEM-----VSDNCLSAIDALTQQAFTGFRAVNTVFVSMNSKQQEQNKEAFS 1171
            +L   L S+Q++     V ++ +   DAL QQ     RA+N VF SMNS Q+E NK+ FS
Sbjct: 222  RLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIRALNHVFCSMNSNQKELNKDVFS 281

Query: 1172 RLLHHIKSKAPALF 1213
            RLL  ++     +F
Sbjct: 282  RLLSCVECGDSPIF 295



 Score = 63.9 bits (154), Expect = 1e-07
 Identities = 43/113 (38%), Positives = 60/113 (53%), Gaps = 6/113 (5%)
 Frame = +2

Query: 293 MRAVGFNRFYLSPNEEKTLVFMAKXXXXXXXXXXXXXXXXTQSVEEISAEDF-KQEAKVS 469
           ++ + + RF L+P  +  L   +                 + SVEEIS EDF KQE +V 
Sbjct: 23  IQLLSYPRFGLTPLYKTPLSIASVSHRMGIEDVEEGEISDSASVEEISEEDFNKQEVRVL 82

Query: 470 SKPKTTSDSGVW----MGDLLKY-RVSSNYGHGFHNLAWAQAVQNKPLDEILV 613
            + K  +D+ VW    + DL KY +  S Y    +NLAWAQAVQNKPL++I V
Sbjct: 83  REAKPKADTRVWTMRDLQDLYKYHQACSGYTPRLYNLAWAQAVQNKPLNDIFV 135


>ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like isoform X1 [Cicer arietinum]
          Length = 1247

 Score = 85.9 bits (211), Expect = 3e-14
 Identities = 81/278 (29%), Positives = 124/278 (44%), Gaps = 11/278 (3%)
 Frame = +2

Query: 413  TQSVEEISAEDFKQEAKV-------SSKPKTTSDSGVW-MGDLL-KY-RVSSNYGHGFHN 562
            T SV EIS EDF ++  V       S K KT  D+ VW + DL  KY  +   Y  G +N
Sbjct: 33   TASVVEISEEDFNKQDVVKVNNNSDSDKAKTGGDARVWAVHDLYSKYPTICRGYASGLYN 92

Query: 563  LAWAQAVQNKPLDEILVXXXXXXXXXXXXXXXXXXXXXXXDNNNNKSISSAKEVCNXXXX 742
            LAWAQAVQNKPL++I V                       D+++N + +S  +       
Sbjct: 93   LAWAQAVQNKPLNDIFV--------------------MELDSDSNANANSNND------- 125

Query: 743  XXXXXXXXXXXXNAVQXXXXXXXXXXXXIDMDSEMIGEEKEGTCGELDSSGRNLKDEEAS 922
                        N               +D D    GE +EG     D +G  +   + S
Sbjct: 126  -----------SNNGNGDLNMPLKEVVMVDDDEREEGELEEGEIDGDDDTGGVMVGGDGS 174

Query: 923  EKKIQS-IQEVLQTVTVKDAEKSLHGVCSQLCACLDSMQEMVSDNCLSAIDALTQQAFTG 1099
            E   +S I++ L+ VTV +  +S     S+L   L S  +++S   +S  D + +  +  
Sbjct: 175  ETVSESDIRDFLEGVTVANVAESFAETISRLLRVLQS--KLLSGPAVSEKDYVIRLLYNA 232

Query: 1100 FRAVNTVFVSMNSKQQEQNKEAFSRLLHHIKSKAPALF 1213
               V++VF SM++ Q+E NK+   RLL+ +K++   LF
Sbjct: 233  IEIVHSVFCSMDNLQKEDNKDNIIRLLYFLKNEHTQLF 270


Top