BLASTX nr result

ID: Zingiber23_contig00022597 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber23_contig00022597
         (1445 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal doma...   484   e-134
ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [S...   482   e-133
ref|XP_006654357.1| PREDICTED: RNA polymerase II C-terminal doma...   481   e-133
ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal doma...   480   e-133
ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group] g...   476   e-132
ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal doma...   476   e-131
gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indi...   476   e-131
gb|EMS57931.1| RNA polymerase II C-terminal domain phosphatase-l...   475   e-131
gb|AFW77884.1| CPL3 [Zea mays]                                        475   e-131
ref|NP_001152445.1| CPL3 [Zea mays] gi|195656359|gb|ACG47647.1| ...   474   e-131
dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare]    474   e-131
gb|EMT13574.1| RNA polymerase II C-terminal domain phosphatase-l...   462   e-127
ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu...   448   e-123
ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [S...   445   e-122
gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isofo...   444   e-122
ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ...   443   e-121
ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma...   438   e-120
ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu...   436   e-119
gb|EOY32065.1| RNA polymerase II ctd phosphatase, putative isofo...   436   e-119
ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [A...   429   e-117

>ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Setaria italica]
          Length = 543

 Score =  484 bits (1245), Expect = e-134
 Identities = 240/371 (64%), Positives = 287/371 (77%), Gaps = 5/371 (1%)
 Frame = +3

Query: 18   KNV---ICPPHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXX 182
            KNV   +CP HPG+F GLC RCG  Q EED SGVAFGYIHK LRLG  E+ RLRGAD   
Sbjct: 98   KNVQVEVCP-HPGYFGGLCFRCGKPQDEEDASGVAFGYIHKGLRLGTSEIDRLRGADLKN 156

Query: 183  XXXXXXXXXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKL 362
                              NST+L D+ S E  L  +   +KDDPDRS+F LDSM MLTKL
Sbjct: 157  LLRERKLVLILDLDHTLINSTKLQDISSAENELGIRTAALKDDPDRSIFSLDSMQMLTKL 216

Query: 363  RPFVNNFLKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGL 542
            RPFV NFLKEAS++FEMYIYTM +++YAIEIAKLLDP  VYF SKVI+ +DCTQRHQKGL
Sbjct: 217  RPFVRNFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNVYFPSKVISNSDCTQRHQKGL 276

Query: 543  DVVLGAESLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERES 722
            DV+LGAES+ VILDDTE VWQ+HK+NLI MERYH+FASSCRQFGF  KSLSE M+DERES
Sbjct: 277  DVILGAESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGVKSLSESMQDERES 336

Query: 723  DGALATVLKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSL 902
            DGALATVL +LKR H +FFD A+ + +S+RDVR ++K +R+E+L+GCK+VFSRVFP+ S 
Sbjct: 337  DGALATVLDVLKRIHTIFFDTAVETALSSRDVRQVIKTVRKEVLEGCKLVFSRVFPNTSR 396

Query: 903  AQDQPIWKMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFL 1082
             Q+Q +WKMAE LGA C T+VD +VTHV++ D GT+KA WA +NKKFLV+P WIEA+NF 
Sbjct: 397  PQEQMMWKMAEHLGAVCSTDVDSTVTHVVAVDLGTEKARWAVKNKKFLVHPRWIEAANFR 456

Query: 1083 WQRQKEENFSI 1115
            W RQ EE+F +
Sbjct: 457  WHRQPEEDFPV 467


>ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
            gi|241915584|gb|EER88728.1| hypothetical protein
            SORBIDRAFT_10g025580 [Sorghum bicolor]
          Length = 558

 Score =  482 bits (1240), Expect = e-133
 Identities = 236/363 (65%), Positives = 282/363 (77%), Gaps = 2/363 (0%)
 Frame = +3

Query: 36   PHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXXXX 209
            PHPG+F GLC RCG  Q EE+ SGVAFGYIHK LRLG  E+ RLRGAD            
Sbjct: 108  PHPGYFGGLCFRCGKPQDEENVSGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVL 167

Query: 210  XXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNFLK 389
                     NST+L D+ S E+ L  Q    KDDP+RS+F LDSM MLTKLRPFV  FLK
Sbjct: 168  ILDLDHTLINSTKLQDISSAEKDLGIQTAASKDDPNRSIFSLDSMQMLTKLRPFVREFLK 227

Query: 390  EASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAESL 569
            EAS++FEMYIYTM +++YAIEIAKLLDP  +YF SKVI+ +DCTQRHQKGLDV+LGAES+
Sbjct: 228  EASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESV 287

Query: 570  VVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATVLK 749
             VILDDTE VWQ+HK+NLI MERYHFFASSCRQFGF  +SLSE M+DERESDGALATVL 
Sbjct: 288  AVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRSLSESMQDERESDGALATVLD 347

Query: 750  ILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIWKM 929
            +LKR H +FFD A+ +D+S++DVR ++K +R+EILQGCKIVFSRVFP+N+  Q+Q +WKM
Sbjct: 348  VLKRIHSIFFDLAVETDLSSQDVRQVIKAVRKEILQGCKIVFSRVFPNNTRPQEQMLWKM 407

Query: 930  AEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEENF 1109
            AE LGA C T+VD SVTHV++ D GT+KA W   NKKFLV+P WIEA+NF W RQ EE+F
Sbjct: 408  AEHLGAVCSTDVDSSVTHVVTVDLGTEKARWGVANKKFLVHPRWIEAANFRWHRQPEEDF 467

Query: 1110 SIS 1118
             ++
Sbjct: 468  PVT 470


>ref|XP_006654357.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Oryza brachyantha]
          Length = 557

 Score =  481 bits (1238), Expect = e-133
 Identities = 235/366 (64%), Positives = 282/366 (77%), Gaps = 2/366 (0%)
 Frame = +3

Query: 27   ICPPHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXX 200
            ICPPHPGFF GLC +CG  Q EED  GVAFGYIHK L LG  E+ RLRGAD         
Sbjct: 110  ICPPHPGFFGGLCFKCGKKQDEEDVPGVAFGYIHKGLTLGTSEIDRLRGADLKNLLRERR 169

Query: 201  XXXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNN 380
                        NST+L+D+ + E  L  Q    KDDP+RSLFRLD+M MLTKLRPFV  
Sbjct: 170  LVLILDLDHTLINSTKLLDLSAAENELGIQSAASKDDPNRSLFRLDAMQMLTKLRPFVRE 229

Query: 381  FLKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGA 560
            FLKEAS++FEMYIYTM +++YAIEIAKLLDP  VYF S VI+ +DCTQRHQKGLDV+LGA
Sbjct: 230  FLKEASNMFEMYIYTMGDKAYAIEIAKLLDPENVYFGSNVISNSDCTQRHQKGLDVILGA 289

Query: 561  ESLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALAT 740
            ESL VILDDTE VWQ+HK+NLI MERYH+FASSCRQFGF+A+SLSE M+DERE DGALAT
Sbjct: 290  ESLAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFSARSLSESMQDEREGDGALAT 349

Query: 741  VLKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPI 920
            +L IL+R H +FFD A+ + + +RDVR ++K +R+EIL GCK+VF+RVFP +   QDQ +
Sbjct: 350  ILDILRRIHSIFFDSAVQNPLPSRDVRQVIKRVRQEILDGCKLVFTRVFPLHQRPQDQML 409

Query: 921  WKMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKE 1100
            WKMAEQLGA CCT+VD  VTHV++ D GT+KA WA  NKKFLV+P WIEA+NF W RQ+E
Sbjct: 410  WKMAEQLGAVCCTDVDSMVTHVVALDLGTEKARWAVGNKKFLVHPRWIEAANFRWHRQQE 469

Query: 1101 ENFSIS 1118
            E+F ++
Sbjct: 470  EDFPVA 475


>ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Brachypodium distachyon]
          Length = 492

 Score =  480 bits (1236), Expect = e-133
 Identities = 233/366 (63%), Positives = 287/366 (78%), Gaps = 2/366 (0%)
 Frame = +3

Query: 27   ICPPHPGFFKGLCIRCGQI--EEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXX 200
            ICPPHPGF +GLCI+CG+I  EED  GVA GYIH+ LRLG  E+ RLRG+D         
Sbjct: 105  ICPPHPGFLRGLCIKCGKIQDEEDVPGVACGYIHEGLRLGTSEIERLRGSDLKKLLRERK 164

Query: 201  XXXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNN 380
                        NSTRL D+ + E  L  Q   +KDDPDRSLF L+ MHMLTKLRPFV  
Sbjct: 165  LVLILDLDHTLINSTRLHDISAAEMDLGIQTAALKDDPDRSLFTLERMHMLTKLRPFVRR 224

Query: 381  FLKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGA 560
            FLKEAS++FEMYIYTM +++Y+IE+AKLLDPG VYF SKVI+ +DCTQRHQKGLDVVLGA
Sbjct: 225  FLKEASNMFEMYIYTMGDKAYSIEVAKLLDPGNVYFGSKVISNSDCTQRHQKGLDVVLGA 284

Query: 561  ESLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALAT 740
            ES+ VILDDTE VWQ+HK+NLI MERYH+FASSCRQFGF+ +SLSELM DERESDGAL+T
Sbjct: 285  ESIAVILDDTEDVWQKHKENLILMERYHYFASSCRQFGFSVRSLSELMVDERESDGALST 344

Query: 741  VLKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPI 920
            +L +LKR H +FFD  + + +S+R +  ++K +R+E+LQGCK+VFSRVFPSNS  QDQ I
Sbjct: 345  ILDVLKRIHTIFFDSGVETALSSRTLM-VIKRVRQEVLQGCKLVFSRVFPSNSCPQDQII 403

Query: 921  WKMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKE 1100
            WKMAE+LGA+CC  VD +VTHV++ D GT+KA WA +NKKFL++P WIEASN+ W+RQ E
Sbjct: 404  WKMAEKLGASCCAHVDSTVTHVVAVDVGTEKARWAVENKKFLLHPRWIEASNYRWRRQPE 463

Query: 1101 ENFSIS 1118
            E+F ++
Sbjct: 464  EDFPVA 469


>ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group] gi|57863785|gb|AAS86390.2|
            unknown protein [Oryza sativa Japonica Group]
            gi|113578991|dbj|BAF17354.1| Os05g0390500 [Oryza sativa
            Japonica Group] gi|215695102|dbj|BAG90293.1| unnamed
            protein product [Oryza sativa Japonica Group]
            gi|222631469|gb|EEE63601.1| hypothetical protein
            OsJ_18418 [Oryza sativa Japonica Group]
          Length = 536

 Score =  476 bits (1226), Expect = e-132
 Identities = 234/365 (64%), Positives = 279/365 (76%), Gaps = 2/365 (0%)
 Frame = +3

Query: 30   CPPHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXX 203
            CPPHPGFF GLC RCG  Q EED  GVAFGYIHK LRLG  E+ RLRGAD          
Sbjct: 112  CPPHPGFFGGLCYRCGKRQDEEDVPGVAFGYIHKGLRLGTTEIDRLRGADLKNLLRERKL 171

Query: 204  XXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNF 383
                       NST+L D+ + E  L  Q    +  PDRSLF L++M MLTKLRPFV  F
Sbjct: 172  VLILDLDHTLINSTKLFDLSAAENELGIQSAAKEVVPDRSLFTLETMQMLTKLRPFVRRF 231

Query: 384  LKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAE 563
            LKEAS +FEMYIYTM +++YAIEIAKLLDP  VYF SKVI+ +DCTQRHQKGLDVVLG E
Sbjct: 232  LKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSDCTQRHQKGLDVVLGDE 291

Query: 564  SLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATV 743
            S+ VILDDTE VWQ+HK+NLI MERYH+FASSCRQFGF A+SLSE M+DERE+DGALAT+
Sbjct: 292  SVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARSLSETMQDERENDGALATI 351

Query: 744  LKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIW 923
            L +L+R H +FFDP     +S+RDVR ++K +R+E+LQGCK+VF+RVFP +   QDQ IW
Sbjct: 352  LDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKLVFTRVFPLHQRQQDQMIW 411

Query: 924  KMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEE 1103
            KMAEQLGA CCT+VD +VTHV++ D GT+KA WA  NKKFLV+P WIEA+NF WQRQ+EE
Sbjct: 412  KMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAVSNKKFLVHPRWIEAANFRWQRQQEE 471

Query: 1104 NFSIS 1118
            +F ++
Sbjct: 472  DFPVA 476


>ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Brachypodium distachyon]
          Length = 493

 Score =  476 bits (1225), Expect = e-131
 Identities = 232/365 (63%), Positives = 283/365 (77%), Gaps = 2/365 (0%)
 Frame = +3

Query: 27   ICPPHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXX 200
            ICPPHPGFF GLC RCG  Q EED  GVAFGYIHK LRLG  E+ RLRG++         
Sbjct: 104  ICPPHPGFFGGLCFRCGKRQDEEDVPGVAFGYIHKGLRLGTSEIDRLRGSNVKSLLRERK 163

Query: 201  XXXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNN 380
                        NST+L D+ + E  L  Q    +D P++SLF L++M MLTKLRPFV  
Sbjct: 164  LVLILDLDHTLINSTKLHDISAAERDLGIQTFASEDAPEKSLFTLEAMQMLTKLRPFVCK 223

Query: 381  FLKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGA 560
            FLKEAS++FEMYIYTM +++YAIEIAKLLDPG VYF SKVI+ +DCTQRHQKGLDVVLGA
Sbjct: 224  FLKEASNMFEMYIYTMGDKAYAIEIAKLLDPGNVYFGSKVISNSDCTQRHQKGLDVVLGA 283

Query: 561  ESLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALAT 740
            E++ +ILDDTE VWQ+HK+NLI MERYH+FASSCRQFGF+ K+LSE M+DERESDGALAT
Sbjct: 284  ENVAIILDDTEYVWQKHKENLILMERYHYFASSCRQFGFSVKALSESMQDERESDGALAT 343

Query: 741  VLKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPI 920
             L +LKR H +FFD A+ + +S+RDVR ++K +R+E+LQGCK+VFSRVFPS+S  QDQ I
Sbjct: 344  TLDVLKRIHTLFFDSAVETALSSRDVRQVIKKVRQEVLQGCKVVFSRVFPSSSRPQDQII 403

Query: 921  WKMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKE 1100
            WKMAEQLGA CC ++D +VTHV++ D+GT+KA WA  N K LV+P WIEASNF W RQ+E
Sbjct: 404  WKMAEQLGAICCADMDSTVTHVVAVDSGTEKARWAVGNNKILVHPRWIEASNFRWHRQQE 463

Query: 1101 ENFSI 1115
            E+F +
Sbjct: 464  EDFPV 468


>gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indica Group]
          Length = 574

 Score =  476 bits (1224), Expect = e-131
 Identities = 233/365 (63%), Positives = 279/365 (76%), Gaps = 2/365 (0%)
 Frame = +3

Query: 30   CPPHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXX 203
            CPPHPGFF GLC RCG  Q EED  GVAFGYIHK LRLG  E+ RLRGAD          
Sbjct: 138  CPPHPGFFGGLCYRCGKRQDEEDVPGVAFGYIHKGLRLGTTEIDRLRGADLKNLLRERKL 197

Query: 204  XXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNF 383
                       NST+L D+ + E  L  Q    +  PDRSLF L++M MLTKLRPFV  F
Sbjct: 198  VLILDLDHTLINSTKLFDLSAAENELGIQSAAKEVVPDRSLFTLETMQMLTKLRPFVRRF 257

Query: 384  LKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAE 563
            LKEAS +FEMYIYTM +++YAIEIAKLLDP  VYF SKVI+ +DCTQRHQKGLDVVLG E
Sbjct: 258  LKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSDCTQRHQKGLDVVLGDE 317

Query: 564  SLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATV 743
            S+ VILDDTE VWQ+HK+NLI MERYH+FASSCRQFGF A+SLSE M+DERE+DGALAT+
Sbjct: 318  SVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARSLSETMQDERENDGALATI 377

Query: 744  LKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIW 923
            L +L+R H +FFDP     +S+RDVR ++K +R+E+LQGCK+VF+RVFP +   QDQ +W
Sbjct: 378  LDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKLVFTRVFPLHQRQQDQMLW 437

Query: 924  KMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEE 1103
            KMAEQLGA CCT+VD +VTHV++ D GT+KA WA  NKKFLV+P WIEA+NF WQRQ+EE
Sbjct: 438  KMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAVSNKKFLVHPRWIEAANFRWQRQQEE 497

Query: 1104 NFSIS 1118
            +F ++
Sbjct: 498  DFPVA 502


>gb|EMS57931.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Triticum
            urartu]
          Length = 589

 Score =  475 bits (1223), Expect = e-131
 Identities = 232/365 (63%), Positives = 279/365 (76%), Gaps = 2/365 (0%)
 Frame = +3

Query: 27   ICPPHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXX 200
            ICPPHPG+F GLC RCG  Q EED  GVAFGY+HK LRLG  E+ RLRG+D         
Sbjct: 154  ICPPHPGYFGGLCFRCGKRQDEEDVPGVAFGYVHKGLRLGTTEIDRLRGSDLKNLLRERK 213

Query: 201  XXXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNN 380
                        NST+L D+ + E  L  Q    KDDP+ SLF L+ M MLTKLRPFV  
Sbjct: 214  LILILDLDHTLINSTKLHDISAAENNLGIQTAASKDDPNGSLFTLEGMQMLTKLRPFVRK 273

Query: 381  FLKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGA 560
            FLKEAS++FEMYIYTM +++YAIEIAKLLDP  VYF+SKVI+ +DCTQRHQKGLD+VLGA
Sbjct: 274  FLKEASNMFEMYIYTMGDKAYAIEIAKLLDPRNVYFNSKVISNSDCTQRHQKGLDMVLGA 333

Query: 561  ESLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALAT 740
            ES+ VILDDTE VWQ+HK+NLI MERYH+FASSCRQFGF+ KSLSE M+DER SDGALAT
Sbjct: 334  ESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFSVKSLSEFMQDERGSDGALAT 393

Query: 741  VLKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPI 920
            +L +LKR H +FFD A+ + +S+RDVR ++K +R+E+LQGCK+VFSRVFPS+S  QDQ I
Sbjct: 394  ILDVLKRIHTIFFDSAVETALSSRDVRQVIKRVRQEVLQGCKLVFSRVFPSSSRPQDQFI 453

Query: 921  WKMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKE 1100
            WKMAEQLGA C  +VD ++THV++ D GT KA WA  N K LV+P WIEASNF W RQ+E
Sbjct: 454  WKMAEQLGAICSADVDSTITHVVAVDVGTDKARWAVNNNKILVHPRWIEASNFRWHRQQE 513

Query: 1101 ENFSI 1115
            E+F +
Sbjct: 514  EDFPV 518


>gb|AFW77884.1| CPL3 [Zea mays]
          Length = 533

 Score =  475 bits (1223), Expect = e-131
 Identities = 235/363 (64%), Positives = 278/363 (76%), Gaps = 2/363 (0%)
 Frame = +3

Query: 36   PHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXXXX 209
            PHPG F GLCI CG  Q EED SGVAFGYIHK LRLG  E+ RLRGAD            
Sbjct: 106  PHPGHFGGLCIICGKPQDEEDVSGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVL 165

Query: 210  XXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNFLK 389
                     NST+L D+ S E+ L  Q    KDDP+RS+F LD M MLTKLRPFV  FLK
Sbjct: 166  ILDLDHTLINSTKLQDISSAEKDLGIQSAASKDDPNRSIFALDLMPMLTKLRPFVREFLK 225

Query: 390  EASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAESL 569
            EAS++FEMYIYTM +++YAIEIAKLLDP  +YF SKVI+ +DCTQRHQKGLDV+LGAES+
Sbjct: 226  EASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESV 285

Query: 570  VVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATVLK 749
             VILDDTE VWQ+HK+NLI MERYHFFASSCRQFGF  +SLSE ++DERESDGALATVL 
Sbjct: 286  AVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRSLSESLQDERESDGALATVLD 345

Query: 750  ILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIWKM 929
            +LKR H  FFD A  +D+S+RD+R ++K +R+EILQGCKIVFSRVFP+N+  Q+Q +WKM
Sbjct: 346  VLKRIHATFFDMAAETDLSSRDIRQVIKTLRKEILQGCKIVFSRVFPNNTRPQEQMVWKM 405

Query: 930  AEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEENF 1109
            AE LGA C  +VDPSVTHV++ D GT+KA W   NKKFLV+P WIEA+NF W RQ EE+F
Sbjct: 406  AEYLGAVCVKDVDPSVTHVVTVDLGTEKARWGLNNKKFLVHPRWIEAANFRWHRQPEEDF 465

Query: 1110 SIS 1118
             ++
Sbjct: 466  PVT 468


>ref|NP_001152445.1| CPL3 [Zea mays] gi|195656359|gb|ACG47647.1| CPL3 [Zea mays]
          Length = 531

 Score =  474 bits (1220), Expect = e-131
 Identities = 234/363 (64%), Positives = 278/363 (76%), Gaps = 2/363 (0%)
 Frame = +3

Query: 36   PHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXXXX 209
            PHPG F GLCI CG  Q EED SGVAFGYIHK LRLG  E+ RLRGAD            
Sbjct: 104  PHPGHFGGLCIICGKPQDEEDVSGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVL 163

Query: 210  XXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNFLK 389
                     NST+L D+ S E+ L  Q    KDDP+RS+F LD M MLTKLRPFV  FLK
Sbjct: 164  ILDLDHTLINSTKLQDISSAEKDLGIQSAASKDDPNRSIFALDLMPMLTKLRPFVREFLK 223

Query: 390  EASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAESL 569
            EAS++FEMYIYTM +++YAIEIAKLLDP  +YF SKVI+ +DCTQRHQKGLDV+LGAES+
Sbjct: 224  EASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESV 283

Query: 570  VVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATVLK 749
             VILDDTE VWQ+HK+NLI MERYHFFASSCRQFGF  +SLSE ++DERESDGALATVL 
Sbjct: 284  AVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRSLSESLQDERESDGALATVLD 343

Query: 750  ILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIWKM 929
            +LKR H  FFD A  +D+S+RD+R ++K +R+EILQGCKIVFSRVFP+N+  Q+Q +WKM
Sbjct: 344  VLKRIHATFFDMAAETDLSSRDIRQVIKTLRKEILQGCKIVFSRVFPNNTRPQEQMVWKM 403

Query: 930  AEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEENF 1109
            AE LGA C  +VDPSVTHV++ D GT+K+ W   NKKFLV+P WIEA+NF W RQ EE+F
Sbjct: 404  AEYLGAVCVKDVDPSVTHVVTVDLGTEKSRWGLNNKKFLVHPRWIEAANFRWHRQPEEDF 463

Query: 1110 SIS 1118
             ++
Sbjct: 464  PVT 466


>dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  474 bits (1219), Expect = e-131
 Identities = 233/369 (63%), Positives = 282/369 (76%), Gaps = 4/369 (1%)
 Frame = +3

Query: 30   CPPHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXX 203
            CPPHPGFF GLCI CG  Q EED  GVAFGYIHK LRLG  EM RLR ++          
Sbjct: 104  CPPHPGFFGGLCINCGKSQDEEDVPGVAFGYIHKGLRLGTSEMDRLRESEVKNLLRERKL 163

Query: 204  XXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMK--DDPDRSLFRLDSMHMLTKLRPFVN 377
                       NSTRL D+ + E  L  Q    K  DDP+RSLF L  MHMLTKLRPFV 
Sbjct: 164  VLILDLDHTLINSTRLHDISAAEMDLGIQTAASKNADDPERSLFTLQGMHMLTKLRPFVR 223

Query: 378  NFLKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLG 557
             FL+EAS++F+MYIYTM +++YAIEIAKLLDPG VYFDSKVI+ +DCTQRHQKGLDVVLG
Sbjct: 224  KFLEEASNMFDMYIYTMGDKAYAIEIAKLLDPGNVYFDSKVISNSDCTQRHQKGLDVVLG 283

Query: 558  AESLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALA 737
             + + VI+DDTE VWQ+HK+NLI MERYH+FA+SCRQFGF+ +SLSELM+DERESDGALA
Sbjct: 284  DDKVAVIIDDTEHVWQKHKENLILMERYHYFAASCRQFGFSDQSLSELMQDERESDGALA 343

Query: 738  TVLKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQP 917
            T+L +LKR H +FFD  + + +S+RDVR ++K +R+E+LQGCK+VFSRVFPS+  +QDQ 
Sbjct: 344  TILDVLKRIHTIFFDSGVETALSSRDVRQVIKRVRQEVLQGCKLVFSRVFPSDCRSQDQI 403

Query: 918  IWKMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQK 1097
            +WKMAEQLGA CC+EVDPSVTHV++   GT+KA WA  NKKFL++P WIEA N+ W RQ 
Sbjct: 404  MWKMAEQLGAVCCSEVDPSVTHVVAVHAGTEKARWAAGNKKFLLHPRWIEACNYRWHRQP 463

Query: 1098 EENFSISNL 1124
            EE+F +  L
Sbjct: 464  EEDFPVPGL 472


>gb|EMT13574.1| RNA polymerase II C-terminal domain phosphatase-like protein 4
            [Aegilops tauschii]
          Length = 632

 Score =  462 bits (1189), Expect = e-127
 Identities = 235/401 (58%), Positives = 283/401 (70%), Gaps = 38/401 (9%)
 Frame = +3

Query: 27   ICPPHPGFFKGLCIRCG--QIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXX 200
            ICPPHPG+F GLC RCG  Q EED  GVAFGY+HK LRLG  E+ RLRG+D         
Sbjct: 159  ICPPHPGYFGGLCFRCGKRQDEEDVPGVAFGYVHKGLRLGTTEIDRLRGSDLKNLLREKK 218

Query: 201  XXXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMK------------------------- 305
                        NST+L D+ + E  L  QI   K                         
Sbjct: 219  LILILDLDHTLINSTKLHDISAAENNLGIQIAASKGCRISYSSPETSVVHGYFLPPSVSR 278

Query: 306  -----------DDPDRSLFRLDSMHMLTKLRPFVNNFLKEASSLFEMYIYTMAERSYAIE 452
                       DDP+ SLF L+ M MLTKLRPFV  FLKEAS++FEMYIYTM +++YAIE
Sbjct: 279  LLQLTTNMPYADDPNGSLFTLEGMQMLTKLRPFVRKFLKEASNMFEMYIYTMGDKAYAIE 338

Query: 453  IAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAESLVVILDDTEVVWQRHKDNLIQM 632
            IAKLLDP  VYF+SKVI+ +DCTQRHQKGLD+VLGAES+ VILDDTE VWQ+HK+NLI M
Sbjct: 339  IAKLLDPRNVYFNSKVISNSDCTQRHQKGLDMVLGAESVAVILDDTEYVWQKHKENLILM 398

Query: 633  ERYHFFASSCRQFGFNAKSLSELMKDERESDGALATVLKILKRTHQMFFDPALGSDVSTR 812
            ERYH+FASSCRQFGF+ KSLSELM+DER SDGALAT+L +LKR H +FFD A+ + +S+R
Sbjct: 399  ERYHYFASSCRQFGFSVKSLSELMQDERGSDGALATILDVLKRIHTIFFDLAVETALSSR 458

Query: 813  DVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIWKMAEQLGATCCTEVDPSVTHVIS 992
            DVR ++K +R+E+LQGCK+VFSRVFPS+S  QDQ IWKMAEQLGA C  +VD ++THV++
Sbjct: 459  DVRQVIKRVRQEVLQGCKLVFSRVFPSSSRPQDQFIWKMAEQLGAICSADVDSTITHVVA 518

Query: 993  TDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEENFSI 1115
             D GT KA WA +NKK LV+P WIEASNF W RQ+EE+F +
Sbjct: 519  VDVGTDKARWAVKNKKILVHPRWIEASNFRWHRQQEEDFPV 559


>ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318538|gb|EEF03112.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 472

 Score =  448 bits (1153), Expect = e-123
 Identities = 219/362 (60%), Positives = 272/362 (75%)
 Frame = +3

Query: 39   HPGFFKGLCIRCGQIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXXXXXXX 218
            HPG F  +CI CGQ+ + ESGV FGYIHK LRLGN E+ RLR  D               
Sbjct: 110  HPGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILD 169

Query: 219  XXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNFLKEAS 398
                  NST+L+ +  DEEYL  Q D ++D    SLF L SM M+TKLRPFV  FLKEAS
Sbjct: 170  LDHTLLNSTQLMHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEAS 229

Query: 399  SLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAESLVVI 578
             +FEMYIYTM +R+YA+E+AKLLDPG+ YF++KVI++ D TQRHQKGLDVVLG ES V+I
Sbjct: 230  QMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLI 289

Query: 579  LDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATVLKILK 758
            LDDTE  W +HKDNLI MERYHFFASSC QFGFN KSLSE   DE ES+GALA++LK+L+
Sbjct: 290  LDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLR 349

Query: 759  RTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIWKMAEQ 938
            + HQ+FF+  L  ++  RDVR +LK +R+++L+GCKIVFSRVFP+ S A +  +W+MAEQ
Sbjct: 350  KIHQIFFE-ELEENMDGRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQ 408

Query: 939  LGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEENFSIS 1118
            LGATC TE+DPSVTHV+S D+GT+K+HWA ++ KFLV P WIEA+N+ WQRQ EENFS +
Sbjct: 409  LGATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFN 468

Query: 1119 NL 1124
             +
Sbjct: 469  QI 470


>ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
            gi|241945026|gb|EES18171.1| hypothetical protein
            SORBIDRAFT_09g019310 [Sorghum bicolor]
          Length = 547

 Score =  445 bits (1145), Expect = e-122
 Identities = 221/364 (60%), Positives = 274/364 (75%), Gaps = 3/364 (0%)
 Frame = +3

Query: 36   PHPGFFKGLCIRCGQIEEDE--SGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXXXX 209
            PHPG+ +GLC  CG  +++E  SGVA  YI K LRL   E+ RLR AD            
Sbjct: 110  PHPGYIRGLCYICGNPQDEEYISGVALDYIDKGLRLRTSEIDRLRCADLKNLLRERKLVL 169

Query: 210  XXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNFLK 389
                     NST+L ++ S E+ L  Q    KDDP+RS+F L+SM +LTKLRPFV  FLK
Sbjct: 170  ILDLDHTLINSTKLQNISSAEKDLGIQTAASKDDPNRSIFALESMQLLTKLRPFVREFLK 229

Query: 390  EASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAESL 569
            EAS++FEMYIYTM +++YAIEIAKLLDP  +YF  KVI+ +DCT+RHQKGLDV+LGA S+
Sbjct: 230  EASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPLKVISNSDCTKRHQKGLDVILGAASV 289

Query: 570  VVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATVLK 749
             VILDDTE VW++HK+NLI MERYHFFASSCR+FGF  +SLSELM+DERESDGALATVL 
Sbjct: 290  AVILDDTEFVWKKHKENLILMERYHFFASSCREFGFAVRSLSELMQDERESDGALATVLD 349

Query: 750  ILKRTHQMFFDPALGS-DVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIWK 926
            +LKR H +FFD A+ + D+S+RDVR ++K +R+EILQGCKIVFSRVFP+N+  Q Q +WK
Sbjct: 350  VLKRIHAIFFDMAVETDDLSSRDVRQVIKAVRKEILQGCKIVFSRVFPNNTRPQKQMVWK 409

Query: 927  MAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEEN 1106
            MAE LGA C T+VD SVTHV++ D GT+KA W   NKKFLV+P WIEA+NF W RQ EE+
Sbjct: 410  MAEYLGAVCSTDVDSSVTHVVTVDLGTEKARWGVANKKFLVHPRWIEAANFRWHRQPEED 469

Query: 1107 FSIS 1118
            F ++
Sbjct: 470  FPVT 473


>gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma
            cacao]
          Length = 469

 Score =  444 bits (1143), Expect = e-122
 Identities = 222/367 (60%), Positives = 268/367 (73%)
 Frame = +3

Query: 18   KNVICPPHPGFFKGLCIRCGQIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXX 197
            K  IC  HPG F  +CI CGQ  +DESGV FGYIHK LRLGN E+ RLR  D        
Sbjct: 100  KKDICT-HPGSFGQMCILCGQRLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHK 158

Query: 198  XXXXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVN 377
                         NST+L+ +  DEEYL  Q D ++D    SLF LD MHM+TKLRPFV 
Sbjct: 159  KLYLVLDLDHTLLNSTQLMHLTPDEEYLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVR 218

Query: 378  NFLKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLG 557
             FLKEAS +FEMYIYTM +R YA+E+AKLLDP + YF  +VI++ D TQ+HQKGLDVVLG
Sbjct: 219  TFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLG 278

Query: 558  AESLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALA 737
             ES VVILDDTE  W +HKDNLI MERYH+FASSC QFG+  KSLS+L  DE E DGALA
Sbjct: 279  QESAVVILDDTENAWMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGALA 338

Query: 738  TVLKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQP 917
            +VLK L++ H MFFD  L  ++++RDVR +LK ++ E+L+GCKIVFS VFP+N  A+  P
Sbjct: 339  SVLKALRQIHHMFFD-ELDCNLASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHP 397

Query: 918  IWKMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQK 1097
            +WKMAEQLGATC TE D SVTHV+STD GT+K+ WA + KKFLV+P WIEA+N+LWQ+Q 
Sbjct: 398  LWKMAEQLGATCSTETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQP 457

Query: 1098 EENFSIS 1118
            EENF +S
Sbjct: 458  EENFPVS 464


>ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223534449|gb|EEF36151.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  443 bits (1139), Expect = e-121
 Identities = 214/360 (59%), Positives = 269/360 (74%)
 Frame = +3

Query: 39   HPGFFKGLCIRCGQIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXXXXXXX 218
            HPG F  +CI CG+   +E+GV FGYIHK LRL N E+ RLR  D               
Sbjct: 113  HPGSFGDMCILCGERLIEETGVTFGYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLD 172

Query: 219  XXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNFLKEAS 398
                  NST+L+ + ++EEYL  QID M+D  + SLF +D MHM+TKLRPF+  FLKEAS
Sbjct: 173  LDHTLLNSTQLMHLTAEEEYLKSQIDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEAS 232

Query: 399  SLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAESLVVI 578
             +FEMYIYTM +R+YA+E+AK LDPG+ YF+++VI++ D TQRHQKGLD+VLG ES V+I
Sbjct: 233  QMFEMYIYTMGDRAYALEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLI 292

Query: 579  LDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATVLKILK 758
            LDDTE  W +HKDNLI MERYHFFASSCRQFGF  KSLS+L  DE ESDGALA+VLK+L+
Sbjct: 293  LDDTENAWTKHKDNLILMERYHFFASSCRQFGFECKSLSQLKSDENESDGALASVLKVLR 352

Query: 759  RTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIWKMAEQ 938
            R H +FFD  L   +  RDVR +L  +R+++L+GCKIVFSRVFP+   A +  +WKMAEQ
Sbjct: 353  RIHHIFFD-ELEDAIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQ 411

Query: 939  LGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEENFSIS 1118
            LGATC  EVDPSVTHV+S + GT+K+ WA +N KFLV+P WIEA+N++WQRQ EENFS++
Sbjct: 412  LGATCSREVDPSVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVN 471


>ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Cucumis sativus]
          Length = 452

 Score =  438 bits (1127), Expect = e-120
 Identities = 222/382 (58%), Positives = 272/382 (71%)
 Frame = +3

Query: 12   LGKNVICPPHPGFFKGLCIRCGQIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXX 191
            L K  +C  HPG F  +CI CGQ  ++ESGV FGYIHK+LRL N E+ R+R  +      
Sbjct: 76   LSKQQLCS-HPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNNDEINRMRNKEMKELLQ 134

Query: 192  XXXXXXXXXXXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPF 371
                           NST L  +  +EEYL  Q D + D    SLF L+S+H +TKLRPF
Sbjct: 135  RKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLDDVTKGSLFLLNSVHTMTKLRPF 194

Query: 372  VNNFLKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVV 551
            V++FLKEAS LFEMYIYTM ER YA E+AKLLDP K YF SKVI++ D TQ+HQKGLDVV
Sbjct: 195  VHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVV 254

Query: 552  LGAESLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGA 731
            LG ES V+ILDDTE  W +HK+NLI MERYHFFASSCRQFGFN KSLSEL  DE E+DGA
Sbjct: 255  LGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDGA 314

Query: 732  LATVLKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQD 911
            L T+LK+LK+ H MFF+   G D+  RDVR +LK +R E+L+GCK+VFSRVFP+   A++
Sbjct: 315  LTTILKVLKQVHHMFFNEVSG-DLVDRDVRQVLKTVRAEVLEGCKVVFSRVFPTKFQAEN 373

Query: 912  QPIWKMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQR 1091
              +WKM EQLG TC TE+D SVTHV++TD GT+K+ WA + KKFLV+P WIEASN+ W+R
Sbjct: 374  HQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRWIEASNYFWKR 433

Query: 1092 QKEENFSISNLPSHTTVDCTAK 1157
            Q EENF++      T V+ T K
Sbjct: 434  QMEENFTV----EQTKVEQTKK 451


>ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318537|gb|EEF03111.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 468

 Score =  436 bits (1121), Expect = e-119
 Identities = 215/362 (59%), Positives = 266/362 (73%)
 Frame = +3

Query: 39   HPGFFKGLCIRCGQIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXXXXXXX 218
            HPG F  +CI CGQ+ + ESGV FGYIHK LRLGN E+ RLR  D               
Sbjct: 110  HPGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILD 169

Query: 219  XXXXXXNSTRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNFLKEAS 398
                  NST+L+ +  DEEYL  Q D ++D    SLF L SM M+TKLRPFV  FLKEAS
Sbjct: 170  LDHTLLNSTQLMHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEAS 229

Query: 399  SLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAESLVVI 578
             +FEMYIYTM +R+YA+E+AKLLDPG+ YF++KVI++ D TQRHQKGLDVVLG ES V+I
Sbjct: 230  QMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLI 289

Query: 579  LDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATVLKILK 758
            LDDTE  W +HKDNLI MERYHFFASSC QFGFN KSLSE   DE ES+GALA++LK+L+
Sbjct: 290  LDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLR 349

Query: 759  RTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIWKMAEQ 938
            + HQ+FF+     D        +LK +R+++L+GCKIVFSRVFP+ S A +  +W+MAEQ
Sbjct: 350  KIHQIFFE-----DHILSLALQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQ 404

Query: 939  LGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEENFSIS 1118
            LGATC TE+DPSVTHV+S D+GT+K+HWA ++ KFLV P WIEA+N+ WQRQ EENFS +
Sbjct: 405  LGATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFN 464

Query: 1119 NL 1124
             +
Sbjct: 465  QI 466


>gb|EOY32065.1| RNA polymerase II ctd phosphatase, putative isoform 2 [Theobroma
            cacao]
          Length = 357

 Score =  436 bits (1120), Expect = e-119
 Identities = 215/353 (60%), Positives = 261/353 (73%)
 Frame = +3

Query: 60   LCIRCGQIEEDESGVAFGYIHKDLRLGNVEMARLRGADXXXXXXXXXXXXXXXXXXXXXN 239
            +CI CGQ  +DESGV FGYIHK LRLGN E+ RLR  D                     N
Sbjct: 1    MCILCGQRLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLN 60

Query: 240  STRLVDVISDEEYLLRQIDGMKDDPDRSLFRLDSMHMLTKLRPFVNNFLKEASSLFEMYI 419
            ST+L+ +  DEEYL  Q D ++D    SLF LD MHM+TKLRPFV  FLKEAS +FEMYI
Sbjct: 61   STQLMHLTPDEEYLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYI 120

Query: 420  YTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRHQKGLDVVLGAESLVVILDDTEVV 599
            YTM +R YA+E+AKLLDP + YF  +VI++ D TQ+HQKGLDVVLG ES VVILDDTE  
Sbjct: 121  YTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENA 180

Query: 600  WQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKDERESDGALATVLKILKRTHQMFF 779
            W +HKDNLI MERYH+FASSC QFG+  KSLS+L  DE E DGALA+VLK L++ H MFF
Sbjct: 181  WMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFF 240

Query: 780  DPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFPSNSLAQDQPIWKMAEQLGATCCT 959
            D  L  ++++RDVR +LK ++ E+L+GCKIVFS VFP+N  A+  P+WKMAEQLGATC T
Sbjct: 241  D-ELDCNLASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCST 299

Query: 960  EVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEASNFLWQRQKEENFSIS 1118
            E D SVTHV+STD GT+K+ WA + KKFLV+P WIEA+N+LWQ+Q EENF +S
Sbjct: 300  ETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVS 352


>ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda]
            gi|548840545|gb|ERN00656.1| hypothetical protein
            AMTR_s00106p00017820 [Amborella trichopoda]
          Length = 486

 Score =  429 bits (1103), Expect = e-117
 Identities = 208/375 (55%), Positives = 270/375 (72%), Gaps = 12/375 (3%)
 Frame = +3

Query: 27   ICPPHPGFFKGLCIRCGQIEEDES------GVAFGYIHKDLRLGNVEMARLRGADXXXXX 188
            +CPPHPGF+K +CIRCG+ ++DE+       VAF YIHKDL+LG  E+ARLR  D     
Sbjct: 104  VCPPHPGFYKDMCIRCGEQKDDETVARKETAVAFNYIHKDLKLGAEEVARLRATDLKNLY 163

Query: 189  XXXXXXXXXXXXXXXXNSTRLVDVISDEE------YLLRQIDGMKDDPDRSLFRLDSMHM 350
                            NSTRLVDV  +EE      YL ++      D   +LF+L+ +HM
Sbjct: 164  RRRKLYLVLDLDHTLLNSTRLVDVSPEEEAYLNATYLNKETSSSNGDTSGTLFKLEPLHM 223

Query: 351  LTKLRPFVNNFLKEASSLFEMYIYTMAERSYAIEIAKLLDPGKVYFDSKVITQADCTQRH 530
            LTKLRPFV  FLKEA+++FEMY+YTM ER+YA+E+AKLLDP  VYF S+VI+Q D T RH
Sbjct: 224  LTKLRPFVRTFLKEANTMFEMYVYTMGERAYALEMAKLLDPSGVYFGSRVISQGDSTVRH 283

Query: 531  QKGLDVVLGAESLVVILDDTEVVWQRHKDNLIQMERYHFFASSCRQFGFNAKSLSELMKD 710
            QKGLDVVLG+E  VVILDDTE VW +HK+NL+ MERYHFF+SSCRQF  + KSLSEL +D
Sbjct: 284  QKGLDVVLGSECAVVILDDTEHVWHKHKENLVLMERYHFFSSSCRQFNVHYKSLSELKRD 343

Query: 711  ERESDGALATVLKILKRTHQMFFDPALGSDVSTRDVRPLLKGIRREILQGCKIVFSRVFP 890
            E ESDG LA++L +LK  HQMF+   + +D +  DVR +LK I+ E+L+GC++VFSR+FP
Sbjct: 344  ESESDGMLASILNVLKHIHQMFYYQEVETDFNGSDVRKVLKTIQSEVLKGCRLVFSRIFP 403

Query: 891  SNSLAQDQPIWKMAEQLGATCCTEVDPSVTHVISTDTGTQKAHWATQNKKFLVNPHWIEA 1070
            +N   ++Q +W++AEQLGA+C  E+D +VTHV+S D GT+KA WA Q KK LVNP W+EA
Sbjct: 404  TNYPVENQTLWRIAEQLGASCSKELDEAVTHVVSLDLGTEKARWAIQRKKHLVNPGWLEA 463

Query: 1071 SNFLWQRQKEENFSI 1115
            +N+ W+RQ E+ F I
Sbjct: 464  TNYFWKRQPEDQFPI 478


Top