BLASTX nr result

ID: Ziziphus21_contig00021493 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ziziphus21_contig00021493
         (1819 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ...   570   e-159
ref|XP_012481530.1| PREDICTED: RNA polymerase II C-terminal doma...   566   e-158
gb|KJB27893.1| hypothetical protein B456_005G016300 [Gossypium r...   566   e-158
ref|XP_012481529.1| PREDICTED: RNA polymerase II C-terminal doma...   566   e-158
ref|XP_012078975.1| PREDICTED: RNA polymerase II C-terminal doma...   566   e-158
ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative ...   565   e-158
ref|XP_010110375.1| RNA polymerase II C-terminal domain phosphat...   562   e-157
ref|XP_008242970.1| PREDICTED: RNA polymerase II C-terminal doma...   562   e-157
gb|KHG05109.1| RNA polymerase II C-terminal domain phosphatase-l...   560   e-156
ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prun...   555   e-155
ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu...   552   e-154
ref|XP_011656094.1| PREDICTED: RNA polymerase II C-terminal doma...   548   e-153
gb|KGN52677.1| hypothetical protein Csa_5G650420 [Cucumis sativus]    548   e-153
ref|XP_011656095.1| PREDICTED: RNA polymerase II C-terminal doma...   548   e-153
ref|XP_011018018.1| PREDICTED: RNA polymerase II C-terminal doma...   546   e-152
ref|XP_010048820.1| PREDICTED: RNA polymerase II C-terminal doma...   545   e-152
ref|XP_010048821.1| PREDICTED: RNA polymerase II C-terminal doma...   545   e-152
ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal doma...   544   e-151
ref|XP_007014446.1| RNA polymerase II ctd phosphatase, putative ...   542   e-151
gb|KDO68848.1| hypothetical protein CISIN_1g041302mg [Citrus sin...   541   e-151

>ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223534449|gb|EEF36151.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  570 bits (1468), Expect = e-159
 Identities = 278/398 (69%), Positives = 324/398 (81%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESG 1370
            IKR +VE +E+ E P  S              +S+  CTHPGSFGDMCI CG+RL +E+G
Sbjct: 76   IKRSRVETLENGENPKESTRVSLDQTLVAS--SSKVACTHPGSFGDMCILCGERLIEETG 133

Query: 1369 VTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYL 1190
            VTFGYIHKGLRL NDEIVRLR  DMKN                 LNST L+HLT++EEYL
Sbjct: 134  VTFGYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTLLNSTQLMHLTAEEEYL 193

Query: 1189 KSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMAN 1010
            KSQ DS+QD+SNGSLFM++ MHMMTKLRPF+RTFLKEAS+M+E++IYTMGDRAYAL MA 
Sbjct: 194  KSQIDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEMYIYTMGDRAYALEMAK 253

Query: 1009 LLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERY 830
             LDP REYF  RVISRDDGTQ+HQKGLD+VLGQESAVLILDDTENAWT+HK NLILMERY
Sbjct: 254  FLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWTKHKDNLILMERY 313

Query: 829  HFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQ 650
            HFF SSC+QFGF CKSLS++KSDE+E DGALA+VLKVL++ H++FFDE  D   G+DVRQ
Sbjct: 314  HFFASSCRQFGFECKSLSQLKSDENESDGALASVLKVLRRIHHIFFDELEDAIDGRDVRQ 373

Query: 649  VLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAG 470
            VL ++RK+VLK CKIVFSR+FPT+FQA+NH LWKMAEQLGATC  E+DPSVTHVV+ +AG
Sbjct: 374  VLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCSREVDPSVTHVVSAEAG 433

Query: 469  TDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVN 356
            T+KSRWA+K  KFLVHPRWIEA  Y+WQ+QPEENF VN
Sbjct: 434  TEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVN 471


>ref|XP_012481530.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Gossypium raimondii]
          Length = 404

 Score =  567 bits (1460), Expect = e-158
 Identities = 280/402 (69%), Positives = 320/402 (79%)
 Frame = -1

Query: 1546 KRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESGV 1367
            KR K E ++ +E P  S               +++ CTHPGSFG MCI CGQR++ ESGV
Sbjct: 4    KRCKTEKLDDLEGPQGSTSQGLIEEKLEVS-LNKDTCTHPGSFGQMCILCGQRVDDESGV 62

Query: 1366 TFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYLK 1187
            TFGYIHKGLRL NDEIVRLR  DMKN                 LNST L HLT++EEYLK
Sbjct: 63   TFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLK 122

Query: 1186 SQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMANL 1007
             Q+DS+QD+S GSLFMLE MHMMTKLRPFVRTFLKEAS+M+E++IYTMGDR YAL MA L
Sbjct: 123  GQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKL 182

Query: 1006 LDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERYH 827
            LDP++EYF  RVISRDDGTQKHQKGLDVVLGQ+SAV+ILDDTENAWT+HK NLILMERYH
Sbjct: 183  LDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYH 242

Query: 826  FFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQV 647
            FF SSC+QFGF C+SLS++KSDESE DGALA++LK+L+Q H++FFDE       +DVRQV
Sbjct: 243  FFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQIHHIFFDELDSDLASRDVRQV 302

Query: 646  LKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAGT 467
            LK++RKEVLKDCKIVFSR+FPTKFQ ENH LWKMAEQLGATC TE D SVTHVV+ DAGT
Sbjct: 303  LKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHVVSMDAGT 362

Query: 466  DKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKNQ 341
            +KSRWAVKE KFLVHPRWIEAA + W KQPEE F V+  KNQ
Sbjct: 363  EKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQTKNQ 404


>gb|KJB27893.1| hypothetical protein B456_005G016300 [Gossypium raimondii]
          Length = 469

 Score =  567 bits (1460), Expect = e-158
 Identities = 280/402 (69%), Positives = 320/402 (79%)
 Frame = -1

Query: 1546 KRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESGV 1367
            KR K E ++ +E P  S               +++ CTHPGSFG MCI CGQR++ ESGV
Sbjct: 70   KRCKTEKLDDLEGPQGSTSQGLIEEKLVSL--NKDTCTHPGSFGQMCILCGQRVDDESGV 127

Query: 1366 TFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYLK 1187
            TFGYIHKGLRL NDEIVRLR  DMKN                 LNST L HLT++EEYLK
Sbjct: 128  TFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLK 187

Query: 1186 SQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMANL 1007
             Q+DS+QD+S GSLFMLE MHMMTKLRPFVRTFLKEAS+M+E++IYTMGDR YAL MA L
Sbjct: 188  GQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKL 247

Query: 1006 LDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERYH 827
            LDP++EYF  RVISRDDGTQKHQKGLDVVLGQ+SAV+ILDDTENAWT+HK NLILMERYH
Sbjct: 248  LDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYH 307

Query: 826  FFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQV 647
            FF SSC+QFGF C+SLS++KSDESE DGALA++LK+L+Q H++FFDE       +DVRQV
Sbjct: 308  FFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQIHHIFFDELDSDLASRDVRQV 367

Query: 646  LKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAGT 467
            LK++RKEVLKDCKIVFSR+FPTKFQ ENH LWKMAEQLGATC TE D SVTHVV+ DAGT
Sbjct: 368  LKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHVVSMDAGT 427

Query: 466  DKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKNQ 341
            +KSRWAVKE KFLVHPRWIEAA + W KQPEE F V+  KNQ
Sbjct: 428  EKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQTKNQ 469


>ref|XP_012481529.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Gossypium raimondii]
            gi|763760638|gb|KJB27892.1| hypothetical protein
            B456_005G016300 [Gossypium raimondii]
          Length = 470

 Score =  567 bits (1460), Expect = e-158
 Identities = 280/402 (69%), Positives = 320/402 (79%)
 Frame = -1

Query: 1546 KRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESGV 1367
            KR K E ++ +E P  S               +++ CTHPGSFG MCI CGQR++ ESGV
Sbjct: 70   KRCKTEKLDDLEGPQGSTSQGLIEEKLEVS-LNKDTCTHPGSFGQMCILCGQRVDDESGV 128

Query: 1366 TFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYLK 1187
            TFGYIHKGLRL NDEIVRLR  DMKN                 LNST L HLT++EEYLK
Sbjct: 129  TFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLK 188

Query: 1186 SQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMANL 1007
             Q+DS+QD+S GSLFMLE MHMMTKLRPFVRTFLKEAS+M+E++IYTMGDR YAL MA L
Sbjct: 189  GQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKL 248

Query: 1006 LDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERYH 827
            LDP++EYF  RVISRDDGTQKHQKGLDVVLGQ+SAV+ILDDTENAWT+HK NLILMERYH
Sbjct: 249  LDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYH 308

Query: 826  FFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQV 647
            FF SSC+QFGF C+SLS++KSDESE DGALA++LK+L+Q H++FFDE       +DVRQV
Sbjct: 309  FFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQIHHIFFDELDSDLASRDVRQV 368

Query: 646  LKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAGT 467
            LK++RKEVLKDCKIVFSR+FPTKFQ ENH LWKMAEQLGATC TE D SVTHVV+ DAGT
Sbjct: 369  LKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHVVSMDAGT 428

Query: 466  DKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKNQ 341
            +KSRWAVKE KFLVHPRWIEAA + W KQPEE F V+  KNQ
Sbjct: 429  EKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQTKNQ 470


>ref|XP_012078975.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Jatropha curcas] gi|802640739|ref|XP_012078976.1|
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 4 [Jatropha curcas]
            gi|643722394|gb|KDP32215.1| hypothetical protein
            JCGZ_13822 [Jatropha curcas]
          Length = 470

 Score =  566 bits (1459), Expect = e-158
 Identities = 281/404 (69%), Positives = 325/404 (80%), Gaps = 1/404 (0%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESG 1370
            IKR +VE +E++E+P  S              +S+  CTHPGSFGDMCI CGQRL +E+G
Sbjct: 68   IKRSRVETLENVEDPKGSTFHGSLDLNLGAS-SSKVACTHPGSFGDMCIICGQRLNEETG 126

Query: 1369 VTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYL 1190
            VT  YIHKGLRL NDEIVRLR  D KN                 LNST L+H+T++EEYL
Sbjct: 127  VTLAYIHKGLRLGNDEIVRLRNSDTKNLLRHKKLYLVLDLDHTLLNSTQLMHMTAEEEYL 186

Query: 1189 KSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMAN 1010
            KSQ DSLQD+SNGSLF L+ MHMMTKLRP+V TFLKEAS+M+E++IYTMGDRAYAL MA 
Sbjct: 187  KSQLDSLQDVSNGSLFKLDFMHMMTKLRPYVHTFLKEASQMFEMYIYTMGDRAYALEMAK 246

Query: 1009 LLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERY 830
            LLDPRREYF  RVISRDDGTQ+HQKGLD+VLGQESAVLILDDTE AWT+HK NLILMERY
Sbjct: 247  LLDPRREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTETAWTKHKDNLILMERY 306

Query: 829  HFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSD-GCTGKDVR 653
            HFF SSC QFGF CKSLSE+KSDES+ DGALA+VLKVL++ H++FFDE  D     +DVR
Sbjct: 307  HFFASSCHQFGFSCKSLSELKSDESDSDGALASVLKVLRRIHHIFFDELMDVNLDSRDVR 366

Query: 652  QVLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDA 473
            QVLK++RK+VL+ CKIVFSR+FPT+FQA NHQLWKMAEQLGA C TELD S+THVV+T+A
Sbjct: 367  QVLKTVRKDVLEGCKIVFSRVFPTQFQANNHQLWKMAEQLGAICSTELDSSITHVVSTEA 426

Query: 472  GTDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKNQ 341
            GT+KSRWA+K  KFLVHPRWIEAA YLWQ+QPEENF VN  K+Q
Sbjct: 427  GTEKSRWAMKNKKFLVHPRWIEAANYLWQRQPEENFSVNQPKHQ 470


>ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma
            cacao] gi|508784808|gb|EOY32064.1| RNA polymerase II ctd
            phosphatase, putative isoform 1 [Theobroma cacao]
          Length = 469

 Score =  565 bits (1456), Expect = e-158
 Identities = 284/405 (70%), Positives = 317/405 (78%), Gaps = 3/405 (0%)
 Frame = -1

Query: 1546 KRRKVENVESMEEPHASIXXXXXXXXXXXEPT---SQNPCTHPGSFGDMCIRCGQRLEQE 1376
            KR K E +E +EE   S                   ++ CTHPGSFG MCI CGQRL+ E
Sbjct: 65   KRCKTEKLEDLEESRGSTSQGLIEDKIVIHAELSLKKDICTHPGSFGQMCILCGQRLDDE 124

Query: 1375 SGVTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEE 1196
            SGVTFGYIHKGLRL NDEIVRLR  DMKN                 LNST L+HLT DEE
Sbjct: 125  SGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLMHLTPDEE 184

Query: 1195 YLKSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAM 1016
            YLK Q+DSLQD+S GSLFML+ MHMMTKLRPFVRTFLKEAS+M+E++IYTMGDR YAL M
Sbjct: 185  YLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEM 244

Query: 1015 ANLLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILME 836
            A LLDPRREYF +RVISRDDGTQKHQKGLDVVLGQESAV+ILDDTENAW +HK NLILME
Sbjct: 245  AKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENAWMKHKDNLILME 304

Query: 835  RYHFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDV 656
            RYH+F SSC QFG+ CKSLS++KSDESE DGALA+VLK L+Q H+MFFDE       +DV
Sbjct: 305  RYHYFASSCHQFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFFDELDCNLASRDV 364

Query: 655  RQVLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATD 476
            RQVLK++++EVLK CKIVFS +FPT F AE+H LWKMAEQLGATC TE D SVTHVV+TD
Sbjct: 365  RQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTETDLSVTHVVSTD 424

Query: 475  AGTDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKNQ 341
            AGT+KSRWAVKE KFLVHPRWIEA  YLWQKQPEENF V+  KNQ
Sbjct: 425  AGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 469


>ref|XP_010110375.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Morus
            notabilis] gi|587939514|gb|EXC26161.1| RNA polymerase II
            C-terminal domain phosphatase-like 4 [Morus notabilis]
          Length = 512

 Score =  562 bits (1448), Expect = e-157
 Identities = 280/372 (75%), Positives = 311/372 (83%), Gaps = 1/372 (0%)
 Frame = -1

Query: 1453 TSQNPCTHPGSFGDMCIRCGQRLEQESGVTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXX 1274
            T ++ CTHPGSFGDMCI CGQRLE+E+GVTFGYIHKGLRLNNDEIVRLR  DMKN     
Sbjct: 141  TKKDACTHPGSFGDMCILCGQRLEEETGVTFGYIHKGLRLNNDEIVRLRSTDMKNLIRHK 200

Query: 1273 XXXXXXXXXXXXLNSTLLVHLTSDEEYLKSQTDSLQDISNGSLFMLEAMHMMTKLRPFVR 1094
                        LNST LV L+S+E+YLKSQ  S QD S GSLF+LEAMHMMTKLRPFVR
Sbjct: 201  KLCLVLDLDHTLLNSTRLVDLSSEEQYLKSQAFSPQDASEGSLFVLEAMHMMTKLRPFVR 260

Query: 1093 TFLKEASKMYELHIYTMGDRAYALAMANLLDPRREYFGERVISRDDGTQKHQKGLDVVLG 914
             FLKE   ++EL++YTMGDR YALAMA LLDPRREYFG+R+ISRDDGT KHQKGLDVVLG
Sbjct: 261  NFLKEVYNLFELYVYTMGDRPYALAMAKLLDPRREYFGDRIISRDDGTLKHQKGLDVVLG 320

Query: 913  QESAVLILDDTENAWTQH-KGNLILMERYHFFRSSCQQFGFHCKSLSEMKSDESELDGAL 737
            QESAVLILDDTENAW +H K NLILMERYHFFRSS  QFG++CKSLSE+KSDESE +GAL
Sbjct: 321  QESAVLILDDTENAWIKHHKENLILMERYHFFRSSTHQFGYNCKSLSELKSDESETEGAL 380

Query: 736  ATVLKVLKQTHNMFFDETSDGCTGKDVRQVLKSIRKEVLKDCKIVFSRIFPTKFQAENHQ 557
             TVL VLKQ H+MFFDE       +DVRQVLK++RKEVLK CKIVFSR+FPT+FQAENHQ
Sbjct: 381  VTVLNVLKQVHSMFFDERGIDHIIRDVRQVLKTLRKEVLKGCKIVFSRVFPTEFQAENHQ 440

Query: 556  LWKMAEQLGATCLTELDPSVTHVVATDAGTDKSRWAVKEMKFLVHPRWIEAAYYLWQKQP 377
            LWKMAEQLGATC  ELDPSVTHVV+ D GT+KSRWAVKE KFLVHPRWIEAA Y+W++QP
Sbjct: 441  LWKMAEQLGATCGIELDPSVTHVVSLDVGTEKSRWAVKENKFLVHPRWIEAANYMWKRQP 500

Query: 376  EENFCVNIVKNQ 341
            E+NF VN VKNQ
Sbjct: 501  EDNFSVNQVKNQ 512


>ref|XP_008242970.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Prunus mume]
          Length = 449

 Score =  562 bits (1448), Expect = e-157
 Identities = 278/400 (69%), Positives = 322/400 (80%)
 Frame = -1

Query: 1546 KRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESGV 1367
            KRRKVEN+ S+++ H S             P + + CTHPGS  D+CI CGQR++++SGV
Sbjct: 50   KRRKVENLGSIDKTHGSTSQVFVEENSEASPKT-DICTHPGSVKDLCIVCGQRVDEKSGV 108

Query: 1366 TFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYLK 1187
              GYIHK   LNNDEI R+R  D+K                  LNST L H+T++EEYL 
Sbjct: 109  PLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLVLDLDHTLLNSTHLNHMTAEEEYLH 168

Query: 1186 SQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMANL 1007
            SQTDSLQD+SNGSLF ++ MHMMTKLRPFVR FLKEAS+M+E++IYTMG+RAYAL MA L
Sbjct: 169  SQTDSLQDVSNGSLFRVDVMHMMTKLRPFVRKFLKEASEMFEMYIYTMGERAYALEMAKL 228

Query: 1006 LDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERYH 827
            LDPR+EYFG+RVISRDDGTQKHQKGLDVVLGQESA LILDDTENAWT+HK NLILMERYH
Sbjct: 229  LDPRKEYFGDRVISRDDGTQKHQKGLDVVLGQESAALILDDTENAWTKHKDNLILMERYH 288

Query: 826  FFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQV 647
            FFRSSC QFGFHCKSLSE+KSDESE +GALATVL+VLK+THNMFF E+ D    +DVRQV
Sbjct: 289  FFRSSCHQFGFHCKSLSELKSDESEPEGALATVLEVLKRTHNMFFYESKDNLIDRDVRQV 348

Query: 646  LKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAGT 467
            LK++RKE+LK CKIVFSR+FP+KFQAENHQLWKMAEQLGA C TELDPSVTHVV+TDAGT
Sbjct: 349  LKTLRKEILKGCKIVFSRVFPSKFQAENHQLWKMAEQLGAACSTELDPSVTHVVSTDAGT 408

Query: 466  DKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVK 347
            +KSRWAVKE KFLVHP+WIEA+ Y+W KQ E+ F V   K
Sbjct: 409  EKSRWAVKEKKFLVHPQWIEASNYMWLKQAEDKFPVKQTK 448


>gb|KHG05109.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Gossypium
            arboreum]
          Length = 404

 Score =  560 bits (1444), Expect = e-156
 Identities = 272/370 (73%), Positives = 310/370 (83%)
 Frame = -1

Query: 1450 SQNPCTHPGSFGDMCIRCGQRLEQESGVTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXX 1271
            +++ C+HPGSFG MCI CGQR++ ES VTFGYIHKGLRL NDEIVRLR  DMKN      
Sbjct: 35   NKDTCSHPGSFGQMCILCGQRVDDESSVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKK 94

Query: 1270 XXXXXXXXXXXLNSTLLVHLTSDEEYLKSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRT 1091
                       LNST L HLT++EEYLK Q+DSLQD+S GSLFMLE M MMTKLRPFVRT
Sbjct: 95   LYLVLDLDHTLLNSTQLNHLTAEEEYLKGQSDSLQDVSKGSLFMLEFMQMMTKLRPFVRT 154

Query: 1090 FLKEASKMYELHIYTMGDRAYALAMANLLDPRREYFGERVISRDDGTQKHQKGLDVVLGQ 911
            FLKEAS+M+E++IYTMGDR YAL MA LLDP++EYF  RVISRDDGTQKHQKGLDVVLGQ
Sbjct: 155  FLKEASEMFEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQ 214

Query: 910  ESAVLILDDTENAWTQHKGNLILMERYHFFRSSCQQFGFHCKSLSEMKSDESELDGALAT 731
            +SAV+ILDDTENAWT+HK NLILMERYHFF SSC+QFGF CKSLS++KSDESE DGALA+
Sbjct: 215  DSAVVILDDTENAWTKHKDNLILMERYHFFASSCRQFGFDCKSLSQLKSDESEPDGALAS 274

Query: 730  VLKVLKQTHNMFFDETSDGCTGKDVRQVLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLW 551
            +LK+L+Q H++FFDE       +DVRQVLK++RKEVLK+CKIVFSR+FPTKFQ ENH LW
Sbjct: 275  ILKILRQIHHIFFDELDSDLASRDVRQVLKTVRKEVLKNCKIVFSRVFPTKFQPENHLLW 334

Query: 550  KMAEQLGATCLTELDPSVTHVVATDAGTDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEE 371
            KMAEQLGATC TE D SVTH+V+ DAGT+KSRWAVKE KFLVHPRWIEAA + WQKQPEE
Sbjct: 335  KMAEQLGATCSTETDSSVTHIVSMDAGTEKSRWAVKENKFLVHPRWIEAANFFWQKQPEE 394

Query: 370  NFCVNIVKNQ 341
            NF V+  KNQ
Sbjct: 395  NFPVSQTKNQ 404


>ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica]
            gi|462399876|gb|EMJ05544.1| hypothetical protein
            PRUPE_ppa005647mg [Prunus persica]
          Length = 449

 Score =  555 bits (1430), Expect = e-155
 Identities = 276/400 (69%), Positives = 320/400 (80%)
 Frame = -1

Query: 1546 KRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESGV 1367
            KRRKVEN+ S++E   S             P  ++ CTHPGS  D+CI CGQR++++SGV
Sbjct: 50   KRRKVENLGSIDETQGSTSQIFVEENSEASP-KKDICTHPGSVKDLCIVCGQRVDEKSGV 108

Query: 1366 TFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYLK 1187
              GYIHK   LNNDEI R+R  D+K                  LNST L H+T++EEYL 
Sbjct: 109  PLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLVLDLDHTLLNSTHLNHMTAEEEYLH 168

Query: 1186 SQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMANL 1007
            SQTDSLQD+S+GSLF ++ MHMMTKLRPFVR FLKEAS+M+E++IYTMG+RAYAL MA L
Sbjct: 169  SQTDSLQDVSDGSLFRVDVMHMMTKLRPFVRKFLKEASEMFEMYIYTMGERAYALEMAKL 228

Query: 1006 LDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERYH 827
            LDPR+EYFG+RVISRDDGTQKHQKGLDVVLG ESA LILDDTENAWT+HK NLILMERYH
Sbjct: 229  LDPRKEYFGDRVISRDDGTQKHQKGLDVVLGHESAALILDDTENAWTKHKDNLILMERYH 288

Query: 826  FFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQV 647
            FFRSSC QFGFHCKSLSE+KSDESE +GALATVL+VLK+ HNMFF E+ D    +DVRQV
Sbjct: 289  FFRSSCHQFGFHCKSLSELKSDESEPEGALATVLEVLKRIHNMFFYESKDNLIDRDVRQV 348

Query: 646  LKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAGT 467
            LK++RKE+LK CKIVFSR+FP+KFQAENHQLWKMAEQLGATC TELD SVTHVV+TDAGT
Sbjct: 349  LKTLRKEILKGCKIVFSRVFPSKFQAENHQLWKMAEQLGATCSTELDLSVTHVVSTDAGT 408

Query: 466  DKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVK 347
            +KSRWAVKE KFLVHP+WIEA+ Y+W KQ E+ F VN  K
Sbjct: 409  EKSRWAVKEKKFLVHPQWIEASNYMWLKQAEDKFPVNQTK 448


>ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318538|gb|EEF03112.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 472

 Score =  552 bits (1422), Expect = e-154
 Identities = 274/402 (68%), Positives = 315/402 (78%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESG 1370
            +KR KVE VE +E+   +               S+  CTHPGSFG MCI CGQ L+ ESG
Sbjct: 72   VKRSKVETVEIVEDDGGTTSFASLKHNSEAS-ISKEICTHPGSFGTMCIVCGQLLDGESG 130

Query: 1369 VTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYL 1190
            VTFGYIHKGLRL NDEIVRLR  DMKN                 LNST L+H+T DEEYL
Sbjct: 131  VTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEEYL 190

Query: 1189 KSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMAN 1010
              QTDSLQD+S GSLFML +M MMTKLRPFVRTFLKEAS+M+E++IYTMGDRAYAL MA 
Sbjct: 191  NGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAK 250

Query: 1009 LLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERY 830
            LLDP REYF  +VISRDDGTQ+HQKGLDVVLGQESAVLILDDTENAW +HK NLILMERY
Sbjct: 251  LLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLILMERY 310

Query: 829  HFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQ 650
            HFF SSC QFGF+CKSLSE K+DESE +GALA++LKVL++ H +FF+E  +   G+DVRQ
Sbjct: 311  HFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFEELEENMDGRDVRQ 370

Query: 649  VLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAG 470
            VLK++RK+VLK CKIVFSR+FPT+ QA+NH LW+MAEQLGATC TELDPSVTHVV+ D+G
Sbjct: 371  VLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSG 430

Query: 469  TDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKN 344
            T+KS WA+K  KFLV P WIEAA Y WQ+QPEENF  N +KN
Sbjct: 431  TEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFNQIKN 472


>ref|XP_011656094.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Cucumis sativus]
          Length = 486

 Score =  548 bits (1413), Expect = e-153
 Identities = 271/401 (67%), Positives = 317/401 (79%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESG 1370
            IKRRKVE +E+ EE                  + Q  C+HPGSFG+MCI CGQRL++ESG
Sbjct: 83   IKRRKVEKLENSEED----IMHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESG 138

Query: 1369 VTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYL 1190
            VTFGYIHK LRLNNDEI R+R K+MK                  LNST L +LT +EEYL
Sbjct: 139  VTFGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYL 198

Query: 1189 KSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMAN 1010
            +SQTDSL D++ GSLF+L ++H MTKLRPFV +FLKEASK++E++IYTMG+R YA  MA 
Sbjct: 199  RSQTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAK 258

Query: 1009 LLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERY 830
            LLDP++EYF  +VISRDDGTQKHQKGLDVVLG+ESAVLILDDTENAWT+HK NLILMERY
Sbjct: 259  LLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERY 318

Query: 829  HFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQ 650
            HFF SSC+QFGF+CKSLSE+K+DESE DGAL T+LKVLKQ H+MFF+E S     +DVRQ
Sbjct: 319  HFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQVHHMFFNEVSGDLVDRDVRQ 378

Query: 649  VLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAG 470
            VLK++R EVL+ CK+VFSR+FPTKFQAENHQLWKM EQLG TC TELD SVTHVVATDAG
Sbjct: 379  VLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAG 438

Query: 469  TDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVK 347
            T+KSRWA+KE KFLVHPRWIEA+ Y W++Q EENF V   K
Sbjct: 439  TEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENFTVEQTK 479


>gb|KGN52677.1| hypothetical protein Csa_5G650420 [Cucumis sativus]
          Length = 543

 Score =  548 bits (1413), Expect = e-153
 Identities = 271/401 (67%), Positives = 317/401 (79%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESG 1370
            IKRRKVE +E+ EE                  + Q  C+HPGSFG+MCI CGQRL++ESG
Sbjct: 140  IKRRKVEKLENSEED----IMHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESG 195

Query: 1369 VTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYL 1190
            VTFGYIHK LRLNNDEI R+R K+MK                  LNST L +LT +EEYL
Sbjct: 196  VTFGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYL 255

Query: 1189 KSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMAN 1010
            +SQTDSL D++ GSLF+L ++H MTKLRPFV +FLKEASK++E++IYTMG+R YA  MA 
Sbjct: 256  RSQTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAK 315

Query: 1009 LLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERY 830
            LLDP++EYF  +VISRDDGTQKHQKGLDVVLG+ESAVLILDDTENAWT+HK NLILMERY
Sbjct: 316  LLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERY 375

Query: 829  HFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQ 650
            HFF SSC+QFGF+CKSLSE+K+DESE DGAL T+LKVLKQ H+MFF+E S     +DVRQ
Sbjct: 376  HFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQVHHMFFNEVSGDLVDRDVRQ 435

Query: 649  VLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAG 470
            VLK++R EVL+ CK+VFSR+FPTKFQAENHQLWKM EQLG TC TELD SVTHVVATDAG
Sbjct: 436  VLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAG 495

Query: 469  TDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVK 347
            T+KSRWA+KE KFLVHPRWIEA+ Y W++Q EENF V   K
Sbjct: 496  TEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENFTVEQTK 536


>ref|XP_011656095.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Cucumis sativus]
            gi|778707965|ref|XP_011656096.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Cucumis sativus]
          Length = 452

 Score =  548 bits (1413), Expect = e-153
 Identities = 271/401 (67%), Positives = 317/401 (79%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESG 1370
            IKRRKVE +E+ EE                  + Q  C+HPGSFG+MCI CGQRL++ESG
Sbjct: 49   IKRRKVEKLENSEED----IMHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESG 104

Query: 1369 VTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYL 1190
            VTFGYIHK LRLNNDEI R+R K+MK                  LNST L +LT +EEYL
Sbjct: 105  VTFGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYL 164

Query: 1189 KSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMAN 1010
            +SQTDSL D++ GSLF+L ++H MTKLRPFV +FLKEASK++E++IYTMG+R YA  MA 
Sbjct: 165  RSQTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAK 224

Query: 1009 LLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERY 830
            LLDP++EYF  +VISRDDGTQKHQKGLDVVLG+ESAVLILDDTENAWT+HK NLILMERY
Sbjct: 225  LLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERY 284

Query: 829  HFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQ 650
            HFF SSC+QFGF+CKSLSE+K+DESE DGAL T+LKVLKQ H+MFF+E S     +DVRQ
Sbjct: 285  HFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQVHHMFFNEVSGDLVDRDVRQ 344

Query: 649  VLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAG 470
            VLK++R EVL+ CK+VFSR+FPTKFQAENHQLWKM EQLG TC TELD SVTHVVATDAG
Sbjct: 345  VLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAG 404

Query: 469  TDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVK 347
            T+KSRWA+KE KFLVHPRWIEA+ Y W++Q EENF V   K
Sbjct: 405  TEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENFTVEQTK 445


>ref|XP_011018018.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Populus euphratica]
          Length = 472

 Score =  546 bits (1406), Expect = e-152
 Identities = 272/402 (67%), Positives = 312/402 (77%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESG 1370
            +KR KVE +E +E+   +               S+  CTHPGSFG MCI CGQ L+ ESG
Sbjct: 72   VKRSKVETLEIVEDDGGAASLASLKHNSEVS-ISKEICTHPGSFGTMCIVCGQLLDGESG 130

Query: 1369 VTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYL 1190
            VTFGYIHKGLRL NDEIVRLR  DMKN                 LNST L+H+T DEEYL
Sbjct: 131  VTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEEYL 190

Query: 1189 KSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMAN 1010
              QT SLQD+S GSLFML +M MMTKLRPFVRTFLKEAS+M+E++IYTMGDRAYAL MA 
Sbjct: 191  NGQTASLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAK 250

Query: 1009 LLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERY 830
            LLDP REYF  +VISRDDGTQ+HQKGLDVVLGQESAVLILDDTENAW +HK NLILMERY
Sbjct: 251  LLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLILMERY 310

Query: 829  HFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQ 650
            HFF SSC QFGF+CKSLSE  +DESE +GALA++LKVL++ H +FF+E  +   G+DVRQ
Sbjct: 311  HFFASSCHQFGFNCKSLSEQNTDESESEGALASILKVLRKIHQIFFEELEENMDGRDVRQ 370

Query: 649  VLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAG 470
            VLK++RK+VLK CKIVFSR+FPT+ QA NH LW+MAEQLGATC TELDPSVTHVV+ D+G
Sbjct: 371  VLKTVRKDVLKGCKIVFSRVFPTQSQANNHHLWRMAEQLGATCSTELDPSVTHVVSKDSG 430

Query: 469  TDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKN 344
            T+KS WA K  KFLV P WIEAA Y WQ+QPEENF VN +KN
Sbjct: 431  TEKSHWASKHNKFLVQPGWIEAANYFWQRQPEENFSVNQIKN 472


>ref|XP_010048820.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Eucalyptus grandis]
          Length = 460

 Score =  545 bits (1403), Expect = e-152
 Identities = 270/403 (66%), Positives = 315/403 (78%), Gaps = 5/403 (1%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPT-----SQNPCTHPGSFGDMCIRCGQRL 1385
            IKRRK E +E +EEP  S            E        Q+ C+HPGSFG MC+RCG+ L
Sbjct: 53   IKRRKAEKLEILEEPEGSTSHEYSEQILDSEQIVETTIKQDSCSHPGSFGGMCMRCGKSL 112

Query: 1384 EQESGVTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTS 1205
            E++SGVTFGYIHKGL L NDEI RLR  DMKN                 LNST L H++S
Sbjct: 113  EEKSGVTFGYIHKGLWLANDEIARLRKTDMKNLLRYKKLYLILDLDHTLLNSTSLAHISS 172

Query: 1204 DEEYLKSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYA 1025
            DEE+L+ + DS +D+S GSLF+LE MH MTKLRPFVRTFLKEAS+M+E++IYTMGDR+YA
Sbjct: 173  DEEHLRGKVDSREDVSKGSLFILEHMHTMTKLRPFVRTFLKEASEMFEMYIYTMGDRSYA 232

Query: 1024 LAMANLLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLI 845
            L MA LLDP+REYF ERVISRDDGTQ+HQKGLDVVLGQES VLILDDTE AWT+HK NL+
Sbjct: 233  LEMAKLLDPKREYFHERVISRDDGTQRHQKGLDVVLGQESYVLILDDTEQAWTKHKDNLL 292

Query: 844  LMERYHFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTG 665
            LMERYH+F SSCQQFGF CKSLSEM++DE+E DGALATV+ VLK+ H +FF E  D   G
Sbjct: 293  LMERYHYFASSCQQFGFSCKSLSEMETDENEGDGALATVIGVLKRVHTIFFHELEDELAG 352

Query: 664  KDVRQVLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVV 485
            +DVRQVLK++RKEVL+ CKI+FSR+FPT F A  HQLWKMAEQLGATC  +LD SVTHVV
Sbjct: 353  RDVRQVLKTLRKEVLEGCKIIFSRVFPTHFPARQHQLWKMAEQLGATCTVDLDDSVTHVV 412

Query: 484  ATDAGTDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVN 356
            ATDAGT+KSRWAVKE K LVHPRWIEA+YY W++QPEE+F V+
Sbjct: 413  ATDAGTEKSRWAVKEKKSLVHPRWIEASYYFWKRQPEESFSVD 455


>ref|XP_010048821.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Eucalyptus grandis]
            gi|629116538|gb|KCW81213.1| hypothetical protein
            EUGRSUZ_C02587 [Eucalyptus grandis]
          Length = 459

 Score =  545 bits (1403), Expect = e-152
 Identities = 270/403 (66%), Positives = 315/403 (78%), Gaps = 5/403 (1%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPT-----SQNPCTHPGSFGDMCIRCGQRL 1385
            IKRRK E +E +EEP  S            E        Q+ C+HPGSFG MC+RCG+ L
Sbjct: 52   IKRRKAEKLEILEEPEGSTSHEYSEQILDSEQIVETTIKQDSCSHPGSFGGMCMRCGKSL 111

Query: 1384 EQESGVTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTS 1205
            E++SGVTFGYIHKGL L NDEI RLR  DMKN                 LNST L H++S
Sbjct: 112  EEKSGVTFGYIHKGLWLANDEIARLRKTDMKNLLRYKKLYLILDLDHTLLNSTSLAHISS 171

Query: 1204 DEEYLKSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYA 1025
            DEE+L+ + DS +D+S GSLF+LE MH MTKLRPFVRTFLKEAS+M+E++IYTMGDR+YA
Sbjct: 172  DEEHLRGKVDSREDVSKGSLFILEHMHTMTKLRPFVRTFLKEASEMFEMYIYTMGDRSYA 231

Query: 1024 LAMANLLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLI 845
            L MA LLDP+REYF ERVISRDDGTQ+HQKGLDVVLGQES VLILDDTE AWT+HK NL+
Sbjct: 232  LEMAKLLDPKREYFHERVISRDDGTQRHQKGLDVVLGQESYVLILDDTEQAWTKHKDNLL 291

Query: 844  LMERYHFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTG 665
            LMERYH+F SSCQQFGF CKSLSEM++DE+E DGALATV+ VLK+ H +FF E  D   G
Sbjct: 292  LMERYHYFASSCQQFGFSCKSLSEMETDENEGDGALATVIGVLKRVHTIFFHELEDELAG 351

Query: 664  KDVRQVLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVV 485
            +DVRQVLK++RKEVL+ CKI+FSR+FPT F A  HQLWKMAEQLGATC  +LD SVTHVV
Sbjct: 352  RDVRQVLKTLRKEVLEGCKIIFSRVFPTHFPARQHQLWKMAEQLGATCTVDLDDSVTHVV 411

Query: 484  ATDAGTDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVN 356
            ATDAGT+KSRWAVKE K LVHPRWIEA+YY W++QPEE+F V+
Sbjct: 412  ATDAGTEKSRWAVKEKKSLVHPRWIEASYYFWKRQPEESFSVD 454


>ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like isoform X1 [Citrus sinensis]
            gi|568865772|ref|XP_006486244.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4-like
            isoform X2 [Citrus sinensis]
            gi|568865774|ref|XP_006486245.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4-like
            isoform X3 [Citrus sinensis]
          Length = 478

 Score =  544 bits (1401), Expect = e-151
 Identities = 274/406 (67%), Positives = 316/406 (77%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHASIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRLEQESG 1370
            IKRRK + VE+++E                     N C HPGS G MC RCG+RLE+ESG
Sbjct: 66   IKRRKTQIVETIQERPGPTLLGNLEEKTEVSLEMDN-CPHPGSLGGMCYRCGKRLEEESG 124

Query: 1369 VTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTSDEEYL 1190
            VTF YI KGLRL NDEI RLR  DMK+                 LNSTLL+HLT +E+YL
Sbjct: 125  VTFSYICKGLRLGNDEIDRLRNTDMKHLLRHRKLYLILDLDHTLLNSTLLLHLTPEEDYL 184

Query: 1189 KSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYALAMAN 1010
            KSQ DSLQD+S GSLFML  M+MMTKLRPFV TFLKEAS+M+E++IYTMGDR YAL MA 
Sbjct: 185  KSQADSLQDVSKGSLFMLAFMNMMTKLRPFVHTFLKEASEMFEMYIYTMGDRPYALEMAK 244

Query: 1009 LLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLILMERY 830
            LLDP REYF  RVISRDDGTQ+HQKGLDVVLGQESAVLILDDTENAWT+H+ NLILMERY
Sbjct: 245  LLDPSREYFNARVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWTKHRDNLILMERY 304

Query: 829  HFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTGKDVRQ 650
            HFF SSC+QFG+HC+SLS+++SDESEL+GALA+VLKVLK+ HN+FFDE ++   G+DVRQ
Sbjct: 305  HFFASSCRQFGYHCQSLSQLRSDESELEGALASVLKVLKRIHNIFFDELANDLAGRDVRQ 364

Query: 649  VLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVVATDAG 470
            VLK +R EVLK CK+VFS +FPTKF A+ H LWKMAEQLGATCL ELDPSVTHVV+TDA 
Sbjct: 365  VLKMVRGEVLKGCKLVFSHVFPTKFPADTHYLWKMAEQLGATCLIELDPSVTHVVSTDAR 424

Query: 469  TDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKNQ*SF 332
            T+KSRWA KE KFLV PRWIE A +LWQ+QPEENF V   K + +F
Sbjct: 425  TEKSRWAAKEAKFLVDPRWIETANFLWQRQPEENFPVKQNKPEENF 470


>ref|XP_007014446.1| RNA polymerase II ctd phosphatase, putative isoform 2 [Theobroma
            cacao] gi|508784809|gb|EOY32065.1| RNA polymerase II ctd
            phosphatase, putative isoform 2 [Theobroma cacao]
          Length = 357

 Score =  542 bits (1397), Expect = e-151
 Identities = 268/357 (75%), Positives = 297/357 (83%)
 Frame = -1

Query: 1411 MCIRCGQRLEQESGVTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLN 1232
            MCI CGQRL+ ESGVTFGYIHKGLRL NDEIVRLR  DMKN                 LN
Sbjct: 1    MCILCGQRLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLN 60

Query: 1231 STLLVHLTSDEEYLKSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHI 1052
            ST L+HLT DEEYLK Q+DSLQD+S GSLFML+ MHMMTKLRPFVRTFLKEAS+M+E++I
Sbjct: 61   STQLMHLTPDEEYLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYI 120

Query: 1051 YTMGDRAYALAMANLLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENA 872
            YTMGDR YAL MA LLDPRREYF +RVISRDDGTQKHQKGLDVVLGQESAV+ILDDTENA
Sbjct: 121  YTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENA 180

Query: 871  WTQHKGNLILMERYHFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFF 692
            W +HK NLILMERYH+F SSC QFG+ CKSLS++KSDESE DGALA+VLK L+Q H+MFF
Sbjct: 181  WMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFF 240

Query: 691  DETSDGCTGKDVRQVLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTE 512
            DE       +DVRQVLK++++EVLK CKIVFS +FPT F AE+H LWKMAEQLGATC TE
Sbjct: 241  DELDCNLASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTE 300

Query: 511  LDPSVTHVVATDAGTDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKNQ 341
             D SVTHVV+TDAGT+KSRWAVKE KFLVHPRWIEA  YLWQKQPEENF V+  KNQ
Sbjct: 301  TDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 357


>gb|KDO68848.1| hypothetical protein CISIN_1g041302mg [Citrus sinensis]
          Length = 484

 Score =  541 bits (1395), Expect = e-151
 Identities = 272/411 (66%), Positives = 317/411 (77%), Gaps = 5/411 (1%)
 Frame = -1

Query: 1549 IKRRKVENVESMEEPHA-----SIXXXXXXXXXXXEPTSQNPCTHPGSFGDMCIRCGQRL 1385
            IKRRK + VE+++E        ++                + C HPGS G MC RCG+RL
Sbjct: 66   IKRRKTQIVETIQERPGPTLLGNLEEKTDMLYCAEVSLEMDNCPHPGSLGGMCYRCGKRL 125

Query: 1384 EQESGVTFGYIHKGLRLNNDEIVRLRGKDMKNXXXXXXXXXXXXXXXXXLNSTLLVHLTS 1205
            E+ESGVTF YI KGLRL NDEI RLR  DMK+                 LNSTLL+HLT 
Sbjct: 126  EEESGVTFSYICKGLRLGNDEIDRLRNTDMKHLLRHRKLYLILDLDHTLLNSTLLLHLTP 185

Query: 1204 DEEYLKSQTDSLQDISNGSLFMLEAMHMMTKLRPFVRTFLKEASKMYELHIYTMGDRAYA 1025
            +E+YLKSQ DSLQD+S GSLFML  M+MMTKLRPFV TFLKEAS+M+E++IYTMGDR YA
Sbjct: 186  EEDYLKSQADSLQDVSKGSLFMLAFMNMMTKLRPFVHTFLKEASEMFEMYIYTMGDRPYA 245

Query: 1024 LAMANLLDPRREYFGERVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTQHKGNLI 845
            L MA LLDP REYF  RVISRDDGTQ+HQKGLDVVLGQESAVLILDDTENAWT+H+ NLI
Sbjct: 246  LEMAKLLDPSREYFNARVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWTKHRDNLI 305

Query: 844  LMERYHFFRSSCQQFGFHCKSLSEMKSDESELDGALATVLKVLKQTHNMFFDETSDGCTG 665
            LMERYHFF SSC+QFG+HC+SLS+++SDESEL+GALA+VLKVLK+ HN+FFDE ++   G
Sbjct: 306  LMERYHFFASSCRQFGYHCQSLSQLRSDESELEGALASVLKVLKRIHNIFFDELANDLAG 365

Query: 664  KDVRQVLKSIRKEVLKDCKIVFSRIFPTKFQAENHQLWKMAEQLGATCLTELDPSVTHVV 485
            +DVRQVLK +R EVLK CK+VFS +FPTKF A+ H LWKMAEQLGATC  ELDPSVTHVV
Sbjct: 366  RDVRQVLKMVRGEVLKGCKLVFSHVFPTKFPADTHYLWKMAEQLGATCSIELDPSVTHVV 425

Query: 484  ATDAGTDKSRWAVKEMKFLVHPRWIEAAYYLWQKQPEENFCVNIVKNQ*SF 332
            +TDA T+KSRWA KE KFLV PRWIE A +LWQ+QPEENF V   K + +F
Sbjct: 426  STDARTEKSRWAAKEAKFLVDPRWIETANFLWQRQPEENFPVQQTKPEENF 476


Top