BLASTX nr result

ID: Angelica27_contig00003880 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica27_contig00003880
         (1662 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_017225547.1 PREDICTED: RNA polymerase II C-terminal domain ph...   739   0.0  
XP_010645384.1 PREDICTED: RNA polymerase II C-terminal domain ph...   575   0.0  
XP_010647279.1 PREDICTED: RNA polymerase II C-terminal domain ph...   574   0.0  
CDP10217.1 unnamed protein product [Coffea canephora]                 556   0.0  
XP_019234536.1 PREDICTED: RNA polymerase II C-terminal domain ph...   555   0.0  
OIT26683.1 rna polymerase ii c-terminal domain phosphatase-like ...   555   0.0  
XP_009776171.1 PREDICTED: RNA polymerase II C-terminal domain ph...   549   0.0  
KVH97632.1 BRCT domain-containing protein [Cynara cardunculus va...   547   0.0  
XP_011079425.1 PREDICTED: RNA polymerase II C-terminal domain ph...   547   0.0  
XP_012481530.1 PREDICTED: RNA polymerase II C-terminal domain ph...   543   0.0  
XP_011078409.1 PREDICTED: RNA polymerase II C-terminal domain ph...   545   0.0  
XP_016727412.1 PREDICTED: RNA polymerase II C-terminal domain ph...   544   0.0  
XP_012481529.1 PREDICTED: RNA polymerase II C-terminal domain ph...   543   0.0  
XP_016468745.1 PREDICTED: RNA polymerase II C-terminal domain ph...   543   0.0  
KHG05109.1 RNA polymerase II C-terminal domain phosphatase-like ...   539   0.0  
XP_007014446.2 PREDICTED: RNA polymerase II C-terminal domain ph...   540   0.0  
XP_019163218.1 PREDICTED: RNA polymerase II C-terminal domain ph...   540   0.0  
XP_017631987.1 PREDICTED: RNA polymerase II C-terminal domain ph...   539   0.0  
XP_016714083.1 PREDICTED: RNA polymerase II C-terminal domain ph...   538   0.0  
KJB27893.1 hypothetical protein B456_005G016300 [Gossypium raimo...   537   0.0  

>XP_017225547.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Daucus carota subsp. sativus]
          Length = 462

 Score =  739 bits (1908), Expect = 0.0
 Identities = 374/445 (84%), Positives = 383/445 (86%), Gaps = 3/445 (0%)
 Frame = -2

Query: 1535 FASFLDAELDSTSDTSPXXXXXXXXXXXXXXXXXE---LFSTXXXXXXXXXXXVDPYGST 1365
            FASFLDAELDS SDTSP                     LFST           VD YGST
Sbjct: 18   FASFLDAELDSASDTSPEPGDEDDENENDENENDYDSELFSTKKQKVELSDKAVDSYGST 77

Query: 1364 SRGVEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLANDEIARL 1185
            S G   KLE SI+EDICTHPGVIGGMCIRCGQKTD  QSGVAFGYIHKDLRLANDEIARL
Sbjct: 78   SSGTGTKLEVSIEEDICTHPGVIGGMCIRCGQKTDGEQSGVAFGYIHKDLRLANDEIARL 137

Query: 1184 RNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNLFRLDF 1005
            RNNDLKNLFRHKK            NSTQFRHI PEEEYL++PPDSLPDALKGNLFRLDF
Sbjct: 138  RNNDLKNLFRHKKLNLVLDLDHTLLNSTQFRHIMPEEEYLKVPPDSLPDALKGNLFRLDF 197

Query: 1004 MHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIAQGDCT 825
            MHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYA+EMAKLLDPENIYFNSKVIAQGDCT
Sbjct: 198  MHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYAVEMAKLLDPENIYFNSKVIAQGDCT 257

Query: 824  QRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCKSRSEL 645
            QRHQKGLDVVVGQDSAVLILDDTEQVW+KHKENLILMERYHYFVSSYRQFGFNCKSRSEL
Sbjct: 258  QRHQKGLDVVVGQDSAVLILDDTEQVWAKHKENLILMERYHYFVSSYRQFGFNCKSRSEL 317

Query: 644  KCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCKLVFTR 465
            KCDESE+DGALATVLEVLKRVHS FFD EQGAD+ KKDVRQVLK VR +VLKGCKLVFTR
Sbjct: 318  KCDESEEDGALATVLEVLKRVHSIFFDPEQGADITKKDVRQVLKTVRKEVLKGCKLVFTR 377

Query: 464  VFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFLVHPGW 285
            VFPAKFPAESHHLWKMAEQLGATCS E+DPSVTHVVSMDKGTEKSRWAVRENKFLVHPGW
Sbjct: 378  VFPAKFPAESHHLWKMAEQLGATCSREVDPSVTHVVSMDKGTEKSRWAVRENKFLVHPGW 437

Query: 284  IEAANYLWRKQPEENFPVDEVKQTK 210
            IEAANYLWRKQ EENFPVDE KQTK
Sbjct: 438  IEAANYLWRKQAEENFPVDEAKQTK 462


>XP_010645384.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Vitis vinifera]
          Length = 458

 Score =  575 bits (1482), Expect = 0.0
 Identities = 294/441 (66%), Positives = 338/441 (76%), Gaps = 2/441 (0%)
 Frame = -2

Query: 1535 FASFLDAELDS-TSDTSPXXXXXXXXXXXXXXXXXELFSTXXXXXXXXXXXVDPYGSTSR 1359
            FA++LDAELDS +SD SP                 E                +  GSTS 
Sbjct: 17   FAAYLDAELDSDSSDVSPEQEAEDDEQEAEDESDSEYKRVKRQKVEEFESIEEHPGSTSD 76

Query: 1358 G-VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLANDEIARLR 1182
            G +E+ LE +I +D CTHPGV   +CIRCGQK  EG SGVAFGYIHKDLRL +DEIARLR
Sbjct: 77   GSLEQNLEVTITKDTCTHPGVFRELCIRCGQKM-EGGSGVAFGYIHKDLRLGSDEIARLR 135

Query: 1181 NNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNLFRLDFM 1002
            + DLKNL RHKK            NST+   ITPEE YL+   D L   LKGNLF L+ M
Sbjct: 136  DTDLKNLLRHKKLYLVLDLDHTLLNSTRLLDITPEELYLKNQTDPLQGGLKGNLFMLNTM 195

Query: 1001 HMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIAQGDCTQ 822
            HM+TKLRP+V TFLKEASK+FEMYIYTMGER+YALEMAKLLDPE +YF+S+VI+Q DCTQ
Sbjct: 196  HMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCTQ 255

Query: 821  RHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCKSRSELK 642
            RHQKGLDVV+GQ+SAVLILDDTE VW KHK+NLILMERYH+F SS RQFGFNCKS SELK
Sbjct: 256  RHQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGFNCKSLSELK 315

Query: 641  CDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCKLVFTRV 462
             DESE DGALATVL+VL+R+HS FFD E G D   +DVRQV+K VR +VLKGCK+VF+RV
Sbjct: 316  SDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKEVLKGCKIVFSRV 375

Query: 461  FPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFLVHPGWI 282
            FP +F AE+HHLW+MAEQLGATC+TE+DPSVTHVVS D GTEKSRWA++E KFLVHPGWI
Sbjct: 376  FPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQEKKFLVHPGWI 435

Query: 281  EAANYLWRKQPEENFPVDEVK 219
            EAANY W+KQPEENFPV++ K
Sbjct: 436  EAANYFWQKQPEENFPVNQKK 456


>XP_010647279.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Vitis vinifera]
          Length = 466

 Score =  574 bits (1480), Expect = 0.0
 Identities = 294/441 (66%), Positives = 337/441 (76%), Gaps = 2/441 (0%)
 Frame = -2

Query: 1535 FASFLDAELDS-TSDTSPXXXXXXXXXXXXXXXXXELFSTXXXXXXXXXXXVDPYGSTSR 1359
            FA++LDAELDS +SD SP                 E                +  GSTS 
Sbjct: 25   FAAYLDAELDSDSSDVSPEQEAEDDEQEAEDESDSEYKRVKRQKVEEFESIEEHPGSTSD 84

Query: 1358 G-VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLANDEIARLR 1182
            G +E+ LE +I +D CTHPGV   +CIRCGQK  EG SGVAFGYIHKDLRL +DEIARLR
Sbjct: 85   GSLEQNLEVTITKDTCTHPGVFRELCIRCGQKM-EGGSGVAFGYIHKDLRLGSDEIARLR 143

Query: 1181 NNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNLFRLDFM 1002
            + DLKNL RHKK            NST+   ITPEE YL+   D L   LKGNLF L+ M
Sbjct: 144  DTDLKNLLRHKKLYLVLDLDHTLLNSTRLLDITPEELYLKNQTDPLQGGLKGNLFMLNTM 203

Query: 1001 HMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIAQGDCTQ 822
            HM+TKLRP+V TFLKEASK+FEMYIYTMGER+YALEMAKLLDPE +YF+S+VI+Q DCTQ
Sbjct: 204  HMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCTQ 263

Query: 821  RHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCKSRSELK 642
            RHQKGLDVV+GQ+SAVLILDDTE VW KHK+NLILMERYH+F SS RQFGFNCKS SELK
Sbjct: 264  RHQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGFNCKSLSELK 323

Query: 641  CDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCKLVFTRV 462
             DESE DGALATVL+VL+R+HS FFD E G D   +DVRQV+K VR  VLKGCK+VF+RV
Sbjct: 324  SDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKDVLKGCKIVFSRV 383

Query: 461  FPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFLVHPGWI 282
            FP +F AE+HHLW+MAEQLGATC+TE+DPSVTHVVS D GTEKSRWA++E KFLVHPGWI
Sbjct: 384  FPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQEKKFLVHPGWI 443

Query: 281  EAANYLWRKQPEENFPVDEVK 219
            EAANY W+KQPEENFPV++ K
Sbjct: 444  EAANYFWQKQPEENFPVNQKK 464


>CDP10217.1 unnamed protein product [Coffea canephora]
          Length = 469

 Score =  556 bits (1432), Expect = 0.0
 Identities = 287/450 (63%), Positives = 336/450 (74%), Gaps = 11/450 (2%)
 Frame = -2

Query: 1535 FASFLDAELDSTSDTSPXXXXXXXXXXXXXXXXXELFSTXXXXXXXXXXXV--------- 1383
            FA+FLDAELDS SD SP                 +   T                     
Sbjct: 19   FAAFLDAELDSASDASPHPEEAEEEVVEEEEAENKGGDTDDYDLDSEKIKRRKVEILESS 78

Query: 1382 -DPYGSTSRGVEKKLE-ASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRL 1209
             D    TS+ VE +   AS  +D+C+HPGVIGG+CIRCGQK D+ +SGVAF YIHK+LRL
Sbjct: 79   LDVEAMTSQEVEIQTSGASSDKDVCSHPGVIGGLCIRCGQKMDD-ESGVAFSYIHKNLRL 137

Query: 1208 ANDEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALK 1029
            ANDEIARLR+ DLKNL R KK            NS++F  +T +E YL+   D L DALK
Sbjct: 138  ANDEIARLRDKDLKNLLRKKKLYLVLDLDHTLLNSSRFLDLTVDEGYLKGSRDDLSDALK 197

Query: 1028 GNLFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSK 849
             +L++LD+MHMMTKLRPFV +FLKEAS LFEMYIYTMGERAYAL+MAKLLDPE++YFNS+
Sbjct: 198  NSLYKLDYMHMMTKLRPFVHSFLKEASDLFEMYIYTMGERAYALQMAKLLDPEDVYFNSR 257

Query: 848  VIAQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGF 669
            VIAQGDCTQRHQKGLD+V+GQ+SAVLILDDTE VW KHKENLILMERYH+F SS RQFGF
Sbjct: 258  VIAQGDCTQRHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILMERYHFFASSCRQFGF 317

Query: 668  NCKSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLK 489
              KS SE K DESE +GALATVL VL+++HSTFFDTE  A L+ +DVRQVL  VR +VLK
Sbjct: 318  GSKSLSERKTDESESEGALATVLRVLQQIHSTFFDTEHSASLVDRDVRQVLITVRKEVLK 377

Query: 488  GCKLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVREN 309
            GCK+VFTRVFP +F  E+HHLWKMAE+LGA CS+E+DPSVTHVVS+D GTEKS WAV+E 
Sbjct: 378  GCKVVFTRVFPTQFQGENHHLWKMAERLGAICSSEVDPSVTHVVSLDPGTEKSIWAVQEG 437

Query: 308  KFLVHPGWIEAANYLWRKQPEENFPVDEVK 219
            K+LVHP WIEAANYLW+KQPEE++PV   K
Sbjct: 438  KYLVHPRWIEAANYLWKKQPEESYPVSNPK 467


>XP_019234536.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Nicotiana attenuata]
          Length = 473

 Score =  555 bits (1429), Expect = 0.0
 Identities = 276/384 (71%), Positives = 317/384 (82%), Gaps = 1/384 (0%)
 Frame = -2

Query: 1379 PYGSTSRGVEKKLE-ASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAN 1203
            P  S SRG   +   AS+  DIC+HPGV+GGMCIRCGQK  E +SGVAFGYIHK+LRLA+
Sbjct: 89   PQSSASRGEPAETSGASLALDICSHPGVMGGMCIRCGQKV-ENESGVAFGYIHKNLRLAD 147

Query: 1202 DEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGN 1023
            DEIARLR+ DLKNL RHKK            NST+   I+ EE YL+   + LPDAL+ N
Sbjct: 148  DEIARLRDKDLKNLLRHKKLYLVLDLDHTLLNSTRLADISAEELYLKDQREVLPDALRSN 207

Query: 1022 LFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVI 843
            LF+LD++HMMTKLRPFV TFLKEAS LFEMYIYTMGER YALEMA LLDP  IYF+S+VI
Sbjct: 208  LFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMADLLDPGGIYFHSRVI 267

Query: 842  AQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNC 663
            AQGDCTQRHQKGLDVVVGQ+SAVLILDDTE VW KHKENLILMERYH+F SS RQFG  C
Sbjct: 268  AQGDCTQRHQKGLDVVVGQESAVLILDDTEAVWGKHKENLILMERYHFFTSSCRQFGLKC 327

Query: 662  KSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGC 483
            KS SE K DE+E +GALA+VL+VL+++HS FFD E+  ++M++DVRQVLK VR ++LKGC
Sbjct: 328  KSLSETKSDENEAEGALASVLKVLQQIHSLFFDPERRDNIMERDVRQVLKQVRKEILKGC 387

Query: 482  KLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKF 303
            K+VFTRVFP +F AE+HHLWK+AEQLGATCSTE+D SVTHVVSMD GT+KSRWAV+E KF
Sbjct: 388  KIVFTRVFPTQFQAENHHLWKLAEQLGATCSTEVDQSVTHVVSMDAGTDKSRWAVKEKKF 447

Query: 302  LVHPGWIEAANYLWRKQPEENFPV 231
            LVHP WIEAANYLWRK PEENFPV
Sbjct: 448  LVHPRWIEAANYLWRKPPEENFPV 471


>OIT26683.1 rna polymerase ii c-terminal domain phosphatase-like 4 [Nicotiana
            attenuata]
          Length = 478

 Score =  555 bits (1429), Expect = 0.0
 Identities = 276/384 (71%), Positives = 317/384 (82%), Gaps = 1/384 (0%)
 Frame = -2

Query: 1379 PYGSTSRGVEKKLE-ASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAN 1203
            P  S SRG   +   AS+  DIC+HPGV+GGMCIRCGQK  E +SGVAFGYIHK+LRLA+
Sbjct: 89   PQSSASRGEPAETSGASLALDICSHPGVMGGMCIRCGQKV-ENESGVAFGYIHKNLRLAD 147

Query: 1202 DEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGN 1023
            DEIARLR+ DLKNL RHKK            NST+   I+ EE YL+   + LPDAL+ N
Sbjct: 148  DEIARLRDKDLKNLLRHKKLYLVLDLDHTLLNSTRLADISAEELYLKDQREVLPDALRSN 207

Query: 1022 LFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVI 843
            LF+LD++HMMTKLRPFV TFLKEAS LFEMYIYTMGER YALEMA LLDP  IYF+S+VI
Sbjct: 208  LFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMADLLDPGGIYFHSRVI 267

Query: 842  AQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNC 663
            AQGDCTQRHQKGLDVVVGQ+SAVLILDDTE VW KHKENLILMERYH+F SS RQFG  C
Sbjct: 268  AQGDCTQRHQKGLDVVVGQESAVLILDDTEAVWGKHKENLILMERYHFFTSSCRQFGLKC 327

Query: 662  KSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGC 483
            KS SE K DE+E +GALA+VL+VL+++HS FFD E+  ++M++DVRQVLK VR ++LKGC
Sbjct: 328  KSLSETKSDENEAEGALASVLKVLQQIHSLFFDPERRDNIMERDVRQVLKQVRKEILKGC 387

Query: 482  KLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKF 303
            K+VFTRVFP +F AE+HHLWK+AEQLGATCSTE+D SVTHVVSMD GT+KSRWAV+E KF
Sbjct: 388  KIVFTRVFPTQFQAENHHLWKLAEQLGATCSTEVDQSVTHVVSMDAGTDKSRWAVKEKKF 447

Query: 302  LVHPGWIEAANYLWRKQPEENFPV 231
            LVHP WIEAANYLWRK PEENFPV
Sbjct: 448  LVHPRWIEAANYLWRKPPEENFPV 471


>XP_009776171.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Nicotiana sylvestris]
          Length = 473

 Score =  549 bits (1414), Expect = 0.0
 Identities = 274/384 (71%), Positives = 315/384 (82%), Gaps = 1/384 (0%)
 Frame = -2

Query: 1379 PYGSTSRGVEKKLE-ASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAN 1203
            P  S SRG   +   AS+  DIC+HPGV+GGMCIRCGQK  E +SGVAFGYIHK+LRLA+
Sbjct: 89   PQSSASRGEPAETSGASLALDICSHPGVMGGMCIRCGQKV-ENESGVAFGYIHKNLRLAD 147

Query: 1202 DEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGN 1023
            DEIARLR+ DLKNL RHKK            NST+   I+ EE YL+   + LPDAL+ N
Sbjct: 148  DEIARLRDKDLKNLLRHKKLYLVLDLDHTLLNSTRLADISAEELYLKDQREVLPDALRSN 207

Query: 1022 LFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVI 843
            LF+LD++HMMTKLRPFV TFLKEAS LFEMYIYTMGER YALEMA LLDP  IYF+S+VI
Sbjct: 208  LFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMASLLDPGGIYFHSRVI 267

Query: 842  AQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNC 663
            AQGDCTQRHQKGLDVVVGQ+SAVLILDDTE VW KHKENLILMERYH+F SS RQFG  C
Sbjct: 268  AQGDCTQRHQKGLDVVVGQESAVLILDDTEAVWGKHKENLILMERYHFFTSSCRQFGLKC 327

Query: 662  KSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGC 483
            KS S  K DE+E +GALA+VL+VL+++HS FFD E+  ++M++DVRQVLK VR ++LKGC
Sbjct: 328  KSLSATKSDENEAEGALASVLKVLQQIHSLFFDPERRDNIMERDVRQVLKQVRKEILKGC 387

Query: 482  KLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKF 303
            K+VFTRVFP +F AE+HHLWK+AEQLGATCSTE+D SVTHVVSMD GT+KSRWAV+E KF
Sbjct: 388  KIVFTRVFPTQFQAENHHLWKLAEQLGATCSTEVDQSVTHVVSMDAGTDKSRWAVKEKKF 447

Query: 302  LVHPGWIEAANYLWRKQPEENFPV 231
            LVHP WIEAANYLWRK  EENFPV
Sbjct: 448  LVHPRWIEAANYLWRKPLEENFPV 471


>KVH97632.1 BRCT domain-containing protein [Cynara cardunculus var. scolymus]
          Length = 439

 Score =  547 bits (1409), Expect = 0.0
 Identities = 286/442 (64%), Positives = 324/442 (73%)
 Frame = -2

Query: 1535 FASFLDAELDSTSDTSPXXXXXXXXXXXXXXXXXELFSTXXXXXXXXXXXVDPYGSTSRG 1356
            FASFLD ELDSTSDTSP                 +                 P   T+  
Sbjct: 17   FASFLDTELDSTSDTSPEPEEEANETYHSDGNRTKRQKIEVLESVTDANDSTPQHETT-- 74

Query: 1355 VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLANDEIARLRNN 1176
              K LEAS+K DICTHPGVIGGMCI+CG+K D  QSGVAFGYIHKDLRLANDEI RLR+ 
Sbjct: 75   --KTLEASMK-DICTHPGVIGGMCIKCGEKMD-NQSGVAFGYIHKDLRLANDEIVRLRDR 130

Query: 1175 DLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNLFRLDFMHM 996
            DLKNLF  KK            NST+F  +T EE YL    D + D L+G LF+LD M M
Sbjct: 131  DLKNLFNQKKLCLVLDLDHTLLNSTRFMDVTQEEGYLMNQSDPMQDVLRGTLFKLDSMRM 190

Query: 995  MTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIAQGDCTQRH 816
            +TKLRPFV TFLKEASKLFEMYIYTMGERAYALEMA LLDP  IYF+S+VIAQ DCTQRH
Sbjct: 191  LTKLRPFVHTFLKEASKLFEMYIYTMGERAYALEMATLLDPGKIYFDSRVIAQSDCTQRH 250

Query: 815  QKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCKSRSELKCD 636
            QKGLDVV+GQ+SAVLILDDTE VW KHK NLILMERYH+F SS +QFG+ CKS SELK D
Sbjct: 251  QKGLDVVLGQESAVLILDDTEAVWVKHKGNLILMERYHFFASSCKQFGYRCKSLSELKND 310

Query: 635  ESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCKLVFTRVFP 456
            ESEDDGALATVL+VLKR+HS FFD              VL  VR+++LKGCK+VF+RVFP
Sbjct: 311  ESEDDGALATVLQVLKRIHSMFFD-------------PVLGTVRSEILKGCKIVFSRVFP 357

Query: 455  AKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFLVHPGWIEA 276
             KF AE+HHLWKMAE+LGATC+TE+DPSVTHV+S D GTEKSRWAV + KFLV P W+EA
Sbjct: 358  TKFQAENHHLWKMAERLGATCATEVDPSVTHVISTDIGTEKSRWAVDQKKFLVEPRWLEA 417

Query: 275  ANYLWRKQPEENFPVDEVKQTK 210
            ANYLW++QPEE FPV+E+K  +
Sbjct: 418  ANYLWQRQPEELFPVNEIKNNR 439


>XP_011079425.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Sesamum indicum] XP_011079426.1 PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4
            [Sesamum indicum]
          Length = 461

 Score =  547 bits (1410), Expect = 0.0
 Identities = 282/444 (63%), Positives = 329/444 (74%), Gaps = 6/444 (1%)
 Frame = -2

Query: 1532 ASFLDAELDSTSDTSPXXXXXXXXXXXXXXXXXELFSTXXXXXXXXXXXV----DPYGST 1365
            A+FLD ELD+ SD S                    +             +    +P  S+
Sbjct: 18   AAFLDVELDTVSDASADPEEVAEEEEESDDGDGGNYDMDLKRVKRRKVELSEGINPQSSS 77

Query: 1364 SRGVEKKLEASI--KEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLANDEIA 1191
            S+G   K+   +  K+++C HPGV  GMC+RCGQK D+ +SGVAFGYIHK+LRLANDEIA
Sbjct: 78   SQGEPAKVVGGLLPKKNMCPHPGVYAGMCMRCGQKMDD-ESGVAFGYIHKNLRLANDEIA 136

Query: 1190 RLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNLFRL 1011
            RLR+ DLKNL RHKK            NS +   IT EE YL    D+LPDALK +LFRL
Sbjct: 137  RLRDKDLKNLLRHKKLCLVLDLDHTLLNSARLPDITVEEGYLSQR-DALPDALKSSLFRL 195

Query: 1010 DFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIAQGD 831
            D M MMTKLRPFV  FLKEAS LFEMYIYTMGER YALEMAKLLDP ++YFNS++IAQGD
Sbjct: 196  DRMQMMTKLRPFVHAFLKEASNLFEMYIYTMGERPYALEMAKLLDPGDVYFNSRIIAQGD 255

Query: 830  CTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCKSRS 651
            CTQR+QKGLDVV+GQ+SAVLILDDTE VW KHKENLILMERYH+F SS + FGFNCKS S
Sbjct: 256  CTQRYQKGLDVVLGQESAVLILDDTEAVWGKHKENLILMERYHFFASSCKHFGFNCKSLS 315

Query: 650  ELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCKLVF 471
            EL+ DESE DGALATVL+VL+RVHS FFD      L  +DVRQVLK VR ++L+GCK+VF
Sbjct: 316  ELRSDESETDGALATVLKVLQRVHSLFFDPGHKDRLEDRDVRQVLKTVRKEILEGCKVVF 375

Query: 470  TRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFLVHP 291
            +RVFP  FPAE HHLWKMAEQLGATCS E+DPSVTHVVSMD GT+KSRWAV+E KFLVHP
Sbjct: 376  SRVFPTNFPAEEHHLWKMAEQLGATCSLELDPSVTHVVSMDAGTDKSRWAVQEKKFLVHP 435

Query: 290  GWIEAANYLWRKQPEENFPVDEVK 219
             WIEA+NY+W+KQPE++FPV + K
Sbjct: 436  RWIEASNYMWQKQPEDSFPVSQAK 459


>XP_012481530.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Gossypium raimondii]
          Length = 404

 Score =  543 bits (1399), Expect = 0.0
 Identities = 266/388 (68%), Positives = 313/388 (80%), Gaps = 1/388 (0%)
 Frame = -2

Query: 1379 PYGSTSRG-VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAN 1203
            P GSTS+G +E+KLE S+ +D CTHPG  G MCI CGQ+ D+ +SGV FGYIHK LRL N
Sbjct: 17   PQGSTSQGLIEEKLEVSLNKDTCTHPGSFGQMCILCGQRVDD-ESGVTFGYIHKGLRLGN 75

Query: 1202 DEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGN 1023
            DEI RLR+ D+KNL RHKK            NSTQ  H+T EEEYL+   DS+ D  KG+
Sbjct: 76   DEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLKGQSDSMQDVSKGS 135

Query: 1022 LFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVI 843
            LF L+FMHMMTKLRPFVRTFLKEAS++FEMYIYTMG+R YALEMAKLLDP+  YFN +VI
Sbjct: 136  LFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVI 195

Query: 842  AQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNC 663
            ++ D TQ+HQKGLDVV+GQDSAV+ILDDTE  W+KHK+NLILMERYH+F SS RQFGF+C
Sbjct: 196  SRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYHFFASSCRQFGFDC 255

Query: 662  KSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGC 483
            +S S+LK DESE DGALA++L++L+++H  FFD E  +DL  +DVRQVLK VR +VLK C
Sbjct: 256  RSLSQLKSDESEPDGALASILKILRQIHHIFFD-ELDSDLASRDVRQVLKTVRKEVLKDC 314

Query: 482  KLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKF 303
            K+VF+RVFP KF  E+H LWKMAEQLGATCSTE D SVTHVVSMD GTEKSRWAV+ENKF
Sbjct: 315  KIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHVVSMDAGTEKSRWAVKENKF 374

Query: 302  LVHPGWIEAANYLWRKQPEENFPVDEVK 219
            LVHP WIEAAN+ W KQPEE FPV + K
Sbjct: 375  LVHPRWIEAANFFWLKQPEEKFPVSQTK 402


>XP_011078409.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Sesamum indicum]
          Length = 464

 Score =  545 bits (1403), Expect = 0.0
 Identities = 279/442 (63%), Positives = 326/442 (73%), Gaps = 4/442 (0%)
 Frame = -2

Query: 1532 ASFLDAELDSTSDTSPXXXXXXXXXXXXXXXXXEL----FSTXXXXXXXXXXXVDPYGST 1365
            A+FLDAELD+ SD S                        F             ++P  S+
Sbjct: 23   AAFLDAELDTVSDASADPEEVAEGEEESDDGDEGNYDLDFKRVKRRKVELSEGINPQSSS 82

Query: 1364 SRGVEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLANDEIARL 1185
            S+G   ++   +  ++C HPGV  GMC+RCGQK D+ +SGVAFGYIHK+LRLA+DEIARL
Sbjct: 83   SQGEPAQVVGGLLPNMCPHPGVYAGMCMRCGQKMDD-ESGVAFGYIHKNLRLADDEIARL 141

Query: 1184 RNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNLFRLDF 1005
            R+ DLKNL RHKK            NS +   IT EE YL    D+LPDALK +LFRLD 
Sbjct: 142  RDKDLKNLLRHKKLCLVLDLDHTLLNSARLPDITVEEGYLSQR-DALPDALKSSLFRLDR 200

Query: 1004 MHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIAQGDCT 825
            M MMTKLRPFV  FLKEAS LFEMYIYTMGER YALEMAKLLDP ++YFNS++IAQGDCT
Sbjct: 201  MQMMTKLRPFVHVFLKEASNLFEMYIYTMGERPYALEMAKLLDPGDVYFNSRIIAQGDCT 260

Query: 824  QRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCKSRSEL 645
            QR+QKGLDVV+GQ+SAVLILDDTE VW KHKENLILMERYH+F SS + FGFNCKS SEL
Sbjct: 261  QRYQKGLDVVLGQESAVLILDDTEAVWGKHKENLILMERYHFFASSCKHFGFNCKSLSEL 320

Query: 644  KCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCKLVFTR 465
            + DESE DGALATVL+VL+ VH  FFD      L  +DVRQVLK VR ++L+GCK+VF+R
Sbjct: 321  RSDESETDGALATVLKVLQHVHGLFFDPGYKDHLEDRDVRQVLKTVRKEILEGCKVVFSR 380

Query: 464  VFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFLVHPGW 285
            VFP  FPAE HHLWKMAEQLGATCS E+DPSVTHVVSMD GT+KSRWAV+E KFLVHP W
Sbjct: 381  VFPTNFPAEEHHLWKMAEQLGATCSLELDPSVTHVVSMDAGTDKSRWAVQEKKFLVHPRW 440

Query: 284  IEAANYLWRKQPEENFPVDEVK 219
            IEA+NY+W+KQPE++FPV + K
Sbjct: 441  IEASNYMWQKQPEDSFPVSQAK 462


>XP_016727412.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Gossypium hirsutum]
          Length = 470

 Score =  544 bits (1401), Expect = 0.0
 Identities = 266/388 (68%), Positives = 314/388 (80%), Gaps = 1/388 (0%)
 Frame = -2

Query: 1379 PYGSTSRG-VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAN 1203
            P GSTS+G +E+KLE S+ +D CTHPG  G MCI CGQ+ D+ +SGV FGYIHK LRL N
Sbjct: 83   PQGSTSQGLIEEKLEVSLNKDTCTHPGSFGQMCILCGQRVDD-ESGVTFGYIHKGLRLGN 141

Query: 1202 DEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGN 1023
            DEI RLR+ D+KNL RHKK            NSTQ  H+T EEEYL+   DSL D  KG+
Sbjct: 142  DEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLKGQSDSLQDVSKGS 201

Query: 1022 LFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVI 843
            LF L+FMHMMTKLRPFVRTFLKEAS++FEMYIYTMG+R YALEMAKLLDP+  YFN +VI
Sbjct: 202  LFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVI 261

Query: 842  AQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNC 663
            ++ D TQ+HQKGLDVV+GQDSAV+ILDDTE  W+KHK+NLILMERYH+F SS RQFGF+C
Sbjct: 262  SRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYHFFASSCRQFGFDC 321

Query: 662  KSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGC 483
            +S S+LK DESE DGALA++L++L+++H  FFD E  +DL  +DVRQVLK VR ++LK C
Sbjct: 322  RSLSQLKSDESEPDGALASILKILRQIHHIFFD-ELDSDLASRDVRQVLKTVRKELLKDC 380

Query: 482  KLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKF 303
            K+VF+RVFP KF  E+H LWKMAEQLGATCSTE D SVTHVVSMD GTEKSRWAV+ENKF
Sbjct: 381  KIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHVVSMDAGTEKSRWAVKENKF 440

Query: 302  LVHPGWIEAANYLWRKQPEENFPVDEVK 219
            LVHP WIEAAN+ W+KQPEE FPV + K
Sbjct: 441  LVHPRWIEAANFFWQKQPEEKFPVSQTK 468


>XP_012481529.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Gossypium raimondii] KJB27892.1 hypothetical
            protein B456_005G016300 [Gossypium raimondii]
          Length = 470

 Score =  543 bits (1399), Expect = 0.0
 Identities = 266/388 (68%), Positives = 313/388 (80%), Gaps = 1/388 (0%)
 Frame = -2

Query: 1379 PYGSTSRG-VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAN 1203
            P GSTS+G +E+KLE S+ +D CTHPG  G MCI CGQ+ D+ +SGV FGYIHK LRL N
Sbjct: 83   PQGSTSQGLIEEKLEVSLNKDTCTHPGSFGQMCILCGQRVDD-ESGVTFGYIHKGLRLGN 141

Query: 1202 DEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGN 1023
            DEI RLR+ D+KNL RHKK            NSTQ  H+T EEEYL+   DS+ D  KG+
Sbjct: 142  DEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLKGQSDSMQDVSKGS 201

Query: 1022 LFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVI 843
            LF L+FMHMMTKLRPFVRTFLKEAS++FEMYIYTMG+R YALEMAKLLDP+  YFN +VI
Sbjct: 202  LFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVI 261

Query: 842  AQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNC 663
            ++ D TQ+HQKGLDVV+GQDSAV+ILDDTE  W+KHK+NLILMERYH+F SS RQFGF+C
Sbjct: 262  SRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYHFFASSCRQFGFDC 321

Query: 662  KSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGC 483
            +S S+LK DESE DGALA++L++L+++H  FFD E  +DL  +DVRQVLK VR +VLK C
Sbjct: 322  RSLSQLKSDESEPDGALASILKILRQIHHIFFD-ELDSDLASRDVRQVLKTVRKEVLKDC 380

Query: 482  KLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKF 303
            K+VF+RVFP KF  E+H LWKMAEQLGATCSTE D SVTHVVSMD GTEKSRWAV+ENKF
Sbjct: 381  KIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHVVSMDAGTEKSRWAVKENKF 440

Query: 302  LVHPGWIEAANYLWRKQPEENFPVDEVK 219
            LVHP WIEAAN+ W KQPEE FPV + K
Sbjct: 441  LVHPRWIEAANFFWLKQPEEKFPVSQTK 468


>XP_016468745.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Nicotiana tabacum]
          Length = 473

 Score =  543 bits (1399), Expect = 0.0
 Identities = 272/384 (70%), Positives = 313/384 (81%), Gaps = 1/384 (0%)
 Frame = -2

Query: 1379 PYGSTSRGVEKKLE-ASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAN 1203
            P  S SRG   +   AS+  DIC+HPGV+GGMCIRCGQK  E +SGVAFGYIHK+LRLA+
Sbjct: 89   PQSSASRGEPAETSGASLALDICSHPGVMGGMCIRCGQKV-ENESGVAFGYIHKNLRLAD 147

Query: 1202 DEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGN 1023
            DEIARLR+ DLKNL RHKK            NS +   I+ EE YL+   + LPDAL+ N
Sbjct: 148  DEIARLRDKDLKNLLRHKKLYLVLDLDHTLLNSARLADISAEELYLKDQREVLPDALRSN 207

Query: 1022 LFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVI 843
            LF+LD++HMMTKLRPFV TFLKEAS LFEMYIYTMGER YALEMA LLDP  IYF+S+VI
Sbjct: 208  LFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMASLLDPGGIYFHSRVI 267

Query: 842  AQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNC 663
            AQGDCTQRHQKGLDVVVGQ+SAVLILDDTE VW KHKENLILMERYH+F SS RQFG  C
Sbjct: 268  AQGDCTQRHQKGLDVVVGQESAVLILDDTEAVWGKHKENLILMERYHFFTSSCRQFGLKC 327

Query: 662  KSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGC 483
            KS S  K DE+E +GALA+VL+VL+++HS FFD E+  ++M++DVRQVLK VR ++LKGC
Sbjct: 328  KSLSATKSDENEAEGALASVLKVLQQIHSLFFDPERRDNIMERDVRQVLKQVRKEILKGC 387

Query: 482  KLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKF 303
            K+VFTRVFP +F AE+HHLWK+AEQLGATCSTE+D SVTHVVSMD GT+KSRWAV+E KF
Sbjct: 388  KIVFTRVFPTQFQAENHHLWKLAEQLGATCSTEVDQSVTHVVSMDAGTDKSRWAVKEKKF 447

Query: 302  LVHPGWIEAANYLWRKQPEENFPV 231
            LVHP WIEAANYLWRK  EENF V
Sbjct: 448  LVHPRWIEAANYLWRKPLEENFLV 471


>KHG05109.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Gossypium
            arboreum]
          Length = 404

 Score =  539 bits (1389), Expect = 0.0
 Identities = 264/386 (68%), Positives = 312/386 (80%), Gaps = 1/386 (0%)
 Frame = -2

Query: 1373 GSTSRG-VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLANDE 1197
            GSTS+G +E+KLE S+ +D C+HPG  G MCI CGQ+ D+ +S V FGYIHK LRL NDE
Sbjct: 19   GSTSQGLIEEKLEVSLNKDTCSHPGSFGQMCILCGQRVDD-ESSVTFGYIHKGLRLGNDE 77

Query: 1196 IARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNLF 1017
            I RLR+ D+KNL RHKK            NSTQ  H+T EEEYL+   DSL D  KG+LF
Sbjct: 78   IVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLKGQSDSLQDVSKGSLF 137

Query: 1016 RLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIAQ 837
             L+FM MMTKLRPFVRTFLKEAS++FEMYIYTMG+R YALEMAKLLDP+  YFN +VI++
Sbjct: 138  MLEFMQMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVISR 197

Query: 836  GDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCKS 657
             D TQ+HQKGLDVV+GQDSAV+ILDDTE  W+KHK+NLILMERYH+F SS RQFGF+CKS
Sbjct: 198  DDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYHFFASSCRQFGFDCKS 257

Query: 656  RSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCKL 477
             S+LK DESE DGALA++L++L+++H  FFD E  +DL  +DVRQVLK VR +VLK CK+
Sbjct: 258  LSQLKSDESEPDGALASILKILRQIHHIFFD-ELDSDLASRDVRQVLKTVRKEVLKNCKI 316

Query: 476  VFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFLV 297
            VF+RVFP KF  E+H LWKMAEQLGATCSTE D SVTH+VSMD GTEKSRWAV+ENKFLV
Sbjct: 317  VFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHIVSMDAGTEKSRWAVKENKFLV 376

Query: 296  HPGWIEAANYLWRKQPEENFPVDEVK 219
            HP WIEAAN+ W+KQPEENFPV + K
Sbjct: 377  HPRWIEAANFFWQKQPEENFPVSQTK 402


>XP_007014446.2 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Theobroma cacao]
          Length = 466

 Score =  540 bits (1392), Expect = 0.0
 Identities = 267/386 (69%), Positives = 309/386 (80%), Gaps = 1/386 (0%)
 Frame = -2

Query: 1373 GSTSRG-VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLANDE 1197
            GSTS+G +E K+E S+K+DICTHPG  G MCI CGQ+ D+ +SGV FGYIHK LRL NDE
Sbjct: 81   GSTSQGLIEDKIELSLKKDICTHPGSFGQMCILCGQRLDD-ESGVTFGYIHKGLRLGNDE 139

Query: 1196 IARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNLF 1017
            I RLR+ D+KNL RHKK            NSTQ  H+TP+EEYL+   DSL D  +G+LF
Sbjct: 140  IVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLMHLTPDEEYLKGQSDSLQDVSRGSLF 199

Query: 1016 RLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIAQ 837
             LDFMHMMTKLRPFVRTFLKEAS++FEMYIYTMG+R YALEMAKLLDP   YF+ +VI++
Sbjct: 200  MLDFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREYFSDRVISR 259

Query: 836  GDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCKS 657
             D TQ+HQKGLDVV+GQ+SAV+ILDDTE  W KHK+NLILMERYHYF SS  QFG+ CKS
Sbjct: 260  DDGTQKHQKGLDVVLGQESAVVILDDTENAWMKHKDNLILMERYHYFASSCHQFGYKCKS 319

Query: 656  RSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCKL 477
             S+LK DESE DGALA+VL+ L+++H  FFD E   +L  +DVRQVLK VR +VLKGCK+
Sbjct: 320  LSQLKSDESEPDGALASVLKALRQIHHMFFD-ELDCNLASRDVRQVLKTVREEVLKGCKI 378

Query: 476  VFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFLV 297
            VF+ VFP  FPAESH LWKMAEQLGATCSTE D SVTHVVS D GTEKSRWAV+E KFLV
Sbjct: 379  VFSHVFPTNFPAESHPLWKMAEQLGATCSTETDLSVTHVVSTDAGTEKSRWAVKEKKFLV 438

Query: 296  HPGWIEAANYLWRKQPEENFPVDEVK 219
            HP WIEA NYLW+KQPEENFPV + K
Sbjct: 439  HPRWIEATNYLWQKQPEENFPVSQGK 464


>XP_019163218.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Ipomoea nil]
          Length = 478

 Score =  540 bits (1391), Expect = 0.0
 Identities = 268/384 (69%), Positives = 316/384 (82%)
 Frame = -2

Query: 1382 DPYGSTSRGVEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAN 1203
            D   S SRG  +  E S+K + CTHPGVIGGMCIRCGQ  D+ +SGV+FGYIHK+L+L  
Sbjct: 91   DTESSKSRG--EPAETSVKMNTCTHPGVIGGMCIRCGQLVDD-ESGVSFGYIHKNLKLTY 147

Query: 1202 DEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGN 1023
            DE+ARLR  DLKNL +HKK            NST+   I+ EEEYL+   D+LPDALK +
Sbjct: 148  DEVARLREKDLKNLLQHKKLYLVLDLDHTVLNSTRISDISAEEEYLK---DTLPDALKSS 204

Query: 1022 LFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVI 843
            LFRLD +HMMTKLRPFV  FLKEAS LFEMYIYTMGER YALEMAKLLDP ++YF+S+VI
Sbjct: 205  LFRLDRIHMMTKLRPFVNNFLKEASDLFEMYIYTMGERPYALEMAKLLDPRDVYFHSRVI 264

Query: 842  AQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNC 663
            AQGD TQRHQKGLD+V+GQ+S+VLILDDTE VW KHKENLILM+RYH+F SS +QFGF+ 
Sbjct: 265  AQGDSTQRHQKGLDIVLGQESSVLILDDTEVVWGKHKENLILMDRYHFFASSCQQFGFDS 324

Query: 662  KSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGC 483
            KS S+LK DESE++GALATVL VLKR+H  FFD ++G +L+ +DVR+VLK VR +VL+GC
Sbjct: 325  KSLSQLKSDESEENGALATVLAVLKRIHGIFFDQKRGDNLLDRDVREVLKGVRKEVLEGC 384

Query: 482  KLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKF 303
            K+VF+RVFP KF AE+HHLW+MAEQLGATC+TE+D SVTHVVSMD GTEKSRWA +ENKF
Sbjct: 385  KIVFSRVFPTKFHAENHHLWRMAEQLGATCTTELDQSVTHVVSMDAGTEKSRWAQKENKF 444

Query: 302  LVHPGWIEAANYLWRKQPEENFPV 231
            LVHP WIEAANYLW+KQ EENFPV
Sbjct: 445  LVHPKWIEAANYLWKKQAEENFPV 468


>XP_017631987.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Gossypium arboreum]
          Length = 470

 Score =  539 bits (1389), Expect = 0.0
 Identities = 264/386 (68%), Positives = 312/386 (80%), Gaps = 1/386 (0%)
 Frame = -2

Query: 1373 GSTSRG-VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLANDE 1197
            GSTS+G +E+KLE S+ +D C+HPG  G MCI CGQ+ D+ +S V FGYIHK LRL NDE
Sbjct: 85   GSTSQGLIEEKLEVSLNKDTCSHPGSFGQMCILCGQRVDD-ESSVTFGYIHKGLRLGNDE 143

Query: 1196 IARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNLF 1017
            I RLR+ D+KNL RHKK            NSTQ  H+T EEEYL+   DSL D  KG+LF
Sbjct: 144  IVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLKGQSDSLQDVSKGSLF 203

Query: 1016 RLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIAQ 837
             L+FM MMTKLRPFVRTFLKEAS++FEMYIYTMG+R YALEMAKLLDP+  YFN +VI++
Sbjct: 204  MLEFMQMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVISR 263

Query: 836  GDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCKS 657
             D TQ+HQKGLDVV+GQDSAV+ILDDTE  W+KHK+NLILMERYH+F SS RQFGF+CKS
Sbjct: 264  DDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYHFFASSCRQFGFDCKS 323

Query: 656  RSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCKL 477
             S+LK DESE DGALA++L++L+++H  FFD E  +DL  +DVRQVLK VR +VLK CK+
Sbjct: 324  LSQLKSDESEPDGALASILKILRQIHHIFFD-ELDSDLASRDVRQVLKTVRKEVLKNCKI 382

Query: 476  VFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFLV 297
            VF+RVFP KF  E+H LWKMAEQLGATCSTE D SVTH+VSMD GTEKSRWAV+ENKFLV
Sbjct: 383  VFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHIVSMDAGTEKSRWAVKENKFLV 442

Query: 296  HPGWIEAANYLWRKQPEENFPVDEVK 219
            HP WIEAAN+ W+KQPEENFPV + K
Sbjct: 443  HPRWIEAANFFWQKQPEENFPVSQTK 468


>XP_016714083.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Gossypium hirsutum]
          Length = 470

 Score =  538 bits (1387), Expect = 0.0
 Identities = 264/388 (68%), Positives = 312/388 (80%), Gaps = 1/388 (0%)
 Frame = -2

Query: 1379 PYGSTSRG-VEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAN 1203
            P GSTS+G +E+KLE S+ +D C+HPG  G MCI CGQ+ D+ +S V FGYIHK LRL N
Sbjct: 83   PQGSTSQGLIEEKLEVSLNKDTCSHPGSFGQMCILCGQRVDD-ESSVTFGYIHKGLRLGN 141

Query: 1202 DEIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGN 1023
            DEI RLR+ D+KNL  HKK            NSTQ  H+T EEEYL+   DSL D  KG+
Sbjct: 142  DEIVRLRSTDMKNLLCHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLKGQSDSLQDVSKGS 201

Query: 1022 LFRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVI 843
            LF L+FM MMTKLRPFVRTFLKEAS++FEMYIYTMG+R YALEMAKLLDP+  YFN +VI
Sbjct: 202  LFMLEFMQMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVI 261

Query: 842  AQGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNC 663
            ++ D TQ+HQKGLDVV+GQDSAV+ILDDTE  W+KHK+NLILMERYH+F SS RQFGF+C
Sbjct: 262  SRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYHFFASSCRQFGFDC 321

Query: 662  KSRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGC 483
            KS S+LK DESE DGALA++L++L+++H  FFD E  +DL  +DVRQVLK VR +VLK C
Sbjct: 322  KSLSQLKSDESEPDGALASILKILRQIHHIFFD-ELDSDLASRDVRQVLKTVRKEVLKNC 380

Query: 482  KLVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKF 303
            K+VF+RVFP KF  E+H LWKMAEQLGATCSTE D SVTH+VSMD GTEKSRWAV+ENKF
Sbjct: 381  KIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHIVSMDAGTEKSRWAVKENKF 440

Query: 302  LVHPGWIEAANYLWRKQPEENFPVDEVK 219
            LVHP WIEAAN+ W+KQPEENFPV + K
Sbjct: 441  LVHPRWIEAANFFWQKQPEENFPVSQTK 468


>KJB27893.1 hypothetical protein B456_005G016300 [Gossypium raimondii]
          Length = 469

 Score =  537 bits (1383), Expect = 0.0
 Identities = 262/387 (67%), Positives = 310/387 (80%)
 Frame = -2

Query: 1379 PYGSTSRGVEKKLEASIKEDICTHPGVIGGMCIRCGQKTDEGQSGVAFGYIHKDLRLAND 1200
            P GSTS+G+ ++   S+ +D CTHPG  G MCI CGQ+ D+ +SGV FGYIHK LRL ND
Sbjct: 83   PQGSTSQGLIEEKLVSLNKDTCTHPGSFGQMCILCGQRVDD-ESGVTFGYIHKGLRLGND 141

Query: 1199 EIARLRNNDLKNLFRHKKXXXXXXXXXXXXNSTQFRHITPEEEYLRMPPDSLPDALKGNL 1020
            EI RLR+ D+KNL RHKK            NSTQ  H+T EEEYL+   DS+ D  KG+L
Sbjct: 142  EIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYLKGQSDSMQDVSKGSL 201

Query: 1019 FRLDFMHMMTKLRPFVRTFLKEASKLFEMYIYTMGERAYALEMAKLLDPENIYFNSKVIA 840
            F L+FMHMMTKLRPFVRTFLKEAS++FEMYIYTMG+R YALEMAKLLDP+  YFN +VI+
Sbjct: 202  FMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVIS 261

Query: 839  QGDCTQRHQKGLDVVVGQDSAVLILDDTEQVWSKHKENLILMERYHYFVSSYRQFGFNCK 660
            + D TQ+HQKGLDVV+GQDSAV+ILDDTE  W+KHK+NLILMERYH+F SS RQFGF+C+
Sbjct: 262  RDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERYHFFASSCRQFGFDCR 321

Query: 659  SRSELKCDESEDDGALATVLEVLKRVHSTFFDTEQGADLMKKDVRQVLKIVRNKVLKGCK 480
            S S+LK DESE DGALA++L++L+++H  FFD E  +DL  +DVRQVLK VR +VLK CK
Sbjct: 322  SLSQLKSDESEPDGALASILKILRQIHHIFFD-ELDSDLASRDVRQVLKTVRKEVLKDCK 380

Query: 479  LVFTRVFPAKFPAESHHLWKMAEQLGATCSTEMDPSVTHVVSMDKGTEKSRWAVRENKFL 300
            +VF+RVFP KF  E+H LWKMAEQLGATCSTE D SVTHVVSMD GTEKSRWAV+ENKFL
Sbjct: 381  IVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHVVSMDAGTEKSRWAVKENKFL 440

Query: 299  VHPGWIEAANYLWRKQPEENFPVDEVK 219
            VHP WIEAAN+ W KQPEE FPV + K
Sbjct: 441  VHPRWIEAANFFWLKQPEEKFPVSQTK 467


Top