BLASTX nr result

ID: Forsythia22_contig00019467 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00019467
         (1759 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011079425.1| PREDICTED: RNA polymerase II C-terminal doma...   677   0.0  
ref|XP_011078409.1| PREDICTED: RNA polymerase II C-terminal doma...   669   0.0  
ref|XP_009776171.1| PREDICTED: RNA polymerase II C-terminal doma...   660   0.0  
ref|XP_012846745.1| PREDICTED: RNA polymerase II C-terminal doma...   658   0.0  
ref|XP_011078410.1| PREDICTED: RNA polymerase II C-terminal doma...   645   0.0  
ref|XP_012837700.1| PREDICTED: RNA polymerase II C-terminal doma...   640   e-180
emb|CDP10217.1| unnamed protein product [Coffea canephora]            638   e-180
ref|XP_010323180.1| PREDICTED: RNA polymerase II C-terminal doma...   626   e-176
ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal doma...   626   e-176
ref|XP_010323182.1| PREDICTED: RNA polymerase II C-terminal doma...   624   e-175
ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal doma...   616   e-173
ref|XP_010645384.1| PREDICTED: RNA polymerase II C-terminal doma...   615   e-173
ref|XP_010647279.1| PREDICTED: RNA polymerase II C-terminal doma...   614   e-173
ref|XP_012837702.1| PREDICTED: RNA polymerase II C-terminal doma...   605   e-170
ref|XP_012078975.1| PREDICTED: RNA polymerase II C-terminal doma...   572   e-160
ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative ...   571   e-160
ref|XP_012481529.1| PREDICTED: RNA polymerase II C-terminal doma...   566   e-158
gb|KJB27893.1| hypothetical protein B456_005G016300 [Gossypium r...   563   e-157
ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu...   561   e-157
ref|XP_010693335.1| PREDICTED: RNA polymerase II C-terminal doma...   557   e-155

>ref|XP_011079425.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Sesamum indicum] gi|747065569|ref|XP_011079426.1|
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 4 [Sesamum indicum]
          Length = 461

 Score =  677 bits (1747), Expect = 0.0
 Identities = 342/462 (74%), Positives = 381/462 (82%), Gaps = 1/462 (0%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNSDHQR 1399
            MSL ADSPVHSSSS+D AA LD ELD+ SDAS                     Y+ D +R
Sbjct: 1    MSLAADSPVHSSSSEDLAAFLDVELDTVSDASADPEEVAEEEEESDDGDGGN-YDMDLKR 59

Query: 1398 IKRRKVEVYEEMVDSQSSKSEGEAPQNLGSS-PKNNMCTHPGIIGGMCIRCGQTMDDESG 1222
            +KRRKVE+  E ++ QSS S+GE  + +G   PK NMC HPG+  GMC+RCGQ MDDESG
Sbjct: 60   VKRRKVEL-SEGINPQSSSSQGEPAKVVGGLLPKKNMCPHPGVYAGMCMRCGQKMDDESG 118

Query: 1221 VAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITIEERYL 1042
            VAFGYIHKNLRLANDEIARLRDKD KNLLRHKK           LNS R+ DIT+EE YL
Sbjct: 119  VAFGYIHKNLRLANDEIARLRDKDLKNLLRHKKLCLVLDLDHTLLNSARLPDITVEEGYL 178

Query: 1041 EGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYALEMAK 862
              QRD LPD LKSSL+RLD M MMTKLRPFV+ FLKEASNLFEMYIYTMGER YALEMAK
Sbjct: 179  S-QRDALPDALKSSLFRLDRMQMMTKLRPFVHAFLKEASNLFEMYIYTMGERPYALEMAK 237

Query: 861  LLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILMERY 682
            LLDPG VYF+SR+IAQGDCT ++QKGLD+VLGQESAVLILDDTEAVWGKHKENLILMERY
Sbjct: 238  LLDPGDVYFNSRIIAQGDCTQRYQKGLDVVLGQESAVLILDDTEAVWGKHKENLILMERY 297

Query: 681  HFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLEHRDVR 502
            HFFASSC+ FGFNCKSLSELRSDESETDGALATV+K+LQ++HSLFFDP H D LE RDVR
Sbjct: 298  HFFASSCKHFGFNCKSLSELRSDESETDGALATVLKVLQRVHSLFFDPGHKDRLEDRDVR 357

Query: 501  QVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHVVSMDA 322
            QVLK+VRKEIL+GCKVVF+RVFPTN  AE   +WKMAEQLGATCS ELDP VTHVVSMDA
Sbjct: 358  QVLKTVRKEILEGCKVVFSRVFPTNFPAEEHHLWKMAEQLGATCSLELDPSVTHVVSMDA 417

Query: 321  GTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQ 196
            GTDKSRWA+QEKKFLVHPRWIEASNY+W+KQPE++FPVS+++
Sbjct: 418  GTDKSRWAVQEKKFLVHPRWIEASNYMWQKQPEDSFPVSQAK 459


>ref|XP_011078409.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Sesamum indicum]
          Length = 464

 Score =  669 bits (1725), Expect = 0.0
 Identities = 339/461 (73%), Positives = 379/461 (82%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNSDHQR 1399
            MSL ADSPVHSSSS+D AA LDAELD+ SDAS                     Y+ D +R
Sbjct: 6    MSLAADSPVHSSSSEDLAAFLDAELDTVSDASADPEEVAEGEEESDDGDEGN-YDLDFKR 64

Query: 1398 IKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTMDDESGV 1219
            +KRRKVE+  E ++ QSS S+GE  Q +G    N MC HPG+  GMC+RCGQ MDDESGV
Sbjct: 65   VKRRKVEL-SEGINPQSSSSQGEPAQVVGGLLPN-MCPHPGVYAGMCMRCGQKMDDESGV 122

Query: 1218 AFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITIEERYLE 1039
            AFGYIHKNLRLA+DEIARLRDKD KNLLRHKK           LNS R+ DIT+EE YL 
Sbjct: 123  AFGYIHKNLRLADDEIARLRDKDLKNLLRHKKLCLVLDLDHTLLNSARLPDITVEEGYLS 182

Query: 1038 GQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYALEMAKL 859
             QRD LPD LKSSL+RLD M MMTKLRPFV+ FLKEASNLFEMYIYTMGER YALEMAKL
Sbjct: 183  -QRDALPDALKSSLFRLDRMQMMTKLRPFVHVFLKEASNLFEMYIYTMGERPYALEMAKL 241

Query: 858  LDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILMERYH 679
            LDPG VYF+SR+IAQGDCT ++QKGLD+VLGQESAVLILDDTEAVWGKHKENLILMERYH
Sbjct: 242  LDPGDVYFNSRIIAQGDCTQRYQKGLDVVLGQESAVLILDDTEAVWGKHKENLILMERYH 301

Query: 678  FFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLEHRDVRQ 499
            FFASSC+ FGFNCKSLSELRSDESETDGALATV+K+LQ +H LFFDP + D+LE RDVRQ
Sbjct: 302  FFASSCKHFGFNCKSLSELRSDESETDGALATVLKVLQHVHGLFFDPGYKDHLEDRDVRQ 361

Query: 498  VLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHVVSMDAG 319
            VLK+VRKEIL+GCKVVF+RVFPTN  AE   +WKMAEQLGATCS ELDP VTHVVSMDAG
Sbjct: 362  VLKTVRKEILEGCKVVFSRVFPTNFPAEEHHLWKMAEQLGATCSLELDPSVTHVVSMDAG 421

Query: 318  TDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQ 196
            TDKSRWA+QEKKFLVHPRWIEASNY+W+KQPE++FPVS+++
Sbjct: 422  TDKSRWAVQEKKFLVHPRWIEASNYMWQKQPEDSFPVSQAK 462


>ref|XP_009776171.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Nicotiana sylvestris]
          Length = 473

 Score =  660 bits (1702), Expect = 0.0
 Identities = 331/472 (70%), Positives = 376/472 (79%), Gaps = 14/472 (2%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYN----- 1414
            MSLTADSPVHSSSSDDFAA LDAELDSASD S                    +       
Sbjct: 1    MSLTADSPVHSSSSDDFAAFLDAELDSASDVSPDQHEVENEEAEGEEEVEDEEGQDEDGG 60

Query: 1413 ---------SDHQRIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGM 1261
                     SD  RIK+RK E  E+ V  QSS S GE  +  G+S   ++C+HPG++GGM
Sbjct: 61   DGDDLDDGASDSSRIKKRKAEALEDAVYPQSSASRGEPAETSGASLALDICSHPGVMGGM 120

Query: 1260 CIRCGQTMDDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNS 1081
            CIRCGQ +++ESGVAFGYIHKNLRLA+DEIARLRDKD KNLLRHKK           LNS
Sbjct: 121  CIRCGQKVENESGVAFGYIHKNLRLADDEIARLRDKDLKNLLRHKKLYLVLDLDHTLLNS 180

Query: 1080 TRIADITIEERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIY 901
            TR+ADI+ EE YL+ QR+ LPD L+S+L++LD +HMMTKLRPFV+TFLKEAS+LFEMYIY
Sbjct: 181  TRLADISAEELYLKDQREVLPDALRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIY 240

Query: 900  TMGERAYALEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVW 721
            TMGER YALEMA LLDPGG+YFHSRVIAQGDCT +HQKGLD+V+GQESAVLILDDTEAVW
Sbjct: 241  TMGERPYALEMASLLDPGGIYFHSRVIAQGDCTQRHQKGLDVVVGQESAVLILDDTEAVW 300

Query: 720  GKHKENLILMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFD 541
            GKHKENLILMERYHFF SSCRQFG  CKSLS  +SDE+E +GALA+V+K+LQQIHSLFFD
Sbjct: 301  GKHKENLILMERYHFFTSSCRQFGLKCKSLSATKSDENEAEGALASVLKVLQQIHSLFFD 360

Query: 540  PEHVDNLEHRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTE 361
            PE  DN+  RDVRQVLK VRKEILKGCK+VFTRVFPT  QAE+  +WK+AEQLGATCSTE
Sbjct: 361  PERRDNIMERDVRQVLKQVRKEILKGCKIVFTRVFPTQFQAENHHLWKLAEQLGATCSTE 420

Query: 360  LDPHVTHVVSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVS 205
            +D  VTHVVSMDAGTDKSRWA++EKKFLVHPRWIEA+NYLW+K  EENFPVS
Sbjct: 421  VDQSVTHVVSMDAGTDKSRWAVKEKKFLVHPRWIEAANYLWRKPLEENFPVS 472


>ref|XP_012846745.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Erythranthe guttatus] gi|848893409|ref|XP_012846746.1|
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 4 [Erythranthe guttatus]
            gi|848893411|ref|XP_012846747.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4
            [Erythranthe guttatus] gi|848893413|ref|XP_012846748.1|
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 4 [Erythranthe guttatus]
            gi|604317771|gb|EYU29592.1| hypothetical protein
            MIMGU_mgv1a017809mg [Erythranthe guttata]
          Length = 466

 Score =  658 bits (1698), Expect = 0.0
 Identities = 331/469 (70%), Positives = 378/469 (80%), Gaps = 7/469 (1%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSD--DFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNSDH 1405
            MSL  DSP HSSSSD  D  A LDAELD ASD                        + D 
Sbjct: 1    MSLAEDSPAHSSSSDGDDLVAFLDAELDIASDGEADSEEVADDEDSDNGDEDN---DLDL 57

Query: 1404 QRIKRRKVEVYEEM----VDSQSSKSEGEAPQNL-GSSPKNNMCTHPGIIGGMCIRCGQT 1240
            +R+KRRK+E+ E++    ++SQSS S GE+ Q L GSSPK N C HPG+  GMC+RCGQ 
Sbjct: 58   KRVKRRKIELSEDVNFDVINSQSSSSVGESVQLLSGSSPKKNTCLHPGVYAGMCMRCGQK 117

Query: 1239 MDDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADIT 1060
            MDDESGVAFGYIHKNLRLANDE+ RLRD+D KN+LRH+K           LNS R+ DIT
Sbjct: 118  MDDESGVAFGYIHKNLRLANDEMDRLRDRDLKNMLRHRKLCLVLDLDHTLLNSARLHDIT 177

Query: 1059 IEERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAY 880
             EE YL GQRD LPDTLKSSL+RLD ++MMTKLRPFV+TFLKEAS LFEMYIYTMGER Y
Sbjct: 178  EEEGYLNGQRDALPDTLKSSLFRLDWIYMMTKLRPFVHTFLKEASKLFEMYIYTMGERPY 237

Query: 879  ALEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENL 700
            ALEMAKLLDPG +YF+SR+IAQGDCTHKHQKGLD+VLGQESAV+ILDDTE VW KHK+NL
Sbjct: 238  ALEMAKLLDPGDIYFNSRIIAQGDCTHKHQKGLDVVLGQESAVVILDDTEVVWSKHKDNL 297

Query: 699  ILMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNL 520
            ILMERYHFFASSC+QFGFNCKSLSELRSDES+T+GAL TV+K LQQIHSLFFD E  D+L
Sbjct: 298  ILMERYHFFASSCKQFGFNCKSLSELRSDESDTEGALPTVLKRLQQIHSLFFDVERKDSL 357

Query: 519  EHRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTH 340
            E RDVR V+K++RKE+LKGCKVVFTRVFPTN  AEH  +WKMAE+LGATC  E+DP +TH
Sbjct: 358  EDRDVRLVMKTLRKEVLKGCKVVFTRVFPTNFPAEHHSLWKMAEKLGATCCNEIDPCITH 417

Query: 339  VVSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQE 193
            VVSMDAGTDKSRWA++EKKFLVHPRWIEASNY+W+KQPEENFPVS++ +
Sbjct: 418  VVSMDAGTDKSRWALKEKKFLVHPRWIEASNYMWQKQPEENFPVSQANK 466


>ref|XP_011078410.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Sesamum indicum]
          Length = 451

 Score =  645 bits (1663), Expect = 0.0
 Identities = 331/461 (71%), Positives = 369/461 (80%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNSDHQR 1399
            MSL ADSPVHSSSS+D AA LDAELD+ SDAS                     Y+ D +R
Sbjct: 6    MSLAADSPVHSSSSEDLAAFLDAELDTVSDASADPEEVAEGEEESDDGDEGN-YDLDFKR 64

Query: 1398 IKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTMDDESGV 1219
            +KRRKVE+  E ++ QSS S+GE  Q +G    N MC HPG+  GMC+RCGQ MDDESGV
Sbjct: 65   VKRRKVEL-SEGINPQSSSSQGEPAQVVGGLLPN-MCPHPGVYAGMCMRCGQKMDDESGV 122

Query: 1218 AFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITIEERYLE 1039
            AFGYIHKNLRLA+DEIARLRDKD KNLLRHKK           LNS R+ DIT+EE YL 
Sbjct: 123  AFGYIHKNLRLADDEIARLRDKDLKNLLRHKKLCLVLDLDHTLLNSARLPDITVEEGYLS 182

Query: 1038 GQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYALEMAKL 859
             QRD LPD LKSSL+RLD M MMTKLRPFV+ FLKEASNLFEMYIYTMGER YALEMAKL
Sbjct: 183  -QRDALPDALKSSLFRLDRMQMMTKLRPFVHVFLKEASNLFEMYIYTMGERPYALEMAKL 241

Query: 858  LDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILMERYH 679
            LDPG VYF+SR+IAQGDCT ++QKGLD+VLGQESAVLILDDTEAVWGKHKENLILMERYH
Sbjct: 242  LDPGDVYFNSRIIAQGDCTQRYQKGLDVVLGQESAVLILDDTEAVWGKHKENLILMERYH 301

Query: 678  FFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLEHRDVRQ 499
            FFASSC+ FGFNCKSLSELRSDESETDGALATV+K+LQ +H LFFDP             
Sbjct: 302  FFASSCKHFGFNCKSLSELRSDESETDGALATVLKVLQHVHGLFFDP------------- 348

Query: 498  VLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHVVSMDAG 319
            VLK+VRKEIL+GCKVVF+RVFPTN  AE   +WKMAEQLGATCS ELDP VTHVVSMDAG
Sbjct: 349  VLKTVRKEILEGCKVVFSRVFPTNFPAEEHHLWKMAEQLGATCSLELDPSVTHVVSMDAG 408

Query: 318  TDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQ 196
            TDKSRWA+QEKKFLVHPRWIEASNY+W+KQPE++FPVS+++
Sbjct: 409  TDKSRWAVQEKKFLVHPRWIEASNYMWQKQPEDSFPVSQAK 449


>ref|XP_012837700.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Erythranthe guttatus]
            gi|848874314|ref|XP_012837701.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Erythranthe guttatus]
            gi|604332682|gb|EYU37264.1| hypothetical protein
            MIMGU_mgv1a005925mg [Erythranthe guttata]
          Length = 464

 Score =  640 bits (1650), Expect = e-180
 Identities = 321/468 (68%), Positives = 374/468 (79%), Gaps = 6/468 (1%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSD--DFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNSDH 1405
            MSL  DSP HSSSSD  D  A LDAELD ASD                        + D 
Sbjct: 1    MSLAEDSPAHSSSSDGDDLVAFLDAELDIASDGEADSEEVADDEDSDNGDEDD---DLDL 57

Query: 1404 QRIKRRKVEVYEEM----VDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTM 1237
            +R+KRRK+E+ E++    ++SQSS S  E   + GSSPK N C HPG+  GMC++CGQ M
Sbjct: 58   KRVKRRKMELSEDVNFDVINSQSSSS-AEQILSAGSSPKKNTCLHPGVYAGMCMKCGQKM 116

Query: 1236 DDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITI 1057
            DDESGVAFGYIHKNLRLANDEI RLRD+D KN+LRH+K           LNS R+ DIT 
Sbjct: 117  DDESGVAFGYIHKNLRLANDEIDRLRDRDLKNMLRHRKLCLVLDLDHTLLNSARLHDITE 176

Query: 1056 EERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYA 877
            +E YL GQR+ LPD LK+SL+RLD ++MMTKLRP+V+TFLKEAS LFEMYIYTMGER YA
Sbjct: 177  QEGYLNGQREALPDNLKNSLFRLDWIYMMTKLRPYVHTFLKEASKLFEMYIYTMGERPYA 236

Query: 876  LEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLI 697
            LEMAKLLDPG +YF+SR+IAQGDCT KHQKGLD+VLGQESAV+ILDDTEAVW KHK+NLI
Sbjct: 237  LEMAKLLDPGDIYFNSRIIAQGDCTQKHQKGLDVVLGQESAVVILDDTEAVWSKHKDNLI 296

Query: 696  LMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLE 517
            LMERYHFFASSC+QFGFNCKSLSEL+SDES+T GALA+V+K LQQIH+LFFD E  D+LE
Sbjct: 297  LMERYHFFASSCKQFGFNCKSLSELQSDESDTQGALASVLKRLQQIHTLFFDAERKDSLE 356

Query: 516  HRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHV 337
             RDVR V+K++RKE+LKGCKVVFTRVFPTN  +EH  +WKMAE+LGATC  E+DP VTHV
Sbjct: 357  DRDVRLVMKTLRKEVLKGCKVVFTRVFPTNFPSEHHSLWKMAEKLGATCCNEIDPSVTHV 416

Query: 336  VSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQE 193
            VSMDAGTDKSRWA+QEKKFLVHPRWIEASNY+W+KQ EENFPVS++++
Sbjct: 417  VSMDAGTDKSRWAVQEKKFLVHPRWIEASNYMWQKQTEENFPVSQAKK 464


>emb|CDP10217.1| unnamed protein product [Coffea canephora]
          Length = 469

 Score =  638 bits (1645), Expect = e-180
 Identities = 322/464 (69%), Positives = 373/464 (80%), Gaps = 6/464 (1%)
 Frame = -2

Query: 1578 MSLTADSPVHS--SSSDDFAAILDAELDSASDAS----RXXXXXXXXXXXXXXXXXXXDY 1417
            MSLTADSPVHS  +S +DFAA LDAELDSASDAS                        DY
Sbjct: 1    MSLTADSPVHSPSTSGEDFAAFLDAELDSASDASPHPEEAEEEVVEEEEAENKGGDTDDY 60

Query: 1416 NSDHQRIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTM 1237
            + D ++IKRRKVE+ E  +D ++  S+    Q  G+S   ++C+HPG+IGG+CIRCGQ M
Sbjct: 61   DLDSEKIKRRKVEILESSLDVEAMTSQEVEIQTSGASSDKDVCSHPGVIGGLCIRCGQKM 120

Query: 1236 DDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITI 1057
            DDESGVAF YIHKNLRLANDEIARLRDKD KNLLR KK           LNS+R  D+T+
Sbjct: 121  DDESGVAFSYIHKNLRLANDEIARLRDKDLKNLLRKKKLYLVLDLDHTLLNSSRFLDLTV 180

Query: 1056 EERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYA 877
            +E YL+G RD L D LK+SLY+LD MHMMTKLRPFV++FLKEAS+LFEMYIYTMGERAYA
Sbjct: 181  DEGYLKGSRDDLSDALKNSLYKLDYMHMMTKLRPFVHSFLKEASDLFEMYIYTMGERAYA 240

Query: 876  LEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLI 697
            L+MAKLLDP  VYF+SRVIAQGDCT +HQKGLDIVLGQESAVLILDDTEAVWGKHKENLI
Sbjct: 241  LQMAKLLDPEDVYFNSRVIAQGDCTQRHQKGLDIVLGQESAVLILDDTEAVWGKHKENLI 300

Query: 696  LMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLE 517
            LMERYHFFASSCRQFGF  KSLSE ++DESE++GALATV+++LQQIHS FFD EH  +L 
Sbjct: 301  LMERYHFFASSCRQFGFGSKSLSERKTDESESEGALATVLRVLQQIHSTFFDTEHSASLV 360

Query: 516  HRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHV 337
             RDVRQVL +VRKE+LKGCKVVFTRVFPT  Q E+  +WKMAE+LGA CS+E+DP VTHV
Sbjct: 361  DRDVRQVLITVRKEVLKGCKVVFTRVFPTQFQGENHHLWKMAERLGAICSSEVDPSVTHV 420

Query: 336  VSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVS 205
            VS+D GT+KS WA+QE K+LVHPRWIEA+NYLWKKQPEE++PVS
Sbjct: 421  VSLDPGTEKSIWAVQEGKYLVHPRWIEAANYLWKKQPEESYPVS 464


>ref|XP_010323180.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Solanum lycopersicum]
            gi|723712089|ref|XP_010323181.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Solanum lycopersicum]
          Length = 462

 Score =  626 bits (1615), Expect = e-176
 Identities = 312/461 (67%), Positives = 365/461 (79%), Gaps = 3/461 (0%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNS---D 1408
            MSLTADSPVHSSSSD+FAA LDAELDSASD                             D
Sbjct: 1    MSLTADSPVHSSSSDEFAAFLDAELDSASDVDEVESGEAEGEEEVEDEDNDTGDGDGSID 60

Query: 1407 HQRIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTMDDE 1228
              R K+RK+E+ E  VD QSS S GE  +  G+S   ++CTHPG++GGMCIRCGQ ++DE
Sbjct: 61   SSRSKKRKIELIEGAVDPQSSVSRGEPAETSGASMALDVCTHPGVMGGMCIRCGQKVEDE 120

Query: 1227 SGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITIEER 1048
            SGVAFGYIHKNLRLA+DE+ARLR+KD KNLLRH+K           LNSTR+ADI+ EE 
Sbjct: 121  SGVAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEES 180

Query: 1047 YLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYALEM 868
            YL+ QR+ LPD L+S+L++LD +HMMTKLRPFV+TFLKEAS+LFEMYIYTMGER YALEM
Sbjct: 181  YLKDQREVLPDALRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEM 240

Query: 867  AKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILME 688
            AKLLDPGG+YFHSRVIAQ D T +HQKGLD+VLGQESAVLILDDTE VWGKH+ENLILM+
Sbjct: 241  AKLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMD 300

Query: 687  RYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLEHRD 508
            RYHFF SSCRQFG  CKSLSE +SDE+E +GALA+V+++LQ+IH LFFDPE  DN+  RD
Sbjct: 301  RYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGDNIMERD 360

Query: 507  VRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHVVSM 328
            VRQVLK+VRKEILKGCK+VFT V P   Q E+   WK+AE+LGAT STE+D  VTHVVSM
Sbjct: 361  VRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSM 420

Query: 327  DAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVS 205
            +  T+KSR A++EKKFLVHPRWIEA+NYLW+K PEENFPVS
Sbjct: 421  NDKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 461


>ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Solanum lycopersicum]
          Length = 512

 Score =  626 bits (1615), Expect = e-176
 Identities = 312/461 (67%), Positives = 365/461 (79%), Gaps = 3/461 (0%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNS---D 1408
            MSLTADSPVHSSSSD+FAA LDAELDSASD                             D
Sbjct: 51   MSLTADSPVHSSSSDEFAAFLDAELDSASDVDEVESGEAEGEEEVEDEDNDTGDGDGSID 110

Query: 1407 HQRIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTMDDE 1228
              R K+RK+E+ E  VD QSS S GE  +  G+S   ++CTHPG++GGMCIRCGQ ++DE
Sbjct: 111  SSRSKKRKIELIEGAVDPQSSVSRGEPAETSGASMALDVCTHPGVMGGMCIRCGQKVEDE 170

Query: 1227 SGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITIEER 1048
            SGVAFGYIHKNLRLA+DE+ARLR+KD KNLLRH+K           LNSTR+ADI+ EE 
Sbjct: 171  SGVAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEES 230

Query: 1047 YLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYALEM 868
            YL+ QR+ LPD L+S+L++LD +HMMTKLRPFV+TFLKEAS+LFEMYIYTMGER YALEM
Sbjct: 231  YLKDQREVLPDALRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEM 290

Query: 867  AKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILME 688
            AKLLDPGG+YFHSRVIAQ D T +HQKGLD+VLGQESAVLILDDTE VWGKH+ENLILM+
Sbjct: 291  AKLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMD 350

Query: 687  RYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLEHRD 508
            RYHFF SSCRQFG  CKSLSE +SDE+E +GALA+V+++LQ+IH LFFDPE  DN+  RD
Sbjct: 351  RYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGDNIMERD 410

Query: 507  VRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHVVSM 328
            VRQVLK+VRKEILKGCK+VFT V P   Q E+   WK+AE+LGAT STE+D  VTHVVSM
Sbjct: 411  VRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSM 470

Query: 327  DAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVS 205
            +  T+KSR A++EKKFLVHPRWIEA+NYLW+K PEENFPVS
Sbjct: 471  NDKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 511


>ref|XP_010323182.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Solanum lycopersicum]
          Length = 473

 Score =  624 bits (1608), Expect = e-175
 Identities = 314/471 (66%), Positives = 368/471 (78%), Gaps = 13/471 (2%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDAS-------------RXXXXXXXXXXXXXX 1438
            MSL ADSPVHSSSSDDFAA LDAELDSASD S                            
Sbjct: 2    MSLMADSPVHSSSSDDFAAFLDAELDSASDVSPELDEVENGEAEVEVELEDEKGKDEDND 61

Query: 1437 XXXXXDYNSDHQRIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMC 1258
                 D N D +R K+RK+E+ E  VD QS  S GE+ +  G+S   ++CTHPG++GGMC
Sbjct: 62   TGDGDDGNIDSRRSKKRKIELIEAAVDPQSLVSRGESAETSGASLALDVCTHPGVMGGMC 121

Query: 1257 IRCGQTMDDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNST 1078
            IRCGQ ++DESGVAFGYIHKNLRLA+DE+ARLR+KD KNLLRH+K           LNST
Sbjct: 122  IRCGQKVEDESGVAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNST 181

Query: 1077 RIADITIEERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYT 898
            R+ADI+ EE YL+ QR+ LPD L+S+L++LD +HMMTKLRPFV+TFLKEAS+LFEMYIYT
Sbjct: 182  RLADISAEESYLKDQREVLPDALRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYT 241

Query: 897  MGERAYALEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWG 718
            MGER YALEMAKLLDPGG+YFHSRVIAQ D T +HQKGLD+VLGQESAVLILDDTE VWG
Sbjct: 242  MGERPYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWG 301

Query: 717  KHKENLILMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDP 538
            KH+ENLILM+RYHFF SSCRQFG  CKSLSE +SDE+E +GALA+V+++LQ+IH LFFDP
Sbjct: 302  KHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDP 361

Query: 537  EHVDNLEHRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTEL 358
            E  DN+  RDVRQVLK+VRKEILKGCK+VFT V P   Q E+   WK+AE+LGAT STE+
Sbjct: 362  ERGDNIMERDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTEV 421

Query: 357  DPHVTHVVSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVS 205
            D  VTHVVSM+  T+KSR A++EKKFLVHPRWIEA+NYLW+K PEENFPVS
Sbjct: 422  DESVTHVVSMNDKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 472


>ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum tuberosum]
          Length = 478

 Score =  616 bits (1589), Expect = e-173
 Identities = 312/476 (65%), Positives = 363/476 (76%), Gaps = 18/476 (3%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNS---- 1411
            MSLTADSPVHSSSSDDFAA LDAELDSASD S                            
Sbjct: 2    MSLTADSPVHSSSSDDFAAFLDAELDSASDVSPELDEVENGEAEGEEEVEDEKGQDEGND 61

Query: 1410 --------------DHQRIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGI 1273
                          D  R K+RK+E+ E  VD QSS S GE  +  G+S   ++CTHPG+
Sbjct: 62   TGDGDDDDDDDGSIDSSRSKKRKIELIEAAVDPQSSVSRGEPAETSGASLALDVCTHPGV 121

Query: 1272 IGGMCIRCGQTMDDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXX 1093
            +GGMCIRCGQ ++DESGVAFGYIHKNLRLA+DE+ARLRDKD KNLLRHKK          
Sbjct: 122  MGGMCIRCGQKVEDESGVAFGYIHKNLRLADDEVARLRDKDLKNLLRHKKLILVLDLDHT 181

Query: 1092 XLNSTRIADITIEERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFE 913
             LNSTR+ADI+ EE YL+ QR+ LPD L+++L++LD +HMMTKLRPFV+TFLKEAS+LFE
Sbjct: 182  LLNSTRLADISAEESYLKDQREVLPDALRNNLFKLDWIHMMTKLRPFVHTFLKEASSLFE 241

Query: 912  MYIYTMGERAYALEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDT 733
            MYIYTMGER YALEMA LLDPGG+YFHSRVIAQ D T +HQKGLD+VLGQESAVLILDDT
Sbjct: 242  MYIYTMGERPYALEMASLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDT 301

Query: 732  EAVWGKHKENLILMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHS 553
            E VWGKH+ENLILM+RYHFF SSCRQFG  CKSLSE +SDE+E +GALA+V+++LQ+IH 
Sbjct: 302  EVVWGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHR 361

Query: 552  LFFDPEHVDNLEHRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGAT 373
            LFFD E  DN+  RDVRQVLK+VRKEILKGCK+VFT V P   Q E+   WK+AE+LGAT
Sbjct: 362  LFFDLERGDNIMERDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHHYWKLAEKLGAT 421

Query: 372  CSTELDPHVTHVVSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVS 205
             STE+D  VTHVVSM+  T+KSR A++EKKFLVHP WIEA+NYLW+K PEENFPVS
Sbjct: 422  FSTEVDESVTHVVSMNDKTEKSRQALREKKFLVHPSWIEAANYLWRKPPEENFPVS 477


>ref|XP_010645384.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Vitis vinifera]
          Length = 458

 Score =  615 bits (1587), Expect = e-173
 Identities = 309/462 (66%), Positives = 363/462 (78%), Gaps = 1/462 (0%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDS-ASDASRXXXXXXXXXXXXXXXXXXXDYNSDHQ 1402
            MSL  DSPVHSSSSD FAA LDAELDS +SD S                      +S+++
Sbjct: 1    MSLVTDSPVHSSSSDGFAAYLDAELDSDSSDVSPEQEAEDDEQEAEDES------DSEYK 54

Query: 1401 RIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTMDDESG 1222
            R+KR+KVE +E + +   S S+G   QNL  +   + CTHPG+   +CIRCGQ M+  SG
Sbjct: 55   RVKRQKVEEFESIEEHPGSTSDGSLEQNLEVTITKDTCTHPGVFRELCIRCGQKMEGGSG 114

Query: 1221 VAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITIEERYL 1042
            VAFGYIHK+LRL +DEIARLRD D KNLLRHKK           LNSTR+ DIT EE YL
Sbjct: 115  VAFGYIHKDLRLGSDEIARLRDTDLKNLLRHKKLYLVLDLDHTLLNSTRLLDITPEELYL 174

Query: 1041 EGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYALEMAK 862
            + Q D L   LK +L+ L+ MHM+TKLRP+V+TFLKEAS +FEMYIYTMGER+YALEMAK
Sbjct: 175  KNQTDPLQGGLKGNLFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAK 234

Query: 861  LLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILMERY 682
            LLDP  VYF SRVI+Q DCT +HQKGLD+VLGQESAVLILDDTE+VW KHK+NLILMERY
Sbjct: 235  LLDPERVYFSSRVISQADCTQRHQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERY 294

Query: 681  HFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLEHRDVR 502
            HFFASSCRQFGFNCKSLSEL+SDESE DGALATV+K+LQ+IHS+FFDPE  D+   RDVR
Sbjct: 295  HFFASSCRQFGFNCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVR 354

Query: 501  QVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHVVSMDA 322
            QV+K VRKE+LKGCK+VF+RVFPT  QAE+  +W+MAEQLGATC+TELDP VTHVVS DA
Sbjct: 355  QVVKRVRKEVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDA 414

Query: 321  GTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQ 196
            GT+KSRWA+QEKKFLVHP WIEA+NY W+KQPEENFPV++ +
Sbjct: 415  GTEKSRWALQEKKFLVHPGWIEAANYFWQKQPEENFPVNQKK 456


>ref|XP_010647279.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Vitis vinifera]
          Length = 466

 Score =  614 bits (1584), Expect = e-173
 Identities = 308/462 (66%), Positives = 363/462 (78%), Gaps = 1/462 (0%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDS-ASDASRXXXXXXXXXXXXXXXXXXXDYNSDHQ 1402
            MSL  DSPVHSSSSD FAA LDAELDS +SD S                      +S+++
Sbjct: 9    MSLVTDSPVHSSSSDGFAAYLDAELDSDSSDVSPEQEAEDDEQEAEDES------DSEYK 62

Query: 1401 RIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTMDDESG 1222
            R+KR+KVE +E + +   S S+G   QNL  +   + CTHPG+   +CIRCGQ M+  SG
Sbjct: 63   RVKRQKVEEFESIEEHPGSTSDGSLEQNLEVTITKDTCTHPGVFRELCIRCGQKMEGGSG 122

Query: 1221 VAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITIEERYL 1042
            VAFGYIHK+LRL +DEIARLRD D KNLLRHKK           LNSTR+ DIT EE YL
Sbjct: 123  VAFGYIHKDLRLGSDEIARLRDTDLKNLLRHKKLYLVLDLDHTLLNSTRLLDITPEELYL 182

Query: 1041 EGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYALEMAK 862
            + Q D L   LK +L+ L+ MHM+TKLRP+V+TFLKEAS +FEMYIYTMGER+YALEMAK
Sbjct: 183  KNQTDPLQGGLKGNLFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAK 242

Query: 861  LLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILMERY 682
            LLDP  VYF SRVI+Q DCT +HQKGLD+VLGQESAVLILDDTE+VW KHK+NLILMERY
Sbjct: 243  LLDPERVYFSSRVISQADCTQRHQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERY 302

Query: 681  HFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLEHRDVR 502
            HFFASSCRQFGFNCKSLSEL+SDESE DGALATV+K+LQ+IHS+FFDPE  D+   RDVR
Sbjct: 303  HFFASSCRQFGFNCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVR 362

Query: 501  QVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHVVSMDA 322
            QV+K VRK++LKGCK+VF+RVFPT  QAE+  +W+MAEQLGATC+TELDP VTHVVS DA
Sbjct: 363  QVVKRVRKDVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDA 422

Query: 321  GTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQ 196
            GT+KSRWA+QEKKFLVHP WIEA+NY W+KQPEENFPV++ +
Sbjct: 423  GTEKSRWALQEKKFLVHPGWIEAANYFWQKQPEENFPVNQKK 464


>ref|XP_012837702.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Erythranthe guttatus]
          Length = 400

 Score =  605 bits (1561), Expect = e-170
 Identities = 291/396 (73%), Positives = 340/396 (85%)
 Frame = -2

Query: 1380 EVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTMDDESGVAFGYIH 1201
            +V  ++++SQSS S  E   + GSSPK N C HPG+  GMC++CGQ MDDESGVAFGYIH
Sbjct: 6    DVNFDVINSQSSSS-AEQILSAGSSPKKNTCLHPGVYAGMCMKCGQKMDDESGVAFGYIH 64

Query: 1200 KNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITIEERYLEGQRDTL 1021
            KNLRLANDEI RLRD+D KN+LRH+K           LNS R+ DIT +E YL GQR+ L
Sbjct: 65   KNLRLANDEIDRLRDRDLKNMLRHRKLCLVLDLDHTLLNSARLHDITEQEGYLNGQREAL 124

Query: 1020 PDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYALEMAKLLDPGGV 841
            PD LK+SL+RLD ++MMTKLRP+V+TFLKEAS LFEMYIYTMGER YALEMAKLLDPG +
Sbjct: 125  PDNLKNSLFRLDWIYMMTKLRPYVHTFLKEASKLFEMYIYTMGERPYALEMAKLLDPGDI 184

Query: 840  YFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILMERYHFFASSC 661
            YF+SR+IAQGDCT KHQKGLD+VLGQESAV+ILDDTEAVW KHK+NLILMERYHFFASSC
Sbjct: 185  YFNSRIIAQGDCTQKHQKGLDVVLGQESAVVILDDTEAVWSKHKDNLILMERYHFFASSC 244

Query: 660  RQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNLEHRDVRQVLKSVR 481
            +QFGFNCKSLSEL+SDES+T GALA+V+K LQQIH+LFFD E  D+LE RDVR V+K++R
Sbjct: 245  KQFGFNCKSLSELQSDESDTQGALASVLKRLQQIHTLFFDAERKDSLEDRDVRLVMKTLR 304

Query: 480  KEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHVVSMDAGTDKSRW 301
            KE+LKGCKVVFTRVFPTN  +EH  +WKMAE+LGATC  E+DP VTHVVSMDAGTDKSRW
Sbjct: 305  KEVLKGCKVVFTRVFPTNFPSEHHSLWKMAEKLGATCCNEIDPSVTHVVSMDAGTDKSRW 364

Query: 300  AIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQE 193
            A+QEKKFLVHPRWIEASNY+W+KQ EENFPVS++++
Sbjct: 365  AVQEKKFLVHPRWIEASNYMWQKQTEENFPVSQAKK 400


>ref|XP_012078975.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Jatropha curcas] gi|802640739|ref|XP_012078976.1|
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 4 [Jatropha curcas]
            gi|643722394|gb|KDP32215.1| hypothetical protein
            JCGZ_13822 [Jatropha curcas]
          Length = 470

 Score =  572 bits (1473), Expect = e-160
 Identities = 287/466 (61%), Positives = 347/466 (74%), Gaps = 7/466 (1%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNS---- 1411
            MSL  DSPVHSSSS+DFAA+LDAELDS S  S                      +     
Sbjct: 1    MSLVTDSPVHSSSSEDFAALLDAELDSKSSDSSPNDDDEEEEEEEEEEEEEEAKDEPEDD 60

Query: 1410 ---DHQRIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQT 1240
               + +RIKR +VE  E + D + S   G    NLG+S     CTHPG  G MCI CGQ 
Sbjct: 61   PDIESKRIKRSRVETLENVEDPKGSTFHGSLDLNLGASSSKVACTHPGSFGDMCIICGQR 120

Query: 1239 MDDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADIT 1060
            +++E+GV   YIHK LRL NDEI RLR+ D KNLLRHKK           LNST++  +T
Sbjct: 121  LNEETGVTLAYIHKGLRLGNDEIVRLRNSDTKNLLRHKKLYLVLDLDHTLLNSTQLMHMT 180

Query: 1059 IEERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAY 880
             EE YL+ Q D+L D    SL++LD MHMMTKLRP+V+TFLKEAS +FEMYIYTMG+RAY
Sbjct: 181  AEEEYLKSQLDSLQDVSNGSLFKLDFMHMMTKLRPYVHTFLKEASQMFEMYIYTMGDRAY 240

Query: 879  ALEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENL 700
            ALEMAKLLDP   YF++RVI++ D T +HQKGLDIVLGQESAVLILDDTE  W KHK+NL
Sbjct: 241  ALEMAKLLDPRREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTETAWTKHKDNL 300

Query: 699  ILMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNL 520
            ILMERYHFFASSC QFGF+CKSLSEL+SDES++DGALA+V+K+L++IH +FFD     NL
Sbjct: 301  ILMERYHFFASSCHQFGFSCKSLSELKSDESDSDGALASVLKVLRRIHHIFFDELMDVNL 360

Query: 519  EHRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTH 340
            + RDVRQVLK+VRK++L+GCK+VF+RVFPT  QA + Q+WKMAEQLGA CSTELD  +TH
Sbjct: 361  DSRDVRQVLKTVRKDVLEGCKIVFSRVFPTQFQANNHQLWKMAEQLGAICSTELDSSITH 420

Query: 339  VVSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSK 202
            VVS +AGT+KSRWA++ KKFLVHPRWIEA+NYLW++QPEENF V++
Sbjct: 421  VVSTEAGTEKSRWAMKNKKFLVHPRWIEAANYLWQRQPEENFSVNQ 466


>ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma
            cacao] gi|508784808|gb|EOY32064.1| RNA polymerase II ctd
            phosphatase, putative isoform 1 [Theobroma cacao]
          Length = 469

 Score =  571 bits (1472), Expect = e-160
 Identities = 295/468 (63%), Positives = 344/468 (73%), Gaps = 7/468 (1%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNSDH-- 1405
            MSL  DSPVHSSSSDDFAA+LDAEL+  S  S                      + D   
Sbjct: 1    MSLVTDSPVHSSSSDDFAALLDAELEVGSSGSSPDEEDVEADGDNNNDNNDDHDDDDDLD 60

Query: 1404 -QRIKRRKVEVYEEMVDSQSSKSEGEAPQNL----GSSPKNNMCTHPGIIGGMCIRCGQT 1240
             QR KR K E  E++ +S+ S S+G     +      S K ++CTHPG  G MCI CGQ 
Sbjct: 61   SQRNKRCKTEKLEDLEESRGSTSQGLIEDKIVIHAELSLKKDICTHPGSFGQMCILCGQR 120

Query: 1239 MDDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADIT 1060
            +DDESGV FGYIHK LRL NDEI RLR  D KNLLRHKK           LNST++  +T
Sbjct: 121  LDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLMHLT 180

Query: 1059 IEERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAY 880
             +E YL+GQ D+L D  + SL+ LD MHMMTKLRPFV TFLKEAS +FEMYIYTMG+R Y
Sbjct: 181  PDEEYLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPY 240

Query: 879  ALEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENL 700
            ALEMAKLLDP   YF  RVI++ D T KHQKGLD+VLGQESAV+ILDDTE  W KHK+NL
Sbjct: 241  ALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENAWMKHKDNL 300

Query: 699  ILMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVDNL 520
            ILMERYH+FASSC QFG+ CKSLS+L+SDESE DGALA+V+K L+QIH +FFD E   NL
Sbjct: 301  ILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFFD-ELDCNL 359

Query: 519  EHRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTH 340
              RDVRQVLK+V++E+LKGCK+VF+ VFPTN  AE   +WKMAEQLGATCSTE D  VTH
Sbjct: 360  ASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTETDLSVTH 419

Query: 339  VVSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQ 196
            VVS DAGT+KSRWA++EKKFLVHPRWIEA+NYLW+KQPEENFPVS+ +
Sbjct: 420  VVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGK 467


>ref|XP_012481529.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Gossypium raimondii]
            gi|763760638|gb|KJB27892.1| hypothetical protein
            B456_005G016300 [Gossypium raimondii]
          Length = 470

 Score =  566 bits (1458), Expect = e-158
 Identities = 290/470 (61%), Positives = 343/470 (72%), Gaps = 9/470 (1%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYN----- 1414
            MS   DSPVHSSSSDDFAA++DAEL+  S  S                      +     
Sbjct: 1    MSFATDSPVHSSSSDDFAALIDAELEVGSSGSSPDEQDNEEEEVDADSDDDDSDDEEDDS 60

Query: 1413 ----SDHQRIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCG 1246
                +DH R KR K E  +++   Q S S+G   + L  S   + CTHPG  G MCI CG
Sbjct: 61   NDDLNDH-RNKRCKTEKLDDLEGPQGSTSQGLIEEKLEVSLNKDTCTHPGSFGQMCILCG 119

Query: 1245 QTMDDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIAD 1066
            Q +DDESGV FGYIHK LRL NDEI RLR  D KNLLRHKK           LNST++  
Sbjct: 120  QRVDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNH 179

Query: 1065 ITIEERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGER 886
            +T EE YL+GQ D++ D  K SL+ L+ MHMMTKLRPFV TFLKEAS +FEMYIYTMG+R
Sbjct: 180  LTAEEEYLKGQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDR 239

Query: 885  AYALEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKE 706
             YALEMAKLLDP   YF+ RVI++ D T KHQKGLD+VLGQ+SAV+ILDDTE  W KHK+
Sbjct: 240  PYALEMAKLLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKD 299

Query: 705  NLILMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVD 526
            NLILMERYHFFASSCRQFGF+C+SLS+L+SDESE DGALA+++KIL+QIH +FFD E   
Sbjct: 300  NLILMERYHFFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQIHHIFFD-ELDS 358

Query: 525  NLEHRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHV 346
            +L  RDVRQVLK+VRKE+LK CK+VF+RVFPT  Q E+  +WKMAEQLGATCSTE D  V
Sbjct: 359  DLASRDVRQVLKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSV 418

Query: 345  THVVSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQ 196
            THVVSMDAGT+KSRWA++E KFLVHPRWIEA+N+ W KQPEE FPVS+++
Sbjct: 419  THVVSMDAGTEKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQTK 468


>gb|KJB27893.1| hypothetical protein B456_005G016300 [Gossypium raimondii]
          Length = 469

 Score =  563 bits (1452), Expect = e-157
 Identities = 291/470 (61%), Positives = 344/470 (73%), Gaps = 9/470 (1%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYN----- 1414
            MS   DSPVHSSSSDDFAA++DAEL+  S  S                      +     
Sbjct: 1    MSFATDSPVHSSSSDDFAALIDAELEVGSSGSSPDEQDNEEEEVDADSDDDDSDDEEDDS 60

Query: 1413 ----SDHQRIKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCG 1246
                +DH R KR K E  +++   Q S S+G   + L S  K+  CTHPG  G MCI CG
Sbjct: 61   NDDLNDH-RNKRCKTEKLDDLEGPQGSTSQGLIEEKLVSLNKDT-CTHPGSFGQMCILCG 118

Query: 1245 QTMDDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIAD 1066
            Q +DDESGV FGYIHK LRL NDEI RLR  D KNLLRHKK           LNST++  
Sbjct: 119  QRVDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNH 178

Query: 1065 ITIEERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGER 886
            +T EE YL+GQ D++ D  K SL+ L+ MHMMTKLRPFV TFLKEAS +FEMYIYTMG+R
Sbjct: 179  LTAEEEYLKGQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDR 238

Query: 885  AYALEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKE 706
             YALEMAKLLDP   YF+ RVI++ D T KHQKGLD+VLGQ+SAV+ILDDTE  W KHK+
Sbjct: 239  PYALEMAKLLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKD 298

Query: 705  NLILMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEHVD 526
            NLILMERYHFFASSCRQFGF+C+SLS+L+SDESE DGALA+++KIL+QIH +FFD E   
Sbjct: 299  NLILMERYHFFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQIHHIFFD-ELDS 357

Query: 525  NLEHRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHV 346
            +L  RDVRQVLK+VRKE+LK CK+VF+RVFPT  Q E+  +WKMAEQLGATCSTE D  V
Sbjct: 358  DLASRDVRQVLKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSV 417

Query: 345  THVVSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPVSKSQ 196
            THVVSMDAGT+KSRWA++E KFLVHPRWIEA+N+ W KQPEE FPVS+++
Sbjct: 418  THVVSMDAGTEKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQTK 467


>ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318538|gb|EEF03112.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 472

 Score =  561 bits (1446), Expect = e-157
 Identities = 286/466 (61%), Positives = 343/466 (73%), Gaps = 11/466 (2%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDAS---------RXXXXXXXXXXXXXXXXXX 1426
            MSL  DSPVHSSSSDDFAA LD ELDS S AS         +                  
Sbjct: 1    MSLVTDSPVHSSSSDDFAAFLDTELDSKSSASSASDDEAPNQRHSDSAASSSPDQDKEAE 60

Query: 1425 XDYNSDHQR--IKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIR 1252
             D +SD QR  +KR KVE  E + D   + S      N  +S    +CTHPG  G MCI 
Sbjct: 61   EDDDSDFQRKRVKRSKVETVEIVEDDGGTTSFASLKHNSEASISKEICTHPGSFGTMCIV 120

Query: 1251 CGQTMDDESGVAFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRI 1072
            CGQ +D ESGV FGYIHK LRL NDEI RLR+ D KNLLRHKK           LNST++
Sbjct: 121  CGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQL 180

Query: 1071 ADITIEERYLEGQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMG 892
              +T++E YL GQ D+L D  K SL+ L  M MMTKLRPFV TFLKEAS +FEMYIYTMG
Sbjct: 181  MHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMG 240

Query: 891  ERAYALEMAKLLDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKH 712
            +RAYALEMAKLLDPG  YF+++VI++ D T +HQKGLD+VLGQESAVLILDDTE  W KH
Sbjct: 241  DRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKH 300

Query: 711  KENLILMERYHFFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEH 532
            K+NLILMERYHFFASSC QFGFNCKSLSE ++DESE++GALA+++K+L++IH +FF+ E 
Sbjct: 301  KDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFE-EL 359

Query: 531  VDNLEHRDVRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDP 352
             +N++ RDVRQVLK+VRK++LKGCK+VF+RVFPT SQA++  +W+MAEQLGATCSTELDP
Sbjct: 360  EENMDGRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDP 419

Query: 351  HVTHVVSMDAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENF 214
             VTHVVS D+GT+KS WA++  KFLV P WIEA+NY W++QPEENF
Sbjct: 420  SVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENF 465


>ref|XP_010693335.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Beta vulgaris subsp. vulgaris]
            gi|870846670|gb|KMS99186.1| hypothetical protein
            BVRB_2g047230 [Beta vulgaris subsp. vulgaris]
          Length = 434

 Score =  557 bits (1435), Expect = e-155
 Identities = 280/460 (60%), Positives = 339/460 (73%), Gaps = 3/460 (0%)
 Frame = -2

Query: 1578 MSLTADSPVHSSSSDDFAAILDAELDSASDASRXXXXXXXXXXXXXXXXXXXDYNSDHQR 1399
            MS+  DSPV SSSSDDFAA+LDAELDS S  +                      N +  R
Sbjct: 1    MSVATDSPVSSSSSDDFAALLDAELDSGSSDTSPDQDEDN--------------NVEGAR 46

Query: 1398 IKRRKVEVYEEMVDSQSSKSEGEAPQNLGSSPKNNMCTHPGIIGGMCIRCGQTMDDESGV 1219
            +KRRKV   +  V+ + S                  CTHPG +  +CI CG+ MDD +GV
Sbjct: 47   MKRRKVLEIDSKVEVEGS------------------CTHPGFLRDLCIGCGKRMDDGAGV 88

Query: 1218 AFGYIHKNLRLANDEIARLRDKDFKNLLRHKKXXXXXXXXXXXLNSTRIADITIEERYLE 1039
            AFGYIHK+LRL NDEI+RLR+ D ++LLRHKK           LNSTR+ DI  EE YL+
Sbjct: 89   AFGYIHKDLRLGNDEISRLRNADVRSLLRHKKLYLVLDLDHTLLNSTRLEDINSEEEYLK 148

Query: 1038 GQRDTLPDTLKSSLYRLDLMHMMTKLRPFVNTFLKEASNLFEMYIYTMGERAYALEMAKL 859
             Q D+  D  K SL+RLD+M MMTKLRP+V TFL+EAS++FEMYIYTMGER YA+EMAKL
Sbjct: 149  SQTDSFQDIAKGSLFRLDMMRMMTKLRPYVRTFLEEASSMFEMYIYTMGERPYAIEMAKL 208

Query: 858  LDPGGVYFHSRVIAQGDCTHKHQKGLDIVLGQESAVLILDDTEAVWGKHKENLILMERYH 679
            LDPG +YF+SRVI+Q DCT +HQKGLD+VLGQESAVLILDDTE VW +HK+NLILMERYH
Sbjct: 209  LDPGNLYFNSRVISQADCTQRHQKGLDVVLGQESAVLILDDTEGVWRRHKDNLILMERYH 268

Query: 678  FFASSCRQFGFNCKSLSELRSDESETDGALATVIKILQQIHSLFFDPEH---VDNLEHRD 508
            +F+SSCRQFG++CKSLSEL+ DE+E DGALATV+ +L++IHS FFDPEH    D+   RD
Sbjct: 269  YFSSSCRQFGYSCKSLSELKGDENEADGALATVLGVLKKIHSKFFDPEHGDESDDFAARD 328

Query: 507  VRQVLKSVRKEILKGCKVVFTRVFPTNSQAEHQQIWKMAEQLGATCSTELDPHVTHVVSM 328
            VRQVLK  RKE+LK CK+VF+RVFPT  QA++  +WKMAE+LGATCS ELD  VTHVVS 
Sbjct: 329  VRQVLKQFRKEVLKDCKLVFSRVFPTKFQADNHHLWKMAEKLGATCSMELDSSVTHVVST 388

Query: 327  DAGTDKSRWAIQEKKFLVHPRWIEASNYLWKKQPEENFPV 208
            D+GT+KSRWA+Q  KFLVHPRW+EA+NYLW +QPE+ FPV
Sbjct: 389  DSGTEKSRWAVQNGKFLVHPRWLEAANYLWNRQPEDQFPV 428


Top