BLASTX nr result

ID: Papaver31_contig00043238 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver31_contig00043238
         (1621 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010645384.1| PREDICTED: RNA polymerase II C-terminal doma...   495   e-137
ref|XP_010647279.1| PREDICTED: RNA polymerase II C-terminal doma...   494   e-137
ref|XP_010265619.1| PREDICTED: RNA polymerase II C-terminal doma...   490   e-135
ref|XP_010265618.1| PREDICTED: RNA polymerase II C-terminal doma...   486   e-134
ref|XP_008242970.1| PREDICTED: RNA polymerase II C-terminal doma...   477   e-131
ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prun...   476   e-131
gb|KNA16741.1| hypothetical protein SOVF_085710 isoform B [Spina...   476   e-131
ref|XP_012481529.1| PREDICTED: RNA polymerase II C-terminal doma...   476   e-131
ref|XP_010693335.1| PREDICTED: RNA polymerase II C-terminal doma...   476   e-131
gb|KHG05109.1| RNA polymerase II C-terminal domain phosphatase-l...   476   e-131
ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu...   475   e-131
ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative ...   475   e-131
gb|KNA16740.1| hypothetical protein SOVF_085710 isoform A [Spina...   475   e-131
ref|XP_011018018.1| PREDICTED: RNA polymerase II C-terminal doma...   473   e-130
ref|XP_012078975.1| PREDICTED: RNA polymerase II C-terminal doma...   472   e-130
gb|KNA16742.1| hypothetical protein SOVF_085710 isoform C [Spina...   471   e-130
gb|KJB27893.1| hypothetical protein B456_005G016300 [Gossypium r...   471   e-130
ref|XP_012481530.1| PREDICTED: RNA polymerase II C-terminal doma...   470   e-129
gb|KNA25939.1| hypothetical protein SOVF_002000 isoform B [Spina...   468   e-129
ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu...   468   e-129

>ref|XP_010645384.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Vitis vinifera]
          Length = 458

 Score =  495 bits (1275), Expect = e-137
 Identities = 248/424 (58%), Positives = 312/424 (73%), Gaps = 1/424 (0%)
 Frame = -2

Query: 1473 DHDGEEEEVNVED-ADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297
            + + E++E   ED +D   KR KRQ+++  +  E    STS   + Q+ +V++  + C H
Sbjct: 35   EQEAEDDEQEAEDESDSEYKRVKRQKVEEFESIEEHPGSTSDGSLEQNLEVTITKDTCTH 94

Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117
            P + + +C++CG+ ++  S VA GYIHKDL++GS+E+ARLR+ DLK+L   KK       
Sbjct: 95   PGVFRELCIRCGQKMEGGSGVAFGYIHKDLRLGSDEIARLRDTDLKNLLRHKKLYLVLDL 154

Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937
                LNST    I+P+E YLK+QTD LQ    GNLF L++M M TKLRP+V TFLKEAS 
Sbjct: 155  DHTLLNSTRLLDITPEELYLKNQTDPLQGGLKGNLFMLNTMHMLTKLRPYVHTFLKEASK 214

Query: 936  MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757
            MFEMY+YTMGER YA+EMA+LLDP ++YF SRVISQADCTQ+HQKGLDVVLG ESAV+IL
Sbjct: 215  MFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCTQRHQKGLDVVLGQESAVLIL 274

Query: 756  DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577
            DDTE VW++HK+NLILM+RYH+F+SS R F  N +SLSELK DESEPDGALAT+L+VL+R
Sbjct: 275  DDTESVWQKHKDNLILMERYHFFASSCRQFGFNCKSLSELKSDESEPDGALATVLKVLQR 334

Query: 576  IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397
            IH              RDVRQV+K VR EVLKGCK+VFSRV+    Q EN  LW +AE+L
Sbjct: 335  IHSMFFDPELGDDFSGRDVRQVVKRVRKEVLKGCKIVFSRVFPTRFQAENHHLWRMAEQL 394

Query: 396  GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217
            GA C  ELD SVTHVVSTD GTEKSRWA++ KKFLVHP WIEAAN+ WQ+QPE++F VN 
Sbjct: 395  GATCATELDPSVTHVVSTDAGTEKSRWALQEKKFLVHPGWIEAANYFWQKQPEENFPVNQ 454

Query: 216  SKKE 205
             K +
Sbjct: 455  KKNQ 458


>ref|XP_010647279.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Vitis vinifera]
          Length = 466

 Score =  494 bits (1272), Expect = e-137
 Identities = 247/424 (58%), Positives = 312/424 (73%), Gaps = 1/424 (0%)
 Frame = -2

Query: 1473 DHDGEEEEVNVED-ADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297
            + + E++E   ED +D   KR KRQ+++  +  E    STS   + Q+ +V++  + C H
Sbjct: 43   EQEAEDDEQEAEDESDSEYKRVKRQKVEEFESIEEHPGSTSDGSLEQNLEVTITKDTCTH 102

Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117
            P + + +C++CG+ ++  S VA GYIHKDL++GS+E+ARLR+ DLK+L   KK       
Sbjct: 103  PGVFRELCIRCGQKMEGGSGVAFGYIHKDLRLGSDEIARLRDTDLKNLLRHKKLYLVLDL 162

Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937
                LNST    I+P+E YLK+QTD LQ    GNLF L++M M TKLRP+V TFLKEAS 
Sbjct: 163  DHTLLNSTRLLDITPEELYLKNQTDPLQGGLKGNLFMLNTMHMLTKLRPYVHTFLKEASK 222

Query: 936  MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757
            MFEMY+YTMGER YA+EMA+LLDP ++YF SRVISQADCTQ+HQKGLDVVLG ESAV+IL
Sbjct: 223  MFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCTQRHQKGLDVVLGQESAVLIL 282

Query: 756  DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577
            DDTE VW++HK+NLILM+RYH+F+SS R F  N +SLSELK DESEPDGALAT+L+VL+R
Sbjct: 283  DDTESVWQKHKDNLILMERYHFFASSCRQFGFNCKSLSELKSDESEPDGALATVLKVLQR 342

Query: 576  IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397
            IH              RDVRQV+K VR +VLKGCK+VFSRV+    Q EN  LW +AE+L
Sbjct: 343  IHSMFFDPELGDDFSGRDVRQVVKRVRKDVLKGCKIVFSRVFPTRFQAENHHLWRMAEQL 402

Query: 396  GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217
            GA C  ELD SVTHVVSTD GTEKSRWA++ KKFLVHP WIEAAN+ WQ+QPE++F VN 
Sbjct: 403  GATCATELDPSVTHVVSTDAGTEKSRWALQEKKFLVHPGWIEAANYFWQKQPEENFPVNQ 462

Query: 216  SKKE 205
             K +
Sbjct: 463  KKNQ 466


>ref|XP_010265619.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Nelumbo nucifera]
          Length = 449

 Score =  490 bits (1261), Expect = e-135
 Identities = 256/450 (56%), Positives = 323/450 (71%), Gaps = 2/450 (0%)
 Frame = -2

Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375
            FAALL+ EL  +               D  G +E+   +D DF  +R K++++D L++ E
Sbjct: 17   FAALLDAELDTVS-------------SDASGGQED---DDEDFNIERIKKRKVDELENVE 60

Query: 1374 GIHCSTSLEPVPQSSDVSVQLEICA-HPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVG 1198
             +  STS+  + Q  D S + +IC  HP  I+ MC++CG+  +D S VA GYIHKDLK+G
Sbjct: 61   DLQGSTSVGALQQELDTSKE-DICPPHPGFIREMCIRCGQRQEDGSGVAFGYIHKDLKLG 119

Query: 1197 SEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHG 1018
             EE+ARLR  D K L   +K           LNST    +SP+EEYLK QTDSL D+ +G
Sbjct: 120  MEEIARLRGADHKKLLHGRKLYLVLDLDHTLLNSTRLIDLSPEEEYLKGQTDSLNDVLNG 179

Query: 1017 NLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRV 838
            +LF+LDSM M TKLRPFV TFLKEAS+MFEMYVYTM ER YA+E+A+LLDPG +YF SRV
Sbjct: 180  SLFRLDSMTMLTKLRPFVHTFLKEASSMFEMYVYTMAERSYALEIAKLLDPGGVYFSSRV 239

Query: 837  ISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLN 658
            ISQ +CTQ+HQKGLDVVLGAESAV+ILDDTE VW++H+ENLILM+RYHYFSSS R F  +
Sbjct: 240  ISQDNCTQRHQKGLDVVLGAESAVVILDDTEIVWQKHRENLILMERYHYFSSSCRQFGFS 299

Query: 657  NRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMR-RDVRQVLKTVRSEVLK 481
             +SLSELKRDE E +GALAT+L+VLKRIH            +  RDVRQV+K +R +VLK
Sbjct: 300  AKSLSELKRDECESEGALATVLKVLKRIHEMFFNELVFGADLESRDVRQVMKAIRQDVLK 359

Query: 480  GCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYK 301
            GCK+VFSRV+      EN +LW++AE+LGA C  ELD+SVTHVVSTD GTEK+RWAV++K
Sbjct: 360  GCKIVFSRVFPTKFHAENHQLWKIAEQLGATCSTELDSSVTHVVSTDTGTEKARWAVQHK 419

Query: 300  KFLVHPSWIEAANFLWQRQPEDSFAVNDSK 211
            K LVHP WIEA N+ W+RQ E++FAV  +K
Sbjct: 420  KHLVHPQWIEATNYFWERQSEENFAVKKNK 449


>ref|XP_010265618.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Nelumbo nucifera]
          Length = 451

 Score =  486 bits (1251), Expect = e-134
 Identities = 255/450 (56%), Positives = 323/450 (71%), Gaps = 2/450 (0%)
 Frame = -2

Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375
            FAALL+ EL  +               D  G +E+ + ED +  + R K++++D L++ E
Sbjct: 17   FAALLDAELDTVS-------------SDASGGQEDDD-EDFNIESFRIKKRKVDELENVE 62

Query: 1374 GIHCSTSLEPVPQSSDVSVQLEICA-HPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVG 1198
             +  STS+  + Q  D S + +IC  HP  I+ MC++CG+  +D S VA GYIHKDLK+G
Sbjct: 63   DLQGSTSVGALQQELDTSKE-DICPPHPGFIREMCIRCGQRQEDGSGVAFGYIHKDLKLG 121

Query: 1197 SEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHG 1018
             EE+ARLR  D K L   +K           LNST    +SP+EEYLK QTDSL D+ +G
Sbjct: 122  MEEIARLRGADHKKLLHGRKLYLVLDLDHTLLNSTRLIDLSPEEEYLKGQTDSLNDVLNG 181

Query: 1017 NLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRV 838
            +LF+LDSM M TKLRPFV TFLKEAS+MFEMYVYTM ER YA+E+A+LLDPG +YF SRV
Sbjct: 182  SLFRLDSMTMLTKLRPFVHTFLKEASSMFEMYVYTMAERSYALEIAKLLDPGGVYFSSRV 241

Query: 837  ISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLN 658
            ISQ +CTQ+HQKGLDVVLGAESAV+ILDDTE VW++H+ENLILM+RYHYFSSS R F  +
Sbjct: 242  ISQDNCTQRHQKGLDVVLGAESAVVILDDTEIVWQKHRENLILMERYHYFSSSCRQFGFS 301

Query: 657  NRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMR-RDVRQVLKTVRSEVLK 481
             +SLSELKRDE E +GALAT+L+VLKRIH            +  RDVRQV+K +R +VLK
Sbjct: 302  AKSLSELKRDECESEGALATVLKVLKRIHEMFFNELVFGADLESRDVRQVMKAIRQDVLK 361

Query: 480  GCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYK 301
            GCK+VFSRV+      EN +LW++AE+LGA C  ELD+SVTHVVSTD GTEK+RWAV++K
Sbjct: 362  GCKIVFSRVFPTKFHAENHQLWKIAEQLGATCSTELDSSVTHVVSTDTGTEKARWAVQHK 421

Query: 300  KFLVHPSWIEAANFLWQRQPEDSFAVNDSK 211
            K LVHP WIEA N+ W+RQ E++FAV  +K
Sbjct: 422  KHLVHPQWIEATNYFWERQSEENFAVKKNK 451


>ref|XP_008242970.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Prunus mume]
          Length = 449

 Score =  477 bits (1228), Expect = e-131
 Identities = 241/420 (57%), Positives = 303/420 (72%)
 Frame = -2

Query: 1470 HDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHPS 1291
            H   +EE + E  D + + +KR++++ L   +  H STS   V ++S+ S + +IC HP 
Sbjct: 30   HSSPDEEADYESDDGSERSTKRRKVENLGSIDKTHGSTSQVFVEENSEASPKTDICTHPG 89

Query: 1290 LIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXXX 1111
             +K +C+ CG+ VD+ S V LGYIHKD  + ++E+ R+R+ D+K     KK         
Sbjct: 90   SVKDLCIVCGQRVDEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLVLDLDH 149

Query: 1110 XXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMF 931
              LNSTH  H++ +EEYL SQTDSLQD+ +G+LF++D M M TKLRPFVR FLKEAS MF
Sbjct: 150  TLLNSTHLNHMTAEEEYLHSQTDSLQDVSNGSLFRVDVMHMMTKLRPFVRKFLKEASEMF 209

Query: 930  EMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILDD 751
            EMY+YTMGER YA+EMA+LLDP K YF  RVIS+ D TQKHQKGLDVVLG ESA +ILDD
Sbjct: 210  EMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDVVLGQESAALILDD 269

Query: 750  TEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRIH 571
            TE  W +HK+NLILM+RYH+F SS   F  + +SLSELK DESEP+GALAT+LEVLKR H
Sbjct: 270  TENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEGALATVLEVLKRTH 329

Query: 570  XXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGA 391
                       L+ RDVRQVLKT+R E+LKGCK+VFSRV+    Q EN +LW++AE+LGA
Sbjct: 330  -NMFFYESKDNLIDRDVRQVLKTLRKEILKGCKIVFSRVFPSKFQAENHQLWKMAEQLGA 388

Query: 390  ICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSK 211
             C  ELD SVTHVVSTD GTEKSRWAVK KKFLVHP WIEA+N++W +Q ED F V  +K
Sbjct: 389  ACSTELDPSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEASNYMWLKQAEDKFPVKQTK 448


>ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica]
            gi|462399876|gb|EMJ05544.1| hypothetical protein
            PRUPE_ppa005647mg [Prunus persica]
          Length = 449

 Score =  476 bits (1226), Expect = e-131
 Identities = 242/420 (57%), Positives = 303/420 (72%)
 Frame = -2

Query: 1470 HDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHPS 1291
            H   +EE + E  D + + +KR++++ L   +    STS   V ++S+ S + +IC HP 
Sbjct: 30   HSSPDEEADYESDDGSERSTKRRKVENLGSIDETQGSTSQIFVEENSEASPKKDICTHPG 89

Query: 1290 LIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXXX 1111
             +K +C+ CG+ VD+ S V LGYIHKD  + ++E+ R+R+ D+K     KK         
Sbjct: 90   SVKDLCIVCGQRVDEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLVLDLDH 149

Query: 1110 XXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMF 931
              LNSTH  H++ +EEYL SQTDSLQD+  G+LF++D M M TKLRPFVR FLKEAS MF
Sbjct: 150  TLLNSTHLNHMTAEEEYLHSQTDSLQDVSDGSLFRVDVMHMMTKLRPFVRKFLKEASEMF 209

Query: 930  EMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILDD 751
            EMY+YTMGER YA+EMA+LLDP K YF  RVIS+ D TQKHQKGLDVVLG ESA +ILDD
Sbjct: 210  EMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDVVLGHESAALILDD 269

Query: 750  TEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRIH 571
            TE  W +HK+NLILM+RYH+F SS   F  + +SLSELK DESEP+GALAT+LEVLKRIH
Sbjct: 270  TENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEGALATVLEVLKRIH 329

Query: 570  XXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGA 391
                       L+ RDVRQVLKT+R E+LKGCK+VFSRV+    Q EN +LW++AE+LGA
Sbjct: 330  -NMFFYESKDNLIDRDVRQVLKTLRKEILKGCKIVFSRVFPSKFQAENHQLWKMAEQLGA 388

Query: 390  ICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSK 211
             C  ELD SVTHVVSTD GTEKSRWAVK KKFLVHP WIEA+N++W +Q ED F VN +K
Sbjct: 389  TCSTELDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEASNYMWLKQAEDKFPVNQTK 448


>gb|KNA16741.1| hypothetical protein SOVF_085710 isoform B [Spinacia oleracea]
          Length = 452

 Score =  476 bits (1225), Expect = e-131
 Identities = 243/449 (54%), Positives = 311/449 (69%)
 Frame = -2

Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375
            FAALL+ EL                    DG  ++   +D D  A+  + +R   L+ D 
Sbjct: 17   FAALLDAELDS---------------DSSDGSPDQDCSDDEDNNAEGERTKRCKVLEMDS 61

Query: 1374 GIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGS 1195
             +    S         +    + C HP  ++ MC+ CG+ +DD + VA GYIHKDL++GS
Sbjct: 62   RVEVQGSNSNGFTEQTIEAITDSCTHPGFLRDMCICCGKRMDDGAGVAFGYIHKDLRLGS 121

Query: 1194 EELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGN 1015
            +E++RLR+ D+++L   KK           LNST  + I+ +E+YLKSQTDS +D+  G+
Sbjct: 122  DEVSRLRDADVRTLLRNKKLYLVLDLDHTLLNSTRLEDINSEEDYLKSQTDSFEDISKGS 181

Query: 1014 LFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVI 835
            LF+LD MRM TKLRP+VRTFL+EASNMFEMY+YTMGER YA+EMA+LLDPG +YF+SRVI
Sbjct: 182  LFRLDKMRMMTKLRPYVRTFLQEASNMFEMYIYTMGERAYAIEMAKLLDPGSLYFNSRVI 241

Query: 834  SQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNN 655
            SQADCTQ+HQKGLDVVLG +SAV+ILDDTE VW++HK+NLILM+RYHYFSSS R F  N 
Sbjct: 242  SQADCTQRHQKGLDVVLGKDSAVLILDDTEAVWQRHKDNLILMERYHYFSSSCRQFGFNC 301

Query: 654  RSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGC 475
            +SLSELK DE+E DGALAT+L VLK+IH              RDVRQVLK +R+EVL  C
Sbjct: 302  KSLSELKGDENEADGALATVLGVLKKIHSNFFDAEHGNDFAARDVRQVLKKIRNEVLGDC 361

Query: 474  KLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYKKF 295
            K+VFSRV+    Q EN  LW++AE+LGA C  E+D++VTHVVSTD GTEKSRWAV+  K+
Sbjct: 362  KIVFSRVFPTKFQAENHHLWKMAEQLGARCAVEVDSTVTHVVSTDAGTEKSRWAVENGKY 421

Query: 294  LVHPSWIEAANFLWQRQPEDSFAVNDSKK 208
            LVHP W+EAAN+LW ++PE  F V  SKK
Sbjct: 422  LVHPKWLEAANYLWSKKPEQEFPVVLSKK 450


>ref|XP_012481529.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X1 [Gossypium raimondii]
            gi|763760638|gb|KJB27892.1| hypothetical protein
            B456_005G016300 [Gossypium raimondii]
          Length = 470

 Score =  476 bits (1225), Expect = e-131
 Identities = 244/423 (57%), Positives = 304/423 (71%)
 Frame = -2

Query: 1473 DHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHP 1294
            D D ++EE +  D D    R+KR + + LDD EG   STS   + +  +VS+  + C HP
Sbjct: 50   DDDSDDEEDDSND-DLNDHRNKRCKTEKLDDLEGPQGSTSQGLIEEKLEVSLNKDTCTHP 108

Query: 1293 SLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXX 1114
                 MC+ CG+ VDD S V  GYIHK L++G++E+ RLR+ D+K+L   KK        
Sbjct: 109  GSFGQMCILCGQRVDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLD 168

Query: 1113 XXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNM 934
               LNST   H++ +EEYLK Q+DS+QD+  G+LF L+ M M TKLRPFVRTFLKEAS M
Sbjct: 169  HTLLNSTQLNHLTAEEEYLKGQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEM 228

Query: 933  FEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILD 754
            FEMY+YTMG+RPYA+EMA+LLDP K YF+ RVIS+ D TQKHQKGLDVVLG +SAV+ILD
Sbjct: 229  FEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILD 288

Query: 753  DTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRI 574
            DTE  W +HK+NLILM+RYH+F+SS R F  + RSLS+LK DESEPDGALA+IL++L++I
Sbjct: 289  DTENAWTKHKDNLILMERYHFFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQI 348

Query: 573  HXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELG 394
            H           L  RDVRQVLKTVR EVLK CK+VFSRV+    Q EN  LW++AE+LG
Sbjct: 349  H-HIFFDELDSDLASRDVRQVLKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLG 407

Query: 393  AICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDS 214
            A C  E D+SVTHVVS D GTEKSRWAVK  KFLVHP WIEAANF W +QPE+ F V+ +
Sbjct: 408  ATCSTETDSSVTHVVSMDAGTEKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQT 467

Query: 213  KKE 205
            K +
Sbjct: 468  KNQ 470


>ref|XP_010693335.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Beta vulgaris subsp. vulgaris]
            gi|870846670|gb|KMS99186.1| hypothetical protein
            BVRB_2g047230 [Beta vulgaris subsp. vulgaris]
          Length = 434

 Score =  476 bits (1224), Expect = e-131
 Identities = 240/398 (60%), Positives = 296/398 (74%), Gaps = 4/398 (1%)
 Frame = -2

Query: 1386 DDDEGIHCSTSLEPVPQSSDVSVQLE-ICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKD 1210
            D+D  +  +          D  V++E  C HP  ++ +C+ CG+ +DD + VA GYIHKD
Sbjct: 37   DEDNNVEGARMKRRKVLEIDSKVEVEGSCTHPGFLRDLCIGCGKRMDDGAGVAFGYIHKD 96

Query: 1209 LKVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQD 1030
            L++G++E++RLRN D++SL   KK           LNST  + I+ +EEYLKSQTDS QD
Sbjct: 97   LRLGNDEISRLRNADVRSLLRHKKLYLVLDLDHTLLNSTRLEDINSEEEYLKSQTDSFQD 156

Query: 1029 LPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYF 850
            +  G+LF+LD MRM TKLRP+VRTFL+EAS+MFEMY+YTMGERPYA+EMA+LLDPG +YF
Sbjct: 157  IAKGSLFRLDMMRMMTKLRPYVRTFLEEASSMFEMYIYTMGERPYAIEMAKLLDPGNLYF 216

Query: 849  DSRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRS 670
            +SRVISQADCTQ+HQKGLDVVLG ESAV+ILDDTE VWR+HK+NLILM+RYHYFSSS R 
Sbjct: 217  NSRVISQADCTQRHQKGLDVVLGQESAVLILDDTEGVWRRHKDNLILMERYHYFSSSCRQ 276

Query: 669  FNLNNRSLSELKRDESEPDGALATILEVLKRIH---XXXXXXXXXXXLMRRDVRQVLKTV 499
            F  + +SLSELK DE+E DGALAT+L VLK+IH                 RDVRQVLK  
Sbjct: 277  FGYSCKSLSELKGDENEADGALATVLGVLKKIHSKFFDPEHGDESDDFAARDVRQVLKQF 336

Query: 498  RSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSR 319
            R EVLK CKLVFSRV+    Q +N  LW++AE+LGA C  ELD+SVTHVVSTD GTEKSR
Sbjct: 337  RKEVLKDCKLVFSRVFPTKFQADNHHLWKMAEKLGATCSMELDSSVTHVVSTDSGTEKSR 396

Query: 318  WAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205
            WAV+  KFLVHP W+EAAN+LW RQPED F V+ +K +
Sbjct: 397  WAVQNGKFLVHPRWLEAANYLWNRQPEDQFPVHLTKSK 434


>gb|KHG05109.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Gossypium
            arboreum]
          Length = 404

 Score =  476 bits (1224), Expect = e-131
 Identities = 238/404 (58%), Positives = 301/404 (74%)
 Frame = -2

Query: 1416 RSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSA 1237
            R+KR + + LDD EG+  STS   + +  +VS+  + C+HP     MC+ CG+ VDD S+
Sbjct: 2    RNKRCKTEKLDDLEGLQGSTSQGLIEEKLEVSLNKDTCSHPGSFGQMCILCGQRVDDESS 61

Query: 1236 VALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYL 1057
            V  GYIHK L++G++E+ RLR+ D+K+L   KK           LNST   H++ +EEYL
Sbjct: 62   VTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYL 121

Query: 1056 KSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMAR 877
            K Q+DSLQD+  G+LF L+ M+M TKLRPFVRTFLKEAS MFEMY+YTMG+RPYA+EMA+
Sbjct: 122  KGQSDSLQDVSKGSLFMLEFMQMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAK 181

Query: 876  LLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRY 697
            LLDP K YF+ RVIS+ D TQKHQKGLDVVLG +SAV+ILDDTE  W +HK+NLILM+RY
Sbjct: 182  LLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERY 241

Query: 696  HYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVR 517
            H+F+SS R F  + +SLS+LK DESEPDGALA+IL++L++IH           L  RDVR
Sbjct: 242  HFFASSCRQFGFDCKSLSQLKSDESEPDGALASILKILRQIH-HIFFDELDSDLASRDVR 300

Query: 516  QVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDM 337
            QVLKTVR EVLK CK+VFSRV+    Q EN  LW++AE+LGA C  E D+SVTH+VS D 
Sbjct: 301  QVLKTVRKEVLKNCKIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHIVSMDA 360

Query: 336  GTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205
            GTEKSRWAVK  KFLVHP WIEAANF WQ+QPE++F V+ +K +
Sbjct: 361  GTEKSRWAVKENKFLVHPRWIEAANFFWQKQPEENFPVSQTKNQ 404


>ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318538|gb|EEF03112.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 472

 Score =  475 bits (1223), Expect = e-131
 Identities = 239/422 (56%), Positives = 303/422 (71%)
 Frame = -2

Query: 1476 QDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297
            QD + EE++    D+DF  KR KR +++T++  E    +TS   +  +S+ S+  EIC H
Sbjct: 55   QDKEAEEDD----DSDFQRKRVKRSKVETVEIVEDDGGTTSFASLKHNSEASISKEICTH 110

Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117
            P     MC+ CG+++D  S V  GYIHK L++G++E+ RLRN D+K+L   KK       
Sbjct: 111  PGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDL 170

Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937
                LNST   H++ DEEYL  QTDSLQD+  G+LF L SM+M TKLRPFVRTFLKEAS 
Sbjct: 171  DHTLLNSTQLMHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQ 230

Query: 936  MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757
            MFEMY+YTMG+R YA+EMA+LLDPG+ YF+++VIS+ D TQ+HQKGLDVVLG ESAV+IL
Sbjct: 231  MFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLIL 290

Query: 756  DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577
            DDTE  W +HK+NLILM+RYH+F+SS   F  N +SLSE K DESE +GALA+IL+VL++
Sbjct: 291  DDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRK 350

Query: 576  IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397
            IH              RDVRQVLKTVR +VLKGCK+VFSRV+    Q +N  LW +AE+L
Sbjct: 351  IHQIFFEELEENMD-GRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQL 409

Query: 396  GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217
            GA C  ELD SVTHVVS D GTEKS WA+K+ KFLV P WIEAAN+ WQRQPE++F+ N 
Sbjct: 410  GATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFNQ 469

Query: 216  SK 211
             K
Sbjct: 470  IK 471


>ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma
            cacao] gi|508784808|gb|EOY32064.1| RNA polymerase II ctd
            phosphatase, putative isoform 1 [Theobroma cacao]
          Length = 469

 Score =  475 bits (1223), Expect = e-131
 Identities = 247/454 (54%), Positives = 315/454 (69%), Gaps = 4/454 (0%)
 Frame = -2

Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375
            FAALL+ EL                D D++ +  + + +D D  ++R+KR + + L+D E
Sbjct: 17   FAALLDAELEVGSSGSSPDEEDVEADGDNNNDNNDDHDDDDDLDSQRNKRCKTEKLEDLE 76

Query: 1374 GIHCSTSL----EPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDL 1207
                STS     + +   +++S++ +IC HP     MC+ CG+ +DD S V  GYIHK L
Sbjct: 77   ESRGSTSQGLIEDKIVIHAELSLKKDICTHPGSFGQMCILCGQRLDDESGVTFGYIHKGL 136

Query: 1206 KVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDL 1027
            ++G++E+ RLR+ D+K+L   KK           LNST   H++PDEEYLK Q+DSLQD+
Sbjct: 137  RLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLMHLTPDEEYLKGQSDSLQDV 196

Query: 1026 PHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFD 847
              G+LF LD M M TKLRPFVRTFLKEAS MFEMY+YTMG+RPYA+EMA+LLDP + YF 
Sbjct: 197  SRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREYFS 256

Query: 846  SRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSF 667
             RVIS+ D TQKHQKGLDVVLG ESAV+ILDDTE  W +HK+NLILM+RYHYF+SS   F
Sbjct: 257  DRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENAWMKHKDNLILMERYHYFASSCHQF 316

Query: 666  NLNNRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEV 487
                +SLS+LK DESEPDGALA++L+ L++IH           L  RDVRQVLKTV+ EV
Sbjct: 317  GYKCKSLSQLKSDESEPDGALASVLKALRQIH-HMFFDELDCNLASRDVRQVLKTVQEEV 375

Query: 486  LKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVK 307
            LKGCK+VFS V+      E+  LW++AE+LGA C  E D SVTHVVSTD GTEKSRWAVK
Sbjct: 376  LKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTETDLSVTHVVSTDAGTEKSRWAVK 435

Query: 306  YKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205
             KKFLVHP WIEA N+LWQ+QPE++F V+  K +
Sbjct: 436  EKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 469


>gb|KNA16740.1| hypothetical protein SOVF_085710 isoform A [Spinacia oleracea]
          Length = 453

 Score =  475 bits (1222), Expect = e-131
 Identities = 244/450 (54%), Positives = 313/450 (69%), Gaps = 1/450 (0%)
 Frame = -2

Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375
            FAALL+ EL                    DG  ++   +D D  A+  + +R   L+ D 
Sbjct: 17   FAALLDAELDS---------------DSSDGSPDQDCSDDEDNNAEGERTKRCKVLEMDS 61

Query: 1374 GIHCSTS-LEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVG 1198
             +    S      + + V    + C HP  ++ MC+ CG+ +DD + VA GYIHKDL++G
Sbjct: 62   RVEVQGSNSNGFTEQTIVEAITDSCTHPGFLRDMCICCGKRMDDGAGVAFGYIHKDLRLG 121

Query: 1197 SEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHG 1018
            S+E++RLR+ D+++L   KK           LNST  + I+ +E+YLKSQTDS +D+  G
Sbjct: 122  SDEVSRLRDADVRTLLRNKKLYLVLDLDHTLLNSTRLEDINSEEDYLKSQTDSFEDISKG 181

Query: 1017 NLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRV 838
            +LF+LD MRM TKLRP+VRTFL+EASNMFEMY+YTMGER YA+EMA+LLDPG +YF+SRV
Sbjct: 182  SLFRLDKMRMMTKLRPYVRTFLQEASNMFEMYIYTMGERAYAIEMAKLLDPGSLYFNSRV 241

Query: 837  ISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLN 658
            ISQADCTQ+HQKGLDVVLG +SAV+ILDDTE VW++HK+NLILM+RYHYFSSS R F  N
Sbjct: 242  ISQADCTQRHQKGLDVVLGKDSAVLILDDTEAVWQRHKDNLILMERYHYFSSSCRQFGFN 301

Query: 657  NRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKG 478
             +SLSELK DE+E DGALAT+L VLK+IH              RDVRQVLK +R+EVL  
Sbjct: 302  CKSLSELKGDENEADGALATVLGVLKKIHSNFFDAEHGNDFAARDVRQVLKKIRNEVLGD 361

Query: 477  CKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYKK 298
            CK+VFSRV+    Q EN  LW++AE+LGA C  E+D++VTHVVSTD GTEKSRWAV+  K
Sbjct: 362  CKIVFSRVFPTKFQAENHHLWKMAEQLGARCAVEVDSTVTHVVSTDAGTEKSRWAVENGK 421

Query: 297  FLVHPSWIEAANFLWQRQPEDSFAVNDSKK 208
            +LVHP W+EAAN+LW ++PE  F V  SKK
Sbjct: 422  YLVHPKWLEAANYLWSKKPEQEFPVVLSKK 451


>ref|XP_011018018.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Populus euphratica]
          Length = 472

 Score =  473 bits (1218), Expect = e-130
 Identities = 240/422 (56%), Positives = 302/422 (71%)
 Frame = -2

Query: 1476 QDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297
            QD + EE++    D+DF +KR KR +++TL+  E    + SL  +  +S+VS+  EIC H
Sbjct: 55   QDKEAEEDD----DSDFQSKRVKRSKVETLEIVEDDGGAASLASLKHNSEVSISKEICTH 110

Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117
            P     MC+ CG+++D  S V  GYIHK L++G++E+ RLRN D+K+L   KK       
Sbjct: 111  PGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDL 170

Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937
                LNST   H++ DEEYL  QT SLQD+  G+LF L SM+M TKLRPFVRTFLKEAS 
Sbjct: 171  DHTLLNSTQLMHMTLDEEYLNGQTASLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQ 230

Query: 936  MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757
            MFEMY+YTMG+R YA+EMA+LLDPG+ YF+++VIS+ D TQ+HQKGLDVVLG ESAV+IL
Sbjct: 231  MFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLIL 290

Query: 756  DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577
            DDTE  W +HK+NLILM+RYH+F+SS   F  N +SLSE   DESE +GALA+IL+VL++
Sbjct: 291  DDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQNTDESESEGALASILKVLRK 350

Query: 576  IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397
            IH              RDVRQVLKTVR +VLKGCK+VFSRV+    Q  N  LW +AE+L
Sbjct: 351  IHQIFFEELEENMD-GRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQANNHHLWRMAEQL 409

Query: 396  GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217
            GA C  ELD SVTHVVS D GTEKS WA K+ KFLV P WIEAAN+ WQRQPE++F+VN 
Sbjct: 410  GATCSTELDPSVTHVVSKDSGTEKSHWASKHNKFLVQPGWIEAANYFWQRQPEENFSVNQ 469

Query: 216  SK 211
             K
Sbjct: 470  IK 471


>ref|XP_012078975.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            [Jatropha curcas] gi|802640739|ref|XP_012078976.1|
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 4 [Jatropha curcas]
            gi|643722394|gb|KDP32215.1| hypothetical protein
            JCGZ_13822 [Jatropha curcas]
          Length = 470

 Score =  472 bits (1215), Expect = e-130
 Identities = 244/454 (53%), Positives = 317/454 (69%), Gaps = 4/454 (0%)
 Frame = -2

Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVE----DADFAAKRSKRQRLDTL 1387
            FAALL+ EL                +++ + EEEE   +    D D  +KR KR R++TL
Sbjct: 17   FAALLDAELDSKSSDSSPNDDDEEEEEEEEEEEEEEAKDEPEDDPDIESKRIKRSRVETL 76

Query: 1386 DDDEGIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDL 1207
            ++ E    ST    +  +   S     C HP     MC+ CG+ +++ + V L YIHK L
Sbjct: 77   ENVEDPKGSTFHGSLDLNLGASSSKVACTHPGSFGDMCIICGQRLNEETGVTLAYIHKGL 136

Query: 1206 KVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDL 1027
            ++G++E+ RLRN D K+L   KK           LNST   H++ +EEYLKSQ DSLQD+
Sbjct: 137  RLGNDEIVRLRNSDTKNLLRHKKLYLVLDLDHTLLNSTQLMHMTAEEEYLKSQLDSLQDV 196

Query: 1026 PHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFD 847
             +G+LF+LD M M TKLRP+V TFLKEAS MFEMY+YTMG+R YA+EMA+LLDP + YF+
Sbjct: 197  SNGSLFKLDFMHMMTKLRPYVHTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPRREYFN 256

Query: 846  SRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSF 667
            +RVIS+ D TQ+HQKGLD+VLG ESAV+ILDDTE  W +HK+NLILM+RYH+F+SS   F
Sbjct: 257  ARVISRDDGTQRHQKGLDIVLGQESAVLILDDTETAWTKHKDNLILMERYHFFASSCHQF 316

Query: 666  NLNNRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEV 487
              + +SLSELK DES+ DGALA++L+VL+RIH           L  RDVRQVLKTVR +V
Sbjct: 317  GFSCKSLSELKSDESDSDGALASVLKVLRRIHHIFFDELMDVNLDSRDVRQVLKTVRKDV 376

Query: 486  LKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVK 307
            L+GCK+VFSRV+    Q  N +LW++AE+LGAIC  ELD+S+THVVST+ GTEKSRWA+K
Sbjct: 377  LEGCKIVFSRVFPTQFQANNHQLWKMAEQLGAICSTELDSSITHVVSTEAGTEKSRWAMK 436

Query: 306  YKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205
             KKFLVHP WIEAAN+LWQRQPE++F+VN  K +
Sbjct: 437  NKKFLVHPRWIEAANYLWQRQPEENFSVNQPKHQ 470


>gb|KNA16742.1| hypothetical protein SOVF_085710 isoform C [Spinacia oleracea]
          Length = 450

 Score =  471 bits (1212), Expect = e-130
 Identities = 243/449 (54%), Positives = 315/449 (70%)
 Frame = -2

Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375
            FAALL+ EL                  D   +++  + ED +   +R K   +D+  + +
Sbjct: 17   FAALLDAELDS-------------DSSDGSPDQDCSDDEDNNAEGERCKVLEMDSRVEVQ 63

Query: 1374 GIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGS 1195
            G + +   E     + V    + C HP  ++ MC+ CG+ +DD + VA GYIHKDL++GS
Sbjct: 64   GSNSNGFTE----QTIVEAITDSCTHPGFLRDMCICCGKRMDDGAGVAFGYIHKDLRLGS 119

Query: 1194 EELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGN 1015
            +E++RLR+ D+++L   KK           LNST  + I+ +E+YLKSQTDS +D+  G+
Sbjct: 120  DEVSRLRDADVRTLLRNKKLYLVLDLDHTLLNSTRLEDINSEEDYLKSQTDSFEDISKGS 179

Query: 1014 LFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVI 835
            LF+LD MRM TKLRP+VRTFL+EASNMFEMY+YTMGER YA+EMA+LLDPG +YF+SRVI
Sbjct: 180  LFRLDKMRMMTKLRPYVRTFLQEASNMFEMYIYTMGERAYAIEMAKLLDPGSLYFNSRVI 239

Query: 834  SQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNN 655
            SQADCTQ+HQKGLDVVLG +SAV+ILDDTE VW++HK+NLILM+RYHYFSSS R F  N 
Sbjct: 240  SQADCTQRHQKGLDVVLGKDSAVLILDDTEAVWQRHKDNLILMERYHYFSSSCRQFGFNC 299

Query: 654  RSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGC 475
            +SLSELK DE+E DGALAT+L VLK+IH              RDVRQVLK +R+EVL  C
Sbjct: 300  KSLSELKGDENEADGALATVLGVLKKIHSNFFDAEHGNDFAARDVRQVLKKIRNEVLGDC 359

Query: 474  KLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYKKF 295
            K+VFSRV+    Q EN  LW++AE+LGA C  E+D++VTHVVSTD GTEKSRWAV+  K+
Sbjct: 360  KIVFSRVFPTKFQAENHHLWKMAEQLGARCAVEVDSTVTHVVSTDAGTEKSRWAVENGKY 419

Query: 294  LVHPSWIEAANFLWQRQPEDSFAVNDSKK 208
            LVHP W+EAAN+LW ++PE  F V  SKK
Sbjct: 420  LVHPKWLEAANYLWSKKPEQEFPVVLSKK 448


>gb|KJB27893.1| hypothetical protein B456_005G016300 [Gossypium raimondii]
          Length = 469

 Score =  471 bits (1212), Expect = e-130
 Identities = 244/423 (57%), Positives = 304/423 (71%)
 Frame = -2

Query: 1473 DHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHP 1294
            D D ++EE +  D D    R+KR + + LDD EG   STS + + +   VS+  + C HP
Sbjct: 50   DDDSDDEEDDSND-DLNDHRNKRCKTEKLDDLEGPQGSTS-QGLIEEKLVSLNKDTCTHP 107

Query: 1293 SLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXX 1114
                 MC+ CG+ VDD S V  GYIHK L++G++E+ RLR+ D+K+L   KK        
Sbjct: 108  GSFGQMCILCGQRVDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLD 167

Query: 1113 XXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNM 934
               LNST   H++ +EEYLK Q+DS+QD+  G+LF L+ M M TKLRPFVRTFLKEAS M
Sbjct: 168  HTLLNSTQLNHLTAEEEYLKGQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEM 227

Query: 933  FEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILD 754
            FEMY+YTMG+RPYA+EMA+LLDP K YF+ RVIS+ D TQKHQKGLDVVLG +SAV+ILD
Sbjct: 228  FEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILD 287

Query: 753  DTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRI 574
            DTE  W +HK+NLILM+RYH+F+SS R F  + RSLS+LK DESEPDGALA+IL++L++I
Sbjct: 288  DTENAWTKHKDNLILMERYHFFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQI 347

Query: 573  HXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELG 394
            H           L  RDVRQVLKTVR EVLK CK+VFSRV+    Q EN  LW++AE+LG
Sbjct: 348  H-HIFFDELDSDLASRDVRQVLKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLG 406

Query: 393  AICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDS 214
            A C  E D+SVTHVVS D GTEKSRWAVK  KFLVHP WIEAANF W +QPE+ F V+ +
Sbjct: 407  ATCSTETDSSVTHVVSMDAGTEKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQT 466

Query: 213  KKE 205
            K +
Sbjct: 467  KNQ 469


>ref|XP_012481530.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4
            isoform X2 [Gossypium raimondii]
          Length = 404

 Score =  470 bits (1209), Expect = e-129
 Identities = 238/404 (58%), Positives = 295/404 (73%)
 Frame = -2

Query: 1416 RSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSA 1237
            R+KR + + LDD EG   STS   + +  +VS+  + C HP     MC+ CG+ VDD S 
Sbjct: 2    RNKRCKTEKLDDLEGPQGSTSQGLIEEKLEVSLNKDTCTHPGSFGQMCILCGQRVDDESG 61

Query: 1236 VALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYL 1057
            V  GYIHK L++G++E+ RLR+ D+K+L   KK           LNST   H++ +EEYL
Sbjct: 62   VTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYL 121

Query: 1056 KSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMAR 877
            K Q+DS+QD+  G+LF L+ M M TKLRPFVRTFLKEAS MFEMY+YTMG+RPYA+EMA+
Sbjct: 122  KGQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAK 181

Query: 876  LLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRY 697
            LLDP K YF+ RVIS+ D TQKHQKGLDVVLG +SAV+ILDDTE  W +HK+NLILM+RY
Sbjct: 182  LLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERY 241

Query: 696  HYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVR 517
            H+F+SS R F  + RSLS+LK DESEPDGALA+IL++L++IH           L  RDVR
Sbjct: 242  HFFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQIH-HIFFDELDSDLASRDVR 300

Query: 516  QVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDM 337
            QVLKTVR EVLK CK+VFSRV+    Q EN  LW++AE+LGA C  E D+SVTHVVS D 
Sbjct: 301  QVLKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHVVSMDA 360

Query: 336  GTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205
            GTEKSRWAVK  KFLVHP WIEAANF W +QPE+ F V+ +K +
Sbjct: 361  GTEKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQTKNQ 404


>gb|KNA25939.1| hypothetical protein SOVF_002000 isoform B [Spinacia oleracea]
          Length = 452

 Score =  468 bits (1203), Expect = e-129
 Identities = 240/451 (53%), Positives = 315/451 (69%), Gaps = 1/451 (0%)
 Frame = -2

Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHD-GEEEEVNVEDADFAAKRSKRQRLDTLDDD 1378
            FAALL+ EL                  D D  ++E+ N E A    KR K   +DT  + 
Sbjct: 17   FAALLDAELDS---------DSSDGSPDQDCSDDEDNNAEGARI--KRRKVLEMDTRVEV 65

Query: 1377 EGIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVG 1198
            +G + +   E   + +      + C HP  ++ MC+ CG+ +DD + VA GYIHKDL++G
Sbjct: 66   QGSNSNGFTEKAIEEATT----DSCTHPGFLRDMCICCGKRMDDGAGVAFGYIHKDLRLG 121

Query: 1197 SEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHG 1018
            S+E++RLR+ D+++L   KK           LNST  + I+ +E+YLKSQTDS +D+  G
Sbjct: 122  SDEVSRLRDADVRTLLRNKKLYLVLDLDHTLLNSTRLEDINSEEDYLKSQTDSFEDISKG 181

Query: 1017 NLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRV 838
            +LF+LD +RM TKLRP+VRTFL+EASNMFEMY+YTMGER YA+EMA+LLDPG +YF+SRV
Sbjct: 182  SLFRLDKIRMMTKLRPYVRTFLQEASNMFEMYIYTMGERAYAIEMAKLLDPGNVYFNSRV 241

Query: 837  ISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLN 658
            IS+ADCT++HQKGLDVVLG +SAV+ILDDTE VW++HK+NLILM+RYHYFSSS R F  N
Sbjct: 242  ISKADCTRRHQKGLDVVLGKDSAVLILDDTEAVWQRHKDNLILMERYHYFSSSCRQFGFN 301

Query: 657  NRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKG 478
             +SLSELK DE+E DGALAT+L VLK+IH              RDVRQVL+ +R+E+L+G
Sbjct: 302  CKSLSELKGDENEADGALATVLGVLKKIHSNFFNPEHGDDFAARDVRQVLRKIRNEILRG 361

Query: 477  CKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYKK 298
            CK+VFSRV+    Q EN  LW++AE+LGA C  E+D+SVTHVVS   GTEKSRWAV+   
Sbjct: 362  CKIVFSRVFSTESQAENHHLWKMAEQLGATCAVEVDSSVTHVVSEYAGTEKSRWAVQNGN 421

Query: 297  FLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205
            FLVHP W+EAAN+LW ++PE  F V  +K +
Sbjct: 422  FLVHPKWLEAANYLWSKKPEQQFPVELTKSK 452


>ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318537|gb|EEF03111.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 468

 Score =  468 bits (1203), Expect = e-129
 Identities = 235/422 (55%), Positives = 300/422 (71%)
 Frame = -2

Query: 1476 QDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297
            QD + EE++    D+DF  KR KR +++T++  E    +TS   +  +S+ S+  EIC H
Sbjct: 55   QDKEAEEDD----DSDFQRKRVKRSKVETVEIVEDDGGTTSFASLKHNSEASISKEICTH 110

Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117
            P     MC+ CG+++D  S V  GYIHK L++G++E+ RLRN D+K+L   KK       
Sbjct: 111  PGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDL 170

Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937
                LNST   H++ DEEYL  QTDSLQD+  G+LF L SM+M TKLRPFVRTFLKEAS 
Sbjct: 171  DHTLLNSTQLMHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQ 230

Query: 936  MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757
            MFEMY+YTMG+R YA+EMA+LLDPG+ YF+++VIS+ D TQ+HQKGLDVVLG ESAV+IL
Sbjct: 231  MFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLIL 290

Query: 756  DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577
            DDTE  W +HK+NLILM+RYH+F+SS   F  N +SLSE K DESE +GALA+IL+VL++
Sbjct: 291  DDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRK 350

Query: 576  IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397
            IH            +     QVLKTVR +VLKGCK+VFSRV+    Q +N  LW +AE+L
Sbjct: 351  IHQIFFEDHILSLAL-----QVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQL 405

Query: 396  GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217
            GA C  ELD SVTHVVS D GTEKS WA+K+ KFLV P WIEAAN+ WQRQPE++F+ N 
Sbjct: 406  GATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFNQ 465

Query: 216  SK 211
             K
Sbjct: 466  IK 467


Top