BLASTX nr result

ID: Mentha26_contig00005567 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00005567
         (1369 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU37264.1| hypothetical protein MIMGU_mgv1a005925mg [Mimulus...   541   e-151
gb|EYU29592.1| hypothetical protein MIMGU_mgv1a017809mg [Mimulus...   538   e-150
ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal doma...   493   e-137
ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal doma...   491   e-136
ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal doma...   488   e-135
gb|EPS63533.1| hypothetical protein M569_11249, partial [Genlise...   478   e-132
ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative ...   476   e-132
ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ...   469   e-129
ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prun...   468   e-129
ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu...   466   e-128
ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma...   463   e-128
ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal doma...   462   e-127
ref|XP_007014446.1| RNA polymerase II ctd phosphatase, putative ...   457   e-126
ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu...   449   e-123
gb|EXC26161.1| RNA polymerase II C-terminal domain phosphatase-l...   443   e-122
ref|XP_007154891.1| hypothetical protein PHAVU_003G156800g [Phas...   439   e-120
ref|XP_007154892.1| hypothetical protein PHAVU_003G156800g [Phas...   438   e-120
ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal doma...   437   e-120
ref|XP_007149092.1| hypothetical protein PHAVU_005G040600g [Phas...   437   e-120
ref|XP_006575309.1| PREDICTED: RNA polymerase II C-terminal doma...   434   e-119

>gb|EYU37264.1| hypothetical protein MIMGU_mgv1a005925mg [Mimulus guttatus]
          Length = 464

 Score =  541 bits (1393), Expect = e-151
 Identities = 265/378 (70%), Positives = 317/378 (83%)
 Frame = +3

Query: 39   SDGSLSKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKD 218
            S GS  KK  C HPG +AGMCMKCGQ+ DD+S V   YI KNLRLANDE+ RLRD+D K+
Sbjct: 89   SAGSSPKKNTCLHPGVYAGMCMKCGQKMDDESGVAFGYIHKNLRLANDEIDRLRDRDLKN 148

Query: 219  LLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMT 398
            +L HR K            NSARL DI ++E  YLN QR++LPDN+KNSLFRLD ++MMT
Sbjct: 149  MLRHR-KLCLVLDLDHTLLNSARLHDITEQEG-YLNGQREALPDNLKNSLFRLDWIYMMT 206

Query: 399  KLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQK 578
            KLRP+V TFL+EASK+FEMYIYTMGERPYALEMA LLDPG+ YF+SRIIAQGDCTQK+QK
Sbjct: 207  KLRPYVHTFLKEASKLFEMYIYTMGERPYALEMAKLLDPGDIYFNSRIIAQGDCTQKHQK 266

Query: 579  GLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDES 758
            GLD+VLG+ESAV+ILDDTEAVW ++K+NLILM+RYHFFASSCK FGF+  SLS+L++DES
Sbjct: 267  GLDVVLGQESAVVILDDTEAVWSKHKDNLILMERYHFFASSCKQFGFNCKSLSELQSDES 326

Query: 759  ETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPIS 938
            +T+GALA+VLK LQ+IH LFFD  RKD LE+RDVR V+K +RKEVLKGC++VFTRV P +
Sbjct: 327  DTQGALASVLKRLQQIHTLFFDAERKDSLEDRDVRLVMKTLRKEVLKGCKVVFTRVFPTN 386

Query: 939  FPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASN 1118
            FP+E+H LW+MAE+LGA C  E+D SVTHVVS DAGTDKSRWAV+EKKFLV+P WIEASN
Sbjct: 387  FPSEHHSLWKMAEKLGATCCNEIDPSVTHVVSMDAGTDKSRWAVQEKKFLVHPRWIEASN 446

Query: 1119 FLWRKQPEEKFPVVQAKQ 1172
            ++W+KQ EE FPV QAK+
Sbjct: 447  YMWQKQTEENFPVSQAKK 464


>gb|EYU29592.1| hypothetical protein MIMGU_mgv1a017809mg [Mimulus guttatus]
          Length = 466

 Score =  538 bits (1387), Expect = e-150
 Identities = 263/376 (69%), Positives = 311/376 (82%)
 Frame = +3

Query: 45   GSLSKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKDLL 224
            GS  KK  C HPG +AGMCM+CGQ+ DD+S V   YI KNLRLANDEM RLRD+D K++L
Sbjct: 93   GSSPKKNTCLHPGVYAGMCMRCGQKMDDESGVAFGYIHKNLRLANDEMDRLRDRDLKNML 152

Query: 225  VHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKL 404
             HR K            NSARL DI +EE  YLN QRD+LPD +K+SLFRLD ++MMTKL
Sbjct: 153  RHR-KLCLVLDLDHTLLNSARLHDITEEEG-YLNGQRDALPDTLKSSLFRLDWIYMMTKL 210

Query: 405  RPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGL 584
            RPFV TFL+EASK+FEMYIYTMGERPYALEMA LLDPG+ YF+SRIIAQGDCT K+QKGL
Sbjct: 211  RPFVHTFLKEASKLFEMYIYTMGERPYALEMAKLLDPGDIYFNSRIIAQGDCTHKHQKGL 270

Query: 585  DIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDESET 764
            D+VLG+ESAV+ILDDTE VW ++K+NLILM+RYHFFASSCK FGF+  SLS+L++DES+T
Sbjct: 271  DVVLGQESAVVILDDTEVVWSKHKDNLILMERYHFFASSCKQFGFNCKSLSELRSDESDT 330

Query: 765  EGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFP 944
            EGAL TVLK LQ+IH LFFD  RKD LE+RDVR V+K +RKEVLKGC++VFTRV P +FP
Sbjct: 331  EGALPTVLKRLQQIHSLFFDVERKDSLEDRDVRLVMKTLRKEVLKGCKVVFTRVFPTNFP 390

Query: 945  AENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASNFL 1124
            AE+H LW+MAE+LGA C  E+D  +THVVS DAGTDKSRWA+KEKKFLV+P WIEASN++
Sbjct: 391  AEHHSLWKMAEKLGATCCNEIDPCITHVVSMDAGTDKSRWALKEKKFLVHPRWIEASNYM 450

Query: 1125 WRKQPEEKFPVVQAKQ 1172
            W+KQPEE FPV QA +
Sbjct: 451  WQKQPEENFPVSQANK 466


>ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum tuberosum]
          Length = 478

 Score =  493 bits (1269), Expect = e-137
 Identities = 245/383 (63%), Positives = 299/383 (78%)
 Frame = +3

Query: 9    SGITDPEASSSDGSLSKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEM 188
            S ++  E + + G+    ++CTHPG   GMC++CGQ+ +D+S V   YI KNLRLA+DE+
Sbjct: 96   SSVSRGEPAETSGASLALDVCTHPGVMGGMCIRCGQKVEDESGVAFGYIHKNLRLADDEV 155

Query: 189  ARLRDKDFKDLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSL 368
            ARLRDKD K+LL H+ K            NS RL+DI  EE  YL  QR+ LPD ++N+L
Sbjct: 156  ARLRDKDLKNLLRHK-KLILVLDLDHTLLNSTRLADISAEES-YLKDQREVLPDALRNNL 213

Query: 369  FRLDKMHMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIA 548
            F+LD +HMMTKLRPFV TFL+EAS +FEMYIYTMGERPYALEMA LLDPG  YFHSR+IA
Sbjct: 214  FKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMASLLDPGGIYFHSRVIA 273

Query: 549  QGDCTQKYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYS 728
            Q D T+++QKGLD+VLG+ESAVLILDDTE VW +++ENLILMDRYHFF SSC+ FG    
Sbjct: 274  QSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFTSSCRQFGLKCK 333

Query: 729  SLSQLKTDESETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCR 908
            SLS+ K+DE+E EGALA+VL+VLQRIH LFFD  R D++  RDVRQVLK VRKE+LKGC+
Sbjct: 334  SLSEQKSDENEAEGALASVLEVLQRIHRLFFDLERGDNIMERDVRQVLKTVRKEILKGCK 393

Query: 909  IVFTRVIPISFPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFL 1088
            IVFT VIPI    ENHH W++AE+LGA   TEVDESVTHVVS +  T+KSR A++EKKFL
Sbjct: 394  IVFTGVIPIQCQPENHHYWKLAEKLGATFSTEVDESVTHVVSMNDKTEKSRQALREKKFL 453

Query: 1089 VNPGWIEASNFLWRKQPEEKFPV 1157
            V+P WIEA+N+LWRK PEE FPV
Sbjct: 454  VHPSWIEAANYLWRKPPEENFPV 476


>ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum lycopersicum]
          Length = 512

 Score =  491 bits (1263), Expect = e-136
 Identities = 248/392 (63%), Positives = 302/392 (77%), Gaps = 7/392 (1%)
 Frame = +3

Query: 3    LCSGITDPEASSSDGSLSKK-------EMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDK 161
            L  G  DP++S S G  ++        ++CTHPG   GMC++CGQ+ +D+S V   YI K
Sbjct: 121  LIEGAVDPQSSVSRGEPAETSGASMALDVCTHPGVMGGMCIRCGQKVEDESGVAFGYIHK 180

Query: 162  NLRLANDEMARLRDKDFKDLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDS 341
            NLRLA+DE+ARLR+KD K+LL HR K            NS RL+DI  EE  YL  QR+ 
Sbjct: 181  NLRLADDEVARLREKDLKNLLRHR-KLILVLDLDHTLLNSTRLADISAEES-YLKDQREV 238

Query: 342  LPDNIKNSLFRLDKMHMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGN 521
            LPD ++++LF+LD +HMMTKLRPFV TFL+EAS +FEMYIYTMGERPYALEMA LLDPG 
Sbjct: 239  LPDALRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAKLLDPGG 298

Query: 522  KYFHSRIIAQGDCTQKYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASS 701
             YFHSR+IAQ D T+++QKGLD+VLG+ESAVLILDDTE VW +++ENLILMDRYHFF SS
Sbjct: 299  IYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFTSS 358

Query: 702  CKHFGFSYSSLSQLKTDESETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAV 881
            C+ FG    SLS+ K+DE+E EGALA+VL+VLQRIH LFFD  R D++  RDVRQVLK V
Sbjct: 359  CRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGDNIMERDVRQVLKTV 418

Query: 882  RKEVLKGCRIVFTRVIPISFPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSR 1061
            RKE+LKGC+IVFT VIPI    ENH+ W++AE+LGA   TEVDESVTHVVS +  T+KSR
Sbjct: 419  RKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMNDKTEKSR 478

Query: 1062 WAVKEKKFLVNPGWIEASNFLWRKQPEEKFPV 1157
             AV+EKKFLV+P WIEA+N+LWRK PEE FPV
Sbjct: 479  QAVREKKFLVHPRWIEAANYLWRKPPEENFPV 510


>ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum lycopersicum]
          Length = 472

 Score =  488 bits (1257), Expect = e-135
 Identities = 243/377 (64%), Positives = 297/377 (78%)
 Frame = +3

Query: 27   EASSSDGSLSKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDK 206
            E++ + G+    ++CTHPG   GMC++CGQ+ +D+S V   YI KNLRLA+DE+ARLR+K
Sbjct: 96   ESAETSGASLALDVCTHPGVMGGMCIRCGQKVEDESGVAFGYIHKNLRLADDEVARLREK 155

Query: 207  DFKDLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKM 386
            D K+LL HR K            NS RL+DI  EE  YL  QR+ LPD ++++LF+LD +
Sbjct: 156  DLKNLLRHR-KLILVLDLDHTLLNSTRLADISAEES-YLKDQREVLPDALRSNLFKLDWI 213

Query: 387  HMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQ 566
            HMMTKLRPFV TFL+EAS +FEMYIYTMGERPYALEMA LLDPG  YFHSR+IAQ D T+
Sbjct: 214  HMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAKLLDPGGIYFHSRVIAQSDSTR 273

Query: 567  KYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLK 746
            ++QKGLD+VLG+ESAVLILDDTE VW +++ENLILMDRYHFF SSC+ FG    SLS+ K
Sbjct: 274  RHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQK 333

Query: 747  TDESETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRV 926
            +DE+E EGALA+VL+VLQRIH LFFD  R D++  RDVRQVLK VRKE+LKGC+IVFT V
Sbjct: 334  SDENEAEGALASVLEVLQRIHRLFFDPERGDNIMERDVRQVLKTVRKEILKGCKIVFTGV 393

Query: 927  IPISFPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWI 1106
            IPI    ENH+ W++AE+LGA   TEVDESVTHVVS +  T+KSR AV+EKKFLV+P WI
Sbjct: 394  IPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWI 453

Query: 1107 EASNFLWRKQPEEKFPV 1157
            EA+N+LWRK PEE FPV
Sbjct: 454  EAANYLWRKPPEENFPV 470


>gb|EPS63533.1| hypothetical protein M569_11249, partial [Genlisea aurea]
          Length = 386

 Score =  478 bits (1231), Expect = e-132
 Identities = 236/378 (62%), Positives = 298/378 (78%), Gaps = 1/378 (0%)
 Frame = +3

Query: 27   EASSSDGSLSKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDK 206
            ++++   S+S+  +C HPG + GMC+ CG   +++S +P  YI KNLRLA+DE+ARLR K
Sbjct: 8    DSTAESRSISESSVCPHPGIYGGMCIMCGGIMEEESGIPFGYIHKNLRLADDEVARLRYK 67

Query: 207  DFKDLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKM 386
            D K LL  RRK            NS+RLSD+  EE  +LN     LPD+++NSLFRL+ +
Sbjct: 68   DLKALL-GRRKLHLVLDLDHTLLNSSRLSDLTGEE-CHLNVHSSDLPDSMRNSLFRLEHI 125

Query: 387  HMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQ 566
             MMTKLRPFVRTFL+EAS++FEM+IYTMGERPYALEMA LLDPG+ YFHSRIIAQGDCTQ
Sbjct: 126  QMMTKLRPFVRTFLKEASEIFEMHIYTMGERPYALEMAKLLDPGDTYFHSRIIAQGDCTQ 185

Query: 567  KYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLK 746
            K+QKGLD+VLG+ES VLILDDTE VW ++KENLILM+RY FF SSCK FGF+  SL++L+
Sbjct: 186  KHQKGLDVVLGQESTVLILDDTEGVWGKHKENLILMERYLFFGSSCKQFGFTCKSLAELR 245

Query: 747  TDESETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRV 926
            +DESE+EGAL+T L  L+RIH LFFD    D+LE RDVR+VL +VRKE+L+GC+IVF+RV
Sbjct: 246  SDESESEGALSTALATLKRIHSLFFDGEHDDELEARDVRKVLHSVRKEILEGCKIVFSRV 305

Query: 927  IPIS-FPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGW 1103
             P S F AENH LW+M  +LGA C  EVD +VTHVV+ DAGTDKSRWA+++ K LV+P W
Sbjct: 306  FPSSFFQAENHQLWKMGVRLGATCSREVDSTVTHVVAVDAGTDKSRWALRQGKHLVHPRW 365

Query: 1104 IEASNFLWRKQPEEKFPV 1157
            +EAS ++W++QPEEKFPV
Sbjct: 366  LEASYYMWKRQPEEKFPV 383


>ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma
            cacao] gi|508784808|gb|EOY32064.1| RNA polymerase II ctd
            phosphatase, putative isoform 1 [Theobroma cacao]
          Length = 469

 Score =  476 bits (1226), Expect = e-132
 Identities = 240/371 (64%), Positives = 289/371 (77%)
 Frame = +3

Query: 57   KKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKDLLVHRR 236
            KK++CTHPG F  MC+ CGQR DD+S V   YI K LRL NDE+ RLR  D K+LL H+ 
Sbjct: 100  KKDICTHPGSFGQMCILCGQRLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHK- 158

Query: 237  KXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFV 416
            K            NS +L  +  +E+ YL  Q DSL D  + SLF LD MHMMTKLRPFV
Sbjct: 159  KLYLVLDLDHTLLNSTQLMHLTPDEE-YLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFV 217

Query: 417  RTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVL 596
            RTFL+EAS+MFEMYIYTMG+RPYALEMA LLDP  +YF  R+I++ D TQK+QKGLD+VL
Sbjct: 218  RTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVL 277

Query: 597  GRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDESETEGAL 776
            G+ESAV+ILDDTE  W ++K+NLILM+RYH+FASSC  FG+   SLSQLK+DESE +GAL
Sbjct: 278  GQESAVVILDDTENAWMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGAL 337

Query: 777  ATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENH 956
            A+VLK L++IH +FFDE    +L +RDVRQVLK V++EVLKGC+IVF+ V P +FPAE+H
Sbjct: 338  ASVLKALRQIHHMFFDE-LDCNLASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESH 396

Query: 957  HLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASNFLWRKQ 1136
             LW+MAEQLGA C TE D SVTHVVS DAGT+KSRWAVKEKKFLV+P WIEA+N+LW+KQ
Sbjct: 397  PLWKMAEQLGATCSTETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQ 456

Query: 1137 PEEKFPVVQAK 1169
            PEE FPV Q K
Sbjct: 457  PEENFPVSQGK 467


>ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223534449|gb|EEF36151.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  469 bits (1206), Expect = e-129
 Identities = 234/372 (62%), Positives = 289/372 (77%)
 Frame = +3

Query: 54   SKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKDLLVHR 233
            S K  CTHPG F  MC+ CG+R  +++ V   YI K LRLANDE+ RLR+ D K+LL HR
Sbjct: 106  SSKVACTHPGSFGDMCILCGERLIEETGVTFGYIHKGLRLANDEIVRLRNTDMKNLLRHR 165

Query: 234  RKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPF 413
             K            NS +L  +  EE+ YL SQ DS+ D    SLF +D MHMMTKLRPF
Sbjct: 166  -KLYLVLDLDHTLLNSTQLMHLTAEEE-YLKSQIDSMQDVSNGSLFMVDFMHMMTKLRPF 223

Query: 414  VRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIV 593
            +RTFL+EAS+MFEMYIYTMG+R YALEMA  LDPG +YF++R+I++ D TQ++QKGLDIV
Sbjct: 224  IRTFLKEASQMFEMYIYTMGDRAYALEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIV 283

Query: 594  LGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDESETEGA 773
            LG+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC+ FGF   SLSQLK+DE+E++GA
Sbjct: 284  LGQESAVLILDDTENAWTKHKDNLILMERYHFFASSCRQFGFECKSLSQLKSDENESDGA 343

Query: 774  LATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAEN 953
            LA+VLKVL+RIH +FFDE  +D ++ RDVRQVL  VRK+VLKGC+IVF+RV P  F A+N
Sbjct: 344  LASVLKVLRRIHHIFFDE-LEDAIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADN 402

Query: 954  HHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASNFLWRK 1133
            HHLW+MAEQLGA C  EVD SVTHVVS +AGT+KSRWA+K  KFLV+P WIEA+N++W++
Sbjct: 403  HHLWKMAEQLGATCSREVDPSVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQR 462

Query: 1134 QPEEKFPVVQAK 1169
            QPEE F V Q K
Sbjct: 463  QPEENFSVNQPK 474


>ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica]
            gi|462399876|gb|EMJ05544.1| hypothetical protein
            PRUPE_ppa005647mg [Prunus persica]
          Length = 449

 Score =  468 bits (1205), Expect = e-129
 Identities = 240/371 (64%), Positives = 284/371 (76%)
 Frame = +3

Query: 57   KKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKDLLVHRR 236
            KK++CTHPG    +C+ CGQR D+ S VPL YI K+  L NDE+ R+R  D K  L H +
Sbjct: 81   KKDICTHPGSVKDLCIVCGQRVDEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSL-HLK 139

Query: 237  KXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFV 416
            K            NS  L+ +  EE+ YL+SQ DSL D    SLFR+D MHMMTKLRPFV
Sbjct: 140  KLYLVLDLDHTLLNSTHLNHMTAEEE-YLHSQTDSLQDVSDGSLFRVDVMHMMTKLRPFV 198

Query: 417  RTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVL 596
            R FL+EAS+MFEMYIYTMGER YALEMA LLDP  +YF  R+I++ D TQK+QKGLD+VL
Sbjct: 199  RKFLKEASEMFEMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDVVL 258

Query: 597  GRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDESETEGAL 776
            G ESA LILDDTE  W ++K+NLILM+RYHFF SSC  FGF   SLS+LK+DESE EGAL
Sbjct: 259  GHESAALILDDTENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEGAL 318

Query: 777  ATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENH 956
            ATVL+VL+RIH +FF E  KD+L +RDVRQVLK +RKE+LKGC+IVF+RV P  F AENH
Sbjct: 319  ATVLEVLKRIHNMFFYES-KDNLIDRDVRQVLKTLRKEILKGCKIVFSRVFPSKFQAENH 377

Query: 957  HLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASNFLWRKQ 1136
             LW+MAEQLGA C TE+D SVTHVVS DAGT+KSRWAVKEKKFLV+P WIEASN++W KQ
Sbjct: 378  QLWKMAEQLGATCSTELDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEASNYMWLKQ 437

Query: 1137 PEEKFPVVQAK 1169
             E+KFPV Q K
Sbjct: 438  AEDKFPVNQTK 448


>ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318538|gb|EEF03112.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 472

 Score =  466 bits (1199), Expect = e-128
 Identities = 234/378 (61%), Positives = 292/378 (77%)
 Frame = +3

Query: 36   SSDGSLSKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFK 215
            +S+ S+SK E+CTHPG F  MC+ CGQ  D +S V   YI K LRL NDE+ RLR+ D K
Sbjct: 98   NSEASISK-EICTHPGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMK 156

Query: 216  DLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMM 395
            +LL H+ K            NS +L  +  +E+ YLN Q DSL D  K SLF L  M MM
Sbjct: 157  NLLRHK-KLYLILDLDHTLLNSTQLMHMTLDEE-YLNGQTDSLQDVSKGSLFMLSSMQMM 214

Query: 396  TKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQ 575
            TKLRPFVRTFL+EAS+MFEMYIYTMG+R YALEMA LLDPG +YF++++I++ D TQ++Q
Sbjct: 215  TKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQ 274

Query: 576  KGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDE 755
            KGLD+VLG+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC  FGF+  SLS+ KTDE
Sbjct: 275  KGLDVVLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDE 334

Query: 756  SETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPI 935
            SE+EGALA++LKVL++IH +FF+E  +++++ RDVRQVLK VRK+VLKGC+IVF+RV P 
Sbjct: 335  SESEGALASILKVLRKIHQIFFEE-LEENMDGRDVRQVLKTVRKDVLKGCKIVFSRVFPT 393

Query: 936  SFPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEAS 1115
               A+NHHLWRMAEQLGA C TE+D SVTHVVS D+GT+KS WA+K  KFLV PGWIEA+
Sbjct: 394  QSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAA 453

Query: 1116 NFLWRKQPEEKFPVVQAK 1169
            N+ W++QPEE F   Q K
Sbjct: 454  NYFWQRQPEENFSFNQIK 471


>ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Cucumis sativus]
          Length = 452

 Score =  463 bits (1192), Expect = e-128
 Identities = 231/373 (61%), Positives = 287/373 (76%)
 Frame = +3

Query: 51   LSKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKDLLVH 230
            LSK+++C+HPG F  MC+ CGQR D++S V   YI K LRL NDE+ R+R+K+ K+LL  
Sbjct: 76   LSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNNDEINRMRNKEMKELL-Q 134

Query: 231  RRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRP 410
            R+K            NS  L  +  EE+ YL SQ DSL D  K SLF L+ +H MTKLRP
Sbjct: 135  RKKLILVLDLDHTLLNSTELRYLTVEEE-YLRSQTDSLDDVTKGSLFLLNSVHTMTKLRP 193

Query: 411  FVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDI 590
            FV +FL+EASK+FEMYIYTMGER YA EMA LLDP  +YF S++I++ D TQK+QKGLD+
Sbjct: 194  FVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDV 253

Query: 591  VLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDESETEG 770
            VLG+ESAVLILDDTE  W ++KENLILM+RYHFFASSC+ FGF+  SLS+LK DESET+G
Sbjct: 254  VLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDG 313

Query: 771  ALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAE 950
            AL T+LKVL+++H +FF+E    DL +RDVRQVLK VR EVL+GC++VF+RV P  F AE
Sbjct: 314  ALTTILKVLKQVHHMFFNE-VSGDLVDRDVRQVLKTVRAEVLEGCKVVFSRVFPTKFQAE 372

Query: 951  NHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASNFLWR 1130
            NH LW+M EQLG  C TE+D+SVTHVV+ DAGT+KSRWA+KEKKFLV+P WIEASN+ W+
Sbjct: 373  NHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRWIEASNYFWK 432

Query: 1131 KQPEEKFPVVQAK 1169
            +Q EE F V Q K
Sbjct: 433  RQMEENFTVEQTK 445


>ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like isoform X1 [Citrus sinensis]
            gi|568865772|ref|XP_006486244.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4-like
            isoform X2 [Citrus sinensis]
            gi|568865774|ref|XP_006486245.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4-like
            isoform X3 [Citrus sinensis]
          Length = 478

 Score =  462 bits (1190), Expect = e-127
 Identities = 234/367 (63%), Positives = 281/367 (76%)
 Frame = +3

Query: 69   CTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKDLLVHRRKXXX 248
            C HPG   GMC +CG+R +++S V   YI K LRL NDE+ RLR+ D K LL HR K   
Sbjct: 102  CPHPGSLGGMCYRCGKRLEEESGVTFSYICKGLRLGNDEIDRLRNTDMKHLLRHR-KLYL 160

Query: 249  XXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFVRTFL 428
                     NS  L  +  EED YL SQ DSL D  K SLF L  M+MMTKLRPFV TFL
Sbjct: 161  ILDLDHTLLNSTLLLHLTPEED-YLKSQADSLQDVSKGSLFMLAFMNMMTKLRPFVHTFL 219

Query: 429  EEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVLGRES 608
            +EAS+MFEMYIYTMG+RPYALEMA LLDP  +YF++R+I++ D TQ++QKGLD+VLG+ES
Sbjct: 220  KEASEMFEMYIYTMGDRPYALEMAKLLDPSREYFNARVISRDDGTQRHQKGLDVVLGQES 279

Query: 609  AVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDESETEGALATVL 788
            AVLILDDTE  W ++++NLILM+RYHFFASSC+ FG+   SLSQL++DESE EGALA+VL
Sbjct: 280  AVLILDDTENAWTKHRDNLILMERYHFFASSCRQFGYHCQSLSQLRSDESELEGALASVL 339

Query: 789  KVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENHHLWR 968
            KVL+RIH +FFDE   +DL  RDVRQVLK VR EVLKGC++VF+ V P  FPA+ H+LW+
Sbjct: 340  KVLKRIHNIFFDE-LANDLAGRDVRQVLKMVRGEVLKGCKLVFSHVFPTKFPADTHYLWK 398

Query: 969  MAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASNFLWRKQPEEK 1148
            MAEQLGA CL E+D SVTHVVS DA T+KSRWA KE KFLV+P WIE +NFLW++QPEE 
Sbjct: 399  MAEQLGATCLIELDPSVTHVVSTDARTEKSRWAAKEAKFLVDPRWIETANFLWQRQPEEN 458

Query: 1149 FPVVQAK 1169
            FPV Q K
Sbjct: 459  FPVKQNK 465


>ref|XP_007014446.1| RNA polymerase II ctd phosphatase, putative isoform 2 [Theobroma
            cacao] gi|508784809|gb|EOY32065.1| RNA polymerase II ctd
            phosphatase, putative isoform 2 [Theobroma cacao]
          Length = 357

 Score =  457 bits (1176), Expect = e-126
 Identities = 232/358 (64%), Positives = 279/358 (77%)
 Frame = +3

Query: 96   MCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKDLLVHRRKXXXXXXXXXXXX 275
            MC+ CGQR DD+S V   YI K LRL NDE+ RLR  D K+LL H+ K            
Sbjct: 1    MCILCGQRLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHK-KLYLVLDLDHTLL 59

Query: 276  NSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFVRTFLEEASKMFEM 455
            NS +L  +  +E+ YL  Q DSL D  + SLF LD MHMMTKLRPFVRTFL+EAS+MFEM
Sbjct: 60   NSTQLMHLTPDEE-YLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEM 118

Query: 456  YIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVLGRESAVLILDDTE 635
            YIYTMG+RPYALEMA LLDP  +YF  R+I++ D TQK+QKGLD+VLG+ESAV+ILDDTE
Sbjct: 119  YIYTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTE 178

Query: 636  AVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDESETEGALATVLKVLQRIHGL 815
              W ++K+NLILM+RYH+FASSC  FG+   SLSQLK+DESE +GALA+VLK L++IH +
Sbjct: 179  NAWMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHM 238

Query: 816  FFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENHHLWRMAEQLGAIC 995
            FFDE    +L +RDVRQVLK V++EVLKGC+IVF+ V P +FPAE+H LW+MAEQLGA C
Sbjct: 239  FFDE-LDCNLASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATC 297

Query: 996  LTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASNFLWRKQPEEKFPVVQAK 1169
             TE D SVTHVVS DAGT+KSRWAVKEKKFLV+P WIEA+N+LW+KQPEE FPV Q K
Sbjct: 298  STETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGK 355


>ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318537|gb|EEF03111.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 468

 Score =  449 bits (1155), Expect = e-123
 Identities = 230/378 (60%), Positives = 286/378 (75%)
 Frame = +3

Query: 36   SSDGSLSKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFK 215
            +S+ S+SK E+CTHPG F  MC+ CGQ  D +S V   YI K LRL NDE+ RLR+ D K
Sbjct: 98   NSEASISK-EICTHPGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMK 156

Query: 216  DLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMM 395
            +LL H+ K            NS +L  +  +E+ YLN Q DSL D  K SLF L  M MM
Sbjct: 157  NLLRHK-KLYLILDLDHTLLNSTQLMHMTLDEE-YLNGQTDSLQDVSKGSLFMLSSMQMM 214

Query: 396  TKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQ 575
            TKLRPFVRTFL+EAS+MFEMYIYTMG+R YALEMA LLDPG +YF++++I++ D TQ++Q
Sbjct: 215  TKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQ 274

Query: 576  KGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDE 755
            KGLD+VLG+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC  FGF+  SLS+ KTDE
Sbjct: 275  KGLDVVLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDE 334

Query: 756  SETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPI 935
            SE+EGALA++LKVL++IH +FF+    D + +  + QVLK VRK+VLKGC+IVF+RV P 
Sbjct: 335  SESEGALASILKVLRKIHQIFFE----DHILSLAL-QVLKTVRKDVLKGCKIVFSRVFPT 389

Query: 936  SFPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEAS 1115
               A+NHHLWRMAEQLGA C TE+D SVTHVVS D+GT+KS WA+K  KFLV PGWIEA+
Sbjct: 390  QSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAA 449

Query: 1116 NFLWRKQPEEKFPVVQAK 1169
            N+ W++QPEE F   Q K
Sbjct: 450  NYFWQRQPEENFSFNQIK 467


>gb|EXC26161.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Morus
            notabilis]
          Length = 512

 Score =  443 bits (1140), Expect = e-122
 Identities = 227/373 (60%), Positives = 276/373 (73%), Gaps = 1/373 (0%)
 Frame = +3

Query: 54   SKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKDLLVHR 233
            +KK+ CTHPG F  MC+ CGQR ++++ V   YI K LRL NDE+ RLR  D K+L+ H+
Sbjct: 141  TKKDACTHPGSFGDMCILCGQRLEEETGVTFGYIHKGLRLNNDEIVRLRSTDMKNLIRHK 200

Query: 234  RKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPF 413
             K            NS RL D+  EE  YL SQ  S  D  + SLF L+ MHMMTKLRPF
Sbjct: 201  -KLCLVLDLDHTLLNSTRLVDLSSEEQ-YLKSQAFSPQDASEGSLFVLEAMHMMTKLRPF 258

Query: 414  VRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIV 593
            VR FL+E   +FE+Y+YTMG+RPYAL MA LLDP  +YF  RII++ D T K+QKGLD+V
Sbjct: 259  VRNFLKEVYNLFELYVYTMGDRPYALAMAKLLDPRREYFGDRIISRDDGTLKHQKGLDVV 318

Query: 594  LGRESAVLILDDTEAVW-KENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDESETEG 770
            LG+ESAVLILDDTE  W K +KENLILM+RYHFF SS   FG++  SLS+LK+DESETEG
Sbjct: 319  LGQESAVLILDDTENAWIKHHKENLILMERYHFFRSSTHQFGYNCKSLSELKSDESETEG 378

Query: 771  ALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAE 950
            AL TVL VL+++H +FFDE   D +  RDVRQVLK +RKEVLKGC+IVF+RV P  F AE
Sbjct: 379  ALVTVLNVLKQVHSMFFDERGIDHII-RDVRQVLKTLRKEVLKGCKIVFSRVFPTEFQAE 437

Query: 951  NHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASNFLWR 1130
            NH LW+MAEQLGA C  E+D SVTHVVS D GT+KSRWAVKE KFLV+P WIEA+N++W+
Sbjct: 438  NHQLWKMAEQLGATCGIELDPSVTHVVSLDVGTEKSRWAVKENKFLVHPRWIEAANYMWK 497

Query: 1131 KQPEEKFPVVQAK 1169
            +QPE+ F V Q K
Sbjct: 498  RQPEDNFSVNQVK 510


>ref|XP_007154891.1| hypothetical protein PHAVU_003G156800g [Phaseolus vulgaris]
            gi|561028245|gb|ESW26885.1| hypothetical protein
            PHAVU_003G156800g [Phaseolus vulgaris]
          Length = 441

 Score =  439 bits (1128), Expect = e-120
 Identities = 227/386 (58%), Positives = 288/386 (74%), Gaps = 7/386 (1%)
 Frame = +3

Query: 15   ITDPEASSSDGSLS-------KKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRL 173
            I + E S+ +G +        K ++C+HPG F  MC++CGQ+ D +S V   YI K LRL
Sbjct: 57   IEETEGSTLEGIIKQNLEVSVKVDVCSHPGSFGSMCIRCGQKLDGESGVTFGYIHKGLRL 116

Query: 174  ANDEMARLRDKDFKDLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDN 353
             +DE++RLR+ D K LL  R+K            NS  LSD+  EE   L+ Q DSL D 
Sbjct: 117  HDDEISRLRNTDMKSLLC-RKKLYFVLDLDHTLLNSTHLSDLSSEESSLLD-QTDSLEDV 174

Query: 354  IKNSLFRLDKMHMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFH 533
             K SLF+LD MHMMTKLRPFVR+FL+EAS+MFEMYIYTMG+RPYALEMA LLDP   YF+
Sbjct: 175  SKGSLFKLDHMHMMTKLRPFVRSFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRGVYFN 234

Query: 534  SRIIAQGDCTQKYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHF 713
            +++I++ D TQK+QKGLD+VLG+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC+ F
Sbjct: 235  AKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHFFASSCRQF 294

Query: 714  GFSYSSLSQLKTDESETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEV 893
            GF+  SL++L+ DE ET+GALA +LKVL+++H  FFD+  ++DL +RDVRQVL +VR EV
Sbjct: 295  GFNCKSLAELRNDEDETDGALAKILKVLRQVHCTFFDK-HQEDLVDRDVRQVLASVRSEV 353

Query: 894  LKGCRIVFTRVIPISFPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVK 1073
            L GC IVF+R+   + P+    L +MAEQ+GA CLTEVD SVTHVV+ DAGT+KSRWAVK
Sbjct: 354  LGGCVIVFSRIFHGALPS----LRKMAEQMGATCLTEVDLSVTHVVATDAGTEKSRWAVK 409

Query: 1074 EKKFLVNPGWIEASNFLWRKQPEEKF 1151
            E KFLV+P WIEA+NF W KQPEE F
Sbjct: 410  EHKFLVHPRWIEAANFFWEKQPEENF 435


>ref|XP_007154892.1| hypothetical protein PHAVU_003G156800g [Phaseolus vulgaris]
            gi|561028246|gb|ESW26886.1| hypothetical protein
            PHAVU_003G156800g [Phaseolus vulgaris]
          Length = 401

 Score =  438 bits (1126), Expect = e-120
 Identities = 223/365 (61%), Positives = 280/365 (76%)
 Frame = +3

Query: 57   KKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLANDEMARLRDKDFKDLLVHRR 236
            K ++C+HPG F  MC++CGQ+ D +S V   YI K LRL +DE++RLR+ D K LL  R+
Sbjct: 38   KVDVCSHPGSFGSMCIRCGQKLDGESGVTFGYIHKGLRLHDDEISRLRNTDMKSLLC-RK 96

Query: 237  KXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFV 416
            K            NS  LSD+  EE   L+ Q DSL D  K SLF+LD MHMMTKLRPFV
Sbjct: 97   KLYFVLDLDHTLLNSTHLSDLSSEESSLLD-QTDSLEDVSKGSLFKLDHMHMMTKLRPFV 155

Query: 417  RTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVL 596
            R+FL+EAS+MFEMYIYTMG+RPYALEMA LLDP   YF++++I++ D TQK+QKGLD+VL
Sbjct: 156  RSFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRGVYFNAKVISRDDGTQKHQKGLDVVL 215

Query: 597  GRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYSSLSQLKTDESETEGAL 776
            G+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC+ FGF+  SL++L+ DE ET+GAL
Sbjct: 216  GQESAVLILDDTEHAWMKHKDNLILMERYHFFASSCRQFGFNCKSLAELRNDEDETDGAL 275

Query: 777  ATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENH 956
            A +LKVL+++H  FFD+  ++DL +RDVRQVL +VR EVL GC IVF+R+   + P+   
Sbjct: 276  AKILKVLRQVHCTFFDK-HQEDLVDRDVRQVLASVRSEVLGGCVIVFSRIFHGALPS--- 331

Query: 957  HLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEKKFLVNPGWIEASNFLWRKQ 1136
             L +MAEQ+GA CLTEVD SVTHVV+ DAGT+KSRWAVKE KFLV+P WIEA+NF W KQ
Sbjct: 332  -LRKMAEQMGATCLTEVDLSVTHVVATDAGTEKSRWAVKEHKFLVHPRWIEAANFFWEKQ 390

Query: 1137 PEEKF 1151
            PEE F
Sbjct: 391  PEENF 395


>ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Fragaria vesca subsp. vesca]
          Length = 464

 Score =  437 bits (1125), Expect = e-120
 Identities = 229/390 (58%), Positives = 285/390 (73%), Gaps = 1/390 (0%)
 Frame = +3

Query: 3    LCSGITDPEASSSDGSLSKKEMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLAND 182
            L S     E S + G     ++C HPG F  MC  CGQR  + S V   YI K LRL + 
Sbjct: 76   LTSQAVSEEISEASGV---DDLCAHPGSFGDMCFLCGQRLIEQSGVTFGYIHKGLRLNDG 132

Query: 183  EMARLRDKDFKDLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIKN 362
            E+ RLR+ D K  L + +K            N+  L+ +  +E+ YL    DSLPD +K+
Sbjct: 133  EIDRLRNTDIKKSL-NNKKLYLVLDLDHTLLNTTLLNHVTAKEE-YLMCPPDSLPDVLKD 190

Query: 363  SLFRLDKMHMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRI 542
            SLFRLD M MMTKLRPF+RTFL+EAS++FEMYIYTMG+R YALEMA LLDP  +YF  R+
Sbjct: 191  SLFRLDFMRMMTKLRPFIRTFLKEASEIFEMYIYTMGDRAYALEMAKLLDPKKEYFGDRV 250

Query: 543  IAQGDCTQKYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFS 722
            I++ D TQ++QKGLDIVLG+ESAVLILDDTE  W ++K+NLILM+RYHFF SSC  FGF+
Sbjct: 251  ISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWIKHKDNLILMERYHFFRSSCAQFGFT 310

Query: 723  YSSLSQLKTDESETEGALATVLKVLQRIHGLFF-DEGRKDDLENRDVRQVLKAVRKEVLK 899
              SLS+LK+DESE EGALA VL +L+RIH +FF D G   +L +RDVRQVLK VRKEVL 
Sbjct: 311  CESLSELKSDESEPEGALANVLDLLKRIHKMFFYDLG--GNLVDRDVRQVLKIVRKEVLN 368

Query: 900  GCRIVFTRVIPISFPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKEK 1079
            GC++VF+R+IP    A +HHLW+MAEQLGAIC TEVD +VTHVV+ DAGT+KSRWAVK  
Sbjct: 369  GCKVVFSRIIPSKVLASSHHLWKMAEQLGAICSTEVDSTVTHVVALDAGTEKSRWAVKHN 428

Query: 1080 KFLVNPGWIEASNFLWRKQPEEKFPVVQAK 1169
            KFLV+P W+EA+N++W+KQ EEKFPV + K
Sbjct: 429  KFLVHPRWLEAANYMWQKQAEEKFPVTETK 458


>ref|XP_007149092.1| hypothetical protein PHAVU_005G040600g [Phaseolus vulgaris]
            gi|593697222|ref|XP_007149093.1| hypothetical protein
            PHAVU_005G040600g [Phaseolus vulgaris]
            gi|561022356|gb|ESW21086.1| hypothetical protein
            PHAVU_005G040600g [Phaseolus vulgaris]
            gi|561022357|gb|ESW21087.1| hypothetical protein
            PHAVU_005G040600g [Phaseolus vulgaris]
          Length = 443

 Score =  437 bits (1124), Expect = e-120
 Identities = 225/392 (57%), Positives = 292/392 (74%), Gaps = 8/392 (2%)
 Frame = +3

Query: 21   DPEASSSDGSLSKK-------EMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRLAN 179
            + E S+S+G L +        ++CTHPG F  MC++CGQ+ D  S V   YI K LRL +
Sbjct: 59   ETEGSTSEGILKQNLETSVEVDVCTHPGSFGSMCIRCGQKLDGKSGVTFGYIHKGLRLHD 118

Query: 180  DEMARLRDKDFKDLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDNIK 359
            +E++RLR+ D K LL  R+K            NS  L+ +  EE   LN Q DSL D  K
Sbjct: 119  EEISRLRNTDMKSLLC-RKKLYLVLDLDHTLLNSTLLAHLSSEESHLLN-QTDSLQDVSK 176

Query: 360  NSLFRLDKMHMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSR 539
             SLF+L+ MHMMTKLRPFVR+FL+EA++MFEMYIYTMG+RPYALEMA LLDP  +YF++R
Sbjct: 177  GSLFKLEHMHMMTKLRPFVRSFLKEATEMFEMYIYTMGDRPYALEMAKLLDPQGEYFNAR 236

Query: 540  IIAQGDCTQKYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGF 719
            +I++ D TQK+QKGLD+VLG+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC+ FGF
Sbjct: 237  VISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHFFASSCRQFGF 296

Query: 720  SYSSLSQLKTDESETEGALATVLKVLQRIHGLFFDEGRK-DDLENRDVRQVLKAVRKEVL 896
            +  S ++L+ DE ET+GALA +LKVL+++H  FFD+ ++ DDL NRDVRQVL +VR EVL
Sbjct: 297  NCKSPAELRNDEDETDGALAKILKVLKQVHCTFFDKHQEDDDLVNRDVRQVLSSVRSEVL 356

Query: 897  KGCRIVFTRVIPISFPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVKE 1076
             GC IVF+R+   + P+    L +MAEQ+GA CL EVD SVTH+V+ DAGT+KSRWA+KE
Sbjct: 357  SGCVIVFSRIFHGALPS----LQKMAEQMGATCLAEVDPSVTHIVATDAGTEKSRWALKE 412

Query: 1077 KKFLVNPGWIEASNFLWRKQPEEKFPVVQAKQ 1172
            KKFLV+P WIEA+N+ W KQPEE F +++ KQ
Sbjct: 413  KKFLVHPRWIEAANYFWEKQPEENF-IIKKKQ 443


>ref|XP_006575309.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Glycine max]
          Length = 442

 Score =  434 bits (1117), Expect = e-119
 Identities = 220/393 (55%), Positives = 297/393 (75%), Gaps = 7/393 (1%)
 Frame = +3

Query: 15   ITDPEASSSDGSLSKK-------EMCTHPGFFAGMCMKCGQRADDDSAVPLKYIDKNLRL 173
            I + E S+S+G + +        ++CTHPG F  MC++CGQ+ D +S V   YI K LRL
Sbjct: 59   IEETEGSTSEGIIKQSLEASMEVDVCTHPGSFGNMCIRCGQKLDGESGVTFGYIHKGLRL 118

Query: 174  ANDEMARLRDKDFKDLLVHRRKXXXXXXXXXXXXNSARLSDIKDEEDIYLNSQRDSLPDN 353
             ++E++RLR+ D K LL  R+K            NS  L+ +  EE   LN Q DSL D 
Sbjct: 119  HDEEISRLRNTDMKSLLC-RKKLYLVLDLDHTLLNSTHLAHLTSEESHLLN-QTDSLRDV 176

Query: 354  IKNSLFRLDKMHMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFH 533
             K SLF+L+ M+MMTKLRPFVR FL+EAS+MFEMYIYTMG+RPYALEMA LLDP  +YF+
Sbjct: 177  SKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 236

Query: 534  SRIIAQGDCTQKYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHF 713
            +++I++ D TQK+QKGLD+VLG+ESAVLILDDTE  W ++K+NLILM+RYHFF SSC+ F
Sbjct: 237  AKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHFFGSSCRQF 296

Query: 714  GFSYSSLSQLKTDESETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEV 893
            GF+  SL++LK+DE+ET+GALA +LKVL+++H +FFD  +++D ++RDVRQ+L  VR+EV
Sbjct: 297  GFNCKSLAELKSDENETDGALAKILKVLKQVHCMFFD--KQEDFDDRDVRQMLSLVRREV 354

Query: 894  LKGCRIVFTRVIPISFPAENHHLWRMAEQLGAICLTEVDESVTHVVSGDAGTDKSRWAVK 1073
            L GC I+F+R++  + P+    L +MAEQ+GA CLTE+D SVTHVV+ DAGT+K RWAVK
Sbjct: 355  LSGCVIIFSRIVHGAIPS----LRKMAEQMGATCLTEIDPSVTHVVATDAGTEKCRWAVK 410

Query: 1074 EKKFLVNPGWIEASNFLWRKQPEEKFPVVQAKQ 1172
            EKKF+V+P WIEA+N+ W+KQPEE F +++ KQ
Sbjct: 411  EKKFVVHPLWIEAANYFWQKQPEENF-ILKKKQ 442


Top