BLASTX nr result

ID: Mentha28_contig00010358 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00010358
         (1425 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU29592.1| hypothetical protein MIMGU_mgv1a017809mg [Mimulus...   499   e-139
gb|EYU37264.1| hypothetical protein MIMGU_mgv1a005925mg [Mimulus...   499   e-138
ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal doma...   459   e-126
ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal doma...   455   e-125
ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal doma...   455   e-125
ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative ...   434   e-119
gb|EPS63533.1| hypothetical protein M569_11249, partial [Genlise...   434   e-119
ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ...   432   e-118
ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prun...   424   e-116
ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu...   423   e-115
ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal doma...   421   e-115
ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma...   420   e-115
ref|XP_007014446.1| RNA polymerase II ctd phosphatase, putative ...   414   e-113
gb|EXC26161.1| RNA polymerase II C-terminal domain phosphatase-l...   406   e-110
ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu...   406   e-110
ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal doma...   395   e-107
ref|XP_007154892.1| hypothetical protein PHAVU_003G156800g [Phas...   394   e-107
ref|XP_007154891.1| hypothetical protein PHAVU_003G156800g [Phas...   394   e-107
ref|XP_007149092.1| hypothetical protein PHAVU_005G040600g [Phas...   392   e-106
ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [A...   389   e-105

>gb|EYU29592.1| hypothetical protein MIMGU_mgv1a017809mg [Mimulus guttatus]
          Length = 466

 Score =  499 bits (1286), Expect = e-139
 Identities = 252/376 (67%), Positives = 297/376 (78%), Gaps = 21/376 (5%)
 Frame = -1

Query: 1407 GSLSKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLL 1228
            GS  KK  C HPG +AGMCM+CGQ+ DD+S VA  YI KNLRLANDEM RLRD+D K++L
Sbjct: 93   GSSPKKNTCLHPGVYAGMCMRCGQKMDDESGVAFGYIHKNLRLANDEMDRLRDRDLKNML 152

Query: 1227 VHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKL 1048
             HR K           LNSARL DI +EE  YLN QRD+LPD +K+SLFRLD ++MMTKL
Sbjct: 153  RHR-KLCLVLDLDHTLLNSARLHDITEEEG-YLNGQRDALPDTLKSSLFRLDWIYMMTKL 210

Query: 1047 RPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGL 868
            RPFV TFL+EASK+FEMYIYTMGERPYALEMA LLDPG+ YF+SRIIAQGDCT K+QKGL
Sbjct: 211  RPFVHTFLKEASKLFEMYIYTMGERPYALEMAKLLDPGDIYFNSRIIAQGDCTHKHQKGL 270

Query: 867  DIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESET 688
            D+VLG+ESAV+ILDDTE VW ++K+NLILM+RYHFFASSCK FGF+  SLS+L++DES+T
Sbjct: 271  DVVLGQESAVVILDDTEVVWSKHKDNLILMERYHFFASSCKQFGFNCKSLSELRSDESDT 330

Query: 687  EGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFP 508
            EGAL TVLK LQ+IH LFFD  RKD LE+RDVR V+K +RKEVLKGC++VFTRV P +FP
Sbjct: 331  EGALPTVLKRLQQIHSLFFDVERKDSLEDRDVRLVMKTLRKEVLKGCKVVFTRVFPTNFP 390

Query: 507  AENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFM 391
            AE+H LW+MAE+LGA                      SRWA+KEKKFLV+P WIEASN+M
Sbjct: 391  AEHHSLWKMAEKLGATCCNEIDPCITHVVSMDAGTDKSRWALKEKKFLVHPRWIEASNYM 450

Query: 390  WRKQPEEKFPVVQAKQ 343
            W+KQPEE FPV QA +
Sbjct: 451  WQKQPEENFPVSQANK 466


>gb|EYU37264.1| hypothetical protein MIMGU_mgv1a005925mg [Mimulus guttatus]
          Length = 464

 Score =  499 bits (1284), Expect = e-138
 Identities = 251/376 (66%), Positives = 301/376 (80%), Gaps = 21/376 (5%)
 Frame = -1

Query: 1407 GSLSKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLL 1228
            GS  KK  C HPG +AGMCMKCGQ+ DD+S VA  YI KNLRLANDE+ RLRD+D K++L
Sbjct: 91   GSSPKKNTCLHPGVYAGMCMKCGQKMDDESGVAFGYIHKNLRLANDEIDRLRDRDLKNML 150

Query: 1227 VHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKL 1048
             HR K           LNSARL DI ++E  YLN QR++LPDN+KNSLFRLD ++MMTKL
Sbjct: 151  RHR-KLCLVLDLDHTLLNSARLHDITEQEG-YLNGQREALPDNLKNSLFRLDWIYMMTKL 208

Query: 1047 RPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGL 868
            RP+V TFL+EASK+FEMYIYTMGERPYALEMA LLDPG+ YF+SRIIAQGDCTQK+QKGL
Sbjct: 209  RPYVHTFLKEASKLFEMYIYTMGERPYALEMAKLLDPGDIYFNSRIIAQGDCTQKHQKGL 268

Query: 867  DIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESET 688
            D+VLG+ESAV+ILDDTEAVW ++K+NLILM+RYHFFASSCK FGF+  SLS+L++DES+T
Sbjct: 269  DVVLGQESAVVILDDTEAVWSKHKDNLILMERYHFFASSCKQFGFNCKSLSELQSDESDT 328

Query: 687  EGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFP 508
            +GALA+VLK LQ+IH LFFD  RKD LE+RDVR V+K +RKEVLKGC++VFTRV P +FP
Sbjct: 329  QGALASVLKRLQQIHTLFFDAERKDSLEDRDVRLVMKTLRKEVLKGCKVVFTRVFPTNFP 388

Query: 507  AENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFM 391
            +E+H LW+MAE+LGA                      SRWAV+EKKFLV+P WIEASN+M
Sbjct: 389  SEHHSLWKMAEKLGATCCNEIDPSVTHVVSMDAGTDKSRWAVQEKKFLVHPRWIEASNYM 448

Query: 390  WRKQPEEKFPVVQAKQ 343
            W+KQ EE FPV QAK+
Sbjct: 449  WQKQTEENFPVSQAKK 464


>ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum tuberosum]
          Length = 478

 Score =  459 bits (1180), Expect = e-126
 Identities = 232/374 (62%), Positives = 281/374 (75%), Gaps = 21/374 (5%)
 Frame = -1

Query: 1416 ETIGSLSKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFK 1237
            ET G+    ++CTHPG   GMC++CGQ+ +D+S VA  YI KNLRLA+DE+ARLRDKD K
Sbjct: 105  ETSGASLALDVCTHPGVMGGMCIRCGQKVEDESGVAFGYIHKNLRLADDEVARLRDKDLK 164

Query: 1236 DLLVHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMM 1057
            +LL H+ K           LNS RL+DI  EE  YL  QR+ LPD ++N+LF+LD +HMM
Sbjct: 165  NLLRHK-KLILVLDLDHTLLNSTRLADISAEES-YLKDQREVLPDALRNNLFKLDWIHMM 222

Query: 1056 TKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQ 877
            TKLRPFV TFL+EAS +FEMYIYTMGERPYALEMA LLDPG  YFHSR+IAQ D T+++Q
Sbjct: 223  TKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMASLLDPGGIYFHSRVIAQSDSTRRHQ 282

Query: 876  KGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDE 697
            KGLD+VLG+ESAVLILDDTE VW +++ENLILMDRYHFF SSC+ FG    SLS+ K+DE
Sbjct: 283  KGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDE 342

Query: 696  SETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPI 517
            +E EGALA+VL+VLQRIH LFFD  R D++  RDVRQVLK VRKE+LKGC+IVFT VIPI
Sbjct: 343  NEAEGALASVLEVLQRIHRLFFDLERGDNIMERDVRQVLKTVRKEILKGCKIVFTGVIPI 402

Query: 516  SFPAENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEAS 400
                ENHH W++AE+LGA                      SR A++EKKFLV+P WIEA+
Sbjct: 403  QCQPENHHYWKLAEKLGATFSTEVDESVTHVVSMNDKTEKSRQALREKKFLVHPSWIEAA 462

Query: 399  NFMWRKQPEEKFPV 358
            N++WRK PEE FPV
Sbjct: 463  NYLWRKPPEENFPV 476


>ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum lycopersicum]
          Length = 512

 Score =  455 bits (1171), Expect = e-125
 Identities = 231/374 (61%), Positives = 281/374 (75%), Gaps = 21/374 (5%)
 Frame = -1

Query: 1416 ETIGSLSKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFK 1237
            ET G+    ++CTHPG   GMC++CGQ+ +D+S VA  YI KNLRLA+DE+ARLR+KD K
Sbjct: 139  ETSGASMALDVCTHPGVMGGMCIRCGQKVEDESGVAFGYIHKNLRLADDEVARLREKDLK 198

Query: 1236 DLLVHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMM 1057
            +LL HR K           LNS RL+DI  EE  YL  QR+ LPD ++++LF+LD +HMM
Sbjct: 199  NLLRHR-KLILVLDLDHTLLNSTRLADISAEES-YLKDQREVLPDALRSNLFKLDWIHMM 256

Query: 1056 TKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQ 877
            TKLRPFV TFL+EAS +FEMYIYTMGERPYALEMA LLDPG  YFHSR+IAQ D T+++Q
Sbjct: 257  TKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQ 316

Query: 876  KGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDE 697
            KGLD+VLG+ESAVLILDDTE VW +++ENLILMDRYHFF SSC+ FG    SLS+ K+DE
Sbjct: 317  KGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDE 376

Query: 696  SETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPI 517
            +E EGALA+VL+VLQRIH LFFD  R D++  RDVRQVLK VRKE+LKGC+IVFT VIPI
Sbjct: 377  NEAEGALASVLEVLQRIHRLFFDPERGDNIMERDVRQVLKTVRKEILKGCKIVFTGVIPI 436

Query: 516  SFPAENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEAS 400
                ENH+ W++AE+LGA                      SR AV+EKKFLV+P WIEA+
Sbjct: 437  QCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWIEAA 496

Query: 399  NFMWRKQPEEKFPV 358
            N++WRK PEE FPV
Sbjct: 497  NYLWRKPPEENFPV 510


>ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum lycopersicum]
          Length = 472

 Score =  455 bits (1170), Expect = e-125
 Identities = 231/374 (61%), Positives = 281/374 (75%), Gaps = 21/374 (5%)
 Frame = -1

Query: 1416 ETIGSLSKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFK 1237
            ET G+    ++CTHPG   GMC++CGQ+ +D+S VA  YI KNLRLA+DE+ARLR+KD K
Sbjct: 99   ETSGASLALDVCTHPGVMGGMCIRCGQKVEDESGVAFGYIHKNLRLADDEVARLREKDLK 158

Query: 1236 DLLVHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMM 1057
            +LL HR K           LNS RL+DI  EE  YL  QR+ LPD ++++LF+LD +HMM
Sbjct: 159  NLLRHR-KLILVLDLDHTLLNSTRLADISAEES-YLKDQREVLPDALRSNLFKLDWIHMM 216

Query: 1056 TKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQ 877
            TKLRPFV TFL+EAS +FEMYIYTMGERPYALEMA LLDPG  YFHSR+IAQ D T+++Q
Sbjct: 217  TKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQ 276

Query: 876  KGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDE 697
            KGLD+VLG+ESAVLILDDTE VW +++ENLILMDRYHFF SSC+ FG    SLS+ K+DE
Sbjct: 277  KGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDE 336

Query: 696  SETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPI 517
            +E EGALA+VL+VLQRIH LFFD  R D++  RDVRQVLK VRKE+LKGC+IVFT VIPI
Sbjct: 337  NEAEGALASVLEVLQRIHRLFFDPERGDNIMERDVRQVLKTVRKEILKGCKIVFTGVIPI 396

Query: 516  SFPAENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEAS 400
                ENH+ W++AE+LGA                      SR AV+EKKFLV+P WIEA+
Sbjct: 397  QCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWIEAA 456

Query: 399  NFMWRKQPEEKFPV 358
            N++WRK PEE FPV
Sbjct: 457  NYLWRKPPEENFPV 470


>ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma
            cacao] gi|508784808|gb|EOY32064.1| RNA polymerase II ctd
            phosphatase, putative isoform 1 [Theobroma cacao]
          Length = 469

 Score =  434 bits (1117), Expect = e-119
 Identities = 225/371 (60%), Positives = 273/371 (73%), Gaps = 21/371 (5%)
 Frame = -1

Query: 1395 KKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLVHRR 1216
            KK+ICTHPG F  MC+ CGQR DD+S V   YI K LRL NDE+ RLR  D K+LL H+ 
Sbjct: 100  KKDICTHPGSFGQMCILCGQRLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHK- 158

Query: 1215 KXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFV 1036
            K           LNS +L  +  +E+ YL  Q DSL D  + SLF LD MHMMTKLRPFV
Sbjct: 159  KLYLVLDLDHTLLNSTQLMHLTPDEE-YLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFV 217

Query: 1035 RTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVL 856
            RTFL+EAS+MFEMYIYTMG+RPYALEMA LLDP  +YF  R+I++ D TQK+QKGLD+VL
Sbjct: 218  RTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVL 277

Query: 855  GRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETEGAL 676
            G+ESAV+ILDDTE  W ++K+NLILM+RYH+FASSC  FG+   SLSQLK+DESE +GAL
Sbjct: 278  GQESAVVILDDTENAWMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGAL 337

Query: 675  ATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENH 496
            A+VLK L++IH +FFDE    +L +RDVRQVLK V++EVLKGC+IVF+ V P +FPAE+H
Sbjct: 338  ASVLKALRQIHHMFFDE-LDCNLASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESH 396

Query: 495  HLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFMWRKQ 379
             LW+MAEQLGA                      SRWAVKEKKFLV+P WIEA+N++W+KQ
Sbjct: 397  PLWKMAEQLGATCSTETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQ 456

Query: 378  PEEKFPVVQAK 346
            PEE FPV Q K
Sbjct: 457  PEENFPVSQGK 467


>gb|EPS63533.1| hypothetical protein M569_11249, partial [Genlisea aurea]
          Length = 386

 Score =  434 bits (1116), Expect = e-119
 Identities = 222/371 (59%), Positives = 277/371 (74%), Gaps = 22/371 (5%)
 Frame = -1

Query: 1404 SLSKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLV 1225
            S+S+  +C HPG + GMC+ CG   +++S +   YI KNLRLA+DE+ARLR KD K LL 
Sbjct: 15   SISESSVCPHPGIYGGMCIMCGGIMEEESGIPFGYIHKNLRLADDEVARLRYKDLKALL- 73

Query: 1224 HRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLR 1045
             RRK           LNS+RLSD+  EE  +LN     LPD+++NSLFRL+ + MMTKLR
Sbjct: 74   GRRKLHLVLDLDHTLLNSSRLSDLTGEE-CHLNVHSSDLPDSMRNSLFRLEHIQMMTKLR 132

Query: 1044 PFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLD 865
            PFVRTFL+EAS++FEM+IYTMGERPYALEMA LLDPG+ YFHSRIIAQGDCTQK+QKGLD
Sbjct: 133  PFVRTFLKEASEIFEMHIYTMGERPYALEMAKLLDPGDTYFHSRIIAQGDCTQKHQKGLD 192

Query: 864  IVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETE 685
            +VLG+ES VLILDDTE VW ++KENLILM+RY FF SSCK FGF+  SL++L++DESE+E
Sbjct: 193  VVLGQESTVLILDDTEGVWGKHKENLILMERYLFFGSSCKQFGFTCKSLAELRSDESESE 252

Query: 684  GALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPIS-FP 508
            GAL+T L  L+RIH LFFD    D+LE RDVR+VL +VRKE+L+GC+IVF+RV P S F 
Sbjct: 253  GALSTALATLKRIHSLFFDGEHDDELEARDVRKVLHSVRKEILEGCKIVFSRVFPSSFFQ 312

Query: 507  AENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFM 391
            AENH LW+M  +LGA                      SRWA+++ K LV+P W+EAS +M
Sbjct: 313  AENHQLWKMGVRLGATCSREVDSTVTHVVAVDAGTDKSRWALRQGKHLVHPRWLEASYYM 372

Query: 390  WRKQPEEKFPV 358
            W++QPEEKFPV
Sbjct: 373  WKRQPEEKFPV 383


>ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223534449|gb|EEF36151.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  432 bits (1110), Expect = e-118
 Identities = 223/381 (58%), Positives = 279/381 (73%), Gaps = 21/381 (5%)
 Frame = -1

Query: 1425 ALTETIGSLSKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDK 1246
            +L +T+ + S K  CTHPG F  MC+ CG+R  +++ V   YI K LRLANDE+ RLR+ 
Sbjct: 97   SLDQTLVASSSKVACTHPGSFGDMCILCGERLIEETGVTFGYIHKGLRLANDEIVRLRNT 156

Query: 1245 DFKDLLVHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKM 1066
            D K+LL HR K           LNS +L  +  EE+ YL SQ DS+ D    SLF +D M
Sbjct: 157  DMKNLLRHR-KLYLVLDLDHTLLNSTQLMHLTAEEE-YLKSQIDSMQDVSNGSLFMVDFM 214

Query: 1065 HMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQ 886
            HMMTKLRPF+RTFL+EAS+MFEMYIYTMG+R YALEMA  LDPG +YF++R+I++ D TQ
Sbjct: 215  HMMTKLRPFIRTFLKEASQMFEMYIYTMGDRAYALEMAKFLDPGREYFNARVISRDDGTQ 274

Query: 885  KYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLK 706
            ++QKGLDIVLG+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC+ FGF   SLSQLK
Sbjct: 275  RHQKGLDIVLGQESAVLILDDTENAWTKHKDNLILMERYHFFASSCRQFGFECKSLSQLK 334

Query: 705  TDESETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRV 526
            +DE+E++GALA+VLKVL+RIH +FFDE  +D ++ RDVRQVL  VRK+VLKGC+IVF+RV
Sbjct: 335  SDENESDGALASVLKVLRRIHHIFFDE-LEDAIDGRDVRQVLSTVRKDVLKGCKIVFSRV 393

Query: 525  IPISFPAENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWI 409
             P  F A+NHHLW+MAEQLGA                      SRWA+K  KFLV+P WI
Sbjct: 394  FPTQFQADNHHLWKMAEQLGATCSREVDPSVTHVVSAEAGTEKSRWALKNDKFLVHPRWI 453

Query: 408  EASNFMWRKQPEEKFPVVQAK 346
            EA+N+MW++QPEE F V Q K
Sbjct: 454  EATNYMWQRQPEENFSVNQPK 474


>ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica]
            gi|462399876|gb|EMJ05544.1| hypothetical protein
            PRUPE_ppa005647mg [Prunus persica]
          Length = 449

 Score =  424 bits (1091), Expect = e-116
 Identities = 226/371 (60%), Positives = 266/371 (71%), Gaps = 21/371 (5%)
 Frame = -1

Query: 1395 KKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLVHRR 1216
            KK+ICTHPG    +C+ CGQR D+ S V L YI K+  L NDE+ R+R  D K  L H +
Sbjct: 81   KKDICTHPGSVKDLCIVCGQRVDEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSL-HLK 139

Query: 1215 KXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFV 1036
            K           LNS  L+ +  EE+ YL+SQ DSL D    SLFR+D MHMMTKLRPFV
Sbjct: 140  KLYLVLDLDHTLLNSTHLNHMTAEEE-YLHSQTDSLQDVSDGSLFRVDVMHMMTKLRPFV 198

Query: 1035 RTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVL 856
            R FL+EAS+MFEMYIYTMGER YALEMA LLDP  +YF  R+I++ D TQK+QKGLD+VL
Sbjct: 199  RKFLKEASEMFEMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDVVL 258

Query: 855  GRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETEGAL 676
            G ESA LILDDTE  W ++K+NLILM+RYHFF SSC  FGF   SLS+LK+DESE EGAL
Sbjct: 259  GHESAALILDDTENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEGAL 318

Query: 675  ATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENH 496
            ATVL+VL+RIH +FF E  KD+L +RDVRQVLK +RKE+LKGC+IVF+RV P  F AENH
Sbjct: 319  ATVLEVLKRIHNMFFYES-KDNLIDRDVRQVLKTLRKEILKGCKIVFSRVFPSKFQAENH 377

Query: 495  HLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFMWRKQ 379
             LW+MAEQLGA                      SRWAVKEKKFLV+P WIEASN+MW KQ
Sbjct: 378  QLWKMAEQLGATCSTELDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEASNYMWLKQ 437

Query: 378  PEEKFPVVQAK 346
             E+KFPV Q K
Sbjct: 438  AEDKFPVNQTK 448


>ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318538|gb|EEF03112.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 472

 Score =  423 bits (1087), Expect = e-115
 Identities = 218/370 (58%), Positives = 269/370 (72%), Gaps = 21/370 (5%)
 Frame = -1

Query: 1392 KEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLVHRRK 1213
            KEICTHPG F  MC+ CGQ  D +S V   YI K LRL NDE+ RLR+ D K+LL H+ K
Sbjct: 105  KEICTHPGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHK-K 163

Query: 1212 XXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFVR 1033
                       LNS +L  +  +E+ YLN Q DSL D  K SLF L  M MMTKLRPFVR
Sbjct: 164  LYLILDLDHTLLNSTQLMHMTLDEE-YLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVR 222

Query: 1032 TFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVLG 853
            TFL+EAS+MFEMYIYTMG+R YALEMA LLDPG +YF++++I++ D TQ++QKGLD+VLG
Sbjct: 223  TFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLG 282

Query: 852  RESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETEGALA 673
            +ESAVLILDDTE  W ++K+NLILM+RYHFFASSC  FGF+  SLS+ KTDESE+EGALA
Sbjct: 283  QESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALA 342

Query: 672  TVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENHH 493
            ++LKVL++IH +FF+E  +++++ RDVRQVLK VRK+VLKGC+IVF+RV P    A+NHH
Sbjct: 343  SILKVLRKIHQIFFEE-LEENMDGRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHH 401

Query: 492  LWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFMWRKQP 376
            LWRMAEQLGA                      S WA+K  KFLV PGWIEA+N+ W++QP
Sbjct: 402  LWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQP 461

Query: 375  EEKFPVVQAK 346
            EE F   Q K
Sbjct: 462  EENFSFNQIK 471


>ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like isoform X1 [Citrus sinensis]
            gi|568865772|ref|XP_006486244.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4-like
            isoform X2 [Citrus sinensis]
            gi|568865774|ref|XP_006486245.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4-like
            isoform X3 [Citrus sinensis]
          Length = 478

 Score =  421 bits (1083), Expect = e-115
 Identities = 219/367 (59%), Positives = 265/367 (72%), Gaps = 21/367 (5%)
 Frame = -1

Query: 1383 CTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLVHRRKXXX 1204
            C HPG   GMC +CG+R +++S V   YI K LRL NDE+ RLR+ D K LL HR K   
Sbjct: 102  CPHPGSLGGMCYRCGKRLEEESGVTFSYICKGLRLGNDEIDRLRNTDMKHLLRHR-KLYL 160

Query: 1203 XXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFVRTFL 1024
                    LNS  L  +  EED YL SQ DSL D  K SLF L  M+MMTKLRPFV TFL
Sbjct: 161  ILDLDHTLLNSTLLLHLTPEED-YLKSQADSLQDVSKGSLFMLAFMNMMTKLRPFVHTFL 219

Query: 1023 EEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVLGRES 844
            +EAS+MFEMYIYTMG+RPYALEMA LLDP  +YF++R+I++ D TQ++QKGLD+VLG+ES
Sbjct: 220  KEASEMFEMYIYTMGDRPYALEMAKLLDPSREYFNARVISRDDGTQRHQKGLDVVLGQES 279

Query: 843  AVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETEGALATVL 664
            AVLILDDTE  W ++++NLILM+RYHFFASSC+ FG+   SLSQL++DESE EGALA+VL
Sbjct: 280  AVLILDDTENAWTKHRDNLILMERYHFFASSCRQFGYHCQSLSQLRSDESELEGALASVL 339

Query: 663  KVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENHHLWR 484
            KVL+RIH +FFDE   +DL  RDVRQVLK VR EVLKGC++VF+ V P  FPA+ H+LW+
Sbjct: 340  KVLKRIHNIFFDE-LANDLAGRDVRQVLKMVRGEVLKGCKLVFSHVFPTKFPADTHYLWK 398

Query: 483  MAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFMWRKQPEEK 367
            MAEQLGA                      SRWA KE KFLV+P WIE +NF+W++QPEE 
Sbjct: 399  MAEQLGATCLIELDPSVTHVVSTDARTEKSRWAAKEAKFLVDPRWIETANFLWQRQPEEN 458

Query: 366  FPVVQAK 346
            FPV Q K
Sbjct: 459  FPVKQNK 465


>ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Cucumis sativus]
          Length = 452

 Score =  420 bits (1080), Expect = e-115
 Identities = 217/378 (57%), Positives = 272/378 (71%), Gaps = 21/378 (5%)
 Frame = -1

Query: 1416 ETIGSLSKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFK 1237
            +++  LSK+++C+HPG F  MC+ CGQR D++S V   YI K LRL NDE+ R+R+K+ K
Sbjct: 71   QSLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNNDEINRMRNKEMK 130

Query: 1236 DLLVHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMM 1057
            +LL  R+K           LNS  L  +  EE+ YL SQ DSL D  K SLF L+ +H M
Sbjct: 131  ELL-QRKKLILVLDLDHTLLNSTELRYLTVEEE-YLRSQTDSLDDVTKGSLFLLNSVHTM 188

Query: 1056 TKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQ 877
            TKLRPFV +FL+EASK+FEMYIYTMGER YA EMA LLDP  +YF S++I++ D TQK+Q
Sbjct: 189  TKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQ 248

Query: 876  KGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDE 697
            KGLD+VLG+ESAVLILDDTE  W ++KENLILM+RYHFFASSC+ FGF+  SLS+LK DE
Sbjct: 249  KGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDE 308

Query: 696  SETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPI 517
            SET+GAL T+LKVL+++H +FF+E    DL +RDVRQVLK VR EVL+GC++VF+RV P 
Sbjct: 309  SETDGALTTILKVLKQVHHMFFNE-VSGDLVDRDVRQVLKTVRAEVLEGCKVVFSRVFPT 367

Query: 516  SFPAENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEAS 400
             F AENH LW+M EQLG                       SRWA+KEKKFLV+P WIEAS
Sbjct: 368  KFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRWIEAS 427

Query: 399  NFMWRKQPEEKFPVVQAK 346
            N+ W++Q EE F V Q K
Sbjct: 428  NYFWKRQMEENFTVEQTK 445


>ref|XP_007014446.1| RNA polymerase II ctd phosphatase, putative isoform 2 [Theobroma
            cacao] gi|508784809|gb|EOY32065.1| RNA polymerase II ctd
            phosphatase, putative isoform 2 [Theobroma cacao]
          Length = 357

 Score =  414 bits (1064), Expect = e-113
 Identities = 216/358 (60%), Positives = 263/358 (73%), Gaps = 21/358 (5%)
 Frame = -1

Query: 1356 MCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLVHRRKXXXXXXXXXXXL 1177
            MC+ CGQR DD+S V   YI K LRL NDE+ RLR  D K+LL H+ K           L
Sbjct: 1    MCILCGQRLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHK-KLYLVLDLDHTLL 59

Query: 1176 NSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFVRTFLEEASKMFEM 997
            NS +L  +  +E+ YL  Q DSL D  + SLF LD MHMMTKLRPFVRTFL+EAS+MFEM
Sbjct: 60   NSTQLMHLTPDEE-YLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEM 118

Query: 996  YIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVLGRESAVLILDDTE 817
            YIYTMG+RPYALEMA LLDP  +YF  R+I++ D TQK+QKGLD+VLG+ESAV+ILDDTE
Sbjct: 119  YIYTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTE 178

Query: 816  AVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETEGALATVLKVLQRIHGL 637
              W ++K+NLILM+RYH+FASSC  FG+   SLSQLK+DESE +GALA+VLK L++IH +
Sbjct: 179  NAWMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHM 238

Query: 636  FFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENHHLWRMAEQLGA-- 463
            FFDE    +L +RDVRQVLK V++EVLKGC+IVF+ V P +FPAE+H LW+MAEQLGA  
Sbjct: 239  FFDE-LDCNLASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATC 297

Query: 462  -------------------XSRWAVKEKKFLVNPGWIEASNFMWRKQPEEKFPVVQAK 346
                                SRWAVKEKKFLV+P WIEA+N++W+KQPEE FPV Q K
Sbjct: 298  STETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGK 355


>gb|EXC26161.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Morus
            notabilis]
          Length = 512

 Score =  406 bits (1044), Expect = e-110
 Identities = 216/373 (57%), Positives = 261/373 (69%), Gaps = 22/373 (5%)
 Frame = -1

Query: 1398 SKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLVHR 1219
            +KK+ CTHPG F  MC+ CGQR ++++ V   YI K LRL NDE+ RLR  D K+L+ H+
Sbjct: 141  TKKDACTHPGSFGDMCILCGQRLEEETGVTFGYIHKGLRLNNDEIVRLRSTDMKNLIRHK 200

Query: 1218 RKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPF 1039
             K           LNS RL D+  EE  YL SQ  S  D  + SLF L+ MHMMTKLRPF
Sbjct: 201  -KLCLVLDLDHTLLNSTRLVDLSSEEQ-YLKSQAFSPQDASEGSLFVLEAMHMMTKLRPF 258

Query: 1038 VRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIV 859
            VR FL+E   +FE+Y+YTMG+RPYAL MA LLDP  +YF  RII++ D T K+QKGLD+V
Sbjct: 259  VRNFLKEVYNLFELYVYTMGDRPYALAMAKLLDPRREYFGDRIISRDDGTLKHQKGLDVV 318

Query: 858  LGRESAVLILDDTEAVW-KENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETEG 682
            LG+ESAVLILDDTE  W K +KENLILM+RYHFF SS   FG++  SLS+LK+DESETEG
Sbjct: 319  LGQESAVLILDDTENAWIKHHKENLILMERYHFFRSSTHQFGYNCKSLSELKSDESETEG 378

Query: 681  ALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAE 502
            AL TVL VL+++H +FFDE R  D   RDVRQVLK +RKEVLKGC+IVF+RV P  F AE
Sbjct: 379  ALVTVLNVLKQVHSMFFDE-RGIDHIIRDVRQVLKTLRKEVLKGCKIVFSRVFPTEFQAE 437

Query: 501  NHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFMWR 385
            NH LW+MAEQLGA                      SRWAVKE KFLV+P WIEA+N+MW+
Sbjct: 438  NHQLWKMAEQLGATCGIELDPSVTHVVSLDVGTEKSRWAVKENKFLVHPRWIEAANYMWK 497

Query: 384  KQPEEKFPVVQAK 346
            +QPE+ F V Q K
Sbjct: 498  RQPEDNFSVNQVK 510


>ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318537|gb|EEF03111.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 468

 Score =  406 bits (1043), Expect = e-110
 Identities = 214/370 (57%), Positives = 263/370 (71%), Gaps = 21/370 (5%)
 Frame = -1

Query: 1392 KEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLVHRRK 1213
            KEICTHPG F  MC+ CGQ  D +S V   YI K LRL NDE+ RLR+ D K+LL H+ K
Sbjct: 105  KEICTHPGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHK-K 163

Query: 1212 XXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFVR 1033
                       LNS +L  +  +E+ YLN Q DSL D  K SLF L  M MMTKLRPFVR
Sbjct: 164  LYLILDLDHTLLNSTQLMHMTLDEE-YLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVR 222

Query: 1032 TFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVLG 853
            TFL+EAS+MFEMYIYTMG+R YALEMA LLDPG +YF++++I++ D TQ++QKGLD+VLG
Sbjct: 223  TFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLG 282

Query: 852  RESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETEGALA 673
            +ESAVLILDDTE  W ++K+NLILM+RYHFFASSC  FGF+  SLS+ KTDESE+EGALA
Sbjct: 283  QESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALA 342

Query: 672  TVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENHH 493
            ++LKVL++IH +FF+    D + +  + QVLK VRK+VLKGC+IVF+RV P    A+NHH
Sbjct: 343  SILKVLRKIHQIFFE----DHILSLAL-QVLKTVRKDVLKGCKIVFSRVFPTQSQADNHH 397

Query: 492  LWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFMWRKQP 376
            LWRMAEQLGA                      S WA+K  KFLV PGWIEA+N+ W++QP
Sbjct: 398  LWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQP 457

Query: 375  EEKFPVVQAK 346
            EE F   Q K
Sbjct: 458  EENFSFNQIK 467


>ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Fragaria vesca subsp. vesca]
          Length = 464

 Score =  395 bits (1015), Expect = e-107
 Identities = 214/383 (55%), Positives = 267/383 (69%), Gaps = 23/383 (6%)
 Frame = -1

Query: 1425 ALTETIGSLSK-KEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRD 1249
            A++E I   S   ++C HPG F  MC  CGQR  + S V   YI K LRL + E+ RLR+
Sbjct: 80   AVSEEISEASGVDDLCAHPGSFGDMCFLCGQRLIEQSGVTFGYIHKGLRLNDGEIDRLRN 139

Query: 1248 KDFKDLLVHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDK 1069
             D K  L + +K           LN+  L+ +  +E+ YL    DSLPD +K+SLFRLD 
Sbjct: 140  TDIKKSL-NNKKLYLVLDLDHTLLNTTLLNHVTAKEE-YLMCPPDSLPDVLKDSLFRLDF 197

Query: 1068 MHMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCT 889
            M MMTKLRPF+RTFL+EAS++FEMYIYTMG+R YALEMA LLDP  +YF  R+I++ D T
Sbjct: 198  MRMMTKLRPFIRTFLKEASEIFEMYIYTMGDRAYALEMAKLLDPKKEYFGDRVISRDDGT 257

Query: 888  QKYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQL 709
            Q++QKGLDIVLG+ESAVLILDDTE  W ++K+NLILM+RYHFF SSC  FGF+  SLS+L
Sbjct: 258  QRHQKGLDIVLGQESAVLILDDTENAWIKHKDNLILMERYHFFRSSCAQFGFTCESLSEL 317

Query: 708  KTDESETEGALATVLKVLQRIHGLFF-DEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFT 532
            K+DESE EGALA VL +L+RIH +FF D G   +L +RDVRQVLK VRKEVL GC++VF+
Sbjct: 318  KSDESEPEGALANVLDLLKRIHKMFFYDLG--GNLVDRDVRQVLKIVRKEVLNGCKVVFS 375

Query: 531  RVIPISFPAENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPG 415
            R+IP    A +HHLW+MAEQLGA                      SRWAVK  KFLV+P 
Sbjct: 376  RIIPSKVLASSHHLWKMAEQLGAICSTEVDSTVTHVVALDAGTEKSRWAVKHNKFLVHPR 435

Query: 414  WIEASNFMWRKQPEEKFPVVQAK 346
            W+EA+N+MW+KQ EEKFPV + K
Sbjct: 436  WLEAANYMWQKQAEEKFPVTETK 458


>ref|XP_007154892.1| hypothetical protein PHAVU_003G156800g [Phaseolus vulgaris]
            gi|561028246|gb|ESW26886.1| hypothetical protein
            PHAVU_003G156800g [Phaseolus vulgaris]
          Length = 401

 Score =  394 bits (1011), Expect = e-107
 Identities = 207/365 (56%), Positives = 262/365 (71%), Gaps = 21/365 (5%)
 Frame = -1

Query: 1395 KKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLVHRR 1216
            K ++C+HPG F  MC++CGQ+ D +S V   YI K LRL +DE++RLR+ D K LL  R+
Sbjct: 38   KVDVCSHPGSFGSMCIRCGQKLDGESGVTFGYIHKGLRLHDDEISRLRNTDMKSLLC-RK 96

Query: 1215 KXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFV 1036
            K           LNS  LSD+  EE   L+ Q DSL D  K SLF+LD MHMMTKLRPFV
Sbjct: 97   KLYFVLDLDHTLLNSTHLSDLSSEESSLLD-QTDSLEDVSKGSLFKLDHMHMMTKLRPFV 155

Query: 1035 RTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVL 856
            R+FL+EAS+MFEMYIYTMG+RPYALEMA LLDP   YF++++I++ D TQK+QKGLD+VL
Sbjct: 156  RSFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRGVYFNAKVISRDDGTQKHQKGLDVVL 215

Query: 855  GRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETEGAL 676
            G+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC+ FGF+  SL++L+ DE ET+GAL
Sbjct: 216  GQESAVLILDDTEHAWMKHKDNLILMERYHFFASSCRQFGFNCKSLAELRNDEDETDGAL 275

Query: 675  ATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENH 496
            A +LKVL+++H  FFD+  ++DL +RDVRQVL +VR EVL GC IVF+R+   + P+   
Sbjct: 276  AKILKVLRQVHCTFFDK-HQEDLVDRDVRQVLASVRSEVLGGCVIVFSRIFHGALPS--- 331

Query: 495  HLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFMWRKQ 379
             L +MAEQ+GA                      SRWAVKE KFLV+P WIEA+NF W KQ
Sbjct: 332  -LRKMAEQMGATCLTEVDLSVTHVVATDAGTEKSRWAVKEHKFLVHPRWIEAANFFWEKQ 390

Query: 378  PEEKF 364
            PEE F
Sbjct: 391  PEENF 395


>ref|XP_007154891.1| hypothetical protein PHAVU_003G156800g [Phaseolus vulgaris]
            gi|561028245|gb|ESW26885.1| hypothetical protein
            PHAVU_003G156800g [Phaseolus vulgaris]
          Length = 441

 Score =  394 bits (1011), Expect = e-107
 Identities = 207/365 (56%), Positives = 262/365 (71%), Gaps = 21/365 (5%)
 Frame = -1

Query: 1395 KKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKDFKDLLVHRR 1216
            K ++C+HPG F  MC++CGQ+ D +S V   YI K LRL +DE++RLR+ D K LL  R+
Sbjct: 78   KVDVCSHPGSFGSMCIRCGQKLDGESGVTFGYIHKGLRLHDDEISRLRNTDMKSLLC-RK 136

Query: 1215 KXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMHMMTKLRPFV 1036
            K           LNS  LSD+  EE   L+ Q DSL D  K SLF+LD MHMMTKLRPFV
Sbjct: 137  KLYFVLDLDHTLLNSTHLSDLSSEESSLLD-QTDSLEDVSKGSLFKLDHMHMMTKLRPFV 195

Query: 1035 RTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQKYQKGLDIVL 856
            R+FL+EAS+MFEMYIYTMG+RPYALEMA LLDP   YF++++I++ D TQK+QKGLD+VL
Sbjct: 196  RSFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRGVYFNAKVISRDDGTQKHQKGLDVVL 255

Query: 855  GRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKTDESETEGAL 676
            G+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC+ FGF+  SL++L+ DE ET+GAL
Sbjct: 256  GQESAVLILDDTEHAWMKHKDNLILMERYHFFASSCRQFGFNCKSLAELRNDEDETDGAL 315

Query: 675  ATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIVFTRVIPISFPAENH 496
            A +LKVL+++H  FFD+  ++DL +RDVRQVL +VR EVL GC IVF+R+   + P+   
Sbjct: 316  AKILKVLRQVHCTFFDK-HQEDLVDRDVRQVLASVRSEVLGGCVIVFSRIFHGALPS--- 371

Query: 495  HLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWIEASNFMWRKQ 379
             L +MAEQ+GA                      SRWAVKE KFLV+P WIEA+NF W KQ
Sbjct: 372  -LRKMAEQMGATCLTEVDLSVTHVVATDAGTEKSRWAVKEHKFLVHPRWIEAANFFWEKQ 430

Query: 378  PEEKF 364
            PEE F
Sbjct: 431  PEENF 435


>ref|XP_007149092.1| hypothetical protein PHAVU_005G040600g [Phaseolus vulgaris]
            gi|593697222|ref|XP_007149093.1| hypothetical protein
            PHAVU_005G040600g [Phaseolus vulgaris]
            gi|561022356|gb|ESW21086.1| hypothetical protein
            PHAVU_005G040600g [Phaseolus vulgaris]
            gi|561022357|gb|ESW21087.1| hypothetical protein
            PHAVU_005G040600g [Phaseolus vulgaris]
          Length = 443

 Score =  392 bits (1006), Expect = e-106
 Identities = 207/382 (54%), Positives = 271/382 (70%), Gaps = 22/382 (5%)
 Frame = -1

Query: 1422 LTETIGSLSKKEICTHPGFFAGMCMKCGQRADDDSAVALKYIDKNLRLANDEMARLRDKD 1243
            L + + +  + ++CTHPG F  MC++CGQ+ D  S V   YI K LRL ++E++RLR+ D
Sbjct: 69   LKQNLETSVEVDVCTHPGSFGSMCIRCGQKLDGKSGVTFGYIHKGLRLHDEEISRLRNTD 128

Query: 1242 FKDLLVHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNSQRDSLPDNIKNSLFRLDKMH 1063
             K LL  R+K           LNS  L+ +  EE   LN Q DSL D  K SLF+L+ MH
Sbjct: 129  MKSLLC-RKKLYLVLDLDHTLLNSTLLAHLSSEESHLLN-QTDSLQDVSKGSLFKLEHMH 186

Query: 1062 MMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQGDCTQK 883
            MMTKLRPFVR+FL+EA++MFEMYIYTMG+RPYALEMA LLDP  +YF++R+I++ D TQK
Sbjct: 187  MMTKLRPFVRSFLKEATEMFEMYIYTMGDRPYALEMAKLLDPQGEYFNARVISRDDGTQK 246

Query: 882  YQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSLSQLKT 703
            +QKGLD+VLG+ESAVLILDDTE  W ++K+NLILM+RYHFFASSC+ FGF+  S ++L+ 
Sbjct: 247  HQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHFFASSCRQFGFNCKSPAELRN 306

Query: 702  DESETEGALATVLKVLQRIHGLFFDEGRK-DDLENRDVRQVLKAVRKEVLKGCRIVFTRV 526
            DE ET+GALA +LKVL+++H  FFD+ ++ DDL NRDVRQVL +VR EVL GC IVF+R+
Sbjct: 307  DEDETDGALAKILKVLKQVHCTFFDKHQEDDDLVNRDVRQVLSSVRSEVLSGCVIVFSRI 366

Query: 525  IPISFPAENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVNPGWI 409
               + P+    L +MAEQ+GA                      SRWA+KEKKFLV+P WI
Sbjct: 367  FHGALPS----LQKMAEQMGATCLAEVDPSVTHIVATDAGTEKSRWALKEKKFLVHPRWI 422

Query: 408  EASNFMWRKQPEEKFPVVQAKQ 343
            EA+N+ W KQPEE F +++ KQ
Sbjct: 423  EAANYFWEKQPEENF-IIKKKQ 443


>ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda]
            gi|548840545|gb|ERN00656.1| hypothetical protein
            AMTR_s00106p00017820 [Amborella trichopoda]
          Length = 486

 Score =  389 bits (1000), Expect = e-105
 Identities = 195/381 (51%), Positives = 263/381 (69%), Gaps = 32/381 (8%)
 Frame = -1

Query: 1404 SLSKKEICTHPGFFAGMCMKCGQRADDDS------AVALKYIDKNLRLANDEMARLRDKD 1243
            S S+K    HPGF+  MC++CG++ DD++      AVA  YI K+L+L  +E+ARLR  D
Sbjct: 99   STSEKVCPPHPGFYKDMCIRCGEQKDDETVARKETAVAFNYIHKDLKLGAEEVARLRATD 158

Query: 1242 FKDLLVHRRKXXXXXXXXXXXLNSARLSDIKDEEDIYLNS-----QRDSLPDNIKNSLFR 1078
             K+L   RRK           LNS RL D+  EE+ YLN+     +  S   +   +LF+
Sbjct: 159  LKNLY-RRRKLYLVLDLDHTLLNSTRLVDVSPEEEAYLNATYLNKETSSSNGDTSGTLFK 217

Query: 1077 LDKMHMMTKLRPFVRTFLEEASKMFEMYIYTMGERPYALEMADLLDPGNKYFHSRIIAQG 898
            L+ +HM+TKLRPFVRTFL+EA+ MFEMY+YTMGER YALEMA LLDP   YF SR+I+QG
Sbjct: 218  LEPLHMLTKLRPFVRTFLKEANTMFEMYVYTMGERAYALEMAKLLDPSGVYFGSRVISQG 277

Query: 897  DCTQKYQKGLDIVLGRESAVLILDDTEAVWKENKENLILMDRYHFFASSCKHFGFSYNSL 718
            D T ++QKGLD+VLG E AV+ILDDTE VW ++KENL+LM+RYHFF+SSC+ F   Y SL
Sbjct: 278  DSTVRHQKGLDVVLGSECAVVILDDTEHVWHKHKENLVLMERYHFFSSSCRQFNVHYKSL 337

Query: 717  SQLKTDESETEGALATVLKVLQRIHGLFFDEGRKDDLENRDVRQVLKAVRKEVLKGCRIV 538
            S+LK DESE++G LA++L VL+ IH +F+ +  + D    DVR+VLK ++ EVLKGCR+V
Sbjct: 338  SELKRDESESDGMLASILNVLKHIHQMFYYQEVETDFNGSDVRKVLKTIQSEVLKGCRLV 397

Query: 537  FTRVIPISFPAENHHLWRMAEQLGA---------------------XSRWAVKEKKFLVN 421
            F+R+ P ++P EN  LWR+AEQLGA                      +RWA++ KK LVN
Sbjct: 398  FSRIFPTNYPVENQTLWRIAEQLGASCSKELDEAVTHVVSLDLGTEKARWAIQRKKHLVN 457

Query: 420  PGWIEASNFMWRKQPEEKFPV 358
            PGW+EA+N+ W++QPE++FP+
Sbjct: 458  PGWLEATNYFWKRQPEDQFPI 478


Top