BLASTX nr result

ID: Sinomenium21_contig00018669 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00018669
         (1423 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative ...   484   e-134
ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [A...   479   e-132
ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu...   471   e-130
ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma...   466   e-128
ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prun...   465   e-128
ref|XP_007014446.1| RNA polymerase II ctd phosphatase, putative ...   463   e-128
ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ...   461   e-127
gb|EYU29592.1| hypothetical protein MIMGU_mgv1a017809mg [Mimulus...   456   e-126
ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu...   455   e-125
ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal doma...   450   e-124
ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [S...   450   e-124
ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal doma...   449   e-123
ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal doma...   449   e-123
ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal doma...   447   e-123
gb|EYU37264.1| hypothetical protein MIMGU_mgv1a005925mg [Mimulus...   446   e-123
ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal doma...   446   e-122
gb|EXC26161.1| RNA polymerase II C-terminal domain phosphatase-l...   444   e-122
ref|XP_007154891.1| hypothetical protein PHAVU_003G156800g [Phas...   441   e-121
ref|XP_007149092.1| hypothetical protein PHAVU_005G040600g [Phas...   441   e-121
gb|AFW77884.1| CPL3 [Zea mays]                                        435   e-119

>ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma
            cacao] gi|508784808|gb|EOY32064.1| RNA polymerase II ctd
            phosphatase, putative isoform 1 [Theobroma cacao]
          Length = 469

 Score =  484 bits (1247), Expect = e-134
 Identities = 242/408 (59%), Positives = 308/408 (75%), Gaps = 4/408 (0%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANV----ETCPHPAFFREMCVRCGQYMN 233
            R KR K ++L+D  ES+G+TS   ++ +    A +    + C HP  F +MC+ CGQ ++
Sbjct: 63   RNKRCKTEKLEDLEESRGSTSQGLIEDKIVIHAELSLKKDICTHPGSFGQMCILCGQRLD 122

Query: 234  DDSAVAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPE 413
            D+S V FGYIHK L++G +E+ RLR +++  +LR++K            NST+++ ++P+
Sbjct: 123  DESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLMHLTPD 182

Query: 414  EEYLINHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYAL 593
            EEYL   +DSLQD+  G LF +D +HM+TKLRPFVRTFLKEAS M+EMYIYTMG+R YAL
Sbjct: 183  EEYLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYAL 242

Query: 594  EMAQLLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLIL 773
            EMA+LLDP R YF+ RVIS+ D TQKHQKGLDVVLG +SAVVILDDTE  W +H++NLIL
Sbjct: 243  EMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENAWMKHKDNLIL 302

Query: 774  MERYHFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTS 953
            MERYH+FASS   FG   KSLS+LK DESE DGALA+VL  L+ +H MFF+ ELD NL S
Sbjct: 303  MERYHYFASSCHQFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFFD-ELDCNLAS 361

Query: 954  GDVRKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVV 1133
             DVR+VLKTV+ EVLKGCKIVFS V+     AE+  LW++A+ LGA CSTE + SVTHVV
Sbjct: 362  RDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTETDLSVTHVV 421

Query: 1134 STDTGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIKNQ 1277
            STD GTEK+RWAV++KKFLVHPRWIEA NYLW++QPEE F ++  KNQ
Sbjct: 422  STDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 469


>ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda]
            gi|548840545|gb|ERN00656.1| hypothetical protein
            AMTR_s00106p00017820 [Amborella trichopoda]
          Length = 486

 Score =  479 bits (1234), Expect = e-132
 Identities = 241/411 (58%), Positives = 303/411 (73%), Gaps = 13/411 (3%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCP-HPAFFREMCVRCGQYMNDDS 242
            RIKR KI E ++  ESQ + +         E  + + CP HP F+++MC+RCG+  +D++
Sbjct: 68   RIKRPKICEDEEIKESQSSNANQGELDNFKESTSEKVCPPHPGFYKDMCIRCGEQKDDET 127

Query: 243  ------AVAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDV 404
                  AVAF YIHKDLK+G EE+ RLR ++L  + R RK            NSTR++DV
Sbjct: 128  VARKETAVAFNYIHKDLKLGAEEVARLRATDLKNLYRRRKLYLVLDLDHTLLNSTRLVDV 187

Query: 405  SPEEE------YLINHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIY 566
            SPEEE      YL   T S     +G LFK++ +HMLTKLRPFVRTFLKEA++M+EMY+Y
Sbjct: 188  SPEEEAYLNATYLNKETSSSNGDTSGTLFKLEPLHMLTKLRPFVRTFLKEANTMFEMYVY 247

Query: 567  TMGERSYALEMAQLLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVW 746
            TMGER+YALEMA+LLDP  VYF SRVISQ DST +HQKGLDVVLG + AVVILDDTE VW
Sbjct: 248  TMGERAYALEMAKLLDPSGVYFGSRVISQGDSTVRHQKGLDVVLGSECAVVILDDTEHVW 307

Query: 747  NRHRENLILMERYHFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFN 926
            ++H+ENL+LMERYHFF+SS R F +  KSLSELKRDESE DG LA++L VLKH+H+MF+ 
Sbjct: 308  HKHKENLVLMERYHFFSSSCRQFNVHYKSLSELKRDESESDGMLASILNVLKHIHQMFYY 367

Query: 927  LELDANLTSGDVRKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTE 1106
             E++ +    DVRKVLKT++SEVLKGC++VFSR++      ENQ LW IA+ LGA CS E
Sbjct: 368  QEVETDFNGSDVRKVLKTIQSEVLKGCRLVFSRIFPTNYPVENQTLWRIAEQLGASCSKE 427

Query: 1107 LNQSVTHVVSTDTGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSI 1259
            L+++VTHVVS D GTEKARWA+Q+KK LV+P W+EA NY WKRQPE+QF I
Sbjct: 428  LDEAVTHVVSLDLGTEKARWAIQRKKHLVNPGWLEATNYFWKRQPEDQFPI 478


>ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318538|gb|EEF03112.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 472

 Score =  471 bits (1213), Expect = e-130
 Identities = 228/403 (56%), Positives = 301/403 (74%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSA 245
            R+KR K++ ++   +  G TS ++++   +   + E C HP  F  MC+ CGQ ++ +S 
Sbjct: 71   RVKRSKVETVEIVEDDGGTTSFASLKHNSEASISKEICTHPGSFGTMCIVCGQLLDGESG 130

Query: 246  VAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYL 425
            V FGYIHK L++G +E+ RLR +++  +LR++K            NST+++ ++ +EEYL
Sbjct: 131  VTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEEYL 190

Query: 426  INHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQ 605
               TDSLQD+  G LF + ++ M+TKLRPFVRTFLKEAS M+EMYIYTMG+R+YALEMA+
Sbjct: 191  NGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAK 250

Query: 606  LLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERY 785
            LLDP R YFN++VIS+ D TQ+HQKGLDVVLG +SAV+ILDDTE  W +H++NLILMERY
Sbjct: 251  LLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLILMERY 310

Query: 786  HFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDVR 965
            HFFASS   FG + KSLSE K DESE +GALA++L VL+ +H++FF  EL+ N+   DVR
Sbjct: 311  HFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFE-ELEENMDGRDVR 369

Query: 966  KVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTDT 1145
            +VLKTVR +VLKGCKIVFSRV+    +A+N  LW +A+ LGA CSTEL+ SVTHVVS D+
Sbjct: 370  QVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDS 429

Query: 1146 GTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIKN 1274
            GTEK+ WA++  KFLV P WIEAANY W+RQPEE FS N IKN
Sbjct: 430  GTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFNQIKN 472


>ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Cucumis sativus]
          Length = 452

 Score =  466 bits (1198), Expect = e-128
 Identities = 228/403 (56%), Positives = 302/403 (74%), Gaps = 1/403 (0%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEV-ANVETCPHPAFFREMCVRCGQYMNDDS 242
            RIKR K+++L+++ E      +  ++ +  EV +  + C HP  F  MC+ CGQ ++++S
Sbjct: 48   RIKRRKVEKLENSEED----IMHEVEEQSLEVLSKQQLCSHPGSFGNMCIICGQRLDEES 103

Query: 243  AVAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEY 422
             V FGYIHK+L++  +E++R+R   +  +L+ +K            NST +  ++ EEEY
Sbjct: 104  GVTFGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEY 163

Query: 423  LINHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMA 602
            L + TDSL D+  G LF ++++H +TKLRPFV +FLKEAS ++EMYIYTMGER YA EMA
Sbjct: 164  LRSQTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMA 223

Query: 603  QLLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMER 782
            +LLDP++ YF+S+VIS+ D TQKHQKGLDVVLG +SAV+ILDDTE  W +H+ENLILMER
Sbjct: 224  KLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMER 283

Query: 783  YHFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDV 962
            YHFFASS R FG + KSLSELK DESE DGAL T+L VLK VH MFFN E+  +L   DV
Sbjct: 284  YHFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQVHHMFFN-EVSGDLVDRDV 342

Query: 963  RKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTD 1142
            R+VLKTVR+EVL+GCK+VFSRV+    +AEN +LW++ + LG  CSTEL+QSVTHVV+TD
Sbjct: 343  RQVLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATD 402

Query: 1143 TGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIK 1271
             GTEK+RWA+++KKFLVHPRWIEA+NY WKRQ EE F++   K
Sbjct: 403  AGTEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENFTVEQTK 445


>ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica]
            gi|462399876|gb|EMJ05544.1| hypothetical protein
            PRUPE_ppa005647mg [Prunus persica]
          Length = 449

 Score =  465 bits (1196), Expect = e-128
 Identities = 229/400 (57%), Positives = 293/400 (73%)
 Frame = +3

Query: 72   KRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSAVA 251
            KR K++ L    E+QG+TS   ++   +     + C HP   +++C+ CGQ +++ S V 
Sbjct: 50   KRRKVENLGSIDETQGSTSQIFVEENSEASPKKDICTHPGSVKDLCIVCGQRVDEKSGVP 109

Query: 252  FGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYLIN 431
             GYIHKD  +  +E+DR+R +++   L  +K            NST +  ++ EEEYL +
Sbjct: 110  LGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLVLDLDHTLLNSTHLNHMTAEEEYLHS 169

Query: 432  HTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQLL 611
             TDSLQD+ +G LF+VD +HM+TKLRPFVR FLKEAS M+EMYIYTMGER+YALEMA+LL
Sbjct: 170  QTDSLQDVSDGSLFRVDVMHMMTKLRPFVRKFLKEASEMFEMYIYTMGERAYALEMAKLL 229

Query: 612  DPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERYHF 791
            DP + YF  RVIS+ D TQKHQKGLDVVLG +SA +ILDDTE  W +H++NLILMERYHF
Sbjct: 230  DPRKEYFGDRVISRDDGTQKHQKGLDVVLGHESAALILDDTENAWTKHKDNLILMERYHF 289

Query: 792  FASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDVRKV 971
            F SS   FG   KSLSELK DESE +GALATVL VLK +H MFF  E   NL   DVR+V
Sbjct: 290  FRSSCHQFGFHCKSLSELKSDESEPEGALATVLEVLKRIHNMFF-YESKDNLIDRDVRQV 348

Query: 972  LKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTDTGT 1151
            LKT+R E+LKGCKIVFSRV+    +AEN +LW++A+ LGA CSTEL+ SVTHVVSTD GT
Sbjct: 349  LKTLRKEILKGCKIVFSRVFPSKFQAENHQLWKMAEQLGATCSTELDLSVTHVVSTDAGT 408

Query: 1152 EKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIK 1271
            EK+RWAV++KKFLVHP+WIEA+NY+W +Q E++F +N  K
Sbjct: 409  EKSRWAVKEKKFLVHPQWIEASNYMWLKQAEDKFPVNQTK 448


>ref|XP_007014446.1| RNA polymerase II ctd phosphatase, putative isoform 2 [Theobroma
            cacao] gi|508784809|gb|EOY32065.1| RNA polymerase II ctd
            phosphatase, putative isoform 2 [Theobroma cacao]
          Length = 357

 Score =  463 bits (1191), Expect = e-128
 Identities = 226/358 (63%), Positives = 281/358 (78%)
 Frame = +3

Query: 204  MCVRCGQYMNDDSAVAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXN 383
            MC+ CGQ ++D+S V FGYIHK L++G +E+ RLR +++  +LR++K            N
Sbjct: 1    MCILCGQRLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLN 60

Query: 384  STRVIDVSPEEEYLINHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYI 563
            ST+++ ++P+EEYL   +DSLQD+  G LF +D +HM+TKLRPFVRTFLKEAS M+EMYI
Sbjct: 61   STQLMHLTPDEEYLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYI 120

Query: 564  YTMGERSYALEMAQLLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIV 743
            YTMG+R YALEMA+LLDP R YF+ RVIS+ D TQKHQKGLDVVLG +SAVVILDDTE  
Sbjct: 121  YTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENA 180

Query: 744  WNRHRENLILMERYHFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFF 923
            W +H++NLILMERYH+FASS   FG   KSLS+LK DESE DGALA+VL  L+ +H MFF
Sbjct: 181  WMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFF 240

Query: 924  NLELDANLTSGDVRKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECST 1103
            + ELD NL S DVR+VLKTV+ EVLKGCKIVFS V+     AE+  LW++A+ LGA CST
Sbjct: 241  D-ELDCNLASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCST 299

Query: 1104 ELNQSVTHVVSTDTGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIKNQ 1277
            E + SVTHVVSTD GTEK+RWAV++KKFLVHPRWIEA NYLW++QPEE F ++  KNQ
Sbjct: 300  ETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 357


>ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223534449|gb|EEF36151.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  461 bits (1185), Expect = e-127
 Identities = 225/402 (55%), Positives = 300/402 (74%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSA 245
            RIKR +++ L++    + +T +S  Q      + V  C HP  F +MC+ CG+ + +++ 
Sbjct: 75   RIKRSRVETLENGENPKESTRVSLDQTLVASSSKV-ACTHPGSFGDMCILCGERLIEETG 133

Query: 246  VAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYL 425
            V FGYIHK L++  +E+ RLR +++  +LR+RK            NST+++ ++ EEEYL
Sbjct: 134  VTFGYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTLLNSTQLMHLTAEEEYL 193

Query: 426  INHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQ 605
             +  DS+QD+ NG LF VD +HM+TKLRPF+RTFLKEAS M+EMYIYTMG+R+YALEMA+
Sbjct: 194  KSQIDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEMYIYTMGDRAYALEMAK 253

Query: 606  LLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERY 785
             LDP R YFN+RVIS+ D TQ+HQKGLD+VLG +SAV+ILDDTE  W +H++NLILMERY
Sbjct: 254  FLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWTKHKDNLILMERY 313

Query: 786  HFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDVR 965
            HFFASS R FG   KSLS+LK DE+E DGALA+VL VL+ +H +FF+ EL+  +   DVR
Sbjct: 314  HFFASSCRQFGFECKSLSQLKSDENESDGALASVLKVLRRIHHIFFD-ELEDAIDGRDVR 372

Query: 966  KVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTDT 1145
            +VL TVR +VLKGCKIVFSRV+    +A+N  LW++A+ LGA CS E++ SVTHVVS + 
Sbjct: 373  QVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCSREVDPSVTHVVSAEA 432

Query: 1146 GTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIK 1271
            GTEK+RWA++  KFLVHPRWIEA NY+W+RQPEE FS+N  K
Sbjct: 433  GTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVNQPK 474


>gb|EYU29592.1| hypothetical protein MIMGU_mgv1a017809mg [Mimulus guttatus]
          Length = 466

 Score =  456 bits (1174), Expect = e-126
 Identities = 223/404 (55%), Positives = 296/404 (73%), Gaps = 5/404 (1%)
 Frame = +3

Query: 66   RIKRHKIDELDDTG----ESQGATSLS-AMQREPDEVANVETCPHPAFFREMCVRCGQYM 230
            R+KR KI+  +D       SQ ++S+  ++Q          TC HP  +  MC+RCGQ M
Sbjct: 59   RVKRRKIELSEDVNFDVINSQSSSSVGESVQLLSGSSPKKNTCLHPGVYAGMCMRCGQKM 118

Query: 231  NDDSAVAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSP 410
            +D+S VAFGYIHK+L++  +EMDRLR  +L  +LR+RK            NS R+ D++ 
Sbjct: 119  DDESGVAFGYIHKNLRLANDEMDRLRDRDLKNMLRHRKLCLVLDLDHTLLNSARLHDITE 178

Query: 411  EEEYLINHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYA 590
            EE YL    D+L D     LF++D I+M+TKLRPFV TFLKEAS ++EMYIYTMGER YA
Sbjct: 179  EEGYLNGQRDALPDTLKSSLFRLDWIYMMTKLRPFVHTFLKEASKLFEMYIYTMGERPYA 238

Query: 591  LEMAQLLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLI 770
            LEMA+LLDP  +YFNSR+I+Q D T KHQKGLDVVLG +SAVVILDDTE+VW++H++NLI
Sbjct: 239  LEMAKLLDPGDIYFNSRIIAQGDCTHKHQKGLDVVLGQESAVVILDDTEVVWSKHKDNLI 298

Query: 771  LMERYHFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLT 950
            LMERYHFFASS + FG + KSLSEL+ DES+ +GAL TVL  L+ +H +FF++E   +L 
Sbjct: 299  LMERYHFFASSCKQFGFNCKSLSELRSDESDTEGALPTVLKRLQQIHSLFFDVERKDSLE 358

Query: 951  SGDVRKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHV 1130
              DVR V+KT+R EVLKGCK+VF+RV+     AE+  LW++A+ LGA C  E++  +THV
Sbjct: 359  DRDVRLVMKTLRKEVLKGCKVVFTRVFPTNFPAEHHSLWKMAEKLGATCCNEIDPCITHV 418

Query: 1131 VSTDTGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSIN 1262
            VS D GT+K+RWA+++KKFLVHPRWIEA+NY+W++QPEE F ++
Sbjct: 419  VSMDAGTDKSRWALKEKKFLVHPRWIEASNYMWQKQPEENFPVS 462


>ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa]
            gi|550318537|gb|EEF03111.2| hypothetical protein
            POPTR_0018s11760g [Populus trichocarpa]
          Length = 468

 Score =  455 bits (1170), Expect = e-125
 Identities = 224/403 (55%), Positives = 296/403 (73%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSA 245
            R+KR K++ ++   +  G TS ++++   +   + E C HP  F  MC+ CGQ ++ +S 
Sbjct: 71   RVKRSKVETVEIVEDDGGTTSFASLKHNSEASISKEICTHPGSFGTMCIVCGQLLDGESG 130

Query: 246  VAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYL 425
            V FGYIHK L++G +E+ RLR +++  +LR++K            NST+++ ++ +EEYL
Sbjct: 131  VTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEEYL 190

Query: 426  INHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQ 605
               TDSLQD+  G LF + ++ M+TKLRPFVRTFLKEAS M+EMYIYTMG+R+YALEMA+
Sbjct: 191  NGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAK 250

Query: 606  LLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERY 785
            LLDP R YFN++VIS+ D TQ+HQKGLDVVLG +SAV+ILDDTE  W +H++NLILMERY
Sbjct: 251  LLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLILMERY 310

Query: 786  HFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDVR 965
            HFFASS   FG + KSLSE K DESE +GALA++L VL+ +H++FF    D  L+     
Sbjct: 311  HFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFE---DHILSL--AL 365

Query: 966  KVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTDT 1145
            +VLKTVR +VLKGCKIVFSRV+    +A+N  LW +A+ LGA CSTEL+ SVTHVVS D+
Sbjct: 366  QVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDS 425

Query: 1146 GTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIKN 1274
            GTEK+ WA++  KFLV P WIEAANY W+RQPEE FS N IKN
Sbjct: 426  GTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFNQIKN 468


>ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Setaria italica]
          Length = 543

 Score =  450 bits (1158), Expect = e-124
 Identities = 225/399 (56%), Positives = 290/399 (72%), Gaps = 3/399 (0%)
 Frame = +3

Query: 72   KRHKIDELD-DTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCG--QYMNDDS 242
            KR +++E   D G S     ++     P +   VE CPHP +F  +C RCG  Q   D S
Sbjct: 72   KRRRVEEQSQDQGTSIRPDKIAT---GPSKNVQVEVCPHPGYFGGLCFRCGKPQDEEDAS 128

Query: 243  AVAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEY 422
             VAFGYIHK L++GT E+DRLRG++L  +LR RK            NST++ D+S  E  
Sbjct: 129  GVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLQDISSAENE 188

Query: 423  LINHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMA 602
            L   T +L+D P+  +F +D++ MLTKLRPFVR FLKEAS+M+EMYIYTMG+++YA+E+A
Sbjct: 189  LGIRTAALKDDPDRSIFSLDSMQMLTKLRPFVRNFLKEASNMFEMYIYTMGDKAYAIEIA 248

Query: 603  QLLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMER 782
            +LLDP  VYF S+VIS +D TQ+HQKGLDV+LG +S  VILDDTE VW +H+ENLILMER
Sbjct: 249  KLLDPSNVYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEYVWQKHKENLILMER 308

Query: 783  YHFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDV 962
            YH+FASS R FG   KSLSE  +DE E DGALATVL VLK +H +FF+  ++  L+S DV
Sbjct: 309  YHYFASSCRQFGFGVKSLSESMQDERESDGALATVLDVLKRIHTIFFDTAVETALSSRDV 368

Query: 963  RKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTD 1142
            R+V+KTVR EVL+GCK+VFSRV+    R + Q +W++A+ LGA CST+++ +VTHVV+ D
Sbjct: 369  RQVIKTVRKEVLEGCKLVFSRVFPNTSRPQEQMMWKMAEHLGAVCSTDVDSTVTHVVAVD 428

Query: 1143 TGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSI 1259
             GTEKARWAV+ KKFLVHPRWIEAAN+ W RQPEE F +
Sbjct: 429  LGTEKARWAVKNKKFLVHPRWIEAANFRWHRQPEEDFPV 467


>ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
            gi|241915584|gb|EER88728.1| hypothetical protein
            SORBIDRAFT_10g025580 [Sorghum bicolor]
          Length = 558

 Score =  450 bits (1157), Expect = e-124
 Identities = 224/405 (55%), Positives = 292/405 (72%), Gaps = 3/405 (0%)
 Frame = +3

Query: 72   KRHKIDE-LDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDD--S 242
            KR +++E L D G S     +        +   VE CPHP +F  +C RCG+  +++  S
Sbjct: 74   KRRRVEEQLQDQGTSVRPDKIPT---GASKNVQVEACPHPGYFGGLCFRCGKPQDEENVS 130

Query: 243  AVAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEY 422
             VAFGYIHK L++GT E+DRLRG++L  +LR RK            NST++ D+S  E+ 
Sbjct: 131  GVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLQDISSAEKD 190

Query: 423  LINHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMA 602
            L   T + +D PN  +F +D++ MLTKLRPFVR FLKEAS+M+EMYIYTMG+++YA+E+A
Sbjct: 191  LGIQTAASKDDPNRSIFSLDSMQMLTKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIA 250

Query: 603  QLLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMER 782
            +LLDP  +YF S+VIS +D TQ+HQKGLDV+LG +S  VILDDTE VW +H+ENLILMER
Sbjct: 251  KLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEYVWQKHKENLILMER 310

Query: 783  YHFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDV 962
            YHFFASS R FG   +SLSE  +DE E DGALATVL VLK +H +FF+L ++ +L+S DV
Sbjct: 311  YHFFASSCRQFGFGVRSLSESMQDERESDGALATVLDVLKRIHSIFFDLAVETDLSSQDV 370

Query: 963  RKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTD 1142
            R+V+K VR E+L+GCKIVFSRV+    R + Q LW++A+ LGA CST+++ SVTHVV+ D
Sbjct: 371  RQVIKAVRKEILQGCKIVFSRVFPNNTRPQEQMLWKMAEHLGAVCSTDVDSSVTHVVTVD 430

Query: 1143 TGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIKNQ 1277
             GTEKARW V  KKFLVHPRWIEAAN+ W RQPEE F +   K +
Sbjct: 431  LGTEKARWGVANKKFLVHPRWIEAANFRWHRQPEEDFPVTAPKEK 475


>ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum lycopersicum]
          Length = 512

 Score =  449 bits (1156), Expect = e-123
 Identities = 224/399 (56%), Positives = 290/399 (72%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSA 245
            R K+ KI+ ++   + Q + S             ++ C HP     MC+RCGQ + D+S 
Sbjct: 113  RSKKRKIELIEGAVDPQSSVSRGEPAETSGASMALDVCTHPGVMGGMCIRCGQKVEDESG 172

Query: 246  VAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYL 425
            VAFGYIHK+L++  +E+ RLR  +L  +LR+RK            NSTR+ D+S EE YL
Sbjct: 173  VAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEESYL 232

Query: 426  INHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQ 605
             +  + L D     LFK+D IHM+TKLRPFV TFLKEASS++EMYIYTMGER YALEMA+
Sbjct: 233  KDQREVLPDALRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAK 292

Query: 606  LLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERY 785
            LLDP  +YF+SRVI+Q+DST++HQKGLDVVLG +SAV+ILDDTE+VW +HRENLILM+RY
Sbjct: 293  LLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRY 352

Query: 786  HFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDVR 965
            HFF SS R FGL  KSLSE K DE+E +GALA+VL VL+ +H +FF+ E   N+   DVR
Sbjct: 353  HFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGDNIMERDVR 412

Query: 966  KVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTDT 1145
            +VLKTVR E+LKGCKIVF+ V  +  + EN   W++A+ LGA  STE+++SVTHVVS + 
Sbjct: 413  QVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMND 472

Query: 1146 GTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSIN 1262
             TEK+R AV++KKFLVHPRWIEAANYLW++ PEE F ++
Sbjct: 473  KTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 511


>ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum lycopersicum]
          Length = 472

 Score =  449 bits (1154), Expect = e-123
 Identities = 224/399 (56%), Positives = 289/399 (72%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSA 245
            R K+ KI+ ++   + Q   S             ++ C HP     MC+RCGQ + D+S 
Sbjct: 73   RSKKRKIELIEAAVDPQSLVSRGESAETSGASLALDVCTHPGVMGGMCIRCGQKVEDESG 132

Query: 246  VAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYL 425
            VAFGYIHK+L++  +E+ RLR  +L  +LR+RK            NSTR+ D+S EE YL
Sbjct: 133  VAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEESYL 192

Query: 426  INHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQ 605
             +  + L D     LFK+D IHM+TKLRPFV TFLKEASS++EMYIYTMGER YALEMA+
Sbjct: 193  KDQREVLPDALRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAK 252

Query: 606  LLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERY 785
            LLDP  +YF+SRVI+Q+DST++HQKGLDVVLG +SAV+ILDDTE+VW +HRENLILM+RY
Sbjct: 253  LLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRY 312

Query: 786  HFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDVR 965
            HFF SS R FGL  KSLSE K DE+E +GALA+VL VL+ +H +FF+ E   N+   DVR
Sbjct: 313  HFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGDNIMERDVR 372

Query: 966  KVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTDT 1145
            +VLKTVR E+LKGCKIVF+ V  +  + EN   W++A+ LGA  STE+++SVTHVVS + 
Sbjct: 373  QVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMND 432

Query: 1146 GTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSIN 1262
             TEK+R AV++KKFLVHPRWIEAANYLW++ PEE F ++
Sbjct: 433  KTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 471


>ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Solanum tuberosum]
          Length = 478

 Score =  447 bits (1151), Expect = e-123
 Identities = 222/399 (55%), Positives = 289/399 (72%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSA 245
            R K+ KI+ ++   + Q + S             ++ C HP     MC+RCGQ + D+S 
Sbjct: 79   RSKKRKIELIEAAVDPQSSVSRGEPAETSGASLALDVCTHPGVMGGMCIRCGQKVEDESG 138

Query: 246  VAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYL 425
            VAFGYIHK+L++  +E+ RLR  +L  +LR++K            NSTR+ D+S EE YL
Sbjct: 139  VAFGYIHKNLRLADDEVARLRDKDLKNLLRHKKLILVLDLDHTLLNSTRLADISAEESYL 198

Query: 426  INHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQ 605
             +  + L D     LFK+D IHM+TKLRPFV TFLKEASS++EMYIYTMGER YALEMA 
Sbjct: 199  KDQREVLPDALRNNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAS 258

Query: 606  LLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERY 785
            LLDP  +YF+SRVI+Q+DST++HQKGLDVVLG +SAV+ILDDTE+VW +HRENLILM+RY
Sbjct: 259  LLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRY 318

Query: 786  HFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDVR 965
            HFF SS R FGL  KSLSE K DE+E +GALA+VL VL+ +H +FF+LE   N+   DVR
Sbjct: 319  HFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDLERGDNIMERDVR 378

Query: 966  KVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTDT 1145
            +VLKTVR E+LKGCKIVF+ V  +  + EN   W++A+ LGA  STE+++SVTHVVS + 
Sbjct: 379  QVLKTVRKEILKGCKIVFTGVIPIQCQPENHHYWKLAEKLGATFSTEVDESVTHVVSMND 438

Query: 1146 GTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSIN 1262
             TEK+R A+++KKFLVHP WIEAANYLW++ PEE F ++
Sbjct: 439  KTEKSRQALREKKFLVHPSWIEAANYLWRKPPEENFPVS 477


>gb|EYU37264.1| hypothetical protein MIMGU_mgv1a005925mg [Mimulus guttatus]
          Length = 464

 Score =  446 bits (1148), Expect = e-123
 Identities = 220/409 (53%), Positives = 293/409 (71%), Gaps = 7/409 (1%)
 Frame = +3

Query: 66   RIKRHKIDELDDTG-------ESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQ 224
            R+KR K++  +D          S  A  + +    P +     TC HP  +  MC++CGQ
Sbjct: 59   RVKRRKMELSEDVNFDVINSQSSSSAEQILSAGSSPKK----NTCLHPGVYAGMCMKCGQ 114

Query: 225  YMNDDSAVAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDV 404
             M+D+S VAFGYIHK+L++  +E+DRLR  +L  +LR+RK            NS R+ D+
Sbjct: 115  KMDDESGVAFGYIHKNLRLANDEIDRLRDRDLKNMLRHRKLCLVLDLDHTLLNSARLHDI 174

Query: 405  SPEEEYLINHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERS 584
            + +E YL    ++L D     LF++D I+M+TKLRP+V TFLKEAS ++EMYIYTMGER 
Sbjct: 175  TEQEGYLNGQREALPDNLKNSLFRLDWIYMMTKLRPYVHTFLKEASKLFEMYIYTMGERP 234

Query: 585  YALEMAQLLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHREN 764
            YALEMA+LLDP  +YFNSR+I+Q D TQKHQKGLDVVLG +SAVVILDDTE VW++H++N
Sbjct: 235  YALEMAKLLDPGDIYFNSRIIAQGDCTQKHQKGLDVVLGQESAVVILDDTEAVWSKHKDN 294

Query: 765  LILMERYHFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDAN 944
            LILMERYHFFASS + FG + KSLSEL+ DES+  GALA+VL  L+ +H +FF+ E   +
Sbjct: 295  LILMERYHFFASSCKQFGFNCKSLSELQSDESDTQGALASVLKRLQQIHTLFFDAERKDS 354

Query: 945  LTSGDVRKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVT 1124
            L   DVR V+KT+R EVLKGCK+VF+RV+     +E+  LW++A+ LGA C  E++ SVT
Sbjct: 355  LEDRDVRLVMKTLRKEVLKGCKVVFTRVFPTNFPSEHHSLWKMAEKLGATCCNEIDPSVT 414

Query: 1125 HVVSTDTGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIK 1271
            HVVS D GT+K+RWAVQ+KKFLVHPRWIEA+NY+W++Q EE F ++  K
Sbjct: 415  HVVSMDAGTDKSRWAVQEKKFLVHPRWIEASNYMWQKQTEENFPVSQAK 463


>ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like isoform X1 [Citrus sinensis]
            gi|568865772|ref|XP_006486244.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4-like
            isoform X2 [Citrus sinensis]
            gi|568865774|ref|XP_006486245.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 4-like
            isoform X3 [Citrus sinensis]
          Length = 478

 Score =  446 bits (1147), Expect = e-122
 Identities = 225/409 (55%), Positives = 291/409 (71%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSA 245
            RIKR K   ++   E  G T L  ++ + +    ++ CPHP     MC RCG+ + ++S 
Sbjct: 65   RIKRRKTQIVETIQERPGPTLLGNLEEKTEVSLEMDNCPHPGSLGGMCYRCGKRLEEESG 124

Query: 246  VAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYL 425
            V F YI K L++G +E+DRLR +++  +LR+RK            NST ++ ++PEE+YL
Sbjct: 125  VTFSYICKGLRLGNDEIDRLRNTDMKHLLRHRKLYLILDLDHTLLNSTLLLHLTPEEDYL 184

Query: 426  INHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQ 605
             +  DSLQD+  G LF +  ++M+TKLRPFV TFLKEAS M+EMYIYTMG+R YALEMA+
Sbjct: 185  KSQADSLQDVSKGSLFMLAFMNMMTKLRPFVHTFLKEASEMFEMYIYTMGDRPYALEMAK 244

Query: 606  LLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERY 785
            LLDP R YFN+RVIS+ D TQ+HQKGLDVVLG +SAV+ILDDTE  W +HR+NLILMERY
Sbjct: 245  LLDPSREYFNARVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWTKHRDNLILMERY 304

Query: 786  HFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDVR 965
            HFFASS R FG   +SLS+L+ DESE +GALA+VL VLK +H +FF+ EL  +L   DVR
Sbjct: 305  HFFASSCRQFGYHCQSLSQLRSDESELEGALASVLKVLKRIHNIFFD-ELANDLAGRDVR 363

Query: 966  KVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTDT 1145
            +VLK VR EVLKGCK+VFS V+     A+   LW++A+ LGA C  EL+ SVTHVVSTD 
Sbjct: 364  QVLKMVRGEVLKGCKLVFSHVFPTKFPADTHYLWKMAEQLGATCLIELDPSVTHVVSTDA 423

Query: 1146 GTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIKNQ*YFPA 1292
             TEK+RWA ++ KFLV PRWIE AN+LW+RQPEE F +   K +  F A
Sbjct: 424  RTEKSRWAAKEAKFLVDPRWIETANFLWQRQPEENFPVKQNKPEENFHA 472


>gb|EXC26161.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Morus
            notabilis]
          Length = 512

 Score =  444 bits (1142), Expect = e-122
 Identities = 217/376 (57%), Positives = 278/376 (73%), Gaps = 1/376 (0%)
 Frame = +3

Query: 153  DEVANVETCPHPAFFREMCVRCGQYMNDDSAVAFGYIHKDLKIGTEEMDRLRGSNLNTVL 332
            +E    + C HP  F +MC+ CGQ + +++ V FGYIHK L++  +E+ RLR +++  ++
Sbjct: 138  EESTKKDACTHPGSFGDMCILCGQRLEEETGVTFGYIHKGLRLNNDEIVRLRSTDMKNLI 197

Query: 333  RNRKXXXXXXXXXXXXNSTRVIDVSPEEEYLINHTDSLQDIPNGGLFKVDTIHMLTKLRP 512
            R++K            NSTR++D+S EE+YL +   S QD   G LF ++ +HM+TKLRP
Sbjct: 198  RHKKLCLVLDLDHTLLNSTRLVDLSSEEQYLKSQAFSPQDASEGSLFVLEAMHMMTKLRP 257

Query: 513  FVRTFLKEASSMYEMYIYTMGERSYALEMAQLLDPERVYFNSRVISQADSTQKHQKGLDV 692
            FVR FLKE  +++E+Y+YTMG+R YAL MA+LLDP R YF  R+IS+ D T KHQKGLDV
Sbjct: 258  FVRNFLKEVYNLFELYVYTMGDRPYALAMAKLLDPRREYFGDRIISRDDGTLKHQKGLDV 317

Query: 693  VLGVDSAVVILDDTEIVW-NRHRENLILMERYHFFASSSRPFGLSGKSLSELKRDESEKD 869
            VLG +SAV+ILDDTE  W   H+ENLILMERYHFF SS+  FG + KSLSELK DESE +
Sbjct: 318  VLGQESAVLILDDTENAWIKHHKENLILMERYHFFRSSTHQFGYNCKSLSELKSDESETE 377

Query: 870  GALATVLTVLKHVHEMFFNLELDANLTSGDVRKVLKTVRSEVLKGCKIVFSRVWKVGERA 1049
            GAL TVL VLK VH MFF+ E   +    DVR+VLKT+R EVLKGCKIVFSRV+    +A
Sbjct: 378  GALVTVLNVLKQVHSMFFD-ERGIDHIIRDVRQVLKTLRKEVLKGCKIVFSRVFPTEFQA 436

Query: 1050 ENQKLWEIAQLLGAECSTELNQSVTHVVSTDTGTEKARWAVQQKKFLVHPRWIEAANYLW 1229
            EN +LW++A+ LGA C  EL+ SVTHVVS D GTEK+RWAV++ KFLVHPRWIEAANY+W
Sbjct: 437  ENHQLWKMAEQLGATCGIELDPSVTHVVSLDVGTEKSRWAVKENKFLVHPRWIEAANYMW 496

Query: 1230 KRQPEEQFSINHIKNQ 1277
            KRQPE+ FS+N +KNQ
Sbjct: 497  KRQPEDNFSVNQVKNQ 512


>ref|XP_007154891.1| hypothetical protein PHAVU_003G156800g [Phaseolus vulgaris]
            gi|561028245|gb|ESW26885.1| hypothetical protein
            PHAVU_003G156800g [Phaseolus vulgaris]
          Length = 441

 Score =  441 bits (1133), Expect = e-121
 Identities = 221/398 (55%), Positives = 292/398 (73%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSA 245
            RIKRHKI+ +++T   +G+T    +++  +    V+ C HP  F  MC+RCGQ ++ +S 
Sbjct: 48   RIKRHKIESIEET---EGSTLEGIIKQNLEVSVKVDVCSHPGSFGSMCIRCGQKLDGESG 104

Query: 246  VAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYL 425
            V FGYIHK L++  +E+ RLR +++ ++L  +K            NST + D+S EE  L
Sbjct: 105  VTFGYIHKGLRLHDDEISRLRNTDMKSLLCRKKLYFVLDLDHTLLNSTHLSDLSSEESSL 164

Query: 426  INHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQ 605
            ++ TDSL+D+  G LFK+D +HM+TKLRPFVR+FLKEAS M+EMYIYTMG+R YALEMA+
Sbjct: 165  LDQTDSLEDVSKGSLFKLDHMHMMTKLRPFVRSFLKEASEMFEMYIYTMGDRPYALEMAK 224

Query: 606  LLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERY 785
            LLDP  VYFN++VIS+ D TQKHQKGLDVVLG +SAV+ILDDTE  W +H++NLILMERY
Sbjct: 225  LLDPRGVYFNAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERY 284

Query: 786  HFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDVR 965
            HFFASS R FG + KSL+EL+ DE E DGALA +L VL+ VH  FF+   + +L   DVR
Sbjct: 285  HFFASSCRQFGFNCKSLAELRNDEDETDGALAKILKVLRQVHCTFFDKHQE-DLVDRDVR 343

Query: 966  KVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTDT 1145
            +VL +VRSEVL GC IVFSR++          L ++A+ +GA C TE++ SVTHVV+TD 
Sbjct: 344  QVLASVRSEVLGGCVIVFSRIF----HGALPSLRKMAEQMGATCLTEVDLSVTHVVATDA 399

Query: 1146 GTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSI 1259
            GTEK+RWAV++ KFLVHPRWIEAAN+ W++QPEE F I
Sbjct: 400  GTEKSRWAVKEHKFLVHPRWIEAANFFWEKQPEENFFI 437


>ref|XP_007149092.1| hypothetical protein PHAVU_005G040600g [Phaseolus vulgaris]
            gi|593697222|ref|XP_007149093.1| hypothetical protein
            PHAVU_005G040600g [Phaseolus vulgaris]
            gi|561022356|gb|ESW21086.1| hypothetical protein
            PHAVU_005G040600g [Phaseolus vulgaris]
            gi|561022357|gb|ESW21087.1| hypothetical protein
            PHAVU_005G040600g [Phaseolus vulgaris]
          Length = 443

 Score =  441 bits (1133), Expect = e-121
 Identities = 222/399 (55%), Positives = 291/399 (72%), Gaps = 1/399 (0%)
 Frame = +3

Query: 66   RIKRHKIDELDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDDSA 245
            RIKR KI+    T E++G+TS   +++  +    V+ C HP  F  MC+RCGQ ++  S 
Sbjct: 48   RIKRRKIES---TEETEGSTSEGILKQNLETSVEVDVCTHPGSFGSMCIRCGQKLDGKSG 104

Query: 246  VAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEYL 425
            V FGYIHK L++  EE+ RLR +++ ++L  +K            NST +  +S EE +L
Sbjct: 105  VTFGYIHKGLRLHDEEISRLRNTDMKSLLCRKKLYLVLDLDHTLLNSTLLAHLSSEESHL 164

Query: 426  INHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMAQ 605
            +N TDSLQD+  G LFK++ +HM+TKLRPFVR+FLKEA+ M+EMYIYTMG+R YALEMA+
Sbjct: 165  LNQTDSLQDVSKGSLFKLEHMHMMTKLRPFVRSFLKEATEMFEMYIYTMGDRPYALEMAK 224

Query: 606  LLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMERY 785
            LLDP+  YFN+RVIS+ D TQKHQKGLDVVLG +SAV+ILDDTE  W +H++NLILMERY
Sbjct: 225  LLDPQGEYFNARVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERY 284

Query: 786  HFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNL-ELDANLTSGDV 962
            HFFASS R FG + KS +EL+ DE E DGALA +L VLK VH  FF+  + D +L + DV
Sbjct: 285  HFFASSCRQFGFNCKSPAELRNDEDETDGALAKILKVLKQVHCTFFDKHQEDDDLVNRDV 344

Query: 963  RKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTD 1142
            R+VL +VRSEVL GC IVFSR++          L ++A+ +GA C  E++ SVTH+V+TD
Sbjct: 345  RQVLSSVRSEVLSGCVIVFSRIF----HGALPSLQKMAEQMGATCLAEVDPSVTHIVATD 400

Query: 1143 TGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSI 1259
             GTEK+RWA+++KKFLVHPRWIEAANY W++QPEE F I
Sbjct: 401  AGTEKSRWALKEKKFLVHPRWIEAANYFWEKQPEENFII 439


>gb|AFW77884.1| CPL3 [Zea mays]
          Length = 533

 Score =  435 bits (1118), Expect = e-119
 Identities = 215/405 (53%), Positives = 287/405 (70%), Gaps = 3/405 (0%)
 Frame = +3

Query: 72   KRHKIDE-LDDTGESQGATSLSAMQREPDEVANVETCPHPAFFREMCVRCGQYMNDD--S 242
            KR +++E   D G S     +        ++  VE CPHP  F  +C+ CG+  +++  S
Sbjct: 72   KRRRVEEQCQDQGTSVRPDKIPT---GASKIVQVEACPHPGHFGGLCIICGKPQDEEDVS 128

Query: 243  AVAFGYIHKDLKIGTEEMDRLRGSNLNTVLRNRKXXXXXXXXXXXXNSTRVIDVSPEEEY 422
             VAFGYIHK L++GT E+DRLRG++L  +LR RK            NST++ D+S  E+ 
Sbjct: 129  GVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLQDISSAEKD 188

Query: 423  LINHTDSLQDIPNGGLFKVDTIHMLTKLRPFVRTFLKEASSMYEMYIYTMGERSYALEMA 602
            L   + + +D PN  +F +D + MLTKLRPFVR FLKEAS+M+EMYIYTMG+++YA+E+A
Sbjct: 189  LGIQSAASKDDPNRSIFALDLMPMLTKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIA 248

Query: 603  QLLDPERVYFNSRVISQADSTQKHQKGLDVVLGVDSAVVILDDTEIVWNRHRENLILMER 782
            +LLDP  +YF S+VIS +D TQ+HQKGLDV+LG +S  VILDDTE VW +H+ENLILMER
Sbjct: 249  KLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEYVWQKHKENLILMER 308

Query: 783  YHFFASSSRPFGLSGKSLSELKRDESEKDGALATVLTVLKHVHEMFFNLELDANLTSGDV 962
            YHFFASS R FG   +SLSE  +DE E DGALATVL VLK +H  FF++  + +L+S D+
Sbjct: 309  YHFFASSCRQFGFGVRSLSESLQDERESDGALATVLDVLKRIHATFFDMAAETDLSSRDI 368

Query: 963  RKVLKTVRSEVLKGCKIVFSRVWKVGERAENQKLWEIAQLLGAECSTELNQSVTHVVSTD 1142
            R+V+KT+R E+L+GCKIVFSRV+    R + Q +W++A+ LGA C  +++ SVTHVV+ D
Sbjct: 369  RQVIKTLRKEILQGCKIVFSRVFPNNTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVD 428

Query: 1143 TGTEKARWAVQQKKFLVHPRWIEAANYLWKRQPEEQFSINHIKNQ 1277
             GTEKARW +  KKFLVHPRWIEAAN+ W RQPEE F +   K +
Sbjct: 429  LGTEKARWGLNNKKFLVHPRWIEAANFRWHRQPEEDFPVTAPKEK 473


Top