BLASTX nr result
ID: Ephedra25_contig00017285
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00017285 (1840 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [A... 424 e-116 gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isofo... 395 e-107 ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ... 393 e-106 ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma... 389 e-105 gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus pe... 387 e-105 ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu... 387 e-105 ref|XP_001781984.1| predicted protein [Physcomitrella patens] gi... 386 e-104 ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal doma... 384 e-103 ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [S... 384 e-103 ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal doma... 380 e-103 ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal doma... 380 e-102 ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal doma... 379 e-102 dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] 377 e-102 ref|NP_001078764.1| C-terminal domain phosphatase-like 4 [Arabid... 377 e-102 ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal doma... 376 e-101 ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal doma... 375 e-101 ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Sela... 374 e-101 ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu... 373 e-100 ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arab... 372 e-100 gb|EOY32065.1| RNA polymerase II ctd phosphatase, putative isofo... 369 2e-99 >ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda] gi|548840545|gb|ERN00656.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda] Length = 486 Score = 424 bits (1090), Expect = e-116 Identities = 222/472 (47%), Positives = 309/472 (65%), Gaps = 18/472 (3%) Frame = +2 Query: 149 ELDSISNAS-ENHQDDDDQEFESSPD-----------IDFDSEDRPRIKRRKLYDNEESE 292 E D +SN+S E+ + D+D E + + D++ + RIKR K+ ++EE + Sbjct: 22 ETDLLSNSSGESPERDEDILDEITSESVSERSAVWDSTDYEEIELERIKRPKICEDEEIK 81 Query: 293 EQLPI-----KIEDKPESSSARGCL-HPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIH 454 E ++++ ES+S + C HPG+ +CI+CG+ K D +++ V YIH Sbjct: 82 ESQSSNANQGELDNFKESTSEKVCPPHPGFYKDMCIRCGEQKDDETVARKETAVAFNYIH 141 Query: 455 KEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLV 634 K+ +L E+ARLR DLK++ +K NS R +DV PEEE+Y+++TYL Sbjct: 142 KDLKLGAEEVARLRATDLKNLYRRRKLYLVLDLDHTLLNSTRLVDVSPEEEAYLNATYL- 200 Query: 635 GQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMA 814 + S +GD TL+KL +HM TKLRPFV FL+EAN M+EMY+YTMGERAYA +MA Sbjct: 201 -NKETSSSNGDTSGTLFKLEPLHMLTKLRPFVRTFLKEANTMFEMYVYTMGERAYALEMA 259 Query: 815 KLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVER 994 KLLDP G +F R+IS DST R QK LDVVLG+E AVVILDDTE VW KHK NL+++ER Sbjct: 260 KLLDPSGVYFGSRVISQGDSTVRHQKGLDVVLGSECAVVILDDTEHVWHKHKENLVLMER 319 Query: 995 YHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCF 1174 YHFF +SC QF+ KSL++ KRDE+ES+G L+ +L VL+ +HQ F+ + + F Sbjct: 320 YHFFSSSCRQFNVHYKSLSELKRDESESDGMLASILNVLKHIHQMFY-----YQEVETDF 374 Query: 1175 DSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTH 1354 + DVR VLK I++ +LK C++VFSR+FPT + E LWR+AE+LGA CS L E VTH Sbjct: 375 NGSDVRKVLKTIQSEVLKGCRLVFSRIFPTNYPVENQTLWRIAEQLGASCSKELDEAVTH 434 Query: 1355 VVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSS 1510 VVSLD GT+K+RWA+Q LV+P W+EA+++ W+RQ ED+FP+ K+ S Sbjct: 435 VVSLDLGTEKARWAIQRKKHLVNPGWLEATNYFWKRQPEDQFPIPSKNGGGS 486 >gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma cacao] Length = 469 Score = 395 bits (1014), Expect = e-107 Identities = 224/461 (48%), Positives = 283/461 (61%), Gaps = 7/461 (1%) Frame = +2 Query: 143 DFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIK-IED 319 D E D +N N DDD D DS+ R K KL D EES IED Sbjct: 38 DVEADGDNNNDNNDDHDDDD--------DLDSQRNKRCKTEKLEDLEESRGSTSQGLIED 89 Query: 320 K----PESSSARG-CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEI 484 K E S + C HPG +CI CGQ D +S V YIHK L + EI Sbjct: 90 KIVIHAELSLKKDICTHPGSFGQMCILCGQRLDD------ESGVTFGYIHKGLRLGNDEI 143 Query: 485 ARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQRNVESESG 664 RLR D+K++L KK NS + + + P+EE YL GQ + Sbjct: 144 VRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLMHLTPDEE------YLKGQSD---SLQ 194 Query: 665 DIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKF 841 D+ R +L+ L MHM TKLRPFV FL+EA++M+EMYIYTMG+R YA +MAKLLDP+ ++ Sbjct: 195 DVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREY 254 Query: 842 FADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCL 1021 F+DR+IS D T + QK LDVVLG ESAVVILDDTE W KHK NLI++ERYH+F +SC Sbjct: 255 FSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENAWMKHKDNLILMERYHYFASSCH 314 Query: 1022 QFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVL 1201 QF KSL+Q K DE+E +G L+ +L+ LR +H FFD + SRDVR VL Sbjct: 315 QFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFFDELDCN------LASRDVRQVL 368 Query: 1202 KEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTD 1381 K ++ +LK CK+VFS VFPT F AE+H LW++AE+LGA CST VTHVVS DAGT+ Sbjct: 369 KTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTETDLSVTHVVSTDAGTE 428 Query: 1382 KSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 1504 KSRWAV+ FLVHP WIEA+++LWQ+Q E+ FPV++ +Q Sbjct: 429 KSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 469 >ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 478 Score = 393 bits (1010), Expect = e-106 Identities = 217/470 (46%), Positives = 295/470 (62%), Gaps = 18/470 (3%) Frame = +2 Query: 137 LIDFELDSISNASEN-------------HQDDDDQEFESSPDIDFDSEDRP-RIKRRK-- 268 L+D ELDS S++S++ + D +++E E D D DS+ RIKR + Sbjct: 23 LLDAELDSKSSSSDSSPKAIKHDDASDANDDVNEEEEEEESDSDDDSDIATNRIKRSRVE 82 Query: 269 -LYDNEESEEQLPIKIEDKPESSSAR-GCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCL 442 L + E +E + ++ +SS++ C HPG +CI CG E +++ V Sbjct: 83 TLENGENPKESTRVSLDQTLVASSSKVACTHPGSFGDMCILCG------ERLIEETGVTF 136 Query: 443 KYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISS 622 YIHK L++ EI RLR D+K++L +K NS + + + EEE Sbjct: 137 GYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTLLNSTQLMHLTAEEE----- 191 Query: 623 TYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYA 802 YL Q ++S +L+ + MHM TKLRPF+ FL+EA+QM+EMYIYTMG+RAYA Sbjct: 192 -YLKSQ--IDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEMYIYTMGDRAYA 248 Query: 803 EKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLI 982 +MAK LDP ++F R+IS D T R QK LD+VLG ESAV+ILDDTE W KHK NLI Sbjct: 249 LEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWTKHKDNLI 308 Query: 983 VVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMS 1162 ++ERYHFF +SC QF KSL+Q K DE ES+G L+ +L+VLR +H FFD + + Sbjct: 309 LMERYHFFASSCRQFGFECKSLSQLKSDENESDGALASVLKVLRRIHHIFFDELED---- 364 Query: 1163 QFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLRE 1342 D RDVR VL +R +LK CK+VFSRVFPTQF+A+ HHLW++AE+LGA CS + Sbjct: 365 --AIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCSREVDP 422 Query: 1343 DVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTK 1492 VTHVVS +AGT+KSRWA++ND FLVHP WIEA++++WQRQ E+ F V + Sbjct: 423 SVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVNQ 472 >ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Cucumis sativus] Length = 452 Score = 389 bits (999), Expect = e-105 Identities = 208/438 (47%), Positives = 280/438 (63%), Gaps = 5/438 (1%) Frame = +2 Query: 188 DDDDQEFESSPDIDFDSEDRP---RIKRRKLYDNEESEEQLPIKIEDKPES--SSARGCL 352 D D +SSPD + + ++ RIKRRK+ E SEE + ++E++ S + C Sbjct: 24 DLDSHSSDSSPDEETEGDNNAESVRIKRRKVEKLENSEEDIMHEVEEQSLEVLSKQQLCS 83 Query: 353 HPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKK 532 HPG +CI CGQ +++S V YIHKE L++ EI R+R ++K +L KK Sbjct: 84 HPGSFGNMCIICGQRL------DEESGVTFGYIHKELRLNNDEINRMRNKEMKELLQRKK 137 Query: 533 XXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRTLYKLPSMHMWT 712 NS + EEE S T +S +L+ L S+H T Sbjct: 138 LILVLDLDHTLLNSTELRYLTVEEEYLRSQT--------DSLDDVTKGSLFLLNSVHTMT 189 Query: 713 KLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQK 892 KLRPFVH FL+EA++++EMYIYTMGER YA +MAKLLDPK ++F+ ++IS D T + QK Sbjct: 190 KLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQK 249 Query: 893 DLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEA 1072 LDVVLG ESAV+ILDDTE W KHK NLI++ERYHFF +SC QF KSL++ K DE+ Sbjct: 250 GLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDES 309 Query: 1073 ESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSR 1252 E++G L+ +L+VL+ VH FF+ +S + RDVR VLK +RA +L+ CKVVFSR Sbjct: 310 ETDGALTTILKVLKQVHHMFFNEVSGDLV------DRDVRQVLKTVRAEVLEGCKVVFSR 363 Query: 1253 VFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSW 1432 VFPT+F+AE H LW++ E+LG CST L + VTHVV+ DAGT+KSRWA++ FLVHP W Sbjct: 364 VFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRW 423 Query: 1433 IEASSFLWQRQSEDRFPV 1486 IEAS++ W+RQ E+ F V Sbjct: 424 IEASNYFWKRQMEENFTV 441 >gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica] Length = 449 Score = 387 bits (995), Expect = e-105 Identities = 214/436 (49%), Positives = 282/436 (64%), Gaps = 9/436 (2%) Frame = +2 Query: 212 SSPD--IDFDSEDRPR--IKRRKLYD----NEESEEQLPIKIEDKPESSSARG-CLHPGY 364 SSPD D++S+D KRRK+ + +E I +E+ E+S + C HPG Sbjct: 31 SSPDEEADYESDDGSERSTKRRKVENLGSIDETQGSTSQIFVEENSEASPKKDICTHPGS 90 Query: 365 MWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXX 544 + +CI CGQ +++S V L YIHK+F L++ EI R+R D+K L KK Sbjct: 91 VKDLCIVCGQRV------DEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLV 144 Query: 545 XXXXXXXXNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRP 724 NS + EEE YL Q + + D +L+++ MHM TKLRP Sbjct: 145 LDLDHTLLNSTHLNHMTAEEE------YLHSQTDSLQDVSD--GSLFRVDVMHMMTKLRP 196 Query: 725 FVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDV 904 FV KFL+EA++M+EMYIYTMGERAYA +MAKLLDP+ ++F DR+IS D T + QK LDV Sbjct: 197 FVRKFLKEASEMFEMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDV 256 Query: 905 VLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEG 1084 VLG ESA +ILDDTE W KHK NLI++ERYHFF +SC QF KSL++ K DE+E EG Sbjct: 257 VLGHESAALILDDTENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEG 316 Query: 1085 TLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPT 1264 L+ +L+VL+ +H FF + + RDVR VLK +R ILK CK+VFSRVFP+ Sbjct: 317 ALATVLEVLKRIHNMFFYESKDNLI------DRDVRQVLKTLRKEILKGCKIVFSRVFPS 370 Query: 1265 QFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEAS 1444 +F+AE H LW++AE+LGA CST L VTHVVS DAGT+KSRWAV+ FLVHP WIEAS Sbjct: 371 KFQAENHQLWKMAEQLGATCSTELDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEAS 430 Query: 1445 SFLWQRQSEDRFPVTK 1492 +++W +Q+ED+FPV + Sbjct: 431 NYMWLKQAEDKFPVNQ 446 >ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318538|gb|EEF03112.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 472 Score = 387 bits (994), Expect = e-105 Identities = 221/466 (47%), Positives = 290/466 (62%), Gaps = 19/466 (4%) Frame = +2 Query: 140 IDFELDSISNASENHQDD------DDQEFESSPDIDFDSED-------RPRIKRRKLYDN 280 +D ELDS S+AS D+ D SSPD D ++E+ R R+KR K+ Sbjct: 21 LDTELDSKSSASSASDDEAPNQRHSDSAASSSPDQDKEAEEDDDSDFQRKRVKRSKVETV 80 Query: 281 EESEEQLPI----KIEDKPESSSARG-CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLK 445 E E+ ++ E+S ++ C HPG +CI CGQ + +S V Sbjct: 81 EIVEDDGGTTSFASLKHNSEASISKEICTHPGSFGTMCIVCGQLL------DGESGVTFG 134 Query: 446 YIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISST 625 YIHK L + EI RLR D+K++L KK NS + + + +EE Sbjct: 135 YIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEE------ 188 Query: 626 YLVGQRNVESESGDIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYA 802 YL GQ + D+ + +L+ L SM M TKLRPFV FL+EA+QM+EMYIYTMG+RAYA Sbjct: 189 YLNGQTD---SLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYA 245 Query: 803 EKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLI 982 +MAKLLDP ++F ++IS D T R QK LDVVLG ESAV+ILDDTE W KHK NLI Sbjct: 246 LEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLI 305 Query: 983 VVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMS 1162 ++ERYHFF +SC QF KSL++ K DE+ESEG L+ +L+VLR +HQ FF+ + Sbjct: 306 LMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFEELEEN--- 362 Query: 1163 QFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLRE 1342 D RDVR VLK +R +LK CK+VFSRVFPTQ +A+ HHLWR+AE+LGA CST L Sbjct: 363 ---MDGRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDP 419 Query: 1343 DVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRF 1480 VTHVVS D+GT+KS WA++++ FLV P WIEA+++ WQRQ E+ F Sbjct: 420 SVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENF 465 >ref|XP_001781984.1| predicted protein [Physcomitrella patens] gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens] Length = 563 Score = 386 bits (992), Expect = e-104 Identities = 213/471 (45%), Positives = 293/471 (62%), Gaps = 6/471 (1%) Frame = +2 Query: 149 ELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIEDKPE 328 +L+ ++ + + ++D D ES D + D ++ E S++ PI + E Sbjct: 63 DLEEGTSENGSVEEDADSVVESEDHADDNHLDVEKVA-------ESSDDITPICVNYSGE 115 Query: 329 SSSARGCL-HPGYMWGVCIKCGQNKPDSEENEQQ-SRVCLKYIHKEFELSDSEIARLRKD 502 ++ C HPG++W VCI+CG+ K + N+ RV L+YIH+ E+S+ E AR+R Sbjct: 116 MVNSNKCPPHPGFIWDVCIRCGKRKSTAPSNDPVIDRVGLRYIHEGLEVSELEAARVRNA 175 Query: 503 DLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRTL 682 +L+ V +K NSARF +V EE Y+ T+ GQ++ S L Sbjct: 176 ELRRVTGKQKLLLVVDLDHTMLNSARFSEVPAEERIYL--TWTAGQQHGRVSS------L 227 Query: 683 YKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIIS 862 ++L + MWTKLRPF HKFLEEA+++YEMY+YTMGE+ YA+ MA+LLDP G+ F RIIS Sbjct: 228 HQLTKLGMWTKLRPFAHKFLEEASKLYEMYVYTMGEKIYAQAMAELLDPTGQLFGGRIIS 287 Query: 863 SSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTK 1042 +DST R KDLDVVLGAESAVVILDDTE VWP H+SNLI++ERYHFF +SC QF Sbjct: 288 QTDSTKRHTKDLDVVLGAESAVVILDDTEAVWPNHRSNLILMERYHFFTSSCHQFRVRAP 347 Query: 1043 SLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQF-CFDSRDVRLVLKEIRAR 1219 SL Q RDE E +GTL+ L+ L+++H FF+ K M + + DVR V++ IR + Sbjct: 348 SLAQMHRDECEIDGTLATTLKTLQAIHHEFFNGHKGKSMKRRPPLELPDVRDVIRSIRGK 407 Query: 1220 ILKDCKVVFSRVFPTQFR-AETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWA 1396 +L C +VFSR+FPT + E H W+LA ELGA CST THVV+LD GTDK+RWA Sbjct: 408 LLSGCHIVFSRIFPTGLQNPEFHPFWQLAVELGARCSTVCDHTTTHVVALDRGTDKARWA 467 Query: 1397 VQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ--SSGFVESVSQLPL 1543 Q+ LVHP W+EA+S+LW+R E FPVT S S+ F +++S P+ Sbjct: 468 KQHGISLVHPRWVEAASYLWKRPREKDFPVTDDASALISTTFSKNISVEPI 518 >ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Setaria italica] Length = 543 Score = 384 bits (985), Expect = e-103 Identities = 214/472 (45%), Positives = 289/472 (61%), Gaps = 14/472 (2%) Frame = +2 Query: 137 LIDFELDSISNA---------SENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEES 289 L+D EL+ S A S + DD+ E E S +++ + ++ KRR++ E+S Sbjct: 23 LLDSELELASGADSAFPGDPSSASPDTDDEGEDEDSEEVEVELLEQNSAKRRRV--EEQS 80 Query: 290 EEQ----LPIKIEDKPESS-SARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIH 454 ++Q P KI P + C HPGY G+C +CG KP EE+ S V YIH Sbjct: 81 QDQGTSIRPDKIATGPSKNVQVEVCPHPGYFGGLCFRCG--KPQDEEDA--SGVAFGYIH 136 Query: 455 KEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLV 634 K L SEI RLR DLK++L +K NS + D+ E + Sbjct: 137 KGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLQDISSAENE-------L 189 Query: 635 GQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMA 814 G R + D R+++ L SM M TKLRPFV FL+EA+ M+EMYIYTMG++AYA ++A Sbjct: 190 GIRTAALKD-DPDRSIFSLDSMQMLTKLRPFVRNFLKEASNMFEMYIYTMGDKAYAIEIA 248 Query: 815 KLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVER 994 KLLDP +F ++IS+SD T R QK LDV+LGAES VILDDTE VW KHK NLI++ER Sbjct: 249 KLLDPSNVYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEYVWQKHKENLILMER 308 Query: 995 YHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCF 1174 YH+F +SC QF G KSL++S +DE ES+G L+ +L VL+ +H FFD +S Sbjct: 309 YHYFASSCRQFGFGVKSLSESMQDERESDGALATVLDVLKRIHTIFFDTAVETALS---- 364 Query: 1175 DSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTH 1354 SRDVR V+K +R +L+ CK+VFSRVFP R + +W++AE LGA+CST + VTH Sbjct: 365 -SRDVRQVIKTVRKEVLEGCKLVFSRVFPNTSRPQEQMMWKMAEHLGAVCSTDVDSTVTH 423 Query: 1355 VVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSS 1510 VV++D GT+K+RWAV+N FLVHP WIEA++F W RQ E+ FPV +S+ Sbjct: 424 VVAVDLGTEKARWAVKNKKFLVHPRWIEAANFRWHRQPEEDFPVIPPKEKST 475 >ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] Length = 558 Score = 384 bits (985), Expect = e-103 Identities = 206/460 (44%), Positives = 287/460 (62%), Gaps = 4/460 (0%) Frame = +2 Query: 164 SNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIEDKPESSS-- 337 S+A DD+D++ + D + ++ ++ KRR++ + + ++ ++ + P +S Sbjct: 43 SSAFPAATDDEDEDEDEDEDPEVEAVEQNGTKRRRV-EEQLQDQGTSVRPDKIPTGASKN 101 Query: 338 --ARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLK 511 C HPGY G+C +CG KP EEN S V YIHK L SEI RLR DLK Sbjct: 102 VQVEACPHPGYFGGLCFRCG--KPQDEENV--SGVAFGYIHKGLRLGTSEIDRLRGADLK 157 Query: 512 SVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRTLYKL 691 ++L +K NS + D+ E+ +G + S+ D R+++ L Sbjct: 158 NLLRERKLVLILDLDHTLINSTKLQDISSAEKD-------LGIQTAASKD-DPNRSIFSL 209 Query: 692 PSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSD 871 SM M TKLRPFV +FL+EA+ M+EMYIYTMG++AYA ++AKLLDP +F ++IS+SD Sbjct: 210 DSMQMLTKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSD 269 Query: 872 STNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLT 1051 T R QK LDV+LGAES VILDDTE VW KHK NLI++ERYHFF +SC QF G +SL+ Sbjct: 270 CTQRHQKGLDVILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRSLS 329 Query: 1052 QSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKD 1231 +S +DE ES+G L+ +L VL+ +H FFD ++S S+DVR V+K +R IL+ Sbjct: 330 ESMQDERESDGALATVLDVLKRIHSIFFDLAVETDLS-----SQDVRQVIKAVRKEILQG 384 Query: 1232 CKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQNDC 1411 CK+VFSRVFP R + LW++AE LGA+CST + VTHVV++D GT+K+RW V N Sbjct: 385 CKIVFSRVFPNNTRPQEQMLWKMAEHLGAVCSTDVDSSVTHVVTVDLGTEKARWGVANKK 444 Query: 1412 FLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSSGFVESVS 1531 FLVHP WIEA++F W RQ E+ FPVT +S +V+ Sbjct: 445 FLVHPRWIEAANFRWHRQPEEDFPVTAPKEKSRDIDNAVA 484 >ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 472 Score = 380 bits (977), Expect = e-103 Identities = 217/472 (45%), Positives = 291/472 (61%), Gaps = 22/472 (4%) Frame = +2 Query: 140 IDFELDSISNAS------ENHQDDDDQEFESSPDIDFDSE---------DRPRIKRRKLY 274 +D ELDS S+ S EN + + + E E D D++ D R K+RK+ Sbjct: 21 LDAELDSASDVSPELDEVENGEAEVEVELEDEKGKDEDNDTGDGDDGNIDSRRSKKRKIE 80 Query: 275 DNEESEEQLPIKIEDKPESSSARG-------CLHPGYMWGVCIKCGQNKPDSEENEQQSR 433 E + + P + + ES+ G C HPG M G+CI+CGQ D +S Sbjct: 81 LIEAAVD--PQSLVSRGESAETSGASLALDVCTHPGVMGGMCIRCGQKVED------ESG 132 Query: 434 VCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESY 613 V YIHK L+D E+ARLR+ DLK++L +K NS R D+ EE Sbjct: 133 VAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEE--- 189 Query: 614 ISSTYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGER 793 +YL QR V ++ + L+KL +HM TKLRPFVH FL+EA+ ++EMYIYTMGER Sbjct: 190 ---SYLKDQREVLPDA--LRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGER 244 Query: 794 AYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKS 973 YA +MAKLLDP G +F R+I+ SDST R QK LDVVLG ESAV+ILDDTE VW KH+ Sbjct: 245 PYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRE 304 Query: 974 NLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNK 1153 NLI+++RYHFF +SC QF KSL++ K DE E+EG L+ +L+VL+ +H+ FFD Sbjct: 305 NLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGD 364 Query: 1154 EMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTS 1333 + + RDVR VLK +R ILK CK+VF+ V P Q + E H+ W+LAE+LGA ST Sbjct: 365 NIME-----RDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTE 419 Query: 1334 LREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVT 1489 + E VTHVVS++ T+KSR AV+ FLVHP WIEA+++LW++ E+ FPV+ Sbjct: 420 VDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 471 >ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum tuberosum] Length = 478 Score = 380 bits (976), Expect = e-102 Identities = 217/475 (45%), Positives = 287/475 (60%), Gaps = 25/475 (5%) Frame = +2 Query: 140 IDFELDSISNAS------ENHQDDDDQEFESSP--------------DIDFDSEDRPRIK 259 +D ELDS S+ S EN + + ++E E D D S D R K Sbjct: 22 LDAELDSASDVSPELDEVENGEAEGEEEVEDEKGQDEGNDTGDGDDDDDDDGSIDSSRSK 81 Query: 260 RRKLYDNEESEEQLPIKIEDKPESSSARG-----CLHPGYMWGVCIKCGQNKPDSEENEQ 424 +RK+ E + + +P +S C HPG M G+CI+CGQ D Sbjct: 82 KRKIELIEAAVDPQSSVSRGEPAETSGASLALDVCTHPGVMGGMCIRCGQKVED------ 135 Query: 425 QSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEE 604 +S V YIHK L+D E+ARLR DLK++L KK NS R D+ EE Sbjct: 136 ESGVAFGYIHKNLRLADDEVARLRDKDLKNLLRHKKLILVLDLDHTLLNSTRLADISAEE 195 Query: 605 ESYISSTYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTM 784 +YL QR V ++ + L+KL +HM TKLRPFVH FL+EA+ ++EMYIYTM Sbjct: 196 ------SYLKDQREVLPDA--LRNNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTM 247 Query: 785 GERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPK 964 GER YA +MA LLDP G +F R+I+ SDST R QK LDVVLG ESAV+ILDDTE VW K Sbjct: 248 GERPYALEMASLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGK 307 Query: 965 HKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNI 1144 H+ NLI+++RYHFF +SC QF KSL++ K DE E+EG L+ +L+VL+ +H+ FFD Sbjct: 308 HRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDLE 367 Query: 1145 SNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAIC 1324 + + RDVR VLK +R ILK CK+VF+ V P Q + E HH W+LAE+LGA Sbjct: 368 RGDNIME-----RDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHHYWKLAEKLGATF 422 Query: 1325 STSLREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVT 1489 ST + E VTHVVS++ T+KSR A++ FLVHPSWIEA+++LW++ E+ FPV+ Sbjct: 423 STEVDESVTHVVSMNDKTEKSRQALREKKFLVHPSWIEAANYLWRKPPEENFPVS 477 >ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 512 Score = 379 bits (974), Expect = e-102 Identities = 214/460 (46%), Positives = 287/460 (62%), Gaps = 10/460 (2%) Frame = +2 Query: 140 IDFELDSISNASE----NHQDDDDQEFESSPDIDFD-SEDRPRIKRRKLYDNEESEEQLP 304 +D ELDS S+ E + +++ E E + D D S D R K+RK+ E + + Sbjct: 71 LDAELDSASDVDEVESGEAEGEEEVEDEDNDTGDGDGSIDSSRSKKRKIELIEGAVDPQS 130 Query: 305 IKIEDKPESSSARG-----CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFEL 469 +P +S C HPG M G+CI+CGQ D +S V YIHK L Sbjct: 131 SVSRGEPAETSGASMALDVCTHPGVMGGMCIRCGQKVED------ESGVAFGYIHKNLRL 184 Query: 470 SDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQRNV 649 +D E+ARLR+ DLK++L +K NS R D+ EE +YL QR V Sbjct: 185 ADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEE------SYLKDQREV 238 Query: 650 ESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDP 829 ++ + L+KL +HM TKLRPFVH FL+EA+ ++EMYIYTMGER YA +MAKLLDP Sbjct: 239 LPDA--LRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAKLLDP 296 Query: 830 KGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFH 1009 G +F R+I+ SDST R QK LDVVLG ESAV+ILDDTE VW KH+ NLI+++RYHFF Sbjct: 297 GGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFT 356 Query: 1010 ASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDV 1189 +SC QF KSL++ K DE E+EG L+ +L+VL+ +H+ FFD + + RDV Sbjct: 357 SSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGDNIME-----RDV 411 Query: 1190 RLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLD 1369 R VLK +R ILK CK+VF+ V P Q + E H+ W+LAE+LGA ST + E VTHVVS++ Sbjct: 412 RQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMN 471 Query: 1370 AGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVT 1489 T+KSR AV+ FLVHP WIEA+++LW++ E+ FPV+ Sbjct: 472 DKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 511 >dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] Length = 1065 Score = 377 bits (969), Expect = e-102 Identities = 202/455 (44%), Positives = 280/455 (61%) Frame = +2 Query: 140 IDFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIED 319 +D ELDS S+AS ++++ E + +KR+KL EE+ Sbjct: 647 LDAELDSASDASSGPSEEEEAE----------DDVESGLKRQKLEHLEEA---------- 686 Query: 320 KPESSSARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRK 499 SSS C HPG +C CGQ +++ V +YIHKE L++ EI+RLR Sbjct: 687 ---SSSKGECEHPGSFGNMCFVCGQKL-------EETGVSFRYIHKEMRLNEDEISRLRD 736 Query: 500 DDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRT 679 D + + +K N+ D+ PEEE S T+ +++ G + Sbjct: 737 SDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTH-----SLQDGCNVSGGS 791 Query: 680 LYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRII 859 L+ L M M TKLRPFVH FL+EA++M+ MYIYTMG+R YA +MAKLLDPKG++F DR+I Sbjct: 792 LFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVI 851 Query: 860 SSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGT 1039 S D T R +K LDVVLG ESAV+ILDDTE WPKHK NLIV+ERYHFF +SC QF Sbjct: 852 SRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRY 911 Query: 1040 KSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRAR 1219 KSL++ K DE+E +G L+ +L+VL+ H FF+N+ +RDVRL+LK++R Sbjct: 912 KSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEG------ISNRDVRLMLKQVRKE 965 Query: 1220 ILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAV 1399 ILK CK+VFSRVFPT+ + E H LW++AEELGA C+T + VTHVV++D GT+K+RWAV Sbjct: 966 ILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAV 1025 Query: 1400 QNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 1504 + ++VH WI+A+++LW +Q E+ F + + Q Sbjct: 1026 REKKYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 1060 >ref|NP_001078764.1| C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal phosphatase-like 4; Short=AtCPL4; Short=CTD phosphatase-like 4 gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana] gi|332009601|gb|AED96984.1| C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] Length = 440 Score = 377 bits (969), Expect = e-102 Identities = 202/455 (44%), Positives = 280/455 (61%) Frame = +2 Query: 140 IDFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIED 319 +D ELDS S+AS ++++ E + +KR+KL EE+ Sbjct: 22 LDAELDSASDASSGPSEEEEAE----------DDVESGLKRQKLEHLEEA---------- 61 Query: 320 KPESSSARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRK 499 SSS C HPG +C CGQ +++ V +YIHKE L++ EI+RLR Sbjct: 62 ---SSSKGECEHPGSFGNMCFVCGQKL-------EETGVSFRYIHKEMRLNEDEISRLRD 111 Query: 500 DDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRT 679 D + + +K N+ D+ PEEE S T+ +++ G + Sbjct: 112 SDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTH-----SLQDGCNVSGGS 166 Query: 680 LYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRII 859 L+ L M M TKLRPFVH FL+EA++M+ MYIYTMG+R YA +MAKLLDPKG++F DR+I Sbjct: 167 LFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVI 226 Query: 860 SSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGT 1039 S D T R +K LDVVLG ESAV+ILDDTE WPKHK NLIV+ERYHFF +SC QF Sbjct: 227 SRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRY 286 Query: 1040 KSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRAR 1219 KSL++ K DE+E +G L+ +L+VL+ H FF+N+ +RDVRL+LK++R Sbjct: 287 KSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEG------ISNRDVRLMLKQVRKE 340 Query: 1220 ILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAV 1399 ILK CK+VFSRVFPT+ + E H LW++AEELGA C+T + VTHVV++D GT+K+RWAV Sbjct: 341 ILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAV 400 Query: 1400 QNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 1504 + ++VH WI+A+++LW +Q E+ F + + Q Sbjct: 401 REKKYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 435 >ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Fragaria vesca subsp. vesca] Length = 464 Score = 376 bits (966), Expect = e-101 Identities = 210/462 (45%), Positives = 288/462 (62%), Gaps = 11/462 (2%) Frame = +2 Query: 140 IDFELDSISNASENHQD------DDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQL 301 ++ EL+S S+ S ++ DDD ESS D E R+KRRK+ + E EE Sbjct: 20 LETELESGSSESSPDEECKAAVGDDDGGSESS-----DVESESRVKRRKVENVEILEEAN 74 Query: 302 PIKIEDKPES-SSARG----CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFE 466 + + E S A G C HPG +C CGQ + QS V YIHK Sbjct: 75 ALTSQAVSEEISEASGVDDLCAHPGSFGDMCFLCGQRLIE------QSGVTFGYIHKGLR 128 Query: 467 LSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQRN 646 L+D EI RLR D+K L+ KK N+ V +EE + Sbjct: 129 LNDGEIDRLRNTDIKKSLNNKKLYLVLDLDHTLLNTTLLNHVTAKEEYLMCPP------- 181 Query: 647 VESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLD 826 +S + +L++L M M TKLRPF+ FL+EA++++EMYIYTMG+RAYA +MAKLLD Sbjct: 182 -DSLPDVLKDSLFRLDFMRMMTKLRPFIRTFLKEASEIFEMYIYTMGDRAYALEMAKLLD 240 Query: 827 PKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFF 1006 PK ++F DR+IS D T R QK LD+VLG ESAV+ILDDTE W KHK NLI++ERYHFF Sbjct: 241 PKKEYFGDRVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWIKHKDNLILMERYHFF 300 Query: 1007 HASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRD 1186 +SC QF +SL++ K DE+E EG L+ +L +L+ +H+ FF ++ + RD Sbjct: 301 RSSCAQFGFTCESLSELKSDESEPEGALANVLDLLKRIHKMFFYDLGGNLV------DRD 354 Query: 1187 VRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSL 1366 VR VLK +R +L CKVVFSR+ P++ A +HHLW++AE+LGAICST + VTHVV+L Sbjct: 355 VRQVLKIVRKEVLNGCKVVFSRIIPSKVLASSHHLWKMAEQLGAICSTEVDSTVTHVVAL 414 Query: 1367 DAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTK 1492 DAGT+KSRWAV+++ FLVHP W+EA++++WQ+Q+E++FPVT+ Sbjct: 415 DAGTEKSRWAVKHNKFLVHPRWLEAANYMWQKQAEEKFPVTE 456 >ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X1 [Citrus sinensis] gi|568865772|ref|XP_006486244.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X2 [Citrus sinensis] gi|568865774|ref|XP_006486245.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X3 [Citrus sinensis] Length = 478 Score = 375 bits (964), Expect = e-101 Identities = 213/470 (45%), Positives = 287/470 (61%), Gaps = 13/470 (2%) Frame = +2 Query: 140 IDFELDSIS-------NASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQ 298 +D ELDS S A + +D++ + E+ + D +D RIKRRK E +E+ Sbjct: 21 LDAELDSNSLGSSPEKEAEDKDEDEESIDEEAENEEARDDKDLERIKRRKTQIVETIQER 80 Query: 299 ----LPIKIEDKPESS-SARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEF 463 L +E+K E S C HPG + G+C +CG+ E++S V YI K Sbjct: 81 PGPTLLGNLEEKTEVSLEMDNCPHPGSLGGMCYRCGKRL------EEESGVTFSYICKGL 134 Query: 464 ELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQR 643 L + EI RLR D+K +L +K NS + + PEE+ YL Q Sbjct: 135 RLGNDEIDRLRNTDMKHLLRHRKLYLILDLDHTLLNSTLLLHLTPEED------YLKSQA 188 Query: 644 NVESESGDIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKL 820 + D+ + +L+ L M+M TKLRPFVH FL+EA++M+EMYIYTMG+R YA +MAKL Sbjct: 189 D---SLQDVSKGSLFMLAFMNMMTKLRPFVHTFLKEASEMFEMYIYTMGDRPYALEMAKL 245 Query: 821 LDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYH 1000 LDP ++F R+IS D T R QK LDVVLG ESAV+ILDDTE W KH+ NLI++ERYH Sbjct: 246 LDPSREYFNARVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWTKHRDNLILMERYH 305 Query: 1001 FFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDS 1180 FF +SC QF +SL+Q + DE+E EG L+ +L+VL+ +H FFD ++N Sbjct: 306 FFASSCRQFGYHCQSLSQLRSDESELEGALASVLKVLKRIHNIFFDELAND------LAG 359 Query: 1181 RDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVV 1360 RDVR VLK +R +LK CK+VFS VFPT+F A+TH+LW++AE+LGA C L VTHVV Sbjct: 360 RDVRQVLKMVRGEVLKGCKLVFSHVFPTKFPADTHYLWKMAEQLGATCLIELDPSVTHVV 419 Query: 1361 SLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSS 1510 S DA T+KSRWA + FLV P WIE ++FLWQRQ E+ FPV + + + Sbjct: 420 STDARTEKSRWAAKEAKFLVDPRWIETANFLWQRQPEENFPVKQNKPEEN 469 >ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii] gi|300166408|gb|EFJ33014.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii] Length = 411 Score = 374 bits (961), Expect = e-101 Identities = 193/396 (48%), Positives = 266/396 (67%), Gaps = 12/396 (3%) Frame = +2 Query: 365 MWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXX 544 MWGVCI+CG KP+SE S V LKYIH+EFEL+ +AR+R+D+L+ VL +K Sbjct: 1 MWGVCIRCGVLKPNSEPGGSASNVALKYIHEEFELAGDVLARVREDELRQVLGKRKLFLV 60 Query: 545 XXXXXXXXNSARFIDVDPEEESYISSTYL-VGQRNVESESGDI----------GRTLYKL 691 NSAR+++V P+E +Y+ TY+ V + + + S G L+++ Sbjct: 61 LDLDHTLLNSARWMEVFPDETAYLEHTYMNVPEDKIPALSNGAPAVAGVIQPGGGGLHRI 120 Query: 692 PSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSD 871 M +WTKLRPF HKFLEEA++++EMY+YTMGER YA MA LLDP GKFF R+IS D Sbjct: 121 HGMQLWTKLRPFAHKFLEEASKLFEMYVYTMGERMYAVTMAHLLDPTGKFFKGRVISQRD 180 Query: 872 STNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLT 1051 ST R+ KDLD+VLGA+SAV+ILDDTE VWPKH++NLIV+ERYHFF +SC QF SLT Sbjct: 181 STCRQTKDLDIVLGADSAVLILDDTEAVWPKHRANLIVMERYHFFQSSCRQFGLENPSLT 240 Query: 1052 QSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKD 1231 +++RDE++ EG L+ +L+VL+ +H FF + S++ D RD+ V +R+ IL Sbjct: 241 KAERDESKDEGALANVLKVLQRIHSDFF---MESDDSRYTCDVRDITSV---VRSEILSG 294 Query: 1232 CKVVFSRVFPTQ-FRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQND 1408 CK+VFSR+FPT E LWRL +LGA C + + VTHVV+LD TDK++WA ++ Sbjct: 295 CKLVFSRIFPTDCLEPELTPLWRLCVDLGAECVLAHDDSVTHVVALDRFTDKAKWAKEHR 354 Query: 1409 CFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSSGF 1516 FLVHP+W+EA+ LW+R +E FPV + +++ F Sbjct: 355 KFLVHPAWVEAAHSLWRRPNELEFPVREGQTRAPVF 390 >ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318537|gb|EEF03111.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 468 Score = 373 bits (958), Expect = e-100 Identities = 218/467 (46%), Positives = 288/467 (61%), Gaps = 20/467 (4%) Frame = +2 Query: 140 IDFELDSISNASENHQDD------DDQEFESSPDIDFDSED-------RPRIKRRKLYDN 280 +D ELDS S+AS D+ D SSPD D ++E+ R R+KR K+ Sbjct: 21 LDTELDSKSSASSASDDEAPNQRHSDSAASSSPDQDKEAEEDDDSDFQRKRVKRSKVETV 80 Query: 281 EESEEQLPI----KIEDKPESSSARG-CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLK 445 E E+ ++ E+S ++ C HPG +CI CGQ + +S V Sbjct: 81 EIVEDDGGTTSFASLKHNSEASISKEICTHPGSFGTMCIVCGQLL------DGESGVTFG 134 Query: 446 YIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISST 625 YIHK L + EI RLR D+K++L KK NS + + + +EE Sbjct: 135 YIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEE------ 188 Query: 626 YLVGQRNVESESGDIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYA 802 YL GQ + D+ + +L+ L SM M TKLRPFV FL+EA+QM+EMYIYTMG+RAYA Sbjct: 189 YLNGQTD---SLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYA 245 Query: 803 EKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLI 982 +MAKLLDP ++F ++IS D T R QK LDVVLG ESAV+ILDDTE W KHK NLI Sbjct: 246 LEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLI 305 Query: 983 VVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFF-DNISNKEM 1159 ++ERYHFF +SC QF KSL++ K DE+ESEG L+ +L+VLR +HQ FF D+I + + Sbjct: 306 LMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFEDHILSLAL 365 Query: 1160 SQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLR 1339 VLK +R +LK CK+VFSRVFPTQ +A+ HHLWR+AE+LGA CST L Sbjct: 366 Q-----------VLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELD 414 Query: 1340 EDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRF 1480 VTHVVS D+GT+KS WA++++ FLV P WIEA+++ WQRQ E+ F Sbjct: 415 PSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENF 461 >ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp. lyrata] gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp. lyrata] Length = 1006 Score = 372 bits (955), Expect = e-100 Identities = 204/457 (44%), Positives = 287/457 (62%), Gaps = 2/457 (0%) Frame = +2 Query: 140 IDFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIED 319 +D ELDS S+AS +++++ + ++ +KRRKL E +E E+ Sbjct: 583 LDAELDSASDASSGPSEEEEEA---------EDDEESGLKRRKLEHLETVDE------EE 627 Query: 320 KPESSSARG-CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLR 496 E+SS++G C HPG +C CGQ +++ V +YIHKE L++ EI+RLR Sbjct: 628 IEEASSSKGECQHPGSFGNMCFVCGQKL-------EETGVSFRYIHKEMRLNEDEISRLR 680 Query: 497 KDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVDPEEESYISSTYLVGQRNVESESGDI-G 673 D + + +K NS D+ PEEE S T+ + + D+ G Sbjct: 681 DSDSRFLQRQRKLYLVLDLDHTLLNSTVLRDLKPEEEYLKSHTHSLQEPFDFLLISDVSG 740 Query: 674 RTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADR 853 +L+ L MHM TKLRPFVH FL+EA++M+ MYIYTMG+RAYA +MAKLLDP+G++F DR Sbjct: 741 GSLFMLEFMHMMTKLRPFVHSFLKEASEMFVMYIYTMGDRAYARQMAKLLDPRGEYFGDR 800 Query: 854 IISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHC 1033 IIS D T R QK LDVVLG ESAV+ILDDTE WP HK NLIV+ERYHFF +SC QF Sbjct: 801 IISRDDGTVRHQKSLDVVLGQESAVLILDDTENAWPNHKDNLIVIERYHFFASSCRQFDH 860 Query: 1034 GTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIR 1213 KSL++ K DE+E +G L+ +L ++ ++ISN RDVR +LK++R Sbjct: 861 KYKSLSELKSDESEPDGALATVL-------KNVDEDISN----------RDVRSMLKQVR 903 Query: 1214 ARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRW 1393 +LK CKVVFSRVFPT+ + E H LW++AEELGA C+T + VTHVV++D GT+K+RW Sbjct: 904 KEVLKGCKVVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARW 963 Query: 1394 AVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 1504 AV+ ++VH WI+A+++LW++Q E++F + + Q Sbjct: 964 AVREKKYVVHRGWIDAANYLWKKQPEEKFSLEQLKKQ 1000 >gb|EOY32065.1| RNA polymerase II ctd phosphatase, putative isoform 2 [Theobroma cacao] Length = 357 Score = 369 bits (948), Expect = 2e-99 Identities = 195/378 (51%), Positives = 251/378 (66%), Gaps = 1/378 (0%) Frame = +2 Query: 374 VCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXX 553 +CI CGQ D +S V YIHK L + EI RLR D+K++L KK Sbjct: 1 MCILCGQRLDD------ESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDL 54 Query: 554 XXXXXNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGR-TLYKLPSMHMWTKLRPFV 730 NS + + + P+EE YL GQ + D+ R +L+ L MHM TKLRPFV Sbjct: 55 DHTLLNSTQLMHLTPDEE------YLKGQSD---SLQDVSRGSLFMLDFMHMMTKLRPFV 105 Query: 731 HKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVL 910 FL+EA++M+EMYIYTMG+R YA +MAKLLDP+ ++F+DR+IS D T + QK LDVVL Sbjct: 106 RTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVL 165 Query: 911 GAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTL 1090 G ESAVVILDDTE W KHK NLI++ERYH+F +SC QF KSL+Q K DE+E +G L Sbjct: 166 GQESAVVILDDTENAWMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGAL 225 Query: 1091 SILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQF 1270 + +L+ LR +H FFD + SRDVR VLK ++ +LK CK+VFS VFPT F Sbjct: 226 ASVLKALRQIHHMFFDELDCN------LASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNF 279 Query: 1271 RAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSF 1450 AE+H LW++AE+LGA CST VTHVVS DAGT+KSRWAV+ FLVHP WIEA+++ Sbjct: 280 PAESHPLWKMAEQLGATCSTETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNY 339 Query: 1451 LWQRQSEDRFPVTKKDSQ 1504 LWQ+Q E+ FPV++ +Q Sbjct: 340 LWQKQPEENFPVSQGKNQ 357