BLASTX nr result
ID: Ephedra26_contig00010648
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra26_contig00010648 (1875 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [A... 423 e-115 gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isofo... 393 e-106 ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ... 391 e-106 ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma... 389 e-105 ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu... 389 e-105 ref|XP_001781984.1| predicted protein [Physcomitrella patens] gi... 386 e-104 gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus pe... 385 e-104 ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal doma... 384 e-103 ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [S... 384 e-103 ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal doma... 381 e-103 ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal doma... 381 e-103 ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal doma... 380 e-102 dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] 377 e-101 ref|NP_001078764.1| C-terminal domain phosphatase-like 4 [Arabid... 377 e-101 ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal doma... 375 e-101 ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Sela... 375 e-101 ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal doma... 375 e-101 ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu... 375 e-101 ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arab... 371 e-100 ref|XP_006654357.1| PREDICTED: RNA polymerase II C-terminal doma... 369 3e-99 >ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda] gi|548840545|gb|ERN00656.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda] Length = 486 Score = 423 bits (1087), Expect = e-115 Identities = 222/472 (47%), Positives = 309/472 (65%), Gaps = 18/472 (3%) Frame = +3 Query: 141 ELDSISNAS-ENHQDDDDQEFESSPD-----------IDFDSEDRPRIKRRKLYDNEESE 284 E D +SN+S E+ + D+D E + + D++ + RIKR K+ ++EE + Sbjct: 22 ETDLLSNSSGESPERDEDILDEITSESVSERSAVWDSTDYEEIELERIKRPKICEDEEIK 81 Query: 285 EQLPI-----KIEDKPESSSAQGCL-HPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIH 446 E ++++ ES+S + C HPG+ +CI+CG+ K D +++ V YIH Sbjct: 82 ESQSSNANQGELDNFKESTSEKVCPPHPGFYKDMCIRCGEQKDDETVARKETAVAFNYIH 141 Query: 447 KEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLV 626 K+ +L E+ARLR DLK++ +K NS R +DV PEEE+Y+++TYL Sbjct: 142 KDLKLGAEEVARLRATDLKNLYRRRKLYLVLDLDHTLLNSTRLVDVSPEEEAYLNATYL- 200 Query: 627 GQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMA 806 + S +GD TL+KL +HM TKLRPFV FL+EAN M+EMY+YTMGERAYA +MA Sbjct: 201 -NKETSSSNGDTSGTLFKLEPLHMLTKLRPFVRTFLKEANTMFEMYVYTMGERAYALEMA 259 Query: 807 KLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVER 986 KLLDP G +F R+IS DST R QK LDVVLG+E AVVILDDTE VW KHK NL+++ER Sbjct: 260 KLLDPSGVYFGSRVISQGDSTVRHQKGLDVVLGSECAVVILDDTEHVWHKHKENLVLMER 319 Query: 987 YHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCF 1166 YHFF +SC QF+ KSL++ KRDE+ES+G L+ +L VL+ +HQ F+ + + F Sbjct: 320 YHFFSSSCRQFNVHYKSLSELKRDESESDGMLASILNVLKHIHQMFY-----YQEVETDF 374 Query: 1167 DSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTH 1346 + DVR VLK I++ +LK C++VFSR+FPT + E LWR+AE+LGA CS L E VTH Sbjct: 375 NGSDVRKVLKTIQSEVLKGCRLVFSRIFPTNYPVENQTLWRIAEQLGASCSKELDEAVTH 434 Query: 1347 VVSLDSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSS 1502 VVSLD GT+K+RWA+Q LV+P W+EA+++ W+RQ ED+FP+ K+ S Sbjct: 435 VVSLDLGTEKARWAIQRKKHLVNPGWLEATNYFWKRQPEDQFPIPSKNGGGS 486 >gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma cacao] Length = 469 Score = 393 bits (1010), Expect = e-106 Identities = 222/461 (48%), Positives = 281/461 (60%), Gaps = 7/461 (1%) Frame = +3 Query: 135 DFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIK-IED 311 D E D +N N DDD D DS+ R K KL D EES IED Sbjct: 38 DVEADGDNNNDNNDDHDDDD--------DLDSQRNKRCKTEKLEDLEESRGSTSQGLIED 89 Query: 312 K-----PESSSAQGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEI 476 K S C HPG +CI CGQ D +S V YIHK L + EI Sbjct: 90 KIVIHAELSLKKDICTHPGSFGQMCILCGQRLDD------ESGVTFGYIHKGLRLGNDEI 143 Query: 477 ARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQRNVESESG 656 RLR D+K++L KK NS + + + P+EE YL GQ + Sbjct: 144 VRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLMHLTPDEE------YLKGQSD---SLQ 194 Query: 657 DIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKF 833 D+ R +L+ L MHM TKLRPFV FL+EA++M+EMYIYTMG+R YA +MAKLLDP+ ++ Sbjct: 195 DVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREY 254 Query: 834 FADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCL 1013 F+DR+IS D T + QK LDVVLG ESAVVILDDTE W KHK NLI++ERYH+F +SC Sbjct: 255 FSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENAWMKHKDNLILMERYHYFASSCH 314 Query: 1014 QFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVL 1193 QF KSL+Q K DE+E +G L+ +L+ LR +H FFD + SRDVR VL Sbjct: 315 QFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFFDELDCN------LASRDVRQVL 368 Query: 1194 KEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDSGTD 1373 K ++ +LK CK+VFS VFPT F AE+H LW++AE+LGA CST VTHVVS D+GT+ Sbjct: 369 KTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTETDLSVTHVVSTDAGTE 428 Query: 1374 KSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 1496 KSRWAV+ FLVHP WIEA+++LWQ+Q E+ FPV++ +Q Sbjct: 429 KSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 469 >ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 478 Score = 391 bits (1005), Expect = e-106 Identities = 216/470 (45%), Positives = 295/470 (62%), Gaps = 18/470 (3%) Frame = +3 Query: 129 LIDFELDSISNASEN-------------HQDDDDQEFESSPDIDFDSEDRP-RIKRRK-- 260 L+D ELDS S++S++ + D +++E E D D DS+ RIKR + Sbjct: 23 LLDAELDSKSSSSDSSPKAIKHDDASDANDDVNEEEEEEESDSDDDSDIATNRIKRSRVE 82 Query: 261 -LYDNEESEEQLPIKIEDKPESSSAQ-GCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCL 434 L + E +E + ++ +SS++ C HPG +CI CG E +++ V Sbjct: 83 TLENGENPKESTRVSLDQTLVASSSKVACTHPGSFGDMCILCG------ERLIEETGVTF 136 Query: 435 KYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISS 614 YIHK L++ EI RLR D+K++L +K NS + + + EEE Sbjct: 137 GYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTLLNSTQLMHLTAEEE----- 191 Query: 615 TYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYA 794 YL Q ++S +L+ + MHM TKLRPF+ FL+EA+QM+EMYIYTMG+RAYA Sbjct: 192 -YLKSQ--IDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEMYIYTMGDRAYA 248 Query: 795 EKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLI 974 +MAK LDP ++F R+IS D T R QK LD+VLG ESAV+ILDDTE W KHK NLI Sbjct: 249 LEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWTKHKDNLI 308 Query: 975 VVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMS 1154 ++ERYHFF +SC QF KSL+Q K DE ES+G L+ +L+VLR +H FFD + + Sbjct: 309 LMERYHFFASSCRQFGFECKSLSQLKSDENESDGALASVLKVLRRIHHIFFDELED---- 364 Query: 1155 QFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLRE 1334 D RDVR VL +R +LK CK+VFSRVFPTQF+A+ HHLW++AE+LGA CS + Sbjct: 365 --AIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCSREVDP 422 Query: 1335 DVTHVVSLDSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTK 1484 VTHVVS ++GT+KSRWA++ND FLVHP WIEA++++WQRQ E+ F V + Sbjct: 423 SVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVNQ 472 >ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Cucumis sativus] Length = 452 Score = 389 bits (999), Expect = e-105 Identities = 208/438 (47%), Positives = 280/438 (63%), Gaps = 5/438 (1%) Frame = +3 Query: 180 DDDDQEFESSPDIDFDSEDRP---RIKRRKLYDNEESEEQLPIKIEDKPES--SSAQGCL 344 D D +SSPD + + ++ RIKRRK+ E SEE + ++E++ S Q C Sbjct: 24 DLDSHSSDSSPDEETEGDNNAESVRIKRRKVEKLENSEEDIMHEVEEQSLEVLSKQQLCS 83 Query: 345 HPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKK 524 HPG +CI CGQ +++S V YIHKE L++ EI R+R ++K +L KK Sbjct: 84 HPGSFGNMCIICGQRL------DEESGVTFGYIHKELRLNNDEINRMRNKEMKELLQRKK 137 Query: 525 XXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQRNVESESGDIGRTLYKLPSMHMWT 704 NS + EEE S T +S +L+ L S+H T Sbjct: 138 LILVLDLDHTLLNSTELRYLTVEEEYLRSQT--------DSLDDVTKGSLFLLNSVHTMT 189 Query: 705 KLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQK 884 KLRPFVH FL+EA++++EMYIYTMGER YA +MAKLLDPK ++F+ ++IS D T + QK Sbjct: 190 KLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQK 249 Query: 885 DLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEA 1064 LDVVLG ESAV+ILDDTE W KHK NLI++ERYHFF +SC QF KSL++ K DE+ Sbjct: 250 GLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDES 309 Query: 1065 ESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSR 1244 E++G L+ +L+VL+ VH FF+ +S + RDVR VLK +RA +L+ CKVVFSR Sbjct: 310 ETDGALTTILKVLKQVHHMFFNEVSGDLV------DRDVRQVLKTVRAEVLEGCKVVFSR 363 Query: 1245 VFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDSGTDKSRWAVQNDCFLVHPSW 1424 VFPT+F+AE H LW++ E+LG CST L + VTHVV+ D+GT+KSRWA++ FLVHP W Sbjct: 364 VFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRW 423 Query: 1425 IEASSFLWQRQSEDRFPV 1478 IEAS++ W+RQ E+ F V Sbjct: 424 IEASNYFWKRQMEENFTV 441 >ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318538|gb|EEF03112.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 472 Score = 389 bits (998), Expect = e-105 Identities = 223/466 (47%), Positives = 290/466 (62%), Gaps = 19/466 (4%) Frame = +3 Query: 132 IDFELDSISNASENHQDD------DDQEFESSPDIDFDSED-------RPRIKRRKLYDN 272 +D ELDS S+AS D+ D SSPD D ++E+ R R+KR K+ Sbjct: 21 LDTELDSKSSASSASDDEAPNQRHSDSAASSSPDQDKEAEEDDDSDFQRKRVKRSKVETV 80 Query: 273 EESEEQLPI----KIEDKPESS-SAQGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLK 437 E E+ ++ E+S S + C HPG +CI CGQ + +S V Sbjct: 81 EIVEDDGGTTSFASLKHNSEASISKEICTHPGSFGTMCIVCGQLL------DGESGVTFG 134 Query: 438 YIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISST 617 YIHK L + EI RLR D+K++L KK NS + + + +EE Sbjct: 135 YIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEE------ 188 Query: 618 YLVGQRNVESESGDIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYA 794 YL GQ + D+ + +L+ L SM M TKLRPFV FL+EA+QM+EMYIYTMG+RAYA Sbjct: 189 YLNGQTD---SLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYA 245 Query: 795 EKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLI 974 +MAKLLDP ++F ++IS D T R QK LDVVLG ESAV+ILDDTE W KHK NLI Sbjct: 246 LEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLI 305 Query: 975 VVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMS 1154 ++ERYHFF +SC QF KSL++ K DE+ESEG L+ +L+VLR +HQ FF+ + Sbjct: 306 LMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFEELEEN--- 362 Query: 1155 QFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLRE 1334 D RDVR VLK +R +LK CK+VFSRVFPTQ +A+ HHLWR+AE+LGA CST L Sbjct: 363 ---MDGRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDP 419 Query: 1335 DVTHVVSLDSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRF 1472 VTHVVS DSGT+KS WA++++ FLV P WIEA+++ WQRQ E+ F Sbjct: 420 SVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENF 465 >ref|XP_001781984.1| predicted protein [Physcomitrella patens] gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens] Length = 563 Score = 386 bits (991), Expect = e-104 Identities = 213/471 (45%), Positives = 293/471 (62%), Gaps = 6/471 (1%) Frame = +3 Query: 141 ELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIEDKPE 320 +L+ ++ + + ++D D ES D + D ++ E S++ PI + E Sbjct: 63 DLEEGTSENGSVEEDADSVVESEDHADDNHLDVEKVA-------ESSDDITPICVNYSGE 115 Query: 321 SSSAQGCL-HPGYMWGVCIKCGQNKPDSEENEQQ-SRVCLKYIHKEFELSDSEIARLRKD 494 ++ C HPG++W VCI+CG+ K + N+ RV L+YIH+ E+S+ E AR+R Sbjct: 116 MVNSNKCPPHPGFIWDVCIRCGKRKSTAPSNDPVIDRVGLRYIHEGLEVSELEAARVRNA 175 Query: 495 DLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQRNVESESGDIGRTL 674 +L+ V +K NSARF +V EE Y+ T+ GQ++ S L Sbjct: 176 ELRRVTGKQKLLLVVDLDHTMLNSARFSEVPAEERIYL--TWTAGQQHGRVSS------L 227 Query: 675 YKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIIS 854 ++L + MWTKLRPF HKFLEEA+++YEMY+YTMGE+ YA+ MA+LLDP G+ F RIIS Sbjct: 228 HQLTKLGMWTKLRPFAHKFLEEASKLYEMYVYTMGEKIYAQAMAELLDPTGQLFGGRIIS 287 Query: 855 SSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTK 1034 +DST R KDLDVVLGAESAVVILDDTE VWP H+SNLI++ERYHFF +SC QF Sbjct: 288 QTDSTKRHTKDLDVVLGAESAVVILDDTEAVWPNHRSNLILMERYHFFTSSCHQFRVRAP 347 Query: 1035 SLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQF-CFDSRDVRLVLKEIRAR 1211 SL Q RDE E +GTL+ L+ L+++H FF+ K M + + DVR V++ IR + Sbjct: 348 SLAQMHRDECEIDGTLATTLKTLQAIHHEFFNGHKGKSMKRRPPLELPDVRDVIRSIRGK 407 Query: 1212 ILKDCKVVFSRVFPTQFR-AETHHLWRLAEELGAICSTSLREDVTHVVSLDSGTDKSRWA 1388 +L C +VFSR+FPT + E H W+LA ELGA CST THVV+LD GTDK+RWA Sbjct: 408 LLSGCHIVFSRIFPTGLQNPEFHPFWQLAVELGARCSTVCDHTTTHVVALDRGTDKARWA 467 Query: 1389 VQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ--SSGFVESVSQLPL 1535 Q+ LVHP W+EA+S+LW+R E FPVT S S+ F +++S P+ Sbjct: 468 KQHGISLVHPRWVEAASYLWKRPREKDFPVTDDASALISTTFSKNISVEPI 518 >gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica] Length = 449 Score = 385 bits (990), Expect = e-104 Identities = 213/436 (48%), Positives = 282/436 (64%), Gaps = 9/436 (2%) Frame = +3 Query: 204 SSPD--IDFDSEDRPR--IKRRKLYD----NEESEEQLPIKIEDKPESSSAQG-CLHPGY 356 SSPD D++S+D KRRK+ + +E I +E+ E+S + C HPG Sbjct: 31 SSPDEEADYESDDGSERSTKRRKVENLGSIDETQGSTSQIFVEENSEASPKKDICTHPGS 90 Query: 357 MWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXX 536 + +CI CGQ +++S V L YIHK+F L++ EI R+R D+K L KK Sbjct: 91 VKDLCIVCGQRV------DEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLV 144 Query: 537 XXXXXXXXNSARFIDVHPEEESYISSTYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRP 716 NS + EEE YL Q + + D +L+++ MHM TKLRP Sbjct: 145 LDLDHTLLNSTHLNHMTAEEE------YLHSQTDSLQDVSD--GSLFRVDVMHMMTKLRP 196 Query: 717 FVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDV 896 FV KFL+EA++M+EMYIYTMGERAYA +MAKLLDP+ ++F DR+IS D T + QK LDV Sbjct: 197 FVRKFLKEASEMFEMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDV 256 Query: 897 VLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEG 1076 VLG ESA +ILDDTE W KHK NLI++ERYHFF +SC QF KSL++ K DE+E EG Sbjct: 257 VLGHESAALILDDTENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEG 316 Query: 1077 TLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPT 1256 L+ +L+VL+ +H FF + + RDVR VLK +R ILK CK+VFSRVFP+ Sbjct: 317 ALATVLEVLKRIHNMFFYESKDNLI------DRDVRQVLKTLRKEILKGCKIVFSRVFPS 370 Query: 1257 QFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDSGTDKSRWAVQNDCFLVHPSWIEAS 1436 +F+AE H LW++AE+LGA CST L VTHVVS D+GT+KSRWAV+ FLVHP WIEAS Sbjct: 371 KFQAENHQLWKMAEQLGATCSTELDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEAS 430 Query: 1437 SFLWQRQSEDRFPVTK 1484 +++W +Q+ED+FPV + Sbjct: 431 NYMWLKQAEDKFPVNQ 446 >ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Setaria italica] Length = 543 Score = 384 bits (985), Expect = e-103 Identities = 214/472 (45%), Positives = 290/472 (61%), Gaps = 14/472 (2%) Frame = +3 Query: 129 LIDFELDSISNA---------SENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEES 281 L+D EL+ S A S + DD+ E E S +++ + ++ KRR++ E+S Sbjct: 23 LLDSELELASGADSAFPGDPSSASPDTDDEGEDEDSEEVEVELLEQNSAKRRRV--EEQS 80 Query: 282 EEQ----LPIKIEDKPESS-SAQGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIH 446 ++Q P KI P + + C HPGY G+C +CG KP EE+ S V YIH Sbjct: 81 QDQGTSIRPDKIATGPSKNVQVEVCPHPGYFGGLCFRCG--KPQDEEDA--SGVAFGYIH 136 Query: 447 KEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLV 626 K L SEI RLR DLK++L +K NS + D+ E + Sbjct: 137 KGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLQDISSAENE-------L 189 Query: 627 GQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMA 806 G R + D R+++ L SM M TKLRPFV FL+EA+ M+EMYIYTMG++AYA ++A Sbjct: 190 GIRTAALKD-DPDRSIFSLDSMQMLTKLRPFVRNFLKEASNMFEMYIYTMGDKAYAIEIA 248 Query: 807 KLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVER 986 KLLDP +F ++IS+SD T R QK LDV+LGAES VILDDTE VW KHK NLI++ER Sbjct: 249 KLLDPSNVYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEYVWQKHKENLILMER 308 Query: 987 YHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCF 1166 YH+F +SC QF G KSL++S +DE ES+G L+ +L VL+ +H FFD +S Sbjct: 309 YHYFASSCRQFGFGVKSLSESMQDERESDGALATVLDVLKRIHTIFFDTAVETALS---- 364 Query: 1167 DSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTH 1346 SRDVR V+K +R +L+ CK+VFSRVFP R + +W++AE LGA+CST + VTH Sbjct: 365 -SRDVRQVIKTVRKEVLEGCKLVFSRVFPNTSRPQEQMMWKMAEHLGAVCSTDVDSTVTH 423 Query: 1347 VVSLDSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSS 1502 VV++D GT+K+RWAV+N FLVHP WIEA++F W RQ E+ FPV +S+ Sbjct: 424 VVAVDLGTEKARWAVKNKKFLVHPRWIEAANFRWHRQPEEDFPVIPPKEKST 475 >ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] Length = 558 Score = 384 bits (985), Expect = e-103 Identities = 206/460 (44%), Positives = 288/460 (62%), Gaps = 4/460 (0%) Frame = +3 Query: 156 SNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIEDKPESSS-- 329 S+A DD+D++ + D + ++ ++ KRR++ + + ++ ++ + P +S Sbjct: 43 SSAFPAATDDEDEDEDEDEDPEVEAVEQNGTKRRRV-EEQLQDQGTSVRPDKIPTGASKN 101 Query: 330 --AQGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLK 503 + C HPGY G+C +CG KP EEN S V YIHK L SEI RLR DLK Sbjct: 102 VQVEACPHPGYFGGLCFRCG--KPQDEENV--SGVAFGYIHKGLRLGTSEIDRLRGADLK 157 Query: 504 SVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQRNVESESGDIGRTLYKL 683 ++L +K NS + D+ E+ +G + S+ D R+++ L Sbjct: 158 NLLRERKLVLILDLDHTLINSTKLQDISSAEKD-------LGIQTAASKD-DPNRSIFSL 209 Query: 684 PSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSD 863 SM M TKLRPFV +FL+EA+ M+EMYIYTMG++AYA ++AKLLDP +F ++IS+SD Sbjct: 210 DSMQMLTKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSD 269 Query: 864 STNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLT 1043 T R QK LDV+LGAES VILDDTE VW KHK NLI++ERYHFF +SC QF G +SL+ Sbjct: 270 CTQRHQKGLDVILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRSLS 329 Query: 1044 QSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKD 1223 +S +DE ES+G L+ +L VL+ +H FFD ++S S+DVR V+K +R IL+ Sbjct: 330 ESMQDERESDGALATVLDVLKRIHSIFFDLAVETDLS-----SQDVRQVIKAVRKEILQG 384 Query: 1224 CKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDSGTDKSRWAVQNDC 1403 CK+VFSRVFP R + LW++AE LGA+CST + VTHVV++D GT+K+RW V N Sbjct: 385 CKIVFSRVFPNNTRPQEQMLWKMAEHLGAVCSTDVDSSVTHVVTVDLGTEKARWGVANKK 444 Query: 1404 FLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSSGFVESVS 1523 FLVHP WIEA++F W RQ E+ FPVT +S +V+ Sbjct: 445 FLVHPRWIEAANFRWHRQPEEDFPVTAPKEKSRDIDNAVA 484 >ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 472 Score = 381 bits (979), Expect = e-103 Identities = 217/472 (45%), Positives = 291/472 (61%), Gaps = 22/472 (4%) Frame = +3 Query: 132 IDFELDSISNAS------ENHQDDDDQEFESSPDIDFDSE---------DRPRIKRRKLY 266 +D ELDS S+ S EN + + + E E D D++ D R K+RK+ Sbjct: 21 LDAELDSASDVSPELDEVENGEAEVEVELEDEKGKDEDNDTGDGDDGNIDSRRSKKRKIE 80 Query: 267 DNEESEEQLPIKIEDKPESSSAQG-------CLHPGYMWGVCIKCGQNKPDSEENEQQSR 425 E + + P + + ES+ G C HPG M G+CI+CGQ D +S Sbjct: 81 LIEAAVD--PQSLVSRGESAETSGASLALDVCTHPGVMGGMCIRCGQKVED------ESG 132 Query: 426 VCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESY 605 V YIHK L+D E+ARLR+ DLK++L +K NS R D+ EE Sbjct: 133 VAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEE--- 189 Query: 606 ISSTYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGER 785 +YL QR V ++ + L+KL +HM TKLRPFVH FL+EA+ ++EMYIYTMGER Sbjct: 190 ---SYLKDQREVLPDA--LRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGER 244 Query: 786 AYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKS 965 YA +MAKLLDP G +F R+I+ SDST R QK LDVVLG ESAV+ILDDTE VW KH+ Sbjct: 245 PYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRE 304 Query: 966 NLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNK 1145 NLI+++RYHFF +SC QF KSL++ K DE E+EG L+ +L+VL+ +H+ FFD Sbjct: 305 NLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGD 364 Query: 1146 EMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTS 1325 + + RDVR VLK +R ILK CK+VF+ V P Q + E H+ W+LAE+LGA ST Sbjct: 365 NIME-----RDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTE 419 Query: 1326 LREDVTHVVSLDSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVT 1481 + E VTHVVS++ T+KSR AV+ FLVHP WIEA+++LW++ E+ FPV+ Sbjct: 420 VDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 471 >ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum tuberosum] Length = 478 Score = 381 bits (978), Expect = e-103 Identities = 219/475 (46%), Positives = 290/475 (61%), Gaps = 25/475 (5%) Frame = +3 Query: 132 IDFELDSISNAS------ENHQDDDDQEFESSP--------------DIDFDSEDRPRIK 251 +D ELDS S+ S EN + + ++E E D D S D R K Sbjct: 22 LDAELDSASDVSPELDEVENGEAEGEEEVEDEKGQDEGNDTGDGDDDDDDDGSIDSSRSK 81 Query: 252 RRKLYDNEES-EEQLPIKIEDKPESSSAQ----GCLHPGYMWGVCIKCGQNKPDSEENEQ 416 +RK+ E + + Q + + E+S A C HPG M G+CI+CGQ D Sbjct: 82 KRKIELIEAAVDPQSSVSRGEPAETSGASLALDVCTHPGVMGGMCIRCGQKVED------ 135 Query: 417 QSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEE 596 +S V YIHK L+D E+ARLR DLK++L KK NS R D+ EE Sbjct: 136 ESGVAFGYIHKNLRLADDEVARLRDKDLKNLLRHKKLILVLDLDHTLLNSTRLADISAEE 195 Query: 597 ESYISSTYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTM 776 +YL QR V ++ + L+KL +HM TKLRPFVH FL+EA+ ++EMYIYTM Sbjct: 196 ------SYLKDQREVLPDA--LRNNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTM 247 Query: 777 GERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPK 956 GER YA +MA LLDP G +F R+I+ SDST R QK LDVVLG ESAV+ILDDTE VW K Sbjct: 248 GERPYALEMASLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGK 307 Query: 957 HKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNI 1136 H+ NLI+++RYHFF +SC QF KSL++ K DE E+EG L+ +L+VL+ +H+ FFD Sbjct: 308 HRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDLE 367 Query: 1137 SNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAIC 1316 + + RDVR VLK +R ILK CK+VF+ V P Q + E HH W+LAE+LGA Sbjct: 368 RGDNIME-----RDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHHYWKLAEKLGATF 422 Query: 1317 STSLREDVTHVVSLDSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVT 1481 ST + E VTHVVS++ T+KSR A++ FLVHPSWIEA+++LW++ E+ FPV+ Sbjct: 423 STEVDESVTHVVSMNDKTEKSRQALREKKFLVHPSWIEAANYLWRKPPEENFPVS 477 >ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 512 Score = 380 bits (976), Expect = e-102 Identities = 216/460 (46%), Positives = 290/460 (63%), Gaps = 10/460 (2%) Frame = +3 Query: 132 IDFELDSISNASE----NHQDDDDQEFESSPDIDFD-SEDRPRIKRRKLYDNEES-EEQL 293 +D ELDS S+ E + +++ E E + D D S D R K+RK+ E + + Q Sbjct: 71 LDAELDSASDVDEVESGEAEGEEEVEDEDNDTGDGDGSIDSSRSKKRKIELIEGAVDPQS 130 Query: 294 PIKIEDKPESSSAQG----CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFEL 461 + + E+S A C HPG M G+CI+CGQ D +S V YIHK L Sbjct: 131 SVSRGEPAETSGASMALDVCTHPGVMGGMCIRCGQKVED------ESGVAFGYIHKNLRL 184 Query: 462 SDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQRNV 641 +D E+ARLR+ DLK++L +K NS R D+ EE +YL QR V Sbjct: 185 ADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEE------SYLKDQREV 238 Query: 642 ESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDP 821 ++ + L+KL +HM TKLRPFVH FL+EA+ ++EMYIYTMGER YA +MAKLLDP Sbjct: 239 LPDA--LRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAKLLDP 296 Query: 822 KGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFH 1001 G +F R+I+ SDST R QK LDVVLG ESAV+ILDDTE VW KH+ NLI+++RYHFF Sbjct: 297 GGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFT 356 Query: 1002 ASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDV 1181 +SC QF KSL++ K DE E+EG L+ +L+VL+ +H+ FFD + + RDV Sbjct: 357 SSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGDNIME-----RDV 411 Query: 1182 RLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLD 1361 R VLK +R ILK CK+VF+ V P Q + E H+ W+LAE+LGA ST + E VTHVVS++ Sbjct: 412 RQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMN 471 Query: 1362 SGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVT 1481 T+KSR AV+ FLVHP WIEA+++LW++ E+ FPV+ Sbjct: 472 DKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 511 >dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] Length = 1065 Score = 377 bits (967), Expect = e-101 Identities = 202/455 (44%), Positives = 280/455 (61%) Frame = +3 Query: 132 IDFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIED 311 +D ELDS S+AS ++++ E + +KR+KL EE+ Sbjct: 647 LDAELDSASDASSGPSEEEEAE----------DDVESGLKRQKLEHLEEA---------- 686 Query: 312 KPESSSAQGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRK 491 SSS C HPG +C CGQ +++ V +YIHKE L++ EI+RLR Sbjct: 687 ---SSSKGECEHPGSFGNMCFVCGQKL-------EETGVSFRYIHKEMRLNEDEISRLRD 736 Query: 492 DDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQRNVESESGDIGRT 671 D + + +K N+ D+ PEEE S T+ +++ G + Sbjct: 737 SDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTH-----SLQDGCNVSGGS 791 Query: 672 LYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRII 851 L+ L M M TKLRPFVH FL+EA++M+ MYIYTMG+R YA +MAKLLDPKG++F DR+I Sbjct: 792 LFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVI 851 Query: 852 SSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGT 1031 S D T R +K LDVVLG ESAV+ILDDTE WPKHK NLIV+ERYHFF +SC QF Sbjct: 852 SRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRY 911 Query: 1032 KSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRAR 1211 KSL++ K DE+E +G L+ +L+VL+ H FF+N+ +RDVRL+LK++R Sbjct: 912 KSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEG------ISNRDVRLMLKQVRKE 965 Query: 1212 ILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDSGTDKSRWAV 1391 ILK CK+VFSRVFPT+ + E H LW++AEELGA C+T + VTHVV++D GT+K+RWAV Sbjct: 966 ILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAV 1025 Query: 1392 QNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 1496 + ++VH WI+A+++LW +Q E+ F + + Q Sbjct: 1026 REKKYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 1060 >ref|NP_001078764.1| C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal phosphatase-like 4; Short=AtCPL4; Short=CTD phosphatase-like 4 gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana] gi|332009601|gb|AED96984.1| C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] Length = 440 Score = 377 bits (967), Expect = e-101 Identities = 202/455 (44%), Positives = 280/455 (61%) Frame = +3 Query: 132 IDFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIED 311 +D ELDS S+AS ++++ E + +KR+KL EE+ Sbjct: 22 LDAELDSASDASSGPSEEEEAE----------DDVESGLKRQKLEHLEEA---------- 61 Query: 312 KPESSSAQGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRK 491 SSS C HPG +C CGQ +++ V +YIHKE L++ EI+RLR Sbjct: 62 ---SSSKGECEHPGSFGNMCFVCGQKL-------EETGVSFRYIHKEMRLNEDEISRLRD 111 Query: 492 DDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQRNVESESGDIGRT 671 D + + +K N+ D+ PEEE S T+ +++ G + Sbjct: 112 SDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTH-----SLQDGCNVSGGS 166 Query: 672 LYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRII 851 L+ L M M TKLRPFVH FL+EA++M+ MYIYTMG+R YA +MAKLLDPKG++F DR+I Sbjct: 167 LFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVI 226 Query: 852 SSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGT 1031 S D T R +K LDVVLG ESAV+ILDDTE WPKHK NLIV+ERYHFF +SC QF Sbjct: 227 SRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRY 286 Query: 1032 KSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRAR 1211 KSL++ K DE+E +G L+ +L+VL+ H FF+N+ +RDVRL+LK++R Sbjct: 287 KSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEG------ISNRDVRLMLKQVRKE 340 Query: 1212 ILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDSGTDKSRWAV 1391 ILK CK+VFSRVFPT+ + E H LW++AEELGA C+T + VTHVV++D GT+K+RWAV Sbjct: 341 ILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAV 400 Query: 1392 QNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 1496 + ++VH WI+A+++LW +Q E+ F + + Q Sbjct: 401 REKKYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 435 >ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Fragaria vesca subsp. vesca] Length = 464 Score = 375 bits (963), Expect = e-101 Identities = 209/462 (45%), Positives = 288/462 (62%), Gaps = 11/462 (2%) Frame = +3 Query: 132 IDFELDSISNASENHQD------DDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQL 293 ++ EL+S S+ S ++ DDD ESS D E R+KRRK+ + E EE Sbjct: 20 LETELESGSSESSPDEECKAAVGDDDGGSESS-----DVESESRVKRRKVENVEILEEAN 74 Query: 294 PIKIEDKPES-SSAQG----CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFE 458 + + E S A G C HPG +C CGQ + QS V YIHK Sbjct: 75 ALTSQAVSEEISEASGVDDLCAHPGSFGDMCFLCGQRLIE------QSGVTFGYIHKGLR 128 Query: 459 LSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQRN 638 L+D EI RLR D+K L+ KK N+ V +EE + Sbjct: 129 LNDGEIDRLRNTDIKKSLNNKKLYLVLDLDHTLLNTTLLNHVTAKEEYLMCPP------- 181 Query: 639 VESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLD 818 +S + +L++L M M TKLRPF+ FL+EA++++EMYIYTMG+RAYA +MAKLLD Sbjct: 182 -DSLPDVLKDSLFRLDFMRMMTKLRPFIRTFLKEASEIFEMYIYTMGDRAYALEMAKLLD 240 Query: 819 PKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFF 998 PK ++F DR+IS D T R QK LD+VLG ESAV+ILDDTE W KHK NLI++ERYHFF Sbjct: 241 PKKEYFGDRVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWIKHKDNLILMERYHFF 300 Query: 999 HASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRD 1178 +SC QF +SL++ K DE+E EG L+ +L +L+ +H+ FF ++ + RD Sbjct: 301 RSSCAQFGFTCESLSELKSDESEPEGALANVLDLLKRIHKMFFYDLGGNLV------DRD 354 Query: 1179 VRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSL 1358 VR VLK +R +L CKVVFSR+ P++ A +HHLW++AE+LGAICST + VTHVV+L Sbjct: 355 VRQVLKIVRKEVLNGCKVVFSRIIPSKVLASSHHLWKMAEQLGAICSTEVDSTVTHVVAL 414 Query: 1359 DSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTK 1484 D+GT+KSRWAV+++ FLVHP W+EA++++WQ+Q+E++FPVT+ Sbjct: 415 DAGTEKSRWAVKHNKFLVHPRWLEAANYMWQKQAEEKFPVTE 456 >ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii] gi|300166408|gb|EFJ33014.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii] Length = 411 Score = 375 bits (963), Expect = e-101 Identities = 193/396 (48%), Positives = 266/396 (67%), Gaps = 12/396 (3%) Frame = +3 Query: 357 MWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXX 536 MWGVCI+CG KP+SE S V LKYIH+EFEL+ +AR+R+D+L+ VL +K Sbjct: 1 MWGVCIRCGVLKPNSEPGGSASNVALKYIHEEFELAGDVLARVREDELRQVLGKRKLFLV 60 Query: 537 XXXXXXXXNSARFIDVHPEEESYISSTYL-VGQRNVESESGDI----------GRTLYKL 683 NSAR+++V P+E +Y+ TY+ V + + + S G L+++ Sbjct: 61 LDLDHTLLNSARWMEVFPDETAYLEHTYMNVPEDKIPALSNGAPAVAGVIQPGGGGLHRI 120 Query: 684 PSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSD 863 M +WTKLRPF HKFLEEA++++EMY+YTMGER YA MA LLDP GKFF R+IS D Sbjct: 121 HGMQLWTKLRPFAHKFLEEASKLFEMYVYTMGERMYAVTMAHLLDPTGKFFKGRVISQRD 180 Query: 864 STNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLT 1043 ST R+ KDLD+VLGA+SAV+ILDDTE VWPKH++NLIV+ERYHFF +SC QF SLT Sbjct: 181 STCRQTKDLDIVLGADSAVLILDDTEAVWPKHRANLIVMERYHFFQSSCRQFGLENPSLT 240 Query: 1044 QSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKD 1223 +++RDE++ EG L+ +L+VL+ +H FF + S++ D RD+ V +R+ IL Sbjct: 241 KAERDESKDEGALANVLKVLQRIHSDFF---MESDDSRYTCDVRDITSV---VRSEILSG 294 Query: 1224 CKVVFSRVFPTQ-FRAETHHLWRLAEELGAICSTSLREDVTHVVSLDSGTDKSRWAVQND 1400 CK+VFSR+FPT E LWRL +LGA C + + VTHVV+LD TDK++WA ++ Sbjct: 295 CKLVFSRIFPTDCLEPELTPLWRLCVDLGAECVLAHDDSVTHVVALDRFTDKAKWAKEHR 354 Query: 1401 CFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSSGF 1508 FLVHP+W+EA+ LW+R +E FPV + +++ F Sbjct: 355 KFLVHPAWVEAAHSLWRRPNELEFPVREGQTRAPVF 390 >ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X1 [Citrus sinensis] gi|568865772|ref|XP_006486244.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X2 [Citrus sinensis] gi|568865774|ref|XP_006486245.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X3 [Citrus sinensis] Length = 478 Score = 375 bits (962), Expect = e-101 Identities = 212/470 (45%), Positives = 287/470 (61%), Gaps = 13/470 (2%) Frame = +3 Query: 132 IDFELDSIS-------NASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQ 290 +D ELDS S A + +D++ + E+ + D +D RIKRRK E +E+ Sbjct: 21 LDAELDSNSLGSSPEKEAEDKDEDEESIDEEAENEEARDDKDLERIKRRKTQIVETIQER 80 Query: 291 ----LPIKIEDKPESS-SAQGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEF 455 L +E+K E S C HPG + G+C +CG+ E++S V YI K Sbjct: 81 PGPTLLGNLEEKTEVSLEMDNCPHPGSLGGMCYRCGKRL------EEESGVTFSYICKGL 134 Query: 456 ELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQR 635 L + EI RLR D+K +L +K NS + + PEE+ YL Q Sbjct: 135 RLGNDEIDRLRNTDMKHLLRHRKLYLILDLDHTLLNSTLLLHLTPEED------YLKSQA 188 Query: 636 NVESESGDIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKL 812 + D+ + +L+ L M+M TKLRPFVH FL+EA++M+EMYIYTMG+R YA +MAKL Sbjct: 189 D---SLQDVSKGSLFMLAFMNMMTKLRPFVHTFLKEASEMFEMYIYTMGDRPYALEMAKL 245 Query: 813 LDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYH 992 LDP ++F R+IS D T R QK LDVVLG ESAV+ILDDTE W KH+ NLI++ERYH Sbjct: 246 LDPSREYFNARVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWTKHRDNLILMERYH 305 Query: 993 FFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDS 1172 FF +SC QF +SL+Q + DE+E EG L+ +L+VL+ +H FFD ++N Sbjct: 306 FFASSCRQFGYHCQSLSQLRSDESELEGALASVLKVLKRIHNIFFDELAND------LAG 359 Query: 1173 RDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVV 1352 RDVR VLK +R +LK CK+VFS VFPT+F A+TH+LW++AE+LGA C L VTHVV Sbjct: 360 RDVRQVLKMVRGEVLKGCKLVFSHVFPTKFPADTHYLWKMAEQLGATCLIELDPSVTHVV 419 Query: 1353 SLDSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSS 1502 S D+ T+KSRWA + FLV P WIE ++FLWQRQ E+ FPV + + + Sbjct: 420 STDARTEKSRWAAKEAKFLVDPRWIETANFLWQRQPEENFPVKQNKPEEN 469 >ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318537|gb|EEF03111.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 468 Score = 375 bits (962), Expect = e-101 Identities = 220/467 (47%), Positives = 288/467 (61%), Gaps = 20/467 (4%) Frame = +3 Query: 132 IDFELDSISNASENHQDD------DDQEFESSPDIDFDSED-------RPRIKRRKLYDN 272 +D ELDS S+AS D+ D SSPD D ++E+ R R+KR K+ Sbjct: 21 LDTELDSKSSASSASDDEAPNQRHSDSAASSSPDQDKEAEEDDDSDFQRKRVKRSKVETV 80 Query: 273 EESEEQLPI----KIEDKPESS-SAQGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLK 437 E E+ ++ E+S S + C HPG +CI CGQ + +S V Sbjct: 81 EIVEDDGGTTSFASLKHNSEASISKEICTHPGSFGTMCIVCGQLL------DGESGVTFG 134 Query: 438 YIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISST 617 YIHK L + EI RLR D+K++L KK NS + + + +EE Sbjct: 135 YIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEE------ 188 Query: 618 YLVGQRNVESESGDIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYA 794 YL GQ + D+ + +L+ L SM M TKLRPFV FL+EA+QM+EMYIYTMG+RAYA Sbjct: 189 YLNGQTD---SLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYA 245 Query: 795 EKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLI 974 +MAKLLDP ++F ++IS D T R QK LDVVLG ESAV+ILDDTE W KHK NLI Sbjct: 246 LEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLI 305 Query: 975 VVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFF-DNISNKEM 1151 ++ERYHFF +SC QF KSL++ K DE+ESEG L+ +L+VLR +HQ FF D+I + + Sbjct: 306 LMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFEDHILSLAL 365 Query: 1152 SQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLR 1331 VLK +R +LK CK+VFSRVFPTQ +A+ HHLWR+AE+LGA CST L Sbjct: 366 Q-----------VLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELD 414 Query: 1332 EDVTHVVSLDSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRF 1472 VTHVVS DSGT+KS WA++++ FLV P WIEA+++ WQRQ E+ F Sbjct: 415 PSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENF 461 >ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp. lyrata] gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp. lyrata] Length = 1006 Score = 371 bits (952), Expect = e-100 Identities = 204/457 (44%), Positives = 287/457 (62%), Gaps = 2/457 (0%) Frame = +3 Query: 132 IDFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIED 311 +D ELDS S+AS +++++ + ++ +KRRKL E +E E+ Sbjct: 583 LDAELDSASDASSGPSEEEEEA---------EDDEESGLKRRKLEHLETVDE------EE 627 Query: 312 KPESSSAQG-CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLR 488 E+SS++G C HPG +C CGQ +++ V +YIHKE L++ EI+RLR Sbjct: 628 IEEASSSKGECQHPGSFGNMCFVCGQKL-------EETGVSFRYIHKEMRLNEDEISRLR 680 Query: 489 KDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISSTYLVGQRNVESESGDI-G 665 D + + +K NS D+ PEEE S T+ + + D+ G Sbjct: 681 DSDSRFLQRQRKLYLVLDLDHTLLNSTVLRDLKPEEEYLKSHTHSLQEPFDFLLISDVSG 740 Query: 666 RTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADR 845 +L+ L MHM TKLRPFVH FL+EA++M+ MYIYTMG+RAYA +MAKLLDP+G++F DR Sbjct: 741 GSLFMLEFMHMMTKLRPFVHSFLKEASEMFVMYIYTMGDRAYARQMAKLLDPRGEYFGDR 800 Query: 846 IISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHC 1025 IIS D T R QK LDVVLG ESAV+ILDDTE WP HK NLIV+ERYHFF +SC QF Sbjct: 801 IISRDDGTVRHQKSLDVVLGQESAVLILDDTENAWPNHKDNLIVIERYHFFASSCRQFDH 860 Query: 1026 GTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIR 1205 KSL++ K DE+E +G L+ +L ++ ++ISN RDVR +LK++R Sbjct: 861 KYKSLSELKSDESEPDGALATVL-------KNVDEDISN----------RDVRSMLKQVR 903 Query: 1206 ARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDSGTDKSRW 1385 +LK CKVVFSRVFPT+ + E H LW++AEELGA C+T + VTHVV++D GT+K+RW Sbjct: 904 KEVLKGCKVVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARW 963 Query: 1386 AVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 1496 AV+ ++VH WI+A+++LW++Q E++F + + Q Sbjct: 964 AVREKKYVVHRGWIDAANYLWKKQPEEKFSLEQLKKQ 1000 >ref|XP_006654357.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Oryza brachyantha] Length = 557 Score = 369 bits (947), Expect = 3e-99 Identities = 204/474 (43%), Positives = 288/474 (60%), Gaps = 17/474 (3%) Frame = +3 Query: 129 LIDFELD--------SISNASENHQDDDDQEFESSPDIDFDSEDRPRI-----KRRKLYD 269 L+D ELD S S AS++ +D D++E + + ++ ++ KRR++ D Sbjct: 25 LLDAELDFDSAADPSSASAASDDEEDGDEEEGKEEDVVMVVEQEEAKVEQSSSKRRRVED 84 Query: 270 NEESEEQLPIKIEDKPESSS---AQGCL-HPGYMWGVCIKCGQNKPDSEENEQQSRVCLK 437 + E + +D SS + C HPG+ G+C KCG+ ++ E V Sbjct: 85 QHQDEGKAMRPNDDTVGSSKDVKIEICPPHPGFFGGLCFKCGKK----QDEEDVPGVAFG 140 Query: 438 YIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXXNSARFIDVHPEEESYISST 617 YIHK L SEI RLR DLK++L ++ NS + +D+ E Sbjct: 141 YIHKGLTLGTSEIDRLRGADLKNLLRERRLVLILDLDHTLINSTKLLDLSAAENE----- 195 Query: 618 YLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAE 797 +G ++ S+ D R+L++L +M M TKLRPFV +FL+EA+ M+EMYIYTMG++AYA Sbjct: 196 --LGIQSAASKD-DPNRSLFRLDAMQMLTKLRPFVREFLKEASNMFEMYIYTMGDKAYAI 252 Query: 798 KMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGAESAVVILDDTEPVWPKHKSNLIV 977 ++AKLLDP+ +F +IS+SD T R QK LDV+LGAES VILDDTE VW KHK NLI+ Sbjct: 253 EIAKLLDPENVYFGSNVISNSDCTQRHQKGLDVILGAESLAVILDDTEYVWQKHKENLIL 312 Query: 978 VERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQ 1157 +ERYH+F +SC QF +SL++S +DE E +G L+ +L +LR +H FFD+ + Sbjct: 313 MERYHYFASSCRQFGFSARSLSESMQDEREGDGALATILDILRRIHSIFFDSAVQNPL-- 370 Query: 1158 FCFDSRDVRLVLKEIRARILKDCKVVFSRVFPTQFRAETHHLWRLAEELGAICSTSLRED 1337 SRDVR V+K +R IL CK+VF+RVFP R + LW++AE+LGA+C T + Sbjct: 371 ---PSRDVRQVIKRVRQEILDGCKLVFTRVFPLHQRPQDQMLWKMAEQLGAVCCTDVDSM 427 Query: 1338 VTHVVSLDSGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQS 1499 VTHVV+LD GT+K+RWAV N FLVHP WIEA++F W RQ E+ FPV + +S Sbjct: 428 VTHVVALDLGTEKARWAVGNKKFLVHPRWIEAANFRWHRQQEEDFPVARPREKS 481