BLASTX nr result
ID: Ephedra27_contig00001001
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra27_contig00001001 (1740 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [A... 422 e-115 gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isofo... 393 e-106 ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ... 392 e-106 gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus pe... 389 e-105 ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma... 387 e-105 ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu... 385 e-104 ref|XP_001781984.1| predicted protein [Physcomitrella patens] gi... 383 e-103 ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal doma... 382 e-103 ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [S... 382 e-103 ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal doma... 380 e-102 ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal doma... 380 e-102 ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal doma... 379 e-102 ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal doma... 377 e-102 dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] 376 e-101 ref|NP_001078764.1| C-terminal domain phosphatase-like 4 [Arabid... 376 e-101 ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal doma... 374 e-101 ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu... 372 e-100 ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Sela... 371 e-100 ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arab... 370 e-100 gb|EOY32065.1| RNA polymerase II ctd phosphatase, putative isofo... 368 5e-99 >ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda] gi|548840545|gb|ERN00656.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda] Length = 486 Score = 422 bits (1086), Expect = e-115 Identities = 222/472 (47%), Positives = 310/472 (65%), Gaps = 18/472 (3%) Frame = -1 Query: 1584 ELDSISNAS-ENHQDDDDQEFESSPD-----------IDFDSEDRPRIKRRKLYDNEESE 1441 E D +SN+S E+ + D+D E + + D++ + RIKR K+ ++EE + Sbjct: 22 ETDLLSNSSGESPERDEDILDEITSESVSERSAVWDSTDYEEIELERIKRPKICEDEEIK 81 Query: 1440 EQLPI-----KIEDKPESSSARGCL-HPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIH 1279 E ++++ ES+S + C HPG+ +CI+CG+ K D +++ V YIH Sbjct: 82 ESQSSNANQGELDNFKESTSEKVCPPHPGFYKDMCIRCGEQKDDETVARKETAVAFNYIH 141 Query: 1278 KEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLV 1099 K+ +L E+ARLR DLK++ +K LNS R +DV PEEE+Y+++TYL Sbjct: 142 KDLKLGAEEVARLRATDLKNLYRRRKLYLVLDLDHTLLNSTRLVDVSPEEEAYLNATYL- 200 Query: 1098 GQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMA 919 + S +GD TL+KL +HM TKLRPFV FL+EAN M+EMY+YTMGERAYA +MA Sbjct: 201 -NKETSSSNGDTSGTLFKLEPLHMLTKLRPFVRTFLKEANTMFEMYVYTMGERAYALEMA 259 Query: 918 KLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVER 739 KLLDP G +F R+IS DST R QK LDVVLG+E AVVILDDTE VW KHK NL+++ER Sbjct: 260 KLLDPSGVYFGSRVISQGDSTVRHQKGLDVVLGSECAVVILDDTEHVWHKHKENLVLMER 319 Query: 738 YHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCF 559 YHFF +SC QF+ KSL++ KRDE+ES+G L+ +L VL+ +HQ F+ + + F Sbjct: 320 YHFFSSSCRQFNVHYKSLSELKRDESESDGMLASILNVLKHIHQMFY-----YQEVETDF 374 Query: 558 DSRDVRLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTH 379 + DVR VLK I++ +LK C++VFSR+FP+ + E LWR+AE+LGA CS L E VTH Sbjct: 375 NGSDVRKVLKTIQSEVLKGCRLVFSRIFPTNYPVENQTLWRIAEQLGASCSKELDEAVTH 434 Query: 378 VVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSS 223 VVSLD GT+K+RWA+Q LV+P W+EA+++ W+RQ ED+FP+ K+ S Sbjct: 435 VVSLDLGTEKARWAIQRKKHLVNPGWLEATNYFWKRQPEDQFPIPSKNGGGS 486 >gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma cacao] Length = 469 Score = 393 bits (1010), Expect = e-106 Identities = 224/461 (48%), Positives = 284/461 (61%), Gaps = 7/461 (1%) Frame = -1 Query: 1590 DFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIK-IED 1414 D E D +N N DDD D DS+ R K KL D EES IED Sbjct: 38 DVEADGDNNNDNNDDHDDDD--------DLDSQRNKRCKTEKLEDLEESRGSTSQGLIED 89 Query: 1413 K----PESSSARG-CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEI 1249 K E S + C HPG +CI CGQ D +S V YIHK L + EI Sbjct: 90 KIVIHAELSLKKDICTHPGSFGQMCILCGQRLDD------ESGVTFGYIHKGLRLGNDEI 143 Query: 1248 ARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQRNVESESG 1069 RLR D+K++L KK LNS + + + P+EE YL GQ + Sbjct: 144 VRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLMHLTPDEE------YLKGQSD---SLQ 194 Query: 1068 DIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKF 892 D+ R +L+ L MHM TKLRPFV FL+EA++M+EMYIYTMG+R YA +MAKLLDP+ ++ Sbjct: 195 DVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREY 254 Query: 891 FADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCL 712 F+DR+IS D T + QK LDVVLG ESAVVILDDTE W KHK NLI++ERYH+F +SC Sbjct: 255 FSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENAWMKHKDNLILMERYHYFASSCH 314 Query: 711 QFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVL 532 QF KSL+Q K DE+E +G L+ +L+ LR +H FFD + SRDVR VL Sbjct: 315 QFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFFDELDCN------LASRDVRQVL 368 Query: 531 KEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTD 352 K ++ +LK CK+VFS VFP+ F AE+H LW++AE+LGA CST VTHVVS DAGT+ Sbjct: 369 KTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTETDLSVTHVVSTDAGTE 428 Query: 351 KSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 229 KSRWAV+ FLVHP WIEA+++LWQ+Q E+ FPV++ +Q Sbjct: 429 KSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 469 >ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 478 Score = 392 bits (1006), Expect = e-106 Identities = 217/470 (46%), Positives = 296/470 (62%), Gaps = 18/470 (3%) Frame = -1 Query: 1596 LIDFELDSISNASEN-------------HQDDDDQEFESSPDIDFDSEDRP-RIKRRK-- 1465 L+D ELDS S++S++ + D +++E E D D DS+ RIKR + Sbjct: 23 LLDAELDSKSSSSDSSPKAIKHDDASDANDDVNEEEEEEESDSDDDSDIATNRIKRSRVE 82 Query: 1464 -LYDNEESEEQLPIKIEDKPESSSAR-GCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCL 1291 L + E +E + ++ +SS++ C HPG +CI CG E +++ V Sbjct: 83 TLENGENPKESTRVSLDQTLVASSSKVACTHPGSFGDMCILCG------ERLIEETGVTF 136 Query: 1290 KYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISS 1111 YIHK L++ EI RLR D+K++L +K LNS + + + EEE Sbjct: 137 GYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTLLNSTQLMHLTAEEE----- 191 Query: 1110 TYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYA 931 YL Q ++S +L+ + MHM TKLRPF+ FL+EA+QM+EMYIYTMG+RAYA Sbjct: 192 -YLKSQ--IDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEMYIYTMGDRAYA 248 Query: 930 EKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLI 751 +MAK LDP ++F R+IS D T R QK LD+VLG ESAV+ILDDTE W KHK NLI Sbjct: 249 LEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWTKHKDNLI 308 Query: 750 VVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMS 571 ++ERYHFF +SC QF KSL+Q K DE ES+G L+ +L+VLR +H FFD + + Sbjct: 309 LMERYHFFASSCRQFGFECKSLSQLKSDENESDGALASVLKVLRRIHHIFFDELED---- 364 Query: 570 QFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLRE 391 D RDVR VL +R +LK CK+VFSRVFP+QF+A+ HHLW++AE+LGA CS + Sbjct: 365 --AIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCSREVDP 422 Query: 390 DVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTK 241 VTHVVS +AGT+KSRWA++ND FLVHP WIEA++++WQRQ E+ F V + Sbjct: 423 SVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENFSVNQ 472 >gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica] Length = 449 Score = 389 bits (998), Expect = e-105 Identities = 216/436 (49%), Positives = 283/436 (64%), Gaps = 9/436 (2%) Frame = -1 Query: 1521 SSPD--IDFDSEDRPR--IKRRKLYD----NEESEEQLPIKIEDKPESSSARG-CLHPGY 1369 SSPD D++S+D KRRK+ + +E I +E+ E+S + C HPG Sbjct: 31 SSPDEEADYESDDGSERSTKRRKVENLGSIDETQGSTSQIFVEENSEASPKKDICTHPGS 90 Query: 1368 MWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXX 1189 + +CI CGQ +++S V L YIHK+F L++ EI R+R D+K L KK Sbjct: 91 VKDLCIVCGQRV------DEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLV 144 Query: 1188 XXXXXXXLNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRP 1009 LNS + EEE YL Q + + D +L+++ MHM TKLRP Sbjct: 145 LDLDHTLLNSTHLNHMTAEEE------YLHSQTDSLQDVSD--GSLFRVDVMHMMTKLRP 196 Query: 1008 FVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDV 829 FV KFL+EA++M+EMYIYTMGERAYA +MAKLLDP+ ++F DR+IS D T + QK LDV Sbjct: 197 FVRKFLKEASEMFEMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDV 256 Query: 828 VLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEG 649 VLG ESA +ILDDTE W KHK NLI++ERYHFF +SC QF KSL++ K DE+E EG Sbjct: 257 VLGHESAALILDDTENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEG 316 Query: 648 TLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPS 469 L+ +L+VL+ +H FF + + RDVR VLK +R ILK CK+VFSRVFPS Sbjct: 317 ALATVLEVLKRIHNMFFYESKDNLI------DRDVRQVLKTLRKEILKGCKIVFSRVFPS 370 Query: 468 QFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEAS 289 +F+AE H LW++AE+LGA CST L VTHVVS DAGT+KSRWAV+ FLVHP WIEAS Sbjct: 371 KFQAENHQLWKMAEQLGATCSTELDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEAS 430 Query: 288 SFLWQRQSEDRFPVTK 241 +++W +Q+ED+FPV + Sbjct: 431 NYMWLKQAEDKFPVNQ 446 >ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Cucumis sativus] Length = 452 Score = 387 bits (995), Expect = e-105 Identities = 208/438 (47%), Positives = 281/438 (64%), Gaps = 5/438 (1%) Frame = -1 Query: 1545 DDDDQEFESSPDIDFDSEDRP---RIKRRKLYDNEESEEQLPIKIEDKPES--SSARGCL 1381 D D +SSPD + + ++ RIKRRK+ E SEE + ++E++ S + C Sbjct: 24 DLDSHSSDSSPDEETEGDNNAESVRIKRRKVEKLENSEEDIMHEVEEQSLEVLSKQQLCS 83 Query: 1380 HPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKK 1201 HPG +CI CGQ +++S V YIHKE L++ EI R+R ++K +L KK Sbjct: 84 HPGSFGNMCIICGQRL------DEESGVTFGYIHKELRLNNDEINRMRNKEMKELLQRKK 137 Query: 1200 XXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRTLYKLPSMHMWT 1021 LNS + EEE S T +S +L+ L S+H T Sbjct: 138 LILVLDLDHTLLNSTELRYLTVEEEYLRSQT--------DSLDDVTKGSLFLLNSVHTMT 189 Query: 1020 KLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQK 841 KLRPFVH FL+EA++++EMYIYTMGER YA +MAKLLDPK ++F+ ++IS D T + QK Sbjct: 190 KLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQK 249 Query: 840 DLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEA 661 LDVVLG ESAV+ILDDTE W KHK NLI++ERYHFF +SC QF KSL++ K DE+ Sbjct: 250 GLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDES 309 Query: 660 ESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSR 481 E++G L+ +L+VL+ VH FF+ +S + RDVR VLK +RA +L+ CKVVFSR Sbjct: 310 ETDGALTTILKVLKQVHHMFFNEVSGDLV------DRDVRQVLKTVRAEVLEGCKVVFSR 363 Query: 480 VFPSQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSW 301 VFP++F+AE H LW++ E+LG CST L + VTHVV+ DAGT+KSRWA++ FLVHP W Sbjct: 364 VFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRW 423 Query: 300 IEASSFLWQRQSEDRFPV 247 IEAS++ W+RQ E+ F V Sbjct: 424 IEASNYFWKRQMEENFTV 441 >ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318538|gb|EEF03112.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 472 Score = 385 bits (990), Expect = e-104 Identities = 221/466 (47%), Positives = 291/466 (62%), Gaps = 19/466 (4%) Frame = -1 Query: 1593 IDFELDSISNASENHQDD------DDQEFESSPDIDFDSED-------RPRIKRRKLYDN 1453 +D ELDS S+AS D+ D SSPD D ++E+ R R+KR K+ Sbjct: 21 LDTELDSKSSASSASDDEAPNQRHSDSAASSSPDQDKEAEEDDDSDFQRKRVKRSKVETV 80 Query: 1452 EESEEQLPI----KIEDKPESSSARG-CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLK 1288 E E+ ++ E+S ++ C HPG +CI CGQ + +S V Sbjct: 81 EIVEDDGGTTSFASLKHNSEASISKEICTHPGSFGTMCIVCGQLL------DGESGVTFG 134 Query: 1287 YIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISST 1108 YIHK L + EI RLR D+K++L KK LNS + + + +EE Sbjct: 135 YIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEE------ 188 Query: 1107 YLVGQRNVESESGDIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYA 931 YL GQ + D+ + +L+ L SM M TKLRPFV FL+EA+QM+EMYIYTMG+RAYA Sbjct: 189 YLNGQTD---SLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYA 245 Query: 930 EKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLI 751 +MAKLLDP ++F ++IS D T R QK LDVVLG ESAV+ILDDTE W KHK NLI Sbjct: 246 LEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLI 305 Query: 750 VVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMS 571 ++ERYHFF +SC QF KSL++ K DE+ESEG L+ +L+VLR +HQ FF+ + Sbjct: 306 LMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFEELEEN--- 362 Query: 570 QFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLRE 391 D RDVR VLK +R +LK CK+VFSRVFP+Q +A+ HHLWR+AE+LGA CST L Sbjct: 363 ---MDGRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDP 419 Query: 390 DVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRF 253 VTHVVS D+GT+KS WA++++ FLV P WIEA+++ WQRQ E+ F Sbjct: 420 SVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENF 465 >ref|XP_001781984.1| predicted protein [Physcomitrella patens] gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens] Length = 563 Score = 383 bits (984), Expect = e-103 Identities = 212/471 (45%), Positives = 293/471 (62%), Gaps = 6/471 (1%) Frame = -1 Query: 1584 ELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIEDKPE 1405 +L+ ++ + + ++D D ES D + D ++ E S++ PI + E Sbjct: 63 DLEEGTSENGSVEEDADSVVESEDHADDNHLDVEKVA-------ESSDDITPICVNYSGE 115 Query: 1404 SSSARGCL-HPGYMWGVCIKCGQNKPDSEENEQQ-SRVCLKYIHKEFELSDSEIARLRKD 1231 ++ C HPG++W VCI+CG+ K + N+ RV L+YIH+ E+S+ E AR+R Sbjct: 116 MVNSNKCPPHPGFIWDVCIRCGKRKSTAPSNDPVIDRVGLRYIHEGLEVSELEAARVRNA 175 Query: 1230 DLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRTL 1051 +L+ V +K LNSARF +V EE Y+ T+ GQ++ S L Sbjct: 176 ELRRVTGKQKLLLVVDLDHTMLNSARFSEVPAEERIYL--TWTAGQQHGRVSS------L 227 Query: 1050 YKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIIS 871 ++L + MWTKLRPF HKFLEEA+++YEMY+YTMGE+ YA+ MA+LLDP G+ F RIIS Sbjct: 228 HQLTKLGMWTKLRPFAHKFLEEASKLYEMYVYTMGEKIYAQAMAELLDPTGQLFGGRIIS 287 Query: 870 SSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTK 691 +DST R KDLDVVLG ESAVVILDDTE VWP H+SNLI++ERYHFF +SC QF Sbjct: 288 QTDSTKRHTKDLDVVLGAESAVVILDDTEAVWPNHRSNLILMERYHFFTSSCHQFRVRAP 347 Query: 690 SLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQF-CFDSRDVRLVLKEIRAR 514 SL Q RDE E +GTL+ L+ L+++H FF+ K M + + DVR V++ IR + Sbjct: 348 SLAQMHRDECEIDGTLATTLKTLQAIHHEFFNGHKGKSMKRRPPLELPDVRDVIRSIRGK 407 Query: 513 ILKDCKVVFSRVFPSQFR-AETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWA 337 +L C +VFSR+FP+ + E H W+LA ELGA CST THVV+LD GTDK+RWA Sbjct: 408 LLSGCHIVFSRIFPTGLQNPEFHPFWQLAVELGARCSTVCDHTTTHVVALDRGTDKARWA 467 Query: 336 VQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ--SSGFVESVSQLPL 190 Q+ LVHP W+EA+S+LW+R E FPVT S S+ F +++S P+ Sbjct: 468 KQHGISLVHPRWVEAASYLWKRPREKDFPVTDDASALISTTFSKNISVEPI 518 >ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Setaria italica] Length = 543 Score = 382 bits (982), Expect = e-103 Identities = 213/472 (45%), Positives = 290/472 (61%), Gaps = 14/472 (2%) Frame = -1 Query: 1596 LIDFELDSISNA---------SENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEES 1444 L+D EL+ S A S + DD+ E E S +++ + ++ KRR++ E+S Sbjct: 23 LLDSELELASGADSAFPGDPSSASPDTDDEGEDEDSEEVEVELLEQNSAKRRRV--EEQS 80 Query: 1443 EEQ----LPIKIEDKPESS-SARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIH 1279 ++Q P KI P + C HPGY G+C +CG KP EE+ S V YIH Sbjct: 81 QDQGTSIRPDKIATGPSKNVQVEVCPHPGYFGGLCFRCG--KPQDEEDA--SGVAFGYIH 136 Query: 1278 KEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLV 1099 K L SEI RLR DLK++L +K +NS + D+ E + Sbjct: 137 KGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLQDISSAENE-------L 189 Query: 1098 GQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMA 919 G R + D R+++ L SM M TKLRPFV FL+EA+ M+EMYIYTMG++AYA ++A Sbjct: 190 GIRTAALKD-DPDRSIFSLDSMQMLTKLRPFVRNFLKEASNMFEMYIYTMGDKAYAIEIA 248 Query: 918 KLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVER 739 KLLDP +F ++IS+SD T R QK LDV+LG ES VILDDTE VW KHK NLI++ER Sbjct: 249 KLLDPSNVYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEYVWQKHKENLILMER 308 Query: 738 YHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCF 559 YH+F +SC QF G KSL++S +DE ES+G L+ +L VL+ +H FFD +S Sbjct: 309 YHYFASSCRQFGFGVKSLSESMQDERESDGALATVLDVLKRIHTIFFDTAVETALS---- 364 Query: 558 DSRDVRLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTH 379 SRDVR V+K +R +L+ CK+VFSRVFP+ R + +W++AE LGA+CST + VTH Sbjct: 365 -SRDVRQVIKTVRKEVLEGCKLVFSRVFPNTSRPQEQMMWKMAEHLGAVCSTDVDSTVTH 423 Query: 378 VVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSS 223 VV++D GT+K+RWAV+N FLVHP WIEA++F W RQ E+ FPV +S+ Sbjct: 424 VVAVDLGTEKARWAVKNKKFLVHPRWIEAANFRWHRQPEEDFPVIPPKEKST 475 >ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] Length = 558 Score = 382 bits (982), Expect = e-103 Identities = 205/460 (44%), Positives = 288/460 (62%), Gaps = 4/460 (0%) Frame = -1 Query: 1569 SNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIEDKPESSS-- 1396 S+A DD+D++ + D + ++ ++ KRR++ + + ++ ++ + P +S Sbjct: 43 SSAFPAATDDEDEDEDEDEDPEVEAVEQNGTKRRRV-EEQLQDQGTSVRPDKIPTGASKN 101 Query: 1395 --ARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLK 1222 C HPGY G+C +CG KP EEN S V YIHK L SEI RLR DLK Sbjct: 102 VQVEACPHPGYFGGLCFRCG--KPQDEENV--SGVAFGYIHKGLRLGTSEIDRLRGADLK 157 Query: 1221 SVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRTLYKL 1042 ++L +K +NS + D+ E+ +G + S+ D R+++ L Sbjct: 158 NLLRERKLVLILDLDHTLINSTKLQDISSAEKD-------LGIQTAASKD-DPNRSIFSL 209 Query: 1041 PSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSD 862 SM M TKLRPFV +FL+EA+ M+EMYIYTMG++AYA ++AKLLDP +F ++IS+SD Sbjct: 210 DSMQMLTKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSD 269 Query: 861 STNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLT 682 T R QK LDV+LG ES VILDDTE VW KHK NLI++ERYHFF +SC QF G +SL+ Sbjct: 270 CTQRHQKGLDVILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRSLS 329 Query: 681 QSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKD 502 +S +DE ES+G L+ +L VL+ +H FFD ++S S+DVR V+K +R IL+ Sbjct: 330 ESMQDERESDGALATVLDVLKRIHSIFFDLAVETDLS-----SQDVRQVIKAVRKEILQG 384 Query: 501 CKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQNDC 322 CK+VFSRVFP+ R + LW++AE LGA+CST + VTHVV++D GT+K+RW V N Sbjct: 385 CKIVFSRVFPNNTRPQEQMLWKMAEHLGAVCSTDVDSSVTHVVTVDLGTEKARWGVANKK 444 Query: 321 FLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSSGFVESVS 202 FLVHP WIEA++F W RQ E+ FPVT +S +V+ Sbjct: 445 FLVHPRWIEAANFRWHRQPEEDFPVTAPKEKSRDIDNAVA 484 >ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 472 Score = 380 bits (976), Expect = e-102 Identities = 218/472 (46%), Positives = 292/472 (61%), Gaps = 22/472 (4%) Frame = -1 Query: 1593 IDFELDSISNAS------ENHQDDDDQEFESSPDIDFDSE---------DRPRIKRRKLY 1459 +D ELDS S+ S EN + + + E E D D++ D R K+RK+ Sbjct: 21 LDAELDSASDVSPELDEVENGEAEVEVELEDEKGKDEDNDTGDGDDGNIDSRRSKKRKIE 80 Query: 1458 DNEESEEQLPIKIEDKPESSSARG-------CLHPGYMWGVCIKCGQNKPDSEENEQQSR 1300 E + + P + + ES+ G C HPG M G+CI+CGQ D +S Sbjct: 81 LIEAAVD--PQSLVSRGESAETSGASLALDVCTHPGVMGGMCIRCGQKVED------ESG 132 Query: 1299 VCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESY 1120 V YIHK L+D E+ARLR+ DLK++L +K LNS R D+ EE Sbjct: 133 VAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEE--- 189 Query: 1119 ISSTYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGER 940 +YL QR V ++ + L+KL +HM TKLRPFVH FL+EA+ ++EMYIYTMGER Sbjct: 190 ---SYLKDQREVLPDA--LRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGER 244 Query: 939 AYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKS 760 YA +MAKLLDP G +F R+I+ SDST R QK LDVVLG ESAV+ILDDTE VW KH+ Sbjct: 245 PYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRE 304 Query: 759 NLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNK 580 NLI+++RYHFF +SC QF KSL++ K DE E+EG L+ +L+VL+ +H+ FFD Sbjct: 305 NLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGD 364 Query: 579 EMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTS 400 + + RDVR VLK +R ILK CK+VF+ V P Q + E H+ W+LAE+LGA ST Sbjct: 365 NIME-----RDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTE 419 Query: 399 LREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVT 244 + E VTHVVS++ T+KSR AV+ FLVHP WIEA+++LW++ E+ FPV+ Sbjct: 420 VDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 471 >ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum tuberosum] Length = 478 Score = 380 bits (975), Expect = e-102 Identities = 218/475 (45%), Positives = 288/475 (60%), Gaps = 25/475 (5%) Frame = -1 Query: 1593 IDFELDSISNAS------ENHQDDDDQEFESSP--------------DIDFDSEDRPRIK 1474 +D ELDS S+ S EN + + ++E E D D S D R K Sbjct: 22 LDAELDSASDVSPELDEVENGEAEGEEEVEDEKGQDEGNDTGDGDDDDDDDGSIDSSRSK 81 Query: 1473 RRKLYDNEESEEQLPIKIEDKPESSSARG-----CLHPGYMWGVCIKCGQNKPDSEENEQ 1309 +RK+ E + + +P +S C HPG M G+CI+CGQ D Sbjct: 82 KRKIELIEAAVDPQSSVSRGEPAETSGASLALDVCTHPGVMGGMCIRCGQKVED------ 135 Query: 1308 QSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEE 1129 +S V YIHK L+D E+ARLR DLK++L KK LNS R D+ EE Sbjct: 136 ESGVAFGYIHKNLRLADDEVARLRDKDLKNLLRHKKLILVLDLDHTLLNSTRLADISAEE 195 Query: 1128 ESYISSTYLVGQRNVESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTM 949 +YL QR V ++ + L+KL +HM TKLRPFVH FL+EA+ ++EMYIYTM Sbjct: 196 ------SYLKDQREVLPDA--LRNNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTM 247 Query: 948 GERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPK 769 GER YA +MA LLDP G +F R+I+ SDST R QK LDVVLG ESAV+ILDDTE VW K Sbjct: 248 GERPYALEMASLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGK 307 Query: 768 HKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNI 589 H+ NLI+++RYHFF +SC QF KSL++ K DE E+EG L+ +L+VL+ +H+ FFD Sbjct: 308 HRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDLE 367 Query: 588 SNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAIC 409 + + RDVR VLK +R ILK CK+VF+ V P Q + E HH W+LAE+LGA Sbjct: 368 RGDNIME-----RDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHHYWKLAEKLGATF 422 Query: 408 STSLREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVT 244 ST + E VTHVVS++ T+KSR A++ FLVHPSWIEA+++LW++ E+ FPV+ Sbjct: 423 STEVDESVTHVVSMNDKTEKSRQALREKKFLVHPSWIEAANYLWRKPPEENFPVS 477 >ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 512 Score = 379 bits (973), Expect = e-102 Identities = 215/460 (46%), Positives = 288/460 (62%), Gaps = 10/460 (2%) Frame = -1 Query: 1593 IDFELDSISNASE----NHQDDDDQEFESSPDIDFD-SEDRPRIKRRKLYDNEESEEQLP 1429 +D ELDS S+ E + +++ E E + D D S D R K+RK+ E + + Sbjct: 71 LDAELDSASDVDEVESGEAEGEEEVEDEDNDTGDGDGSIDSSRSKKRKIELIEGAVDPQS 130 Query: 1428 IKIEDKPESSSARG-----CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFEL 1264 +P +S C HPG M G+CI+CGQ D +S V YIHK L Sbjct: 131 SVSRGEPAETSGASMALDVCTHPGVMGGMCIRCGQKVED------ESGVAFGYIHKNLRL 184 Query: 1263 SDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQRNV 1084 +D E+ARLR+ DLK++L +K LNS R D+ EE +YL QR V Sbjct: 185 ADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNSTRLADISAEE------SYLKDQREV 238 Query: 1083 ESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDP 904 ++ + L+KL +HM TKLRPFVH FL+EA+ ++EMYIYTMGER YA +MAKLLDP Sbjct: 239 LPDA--LRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIYTMGERPYALEMAKLLDP 296 Query: 903 KGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFH 724 G +F R+I+ SDST R QK LDVVLG ESAV+ILDDTE VW KH+ NLI+++RYHFF Sbjct: 297 GGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFT 356 Query: 723 ASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDV 544 +SC QF KSL++ K DE E+EG L+ +L+VL+ +H+ FFD + + RDV Sbjct: 357 SSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFDPERGDNIME-----RDV 411 Query: 543 RLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLD 364 R VLK +R ILK CK+VF+ V P Q + E H+ W+LAE+LGA ST + E VTHVVS++ Sbjct: 412 RQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMN 471 Query: 363 AGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVT 244 T+KSR AV+ FLVHP WIEA+++LW++ E+ FPV+ Sbjct: 472 DKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVS 511 >ref|XP_004287124.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Fragaria vesca subsp. vesca] Length = 464 Score = 377 bits (969), Expect = e-102 Identities = 212/462 (45%), Positives = 289/462 (62%), Gaps = 11/462 (2%) Frame = -1 Query: 1593 IDFELDSISNASENHQD------DDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQL 1432 ++ EL+S S+ S ++ DDD ESS D E R+KRRK+ + E EE Sbjct: 20 LETELESGSSESSPDEECKAAVGDDDGGSESS-----DVESESRVKRRKVENVEILEEAN 74 Query: 1431 PIKIEDKPES-SSARG----CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFE 1267 + + E S A G C HPG +C CGQ + QS V YIHK Sbjct: 75 ALTSQAVSEEISEASGVDDLCAHPGSFGDMCFLCGQRLIE------QSGVTFGYIHKGLR 128 Query: 1266 LSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQRN 1087 L+D EI RLR D+K L+ KK LN+ V +EE + Sbjct: 129 LNDGEIDRLRNTDIKKSLNNKKLYLVLDLDHTLLNTTLLNHVTAKEEYLMCPP------- 181 Query: 1086 VESESGDIGRTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLD 907 +S + +L++L M M TKLRPF+ FL+EA++++EMYIYTMG+RAYA +MAKLLD Sbjct: 182 -DSLPDVLKDSLFRLDFMRMMTKLRPFIRTFLKEASEIFEMYIYTMGDRAYALEMAKLLD 240 Query: 906 PKGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFF 727 PK ++F DR+IS D T R QK LD+VLG ESAV+ILDDTE W KHK NLI++ERYHFF Sbjct: 241 PKKEYFGDRVISRDDGTQRHQKGLDIVLGQESAVLILDDTENAWIKHKDNLILMERYHFF 300 Query: 726 HASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRD 547 +SC QF +SL++ K DE+E EG L+ +L +L+ +H+ FF ++ + RD Sbjct: 301 RSSCAQFGFTCESLSELKSDESEPEGALANVLDLLKRIHKMFFYDLGGNLV------DRD 354 Query: 546 VRLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTHVVSL 367 VR VLK +R +L CKVVFSR+ PS+ A +HHLW++AE+LGAICST + VTHVV+L Sbjct: 355 VRQVLKIVRKEVLNGCKVVFSRIIPSKVLASSHHLWKMAEQLGAICSTEVDSTVTHVVAL 414 Query: 366 DAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTK 241 DAGT+KSRWAV+++ FLVHP W+EA++++WQ+Q+E++FPVT+ Sbjct: 415 DAGTEKSRWAVKHNKFLVHPRWLEAANYMWQKQAEEKFPVTE 456 >dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] Length = 1065 Score = 376 bits (965), Expect = e-101 Identities = 202/455 (44%), Positives = 281/455 (61%) Frame = -1 Query: 1593 IDFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIED 1414 +D ELDS S+AS ++++ E + +KR+KL EE+ Sbjct: 647 LDAELDSASDASSGPSEEEEAE----------DDVESGLKRQKLEHLEEA---------- 686 Query: 1413 KPESSSARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRK 1234 SSS C HPG +C CGQ +++ V +YIHKE L++ EI+RLR Sbjct: 687 ---SSSKGECEHPGSFGNMCFVCGQKL-------EETGVSFRYIHKEMRLNEDEISRLRD 736 Query: 1233 DDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRT 1054 D + + +K LN+ D+ PEEE S T+ +++ G + Sbjct: 737 SDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTH-----SLQDGCNVSGGS 791 Query: 1053 LYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRII 874 L+ L M M TKLRPFVH FL+EA++M+ MYIYTMG+R YA +MAKLLDPKG++F DR+I Sbjct: 792 LFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVI 851 Query: 873 SSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGT 694 S D T R +K LDVVLG ESAV+ILDDTE WPKHK NLIV+ERYHFF +SC QF Sbjct: 852 SRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRY 911 Query: 693 KSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRAR 514 KSL++ K DE+E +G L+ +L+VL+ H FF+N+ +RDVRL+LK++R Sbjct: 912 KSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEG------ISNRDVRLMLKQVRKE 965 Query: 513 ILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAV 334 ILK CK+VFSRVFP++ + E H LW++AEELGA C+T + VTHVV++D GT+K+RWAV Sbjct: 966 ILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAV 1025 Query: 333 QNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 229 + ++VH WI+A+++LW +Q E+ F + + Q Sbjct: 1026 REKKYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 1060 >ref|NP_001078764.1| C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal phosphatase-like 4; Short=AtCPL4; Short=CTD phosphatase-like 4 gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana] gi|332009601|gb|AED96984.1| C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] Length = 440 Score = 376 bits (965), Expect = e-101 Identities = 202/455 (44%), Positives = 281/455 (61%) Frame = -1 Query: 1593 IDFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIED 1414 +D ELDS S+AS ++++ E + +KR+KL EE+ Sbjct: 22 LDAELDSASDASSGPSEEEEAE----------DDVESGLKRQKLEHLEEA---------- 61 Query: 1413 KPESSSARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRK 1234 SSS C HPG +C CGQ +++ V +YIHKE L++ EI+RLR Sbjct: 62 ---SSSKGECEHPGSFGNMCFVCGQKL-------EETGVSFRYIHKEMRLNEDEISRLRD 111 Query: 1233 DDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGRT 1054 D + + +K LN+ D+ PEEE S T+ +++ G + Sbjct: 112 SDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLKSHTH-----SLQDGCNVSGGS 166 Query: 1053 LYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRII 874 L+ L M M TKLRPFVH FL+EA++M+ MYIYTMG+R YA +MAKLLDPKG++F DR+I Sbjct: 167 LFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVI 226 Query: 873 SSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGT 694 S D T R +K LDVVLG ESAV+ILDDTE WPKHK NLIV+ERYHFF +SC QF Sbjct: 227 SRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRY 286 Query: 693 KSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRAR 514 KSL++ K DE+E +G L+ +L+VL+ H FF+N+ +RDVRL+LK++R Sbjct: 287 KSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEG------ISNRDVRLMLKQVRKE 340 Query: 513 ILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAV 334 ILK CK+VFSRVFP++ + E H LW++AEELGA C+T + VTHVV++D GT+K+RWAV Sbjct: 341 ILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAV 400 Query: 333 QNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 229 + ++VH WI+A+++LW +Q E+ F + + Q Sbjct: 401 REKKYVVHRGWIDAANYLWMKQPEENFGLEQLKKQ 435 >ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X1 [Citrus sinensis] gi|568865772|ref|XP_006486244.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X2 [Citrus sinensis] gi|568865774|ref|XP_006486245.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X3 [Citrus sinensis] Length = 478 Score = 374 bits (960), Expect = e-101 Identities = 213/470 (45%), Positives = 288/470 (61%), Gaps = 13/470 (2%) Frame = -1 Query: 1593 IDFELDSIS-------NASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQ 1435 +D ELDS S A + +D++ + E+ + D +D RIKRRK E +E+ Sbjct: 21 LDAELDSNSLGSSPEKEAEDKDEDEESIDEEAENEEARDDKDLERIKRRKTQIVETIQER 80 Query: 1434 ----LPIKIEDKPESS-SARGCLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEF 1270 L +E+K E S C HPG + G+C +CG+ E++S V YI K Sbjct: 81 PGPTLLGNLEEKTEVSLEMDNCPHPGSLGGMCYRCGKRL------EEESGVTFSYICKGL 134 Query: 1269 ELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQR 1090 L + EI RLR D+K +L +K LNS + + PEE+ YL Q Sbjct: 135 RLGNDEIDRLRNTDMKHLLRHRKLYLILDLDHTLLNSTLLLHLTPEED------YLKSQA 188 Query: 1089 NVESESGDIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKL 913 + D+ + +L+ L M+M TKLRPFVH FL+EA++M+EMYIYTMG+R YA +MAKL Sbjct: 189 D---SLQDVSKGSLFMLAFMNMMTKLRPFVHTFLKEASEMFEMYIYTMGDRPYALEMAKL 245 Query: 912 LDPKGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYH 733 LDP ++F R+IS D T R QK LDVVLG ESAV+ILDDTE W KH+ NLI++ERYH Sbjct: 246 LDPSREYFNARVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWTKHRDNLILMERYH 305 Query: 732 FFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDS 553 FF +SC QF +SL+Q + DE+E EG L+ +L+VL+ +H FFD ++N Sbjct: 306 FFASSCRQFGYHCQSLSQLRSDESELEGALASVLKVLKRIHNIFFDELAND------LAG 359 Query: 552 RDVRLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTHVV 373 RDVR VLK +R +LK CK+VFS VFP++F A+TH+LW++AE+LGA C L VTHVV Sbjct: 360 RDVRQVLKMVRGEVLKGCKLVFSHVFPTKFPADTHYLWKMAEQLGATCLIELDPSVTHVV 419 Query: 372 SLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSS 223 S DA T+KSRWA + FLV P WIE ++FLWQRQ E+ FPV + + + Sbjct: 420 STDARTEKSRWAAKEAKFLVDPRWIETANFLWQRQPEENFPVKQNKPEEN 469 >ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318537|gb|EEF03111.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 468 Score = 372 bits (954), Expect = e-100 Identities = 218/467 (46%), Positives = 289/467 (61%), Gaps = 20/467 (4%) Frame = -1 Query: 1593 IDFELDSISNASENHQDD------DDQEFESSPDIDFDSED-------RPRIKRRKLYDN 1453 +D ELDS S+AS D+ D SSPD D ++E+ R R+KR K+ Sbjct: 21 LDTELDSKSSASSASDDEAPNQRHSDSAASSSPDQDKEAEEDDDSDFQRKRVKRSKVETV 80 Query: 1452 EESEEQLPI----KIEDKPESSSARG-CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLK 1288 E E+ ++ E+S ++ C HPG +CI CGQ + +S V Sbjct: 81 EIVEDDGGTTSFASLKHNSEASISKEICTHPGSFGTMCIVCGQLL------DGESGVTFG 134 Query: 1287 YIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISST 1108 YIHK L + EI RLR D+K++L KK LNS + + + +EE Sbjct: 135 YIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTLLNSTQLMHMTLDEE------ 188 Query: 1107 YLVGQRNVESESGDIGR-TLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYA 931 YL GQ + D+ + +L+ L SM M TKLRPFV FL+EA+QM+EMYIYTMG+RAYA Sbjct: 189 YLNGQTD---SLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYA 245 Query: 930 EKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLI 751 +MAKLLDP ++F ++IS D T R QK LDVVLG ESAV+ILDDTE W KHK NLI Sbjct: 246 LEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLI 305 Query: 750 VVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFF-DNISNKEM 574 ++ERYHFF +SC QF KSL++ K DE+ESEG L+ +L+VLR +HQ FF D+I + + Sbjct: 306 LMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQIFFEDHILSLAL 365 Query: 573 SQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLR 394 VLK +R +LK CK+VFSRVFP+Q +A+ HHLWR+AE+LGA CST L Sbjct: 366 Q-----------VLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELD 414 Query: 393 EDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSFLWQRQSEDRF 253 VTHVVS D+GT+KS WA++++ FLV P WIEA+++ WQRQ E+ F Sbjct: 415 PSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENF 461 >ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii] gi|300166408|gb|EFJ33014.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii] Length = 411 Score = 371 bits (953), Expect = e-100 Identities = 192/396 (48%), Positives = 266/396 (67%), Gaps = 12/396 (3%) Frame = -1 Query: 1368 MWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXX 1189 MWGVCI+CG KP+SE S V LKYIH+EFEL+ +AR+R+D+L+ VL +K Sbjct: 1 MWGVCIRCGVLKPNSEPGGSASNVALKYIHEEFELAGDVLARVREDELRQVLGKRKLFLV 60 Query: 1188 XXXXXXXLNSARFIDVDPEEESYISSTYL-VGQRNVESESGDI----------GRTLYKL 1042 LNSAR+++V P+E +Y+ TY+ V + + + S G L+++ Sbjct: 61 LDLDHTLLNSARWMEVFPDETAYLEHTYMNVPEDKIPALSNGAPAVAGVIQPGGGGLHRI 120 Query: 1041 PSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSD 862 M +WTKLRPF HKFLEEA++++EMY+YTMGER YA MA LLDP GKFF R+IS D Sbjct: 121 HGMQLWTKLRPFAHKFLEEASKLFEMYVYTMGERMYAVTMAHLLDPTGKFFKGRVISQRD 180 Query: 861 STNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLT 682 ST R+ KDLD+VLG +SAV+ILDDTE VWPKH++NLIV+ERYHFF +SC QF SLT Sbjct: 181 STCRQTKDLDIVLGADSAVLILDDTEAVWPKHRANLIVMERYHFFQSSCRQFGLENPSLT 240 Query: 681 QSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKD 502 +++RDE++ EG L+ +L+VL+ +H FF + S++ D RD+ V +R+ IL Sbjct: 241 KAERDESKDEGALANVLKVLQRIHSDFF---MESDDSRYTCDVRDITSV---VRSEILSG 294 Query: 501 CKVVFSRVFPSQ-FRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQND 325 CK+VFSR+FP+ E LWRL +LGA C + + VTHVV+LD TDK++WA ++ Sbjct: 295 CKLVFSRIFPTDCLEPELTPLWRLCVDLGAECVLAHDDSVTHVVALDRFTDKAKWAKEHR 354 Query: 324 CFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQSSGF 217 FLVHP+W+EA+ LW+R +E FPV + +++ F Sbjct: 355 KFLVHPAWVEAAHSLWRRPNELEFPVREGQTRAPVF 390 >ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp. lyrata] gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp. lyrata] Length = 1006 Score = 370 bits (951), Expect = e-100 Identities = 204/457 (44%), Positives = 288/457 (63%), Gaps = 2/457 (0%) Frame = -1 Query: 1593 IDFELDSISNASENHQDDDDQEFESSPDIDFDSEDRPRIKRRKLYDNEESEEQLPIKIED 1414 +D ELDS S+AS +++++ + ++ +KRRKL E +E E+ Sbjct: 583 LDAELDSASDASSGPSEEEEEA---------EDDEESGLKRRKLEHLETVDE------EE 627 Query: 1413 KPESSSARG-CLHPGYMWGVCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLR 1237 E+SS++G C HPG +C CGQ +++ V +YIHKE L++ EI+RLR Sbjct: 628 IEEASSSKGECQHPGSFGNMCFVCGQKL-------EETGVSFRYIHKEMRLNEDEISRLR 680 Query: 1236 KDDLKSVLSAKKXXXXXXXXXXXLNSARFIDVDPEEESYISSTYLVGQRNVESESGDI-G 1060 D + + +K LNS D+ PEEE S T+ + + D+ G Sbjct: 681 DSDSRFLQRQRKLYLVLDLDHTLLNSTVLRDLKPEEEYLKSHTHSLQEPFDFLLISDVSG 740 Query: 1059 RTLYKLPSMHMWTKLRPFVHKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADR 880 +L+ L MHM TKLRPFVH FL+EA++M+ MYIYTMG+RAYA +MAKLLDP+G++F DR Sbjct: 741 GSLFMLEFMHMMTKLRPFVHSFLKEASEMFVMYIYTMGDRAYARQMAKLLDPRGEYFGDR 800 Query: 879 IISSSDSTNRRQKDLDVVLGTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHC 700 IIS D T R QK LDVVLG ESAV+ILDDTE WP HK NLIV+ERYHFF +SC QF Sbjct: 801 IISRDDGTVRHQKSLDVVLGQESAVLILDDTENAWPNHKDNLIVIERYHFFASSCRQFDH 860 Query: 699 GTKSLTQSKRDEAESEGTLSILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIR 520 KSL++ K DE+E +G L+ +L ++ ++ISN RDVR +LK++R Sbjct: 861 KYKSLSELKSDESEPDGALATVL-------KNVDEDISN----------RDVRSMLKQVR 903 Query: 519 ARILKDCKVVFSRVFPSQFRAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRW 340 +LK CKVVFSRVFP++ + E H LW++AEELGA C+T + VTHVV++D GT+K+RW Sbjct: 904 KEVLKGCKVVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARW 963 Query: 339 AVQNDCFLVHPSWIEASSFLWQRQSEDRFPVTKKDSQ 229 AV+ ++VH WI+A+++LW++Q E++F + + Q Sbjct: 964 AVREKKYVVHRGWIDAANYLWKKQPEEKFSLEQLKKQ 1000 >gb|EOY32065.1| RNA polymerase II ctd phosphatase, putative isoform 2 [Theobroma cacao] Length = 357 Score = 368 bits (944), Expect = 5e-99 Identities = 195/378 (51%), Positives = 252/378 (66%), Gaps = 1/378 (0%) Frame = -1 Query: 1359 VCIKCGQNKPDSEENEQQSRVCLKYIHKEFELSDSEIARLRKDDLKSVLSAKKXXXXXXX 1180 +CI CGQ D +S V YIHK L + EI RLR D+K++L KK Sbjct: 1 MCILCGQRLDD------ESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDL 54 Query: 1179 XXXXLNSARFIDVDPEEESYISSTYLVGQRNVESESGDIGR-TLYKLPSMHMWTKLRPFV 1003 LNS + + + P+EE YL GQ + D+ R +L+ L MHM TKLRPFV Sbjct: 55 DHTLLNSTQLMHLTPDEE------YLKGQSD---SLQDVSRGSLFMLDFMHMMTKLRPFV 105 Query: 1002 HKFLEEANQMYEMYIYTMGERAYAEKMAKLLDPKGKFFADRIISSSDSTNRRQKDLDVVL 823 FL+EA++M+EMYIYTMG+R YA +MAKLLDP+ ++F+DR+IS D T + QK LDVVL Sbjct: 106 RTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVL 165 Query: 822 GTESAVVILDDTEPVWPKHKSNLIVVERYHFFHASCLQFHCGTKSLTQSKRDEAESEGTL 643 G ESAVVILDDTE W KHK NLI++ERYH+F +SC QF KSL+Q K DE+E +G L Sbjct: 166 GQESAVVILDDTENAWMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGAL 225 Query: 642 SILLQVLRSVHQSFFDNISNKEMSQFCFDSRDVRLVLKEIRARILKDCKVVFSRVFPSQF 463 + +L+ LR +H FFD + SRDVR VLK ++ +LK CK+VFS VFP+ F Sbjct: 226 ASVLKALRQIHHMFFDELDCN------LASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNF 279 Query: 462 RAETHHLWRLAEELGAICSTSLREDVTHVVSLDAGTDKSRWAVQNDCFLVHPSWIEASSF 283 AE+H LW++AE+LGA CST VTHVVS DAGT+KSRWAV+ FLVHP WIEA+++ Sbjct: 280 PAESHPLWKMAEQLGATCSTETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNY 339 Query: 282 LWQRQSEDRFPVTKKDSQ 229 LWQ+Q E+ FPV++ +Q Sbjct: 340 LWQKQPEENFPVSQGKNQ 357