BLASTX nr result
ID: Ephedra25_contig00012587
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00012587 (1548 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [A... 360 9e-97 ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu... 336 1e-89 dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] 333 9e-89 ref|NP_001078764.1| C-terminal domain phosphatase-like 4 [Arabid... 333 9e-89 ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ... 331 6e-88 ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal doma... 330 1e-87 ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arab... 328 4e-87 ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal doma... 328 5e-87 ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma... 327 8e-87 ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal doma... 325 4e-86 ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu... 323 9e-86 ref|XP_001781984.1| predicted protein [Physcomitrella patens] gi... 323 9e-86 gb|EXC26161.1| RNA polymerase II C-terminal domain phosphatase-l... 322 3e-85 gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isofo... 322 3e-85 gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus pe... 322 3e-85 dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare] 321 6e-85 ref|XP_006401141.1| hypothetical protein EUTSA_v10013455mg [Eutr... 320 1e-84 ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal doma... 318 3e-84 ref|XP_006600548.1| PREDICTED: RNA polymerase II C-terminal doma... 318 5e-84 gb|EPS63533.1| hypothetical protein M569_11249, partial [Genlise... 318 5e-84 >ref|XP_006838087.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda] gi|548840545|gb|ERN00656.1| hypothetical protein AMTR_s00106p00017820 [Amborella trichopoda] Length = 486 Score = 360 bits (924), Expect = 9e-97 Identities = 205/426 (48%), Positives = 264/426 (61%), Gaps = 7/426 (1%) Frame = +3 Query: 90 EDQNQRIKRRRLFEGTEEILES----ANVQEEEQLS-SISEK-CPPHPGYMWGVCILCGQ 251 E + +RIKR ++ E EEI ES AN E + S SEK CPPHPG+ +CI CG+ Sbjct: 63 EIELERIKRPKICED-EEIKESQSSNANQGELDNFKESTSEKVCPPHPGFYKDMCIRCGE 121 Query: 252 IKKDMENEEK-SGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXXXXXXXXXXN 428 K D K + V+ YIHKDL+L E+ R R L +L ++K N Sbjct: 122 QKDDETVARKETAVAFNYIHKDLKLGAEEVARLRATDLKNLY-RRRKLYLVLDLDHTLLN 180 Query: 429 SARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTKLRPFVHTFL 608 S R DV EEE+Y+ + YL ET S + T+G L KLE L + TKLRPFV TFL Sbjct: 181 STRLVDVSPEEEAYLNATYLNKET-----SSSNGDTSGTLFKLEPLHMLTKLRPFVRTFL 235 Query: 609 EEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKNLDVVLGAES 788 +E + M+EM + TMGER YAL+MAKLLDPSG YFG+R+IS DST RHQK LDVVLG+E Sbjct: 236 KEANTMFEMYVYTMGERAYALEMAKLLDPSGVYFGSRVISQGDSTVRHQKGLDVVLGSEC 295 Query: 789 AVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELETQGTXXXXX 968 AVVILDDTE+VW KH+ NL+++ERYHFF SSC+QFN+ +SL+E DE E+ G Sbjct: 296 AVVILDDTEHVWHKHKENLVLMERYHFFSSSCRQFNVHYKSLSELKRDESESDGMLASIL 355 Query: 969 XXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGVFPTQCXXXX 1148 HQMF+ + DF N DVR++LK I S+VL+GC+LVFS +FPT Sbjct: 356 NVLKHIHQMFYYQEVETDF-----NGSDVRKVLKTIQSEVLKGCRLVFSRIFPTNYPVEN 410 Query: 1149 XXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWLEASKYLWQR 1328 C+ + E VTHV++LD GT+ +RWAIQ + LV+P WLEA+ Y W+R Sbjct: 411 QTLWRIAEQLGASCSKELDEAVTHVVSLDLGTEKARWAIQRKKHLVNPGWLEATNYFWKR 470 Query: 1329 QPEEKF 1346 QPE++F Sbjct: 471 QPEDQF 476 >ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318538|gb|EEF03112.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 472 Score = 336 bits (862), Expect = 1e-89 Identities = 191/450 (42%), Positives = 266/450 (59%), Gaps = 10/450 (2%) Frame = +3 Query: 48 SLLESELDSNPQPSEDQNQRIKRRRLFEG---TEEILES-------ANVQEEEQLSSISE 197 S S D + + ED + +R+R+ T EI+E A+++ + +SIS+ Sbjct: 47 SAASSSPDQDKEAEEDDDSDFQRKRVKRSKVETVEIVEDDGGTTSFASLKHNSE-ASISK 105 Query: 198 KCPPHPGYMWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLIS 377 + HPG +CI+CGQ+ + +SGV+ YIHK L L + EI R R + +L+ Sbjct: 106 EICTHPGSFGTMCIVCGQLL-----DGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLR 160 Query: 378 HQQKXXXXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKL 557 H+ K NS + + L+EE YL G+T+ S D SK G L L Sbjct: 161 HK-KLYLILDLDHTLLNSTQLMHMTLDEE------YLNGQTD--SLQDVSK---GSLFML 208 Query: 558 EALKIWTKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATD 737 ++++ TKLRPFV TFL+E S+M+EM I TMG+R YAL+MAKLLDP +YF ++IS D Sbjct: 209 SSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDD 268 Query: 738 STHRHQKNLDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLT 917 T RHQK LDVVLG ESAV+ILDDTEN W KH+ NLI++ERYHFF SSC QF +SL+ Sbjct: 269 GTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLS 328 Query: 918 EAMTDELETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEG 1097 E TDE E++G HQ+FF++ + ++ RDVR++LK + VL+G Sbjct: 329 EQKTDESESEGALASILKVLRKIHQIFFEE------LEENMDGRDVRQVLKTVRKDVLKG 382 Query: 1098 CKLVFSGVFPTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNR 1277 CK+VFS VFPTQ C+T + VTHV++ DSGT+ S WA+++N+ Sbjct: 383 CKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWALKHNK 442 Query: 1278 FLVHPHWLEASKYLWQRQPEEKFIANILQS 1367 FLV P W+EA+ Y WQRQPEE F N +++ Sbjct: 443 FLVQPGWIEAANYFWQRQPEENFSFNQIKN 472 >dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana] Length = 1065 Score = 333 bits (855), Expect = 9e-89 Identities = 184/434 (42%), Positives = 261/434 (60%) Frame = +3 Query: 45 ASLLESELDSNPQPSEDQNQRIKRRRLFEGTEEILESANVQEEEQLSSISEKCPPHPGYM 224 A+ L++ELDS S ++ + + E L+ ++ E+ SS +C HPG Sbjct: 644 AAFLDAELDSASDASSGPSEEEEAE---DDVESGLKRQKLEHLEEASSSKGECE-HPGSF 699 Query: 225 WGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXXX 404 +C +CGQ E++GVS +YIHK++ L + EI+R R D + + Q+K Sbjct: 700 GNMCFVCGQ------KLEETGVSFRYIHKEMRLNEDEISRLR-DSDSRFLQRQRKLYLVL 752 Query: 405 XXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTKL 584 N+ D+ EEE YL T+ S D + G L LE +++ TKL Sbjct: 753 DLDHTLLNTTILRDLKPEEE------YLKSHTH--SLQDGCNVSGGSLFLLEFMQMMTKL 804 Query: 585 RPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKNL 764 RPFVH+FL+E SEM+ M I TMG+R YA +MAKLLDP G+YFG R+IS D T RH+K+L Sbjct: 805 RPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISRDDGTVRHEKSL 864 Query: 765 DVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELET 944 DVVLG ESAV+ILDDTEN WPKH+ NLIV+ERYHFF SSC+QF+ + +SL+E +DE E Sbjct: 865 DVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRYKSLSELKSDESEP 924 Query: 945 QGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGVF 1124 G H +FF+ NV++ G+++RDVR +LK++ ++L+GCK+VFS VF Sbjct: 925 DGALATVLKVLKQAHALFFE--NVDE----GISNRDVRLMLKQVRKEILKGCKIVFSRVF 978 Query: 1125 PTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWLE 1304 PT+ CAT V VTHV+A+D GT+ +RWA++ +++VH W++ Sbjct: 979 PTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVREKKYVVHRGWID 1038 Query: 1305 ASKYLWQRQPEEKF 1346 A+ YLW +QPEE F Sbjct: 1039 AANYLWMKQPEENF 1052 >ref|NP_001078764.1| C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal phosphatase-like 4; Short=AtCPL4; Short=CTD phosphatase-like 4 gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana] gi|332009601|gb|AED96984.1| C-terminal domain phosphatase-like 4 [Arabidopsis thaliana] Length = 440 Score = 333 bits (855), Expect = 9e-89 Identities = 184/434 (42%), Positives = 261/434 (60%) Frame = +3 Query: 45 ASLLESELDSNPQPSEDQNQRIKRRRLFEGTEEILESANVQEEEQLSSISEKCPPHPGYM 224 A+ L++ELDS S ++ + + E L+ ++ E+ SS +C HPG Sbjct: 19 AAFLDAELDSASDASSGPSEEEEAE---DDVESGLKRQKLEHLEEASSSKGECE-HPGSF 74 Query: 225 WGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXXX 404 +C +CGQ E++GVS +YIHK++ L + EI+R R D + + Q+K Sbjct: 75 GNMCFVCGQ------KLEETGVSFRYIHKEMRLNEDEISRLR-DSDSRFLQRQRKLYLVL 127 Query: 405 XXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTKL 584 N+ D+ EEE YL T+ S D + G L LE +++ TKL Sbjct: 128 DLDHTLLNTTILRDLKPEEE------YLKSHTH--SLQDGCNVSGGSLFLLEFMQMMTKL 179 Query: 585 RPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKNL 764 RPFVH+FL+E SEM+ M I TMG+R YA +MAKLLDP G+YFG R+IS D T RH+K+L Sbjct: 180 RPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISRDDGTVRHEKSL 239 Query: 765 DVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELET 944 DVVLG ESAV+ILDDTEN WPKH+ NLIV+ERYHFF SSC+QF+ + +SL+E +DE E Sbjct: 240 DVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRYKSLSELKSDESEP 299 Query: 945 QGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGVF 1124 G H +FF+ NV++ G+++RDVR +LK++ ++L+GCK+VFS VF Sbjct: 300 DGALATVLKVLKQAHALFFE--NVDE----GISNRDVRLMLKQVRKEILKGCKIVFSRVF 353 Query: 1125 PTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWLE 1304 PT+ CAT V VTHV+A+D GT+ +RWA++ +++VH W++ Sbjct: 354 PTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVREKKYVVHRGWID 413 Query: 1305 ASKYLWQRQPEEKF 1346 A+ YLW +QPEE F Sbjct: 414 AANYLWMKQPEENF 427 >ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 478 Score = 331 bits (848), Expect = 6e-88 Identities = 189/436 (43%), Positives = 256/436 (58%), Gaps = 3/436 (0%) Frame = +3 Query: 57 ESELDSNPQPSEDQNQRIKRRRL--FEGTEEILESANVQEEEQLSSISEKCP-PHPGYMW 227 E E DS+ S+ RIKR R+ E E ES V ++ L + S K HPG Sbjct: 60 EEESDSDDD-SDIATNRIKRSRVETLENGENPKESTRVSLDQTLVASSSKVACTHPGSFG 118 Query: 228 GVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXXXX 407 +CILCG+ E++GV+ YIHK L LA+ EI R R + +L+ H+ K Sbjct: 119 DMCILCGE-----RLIEETGVTFGYIHKGLRLANDEIVRLRNTDMKNLLRHR-KLYLVLD 172 Query: 408 XXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTKLR 587 NS + + EEE YL + + S D S NG L ++ + + TKLR Sbjct: 173 LDHTLLNSTQLMHLTAEEE------YLKSQID--SMQDVS---NGSLFMVDFMHMMTKLR 221 Query: 588 PFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKNLD 767 PF+ TFL+E S+M+EM I TMG+R YAL+MAK LDP +YF R+IS D T RHQK LD Sbjct: 222 PFIRTFLKEASQMFEMYIYTMGDRAYALEMAKFLDPGREYFNARVISRDDGTQRHQKGLD 281 Query: 768 VVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELETQ 947 +VLG ESAV+ILDDTEN W KH+ NLI++ERYHFF SSC+QF + +SL++ +DE E+ Sbjct: 282 IVLGQESAVLILDDTENAWTKHKDNLILMERYHFFASSCRQFGFECKSLSQLKSDENESD 341 Query: 948 GTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGVFP 1127 G H +FFD+ +ED ++ RDVR++L + VL+GCK+VFS VFP Sbjct: 342 GALASVLKVLRRIHHIFFDE--LED----AIDGRDVRQVLSTVRKDVLKGCKIVFSRVFP 395 Query: 1128 TQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWLEA 1307 TQ C+ V VTHV++ ++GT+ SRWA++N++FLVHP W+EA Sbjct: 396 TQFQADNHHLWKMAEQLGATCSREVDPSVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEA 455 Query: 1308 SKYLWQRQPEEKFIAN 1355 + Y+WQRQPEE F N Sbjct: 456 TNYMWQRQPEENFSVN 471 >ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 512 Score = 330 bits (846), Expect = 1e-87 Identities = 184/435 (42%), Positives = 257/435 (59%), Gaps = 5/435 (1%) Frame = +3 Query: 57 ESELDSNPQPSEDQNQRIKRRR--LFEGTEEILESANVQEEEQLSSIS---EKCPPHPGY 221 + + D+ + R K+R+ L EG + S + E + S S + C HPG Sbjct: 97 DEDNDTGDGDGSIDSSRSKKRKIELIEGAVDPQSSVSRGEPAETSGASMALDVCT-HPGV 155 Query: 222 MWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXX 401 M G+CI CGQ + E++SGV+ YIHK+L LAD E+ R RE L +L+ H+ K Sbjct: 156 MGGMCIRCGQ-----KVEDESGVAFGYIHKNLRLADDEVARLREKDLKNLLRHR-KLILV 209 Query: 402 XXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTK 581 NS R AD+ EE R + + + +L KL+ + + TK Sbjct: 210 LDLDHTLLNSTRLADISAEESYLKDQREVLPD-----------ALRSNLFKLDWIHMMTK 258 Query: 582 LRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKN 761 LRPFVHTFL+E S ++EM I TMGER YAL+MAKLLDP G YF +R+I+ +DST RHQK Sbjct: 259 LRPFVHTFLKEASSLFEMYIYTMGERPYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQKG 318 Query: 762 LDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELE 941 LDVVLG ESAV+ILDDTE VW KHR NLI+++RYHFF SSC+QF ++ +SL+E +DE E Sbjct: 319 LDVVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENE 378 Query: 942 TQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGV 1121 +G H++FFD ++ + RDVR++LK + ++L+GCK+VF+GV Sbjct: 379 AEGALASVLEVLQRIHRLFFDPERGDNIME-----RDVRQVLKTVRKEILKGCKIVFTGV 433 Query: 1122 FPTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWL 1301 P QC +T V E VTHV++++ T+ SR A++ +FLVHP W+ Sbjct: 434 IPIQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWI 493 Query: 1302 EASKYLWQRQPEEKF 1346 EA+ YLW++ PEE F Sbjct: 494 EAANYLWRKPPEENF 508 >ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp. lyrata] gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp. lyrata] Length = 1006 Score = 328 bits (841), Expect = 4e-87 Identities = 192/454 (42%), Positives = 268/454 (59%), Gaps = 11/454 (2%) Frame = +3 Query: 45 ASLLESELDS----NPQPSE-------DQNQRIKRRRLFEGTEEILESANVQEEEQLSSI 191 A+ L++ELDS + PSE D+ +KRR+L E LE+ + +E E+ SS Sbjct: 580 AAFLDAELDSASDASSGPSEEEEEAEDDEESGLKRRKL-----EHLETVDEEEIEEASSS 634 Query: 192 SEKCPPHPGYMWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASL 371 +C HPG +C +CGQ E++GVS +YIHK++ L + EI+R R D + Sbjct: 635 KGECQ-HPGSFGNMCFVCGQ------KLEETGVSFRYIHKEMRLNEDEISRLR-DSDSRF 686 Query: 372 ISHQQKXXXXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLL 551 + Q+K NS D+ EEE + + E D S + G L Sbjct: 687 LQRQRKLYLVLDLDHTLLNSTVLRDLKPEEEYLKSHTHSLQEPFDFLLI--SDVSGGSLF 744 Query: 552 KLEALKIWTKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISA 731 LE + + TKLRPFVH+FL+E SEM+ M I TMG+R YA +MAKLLDP G+YFG RIIS Sbjct: 745 MLEFMHMMTKLRPFVHSFLKEASEMFVMYIYTMGDRAYARQMAKLLDPRGEYFGDRIISR 804 Query: 732 TDSTHRHQKNLDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQS 911 D T RHQK+LDVVLG ESAV+ILDDTEN WP H+ NLIV+ERYHFF SSC+QF+ + +S Sbjct: 805 DDGTVRHQKSLDVVLGQESAVLILDDTENAWPNHKDNLIVIERYHFFASSCRQFDHKYKS 864 Query: 912 LTEAMTDELETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVL 1091 L+E +DE E G +NV++ +++RDVR +LK++ +VL Sbjct: 865 LSELKSDESEPDGALATVL-------------KNVDE----DISNRDVRSMLKQVRKEVL 907 Query: 1092 EGCKLVFSGVFPTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQN 1271 +GCK+VFS VFPT+ CAT V VTHV+A+D GT+ +RWA++ Sbjct: 908 KGCKVVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVRE 967 Query: 1272 NRFLVHPHWLEASKYLWQRQPEEKFIANILQSRE 1373 +++VH W++A+ YLW++QPEEKF L+ ++ Sbjct: 968 KKYVVHRGWIDAANYLWKKQPEEKFSLEQLKKQQ 1001 >ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 472 Score = 328 bits (840), Expect = 5e-87 Identities = 184/433 (42%), Positives = 257/433 (59%), Gaps = 7/433 (1%) Frame = +3 Query: 69 DSNPQPSEDQN---QRIKRRR--LFEGT--EEILESANVQEEEQLSSISEKCPPHPGYMW 227 D++ +D N +R K+R+ L E + L S E +S++ HPG M Sbjct: 58 DNDTGDGDDGNIDSRRSKKRKIELIEAAVDPQSLVSRGESAETSGASLALDVCTHPGVMG 117 Query: 228 GVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXXXX 407 G+CI CGQ + E++SGV+ YIHK+L LAD E+ R RE L +L+ H+ K Sbjct: 118 GMCIRCGQ-----KVEDESGVAFGYIHKNLRLADDEVARLREKDLKNLLRHR-KLILVLD 171 Query: 408 XXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTKLR 587 NS R AD+ EE R + + + +L KL+ + + TKLR Sbjct: 172 LDHTLLNSTRLADISAEESYLKDQREVLPD-----------ALRSNLFKLDWIHMMTKLR 220 Query: 588 PFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKNLD 767 PFVHTFL+E S ++EM I TMGER YAL+MAKLLDP G YF +R+I+ +DST RHQK LD Sbjct: 221 PFVHTFLKEASSLFEMYIYTMGERPYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQKGLD 280 Query: 768 VVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELETQ 947 VVLG ESAV+ILDDTE VW KHR NLI+++RYHFF SSC+QF ++ +SL+E +DE E + Sbjct: 281 VVLGQESAVLILDDTEVVWGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAE 340 Query: 948 GTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGVFP 1127 G H++FFD ++ + RDVR++LK + ++L+GCK+VF+GV P Sbjct: 341 GALASVLEVLQRIHRLFFDPERGDNIME-----RDVRQVLKTVRKEILKGCKIVFTGVIP 395 Query: 1128 TQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWLEA 1307 QC +T V E VTHV++++ T+ SR A++ +FLVHP W+EA Sbjct: 396 IQCQPENHYYWKLAEKLGATFSTEVDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWIEA 455 Query: 1308 SKYLWQRQPEEKF 1346 + YLW++ PEE F Sbjct: 456 ANYLWRKPPEENF 468 >ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Cucumis sativus] Length = 452 Score = 327 bits (838), Expect = 8e-87 Identities = 190/447 (42%), Positives = 257/447 (57%), Gaps = 13/447 (2%) Frame = +3 Query: 45 ASLLESELDSNPQPSEDQNQ----------RIKRRR---LFEGTEEILESANVQEEEQLS 185 A+ L +LDS+ S + RIKRR+ L E+I+ Q E LS Sbjct: 18 AAFLAVDLDSHSSDSSPDEETEGDNNAESVRIKRRKVEKLENSEEDIMHEVEEQSLEVLS 77 Query: 186 SISEKCPPHPGYMWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLA 365 ++ HPG +CI+CGQ +E+SGV+ YIHK+L L + EI R R + Sbjct: 78 K--QQLCSHPGSFGNMCIICGQ-----RLDEESGVTFGYIHKELRLNNDEINRMRNKEMK 130 Query: 366 SLISHQQKXXXXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGD 545 L+ ++K NS + +EEE YL +T+ S D +K G Sbjct: 131 ELLQ-RKKLILVLDLDHTLLNSTELRYLTVEEE------YLRSQTD--SLDDVTK---GS 178 Query: 546 LLKLEALKIWTKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRII 725 L L ++ TKLRPFVH+FL+E S+++EM I TMGER YA +MAKLLDP +YF +++I Sbjct: 179 LFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVI 238 Query: 726 SATDSTHRHQKNLDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQK 905 S D T +HQK LDVVLG ESAV+ILDDTEN W KH+ NLI++ERYHFF SSC+QF Sbjct: 239 SRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNC 298 Query: 906 QSLTEAMTDELETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSK 1085 +SL+E DE ET G H MFF++ + + L RDVR++LK + ++ Sbjct: 299 KSLSELKNDESETDGALTTILKVLKQVHHMFFNEVSGD------LVDRDVRQVLKTVRAE 352 Query: 1086 VLEGCKLVFSGVFPTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAI 1265 VLEGCK+VFS VFPT+ C+T + + VTHV+A D+GT+ SRWA+ Sbjct: 353 VLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWAL 412 Query: 1266 QNNRFLVHPHWLEASKYLWQRQPEEKF 1346 + +FLVHP W+EAS Y W+RQ EE F Sbjct: 413 KEKKFLVHPRWIEASNYFWKRQMEENF 439 >ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum tuberosum] Length = 478 Score = 325 bits (832), Expect = 4e-86 Identities = 182/430 (42%), Positives = 254/430 (59%), Gaps = 4/430 (0%) Frame = +3 Query: 69 DSNPQPSEDQNQRIKRR-RLFEGTEEILESANVQEEEQLSSIS---EKCPPHPGYMWGVC 236 D + S D ++ KR+ L E + S + E + S S + C HPG M G+C Sbjct: 68 DDDDDGSIDSSRSKKRKIELIEAAVDPQSSVSRGEPAETSGASLALDVCT-HPGVMGGMC 126 Query: 237 ILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXXXXXXX 416 I CGQ + E++SGV+ YIHK+L LAD E+ R R+ L +L+ H+ K Sbjct: 127 IRCGQ-----KVEDESGVAFGYIHKNLRLADDEVARLRDKDLKNLLRHK-KLILVLDLDH 180 Query: 417 XXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTKLRPFV 596 NS R AD+ EE R + + + +L KL+ + + TKLRPFV Sbjct: 181 TLLNSTRLADISAEESYLKDQREVLPD-----------ALRNNLFKLDWIHMMTKLRPFV 229 Query: 597 HTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKNLDVVL 776 HTFL+E S ++EM I TMGER YAL+MA LLDP G YF +R+I+ +DST RHQK LDVVL Sbjct: 230 HTFLKEASSLFEMYIYTMGERPYALEMASLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVL 289 Query: 777 GAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELETQGTX 956 G ESAV+ILDDTE VW KHR NLI+++RYHFF SSC+QF ++ +SL+E +DE E +G Sbjct: 290 GQESAVLILDDTEVVWGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGAL 349 Query: 957 XXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGVFPTQC 1136 H++FFD ++ + RDVR++LK + ++L+GCK+VF+GV P QC Sbjct: 350 ASVLEVLQRIHRLFFDLERGDNIME-----RDVRQVLKTVRKEILKGCKIVFTGVIPIQC 404 Query: 1137 XXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWLEASKY 1316 +T V E VTHV++++ T+ SR A++ +FLVHP W+EA+ Y Sbjct: 405 QPENHHYWKLAEKLGATFSTEVDESVTHVVSMNDKTEKSRQALREKKFLVHPSWIEAANY 464 Query: 1317 LWQRQPEEKF 1346 LW++ PEE F Sbjct: 465 LWRKPPEENF 474 >ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318537|gb|EEF03111.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 468 Score = 323 bits (829), Expect = 9e-86 Identities = 188/450 (41%), Positives = 260/450 (57%), Gaps = 10/450 (2%) Frame = +3 Query: 48 SLLESELDSNPQPSEDQNQRIKRRRLFEG---TEEILES-------ANVQEEEQLSSISE 197 S S D + + ED + +R+R+ T EI+E A+++ + +SIS+ Sbjct: 47 SAASSSPDQDKEAEEDDDSDFQRKRVKRSKVETVEIVEDDGGTTSFASLKHNSE-ASISK 105 Query: 198 KCPPHPGYMWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLIS 377 + HPG +CI+CGQ+ + +SGV+ YIHK L L + EI R R + +L+ Sbjct: 106 EICTHPGSFGTMCIVCGQLL-----DGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLR 160 Query: 378 HQQKXXXXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKL 557 H+ K NS + + L+EE YL G+T+ S D SK G L L Sbjct: 161 HK-KLYLILDLDHTLLNSTQLMHMTLDEE------YLNGQTD--SLQDVSK---GSLFML 208 Query: 558 EALKIWTKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATD 737 ++++ TKLRPFV TFL+E S+M+EM I TMG+R YAL+MAKLLDP +YF ++IS D Sbjct: 209 SSMQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDD 268 Query: 738 STHRHQKNLDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLT 917 T RHQK LDVVLG ESAV+ILDDTEN W KH+ NLI++ERYHFF SSC QF +SL+ Sbjct: 269 GTQRHQKGLDVVLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLS 328 Query: 918 EAMTDELETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEG 1097 E TDE E++G HQ+FF+ + L ++LK + VL+G Sbjct: 329 EQKTDESESEGALASILKVLRKIHQIFFE----DHILSLAL------QVLKTVRKDVLKG 378 Query: 1098 CKLVFSGVFPTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNR 1277 CK+VFS VFPTQ C+T + VTHV++ DSGT+ S WA+++N+ Sbjct: 379 CKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWALKHNK 438 Query: 1278 FLVHPHWLEASKYLWQRQPEEKFIANILQS 1367 FLV P W+EA+ Y WQRQPEE F N +++ Sbjct: 439 FLVQPGWIEAANYFWQRQPEENFSFNQIKN 468 >ref|XP_001781984.1| predicted protein [Physcomitrella patens] gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens] Length = 563 Score = 323 bits (829), Expect = 9e-86 Identities = 182/417 (43%), Positives = 244/417 (58%), Gaps = 5/417 (1%) Frame = +3 Query: 192 SEKCPPHPGYMWGVCILCGQIKKDMENEEK--SGVSLKYIHKDLELADSEITRFREDGLA 365 S KCPPHPG++W VCI CG+ K + + V L+YIH+ LE+++ E R R L Sbjct: 119 SNKCPPHPGFIWDVCIRCGKRKSTAPSNDPVIDRVGLRYIHEGLEVSELEAARVRNAELR 178 Query: 366 SLISHQQKXXXXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGD 545 ++ +QK NSARF++V EE Y+T + AG+ + +S Sbjct: 179 R-VTGKQKLLLVVDLDHTMLNSARFSEVPAEERIYLT--WTAGQQHGRVSS--------- 226 Query: 546 LLKLEALKIWTKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRII 725 L +L L +WTKLRPF H FLEE S++YEM + TMGE+ YA MA+LLDP+G+ FG RII Sbjct: 227 LHQLTKLGMWTKLRPFAHKFLEEASKLYEMYVYTMGEKIYAQAMAELLDPTGQLFGGRII 286 Query: 726 SATDSTHRHQKNLDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQK 905 S TDST RH K+LDVVLGAESAVVILDDTE VWP HRSNLI++ERYHFF SSC QF ++ Sbjct: 287 SQTDSTKRHTKDLDVVLGAESAVVILDDTEAVWPNHRSNLILMERYHFFTSSCHQFRVRA 346 Query: 906 QSLTEAMTDELETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQF-GLNSRDVREILKEICS 1082 SL + DE E GT H FF+ + + L DVR++++ I Sbjct: 347 PSLAQMHRDECEIDGTLATTLKTLQAIHHEFFNGHKGKSMKRRPPLELPDVRDVIRSIRG 406 Query: 1083 KVLEGCKLVFSGVFPTQC-XXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRW 1259 K+L GC +VFS +FPT +C+T THV+ALD GTD +RW Sbjct: 407 KLLSGCHIVFSRIFPTGLQNPEFHPFWQLAVELGARCSTVCDHTTTHVVALDRGTDKARW 466 Query: 1260 AIQNNRFLVHPHWLEASKYLWQRQPEEKF-IANILQSRESSGFPESVSMFPCQ*HGN 1427 A Q+ LVHP W+EA+ YLW+R E+ F + + + S+ F +++S+ P N Sbjct: 467 AKQHGISLVHPRWVEAASYLWKRPREKDFPVTDDASALISTTFSKNISVEPISIEAN 523 >gb|EXC26161.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Morus notabilis] Length = 512 Score = 322 bits (825), Expect = 3e-85 Identities = 179/400 (44%), Positives = 238/400 (59%), Gaps = 1/400 (0%) Frame = +3 Query: 174 EQLSSISEKCPPHPGYMWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFRE 353 E+ S+ + C HPG +CILCGQ EE++GV+ YIHK L L + EI R R Sbjct: 137 EEESTKKDACT-HPGSFGDMCILCGQ-----RLEEETGVTFGYIHKGLRLNNDEIVRLRS 190 Query: 354 DGLASLISHQQKXXXXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKS 533 + +LI H+ K NS R D L EE Y+ S+ A + Sbjct: 191 TDMKNLIRHK-KLCLVLDLDHTLLNSTRLVD-LSSEEQYLKSQ----------AFSPQDA 238 Query: 534 TNGDLLKLEALKIWTKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFG 713 + G L LEA+ + TKLRPFV FL+EV ++E+ + TMG+R YAL MAKLLDP +YFG Sbjct: 239 SEGSLFVLEAMHMMTKLRPFVRNFLKEVYNLFELYVYTMGDRPYALAMAKLLDPRREYFG 298 Query: 714 TRIISATDSTHRHQKNLDVVLGAESAVVILDDTENVWPK-HRSNLIVVERYHFFRSSCQQ 890 RIIS D T +HQK LDVVLG ESAV+ILDDTEN W K H+ NLI++ERYHFFRSS Q Sbjct: 299 DRIISRDDGTLKHQKGLDVVLGQESAVLILDDTENAWIKHHKENLILMERYHFFRSSTHQ 358 Query: 891 FNIQKQSLTEAMTDELETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILK 1070 F +SL+E +DE ET+G H MFFD+R ++ RDVR++LK Sbjct: 359 FGYNCKSLSELKSDESETEGALVTVLNVLKQVHSMFFDERGIDHI------IRDVRQVLK 412 Query: 1071 EICSKVLEGCKLVFSGVFPTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDA 1250 + +VL+GCK+VFS VFPT+ C + VTHV++LD GT+ Sbjct: 413 TLRKEVLKGCKIVFSRVFPTEFQAENHQLWKMAEQLGATCGIELDPSVTHVVSLDVGTEK 472 Query: 1251 SRWAIQNNRFLVHPHWLEASKYLWQRQPEEKFIANILQSR 1370 SRWA++ N+FLVHP W+EA+ Y+W+RQPE+ F N ++++ Sbjct: 473 SRWAVKENKFLVHPRWIEAANYMWKRQPEDNFSVNQVKNQ 512 >gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma cacao] Length = 469 Score = 322 bits (825), Expect = 3e-85 Identities = 184/432 (42%), Positives = 248/432 (57%), Gaps = 6/432 (1%) Frame = +3 Query: 69 DSNPQPSEDQNQRIKRRRLFE------GTEEILESANVQEEEQLSSISEKCPPHPGYMWG 230 D + +N+R K +L + T + L + +LS + C HPG Sbjct: 54 DDDDDLDSQRNKRCKTEKLEDLEESRGSTSQGLIEDKIVIHAELSLKKDICT-HPGSFGQ 112 Query: 231 VCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXXXXX 410 +CILCGQ +++SGV+ YIHK L L + EI R R + +L+ H+ K Sbjct: 113 MCILCGQ-----RLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHK-KLYLVLDL 166 Query: 411 XXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTKLRP 590 NS + + +EE YL G+++ S D S+ G L L+ + + TKLRP Sbjct: 167 DHTLLNSTQLMHLTPDEE------YLKGQSD--SLQDVSR---GSLFMLDFMHMMTKLRP 215 Query: 591 FVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKNLDV 770 FV TFL+E SEM+EM I TMG+R YAL+MAKLLDP +YF R+IS D T +HQK LDV Sbjct: 216 FVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDV 275 Query: 771 VLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELETQG 950 VLG ESAVVILDDTEN W KH+ NLI++ERYH+F SSC QF + +SL++ +DE E G Sbjct: 276 VLGQESAVVILDDTENAWMKHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDG 335 Query: 951 TXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGVFPT 1130 H MFFD+ + L SRDVR++LK + +VL+GCK+VFS VFPT Sbjct: 336 ALASVLKALRQIHHMFFDELDC------NLASRDVRQVLKTVQEEVLKGCKIVFSHVFPT 389 Query: 1131 QCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWLEAS 1310 C+T VTHV++ D+GT+ SRWA++ +FLVHP W+EA+ Sbjct: 390 NFPAESHPLWKMAEQLGATCSTETDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEAT 449 Query: 1311 KYLWQRQPEEKF 1346 YLWQ+QPEE F Sbjct: 450 NYLWQKQPEENF 461 >gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica] Length = 449 Score = 322 bits (825), Expect = 3e-85 Identities = 185/429 (43%), Positives = 248/429 (57%), Gaps = 6/429 (1%) Frame = +3 Query: 87 SEDQNQRIKRRRLFEGTEEILESAN------VQEEEQLSSISEKCPPHPGYMWGVCILCG 248 S+D ++R +RR E I E+ V+E + S + C HPG + +CI+CG Sbjct: 41 SDDGSERSTKRRKVENLGSIDETQGSTSQIFVEENSEASPKKDICT-HPGSVKDLCIVCG 99 Query: 249 QIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXXXXXXXXXXN 428 Q +EKSGV L YIHKD L + EI R R + + H +K N Sbjct: 100 Q-----RVDEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSL-HLKKLYLVLDLDHTLLN 153 Query: 429 SARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTKLRPFVHTFL 608 S + EEE YL +T+ S D S +G L +++ + + TKLRPFV FL Sbjct: 154 STHLNHMTAEEE------YLHSQTD--SLQDVS---DGSLFRVDVMHMMTKLRPFVRKFL 202 Query: 609 EEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKNLDVVLGAES 788 +E SEM+EM I TMGER YAL+MAKLLDP +YFG R+IS D T +HQK LDVVLG ES Sbjct: 203 KEASEMFEMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDVVLGHES 262 Query: 789 AVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELETQGTXXXXX 968 A +ILDDTEN W KH+ NLI++ERYHFFRSSC QF +SL+E +DE E +G Sbjct: 263 AALILDDTENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEGALATVL 322 Query: 969 XXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGVFPTQCXXXX 1148 H MFF + S+ L RDVR++LK + ++L+GCK+VFS VFP++ Sbjct: 323 EVLKRIHNMFFYE------SKDNLIDRDVRQVLKTLRKEILKGCKIVFSRVFPSKFQAEN 376 Query: 1149 XXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWLEASKYLWQR 1328 C+T + VTHV++ D+GT+ SRWA++ +FLVHP W+EAS Y+W + Sbjct: 377 HQLWKMAEQLGATCSTELDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEASNYMWLK 436 Query: 1329 QPEEKFIAN 1355 Q E+KF N Sbjct: 437 QAEDKFPVN 445 >dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 488 Score = 321 bits (822), Expect = 6e-85 Identities = 176/416 (42%), Positives = 242/416 (58%), Gaps = 4/416 (0%) Frame = +3 Query: 111 KRRRLFEGTEEILESANVQEEEQLSSISE----KCPPHPGYMWGVCILCGQIKKDMENEE 278 KRRR+ E ++ +A EE+ + S+ + KCPPHPG+ G+CI CG K + E+ Sbjct: 71 KRRRVEEHRQD-QGTATRPEEDVIGSVKDAQIKKCPPHPGFFGGLCINCG---KSQDEED 126 Query: 279 KSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXXXXXXXXXXXXNSARFADVLLE 458 GV+ YIHK L L SE+ R RE + +L+ ++K NS R D+ Sbjct: 127 VPGVAFGYIHKGLRLGTSEMDRLRESEVKNLL-RERKLVLILDLDHTLINSTRLHDISAA 185 Query: 459 EESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIWTKLRPFVHTFLEEVSEMYEMC 638 E L +T +AS + L L+ + + TKLRPFV FLEE S M++M Sbjct: 186 EMD------LGIQT---AASKNADDPERSLFTLQGMHMLTKLRPFVRKFLEEASNMFDMY 236 Query: 639 INTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQKNLDVVLGAESAVVILDDTEN 818 I TMG++ YA+++AKLLDP YF +++IS +D T RHQK LDVVLG + VI+DDTE+ Sbjct: 237 IYTMGDKAYAIEIAKLLDPGNVYFDSKVISNSDCTQRHQKGLDVVLGDDKVAVIIDDTEH 296 Query: 819 VWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDELETQGTXXXXXXXXXXXHQMF 998 VW KH+ NLI++ERYH+F +SC+QF QSL+E M DE E+ G H +F Sbjct: 297 VWQKHKENLILMERYHYFAASCRQFGFSDQSLSELMQDERESDGALATILDVLKRIHTIF 356 Query: 999 FDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFSGVFPTQCXXXXXXXXXXXXXX 1178 FD VE L+SRDVR+++K + +VL+GCKLVFS VFP+ C Sbjct: 357 FDS-GVET----ALSSRDVRQVIKRVRQEVLQGCKLVFSRVFPSDCRSQDQIMWKMAEQL 411 Query: 1179 XXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPHWLEASKYLWQRQPEEKF 1346 C + V VTHV+A+ +GT+ +RWA N +FL+HP W+EA Y W RQPEE F Sbjct: 412 GAVCCSEVDPSVTHVVAVHAGTEKARWAAGNKKFLLHPRWIEACNYRWHRQPEEDF 467 >ref|XP_006401141.1| hypothetical protein EUTSA_v10013455mg [Eutrema salsugineum] gi|557102231|gb|ESQ42594.1| hypothetical protein EUTSA_v10013455mg [Eutrema salsugineum] Length = 467 Score = 320 bits (820), Expect = 1e-84 Identities = 193/473 (40%), Positives = 268/473 (56%), Gaps = 20/473 (4%) Frame = +3 Query: 45 ASLLESELDSNPQ------PSEDQ-------NQRIKRRRLF-------EGTEEILESANV 164 A+ LE+EL+S+ PSE+ N R+K+R+L EG E + Sbjct: 18 AAFLETELESDSDSSSESFPSEEAEDDTEVANHRLKKRKLEHLETVEEEGVENVASVTFS 77 Query: 165 QEEEQLSSISEKCPPHPGYMWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITR 344 +E + SS C HPG + +CILCG E E++GV L+Y+H+D+ + EI+R Sbjct: 78 EEISEASSSKRPCD-HPGSIKQICILCG------EPVEQTGVPLRYMHQDMWIHQEEISR 130 Query: 345 FREDGLASLISHQQKXXXXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDK 524 R+ + + Q+K N+ D+ EE+ YL T+ S D Sbjct: 131 IRDSDI-KFLQRQRKLCLVLDLDHTLLNTTVLRDLKPEED------YLKSHTH--SLQDV 181 Query: 525 SKSTNGDLLKLEALKIWTKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGK 704 S GDL L+ + + TKLRPFV +FL+E SEM+ M I TMG+R YA KMA+LLDP G+ Sbjct: 182 S---GGDLFMLDFMNMMTKLRPFVRSFLKEASEMFVMYIYTMGDRDYARKMAELLDPKGE 238 Query: 705 YFGTRIISATDSTHRHQKNLDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSC 884 YF RIIS D T +HQK+LDVVLG ES+V+ILDDTEN WP H+ NLIV+ERYHFF SSC Sbjct: 239 YFSGRIISRDDGTVKHQKSLDVVLGQESSVLILDDTENAWPSHKDNLIVIERYHFFASSC 298 Query: 885 QQFNIQKQSLTEAMTDELETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREI 1064 +QF + QSL++ +DE E G H +FF ED + + RDVR + Sbjct: 299 RQFEHKYQSLSQLKSDESEPDGVLATVLKVLKQTHSLFF-----EDGGGY-TSGRDVRTL 352 Query: 1065 LKEICSKVLEGCKLVFSGVFPTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGT 1244 LK++ +VLEGCK+VFS VFPT+ CAT V VTHV+A+D GT Sbjct: 353 LKQVRKQVLEGCKVVFSRVFPTKSEPKDHPLWRIAEGLGATCATEVDASVTHVVAMDVGT 412 Query: 1245 DASRWAIQNNRFLVHPHWLEASKYLWQRQPEEKFIANILQSRESSGFPESVSM 1403 + RWAI+ +F+V+ W++A+ YLW++QPEE F L+ E+ + V++ Sbjct: 413 EKVRWAIREKKFVVNRGWIDAAHYLWKKQPEENFGLEQLKKTETEVKNDDVTL 465 >ref|XP_006486243.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X1 [Citrus sinensis] gi|568865772|ref|XP_006486244.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X2 [Citrus sinensis] gi|568865774|ref|XP_006486245.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X3 [Citrus sinensis] Length = 478 Score = 318 bits (816), Expect = 3e-84 Identities = 187/437 (42%), Positives = 249/437 (56%), Gaps = 6/437 (1%) Frame = +3 Query: 54 LESELDSNPQPSEDQNQRIKRRRLFEGTEEILES------ANVQEEEQLSSISEKCPPHP 215 ++ E ++ + +RIKRR+ + E I E N++E+ ++S + CP HP Sbjct: 48 IDEEAENEEARDDKDLERIKRRKT-QIVETIQERPGPTLLGNLEEKTEVSLEMDNCP-HP 105 Query: 216 GYMWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLASLISHQQKXX 395 G + G+C CG+ EE+SGV+ YI K L L + EI R R + L+ H+ K Sbjct: 106 GSLGGMCYRCGK-----RLEEESGVTFSYICKGLRLGNDEIDRLRNTDMKHLLRHR-KLY 159 Query: 396 XXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGDLLKLEALKIW 575 NS + EE+ YL + + S D SK G L L + + Sbjct: 160 LILDLDHTLLNSTLLLHLTPEED------YLKSQAD--SLQDVSK---GSLFMLAFMNMM 208 Query: 576 TKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRIISATDSTHRHQ 755 TKLRPFVHTFL+E SEM+EM I TMG+R YAL+MAKLLDPS +YF R+IS D T RHQ Sbjct: 209 TKLRPFVHTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPSREYFNARVISRDDGTQRHQ 268 Query: 756 KNLDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQKQSLTEAMTDE 935 K LDVVLG ESAV+ILDDTEN W KHR NLI++ERYHFF SSC+QF QSL++ +DE Sbjct: 269 KGLDVVLGQESAVLILDDTENAWTKHRDNLILMERYHFFASSCRQFGYHCQSLSQLRSDE 328 Query: 936 LETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSKVLEGCKLVFS 1115 E +G H +FFD+ + L RDVR++LK + +VL+GCKLVFS Sbjct: 329 SELEGALASVLKVLKRIHNIFFDELAND------LAGRDVRQVLKMVRGEVLKGCKLVFS 382 Query: 1116 GVFPTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRWAIQNNRFLVHPH 1295 VFPT+ C + VTHV++ D+ T+ SRWA + +FLV P Sbjct: 383 HVFPTKFPADTHYLWKMAEQLGATCLIELDPSVTHVVSTDARTEKSRWAAKEAKFLVDPR 442 Query: 1296 WLEASKYLWQRQPEEKF 1346 W+E + +LWQRQPEE F Sbjct: 443 WIETANFLWQRQPEENF 459 >ref|XP_006600548.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like isoform X2 [Glycine max] Length = 444 Score = 318 bits (814), Expect = 5e-84 Identities = 195/451 (43%), Positives = 256/451 (56%), Gaps = 18/451 (3%) Frame = +3 Query: 48 SLLESELD-SNPQPSED----------QNQRIKRRRLFEGTEEILESAN---VQEEEQLS 185 + L++ELD S+P S D Q+ R KRR+ FE EE S + V+ + S Sbjct: 19 AFLDAELDASSPDSSPDKEVVKQDDELQSVRTKRRK-FESIEETEGSTSEGIVKRSLEAS 77 Query: 186 SISEKCPPHPGYMWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGLA 365 S + C HPG +CI CGQ K D E SGV+ YIHK L L D EI+R R + Sbjct: 78 SEVDVCCTHPGSFGNMCIRCGQ-KLDGE----SGVTFGYIHKGLRLHDEEISRLRNTDMK 132 Query: 366 SLISHQQKXXXXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNGD 545 SL+ ++K NS A + EE +L +T+ + D SK G Sbjct: 133 SLLG-RKKLYLVLDLDHTLLNSTHLAQLTSEE------LHLLNQTDSLTMIDVSK---GS 182 Query: 546 LLKLEALKIWTKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRII 725 L KLE + + TKLRPFV FL+E SEM+EM I TMG+R YAL+MAKLLDP G+YF ++I Sbjct: 183 LFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFNAKVI 242 Query: 726 SATDSTHRHQKNLDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQK 905 S D T +HQK LDVVLG ESAV+ILDDTE+ W KH+ NLI++ERYHFF SSC+QF Sbjct: 243 SRDDGTQKHQKGLDVVLGQESAVIILDDTEHAWMKHKDNLILMERYHFFGSSCRQFGFNC 302 Query: 906 QSLTEAMTDELETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICSK 1085 +SL E +DE ET G H MFFDK+ EDF + +DVR++L + + Sbjct: 303 KSLAELKSDEDETDGALAKILKVLKQVHCMFFDKQ--EDF-----DDQDVRQVLSSVRRE 355 Query: 1086 VLEGCKLVFS----GVFPTQCXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDAS 1253 VL GC ++FS G P+ C T + VTHV+A D+GT+ Sbjct: 356 VLSGCVIIFSRIVHGAIPS--------LRKMAEQMGATCLTEIDPSVTHVVATDAGTEKC 407 Query: 1254 RWAIQNNRFLVHPHWLEASKYLWQRQPEEKF 1346 RWA++ +F+VHP W+EA+ Y WQ+QPEE F Sbjct: 408 RWAVKEKKFVVHPLWIEAANYFWQKQPEENF 438 >gb|EPS63533.1| hypothetical protein M569_11249, partial [Genlisea aurea] Length = 386 Score = 318 bits (814), Expect = 5e-84 Identities = 179/389 (46%), Positives = 235/389 (60%), Gaps = 2/389 (0%) Frame = +3 Query: 186 SISEKCP-PHPGYMWGVCILCGQIKKDMENEEKSGVSLKYIHKDLELADSEITRFREDGL 362 SISE PHPG G+CI+CG I EE+SG+ YIHK+L LAD E+ R R L Sbjct: 15 SISESSVCPHPGIYGGMCIMCGGIM-----EEESGIPFGYIHKNLRLADDEVARLRYKDL 69 Query: 363 ASLISHQQKXXXXXXXXXXXXNSARFADVLLEEESYITSRYLAGETNDGSASDKSKSTNG 542 +L+ ++K NS+R +D L EE ++ +SD S Sbjct: 70 KALLG-RRKLHLVLDLDHTLLNSSRLSD-LTGEECHLNVH----------SSDLPDSMRN 117 Query: 543 DLLKLEALKIWTKLRPFVHTFLEEVSEMYEMCINTMGERFYALKMAKLLDPSGKYFGTRI 722 L +LE +++ TKLRPFV TFL+E SE++EM I TMGER YAL+MAKLLDP YF +RI Sbjct: 118 SLFRLEHIQMMTKLRPFVRTFLKEASEIFEMHIYTMGERPYALEMAKLLDPGDTYFHSRI 177 Query: 723 ISATDSTHRHQKNLDVVLGAESAVVILDDTENVWPKHRSNLIVVERYHFFRSSCQQFNIQ 902 I+ D T +HQK LDVVLG ES V+ILDDTE VW KH+ NLI++ERY FF SSC+QF Sbjct: 178 IAQGDCTQKHQKGLDVVLGQESTVLILDDTEGVWGKHKENLILMERYLFFGSSCKQFGFT 237 Query: 903 KQSLTEAMTDELETQGTXXXXXXXXXXXHQMFFDKRNVEDFSQFGLNSRDVREILKEICS 1082 +SL E +DE E++G H +FFD + ++ L +RDVR++L + Sbjct: 238 CKSLAELRSDESESEGALSTALATLKRIHSLFFDGEHDDE-----LEARDVRKVLHSVRK 292 Query: 1083 KVLEGCKLVFSGVFPTQ-CXXXXXXXXXXXXXXXXKCATTVREDVTHVIALDSGTDASRW 1259 ++LEGCK+VFS VFP+ C+ V VTHV+A+D+GTD SRW Sbjct: 293 EILEGCKIVFSRVFPSSFFQAENHQLWKMGVRLGATCSREVDSTVTHVVAVDAGTDKSRW 352 Query: 1260 AIQNNRFLVHPHWLEASKYLWQRQPEEKF 1346 A++ + LVHP WLEAS Y+W+RQPEEKF Sbjct: 353 ALRQGKHLVHPRWLEASYYMWKRQPEEKF 381