BLASTX nr result
ID: Mentha25_contig00025564
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00025564 (3530 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus... 1304 0.0 ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma... 987 0.0 ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal doma... 970 0.0 ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat... 949 0.0 gb|AAV92930.1| putative transcription regulator CPL1 [Solanum ly... 937 0.0 ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr... 934 0.0 ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma... 924 0.0 emb|CBI35661.3| unnamed protein product [Vitis vinifera] 921 0.0 ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu... 904 0.0 gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l... 891 0.0 ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma... 875 0.0 ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma... 874 0.0 ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus tric... 868 0.0 ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citr... 868 0.0 ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phas... 868 0.0 ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma... 862 0.0 ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Popu... 847 0.0 ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal doma... 847 0.0 ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ... 843 0.0 ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera... 842 0.0 >gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus guttatus] Length = 1220 Score = 1304 bits (3375), Expect = 0.0 Identities = 717/1184 (60%), Positives = 835/1184 (70%), Gaps = 11/1184 (0%) Frame = +2 Query: 2 SNSGGSEARVWMMRDLYKYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSNDNGN 181 +++GG ARVW M+DLY+YQ+ SK Y GLYNLAWAQAVNNK L +VL+M E G+ND N Sbjct: 76 NSAGGGGARVWTMKDLYEYQVASK-HYPGLYNLAWAQAVNNKSLDEVLMMKEDGNNDRSN 134 Query: 182 PD-TDSSSAKTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNK 358 +D+SS+K+S +++ NSN+ Sbjct: 135 GGISDTSSSKSSKTNDSKVVIDVEVEGGMEEGELEEGEIDLDSELVVRNMDFNVETNSNE 194 Query: 359 ETECQXXXXXXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAE 538 ++ EL +LN ADA S+ LCS L+ T + LQ++VLEG FAE Sbjct: 195 KSR-----------RVDSIKRELESLNVADAIISYHRLCSSLKNTIVSLQEMVLEGSFAE 243 Query: 539 RDTLVQLFIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAI 718 +DTLVQL + AIQTL+ VFSSM+ +LK+QN IL RLLA+VTSL+PPLFS LQL++ EAI Sbjct: 244 KDTLVQLLLTAIQTLYSVFSSMSPKLKEQNKPILSRLLARVTSLKPPLFSPLQLEKAEAI 303 Query: 719 RLCXXXXXXXXXXXXATGRKEVRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQ 898 R GR+ V T DLHVLLE ++ + L+ E G +GS DQ Sbjct: 304 RFSMESSVESFRNDSNNGRERV----GTADLHVLLETANTDSIDLRKCEIESGPSGSPDQ 359 Query: 899 SGHTFPIQC-SAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQ 1075 + +C S GLV R KG+ PL+DLHKDHDADSLPSPTRDLSA PFDKG I+ Sbjct: 360 T------ECRSNLGLVISRHKGVTRPLIDLHKDHDADSLPSPTRDLSAPLPFDKGFIMGH 413 Query: 1076 GLLKPEWPVLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEEGD 1255 GLLKPEWPV + + R+N ++HPYETDAV AVSSYQQKFG SFFVND+LPSPTPSE+G Sbjct: 414 GLLKPEWPVPGRNIERDNILMHPYETDAVIAVSSYQQKFGRSSFFVNDKLPSPTPSEDGQ 473 Query: 1256 NNGDXXXXXXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDTENNS 1435 +GD H+VN A+N L S QP S+++R Sbjct: 474 TSGDGEINGEVSSSIIHHVNPAVNILTSVQPVVSSSVAMDTSATPEISNSLRN------- 526 Query: 1436 RSVLKTSYAKSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHKVVQEQV 1615 VLK++ AKSRDPRLRL+NSDAG +N + SL +GS+ESK + +G++SSRK K +E V Sbjct: 527 -PVLKSTSAKSRDPRLRLSNSDAGAKNPNKSLSAVGSEESKWESSGMVSSRKQKTNEELV 585 Query: 1616 LDGPALKRQKNEXXXXXXXXXXXXXISTSQVAIPSPNLPVSSLIKSPLSFQTEIMPVKXX 1795 L+GPALKRQ+NE STSQ+ LPVS+ I S L+ Q+E P K Sbjct: 586 LNGPALKRQRNELSGPSTAMPLVSATSTSQMT-----LPVSAPIMSLLTSQSEKFPSKNS 640 Query: 1796 XXXXXXXXXXRDIVGNPSMWMSILKMEHQKSSGDNNSAVQMPNSNST-GVAPSTSGVLPF 1972 +DI +PS+WM+ILKME+ KSS D S Q+ NSNS G PS GV+P Sbjct: 641 NATSSLHSLLKDIAVDPSIWMNILKMENLKSSDDIKSMTQISNSNSVLGAVPSPVGVMPL 700 Query: 1973 SSMLGQKPAGIV--PCQAVSAEEPGKVRMKPRDPRRILHNNTPHKGSTAVSDLPKTNASS 2146 SS +GQ AG V P QAVS EE GKVRMKPRDPRR+LHNN P K T+V+D PK +AS Sbjct: 701 SSTIGQISAGTVQIPSQAVSVEESGKVRMKPRDPRRVLHNNAPQKDVTSVADQPKADASF 760 Query: 2147 LSVIMGSLSAKEQEDQMEKVVSSGTVKPPDITMQFTNNLRNIADIMSVSQASMPSTILPL 2326 S +++ +QEDQ+E +SS ++KPPDITMQFTNNLRNIAD++SVSQ S +L Sbjct: 761 GS----AMNTPKQEDQLENKMSSSSMKPPDITMQFTNNLRNIADLLSVSQICTTSPVLAQ 816 Query: 2327 SVSSEQQ-----AGTDTKIVVNESVNFRSGSNLT-SEAATSIPPRPLNANAWSDVEHLFD 2488 S + AG +T+ + E N R+ +++T SEAATS PPRPLNANAWSDVEHLF+ Sbjct: 817 IPSLQPAQGDLIAGKETRGPIAEYGNIRNVTDITTSEAATSSPPRPLNANAWSDVEHLFE 876 Query: 2489 GFDDQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKK 2668 GFDDQQK AIQRERARRLEEQNK+FA K NSAKFVEVDP HDEMLRKK Sbjct: 877 GFDDQQKVAIQRERARRLEEQNKLFAVRKLCLVLDLDHTLLNSAKFVEVDPQHDEMLRKK 936 Query: 2669 EEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLD 2848 EEQDREKP+RHLFRFPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKYYATEMAKLLD Sbjct: 937 EEQDREKPHRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKYYATEMAKLLD 996 Query: 2849 PKGELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLI 3028 PKGELFSGRVISRGDDGEPFDSDDR PKSKDLEGVLGMES VVIIDDS+RVWPHNKLNLI Sbjct: 997 PKGELFSGRVISRGDDGEPFDSDDRAPKSKDLEGVLGMESGVVIIDDSIRVWPHNKLNLI 1056 Query: 3029 VVERYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEAD 3208 VVERYIYFPCSRRQFGLPGPSLLEIDHDERPE+GTLAS VIERIHE FFGH+SL+EAD Sbjct: 1057 VVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASCSTVIERIHENFFGHESLNEAD 1116 Query: 3209 VRNILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVA 3388 VRNILA EQ+KILAGCRIVFSRVFPVGEA PHMHPLWQTAEQFGAVC NQIDE VTHVVA Sbjct: 1117 VRNILASEQRKILAGCRIVFSRVFPVGEAKPHMHPLWQTAEQFGAVCINQIDEHVTHVVA 1176 Query: 3389 NSLGTDKVNWALSRGRFVVHPGWVEASALLYRRANEHDFAIKQQ 3520 NSLGTDKVNWALS G+FVVHPGWVEASALLYRRANEHDFAIKQQ Sbjct: 1177 NSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAIKQQ 1220 >ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Solanum tuberosum] Length = 1218 Score = 987 bits (2552), Expect = 0.0 Identities = 589/1215 (48%), Positives = 718/1215 (59%), Gaps = 52/1215 (4%) Frame = +2 Query: 26 RVWMMRDLYKYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSND--NGNPDTDSS 199 RVW MRD YKY I S+ GLYNLAWAQAV NKPL ++ VM SN N N + +S Sbjct: 57 RVWTMRDAYKYPI-SRDYARGLYNLAWAQAVQNKPLDELFVMTSDNSNQCANANANVESK 115 Query: 200 SAKTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNKETECQXX 379 D D++ N KE Sbjct: 116 VIIDVDVDDDAKEEGELEEGEIDLDAADLVL------------------NFGKEAN---- 153 Query: 380 XXXXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAERDTLVQL 559 E+L ++ + KSF +CS+L+T+ + L ++ L + D L+QL Sbjct: 154 ----------FVREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQD--KNDILIQL 201 Query: 560 FIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLCXXXX 739 F+ A++T++ VF SM KQQN IL RLL + P L SS QLKE++A+ L Sbjct: 202 FMTALRTINSVFYSMNQDQKQQNTDILSRLLFHAKTQLPALLSSEQLKEVDAVILSINQS 261 Query: 740 XXXXXXXX---ATGRKEV---------RQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSA 883 G K V + N N + D A +KS G + S Sbjct: 262 AVFSNTQDNDKVNGIKVVELLDKKVSHKSSENANQDFTAVNKYDLGAVSIKSSGLKEQSV 321 Query: 884 GSVDQSGHTFPIQCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGL 1063 + PGL N + KGL +PLLDLHKDHD D+LPSPTR++ FP K Sbjct: 322 S----------FESVKPGLANSKAKGLSIPLLDLHKDHDEDTLPSPTREIGPQFPVAKAT 371 Query: 1064 ILEQGLLKPEWPVLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPS 1243 G++K + P+ +L + N +LHPYETDA+KAVSSYQQKFG S FV++ LPSPTPS Sbjct: 372 -QAHGMVKLDLPIFAGSLEKGNSLLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPS 430 Query: 1244 EEGDNNGDXXXXXXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDT 1423 EEGD+ HN + LN GQP + RT D Sbjct: 431 EEGDSGKGDIGGEVTSLDVVHNASH-LNESSMGQPILSSVPQTNILDGQGLGT-ARTADP 488 Query: 1424 ENN-SRSVLKTSYAKSRDPRLRLANSDAGPRNLSPS-LPLIGSDESKSKFAGVMSSRKHK 1597 + L++S AKSRDPRLRLA SDA +N + + LP+ D ++ S+K K Sbjct: 489 LSFLPNPSLRSSTAKSRDPRLRLATSDAVAQNTNKNILPIPDIDLKLEASLEMIGSKKQK 548 Query: 1598 VVQEQVLDGPALKRQKNEXXXXXXXXXXXXX----------------ISTSQVAIPSPNL 1729 V V P KRQ++E I++S A S + Sbjct: 549 TVDLPVFGAPLPKRQRSEQTDSIIVSDVRPSTGNGGWLEDRGTAGLPITSSNCATDSSDN 608 Query: 1730 PVSSL---------IKSPLSFQTEIMPVKXXXXXXXXXXXXRDIVGNPSMWMSILKMEHQ 1882 + L I S + E PV +DI NPS+WM+I+KME Q Sbjct: 609 DIRKLEQVTATIATIPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKMEQQ 668 Query: 1883 KSSGDNNSAVQMPNSNST--GVAPSTSGVLPFSSMLGQKPAGIV--PCQAVSAEEPGKVR 2050 KS+ + + +S+ + G PST + P SS +GQ+ GI+ P SA+E VR Sbjct: 669 KSADASRTTTAQASSSKSILGAVPSTDAIAPRSSAIGQRSVGILQTPTHTASADEVAIVR 728 Query: 2051 MKPRDPRRILHNNTPHKGSTAVSDLPKTNASSLSVIMGSLSAKEQEDQME-KVVSSGTVK 2227 MKPRDPRR+LHN KG SD KT + + +L + QEDQ++ K + + Sbjct: 729 MKPRDPRRVLHNTAVLKGGNVGSDQCKTGVAGTHATISNLGFQSQEDQLDRKSAVTLSTT 788 Query: 2228 PPDITMQFTNNLRNIADIMSVSQASMPSTILPLSVSSE------QQAGTDTKIVVNESVN 2389 PPDI QFT NL+NIAD++SVS PST L + ++ Q+ ++ K V+E Sbjct: 789 PPDIARQFTKNLKNIADMISVS----PSTSLSAASQTQTQCLQSHQSRSEGKEAVSEPSE 844 Query: 2390 FRSGSNLTSEAATSIPPRPLNANAWSDVEHLFDGFDDQQKAAIQRERARRLEEQNKMFAA 2569 + + L SE + +P +W DVEHLF+G+ DQQ+A IQRERARRLEEQ KMF+ Sbjct: 845 RVNDAGLASEKGSPGSLQP--QISWGDVEHLFEGYSDQQRADIQRERARRLEEQKKMFSV 902 Query: 2570 GKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLFRFPHMGMWTKLRPG 2749 K NSAKFVE+DP+H+E+LRKKEEQDREKP RHLFRFPHMGMWTKLRPG Sbjct: 903 RKLCLVLDLDHTLLNSAKFVEIDPVHEEILRKKEEQDREKPCRHLFRFPHMGMWTKLRPG 962 Query: 2750 IWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGRVISRGDDGEPFDSDDRVP 2929 IWNFLEKAS L+ELHLYTMGNK YATEMAKLLDPKG+LF+GRVISRGDDG+PFD D+RVP Sbjct: 963 IWNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGDERVP 1022 Query: 2930 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDH 3109 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDH Sbjct: 1023 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDH 1082 Query: 3110 DERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQQKILAGCRIVFSRVFPVG 3289 DERPE+GTLAS L VI+RIH+ FF H S+DEADVRNILA EQ+KILAGCRIVFSRVFPVG Sbjct: 1083 DERPEDGTLASCLGVIQRIHQNFFAHRSIDEADVRNILATEQKKILAGCRIVFSRVFPVG 1142 Query: 3290 EANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSRGRFVVHPGWVEAS 3469 EANPH+HPLWQTAEQFGAVCT+QID+QVTHVVANSLGTDKVNWALS GRFVVHPGWVEAS Sbjct: 1143 EANPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEAS 1202 Query: 3470 ALLYRRANEHDFAIK 3514 ALLYRRANEHDFAIK Sbjct: 1203 ALLYRRANEHDFAIK 1217 >ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Solanum lycopersicum] Length = 1211 Score = 970 bits (2507), Expect = 0.0 Identities = 578/1198 (48%), Positives = 706/1198 (58%), Gaps = 35/1198 (2%) Frame = +2 Query: 26 RVWMMRDLYKYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSNDNGNPDTDSSSA 205 RVW MRD+YKY I S+ GLYNLAWAQAV NKPL ++ VM N N + +S Sbjct: 60 RVWTMRDVYKYPI-SRDYARGLYNLAWAQAVQNKPLDELFVMTS--DNSNQCANGESKVI 116 Query: 206 KTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNKETECQXXXX 385 D D++ N KE Sbjct: 117 IDVDVDDDAKEEGELEEGEIDLDSADLVV------------------NFGKEAN------ 152 Query: 386 XXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAERDTLVQLFI 565 E+L ++ + KSF +CS+L+T+ + L ++ L + D L+QLF+ Sbjct: 153 --------FIREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQD--KNDILIQLFM 202 Query: 566 AAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLCXXXXXX 745 A++T++ VF SM KQQN IL RLL + P L SS QLKE++A+ L Sbjct: 203 TALRTINSVFYSMNDHQKQQNTDILSRLLFNAKTQLPALLSSEQLKELDALILSINHSLV 262 Query: 746 XXXXXX--ATGRKEVRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQSGHTFPI 919 V Q + D H EN + + S S + Sbjct: 263 SSNTQDNDTVNGINVVQLLDMKDSHKSSENANQDFTSVNKYDLGDVSIKSSGLKEQSVSS 322 Query: 920 QCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPEWP 1099 + PGL N + KGL PLLDLHKDHD D+LPSPTR + FP + G++K + P Sbjct: 323 ESVKPGLDNSKAKGLSFPLLDLHKDHDEDTLPSPTRQIGPQFPATQ----THGMVKLDLP 378 Query: 1100 VLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEEGDNNGDXXXX 1279 + +L + N +LHPYETDA+KAVSSYQQKFG S FV++ LPSPTPSEE D+ Sbjct: 379 IFPASLDKGNSLLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEDDSGKGDTGG 438 Query: 1280 XXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDTENN-SRSVLKTS 1456 HN + LN GQP + RT D + L++S Sbjct: 439 EVTSFDVVHNASH-LNESSMGQPILSSVPQTNILDGQGLGTT-RTADPLSFLPNPSLRSS 496 Query: 1457 YAKSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALK 1636 AKSRDPRLRLA SD +N LP+ D ++ S+K K V D P K Sbjct: 497 TAKSRDPRLRLATSDTVAQNTI--LPIPDIDLKLEASLEMIVSKKQKTVDLSAFDAPLPK 554 Query: 1637 RQKNEXXXXXXXXXXXXXIS---------TSQVAIPSPNLPVSS---------------- 1741 RQ++E I T+++ I S N + Sbjct: 555 RQRSEQTDSIIVSDVRPSIGNGGWLEDRGTAELPITSSNCATYNSDNDIRKLEQVTATIA 614 Query: 1742 LIKSPLSFQTEIMPVKXXXXXXXXXXXXRDIVGNPSMWMSILKMEHQKSSGDN--NSAVQ 1915 I S + E PV +DI NPS+WM+I+K E QKS+ + N+A Sbjct: 615 TIPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKTEQQKSADASRTNTAQA 674 Query: 1916 MPNSNSTGVAPSTSGVLPFSSMLGQKPAGIV--PCQAVSAEEPGKVRMKPRDPRRILHNN 2089 + + G PST V P SS +GQ+ GI+ P SA+E VRMKPRDPRR+LH+ Sbjct: 675 SSSKSILGAVPSTVAVAPRSSAIGQRSVGILQTPTHTASADEVAIVRMKPRDPRRVLHST 734 Query: 2090 TPHKGSTAVSDLPKTNASSLSVIMGSLSAKEQEDQME-KVVSSGTVKPPDITMQFTNNLR 2266 KG + D KT + + +LS + QEDQ++ K + + PPDI QFT NL+ Sbjct: 735 AVLKGGSVGLDQCKTGVAGTHATISNLSFQSQEDQLDRKSAVTLSTTPPDIACQFTKNLK 794 Query: 2267 NIADIMSVSQASMPSTILPLSVSSEQ--QAGTDTKIVVNESVNFRSGSNLTSEAATSIPP 2440 NIAD++SVS ++ PS Q Q+ ++ K V+E + + + L SE + Sbjct: 795 NIADMISVSPSTSPSVASQTQTLCIQAYQSRSEVKGAVSEPSEWVNDAGLASEKGSPGSL 854 Query: 2441 RPLNANAWSDVEHLFDGFDDQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNSA 2620 +P +W DVEHLF+G+ DQQ+A IQRER RRLEEQ KMF+ K NSA Sbjct: 855 QP--QISWGDVEHLFEGYSDQQRADIQRERTRRLEEQKKMFSVRKLCLVLDLDHTLLNSA 912 Query: 2621 KFVEVDPLHDEMLRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLY 2800 KFVE+DP+H+E+LRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKAS L+ELHLY Sbjct: 913 KFVEIDPVHEEILRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASNLFELHLY 972 Query: 2801 TMGNKYYATEMAKLLDPKGELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVVI 2980 TMGNK YATEMAKLLDPKG+LF+GRVISRGDDG+PFD D+RVPKSKDLEGVLGMESAVVI Sbjct: 973 TMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVI 1032 Query: 2981 IDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVIE 3160 IDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPE+GTLAS L VI+ Sbjct: 1033 IDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASCLGVIQ 1092 Query: 3161 RIHEIFFGHDSLDEADVRNILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQFG 3340 RIH+ FF H S+DEADVRNILA EQ+KILAGCRIVFSRVFPVGEA+PH+HPLWQTAEQFG Sbjct: 1093 RIHQNFFTHRSIDEADVRNILATEQKKILAGCRIVFSRVFPVGEASPHLHPLWQTAEQFG 1152 Query: 3341 AVCTNQIDEQVTHVVANSLGTDKVNWALSRGRFVVHPGWVEASALLYRRANEHDFAIK 3514 AVCT+QID+QVTHVVANSLGTDKVNWALS GR VVHPGWVEASALLYRRANEHDFAIK Sbjct: 1153 AVCTSQIDDQVTHVVANSLGTDKVNWALSTGRSVVHPGWVEASALLYRRANEHDFAIK 1210 >ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] Length = 1290 Score = 949 bits (2453), Expect = 0.0 Identities = 588/1245 (47%), Positives = 720/1245 (57%), Gaps = 74/1245 (5%) Frame = +2 Query: 2 SNSGG---SEARVWMMRDLYKYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLV------MM 154 S+ GG S +RVW M+DL KY + GLYN AWAQAV NKPL ++ V Sbjct: 66 SSKGGEANSNSRVWTMQDLCKYPSVIRGYASGLYNFAWAQAVQNKPLNEIFVKDFEQPQQ 125 Query: 155 EGGSNDNGNPDTDSSSAKTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 334 + N + + S ++ S ++ Sbjct: 126 DENKNSKRSSPSSSVASVNSKEEKGSSGNLAVKVVIDDDSEDEMEEDKVVNLDKEEGELE 185 Query: 335 XXXXNSNKETECQXXXXXXXXXXXXXXMEELVNL--------NAADAQKSFDALCSRLET 490 + + E + + +E+ NL +A+KSF+ +CSRL Sbjct: 186 EGEIDLDSEPKEKVLSSEDGNVGNSDELEKRANLIRGVLEGVTVIEAEKSFEGVCSRLHN 245 Query: 491 TAIKLQKIVLEGPFAERDTLVQLFIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSL 670 L+ ++LE +D L+QL AI + F ++ K+QN AIL RLL+ V Sbjct: 246 ALESLRALILECSVPAKDALIQLAFGAINS---AFVALNCNSKEQNVAILSRLLSIVKGH 302 Query: 671 RPPLFSSLQLKEMEAIRLCXXXXXXXXXXXXATGRKEVRQGFNTNDLHVLLENTDSKAAY 850 P LF ++KE++ + + +V G N D L EN Sbjct: 303 DPSLFPPDKMKEIDVMLISLNSPARAIDTEKDM---KVVDGVNKKDPDALPENICHD--- 356 Query: 851 LKSRGKEPGSAGSVDQSGHTFPIQCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRD 1030 L K P SA V + + PG+ N R +G+ LPLLDLHKDHDADSLPSPTR+ Sbjct: 357 LTVTNKLPSSAKFVINNKPNALTETLKPGVPNFRNRGISLPLLDLHKDHDADSLPSPTRE 416 Query: 1031 LSATFPFDKGLILEQGLLKPEWPVLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFF 1210 + P +K L ++K + + + E LHPYETDA+KA S+YQQKFG GSFF Sbjct: 417 TTPCLPVNKPLTSGDVMVKSGFMTGKGSHDAEGDKLHPYETDALKAFSTYQQKFGQGSFF 476 Query: 1211 VNDELPSPTPSEE-GDNNGDXXXXXXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXX 1387 +D LPSPTPSEE GD GD N N I G P Sbjct: 477 SSDRLPSPTPSEESGDEGGDNGGEVSSSSSIG---NFKPNLPILGHPIVSSAPLVDSASS 533 Query: 1388 XXESS-NVRTVDTENNSRSVLKTSYAKSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSK 1564 + R ++ +++ S AKSRDPRL ANS+A +L+ L + + + Sbjct: 534 SLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNERL--LHNASKVAP 591 Query: 1565 FAGVMSSRKHKVVQEQVLDGPALKRQKN---------------------EXXXXXXXXXX 1681 G+M SRK K V+E +LD PALKRQ+N E Sbjct: 592 VGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIGSQIT 651 Query: 1682 XXXISTSQVAIPSPNLPVSSLIKSPLSFQTEI-------MPVKXXXXXXXXXXXXRDIVG 1840 + + S + S LS +T I +PV +DI Sbjct: 652 NRNQTAENLESNSRKMDNGVTSSSTLSGKTNITVGTNEQVPVTSTSTPSLPALL-KDIAV 710 Query: 1841 NPSMWMSILKM---------EHQKSSGDNNSAVQMPNSNST-GV--------APSTSGVL 1966 NP+M ++ILKM QKS S P+SNS GV +PS + V Sbjct: 711 NPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIPSPSVNNVP 770 Query: 1967 PFSSMLGQKPAGIVPCQAVSAEEPGKVRMKPRDPRRILHNNTPHKGSTAVSDLPKTNASS 2146 SS + KPAG + Q S +E GK+RMKPRDPRR+LH N+ + + D KTN + Sbjct: 771 SISSGISSKPAGNL--QVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGLDQLKTNGAL 828 Query: 2147 LSVIMGS---LSAKEQEDQME-KVVSSGTVKPPDITMQFTNNLRNIADIMSVSQA--SMP 2308 S GS L+A++ + Q E K + S V PPDIT QFTNNL+NIADIMSVSQA S+P Sbjct: 829 TSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIADIMSVSQALTSLP 888 Query: 2309 ST---ILPLSVSSEQQAGTDTKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDVEH 2479 ++P V + + D K +V+ S + ++G+ L EA + P + NAW DVEH Sbjct: 889 PVSHNLVPQPVLIKSDS-MDMKALVSNSEDQQTGAGLAPEAGAT---GPRSQNAWGDVEH 944 Query: 2480 LFDGFDDQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEML 2659 LF+ +DDQQKAAIQRERARR+EEQ KMF+A K NSAKF+EVDP+H+E+L Sbjct: 945 LFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDPVHEEIL 1004 Query: 2660 RKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAK 2839 RKKEEQDREKP RHLFRF HMGMWTKLRPGIWNFLEKASKLYELHLYTMGNK YATEMAK Sbjct: 1005 RKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAK 1064 Query: 2840 LLDPKGELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKL 3019 +LDPKG LF+GRVISRGDDG+PFD D+RVP+SKDLEGVLGMESAVVIIDDSVRVWPHNKL Sbjct: 1065 VLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVWPHNKL 1124 Query: 3020 NLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLD 3199 NLIVVERY YFPCSRRQFGL GPSLLEIDHDERPE+GTLASSLAVIERIH+ FF H +LD Sbjct: 1125 NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFSHQNLD 1184 Query: 3200 EADVRNILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTH 3379 + DVRNILA EQ+KILAGCRIVFSRVFPVGEANPH+HPLWQTAEQFGAVCTNQIDE VTH Sbjct: 1185 DVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEHVTH 1244 Query: 3380 VVANSLGTDKVNWALSRGRFVVHPGWVEASALLYRRANEHDFAIK 3514 VVANSLGTDKVNWALS G+FVVHPGWVEASALLYRRANE DFAIK Sbjct: 1245 VVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289 >gb|AAV92930.1| putative transcription regulator CPL1 [Solanum lycopersicum] Length = 1227 Score = 937 bits (2421), Expect = 0.0 Identities = 573/1233 (46%), Positives = 701/1233 (56%), Gaps = 70/1233 (5%) Frame = +2 Query: 26 RVWMMRDLYKYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSNDNGNPDTDSSSA 205 RVW MRD+YKY I S+ GLYNLAWAQAV NKPL ++ VM N N + +S Sbjct: 60 RVWTMRDVYKYPI-SRDYARGLYNLAWAQAVQNKPLDELFVMTS--DNSNQCANGESKVI 116 Query: 206 KTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNKETECQXXXX 385 D D++ N KE Sbjct: 117 IDVDVDDDAKEEGELEEGEIDLDSADLVV------------------NFGKEAN------ 152 Query: 386 XXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAERDTLVQLFI 565 E+L ++ + KSF +CS+L+T+ + L ++ L + D L+QLF+ Sbjct: 153 --------FIREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQD--KNDILIQLFM 202 Query: 566 AAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLCXXXXXX 745 A++T++ VF SM KQQN IL RLL + P L SS QLKE++A+ L Sbjct: 203 TALRTINSVFYSMNDHQKQQNTDILSRLLFNAKTQLPALLSSEQLKELDALILSINHSLV 262 Query: 746 XXXXXX--ATGRKEVRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQSGHTFPI 919 V Q + D H EN + + S S + Sbjct: 263 SSNTQDNDTVNGINVVQLLDMKDSHKSSENANQDFTSVNKYDLGDVSIKSSGLKEQSVSS 322 Query: 920 QCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPEWP 1099 + PGL N + KGL PLLDLHKDHD D+LPSPTR + FP + G++K + P Sbjct: 323 ESVKPGLDNSKAKGLSFPLLDLHKDHDEDTLPSPTRQIGPQFPATQ----THGMVKLDLP 378 Query: 1100 VLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEEGDNNGDXXXX 1279 + +L + N +LHPYETDA+KAVSSYQQKFG S FV++ LPSPTPSEE D+ Sbjct: 379 IFPASLDKGNSLLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEDDSGKGDTGG 438 Query: 1280 XXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDTENN-SRSVLKTS 1456 HN + LN GQP + RT D + L++S Sbjct: 439 EVTSFDVVHNASH-LNESSMGQPILSSVPQTNILDGQGLGTT-RTADPLSFLPNPSLRSS 496 Query: 1457 YAKSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALK 1636 AKSRDPRLRLA SD +N LP+ D ++ S+K K V D P K Sbjct: 497 TAKSRDPRLRLATSDTVAQNTI--LPIPDIDLKLEASLEMIVSKKQKTVDLSAFDAPLPK 554 Query: 1637 RQKNEXXXXXXXXXXXXXIS---------TSQVAIPSPNLPVSS---------------- 1741 RQ++E I T+++ I S N + Sbjct: 555 RQRSEQTDSIIVSDVRPSIGNGGWLEDRGTAELPITSSNCATYNSDNDIRKLEQVTATIA 614 Query: 1742 LIKSPLSFQTEIMPVKXXXXXXXXXXXXRDIVGNPSMWMSILKMEHQKSSGDN--NSAVQ 1915 I S + E PV +DI NPS+WM+I+K E QKS+ + N+A Sbjct: 615 TIPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKTEQQKSADASRTNTAQA 674 Query: 1916 MPNSNSTGVAPSTSGVLPFSSMLGQKPAGIV--PCQAVSA-------------------- 2029 + + G PST V P SS +GQ+ GI+ P SA Sbjct: 675 SSSKSILGAVPSTVAVAPRSSAIGQRSVGILQTPTHTASAASSIYNLLMNDFIYSVIFTA 734 Query: 2030 ---------------EEPGKVRMKPRDPRRILHNNTPHKGSTAVSDLPKTNASSLSVIMG 2164 +E VRMKPRDPRR+LH+ KG + D KT + + Sbjct: 735 SIAQFPFYFFLTFSRDEVAIVRMKPRDPRRVLHSTAVLKGGSVGLDQCKTGVAGTHATIS 794 Query: 2165 SLSAKEQEDQME-KVVSSGTVKPPDITMQFTNNLRNIADIMSVSQASMPSTILPLSVSSE 2341 +LS + QEDQ++ K + + PPDI QFT NL+NIAD++SVS ++ PS Sbjct: 795 NLSFQSQEDQLDRKSAVTLSTTPPDIACQFTKNLKNIADMISVSPSTSPSVASQTQTLCI 854 Query: 2342 Q--QAGTDTKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDVEHLFDGFDDQQKAA 2515 Q Q+ ++ K V+E + + + L SE + +P +W DVEHLF+G+ DQQ+A Sbjct: 855 QAYQSRSEVKGAVSEPSEWVNDAGLASEKGSPGSLQP--QISWGDVEHLFEGYSDQQRAD 912 Query: 2516 IQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPY 2695 IQRER RRLEEQ KMF+ FVE+DP+H+E+LRKKEEQDREKPY Sbjct: 913 IQRERTRRLEEQKKMFS-------------------FVEIDPVHEEILRKKEEQDREKPY 953 Query: 2696 RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGR 2875 RHLFRFPHMGMWTKLRPGIWNFLEKAS L+ELHLYTMGNK YATEMAKLLDPKG+LF+GR Sbjct: 954 RHLFRFPHMGMWTKLRPGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGR 1013 Query: 2876 VISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFP 3055 VISRGDDG+PFD D+RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFP Sbjct: 1014 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFP 1073 Query: 3056 CSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQ 3235 CSRRQFGLPGPSLLEIDHDERPE+GTLAS L VI+RIH+ FF H S+DEADVRNILA EQ Sbjct: 1074 CSRRQFGLPGPSLLEIDHDERPEDGTLASCLGVIQRIHQNFFTHRSIDEADVRNILATEQ 1133 Query: 3236 QKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVN 3415 +KILAGCRIVFSRVFPVGEA+PH+HPLWQTAEQFGAVCT+QID+QVTHVVANSLGTDKVN Sbjct: 1134 KKILAGCRIVFSRVFPVGEASPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKVN 1193 Query: 3416 WALSRGRFVVHPGWVEASALLYRRANEHDFAIK 3514 WALS GR VVHPGWVEASALLYRRANEHDFAIK Sbjct: 1194 WALSTGRSVVHPGWVEASALLYRRANEHDFAIK 1226 >ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|568858958|ref|XP_006483010.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Citrus sinensis] gi|557541056|gb|ESR52100.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1234 Score = 934 bits (2414), Expect = 0.0 Identities = 584/1229 (47%), Positives = 715/1229 (58%), Gaps = 61/1229 (4%) Frame = +2 Query: 11 GGSEARVWMMRDLY-KYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSNDNGNPD 187 G + ARVW MRDLY KY + GL+NLAWAQAV NKPL ++ VM E +D Sbjct: 46 GEAAARVWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVM-EAEQDDVSKRS 104 Query: 188 TDSSSAKTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNKETE 367 + +SS + + SN++ Sbjct: 105 SPASSVASVNSGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVS 164 Query: 368 CQXXXXXXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAERDT 547 Q E L ++ D SF+ +CS+LE T L+++V E +D Sbjct: 165 EQVKEEMKLINVESIR-EALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDA 221 Query: 548 LVQLFIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLC 727 L+QL +A+Q++H VF SM LK+QN IL RLL+ + S PPLFSS Q+KEMEA+ Sbjct: 222 LIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAM--- 278 Query: 728 XXXXXXXXXXXXATGRKE---VRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQ 898 A +++ G N D +++ EN + L + K P S+ Q Sbjct: 279 -----LSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVND---LNFKEKVPLPVDSLMQ 330 Query: 899 SGHTFPIQCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQG 1078 + P++ S PG R +G++LPLLD HK HD DSLPSPTR+ + + P + L++ G Sbjct: 331 NK---PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDG 387 Query: 1079 LLKPEWPVLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEE--- 1249 ++K + + E YETDA++A SSYQQKFG SFF+N ELPSPTPSEE Sbjct: 388 VVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGD 447 Query: 1250 --GDNNGDXXXXXXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDT 1423 GD G+ N+ +S QP + S+V+ + T Sbjct: 448 GDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPM--------DISSVQALTT 499 Query: 1424 ENNS-------RSVLKTSYA-----KSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKF 1567 NNS V+K + KSRDPRLR A+S+A N P+ P++ + Sbjct: 500 ANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPA-PILHNAPKVEPV 558 Query: 1568 AGVMSSRKHKVVQEQVLDGPALKRQKNEXXXXXXXXXXXXXISTS---------QVAIPS 1720 VMSSRK K V+E VLDGPALKRQ+N + + I + Sbjct: 559 GRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMN 618 Query: 1721 PNLPVSSL----------IKSPLSFQT--------EIMPVKXXXXXXXXXXXXRDIVGNP 1846 NL V S SP++ T E P +DI NP Sbjct: 619 RNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNP 678 Query: 1847 SMWMSILKM-EHQKSSGDNNSAVQMPNSNSTGVA-PSTSGVLPFSSMLGQKPAGIVPCQA 2020 +M ++ILKM + QK + D A Q N +S P +P S+ P+GI+ + Sbjct: 679 TMLLNILKMGQQQKLAAD---AQQKSNDSSMNTMHPPIPSSIPPVSVTCSIPSGIL---S 732 Query: 2021 VSAEEPGKVRMKPRDPRRILHNNTPHKGSTAVSDLPKTNASSLSVIMGSLSAKEQEDQM- 2197 +E GKVRMKPRDPRR+LH N + + + KT+ S GS + Q+ Sbjct: 733 KPMDELGKVRMKPRDPRRVLHGNALQRSGSLGPEF-KTDGPSAPCTQGSKENLNFQKQLG 791 Query: 2198 ---EKVVSSGTVKPPDITMQFTNNLRNIADIMSVSQ-------ASMPSTILPLSVSSEQQ 2347 K V S +V PDIT QFT NL++IAD MSVSQ S S I P + S Sbjct: 792 APEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKS--- 848 Query: 2348 AGTDTKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDVEHLFDGFDDQQKAAIQRE 2527 G D K VV + ++G+ EA P +AW DVEHLF+G+DDQQKAAIQ+E Sbjct: 849 -GADMKAVVTNHDDKQTGTGSGPEAG---PVGAHPQSAWGDVEHLFEGYDDQQKAAIQKE 904 Query: 2528 RARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLF 2707 R RRLEEQ KMF+A K NSAKF EVDP+HDE+LRKKEEQDREKP+RHLF Sbjct: 905 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964 Query: 2708 RFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGRVISR 2887 RFPHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNK YATEMAK+LDPKG LF+GRVISR Sbjct: 965 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024 Query: 2888 GDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRR 3067 GDDG+PFD D+RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRR Sbjct: 1025 GDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1084 Query: 3068 QFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQQKIL 3247 QFGL GPSLLEIDHDER E+GTLASSL VIER+H+IFF H SLD+ DVRNILA EQ+KIL Sbjct: 1085 QFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKIL 1144 Query: 3248 AGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALS 3427 AGCRIVFSRVFPVGEANPH+HPLWQTAEQFGAVCT ID+QVTHVVANSLGTDKVNWALS Sbjct: 1145 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 1204 Query: 3428 RGRFVVHPGWVEASALLYRRANEHDFAIK 3514 GRFVVHPGWVEASALLYRRANE DFAIK Sbjct: 1205 TGRFVVHPGWVEASALLYRRANEQDFAIK 1233 >ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Vitis vinifera] Length = 1238 Score = 924 bits (2388), Expect = 0.0 Identities = 551/1079 (51%), Positives = 664/1079 (61%), Gaps = 47/1079 (4%) Frame = +2 Query: 419 EELVNLNAADAQKSFDALCSRLETTAIKLQKI-----VLEGPFAERDTLVQLFIAAIQTL 583 E+L ++ +A+KSF +CSRL+ T LQK+ V E +D L Q I AI+ L Sbjct: 191 EDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIRAL 250 Query: 584 HKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIR--LCXXXXXXXXXX 757 + VF SM K+ N + RLL+ V P+FS +KE+E + L Sbjct: 251 NHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSAEA 310 Query: 758 XXATGRKEVRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQSGHTFPIQCSAPG 937 +V G N N L +E++ A K + S S +Q+ PG Sbjct: 311 SDKVNDVQVTDGMNRNILDSSVESSGRAFASAKKLSLDSISVESYNQNNP----DALKPG 366 Query: 938 LVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPEWPVLRQTL 1117 L + R + + PLLDLHKDHD DSLPSPT FP +K ++ V +T Sbjct: 367 LSSSRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPVNKSELVTA-------KVAHET- 418 Query: 1118 PRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEE-----GDNNGDXXXXX 1282 ++ ++HPYETDA+KAVS+YQQKFG SF D+LPSPTPSEE GD +G+ Sbjct: 419 --QDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVSSSS 476 Query: 1283 XXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDTENNSRSVLKTSYA 1462 N + ++S P N V + + S + S A Sbjct: 477 TISAPITANAPALGHPIVSSAPQMDSSIVQGPTV----GRNTSLVSSGPHLDSSVVAS-A 531 Query: 1463 KSRDPRLRLANSDAGPRNLSPS-LPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALKR 1639 KSRDPRLRLA+SDAG +L+ LP + + ++SSRK K +E +LDGP KR Sbjct: 532 KSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKR 591 Query: 1640 QKNEXXXXXXXXXXXXXIST------SQVAIP---SPNLPVSSLIKSPLSFQTEI----- 1777 Q+N +++ S IP + N + + P ++++ Sbjct: 592 QRNGLTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTGI 651 Query: 1778 --------------MPVKXXXXXXXXXXXXRDIVGNPSMWMSIL-KMEHQKSSGDNNSAV 1912 +PV +DI NP++WM+I K+E QKS + V Sbjct: 652 GCDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQKSGDPAKNTV 711 Query: 1913 QMPNSNST-GVAPSTSGVLPFSSMLGQKPAGIVPC-QAVSAEEPGKVRMKPRDPRRILHN 2086 P SNS GV P S S LGQKPAG + Q +E GKVRMKPRDPRRILH Sbjct: 712 LPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMDESGKVRMKPRDPRRILHA 771 Query: 2087 NTPHKGSTAVSDLPKTNASSLSVIMGSLSAKEQEDQME-KVVSSGTVKPPDITMQFTNNL 2263 N+ + ++ S+ KTNA ++QEDQ E K V S +V PPDI+ QFT NL Sbjct: 772 NSFQRSGSSGSEQFKTNA------------QKQEDQTETKSVPSHSVNPPDISQQFTKNL 819 Query: 2264 RNIADIMSVSQASMPSTILPLSVSSEQQAGTDTKIVVNESVNFRSGSNLTSEAAT--SIP 2437 +NIAD+MS SQAS + P +SS+ ++ V +V+ SG LT+ + S Sbjct: 820 KNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATVS-DSGDQLTANGSKPESAA 878 Query: 2438 PRPLNANAWSDVEHLFDGFDDQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNS 2617 P + N W DVEHLFDG+DDQQKAAIQRERARR+EEQ KMF+A K NS Sbjct: 879 GPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNS 938 Query: 2618 AKFVEVDPLHDEMLRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHL 2797 AKFVEVDP+HDE+LRKKEEQDREK RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHL Sbjct: 939 AKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHL 998 Query: 2798 YTMGNKYYATEMAKLLDPKGELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVV 2977 YTMGNK YATEMAK+LDPKG LF+GRVIS+GDDG+ D D+RVPKSKDLEGVLGMESAVV Sbjct: 999 YTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVV 1058 Query: 2978 IIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVI 3157 IIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGLPGPSLLEIDHDERPE+GTLASSLAVI Sbjct: 1059 IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVI 1118 Query: 3158 ERIHEIFFGHDSLDEADVRNILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQF 3337 ERIH+ FF + +LDE DVRNILA EQ+KILAGCRIVFSRVFPVGEANPH+HPLWQTAE F Sbjct: 1119 ERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESF 1178 Query: 3338 GAVCTNQIDEQVTHVVANSLGTDKVNWALSRGRFVVHPGWVEASALLYRRANEHDFAIK 3514 GAVCTNQIDEQVTHVVANSLGTDKVNWALS GRFVVHPGWVEASALLYRRANE DFAIK Sbjct: 1179 GAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1237 >emb|CBI35661.3| unnamed protein product [Vitis vinifera] Length = 1184 Score = 921 bits (2380), Expect = 0.0 Identities = 546/1048 (52%), Positives = 649/1048 (61%), Gaps = 16/1048 (1%) Frame = +2 Query: 419 EELVNLNAADAQKSFDALCSRLETTAIKLQKI-----VLEGPFAERDTLVQLFIAAIQTL 583 E+L ++ +A+KSF +CSRL+ T LQK+ V E +D L Q I AI+ L Sbjct: 202 EDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIRAL 261 Query: 584 HKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLCXXXXXXXXXXXX 763 + VF SM K+ N + RLL+ V P+FS +KE+E + Sbjct: 262 NHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMM---------SFLDT 312 Query: 764 ATGRKEVRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQSGHTFPIQCSAPGLV 943 + ND+ V TD + SV+ SG F Sbjct: 313 PAAQSSAEASDKVNDVQV----TDGMNRNILD--------SSVESSGRAFA------SAK 354 Query: 944 NMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPEWPVLRQTLPR 1123 R + + PLLDLHKDHD DSLPSPT FP +K ++ V +T Sbjct: 355 KFRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPVNKSELVTA-------KVAHET--- 404 Query: 1124 ENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEE-GDNNGDXXXXXXXXXXX 1300 ++ ++HPYETDA+KAVS+YQQKFG SF D+LPSPTPSEE GD GD Sbjct: 405 QDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVSSSSTI 464 Query: 1301 XHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDTENNSRSVLKTSYAKSRDPR 1480 + N+ G P N V++ NS +L+ S AKSRDPR Sbjct: 465 SAPITA--NAPALGHPIVSSAPQMDIVQGLVVPRNTGAVNSRFNS--ILRAS-AKSRDPR 519 Query: 1481 LRLANSDAGPRNLSPS-LPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALKRQKNEXX 1657 LRLA+SDAG +L+ LP + + ++SSRK K +E +LDGP KRQ+N Sbjct: 520 LRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRN--G 577 Query: 1658 XXXXXXXXXXXISTSQVAIPSPNLPVSSLIKSPLSFQTEIMPVKXXXXXXXXXXXXRDIV 1837 ++ + + P + V+ E +PV +DI Sbjct: 578 LTSPATKLESKVTVTGIGCDKPYVTVNG---------NEHLPVVATSTTASLQSLLKDIA 628 Query: 1838 GNPSMWMSIL-KMEHQKSSGDNNSAVQMPNSNST-GVAPSTSGVLPFSSMLGQKPAGIVP 2011 NP++WM+I K+E QKS + V P SNS GV P S S LGQKPAG + Sbjct: 629 VNPAVWMNIFNKVEQQKSGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQ 688 Query: 2012 CQAVSA----EEPGKVRMKPRDPRRILHNNTPHKGSTAVSDLPKTNASSLSVIMGSLSAK 2179 +E GKVRMKPRDPRRILH N+ + ++ S+ KTNA + Sbjct: 689 VPQTGPMNPQDESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKTNA------------Q 736 Query: 2180 EQEDQME-KVVSSGTVKPPDITMQFTNNLRNIADIMSVSQASMPSTILPLSVSSEQQAGT 2356 +QEDQ E K V S +V PPDI+ QFT NL+NIAD+MS SQAS + P +SS+ Sbjct: 737 KQEDQTETKSVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVN 796 Query: 2357 DTKIVVNESVNFRSGSNLTSEAAT--SIPPRPLNANAWSDVEHLFDGFDDQQKAAIQRER 2530 ++ V +V+ SG LT+ + S P + N W DVEHLFDG+DDQQKAAIQRER Sbjct: 797 TDRMDVKATVS-DSGDQLTANGSKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRER 855 Query: 2531 ARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLFR 2710 ARR+EEQ KMF+A K NSAKFVEVDP+HDE+LRKKEEQDREK RHLFR Sbjct: 856 ARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFR 915 Query: 2711 FPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGRVISRG 2890 FPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNK YATEMAK+LDPKG LF+GRVIS+G Sbjct: 916 FPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKG 975 Query: 2891 DDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQ 3070 DDG+ D D+RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQ Sbjct: 976 DDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1035 Query: 3071 FGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQQKILA 3250 FGLPGPSLLEIDHDERPE+GTLASSLAVIERIH+ FF + +LDE DVRNILA EQ+KILA Sbjct: 1036 FGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILA 1095 Query: 3251 GCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSR 3430 GCRIVFSRVFPVGEANPH+HPLWQTAE FGAVCTNQIDEQVTHVVANSLGTDKVNWALS Sbjct: 1096 GCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALST 1155 Query: 3431 GRFVVHPGWVEASALLYRRANEHDFAIK 3514 GRFVVHPGWVEASALLYRRANE DFAIK Sbjct: 1156 GRFVVHPGWVEASALLYRRANEQDFAIK 1183 >ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] gi|550343308|gb|EEE79627.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] Length = 1247 Score = 904 bits (2337), Expect = 0.0 Identities = 552/1239 (44%), Positives = 699/1239 (56%), Gaps = 68/1239 (5%) Frame = +2 Query: 2 SNSGGSEARVWMMRDLYKYQIPSKPSYL-GLYNLAWAQAVNNKPLGDVLVMMEGGSNDNG 178 +N+ S+ +VW +RDLYKYQ+ Y+ GLYNLAWAQAV NKPL ++ V +E Sbjct: 56 NNNSSSKQKVWTVRDLYKYQVGG--GYMSGLYNLAWAQAVQNKPLNELFVEVE------- 106 Query: 179 NPDTDSSSAKTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNK 358 D SS K+S N ++ Sbjct: 107 ---VDDSSQKSSVSSVNSSKEDKRTVVIDDSGDEMDVVKVIDIEKEEGELEEGEIDLDSE 163 Query: 359 ETECQXXXXXXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVL--EGPF 532 E+L +++ KSF+A+C +L L+++V E F Sbjct: 164 GKSEGGMVSVDTEKRVKSIREDLESVSVIKDDKSFEAVCLKLHNALESLKELVRVNENGF 223 Query: 533 AERDTLVQLFIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEME 712 +D+LV+L AI ++ FSSM +LK+QN + +R L+ V S P FS KE+ Sbjct: 224 PSKDSLVRLLFTAIGAVNSFFSSMNQKLKEQNKGVFMRFLSLVNSHDPSFFSPEHTKEV- 282 Query: 713 AIRLCXXXXXXXXXXXXATGRKEVRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSV 892 C F D ++ S L + + P +A S Sbjct: 283 ----CD---------------------FCNFDFRIV-----SLCYDLTTMNRLPSAAESF 312 Query: 893 DQSGHTFPIQCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILE 1072 + F I+ PG+ + + +G++LPLLDL K HD DSLPSPTR+ + +FP + L + Sbjct: 313 VHNKPNFSIEPPKPGVPSFKSRGVLLPLLDLKKFHDEDSLPSPTRETAPSFPVQRLLPIG 372 Query: 1073 QGLLKPEWPVLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEE- 1249 G++ PV + E P +HPYETDA+KAVSSYQ+KF SFF N ELPSPTPSEE Sbjct: 373 DGMISSGLPVPKVASITEEPRVHPYETDALKAVSSYQKKFNLNSFFTN-ELPSPTPSEES 431 Query: 1250 GDNNGDXXXXXXXXXXXXH-NVNRALNSLISGQP---XXXXXXXXXXXXXXXESSNVRTV 1417 G+ +GD + VN ++ S P +S++R V Sbjct: 432 GNGDGDTAGEVSSSSTVNYRTVNPPVSDRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVV 491 Query: 1418 DTENNS-------RSVLKTSYAKSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGV 1576 NS S +K S AKSRDPRLR N+DA + + L+ ++ +++ +G Sbjct: 492 IPTRNSAPVSSGTSSTVKAS-AKSRDPRLRYVNTDASALDQNQRTLLMVNNPPRAEPSGA 550 Query: 1577 MSSRKHKVVQEQVLDGPALKRQKNEXXXXXXXXXXXXXIST------------------- 1699 ++ + + ++E VLDG +LKRQ+N T Sbjct: 551 IAGSRKQKIEEDVLDGTSLKRQRNSFDNFGVVRDIRSMTGTGGWLEDTDMAEPQTVNKNQ 610 Query: 1700 ------------SQVAIPSPNLPVSSLIKSPLSFQTEIMPVKXXXXXXXXXXXXRDIVGN 1843 + V PS +SS+ S + Q +M + Sbjct: 611 WAENAEPGQRINNGVVCPSTGSVMSSVSCSG-NVQVPVMGINTIAGSEQAPVTSTTTASL 669 Query: 1844 PSMWMSI----------LKMEHQKSSGDNNSAVQMPNSNSTGVAPSTS---GVLPFSSML 1984 P + I LKM Q+ + + ST PS++ G +P + + Sbjct: 670 PDLLKDITVNPTMLINILKMGQQQRLALDGQQKLADPAKSTSHPPSSNTVLGAIPEVNAV 729 Query: 1985 GQKPAGIV---------PCQAVSAEEPGKVRMKPRDPRRILHNNTPHKGSTAVSDLPKTN 2137 P+GI+ P Q + +E GK+RMKPRDPRR+LHNN + + S+ KT Sbjct: 730 SSLPSGILPRSAGKAQGPSQIATTDESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTT 789 Query: 2138 ASSLSVIMGSLSAKEQEDQMEKVVSSGTVKPPDITMQFTNNLRNIADIMSVSQASMPSTI 2317 + S G+ + + Q E + V PPDI+ FT +L+NIADI+SVSQ Sbjct: 790 TLT-STTQGTKDNQNLQKQ-EGLAELKPVVPPDISSPFTKSLKNIADIVSVSQTCTTPPF 847 Query: 2318 LPLSVSSEQQAGTDTKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDVEHLFDGFD 2497 + +V+S+ ++ ++ + + + L+ N W DVEHLF+G+D Sbjct: 848 VSQNVASQPVQIKSDRVDGKTGISNSDQKMGPASSPEVVAASSLSQNTWEDVEHLFEGYD 907 Query: 2498 DQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQ 2677 DQQKAAIQRERARR+EEQ K+FAA K NSAKFVEVDP+HDE+LRKKEEQ Sbjct: 908 DQQKAAIQRERARRIEEQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQ 967 Query: 2678 DREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKG 2857 DREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNK YATEMAK+LDPKG Sbjct: 968 DREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKG 1027 Query: 2858 ELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVE 3037 LF+GRV+SRGDDG+ D D+RVPKSKDLEGVLGMES VVIIDDS+RVWPHNKLNLIVVE Sbjct: 1028 VLFAGRVVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVE 1087 Query: 3038 RYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRN 3217 RYIYFPCSRRQFGLPGPSLLEIDHDERPE+GTLA SLAVIERIH+ FF H SLDEADVRN Sbjct: 1088 RYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRN 1147 Query: 3218 ILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSL 3397 ILA EQ+KILAGCRIVFSRVFPVGE NPH+HPLWQ+AEQFGAVCTNQIDEQVTHVVANSL Sbjct: 1148 ILASEQRKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSL 1207 Query: 3398 GTDKVNWALSRGRFVVHPGWVEASALLYRRANEHDFAIK 3514 GTDKVNWALS GRFVVHPGWVEASALLYRRANE DFAIK Sbjct: 1208 GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1246 >gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus notabilis] Length = 1301 Score = 891 bits (2302), Expect = 0.0 Identities = 527/1063 (49%), Positives = 653/1063 (61%), Gaps = 50/1063 (4%) Frame = +2 Query: 419 EELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFA--ERDTLVQLFIAAIQTLHKV 592 E L ++N +A+KSF+ +CSRL+ T L+ ++ E F+ +D ++Q+ I AIQ ++ V Sbjct: 209 ETLGSVNVVNAEKSFEEVCSRLQRTLESLRGVLSEKEFSFPTKDVVIQMSITAIQVVNSV 268 Query: 593 FSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLCXXXXXXXXXXXXATG 772 F SM++ K+Q L RL V + PLFS Q KE+E + + Sbjct: 269 FCSMSVNQKEQKKETLSRLFCSVKNCGTPLFSPEQTKEIELMISSLNPLNVLPSSGASDK 328 Query: 773 RKEVRQGFNTNDLHVLLENTDSKAAYLKSRG-KEPGS--AGSVDQSGHTFPIQCSAPGLV 943 KE + +++ L N +++ A ++ K P A V + T P + PG + Sbjct: 329 EKETQIIERLHEMDSNLTNANAENASIERTSVKLPQDCVASVVHSNPITLP-ELLRPGTL 387 Query: 944 NMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPEWPVLRQTLPR 1123 + +GL+LPLLDLHKDHDADSLPSPTR+ + FP K L + G++KP + Sbjct: 388 AFKGRGLLLPLLDLHKDHDADSLPSPTREAPSCFPVYKPLGVADGIIKPVSTTAKVAPGA 447 Query: 1124 ENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEEGDNNGDXXXXXXXXXXXX 1303 E LH YETDA+KAVS+YQQKFG GSF ++D LPSPTPSEE D D Sbjct: 448 EESRLHRYETDALKAVSTYQQKFGRGSFLMSDRLPSPTPSEECDEEDDINQEVSSSLTSG 507 Query: 1304 HNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDTENNSRSVLKTSYAKSRDPRL 1483 + A+ L + N V + +NS +K S A+SRDPRL Sbjct: 508 NLRTPAIPILRPSVVTSSVPVSSPTMQGPIAAKNAAPVGSGSNS--TMKAS-ARSRDPRL 564 Query: 1484 RLANSDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALKRQKNEXXXX 1663 R ANSDAG +L+ + K + SSRK ++V+E LDGPALKRQ++ Sbjct: 565 RFANSDAGALDLNQRPLTAVHNGPKVEPGDPTSSRKQRIVEEPNLDGPALKRQRHAFVSA 624 Query: 1664 XXXXXXXXXIS-------TSQVAIPSPNLPVSS----------LIKSPL-----SFQTEI 1777 + T+ I + N V + L+ P+ + E Sbjct: 625 KIDVKTASGVGGWLEDNGTTGPQIMNKNQLVENAEADPRKSIHLVNGPIMNNGPNIGKEQ 684 Query: 1778 MPVKXXXXXXXXXXXXRDIVGNPSMWMSILKMEHQKSSGDNNSAVQMPNSNSTGVAPSTS 1957 +PV +DI NP+++M IL Q+ ++ + +S +T P T+ Sbjct: 685 VPVTGTSTPDALPAILKDIAVNPTIFMDILNKLGQQQLLAADAQQKSDSSKNTTHPPGTN 744 Query: 1958 GVL---PFSSMLGQKPAGIVPCQAVSA------------EEPGKVRMKPRDPRRILHNNT 2092 +L P ++ K +GI+ AVS +E GK+RMKPRDPRR+LH N Sbjct: 745 SILGAAPLVNVAPSKASGILQTPAVSLPTTSQVATASMQDELGKIRMKPRDPRRVLHGNM 804 Query: 2093 PHKGSTAVSDLPKTNASSLSVIMGS---LSAKEQEDQMEKV-VSSGTVKPPDITMQFTNN 2260 K + + K SS+S G+ L+ QE Q +K V S V PDI QFT N Sbjct: 805 LQKSWSLGHEQFKPIVSSVSCTPGNKDNLNGPVQEGQADKKQVPSQLVVQPDIARQFTKN 864 Query: 2261 LRNIADIMSVSQASMPSTILPLSVSSE----QQAGTDTKIVVNESVNFRSGSNLTSEAAT 2428 LRNIAD+MSVSQAS + ++SS+ + D K VV S + SG+N T E Sbjct: 865 LRNIADLMSVSQASTSPATVSQNLSSQPLPVKPDRGDVKAVVPNSEDQHSGTNSTPETTL 924 Query: 2429 SIPPRPLNANAWSDVEHLFDGFDDQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXX 2608 ++P R NAW DVEHLF+G+DD+QKAAIQRERARRLEEQ KMF A K Sbjct: 925 AVPSR--TPNAWGDVEHLFEGYDDEQKAAIQRERARRLEEQKKMFDAHKLCLVLDLDHTL 982 Query: 2609 XNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYE 2788 NSAKFVEVD +HDE+LRKKEEQDREKP RHLFRFPHMGMWTKLRPG+WNFLEKASKLYE Sbjct: 983 LNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGVWNFLEKASKLYE 1042 Query: 2789 LHLYTMGNKYYATEMAKLLDPKGELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMES 2968 LHLYTMGNK YATEMAK+LDP G LFSGRVISRGDDG+PFD D+RVPKSKDLEGVLGMES Sbjct: 1043 LHLYTMGNKLYATEMAKVLDPMGTLFSGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES 1102 Query: 2969 AVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSL 3148 +VVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGLPGPSLLEIDHDERPE+GTLASSL Sbjct: 1103 SVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEQGTLASSL 1162 Query: 3149 AVIERIHEIFFGHDSLDEADVRNILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTA 3328 AVIE+IH+ FF H SLDE DVRNILA EQ+KILAGCRIVFSRVFPV E NPH+HPLWQTA Sbjct: 1163 AVIEKIHQNFFSHHSLDEVDVRNILASEQRKILAGCRIVFSRVFPVSEVNPHLHPLWQTA 1222 Query: 3329 EQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSRGRFVVHPGW 3457 EQFGAVCT QID+QVTHVVANS GTDKVNWAL+ G+F VHPGW Sbjct: 1223 EQFGAVCTTQIDDQVTHVVANSPGTDKVNWALANGKFAVHPGW 1265 >ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Glycine max] Length = 1261 Score = 875 bits (2260), Expect = 0.0 Identities = 553/1225 (45%), Positives = 704/1225 (57%), Gaps = 58/1225 (4%) Frame = +2 Query: 14 GSEARVWMMRDLY-KYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSNDNGNPDT 190 GS+ARVW + DLY KY + GLYNLAWAQAV NKPL D+ VM E S+ N N + Sbjct: 60 GSDARVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVM-EVDSDANANSNR 118 Query: 191 DSSSAKTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNKETEC 370 +S S + + N +S K + Sbjct: 119 NS-SHRLASVAVNPKDVVVVDVDKEEGELEEGEIDADAEPEGEAESVVVAVSDSEKLDDV 177 Query: 371 QXXXXXXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAERDTL 550 + +E + A+ +SF CS+L+ T L +++ +E+D L Sbjct: 178 KMDVSDSEQLGARGVLE---GVTVANVVESFAQTCSKLQNT---LPEVLSRPAGSEKDDL 231 Query: 551 VQLFIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPP-LFSSLQLKEMEAIRLC 727 V+L A + ++ VF SM K+QN +LRLL+ V + LFS +KE++ + Sbjct: 232 VRLSFNATEVVYSVFCSMDSSEKEQNKDSILRLLSFVKDQQQAQLFSPEHVKEIQGMMTA 291 Query: 728 XXXXXXXXXXXXATGRKEVRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQSGH 907 KE++ T ++ EN+ + + + +E + + + + Sbjct: 292 IDSVGALVNSEAIGKEKELQ----TTEIKTQ-ENSAVEVQIHEIKTQENQAVEAAELISY 346 Query: 908 TFPI--------QCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGL 1063 + P+ Q G +++ +G++LPLLDLHKDHDADSLPSPTR+ + FP +K L Sbjct: 347 SKPLHRDITGTSQALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLL 406 Query: 1064 ILEQGLLKPEWPVLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPS 1243 + + +++ + L E H YETDA+KAVS+YQQKFG S F ND+ PSPTPS Sbjct: 407 SVGESMVRSGSASAKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPS 466 Query: 1244 EEGDNNGDXXXXXXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDT 1423 + ++ + +L+ P S VD Sbjct: 467 GDCEDEVVDTNEEVSSASTGDFLTSTKPTLLDQPPVSATSMDRSSMHGFISSR----VDA 522 Query: 1424 ENNSRSVLKTSYAKSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHKVV 1603 +K+S AK+RDPRLR NSDA + +L + ++ SK +++G SRK K Sbjct: 523 TGPGSFPVKSS-AKNRDPRLRFINSDASAVD---NLSTLINNMSKVEYSGTTISRKQKAA 578 Query: 1604 QEQVLDGPALKRQKNEXXXXXXXXXXXXXISTSQV-----------------------AI 1714 +E LD KR K+ S + A Sbjct: 579 EEPSLDVTVSKRLKSSLENTEHNMSEVRTGSGGWLEENTGPGAQLIERNHLMDKFGPEAK 638 Query: 1715 PSPNLPVSSLIKSP----LSFQTEIMPVKXXXXXXXXXXXXRDIVGNPSMWMSILKM-EH 1879 + N SS S S + E P+ ++ NP M ++IL++ E Sbjct: 639 KTLNTVSSSCTGSDNFNATSIRNEQAPITASNVLASLPALLKEASVNPIMLVNILRLAEA 698 Query: 1880 QKSSGDNNSAVQMPNSNSTGVAPSTSGVLPFSSMLG----QKPAGIVPCQAVSA------ 2029 QK S D+ +A+ + + S+ A T S + Q G++P + S Sbjct: 699 QKKSADS-AAIMLLHPTSSNPAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSTAQTL 757 Query: 2030 -EEPGKVRMKPRDPRRILH-NNTPHKGSTAVSDLPKTNASSLSVIM---GSLSAKEQEDQ 2194 ++ GK+RMKPRDPRRILH NNT K ++ K S +S +++A + E + Sbjct: 758 QDDSGKIRMKPRDPRRILHTNNTIQKSGDLGNEQFKAIVSPVSNNQRTGDNVNAPKLEGR 817 Query: 2195 ME-KVVSSGTVKPPDITMQFTNNLRNIADIMSVSQASMPSTILPLSVSSEQQAGT----D 2359 ++ K+V + + PDI QFT NL+NIADIMSVSQ S T + + SS T + Sbjct: 818 VDNKLVPTQSSAQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLTSDRGE 877 Query: 2360 TKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDVEHLFDGFDDQQKAAIQRERARR 2539 K VV+ S N ++ E A S+ R + + W DVEHLF+G+D+QQKAAIQRERARR Sbjct: 878 QKSVVSSSQNLQADMASAHETAASVTSR--SQSTWGDVEHLFEGYDEQQKAAIQRERARR 935 Query: 2540 LEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLFRFPH 2719 +EEQNKMFAA K NSAKFVEVDPLHDE+LRKKEEQDREKP+RHLFRFPH Sbjct: 936 IEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPH 995 Query: 2720 MGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGRVISRGDDG 2899 MGMWTKLRPGIWNFLEKASKLYELHLYTMGNK YATEMAK+LDPKG LF+GRVISRGDD Sbjct: 996 MGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDT 1055 Query: 2900 EPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGL 3079 + D ++RVPKSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL Sbjct: 1056 DSVDGEERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1115 Query: 3080 PGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQQKILAGCR 3259 PGPSLLEIDHDERPE GTLASSLAVIE+IH+IFF SL+E DVRNILA EQ+KILAGCR Sbjct: 1116 PGPSLLEIDHDERPEAGTLASSLAVIEKIHQIFFASQSLEEVDVRNILASEQRKILAGCR 1175 Query: 3260 IVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSRGRF 3439 IVFSRVFPVGEANPH+HPLWQTAEQFGAVCTNQIDEQVTHVVANS GTDKVNWAL+ GRF Sbjct: 1176 IVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKVNWALNNGRF 1235 Query: 3440 VVHPGWVEASALLYRRANEHDFAIK 3514 VVHPGWVEASALLYRRANE DFAIK Sbjct: 1236 VVHPGWVEASALLYRRANEQDFAIK 1260 >ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Glycine max] Length = 1257 Score = 874 bits (2257), Expect = 0.0 Identities = 566/1243 (45%), Positives = 708/1243 (56%), Gaps = 76/1243 (6%) Frame = +2 Query: 14 GSEARVWMMRDLY-KYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSNDNGNPDT 190 GS+ARVW + DLY KY + GLYNLAWAQAV NKPL D+ V ME S+ N N ++ Sbjct: 60 GSDARVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFV-MEVDSDANANSNS 118 Query: 191 DSSSAKTSDG-DENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNKETE 367 ++S+ S + +S K + Sbjct: 119 NNSNRLASVAVNPKDVVVVDVDKEEGELEEGEIDADAEPEGEAESVVAVPVVSDSEKLDD 178 Query: 368 CQXXXXXXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAERDT 547 + +E + N A+ SF CS+L+ L +++ +ERD Sbjct: 179 VKRDVSNSEQLGVRGVLEGVTVANVAE---SFAQTCSKLQNA---LPEVLSRPADSERDD 232 Query: 548 LVQLFIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQV-TSLRPPLFSSLQLKEMEAIRL 724 LV+L A + ++ VF SM K+QN +LRLL+ V + LFS +KE++ + Sbjct: 233 LVRLSFNATEVVYSVFCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMT 292 Query: 725 CXXXXXXXXXXXXATGRKEVRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQSG 904 KE++ T+++ EN +AA L S K S + + Sbjct: 293 AIDYFGALVNSEAIGKEKELQTTVQTHEIKT-QENQAVEAAELISYNKPLHS--DIIGAS 349 Query: 905 HTFPIQCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLL 1084 H G +++ +G++LPLLDLHKDHDADSLPSPTR+ + FP +K L + + ++ Sbjct: 350 HALKF-----GQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEPMV 404 Query: 1085 -------KPEWPVLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPS 1243 KPE + L E H YETDA+KAVS+YQQKFG S F ND+ PSPTPS Sbjct: 405 SSGSAAAKPESG--KMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPS 462 Query: 1244 EEGDNNGDXXXXXXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRT--- 1414 GD + + N ++S +G +S R+ Sbjct: 463 --GDCEDEIV-----------DTNEEVSSASTGDFLTSTKPTLLDLPPVSATSTDRSSLH 509 Query: 1415 ------VDTENNSRSVLKTSYAKSRDPRLRLANSDA----GPRNLSPSLPLIGSDESKSK 1564 VD +K+S AK+RDPRLR NSDA P L ++P K + Sbjct: 510 GFISSRVDAAGPGSLPVKSS-AKNRDPRLRFVNSDASAVDNPSTLIHNMP-------KVE 561 Query: 1565 FAGVMSSRKHKVVQEQVLDGPALKRQKNEXXXXXXXXXXXXXISTSQVAI---------- 1714 +AG SRK K +E LD KRQK+ +S + I Sbjct: 562 YAGTTISRKQKAAEEPSLDVTVSKRQKS------PLENTEHNMSEVRTGIGGWLEEHTGP 615 Query: 1715 ---------------PSPNLPVSSLIKS--------PLSFQTEIMPVKXXXXXXXXXXXX 1825 P P ++++ S S + E P+ Sbjct: 616 GAQFIERNHLMDKFGPEPQKTLNTVSSSCTGSDNFNATSIRNEQAPITSSNVLASLPALL 675 Query: 1826 RDIVGNPSMWMSILKM-EHQKSSGDN--NSAVQMPNSNSTGVAPSTSGV-LPFSSMLGQK 1993 + NP+M +++L++ E QK S D+ N + +SNS ST+ + ++ L Q Sbjct: 676 KGAAVNPTMLVNLLRIAEAQKKSADSATNMLLHPTSSNSAMGTDSTASIGSSMATGLLQS 735 Query: 1994 PAGIVPCQAVSA-------EEPGKVRMKPRDPRRILH-NNTPHKGSTAVSDLPKTNASSL 2149 G++P + S ++ GK+RMKPRDPRRILH NNT K ++ K S + Sbjct: 736 SVGMLPVSSQSTSMTQTLQDDSGKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPV 795 Query: 2150 SVIMG---SLSAKEQEDQME-KVVSSGTVKPPDITMQFTNNLRNIADIMSVSQASMPSTI 2317 S G +++A++ E +++ K+V + PDI QF NL+NIADIMSVSQ S T Sbjct: 796 SNNQGTGDNVNAQKLEGRVDSKLVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTP 855 Query: 2318 LPLSVSSEQQAGT----DTKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDVEHLF 2485 + SS T + K VV+ S N +G E A S R + N W DVEHLF Sbjct: 856 VAQIFSSASVPLTSDRGEQKSVVSNSQNLEAGMVSAHETAASGTCR--SQNTWGDVEHLF 913 Query: 2486 DGFDDQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRK 2665 +G+D+QQKAAIQRERARR+EEQNKMFAA K NSAKFVEVDP+HDE+LRK Sbjct: 914 EGYDEQQKAAIQRERARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRK 973 Query: 2666 KEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLL 2845 KEEQDREKP+RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNK YATEMAK+L Sbjct: 974 KEEQDREKPHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVL 1033 Query: 2846 DPKGELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNL 3025 DPKG LF+GRVISRGDD + D ++R PKSKDLEGVLGMES+VVIIDDSVRVWPHNKLNL Sbjct: 1034 DPKGLLFAGRVISRGDDTDSVDGEERAPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNL 1093 Query: 3026 IVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEA 3205 IVVERY YFPCSRRQFGLPGPSLLEIDHDERPE GTLASSLAVIE+IH+IFF SL+E Sbjct: 1094 IVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLAVIEKIHQIFFASRSLEEV 1153 Query: 3206 DVRNILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVV 3385 DVRNILA EQ+KILAGCRIVFSRVFPVGEANPH+HPLWQTAEQFGA CTNQIDEQVTHVV Sbjct: 1154 DVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVV 1213 Query: 3386 ANSLGTDKVNWALSRGRFVVHPGWVEASALLYRRANEHDFAIK 3514 ANS GTDKVNWAL+ GRFVVHPGWVEASALLYRRANE DFAIK Sbjct: 1214 ANSPGTDKVNWALNNGRFVVHPGWVEASALLYRRANEQDFAIK 1256 >ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus trichocarpa] gi|550347145|gb|EEE82674.2| CTD phosphatase-like protein 3 [Populus trichocarpa] Length = 1190 Score = 868 bits (2244), Expect = 0.0 Identities = 512/1074 (47%), Positives = 641/1074 (59%), Gaps = 42/1074 (3%) Frame = +2 Query: 419 EELVNLNAADAQKSFDALCSRLETTAIKLQKIV--LEGPFAERDTLVQLFIAAIQTLHKV 592 ++L +++ + +KSF+A+C +L L+++V + F +D LVQL AI+ ++ V Sbjct: 153 KDLESVSVIETEKSFEAVCLKLHKVLESLKELVGGNDNSFPSKDGLVQLLFMAIRVVNSV 212 Query: 593 FSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLCXXXXXXXXXXXXATG 772 F SM +LK+QN + R + + S PP FS Q KE+ Sbjct: 213 FCSMNKKLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEV--------------------- 251 Query: 773 RKEVRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQSGHTFPIQCSAPGLVNMR 952 N N L + + K P + V + PG+ + + Sbjct: 252 -------LNENHNDSLAKTAGYDLTTMSE--KLPAAETFVQNKPNKSIEAPKPPGVPSFK 302 Query: 953 LKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPEWPVLRQTLPRENP 1132 +G++LPLLDL K HD DSLPSPT++ + FP + L + G++ PV + T E P Sbjct: 303 SRGVLLPLLDLKKYHDEDSLPSPTQE-TTPFPVQRLLAIGDGMVSSGLPVPKVTPVAEEP 361 Query: 1133 VLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEEGDNNGDXXXXXXXXXXXXHNV 1312 +HPYETDA+KAVSSYQQKF SFF N ELPSPTPSEE NGD Sbjct: 362 RMHPYETDALKAVSSYQQKFNRNSFFTN-ELPSPTPSEES-GNGDGDTAGEVSSSSTVVN 419 Query: 1313 NRALNSLISGQPXXXXXXXXXXXXXXX-ESSNVRTVDTENNSR-------SVLKTSYAKS 1468 R +N +S Q +SSN+R V NS S +K S AKS Sbjct: 420 YRTVNPPVSDQKNAPPSPPPLPPPPPHPDSSNIRGVVPTRNSAPVSSGPSSTIKAS-AKS 478 Query: 1469 RDPRLRLANSDAGPRNLSP-SLPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALKRQK 1645 RDPRLR N DA + + +LP++ + ++ S+KHK+ +E VLD P+LKRQ+ Sbjct: 479 RDPRLRYVNIDACALDHNQRALPMVNNLPRVEPAGAIVGSKKHKI-EEDVLDDPSLKRQR 537 Query: 1646 NEXXXXXXXXXXXXXIST------SQVAIP----------SPNLPVSSLIKSPLSFQTEI 1777 N T + +A P + N+ S +SP + I Sbjct: 538 NSFDNYGAVRDIESMTGTGGWLEDTDMAEPQTVNKNQWAENSNVNGSGNAQSPFMGISNI 597 Query: 1778 MPVKXXXXXXXXXXXX----RDIVGNPSMWMSILKMEHQKSSGDNNSAVQMPNSNSTGVA 1945 + +DI NP+M ++ILKM Q+ + + ST Sbjct: 598 TGSEQAQVTSTATTSLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQTLSDPAKSTSHP 657 Query: 1946 PSTS---GVLPFSSMLGQKPAGI--------VPCQAVSAEEPGKVRMKPRDPRRILHNNT 2092 P ++ G +P ++ +P+GI VP Q +++E GK+RMKPRDPRR LHNN+ Sbjct: 658 PISNTVLGAIPTVNVASSQPSGIFPRPAGTPVPSQIATSDESGKIRMKPRDPRRFLHNNS 717 Query: 2093 PHKGSTAVSDLPKTNASSLSVIMGSLSAKEQEDQMEKVVSSGTVKPPDITMQFTNNLRNI 2272 + + S+ KT ++L+ + + E + PPDI+ FT +L NI Sbjct: 718 LQRAGSMGSEQFKT--TTLTPTTQGTKDDQNVQKQEGLAELKPTVPPDISFPFTKSLENI 775 Query: 2273 ADIMSVSQASMPSTILPLSVSSEQQAGTDTKIVVNESVNFRSGSNLTSEAATSIPPRPLN 2452 ADI+SVSQAS + +V+S+ ++ ++ + + + + Sbjct: 776 ADILSVSQASTTPPFISQNVASQPMQTKSERVDGKTGISISDQKTGPASSPEVVAASSHS 835 Query: 2453 ANAWSDVEHLFDGFDDQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVE 2632 N W DVEHLF+G+DDQQKAAIQRERARRLEEQ KMFAA K NSAK + Sbjct: 836 QNTWKDVEHLFEGYDDQQKAAIQRERARRLEEQKKMFAARKLCLVLDLDHTLLNSAKAIL 895 Query: 2633 VDPLHDEMLRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGN 2812 LHDE+LRKKEEQDREKPYRH+FR PHMGMWTKLRPGIWNFLEKASKL+ELHLYTMGN Sbjct: 896 SSSLHDEILRKKEEQDREKPYRHIFRIPHMGMWTKLRPGIWNFLEKASKLFELHLYTMGN 955 Query: 2813 KYYATEMAKLLDPKGELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDS 2992 K YATEMAK+LDPKG LF+GRVISRGDDG+PFD D+RVPKSKDLEGVLGMES VVIIDDS Sbjct: 956 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESGVVIIDDS 1015 Query: 2993 VRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHE 3172 VRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPE+GTLA S AVIE+IH+ Sbjct: 1016 VRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSFAVIEKIHQ 1075 Query: 3173 IFFGHDSLDEADVRNILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCT 3352 FF H SLDEADVRNILA EQ+KIL GCRI+FSRVFPVGE NPH+HPLWQ AEQFGAVCT Sbjct: 1076 NFFTHRSLDEADVRNILASEQRKILGGCRILFSRVFPVGEVNPHLHPLWQMAEQFGAVCT 1135 Query: 3353 NQIDEQVTHVVANSLGTDKVNWALSRGRFVVHPGWVEASALLYRRANEHDFAIK 3514 NQIDEQVTHVVANSLGTDKVNWALS GR VVHPGWVEASALLYRRANE DF+IK Sbjct: 1136 NQIDEQVTHVVANSLGTDKVNWALSTGRIVVHPGWVEASALLYRRANEQDFSIK 1189 >ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|557541054|gb|ESR52098.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1208 Score = 868 bits (2243), Expect = 0.0 Identities = 552/1195 (46%), Positives = 683/1195 (57%), Gaps = 61/1195 (5%) Frame = +2 Query: 11 GGSEARVWMMRDLY-KYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSNDNGNPD 187 G + ARVW MRDLY KY + GL+NLAWAQAV NKPL ++ VM E +D Sbjct: 46 GEAAARVWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVM-EAEQDDVSKRS 104 Query: 188 TDSSSAKTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNKETE 367 + +SS + + SN++ Sbjct: 105 SPASSVASVNSGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVS 164 Query: 368 CQXXXXXXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAERDT 547 Q E L ++ D SF+ +CS+LE T L+++V E +D Sbjct: 165 EQVKEEMKLINVESIR-EALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDA 221 Query: 548 LVQLFIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLC 727 L+QL +A+Q++H VF SM LK+QN IL RLL+ + S PPLFSS Q+KEMEA+ Sbjct: 222 LIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAM--- 278 Query: 728 XXXXXXXXXXXXATGRKE---VRQGFNTNDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQ 898 A +++ G N D +++ EN + L + K P S+ Q Sbjct: 279 -----LSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVND---LNFKEKVPLPVDSLMQ 330 Query: 899 SGHTFPIQCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQG 1078 + P++ S PG R +G++LPLLD HK HD DSLPSPTR+ + + P + L++ G Sbjct: 331 NK---PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDG 387 Query: 1079 LLKPEWPVLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEE--- 1249 ++K + + E YETDA++A SSYQQKFG SFF+N ELPSPTPSEE Sbjct: 388 VVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGD 447 Query: 1250 --GDNNGDXXXXXXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDT 1423 GD G+ N+ +S QP + S+V+ + T Sbjct: 448 GDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPM--------DISSVQALTT 499 Query: 1424 ENNS-------RSVLKTSYA-----KSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKF 1567 NNS V+K + KSRDPRLR A+S+A N P+ P++ + Sbjct: 500 ANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPA-PILHNAPKVEPV 558 Query: 1568 AGVMSSRKHKVVQEQVLDGPALKRQKNEXXXXXXXXXXXXXISTS---------QVAIPS 1720 VMSSRK K V+E VLDGPALKRQ+N + + I + Sbjct: 559 GRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMN 618 Query: 1721 PNLPVSSL----------IKSPLSFQT--------EIMPVKXXXXXXXXXXXXRDIVGNP 1846 NL V S SP++ T E P +DI NP Sbjct: 619 RNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNP 678 Query: 1847 SMWMSILKM-EHQKSSGDNNSAVQMPNSNSTGVA-PSTSGVLPFSSMLGQKPAGIVPCQA 2020 +M ++ILKM + QK + D A Q N +S P +P S+ P+GI+ + Sbjct: 679 TMLLNILKMGQQQKLAAD---AQQKSNDSSMNTMHPPIPSSIPPVSVTCSIPSGIL---S 732 Query: 2021 VSAEEPGKVRMKPRDPRRILHNNTPHKGSTAVSDLPKTNASSLSVIMGSLSAKEQEDQM- 2197 +E GKVRMKPRDPRR+LH N + + + KT+ S GS + Q+ Sbjct: 733 KPMDELGKVRMKPRDPRRVLHGNALQRSGSLGPEF-KTDGPSAPCTQGSKENLNFQKQLG 791 Query: 2198 ---EKVVSSGTVKPPDITMQFTNNLRNIADIMSVSQ-------ASMPSTILPLSVSSEQQ 2347 K V S +V PDIT QFT NL++IAD MSVSQ S S I P + S Sbjct: 792 APEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKS--- 848 Query: 2348 AGTDTKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDVEHLFDGFDDQQKAAIQRE 2527 G D K VV + ++G+ EA P +AW DVEHLF+G+DDQQKAAIQ+E Sbjct: 849 -GADMKAVVTNHDDKQTGTGSGPEAG---PVGAHPQSAWGDVEHLFEGYDDQQKAAIQKE 904 Query: 2528 RARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLF 2707 R RRLEEQ KMF+A K NSAKF EVDP+HDE+LRKKEEQDREKP+RHLF Sbjct: 905 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964 Query: 2708 RFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGRVISR 2887 RFPHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNK YATEMAK+LDPKG LF+GRVISR Sbjct: 965 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024 Query: 2888 GDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRR 3067 GDDG+PFD D+RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRR Sbjct: 1025 GDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1084 Query: 3068 QFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQQKIL 3247 QFGL GPSLLEIDHDER E+GTLASSL VIER+H+IFF H SLD+ DVRNILA EQ+KIL Sbjct: 1085 QFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKIL 1144 Query: 3248 AGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKV 3412 AGCRIVFSRVFPVGEANPH+HPLWQTAEQFGAVCT ID+QVTHVVANSLGTDKV Sbjct: 1145 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 1199 >ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris] gi|561012448|gb|ESW11309.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris] Length = 1272 Score = 868 bits (2242), Expect = 0.0 Identities = 548/1239 (44%), Positives = 697/1239 (56%), Gaps = 72/1239 (5%) Frame = +2 Query: 14 GSEARVWMMRDLY-KYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSNDNGNPDT 190 GS+ARVW +RD+Y KY + GLYNLAWAQAV NKPL D+ VM + + Sbjct: 59 GSDARVWSVRDIYTKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMELDSEANANSNSN 118 Query: 191 DSSSAKTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNKETEC 370 +S+ + + + ++E Sbjct: 119 NSNRPSSVSVNPKEVMVVDVDREEGELEEGEIDADADPEAEAESVVAASVVSETVSDSEQ 178 Query: 371 QXXXXXXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAERDTL 550 + L + A+ +SF SRL L ++ +E+D L Sbjct: 179 FGVKKGVSDSEQLGVRDVLEGVTVANVAESFAQTSSRLLNA---LPQVFSRPADSEKDDL 235 Query: 551 VQLFIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPP-LFSSLQLKEMEAIRLC 727 ++L AI+ ++ VF SM K+QN +LRLL+ + LFS +KE++ + Sbjct: 236 IRLSFNAIEVVYSVFRSMDSSDKEQNKNSILRLLSSAKDKKQAQLFSPEHIKEIQDMMTA 295 Query: 728 XXXXXXXXXXXXATGRKEVRQGFNTNDLHVLLENTDSKAAYLKSRG---KEPGSAGSVDQ 898 A G E T +++ ++ A +++RG +E + + + Sbjct: 296 IDSVG-------ALGSNEAIY-METELQTPEIKSQENSALEVQTRGIKIQENQAVVATEL 347 Query: 899 SGHTFPIQCSAPGLV--------NMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFD 1054 P+ G +++ +G++LPLLDLHKDHDADSLPSPTR+ + FP + Sbjct: 348 VSSIKPLHSDIIGASRALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVN 407 Query: 1055 KGLILEQGLLKPEWPVLRQT-----LPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVND 1219 K L + + ++K + + E H YETDA+KAVS+YQQKFG S F ND Sbjct: 408 KLLSVGEVMVKSGSAAAKMQPGKLEVDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTND 467 Query: 1220 ELPSPTPSEEGDNNGDXXXXXXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXES 1399 +LPSPTPS + D+ + +L+ P S Sbjct: 468 KLPSPTPSGDCDDMAVDTNEEVSSASTSGFLTSTKPTLLDQPPVSATSVDKSRLLGLISS 527 Query: 1400 SNVRTVDTENNSRSVLKTSYAKSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGVM 1579 VD + +K+S AKSRDPR RL NS+A + + + + K ++AG Sbjct: 528 R----VDAAGSGSFPVKSS-AKSRDPRRRLINSEASAVDNQFT---VTHNMPKVEYAGST 579 Query: 1580 SSRKHKVVQEQVLDGPALKRQKNEXXXXXXXXXXXXXISTSQVAIPSPNLPVSSLIK--- 1750 SRK K V+E D KR K+ I+ S + P + LI+ Sbjct: 580 ISRKQKAVEEPSFDLTVSKRLKSSLENIEHNTSEVRTIAGSGGWLEDITGPGTQLIEKNH 639 Query: 1751 ------------------------SPLSFQTEIMPVKXXXXXXXXXXXXRDIVGNPSMWM 1858 + S + E P+ +DIV NP+M + Sbjct: 640 LIDKFAPEPKRTLNTVSSSGSVNFNATSIRNEQAPITSNNVPSSLPAIFKDIVVNPTMLL 699 Query: 1859 SILKMEHQKSSGDNNSAVQMPN------SNSTGVAPSTSGVLPFSSMLGQKPAGIVPCQA 2020 S+L + + NNSA N SNS ST+ ++ + Q G++P + Sbjct: 700 SLLMEQKRLVDAQNNSADSATNMLHPTSSNSAMGTDSTASIVSSMATGLQTSVGMLPVSS 759 Query: 2021 VSA-------EEPGKVRMKPRDPRRILH-NNTPHKGSTAVSDLPKTNASSLSVIM---GS 2167 S + GK+RMKPRDPRRILH NN+ K V++L K S +S I+ S Sbjct: 760 QSTSTAQLQDDYSGKIRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNILVTGDS 819 Query: 2168 LSAKEQEDQME-KVVSSGTVKPPDITMQFTNNLRNIADIMSVSQAS---------MPSTI 2317 ++A++ E +M+ K+V + + PDIT QFT NL+NIADIMSVSQ S S Sbjct: 820 VNAQKLEGRMDTKLVPTQSGAAPDITRQFTRNLKNIADIMSVSQESSTHSPAAQGFSSAS 879 Query: 2318 LPLSVSSEQQAGTDTKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDVEHLFDGFD 2497 +PL+V +Q K V++ S N +G+ E P + + W DVEHLF+G+D Sbjct: 880 VPLNVDRGEQ-----KSVLSNSQNLHAGTGSAPEICA--PGTSRSQSTWGDVEHLFEGYD 932 Query: 2498 DQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQ 2677 +QQKAAIQRERARR+EEQNKMFAA K NSAKFVEVDP+H+E+LRKKEE Sbjct: 933 EQQKAAIQRERARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEEL 992 Query: 2678 DREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKG 2857 DREKP+RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNK YATEMAK+LDPKG Sbjct: 993 DREKPHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKG 1052 Query: 2858 ELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVE 3037 LF+GRVISRGDD + D ++R PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVE Sbjct: 1053 VLFAGRVISRGDDTDSVDGEERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVE 1112 Query: 3038 RYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRN 3217 RY YFPCSRRQFGLPGPSLLEIDHDERPE GTLASSLAVIER+H+ FF SL+E DVRN Sbjct: 1113 RYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLAVIERLHQNFFSSQSLEEVDVRN 1172 Query: 3218 ILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSL 3397 ILA EQ+KIL+GCRIVFSRVFPVGEANPH+HPLWQTAEQFGAVCTNQID+QVTHVVANSL Sbjct: 1173 ILASEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSL 1232 Query: 3398 GTDKVNWALSRGRFVVHPGWVEASALLYRRANEHDFAIK 3514 GTDKVNWALS GRFVVHPGWVEASALLYRRANE DFAIK Sbjct: 1233 GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1271 >ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like isoform X1 [Cicer arietinum] Length = 1247 Score = 862 bits (2227), Expect = 0.0 Identities = 550/1231 (44%), Positives = 694/1231 (56%), Gaps = 62/1231 (5%) Frame = +2 Query: 14 GSEARVWMMRDLY-KYQIPSKPSYLGLYNLAWAQAVNNKPLGDVLVMMEGGSNDNGNPDT 190 G +ARVW + DLY KY + GLYNLAWAQAV NKPL D+ V ME S+ N Sbjct: 64 GGDARVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFV-MELDSDSNA---- 118 Query: 191 DSSSAKTSDGDENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSNKETEC 370 +++S S+ ET Sbjct: 119 NANSNNDSNNGNGDLNMPLKEVVMVDDDEREEGELEEGEIDGDDDTGGVMVGGDGSETVS 178 Query: 371 QXXXXXXXXXXXXXXMEELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGP-FAERDT 547 + + L + A+ +SF SRL LQ +L GP +E+D Sbjct: 179 ESDIR-----------DFLEGVTVANVAESFAETISRLLRV---LQSKLLSGPAVSEKDY 224 Query: 548 LVQLFIAAIQTLHKVFSSMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEA-IRL 724 +++L AI+ +H VF SM K+ N ++RLL + + LFS +KE++ I Sbjct: 225 VIRLLYNAIEIVHSVFCSMDNLQKEDNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITA 284 Query: 725 CXXXXXXXXXXXXATGRKEVRQGFNTNDLHVLLENTDSKAAYLKS-RGKEPGSAGSVDQS 901 G K L+ D K ++ + E S+ + S Sbjct: 285 IDTVDALGNSVVVGNGEK--------------LDTLDIKTRQIQGLKASELISSSKLVHS 330 Query: 902 GHTFPIQCSAPGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGL 1081 T + G N++ +G++LPL DLHK HD DSLPSPTR+ + FP +K + G+ Sbjct: 331 NLTEASEALLSGQSNIKGRGVMLPLFDLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGM 390 Query: 1082 LKPEWP------VLRQTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPS 1243 +P P ++ L EN H YETDA+KAVS+YQQKFG S+F +D+ PSPTPS Sbjct: 391 DRPGLPSAGKTEAVKMELDTENSKNHLYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPS 450 Query: 1244 EEGDNNGDXXXXXXXXXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNV--RTV 1417 GD + + A+ SL S +P + + Sbjct: 451 ------GDCEEGVADANEEVSSASIAV-SLTSSKPLLDQMPVSSTSVDRSSMHGLINSRI 503 Query: 1418 DTENNSRSVLKTSYAKSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHK 1597 + ++ +KTS A+SRDPRLR NSDA +L+ SL ++ K + AG + SRK K Sbjct: 504 EAASSVTYPVKTS-ARSRDPRLRFINSDASALDLNQSLGT--NNMPKVENAGRVISRKQK 560 Query: 1598 VVQEQVLDGPALKRQKNEXXXXXXXXXXXXXISTSQVAIPSPNLPVSSLIKSPLSFQ--- 1768 +E LD A KR ++ ++ + + + S LI+ Q Sbjct: 561 TTEELSLDATAPKRLRSSLENSRHNTREERTMAGNGGWLEENRVAGSHLIERNHLMQKGE 620 Query: 1769 --------------------TEIMPVKXXXXXXXXXXXXRDIVGNPSMWMSILKMEHQKS 1888 E PV ++I NP+M ++IL + Q+ Sbjct: 621 TELKKTMSTSSGYSTVTSNGNEQAPVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRL 680 Query: 1889 SGDNN--------SAVQMPNSN-----STGVAPSTSGVLPFSSMLGQKPAGIVPCQAVSA 2029 + + N S + + NS + P+ + LP SS+ G PA Sbjct: 681 AAEANKKPVDSATSTMHLTNSARGPDATVNTGPAMTAGLPQSSV-GMLPASTQAASMAHT 739 Query: 2030 --EEPGKVRMKPRDPRRILHNNTP-HKGSTAVSDLPKTNASSLSVIMGS---LSAKEQED 2191 E+ GK+RMKPRDPRRILH ++ K + S+ K+ S S G+ ++A++ + Sbjct: 740 LLEDSGKIRMKPRDPRRILHGSSSLQKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDV 799 Query: 2192 QME-KVVSSGTVKPPDITMQFTNNLRNIADIMSVSQASMPSTILPLSVSSEQQAGT---- 2356 ++E K+ + + PDIT QFT NL+NIADIMSVSQ PST LP + + A Sbjct: 800 RVETKLAPTQSSAQPDITRQFTKNLKNIADIMSVSQE--PSTQLPATTQNVSSASVPFTL 857 Query: 2357 ---DTKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDVEHLFDGFDDQQKAAIQRE 2527 + K V S N + G E T P + + W+DVEHLF+G+D++QKAAIQRE Sbjct: 858 DKAELKSGVPNSQNLQDGVGSAPE--TCAPGSSRSQSTWADVEHLFEGYDEKQKAAIQRE 915 Query: 2528 RARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLF 2707 RARRLEEQNKMFA+ K NSAKFVEVDP+HDE+LRKKEEQDREKP+RHLF Sbjct: 916 RARRLEEQNKMFASKKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLF 975 Query: 2708 RFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGRVISR 2887 RFPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGNK YATEMAK+LDPKG LF+GRVISR Sbjct: 976 RFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1035 Query: 2888 GDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRR 3067 GDD E D D+R PKSKDLEGV+GMES+VVI+DDSVRVWPHNKLNLIVVERY YFPCSRR Sbjct: 1036 GDDTESVDGDERAPKSKDLEGVMGMESSVVIVDDSVRVWPHNKLNLIVVERYTYFPCSRR 1095 Query: 3068 QFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQQKIL 3247 QFGLPGPSLLEIDHDERPE GTLASSLAVIERIH+ FF SL+E DVRNILA EQ+KIL Sbjct: 1096 QFGLPGPSLLEIDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKIL 1155 Query: 3248 AGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALS 3427 AGCRIVFSRVFPVGEANPH+HPLWQTAEQFGAVC NQID+QVTHVVANSLGTDKVNWA+S Sbjct: 1156 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCINQIDDQVTHVVANSLGTDKVNWAIS 1215 Query: 3428 RGRFVVHPGWVEASALLYRRANEHDFAIKQQ 3520 GRFVVHPGWVEASALLYRRANE DFAIK + Sbjct: 1216 TGRFVVHPGWVEASALLYRRANEQDFAIKPE 1246 >ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] gi|550343307|gb|EEE79693.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] Length = 1030 Score = 847 bits (2189), Expect = 0.0 Identities = 498/1037 (48%), Positives = 622/1037 (59%), Gaps = 66/1037 (6%) Frame = +2 Query: 602 MTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLCXXXXXXXXXXXXATGRKE 781 M +LK+QN + +R L+ V S P FS KE+E + + A +E Sbjct: 1 MNQKLKEQNKGVFMRFLSLVNSHDPSFFSPEHTKEIE-LMVSSLDSHDILSSSRAGEERE 59 Query: 782 VRQGFNTNDLHVLLENTDSKAAY-LKSRGKEPGSAGSVDQSGHTFPIQCSAPGLVNMRLK 958 + N+ ++ A Y L + + P +A S + F I+ PG+ + + + Sbjct: 60 TQVSGKVNERDN--DSLSKTAGYDLTTMNRLPSAAESFVHNKPNFSIEPPKPGVPSFKSR 117 Query: 959 GLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPEWPVLRQTLPRENPVL 1138 G++LPLLDL K HD DSLPSPTR+ + +FP + L + G++ PV + E P + Sbjct: 118 GVLLPLLDLKKFHDEDSLPSPTRETAPSFPVQRLLPIGDGMISSGLPVPKVASITEEPRV 177 Query: 1139 HPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEE-GDNNGDXXXXXXXXXXXXH-NV 1312 HPYETDA+KAVSSYQ+KF SFF N ELPSPTPSEE G+ +GD + V Sbjct: 178 HPYETDALKAVSSYQKKFNLNSFFTN-ELPSPTPSEESGNGDGDTAGEVSSSSTVNYRTV 236 Query: 1313 NRALNSLISGQP---XXXXXXXXXXXXXXXESSNVRTVDTENNS-------RSVLKTSYA 1462 N ++ S P +S++R V NS S +K S A Sbjct: 237 NPPVSDRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVVIPTRNSAPVSSGTSSTVKAS-A 295 Query: 1463 KSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALKRQ 1642 KSRDPRLR N+DA + + L+ ++ +++ +G ++ + + ++E VLDG +LKRQ Sbjct: 296 KSRDPRLRYVNTDASALDQNQRTLLMVNNPPRAEPSGAIAGSRKQKIEEDVLDGTSLKRQ 355 Query: 1643 KNEXXXXXXXXXXXXXIST-------------------------------SQVAIPSPNL 1729 +N T + V PS Sbjct: 356 RNSFDNFGVVRDIRSMTGTGGWLEDTDMAEPQTVNKNQWAENAEPGQRINNGVVCPSTGS 415 Query: 1730 PVSSLIKSPLSFQTEIMPVKXXXXXXXXXXXXRDIVGNPSMWMSI----------LKMEH 1879 +SS+ S + Q +M + P + I LKM Sbjct: 416 VMSSVSCSG-NVQVPVMGINTIAGSEQAPVTSTTTASLPDLLKDITVNPTMLINILKMGQ 474 Query: 1880 QKSSGDNNSAVQMPNSNSTGVAPSTS---GVLPFSSMLGQKPAGIV---------PCQAV 2023 Q+ + + ST PS++ G +P + + P+GI+ P Q Sbjct: 475 QQRLALDGQQKLADPAKSTSHPPSSNTVLGAIPEVNAVSSLPSGILPRSAGKAQGPSQIA 534 Query: 2024 SAEEPGKVRMKPRDPRRILHNNTPHKGSTAVSDLPKTNASSLSVIMGSLSAKEQEDQMEK 2203 + +E GK+RMKPRDPRR+LHNN + + S+ KT + S G+ + + Q E Sbjct: 535 TTDESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTLT-STTQGTKDNQNLQKQ-EG 592 Query: 2204 VVSSGTVKPPDITMQFTNNLRNIADIMSVSQASMPSTILPLSVSSEQQAGTDTKIVVNES 2383 + V PPDI+ FT +L+NIADI+SVSQ + +V+S+ ++ Sbjct: 593 LAELKPVVPPDISSPFTKSLKNIADIVSVSQTCTTPPFVSQNVASQPVQIKSDRVDGKTG 652 Query: 2384 VNFRSGSNLTSEAATSIPPRPLNANAWSDVEHLFDGFDDQQKAAIQRERARRLEEQNKMF 2563 ++ + + + L+ N W DVEHLF+G+DDQQKAAIQRERARR+EEQ K+F Sbjct: 653 ISNSDQKMGPASSPEVVAASSLSQNTWEDVEHLFEGYDDQQKAAIQRERARRIEEQKKLF 712 Query: 2564 AAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLFRFPHMGMWTKLR 2743 AA K NSAKFVEVDP+HDE+LRKKEEQDREKPYRHLFRFPHMGMWTKLR Sbjct: 713 AARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFPHMGMWTKLR 772 Query: 2744 PGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGRVISRGDDGEPFDSDDR 2923 PGIWNFLEKASKLYELHLYTMGNK YATEMAK+LDPKG LF+GRV+SRGDDG+ D D+R Sbjct: 773 PGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDER 832 Query: 2924 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEI 3103 VPKSKDLEGVLGMES VVIIDDS+RVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEI Sbjct: 833 VPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEI 892 Query: 3104 DHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQQKILAGCRIVFSRVFP 3283 DHDERPE+GTLA SLAVIERIH+ FF H SLDEADVRNILA EQ+KILAGCRIVFSRVFP Sbjct: 893 DHDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQRKILAGCRIVFSRVFP 952 Query: 3284 VGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSRGRFVVHPGWVE 3463 VGE NPH+HPLWQ+AEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALS GRFVVHPGWVE Sbjct: 953 VGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVE 1012 Query: 3464 ASALLYRRANEHDFAIK 3514 ASALLYRRANE DFAIK Sbjct: 1013 ASALLYRRANEQDFAIK 1029 >ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Fragaria vesca subsp. vesca] Length = 1230 Score = 847 bits (2189), Expect = 0.0 Identities = 512/1091 (46%), Positives = 645/1091 (59%), Gaps = 59/1091 (5%) Frame = +2 Query: 419 EELVNLNAADAQKSFDALCSRLETTAIKLQKIVLEGPFAERDTLVQLFIAAIQTLHKVFS 598 E L +L +A+KSF +C R + L+ ++ E + ++ LVQ A++ + VF Sbjct: 171 EALESLTITEAEKSFGDVCHRFLDSLESLRGVLSEINVSTKEALVQQLFNAVRAISSVFR 230 Query: 599 SMTLRLKQQNGAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLCXXXXXXXXXXXXATGRK 778 SM+ K+QN +L R+L+ S P F + QLKE+E + + Sbjct: 231 SMSADQKEQNKDVLSRILSSAKS-DPSPFPAEQLKEIEVMS-------------SSMDSP 276 Query: 779 EVRQGFNTNDLHVL--LENTDS-----KAAYLKSRGKEPGSAG--SVDQSGHTFPIQCSA 931 + + G N + + + TDS A+++ + GS SV S + Sbjct: 277 QTKAGTKENGIQCINGVYKTDSDTSGANASHVFTYAANTGSDTQVSVVHSNPNISSEVPR 336 Query: 932 PGLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPE-WPVLR 1108 G + + +GL+LPLLDLH DHD DSLPSPTR+ A FP K +++E G++K W R Sbjct: 337 SGSSSFKGRGLMLPLLDLHMDHDEDSLPSPTREPPACFPAQKPVVVENGMVKKSGWETAR 396 Query: 1109 QTLPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEEGDNNGDXXXXXXX 1288 L E +H YET+A+KAVSSYQQKF SF + ELPSPTPSEE +NGD Sbjct: 397 AALDVEGSKMHVYETEALKAVSSYQQKFSRNSFLTS-ELPSPTPSEEEGDNGDDAAVGEV 455 Query: 1289 XXXXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNV--RTVDTENNSRSVLKTSYA 1462 N R +SG+ + +T + ++ S A Sbjct: 456 SSSSASNNVRTPQPPVSGRQVVSSVPATTLPGSSGMHGLITAKTASPVSLGSNMPNKSSA 515 Query: 1463 KSRDPRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALKRQ 1642 KSRDPRLR ANSDAG L+ + + K +SSRKHK ++ DGP KRQ Sbjct: 516 KSRDPRLRFANSDAGALTLNQQSSIQVHNAPKVDSVITLSSRKHKSPEDSNFDGPESKRQ 575 Query: 1643 K---------------NEXXXXXXXXXXXXXISTSQV-----AIPSPNLPVSSLIKSPLS 1762 + N I+ +Q A P + VSS SP + Sbjct: 576 RGANSVVGWGAKTSFGNGVWLEDGSSVGPHLINRNQTVEKKEADPRKMVNVSS---SPGT 632 Query: 1763 FQ---------TEIMPVKXXXXXXXXXXXXRDIVGNPSMWMSILKMEHQKSSGDNNSAVQ 1915 + E +P+ +DI NP+M ++ILK+ + N+A Sbjct: 633 VEGNSNGQNTANEKVPL-VAPSLVSLPAIFKDIAVNPTMLVNILKL----AEAQQNAAA- 686 Query: 1916 MPNSNSTGVAPSTSGVLPFSSMLGQKPAGI--------VPCQAVSAEEPGKVRMKPRDPR 2071 P + P +S +P ++ L P+ + Q +E GK+RMK RDPR Sbjct: 687 -PARKESLTYPPSSSSIPGTAALVNDPSKTSGALLTPTICSQKTPTDEAGKIRMKLRDPR 745 Query: 2072 RILHNNTPHKGSTAVSDLPKTNASSLSVIMGS---LSAKEQEDQMEK---VVSSGTVKPP 2233 R+LH N + + + LS + ++ K+Q+ Q + SG + P Sbjct: 746 RLLHGNALQNSGSVGHEQSRNIVPPLSSSQANNDDMNGKKQDSQADNNSVTSQSGALGAP 805 Query: 2234 DITMQFTNNLRNIADIMSVSQASM-PSTILPLSVSSEQQAGTDTKIVVNESVNFRSGSNL 2410 DI QFT NL+NIADI+SVSQ S P+T Q T+ + ++V+ ++ Sbjct: 806 DIASQFTKNLKNIADIISVSQVSTSPAT-------PSQNLSTELISINPDNVDLKAEEQH 858 Query: 2411 TSEAATSIPPRPLNANA---WSDVEHLFDGFDDQQKAAIQRERARRLEEQNKMFAAGKXX 2581 T + S+P + + W DVEHLF+G+DD+QKAAIQRERARR+EEQ KMFAA K Sbjct: 859 TGSISASVPTAAGASRSPATWGDVEHLFEGYDDKQKAAIQRERARRIEEQKKMFAAHKLC 918 Query: 2582 XXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNF 2761 NSAKFVEVDP+HDE+LRKKEEQDR++P RHLFRF HMGMWTKLRPG+W F Sbjct: 919 LVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDRKEPQRHLFRFQHMGMWTKLRPGVWKF 978 Query: 2762 LEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGRVISRGDDGEPFDSDDRVPKSKD 2941 LEKAS L+E+HLYTMGNK YATEMAK+LDP G LF+GRVISRGDDG+P+D D+RVPKSKD Sbjct: 979 LEKASHLFEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISRGDDGDPYDGDERVPKSKD 1038 Query: 2942 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERP 3121 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER Sbjct: 1039 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERH 1098 Query: 3122 EEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQQKILAGCRIVFSRVFPVGEANP 3301 E+GTLASSLAVIE+IH+IFF H SLDEADVRNILA EQQKIL GCRIVFSRVFPVGE NP Sbjct: 1099 EDGTLASSLAVIEKIHQIFFSHPSLDEADVRNILASEQQKILGGCRIVFSRVFPVGEVNP 1158 Query: 3302 HMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSRGRFVVHPGWVEASALLY 3481 H+HPLWQTAEQFGAVCTNQID+QVTHVVANSLGTDKVNWALS G++VVHPGWVEASALLY Sbjct: 1159 HLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSSGKYVVHPGWVEASALLY 1218 Query: 3482 RRANEHDFAIK 3514 RRANE DFAIK Sbjct: 1219 RRANEQDFAIK 1229 >ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 1195 Score = 843 bits (2178), Expect = 0.0 Identities = 488/922 (52%), Positives = 590/922 (63%), Gaps = 62/922 (6%) Frame = +2 Query: 935 GLVNMRLKGLVLPLLDLHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPEWPVLRQT 1114 G+ + + + +LPLLDLHKDHDADSLPSPTR+ + P + +L P + Sbjct: 302 GVSSFKSRAALLPLLDLHKDHDADSLPSPTRESALPLPAYR-------VLTP-----KMV 349 Query: 1115 LPRENPVLHPYETDAVKAVSSYQQKFGGGSFFVNDELPSPTPSEEGDNNGDXXXXXXXXX 1294 L N +HPYETDA+KAVSSYQQKF SF + D LPSPTPSEE NGD Sbjct: 350 LDTGNSRMHPYETDALKAVSSYQQKFSKSSFALTDRLPSPTPSEES-GNGDGDTGGEVSS 408 Query: 1295 XXXHNVNRALNSLISGQPXXXXXXXXXXXXXXXESSNVRTVDTENNSRSVLKTSYAKSRD 1474 + R N L SGQ ++++ +++ S+ + AKSRD Sbjct: 409 SLSVSSFRPANPLTSGQSNASISLPRMDGSSLPGVISIKSAVRASSAPSLTVKASAKSRD 468 Query: 1475 PRLRLANSDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALKRQKN-- 1648 PRLR NSD+ + + + + G M+ ++ K+V + + DG +LKRQKN Sbjct: 469 PRLRFVNSDSNALDQNHRAVPVVNTLKVEPIGGTMNKKRQKIVDDPIPDGHSLKRQKNAL 528 Query: 1649 ------------------------------------EXXXXXXXXXXXXXISTSQVAIPS 1720 + + TS I S Sbjct: 529 ENSGVVRDVKTMVGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCTSSSCISS 588 Query: 1721 PNLPVSSLIK---SPLSFQTEIMPVKXXXXXXXXXXXXRDIVGNPSMWMSILKM------ 1873 N+ + I + + E++PVK ++I NP+M ++ILKM Sbjct: 589 VNISGTEQIPVTGTSVPIGGELVPVKGSTAAIPDLL--KNIAVNPTMLINILKMGQQQRL 646 Query: 1874 --EHQKSSGDNNSAVQMP-NSNST-GVAP----STSGVLPFSSMLGQKPAGIVPC--QAV 2023 E Q+ D + P NSNS G P + SG+LP +PAG V Q Sbjct: 647 ALEAQQKPVDPAKSTTYPLNSNSMLGTVPVVGAAHSGILP-------RPAGTVQVSPQLG 699 Query: 2024 SAEEPGKVRMKPRDPRRILHNNTPHKGSTAVSDLPKTNASSLSV---IMGSLSAKEQEDQ 2194 +A++ GK+RMKPRDPRR+LHNN + + S+ KTN +S+ + + + ++QE Q Sbjct: 700 TADDLGKIRMKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQ 759 Query: 2195 MEKV-VSSGTVKPPDITMQFTNNLRNIADIMSVSQASMPSTILPLSVSSEQQAGTDTKIV 2371 +EK V ++ PDI+M FT NL+NIADI+SVS AS ++P + +S+ T Sbjct: 760 VEKKPVPLQSLALPDISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQPMRTT----- 814 Query: 2372 VNESVNFRS-GSNLTSEAATSIPPRPLNANAWSDVEHLFDGFDDQQKAAIQRERARRLEE 2548 ++ S F GS + AA + PR NAW DVEHLF+G++DQQKAAIQRERARR+EE Sbjct: 815 ISSSDQFLGIGSAPGAAAAAAAGPR--TQNAWGDVEHLFEGYNDQQKAAIQRERARRIEE 872 Query: 2549 QNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDEMLRKKEEQDREKPYRHLFRFPHMGM 2728 Q K+F+A K NSAKFVEVDP+HDE+LRKKEEQDREK +RHLFRFPHMGM Sbjct: 873 QKKLFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGM 932 Query: 2729 WTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGELFSGRVISRGDDGEPF 2908 WTKLRPGIWNFLEKASKLYELHLYTMGNK YATEMAK+LDP G LF+GRVISRGDDGEPF Sbjct: 933 WTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPF 992 Query: 2909 DSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGP 3088 D D+R+PKSKDLEGVLGMES VVI+DDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGP Sbjct: 993 DGDERIPKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGP 1052 Query: 3089 SLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDSLDEADVRNILACEQQKILAGCRIVF 3268 SLLEIDHDERPE+GTLA SLAVIERIH+ FF H SLDEADVRNILA EQ+KILAGCRIVF Sbjct: 1053 SLLEIDHDERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVF 1112 Query: 3269 SRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSRGRFVVH 3448 SRVFPVGEANPH+HPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALS GRFVV+ Sbjct: 1113 SRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVY 1172 Query: 3449 PGWVEASALLYRRANEHDFAIK 3514 PGWVEASALLYRRANE DFAIK Sbjct: 1173 PGWVEASALLYRRANEQDFAIK 1194 >ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 3-like [Cucumis sativus] Length = 1249 Score = 842 bits (2175), Expect = 0.0 Identities = 509/1067 (47%), Positives = 621/1067 (58%), Gaps = 45/1067 (4%) Frame = +2 Query: 449 AQKSFDALCSRLETTAIKLQKIVLEGPFAERDTLVQLFIAAIQTLHKVFSSMTLRLKQQN 628 AQKSF +CS++ ++ +++ +D L+Q AA++ ++ VF SM L K+++ Sbjct: 213 AQKSFQEVCSQIHSSIETFVELLQGKVVPRKDALIQRLYAALRLINSVFCSMNLSEKEEH 272 Query: 629 GAILLRLLAQVTSLRPPLFSSLQLKEMEAIRLCXXXXXXXXXXXXATGRKEVR--QGFNT 802 L RLL+ V + PPLFS Q+K +E + E+ G Sbjct: 273 KEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPSTDSLDHLPSMRGSAKEVEIHIPNGVKD 332 Query: 803 NDLHVLLENTDSKAAYLKSRGKEPGSAGSVDQSGHTFPIQCSAPGLVNMRLKGLVLPLLD 982 D + +T S+ + G ++ + G+ +++ +G +LPLLD Sbjct: 333 MDFYSAYTSTSSQLTPSNKLASDSIPFGVKGKNNLNILSEGLQSGVSSIKGRGPLLPLLD 392 Query: 983 LHKDHDADSLPSPTRDLSATFPFDKGLILEQGLLKPEWPVLRQTLPRENPVLHPYETDAV 1162 LHKDHDADSLPSPTR+ F K P + P + HPYETDA+ Sbjct: 393 LHKDHDADSLPSPTREAPTIFSVQKS---------GNAPT-KMAFPVDGSRSHPYETDAL 442 Query: 1163 KAVSSYQQKFGGGSFFVNDELPSPTPSEEGDNNGDXXXXXXXXXXXXHNVNRAL---NSL 1333 KAVS+YQQKFG SF + D LPSPTPSEE D GD ++ R+L N Sbjct: 443 KAVSTYQQKFGRSSFSMADRLPSPTPSEEHDGGGD-----IGGEVSSSSIIRSLKSSNVS 497 Query: 1334 ISGQPXXXXXXXXXXXXXXXESSNVRTVDTENNSRSVLKTS------YAKSRDPRLRLAN 1495 GQ +SS+ R + + N S AKSRDPRLR+ N Sbjct: 498 KPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTVKPLAKSRDPRLRIVN 557 Query: 1496 SDAGPRNLSPSLPLIGSDESKSKFAGVMSSRKHKVVQEQVLDGPALKRQKNEXXXXXXXX 1675 SDA +L+P S + A + RK K+ E DGP +KR + Sbjct: 558 SDASGMDLNPRTMASVQSSSILESAATLHLRKQKMDGEPNTDGPEVKRLRIGSQNLAVAA 617 Query: 1676 XXXXXISTS------------------QVAIPSPNLPVSSLIKSPLSFQTEIMPVKXXXX 1801 +S S Q+ I N S + + E P Sbjct: 618 SDVRAVSGSGGWLEDTMPAGPRLFNRNQMEIAEANATEKSNVTNNSGSGNECTPTVNNSN 677 Query: 1802 XXXXXXXXRDIVGNPSMWMSILKMEHQ---------KSSGDNNSAVQMPNSN-STGVAPS 1951 +DIV NP+M +++LKM Q KSS +A+ + N G +P Sbjct: 678 DASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEPEKNAICPTSLNPCQGSSPL 737 Query: 1952 TSGVLPFSSMLGQKPAGIVPCQAV--SAEEPGKVRMKPRDPRRILHNNTPHKGSTAVSDL 2125 + + S +L Q+ AG V ++ GKVRMKPRDPRR+LH N+ K + +D Sbjct: 738 INAPVATSGIL-QQSAGTPSASPVVGRQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQ 796 Query: 2126 PKTNASSLSVIMGSL---SAKEQEDQMEKVVSSGTVKPPDITMQFTNNLRNIADIMSVSQ 2296 K + S GS + +QE Q + ++S PDI QFTNNL+NIADIMSV Sbjct: 797 LKGVVPTASNTEGSRDIPNGHKQEGQGDSKLASSQTILPDIGRQFTNNLKNIADIMSVPS 856 Query: 2297 ASMPSTILPLSVSSE-QQAGTDTKIVVNESVNFRSGSNLTSEAATSIPPRPLNANAWSDV 2473 P T P S S + D+K V T+ A + + AW D+ Sbjct: 857 ---PPTSSPNSSSKPVGSSSMDSKPVT------------TAFQAVDMAASSRSQGAWGDL 901 Query: 2474 EHLFDGFDDQQKAAIQRERARRLEEQNKMFAAGKXXXXXXXXXXXXNSAKFVEVDPLHDE 2653 EHLFD +DD+QKAAIQRERARR+EEQ KMFAA K NSAKFVEVDP+HDE Sbjct: 902 EHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDE 961 Query: 2654 MLRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKYYATEM 2833 +LRKKEEQDREK RHLFRFPHMGMWTKLRPG+WNFLEKAS+LYELHLYTMGNK YATEM Sbjct: 962 ILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEM 1021 Query: 2834 AKLLDPKGELFSGRVISRGDDGEPFDSDDRVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 3013 AK+LDPKG LF+GRVISRGDDG+P D DDRVPKSKDLEGVLGMES VVIIDDS+RVWPHN Sbjct: 1022 AKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHN 1081 Query: 3014 KLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEEGTLASSLAVIERIHEIFFGHDS 3193 K+NLIVVERY YFPCSRRQFGL GPSLLEIDHDERPE+GTLASSL VI+RIH+ FF + Sbjct: 1082 KMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVIQRIHQXFFSNPE 1141 Query: 3194 LDEADVRNILACEQQKILAGCRIVFSRVFPVGEANPHMHPLWQTAEQFGAVCTNQIDEQV 3373 LD+ DVR IL+ EQQKILAGCRIVFSRVFPVGEANPH+HPLWQTAEQFGA CTNQIDEQV Sbjct: 1142 LDQVDVRTILSAEQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQV 1201 Query: 3374 THVVANSLGTDKVNWALSRGRFVVHPGWVEASALLYRRANEHDFAIK 3514 THVVANSLGTDKVNWALS GRFVVHPGWVEASALLYRRA E DFAIK Sbjct: 1202 THVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRATEQDFAIK 1248