BLASTX nr result
ID: Akebia23_contig00009264
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00009264 (1981 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma... 356 2e-95 emb|CBI35661.3| unnamed protein product [Vitis vinifera] 330 2e-87 gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l... 328 7e-87 ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat... 317 1e-83 ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr... 307 1e-80 ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citr... 307 1e-80 ref|XP_006438857.1| hypothetical protein CICLE_v10030535mg [Citr... 307 1e-80 ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal doma... 281 8e-73 ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus tric... 274 1e-70 ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera... 265 4e-68 ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal doma... 265 4e-68 ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu... 260 1e-66 gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus... 249 4e-63 ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phas... 240 2e-60 ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma... 237 1e-59 ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma... 237 2e-59 ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal doma... 233 3e-58 ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma... 231 1e-57 ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma... 227 2e-56 ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ... 223 2e-55 >ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Vitis vinifera] Length = 1238 Score = 356 bits (914), Expect = 2e-95 Identities = 270/693 (38%), Positives = 358/693 (51%), Gaps = 88/693 (12%) Frame = -3 Query: 1820 ERKILIWMAHDRIEEGEISDNSQSIEAIAEEDF-KQESKVSNRG-----SRVW----MDD 1671 E++ I M + +EEGEISD S S+E I+EEDF KQE +V +RVW + D Sbjct: 3 EKENNIMMGIEDVEEGEISD-SASVEEISEEDFNKQEVRVLREAKPKADTRVWTMRDLQD 61 Query: 1670 MLKY-PISSNYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRS-------------- 1536 + KY S Y LYN AWAQAVQNKPL++I + D E+SKRS Sbjct: 62 LYKYHQACSGYTPRLYNLAWAQAVQNKPLNDIFVMDD---EESKRSSSSSNTSRDDSSSA 118 Query: 1535 ------IIDDSRADXXXXXXXXXXXXXXXXXXXXXXXXXXXE----GGLSNNSNLARNLE 1386 IIDDS + + GG+ + + +L+ Sbjct: 119 KEVAKVIIDDSGDEMDVKMDDVSEKEEGELEEGEIDLDSEPDVKDEGGVLDVNEPEIDLK 178 Query: 1385 EREFENRIKSIREALGTVTVKDAEISFHGVCXXXXXXXXXXXLM-----IMENGTLDVDD 1221 ERE R+KSI+E L +VTV +AE SF GVC + + E+ D Sbjct: 179 ERELVERVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDA 238 Query: 1220 LIQQSFDGIQAINSVFCSMNPKQLEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPF 1041 L QQ + I+A+N VFCSMN Q E NKD+FS LLS V D+ +FS + +KE+E MM F Sbjct: 239 LAQQLINAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSF 298 Query: 1040 MDLQAVVPSVKAAEKEKEIQVNNGVNPNELGILGENP-----SSKKFLLEPIPVIA---N 885 +D A S +A++K ++QV +G+N N L E+ S+KK L+ I V + N Sbjct: 299 LDTPAAQSSAEASDKVNDVQVTDGMNRNILDSSVESSGRAFASAKKLSLDSISVESYNQN 358 Query: 884 MGFEIKPXXXXXXXXXXXXXXXLDLHRKHDVDSLPSPTRETPLPSPLVKSELATPNVTDE 705 +KP DLH+ HD DSLPSPT + P P+ KSEL T V E Sbjct: 359 NPDALKPGLSSSRGRFIFGPLL-DLHKDHDEDSLPSPTGKAPQCFPVNKSELVTAKVAHE 417 Query: 704 SEDAMMYRYETDALKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSST 525 ++D++M+ YETDALKA STYQQKFG TS D+LPSPTPSEE + D+SGEVSSSST Sbjct: 418 TQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVSSSST 477 Query: 524 VGNVRTVNPSVSLQPVSSPTAHMDSSSGQ----------------TGSNLVLKAKSRDPR 393 + T N P+ S MDSS Q S++V AKSRDPR Sbjct: 478 ISAPITANAPALGHPIVSSAPQMDSSIVQGPTVGRNTSLVSSGPHLDSSVVASAKSRDPR 537 Query: 392 LRFTNSEGDASVLNQYPL--LEDAPKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNG 219 LR +S+ + LN+ PL + ++PK + LG +SSRK E + DG KRQRNG Sbjct: 538 LRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVT--KRQRNG 595 Query: 218 LTD--SLITGYVPMVSG----DRSTVGTQVTDKNILAKNMGTDPRESEK----------- 90 LT ++ + SG D +TV Q+ ++N L +N GTDP++ E Sbjct: 596 LTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTGIGCDK 655 Query: 89 -----GENERLPMIGPSTMASLPSLLRDIAVNP 6 NE LP++ ST ASL SLL+DIAVNP Sbjct: 656 PYVTVNGNEHLPVVATSTTASLQSLLKDIAVNP 688 >emb|CBI35661.3| unnamed protein product [Vitis vinifera] Length = 1184 Score = 330 bits (845), Expect = 2e-87 Identities = 248/637 (38%), Positives = 326/637 (51%), Gaps = 39/637 (6%) Frame = -3 Query: 1799 MAHDRIEEGEISDNSQSIEAIAEEDF-KQESKVSNRG-----SRVW----MDDMLKY-PI 1653 M + +EEGEISD S S+E I+EEDF KQE +V +RVW + D+ KY Sbjct: 50 MGIEDVEEGEISD-SASVEEISEEDFNKQEVRVLREAKPKADTRVWTMRDLQDLYKYHQA 108 Query: 1652 SSNYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRSIIDDSRADXXXXXXXXXXXXX 1473 S Y LYN AWAQAVQNKPL++I + IIDDS + Sbjct: 109 CSGYTPRLYNLAWAQAVQNKPLNDIFV------------IIDDSGDEMDVKMDDVSEKEE 156 Query: 1472 XXXXXXXXXXXXXXE----GGLSNNSNLARNLEEREFENRIKSIREALGTVTVKDAEISF 1305 + GG+ + + +L+ERE R+KSI+E L +VTV +AE SF Sbjct: 157 GELEEGEIDLDSEPDVKDEGGVLDVNEPEIDLKERELVERVKSIQEDLESVTVIEAEKSF 216 Query: 1304 HGVCXXXXXXXXXXXLM-----IMENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQLEQN 1140 GVC + + E+ D L QQ + I+A+N VFCSMN Q E N Sbjct: 217 SGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIRALNHVFCSMNSNQKELN 276 Query: 1139 KDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMDLQAVVPSVKAAEKEKEIQVNNGVNP 960 KD+FS LLS V D+ +FS + +KE+E MM F+D A S +A++K ++QV +G+N Sbjct: 277 KDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSAEASDKVNDVQVTDGMNR 336 Query: 959 NELGILGENP-----SSKKFLLEPIPVIANMGFEIKPXXXXXXXXXXXXXXXLDLHRKHD 795 N L E+ S+KKF LDLH+ HD Sbjct: 337 NILDSSVESSGRAFASAKKF-----------------------RGRFIFGPLLDLHKDHD 373 Query: 794 VDSLPSPTRETPLPSPLVKSELATPNVTDESEDAMMYRYETDALKAFSTYQQKFGRTSNF 615 DSLPSPT + P P+ KSEL T V E++D++M+ YETDALKA STYQQKFG TS Sbjct: 374 EDSLPSPTGKAPQCFPVNKSELVTAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFL 433 Query: 614 LTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVNPSVSLQPVSSPTAHMDSSSG-- 441 D+LPSPTPSEE + D+SGEVSSSST+ T N P+ S MD G Sbjct: 434 PIDKLPSPTPSEESGDTYGDISGEVSSSSTISAPITANAPALGHPIVSSAPQMDIVQGLV 493 Query: 440 ---QTGS-----NLVLK--AKSRDPRLRFTNSEGDASVLNQYPL--LEDAPKSETLGGSI 297 TG+ N +L+ AKSRDPRLR +S+ + LN+ PL + ++PK + LG + Sbjct: 494 VPRNTGAVNSRFNSILRASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIV 553 Query: 296 SSRKHTIIVESVSDGQSQNFKRQRNGLTDSLITGYVPMVSGDRSTVGTQVTDKNILAKNM 117 SSRK E + DG KRQRNGLT + T++ K + + Sbjct: 554 SSRKQKSAEEPLLDGPVT--KRQRNGLT----------------SPATKLESK-VTVTGI 594 Query: 116 GTDPRESEKGENERLPMIGPSTMASLPSLLRDIAVNP 6 G D NE LP++ ST ASL SLL+DIAVNP Sbjct: 595 GCDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNP 631 >gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus notabilis] Length = 1301 Score = 328 bits (840), Expect = 7e-87 Identities = 265/705 (37%), Positives = 350/705 (49%), Gaps = 111/705 (15%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDF-KQESKVSNRG-----------------SRVWM--DDML 1665 +EEGEISD S S+E I+EEDF KQE + G SRVW D Sbjct: 13 VEEGEISD-SASVEEISEEDFNKQEGNGTGSGKVMSVSDSNSKESKFGDSRVWTMRDLYA 71 Query: 1664 KYPISSNYGSGLYNFAWAQAVQNKPLSEILMRDFGS------------------------ 1557 YP Y +GLYN AWAQAVQNKPL+EI + D + Sbjct: 72 NYPGFRGYTTGLYNLAWAQAVQNKPLNEIFVMDVDADDSSRVVLSSASPAVNSGRREGKN 131 Query: 1556 ----IEKSKRSIIDDSRADXXXXXXXXXXXXXXXXXXXXXXXXXXXEGGLSNNSNLARNL 1389 +EK ++ +IDDS + +G L+ + L Sbjct: 132 GVKEVEKVEKVVIDDSADEMEEGELEEGEIDLESEPTQKPAGEEAKDGDLNCEAENVGGL 191 Query: 1388 E----EREFENRIKSIREALGTVTVKDAEISFHGVCXXXXXXXXXXXLMIMENG----TL 1233 E E E R+ I E LG+V V +AE SF VC ++ E T Sbjct: 192 EVDSRRDELEKRVDLIWETLGSVNVVNAEKSFEEVCSRLQRTLESLRGVLSEKEFSFPTK 251 Query: 1232 DVDDLIQQSFDGIQAINSVFCSMNPKQLEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEA 1053 DV +IQ S IQ +NSVFCSM+ Q EQ K+ S L V + T LFSP+Q KEIE Sbjct: 252 DV--VIQMSITAIQVVNSVFCSMSVNQKEQKKETLSRLFCSVKNCGTPLFSPEQTKEIEL 309 Query: 1052 MMPFMDLQAVVPSVKAAEKEKEIQVNNGVNPNELGILG---ENPSSKKFLLE-PIPVIAN 885 M+ ++ V+PS A++KEKE Q+ ++ + + EN S ++ ++ P +A+ Sbjct: 310 MISSLNPLNVLPSSGASDKEKETQIIERLHEMDSNLTNANAENASIERTSVKLPQDCVAS 369 Query: 884 MGF-------EIKPXXXXXXXXXXXXXXXLDLHRKHDVDSLPSPTRETPLPSP------- 747 + E+ LDLH+ HD DSLPSPTRE P P Sbjct: 370 VVHSNPITLPELLRPGTLAFKGRGLLLPLLDLHKDHDADSLPSPTREAPSCFPVYKPLGV 429 Query: 746 ---LVKSELATPNVTDESEDAMMYRYETDALKAFSTYQQKFGRTSNFLTDQLPSPTPSEE 576 ++K T V +E++ ++RYETDALKA STYQQKFGR S ++D+LPSPTPSEE Sbjct: 430 ADGIIKPVSTTAKVAPGAEESRLHRYETDALKAVSTYQQKFGRGSFLMSDRLPSPTPSEE 489 Query: 575 CDEVDFDLSGEVSSSSTVGNVRT-----VNPSV--SLQPVSSPT-----AHMDSSSGQTG 432 CDE D D++ EVSSS T GN+RT + PSV S PVSSPT A +++ +G Sbjct: 490 CDEED-DINQEVSSSLTSGNLRTPAIPILRPSVVTSSVPVSSPTMQGPIAAKNAAPVGSG 548 Query: 431 SNLVLK--AKSRDPRLRFTNSEGDASVLNQYPL--LEDAPKSETLGGSISSRKHTIIVES 264 SN +K A+SRDPRLRF NS+ A LNQ PL + + PK E G SSRK I+ E Sbjct: 549 SNSTMKASARSRDPRLRFANSDAGALDLNQRPLTAVHNGPKVEP-GDPTSSRKQRIVEEP 607 Query: 263 VSDGQSQNFKRQRNGLTDSLITGYVPMVSG------DRSTVGTQVTDKNILAKNMGTDPR 102 DG + KRQR+ + I V SG D T G Q+ +KN L +N DPR Sbjct: 608 NLDGPA--LKRQRHAFVSAKID--VKTASGVGGWLEDNGTTGPQIMNKNQLVENAEADPR 663 Query: 101 ES---------EKGEN---ERLPMIGPSTMASLPSLLRDIAVNPT 3 +S G N E++P+ G ST +LP++L+DIAVNPT Sbjct: 664 KSIHLVNGPIMNNGPNIGKEQVPVTGTSTPDALPAILKDIAVNPT 708 >ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] Length = 1290 Score = 317 bits (813), Expect = 1e-83 Identities = 261/697 (37%), Positives = 340/697 (48%), Gaps = 103/697 (14%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDF-KQESKV-----------SNRGSRVW-MDDMLKYP-ISS 1647 +EEGEISD S SIE I+EEDF KQ+ K+ +N SRVW M D+ KYP + Sbjct: 34 VEEGEISD-SASIEEISEEDFNKQDVKILKESKSSKGGEANSNSRVWTMQDLCKYPSVIR 92 Query: 1646 NYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEK-----SKRS------------------ 1536 Y SGLYNFAWAQAVQNKPL+EI ++DF ++ SKRS Sbjct: 93 GYASGLYNFAWAQAVQNKPLNEIFVKDFEQPQQDENKNSKRSSPSSSVASVNSKEEKGSS 152 Query: 1535 -------IIDDSRADXXXXXXXXXXXXXXXXXXXXXXXXXXXEGGLSNNSNLARNLEERE 1377 +IDD D +S E Sbjct: 153 GNLAVKVVIDDDSEDEMEEDKVVNLDKEEGELEEGEIDLDSEPKEKVLSSEDGNVGNSDE 212 Query: 1376 FENRIKSIREALGTVTVKDAEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDG 1197 E R IR L VTV +AE SF GVC +I+E D LIQ +F Sbjct: 213 LEKRANLIRGVLEGVTVIEAEKSFEGVCSRLHNALESLRALILECSVPAKDALIQLAFG- 271 Query: 1196 IQAINSVFCSMNPKQLEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMDLQAVVP 1017 AINS F ++N EQN I S LLS V D +LF P +MKEI+ M+ ++ A Sbjct: 272 --AINSAFVALNCNSKEQNVAILSRLLSIVKGHDPSLFPPDKMKEIDVMLISLNSPA--- 326 Query: 1016 SVKAAEKEKEIQVNNGVNPNELGILGEN-----------PSSKKFLLEPIPVIANMGFEI 870 +A + EK+++V +GVN + L EN PSS KF++ P N E Sbjct: 327 --RAIDTEKDMKVVDGVNKKDPDALPENICHDLTVTNKLPSSAKFVINNKP---NALTET 381 Query: 869 KPXXXXXXXXXXXXXXXLDLHRKHDVDSLPSPTRET----PLPSPL------VKSELATP 720 LDLH+ HD DSLPSPTRET P+ PL VKS T Sbjct: 382 LKPGVPNFRNRGISLPLLDLHKDHDADSLPSPTRETTPCLPVNKPLTSGDVMVKSGFMTG 441 Query: 719 NVTDESEDAMMYRYETDALKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEV 540 + ++E ++ YETDALKAFSTYQQKFG+ S F +D+LPSPTPSEE + D GEV Sbjct: 442 KGSHDAEGDKLHPYETDALKAFSTYQQKFGQGSFFSSDRLPSPTPSEESGDEGGDNGGEV 501 Query: 539 SSSSTVGNVRTVNPSVSLQPVSSPTAHMDSSSG--------------QTGSNLVLK--AK 408 SSSS++GN + P + P+ S +DS+S + SN+V K AK Sbjct: 502 SSSSSIGNFKPNLPILG-HPIVSSAPLVDSASSSLQGQITTRNATPMSSVSNIVSKSLAK 560 Query: 407 SRDPRLRFTNSEGDASVLNQYPLLEDAPKSETLGGSISSRKHTIIVESVSDGQSQNFKRQ 228 SRDPRL F NS A LN+ LL +A K +GG + SRK + E + D S KRQ Sbjct: 561 SRDPRLWFANSNASALDLNE-RLLHNASKVAPVGGIMDSRKKKSVEEPILD--SPALKRQ 617 Query: 227 RNGLTDSLITGYVPMVSG------DRSTVGTQVTDKNILAKNMGTDPRESEK-------- 90 RN L + + V VSG D +G+Q+T++N A+N+ ++ R+ + Sbjct: 618 RNELENLGVARDVQTVSGIGGWLEDTDAIGSQITNRNQTAENLESNSRKMDNGVTSSSTL 677 Query: 89 --------GENERLPMIGPSTMASLPSLLRDIAVNPT 3 G NE++P+ ST SLP+LL+DIAVNPT Sbjct: 678 SGKTNITVGTNEQVPVTSTST-PSLPALLKDIAVNPT 713 >ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|568858958|ref|XP_006483010.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Citrus sinensis] gi|557541056|gb|ESR52100.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1234 Score = 307 bits (786), Expect = 1e-80 Identities = 243/687 (35%), Positives = 333/687 (48%), Gaps = 93/687 (13%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFK----------QESKVSNRG-----SRVW-MDDML-KYP 1656 +EEGEISD + S+E I+EEDFK +E+K G +RVW M D+ KYP Sbjct: 5 VEEGEISDTA-SVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYP 63 Query: 1655 -ISSNYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRS--------IIDDSRADXXX 1503 I YG GL+N AWAQAVQNKPL+EI + + + SKRS + + A Sbjct: 64 AICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDD 123 Query: 1502 XXXXXXXXXXXXXXXXXXXXXXXXEGGLS------NNSNLARNLEEREFENRIKSIREAL 1341 EG + +N ++ ++E ++SIREAL Sbjct: 124 KKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLINVESIREAL 183 Query: 1340 GTVTVKDAEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMN 1161 +V D ISF GVC ++ EN D LIQ +F +Q+++SVFCSMN Sbjct: 184 ESVLRGD--ISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAVQSVHSVFCSMN 241 Query: 1160 PKQLEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMDLQAVVPSVKAAEKEKEIQ 981 EQNK+I S LLS + S + LFS Q+KE+EAM+ + +A +KEK++ Sbjct: 242 HVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSL-------VTRANDKEKDML 294 Query: 980 VNNGVNPNELGILGENPSSKKFLLEPIPV-----IANMGFEIKPXXXXXXXXXXXXXXXL 816 +GVN + I+ EN + E +P+ + N E L Sbjct: 295 AMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLL 354 Query: 815 DLHRKHDVDSLPSPTRETPLPSPL----------VKSELATPNVTDESEDAMMYRYETDA 666 D H+ HDVDSLPSPTRET P+ VKS A ++ +E YETDA Sbjct: 355 DPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVKSWAAAAKLSHNAEVHKTPHYETDA 414 Query: 665 LKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVN-PSVS 489 L+AFS+YQQKFGR S F+ +LPSPTPSEE + D D GE+SS++ V + VN P++ Sbjct: 415 LRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPTLG 474 Query: 488 LQPVSSP----------------TAHMDSSSGQTGSNLVLK--------AKSRDPRLRFT 381 QPVSS T +S+ +G N V+K KSRDPRLRF Sbjct: 475 QQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFA 534 Query: 380 NSEGDASVLNQYPLLEDAPKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDSLI 201 +S P+L +APK E +G +SSRK + E V DG + KRQRNG +S + Sbjct: 535 SSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPA--LKRQRNGFENSGV 592 Query: 200 TGYVPMVSG------DRSTVGTQVTDKNILAKNMGTDPRESEKGE--------------- 84 + G D Q+ ++N+L + ++ R+ + G Sbjct: 593 VRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSG 652 Query: 83 NERLPMIGPSTMASLPSLLRDIAVNPT 3 NE P PST SLP+LL+DIAVNPT Sbjct: 653 NEPAPATTPSTTVSLPALLKDIAVNPT 679 >ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|557541054|gb|ESR52098.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1208 Score = 307 bits (786), Expect = 1e-80 Identities = 243/687 (35%), Positives = 333/687 (48%), Gaps = 93/687 (13%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFK----------QESKVSNRG-----SRVW-MDDML-KYP 1656 +EEGEISD + S+E I+EEDFK +E+K G +RVW M D+ KYP Sbjct: 5 VEEGEISDTA-SVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYP 63 Query: 1655 -ISSNYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRS--------IIDDSRADXXX 1503 I YG GL+N AWAQAVQNKPL+EI + + + SKRS + + A Sbjct: 64 AICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDD 123 Query: 1502 XXXXXXXXXXXXXXXXXXXXXXXXEGGLS------NNSNLARNLEEREFENRIKSIREAL 1341 EG + +N ++ ++E ++SIREAL Sbjct: 124 KKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLINVESIREAL 183 Query: 1340 GTVTVKDAEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMN 1161 +V D ISF GVC ++ EN D LIQ +F +Q+++SVFCSMN Sbjct: 184 ESVLRGD--ISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAVQSVHSVFCSMN 241 Query: 1160 PKQLEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMDLQAVVPSVKAAEKEKEIQ 981 EQNK+I S LLS + S + LFS Q+KE+EAM+ + +A +KEK++ Sbjct: 242 HVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSL-------VTRANDKEKDML 294 Query: 980 VNNGVNPNELGILGENPSSKKFLLEPIPV-----IANMGFEIKPXXXXXXXXXXXXXXXL 816 +GVN + I+ EN + E +P+ + N E L Sbjct: 295 AMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLL 354 Query: 815 DLHRKHDVDSLPSPTRETPLPSPL----------VKSELATPNVTDESEDAMMYRYETDA 666 D H+ HDVDSLPSPTRET P+ VKS A ++ +E YETDA Sbjct: 355 DPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVKSWAAAAKLSHNAEVHKTPHYETDA 414 Query: 665 LKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVN-PSVS 489 L+AFS+YQQKFGR S F+ +LPSPTPSEE + D D GE+SS++ V + VN P++ Sbjct: 415 LRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPTLG 474 Query: 488 LQPVSSP----------------TAHMDSSSGQTGSNLVLK--------AKSRDPRLRFT 381 QPVSS T +S+ +G N V+K KSRDPRLRF Sbjct: 475 QQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFA 534 Query: 380 NSEGDASVLNQYPLLEDAPKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDSLI 201 +S P+L +APK E +G +SSRK + E V DG + KRQRNG +S + Sbjct: 535 SSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPA--LKRQRNGFENSGV 592 Query: 200 TGYVPMVSG------DRSTVGTQVTDKNILAKNMGTDPRESEKGE--------------- 84 + G D Q+ ++N+L + ++ R+ + G Sbjct: 593 VRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSG 652 Query: 83 NERLPMIGPSTMASLPSLLRDIAVNPT 3 NE P PST SLP+LL+DIAVNPT Sbjct: 653 NEPAPATTPSTTVSLPALLKDIAVNPT 679 >ref|XP_006438857.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|567892677|ref|XP_006438859.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|557541053|gb|ESR52097.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|557541055|gb|ESR52099.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1118 Score = 307 bits (786), Expect = 1e-80 Identities = 243/687 (35%), Positives = 333/687 (48%), Gaps = 93/687 (13%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFK----------QESKVSNRG-----SRVW-MDDML-KYP 1656 +EEGEISD + S+E I+EEDFK +E+K G +RVW M D+ KYP Sbjct: 5 VEEGEISDTA-SVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYP 63 Query: 1655 -ISSNYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRS--------IIDDSRADXXX 1503 I YG GL+N AWAQAVQNKPL+EI + + + SKRS + + A Sbjct: 64 AICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDD 123 Query: 1502 XXXXXXXXXXXXXXXXXXXXXXXXEGGLS------NNSNLARNLEEREFENRIKSIREAL 1341 EG + +N ++ ++E ++SIREAL Sbjct: 124 KKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLINVESIREAL 183 Query: 1340 GTVTVKDAEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMN 1161 +V D ISF GVC ++ EN D LIQ +F +Q+++SVFCSMN Sbjct: 184 ESVLRGD--ISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAVQSVHSVFCSMN 241 Query: 1160 PKQLEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMDLQAVVPSVKAAEKEKEIQ 981 EQNK+I S LLS + S + LFS Q+KE+EAM+ + +A +KEK++ Sbjct: 242 HVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSL-------VTRANDKEKDML 294 Query: 980 VNNGVNPNELGILGENPSSKKFLLEPIPV-----IANMGFEIKPXXXXXXXXXXXXXXXL 816 +GVN + I+ EN + E +P+ + N E L Sbjct: 295 AMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLL 354 Query: 815 DLHRKHDVDSLPSPTRETPLPSPL----------VKSELATPNVTDESEDAMMYRYETDA 666 D H+ HDVDSLPSPTRET P+ VKS A ++ +E YETDA Sbjct: 355 DPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVKSWAAAAKLSHNAEVHKTPHYETDA 414 Query: 665 LKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVN-PSVS 489 L+AFS+YQQKFGR S F+ +LPSPTPSEE + D D GE+SS++ V + VN P++ Sbjct: 415 LRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPTLG 474 Query: 488 LQPVSSP----------------TAHMDSSSGQTGSNLVLK--------AKSRDPRLRFT 381 QPVSS T +S+ +G N V+K KSRDPRLRF Sbjct: 475 QQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFA 534 Query: 380 NSEGDASVLNQYPLLEDAPKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDSLI 201 +S P+L +APK E +G +SSRK + E V DG + KRQRNG +S + Sbjct: 535 SSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPA--LKRQRNGFENSGV 592 Query: 200 TGYVPMVSG------DRSTVGTQVTDKNILAKNMGTDPRESEKGE--------------- 84 + G D Q+ ++N+L + ++ R+ + G Sbjct: 593 VRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSG 652 Query: 83 NERLPMIGPSTMASLPSLLRDIAVNPT 3 NE P PST SLP+LL+DIAVNPT Sbjct: 653 NEPAPATTPSTTVSLPALLKDIAVNPT 679 >ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Fragaria vesca subsp. vesca] Length = 1230 Score = 281 bits (719), Expect = 8e-73 Identities = 241/672 (35%), Positives = 321/672 (47%), Gaps = 78/672 (11%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDF-KQESKV--------SNRGSRVW-MDDMLKYPISSNYGS 1635 +EEGEI D S S+E I+EEDF KQESK S G+R W ++L +P G Sbjct: 13 VEEGEIPD-SNSVEEISEEDFVKQESKAVEPKSNGGSGDGARFWTFHEVLAHPHFRGIGG 71 Query: 1634 G-LYNFAWAQAVQNKPLSEILMRDFGSIEKSKRSIIDDSRADXXXXXXXXXXXXXXXXXX 1458 G L N AWAQAVQNKP +++L++ S EKSK+ S Sbjct: 72 GGLANLAWAQAVQNKPFNDLLVK-LDSDEKSKQQQQQRSSVSSGNEKVVIIDSGDEMDVE 130 Query: 1457 XXXXXXXXXEGGLSN----NSNLARNLEEREFENRIKSIREALGTVTVKDAEISFHGVCX 1290 E G + N A ++ +E R+ +REAL ++T+ +AE SF VC Sbjct: 131 KEEEELEEGEIGFDSECGDNDKAAGSVGNGVWEKRVNLLREALESLTITEAEKSFGDVCH 190 Query: 1289 XXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQLEQNKDIFSSLLSH 1110 ++ E + L+QQ F+ ++AI+SVF SM+ Q EQNKD+ S +LS Sbjct: 191 RFLDSLESLRGVLSEINVSTKEALVQQLFNAVRAISSVFRSMSADQKEQNKDVLSRILSS 250 Query: 1109 VMSQDTTLFSPKQMKEIEAMMPFMDLQAVVPSVKAAEKEKEIQVNNGVNPNELGILGENP 930 S D + F +Q+KEIE M MD P KA KE IQ NGV + G N Sbjct: 251 AKS-DPSPFPAEQLKEIEVMSSSMD----SPQTKAGTKENGIQCINGVYKTDSDTSGANA 305 Query: 929 S---------SKKFLLEPIPVIANMGFEIKPXXXXXXXXXXXXXXXLDLHRKHDVDSLPS 777 S + + N+ E+ LDLH HD DSLPS Sbjct: 306 SHVFTYAANTGSDTQVSVVHSNPNISSEVPRSGSSSFKGRGLMLPLLDLHMDHDEDSLPS 365 Query: 776 PTRETPLPSP-----------LVKSELATPNVTDESEDAMMYRYETDALKAFSTYQQKFG 630 PTRE P P + KS T + E + M+ YET+ALKA S+YQQKF Sbjct: 366 PTREPPACFPAQKPVVVENGMVKKSGWETARAALDVEGSKMHVYETEALKAVSSYQQKFS 425 Query: 629 RTSNFLTDQLPSPTPS-EECDEVDFDLSGEVSSSSTVGNVRTVNPSVSLQPV--SSPTAH 459 R S FLT +LPSPTPS EE D D GEVSSSS NVRT P VS + V S P Sbjct: 426 RNS-FLTSELPSPTPSEEEGDNGDDAAVGEVSSSSASNNVRTPQPPVSGRQVVSSVPATT 484 Query: 458 MDSSSG-------------QTGSNLVLK--AKSRDPRLRFTNSEGDASVLNQYPLLE--D 330 + SSG GSN+ K AKSRDPRLRF NS+ A LNQ ++ + Sbjct: 485 LPGSSGMHGLITAKTASPVSLGSNMPNKSSAKSRDPRLRFANSDAGALTLNQQSSIQVHN 544 Query: 329 APKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDSLITGYVPMVS-------GD 171 APK +++ ++SSRKH +S DG KRQR + + G+ S D Sbjct: 545 APKVDSV-ITLSSRKHKSPEDSNFDGPES--KRQRGA---NSVVGWGAKTSFGNGVWLED 598 Query: 170 RSTVGTQVTDKNILAKNMGTDPRE----------------SEKGENERLPMIGPSTMASL 39 S+VG + ++N + DPR+ + NE++P++ PS + SL Sbjct: 599 GSSVGPHLINRNQTVEKKEADPRKMVNVSSSPGTVEGNSNGQNTANEKVPLVAPS-LVSL 657 Query: 38 PSLLRDIAVNPT 3 P++ +DIAVNPT Sbjct: 658 PAIFKDIAVNPT 669 >ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus trichocarpa] gi|550347145|gb|EEE82674.2| CTD phosphatase-like protein 3 [Populus trichocarpa] Length = 1190 Score = 274 bits (700), Expect = 1e-70 Identities = 226/663 (34%), Positives = 307/663 (46%), Gaps = 69/663 (10%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFKQESKV--------SNRGSRVW-MDDMLKYPISSNYGSG 1632 +EEGEISD + S+E I+EEDF ++ V +N +VW + D+ KY + Y SG Sbjct: 18 VEEGEISDTA-SVEEISEEDFNKQEVVIVKETPSSNNSSQKVWTVRDLYKYQVGGGYMSG 76 Query: 1631 LYNFAWAQAVQNKPLSEI-LMRDFGSIEKSKRSIIDDSRADXXXXXXXXXXXXXXXXXXX 1455 LYN AWA+AVQNKPL+E+ ++ D E +ID + + Sbjct: 77 LYNLAWARAVQNKPLNELTVVIDDSGDEMDVVKVIDIEKEEGELEEGEIDLDSEPVVVQ- 135 Query: 1454 XXXXXXXXEGGLSNNSNLARNLEEREFENRIKSIREALGTVTVKDAEISFHGVCXXXXXX 1275 + + + ENR+KSIR+ L +V+V + E SF VC Sbjct: 136 ------------------SEGMVSVDVENRVKSIRKDLESVSVIETEKSFEAVCLKLHKV 177 Query: 1274 XXXXXLMI--MENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQLEQNKDIFSSLLSHVMS 1101 ++ +N D L+Q F I+ +NSVFCSMN K EQNK +FS S + S Sbjct: 178 LESLKELVGGNDNSFPSKDGLVQLLFMAIRVVNSVFCSMNKKLKEQNKGVFSRFFSLLNS 237 Query: 1100 QDTTLFSPKQMKEIE--------AMMPFMDLQAVVPSVKAAEKEKEIQVNNGVN-PNELG 948 FSP Q KE+ A DL + + AAE + + N + P G Sbjct: 238 HYPPFFSPGQNKEVLNENHNDSLAKTAGYDLTTMSEKLPAAETFVQNKPNKSIEAPKPPG 297 Query: 947 ILGENPSSK-KFLLEPIPVIANMGFEIKPXXXXXXXXXXXXXXXLDLHRKHDVDSLPSPT 771 + PS K + +L P+ LDL + HD DSLPSPT Sbjct: 298 V----PSFKSRGVLLPL---------------------------LDLKKYHDEDSLPSPT 326 Query: 770 RET-PLP--------SPLVKSELATPNVTDESEDAMMYRYETDALKAFSTYQQKFGRTSN 618 +ET P P +V S L P VT +E+ M+ YETDALKA S+YQQKF R S Sbjct: 327 QETTPFPVQRLLAIGDGMVSSGLPVPKVTPVAEEPRMHPYETDALKAVSSYQQKFNRNS- 385 Query: 617 FLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVNPSVSLQ--------PVSSPTA 462 F T++LPSPTPSEE D D +GEVSSSSTV N RTVNP VS Q P+ P Sbjct: 386 FFTNELPSPTPSEESGNGDGDTAGEVSSSSTVVNYRTVNPPVSDQKNAPPSPPPLPPPPP 445 Query: 461 HMDSS--------------SGQTGSNLVLKAKSRDPRLRFTNSEGDASVLNQ--YPLLED 330 H DSS S S + AKSRDPRLR+ N + A NQ P++ + Sbjct: 446 HPDSSNIRGVVPTRNSAPVSSGPSSTIKASAKSRDPRLRYVNIDACALDHNQRALPMVNN 505 Query: 329 APKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDSLITGYVPMVSGDRSTV-GT 153 P+ E G + S+KH I + + D + KRQRN + + ++G + T Sbjct: 506 LPRVEPAGAIVGSKKHKIEEDVLDD---PSLKRQRNSFDNYGAVRDIESMTGTGGWLEDT 562 Query: 152 QVTDKNILAKNMGTDPRESEKGENERLPMIGPSTM-------------ASLPSLLRDIAV 12 + + + KN + N + P +G S + SLP LL+DIAV Sbjct: 563 DMAEPQTVNKNQWAENSNVNGSGNAQSPFMGISNITGSEQAQVTSTATTSLPDLLKDIAV 622 Query: 11 NPT 3 NPT Sbjct: 623 NPT 625 >ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 3-like [Cucumis sativus] Length = 1249 Score = 265 bits (678), Expect = 4e-68 Identities = 238/689 (34%), Positives = 329/689 (47%), Gaps = 95/689 (13%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFKQ-----------ESKVSNRGSRVW-MDDMLK-YP-ISS 1647 +EEGEISD + S+E I+EEDF + SK SNR +RVW M D+ K YP + Sbjct: 12 VEEGEISDTA-SVEEISEEDFNKLDSSASPKVVVPSKDSNRETRVWTMSDLYKNYPAMRH 70 Query: 1646 NYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRS----------------------I 1533 Y SGLYN AWAQAVQNKPL++I + + EKSK S + Sbjct: 71 GYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDGSNTTKEEDRVV 130 Query: 1532 IDDSRADXXXXXXXXXXXXXXXXXXXXXXXXXXXE------GGLSNNSNLARN-----LE 1386 IDDS + E LS++ ++ N LE Sbjct: 131 IDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDMDINGQEFDLE 190 Query: 1385 EREFENRIKSIREALGTVTVKDAEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQS 1206 +E + +K I++ L VT+ A+ SF VC ++ D LIQ+ Sbjct: 191 TKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKDALIQRL 250 Query: 1205 FDGIQAINSVFCSMNPKQLEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMDLQA 1026 + ++ INSVFCSMN + E++K+ S LLS+V + D LFSP+Q+K +E MP D Sbjct: 251 YAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPSTDSLD 310 Query: 1025 VVPSVKAAEKEKEIQVNNGVNPNELGILGENPSSK-----KFLLEPIPVIA------NMG 879 +PS++ + KE EI + NGV + + SS+ K + IP N+ Sbjct: 311 HLPSMRGSAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLASDSIPFGVKGKNNLNIL 370 Query: 878 FEIKPXXXXXXXXXXXXXXXLDLHRKHDVDSLPSPTRETPLPSPLVKSELATPNVTDESE 699 E LDLH+ HD DSLPSPTRE P + KS A + + Sbjct: 371 SEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGNAPTKMAFPVD 430 Query: 698 DAMMYRYETDALKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVG 519 + + YETDALKA STYQQKFGR+S + D+LPSPTPSEE D D+ GEVSSSS + Sbjct: 431 GSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEHDG-GGDIGGEVSSSSIIR 489 Query: 518 NVRTVNPSVSLQPVSSPT-------AHMDSSSGQ------------TGSNLVLK--AKSR 402 ++++ N S Q +S + +MDSSS + + SN +K AKSR Sbjct: 490 SLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTVKPLAKSR 549 Query: 401 DPRLRFTNSEGDASVLNQYPLLEDAPKSETL---GGSISSRKHTIIVESVSDGQSQNFKR 231 DPRLR NS DAS ++ P + +S ++ ++ RK + E +DG KR Sbjct: 550 DPRLRIVNS--DASGMDLNPRTMASVQSSSILESAATLHLRKQKMDGEPNTDG--PEVKR 605 Query: 230 QRNGLTD-SLITGYVPMVSG------DRSTVGTQVTDKNIL------AKNMGTDPRESEK 90 R G + ++ V VSG D G ++ ++N + A S Sbjct: 606 LRIGSQNLAVAASDVRAVSGSGGWLEDTMPAGPRLFNRNQMEIAEANATEKSNVTNNSGS 665 Query: 89 GENERLPMIGPSTMASLPSLLRDIAVNPT 3 G NE P + S ASLPSLL+DI VNPT Sbjct: 666 G-NECTPTVNNSNDASLPSLLKDIVVNPT 693 >ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Cucumis sativus] Length = 1249 Score = 265 bits (678), Expect = 4e-68 Identities = 238/689 (34%), Positives = 329/689 (47%), Gaps = 95/689 (13%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFKQ-----------ESKVSNRGSRVW-MDDMLK-YP-ISS 1647 +EEGEISD + S+E I+EEDF + SK SNR +RVW M D+ K YP + Sbjct: 12 VEEGEISDTA-SVEEISEEDFNKLDSSASPKVVVPSKDSNRETRVWTMSDLYKNYPAMRH 70 Query: 1646 NYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRS----------------------I 1533 Y SGLYN AWAQAVQNKPL++I + + EKSK S + Sbjct: 71 GYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDGSNTTKEEDRVV 130 Query: 1532 IDDSRADXXXXXXXXXXXXXXXXXXXXXXXXXXXE------GGLSNNSNLARN-----LE 1386 IDDS + E LS++ ++ N LE Sbjct: 131 IDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDMDINGQEFDLE 190 Query: 1385 EREFENRIKSIREALGTVTVKDAEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQS 1206 +E + +K I++ L VT+ A+ SF VC ++ D LIQ+ Sbjct: 191 TKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKDALIQRL 250 Query: 1205 FDGIQAINSVFCSMNPKQLEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMDLQA 1026 + ++ INSVFCSMN + E++K+ S LLS+V + D LFSP+Q+K +E MP D Sbjct: 251 YAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPSTDSLD 310 Query: 1025 VVPSVKAAEKEKEIQVNNGVNPNELGILGENPSSK-----KFLLEPIPVIA------NMG 879 +PS++ + KE EI + NGV + + SS+ K + IP N+ Sbjct: 311 HLPSMRGSAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLASDSIPFGVKGKNNLNIL 370 Query: 878 FEIKPXXXXXXXXXXXXXXXLDLHRKHDVDSLPSPTRETPLPSPLVKSELATPNVTDESE 699 E LDLH+ HD DSLPSPTRE P + KS A + + Sbjct: 371 SEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQKSGNAPTKMAFPVD 430 Query: 698 DAMMYRYETDALKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVG 519 + + YETDALKA STYQQKFGR+S + D+LPSPTPSEE D D+ GEVSSSS + Sbjct: 431 GSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEHDG-GGDIGGEVSSSSIIR 489 Query: 518 NVRTVNPSVSLQPVSSPT-------AHMDSSSGQ------------TGSNLVLK--AKSR 402 ++++ N S Q +S + +MDSSS + + SN +K AKSR Sbjct: 490 SLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTVKPLAKSR 549 Query: 401 DPRLRFTNSEGDASVLNQYPLLEDAPKSETL---GGSISSRKHTIIVESVSDGQSQNFKR 231 DPRLR NS DAS ++ P + +S ++ ++ RK + E +DG KR Sbjct: 550 DPRLRIVNS--DASGMDLNPRTMASVQSSSILESAATLHLRKQKMDGEPNTDG--PEVKR 605 Query: 230 QRNGLTD-SLITGYVPMVSG------DRSTVGTQVTDKNIL------AKNMGTDPRESEK 90 R G + ++ V VSG D G ++ ++N + A S Sbjct: 606 LRIGSQNLAVAASDVRAVSGSGGWLEDTMPAGPRLFNRNQMEIAEANATEKSNVTNNSGS 665 Query: 89 GENERLPMIGPSTMASLPSLLRDIAVNPT 3 G NE P + S ASLPSLL+DI VNPT Sbjct: 666 G-NECTPTVNNSNDASLPSLLKDIVVNPT 693 >ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] gi|550343308|gb|EEE79627.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] Length = 1247 Score = 260 bits (665), Expect = 1e-66 Identities = 234/706 (33%), Positives = 316/706 (44%), Gaps = 112/706 (15%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDF-KQE--------SKVSNRGS----RVW-MDDMLKYPISS 1647 +EEGEISD + S+E I+E+DF KQE S +N S +VW + D+ KY + Sbjct: 20 VEEGEISDTA-SVEEISEDDFNKQEVVVVKETPSSTTNNNSSSKQKVWTVRDLYKYQVGG 78 Query: 1646 NYGSGLYNFAWAQAVQNKPLSEILMR-------------DFGSIEKSKRSIIDDSRADXX 1506 Y SGLYN AWAQAVQNKPL+E+ + S ++ KR+++ D D Sbjct: 79 GYMSGLYNLAWAQAVQNKPLNELFVEVEVDDSSQKSSVSSVNSSKEDKRTVVIDDSGDEM 138 Query: 1505 XXXXXXXXXXXXXXXXXXXXXXXXXEGGLSNNSNLARNLEEREFENRIKSIREALGTVTV 1326 L + + + E R+KSIRE L +V+V Sbjct: 139 DVVKVIDIEKEEGELEEGEID-------LDSEGKSEGGMVSVDTEKRVKSIREDLESVSV 191 Query: 1325 KDAEISFHGVCXXXXXXXXXXXLMIM--ENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQ 1152 + SF VC ++ ENG D L++ F I A+NS F SMN K Sbjct: 192 IKDDKSFEAVCLKLHNALESLKELVRVNENGFPSKDSLVRLLFTAIGAVNSFFSSMNQKL 251 Query: 1151 LEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPF--------MDLQAVVPSVKAAEK 996 EQNK +F LS V S D + FSP+ KE+ F DL + AAE Sbjct: 252 KEQNKGVFMRFLSLVNSHDPSFFSPEHTKEVCDFCNFDFRIVSLCYDLTTMNRLPSAAES 311 Query: 995 EKEIQVNNGVNPNELGILGENPSSK-KFLLEPIPVIANMGFEIKPXXXXXXXXXXXXXXX 819 + N + P + G+ PS K + +L P+ Sbjct: 312 FVHNKPNFSIEPPKPGV----PSFKSRGVLLPL--------------------------- 340 Query: 818 LDLHRKHDVDSLPSPTRET----------PLPSPLVKSELATPNVTDESEDAMMYRYETD 669 LDL + HD DSLPSPTRET P+ ++ S L P V +E+ ++ YETD Sbjct: 341 LDLKKFHDEDSLPSPTRETAPSFPVQRLLPIGDGMISSGLPVPKVASITEEPRVHPYETD 400 Query: 668 ALKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVNPSVS 489 ALKA S+YQ+KF S F T++LPSPTPSEE D D +GEVSSSSTV N RTVNP VS Sbjct: 401 ALKAVSSYQKKFNLNS-FFTNELPSPTPSEESGNGDGDTAGEVSSSSTV-NYRTVNPPVS 458 Query: 488 LQPVSSPT--------------AHMDSS--------------SGQTGSNLVLKAKSRDPR 393 + +SP+ H+++S S T S + AKSRDPR Sbjct: 459 DRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVVIPTRNSAPVSSGTSSTVKASAKSRDPR 518 Query: 392 LRFTNSEGDASVLNQYPLL--EDAPKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNG 219 LR+ N++ A NQ LL + P++E G SRK I E V DG S KRQRN Sbjct: 519 LRYVNTDASALDQNQRTLLMVNNPPRAEPSGAIAGSRKQK-IEEDVLDGTS--LKRQRNS 575 Query: 218 LTDSLITGYVPMVSG------DRSTVGTQVTDKNILAKN---------------MGTDPR 102 + + + ++G D Q +KN A+N G+ Sbjct: 576 FDNFGVVRDIRSMTGTGGWLEDTDMAEPQTVNKNQWAENAEPGQRINNGVVCPSTGSVMS 635 Query: 101 ESEKGENERLPMIG-------------PSTMASLPSLLRDIAVNPT 3 N ++P++G +T ASLP LL+DI VNPT Sbjct: 636 SVSCSGNVQVPVMGINTIAGSEQAPVTSTTTASLPDLLKDITVNPT 681 >gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus guttatus] Length = 1220 Score = 249 bits (635), Expect = 4e-63 Identities = 226/678 (33%), Positives = 322/678 (47%), Gaps = 81/678 (11%) Frame = -3 Query: 1793 HDRIEEGEISDNSQSIEAIAEEDFKQESKV------------------------------ 1704 HD +EEGEISD S SIE I+EEDF + + Sbjct: 16 HD-VEEGEISD-SASIEEISEEDFNAKQALQPSPPPAPPLKSSLNSSHINVVTSNNNNNN 73 Query: 1703 SNR-----GSRVW-MDDMLKYPISSNYGSGLYNFAWAQAVQNKPLSEILM-RDFGSIEKS 1545 SN G+RVW M D+ +Y ++S + GLYN AWAQAV NK L E+LM ++ G+ ++S Sbjct: 74 SNNSAGGGGARVWTMKDLYEYQVASKHYPGLYNLAWAQAVNNKSLDEVLMMKEDGNNDRS 133 Query: 1544 KRSIIDDSRADXXXXXXXXXXXXXXXXXXXXXXXXXXXEGGLSNN---SNLARNLE--ER 1380 I D S + E L + N+ N+E Sbjct: 134 NGGISDTSSSKSSKTNDSKVVIDVEVEGGMEEGELEEGEIDLDSELVVRNMDFNVETNSN 193 Query: 1379 EFENRIKSIREALGTVTVKDAEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFD 1200 E R+ SI+ L ++ V DA IS+H +C M++E + D L+Q Sbjct: 194 EKSRRVDSIKRELESLNVADAIISYHRLCSSLKNTIVSLQEMVLEGSFAEKDTLVQLLLT 253 Query: 1199 GIQAINSVFCSMNPKQLEQNKDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMDLQAVV 1020 IQ + SVF SM+PK EQNK I S LL+ V S LFSP Q+++ EA+ M+ Sbjct: 254 AIQTLYSVFSSMSPKLKEQNKPILSRLLARVTSLKPPLFSPLQLEKAEAIRFSME----- 308 Query: 1019 PSVKAAEKEKEIQVNNG---VNPNELGILGENPSSKKFLLEPIPVIA------------- 888 SV++ + NNG V +L +L E ++ L + + Sbjct: 309 SSVESFRNDS----NNGRERVGTADLHVLLETANTDSIDLRKCEIESGPSGSPDQTECRS 364 Query: 887 NMGFEIKPXXXXXXXXXXXXXXXLDLHRKHDVDSLPSPTRETPLPSP----------LVK 738 N+G I +DLH+ HD DSLPSPTR+ P P L+K Sbjct: 365 NLGLVIS-------RHKGVTRPLIDLHKDHDADSLPSPTRDLSAPLPFDKGFIMGHGLLK 417 Query: 737 SELATPNVTDESEDAMMYRYETDALKAFSTYQQKFGRTSNFLTDQLPSPTPSEECD-EVD 561 E P E ++ +M+ YETDA+ A S+YQQKFGR+S F+ D+LPSPTPSE+ D Sbjct: 418 PEWPVPGRNIERDNILMHPYETDAVIAVSSYQQKFGRSSFFVNDKLPSPTPSEDGQTSGD 477 Query: 560 FDLSGEVSSSSTVGNVRTVNPSV----SLQPVSSPTAHMDSSSGQTGSNL----VLK--- 414 +++GEVSSS + VNP+V S+QPV S + MD+S+ SN VLK Sbjct: 478 GEINGEVSSSI----IHHVNPAVNILTSVQPVVSSSVAMDTSATPEISNSLRNPVLKSTS 533 Query: 413 AKSRDPRLRFTNSEGDASVLNQYPLLEDAPKSE-TLGGSISSRKHTIIVESVSDGQSQNF 237 AKSRDPRLR +NS+ A N+ + +S+ G +SSRK E V +G + Sbjct: 534 AKSRDPRLRLSNSDAGAKNPNKSLSAVGSEESKWESSGMVSSRKQKTNEELVLNGPA--L 591 Query: 236 KRQRNGLTDSLITGYVPMVSGDRSTVGTQVTDKNILAKNMGTDPRESEKGENERLPMIGP 57 KRQRN L+ + +P+VS ++ T I++ ++E+ P Sbjct: 592 KRQRNELSGP--STAMPLVSATSTSQMTLPVSAPIMS---------LLTSQSEKFPSKNS 640 Query: 56 STMASLPSLLRDIAVNPT 3 + +SL SLL+DIAV+P+ Sbjct: 641 NATSSLHSLLKDIAVDPS 658 >ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris] gi|561012448|gb|ESW11309.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris] Length = 1272 Score = 240 bits (612), Expect = 2e-60 Identities = 211/682 (30%), Positives = 301/682 (44%), Gaps = 88/682 (12%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDF-KQESKVSNRG------SRVWM--DDMLKYP-ISSNYGS 1635 +EEGEISD + S+E I+E DF KQ+ KV+N +RVW D KYP I Y S Sbjct: 25 VEEGEISDTA-SVEEISEADFNKQDVKVNNNNKPNGSDARVWSVRDIYTKYPTICRGYAS 83 Query: 1634 GLYNFAWAQAVQNKPLSEILMRDFGS------------------IEKSKRSIIDDSRADX 1509 GLYN AWAQAVQNKPL++I + + S + + ++D R + Sbjct: 84 GLYNLAWAQAVQNKPLNDIFVMELDSEANANSNSNNSNRPSSVSVNPKEVMVVDVDREEG 143 Query: 1508 XXXXXXXXXXXXXXXXXXXXXXXXXXEGGLSNNSNLARNLEEREFENRIKSIREALGTVT 1329 +S++ +++ ++ +R+ L VT Sbjct: 144 ELEEGEIDADADPEAEAESVVAASVVSETVSDSEQFG--VKKGVSDSEQLGVRDVLEGVT 201 Query: 1328 VKDAEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQL 1149 V + SF + + DDLI+ SF+ I+ + SVF SM+ Sbjct: 202 VANVAESF---AQTSSRLLNALPQVFSRPADSEKDDLIRLSFNAIEVVYSVFRSMDSSDK 258 Query: 1148 EQNKDIFSSLLSHVMSQ-DTTLFSPKQMKEIEAMMPFMDLQAVVPSVKAAEKEKEIQVNN 972 EQNK+ LLS + LFSP+ +KEI+ MM +D + S +A E E+Q Sbjct: 259 EQNKNSILRLLSSAKDKKQAQLFSPEHIKEIQDMMTAIDSVGALGSNEAIYMETELQTPE 318 Query: 971 GVNPNELGILGENPSSKKFLLEPIPVIANMGFEIKPXXXXXXXXXXXXXXXL-------- 816 + E L K V + IKP Sbjct: 319 -IKSQENSALEVQTRGIKIQENQAVVATELVSSIKPLHSDIIGASRALKFGQNSIKGRGV 377 Query: 815 -----DLHRKHDVDSLPSPTRETPLPSP----------LVKSELATPNVTD-----ESED 696 DLH+ HD DSLPSPTRE P P +VKS A + +SE Sbjct: 378 LLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEVMVKSGSAAAKMQPGKLEVDSEG 437 Query: 695 AMMYRYETDALKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGN 516 + + YETDALKA STYQQKFGR+S F D+LPSPTPS +CD++ D + EVSS+ST G Sbjct: 438 SKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCDDMAVDTNEEVSSASTSGF 497 Query: 515 VRTVNPSVSLQPVSSPTAHMDS-----------SSGQTGSNLVLKAKSRDPRLRFTNSEG 369 + + P++ QP S T+ S ++G + AKSRDPR R NSE Sbjct: 498 LTSTKPTLLDQPPVSATSVDKSRLLGLISSRVDAAGSGSFPVKSSAKSRDPRRRLINSEA 557 Query: 368 DASVLNQYPLLEDAPKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDSLITGYV 189 A V NQ+ + + PK E G +IS ++ + S S+ K + + T V Sbjct: 558 SA-VDNQFTVTHNMPKVEYAGSTISRKQKAVEEPSFDLTVSKRLKSSLENIEHN--TSEV 614 Query: 188 PMVSG------DRSTVGTQVTDKNILAKNMGTDPRE-----SEKG---------ENERLP 69 ++G D + GTQ+ +KN L +P+ S G NE+ P Sbjct: 615 RTIAGSGGWLEDITGPGTQLIEKNHLIDKFAPEPKRTLNTVSSSGSVNFNATSIRNEQAP 674 Query: 68 MIGPSTMASLPSLLRDIAVNPT 3 + + +SLP++ +DI VNPT Sbjct: 675 ITSNNVPSSLPAIFKDIVVNPT 696 >ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Glycine max] Length = 1257 Score = 237 bits (605), Expect = 1e-59 Identities = 212/669 (31%), Positives = 307/669 (45%), Gaps = 75/669 (11%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDF-KQESKVSNRGS-------RVWM--DDMLKYP-ISSNYG 1638 +EEGEISD + S+E I+ EDF KQ+ KV N + RVW D KYP I Y Sbjct: 25 VEEGEISDTA-SVEEISAEDFNKQDVKVLNNNNKPNGSDARVWAVHDLYSKYPTICRGYA 83 Query: 1637 SGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRSIIDDSR-------------ADXXXXX 1497 SGLYN AWAQAVQNKPL++I + + S + + + +R D Sbjct: 84 SGLYNLAWAQAVQNKPLNDIFVMEVDSDANANSNSNNSNRLASVAVNPKDVVVVDVDKEE 143 Query: 1496 XXXXXXXXXXXXXXXXXXXXXXEGGLSNNSNLARNLEEREFENRIKSIREALGTVTVKDA 1317 + ++S +++ + +R L VTV + Sbjct: 144 GELEEGEIDADAEPEGEAESVVAVPVVSDSEKLDDVKRDVSNSEQLGVRGVLEGVTVANV 203 Query: 1316 EISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQLEQNK 1137 SF C ++ + DDL++ SF+ + + SVFCSM+ + EQNK Sbjct: 204 AESFAQTCSKLQNALPE---VLSRPADSERDDLVRLSFNATEVVYSVFCSMDSLKKEQNK 260 Query: 1136 DIFSSLLSHVMSQDTT-LFSPKQMKEIEAMMPFMDLQAVVPSVKAAEKEKEIQVNNGVNP 960 D LLS V Q LFSP+ +KEI+ MM +D + + +A KEKE+Q V Sbjct: 261 DSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAIDYFGALVNSEAIGKEKELQTT--VQT 318 Query: 959 NELGILGENPSSKKFLL---EPIPVIANMGFEIKPXXXXXXXXXXXXXXXLDLHRKHDVD 789 +E+ L+ +P+ LDLH+ HD D Sbjct: 319 HEIKTQENQAVEAAELISYNKPLHSDIIGASHALKFGQNSIKGRGVLLPLLDLHKDHDAD 378 Query: 788 SLPSPTRETP----------LPSPLVKSELATPNVTD-----ESEDAMMYRYETDALKAF 654 SLPSPTRE P + P+V S A +SE + + YETDALKA Sbjct: 379 SLPSPTREAPSCFPVNKLLSVGEPMVSSGSAAAKPESGKMELDSEGSKFHLYETDALKAV 438 Query: 653 STYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVNPS-VSLQPV 477 STYQQKFGR+S F D+ PSPTPS +C++ D + EVSS+ST + + P+ + L PV Sbjct: 439 STYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVDTNEEVSSASTGDFLTSTKPTLLDLPPV 498 Query: 476 SSPTAHMDSSSGQTGS--------NLVLK--AKSRDPRLRFTNSEGDASVLNQYPLLEDA 327 S+ + S G S +L +K AK+RDPRLRF NS+ A V N L+ + Sbjct: 499 SATSTDRSSLHGFISSRVDAAGPGSLPVKSSAKNRDPRLRFVNSDASA-VDNPSTLIHNM 557 Query: 326 PKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGL--TDSLITGYVPMVSG---DRST 162 PK E G +IS ++ S+ S KRQ++ L T+ ++ + G + + Sbjct: 558 PKVEYAGTTISRKQKAAEEPSLDVTVS---KRQKSPLENTEHNMSEVRTGIGGWLEEHTG 614 Query: 161 VGTQVTDKNILAKNMGTDPRE----------------SEKGENERLPMIGPSTMASLPSL 30 G Q ++N L G +P++ + NE+ P+ + +ASLP+L Sbjct: 615 PGAQFIERNHLMDKFGPEPQKTLNTVSSSCTGSDNFNATSIRNEQAPITSSNVLASLPAL 674 Query: 29 LRDIAVNPT 3 L+ AVNPT Sbjct: 675 LKGAAVNPT 683 >ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Solanum tuberosum] Length = 1218 Score = 237 bits (604), Expect = 2e-59 Identities = 208/667 (31%), Positives = 296/667 (44%), Gaps = 73/667 (10%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFKQE----------------SKVSNRGSRVW-MDDMLKYP 1656 +EEGEISD S S+E I+E+ F ++ ++ S +RVW M D KYP Sbjct: 10 VEEGEISD-SASVEEISEDAFNRQDPPTTTKIKIASNENQNQNSTTTTRVWTMRDAYKYP 68 Query: 1655 ISSNYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRSIIDDSRADXXXXXXXXXXXX 1476 IS +Y GLYN AWAQAVQNKPL E+ + + + S + ++ + Sbjct: 69 ISRDYARGLYNLAWAQAVQNKPLDELFVM---TSDNSNQCANANANVESKVIIDVDVDDD 125 Query: 1475 XXXXXXXXXXXXXXXEGGLSNNSNLARNLEEREFENRIKSIREALGTVTVKDAEISFHGV 1296 L N F +RE L +VT+ + SF V Sbjct: 126 AKEEGELEEGEIDLDAADLVLN-----------FGKEANFVREQLQSVTLDETHKSFSMV 174 Query: 1295 CXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQLEQNKDIFSSLL 1116 C + + D+ LIQ ++ INSVF SMN Q +QN DI S LL Sbjct: 175 CSKLQTSLLALGELALSQDKNDI--LIQLFMTALRTINSVFYSMNQDQKQQNTDILSRLL 232 Query: 1115 SHVMSQDTTLFSPKQMKEIEAMMPFMDLQAVVPSVKAAEKEKEIQVNNGVNPNELGILGE 936 H +Q L S +Q+KE++A++ ++ AV + + +K I+V ++ E Sbjct: 233 FHAKTQLPALLSSEQLKEVDAVILSINQSAVFSNTQDNDKVNGIKVVELLDKKVSHKSSE 292 Query: 935 NPSS-----KKFLLEPIPVIAN------MGFEIKPXXXXXXXXXXXXXXXLDLHRKHDVD 789 N + K+ L + + ++ + FE LDLH+ HD D Sbjct: 293 NANQDFTAVNKYDLGAVSIKSSGLKEQSVSFESVKPGLANSKAKGLSIPLLDLHKDHDED 352 Query: 788 SLPSPTRETPLPSPLVKSELATPNV---------TDESEDAMMYRYETDALKAFSTYQQK 636 +LPSPTRE P+ K+ A V + E +++++ YETDALKA S+YQQK Sbjct: 353 TLPSPTREIGPQFPVAKATQAHGMVKLDLPIFAGSLEKGNSLLHPYETDALKAVSSYQQK 412 Query: 635 FGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVNPSVSLQPVSSPTAHM 456 FGR+S F+++ LPSPTPSEE D D+ GEV+S V N +N S QP+ S Sbjct: 413 FGRSSLFVSENLPSPTPSEEGDSGKGDIGGEVTSLDVVHNASHLNESSMGQPILSSVPQT 472 Query: 455 DSSSGQ------TGSNLVL---------KAKSRDPRLRFTNSEGDASVLNQ--YPLLEDA 327 + GQ T L AKSRDPRLR S+ A N+ P+ + Sbjct: 473 NILDGQGLGTARTADPLSFLPNPSLRSSTAKSRDPRLRLATSDAVAQNTNKNILPIPDID 532 Query: 326 PKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDSLITGYVPMVSG------DRS 165 K E I S+K + V KRQR+ TDS+I V +G DR Sbjct: 533 LKLEASLEMIGSKKQKTVDLPVFGAPLP--KRQRSEQTDSIIVSDVRPSTGNGGWLEDRG 590 Query: 164 TVGTQVTDKNILAKNMGTDPRESEK-------------GENERLPMIGPSTMASLPSLLR 24 T G +T N + D R+ E+ E P+ G ST +L SLL+ Sbjct: 591 TAGLPITSSNCATDSSDNDIRKLEQVTATIATIPSVIVNAAENFPVTGISTSTTLHSLLK 650 Query: 23 DIAVNPT 3 DIA+NP+ Sbjct: 651 DIAINPS 657 >ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like isoform X2 [Cicer arietinum] Length = 1227 Score = 233 bits (593), Expect = 3e-58 Identities = 207/660 (31%), Positives = 305/660 (46%), Gaps = 66/660 (10%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFKQES--KVSNRG----------SRVWM--DDMLKYP-IS 1650 +EEGEISD + +E I+EEDF ++ KV+N +RVW D KYP I Sbjct: 25 VEEGEISDTASVVE-ISEEDFNKQDVVKVNNNSDSDKAKTGGDARVWAVHDLYSKYPTIC 83 Query: 1649 SNYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRSIIDDSRADXXXXXXXXXXXXXX 1470 Y SGLYN AWAQAVQNKPL++I + + S + ++DD + Sbjct: 84 RGYASGLYNLAWAQAVQNKPLNDIFVMELDSDSNANVVMVDDDEREEGELEEGEIDGDDD 143 Query: 1469 XXXXXXXXXXXXXEGGLSNNSNLARNLEEREFENRIKSIREALGTVTVKDAEISFHGVCX 1290 GG+ + + + E + IR+ L VTV + SF Sbjct: 144 T-------------GGVMVGGDGSETVSESD-------IRDFLEGVTVANVAESFAETIS 183 Query: 1289 XXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQLEQNKDIFSSLLSH 1110 ++ + D +I+ ++ I+ ++SVFCSM+ Q E NKD LL Sbjct: 184 RLLRVLQSK--LLSGPAVSEKDYVIRLLYNAIEIVHSVFCSMDNLQKEDNKDNIIRLLYF 241 Query: 1109 VMSQDTTLFSPKQMKEIEAMMPFMD-LQAVVPSVKAAEKEK------EIQVNNGVNPNEL 951 + ++ T LFSP+ MKEI+ M+ +D + A+ SV EK + + G+ +EL Sbjct: 242 LKNEHTQLFSPEHMKEIQVMITAIDTVDALGNSVVVGNGEKLDTLDIKTRQIQGLKASEL 301 Query: 950 GILGENPSSKKFLLEPIPVIANMGFEIKPXXXXXXXXXXXXXXXLDLHRKHDVDSLPSPT 771 + + L E + + IK DLH+ HD+DSLPSPT Sbjct: 302 --ISSSKLVHSNLTEASEALLSGQSNIK--------GRGVMLPLFDLHKVHDLDSLPSPT 351 Query: 770 RETPLPSPLVK----------SELATPNVTD------ESEDAMMYRYETDALKAFSTYQQ 639 RE P P+ K L + T+ ++E++ + YETDALKA STYQQ Sbjct: 352 REAPSFFPVNKLFSVGDGMDRPGLPSAGKTEAVKMELDTENSKNHLYETDALKAVSTYQQ 411 Query: 638 KFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVNPSVSLQPVSSPTAH 459 KFGR+S F D+ PSPTPS +C+E D + EVSS+S ++ + P + PVSS + Sbjct: 412 KFGRSSYFTDDKFPSPTPSGDCEEGVADANEEVSSASIAVSLTSSKPLLDQMPVSSTSVD 471 Query: 458 MDSSSGQTGSNL----------VLKAKSRDPRLRFTNSEGDASVLNQYPLLEDAPKSETL 309 S G S + A+SRDPRLRF NS+ A LNQ + PK E Sbjct: 472 RSSMHGLINSRIEAASSVTYPVKTSARSRDPRLRFINSDASALDLNQSLGTNNMPKVEN- 530 Query: 308 GGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDS--------LITGYVPMVSGDRSTVGT 153 G + SRK E D + KR R+ L +S + G + +R G+ Sbjct: 531 AGRVISRKQKTTEELSLDATAP--KRLRSSLENSRHNTREERTMAGNGGWLEENR-VAGS 587 Query: 152 QVTDKNILAKNMGTDPRES----------EKGENERLPMIGPSTMASLPSLLRDIAVNPT 3 + ++N L + T+ +++ NE+ P+ +T A+LP LL++IAVNPT Sbjct: 588 HLIERNHLMQKGETELKKTMSTSSGYSTVTSNGNEQAPVTVSNTAAALPGLLKNIAVNPT 647 >ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Glycine max] Length = 1261 Score = 231 bits (588), Expect = 1e-57 Identities = 213/688 (30%), Positives = 302/688 (43%), Gaps = 95/688 (13%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDF-KQESKVSNRGS-------RVWM--DDMLKYP-ISSNYG 1638 +EEGEISD + S+E I+ EDF KQ+ K+ N + RVW D KYP I Y Sbjct: 25 VEEGEISDTA-SVEEISAEDFNKQDVKLLNNNNKPNGSDARVWAVHDLYSKYPTICRGYA 83 Query: 1637 SGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRSIIDDSRADXXXXXXXXXXXXXXXXXX 1458 SGLYN AWAQAVQNKPL++I + + S D+ A+ Sbjct: 84 SGLYNLAWAQAVQNKPLNDIFVMEVDS----------DANANSNRNSSHRLASVAVNPKD 133 Query: 1457 XXXXXXXXXEGGLSNNSNLARNLEEREFENRIKSI---------------------REAL 1341 EG L A E E E+ + ++ R L Sbjct: 134 VVVVDVDKEEGELEEGEIDADAEPEGEAESVVVAVSDSEKLDDVKMDVSDSEQLGARGVL 193 Query: 1340 GTVTVKDAEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMN 1161 VTV + SF C ++ + DDL++ SF+ + + SVFCSM+ Sbjct: 194 EGVTVANVVESFAQTCSKLQNTLPE---VLSRPAGSEKDDLVRLSFNATEVVYSVFCSMD 250 Query: 1160 PKQLEQNKDIFSSLLSHVMSQDTT-LFSPKQMKEIEAMMPFMDLQAVVPSVKAAEKEKEI 984 + EQNKD LLS V Q LFSP+ +KEI+ MM +D + + +A KEKE+ Sbjct: 251 SSEKEQNKDSILRLLSFVKDQQQAQLFSPEHVKEIQGMMTAIDSVGALVNSEAIGKEKEL 310 Query: 983 QVNNGVNPNELGILGENPSSKKFLLEPIPVIANMGFEI-------KPXXXXXXXXXXXXX 825 Q I + S+ + + I N E KP Sbjct: 311 QTTE--------IKTQENSAVEVQIHEIKTQENQAVEAAELISYSKPLHRDITGTSQALK 362 Query: 824 XXL-------------DLHRKHDVDSLPSPTRETPLPSP----------LVKSELATPNV 714 DLH+ HD DSLPSPTRE P P +V+S A+ + Sbjct: 363 FGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGESMVRSGSASAKM 422 Query: 713 TDESEDAMMYRYETDALKAFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSS 534 +SE + + YETDALKA STYQQKFGR+S F D+ PSPTPS +C++ D + EVSS Sbjct: 423 ELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEVVDTNEEVSS 482 Query: 533 SSTVGNVRTVNPSVSLQPVSSPTAHMDSSS------------GQTGSNLVLKAKSRDPRL 390 +ST + + P++ QP S T+ MD SS G + AK+RDPRL Sbjct: 483 ASTGDFLTSTKPTLLDQPPVSATS-MDRSSMHGFISSRVDATGPGSFPVKSSAKNRDPRL 541 Query: 389 RFTNSEGDASVLNQYPLLEDAPKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTD 210 RF NS+ A V N L+ + K E G +IS ++ S+ S+ K Sbjct: 542 RFINSDASA-VDNLSTLINNMSKVEYSGTTISRKQKAAEEPSLDVTVSKRLKSSLENTEH 600 Query: 209 SLITGYVPMVSG----DRSTVGTQVTDKNILAKNMGTDPRE----------------SEK 90 ++ V SG + + G Q+ ++N L G + ++ + Sbjct: 601 NM--SEVRTGSGGWLEENTGPGAQLIERNHLMDKFGPEAKKTLNTVSSSCTGSDNFNATS 658 Query: 89 GENERLPMIGPSTMASLPSLLRDIAVNP 6 NE+ P+ + +ASLP+LL++ +VNP Sbjct: 659 IRNEQAPITASNVLASLPALLKEASVNP 686 >ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like isoform X1 [Cicer arietinum] Length = 1247 Score = 227 bits (578), Expect = 2e-56 Identities = 207/667 (31%), Positives = 304/667 (45%), Gaps = 73/667 (10%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFKQES--KVSNRG----------SRVWM--DDMLKYP-IS 1650 +EEGEISD + +E I+EEDF ++ KV+N +RVW D KYP I Sbjct: 25 VEEGEISDTASVVE-ISEEDFNKQDVVKVNNNSDSDKAKTGGDARVWAVHDLYSKYPTIC 83 Query: 1649 SNYGSGLYNFAWAQAVQNKPLSEILMRDFGSIEKSKRSIIDDSR-------ADXXXXXXX 1491 Y SGLYN AWAQAVQNKPL++I + + S + + +DS Sbjct: 84 RGYASGLYNLAWAQAVQNKPLNDIFVMELDSDSNANANSNNDSNNGNGDLNMPLKEVVMV 143 Query: 1490 XXXXXXXXXXXXXXXXXXXXEGGLSNNSNLARNLEEREFENRIKSIREALGTVTVKDAEI 1311 GG+ + + + E + IR+ L VTV + Sbjct: 144 DDDEREEGELEEGEIDGDDDTGGVMVGGDGSETVSESD-------IRDFLEGVTVANVAE 196 Query: 1310 SFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQLEQNKDI 1131 SF ++ + D +I+ ++ I+ ++SVFCSM+ Q E NKD Sbjct: 197 SFAETISRLLRVLQSK--LLSGPAVSEKDYVIRLLYNAIEIVHSVFCSMDNLQKEDNKDN 254 Query: 1130 FSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMD-LQAVVPSVKAAEKEK------EIQVNN 972 LL + ++ T LFSP+ MKEI+ M+ +D + A+ SV EK + + Sbjct: 255 IIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNSVVVGNGEKLDTLDIKTRQIQ 314 Query: 971 GVNPNELGILGENPSSKKFLLEPIPVIANMGFEIKPXXXXXXXXXXXXXXXLDLHRKHDV 792 G+ +EL + + L E + + IK DLH+ HD+ Sbjct: 315 GLKASEL--ISSSKLVHSNLTEASEALLSGQSNIK--------GRGVMLPLFDLHKVHDL 364 Query: 791 DSLPSPTRETPLPSPLVK----------SELATPNVTD------ESEDAMMYRYETDALK 660 DSLPSPTRE P P+ K L + T+ ++E++ + YETDALK Sbjct: 365 DSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGKTEAVKMELDTENSKNHLYETDALK 424 Query: 659 AFSTYQQKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVNPSVSLQP 480 A STYQQKFGR+S F D+ PSPTPS +C+E D + EVSS+S ++ + P + P Sbjct: 425 AVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADANEEVSSASIAVSLTSSKPLLDQMP 484 Query: 479 VSSPTAHMDSSSGQTGSNL----------VLKAKSRDPRLRFTNSEGDASVLNQYPLLED 330 VSS + S G S + A+SRDPRLRF NS+ A LNQ + Sbjct: 485 VSSTSVDRSSMHGLINSRIEAASSVTYPVKTSARSRDPRLRFINSDASALDLNQSLGTNN 544 Query: 329 APKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDS--------LITGYVPMVSG 174 PK E G + SRK E D + KR R+ L +S + G + Sbjct: 545 MPKVEN-AGRVISRKQKTTEELSLDATAP--KRLRSSLENSRHNTREERTMAGNGGWLEE 601 Query: 173 DRSTVGTQVTDKNILAKNMGTDPRES----------EKGENERLPMIGPSTMASLPSLLR 24 +R G+ + ++N L + T+ +++ NE+ P+ +T A+LP LL+ Sbjct: 602 NR-VAGSHLIERNHLMQKGETELKKTMSTSSGYSTVTSNGNEQAPVTVSNTAAALPGLLK 660 Query: 23 DIAVNPT 3 +IAVNPT Sbjct: 661 NIAVNPT 667 >ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 1195 Score = 223 bits (569), Expect = 2e-55 Identities = 203/682 (29%), Positives = 301/682 (44%), Gaps = 88/682 (12%) Frame = -3 Query: 1784 IEEGEISDNSQSIEAIAEEDFKQESKV-----------------SNRGSRVW-MDDMLKY 1659 +EEGEISD + SIE I+EEDF ++ V N RVW + D+ +Y Sbjct: 15 VEEGEISDTA-SIEEISEEDFNKQDVVVVKPPSSNNETTKQKEQGNGNGRVWTISDLYRY 73 Query: 1658 PISSNYGSGLYNFAWAQAVQ------NKPLSEILMRDFGSI-EKSKRSIIDDSRADXXXX 1500 + + SGLYN AWAQAVQ NKPL+E+ + E SKRS S A Sbjct: 74 QMVGGHVSGLYNLAWAQAVQSKPGKSNKPLNELFADVVEELDESSKRSSPSSSAA----- 128 Query: 1499 XXXXXXXXXXXXXXXXXXXXXXXEGGLSNNSNLARNLEEREFENRIKSIREALGTVTVKD 1320 S NSN EE+ ++ + V + D Sbjct: 129 ---------------------------SVNSNNKDGDEEK---------KKVVEKVVIDD 152 Query: 1319 AEISFHGVCXXXXXXXXXXXLMIMENGTLDVDDLIQQSFDGIQAINSVFCSMNPKQLEQN 1140 M+ +N + D++++ +G + M P + N Sbjct: 153 -----------------NGDEMMDDNNRNKIVDVVEKE-EGELEEGEIDLDMEPGEKANN 194 Query: 1139 KDIFSSLLSHVMSQDTTLFSPKQMKEIEAMMPFMDLQAVVPSVKAA--------EKEKEI 984 D+ + + + + K+M I + + ++ V+ ++ EKEKE Sbjct: 195 GDVLNMNIDGLEVESGEKGFEKKMNSIRDALESVTIEFVLACTDSSGVSFSSFSEKEKEP 254 Query: 983 QVNNGVNPNELGILGENPSSKKFLLEPIPVI------ANMGFEIKPXXXXXXXXXXXXXX 822 ++ VN + + G++ + +P AN+ E Sbjct: 255 LISTVVNKKDNDVNGKSSGHDMSAVNKLPTDSFVNNKANLSIEGPKTGVSSFKSRAALLP 314 Query: 821 XLDLHRKHDVDSLPSPTRETPLPSPLVKSELATPNVTDESEDAMMYRYETDALKAFSTYQ 642 LDLH+ HD DSLPSPTRE+ LP P + + TP + ++ ++ M+ YETDALKA S+YQ Sbjct: 315 LLDLHKDHDADSLPSPTRESALPLPAYR--VLTPKMVLDTGNSRMHPYETDALKAVSSYQ 372 Query: 641 QKFGRTSNFLTDQLPSPTPSEECDEVDFDLSGEVSSSSTVGNVRTVNPSVSLQP-VSSPT 465 QKF ++S LTD+LPSPTPSEE D D GEVSSS +V + R NP S Q S Sbjct: 373 QKFSKSSFALTDRLPSPTPSEESGNGDGDTGGEVSSSLSVSSFRPANPLTSGQSNASISL 432 Query: 464 AHMDSSS------------GQTGSNLVLK--AKSRDPRLRFTNSEGDASVLNQYPL-LED 330 MD SS + +L +K AKSRDPRLRF NS+ +A N + + + Sbjct: 433 PRMDGSSLPGVISIKSAVRASSAPSLTVKASAKSRDPRLRFVNSDSNALDQNHRAVPVVN 492 Query: 329 APKSETLGGSISSRKHTIIVESVSDGQSQNFKRQRNGLTDSLITGYVPMVSG------DR 168 K E +GG+++ ++ I+ + + DG S KRQ+N L +S + V + G D Sbjct: 493 TLKVEPIGGTMNKKRQKIVDDPIPDGHS--LKRQKNALENSGVVRDVKTMVGSGGWLEDT 550 Query: 167 STVGTQVTDKNILAKNMGTDPRESEKG---------------ENERLPMIGPS------- 54 VG Q +KN L N +DPR + G E++P+ G S Sbjct: 551 DMVGPQTMNKNQLVDNAESDPRRKDGGGVCTSSSCISSVNISGTEQIPVTGTSVPIGGEL 610 Query: 53 -----TMASLPSLLRDIAVNPT 3 + A++P LL++IAVNPT Sbjct: 611 VPVKGSTAAIPDLLKNIAVNPT 632