BLASTX nr result
ID: Akebia25_contig00012464
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00012464 (4591 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat... 1109 0.0 ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma... 1094 0.0 gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l... 1061 0.0 ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr... 1048 0.0 ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citr... 1028 0.0 emb|CBI35661.3| unnamed protein product [Vitis vinifera] 1025 0.0 ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu... 1017 0.0 ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus tric... 975 0.0 ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal doma... 953 0.0 ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera... 952 0.0 ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma... 937 0.0 ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma... 936 0.0 ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal doma... 932 0.0 ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Popu... 924 0.0 ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma... 919 0.0 ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phas... 917 0.0 ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal doma... 911 0.0 ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ... 900 0.0 ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma... 899 0.0 ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal doma... 887 0.0 >ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] Length = 1290 Score = 1109 bits (2869), Expect = 0.0 Identities = 653/1258 (51%), Positives = 801/1258 (63%), Gaps = 32/1258 (2%) Frame = +3 Query: 285 SVEEISEEDF-KQEAKVL----NPKGGD----SRVW-MGDLLNYP-VSSNYGSGLYNFAW 431 S+EEISEEDF KQ+ K+L + KGG+ SRVW M DL YP V Y SGLYNFAW Sbjct: 44 SIEEISEEDFNKQDVKILKESKSSKGGEANSNSRVWTMQDLCKYPSVIRGYASGLYNFAW 103 Query: 432 AQAVQNKPLTEILMRDFESEEK-----SKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIID 596 AQAVQNKPL EI ++DFE ++ SKRS + V VI D Sbjct: 104 AQAVQNKPLNEIFVKDFEQPQQDENKNSKRSSPSSSVASVNSKEEKGSSGNLAVKVVIDD 163 Query: 597 DSSEEIDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGR-NSEG 773 DS +E++ V LD+E E+ L S+ G + Sbjct: 164 DSEDEMEE--DKVVNLDKEEGELEEGEIDLDSEPKEKV----------LSSEDGNVGNSD 211 Query: 774 EFEKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFT 953 E EK+ IRG LE VTV AEKSF GVC L +L+SL+ +I+E P D LIQ +F Sbjct: 212 ELEKRANLIRGVLEGVTVIEAEKSFEGVCSRLHNALESLRALILECSVPAKDALIQLAFG 271 Query: 954 GIQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEA-V 1130 AINS F ++N +EQN + RLL+ VK D +LF P++MKEI+ M+ L+S A Sbjct: 272 ---AINSAFVALNCNSKEQNVAILSRLLSIVKGHDPSLFPPDKMKEIDVMLISLNSPARA 328 Query: 1131 VSHVKAMKKDNGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSETXXXX 1310 + K MK +G N + L EN L +NK+ P K N N +ET Sbjct: 329 IDTEKDMKVVDGVNKKDPDALPENICHDLTVTNKL---PSSAKFVINNKPNALTETLKPG 385 Query: 1311 XXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATPKIA 1490 DLH+DHDADSLPSPTRET P LPV K GD + KS T K + Sbjct: 386 VPNFRNRGISLPLL-DLHKDHDADSLPSPTRETTPCLPVNKPLTSGDVMVKSGFMTGKGS 444 Query: 1491 DESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEVSSF 1670 ++E LH YETDALKA S+YQQKFG+ S + RLPSPTPSEE D GD+ GEVSS Sbjct: 445 HDAEGDKLHPYETDALKAFSTYQQKFGQGSFFSSDRLPSPTPSEESGDEGGDNGGEVSSS 504 Query: 1671 STVGDVRNVNLPLPLRSVGSPTPHMDSSMG--QRQMPAKTAGHLACVSNPVLRAPAKNRD 1844 S++G+ + NLP+ + S P +DS+ Q Q+ + A ++ VSN V ++ AK+RD Sbjct: 505 SSIGNFKP-NLPILGHPIVSSAPLVDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRD 563 Query: 1845 PRLRFANSEGDALDLNQRPLLEGATKSDTLGGIISSRKHNIVVESVLDGQTLKRQRNGLT 2024 PRL FANS ALDLN+R LL A+K +GGI+ SRK V E +LD LKRQRN L Sbjct: 564 PRLWFANSNASALDLNER-LLHNASKVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELE 622 Query: 2025 DFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSERRDIDA 2204 + V++DVQ VSG GWLE++ +G+Q T+ N+ A+N+ ++ RK +NG SS Sbjct: 623 NLGVARDVQTVSGIGGWLEDTDAIGSQITNRNQTAENLESNSRKMDNGVTSSST-----L 677 Query: 2205 SSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIM--EQQRLAAGAQKKSS 2378 S N++VG NE +P+ T +T SLP+LL+DIAVNPTML+ ++ +QQRL A AQ+KS Sbjct: 678 SGKTNITVGTNEQVPVTST-STPSLPALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSP 736 Query: 2379 DSAQN----------MKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMN 2528 D ++ + SS++VIP +P VN+ S SS I KPA +VP+ S Sbjct: 737 DPVKSTFHQPSSNSLLGVVSSTNVIP--SPSVNNVPSISSGISSKPAGNLQVPSPDES-- 792 Query: 2529 PQGEWGQIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQ 2708 G+IRMKPRDPRR+LH ++ Q++ S+G D+ KTNGA Q Sbjct: 793 -----GKIRMKPRDPRRVLHGNSLQRSGSMGLDQLKTNGALTSSTQGSKDNLNAQKLDSQ 847 Query: 2709 AQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAG 2888 ++ + F LKN+A ++S SQA + P VS ++ QP +K++ Sbjct: 848 TESKPMQSQLVPPPDITQQFTNNLKNIADIMSVSQALTSLPPVSHNLVPQPVLIKSDSMD 907 Query: 2889 VGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXX 3068 + A+V+ DQQ G G PE GP R QN WGDVE LFE YDD Sbjct: 908 MKALVSNSEDQQTGAGLAPEAGATGP-RSQNAWGDVEHLFERYDDQQKAAIQRERARRIE 966 Query: 3069 XXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMG 3248 KMF+ARK NSAKF+EVDP+H+EILRKKEEQDREKP RHLFRF HMG Sbjct: 967 EQKKMFSARKLCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPERHLFRFHHMG 1026 Query: 3249 MWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDP 3428 MWTKLRPGIWNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP G LFAGRVIS+GDDGDP Sbjct: 1027 MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1086 Query: 3429 FDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 3608 FDG+E++P++KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG Sbjct: 1087 FDGDERVPRSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1146 Query: 3609 PSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVV 3788 PSLLEIDHDERPE+GTLASSLAVIER+HQ FFSH++L DVDVRN+LASEQ+KILAGCR+V Sbjct: 1147 PSLLEIDHDERPEDGTLASSLAVIERIHQDFFSHQNLDDVDVRNILASEQRKILAGCRIV 1206 Query: 3789 FSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 FSR+FPVGEANPHLHPLWQTAEQFGAVCT QIDE VTHVVANSLGTDKVNWALSTG+F Sbjct: 1207 FSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKF 1264 >ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Vitis vinifera] Length = 1238 Score = 1094 bits (2829), Expect = 0.0 Identities = 650/1252 (51%), Positives = 792/1252 (63%), Gaps = 26/1252 (2%) Frame = +3 Query: 285 SVEEISEEDF-KQEAKVLN---PKGGDSRVW----MGDLLNY-PVSSNYGSGLYNFAWAQ 437 SVEEISEEDF KQE +VL PK D+RVW + DL Y S Y LYN AWAQ Sbjct: 25 SVEEISEEDFNKQEVRVLREAKPKA-DTRVWTMRDLQDLYKYHQACSGYTPRLYNLAWAQ 83 Query: 438 AVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEEID 617 AVQNKPL +I + D +E+SKRS S+ KEV VIIDDS +E+D Sbjct: 84 AVQNKPLNDIFVMD---DEESKRSSSS------SNTSRDDSSSAKEVAKVIIDDSGDEMD 134 Query: 618 SKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKS 797 K DV LD+E + EGG + N+ P + E E +++KS Sbjct: 135 VKMDDVSEKEEGELEEGEID--LDSEPDVKDEGGVLDVNE--PEIDLK--ERELVERVKS 188 Query: 798 IRGALETVTVKYAEKSFHGVCLELQASLDSLKLM-----IMENGAPDVDDLIQQSFTGIQ 962 I+ LE+VTV AEKSF GVC LQ +L SL+ + + E+ P D L QQ I+ Sbjct: 189 IQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIR 248 Query: 963 AINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSHV 1142 A+N VFCSMN Q+E NKD+F RLL+ V+ D+ +FS + +KE+E MM LD+ A S Sbjct: 249 ALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSA 308 Query: 1143 KAMKKDN------GTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSETXX 1304 +A K N G N N E+ G+ S+ K+ L+ I V+S +QN + Sbjct: 309 EASDKVNDVQVTDGMNRNILDSSVESSGRAFASAKKLSLDSISVESYNQN-----NPDAL 363 Query: 1305 XXXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATPK 1484 DLH+DHD DSLPSPT + P PV +KSEL T K Sbjct: 364 KPGLSSSRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPV----------NKSELVTAK 413 Query: 1485 IADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEVS 1664 +A E++DS +H YETDALKA+S+YQQKFG TS + +LPSPTPSEE D GD +GEVS Sbjct: 414 VAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVS 473 Query: 1665 SFSTVGDVRNVNLPLPLRSVGSPTPHMDSSMGQRQMPAKTAGHLACVSNPVLR----APA 1832 S ST+ N P + S P MDSS+ Q + ++ S P L A A Sbjct: 474 SSSTISAPITANAPALGHPIVSSAPQMDSSIVQGPTVGRNTSLVS--SGPHLDSSVVASA 531 Query: 1833 KNRDPRLRFANSEGDALDLNQRPL--LEGATKSDTLGGIISSRKHNIVVESVLDGQTLKR 2006 K+RDPRLR A+S+ +LDLN+RPL + + K D LG I+SSRK E +LDG KR Sbjct: 532 KSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKR 591 Query: 2007 QRNGLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSE 2186 QRNGLT A +D Q V S GWLE+S+TV Q + N+L +N GTD +K E+ V+ Sbjct: 592 QRNGLTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTGI 651 Query: 2187 RRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIMEQQRLAAGAQ 2366 D V+V GNE LP++ T TTASL SLL+DIAVNP + M + + + Q Sbjct: 652 GCD-----KPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVE------Q 700 Query: 2367 KKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNPQGEWG 2546 +KS D A+N +S+ I G+ P + A K S + QKPA +VP P E G Sbjct: 701 QKSGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVP----QTGPMDESG 756 Query: 2547 QIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQAQTTSL 2726 ++RMKPRDPRRILH+++FQ++ S GS++FKTN +Q +T S+ Sbjct: 757 KVRMKPRDPRRILHANSFQRSGSSGSEQFKTNAQKQ---------------EDQTETKSV 801 Query: 2727 XXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAGVGAVVT 2906 F + LKN+A ++S SQA++ PT Q +SSQ V T++ V A V+ Sbjct: 802 PSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATVS 861 Query: 2907 ELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXXXXNKMF 3086 + DQ G+KPE S AGP + +N WGDVE LF+GYDD KMF Sbjct: 862 DSGDQLTANGSKPE-SAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMF 920 Query: 3087 AARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 3266 +ARK NSAKFVEVDP+HDEILRKKEEQDREK RHLFRFPHMGMWTKLR Sbjct: 921 SARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLR 980 Query: 3267 PGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDPFDGEEK 3446 PGIWNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP G LFAGRVISKGDDGD DG+E+ Sbjct: 981 PGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDER 1040 Query: 3447 LPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEI 3626 +PK+KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEI Sbjct: 1041 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEI 1100 Query: 3627 DHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVVFSRIFP 3806 DHDERPE+GTLASSLAVIER+HQ+FFS+R+L +VDVRN+LASEQ+KILAGCR+VFSR+FP Sbjct: 1101 DHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFP 1160 Query: 3807 VGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 VGEANPHLHPLWQTAE FGAVCT QIDEQVTHVVANSLGTDKVNWALSTGRF Sbjct: 1161 VGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRF 1212 >gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus notabilis] Length = 1301 Score = 1061 bits (2744), Expect = 0.0 Identities = 630/1271 (49%), Positives = 784/1271 (61%), Gaps = 45/1271 (3%) Frame = +3 Query: 285 SVEEISEEDF-KQEA------KVLN--------PKGGDSRVW-MGDLL-NYPVSSNYGSG 413 SVEEISEEDF KQE KV++ K GDSRVW M DL NYP Y +G Sbjct: 23 SVEEISEEDFNKQEGNGTGSGKVMSVSDSNSKESKFGDSRVWTMRDLYANYPGFRGYTTG 82 Query: 414 LYNFAWAQAVQNKPLTEILMRDFESEEKSK--RSGSNLLXXXXXXXXXXXXXXMKEVCNV 587 LYN AWAQAVQNKPL EI + D ++++ S+ S ++ +++V V Sbjct: 83 LYNLAWAQAVQNKPLNEIFVMDVDADDSSRVVLSSASPAVNSGRREGKNGVKEVEKVEKV 142 Query: 588 IIDDSSEEIDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNAND---------S 740 +IDDS++E++ + L++E ++ G + D Sbjct: 143 VIDDSADEMEEGELE------------EGEIDLESEPTQKPAGEEAKDGDLNCEAENVGG 190 Query: 741 LPSDSGRNSEGEFEKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMEN--G 914 L DS R+ E EK++ I L +V V AEKSF VC LQ +L+SL+ ++ E Sbjct: 191 LEVDSRRD---ELEKRVDLIWETLGSVNVVNAEKSFEEVCSRLQRTLESLRGVLSEKEFS 247 Query: 915 APDVDDLIQQSFTGIQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEI 1094 P D +IQ S T IQ +NSVFCSM+ Q+EQ K+ RL VK+ T LFSPE+ KEI Sbjct: 248 FPTKDVVIQMSITAIQVVNSVFCSMSVNQKEQKKETLSRLFCSVKNCGTPLFSPEQTKEI 307 Query: 1095 EAMMYGLDSEAVVSHVKAMKKDNGTNPNEFGILGENPGQVLNSS-NKILLEPIPVKSGDQ 1271 E M+ L+ V+ A K+ T E L E + N++ +E VK Sbjct: 308 ELMISSLNPLNVLPSSGASDKEKETQIIER--LHEMDSNLTNANAENASIERTSVKLPQD 365 Query: 1272 NIANMGSETXXXXXXXXXXXXXXXXXXX------DLHRDHDADSLPSPTRETPPFLPVQK 1433 +A++ DLH+DHDADSLPSPTRE P PV K Sbjct: 366 CVASVVHSNPITLPELLRPGTLAFKGRGLLLPLLDLHKDHDADSLPSPTREAPSCFPVYK 425 Query: 1434 LKVVGDGLSKSELATPKIADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPT 1613 V DG+ K T K+A +E+S LH YETDALKA+S+YQQKFGR S +++ RLPSPT Sbjct: 426 PLGVADGIIKPVSTTAKVAPGAEESRLHRYETDALKAVSTYQQKFGRGSFLMSDRLPSPT 485 Query: 1614 PSEECDDGDGDSTGEVSSFSTVGDVRNVNLPLPLRSVGSPTPHMDSSMGQRQMPAKTAGH 1793 PSEECD+ D D EVSS T G++R +P+ SV + + + S Q + AK A Sbjct: 486 PSEECDEED-DINQEVSSSLTSGNLRTPAIPILRPSVVTSSVPVSSPTMQGPIAAKNAAP 544 Query: 1794 LACVSNPVLRAPAKNRDPRLRFANSEGDALDLNQRPL--LEGATKSDTLGGIISSRKHNI 1967 + SN ++A A++RDPRLRFANS+ ALDLNQRPL + K + G SSRK I Sbjct: 545 VGSGSNSTMKASARSRDPRLRFANSDAGALDLNQRPLTAVHNGPKVEP-GDPTSSRKQRI 603 Query: 1968 VVESVLDGQTLKRQRNGLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTD 2147 V E LDG LKRQR+ + DV+ SG GWLE++ T G Q + N+L +N D Sbjct: 604 VEEPNLDGPALKRQRHAFVSAKI--DVKTASGVGGWLEDNGTTGPQIMNKNQLVENAEAD 661 Query: 2148 LRKS---ENGEIVSSERRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTM 2318 RKS NG I+++ N+ G E +P+ T T +LP++L+DIAVNPT+ Sbjct: 662 PRKSIHLVNGPIMNN-------GPNI-----GKEQVPVTGTSTPDALPAILKDIAVNPTI 709 Query: 2319 LMQLIM---EQQRLAAGAQKKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPA 2489 M ++ +QQ LAA AQ+KS S P ++S++ G APLVN A SK+S I Q PA Sbjct: 710 FMDILNKLGQQQLLAADAQQKSDSSKNTTHPPGTNSIL-GAAPLVNVAPSKASGILQTPA 768 Query: 2490 VRHKVPAQTTSMNPQGEWGQIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXX 2669 V +Q + + Q E G+IRMKPRDPRR+LH + QK+ SLG ++FK + Sbjct: 769 VSLPTTSQVATASMQDELGKIRMKPRDPRRVLHGNMLQKSWSLGHEQFKPIVSSVSCTPG 828 Query: 2670 XXXXXXXXXXREQAQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSI 2849 QA + F + L+N+A ++S SQA+ +P TVSQ++ Sbjct: 829 NKDNLNGPVQEGQADKKQVPSQLVVQPDIARQFTKNLRNIADLMSVSQASTSPATVSQNL 888 Query: 2850 SSQPEPVKTEKAGVGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXX 3029 SSQP PVK ++ V AVV DQ G + PE ++A P+R N WGDVE LFEGYDD Sbjct: 889 SSQPLPVKPDRGDVKAVVPNSEDQHSGTNSTPETTLAVPSRTPNAWGDVEHLFEGYDDEQ 948 Query: 3030 XXXXXXXXXXXXXXXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDRE 3209 KMF A K NSAKFVEVD +HDEILRKKEEQDRE Sbjct: 949 KAAIQRERARRLEEQKKMFDAHKLCLVLDLDHTLLNSAKFVEVDSVHDEILRKKEEQDRE 1008 Query: 3210 KPHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALF 3389 KP RHLFRFPHMGMWTKLRPG+WNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP G LF Sbjct: 1009 KPQRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPMGTLF 1068 Query: 3390 AGRVISKGDDGDPFDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 3569 +GRVIS+GDDGDPFDG+E++PK+KDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYT Sbjct: 1069 SGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYT 1128 Query: 3570 YFPCSRRQFGLLGPSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLA 3749 YFPCSRRQFGL GPSLLEIDHDERPE+GTLASSLAVIE++HQ FFSH SL +VDVRN+LA Sbjct: 1129 YFPCSRRQFGLPGPSLLEIDHDERPEQGTLASSLAVIEKIHQNFFSHHSLDEVDVRNILA 1188 Query: 3750 SEQQKILAGCRVVFSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTD 3929 SEQ+KILAGCR+VFSR+FPV E NPHLHPLWQTAEQFGAVCT QID+QVTHVVANS GTD Sbjct: 1189 SEQRKILAGCRIVFSRVFPVSEVNPHLHPLWQTAEQFGAVCTTQIDDQVTHVVANSPGTD 1248 Query: 3930 KVNWALSTGRF 3962 KVNWAL+ G+F Sbjct: 1249 KVNWALANGKF 1259 >ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|568858958|ref|XP_006483010.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Citrus sinensis] gi|557541056|gb|ESR52100.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1234 Score = 1048 bits (2709), Expect = 0.0 Identities = 625/1257 (49%), Positives = 772/1257 (61%), Gaps = 31/1257 (2%) Frame = +3 Query: 285 SVEEISEEDFK----------QEAKVLNPKGGDS--RVW-MGDLLN-YP-VSSNYGSGLY 419 SVEEISEEDFK +E K + GG++ RVW M DL N YP + YG GL+ Sbjct: 15 SVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPGLH 74 Query: 420 NFAWAQAVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDD 599 N AWAQAVQNKPL EI + + E ++ SKRS K V V+IDD Sbjct: 75 NLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDDKKVVEKVVIDD 134 Query: 600 SSEEIDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEF 779 S +EI+ + ++ EE E ++S S + E Sbjct: 135 SGDEIEKEEGEL----------------------EEGEIELDLESESNEKVSEQVKEEMK 172 Query: 780 EKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGI 959 ++SIR ALE+V + SF GVC +L+ +L+SL+ ++ EN P D LIQ +F+ + Sbjct: 173 LINVESIREALESVL--RGDISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAV 230 Query: 960 QAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSH 1139 Q+++SVFCSMN +EQNK++ RLL+ +KS + LFS ++KE+EAM+ L + A Sbjct: 231 QSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKE 290 Query: 1140 VKAMKKDNGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSETXXXXXXX 1319 K M +G N + I+ EN LN K+ P+PV S QN S+ Sbjct: 291 -KDMLAMHGVNGKDSNIVTENAVNDLNFKEKV---PLPVDSLMQNKPLEASKPGPPGYRS 346 Query: 1320 XXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATPKIADES 1499 D H+ HD DSLPSPTRET P +PVQ+ VVGDG+ KS A K++ + Sbjct: 347 RGVLLPLL----DPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVKSWAAAAKLSHNA 402 Query: 1500 EDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEVSSFSTV 1679 E HYETDAL+A SSYQQKFGR S + S LPSPTPSEE DGDGD+ GE+SS + V Sbjct: 403 EVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAV 462 Query: 1680 GDVRNVNLPLPLRSVGSPTPH-----MDSSMGQ------RQMPAKTAGHLACVSNPVLRA 1826 + VN+P + S P MD S Q PA + + NPV++A Sbjct: 463 DQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKA 522 Query: 1827 PAKNRDPRLRFANSEGDALDLNQRP--LLEGATKSDTLGGIISSRKHNIVVESVLDGQTL 2000 P K+RDPRLRFA+S +AL+LN +P +L A K + +G ++SSRK V E VLDG L Sbjct: 523 PIKSRDPRLRFASS--NALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPAL 580 Query: 2001 KRQRNGLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVS 2180 KRQRNG + V +D + + GS GWLE++ Q + N L + ++ RK +NG Sbjct: 581 KRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSP 640 Query: 2181 SERRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIM--EQQRLA 2354 S NV V GNEP P TT SLP+LL+DIAVNPTML+ ++ +QQ+LA Sbjct: 641 I------TSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLA 694 Query: 2355 AGAQKKSSDSAQN-MKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNP 2531 A AQ+KS+DS+ N M P SS+ P +V +P+ S P Sbjct: 695 ADAQQKSNDSSMNTMHPPIPSSIPP-------------------VSVTCSIPSGILS-KP 734 Query: 2532 QGEWGQIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQA 2711 E G++RMKPRDPRR+LH + Q++ SLG + FKT+G Sbjct: 735 MDELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAP 793 Query: 2712 QTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAGV 2891 + + F + LK++A +S SQ + P VSQ+ QP +K+ A + Sbjct: 794 EAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSG-ADM 852 Query: 2892 GAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXXX 3071 AVVT D+Q G G+ PE G A Q+ WGDVE LFEGYDD Sbjct: 853 KAVVTNHDDKQTGTGSGPEAGPVG-AHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEE 911 Query: 3072 XNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGM 3251 KMF+ARK NSAKF EVDP+HDEILRKKEEQDREKPHRHLFRFPHMGM Sbjct: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971 Query: 3252 WTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDPF 3431 WTKLRPGIW FLE+ASKL+EMHLYTMGNKLYATEMAKVLDP G LFAGRVIS+GDDGDPF Sbjct: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031 Query: 3432 DGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 3611 DG+E++PK+KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP Sbjct: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 1091 Query: 3612 SLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVVF 3791 SLLEIDHDER E+GTLASSL VIERLH+ FFSH+SL DVDVRN+LA+EQ+KILAGCR+VF Sbjct: 1092 SLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVF 1151 Query: 3792 SRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 SR+FPVGEANPHLHPLWQTAEQFGAVCT ID+QVTHVVANSLGTDKVNWALSTGRF Sbjct: 1152 SRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF 1208 >ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|557541054|gb|ESR52098.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1208 Score = 1028 bits (2658), Expect = 0.0 Identities = 616/1248 (49%), Positives = 763/1248 (61%), Gaps = 31/1248 (2%) Frame = +3 Query: 285 SVEEISEEDFK----------QEAKVLNPKGGDS--RVW-MGDLLN-YP-VSSNYGSGLY 419 SVEEISEEDFK +E K + GG++ RVW M DL N YP + YG GL+ Sbjct: 15 SVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPGLH 74 Query: 420 NFAWAQAVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDD 599 N AWAQAVQNKPL EI + + E ++ SKRS K V V+IDD Sbjct: 75 NLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDDKKVVEKVVIDD 134 Query: 600 SSEEIDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEF 779 S +EI+ + ++ EE E ++S S + E Sbjct: 135 SGDEIEKEEGEL----------------------EEGEIELDLESESNEKVSEQVKEEMK 172 Query: 780 EKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGI 959 ++SIR ALE+V + SF GVC +L+ +L+SL+ ++ EN P D LIQ +F+ + Sbjct: 173 LINVESIREALESVL--RGDISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAV 230 Query: 960 QAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSH 1139 Q+++SVFCSMN +EQNK++ RLL+ +KS + LFS ++KE+EAM+ L + A Sbjct: 231 QSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKE 290 Query: 1140 VKAMKKDNGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSETXXXXXXX 1319 K M +G N + I+ EN LN K+ P+PV S QN S+ Sbjct: 291 -KDMLAMHGVNGKDSNIVTENAVNDLNFKEKV---PLPVDSLMQNKPLEASKPGPPGYRS 346 Query: 1320 XXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATPKIADES 1499 D H+ HD DSLPSPTRET P +PVQ+ VVGDG+ KS A K++ + Sbjct: 347 RGVLLPLL----DPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVKSWAAAAKLSHNA 402 Query: 1500 EDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEVSSFSTV 1679 E HYETDAL+A SSYQQKFGR S + S LPSPTPSEE DGDGD+ GE+SS + V Sbjct: 403 EVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAV 462 Query: 1680 GDVRNVNLPLPLRSVGSPTPH-----MDSSMGQ------RQMPAKTAGHLACVSNPVLRA 1826 + VN+P + S P MD S Q PA + + NPV++A Sbjct: 463 DQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKA 522 Query: 1827 PAKNRDPRLRFANSEGDALDLNQRP--LLEGATKSDTLGGIISSRKHNIVVESVLDGQTL 2000 P K+RDPRLRFA+S +AL+LN +P +L A K + +G ++SSRK V E VLDG L Sbjct: 523 PIKSRDPRLRFASS--NALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPAL 580 Query: 2001 KRQRNGLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVS 2180 KRQRNG + V +D + + GS GWLE++ Q + N L + ++ RK +NG Sbjct: 581 KRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSP 640 Query: 2181 SERRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIM--EQQRLA 2354 S NV V GNEP P TT SLP+LL+DIAVNPTML+ ++ +QQ+LA Sbjct: 641 I------TSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLA 694 Query: 2355 AGAQKKSSDSAQN-MKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNP 2531 A AQ+KS+DS+ N M P SS+ P +V +P+ S P Sbjct: 695 ADAQQKSNDSSMNTMHPPIPSSIPP-------------------VSVTCSIPSGILS-KP 734 Query: 2532 QGEWGQIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQA 2711 E G++RMKPRDPRR+LH + Q++ SLG + FKT+G Sbjct: 735 MDELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAP 793 Query: 2712 QTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAGV 2891 + + F + LK++A +S SQ + P VSQ+ QP +K+ A + Sbjct: 794 EAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSG-ADM 852 Query: 2892 GAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXXX 3071 AVVT D+Q G G+ PE G A Q+ WGDVE LFEGYDD Sbjct: 853 KAVVTNHDDKQTGTGSGPEAGPVG-AHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEE 911 Query: 3072 XNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGM 3251 KMF+ARK NSAKF EVDP+HDEILRKKEEQDREKPHRHLFRFPHMGM Sbjct: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971 Query: 3252 WTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDPF 3431 WTKLRPGIW FLE+ASKL+EMHLYTMGNKLYATEMAKVLDP G LFAGRVIS+GDDGDPF Sbjct: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031 Query: 3432 DGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 3611 DG+E++PK+KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP Sbjct: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 1091 Query: 3612 SLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVVF 3791 SLLEIDHDER E+GTLASSL VIERLH+ FFSH+SL DVDVRN+LA+EQ+KILAGCR+VF Sbjct: 1092 SLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVF 1151 Query: 3792 SRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKV 3935 SR+FPVGEANPHLHPLWQTAEQFGAVCT ID+QVTHVVANSLGTDKV Sbjct: 1152 SRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 1199 >emb|CBI35661.3| unnamed protein product [Vitis vinifera] Length = 1184 Score = 1025 bits (2649), Expect = 0.0 Identities = 622/1242 (50%), Positives = 751/1242 (60%), Gaps = 16/1242 (1%) Frame = +3 Query: 285 SVEEISEEDF-KQEAKVLN---PKGGDSRVW----MGDLLNY-PVSSNYGSGLYNFAWAQ 437 SVEEISEEDF KQE +VL PK D+RVW + DL Y S Y LYN AWAQ Sbjct: 65 SVEEISEEDFNKQEVRVLREAKPKA-DTRVWTMRDLQDLYKYHQACSGYTPRLYNLAWAQ 123 Query: 438 AVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEEID 617 AVQNKPL +I VIIDDS +E+D Sbjct: 124 AVQNKPLNDIF--------------------------------------VIIDDSGDEMD 145 Query: 618 SKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKS 797 K DV LD+E + EGG + N+ P + E E +++KS Sbjct: 146 VKMDDVSEKEEGELEEGEID--LDSEPDVKDEGGVLDVNE--PEIDLK--ERELVERVKS 199 Query: 798 IRGALETVTVKYAEKSFHGVCLELQASLDSLKLM-----IMENGAPDVDDLIQQSFTGIQ 962 I+ LE+VTV AEKSF GVC LQ +L SL+ + + E+ P D L QQ I+ Sbjct: 200 IQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIR 259 Query: 963 AINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSHV 1142 A+N VFCSMN Q+E NKD+F RLL+ V+ D+ +FS + +KE+E MM LD+ A S Sbjct: 260 ALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSA 319 Query: 1143 KAMKKDNGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSETXXXXXXXX 1322 +A K N QV + N+ +L+ V+S + A+ Sbjct: 320 EASDKVNDV-------------QVTDGMNRNILDS-SVESSGRAFASAKK---------- 355 Query: 1323 XXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATPKIADESE 1502 DLH+DHD DSLPSPT + P PV +KSEL T K+A E++ Sbjct: 356 FRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPV----------NKSELVTAKVAHETQ 405 Query: 1503 DSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEVSSFSTVG 1682 DS +H YETDALKA+S+YQQKFG TS + +LPSPTPSEE D GD +GEVSS ST+ Sbjct: 406 DSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVSSSSTIS 465 Query: 1683 DVRNVNLPLPLRSVGSPTPHMDSSMGQRQMPAKTAGHLACVSNPVLRAPAKNRDPRLRFA 1862 N P + S P MD G + + G + N +LRA AK+RDPRLR A Sbjct: 466 APITANAPALGHPIVSSAPQMDIVQGL--VVPRNTGAVNSRFNSILRASAKSRDPRLRLA 523 Query: 1863 NSEGDALDLNQRPL--LEGATKSDTLGGIISSRKHNIVVESVLDGQTLKRQRNGLTDFAV 2036 +S+ +LDLN+RPL + + K D LG I+SSRK E +LDG KRQRNGLT A Sbjct: 524 SSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTSPAT 583 Query: 2037 SKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSERRDIDASSNL 2216 LE TV +G D Sbjct: 584 K------------LESKVTV-----------TGIGCD---------------------KP 599 Query: 2217 NVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIMEQQRLAAGAQKKSSDSAQNM 2396 V+V GNE LP++ T TTASL SLL+DIAVNP + M + + + Q+KS D A+N Sbjct: 600 YVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVE------QQKSGDPAKNT 653 Query: 2397 KPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNPQGEWGQIRMKPRDPR 2576 +S+ I G+ P + A K S + QKPA +VP QT MNPQ E G++RMKPRDPR Sbjct: 654 VLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVP-QTGPMNPQDESGKVRMKPRDPR 712 Query: 2577 RILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQAQTTSLXXXXXXXXXX 2756 RILH+++FQ++ S GS++FKTN +Q +T S+ Sbjct: 713 RILHANSFQRSGSSGSEQFKTNAQKQ---------------EDQTETKSVPSHSVNPPDI 757 Query: 2757 XXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAGVGAVVTELSDQQIGIG 2936 F + LKN+A ++S SQA++ PT Q +SSQ V T++ V A V++ DQ G Sbjct: 758 SQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANG 817 Query: 2937 AKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXXXXNKMFAARKXXXXXX 3116 +KPE S AGP + +N WGDVE LF+GYDD KMF+ARK Sbjct: 818 SKPE-SAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLD 876 Query: 3117 XXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWNFLEKA 3296 NSAKFVEVDP+HDEILRKKEEQDREK RHLFRFPHMGMWTKLRPGIWNFLEKA Sbjct: 877 LDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKA 936 Query: 3297 SKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDPFDGEEKLPKNKDLEGV 3476 SKLYE+HLYTMGNKLYATEMAKVLDP G LFAGRVISKGDDGD DG+E++PK+KDLEGV Sbjct: 937 SKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGV 996 Query: 3477 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEEGT 3656 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEIDHDERPE+GT Sbjct: 997 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGT 1056 Query: 3657 LASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVVFSRIFPVGEANPHLHP 3836 LASSLAVIER+HQ+FFS+R+L +VDVRN+LASEQ+KILAGCR+VFSR+FPVGEANPHLHP Sbjct: 1057 LASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHP 1116 Query: 3837 LWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 LWQTAE FGAVCT QIDEQVTHVVANSLGTDKVNWALSTGRF Sbjct: 1117 LWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRF 1158 >ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] gi|550343308|gb|EEE79627.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] Length = 1247 Score = 1017 bits (2629), Expect = 0.0 Identities = 607/1277 (47%), Positives = 776/1277 (60%), Gaps = 51/1277 (3%) Frame = +3 Query: 285 SVEEISEEDF-KQEAKVL---------NPKGGDSRVW-MGDLLNYPVSSNYGSGLYNFAW 431 SVEEISE+DF KQE V+ N +VW + DL Y V Y SGLYN AW Sbjct: 30 SVEEISEDDFNKQEVVVVKETPSSTTNNNSSSKQKVWTVRDLYKYQVGGGYMSGLYNLAW 89 Query: 432 AQAVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEE 611 AQAVQNKPL E+ + + E ++ S++S + + ++ V+IDDS +E Sbjct: 90 AQAVQNKPLNELFV-EVEVDDSSQKSSVSSVNSSK-----------EDKRTVVIDDSGDE 137 Query: 612 IDSKAQDVXXXXXXXXXXXXXXXXLDTEMVE-ETEGGWSNANDSLPSDSGRNSEGEFEKQ 788 +D +D E E E E G + + S+ G S + EK+ Sbjct: 138 MD------------------VVKVIDIEKEEGELEEGEIDLDSEGKSEGGMVSV-DTEKR 178 Query: 789 IKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIM--ENGAPDVDDLIQQSFTGIQ 962 +KSIR LE+V+V +KSF VCL+L +L+SLK ++ ENG P D L++ FT I Sbjct: 179 VKSIREDLESVSVIKDDKSFEAVCLKLHNALESLKELVRVNENGFPSKDSLVRLLFTAIG 238 Query: 963 AINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKE----------IEAMMYG 1112 A+NS F SMN K +EQNK +F+R L+ V S D + FSPE KE I ++ Y Sbjct: 239 AVNSFFSSMNQKLKEQNKGVFMRFLSLVNSHDPSFFSPEHTKEVCDFCNFDFRIVSLCYD 298 Query: 1113 LDSEAVVSHVKAMKKDNGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGS 1292 L + ++ + + + N F I PG S +LL + Sbjct: 299 LTT---MNRLPSAAESFVHNKPNFSIEPPKPGVPSFKSRGVLLPLL-------------- 341 Query: 1293 ETXXXXXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSEL 1472 DL + HD DSLPSPTRET P PVQ+L +GDG+ S L Sbjct: 342 ---------------------DLKKFHDEDSLPSPTRETAPSFPVQRLLPIGDGMISSGL 380 Query: 1473 ATPKIADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDST 1652 PK+A +E+ +H YETDALKA+SSYQ+KF S T+ LPSPTPSEE +GDGD+ Sbjct: 381 PVPKVASITEEPRVHPYETDALKAVSSYQKKFNLNS-FFTNELPSPTPSEESGNGDGDTA 439 Query: 1653 GEVSSFSTVGDVRNVNLPLPLRSVGSPTP--------------HMDSSMGQRQMPAKTAG 1790 GEVSS STV + R VN P+ R SP+P H+++S + +P + + Sbjct: 440 GEVSSSSTV-NYRTVNPPVSDRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVVIPTRNSA 498 Query: 1791 HLACVSNPVLRAPAKNRDPRLRFANSEGDALDLNQRPLL--EGATKSDTLGGIISSRKHN 1964 ++ ++ ++A AK+RDPRLR+ N++ ALD NQR LL +++ G I SRK Sbjct: 499 PVSSGTSSTVKASAKSRDPRLRYVNTDASALDQNQRTLLMVNNPPRAEPSGAIAGSRKQK 558 Query: 1965 IVVESVLDGQTLKRQRNGLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGT 2144 I E VLDG +LKRQRN +F V +D++ ++G+ GWLE++ Q + N+ A+N Sbjct: 559 IE-EDVLDGTSLKRQRNSFDNFGVVRDIRSMTGTGGWLEDTDMAEPQTVNKNQWAENAEP 617 Query: 2145 DLRKSENGEI---VSSERRDIDASSNLNV------SVGGNEPLPMICTGTTASLPSLLRD 2297 R + NG + S + S N+ V ++ G+E P+ T TTASLP LL+D Sbjct: 618 GQRIN-NGVVCPSTGSVMSSVSCSGNVQVPVMGINTIAGSEQAPVTST-TTASLPDLLKD 675 Query: 2298 IAVNPTMLMQLIM--EQQRLAAGAQKKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSE 2471 I VNPTML+ ++ +QQRLA Q+K +D A++ SS+ + G P VN+ SS S Sbjct: 676 ITVNPTMLINILKMGQQQRLALDGQQKLADPAKSTSHPPSSNTVLGAIPEVNAVSSLPSG 735 Query: 2472 IEQKPAVRHKVPAQTTSMNPQGEWGQIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAP 2651 I + A + + P+Q + + E G+IRMKPRDPRR+LH++ Q+ SLGS++FKT Sbjct: 736 ILPRSAGKAQGPSQIATTD---ESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTLT 792 Query: 2652 XXXXXXXXXXXXXXXXREQAQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPP 2831 Q Q F + LKN+A ++S SQ TPP Sbjct: 793 STTQGTKDNQNL------QKQEGLAELKPVVPPDISSPFTKSLKNIADIVSVSQTCTTPP 846 Query: 2832 TVSQSISSQPEPVKTEKAGVGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFE 3011 VSQ+++SQP +K+++ G SDQ++G + PE +A + QN W DVE LFE Sbjct: 847 FVSQNVASQPVQIKSDRVD-GKTGISNSDQKMGPASSPEV-VAASSLSQNTWEDVEHLFE 904 Query: 3012 GYDDXXXXXXXXXXXXXXXXXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKK 3191 GYDD K+FAARK NSAKFVEVDP+HDEILRKK Sbjct: 905 GYDDQQKAAIQRERARRIEEQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKK 964 Query: 3192 EEQDREKPHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLD 3371 EEQDREKP+RHLFRFPHMGMWTKLRPGIWNFLEKASKLYE+HLYTMGNKLYATEMAKVLD Sbjct: 965 EEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLD 1024 Query: 3372 PTGALFAGRVISKGDDGDPFDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLI 3551 P G LFAGRV+S+GDDGD DG+E++PK+KDLEGVLGMES VVIIDDS+RVWPHNKLNLI Sbjct: 1025 PKGVLFAGRVVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLI 1084 Query: 3552 VVERYTYFPCSRRQFGLLGPSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVD 3731 VVERY YFPCSRRQFGL GPSLLEIDHDERPE+GTLA SLAVIER+HQ FF+H SL + D Sbjct: 1085 VVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEAD 1144 Query: 3732 VRNVLASEQQKILAGCRVVFSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVA 3911 VRN+LASEQ+KILAGCR+VFSR+FPVGE NPHLHPLWQ+AEQFGAVCT QIDEQVTHVVA Sbjct: 1145 VRNILASEQRKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVA 1204 Query: 3912 NSLGTDKVNWALSTGRF 3962 NSLGTDKVNWALSTGRF Sbjct: 1205 NSLGTDKVNWALSTGRF 1221 >ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus trichocarpa] gi|550347145|gb|EEE82674.2| CTD phosphatase-like protein 3 [Populus trichocarpa] Length = 1190 Score = 975 bits (2520), Expect = 0.0 Identities = 583/1248 (46%), Positives = 741/1248 (59%), Gaps = 23/1248 (1%) Frame = +3 Query: 285 SVEEISEEDFKQEAKVL---NPKGGDS--RVW-MGDLLNYPVSSNYGSGLYNFAWAQAVQ 446 SVEEISEEDF ++ V+ P +S +VW + DL Y V Y SGLYN AWA+AVQ Sbjct: 28 SVEEISEEDFNKQEVVIVKETPSSNNSSQKVWTVRDLYKYQVGGGYMSGLYNLAWARAVQ 87 Query: 447 NKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEEIDS-K 623 NKPL E+ V+IDDS +E+D K Sbjct: 88 NKPLNEL--------------------------------------TVVIDDSGDEMDVVK 109 Query: 624 AQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKSIR 803 D+ + E E EG ++ + S + E ++KSIR Sbjct: 110 VIDI-----------------EKEEGELEEGEIDLDSEPVVVQSEGMVSVDVENRVKSIR 152 Query: 804 GALETVTVKYAEKSFHGVCLELQASLDSLKLMI--MENGAPDVDDLIQQSFTGIQAINSV 977 LE+V+V EKSF VCL+L L+SLK ++ +N P D L+Q F I+ +NSV Sbjct: 153 KDLESVSVIETEKSFEAVCLKLHKVLESLKELVGGNDNSFPSKDGLVQLLFMAIRVVNSV 212 Query: 978 FCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSHVKAMKK 1157 FCSMN K +EQNK +F R + + S FSP + KE+ Sbjct: 213 FCSMNKKLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEV--------------------- 251 Query: 1158 DNGTNPNEFGILGENPGQVLNSSNKILLEPIPV-KSGDQNIANMGSETXXXXXXXXXXXX 1334 N N L + G L + + E +P ++ QN N E Sbjct: 252 ---LNENHNDSLAKTAGYDLTTMS----EKLPAAETFVQNKPNKSIEAPKPPGVPSFKSR 304 Query: 1335 XXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATPKIADESEDSTL 1514 DL + HD DSLPSPT+ET PF PVQ+L +GDG+ S L PK+ +E+ + Sbjct: 305 GVLLPLLDLKKYHDEDSLPSPTQETTPF-PVQRLLAIGDGMVSSGLPVPKVTPVAEEPRM 363 Query: 1515 HHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEVSSFSTVGDVRN 1694 H YETDALKA+SSYQQKF R S T+ LPSPTPSEE +GDGD+ GEVSS STV + R Sbjct: 364 HPYETDALKAVSSYQQKFNRNS-FFTNELPSPTPSEESGNGDGDTAGEVSSSSTVVNYRT 422 Query: 1695 VNLPLPLRSVGSPTP--------HMDSSMGQRQMPAKTAGHLACVSNPVLRAPAKNRDPR 1850 VN P+ + P+P H DSS + +P + + ++ + ++A AK+RDPR Sbjct: 423 VNPPVSDQKNAPPSPPPLPPPPPHPDSSNIRGVVPTRNSAPVSSGPSSTIKASAKSRDPR 482 Query: 1851 LRFANSEGDALDLNQR--PLLEGATKSDTLGGIISSRKHNIVVESVLDGQTLKRQRNGLT 2024 LR+ N + ALD NQR P++ + + G I+ S+KH I E VLD +LKRQRN Sbjct: 483 LRYVNIDACALDHNQRALPMVNNLPRVEPAGAIVGSKKHKIE-EDVLDDPSLKRQRNSFD 541 Query: 2025 DFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSERRDIDA 2204 ++ +D++ ++G+ GWLE++ Q + N+ A+N +++ S N + Sbjct: 542 NYGAVRDIESMTGTGGWLEDTDMAEPQTVNKNQWAEN--SNVNGSGNAQ----------- 588 Query: 2205 SSNLNVS-VGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIM--EQQRLAAGAQKKS 2375 S + +S + G+E + T TT SLP LL+DIAVNPTML+ ++ +QQRLA Q+ Sbjct: 589 SPFMGISNITGSEQAQVTSTATT-SLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQTL 647 Query: 2376 SDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNPQGEWGQIR 2555 SD A++ S+ + G P VN ASS+ S I +PA VP+Q + + E G+IR Sbjct: 648 SDPAKSTSHPPISNTVLGAIPTVNVASSQPSGIFPRPAGT-PVPSQIATSD---ESGKIR 703 Query: 2556 MKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQAQTTSLXXX 2735 MKPRDPRR LH+++ Q+ S+GS++FKT Q Q Sbjct: 704 MKPRDPRRFLHNNSLQRAGSMGSEQFKTT------TLTPTTQGTKDDQNVQKQEGLAELK 757 Query: 2736 XXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAGVGAVVTELS 2915 F + L+N+A +LS SQA+ TPP +SQ+++SQP K+E+ G +S Sbjct: 758 PTVPPDISFPFTKSLENIADILSVSQASTTPPFISQNVASQPMQTKSERVD-GKTGISIS 816 Query: 2916 DQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXXXXNKMFAAR 3095 DQ+ G + PE +A + QN W DVE LFEGYDD KMFAAR Sbjct: 817 DQKTGPASSPEV-VAASSHSQNTWKDVEHLFEGYDDQQKAAIQRERARRLEEQKKMFAAR 875 Query: 3096 KXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 3275 K NSAK + LHDEILRKKEEQDREKP+RH+FR PHMGMWTKLRPGI Sbjct: 876 KLCLVLDLDHTLLNSAKAILSSSLHDEILRKKEEQDREKPYRHIFRIPHMGMWTKLRPGI 935 Query: 3276 WNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDPFDGEEKLPK 3455 WNFLEKASKL+E+HLYTMGNKLYATEMAKVLDP G LFAGRVIS+GDDGDPFDG+E++PK Sbjct: 936 WNFLEKASKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 995 Query: 3456 NKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 3635 +KDLEGVLGMES VVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHD Sbjct: 996 SKDLEGVLGMESGVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHD 1055 Query: 3636 ERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVVFSRIFPVGE 3815 ERPE+GTLA S AVIE++HQ FF+HRSL + DVRN+LASEQ+KIL GCR++FSR+FPVGE Sbjct: 1056 ERPEDGTLACSFAVIEKIHQNFFTHRSLDEADVRNILASEQRKILGGCRILFSRVFPVGE 1115 Query: 3816 ANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGR 3959 NPHLHPLWQ AEQFGAVCT QIDEQVTHVVANSLGTDKVNWALSTGR Sbjct: 1116 VNPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGR 1163 >ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Cucumis sativus] Length = 1249 Score = 953 bits (2464), Expect = 0.0 Identities = 576/1258 (45%), Positives = 742/1258 (58%), Gaps = 32/1258 (2%) Frame = +3 Query: 285 SVEEISEEDFKQEAKVLNPK--------GGDSRVW-MGDLL-NYPVSSN-YGSGLYNFAW 431 SVEEISEEDF + +PK ++RVW M DL NYP + Y SGLYN AW Sbjct: 22 SVEEISEEDFNKLDSSASPKVVVPSKDSNRETRVWTMSDLYKNYPAMRHGYASGLYNLAW 81 Query: 432 AQAVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEE 611 AQAVQNKPL +I + + + +EKSK S S KE V+IDDS +E Sbjct: 82 AQAVQNKPLNDIFVMEADLDEKSKHSSST----PFGNAKDDGSNTTKEEDRVVIDDSGDE 137 Query: 612 IDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSD-SGRNSE---GEF 779 ++ D +DTE VEE + +DS D +G+ + E Sbjct: 138 MNC---DNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDMDINGQEFDLETKEL 194 Query: 780 EKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGI 959 ++ +K I+ L+ VT+ A+KSF VC ++ +S+++ ++ P D LIQ+ + + Sbjct: 195 DELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKDALIQRLYAAL 254 Query: 960 QAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSH 1139 + INSVFCSMN ++E++K+ RLL++VK+ D LFSPE++K +E M DS + Sbjct: 255 RLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPSTDSLDHLPS 314 Query: 1140 VKAMKKD------NGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSETX 1301 ++ K+ NG +F + L SNK+ + IP +N N+ SE Sbjct: 315 MRGSAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLASDSIPFGVKGKNNLNILSE-G 373 Query: 1302 XXXXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATP 1481 DLH+DHDADSLPSPTRE P VQK S A Sbjct: 374 LQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQK----------SGNAPT 423 Query: 1482 KIADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEV 1661 K+A + S H YETDALKA+S+YQQKFGR+S + RLPSPTPSEE DG GD GEV Sbjct: 424 KMAFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE-HDGGGDIGGEV 482 Query: 1662 SSFSTVGDVRNVNLPLPLRSVGSPT-------PHMDSSMGQRQMPAKTAGHLACVSNPVL 1820 SS S + +++ N+ P + S + P+MDSS + + + VSNP + Sbjct: 483 SSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTV 542 Query: 1821 RAPAKNRDPRLRFANSEGDALDLNQRPLLEGATKSDT-LGGIISSRKHNIVVESVLDGQT 1997 + AK+RDPRLR NS+ +DLN R + + S + RK + E DG Sbjct: 543 KPLAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILESAATLHLRKQKMDGEPNTDGPE 602 Query: 1998 LKRQRNGLTDFAVS-KDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEI 2174 +KR R G + AV+ DV+ VSGS GWLE++ G + + N++ E E Sbjct: 603 VKRLRIGSQNLAVAASDVRAVSGSGGWLEDTMPAGPRLFNRNQM-----------EIAEA 651 Query: 2175 VSSERRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIM--EQQR 2348 ++E+ ++ N S GNE P + ASLPSLL+DI VNPTML+ L+ +QQ+ Sbjct: 652 NATEKSNVT-----NNSGSGNECTPTVNNSNDASLPSLLKDIVVNPTMLLNLLKMSQQQQ 706 Query: 2349 LAAGAQKKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMN 2528 LAA + KSS+ +N +S + G +PL+N+ + S ++Q P+ + + Sbjct: 707 LAAELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVATSGILQQSAGT----PSASPVVG 762 Query: 2529 PQGEWGQIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQ 2708 Q + G++RMKPRDPRR+LH ++ QK SLG+D+ K P +E Sbjct: 763 RQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLK-GVVPTASNTEGSRDIPNGHKQEG 821 Query: 2709 AQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAG 2888 + L F LKN+A ++S +PPT S + SS+P Sbjct: 822 QGDSKLASSQTILPDIGRQFTNNLKNIADIMSVP----SPPTSSPNSSSKP--------- 868 Query: 2889 VGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXX 3068 V + D + A +A +R Q WGD+E LF+ YDD Sbjct: 869 ---VGSSSMDSKPVTTAFQAVDMAASSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIE 925 Query: 3069 XXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMG 3248 KMFAARK NSAKFVEVDP+HDEILRKKEEQDREK RHLFRFPHMG Sbjct: 926 EQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMG 985 Query: 3249 MWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDP 3428 MWTKLRPG+WNFLEKAS+LYE+HLYTMGNKLYATEMAKVLDP G LFAGRVIS+GDDGDP Sbjct: 986 MWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1045 Query: 3429 FDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 3608 DG++++PK+KDLEGVLGMES VVIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLG Sbjct: 1046 LDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLG 1105 Query: 3609 PSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVV 3788 PSLLEIDHDERPE+GTLASSL VI+R+HQ+FFS+ L VDVR +L++EQQKILAGCR+V Sbjct: 1106 PSLLEIDHDERPEDGTLASSLGVIQRIHQSFFSNPELDQVDVRTILSAEQQKILAGCRIV 1165 Query: 3789 FSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 FSR+FPVGEANPHLHPLWQTAEQFGA CT QIDEQVTHVVANSLGTDKVNWALSTGRF Sbjct: 1166 FSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKVNWALSTGRF 1223 >ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 3-like [Cucumis sativus] Length = 1249 Score = 952 bits (2462), Expect = 0.0 Identities = 576/1258 (45%), Positives = 741/1258 (58%), Gaps = 32/1258 (2%) Frame = +3 Query: 285 SVEEISEEDFKQEAKVLNPK--------GGDSRVW-MGDLL-NYPVSSN-YGSGLYNFAW 431 SVEEISEEDF + +PK ++RVW M DL NYP + Y SGLYN AW Sbjct: 22 SVEEISEEDFNKLDSSASPKVVVPSKDSNRETRVWTMSDLYKNYPAMRHGYASGLYNLAW 81 Query: 432 AQAVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEE 611 AQAVQNKPL +I + + + +EKSK S S KE V+IDDS +E Sbjct: 82 AQAVQNKPLNDIFVMEADLDEKSKHSSST----PFGNAKDDGSNTTKEEDRVVIDDSGDE 137 Query: 612 IDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSD-SGRNSE---GEF 779 ++ D +DTE VEE + +DS D +G+ + E Sbjct: 138 MNC---DNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDMDINGQEFDLETKEL 194 Query: 780 EKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGI 959 ++ +K I+ L+ VT+ A+KSF VC ++ +S+++ ++ P D LIQ+ + + Sbjct: 195 DELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKDALIQRLYAAL 254 Query: 960 QAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSH 1139 + INSVFCSMN ++E++K+ RLL++VK+ D LFSPE++K +E M DS + Sbjct: 255 RLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPSTDSLDHLPS 314 Query: 1140 VKAMKKD------NGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSETX 1301 ++ K+ NG +F + L SNK+ + IP +N N+ SE Sbjct: 315 MRGSAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLASDSIPFGVKGKNNLNILSE-G 373 Query: 1302 XXXXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATP 1481 DLH+DHDADSLPSPTRE P VQK S A Sbjct: 374 LQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQK----------SGNAPT 423 Query: 1482 KIADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEV 1661 K+A + S H YETDALKA+S+YQQKFGR+S + RLPSPTPSEE DG GD GEV Sbjct: 424 KMAFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE-HDGGGDIGGEV 482 Query: 1662 SSFSTVGDVRNVNLPLPLRSVGSPT-------PHMDSSMGQRQMPAKTAGHLACVSNPVL 1820 SS S + +++ N+ P + S + P+MDSS + + + VSNP + Sbjct: 483 SSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTV 542 Query: 1821 RAPAKNRDPRLRFANSEGDALDLNQRPLLEGATKSDT-LGGIISSRKHNIVVESVLDGQT 1997 + AK+RDPRLR NS+ +DLN R + + S + RK + E DG Sbjct: 543 KPLAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILESAATLHLRKQKMDGEPNTDGPE 602 Query: 1998 LKRQRNGLTDFAVS-KDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEI 2174 +KR R G + AV+ DV+ VSGS GWLE++ G + + N++ E E Sbjct: 603 VKRLRIGSQNLAVAASDVRAVSGSGGWLEDTMPAGPRLFNRNQM-----------EIAEA 651 Query: 2175 VSSERRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIM--EQQR 2348 ++E+ ++ N S GNE P + ASLPSLL+DI VNPTML+ L+ +QQ+ Sbjct: 652 NATEKSNVT-----NNSGSGNECTPTVNNSNDASLPSLLKDIVVNPTMLLNLLKMSQQQQ 706 Query: 2349 LAAGAQKKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMN 2528 LAA + KSS+ +N +S + G +PL+N+ + S ++Q P+ + + Sbjct: 707 LAAELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVATSGILQQSAGT----PSASPVVG 762 Query: 2529 PQGEWGQIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQ 2708 Q + G++RMKPRDPRR+LH ++ QK SLG+D+ K P +E Sbjct: 763 RQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLK-GVVPTASNTEGSRDIPNGHKQEG 821 Query: 2709 AQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAG 2888 + L F LKN+A ++S +PPT S + SS+P Sbjct: 822 QGDSKLASSQTILPDIGRQFTNNLKNIADIMSVP----SPPTSSPNSSSKP--------- 868 Query: 2889 VGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXX 3068 V + D + A +A +R Q WGD+E LF+ YDD Sbjct: 869 ---VGSSSMDSKPVTTAFQAVDMAASSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIE 925 Query: 3069 XXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMG 3248 KMFAARK NSAKFVEVDP+HDEILRKKEEQDREK RHLFRFPHMG Sbjct: 926 EQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMG 985 Query: 3249 MWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDP 3428 MWTKLRPG+WNFLEKAS+LYE+HLYTMGNKLYATEMAKVLDP G LFAGRVIS+GDDGDP Sbjct: 986 MWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1045 Query: 3429 FDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 3608 DG++++PK+KDLEGVLGMES VVIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLG Sbjct: 1046 LDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLG 1105 Query: 3609 PSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVV 3788 PSLLEIDHDERPE+GTLASSL VI+R+HQ FFS+ L VDVR +L++EQQKILAGCR+V Sbjct: 1106 PSLLEIDHDERPEDGTLASSLGVIQRIHQXFFSNPELDQVDVRTILSAEQQKILAGCRIV 1165 Query: 3789 FSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 FSR+FPVGEANPHLHPLWQTAEQFGA CT QIDEQVTHVVANSLGTDKVNWALSTGRF Sbjct: 1166 FSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKVNWALSTGRF 1223 >ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Glycine max] Length = 1261 Score = 937 bits (2423), Expect = 0.0 Identities = 584/1251 (46%), Positives = 740/1251 (59%), Gaps = 25/1251 (1%) Frame = +3 Query: 285 SVEEISEEDF-KQEAKVLN----PKGGDSRVW-MGDLLN-YP-VSSNYGSGLYNFAWAQA 440 SVEEIS EDF KQ+ K+LN P G D+RVW + DL + YP + Y SGLYN AWAQA Sbjct: 35 SVEEISAEDFNKQDVKLLNNNNKPNGSDARVWAVHDLYSKYPTICRGYASGLYNLAWAQA 94 Query: 441 VQNKPLTEILMRDFESEEK--SKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEEI 614 VQNKPL +I + + +S+ S R+ S+ L K+V V +D E+ Sbjct: 95 VQNKPLNDIFVMEVDSDANANSNRNSSHRLASVAVNP--------KDVVVVDVDKEEGEL 146 Query: 615 DSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIK 794 + D D E E E +DS D + + E+ Sbjct: 147 EEGEIDA-----------------DAEPEGEAESVVVAVSDSEKLDDVKMDVSDSEQL-- 187 Query: 795 SIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGIQAINS 974 RG LE VTV +SF C +LQ +L + + + DDL++ SF + + S Sbjct: 188 GARGVLEGVTVANVVESFAQTCSKLQNTLPEV---LSRPAGSEKDDLVRLSFNATEVVYS 244 Query: 975 VFCSMNPKQQEQNKDLFLRLLTHVKSQDTT-LFSPERMKEIEAMMYGLDSEAVVSHVKAM 1151 VFCSM+ ++EQNKD LRLL+ VK Q LFSPE +KEI+ MM +DS + + +A+ Sbjct: 245 VFCSMDSSEKEQNKDSILRLLSFVKDQQQAQLFSPEHVKEIQGMMTAIDSVGALVNSEAI 304 Query: 1152 KKDNGTNPNEFGILGENPGQV----LNSSNKILLEPIPVKSGDQNIAN--MGSETXXXXX 1313 K+ E + +V + + +E + S + + G+ Sbjct: 305 GKEKELQTTEIKTQENSAVEVQIHEIKTQENQAVEAAELISYSKPLHRDITGTSQALKFG 364 Query: 1314 XXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATPKIAD 1493 DLH+DHDADSLPSPTRE P PV KL VG+ + +S A+ K+ Sbjct: 365 QNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGESMVRSGSASAKMEL 424 Query: 1494 ESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEVSSFS 1673 +SE S H YETDALKA+S+YQQKFGR+S + PSPTPS +C+D D+ EVSS S Sbjct: 425 DSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEVVDTNEEVSSAS 484 Query: 1674 TVGDVRNVNLPLPLRSVGSPTPHMDSSMGQRQMPAKTAGHLACV---SNPVLRAPAKNRD 1844 T GD P L P +SM + M + + S PV ++ AKNRD Sbjct: 485 T-GDFLTSTKPTLL----DQPPVSATSMDRSSMHGFISSRVDATGPGSFPV-KSSAKNRD 538 Query: 1845 PRLRFANSEGDALDLNQRPLLEGATKSDTLGGIISSRKHNIVVESVLDGQTLKRQRNGLT 2024 PRLRF NS+ A+D N L+ +K + G IS RK E LD KR ++ L Sbjct: 539 PRLRFINSDASAVD-NLSTLINNMSKVEYSGTTIS-RKQKAAEEPSLDVTVSKRLKSSLE 596 Query: 2025 DFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSERRDIDA 2204 + + ++ +GS GWLEE++ G Q + N L G + +K+ N VSS Sbjct: 597 NTEHNMS-EVRTGSGGWLEENTGPGAQLIERNHLMDKFGPEAKKTLN--TVSSS---CTG 650 Query: 2205 SSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIMEQQRLAAGAQKKSSDS 2384 S N N + NE P+ + ASLP+LL++ +VNP ML+ ++ RLA AQKKS+DS Sbjct: 651 SDNFNATSIRNEQAPITASNVLASLPALLKEASVNPIMLVNIL----RLAE-AQKKSADS 705 Query: 2385 AQNM--KPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTS--MNPQGEWGQI 2552 A M P SS+ + G + SS ++ + Q V +Q+TS Q + G+I Sbjct: 706 AAIMLLHPTSSNPAM-GTDSTASIGSSMATGLLQSSVGMLPVSSQSTSTAQTLQDDSGKI 764 Query: 2553 RMKPRDPRRILHSS-TFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQAQTTSLX 2729 RMKPRDPRRILH++ T QK+ LG+++FK +P + + Sbjct: 765 RMKPRDPRRILHTNNTIQKSGDLGNEQFKAIVSPVSNNQRTGDNVNAPKLEGRVDNKLVP 824 Query: 2730 XXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAGVGAVVTE 2909 F LKN+A ++S SQ ++T VSQ+ SS P+ +++ +VV+ Sbjct: 825 TQSSAQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLTSDRGEQKSVVSS 884 Query: 2910 LSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXXXXNKMFA 3089 + Q + + E + + +R Q+ WGDVE LFEGYD+ NKMFA Sbjct: 885 SQNLQADMASAHETAASVTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKMFA 944 Query: 3090 ARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 3269 ARK NSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP Sbjct: 945 ARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 1004 Query: 3270 GIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDPFDGEEKL 3449 GIWNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP G LFAGRVIS+GDD D DGEE++ Sbjct: 1005 GIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVDGEERV 1064 Query: 3450 PKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEID 3629 PK+KDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEID Sbjct: 1065 PKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEID 1124 Query: 3630 HDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVVFSRIFPV 3809 HDERPE GTLASSLAVIE++HQ FF+ +SL +VDVRN+LASEQ+KILAGCR+VFSR+FPV Sbjct: 1125 HDERPEAGTLASSLAVIEKIHQIFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVFPV 1184 Query: 3810 GEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 GEANPHLHPLWQTAEQFGAVCT QIDEQVTHVVANS GTDKVNWAL+ GRF Sbjct: 1185 GEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKVNWALNNGRF 1235 >ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Glycine max] Length = 1257 Score = 936 bits (2419), Expect = 0.0 Identities = 594/1259 (47%), Positives = 739/1259 (58%), Gaps = 33/1259 (2%) Frame = +3 Query: 285 SVEEISEEDF-KQEAKVLN----PKGGDSRVW-MGDLLN-YP-VSSNYGSGLYNFAWAQA 440 SVEEIS EDF KQ+ KVLN P G D+RVW + DL + YP + Y SGLYN AWAQA Sbjct: 35 SVEEISAEDFNKQDVKVLNNNNKPNGSDARVWAVHDLYSKYPTICRGYASGLYNLAWAQA 94 Query: 441 VQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEEIDS 620 VQNKPL +I + + +S+ + + +N K+V V +D E++ Sbjct: 95 VQNKPLNDIFVMEVDSDANANSNSNN------SNRLASVAVNPKDVVVVDVDKEEGELEE 148 Query: 621 KAQDVXXXXXXXXXXXXXXXXL-DTEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKS 797 D + D+E +++ + SN+ Sbjct: 149 GEIDADAEPEGEAESVVAVPVVSDSEKLDDVKRDVSNSEQL------------------G 190 Query: 798 IRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGIQAINSV 977 +RG LE VTV +SF C +LQ +L + + + DDL++ SF + + SV Sbjct: 191 VRGVLEGVTVANVAESFAQTCSKLQNALPEV---LSRPADSERDDLVRLSFNATEVVYSV 247 Query: 978 FCSMNPKQQEQNKDLFLRLLTHVKSQDTT-LFSPERMKEIEAMMYGLDSEAVVSHVKAMK 1154 FCSM+ ++EQNKD LRLL+ VK Q LFSPE +KEI+ MM +D + + +A+ Sbjct: 248 FCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAIDYFGALVNSEAIG 307 Query: 1155 KDNG--TNPNEFGILGENPGQV----LNSSNKILLEPI-----PVKSGDQNIANMGSETX 1301 K+ T I + V L S NK L I +K G +I G Sbjct: 308 KEKELQTTVQTHEIKTQENQAVEAAELISYNKPLHSDIIGASHALKFGQNSIKGRG---- 363 Query: 1302 XXXXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGL--SKSELA 1475 DLH+DHDADSLPSPTRE P PV KL VG+ + S S A Sbjct: 364 ------------VLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEPMVSSGSAAA 411 Query: 1476 TP---KIADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGD 1646 P K+ +SE S H YETDALKA+S+YQQKFGR+S + PSPTPS +C+D D Sbjct: 412 KPESGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVD 471 Query: 1647 STGEVSSFSTVGDVRNVNLP--LPLRSVGSPTPHMDSSMGQRQMPAKTAGHLACVSNPVL 1820 + EVSS ST GD P L L V + + S G AG S PV Sbjct: 472 TNEEVSSAST-GDFLTSTKPTLLDLPPVSATSTDRSSLHGFISSRVDAAGP---GSLPV- 526 Query: 1821 RAPAKNRDPRLRFANSEGDALDLNQRPLLEGATKSDTLGGIISSRKHNIVVESVLDGQTL 2000 ++ AKNRDPRLRF NS+ A+D N L+ K + G IS RK E LD Sbjct: 527 KSSAKNRDPRLRFVNSDASAVD-NPSTLIHNMPKVEYAGTTIS-RKQKAAEEPSLDVTVS 584 Query: 2001 KRQRNGLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVS 2180 KRQ++ L + + ++ +G GWLEE + G Q + N L G + +K+ N VS Sbjct: 585 KRQKSPLENTEHNMS-EVRTGIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLN--TVS 641 Query: 2181 SERRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIMEQQRLAAG 2360 S S N N + NE P+ + ASLP+LL+ AVNPTML+ L+ A Sbjct: 642 SS---CTGSDNFNATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLLR-----IAE 693 Query: 2361 AQKKSSDSAQNM--KPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNP- 2531 AQKKS+DSA NM P SS+S + G + SS ++ + Q V +Q+TSM Sbjct: 694 AQKKSADSATNMLLHPTSSNSAM-GTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQT 752 Query: 2532 -QGEWGQIRMKPRDPRRILHSS-TFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXRE 2705 Q + G+IRMKPRDPRRILH++ T QK+ +LG+++FK +P Sbjct: 753 LQDDSGKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEG 812 Query: 2706 QAQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKA 2885 + + + FA LKN+A ++S SQ ++T V+Q SS P+ +++ Sbjct: 813 RVDSKLVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRG 872 Query: 2886 GVGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXX 3065 +VV+ + + G+ + E + +G R QN WGDVE LFEGYD+ Sbjct: 873 EQKSVVSNSQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRI 932 Query: 3066 XXXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHM 3245 NKMFAARK NSAKFVEVDP+HDEILRKKEEQDREKPHRHLFRFPHM Sbjct: 933 EEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 992 Query: 3246 GMWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGD 3425 GMWTKLRPGIWNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP G LFAGRVIS+GDD D Sbjct: 993 GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTD 1052 Query: 3426 PFDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 3605 DGEE+ PK+KDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL Sbjct: 1053 SVDGEERAPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLP 1112 Query: 3606 GPSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRV 3785 GPSLLEIDHDERPE GTLASSLAVIE++HQ FF+ RSL +VDVRN+LASEQ+KILAGCR+ Sbjct: 1113 GPSLLEIDHDERPEAGTLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRI 1172 Query: 3786 VFSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 VFSR+FPVGEANPHLHPLWQTAEQFGA CT QIDEQVTHVVANS GTDKVNWAL+ GRF Sbjct: 1173 VFSRVFPVGEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRF 1231 >ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Fragaria vesca subsp. vesca] Length = 1230 Score = 932 bits (2409), Expect = 0.0 Identities = 583/1249 (46%), Positives = 739/1249 (59%), Gaps = 23/1249 (1%) Frame = +3 Query: 285 SVEEISEEDF-KQEAKVLNPK-----GGDSRVW-MGDLLNYPVSSNYGSG-LYNFAWAQA 440 SVEEISEEDF KQE+K + PK G +R W ++L +P G G L N AWAQA Sbjct: 23 SVEEISEEDFVKQESKAVEPKSNGGSGDGARFWTFHEVLAHPHFRGIGGGGLANLAWAQA 82 Query: 441 VQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEEIDS 620 VQNKP ++L++ +S+EKSK+ V+I DS +E+D Sbjct: 83 VQNKPFNDLLVK-LDSDEKSKQQQQQRSSVSSGNE------------KVVIIDSGDEMDV 129 Query: 621 KAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKSI 800 + ++ E +EE E G+ + +G G +EK++ + Sbjct: 130 EKEE--------------------EELEEGEIGFDSECGDNDKAAGSVGNGVWEKRVNLL 169 Query: 801 RGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGIQAINSVF 980 R ALE++T+ AEKSF VC SL+SL+ ++ E + L+QQ F ++AI+SVF Sbjct: 170 REALESLTITEAEKSFGDVCHRFLDSLESLRGVLSEINVSTKEALVQQLFNAVRAISSVF 229 Query: 981 CSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSHVK--AMK 1154 SM+ Q+EQNKD+ R+L+ KS D + F E++KEIE M +DS + K ++ Sbjct: 230 RSMSADQKEQNKDVLSRILSSAKS-DPSPFPAEQLKEIEVMSSSMDSPQTKAGTKENGIQ 288 Query: 1155 KDNGTNPNEFGILGENPGQVLN-SSNKILLEPIPVKSGDQNIANMGSETXXXXXXXXXXX 1331 NG + G N V ++N + V + NI S Sbjct: 289 CINGVYKTDSDTSGANASHVFTYAANTGSDTQVSVVHSNPNI----SSEVPRSGSSSFKG 344 Query: 1332 XXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGL-SKSELATPKIADESEDS 1508 DLH DHD DSLPSPTRE P P QK VV +G+ KS T + A + E S Sbjct: 345 RGLMLPLLDLHMDHDEDSLPSPTREPPACFPAQKPVVVENGMVKKSGWETARAALDVEGS 404 Query: 1509 TLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEE-CDDGDGDSTGEVSSFSTVGD 1685 +H YET+ALKA+SSYQQKF R S LTS LPSPTPSEE D+GD + GEVSS S + Sbjct: 405 KMHVYETEALKAVSSYQQKFSRNS-FLTSELPSPTPSEEEGDNGDDAAVGEVSSSSASNN 463 Query: 1686 VRNVNLPLPLRSVGSPTPH--MDSSMGQRQM-PAKTAGHLACVSNPVLRAPAKNRDPRLR 1856 VR P+ R V S P + S G + AKTA ++ SN ++ AK+RDPRLR Sbjct: 464 VRTPQPPVSGRQVVSSVPATTLPGSSGMHGLITAKTASPVSLGSNMPNKSSAKSRDPRLR 523 Query: 1857 FANSEGDALDLNQRPLLE--GATKSDTLGGIISSRKHNIVVESVLDGQTLKRQRNGLTDF 2030 FANS+ AL LNQ+ ++ A K D++ +SSRKH +S DG KRQR + Sbjct: 524 FANSDAGALTLNQQSSIQVHNAPKVDSVI-TLSSRKHKSPEDSNFDGPESKRQRGA--NS 580 Query: 2031 AVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSERRDIDASS 2210 V + G+ WLE+ S+VG + N+ + D RK N VSS ++ +S Sbjct: 581 VVGWGAKTSFGNGVWLEDGSSVGPHLINRNQTVEKKEADPRKMVN---VSSSPGTVEGNS 637 Query: 2211 NLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIM---EQQRLAAGAQKKSSD 2381 N + NE +P++ + SLP++ +DIAVNPTML+ ++ QQ AA A+K+S Sbjct: 638 NGQNTA--NEKVPLVAP-SLVSLPAIFKDIAVNPTMLVNILKLAEAQQNAAAPARKESLT 694 Query: 2382 SAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNPQGEWGQIRMK 2561 P SSSS IPG A LVN S S + P + P E G+IRMK Sbjct: 695 Y-----PPSSSS-IPGTAALVNDPSKTSGAL--------LTPTICSQKTPTDEAGKIRMK 740 Query: 2562 PRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQAQTTSLXXXXX 2741 RDPRR+LH + Q + S+G ++ + P QA S+ Sbjct: 741 LRDPRRLLHGNALQNSGSVGHEQSRNIVPPLSSSQANNDDMNGKKQDSQADNNSVTSQSG 800 Query: 2742 XXXXXXXX--FAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAGVGAVVTELS 2915 F + LKN+A ++S SQ + +P T SQ++S++ + + + A Sbjct: 801 ALGAPDIASQFTKNLKNIADIISVSQVSTSPATPSQNLSTELISINPDNVDLKA-----E 855 Query: 2916 DQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXXXXNKMFAAR 3095 +Q G + + AG +R WGDVE LFEGYDD KMFAA Sbjct: 856 EQHTGSISASVPTAAGASRSPATWGDVEHLFEGYDDKQKAAIQRERARRIEEQKKMFAAH 915 Query: 3096 KXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 3275 K NSAKFVEVDP+HDEILRKKEEQDR++P RHLFRF HMGMWTKLRPG+ Sbjct: 916 KLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDRKEPQRHLFRFQHMGMWTKLRPGV 975 Query: 3276 WNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDPFDGEEKLPK 3455 W FLEKAS L+EMHLYTMGNKLYATEMAKVLDPTGALFAGRVIS+GDDGDP+DG+E++PK Sbjct: 976 WKFLEKASHLFEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISRGDDGDPYDGDERVPK 1035 Query: 3456 NKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 3635 +KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD Sbjct: 1036 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 1095 Query: 3636 ERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVVFSRIFPVGE 3815 ER E+GTLASSLAVIE++HQ FFSH SL + DVRN+LASEQQKIL GCR+VFSR+FPVGE Sbjct: 1096 ERHEDGTLASSLAVIEKIHQIFFSHPSLDEADVRNILASEQQKILGGCRIVFSRVFPVGE 1155 Query: 3816 ANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 NPHLHPLWQTAEQFGAVCT QID+QVTHVVANSLGTDKVNWALS+G++ Sbjct: 1156 VNPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSSGKY 1204 >ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] gi|550343307|gb|EEE79693.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] Length = 1030 Score = 924 bits (2389), Expect = 0.0 Identities = 522/1024 (50%), Positives = 657/1024 (64%), Gaps = 32/1024 (3%) Frame = +3 Query: 987 MNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSHVKAMKK--- 1157 MN K +EQNK +F+R L+ V S D + FSPE KEIE M+ LDS ++S +A ++ Sbjct: 1 MNQKLKEQNKGVFMRFLSLVNSHDPSFFSPEHTKEIELMVSSLDSHDILSSSRAGEERET 60 Query: 1158 --DNGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSETXXXXXXXXXXX 1331 N + L + G L + N++ P +S N N E Sbjct: 61 QVSGKVNERDNDSLSKTAGYDLTTMNRL---PSAAESFVHNKPNFSIEPPKPGVPSFKSR 117 Query: 1332 XXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATPKIADESEDST 1511 DL + HD DSLPSPTRET P PVQ+L +GDG+ S L PK+A +E+ Sbjct: 118 GVLLPLL-DLKKFHDEDSLPSPTRETAPSFPVQRLLPIGDGMISSGLPVPKVASITEEPR 176 Query: 1512 LHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEVSSFSTVGDVR 1691 +H YETDALKA+SSYQ+KF S T+ LPSPTPSEE +GDGD+ GEVSS STV + R Sbjct: 177 VHPYETDALKAVSSYQKKFNLNS-FFTNELPSPTPSEESGNGDGDTAGEVSSSSTV-NYR 234 Query: 1692 NVNLPLPLRSVGSPTP--------------HMDSSMGQRQMPAKTAGHLACVSNPVLRAP 1829 VN P+ R SP+P H+++S + +P + + ++ ++ ++A Sbjct: 235 TVNPPVSDRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVVIPTRNSAPVSSGTSSTVKAS 294 Query: 1830 AKNRDPRLRFANSEGDALDLNQRPLL--EGATKSDTLGGIISSRKHNIVVESVLDGQTLK 2003 AK+RDPRLR+ N++ ALD NQR LL +++ G I SRK I E VLDG +LK Sbjct: 295 AKSRDPRLRYVNTDASALDQNQRTLLMVNNPPRAEPSGAIAGSRKQKIE-EDVLDGTSLK 353 Query: 2004 RQRNGLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEI--- 2174 RQRN +F V +D++ ++G+ GWLE++ Q + N+ A+N R + NG + Sbjct: 354 RQRNSFDNFGVVRDIRSMTGTGGWLEDTDMAEPQTVNKNQWAENAEPGQRIN-NGVVCPS 412 Query: 2175 VSSERRDIDASSNLNV------SVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIM 2336 S + S N+ V ++ G+E P+ T TTASLP LL+DI VNPTML+ ++ Sbjct: 413 TGSVMSSVSCSGNVQVPVMGINTIAGSEQAPVTST-TTASLPDLLKDITVNPTMLINILK 471 Query: 2337 --EQQRLAAGAQKKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPA 2510 +QQRLA Q+K +D A++ SS+ + G P VN+ SS S I + A + + P+ Sbjct: 472 MGQQQRLALDGQQKLADPAKSTSHPPSSNTVLGAIPEVNAVSSLPSGILPRSAGKAQGPS 531 Query: 2511 QTTSMNPQGEWGQIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXX 2690 Q + + E G+IRMKPRDPRR+LH++ Q+ SLGS++FKT Sbjct: 532 QIATTD---ESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTLTSTTQGTKDNQNL- 587 Query: 2691 XXXREQAQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPV 2870 Q Q F + LKN+A ++S SQ TPP VSQ+++SQP + Sbjct: 588 -----QKQEGLAELKPVVPPDISSPFTKSLKNIADIVSVSQTCTTPPFVSQNVASQPVQI 642 Query: 2871 KTEKAGVGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXX 3050 K+++ G SDQ++G + PE +A + QN W DVE LFEGYDD Sbjct: 643 KSDRVD-GKTGISNSDQKMGPASSPEV-VAASSLSQNTWEDVEHLFEGYDDQQKAAIQRE 700 Query: 3051 XXXXXXXXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLF 3230 K+FAARK NSAKFVEVDP+HDEILRKKEEQDREKP+RHLF Sbjct: 701 RARRIEEQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLF 760 Query: 3231 RFPHMGMWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISK 3410 RFPHMGMWTKLRPGIWNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP G LFAGRV+S+ Sbjct: 761 RFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVVSR 820 Query: 3411 GDDGDPFDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 3590 GDDGD DG+E++PK+KDLEGVLGMES VVIIDDS+RVWPHNKLNLIVVERY YFPCSRR Sbjct: 821 GDDGDLLDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCSRR 880 Query: 3591 QFGLLGPSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKIL 3770 QFGL GPSLLEIDHDERPE+GTLA SLAVIER+HQ FF+H SL + DVRN+LASEQ+KIL Sbjct: 881 QFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQRKIL 940 Query: 3771 AGCRVVFSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALS 3950 AGCR+VFSR+FPVGE NPHLHPLWQ+AEQFGAVCT QIDEQVTHVVANSLGTDKVNWALS Sbjct: 941 AGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALS 1000 Query: 3951 TGRF 3962 TGRF Sbjct: 1001 TGRF 1004 >ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like isoform X1 [Cicer arietinum] Length = 1247 Score = 919 bits (2375), Expect = 0.0 Identities = 567/1267 (44%), Positives = 734/1267 (57%), Gaps = 41/1267 (3%) Frame = +3 Query: 285 SVEEISEEDFKQE--AKVLNPK-------GGDSRVW-MGDLLN-YP-VSSNYGSGLYNFA 428 SV EISEEDF ++ KV N GGD+RVW + DL + YP + Y SGLYN A Sbjct: 35 SVVEISEEDFNKQDVVKVNNNSDSDKAKTGGDARVWAVHDLYSKYPTICRGYASGLYNLA 94 Query: 429 WAQAVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSE 608 WAQAVQNKPL +I + + +S+ + + +N +KEV V++DD Sbjct: 95 WAQAVQNKPLNDIFVMELDSDSNANANSNN----DSNNGNGDLNMPLKEV--VMVDDDER 148 Query: 609 EIDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQ 788 E + E+ E G + + G + E + Sbjct: 149 E-------------------------EGELEEGEIDGDDDTGGVMVGGDGSETVSESD-- 181 Query: 789 IKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGIQAI 968 IR LE VTV +SF L L S ++ + D +I+ + I+ + Sbjct: 182 ---IRDFLEGVTVANVAESFAETISRLLRVLQSK--LLSGPAVSEKDYVIRLLYNAIEIV 236 Query: 969 NSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSHVKA 1148 +SVFCSM+ Q+E NKD +RLL +K++ T LFSPE MKEI+ M+ +D+ + + Sbjct: 237 HSVFCSMDNLQKEDNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNSVV 296 Query: 1149 M---KKDNGTNPNEFGILGENPGQVLNSSNKI---LLEPIP-VKSGDQNIANMGSETXXX 1307 + +K + + I G ++++SS + L E + SG NI G Sbjct: 297 VGNGEKLDTLDIKTRQIQGLKASELISSSKLVHSNLTEASEALLSGQSNIKGRG------ 350 Query: 1308 XXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATP-- 1481 DLH+ HD DSLPSPTRE P F PV KL VGDG+ + L + Sbjct: 351 ----------VMLPLFDLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGK 400 Query: 1482 ----KIADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDS 1649 K+ ++E+S H YETDALKA+S+YQQKFGR+S + PSPTPS +C++G D+ Sbjct: 401 TEAVKMELDTENSKNHLYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADA 460 Query: 1650 TGEVSSFSTVGDVRNVNLPLPLRSVGSPTPHMDSSMGQRQMPAKTAGHLACVSNPVLRAP 1829 EVSS S + + L V S + S G + A + V+ PV + Sbjct: 461 NEEVSSASIAVSLTSSKPLLDQMPVSSTSVDRSSMHGLINSRIEAA---SSVTYPV-KTS 516 Query: 1830 AKNRDPRLRFANSEGDALDLNQRPLLEGATKSDTLGGIISSRKHNIVVESVLDGQTLKRQ 2009 A++RDPRLRF NS+ ALDLNQ K + G +IS RK E LD KR Sbjct: 517 ARSRDPRLRFINSDASALDLNQSLGTNNMPKVENAGRVIS-RKQKTTEELSLDATAPKRL 575 Query: 2010 RNGLTDFAVS-KDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSE 2186 R+ L + + ++ + ++G+ GWLEE+ G+ + N L + T+L+K+ + Sbjct: 576 RSSLENSRHNTREERTMAGNGGWLEENRVAGSHLIERNHLMQKGETELKKTMS------- 628 Query: 2187 RRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIMEQQ-RLAAGA 2363 +S V+ GNE P+ + T A+LP LL++IAVNPTML+ +++EQQ RLAA A Sbjct: 629 ----TSSGYSTVTSNGNEQAPVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRLAAEA 684 Query: 2364 QKKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNPQG-- 2537 KK DSA + L NSA + + PA+ +P + M P Sbjct: 685 NKKPVDSATSTMH------------LTNSARGPDATVNTGPAMTAGLPQSSVGMLPASTQ 732 Query: 2538 ----------EWGQIRMKPRDPRRILH-SSTFQKNESLGSDKFKTNGAPXXXXXXXXXXX 2684 + G+IRMKPRDPRRILH SS+ QK+ S GS++ K+ +P Sbjct: 733 AASMAHTLLEDSGKIRMKPRDPRRILHGSSSLQKSGSTGSEQSKSVVSPTSNNQGNGGNV 792 Query: 2685 XXXXXREQAQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTP-PTVSQSISSQP 2861 + +T F + LKN+A ++S SQ +T P +Q++SS Sbjct: 793 NAQKLDVRVETKLAPTQSSAQPDITRQFTKNLKNIADIMSVSQEPSTQLPATTQNVSSAS 852 Query: 2862 EPVKTEKAGVGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXX 3041 P +KA + + V + Q G+G+ PE G +R Q+ W DVE LFEGYD+ Sbjct: 853 VPFTLDKAELKSGVPNSQNLQDGVGSAPETCAPGSSRSQSTWADVEHLFEGYDEKQKAAI 912 Query: 3042 XXXXXXXXXXXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHR 3221 NKMFA++K NSAKFVEVDP+HDEILRKKEEQDREKPHR Sbjct: 913 QRERARRLEEQNKMFASKKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHR 972 Query: 3222 HLFRFPHMGMWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRV 3401 HLFRFPHMGMWTKLRPG+WNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP G LFAGRV Sbjct: 973 HLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1032 Query: 3402 ISKGDDGDPFDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 3581 IS+GDD + DG+E+ PK+KDLEGV+GMES+VVI+DDSVRVWPHNKLNLIVVERYTYFPC Sbjct: 1033 ISRGDDTESVDGDERAPKSKDLEGVMGMESSVVIVDDSVRVWPHNKLNLIVVERYTYFPC 1092 Query: 3582 SRRQFGLLGPSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQ 3761 SRRQFGL GPSLLEIDHDERPE GTLASSLAVIER+HQ FF+ +SL +VDVRN+LASEQ+ Sbjct: 1093 SRRQFGLPGPSLLEIDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQR 1152 Query: 3762 KILAGCRVVFSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNW 3941 KILAGCR+VFSR+FPVGEANPHLHPLWQTAEQFGAVC QID+QVTHVVANSLGTDKVNW Sbjct: 1153 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCINQIDDQVTHVVANSLGTDKVNW 1212 Query: 3942 ALSTGRF 3962 A+STGRF Sbjct: 1213 AISTGRF 1219 >ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris] gi|561012448|gb|ESW11309.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris] Length = 1272 Score = 917 bits (2370), Expect = 0.0 Identities = 572/1258 (45%), Positives = 726/1258 (57%), Gaps = 32/1258 (2%) Frame = +3 Query: 285 SVEEISEEDF-KQEAKVLN---PKGGDSRVW-MGDLLN-YP-VSSNYGSGLYNFAWAQAV 443 SVEEISE DF KQ+ KV N P G D+RVW + D+ YP + Y SGLYN AWAQAV Sbjct: 35 SVEEISEADFNKQDVKVNNNNKPNGSDARVWSVRDIYTKYPTICRGYASGLYNLAWAQAV 94 Query: 444 QNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSEEIDSK 623 QNKPL +I + + +SE + + +N KEV V +D E++ Sbjct: 95 QNKPLNDIFVMELDSEANANSNSNN------SNRPSSVSVNPKEVMVVDVDREEGELEEG 148 Query: 624 AQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLP-SDSGRNSEGEFEKQIKSI 800 D D E E+ S ++++ S+ +G + + + Sbjct: 149 EIDADA---------------DPEAEAESVVAASVVSETVSDSEQFGVKKGVSDSEQLGV 193 Query: 801 RGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGIQAINSVF 980 R LE VTV +SF L L++L + + DDLI+ SF I+ + SVF Sbjct: 194 RDVLEGVTVANVAESFAQTSSRL---LNALPQVFSRPADSEKDDLIRLSFNAIEVVYSVF 250 Query: 981 CSMNPKQQEQNKDLFLRLLTHVKSQ-DTTLFSPERMKEIEAMMYGLDSEAVVSHVKAMKK 1157 SM+ +EQNK+ LRLL+ K + LFSPE +KEI+ MM +DS + +A+ Sbjct: 251 RSMDSSDKEQNKNSILRLLSSAKDKKQAQLFSPEHIKEIQDMMTAIDSVGALGSNEAIYM 310 Query: 1158 DNGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIAN-------------MGSET 1298 + E NS+ ++ I ++ +A +G+ Sbjct: 311 ETELQTPEIK-------SQENSALEVQTRGIKIQENQAVVATELVSSIKPLHSDIIGASR 363 Query: 1299 XXXXXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELAT 1478 DLH+DHDADSLPSPTRE P PV KL VG+ + KS A Sbjct: 364 ALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEVMVKSGSAA 423 Query: 1479 PKIAD-----ESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDG 1643 K+ +SE S H YETDALKA+S+YQQKFGR+S +LPSPTPS +CDD Sbjct: 424 AKMQPGKLEVDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCDDMAV 483 Query: 1644 DSTGEVSSFSTVGDVRNVNLPLPLRSVGSPTPHMDSS--MGQRQMPAKTAGHLACVSNPV 1817 D+ EVSS ST G + + P L +D S +G AG S PV Sbjct: 484 DTNEEVSSASTSGFLTSTK-PTLLDQPPVSATSVDKSRLLGLISSRVDAAGS---GSFPV 539 Query: 1818 LRAPAKNRDPRLRFANSEGDALDLNQRPLLEGATKSDTLGGIISSRKHNIVVESVLDGQT 1997 ++ AK+RDPR R NSE A+D NQ + K + G IS RK V E D Sbjct: 540 -KSSAKSRDPRRRLINSEASAVD-NQFTVTHNMPKVEYAGSTIS-RKQKAVEEPSFDLTV 596 Query: 1998 LKRQRNGLTDFAVS-KDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEI 2174 KR ++ L + + +V+ ++GS GWLE+ + GTQ + N L + +++ N Sbjct: 597 SKRLKSSLENIEHNTSEVRTIAGSGGWLEDITGPGTQLIEKNHLIDKFAPEPKRTLN--T 654 Query: 2175 VSSERRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIMEQQRLA 2354 VSS S N N + NE P+ +SLP++ +DI VNPTML+ L+MEQ+RL Sbjct: 655 VSSS-----GSVNFNATSIRNEQAPITSNNVPSSLPAIFKDIVVNPTMLLSLLMEQKRLV 709 Query: 2355 AGAQKKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNPQ 2534 AQ S+DSA NM +SS+ G + SS ++ ++ + T++ Q Sbjct: 710 -DAQNNSADSATNMLHPTSSNSAMGTDSTASIVSSMATGLQTSVGMLPVSSQSTSTAQLQ 768 Query: 2535 GEW-GQIRMKPRDPRRILHSS-TFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQ 2708 ++ G+IRMKPRDPRRILH++ + QK+ ++ ++ K +P + Sbjct: 769 DDYSGKIRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNILVTGDSVNAQKLEGR 828 Query: 2709 AQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTEKAG 2888 T + F LKN+A ++S SQ ++T +Q SS P+ ++ Sbjct: 829 MDTKLVPTQSGAAPDITRQFTRNLKNIADIMSVSQESSTHSPAAQGFSSASVPLNVDRGE 888 Query: 2889 VGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXX 3068 +V++ + G G+ PE G +R Q+ WGDVE LFEGYD+ Sbjct: 889 QKSVLSNSQNLHAGTGSAPEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIE 948 Query: 3069 XXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMG 3248 NKMFAARK NSAKFVEVDP+H+EILRKKEE DREKPHRHLFRFPHMG Sbjct: 949 EQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEELDREKPHRHLFRFPHMG 1008 Query: 3249 MWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDP 3428 MWTKLRPGIWNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP G LFAGRVIS+GDD D Sbjct: 1009 MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDS 1068 Query: 3429 FDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 3608 DGEE+ PK+KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL G Sbjct: 1069 VDGEERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPG 1128 Query: 3609 PSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVV 3788 PSLLEIDHDERPE GTLASSLAVIERLHQ FFS +SL +VDVRN+LASEQ+KIL+GCR+V Sbjct: 1129 PSLLEIDHDERPEAGTLASSLAVIERLHQNFFSSQSLEEVDVRNILASEQRKILSGCRIV 1188 Query: 3789 FSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 FSR+FPVGEANPHLHPLWQTAEQFGAVCT QID+QVTHVVANSLGTDKVNWALSTGRF Sbjct: 1189 FSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSTGRF 1246 >ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like isoform X2 [Cicer arietinum] Length = 1227 Score = 911 bits (2355), Expect = 0.0 Identities = 565/1267 (44%), Positives = 730/1267 (57%), Gaps = 41/1267 (3%) Frame = +3 Query: 285 SVEEISEEDFKQE--AKVLNPK-------GGDSRVW-MGDLLN-YP-VSSNYGSGLYNFA 428 SV EISEEDF ++ KV N GGD+RVW + DL + YP + Y SGLYN A Sbjct: 35 SVVEISEEDFNKQDVVKVNNNSDSDKAKTGGDARVWAVHDLYSKYPTICRGYASGLYNLA 94 Query: 429 WAQAVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDSSE 608 WAQAVQNKPL +I + + +S+ S +N+ V++DD Sbjct: 95 WAQAVQNKPLNDIFVMELDSD-----SNANV---------------------VMVDDDER 128 Query: 609 EIDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQ 788 E + E+ E G + + G + E + Sbjct: 129 E-------------------------EGELEEGEIDGDDDTGGVMVGGDGSETVSESD-- 161 Query: 789 IKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGIQAI 968 IR LE VTV +SF L L S ++ + D +I+ + I+ + Sbjct: 162 ---IRDFLEGVTVANVAESFAETISRLLRVLQSK--LLSGPAVSEKDYVIRLLYNAIEIV 216 Query: 969 NSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSHVKA 1148 +SVFCSM+ Q+E NKD +RLL +K++ T LFSPE MKEI+ M+ +D+ + + Sbjct: 217 HSVFCSMDNLQKEDNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNSVV 276 Query: 1149 M---KKDNGTNPNEFGILGENPGQVLNSSNKI---LLEPIP-VKSGDQNIANMGSETXXX 1307 + +K + + I G ++++SS + L E + SG NI G Sbjct: 277 VGNGEKLDTLDIKTRQIQGLKASELISSSKLVHSNLTEASEALLSGQSNIKGRG------ 330 Query: 1308 XXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATP-- 1481 DLH+ HD DSLPSPTRE P F PV KL VGDG+ + L + Sbjct: 331 ----------VMLPLFDLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGK 380 Query: 1482 ----KIADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDS 1649 K+ ++E+S H YETDALKA+S+YQQKFGR+S + PSPTPS +C++G D+ Sbjct: 381 TEAVKMELDTENSKNHLYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADA 440 Query: 1650 TGEVSSFSTVGDVRNVNLPLPLRSVGSPTPHMDSSMGQRQMPAKTAGHLACVSNPVLRAP 1829 EVSS S + + L V S + S G + A + V+ PV + Sbjct: 441 NEEVSSASIAVSLTSSKPLLDQMPVSSTSVDRSSMHGLINSRIEAA---SSVTYPV-KTS 496 Query: 1830 AKNRDPRLRFANSEGDALDLNQRPLLEGATKSDTLGGIISSRKHNIVVESVLDGQTLKRQ 2009 A++RDPRLRF NS+ ALDLNQ K + G +IS RK E LD KR Sbjct: 497 ARSRDPRLRFINSDASALDLNQSLGTNNMPKVENAGRVIS-RKQKTTEELSLDATAPKRL 555 Query: 2010 RNGLTDFAVS-KDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSE 2186 R+ L + + ++ + ++G+ GWLEE+ G+ + N L + T+L+K+ + Sbjct: 556 RSSLENSRHNTREERTMAGNGGWLEENRVAGSHLIERNHLMQKGETELKKTMS------- 608 Query: 2187 RRDIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIMEQQ-RLAAGA 2363 +S V+ GNE P+ + T A+LP LL++IAVNPTML+ +++EQQ RLAA A Sbjct: 609 ----TSSGYSTVTSNGNEQAPVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRLAAEA 664 Query: 2364 QKKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNPQG-- 2537 KK DSA + L NSA + + PA+ +P + M P Sbjct: 665 NKKPVDSATSTMH------------LTNSARGPDATVNTGPAMTAGLPQSSVGMLPASTQ 712 Query: 2538 ----------EWGQIRMKPRDPRRILH-SSTFQKNESLGSDKFKTNGAPXXXXXXXXXXX 2684 + G+IRMKPRDPRRILH SS+ QK+ S GS++ K+ +P Sbjct: 713 AASMAHTLLEDSGKIRMKPRDPRRILHGSSSLQKSGSTGSEQSKSVVSPTSNNQGNGGNV 772 Query: 2685 XXXXXREQAQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTP-PTVSQSISSQP 2861 + +T F + LKN+A ++S SQ +T P +Q++SS Sbjct: 773 NAQKLDVRVETKLAPTQSSAQPDITRQFTKNLKNIADIMSVSQEPSTQLPATTQNVSSAS 832 Query: 2862 EPVKTEKAGVGAVVTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXX 3041 P +KA + + V + Q G+G+ PE G +R Q+ W DVE LFEGYD+ Sbjct: 833 VPFTLDKAELKSGVPNSQNLQDGVGSAPETCAPGSSRSQSTWADVEHLFEGYDEKQKAAI 892 Query: 3042 XXXXXXXXXXXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHR 3221 NKMFA++K NSAKFVEVDP+HDEILRKKEEQDREKPHR Sbjct: 893 QRERARRLEEQNKMFASKKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHR 952 Query: 3222 HLFRFPHMGMWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRV 3401 HLFRFPHMGMWTKLRPG+WNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP G LFAGRV Sbjct: 953 HLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1012 Query: 3402 ISKGDDGDPFDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 3581 IS+GDD + DG+E+ PK+KDLEGV+GMES+VVI+DDSVRVWPHNKLNLIVVERYTYFPC Sbjct: 1013 ISRGDDTESVDGDERAPKSKDLEGVMGMESSVVIVDDSVRVWPHNKLNLIVVERYTYFPC 1072 Query: 3582 SRRQFGLLGPSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQ 3761 SRRQFGL GPSLLEIDHDERPE GTLASSLAVIER+HQ FF+ +SL +VDVRN+LASEQ+ Sbjct: 1073 SRRQFGLPGPSLLEIDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQR 1132 Query: 3762 KILAGCRVVFSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNW 3941 KILAGCR+VFSR+FPVGEANPHLHPLWQTAEQFGAVC QID+QVTHVVANSLGTDKVNW Sbjct: 1133 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCINQIDDQVTHVVANSLGTDKVNW 1192 Query: 3942 ALSTGRF 3962 A+STGRF Sbjct: 1193 AISTGRF 1199 >ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 1195 Score = 900 bits (2325), Expect = 0.0 Identities = 562/1270 (44%), Positives = 724/1270 (57%), Gaps = 44/1270 (3%) Frame = +3 Query: 285 SVEEISEEDF-KQEAKVLNPKG-------------GDSRVW-MGDLLNYPVSSNYGSGLY 419 S+EEISEEDF KQ+ V+ P G+ RVW + DL Y + + SGLY Sbjct: 25 SIEEISEEDFNKQDVVVVKPPSSNNETTKQKEQGNGNGRVWTISDLYRYQMVGGHVSGLY 84 Query: 420 NFAWAQAVQ------NKPLTEILMRDFES-EEKSKRSGSNLLXXXXXXXXXXXXXXMKEV 578 N AWAQAVQ NKPL E+ E +E SKRS + K+V Sbjct: 85 NLAWAQAVQSKPGKSNKPLNELFADVVEELDESSKRSSPSSSAASVNSNNKDGDEEKKKV 144 Query: 579 CN-VIIDDSSEEIDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDS 755 V+IDD+ + EM+++ + D + + Sbjct: 145 VEKVVIDDNGD----------------------------EMMDDNNR--NKIVDVVEKEE 174 Query: 756 GRNSEGEFEKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDL 935 G EGE + ++ EK+ +G L + ++D L++ E G + Sbjct: 175 GELEEGEIDLDMEP------------GEKANNGDVLNM--NIDGLEVESGEKGFEKKMNS 220 Query: 936 IQQSFTGIQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGL 1115 I+ + + + C+ FS KE E ++ Sbjct: 221 IRDALESVTIEFVLACT---------------------DSSGVSFSSFSEKEKEPLI--- 256 Query: 1116 DSEAVVSHVKAMKKDNGTNPNEFGILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSE 1295 VV+ KKDN N G++ G +++ NK+ P S N AN+ E Sbjct: 257 --STVVN-----KKDNDVN-------GKSSGHDMSAVNKL-----PTDSFVNNKANLSIE 297 Query: 1296 TXXXXXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELA 1475 DLH+DHDADSLPSPTRE+ LP ++ Sbjct: 298 -GPKTGVSSFKSRAALLPLLDLHKDHDADSLPSPTRESALPLPAYRV------------L 344 Query: 1476 TPKIADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTG 1655 TPK+ ++ +S +H YETDALKA+SSYQQKF ++S LT RLPSPTPSEE +GDGD+ G Sbjct: 345 TPKMVLDTGNSRMHPYETDALKAVSSYQQKFSKSSFALTDRLPSPTPSEESGNGDGDTGG 404 Query: 1656 EVSSFSTVGDVRNVNLPLPLRSVGSPT----PHMDSSMGQRQMPAKTAGHLACVSNPVLR 1823 EVSS +V R N PL S S P MD S + K+A + + ++ Sbjct: 405 EVSSSLSVSSFRPAN---PLTSGQSNASISLPRMDGSSLPGVISIKSAVRASSAPSLTVK 461 Query: 1824 APAKNRDPRLRFANSEGDALDLNQRPL-LEGATKSDTLGGIISSRKHNIVVESVLDGQTL 2000 A AK+RDPRLRF NS+ +ALD N R + + K + +GG ++ ++ IV + + DG +L Sbjct: 462 ASAKSRDPRLRFVNSDSNALDQNHRAVPVVNTLKVEPIGGTMNKKRQKIVDDPIPDGHSL 521 Query: 2001 KRQRNGLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVS 2180 KRQ+N L + V +DV+ + GS GWLE++ VG Q + N+L N +D R+ + G + + Sbjct: 522 KRQKNALENSGVVRDVKTMVGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCT 581 Query: 2181 SERRDIDASSNLNVSVGGNEPLPMICT------------GTTASLPSLLRDIAVNPTMLM 2324 S +S +V++ G E +P+ T G+TA++P LL++IAVNPTML+ Sbjct: 582 S------SSCISSVNISGTEQIPVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLI 635 Query: 2325 QLIM--EQQRLAAGAQKKSSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRH 2498 ++ +QQRLA AQ+K D A++ +S+ + G P+V +A S I +PA Sbjct: 636 NILKMGQQQRLALEAQQKPVDPAKSTTYPLNSNSMLGTVPVVGAAHSG---ILPRPAGTV 692 Query: 2499 KVPAQTTSMNPQGEWGQIRMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXX 2678 +V Q + + G+IRMKPRDPRR+LH++ Q+N S+GS+ KTN Sbjct: 693 QVSPQ---LGTADDLGKIRMKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKD 749 Query: 2679 XXXXXXXREQAQTTSLXXXXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQ 2858 Q + + F + LKN+A ++S S A+ + P V Q+ +SQ Sbjct: 750 NQNLQKQEGQVEKKPVPLQSLALPDISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQ 809 Query: 2859 PEPVKTEKAGVGAVVTELSDQQIGIGAKPEESIAGPA--RLQNPWGDVEQLFEGYDDXXX 3032 P SDQ +GIG+ P + A A R QN WGDVE LFEGY+D Sbjct: 810 PMRTTISS----------SDQFLGIGSAPGAAAAAAAGPRTQNAWGDVEHLFEGYNDQQK 859 Query: 3033 XXXXXXXXXXXXXXNKMFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREK 3212 K+F+ARK NSAKFVEVDP+HDEILRKKEEQDREK Sbjct: 860 AAIQRERARRIEEQKKLFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREK 919 Query: 3213 PHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFA 3392 HRHLFRFPHMGMWTKLRPGIWNFLEKASKLYE+HLYTMGNKLYATEMAKVLDPTG LF Sbjct: 920 AHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFN 979 Query: 3393 GRVISKGDDGDPFDGEEKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 3572 GRVIS+GDDG+PFDG+E++PK+KDLEGVLGMES VVI+DDSVRVWPHNKLNLIVVERY Y Sbjct: 980 GRVISRGDDGEPFDGDERIPKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIY 1039 Query: 3573 FPCSRRQFGLLGPSLLEIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLAS 3752 FPCSRRQFGL GPSLLEIDHDERPE+GTLA SLAVIER+HQ FF+H SL + DVRN+LAS Sbjct: 1040 FPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILAS 1099 Query: 3753 EQQKILAGCRVVFSRIFPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDK 3932 EQ+KILAGCR+VFSR+FPVGEANPHLHPLWQTAEQFGAVCT QIDEQVTHVVANSLGTDK Sbjct: 1100 EQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDK 1159 Query: 3933 VNWALSTGRF 3962 VNWALSTGRF Sbjct: 1160 VNWALSTGRF 1169 >ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Solanum tuberosum] Length = 1218 Score = 899 bits (2323), Expect = 0.0 Identities = 554/1252 (44%), Positives = 722/1252 (57%), Gaps = 26/1252 (2%) Frame = +3 Query: 285 SVEEISEEDFKQE-------AKVLNPKGGD------SRVW-MGDLLNYPVSSNYGSGLYN 422 SVEEISE+ F ++ K+ + + + +RVW M D YP+S +Y GLYN Sbjct: 20 SVEEISEDAFNRQDPPTTTKIKIASNENQNQNSTTTTRVWTMRDAYKYPISRDYARGLYN 79 Query: 423 FAWAQAVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVIIDDS 602 AWAQAVQNKPL E+ + ++ + + +N+ K + +V +DD Sbjct: 80 LAWAQAVQNKPLDELFVMTSDNSNQCANANANV--------------ESKVIIDVDVDDD 125 Query: 603 SEEIDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEGEFE 782 ++E +EE E A+ L F Sbjct: 126 AKEEGE--------------------------LEEGEIDLDAADLVL----------NFG 149 Query: 783 KQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFTGIQ 962 K+ +R L++VT+ KSF VC +LQ SL +L + + D+ LIQ T ++ Sbjct: 150 KEANFVREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQDKNDI--LIQLFMTALR 207 Query: 963 AINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVVSHV 1142 INSVF SMN Q++QN D+ RLL H K+Q L S E++KE++A++ ++ AV S+ Sbjct: 208 TINSVFYSMNQDQKQQNTDILSRLLFHAKTQLPALLSSEQLKEVDAVILSINQSAVFSNT 267 Query: 1143 KAMKKDNGTNPNEF------GILGENPGQVLNSSNKILLEPIPVKSGDQNIANMGSETXX 1304 + K NG E EN Q + NK L + +KS ++ E+ Sbjct: 268 QDNDKVNGIKVVELLDKKVSHKSSENANQDFTAVNKYDLGAVSIKSSGLKEQSVSFESVK 327 Query: 1305 XXXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELATPK 1484 DLH+DHD D+LPSPTRE P PV K G+ K +L Sbjct: 328 PGLANSKAKGLSIPLL-DLHKDHDEDTLPSPTREIGPQFPVAKA-TQAHGMVKLDLPIFA 385 Query: 1485 IADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTGEVS 1664 + E +S LH YETDALKA+SSYQQKFGR+S ++ LPSPTPSEE D G GD GEV+ Sbjct: 386 GSLEKGNSLLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEGDSGKGDIGGEVT 445 Query: 1665 SFSTVGDVRNVNLPLPLRSVGSPTPHMDSSMGQRQMPAKTAGHLACVSNPVLRAP-AKNR 1841 S V + ++N + + S P + GQ A+TA L+ + NP LR+ AK+R Sbjct: 446 SLDVVHNASHLNESSMGQPILSSVPQTNILDGQGLGTARTADPLSFLPNPSLRSSTAKSR 505 Query: 1842 DPRLRFANSEGDALDLNQR--PLLEGATKSDTLGGIISSRKHNIVVESVLDGQTLKRQRN 2015 DPRLR A S+ A + N+ P+ + K + +I S+K V V KRQR+ Sbjct: 506 DPRLRLATSDAVAQNTNKNILPIPDIDLKLEASLEMIGSKKQKTVDLPVFGAPLPKRQRS 565 Query: 2016 GLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSERRD 2195 TD + DV+ +G+ GWLE+ T G T N + D+RK E ++ ++ Sbjct: 566 EQTDSIIVSDVRPSTGNGGWLEDRGTAGLPITSSNCATDSSDNDIRKLE--QVTAT---- 619 Query: 2196 IDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIMEQQRLAAGAQKKS 2375 I ++ V+ N P+ I T TT L SLL+DIA+NP++ M +I +Q+ +A A + + Sbjct: 620 IATIPSVIVNAAENFPVTGISTSTT--LHSLLKDIAINPSIWMNIIKMEQQKSADASRTT 677 Query: 2376 SDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNPQGEWGQIR 2555 + A SSS I G P ++ + +SS I Q+ + P T S + E +R Sbjct: 678 TAQA------SSSKSILGAVPSTDAIAPRSSAIGQRSVGILQTPTHTASAD---EVAIVR 728 Query: 2556 MKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQAQTTSLXXX 2735 MKPRDPRR+LH++ K ++GSD+ KT A +Q S Sbjct: 729 MKPRDPRRVLHNTAVLKGGNVGSDQCKTGVA---GTHATISNLGFQSQEDQLDRKSAVTL 785 Query: 2736 XXXXXXXXXXFAEKLKNLAHMLSTSQATN---TPPTVSQSISSQPEPVKTEKAGVGAVVT 2906 F + LKN+A M+S S +T+ T +Q + S + ++A V+ Sbjct: 786 STTPPDIARQFTKNLKNIADMISVSPSTSLSAASQTQTQCLQSHQSRSEGKEA-----VS 840 Query: 2907 ELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXXXXNKMF 3086 E S++ G E+ G + Q WGDVE LFEGY D KMF Sbjct: 841 EPSERVNDAGLASEKGSPGSLQPQISWGDVEHLFEGYSDQQRADIQRERARRLEEQKKMF 900 Query: 3087 AARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 3266 + RK NSAKFVE+DP+H+EILRKKEEQDREKP RHLFRFPHMGMWTKLR Sbjct: 901 SVRKLCLVLDLDHTLLNSAKFVEIDPVHEEILRKKEEQDREKPCRHLFRFPHMGMWTKLR 960 Query: 3267 PGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDPFDGEEK 3446 PGIWNFLEKAS L+E+HLYTMGNKLYATEMAK+LDP G LFAGRVIS+GDDGDPFDG+E+ Sbjct: 961 PGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGDER 1020 Query: 3447 LPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEI 3626 +PK+KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEI Sbjct: 1021 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEI 1080 Query: 3627 DHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVVFSRIFP 3806 DHDERPE+GTLAS L VI+R+HQ FF+HRS+ + DVRN+LA+EQ+KILAGCR+VFSR+FP Sbjct: 1081 DHDERPEDGTLASCLGVIQRIHQNFFAHRSIDEADVRNILATEQKKILAGCRIVFSRVFP 1140 Query: 3807 VGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGRF 3962 VGEANPHLHPLWQTAEQFGAVCT QID+QVTHVVANSLGTDKVNWALSTGRF Sbjct: 1141 VGEANPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKVNWALSTGRF 1192 >ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Solanum lycopersicum] Length = 1211 Score = 887 bits (2293), Expect = 0.0 Identities = 551/1253 (43%), Positives = 716/1253 (57%), Gaps = 28/1253 (2%) Frame = +3 Query: 285 SVEEISEEDFKQE----------AKVLNPKGGD------SRVW-MGDLLNYPVSSNYGSG 413 SVEEISE+ F ++ K+ + + + +RVW M D+ YP+S +Y G Sbjct: 20 SVEEISEDAFNRQDPPTTSTTSKIKIASNENQNQNSTTATRVWTMRDVYKYPISRDYARG 79 Query: 414 LYNFAWAQAVQNKPLTEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXXMKEVCNVII 593 LYN AWAQAVQNKPL E+ + ++ + S K + +V + Sbjct: 80 LYNLAWAQAVQNKPLDELFVMTSDNSNQCANGES------------------KVIIDVDV 121 Query: 594 DDSSEEIDSKAQDVXXXXXXXXXXXXXXXXLDTEMVEETEGGWSNANDSLPSDSGRNSEG 773 DD ++E +EE E +A+ + Sbjct: 122 DDDAKEEGE--------------------------LEEGEIDLDSADLVV---------- 145 Query: 774 EFEKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGAPDVDDLIQQSFT 953 F K+ IR L++VT+ KSF VC +LQ SL +L + + D+ LIQ T Sbjct: 146 NFGKEANFIREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQDKNDI--LIQLFMT 203 Query: 954 GIQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSPERMKEIEAMMYGLDSEAVV 1133 ++ INSVF SMN Q++QN D+ RLL + K+Q L S E++KE++A++ ++ V Sbjct: 204 ALRTINSVFYSMNDHQKQQNTDILSRLLFNAKTQLPALLSSEQLKELDALILSINHSLVS 263 Query: 1134 SHVKAMKKDNGTNPNEFGIL------GENPGQVLNSSNKILLEPIPVKSGDQNIANMGSE 1295 S+ + NG N + + EN Q S NK L + +KS ++ SE Sbjct: 264 SNTQDNDTVNGINVVQLLDMKDSHKSSENANQDFTSVNKYDLGDVSIKSSGLKEQSVSSE 323 Query: 1296 TXXXXXXXXXXXXXXXXXXXDLHRDHDADSLPSPTRETPPFLPVQKLKVVGDGLSKSELA 1475 + DLH+DHD D+LPSPTR+ P P + G+ K +L Sbjct: 324 SVKPGLDNSKAKGLSFPLL-DLHKDHDEDTLPSPTRQIGPQFPATQTH----GMVKLDLP 378 Query: 1476 TPKIADESEDSTLHHYETDALKALSSYQQKFGRTSNILTSRLPSPTPSEECDDGDGDSTG 1655 + + +S LH YETDALKA+SSYQQKFGR+S ++ LPSPTPSEE D G GD+ G Sbjct: 379 IFPASLDKGNSLLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEDDSGKGDTGG 438 Query: 1656 EVSSFSTVGDVRNVNLPLPLRSVGSPTPHMDSSMGQRQMPAKTAGHLACVSNPVLRAP-A 1832 EV+SF V + ++N + + S P + GQ +TA L+ + NP LR+ A Sbjct: 439 EVTSFDVVHNASHLNESSMGQPILSSVPQTNILDGQGLGTTRTADPLSFLPNPSLRSSTA 498 Query: 1833 KNRDPRLRFANSEGDALDLNQRPLLEGATKSDTLGGIISSRKHNIVVESVLDGQTLKRQR 2012 K+RDPRLR A S+ A + P+ + K + +I S+K V S D KRQR Sbjct: 499 KSRDPRLRLATSDTVAQN-TILPIPDIDLKLEASLEMIVSKKQKTVDLSAFDAPLPKRQR 557 Query: 2013 NGLTDFAVSKDVQMVSGSCGWLEESSTVGTQATDVNRLAKNMGTDLRKSENGEIVSSERR 2192 + TD + DV+ G+ GWLE+ T T N N D+RK E ++ ++ Sbjct: 558 SEQTDSIIVSDVRPSIGNGGWLEDRGTAELPITSSNCATYNSDNDIRKLE--QVTAT--- 612 Query: 2193 DIDASSNLNVSVGGNEPLPMICTGTTASLPSLLRDIAVNPTMLMQLIMEQQRLAAGAQKK 2372 I ++ V+ N P+ I T TT L SLL+DIA+NP++ M +I +Q QK Sbjct: 613 -IATIPSVIVNAAENFPVTGISTSTT--LHSLLKDIAINPSIWMNIIKTEQ------QKS 663 Query: 2373 SSDSAQNMKPASSSSVIPGIAPLVNSASSKSSEIEQKPAVRHKVPAQTTSMNPQGEWGQI 2552 + S N ASSS I G P + + +SS I Q+ + P T S + E + Sbjct: 664 ADASRTNTAQASSSKSILGAVPSTVAVAPRSSAIGQRSVGILQTPTHTASAD---EVAIV 720 Query: 2553 RMKPRDPRRILHSSTFQKNESLGSDKFKTNGAPXXXXXXXXXXXXXXXXREQAQTTSLXX 2732 RMKPRDPRR+LHS+ K S+G D+ KT A +Q S Sbjct: 721 RMKPRDPRRVLHSTAVLKGGSVGLDQCKTGVA---GTHATISNLSFQSQEDQLDRKSAVT 777 Query: 2733 XXXXXXXXXXXFAEKLKNLAHMLSTSQATNTPPTVSQSISSQPEPVKTE----KAGVGAV 2900 F + LKN+A M+S S P+ S S++SQ + + + ++ V Sbjct: 778 LSTTPPDIACQFTKNLKNIADMISVS------PSTSPSVASQTQTLCIQAYQSRSEVKGA 831 Query: 2901 VTELSDQQIGIGAKPEESIAGPARLQNPWGDVEQLFEGYDDXXXXXXXXXXXXXXXXXNK 3080 V+E S+ G E+ G + Q WGDVE LFEGY D K Sbjct: 832 VSEPSEWVNDAGLASEKGSPGSLQPQISWGDVEHLFEGYSDQQRADIQRERTRRLEEQKK 891 Query: 3081 MFAARKXXXXXXXXXXXXNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 3260 MF+ RK NSAKFVE+DP+H+EILRKKEEQDREKP+RHLFRFPHMGMWTK Sbjct: 892 MFSVRKLCLVLDLDHTLLNSAKFVEIDPVHEEILRKKEEQDREKPYRHLFRFPHMGMWTK 951 Query: 3261 LRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISKGDDGDPFDGE 3440 LRPGIWNFLEKAS L+E+HLYTMGNKLYATEMAK+LDP G LFAGRVIS+GDDGDPFDG+ Sbjct: 952 LRPGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGD 1011 Query: 3441 EKLPKNKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 3620 E++PK+KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLL Sbjct: 1012 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLL 1071 Query: 3621 EIDHDERPEEGTLASSLAVIERLHQTFFSHRSLHDVDVRNVLASEQQKILAGCRVVFSRI 3800 EIDHDERPE+GTLAS L VI+R+HQ FF+HRS+ + DVRN+LA+EQ+KILAGCR+VFSR+ Sbjct: 1072 EIDHDERPEDGTLASCLGVIQRIHQNFFTHRSIDEADVRNILATEQKKILAGCRIVFSRV 1131 Query: 3801 FPVGEANPHLHPLWQTAEQFGAVCTIQIDEQVTHVVANSLGTDKVNWALSTGR 3959 FPVGEA+PHLHPLWQTAEQFGAVCT QID+QVTHVVANSLGTDKVNWALSTGR Sbjct: 1132 FPVGEASPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKVNWALSTGR 1184