BLASTX nr result
ID: Papaver32_contig00003964
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver32_contig00003964 (4446 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_010249185.1 PREDICTED: RNA polymerase II C-terminal domain ph... 1024 0.0 XP_018833954.1 PREDICTED: RNA polymerase II C-terminal domain ph... 882 0.0 XP_018833953.1 PREDICTED: RNA polymerase II C-terminal domain ph... 877 0.0 XP_010656786.1 PREDICTED: RNA polymerase II C-terminal domain ph... 874 0.0 XP_010656789.1 PREDICTED: RNA polymerase II C-terminal domain ph... 874 0.0 EOX99661.1 RNA polymerase II C-terminal domain phosphatase-like ... 873 0.0 XP_007043830.2 PREDICTED: RNA polymerase II C-terminal domain ph... 868 0.0 XP_018840025.1 PREDICTED: RNA polymerase II C-terminal domain ph... 858 0.0 XP_018840024.1 PREDICTED: RNA polymerase II C-terminal domain ph... 853 0.0 GAV71470.1 BRCT domain-containing protein/NIF domain-containing ... 844 0.0 XP_012459417.1 PREDICTED: RNA polymerase II C-terminal domain ph... 843 0.0 XP_012459418.1 PREDICTED: RNA polymerase II C-terminal domain ph... 838 0.0 XP_017615720.1 PREDICTED: RNA polymerase II C-terminal domain ph... 837 0.0 XP_012088736.1 PREDICTED: RNA polymerase II C-terminal domain ph... 833 0.0 XP_016680068.1 PREDICTED: RNA polymerase II C-terminal domain ph... 832 0.0 XP_008791049.1 PREDICTED: RNA polymerase II C-terminal domain ph... 828 0.0 XP_018840026.1 PREDICTED: RNA polymerase II C-terminal domain ph... 821 0.0 OMP02331.1 hypothetical protein CCACVL1_02829 [Corchorus capsula... 827 0.0 OMP09626.1 hypothetical protein COLO4_05290 [Corchorus olitorius] 825 0.0 XP_010682659.1 PREDICTED: RNA polymerase II C-terminal domain ph... 823 0.0 >XP_010249185.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Nelumbo nucifera] Length = 1313 Score = 1024 bits (2648), Expect = 0.0 Identities = 635/1324 (47%), Positives = 777/1324 (58%), Gaps = 74/1324 (5%) Frame = -1 Query: 4305 YNQVFPRNFATPMYNYAWKQAVQNRPLRNV--------------------------NDDN 4204 Y + + +N AW QAVQNRPL V N++N Sbjct: 85 YQYPVTSGYGSGFHNLAWAQAVQNRPLDEVFVRDFGSDEKLVRSVSKPMINSREDNNNNN 144 Query: 4203 AAAATS---VVIEISDEGVVVND-----VDSXXXXXXXXXXXXGDNDTEMVE-GTVVESN 4051 + +S V ISD+ D V D D+EMVE G +E + Sbjct: 145 RSLNSSSKEVCNLISDDSSEEIDSKMAVVGEDEKEEGELEEGEIDLDSEMVESGHSIEIS 204 Query: 4050 LNGMPSSTTDDKIMNENEEIKSIRQVIQLVINAKNAGKPFGGACGELWTSLDKLQKFVLN 3871 +G ++ D K + + SIR+ ++ V K A K F C + TSL+ LQ + Sbjct: 205 SDGQSNAEKDLKEKEFEKRLNSIRECLETV-TVKEADKSFDAICFRMRTSLESLQAMISE 263 Query: 3870 NGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNASLFSSEQM 3691 N + ++ LI+QSF +Q + SVYC+M ++Q Q KD+FSRL+VH+K Q LFS ++M Sbjct: 264 NRVPA-MDDLIEQSFTGIQTINSVYCSMTPQQQEQNKDIFSRLIVHLKIQEPVLFSPDRM 322 Query: 3690 KELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPLEEKNGASQNGHG 3511 KE+E +++SL+ + NQ E+ +G + I + + L EK G NG Sbjct: 323 KEIESMVRSLDCPSALSNIKVLNQ---EKEALVG---VRENIKNSSILSEKAG---NG-- 371 Query: 3510 ENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGSGIFQSGPLSGLK 3331 ++ SKK LEP+ VK+ D N S ++G G + Sbjct: 372 ----------------------VDFSKKFQLEPMPVKYGDWDNLNTRSETSKAGLSFGSR 409 Query: 3330 SRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRFPSPTRETLRPVQ 3151 SR GFGP HRDH D PSPTR+ P+ Sbjct: 410 SRIGFGPLLDL----------------------------HRDHDADSLPSPTRKAPPPLP 441 Query: 3150 ANKPQAVESRPVKSD-----------GTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXXX 3004 KP ++ +SD T +HPYETDA KAVSTYQQKFG Sbjct: 442 MQKPLSISDGTPRSDLVTNIVEDKMDDTALHPYETDALKAVSTYQQKFGRTSLLLSDRLP 501 Query: 3003 XXXXSEEGNDDEDDSKGEXXXXXXXXXXXXXXXXP-LQAASVYAAFQNNGLCRQGTELEI 2827 SEE +D + D GE L+ S ++ +N L QG + Sbjct: 502 SPTPSEECDDGDGDINGEVSSSTTVGGVATINSSTSLKTVSSATSYADN-LSGQGLVPAV 560 Query: 2826 NP---------VVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGTLEEIIN 2674 + V+R K RDPR + E G DLN R +H+ S L I+ Sbjct: 561 SVGQLGSMSSHVIRTAKN---RDPRLRYANSEVGPLDLNQRPPSGDHDIRKSEPLGGIMG 617 Query: 2673 MRKNKSVPQSVLDGHTLKRQRNGLTRSTVSGTGGWGEDTSVRPQHTLTNQVTESIGSRDP 2494 RK+K V +S+LD HT KRQRNGL S SG D V Sbjct: 618 SRKHKIVEESLLDDHTFKRQRNGLINSGASG------DVQVVS----------------- 654 Query: 2493 RNFGNGGWSEDSVTR-LQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRNLTAGAG 2317 G+GGW E+S + LQPT +++ E R SDPRK G+GE F +QD G G Sbjct: 655 ---GSGGWLEESSSMGLQPTDRSRLIEK-RESDPRKLGSGEASFGNKQDTGCSTYNVTTG 710 Query: 2316 GKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLL-LEHQRL-----QKSNNSPQNLVTGS 2158 G EQL+ G G+ SLPS LKDIAVNP ML++L+ +EHQRL QK N Q+ + S Sbjct: 711 GNEQLTASGIGSTVSLPSLLKDIAVNPTMLMHLIKMEHQRLAVEALQKCGNPAQSTMQSS 770 Query: 2157 SLHGFPGSVPLANIPSSKSLEIDQKHSVKPQVPGQVIST---GDSGKTRMKLRDPRLAAR 1987 S PG + NI S E ++K + Q+ Q S GD GK RMK RDPR Sbjct: 771 SSSVMPGKIASVNIASKTLSEPEKKSAGNSQISVQTASMIPHGDLGKIRMKPRDPRRILH 830 Query: 1986 MNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPSGA---PDISQQ 1816 NT QK++S GP E+ K G PS T R+NLIVR Q QAQTNS+ S + PDI+QQ Sbjct: 831 SNTFQKSDSSGP-ERFKANGTPSPNTPTCRDNLIVRQQGEQAQTNSLLSQSTAPPDIAQQ 889 Query: 1815 FTKELKSLADILSASQA---PSVVPLTVSSPIVPIKTDTTEMKTVVTEFKDQESGTVTAP 1645 FTK+LK++A+ILSASQA PSVVP T+SS VP K D +MK V T+ DQ S + P Sbjct: 890 FTKKLKNIANILSASQAINTPSVVPQTISSQPVPAKMDKVDMKVVATDSNDQRSWSALTP 949 Query: 1644 VERIVQPT-QNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARKXXXXXXXXXX 1468 ER P+ QN WGDVEHL EGYDDQ++AAI +ERARR+EEQN+MFAARK Sbjct: 950 EERAAGPSSQNAWGDVEHLFEGYDDQQKAAIQRERARRIEEQNQMFAARKLCLVLDLDHT 1009 Query: 1467 XLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVWNFLEKASKLY 1288 LNSAKFVEVDP+HEE+LRKKEEQDREKP RHLFRF HM MWTKLRPG+WNFLEKASKLY Sbjct: 1010 LLNSAKFVEVDPVHEEMLRKKEEQDREKPQRHLFRFTHMGMWTKLRPGIWNFLEKASKLY 1069 Query: 1287 ELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKIKDLEGVLGME 1108 ELHLYTMGNKLYATEMAKVLDP+G LF GRVIS+GD+GDP+DGDER K KDL+GVLGME Sbjct: 1070 ELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISRGDDGDPFDGDERQPKSKDLDGVLGME 1129 Query: 1107 SNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDERPDEGTLALS 928 S VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ GL GPSLLEIDHDERP++GTLA S Sbjct: 1130 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQLGLHGPSLLEIDHDERPEDGTLASS 1189 Query: 927 LAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEMNPQLHPLWQS 748 LAVIER+HQNFFSH++LND+DVR+ILAAEQ+KILAGCRIVFSR+FPVGE NP LHPLWQ+ Sbjct: 1190 LAVIERIHQNFFSHQNLNDVDVRNILAAEQQKILAGCRIVFSRVFPVGEANPHLHPLWQT 1249 Query: 747 AEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTLLYRRANEHEF 568 AEQFGAVC+ QIDE VTHVVA SLGTDKVNWAL+TGR+VVHPGWVEAS LLYRRANEH+F Sbjct: 1250 AEQFGAVCTNQIDEQVTHVVAISLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEHDF 1309 Query: 567 AVKI 556 A+K+ Sbjct: 1310 AIKL 1313 >XP_018833954.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2 [Juglans regia] Length = 1299 Score = 882 bits (2278), Expect = 0.0 Identities = 565/1311 (43%), Positives = 730/1311 (55%), Gaps = 53/1311 (4%) Frame = -1 Query: 4332 VWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRN-------VNDDNAAAATSVVIE 4174 VWT++DL++Y R +A+ +YN AW QAVQN+PL V+ D + +S + Sbjct: 75 VWTVQDLYQYQ--VSRGYASSLYNLAWAQAVQNKPLNEIFVMEAEVDPDEKSKQSSALPN 132 Query: 4173 ISDEGVVVNDVDSXXXXXXXXXXXXGDNDT-EMVEGTVVESNLNGMP---SSTTD----- 4021 + +G+ +D D + E+ EG E +L+ P + TD Sbjct: 133 SNSKGIDEMVIDDDNGDDVDVKVVDVDKEEGELEEG---EIDLDSEPVDKGAETDVVKDE 189 Query: 4020 ----DKIMN-ENEEIKSIRQVIQLV-----INAKNAGKPFGGACGELWTSLDKLQKFVLN 3871 ++I+N EN EI S ++V ++ + A K FG C + +L+ L+K Sbjct: 190 AVLCNEIVNVENSEIVSDKRVTSILEALESVTVIEAEKSFGEVCSRMHKTLESLKKVFSE 249 Query: 3870 NGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNASLFSSEQM 3691 N ++L+Q SF A+QAV SV+C+MN ++ Q KD RL+ +VK N LFSSEQM Sbjct: 250 NHVPLK-DALVQLSFTAIQAVNSVFCSMNNDQKEQNKDNLLRLISYVKNFNPPLFSSEQM 308 Query: 3690 KELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPLEEKNGASQNGHG 3511 KE+E + S++ +L + ++ N+ + + LE Sbjct: 309 KEIEVMKPSVDSVDPLLSSTDSVKHYEMTAIDEANNKDSDALAKSDALE----------- 357 Query: 3510 ENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGSGIFQSGPLSGLK 3331 L SS K + ++ N + S + + G +S K Sbjct: 358 ----------------------LTSSNKLSSDSVAAGSLVHSNPNILSEVLRPG-ISSFK 394 Query: 3330 SRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRFPSPTRETLRPVQ 3151 SRG P LPSPTR+AP F + K V GM PT + + Sbjct: 395 SRGALLPLLDLHKDHDADSLPSPTREAPSCFPVLK--VMTVGEGMANPLLPTAKVAHDTE 452 Query: 3150 ANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXXXXXXXSEEGNDD 2971 K + YETDA KA S YQQKFG SEE +D Sbjct: 453 EPK---------------LRIYETDALKAFSNYQQKFGRNSFFTSDRLPSPTPSEECDDG 497 Query: 2970 EDDSKGEXXXXXXXXXXXXXXXXPL----QAASVYAAFQNNGLCRQGTELEINPVVRAQK 2803 + D+ GE L ++ ++ Q + T + ++ Sbjct: 498 DGDTGGEVSSSSSSGNLRNVNPPILGQPVTPSTNSSSMQGLITTKNATTASSGSNIISKA 557 Query: 2802 QSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGTLEEIINMRKNKSVPQSVLDGHTL 2623 ++ RDPR + + + DLN R L HN P + I+ RK K+V + L+GH L Sbjct: 558 LAKSRDPRLRLANSDLSALDLNQRPLSLVHNTPKVEPV-GTISSRKQKTVEEPTLEGHAL 616 Query: 2622 KRQRNGLTRS-------TVSGTGGWGEDT-SVRPQHTLTNQVTESIGSRDPRNFGNGGWS 2467 KRQR GL S VSG+GGW +DT +V PQ NQ E DPR Sbjct: 617 KRQRIGLENSGVVKDVKNVSGSGGWLDDTGTVGPQLMNRNQFMEK-AEVDPR-------- 667 Query: 2466 EDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRNLTAGAGGKEQLSLIGN 2287 ++ E + S N E + R DN + + G Sbjct: 668 -------------KMAEVVSCSSSSCANNNETI--SRNDN--------------VLVTGT 698 Query: 2286 GNMGSLPSTLKDIAVNP-MLINLLLEHQRLQKSNNSPQNLVTGSSLHGFP-------GSV 2131 SLP+ LKDIAVNP ML+N+L + + + ++ QN + + P G+ Sbjct: 699 STTASLPALLKDIAVNPTMLLNILKMGGQQRLAVDALQNSADPAKITTLPACSTSILGAA 758 Query: 2130 PLANIPSSKSLEIDQKHSVKPQVPGQVISTGDSGKTRMKLRDPRLAARMNTCQKNESLGP 1951 PL N+ SK+ + QK + Q P V D+GK RMK RDPR N+ K+ S G Sbjct: 759 PLVNVAPSKASGLLQKPTGTLQNPSLVDPMEDTGKIRMKPRDPRRILHGNSLHKHPSSGH 818 Query: 1950 LEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPS---GAPDISQQFTKELKSLADIL 1780 E +K P+S TQ S++NL + Q +A SV S PDI++QFTK LK++ADI+ Sbjct: 819 -EHIKIIVPPTSSTQGSKDNLNAQKQEGEADAKSVHSQSVAPPDIARQFTKNLKNIADII 877 Query: 1779 SASQAPS--VVPLTVSSPIVPIKTDTTEMKTVVTEFKDQES--GTVTAPVERIVQPTQNM 1612 S SQA + ++ +SS V +K+D ++K V + +DQ S T I ++NM Sbjct: 878 SVSQASTTPIISQNMSSETVQVKSDKVDVKVVASNSEDQRSLISTALEVGVAIASRSENM 937 Query: 1611 WGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARKXXXXXXXXXXXLNSAKFVEVDP 1432 WGDVEHL EGYDDQ++AAI +ERARR+EEQ KMFAA K LNSAKF EVD Sbjct: 938 WGDVEHLFEGYDDQQKAAIQRERARRIEEQKKMFAAHKLCLVLDLDHTLLNSAKFGEVDH 997 Query: 1431 IHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVWNFLEKASKLYELHLYTMGNKLY 1252 +H+E+LRKKEEQDREKP RHLFRFPHM MWTKLRPG+W FLEKASKL+ELHLYTMGNKLY Sbjct: 998 VHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWTFLEKASKLFELHLYTMGNKLY 1057 Query: 1251 ATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKIKDLEGVLGMESNVVIIDDSVRV 1072 ATEMAKVLDP G LF GRVIS+GD+GD DGDER+ K KDLEGVLGMES VVIIDDSVRV Sbjct: 1058 ATEMAKVLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMESAVVIIDDSVRV 1117 Query: 1071 WPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDERPDEGTLALSLAVIERVHQNFF 892 WPHNKLNLIVVERYTYFPCSRRQFGL+GPSLLEIDHDERP+EGTLA SL VIER+HQNFF Sbjct: 1118 WPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEEGTLASSLGVIERIHQNFF 1177 Query: 891 SHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEMNPQLHPLWQSAEQFGAVCSTQI 712 SH SL+++DVR+ILAAEQRKIL+GCRIVFSR+FPVGE NP LHPLWQ+AEQFGAVC+ QI Sbjct: 1178 SHHSLDEVDVRNILAAEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQI 1237 Query: 711 DEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTLLYRRANEHEFAVK 559 DE VTHVVANSLGTDKVNWAL+TGR+VV+PGWVEAS LLYRRANE +FA+K Sbjct: 1238 DEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANERDFAIK 1288 >XP_018833953.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1 [Juglans regia] Length = 1302 Score = 877 bits (2265), Expect = 0.0 Identities = 565/1314 (42%), Positives = 730/1314 (55%), Gaps = 56/1314 (4%) Frame = -1 Query: 4332 VWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRN-------VNDDNAAAATSVVIE 4174 VWT++DL++Y R +A+ +YN AW QAVQN+PL V+ D + +S + Sbjct: 75 VWTVQDLYQYQ--VSRGYASSLYNLAWAQAVQNKPLNEIFVMEAEVDPDEKSKQSSALPN 132 Query: 4173 ISDEGVVVNDVDSXXXXXXXXXXXXGDNDT-EMVEGTVVESNLNGMP---SSTTD----- 4021 + +G+ +D D + E+ EG E +L+ P + TD Sbjct: 133 SNSKGIDEMVIDDDNGDDVDVKVVDVDKEEGELEEG---EIDLDSEPVDKGAETDVVKDE 189 Query: 4020 ----DKIMN-ENEEIKSIRQVIQLV-----INAKNAGKPFGGACGELWTSLDKLQKFVLN 3871 ++I+N EN EI S ++V ++ + A K FG C + +L+ L+K Sbjct: 190 AVLCNEIVNVENSEIVSDKRVTSILEALESVTVIEAEKSFGEVCSRMHKTLESLKKVFSE 249 Query: 3870 NGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNASLFSSEQM 3691 N ++L+Q SF A+QAV SV+C+MN ++ Q KD RL+ +VK N LFSSEQM Sbjct: 250 NHVPLK-DALVQLSFTAIQAVNSVFCSMNNDQKEQNKDNLLRLISYVKNFNPPLFSSEQM 308 Query: 3690 KELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPLEEKNGASQNGHG 3511 KE+E + S++ +L + ++ N+ + + LE Sbjct: 309 KEIEVMKPSVDSVDPLLSSTDSVKHYEMTAIDEANNKDSDALAKSDALE----------- 357 Query: 3510 ENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGSGIFQSGPLSGLK 3331 L SS K + ++ N + S + + G +S K Sbjct: 358 ----------------------LTSSNKLSSDSVAAGSLVHSNPNILSEVLRPG-ISSFK 394 Query: 3330 SRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRFPSPTRETLRPVQ 3151 SRG P LPSPTR+AP F + K V GM PT + + Sbjct: 395 SRGALLPLLDLHKDHDADSLPSPTREAPSCFPVLK--VMTVGEGMANPLLPTAKVAHDTE 452 Query: 3150 ANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXXXXXXXSEEGNDD 2971 K + YETDA KA S YQQKFG SEE +D Sbjct: 453 EPK---------------LRIYETDALKAFSNYQQKFGRNSFFTSDRLPSPTPSEECDDG 497 Query: 2970 EDDSKGEXXXXXXXXXXXXXXXXPL----QAASVYAAFQNNGLCRQGTELEINPVVRAQK 2803 + D+ GE L ++ ++ Q + T + ++ Sbjct: 498 DGDTGGEVSSSSSSGNLRNVNPPILGQPVTPSTNSSSMQGLITTKNATTASSGSNIISKA 557 Query: 2802 QSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGTLEEIINMRKNKSVPQSVLDGHTL 2623 ++ RDPR + + + DLN R L HN P + I+ RK K+V + L+GH L Sbjct: 558 LAKSRDPRLRLANSDLSALDLNQRPLSLVHNTPKVEPV-GTISSRKQKTVEEPTLEGHAL 616 Query: 2622 KRQRNGLTRS-------TVSGTGGWGEDT-SVRPQHTLTNQVTESIGSRDPRNFGNGGWS 2467 KRQR GL S VSG+GGW +DT +V PQ NQ E DPR Sbjct: 617 KRQRIGLENSGVVKDVKNVSGSGGWLDDTGTVGPQLMNRNQFMEK-AEVDPR-------- 667 Query: 2466 EDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRNLTAGAGGKEQLSLIGN 2287 ++ E + S N E + R DN + + G Sbjct: 668 -------------KMAEVVSCSSSSCANNNETI--SRNDN--------------VLVTGT 698 Query: 2286 GNMGSLPSTLKDIAVNP-MLINLLLEHQRLQKSNNSPQNLVTGSSLHGFP-------GSV 2131 SLP+ LKDIAVNP ML+N+L + + + ++ QN + + P G+ Sbjct: 699 STTASLPALLKDIAVNPTMLLNILKMGGQQRLAVDALQNSADPAKITTLPACSTSILGAA 758 Query: 2130 PLANIPSSKSLEIDQKHSVKPQVPGQV---ISTGDSGKTRMKLRDPRLAARMNTCQKNES 1960 PL N+ SK+ + QK + Q P V D+GK RMK RDPR N+ K+ S Sbjct: 759 PLVNVAPSKASGLLQKPTGTLQNPSLVDPMCLQEDTGKIRMKPRDPRRILHGNSLHKHPS 818 Query: 1959 LGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPS---GAPDISQQFTKELKSLA 1789 G E +K P+S TQ S++NL + Q +A SV S PDI++QFTK LK++A Sbjct: 819 SGH-EHIKIIVPPTSSTQGSKDNLNAQKQEGEADAKSVHSQSVAPPDIARQFTKNLKNIA 877 Query: 1788 DILSASQAPS--VVPLTVSSPIVPIKTDTTEMKTVVTEFKDQES--GTVTAPVERIVQPT 1621 DI+S SQA + ++ +SS V +K+D ++K V + +DQ S T I + Sbjct: 878 DIISVSQASTTPIISQNMSSETVQVKSDKVDVKVVASNSEDQRSLISTALEVGVAIASRS 937 Query: 1620 QNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARKXXXXXXXXXXXLNSAKFVE 1441 +NMWGDVEHL EGYDDQ++AAI +ERARR+EEQ KMFAA K LNSAKF E Sbjct: 938 ENMWGDVEHLFEGYDDQQKAAIQRERARRIEEQKKMFAAHKLCLVLDLDHTLLNSAKFGE 997 Query: 1440 VDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVWNFLEKASKLYELHLYTMGN 1261 VD +H+E+LRKKEEQDREKP RHLFRFPHM MWTKLRPG+W FLEKASKL+ELHLYTMGN Sbjct: 998 VDHVHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWTFLEKASKLFELHLYTMGN 1057 Query: 1260 KLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKIKDLEGVLGMESNVVIIDDS 1081 KLYATEMAKVLDP G LF GRVIS+GD+GD DGDER+ K KDLEGVLGMES VVIIDDS Sbjct: 1058 KLYATEMAKVLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMESAVVIIDDS 1117 Query: 1080 VRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDERPDEGTLALSLAVIERVHQ 901 VRVWPHNKLNLIVVERYTYFPCSRRQFGL+GPSLLEIDHDERP+EGTLA SL VIER+HQ Sbjct: 1118 VRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEEGTLASSLGVIERIHQ 1177 Query: 900 NFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEMNPQLHPLWQSAEQFGAVCS 721 NFFSH SL+++DVR+ILAAEQRKIL+GCRIVFSR+FPVGE NP LHPLWQ+AEQFGAVC+ Sbjct: 1178 NFFSHHSLDEVDVRNILAAEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 1237 Query: 720 TQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTLLYRRANEHEFAVK 559 QIDE VTHVVANSLGTDKVNWAL+TGR+VV+PGWVEAS LLYRRANE +FA+K Sbjct: 1238 NQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANERDFAIK 1291 >XP_010656786.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1 [Vitis vinifera] Length = 1276 Score = 874 bits (2258), Expect = 0.0 Identities = 571/1344 (42%), Positives = 732/1344 (54%), Gaps = 77/1344 (5%) Frame = -1 Query: 4359 VIREESKRM---VWTM---EDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV------ 4216 V+RE + VWTM +DL+KY+Q + +YN AW QAVQN+PL ++ Sbjct: 55 VLREAKPKADTRVWTMRDLQDLYKYHQACS-GYTPRLYNLAWAQAVQNKPLNDIFVMDDE 113 Query: 4215 ------------NDDNAAAATSVVIEISDEG----VVVNDVDSXXXXXXXXXXXXGDNDT 4084 DD+++A + I D G V ++DV D++ Sbjct: 114 ESKRSSSSSNTSRDDSSSAKEVAKVIIDDSGDEMDVKMDDVSEKEEGELEEGEIDLDSEP 173 Query: 4083 EMV-EGTVVESNLNGMPSSTTDDKIMNENEEIKSIRQVIQLVINAKNAGKPFGGACGELW 3907 ++ EG V++ N D K E +KSI++ ++ V A K F G C L Sbjct: 174 DVKDEGGVLDVN-----EPEIDLKERELVERVKSIQEDLESV-TVIEAEKSFSGVCSRLQ 227 Query: 3906 TSLDKLQKF----VLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLL 3739 +L LQK V+ + + ++L QQ A++A+ V+C+MN ++ KDVFSRLL Sbjct: 228 NTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIRALNHVFCSMNSNQKELNKDVFSRLL 287 Query: 3738 VHVKTQNASLFSSEQMKELEDIIQSLEKQKEILEKNGANQ-NDREENPSLGMNRIESGIV 3562 V+ ++ +FS + +KE+E ++ L+ +++ ND + + N ++S + Sbjct: 288 SCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSAEASDKVNDVQVTDGMNRNILDSSV- 346 Query: 3561 SKNPLEEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGN 3382 E+ S+KK L+ ISV+ +Q N Sbjct: 347 ---------------------------------ESSGRAFASAKKLSLDSISVESYNQNN 373 Query: 3381 GKVGSGIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDH 3202 + G LS + R FGP H+DH Sbjct: 374 PDA----LKPG-LSSSRGRFIFGPLLDL----------------------------HKDH 400 Query: 3201 GMDRFPSPTRETLRPVQANKPQAVESRPV-KSDGTEMHPYETDAHKAVSTYQQKFGXXXX 3025 D PSPT + + NK + V ++ ++ + MHPYETDA KAVSTYQQKFG Sbjct: 401 DEDSLPSPTGKAPQCFPVNKSELVTAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSF 460 Query: 3024 XXXXXXXXXXXSEEGNDDEDDSKGEXXXXXXXXXXXXXXXXPLQAASVYAAFQ------- 2866 SEE D D GE L V +A Q Sbjct: 461 LPIDKLPSPTPSEESGDTYGDISGEVSSSSTISAPITANAPALGHPIVSSAPQMDSSIVQ 520 Query: 2865 ------NNGLCRQGTELE-------------------INPVVRAQKQSRGRDPRRQNLGP 2761 N L G L+ N ++RA +SR DPR + Sbjct: 521 GPTVGRNTSLVSSGPHLDSSVVQGLVVPRNTGAVNSRFNSILRASAKSR--DPRLRLASS 578 Query: 2760 EAGSGDLNLRSAYLEHNPPTSGTLEEIINMRKNKSVPQSVLDGHTLKRQRNGLTRSTVSG 2581 +AGS DLN R N P L EI++ RK KS + +LDG KRQRNGLT Sbjct: 579 DAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLT------ 632 Query: 2580 TGGWGEDTSVRPQHTLTNQVTESIGSRDPRNFGNGGWSEDSVTRLQPTLT-NQVGESIRS 2404 +VR T+ +GGW EDS T + + NQ+ E+ Sbjct: 633 -----SPATVRDAQTVV---------------ASGGWLEDSNTVIPQMMNRNQLIENT-G 671 Query: 2403 SDPRKFGNGEVVFSQRQDNGGRNLTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNPMLIN 2224 +DP+K + V D G E L ++ SL S LKDIAVNP + Sbjct: 672 TDPKKLESKVTVTGIGCDKP----YVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWM 727 Query: 2223 LLLEHQRLQKSNNSPQNLVTGSSLHGFPGSVPLANIPSSKSLEIDQKHSVKPQVP--GQV 2050 + QKS + +N V + + G VP A++ K + QK + QVP G + Sbjct: 728 NIFNKVEQQKSGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPM 787 Query: 2049 ISTGDSGKTRMKLRDPRLAARMNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQS 1870 +SGK RMK RDPR N+ Q++ S G EQ KT + Q Sbjct: 788 NPQDESGKVRMKPRDPRRILHANSFQRSGSSGS-EQFKTNA---------------QKQE 831 Query: 1869 VQAQTNSVPSGA---PDISQQFTKELKSLADILSASQAPSVVPL---TVSSPIVPIKTDT 1708 Q +T SVPS + PDISQQFTK LK++AD++SASQA S+ P +SS V + TD Sbjct: 832 DQTETKSVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDR 891 Query: 1707 TEMKTVVTEFKDQESGTVTAPVERIVQP-TQNMWGDVEHLLEGYDDQERAAIHKERARRM 1531 ++K V++ DQ + + P P ++N WGDVEHL +GYDDQ++AAI +ERARR+ Sbjct: 892 MDVKATVSDSGDQLTANGSKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRI 951 Query: 1530 EEQNKMFAARKXXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHM 1351 EEQ KMF+ARK LNSAKFVEVDP+H+E+LRKKEEQDREK RHLFRFPHM Sbjct: 952 EEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHM 1011 Query: 1350 RMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGD 1171 MWTKLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVISKGD+GD Sbjct: 1012 GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGD 1071 Query: 1170 PYDGDERLQKIKDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLM 991 DGDER+ K KDLEGVLGMES VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL Sbjct: 1072 VLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLP 1131 Query: 990 GPSLLEIDHDERPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRI 811 GPSLLEIDHDERP++GTLA SLAVIER+HQ+FFS+++L+++DVR+ILA+EQRKILAGCRI Sbjct: 1132 GPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRI 1191 Query: 810 VFSRIFPVGEMNPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYV 631 VFSR+FPVGE NP LHPLWQ+AE FGAVC+ QIDE VTHVVANSLGTDKVNWAL+TGR+V Sbjct: 1192 VFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFV 1251 Query: 630 VHPGWVEASTLLYRRANEHEFAVK 559 VHPGWVEAS LLYRRANE +FA+K Sbjct: 1252 VHPGWVEASALLYRRANEQDFAIK 1275 >XP_010656789.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2 [Vitis vinifera] Length = 1273 Score = 874 bits (2257), Expect = 0.0 Identities = 571/1342 (42%), Positives = 731/1342 (54%), Gaps = 75/1342 (5%) Frame = -1 Query: 4359 VIREESKRM---VWTM---EDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV------ 4216 V+RE + VWTM +DL+KY+Q + +YN AW QAVQN+PL ++ Sbjct: 55 VLREAKPKADTRVWTMRDLQDLYKYHQACS-GYTPRLYNLAWAQAVQNKPLNDIFVMDDE 113 Query: 4215 ------------NDDNAAAATSVVIEISDEG----VVVNDVDSXXXXXXXXXXXXGDNDT 4084 DD+++A + I D G V ++DV D++ Sbjct: 114 ESKRSSSSSNTSRDDSSSAKEVAKVIIDDSGDEMDVKMDDVSEKEEGELEEGEIDLDSEP 173 Query: 4083 EMV-EGTVVESNLNGMPSSTTDDKIMNENEEIKSIRQVIQLVINAKNAGKPFGGACGELW 3907 ++ EG V++ N D K E +KSI++ ++ V A K F G C L Sbjct: 174 DVKDEGGVLDVN-----EPEIDLKERELVERVKSIQEDLESV-TVIEAEKSFSGVCSRLQ 227 Query: 3906 TSLDKLQKF----VLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLL 3739 +L LQK V+ + + ++L QQ A++A+ V+C+MN ++ KDVFSRLL Sbjct: 228 NTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIRALNHVFCSMNSNQKELNKDVFSRLL 287 Query: 3738 VHVKTQNASLFSSEQMKELEDIIQSLEKQKEILEKNGANQ-NDREENPSLGMNRIESGIV 3562 V+ ++ +FS + +KE+E ++ L+ +++ ND + + N ++S + Sbjct: 288 SCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSAEASDKVNDVQVTDGMNRNILDSSV- 346 Query: 3561 SKNPLEEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGN 3382 E+ S+KK L+ ISV+ +Q N Sbjct: 347 ---------------------------------ESSGRAFASAKKLSLDSISVESYNQNN 373 Query: 3381 GKVGSGIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDH 3202 + G LS + R FGP H+DH Sbjct: 374 PDA----LKPG-LSSSRGRFIFGPLLDL----------------------------HKDH 400 Query: 3201 GMDRFPSPTRETLRPVQANKPQAVESRPV-KSDGTEMHPYETDAHKAVSTYQQKFGXXXX 3025 D PSPT + + NK + V ++ ++ + MHPYETDA KAVSTYQQKFG Sbjct: 401 DEDSLPSPTGKAPQCFPVNKSELVTAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSF 460 Query: 3024 XXXXXXXXXXXSEEGNDDEDDSKGEXXXXXXXXXXXXXXXXPLQAASVYAAFQ------- 2866 SEE D D GE L V +A Q Sbjct: 461 LPIDKLPSPTPSEESGDTYGDISGEVSSSSTISAPITANAPALGHPIVSSAPQMDSSIVQ 520 Query: 2865 ------NNGLCRQGTELE-------------------INPVVRAQKQSRGRDPRRQNLGP 2761 N L G L+ N ++RA +SR DPR + Sbjct: 521 GPTVGRNTSLVSSGPHLDSSVVQGLVVPRNTGAVNSRFNSILRASAKSR--DPRLRLASS 578 Query: 2760 EAGSGDLNLRSAYLEHNPPTSGTLEEIINMRKNKSVPQSVLDGHTLKRQRNGLTRSTVSG 2581 +AGS DLN R N P L EI++ RK KS + +LDG KRQRNGLT Sbjct: 579 DAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLT------ 632 Query: 2580 TGGWGEDTSVRPQHTLTNQVTESIGSRDPRNFGNGGWSEDSVTRLQPTLT-NQVGESIRS 2404 +VR T+ +GGW EDS T + + NQ+ E+ Sbjct: 633 -----SPATVRDAQTVV---------------ASGGWLEDSNTVIPQMMNRNQLIENT-G 671 Query: 2403 SDPRKFGNGEVVFSQRQDNGGRNLTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNPMLIN 2224 +DP+K + V D G E L ++ SL S LKDIAVNP + Sbjct: 672 TDPKKLESKVTVTGIGCDKP----YVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWM 727 Query: 2223 LLLEHQRLQKSNNSPQNLVTGSSLHGFPGSVPLANIPSSKSLEIDQKHSVKPQVPGQVIS 2044 + QKS + +N V + + G VP A++ K + QK + QVP Q Sbjct: 728 NIFNKVEQQKSGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVP-QTGP 786 Query: 2043 TGDSGKTRMKLRDPRLAARMNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQ 1864 +SGK RMK RDPR N+ Q++ S G EQ KT + Q Q Sbjct: 787 MDESGKVRMKPRDPRRILHANSFQRSGSSGS-EQFKTNA---------------QKQEDQ 830 Query: 1863 AQTNSVPSGA---PDISQQFTKELKSLADILSASQAPSVVPL---TVSSPIVPIKTDTTE 1702 +T SVPS + PDISQQFTK LK++AD++SASQA S+ P +SS V + TD + Sbjct: 831 TETKSVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMD 890 Query: 1701 MKTVVTEFKDQESGTVTAPVERIVQP-TQNMWGDVEHLLEGYDDQERAAIHKERARRMEE 1525 +K V++ DQ + + P P ++N WGDVEHL +GYDDQ++AAI +ERARR+EE Sbjct: 891 VKATVSDSGDQLTANGSKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEE 950 Query: 1524 QNKMFAARKXXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRM 1345 Q KMF+ARK LNSAKFVEVDP+H+E+LRKKEEQDREK RHLFRFPHM M Sbjct: 951 QKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGM 1010 Query: 1344 WTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPY 1165 WTKLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVISKGD+GD Sbjct: 1011 WTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVL 1070 Query: 1164 DGDERLQKIKDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGP 985 DGDER+ K KDLEGVLGMES VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GP Sbjct: 1071 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGP 1130 Query: 984 SLLEIDHDERPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVF 805 SLLEIDHDERP++GTLA SLAVIER+HQ+FFS+++L+++DVR+ILA+EQRKILAGCRIVF Sbjct: 1131 SLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVF 1190 Query: 804 SRIFPVGEMNPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVH 625 SR+FPVGE NP LHPLWQ+AE FGAVC+ QIDE VTHVVANSLGTDKVNWAL+TGR+VVH Sbjct: 1191 SRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVH 1250 Query: 624 PGWVEASTLLYRRANEHEFAVK 559 PGWVEAS LLYRRANE +FA+K Sbjct: 1251 PGWVEASALLYRRANEQDFAIK 1272 >EOX99661.1 RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] Length = 1290 Score = 873 bits (2256), Expect = 0.0 Identities = 582/1333 (43%), Positives = 742/1333 (55%), Gaps = 71/1333 (5%) Frame = -1 Query: 4344 SKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPL--------------RNVNDD 4207 S VWTM+DL KY V R +A+ +YN+AW QAVQN+PL N N Sbjct: 74 SNSRVWTMQDLCKYPSVI-RGYASGLYNFAWAQAVQNKPLNEIFVKDFEQPQQDENKNSK 132 Query: 4206 NAAAATSVVIEISDEG---------VVVNDVDSXXXXXXXXXXXXGDNDTEMVEGTV-VE 4057 ++ ++SV S E VV D DS + E+ EG + ++ Sbjct: 133 RSSPSSSVASVNSKEEKGSSGNLAVKVVIDDDSEDEMEEDKVVNLDKEEGELEEGEIDLD 192 Query: 4056 SNLNGMPSSTTDDKIMNENEEIKS---IRQVIQLVINAKNAGKPFGGACGELWTSLDKLQ 3886 S S+ D + N +E K IR V++ + A K F G C L +L+ L+ Sbjct: 193 SEPKEKVLSSEDGNVGNSDELEKRANLIRGVLE-GVTVIEAEKSFEGVCSRLHNALESLR 251 Query: 3885 KFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNASLF 3706 +L + ++LIQ +F A+ S + +N + Q + SRLL VK + SLF Sbjct: 252 ALILECSVPAK-DALIQLAFG---AINSAFVALNCNSKEQNVAILSRLLSIVKGHDPSLF 307 Query: 3705 SSEQMKELEDIIQSL-------EKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPL 3547 ++MKE++ ++ SL + +K++ +G N+ D + P I + N L Sbjct: 308 PPDKMKEIDVMLISLNSPARAIDTEKDMKVVDGVNKKDPDALP----ENICHDLTVTNKL 363 Query: 3546 EEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGS 3367 P V N P+ L + K Sbjct: 364 --------------------PSSAKFVINNKPNALTETLKP------------------- 384 Query: 3366 GIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRF 3187 + ++RG P LPSPTR+ ++KP Sbjct: 385 ------GVPNFRNRGISLPLLDLHKDHDADSLPSPTRETTPCLPVNKPL----------- 427 Query: 3186 PSPTRETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXX 3007 T V ++G ++HPYETDA KA STYQQKFG Sbjct: 428 ------TSGDVMVKSGFMTGKGSHDAEGDKLHPYETDALKAFSTYQQKFGQGSFFSSDRL 481 Query: 3006 XXXXXSEEGNDDEDDSKGE----XXXXXXXXXXXXXXXXPLQAA----SVYAAFQNNGLC 2851 SEE D+ D+ GE + +A S ++ Q Sbjct: 482 PSPTPSEESGDEGGDNGGEVSSSSSIGNFKPNLPILGHPIVSSAPLVDSASSSLQGQITT 541 Query: 2850 RQGTELEINPVVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGTLEEIINM 2671 R T + + ++ ++ RDPR A + DLN R L HN + I++ Sbjct: 542 RNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNER---LLHNASKVAPVGGIMDS 598 Query: 2670 RKNKSVPQSVLDGHTLKRQRNGLTR-------STVSGTGGWGEDTSVRPQHTLTNQVTES 2512 RK KSV + +LD LKRQRN L TVSG GGW ED T++ Sbjct: 599 RKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLED-------------TDA 645 Query: 2511 IGSRDPRNFGNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRNL 2332 IGS Q T NQ E++ S+ RK NG V S +G N+ Sbjct: 646 IGS-------------------QITNRNQTAENLESNS-RKMDNG--VTSSSTLSGKTNI 683 Query: 2331 TAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLL--EHQRL-----QKSNNSPQ 2176 T G EQ+ + + + SLP+ LKDIAVNP MLIN+L + QRL QKS + + Sbjct: 684 TVGT--NEQVP-VTSTSTPSLPALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVK 740 Query: 2175 NLVTGSSLHGFPGSVPLANIPSSKSL----EIDQKHSVKPQVPGQVISTGDSGKTRMKLR 2008 + S + G V N+ S S+ I S KP QV S +SGK RMK R Sbjct: 741 STFHQPSSNSLLGVVSSTNVIPSPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPR 800 Query: 2007 DPRLAARMNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPSGA-- 1834 DPR N+ Q++ S+G L+QLKT GA +S TQ S++NL Q + +QT S P + Sbjct: 801 DPRRVLHGNSLQRSGSMG-LDQLKTNGALTSSTQGSKDNL--NAQKLDSQTESKPMQSQL 857 Query: 1833 ---PDISQQFTKELKSLADILSASQAPSVVPLTVSSPIVP----IKTDTTEMKTVVTEFK 1675 PDI+QQFT LK++ADI+S SQA + +P VS +VP IK+D+ +MK +V+ + Sbjct: 858 VPPPDITQQFTNNLKNIADIMSVSQALTSLP-PVSHNLVPQPVLIKSDSMDMKALVSNSE 916 Query: 1674 DQESGTVTAPVERIVQP-TQNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARK 1498 DQ++G AP P +QN WGDVEHL E YDDQ++AAI +ERARR+EEQ KMF+ARK Sbjct: 917 DQQTGAGLAPEAGATGPRSQNAWGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARK 976 Query: 1497 XXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVW 1318 LNSAKF+EVDP+HEE+LRKKEEQDREKP RHLFRF HM MWTKLRPG+W Sbjct: 977 LCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIW 1036 Query: 1317 NFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKI 1138 NFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP+DGDER+ + Sbjct: 1037 NFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRS 1096 Query: 1137 KDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDE 958 KDLEGVLGMES VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL+GPSLLEIDHDE Sbjct: 1097 KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDE 1156 Query: 957 RPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEM 778 RP++GTLA SLAVIER+HQ+FFSH++L+D+DVR+ILA+EQRKILAGCRIVFSR+FPVGE Sbjct: 1157 RPEDGTLASSLAVIERIHQDFFSHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEA 1216 Query: 777 NPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTL 598 NP LHPLWQ+AEQFGAVC+ QIDEHVTHVVANSLGTDKVNWAL+TG++VVHPGWVEAS L Sbjct: 1217 NPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASAL 1276 Query: 597 LYRRANEHEFAVK 559 LYRRANE +FA+K Sbjct: 1277 LYRRANEVDFAIK 1289 >XP_007043830.2 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Theobroma cacao] Length = 1290 Score = 868 bits (2244), Expect = 0.0 Identities = 580/1333 (43%), Positives = 742/1333 (55%), Gaps = 71/1333 (5%) Frame = -1 Query: 4344 SKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPL--------------RNVNDD 4207 S VWTM+DL KY V R +A+ +YN+AW QAVQN+PL N N Sbjct: 74 SNSRVWTMQDLCKYPSVI-RGYASGLYNFAWAQAVQNKPLNEIFVKDFEQPQQDENKNSK 132 Query: 4206 NAAAATSVVIEISDEG---------VVVNDVDSXXXXXXXXXXXXGDNDTEMVEGTV-VE 4057 ++ ++SV S E VV D DS + E+ EG + ++ Sbjct: 133 RSSPSSSVASVNSKEEKGSSGNLAVKVVIDDDSEDEMEEDKVVNLDKEEGELEEGEIDLD 192 Query: 4056 SNLNGMPSSTTDDKIMNENEEIKS---IRQVIQLVINAKNAGKPFGGACGELWTSLDKLQ 3886 S S+ D + N +E K IR V++ + A K F G C L +L+ L+ Sbjct: 193 SEPKEKVLSSEDGNVGNSDELEKRANLIRGVLE-GVTVIEAEKSFEGVCSRLQNALESLR 251 Query: 3885 KFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNASLF 3706 +L + ++LIQ +F A+ S + +N + Q + SRLL VK + SLF Sbjct: 252 ALILECSVPAK-DALIQLAFG---AINSAFVALNCNSKEQNVAILSRLLSIVKGHDPSLF 307 Query: 3705 SSEQMKELEDIIQSL-------EKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPL 3547 ++MKE++ ++ SL + +K++ +G N+ D + P I + N L Sbjct: 308 PPDKMKEIDVMLISLNSPARAIDTEKDMKVVDGVNKKDPDALP----ENICHDLTVTNKL 363 Query: 3546 EEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGS 3367 P V N P+ L + K Sbjct: 364 --------------------PSSAKFVINNKPNALTETLKP------------------- 384 Query: 3366 GIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRF 3187 + ++RG P LPSPTR+ ++KP Sbjct: 385 ------GVPNFRNRGISLPLLDLHKDHDADSLPSPTRETTPCLPVNKPL----------- 427 Query: 3186 PSPTRETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXX 3007 T V ++G ++HPYETDA KA STYQQKFG Sbjct: 428 ------TSGDVMVKSGFMTGKGSHDAEGDKLHPYETDALKAFSTYQQKFGQGSFFSSDRL 481 Query: 3006 XXXXXSEEGNDDEDDSKGE----XXXXXXXXXXXXXXXXPLQAA----SVYAAFQNNGLC 2851 SEE D+ D+ GE + +A S ++ Q Sbjct: 482 PSPTPSEESGDEGGDNGGEVSSSSSIGNFKPNLPILGHPIVSSAPLVDSASSSLQGQITT 541 Query: 2850 RQGTELEINPVVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGTLEEIINM 2671 R T + + ++ ++ RDPR A + DLN R L HN + I++ Sbjct: 542 RNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNER---LLHNASKVAPVGGIMDS 598 Query: 2670 RKNKSVPQSVLDGHTLKRQRNGLTR-------STVSGTGGWGEDTSVRPQHTLTNQVTES 2512 RK KSV + +LD LKRQRN L TVSG GGW ED T++ Sbjct: 599 RKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLED-------------TDA 645 Query: 2511 IGSRDPRNFGNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRNL 2332 IGS Q T NQ E++ S+ RK NG V S +G N+ Sbjct: 646 IGS-------------------QITNRNQTAENLESNS-RKMDNG--VTSSSTLSGKTNI 683 Query: 2331 TAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLL--EHQRL-----QKSNNSPQ 2176 T G EQ+ + + + SLP+ LKDIAVNP MLIN+L + QRL QKS + + Sbjct: 684 TVGT--NEQVP-VTSTSTPSLPALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVK 740 Query: 2175 NLVTGSSLHGFPGSVPLANIPSSKSL----EIDQKHSVKPQVPGQVISTGDSGKTRMKLR 2008 + S + G V N+ S S+ I S KP QV S +SGK RMK R Sbjct: 741 STFHQPSSNSLLGVVSSTNVIPSPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPR 800 Query: 2007 DPRLAARMNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPSGA-- 1834 DPR N+ Q++ S+GP +QLKT GA +S TQ S++NL Q + +QT S P + Sbjct: 801 DPRRVLHGNSLQRSGSMGP-DQLKTNGALTSSTQGSKDNL--NAQKLDSQTESKPMQSQL 857 Query: 1833 ---PDISQQFTKELKSLADILSASQA-PSVVPLT---VSSPIVPIKTDTTEMKTVVTEFK 1675 PDI+QQFT LK++A I+S SQA S+ P++ V P++ IK+D+ +MK +V+ + Sbjct: 858 VPPPDITQQFTNNLKNIAGIVSVSQALTSLSPVSHNLVPQPVL-IKSDSMDMKALVSNSE 916 Query: 1674 DQESGTVTAPVERIVQP-TQNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARK 1498 DQ++G AP P +QN WGDVEHL E YDDQ++AAI +ERARR+EEQ KMF+ARK Sbjct: 917 DQQTGAGLAPEAGATGPHSQNAWGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARK 976 Query: 1497 XXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVW 1318 LNSAKF+EVDP+HEE+LRKKEEQDREKP RHLFRF HM MWTKLRPG+W Sbjct: 977 LCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIW 1036 Query: 1317 NFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKI 1138 NFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP+DGDER+ + Sbjct: 1037 NFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRS 1096 Query: 1137 KDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDE 958 KDLEGVLGMES VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL+GPSLLEIDHDE Sbjct: 1097 KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDE 1156 Query: 957 RPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEM 778 RP++GTLA SLAVIER+HQ+FFSH++L+D+DVR+ILA+EQRKILAGCRIVFSR+FPVGE Sbjct: 1157 RPEDGTLASSLAVIERIHQDFFSHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEA 1216 Query: 777 NPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTL 598 NP LHPLWQ+AEQFGAVC+ QIDEHVTHVVANSLGTDKVNWAL+TG++VVHPGWVEAS L Sbjct: 1217 NPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASAL 1276 Query: 597 LYRRANEHEFAVK 559 LYRRANE +FA+K Sbjct: 1277 LYRRANEVDFAIK 1289 >XP_018840025.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2 [Juglans regia] Length = 1280 Score = 858 bits (2218), Expect = 0.0 Identities = 551/1317 (41%), Positives = 726/1317 (55%), Gaps = 47/1317 (3%) Frame = -1 Query: 4368 TKPVIREESKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV--------- 4216 TK + ++ VWTM+DL+KY R + + +YN AW QAVQN+PL + Sbjct: 63 TKVSSKPKAGARVWTMQDLYKYQ--VSRGYGSSLYNLAWAQAVQNKPLNEIFVMGAEVDL 120 Query: 4215 ----------NDDNAAAATSVVIEISDEGVVVNDVDSXXXXXXXXXXXXGDNDTEMVEGT 4066 + NA V+++ + + V D D+E +E Sbjct: 121 DEKSKRSSAPPNSNAKEVDEVMVDNDSKDEMDAKVVDVGKEEGELEEGEIDLDSEPIEKE 180 Query: 4065 VVESNLN-----GMPSSTTDDKIMNENEEIKSIRQVIQ--LVINAKNAGKPFGGACGELW 3907 V + G ++ + + + IR+ ++ VI A+ + FG C + Sbjct: 181 VESEEIKEEAVLGREGVNVENSEIVLEKRVTWIRETLESATVIEAETS---FGEVCSRVH 237 Query: 3906 TSLDKLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVK 3727 ++++ L++ VL+ + + ++L+Q F A++AV SV+ +MN + Q K+ R++ VK Sbjct: 238 STMESLRE-VLSESSVPTKDALVQLLFTAIKAVNSVFSSMNRNRKEQNKENVLRVISDVK 296 Query: 3726 TQNASLFSSEQMKELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPL 3547 N LFSSEQMKE+E + S++ +L G+ R E + Sbjct: 297 FGNPPLFSSEQMKEIEVMRSSVDSVDALLSTID------------GVKRKEMAAIDAANN 344 Query: 3546 EEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGS 3367 ++ + ++ + E L S+K S + I+V N + Sbjct: 345 KDFDASTTSDGRE---------------------LTSNKLSS-DSIAVGSLVLSNANILP 382 Query: 3366 GIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRF 3187 + + G +S KSR P LPSPTR+AP F +H + GM R Sbjct: 383 EVLKPG-VSSFKSRAILLPLLDLHKDHDIDSLPSPTREAPSSFPVHN--IMDIGDGMARP 439 Query: 3186 PSPTRETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXX 3007 PT + + +K +H YETDA KA STYQQKFG Sbjct: 440 VLPTAKVAHDTENSK---------------LHIYETDALKAFSTYQQKFGQNSLFTSDLP 484 Query: 3006 XXXXXSEEGNDDEDDSKGEXXXXXXXXXXXXXXXXPLQAASVYAAFQNNGL-----CRQG 2842 EE +D + D+ GE L + ++ + + Sbjct: 485 SPTPS-EEFDDGDGDTSGEVSSSSTIGNIRNVNPPFLWGPPGTPSMDSSSMDGPITTKNS 543 Query: 2841 TELEI--NPVVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGTLEEIINMR 2668 T + N +V+A +SR DPR + ++ + N H+ P + I + + Sbjct: 544 TPITFGSNSIVKASAKSR--DPRLRLANYDSNALYFNQHPLSSVHDTPKVEPVGTI-SSK 600 Query: 2667 KNKSVPQSVLDGHTLKRQRNGLTRSTVSGTGGWGEDTSVRPQHTLTNQVTESIGSRDPRN 2488 K K++ + L+GH LKRQRNGL S V RD +N Sbjct: 601 KQKALEEPTLEGHALKRQRNGLENSGVV---------------------------RDMKN 633 Query: 2487 F-GNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRNLTAGAGGK 2311 G+GGW +D+ T + +DPRK E+V N T G Sbjct: 634 VSGSGGWLDDTKTVGSQLMNRNQLMETAETDPRKMA--EIVSCSGISCANANATIS--GN 689 Query: 2310 EQLSLIGNGNMGSLPSTLKDIAVNP-MLINLL-------LEHQRLQKSNNSPQNLVTGSS 2155 EQ+S+ G SLP+ LKDIAVNP +L+N+L LE QKS + ++ S Sbjct: 690 EQVSVTGTSAAASLPALLKDIAVNPTVLLNILKMGQQQSLEADVQQKSADPAKSTTQPPS 749 Query: 2154 LHGFPGSVPLANIPSSKSLEIDQKHSVKPQVPGQVISTGDSGKTRMKLRDPRLAARMNTC 1975 + G+ P+ N+ SK L + QK + +VP Q++ D GK RMK RDPR NT Sbjct: 750 SNSILGTAPMVNVAPSKVLGLLQKQAATLKVPSQIVPMEDLGKIRMKPRDPRRILHDNTL 809 Query: 1974 QKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPSGAPDISQQFTKELKS 1795 QKN SLG EQ K +S TQ + + Q+ T PDI++QFTK LK+ Sbjct: 810 QKNPSLG-YEQPKITVPLASSTQKQEGQVDTKSTPFQSVTQ------PDIARQFTKNLKN 862 Query: 1794 LADILSASQAPSVVPL---TVSSPIVPIKTDTTEMKTVVTEFKDQESGTVTAPVERIVQP 1624 +AD +S S A + +P+ ++S V K + +MKTV + +DQ SGT AP + Sbjct: 863 IADFISVSLASTTLPIISHSISCGAVQGKPEKVDMKTVASNSEDQRSGTSPAPEIGVAMA 922 Query: 1623 T--QNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARKXXXXXXXXXXXLNSAK 1450 + +NMWGDVEHL EGYDDQ++AAI +ERARR+EEQ KMF+A K LNSAK Sbjct: 923 SRPENMWGDVEHLFEGYDDQQKAAIQRERARRIEEQKKMFSAHKLCLVLDLDHTLLNSAK 982 Query: 1449 FVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVWNFLEKASKLYELHLYT 1270 F EVDPIH+E+LRKKEEQDREK RHLFRFPHM MWTKLRPG+WNFLEKASKLYELHLYT Sbjct: 983 FGEVDPIHDEILRKKEEQDREKQQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYT 1042 Query: 1269 MGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKIKDLEGVLGMESNVVII 1090 MGNKLYATEMAKVLDP G LF GRVIS+GD+GD +DGDER+ K KDLEGVLGMES VVII Sbjct: 1043 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDLFDGDERVPKSKDLEGVLGMESAVVII 1102 Query: 1089 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDERPDEGTLALSLAVIER 910 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEIDHDERP++GTLA S AVIER Sbjct: 1103 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSSAVIER 1162 Query: 909 VHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEMNPQLHPLWQSAEQFGA 730 +HQNFFSH+SL+++DVR+ILAAEQRKIL GC IVFSR+FPVGE NP LHPLWQ+AEQFGA Sbjct: 1163 LHQNFFSHQSLDEVDVRNILAAEQRKILGGCSIVFSRVFPVGEANPHLHPLWQTAEQFGA 1222 Query: 729 VCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTLLYRRANEHEFAVK 559 VC+ QIDE VTHVVANSLGTDKVNWAL+TGR+VV+PGWVEAS LLYRRANE +FA+K Sbjct: 1223 VCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANERDFAIK 1279 >XP_018840024.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1 [Juglans regia] Length = 1283 Score = 853 bits (2204), Expect = 0.0 Identities = 551/1320 (41%), Positives = 726/1320 (55%), Gaps = 50/1320 (3%) Frame = -1 Query: 4368 TKPVIREESKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV--------- 4216 TK + ++ VWTM+DL+KY R + + +YN AW QAVQN+PL + Sbjct: 63 TKVSSKPKAGARVWTMQDLYKYQ--VSRGYGSSLYNLAWAQAVQNKPLNEIFVMGAEVDL 120 Query: 4215 ----------NDDNAAAATSVVIEISDEGVVVNDVDSXXXXXXXXXXXXGDNDTEMVEGT 4066 + NA V+++ + + V D D+E +E Sbjct: 121 DEKSKRSSAPPNSNAKEVDEVMVDNDSKDEMDAKVVDVGKEEGELEEGEIDLDSEPIEKE 180 Query: 4065 VVESNLN-----GMPSSTTDDKIMNENEEIKSIRQVIQ--LVINAKNAGKPFGGACGELW 3907 V + G ++ + + + IR+ ++ VI A+ + FG C + Sbjct: 181 VESEEIKEEAVLGREGVNVENSEIVLEKRVTWIRETLESATVIEAETS---FGEVCSRVH 237 Query: 3906 TSLDKLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVK 3727 ++++ L++ VL+ + + ++L+Q F A++AV SV+ +MN + Q K+ R++ VK Sbjct: 238 STMESLRE-VLSESSVPTKDALVQLLFTAIKAVNSVFSSMNRNRKEQNKENVLRVISDVK 296 Query: 3726 TQNASLFSSEQMKELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPL 3547 N LFSSEQMKE+E + S++ +L G+ R E + Sbjct: 297 FGNPPLFSSEQMKEIEVMRSSVDSVDALLSTID------------GVKRKEMAAIDAANN 344 Query: 3546 EEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGS 3367 ++ + ++ + E L S+K S + I+V N + Sbjct: 345 KDFDASTTSDGRE---------------------LTSNKLSS-DSIAVGSLVLSNANILP 382 Query: 3366 GIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRF 3187 + + G +S KSR P LPSPTR+AP F +H + GM R Sbjct: 383 EVLKPG-VSSFKSRAILLPLLDLHKDHDIDSLPSPTREAPSSFPVHN--IMDIGDGMARP 439 Query: 3186 PSPTRETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXX 3007 PT + + +K +H YETDA KA STYQQKFG Sbjct: 440 VLPTAKVAHDTENSK---------------LHIYETDALKAFSTYQQKFGQNSLFTSDLP 484 Query: 3006 XXXXXSEEGNDDEDDSKGEXXXXXXXXXXXXXXXXPLQAASVYAAFQNNGL-----CRQG 2842 EE +D + D+ GE L + ++ + + Sbjct: 485 SPTPS-EEFDDGDGDTSGEVSSSSTIGNIRNVNPPFLWGPPGTPSMDSSSMDGPITTKNS 543 Query: 2841 TELEI--NPVVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGTLEEIINMR 2668 T + N +V+A +SR DPR + ++ + N H+ P + I + + Sbjct: 544 TPITFGSNSIVKASAKSR--DPRLRLANYDSNALYFNQHPLSSVHDTPKVEPVGTI-SSK 600 Query: 2667 KNKSVPQSVLDGHTLKRQRNGLTRSTVSGTGGWGEDTSVRPQHTLTNQVTESIGSRDPRN 2488 K K++ + L+GH LKRQRNGL S V RD +N Sbjct: 601 KQKALEEPTLEGHALKRQRNGLENSGVV---------------------------RDMKN 633 Query: 2487 F-GNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRNLTAGAGGK 2311 G+GGW +D+ T + +DPRK E+V N T G Sbjct: 634 VSGSGGWLDDTKTVGSQLMNRNQLMETAETDPRKMA--EIVSCSGISCANANATIS--GN 689 Query: 2310 EQLSLIGNGNMGSLPSTLKDIAVNP-MLINLL-------LEHQRLQKSNNSPQNLVTGSS 2155 EQ+S+ G SLP+ LKDIAVNP +L+N+L LE QKS + ++ S Sbjct: 690 EQVSVTGTSAAASLPALLKDIAVNPTVLLNILKMGQQQSLEADVQQKSADPAKSTTQPPS 749 Query: 2154 LHGFPGSVPLANIPSSKSLEIDQKHSVKPQVPGQVISTG---DSGKTRMKLRDPRLAARM 1984 + G+ P+ N+ SK L + QK + +VP Q++ D GK RMK RDPR Sbjct: 750 SNSILGTAPMVNVAPSKVLGLLQKQAATLKVPSQIVPMHLQEDLGKIRMKPRDPRRILHD 809 Query: 1983 NTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPSGAPDISQQFTKE 1804 NT QKN SLG EQ K +S TQ + + Q+ T PDI++QFTK Sbjct: 810 NTLQKNPSLG-YEQPKITVPLASSTQKQEGQVDTKSTPFQSVTQ------PDIARQFTKN 862 Query: 1803 LKSLADILSASQAPSVVPL---TVSSPIVPIKTDTTEMKTVVTEFKDQESGTVTAPVERI 1633 LK++AD +S S A + +P+ ++S V K + +MKTV + +DQ SGT AP + Sbjct: 863 LKNIADFISVSLASTTLPIISHSISCGAVQGKPEKVDMKTVASNSEDQRSGTSPAPEIGV 922 Query: 1632 VQPT--QNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARKXXXXXXXXXXXLN 1459 + +NMWGDVEHL EGYDDQ++AAI +ERARR+EEQ KMF+A K LN Sbjct: 923 AMASRPENMWGDVEHLFEGYDDQQKAAIQRERARRIEEQKKMFSAHKLCLVLDLDHTLLN 982 Query: 1458 SAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVWNFLEKASKLYELH 1279 SAKF EVDPIH+E+LRKKEEQDREK RHLFRFPHM MWTKLRPG+WNFLEKASKLYELH Sbjct: 983 SAKFGEVDPIHDEILRKKEEQDREKQQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELH 1042 Query: 1278 LYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKIKDLEGVLGMESNV 1099 LYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GD +DGDER+ K KDLEGVLGMES V Sbjct: 1043 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDLFDGDERVPKSKDLEGVLGMESAV 1102 Query: 1098 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDERPDEGTLALSLAV 919 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEIDHDERP++GTLA S AV Sbjct: 1103 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSSAV 1162 Query: 918 IERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEMNPQLHPLWQSAEQ 739 IER+HQNFFSH+SL+++DVR+ILAAEQRKIL GC IVFSR+FPVGE NP LHPLWQ+AEQ Sbjct: 1163 IERLHQNFFSHQSLDEVDVRNILAAEQRKILGGCSIVFSRVFPVGEANPHLHPLWQTAEQ 1222 Query: 738 FGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTLLYRRANEHEFAVK 559 FGAVC+ QIDE VTHVVANSLGTDKVNWAL+TGR+VV+PGWVEAS LLYRRANE +FA+K Sbjct: 1223 FGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANERDFAIK 1282 >GAV71470.1 BRCT domain-containing protein/NIF domain-containing protein [Cephalotus follicularis] Length = 1228 Score = 844 bits (2181), Expect = 0.0 Identities = 561/1326 (42%), Positives = 742/1326 (55%), Gaps = 58/1326 (4%) Frame = -1 Query: 4359 VIREESKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV----NDDNA--- 4201 V+++ + VWT++DL+K+ + R FA+ + N AW QAVQN+PL ++ DDN+ Sbjct: 41 VVKDSKPKGVWTVQDLYKFGPISGR-FASSLCNLAWAQAVQNKPLNDIFVAEQDDNSKRS 99 Query: 4200 --AAATSVVIEISDEGVVVNDVDSXXXXXXXXXXXXGD-NDTEMVEGTVVESNLNGMPSS 4030 +++ + V D+G V VD+ D + +M EG + E ++ Sbjct: 100 SPSSSVASVNSKEDKGKEVVVVDNHSKDKIYNNKVCIDVSGDDMEEGELEEGEID----L 155 Query: 4029 TTDDKIMNENEEIKSIRQVIQLVINAKNAGKPFGGACGELWTSLDKLQKFVLNNGTSSSV 3850 D ++ + + IRQ ++ V +A NA K F G C +L S + L++ V+++ + + Sbjct: 156 DVDSSEVDLEKRVCVIRQALESV-SAVNAEKSFEGVCLKLQRSFESLRE-VVSDISLVTK 213 Query: 3849 NSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNASLFSSEQMKELEDII 3670 + +Q F A++ V SV+C+M + Q K + SRL+ VK+ + LFS EQ+KE++ + Sbjct: 214 EANVQLLFTAIENVHSVFCSMEDDLKEQNKGILSRLISLVKSHDPPLFSPEQLKEID--V 271 Query: 3669 QSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPLEEKNGASQNGHGENPRLGT 3490 S++ ++ D+E + M + S ++K+ ++ AS+ Sbjct: 272 MSIK----------GSEKDQEVQINDAMKKKCSDTLAKSADDDLTSASKL---------- 311 Query: 3489 NPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGSGIFQSGPLSGLKSRGGFGP 3310 P + I+ ++ P++ KS L G + RG Sbjct: 312 -PSAVNILVDDKPNMSQEVVKS-------------------------GLYGFRGRG---- 341 Query: 3309 XXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRFPSPTRETLR-PVQANKPQA 3133 R P L H+DH D PSPTRET + +K A Sbjct: 342 -----------------RGVLVPLLD-----LHKDHDEDSLPSPTRETSHCSIPIHKALA 379 Query: 3132 VESRPVKS-----------DGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXXXXXXXSE 2986 V +KS + +++HPYETDA KAVSTYQQKFG SE Sbjct: 380 VGDGMIKSGLPTTMVAEDKEDSKLHPYETDAVKAVSTYQQKFGRSSFFMSTRLPSPTPSE 439 Query: 2985 EGNDDEDDSKGEXXXXXXXXXXXXXXXXPLQAASVYAAFQNNGLCRQG------------ 2842 E + + D GE + V + Q + +G Sbjct: 440 ESGEGDGDIGGEVSSTSNLGGFKPVNHSVVGVPIVSGSPQMDASSMEGLTTTRSPAPVSS 499 Query: 2841 ---TELEINPVVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGTLEEIINM 2671 T NP ++ +SR DPR + + + DL R +L HN P + + Sbjct: 500 PAPTVSGSNPTMKPSAKSR--DPRLRYVNSDVSVLDLTQRPLHLVHNAP-----KVELGS 552 Query: 2670 RKNKSVPQSVLDGHTLKRQRNGLTRSTVSGTGGWGEDTSVRPQHTLTNQVTESIGSRDPR 2491 RK K+V +LDG LKRQ++G S SG G + TS Sbjct: 553 RKQKTVEDPILDGPALKRQKSG---SENSGLIGVLKTTS--------------------- 588 Query: 2490 NFGNGGWSEDSVTRLQPTLTNQVGESIRSS----DPRKFGNGEVVFSQRQDNGGRNLTAG 2323 GNGGW ED T+ VG + + DPRK G V S + N+ Sbjct: 589 --GNGGWLED---------TDMVGTQLLNKNVVLDPRKVDVG--VTSPSIVHCNTNV--- 632 Query: 2322 AGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLL--EHQRL-----QKSNNSPQNLV 2167 G E L + + + SLP+ LKDIAVNP MLIN+L + QRL QKS +S Sbjct: 633 --GNEPLLVTSSSSTASLPALLKDIAVNPTMLINILKMGQQQRLPAEVQQKSTDSLHPPT 690 Query: 2166 TGSSLHGFPGSVPLANIPSSKSLEIDQKHSVKPQVPGQVISTGDSGKTRMKLRDPRLAAR 1987 + S L G+VP N SS I K + Q + D GK RMK RDPR Sbjct: 691 SNSLL----GAVPSVNFASSNPSRILPKPAGTLPTTPQTSAMDDPGKIRMKPRDPRRVLH 746 Query: 1986 MNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPSGA---PDISQQ 1816 N Q++ SLG E+LK PS+ + ++NL + QA+T +PS + PDI++ Sbjct: 747 GNALQRSGSLGS-EKLK-MNVPST-SSFQKDNLNAQKLEGQAETKPMPSLSIPQPDITRL 803 Query: 1815 FTKELKSLADILSASQ----APSVVPLTVSSPIVPIKTDTTEMKTVVTEFKDQESGTVTA 1648 FTK LK++ DI+S SQ +P+V S P IK D ++K +V+ +D +GTV+A Sbjct: 804 FTKNLKNINDIMSVSQPLIGSPNVTQNLESQP-AQIKADRVDVKAIVSNSEDPRTGTVSA 862 Query: 1647 PVERIVQPT--QNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARKXXXXXXXX 1474 P Q+ WGDVEHL EGYDDQ++AAI +ERARR+EEQNKMFAA K Sbjct: 863 SEVGAAGPARPQHAWGDVEHLFEGYDDQQKAAIQRERARRLEEQNKMFAAHKLCLVLDLD 922 Query: 1473 XXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVWNFLEKASK 1294 LNSAKFVEVDP+H+E+LRKKEEQDREK HRHLFRFPHM MWTKLRPG+WNFLE+ASK Sbjct: 923 HTLLNSAKFVEVDPVHDEILRKKEEQDREKLHRHLFRFPHMGMWTKLRPGIWNFLERASK 982 Query: 1293 LYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKIKDLEGVLG 1114 L+ELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP+DGDER+ K KDLEGVLG Sbjct: 983 LFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 1042 Query: 1113 MESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDERPDEGTLA 934 MES VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEIDHDERP++GTLA Sbjct: 1043 MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLA 1102 Query: 933 LSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEMNPQLHPLW 754 +L VIER+HQ FFS++ L D+DVR+ILA+EQ+KIL GCRI+FSR+FPVGE NP LHPLW Sbjct: 1103 SALTVIERIHQIFFSYQPLGDVDVRNILASEQQKILDGCRILFSRVFPVGEANPHLHPLW 1162 Query: 753 QSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTLLYRRANEH 574 Q+AEQFGAVC+ QIDE VTHVVANSLGTDKVNWAL+TGR+VV+PGWVEAS LLYRRANE Sbjct: 1163 QTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANEQ 1222 Query: 573 EFAVKI 556 +F +K+ Sbjct: 1223 DFGIKL 1228 >XP_012459417.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1 [Gossypium raimondii] KJB77191.1 hypothetical protein B456_012G125200 [Gossypium raimondii] Length = 1272 Score = 843 bits (2177), Expect = 0.0 Identities = 558/1337 (41%), Positives = 734/1337 (54%), Gaps = 75/1337 (5%) Frame = -1 Query: 4344 SKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV--------------NDD 4207 S VWTM+DL KY V R +A+ +YN+AW QAVQN+PL ++ N+ Sbjct: 51 SNSRVWTMQDLCKYPSVI-RGYASGLYNFAWAQAVQNKPLNDIFVKELEQQPQQDENNNS 109 Query: 4206 NAAAATSVVIEIS---DEGVVVNDVDSXXXXXXXXXXXXGDN--DTEMVEGTVVESNLNG 4042 ++ +S V ++ ++G N D D + + EG + E ++ Sbjct: 110 KRSSPSSSVASVNSKEEKGYSGNSADRVVIDDDTGDEMEEDKIVNLDKEEGELEEGEID- 168 Query: 4041 MPSSTTDDKIMNENE-----------EIKSIRQVIQLVINAKNAGKPFGGACGELWTSLD 3895 + S +++++ + + IR V++ I A K F C L +L+ Sbjct: 169 LDSEPVKERVLSSEDGNVGISDELEKRVNLIRGVLE-GITVIEAEKSFEVVCSRLQNALE 227 Query: 3894 KLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNA 3715 LQ V G + ++LI+ AL AV S + +N + Q + SRLL VK + Sbjct: 228 SLQGLVFEYGVPTK-DTLIE---LALGAVNSAFVALNSNLKEQNVSILSRLLSVVKGFDP 283 Query: 3714 SLFSSEQMKELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPLEEKN 3535 LF ++MKE+E ++ SL ++ + +++P + + N L Sbjct: 284 PLFPLDKMKEIEVMLLSLNSPARAIDSEKEIKIVNKKDPDALAENVGHDLTVTNKL---- 339 Query: 3534 GASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGSGIFQ 3355 P+ + N P++L + K Sbjct: 340 ----------------PLSVDSEIHNMPNILTEALKP----------------------- 360 Query: 3354 SGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRFPSPT 3175 + +++G P LPSPTR+ + +P GM R Sbjct: 361 --GVPNFRNKGLSLPLLDLHKDHDADSLPSPTRETTPCLPVLRPLT--TGDGMVRSGFMM 416 Query: 3174 RETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXXXXXX 2995 + L + NK MHPYETDA KA S+YQ+KFG Sbjct: 417 AKGLPDAERNK---------------MHPYETDALKAFSSYQRKFGRGSFFSSDRLPSPT 461 Query: 2994 XSEEGNDDEDDSKGE----------XXXXXXXXXXXXXXXXPLQAASVYAAFQNNGLCRQ 2845 SEE D+ D+ GE + +AS ++ Q + Sbjct: 462 PSEESGDEGCDTGGEVSSSSSIGNFKPNLPVMGHPIVSSAPHIDSASSTSSMQGQFTTQN 521 Query: 2844 GTELEINPV--VRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAY-LEHNPPTSGTLEEIIN 2674 T + ++ + ++ ++ RDPR + + DLN R + PP SG I++ Sbjct: 522 ATPVTVSSASNILSKASAKSRDPRLRFANSNVSALDLNQRPLHNASKVPPVSG----IMD 577 Query: 2673 MRKNKSVPQSVLDGHTLKRQRNGLTR------STVSGTGGWGEDT-SVRPQHTLTNQVTE 2515 RK KS + VLDG KRQ+N L VSG GGW EDT + Q T NQ E Sbjct: 578 PRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNGGWLEDTDNCESQITNRNQTME 637 Query: 2514 SIGSRDPRNFGNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRN 2335 ++ S N E VT TL+ + ++ + Sbjct: 638 TLDS-------NSRKMEHGVT-CSSTLSGKTNTTVNKN---------------------- 667 Query: 2334 LTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLL--EHQRL-----QKSNNSP 2179 EQ+ L G N SLP+ LKDIAVNP MLIN+L + QRL QK+ + Sbjct: 668 --------EQVPLTGMSN-PSLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPL 718 Query: 2178 QNLVTGSSLHGFPGSVPLANIPSSKSLEIDQKHS----VKPQVPGQVISTGDSGKTRMKL 2011 +N + S + G +P AN+ S S+ + S KP Q +S K RMK Sbjct: 719 KNTLYQPSSNPVLGVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKP 778 Query: 2010 RDPRLAARMNTCQKNESLGPLEQLKTFG-APSSLTQDSRENLIVRHQ---SVQA---QTN 1852 RDPR N QK+ S+GP +QLKT G +P+S TQ S++N+ + Q ++A Q Sbjct: 779 RDPRRVLHGNVLQKSGSVGP-DQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQ 837 Query: 1851 SVPSGAPDISQQFTKELKSLADILSASQA----PSVVPLTVSSPIVPIKTDTTEMKTVVT 1684 VP PDI+QQFT+ LK++A ++S Q+ P+V VS PI +K++T + T + Sbjct: 838 FVP--PPDIAQQFTQSLKNIAGMMSGPQSFAGLPAVSQNLVSQPI-QVKSETADKNTKGS 894 Query: 1683 EFKDQESGTVTAPVERIV--QPTQNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMF 1510 +DQ++GT TAP + P+QN WGDVEHL E YDD+++AAI +ERARR+EEQ KMF Sbjct: 895 NSEDQQTGTGTAPEAGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMF 954 Query: 1509 AARKXXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLR 1330 AARK LNSAKF+EVDP+HEE+LRKKEEQDREKP RHLFRF HM MWTKLR Sbjct: 955 AARKLCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLR 1014 Query: 1329 PGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDER 1150 PG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP+DGDER Sbjct: 1015 PGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1074 Query: 1149 LQKIKDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEI 970 + + KDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL+GPSLLEI Sbjct: 1075 VPRSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEI 1134 Query: 969 DHDERPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFP 790 DHDERP++GTLA SLAVIER+HQNFFSH++L+DLDVR+ILA EQRKIL+GCRIVFSR+FP Sbjct: 1135 DHDERPEDGTLASSLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFP 1194 Query: 789 VGEMNPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVE 610 VGE NP LHPLWQ+AEQFGAVC+ QIDEHVTHVVANSLGTDKVNWAL+TG++VVHPGWVE Sbjct: 1195 VGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVE 1254 Query: 609 ASTLLYRRANEHEFAVK 559 AS LLYRRANEH+FA+K Sbjct: 1255 ASALLYRRANEHDFAIK 1271 >XP_012459418.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2 [Gossypium raimondii] Length = 1251 Score = 838 bits (2164), Expect = 0.0 Identities = 563/1338 (42%), Positives = 733/1338 (54%), Gaps = 76/1338 (5%) Frame = -1 Query: 4344 SKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV--------------NDD 4207 S VWTM+DL KY V R +A+ +YN+AW QAVQN+PL ++ N+ Sbjct: 51 SNSRVWTMQDLCKYPSVI-RGYASGLYNFAWAQAVQNKPLNDIFVKELEQQPQQDENNNS 109 Query: 4206 NAAAATSVVIEIS---DEGVVVNDVDSXXXXXXXXXXXXGDN--DTEMVEGTVVESNLNG 4042 ++ +S V ++ ++G N D D + + EG + E ++ Sbjct: 110 KRSSPSSSVASVNSKEEKGYSGNSADRVVIDDDTGDEMEEDKIVNLDKEEGELEEGEID- 168 Query: 4041 MPSSTTDDKIMNENE-----------EIKSIRQVIQLVINAKNAGKPFGGACGELWTSLD 3895 + S +++++ + + IR V++ I A K F C L +L+ Sbjct: 169 LDSEPVKERVLSSEDGNVGISDELEKRVNLIRGVLE-GITVIEAEKSFEVVCSRLQNALE 227 Query: 3894 KLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNA 3715 LQ V G + ++LI+ AL AV S + +N + Q + SRLL VK + Sbjct: 228 SLQGLVFEYGVPTK-DTLIE---LALGAVNSAFVALNSNLKEQNVSILSRLLSVVKGFDP 283 Query: 3714 SLFSSEQMKELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPLEEKN 3535 LF ++MKE+E ++ SL ++ + E IV+K ++ + Sbjct: 284 PLFPLDKMKEIEVMLLSLNSPARAID-----------------SEKEIKIVNK---KDPD 323 Query: 3534 GASQN-GHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGSGIF 3358 ++N GH L P V N K L P+ H D + Sbjct: 324 ALAENVGHDLTEAL-------------KPGVPNFRNKGLSLPLLDLHKDHDADSL----- 365 Query: 3357 QSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRFPSP 3178 PSPTR+ + +P GM R Sbjct: 366 -----------------------------PSPTRETTPCLPVLRPLT--TGDGMVRSGFM 394 Query: 3177 TRETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXXXXX 2998 + L + NK MHPYETDA KA S+YQ+KFG Sbjct: 395 MAKGLPDAERNK---------------MHPYETDALKAFSSYQRKFGRGSFFSSDRLPSP 439 Query: 2997 XXSEEGNDDEDDSKGE----------XXXXXXXXXXXXXXXXPLQAASVYAAFQNNGLCR 2848 SEE D+ D+ GE + +AS ++ Q + Sbjct: 440 TPSEESGDEGCDTGGEVSSSSSIGNFKPNLPVMGHPIVSSAPHIDSASSTSSMQGQFTTQ 499 Query: 2847 QGTELEINPV--VRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAY-LEHNPPTSGTLEEII 2677 T + ++ + ++ ++ RDPR + + DLN R + PP SG I+ Sbjct: 500 NATPVTVSSASNILSKASAKSRDPRLRFANSNVSALDLNQRPLHNASKVPPVSG----IM 555 Query: 2676 NMRKNKSVPQSVLDGHTLKRQRNGLTR------STVSGTGGWGEDT-SVRPQHTLTNQVT 2518 + RK KS + VLDG KRQ+N L VSG GGW EDT + Q T NQ Sbjct: 556 DPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNGGWLEDTDNCESQITNRNQTM 615 Query: 2517 ESIGSRDPRNFGNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGR 2338 E++ S N E VT TL+ + ++ + Sbjct: 616 ETLDS-------NSRKMEHGVT-CSSTLSGKTNTTVNKN--------------------- 646 Query: 2337 NLTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLL--EHQRL-----QKSNNS 2182 EQ+ L G N SLP+ LKDIAVNP MLIN+L + QRL QK+ + Sbjct: 647 ---------EQVPLTGMSN-PSLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDP 696 Query: 2181 PQNLVTGSSLHGFPGSVPLANIPSSKSLEIDQKHS----VKPQVPGQVISTGDSGKTRMK 2014 +N + S + G +P AN+ S S+ + S KP Q +S K RMK Sbjct: 697 LKNTLYQPSSNPVLGVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMK 756 Query: 2013 LRDPRLAARMNTCQKNESLGPLEQLKTFG-APSSLTQDSRENLIVRHQ---SVQA---QT 1855 RDPR N QK+ S+GP +QLKT G +P+S TQ S++N+ + Q ++A Q Sbjct: 757 PRDPRRVLHGNVLQKSGSVGP-DQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQC 815 Query: 1854 NSVPSGAPDISQQFTKELKSLADILSASQA----PSVVPLTVSSPIVPIKTDTTEMKTVV 1687 VP PDI+QQFT+ LK++A ++S Q+ P+V VS PI +K++T + T Sbjct: 816 QFVP--PPDIAQQFTQSLKNIAGMMSGPQSFAGLPAVSQNLVSQPI-QVKSETADKNTKG 872 Query: 1686 TEFKDQESGTVTAPVERIV--QPTQNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKM 1513 + +DQ++GT TAP + P+QN WGDVEHL E YDD+++AAI +ERARR+EEQ KM Sbjct: 873 SNSEDQQTGTGTAPEAGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKM 932 Query: 1512 FAARKXXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKL 1333 FAARK LNSAKF+EVDP+HEE+LRKKEEQDREKP RHLFRF HM MWTKL Sbjct: 933 FAARKLCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKL 992 Query: 1332 RPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDE 1153 RPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP+DGDE Sbjct: 993 RPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1052 Query: 1152 RLQKIKDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLE 973 R+ + KDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL+GPSLLE Sbjct: 1053 RVPRSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 1112 Query: 972 IDHDERPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIF 793 IDHDERP++GTLA SLAVIER+HQNFFSH++L+DLDVR+ILA EQRKIL+GCRIVFSR+F Sbjct: 1113 IDHDERPEDGTLASSLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVF 1172 Query: 792 PVGEMNPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWV 613 PVGE NP LHPLWQ+AEQFGAVC+ QIDEHVTHVVANSLGTDKVNWAL+TG++VVHPGWV Sbjct: 1173 PVGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWV 1232 Query: 612 EASTLLYRRANEHEFAVK 559 EAS LLYRRANEH+FA+K Sbjct: 1233 EASALLYRRANEHDFAIK 1250 >XP_017615720.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Gossypium arboreum] Length = 1272 Score = 837 bits (2161), Expect = 0.0 Identities = 564/1342 (42%), Positives = 738/1342 (54%), Gaps = 80/1342 (5%) Frame = -1 Query: 4344 SKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV--------------NDD 4207 S VWTM+DL KY V R +A+ +YN+AW QAVQN+PL ++ N+ Sbjct: 51 SNSRVWTMQDLCKYPSVI-RGYASGLYNFAWAQAVQNKPLNDIFVKELEQQPQQDENNNS 109 Query: 4206 NAAAATSVVIEIS---DEGVVVNDVDSXXXXXXXXXXXXGDN--DTEMVEGTVVESNLNG 4042 ++ +S V ++ ++G N D D + + EG + E ++ Sbjct: 110 KRSSPSSSVASVNSKEEKGYSGNSADRVVIDDDTGDEMEEDKIVNLDKEEGELEEGEID- 168 Query: 4041 MPSSTTDDKIMNENE-----------EIKSIRQVIQLVINAKNAGKPFGGACGELWTSLD 3895 + S +++++ + + IR V++ I A K F C L +L+ Sbjct: 169 LDSEPVKERVLSSEDGNVGISDELEKRVNLIRGVLE-GITVIEAEKSFEVVCSRLQNALE 227 Query: 3894 KLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNA 3715 L+ V G + ++LI+ +F AV S + +N + Q + SRLL VK + Sbjct: 228 SLRGLVFEYGVPTK-DTLIELAFG---AVNSAFVALNSNLKEQNVSILSRLLSVVKGFDP 283 Query: 3714 SLFSSEQMKELEDIIQSL-------EKQKEILEKNGANQNDREENPSLGMNRIESGIVSK 3556 LF ++MKE+E ++ SL + +KEI N + + EN + + +K Sbjct: 284 PLFPLDKMKEIEVMLLSLNSPVRAIDSEKEIKIVNKKDPDALAENVGHDLT-----VTNK 338 Query: 3555 NPLEEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGK 3376 PL + P + T ++ P V N K L P+ H D Sbjct: 339 LPLSVDSEIH-----NMPSMLTEALK--------PGVPNFRNKGLSLPLLDLHKDHDADS 385 Query: 3375 VGSGIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGM 3196 + PSPTR+ + +P GM Sbjct: 386 L----------------------------------PSPTRETTPCLPVLRPLT--TGDGM 409 Query: 3195 DRFPSPTRETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXX 3016 R S + L + NK MHPYETDA KA S+YQ+KFG Sbjct: 410 VRSGSMMAKGLPDEERNK---------------MHPYETDALKAFSSYQRKFGRGSFFSS 454 Query: 3015 XXXXXXXXSEEGNDDEDDSKGE----------XXXXXXXXXXXXXXXXPLQAASVYAAFQ 2866 SEE D+ D+ GE + +AS+ ++ Q Sbjct: 455 DRLPSPTPSEESGDEGCDTGGEVSSSSSIGNFKPNLPVMGHPIVSSPPHIDSASLTSSMQ 514 Query: 2865 NNGLCRQGTELEINPV--VRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGT 2692 + T + ++ + ++ ++ RDPR + + DLN R HN Sbjct: 515 GQFTTQNATPVTVSSASSILSKASAKSRDPRLRFANSNVSALDLNQRPL---HNASKVPP 571 Query: 2691 LEEIINMRKNKSVPQSVLDGHTLKRQRNGLTR------STVSGTGGWGEDTSVRPQHTLT 2530 + I++ RK KS + VLDG KRQ+N L VSG GGW ED Sbjct: 572 VSVIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNGGWLED---------- 621 Query: 2529 NQVTESIGSRDPRNFGNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQD 2350 T++ GS Q T NQ E++ S+ RK +G S Sbjct: 622 ---TDNCGS-------------------QITNRNQTMETL-DSNSRKMEHGVTCSSTL-- 656 Query: 2349 NGGRNLTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLL--EHQRL-----QK 2194 +G N T EQ+ L G N SLP+ LKDIAVNP MLIN+L + QRL K Sbjct: 657 SGKTNTTVNK--NEQVPLTGMSN-PSLPALLKDIAVNPTMLINILKMGQQQRLPSESQHK 713 Query: 2193 SNNSPQNLVTGSSLHGFPGSVPLANIPSSKSLEIDQKHS----VKPQVPGQVISTGDSGK 2026 + ++ +N + S + G VP N+ S S+ + S KP Q +SGK Sbjct: 714 TPDALKNTLYQPSSNPVLGVVPPGNVIPSPSVNVVPSTSSGTLSKPAGNLQGPPLDESGK 773 Query: 2025 TRMKLRDPRLAARMNTCQKNESLGPLEQLKTFG-APSSLTQDSRENLIVRHQ---SVQA- 1861 RMK RDPR N QK S+GP +QLKT G +P+S T S++N+ + Q ++A Sbjct: 774 IRMKPRDPRRVLHGNVLQKTSSVGP-DQLKTNGTSPASSTLGSKDNMNAQKQLENQIEAK 832 Query: 1860 --QTNSVPSGAPDISQQFTKELKSLADILSASQA----PSVVPLTVSSPIVPIKTDTTEM 1699 Q VP PDI+QQFT+ LK++A ++S Q+ P+V VS PI +K++TT+ Sbjct: 833 PIQCQLVP--PPDITQQFTQSLKNIAGMMSGPQSFASLPAVSQNLVSQPI-QVKSETTDK 889 Query: 1698 KTVVTEFKDQESGTVTAPVERIV--QPTQNMWGDVEHLLEGYDDQERAAIHKERARRMEE 1525 T + +DQ++GT TAP + P+QN WGDVEHL E YDD+++AAI +ERARR+EE Sbjct: 890 NTKGSNCEDQQTGTGTAPEVGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEE 949 Query: 1524 QNKMFAARKXXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRM 1345 Q KMFAARK LNSAKF+EVDP+HEE+LRKKEEQDREKP RHLFRF HM M Sbjct: 950 QKKMFAARKLCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGM 1009 Query: 1344 WTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPY 1165 WTKLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP+ Sbjct: 1010 WTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1069 Query: 1164 DGDERLQKIKDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGP 985 DGDER+ + KDLEGVLGMES+VVIIDDS+RVWPHNKLNLIVVERYTYFP SRRQFGL+GP Sbjct: 1070 DGDERVPRSKDLEGVLGMESSVVIIDDSMRVWPHNKLNLIVVERYTYFPFSRRQFGLLGP 1129 Query: 984 SLLEIDHDERPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVF 805 SLLEIDHDERP++GTLA SLAVIER+HQNFFSH++L+DLDVR+ILA EQRKIL+GCRIVF Sbjct: 1130 SLLEIDHDERPEDGTLASSLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVF 1189 Query: 804 SRIFPVGEMNPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVH 625 SR+FPVGE NP LHPLWQ+AEQFGAVC+ QIDEHVTHVVANSLGTDKVNWAL+TG++VVH Sbjct: 1190 SRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVH 1249 Query: 624 PGWVEASTLLYRRANEHEFAVK 559 PGWVEAS LLYRRANEH+FA+K Sbjct: 1250 PGWVEASALLYRRANEHDFAIK 1271 >XP_012088736.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Jatropha curcas] KDP23276.1 hypothetical protein JCGZ_23109 [Jatropha curcas] Length = 1283 Score = 833 bits (2152), Expect = 0.0 Identities = 567/1338 (42%), Positives = 729/1338 (54%), Gaps = 68/1338 (5%) Frame = -1 Query: 4368 TKPVIREESKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV--------N 4213 +KP + S WTM+DL+KY + + +YN AW QAVQN+PL ++ N Sbjct: 66 SKPKENDGSSGRFWTMKDLYKYQM--GGGYVSGLYNLAWAQAVQNKPLNDLFVEVEPDEN 123 Query: 4212 DDNAAAATSVVIEISD------------EGVVVND--------VDSXXXXXXXXXXXXGD 4093 ++ ++SV S+ E VV++D + D Sbjct: 124 SKRSSPSSSVASVNSNSNSNKEEEKKKVEKVVIDDSGDEMDVKIVDFEKEEGELEEGEID 183 Query: 4092 NDTEMVEGTVVESN---LNG----MPSSTTDDKIMNENEEIKSIRQVIQLVINAKNAGKP 3934 D++ E + E LN + S T K + +++K IR+ ++ + + K Sbjct: 184 LDSDPAEKAIDEGKERFLNNDEMDIDVSETKSKDKDLEKKVKFIREALE-ALTVTESNKS 242 Query: 3933 FGGACGELWTSLDKLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDV 3754 F AC L +L L++ + N + N L+Q S A+Q+V SV+ +MN K + Q KD Sbjct: 243 FETACSMLGNTLKSLREVIGKNNIPTKDN-LLQLSSNAVQSVNSVFTSMNHKLREQNKDS 301 Query: 3753 FSRLLVHVKTQNASLFSSEQMKELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIE 3574 FSR L V + SL S E L K+ E++ + ++ + +E SL + Sbjct: 302 FSRFLSVVNSHVPSLLSPE-----------LIKEIEVMTSSLSSISGEKEKESLIFS--- 347 Query: 3573 SGIVSKNPLEEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHN 3394 + ++ A +GH + G N P++ SL P Sbjct: 348 ----DEGNKKDDMSAKSSGHSLTTAKKLSSFA-GSFASNKPNM------SLEAP------ 390 Query: 3393 DQGNGKVGSGIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQ 3214 K+G F KSR G P Sbjct: 391 -----KMGVSTF--------KSRAGLLPLLDL---------------------------- 409 Query: 3213 HRDHGMDRFPSPTRETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGX 3034 H+DH D PSPTRE P+ + + + ++ T+MHPYETDA KAVS+YQQKF Sbjct: 410 HKDHDADSLPSPTREAAPPLPVRRVSTPKVA-LDNEDTKMHPYETDALKAVSSYQQKFNR 468 Query: 3033 XXXXXXXXXXXXXXSEEGNDDEDDSKGEXXXXXXXXXXXXXXXXPLQAASVYAAFQNNGL 2854 SEE + + D GE + V + Sbjct: 469 SSFAVNDRLPSPTPSEESGNGDGDVGGEVSSSSAVGQFRPANPPNSGQSIVSTSPHPESS 528 Query: 2853 CRQGTELEIN--PV-----VRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSG 2695 QG N PV + + ++ RDPR + + +A + D N L +N P Sbjct: 529 NMQGVVPAKNAGPVSSGSSLTVKASAKSRDPRLRFVNSDANALDQN-HVLPLVNNTPKVE 587 Query: 2694 TLEEIINMRKNKSVPQSVLDGHTLKRQRNGLTRS-------TVSGTGGWGEDTS-VRPQH 2539 L +N++K KSV SVLDG +LKRQRN L S T+ +GGW EDT VRPQ Sbjct: 588 YLGGPMNLKKQKSVDDSVLDGPSLKRQRNVLEHSGGVGNVKTMIASGGWLEDTDMVRPQT 647 Query: 2538 TLTNQVTESIGSRDPRNFGNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQ 2359 NQ+ E + DPR NG +V+ + + GN Q Sbjct: 648 MNRNQLVE---NSDPRRMDNGVACPSTVSGISSVSIS--------------GN-----EQ 685 Query: 2358 RQDNGGRNLTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLLEHQRLQKSNNS 2182 + G +T G EQ+ + G SLP LK+IAVNP ML+NLL Q+ + + ++ Sbjct: 686 KPVIGTGAITEG----EQIQMTGTSE-ASLPDLLKNIAVNPTMLLNLLKMGQQQRSAIDA 740 Query: 2181 PQNLVTGSSLHGFP-------GSVPLANIPSSKSLEIDQKHSVKP------QVPGQVIST 2041 Q + P GSVP+ N+ + + SV P QVP Q + Sbjct: 741 QQKPSDPAKTSKHPLNANAILGSVPVVNV-------VPPQPSVMPRPAGTLQVPPQA-AV 792 Query: 2040 GDSGKTRMKLRDPRLAARMNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQA 1861 + GK RMK RDPR T QKN ++G EQ KT Q +++N IV+ Q QA Sbjct: 793 EELGKIRMKPRDPRRVLHYQTLQKNGNMG-YEQFKTNLTSPPTDQGTKDNQIVQKQDGQA 851 Query: 1860 QTNSVPSGA---PDISQQFTKELKSLADILSASQAPSVVPLTVSSPIVPIKTDTTEMKTV 1690 +T VP + PDIS FTK LK++ADI+S S A S P VS + T +T+ Sbjct: 852 ETEPVPLQSLVVPDISLPFTKSLKNIADIVSVSHA-STSPTVVSQNLASQPT-----RTI 905 Query: 1689 VTEFKDQESGTVTAPVERIVQP-TQNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKM 1513 V+ +Q +G +AP V P Q+ WGDVEHL EGY DQ++AAI +ERARR+EEQ KM Sbjct: 906 VSN-SEQPAGIGSAPCVAPVGPRPQDAWGDVEHLFEGYSDQQKAAIQRERARRIEEQKKM 964 Query: 1512 FAARKXXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKL 1333 FAARK LNSAKFVEVDP+H+E+LRKKEEQDREKP+RHLFRFPHM MWTKL Sbjct: 965 FAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFPHMGMWTKL 1024 Query: 1332 RPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDE 1153 RPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP+G LF GRVIS+GD+ D +D DE Sbjct: 1025 RPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDTDSFDSDE 1084 Query: 1152 RLQKIKDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLE 973 R+ K KDLEGVLGMES VVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLE Sbjct: 1085 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLE 1144 Query: 972 IDHDERPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIF 793 IDHDERP++GTLA SLAVIE++HQ+FF+H SL+D DVR+ILA+EQRKILAGCRIVFSR+F Sbjct: 1145 IDHDERPEDGTLACSLAVIEKIHQHFFTHPSLDDADVRNILASEQRKILAGCRIVFSRVF 1204 Query: 792 PVGEMNPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWV 613 PVGE NP LHPLWQ+AEQFGAVC+ QIDE VTHVVANSLGTDKVNWAL+TGR+VV+PGWV Sbjct: 1205 PVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWV 1264 Query: 612 EASTLLYRRANEHEFAVK 559 EAS LLYRRANE +FA+K Sbjct: 1265 EASALLYRRANEQDFAIK 1282 >XP_016680068.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Gossypium hirsutum] Length = 1272 Score = 832 bits (2149), Expect = 0.0 Identities = 550/1337 (41%), Positives = 728/1337 (54%), Gaps = 75/1337 (5%) Frame = -1 Query: 4344 SKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV--------------NDD 4207 S VWTM+DL KY V R +A+ +YN+AW QAVQN+PL ++ N+ Sbjct: 51 SNSRVWTMQDLCKYPSVI-RGYASGLYNFAWAQAVQNKPLNDIFVKELEQQPQQDENNNS 109 Query: 4206 NAAAATSVVIEIS---DEGVVVNDVDSXXXXXXXXXXXXGDN--DTEMVEGTVVESNLNG 4042 ++ +S V ++ ++G N D D + + EG + E ++ Sbjct: 110 KRSSPSSSVASVNSKEEKGYSGNSADRVVIDDDTGDEMEEDKIVNLDKEEGELEEGEID- 168 Query: 4041 MPSSTTDDKIMNENE-----------EIKSIRQVIQLVINAKNAGKPFGGACGELWTSLD 3895 + S +++++ + + IR V++ I A K F C L +L+ Sbjct: 169 LDSEPVKERVLSSEDGNVGISDELEKRVNLIRGVLE-GITVIEAEKSFEVVCSRLQNALE 227 Query: 3894 KLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNA 3715 L+ V G + ++LI+ +F AV S + + + Q + SRLL VK + Sbjct: 228 SLRGLVFEYGVPTK-DTLIELAFG---AVNSAFVALKCNLKEQNVSILSRLLSVVKGFDP 283 Query: 3714 SLFSSEQMKELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPLEEKN 3535 LF ++MKE+E ++ SL ++ + +++P + + N L Sbjct: 284 PLFPLDKMKEIEVMLLSLNSPARAIDSEKEIKIVNKKDPDALAENVGHDLTVTNKL---- 339 Query: 3534 GASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGSGIFQ 3355 P+ + N P++L + K Sbjct: 340 ----------------PLSVDSEIHNMPNMLTEALKP----------------------- 360 Query: 3354 SGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRFPSPT 3175 + +++G P LPSPTR+ + +P GM R Sbjct: 361 --GIPNFRNKGLSLPLLDLHKDHDADSLPSPTRETTPCLPVLRPLT--TGDGMVRSGFMM 416 Query: 3174 RETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXXXXXX 2995 + L + NK MHPYETDA KA S+YQ+KFG Sbjct: 417 AKGLPDEEHNK---------------MHPYETDALKAFSSYQRKFGRGSFFSSDRLPSPT 461 Query: 2994 XSEEGNDDEDDSKGE----------XXXXXXXXXXXXXXXXPLQAASVYAAFQNNGLCRQ 2845 SEE D+ D+ GE + +AS ++ Q + Sbjct: 462 PSEESGDEGCDTGGEVSSSSSIGNFKPNLPVMGHPIVSSAPHIDSASSTSSMQGQFTTQN 521 Query: 2844 GTELEINPV--VRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAY-LEHNPPTSGTLEEIIN 2674 T + ++ + ++ ++ RDPR + + DLN R + PP SG I++ Sbjct: 522 ATPVTVSSASNILSKASAKSRDPRLRFANSNVSALDLNQRPLHNASKVPPVSG----IMD 577 Query: 2673 MRKNKSVPQSVLDGHTLKRQRNGLTR------STVSGTGGWGEDT-SVRPQHTLTNQVTE 2515 RK KS + VLDG KRQ+N L VSG GGW EDT + Q T NQ E Sbjct: 578 PRKKKSTEEPVLDGPAPKRQKNELENLGVRDVQAVSGNGGWLEDTDNCESQITNRNQTME 637 Query: 2514 SIGSRDPRNFGNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRN 2335 ++ S W + TL+ + + + Sbjct: 638 TLDS--------NSWKMEHGVTCSSTLSGKANTIVNKN---------------------- 667 Query: 2334 LTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLL--EHQRL-----QKSNNSP 2179 EQ+ L G N SLP+ LKDIAVNP MLIN+L + QRL QK+ + Sbjct: 668 --------EQVPLTGMSN-PSLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPL 718 Query: 2178 QNLVTGSSLHGFPGSVPLANIPSSKSLEIDQKHS----VKPQVPGQVISTGDSGKTRMKL 2011 +N + S + G +P AN+ S S+ + S KP Q +S K RMK Sbjct: 719 KNTLYQPSSNPVLGVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKP 778 Query: 2010 RDPRLAARMNTCQKNESLGPLEQLKTFG-APSSLTQDSRENLIVRHQ---SVQA---QTN 1852 RDPR N QK+ S+GP +QLKT G +P+S TQ S++N+ + Q ++A Q Sbjct: 779 RDPRRVLHGNVLQKSGSVGP-DQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQ 837 Query: 1851 SVPSGAPDISQQFTKELKSLADILSASQA----PSVVPLTVSSPIVPIKTDTTEMKTVVT 1684 VP PDI+QQFT+ LK++A ++S Q+ P+V VS PI +K++TT+ T + Sbjct: 838 FVP--PPDIAQQFTQSLKNIAGMMSGPQSFAGLPAVSQNLVSQPI-QVKSETTDKNTKGS 894 Query: 1683 EFKDQESGTVTAPVERIV--QPTQNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMF 1510 +DQ++GT TAP + P+QN WGDVEHL E YDD+++AAI +ERARR+EEQ KM Sbjct: 895 NSEDQQTGTGTAPEAGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMI 954 Query: 1509 AARKXXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLR 1330 AARK LNSAKF+EVDP+HEE+LRKKEEQDREKP RHLFRF HM MWTKLR Sbjct: 955 AARKLCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLR 1014 Query: 1329 PGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDER 1150 PG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP+DGDER Sbjct: 1015 PGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1074 Query: 1149 LQKIKDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEI 970 + + KDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ GL+GPSLLEI Sbjct: 1075 VPRSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQCGLLGPSLLEI 1134 Query: 969 DHDERPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFP 790 DHDERP++GTLA SLAVIER+HQNFFSH++L+DLDVR+ILA EQRKIL+GCRIVFSR+FP Sbjct: 1135 DHDERPEDGTLASSLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFP 1194 Query: 789 VGEMNPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVE 610 VGE NP LHPLWQ+AEQFGAVC+ QIDEHVTHVVANSLGTDKVNWA +TG++VVHPGWVE Sbjct: 1195 VGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWAQSTGKFVVHPGWVE 1254 Query: 609 ASTLLYRRANEHEFAVK 559 AS LLYRRANEH+FA+K Sbjct: 1255 ASALLYRRANEHDFAIK 1271 >XP_008791049.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Phoenix dactylifera] Length = 1269 Score = 828 bits (2140), Expect = 0.0 Identities = 547/1307 (41%), Positives = 715/1307 (54%), Gaps = 64/1307 (4%) Frame = -1 Query: 4287 RNFATPMYNYAWKQAVQNRPL----RNVNDDNAAAATSVVIEISDEGVVVNDVDSXXXXX 4120 RN+A +Y++AW QAVQN+PL + V + A ++ + +E V D Sbjct: 76 RNYAPNLYSFAWAQAVQNKPLGLDLKPVGSADPPAKSAGGKPVKEEAYNVVDSSEESGGG 135 Query: 4119 XXXXXXXGDND-----TEMVEGTVVE--SNLNGMPSSTTDDKIMN--ENEEIKSIRQVIQ 3967 + +E V G +++ S+ S + + K++ E EEI + + Sbjct: 136 TEKEEGELEEGEIGFGSEPVGGEIIDLSSDKQEDGSESEEKKLLGGKETEEIGEFDRRVS 195 Query: 3966 LV------INAKNAGKPFGGACGELWTSLDKLQK-FVLNNGTSSSVNSLIQQSFAALQAV 3808 L+ + + A K F G C L S + L+ F +++L+QQ+F ++ V Sbjct: 196 LILEELETVTEEEAEKSFDGVCLRLRQSFEMLKPMFAETESPVPVLDALVQQAFEGIKTV 255 Query: 3807 KSVYCTMNFKEQTQYKDVFSRLLVHVKTQNASLFSSEQMKELEDIIQSLEKQKEILEKNG 3628 SV + N K++ Q KD RLL+H+K Q +++ S EQ+KE++ +QSL + E + Sbjct: 256 HSVLRSENLKKKEQNKDFLLRLLIHIKNQYSNILSPEQVKEIDTRVQSL-----VFEDDS 310 Query: 3627 ANQNDREENPSLGMNRIESGIVSKNPLEEKNGASQNGHGENPRLGTNPIEIGIVGENPPH 3448 ++ N K L EK S G+V H Sbjct: 311 NKESKLYAGSGTNTN-------DKTHLPEKPDISS---------------FGLVSSGNSH 348 Query: 3447 VLNSSKKSLLEPISVKHNDQGNGKVGSGIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLP 3268 V++S I K+ G K+ + G + Sbjct: 349 VVSS--------IGSKNVQAGLPKLDTPTISRGRIV------------------------ 376 Query: 3267 SPTRDAPKPFLIHKPQVQHRDHGMDRFPSPTRETLRPVQANKP-----------QAVESR 3121 SP D H ++ + PSPTRE P+ +KP + + ++ Sbjct: 377 SPLLDL------------HAEYDEESLPSPTRENAPPLPIHKPIGFGTGTVVFTEPITTK 424 Query: 3120 PVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXXXXXXXSEEGNDDEDDSKGEXXX 2941 V+++ HPY TDA KAVS+YQQK+ E DD+DD+ E Sbjct: 425 NVEAEDDTPHPYITDAFKAVSSYQQKY-----FFTSNKLPSPTPSEECDDKDDAHDEVSS 479 Query: 2940 XXXXXXXXXXXXXP-LQAASVYAAFQNNGLCRQGTELEI--------NPVVRAQKQSRGR 2788 +Q A+ AA ++ Q ++ NP +R +SR Sbjct: 480 SSANGNAGCVNTTSEIQVATNSAACTDSSSRHQPGPVKPVGQLGSAPNPAIRPALKSR-- 537 Query: 2787 DPRRQNLGPEAGSG-DLNLRSAYLEHNPPTSGTLEEIINMRKNKSVPQSVLDGHTLKRQR 2611 DPR + + E+G+ D N R+ L+ + P + + I N RK+K+V +S + HTLKRQ+ Sbjct: 538 DPRLRFVNSESGNASDPNRRAMSLDFSAPNNDLVGGITNPRKHKAVDESFPENHTLKRQK 597 Query: 2610 NGLTRS-----TVSGTGGWGEDTS-VRPQHTLTNQVTES--IGSRDPRNFGNGGWSEDSV 2455 NGLT S T GGW ED+S VR Q + ++ E+ I ++P N DS Sbjct: 598 NGLTNSSDVQMTPGRGGGWLEDSSSVRSQLSDKIRLNENMEIEIKNPGNVVMSDRRPDSN 657 Query: 2454 TRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRNLTAGAGGKEQLSLIGNGNMG 2275 +Q T T G + S + G ++ A Sbjct: 658 PNIQVTNT---------------GTCMIPSSTTAPSSGTAPSSSAAASV----------- 691 Query: 2274 SLPSTLKDIAVNP-MLINLL-LEHQRL-----QKSNNSPQNLVTGSSLHGFPGSVPLANI 2116 S PS LKDIAVNP ML+ L+ +E QRL QK+ N+ SSL+ PG+V AN+ Sbjct: 692 SFPSLLKDIAVNPTMLMQLIQIEQQRLSAEAQQKTVGLMHNMAHASSLNVLPGAVSSANV 751 Query: 2115 PSSKSLEIDQKHSVKPQVPGQVISTG---DSGKTRMKLRDPRLAARMNTCQKNESLGPLE 1945 S KS E+ S +PQV Q +ST D G+ RMK RDPR N QKNE++ E Sbjct: 752 ASMKSAEVGHNPSGRPQVTAQTVSTNSQSDVGRIRMKPRDPRRILH-NMVQKNETIVS-E 809 Query: 1944 QLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPSGAPDISQQFTKELKSLADILSASQ- 1768 + K G SS Q S+++L + Q QAQ +P+ Q K K+L DI S Q Sbjct: 810 RAKPNGTLSSDPQSSKDHLAIGEQGEQAQATGLPT------LQLAKNPKNLGDISSPLQL 863 Query: 1767 --APSVVPLTVSSPIVPIKTDTTEMKTVVTEFKDQESGTVTAPVERIVQPTQ--NMWGDV 1600 P VP +S PI + +++ D ++ + A TQ N WGDV Sbjct: 864 TTTPLAVPQIISQPI-QFNINKVDLRPAAAVVNDPKTLSTVASEGSTTVATQSTNAWGDV 922 Query: 1599 EHLLEGYDDQERAAIHKERARRMEEQNKMFAARKXXXXXXXXXXXLNSAKFVEVDPIHEE 1420 +HLL+GYDDQ++AAI +ERARR+ EQNKMFAARK LNSAKFVEVDP+HEE Sbjct: 923 DHLLDGYDDQQKAAIQRERARRIAEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEE 982 Query: 1419 VLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEM 1240 +LRKKEEQDREKP RHLFRF HM MWTKLRPG+WNFLEKASKLYE+HLYTMGNKLYATEM Sbjct: 983 ILRKKEEQDREKPQRHLFRFQHMGMWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEM 1042 Query: 1239 AKVLDPSGALFEGRVISKGDEGDPYDGDERLQKIKDLEGVLGMESNVVIIDDSVRVWPHN 1060 AKVLDP+G LF GRVIS+GD+ +P+DGDER+ K KDL+GVLGMES VVIIDDSVRVWPHN Sbjct: 1043 AKVLDPTGTLFAGRVISRGDDSEPFDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHN 1102 Query: 1059 KLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDERPDEGTLALSLAVIERVHQNFFSHKS 880 KLNLIVVERYTYFPCSRRQFGL GPSLLEIDHDERP++GTLA SL VIER+H +FFSH+S Sbjct: 1103 KLNLIVVERYTYFPCSRRQFGLFGPSLLEIDHDERPEDGTLASSLTVIERIHDDFFSHRS 1162 Query: 879 LNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEMNPQLHPLWQSAEQFGAVCSTQIDEHV 700 LND+DVR+ILAAEQRKILAGC+IVFSR+FPVGE NP LHPLWQ AEQFGA C+ QIDE V Sbjct: 1163 LNDVDVRNILAAEQRKILAGCKIVFSRVFPVGEANPHLHPLWQMAEQFGAACTNQIDEQV 1222 Query: 699 THVVANSLGTDKVNWALNTGRYVVHPGWVEASTLLYRRANEHEFAVK 559 THVVANSLGTDKVNWAL+TGR+VVHP WVEAS LLYRR NE +FAVK Sbjct: 1223 THVVANSLGTDKVNWALSTGRFVVHPSWVEASALLYRRVNEQDFAVK 1269 >XP_018840026.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X3 [Juglans regia] Length = 1065 Score = 821 bits (2120), Expect = 0.0 Identities = 514/1149 (44%), Positives = 660/1149 (57%), Gaps = 24/1149 (2%) Frame = -1 Query: 3933 FGGACGELWTSLDKLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDV 3754 FG C + ++++ L++ VL+ + + ++L+Q F A++AV SV+ +MN + Q K+ Sbjct: 11 FGEVCSRVHSTMESLRE-VLSESSVPTKDALVQLLFTAIKAVNSVFSSMNRNRKEQNKEN 69 Query: 3753 FSRLLVHVKTQNASLFSSEQMKELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIE 3574 R++ VK N LFSSEQMKE+E + S++ +L G+ R E Sbjct: 70 VLRVISDVKFGNPPLFSSEQMKEIEVMRSSVDSVDALLSTID------------GVKRKE 117 Query: 3573 SGIVSKNPLEEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHN 3394 + ++ + ++ + E L S+K S + I+V Sbjct: 118 MAAIDAANNKDFDASTTSDGRE---------------------LTSNKLSS-DSIAVGSL 155 Query: 3393 DQGNGKVGSGIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQ 3214 N + + + G +S KSR P LPSPTR+AP F +H + Sbjct: 156 VLSNANILPEVLKPG-VSSFKSRAILLPLLDLHKDHDIDSLPSPTREAPSSFPVHN--IM 212 Query: 3213 HRDHGMDRFPSPTRETLRPVQANKPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGX 3034 GM R PT + + +K +H YETDA KA STYQQKFG Sbjct: 213 DIGDGMARPVLPTAKVAHDTENSK---------------LHIYETDALKAFSTYQQKFGQ 257 Query: 3033 XXXXXXXXXXXXXXSEEGNDDEDDSKGEXXXXXXXXXXXXXXXXPLQAASVYAAFQNNGL 2854 EE +D + D+ GE L + ++ + Sbjct: 258 NSLFTSDLPSPTPS-EEFDDGDGDTSGEVSSSSTIGNIRNVNPPFLWGPPGTPSMDSSSM 316 Query: 2853 -----CRQGTELEI--NPVVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSG 2695 + T + N +V+A +SR DPR + ++ + N H+ P Sbjct: 317 DGPITTKNSTPITFGSNSIVKASAKSR--DPRLRLANYDSNALYFNQHPLSSVHDTPKVE 374 Query: 2694 TLEEIINMRKNKSVPQSVLDGHTLKRQRNGLTRSTVSGTGGWGEDTSVRPQHTLTNQVTE 2515 + I + +K K++ + L+GH LKRQRNGL S V Sbjct: 375 PVGTI-SSKKQKALEEPTLEGHALKRQRNGLENSGVV----------------------- 410 Query: 2514 SIGSRDPRNF-GNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGR 2338 RD +N G+GGW +D+ T + +DPRK E+V Sbjct: 411 ----RDMKNVSGSGGWLDDTKTVGSQLMNRNQLMETAETDPRKMA--EIVSCSGISCANA 464 Query: 2337 NLTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLL-------LEHQRLQKSNNS 2182 N T G EQ+S+ G SLP+ LKDIAVNP +L+N+L LE QKS + Sbjct: 465 NATIS--GNEQVSVTGTSAAASLPALLKDIAVNPTVLLNILKMGQQQSLEADVQQKSADP 522 Query: 2181 PQNLVTGSSLHGFPGSVPLANIPSSKSLEIDQKHSVKPQVPGQVISTG---DSGKTRMKL 2011 ++ S + G+ P+ N+ SK L + QK + +VP Q++ D GK RMK Sbjct: 523 AKSTTQPPSSNSILGTAPMVNVAPSKVLGLLQKQAATLKVPSQIVPMHLQEDLGKIRMKP 582 Query: 2010 RDPRLAARMNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPSGAP 1831 RDPR NT QKN SLG EQ K +S TQ + + Q+ T P Sbjct: 583 RDPRRILHDNTLQKNPSLG-YEQPKITVPLASSTQKQEGQVDTKSTPFQSVTQ------P 635 Query: 1830 DISQQFTKELKSLADILSASQAPSVVPL---TVSSPIVPIKTDTTEMKTVVTEFKDQESG 1660 DI++QFTK LK++AD +S S A + +P+ ++S V K + +MKTV + +DQ SG Sbjct: 636 DIARQFTKNLKNIADFISVSLASTTLPIISHSISCGAVQGKPEKVDMKTVASNSEDQRSG 695 Query: 1659 TVTAPVERIVQPT--QNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARKXXXX 1486 T AP + + +NMWGDVEHL EGYDDQ++AAI +ERARR+EEQ KMF+A K Sbjct: 696 TSPAPEIGVAMASRPENMWGDVEHLFEGYDDQQKAAIQRERARRIEEQKKMFSAHKLCLV 755 Query: 1485 XXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVWNFLE 1306 LNSAKF EVDPIH+E+LRKKEEQDREK RHLFRFPHM MWTKLRPG+WNFLE Sbjct: 756 LDLDHTLLNSAKFGEVDPIHDEILRKKEEQDREKQQRHLFRFPHMGMWTKLRPGIWNFLE 815 Query: 1305 KASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKIKDLE 1126 KASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GD +DGDER+ K KDLE Sbjct: 816 KASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDLFDGDERVPKSKDLE 875 Query: 1125 GVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDERPDE 946 GVLGMES VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEIDHDERP++ Sbjct: 876 GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPED 935 Query: 945 GTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEMNPQL 766 GTLA S AVIER+HQNFFSH+SL+++DVR+ILAAEQRKIL GC IVFSR+FPVGE NP L Sbjct: 936 GTLASSSAVIERLHQNFFSHQSLDEVDVRNILAAEQRKILGGCSIVFSRVFPVGEANPHL 995 Query: 765 HPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTLLYRR 586 HPLWQ+AEQFGAVC+ QIDE VTHVVANSLGTDKVNWAL+TGR+VV+PGWVEAS LLYRR Sbjct: 996 HPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRR 1055 Query: 585 ANEHEFAVK 559 ANE +FA+K Sbjct: 1056 ANERDFAIK 1064 >OMP02331.1 hypothetical protein CCACVL1_02829 [Corchorus capsularis] Length = 1290 Score = 827 bits (2137), Expect = 0.0 Identities = 552/1335 (41%), Positives = 720/1335 (53%), Gaps = 72/1335 (5%) Frame = -1 Query: 4344 SKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV------------NDDNA 4201 S VWTM+DL KY VF R + + +YN+AW QAVQN+PL ++ N+ Sbjct: 74 SNSRVWTMQDLCKYPSVF-RGYTSGLYNFAWAQAVQNKPLNDIFVKDFEQQQEENNNSKR 132 Query: 4200 AAATSVVIEISDE------GV----VVNDVDSXXXXXXXXXXXXGDNDTEMVEGTV---- 4063 ++ +S V ++ + G+ VV D DS + E+ EG + Sbjct: 133 SSPSSSVASVNSKEEKGSSGIPADKVVIDDDSEDELEDDKVVNLEKEEGELEEGEIDLDS 192 Query: 4062 ------VESNLNGMPSSTTDDKIMNENEEIKSIRQVIQLV--INAKNAGKPFGGACGELW 3907 V S+ +G SS+ D + + +E K + + +L+ + A K F C L Sbjct: 193 EPVKERVLSSEDGNVSSS-DGNVGSSDESEKRVNLIRELLEGVTVIEAEKSFEAVCSRLQ 251 Query: 3906 TSLDKLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVK 3727 +LD L+ + G + ++LIQ +F A + S + +N + Q ++ SRLL VK Sbjct: 252 NALDSLRGLIFEYGVPTK-DTLIQLAFGA---INSAFVALNNNLKEQNVEILSRLLSVVK 307 Query: 3726 TQNASLFSSEQMKELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPL 3547 + +F +++MKE++ ++ SL ++ D++ G+N+ + Sbjct: 308 GHDPPIFPTDKMKEIQVMLLSLNSPARAID------TDKDTKVVDGINKDHDAVY----- 356 Query: 3546 EEKNGASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNG---- 3379 EN H L + K L S+ HN Sbjct: 357 ----------------------------ENVGHDLTVTNKLPLPADSIIHNKPNTSTETL 388 Query: 3378 KVGSGIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHG 3199 K+G+ F++ +S P D H+DH Sbjct: 389 KLGTPNFRNRGIS------------------------LPLLDL------------HKDHD 412 Query: 3198 MDRFPSPTRETLRPVQANKPQAVESRPVKS-----------DGTEMHPYETDAHKAVSTY 3052 D PSPTRET + KP KS +G ++HPYE + KA STY Sbjct: 413 ADSLPSPTRETTPCLPVKKPLNTGDVMAKSGFMTGKRSHDAEGNKLHPYEMEPLKAFSTY 472 Query: 3051 QQKFGXXXXXXXXXXXXXXXSEEGNDDEDDSKGEXXXXXXXXXXXXXXXXPLQAA----- 2887 QQKF SEE D+ D+ GE Q Sbjct: 473 QQKFCRGSFFTSDRLPSPTPSEESGDEGGDNGGEVSSSSGLANFKPNLPVLGQPIVSPPP 532 Query: 2886 ---SVYAAFQNNGLCRQGTELEINPVVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLE 2716 S ++ Q R T + + + ++ RDPR + A + DLN Sbjct: 533 QINSATSSMQEQITARNATSVASGSNILLKASAKSRDPRLRFANSNASALDLNEPL---- 588 Query: 2715 HNPPTSGTLEEIINMRKNKSVPQSVLDGHTLKRQRNGLTRSTVSGTGGWGEDTSVRPQHT 2536 HN P + II RK KSV + LDG +KRQRN S V VR T Sbjct: 589 HNAPKVAPVGGIIATRKQKSVEEPALDGPAVKRQRNEPENSGV-----------VRDMQT 637 Query: 2535 LTNQVTESIGSRDPRNFGNGGWSEDS-VTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQ 2359 ++ GNGGW ED+ V Q T N + S+ RK NG V S Sbjct: 638 VS---------------GNGGWLEDADVIGSQITNRNHTANNSESNS-RKINNG--VNSS 679 Query: 2358 RQDNGGRNLTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLLEHQRLQKSNNS 2182 +G N+T G EQ+ + S P+ LKDIAVNP MLIN+L + +KS + Sbjct: 680 STLSGMPNMTVGRN--EQVPMTSTSTP-SFPALLKDIAVNPTMLINILKVAEAQRKSPDP 736 Query: 2181 PQNLVTGSSLHGFPGSVPLANIPSSKSLEIDQKHS--VKPQVPG--QVISTGDSGKTRMK 2014 ++ + PG VP ANI + SL +S V P++ G QV S +SGK RMK Sbjct: 737 VRSALPQPVSSSLPGVVPSANIVPTSSLNTVPSNSSVVMPKLAGNLQVPSLDESGKIRMK 796 Query: 2013 LRDPRLAARMNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPSG- 1837 RDPR N+ Q++ S+GP +QLKT + +S TQ S++NL R Q ++ + S Sbjct: 797 PRDPRRVLHGNSLQRSGSMGP-DQLKTNVSLTSSTQGSKDNLDARKPEGQTESKPIQSQL 855 Query: 1836 --APDISQQFTKELKSLADILSASQ---APSVVPLTVSSPIVPIKTDTTEMKTVVTEFKD 1672 APDI+QQFTK L +ADI+S SQ +P V + S V IK+D + K V +D Sbjct: 856 VQAPDITQQFTKNLNYIADIMSVSQVMTSPLAVSQNLVSQPVEIKSDNLDTKVSVPNSED 915 Query: 1671 QESGTVTAP-VERIVQPT--QNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAAR 1501 Q+SGT +AP V P QN WGD EHL E YDD+++A + ERARR+EEQ KMFA+R Sbjct: 916 QQSGTGSAPEVGATTGPPRPQNTWGDFEHLFERYDDRQKATLQLERARRIEEQKKMFASR 975 Query: 1500 KXXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGV 1321 K LNSAKF EV+P HEE+LRKKEEQDREKP RHLF F HM MWTKLRPG+ Sbjct: 976 KLCLVLDIDHTLLNSAKFHEVEPKHEEILRKKEEQDREKPKRHLFHFHHMGMWTKLRPGI 1035 Query: 1320 WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQK 1141 WNFLEKASKL+ELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP+DGDER+ + Sbjct: 1036 WNFLEKASKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPR 1095 Query: 1140 IKDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHD 961 KDL+GVLG+ES VVIIDDSVRVWPH+KLNLI VERYTYFP SRRQFGL GPSLLEIDHD Sbjct: 1096 SKDLDGVLGLESAVVIIDDSVRVWPHHKLNLIAVERYTYFPSSRRQFGLPGPSLLEIDHD 1155 Query: 960 ERPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGE 781 ERP++GTLA SLAVIER+HQ FFSH++L+D+DVR+ILA+E+RKIL GCRIVFSR+FPV E Sbjct: 1156 ERPEDGTLASSLAVIERIHQEFFSHQNLDDVDVRTILASEKRKILNGCRIVFSRVFPVDE 1215 Query: 780 MNPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEAST 601 NP LHPLWQ+AEQFGAVC+ QIDE VTHVVA S GT+KVNWAL+ G++VVHPGWVEAS Sbjct: 1216 ANPHLHPLWQTAEQFGAVCTYQIDERVTHVVAISPGTEKVNWALSNGKFVVHPGWVEASA 1275 Query: 600 LLYRRANEHEFAVKI 556 LLYRRANE +FA+K+ Sbjct: 1276 LLYRRANEVDFAIKL 1290 >OMP09626.1 hypothetical protein COLO4_05290 [Corchorus olitorius] Length = 1261 Score = 825 bits (2131), Expect = 0.0 Identities = 554/1337 (41%), Positives = 725/1337 (54%), Gaps = 74/1337 (5%) Frame = -1 Query: 4344 SKRMVWTMEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV------------NDDNA 4201 S VWTM+DL KY VF R + + +YN+AW QAVQN+PL ++ N+ Sbjct: 51 SNSRVWTMQDLCKYPSVF-RGYTSGLYNFAWAQAVQNKPLNDIFVKDFEQQQDENNNSKR 109 Query: 4200 AAATSVVIEIS---DEGV-------VVNDVDSXXXXXXXXXXXXGDNDTEMVEGTVVESN 4051 ++ +S V ++ D+G VV D DS + E+ EG E + Sbjct: 110 SSPSSSVASVNSKEDKGSSGIPADKVVIDDDSEDEMEDDKVVNLEKEEGELEEG---EID 166 Query: 4050 LNGMPSS----TTDDKIMNENEEIKS----IRQVIQLVINAKNAGKPFGGACGELWTSLD 3895 L+ P +++D + ++E++ IR+V++ I A K F C L +LD Sbjct: 167 LDSEPVKERVLSSEDGNVGSSDELEKRVNLIREVLEW-ITVIEAEKSFEAVCSRLQNALD 225 Query: 3894 KLQKFVLNNGTSSSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNA 3715 L+ + + ++LIQ +F A + S + +N + Q ++ SRLL VK + Sbjct: 226 SLRGLIFEYSVPTK-DTLIQLAFGA---INSAFVALNHNLKEQNVEILSRLLSVVKGHDP 281 Query: 3714 SLFSSEQMKELEDIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPLEEKN 3535 +F +++MKE++ ++ SL ++ D++ G+N+ Sbjct: 282 PMFPTDKMKEIQVMLLSLNSPARAID------TDKDTKVVDGINKDHDA----------- 324 Query: 3534 GASQNGHGENPRLGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNG----KVGS 3367 V EN H L + K L S+ HN K+G+ Sbjct: 325 ----------------------VDENVGHDLTVTNKLPLSADSIIHNKPNTSTETLKLGT 362 Query: 3366 GIFQSGPLSGLKSRGGFGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRF 3187 F++ +S P D H+DH D Sbjct: 363 PNFRNRGIS------------------------LPLLDL------------HKDHDADSL 386 Query: 3186 PSPTRETLRPVQANKPQAVESRPVKS-----------DGTEMHPYETDAHKAVSTYQQKF 3040 PSPTRET + KP KS +G ++HPYE + KA STYQQKF Sbjct: 387 PSPTRETTPCLPVQKPLNTGDVMAKSGFMTGKRSHDAEGNKLHPYEMEPLKAFSTYQQKF 446 Query: 3039 GXXXXXXXXXXXXXXXSEEGNDDEDDSKGEXXXXXXXXXXXXXXXXP--------LQAAS 2884 SEE D+ D+ GE Q S Sbjct: 447 CRGSFFTNDRLPSPTPSEESGDEGGDNGGEVSSSSGLANFKPNLPVLGQPIVSPPPQVNS 506 Query: 2883 VYAAFQNNGLCRQGTELEINPVVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPP 2704 ++ Q R T + + + ++ RDPR + A + DLN + H+ P Sbjct: 507 ATSSMQEQITARIATSMTSGSNILPKTSAKSRDPRLRFANSNASALDLN---EWPLHDAP 563 Query: 2703 TSGTLEEIINMRKNKSVPQSVLDGHTLKRQRN-----GLTRS--TVSGTGGWGEDTSVRP 2545 ++ II RK KSV + LDG +KRQRN G+ R TVSG GGW ED Sbjct: 564 KVSSVGGIIATRKQKSVEEPALDGPAVKRQRNEPENSGVVRDMQTVSGNGGWLEDA---- 619 Query: 2544 QHTLTNQVTESIGSRDPRNFGNGGWSEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVF 2365 + IGS+ +T T N S+ RK NG V Sbjct: 620 ---------DFIGSQ--------------ITNRNHTADNS------ESNSRKINNG--VN 648 Query: 2364 SQRQDNGGRNLTAGAGGKEQLSLIGNGNMGSLPSTLKDIAVNP-MLINLLLEHQRLQKSN 2188 S +G N+T G EQ+ + SLP+ LKDIAVNP MLIN+L + +KS Sbjct: 649 SSSTLSGMPNMTVGRN--EQVPMTSTSTP-SLPALLKDIAVNPTMLINILKVAEAQRKSP 705 Query: 2187 NSPQNLVTGSSLHGFPGSVPLANIPSSKSLE-IDQKHSV-KPQVPG--QVISTGDSGKTR 2020 + ++ + PG VP ANI + SL + K SV P++ G QV S + GK R Sbjct: 706 DPVRSALPQPVSSSLPGVVPSANIVPTSSLNTVPSKSSVVMPKLAGNLQVPSLDEPGKIR 765 Query: 2019 MKLRDPRLAARMNTCQKNESLGPLEQLKTFGAPSSLTQDSRENLIVRHQSVQAQTNSVPS 1840 MK RDPR ++ Q++ S+GP +QLKT G+ +S TQ S++NL R Q ++ + S Sbjct: 766 MKPRDPRRVLHGSSLQRSGSMGP-DQLKTNGSLTSSTQGSKDNLDARKPEGQTESKPIQS 824 Query: 1839 G---APDISQQFTKELKSLADILSASQ---APSVVPLTVSSPIVPIKTDTTEMKTVVTEF 1678 APDI+QQFTK L +ADI+S SQ +P V + S V IK+D + K V Sbjct: 825 QLVQAPDITQQFTKNLNYIADIMSVSQVMTSPLAVSQNLVSQPVEIKSDNLDTKVSVPNS 884 Query: 1677 KDQESGTVTAP-VERIVQPT--QNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFA 1507 + Q+SGT +AP V P QN WGD EHL E YDD+++A + ERARR+EEQ KMFA Sbjct: 885 EAQQSGTGSAPEVGATTGPPRPQNTWGDFEHLFERYDDRQKATLQLERARRIEEQKKMFA 944 Query: 1506 ARKXXXXXXXXXXXLNSAKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRP 1327 +RK LNSAKF EV+P HEE+LRKKEEQDREKP RHLFRF HM MWTKLRP Sbjct: 945 SRKLCLVLDIDHTLLNSAKFHEVEPKHEEILRKKEEQDREKPKRHLFRFHHMGMWTKLRP 1004 Query: 1326 GVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERL 1147 G+WNFLEKASKL+ELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP+DGDER+ Sbjct: 1005 GIWNFLEKASKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1064 Query: 1146 QKIKDLEGVLGMESNVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEID 967 + KDL+GVLG+ES VVIIDDSVRVWPH+KLNLI VERYTYFP SRRQFGL GPSLLEID Sbjct: 1065 PRSKDLDGVLGLESAVVIIDDSVRVWPHHKLNLIAVERYTYFPSSRRQFGLPGPSLLEID 1124 Query: 966 HDERPDEGTLALSLAVIERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPV 787 HDERP++GTLA SLAVIER+HQ FFSH++L+D+DVR+ILA+E+RKIL GCRIVFSR+FPV Sbjct: 1125 HDERPEDGTLASSLAVIERIHQEFFSHQNLDDVDVRTILASEKRKILNGCRIVFSRVFPV 1184 Query: 786 GEMNPQLHPLWQSAEQFGAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEA 607 E NP LHPLWQ+AEQFGAVC+ QIDE VTHVVA S GT+KVNWAL+ G++VVHPGWVEA Sbjct: 1185 DEANPHLHPLWQTAEQFGAVCTYQIDERVTHVVAISPGTEKVNWALSNGKFVVHPGWVEA 1244 Query: 606 STLLYRRANEHEFAVKI 556 S LLYRRANE +FA+K+ Sbjct: 1245 SALLYRRANEVDFAIKL 1261 >XP_010682659.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Beta vulgaris subsp. vulgaris] KMT07356.1 hypothetical protein BVRB_6g149820 [Beta vulgaris subsp. vulgaris] Length = 1252 Score = 823 bits (2127), Expect = 0.0 Identities = 562/1319 (42%), Positives = 729/1319 (55%), Gaps = 64/1319 (4%) Frame = -1 Query: 4323 MEDLFKYNQVFPRNFATPMYNYAWKQAVQNRPLRNV--------NDDNAAAATSVVIEIS 4168 M DL+KY+ A+ +YN AW QAVQN+PL V N+ NA+ + V + Sbjct: 49 MRDLYKYSSYRGYGAASGLYNLAWAQAVQNKPLNEVLVELDDKKNNKNASTDDTSVNKEQ 108 Query: 4167 DEGVVVNDVDSXXXXXXXXXXXXGDNDTEMVEGTVVESNLNGMPSSTTDDKIMNENE--- 3997 E V + V+S D+E EG + E ++ T ++ N N+ Sbjct: 109 GE-VQQHCVESKEVFEVV--------DSEKEEGELEEGEIDFDSDDTGNNHNSNGNKVQD 159 Query: 3996 --------------EIKSIRQVIQLVINAKNAGKPFGGACGELWTSLDKLQKFVLNNGTS 3859 ++ SIR+V+ V A+ A K F C L TSL+ L++ VL+ Sbjct: 160 DFGGLEMDDGELENQVSSIRKVLHNVTVAE-AHKSFDIVCARLRTSLETLRELVLHTWFP 218 Query: 3858 SSVNSLIQQSFAALQAVKSVYCTMNFKEQTQYKDVFSRLLVHVKTQNASLFSSEQMKELE 3679 S ++LIQQ+FAA+Q V SVY +M+ + Q KD SRLL V ++ LF+ EQ KE+E Sbjct: 219 SK-DALIQQAFAAIQCVYSVYSSMSPTLRDQNKDRMSRLLTFVMDLSSVLFTPEQRKEVE 277 Query: 3678 DIIQSLEKQKEILEKNGANQNDREENPSLGMNRIESGIVSKNPLEEKNGASQNGHGENPR 3499 +I S+ I+ +++ +EE P + K L + N + N Sbjct: 278 GMITSV--NPPIVPVKPKSRDRQEELP----------VTEKAILTDSNTLTVN------- 318 Query: 3498 LGTNPIEIGIVGENPPHVLNSSKKSLLEPISVKHNDQGNGKVGSGIFQSGPLSGLKSRGG 3319 G+N +L K + +SV +++ N + S + P S LK R Sbjct: 319 ----------TGDNKSDLL----KKVGPELSVYQSEKKNTDILSEAMRHFP-SSLKVRSS 363 Query: 3318 FGPXXXXXXXXXXXXLPSPTRDAPKPFLIHKPQVQHRDHGMDRFPSPTRETL--RPVQAN 3145 FGP H+ H D PSPT +T+ P Sbjct: 364 FGPLLDL----------------------------HKVHDEDSLPSPTSKTMPSLPFFET 395 Query: 3144 KPQAVESRPVKSDGTEMHPYETDAHKAVSTYQQKFGXXXXXXXXXXXXXXXSEEGN---- 2977 P V KS +HPYET+A KAVS+YQQ+FG SE+GN Sbjct: 396 APPRVVHGLQKSG---VHPYETEAVKAVSSYQQRFGRSTFLATDMLPSPTPSEDGNEGGA 452 Query: 2976 DDEDDSKGEXXXXXXXXXXXXXXXXPLQAASVYAAFQN--------NGLCRQGTELEINP 2821 DD ++ Q AA+ + +G + + + +P Sbjct: 453 DDSNEEVSSSNAYTNVVSRTTNSSVVPQPVVSSAAYTSSSTMQGVISGTSAESSSVGSSP 512 Query: 2820 VVRAQKQSRGRDPRRQNLGPEAGSGDLNLRSAYLEHNPPTSGTLE---EIINMRKNKSVP 2650 +RA +S RDPR ++L P GS DL+ + + P ++ LE EI+ +K K++ Sbjct: 513 SLRASAKS--RDPRLRHLNPNFGSLDLSFCPSPMV--PSSASKLEPLGEIMKSKKTKALE 568 Query: 2649 QSVLDGHTLKRQRNGLTRSTVSGTGGWGEDTSVRPQHTLTNQVTESIGSRDPRNFGNGGW 2470 +LDG T KR RNGL ED S+ NQV GS Sbjct: 569 GRLLDGPTAKRPRNGLET----------EDMSMN-----ANQVKTLQGSTRMET------ 607 Query: 2469 SEDSVTRLQPTLTNQVGESIRSSDPRKFGNGEVVFSQRQDNGGRNLTAGAGGKEQLSLIG 2290 S S+ Q + +G +I DPRK G+G V S ++ K +++ G Sbjct: 608 SSSSILGPQSSSRGLLGPAI---DPRKPGSGTV--SSGITTNNPSMAVNKTAKPSMNVSG 662 Query: 2289 NGNMGSLPSTLKDIAVNP-MLINLLLEHQRLQKSNNSPQNLVTGSSLHGFPGSVPLANIP 2113 + SL S LKDIA NP +N++ E KS+ Q++ + + G+ P A Sbjct: 663 S---PSLQSLLKDIAGNPGAWMNIIKEQ---NKSSEPLQSVSHSMNSNSILGAAPSAIAV 716 Query: 2112 SSKSLEIDQKHSVKPQVPGQVISTG---DSGKTRMKLRDPRLAARMNTCQKNESLGPLEQ 1942 S + Q + QVP + T DS K RMK RDPR A N Q+ S P EQ Sbjct: 717 PPISSGVGQTSAGLLQVPSPKVVTSSQDDSAKLRMKPRDPRRALHANMAQRTGSSVP-EQ 775 Query: 1941 LKTFGAPSSLTQDSRENL----IVRHQSVQAQTNSVPSGAPDISQQFTKELKSLADILSA 1774 K G ++ TQ +EN+ V S A ++ P PDI++QFTK LK++ADI+S+ Sbjct: 776 PKVNGVHNTTTQGLQENINAQRYVNGTSPSAASSQTPI-LPDITKQFTKNLKNIADIISS 834 Query: 1773 SQAPSV-VPLTVSSPIVPIKTDTTEMKT-----------VVTEFKDQESGTVTAPVERIV 1630 Q S+ PL VSS +DTT + + V+T +Q + + P E + Sbjct: 835 PQTSSIQSPLAVSSLSAQANSDTTSISSGGQASCSSGGPVIT--GNQRTVSALRPEEVVS 892 Query: 1629 --QPTQNMWGDVEHLLEGYDDQERAAIHKERARRMEEQNKMFAARKXXXXXXXXXXXLNS 1456 +QN WGDVEHL +GYDDQ++AAI +ERARR++EQNKMFA RK LNS Sbjct: 893 GRPQSQNNWGDVEHLFDGYDDQQKAAIQQERARRLDEQNKMFADRKLCLVLDLDHTLLNS 952 Query: 1455 AKFVEVDPIHEEVLRKKEEQDREKPHRHLFRFPHMRMWTKLRPGVWNFLEKASKLYELHL 1276 AKF EVDP+H+E+LRKKEEQDREKP RHLFRFPHM MWTKLRPG+WNFLEKASKL+ELHL Sbjct: 953 AKFSEVDPVHDEILRKKEEQDREKPRRHLFRFPHMAMWTKLRPGIWNFLEKASKLFELHL 1012 Query: 1275 YTMGNKLYATEMAKVLDPSGALFEGRVISKGDEGDPYDGDERLQKIKDLEGVLGMESNVV 1096 YTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDP DGDER+ K KDLEGV+GMES+VV Sbjct: 1013 YTMGNKLYATEMAKVLDPKGTLFAGRVISRGDDGDPIDGDERVPKSKDLEGVMGMESSVV 1072 Query: 1095 IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLMGPSLLEIDHDERPDEGTLALSLAVI 916 IIDDS RVWPHNKLNLIVVERYTYFPCSR+QFGL GPSLLEIDHDERP+EGTLA SLAVI Sbjct: 1073 IIDDSARVWPHNKLNLIVVERYTYFPCSRKQFGLPGPSLLEIDHDERPEEGTLASSLAVI 1132 Query: 915 ERVHQNFFSHKSLNDLDVRSILAAEQRKILAGCRIVFSRIFPVGEMNPQLHPLWQSAEQF 736 E++HQNFFSHKSL+D+DVR+IL AEQRKILAGCRI+FSR+FPVGE NP LHPLWQ+AEQF Sbjct: 1133 EKIHQNFFSHKSLDDVDVRNILGAEQRKILAGCRILFSRVFPVGEANPHLHPLWQTAEQF 1192 Query: 735 GAVCSTQIDEHVTHVVANSLGTDKVNWALNTGRYVVHPGWVEASTLLYRRANEHEFAVK 559 GAVC+ Q+DE VTHVVANSLGTDKVNWAL+T R+VVHP WVEAS LLYRR NE +FA+K Sbjct: 1193 GAVCTNQLDEQVTHVVANSLGTDKVNWALSTKRFVVHPSWVEASALLYRRVNEQDFAIK 1251