BLASTX nr result
ID: Ophiopogon21_contig00005270
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon21_contig00005270 (2337 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010929653.1| PREDICTED: RNA polymerase II C-terminal doma... 832 0.0 ref|XP_008791049.1| PREDICTED: RNA polymerase II C-terminal doma... 811 0.0 ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal doma... 788 0.0 ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat... 782 0.0 ref|XP_009421039.1| PREDICTED: RNA polymerase II C-terminal doma... 777 0.0 ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ... 770 0.0 ref|XP_009386584.1| PREDICTED: RNA polymerase II C-terminal doma... 768 0.0 dbj|BAT14211.1| Os11g0521900, partial [Oryza sativa Japonica Group] 756 0.0 gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japo... 756 0.0 gb|ABA93957.1| NLI interacting factor-like phosphatase family pr... 756 0.0 ref|XP_010656789.1| PREDICTED: RNA polymerase II C-terminal doma... 755 0.0 ref|XP_010656786.1| PREDICTED: RNA polymerase II C-terminal doma... 755 0.0 ref|XP_012088736.1| PREDICTED: RNA polymerase II C-terminal doma... 753 0.0 ref|XP_012459418.1| PREDICTED: RNA polymerase II C-terminal doma... 750 0.0 gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium r... 750 0.0 gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium r... 750 0.0 ref|XP_012459417.1| PREDICTED: RNA polymerase II C-terminal doma... 750 0.0 ref|XP_010656784.1| PREDICTED: RNA polymerase II C-terminal doma... 750 0.0 ref|XP_006662962.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera... 750 0.0 gb|KDO83166.1| hypothetical protein CISIN_1g000897mg [Citrus sin... 748 0.0 >ref|XP_010929653.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Elaeis guineensis] Length = 1268 Score = 832 bits (2150), Expect = 0.0 Identities = 464/793 (58%), Positives = 544/793 (68%), Gaps = 64/793 (8%) Frame = -2 Query: 2324 VNSSGGLQMSS-----VNSSGAFQ---MAPLNTSGGSQMAPVKTSAKSRDPRLRFMNSEV 2169 VN++ +Q+++ +SS + Q + P+ G + + + KSRDPRLRF++SE Sbjct: 488 VNTTSQIQVATSSAACTDSSSSHQPGTVKPVGQLGSAPNLATRPALKSRDPRLRFVSSES 547 Query: 2168 GGA--------------PQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRSRDAQV 2031 G A P NG G N RKHKA+DE +P+ H LKRQRN T S D Q+ Sbjct: 548 GSASDPNTQVMSLDSSAPNNGPVGGITNPRKHKAVDESLPENHTLKRQRNGLTNSGDVQM 607 Query: 2030 MPGRGG-WIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNG- 1857 +PGRGG W++D+ V SQ +++++ ++NM + +N V VG DRR G Sbjct: 608 IPGRGGGWLDDSSAVGSQPSDKIRLSENMEIETKNPVS-VVGSDRRPDSNPNIHVSNTGT 666 Query: 1856 ----------GSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXX 1707 S++ S+S A VS PSLLKDIAVNPTML++L++M Sbjct: 667 CPIPSSTAAPASSTAPSSSAAASVSFPSLLKDIAVNPTMLMQLIQMEQQRLSAEAQQKTV 726 Query: 1706 A-------GPAVNGLSSAISP-------SPDVGQNPAAKPQM-------NGPNDMGKIRM 1590 ++N LS A+S S +VGQNP +PQ+ N +D+G+IRM Sbjct: 727 GLMQNMAHASSLNVLSGAVSSATVASMKSTEVGQNPGGRPQVPPQTVSTNSQSDVGRIRM 786 Query: 1589 KPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQS 1410 KPRDPRRVLH NMVQK+E++ SE AK NG L S+ QSSKD + EQGEQAQ TLP+Q Sbjct: 787 KPRDPRRVLH-NMVQKNETVVSERAKPNGTLSSDPQSSKDQSAIGEQGEQAQATTLPTQ- 844 Query: 1409 ASLPDISQQFTKNLQNLADIVSSSQASALP-VGTQNSSQLIPSKI--------CNDTTEP 1257 QF KN +NL DI S+ Q++ P +Q SQ I KI ++P Sbjct: 845 --------QFAKNTKNLGDISSTLQSTTTPPAASQIISQPIQLKINKVDPRPAAAVVSDP 896 Query: 1256 KTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERK 1077 KT++ + ++G T +G NPWGDVDHLLDGYDDQQKAAIQ+ERARRI EQNKMFA RK Sbjct: 897 KTLSAVTSEGST-TGATPSTNPWGDVDHLLDGYDDQQKAAIQRERARRIAEQNKMFAARK 955 Query: 1076 XXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVW 897 LNSAKFVEVDP+HEEIL RHLFRF HMGMWTKLRPG+W Sbjct: 956 LCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEEQDREKPQRHLFRFQHMGMWTKLRPGIW 1015 Query: 896 NFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKS 717 FLEKASKLYE+HLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+GDPFD DERVPKS Sbjct: 1016 TFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDGDPFDGDERVPKS 1075 Query: 716 KDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDE 537 KDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDE Sbjct: 1076 KDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLFGPSLLEIDHDE 1135 Query: 536 RPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEA 357 RP+ GTLASSL VIER+HQ FFSH SLN++DVRNILAAEQRKIL GCKIVFSRVFPVGEA Sbjct: 1136 RPEDGTLASSLAVIERIHQNFFSHHSLNDIDVRNILAAEQRKILAGCKIVFSRVFPVGEA 1195 Query: 356 NPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASAL 177 NPHLHPLWQ AEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPGWVEASAL Sbjct: 1196 NPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASAL 1255 Query: 176 LYRRANEQDFAIK 138 LYRR +E DFA+K Sbjct: 1256 LYRRVSEHDFAVK 1268 >ref|XP_008791049.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Phoenix dactylifera] Length = 1269 Score = 811 bits (2094), Expect = 0.0 Identities = 457/796 (57%), Positives = 531/796 (66%), Gaps = 67/796 (8%) Frame = -2 Query: 2324 VNSSGGLQMSS-----VNSSGAFQMAPLNTSGGSQMAP---VKTSAKSRDPRLRFMNSEV 2169 VN++ +Q+++ +SS Q P+ G AP ++ + KSRDPRLRF+NSE Sbjct: 489 VNTTSEIQVATNSAACTDSSSRHQPGPVKPVGQLGSAPNPAIRPALKSRDPRLRFVNSES 548 Query: 2168 GGA--------------PQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRSRDAQV 2031 G A P N G N RKHKA+DE P+ H LKRQ+N T S D Q+ Sbjct: 549 GNASDPNRRAMSLDFSAPNNDLVGGITNPRKHKAVDESFPENHTLKRQKNGLTNSSDVQM 608 Query: 2030 MPGRGG-WIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNG- 1857 PGRGG W+ED+ V SQ++++++ N+NM + +N G V DRR G Sbjct: 609 TPGRGGGWLEDSSSVRSQLSDKIRLNENMEIEIKN-PGNVVMSDRRPDSNPNIQVTNTGT 667 Query: 1856 ----------GSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXX 1707 S + S+S A VS PSLLKDIAVNPTML++L+++ Sbjct: 668 CMIPSSTTAPSSGTAPSSSAAASVSFPSLLKDIAVNPTMLMQLIQIEQQRLSAEAQQKTV 727 Query: 1706 A-------GPAVNGLSSAISP-------SPDVGQNPAAKPQM-------NGPNDMGKIRM 1590 ++N L A+S S +VG NP+ +PQ+ N +D+G+IRM Sbjct: 728 GLMHNMAHASSLNVLPGAVSSANVASMKSAEVGHNPSGRPQVTAQTVSTNSQSDVGRIRM 787 Query: 1589 KPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQS 1410 KPRDPRR+LH NMVQK+E++ SE AK NG L S+ QSSKD L + EQGEQAQ LP+ Sbjct: 788 KPRDPRRILH-NMVQKNETIVSERAKPNGTLSSDPQSSKDHLAIGEQGEQAQATGLPTL- 845 Query: 1409 ASLPDISQQFTKNLQNLADIVSSSQASALPVGTQ-----------NSSQLIPSK-ICNDT 1266 Q KN +NL DI S Q + P+ N L P+ + ND Sbjct: 846 --------QLAKNPKNLGDISSPLQLTTTPLAVPQIISQPIQFNINKVDLRPAAAVVND- 896 Query: 1265 TEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFA 1086 PKT++ + ++G T N WGDVDHLLDGYDDQQKAAIQ+ERARRI EQNKMFA Sbjct: 897 --PKTLSTVASEGSTTVAT-QSTNAWGDVDHLLDGYDDQQKAAIQRERARRIAEQNKMFA 953 Query: 1085 ERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRP 906 RK LNSAKFVEVDP+HEEIL RHLFRF HMGMWTKLRP Sbjct: 954 ARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEEQDREKPQRHLFRFQHMGMWTKLRP 1013 Query: 905 GVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERV 726 G+WNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+ +PFD DERV Sbjct: 1014 GIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDSEPFDGDERV 1073 Query: 725 PKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEID 546 PKSKDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEID Sbjct: 1074 PKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLFGPSLLEID 1133 Query: 545 HDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPV 366 HDERP+ GTLASSL VIER+H FFSH+SLN+VDVRNILAAEQRKIL GCKIVFSRVFPV Sbjct: 1134 HDERPEDGTLASSLTVIERIHDDFFSHRSLNDVDVRNILAAEQRKILAGCKIVFSRVFPV 1193 Query: 365 GEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEA 186 GEANPHLHPLWQ AEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHP WVEA Sbjct: 1194 GEANPHLHPLWQMAEQFGAACTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPSWVEA 1253 Query: 185 SALLYRRANEQDFAIK 138 SALLYRR NEQDFA+K Sbjct: 1254 SALLYRRVNEQDFAVK 1269 >ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Nelumbo nucifera] Length = 1313 Score = 788 bits (2035), Expect = 0.0 Identities = 448/787 (56%), Positives = 528/787 (67%), Gaps = 55/787 (6%) Frame = -2 Query: 2333 MSSVNSSGGLQMSSVNSSGAFQMA-----PLNTSG--GSQMAPVKTSAKSRDPRLRFMNS 2175 ++++NSS L+ S +S A ++ P + G GS + V +AK+RDPRLR+ NS Sbjct: 529 VATINSSTSLKTVSSATSYADNLSGQGLVPAVSVGQLGSMSSHVIRTAKNRDPRLRYANS 588 Query: 2174 EVGGA-----PQNGF--------AAGSVNSRKHKAIDEPVPDEHNLKRQRN---ESTRSR 2043 EVG P +G G + SRKHK ++E + D+H KRQRN S S Sbjct: 589 EVGPLDLNQRPPSGDHDIRKSEPLGGIMGSRKHKIVEESLLDDHTFKRQRNGLINSGASG 648 Query: 2042 DAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXX 1863 D QV+ G GGW+E++ + Q +R + + R L GE + Sbjct: 649 DVQVVSGSGGWLEESSSMGLQPTDRSRLIEKRESDPRKLGSGEASFGNKQDTGCSTYNVT 708 Query: 1862 NGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGL 1683 GG+ ++ VSLPSLLKDIAVNPTML+ L+KM PA + + Sbjct: 709 TGGNEQLTASGIGSTVSLPSLLKDIAVNPTMLMHLIKMEHQRLAVEALQKCG-NPAQSTM 767 Query: 1682 ---SSAISPSPDVGQNPAAKP-------------------QMNGPNDMGKIRMKPRDPRR 1569 SS++ P N A+K M D+GKIRMKPRDPRR Sbjct: 768 QSSSSSVMPGKIASVNIASKTLSEPEKKSAGNSQISVQTASMIPHGDLGKIRMKPRDPRR 827 Query: 1568 VLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDIS 1389 +LHSN QKS+S G E K+NG + +D L V++QGEQAQTN+L SQS + PDI+ Sbjct: 828 ILHSNTFQKSDSSGPERFKANGTPSPNTPTCRDNLIVRQQGEQAQTNSLLSQSTAPPDIA 887 Query: 1388 QQFTKNLQNLADIVSSSQASALP--VGTQNSSQLIPSK--------ICNDTTEPKTVTGM 1239 QQFTK L+N+A+I+S+SQA P V SSQ +P+K + D+ + ++ + + Sbjct: 888 QQFTKKLKNIANILSASQAINTPSVVPQTISSQPVPAKMDKVDMKVVATDSNDQRSWSAL 947 Query: 1238 CTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXX 1059 T E A+G N WGDV+HL +GYDDQQKAAIQ+ERARRIEEQN+MFA RK Sbjct: 948 -TPEERAAGPSS-QNAWGDVEHLFEGYDDQQKAAIQRERARRIEEQNQMFAARKLCLVLD 1005 Query: 1058 XXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKA 879 LNSAKFVEVDP+HEE+L RHLFRF HMGMWTKLRPG+WNFLEKA Sbjct: 1006 LDHTLLNSAKFVEVDPVHEEMLRKKEEQDREKPQRHLFRFTHMGMWTKLRPGIWNFLEKA 1065 Query: 878 SKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGV 699 SKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDPFD DER PKSKDLDGV Sbjct: 1066 SKLYELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISRGDDGDPFDGDERQPKSKDLDGV 1125 Query: 698 LGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGT 519 LGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQ GL GPSLLEIDHDERP+ GT Sbjct: 1126 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQLGLHGPSLLEIDHDERPEDGT 1185 Query: 518 LASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHP 339 LASSL VIER+HQ FFSHQ+LN+VDVRNILAAEQ+KIL GC+IVFSRVFPVGEANPHLHP Sbjct: 1186 LASSLAVIERIHQNFFSHQNLNDVDVRNILAAEQQKILAGCRIVFSRVFPVGEANPHLHP 1245 Query: 338 LWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 159 LWQTAEQFGA CTNQIDE VTHVVA S GTDKVNWALSTGRFVVHPGWVEASALLYRRAN Sbjct: 1246 LWQTAEQFGAVCTNQIDEQVTHVVAISLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 1305 Query: 158 EQDFAIK 138 E DFAIK Sbjct: 1306 EHDFAIK 1312 >ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] Length = 1290 Score = 782 bits (2020), Expect = 0.0 Identities = 432/770 (56%), Positives = 510/770 (66%), Gaps = 52/770 (6%) Frame = -2 Query: 2291 VNSSGAFQMAPLNTSGGSQMAPV-----KTSAKSRDPRLRFMNSEVGGAPQN-------- 2151 V+S+ + + T + M+ V K+ AKSRDPRL F NS N Sbjct: 528 VDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNERLLHNAS 587 Query: 2150 --GFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 1986 G ++SRK K+++EP+ D LKRQRNE +RD Q + G GGW+ED + Sbjct: 588 KVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIG 647 Query: 1985 SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLP 1806 SQ+ NR Q +N+ +R + G G+ + + SLP Sbjct: 648 SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNITVGTNEQVPVTSTSTPSLP 702 Query: 1805 SLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAG--------PAVNGLSSAIS-----P 1665 +LLKDIAVNPTML+ +LKM P+ N L +S P Sbjct: 703 ALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIP 762 Query: 1664 SPDV----------GQNPAAKPQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 1515 SP V PA Q+ P++ GKIRMKPRDPRRVLH N +Q+S S+G + Sbjct: 763 SPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGLDQL 822 Query: 1514 KSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 1335 K+NG L S Q SKD L Q+ Q ++ + SQ PDI+QQFT NL+N+ADI+S SQ Sbjct: 823 KTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIADIMSVSQ 882 Query: 1334 A-SALPVGTQNSSQLIPSKIC--NDTTEPKTVTGMCTQGETASGVIDLA--------NPW 1188 A ++LP + N L+P + +D+ + K + +T +G+ A N W Sbjct: 883 ALTSLPPVSHN---LVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGPRSQNAW 939 Query: 1187 GDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPI 1008 GDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+ RK LNSAKF+EVDP+ Sbjct: 940 GDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDPV 999 Query: 1007 HEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYA 828 HEEIL RHLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYA Sbjct: 1000 HEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1059 Query: 827 TEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVW 648 TEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMESAVVIIDDS+RVW Sbjct: 1060 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVW 1119 Query: 647 PHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFS 468 PHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FFS Sbjct: 1120 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFS 1179 Query: 467 HQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQID 288 HQ+L++VDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQID Sbjct: 1180 HQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQID 1239 Query: 287 EHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 138 EHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK Sbjct: 1240 EHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289 >ref|XP_009421039.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Musa acuminata subsp. malaccensis] Length = 1228 Score = 777 bits (2007), Expect = 0.0 Identities = 449/782 (57%), Positives = 514/782 (65%), Gaps = 50/782 (6%) Frame = -2 Query: 2333 MSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAPVKT--------------SAKSRDP 2196 +S+ ++ +Q +V SS A N+S G Q PVK + K RDP Sbjct: 459 VSNAETACTIQNQAVKSSST--AACSNSSAGDQPYPVKLVGQVGSGSKSSAKPALKRRDP 516 Query: 2195 RLRFMNSEVGG-----------APQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTR 2049 RL+ MN+EV G A N GS+N+RKHK++DEPV +H +KRQ+N T Sbjct: 517 RLKLMNNEVRGPSVGDKGIDSNALDNRLVGGSMNTRKHKSVDEPVTGDHKMKRQKNGFTG 576 Query: 2048 SRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXX 1869 SRD Q+ GRGGW+ED+ + Q ++R Q N+N V R GEVG ++ Sbjct: 577 SRDMQMTSGRGGWLEDSSI--PQPSDRNQINENFQVEVRKPGSGEVGSGKK--SDSNMNF 632 Query: 1868 XXNGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPA-- 1695 G N S + +SLP LLK AVNPT+ V+LL+M A + Sbjct: 633 SMLNGLIPNPSGNLPNTLSLPPLLK--AVNPTIFVQLLQMEQHRLAAENHQIVTASTSDV 690 Query: 1694 -----VNGLSSAISP-------SPDVGQN-------PAAKPQMNGPNDMGKIRMKPRDPR 1572 VNGL A+S S +VGQN P+ ++ ND+G+IRMKPRDPR Sbjct: 691 TNVSKVNGLPGAVSSVNSTPLKSQEVGQNHLGMSQIPSQSASVSSQNDVGRIRMKPRDPR 750 Query: 1571 RVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDI 1392 R LH+NMVQ + SE K N +P QSS T +E GEQAQ + L +Q P++ Sbjct: 751 RALHNNMVQMKNVIVSEQNKINEAIPGP-QSSMGHSTAREPGEQAQASVLATQFVPQPNM 809 Query: 1391 SQQFTKNLQNLADIVSSSQASALPVGTQNSSQLIPSKICNDTTEPKTV----TGMCTQGE 1224 S+Q TKNL N IVSSSQ +A +Q Q IPSK P + + Sbjct: 810 SRQLTKNLGN---IVSSSQLAAT---SQAVPQYIPSKANQVNVRPASAELNDSKTLVSEA 863 Query: 1223 TASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXX 1044 TA GV N WGDVDH LDGY+D+Q+AAIQKERARRI EQNKMFA RK Sbjct: 864 TAKGVSQSVNAWGDVDHFLDGYNDEQRAAIQKERARRIAEQNKMFAARKLCLVLDLDHTL 923 Query: 1043 LNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYE 864 LNSAKFVEVDP+HEEIL RHLF F HMGMWTKLRPG+WNFL+KASKLYE Sbjct: 924 LNSAKFVEVDPVHEEILRRKEEQDREKPQRHLFCFHHMGMWTKLRPGIWNFLDKASKLYE 983 Query: 863 LHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMES 684 LHLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+ D D DERVPKSKDLDGVLGMES Sbjct: 984 LHLYTMGNKLYATEMAKVLDPTGTLFSGRVISRGDDADTVDGDERVPKSKDLDGVLGMES 1043 Query: 683 AVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSL 504 AVVIIDDSLRVWP NK NLIVVERY YFPSSRRQFGL GPSLLEIDHDERP+ GTLASSL Sbjct: 1044 AVVIIDDSLRVWPLNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDGTLASSL 1103 Query: 503 GVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTA 324 VIER+HQ FFSH SL +VDVRNILAAEQRKIL GC+IVFSRVFPVGEANPHLHPLWQTA Sbjct: 1104 AVIERIHQNFFSHHSLKDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTA 1163 Query: 323 EQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 144 EQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPGWVEASALLYRRANE DFA Sbjct: 1164 EQFGAICTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEHDFA 1223 Query: 143 IK 138 +K Sbjct: 1224 VK 1225 >ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 1195 Score = 770 bits (1989), Expect = 0.0 Identities = 426/774 (55%), Positives = 513/774 (66%), Gaps = 42/774 (5%) Frame = -2 Query: 2333 MSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAP---VKTSAKSRDPRLRFMNSEVGG 2163 ++S S+ + + ++ S + + ++ + AP VK SAKSRDPRLRF+NS+ Sbjct: 421 LTSGQSNASISLPRMDGSSLPGVISIKSAVRASSAPSLTVKASAKSRDPRLRFVNSDSNA 480 Query: 2162 APQNGFA------------AGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS---RDAQVM 2028 QN A G++N ++ K +D+P+PD H+LKRQ+N S RD + M Sbjct: 481 LDQNHRAVPVVNTLKVEPIGGTMNKKRQKIVDDPIPDGHSLKRQKNALENSGVVRDVKTM 540 Query: 2027 PGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRL---VXXXXXXXXXNG 1857 G GGW+ED MV Q N+ Q N R GG V V Sbjct: 541 VGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCTSSSCISSVNISGTEQIPVT 600 Query: 1856 GSASNMSTSPAPV----VSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVN 1689 G++ + PV ++P LLK+IAVNPTML+ +LKM PA + Sbjct: 601 GTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILKMGQQQRLALEAQQKPVDPAKS 660 Query: 1688 -----GLSSAISPSPDVG-------QNPA----AKPQMNGPNDMGKIRMKPRDPRRVLHS 1557 +S + P VG PA PQ+ +D+GKIRMKPRDPRRVLH+ Sbjct: 661 TTYPLNSNSMLGTVPVVGAAHSGILPRPAGTVQVSPQLGTADDLGKIRMKPRDPRRVLHN 720 Query: 1556 NMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFT 1377 N +Q++ S+GSE K+N Q +KD +Q+Q Q + +P QS +LPDIS FT Sbjct: 721 NALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEKKPVPLQSLALPDISMPFT 780 Query: 1376 KNLQNLADIVSSSQAS-ALPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDL 1200 KNL+N+ADIVS S AS + P+ QN + + + + + A+ Sbjct: 781 KNLKNIADIVSVSHASTSQPLVPQNPASQPMRTTISSSDQFLGIGSAPGAAAAAAAGPRT 840 Query: 1199 ANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVE 1020 N WGDV+HL +GY+DQQKAAIQ+ERARRIEEQ K+F+ RK LNSAKFVE Sbjct: 841 QNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQKKLFSARKLCLVLDLDHTLLNSAKFVE 900 Query: 1019 VDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGN 840 VDP+H+EIL RHLFRFPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGN Sbjct: 901 VDPVHDEILRKKEEQDREKAHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGN 960 Query: 839 KLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDS 660 KLYATEMAKVLDP G LF+GRVIS+GD+G+PFD DER+PKSKDL+GVLGMES VVI+DDS Sbjct: 961 KLYATEMAKVLDPTGVLFNGRVISRGDDGEPFDGDERIPKSKDLEGVLGMESGVVIMDDS 1020 Query: 659 LRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQ 480 +RVWPHNK NLIVVERY+YFP SRRQFGLPGPSLLEIDHDERP+ GTLA SL VIER+HQ Sbjct: 1021 VRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQ 1080 Query: 479 IFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECT 300 FF+H SL+E DVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CT Sbjct: 1081 NFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 1140 Query: 299 NQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 138 NQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEASALLYRRANEQDFAIK Sbjct: 1141 NQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANEQDFAIK 1194 >ref|XP_009386584.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Musa acuminata subsp. malaccensis] Length = 1251 Score = 768 bits (1982), Expect = 0.0 Identities = 431/741 (58%), Positives = 505/741 (68%), Gaps = 42/741 (5%) Frame = -2 Query: 2225 VKTSAKSRDPRLRFMNSEVGG-----------APQNGFAAGSVNSRKHKAIDEPVPD-EH 2082 VK + K RDPRLRFMN+EV G AP +GF G++N+RKHK DE + Sbjct: 522 VKPALKRRDPRLRFMNNEVRGPSEERSGIRCNAPDDGFLGGTINARKHKIADESAAVVDQ 581 Query: 2081 NLKRQRNESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPD 1902 +KRQRN S SR+ V+ G W+E + ++ Q + R Q N+N+ R GEVG D Sbjct: 582 TMKRQRNGSMSSRNMHVISGSSEWLEGDSIIP-QPSERSQVNENLHADIRKAGTGEVGFD 640 Query: 1901 RRLVXXXXXXXXXNGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXX 1722 + G N S++PA +SLPSLLK AVNPT+LV+LLKM Sbjct: 641 KE--PNSNANFSMLNGLKPNSSSNPAGPISLPSLLK--AVNPTILVQLLKMEQQRLAAEN 696 Query: 1721 XXXXXAGPA-------VNGLSSAISP-------SPDVGQNPAAKPQ-------MNGPNDM 1605 + V+GL A+S S + GQN Q M+ ND+ Sbjct: 697 QQNVTTSTSDITNVSSVSGLPGAVSSVISTPVRSNEPGQNQLGISQVSPQSASMSSQNDL 756 Query: 1604 GKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNT 1425 G+IRMKPRDPRR+LH+N+VQK+E + SE NG Q + LT +E GEQAQ+N Sbjct: 757 GRIRMKPRDPRRILHNNIVQKNEVVASEQNNINGATAGP-QGTMGHLTAREAGEQAQSNI 815 Query: 1424 LPSQSASLPDISQQFTKNLQNLADIVSSSQASAL-PVGTQNSSQLIPSK--------ICN 1272 LP+Q + PD S++ TKNL IVSS Q + P +SQ I SK Sbjct: 816 LPTQFSPPPDRSEELTKNLPT---IVSSLQLTTTSPTIPHGNSQPISSKGNQMDVKLALA 872 Query: 1271 DTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKM 1092 + +PKTV+ + + E ++GV + N WGDVDHLLDGY+D+QKAAIQ+ERARRI EQNKM Sbjct: 873 EVNDPKTVSDVLS--ERSAGVSESTNLWGDVDHLLDGYNDEQKAAIQRERARRIVEQNKM 930 Query: 1091 FAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKL 912 FA RK LNSAKFVEVDP+HEE+L RH++ F HMGMWTKL Sbjct: 931 FAARKLCLVLDLDHTLLNSAKFVEVDPVHEEVLRRKEEQDREKPQRHIYCFQHMGMWTKL 990 Query: 911 RPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDE 732 RPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G+LF GRVIS+GD+GDP + DE Sbjct: 991 RPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGSLFSGRVISRGDDGDPLNGDE 1050 Query: 731 RVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLE 552 RVPKSKDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY +FPSSRRQFGL GPSLLE Sbjct: 1051 RVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTFFPSSRRQFGLLGPSLLE 1110 Query: 551 IDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVF 372 IDHDERP+ GTLASSL VIER+HQ FFSH S+ + DVRNILA+EQRKIL GC+IVFSRVF Sbjct: 1111 IDHDERPEDGTLASSLAVIERIHQNFFSHHSIKDADVRNILASEQRKILTGCRIVFSRVF 1170 Query: 371 PVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWV 192 PVGEANPHLHPLWQTAEQFGA CT+QIDE VTHVVANS GTDKVNWALSTGRFVVHPGWV Sbjct: 1171 PVGEANPHLHPLWQTAEQFGAVCTSQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 1230 Query: 191 EASALLYRRANEQDFAIKG*T 129 EASALLYRR NE DFA+K T Sbjct: 1231 EASALLYRRVNEHDFAVKAVT 1251 >dbj|BAT14211.1| Os11g0521900, partial [Oryza sativa Japonica Group] Length = 1226 Score = 756 bits (1952), Expect = 0.0 Identities = 422/757 (55%), Positives = 504/757 (66%), Gaps = 51/757 (6%) Frame = -2 Query: 2255 NTSGGSQMAPVKTSAKSRDPRLRFMNSEVGGA-------------PQNGFAAG---SVNS 2124 + SG + + +K +AKSRDPRL+F+N + GG P G S+NS Sbjct: 477 SVSGSNHL--LKATAKSRDPRLKFLNRDTGGVADANRRVNFAEPNPSKDRTMGGGVSINS 534 Query: 2123 RKHKAIDEPVPDEHNLKRQRNESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMA 1944 RK+KA+DEP+ DE+ LKR R RD Q GRGGW +D G ++S ++ QPN+N Sbjct: 535 RKNKAVDEPMVDENALKRSRGVIGNLRDMQPT-GRGGWAKDGGNISSYSSDGFQPNQNTR 593 Query: 1943 VGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSA---------SNMSTSPAPVVSLPSLLKD 1791 +GN + D L +G S S TS AP VSLP++LKD Sbjct: 594 LGNNTTGNHNIRTDSTLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKD 653 Query: 1790 IAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAISP-----------SPDVGQN 1644 IAVNPTML++ ++M G++S ++P + +V Sbjct: 654 IAVNPTMLMQWIQMEQQKMSASEPQQKVTASV--GMTSNVTPGMVLPLGNAPKTTEVAAV 711 Query: 1643 PAAKPQ-------MNGPNDMGKIRMKPRDPRRVLHSNMVQKSESL---GSELAKSNGDLP 1494 P+ +PQ M+ ND G IRMKPRDPRR+LHSN+VQK++++ G E AKSNG P Sbjct: 712 PSVRPQVPMQSAPMHSQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAP 771 Query: 1493 SEVQSSKDLLTVQEQ-GEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPV 1317 + QSSKD L Q+Q EQ Q LPS + ++ T N +++ ++ A P Sbjct: 772 PDSQSSKDHLLNQDQKAEQLQAIALPSLPVT--SSARPVTMNANPVSNSQLAATALMPPH 829 Query: 1316 G----TQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQ 1149 G T +S ++ E T TA + A+P+GDVDHLLDGYDDQ Sbjct: 830 GNTKQTSSSVNKADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQ 889 Query: 1148 QKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXX 969 QKA IQKERARRI+EQ+KMFA RK LNSAKF+EVD IH EIL Sbjct: 890 QKALIQKERARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDR 949 Query: 968 XXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTL 789 RHLF F HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNK+YATEMAKVLDP GTL Sbjct: 950 ERAERHLFCFNHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTL 1009 Query: 788 FHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERY 609 F GRVIS+GD+GDPFDSDERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNKHNLIVVERY Sbjct: 1010 FAGRVISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERY 1069 Query: 608 MYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNIL 429 YFP SRRQFGLPGPSLLEID DERP+ GTLASSL VIER+H+ FFSH +LN+ DVR+IL Sbjct: 1070 TYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSIL 1129 Query: 428 AAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGT 249 A+EQ++ILGGC+IVFSR+FPVGEANPH+HPLWQTAEQFGA CTNQID+ VTHVVANS GT Sbjct: 1130 ASEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGT 1189 Query: 248 DKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 138 DKVNWALSTGRFVVHPGWVEASALLYRRA+E DFA+K Sbjct: 1190 DKVNWALSTGRFVVHPGWVEASALLYRRASELDFAVK 1226 >gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group] Length = 1267 Score = 756 bits (1952), Expect = 0.0 Identities = 422/757 (55%), Positives = 504/757 (66%), Gaps = 51/757 (6%) Frame = -2 Query: 2255 NTSGGSQMAPVKTSAKSRDPRLRFMNSEVGGA-------------PQNGFAAG---SVNS 2124 + SG + + +K +AKSRDPRL+F+N + GG P G S+NS Sbjct: 518 SVSGSNHL--LKATAKSRDPRLKFLNRDTGGVADANRRVNFAEPNPSKDRTMGGGVSINS 575 Query: 2123 RKHKAIDEPVPDEHNLKRQRNESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMA 1944 RK+KA+DEP+ DE+ LKR R RD Q GRGGW +D G ++S ++ QPN+N Sbjct: 576 RKNKAVDEPMVDENALKRSRGVIGNLRDMQPT-GRGGWAKDGGNISSYSSDGFQPNQNTR 634 Query: 1943 VGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSA---------SNMSTSPAPVVSLPSLLKD 1791 +GN + D L +G S S TS AP VSLP++LKD Sbjct: 635 LGNNTTGNHNIRTDSTLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKD 694 Query: 1790 IAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAISP-----------SPDVGQN 1644 IAVNPTML++ ++M G++S ++P + +V Sbjct: 695 IAVNPTMLMQWIQMEQQKMSASEPQQKVTASV--GMTSNVTPGMVLPLGNAPKTTEVAAV 752 Query: 1643 PAAKPQ-------MNGPNDMGKIRMKPRDPRRVLHSNMVQKSESL---GSELAKSNGDLP 1494 P+ +PQ M+ ND G IRMKPRDPRR+LHSN+VQK++++ G E AKSNG P Sbjct: 753 PSVRPQVPMQSAPMHSQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAP 812 Query: 1493 SEVQSSKDLLTVQEQ-GEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPV 1317 + QSSKD L Q+Q EQ Q LPS + ++ T N +++ ++ A P Sbjct: 813 PDSQSSKDHLLNQDQKAEQLQAIALPSLPVT--SSARPVTMNANPVSNSQLAATALMPPH 870 Query: 1316 G----TQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQ 1149 G T +S ++ E T TA + A+P+GDVDHLLDGYDDQ Sbjct: 871 GNTKQTSSSVNKADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQ 930 Query: 1148 QKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXX 969 QKA IQKERARRI+EQ+KMFA RK LNSAKF+EVD IH EIL Sbjct: 931 QKALIQKERARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDR 990 Query: 968 XXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTL 789 RHLF F HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNK+YATEMAKVLDP GTL Sbjct: 991 ERAERHLFCFNHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTL 1050 Query: 788 FHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERY 609 F GRVIS+GD+GDPFDSDERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNKHNLIVVERY Sbjct: 1051 FAGRVISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERY 1110 Query: 608 MYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNIL 429 YFP SRRQFGLPGPSLLEID DERP+ GTLASSL VIER+H+ FFSH +LN+ DVR+IL Sbjct: 1111 TYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSIL 1170 Query: 428 AAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGT 249 A+EQ++ILGGC+IVFSR+FPVGEANPH+HPLWQTAEQFGA CTNQID+ VTHVVANS GT Sbjct: 1171 ASEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGT 1230 Query: 248 DKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 138 DKVNWALSTGRFVVHPGWVEASALLYRRA+E DFA+K Sbjct: 1231 DKVNWALSTGRFVVHPGWVEASALLYRRASELDFAVK 1267 >gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed [Oryza sativa Japonica Group] Length = 1272 Score = 756 bits (1952), Expect = 0.0 Identities = 422/757 (55%), Positives = 504/757 (66%), Gaps = 51/757 (6%) Frame = -2 Query: 2255 NTSGGSQMAPVKTSAKSRDPRLRFMNSEVGGA-------------PQNGFAAG---SVNS 2124 + SG + + +K +AKSRDPRL+F+N + GG P G S+NS Sbjct: 523 SVSGSNHL--LKATAKSRDPRLKFLNRDTGGVADANRRVNFAEPNPSKDRTMGGGVSINS 580 Query: 2123 RKHKAIDEPVPDEHNLKRQRNESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMA 1944 RK+KA+DEP+ DE+ LKR R RD Q GRGGW +D G ++S ++ QPN+N Sbjct: 581 RKNKAVDEPMVDENALKRSRGVIGNLRDMQPT-GRGGWAKDGGNISSYSSDGFQPNQNTR 639 Query: 1943 VGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSA---------SNMSTSPAPVVSLPSLLKD 1791 +GN + D L +G S S TS AP VSLP++LKD Sbjct: 640 LGNNTTGNHNIRTDSTLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKD 699 Query: 1790 IAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAISP-----------SPDVGQN 1644 IAVNPTML++ ++M G++S ++P + +V Sbjct: 700 IAVNPTMLMQWIQMEQQKMSASEPQQKVTASV--GMTSNVTPGMVLPLGNAPKTTEVAAV 757 Query: 1643 PAAKPQ-------MNGPNDMGKIRMKPRDPRRVLHSNMVQKSESL---GSELAKSNGDLP 1494 P+ +PQ M+ ND G IRMKPRDPRR+LHSN+VQK++++ G E AKSNG P Sbjct: 758 PSVRPQVPMQSAPMHSQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAP 817 Query: 1493 SEVQSSKDLLTVQEQ-GEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPV 1317 + QSSKD L Q+Q EQ Q LPS + ++ T N +++ ++ A P Sbjct: 818 PDSQSSKDHLLNQDQKAEQLQAIALPSLPVT--SSARPVTMNANPVSNSQLAATALMPPH 875 Query: 1316 G----TQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQ 1149 G T +S ++ E T TA + A+P+GDVDHLLDGYDDQ Sbjct: 876 GNTKQTSSSVNKADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQ 935 Query: 1148 QKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXX 969 QKA IQKERARRI+EQ+KMFA RK LNSAKF+EVD IH EIL Sbjct: 936 QKALIQKERARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDR 995 Query: 968 XXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTL 789 RHLF F HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNK+YATEMAKVLDP GTL Sbjct: 996 ERAERHLFCFNHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTL 1055 Query: 788 FHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERY 609 F GRVIS+GD+GDPFDSDERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNKHNLIVVERY Sbjct: 1056 FAGRVISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERY 1115 Query: 608 MYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNIL 429 YFP SRRQFGLPGPSLLEID DERP+ GTLASSL VIER+H+ FFSH +LN+ DVR+IL Sbjct: 1116 TYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSIL 1175 Query: 428 AAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGT 249 A+EQ++ILGGC+IVFSR+FPVGEANPH+HPLWQTAEQFGA CTNQID+ VTHVVANS GT Sbjct: 1176 ASEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGT 1235 Query: 248 DKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 138 DKVNWALSTGRFVVHPGWVEASALLYRRA+E DFA+K Sbjct: 1236 DKVNWALSTGRFVVHPGWVEASALLYRRASELDFAVK 1272 >ref|XP_010656789.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X3 [Vitis vinifera] Length = 1273 Score = 755 bits (1949), Expect = 0.0 Identities = 421/737 (57%), Positives = 495/737 (67%), Gaps = 41/737 (5%) Frame = -2 Query: 2225 VKTSAKSRDPRLRFMNSEVGG-------------APQNGFAAGSVNSRKHKAIDEPVPDE 2085 ++ SAKSRDPRLR +S+ G +P+ V+SRK K+ +EP+ D Sbjct: 562 LRASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDG 621 Query: 2084 HNLKRQRNESTRS---RDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNL---- 1926 KRQRN T RDAQ + GGW+ED+ V QM NR Q +N + L Sbjct: 622 PVTKRQRNGLTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKV 681 Query: 1925 -VGGEVGPDRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKM 1749 V G +G D+ V G+ + + SL SLLKDIAVNP + + + Sbjct: 682 TVTG-IGCDKPYVTV--------NGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNK 732 Query: 1748 XXXXXXXXXXXXXXAGPAVNGLSSAISPSP-------DVGQNPAAK---PQMNGPNDMGK 1599 P N + + P+ +GQ PA PQ ++ GK Sbjct: 733 VEQQKSGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMDESGK 792 Query: 1598 IRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLP 1419 +RMKPRDPRR+LH+N Q+S S GSE K+N Q+Q +Q +T ++P Sbjct: 793 VRMKPRDPRRILHANSFQRSGSSGSEQFKTNA---------------QKQEDQTETKSVP 837 Query: 1418 SQSASLPDISQQFTKNLQNLADIVSSSQASAL-PVGTQ---------NSSQLIPSKICND 1269 S S + PDISQQFTKNL+N+AD++S+SQAS++ P Q N+ ++ +D Sbjct: 838 SHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATVSD 897 Query: 1268 TTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMF 1089 + + T G ++ E+A+G N WGDV+HL DGYDDQQKAAIQ+ERARRIEEQ KMF Sbjct: 898 SGDQLTANG--SKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMF 955 Query: 1088 AERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLR 909 + RK LNSAKFVEVDP+H+EIL RHLFRFPHMGMWTKLR Sbjct: 956 SARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLR 1015 Query: 908 PGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDER 729 PG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKG LF GRVISKGD+GD D DER Sbjct: 1016 PGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDER 1075 Query: 728 VPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEI 549 VPKSKDL+GVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEI Sbjct: 1076 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEI 1135 Query: 548 DHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFP 369 DHDERP+ GTLASSL VIER+HQ FFS+++L+EVDVRNILA+EQRKIL GC+IVFSRVFP Sbjct: 1136 DHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFP 1195 Query: 368 VGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVE 189 VGEANPHLHPLWQTAE FGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPGWVE Sbjct: 1196 VGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVE 1255 Query: 188 ASALLYRRANEQDFAIK 138 ASALLYRRANEQDFAIK Sbjct: 1256 ASALLYRRANEQDFAIK 1272 >ref|XP_010656786.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2 [Vitis vinifera] Length = 1276 Score = 755 bits (1949), Expect = 0.0 Identities = 422/740 (57%), Positives = 496/740 (67%), Gaps = 44/740 (5%) Frame = -2 Query: 2225 VKTSAKSRDPRLRFMNSEVGG-------------APQNGFAAGSVNSRKHKAIDEPVPDE 2085 ++ SAKSRDPRLR +S+ G +P+ V+SRK K+ +EP+ D Sbjct: 562 LRASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDG 621 Query: 2084 HNLKRQRNESTRS---RDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNL---- 1926 KRQRN T RDAQ + GGW+ED+ V QM NR Q +N + L Sbjct: 622 PVTKRQRNGLTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKV 681 Query: 1925 -VGGEVGPDRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKM 1749 V G +G D+ V G+ + + SL SLLKDIAVNP + + + Sbjct: 682 TVTG-IGCDKPYVTV--------NGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNK 732 Query: 1748 XXXXXXXXXXXXXXAGPAVNGLSSAISPSP-------DVGQNPAAKPQ------MNGPND 1608 P N + + P+ +GQ PA Q MN ++ Sbjct: 733 VEQQKSGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMNPQDE 792 Query: 1607 MGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTN 1428 GK+RMKPRDPRR+LH+N Q+S S GSE K+N Q+Q +Q +T Sbjct: 793 SGKVRMKPRDPRRILHANSFQRSGSSGSEQFKTNA---------------QKQEDQTETK 837 Query: 1427 TLPSQSASLPDISQQFTKNLQNLADIVSSSQASAL-PVGTQ---------NSSQLIPSKI 1278 ++PS S + PDISQQFTKNL+N+AD++S+SQAS++ P Q N+ ++ Sbjct: 838 SVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKAT 897 Query: 1277 CNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQN 1098 +D+ + T G ++ E+A+G N WGDV+HL DGYDDQQKAAIQ+ERARRIEEQ Sbjct: 898 VSDSGDQLTANG--SKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQK 955 Query: 1097 KMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWT 918 KMF+ RK LNSAKFVEVDP+H+EIL RHLFRFPHMGMWT Sbjct: 956 KMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWT 1015 Query: 917 KLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDS 738 KLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKG LF GRVISKGD+GD D Sbjct: 1016 KLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDG 1075 Query: 737 DERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSL 558 DERVPKSKDL+GVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSL Sbjct: 1076 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSL 1135 Query: 557 LEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSR 378 LEIDHDERP+ GTLASSL VIER+HQ FFS+++L+EVDVRNILA+EQRKIL GC+IVFSR Sbjct: 1136 LEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSR 1195 Query: 377 VFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPG 198 VFPVGEANPHLHPLWQTAE FGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPG Sbjct: 1196 VFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPG 1255 Query: 197 WVEASALLYRRANEQDFAIK 138 WVEASALLYRRANEQDFAIK Sbjct: 1256 WVEASALLYRRANEQDFAIK 1275 >ref|XP_012088736.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Jatropha curcas] gi|643708360|gb|KDP23276.1| hypothetical protein JCGZ_23109 [Jatropha curcas] Length = 1283 Score = 753 bits (1943), Expect = 0.0 Identities = 425/753 (56%), Positives = 492/753 (65%), Gaps = 49/753 (6%) Frame = -2 Query: 2249 SGGSQMAPVKTSAKSRDPRLRFMNSEVGGAPQNG------------FAAGSVNSRKHKAI 2106 S GS + VK SAKSRDPRLRF+NS+ QN + G +N +K K++ Sbjct: 543 SSGSSLT-VKASAKSRDPRLRFVNSDANALDQNHVLPLVNNTPKVEYLGGPMNLKKQKSV 601 Query: 2105 DEPVPDEHNLKRQRNESTRSR---DAQVMPGRGGWIEDNGMVASQMNNRVQ--------- 1962 D+ V D +LKRQRN S + + M GGW+ED MV Q NR Q Sbjct: 602 DDSVLDGPSLKRQRNVLEHSGGVGNVKTMIASGGWLEDTDMVRPQTMNRNQLVENSDPRR 661 Query: 1961 -------PNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLPS 1803 P+ + + ++ G E P G TS A SLP Sbjct: 662 MDNGVACPSTVSGISSVSISGNEQKP------VIGTGAITEGEQIQMTGTSEA---SLPD 712 Query: 1802 LLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNG--------------LSSAISP 1665 LLK+IAVNPTML+ LLKM + PA + + + P Sbjct: 713 LLKNIAVNPTMLLNLLKMGQQQRSAIDAQQKPSDPAKTSKHPLNANAILGSVPVVNVVPP 772 Query: 1664 SPDVGQNPAAK---PQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLP 1494 P V PA P ++GKIRMKPRDPRRVLH +QK+ ++G E K+N P Sbjct: 773 QPSVMPRPAGTLQVPPQAAVEELGKIRMKPRDPRRVLHYQTLQKNGNMGYEQFKTNLTSP 832 Query: 1493 SEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALP-V 1317 Q +KD VQ+Q QA+T +P QS +PDIS FTK+L+N+ADIVS S AS P V Sbjct: 833 PTDQGTKDNQIVQKQDGQAETEPVPLQSLVVPDISLPFTKSLKNIADIVSVSHASTSPTV 892 Query: 1316 GTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAA 1137 +QN + I +++ +P + D WGDV+HL +GY DQQKAA Sbjct: 893 VSQNLASQPTRTIVSNSEQPAGIGSAPCVAPVGPRPQDA---WGDVEHLFEGYSDQQKAA 949 Query: 1136 IQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXX 957 IQ+ERARRIEEQ KMFA RK LNSAKFVEVDP+H+EIL Sbjct: 950 IQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPY 1009 Query: 956 RHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGR 777 RHLFRFPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF+GR Sbjct: 1010 RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGR 1069 Query: 776 VISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFP 597 VIS+GD+ D FDSDERVPKSKDL+GVLGMESAVVIIDDS+RVWPHNK NLIVVERY+YFP Sbjct: 1070 VISRGDDTDSFDSDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFP 1129 Query: 596 SSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQ 417 SRRQFGLPGPSLLEIDHDERP+ GTLA SL VIE++HQ FF+H SL++ DVRNILA+EQ Sbjct: 1130 CSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIEKIHQHFFTHPSLDDADVRNILASEQ 1189 Query: 416 RKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVN 237 RKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQIDE VTHVVANS GTDKVN Sbjct: 1190 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVN 1249 Query: 236 WALSTGRFVVHPGWVEASALLYRRANEQDFAIK 138 WALSTGRFVV+PGWVEASALLYRRANEQDFAIK Sbjct: 1250 WALSTGRFVVYPGWVEASALLYRRANEQDFAIK 1282 >ref|XP_012459418.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2 [Gossypium raimondii] Length = 1251 Score = 750 bits (1936), Expect = 0.0 Identities = 434/781 (55%), Positives = 507/781 (64%), Gaps = 52/781 (6%) Frame = -2 Query: 2324 VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMAPVKTSAKSRDPRLRFMNSEVGGA 2160 V+S+ + +S SS G F P+ S S + K SAKSRDPRLRF NS V Sbjct: 477 VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 535 Query: 2159 PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 2016 N +G ++ RK K+ +EPV D KRQ+NE RD Q + G G Sbjct: 536 DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 595 Query: 2015 GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMS 1836 GW+ED SQ+ NR Q + + +R + G + MS Sbjct: 596 GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 655 Query: 1835 TSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAIS---- 1668 SLP+LLKDIAVNPTML+ +LKM P N L S Sbjct: 656 NP-----SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 710 Query: 1667 ---------PSPDVGQNPAA------KPQMN--GP--NDMGKIRMKPRDPRRVLHSNMVQ 1545 PSP V P++ KP N GP ++ KIRMKPRDPRRVLH N++Q Sbjct: 711 GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 770 Query: 1544 KSESLGSELAKSNGDLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 1371 KS S+G + K+NG P S Q SKD + Q+Q E Q + + Q PDI+QQFT++ Sbjct: 771 KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 830 Query: 1370 LQNLADIVSSSQASA-LPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLA- 1197 L+N+A ++S Q+ A LP +QN P ++ ++T + T +T +G A Sbjct: 831 LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 889 Query: 1196 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXL 1041 N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK L Sbjct: 890 VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 949 Query: 1040 NSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 861 NSAKF+EVDP+HEEIL RHLFRF HMGMWTKLRPG+WNFLEKASKLYEL Sbjct: 950 NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 1009 Query: 860 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 681 HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+ Sbjct: 1010 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 1069 Query: 680 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 501 VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL Sbjct: 1070 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 1129 Query: 500 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 321 VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE Sbjct: 1130 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 1189 Query: 320 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 141 QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI Sbjct: 1190 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 1249 Query: 140 K 138 K Sbjct: 1250 K 1250 >gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium raimondii] Length = 982 Score = 750 bits (1936), Expect = 0.0 Identities = 434/781 (55%), Positives = 507/781 (64%), Gaps = 52/781 (6%) Frame = -2 Query: 2324 VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMAPVKTSAKSRDPRLRFMNSEVGGA 2160 V+S+ + +S SS G F P+ S S + K SAKSRDPRLRF NS V Sbjct: 208 VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 266 Query: 2159 PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 2016 N +G ++ RK K+ +EPV D KRQ+NE RD Q + G G Sbjct: 267 DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 326 Query: 2015 GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMS 1836 GW+ED SQ+ NR Q + + +R + G + MS Sbjct: 327 GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 386 Query: 1835 TSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAIS---- 1668 SLP+LLKDIAVNPTML+ +LKM P N L S Sbjct: 387 NP-----SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 441 Query: 1667 ---------PSPDVGQNPAA------KPQMN--GP--NDMGKIRMKPRDPRRVLHSNMVQ 1545 PSP V P++ KP N GP ++ KIRMKPRDPRRVLH N++Q Sbjct: 442 GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 501 Query: 1544 KSESLGSELAKSNGDLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 1371 KS S+G + K+NG P S Q SKD + Q+Q E Q + + Q PDI+QQFT++ Sbjct: 502 KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 561 Query: 1370 LQNLADIVSSSQASA-LPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLA- 1197 L+N+A ++S Q+ A LP +QN P ++ ++T + T +T +G A Sbjct: 562 LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 620 Query: 1196 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXL 1041 N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK L Sbjct: 621 VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 680 Query: 1040 NSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 861 NSAKF+EVDP+HEEIL RHLFRF HMGMWTKLRPG+WNFLEKASKLYEL Sbjct: 681 NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 740 Query: 860 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 681 HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+ Sbjct: 741 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 800 Query: 680 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 501 VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL Sbjct: 801 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 860 Query: 500 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 321 VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE Sbjct: 861 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 920 Query: 320 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 141 QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI Sbjct: 921 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 980 Query: 140 K 138 K Sbjct: 981 K 981 >gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium raimondii] Length = 1033 Score = 750 bits (1936), Expect = 0.0 Identities = 434/781 (55%), Positives = 507/781 (64%), Gaps = 52/781 (6%) Frame = -2 Query: 2324 VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMAPVKTSAKSRDPRLRFMNSEVGGA 2160 V+S+ + +S SS G F P+ S S + K SAKSRDPRLRF NS V Sbjct: 259 VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 317 Query: 2159 PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 2016 N +G ++ RK K+ +EPV D KRQ+NE RD Q + G G Sbjct: 318 DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 377 Query: 2015 GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMS 1836 GW+ED SQ+ NR Q + + +R + G + MS Sbjct: 378 GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 437 Query: 1835 TSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAIS---- 1668 SLP+LLKDIAVNPTML+ +LKM P N L S Sbjct: 438 NP-----SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 492 Query: 1667 ---------PSPDVGQNPAA------KPQMN--GP--NDMGKIRMKPRDPRRVLHSNMVQ 1545 PSP V P++ KP N GP ++ KIRMKPRDPRRVLH N++Q Sbjct: 493 GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 552 Query: 1544 KSESLGSELAKSNGDLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 1371 KS S+G + K+NG P S Q SKD + Q+Q E Q + + Q PDI+QQFT++ Sbjct: 553 KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 612 Query: 1370 LQNLADIVSSSQASA-LPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLA- 1197 L+N+A ++S Q+ A LP +QN P ++ ++T + T +T +G A Sbjct: 613 LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 671 Query: 1196 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXL 1041 N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK L Sbjct: 672 VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 731 Query: 1040 NSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 861 NSAKF+EVDP+HEEIL RHLFRF HMGMWTKLRPG+WNFLEKASKLYEL Sbjct: 732 NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 791 Query: 860 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 681 HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+ Sbjct: 792 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 851 Query: 680 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 501 VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL Sbjct: 852 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 911 Query: 500 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 321 VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE Sbjct: 912 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 971 Query: 320 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 141 QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI Sbjct: 972 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 1031 Query: 140 K 138 K Sbjct: 1032 K 1032 >ref|XP_012459417.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1 [Gossypium raimondii] gi|763810289|gb|KJB77191.1| hypothetical protein B456_012G125200 [Gossypium raimondii] Length = 1272 Score = 750 bits (1936), Expect = 0.0 Identities = 434/781 (55%), Positives = 507/781 (64%), Gaps = 52/781 (6%) Frame = -2 Query: 2324 VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMAPVKTSAKSRDPRLRFMNSEVGGA 2160 V+S+ + +S SS G F P+ S S + K SAKSRDPRLRF NS V Sbjct: 498 VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 556 Query: 2159 PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 2016 N +G ++ RK K+ +EPV D KRQ+NE RD Q + G G Sbjct: 557 DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 616 Query: 2015 GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMS 1836 GW+ED SQ+ NR Q + + +R + G + MS Sbjct: 617 GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 676 Query: 1835 TSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAIS---- 1668 SLP+LLKDIAVNPTML+ +LKM P N L S Sbjct: 677 NP-----SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 731 Query: 1667 ---------PSPDVGQNPAA------KPQMN--GP--NDMGKIRMKPRDPRRVLHSNMVQ 1545 PSP V P++ KP N GP ++ KIRMKPRDPRRVLH N++Q Sbjct: 732 GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 791 Query: 1544 KSESLGSELAKSNGDLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 1371 KS S+G + K+NG P S Q SKD + Q+Q E Q + + Q PDI+QQFT++ Sbjct: 792 KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 851 Query: 1370 LQNLADIVSSSQASA-LPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLA- 1197 L+N+A ++S Q+ A LP +QN P ++ ++T + T +T +G A Sbjct: 852 LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 910 Query: 1196 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXL 1041 N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK L Sbjct: 911 VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 970 Query: 1040 NSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 861 NSAKF+EVDP+HEEIL RHLFRF HMGMWTKLRPG+WNFLEKASKLYEL Sbjct: 971 NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 1030 Query: 860 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 681 HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+ Sbjct: 1031 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 1090 Query: 680 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 501 VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL Sbjct: 1091 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 1150 Query: 500 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 321 VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE Sbjct: 1151 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 1210 Query: 320 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 141 QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI Sbjct: 1211 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 1270 Query: 140 K 138 K Sbjct: 1271 K 1271 >ref|XP_010656784.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1 [Vitis vinifera] Length = 1285 Score = 750 bits (1936), Expect = 0.0 Identities = 421/749 (56%), Positives = 496/749 (66%), Gaps = 53/749 (7%) Frame = -2 Query: 2225 VKTSAKSRDPRLRFMNSEVGG-------------APQNGFAAGSVNSRKHKAIDEPVPDE 2085 ++ SAKSRDPRLR +S+ G +P+ V+SRK K+ +EP+ D Sbjct: 562 LRASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDG 621 Query: 2084 HNLKRQRNESTRS---RDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNL---- 1926 KRQRN T RDAQ + GGW+ED+ V QM NR Q +N + L Sbjct: 622 PVTKRQRNGLTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKV 681 Query: 1925 -VGGEVGPDRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKM 1749 V G +G D+ V G+ + + SL SLLKDIAVNP + + + Sbjct: 682 TVTG-IGCDKPYVTV--------NGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNK 732 Query: 1748 XXXXXXXXXXXXXXAGPAVNGLSSAISPSP-------DVGQNPAAKPQM----------- 1623 P N + + P+ +GQ PA Q+ Sbjct: 733 VEQQKSGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMLVTSC 792 Query: 1622 ----NGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQ 1455 N ++ GK+RMKPRDPRR+LH+N Q+S S GSE K+N Q Sbjct: 793 NNAQNPQDESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKTNA---------------Q 837 Query: 1454 EQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASAL-PVGTQ---------N 1305 +Q +Q +T ++PS S + PDISQQFTKNL+N+AD++S+SQAS++ P Q N Sbjct: 838 KQEDQTETKSVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVN 897 Query: 1304 SSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKE 1125 + ++ +D+ + T G ++ E+A+G N WGDV+HL DGYDDQQKAAIQ+E Sbjct: 898 TDRMDVKATVSDSGDQLTANG--SKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRE 955 Query: 1124 RARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLF 945 RARRIEEQ KMF+ RK LNSAKFVEVDP+H+EIL RHLF Sbjct: 956 RARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLF 1015 Query: 944 RFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISK 765 RFPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKG LF GRVISK Sbjct: 1016 RFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISK 1075 Query: 764 GDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRR 585 GD+GD D DERVPKSKDL+GVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRR Sbjct: 1076 GDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1135 Query: 584 QFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKIL 405 QFGLPGPSLLEIDHDERP+ GTLASSL VIER+HQ FFS+++L+EVDVRNILA+EQRKIL Sbjct: 1136 QFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKIL 1195 Query: 404 GGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALS 225 GC+IVFSRVFPVGEANPHLHPLWQTAE FGA CTNQIDE VTHVVANS GTDKVNWALS Sbjct: 1196 AGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALS 1255 Query: 224 TGRFVVHPGWVEASALLYRRANEQDFAIK 138 TGRFVVHPGWVEASALLYRRANEQDFAIK Sbjct: 1256 TGRFVVHPGWVEASALLYRRANEQDFAIK 1284 >ref|XP_006662962.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 3-like [Oryza brachyantha] Length = 1267 Score = 750 bits (1936), Expect = 0.0 Identities = 425/758 (56%), Positives = 500/758 (65%), Gaps = 54/758 (7%) Frame = -2 Query: 2249 SGGSQMAPVKTSAKSRDPRLRFMNSEVGGA-------------PQNGFAAG---SVNSRK 2118 SG + M +K +AKSRDPRLRF+N + G P G +NSRK Sbjct: 520 SGSNHM--LKATAKSRDPRLRFLNRDAGVVADANRRLNFAEPNPSKDRTMGVGVPINSRK 577 Query: 2117 HKAIDEPVPDEHNLKRQRNESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVG 1938 HK +DEP+ DE+ LKR R + RD GRGGW +D V+S ++ QPN+N +G Sbjct: 578 HKTVDEPLVDENMLKRSRGGNGNPRDVLTPAGRGGWAKDGVNVSSYSSDGFQPNQNTRLG 637 Query: 1937 NRNLVGGEVGPDRRLVXXXXXXXXXNG---------GSASNMSTSPAPVVSLPSLLKDIA 1785 N V D LV +G + S+ TS AP VSLP++LKDIA Sbjct: 638 NSTTGSHNVRTDSTLVSNTNNMTNSSGINTGVVQAPQTNSSPQTSSAPSVSLPAMLKDIA 697 Query: 1784 VNPTMLVELLKMXXXXXXXXXXXXXXAGPA----------VNGLSSA-----ISPSPDV- 1653 VNPTML++ ++M V LS A +P P V Sbjct: 698 VNPTMLMQWIQMEQQKMSATEPLQKVTASVGMTSNETAGMVLPLSCASKTTEAAPVPSVR 757 Query: 1652 GQNPAAKPQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSES---LGSELAKSNGDLPSEVQ 1482 Q P ++ ND G IRMKPRDPRR+LHSN+ QK+++ +G E AK NG + Q Sbjct: 758 SQVPMQTAAVHSQNDAGVIRMKPRDPRRILHSNIAQKNDTVPPVGVEQAKINGTALPDSQ 817 Query: 1481 SSKD-LLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPVGTQN 1305 SKD LL ++Q EQ QT+ LPSQ + ++Q T N A+ VS+SQ +A + Sbjct: 818 GSKDHLLNHEQQAEQLQTSALPSQPVT--PSARQVTMN----ANPVSNSQLAATALMPHG 871 Query: 1304 SSQLIPSKICNDTTEPKTVTGM---------CTQGETASGVIDLANPWGDVDHLLDGYDD 1152 S+Q S + + +P+ G T TA + A+PWGDVDHLLDGYDD Sbjct: 872 STQQTSSSV--NKADPRLTAGQNETNDDAVTSTGPLTAPDAVLPASPWGDVDHLLDGYDD 929 Query: 1151 QQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXX 972 QQKA IQKERARRI EQ KMFA +K LNSAKF EV+PIHEEIL Sbjct: 930 QQKALIQKERARRIMEQQKMFAAQKLCLVLDLDHTLLNSAKFAEVEPIHEEILRKKEEQD 989 Query: 971 XXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGT 792 RHLF F HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNK+YATEMA+VLDP GT Sbjct: 990 RERADRHLFCFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKIYATEMARVLDPTGT 1049 Query: 791 LFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVER 612 LF GRVIS+GD+GD DSDERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNKHNLIVVER Sbjct: 1050 LFAGRVISRGDDGDTLDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVER 1109 Query: 611 YMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNI 432 Y YFP SRRQFGLPGPSLLEID DERP+ GTLASSL VIER+HQ FF+H +LN+ DVR+I Sbjct: 1110 YTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHQNFFTHPNLNDADVRSI 1169 Query: 431 LAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFG 252 LA+EQ++ILGGC+IVFSR+FPVGEANPH+HPLWQTAEQFGA CTNQID+ VTHVVANS G Sbjct: 1170 LASEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLG 1229 Query: 251 TDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 138 TDKVNWALSTGRFVVHPGWVEASALLYRRA+E DFA+K Sbjct: 1230 TDKVNWALSTGRFVVHPGWVEASALLYRRASELDFAVK 1267 >gb|KDO83166.1| hypothetical protein CISIN_1g000897mg [Citrus sinensis] Length = 1218 Score = 748 bits (1932), Expect = 0.0 Identities = 425/750 (56%), Positives = 503/750 (67%), Gaps = 27/750 (3%) Frame = -2 Query: 2306 LQMSSVNS-SGAFQMAPLNTSGGSQMAP---VKTSAKSRDPRLRFMNSE----------- 2172 + +SSV + + A AP ++ + P VK KSRDPRLRF +S Sbjct: 489 MDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPI 548 Query: 2171 VGGAPQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS---RDAQVMPGRGGWIED 2001 + AP+ ++SRK K ++EPV D LKRQRN S RD + + G GGW+ED Sbjct: 549 LHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLED 608 Query: 2000 NGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMSTSPAP 1821 M Q+ NR + +R L G P G+ +T+P+ Sbjct: 609 TDMFEPQIMNRNLLVDSAESNSRKLDNGATSP-----ITSGTPNVVVSGNEPAPATTPST 663 Query: 1820 VVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAISPSPDVGQNP 1641 VSLP+LLKDIAVNPTML+ +LKM ++N + I P Sbjct: 664 TVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTMHPPI---------P 714 Query: 1640 AAKPQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLT 1461 ++ P P+++GK+RMKPRDPRRVLH N +Q+S SLG E K++G Q SK+ L Sbjct: 715 SSIP----PDELGKVRMKPRDPRRVLHGNALQRSGSLGPEF-KTDGPSAPCTQGSKENLN 769 Query: 1460 VQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQA-SALPVGTQNSSQLIPS 1284 Q+Q + + SQS PDI+QQFTKNL+++AD +S SQ ++ P+ +QNS + P Sbjct: 770 FQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSP-IQPG 828 Query: 1283 KICNDTTEPKTVTGMCTQGETASGVIDLANP--------WGDVDHLLDGYDDQQKAAIQK 1128 +I + K V +T +G A P WGDV+HL +GYDDQQKAAIQK Sbjct: 829 QI-KSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQK 887 Query: 1127 ERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHL 948 ER RR+EEQ KMF+ RK LNSAKF EVDP+H+EIL RHL Sbjct: 888 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 947 Query: 947 FRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVIS 768 FRFPHMGMWTKLRPG+W FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKG LF GRVIS Sbjct: 948 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1007 Query: 767 KGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSR 588 +GD+GDPFD DERVPKSKDL+GVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SR Sbjct: 1008 RGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSR 1067 Query: 587 RQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKI 408 RQFGL GPSLLEIDHDER + GTLASSLGVIERLH+IFFSHQSL++VDVRNILAAEQRKI Sbjct: 1068 RQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKI 1127 Query: 407 LGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWAL 228 L GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CT ID+ VTHVVANS GTDKVNWAL Sbjct: 1128 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 1187 Query: 227 STGRFVVHPGWVEASALLYRRANEQDFAIK 138 STGRFVVHPGWVEASALLYRRANEQDFAIK Sbjct: 1188 STGRFVVHPGWVEASALLYRRANEQDFAIK 1217