BLASTX nr result
ID: Ophiopogon23_contig00022081
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon23_contig00022081 (2279 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ONK58600.1| uncharacterized protein A4U43_C09F14730 [Asparagu... 863 0.0 ref|XP_020246902.1| RNA polymerase II C-terminal domain phosphat... 863 0.0 ref|XP_010929653.1| PREDICTED: RNA polymerase II C-terminal doma... 810 0.0 ref|XP_008791049.1| PREDICTED: RNA polymerase II C-terminal doma... 789 0.0 gb|OVA17386.1| BRCT domain [Macleaya cordata] 781 0.0 gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-l... 768 0.0 ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal doma... 767 0.0 ref|XP_007043830.2| PREDICTED: RNA polymerase II C-terminal doma... 766 0.0 ref|XP_021275229.1| RNA polymerase II C-terminal domain phosphat... 766 0.0 gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Rici... 754 0.0 ref|XP_021662955.1| LOW QUALITY PROTEIN: RNA polymerase II C-ter... 752 0.0 gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium r... 739 0.0 ref|XP_022737741.1| RNA polymerase II C-terminal domain phosphat... 748 0.0 gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium r... 739 0.0 ref|XP_010656786.1| PREDICTED: RNA polymerase II C-terminal doma... 744 0.0 ref|XP_010656789.1| PREDICTED: RNA polymerase II C-terminal doma... 743 0.0 dbj|GAV71470.1| BRCT domain-containing protein/NIF domain-contai... 742 0.0 ref|XP_012459418.1| PREDICTED: RNA polymerase II C-terminal doma... 739 0.0 gb|PON91807.1| FCP1-like phosphatase [Trema orientalis] 739 0.0 ref|XP_012459417.1| PREDICTED: RNA polymerase II C-terminal doma... 739 0.0 >gb|ONK58600.1| uncharacterized protein A4U43_C09F14730 [Asparagus officinalis] Length = 1100 Score = 863 bits (2229), Expect = 0.0 Identities = 473/740 (63%), Positives = 520/740 (70%), Gaps = 8/740 (1%) Frame = +1 Query: 22 MSSVNSSGGLQMSSVNSSGAFQMAPLNTSG--GSQMDP-VKTSAKSRDPRLRFMNSEVGG 192 M+ VNSS G QM V SS QMA + T G GS+ +P VK SAKSRDPRLRFM SE Sbjct: 256 MAPVNSSDGFQMQPVYSSAGPQMAQVRTIGQVGSEANPAVKASAKSRDPRLRFMKSETSV 315 Query: 193 APQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRSRDAQVMPGRGGWIEDNGMVAS 372 P+NGFAAG VNS KHK+ DE V DEH+LKRQR +S SRD +V + G +D Sbjct: 316 VPKNGFAAGPVNSLKHKSDDELVLDEHSLKRQRKDSMSSRDVKVTGSQSGATKD------ 369 Query: 373 QMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXXX 552 RNLVG EVG + L S S Sbjct: 370 ----------------RNLVGTEVGYEMGLDADNNKLTVSSVPSTS--ISTTGPIVSLPS 411 Query: 553 XXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAISPSPDVGQNPAAKSQM 732 K IA NP MLV+LL+M G AVNGLSSA S S +GQNP+ KSQM Sbjct: 412 LLKGIAANPQMLVQLLRMEQQKIAAGQAQEKPDGQAVNGLSSATSSSSGIGQNPSVKSQM 471 Query: 733 -----NGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTV 897 + N+M KIRMKPRDPRRVLHSNM Q++E+ GS S V S KD L Sbjct: 472 PPQTKSTNNNMAKIRMKPRDPRRVLHSNMAQRTENSGSAT--------SNVHSIKDQLLH 523 Query: 898 QEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPVGTQNSSQLIPSKI 1077 ++QG+QAQ +P QSASL DISQQFTKNLQNLAD+VS SQ SK Sbjct: 524 RKQGDQAQKLAVPLQSASLTDISQQFTKNLQNLADMVSKSQ----------------SKT 567 Query: 1078 SNDTTEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQN 1257 S+D T+PK V E+C+Q +T ANPWGDVDHLLDGYDD+QKAAIQKERARRIEEQN Sbjct: 568 SDDLTKPKNVPELCSQTDTKPA----ANPWGDVDHLLDGYDDKQKAAIQKERARRIEEQN 623 Query: 1258 KMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWT 1437 KMFA RK NSAKFVE+DPIHEEIL HLFR HMGMWT Sbjct: 624 KMFAARKLCLVLDLDHTLLNSAKFVEIDPIHEEILRKKEEQDRQTQERHLFRLQHMGMWT 683 Query: 1438 KLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDS 1617 KLRPG+W FLEKAS+LYELH+YTMGNKLYATEMAK+LDPKG LF GRV+S+GD+GDPFD Sbjct: 684 KLRPGIWTFLEKASQLYELHVYTMGNKLYATEMAKLLDPKGNLFAGRVLSRGDDGDPFDG 743 Query: 1618 DERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSL 1797 D+RVPKSKDLDGVLGMESAV+IIDDSLRVWPHNKHNLIVVERY YFPSSRRQFGL GPSL Sbjct: 744 DDRVPKSKDLDGVLGMESAVLIIDDSLRVWPHNKHNLIVVERYTYFPSSRRQFGLIGPSL 803 Query: 1798 LEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSR 1977 LEIDHDERP+ GTLASSL VIER+H+IFFSH L EVDVRNIL AEQRKIL GCKIVFSR Sbjct: 804 LEIDHDERPEDGTLASSLAVIERIHEIFFSHSCLTEVDVRNILGAEQRKILAGCKIVFSR 863 Query: 1978 VFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPG 2157 +FPVGEANPHLHPLWQTAEQFGAECTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPG Sbjct: 864 IFPVGEANPHLHPLWQTAEQFGAECTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPG 923 Query: 2158 WVEASALLYRRANEQDFAIK 2217 WVEASALLYRRANEQDFA+K Sbjct: 924 WVEASALLYRRANEQDFAVK 943 >ref|XP_020246902.1| RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1 [Asparagus officinalis] ref|XP_020246903.1| RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2 [Asparagus officinalis] Length = 1127 Score = 863 bits (2229), Expect = 0.0 Identities = 473/740 (63%), Positives = 520/740 (70%), Gaps = 8/740 (1%) Frame = +1 Query: 22 MSSVNSSGGLQMSSVNSSGAFQMAPLNTSG--GSQMDP-VKTSAKSRDPRLRFMNSEVGG 192 M+ VNSS G QM V SS QMA + T G GS+ +P VK SAKSRDPRLRFM SE Sbjct: 439 MAPVNSSDGFQMQPVYSSAGPQMAQVRTIGQVGSEANPAVKASAKSRDPRLRFMKSETSV 498 Query: 193 APQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRSRDAQVMPGRGGWIEDNGMVAS 372 P+NGFAAG VNS KHK+ DE V DEH+LKRQR +S SRD +V + G +D Sbjct: 499 VPKNGFAAGPVNSLKHKSDDELVLDEHSLKRQRKDSMSSRDVKVTGSQSGATKD------ 552 Query: 373 QMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXXX 552 RNLVG EVG + L S S Sbjct: 553 ----------------RNLVGTEVGYEMGLDADNNKLTVSSVPSTS--ISTTGPIVSLPS 594 Query: 553 XXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAISPSPDVGQNPAAKSQM 732 K IA NP MLV+LL+M G AVNGLSSA S S +GQNP+ KSQM Sbjct: 595 LLKGIAANPQMLVQLLRMEQQKIAAGQAQEKPDGQAVNGLSSATSSSSGIGQNPSVKSQM 654 Query: 733 -----NGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTV 897 + N+M KIRMKPRDPRRVLHSNM Q++E+ GS S V S KD L Sbjct: 655 PPQTKSTNNNMAKIRMKPRDPRRVLHSNMAQRTENSGSAT--------SNVHSIKDQLLH 706 Query: 898 QEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPVGTQNSSQLIPSKI 1077 ++QG+QAQ +P QSASL DISQQFTKNLQNLAD+VS SQ SK Sbjct: 707 RKQGDQAQKLAVPLQSASLTDISQQFTKNLQNLADMVSKSQ----------------SKT 750 Query: 1078 SNDTTEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQN 1257 S+D T+PK V E+C+Q +T ANPWGDVDHLLDGYDD+QKAAIQKERARRIEEQN Sbjct: 751 SDDLTKPKNVPELCSQTDTKPA----ANPWGDVDHLLDGYDDKQKAAIQKERARRIEEQN 806 Query: 1258 KMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWT 1437 KMFA RK NSAKFVE+DPIHEEIL HLFR HMGMWT Sbjct: 807 KMFAARKLCLVLDLDHTLLNSAKFVEIDPIHEEILRKKEEQDRQTQERHLFRLQHMGMWT 866 Query: 1438 KLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDS 1617 KLRPG+W FLEKAS+LYELH+YTMGNKLYATEMAK+LDPKG LF GRV+S+GD+GDPFD Sbjct: 867 KLRPGIWTFLEKASQLYELHVYTMGNKLYATEMAKLLDPKGNLFAGRVLSRGDDGDPFDG 926 Query: 1618 DERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSL 1797 D+RVPKSKDLDGVLGMESAV+IIDDSLRVWPHNKHNLIVVERY YFPSSRRQFGL GPSL Sbjct: 927 DDRVPKSKDLDGVLGMESAVLIIDDSLRVWPHNKHNLIVVERYTYFPSSRRQFGLIGPSL 986 Query: 1798 LEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSR 1977 LEIDHDERP+ GTLASSL VIER+H+IFFSH L EVDVRNIL AEQRKIL GCKIVFSR Sbjct: 987 LEIDHDERPEDGTLASSLAVIERIHEIFFSHSCLTEVDVRNILGAEQRKILAGCKIVFSR 1046 Query: 1978 VFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPG 2157 +FPVGEANPHLHPLWQTAEQFGAECTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPG Sbjct: 1047 IFPVGEANPHLHPLWQTAEQFGAECTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPG 1106 Query: 2158 WVEASALLYRRANEQDFAIK 2217 WVEASALLYRRANEQDFA+K Sbjct: 1107 WVEASALLYRRANEQDFAVK 1126 >ref|XP_010929653.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Elaeis guineensis] Length = 1268 Score = 810 bits (2091), Expect = 0.0 Identities = 453/803 (56%), Positives = 534/803 (66%), Gaps = 64/803 (7%) Frame = +1 Query: 1 NSSGGLQMSSVNSSGGLQMSS-----VNSSGAFQ---MAPLNTSGGSQMDPVKTSAKSRD 156 +SS VN++ +Q+++ +SS + Q + P+ G + + + KSRD Sbjct: 478 SSSANRNAGCVNTTSQIQVATSSAACTDSSSSHQPGTVKPVGQLGSAPNLATRPALKSRD 537 Query: 157 PRLRFMNSEVGGA--------------PQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRN 294 PRLRF++SE G A P NG G N RKHKA+DE +P+ H LKRQRN Sbjct: 538 PRLRFVSSESGSASDPNTQVMSLDSSAPNNGPVGGITNPRKHKAVDESLPENHTLKRQRN 597 Query: 295 ESTRSRDAQVMPGRGG-WIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRR---- 459 T S D Q++PGRGG W++D+ V SQ +++++ ++NM + +N V VG DRR Sbjct: 598 GLTNSGDVQMIPGRGGGWLDDSSAVGSQPSDKIRLSENMEIETKNPVS-VVGSDRRPDSN 656 Query: 460 -------LVXXXXXXXXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXX 618 S++ KDIAVNPTML++L++M Sbjct: 657 PNIHVSNTGTCPIPSSTAAPASSTAPSSSAAASVSFPSLLKDIAVNPTMLMQLIQMEQQR 716 Query: 619 XXXXXXXXXXX-------GPAVNGLSSAISP-------SPDVGQNPAAKSQM-------N 735 ++N LS A+S S +VGQNP + Q+ N Sbjct: 717 LSAEAQQKTVGLMQNMAHASSLNVLSGAVSSATVASMKSTEVGQNPGGRPQVPPQTVSTN 776 Query: 736 GPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQ 915 +D+G+IRMKPRDPRRVLH NMVQK+E++ SE AK NG L S+ QSSKD + EQGEQ Sbjct: 777 SQSDVGRIRMKPRDPRRVLH-NMVQKNETVVSERAKPNGTLSSDPQSSKDQSAIGEQGEQ 835 Query: 916 AQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALP-VGTQNSSQLIPSKISND-- 1086 AQ TLP+Q QF KN +NL DI S+ Q++ P +Q SQ I KI+ Sbjct: 836 AQATTLPTQ---------QFAKNTKNLGDISSTLQSTTTPPAASQIISQPIQLKINKVDP 886 Query: 1087 ------TTEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIE 1248 ++PKT++ + ++G T +G NPWGDVDHLLDGYDDQQKAAIQ+ERARRI Sbjct: 887 RPAAAVVSDPKTLSAVTSEGST-TGATPSTNPWGDVDHLLDGYDDQQKAAIQRERARRIA 945 Query: 1249 EQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMG 1428 EQNKMFA RK NSAKFVEVDP+HEEIL HLFRF HMG Sbjct: 946 EQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEEQDREKPQRHLFRFQHMG 1005 Query: 1429 MWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDP 1608 MWTKLRPG+W FLEKASKLYE+HLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+GDP Sbjct: 1006 MWTKLRPGIWTFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDGDP 1065 Query: 1609 FDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPG 1788 FD DERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL G Sbjct: 1066 FDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLFG 1125 Query: 1789 PSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIV 1968 PSLLEIDHDERP+ GTLASSL VIER+HQ FFSH SLN++DVRNILAAEQRKIL GCKIV Sbjct: 1126 PSLLEIDHDERPEDGTLASSLAVIERIHQNFFSHHSLNDIDVRNILAAEQRKILAGCKIV 1185 Query: 1969 FSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVV 2148 FSRVFPVGEANPHLHPLWQ AEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV Sbjct: 1186 FSRVFPVGEANPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVV 1245 Query: 2149 HPGWVEASALLYRRANEQDFAIK 2217 HPGWVEASALLYRR +E DFA+K Sbjct: 1246 HPGWVEASALLYRRVSEHDFAVK 1268 >ref|XP_008791049.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Phoenix dactylifera] Length = 1269 Score = 789 bits (2038), Expect = 0.0 Identities = 447/803 (55%), Positives = 522/803 (65%), Gaps = 64/803 (7%) Frame = +1 Query: 1 NSSGGLQMSSVNSSGGLQMSS-----VNSSGAFQMAPLNTSG--GSQMDP-VKTSAKSRD 156 +SS VN++ +Q+++ +SS Q P+ G GS +P ++ + KSRD Sbjct: 479 SSSANGNAGCVNTTSEIQVATNSAACTDSSSRHQPGPVKPVGQLGSAPNPAIRPALKSRD 538 Query: 157 PRLRFMNSEVGGA--------------PQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRN 294 PRLRF+NSE G A P N G N RKHKA+DE P+ H LKRQ+N Sbjct: 539 PRLRFVNSESGNASDPNRRAMSLDFSAPNNDLVGGITNPRKHKAVDESFPENHTLKRQKN 598 Query: 295 ESTRSRDAQVMPGRGG-WIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRR---- 459 T S D Q+ PGRGG W+ED+ V SQ++++++ N+NM + +N G V DRR Sbjct: 599 GLTNSSDVQMTPGRGGGWLEDSSSVRSQLSDKIRLNENMEIEIKN-PGNVVMSDRRPDSN 657 Query: 460 -------LVXXXXXXXXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXX 618 S + KDIAVNPTML++L+++ Sbjct: 658 PNIQVTNTGTCMIPSSTTAPSSGTAPSSSAAASVSFPSLLKDIAVNPTMLMQLIQIEQQR 717 Query: 619 XXXXXXXXXXX-------GPAVNGLSSAISP-------SPDVGQNPAAKSQM-------N 735 ++N L A+S S +VG NP+ + Q+ N Sbjct: 718 LSAEAQQKTVGLMHNMAHASSLNVLPGAVSSANVASMKSAEVGHNPSGRPQVTAQTVSTN 777 Query: 736 GPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQ 915 +D+G+IRMKPRDPRR+LH NMVQK+E++ SE AK NG L S+ QSSKD L + EQGEQ Sbjct: 778 SQSDVGRIRMKPRDPRRILH-NMVQKNETIVSERAKPNGTLSSDPQSSKDHLAIGEQGEQ 836 Query: 916 AQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPVGT-QNSSQLIPSKISND-- 1086 AQ LP+ Q KN +NL DI S Q + P+ Q SQ I I+ Sbjct: 837 AQATGLPTL---------QLAKNPKNLGDISSPLQLTTTPLAVPQIISQPIQFNINKVDL 887 Query: 1087 ------TTEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIE 1248 +PKT++ + ++G T N WGDVDHLLDGYDDQQKAAIQ+ERARRI Sbjct: 888 RPAAAVVNDPKTLSTVASEGSTTVAT-QSTNAWGDVDHLLDGYDDQQKAAIQRERARRIA 946 Query: 1249 EQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMG 1428 EQNKMFA RK NSAKFVEVDP+HEEIL HLFRF HMG Sbjct: 947 EQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEEQDREKPQRHLFRFQHMG 1006 Query: 1429 MWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDP 1608 MWTKLRPG+WNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+ +P Sbjct: 1007 MWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDSEP 1066 Query: 1609 FDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPG 1788 FD DERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL G Sbjct: 1067 FDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLFG 1126 Query: 1789 PSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIV 1968 PSLLEIDHDERP+ GTLASSL VIER+H FFSH+SLN+VDVRNILAAEQRKIL GCKIV Sbjct: 1127 PSLLEIDHDERPEDGTLASSLTVIERIHDDFFSHRSLNDVDVRNILAAEQRKILAGCKIV 1186 Query: 1969 FSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVV 2148 FSRVFPVGEANPHLHPLWQ AEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV Sbjct: 1187 FSRVFPVGEANPHLHPLWQMAEQFGAACTNQIDEQVTHVVANSLGTDKVNWALSTGRFVV 1246 Query: 2149 HPGWVEASALLYRRANEQDFAIK 2217 HP WVEASALLYRR NEQDFA+K Sbjct: 1247 HPSWVEASALLYRRVNEQDFAVK 1269 >gb|OVA17386.1| BRCT domain [Macleaya cordata] Length = 1214 Score = 781 bits (2018), Expect = 0.0 Identities = 436/746 (58%), Positives = 502/746 (67%), Gaps = 45/746 (6%) Frame = +1 Query: 115 SQMDPV-KTSAKSRDPRLRFMNSEVGG-------------APQNGFAAGSVNSRKHKAID 252 S ++PV + KSRDPRLRF+NSEVG AP++ G ++SRK+K Sbjct: 472 SGINPVLRPQPKSRDPRLRFLNSEVGSVDLNQRSPYVEYNAPKSETLGGIISSRKNKTDP 531 Query: 253 EPVPDEHNLKRQRN---ESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNR 423 E V D H LKRQRN T S Q+ G GGW+ED V Q R+Q +++ R Sbjct: 532 ESVLDGHTLKRQRNGLTSPTVSGGVQMSSGSGGWLEDISTVRPQPTPRIQLAESVGSDPR 591 Query: 424 NLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLK 603 + GEV R + KDIAVNPTML+ L++ Sbjct: 592 MIGNGEVLSGLRQDTSSSNINVRAGGNDQLPLTGIDTMGSLPSLLKDIAVNPTMLINLIR 651 Query: 604 -MXXXXXXXXXXXXXXXGPAVNGLSSAISP------------SPDVGQNPAAKSQMNGP- 741 + G SS + P S ++ Q PA K Q GP Sbjct: 652 EQQRLAAETQQKSSNPTQNKITGSSSNVLPRSVPLANVASSKSSEIEQKPAVKPQ--GPA 709 Query: 742 -----NDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQ 906 + GKIRMKPRDPRR+LH++ QK+E LG E K+ G S +Q+SKD L V++Q Sbjct: 710 ETISTGEFGKIRMKPRDPRRILHNSTFQKNECLGLEQLKTIGASSSLIQASKDNLIVRQQ 769 Query: 907 GEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALP--VGTQNSSQLIPSKIS 1080 GE AQTN+LPS SA PDI+QQFTK L+NLADI+SSSQA+ +P V SSQ IP+KI Sbjct: 770 GELAQTNSLPSHSAPAPDIAQQFTKELKNLADILSSSQATNIPSVVPQTVSSQTIPTKI- 828 Query: 1081 NDTTEPKTVTEMCTQGETASG-------VIDLANPWGDVDHLLDGYDDQQKAAIQKERAR 1239 DTT+ +TV + ++ +G V+ N W DV+HL +GYDDQQ+AAI +ERAR Sbjct: 829 -DTTDMRTVVTVPKDQQSGTGTTPEEGTVLPSENKWEDVEHLFEGYDDQQRAAIHRERAR 887 Query: 1240 RIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFP 1419 RIEEQNKMFA RK NSAKF+EVDP+H+EIL HLFRFP Sbjct: 888 RIEEQNKMFAARKLCLVLDLDHTLLNSAKFIEVDPVHDEILRKKEEQDREKPHRHLFRFP 947 Query: 1420 HMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDE 1599 HMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVISKGDE Sbjct: 948 HMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISKGDE 1007 Query: 1600 GDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFG 1779 GDPFD DERVPKSKDL+GVLGMES+VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFG Sbjct: 1008 GDPFDGDERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1067 Query: 1780 LPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGC 1959 L GPSLLEIDHDERP+ GTLASSL VIER+HQ FFSH SL++VDVRNILA+EQRKIL GC Sbjct: 1068 LLGPSLLEIDHDERPEEGTLASSLAVIERIHQNFFSHMSLHDVDVRNILASEQRKILAGC 1127 Query: 1960 KIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGR 2139 +IVFSRVFPVGEANPHLHPLWQ+AEQFGA CT QIDE VTHVVANS GTDKVNWALSTGR Sbjct: 1128 RIVFSRVFPVGEANPHLHPLWQSAEQFGAVCTTQIDEQVTHVVANSLGTDKVNWALSTGR 1187 Query: 2140 FVVHPGWVEASALLYRRANEQDFAIK 2217 FVVHP WVEAS LLYRRANE DFA+K Sbjct: 1188 FVVHPSWVEASTLLYRRANEHDFAVK 1213 >gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] Length = 1290 Score = 768 bits (1983), Expect = 0.0 Identities = 425/770 (55%), Positives = 499/770 (64%), Gaps = 52/770 (6%) Frame = +1 Query: 64 VNSSGAFQMAPLNTSGGSQMDPV-----KTSAKSRDPRLRFMNSEVGGAPQN-------- 204 V+S+ + + T + M V K+ AKSRDPRL F NS N Sbjct: 528 VDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNERLLHNAS 587 Query: 205 --GFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 369 G ++SRK K+++EP+ D LKRQRNE +RD Q + G GGW+ED + Sbjct: 588 KVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIG 647 Query: 370 SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXX 549 SQ+ NR Q +N+ +R + G + + Sbjct: 648 SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNITVGTNEQVPVTSTSTPSLP 702 Query: 550 XXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXG--------PAVNGLSSAIS-----P 690 KDIAVNPTML+ +LKM P+ N L +S P Sbjct: 703 ALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIP 762 Query: 691 SPDV----------GQNPAAKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 840 SP V PA Q+ P++ GKIRMKPRDPRRVLH N +Q+S S+G + Sbjct: 763 SPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGLDQL 822 Query: 841 KSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 1020 K+NG L S Q SKD L Q+ Q ++ + SQ PDI+QQFT NL+N+ADI+S SQ Sbjct: 823 KTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIADIMSVSQ 882 Query: 1021 A-SALPVGTQNSSQLIPSK--ISNDTTEPKTVTEMCTQGETASGVIDLA--------NPW 1167 A ++LP + N L+P I +D+ + K + +T +G+ A N W Sbjct: 883 ALTSLPPVSHN---LVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGPRSQNAW 939 Query: 1168 GDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPI 1347 GDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+ RK NSAKF+EVDP+ Sbjct: 940 GDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDPV 999 Query: 1348 HEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYA 1527 HEEIL HLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYA Sbjct: 1000 HEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1059 Query: 1528 TEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVW 1707 TEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMESAVVIIDDS+RVW Sbjct: 1060 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVW 1119 Query: 1708 PHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFS 1887 PHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FFS Sbjct: 1120 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFS 1179 Query: 1888 HQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQID 2067 HQ+L++VDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQID Sbjct: 1180 HQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQID 1239 Query: 2068 EHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217 EHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK Sbjct: 1240 EHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289 >ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Nelumbo nucifera] Length = 1313 Score = 767 bits (1980), Expect = 0.0 Identities = 437/787 (55%), Positives = 516/787 (65%), Gaps = 55/787 (6%) Frame = +1 Query: 22 MSSVNSSGGLQMSSVNSSGAFQMA-----PLNTSG--GSQMDPVKTSAKSRDPRLRFMNS 180 ++++NSS L+ S +S A ++ P + G GS V +AK+RDPRLR+ NS Sbjct: 529 VATINSSTSLKTVSSATSYADNLSGQGLVPAVSVGQLGSMSSHVIRTAKNRDPRLRYANS 588 Query: 181 EVGGA-----PQNGF--------AAGSVNSRKHKAIDEPVPDEHNLKRQRN---ESTRSR 312 EVG P +G G + SRKHK ++E + D+H KRQRN S S Sbjct: 589 EVGPLDLNQRPPSGDHDIRKSEPLGGIMGSRKHKIVEESLLDDHTFKRQRNGLINSGASG 648 Query: 313 DAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXX 492 D QV+ G GGW+E++ + Q +R + + R L GE + Sbjct: 649 DVQVVSGSGGWLEESSSMGLQPTDRSRLIEKRESDPRKLGSGEASFGNKQDTGCSTYNVT 708 Query: 493 XXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGL 672 + KDIAVNPTML+ L+KM PA + + Sbjct: 709 TGGNEQLTASGIGSTVSLPSLLKDIAVNPTMLMHLIKMEHQRLAVEALQKCG-NPAQSTM 767 Query: 673 ---SSAISPSPDVGQNPAAKS-------------------QMNGPNDMGKIRMKPRDPRR 786 SS++ P N A+K+ M D+GKIRMKPRDPRR Sbjct: 768 QSSSSSVMPGKIASVNIASKTLSEPEKKSAGNSQISVQTASMIPHGDLGKIRMKPRDPRR 827 Query: 787 VLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDIS 966 +LHSN QKS+S G E K+NG + +D L V++QGEQAQTN+L SQS + PDI+ Sbjct: 828 ILHSNTFQKSDSSGPERFKANGTPSPNTPTCRDNLIVRQQGEQAQTNSLLSQSTAPPDIA 887 Query: 967 QQFTKNLQNLADIVSSSQASALP--VGTQNSSQLIPSK--------ISNDTTEPKTVTEM 1116 QQFTK L+N+A+I+S+SQA P V SSQ +P+K ++ D+ + ++ + + Sbjct: 888 QQFTKKLKNIANILSASQAINTPSVVPQTISSQPVPAKMDKVDMKVVATDSNDQRSWSAL 947 Query: 1117 CTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXX 1296 T E A+G N WGDV+HL +GYDDQQKAAIQ+ERARRIEEQN+MFA RK Sbjct: 948 -TPEERAAGPSS-QNAWGDVEHLFEGYDDQQKAAIQRERARRIEEQNQMFAARKLCLVLD 1005 Query: 1297 XXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKA 1476 NSAKFVEVDP+HEE+L HLFRF HMGMWTKLRPG+WNFLEKA Sbjct: 1006 LDHTLLNSAKFVEVDPVHEEMLRKKEEQDREKPQRHLFRFTHMGMWTKLRPGIWNFLEKA 1065 Query: 1477 SKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGV 1656 SKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDPFD DER PKSKDLDGV Sbjct: 1066 SKLYELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISRGDDGDPFDGDERQPKSKDLDGV 1125 Query: 1657 LGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGT 1836 LGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQ GL GPSLLEIDHDERP+ GT Sbjct: 1126 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQLGLHGPSLLEIDHDERPEDGT 1185 Query: 1837 LASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHP 2016 LASSL VIER+HQ FFSHQ+LN+VDVRNILAAEQ+KIL GC+IVFSRVFPVGEANPHLHP Sbjct: 1186 LASSLAVIERIHQNFFSHQNLNDVDVRNILAAEQQKILAGCRIVFSRVFPVGEANPHLHP 1245 Query: 2017 LWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 2196 LWQTAEQFGA CTNQIDE VTHVVA S GTDKVNWALSTGRFVVHPGWVEASALLYRRAN Sbjct: 1246 LWQTAEQFGAVCTNQIDEQVTHVVAISLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 1305 Query: 2197 EQDFAIK 2217 E DFAIK Sbjct: 1306 EHDFAIK 1312 >ref|XP_007043830.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 [Theobroma cacao] Length = 1290 Score = 766 bits (1978), Expect = 0.0 Identities = 426/771 (55%), Positives = 496/771 (64%), Gaps = 53/771 (6%) Frame = +1 Query: 64 VNSSGAFQMAPLNTSGGSQMDPV-----KTSAKSRDPRLRFMNSEVGGAPQN-------- 204 V+S+ + + T + M V K+ AKSRDPRL F NS N Sbjct: 528 VDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNERLLHNAS 587 Query: 205 --GFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 369 G ++SRK K+++EP+ D LKRQRNE +RD Q + G GGW+ED + Sbjct: 588 KVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIG 647 Query: 370 SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXX 549 SQ+ NR Q +N+ +R + G + + Sbjct: 648 SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNITVGTNEQVPVTSTSTPSLP 702 Query: 550 XXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXG--------PAVNGLSSAIS-----P 690 KDIAVNPTML+ +LKM P+ N L +S P Sbjct: 703 ALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIP 762 Query: 691 SPDV----------GQNPAAKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 840 SP V PA Q+ P++ GKIRMKPRDPRRVLH N +Q+S S+G + Sbjct: 763 SPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGPDQL 822 Query: 841 KSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 1020 K+NG L S Q SKD L Q+ Q ++ + SQ PDI+QQFT NL+N+A IVS SQ Sbjct: 823 KTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIAGIVSVSQ 882 Query: 1021 A--SALPVGTQNSSQLIPSK--ISNDTTEPKTVTEMCTQGETASGVIDLA--------NP 1164 A S PV S L+P I +D+ + K + +T +G+ A N Sbjct: 883 ALTSLSPV----SHNLVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGPHSQNA 938 Query: 1165 WGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDP 1344 WGDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+ RK NSAKF+EVDP Sbjct: 939 WGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDP 998 Query: 1345 IHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLY 1524 +HEEIL HLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLY Sbjct: 999 VHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLY 1058 Query: 1525 ATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRV 1704 ATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMESAVVIIDDS+RV Sbjct: 1059 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRV 1118 Query: 1705 WPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFF 1884 WPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FF Sbjct: 1119 WPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFF 1178 Query: 1885 SHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQI 2064 SHQ+L++VDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQI Sbjct: 1179 SHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQI 1238 Query: 2065 DEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217 DEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK Sbjct: 1239 DEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289 >ref|XP_021275229.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Herrania umbratica] Length = 1291 Score = 766 bits (1978), Expect = 0.0 Identities = 424/770 (55%), Positives = 499/770 (64%), Gaps = 49/770 (6%) Frame = +1 Query: 55 MSSVNSSGAFQMAPLNTSGGSQMDPV--KTSAKSRDPRLRFMNSEVGGAPQN-------- 204 + S NSS Q+ N + S + + K+ AKSRDPRL F N+ N Sbjct: 529 VDSANSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANTNASALDLNERPLHNAS 588 Query: 205 --GFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 369 G ++SRK K+++EP+ D LKRQRNE +RD Q + G GGW+ED ++ Sbjct: 589 KVAPVGGIMDSRKRKSVEEPILDGPALKRQRNELENLGVARDVQTVCGIGGWLEDTDVIG 648 Query: 370 SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXX 549 SQ+ NR Q +N+ +R + G + + Sbjct: 649 SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNMTVGTNEQVPVTSTSTPSLP 703 Query: 550 XXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGL-------------SSAISP 690 KDIAVNPTML+ +LKM P + L S+ + Sbjct: 704 ALLKDIAVNPTMLISILKMGQQQRLGAEAQQKSPDPVKSTLHQPSSNSLLGVVSSTNVIS 763 Query: 691 SPDVG----------QNPAAKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 840 SP V PA Q+ P++ GKIRMKPRDPRRVLH N +Q+S S+G + Sbjct: 764 SPSVNNVPSISSGILSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGPDQL 823 Query: 841 KSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 1020 K+NG L S Q SKD L Q Q ++ + SQ PDI+QQFTKNL+N+ADI+S SQ Sbjct: 824 KTNGALTSSTQGSKDNLNAQNLDSQTESKPMQSQLVPPPDITQQFTKNLKNIADIMSVSQ 883 Query: 1021 A-SALPVGTQNSSQLIPS--KISNDTTEPKTVTEMCTQGETASGVI--------DLANPW 1167 A ++LP QN L+P +I +D+ + K + +T +G+ N W Sbjct: 884 ALTSLPPVPQN---LVPQPVQIKSDSMDMKALVSNSEDQQTGAGLAPEVGATGPHSQNAW 940 Query: 1168 GDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPI 1347 GDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+ K NSAKF+EVDP+ Sbjct: 941 GDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSAHKLCLVLDLDHTLLNSAKFIEVDPV 1000 Query: 1348 HEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYA 1527 HEEIL HLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYA Sbjct: 1001 HEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1060 Query: 1528 TEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVW 1707 TEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES VVIIDDS+RVW Sbjct: 1061 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESGVVIIDDSVRVW 1120 Query: 1708 PHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFS 1887 PHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FFS Sbjct: 1121 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFS 1180 Query: 1888 HQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQID 2067 HQ+L++VDVRNILAAEQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQID Sbjct: 1181 HQNLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQID 1240 Query: 2068 EHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217 EHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK Sbjct: 1241 EHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1290 >gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis] Length = 1195 Score = 754 bits (1947), Expect = 0.0 Identities = 416/736 (56%), Positives = 488/736 (66%), Gaps = 40/736 (5%) Frame = +1 Query: 130 VKTSAKSRDPRLRFMNSEVGGAPQNGFAA------------GSVNSRKHKAIDEPVPDEH 273 VK SAKSRDPRLRF+NS+ QN A G++N ++ K +D+P+PD H Sbjct: 460 VKASAKSRDPRLRFVNSDSNALDQNHRAVPVVNTLKVEPIGGTMNKKRQKIVDDPIPDGH 519 Query: 274 NLKRQRNESTRS---RDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEV 444 +LKRQ+N S RD + M G GGW+ED MV Q N+ Q N R GG V Sbjct: 520 SLKRQKNALENSGVVRDVKTMVGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGV 579 Query: 445 GPDRRLVXXXXXXXXXXXXSASN-------MXXXXXXXXXXXXXXKDIAVNPTMLVELLK 603 + + K+IAVNPTML+ +LK Sbjct: 580 CTSSSCISSVNISGTEQIPVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILK 639 Query: 604 MXXXXXXXXXXXXXXXGPAVN-----GLSSAISPSPDVG-------QNPA----AKSQMN 735 M PA + +S + P VG PA Q+ Sbjct: 640 MGQQQRLALEAQQKPVDPAKSTTYPLNSNSMLGTVPVVGAAHSGILPRPAGTVQVSPQLG 699 Query: 736 GPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQ 915 +D+GKIRMKPRDPRRVLH+N +Q++ S+GSE K+N Q +KD +Q+Q Q Sbjct: 700 TADDLGKIRMKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQ 759 Query: 916 AQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQAS-ALPVGTQN-SSQLIPSKISNDT 1089 + +P QS +LPDIS FTKNL+N+ADIVS S AS + P+ QN +SQ + + IS+ Sbjct: 760 VEKKPVPLQSLALPDISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQPMRTTISSSD 819 Query: 1090 TEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFA 1269 + A+G N WGDV+HL +GY+DQQKAAIQ+ERARRIEEQ K+F+ Sbjct: 820 QFLGIGSAPGAAAAAAAGP-RTQNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQKKLFS 878 Query: 1270 ERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRP 1449 RK NSAKFVEVDP+H+EIL HLFRFPHMGMWTKLRP Sbjct: 879 ARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGMWTKLRP 938 Query: 1450 GVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERV 1629 G+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF+GRVIS+GD+G+PFD DER+ Sbjct: 939 GIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPFDGDERI 998 Query: 1630 PKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEID 1809 PKSKDL+GVLGMES VVI+DDS+RVWPHNK NLIVVERY+YFP SRRQFGLPGPSLLEID Sbjct: 999 PKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEID 1058 Query: 1810 HDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPV 1989 HDERP+ GTLA SL VIER+HQ FF+H SL+E DVRNILA+EQRKIL GC+IVFSRVFPV Sbjct: 1059 HDERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPV 1118 Query: 1990 GEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEA 2169 GEANPHLHPLWQTAEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEA Sbjct: 1119 GEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEA 1178 Query: 2170 SALLYRRANEQDFAIK 2217 SALLYRRANEQDFAIK Sbjct: 1179 SALLYRRANEQDFAIK 1194 >ref|XP_021662955.1| LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 3 [Hevea brasiliensis] Length = 1292 Score = 752 bits (1942), Expect = 0.0 Identities = 416/735 (56%), Positives = 480/735 (65%), Gaps = 40/735 (5%) Frame = +1 Query: 133 KTSAKSRDPRLRFMNSEVGGAPQNGFAA----------GSVNSRKHKAIDEPVPDEHNLK 282 K SAKSRDPRLRF+NS+ + QN A G++N +K K++DEP+PD LK Sbjct: 560 KASAKSRDPRLRFVNSDANVSDQNNRAVPVVNNTLKVGGTMNLKKQKSVDEPIPDGPPLK 619 Query: 283 RQRNESTRS---RDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPD 453 RQ+ S S RD + M G GGW+ED +V Q NR Q +N R + G P Sbjct: 620 RQKIASEISGVGRDVKTMIGSGGWLEDTDVVGPQTLNRNQLVENAESDPRRIDNGVACPS 679 Query: 454 RRLVXXXXXXXXXXXXS---------ASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKM 606 A + K+IAVNPTML+ +LKM Sbjct: 680 TVSGISSVNISGNEQLQVTGASAVAGAEQVPVMGASATSLPDLLKNIAVNPTMLISILKM 739 Query: 607 XXXXXXXXXXXXXXXG--------PAVNGLSSA-----ISPSPDVGQNPAAKSQMNGP-- 741 P N + A ++P G P + P Sbjct: 740 GQQQRLAIEAQQKPVDLAKSTTHPPNTNSILGALPVVNVAPPQSTGILPRPAGALQVPQL 799 Query: 742 ---NDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGE 912 ++MGKIRMKPRDPRRVLH+N +Q++ SLGSE K+N S Q +K+ VQ Q Sbjct: 800 AASDEMGKIRMKPRDPRRVLHNNTLQRNGSLGSEQFKTNLISTSTSQGTKENQNVQNQEG 859 Query: 913 QAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPVGTQNSSQLIPSKISNDTT 1092 Q + +P+QS PDIS FTK+L+N+ADIVS S AS P+ +QN L+ + Sbjct: 860 QVEMKPVPTQSLVAPDISLPFTKSLKNIADIVSVSNASTPPLVSQN---LVSQHVRTVVL 916 Query: 1093 EPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAE 1272 + T + AS N WGD DH+ +GY+DQQKAAIQ+ERARRIEEQ KMFA Sbjct: 917 NSEQPTGIGLPPGVASVAPRSQNTWGDFDHIFEGYNDQQKAAIQRERARRIEEQKKMFAA 976 Query: 1273 RKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPG 1452 K NSAKFVE+DP+H+EIL HLFRFPHMGMWTKLRPG Sbjct: 977 NKLCLVLDLDHTLLNSAKFVEIDPVHDEILRKKEEQDHEKPQRHLFRFPHMGMWTKLRPG 1036 Query: 1453 VWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVP 1632 +WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF+GRVIS GD+GDPFDSDERVP Sbjct: 1037 IWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISXGDDGDPFDSDERVP 1096 Query: 1633 KSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDH 1812 KSKDL+GVLGMESAVVIIDDS+RVWPHNK NLIVVERY+YFP SRRQFGLPGPSLLEIDH Sbjct: 1097 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDH 1156 Query: 1813 DERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVG 1992 DERP+ GTLA SL VIER+HQ FF+H SL+E DVRNILA+EQRKIL GC+IVFSRVFPVG Sbjct: 1157 DERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPVG 1216 Query: 1993 EANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEAS 2172 EANPHLHPLWQTAEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEAS Sbjct: 1217 EANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEAS 1276 Query: 2173 ALLYRRANEQDFAIK 2217 ALLYRRANEQDFAIK Sbjct: 1277 ALLYRRANEQDFAIK 1291 >gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium raimondii] Length = 982 Score = 739 bits (1907), Expect = 0.0 Identities = 424/781 (54%), Positives = 497/781 (63%), Gaps = 52/781 (6%) Frame = +1 Query: 31 VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195 V+S+ + +S SS G F P+ S S + K SAKSRDPRLRF NS V Sbjct: 208 VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 266 Query: 196 PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 339 N +G ++ RK K+ +EPV D KRQ+NE RD Q + G G Sbjct: 267 DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 326 Query: 340 GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMX 519 GW+ED SQ+ NR Q + + +R + G + M Sbjct: 327 GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 386 Query: 520 XXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAIS---- 687 KDIAVNPTML+ +LKM P N L S Sbjct: 387 NPSLPALL-----KDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 441 Query: 688 ---------PSPDVGQNPAAKS--------QMNGP--NDMGKIRMKPRDPRRVLHSNMVQ 810 PSP V P++ S + GP ++ KIRMKPRDPRRVLH N++Q Sbjct: 442 GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 501 Query: 811 KSESLGSELAKSNGNLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 984 KS S+G + K+NG P S Q SKD + Q+Q E Q + + Q PDI+QQFT++ Sbjct: 502 KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 561 Query: 985 LQNLADIVSSSQASA-LPVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA- 1158 L+N+A ++S Q+ A LP +QN P ++ ++T + T +T +G A Sbjct: 562 LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 620 Query: 1159 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXX 1314 N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK Sbjct: 621 VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 680 Query: 1315 NSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 1494 NSAKF+EVDP+HEEIL HLFRF HMGMWTKLRPG+WNFLEKASKLYEL Sbjct: 681 NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 740 Query: 1495 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 1674 HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+ Sbjct: 741 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 800 Query: 1675 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 1854 VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL Sbjct: 801 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 860 Query: 1855 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 2034 VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE Sbjct: 861 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 920 Query: 2035 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 2214 QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI Sbjct: 921 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 980 Query: 2215 K 2217 K Sbjct: 981 K 981 >ref|XP_022737741.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Durio zibethinus] Length = 1274 Score = 748 bits (1932), Expect = 0.0 Identities = 421/764 (55%), Positives = 491/764 (64%), Gaps = 44/764 (5%) Frame = +1 Query: 58 SSVNSSGAFQMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGAPQNGF--------- 210 SS+ A Q A + T + +K+SAKSRDPRLRF NS N Sbjct: 521 SSMQGKIATQNATVVTVSSASNIALKSSAKSRDPRLRFANSNASALDLNQQPLHNASKAV 580 Query: 211 -AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS---RDAQVMPGRGGWIEDNGMVASQM 378 G ++SRK K+I+EPV D LKRQR E S +D Q + G GW+ED ++ SQ+ Sbjct: 581 PVGGIMDSRKQKSIEEPVLDGPALKRQRKELENSGVVKDVQTVSGNCGWLEDTDVIGSQV 640 Query: 379 NNRVQPNKNM-----AVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXX 543 NR Q +N + NR + + S ++ Sbjct: 641 TNRNQIVENSDSNSWKMDNRVTCSSTLSGKTNMTVNRNEQVPMTGMSTPSLPALL----- 695 Query: 544 XXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGP--------AVNGLSSAISP--- 690 KDIAVNPT+L+ +LKM P + N + ++P Sbjct: 696 -----KDIAVNPTVLINILKMGQQERLAAEILQKSPDPVKSTLHQPSSNSILGVVTPVNI 750 Query: 691 ----SPDVGQNPAAKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNL 858 S + PA Q+ P++ G IRMKPRDPRRVLH N++Q+S +G + K+NG Sbjct: 751 VPSSSSGILSKPAGNLQVPPPDESGNIRMKPRDPRRVLHGNVLQRSGIMGPDQVKTNGTT 810 Query: 859 P-SEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQA-SAL 1032 P S SKD L VQ+ Q ++ + SQ PDI+QQFTKNL+N+ADI+S SQA ++L Sbjct: 811 PTSSTLGSKDNLNVQKLEAQTESKPMQSQLVPAPDITQQFTKNLKNIADIMSVSQALTSL 870 Query: 1033 PVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA---------NPWGDVDHL 1185 P +Q+ P + D+ + KTV +T +G A N W DV+HL Sbjct: 871 PAVSQSLVSQ-PVQHKPDSMDMKTVVSSSEDQQTGTGSAPEADARGPHCSQNTWDDVEHL 929 Query: 1186 LDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILX 1365 + YDDQQKAAIQKERARRIEEQ KMF K NSAKF EVDP+HEEIL Sbjct: 930 FERYDDQQKAAIQKERARRIEEQKKMFDANKLCLVLDLDHTLLNSAKFNEVDPVHEEILR 989 Query: 1366 XXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKV 1545 HLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAKV Sbjct: 990 KKEEQDREKPQRHLFRFQHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKV 1049 Query: 1546 LDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHN 1725 LDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMESAVVIIDDS+RVWPHNK N Sbjct: 1050 LDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVWPHNKLN 1109 Query: 1726 LIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNE 1905 LIVVERY YFP SRRQFGL GPSLLEIDHDER D GTLASSL VIER+HQ FFSHQ+L++ Sbjct: 1110 LIVVERYTYFPCSRRQFGLLGPSLLEIDHDERLDDGTLASSLAVIERIHQDFFSHQNLDD 1169 Query: 1906 VDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHV 2085 VDVRNILAAEQRKIL GC +VFSRVFPVGEANPHLHPLWQTAEQFGA CTNQIDEHVTHV Sbjct: 1170 VDVRNILAAEQRKILAGCHVVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHV 1229 Query: 2086 VANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217 VANS GTDKVNWALSTG+FVVHPGWVEAS LLYRRANE DFAIK Sbjct: 1230 VANSLGTDKVNWALSTGKFVVHPGWVEASTLLYRRANELDFAIK 1273 >gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium raimondii] Length = 1033 Score = 739 bits (1907), Expect = 0.0 Identities = 424/781 (54%), Positives = 497/781 (63%), Gaps = 52/781 (6%) Frame = +1 Query: 31 VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195 V+S+ + +S SS G F P+ S S + K SAKSRDPRLRF NS V Sbjct: 259 VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 317 Query: 196 PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 339 N +G ++ RK K+ +EPV D KRQ+NE RD Q + G G Sbjct: 318 DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 377 Query: 340 GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMX 519 GW+ED SQ+ NR Q + + +R + G + M Sbjct: 378 GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 437 Query: 520 XXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAIS---- 687 KDIAVNPTML+ +LKM P N L S Sbjct: 438 NPSLPALL-----KDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 492 Query: 688 ---------PSPDVGQNPAAKS--------QMNGP--NDMGKIRMKPRDPRRVLHSNMVQ 810 PSP V P++ S + GP ++ KIRMKPRDPRRVLH N++Q Sbjct: 493 GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 552 Query: 811 KSESLGSELAKSNGNLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 984 KS S+G + K+NG P S Q SKD + Q+Q E Q + + Q PDI+QQFT++ Sbjct: 553 KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 612 Query: 985 LQNLADIVSSSQASA-LPVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA- 1158 L+N+A ++S Q+ A LP +QN P ++ ++T + T +T +G A Sbjct: 613 LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 671 Query: 1159 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXX 1314 N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK Sbjct: 672 VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 731 Query: 1315 NSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 1494 NSAKF+EVDP+HEEIL HLFRF HMGMWTKLRPG+WNFLEKASKLYEL Sbjct: 732 NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 791 Query: 1495 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 1674 HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+ Sbjct: 792 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 851 Query: 1675 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 1854 VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL Sbjct: 852 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 911 Query: 1855 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 2034 VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE Sbjct: 912 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 971 Query: 2035 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 2214 QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI Sbjct: 972 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 1031 Query: 2215 K 2217 K Sbjct: 1032 K 1032 >ref|XP_010656786.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1 [Vitis vinifera] Length = 1276 Score = 744 bits (1920), Expect = 0.0 Identities = 425/782 (54%), Positives = 504/782 (64%), Gaps = 47/782 (6%) Frame = +1 Query: 13 GLQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSG-GSQMDPV-KTSAKSRDPRLRFMNSEV 186 G S V+S L S V + P NT S+ + + + SAKSRDPRLR +S+ Sbjct: 525 GRNTSLVSSGPHLDSSVVQGL----VVPRNTGAVNSRFNSILRASAKSRDPRLRLASSDA 580 Query: 187 GG-------------APQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS---RDA 318 G +P+ V+SRK K+ +EP+ D KRQRN T RDA Sbjct: 581 GSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTSPATVRDA 640 Query: 319 QVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNL-----VGGEVGPDRRLVXXXXXX 483 Q + GGW+ED+ V QM NR Q +N + L V G +G D+ V Sbjct: 641 QTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTG-IGCDKPYVTVNGNE 699 Query: 484 XXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAV 663 +++ KDIAVNP + + + P Sbjct: 700 HLPVVATSTTASLQSLL--------KDIAVNPAVWMNIFNKVEQQKSGDPAKNTVLPPTS 751 Query: 664 NGLSSAISPSP-------DVGQNPAAKSQ------MNGPNDMGKIRMKPRDPRRVLHSNM 804 N + + P+ +GQ PA Q MN ++ GK+RMKPRDPRR+LH+N Sbjct: 752 NSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMNPQDESGKVRMKPRDPRRILHANS 811 Query: 805 VQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKN 984 Q+S S GSE K+N Q+Q +Q +T ++PS S + PDISQQFTKN Sbjct: 812 FQRSGSSGSEQFKTNA---------------QKQEDQTETKSVPSHSVNPPDISQQFTKN 856 Query: 985 LQNLADIVSSSQASALPVGTQNSSQLIPSK---ISNDTTEPKT--------VTEMCTQGE 1131 L+N+AD++S+SQAS++ T Q++ S+ ++ D + K +T ++ E Sbjct: 857 LKNIADLMSASQASSM---TPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKPE 913 Query: 1132 TASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXX 1311 +A+G N WGDV+HL DGYDDQQKAAIQ+ERARRIEEQ KMF+ RK Sbjct: 914 SAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTL 973 Query: 1312 XNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYE 1491 NSAKFVEVDP+H+EIL HLFRFPHMGMWTKLRPG+WNFLEKASKLYE Sbjct: 974 LNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYE 1033 Query: 1492 LHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMES 1671 LHLYTMGNKLYATEMAKVLDPKG LF GRVISKGD+GD D DERVPKSKDL+GVLGMES Sbjct: 1034 LHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMES 1093 Query: 1672 AVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSL 1851 AVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEIDHDERP+ GTLASSL Sbjct: 1094 AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSL 1153 Query: 1852 GVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTA 2031 VIER+HQ FFS+++L+EVDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTA Sbjct: 1154 AVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTA 1213 Query: 2032 EQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 2211 E FGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA Sbjct: 1214 ESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1273 Query: 2212 IK 2217 IK Sbjct: 1274 IK 1275 >ref|XP_010656789.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2 [Vitis vinifera] Length = 1273 Score = 743 bits (1919), Expect = 0.0 Identities = 426/779 (54%), Positives = 504/779 (64%), Gaps = 44/779 (5%) Frame = +1 Query: 13 GLQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSG-GSQMDPV-KTSAKSRDPRLRFMNSEV 186 G S V+S L S V + P NT S+ + + + SAKSRDPRLR +S+ Sbjct: 525 GRNTSLVSSGPHLDSSVVQGL----VVPRNTGAVNSRFNSILRASAKSRDPRLRLASSDA 580 Query: 187 GG-------------APQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS---RDA 318 G +P+ V+SRK K+ +EP+ D KRQRN T RDA Sbjct: 581 GSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTSPATVRDA 640 Query: 319 QVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNL-----VGGEVGPDRRLVXXXXXX 483 Q + GGW+ED+ V QM NR Q +N + L V G +G D+ V Sbjct: 641 QTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTG-IGCDKPYVTVNGNE 699 Query: 484 XXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAV 663 +++ KDIAVNP + + + P Sbjct: 700 HLPVVATSTTASLQSLL--------KDIAVNPAVWMNIFNKVEQQKSGDPAKNTVLPPTS 751 Query: 664 NGLSSAISPSP-------DVGQNPAAKSQM--NGPND-MGKIRMKPRDPRRVLHSNMVQK 813 N + + P+ +GQ PA Q+ GP D GK+RMKPRDPRR+LH+N Q+ Sbjct: 752 NSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMDESGKVRMKPRDPRRILHANSFQR 811 Query: 814 SESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQN 993 S S GSE K+N Q+Q +Q +T ++PS S + PDISQQFTKNL+N Sbjct: 812 SGSSGSEQFKTNA---------------QKQEDQTETKSVPSHSVNPPDISQQFTKNLKN 856 Query: 994 LADIVSSSQASALPVGTQNSSQLIPSK---ISNDTTEPKT--------VTEMCTQGETAS 1140 +AD++S+SQAS++ T Q++ S+ ++ D + K +T ++ E+A+ Sbjct: 857 IADLMSASQASSM---TPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKPESAA 913 Query: 1141 GVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNS 1320 G N WGDV+HL DGYDDQQKAAIQ+ERARRIEEQ KMF+ RK NS Sbjct: 914 GPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNS 973 Query: 1321 AKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHL 1500 AKFVEVDP+H+EIL HLFRFPHMGMWTKLRPG+WNFLEKASKLYELHL Sbjct: 974 AKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHL 1033 Query: 1501 YTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVV 1680 YTMGNKLYATEMAKVLDPKG LF GRVISKGD+GD D DERVPKSKDL+GVLGMESAVV Sbjct: 1034 YTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVV 1093 Query: 1681 IIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVI 1860 IIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEIDHDERP+ GTLASSL VI Sbjct: 1094 IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVI 1153 Query: 1861 ERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQF 2040 ER+HQ FFS+++L+EVDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE F Sbjct: 1154 ERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESF 1213 Query: 2041 GAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217 GA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK Sbjct: 1214 GAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1272 >dbj|GAV71470.1| BRCT domain-containing protein/NIF domain-containing protein [Cephalotus follicularis] Length = 1228 Score = 742 bits (1915), Expect = 0.0 Identities = 425/780 (54%), Positives = 502/780 (64%), Gaps = 43/780 (5%) Frame = +1 Query: 7 SGGLQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEV 186 SG QM + + G + S A +P T GS +K SAKSRDPRLR++NS+V Sbjct: 475 SGSPQMDASSMEG----LTTTRSPAPVSSPAPTVSGSN-PTMKPSAKSRDPRLRYVNSDV 529 Query: 187 G-------------GAPQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRSRDAQVM 327 AP+ + SRK K +++P+ D LKRQ++ S S V+ Sbjct: 530 SVLDLTQRPLHLVHNAPKV-----ELGSRKQKTVEDPILDGPALKRQKSGSENSGLIGVL 584 Query: 328 P---GRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXX 498 G GGW+ED MV +Q+ N KN+ + R + G P Sbjct: 585 KTTSGNGGWLEDTDMVGTQLLN-----KNVVLDPRKVDVGVTSPSIVHCNTNVGNEPLLV 639 Query: 499 XSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXG----PAVN 666 S+S+ KDIAVNPTML+ +LKM P N Sbjct: 640 TSSSSTASLPALL-------KDIAVNPTMLINILKMGQQQRLPAEVQQKSTDSLHPPTSN 692 Query: 667 GLSSAISPSPDVGQNPA-----------AKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQK 813 L A+ NP+ Q + +D GKIRMKPRDPRRVLH N +Q+ Sbjct: 693 SLLGAVPSVNFASSNPSRILPKPAGTLPTTPQTSAMDDPGKIRMKPRDPRRVLHGNALQR 752 Query: 814 SESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQN 993 S SLGSE K N +PS KD L Q+ QA+T +PS S PDI++ FTKNL+N Sbjct: 753 SGSLGSEKLKMN--VPSTSSFQKDNLNAQKLEGQAETKPMPSLSIPQPDITRLFTKNLKN 810 Query: 994 LADIVSSSQASALPVGTQNSSQLI---PSKISNDTTEPKTV---TEMCTQGETASGVIDL 1155 + DI+S SQ +G+ N +Q + P++I D + K + +E G ++ + Sbjct: 811 INDIMSVSQPL---IGSPNVTQNLESQPAQIKADRVDVKAIVSNSEDPRTGTVSASEVGA 867 Query: 1156 ANP------WGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXN 1317 A P WGDV+HL +GYDDQQKAAIQ+ERARR+EEQNKMFA K N Sbjct: 868 AGPARPQHAWGDVEHLFEGYDDQQKAAIQRERARRLEEQNKMFAAHKLCLVLDLDHTLLN 927 Query: 1318 SAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELH 1497 SAKFVEVDP+H+EIL HLFRFPHMGMWTKLRPG+WNFLE+ASKL+ELH Sbjct: 928 SAKFVEVDPVHDEILRKKEEQDREKLHRHLFRFPHMGMWTKLRPGIWNFLERASKLFELH 987 Query: 1498 LYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAV 1677 LYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVPKSKDL+GVLGMESAV Sbjct: 988 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAV 1047 Query: 1678 VIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGV 1857 VIIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEIDHDERP+ GTLAS+L V Sbjct: 1048 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASALTV 1107 Query: 1858 IERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQ 2037 IER+HQIFFS+Q L +VDVRNILA+EQ+KIL GC+I+FSRVFPVGEANPHLHPLWQTAEQ Sbjct: 1108 IERIHQIFFSYQPLGDVDVRNILASEQQKILDGCRILFSRVFPVGEANPHLHPLWQTAEQ 1167 Query: 2038 FGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217 FGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEASALLYRRANEQDF IK Sbjct: 1168 FGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANEQDFGIK 1227 >ref|XP_012459418.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2 [Gossypium raimondii] Length = 1251 Score = 739 bits (1907), Expect = 0.0 Identities = 424/781 (54%), Positives = 497/781 (63%), Gaps = 52/781 (6%) Frame = +1 Query: 31 VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195 V+S+ + +S SS G F P+ S S + K SAKSRDPRLRF NS V Sbjct: 477 VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 535 Query: 196 PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 339 N +G ++ RK K+ +EPV D KRQ+NE RD Q + G G Sbjct: 536 DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 595 Query: 340 GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMX 519 GW+ED SQ+ NR Q + + +R + G + M Sbjct: 596 GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 655 Query: 520 XXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAIS---- 687 KDIAVNPTML+ +LKM P N L S Sbjct: 656 NPSLPALL-----KDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 710 Query: 688 ---------PSPDVGQNPAAKS--------QMNGP--NDMGKIRMKPRDPRRVLHSNMVQ 810 PSP V P++ S + GP ++ KIRMKPRDPRRVLH N++Q Sbjct: 711 GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 770 Query: 811 KSESLGSELAKSNGNLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 984 KS S+G + K+NG P S Q SKD + Q+Q E Q + + Q PDI+QQFT++ Sbjct: 771 KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 830 Query: 985 LQNLADIVSSSQASA-LPVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA- 1158 L+N+A ++S Q+ A LP +QN P ++ ++T + T +T +G A Sbjct: 831 LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 889 Query: 1159 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXX 1314 N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK Sbjct: 890 VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 949 Query: 1315 NSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 1494 NSAKF+EVDP+HEEIL HLFRF HMGMWTKLRPG+WNFLEKASKLYEL Sbjct: 950 NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 1009 Query: 1495 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 1674 HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+ Sbjct: 1010 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 1069 Query: 1675 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 1854 VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL Sbjct: 1070 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 1129 Query: 1855 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 2034 VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE Sbjct: 1130 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 1189 Query: 2035 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 2214 QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI Sbjct: 1190 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 1249 Query: 2215 K 2217 K Sbjct: 1250 K 1250 >gb|PON91807.1| FCP1-like phosphatase [Trema orientalis] Length = 1294 Score = 739 bits (1909), Expect = 0.0 Identities = 425/784 (54%), Positives = 494/784 (63%), Gaps = 50/784 (6%) Frame = +1 Query: 16 LQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195 L+ SS++ S + SS+ G ++G + VK SAKSRDPRLRF NS++ Sbjct: 520 LRPSSISPSTPVSSSSMQ--GPITAKNAASAGSASNSTVKASAKSRDPRLRFANSDLAAL 577 Query: 196 PQNGFAAGSV------------NSRKHKAIDEPVPDEHNLKRQRNESTRSR---DAQVMP 330 N +V +SRK + DE D KRQRN +R D + + Sbjct: 578 DLNLRPVTAVQNAPKVEPGEPTSSRKQRITDESNLDGSPYKRQRNSFENARIVGDVKTVS 637 Query: 331 GRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSAS 510 G GGW+EDNG V Q+NN+ ++ R LV P SA+ Sbjct: 638 GSGGWLEDNGFVGPQLNNKNHSMASLEADPRKLVHMVNCPTNNGPNMAKEQVPVTSTSAT 697 Query: 511 NMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXG----------PA 660 KDIAVNPT+L+ LLK+ P+ Sbjct: 698 ---------ASLPELLKDIAVNPTLLINLLKLGQQQQQQQLVAETQPKSDPVKDSIHPPS 748 Query: 661 VNGLSSA-----ISPSPDVG--QNPAAK-------SQMNGPNDMGKIRMKPRDPRRVLHS 798 N + A I+PS G Q P+A + M+ +++GKIRMKPRDPRRVLH Sbjct: 749 SNSILGAAPLVNIAPSKASGILQTPSASFPVTSQVAAMSSQDELGKIRMKPRDPRRVLHG 808 Query: 799 NMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFT 978 + +QKS SLG E K+ + S +KD L Q Q QA T+PSQS PDI +QFT Sbjct: 809 STLQKSGSLGHEQLKTVVSPLSSTTGNKDNLNGQMQEGQADQKTVPSQSVLPPDIGRQFT 868 Query: 979 KNLQNLADIVSSSQASALP-VGTQN-SSQLIPSK---------ISNDTTEPKTVTEMCTQ 1125 KNL+N+ADI+S S S P + +QN +SQ +P K +SN + + T Sbjct: 869 KNLRNIADIISVSNVSTSPAIVSQNVASQPVPVKPERGDVKAVVSNSEDQRNGIL---TP 925 Query: 1126 GETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXX 1305 +G N WGDV+HL +GYDDQQKAAIQ+ER RR+EEQNKMF RK Sbjct: 926 EVAVAGPSRAPNAWGDVEHLFEGYDDQQKAAIQRERTRRLEEQNKMFEARKLCLVLDLDH 985 Query: 1306 XXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKL 1485 NSAKFVEVDP+H+EIL HLFRFPHMGMWTKLRPGVWNFLEKASKL Sbjct: 986 TLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGVWNFLEKASKL 1045 Query: 1486 YELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGM 1665 YELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDPFD DERVPKSKDL+GVLGM Sbjct: 1046 YELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 1105 Query: 1666 ESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLAS 1845 ESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEIDHDERP+ GTLAS Sbjct: 1106 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLAS 1165 Query: 1846 SLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQ 2025 SL VIER+HQ FF+HQSL E DVRNILA+EQRKIL GC+IVFSRVFPV E NPHLHPLWQ Sbjct: 1166 SLSVIERIHQNFFNHQSLEEADVRNILASEQRKILAGCRIVFSRVFPVSEVNPHLHPLWQ 1225 Query: 2026 TAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 2205 TAEQFGA C QID+ VTHVVANS GTDKVNWA+S GRF VHPGWVEASALLYRRANEQD Sbjct: 1226 TAEQFGAVCITQIDDQVTHVVANSPGTDKVNWAISNGRFAVHPGWVEASALLYRRANEQD 1285 Query: 2206 FAIK 2217 F IK Sbjct: 1286 FTIK 1289 >ref|XP_012459417.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1 [Gossypium raimondii] gb|KJB77191.1| hypothetical protein B456_012G125200 [Gossypium raimondii] Length = 1272 Score = 739 bits (1907), Expect = 0.0 Identities = 424/781 (54%), Positives = 497/781 (63%), Gaps = 52/781 (6%) Frame = +1 Query: 31 VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195 V+S+ + +S SS G F P+ S S + K SAKSRDPRLRF NS V Sbjct: 498 VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 556 Query: 196 PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 339 N +G ++ RK K+ +EPV D KRQ+NE RD Q + G G Sbjct: 557 DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 616 Query: 340 GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMX 519 GW+ED SQ+ NR Q + + +R + G + M Sbjct: 617 GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 676 Query: 520 XXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAIS---- 687 KDIAVNPTML+ +LKM P N L S Sbjct: 677 NPSLPALL-----KDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 731 Query: 688 ---------PSPDVGQNPAAKS--------QMNGP--NDMGKIRMKPRDPRRVLHSNMVQ 810 PSP V P++ S + GP ++ KIRMKPRDPRRVLH N++Q Sbjct: 732 GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 791 Query: 811 KSESLGSELAKSNGNLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 984 KS S+G + K+NG P S Q SKD + Q+Q E Q + + Q PDI+QQFT++ Sbjct: 792 KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 851 Query: 985 LQNLADIVSSSQASA-LPVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA- 1158 L+N+A ++S Q+ A LP +QN P ++ ++T + T +T +G A Sbjct: 852 LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 910 Query: 1159 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXX 1314 N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK Sbjct: 911 VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 970 Query: 1315 NSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 1494 NSAKF+EVDP+HEEIL HLFRF HMGMWTKLRPG+WNFLEKASKLYEL Sbjct: 971 NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 1030 Query: 1495 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 1674 HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+ Sbjct: 1031 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 1090 Query: 1675 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 1854 VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL Sbjct: 1091 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 1150 Query: 1855 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 2034 VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE Sbjct: 1151 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 1210 Query: 2035 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 2214 QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI Sbjct: 1211 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 1270 Query: 2215 K 2217 K Sbjct: 1271 K 1271