BLASTX nr result

ID: Ophiopogon23_contig00022081 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon23_contig00022081
         (2279 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ONK58600.1| uncharacterized protein A4U43_C09F14730 [Asparagu...   863   0.0  
ref|XP_020246902.1| RNA polymerase II C-terminal domain phosphat...   863   0.0  
ref|XP_010929653.1| PREDICTED: RNA polymerase II C-terminal doma...   810   0.0  
ref|XP_008791049.1| PREDICTED: RNA polymerase II C-terminal doma...   789   0.0  
gb|OVA17386.1| BRCT domain [Macleaya cordata]                         781   0.0  
gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-l...   768   0.0  
ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal doma...   767   0.0  
ref|XP_007043830.2| PREDICTED: RNA polymerase II C-terminal doma...   766   0.0  
ref|XP_021275229.1| RNA polymerase II C-terminal domain phosphat...   766   0.0  
gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Rici...   754   0.0  
ref|XP_021662955.1| LOW QUALITY PROTEIN: RNA polymerase II C-ter...   752   0.0  
gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium r...   739   0.0  
ref|XP_022737741.1| RNA polymerase II C-terminal domain phosphat...   748   0.0  
gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium r...   739   0.0  
ref|XP_010656786.1| PREDICTED: RNA polymerase II C-terminal doma...   744   0.0  
ref|XP_010656789.1| PREDICTED: RNA polymerase II C-terminal doma...   743   0.0  
dbj|GAV71470.1| BRCT domain-containing protein/NIF domain-contai...   742   0.0  
ref|XP_012459418.1| PREDICTED: RNA polymerase II C-terminal doma...   739   0.0  
gb|PON91807.1| FCP1-like phosphatase [Trema orientalis]               739   0.0  
ref|XP_012459417.1| PREDICTED: RNA polymerase II C-terminal doma...   739   0.0  

>gb|ONK58600.1| uncharacterized protein A4U43_C09F14730 [Asparagus officinalis]
          Length = 1100

 Score =  863 bits (2229), Expect = 0.0
 Identities = 473/740 (63%), Positives = 520/740 (70%), Gaps = 8/740 (1%)
 Frame = +1

Query: 22   MSSVNSSGGLQMSSVNSSGAFQMAPLNTSG--GSQMDP-VKTSAKSRDPRLRFMNSEVGG 192
            M+ VNSS G QM  V SS   QMA + T G  GS+ +P VK SAKSRDPRLRFM SE   
Sbjct: 256  MAPVNSSDGFQMQPVYSSAGPQMAQVRTIGQVGSEANPAVKASAKSRDPRLRFMKSETSV 315

Query: 193  APQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRSRDAQVMPGRGGWIEDNGMVAS 372
             P+NGFAAG VNS KHK+ DE V DEH+LKRQR +S  SRD +V   + G  +D      
Sbjct: 316  VPKNGFAAGPVNSLKHKSDDELVLDEHSLKRQRKDSMSSRDVKVTGSQSGATKD------ 369

Query: 373  QMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXXX 552
                            RNLVG EVG +  L             S S              
Sbjct: 370  ----------------RNLVGTEVGYEMGLDADNNKLTVSSVPSTS--ISTTGPIVSLPS 411

Query: 553  XXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAISPSPDVGQNPAAKSQM 732
              K IA NP MLV+LL+M               G AVNGLSSA S S  +GQNP+ KSQM
Sbjct: 412  LLKGIAANPQMLVQLLRMEQQKIAAGQAQEKPDGQAVNGLSSATSSSSGIGQNPSVKSQM 471

Query: 733  -----NGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTV 897
                 +  N+M KIRMKPRDPRRVLHSNM Q++E+ GS          S V S KD L  
Sbjct: 472  PPQTKSTNNNMAKIRMKPRDPRRVLHSNMAQRTENSGSAT--------SNVHSIKDQLLH 523

Query: 898  QEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPVGTQNSSQLIPSKI 1077
            ++QG+QAQ   +P QSASL DISQQFTKNLQNLAD+VS SQ                SK 
Sbjct: 524  RKQGDQAQKLAVPLQSASLTDISQQFTKNLQNLADMVSKSQ----------------SKT 567

Query: 1078 SNDTTEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQN 1257
            S+D T+PK V E+C+Q +T       ANPWGDVDHLLDGYDD+QKAAIQKERARRIEEQN
Sbjct: 568  SDDLTKPKNVPELCSQTDTKPA----ANPWGDVDHLLDGYDDKQKAAIQKERARRIEEQN 623

Query: 1258 KMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWT 1437
            KMFA RK            NSAKFVE+DPIHEEIL             HLFR  HMGMWT
Sbjct: 624  KMFAARKLCLVLDLDHTLLNSAKFVEIDPIHEEILRKKEEQDRQTQERHLFRLQHMGMWT 683

Query: 1438 KLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDS 1617
            KLRPG+W FLEKAS+LYELH+YTMGNKLYATEMAK+LDPKG LF GRV+S+GD+GDPFD 
Sbjct: 684  KLRPGIWTFLEKASQLYELHVYTMGNKLYATEMAKLLDPKGNLFAGRVLSRGDDGDPFDG 743

Query: 1618 DERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSL 1797
            D+RVPKSKDLDGVLGMESAV+IIDDSLRVWPHNKHNLIVVERY YFPSSRRQFGL GPSL
Sbjct: 744  DDRVPKSKDLDGVLGMESAVLIIDDSLRVWPHNKHNLIVVERYTYFPSSRRQFGLIGPSL 803

Query: 1798 LEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSR 1977
            LEIDHDERP+ GTLASSL VIER+H+IFFSH  L EVDVRNIL AEQRKIL GCKIVFSR
Sbjct: 804  LEIDHDERPEDGTLASSLAVIERIHEIFFSHSCLTEVDVRNILGAEQRKILAGCKIVFSR 863

Query: 1978 VFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPG 2157
            +FPVGEANPHLHPLWQTAEQFGAECTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPG
Sbjct: 864  IFPVGEANPHLHPLWQTAEQFGAECTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPG 923

Query: 2158 WVEASALLYRRANEQDFAIK 2217
            WVEASALLYRRANEQDFA+K
Sbjct: 924  WVEASALLYRRANEQDFAVK 943


>ref|XP_020246902.1| RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1
            [Asparagus officinalis]
 ref|XP_020246903.1| RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2
            [Asparagus officinalis]
          Length = 1127

 Score =  863 bits (2229), Expect = 0.0
 Identities = 473/740 (63%), Positives = 520/740 (70%), Gaps = 8/740 (1%)
 Frame = +1

Query: 22   MSSVNSSGGLQMSSVNSSGAFQMAPLNTSG--GSQMDP-VKTSAKSRDPRLRFMNSEVGG 192
            M+ VNSS G QM  V SS   QMA + T G  GS+ +P VK SAKSRDPRLRFM SE   
Sbjct: 439  MAPVNSSDGFQMQPVYSSAGPQMAQVRTIGQVGSEANPAVKASAKSRDPRLRFMKSETSV 498

Query: 193  APQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRSRDAQVMPGRGGWIEDNGMVAS 372
             P+NGFAAG VNS KHK+ DE V DEH+LKRQR +S  SRD +V   + G  +D      
Sbjct: 499  VPKNGFAAGPVNSLKHKSDDELVLDEHSLKRQRKDSMSSRDVKVTGSQSGATKD------ 552

Query: 373  QMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXXX 552
                            RNLVG EVG +  L             S S              
Sbjct: 553  ----------------RNLVGTEVGYEMGLDADNNKLTVSSVPSTS--ISTTGPIVSLPS 594

Query: 553  XXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAISPSPDVGQNPAAKSQM 732
              K IA NP MLV+LL+M               G AVNGLSSA S S  +GQNP+ KSQM
Sbjct: 595  LLKGIAANPQMLVQLLRMEQQKIAAGQAQEKPDGQAVNGLSSATSSSSGIGQNPSVKSQM 654

Query: 733  -----NGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTV 897
                 +  N+M KIRMKPRDPRRVLHSNM Q++E+ GS          S V S KD L  
Sbjct: 655  PPQTKSTNNNMAKIRMKPRDPRRVLHSNMAQRTENSGSAT--------SNVHSIKDQLLH 706

Query: 898  QEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPVGTQNSSQLIPSKI 1077
            ++QG+QAQ   +P QSASL DISQQFTKNLQNLAD+VS SQ                SK 
Sbjct: 707  RKQGDQAQKLAVPLQSASLTDISQQFTKNLQNLADMVSKSQ----------------SKT 750

Query: 1078 SNDTTEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQN 1257
            S+D T+PK V E+C+Q +T       ANPWGDVDHLLDGYDD+QKAAIQKERARRIEEQN
Sbjct: 751  SDDLTKPKNVPELCSQTDTKPA----ANPWGDVDHLLDGYDDKQKAAIQKERARRIEEQN 806

Query: 1258 KMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWT 1437
            KMFA RK            NSAKFVE+DPIHEEIL             HLFR  HMGMWT
Sbjct: 807  KMFAARKLCLVLDLDHTLLNSAKFVEIDPIHEEILRKKEEQDRQTQERHLFRLQHMGMWT 866

Query: 1438 KLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDS 1617
            KLRPG+W FLEKAS+LYELH+YTMGNKLYATEMAK+LDPKG LF GRV+S+GD+GDPFD 
Sbjct: 867  KLRPGIWTFLEKASQLYELHVYTMGNKLYATEMAKLLDPKGNLFAGRVLSRGDDGDPFDG 926

Query: 1618 DERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSL 1797
            D+RVPKSKDLDGVLGMESAV+IIDDSLRVWPHNKHNLIVVERY YFPSSRRQFGL GPSL
Sbjct: 927  DDRVPKSKDLDGVLGMESAVLIIDDSLRVWPHNKHNLIVVERYTYFPSSRRQFGLIGPSL 986

Query: 1798 LEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSR 1977
            LEIDHDERP+ GTLASSL VIER+H+IFFSH  L EVDVRNIL AEQRKIL GCKIVFSR
Sbjct: 987  LEIDHDERPEDGTLASSLAVIERIHEIFFSHSCLTEVDVRNILGAEQRKILAGCKIVFSR 1046

Query: 1978 VFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPG 2157
            +FPVGEANPHLHPLWQTAEQFGAECTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPG
Sbjct: 1047 IFPVGEANPHLHPLWQTAEQFGAECTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPG 1106

Query: 2158 WVEASALLYRRANEQDFAIK 2217
            WVEASALLYRRANEQDFA+K
Sbjct: 1107 WVEASALLYRRANEQDFAVK 1126


>ref|XP_010929653.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Elaeis guineensis]
          Length = 1268

 Score =  810 bits (2091), Expect = 0.0
 Identities = 453/803 (56%), Positives = 534/803 (66%), Gaps = 64/803 (7%)
 Frame = +1

Query: 1    NSSGGLQMSSVNSSGGLQMSS-----VNSSGAFQ---MAPLNTSGGSQMDPVKTSAKSRD 156
            +SS       VN++  +Q+++      +SS + Q   + P+   G +     + + KSRD
Sbjct: 478  SSSANRNAGCVNTTSQIQVATSSAACTDSSSSHQPGTVKPVGQLGSAPNLATRPALKSRD 537

Query: 157  PRLRFMNSEVGGA--------------PQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRN 294
            PRLRF++SE G A              P NG   G  N RKHKA+DE +P+ H LKRQRN
Sbjct: 538  PRLRFVSSESGSASDPNTQVMSLDSSAPNNGPVGGITNPRKHKAVDESLPENHTLKRQRN 597

Query: 295  ESTRSRDAQVMPGRGG-WIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRR---- 459
              T S D Q++PGRGG W++D+  V SQ +++++ ++NM +  +N V   VG DRR    
Sbjct: 598  GLTNSGDVQMIPGRGGGWLDDSSAVGSQPSDKIRLSENMEIETKNPVS-VVGSDRRPDSN 656

Query: 460  -------LVXXXXXXXXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXX 618
                                 S++                KDIAVNPTML++L++M    
Sbjct: 657  PNIHVSNTGTCPIPSSTAAPASSTAPSSSAAASVSFPSLLKDIAVNPTMLMQLIQMEQQR 716

Query: 619  XXXXXXXXXXX-------GPAVNGLSSAISP-------SPDVGQNPAAKSQM-------N 735
                                ++N LS A+S        S +VGQNP  + Q+       N
Sbjct: 717  LSAEAQQKTVGLMQNMAHASSLNVLSGAVSSATVASMKSTEVGQNPGGRPQVPPQTVSTN 776

Query: 736  GPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQ 915
              +D+G+IRMKPRDPRRVLH NMVQK+E++ SE AK NG L S+ QSSKD   + EQGEQ
Sbjct: 777  SQSDVGRIRMKPRDPRRVLH-NMVQKNETVVSERAKPNGTLSSDPQSSKDQSAIGEQGEQ 835

Query: 916  AQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALP-VGTQNSSQLIPSKISND-- 1086
            AQ  TLP+Q         QF KN +NL DI S+ Q++  P   +Q  SQ I  KI+    
Sbjct: 836  AQATTLPTQ---------QFAKNTKNLGDISSTLQSTTTPPAASQIISQPIQLKINKVDP 886

Query: 1087 ------TTEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIE 1248
                   ++PKT++ + ++G T +G     NPWGDVDHLLDGYDDQQKAAIQ+ERARRI 
Sbjct: 887  RPAAAVVSDPKTLSAVTSEGST-TGATPSTNPWGDVDHLLDGYDDQQKAAIQRERARRIA 945

Query: 1249 EQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMG 1428
            EQNKMFA RK            NSAKFVEVDP+HEEIL             HLFRF HMG
Sbjct: 946  EQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEEQDREKPQRHLFRFQHMG 1005

Query: 1429 MWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDP 1608
            MWTKLRPG+W FLEKASKLYE+HLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+GDP
Sbjct: 1006 MWTKLRPGIWTFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDGDP 1065

Query: 1609 FDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPG 1788
            FD DERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL G
Sbjct: 1066 FDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLFG 1125

Query: 1789 PSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIV 1968
            PSLLEIDHDERP+ GTLASSL VIER+HQ FFSH SLN++DVRNILAAEQRKIL GCKIV
Sbjct: 1126 PSLLEIDHDERPEDGTLASSLAVIERIHQNFFSHHSLNDIDVRNILAAEQRKILAGCKIV 1185

Query: 1969 FSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVV 2148
            FSRVFPVGEANPHLHPLWQ AEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV
Sbjct: 1186 FSRVFPVGEANPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVV 1245

Query: 2149 HPGWVEASALLYRRANEQDFAIK 2217
            HPGWVEASALLYRR +E DFA+K
Sbjct: 1246 HPGWVEASALLYRRVSEHDFAVK 1268


>ref|XP_008791049.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Phoenix dactylifera]
          Length = 1269

 Score =  789 bits (2038), Expect = 0.0
 Identities = 447/803 (55%), Positives = 522/803 (65%), Gaps = 64/803 (7%)
 Frame = +1

Query: 1    NSSGGLQMSSVNSSGGLQMSS-----VNSSGAFQMAPLNTSG--GSQMDP-VKTSAKSRD 156
            +SS       VN++  +Q+++      +SS   Q  P+   G  GS  +P ++ + KSRD
Sbjct: 479  SSSANGNAGCVNTTSEIQVATNSAACTDSSSRHQPGPVKPVGQLGSAPNPAIRPALKSRD 538

Query: 157  PRLRFMNSEVGGA--------------PQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRN 294
            PRLRF+NSE G A              P N    G  N RKHKA+DE  P+ H LKRQ+N
Sbjct: 539  PRLRFVNSESGNASDPNRRAMSLDFSAPNNDLVGGITNPRKHKAVDESFPENHTLKRQKN 598

Query: 295  ESTRSRDAQVMPGRGG-WIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRR---- 459
              T S D Q+ PGRGG W+ED+  V SQ++++++ N+NM +  +N  G  V  DRR    
Sbjct: 599  GLTNSSDVQMTPGRGGGWLEDSSSVRSQLSDKIRLNENMEIEIKN-PGNVVMSDRRPDSN 657

Query: 460  -------LVXXXXXXXXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXX 618
                                 S +                KDIAVNPTML++L+++    
Sbjct: 658  PNIQVTNTGTCMIPSSTTAPSSGTAPSSSAAASVSFPSLLKDIAVNPTMLMQLIQIEQQR 717

Query: 619  XXXXXXXXXXX-------GPAVNGLSSAISP-------SPDVGQNPAAKSQM-------N 735
                                ++N L  A+S        S +VG NP+ + Q+       N
Sbjct: 718  LSAEAQQKTVGLMHNMAHASSLNVLPGAVSSANVASMKSAEVGHNPSGRPQVTAQTVSTN 777

Query: 736  GPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQ 915
              +D+G+IRMKPRDPRR+LH NMVQK+E++ SE AK NG L S+ QSSKD L + EQGEQ
Sbjct: 778  SQSDVGRIRMKPRDPRRILH-NMVQKNETIVSERAKPNGTLSSDPQSSKDHLAIGEQGEQ 836

Query: 916  AQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPVGT-QNSSQLIPSKISND-- 1086
            AQ   LP+          Q  KN +NL DI S  Q +  P+   Q  SQ I   I+    
Sbjct: 837  AQATGLPTL---------QLAKNPKNLGDISSPLQLTTTPLAVPQIISQPIQFNINKVDL 887

Query: 1087 ------TTEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIE 1248
                    +PKT++ + ++G T        N WGDVDHLLDGYDDQQKAAIQ+ERARRI 
Sbjct: 888  RPAAAVVNDPKTLSTVASEGSTTVAT-QSTNAWGDVDHLLDGYDDQQKAAIQRERARRIA 946

Query: 1249 EQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMG 1428
            EQNKMFA RK            NSAKFVEVDP+HEEIL             HLFRF HMG
Sbjct: 947  EQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEEQDREKPQRHLFRFQHMG 1006

Query: 1429 MWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDP 1608
            MWTKLRPG+WNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+ +P
Sbjct: 1007 MWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDSEP 1066

Query: 1609 FDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPG 1788
            FD DERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL G
Sbjct: 1067 FDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLFG 1126

Query: 1789 PSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIV 1968
            PSLLEIDHDERP+ GTLASSL VIER+H  FFSH+SLN+VDVRNILAAEQRKIL GCKIV
Sbjct: 1127 PSLLEIDHDERPEDGTLASSLTVIERIHDDFFSHRSLNDVDVRNILAAEQRKILAGCKIV 1186

Query: 1969 FSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVV 2148
            FSRVFPVGEANPHLHPLWQ AEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV
Sbjct: 1187 FSRVFPVGEANPHLHPLWQMAEQFGAACTNQIDEQVTHVVANSLGTDKVNWALSTGRFVV 1246

Query: 2149 HPGWVEASALLYRRANEQDFAIK 2217
            HP WVEASALLYRR NEQDFA+K
Sbjct: 1247 HPSWVEASALLYRRVNEQDFAVK 1269


>gb|OVA17386.1| BRCT domain [Macleaya cordata]
          Length = 1214

 Score =  781 bits (2018), Expect = 0.0
 Identities = 436/746 (58%), Positives = 502/746 (67%), Gaps = 45/746 (6%)
 Frame = +1

Query: 115  SQMDPV-KTSAKSRDPRLRFMNSEVGG-------------APQNGFAAGSVNSRKHKAID 252
            S ++PV +   KSRDPRLRF+NSEVG              AP++    G ++SRK+K   
Sbjct: 472  SGINPVLRPQPKSRDPRLRFLNSEVGSVDLNQRSPYVEYNAPKSETLGGIISSRKNKTDP 531

Query: 253  EPVPDEHNLKRQRN---ESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNR 423
            E V D H LKRQRN     T S   Q+  G GGW+ED   V  Q   R+Q  +++    R
Sbjct: 532  ESVLDGHTLKRQRNGLTSPTVSGGVQMSSGSGGWLEDISTVRPQPTPRIQLAESVGSDPR 591

Query: 424  NLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLK 603
             +  GEV    R              +                  KDIAVNPTML+ L++
Sbjct: 592  MIGNGEVLSGLRQDTSSSNINVRAGGNDQLPLTGIDTMGSLPSLLKDIAVNPTMLINLIR 651

Query: 604  -MXXXXXXXXXXXXXXXGPAVNGLSSAISP------------SPDVGQNPAAKSQMNGP- 741
                                + G SS + P            S ++ Q PA K Q  GP 
Sbjct: 652  EQQRLAAETQQKSSNPTQNKITGSSSNVLPRSVPLANVASSKSSEIEQKPAVKPQ--GPA 709

Query: 742  -----NDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQ 906
                  + GKIRMKPRDPRR+LH++  QK+E LG E  K+ G   S +Q+SKD L V++Q
Sbjct: 710  ETISTGEFGKIRMKPRDPRRILHNSTFQKNECLGLEQLKTIGASSSLIQASKDNLIVRQQ 769

Query: 907  GEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALP--VGTQNSSQLIPSKIS 1080
            GE AQTN+LPS SA  PDI+QQFTK L+NLADI+SSSQA+ +P  V    SSQ IP+KI 
Sbjct: 770  GELAQTNSLPSHSAPAPDIAQQFTKELKNLADILSSSQATNIPSVVPQTVSSQTIPTKI- 828

Query: 1081 NDTTEPKTVTEMCTQGETASG-------VIDLANPWGDVDHLLDGYDDQQKAAIQKERAR 1239
             DTT+ +TV  +    ++ +G       V+   N W DV+HL +GYDDQQ+AAI +ERAR
Sbjct: 829  -DTTDMRTVVTVPKDQQSGTGTTPEEGTVLPSENKWEDVEHLFEGYDDQQRAAIHRERAR 887

Query: 1240 RIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFP 1419
            RIEEQNKMFA RK            NSAKF+EVDP+H+EIL             HLFRFP
Sbjct: 888  RIEEQNKMFAARKLCLVLDLDHTLLNSAKFIEVDPVHDEILRKKEEQDREKPHRHLFRFP 947

Query: 1420 HMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDE 1599
            HMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVISKGDE
Sbjct: 948  HMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISKGDE 1007

Query: 1600 GDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFG 1779
            GDPFD DERVPKSKDL+GVLGMES+VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFG
Sbjct: 1008 GDPFDGDERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1067

Query: 1780 LPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGC 1959
            L GPSLLEIDHDERP+ GTLASSL VIER+HQ FFSH SL++VDVRNILA+EQRKIL GC
Sbjct: 1068 LLGPSLLEIDHDERPEEGTLASSLAVIERIHQNFFSHMSLHDVDVRNILASEQRKILAGC 1127

Query: 1960 KIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGR 2139
            +IVFSRVFPVGEANPHLHPLWQ+AEQFGA CT QIDE VTHVVANS GTDKVNWALSTGR
Sbjct: 1128 RIVFSRVFPVGEANPHLHPLWQSAEQFGAVCTTQIDEQVTHVVANSLGTDKVNWALSTGR 1187

Query: 2140 FVVHPGWVEASALLYRRANEQDFAIK 2217
            FVVHP WVEAS LLYRRANE DFA+K
Sbjct: 1188 FVVHPSWVEASTLLYRRANEHDFAVK 1213


>gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao]
          Length = 1290

 Score =  768 bits (1983), Expect = 0.0
 Identities = 425/770 (55%), Positives = 499/770 (64%), Gaps = 52/770 (6%)
 Frame = +1

Query: 64   VNSSGAFQMAPLNTSGGSQMDPV-----KTSAKSRDPRLRFMNSEVGGAPQN-------- 204
            V+S+ +     + T   + M  V     K+ AKSRDPRL F NS       N        
Sbjct: 528  VDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNERLLHNAS 587

Query: 205  --GFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 369
                  G ++SRK K+++EP+ D   LKRQRNE      +RD Q + G GGW+ED   + 
Sbjct: 588  KVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIG 647

Query: 370  SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXX 549
            SQ+ NR Q  +N+   +R +  G                     +   +           
Sbjct: 648  SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNITVGTNEQVPVTSTSTPSLP 702

Query: 550  XXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXG--------PAVNGLSSAIS-----P 690
               KDIAVNPTML+ +LKM                        P+ N L   +S     P
Sbjct: 703  ALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIP 762

Query: 691  SPDV----------GQNPAAKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 840
            SP V             PA   Q+  P++ GKIRMKPRDPRRVLH N +Q+S S+G +  
Sbjct: 763  SPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGLDQL 822

Query: 841  KSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 1020
            K+NG L S  Q SKD L  Q+   Q ++  + SQ    PDI+QQFT NL+N+ADI+S SQ
Sbjct: 823  KTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIADIMSVSQ 882

Query: 1021 A-SALPVGTQNSSQLIPSK--ISNDTTEPKTVTEMCTQGETASGVIDLA--------NPW 1167
            A ++LP  + N   L+P    I +D+ + K +       +T +G+   A        N W
Sbjct: 883  ALTSLPPVSHN---LVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGPRSQNAW 939

Query: 1168 GDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPI 1347
            GDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+ RK            NSAKF+EVDP+
Sbjct: 940  GDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDPV 999

Query: 1348 HEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYA 1527
            HEEIL             HLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYA
Sbjct: 1000 HEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1059

Query: 1528 TEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVW 1707
            TEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMESAVVIIDDS+RVW
Sbjct: 1060 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVW 1119

Query: 1708 PHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFS 1887
            PHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FFS
Sbjct: 1120 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFS 1179

Query: 1888 HQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQID 2067
            HQ+L++VDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQID
Sbjct: 1180 HQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQID 1239

Query: 2068 EHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217
            EHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1240 EHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289


>ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Nelumbo nucifera]
          Length = 1313

 Score =  767 bits (1980), Expect = 0.0
 Identities = 437/787 (55%), Positives = 516/787 (65%), Gaps = 55/787 (6%)
 Frame = +1

Query: 22   MSSVNSSGGLQMSSVNSSGAFQMA-----PLNTSG--GSQMDPVKTSAKSRDPRLRFMNS 180
            ++++NSS  L+  S  +S A  ++     P  + G  GS    V  +AK+RDPRLR+ NS
Sbjct: 529  VATINSSTSLKTVSSATSYADNLSGQGLVPAVSVGQLGSMSSHVIRTAKNRDPRLRYANS 588

Query: 181  EVGGA-----PQNGF--------AAGSVNSRKHKAIDEPVPDEHNLKRQRN---ESTRSR 312
            EVG       P +G           G + SRKHK ++E + D+H  KRQRN    S  S 
Sbjct: 589  EVGPLDLNQRPPSGDHDIRKSEPLGGIMGSRKHKIVEESLLDDHTFKRQRNGLINSGASG 648

Query: 313  DAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXX 492
            D QV+ G GGW+E++  +  Q  +R +  +      R L  GE     +           
Sbjct: 649  DVQVVSGSGGWLEESSSMGLQPTDRSRLIEKRESDPRKLGSGEASFGNKQDTGCSTYNVT 708

Query: 493  XXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGL 672
               +                  KDIAVNPTML+ L+KM                PA + +
Sbjct: 709  TGGNEQLTASGIGSTVSLPSLLKDIAVNPTMLMHLIKMEHQRLAVEALQKCG-NPAQSTM 767

Query: 673  ---SSAISPSPDVGQNPAAKS-------------------QMNGPNDMGKIRMKPRDPRR 786
               SS++ P      N A+K+                    M    D+GKIRMKPRDPRR
Sbjct: 768  QSSSSSVMPGKIASVNIASKTLSEPEKKSAGNSQISVQTASMIPHGDLGKIRMKPRDPRR 827

Query: 787  VLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDIS 966
            +LHSN  QKS+S G E  K+NG       + +D L V++QGEQAQTN+L SQS + PDI+
Sbjct: 828  ILHSNTFQKSDSSGPERFKANGTPSPNTPTCRDNLIVRQQGEQAQTNSLLSQSTAPPDIA 887

Query: 967  QQFTKNLQNLADIVSSSQASALP--VGTQNSSQLIPSK--------ISNDTTEPKTVTEM 1116
            QQFTK L+N+A+I+S+SQA   P  V    SSQ +P+K        ++ D+ + ++ + +
Sbjct: 888  QQFTKKLKNIANILSASQAINTPSVVPQTISSQPVPAKMDKVDMKVVATDSNDQRSWSAL 947

Query: 1117 CTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXX 1296
             T  E A+G     N WGDV+HL +GYDDQQKAAIQ+ERARRIEEQN+MFA RK      
Sbjct: 948  -TPEERAAGPSS-QNAWGDVEHLFEGYDDQQKAAIQRERARRIEEQNQMFAARKLCLVLD 1005

Query: 1297 XXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKA 1476
                  NSAKFVEVDP+HEE+L             HLFRF HMGMWTKLRPG+WNFLEKA
Sbjct: 1006 LDHTLLNSAKFVEVDPVHEEMLRKKEEQDREKPQRHLFRFTHMGMWTKLRPGIWNFLEKA 1065

Query: 1477 SKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGV 1656
            SKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDPFD DER PKSKDLDGV
Sbjct: 1066 SKLYELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISRGDDGDPFDGDERQPKSKDLDGV 1125

Query: 1657 LGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGT 1836
            LGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQ GL GPSLLEIDHDERP+ GT
Sbjct: 1126 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQLGLHGPSLLEIDHDERPEDGT 1185

Query: 1837 LASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHP 2016
            LASSL VIER+HQ FFSHQ+LN+VDVRNILAAEQ+KIL GC+IVFSRVFPVGEANPHLHP
Sbjct: 1186 LASSLAVIERIHQNFFSHQNLNDVDVRNILAAEQQKILAGCRIVFSRVFPVGEANPHLHP 1245

Query: 2017 LWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 2196
            LWQTAEQFGA CTNQIDE VTHVVA S GTDKVNWALSTGRFVVHPGWVEASALLYRRAN
Sbjct: 1246 LWQTAEQFGAVCTNQIDEQVTHVVAISLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 1305

Query: 2197 EQDFAIK 2217
            E DFAIK
Sbjct: 1306 EHDFAIK 1312


>ref|XP_007043830.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Theobroma cacao]
          Length = 1290

 Score =  766 bits (1978), Expect = 0.0
 Identities = 426/771 (55%), Positives = 496/771 (64%), Gaps = 53/771 (6%)
 Frame = +1

Query: 64   VNSSGAFQMAPLNTSGGSQMDPV-----KTSAKSRDPRLRFMNSEVGGAPQN-------- 204
            V+S+ +     + T   + M  V     K+ AKSRDPRL F NS       N        
Sbjct: 528  VDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNERLLHNAS 587

Query: 205  --GFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 369
                  G ++SRK K+++EP+ D   LKRQRNE      +RD Q + G GGW+ED   + 
Sbjct: 588  KVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIG 647

Query: 370  SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXX 549
            SQ+ NR Q  +N+   +R +  G                     +   +           
Sbjct: 648  SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNITVGTNEQVPVTSTSTPSLP 702

Query: 550  XXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXG--------PAVNGLSSAIS-----P 690
               KDIAVNPTML+ +LKM                        P+ N L   +S     P
Sbjct: 703  ALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIP 762

Query: 691  SPDV----------GQNPAAKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 840
            SP V             PA   Q+  P++ GKIRMKPRDPRRVLH N +Q+S S+G +  
Sbjct: 763  SPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGPDQL 822

Query: 841  KSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 1020
            K+NG L S  Q SKD L  Q+   Q ++  + SQ    PDI+QQFT NL+N+A IVS SQ
Sbjct: 823  KTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIAGIVSVSQ 882

Query: 1021 A--SALPVGTQNSSQLIPSK--ISNDTTEPKTVTEMCTQGETASGVIDLA--------NP 1164
            A  S  PV    S  L+P    I +D+ + K +       +T +G+   A        N 
Sbjct: 883  ALTSLSPV----SHNLVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGPHSQNA 938

Query: 1165 WGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDP 1344
            WGDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+ RK            NSAKF+EVDP
Sbjct: 939  WGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDP 998

Query: 1345 IHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLY 1524
            +HEEIL             HLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLY
Sbjct: 999  VHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLY 1058

Query: 1525 ATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRV 1704
            ATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMESAVVIIDDS+RV
Sbjct: 1059 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRV 1118

Query: 1705 WPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFF 1884
            WPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FF
Sbjct: 1119 WPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFF 1178

Query: 1885 SHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQI 2064
            SHQ+L++VDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQI
Sbjct: 1179 SHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQI 1238

Query: 2065 DEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217
            DEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1239 DEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289


>ref|XP_021275229.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Herrania
            umbratica]
          Length = 1291

 Score =  766 bits (1978), Expect = 0.0
 Identities = 424/770 (55%), Positives = 499/770 (64%), Gaps = 49/770 (6%)
 Frame = +1

Query: 55   MSSVNSSGAFQMAPLNTSGGSQMDPV--KTSAKSRDPRLRFMNSEVGGAPQN-------- 204
            + S NSS   Q+   N +  S +  +  K+ AKSRDPRL F N+       N        
Sbjct: 529  VDSANSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANTNASALDLNERPLHNAS 588

Query: 205  --GFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 369
                  G ++SRK K+++EP+ D   LKRQRNE      +RD Q + G GGW+ED  ++ 
Sbjct: 589  KVAPVGGIMDSRKRKSVEEPILDGPALKRQRNELENLGVARDVQTVCGIGGWLEDTDVIG 648

Query: 370  SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXXXX 549
            SQ+ NR Q  +N+   +R +  G                     +   +           
Sbjct: 649  SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNMTVGTNEQVPVTSTSTPSLP 703

Query: 550  XXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGL-------------SSAISP 690
               KDIAVNPTML+ +LKM                P  + L             S+ +  
Sbjct: 704  ALLKDIAVNPTMLISILKMGQQQRLGAEAQQKSPDPVKSTLHQPSSNSLLGVVSSTNVIS 763

Query: 691  SPDVG----------QNPAAKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 840
            SP V             PA   Q+  P++ GKIRMKPRDPRRVLH N +Q+S S+G +  
Sbjct: 764  SPSVNNVPSISSGILSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGPDQL 823

Query: 841  KSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 1020
            K+NG L S  Q SKD L  Q    Q ++  + SQ    PDI+QQFTKNL+N+ADI+S SQ
Sbjct: 824  KTNGALTSSTQGSKDNLNAQNLDSQTESKPMQSQLVPPPDITQQFTKNLKNIADIMSVSQ 883

Query: 1021 A-SALPVGTQNSSQLIPS--KISNDTTEPKTVTEMCTQGETASGVI--------DLANPW 1167
            A ++LP   QN   L+P   +I +D+ + K +       +T +G+            N W
Sbjct: 884  ALTSLPPVPQN---LVPQPVQIKSDSMDMKALVSNSEDQQTGAGLAPEVGATGPHSQNAW 940

Query: 1168 GDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPI 1347
            GDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+  K            NSAKF+EVDP+
Sbjct: 941  GDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSAHKLCLVLDLDHTLLNSAKFIEVDPV 1000

Query: 1348 HEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYA 1527
            HEEIL             HLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYA
Sbjct: 1001 HEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1060

Query: 1528 TEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVW 1707
            TEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES VVIIDDS+RVW
Sbjct: 1061 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESGVVIIDDSVRVW 1120

Query: 1708 PHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFS 1887
            PHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FFS
Sbjct: 1121 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFS 1180

Query: 1888 HQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQID 2067
            HQ+L++VDVRNILAAEQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQID
Sbjct: 1181 HQNLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQID 1240

Query: 2068 EHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217
            EHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1241 EHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1290


>gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 1195

 Score =  754 bits (1947), Expect = 0.0
 Identities = 416/736 (56%), Positives = 488/736 (66%), Gaps = 40/736 (5%)
 Frame = +1

Query: 130  VKTSAKSRDPRLRFMNSEVGGAPQNGFAA------------GSVNSRKHKAIDEPVPDEH 273
            VK SAKSRDPRLRF+NS+     QN  A             G++N ++ K +D+P+PD H
Sbjct: 460  VKASAKSRDPRLRFVNSDSNALDQNHRAVPVVNTLKVEPIGGTMNKKRQKIVDDPIPDGH 519

Query: 274  NLKRQRNESTRS---RDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEV 444
            +LKRQ+N    S   RD + M G GGW+ED  MV  Q  N+ Q   N     R   GG V
Sbjct: 520  SLKRQKNALENSGVVRDVKTMVGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGV 579

Query: 445  GPDRRLVXXXXXXXXXXXXSASN-------MXXXXXXXXXXXXXXKDIAVNPTMLVELLK 603
                  +                       +              K+IAVNPTML+ +LK
Sbjct: 580  CTSSSCISSVNISGTEQIPVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILK 639

Query: 604  MXXXXXXXXXXXXXXXGPAVN-----GLSSAISPSPDVG-------QNPA----AKSQMN 735
            M                PA +       +S +   P VG         PA       Q+ 
Sbjct: 640  MGQQQRLALEAQQKPVDPAKSTTYPLNSNSMLGTVPVVGAAHSGILPRPAGTVQVSPQLG 699

Query: 736  GPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQ 915
              +D+GKIRMKPRDPRRVLH+N +Q++ S+GSE  K+N       Q +KD   +Q+Q  Q
Sbjct: 700  TADDLGKIRMKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQ 759

Query: 916  AQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQAS-ALPVGTQN-SSQLIPSKISNDT 1089
             +   +P QS +LPDIS  FTKNL+N+ADIVS S AS + P+  QN +SQ + + IS+  
Sbjct: 760  VEKKPVPLQSLALPDISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQPMRTTISSSD 819

Query: 1090 TEPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFA 1269
                  +        A+G     N WGDV+HL +GY+DQQKAAIQ+ERARRIEEQ K+F+
Sbjct: 820  QFLGIGSAPGAAAAAAAGP-RTQNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQKKLFS 878

Query: 1270 ERKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRP 1449
             RK            NSAKFVEVDP+H+EIL             HLFRFPHMGMWTKLRP
Sbjct: 879  ARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGMWTKLRP 938

Query: 1450 GVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERV 1629
            G+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF+GRVIS+GD+G+PFD DER+
Sbjct: 939  GIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPFDGDERI 998

Query: 1630 PKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEID 1809
            PKSKDL+GVLGMES VVI+DDS+RVWPHNK NLIVVERY+YFP SRRQFGLPGPSLLEID
Sbjct: 999  PKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEID 1058

Query: 1810 HDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPV 1989
            HDERP+ GTLA SL VIER+HQ FF+H SL+E DVRNILA+EQRKIL GC+IVFSRVFPV
Sbjct: 1059 HDERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPV 1118

Query: 1990 GEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEA 2169
            GEANPHLHPLWQTAEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEA
Sbjct: 1119 GEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEA 1178

Query: 2170 SALLYRRANEQDFAIK 2217
            SALLYRRANEQDFAIK
Sbjct: 1179 SALLYRRANEQDFAIK 1194


>ref|XP_021662955.1| LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3 [Hevea brasiliensis]
          Length = 1292

 Score =  752 bits (1942), Expect = 0.0
 Identities = 416/735 (56%), Positives = 480/735 (65%), Gaps = 40/735 (5%)
 Frame = +1

Query: 133  KTSAKSRDPRLRFMNSEVGGAPQNGFAA----------GSVNSRKHKAIDEPVPDEHNLK 282
            K SAKSRDPRLRF+NS+   + QN  A           G++N +K K++DEP+PD   LK
Sbjct: 560  KASAKSRDPRLRFVNSDANVSDQNNRAVPVVNNTLKVGGTMNLKKQKSVDEPIPDGPPLK 619

Query: 283  RQRNESTRS---RDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPD 453
            RQ+  S  S   RD + M G GGW+ED  +V  Q  NR Q  +N     R +  G   P 
Sbjct: 620  RQKIASEISGVGRDVKTMIGSGGWLEDTDVVGPQTLNRNQLVENAESDPRRIDNGVACPS 679

Query: 454  RRLVXXXXXXXXXXXXS---------ASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKM 606
                                      A  +              K+IAVNPTML+ +LKM
Sbjct: 680  TVSGISSVNISGNEQLQVTGASAVAGAEQVPVMGASATSLPDLLKNIAVNPTMLISILKM 739

Query: 607  XXXXXXXXXXXXXXXG--------PAVNGLSSA-----ISPSPDVGQNPAAKSQMNGP-- 741
                                    P  N +  A     ++P    G  P     +  P  
Sbjct: 740  GQQQRLAIEAQQKPVDLAKSTTHPPNTNSILGALPVVNVAPPQSTGILPRPAGALQVPQL 799

Query: 742  ---NDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGE 912
               ++MGKIRMKPRDPRRVLH+N +Q++ SLGSE  K+N    S  Q +K+   VQ Q  
Sbjct: 800  AASDEMGKIRMKPRDPRRVLHNNTLQRNGSLGSEQFKTNLISTSTSQGTKENQNVQNQEG 859

Query: 913  QAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQASALPVGTQNSSQLIPSKISNDTT 1092
            Q +   +P+QS   PDIS  FTK+L+N+ADIVS S AS  P+ +QN   L+   +     
Sbjct: 860  QVEMKPVPTQSLVAPDISLPFTKSLKNIADIVSVSNASTPPLVSQN---LVSQHVRTVVL 916

Query: 1093 EPKTVTEMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAE 1272
              +  T +      AS      N WGD DH+ +GY+DQQKAAIQ+ERARRIEEQ KMFA 
Sbjct: 917  NSEQPTGIGLPPGVASVAPRSQNTWGDFDHIFEGYNDQQKAAIQRERARRIEEQKKMFAA 976

Query: 1273 RKXXXXXXXXXXXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPG 1452
             K            NSAKFVE+DP+H+EIL             HLFRFPHMGMWTKLRPG
Sbjct: 977  NKLCLVLDLDHTLLNSAKFVEIDPVHDEILRKKEEQDHEKPQRHLFRFPHMGMWTKLRPG 1036

Query: 1453 VWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVP 1632
            +WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF+GRVIS GD+GDPFDSDERVP
Sbjct: 1037 IWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISXGDDGDPFDSDERVP 1096

Query: 1633 KSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDH 1812
            KSKDL+GVLGMESAVVIIDDS+RVWPHNK NLIVVERY+YFP SRRQFGLPGPSLLEIDH
Sbjct: 1097 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDH 1156

Query: 1813 DERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVG 1992
            DERP+ GTLA SL VIER+HQ FF+H SL+E DVRNILA+EQRKIL GC+IVFSRVFPVG
Sbjct: 1157 DERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPVG 1216

Query: 1993 EANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEAS 2172
            EANPHLHPLWQTAEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEAS
Sbjct: 1217 EANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEAS 1276

Query: 2173 ALLYRRANEQDFAIK 2217
            ALLYRRANEQDFAIK
Sbjct: 1277 ALLYRRANEQDFAIK 1291


>gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium raimondii]
          Length = 982

 Score =  739 bits (1907), Expect = 0.0
 Identities = 424/781 (54%), Positives = 497/781 (63%), Gaps = 52/781 (6%)
 Frame = +1

Query: 31   VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195
            V+S+  +  +S  SS  G F      P+  S  S +   K SAKSRDPRLRF NS V   
Sbjct: 208  VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 266

Query: 196  PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 339
              N             +G ++ RK K+ +EPV D    KRQ+NE      RD Q + G G
Sbjct: 267  DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 326

Query: 340  GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMX 519
            GW+ED     SQ+ NR Q  + +   +R +  G                       + M 
Sbjct: 327  GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 386

Query: 520  XXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAIS---- 687
                         KDIAVNPTML+ +LKM                P  N L    S    
Sbjct: 387  NPSLPALL-----KDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 441

Query: 688  ---------PSPDVGQNPAAKS--------QMNGP--NDMGKIRMKPRDPRRVLHSNMVQ 810
                     PSP V   P++ S         + GP  ++  KIRMKPRDPRRVLH N++Q
Sbjct: 442  GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 501

Query: 811  KSESLGSELAKSNGNLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 984
            KS S+G +  K+NG  P S  Q SKD +  Q+Q E Q +   +  Q    PDI+QQFT++
Sbjct: 502  KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 561

Query: 985  LQNLADIVSSSQASA-LPVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA- 1158
            L+N+A ++S  Q+ A LP  +QN     P ++ ++T +  T        +T +G    A 
Sbjct: 562  LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 620

Query: 1159 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXX 1314
                    N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK            
Sbjct: 621  VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 680

Query: 1315 NSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 1494
            NSAKF+EVDP+HEEIL             HLFRF HMGMWTKLRPG+WNFLEKASKLYEL
Sbjct: 681  NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 740

Query: 1495 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 1674
            HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+
Sbjct: 741  HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 800

Query: 1675 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 1854
            VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL 
Sbjct: 801  VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 860

Query: 1855 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 2034
            VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE
Sbjct: 861  VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 920

Query: 2035 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 2214
            QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI
Sbjct: 921  QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 980

Query: 2215 K 2217
            K
Sbjct: 981  K 981


>ref|XP_022737741.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Durio
            zibethinus]
          Length = 1274

 Score =  748 bits (1932), Expect = 0.0
 Identities = 421/764 (55%), Positives = 491/764 (64%), Gaps = 44/764 (5%)
 Frame = +1

Query: 58   SSVNSSGAFQMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGAPQNGF--------- 210
            SS+    A Q A + T   +    +K+SAKSRDPRLRF NS       N           
Sbjct: 521  SSMQGKIATQNATVVTVSSASNIALKSSAKSRDPRLRFANSNASALDLNQQPLHNASKAV 580

Query: 211  -AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS---RDAQVMPGRGGWIEDNGMVASQM 378
               G ++SRK K+I+EPV D   LKRQR E   S   +D Q + G  GW+ED  ++ SQ+
Sbjct: 581  PVGGIMDSRKQKSIEEPVLDGPALKRQRKELENSGVVKDVQTVSGNCGWLEDTDVIGSQV 640

Query: 379  NNRVQPNKNM-----AVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMXXXXXXXXX 543
             NR Q  +N       + NR      +     +             S  ++         
Sbjct: 641  TNRNQIVENSDSNSWKMDNRVTCSSTLSGKTNMTVNRNEQVPMTGMSTPSLPALL----- 695

Query: 544  XXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGP--------AVNGLSSAISP--- 690
                 KDIAVNPT+L+ +LKM                P        + N +   ++P   
Sbjct: 696  -----KDIAVNPTVLINILKMGQQERLAAEILQKSPDPVKSTLHQPSSNSILGVVTPVNI 750

Query: 691  ----SPDVGQNPAAKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGNL 858
                S  +   PA   Q+  P++ G IRMKPRDPRRVLH N++Q+S  +G +  K+NG  
Sbjct: 751  VPSSSSGILSKPAGNLQVPPPDESGNIRMKPRDPRRVLHGNVLQRSGIMGPDQVKTNGTT 810

Query: 859  P-SEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQA-SAL 1032
            P S    SKD L VQ+   Q ++  + SQ    PDI+QQFTKNL+N+ADI+S SQA ++L
Sbjct: 811  PTSSTLGSKDNLNVQKLEAQTESKPMQSQLVPAPDITQQFTKNLKNIADIMSVSQALTSL 870

Query: 1033 PVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA---------NPWGDVDHL 1185
            P  +Q+     P +   D+ + KTV       +T +G    A         N W DV+HL
Sbjct: 871  PAVSQSLVSQ-PVQHKPDSMDMKTVVSSSEDQQTGTGSAPEADARGPHCSQNTWDDVEHL 929

Query: 1186 LDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNSAKFVEVDPIHEEILX 1365
             + YDDQQKAAIQKERARRIEEQ KMF   K            NSAKF EVDP+HEEIL 
Sbjct: 930  FERYDDQQKAAIQKERARRIEEQKKMFDANKLCLVLDLDHTLLNSAKFNEVDPVHEEILR 989

Query: 1366 XXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKV 1545
                        HLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAKV
Sbjct: 990  KKEEQDREKPQRHLFRFQHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKV 1049

Query: 1546 LDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHN 1725
            LDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMESAVVIIDDS+RVWPHNK N
Sbjct: 1050 LDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVWPHNKLN 1109

Query: 1726 LIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNE 1905
            LIVVERY YFP SRRQFGL GPSLLEIDHDER D GTLASSL VIER+HQ FFSHQ+L++
Sbjct: 1110 LIVVERYTYFPCSRRQFGLLGPSLLEIDHDERLDDGTLASSLAVIERIHQDFFSHQNLDD 1169

Query: 1906 VDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHV 2085
            VDVRNILAAEQRKIL GC +VFSRVFPVGEANPHLHPLWQTAEQFGA CTNQIDEHVTHV
Sbjct: 1170 VDVRNILAAEQRKILAGCHVVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHV 1229

Query: 2086 VANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217
            VANS GTDKVNWALSTG+FVVHPGWVEAS LLYRRANE DFAIK
Sbjct: 1230 VANSLGTDKVNWALSTGKFVVHPGWVEASTLLYRRANELDFAIK 1273


>gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium raimondii]
          Length = 1033

 Score =  739 bits (1907), Expect = 0.0
 Identities = 424/781 (54%), Positives = 497/781 (63%), Gaps = 52/781 (6%)
 Frame = +1

Query: 31   VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195
            V+S+  +  +S  SS  G F      P+  S  S +   K SAKSRDPRLRF NS V   
Sbjct: 259  VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 317

Query: 196  PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 339
              N             +G ++ RK K+ +EPV D    KRQ+NE      RD Q + G G
Sbjct: 318  DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 377

Query: 340  GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMX 519
            GW+ED     SQ+ NR Q  + +   +R +  G                       + M 
Sbjct: 378  GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 437

Query: 520  XXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAIS---- 687
                         KDIAVNPTML+ +LKM                P  N L    S    
Sbjct: 438  NPSLPALL-----KDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 492

Query: 688  ---------PSPDVGQNPAAKS--------QMNGP--NDMGKIRMKPRDPRRVLHSNMVQ 810
                     PSP V   P++ S         + GP  ++  KIRMKPRDPRRVLH N++Q
Sbjct: 493  GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 552

Query: 811  KSESLGSELAKSNGNLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 984
            KS S+G +  K+NG  P S  Q SKD +  Q+Q E Q +   +  Q    PDI+QQFT++
Sbjct: 553  KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 612

Query: 985  LQNLADIVSSSQASA-LPVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA- 1158
            L+N+A ++S  Q+ A LP  +QN     P ++ ++T +  T        +T +G    A 
Sbjct: 613  LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 671

Query: 1159 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXX 1314
                    N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK            
Sbjct: 672  VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 731

Query: 1315 NSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 1494
            NSAKF+EVDP+HEEIL             HLFRF HMGMWTKLRPG+WNFLEKASKLYEL
Sbjct: 732  NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 791

Query: 1495 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 1674
            HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+
Sbjct: 792  HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 851

Query: 1675 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 1854
            VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL 
Sbjct: 852  VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 911

Query: 1855 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 2034
            VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE
Sbjct: 912  VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 971

Query: 2035 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 2214
            QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI
Sbjct: 972  QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 1031

Query: 2215 K 2217
            K
Sbjct: 1032 K 1032


>ref|XP_010656786.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X1 [Vitis vinifera]
          Length = 1276

 Score =  744 bits (1920), Expect = 0.0
 Identities = 425/782 (54%), Positives = 504/782 (64%), Gaps = 47/782 (6%)
 Frame = +1

Query: 13   GLQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSG-GSQMDPV-KTSAKSRDPRLRFMNSEV 186
            G   S V+S   L  S V       + P NT    S+ + + + SAKSRDPRLR  +S+ 
Sbjct: 525  GRNTSLVSSGPHLDSSVVQGL----VVPRNTGAVNSRFNSILRASAKSRDPRLRLASSDA 580

Query: 187  GG-------------APQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS---RDA 318
            G              +P+       V+SRK K+ +EP+ D    KRQRN  T     RDA
Sbjct: 581  GSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTSPATVRDA 640

Query: 319  QVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNL-----VGGEVGPDRRLVXXXXXX 483
            Q +   GGW+ED+  V  QM NR Q  +N     + L     V G +G D+  V      
Sbjct: 641  QTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTG-IGCDKPYVTVNGNE 699

Query: 484  XXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAV 663
                  +++                KDIAVNP + + +                   P  
Sbjct: 700  HLPVVATSTTASLQSLL--------KDIAVNPAVWMNIFNKVEQQKSGDPAKNTVLPPTS 751

Query: 664  NGLSSAISPSP-------DVGQNPAAKSQ------MNGPNDMGKIRMKPRDPRRVLHSNM 804
            N +   + P+         +GQ PA   Q      MN  ++ GK+RMKPRDPRR+LH+N 
Sbjct: 752  NSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMNPQDESGKVRMKPRDPRRILHANS 811

Query: 805  VQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKN 984
             Q+S S GSE  K+N                Q+Q +Q +T ++PS S + PDISQQFTKN
Sbjct: 812  FQRSGSSGSEQFKTNA---------------QKQEDQTETKSVPSHSVNPPDISQQFTKN 856

Query: 985  LQNLADIVSSSQASALPVGTQNSSQLIPSK---ISNDTTEPKT--------VTEMCTQGE 1131
            L+N+AD++S+SQAS++   T    Q++ S+   ++ D  + K         +T   ++ E
Sbjct: 857  LKNIADLMSASQASSM---TPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKPE 913

Query: 1132 TASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXX 1311
            +A+G     N WGDV+HL DGYDDQQKAAIQ+ERARRIEEQ KMF+ RK           
Sbjct: 914  SAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTL 973

Query: 1312 XNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYE 1491
             NSAKFVEVDP+H+EIL             HLFRFPHMGMWTKLRPG+WNFLEKASKLYE
Sbjct: 974  LNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYE 1033

Query: 1492 LHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMES 1671
            LHLYTMGNKLYATEMAKVLDPKG LF GRVISKGD+GD  D DERVPKSKDL+GVLGMES
Sbjct: 1034 LHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMES 1093

Query: 1672 AVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSL 1851
            AVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEIDHDERP+ GTLASSL
Sbjct: 1094 AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSL 1153

Query: 1852 GVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTA 2031
             VIER+HQ FFS+++L+EVDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTA
Sbjct: 1154 AVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTA 1213

Query: 2032 EQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 2211
            E FGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA
Sbjct: 1214 ESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1273

Query: 2212 IK 2217
            IK
Sbjct: 1274 IK 1275


>ref|XP_010656789.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X2 [Vitis vinifera]
          Length = 1273

 Score =  743 bits (1919), Expect = 0.0
 Identities = 426/779 (54%), Positives = 504/779 (64%), Gaps = 44/779 (5%)
 Frame = +1

Query: 13   GLQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSG-GSQMDPV-KTSAKSRDPRLRFMNSEV 186
            G   S V+S   L  S V       + P NT    S+ + + + SAKSRDPRLR  +S+ 
Sbjct: 525  GRNTSLVSSGPHLDSSVVQGL----VVPRNTGAVNSRFNSILRASAKSRDPRLRLASSDA 580

Query: 187  GG-------------APQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS---RDA 318
            G              +P+       V+SRK K+ +EP+ D    KRQRN  T     RDA
Sbjct: 581  GSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTSPATVRDA 640

Query: 319  QVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNL-----VGGEVGPDRRLVXXXXXX 483
            Q +   GGW+ED+  V  QM NR Q  +N     + L     V G +G D+  V      
Sbjct: 641  QTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTG-IGCDKPYVTVNGNE 699

Query: 484  XXXXXXSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAV 663
                  +++                KDIAVNP + + +                   P  
Sbjct: 700  HLPVVATSTTASLQSLL--------KDIAVNPAVWMNIFNKVEQQKSGDPAKNTVLPPTS 751

Query: 664  NGLSSAISPSP-------DVGQNPAAKSQM--NGPND-MGKIRMKPRDPRRVLHSNMVQK 813
            N +   + P+         +GQ PA   Q+   GP D  GK+RMKPRDPRR+LH+N  Q+
Sbjct: 752  NSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMDESGKVRMKPRDPRRILHANSFQR 811

Query: 814  SESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQN 993
            S S GSE  K+N                Q+Q +Q +T ++PS S + PDISQQFTKNL+N
Sbjct: 812  SGSSGSEQFKTNA---------------QKQEDQTETKSVPSHSVNPPDISQQFTKNLKN 856

Query: 994  LADIVSSSQASALPVGTQNSSQLIPSK---ISNDTTEPKT--------VTEMCTQGETAS 1140
            +AD++S+SQAS++   T    Q++ S+   ++ D  + K         +T   ++ E+A+
Sbjct: 857  IADLMSASQASSM---TPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKPESAA 913

Query: 1141 GVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXNS 1320
            G     N WGDV+HL DGYDDQQKAAIQ+ERARRIEEQ KMF+ RK            NS
Sbjct: 914  GPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNS 973

Query: 1321 AKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHL 1500
            AKFVEVDP+H+EIL             HLFRFPHMGMWTKLRPG+WNFLEKASKLYELHL
Sbjct: 974  AKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHL 1033

Query: 1501 YTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVV 1680
            YTMGNKLYATEMAKVLDPKG LF GRVISKGD+GD  D DERVPKSKDL+GVLGMESAVV
Sbjct: 1034 YTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVV 1093

Query: 1681 IIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVI 1860
            IIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEIDHDERP+ GTLASSL VI
Sbjct: 1094 IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVI 1153

Query: 1861 ERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQF 2040
            ER+HQ FFS+++L+EVDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE F
Sbjct: 1154 ERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESF 1213

Query: 2041 GAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217
            GA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK
Sbjct: 1214 GAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1272


>dbj|GAV71470.1| BRCT domain-containing protein/NIF domain-containing protein
            [Cephalotus follicularis]
          Length = 1228

 Score =  742 bits (1915), Expect = 0.0
 Identities = 425/780 (54%), Positives = 502/780 (64%), Gaps = 43/780 (5%)
 Frame = +1

Query: 7    SGGLQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEV 186
            SG  QM + +  G     +   S A   +P  T  GS    +K SAKSRDPRLR++NS+V
Sbjct: 475  SGSPQMDASSMEG----LTTTRSPAPVSSPAPTVSGSN-PTMKPSAKSRDPRLRYVNSDV 529

Query: 187  G-------------GAPQNGFAAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRSRDAQVM 327
                           AP+       + SRK K +++P+ D   LKRQ++ S  S    V+
Sbjct: 530  SVLDLTQRPLHLVHNAPKV-----ELGSRKQKTVEDPILDGPALKRQKSGSENSGLIGVL 584

Query: 328  P---GRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXX 498
                G GGW+ED  MV +Q+ N     KN+ +  R +  G   P                
Sbjct: 585  KTTSGNGGWLEDTDMVGTQLLN-----KNVVLDPRKVDVGVTSPSIVHCNTNVGNEPLLV 639

Query: 499  XSASNMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXG----PAVN 666
             S+S+               KDIAVNPTML+ +LKM                    P  N
Sbjct: 640  TSSSSTASLPALL-------KDIAVNPTMLINILKMGQQQRLPAEVQQKSTDSLHPPTSN 692

Query: 667  GLSSAISPSPDVGQNPA-----------AKSQMNGPNDMGKIRMKPRDPRRVLHSNMVQK 813
             L  A+        NP+              Q +  +D GKIRMKPRDPRRVLH N +Q+
Sbjct: 693  SLLGAVPSVNFASSNPSRILPKPAGTLPTTPQTSAMDDPGKIRMKPRDPRRVLHGNALQR 752

Query: 814  SESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQN 993
            S SLGSE  K N  +PS     KD L  Q+   QA+T  +PS S   PDI++ FTKNL+N
Sbjct: 753  SGSLGSEKLKMN--VPSTSSFQKDNLNAQKLEGQAETKPMPSLSIPQPDITRLFTKNLKN 810

Query: 994  LADIVSSSQASALPVGTQNSSQLI---PSKISNDTTEPKTV---TEMCTQGETASGVIDL 1155
            + DI+S SQ     +G+ N +Q +   P++I  D  + K +   +E    G  ++  +  
Sbjct: 811  INDIMSVSQPL---IGSPNVTQNLESQPAQIKADRVDVKAIVSNSEDPRTGTVSASEVGA 867

Query: 1156 ANP------WGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXXN 1317
            A P      WGDV+HL +GYDDQQKAAIQ+ERARR+EEQNKMFA  K            N
Sbjct: 868  AGPARPQHAWGDVEHLFEGYDDQQKAAIQRERARRLEEQNKMFAAHKLCLVLDLDHTLLN 927

Query: 1318 SAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYELH 1497
            SAKFVEVDP+H+EIL             HLFRFPHMGMWTKLRPG+WNFLE+ASKL+ELH
Sbjct: 928  SAKFVEVDPVHDEILRKKEEQDREKLHRHLFRFPHMGMWTKLRPGIWNFLERASKLFELH 987

Query: 1498 LYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAV 1677
            LYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVPKSKDL+GVLGMESAV
Sbjct: 988  LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAV 1047

Query: 1678 VIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGV 1857
            VIIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEIDHDERP+ GTLAS+L V
Sbjct: 1048 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASALTV 1107

Query: 1858 IERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQ 2037
            IER+HQIFFS+Q L +VDVRNILA+EQ+KIL GC+I+FSRVFPVGEANPHLHPLWQTAEQ
Sbjct: 1108 IERIHQIFFSYQPLGDVDVRNILASEQQKILDGCRILFSRVFPVGEANPHLHPLWQTAEQ 1167

Query: 2038 FGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 2217
            FGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEASALLYRRANEQDF IK
Sbjct: 1168 FGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANEQDFGIK 1227


>ref|XP_012459418.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X2 [Gossypium raimondii]
          Length = 1251

 Score =  739 bits (1907), Expect = 0.0
 Identities = 424/781 (54%), Positives = 497/781 (63%), Gaps = 52/781 (6%)
 Frame = +1

Query: 31   VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195
            V+S+  +  +S  SS  G F      P+  S  S +   K SAKSRDPRLRF NS V   
Sbjct: 477  VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 535

Query: 196  PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 339
              N             +G ++ RK K+ +EPV D    KRQ+NE      RD Q + G G
Sbjct: 536  DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 595

Query: 340  GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMX 519
            GW+ED     SQ+ NR Q  + +   +R +  G                       + M 
Sbjct: 596  GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 655

Query: 520  XXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAIS---- 687
                         KDIAVNPTML+ +LKM                P  N L    S    
Sbjct: 656  NPSLPALL-----KDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 710

Query: 688  ---------PSPDVGQNPAAKS--------QMNGP--NDMGKIRMKPRDPRRVLHSNMVQ 810
                     PSP V   P++ S         + GP  ++  KIRMKPRDPRRVLH N++Q
Sbjct: 711  GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 770

Query: 811  KSESLGSELAKSNGNLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 984
            KS S+G +  K+NG  P S  Q SKD +  Q+Q E Q +   +  Q    PDI+QQFT++
Sbjct: 771  KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 830

Query: 985  LQNLADIVSSSQASA-LPVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA- 1158
            L+N+A ++S  Q+ A LP  +QN     P ++ ++T +  T        +T +G    A 
Sbjct: 831  LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 889

Query: 1159 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXX 1314
                    N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK            
Sbjct: 890  VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 949

Query: 1315 NSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 1494
            NSAKF+EVDP+HEEIL             HLFRF HMGMWTKLRPG+WNFLEKASKLYEL
Sbjct: 950  NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 1009

Query: 1495 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 1674
            HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+
Sbjct: 1010 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 1069

Query: 1675 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 1854
            VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL 
Sbjct: 1070 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 1129

Query: 1855 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 2034
            VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE
Sbjct: 1130 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 1189

Query: 2035 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 2214
            QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI
Sbjct: 1190 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 1249

Query: 2215 K 2217
            K
Sbjct: 1250 K 1250


>gb|PON91807.1| FCP1-like phosphatase [Trema orientalis]
          Length = 1294

 Score =  739 bits (1909), Expect = 0.0
 Identities = 425/784 (54%), Positives = 494/784 (63%), Gaps = 50/784 (6%)
 Frame = +1

Query: 16   LQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195
            L+ SS++ S  +  SS+   G        ++G +    VK SAKSRDPRLRF NS++   
Sbjct: 520  LRPSSISPSTPVSSSSMQ--GPITAKNAASAGSASNSTVKASAKSRDPRLRFANSDLAAL 577

Query: 196  PQNGFAAGSV------------NSRKHKAIDEPVPDEHNLKRQRNESTRSR---DAQVMP 330
              N     +V            +SRK +  DE   D    KRQRN    +R   D + + 
Sbjct: 578  DLNLRPVTAVQNAPKVEPGEPTSSRKQRITDESNLDGSPYKRQRNSFENARIVGDVKTVS 637

Query: 331  GRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSAS 510
            G GGW+EDNG V  Q+NN+     ++    R LV     P                 SA+
Sbjct: 638  GSGGWLEDNGFVGPQLNNKNHSMASLEADPRKLVHMVNCPTNNGPNMAKEQVPVTSTSAT 697

Query: 511  NMXXXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXG----------PA 660
                            KDIAVNPT+L+ LLK+                          P+
Sbjct: 698  ---------ASLPELLKDIAVNPTLLINLLKLGQQQQQQQLVAETQPKSDPVKDSIHPPS 748

Query: 661  VNGLSSA-----ISPSPDVG--QNPAAK-------SQMNGPNDMGKIRMKPRDPRRVLHS 798
             N +  A     I+PS   G  Q P+A        + M+  +++GKIRMKPRDPRRVLH 
Sbjct: 749  SNSILGAAPLVNIAPSKASGILQTPSASFPVTSQVAAMSSQDELGKIRMKPRDPRRVLHG 808

Query: 799  NMVQKSESLGSELAKSNGNLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFT 978
            + +QKS SLG E  K+  +  S    +KD L  Q Q  QA   T+PSQS   PDI +QFT
Sbjct: 809  STLQKSGSLGHEQLKTVVSPLSSTTGNKDNLNGQMQEGQADQKTVPSQSVLPPDIGRQFT 868

Query: 979  KNLQNLADIVSSSQASALP-VGTQN-SSQLIPSK---------ISNDTTEPKTVTEMCTQ 1125
            KNL+N+ADI+S S  S  P + +QN +SQ +P K         +SN   +   +    T 
Sbjct: 869  KNLRNIADIISVSNVSTSPAIVSQNVASQPVPVKPERGDVKAVVSNSEDQRNGIL---TP 925

Query: 1126 GETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXX 1305
                +G     N WGDV+HL +GYDDQQKAAIQ+ER RR+EEQNKMF  RK         
Sbjct: 926  EVAVAGPSRAPNAWGDVEHLFEGYDDQQKAAIQRERTRRLEEQNKMFEARKLCLVLDLDH 985

Query: 1306 XXXNSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKL 1485
               NSAKFVEVDP+H+EIL             HLFRFPHMGMWTKLRPGVWNFLEKASKL
Sbjct: 986  TLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGVWNFLEKASKL 1045

Query: 1486 YELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGM 1665
            YELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDPFD DERVPKSKDL+GVLGM
Sbjct: 1046 YELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 1105

Query: 1666 ESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLAS 1845
            ESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEIDHDERP+ GTLAS
Sbjct: 1106 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLAS 1165

Query: 1846 SLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQ 2025
            SL VIER+HQ FF+HQSL E DVRNILA+EQRKIL GC+IVFSRVFPV E NPHLHPLWQ
Sbjct: 1166 SLSVIERIHQNFFNHQSLEEADVRNILASEQRKILAGCRIVFSRVFPVSEVNPHLHPLWQ 1225

Query: 2026 TAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 2205
            TAEQFGA C  QID+ VTHVVANS GTDKVNWA+S GRF VHPGWVEASALLYRRANEQD
Sbjct: 1226 TAEQFGAVCITQIDDQVTHVVANSPGTDKVNWAISNGRFAVHPGWVEASALLYRRANEQD 1285

Query: 2206 FAIK 2217
            F IK
Sbjct: 1286 FTIK 1289


>ref|XP_012459417.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X1 [Gossypium raimondii]
 gb|KJB77191.1| hypothetical protein B456_012G125200 [Gossypium raimondii]
          Length = 1272

 Score =  739 bits (1907), Expect = 0.0
 Identities = 424/781 (54%), Positives = 497/781 (63%), Gaps = 52/781 (6%)
 Frame = +1

Query: 31   VNSSGGLQMSSVNSS--GAF---QMAPLNTSGGSQMDPVKTSAKSRDPRLRFMNSEVGGA 195
            V+S+  +  +S  SS  G F      P+  S  S +   K SAKSRDPRLRF NS V   
Sbjct: 498  VSSAPHIDSASSTSSMQGQFTTQNATPVTVSSASNILS-KASAKSRDPRLRFANSNVSAL 556

Query: 196  PQNGF----------AAGSVNSRKHKAIDEPVPDEHNLKRQRNESTRS--RDAQVMPGRG 339
              N             +G ++ RK K+ +EPV D    KRQ+NE      RD Q + G G
Sbjct: 557  DLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVSGNG 616

Query: 340  GWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXXXXSASNMX 519
            GW+ED     SQ+ NR Q  + +   +R +  G                       + M 
Sbjct: 617  GWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLTGMS 676

Query: 520  XXXXXXXXXXXXXKDIAVNPTMLVELLKMXXXXXXXXXXXXXXXGPAVNGLSSAIS---- 687
                         KDIAVNPTML+ +LKM                P  N L    S    
Sbjct: 677  NPSLPALL-----KDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNPVL 731

Query: 688  ---------PSPDVGQNPAAKS--------QMNGP--NDMGKIRMKPRDPRRVLHSNMVQ 810
                     PSP V   P++ S         + GP  ++  KIRMKPRDPRRVLH N++Q
Sbjct: 732  GVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGNVLQ 791

Query: 811  KSESLGSELAKSNGNLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQFTKN 984
            KS S+G +  K+NG  P S  Q SKD +  Q+Q E Q +   +  Q    PDI+QQFT++
Sbjct: 792  KSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQFTQS 851

Query: 985  LQNLADIVSSSQASA-LPVGTQNSSQLIPSKISNDTTEPKTVTEMCTQGETASGVIDLA- 1158
            L+N+A ++S  Q+ A LP  +QN     P ++ ++T +  T        +T +G    A 
Sbjct: 852  LKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAPEAG 910

Query: 1159 --------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXX 1314
                    N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK            
Sbjct: 911  VTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLL 970

Query: 1315 NSAKFVEVDPIHEEILXXXXXXXXXXXXXHLFRFPHMGMWTKLRPGVWNFLEKASKLYEL 1494
            NSAKF+EVDP+HEEIL             HLFRF HMGMWTKLRPG+WNFLEKASKLYEL
Sbjct: 971  NSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYEL 1030

Query: 1495 HLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESA 1674
            HLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES+
Sbjct: 1031 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESS 1090

Query: 1675 VVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLG 1854
            VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL 
Sbjct: 1091 VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLA 1150

Query: 1855 VIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAE 2034
            VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAE
Sbjct: 1151 VIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQTAE 1210

Query: 2035 QFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 2214
            QFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAI
Sbjct: 1211 QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAI 1270

Query: 2215 K 2217
            K
Sbjct: 1271 K 1271