BLASTX nr result

ID: Ophiopogon25_contig00021835 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon25_contig00021835
         (3118 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ONK58600.1| uncharacterized protein A4U43_C09F14730 [Asparagu...   880   0.0  
ref|XP_020246902.1| RNA polymerase II C-terminal domain phosphat...   880   0.0  
ref|XP_010929653.1| PREDICTED: RNA polymerase II C-terminal doma...   834   0.0  
ref|XP_008791049.1| PREDICTED: RNA polymerase II C-terminal doma...   812   0.0  
gb|OVA17386.1| BRCT domain [Macleaya cordata]                         801   0.0  
ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal doma...   786   0.0  
gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-l...   781   0.0  
ref|XP_007043830.2| PREDICTED: RNA polymerase II C-terminal doma...   780   0.0  
ref|XP_009421039.1| PREDICTED: RNA polymerase II C-terminal doma...   777   0.0  
ref|XP_021275229.1| RNA polymerase II C-terminal domain phosphat...   778   0.0  
ref|XP_021662955.1| LOW QUALITY PROTEIN: RNA polymerase II C-ter...   772   0.0  
gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Rici...   768   0.0  
ref|XP_009386584.1| PREDICTED: RNA polymerase II C-terminal doma...   769   0.0  
ref|XP_015570573.1| PREDICTED: RNA polymerase II C-terminal doma...   768   0.0  
dbj|GAV71470.1| BRCT domain-containing protein/NIF domain-contai...   765   0.0  
gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium r...   753   0.0  
gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium r...   753   0.0  
ref|XP_022737741.1| RNA polymerase II C-terminal domain phosphat...   760   0.0  
dbj|BAT14211.1| Os11g0521900, partial [Oryza sativa Japonica Group]   756   0.0  
gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japo...   756   0.0  

>gb|ONK58600.1| uncharacterized protein A4U43_C09F14730 [Asparagus officinalis]
          Length = 1100

 Score =  880 bits (2275), Expect = 0.0
 Identities = 485/760 (63%), Positives = 535/760 (70%), Gaps = 6/760 (0%)
 Frame = -1

Query: 3073 MAPVNSSVGFQMAPFNSSGGPQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAP-V 2897
            MAPVNSS GFQM P  SS GPQM+ V + G +                    GS+  P V
Sbjct: 256  MAPVNSSDGFQMQPVYSSAGPQMAQVRTIGQV--------------------GSEANPAV 295

Query: 2896 KTSAKSRDPRLRFMNSEVGGAPQNGFAAGSVNSRKHKAMDEPVPDEHNLKRQRNESTRSR 2717
            K SAKSRDPRLRFM SE    P+NGFAAG VNS KHK+ DE V DEH+LKRQR +S  SR
Sbjct: 296  KASAKSRDPRLRFMKSETSVVPKNGFAAGPVNSLKHKSDDELVLDEHSLKRQRKDSMSSR 355

Query: 2716 DAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXX 2537
            D +V   + G                      A  +RNLVG EVG +  L          
Sbjct: 356  DVKVTGSQSG----------------------ATKDRNLVGTEVGYEMGLDADNNKLTVS 393

Query: 2536 NGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGL 2357
            +  S S  +T   P+VSLPSLLK IA NP MLV+LL+M               G AVNGL
Sbjct: 394  SVPSTSISTT--GPIVSLPSLLKGIAANPQMLVQLLRMEQQKIAAGQAQEKPDGQAVNGL 451

Query: 2356 SSAISPSPDVGQNPAAKPQM-----NGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSEL 2192
            SSA S S  +GQNP+ K QM     +  N+M KIRMKPRDPRRVLHSNM Q++E+ GS  
Sbjct: 452  SSATSSSSGIGQNPSVKSQMPPQTKSTNNNMAKIRMKPRDPRRVLHSNMAQRTENSGS-- 509

Query: 2191 AKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSS 2012
                    S V S KD L  ++QG+QAQ   +P QSASL DISQQFTKNLQNLAD+VS S
Sbjct: 510  ------ATSNVHSIKDQLLHRKQGDQAQKLAVPLQSASLTDISQQFTKNLQNLADMVSKS 563

Query: 2011 QTSALPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGY 1832
            Q                SK  +D T+PK V  +C+Q +T       ANPWGDVDHLLDGY
Sbjct: 564  Q----------------SKTSDDLTKPKNVPELCSQTDTKPA----ANPWGDVDHLLDGY 603

Query: 1831 DDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXX 1652
            DD+QKAAIQKERARRIEEQNKMFA RK           LNSAKFVE+DPIHEEIL     
Sbjct: 604  DDKQKAAIQKERARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEIDPIHEEILRKKEE 663

Query: 1651 XXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPK 1472
                   RHLFR  HMGMWTKLRPG+W FLEKAS+LYELH+YTMGNKLYATEMAK+LDPK
Sbjct: 664  QDRQTQERHLFRLQHMGMWTKLRPGIWTFLEKASQLYELHVYTMGNKLYATEMAKLLDPK 723

Query: 1471 GTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVV 1292
            G LF GRV+S+GD+GDPFD D+RVPKSKDLDGVLGMESAV+IIDDSLRVWPHNKHNLIVV
Sbjct: 724  GNLFAGRVLSRGDDGDPFDGDDRVPKSKDLDGVLGMESAVLIIDDSLRVWPHNKHNLIVV 783

Query: 1291 ERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVR 1112
            ERY YFPSSRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+H+IFFSH  L EVDVR
Sbjct: 784  ERYTYFPSSRRQFGLIGPSLLEIDHDERPEDGTLASSLAVIERIHEIFFSHSCLTEVDVR 843

Query: 1111 NILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANS 932
            NIL AEQRKIL GCKIVFSR+FPVGEANPHLHPLWQTAEQFGAECTNQIDE VTHVVANS
Sbjct: 844  NILGAEQRKILAGCKIVFSRIFPVGEANPHLHPLWQTAEQFGAECTNQIDEQVTHVVANS 903

Query: 931  FGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 812
             GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA+K
Sbjct: 904  LGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAVK 943


>ref|XP_020246902.1| RNA polymerase II C-terminal domain phosphatase-like 3 isoform X1
            [Asparagus officinalis]
 ref|XP_020246903.1| RNA polymerase II C-terminal domain phosphatase-like 3 isoform X2
            [Asparagus officinalis]
          Length = 1127

 Score =  880 bits (2275), Expect = 0.0
 Identities = 485/760 (63%), Positives = 535/760 (70%), Gaps = 6/760 (0%)
 Frame = -1

Query: 3073 MAPVNSSVGFQMAPFNSSGGPQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAP-V 2897
            MAPVNSS GFQM P  SS GPQM+ V + G +                    GS+  P V
Sbjct: 439  MAPVNSSDGFQMQPVYSSAGPQMAQVRTIGQV--------------------GSEANPAV 478

Query: 2896 KTSAKSRDPRLRFMNSEVGGAPQNGFAAGSVNSRKHKAMDEPVPDEHNLKRQRNESTRSR 2717
            K SAKSRDPRLRFM SE    P+NGFAAG VNS KHK+ DE V DEH+LKRQR +S  SR
Sbjct: 479  KASAKSRDPRLRFMKSETSVVPKNGFAAGPVNSLKHKSDDELVLDEHSLKRQRKDSMSSR 538

Query: 2716 DAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXX 2537
            D +V   + G                      A  +RNLVG EVG +  L          
Sbjct: 539  DVKVTGSQSG----------------------ATKDRNLVGTEVGYEMGLDADNNKLTVS 576

Query: 2536 NGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGL 2357
            +  S S  +T   P+VSLPSLLK IA NP MLV+LL+M               G AVNGL
Sbjct: 577  SVPSTSISTT--GPIVSLPSLLKGIAANPQMLVQLLRMEQQKIAAGQAQEKPDGQAVNGL 634

Query: 2356 SSAISPSPDVGQNPAAKPQM-----NGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSEL 2192
            SSA S S  +GQNP+ K QM     +  N+M KIRMKPRDPRRVLHSNM Q++E+ GS  
Sbjct: 635  SSATSSSSGIGQNPSVKSQMPPQTKSTNNNMAKIRMKPRDPRRVLHSNMAQRTENSGS-- 692

Query: 2191 AKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSS 2012
                    S V S KD L  ++QG+QAQ   +P QSASL DISQQFTKNLQNLAD+VS S
Sbjct: 693  ------ATSNVHSIKDQLLHRKQGDQAQKLAVPLQSASLTDISQQFTKNLQNLADMVSKS 746

Query: 2011 QTSALPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGY 1832
            Q                SK  +D T+PK V  +C+Q +T       ANPWGDVDHLLDGY
Sbjct: 747  Q----------------SKTSDDLTKPKNVPELCSQTDTKPA----ANPWGDVDHLLDGY 786

Query: 1831 DDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXX 1652
            DD+QKAAIQKERARRIEEQNKMFA RK           LNSAKFVE+DPIHEEIL     
Sbjct: 787  DDKQKAAIQKERARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEIDPIHEEILRKKEE 846

Query: 1651 XXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPK 1472
                   RHLFR  HMGMWTKLRPG+W FLEKAS+LYELH+YTMGNKLYATEMAK+LDPK
Sbjct: 847  QDRQTQERHLFRLQHMGMWTKLRPGIWTFLEKASQLYELHVYTMGNKLYATEMAKLLDPK 906

Query: 1471 GTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVV 1292
            G LF GRV+S+GD+GDPFD D+RVPKSKDLDGVLGMESAV+IIDDSLRVWPHNKHNLIVV
Sbjct: 907  GNLFAGRVLSRGDDGDPFDGDDRVPKSKDLDGVLGMESAVLIIDDSLRVWPHNKHNLIVV 966

Query: 1291 ERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVR 1112
            ERY YFPSSRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+H+IFFSH  L EVDVR
Sbjct: 967  ERYTYFPSSRRQFGLIGPSLLEIDHDERPEDGTLASSLAVIERIHEIFFSHSCLTEVDVR 1026

Query: 1111 NILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANS 932
            NIL AEQRKIL GCKIVFSR+FPVGEANPHLHPLWQTAEQFGAECTNQIDE VTHVVANS
Sbjct: 1027 NILGAEQRKILAGCKIVFSRIFPVGEANPHLHPLWQTAEQFGAECTNQIDEQVTHVVANS 1086

Query: 931  FGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 812
             GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA+K
Sbjct: 1087 LGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAVK 1126


>ref|XP_010929653.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Elaeis guineensis]
          Length = 1268

 Score =  834 bits (2155), Expect = 0.0
 Identities = 467/803 (58%), Positives = 547/803 (68%), Gaps = 64/803 (7%)
 Frame = -1

Query: 3028 NSSGGPQMSSVNSSGGLQMSS-----VNSSGAFQ---MAPLNTSGGSQMAPVKTSAKSRD 2873
            +SS       VN++  +Q+++      +SS + Q   + P+   G +     + + KSRD
Sbjct: 478  SSSANRNAGCVNTTSQIQVATSSAACTDSSSSHQPGTVKPVGQLGSAPNLATRPALKSRD 537

Query: 2872 PRLRFMNSEVGGA--------------PQNGFAAGSVNSRKHKAMDEPVPDEHNLKRQRN 2735
            PRLRF++SE G A              P NG   G  N RKHKA+DE +P+ H LKRQRN
Sbjct: 538  PRLRFVSSESGSASDPNTQVMSLDSSAPNNGPVGGITNPRKHKAVDESLPENHTLKRQRN 597

Query: 2734 ESTRSRDAQVMPGRGG-WIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXX 2558
              T S D Q++PGRGG W++D+  V SQ +++++ ++NM +  +N V   VG DRR    
Sbjct: 598  GLTNSGDVQMIPGRGGGWLDDSSAVGSQPSDKIRLSENMEIETKNPVS-VVGSDRRPDSN 656

Query: 2557 XXXXXXXNG-----------GSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXX 2411
                    G            S++  S+S A  VS PSLLKDIAVNPTML++L++M    
Sbjct: 657  PNIHVSNTGTCPIPSSTAAPASSTAPSSSAAASVSFPSLLKDIAVNPTMLMQLIQMEQQR 716

Query: 2410 XXXXXXXXXXA-------GPAVNGLSSAISP-------SPDVGQNPAAKPQM-------N 2294
                                ++N LS A+S        S +VGQNP  +PQ+       N
Sbjct: 717  LSAEAQQKTVGLMQNMAHASSLNVLSGAVSSATVASMKSTEVGQNPGGRPQVPPQTVSTN 776

Query: 2293 GPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQ 2114
              +D+G+IRMKPRDPRRVLH NMVQK+E++ SE AK NG L S+ QSSKD   + EQGEQ
Sbjct: 777  SQSDVGRIRMKPRDPRRVLH-NMVQKNETVVSERAKPNGTLSSDPQSSKDQSAIGEQGEQ 835

Query: 2113 AQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ-TSALPVGTQNSSQLIPSKI----- 1952
            AQ  TLP+Q         QF KN +NL DI S+ Q T+  P  +Q  SQ I  KI     
Sbjct: 836  AQATTLPTQ---------QFAKNTKNLGDISSTLQSTTTPPAASQIISQPIQLKINKVDP 886

Query: 1951 ---CNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIE 1781
                   ++PKT++ + ++G T +G     NPWGDVDHLLDGYDDQQKAAIQ+ERARRI 
Sbjct: 887  RPAAAVVSDPKTLSAVTSEGST-TGATPSTNPWGDVDHLLDGYDDQQKAAIQRERARRIA 945

Query: 1780 EQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMG 1601
            EQNKMFA RK           LNSAKFVEVDP+HEEIL            RHLFRF HMG
Sbjct: 946  EQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEEQDREKPQRHLFRFQHMG 1005

Query: 1600 MWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDP 1421
            MWTKLRPG+W FLEKASKLYE+HLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+GDP
Sbjct: 1006 MWTKLRPGIWTFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDGDP 1065

Query: 1420 FDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPG 1241
            FD DERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL G
Sbjct: 1066 FDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLFG 1125

Query: 1240 PSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIV 1061
            PSLLEIDHDERP+ GTLASSL VIER+HQ FFSH SLN++DVRNILAAEQRKIL GCKIV
Sbjct: 1126 PSLLEIDHDERPEDGTLASSLAVIERIHQNFFSHHSLNDIDVRNILAAEQRKILAGCKIV 1185

Query: 1060 FSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVV 881
            FSRVFPVGEANPHLHPLWQ AEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV
Sbjct: 1186 FSRVFPVGEANPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVV 1245

Query: 880  HPGWVEASALLYRRANEQDFAIK 812
            HPGWVEASALLYRR +E DFA+K
Sbjct: 1246 HPGWVEASALLYRRVSEHDFAVK 1268


>ref|XP_008791049.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Phoenix dactylifera]
          Length = 1269

 Score =  812 bits (2097), Expect = 0.0
 Identities = 459/806 (56%), Positives = 534/806 (66%), Gaps = 67/806 (8%)
 Frame = -1

Query: 3028 NSSGGPQMSSVNSSGGLQMSS-----VNSSGAFQMAPLNTSGGSQMAP---VKTSAKSRD 2873
            +SS       VN++  +Q+++      +SS   Q  P+   G    AP   ++ + KSRD
Sbjct: 479  SSSANGNAGCVNTTSEIQVATNSAACTDSSSRHQPGPVKPVGQLGSAPNPAIRPALKSRD 538

Query: 2872 PRLRFMNSEVGGA--------------PQNGFAAGSVNSRKHKAMDEPVPDEHNLKRQRN 2735
            PRLRF+NSE G A              P N    G  N RKHKA+DE  P+ H LKRQ+N
Sbjct: 539  PRLRFVNSESGNASDPNRRAMSLDFSAPNNDLVGGITNPRKHKAVDESFPENHTLKRQKN 598

Query: 2734 ESTRSRDAQVMPGRGG-WIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXX 2558
              T S D Q+ PGRGG W+ED+  V SQ++++++ N+NM +  +N  G  V  DRR    
Sbjct: 599  GLTNSSDVQMTPGRGGGWLEDSSSVRSQLSDKIRLNENMEIEIKN-PGNVVMSDRRPDSN 657

Query: 2557 XXXXXXXNG-----------GSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXX 2411
                    G            S +  S+S A  VS PSLLKDIAVNPTML++L+++    
Sbjct: 658  PNIQVTNTGTCMIPSSTTAPSSGTAPSSSAAASVSFPSLLKDIAVNPTMLMQLIQIEQQR 717

Query: 2410 XXXXXXXXXXA-------GPAVNGLSSAISP-------SPDVGQNPAAKPQM-------N 2294
                                ++N L  A+S        S +VG NP+ +PQ+       N
Sbjct: 718  LSAEAQQKTVGLMHNMAHASSLNVLPGAVSSANVASMKSAEVGHNPSGRPQVTAQTVSTN 777

Query: 2293 GPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQ 2114
              +D+G+IRMKPRDPRR+LH NMVQK+E++ SE AK NG L S+ QSSKD L + EQGEQ
Sbjct: 778  SQSDVGRIRMKPRDPRRILH-NMVQKNETIVSERAKPNGTLSSDPQSSKDHLAIGEQGEQ 836

Query: 2113 AQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQTSALPVGTQ-----------NSSQL 1967
            AQ   LP+          Q  KN +NL DI S  Q +  P+              N   L
Sbjct: 837  AQATGLPTL---------QLAKNPKNLGDISSPLQLTTTPLAVPQIISQPIQFNINKVDL 887

Query: 1966 IPSK-ICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERAR 1790
             P+  + ND   PKT++ + ++G T        N WGDVDHLLDGYDDQQKAAIQ+ERAR
Sbjct: 888  RPAAAVVND---PKTLSTVASEGSTTVAT-QSTNAWGDVDHLLDGYDDQQKAAIQRERAR 943

Query: 1789 RIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFP 1610
            RI EQNKMFA RK           LNSAKFVEVDP+HEEIL            RHLFRF 
Sbjct: 944  RIAEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEEQDREKPQRHLFRFQ 1003

Query: 1609 HMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDE 1430
            HMGMWTKLRPG+WNFLEKASKLYE+HLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+
Sbjct: 1004 HMGMWTKLRPGIWNFLEKASKLYEMHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDD 1063

Query: 1429 GDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFG 1250
             +PFD DERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFG
Sbjct: 1064 SEPFDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1123

Query: 1249 LPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGC 1070
            L GPSLLEIDHDERP+ GTLASSL VIER+H  FFSH+SLN+VDVRNILAAEQRKIL GC
Sbjct: 1124 LFGPSLLEIDHDERPEDGTLASSLTVIERIHDDFFSHRSLNDVDVRNILAAEQRKILAGC 1183

Query: 1069 KIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGR 890
            KIVFSRVFPVGEANPHLHPLWQ AEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGR
Sbjct: 1184 KIVFSRVFPVGEANPHLHPLWQMAEQFGAACTNQIDEQVTHVVANSLGTDKVNWALSTGR 1243

Query: 889  FVVHPGWVEASALLYRRANEQDFAIK 812
            FVVHP WVEASALLYRR NEQDFA+K
Sbjct: 1244 FVVHPSWVEASALLYRRVNEQDFAVK 1269


>gb|OVA17386.1| BRCT domain [Macleaya cordata]
          Length = 1214

 Score =  801 bits (2069), Expect = 0.0
 Identities = 444/740 (60%), Positives = 510/740 (68%), Gaps = 44/740 (5%)
 Frame = -1

Query: 2899 VKTSAKSRDPRLRFMNSEVGG-------------APQNGFAAGSVNSRKHKAMDEPVPDE 2759
            ++   KSRDPRLRF+NSEVG              AP++    G ++SRK+K   E V D 
Sbjct: 478  LRPQPKSRDPRLRFLNSEVGSVDLNQRSPYVEYNAPKSETLGGIISSRKNKTDPESVLDG 537

Query: 2758 HNLKRQRN---ESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGE 2588
            H LKRQRN     T S   Q+  G GGW+ED   V  Q   R+Q  +++    R +  GE
Sbjct: 538  HTLKRQRNGLTSPTVSGGVQMSSGSGGWLEDISTVRPQPTPRIQLAESVGSDPRMIGNGE 597

Query: 2587 VGPDRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLK-MXXXX 2411
            V    R            GG+     T    + SLPSLLKDIAVNPTML+ L++      
Sbjct: 598  VLSGLRQDTSSSNINVRAGGNDQLPLTGIDTMGSLPSLLKDIAVNPTMLINLIREQQRLA 657

Query: 2410 XXXXXXXXXXAGPAVNGLSSAISP------------SPDVGQNPAAKPQMNGP------N 2285
                          + G SS + P            S ++ Q PA KPQ  GP       
Sbjct: 658  AETQQKSSNPTQNKITGSSSNVLPRSVPLANVASSKSSEIEQKPAVKPQ--GPAETISTG 715

Query: 2284 DMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQT 2105
            + GKIRMKPRDPRR+LH++  QK+E LG E  K+ G   S +Q+SKD L V++QGE AQT
Sbjct: 716  EFGKIRMKPRDPRRILHNSTFQKNECLGLEQLKTIGASSSLIQASKDNLIVRQQGELAQT 775

Query: 2104 NTLPSQSASLPDISQQFTKNLQNLADIVSSSQTSALP--VGTQNSSQLIPSKICNDTTEP 1931
            N+LPS SA  PDI+QQFTK L+NLADI+SSSQ + +P  V    SSQ IP+KI  DTT+ 
Sbjct: 776  NSLPSHSAPAPDIAQQFTKELKNLADILSSSQATNIPSVVPQTVSSQTIPTKI--DTTDM 833

Query: 1930 KTVTGMCTQGETASG-------VIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQN 1772
            +TV  +    ++ +G       V+   N W DV+HL +GYDDQQ+AAI +ERARRIEEQN
Sbjct: 834  RTVVTVPKDQQSGTGTTPEEGTVLPSENKWEDVEHLFEGYDDQQRAAIHRERARRIEEQN 893

Query: 1771 KMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWT 1592
            KMFA RK           LNSAKF+EVDP+H+EIL            RHLFRFPHMGMWT
Sbjct: 894  KMFAARKLCLVLDLDHTLLNSAKFIEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 953

Query: 1591 KLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDS 1412
            KLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF GRVISKGDEGDPFD 
Sbjct: 954  KLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISKGDEGDPFDG 1013

Query: 1411 DERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSL 1232
            DERVPKSKDL+GVLGMES+VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSL
Sbjct: 1014 DERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1073

Query: 1231 LEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSR 1052
            LEIDHDERP+ GTLASSL VIER+HQ FFSH SL++VDVRNILA+EQRKIL GC+IVFSR
Sbjct: 1074 LEIDHDERPEEGTLASSLAVIERIHQNFFSHMSLHDVDVRNILASEQRKILAGCRIVFSR 1133

Query: 1051 VFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPG 872
            VFPVGEANPHLHPLWQ+AEQFGA CT QIDE VTHVVANS GTDKVNWALSTGRFVVHP 
Sbjct: 1134 VFPVGEANPHLHPLWQSAEQFGAVCTTQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPS 1193

Query: 871  WVEASALLYRRANEQDFAIK 812
            WVEAS LLYRRANE DFA+K
Sbjct: 1194 WVEASTLLYRRANEHDFAVK 1213


>ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Nelumbo nucifera]
          Length = 1313

 Score =  786 bits (2029), Expect = 0.0
 Identities = 447/787 (56%), Positives = 527/787 (66%), Gaps = 55/787 (6%)
 Frame = -1

Query: 3007 MSSVNSSGGLQMSSVNSSGAFQMA-----PLNTSG--GSQMAPVKTSAKSRDPRLRFMNS 2849
            ++++NSS  L+  S  +S A  ++     P  + G  GS  + V  +AK+RDPRLR+ NS
Sbjct: 529  VATINSSTSLKTVSSATSYADNLSGQGLVPAVSVGQLGSMSSHVIRTAKNRDPRLRYANS 588

Query: 2848 EVGGA-----PQNGF--------AAGSVNSRKHKAMDEPVPDEHNLKRQRN---ESTRSR 2717
            EVG       P +G           G + SRKHK ++E + D+H  KRQRN    S  S 
Sbjct: 589  EVGPLDLNQRPPSGDHDIRKSEPLGGIMGSRKHKIVEESLLDDHTFKRQRNGLINSGASG 648

Query: 2716 DAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXX 2537
            D QV+ G GGW+E++  +  Q  +R +  +      R L  GE     +           
Sbjct: 649  DVQVVSGSGGWLEESSSMGLQPTDRSRLIEKRESDPRKLGSGEASFGNKQDTGCSTYNVT 708

Query: 2536 NGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGL 2357
             GG+    ++     VSLPSLLKDIAVNPTML+ L+KM                PA + +
Sbjct: 709  TGGNEQLTASGIGSTVSLPSLLKDIAVNPTMLMHLIKMEHQRLAVEALQKCG-NPAQSTM 767

Query: 2356 ---SSAISPSPDVGQNPAAKP-------------------QMNGPNDMGKIRMKPRDPRR 2243
               SS++ P      N A+K                     M    D+GKIRMKPRDPRR
Sbjct: 768  QSSSSSVMPGKIASVNIASKTLSEPEKKSAGNSQISVQTASMIPHGDLGKIRMKPRDPRR 827

Query: 2242 VLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDIS 2063
            +LHSN  QKS+S G E  K+NG       + +D L V++QGEQAQTN+L SQS + PDI+
Sbjct: 828  ILHSNTFQKSDSSGPERFKANGTPSPNTPTCRDNLIVRQQGEQAQTNSLLSQSTAPPDIA 887

Query: 2062 QQFTKNLQNLADIVSSSQTSALP--VGTQNSSQLIPSK--------ICNDTTEPKTVTGM 1913
            QQFTK L+N+A+I+S+SQ    P  V    SSQ +P+K        +  D+ + ++ + +
Sbjct: 888  QQFTKKLKNIANILSASQAINTPSVVPQTISSQPVPAKMDKVDMKVVATDSNDQRSWSAL 947

Query: 1912 CTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXX 1733
             T  E A+G     N WGDV+HL +GYDDQQKAAIQ+ERARRIEEQN+MFA RK      
Sbjct: 948  -TPEERAAGPSS-QNAWGDVEHLFEGYDDQQKAAIQRERARRIEEQNQMFAARKLCLVLD 1005

Query: 1732 XXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKA 1553
                 LNSAKFVEVDP+HEE+L            RHLFRF HMGMWTKLRPG+WNFLEKA
Sbjct: 1006 LDHTLLNSAKFVEVDPVHEEMLRKKEEQDREKPQRHLFRFTHMGMWTKLRPGIWNFLEKA 1065

Query: 1552 SKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGV 1373
            SKLYELHLYTMGNKLYATEMAKVLDP G LF GRVIS+GD+GDPFD DER PKSKDLDGV
Sbjct: 1066 SKLYELHLYTMGNKLYATEMAKVLDPTGVLFAGRVISRGDDGDPFDGDERQPKSKDLDGV 1125

Query: 1372 LGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGT 1193
            LGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQ GL GPSLLEIDHDERP+ GT
Sbjct: 1126 LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQLGLHGPSLLEIDHDERPEDGT 1185

Query: 1192 LASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHP 1013
            LASSL VIER+HQ FFSHQ+LN+VDVRNILAAEQ+KIL GC+IVFSRVFPVGEANPHLHP
Sbjct: 1186 LASSLAVIERIHQNFFSHQNLNDVDVRNILAAEQQKILAGCRIVFSRVFPVGEANPHLHP 1245

Query: 1012 LWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 833
            LWQTAEQFGA CTNQIDE VTHVVA S GTDKVNWALSTGRFVVHPGWVEASALLYRRAN
Sbjct: 1246 LWQTAEQFGAVCTNQIDEQVTHVVAISLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 1305

Query: 832  EQDFAIK 812
            E DFAIK
Sbjct: 1306 EHDFAIK 1312


>gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao]
          Length = 1290

 Score =  781 bits (2016), Expect = 0.0
 Identities = 433/771 (56%), Positives = 508/771 (65%), Gaps = 53/771 (6%)
 Frame = -1

Query: 2965 VNSSGAFQMAPLNTSGGSQMAPV-----KTSAKSRDPRLRFMNSEVGGAPQN-------- 2825
            V+S+ +     + T   + M+ V     K+ AKSRDPRL F NS       N        
Sbjct: 528  VDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNERLLHNAS 587

Query: 2824 --GFAAGSVNSRKHKAMDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 2660
                  G ++SRK K+++EP+ D   LKRQRNE      +RD Q + G GGW+ED   + 
Sbjct: 588  KVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIG 647

Query: 2659 SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLP 2480
            SQ+ NR Q  +N+   +R +  G                    G+   +  +     SLP
Sbjct: 648  SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNITVGTNEQVPVTSTSTPSLP 702

Query: 2479 SLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAG--------PAVNGLSSAIS-----P 2339
            +LLKDIAVNPTML+ +LKM                        P+ N L   +S     P
Sbjct: 703  ALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIP 762

Query: 2338 SPDV----------GQNPAAKPQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 2189
            SP V             PA   Q+  P++ GKIRMKPRDPRRVLH N +Q+S S+G +  
Sbjct: 763  SPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGLDQL 822

Query: 2188 KSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 2009
            K+NG L S  Q SKD L  Q+   Q ++  + SQ    PDI+QQFT NL+N+ADI+S SQ
Sbjct: 823  KTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIADIMSVSQ 882

Query: 2008 --TSALPVGTQNSSQLIPSKIC--NDTTEPKTVTGMCTQGETASGVIDLA--------NP 1865
              TS  PV    S  L+P  +   +D+ + K +       +T +G+   A        N 
Sbjct: 883  ALTSLPPV----SHNLVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGPRSQNA 938

Query: 1864 WGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDP 1685
            WGDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+ RK           LNSAKF+EVDP
Sbjct: 939  WGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDP 998

Query: 1684 IHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLY 1505
            +HEEIL            RHLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLY
Sbjct: 999  VHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLY 1058

Query: 1504 ATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRV 1325
            ATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMESAVVIIDDS+RV
Sbjct: 1059 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRV 1118

Query: 1324 WPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFF 1145
            WPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FF
Sbjct: 1119 WPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFF 1178

Query: 1144 SHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQI 965
            SHQ+L++VDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQI
Sbjct: 1179 SHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQI 1238

Query: 964  DEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 812
            DEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1239 DEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289


>ref|XP_007043830.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Theobroma cacao]
          Length = 1290

 Score =  780 bits (2014), Expect = 0.0
 Identities = 433/771 (56%), Positives = 507/771 (65%), Gaps = 53/771 (6%)
 Frame = -1

Query: 2965 VNSSGAFQMAPLNTSGGSQMAPV-----KTSAKSRDPRLRFMNSEVGGAPQN-------- 2825
            V+S+ +     + T   + M+ V     K+ AKSRDPRL F NS       N        
Sbjct: 528  VDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDLNERLLHNAS 587

Query: 2824 --GFAAGSVNSRKHKAMDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 2660
                  G ++SRK K+++EP+ D   LKRQRNE      +RD Q + G GGW+ED   + 
Sbjct: 588  KVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIG 647

Query: 2659 SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLP 2480
            SQ+ NR Q  +N+   +R +  G                    G+   +  +     SLP
Sbjct: 648  SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNITVGTNEQVPVTSTSTPSLP 702

Query: 2479 SLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAG--------PAVNGLSSAIS-----P 2339
            +LLKDIAVNPTML+ +LKM                        P+ N L   +S     P
Sbjct: 703  ALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIP 762

Query: 2338 SPDV----------GQNPAAKPQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 2189
            SP V             PA   Q+  P++ GKIRMKPRDPRRVLH N +Q+S S+G +  
Sbjct: 763  SPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGPDQL 822

Query: 2188 KSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 2009
            K+NG L S  Q SKD L  Q+   Q ++  + SQ    PDI+QQFT NL+N+A IVS SQ
Sbjct: 823  KTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIAGIVSVSQ 882

Query: 2008 --TSALPVGTQNSSQLIPSKIC--NDTTEPKTVTGMCTQGETASGVIDLA--------NP 1865
              TS  PV    S  L+P  +   +D+ + K +       +T +G+   A        N 
Sbjct: 883  ALTSLSPV----SHNLVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGPHSQNA 938

Query: 1864 WGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDP 1685
            WGDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+ RK           LNSAKF+EVDP
Sbjct: 939  WGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDP 998

Query: 1684 IHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLY 1505
            +HEEIL            RHLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLY
Sbjct: 999  VHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLY 1058

Query: 1504 ATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRV 1325
            ATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMESAVVIIDDS+RV
Sbjct: 1059 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRV 1118

Query: 1324 WPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFF 1145
            WPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FF
Sbjct: 1119 WPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFF 1178

Query: 1144 SHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQI 965
            SHQ+L++VDVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQI
Sbjct: 1179 SHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQI 1238

Query: 964  DEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 812
            DEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1239 DEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289


>ref|XP_009421039.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Musa acuminata subsp. malaccensis]
          Length = 1228

 Score =  777 bits (2007), Expect = 0.0
 Identities = 450/788 (57%), Positives = 516/788 (65%), Gaps = 50/788 (6%)
 Frame = -1

Query: 3025 SSGGPQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAPVKT--------------S 2888
            +S    +S+  ++  +Q  +V SS     A  N+S G Q  PVK               +
Sbjct: 453  ASSSSVVSNAETACTIQNQAVKSSST--AACSNSSAGDQPYPVKLVGQVGSGSKSSAKPA 510

Query: 2887 AKSRDPRLRFMNSEVGG-----------APQNGFAAGSVNSRKHKAMDEPVPDEHNLKRQ 2741
             K RDPRL+ MN+EV G           A  N    GS+N+RKHK++DEPV  +H +KRQ
Sbjct: 511  LKRRDPRLKLMNNEVRGPSVGDKGIDSNALDNRLVGGSMNTRKHKSVDEPVTGDHKMKRQ 570

Query: 2740 RNESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVX 2561
            +N  T SRD Q+  GRGGW+ED+ +   Q ++R Q N+N  V  R    GEVG  ++   
Sbjct: 571  KNGFTGSRDMQMTSGRGGWLEDSSI--PQPSDRNQINENFQVEVRKPGSGEVGSGKK--S 626

Query: 2560 XXXXXXXXNGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXX 2381
                      G   N S +    +SLP LLK  AVNPT+ V+LL+M              
Sbjct: 627  DSNMNFSMLNGLIPNPSGNLPNTLSLPPLLK--AVNPTIFVQLLQMEQHRLAAENHQIVT 684

Query: 2380 AGPA-------VNGLSSAISP-------SPDVGQN-------PAAKPQMNGPNDMGKIRM 2264
            A  +       VNGL  A+S        S +VGQN       P+    ++  ND+G+IRM
Sbjct: 685  ASTSDVTNVSKVNGLPGAVSSVNSTPLKSQEVGQNHLGMSQIPSQSASVSSQNDVGRIRM 744

Query: 2263 KPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQS 2084
            KPRDPRR LH+NMVQ    + SE  K N  +P   QSS    T +E GEQAQ + L +Q 
Sbjct: 745  KPRDPRRALHNNMVQMKNVIVSEQNKINEAIPGP-QSSMGHSTAREPGEQAQASVLATQF 803

Query: 2083 ASLPDISQQFTKNLQNLADIVSSSQTSALPVGTQNSSQLIPSKICNDTTEPKTV----TG 1916
               P++S+Q TKNL N   IVSSSQ +A    +Q   Q IPSK       P +     + 
Sbjct: 804  VPQPNMSRQLTKNLGN---IVSSSQLAAT---SQAVPQYIPSKANQVNVRPASAELNDSK 857

Query: 1915 MCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXX 1736
                  TA GV    N WGDVDH LDGY+D+Q+AAIQKERARRI EQNKMFA RK     
Sbjct: 858  TLVSEATAKGVSQSVNAWGDVDHFLDGYNDEQRAAIQKERARRIAEQNKMFAARKLCLVL 917

Query: 1735 XXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEK 1556
                  LNSAKFVEVDP+HEEIL            RHLF F HMGMWTKLRPG+WNFL+K
Sbjct: 918  DLDHTLLNSAKFVEVDPVHEEILRRKEEQDREKPQRHLFCFHHMGMWTKLRPGIWNFLDK 977

Query: 1555 ASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDG 1376
            ASKLYELHLYTMGNKLYATEMAKVLDP GTLF GRVIS+GD+ D  D DERVPKSKDLDG
Sbjct: 978  ASKLYELHLYTMGNKLYATEMAKVLDPTGTLFSGRVISRGDDADTVDGDERVPKSKDLDG 1037

Query: 1375 VLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAG 1196
            VLGMESAVVIIDDSLRVWP NK NLIVVERY YFPSSRRQFGL GPSLLEIDHDERP+ G
Sbjct: 1038 VLGMESAVVIIDDSLRVWPLNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDG 1097

Query: 1195 TLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLH 1016
            TLASSL VIER+HQ FFSH SL +VDVRNILAAEQRKIL GC+IVFSRVFPVGEANPHLH
Sbjct: 1098 TLASSLAVIERIHQNFFSHHSLKDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 1157

Query: 1015 PLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRA 836
            PLWQTAEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVVHPGWVEASALLYRRA
Sbjct: 1158 PLWQTAEQFGAICTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRA 1217

Query: 835  NEQDFAIK 812
            NE DFA+K
Sbjct: 1218 NEHDFAVK 1225


>ref|XP_021275229.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Herrania
            umbratica]
          Length = 1291

 Score =  778 bits (2008), Expect = 0.0
 Identities = 431/770 (55%), Positives = 509/770 (66%), Gaps = 49/770 (6%)
 Frame = -1

Query: 2974 MSSVNSSGAFQMAPLNTSGGSQMAPV--KTSAKSRDPRLRFMNSEVGGAPQN-------- 2825
            + S NSS   Q+   N +  S ++ +  K+ AKSRDPRL F N+       N        
Sbjct: 529  VDSANSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANTNASALDLNERPLHNAS 588

Query: 2824 --GFAAGSVNSRKHKAMDEPVPDEHNLKRQRNESTR---SRDAQVMPGRGGWIEDNGMVA 2660
                  G ++SRK K+++EP+ D   LKRQRNE      +RD Q + G GGW+ED  ++ 
Sbjct: 589  KVAPVGGIMDSRKRKSVEEPILDGPALKRQRNELENLGVARDVQTVCGIGGWLEDTDVIG 648

Query: 2659 SQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLP 2480
            SQ+ NR Q  +N+   +R +  G                    G+   +  +     SLP
Sbjct: 649  SQITNRNQTAENLESNSRKMDNGVTSSST-----LSGKTNMTVGTNEQVPVTSTSTPSLP 703

Query: 2479 SLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGL-------------SSAISP 2339
            +LLKDIAVNPTML+ +LKM                P  + L             S+ +  
Sbjct: 704  ALLKDIAVNPTMLISILKMGQQQRLGAEAQQKSPDPVKSTLHQPSSNSLLGVVSSTNVIS 763

Query: 2338 SPDVG----------QNPAAKPQMNGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELA 2189
            SP V             PA   Q+  P++ GKIRMKPRDPRRVLH N +Q+S S+G +  
Sbjct: 764  SPSVNNVPSISSGILSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGPDQL 823

Query: 2188 KSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQ 2009
            K+NG L S  Q SKD L  Q    Q ++  + SQ    PDI+QQFTKNL+N+ADI+S SQ
Sbjct: 824  KTNGALTSSTQGSKDNLNAQNLDSQTESKPMQSQLVPPPDITQQFTKNLKNIADIMSVSQ 883

Query: 2008 T-SALPVGTQNSSQLIPS--KICNDTTEPKTVTGMCTQGETASGVI--------DLANPW 1862
              ++LP   QN   L+P   +I +D+ + K +       +T +G+            N W
Sbjct: 884  ALTSLPPVPQN---LVPQPVQIKSDSMDMKALVSNSEDQQTGAGLAPEVGATGPHSQNAW 940

Query: 1861 GDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPI 1682
            GDV+HL + YDDQQKAAIQ+ERARRIEEQ KMF+  K           LNSAKF+EVDP+
Sbjct: 941  GDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSAHKLCLVLDLDHTLLNSAKFIEVDPV 1000

Query: 1681 HEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYA 1502
            HEEIL            RHLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYA
Sbjct: 1001 HEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1060

Query: 1501 TEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVW 1322
            TEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES VVIIDDS+RVW
Sbjct: 1061 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESGVVIIDDSVRVW 1120

Query: 1321 PHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFS 1142
            PHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLASSL VIER+HQ FFS
Sbjct: 1121 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFS 1180

Query: 1141 HQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQID 962
            HQ+L++VDVRNILAAEQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CTNQID
Sbjct: 1181 HQNLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQID 1240

Query: 961  EHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 812
            EHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1241 EHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1290


>ref|XP_021662955.1| LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3 [Hevea brasiliensis]
          Length = 1292

 Score =  772 bits (1994), Expect = 0.0
 Identities = 427/735 (58%), Positives = 494/735 (67%), Gaps = 40/735 (5%)
 Frame = -1

Query: 2896 KTSAKSRDPRLRFMNSEVGGAPQNGFAA----------GSVNSRKHKAMDEPVPDEHNLK 2747
            K SAKSRDPRLRF+NS+   + QN  A           G++N +K K++DEP+PD   LK
Sbjct: 560  KASAKSRDPRLRFVNSDANVSDQNNRAVPVVNNTLKVGGTMNLKKQKSVDEPIPDGPPLK 619

Query: 2746 RQRNESTRS---RDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPD 2576
            RQ+  S  S   RD + M G GGW+ED  +V  Q  NR Q  +N     R +  G   P 
Sbjct: 620  RQKIASEISGVGRDVKTMIGSGGWLEDTDVVGPQTLNRNQLVENAESDPRRIDNGVACPS 679

Query: 2575 R----RLVXXXXXXXXXNGGSASNMSTSPAPVV-----SLPSLLKDIAVNPTMLVELLKM 2423
                   V           G+++       PV+     SLP LLK+IAVNPTML+ +LKM
Sbjct: 680  TVSGISSVNISGNEQLQVTGASAVAGAEQVPVMGASATSLPDLLKNIAVNPTMLISILKM 739

Query: 2422 XXXXXXXXXXXXXXAG--------PAVNGLSSA-----ISPSPDVGQNPAAK-----PQM 2297
                                    P  N +  A     ++P    G  P        PQ+
Sbjct: 740  GQQQRLAIEAQQKPVDLAKSTTHPPNTNSILGALPVVNVAPPQSTGILPRPAGALQVPQL 799

Query: 2296 NGPNDMGKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGE 2117
               ++MGKIRMKPRDPRRVLH+N +Q++ SLGSE  K+N    S  Q +K+   VQ Q  
Sbjct: 800  AASDEMGKIRMKPRDPRRVLHNNTLQRNGSLGSEQFKTNLISTSTSQGTKENQNVQNQEG 859

Query: 2116 QAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQTSALPVGTQNSSQLIPSKICNDTT 1937
            Q +   +P+QS   PDIS  FTK+L+N+ADIVS S  S  P+ +QN   L+   +     
Sbjct: 860  QVEMKPVPTQSLVAPDISLPFTKSLKNIADIVSVSNASTPPLVSQN---LVSQHVRTVVL 916

Query: 1936 EPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAE 1757
              +  TG+      AS      N WGD DH+ +GY+DQQKAAIQ+ERARRIEEQ KMFA 
Sbjct: 917  NSEQPTGIGLPPGVASVAPRSQNTWGDFDHIFEGYNDQQKAAIQRERARRIEEQKKMFAA 976

Query: 1756 RKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPG 1577
             K           LNSAKFVE+DP+H+EIL            RHLFRFPHMGMWTKLRPG
Sbjct: 977  NKLCLVLDLDHTLLNSAKFVEIDPVHDEILRKKEEQDHEKPQRHLFRFPHMGMWTKLRPG 1036

Query: 1576 VWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVP 1397
            +WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF+GRVIS GD+GDPFDSDERVP
Sbjct: 1037 IWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISXGDDGDPFDSDERVP 1096

Query: 1396 KSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDH 1217
            KSKDL+GVLGMESAVVIIDDS+RVWPHNK NLIVVERY+YFP SRRQFGLPGPSLLEIDH
Sbjct: 1097 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDH 1156

Query: 1216 DERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVG 1037
            DERP+ GTLA SL VIER+HQ FF+H SL+E DVRNILA+EQRKIL GC+IVFSRVFPVG
Sbjct: 1157 DERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPVG 1216

Query: 1036 EANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEAS 857
            EANPHLHPLWQTAEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEAS
Sbjct: 1217 EANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEAS 1276

Query: 856  ALLYRRANEQDFAIK 812
            ALLYRRANEQDFAIK
Sbjct: 1277 ALLYRRANEQDFAIK 1291


>gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 1195

 Score =  768 bits (1983), Expect = 0.0
 Identities = 425/774 (54%), Positives = 512/774 (66%), Gaps = 42/774 (5%)
 Frame = -1

Query: 3007 MSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAP---VKTSAKSRDPRLRFMNSEVGG 2837
            ++S  S+  + +  ++ S    +  + ++  +  AP   VK SAKSRDPRLRF+NS+   
Sbjct: 421  LTSGQSNASISLPRMDGSSLPGVISIKSAVRASSAPSLTVKASAKSRDPRLRFVNSDSNA 480

Query: 2836 APQNGFA------------AGSVNSRKHKAMDEPVPDEHNLKRQRNESTRS---RDAQVM 2702
              QN  A             G++N ++ K +D+P+PD H+LKRQ+N    S   RD + M
Sbjct: 481  LDQNHRAVPVVNTLKVEPIGGTMNKKRQKIVDDPIPDGHSLKRQKNALENSGVVRDVKTM 540

Query: 2701 PGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRL---VXXXXXXXXXNG 2531
             G GGW+ED  MV  Q  N+ Q   N     R   GG V         V           
Sbjct: 541  VGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCTSSSCISSVNISGTEQIPVT 600

Query: 2530 GSASNMSTSPAPV----VSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVN 2363
            G++  +     PV     ++P LLK+IAVNPTML+ +LKM                PA +
Sbjct: 601  GTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILKMGQQQRLALEAQQKPVDPAKS 660

Query: 2362 -----GLSSAISPSPDVG-------QNPA----AKPQMNGPNDMGKIRMKPRDPRRVLHS 2231
                   +S +   P VG         PA      PQ+   +D+GKIRMKPRDPRRVLH+
Sbjct: 661  TTYPLNSNSMLGTVPVVGAAHSGILPRPAGTVQVSPQLGTADDLGKIRMKPRDPRRVLHN 720

Query: 2230 NMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFT 2051
            N +Q++ S+GSE  K+N       Q +KD   +Q+Q  Q +   +P QS +LPDIS  FT
Sbjct: 721  NALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEKKPVPLQSLALPDISMPFT 780

Query: 2050 KNLQNLADIVSSSQTS-ALPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDL 1874
            KNL+N+ADIVS S  S + P+  QN +        + + +   +         A+     
Sbjct: 781  KNLKNIADIVSVSHASTSQPLVPQNPASQPMRTTISSSDQFLGIGSAPGAAAAAAAGPRT 840

Query: 1873 ANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVE 1694
             N WGDV+HL +GY+DQQKAAIQ+ERARRIEEQ K+F+ RK           LNSAKFVE
Sbjct: 841  QNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQKKLFSARKLCLVLDLDHTLLNSAKFVE 900

Query: 1693 VDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGN 1514
            VDP+H+EIL            RHLFRFPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGN
Sbjct: 901  VDPVHDEILRKKEEQDREKAHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGN 960

Query: 1513 KLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDS 1334
            KLYATEMAKVLDP G LF+GRVIS+GD+G+PFD DER+PKSKDL+GVLGMES VVI+DDS
Sbjct: 961  KLYATEMAKVLDPTGVLFNGRVISRGDDGEPFDGDERIPKSKDLEGVLGMESGVVIMDDS 1020

Query: 1333 LRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQ 1154
            +RVWPHNK NLIVVERY+YFP SRRQFGLPGPSLLEIDHDERP+ GTLA SL VIER+HQ
Sbjct: 1021 VRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQ 1080

Query: 1153 IFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECT 974
             FF+H SL+E DVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CT
Sbjct: 1081 NFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 1140

Query: 973  NQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 812
            NQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEASALLYRRANEQDFAIK
Sbjct: 1141 NQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANEQDFAIK 1194


>ref|XP_009386584.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Musa acuminata subsp. malaccensis]
          Length = 1251

 Score =  770 bits (1987), Expect = 0.0
 Identities = 432/741 (58%), Positives = 506/741 (68%), Gaps = 42/741 (5%)
 Frame = -1

Query: 2899 VKTSAKSRDPRLRFMNSEVGG-----------APQNGFAAGSVNSRKHKAMDEPVPD-EH 2756
            VK + K RDPRLRFMN+EV G           AP +GF  G++N+RKHK  DE     + 
Sbjct: 522  VKPALKRRDPRLRFMNNEVRGPSEERSGIRCNAPDDGFLGGTINARKHKIADESAAVVDQ 581

Query: 2755 NLKRQRNESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPD 2576
             +KRQRN S  SR+  V+ G   W+E + ++  Q + R Q N+N+    R    GEVG D
Sbjct: 582  TMKRQRNGSMSSRNMHVISGSSEWLEGDSIIP-QPSERSQVNENLHADIRKAGTGEVGFD 640

Query: 2575 RRLVXXXXXXXXXNGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXX 2396
            +              G   N S++PA  +SLPSLLK  AVNPT+LV+LLKM         
Sbjct: 641  KE--PNSNANFSMLNGLKPNSSSNPAGPISLPSLLK--AVNPTILVQLLKMEQQRLAAEN 696

Query: 2395 XXXXXAGPA-------VNGLSSAISP-------SPDVGQNPAAKPQ-------MNGPNDM 2279
                    +       V+GL  A+S        S + GQN     Q       M+  ND+
Sbjct: 697  QQNVTTSTSDITNVSSVSGLPGAVSSVISTPVRSNEPGQNQLGISQVSPQSASMSSQNDL 756

Query: 2278 GKIRMKPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNT 2099
            G+IRMKPRDPRR+LH+N+VQK+E + SE    NG      Q +   LT +E GEQAQ+N 
Sbjct: 757  GRIRMKPRDPRRILHNNIVQKNEVVASEQNNINGATAGP-QGTMGHLTAREAGEQAQSNI 815

Query: 2098 LPSQSASLPDISQQFTKNLQNLADIVSSSQ-TSALPVGTQNSSQLIPSK--------ICN 1946
            LP+Q +  PD S++ TKNL     IVSS Q T+  P     +SQ I SK           
Sbjct: 816  LPTQFSPPPDRSEELTKNLPT---IVSSLQLTTTSPTIPHGNSQPISSKGNQMDVKLALA 872

Query: 1945 DTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKM 1766
            +  +PKTV+ + +  E ++GV +  N WGDVDHLLDGY+D+QKAAIQ+ERARRI EQNKM
Sbjct: 873  EVNDPKTVSDVLS--ERSAGVSESTNLWGDVDHLLDGYNDEQKAAIQRERARRIVEQNKM 930

Query: 1765 FAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKL 1586
            FA RK           LNSAKFVEVDP+HEE+L            RH++ F HMGMWTKL
Sbjct: 931  FAARKLCLVLDLDHTLLNSAKFVEVDPVHEEVLRRKEEQDREKPQRHIYCFQHMGMWTKL 990

Query: 1585 RPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDE 1406
            RPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G+LF GRVIS+GD+GDP + DE
Sbjct: 991  RPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGSLFSGRVISRGDDGDPLNGDE 1050

Query: 1405 RVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLE 1226
            RVPKSKDLDGVLGMESAVVIIDDS+RVWPHNK NLIVVERY +FPSSRRQFGL GPSLLE
Sbjct: 1051 RVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTFFPSSRRQFGLLGPSLLE 1110

Query: 1225 IDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVF 1046
            IDHDERP+ GTLASSL VIER+HQ FFSH S+ + DVRNILA+EQRKIL GC+IVFSRVF
Sbjct: 1111 IDHDERPEDGTLASSLAVIERIHQNFFSHHSIKDADVRNILASEQRKILTGCRIVFSRVF 1170

Query: 1045 PVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWV 866
            PVGEANPHLHPLWQTAEQFGA CT+QIDE VTHVVANS GTDKVNWALSTGRFVVHPGWV
Sbjct: 1171 PVGEANPHLHPLWQTAEQFGAVCTSQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 1230

Query: 865  EASALLYRRANEQDFAIKG*T 803
            EASALLYRR NE DFA+K  T
Sbjct: 1231 EASALLYRRVNEHDFAVKAVT 1251


>ref|XP_015570573.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Ricinus communis]
          Length = 1315

 Score =  768 bits (1983), Expect = 0.0
 Identities = 425/774 (54%), Positives = 512/774 (66%), Gaps = 42/774 (5%)
 Frame = -1

Query: 3007 MSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAP---VKTSAKSRDPRLRFMNSEVGG 2837
            ++S  S+  + +  ++ S    +  + ++  +  AP   VK SAKSRDPRLRF+NS+   
Sbjct: 541  LTSGQSNASISLPRMDGSSLPGVISIKSAVRASSAPSLTVKASAKSRDPRLRFVNSDSNA 600

Query: 2836 APQNGFA------------AGSVNSRKHKAMDEPVPDEHNLKRQRNESTRS---RDAQVM 2702
              QN  A             G++N ++ K +D+P+PD H+LKRQ+N    S   RD + M
Sbjct: 601  LDQNHRAVPVVNTLKVEPIGGTMNKKRQKIVDDPIPDGHSLKRQKNALENSGVVRDVKTM 660

Query: 2701 PGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRL---VXXXXXXXXXNG 2531
             G GGW+ED  MV  Q  N+ Q   N     R   GG V         V           
Sbjct: 661  VGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCTSSSCISSVNISGTEQIPVT 720

Query: 2530 GSASNMSTSPAPV----VSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVN 2363
            G++  +     PV     ++P LLK+IAVNPTML+ +LKM                PA +
Sbjct: 721  GTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILKMGQQQRLALEAQQKPVDPAKS 780

Query: 2362 -----GLSSAISPSPDVG-------QNPA----AKPQMNGPNDMGKIRMKPRDPRRVLHS 2231
                   +S +   P VG         PA      PQ+   +D+GKIRMKPRDPRRVLH+
Sbjct: 781  TTYPLNSNSMLGTVPVVGAAHSGILPRPAGTVQVSPQLGTADDLGKIRMKPRDPRRVLHN 840

Query: 2230 NMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFT 2051
            N +Q++ S+GSE  K+N       Q +KD   +Q+Q  Q +   +P QS +LPDIS  FT
Sbjct: 841  NALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEKKPVPLQSLALPDISMPFT 900

Query: 2050 KNLQNLADIVSSSQTS-ALPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVIDL 1874
            KNL+N+ADIVS S  S + P+  QN +        + + +   +         A+     
Sbjct: 901  KNLKNIADIVSVSHASTSQPLVPQNPASQPMRTTISSSDQFLGIGSAPGAAAAAAAGPRT 960

Query: 1873 ANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVE 1694
             N WGDV+HL +GY+DQQKAAIQ+ERARRIEEQ K+F+ RK           LNSAKFVE
Sbjct: 961  QNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQKKLFSARKLCLVLDLDHTLLNSAKFVE 1020

Query: 1693 VDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGN 1514
            VDP+H+EIL            RHLFRFPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGN
Sbjct: 1021 VDPVHDEILRKKEEQDREKAHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGN 1080

Query: 1513 KLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMESAVVIIDDS 1334
            KLYATEMAKVLDP G LF+GRVIS+GD+G+PFD DER+PKSKDL+GVLGMES VVI+DDS
Sbjct: 1081 KLYATEMAKVLDPTGVLFNGRVISRGDDGEPFDGDERIPKSKDLEGVLGMESGVVIMDDS 1140

Query: 1333 LRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSLGVIERLHQ 1154
            +RVWPHNK NLIVVERY+YFP SRRQFGLPGPSLLEIDHDERP+ GTLA SL VIER+HQ
Sbjct: 1141 VRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQ 1200

Query: 1153 IFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTAEQFGAECT 974
             FF+H SL+E DVRNILA+EQRKIL GC+IVFSRVFPVGEANPHLHPLWQTAEQFGA CT
Sbjct: 1201 NFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 1260

Query: 973  NQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 812
            NQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEASALLYRRANEQDFAIK
Sbjct: 1261 NQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANEQDFAIK 1314


>dbj|GAV71470.1| BRCT domain-containing protein/NIF domain-containing protein
            [Cephalotus follicularis]
          Length = 1228

 Score =  765 bits (1975), Expect = 0.0
 Identities = 443/796 (55%), Positives = 522/796 (65%), Gaps = 44/796 (5%)
 Frame = -1

Query: 3067 PVNSSV-GFQMAPFNSSGGPQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAPVKT 2891
            PVN SV G  +     SG PQM + +  G     +   S A   +P  T  GS    +K 
Sbjct: 463  PVNHSVVGVPIV----SGSPQMDASSMEG----LTTTRSPAPVSSPAPTVSGSNPT-MKP 513

Query: 2890 SAKSRDPRLRFMNSEVG-------------GAPQNGFAAGSVNSRKHKAMDEPVPDEHNL 2750
            SAKSRDPRLR++NS+V               AP+       + SRK K +++P+ D   L
Sbjct: 514  SAKSRDPRLRYVNSDVSVLDLTQRPLHLVHNAPKV-----ELGSRKQKTVEDPILDGPAL 568

Query: 2749 KRQRNESTRSRDAQVMP---GRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGP 2579
            KRQ++ S  S    V+    G GGW+ED  MV +Q+ N     KN+ +  R +  G   P
Sbjct: 569  KRQKSGSENSGLIGVLKTTSGNGGWLEDTDMVGTQLLN-----KNVVLDPRKVDVGVTSP 623

Query: 2578 DRRLVXXXXXXXXXNGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXX 2399
                          N G+   + TS +   SLP+LLKDIAVNPTML+ +LKM        
Sbjct: 624  S-------IVHCNTNVGNEPLLVTSSSSTASLPALLKDIAVNPTMLINILKMGQQQRLPA 676

Query: 2398 XXXXXXAG----PAVNGLSSAISPSPDVGQNPA-----------AKPQMNGPNDMGKIRM 2264
                        P  N L  A+        NP+             PQ +  +D GKIRM
Sbjct: 677  EVQQKSTDSLHPPTSNSLLGAVPSVNFASSNPSRILPKPAGTLPTTPQTSAMDDPGKIRM 736

Query: 2263 KPRDPRRVLHSNMVQKSESLGSELAKSNGDLPSEVQSSKDLLTVQEQGEQAQTNTLPSQS 2084
            KPRDPRRVLH N +Q+S SLGSE  K N  +PS     KD L  Q+   QA+T  +PS S
Sbjct: 737  KPRDPRRVLHGNALQRSGSLGSEKLKMN--VPSTSSFQKDNLNAQKLEGQAETKPMPSLS 794

Query: 2083 ASLPDISQQFTKNLQNLADIVSSSQTSALPVGTQNSSQLI---PSKICNDTTEPKTV--- 1922
               PDI++ FTKNL+N+ DI+S SQ     +G+ N +Q +   P++I  D  + K +   
Sbjct: 795  IPQPDITRLFTKNLKNINDIMSVSQPL---IGSPNVTQNLESQPAQIKADRVDVKAIVSN 851

Query: 1921 -----TGMCTQGET-ASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFA 1760
                 TG  +  E  A+G     + WGDV+HL +GYDDQQKAAIQ+ERARR+EEQNKMFA
Sbjct: 852  SEDPRTGTVSASEVGAAGPARPQHAWGDVEHLFEGYDDQQKAAIQRERARRLEEQNKMFA 911

Query: 1759 ERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRP 1580
              K           LNSAKFVEVDP+H+EIL            RHLFRFPHMGMWTKLRP
Sbjct: 912  AHKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKLHRHLFRFPHMGMWTKLRP 971

Query: 1579 GVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERV 1400
            G+WNFLE+ASKL+ELHLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERV
Sbjct: 972  GIWNFLERASKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1031

Query: 1399 PKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEID 1220
            PKSKDL+GVLGMESAVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGLPGPSLLEID
Sbjct: 1032 PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEID 1091

Query: 1219 HDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPV 1040
            HDERP+ GTLAS+L VIER+HQIFFS+Q L +VDVRNILA+EQ+KIL GC+I+FSRVFPV
Sbjct: 1092 HDERPEDGTLASALTVIERIHQIFFSYQPLGDVDVRNILASEQQKILDGCRILFSRVFPV 1151

Query: 1039 GEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEA 860
            GEANPHLHPLWQTAEQFGA CTNQIDE VTHVVANS GTDKVNWALSTGRFVV+PGWVEA
Sbjct: 1152 GEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEA 1211

Query: 859  SALLYRRANEQDFAIK 812
            SALLYRRANEQDF IK
Sbjct: 1212 SALLYRRANEQDFGIK 1227


>gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium raimondii]
          Length = 982

 Score =  753 bits (1945), Expect = 0.0
 Identities = 432/784 (55%), Positives = 506/784 (64%), Gaps = 47/784 (5%)
 Frame = -1

Query: 3022 SGGPQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAPVKTSAKSRDPRLRFMNSEV 2843
            S  P + S +S+  +Q      +      P+  S  S +   K SAKSRDPRLRF NS V
Sbjct: 209  SSAPHIDSASSTSSMQGQFTTQNAT----PVTVSSASNILS-KASAKSRDPRLRFANSNV 263

Query: 2842 GGAPQNGF----------AAGSVNSRKHKAMDEPVPDEHNLKRQRNESTRS--RDAQVMP 2699
                 N             +G ++ RK K+ +EPV D    KRQ+NE      RD Q + 
Sbjct: 264  SALDLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVS 323

Query: 2698 GRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSAS 2519
            G GGW+ED     SQ+ NR Q  + +   +R +  G                       +
Sbjct: 324  GNGGWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLT 383

Query: 2518 NMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAIS- 2342
             MS       SLP+LLKDIAVNPTML+ +LKM                P  N L    S 
Sbjct: 384  GMSNP-----SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSN 438

Query: 2341 ------------PSPDVGQNPAA------KPQMN--GP--NDMGKIRMKPRDPRRVLHSN 2228
                        PSP V   P++      KP  N  GP  ++  KIRMKPRDPRRVLH N
Sbjct: 439  PVLGVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGN 498

Query: 2227 MVQKSESLGSELAKSNGDLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQF 2054
            ++QKS S+G +  K+NG  P S  Q SKD +  Q+Q E Q +   +  Q    PDI+QQF
Sbjct: 499  VLQKSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQF 558

Query: 2053 TKNLQNLADIVSSSQTSA-LPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVID 1877
            T++L+N+A ++S  Q+ A LP  +QN     P ++ ++T +  T        +T +G   
Sbjct: 559  TQSLKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAP 617

Query: 1876 LA---------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXX 1724
             A         N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK         
Sbjct: 618  EAGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDH 677

Query: 1723 XXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKL 1544
              LNSAKF+EVDP+HEEIL            RHLFRF HMGMWTKLRPG+WNFLEKASKL
Sbjct: 678  TLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKL 737

Query: 1543 YELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGM 1364
            YELHLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGM
Sbjct: 738  YELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGM 797

Query: 1363 ESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLAS 1184
            ES+VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLAS
Sbjct: 798  ESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLAS 857

Query: 1183 SLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQ 1004
            SL VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQ
Sbjct: 858  SLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQ 917

Query: 1003 TAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 824
            TAEQFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE D
Sbjct: 918  TAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHD 977

Query: 823  FAIK 812
            FAIK
Sbjct: 978  FAIK 981


>gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium raimondii]
          Length = 1033

 Score =  753 bits (1945), Expect = 0.0
 Identities = 432/784 (55%), Positives = 506/784 (64%), Gaps = 47/784 (5%)
 Frame = -1

Query: 3022 SGGPQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAPVKTSAKSRDPRLRFMNSEV 2843
            S  P + S +S+  +Q      +      P+  S  S +   K SAKSRDPRLRF NS V
Sbjct: 260  SSAPHIDSASSTSSMQGQFTTQNAT----PVTVSSASNILS-KASAKSRDPRLRFANSNV 314

Query: 2842 GGAPQNGF----------AAGSVNSRKHKAMDEPVPDEHNLKRQRNESTRS--RDAQVMP 2699
                 N             +G ++ RK K+ +EPV D    KRQ+NE      RD Q + 
Sbjct: 315  SALDLNQRPLHNASKVPPVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGVRDVQAVS 374

Query: 2698 GRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPDRRLVXXXXXXXXXNGGSAS 2519
            G GGW+ED     SQ+ NR Q  + +   +R +  G                       +
Sbjct: 375  GNGGWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQVPLT 434

Query: 2518 NMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGPAVNGLSSAIS- 2342
             MS       SLP+LLKDIAVNPTML+ +LKM                P  N L    S 
Sbjct: 435  GMSNP-----SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSN 489

Query: 2341 ------------PSPDVGQNPAA------KPQMN--GP--NDMGKIRMKPRDPRRVLHSN 2228
                        PSP V   P++      KP  N  GP  ++  KIRMKPRDPRRVLH N
Sbjct: 490  PVLGVIPPANVIPSPSVNVVPSSSSGTLSKPAGNLQGPPLDESCKIRMKPRDPRRVLHGN 549

Query: 2227 MVQKSESLGSELAKSNGDLP-SEVQSSKDLLTVQEQGE-QAQTNTLPSQSASLPDISQQF 2054
            ++QKS S+G +  K+NG  P S  Q SKD +  Q+Q E Q +   +  Q    PDI+QQF
Sbjct: 550  VLQKSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQQF 609

Query: 2053 TKNLQNLADIVSSSQTSA-LPVGTQNSSQLIPSKICNDTTEPKTVTGMCTQGETASGVID 1877
            T++L+N+A ++S  Q+ A LP  +QN     P ++ ++T +  T        +T +G   
Sbjct: 610  TQSLKNIAGMMSGPQSFAGLPAVSQNLVSQ-PIQVKSETADKNTKGSNSEDQQTGTGTAP 668

Query: 1876 LA---------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXX 1724
             A         N WGDV+HL + YDD+QKAAIQ+ERARRIEEQ KMFA RK         
Sbjct: 669  EAGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLVLDLDH 728

Query: 1723 XXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKL 1544
              LNSAKF+EVDP+HEEIL            RHLFRF HMGMWTKLRPG+WNFLEKASKL
Sbjct: 729  TLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASKL 788

Query: 1543 YELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGM 1364
            YELHLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGM
Sbjct: 789  YELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGM 848

Query: 1363 ESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLAS 1184
            ES+VVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDERP+ GTLAS
Sbjct: 849  ESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLAS 908

Query: 1183 SLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQ 1004
            SL VIER+HQ FFSHQ+L+++DVRNILA EQRKIL GC+IVFSRVFPVGEANPHLHPLWQ
Sbjct: 909  SLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHLHPLWQ 968

Query: 1003 TAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 824
            TAEQFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEASALLYRRANE D
Sbjct: 969  TAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEHD 1028

Query: 823  FAIK 812
            FAIK
Sbjct: 1029 FAIK 1032


>ref|XP_022737741.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Durio
            zibethinus]
          Length = 1274

 Score =  760 bits (1963), Expect = 0.0
 Identities = 436/782 (55%), Positives = 508/782 (64%), Gaps = 45/782 (5%)
 Frame = -1

Query: 3022 SGGPQMSSVNSSGGLQMSSVNSSGAFQMAPLNTSGGSQMAPVKTSAKSRDPRLRFMNSEV 2843
            S  PQ+ SV+SS       +    A Q A + T   +    +K+SAKSRDPRLRF NS  
Sbjct: 511  SSPPQVDSVSSS-------MQGKIATQNATVVTVSSASNIALKSSAKSRDPRLRFANSNA 563

Query: 2842 GGAPQNGF----------AAGSVNSRKHKAMDEPVPDEHNLKRQRNESTRS---RDAQVM 2702
                 N              G ++SRK K+++EPV D   LKRQR E   S   +D Q +
Sbjct: 564  SALDLNQQPLHNASKAVPVGGIMDSRKQKSIEEPVLDGPALKRQRKELENSGVVKDVQTV 623

Query: 2701 PGRGGWIEDNGMVASQMNNRVQPNKNM-----AVGNRNLVGGEVGPDRRLVXXXXXXXXX 2537
             G  GW+ED  ++ SQ+ NR Q  +N       + NR      +     +          
Sbjct: 624  SGNCGWLEDTDVIGSQVTNRNQIVENSDSNSWKMDNRVTCSSTLSGKTNMTVNRNEQVPM 683

Query: 2536 NGGSASNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKMXXXXXXXXXXXXXXAGP----- 2372
             G     MST      SLP+LLKDIAVNPT+L+ +LKM                P     
Sbjct: 684  TG-----MSTP-----SLPALLKDIAVNPTVLINILKMGQQERLAAEILQKSPDPVKSTL 733

Query: 2371 ---AVNGLSSAISP-------SPDVGQNPAAKPQMNGPNDMGKIRMKPRDPRRVLHSNMV 2222
               + N +   ++P       S  +   PA   Q+  P++ G IRMKPRDPRRVLH N++
Sbjct: 734  HQPSSNSILGVVTPVNIVPSSSSGILSKPAGNLQVPPPDESGNIRMKPRDPRRVLHGNVL 793

Query: 2221 QKSESLGSELAKSNGDLP-SEVQSSKDLLTVQEQGEQAQTNTLPSQSASLPDISQQFTKN 2045
            Q+S  +G +  K+NG  P S    SKD L VQ+   Q ++  + SQ    PDI+QQFTKN
Sbjct: 794  QRSGIMGPDQVKTNGTTPTSSTLGSKDNLNVQKLEAQTESKPMQSQLVPAPDITQQFTKN 853

Query: 2044 LQNLADIVSSSQT-SALPVGTQNS-SQLIPSKICNDTTEPKTVTGMCTQGETASGVIDLA 1871
            L+N+ADI+S SQ  ++LP  +Q+  SQ +  K   D+ + KTV       +T +G    A
Sbjct: 854  LKNIADIMSVSQALTSLPAVSQSLVSQPVQHK--PDSMDMKTVVSSSEDQQTGTGSAPEA 911

Query: 1870 ---------NPWGDVDHLLDGYDDQQKAAIQKERARRIEEQNKMFAERKXXXXXXXXXXX 1718
                     N W DV+HL + YDDQQKAAIQKERARRIEEQ KMF   K           
Sbjct: 912  DARGPHCSQNTWDDVEHLFERYDDQQKAAIQKERARRIEEQKKMFDANKLCLVLDLDHTL 971

Query: 1717 LNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRFPHMGMWTKLRPGVWNFLEKASKLYE 1538
            LNSAKF EVDP+HEEIL            RHLFRF HMGMWTKLRPG+WNFLEKASKLYE
Sbjct: 972  LNSAKFNEVDPVHEEILRKKEEQDREKPQRHLFRFQHMGMWTKLRPGIWNFLEKASKLYE 1031

Query: 1537 LHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGDEGDPFDSDERVPKSKDLDGVLGMES 1358
            LHLYTMGNKLYATEMAKVLDPKG LF GRVIS+GD+GDPFD DERVP+SKDL+GVLGMES
Sbjct: 1032 LHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMES 1091

Query: 1357 AVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQFGLPGPSLLEIDHDERPDAGTLASSL 1178
            AVVIIDDS+RVWPHNK NLIVVERY YFP SRRQFGL GPSLLEIDHDER D GTLASSL
Sbjct: 1092 AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERLDDGTLASSL 1151

Query: 1177 GVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGGCKIVFSRVFPVGEANPHLHPLWQTA 998
             VIER+HQ FFSHQ+L++VDVRNILAAEQRKIL GC +VFSRVFPVGEANPHLHPLWQTA
Sbjct: 1152 AVIERIHQDFFSHQNLDDVDVRNILAAEQRKILAGCHVVFSRVFPVGEANPHLHPLWQTA 1211

Query: 997  EQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 818
            EQFGA CTNQIDEHVTHVVANS GTDKVNWALSTG+FVVHPGWVEAS LLYRRANE DFA
Sbjct: 1212 EQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASTLLYRRANELDFA 1271

Query: 817  IK 812
            IK
Sbjct: 1272 IK 1273


>dbj|BAT14211.1| Os11g0521900, partial [Oryza sativa Japonica Group]
          Length = 1226

 Score =  756 bits (1951), Expect = 0.0
 Identities = 434/807 (53%), Positives = 524/807 (64%), Gaps = 63/807 (7%)
 Frame = -1

Query: 3043 QMAPFNSSGGPQMSSVNSSGGLQMSSVNSSGAFQMAPL--------NTSGGSQMAPVKTS 2888
            +++ F++S    +  VN       +  ++S +F   P         + SG + +  +K +
Sbjct: 431  EVSSFSASNKIALPIVNQMPSRPSTVSSNSDSFAGGPPGYAKQIENSVSGSNHL--LKAT 488

Query: 2887 AKSRDPRLRFMNSEVGGA-------------PQNGFAAG---SVNSRKHKAMDEPVPDEH 2756
            AKSRDPRL+F+N + GG              P      G   S+NSRK+KA+DEP+ DE+
Sbjct: 489  AKSRDPRLKFLNRDTGGVADANRRVNFAEPNPSKDRTMGGGVSINSRKNKAVDEPMVDEN 548

Query: 2755 NLKRQRNESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPD 2576
             LKR R      RD Q   GRGGW +D G ++S  ++  QPN+N  +GN       +  D
Sbjct: 549  ALKRSRGVIGNLRDMQPT-GRGGWAKDGGNISSYSSDGFQPNQNTRLGNNTTGNHNIRTD 607

Query: 2575 RRLVXXXXXXXXXNGGSA---------SNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKM 2423
              L          +G S          S   TS AP VSLP++LKDIAVNPTML++ ++M
Sbjct: 608  STLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQM 667

Query: 2422 XXXXXXXXXXXXXXAGPAVNGLSSAISP-----------SPDVGQNPAAKPQ-------M 2297
                                G++S ++P           + +V   P+ +PQ       M
Sbjct: 668  EQQKMSASEPQQKVTASV--GMTSNVTPGMVLPLGNAPKTTEVAAVPSVRPQVPMQSAPM 725

Query: 2296 NGPNDMGKIRMKPRDPRRVLHSNMVQKSESL---GSELAKSNGDLPSEVQSSKDLLTVQE 2126
            +  ND G IRMKPRDPRR+LHSN+VQK++++   G E AKSNG  P + QSSKD L  Q+
Sbjct: 726  HSQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQD 785

Query: 2125 Q-GEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQTSAL----PVG----TQNSS 1973
            Q  EQ Q   LPS   +    ++  T N    A+ VS+SQ +A     P G    T +S 
Sbjct: 786  QKAEQLQAIALPSLPVT--SSARPVTMN----ANPVSNSQLAATALMPPHGNTKQTSSSV 839

Query: 1972 QLIPSKICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERA 1793
                 ++     E        T   TA   +  A+P+GDVDHLLDGYDDQQKA IQKERA
Sbjct: 840  NKADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERA 899

Query: 1792 RRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRF 1613
            RRI+EQ+KMFA RK           LNSAKF+EVD IH EIL            RHLF F
Sbjct: 900  RRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCF 959

Query: 1612 PHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGD 1433
             HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNK+YATEMAKVLDP GTLF GRVIS+GD
Sbjct: 960  NHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGD 1019

Query: 1432 EGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQF 1253
            +GDPFDSDERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNKHNLIVVERY YFP SRRQF
Sbjct: 1020 DGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQF 1079

Query: 1252 GLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGG 1073
            GLPGPSLLEID DERP+ GTLASSL VIER+H+ FFSH +LN+ DVR+ILA+EQ++ILGG
Sbjct: 1080 GLPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSILASEQQRILGG 1139

Query: 1072 CKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTG 893
            C+IVFSR+FPVGEANPH+HPLWQTAEQFGA CTNQID+ VTHVVANS GTDKVNWALSTG
Sbjct: 1140 CRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALSTG 1199

Query: 892  RFVVHPGWVEASALLYRRANEQDFAIK 812
            RFVVHPGWVEASALLYRRA+E DFA+K
Sbjct: 1200 RFVVHPGWVEASALLYRRASELDFAVK 1226


>gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group]
          Length = 1267

 Score =  756 bits (1951), Expect = 0.0
 Identities = 434/807 (53%), Positives = 524/807 (64%), Gaps = 63/807 (7%)
 Frame = -1

Query: 3043 QMAPFNSSGGPQMSSVNSSGGLQMSSVNSSGAFQMAPL--------NTSGGSQMAPVKTS 2888
            +++ F++S    +  VN       +  ++S +F   P         + SG + +  +K +
Sbjct: 472  EVSSFSASNKIALPIVNQMPSRPSTVSSNSDSFAGGPPGYAKQIENSVSGSNHL--LKAT 529

Query: 2887 AKSRDPRLRFMNSEVGGA-------------PQNGFAAG---SVNSRKHKAMDEPVPDEH 2756
            AKSRDPRL+F+N + GG              P      G   S+NSRK+KA+DEP+ DE+
Sbjct: 530  AKSRDPRLKFLNRDTGGVADANRRVNFAEPNPSKDRTMGGGVSINSRKNKAVDEPMVDEN 589

Query: 2755 NLKRQRNESTRSRDAQVMPGRGGWIEDNGMVASQMNNRVQPNKNMAVGNRNLVGGEVGPD 2576
             LKR R      RD Q   GRGGW +D G ++S  ++  QPN+N  +GN       +  D
Sbjct: 590  ALKRSRGVIGNLRDMQPT-GRGGWAKDGGNISSYSSDGFQPNQNTRLGNNTTGNHNIRTD 648

Query: 2575 RRLVXXXXXXXXXNGGSA---------SNMSTSPAPVVSLPSLLKDIAVNPTMLVELLKM 2423
              L          +G S          S   TS AP VSLP++LKDIAVNPTML++ ++M
Sbjct: 649  STLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQM 708

Query: 2422 XXXXXXXXXXXXXXAGPAVNGLSSAISP-----------SPDVGQNPAAKPQ-------M 2297
                                G++S ++P           + +V   P+ +PQ       M
Sbjct: 709  EQQKMSASEPQQKVTASV--GMTSNVTPGMVLPLGNAPKTTEVAAVPSVRPQVPMQSAPM 766

Query: 2296 NGPNDMGKIRMKPRDPRRVLHSNMVQKSESL---GSELAKSNGDLPSEVQSSKDLLTVQE 2126
            +  ND G IRMKPRDPRR+LHSN+VQK++++   G E AKSNG  P + QSSKD L  Q+
Sbjct: 767  HSQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQD 826

Query: 2125 Q-GEQAQTNTLPSQSASLPDISQQFTKNLQNLADIVSSSQTSAL----PVG----TQNSS 1973
            Q  EQ Q   LPS   +    ++  T N    A+ VS+SQ +A     P G    T +S 
Sbjct: 827  QKAEQLQAIALPSLPVT--SSARPVTMN----ANPVSNSQLAATALMPPHGNTKQTSSSV 880

Query: 1972 QLIPSKICNDTTEPKTVTGMCTQGETASGVIDLANPWGDVDHLLDGYDDQQKAAIQKERA 1793
                 ++     E        T   TA   +  A+P+GDVDHLLDGYDDQQKA IQKERA
Sbjct: 881  NKADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERA 940

Query: 1792 RRIEEQNKMFAERKXXXXXXXXXXXLNSAKFVEVDPIHEEILXXXXXXXXXXXXRHLFRF 1613
            RRI+EQ+KMFA RK           LNSAKF+EVD IH EIL            RHLF F
Sbjct: 941  RRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCF 1000

Query: 1612 PHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGTLFHGRVISKGD 1433
             HMGMWTKLRPG+WNFLEKASKLYELHLYTMGNK+YATEMAKVLDP GTLF GRVIS+GD
Sbjct: 1001 NHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGD 1060

Query: 1432 EGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSLRVWPHNKHNLIVVERYMYFPSSRRQF 1253
            +GDPFDSDERVPKSKDLDGVLGMESAVVIIDDS+RVWPHNKHNLIVVERY YFP SRRQF
Sbjct: 1061 DGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQF 1120

Query: 1252 GLPGPSLLEIDHDERPDAGTLASSLGVIERLHQIFFSHQSLNEVDVRNILAAEQRKILGG 1073
            GLPGPSLLEID DERP+ GTLASSL VIER+H+ FFSH +LN+ DVR+ILA+EQ++ILGG
Sbjct: 1121 GLPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSILASEQQRILGG 1180

Query: 1072 CKIVFSRVFPVGEANPHLHPLWQTAEQFGAECTNQIDEHVTHVVANSFGTDKVNWALSTG 893
            C+IVFSR+FPVGEANPH+HPLWQTAEQFGA CTNQID+ VTHVVANS GTDKVNWALSTG
Sbjct: 1181 CRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALSTG 1240

Query: 892  RFVVHPGWVEASALLYRRANEQDFAIK 812
            RFVVHPGWVEASALLYRRA+E DFA+K
Sbjct: 1241 RFVVHPGWVEASALLYRRASELDFAVK 1267


Top