BLASTX nr result

ID: Cornus23_contig00007692 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00007692
         (2790 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010656786.1| PREDICTED: RNA polymerase II C-terminal doma...   953   0.0  
ref|XP_010656784.1| PREDICTED: RNA polymerase II C-terminal doma...   946   0.0  
ref|XP_010656789.1| PREDICTED: RNA polymerase II C-terminal doma...   944   0.0  
emb|CBI35661.3| unnamed protein product [Vitis vinifera]              898   0.0  
emb|CDP18969.1| unnamed protein product [Coffea canephora]            884   0.0  
ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat...   876   0.0  
ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu...   873   0.0  
ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Popu...   873   0.0  
ref|XP_011020855.1| PREDICTED: RNA polymerase II C-terminal doma...   864   0.0  
ref|XP_009803071.1| PREDICTED: RNA polymerase II C-terminal doma...   860   0.0  
ref|XP_009627456.1| PREDICTED: RNA polymerase II C-terminal doma...   854   0.0  
ref|XP_011036157.1| PREDICTED: RNA polymerase II C-terminal doma...   853   0.0  
ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ...   852   0.0  
ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal doma...   852   0.0  
ref|XP_012088736.1| PREDICTED: RNA polymerase II C-terminal doma...   851   0.0  
ref|XP_012459418.1| PREDICTED: RNA polymerase II C-terminal doma...   848   0.0  
gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium r...   848   0.0  
gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium r...   848   0.0  
ref|XP_012459417.1| PREDICTED: RNA polymerase II C-terminal doma...   848   0.0  
ref|XP_010100046.1| RNA polymerase II C-terminal domain phosphat...   843   0.0  

>ref|XP_010656786.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X2 [Vitis vinifera]
          Length = 1276

 Score =  953 bits (2463), Expect = 0.0
 Identities = 497/709 (70%), Positives = 549/709 (77%), Gaps = 2/709 (0%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLN+ PL  V+  PKV +PLG  + S+KQK  EEP+LDGP  KRQR GLT     RD  
Sbjct: 583  LDLNERPLPAVSNSPKV-DPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTSPATVRDAQ 641

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            T   SGGWLED  TV P  +NRNQ IE  G+  +KL  +VT +      P VT  GNE L
Sbjct: 642  TVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTGIGCDKPYVTVNGNEHL 701

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNII-KMEQQKSVDPAKSAAQSLSSNSILGAVPLM 2254
            PV  TSTTASL S+LKDI VNP++ MNI  K+EQQKS DPAK+     +SNSILG VP  
Sbjct: 702  PVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQKSGDPAKNTVLPPTSNSILGVVPPA 761

Query: 2253 NVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKPRDPRRVLHNNPFLKPGSLGSD 2074
            +VAP KPS LGQ+ AG LQ PQT   N QDE GKVRMKPRDPRR+LH N F + GS GS+
Sbjct: 762  SVAPLKPSALGQKPAGALQVPQTGPMNPQDESGKVRMKPRDPRRILHANSFQRSGSSGSE 821

Query: 2073 QFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIAPPDIALQFTKNLKNIADIISV 1894
            QFKTN                QKQED+++  SVPS S+ PPDI+ QFTKNLKNIAD++S 
Sbjct: 822  QFKTNA---------------QKQEDQTETKSVPSHSVNPPDISQQFTKNLKNIADLMSA 866

Query: 1893 SLAP-MSPAISPNFPSQPVQVPPIRVDVKGVVPELGNLERTSSSTEEVAVDPSGSKNTWG 1717
            S A  M+P       SQ VQV   R+DVK  V + G+    + S  E A  P  SKNTWG
Sbjct: 867  SQASSMTPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKPESAAGPPQSKNTWG 926

Query: 1716 EVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXXXXXXXXXLNSAKFVEVDSVH 1537
            +VEHLF+G+DDQQKAAIQ+ERARRI EQKKMF+ARK           LNSAKFVEVD VH
Sbjct: 927  DVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVH 986

Query: 1536 DEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYAT 1357
            DEILRKKEEQDREK QRHLFRFP+MGMWTKLRPGIWNFLEKASKL+ELHLYTMGNKLYAT
Sbjct: 987  DEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAT 1046

Query: 1356 EMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMEXXXXXXXXXVKVWP 1177
            EMAK+LDPKGVLFAGRVIS+GDDGD++DGDERVPKSKDLEGVLGME         V+VWP
Sbjct: 1047 EMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWP 1106

Query: 1176 HNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSEDGTLASSLAVIERIHENFFSH 997
            HNKLNLIVVERY YFPCSRRQFGLPGPSLLEIDHDER EDGTLASSLAVIERIH++FFS+
Sbjct: 1107 HNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSN 1166

Query: 996  RSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCVNQIDE 817
            R+LDE DVRNIL+SEQR+IL GCRIVFSRVFPVGEANPHLHPLWQTAE FGAVC NQIDE
Sbjct: 1167 RALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDE 1226

Query: 816  QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANELDFAIK 670
            QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1227 QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1275


>ref|XP_010656784.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X1 [Vitis vinifera]
          Length = 1285

 Score =  946 bits (2445), Expect = 0.0
 Identities = 497/718 (69%), Positives = 550/718 (76%), Gaps = 11/718 (1%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLN+ PL  V+  PKV +PLG  + S+KQK  EEP+LDGP  KRQR GLT     RD  
Sbjct: 583  LDLNERPLPAVSNSPKV-DPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTSPATVRDAQ 641

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            T   SGGWLED  TV P  +NRNQ IE  G+  +KL  +VT +      P VT  GNE L
Sbjct: 642  TVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTGIGCDKPYVTVNGNEHL 701

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNII-KMEQQKSVDPAKSAAQSLSSNSILGAVPLM 2254
            PV  TSTTASL S+LKDI VNP++ MNI  K+EQQKS DPAK+     +SNSILG VP  
Sbjct: 702  PVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQKSGDPAKNTVLPPTSNSILGVVPPA 761

Query: 2253 NVAPSKPSELGQRSAGLLQTPQTA---------STNMQDELGKVRMKPRDPRRVLHNNPF 2101
            +VAP KPS LGQ+ AG LQ PQT          + N QDE GKVRMKPRDPRR+LH N F
Sbjct: 762  SVAPLKPSALGQKPAGALQVPQTGPMLVTSCNNAQNPQDESGKVRMKPRDPRRILHANSF 821

Query: 2100 LKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIAPPDIALQFTKNL 1921
             + GS GS+QFKTN                QKQED+++  SVPS S+ PPDI+ QFTKNL
Sbjct: 822  QRSGSSGSEQFKTNA---------------QKQEDQTETKSVPSHSVNPPDISQQFTKNL 866

Query: 1920 KNIADIISVSLAP-MSPAISPNFPSQPVQVPPIRVDVKGVVPELGNLERTSSSTEEVAVD 1744
            KNIAD++S S A  M+P       SQ VQV   R+DVK  V + G+    + S  E A  
Sbjct: 867  KNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKPESAAG 926

Query: 1743 PSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXXXXXXXXXLNSA 1564
            P  SKNTWG+VEHLF+G+DDQQKAAIQ+ERARRI EQKKMF+ARK           LNSA
Sbjct: 927  PPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSA 986

Query: 1563 KFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLEKASKLFELHLY 1384
            KFVEVD VHDEILRKKEEQDREK QRHLFRFP+MGMWTKLRPGIWNFLEKASKL+ELHLY
Sbjct: 987  KFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLY 1046

Query: 1383 TMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMEXXXXX 1204
            TMGNKLYATEMAK+LDPKGVLFAGRVIS+GDDGD++DGDERVPKSKDLEGVLGME     
Sbjct: 1047 TMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVI 1106

Query: 1203 XXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSEDGTLASSLAVIE 1024
                V+VWPHNKLNLIVVERY YFPCSRRQFGLPGPSLLEIDHDER EDGTLASSLAVIE
Sbjct: 1107 IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVIE 1166

Query: 1023 RIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHLHPLWQTAEQFG 844
            RIH++FFS+R+LDE DVRNIL+SEQR+IL GCRIVFSRVFPVGEANPHLHPLWQTAE FG
Sbjct: 1167 RIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFG 1226

Query: 843  AVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANELDFAIK 670
            AVC NQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1227 AVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1284


>ref|XP_010656789.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X3 [Vitis vinifera]
          Length = 1273

 Score =  944 bits (2441), Expect = 0.0
 Identities = 495/709 (69%), Positives = 547/709 (77%), Gaps = 2/709 (0%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLN+ PL  V+  PKV +PLG  + S+KQK  EEP+LDGP  KRQR GLT     RD  
Sbjct: 583  LDLNERPLPAVSNSPKV-DPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTSPATVRDAQ 641

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            T   SGGWLED  TV P  +NRNQ IE  G+  +KL  +VT +      P VT  GNE L
Sbjct: 642  TVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTGIGCDKPYVTVNGNEHL 701

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNII-KMEQQKSVDPAKSAAQSLSSNSILGAVPLM 2254
            PV  TSTTASL S+LKDI VNP++ MNI  K+EQQKS DPAK+     +SNSILG VP  
Sbjct: 702  PVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQKSGDPAKNTVLPPTSNSILGVVPPA 761

Query: 2253 NVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKPRDPRRVLHNNPFLKPGSLGSD 2074
            +VAP KPS LGQ+ AG LQ PQT      DE GKVRMKPRDPRR+LH N F + GS GS+
Sbjct: 762  SVAPLKPSALGQKPAGALQVPQTGP---MDESGKVRMKPRDPRRILHANSFQRSGSSGSE 818

Query: 2073 QFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIAPPDIALQFTKNLKNIADIISV 1894
            QFKTN                QKQED+++  SVPS S+ PPDI+ QFTKNLKNIAD++S 
Sbjct: 819  QFKTNA---------------QKQEDQTETKSVPSHSVNPPDISQQFTKNLKNIADLMSA 863

Query: 1893 SLAP-MSPAISPNFPSQPVQVPPIRVDVKGVVPELGNLERTSSSTEEVAVDPSGSKNTWG 1717
            S A  M+P       SQ VQV   R+DVK  V + G+    + S  E A  P  SKNTWG
Sbjct: 864  SQASSMTPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKPESAAGPPQSKNTWG 923

Query: 1716 EVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXXXXXXXXXLNSAKFVEVDSVH 1537
            +VEHLF+G+DDQQKAAIQ+ERARRI EQKKMF+ARK           LNSAKFVEVD VH
Sbjct: 924  DVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVH 983

Query: 1536 DEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYAT 1357
            DEILRKKEEQDREK QRHLFRFP+MGMWTKLRPGIWNFLEKASKL+ELHLYTMGNKLYAT
Sbjct: 984  DEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAT 1043

Query: 1356 EMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMEXXXXXXXXXVKVWP 1177
            EMAK+LDPKGVLFAGRVIS+GDDGD++DGDERVPKSKDLEGVLGME         V+VWP
Sbjct: 1044 EMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWP 1103

Query: 1176 HNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSEDGTLASSLAVIERIHENFFSH 997
            HNKLNLIVVERY YFPCSRRQFGLPGPSLLEIDHDER EDGTLASSLAVIERIH++FFS+
Sbjct: 1104 HNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSN 1163

Query: 996  RSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCVNQIDE 817
            R+LDE DVRNIL+SEQR+IL GCRIVFSRVFPVGEANPHLHPLWQTAE FGAVC NQIDE
Sbjct: 1164 RALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDE 1223

Query: 816  QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANELDFAIK 670
            QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1224 QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1272


>emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score =  898 bits (2320), Expect = 0.0
 Identities = 478/709 (67%), Positives = 527/709 (74%), Gaps = 2/709 (0%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLN+ PL  V+  PKV +PLG  + S+KQK  EEP+LDGP  KRQR GLT         
Sbjct: 530  LDLNERPLPAVSNSPKV-DPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLT--------- 579

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
                                          S   KL  +VT +      P VT  GNE L
Sbjct: 580  ------------------------------SPATKLESKVTVTGIGCDKPYVTVNGNEHL 609

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNII-KMEQQKSVDPAKSAAQSLSSNSILGAVPLM 2254
            PV  TSTTASL S+LKDI VNP++ MNI  K+EQQKS DPAK+     +SNSILG VP  
Sbjct: 610  PVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQKSGDPAKNTVLPPTSNSILGVVPPA 669

Query: 2253 NVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKPRDPRRVLHNNPFLKPGSLGSD 2074
            +VAP KPS LGQ+ AG LQ PQT   N QDE GKVRMKPRDPRR+LH N F + GS GS+
Sbjct: 670  SVAPLKPSALGQKPAGALQVPQTGPMNPQDESGKVRMKPRDPRRILHANSFQRSGSSGSE 729

Query: 2073 QFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIAPPDIALQFTKNLKNIADIISV 1894
            QFKTN                QKQED+++  SVPS S+ PPDI+ QFTKNLKNIAD++S 
Sbjct: 730  QFKTNA---------------QKQEDQTETKSVPSHSVNPPDISQQFTKNLKNIADLMSA 774

Query: 1893 SLAP-MSPAISPNFPSQPVQVPPIRVDVKGVVPELGNLERTSSSTEEVAVDPSGSKNTWG 1717
            S A  M+P       SQ VQV   R+DVK  V + G+    + S  E A  P  SKNTWG
Sbjct: 775  SQASSMTPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKPESAAGPPQSKNTWG 834

Query: 1716 EVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXXXXXXXXXLNSAKFVEVDSVH 1537
            +VEHLF+G+DDQQKAAIQ+ERARRI EQKKMF+ARK           LNSAKFVEVD VH
Sbjct: 835  DVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVH 894

Query: 1536 DEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYAT 1357
            DEILRKKEEQDREK QRHLFRFP+MGMWTKLRPGIWNFLEKASKL+ELHLYTMGNKLYAT
Sbjct: 895  DEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAT 954

Query: 1356 EMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMEXXXXXXXXXVKVWP 1177
            EMAK+LDPKGVLFAGRVIS+GDDGD++DGDERVPKSKDLEGVLGME         V+VWP
Sbjct: 955  EMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWP 1014

Query: 1176 HNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSEDGTLASSLAVIERIHENFFSH 997
            HNKLNLIVVERY YFPCSRRQFGLPGPSLLEIDHDER EDGTLASSLAVIERIH++FFS+
Sbjct: 1015 HNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSN 1074

Query: 996  RSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCVNQIDE 817
            R+LDE DVRNIL+SEQR+IL GCRIVFSRVFPVGEANPHLHPLWQTAE FGAVC NQIDE
Sbjct: 1075 RALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDE 1134

Query: 816  QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANELDFAIK 670
            QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1135 QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1183


>emb|CDP18969.1| unnamed protein product [Coffea canephora]
          Length = 1210

 Score =  884 bits (2283), Expect = 0.0
 Identities = 473/699 (67%), Positives = 531/699 (75%), Gaps = 2/699 (0%)
 Frame = -1

Query: 2760 VNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVPTSSGSGGWLE 2581
            VN EPKV  P+GG I S+KQK +EE V+DGP  KRQR   TDS + + V T SG+GGWLE
Sbjct: 532  VNGEPKV-EPVGGMISSRKQKTIEEQVMDGPALKRQRNEQTDSSVVKSVQTVSGTGGWLE 590

Query: 2580 DKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQLPVTVTSTTAS 2401
            D+GT      NR+  +   G+   +    VT   + SS  NVT  GN+ LP+T    TAS
Sbjct: 591  DRGTAGLGATNRSHALNSSGNDPMRPEYAVTPLSSGSSLANVTVNGNKNLPLTNPGATAS 650

Query: 2400 LHSILKDITVNPSMLMNIIKMEQQKSVDPAKSAAQSLSSNSILGAVPLMNVAPSKPSELG 2221
            LHS+LKDI VNPS+ MNIIKMEQQKS DP +S +Q   SNSI G+V   N   SKP +LG
Sbjct: 651  LHSLLKDIAVNPSIWMNIIKMEQQKSADPTRSTSQPTCSNSINGSV---NAVVSKPRDLG 707

Query: 2220 QRSAGLLQ-TPQTASTNMQDELGKVRMKPRDPRRVLHNNPFLKPGSLGSDQFKTNVAPIA 2044
            QR+AG  Q T QTAS     E GKVRMKPRDPRRVLHNN   K GS+  DQ +T  +  +
Sbjct: 708  QRAAGTFQVTSQTASVA---EPGKVRMKPRDPRRVLHNNTLQKGGSMEFDQSQTK-SSTS 763

Query: 2043 SSQGMKGNLYPQKQEDESDKNSVPSQSIAPPDIALQFTKNLKNIADIISVSLAPMS-PAI 1867
            S+  M GN+  Q Q+D+ D+  VPS SI  PDIA QFTKNLKNIADI+SVS A  S PA+
Sbjct: 764  SNPEMVGNINFQIQDDQLDRRVVPSNSIVQPDIAQQFTKNLKNIADIVSVSQATSSQPAL 823

Query: 1866 SPNFPSQPVQVPPIRVDVKGVVPELGNLERTSSSTEEVAVDPSGSKNTWGEVEHLFEGFD 1687
                 SQP Q    R +  G++           S++EV++  S  +N W +VEHLFEGFD
Sbjct: 824  PQISLSQPSQAYQGRTETIGMLESGKPQSGPGLSSKEVSMGSSRPQNNWDDVEHLFEGFD 883

Query: 1686 DQQKAAIQKERARRIAEQKKMFAARKXXXXXXXXXXXLNSAKFVEVDSVHDEILRKKEEQ 1507
            DQQKAAI +ERARR+ EQ+KMFA RK                FVEVD +HDEILRKKEEQ
Sbjct: 884  DQQKAAIHRERARRMQEQRKMFAGRKLCL-------------FVEVDPMHDEILRKKEEQ 930

Query: 1506 DREKPQRHLFRFPYMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKLLDPKG 1327
            DREKP RHLFRFP+MGMWTKLRPGIWNFLEKASKL+ELHLYTMGNKLYATEMAKLLDPKG
Sbjct: 931  DREKPHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKG 990

Query: 1326 VLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMEXXXXXXXXXVKVWPHNKLNLIVVE 1147
             LFAGRVISRGDDGDL+DGDERVPKSKDLEGV+GME         ++VWPHNKLNLIVVE
Sbjct: 991  ELFAGRVISRGDDGDLLDGDERVPKSKDLEGVMGMESSVVIIDDSLRVWPHNKLNLIVVE 1050

Query: 1146 RYIYFPCSRRQFGLPGPSLLEIDHDERSEDGTLASSLAVIERIHENFFSHRSLDEADVRN 967
            RYI+FPCSRRQFGLPGPSLLEIDHDERSEDGTLASSLAVIERIHE FF+H+SLDEADVRN
Sbjct: 1051 RYIFFPCSRRQFGLPGPSLLEIDHDERSEDGTLASSLAVIERIHEIFFAHQSLDEADVRN 1110

Query: 966  ILSSEQRQILGGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCVNQIDEQVTHVVANSL 787
            IL+SEQR+IL GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVC N IDEQVTHVVANSL
Sbjct: 1111 ILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNSIDEQVTHVVANSL 1170

Query: 786  GTDKVNWALSTGRFVVHPGWVEASALLYRRANELDFAIK 670
            GTDKVNWALS+GRFVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1171 GTDKVNWALSSGRFVVHPGWVEASALLYRRANEKDFAIK 1209


>ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA
            polymerase II C-terminal domain phosphatase-like 3,
            putative [Theobroma cacao]
          Length = 1290

 Score =  876 bits (2264), Expect = 0.0
 Identities = 467/725 (64%), Positives = 534/725 (73%), Gaps = 18/725 (2%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLN+  L   +     V P+GG + S+K+K VEEP+LD P  KRQR  L + G+ARDV 
Sbjct: 576  LDLNERLLHNASK----VAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQ 631

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            T SG GGWLED   +     NRNQ  E + S +RK+ + VT S   S   N+T G NEQ+
Sbjct: 632  TVSGIGGWLEDTDAIGSQITNRNQTAENLESNSRKMDNGVTSSSTLSGKTNITVGTNEQV 691

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNIIKM---------EQQKSVDPAKSAAQSLSSNS 2278
            PVT TST  SL ++LKDI VNP+ML+NI+KM          QQKS DP KS     SSNS
Sbjct: 692  PVTSTSTP-SLPALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNS 750

Query: 2277 ILGAV--------PLMNVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKPRDPRR 2122
            +LG V        P +N  PS  S +  + AG LQ P        DE GK+RMKPRDPRR
Sbjct: 751  LLGVVSSTNVIPSPSVNNVPSISSGISSKPAGNLQVPSP------DESGKIRMKPRDPRR 804

Query: 2121 VLHNNPFLKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIAPPDIA 1942
            VLH N   + GS+G DQ KTN A  +S+QG K NL  QK + +++   + SQ + PPDI 
Sbjct: 805  VLHGNSLQRSGSMGLDQLKTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDIT 864

Query: 1941 LQFTKNLKNIADIISVSLAPMS-PAISPNFPSQPVQVPPIRVDVKGVVPELGNLERTSSS 1765
             QFT NLKNIADI+SVS A  S P +S N   QPV +    +D+K +V    + +  +  
Sbjct: 865  QQFTNNLKNIADIMSVSQALTSLPPVSHNLVPQPVLIKSDSMDMKALVSNSEDQQTGAGL 924

Query: 1764 TEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXXXXXX 1585
              E       S+N WG+VEHLFE +DDQQKAAIQ+ERARRI EQKKMF+ARK        
Sbjct: 925  APEAGATGPRSQNAWGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLD 984

Query: 1584 XXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLEKASK 1405
               LNSAKF+EVD VH+EILRKKEEQDREKP+RHLFRF +MGMWTKLRPGIWNFLEKASK
Sbjct: 985  HTLLNSAKFIEVDPVHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASK 1044

Query: 1404 LFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLG 1225
            L+ELHLYTMGNKLYATEMAK+LDPKGVLFAGRVISRGDDGD  DGDERVP+SKDLEGVLG
Sbjct: 1045 LYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLG 1104

Query: 1224 MEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSEDGTLA 1045
            ME         V+VWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER EDGTLA
Sbjct: 1105 MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLA 1164

Query: 1044 SSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHLHPLW 865
            SSLAVIERIH++FFSH++LD+ DVRNIL+SEQR+IL GCRIVFSRVFPVGEANPHLHPLW
Sbjct: 1165 SSLAVIERIHQDFFSHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLW 1224

Query: 864  QTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEL 685
            QTAEQFGAVC NQIDE VTHVVANSLGTDKVNWALSTG+FVVHPGWVEASALLYRRANE+
Sbjct: 1225 QTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEV 1284

Query: 684  DFAIK 670
            DFAIK
Sbjct: 1285 DFAIK 1289


>ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343308|gb|EEE79627.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1247

 Score =  873 bits (2256), Expect = 0.0
 Identities = 473/730 (64%), Positives = 536/730 (73%), Gaps = 23/730 (3%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LD NQ  LL VN  P+   P G    S+KQKI EE VLDG   KRQR    + G+ RD+ 
Sbjct: 529  LDQNQRTLLMVNNPPRA-EPSGAIAGSRKQKI-EEDVLDGTSLKRQRNSFDNFGVVRDIR 586

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            + +G+GGWLED     P  +N+NQ  E      +++ + V C    S   +V+  GN Q+
Sbjct: 587  SMTGTGGWLEDTDMAEPQTVNKNQWAEN-AEPGQRINNGVVCPSTGSVMSSVSCSGNVQV 645

Query: 2430 PV-------------TVTSTTASLHSILKDITVNPSMLMNIIKMEQQ---------KSVD 2317
            PV               ++TTASL  +LKDITVNP+ML+NI+KM QQ         K  D
Sbjct: 646  PVMGINTIAGSEQAPVTSTTTASLPDLLKDITVNPTMLINILKMGQQQRLALDGQQKLAD 705

Query: 2316 PAKSAAQSLSSNSILGAVPLMNVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKP 2137
            PAKS +   SSN++LGA+P +N   S PS +  RSAG  Q P   +T   DE GK+RMKP
Sbjct: 706  PAKSTSHPPSSNTVLGAIPEVNAVSSLPSGILPRSAGKAQGPSQIATT--DESGKIRMKP 763

Query: 2136 RDPRRVLHNNPFLKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIA 1957
            RDPRRVLHNN   + GSLGS+QFKT     +++QG K N   QKQE  ++   V      
Sbjct: 764  RDPRRVLHNNALQRAGSLGSEQFKTTTLT-STTQGTKDNQNLQKQEGLAELKPV-----V 817

Query: 1956 PPDIALQFTKNLKNIADIISVSLAPMSPA-ISPNFPSQPVQVPPIRVDVKGVVPELGNLE 1780
            PPDI+  FTK+LKNIADI+SVS    +P  +S N  SQPVQ+   RVD K  +       
Sbjct: 818  PPDISSPFTKSLKNIADIVSVSQTCTTPPFVSQNVASQPVQIKSDRVDGKTGISNSDQKM 877

Query: 1779 RTSSSTEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXX 1600
              +SS E VA   S S+NTW +VEHLFEG+DDQQKAAIQ+ERARRI EQKK+FAARK   
Sbjct: 878  GPASSPEVVAAS-SLSQNTWEDVEHLFEGYDDQQKAAIQRERARRIEEQKKLFAARKLCL 936

Query: 1599 XXXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFL 1420
                    LNSAKFVEVD VHDEILRKKEEQDREKP RHLFRFP+MGMWTKLRPGIWNFL
Sbjct: 937  VLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFL 996

Query: 1419 EKASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDL 1240
            EKASKL+ELHLYTMGNKLYATEMAK+LDPKGVLFAGRV+SRGDDGDL+DGDERVPKSKDL
Sbjct: 997  EKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDERVPKSKDL 1056

Query: 1239 EGVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSE 1060
            EGVLGME         ++VWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDER E
Sbjct: 1057 EGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPE 1116

Query: 1059 DGTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPH 880
            DGTLA SLAVIERIH+NFF+H SLDEADVRNIL+SEQR+IL GCRIVFSRVFPVGE NPH
Sbjct: 1117 DGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEVNPH 1176

Query: 879  LHPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 700
            LHPLWQ+AEQFGAVC NQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR
Sbjct: 1177 LHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 1236

Query: 699  RANELDFAIK 670
            RANE DFAIK
Sbjct: 1237 RANEQDFAIK 1246


>ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343307|gb|EEE79693.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1030

 Score =  873 bits (2256), Expect = 0.0
 Identities = 473/730 (64%), Positives = 536/730 (73%), Gaps = 23/730 (3%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LD NQ  LL VN  P+   P G    S+KQKI EE VLDG   KRQR    + G+ RD+ 
Sbjct: 312  LDQNQRTLLMVNNPPRA-EPSGAIAGSRKQKI-EEDVLDGTSLKRQRNSFDNFGVVRDIR 369

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            + +G+GGWLED     P  +N+NQ  E      +++ + V C    S   +V+  GN Q+
Sbjct: 370  SMTGTGGWLEDTDMAEPQTVNKNQWAEN-AEPGQRINNGVVCPSTGSVMSSVSCSGNVQV 428

Query: 2430 PV-------------TVTSTTASLHSILKDITVNPSMLMNIIKMEQQ---------KSVD 2317
            PV               ++TTASL  +LKDITVNP+ML+NI+KM QQ         K  D
Sbjct: 429  PVMGINTIAGSEQAPVTSTTTASLPDLLKDITVNPTMLINILKMGQQQRLALDGQQKLAD 488

Query: 2316 PAKSAAQSLSSNSILGAVPLMNVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKP 2137
            PAKS +   SSN++LGA+P +N   S PS +  RSAG  Q P   +T   DE GK+RMKP
Sbjct: 489  PAKSTSHPPSSNTVLGAIPEVNAVSSLPSGILPRSAGKAQGPSQIATT--DESGKIRMKP 546

Query: 2136 RDPRRVLHNNPFLKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIA 1957
            RDPRRVLHNN   + GSLGS+QFKT     +++QG K N   QKQE  ++   V      
Sbjct: 547  RDPRRVLHNNALQRAGSLGSEQFKTTTLT-STTQGTKDNQNLQKQEGLAELKPV-----V 600

Query: 1956 PPDIALQFTKNLKNIADIISVSLAPMSPA-ISPNFPSQPVQVPPIRVDVKGVVPELGNLE 1780
            PPDI+  FTK+LKNIADI+SVS    +P  +S N  SQPVQ+   RVD K  +       
Sbjct: 601  PPDISSPFTKSLKNIADIVSVSQTCTTPPFVSQNVASQPVQIKSDRVDGKTGISNSDQKM 660

Query: 1779 RTSSSTEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXX 1600
              +SS E VA   S S+NTW +VEHLFEG+DDQQKAAIQ+ERARRI EQKK+FAARK   
Sbjct: 661  GPASSPEVVAAS-SLSQNTWEDVEHLFEGYDDQQKAAIQRERARRIEEQKKLFAARKLCL 719

Query: 1599 XXXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFL 1420
                    LNSAKFVEVD VHDEILRKKEEQDREKP RHLFRFP+MGMWTKLRPGIWNFL
Sbjct: 720  VLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFL 779

Query: 1419 EKASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDL 1240
            EKASKL+ELHLYTMGNKLYATEMAK+LDPKGVLFAGRV+SRGDDGDL+DGDERVPKSKDL
Sbjct: 780  EKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDERVPKSKDL 839

Query: 1239 EGVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSE 1060
            EGVLGME         ++VWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDER E
Sbjct: 840  EGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPE 899

Query: 1059 DGTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPH 880
            DGTLA SLAVIERIH+NFF+H SLDEADVRNIL+SEQR+IL GCRIVFSRVFPVGE NPH
Sbjct: 900  DGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEVNPH 959

Query: 879  LHPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 700
            LHPLWQ+AEQFGAVC NQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR
Sbjct: 960  LHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 1019

Query: 699  RANELDFAIK 670
            RANE DFAIK
Sbjct: 1020 RANEQDFAIK 1029


>ref|XP_011020855.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Populus euphratica]
          Length = 1271

 Score =  864 bits (2233), Expect = 0.0
 Identities = 470/730 (64%), Positives = 532/730 (72%), Gaps = 23/730 (3%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LD NQ  LL VN  P+   P G    S+KQKI EE VLDG   KRQR    + G  RD+ 
Sbjct: 553  LDQNQRTLLMVNNPPRA-EPSGAIAGSRKQKI-EEDVLDGTSLKRQRNSFDNFGGVRDIR 610

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            + +G+GGWLED     P  +N+NQ  E      +++ + V      S   NV   GN Q+
Sbjct: 611  SMTGTGGWLEDTDMAEPQTVNKNQRAEN-AEPGQRINNGVVRPSTGSVMSNVNCSGNVQV 669

Query: 2430 PV-------------TVTSTTASLHSILKDITVNPSMLMNIIKMEQQ---------KSVD 2317
            PV               ++TTASL  +LKDITVNP++L+NI+KM QQ         K  D
Sbjct: 670  PVMGINTVAGSEQAPVTSTTTASLPDLLKDITVNPTLLINILKMGQQQRLALDGQQKLAD 729

Query: 2316 PAKSAAQSLSSNSILGAVPLMNVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKP 2137
            PAKS +   SS+S+ GA P +N   S+PS +  RSAG  Q P   +T   DE GK+RMKP
Sbjct: 730  PAKSTSHPPSSSSVPGATPEVNAVSSQPSGILPRSAGKAQVPSQVATT--DESGKIRMKP 787

Query: 2136 RDPRRVLHNNPFLKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIA 1957
            RDPRRVLHNN   + GSLGS+QFKT     +++QG K N   QKQE  ++ N V      
Sbjct: 788  RDPRRVLHNNALQRAGSLGSEQFKTTTLT-STTQGTKDNQNLQKQEGLAELNPV-----V 841

Query: 1956 PPDIALQFTKNLKNIADIISVSLAPMSPA-ISPNFPSQPVQVPPIRVDVKGVVPELGNLE 1780
            PPDI+  FTK+L+NIADI+SVS    +P  +S N  SQPVQ+   RVD K          
Sbjct: 842  PPDISSSFTKSLQNIADIVSVSQTCTTPPFVSQNVASQPVQIKSDRVDGKTGTSNSDQKM 901

Query: 1779 RTSSSTEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXX 1600
              +SS E VA   S S+NTW +VEHLFEG+DDQQKAAIQ+ERARRI EQKK+FAARK   
Sbjct: 902  GPASSPEVVAAS-SLSQNTWEDVEHLFEGYDDQQKAAIQRERARRIEEQKKLFAARKLCL 960

Query: 1599 XXXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFL 1420
                    LNSAKFVEVD VHDEILRKKEEQDREKP RHLFRFP+MGMWTKLRPGIWNFL
Sbjct: 961  VLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFL 1020

Query: 1419 EKASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDL 1240
            EKASKL+ELHLYTMGNKLYATEMAK+LDPKGVLFAGRV+SRGDDGDL+DGDERVPKSKDL
Sbjct: 1021 EKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDERVPKSKDL 1080

Query: 1239 EGVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSE 1060
            EGVLGME         ++VWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHD+R E
Sbjct: 1081 EGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDQRPE 1140

Query: 1059 DGTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPH 880
            DGTLA SLAVIERIH+NFF+H SLDEADVRNILSSEQR+IL GCR+VFSRVFPVGE NPH
Sbjct: 1141 DGTLACSLAVIERIHQNFFTHHSLDEADVRNILSSEQRKILAGCRVVFSRVFPVGEVNPH 1200

Query: 879  LHPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 700
            LHPLWQTAEQFGAVC NQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR
Sbjct: 1201 LHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 1260

Query: 699  RANELDFAIK 670
            RANE +FAIK
Sbjct: 1261 RANEQEFAIK 1270


>ref|XP_009803071.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Nicotiana sylvestris] gi|698516385|ref|XP_009803072.1|
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 3 [Nicotiana sylvestris]
          Length = 1241

 Score =  860 bits (2222), Expect = 0.0
 Identities = 457/686 (66%), Positives = 524/686 (76%), Gaps = 5/686 (0%)
 Frame = -1

Query: 2712 SKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVPTSSGSGGWLEDKGTVRPLFINRNQGI 2533
            S+KQKIVE+P  D P  KRQR+  TDS +  DV  S+G+GGWLE +GTV     + N   
Sbjct: 565  SRKQKIVEQPAFDAPLLKRQRSEQTDSIIVSDVRPSTGNGGWLEHRGTVGLPITSSNYVT 624

Query: 2532 EKMGSVTRKLGDEVTCSDATSST-PNVTTGGNEQLPVTVTSTTASLHSILKDITVNPSML 2356
            +   + TRKL ++VT S +TS+T P+V    +  LP+T TS  A+LHS+LKDI +NPS+ 
Sbjct: 625  DSSDNDTRKL-EQVTSSVSTSNTIPSVIVNADVNLPLTGTS--ANLHSLLKDIAINPSIW 681

Query: 2355 MNIIKMEQQKSVDPAKSAAQSLSSNSILGAVPLMNVAPSKPSELGQRSAGLLQTPQTAST 2176
            MNIIK+EQQKS D +K+   + SS+SILGAVP  NVA  K S +GQRS G++QTP    T
Sbjct: 682  MNIIKLEQQKSADASKTTTVASSSSSILGAVPSTNVAAPKSSVIGQRSVGIIQTP--TQT 739

Query: 2175 NMQDELGKVRMKPRDPRRVLHNNPFLKPGSLGS-DQFKTNVAPIASSQGMKGNLYPQKQE 1999
               DE+ KVRMKPRDPRRVLHN    K G+ GS DQ KT VA    +Q M  +   Q+ E
Sbjct: 740  TAADEVAKVRMKPRDPRRVLHNTAVQKSGNSGSADQCKTGVA---GTQAMISSHCVQRPE 796

Query: 1998 DESDKNSVPSQSIAPPDIALQFTKNLKNIADIISVSLAPMSPAISPNFPSQPVQVPPIRV 1819
            D+ D+ S    S  PPDIA QFTKNLKNIAD+ISVS    SP+ +   P+Q +QV P R+
Sbjct: 797  DQLDRKSAVIPSTTPPDIARQFTKNLKNIADMISVSPTSTSPSAASQTPAQHMQVHPSRL 856

Query: 1818 DVKGVVPELGNLERTSSSTEEVAVDPSGS---KNTWGEVEHLFEGFDDQQKAAIQKERAR 1648
            +  G V E   L   +      A  P GS   +++WG VEHLFEG+ DQQ+A+IQ+ER R
Sbjct: 857  EGNGAVSESSELLTDAGLASGKA--PPGSLQLQSSWGNVEHLFEGYSDQQRASIQRERTR 914

Query: 1647 RIAEQKKMFAARKXXXXXXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFP 1468
            R+ EQKKMF+ RK           LNSAKFVE+D VH EILRKKEEQDREKP +HLFRFP
Sbjct: 915  RLEEQKKMFSVRKLCLVLDLDHTLLNSAKFVEIDPVHQEILRKKEEQDREKPYKHLFRFP 974

Query: 1467 YMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDD 1288
            +MGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKLLDPKG LFAGRVISRGDD
Sbjct: 975  HMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDD 1034

Query: 1287 GDLIDGDERVPKSKDLEGVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFG 1108
            GD +DGDER+PKSKDLEGVLGME         V+VWPHNKLNLIVVERYIYFPCSRRQFG
Sbjct: 1035 GDPLDGDERIPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFG 1094

Query: 1107 LPGPSLLEIDHDERSEDGTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGC 928
            LPGPSLLEIDHDER EDGTLAS L VI+RIH+NFF HRS+DEADVRNIL++EQ++IL GC
Sbjct: 1095 LPGPSLLEIDHDERPEDGTLASCLGVIQRIHQNFFEHRSIDEADVRNILATEQQKILAGC 1154

Query: 927  RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGR 748
            RIVFSRVFPVGEANPH HPLWQTAEQFGAVC +QIDEQVTHVVANSLGTDKVNWALSTGR
Sbjct: 1155 RIVFSRVFPVGEANPHFHPLWQTAEQFGAVCSSQIDEQVTHVVANSLGTDKVNWALSTGR 1214

Query: 747  FVVHPGWVEASALLYRRANELDFAIK 670
            FVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1215 FVVHPGWVEASALLYRRANEHDFAIK 1240


>ref|XP_009627456.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Nicotiana tomentosiformis]
            gi|697093792|ref|XP_009627526.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 3
            [Nicotiana tomentosiformis]
          Length = 1236

 Score =  854 bits (2206), Expect = 0.0
 Identities = 454/687 (66%), Positives = 526/687 (76%), Gaps = 4/687 (0%)
 Frame = -1

Query: 2718 IISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVPTSSGSGGWLEDKGTVRPLFINRNQ 2539
            I S+KQKI E+P  D    KRQR+  TDS +  DV  S+G+GGWLE +GT      + N 
Sbjct: 558  IQSRKQKIAEQPAFDASLLKRQRSEQTDSIIVSDVRPSTGNGGWLEHRGTAGLPITSSNY 617

Query: 2538 GIEKMGSVTRKLGDEVTCSDATSST-PNVTTGGNEQLPVTVTSTTASLHSILKDITVNPS 2362
              +  G+ TRKL ++VT S +TS+T P+V    +  LP+T TS  A+LHS+LKDI +NPS
Sbjct: 618  VTDSSGNGTRKL-EQVTSSVSTSNTMPSVIVNADVNLPLTGTS--ANLHSLLKDIAINPS 674

Query: 2361 MLMNIIKMEQQKSVDPAKSAAQSLSSNSILGAVPLMNVAPSKPSELGQRSAGLLQTP-QT 2185
            + MNIIK+EQQKS D +K+   + SS+SILGAVP  NVA S+ S +GQRS G++Q P QT
Sbjct: 675  IWMNIIKLEQQKSADDSKTTTLASSSSSILGAVPSTNVAASRTSMIGQRSVGIIQAPTQT 734

Query: 2184 ASTNMQDELGKVRMKPRDPRRVLHNNPFLKPGSLGS-DQFKTNVAPIASSQGMKGNLYPQ 2008
            A+    DE+ KVRMKPRDPRRVLHN    K G++GS DQ KT VA    +Q M  +   Q
Sbjct: 735  AAA---DEVAKVRMKPRDPRRVLHNTAVQKSGNVGSADQCKTGVA---GTQAMTSSHCVQ 788

Query: 2007 KQEDESDKNSVPSQSIAPPDIALQFTKNLKNIADIISVSLAPMSPAISPNFPSQPVQVPP 1828
            + ED+ D+ S  + S  PPDIA QFTKNLKNIAD+ISVS    SPA +   P+Q +QV P
Sbjct: 789  RPEDQLDRKSAVTPSTTPPDIARQFTKNLKNIADMISVSPTSTSPAAASQTPTQHMQVHP 848

Query: 1827 IRVDVKGVVPELGNLERTSS-STEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERA 1651
             R++  G V E   L   +  ++ +   D    +++WG VEHLFEG+ DQQ+A+IQ+ER 
Sbjct: 849  SRLEGNGAVSESSELLTDAGLASGKAPPDSLQPQSSWGNVEHLFEGYSDQQRASIQRERT 908

Query: 1650 RRIAEQKKMFAARKXXXXXXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRF 1471
            RR+ EQKKMF+ RK           LNSAKFVE+D VH EILRKKEEQDREKP RHLFRF
Sbjct: 909  RRLEEQKKMFSVRKLCLVLDLDHTLLNSAKFVEIDPVHQEILRKKEEQDREKPYRHLFRF 968

Query: 1470 PYMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGD 1291
             +MGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKLLDPKG LFAGRVISRGD
Sbjct: 969  LHMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGD 1028

Query: 1290 DGDLIDGDERVPKSKDLEGVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQF 1111
            DGD +DGDER+PKSKDLEGVLGME         V+VWPHNKLNLIVVERYIYFPCSRRQF
Sbjct: 1029 DGDPLDGDERIPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQF 1088

Query: 1110 GLPGPSLLEIDHDERSEDGTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGG 931
            GLPGPSLLEIDHDER EDGTLAS L VI+RIH+NFF HRS+DEADVRNIL++EQ++IL G
Sbjct: 1089 GLPGPSLLEIDHDERPEDGTLASCLGVIQRIHQNFFEHRSIDEADVRNILATEQQKILAG 1148

Query: 930  CRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTG 751
            CRIVFSRVFPVGEANPHLHPLWQTAEQFGAVC +QIDE VTHVVANSLGTDKVNWALSTG
Sbjct: 1149 CRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCSSQIDELVTHVVANSLGTDKVNWALSTG 1208

Query: 750  RFVVHPGWVEASALLYRRANELDFAIK 670
            RFVVHPGWVEAS LLYRRANE DFAIK
Sbjct: 1209 RFVVHPGWVEASTLLYRRANEHDFAIK 1235


>ref|XP_011036157.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Populus euphratica]
          Length = 1100

 Score =  853 bits (2205), Expect = 0.0
 Identities = 467/730 (63%), Positives = 532/730 (72%), Gaps = 23/730 (3%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LD NQ  L  VN  P+V  P G  + SKKQKI EE VLDGP  KRQR    + G  RD+ 
Sbjct: 383  LDHNQRALPMVNNLPRV-EPAGAIVGSKKQKI-EEDVLDGPSLKRQRNSFDNYGAVRDIE 440

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            + +G+GGWLED     P  +N+NQ  E +     ++ +   C  + S   NV   GN Q 
Sbjct: 441  SMTGTGGWLEDTDMAEPQTVNKNQWAENV-EPGHRINNGFVCPSSGSVKSNVNGSGNAQS 499

Query: 2430 P------------VTVTST-TASLHSILKDITVNPSMLMNIIKMEQQKSV---------D 2317
            P              VTST T SL  +LKDI VNP+ML+NI+KM QQ+ +         D
Sbjct: 500  PFMGISNITGSEQAQVTSTATTSLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQTLSD 559

Query: 2316 PAKSAAQSLSSNSILGAVPLMNVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKP 2137
            PAKS +    SNS+LGA+  +NVA S+PS +  R AG  Q P   +T+  DE GK+RMKP
Sbjct: 560  PAKSTSHPSISNSVLGAISTVNVASSQPSGILPRPAGT-QVPSQIATS--DESGKIRMKP 616

Query: 2136 RDPRRVLHNNPFLKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIA 1957
            RDPRR LHNN   + GSLGS+QFKT      ++QG K +   Q+QE  ++  S       
Sbjct: 617  RDPRRFLHNNSLQRAGSLGSEQFKTTTLT-PTTQGTKDDQNVQEQEGLAELKST-----V 670

Query: 1956 PPDIALQFTKNLKNIADIISVSLAPMSPA-ISPNFPSQPVQVPPIRVDVKGVVPELGNLE 1780
            PPDI+  FTK+L+NIADI+SVS A  +P  IS N  SQP+Q    RVD K  +  + + +
Sbjct: 671  PPDISFPFTKSLENIADILSVSQASTTPPFISQNVASQPMQTKSERVDGKTGI-SISDQK 729

Query: 1779 RTSSSTEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXX 1600
               +S+ EV    S  +NTW +VEHLFEG+DDQQKAAIQ+ERARR+ EQKKMFAARK   
Sbjct: 730  TGPASSAEVVAASSHLQNTWKDVEHLFEGYDDQQKAAIQRERARRMEEQKKMFAARKLCL 789

Query: 1599 XXXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFL 1420
                    LNSAKFVEVD VHDEILRKKEEQDREKP RH+FRFP+MGMWTKLRPGIWNFL
Sbjct: 790  VLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHIFRFPHMGMWTKLRPGIWNFL 849

Query: 1419 EKASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDL 1240
            EKASKLFELHLYTMGNKLYATEMAK+LDPKGVLFAGRVISRGDDGD  DGDERVPKSKDL
Sbjct: 850  EKASKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDL 909

Query: 1239 EGVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSE 1060
            EGVLGME         V+VWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDER E
Sbjct: 910  EGVLGMESGVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPE 969

Query: 1059 DGTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPH 880
            DGTLA SLAVIE+IH+NFF+HRSLDEADVRNIL+SEQR+ILGGCRI+FSRVFPVGE  PH
Sbjct: 970  DGTLACSLAVIEKIHQNFFTHRSLDEADVRNILASEQRKILGGCRILFSRVFPVGEVKPH 1029

Query: 879  LHPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 700
            LHPLWQ AEQFGAVC+NQIDEQVTHVVANSLGTDKVNWALSTGR VVHPGWVEASALLYR
Sbjct: 1030 LHPLWQMAEQFGAVCINQIDEQVTHVVANSLGTDKVNWALSTGRIVVHPGWVEASALLYR 1089

Query: 699  RANELDFAIK 670
            RANE DFAIK
Sbjct: 1090 RANEQDFAIK 1099


>ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223548611|gb|EEF50102.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 1195

 Score =  852 bits (2202), Expect = 0.0
 Identities = 460/728 (63%), Positives = 532/728 (73%), Gaps = 21/728 (2%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LD N   +  VNT    V P+GGT+  K+QKIV++P+ DG   KRQ+  L +SG+ RDV 
Sbjct: 481  LDQNHRAVPVVNTLK--VEPIGGTMNKKRQKIVDDPIPDGHSLKRQKNALENSGVVRDVK 538

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            T  GSGGWLED   V P  +N+NQ ++   S  R+      C+ ++S   +V   G EQ+
Sbjct: 539  TMVGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCT-SSSCISSVNISGTEQI 597

Query: 2430 PVTVTS------------TTASLHSILKDITVNPSMLMNIIKM---------EQQKSVDP 2314
            PVT TS            +TA++  +LK+I VNP+ML+NI+KM          QQK VDP
Sbjct: 598  PVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILKMGQQQRLALEAQQKPVDP 657

Query: 2313 AKSAAQSLSSNSILGAVPLMNVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKPR 2134
            AKS    L+SNS+LG VP++  A S    +  R AG +Q      T   D+LGK+RMKPR
Sbjct: 658  AKSTTYPLNSNSMLGTVPVVGAAHSG---ILPRPAGTVQVSPQLGT--ADDLGKIRMKPR 712

Query: 2133 DPRRVLHNNPFLKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIAP 1954
            DPRRVLHNN   + GS+GS+  KTN+  I  +Q  K N   QKQE + +K  VP QS+A 
Sbjct: 713  DPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEKKPVPLQSLAL 772

Query: 1953 PDIALQFTKNLKNIADIISVSLAPMSPAISPNFPSQPVQVPPIRVDVKGVVPELGNLERT 1774
            PDI++ FTKNLKNIADI+SVS A  S  + P  P+      P+R  +      LG +   
Sbjct: 773  PDISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQ----PMRTTISSSDQFLG-IGSA 827

Query: 1773 SSSTEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXXX 1594
              +    A  P  ++N WG+VEHLFEG++DQQKAAIQ+ERARRI EQKK+F+ARK     
Sbjct: 828  PGAAAAAAAGPR-TQNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQKKLFSARKLCLVL 886

Query: 1593 XXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLEK 1414
                  LNSAKFVEVD VHDEILRKKEEQDREK  RHLFRFP+MGMWTKLRPGIWNFLEK
Sbjct: 887  DLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGMWTKLRPGIWNFLEK 946

Query: 1413 ASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEG 1234
            ASKL+ELHLYTMGNKLYATEMAK+LDP GVLF GRVISRGDDG+  DGDER+PKSKDLEG
Sbjct: 947  ASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPFDGDERIPKSKDLEG 1006

Query: 1233 VLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSEDG 1054
            VLGME         V+VWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDER EDG
Sbjct: 1007 VLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDG 1066

Query: 1053 TLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHLH 874
            TLA SLAVIERIH+NFF+H SLDEADVRNIL+SEQR+IL GCRIVFSRVFPVGEANPHLH
Sbjct: 1067 TLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEANPHLH 1126

Query: 873  PLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRA 694
            PLWQTAEQFGAVC NQIDEQVTHVVANSLGTDKVNWALSTGRFVV+PGWVEASALLYRRA
Sbjct: 1127 PLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRA 1186

Query: 693  NELDFAIK 670
            NE DFAIK
Sbjct: 1187 NEQDFAIK 1194


>ref|XP_010249185.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Nelumbo nucifera]
          Length = 1313

 Score =  852 bits (2200), Expect = 0.0
 Identities = 464/723 (64%), Positives = 540/723 (74%), Gaps = 15/723 (2%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLNQ P  G + + +   PLGG + S+K KIVEE +LD    KRQR GL +SG + DV 
Sbjct: 593  LDLNQRPPSG-DHDIRKSEPLGGIMGSRKHKIVEESLLDDHTFKRQRNGLINSGASGDVQ 651

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTC----SDATSSTPNVTTGG 2443
              SGSGGWLE+  ++     +R++ IEK  S  RKLG          D   ST NVTTGG
Sbjct: 652  VVSGSGGWLEESSSMGLQPTDRSRLIEKRESDPRKLGSGEASFGNKQDTGCSTYNVTTGG 711

Query: 2442 NEQLPVTVTSTTASLHSILKDITVNPSMLMNIIKMEQQ--------KSVDPAKSAAQSLS 2287
            NEQL  +   +T SL S+LKDI VNP+MLM++IKME Q        K  +PA+S  QS S
Sbjct: 712  NEQLTASGIGSTVSLPSLLKDIAVNPTMLMHLIKMEHQRLAVEALQKCGNPAQSTMQSSS 771

Query: 2286 SNSILGAVPLMNVAPSKPSELGQRSAGLLQ-TPQTASTNMQDELGKVRMKPRDPRRVLHN 2110
            S+ + G +  +N+A    SE  ++SAG  Q + QTAS     +LGK+RMKPRDPRR+LH+
Sbjct: 772  SSVMPGKIASVNIASKTLSEPEKKSAGNSQISVQTASMIPHGDLGKIRMKPRDPRRILHS 831

Query: 2109 NPFLKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIAPPDIALQFT 1930
            N F K  S G ++FK N  P  ++   + NL  ++Q +++  NS+ SQS APPDIA QFT
Sbjct: 832  NTFQKSDSSGPERFKANGTPSPNTPTCRDNLIVRQQGEQAQTNSLLSQSTAPPDIAQQFT 891

Query: 1929 KNLKNIADIISVSLAPMSPAISPN-FPSQPVQVPPIRVDVKGVVPELGNLERTSSST-EE 1756
            K LKNIA+I+S S A  +P++ P    SQPV     +VD+K V  +  +    S+ T EE
Sbjct: 892  KKLKNIANILSASQAINTPSVVPQTISSQPVPAKMDKVDMKVVATDSNDQRSWSALTPEE 951

Query: 1755 VAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXXXXXXXXX 1576
             A  PS S+N WG+VEHLFEG+DDQQKAAIQ+ERARRI EQ +MFAARK           
Sbjct: 952  RAAGPS-SQNAWGDVEHLFEGYDDQQKAAIQRERARRIEEQNQMFAARKLCLVLDLDHTL 1010

Query: 1575 LNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLEKASKLFE 1396
            LNSAKFVEVD VH+E+LRKKEEQDREKPQRHLFRF +MGMWTKLRPGIWNFLEKASKL+E
Sbjct: 1011 LNSAKFVEVDPVHEEMLRKKEEQDREKPQRHLFRFTHMGMWTKLRPGIWNFLEKASKLYE 1070

Query: 1395 LHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMEX 1216
            LHLYTMGNKLYATEMAK+LDP GVLFAGRVISRGDDGD  DGDER PKSKDL+GVLGME 
Sbjct: 1071 LHLYTMGNKLYATEMAKVLDPTGVLFAGRVISRGDDGDPFDGDERQPKSKDLDGVLGMES 1130

Query: 1215 XXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSEDGTLASSL 1036
                    V+VWPHNKLNLIVVERY YFPCSRRQ GL GPSLLEIDHDER EDGTLASSL
Sbjct: 1131 AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQLGLHGPSLLEIDHDERPEDGTLASSL 1190

Query: 1035 AVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHLHPLWQTA 856
            AVIERIH+NFFSH++L++ DVRNIL++EQ++IL GCRIVFSRVFPVGEANPHLHPLWQTA
Sbjct: 1191 AVIERIHQNFFSHQNLNDVDVRNILAAEQQKILAGCRIVFSRVFPVGEANPHLHPLWQTA 1250

Query: 855  EQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANELDFA 676
            EQFGAVC NQIDEQVTHVVA SLGTDKVNWALSTGRFVVHPGWVEASALLYRRANE DFA
Sbjct: 1251 EQFGAVCTNQIDEQVTHVVAISLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEHDFA 1310

Query: 675  IKL 667
            IKL
Sbjct: 1311 IKL 1313


>ref|XP_012088736.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Jatropha curcas] gi|643708360|gb|KDP23276.1|
            hypothetical protein JCGZ_23109 [Jatropha curcas]
          Length = 1283

 Score =  851 bits (2198), Expect = 0.0
 Identities = 460/722 (63%), Positives = 527/722 (72%), Gaps = 25/722 (3%)
 Frame = -1

Query: 2760 VNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVPTSSGSGGWLE 2581
            VN  PKV   LGG +  KKQK V++ VLDGP  KRQR  L  SG   +V T   SGGWLE
Sbjct: 580  VNNTPKV-EYLGGPMNLKKQKSVDDSVLDGPSLKRQRNVLEHSGGVGNVKTMIASGGWLE 638

Query: 2580 DKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQLPVTVT----- 2416
            D   VRP  +NRNQ +E   S  R++ + V C    S   +V+  GNEQ PV  T     
Sbjct: 639  DTDMVRPQTMNRNQLVEN--SDPRRMDNGVACPSTVSGISSVSISGNEQKPVIGTGAITE 696

Query: 2415 --------STTASLHSILKDITVNPSMLMNIIKM---------EQQKSVDPAKSAAQSLS 2287
                    ++ ASL  +LK+I VNP+ML+N++KM          QQK  DPAK++   L+
Sbjct: 697  GEQIQMTGTSEASLPDLLKNIAVNPTMLLNLLKMGQQQRSAIDAQQKPSDPAKTSKHPLN 756

Query: 2286 SNSILGAVPLMNVAPSKPSELGQRSAGLLQTPQTASTNMQDELGKVRMKPRDPRRVLHNN 2107
            +N+ILG+VP++NV P +PS +  R AG LQ P  A+    +ELGK+RMKPRDPRRVLH  
Sbjct: 757  ANAILGSVPVVNVVPPQPSVM-PRPAGTLQVPPQAAV---EELGKIRMKPRDPRRVLHYQ 812

Query: 2106 PFLKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIAPPDIALQFTK 1927
               K G++G +QFKTN+    + QG K N   QKQ+ +++   VP QS+  PDI+L FTK
Sbjct: 813  TLQKNGNMGYEQFKTNLTSPPTDQGTKDNQIVQKQDGQAETEPVPLQSLVVPDISLPFTK 872

Query: 1926 NLKNIADIISVSLAPMSPAI-SPNFPSQPVQVPPIRVDVKGVVPELGNLERTSSSTEEVA 1750
            +LKNIADI+SVS A  SP + S N  SQP +              + N E+ +       
Sbjct: 873  SLKNIADIVSVSHASTSPTVVSQNLASQPTRTI------------VSNSEQPAGIGSAPC 920

Query: 1749 VDPSGSK--NTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXXXXXXXXX 1576
            V P G +  + WG+VEHLFEG+ DQQKAAIQ+ERARRI EQKKMFAARK           
Sbjct: 921  VAPVGPRPQDAWGDVEHLFEGYSDQQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTL 980

Query: 1575 LNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLEKASKLFE 1396
            LNSAKFVEVD VHDEILRKKEEQDREKP RHLFRFP+MGMWTKLRPGIWNFLEKASKL+E
Sbjct: 981  LNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYE 1040

Query: 1395 LHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMEX 1216
            LHLYTMGNKLYATEMAK+LDP GVLF GRVISRGDD D  D DERVPKSKDLEGVLGME 
Sbjct: 1041 LHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDTDSFDSDERVPKSKDLEGVLGMES 1100

Query: 1215 XXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSEDGTLASSL 1036
                    V+VWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDER EDGTLA SL
Sbjct: 1101 AVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSL 1160

Query: 1035 AVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHLHPLWQTA 856
            AVIE+IH++FF+H SLD+ADVRNIL+SEQR+IL GCRIVFSRVFPVGEANPHLHPLWQTA
Sbjct: 1161 AVIEKIHQHFFTHPSLDDADVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTA 1220

Query: 855  EQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANELDFA 676
            EQFGAVC NQIDEQVTHVVANSLGTDKVNWALSTGRFVV+PGWVEASALLYRRANE DFA
Sbjct: 1221 EQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANEQDFA 1280

Query: 675  IK 670
            IK
Sbjct: 1281 IK 1282


>ref|XP_012459418.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X2 [Gossypium raimondii]
          Length = 1251

 Score =  848 bits (2192), Expect = 0.0
 Identities = 463/729 (63%), Positives = 531/729 (72%), Gaps = 22/729 (3%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLNQ PL   +  P    P+ G +  +K+K  EEPVLDGP PKRQ+  L + G+ RDV 
Sbjct: 535  LDLNQRPLHNASKVP----PVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGV-RDVQ 589

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
              SG+GGWLED         NRNQ +E + S +RK+   VTCS   S   N T   NEQ+
Sbjct: 590  AVSGNGGWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQV 649

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNIIKM---------EQQKSVDPAKSAAQSLSSNS 2278
            P+T  S   SL ++LKDI VNP+ML+NI+KM          QQK+ DP K+     SSN 
Sbjct: 650  PLTGMSNP-SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNP 708

Query: 2277 ILGAVPLMNVAPSKP-SELGQRSAGLLQTPQTASTNMQ----DELGKVRMKPRDPRRVLH 2113
            +LG +P  NV PS   + +   S+G L  P   + N+Q    DE  K+RMKPRDPRRVLH
Sbjct: 709  VLGVIPPANVIPSPSVNVVPSSSSGTLSKP---AGNLQGPPLDESCKIRMKPRDPRRVLH 765

Query: 2112 NNPFLKPGSLGSDQFKTN-VAPIASSQGMKGNLYPQKQ-EDESDKNSVPSQSIAPPDIAL 1939
             N   K GS+G DQ KTN  +P +S+QG K N+  QKQ E++ +   +  Q + PPDIA 
Sbjct: 766  GNVLQKSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQ 825

Query: 1938 QFTKNLKNIADIISVSLAPMS----PAISPNFPSQPVQVPPIRVD--VKGVVPELGNLER 1777
            QFT++LKNIA ++S    P S    PA+S N  SQP+QV     D   KG   E      
Sbjct: 826  QFTQSLKNIAGMMS---GPQSFAGLPAVSQNLVSQPIQVKSETADKNTKGSNSE-DQQTG 881

Query: 1776 TSSSTEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXX 1597
            T ++ E     P  S+N WG+VEHLFE +DD+QKAAIQ+ERARRI EQKKMFAARK    
Sbjct: 882  TGTAPEAGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLV 941

Query: 1596 XXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLE 1417
                   LNSAKF+EVD VH+EILRKKEEQDREKPQRHLFRF +MGMWTKLRPGIWNFLE
Sbjct: 942  LDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLE 1001

Query: 1416 KASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLE 1237
            KASKL+ELHLYTMGNKLYATEMAK+LDPKGVLFAGRVISRGDDGD  DGDERVP+SKDLE
Sbjct: 1002 KASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLE 1061

Query: 1236 GVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSED 1057
            GVLGME         V+VWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER ED
Sbjct: 1062 GVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPED 1121

Query: 1056 GTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHL 877
            GTLASSLAVIERIH+NFFSH++LD+ DVRNIL++EQR+IL GCRIVFSRVFPVGEANPHL
Sbjct: 1122 GTLASSLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHL 1181

Query: 876  HPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 697
            HPLWQTAEQFGAVC NQIDE VTHVVANSLGTDKVNWALSTG+FVVHPGWVEASALLYRR
Sbjct: 1182 HPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRR 1241

Query: 696  ANELDFAIK 670
            ANE DFAIK
Sbjct: 1242 ANEHDFAIK 1250


>gb|KJB77193.1| hypothetical protein B456_012G125200 [Gossypium raimondii]
          Length = 982

 Score =  848 bits (2192), Expect = 0.0
 Identities = 463/729 (63%), Positives = 531/729 (72%), Gaps = 22/729 (3%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLNQ PL   +  P    P+ G +  +K+K  EEPVLDGP PKRQ+  L + G+ RDV 
Sbjct: 266  LDLNQRPLHNASKVP----PVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGV-RDVQ 320

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
              SG+GGWLED         NRNQ +E + S +RK+   VTCS   S   N T   NEQ+
Sbjct: 321  AVSGNGGWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQV 380

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNIIKM---------EQQKSVDPAKSAAQSLSSNS 2278
            P+T  S   SL ++LKDI VNP+ML+NI+KM          QQK+ DP K+     SSN 
Sbjct: 381  PLTGMSNP-SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNP 439

Query: 2277 ILGAVPLMNVAPSKP-SELGQRSAGLLQTPQTASTNMQ----DELGKVRMKPRDPRRVLH 2113
            +LG +P  NV PS   + +   S+G L  P   + N+Q    DE  K+RMKPRDPRRVLH
Sbjct: 440  VLGVIPPANVIPSPSVNVVPSSSSGTLSKP---AGNLQGPPLDESCKIRMKPRDPRRVLH 496

Query: 2112 NNPFLKPGSLGSDQFKTN-VAPIASSQGMKGNLYPQKQ-EDESDKNSVPSQSIAPPDIAL 1939
             N   K GS+G DQ KTN  +P +S+QG K N+  QKQ E++ +   +  Q + PPDIA 
Sbjct: 497  GNVLQKSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQ 556

Query: 1938 QFTKNLKNIADIISVSLAPMS----PAISPNFPSQPVQVPPIRVD--VKGVVPELGNLER 1777
            QFT++LKNIA ++S    P S    PA+S N  SQP+QV     D   KG   E      
Sbjct: 557  QFTQSLKNIAGMMS---GPQSFAGLPAVSQNLVSQPIQVKSETADKNTKGSNSE-DQQTG 612

Query: 1776 TSSSTEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXX 1597
            T ++ E     P  S+N WG+VEHLFE +DD+QKAAIQ+ERARRI EQKKMFAARK    
Sbjct: 613  TGTAPEAGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLV 672

Query: 1596 XXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLE 1417
                   LNSAKF+EVD VH+EILRKKEEQDREKPQRHLFRF +MGMWTKLRPGIWNFLE
Sbjct: 673  LDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLE 732

Query: 1416 KASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLE 1237
            KASKL+ELHLYTMGNKLYATEMAK+LDPKGVLFAGRVISRGDDGD  DGDERVP+SKDLE
Sbjct: 733  KASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLE 792

Query: 1236 GVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSED 1057
            GVLGME         V+VWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER ED
Sbjct: 793  GVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPED 852

Query: 1056 GTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHL 877
            GTLASSLAVIERIH+NFFSH++LD+ DVRNIL++EQR+IL GCRIVFSRVFPVGEANPHL
Sbjct: 853  GTLASSLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHL 912

Query: 876  HPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 697
            HPLWQTAEQFGAVC NQIDE VTHVVANSLGTDKVNWALSTG+FVVHPGWVEASALLYRR
Sbjct: 913  HPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRR 972

Query: 696  ANELDFAIK 670
            ANE DFAIK
Sbjct: 973  ANEHDFAIK 981


>gb|KJB77192.1| hypothetical protein B456_012G125200 [Gossypium raimondii]
          Length = 1033

 Score =  848 bits (2192), Expect = 0.0
 Identities = 463/729 (63%), Positives = 531/729 (72%), Gaps = 22/729 (3%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLNQ PL   +  P    P+ G +  +K+K  EEPVLDGP PKRQ+  L + G+ RDV 
Sbjct: 317  LDLNQRPLHNASKVP----PVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGV-RDVQ 371

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
              SG+GGWLED         NRNQ +E + S +RK+   VTCS   S   N T   NEQ+
Sbjct: 372  AVSGNGGWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQV 431

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNIIKM---------EQQKSVDPAKSAAQSLSSNS 2278
            P+T  S   SL ++LKDI VNP+ML+NI+KM          QQK+ DP K+     SSN 
Sbjct: 432  PLTGMSNP-SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNP 490

Query: 2277 ILGAVPLMNVAPSKP-SELGQRSAGLLQTPQTASTNMQ----DELGKVRMKPRDPRRVLH 2113
            +LG +P  NV PS   + +   S+G L  P   + N+Q    DE  K+RMKPRDPRRVLH
Sbjct: 491  VLGVIPPANVIPSPSVNVVPSSSSGTLSKP---AGNLQGPPLDESCKIRMKPRDPRRVLH 547

Query: 2112 NNPFLKPGSLGSDQFKTN-VAPIASSQGMKGNLYPQKQ-EDESDKNSVPSQSIAPPDIAL 1939
             N   K GS+G DQ KTN  +P +S+QG K N+  QKQ E++ +   +  Q + PPDIA 
Sbjct: 548  GNVLQKSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQ 607

Query: 1938 QFTKNLKNIADIISVSLAPMS----PAISPNFPSQPVQVPPIRVD--VKGVVPELGNLER 1777
            QFT++LKNIA ++S    P S    PA+S N  SQP+QV     D   KG   E      
Sbjct: 608  QFTQSLKNIAGMMS---GPQSFAGLPAVSQNLVSQPIQVKSETADKNTKGSNSE-DQQTG 663

Query: 1776 TSSSTEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXX 1597
            T ++ E     P  S+N WG+VEHLFE +DD+QKAAIQ+ERARRI EQKKMFAARK    
Sbjct: 664  TGTAPEAGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLV 723

Query: 1596 XXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLE 1417
                   LNSAKF+EVD VH+EILRKKEEQDREKPQRHLFRF +MGMWTKLRPGIWNFLE
Sbjct: 724  LDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLE 783

Query: 1416 KASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLE 1237
            KASKL+ELHLYTMGNKLYATEMAK+LDPKGVLFAGRVISRGDDGD  DGDERVP+SKDLE
Sbjct: 784  KASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLE 843

Query: 1236 GVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSED 1057
            GVLGME         V+VWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER ED
Sbjct: 844  GVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPED 903

Query: 1056 GTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHL 877
            GTLASSLAVIERIH+NFFSH++LD+ DVRNIL++EQR+IL GCRIVFSRVFPVGEANPHL
Sbjct: 904  GTLASSLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHL 963

Query: 876  HPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 697
            HPLWQTAEQFGAVC NQIDE VTHVVANSLGTDKVNWALSTG+FVVHPGWVEASALLYRR
Sbjct: 964  HPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRR 1023

Query: 696  ANELDFAIK 670
            ANE DFAIK
Sbjct: 1024 ANEHDFAIK 1032


>ref|XP_012459417.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X1 [Gossypium raimondii]
            gi|763810289|gb|KJB77191.1| hypothetical protein
            B456_012G125200 [Gossypium raimondii]
          Length = 1272

 Score =  848 bits (2192), Expect = 0.0
 Identities = 463/729 (63%), Positives = 531/729 (72%), Gaps = 22/729 (3%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLNQ PL   +  P    P+ G +  +K+K  EEPVLDGP PKRQ+  L + G+ RDV 
Sbjct: 556  LDLNQRPLHNASKVP----PVSGIMDPRKKKSTEEPVLDGPAPKRQKNELENFGV-RDVQ 610

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
              SG+GGWLED         NRNQ +E + S +RK+   VTCS   S   N T   NEQ+
Sbjct: 611  AVSGNGGWLEDTDNCESQITNRNQTMETLDSNSRKMEHGVTCSSTLSGKTNTTVNKNEQV 670

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNIIKM---------EQQKSVDPAKSAAQSLSSNS 2278
            P+T  S   SL ++LKDI VNP+ML+NI+KM          QQK+ DP K+     SSN 
Sbjct: 671  PLTGMSNP-SLPALLKDIAVNPTMLINILKMGQQQRLPSESQQKTPDPLKNTLYQPSSNP 729

Query: 2277 ILGAVPLMNVAPSKP-SELGQRSAGLLQTPQTASTNMQ----DELGKVRMKPRDPRRVLH 2113
            +LG +P  NV PS   + +   S+G L  P   + N+Q    DE  K+RMKPRDPRRVLH
Sbjct: 730  VLGVIPPANVIPSPSVNVVPSSSSGTLSKP---AGNLQGPPLDESCKIRMKPRDPRRVLH 786

Query: 2112 NNPFLKPGSLGSDQFKTN-VAPIASSQGMKGNLYPQKQ-EDESDKNSVPSQSIAPPDIAL 1939
             N   K GS+G DQ KTN  +P +S+QG K N+  QKQ E++ +   +  Q + PPDIA 
Sbjct: 787  GNVLQKSGSVGPDQLKTNGTSPASSTQGSKDNMNAQKQLENQIEAKPIQCQFVPPPDIAQ 846

Query: 1938 QFTKNLKNIADIISVSLAPMS----PAISPNFPSQPVQVPPIRVD--VKGVVPELGNLER 1777
            QFT++LKNIA ++S    P S    PA+S N  SQP+QV     D   KG   E      
Sbjct: 847  QFTQSLKNIAGMMS---GPQSFAGLPAVSQNLVSQPIQVKSETADKNTKGSNSE-DQQTG 902

Query: 1776 TSSSTEEVAVDPSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXX 1597
            T ++ E     P  S+N WG+VEHLFE +DD+QKAAIQ+ERARRI EQKKMFAARK    
Sbjct: 903  TGTAPEAGVTCPPPSQNAWGDVEHLFEKYDDRQKAAIQRERARRIEEQKKMFAARKLCLV 962

Query: 1596 XXXXXXXLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLE 1417
                   LNSAKF+EVD VH+EILRKKEEQDREKPQRHLFRF +MGMWTKLRPGIWNFLE
Sbjct: 963  LDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLE 1022

Query: 1416 KASKLFELHLYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLE 1237
            KASKL+ELHLYTMGNKLYATEMAK+LDPKGVLFAGRVISRGDDGD  DGDERVP+SKDLE
Sbjct: 1023 KASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLE 1082

Query: 1236 GVLGMEXXXXXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSED 1057
            GVLGME         V+VWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER ED
Sbjct: 1083 GVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPED 1142

Query: 1056 GTLASSLAVIERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHL 877
            GTLASSLAVIERIH+NFFSH++LD+ DVRNIL++EQR+IL GCRIVFSRVFPVGEANPHL
Sbjct: 1143 GTLASSLAVIERIHQNFFSHQNLDDLDVRNILATEQRKILSGCRIVFSRVFPVGEANPHL 1202

Query: 876  HPLWQTAEQFGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 697
            HPLWQTAEQFGAVC NQIDE VTHVVANSLGTDKVNWALSTG+FVVHPGWVEASALLYRR
Sbjct: 1203 HPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRR 1262

Query: 696  ANELDFAIK 670
            ANE DFAIK
Sbjct: 1263 ANEHDFAIK 1271


>ref|XP_010100046.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus
            notabilis] gi|587892642|gb|EXB81217.1| RNA polymerase II
            C-terminal domain phosphatase-like 3 [Morus notabilis]
          Length = 1301

 Score =  843 bits (2179), Expect = 0.0
 Identities = 452/701 (64%), Positives = 517/701 (73%), Gaps = 13/701 (1%)
 Frame = -1

Query: 2790 LDLNQFPLLGVNTEPKVVNPLGGTIISKKQKIVEEPVLDGPEPKRQRTGLTDSGLARDVP 2611
            LDLNQ PL  V+  PKV    G    S+KQ+IVEEP LDGP  KRQR     + +  DV 
Sbjct: 574  LDLNQRPLTAVHNGPKVEP--GDPTSSRKQRIVEEPNLDGPALKRQRHAFVSAKI--DVK 629

Query: 2610 TSSGSGGWLEDKGTVRPLFINRNQGIEKMGSVTRKLGDEVTCSDATSSTPNVTTGGNEQL 2431
            T+SG GGWLED GT  P  +N+NQ +E   +  RK    +      ++ PN+   G EQ+
Sbjct: 630  TASGVGGWLEDNGTTGPQIMNKNQLVENAEADPRK-SIHLVNGPIMNNGPNI---GKEQV 685

Query: 2430 PVTVTSTTASLHSILKDITVNPSMLMNIIKM----------EQQKSVDPAKSAAQSLSSN 2281
            PVT TST  +L +ILKDI VNP++ M+I+             QQKS D +K+      +N
Sbjct: 686  PVTGTSTPDALPAILKDIAVNPTIFMDILNKLGQQQLLAADAQQKS-DSSKNTTHPPGTN 744

Query: 2280 SILGAVPLMNVAPSKPSELGQRSA-GLLQTPQTASTNMQDELGKVRMKPRDPRRVLHNNP 2104
            SILGA PL+NVAPSK S + Q  A  L  T Q A+ +MQDELGK+RMKPRDPRRVLH N 
Sbjct: 745  SILGAAPLVNVAPSKASGILQTPAVSLPTTSQVATASMQDELGKIRMKPRDPRRVLHGNM 804

Query: 2103 FLKPGSLGSDQFKTNVAPIASSQGMKGNLYPQKQEDESDKNSVPSQSIAPPDIALQFTKN 1924
              K  SLG +QFK  V+ ++ + G K NL    QE ++DK  VPSQ +  PDIA QFTKN
Sbjct: 805  LQKSWSLGHEQFKPIVSSVSCTPGNKDNLNGPVQEGQADKKQVPSQLVVQPDIARQFTKN 864

Query: 1923 LKNIADIISVSLAPMSPA-ISPNFPSQPVQVPPIRVDVKGVVPELGNLERTSSSTEEVAV 1747
            L+NIAD++SVS A  SPA +S N  SQP+ V P R DVK VVP   +    ++ST E  +
Sbjct: 865  LRNIADLMSVSQASTSPATVSQNLSSQPLPVKPDRGDVKAVVPNSEDQHSGTNSTPETTL 924

Query: 1746 D-PSGSKNTWGEVEHLFEGFDDQQKAAIQKERARRIAEQKKMFAARKXXXXXXXXXXXLN 1570
              PS + N WG+VEHLFEG+DD+QKAAIQ+ERARR+ EQKKMF A K           LN
Sbjct: 925  AVPSRTPNAWGDVEHLFEGYDDEQKAAIQRERARRLEEQKKMFDAHKLCLVLDLDHTLLN 984

Query: 1569 SAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPYMGMWTKLRPGIWNFLEKASKLFELH 1390
            SAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFP+MGMWTKLRPG+WNFLEKASKL+ELH
Sbjct: 985  SAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELH 1044

Query: 1389 LYTMGNKLYATEMAKLLDPKGVLFAGRVISRGDDGDLIDGDERVPKSKDLEGVLGMEXXX 1210
            LYTMGNKLYATEMAK+LDP G LF+GRVISRGDDGD  DGDERVPKSKDLEGVLGME   
Sbjct: 1045 LYTMGNKLYATEMAKVLDPMGTLFSGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESSV 1104

Query: 1209 XXXXXXVKVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERSEDGTLASSLAV 1030
                  V+VWPHNKLNLIVVERY YFPCSRRQFGLPGPSLLEIDHDER E GTLASSLAV
Sbjct: 1105 VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEQGTLASSLAV 1164

Query: 1029 IERIHENFFSHRSLDEADVRNILSSEQRQILGGCRIVFSRVFPVGEANPHLHPLWQTAEQ 850
            IE+IH+NFFSH SLDE DVRNIL+SEQR+IL GCRIVFSRVFPV E NPHLHPLWQTAEQ
Sbjct: 1165 IEKIHQNFFSHHSLDEVDVRNILASEQRKILAGCRIVFSRVFPVSEVNPHLHPLWQTAEQ 1224

Query: 849  FGAVCVNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGW 727
            FGAVC  QID+QVTHVVANS GTDKVNWAL+ G+F VHPGW
Sbjct: 1225 FGAVCTTQIDDQVTHVVANSPGTDKVNWALANGKFAVHPGW 1265


Top