BLASTX nr result

ID: Akebia27_contig00020491 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00020491
         (2165 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat...   669   0.0  
ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu...   654   0.0  
ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Popu...   654   0.0  
emb|CBI35661.3| unnamed protein product [Vitis vinifera]              648   0.0  
ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma...   646   0.0  
ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ...   646   0.0  
ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma...   636   e-179
ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr...   635   e-179
ref|XP_007225412.1| hypothetical protein PRUPE_ppa000589mg [Prun...   635   e-179
ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma...   635   e-179
ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phas...   632   e-178
ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal doma...   632   e-178
ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma...   632   e-178
gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l...   626   e-176
ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus tric...   623   e-175
gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus...   622   e-175
ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma...   615   e-173
ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal doma...   612   e-172
ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal doma...   612   e-172
ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera...   612   e-172

>ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA
            polymerase II C-terminal domain phosphatase-like 3,
            putative [Theobroma cacao]
          Length = 1290

 Score =  669 bits (1725), Expect = 0.0
 Identities = 344/515 (66%), Positives = 395/515 (76%), Gaps = 7/515 (1%)
 Frame = -2

Query: 2116 ISSVAIGTVPLDLRGDLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVPSSIIQAS 1937
            ISS   G + +    + GK RMKPRDPRRVL  N+ +++G +G +QLKT G  +S  Q S
Sbjct: 776  ISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGLDQLKTNGALTSSTQGS 835

Query: 1936 KDSLTVRQ-----HAEXXXXXXXXXXXXXXXAKQN--NIPGTLSDSQSATSLTTVAQTVS 1778
            KD+L  ++      ++                  N  NI   +S SQ+ TSL  V+  + 
Sbjct: 836  KDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNIADIMSVSQALTSLPPVSHNLV 895

Query: 1777 SQPIPRKIYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQK 1598
             QP+  K  + D++ + ++S +Q++     P+ GA    +S+NAW DVEHLFE +DDQQK
Sbjct: 896  PQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGP-RSQNAWGDVEHLFERYDDQQK 954

Query: 1597 ATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQK 1418
            A I+RERARR EEQ K+F+ARK           LNSAKF+EVDPVHE ILRKKEEQDR+K
Sbjct: 955  AAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREK 1014

Query: 1417 PQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFG 1238
            P+RHLFRF HMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF 
Sbjct: 1015 PERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFA 1074

Query: 1237 ERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIY 1058
             RVISRGDD DP DGDERVP+ KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY Y
Sbjct: 1075 GRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1134

Query: 1057 FPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAK 878
            FPCSRR FGL GPSLLEI HDER +DG LASSLAVIERIH +FF HQ+L+DVDVRNILA 
Sbjct: 1135 FPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFSHQNLDDVDVRNILAS 1194

Query: 877  EQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDK 698
            EQ+ +LAGCRIVFSRVFPVGEA PHLHPLWQTA+QFGAVCTNQIDEHVTHVVANSLGTDK
Sbjct: 1195 EQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDK 1254

Query: 697  VNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 593
            VNWALSTG+FVVHP WVEASALLYRRANE DFAIK
Sbjct: 1255 VNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289


>ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343308|gb|EEE79627.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1247

 Score =  654 bits (1688), Expect = 0.0
 Identities = 344/513 (67%), Positives = 388/513 (75%), Gaps = 2/513 (0%)
 Frame = -2

Query: 2125 LPSISSVAIGTVPLDLRGDLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVPSSII 1946
            LP  +  A G   +    + GK RMKPRDPRRVL NN  ++ G +G+EQ KT  + S+  
Sbjct: 737  LPRSAGKAQGPSQIATTDESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTLTSTT- 795

Query: 1945 QASKDSLTVRQHAEXXXXXXXXXXXXXXXAKQN--NIPGTLSDSQSATSLTTVAQTVSSQ 1772
            Q +KD+  +++                    ++  NI   +S SQ+ T+   V+Q V+SQ
Sbjct: 796  QGTKDNQNLQKQEGLAELKPVVPPDISSPFTKSLKNIADIVSVSQTCTTPPFVSQNVASQ 855

Query: 1771 PIPRKIYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKAT 1592
            P+  K    D +   ++S+ Q+    S+P+  A  S  S+N WEDVEHLFEG+DDQQKA 
Sbjct: 856  PVQIKSDRVDGKTGISNSD-QKMGPASSPEVVAASSL-SQNTWEDVEHLFEGYDDQQKAA 913

Query: 1591 IRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQ 1412
            I+RERARR EEQ KLFAARK           LNSAKFVEVDPVH+ ILRKKEEQDR+KP 
Sbjct: 914  IQRERARRIEEQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPY 973

Query: 1411 RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGER 1232
            RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  R
Sbjct: 974  RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGR 1033

Query: 1231 VISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFP 1052
            V+SRGDD D LDGDERVPK KDL+GVLGMES VVIIDDS RVWPHNKLNLI +ERYIYFP
Sbjct: 1034 VVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFP 1093

Query: 1051 CSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQ 872
            CSRR FGL GPSLLEI HDER +DG LA SLAVIERIH NFF H SL++ DVRNILA EQ
Sbjct: 1094 CSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQ 1153

Query: 871  KNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVN 692
            + +LAGCRIVFSRVFPVGE  PHLHPLWQ+A+QFGAVCTNQIDE VTHVVANSLGTDKVN
Sbjct: 1154 RKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVN 1213

Query: 691  WALSTGRFVVHPCWVEASALLYRRANEFDFAIK 593
            WALSTGRFVVHP WVEASALLYRRANE DFAIK
Sbjct: 1214 WALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1246


>ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343307|gb|EEE79693.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1030

 Score =  654 bits (1688), Expect = 0.0
 Identities = 344/513 (67%), Positives = 388/513 (75%), Gaps = 2/513 (0%)
 Frame = -2

Query: 2125 LPSISSVAIGTVPLDLRGDLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVPSSII 1946
            LP  +  A G   +    + GK RMKPRDPRRVL NN  ++ G +G+EQ KT  + S+  
Sbjct: 520  LPRSAGKAQGPSQIATTDESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTLTSTT- 578

Query: 1945 QASKDSLTVRQHAEXXXXXXXXXXXXXXXAKQN--NIPGTLSDSQSATSLTTVAQTVSSQ 1772
            Q +KD+  +++                    ++  NI   +S SQ+ T+   V+Q V+SQ
Sbjct: 579  QGTKDNQNLQKQEGLAELKPVVPPDISSPFTKSLKNIADIVSVSQTCTTPPFVSQNVASQ 638

Query: 1771 PIPRKIYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKAT 1592
            P+  K    D +   ++S+ Q+    S+P+  A  S  S+N WEDVEHLFEG+DDQQKA 
Sbjct: 639  PVQIKSDRVDGKTGISNSD-QKMGPASSPEVVAASSL-SQNTWEDVEHLFEGYDDQQKAA 696

Query: 1591 IRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQ 1412
            I+RERARR EEQ KLFAARK           LNSAKFVEVDPVH+ ILRKKEEQDR+KP 
Sbjct: 697  IQRERARRIEEQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPY 756

Query: 1411 RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGER 1232
            RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  R
Sbjct: 757  RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGR 816

Query: 1231 VISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFP 1052
            V+SRGDD D LDGDERVPK KDL+GVLGMES VVIIDDS RVWPHNKLNLI +ERYIYFP
Sbjct: 817  VVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFP 876

Query: 1051 CSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQ 872
            CSRR FGL GPSLLEI HDER +DG LA SLAVIERIH NFF H SL++ DVRNILA EQ
Sbjct: 877  CSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQ 936

Query: 871  KNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVN 692
            + +LAGCRIVFSRVFPVGE  PHLHPLWQ+A+QFGAVCTNQIDE VTHVVANSLGTDKVN
Sbjct: 937  RKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVN 996

Query: 691  WALSTGRFVVHPCWVEASALLYRRANEFDFAIK 593
            WALSTGRFVVHP WVEASALLYRRANE DFAIK
Sbjct: 997  WALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1029


>emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score =  648 bits (1672), Expect = 0.0
 Identities = 339/519 (65%), Positives = 386/519 (74%)
 Frame = -2

Query: 2149 QKAAQSMTLPSISSVAIGTVPLDLRGDLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKT 1970
            QK A ++ +P        T P++ + + GK RMKPRDPRR+L  N+F+++G  G+EQ KT
Sbjct: 681  QKPAGALQVPQ-------TGPMNPQDESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKT 733

Query: 1969 KGVPSSIIQASKDSLTVRQHAEXXXXXXXXXXXXXXXAKQNNIPGTLSDSQSATSLTTVA 1790
                    Q  +D    +                    K  NI   +S SQ+++   T  
Sbjct: 734  NA------QKQEDQTETKSVPSHSVNPPDISQQFTKNLK--NIADLMSASQASSMTPTFP 785

Query: 1789 QTVSSQPIPRKIYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFD 1610
            Q +SSQ +       DV+   +DS +Q +   S P E A    QS+N W DVEHLF+G+D
Sbjct: 786  QILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKP-ESAAGPPQSKNTWGDVEHLFDGYD 844

Query: 1609 DQQKATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQ 1430
            DQQKA I+RERARR EEQ K+F+ARK           LNSAKFVEVDPVH+ ILRKKEEQ
Sbjct: 845  DQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQ 904

Query: 1429 DRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTG 1250
            DR+K QRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G
Sbjct: 905  DREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKG 964

Query: 1249 SLFGERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALE 1070
             LF  RVIS+GDD D LDGDERVPK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +E
Sbjct: 965  VLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVE 1024

Query: 1069 RYIYFPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRN 890
            RY YFPCSRR FGL GPSLLEI HDER +DG LASSLAVIERIH +FF +++L++VDVRN
Sbjct: 1025 RYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRN 1084

Query: 889  ILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSL 710
            ILA EQ+ +LAGCRIVFSRVFPVGEA PHLHPLWQTA+ FGAVCTNQIDE VTHVVANSL
Sbjct: 1085 ILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSL 1144

Query: 709  GTDKVNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 593
            GTDKVNWALSTGRFVVHP WVEASALLYRRANE DFAIK
Sbjct: 1145 GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1183


>ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Vitis vinifera]
          Length = 1238

 Score =  646 bits (1666), Expect = 0.0
 Identities = 340/519 (65%), Positives = 385/519 (74%)
 Frame = -2

Query: 2149 QKAAQSMTLPSISSVAIGTVPLDLRGDLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKT 1970
            QK A ++ +P        T P+D   + GK RMKPRDPRR+L  N+F+++G  G+EQ KT
Sbjct: 738  QKPAGALQVPQ-------TGPMD---ESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKT 787

Query: 1969 KGVPSSIIQASKDSLTVRQHAEXXXXXXXXXXXXXXXAKQNNIPGTLSDSQSATSLTTVA 1790
                    Q  +D    +                    K  NI   +S SQ+++   T  
Sbjct: 788  NA------QKQEDQTETKSVPSHSVNPPDISQQFTKNLK--NIADLMSASQASSMTPTFP 839

Query: 1789 QTVSSQPIPRKIYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFD 1610
            Q +SSQ +       DV+   +DS +Q +   S P E A    QS+N W DVEHLF+G+D
Sbjct: 840  QILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKP-ESAAGPPQSKNTWGDVEHLFDGYD 898

Query: 1609 DQQKATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQ 1430
            DQQKA I+RERARR EEQ K+F+ARK           LNSAKFVEVDPVH+ ILRKKEEQ
Sbjct: 899  DQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQ 958

Query: 1429 DRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTG 1250
            DR+K QRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G
Sbjct: 959  DREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKG 1018

Query: 1249 SLFGERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALE 1070
             LF  RVIS+GDD D LDGDERVPK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +E
Sbjct: 1019 VLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVE 1078

Query: 1069 RYIYFPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRN 890
            RY YFPCSRR FGL GPSLLEI HDER +DG LASSLAVIERIH +FF +++L++VDVRN
Sbjct: 1079 RYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRN 1138

Query: 889  ILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSL 710
            ILA EQ+ +LAGCRIVFSRVFPVGEA PHLHPLWQTA+ FGAVCTNQIDE VTHVVANSL
Sbjct: 1139 ILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSL 1198

Query: 709  GTDKVNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 593
            GTDKVNWALSTGRFVVHP WVEASALLYRRANE DFAIK
Sbjct: 1199 GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1237


>ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223548611|gb|EEF50102.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 1195

 Score =  646 bits (1666), Expect = 0.0
 Identities = 350/556 (62%), Positives = 392/556 (70%), Gaps = 34/556 (6%)
 Frame = -2

Query: 2158 DTQQKA---AQSMTLPSISSVAIGTVP----------------------LDLRGDLGKTR 2054
            + QQK    A+S T P  S+  +GTVP                      L    DLGK R
Sbjct: 649  EAQQKPVDPAKSTTYPLNSNSMLGTVPVVGAAHSGILPRPAGTVQVSPQLGTADDLGKIR 708

Query: 2053 MKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTV-RQHAEXXXXXXXXX 1877
            MKPRDPRRVL NN  ++NG +G+E LKT      I Q +KD+  + +Q  +         
Sbjct: 709  MKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEKKPVPLQ 768

Query: 1876 XXXXXXAKQ------NNIPGTLSDSQSATSLTTVAQTVSSQPIPRKIYNADVRVVATDSN 1715
                            NI   +S S ++TS   V Q  +SQP+   I ++D         
Sbjct: 769  SLALPDISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQPMRTTISSSD--------- 819

Query: 1714 NQESWTNSTPKEGAVVSF--QSRNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFA 1541
             Q     S P   A  +   +++NAW DVEHLFEG++DQQKA I+RERARR EEQ KLF+
Sbjct: 820  -QFLGIGSAPGAAAAAAAGPRTQNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQKKLFS 878

Query: 1540 ARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRP 1361
            ARK           LNSAKFVEVDPVH+ ILRKKEEQDR+K  RHLFRFPHMGMWTKLRP
Sbjct: 879  ARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGMWTKLRP 938

Query: 1360 GIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERV 1181
            GIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDPTG LF  RVISRGDD +P DGDER+
Sbjct: 939  GIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPFDGDERI 998

Query: 1180 PKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIG 1001
            PK KDL+GVLGMES VVI+DDS RVWPHNKLNLI +ERYIYFPCSRR FGL GPSLLEI 
Sbjct: 999  PKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEID 1058

Query: 1000 HDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPV 821
            HDER +DG LA SLAVIERIH NFF H SL++ DVRNILA EQ+ +LAGCRIVFSRVFPV
Sbjct: 1059 HDERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPV 1118

Query: 820  GEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEA 641
            GEA PHLHPLWQTA+QFGAVCTNQIDE VTHVVANSLGTDKVNWALSTGRFVV+P WVEA
Sbjct: 1119 GEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEA 1178

Query: 640  SALLYRRANEFDFAIK 593
            SALLYRRANE DFAIK
Sbjct: 1179 SALLYRRANEQDFAIK 1194


>ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1261

 Score =  636 bits (1640), Expect = e-179
 Identities = 337/522 (64%), Positives = 386/522 (73%), Gaps = 8/522 (1%)
 Frame = -2

Query: 2134 SMTLPSISSVAIGTVPLDLRGDLGKTRMKPRDPRRVL-LNNTFKQNGCIGTEQLKTKGVP 1958
            S+ +  +SS +  T    L+ D GK RMKPRDPRR+L  NNT +++G +G EQ K    P
Sbjct: 740  SVGMLPVSSQSTSTAQT-LQDDSGKIRMKPRDPRRILHTNNTIQKSGDLGNEQFKAIVSP 798

Query: 1957 SSIIQASKDSLTV-----RQHAEXXXXXXXXXXXXXXXAKQN--NIPGTLSDSQSATSLT 1799
             S  Q + D++       R   +                 +N  NI   +S SQ +++ T
Sbjct: 799  VSNNQRTGDNVNAPKLEGRVDNKLVPTQSSAQPDIARQFTRNLKNIADIMSVSQESSTHT 858

Query: 1798 TVAQTVSSQPIPRKIYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFE 1619
             V+Q  SS  +P      + + V + S N ++   S  +  A V+ +S++ W DVEHLFE
Sbjct: 859  PVSQNFSSASVPLTSDRGEQKSVVSSSQNLQADMASAHETAASVTSRSQSTWGDVEHLFE 918

Query: 1618 GFDDQQKATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKK 1439
            G+D+QQKA I+RERARR EEQNK+FAARK           LNSAKFVEVDP+H+ ILRKK
Sbjct: 919  GYDEQQKAAIQRERARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKK 978

Query: 1438 EEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLD 1259
            EEQDR+KP RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLD
Sbjct: 979  EEQDREKPHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLD 1038

Query: 1258 PTGSLFGERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLI 1079
            P G LF  RVISRGDD D +DG+ERVPK KDL+GVLGMESSVVIIDDS RVWPHNKLNLI
Sbjct: 1039 PKGVLFAGRVISRGDDTDSVDGEERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLI 1098

Query: 1078 ALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVD 899
             +ERY YFPCSRR FGL GPSLLEI HDER + G LASSLAVIE+IH  FF  QSL +VD
Sbjct: 1099 VVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLAVIEKIHQIFFASQSLEEVD 1158

Query: 898  VRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVA 719
            VRNILA EQ+ +LAGCRIVFSRVFPVGEA PHLHPLWQTA+QFGAVCTNQIDE VTHVVA
Sbjct: 1159 VRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVA 1218

Query: 718  NSLGTDKVNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 593
            NS GTDKVNWAL+ GRFVVHP WVEASALLYRRANE DFAIK
Sbjct: 1219 NSPGTDKVNWALNNGRFVVHPGWVEASALLYRRANEQDFAIK 1260


>ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|568858958|ref|XP_006483010.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 3-like
            [Citrus sinensis] gi|557541056|gb|ESR52100.1|
            hypothetical protein CICLE_v10030535mg [Citrus
            clementina]
          Length = 1234

 Score =  635 bits (1639), Expect = e-179
 Identities = 343/545 (62%), Positives = 397/545 (72%), Gaps = 22/545 (4%)
 Frame = -2

Query: 2161 ADTQQKAAQSM----------TLPSIS---SVAIGTV--PLDLRGDLGKTRMKPRDPRRV 2027
            AD QQK+  S           ++P +S   S+  G +  P+D   +LGK RMKPRDPRRV
Sbjct: 695  ADAQQKSNDSSMNTMHPPIPSSIPPVSVTCSIPSGILSKPMD---ELGKVRMKPRDPRRV 751

Query: 2026 LLNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQH-----AEXXXXXXXXXXXXXX 1862
            L  N  +++G +G E  KT G  +   Q SK++L  ++      A+              
Sbjct: 752  LHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQ 810

Query: 1861 XAKQN--NIPGTLSDSQSATSLTTVAQTVSSQPIPRKIYNADVRVVATDSNNQESWTNST 1688
               +N  +I   +S SQ  TS   V+Q    QP   K   AD++ V T+ +++++ T S 
Sbjct: 811  QFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKS-GADMKAVVTNHDDKQTGTGSG 869

Query: 1687 PKEGAVVSFQSRNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXXXXX 1508
            P+ G V +   ++AW DVEHLFEG+DDQQKA I++ER RR EEQ K+F+ARK        
Sbjct: 870  PEAGPVGA-HPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLD 928

Query: 1507 XXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKASK 1328
               LNSAKF EVDPVH+ ILRKKEEQDR+KP RHLFRFPHMGMWTKLRPGIW FLE+ASK
Sbjct: 929  HTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASK 988

Query: 1327 LYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDGVLG 1148
            L+E+HLYTMGNKLYA EMAKVLDP G LF  RVISRGDD DP DGDERVPK KDL+GVLG
Sbjct: 989  LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 1048

Query: 1147 MESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGALA 968
            MES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL GPSLLEI HDER +DG LA
Sbjct: 1049 MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLA 1108

Query: 967  SSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLW 788
            SSL VIER+H  FF HQSL+DVDVRNILA EQ+ +LAGCRIVFSRVFPVGEA PHLHPLW
Sbjct: 1109 SSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 1168

Query: 787  QTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRANEF 608
            QTA+QFGAVCT  ID+ VTHVVANSLGTDKVNWALSTGRFVVHP WVEASALLYRRANE 
Sbjct: 1169 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQ 1228

Query: 607  DFAIK 593
            DFAIK
Sbjct: 1229 DFAIK 1233


>ref|XP_007225412.1| hypothetical protein PRUPE_ppa000589mg [Prunus persica]
            gi|462422348|gb|EMJ26611.1| hypothetical protein
            PRUPE_ppa000589mg [Prunus persica]
          Length = 1085

 Score =  635 bits (1639), Expect = e-179
 Identities = 331/508 (65%), Positives = 377/508 (74%), Gaps = 6/508 (1%)
 Frame = -2

Query: 2098 GTVPLD----LRGDLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVPSSIIQASKD 1931
            GT+P+     L  + GK RMKPRDPRR L  N  +++G +G EQ +    P S IQ +KD
Sbjct: 582  GTLPVSSQKALMDESGKVRMKPRDPRRALHGNALQKSGSLGQEQFRNIIPPLSAIQGNKD 641

Query: 1930 SLTVRQHAEXXXXXXXXXXXXXXXAKQN--NIPGTLSDSQSATSLTTVAQTVSSQPIPRK 1757
            +L  +   +                 +N  NI   +S S  +TS    +Q+VSSQ +P K
Sbjct: 642  NLNGQADKKLVTSQSLDAPDITRQFTKNLKNIADIMSVSNVSTSPAIASQSVSSQLVPIK 701

Query: 1756 IYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKATIRRER 1577
                D++        Q   + S  +  A    +S   W DVEHLFEG+DDQQKA I+RER
Sbjct: 702  PERIDLK-----PEEQRPESISASEAAAAGPSRSPVMWGDVEHLFEGYDDQQKAAIQRER 756

Query: 1576 ARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFR 1397
             RR EEQ K+FAA K           LNSAKFVEVDPVH+ ILRKKEEQDR+KPQRHLFR
Sbjct: 757  TRRIEEQKKMFAAHKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPQRHLFR 816

Query: 1396 FPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRG 1217
            F HMGMWTKLRPGIWNFLEKAS+L+ELHLYTMGNKLYA EMAKVLDPTG+LF  RVISRG
Sbjct: 817  FHHMGMWTKLRPGIWNFLEKASQLFELHLYTMGNKLYATEMAKVLDPTGALFAGRVISRG 876

Query: 1216 DDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRH 1037
            DD DP DGDER+PK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR 
Sbjct: 877  DDGDPEDGDERIPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 936

Query: 1036 FGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLA 857
            FGL GPSLLEI HDER +DG LASSLAVIE+IH  FF H SL++ DVRNILA EQ+ +LA
Sbjct: 937  FGLLGPSLLEIDHDERQEDGTLASSLAVIEKIHQLFFSHSSLDEADVRNILASEQRKILA 996

Query: 856  GCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALST 677
            GCRIVFSRVFPVGE KPHLHPLWQTA+QFGAVCTNQID+ VTHVVANSLGTDKVNWALS+
Sbjct: 997  GCRIVFSRVFPVGEVKPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSS 1056

Query: 676  GRFVVHPCWVEASALLYRRANEFDFAIK 593
            G++VVHP WVEASALLYRRANE DFAIK
Sbjct: 1057 GKYVVHPGWVEASALLYRRANEQDFAIK 1084


>ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1257

 Score =  635 bits (1638), Expect = e-179
 Identities = 333/504 (66%), Positives = 377/504 (74%), Gaps = 8/504 (1%)
 Frame = -2

Query: 2080 LRGDLGKTRMKPRDPRRVL-LNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTV----- 1919
            L+ D GK RMKPRDPRR+L  NNT +++G +G EQ K    P S  Q + D++       
Sbjct: 753  LQDDSGKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEG 812

Query: 1918 RQHAEXXXXXXXXXXXXXXXAKQN--NIPGTLSDSQSATSLTTVAQTVSSQPIPRKIYNA 1745
            R  ++                 +N  NI   +S SQ +++ T VAQ  SS  +P      
Sbjct: 813  RVDSKLVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRG 872

Query: 1744 DVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKATIRRERARRK 1565
            + + V ++S N E+   S  +  A  + +S+N W DVEHLFEG+D+QQKA I+RERARR 
Sbjct: 873  EQKSVVSNSQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRI 932

Query: 1564 EEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHM 1385
            EEQNK+FAARK           LNSAKFVEVDPVH+ ILRKKEEQDR+KP RHLFRFPHM
Sbjct: 933  EEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 992

Query: 1384 GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDED 1205
            GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD D
Sbjct: 993  GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTD 1052

Query: 1204 PLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLS 1025
             +DG+ER PK KDL+GVLGMESSVVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL 
Sbjct: 1053 SVDGEERAPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLP 1112

Query: 1024 GPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRI 845
            GPSLLEI HDER + G LASSLAVIE+IH  FF  +SL +VDVRNILA EQ+ +LAGCRI
Sbjct: 1113 GPSLLEIDHDERPEAGTLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRI 1172

Query: 844  VFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFV 665
            VFSRVFPVGEA PHLHPLWQTA+QFGA CTNQIDE VTHVVANS GTDKVNWAL+ GRFV
Sbjct: 1173 VFSRVFPVGEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRFV 1232

Query: 664  VHPCWVEASALLYRRANEFDFAIK 593
            VHP WVEASALLYRRANE DFAIK
Sbjct: 1233 VHPGWVEASALLYRRANEQDFAIK 1256


>ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris]
            gi|561012448|gb|ESW11309.1| hypothetical protein
            PHAVU_008G019000g [Phaseolus vulgaris]
          Length = 1272

 Score =  632 bits (1631), Expect = e-178
 Identities = 331/499 (66%), Positives = 375/499 (75%), Gaps = 8/499 (1%)
 Frame = -2

Query: 2065 GKTRMKPRDPRRVL-LNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTV-----RQHAE 1904
            GK RMKPRDPRR+L  NN+ +++G I  E  K    P S I  + DS+       R   +
Sbjct: 773  GKIRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNILVTGDSVNAQKLEGRMDTK 832

Query: 1903 XXXXXXXXXXXXXXXAKQN--NIPGTLSDSQSATSLTTVAQTVSSQPIPRKIYNADVRVV 1730
                             +N  NI   +S SQ +++ +  AQ  SS  +P  +   + + V
Sbjct: 833  LVPTQSGAAPDITRQFTRNLKNIADIMSVSQESSTHSPAAQGFSSASVPLNVDRGEQKSV 892

Query: 1729 ATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNK 1550
             ++S N  + T S P+  A  + +S++ W DVEHLFEG+D+QQKA I+RERARR EEQNK
Sbjct: 893  LSNSQNLHAGTGSAPEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNK 952

Query: 1549 LFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTK 1370
            +FAARK           LNSAKFVEVDPVHE ILRKKEE DR+KP RHLFRFPHMGMWTK
Sbjct: 953  MFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEELDREKPHRHLFRFPHMGMWTK 1012

Query: 1369 LRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGD 1190
            LRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD D +DG+
Sbjct: 1013 LRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVDGE 1072

Query: 1189 ERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLL 1010
            ER PK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL GPSLL
Sbjct: 1073 ERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLL 1132

Query: 1009 EIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRV 830
            EI HDER + G LASSLAVIER+H NFF  QSL +VDVRNILA EQ+ +L+GCRIVFSRV
Sbjct: 1133 EIDHDERPEAGTLASSLAVIERLHQNFFSSQSLEEVDVRNILASEQRKILSGCRIVFSRV 1192

Query: 829  FPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCW 650
            FPVGEA PHLHPLWQTA+QFGAVCTNQID+ VTHVVANSLGTDKVNWALSTGRFVVHP W
Sbjct: 1193 FPVGEANPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGW 1252

Query: 649  VEASALLYRRANEFDFAIK 593
            VEASALLYRRANE DFAIK
Sbjct: 1253 VEASALLYRRANEQDFAIK 1271


>ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like isoform X2 [Cicer arietinum]
          Length = 1227

 Score =  632 bits (1631), Expect = e-178
 Identities = 328/505 (64%), Positives = 379/505 (75%), Gaps = 9/505 (1%)
 Frame = -2

Query: 2071 DLGKTRMKPRDPRRVLL-NNTFKQNGCIGTEQLKTKGVPSSIIQAS-----KDSLTVRQH 1910
            D GK RMKPRDPRR+L  +++ +++G  G+EQ K+   P+S  Q +        L VR  
Sbjct: 723  DSGKIRMKPRDPRRILHGSSSLQKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDVRVE 782

Query: 1909 AEXXXXXXXXXXXXXXXAKQN--NIPGTLSDSQS-ATSLTTVAQTVSSQPIPRKIYNADV 1739
             +                 +N  NI   +S SQ  +T L    Q VSS  +P  +  A++
Sbjct: 783  TKLAPTQSSAQPDITRQFTKNLKNIADIMSVSQEPSTQLPATTQNVSSASVPFTLDKAEL 842

Query: 1738 RVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKATIRRERARRKEE 1559
            +    +S N +    S P+  A  S +S++ W DVEHLFEG+D++QKA I+RERARR EE
Sbjct: 843  KSGVPNSQNLQDGVGSAPETCAPGSSRSQSTWADVEHLFEGYDEKQKAAIQRERARRLEE 902

Query: 1558 QNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGM 1379
            QNK+FA++K           LNSAKFVEVDPVH+ ILRKKEEQDR+KP RHLFRFPHMGM
Sbjct: 903  QNKMFASKKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 962

Query: 1378 WTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPL 1199
            WTKLRPG+WNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD + +
Sbjct: 963  WTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTESV 1022

Query: 1198 DGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGP 1019
            DGDER PK KDL+GV+GMESSVVI+DDS RVWPHNKLNLI +ERY YFPCSRR FGL GP
Sbjct: 1023 DGDERAPKSKDLEGVMGMESSVVIVDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGP 1082

Query: 1018 SLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVF 839
            SLLEI HDER + G LASSLAVIERIH NFF  QSL +VDVRNILA EQ+ +LAGCRIVF
Sbjct: 1083 SLLEIDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVF 1142

Query: 838  SRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVH 659
            SRVFPVGEA PHLHPLWQTA+QFGAVC NQID+ VTHVVANSLGTDKVNWA+STGRFVVH
Sbjct: 1143 SRVFPVGEANPHLHPLWQTAEQFGAVCINQIDDQVTHVVANSLGTDKVNWAISTGRFVVH 1202

Query: 658  PCWVEASALLYRRANEFDFAIKTKQ 584
            P WVEASALLYRRANE DFAIK ++
Sbjct: 1203 PGWVEASALLYRRANEQDFAIKPEK 1227


>ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like isoform X1 [Cicer arietinum]
          Length = 1247

 Score =  632 bits (1631), Expect = e-178
 Identities = 328/505 (64%), Positives = 379/505 (75%), Gaps = 9/505 (1%)
 Frame = -2

Query: 2071 DLGKTRMKPRDPRRVLL-NNTFKQNGCIGTEQLKTKGVPSSIIQAS-----KDSLTVRQH 1910
            D GK RMKPRDPRR+L  +++ +++G  G+EQ K+   P+S  Q +        L VR  
Sbjct: 743  DSGKIRMKPRDPRRILHGSSSLQKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDVRVE 802

Query: 1909 AEXXXXXXXXXXXXXXXAKQN--NIPGTLSDSQS-ATSLTTVAQTVSSQPIPRKIYNADV 1739
             +                 +N  NI   +S SQ  +T L    Q VSS  +P  +  A++
Sbjct: 803  TKLAPTQSSAQPDITRQFTKNLKNIADIMSVSQEPSTQLPATTQNVSSASVPFTLDKAEL 862

Query: 1738 RVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKATIRRERARRKEE 1559
            +    +S N +    S P+  A  S +S++ W DVEHLFEG+D++QKA I+RERARR EE
Sbjct: 863  KSGVPNSQNLQDGVGSAPETCAPGSSRSQSTWADVEHLFEGYDEKQKAAIQRERARRLEE 922

Query: 1558 QNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGM 1379
            QNK+FA++K           LNSAKFVEVDPVH+ ILRKKEEQDR+KP RHLFRFPHMGM
Sbjct: 923  QNKMFASKKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 982

Query: 1378 WTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPL 1199
            WTKLRPG+WNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD + +
Sbjct: 983  WTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTESV 1042

Query: 1198 DGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGP 1019
            DGDER PK KDL+GV+GMESSVVI+DDS RVWPHNKLNLI +ERY YFPCSRR FGL GP
Sbjct: 1043 DGDERAPKSKDLEGVMGMESSVVIVDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGP 1102

Query: 1018 SLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVF 839
            SLLEI HDER + G LASSLAVIERIH NFF  QSL +VDVRNILA EQ+ +LAGCRIVF
Sbjct: 1103 SLLEIDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVF 1162

Query: 838  SRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVH 659
            SRVFPVGEA PHLHPLWQTA+QFGAVC NQID+ VTHVVANSLGTDKVNWA+STGRFVVH
Sbjct: 1163 SRVFPVGEANPHLHPLWQTAEQFGAVCINQIDDQVTHVVANSLGTDKVNWAISTGRFVVH 1222

Query: 658  PCWVEASALLYRRANEFDFAIKTKQ 584
            P WVEASALLYRRANE DFAIK ++
Sbjct: 1223 PGWVEASALLYRRANEQDFAIKPEK 1247


>gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus
            notabilis]
          Length = 1301

 Score =  626 bits (1615), Expect = e-176
 Identities = 324/502 (64%), Positives = 373/502 (74%), Gaps = 7/502 (1%)
 Frame = -2

Query: 2134 SMTLPSISSVAIGTVPLDLRGDLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVPS 1955
            +++LP+ S VA  +    ++ +LGK RMKPRDPRRVL  N  +++  +G EQ K      
Sbjct: 768  AVSLPTTSQVATAS----MQDELGKIRMKPRDPRRVLHGNMLQKSWSLGHEQFKPIVSSV 823

Query: 1954 SIIQASKDSLT--VRQHAEXXXXXXXXXXXXXXXAKQ-----NNIPGTLSDSQSATSLTT 1796
            S    +KD+L   V++                  A+Q      NI   +S SQ++TS  T
Sbjct: 824  SCTPGNKDNLNGPVQEGQADKKQVPSQLVVQPDIARQFTKNLRNIADLMSVSQASTSPAT 883

Query: 1795 VAQTVSSQPIPRKIYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEG 1616
            V+Q +SSQP+P K    DV+ V  +S +Q S TNSTP+    V  ++ NAW DVEHLFEG
Sbjct: 884  VSQNLSSQPLPVKPDRGDVKAVVPNSEDQHSGTNSTPETTLAVPSRTPNAWGDVEHLFEG 943

Query: 1615 FDDQQKATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKE 1436
            +DD+QKA I+RERARR EEQ K+F A K           LNSAKFVEVD VH+ ILRKKE
Sbjct: 944  YDDEQKAAIQRERARRLEEQKKMFDAHKLCLVLDLDHTLLNSAKFVEVDSVHDEILRKKE 1003

Query: 1435 EQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDP 1256
            EQDR+KPQRHLFRFPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYA EMAKVLDP
Sbjct: 1004 EQDREKPQRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDP 1063

Query: 1255 TGSLFGERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIA 1076
             G+LF  RVISRGDD DP DGDERVPK KDL+GVLGMESSVVIIDDS RVWPHNKLNLI 
Sbjct: 1064 MGTLFSGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIV 1123

Query: 1075 LERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDV 896
            +ERY YFPCSRR FGL GPSLLEI HDER + G LASSLAVIE+IH NFF H SL++VDV
Sbjct: 1124 VERYTYFPCSRRQFGLPGPSLLEIDHDERPEQGTLASSLAVIEKIHQNFFSHHSLDEVDV 1183

Query: 895  RNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVAN 716
            RNILA EQ+ +LAGCRIVFSRVFPV E  PHLHPLWQTA+QFGAVCT QID+ VTHVVAN
Sbjct: 1184 RNILASEQRKILAGCRIVFSRVFPVSEVNPHLHPLWQTAEQFGAVCTTQIDDQVTHVVAN 1243

Query: 715  SLGTDKVNWALSTGRFVVHPCW 650
            S GTDKVNWAL+ G+F VHP W
Sbjct: 1244 SPGTDKVNWALANGKFAVHPGW 1265


>ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus trichocarpa]
            gi|550347145|gb|EEE82674.2| CTD phosphatase-like protein
            3 [Populus trichocarpa]
          Length = 1190

 Score =  623 bits (1606), Expect = e-175
 Identities = 330/547 (60%), Positives = 385/547 (70%), Gaps = 26/547 (4%)
 Frame = -2

Query: 2155 TQQKAAQSMTLPSISSVAIGTVP------------------------LDLRGDLGKTRMK 2048
            T    A+S + P IS+  +G +P                        +    + GK RMK
Sbjct: 646  TLSDPAKSTSHPPISNTVLGAIPTVNVASSQPSGIFPRPAGTPVPSQIATSDESGKIRMK 705

Query: 2047 PRDPRRVLLNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQHAEXXXXXXXXXXXX 1868
            PRDPRR L NN+ ++ G +G+EQ KT  +  +  Q +KD   V++               
Sbjct: 706  PRDPRRFLHNNSLQRAGSMGSEQFKTTTLTPTT-QGTKDDQNVQKQEGLAELKPTVPPDI 764

Query: 1867 XXXAKQN--NIPGTLSDSQSATSLTTVAQTVSSQPIPRKIYNADVRVVATDSNNQESWTN 1694
                 ++  NI   LS SQ++T+   ++Q V+SQP+  K    D +   + S+ Q++   
Sbjct: 765  SFPFTKSLENIADILSVSQASTTPPFISQNVASQPMQTKSERVDGKTGISISD-QKTGPA 823

Query: 1693 STPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXXX 1514
            S+P E    S  S+N W+DVEHLFEG+DDQQKA I+RERARR EEQ K+FAARK      
Sbjct: 824  SSP-EVVAASSHSQNTWKDVEHLFEGYDDQQKAAIQRERARRLEEQKKMFAARKLCLVLD 882

Query: 1513 XXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKA 1334
                 LNSAK +    +H+ ILRKKEEQDR+KP RH+FR PHMGMWTKLRPGIWNFLEKA
Sbjct: 883  LDHTLLNSAKAILSSSLHDEILRKKEEQDREKPYRHIFRIPHMGMWTKLRPGIWNFLEKA 942

Query: 1333 SKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDGV 1154
            SKL+ELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD DP DGDERVPK KDL+GV
Sbjct: 943  SKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1002

Query: 1153 LGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGA 974
            LGMES VVIIDDS RVWPHNKLNLI +ERYIYFPCSRR FGL GPSLLEI HDER +DG 
Sbjct: 1003 LGMESGVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGT 1062

Query: 973  LASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHP 794
            LA S AVIE+IH NFF H+SL++ DVRNILA EQ+ +L GCRI+FSRVFPVGE  PHLHP
Sbjct: 1063 LACSFAVIEKIHQNFFTHRSLDEADVRNILASEQRKILGGCRILFSRVFPVGEVNPHLHP 1122

Query: 793  LWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRAN 614
            LWQ A+QFGAVCTNQIDE VTHVVANSLGTDKVNWALSTGR VVHP WVEASALLYRRAN
Sbjct: 1123 LWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRIVVHPGWVEASALLYRRAN 1182

Query: 613  EFDFAIK 593
            E DF+IK
Sbjct: 1183 EQDFSIK 1189


>gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus guttatus]
          Length = 1220

 Score =  622 bits (1603), Expect = e-175
 Identities = 329/517 (63%), Positives = 368/517 (71%), Gaps = 8/517 (1%)
 Frame = -2

Query: 2119 SISSVAIGTVPLDLRG----DLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVPSS 1952
            +I  ++ GTV +  +     + GK RMKPRDPRRVL NN  +++     +Q K      S
Sbjct: 703  TIGQISAGTVQIPSQAVSVEESGKVRMKPRDPRRVLHNNAPQKDVTSVADQPKADASFGS 762

Query: 1951 IIQASKDSLTVRQHAEXXXXXXXXXXXXXXXAKQNNIPGTLSDSQSATSLTTVAQTVSSQ 1772
             +   K    + ++                     NI   LS SQ  T+   +AQ  S Q
Sbjct: 763  AMNTPKQEDQL-ENKMSSSSMKPPDITMQFTNNLRNIADLLSVSQICTTSPVLAQIPSLQ 821

Query: 1771 PIPRK-IYNADVRVVATDSNNQESWTNSTPKEGAVVSFQ---SRNAWEDVEHLFEGFDDQ 1604
            P     I   + R    +  N  + T+ T  E A  S     + NAW DVEHLFEGFDDQ
Sbjct: 822  PAQGDLIAGKETRGPIAEYGNIRNVTDITTSEAATSSPPRPLNANAWSDVEHLFEGFDDQ 881

Query: 1603 QKATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDR 1424
            QK  I+RERARR EEQNKLFA RK           LNSAKFVEVDP H+ +LRKKEEQDR
Sbjct: 882  QKVAIQRERARRLEEQNKLFAVRKLCLVLDLDHTLLNSAKFVEVDPQHDEMLRKKEEQDR 941

Query: 1423 QKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSL 1244
            +KP RHLFRFPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGNK YA EMAK+LDP G L
Sbjct: 942  EKPHRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKYYATEMAKLLDPKGEL 1001

Query: 1243 FGERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERY 1064
            F  RVISRGDD +P D D+R PK KDL+GVLGMES VVIIDDS RVWPHNKLNLI +ERY
Sbjct: 1002 FSGRVISRGDDGEPFDSDDRAPKSKDLEGVLGMESGVVIIDDSIRVWPHNKLNLIVVERY 1061

Query: 1063 IYFPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNIL 884
            IYFPCSRR FGL GPSLLEI HDER +DG LAS   VIERIH NFF H+SLN+ DVRNIL
Sbjct: 1062 IYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASCSTVIERIHENFFGHESLNEADVRNIL 1121

Query: 883  AKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGT 704
            A EQ+ +LAGCRIVFSRVFPVGEAKPH+HPLWQTA+QFGAVC NQIDEHVTHVVANSLGT
Sbjct: 1122 ASEQRKILAGCRIVFSRVFPVGEAKPHMHPLWQTAEQFGAVCINQIDEHVTHVVANSLGT 1181

Query: 703  DKVNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 593
            DKVNWALSTG+FVVHP WVEASALLYRRANE DFAIK
Sbjct: 1182 DKVNWALSTGKFVVHPGWVEASALLYRRANEHDFAIK 1218


>ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Solanum tuberosum]
          Length = 1218

 Score =  615 bits (1585), Expect = e-173
 Identities = 320/495 (64%), Positives = 370/495 (74%), Gaps = 6/495 (1%)
 Frame = -2

Query: 2056 RMKPRDPRRVLLNNTFKQNGCIGTEQLKT--KGVPSSI----IQASKDSLTVRQHAEXXX 1895
            RMKPRDPRRVL N    + G +G++Q KT   G  ++I     Q+ +D L  +       
Sbjct: 728  RMKPRDPRRVLHNTAVLKGGNVGSDQCKTGVAGTHATISNLGFQSQEDQLDRKSAVTLST 787

Query: 1894 XXXXXXXXXXXXAKQNNIPGTLSDSQSATSLTTVAQTVSSQPIPRKIYNADVRVVATDSN 1715
                         K  NI   +S S S TSL+  +QT  +Q +      ++ +   ++ +
Sbjct: 788  TPPDIARQFTKNLK--NIADMISVSPS-TSLSAASQT-QTQCLQSHQSRSEGKEAVSEPS 843

Query: 1714 NQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAAR 1535
             + +      ++G+  S Q + +W DVEHLFEG+ DQQ+A I+RERARR EEQ K+F+ R
Sbjct: 844  ERVNDAGLASEKGSPGSLQPQISWGDVEHLFEGYSDQQRADIQRERARRLEEQKKMFSVR 903

Query: 1534 KXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGI 1355
            K           LNSAKFVE+DPVHE ILRKKEEQDR+KP RHLFRFPHMGMWTKLRPGI
Sbjct: 904  KLCLVLDLDHTLLNSAKFVEIDPVHEEILRKKEEQDREKPCRHLFRFPHMGMWTKLRPGI 963

Query: 1354 WNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPK 1175
            WNFLEKAS L+ELHLYTMGNKLYA EMAK+LDP G LF  RVISRGDD DP DGDERVPK
Sbjct: 964  WNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGDERVPK 1023

Query: 1174 IKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHD 995
             KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERYIYFPCSRR FGL GPSLLEI HD
Sbjct: 1024 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHD 1083

Query: 994  ERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGE 815
            ER +DG LAS L VI+RIH NFF H+S+++ DVRNILA EQK +LAGCRIVFSRVFPVGE
Sbjct: 1084 ERPEDGTLASCLGVIQRIHQNFFAHRSIDEADVRNILATEQKKILAGCRIVFSRVFPVGE 1143

Query: 814  AKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASA 635
            A PHLHPLWQTA+QFGAVCT+QID+ VTHVVANSLGTDKVNWALSTGRFVVHP WVEASA
Sbjct: 1144 ANPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASA 1203

Query: 634  LLYRRANEFDFAIKT 590
            LLYRRANE DFAIK+
Sbjct: 1204 LLYRRANEHDFAIKS 1218


>ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Cucumis sativus]
          Length = 1249

 Score =  612 bits (1579), Expect = e-172
 Identities = 323/515 (62%), Positives = 370/515 (71%)
 Frame = -2

Query: 2137 QSMTLPSISSVAIGTVPLDLRGDLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVP 1958
            QS   PS S V      +  + DLGK RMKPRDPRRVL  N+ ++ G +G +QLK     
Sbjct: 750  QSAGTPSASPV------VGRQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPT 803

Query: 1957 SSIIQASKDSLTVRQHAEXXXXXXXXXXXXXXXAKQNNIPGTLSDSQSATSLTTVAQTVS 1778
            +S  + S+D     +                  + Q  +P      Q   +L  +A  +S
Sbjct: 804  ASNTEGSRDIPNGHKQ--------EGQGDSKLASSQTILPDI--GRQFTNNLKNIADIMS 853

Query: 1777 SQPIPRKIYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQK 1598
                P    N+  + V + S + +  T +        S +S+ AW D+EHLF+ +DD+QK
Sbjct: 854  VPSPPTSSPNSSSKPVGSSSMDSKPVTTAFQAVDMAASSRSQGAWGDLEHLFDSYDDKQK 913

Query: 1597 ATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQK 1418
            A I+RERARR EEQ K+FAARK           LNSAKFVEVDPVH+ ILRKKEEQDR+K
Sbjct: 914  AAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREK 973

Query: 1417 PQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFG 1238
             QRHLFRFPHMGMWTKLRPG+WNFLEKAS+LYELHLYTMGNKLYA EMAKVLDP G LF 
Sbjct: 974  AQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFA 1033

Query: 1237 ERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIY 1058
             RVISRGDD DPLDGD+RVPK KDL+GVLGMES VVIIDDS RVWPHNK+NLI +ERY Y
Sbjct: 1034 GRVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTY 1093

Query: 1057 FPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAK 878
            FPCSRR FGL GPSLLEI HDER +DG LASSL VI+RIH +FF +  L+ VDVR IL+ 
Sbjct: 1094 FPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVIQRIHQSFFSNPELDQVDVRTILSA 1153

Query: 877  EQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDK 698
            EQ+ +LAGCRIVFSRVFPVGEA PHLHPLWQTA+QFGA CTNQIDE VTHVVANSLGTDK
Sbjct: 1154 EQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDK 1213

Query: 697  VNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 593
            VNWALSTGRFVVHP WVEASALLYRRA E DFAIK
Sbjct: 1214 VNWALSTGRFVVHPGWVEASALLYRRATEQDFAIK 1248


>ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Fragaria vesca subsp. vesca]
          Length = 1230

 Score =  612 bits (1578), Expect = e-172
 Identities = 332/559 (59%), Positives = 384/559 (68%), Gaps = 34/559 (6%)
 Frame = -2

Query: 2164 VADTQQKAA-----QSMTLPSISSVAIGTV-----PLDLRGDL---------------GK 2060
            +A+ QQ AA     +S+T P  SS   GT      P    G L               GK
Sbjct: 677  LAEAQQNAAAPARKESLTYPPSSSSIPGTAALVNDPSKTSGALLTPTICSQKTPTDEAGK 736

Query: 2059 TRMKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQH---------A 1907
             RMK RDPRR+L  N  + +G +G EQ +    P S  QA+ D +  ++           
Sbjct: 737  IRMKLRDPRRLLHGNALQNSGSVGHEQSRNIVPPLSSSQANNDDMNGKKQDSQADNNSVT 796

Query: 1906 EXXXXXXXXXXXXXXXAKQNNIPGTLSDSQSATSLTTVAQTVSSQPIPRKIYNADVRVVA 1727
                                NI   +S SQ +TS  T +Q +S++ I     N D++   
Sbjct: 797  SQSGALGAPDIASQFTKNLKNIADIISVSQVSTSPATPSQNLSTELISINPDNVDLK--- 853

Query: 1726 TDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKL 1547
              +  Q + + S     A  + +S   W DVEHLFEG+DD+QKA I+RERARR EEQ K+
Sbjct: 854  --AEEQHTGSISASVPTAAGASRSPATWGDVEHLFEGYDDKQKAAIQRERARRIEEQKKM 911

Query: 1546 FAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKL 1367
            FAA K           LNSAKFVEVDPVH+ ILRKKEEQDR++PQRHLFRF HMGMWTKL
Sbjct: 912  FAAHKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDRKEPQRHLFRFQHMGMWTKL 971

Query: 1366 RPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDE 1187
            RPG+W FLEKAS L+E+HLYTMGNKLYA EMAKVLDPTG+LF  RVISRGDD DP DGDE
Sbjct: 972  RPGVWKFLEKASHLFEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISRGDDGDPYDGDE 1031

Query: 1186 RVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLE 1007
            RVPK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL GPSLLE
Sbjct: 1032 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 1091

Query: 1006 IGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVF 827
            I HDER +DG LASSLAVIE+IH  FF H SL++ DVRNILA EQ+ +L GCRIVFSRVF
Sbjct: 1092 IDHDERHEDGTLASSLAVIEKIHQIFFSHPSLDEADVRNILASEQQKILGGCRIVFSRVF 1151

Query: 826  PVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWV 647
            PVGE  PHLHPLWQTA+QFGAVCTNQID+ VTHVVANSLGTDKVNWALS+G++VVHP WV
Sbjct: 1152 PVGEVNPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSSGKYVVHPGWV 1211

Query: 646  EASALLYRRANEFDFAIKT 590
            EASALLYRRANE DFAIK+
Sbjct: 1212 EASALLYRRANEQDFAIKS 1230


>ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3-like [Cucumis sativus]
          Length = 1249

 Score =  612 bits (1577), Expect = e-172
 Identities = 323/515 (62%), Positives = 369/515 (71%)
 Frame = -2

Query: 2137 QSMTLPSISSVAIGTVPLDLRGDLGKTRMKPRDPRRVLLNNTFKQNGCIGTEQLKTKGVP 1958
            QS   PS S V      +  + DLGK RMKPRDPRRVL  N+ ++ G +G +QLK     
Sbjct: 750  QSAGTPSASPV------VGRQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPT 803

Query: 1957 SSIIQASKDSLTVRQHAEXXXXXXXXXXXXXXXAKQNNIPGTLSDSQSATSLTTVAQTVS 1778
            +S  + S+D     +                  + Q  +P      Q   +L  +A  +S
Sbjct: 804  ASNTEGSRDIPNGHKQ--------EGQGDSKLASSQTILPDI--GRQFTNNLKNIADIMS 853

Query: 1777 SQPIPRKIYNADVRVVATDSNNQESWTNSTPKEGAVVSFQSRNAWEDVEHLFEGFDDQQK 1598
                P    N+  + V + S + +  T +        S +S+ AW D+EHLF+ +DD+QK
Sbjct: 854  VPSPPTSSPNSSSKPVGSSSMDSKPVTTAFQAVDMAASSRSQGAWGDLEHLFDSYDDKQK 913

Query: 1597 ATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQK 1418
            A I+RERARR EEQ K+FAARK           LNSAKFVEVDPVH+ ILRKKEEQDR+K
Sbjct: 914  AAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREK 973

Query: 1417 PQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFG 1238
             QRHLFRFPHMGMWTKLRPG+WNFLEKAS+LYELHLYTMGNKLYA EMAKVLDP G LF 
Sbjct: 974  AQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFA 1033

Query: 1237 ERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIY 1058
             RVISRGDD DPLDGD+RVPK KDL+GVLGMES VVIIDDS RVWPHNK+NLI +ERY Y
Sbjct: 1034 GRVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTY 1093

Query: 1057 FPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAK 878
            FPCSRR FGL GPSLLEI HDER +DG LASSL VI+RIH  FF +  L+ VDVR IL+ 
Sbjct: 1094 FPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVIQRIHQXFFSNPELDQVDVRTILSA 1153

Query: 877  EQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDK 698
            EQ+ +LAGCRIVFSRVFPVGEA PHLHPLWQTA+QFGA CTNQIDE VTHVVANSLGTDK
Sbjct: 1154 EQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDK 1213

Query: 697  VNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 593
            VNWALSTGRFVVHP WVEASALLYRRA E DFAIK
Sbjct: 1214 VNWALSTGRFVVHPGWVEASALLYRRATEQDFAIK 1248


Top