BLASTX nr result

ID: Achyranthes22_contig00016452 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes22_contig00016452
         (2805 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY28304.1| C-terminal domain phosphatase-like 1 isoform 3 [T...   847   0.0  
gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [T...   847   0.0  
gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [T...   847   0.0  
gb|EMJ15747.1| hypothetical protein PRUPE_ppa000988mg [Prunus pe...   846   0.0  
ref|XP_002267987.2| PREDICTED: RNA polymerase II C-terminal doma...   846   0.0  
emb|CBI35690.3| unnamed protein product [Vitis vinifera]              846   0.0  
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...   844   0.0  
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...   835   0.0  
ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma...   834   0.0  
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...   834   0.0  
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...   832   0.0  
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...   830   0.0  
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...   830   0.0  
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...   825   0.0  
gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus...   822   0.0  
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...   818   0.0  
ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma...   807   0.0  
emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]   806   0.0  
ref|XP_004134718.1| PREDICTED: RNA polymerase II C-terminal doma...   805   0.0  
ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal doma...   797   0.0  

>gb|EOY28304.1| C-terminal domain phosphatase-like 1 isoform 3 [Theobroma cacao]
          Length = 870

 Score =  847 bits (2187), Expect = 0.0
 Identities = 459/748 (61%), Positives = 535/748 (71%), Gaps = 10/748 (1%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIY-----FENNNFNYKNDEIEKIVK----KGIYISHYSQESER 2064
            M K+VVY GE  LGEVEIY      +      + DE + +V     K I I + +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 2063 CPPLSVLHTVTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMP 1884
            CPPL+VLHT+T      S+G+CFK MES+    Y   Q    L  L++ C+RDNKTAVMP
Sbjct: 64   CPPLAVLHTIT------SSGICFK-MESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMP 116

Query: 1883 LGEQEIHLVAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANT 1704
            +G+ E+HLVA+ SR  D   PCFWGF+V  GLY+SCL MLNLRCLGIVFDLDETLIVANT
Sbjct: 117  MGDCELHLVAMYSRNSD--RPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANT 174

Query: 1703 LRSFEDRIEALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQ 1524
            +RSFEDRIEALQRK++ E D QR+ GM+ E+KRYQ+DKAILKQYAE DQVV+NGKV KIQ
Sbjct: 175  MRSFEDRIEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQ 234

Query: 1523 AEVIPALSDNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 1344
            +EV+PALSDNHQ I+RPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR
Sbjct: 235  SEVVPALSDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 294

Query: 1343 KRFEVYVCTMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICH 1164
            KRFEVYVCTMAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSGS+KSLFNVF  GICH
Sbjct: 295  KRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICH 354

Query: 1163 PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFF 984
            PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFF
Sbjct: 355  PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFF 414

Query: 983  KEFDEGLLQRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVE 804
            +EFDEGLLQRI E+++EDD KDI SPPDV N+LVSEDD S  +GNK+ + FDGMADAEVE
Sbjct: 415  REFDEGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVE 474

Query: 803  RRLKDXXXXXXXXXXXXXXXXXTVSLDSRLTSSLAFTV-ASSMTISQPAPQASIPSFHTN 627
            RRLK+                  ++LD RLT SL +T+ +SS +I   A Q SI SF   
Sbjct: 475  RRLKE------AISATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNM 528

Query: 626  LFPQAGPLARSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESIS 447
             FP A P+ + +  +   +  L SSPA+EEGEVPESELDPDTRRRLLILQHGQD R+   
Sbjct: 529  QFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTP 588

Query: 446  AEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFP 267
             E                                                  QS GSWF 
Sbjct: 589  PE---------------------------------PAFPPVRPTMQVSVPRGQSRGSWFA 615

Query: 266  VEDHMSSGPMSRLPPKDFPVAPEGVHVEKHRPLPPFPRKVENSVWPDRNFSEKQRLPREA 87
             E+ MS   ++R  PK+FP+  E +H+EKHR  PPF  KVE+S+  DR   E QRL +EA
Sbjct: 616  AEEEMSPRQLNRAAPKEFPLDSERMHIEKHRH-PPFFPKVESSIPSDRLLRENQRLSKEA 674

Query: 86   PRRDDRLRPNYSFPSHQSFRGDEITLSR 3
              RDDRL  N++  S+ SF G+E+ LS+
Sbjct: 675  LHRDDRLGLNHTPSSYHSFSGEEMPLSQ 702


>gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score =  847 bits (2187), Expect = 0.0
 Identities = 459/748 (61%), Positives = 535/748 (71%), Gaps = 10/748 (1%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIY-----FENNNFNYKNDEIEKIVK----KGIYISHYSQESER 2064
            M K+VVY GE  LGEVEIY      +      + DE + +V     K I I + +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 2063 CPPLSVLHTVTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMP 1884
            CPPL+VLHT+T      S+G+CFK MES+    Y   Q    L  L++ C+RDNKTAVMP
Sbjct: 64   CPPLAVLHTIT------SSGICFK-MESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMP 116

Query: 1883 LGEQEIHLVAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANT 1704
            +G+ E+HLVA+ SR  D   PCFWGF+V  GLY+SCL MLNLRCLGIVFDLDETLIVANT
Sbjct: 117  MGDCELHLVAMYSRNSD--RPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANT 174

Query: 1703 LRSFEDRIEALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQ 1524
            +RSFEDRIEALQRK++ E D QR+ GM+ E+KRYQ+DKAILKQYAE DQVV+NGKV KIQ
Sbjct: 175  MRSFEDRIEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQ 234

Query: 1523 AEVIPALSDNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 1344
            +EV+PALSDNHQ I+RPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR
Sbjct: 235  SEVVPALSDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 294

Query: 1343 KRFEVYVCTMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICH 1164
            KRFEVYVCTMAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSGS+KSLFNVF  GICH
Sbjct: 295  KRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICH 354

Query: 1163 PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFF 984
            PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFF
Sbjct: 355  PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFF 414

Query: 983  KEFDEGLLQRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVE 804
            +EFDEGLLQRI E+++EDD KDI SPPDV N+LVSEDD S  +GNK+ + FDGMADAEVE
Sbjct: 415  REFDEGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVE 474

Query: 803  RRLKDXXXXXXXXXXXXXXXXXTVSLDSRLTSSLAFTV-ASSMTISQPAPQASIPSFHTN 627
            RRLK+                  ++LD RLT SL +T+ +SS +I   A Q SI SF   
Sbjct: 475  RRLKE------AISATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNM 528

Query: 626  LFPQAGPLARSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESIS 447
             FP A P+ + +  +   +  L SSPA+EEGEVPESELDPDTRRRLLILQHGQD R+   
Sbjct: 529  QFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTP 588

Query: 446  AEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFP 267
             E                                                  QS GSWF 
Sbjct: 589  PE---------------------------------PAFPPVRPTMQVSVPRGQSRGSWFA 615

Query: 266  VEDHMSSGPMSRLPPKDFPVAPEGVHVEKHRPLPPFPRKVENSVWPDRNFSEKQRLPREA 87
             E+ MS   ++R  PK+FP+  E +H+EKHR  PPF  KVE+S+  DR   E QRL +EA
Sbjct: 616  AEEEMSPRQLNRAAPKEFPLDSERMHIEKHRH-PPFFPKVESSIPSDRLLRENQRLSKEA 674

Query: 86   PRRDDRLRPNYSFPSHQSFRGDEITLSR 3
              RDDRL  N++  S+ SF G+E+ LS+
Sbjct: 675  LHRDDRLGLNHTPSSYHSFSGEEMPLSQ 702


>gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score =  847 bits (2187), Expect = 0.0
 Identities = 459/748 (61%), Positives = 535/748 (71%), Gaps = 10/748 (1%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIY-----FENNNFNYKNDEIEKIVK----KGIYISHYSQESER 2064
            M K+VVY GE  LGEVEIY      +      + DE + +V     K I I + +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 2063 CPPLSVLHTVTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMP 1884
            CPPL+VLHT+T      S+G+CFK MES+    Y   Q    L  L++ C+RDNKTAVMP
Sbjct: 64   CPPLAVLHTIT------SSGICFK-MESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMP 116

Query: 1883 LGEQEIHLVAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANT 1704
            +G+ E+HLVA+ SR  D   PCFWGF+V  GLY+SCL MLNLRCLGIVFDLDETLIVANT
Sbjct: 117  MGDCELHLVAMYSRNSD--RPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANT 174

Query: 1703 LRSFEDRIEALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQ 1524
            +RSFEDRIEALQRK++ E D QR+ GM+ E+KRYQ+DKAILKQYAE DQVV+NGKV KIQ
Sbjct: 175  MRSFEDRIEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQ 234

Query: 1523 AEVIPALSDNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 1344
            +EV+PALSDNHQ I+RPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR
Sbjct: 235  SEVVPALSDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 294

Query: 1343 KRFEVYVCTMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICH 1164
            KRFEVYVCTMAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSGS+KSLFNVF  GICH
Sbjct: 295  KRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICH 354

Query: 1163 PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFF 984
            PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFF
Sbjct: 355  PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFF 414

Query: 983  KEFDEGLLQRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVE 804
            +EFDEGLLQRI E+++EDD KDI SPPDV N+LVSEDD S  +GNK+ + FDGMADAEVE
Sbjct: 415  REFDEGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVE 474

Query: 803  RRLKDXXXXXXXXXXXXXXXXXTVSLDSRLTSSLAFTV-ASSMTISQPAPQASIPSFHTN 627
            RRLK+                  ++LD RLT SL +T+ +SS +I   A Q SI SF   
Sbjct: 475  RRLKE------AISATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNM 528

Query: 626  LFPQAGPLARSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESIS 447
             FP A P+ + +  +   +  L SSPA+EEGEVPESELDPDTRRRLLILQHGQD R+   
Sbjct: 529  QFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTP 588

Query: 446  AEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFP 267
             E                                                  QS GSWF 
Sbjct: 589  PE---------------------------------PAFPPVRPTMQVSVPRGQSRGSWFA 615

Query: 266  VEDHMSSGPMSRLPPKDFPVAPEGVHVEKHRPLPPFPRKVENSVWPDRNFSEKQRLPREA 87
             E+ MS   ++R  PK+FP+  E +H+EKHR  PPF  KVE+S+  DR   E QRL +EA
Sbjct: 616  AEEEMSPRQLNRAAPKEFPLDSERMHIEKHRH-PPFFPKVESSIPSDRLLRENQRLSKEA 674

Query: 86   PRRDDRLRPNYSFPSHQSFRGDEITLSR 3
              RDDRL  N++  S+ SF G+E+ LS+
Sbjct: 675  LHRDDRLGLNHTPSSYHSFSGEEMPLSQ 702


>gb|EMJ15747.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica]
          Length = 940

 Score =  846 bits (2186), Expect = 0.0
 Identities = 455/739 (61%), Positives = 531/739 (71%), Gaps = 1/739 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M K+VVY GE  LGEVEIY E N    KN  +   +K+ I IS++SQ SERCPP++VLHT
Sbjct: 1    MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKE-IRISYFSQSSERCPPVAVLHT 59

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHLV 1857
            +      SS+G+CFK MES        Q QD+ LF L+++C+ +NKTAVMPLG +E+HLV
Sbjct: 60   I------SSHGVCFK-MESKTS-----QSQDTPLFLLHSSCVMENKTAVMPLGGEELHLV 107

Query: 1856 AIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIE 1677
            A+RSR  D   PCFWGFSV PGLY SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRIE
Sbjct: 108  AMRSRNGDKRYPCFWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIE 167

Query: 1676 ALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSD 1497
            ALQRKIS E D QRI+GM+ E+KRYQ+DK ILKQYAE DQVV+NG+V K Q+E +PALSD
Sbjct: 168  ALQRKISSEVDPQRISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSD 227

Query: 1496 NHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 1317
            NHQ I+RPLIRL +KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT
Sbjct: 228  NHQPIIRPLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 287

Query: 1316 MAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVIDD 1137
            MAERDYALEMWRLLDPDSNLI   +LL+RIVCVKSGS+KSLFNVF   +CHPKMALVIDD
Sbjct: 288  MAERDYALEMWRLLDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDD 347

Query: 1136 RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLLQ 957
            RLKVWD++DQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFD+ LLQ
Sbjct: 348  RLKVWDDRDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQ 407

Query: 956  RISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXXX 777
            +I EV +EDD KD+ S PDVSN+LVSEDD S  +GN++ + FDG+ D EVERR+K+    
Sbjct: 408  KIPEVFYEDDIKDVPS-PDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKE---- 462

Query: 776  XXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTISQPAPQASIPSFHTNLFPQAGPLAR 597
                           S+D RL + L +TV  S T+S P  Q S+ SF +  FPQA  L +
Sbjct: 463  --ATPAASMVSSVFTSIDPRL-APLQYTVPPSSTLSLPTTQPSVMSFPSIQFPQAASLVK 519

Query: 596  SLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXXX 417
             L ++G  +  L SSPA+EEGEVPESELDPDTRRRLLILQHGQD R+   +E        
Sbjct: 520  PLGHVGSAEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDQPPSE-------- 571

Query: 416  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGPM 237
                                                     AQS   WFPVE+ MS   +
Sbjct: 572  --------------------------PPFPVRPPMQASVPRAQSRPGWFPVEEEMSPRQL 605

Query: 236  SRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRNFSEKQRLPREAPRRDDRLRP 60
            SR+ PKD P+ PE V +EKHRP    F  KVENS+  DR   E QRLP+EA  RDDRLR 
Sbjct: 606  SRMVPKDLPLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQENQRLPKEAFHRDDRLRF 665

Query: 59   NYSFPSHQSFRGDEITLSR 3
            N++   + S  G+EI LSR
Sbjct: 666  NHALSGYHSLSGEEIPLSR 684


>ref|XP_002267987.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Vitis vinifera]
          Length = 860

 Score =  846 bits (2186), Expect = 0.0
 Identities = 454/740 (61%), Positives = 528/740 (71%), Gaps = 2/740 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M K++VY+G+  +GEVEIY +N             + K I ISHYSQ SERCPPL+VLHT
Sbjct: 1    MYKSIVYEGDDVVGEVEIYPQNQGLE---------LMKEIRISHYSQPSERCPPLAVLHT 51

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHLV 1857
            +T      S G+CFK MES+       Q QD+ L+ L++TC+R+NKTAVM LGE+E+HLV
Sbjct: 52   IT------SCGVCFK-MESSKA-----QSQDTPLYLLHSTCIRENKTAVMSLGEEELHLV 99

Query: 1856 AIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIE 1677
            A+ S++ DG  PCFWGF+V  GLY SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI+
Sbjct: 100  AMYSKKKDGQYPCFWGFNVALGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRID 159

Query: 1676 ALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSD 1497
            ALQRKI+ E D QRI+GM  EV+RYQ+D+ ILKQYAE DQVV+NGK+ K Q E++PALSD
Sbjct: 160  ALQRKINTEVDPQRISGMAAEVRRYQDDRNILKQYAENDQVVENGKLFKTQPEIVPALSD 219

Query: 1496 NHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 1317
            NHQ IVRPLIRLQ+KNI+LTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT
Sbjct: 220  NHQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 279

Query: 1316 MAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVIDD 1137
            MAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSGS+KSLFNVF  GICHPKMALVIDD
Sbjct: 280  MAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDD 339

Query: 1136 RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLLQ 957
            RLKVWDEKDQPRVHVVPAFAPYYAPQAEANN I VLCVARNVACNVRGGFFKEFDEGLLQ
Sbjct: 340  RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQ 399

Query: 956  RISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXXX 777
            RI E+++EDD KDI S PDVSN+LVSEDD S  +GN++   FDGMAD EVER+LKD    
Sbjct: 400  RIPEISYEDDIKDIRSAPDVSNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKD---- 455

Query: 776  XXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTIS-QPAPQASIPSFHTNLFPQAGPLA 600
                           SLD RL+  L F VA+S  ++ QPA Q SI  F    FPQ+  L 
Sbjct: 456  ------AISAPSTVTSLDPRLSPPLQFAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLI 509

Query: 599  RSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXX 420
            + L      +  + SSPA+EEGEVPESELDPDTRRRLLILQHGQD RE  S++       
Sbjct: 510  KPLA----PEPTMQSSPAREEGEVPESELDPDTRRRLLILQHGQDTREHASSD------- 558

Query: 419  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGP 240
                                                       QS GSWFP ++ MS   
Sbjct: 559  ---------------------------PPFPVRPPIQVSVPRVQSRGSWFPADEEMSPRQ 591

Query: 239  MSRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRNFSEKQRLPREAPRRDDRLR 63
            ++R  PK+FP+  + +H+EKHRP  P F  KVE+S   DR   E QRL +E   RDDRLR
Sbjct: 592  LNRAVPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDDRLR 651

Query: 62   PNYSFPSHQSFRGDEITLSR 3
             N+S P + SF G+E+ L R
Sbjct: 652  LNHSLPGYHSFSGEEVPLGR 671


>emb|CBI35690.3| unnamed protein product [Vitis vinifera]
          Length = 788

 Score =  846 bits (2186), Expect = 0.0
 Identities = 454/740 (61%), Positives = 528/740 (71%), Gaps = 2/740 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M K++VY+G+  +GEVEIY +N             + K I ISHYSQ SERCPPL+VLHT
Sbjct: 1    MYKSIVYEGDDVVGEVEIYPQNQGLE---------LMKEIRISHYSQPSERCPPLAVLHT 51

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHLV 1857
            +T      S G+CFK MES+       Q QD+ L+ L++TC+R+NKTAVM LGE+E+HLV
Sbjct: 52   IT------SCGVCFK-MESSKA-----QSQDTPLYLLHSTCIRENKTAVMSLGEEELHLV 99

Query: 1856 AIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIE 1677
            A+ S++ DG  PCFWGF+V  GLY SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI+
Sbjct: 100  AMYSKKKDGQYPCFWGFNVALGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRID 159

Query: 1676 ALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSD 1497
            ALQRKI+ E D QRI+GM  EV+RYQ+D+ ILKQYAE DQVV+NGK+ K Q E++PALSD
Sbjct: 160  ALQRKINTEVDPQRISGMAAEVRRYQDDRNILKQYAENDQVVENGKLFKTQPEIVPALSD 219

Query: 1496 NHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 1317
            NHQ IVRPLIRLQ+KNI+LTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT
Sbjct: 220  NHQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 279

Query: 1316 MAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVIDD 1137
            MAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSGS+KSLFNVF  GICHPKMALVIDD
Sbjct: 280  MAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDD 339

Query: 1136 RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLLQ 957
            RLKVWDEKDQPRVHVVPAFAPYYAPQAEANN I VLCVARNVACNVRGGFFKEFDEGLLQ
Sbjct: 340  RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQ 399

Query: 956  RISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXXX 777
            RI E+++EDD KDI S PDVSN+LVSEDD S  +GN++   FDGMAD EVER+LKD    
Sbjct: 400  RIPEISYEDDIKDIRSAPDVSNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKD---- 455

Query: 776  XXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTIS-QPAPQASIPSFHTNLFPQAGPLA 600
                           SLD RL+  L F VA+S  ++ QPA Q SI  F    FPQ+  L 
Sbjct: 456  ------AISAPSTVTSLDPRLSPPLQFAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLI 509

Query: 599  RSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXX 420
            + L      +  + SSPA+EEGEVPESELDPDTRRRLLILQHGQD RE  S++       
Sbjct: 510  KPLA----PEPTMQSSPAREEGEVPESELDPDTRRRLLILQHGQDTREHASSD------- 558

Query: 419  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGP 240
                                                       QS GSWFP ++ MS   
Sbjct: 559  ---------------------------PPFPVRPPIQVSVPRVQSRGSWFPADEEMSPRQ 591

Query: 239  MSRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRNFSEKQRLPREAPRRDDRLR 63
            ++R  PK+FP+  + +H+EKHRP  P F  KVE+S   DR   E QRL +E   RDDRLR
Sbjct: 592  LNRAVPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDDRLR 651

Query: 62   PNYSFPSHQSFRGDEITLSR 3
             N+S P + SF G+E+ L R
Sbjct: 652  LNHSLPGYHSFSGEEVPLGR 671


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score =  844 bits (2180), Expect = 0.0
 Identities = 462/741 (62%), Positives = 534/741 (72%), Gaps = 2/741 (0%)
 Frame = -2

Query: 2219 KMVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLH 2040
            +M K+VVY GE+ +GEV++Y E NN NYKN  +     K I ISH+SQ SERCPPL+VLH
Sbjct: 2    RMYKSVVYQGEVVVGEVDVYPEENN-NYKNFHV-----KEIRISHFSQPSERCPPLAVLH 55

Query: 2039 TVTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHL 1860
            TVT      S G+CFK MES        QQQD  LF L++ C+R+NKTAVMPLG +EIHL
Sbjct: 56   TVT------SCGVCFK-MESKT------QQQDG-LFQLHSLCIRENKTAVMPLGGEEIHL 101

Query: 1859 VAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRI 1680
            VA+ SR +D   PCFWGF V  GLY+SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI
Sbjct: 102  VAMHSRNVD--RPCFWGFIVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI 159

Query: 1679 EALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALS 1500
            +ALQRKI+ E D QRI+GM  EVKRYQ+DK ILKQYAE DQVVDNG+V K+Q+E++PALS
Sbjct: 160  DALQRKINSEVDPQRISGMQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALS 219

Query: 1499 DNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 1320
            D+HQ IVRPLIRLQDKNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC
Sbjct: 220  DSHQPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 279

Query: 1319 TMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVID 1140
            TMAERDYALEMWRLLDPDSNLI  +ELL RIVCVKSG KKSLFNVF  G+CHPKMALVID
Sbjct: 280  TMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVID 339

Query: 1139 DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLL 960
            DRLKVWDEKDQPRVHVVPAFAPYYAPQAEA+NTIPVLCVARNVACNVRGGFFK+FD+GLL
Sbjct: 340  DRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLL 399

Query: 959  QRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXX 780
            Q+I ++ +EDD KDI SPPDVSN+LVSEDDGS  +G+++   FDGMADAEVER+LKD   
Sbjct: 400  QKIPQIAYEDDIKDIPSPPDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKD--- 456

Query: 779  XXXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTISQPAPQASIPSFHTNLFPQAGPLA 600
                          T +LD RLT SL +T+  S ++  P  QAS+  F    FPQ   L 
Sbjct: 457  ---ALSAASTIPVTTANLDPRLT-SLQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLV 512

Query: 599  RSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXX 420
            + +    P +  LHSSPA+EEGEVPESELDPDTRRRLLILQHGQD R+  SAE       
Sbjct: 513  KPMGQAAPSEPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHASAE------- 565

Query: 419  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGP 240
                                                        S G WFP E+ + S P
Sbjct: 566  --------------------------PPFPVRHPVQTSAPHVPSSRGVWFPAEEEIGSQP 599

Query: 239  MSRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRNFSEK-QRLPREAPRRDDRL 66
            ++R+ PK+FPV    + + K RP  P F  KVE+S+  DR   +  QRLP+E   RDDR 
Sbjct: 600  LNRVVPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRP 659

Query: 65   RPNYSFPSHQSFRGDEITLSR 3
            R N+   S++SF GD+I  SR
Sbjct: 660  RLNHMLSSYRSFSGDDIPFSR 680


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Glycine max]
          Length = 960

 Score =  835 bits (2158), Expect = 0.0
 Identities = 458/740 (61%), Positives = 528/740 (71%), Gaps = 2/740 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M K+VVY GE+ +GEV++Y E NN N   +  +    K I ISH+SQ SERCPPL+VLHT
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHLV 1857
            VT      S G+CFK MES        QQQD  LF L++ C+R+NKTAVMPLG +EIHLV
Sbjct: 61   VT------SCGVCFK-MESKT------QQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLV 106

Query: 1856 AIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIE 1677
            A+ SR  D   PCFWGF V  GLY+SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI+
Sbjct: 107  AMHSRNDD--RPCFWGFIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRID 164

Query: 1676 ALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSD 1497
            ALQRKI+ E D QRI+GM  EVKRY +DK ILKQYAE DQVVDNG+V K+Q+E++PALSD
Sbjct: 165  ALQRKINSEVDPQRISGMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSD 224

Query: 1496 NHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 1317
            +HQ IVRPLIRLQDKNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT
Sbjct: 225  SHQPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 284

Query: 1316 MAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVIDD 1137
            MAERDYALEMWRLLDPDSNLI  +ELL RIVCVKSG KKSLFNVF  G C PKMALVIDD
Sbjct: 285  MAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDD 344

Query: 1136 RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLLQ 957
            RLKVWDE+DQPRVHVVPAFAPYYAPQAEA+NTIPVLCVARNVACNVRGGFFK+FD+GLLQ
Sbjct: 345  RLKVWDERDQPRVHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQ 404

Query: 956  RISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXXX 777
            +I ++ +EDD KD+ SPPDVSN+LVSEDDGS  +GN++   FDGMADAEVER+LKD    
Sbjct: 405  KIPQIAYEDDIKDVPSPPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKD---- 460

Query: 776  XXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTISQPAPQASIPSFHTNLFPQAGPLAR 597
                         T +LD RLT SL +T+  S ++  P  QAS+  F    FPQ   L +
Sbjct: 461  --ALAAASTFPVTTANLDPRLT-SLQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVK 517

Query: 596  SLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXXX 417
             +    P D  LHSSPA+EEGEVPESELDPDTRRRLLILQHGQD R+  SAE        
Sbjct: 518  PMGQAAPSDPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHASAE-------- 569

Query: 416  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGPM 237
                                                       S G WFPVE+ + S P+
Sbjct: 570  -------------------------PPFPVRHPVQASAPRVPSSRGVWFPVEEEIGSQPL 604

Query: 236  SRLPPKDFPVAPEGVHVEKHR-PLPPFPRKVENSVWPDRNFSEK-QRLPREAPRRDDRLR 63
            +R+ PK+FPV    + +EK R   P F  KVE+S+  DR   +  QRLP+E   RDDR R
Sbjct: 605  NRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSHQRLPKEMYHRDDRPR 664

Query: 62   PNYSFPSHQSFRGDEITLSR 3
             N+   S++SF GD+I  SR
Sbjct: 665  LNHMLSSYRSFSGDDIPFSR 684


>ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X2 [Glycine max]
          Length = 929

 Score =  834 bits (2155), Expect = 0.0
 Identities = 457/732 (62%), Positives = 528/732 (72%), Gaps = 2/732 (0%)
 Frame = -2

Query: 2219 KMVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLH 2040
            +M K+VVY GE+ +GEV++Y E NN NYKN  +     K I ISH+SQ SERCPPL+VLH
Sbjct: 2    RMYKSVVYQGEVVVGEVDVYPEENN-NYKNFHV-----KEIRISHFSQPSERCPPLAVLH 55

Query: 2039 TVTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHL 1860
            TVT      S G+CFK MES        QQQD  LF L++ C+R+NKTAVMPLG +EIHL
Sbjct: 56   TVT------SCGVCFK-MESKT------QQQDG-LFQLHSLCIRENKTAVMPLGGEEIHL 101

Query: 1859 VAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRI 1680
            VA+ SR +D   PCFWGF V  GLY+SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI
Sbjct: 102  VAMHSRNVD--RPCFWGFIVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI 159

Query: 1679 EALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALS 1500
            +ALQRKI+ E D QRI+GM  EVKRYQ+DK ILKQYAE DQVVDNG+V K+Q+E++PALS
Sbjct: 160  DALQRKINSEVDPQRISGMQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALS 219

Query: 1499 DNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 1320
            D+HQ IVRPLIRLQDKNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC
Sbjct: 220  DSHQPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 279

Query: 1319 TMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVID 1140
            TMAERDYALEMWRLLDPDSNLI  +ELL RIVCVKSG KKSLFNVF  G+CHPKMALVID
Sbjct: 280  TMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVID 339

Query: 1139 DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLL 960
            DRLKVWDEKDQPRVHVVPAFAPYYAPQAEA+NTIPVLCVARNVACNVRGGFFK+FD+GLL
Sbjct: 340  DRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLL 399

Query: 959  QRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXX 780
            Q+I ++ +EDD KDI SPPDVSN+LVSEDDGS  +G+++   FDGMADAEVER+LKD   
Sbjct: 400  QKIPQIAYEDDIKDIPSPPDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKD--- 456

Query: 779  XXXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTISQPAPQASIPSFHTNLFPQAGPLA 600
                          T +LD RLT SL +T+  S ++  P  QAS+  F    FPQ   L 
Sbjct: 457  ---ALSAASTIPVTTANLDPRLT-SLQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLV 512

Query: 599  RSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXX 420
            + +    P +  LHSSPA+EEGEVPESELDPDTRRRLLILQHGQD R+  SAE       
Sbjct: 513  KPMGQAAPSEPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHASAE------- 565

Query: 419  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGP 240
                                                        S G WFP E+ + S P
Sbjct: 566  --------------------------PPFPVRHPVQTSAPHVPSSRGVWFPAEEEIGSQP 599

Query: 239  MSRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRNFSEK-QRLPREAPRRDDRL 66
            ++R+ PK+FPV    + + K RP  P F  KVE+S+  DR   +  QRLP+E   RDDR 
Sbjct: 600  LNRVVPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRP 659

Query: 65   RPNYSFPSHQSF 30
            R N+   S++SF
Sbjct: 660  RLNHMLSSYRSF 671


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score =  834 bits (2155), Expect = 0.0
 Identities = 453/765 (59%), Positives = 535/765 (69%), Gaps = 27/765 (3%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYF------ENNNFNYKNDEIEKIVKKGIYISHYSQESERCPP 2055
            M K+VVY G+  LGEVEIY       E  N N K   I++IVK+ I ISH+SQ SERCPP
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKE-IRISHFSQTSERCPP 59

Query: 2054 LSVLHTVTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGE 1875
            L+VLHT+T      S G+CFK+ ES +       QQ+S L  L+++C+++NKTAVM LG 
Sbjct: 60   LAVLHTIT------SIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGG 113

Query: 1874 QEIHLVAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRS 1695
            +E+HLVA+ SR  +   PCFWGFSV PGLY+SCL MLNLRCLGIVFDLDETLIVANT+RS
Sbjct: 114  EELHLVAMPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRS 173

Query: 1694 FEDRIEALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEV 1515
            FEDRI+ALQRKIS E D QRI GM+ EVKRY +DK ILKQY E DQVV+NGKV K Q+EV
Sbjct: 174  FEDRIDALQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVENDQVVENGKVIKTQSEV 233

Query: 1514 IPALSDNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRF 1335
            +PALSDNHQ +VRPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRF
Sbjct: 234  VPALSDNHQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRF 293

Query: 1334 EVYVCTMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKM 1155
            EVYVCTMAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSG +KSLFNVF  GICHPKM
Sbjct: 294  EVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKM 353

Query: 1154 ALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEF 975
            ALVIDDRLKVWDE+DQ RVHVVPAFAPYYAPQAE NN +PVLCVARNVACNVRGGFFKEF
Sbjct: 354  ALVIDDRLKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEF 413

Query: 974  DEGLLQRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRL 795
            DEGLLQ+I EV +EDD  +I SPPDVSN+LVSEDD S  +GN++ + FDGMADAEVER+L
Sbjct: 414  DEGLLQKIPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQL 473

Query: 794  KDXXXXXXXXXXXXXXXXXTVSLDSRLTSSLAFTVA---SSMTISQPA------------ 660
            K+                   SLD RL  SL +T+A   SSM  SQP+            
Sbjct: 474  KEAVSASSAILSTIPSTVS--SLDPRLLQSLQYTIASSSSSMPTSQPSMLASQQPMPALQ 531

Query: 659  -----PQASIPSFHTNLFPQAGPLARSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRR 495
                  Q S+  F    FPQ  P  + L  + P +  L SSPA+EEGEVPESELDPDTRR
Sbjct: 532  PPKPPSQLSMTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRR 591

Query: 494  RLLILQHGQDMRESISAEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 315
            RLLILQHG D R++  +E                                          
Sbjct: 592  RLLILQHGHDSRDNAPSE----------------------------------SPFPARPS 617

Query: 314  XXXXXXXAQSLGSWFPVEDHMSSGPMSRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENS 138
                    QS+GSW PVE+ MS   ++R  P++FP+  + +++EKHR   P F  KVE++
Sbjct: 618  TQVSAPRVQSVGSWVPVEEEMSPRQLNR-TPREFPLDSDPMNIEKHRTHHPSFFHKVESN 676

Query: 137  VWPDRNFSEKQRLPREAPRRDDRLRPNYSFPSHQSFRGDEITLSR 3
            +  DR   E QR P+EA  RDDR++ N+S  ++ SF+G+E  LSR
Sbjct: 677  IPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYPSFQGEESPLSR 721


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score =  832 bits (2150), Expect = 0.0
 Identities = 443/752 (58%), Positives = 536/752 (71%), Gaps = 14/752 (1%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKI------------VKKGIYISHYSQE 2073
            M K+VVY G+  LGEVEIY +      + +E+++             + KGI ISH+SQ 
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQQEQKLQQQEELQEQEQELKKKRVIDEILKGIRISHFSQA 60

Query: 2072 SERCPPLSVLHTVTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTA 1893
            SERCPPL+VLHT+T      +NG+CFK MES N +       D+ L  L+++C++++KTA
Sbjct: 61   SERCPPLAVLHTIT------TNGICFK-MESKNSV-----SLDTPLHLLHSSCIQESKTA 108

Query: 1892 VMPL-GEQEIHLVAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLI 1716
            V+ L G +E+HLVA+ SR  +   PCFW F++  GLY+SCL MLNLRCLGIVFDLDETLI
Sbjct: 109  VVLLQGGEELHLVAMFSRNDERQYPCFWAFNISSGLYDSCLVMLNLRCLGIVFDLDETLI 168

Query: 1715 VANTLRSFEDRIEALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKV 1536
            VANT+RSFEDRIEALQRKIS E D QRI+GM+ EVKRYQ+DK ILKQY + DQVV+NG+V
Sbjct: 169  VANTMRSFEDRIEALQRKISTELDPQRISGMLSEVKRYQDDKTILKQYVDNDQVVENGRV 228

Query: 1535 HKIQAEVIPALSDNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLT 1356
             K Q EV+PALSDNHQTIVRPLIRLQ++NI+LTRINPQIRDTSVLVRLRPAWE+LRSYLT
Sbjct: 229  IKTQFEVVPALSDNHQTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEELRSYLT 288

Query: 1355 ARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHG 1176
            ARGRKRFEVYVCTMAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSG +KSLFNVF  
Sbjct: 289  ARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQD 348

Query: 1175 GICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVR 996
            GICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVR
Sbjct: 349  GICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVR 408

Query: 995  GGFFKEFDEGLLQRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMAD 816
            GGFFKEFDEGLLQRI E++FEDD  DI SPPDVSN+LV EDD    +GN++ + FDGMAD
Sbjct: 409  GGFFKEFDEGLLQRIPEISFEDDMNDIPSPPDVSNYLVPEDDAFTSNGNRDPLSFDGMAD 468

Query: 815  AEVERRLKDXXXXXXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTISQPAPQASIPSF 636
            AEVE+RLK+                   +LD+RL   L +T+ASS +I  P  Q ++ +F
Sbjct: 469  AEVEKRLKE------AISISSAFPSTVANLDARLVPPLQYTMASSSSIPVPTSQPAVVTF 522

Query: 635  HTNLFPQAGPLARSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRE 456
             +   PQA PL + L  + P +  L SSPA+EEGEVPESELDPDTRRRLLILQHGQD+R+
Sbjct: 523  PSMQLPQAAPLVKPLGQVVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDLRD 582

Query: 455  SISAEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGS 276
               +E                                                  QS G+
Sbjct: 583  PAPSE--------------------------------SPFPVRPSNSMQVSVPRVQSRGN 610

Query: 275  WFPVEDHMSSGPMSRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRNFSEKQRL 99
            W PVE+ MS   ++R   ++FP+  E +H++KHRP  P F  KVE+S+  +R   E QRL
Sbjct: 611  WVPVEEEMSPRQLNRAVTREFPMDTEPMHIDKHRPHHPSFFPKVESSIPSERMPHENQRL 670

Query: 98   PREAPRRDDRLRPNYSFPSHQSFRGDEITLSR 3
            P+ AP +DDRLR N +  ++QS  G+E +LSR
Sbjct: 671  PKVAPYKDDRLRLNQTMSNYQSLSGEENSLSR 702


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score =  830 bits (2145), Expect = 0.0
 Identities = 448/735 (60%), Positives = 523/735 (71%), Gaps = 1/735 (0%)
 Frame = -2

Query: 2204 VVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHTVTAQ 2025
            +VY GE  LGEVE+Y E  N     DE+     K I ISH+SQ SERCPP++VLHT+   
Sbjct: 4    LVYKGEELLGEVEVYPEELNNKKIWDEL-----KEIRISHFSQSSERCPPVAVLHTI--- 55

Query: 2024 SSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHLVAIRS 1845
               SSNG+CFK MES +       Q  S+LF L+++C+ +NKTAVM LG +E+HLVA+ S
Sbjct: 56   ---SSNGVCFK-MESKSSSS--SSQDTSRLFLLHSSCIMENKTAVMNLGVEELHLVAMYS 109

Query: 1844 RRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIEALQR 1665
            R      PCFWGFSV  GLY SCLGMLNLRCLGIVFDLDETLIVANT+RSFEDRIE LQR
Sbjct: 110  RNNQKQHPCFWGFSVSSGLYSSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIEGLQR 169

Query: 1664 KISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSDNHQT 1485
            KI  E D QRI+GM  E+KRYQ+DK ILKQYAE DQVV+NG+V K Q+EV+PALSD+HQ 
Sbjct: 170  KIQCEVDAQRISGMQAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEVVPALSDSHQP 229

Query: 1484 IVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAER 1305
            I+RPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAER
Sbjct: 230  IIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAER 289

Query: 1304 DYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVIDDRLKV 1125
            DYALEMWRLLDP+SNLI   +LL+RIVCVKSG KKSLFNVF   +CHPKMALVIDDRLKV
Sbjct: 290  DYALEMWRLLDPESNLINANKLLDRIVCVKSGLKKSLFNVFQESLCHPKMALVIDDRLKV 349

Query: 1124 WDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLLQRISE 945
            WD++DQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVAC+VRGGFF+EFD+ LLQ+I E
Sbjct: 350  WDDRDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACSVRGGFFREFDDSLLQKIPE 409

Query: 944  VTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXXXXXXX 765
            + +ED+ KD  S PDVSNFLVSEDD S  +GN++ + FDGMADAEVERRLK+        
Sbjct: 410  IFYEDNIKD-FSSPDVSNFLVSEDDASASNGNRDQLPFDGMADAEVERRLKE-------A 461

Query: 764  XXXXXXXXXTVSLDSRLTSSLAFTVASSMTISQPAPQASIPSFHTNLFPQAGPLARSLCN 585
                      VS +    +SL +TV  S T+S P  Q S+  FH   FPQ+  L + L +
Sbjct: 462  TSAAPTVSSAVSNNDPRLASLQYTVPLSSTVSLPTNQPSMMPFHNVQFPQSASLVKPLGH 521

Query: 584  IGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXXXXXXX 405
            +GP DLGLHSSPA+EEGEVPESELDPDTRRRLLILQHGQD RES+ +E            
Sbjct: 522  VGPADLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRESVPSE------------ 569

Query: 404  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGPMSRLP 225
                                                  QS G WFPVE+ MS   +SR+ 
Sbjct: 570  ----------------------PSFPVRPQVQVSVPRVQSRGGWFPVEEEMSPRKLSRMV 607

Query: 224  PKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRNFSEKQRLPREAPRRDDRLRPNYSF 48
            PK+ P+  E + +EKHR     F  KVENS+  DR   E QRLP+EA  RD+RLR N + 
Sbjct: 608  PKEPPLNSEPMQIEKHRSHHSAFFPKVENSMPSDRILQENQRLPKEAFHRDNRLRFNQAM 667

Query: 47   PSHQSFRGDEITLSR 3
              + SF G+E  L+R
Sbjct: 668  SGYHSFSGEEPPLNR 682


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score =  830 bits (2143), Expect = 0.0
 Identities = 450/740 (60%), Positives = 523/740 (70%), Gaps = 2/740 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M KTV Y G+  LGEVEIY +      + +E  K V   I IS++S+ SERCPPL+VLHT
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLG-EQEIHL 1860
            +TA      +G+CFK MES +         + QL  L+++C+R+NKTAVMPLG  +E+HL
Sbjct: 61   ITA------SGICFK-MESKSS-------DNIQLHLLHSSCIRENKTAVMPLGLTEELHL 106

Query: 1859 VAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRI 1680
            VA+ SR  +   PCFW FSV  GLY SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI
Sbjct: 107  VAMYSRNNEKQYPCFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRI 166

Query: 1679 EALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALS 1500
            EAL RKIS E D QRI GM  EVKRYQ+DK ILKQYAE DQV +NGKV K+Q+EV+PALS
Sbjct: 167  EALLRKISTEVDPQRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALS 226

Query: 1499 DNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 1320
            D+HQ +VRPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC
Sbjct: 227  DSHQALVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 286

Query: 1319 TMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVID 1140
            TMAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSGS+KSLFNVF  G CHPKMALVID
Sbjct: 287  TMAERDYALEMWRLLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVID 346

Query: 1139 DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLL 960
            DRLKVWD+KDQPRVHVVPAFAPYYAPQAEANN IPVLCVARN+ACNVRGGFFKEFDEGLL
Sbjct: 347  DRLKVWDDKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLL 406

Query: 959  QRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXX 780
            QRI E+++EDD KDI SPPDVSN+LVSEDD +  +G K+ + FDGMADAEVERRLK+   
Sbjct: 407  QRIPEISYEDDVKDIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKE--- 463

Query: 779  XXXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTISQPAPQASIPSFHTNLFPQAGPLA 600
                            +LD RL        +SS T + P  QA++       FP A  L 
Sbjct: 464  ---AIAASATISSAVANLDPRLAPFQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLV 520

Query: 599  RSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXX 420
            + L ++GP +  L SSPA+EEGEVPESELDPDTRRRLLILQHG D RE+  +E       
Sbjct: 521  KPLGHVGPPEQSLQSSPAREEGEVPESELDPDTRRRLLILQHGMDTRENAPSE------- 573

Query: 419  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGP 240
                                                        S GSWFPVE+ MS   
Sbjct: 574  ---------------------------APFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQ 606

Query: 239  MSRLPPKDFPVAPEGVHVEKHR-PLPPFPRKVENSVWPDRNFSEKQRLPREAPRRDDRLR 63
            ++R  PK+FP+  E + +EKHR P P F  K+EN    DR   E QR+P+EA RRDDRLR
Sbjct: 607  LNRAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDRP-HENQRMPKEALRRDDRLR 665

Query: 62   PNYSFPSHQSFRGDEITLSR 3
             N++   +QSF G+EI LSR
Sbjct: 666  LNHTLSDYQSFSGEEIPLSR 685


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score =  825 bits (2131), Expect = 0.0
 Identities = 449/740 (60%), Positives = 523/740 (70%), Gaps = 2/740 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M KTV Y G+  LGEVEIY +      + +E  K V   I IS++S+ SERCPPL+VLHT
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLG-EQEIHL 1860
            +TA      +G+CFK MES +         + QL  L+++C+R+NKTAVM LG  +E+HL
Sbjct: 61   ITA------SGICFK-MESKSS-------DNVQLHLLHSSCIRENKTAVMLLGLTEELHL 106

Query: 1859 VAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRI 1680
            VA+ SR  +   PCFW FSV  GLY SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI
Sbjct: 107  VAMYSRNNEKQYPCFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRI 166

Query: 1679 EALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALS 1500
            EAL RKIS E D QRI GM  EVKRYQ+DK ILKQYAE DQV +NGKV K+Q+EV+PALS
Sbjct: 167  EALLRKISTEVDPQRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALS 226

Query: 1499 DNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 1320
            D+HQ +VRPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC
Sbjct: 227  DSHQALVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 286

Query: 1319 TMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVID 1140
            TMAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSGS+KSLFNVF  G CHPKMALVID
Sbjct: 287  TMAERDYALEMWRLLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVID 346

Query: 1139 DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLL 960
            DRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN IPVLCVARN+ACNVRGGFFKEFDEGLL
Sbjct: 347  DRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLL 406

Query: 959  QRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXX 780
            QRI E+++EDD K+I SPPDVSN+LVSEDD +  +G K+ + FDGMADAEVERRLK+   
Sbjct: 407  QRIPEISYEDDVKEIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKE--- 463

Query: 779  XXXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTISQPAPQASIPSFHTNLFPQAGPLA 600
                            +LD RL        +SS T + P  QA++       FP A  L 
Sbjct: 464  ---AIAASATISSAVANLDPRLAPFQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLV 520

Query: 599  RSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXX 420
            + L ++GP +  L SSPA+EEGEVPESELDPDTRRRLLILQHG D RE+  +E       
Sbjct: 521  KPLGHVGPPEQCLQSSPAREEGEVPESELDPDTRRRLLILQHGMDTRENAPSE------- 573

Query: 419  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGP 240
                                                        S GSWFPVE+ MS   
Sbjct: 574  ---------------------------APFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQ 606

Query: 239  MSRLPPKDFPVAPEGVHVEKHR-PLPPFPRKVENSVWPDRNFSEKQRLPREAPRRDDRLR 63
            ++R  PK+FP+  E + +EKHR P P F  K+ENS+  DR   E QR+P+EA RRDDRLR
Sbjct: 607  LNRAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDRP-HENQRMPKEALRRDDRLR 665

Query: 62   PNYSFPSHQSFRGDEITLSR 3
             N++   +QSF G+EI LSR
Sbjct: 666  LNHTLSDYQSFSGEEIPLSR 685


>gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score =  822 bits (2122), Expect = 0.0
 Identities = 459/745 (61%), Positives = 522/745 (70%), Gaps = 7/745 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M K+VVY GE+ LGEVE+Y E NN  YKN  +     K I ISH+SQ SERCPPL+VLHT
Sbjct: 1    MYKSVVYQGEVVLGEVEVYPEENN--YKNFHV-----KEIRISHFSQPSERCPPLAVLHT 53

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHLV 1857
            VT      S G+CFK MES        QQQD  LF L++ C+R+NKTAV+PLG +EIHLV
Sbjct: 54   VT------SCGVCFK-MESKT------QQQDG-LFHLHSLCIRENKTAVIPLGGEEIHLV 99

Query: 1856 AIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIE 1677
            A+ SR  D   P FWGF V  GLY+SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI+
Sbjct: 100  AMHSRNDD--RPRFWGFIVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRID 157

Query: 1676 ALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSD 1497
            ALQRKI+ E D QRI+GM  EVKRYQEDK ILKQYAE DQVVDNG+V K+Q+E++PALSD
Sbjct: 158  ALQRKINSEVDPQRISGMQAEVKRYQEDKNILKQYAENDQVVDNGRVVKVQSEIVPALSD 217

Query: 1496 NHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 1317
            NHQ IVRPLIRLQDKNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT
Sbjct: 218  NHQPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 277

Query: 1316 MAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVIDD 1137
            MAERDYALEMWRLLDPDSNLI  +ELL RIVCVKSG KKSLFNVF  G+CHPKMALVIDD
Sbjct: 278  MAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDD 337

Query: 1136 RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLLQ 957
            RLKVWDEKDQPRVHVVPAFAPYYAPQAEA+N+IPVLCVARNVACNVRGGFFKEFD+GLLQ
Sbjct: 338  RLKVWDEKDQPRVHVVPAFAPYYAPQAEASNSIPVLCVARNVACNVRGGFFKEFDDGLLQ 397

Query: 956  RISEVTFEDDPKDILSPPDVSNFLVSEDDGSGC--DGNKESIGFDGMADAEVERRLKDXX 783
            +I +V +EDD KDI  PPDVSN+LVSEDDGS    +GN++   FD M DAEVER+ K   
Sbjct: 398  KIPQVAYEDDIKDIPIPPDVSNYLVSEDDGSSAISNGNRDPFLFDSMGDAEVERKSKVPT 457

Query: 782  XXXXXXXXXXXXXXXTV---SLDSRLTSSLAFTVASSMTISQPAPQASIPSFHTNLFPQA 612
                            V   +LD RLT SL + + SS +   P  QAS+  F    FPQ 
Sbjct: 458  RAPNEHDALSAASTIPVTTANLDPRLT-SLQYAMVSSGSAPPPTAQASMMPFTHVQFPQP 516

Query: 611  GPLARSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXX 432
              L + +    P +  LHSSPA+EEGEVPESELDPDTRRRLLILQHGQD R+  S E   
Sbjct: 517  AALVKPMGQAAPSESSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSNE--- 573

Query: 431  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHM 252
                                                            S G WFP E+ +
Sbjct: 574  -------------------------------PTYAIRHPVPVSAPRVSSRGGWFPAEEDI 602

Query: 251  SSGPMSRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRNFSEK-QRLPREAPRR 78
             S P++R+ PK+F V    + +EKHRP  P F  KVE+S+  DR   +  QRLP+E   R
Sbjct: 603  GSQPLNRVVPKEFSVDSGSLVIEKHRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHR 662

Query: 77   DDRLRPNYSFPSHQSFRGDEITLSR 3
            DDR R N+   S++S   DEI  SR
Sbjct: 663  DDRPRSNHMLSSYRSLSVDEIPFSR 687


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score =  818 bits (2113), Expect = 0.0
 Identities = 446/759 (58%), Positives = 527/759 (69%), Gaps = 21/759 (2%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYF------ENNNFNYKNDEIEKIVKKGIYISHYSQESERCPP 2055
            M K+VVY GE  LGEVEIY       E  N N +   I++IVK GI ISH+SQ SERCPP
Sbjct: 1    MYKSVVYKGEELLGEVEIYAQEQQQEEEENKNKRKRVIDEIVK-GIRISHFSQASERCPP 59

Query: 2054 LSVLHTVTAQSSSSSNGLCFKIMESN-NKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLG 1878
            L+VLHT+T      S G+CFK+ ES  +       QQ+S L  L+++C+++NKTAVM LG
Sbjct: 60   LAVLHTIT------SIGVCFKMEESTASSSTKISSQQESPLRLLHSSCIQENKTAVMLLG 113

Query: 1877 EQEIHLVAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLR 1698
             +E+HLVA+ SR  +   PCFWGF+V  GLY+SCL MLNLRCLGIVFDLDETLIVANT+R
Sbjct: 114  GEELHLVAMPSRSNERKHPCFWGFNVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMR 173

Query: 1697 SFEDRIEALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAE 1518
            SFED+IEALQ+KIS E D QRI  +I E+KRYQ+DK ILKQY E DQV++NGKV K Q E
Sbjct: 174  SFEDKIEALQKKISTEVDQQRILAIISEIKRYQDDKIILKQYVENDQVIENGKVIKTQFE 233

Query: 1517 VIPALSDNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKR 1338
            V+PA SDNHQ +VRPLIRL +KNI+ TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKR
Sbjct: 234  VVPAASDNHQPLVRPLIRLPEKNIIFTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKR 293

Query: 1337 FEVYVCTMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPK 1158
            FEVYVCTMAERDYALEMWRLLDP+SNLI   ELL+RIVCV SGS+KSLFNVF  GICHPK
Sbjct: 294  FEVYVCTMAERDYALEMWRLLDPESNLINSNELLDRIVCVSSGSRKSLFNVFQDGICHPK 353

Query: 1157 MALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKE 978
            MALVIDDR+ VWDEKDQ RVHVVPAFAPYYAPQAEANN +P+LCVARNVACNVRGGFFKE
Sbjct: 354  MALVIDDRMNVWDEKDQSRVHVVPAFAPYYAPQAEANNAVPILCVARNVACNVRGGFFKE 413

Query: 977  FDEGLLQRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERR 798
            FDEGLLQ+I EV +EDD  +I SPPDVSN+LVSEDD S  +GN++   FD  ADAEVERR
Sbjct: 414  FDEGLLQKIPEVAYEDDTSNIPSPPDVSNYLVSEDDASAANGNRDPPSFDSTADAEVERR 473

Query: 797  LKDXXXXXXXXXXXXXXXXXTVSLDSRLTSSLAFTVASS---MTISQ----------PAP 657
            LK+                   SLD RL  SL + VASS   M  SQ          PA 
Sbjct: 474  LKEAVSASSTIPSTIPSTVS--SLDPRLLQSLQYAVASSSSLMPASQPSMLASQQPVPAS 531

Query: 656  QASIPSFHTNLFPQAGPLARSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQ 477
            Q S+  F    FPQ  PL + L  +   +  L SSPA+EEGEVPESELDPDTRRRLLILQ
Sbjct: 532  QTSMMPFPNTQFPQVAPLVKQLGQVVHPEPSLQSSPAREEGEVPESELDPDTRRRLLILQ 591

Query: 476  HGQDMRESISAEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 297
            HGQD R++  +E                                                
Sbjct: 592  HGQDSRDNAPSE----------------------------------SPFPARPSAPVSAA 617

Query: 296  XAQSLGSWFPVEDHMSSGPMSRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRN 120
              QS GSW PVE+ M+   ++R  P++FP+  + +++EKH+   P F  KVE+++  DR 
Sbjct: 618  HVQSRGSWVPVEEEMTPRQLNR-TPREFPLDSDPMNIEKHQTHHPSFFPKVESNIPSDRM 676

Query: 119  FSEKQRLPREAPRRDDRLRPNYSFPSHQSFRGDEITLSR 3
              E QRLP+EAP R+DR+R N+S P++ SF+ +E  LSR
Sbjct: 677  IHENQRLPKEAPYRNDRMRLNHSTPNYHSFQVEETPLSR 715


>ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cicer arietinum]
          Length = 951

 Score =  807 bits (2085), Expect = 0.0
 Identities = 446/743 (60%), Positives = 515/743 (69%), Gaps = 5/743 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFE--NNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVL 2043
            M K++VY GE+ LGEV+IY E  NNN N+K           I ISH++Q SERC PL+VL
Sbjct: 1    MYKSLVYQGEVVLGEVDIYPEVNNNNKNFKE----------IRISHFTQPSERCLPLAVL 50

Query: 2042 HTVTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIH 1863
            HT+T      S+G+CFK MES         QQ   LF L+N C R+NKTAVMPL  +E+H
Sbjct: 51   HTIT------SSGVCFK-MESKT-------QQQDPLFHLHNLCFRENKTAVMPLCGEEMH 96

Query: 1862 LVAIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDR 1683
            LVA+ SR      PCFWG+ V  GLY SCL MLNLRCLGIVFDLDETLIVANT+RSFEDR
Sbjct: 97   LVAMHSR--SNGRPCFWGYIVGMGLYNSCLMMLNLRCLGIVFDLDETLIVANTMRSFEDR 154

Query: 1682 IEALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPAL 1503
            I+ALQRKI+ E D QRI+GM  EVKRY EDK+ILKQY E DQVVDNGKV K Q+E++PAL
Sbjct: 155  IDALQRKINSEVDPQRISGMQAEVKRYLEDKSILKQYVENDQVVDNGKVLKAQSELVPAL 214

Query: 1502 SDNHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 1323
            SD+HQ IVRPLIRL +KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV
Sbjct: 215  SDSHQPIVRPLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 274

Query: 1322 CTMAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVI 1143
            CTMAERDYALEMWRLLDPDSNLI  +ELL RIVCVKSG KKSLFNVF  G+CHPKMALVI
Sbjct: 275  CTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVI 334

Query: 1142 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGL 963
            DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEA+NTIPVLCVARNVACNVRGGFFK+FD+GL
Sbjct: 335  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGL 394

Query: 962  LQRISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXX 783
            LQ+IS++ +E++ +DI   PDVSN+LVSEDDGS    N++   FDGMADAEVER+LKD  
Sbjct: 395  LQKISQIAYENNTRDISPAPDVSNYLVSEDDGSASYANRDPFAFDGMADAEVERKLKD-- 452

Query: 782  XXXXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTISQPAPQAS-IPSFHTNLFPQAGP 606
                           T  LD RLTSSL +T+ S  ++  PA QAS IP  HT  FPQ   
Sbjct: 453  ----AISAASAIPMTTAKLDPRLTSSLQYTMVSPGSVLPPAAQASMIPLPHTQ-FPQPAT 507

Query: 605  LARSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXX 426
            L + +  + P +L LHSSPA+EEGEVPESELDPDTRRRLLILQHGQD R+  S+E     
Sbjct: 508  LVKPIGQVAPSELSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPL 567

Query: 425  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSS 246
                                                            G WFPVE+ + S
Sbjct: 568  KHPVQVSARVPPR-----------------------------------GGWFPVEEEIGS 592

Query: 245  GPMSRLPPKDFPVAPEGVHVEKHR-PLPPFPRKVENSVWPDRNFSE-KQRLPREAPRRDD 72
             P +R+ PK+  +      +EKHR    PF  KV+ S+  DR   E  QRLP+E   RDD
Sbjct: 593  QPPNRVIPKEIALDSGPSRIEKHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMYHRDD 652

Query: 71   RLRPNYSFPSHQSFRGDEITLSR 3
            R R ++   S+ S  GD+    R
Sbjct: 653  RSRVSHMLSSYPSLSGDDTPFGR 675


>emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]
          Length = 894

 Score =  806 bits (2082), Expect = 0.0
 Identities = 439/740 (59%), Positives = 512/740 (69%), Gaps = 2/740 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M K++VY+G+  +GEVEIY +N             + K I ISHYSQ SERCPPL+VLHT
Sbjct: 1    MYKSIVYEGDDVVGEVEIYPQNQGLE---------LMKEIRISHYSQPSERCPPLAVLHT 51

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHLV 1857
            +T      S G+CFK MES+       Q QD+ L+ L++TC+R+NKTAVM LGE+E+HLV
Sbjct: 52   IT------SCGVCFK-MESSKA-----QSQDTPLYLLHSTCIRENKTAVMSLGEEELHLV 99

Query: 1856 AIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIE 1677
            A+ S++ DG  PCFWGF+V  GLY SCL MLNLRCLGIVFDLDETLIVANT+RSFEDRI+
Sbjct: 100  AMYSKKKDGQYPCFWGFNVALGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRID 159

Query: 1676 ALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSD 1497
            ALQRKI+ E D QRI+GM+ EV                   V+NGK+ K Q E++PALSD
Sbjct: 160  ALQRKINTEVDPQRISGMVAEV-------------------VENGKLFKTQPEIVPALSD 200

Query: 1496 NHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 1317
            NHQ IVRPLIRLQ+KNI+LTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT
Sbjct: 201  NHQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 260

Query: 1316 MAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVIDD 1137
            MAERDYALEMWRLLDP+SNLI  +ELL+RIVCVKSGS+KSLFNVF  GICHPKMALVIDD
Sbjct: 261  MAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDD 320

Query: 1136 RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLLQ 957
            RLKVWDEKDQPRVHVVPAFAPYYAPQAEANN I VLCVARNVACNVRGGFFKEFDEGLLQ
Sbjct: 321  RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQ 380

Query: 956  RISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXXX 777
            RI E+++ED+ KDI S PDVSN+LVSEDD S  +GN++   FDGMAD EVER+LKD    
Sbjct: 381  RIPEISYEDBIKDIRSAPDVSNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKD---- 436

Query: 776  XXXXXXXXXXXXXTVSLDSRLTSSLAFTVASSMTIS-QPAPQASIPSFHTNLFPQAGPLA 600
                           SLD RL+  L F VA+S  ++ QPA Q SI  F    FPQ+  L 
Sbjct: 437  ------AISAPSTVTSLDPRLSPPLQFAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLI 490

Query: 599  RSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXX 420
            + L      +  + SSPA+EEGEVPESELDPDTRRRLLILQHGQD RE  S++       
Sbjct: 491  KPLA----PEPTMQSSPAREEGEVPESELDPDTRRRLLILQHGQDTREHASSD------- 539

Query: 419  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGP 240
                                                       QS GSWFP ++ MS   
Sbjct: 540  ---------------------------PPFPVRPPIQVSVPRVQSRGSWFPADEEMSPRQ 572

Query: 239  MSRLPPKDFPVAPEGVHVEKHRP-LPPFPRKVENSVWPDRNFSEKQRLPREAPRRDDRLR 63
            ++R  PK+FP+  + +H+EKHRP  P F  KVE+S   DR   E QRL +E   RDDRLR
Sbjct: 573  LNRAVPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDDRLR 632

Query: 62   PNYSFPSHQSFRGDEITLSR 3
             N+S P + SF G+E+ L R
Sbjct: 633  LNHSLPGYHSFSGEEVPLGR 652


>ref|XP_004134718.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cucumis sativus] gi|449479317|ref|XP_004155567.1|
            PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II
            C-terminal domain phosphatase-like 1-like [Cucumis
            sativus]
          Length = 803

 Score =  805 bits (2079), Expect = 0.0
 Identities = 442/741 (59%), Positives = 523/741 (70%), Gaps = 3/741 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M K+VVY G+  LG+VEIY E  N  YKN E+     K I I+H+SQ SERCPPL+VLHT
Sbjct: 1    MYKSVVYHGDELLGDVEIYPEEKN-GYKNIEV-----KEIRITHFSQPSERCPPLAVLHT 54

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHLV 1857
            + A      +G+CFK MES        Q QD+ L  L+++C+ +NKTA+M  G +E+HLV
Sbjct: 55   IAA------SGICFK-MESKTS-----QSQDTPLNLLHSSCIMENKTAIMMFGVEELHLV 102

Query: 1856 AIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIE 1677
            A+ SR +D   PCFWGF+V  GLY SCL MLNLRCLGIVFDLDETL+VANT+RSFEDRIE
Sbjct: 103  AMFSRDLDKQYPCFWGFNVAMGLYNSCLDMLNLRCLGIVFDLDETLVVANTMRSFEDRIE 162

Query: 1676 ALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSD 1497
            ALQRKIS E D QR  GM+ EVKRYQ+DK ILKQYAE DQV++NGKV K Q+EV+PALSD
Sbjct: 163  ALQRKISSEVDPQRANGMLAEVKRYQDDKIILKQYAENDQVIENGKVIKSQSEVVPALSD 222

Query: 1496 NHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 1317
            NHQ +VRPLIRL +KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT
Sbjct: 223  NHQPVVRPLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 282

Query: 1316 MAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVIDD 1137
            MAERDYALEMWRLLDPDSNLI  +ELL+RIVCVKSGS+KSLFNVF  G CHPKMALVIDD
Sbjct: 283  MAERDYALEMWRLLDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFCHPKMALVIDD 342

Query: 1136 RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLLQ 957
            RLKVWDEKDQPRVHVVPAFAPYYAP AE NN IPVLCVARNVACNVRGGFFKEFD+ LLQ
Sbjct: 343  RLKVWDEKDQPRVHVVPAFAPYYAPNAEGNNAIPVLCVARNVACNVRGGFFKEFDDILLQ 402

Query: 956  RISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXXX 777
            +IS++++EDD  DI SPPDVSN+LVSED+ S  +GNK+   FDGM D EV+RR+KD    
Sbjct: 403  KISDISYEDDVNDIPSPPDVSNYLVSEDEYSIANGNKDMPTFDGMPDMEVDRRMKDAFLA 462

Query: 776  XXXXXXXXXXXXXTVSLDSRLTSSLAFTVAS-SMTISQPAPQASIPSFHTNLFPQAGPLA 600
                           S D R+ SSL +T+AS S ++  P  Q ++P F     P      
Sbjct: 463  SSTIN----------SADPRV-SSLQYTMASASCSVPLPPKQVTMPYFPNMPLPH----V 507

Query: 599  RSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXXXX 420
             S+ ++ P +  L SSPA+EEGEVPESELDPDTRRRLLILQHGQD RE +S+E       
Sbjct: 508  NSVAHVAPNEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRERLSSE------- 560

Query: 419  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSSGP 240
                                                      AQS G+W P+E+ MS   
Sbjct: 561  -------------------------PAFPARPPPLQQVAAPRAQSRGNWSPMEEEMSPRQ 595

Query: 239  MSRLPPKDFPVAPEGVHV-EKHRP-LPPFPRKVENSVWPDRNFSEKQRLPREAPRRDDRL 66
            ++R   KDFPV  E + + EKHR   P F  KV+NS+ PDR   + QRLP+EA  RDDR+
Sbjct: 596  LNRSARKDFPVDAEPMPMREKHRSNHPSFFAKVDNSILPDRIPHDNQRLPKEAFYRDDRM 655

Query: 65   RPNYSFPSHQSFRGDEITLSR 3
            R +    S+ +F G+EI +++
Sbjct: 656  RVSRRPSSYPAFSGEEIPMNQ 676


>ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X3 [Glycine max]
          Length = 932

 Score =  797 bits (2059), Expect = 0.0
 Identities = 440/742 (59%), Positives = 517/742 (69%), Gaps = 5/742 (0%)
 Frame = -2

Query: 2216 MVKTVVYDGEIFLGEVEIYFENNNFNYKNDEIEKIVKKGIYISHYSQESERCPPLSVLHT 2037
            M +++VY GE+ +GEVEIY E         E + I  K I ISH+SQ SERCPPL+VLHT
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPE---------EKKNIDLKEIRISHFSQPSERCPPLAVLHT 51

Query: 2036 VTAQSSSSSNGLCFKIMESNNKLLYFQQQQDSQLFALYNTCLRDNKTAVMPLGEQEIHLV 1857
            +T      S G+CFK+  S ++     +QQ   LF L+++C+R+NKTAVMPL  +EIHLV
Sbjct: 52   IT------SFGICFKMESSTSQT----RQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLV 101

Query: 1856 AIRSRRMDGATPCFWGFSVIPGLYESCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIE 1677
            A+ SR  D   PCFWGF V  GLY SCL MLNLRCLGIVFDLDETL+VANT+RSFED+IE
Sbjct: 102  AMYSRNND--RPCFWGFIVASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIE 159

Query: 1676 ALQRKISVETDHQRITGMIGEVKRYQEDKAILKQYAETDQVVDNGKVHKIQAEVIPALSD 1497
             L RK++ E + QRI+ M  E+KRY +DK ILK+YAE DQVVDNGKV KIQ+E++PALSD
Sbjct: 160  VLHRKMNSEVNPQRISTMQAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSD 219

Query: 1496 NHQTIVRPLIRLQDKNIVLTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 1317
            +HQ IVRPLIRLQ+KNI+LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCT
Sbjct: 220  SHQPIVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCT 279

Query: 1316 MAERDYALEMWRLLDPDSNLIGGRELLERIVCVKSGSKKSLFNVFHGGICHPKMALVIDD 1137
            MAERDYALEMWRLLDP+ NLI  +ELL+RIVCVKSG KKSLFNVF  G+CH KMALVIDD
Sbjct: 280  MAERDYALEMWRLLDPELNLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDD 339

Query: 1136 RLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKEFDEGLLQ 957
            RLKVWDEKDQP+VHVVPAFAPYYAPQAEA+N +P LC+AR+VACNVRGGFFK+FD+GLLQ
Sbjct: 340  RLKVWDEKDQPQVHVVPAFAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQ 399

Query: 956  RISEVTFEDDPKDILSPPDVSNFLVSEDDGSGCDGNKESIGFDGMADAEVERRLKDXXXX 777
            +I  + +EDD KDI SPPDVSN+LVSEDD S  +GNK  + FDGMADAEVERRLKD    
Sbjct: 400  KIPLIAYEDDIKDIPSPPDVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKD---- 455

Query: 776  XXXXXXXXXXXXXTVSLDSRL--TSSLAFT-VASSMTISQPAPQASIPSFHTNLFPQAGP 606
                         T +LD RL   SSL +T V+SS T+  P  QASI  F    FPQ   
Sbjct: 456  --AISASSTVPAMTTNLDPRLAFNSSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNT 513

Query: 605  LARSLCNIGPQDLGLHSSPAQEEGEVPESELDPDTRRRLLILQHGQDMRESISAEXXXXX 426
            L + +C + P    LHSSPA+EEGEVPESELD DTRRRLLILQHGQD RE  S+E     
Sbjct: 514  LVKPICQVTPPGPSLHSSPAREEGEVPESELDLDTRRRLLILQHGQDTREHTSSE----- 568

Query: 425  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAQSLGSWFPVEDHMSS 246
                                                          S   WF VE+ M  
Sbjct: 569  -----------------------------PPLPVRHPTQVSAPSVPSRRGWFSVEEEMGP 599

Query: 245  GPMSRLPPKDFPVAPEGVHVEKHRPL-PPFPRKVENSVWPDRNFSEK-QRLPREAPRRDD 72
              +++L PK+FPV  E +H+EK  P  P    KV++SV  DR F E  QRLP+E   RDD
Sbjct: 600  QQLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRLPKEVHHRDD 659

Query: 71   RLRPNYSFPSHQSFRGDEITLS 6
              R + S  S+ SF GD+I LS
Sbjct: 660  HSRLSQSLSSYHSFPGDDIPLS 681


Top