BLASTX nr result

ID: Ephedra27_contig00013113 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00013113
         (2258 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006827806.1| hypothetical protein AMTR_s00009p00267690 [A...   723   0.0  
gb|EOY28304.1| C-terminal domain phosphatase-like 1 isoform 3 [T...   707   0.0  
gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [T...   707   0.0  
gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [T...   707   0.0  
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...   698   0.0  
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...   697   0.0  
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...   696   0.0  
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...   695   0.0  
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...   694   0.0  
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...   689   0.0  
ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma...   689   0.0  
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...   689   0.0  
ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma...   686   0.0  
ref|XP_004134718.1| PREDICTED: RNA polymerase II C-terminal doma...   685   0.0  
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...   684   0.0  
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...   679   0.0  
ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma...   676   0.0  
ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma...   658   0.0  
ref|XP_004976316.1| PREDICTED: RNA polymerase II C-terminal doma...   657   0.0  
ref|XP_006413749.1| hypothetical protein EUTSA_v10024324mg [Eutr...   655   0.0  

>ref|XP_006827806.1| hypothetical protein AMTR_s00009p00267690 [Amborella trichopoda]
            gi|548832426|gb|ERM95222.1| hypothetical protein
            AMTR_s00009p00267690 [Amborella trichopoda]
          Length = 942

 Score =  723 bits (1867), Expect = 0.0
 Identities = 399/775 (51%), Positives = 513/775 (66%), Gaps = 23/775 (2%)
 Frame = -3

Query: 2256 LLINGIRDTIRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQDNSLLKQLYITCLK 2077
            L +  I   IR+SHLS  S+RCPPLAVLHTI+  GV  K+E K     S L  LY TCLK
Sbjct: 25   LNLETITKEIRISHLSLPSERCPPLAVLHTIASCGVCFKLEFKSQSGESPLFSLYNTCLK 84

Query: 2076 EAKAAIVPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLD 1897
            + K A++PL   EL LVAM SK++ +   CFWG+ +S  LYN+CL MLNLRCL+IVFDLD
Sbjct: 85   DNKTAVMPLGAEELHLVAMASKNKFELFSCFWGFRISLGLYNSCLAMLNLRCLSIVFDLD 144

Query: 1896 ETLIVANTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVD 1717
            ETLIVANT+RSFED+IDA+QRK+S+E DP RV GM+AEV+RYQ+D+ +LKQY ++DQ+V+
Sbjct: 145  ETLIVANTMRSFEDRIDALQRKISSEPDPQRVSGMLAEVKRYQDDKTILKQYVESDQVVE 204

Query: 1716 SNGVVLKAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEEL 1537
             NG V K   E +PP SD  Q+I RPLIR+Q++N+I TRINP IRDTSVLVR+RPAWE+L
Sbjct: 205  -NGKVFKLQNEIVPPLSDSHQAIVRPLIRLQERNIILTRINPVIRDTSVLVRMRPAWEDL 263

Query: 1536 RTYLTLKGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALG 1357
            R+YLT KGRKRFEVY+CTM+ERDYALEMWRLLD E+ LIN  +LLDR++CVK GSRK+L 
Sbjct: 264  RSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPEANLINPRQLLDRIVCVKSGSRKSLL 323

Query: 1356 NVFRVGFCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNV 1177
             VF+ G CHPK+AMVIDDRLKVW+D+DQPRVHVVPP+APYYAPQAE +  VPVL VARNV
Sbjct: 324  TVFQDGICHPKMAMVIDDRLKVWDDKDQPRVHVVPPYAPYYAPQAEVNNAVPVLYVARNV 383

Query: 1176 ACNVRGGFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLP 1006
            ACNVR GFFKDFD+VLL+++ +V YE ++  LP  PD SNYLL++DD   LN NKDLP+P
Sbjct: 384  ACNVRSGFFKDFDDVLLKRIPDVFYEDDISCLPSAPDSSNYLLSEDDSSVLNGNKDLPIP 443

Query: 1005 EGMADSELEKKLGPQETNQVSLPNIIS-SGIENQPVVRL--------CDSTQPTFIPLVA 853
            EGM DSE+E++L        ++P   S +  E +P + L          S  P   P+  
Sbjct: 444  EGMVDSEVERRLKDANFAMQAMPTSTSNNNFERRPTMSLQHVASTSNMISQSPCQGPMSL 503

Query: 852  NHGRIPLAAPXXXXXXXXXXXLPIIPTVKPYMPQSHV-RLDCGIQTSPPREEGEVPESEV 676
            N+ +   A                +P++K   P  H+   D  +Q SP REEGEVPESE+
Sbjct: 504  NNKQYNHA----------------VPSLK---PSGHICSSDSTLQCSPGREEGEVPESEL 544

Query: 675  DPDTRRRLLILQHGQDTGKFNGADPGLP-----ARLQVTAPPNQVPGGWLGAEEEMSPRQ 511
            DPDTRRRLLILQHGQDT +    DP  P       LQ+  PP Q  G W   EEEMSPRQ
Sbjct: 545  DPDTRRRLLILQHGQDTREHGTIDPPPPPFPLRPALQIAVPPAQSHGPWFPVEEEMSPRQ 604

Query: 510  IVRTAQGPAPQQESFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLK 331
            +    +    + E+   D+ R      + G +  +  DR  +E++R ++ E  + ++RL 
Sbjct: 605  LSHPLREFPLEPEAVQFDRHR--ARPFFHGVDGSIPADRVFNEAQR-LSKEVQYRDDRLH 661

Query: 330  SNHIIPXXXXXXXXXXXSLNKTSSNCK-----TAQGNLLSASNCISALYKIAEYCNTKVD 166
             N                  ++SSN +     T Q     +   +  L  IA  C +KVD
Sbjct: 662  QNLPKTSYSSFPEVEEMPPGQSSSNTRDVPFATGQVPPQYSPTPVGVLKDIAIKCGSKVD 721

Query: 165  FRSWLSSARELEFSVEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQV 1
            FRS +    EL+FSVEV F G+KI   +GKT+KEAQ KAS  +++ +A  Y++Q+
Sbjct: 722  FRSMVVPTTELQFSVEVWFVGEKIGEGIGKTRKEAQFKASEASIRTLARTYLAQI 776


>gb|EOY28304.1| C-terminal domain phosphatase-like 1 isoform 3 [Theobroma cacao]
          Length = 870

 Score =  707 bits (1825), Expect = 0.0
 Identities = 389/761 (51%), Positives = 523/761 (68%), Gaps = 18/761 (2%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKEN------QDNSLLKQLYITCLKEAK 2068
            IR+ +L+  S+RCPPLAVLHTI+ +G+  K+ES ++      QD+  L  L+  C+++ K
Sbjct: 52   IRIEYLTQGSERCPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNK 111

Query: 2067 AAIVPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETL 1888
             A++P+   EL LVAM S++     PCFWG++VS  LY++CL MLNLRCL IVFDLDETL
Sbjct: 112  TAVMPMGDCELHLVAMYSRN--SDRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETL 169

Query: 1887 IVANTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNG 1708
            IVANT+RSFED+I+A+QRK++ E DP RV GMVAE++RYQ+D+A+LKQYA+ DQ+V+ NG
Sbjct: 170  IVANTMRSFEDRIEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVE-NG 228

Query: 1707 VVLKAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTY 1528
             V+K  +E +P  SD  Q I RPLIR+Q+KN+I TRINP IRDTSVLVRLRPAWE+LR+Y
Sbjct: 229  KVIKIQSEVVPALSDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSY 288

Query: 1527 LTLKGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVF 1348
            LT +GRKRFEVY+CTM+ERDYALEMWRLLD ES LIN  ELLDR++CVK GSRK+L NVF
Sbjct: 289  LTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVF 348

Query: 1347 RVGFCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACN 1168
            + G CHPK+A+VIDDRLKVW+++DQPRVHVVP FAPYYAPQAE +  +PVLCVARNVACN
Sbjct: 349  QDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACN 408

Query: 1167 VRGGFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGM 997
            VRGGFF++FDE LLQ++ E+ YE +++ +P  PDV NYL+++DD   LN NKD  L +GM
Sbjct: 409  VRGGFFREFDEGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGM 468

Query: 996  ADSELEKKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXX 817
            AD+E+E++L       +S  + +SS   N    RL  S Q T   + ++   IP +A   
Sbjct: 469  ADAEVERRL----KEAISATSTVSSAAINLD-PRLTPSLQYT---MPSSSSSIPPSASQP 520

Query: 816  XXXXXXXXXLPI-IPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQ 640
                      P+  P VKP  P +    +  +Q+SP REEGEVPESE+DPDTRRRLLILQ
Sbjct: 521  SIVSFSNMQFPLAAPVVKPVAPVAVP--EPSLQSSPAREEGEVPESELDPDTRRRLLILQ 578

Query: 639  HGQDTGKFNGADPGLP---ARLQVTAPPNQVPGGWLGAEEEMSPRQIVRTAQGPAP-QQE 472
            HGQDT      +P  P     +QV+ P  Q  G W  AEEEMSPRQ+ R A    P   E
Sbjct: 579  HGQDTRDHTPPEPAFPPVRPTMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSE 638

Query: 471  SFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXX 292
               ++K R P    +P  E+ +  DR L E++R ++ EA   ++RL  NH  P       
Sbjct: 639  RMHIEKHRHP--PFFPKVESSIPSDRLLRENQR-LSKEALHRDDRLGLNH-TPSSYHSFS 694

Query: 291  XXXXSLNKTSSNCK----TAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFS 124
                 L+++SS+ +     +   + S       L  IA  C  KV+FR  L ++ +L+FS
Sbjct: 695  GEEMPLSQSSSSHRDLDFESGRTVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFS 754

Query: 123  VEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQV 1
            +E  F G+K+   VG+T++EAQ++A+ +++KN+A+ Y+S++
Sbjct: 755  IEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRI 795


>gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score =  707 bits (1825), Expect = 0.0
 Identities = 389/761 (51%), Positives = 523/761 (68%), Gaps = 18/761 (2%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKEN------QDNSLLKQLYITCLKEAK 2068
            IR+ +L+  S+RCPPLAVLHTI+ +G+  K+ES ++      QD+  L  L+  C+++ K
Sbjct: 52   IRIEYLTQGSERCPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNK 111

Query: 2067 AAIVPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETL 1888
             A++P+   EL LVAM S++     PCFWG++VS  LY++CL MLNLRCL IVFDLDETL
Sbjct: 112  TAVMPMGDCELHLVAMYSRN--SDRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETL 169

Query: 1887 IVANTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNG 1708
            IVANT+RSFED+I+A+QRK++ E DP RV GMVAE++RYQ+D+A+LKQYA+ DQ+V+ NG
Sbjct: 170  IVANTMRSFEDRIEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVE-NG 228

Query: 1707 VVLKAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTY 1528
             V+K  +E +P  SD  Q I RPLIR+Q+KN+I TRINP IRDTSVLVRLRPAWE+LR+Y
Sbjct: 229  KVIKIQSEVVPALSDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSY 288

Query: 1527 LTLKGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVF 1348
            LT +GRKRFEVY+CTM+ERDYALEMWRLLD ES LIN  ELLDR++CVK GSRK+L NVF
Sbjct: 289  LTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVF 348

Query: 1347 RVGFCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACN 1168
            + G CHPK+A+VIDDRLKVW+++DQPRVHVVP FAPYYAPQAE +  +PVLCVARNVACN
Sbjct: 349  QDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACN 408

Query: 1167 VRGGFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGM 997
            VRGGFF++FDE LLQ++ E+ YE +++ +P  PDV NYL+++DD   LN NKD  L +GM
Sbjct: 409  VRGGFFREFDEGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGM 468

Query: 996  ADSELEKKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXX 817
            AD+E+E++L       +S  + +SS   N    RL  S Q T   + ++   IP +A   
Sbjct: 469  ADAEVERRL----KEAISATSTVSSAAINLD-PRLTPSLQYT---MPSSSSSIPPSASQP 520

Query: 816  XXXXXXXXXLPI-IPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQ 640
                      P+  P VKP  P +    +  +Q+SP REEGEVPESE+DPDTRRRLLILQ
Sbjct: 521  SIVSFSNMQFPLAAPVVKPVAPVAVP--EPSLQSSPAREEGEVPESELDPDTRRRLLILQ 578

Query: 639  HGQDTGKFNGADPGLP---ARLQVTAPPNQVPGGWLGAEEEMSPRQIVRTAQGPAP-QQE 472
            HGQDT      +P  P     +QV+ P  Q  G W  AEEEMSPRQ+ R A    P   E
Sbjct: 579  HGQDTRDHTPPEPAFPPVRPTMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSE 638

Query: 471  SFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXX 292
               ++K R P    +P  E+ +  DR L E++R ++ EA   ++RL  NH  P       
Sbjct: 639  RMHIEKHRHP--PFFPKVESSIPSDRLLRENQR-LSKEALHRDDRLGLNH-TPSSYHSFS 694

Query: 291  XXXXSLNKTSSNCK----TAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFS 124
                 L+++SS+ +     +   + S       L  IA  C  KV+FR  L ++ +L+FS
Sbjct: 695  GEEMPLSQSSSSHRDLDFESGRTVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFS 754

Query: 123  VEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQV 1
            +E  F G+K+   VG+T++EAQ++A+ +++KN+A+ Y+S++
Sbjct: 755  IEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRI 795


>gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score =  707 bits (1825), Expect = 0.0
 Identities = 389/761 (51%), Positives = 523/761 (68%), Gaps = 18/761 (2%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKEN------QDNSLLKQLYITCLKEAK 2068
            IR+ +L+  S+RCPPLAVLHTI+ +G+  K+ES ++      QD+  L  L+  C+++ K
Sbjct: 52   IRIEYLTQGSERCPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNK 111

Query: 2067 AAIVPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETL 1888
             A++P+   EL LVAM S++     PCFWG++VS  LY++CL MLNLRCL IVFDLDETL
Sbjct: 112  TAVMPMGDCELHLVAMYSRN--SDRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETL 169

Query: 1887 IVANTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNG 1708
            IVANT+RSFED+I+A+QRK++ E DP RV GMVAE++RYQ+D+A+LKQYA+ DQ+V+ NG
Sbjct: 170  IVANTMRSFEDRIEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVE-NG 228

Query: 1707 VVLKAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTY 1528
             V+K  +E +P  SD  Q I RPLIR+Q+KN+I TRINP IRDTSVLVRLRPAWE+LR+Y
Sbjct: 229  KVIKIQSEVVPALSDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSY 288

Query: 1527 LTLKGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVF 1348
            LT +GRKRFEVY+CTM+ERDYALEMWRLLD ES LIN  ELLDR++CVK GSRK+L NVF
Sbjct: 289  LTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVF 348

Query: 1347 RVGFCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACN 1168
            + G CHPK+A+VIDDRLKVW+++DQPRVHVVP FAPYYAPQAE +  +PVLCVARNVACN
Sbjct: 349  QDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACN 408

Query: 1167 VRGGFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGM 997
            VRGGFF++FDE LLQ++ E+ YE +++ +P  PDV NYL+++DD   LN NKD  L +GM
Sbjct: 409  VRGGFFREFDEGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGM 468

Query: 996  ADSELEKKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXX 817
            AD+E+E++L       +S  + +SS   N    RL  S Q T   + ++   IP +A   
Sbjct: 469  ADAEVERRL----KEAISATSTVSSAAINLD-PRLTPSLQYT---MPSSSSSIPPSASQP 520

Query: 816  XXXXXXXXXLPI-IPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQ 640
                      P+  P VKP  P +    +  +Q+SP REEGEVPESE+DPDTRRRLLILQ
Sbjct: 521  SIVSFSNMQFPLAAPVVKPVAPVAVP--EPSLQSSPAREEGEVPESELDPDTRRRLLILQ 578

Query: 639  HGQDTGKFNGADPGLP---ARLQVTAPPNQVPGGWLGAEEEMSPRQIVRTAQGPAP-QQE 472
            HGQDT      +P  P     +QV+ P  Q  G W  AEEEMSPRQ+ R A    P   E
Sbjct: 579  HGQDTRDHTPPEPAFPPVRPTMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSE 638

Query: 471  SFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXX 292
               ++K R P    +P  E+ +  DR L E++R ++ EA   ++RL  NH  P       
Sbjct: 639  RMHIEKHRHP--PFFPKVESSIPSDRLLRENQR-LSKEALHRDDRLGLNH-TPSSYHSFS 694

Query: 291  XXXXSLNKTSSNCK----TAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFS 124
                 L+++SS+ +     +   + S       L  IA  C  KV+FR  L ++ +L+FS
Sbjct: 695  GEEMPLSQSSSSHRDLDFESGRTVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFS 754

Query: 123  VEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQV 1
            +E  F G+K+   VG+T++EAQ++A+ +++KN+A+ Y+S++
Sbjct: 755  IEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRI 795


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score =  698 bits (1802), Expect = 0.0
 Identities = 389/780 (49%), Positives = 520/780 (66%), Gaps = 29/780 (3%)
 Frame = -3

Query: 2253 LINGIRDTIRVSHLSAQSDRCPPLAVLHTISPTGVFIKVE-------SKENQDNSLLKQL 2095
            +I+ I   IR+SH S  S+RCPPLAVLHTI+  GV  K+E       +K +Q  S L  L
Sbjct: 37   VIDEIVKEIRISHFSQTSERCPPLAVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLL 96

Query: 2094 YITCLKEAKAAIVPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLA 1915
            + +C++E K A++ L G EL LVAM S+S   Q PCFWG+ V+  LY++CL MLNLRCL 
Sbjct: 97   HSSCIQENKTAVMHLGGEELHLVAMPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLG 156

Query: 1914 IVFDLDETLIVANTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYAD 1735
            IVFDLDETLIVANT+RSFED+IDA+QRK+S E DP R+ GM++EV+RY +D+ +LKQY +
Sbjct: 157  IVFDLDETLIVANTMRSFEDRIDALQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVE 216

Query: 1734 ADQIVDSNGVVLKAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLR 1555
             DQ+V+ NG V+K  +E +P  SD  Q + RPLIR+Q+KN+I TRINP IRDTSVLVRLR
Sbjct: 217  NDQVVE-NGKVIKTQSEVVPALSDNHQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLR 275

Query: 1554 PAWEELRTYLTLKGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPG 1375
            PAWE+LR+YLT +GRKRFEVY+CTM+ERDYALEMWRLLD ES LIN  ELLDR++CVK G
Sbjct: 276  PAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSG 335

Query: 1374 SRKALGNVFRVGFCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVL 1195
             RK+L NVF+ G CHPK+A+VIDDRLKVW++RDQ RVHVVP FAPYYAPQAE +  VPVL
Sbjct: 336  LRKSLFNVFQDGICHPKMALVIDDRLKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVL 395

Query: 1194 CVARNVACNVRGGFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVN 1024
            CVARNVACNVRGGFFK+FDE LLQK+ EV YE + +++P  PDVSNYL+++DD   +N N
Sbjct: 396  CVARNVACNVRGGFFKEFDEGLLQKIPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGN 455

Query: 1023 KDLPLPEGMADSELEKKLGPQETNQVSLPNIISSGIEN---------QPVVRLCDSTQPT 871
            +D    +GMAD+E+E++L    +   ++ + I S + +         Q  +    S+ PT
Sbjct: 456  RDQLSFDGMADAEVERQLKEAVSASSAILSTIPSTVSSLDPRLLQSLQYTIASSSSSMPT 515

Query: 870  FIP-LVANHGRIPLAAPXXXXXXXXXXXLP--IIPTVKPYMPQ--SHVRLDCGIQTSPPR 706
              P ++A+   +P   P            P    P V P + Q    V  +  +Q+SP R
Sbjct: 516  SQPSMLASQQPMPALQPPKPPSQLSMTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAR 575

Query: 705  EEGEVPESEVDPDTRRRLLILQHGQDTGKFNGADPGLPAR--LQVTAPPNQVPGGWLGAE 532
            EEGEVPESE+DPDTRRRLLILQHG D+     ++   PAR   QV+AP  Q  G W+  E
Sbjct: 576  EEGEVPESELDPDTRRRLLILQHGHDSRDNAPSESPFPARPSTQVSAPRVQSVGSWVPVE 635

Query: 531  EEMSPRQIVRTAQGPAPQQESFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEAS 352
            EEMSPRQ+ RT +      +   ++K R    S +   E+ +  DR + E++R    EA+
Sbjct: 636  EEMSPRQLNRTPREFPLDSDPMNIEKHRTHHPSFFHKVESNIPSDRMIHENQR-QPKEAT 694

Query: 351  FSEERLKSNHIIPXXXXXXXXXXXSLNKTSSNCK---TAQGNLLSASNCISALYKIAEYC 181
            + ++R+K NH               L+++SSN      ++    S    +  L +IA  C
Sbjct: 695  YRDDRMKLNH-STSNYPSFQGEESPLSRSSSNRDLDLESERAFSSTETPVEVLQEIAMKC 753

Query: 180  NTKVDFRSWLSSARELEFSVEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQV 1
             TKV+FR  L +  +L+FS+E  F G+K+    GKT++EAQ++A+  ++K +A  YMS+V
Sbjct: 754  GTKVEFRPALIATSDLQFSIETWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRV 813


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score =  697 bits (1798), Expect = 0.0
 Identities = 391/762 (51%), Positives = 512/762 (67%), Gaps = 17/762 (2%)
 Frame = -3

Query: 2235 DTIRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQDNSLLKQLYITCLKEAKAAIV 2056
            D IR+S+ S  S+RCPPLAVLHTI+ +G+  K+ESK + DN  L  L+ +C++E K A++
Sbjct: 38   DEIRISYFSEASERCPPLAVLHTITASGICFKMESKSS-DNIQLHLLHSSCIRENKTAVM 96

Query: 2055 PLQ-GRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVA 1879
            PL    EL LVAM S++   Q PCFW + V S LYN+CL MLNLRCL IVFDLDETLIVA
Sbjct: 97   PLGLTEELHLVAMYSRNNEKQYPCFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVA 156

Query: 1878 NTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVL 1699
            NT+RSFED+I+A+ RK+S E DP R+ GM AEV+RYQ+D+ +LKQYA+ DQ V+ NG V+
Sbjct: 157  NTMRSFEDRIEALLRKISTEVDPQRIAGMQAEVKRYQDDKNILKQYAENDQ-VNENGKVI 215

Query: 1698 KAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTL 1519
            K  +E +P  SD  Q++ RPLIR+Q+KN+I TRINP IRDTSVLVRLRPAWE+LR+YLT 
Sbjct: 216  KVQSEVVPALSDSHQALVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTA 275

Query: 1518 KGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVG 1339
            +GRKRFEVY+CTM+ERDYALEMWRLLD ES LIN  ELLDR++CVK GSRK+L NVF+ G
Sbjct: 276  RGRKRFEVYVCTMAERDYALEMWRLLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDG 335

Query: 1338 FCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRG 1159
             CHPK+A+VIDDRLKVW+D+DQPRVHVVP FAPYYAPQAE +  +PVLCVARN+ACNVRG
Sbjct: 336  TCHPKMALVIDDRLKVWDDKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRG 395

Query: 1158 GFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGMADS 988
            GFFK+FDE LLQ++ E+ YE +V+ +P  PDVSNYL+++DD    N  KD    +GMAD+
Sbjct: 396  GFFKEFDEGLLQRIPEISYEDDVKDIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADA 455

Query: 987  ELEKKLGPQETNQVSLPNIISSGIEN-----QPVVRLCDSTQPTFIPLVANHGRIPLAAP 823
            E+E++L       ++    ISS + N      P      S+  T     +    +PLA  
Sbjct: 456  EVERRL----KEAIAASATISSAVANLDPRLAPFQYTMPSSSSTTTLPTSQAAVMPLA-- 509

Query: 822  XXXXXXXXXXXLPIIPTVKPYMPQSHV-RLDCGIQTSPPREEGEVPESEVDPDTRRRLLI 646
                        P    VKP     HV   +  +Q+SP REEGEVPESE+DPDTRRRLLI
Sbjct: 510  -------NMQFPPATSLVKPL---GHVGPPEQSLQSSPAREEGEVPESELDPDTRRRLLI 559

Query: 645  LQHGQDTGKFNGADPGLPARLQVTAPPNQVP--GGWLGAEEEMSPRQIVRTAQGPAP-QQ 475
            LQHG DT +   ++   PAR Q+     +VP  G W   EEEMSPRQ+ R      P   
Sbjct: 560  LQHGMDTRENAPSEAPFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQLNRAVPKEFPLNS 619

Query: 474  ESFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXX 295
            E+  ++K R P  S +P  ENP   DR      + M  EA   ++RL+ NH +       
Sbjct: 620  EAMQIEKHRPPHPSFFPKIENPSTSDR--PHENQRMPKEALRRDDRLRLNHTL-SDYQSF 676

Query: 294  XXXXXSLNKTSSNCKTA---QGNLLSASNCIS-ALYKIAEYCNTKVDFRSWLSSARELEF 127
                  L+++SS+ +      G  +S++   S  L  IA  C TKV+FR  L ++ EL+F
Sbjct: 677  SGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQDIAMKCGTKVEFRPALVASTELQF 736

Query: 126  SVEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQV 1
            S+E  F G+KI   +G+T++EAQ++A+  ++K++A+ YM +V
Sbjct: 737  SIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYMLRV 778


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score =  696 bits (1796), Expect = 0.0
 Identities = 391/767 (50%), Positives = 507/767 (66%), Gaps = 26/767 (3%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVE---SKENQDNSLLKQLYITCLKEAKAAI 2059
            IR+SH S  S+RCPPLAVLHT++ TG+  K+E   SK    +S L  L+ TCL++ K A+
Sbjct: 33   IRISHYSPSSERCPPLAVLHTVT-TGLSFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAV 91

Query: 2058 VPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVA 1879
            + L   EL LVAM SK+   Q PCFWG+ V+S LY++CL MLNLRCL IVFDLDETLIVA
Sbjct: 92   MSLGREELHLVAMQSKNIGGQCPCFWGFKVASGLYDSCLTMLNLRCLGIVFDLDETLIVA 151

Query: 1878 NTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVL 1699
            NT+RSFED+I+A+QRK+++ESDP R   M+AEV+RYQED+ +LKQYA+ DQ+VD NG V+
Sbjct: 152  NTMRSFEDRIEALQRKINSESDPQRASVMLAEVKRYQEDKIILKQYAENDQVVD-NGKVI 210

Query: 1698 KAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTL 1519
            K+ +E  P  SD  Q I RPLIR+QD+N+I TRINP IRDTSVLVRLRPAWE+LR+YLT 
Sbjct: 211  KSQSEVFPALSDNHQPIVRPLIRLQDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTA 270

Query: 1518 KGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVG 1339
            +GRKRFEVY+CTM+ERDYALEMWRLLD +S LIN  ELLDR++CVK G RK+L NVF+ G
Sbjct: 271  RGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSQELLDRIVCVKSGLRKSLFNVFQDG 330

Query: 1338 FCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRG 1159
             CHPK+A+VIDDRLKVW+D+DQPRVHVVP FAPY+APQAE +  VPVLCVARNVACNVRG
Sbjct: 331  NCHPKMALVIDDRLKVWDDKDQPRVHVVPAFAPYFAPQAEGNNSVPVLCVARNVACNVRG 390

Query: 1158 GFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGMADS 988
            GFFKDFDE LLQ++SEV YE +++ +P  PDVSNYL+++DD   +N NKD    +GMADS
Sbjct: 391  GFFKDFDEGLLQRISEVAYEDDIKQVPSAPDVSNYLISEDDPSAVNGNKDSLGFDGMADS 450

Query: 987  ELEKKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXXXXX 808
            E+E++L        S+P+ ++             +  P  +P         L  P     
Sbjct: 451  EVERRLKEAMLASTSVPSQMT-------------NLDPRLVP--------ALQYPVPPVI 489

Query: 807  XXXXXXLPIIPTVKPYMPQSHVRL----------DCGIQTSPPREEGEVPESEVDPDTRR 658
                   P++P    ++PQ    L          D  +Q+SP REEGEVPESE+DPDTRR
Sbjct: 490  SQPSIQSPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREEGEVPESELDPDTRR 549

Query: 657  RLLILQHGQDTGKFNGADPGLP--ARLQVTAPPNQVPGGWLGAEEEMSPRQIVRTAQGPA 484
            RLLILQHGQDT     ++P  P    LQV+ PP   P GW  AEEEMSPRQ+ R    P 
Sbjct: 550  RLLILQHGQDTRDQVSSEPKFPMGTPLQVSVPPRVQPHGWFPAEEEMSPRQLNR----PL 605

Query: 483  PQQ------ESFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNH 322
            P +      ES  ++K R P     P  E  +  DR L E++R +  E    ++R++ + 
Sbjct: 606  PPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFENQR-LPKEVIPRDDRMRFSQ 664

Query: 321  IIPXXXXXXXXXXXSLNKTSSNCKTAQGNLLS--ASNCISALYKIAEYCNTKVDFRSWLS 148
              P             + +S+     +             AL  IA  C  KV+FRS   
Sbjct: 665  SQPSFRPPGEEVPLGRSSSSNRVLDLEPGHYDPYLETPAGALQDIAFKCGAKVEFRSSFL 724

Query: 147  SARELEFSVEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMS 7
            S+ EL+FS+EVLF G+K+    G+T++EAQ++A+ ++L  +A +Y+S
Sbjct: 725  SSPELQFSLEVLFAGEKVGEGTGRTRREAQRRAAEESLMYLADKYLS 771


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Glycine max]
          Length = 960

 Score =  695 bits (1793), Expect = 0.0
 Identities = 382/751 (50%), Positives = 502/751 (66%), Gaps = 10/751 (1%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQDNSLLKQLYITCLKEAKAAIVPL 2050
            IR+SH S  S+RCPPLAVLHT++  GV  K+ESK  Q + L  QL+  C++E K A++PL
Sbjct: 40   IRISHFSQPSERCPPLAVLHTVTSCGVCFKMESKTQQQDGLF-QLHSLCIRENKTAVMPL 98

Query: 2049 QGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVANTL 1870
             G E+ LVAM   SR D  PCFWG+ V+  LY++CL MLNLRCL IVFDLDETLIVANT+
Sbjct: 99   GGEEIHLVAM--HSRNDDRPCFWGFIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTM 156

Query: 1869 RSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVLKAV 1690
            RSFED+IDA+QRK+++E DP R+ GM AEV+RY +D+ +LKQYA+ DQ+VD NG V+K  
Sbjct: 157  RSFEDRIDALQRKINSEVDPQRISGMQAEVKRYLDDKNILKQYAENDQVVD-NGRVIKVQ 215

Query: 1689 TEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTLKGR 1510
            +E +P  SD  Q I RPLIR+QDKN+I TRINP IRDTSVLVRLRPAWE+LR+YLT +GR
Sbjct: 216  SEIVPALSDSHQPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 275

Query: 1509 KRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVGFCH 1330
            KRFEVY+CTM+ERDYALEMWRLLD +S LIN  ELL R++CVK G +K+L NVF+ G C 
Sbjct: 276  KRFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCD 335

Query: 1329 PKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRGGFF 1150
            PK+A+VIDDRLKVW++RDQPRVHVVP FAPYYAPQAE S  +PVLCVARNVACNVRGGFF
Sbjct: 336  PKMALVIDDRLKVWDERDQPRVHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFF 395

Query: 1149 KDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGMADSELE 979
            KDFD+ LLQK+ ++ YE +++ +P  PDVSNYL+++DD    N N+D  L +GMAD+E+E
Sbjct: 396  KDFDDGLLQKIPQIAYEDDIKDVPSPPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVE 455

Query: 978  KKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXXXXXXXX 799
            +KL        + P + ++ ++ +       S Q T +P     G +P            
Sbjct: 456  RKLKDALAAASTFP-VTTANLDPR-----LTSLQYTMVP----SGSVPPPTAQASMMPFP 505

Query: 798  XXXLPIIPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQHGQDTGK 619
                P   T+   M Q+    D  + +SP REEGEVPESE+DPDTRRRLLILQHGQDT  
Sbjct: 506  HVQFPQPATLVKPMGQA-APSDPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRD 564

Query: 618  FNGADPGLPARLQVTAPPNQVP---GGWLGAEEEMSPRQIVRTAQGPAPQQES-FGVDKP 451
               A+P  P R  V A   +VP   G W   EEE+  + + R      P      G++KP
Sbjct: 565  HASAEPPFPVRHPVQASAPRVPSSRGVWFPVEEEIGSQPLNRVVPKEFPVDSGPLGIEKP 624

Query: 450  RFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXXXXXXSLN 271
            R    S +   E+ +  DR L +S + +  E    ++R + NH++               
Sbjct: 625  RLHHPSFFNKVESSISSDRILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSR 684

Query: 270  KTSSN---CKTAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFSVEVLFDGK 100
             +SS+      +  ++L A   ++ L++IA  C TKVDF S L ++ EL+FS+E  F GK
Sbjct: 685  SSSSHRDLDSESGHSVLHADTPVAVLHEIALKCGTKVDFMSSLVASTELKFSLEAWFSGK 744

Query: 99   KISVAVGKTKKEAQQKASFDALKNMASQYMS 7
            KI    G+T+KEAQ KA+ D+++++A  Y+S
Sbjct: 745  KIGHGFGRTRKEAQNKAAKDSIEHLADIYLS 775


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score =  694 bits (1791), Expect = 0.0
 Identities = 383/751 (50%), Positives = 506/751 (67%), Gaps = 10/751 (1%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQDNSLLKQLYITCLKEAKAAIVPL 2050
            IR+SH S  S+RCPPLAVLHT++  GV  K+ESK  Q + L  QL+  C++E K A++PL
Sbjct: 36   IRISHFSQPSERCPPLAVLHTVTSCGVCFKMESKTQQQDGLF-QLHSLCIRENKTAVMPL 94

Query: 2049 QGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVANTL 1870
             G E+ LVAM S++ VD+ PCFWG+ V+  LY++CL MLNLRCL IVFDLDETLIVANT+
Sbjct: 95   GGEEIHLVAMHSRN-VDR-PCFWGFIVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTM 152

Query: 1869 RSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVLKAV 1690
            RSFED+IDA+QRK+++E DP R+ GM AEV+RYQ+D+ +LKQYA+ DQ+VD NG V+K  
Sbjct: 153  RSFEDRIDALQRKINSEVDPQRISGMQAEVKRYQDDKNILKQYAENDQVVD-NGRVIKVQ 211

Query: 1689 TEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTLKGR 1510
            +E +P  SD  Q I RPLIR+QDKN+I TRINP IRDTSVLVRLRPAWE+LR+YLT +GR
Sbjct: 212  SEIVPALSDSHQPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 271

Query: 1509 KRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVGFCH 1330
            KRFEVY+CTM+ERDYALEMWRLLD +S LIN  ELL R++CVK G +K+L NVF+ G CH
Sbjct: 272  KRFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCH 331

Query: 1329 PKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRGGFF 1150
            PK+A+VIDDRLKVW+++DQPRVHVVP FAPYYAPQAE S  +PVLCVARNVACNVRGGFF
Sbjct: 332  PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFF 391

Query: 1149 KDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGMADSELE 979
            KDFD+ LLQK+ ++ YE +++ +P  PDVSNYL+++DD    N ++D  L +GMAD+E+E
Sbjct: 392  KDFDDGLLQKIPQIAYEDDIKDIPSPPDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVE 451

Query: 978  KKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXXXXXXXX 799
            +KL    +   ++P + ++ ++ +       S Q T +P     G +P            
Sbjct: 452  RKLKDALSAASTIP-VTTANLDPR-----LTSLQYTMVP----SGSVPPPTAQASMMPFP 501

Query: 798  XXXLPIIPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQHGQDTGK 619
                P   T+   M Q+    +  + +SP REEGEVPESE+DPDTRRRLLILQHGQDT  
Sbjct: 502  HVQFPQPATLVKPMGQA-APSEPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRD 560

Query: 618  FNGADPGLPARLQVTAPPNQVP---GGWLGAEEEMSPRQIVRTAQGPAPQQES-FGVDKP 451
               A+P  P R  V      VP   G W  AEEE+  + + R      P      G+ KP
Sbjct: 561  HASAEPPFPVRHPVQTSAPHVPSSRGVWFPAEEEIGSQPLNRVVPKEFPVDSGPLGIAKP 620

Query: 450  RFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXXXXXXSLN 271
            R    S +   E+ +  DR L +S + +  E    ++R + NH++               
Sbjct: 621  RPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSR 680

Query: 270  KTSSN---CKTAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFSVEVLFDGK 100
              SS+      +  ++L A   ++ L +IA  C TKVDF S L ++ EL+FS+E  F GK
Sbjct: 681  SFSSHRDLDSESGHSVLHADTPVAVLQEIALKCGTKVDFISSLVASTELQFSMEAWFSGK 740

Query: 99   KISVAVGKTKKEAQQKASFDALKNMASQYMS 7
            KI   VG+T+KEAQ KA+ D++K++A  Y+S
Sbjct: 741  KIGHRVGRTRKEAQNKAAEDSIKHLADIYLS 771


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score =  689 bits (1778), Expect = 0.0
 Identities = 384/757 (50%), Positives = 507/757 (66%), Gaps = 15/757 (1%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKEN----QDNSLLKQLYITCLKEAKAA 2062
            IR+SH S  S+RCPP+AVLHTIS  GV  K+ESK +    QD S L  L+ +C+ E K A
Sbjct: 34   IRISHFSQSSERCPPVAVLHTISSNGVCFKMESKSSSSSSQDTSRLFLLHSSCIMENKTA 93

Query: 2061 IVPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIV 1882
            ++ L   EL LVAM S++   Q PCFWG+ VSS LY++CLGMLNLRCL IVFDLDETLIV
Sbjct: 94   VMNLGVEELHLVAMYSRNNQKQHPCFWGFSVSSGLYSSCLGMLNLRCLGIVFDLDETLIV 153

Query: 1881 ANTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVV 1702
            ANT+RSFED+I+ +QRK+  E D  R+ GM AE++RYQ+D+ +LKQYA+ DQ+V+ NG V
Sbjct: 154  ANTMRSFEDRIEGLQRKIQCEVDAQRISGMQAEIKRYQDDKFILKQYAENDQVVE-NGRV 212

Query: 1701 LKAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLT 1522
            +K  +E +P  SD  Q I RPLIR+Q+KN+I TRINP IRDTSVLVRLRPAWE+LR+YLT
Sbjct: 213  IKTQSEVVPALSDSHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLT 272

Query: 1521 LKGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRV 1342
             +GRKRFEVY+CTM+ERDYALEMWRLLD ES LIN  +LLDR++CVK G +K+L NVF+ 
Sbjct: 273  ARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINANKLLDRIVCVKSGLKKSLFNVFQE 332

Query: 1341 GFCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVR 1162
              CHPK+A+VIDDRLKVW+DRDQPRVHVVP FAPYYAPQAE +  VPVLCVARNVAC+VR
Sbjct: 333  SLCHPKMALVIDDRLKVWDDRDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACSVR 392

Query: 1161 GGFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDDL---NVNKDLPLPEGMAD 991
            GGFF++FD+ LLQK+ E+ YE N++    +PDVSN+L+++DD    N N+D    +GMAD
Sbjct: 393  GGFFREFDDSLLQKIPEIFYEDNIKDF-SSPDVSNFLVSEDDASASNGNRDQLPFDGMAD 451

Query: 990  SELEKKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXXXX 811
            +E+E++L    +   + P + S+   N P +     T    +PL +      ++ P    
Sbjct: 452  AEVERRLKEATS---AAPTVSSAVSNNDPRLASLQYT----VPLSST-----VSLPTNQP 499

Query: 810  XXXXXXXLPIIPTVKPYMPQSHV-RLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQHG 634
                   +    +     P  HV   D G+ +SP REEGEVPESE+DPDTRRRLLILQHG
Sbjct: 500  SMMPFHNVQFPQSASLVKPLGHVGPADLGLHSSPAREEGEVPESELDPDTRRRLLILQHG 559

Query: 633  QDTGKFNGADPGLPAR--LQVTAPPNQVPGGWLGAEEEMSPRQIVR-TAQGPAPQQESFG 463
            QDT +   ++P  P R  +QV+ P  Q  GGW   EEEMSPR++ R   + P    E   
Sbjct: 560  QDTRESVPSEPSFPVRPQVQVSVPRVQSRGGWFPVEEEMSPRKLSRMVPKEPPLNSEPMQ 619

Query: 462  VDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXXXXX 283
            ++K R   ++ +P  EN +  DR L E++R +  EA   + RL+ N  +           
Sbjct: 620  IEKHRSHHSAFFPKVENSMPSDRILQENQR-LPKEAFHRDNRLRFNQAM-SGYHSFSGEE 677

Query: 282  XSLNKTSSNCKT---AQGNLLS-ASNCISALYKIAEYCNTKVDFRSWLSSARELEFSVEV 115
              LN++SS+ +      G  +S A      L +IA  C TKV+FR  L  + EL+F VE 
Sbjct: 678  PPLNRSSSSNRDFDYESGRAISNAETPAGVLQEIAMKCGTKVEFRPALVPSTELQFYVEA 737

Query: 114  LFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQ 4
             F G+KI    G+T++EA  +A+  +LKN+A+ Y+S+
Sbjct: 738  WFAGEKIGEGTGRTRREAHFQAAEGSLKNLANIYISR 774


>ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum lycopersicum]
          Length = 954

 Score =  689 bits (1778), Expect = 0.0
 Identities = 387/767 (50%), Positives = 504/767 (65%), Gaps = 26/767 (3%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVE---SKENQDNSLLKQLYITCLKEAKAAI 2059
            IR+SH S  S+RCPPLAVLHT++ TG+  K+E   SK    +S L  L+ TCL++ K A+
Sbjct: 33   IRISHYSPSSERCPPLAVLHTVT-TGLSFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAV 91

Query: 2058 VPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVA 1879
            + L   EL LVAM SK+   Q PCFWG+ V+S LY++CL MLNLRCL IVFDLDETLIVA
Sbjct: 92   MSLGREELHLVAMQSKNIGGQCPCFWGFKVASGLYDSCLTMLNLRCLGIVFDLDETLIVA 151

Query: 1878 NTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVL 1699
            NT+RSFED+I+A+QRK+++ESDP R   M+AEV+RYQED+ +LKQYA+ DQ+VD NG V+
Sbjct: 152  NTMRSFEDRIEALQRKINSESDPQRASVMLAEVKRYQEDKIILKQYAENDQVVD-NGKVI 210

Query: 1698 KAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTL 1519
            ++ +E  P  SD  Q I RPLIR+QD+N+I TRINP IRDTSVLVRLRPAWE+LR+YLT 
Sbjct: 211  RSQSEVFPALSDNHQPIVRPLIRLQDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTA 270

Query: 1518 KGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVG 1339
            +GRKRFEVY+CTM+ERDYALEMWRLLD +S LIN  ELLDR++CVK G RK+L NVF+ G
Sbjct: 271  RGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSQELLDRIVCVKSGLRKSLFNVFQDG 330

Query: 1338 FCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRG 1159
             CHPK+A+VIDDRLKVW+D+DQPRVHVVP FAPY+APQAE +  VPVLCVARNVACNVRG
Sbjct: 331  NCHPKMALVIDDRLKVWDDKDQPRVHVVPAFAPYFAPQAEGNNSVPVLCVARNVACNVRG 390

Query: 1158 GFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGMADS 988
            GFFKDFDE LLQ++SEV YE +++ +P  PDVSNYL+++DD   +N NKD    +GMADS
Sbjct: 391  GFFKDFDEGLLQRISEVAYEDDIKQVPSAPDVSNYLISEDDPSAVNGNKDSLGFDGMADS 450

Query: 987  ELEKKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXXXXX 808
            E+E++L        S+P+ ++             +  P  +P         L  P     
Sbjct: 451  EVERRLKEAMLASTSVPSQMT-------------NLDPRLVP--------ALQYPVPPVI 489

Query: 807  XXXXXXLPIIPTVKPYMPQSHVRL----------DCGIQTSPPREEGEVPESEVDPDTRR 658
                   P++P    ++PQ    L          D  +Q+SP REEGEVPESE+DPDTRR
Sbjct: 490  SQPSIQGPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREEGEVPESELDPDTRR 549

Query: 657  RLLILQHGQDTGKFNGADPGLP--ARLQVTAPPNQVPGGWLGAEEEMSPRQIVRTAQGPA 484
            RLLILQHGQDT     ++P  P    LQV+ PP   P GW  AEEE+SPRQ+ R    P 
Sbjct: 550  RLLILQHGQDTRDQVSSEPKFPIGTPLQVSVPPRVQPHGWFPAEEEVSPRQLNR----PL 605

Query: 483  PQQ------ESFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNH 322
            P +      ES  ++K R P     P  E  +  DR   E++R +  E    ++R++ + 
Sbjct: 606  PPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVFFENQR-LPKEVIPRDDRMRFSQ 664

Query: 321  IIPXXXXXXXXXXXSLNKTSSNCKTAQGNLLS--ASNCISALYKIAEYCNTKVDFRSWLS 148
              P             + +S+                   AL  IA  C  KV+FRS   
Sbjct: 665  SQPSFRPPGEDVSLGRSSSSNRVLDLDPGHYDPYLDTPAGALQDIAFKCGVKVEFRSSFL 724

Query: 147  SARELEFSVEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMS 7
            S+ EL+F +EVLF G+K+   +G+T++EAQ+ A+ ++L  +A +Y+S
Sbjct: 725  SSPELQFCLEVLFAGEKVGEGIGRTRREAQRHAAEESLMYLADKYLS 771


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score =  689 bits (1777), Expect = 0.0
 Identities = 384/764 (50%), Positives = 513/764 (67%), Gaps = 14/764 (1%)
 Frame = -3

Query: 2253 LINGIRDTIRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQD-NSLLKQLYITCLK 2077
            +I+ I   IR+SH S  S+RCPPLAVLHTI+  G+  K+ESK +   ++ L  L+ +C++
Sbjct: 44   VIDEILKGIRISHFSQASERCPPLAVLHTITTNGICFKMESKNSVSLDTPLHLLHSSCIQ 103

Query: 2076 EAKAAIVPLQG-RELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDL 1900
            E+K A+V LQG  EL LVAM S++   Q PCFW +++SS LY++CL MLNLRCL IVFDL
Sbjct: 104  ESKTAVVLLQGGEELHLVAMFSRNDERQYPCFWAFNISSGLYDSCLVMLNLRCLGIVFDL 163

Query: 1899 DETLIVANTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIV 1720
            DETLIVANT+RSFED+I+A+QRK+S E DP R+ GM++EV+RYQ+D+ +LKQY D DQ+V
Sbjct: 164  DETLIVANTMRSFEDRIEALQRKISTELDPQRISGMLSEVKRYQDDKTILKQYVDNDQVV 223

Query: 1719 DSNGVVLKAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEE 1540
            + NG V+K   E +P  SD  Q+I RPLIR+Q++N+I TRINP IRDTSVLVRLRPAWEE
Sbjct: 224  E-NGRVIKTQFEVVPALSDNHQTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEE 282

Query: 1539 LRTYLTLKGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKAL 1360
            LR+YLT +GRKRFEVY+CTM+ERDYALEMWRLLD ES LIN  ELLDR++CVK G RK+L
Sbjct: 283  LRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSL 342

Query: 1359 GNVFRVGFCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARN 1180
             NVF+ G CHPK+A+VIDDRLKVW+++DQPRVHVVP FAPYYAPQAE +  VPVLCVARN
Sbjct: 343  FNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARN 402

Query: 1179 VACNVRGGFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDDL---NVNKDLPL 1009
            VACNVRGGFFK+FDE LLQ++ E+ +E ++  +P  PDVSNYL+ +DD    N N+D   
Sbjct: 403  VACNVRGGFFKEFDEGLLQRIPEISFEDDMNDIPSPPDVSNYLVPEDDAFTSNGNRDPLS 462

Query: 1008 PEGMADSELEKKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLA 829
             +GMAD+E+EK+L       +S+ +   S + N    RL    Q T    +A+   IP+ 
Sbjct: 463  FDGMADAEVEKRL----KEAISISSAFPSTVANLD-ARLVPPLQYT----MASSSSIPVP 513

Query: 828  APXXXXXXXXXXXLP-IIPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRL 652
                         LP   P VKP      V  +  +Q+SP REEGEVPESE+DPDTRRRL
Sbjct: 514  TSQPAVVTFPSMQLPQAAPLVKPL--GQVVPSEPSLQSSPAREEGEVPESELDPDTRRRL 571

Query: 651  LILQHGQDTGKFNGADPGLPAR----LQVTAPPNQVPGGWLGAEEEMSPRQIVRTAQGPA 484
            LILQHGQD      ++   P R    +QV+ P  Q  G W+  EEEMSPRQ+ R      
Sbjct: 572  LILQHGQDLRDPAPSESPFPVRPSNSMQVSVPRVQSRGNWVPVEEEMSPRQLNRAVTREF 631

Query: 483  PQQ-ESFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXX 307
            P   E   +DK R    S +P  E+ +  +R   E++R +   A + ++RL+ N  +   
Sbjct: 632  PMDTEPMHIDKHRPHHPSFFPKVESSIPSERMPHENQR-LPKVAPYKDDRLRLNQTMSNY 690

Query: 306  XXXXXXXXXSLNKTSSNCK---TAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARE 136
                         +SSN      +   + SA   +  L++I+  C  KV+F+  L ++R+
Sbjct: 691  QSLSGEENSLSRSSSSNRDLDVESDRAVSSAETPVRVLHEISMKCGAKVEFKHSLVNSRD 750

Query: 135  LEFSVEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQ 4
            L+FSVE  F G+++    G+T++EAQ  A+  ++KN+A+ Y+S+
Sbjct: 751  LQFSVEAWFAGERVGEGFGRTRREAQSVAAEASIKNLANIYISR 794


>ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X2 [Glycine max]
          Length = 929

 Score =  686 bits (1771), Expect = 0.0
 Identities = 379/748 (50%), Positives = 499/748 (66%), Gaps = 7/748 (0%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQDNSLLKQLYITCLKEAKAAIVPL 2050
            IR+SH S  S+RCPPLAVLHT++  GV  K+ESK  Q + L  QL+  C++E K A++PL
Sbjct: 36   IRISHFSQPSERCPPLAVLHTVTSCGVCFKMESKTQQQDGLF-QLHSLCIRENKTAVMPL 94

Query: 2049 QGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVANTL 1870
             G E+ LVAM S++ VD+ PCFWG+ V+  LY++CL MLNLRCL IVFDLDETLIVANT+
Sbjct: 95   GGEEIHLVAMHSRN-VDR-PCFWGFIVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTM 152

Query: 1869 RSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVLKAV 1690
            RSFED+IDA+QRK+++E DP R+ GM AEV+RYQ+D+ +LKQYA+ DQ+VD NG V+K  
Sbjct: 153  RSFEDRIDALQRKINSEVDPQRISGMQAEVKRYQDDKNILKQYAENDQVVD-NGRVIKVQ 211

Query: 1689 TEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTLKGR 1510
            +E +P  SD  Q I RPLIR+QDKN+I TRINP IRDTSVLVRLRPAWE+LR+YLT +GR
Sbjct: 212  SEIVPALSDSHQPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 271

Query: 1509 KRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVGFCH 1330
            KRFEVY+CTM+ERDYALEMWRLLD +S LIN  ELL R++CVK G +K+L NVF+ G CH
Sbjct: 272  KRFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCH 331

Query: 1329 PKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRGGFF 1150
            PK+A+VIDDRLKVW+++DQPRVHVVP FAPYYAPQAE S  +PVLCVARNVACNVRGGFF
Sbjct: 332  PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFF 391

Query: 1149 KDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGMADSELE 979
            KDFD+ LLQK+ ++ YE +++ +P  PDVSNYL+++DD    N ++D  L +GMAD+E+E
Sbjct: 392  KDFDDGLLQKIPQIAYEDDIKDIPSPPDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVE 451

Query: 978  KKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXXXXXXXX 799
            +KL    +   ++P + ++ ++ +       S Q T +P     G +P            
Sbjct: 452  RKLKDALSAASTIP-VTTANLDPR-----LTSLQYTMVP----SGSVPPPTAQASMMPFP 501

Query: 798  XXXLPIIPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQHGQDTGK 619
                P   T+   M Q+    +  + +SP REEGEVPESE+DPDTRRRLLILQHGQDT  
Sbjct: 502  HVQFPQPATLVKPMGQA-APSEPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRD 560

Query: 618  FNGADPGLPARLQVTAPPNQVP---GGWLGAEEEMSPRQIVRTAQGPAPQQES-FGVDKP 451
               A+P  P R  V      VP   G W  AEEE+  + + R      P      G+ KP
Sbjct: 561  HASAEPPFPVRHPVQTSAPHVPSSRGVWFPAEEEIGSQPLNRVVPKEFPVDSGPLGIAKP 620

Query: 450  RFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXXXXXXSLN 271
            R    S +   E+ +  DR L +S + +  E    ++R + NH++               
Sbjct: 621  RPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSF--------- 671

Query: 270  KTSSNCKTAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFSVEVLFDGKKIS 91
                           +   ++ L +IA  C TKVDF S L ++ EL+FS+E  F GKKI 
Sbjct: 672  ---------------SDTPVAVLQEIALKCGTKVDFISSLVASTELQFSMEAWFSGKKIG 716

Query: 90   VAVGKTKKEAQQKASFDALKNMASQYMS 7
              VG+T+KEAQ KA+ D++K++A  Y+S
Sbjct: 717  HRVGRTRKEAQNKAAEDSIKHLADIYLS 744


>ref|XP_004134718.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cucumis sativus] gi|449479317|ref|XP_004155567.1|
            PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II
            C-terminal domain phosphatase-like 1-like [Cucumis
            sativus]
          Length = 803

 Score =  685 bits (1768), Expect = 0.0
 Identities = 373/760 (49%), Positives = 510/760 (67%), Gaps = 18/760 (2%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQDNSL-LKQLYITCLKEAKAAIVP 2053
            IR++H S  S+RCPPLAVLHTI+ +G+  K+ESK +Q     L  L+ +C+ E K AI+ 
Sbjct: 34   IRITHFSQPSERCPPLAVLHTIAASGICFKMESKTSQSQDTPLNLLHSSCIMENKTAIMM 93

Query: 2052 LQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVANT 1873
                EL LVAM S+    Q PCFWG++V+  LYN+CL MLNLRCL IVFDLDETL+VANT
Sbjct: 94   FGVEELHLVAMFSRDLDKQYPCFWGFNVAMGLYNSCLDMLNLRCLGIVFDLDETLVVANT 153

Query: 1872 LRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVLKA 1693
            +RSFED+I+A+QRK+S+E DP R  GM+AEV+RYQ+D+ +LKQYA+ DQ+++ NG V+K+
Sbjct: 154  MRSFEDRIEALQRKISSEVDPQRANGMLAEVKRYQDDKIILKQYAENDQVIE-NGKVIKS 212

Query: 1692 VTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTLKG 1513
             +E +P  SD  Q + RPLIR+ +KN+I TRINP IRDTSVLVRLRPAWE+LR+YLT +G
Sbjct: 213  QSEVVPALSDNHQPVVRPLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARG 272

Query: 1512 RKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVGFC 1333
            RKRFEVY+CTM+ERDYALEMWRLLD +S LIN  ELLDR++CVK GSRK+L NVF+ GFC
Sbjct: 273  RKRFEVYVCTMAERDYALEMWRLLDPDSNLINPKELLDRIVCVKSGSRKSLFNVFQDGFC 332

Query: 1332 HPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRGGF 1153
            HPK+A+VIDDRLKVW+++DQPRVHVVP FAPYYAP AE +  +PVLCVARNVACNVRGGF
Sbjct: 333  HPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPNAEGNNAIPVLCVARNVACNVRGGF 392

Query: 1152 FKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDDLNV---NKDLPLPEGMADSEL 982
            FK+FD++LLQK+S++ YE +V  +P  PDVSNYL+++D+ ++   NKD+P  +GM D E+
Sbjct: 393  FKEFDDILLQKISDISYEDDVNDIPSPPDVSNYLVSEDEYSIANGNKDMPTFDGMPDMEV 452

Query: 981  EKKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXXXXXXX 802
            ++++             ++S   N    R+  S Q T      +   +PL          
Sbjct: 453  DRRMKDA---------FLASSTINSADPRV-SSLQYTMASASCS---VPLPPKQVTMPYF 499

Query: 801  XXXXLPIIPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQHGQDTG 622
                LP + +V    P      +  +Q+SP REEGEVPESE+DPDTRRRLLILQHGQDT 
Sbjct: 500  PNMPLPHVNSVAHVAPN-----EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR 554

Query: 621  KFNGADPGLPAR----LQVTAPPNQVPGGWLGAEEEMSPRQIVRTAQ-------GPAPQQ 475
            +   ++P  PAR     QV AP  Q  G W   EEEMSPRQ+ R+A+        P P +
Sbjct: 555  ERLSSEPAFPARPPPLQQVAAPRAQSRGNWSPMEEEMSPRQLNRSARKDFPVDAEPMPMR 614

Query: 474  ESFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXX 295
            E     K R    S +   +N ++ DR +    + +  EA + ++R++ +   P      
Sbjct: 615  E-----KHRSNHPSFFAKVDNSILPDR-IPHDNQRLPKEAFYRDDRMRVSR-RPSSYPAF 667

Query: 294  XXXXXSLNKTSSNCK---TAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFS 124
                  +N++SS  +      G  + +   + AL +IA    TKV+F+  L  + +L+FS
Sbjct: 668  SGEEIPMNQSSSRSRDDDIESGRSIWSETPVGALQEIAMKFGTKVEFKPGLVPSTDLQFS 727

Query: 123  VEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQ 4
            VE  F G+KI   +G T+++AQ++A+  ++KN+A+ Y+S+
Sbjct: 728  VEAWFVGEKIGEGIGHTRRDAQRQAAEGSIKNLANIYVSR 767


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score =  684 bits (1765), Expect = 0.0
 Identities = 386/762 (50%), Positives = 510/762 (66%), Gaps = 17/762 (2%)
 Frame = -3

Query: 2235 DTIRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQDNSLLKQLYITCLKEAKAAIV 2056
            D IR+S+ S  S+RCPPLAVLHTI+ +G+  K+ESK + DN  L  L+ +C++E K A++
Sbjct: 38   DEIRISYFSEASERCPPLAVLHTITASGICFKMESKSS-DNVQLHLLHSSCIRENKTAVM 96

Query: 2055 PLQ-GRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVA 1879
             L    EL LVAM S++   Q PCFW + V S LYN+CL MLNLRCL IVFDLDETLIVA
Sbjct: 97   LLGLTEELHLVAMYSRNNEKQYPCFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVA 156

Query: 1878 NTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVL 1699
            NT+RSFED+I+A+ RK+S E DP R+ GM AEV+RYQ+D+ +LKQYA+ DQ V+ NG V+
Sbjct: 157  NTMRSFEDRIEALLRKISTEVDPQRIAGMQAEVKRYQDDKNILKQYAENDQ-VNENGKVI 215

Query: 1698 KAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTL 1519
            K  +E +P  SD  Q++ RPLIR+Q+KN+I TRINP IRDTSVLVRLRPAWE+LR+YLT 
Sbjct: 216  KVQSEVVPALSDSHQALVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTA 275

Query: 1518 KGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVG 1339
            +GRKRFEVY+CTM+ERDYALEMWRLLD ES LIN  ELLDR++CVK GSRK+L NVF+ G
Sbjct: 276  RGRKRFEVYVCTMAERDYALEMWRLLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDG 335

Query: 1338 FCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRG 1159
             CHPK+A+VIDDRLKVW+++DQ RVHVVP FAPYYAPQAE +  +PVLCVARN+ACNVRG
Sbjct: 336  TCHPKMALVIDDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRG 395

Query: 1158 GFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGMADS 988
            GFFK+FDE LLQ++ E+ YE +V+ +P  PDVSNYL+++DD    N  KD    +GMAD+
Sbjct: 396  GFFKEFDEGLLQRIPEISYEDDVKEIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADA 455

Query: 987  ELEKKLGPQETNQVSLPNIISSGIEN-----QPVVRLCDSTQPTFIPLVANHGRIPLAAP 823
            E+E++L       ++    ISS + N      P      S+  T     +    +PLA  
Sbjct: 456  EVERRL----KEAIAASATISSAVANLDPRLAPFQYTMPSSSSTTTLPTSQAAVMPLA-- 509

Query: 822  XXXXXXXXXXXLPIIPTVKPYMPQSHV-RLDCGIQTSPPREEGEVPESEVDPDTRRRLLI 646
                        P    VKP     HV   +  +Q+SP REEGEVPESE+DPDTRRRLLI
Sbjct: 510  -------NMQFPPATSLVKPL---GHVGPPEQCLQSSPAREEGEVPESELDPDTRRRLLI 559

Query: 645  LQHGQDTGKFNGADPGLPARLQVTAPPNQVP--GGWLGAEEEMSPRQIVRTAQGPAP-QQ 475
            LQHG DT +   ++   PAR Q+     +VP  G W   EEEMSPRQ+ R      P   
Sbjct: 560  LQHGMDTRENAPSEAPFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQLNRAVPKEFPLNS 619

Query: 474  ESFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXX 295
            E+  ++K R P  S +P  EN +  DR      + M  EA   ++RL+ NH +       
Sbjct: 620  EAMQIEKHRPPHPSFFPKIENSITSDR--PHENQRMPKEALRRDDRLRLNHTL-SDYQSF 676

Query: 294  XXXXXSLNKTSSNCKTA---QGNLLSASNCIS-ALYKIAEYCNTKVDFRSWLSSARELEF 127
                  L+++SS+ +      G  +S++   S  L  IA  C TKV+FR  L ++ EL+F
Sbjct: 677  SGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQDIAMKCGTKVEFRPALVASTELQF 736

Query: 126  SVEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQV 1
            S+E  F G+KI   +G+T++EAQ++A+  ++K++A+ Y+ +V
Sbjct: 737  SIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYVLRV 778


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score =  679 bits (1753), Expect = 0.0
 Identities = 372/774 (48%), Positives = 511/774 (66%), Gaps = 26/774 (3%)
 Frame = -3

Query: 2253 LINGIRDTIRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESK--------ENQDNSLLKQ 2098
            +I+ I   IR+SH S  S+RCPPLAVLHTI+  GV  K+E           +Q  S L+ 
Sbjct: 37   VIDEIVKGIRISHFSQASERCPPLAVLHTITSIGVCFKMEESTASSSTKISSQQESPLRL 96

Query: 2097 LYITCLKEAKAAIVPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCL 1918
            L+ +C++E K A++ L G EL LVAM S+S   + PCFWG++V+S LY++CL MLNLRCL
Sbjct: 97   LHSSCIQENKTAVMLLGGEELHLVAMPSRSNERKHPCFWGFNVASGLYDSCLVMLNLRCL 156

Query: 1917 AIVFDLDETLIVANTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYA 1738
             IVFDLDETLIVANT+RSFEDKI+A+Q+K+S E D  R+  +++E++RYQ+D+ +LKQY 
Sbjct: 157  GIVFDLDETLIVANTMRSFEDKIEALQKKISTEVDQQRILAIISEIKRYQDDKIILKQYV 216

Query: 1737 DADQIVDSNGVVLKAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRL 1558
            + DQ+++ NG V+K   E +P  SD  Q + RPLIR+ +KN+IFTRINP IRDTSVLVRL
Sbjct: 217  ENDQVIE-NGKVIKTQFEVVPAASDNHQPLVRPLIRLPEKNIIFTRINPQIRDTSVLVRL 275

Query: 1557 RPAWEELRTYLTLKGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKP 1378
            RPAWE+LR+YLT +GRKRFEVY+CTM+ERDYALEMWRLLD ES LIN  ELLDR++CV  
Sbjct: 276  RPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSNELLDRIVCVSS 335

Query: 1377 GSRKALGNVFRVGFCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPV 1198
            GSRK+L NVF+ G CHPK+A+VIDDR+ VW+++DQ RVHVVP FAPYYAPQAE +  VP+
Sbjct: 336  GSRKSLFNVFQDGICHPKMALVIDDRMNVWDEKDQSRVHVVPAFAPYYAPQAEANNAVPI 395

Query: 1197 LCVARNVACNVRGGFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNV 1027
            LCVARNVACNVRGGFFK+FDE LLQK+ EV YE +  ++P  PDVSNYL+++DD    N 
Sbjct: 396  LCVARNVACNVRGGFFKEFDEGLLQKIPEVAYEDDTSNIPSPPDVSNYLVSEDDASAANG 455

Query: 1026 NKDLPLPEGMADSELEKKLGPQETNQVSLPNIISSGIEN---------QPVVRLCDSTQP 874
            N+D P  +  AD+E+E++L    +   ++P+ I S + +         Q  V    S  P
Sbjct: 456  NRDPPSFDSTADAEVERRLKEAVSASSTIPSTIPSTVSSLDPRLLQSLQYAVASSSSLMP 515

Query: 873  TFIP-LVANHGRIPLAAPXXXXXXXXXXXLPIIPTVKPYMPQSHVRLDCGIQTSPPREEG 697
               P ++A+   +P A+              + P VK      H   +  +Q+SP REEG
Sbjct: 516  ASQPSMLASQQPVP-ASQTSMMPFPNTQFPQVAPLVKQLGQVVHP--EPSLQSSPAREEG 572

Query: 696  EVPESEVDPDTRRRLLILQHGQDTGKFNGADPGLPAR--LQVTAPPNQVPGGWLGAEEEM 523
            EVPESE+DPDTRRRLLILQHGQD+     ++   PAR    V+A   Q  G W+  EEEM
Sbjct: 573  EVPESELDPDTRRRLLILQHGQDSRDNAPSESPFPARPSAPVSAAHVQSRGSWVPVEEEM 632

Query: 522  SPRQIVRTAQGPAPQQESFGVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSE 343
            +PRQ+ RT +      +   ++K +    S +P  E+ +  DR + E++R +  EA +  
Sbjct: 633  TPRQLNRTPREFPLDSDPMNIEKHQTHHPSFFPKVESNIPSDRMIHENQR-LPKEAPYRN 691

Query: 342  ERLKSNHIIPXXXXXXXXXXXSLNKTSSNCK---TAQGNLLSASNCISALYKIAEYCNTK 172
            +R++ NH  P            L+++SSN      ++     +   +  L +IA  C TK
Sbjct: 692  DRMRLNHSTP-NYHSFQVEETPLSRSSSNRDLDLESERAFTISETPVEVLQEIAMKCETK 750

Query: 171  VDFRSWLSSARELEFSVEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYM 10
            V+FR  L ++ +L+FS+E  F G+K+    GKT++EAQ++A+  ++K +A  YM
Sbjct: 751  VEFRPALVASIDLQFSIEAWFAGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYM 804


>ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cicer arietinum]
          Length = 951

 Score =  676 bits (1745), Expect = 0.0
 Identities = 379/757 (50%), Positives = 504/757 (66%), Gaps = 15/757 (1%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQDNSLLKQLYITCLKEAKAAIVPL 2050
            IR+SH +  S+RC PLAVLHTI+ +GV  K+ESK  Q + L   L+  C +E K A++PL
Sbjct: 32   IRISHFTQPSERCLPLAVLHTITSSGVCFKMESKTQQQDPLF-HLHNLCFRENKTAVMPL 90

Query: 2049 QGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVANTL 1870
             G E+ LVAM S+S  +  PCFWGY V   LYN+CL MLNLRCL IVFDLDETLIVANT+
Sbjct: 91   CGEEMHLVAMHSRS--NGRPCFWGYIVGMGLYNSCLMMLNLRCLGIVFDLDETLIVANTM 148

Query: 1869 RSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVLKAV 1690
            RSFED+IDA+QRK+++E DP R+ GM AEV+RY ED+++LKQY + DQ+VD NG VLKA 
Sbjct: 149  RSFEDRIDALQRKINSEVDPQRISGMQAEVKRYLEDKSILKQYVENDQVVD-NGKVLKAQ 207

Query: 1689 TEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTLKGR 1510
            +E +P  SD  Q I RPLIR+ +KN+I TRINP IRDTSVLVRLRPAWE+LR+YLT +GR
Sbjct: 208  SELVPALSDSHQPIVRPLIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGR 267

Query: 1509 KRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVGFCH 1330
            KRFEVY+CTM+ERDYALEMWRLLD +S LIN  ELL R++CVK G +K+L NVF+ G CH
Sbjct: 268  KRFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCH 327

Query: 1329 PKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRGGFF 1150
            PK+A+VIDDRLKVW+++DQPRVHVVP FAPYYAPQAE S  +PVLCVARNVACNVRGGFF
Sbjct: 328  PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFF 387

Query: 1149 KDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDDLN---VNKDLPLPEGMADSELE 979
            KDFD+ LLQK+S++ YE N   + P PDVSNYL+++DD +    N+D    +GMAD+E+E
Sbjct: 388  KDFDDGLLQKISQIAYENNTRDISPAPDVSNYLVSEDDGSASYANRDPFAFDGMADAEVE 447

Query: 978  KKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXXXXXXXX 799
            +KL    +   ++P + ++ ++     RL  S Q T   +V+    +P AA         
Sbjct: 448  RKLKDAISAASAIP-MTTAKLD----PRLTSSLQYT---MVSPGSVLPPAAQASMIPLPH 499

Query: 798  XXXLPIIPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQHGQDTGK 619
                     VKP    +   L   + +SP REEGEVPESE+DPDTRRRLLILQHGQD   
Sbjct: 500  TQFPQPATLVKPIGQVAPSEL--SLHSSPAREEGEVPESELDPDTRRRLLILQHGQDNRD 557

Query: 618  FNGADPGLPAR--LQVTA--PPNQVPGGWLGAEEEMS---PRQIV--RTAQGPAPQQESF 466
               ++P  P +  +QV+A  PP    GGW   EEE+    P +++    A    P +   
Sbjct: 558  HTSSEPPFPLKHPVQVSARVPPR---GGWFPVEEEIGSQPPNRVIPKEIALDSGPSR--- 611

Query: 465  GVDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXXXX 286
             ++K R  Q   +P  +  +  DR L E+ + +  E    ++R + +H++          
Sbjct: 612  -IEKHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMYHRDDRSRVSHMLSSYPSLSGDD 670

Query: 285  XXSLNKTSSN---CKTAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFSVEV 115
                  +SS+      +  ++ +A      L +IA  C TKV+F S L+++REL+FS+E 
Sbjct: 671  TPFGRSSSSHRDFDSESGHSVFNAETPAIVLQEIALKCGTKVEFTSSLAASRELQFSIEA 730

Query: 114  LFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQ 4
             F GKKI    G+T+ EAQ KA+ D++K++A  Y+S+
Sbjct: 731  WFSGKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSR 767


>ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
            gi|571500215|ref|XP_006594604.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1-like
            isoform X2 [Glycine max]
          Length = 960

 Score =  658 bits (1697), Expect = 0.0
 Identities = 374/762 (49%), Positives = 500/762 (65%), Gaps = 21/762 (2%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVES---KENQDNSLLKQLYITCLKEAKAAI 2059
            IR+SH S  S+RCPPLAVLHTI+  G+  K+ES   ++ Q    L  L+ +C++E K A+
Sbjct: 31   IRISHFSQPSERCPPLAVLHTITSFGICFKMESSTSQKRQQQDALFHLHSSCIRENKTAV 90

Query: 2058 VPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVA 1879
            +P++G E+ LVAM S++  +  PCFWG+ V+S LYN+CL MLNLRCL IVFDLDETL+VA
Sbjct: 91   MPVRGEEIHLVAMYSRN--NDRPCFWGFIVASGLYNSCLTMLNLRCLGIVFDLDETLVVA 148

Query: 1878 NTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVL 1699
            NT+RSFEDKI+ + RK+++E +P ++  M AE++RY +D+ +LK+YA+ DQ+VD NG V+
Sbjct: 149  NTMRSFEDKIEVLHRKMNSEVNPQQISAMQAEIKRYLDDKNILKEYAENDQVVD-NGKVI 207

Query: 1698 KAVTEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTL 1519
            K  +E +P  SD  Q I RPLIR+Q+KN+I TRINP IRDTSVLVRLRPAWE+LR+YLT 
Sbjct: 208  KIQSESVPALSDSHQPIVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTA 267

Query: 1518 KGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVG 1339
            +GRKRFEV++CTM+ERDYALEMWRLLD E  LIN  ELLDR++CVK G +K+L NVF+ G
Sbjct: 268  RGRKRFEVFVCTMAERDYALEMWRLLDPELNLINSKELLDRIVCVKSGLKKSLFNVFQNG 327

Query: 1338 FCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRG 1159
             CH K+A+VIDDRLKVW+++DQPRVHVVP FAPYY PQAE S  VP LC+ARNVACNVRG
Sbjct: 328  LCHLKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYTPQAEASNAVPFLCLARNVACNVRG 387

Query: 1158 GFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDDL---NVNKDLPLPEGMADS 988
            GFFKDFD+ LLQK+  + YE +++ + P+PDVSNYL+++DD    N NK+L L +GMAD+
Sbjct: 388  GFFKDFDDGLLQKIPLIAYEDDIKDI-PSPDVSNYLVSEDDASASNGNKNLLLFDGMADA 446

Query: 987  ELEKKLGPQETNQVSLPNIISSGIEN-QPVVRLCDSTQPTFIPLVANHGRIPLAAPXXXX 811
            E+E++L     + +S  + I +   N  P +    S Q T   +V++ G +P        
Sbjct: 447  EVERRL----KDAISASSTILALTANIDPRLAFTSSLQYT---MVSSSGTVPPPTAQASV 499

Query: 810  XXXXXXXLPIIPT-VKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQHG 634
                    P   T VKP    +H  L   + +SP REEGE+PESE+D DTRRR LILQHG
Sbjct: 500  VQFGNVQFPQPNTLVKPMSQVTHPGL--SLHSSPAREEGELPESELDLDTRRRFLILQHG 557

Query: 633  QDTGKFNGADPGLPAR--LQVTAPPNQVPG--GWLGAEEEMSPRQIVRTAQGPAP-QQES 469
            QDT +   ++P  P R   QV+AP + VP   GW   EEEM P+Q+        P   E 
Sbjct: 558  QDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMGPQQLNLPVPKEFPVDSEP 617

Query: 468  FGVDK--PRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEER------LKSNHIIP 313
            F ++K  PR P  S +    + +  DR   ES + +  E    ++R      L S H +P
Sbjct: 618  FHIEKRWPRHP--SFFSKVGDSISSDRVFHESHQRLPKEVHHRDDRSRLSQSLSSYHSLP 675

Query: 312  XXXXXXXXXXXSLNKTSSNCKTAQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSAREL 133
                       S     S    +  +L  A      L +IA  C TKV+F S L ++ EL
Sbjct: 676  GDDIPLSGSSYSNRDFDSE---SGRSLFHADTTAGVLQEIALNCGTKVEFLSSLVASTEL 732

Query: 132  EFSVEVLFDGKKISVAVGKTKKEAQQKASFDALKNMASQYMS 7
            +FS+E  F GKKI    G+T++EAQ KA+  ++K +A  YMS
Sbjct: 733  QFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMS 774


>ref|XP_004976316.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Setaria italica]
          Length = 944

 Score =  657 bits (1695), Expect = 0.0
 Identities = 363/754 (48%), Positives = 487/754 (64%), Gaps = 13/754 (1%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKE----NQDNSLLKQLYITCLKEAKAA 2062
            IRV  LS  S+RCPPLAVLH ++     + +ES+     ++    L  ++  CL   K A
Sbjct: 32   IRVDRLSPPSERCPPLAVLHAVAAGARCLVMESRPTSTADEPPPPLVAMHTACLSGNKTA 91

Query: 2061 IVPLQGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIV 1882
            + PL   E+ LVAM SK  +    CFWGY V   LYN+CL MLNLRCL IVFDLDETL+V
Sbjct: 92   VFPLGAEEIHLVAMTSKRNLPNHACFWGYKVPLGLYNSCLTMLNLRCLGIVFDLDETLVV 151

Query: 1881 ANTLRSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVV 1702
            ANT RSFEDKIDA+QRKLS E+DP R+ GM+AE++RYQ+D+++LKQY ++DQ++D  G V
Sbjct: 152  ANTSRSFEDKIDAVQRKLSNETDPQRISGMLAEIKRYQDDKSILKQYIESDQVIDG-GEV 210

Query: 1701 LKAVTEFIPPHSDRGQS-ISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYL 1525
             KA +E IPP +D  Q  ++RP+IR+Q+K++I TRINP+IRDTSVLVRLRPAW++LR YL
Sbjct: 211  YKAQSEVIPPLADNHQQPMTRPIIRLQEKHIILTRINPSIRDTSVLVRLRPAWDDLRNYL 270

Query: 1524 TLKGRKRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFR 1345
              +GRKRFEVY+CTM+ERDYALEMWRLLD +S+LIN  +LLDR++CVK GSRK+L NVF 
Sbjct: 271  IARGRKRFEVYVCTMAERDYALEMWRLLDPDSKLINSVQLLDRLVCVKSGSRKSLLNVFH 330

Query: 1344 VGFCHPKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNV 1165
             G CHP++A+VIDDRLKVW ++DQ RVHVVP FAPYYAPQAE + P+PVLCVARNVACNV
Sbjct: 331  DGSCHPRMALVIDDRLKVWNEKDQHRVHVVPAFAPYYAPQAEANFPIPVLCVARNVACNV 390

Query: 1164 RGGFFKDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD----LNVNKDLPLPEGM 997
            RGGFFK+FDE +L ++SEV YE  ++ +P  PDVSNYL+++D+    +N+NKD    +GM
Sbjct: 391  RGGFFKEFDEGILPQISEVRYEDEMDGIPSAPDVSNYLISEDENSAIININKDPHAIDGM 450

Query: 996  ADSELEKKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLVANHGRIPLAAPXX 817
            AD+E+EK++  + ++     N I++ I+   V        PT           P+AAP  
Sbjct: 451  ADAEVEKRM-KEASSCFQATNPITTDIDVMSVAAKQHFVTPT-------SSSTPIAAPPG 502

Query: 816  XXXXXXXXXLPIIPTVKPYMPQSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQH 637
                     LP  P+     P +   L    Q SP REEGEVPESE+DPDTRRRLLILQH
Sbjct: 503  IIMPLNNEHLPQPPSFS--WPVTLSGLVDPSQGSPAREEGEVPESELDPDTRRRLLILQH 560

Query: 636  GQDTGKFNGADPGLPARLQVTAPPNQVPGGWLGAEEEMSPRQIVRTAQGPAPQQESFGVD 457
            GQDT +     P  P   QV+ PP Q  G WL  E+EM+PR + + +     + +S   D
Sbjct: 561  GQDTREAAQPFPDRPP-AQVSVPPVQSHGNWLSLEDEMNPRNLNKASTEFHLESDSVNYD 619

Query: 456  KPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXXXXXXS 277
              +    S +P  +NP+  DR   +++R         + R+  NH  P           +
Sbjct: 620  NKQPQHPSYFPDGDNPISADRHSYKNQRYPPRPLHNEDHRMLHNH-APATYRSFSGEDIA 678

Query: 276  LNKTSSNCKTAQGN----LLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFSVEVLF 109
                 S  ++ Q       +     +  L +IA  C  KV++RS L    +L+FS+EV  
Sbjct: 679  TRHAPSRQRSRQMESGRYFIQHGGILGVLEEIAVKCGFKVEYRSTLCDTTDLQFSIEVWI 738

Query: 108  DGKKISVAVGKTKKEAQQKASFDALKNMASQYMS 7
             G+KI    GKT+KEAQ +A+  +L+N+A +++S
Sbjct: 739  FGEKIGEGFGKTRKEAQCQAADTSLRNLADKFLS 772


>ref|XP_006413749.1| hypothetical protein EUTSA_v10024324mg [Eutrema salsugineum]
            gi|557114919|gb|ESQ55202.1| hypothetical protein
            EUTSA_v10024324mg [Eutrema salsugineum]
          Length = 963

 Score =  655 bits (1689), Expect = 0.0
 Identities = 356/757 (47%), Positives = 493/757 (65%), Gaps = 14/757 (1%)
 Frame = -3

Query: 2229 IRVSHLSAQSDRCPPLAVLHTISPTGVFIKVESKENQDNSLLKQLYITCLKEAKAAIVPL 2050
            IR+SH S  S+RCPPLAVL T+S  G+  K+E+  +     L   Y +CL++ K A++ L
Sbjct: 51   IRISHFSQPSERCPPLAVLTTVSSCGLCFKLEASASPAQEPLSLFYSSCLRDNKTAVMLL 110

Query: 2049 QGRELLLVAMLSKSRVDQLPCFWGYDVSSVLYNACLGMLNLRCLAIVFDLDETLIVANTL 1870
               EL LVAM S++  +  PCFWG+ V+  +Y++CL MLNLRCL IVFDLDETL+VANT+
Sbjct: 111  GDEELHLVAMYSENIKNDRPCFWGFSVAPGIYDSCLVMLNLRCLGIVFDLDETLVVANTM 170

Query: 1869 RSFEDKIDAIQRKLSAESDPLRVQGMVAEVRRYQEDRAMLKQYADADQIVDSNGVVLKAV 1690
            RSFED+I+ +QR+++ E DP R+  M AE++RYQ+D+ +LKQY ++DQ+++ NG V+K  
Sbjct: 171  RSFEDRIEVLQRRINNEVDPQRIAVMGAEMKRYQDDKNLLKQYVESDQVIE-NGEVIKVQ 229

Query: 1689 TEFIPPHSDRGQSISRPLIRIQDKNMIFTRINPNIRDTSVLVRLRPAWEELRTYLTLKGR 1510
            +E +P  SD  Q + RPLIR+Q+KN+I TRINP IRDTSVLVRLRP+WEELR+YLT KGR
Sbjct: 230  SEIVPALSDNHQPLVRPLIRLQEKNIILTRINPMIRDTSVLVRLRPSWEELRSYLTAKGR 289

Query: 1509 KRFEVYICTMSERDYALEMWRLLDGESRLINHGELLDRVLCVKPGSRKALGNVFRVGFCH 1330
            KRFEVY+CTM+ERDYALEMWRLLD E  LIN  +LL R++CVKPG +K+L NVF    CH
Sbjct: 290  KRFEVYVCTMAERDYALEMWRLLDPEGNLINVNDLLTRIVCVKPGLKKSLFNVFLDATCH 349

Query: 1329 PKLAMVIDDRLKVWEDRDQPRVHVVPPFAPYYAPQAETSCPVPVLCVARNVACNVRGGFF 1150
            PK+A+VIDDRLKVW+++DQPRVHVVP FAPYY+PQAE +   PVLCVARNVAC VRGGFF
Sbjct: 350  PKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYSPQAEAAA-TPVLCVARNVACGVRGGFF 408

Query: 1149 KDFDEVLLQKLSEVVYEFNVESLPPTPDVSNYLLTDDD---LNVNKDLPLPEGMADSELE 979
            +DFD+ LLQ+++E+ YE +VE +P  PDVS+YL+++D+   LN NKD    +GMAD+E+E
Sbjct: 409  RDFDDSLLQRIAEISYENDVEDIPSPPDVSHYLVSEDETSGLNGNKDPLTFDGMADAEVE 468

Query: 978  KKLGPQETNQVSLPNIISSGIENQPVVRLCDSTQPTFIPLV-ANHGRIPLAAPXXXXXXX 802
            ++L       +S  +++       P +     + P   P+  A+   +P+  P       
Sbjct: 469  RRL----KEAISASSVVLPAANIDPRI-----SAPVQYPMASASSVSVPIPVPVPVVQQA 519

Query: 801  XXXXLPIIPTVKPYMP----QSHVRLDCGIQTSPPREEGEVPESEVDPDTRRRLLILQHG 634
                    P+++   P    +  +  +  +Q+SP REEGEVPESE+DPDTRRRLLILQHG
Sbjct: 520  PQPSAMAFPSIQFQQPTPIAKHMLPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHG 579

Query: 633  QDTGKFNGADPGLPARLQVTAPPNQVP--GGWLGAEEEMSPRQIVRTAQGPAP-QQESFG 463
            QDT     ++P  P R  V APP  V    GW   EEEM    + RT     P   E   
Sbjct: 580  QDTRDPAPSEPPFPQRPPVQAPPPHVQPRNGWFPVEEEMDQAPLRRTVSKEYPLDSEMIH 639

Query: 462  VDKPRFPQTSLYPGSENPLVIDRGLDESKRMMTDEASFSEERLKSNHIIPXXXXXXXXXX 283
            ++K R    S +   +N    DR L E++R    E+   +E+L+SN+ +P          
Sbjct: 640  MEKNRPRHPSFFSKIDNSTQSDRMLHENRR-PPKESLRRDEQLRSNNNLPGSHSFFGEEA 698

Query: 282  XSLNKTSSNCKT---AQGNLLSASNCISALYKIAEYCNTKVDFRSWLSSARELEFSVEVL 112
                 +S N      +  N+ +A N    L+ IA  C TKV+++  L ++ +L FSVE  
Sbjct: 699  SWNQSSSRNSDVDFISGRNVQAAENPAEVLHDIAVKCGTKVEYKPGLVASTDLRFSVETW 758

Query: 111  FDGKKISVAVGKTKKEAQQKASFDALKNMASQYMSQV 1
              G+KI   +GK+++EA  KA+  +++N+A  Y+S+V
Sbjct: 759  LSGEKIGEGIGKSRREALHKAAEVSIQNLADVYLSRV 795


Top