BLASTX nr result

ID: Mentha28_contig00012040 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00012040
         (2516 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU27926.1| hypothetical protein MIMGU_mgv1a000848mg [Mimulus...   987   0.0  
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...   826   0.0  
ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma...   819   0.0  
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...   805   0.0  
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...   805   0.0  
ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform...   799   0.0  
ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform...   763   0.0  
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...   754   0.0  
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...   750   0.0  
ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun...   746   0.0  
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...   745   0.0  
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...   741   0.0  
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...   728   0.0  
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...   724   0.0  
emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]   722   0.0  
ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas...   714   0.0  
ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma...   712   0.0  
ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma...   706   0.0  
ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma...   702   0.0  
ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma...   697   0.0  

>gb|EYU27926.1| hypothetical protein MIMGU_mgv1a000848mg [Mimulus guttatus]
          Length = 962

 Score =  987 bits (2552), Expect = 0.0
 Identities = 508/701 (72%), Positives = 562/701 (80%), Gaps = 12/701 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            +LR YLTA+GRKRFEVFVCTMAERDYALEMWRLLDP  NLINSR+LL R+VCVKSG RKS
Sbjct: 263  ELRNYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEFNLINSRELLERVVCVKSGFRKS 322

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAE NNT+PVLCVAR
Sbjct: 323  LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVAR 382

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFKDFDDGLLQ IS VAYEDDI++ PSSPDVSNYLISEDDPSAS G KDS 
Sbjct: 383  NVACNVRGGFFKDFDDGLLQLISGVAYEDDIKDVPSSPDVSNYLISEDDPSASGGNKDSL 442

Query: 1976 GFDGMADSEVERRLKET-STSSAASLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXX 1800
             +DGMAD+EV+RRLK+  S SS A  PIAN+DP +   L Y   SSSFT           
Sbjct: 443  VYDGMADAEVQRRLKDAISASSTAPSPIANLDPIVASVLHYMAPSSSFTAPPPTTQGPAM 502

Query: 1799 PFTSQPFSQVG-MFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQD 1623
             F SQ   QV  + K P+ QL Q ETT +SSPAREEGEVPESELDPDTRRR+LILQHGQD
Sbjct: 503  SFPSQQMHQVATLLKPPLVQLGQGETTSRSSPAREEGEVPESELDPDTRRRMLILQHGQD 562

Query: 1622 MREPPPSEPQFPARPPMQASLPRAQTRGWFPVEEETTQGQLNRVA-PPNDFVLNAESNTI 1446
            MR P PSEPQFPAR PMQ S+PR Q  GWFPVEEE +  Q N+VA PP +F LN ES  I
Sbjct: 563  MRGPSPSEPQFPARTPMQVSVPRVQPHGWFPVEEEMSSRQPNQVALPPKEFPLNVESLPI 622

Query: 1445 DKIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSPV 1266
            DK R  H PFLQ VEPS+PPGR+L ESQRLPKEA  REDQLRLNQ++PDF SF G+D+ V
Sbjct: 623  DKNRGHHSPFLQNVEPSIPPGRILPESQRLPKEAVPREDQLRLNQSLPDFHSFHGEDASV 682

Query: 1265 AQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLFA 1086
            AQP SA+KD DLEAGQIDPY ETC GALQ+IAFKCGTKVEF Q L+SST LQF VEVLFA
Sbjct: 683  AQPSSANKDFDLEAGQIDPYIETCIGALQDIAFKCGTKVEFKQTLISSTGLQFFVEVLFA 742

Query: 1085 GEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPD-SYVAGDGGRFTANQKENGFVS 909
            GE+IG+G+GRT           SL+YLADKYLS+ RPD +YV GDG R   NQKENGF S
Sbjct: 743  GERIGEGMGRTRREAQRQAAEGSLLYLADKYLSRSRPDFNYVPGDGSR-VGNQKENGFNS 801

Query: 908  DPNTSGYQSLPKEEGAPFSSA----RNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQ 741
            + N+ GYQ LP EEG PFS+     R +DPR E SK+P+  S+ ALKE CTMEGL V FQ
Sbjct: 802  NANSFGYQPLPNEEGLPFSTVAAPPRIVDPRTEVSKRPIMGSITALKEFCTMEGLGVTFQ 861

Query: 740  TQPQFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRH 561
            TQPQFSA+PGQ+NEVYAQVE+NGQVLGKGIGLTWDEA+S+AAEKAL  LKSM  QFPYRH
Sbjct: 862  TQPQFSANPGQRNEVYAQVEVNGQVLGKGIGLTWDEARSQAAEKALVTLKSMPGQFPYRH 921

Query: 560  QG-SPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            QG SPRSM  + +KR+K +F+RV QR+   GRYPRNGSPVP
Sbjct: 922  QGSSPRSMQSIPNKRVKQEFNRVSQRLPSFGRYPRNGSPVP 962


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score =  826 bits (2133), Expect = 0.0
 Identities = 445/699 (63%), Positives = 526/699 (75%), Gaps = 10/699 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL+RIVCVKSG RKS
Sbjct: 263  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSQELLDRIVCVKSGLRKS 322

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDGNCHPKMALVIDDRLKVWD+KDQPRVHVVPAFAPY+APQAE NN+VPVLCVAR
Sbjct: 323  LFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVVPAFAPYFAPQAEGNNSVPVLCVAR 382

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFKDFD+GLLQRISEVAYEDDI+  PS+PDVSNYLISEDDPSA NG KDS 
Sbjct: 383  NVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPSAPDVSNYLISEDDPSAVNGNKDSL 442

Query: 1976 GFDGMADSEVERRLKETSTSSAASLP--IANIDPRLTQALQYAVSSSSFTVXXXXXXXXX 1803
            GFDGMADSEVERRLKE   +S  S+P  + N+DPRL  ALQY V      +         
Sbjct: 443  GFDGMADSEVERRLKEAMLAS-TSVPSQMTNLDPRLVPALQYPVPP---VISQPSIQSPV 498

Query: 1802 XPFTSQPFSQV-GMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQ 1626
             PF +Q   QV  + K  + Q+S  +T++QSSPAREEGEVPESELDPDTRRRLLILQHGQ
Sbjct: 499  VPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREEGEVPESELDPDTRRRLLILQHGQ 558

Query: 1625 DMREPPPSEPQFPARPPMQASL-PRAQTRGWFPVEEETTQGQLNRVAPPNDFVLNAESNT 1449
            D R+   SEP+FP   P+Q S+ PR Q  GWFP EEE +  QLNR  PP +F LN ES  
Sbjct: 559  DTRDQVSSEPKFPMGTPLQVSVPPRVQPHGWFPAEEEMSPRQLNRPLPPKEFPLNPESMH 618

Query: 1448 IDKIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSP 1269
            I+K R PH PFL K+E S+P  RVL E+QRLPKE   R+D++R +Q+ P F    G++ P
Sbjct: 619  INKHRPPHPPFLPKMETSMPSDRVLFENQRLPKEVIPRDDRMRFSQSQPSFRP-PGEEVP 677

Query: 1268 VAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLF 1089
            + +  S+++ LDLE G  DPY ET  GALQ+IAFKCG KVEF  + +SS ELQF +EVLF
Sbjct: 678  LGRSSSSNRVLDLEPGHYDPYLETPAGALQDIAFKCGAKVEFRSSFLSSPELQFSLEVLF 737

Query: 1088 AGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPD-SYVAGDGGRFTANQKENGFV 912
            AGEK+G+G GRT          ESL+YLADKYLS  +PD S   GDG RF  N  +NGFV
Sbjct: 738  AGEKVGEGTGRTRREAQRRAAEESLMYLADKYLSCIKPDSSSTQGDGFRF-PNASDNGFV 796

Query: 911  SDPNTSGYQSLPKEEGAPFSSARNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQTQP 732
             + +  GYQ       A     R LDPR+E  KK +G S+ AL+ELC +EGL +AFQTQP
Sbjct: 797  DNMSPFGYQDRVSHSFAS-EPPRVLDPRLEVFKKSVG-SVGALRELCAIEGLGLAFQTQP 854

Query: 731  QFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRHQGS 552
            Q SA+PGQK+E+YAQVEI+GQV GKGIG TWD+AK++AAE+AL ALKS   QF  + QGS
Sbjct: 855  QLSANPGQKSEIYAQVEIDGQVFGKGIGSTWDDAKTQAAERALVALKSELAQFSQKRQGS 914

Query: 551  PRSM-HGVSSKRIKHDFSR-VPQRM---GRYPRNGSPVP 450
            PRS+  G S+KR+K ++SR V QR+   GR+P+N S +P
Sbjct: 915  PRSLQQGFSNKRLKPEYSRGVQQRVPLSGRFPKNTSAMP 953


>ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum lycopersicum]
          Length = 954

 Score =  819 bits (2115), Expect = 0.0
 Identities = 441/700 (63%), Positives = 524/700 (74%), Gaps = 11/700 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL+RIVCVKSG RKS
Sbjct: 263  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSQELLDRIVCVKSGLRKS 322

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDGNCHPKMALVIDDRLKVWD+KDQPRVHVVPAFAPY+APQAE NN+VPVLCVAR
Sbjct: 323  LFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVVPAFAPYFAPQAEGNNSVPVLCVAR 382

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFKDFD+GLLQRISEVAYEDDI+  PS+PDVSNYLISEDDPSA NG KDS 
Sbjct: 383  NVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPSAPDVSNYLISEDDPSAVNGNKDSL 442

Query: 1976 GFDGMADSEVERRLKETSTSSAASLP--IANIDPRLTQALQYAVSSSSFTVXXXXXXXXX 1803
            GFDGMADSEVERRLKE   +S  S+P  + N+DPRL  ALQY V      +         
Sbjct: 443  GFDGMADSEVERRLKEAMLAS-TSVPSQMTNLDPRLVPALQYPVPP---VISQPSIQGPV 498

Query: 1802 XPFTSQPFSQV-GMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQ 1626
             PF +Q   QV  + K  + Q+S  +T++QSSPAREEGEVPESELDPDTRRRLLILQHGQ
Sbjct: 499  VPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREEGEVPESELDPDTRRRLLILQHGQ 558

Query: 1625 DMREPPPSEPQFPARPPMQASL-PRAQTRGWFPVEEETTQGQLNRVAPPNDFVLNAESNT 1449
            D R+   SEP+FP   P+Q S+ PR Q  GWFP EEE +  QLNR  PP +F LN ES  
Sbjct: 559  DTRDQVSSEPKFPIGTPLQVSVPPRVQPHGWFPAEEEVSPRQLNRPLPPKEFPLNPESMH 618

Query: 1448 IDKIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSP 1269
            I+K R PH PFL K+E S+P  RV  E+QRLPKE   R+D++R +Q+ P F    G+D  
Sbjct: 619  INKHRPPHPPFLPKMETSMPSDRVFFENQRLPKEVIPRDDRMRFSQSQPSFRP-PGEDVS 677

Query: 1268 VAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLF 1089
            + +  S+++ LDL+ G  DPY +T  GALQ+IAFKCG KVEF  + +SS ELQF +EVLF
Sbjct: 678  LGRSSSSNRVLDLDPGHYDPYLDTPAGALQDIAFKCGVKVEFRSSFLSSPELQFCLEVLF 737

Query: 1088 AGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPD-SYVAGDGGRFTANQKENGFV 912
            AGEK+G+GIGRT          ESL+YLADKYLS  + D S   GDG RF  N  +NGFV
Sbjct: 738  AGEKVGEGIGRTRREAQRHAAEESLMYLADKYLSCIKADSSSTQGDGFRF-PNASDNGFV 796

Query: 911  SDPNTSGYQSLPKEEGAPFSSARNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQTQP 732
             + +  GYQ       A     R LDPR+E  KK +G S+ AL+ELC +EGL +AFQTQP
Sbjct: 797  ENMSPFGYQDRVSHSFAS-EPPRVLDPRLEVFKKSVG-SVGALRELCAIEGLGLAFQTQP 854

Query: 731  QFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRHQGS 552
            Q S +PGQK+E+YAQVEI+GQV GKGIG TWD+AK++AAE+AL ALKS   QF ++ QGS
Sbjct: 855  QLSVNPGQKSEIYAQVEIDGQVFGKGIGPTWDDAKTQAAERALVALKSELAQFSHKRQGS 914

Query: 551  PRSM--HGVSSKRIKHDFSR-VPQRM---GRYPRNGSPVP 450
            PRS+   G S+KR+K ++SR V QR+   GR+P+N S +P
Sbjct: 915  PRSLQQQGFSNKRLKPEYSRGVQQRVPLSGRFPKNTSAMP 954


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score =  805 bits (2079), Expect = 0.0
 Identities = 426/695 (61%), Positives = 514/695 (73%), Gaps = 6/695 (0%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLIN+++LL+RIVCVKSG RKS
Sbjct: 268  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINTKELLDRIVCVKSGSRKS 327

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDEKDQ RVHVVPAFAPYYAPQAE NN +PVLCVAR
Sbjct: 328  LFNVFQDGTCHPKMALVIDDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNAIPVLCVAR 387

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            N+ACNVRG FFK+FD+GLLQRI E++YEDD++  PS PDVSNYL+SEDD + +NGIKD  
Sbjct: 388  NIACNVRGGFFKEFDEGLLQRIPEISYEDDVKEIPSPPDVSNYLVSEDDAATANGIKDPL 447

Query: 1976 GFDGMADSEVERRLKETSTSSAA-SLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXX 1800
             FDGMAD+EVERRLKE   +SA  S  +AN+DPRL    QY + SSS T           
Sbjct: 448  SFDGMADAEVERRLKEAIAASATISSAVANLDPRLA-PFQYTMPSSSSTTTLPTSQAAVM 506

Query: 1799 PFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQDM 1620
            P  +  F        P+  +   E  +QSSPAREEGEVPESELDPDTRRRLLILQHG D 
Sbjct: 507  PLANMQFPPATSLVKPLGHVGPPEQCLQSSPAREEGEVPESELDPDTRRRLLILQHGMDT 566

Query: 1619 REPPPSEPQFPARPPMQASLPRAQTRG-WFPVEEETTQGQLNRVAPPNDFVLNAESNTID 1443
            RE  PSE  FPAR  MQ S+PR  +RG WFPVEEE +  QLNR A P +F LN+E+  I+
Sbjct: 567  RENAPSEAPFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQLNR-AVPKEFPLNSEAMQIE 625

Query: 1442 KIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSPVA 1263
            K R PH  F  K+E S+   R   E+QR+PKEA  R+D+LRLN  + D+ SFSG++ P++
Sbjct: 626  KHRPPHPSFFPKIENSITSDRP-HENQRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLS 684

Query: 1262 QPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLFAG 1083
            +  S+S+D+D E+G+    TET +G LQ+IA KCGTKVEF  ALV+STELQF +E  FAG
Sbjct: 685  RSSSSSRDVDFESGRDVSSTETPSGVLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAG 744

Query: 1082 EKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDSYVA-GDGGRFTANQKENGFVSD 906
            EKIG+GIGRT           S+ +LA+ Y+ + + DS    GDG RF +N  EN F+ +
Sbjct: 745  EKIGEGIGRTRREAQRQAAEGSIKHLANVYVLRVKSDSGSGHGDGSRF-SNANENCFMGE 803

Query: 905  PNTSGYQSLPKEEGAPFSSARNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQTQPQF 726
             N+ G Q L K+E      ++ +DPR+E SKK +G S++ALKELC  EGL V FQ QP  
Sbjct: 804  INSFGGQPLAKDESLSSEPSKLVDPRLEGSKKLMG-SVSALKELCMTEGLGVVFQQQPPS 862

Query: 725  SAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRHQGSPR 546
            SA+  QK+EVYAQVEI+GQVLGKGIG TWDEAK +AAEKALG+L+SM  QFP +HQGSPR
Sbjct: 863  SANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPR 922

Query: 545  SMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            S+ G+ +KR+K +F RV QRM   GRYP+N  PVP
Sbjct: 923  SLQGMPNKRLKPEFPRVLQRMPPSGRYPKNAPPVP 957


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score =  805 bits (2078), Expect = 0.0
 Identities = 425/695 (61%), Positives = 515/695 (74%), Gaps = 6/695 (0%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLIN+++LL+RIVCVKSG RKS
Sbjct: 268  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINTKELLDRIVCVKSGSRKS 327

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWD+KDQPRVHVVPAFAPYYAPQAE NN +PVLCVAR
Sbjct: 328  LFNVFQDGTCHPKMALVIDDRLKVWDDKDQPRVHVVPAFAPYYAPQAEANNAIPVLCVAR 387

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            N+ACNVRG FFK+FD+GLLQRI E++YEDD+++ PS PDVSNYL+SEDD + +NGIKD  
Sbjct: 388  NIACNVRGGFFKEFDEGLLQRIPEISYEDDVKDIPSPPDVSNYLVSEDDAATANGIKDPL 447

Query: 1976 GFDGMADSEVERRLKETSTSSAA-SLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXX 1800
             FDGMAD+EVERRLKE   +SA  S  +AN+DPRL    QY + SSS T           
Sbjct: 448  SFDGMADAEVERRLKEAIAASATISSAVANLDPRLA-PFQYTMPSSSSTTTLPTSQAAVM 506

Query: 1799 PFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQDM 1620
            P  +  F        P+  +   E ++QSSPAREEGEVPESELDPDTRRRLLILQHG D 
Sbjct: 507  PLANMQFPPATSLVKPLGHVGPPEQSLQSSPAREEGEVPESELDPDTRRRLLILQHGMDT 566

Query: 1619 REPPPSEPQFPARPPMQASLPRAQTRG-WFPVEEETTQGQLNRVAPPNDFVLNAESNTID 1443
            RE  PSE  FPAR  MQ S+PR  +RG WFPVEEE +  QLNR A P +F LN+E+  I+
Sbjct: 567  RENAPSEAPFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQLNR-AVPKEFPLNSEAMQIE 625

Query: 1442 KIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSPVA 1263
            K R PH  F  K+E      R   E+QR+PKEA  R+D+LRLN  + D+ SFSG++ P++
Sbjct: 626  KHRPPHPSFFPKIENPSTSDRP-HENQRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLS 684

Query: 1262 QPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLFAG 1083
            +  S+S+D+D E+G+    TET +G LQ+IA KCGTKVEF  ALV+STELQF +E  FAG
Sbjct: 685  RSSSSSRDVDFESGRDVSSTETPSGVLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAG 744

Query: 1082 EKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDSYVA-GDGGRFTANQKENGFVSD 906
            EKIG+GIGRT           S+ +LA+ Y+ + + DS    GDG RF +N  EN F+ +
Sbjct: 745  EKIGEGIGRTRREAQRQAAEGSIKHLANVYMLRVKSDSGSGHGDGSRF-SNANENCFMGE 803

Query: 905  PNTSGYQSLPKEEGAPFSSARNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQTQPQF 726
             N+ G Q L K+E      ++ +DPR+E SKK +G S++ALKELC  EGL V FQ QP  
Sbjct: 804  INSFGGQPLAKDESLSSEPSKLVDPRLEGSKKLMG-SVSALKELCMTEGLGVVFQQQPPS 862

Query: 725  SAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRHQGSPR 546
            SA+  QK+EVYAQVEI+GQVLGKGIG TWDEAK +AAEKALG+L+SM  QFP +HQGSPR
Sbjct: 863  SANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPR 922

Query: 545  SMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            S+ G+ +KR+K +F RV QRM   GRYP+N  PVP
Sbjct: 923  SLQGMPNKRLKPEFPRVLQRMPPSGRYPKNAPPVP 957


>ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
            gi|508781046|gb|EOY28302.1| C-terminal domain
            phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score =  799 bits (2064), Expect = 0.0
 Identities = 430/700 (61%), Positives = 515/700 (73%), Gaps = 11/700 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL+RIVCVKSG RKS
Sbjct: 284  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKS 343

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAE NNT+PVLCVAR
Sbjct: 344  LFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVAR 403

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FF++FD+GLLQRI E++YEDDI++ PS PDV NYL+SEDD SA NG KD  
Sbjct: 404  NVACNVRGGFFREFDEGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPL 463

Query: 1976 GFDGMADSEVERRLKET-STSSAASLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXX 1800
             FDGMAD+EVERRLKE  S +S  S    N+DPRLT +LQY + SSS ++          
Sbjct: 464  LFDGMADAEVERRLKEAISATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIV 523

Query: 1799 PFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQDM 1620
             F++  F        P+A ++  E ++QSSPAREEGEVPESELDPDTRRRLLILQHGQD 
Sbjct: 524  SFSNMQFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDT 583

Query: 1619 REPPPSEPQF-PARPPMQASLPRAQTRG-WFPVEEETTQGQLNRVAPPNDFVLNAESNTI 1446
            R+  P EP F P RP MQ S+PR Q+RG WF  EEE +  QLNR A P +F L++E   I
Sbjct: 584  RDHTPPEPAFPPVRPTMQVSVPRGQSRGSWFAAEEEMSPRQLNRAA-PKEFPLDSERMHI 642

Query: 1445 DKIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSPV 1266
            +K R  H PF  KVE S+P  R+L E+QRL KEA  R+D+L LN     + SFSG++ P+
Sbjct: 643  EKHR--HPPFFPKVESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPL 700

Query: 1265 AQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLFA 1086
            +Q  S+ +DLD E+G+     ET  G LQ+IA KCG KVEF  ALV+S +LQF +E  FA
Sbjct: 701  SQSSSSHRDLDFESGRTVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFA 760

Query: 1085 GEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDSYVA-GDGGRFTANQKENGFVS 909
            GEK+G+G+GRT          ES+  LA+ YLS+ +PDS  A GD  R   N  +NGF S
Sbjct: 761  GEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRL-HNINDNGFPS 819

Query: 908  DPNTSGYQSLPKEEGAPFSSA----RNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQ 741
            + N+ G Q L KEE   FS+A    R  DPR+E SKK +G S+ ALKELC MEGL V FQ
Sbjct: 820  NVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMG-SVTALKELCMMEGLGVVFQ 878

Query: 740  TQPQFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRH 561
             QP  S++  QK+EVYAQVEI+GQVLGKG GLTW+EAK +AAEKALG+L+SM  Q+  + 
Sbjct: 879  PQPPSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKR 938

Query: 560  QGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            QGSPRS+ G+ +KR+K +F RV QRM   GRYP+N  PVP
Sbjct: 939  QGSPRSLQGMQNKRLKPEFPRVLQRMPSSGRYPKNAPPVP 978


>ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
            gi|508781047|gb|EOY28303.1| C-terminal domain
            phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score =  763 bits (1971), Expect = 0.0
 Identities = 411/665 (61%), Positives = 490/665 (73%), Gaps = 8/665 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL+RIVCVKSG RKS
Sbjct: 284  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKS 343

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAE NNT+PVLCVAR
Sbjct: 344  LFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVAR 403

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FF++FD+GLLQRI E++YEDDI++ PS PDV NYL+SEDD SA NG KD  
Sbjct: 404  NVACNVRGGFFREFDEGLLQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPL 463

Query: 1976 GFDGMADSEVERRLKET-STSSAASLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXX 1800
             FDGMAD+EVERRLKE  S +S  S    N+DPRLT +LQY + SSS ++          
Sbjct: 464  LFDGMADAEVERRLKEAISATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIV 523

Query: 1799 PFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQDM 1620
             F++  F        P+A ++  E ++QSSPAREEGEVPESELDPDTRRRLLILQHGQD 
Sbjct: 524  SFSNMQFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDT 583

Query: 1619 REPPPSEPQF-PARPPMQASLPRAQTRG-WFPVEEETTQGQLNRVAPPNDFVLNAESNTI 1446
            R+  P EP F P RP MQ S+PR Q+RG WF  EEE +  QLNR A P +F L++E   I
Sbjct: 584  RDHTPPEPAFPPVRPTMQVSVPRGQSRGSWFAAEEEMSPRQLNRAA-PKEFPLDSERMHI 642

Query: 1445 DKIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSPV 1266
            +K R  H PF  KVE S+P  R+L E+QRL KEA  R+D+L LN     + SFSG++ P+
Sbjct: 643  EKHR--HPPFFPKVESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPL 700

Query: 1265 AQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLFA 1086
            +Q  S+ +DLD E+G+     ET  G LQ+IA KCG KVEF  ALV+S +LQF +E  FA
Sbjct: 701  SQSSSSHRDLDFESGRTVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFA 760

Query: 1085 GEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDSYVA-GDGGRFTANQKENGFVS 909
            GEK+G+G+GRT          ES+  LA+ YLS+ +PDS  A GD  R   N  +NGF S
Sbjct: 761  GEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRL-HNINDNGFPS 819

Query: 908  DPNTSGYQSLPKEEGAPFSSA----RNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQ 741
            + N+ G Q L KEE   FS+A    R  DPR+E SKK +G S+ ALKELC MEGL V FQ
Sbjct: 820  NVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMG-SVTALKELCMMEGLGVVFQ 878

Query: 740  TQPQFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRH 561
             QP  S++  QK+EVYAQVEI+GQVLGKG GLTW+EAK +AAEKALG+L+SM  Q+  + 
Sbjct: 879  PQPPSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKR 938

Query: 560  QGSPR 546
            QGSPR
Sbjct: 939  QGSPR 943


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score =  754 bits (1947), Expect = 0.0
 Identities = 412/699 (58%), Positives = 509/699 (72%), Gaps = 10/699 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLIN+  LL+RIVCVKSG +KS
Sbjct: 266  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINANKLLDRIVCVKSGLKKS 325

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQ+  CHPKMALVIDDRLKVWD++DQPRVHVVPAFAPYYAPQAE NN VPVLCVAR
Sbjct: 326  LFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVPAFAPYYAPQAEANNAVPVLCVAR 385

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVAC+VRG FF++FDD LLQ+I E+ YED+I++  SSPDVSN+L+SEDD SASNG +D  
Sbjct: 386  NVACSVRGGFFREFDDSLLQKIPEIFYEDNIKDF-SSPDVSNFLVSEDDASASNGNRDQL 444

Query: 1976 GFDGMADSEVERRLKE-TSTSSAASLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXX 1800
             FDGMAD+EVERRLKE TS +   S  ++N DPRL  +LQY V  SS TV          
Sbjct: 445  PFDGMADAEVERRLKEATSAAPTVSSAVSNNDPRLA-SLQYTVPLSS-TVSLPTNQPSMM 502

Query: 1799 PFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQDM 1620
            PF +  F Q      P+  +  A+  + SSPAREEGEVPESELDPDTRRRLLILQHGQD 
Sbjct: 503  PFHNVQFPQSASLVKPLGHVGPADLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDT 562

Query: 1619 REPPPSEPQFPARPPMQASLPRAQTR-GWFPVEEETTQGQLNRVAPPNDFVLNAESNTID 1443
            RE  PSEP FP RP +Q S+PR Q+R GWFPVEEE +  +L+R+  P +  LN+E   I+
Sbjct: 563  RESVPSEPSFPVRPQVQVSVPRVQSRGGWFPVEEEMSPRKLSRMV-PKEPPLNSEPMQIE 621

Query: 1442 KIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSPVA 1263
            K R+ H  F  KVE S+P  R+L E+QRLPKEAF R+++LR NQA+  + SFSG++ P+ 
Sbjct: 622  KHRSHHSAFFPKVENSMPSDRILQENQRLPKEAFHRDNRLRFNQAMSGYHSFSGEEPPLN 681

Query: 1262 QPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLFAG 1083
            +  S+++D D E+G+     ET  G LQEIA KCGTKVEF  ALV STELQF VE  FAG
Sbjct: 682  RSSSSNRDFDYESGRAISNAETPAGVLQEIAMKCGTKVEFRPALVPSTELQFYVEAWFAG 741

Query: 1082 EKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDSY-VAGDGGRFTANQKENGFVSD 906
            EKIG+G GRT           SL  LA+ Y+S+ +PD+  + GD  +F +N   NGF+ +
Sbjct: 742  EKIGEGTGRTRREAHFQAAEGSLKNLANIYISRGKPDALPIHGDASKF-SNVTNNGFMGN 800

Query: 905  PNTSGYQSLPKEEGAPFSS----ARNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQT 738
             N+ G Q LPKE+    S+    +R LDPR++ S+K + SS++ALKELCTMEGLSV +Q 
Sbjct: 801  MNSFGTQPLPKEDSLSSSTSSEPSRPLDPRLDNSRKSV-SSVSALKELCTMEGLSVLYQP 859

Query: 737  QPQFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRHQ 558
            +P    +  +K+EV+ Q EI+G+VLGKGIGLTWDEAK +AAEKALG L+S    +  + Q
Sbjct: 860  RPP-PPNSTEKDEVHVQAEIDGEVLGKGIGLTWDEAKMQAAEKALGNLRS--TLYGQKRQ 916

Query: 557  GSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            GSPR + G+ SKR+K +F +V QRM    RY +N  PVP
Sbjct: 917  GSPRPLQGMPSKRLKQEFPQVLQRMPSSTRYSKNAPPVP 955


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score =  750 bits (1936), Expect = 0.0
 Identities = 408/722 (56%), Positives = 509/722 (70%), Gaps = 33/722 (4%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL+RIVCVKSG RKS
Sbjct: 280  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKS 339

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDE+DQ RVHVVPAFAPYYAPQAEVNN VPVLCVAR
Sbjct: 340  LFNVFQDGICHPKMALVIDDRLKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVAR 399

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFK+FD+GLLQ+I EVAYEDD  N PS PDVSNYL+SEDD SA NG +D  
Sbjct: 400  NVACNVRGGFFKEFDEGLLQKIPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQL 459

Query: 1976 GFDGMADSEVERRLKETSTSSAASL-----PIANIDPRLTQALQYAVSSSSFTV------ 1830
             FDGMAD+EVER+LKE  ++S+A L      ++++DPRL Q+LQY ++SSS ++      
Sbjct: 460  SFDGMADAEVERQLKEAVSASSAILSTIPSTVSSLDPRLLQSLQYTIASSSSSMPTSQPS 519

Query: 1829 -------------XXXXXXXXXXPFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGE 1689
                                   PF +  F QV      + Q+   E ++QSSPAREEGE
Sbjct: 520  MLASQQPMPALQPPKPPSQLSMTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGE 579

Query: 1688 VPESELDPDTRRRLLILQHGQDMREPPPSEPQFPARPPMQASLPRAQTRG-WFPVEEETT 1512
            VPESELDPDTRRRLLILQHG D R+  PSE  FPARP  Q S PR Q+ G W PVEEE +
Sbjct: 580  VPESELDPDTRRRLLILQHGHDSRDNAPSESPFPARPSTQVSAPRVQSVGSWVPVEEEMS 639

Query: 1511 QGQLNRVAPPNDFVLNAESNTIDKIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSRE 1332
              QLNR   P +F L+++   I+K R  H  F  KVE ++P  R++ E+QR PKEA  R+
Sbjct: 640  PRQLNRT--PREFPLDSDPMNIEKHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRD 697

Query: 1331 DQLRLNQAVPDFPSFSGQDSPVAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTK 1152
            D+++LN +  ++PSF G++SP+++  S+++DLDLE+ +    TET    LQEIA KCGTK
Sbjct: 698  DRMKLNHSTSNYPSFQGEESPLSR-SSSNRDLDLESERAFSSTETPVEVLQEIAMKCGTK 756

Query: 1151 VEFNQALVSSTELQFIVEVLFAGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPD 972
            VEF  AL+++++LQF +E  F GEK+G+G G+T           S+  LA  Y+S+ +PD
Sbjct: 757  VEFRPALIATSDLQFSIETWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPD 816

Query: 971  S-YVAGDGGRFTANQKENGFVSDPNTSGYQSLPKEEGAPFSS----ARNLDPRIEPSKKP 807
            S  + GD  R+  +  +NGF+ D N+ G Q L K+E   +S+    +R LD R+E SKK 
Sbjct: 817  SGPMLGDSSRY-PSANDNGFLGDMNSFGNQPLLKDENITYSATSEPSRLLDQRLEGSKKS 875

Query: 806  LGSSLAALKELCTMEGLSVAFQTQPQFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAK 627
            +G S+ ALKE C  EGL V F  Q   S +     EV+AQVEI+GQVLGKGIGLTWDEAK
Sbjct: 876  MG-SVTALKEFCMTEGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAK 934

Query: 626  SEAAEKALGALKSMTVQFPYRHQGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSP 456
             +AAEKALG+L++M  Q+  + QGSPR M G+ +KR+K +F RV QRM    RY +N SP
Sbjct: 935  MQAAEKALGSLRTMFGQYTPKRQGSPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNASP 994

Query: 455  VP 450
            VP
Sbjct: 995  VP 996


>ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica]
            gi|462410413|gb|EMJ15747.1| hypothetical protein
            PRUPE_ppa000988mg [Prunus persica]
          Length = 940

 Score =  746 bits (1925), Expect = 0.0
 Identities = 411/699 (58%), Positives = 492/699 (70%), Gaps = 10/699 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS  LL+RIVCVKSG RKS
Sbjct: 268  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSNKLLDRIVCVKSGSRKS 327

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQ+  CHPKMALVIDDRLKVWD++DQPRVHVVPAFAPYYAPQAE NN VPVLCVAR
Sbjct: 328  LFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVPAFAPYYAPQAEANNAVPVLCVAR 387

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FF++FDD LLQ+I EV YEDDI++ P SPDVSNYL+SEDD SA NG +D  
Sbjct: 388  NVACNVRGGFFREFDDSLLQKIPEVFYEDDIKDVP-SPDVSNYLVSEDDSSALNGNRDPL 446

Query: 1976 GFDGMADSEVERRLKE-TSTSSAASLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXX 1800
             FDG+ D EVERR+KE T  +S  S    +IDPRL   LQY V  SS T+          
Sbjct: 447  PFDGITDVEVERRMKEATPAASMVSSVFTSIDPRLA-PLQYTVPPSS-TLSLPTTQPSVM 504

Query: 1799 PFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQDM 1620
             F S  F Q      P+  +  AE ++QSSPAREEGEVPESELDPDTRRRLLILQHGQD 
Sbjct: 505  SFPSIQFPQAASLVKPLGHVGSAEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDT 564

Query: 1619 REPPPSEPQFPARPPMQASLPRAQTR-GWFPVEEETTQGQLNRVAPPNDFVLNAESNTID 1443
            R+ PPSEP FP RPPMQAS+PRAQ+R GWFPVEEE +  QL+R+  P D  L+ E+  I+
Sbjct: 565  RDQPPSEPPFPVRPPMQASVPRAQSRPGWFPVEEEMSPRQLSRMV-PKDLPLDPETVQIE 623

Query: 1442 KIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSPVA 1263
            K R  H  F  KVE S+P  R+L E+QRLPKEAF R+D+LR N A+  + S SG++ P++
Sbjct: 624  KHRPHHSSFFPKVENSIPSDRILQENQRLPKEAFHRDDRLRFNHALSGYHSLSGEEIPLS 683

Query: 1262 QPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLFAG 1083
            +  S+++D+D E+G+     ET  G LQEIA KCG K                    FAG
Sbjct: 684  RSSSSNRDVDFESGRAISNAETPAGVLQEIAMKCGAK------------------AWFAG 725

Query: 1082 EKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDSY-VAGDGGRFTANQKENGFVSD 906
            EKIG+G G+T           SL  LA+ YLS+ +PDS  V GD  +F  N   NGF  +
Sbjct: 726  EKIGEGSGKTRREAHYQAAEGSLKNLANIYLSRVKPDSVSVHGDMNKF-PNVNSNGFAGN 784

Query: 905  PNTSGYQSLPKEEGAPFSS----ARNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQT 738
             N+ G Q  PKEE    S+    +R LDPR+E SKK + SS++ LKELC MEGL V FQ 
Sbjct: 785  LNSFGIQPFPKEESLSSSTSSEPSRPLDPRLEGSKKSM-SSVSTLKELCMMEGLGVVFQP 843

Query: 737  QPQFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRHQ 558
            +P  S +  +K+EV+ QVEI+G+VLGKGIGLTWDEAK +AAEKALG+L S    +  + Q
Sbjct: 844  RPPPSTNSVEKDEVHVQVEIDGEVLGKGIGLTWDEAKMQAAEKALGSLTS--TLYAQKRQ 901

Query: 557  GSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            GSPRS+ G+SSKR+K +F +V QRM    RYP+N  PVP
Sbjct: 902  GSPRSLQGMSSKRMKQEFPQVLQRMPSSARYPKNAPPVP 940


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score =  745 bits (1923), Expect = 0.0
 Identities = 400/701 (57%), Positives = 503/701 (71%), Gaps = 12/701 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            +LR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL+RIVCVKSG RKS
Sbjct: 282  ELRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKS 341

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAE NN VPVLCVAR
Sbjct: 342  LFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVAR 401

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFK+FD+GLLQRI E+++EDD+ + PS PDVSNYL+ EDD   SNG +D  
Sbjct: 402  NVACNVRGGFFKEFDEGLLQRIPEISFEDDMNDIPSPPDVSNYLVPEDDAFTSNGNRDPL 461

Query: 1976 GFDGMADSEVERRLKET-STSSAASLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXX 1800
             FDGMAD+EVE+RLKE  S SSA    +AN+D RL   LQY ++SSS ++          
Sbjct: 462  SFDGMADAEVEKRLKEAISISSAFPSTVANLDARLVPPLQYTMASSS-SIPVPTSQPAVV 520

Query: 1799 PFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQDM 1620
             F S    Q      P+ Q+  +E ++QSSPAREEGEVPESELDPDTRRRLLILQHGQD+
Sbjct: 521  TFPSMQLPQAAPLVKPLGQVVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDL 580

Query: 1619 REPPPSEPQFPARP--PMQASLPRAQTRG-WFPVEEETTQGQLNRVAPPNDFVLNAESNT 1449
            R+P PSE  FP RP   MQ S+PR Q+RG W PVEEE +  QLNR A   +F ++ E   
Sbjct: 581  RDPAPSESPFPVRPSNSMQVSVPRVQSRGNWVPVEEEMSPRQLNR-AVTREFPMDTEPMH 639

Query: 1448 IDKIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSP 1269
            IDK R  H  F  KVE S+P  R+  E+QRLPK A  ++D+LRLNQ + ++ S SG+++ 
Sbjct: 640  IDKHRPHHPSFFPKVESSIPSERMPHENQRLPKVAPYKDDRLRLNQTMSNYQSLSGEENS 699

Query: 1268 VAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLF 1089
            +++  S+++DLD+E+ +     ET    L EI+ KCG KVEF  +LV+S +LQF VE  F
Sbjct: 700  LSRSSSSNRDLDVESDRAVSSAETPVRVLHEISMKCGAKVEFKHSLVNSRDLQFSVEAWF 759

Query: 1088 AGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDS-YVAGDGGRFTANQKENGFV 912
            AGE++G+G GRT           S+  LA+ Y+S+ +PD+  + GD  ++ ++  +NGF+
Sbjct: 760  AGERVGEGFGRTRREAQSVAAEASIKNLANIYISRAKPDNGALHGDASKY-SSANDNGFL 818

Query: 911  SDPNTSGYQSLPKEEGAPFSSARN----LDPRIEPSKKPLGSSLAALKELCTMEGLSVAF 744
               N+ G Q LPK+E   +S +      LDPR+E SKK + SS+ ALKE C MEGL V F
Sbjct: 819  GHVNSFGSQPLPKDEILSYSDSSEQSGLLDPRLESSKKSM-SSVNALKEFCMMEGLGVNF 877

Query: 743  QTQPQFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYR 564
              Q   S++  Q  EV+AQVEI+GQV+GKGIG T+DEAK +AAEKALG+L++   +FP +
Sbjct: 878  LAQTPLSSNSVQNAEVHAQVEIDGQVMGKGIGSTFDEAKMQAAEKALGSLRTTFGRFPPK 937

Query: 563  HQGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
             QGSPR + G+ +K +K +F RV QRM    RYP+N  PVP
Sbjct: 938  RQGSPRPVPGMPNKHLKPEFPRVLQRMPSSARYPKNAPPVP 978


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score =  741 bits (1912), Expect = 0.0
 Identities = 406/716 (56%), Positives = 503/716 (70%), Gaps = 27/716 (3%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS +LL+RIVCV SG RKS
Sbjct: 281  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSNELLDRIVCVSSGSRKS 340

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDR+ VWDEKDQ RVHVVPAFAPYYAPQAE NN VP+LCVAR
Sbjct: 341  LFNVFQDGICHPKMALVIDDRMNVWDEKDQSRVHVVPAFAPYYAPQAEANNAVPILCVAR 400

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFK+FD+GLLQ+I EVAYEDD  N PS PDVSNYL+SEDD SA+NG +D  
Sbjct: 401  NVACNVRGGFFKEFDEGLLQKIPEVAYEDDTSNIPSPPDVSNYLVSEDDASAANGNRDPP 460

Query: 1976 GFDGMADSEVERRLKETSTSSAASLP------IANIDPRLTQALQYAVSSSSFTV----- 1830
             FD  AD+EVERRLKE + S+++++P      ++++DPRL Q+LQYAV+SSS  +     
Sbjct: 461  SFDSTADAEVERRLKE-AVSASSTIPSTIPSTVSSLDPRLLQSLQYAVASSSSLMPASQP 519

Query: 1829 -------XXXXXXXXXXPFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESEL 1671
                             PF +  F QV      + Q+   E ++QSSPAREEGEVPESEL
Sbjct: 520  SMLASQQPVPASQTSMMPFPNTQFPQVAPLVKQLGQVVHPEPSLQSSPAREEGEVPESEL 579

Query: 1670 DPDTRRRLLILQHGQDMREPPPSEPQFPARPPMQASLPRAQTRG-WFPVEEETTQGQLNR 1494
            DPDTRRRLLILQHGQD R+  PSE  FPARP    S    Q+RG W PVEEE T  QLNR
Sbjct: 580  DPDTRRRLLILQHGQDSRDNAPSESPFPARPSAPVSAAHVQSRGSWVPVEEEMTPRQLNR 639

Query: 1493 VAPPNDFVLNAESNTIDKIRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLN 1314
               P +F L+++   I+K +  H  F  KVE ++P  R++ E+QRLPKEA  R D++RLN
Sbjct: 640  T--PREFPLDSDPMNIEKHQTHHPSFFPKVESNIPSDRMIHENQRLPKEAPYRNDRMRLN 697

Query: 1313 QAVPDFPSFSGQDSPVAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQA 1134
             + P++ SF  +++P+++  S+++DLDLE+ +    +ET    LQEIA KC TKVEF  A
Sbjct: 698  HSTPNYHSFQVEETPLSR-SSSNRDLDLESERAFTISETPVEVLQEIAMKCETKVEFRPA 756

Query: 1133 LVSSTELQFIVEVLFAGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDS-YVAG 957
            LV+S +LQF +E  FAGEK+G+G G+T           S+  LA  Y+ + +PDS  + G
Sbjct: 757  LVASIDLQFSIEAWFAGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMLRAKPDSGPMHG 816

Query: 956  DGGRFTANQKENGFVSDPNTSGYQSLPKEEGAPFSSA----RNLDPRIEPSKKPLGSSLA 789
            D  R+  +  +NGF+ + N  G Q LPK+E   +S+A    R LDPR+E SKK  G S+ 
Sbjct: 817  DSSRY-PSANDNGFLGNMNLFGNQPLPKDELVAYSAASEPSRLLDPRLEGSKKSSG-SVT 874

Query: 788  ALKELCTMEGLSVAFQTQPQFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEK 609
            ALKE CTMEGL V F  Q   SA+     EV+AQVEI+GQVLGKGIG TWDEAK +AAEK
Sbjct: 875  ALKEFCTMEGLVVNFLAQTPLSANSIPGEEVHAQVEIDGQVLGKGIGSTWDEAKMQAAEK 934

Query: 608  ALGALKSMTVQFPYRHQGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            ALG+L++M  Q+  + QGSPR M G+ +KR+K +F RV QRM    RY +N  PVP
Sbjct: 935  ALGSLRTMFGQYTQKRQGSPRPMQGMPNKRLKQEFPRVLQRMPPSARYHKNAPPVP 990


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Glycine max]
          Length = 960

 Score =  728 bits (1880), Expect = 0.0
 Identities = 401/702 (57%), Positives = 500/702 (71%), Gaps = 13/702 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL RIVCVKSG +KS
Sbjct: 265  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKS 324

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG+C PKMALVIDDRLKVWDE+DQPRVHVVPAFAPYYAPQAE +NT+PVLCVAR
Sbjct: 325  LFNVFQDGSCDPKMALVIDDRLKVWDERDQPRVHVVPAFAPYYAPQAEASNTIPVLCVAR 384

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFKDFDDGLLQ+I ++AYEDDI++ PS PDVSNYL+SEDD S SNG +D  
Sbjct: 385  NVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDVPSPPDVSNYLVSEDDGSISNGNRDPF 444

Query: 1976 GFDGMADSEVERRLKETSTSSAASLPI--ANIDPRLTQALQYAVSSSSFTVXXXXXXXXX 1803
             FDGMAD+EVER+LK+ + ++A++ P+  AN+DPRLT +LQY +  S  +V         
Sbjct: 445  LFDGMADAEVERKLKD-ALAAASTFPVTTANLDPRLT-SLQYTMVPSG-SVPPPTAQASM 501

Query: 1802 XPFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQD 1623
             PF    F Q      P+ Q + ++ ++ SSPAREEGEVPESELDPDTRRRLLILQHGQD
Sbjct: 502  MPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQD 561

Query: 1622 MREPPPSEPQFPARPPMQASLPRA-QTRG-WFPVEEETTQGQLNRVAPPNDFVLNAESNT 1449
             R+   +EP FP R P+QAS PR   +RG WFPVEEE     LNRV  P +F +++    
Sbjct: 562  TRDHASAEPPFPVRHPVQASAPRVPSSRGVWFPVEEEIGSQPLNRVV-PKEFPVDSGPLG 620

Query: 1448 IDKIRAPHQPFLQKVEPSVPPGRVLLES-QRLPKEAFSREDQLRLNQAVPDFPSFSGQDS 1272
            I+K R  H  F  KVE S+   R+L +S QRLPKE + R+D+ RLN  +  + SFSG D 
Sbjct: 621  IEKPRLHHPSFFNKVESSISSDRILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSFSGDDI 680

Query: 1271 PVAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVL 1092
            P ++  S+ +DLD E+G    + +T    L EIA KCGTKV+F  +LV+STEL+F +E  
Sbjct: 681  PFSRSSSSHRDLDSESGHSVLHADTPVAVLHEIALKCGTKVDFMSSLVASTELKFSLEAW 740

Query: 1091 FAGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPD-SYVAGDGGRFTANQKENGF 915
            F+G+KIG G GRT          +S+ +LAD YLS  + +     GD   F  N  +NG+
Sbjct: 741  FSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIYLSSAKDEPGSTYGDVSGF-PNVNDNGY 799

Query: 914  VSDPNTSGYQSLPKEEGAPFSSA---RNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAF 744
            +   ++ G Q L KE+ A FSSA   R LDPR++ SK+ +G S++ALKELC MEGL V F
Sbjct: 800  MGIASSLGNQPLSKEDSASFSSASPSRALDPRLDVSKRSMG-SISALKELCMMEGLGVNF 858

Query: 743  QTQP-QFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPY 567
             + P   S +  QK+EV+AQVEI+G++ GKGIGLTWDEAK +AAEKALG L+S   Q   
Sbjct: 859  LSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIGLTWDEAKMQAAEKALGNLRSKLGQSIQ 918

Query: 566  RHQGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            + Q SPR   G S+KR+K ++ R  QRM    RYPRN  P+P
Sbjct: 919  KMQSSPRPHQGFSNKRLKQEYPRTMQRMPSSARYPRNAPPIP 960


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score =  724 bits (1870), Expect = 0.0
 Identities = 401/702 (57%), Positives = 499/702 (71%), Gaps = 13/702 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL RIVCVKSG +KS
Sbjct: 261  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKS 320

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAE +NT+PVLCVAR
Sbjct: 321  LFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNTIPVLCVAR 380

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFKDFDDGLLQ+I ++AYEDDI++ PS PDVSNYL+SEDD S SNG +D  
Sbjct: 381  NVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSPPDVSNYLVSEDDGSISNGHRDPF 440

Query: 1976 GFDGMADSEVERRLKETSTSSAASLPI--ANIDPRLTQALQYAVSSSSFTVXXXXXXXXX 1803
             FDGMAD+EVER+LK+ + S+A+++P+  AN+DPRLT +LQY +  S  +V         
Sbjct: 441  LFDGMADAEVERKLKD-ALSAASTIPVTTANLDPRLT-SLQYTMVPSG-SVPPPTAQASM 497

Query: 1802 XPFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQD 1623
             PF    F Q      P+ Q + +E ++ SSPAREEGEVPESELDPDTRRRLLILQHGQD
Sbjct: 498  MPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQD 557

Query: 1622 MREPPPSEPQFPARPPMQASLPRA-QTRG-WFPVEEETTQGQLNRVAPPNDFVLNAESNT 1449
             R+   +EP FP R P+Q S P    +RG WFP EEE     LNRV  P +F +++    
Sbjct: 558  TRDHASAEPPFPVRHPVQTSAPHVPSSRGVWFPAEEEIGSQPLNRVV-PKEFPVDSGPLG 616

Query: 1448 IDKIRAPHQPFLQKVEPSVPPGRVLLES-QRLPKEAFSREDQLRLNQAVPDFPSFSGQDS 1272
            I K R  H  F  KVE S+   R+L +S QRLPKE + R+D+ RLN  +  + SFSG D 
Sbjct: 617  IAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSFSGDDI 676

Query: 1271 PVAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVL 1092
            P ++  S+ +DLD E+G    + +T    LQEIA KCGTKV+F  +LV+STELQF +E  
Sbjct: 677  PFSRSFSSHRDLDSESGHSVLHADTPVAVLQEIALKCGTKVDFISSLVASTELQFSMEAW 736

Query: 1091 FAGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPD-SYVAGDGGRFTANQKENGF 915
            F+G+KIG  +GRT          +S+ +LAD YLS  + +     GD   F  N  ++G+
Sbjct: 737  FSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKDEPGSTYGDVSGF-PNVNDSGY 795

Query: 914  VSDPNTSGYQSLPKEEGAPFSSA---RNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAF 744
            +   ++ G Q L KE+ A FS+A   R LDPR++ SK+ +G S+++LKELC MEGL V F
Sbjct: 796  MGIASSLGNQPLSKEDSASFSTASPSRVLDPRLDVSKRSMG-SISSLKELCMMEGLDVNF 854

Query: 743  QTQP-QFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPY 567
             + P   S +  QK+EV+AQVEI+G+V GKGIGLTWDEAK +AAEKALG+L+S   Q   
Sbjct: 855  LSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQ 914

Query: 566  RHQGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            + Q SPR   G S+KR+K ++ R  QRM    RYPRN  P+P
Sbjct: 915  KRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSARYPRNAPPIP 956


>emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]
          Length = 894

 Score =  722 bits (1864), Expect = 0.0
 Identities = 399/697 (57%), Positives = 489/697 (70%), Gaps = 8/697 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL+RIVCVKSG RKS
Sbjct: 241  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKS 300

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAE NN + VLCVAR
Sbjct: 301  LFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAISVLCVAR 360

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFK+FD+GLLQRI E++YED+I++  S+PDVSNYL+SEDD S SNG +D  
Sbjct: 361  NVACNVRGGFFKEFDEGLLQRIPEISYEDBIKDIRSAPDVSNYLVSEDDASVSNGNRDQP 420

Query: 1976 GFDGMADSEVERRLKETSTSSAASLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXXP 1797
             FDGMAD EVER+LK+   + +A   + ++DPRL+  LQ+AV++SS             P
Sbjct: 421  CFDGMADVEVERKLKD---AISAPSTVTSLDPRLSPPLQFAVAASSGLAPQPAAQGSIMP 477

Query: 1796 FTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQDMR 1617
            F+++ F Q      P+A     E T+QSSPAREEGEVPESELDPDTRRRLLILQHGQD R
Sbjct: 478  FSNKQFPQSASLIKPLA----PEPTMQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR 533

Query: 1616 EPPPSEPQFPARPPMQASLPRAQTRG-WFPVEEETTQGQLNRVAPPNDFVLNAESNTIDK 1440
            E   S+P FP RPP+Q S+PR Q+RG WFP +EE +  QLNR A P +F L++++  I+K
Sbjct: 534  EHASSDPPFPVRPPIQVSVPRVQSRGSWFPADEEMSPRQLNR-AVPKEFPLDSDTMHIEK 592

Query: 1439 IRAPHQPFLQKVEPSVPPGRVLLESQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSPVAQ 1260
             R  H  F  KVE S    R+L E+QRL KE   R+D+LRLN ++P + SFSG++ P+ +
Sbjct: 593  HRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDDRLRLNHSLPGYHSFSGEEVPLGR 652

Query: 1259 PPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLFAGE 1080
              S+++DLD E+G+  PY ET    L                      L+   EV   GE
Sbjct: 653  -SSSNRDLDFESGRGAPYAETPAVGL----------------------LRNCNEVWNQGE 689

Query: 1079 KIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDSYVAGDGGRFTANQKENGFVSDPN 900
            KIG+G G+T           SL+YL+ +YL          GD  RF  N  +N F+SD N
Sbjct: 690  KIGEGTGKTRREAQCQAAEASLMYLSYRYLH---------GDVNRF-PNASDNNFMSDTN 739

Query: 899  TSGYQSLPKEEGAPFS----SARNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQTQP 732
            + GYQS PKE    FS    S+R LDPR+E SKK +G S++ALKELC MEGL V F +QP
Sbjct: 740  SFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSMG-SISALKELCMMEGLGVEFLSQP 798

Query: 731  QFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPYRHQGS 552
              S++  QK E+ AQVEI+GQVLGKG G TWD+AK +AAEKALG+LKSM  QF  + QGS
Sbjct: 799  PLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQAAEKALGSLKSMLGQFSQKRQGS 858

Query: 551  PRSMHGVSSKRIKHDFSRVPQR---MGRYPRNGSPVP 450
            PRS+ G+  KR+K +F+R  QR    GRY +N SPVP
Sbjct: 859  PRSLQGM-GKRLKSEFTRGLQRTPSSGRYSKNTSPVP 894


>ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
            gi|561032720|gb|ESW31299.1| hypothetical protein
            PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score =  714 bits (1844), Expect = 0.0
 Identities = 398/712 (55%), Positives = 498/712 (69%), Gaps = 23/712 (3%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL RIVCVKSG +KS
Sbjct: 258  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKS 317

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAE +N++PVLCVAR
Sbjct: 318  LFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNSIPVLCVAR 377

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSA--SNGIKD 1983
            NVACNVRG FFK+FDDGLLQ+I +VAYEDDI++ P  PDVSNYL+SEDD S+  SNG +D
Sbjct: 378  NVACNVRGGFFKEFDDGLLQKIPQVAYEDDIKDIPIPPDVSNYLVSEDDGSSAISNGNRD 437

Query: 1982 SNGFDGMADSEVERRLK--------ETSTSSAASLPI--ANIDPRLTQALQYAVSSSSFT 1833
               FD M D+EVER+ K          + S+A+++P+  AN+DPRLT +LQYA+ SS  +
Sbjct: 438  PFLFDSMGDAEVERKSKVPTRAPNEHDALSAASTIPVTTANLDPRLT-SLQYAMVSSG-S 495

Query: 1832 VXXXXXXXXXXPFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRR 1653
                       PFT   F Q      P+ Q + +E+++ SSPAREEGEVPESELDPDTRR
Sbjct: 496  APPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHSSPAREEGEVPESELDPDTRR 555

Query: 1652 RLLILQHGQDMREPPPSEPQFPARPPMQASLPRAQTR-GWFPVEEETTQGQLNRVAPPND 1476
            RLLILQHGQD R+   +EP +  R P+  S PR  +R GWFP EE+     LNRV  P +
Sbjct: 556  RLLILQHGQDTRDHTSNEPTYAIRHPVPVSAPRVSSRGGWFPAEEDIGSQPLNRVV-PKE 614

Query: 1475 FVLNAESNTIDKIRAPHQPFLQKVEPSVPPGRVLLES-QRLPKEAFSREDQLRLNQAVPD 1299
            F +++ S  I+K R  H  F  KVE S+   R+L +S QRLPKE + R+D+ R N  +  
Sbjct: 615  FSVDSGSLVIEKHRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRPRSNHMLSS 674

Query: 1298 FPSFSGQDSPVAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSST 1119
            + S S  + P ++  S+ +DLD E+     + +T    LQEIA KCGTKVEF  +LV+ST
Sbjct: 675  YRSLSVDEIPFSRSSSSHRDLDSESSHSVFHADTPVVVLQEIALKCGTKVEFMSSLVAST 734

Query: 1118 ELQFIVEVLFAGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPD-SYVAGDGGRF 942
            ELQF +E  F+G+KIG G GRT          +S+ +LAD YLS  + +     GD G F
Sbjct: 735  ELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAKDEPGSTYGDVGGF 794

Query: 941  TANQKENGFVSDPNTSGYQSLPKEEGAPFSSA----RNLDPRIEPSKKPLGSSLAALKEL 774
              N  +NG++   ++   Q LPKE+ A FS+A    R LDPR+E SK+P+G S++ALKEL
Sbjct: 795  -PNANDNGYMVIASSLSNQPLPKEDSASFSTASDPSRVLDPRLEVSKRPMG-SISALKEL 852

Query: 773  CTMEGLSVAFQTQP-QFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGA 597
            C MEGL V F + P   S +  QK+EV+AQVEI+G+V GKGIGLTWDEAK +AAEKALG+
Sbjct: 853  CMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGS 912

Query: 596  LKSMTVQFPYRHQGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            L+S   Q   + Q SPRS  G S+KR+K ++ R  QR+    RYPRN  P+P
Sbjct: 913  LRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRAMQRIPSSTRYPRNAPPIP 964


>ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cicer arietinum]
          Length = 951

 Score =  712 bits (1839), Expect = 0.0
 Identities = 393/700 (56%), Positives = 486/700 (69%), Gaps = 11/700 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL RIVCVKSG +KS
Sbjct: 257  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKS 316

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAE +NT+PVLCVAR
Sbjct: 317  LFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNTIPVLCVAR 376

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFKDFDDGLLQ+IS++AYE++ R+   +PDVSNYL+SEDD SAS   +D  
Sbjct: 377  NVACNVRGGFFKDFDDGLLQKISQIAYENNTRDISPAPDVSNYLVSEDDGSASYANRDPF 436

Query: 1976 GFDGMADSEVERRLKET-STSSAASLPIANIDPRLTQALQYAVSSSSFTVXXXXXXXXXX 1800
             FDGMAD+EVER+LK+  S +SA  +  A +DPRLT +LQY + S   +V          
Sbjct: 437  AFDGMADAEVERKLKDAISAASAIPMTTAKLDPRLTSSLQYTMVSPG-SVLPPAAQASMI 495

Query: 1799 PFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQDM 1620
            P     F Q      PI Q++ +E ++ SSPAREEGEVPESELDPDTRRRLLILQHGQD 
Sbjct: 496  PLPHTQFPQPATLVKPIGQVAPSELSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDN 555

Query: 1619 REPPPSEPQFPARPPMQASLPRAQTRGWFPVEEETTQGQLNRVAPPNDFVLNAESNTIDK 1440
            R+   SEP FP + P+Q S       GWFPVEEE      NRV  P +  L++  + I+K
Sbjct: 556  RDHTSSEPPFPLKHPVQVSARVPPRGGWFPVEEEIGSQPPNRVI-PKEIALDSGPSRIEK 614

Query: 1439 IRAPHQPFLQKVEPSVPPGRVLLE-SQRLPKEAFSREDQLRLNQAVPDFPSFSGQDSPVA 1263
             R   QPF  KV+ S+   R L E +QRLPKE + R+D+ R++  +  +PS SG D+P  
Sbjct: 615  HRLHQQPFFPKVDGSISSDRALHETNQRLPKEMYHRDDRSRVSHMLSSYPSLSGDDTPFG 674

Query: 1262 QPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVLFAG 1083
            +  S+ +D D E+G      ET    LQEIA KCGTKVEF  +L +S ELQF +E  F+G
Sbjct: 675  RSSSSHRDFDSESGHSVFNAETPAIVLQEIALKCGTKVEFTSSLAASRELQFSIEAWFSG 734

Query: 1082 EKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDSYVA-GDGGRFTANQKENGFVSD 906
            +KIG G GRT          +S+ +LAD YLS+ + +S  A GD   F  N  +NG+V +
Sbjct: 735  KKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSRAKDESGSAFGDVSGF-PNANDNGYVGN 793

Query: 905  PNTSGYQSLPKEEGAPFSSA----RNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAFQT 738
             ++ G Q LPKEE   FS+A    R LDPR++ SK+ +G S++ALKELC +EGL V F +
Sbjct: 794  VSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVSKRSMG-SVSALKELCMVEGLGVNFLS 852

Query: 737  QPQFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALK-SMTVQFPYRH 561
             P         +EV+AQVEI+GQV GKG G+TWDEAK +AAEKALG+L+ ++  Q   R 
Sbjct: 853  LPA-PVSTNSVDEVHAQVEIDGQVYGKGTGITWDEAKMQAAEKALGSLRTTIHGQGIQRR 911

Query: 560  QGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            Q SPR   G+S+KR+K +  R  QR    GRYPRN  P+P
Sbjct: 912  QLSPRPFQGLSNKRLKQEHPRTLQRFASSGRYPRNAPPIP 951


>ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
            gi|571500215|ref|XP_006594604.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1-like
            isoform X2 [Glycine max]
          Length = 960

 Score =  706 bits (1822), Expect = 0.0
 Identities = 396/705 (56%), Positives = 488/705 (69%), Gaps = 16/705 (2%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEVFVCTMAERDYALEMWRLLDP  NLINS++LL+RIVCVKSG +KS
Sbjct: 260  DLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPELNLINSKELLDRIVCVKSGLKKS 319

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQ+G CH KMALVIDDRLKVWDEKDQPRVHVVPAFAPYY PQAE +N VP LC+AR
Sbjct: 320  LFNVFQNGLCHLKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYTPQAEASNAVPFLCLAR 379

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFKDFDDGLLQ+I  +AYEDDI++ P SPDVSNYL+SEDD SASNG K+  
Sbjct: 380  NVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIP-SPDVSNYLVSEDDASASNGNKNLL 438

Query: 1976 GFDGMADSEVERRLKETSTSSAASLPI-ANIDPRL--TQALQYAVSSSSFTVXXXXXXXX 1806
             FDGMAD+EVERRLK+  ++S+  L + ANIDPRL  T +LQY + SSS TV        
Sbjct: 439  LFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSLQYTMVSSSGTVPPPTAQAS 498

Query: 1805 XXPFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQ 1626
               F +  F Q      P++Q++    ++ SSPAREEGE+PESELD DTRRR LILQHGQ
Sbjct: 499  VVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELPESELDLDTRRRFLILQHGQ 558

Query: 1625 DMREPPPSEPQFPARPPMQASLPRAQT---RGWFPVEEETTQGQLNRVAPPNDFVLNAES 1455
            D RE   SEP FP R P Q S P +     RGWF VEEE    QLN +  P +F +++E 
Sbjct: 559  DTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMGPQQLN-LPVPKEFPVDSEP 617

Query: 1454 NTIDKIRAPHQPFLQKVEPSVPPGRVLLES-QRLPKEAFSREDQLRLNQAVPDFPSFSGQ 1278
              I+K    H  F  KV  S+   RV  ES QRLPKE   R+D+ RL+Q++  + S  G 
Sbjct: 618  FHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKEVHHRDDRSRLSQSLSSYHSLPGD 677

Query: 1277 DSPVAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVE 1098
            D P++    +++D D E+G+   + +T  G LQEIA  CGTKVEF  +LV+STELQF +E
Sbjct: 678  DIPLSGSSYSNRDFDSESGRSLFHADTTAGVLQEIALNCGTKVEFLSSLVASTELQFSIE 737

Query: 1097 VLFAGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDS-YVAGDGGRFTANQKEN 921
              FAG+KIG+G GRT           S+  LAD Y+S  + DS    GD   F  +  + 
Sbjct: 738  AWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNND- 796

Query: 920  GFVSDPNTSGYQSLPKEEGAPFS----SARNLDPRIEPSKKPLGSSLAALKELCTMEGLS 753
            GFVS  N+ G Q LPKEE   FS    S+R  D R+E SK+    S++ALKELC MEGL+
Sbjct: 797  GFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRST-DSISALKELCMMEGLA 855

Query: 752  VAFQTQP-QFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQ 576
             +FQ+ P   S H  QK+EV+AQVEI+GQ+ GKG G+TW+EAK +AA+KALG+L++M  Q
Sbjct: 856  ASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKMQAAKKALGSLRTMFNQ 915

Query: 575  FPYRHQGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
               +  GSPRSM G+++KR+K ++    QR+    RYPRN   VP
Sbjct: 916  GSLKRHGSPRSMQGLANKRLKPEYPPTLQRVPYSARYPRNAPLVP 960


>ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 958

 Score =  702 bits (1813), Expect = 0.0
 Identities = 393/702 (55%), Positives = 478/702 (68%), Gaps = 13/702 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEVFVCTMAERDYALEMWRLLDP  NLINS++LL+RIVCVKSG +KS
Sbjct: 260  DLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPELNLINSKELLDRIVCVKSGLKKS 319

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQ+G CH KMALVIDDRLKVWDEKDQP+VHVVPAFAPYYAPQAE +N VP LC+AR
Sbjct: 320  LFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVPAFAPYYAPQAEASNAVPTLCLAR 379

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            +VACNVRG FFKDFDDGLLQ+I  +AYEDDI++ PS PDVSNYL+SEDD SASNG K+  
Sbjct: 380  SVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSPPDVSNYLVSEDDASASNGNKNLL 439

Query: 1976 GFDGMADSEVERRLKET-STSSAASLPIANIDPRL--TQALQYAVSSSSFTVXXXXXXXX 1806
             FDGMAD+EVERRLK+  S SS       N+DPRL    +LQY + SSS TV        
Sbjct: 440  LFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFNSSLQYTMVSSSGTVPPPTAQAS 499

Query: 1805 XXPFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQ 1626
               F +  F Q      PI Q++    ++ SSPAREEGEVPESELD DTRRRLLILQHGQ
Sbjct: 500  IVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEGEVPESELDLDTRRRLLILQHGQ 559

Query: 1625 DMREPPPSEPQFPARPPMQASLPRAQT-RGWFPVEEETTQGQLNRVAPPNDFVLNAESNT 1449
            D RE   SEP  P R P Q S P   + RGWF VEEE    QLN++  P +F + +E   
Sbjct: 560  DTREHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEEMGPQQLNQLV-PKEFPVGSEPLH 618

Query: 1448 IDKIRAPHQPFLQKVEPSVPPGRVLLES-QRLPKEAFSREDQLRLNQAVPDFPSFSGQDS 1272
            I+K    H     KV+ SV   RV  ES QRLPKE   R+D  RL+Q++  + SF G D 
Sbjct: 619  IEKRWPRHPSLFSKVDDSVSSDRVFHESHQRLPKEVHHRDDHSRLSQSLSSYHSFPGDDI 678

Query: 1271 PVAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVL 1092
            P++    +++D D E+G+   + +   G LQEIA KCGTKVEF  +LV+ST LQF +E  
Sbjct: 679  PLSGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCGTKVEFLSSLVASTALQFSIEAW 738

Query: 1091 FAGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPDS-YVAGDGGRFTANQKENGF 915
            FAG+K+G+G GRT           S+  LAD Y+S  + DS    GD   F  +   NGF
Sbjct: 739  FAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSGFHGS-NNNGF 797

Query: 914  VSDPNTSGYQSLPKEE---GAPFSSARNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAF 744
            VS  N+ G Q LPKE         S+R  DPR+E SK+    S++ALKE C MEGL+  F
Sbjct: 798  VSSGNSLGNQLLPKESVSFSTSSDSSRVSDPRLEVSKRST-DSISALKEFCMMEGLAANF 856

Query: 743  QTQP-QFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPY 567
            Q+ P   S H  QK+EV+AQVEI+GQ+ GKG GLTW+EAK +AA+KAL +L++M  Q   
Sbjct: 857  QSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESLRTMFNQGTR 916

Query: 566  RHQGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            +  GSPRSM G+++KR+K ++ R  QR+    RYPRN   VP
Sbjct: 917  KRHGSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNAPLVP 958


>ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X2 [Glycine max]
          Length = 929

 Score =  697 bits (1800), Expect = 0.0
 Identities = 395/702 (56%), Positives = 487/702 (69%), Gaps = 13/702 (1%)
 Frame = -1

Query: 2516 DLRTYLTAKGRKRFEVFVCTMAERDYALEMWRLLDPGSNLINSRDLLNRIVCVKSGCRKS 2337
            DLR+YLTA+GRKRFEV+VCTMAERDYALEMWRLLDP SNLINS++LL RIVCVKSG +KS
Sbjct: 261  DLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLGRIVCVKSGLKKS 320

Query: 2336 LFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEVNNTVPVLCVAR 2157
            LFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAE +NT+PVLCVAR
Sbjct: 321  LFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEASNTIPVLCVAR 380

Query: 2156 NVACNVRGCFFKDFDDGLLQRISEVAYEDDIRNAPSSPDVSNYLISEDDPSASNGIKDSN 1977
            NVACNVRG FFKDFDDGLLQ+I ++AYEDDI++ PS PDVSNYL+SEDD S SNG +D  
Sbjct: 381  NVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSPPDVSNYLVSEDDGSISNGHRDPF 440

Query: 1976 GFDGMADSEVERRLKETSTSSAASLPI--ANIDPRLTQALQYAVSSSSFTVXXXXXXXXX 1803
             FDGMAD+EVER+LK+ + S+A+++P+  AN+DPRLT +LQY +  S  +V         
Sbjct: 441  LFDGMADAEVERKLKD-ALSAASTIPVTTANLDPRLT-SLQYTMVPSG-SVPPPTAQASM 497

Query: 1802 XPFTSQPFSQVGMFKHPIAQLSQAETTVQSSPAREEGEVPESELDPDTRRRLLILQHGQD 1623
             PF    F Q      P+ Q + +E ++ SSPAREEGEVPESELDPDTRRRLLILQHGQD
Sbjct: 498  MPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQD 557

Query: 1622 MREPPPSEPQFPARPPMQASLPRA-QTRG-WFPVEEETTQGQLNRVAPPNDFVLNAESNT 1449
             R+   +EP FP R P+Q S P    +RG WFP EEE     LNRV  P +F +++    
Sbjct: 558  TRDHASAEPPFPVRHPVQTSAPHVPSSRGVWFPAEEEIGSQPLNRVV-PKEFPVDSGPLG 616

Query: 1448 IDKIRAPHQPFLQKVEPSVPPGRVLLES-QRLPKEAFSREDQLRLNQAVPDFPSFSGQDS 1272
            I K R  H  F  KVE S+   R+L +S QRLPKE + R+D+ RLN  +  + SFS  D+
Sbjct: 617  IAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSFS--DT 674

Query: 1271 PVAQPPSASKDLDLEAGQIDPYTETCTGALQEIAFKCGTKVEFNQALVSSTELQFIVEVL 1092
            PVA                          LQEIA KCGTKV+F  +LV+STELQF +E  
Sbjct: 675  PVA-------------------------VLQEIALKCGTKVDFISSLVASTELQFSMEAW 709

Query: 1091 FAGEKIGQGIGRTXXXXXXXXXXESLVYLADKYLSQRRPD-SYVAGDGGRFTANQKENGF 915
            F+G+KIG  +GRT          +S+ +LAD YLS  + +     GD   F  N  ++G+
Sbjct: 710  FSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKDEPGSTYGDVSGF-PNVNDSGY 768

Query: 914  VSDPNTSGYQSLPKEEGAPFSSA---RNLDPRIEPSKKPLGSSLAALKELCTMEGLSVAF 744
            +   ++ G Q L KE+ A FS+A   R LDPR++ SK+ +G S+++LKELC MEGL V F
Sbjct: 769  MGIASSLGNQPLSKEDSASFSTASPSRVLDPRLDVSKRSMG-SISSLKELCMMEGLDVNF 827

Query: 743  QTQP-QFSAHPGQKNEVYAQVEINGQVLGKGIGLTWDEAKSEAAEKALGALKSMTVQFPY 567
             + P   S +  QK+EV+AQVEI+G+V GKGIGLTWDEAK +AAEKALG+L+S   Q   
Sbjct: 828  LSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQ 887

Query: 566  RHQGSPRSMHGVSSKRIKHDFSRVPQRM---GRYPRNGSPVP 450
            + Q SPR   G S+KR+K ++ R  QRM    RYPRN  P+P
Sbjct: 888  KRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSARYPRNAPPIP 929


Top