BLASTX nr result

ID: Akebia22_contig00005503 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00005503
         (4189 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform...  1248   0.0  
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...  1237   0.0  
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...  1232   0.0  
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...  1231   0.0  
ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun...  1220   0.0  
ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform...  1203   0.0  
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...  1201   0.0  
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...  1197   0.0  
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...  1189   0.0  
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...  1184   0.0  
emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]  1181   0.0  
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...  1172   0.0  
ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas...  1165   0.0  
ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma...  1155   0.0  
ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma...  1154   0.0  
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...  1126   0.0  
ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma...  1124   0.0  
ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal doma...  1108   0.0  
ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma...  1107   0.0  
ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma...  1104   0.0  

>ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
            gi|508781046|gb|EOY28302.1| C-terminal domain
            phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score = 1248 bits (3230), Expect = 0.0
 Identities = 666/986 (67%), Positives = 751/986 (76%), Gaps = 34/986 (3%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQ--------------KICMID---KEIRISHFSQPSER 542
            M+KSVVY+G  +LGEVEIYPQ Q              KI +++   KEIRI + +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 543  CPPLAVLHTIAPSGVCFKMESKLQ-----SKDSP-LFSLHSSCLREDKTAVMPLGEEELH 704
            CPPLAVLHTI  SG+CFKMES        S+DSP L  LHS C+R++KTAVMP+G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 705  LVAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDR 884
            LVAM SR +     CFWGF VS GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDR
Sbjct: 124  LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 885  IDALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAF 1064
            I+ALQRK+ TE+DPQRV+GM+AE+KRYQDDK+ILKQY ENDQVVENGKVIK+QSEVVPA 
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 1065 SDNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYV 1244
            SDN+QPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 1245 CTMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVI 1424
            CTMAERDYALEMWRLLDPESNLINSKELL R+VCVK+GSRKSL NVF DGICHPKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 1425 DDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGL 1604
            DDRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFDEGL
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 1605 LQLIREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVI 1775
            LQ I E+ YED + + PS PDV N+L+SEDD SA   NKDPL F+G+ D E ERRLK+ I
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1776 QASSMVDS----LDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQF---TQVVKPLV 1934
             A+S V S    LDPR  PSLQ  + SS S+I     Q  I+S  N QF     VVKP+ 
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541

Query: 1935 HVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXX 2114
             V   EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT  EP F        
Sbjct: 542  PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601

Query: 2115 XXXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHG 2294
                    QSRGSWF  EE+M PR+ NRA     PKE PL+SE MH +K   RHP FF  
Sbjct: 602  VSVPRG--QSRGSWFAAEEEMSPRQLNRA----APKEFPLDSERMHIEKH--RHPPFFPK 653

Query: 2295 AESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFE 2474
             ESS PSDR+L ENQR  KE    DDRL +NH+ S+Y SF GEE PL QS SS RD  FE
Sbjct: 654  VESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFE 713

Query: 2475 SGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTR 2654
            SGR +    ET AGVLQ+I +KCG KVEFR AL+AS +LQFSIE WF GEK+G G G+TR
Sbjct: 714  SGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTR 772

Query: 2655 KEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKE 2831
            +EAQRQAA+ S++NLAN YLS +  D  S  GDL++  ++N+ G   + NSFG Q L KE
Sbjct: 773  REAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKE 832

Query: 2832 DXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVE 3011
            +        E +R  DPRLE SKKSMGSV+ALKELCMMEGL +VF  QPP S+N + K E
Sbjct: 833  ESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDE 892

Query: 3012 VYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKR 3191
            VYAQVEI GQ+LGKG G TW EAKMQAAE+AL +L+SMLG   Q R GSPR L    +KR
Sbjct: 893  VYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKR 952

Query: 3192 LKPEFPRMLQRINSSTRYSNNAPPVP 3269
            LKPEFPR+LQR+ SS RY  NAPPVP
Sbjct: 953  LKPEFPRVLQRMPSSGRYPKNAPPVP 978


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score = 1237 bits (3201), Expect = 0.0
 Identities = 666/1008 (66%), Positives = 757/1008 (75%), Gaps = 56/1008 (5%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQK-------------ICMIDKEIRISHFSQPSERCPPL 554
            M+KSVVY+G+ LLGEVEIY Q Q+             I  I KEIRISHFSQ SERCPPL
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60

Query: 555  AVLHTIAPSGVCFKME-------SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVA 713
            AVLHTI   GVCFKME       +K+  ++SPL  LHSSC++E+KTAVM LG EELHLVA
Sbjct: 61   AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120

Query: 714  MSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 893
            M SR N  Q+ CFWGF V+PGLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDA
Sbjct: 121  MPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180

Query: 894  LQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDN 1073
            LQRKI+TE+DPQR+ GML+E+KRY DDK+ILKQYVENDQVVENGKVIK QSEVVPA SDN
Sbjct: 181  LQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240

Query: 1074 YQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTM 1253
            +QP+VRPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTM
Sbjct: 241  HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300

Query: 1254 AERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDR 1433
            AERDYALEMWRLLDPESNLINSKELL R+VCVK+G RKSL NVF DGICHPKMALVIDDR
Sbjct: 301  AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360

Query: 1434 LKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQL 1613
            LKVWDE+DQSRVHVVPAFAPYYAPQAE NN VPVLCVARNVACNVRGGFFKEFDEGLLQ 
Sbjct: 361  LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420

Query: 1614 IREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQAS 1784
            I EV YED   N PS PDVSN+L+SEDD SA   N+D L F+G+ D E ER+LK+ + AS
Sbjct: 421  IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSAS 480

Query: 1785 --------SMVDSLDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQ-------------- 1898
                    S V SLDPR   SLQ  +ASS S  S PT Q  +++SQ              
Sbjct: 481  SAILSTIPSTVSSLDPRLLQSLQYTIASSSS--SMPTSQPSMLASQQPMPALQPPKPPSQ 538

Query: 1899 -------NNQFTQV---VKPLVHVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQH 2048
                   N QF QV   VK L  V P EPSLQ SPAREEGEVPESELDPDTRRRLLILQH
Sbjct: 539  LSMTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRRLLILQH 598

Query: 2049 GQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKES 2228
            G D+RD+  SE PF               +QS GSW P+EE+M PR+ NR      P+E 
Sbjct: 599  GHDSRDNAPSESPF---PARPSTQVSAPRVQSVGSWVPVEEEMSPRQLNR-----TPREF 650

Query: 2229 PLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYR 2408
            PL+S+ M+ +K R  HPSFFH  ES+ PSDRM++ENQR PKE    DDR+++NHSTSNY 
Sbjct: 651  PLDSDPMNIEKHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYP 710

Query: 2409 SFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAE 2588
            SF GEE+PL +S SS+RD   ES R     TETP  VLQEI +KCGTKVEFR ALIA+++
Sbjct: 711  SFQGEESPLSRS-SSNRDLDLESERAFSS-TETPVEVLQEIAMKCGTKVEFRPALIATSD 768

Query: 2589 LQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPS 2768
            LQFSIE WF GEK+G GTGKTR+EAQRQAA+ S++ LA  Y+S V  D   + GD ++  
Sbjct: 769  LQFSIETWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYP 828

Query: 2769 HVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMM 2945
              N+ G LGD NSFG Q L K++        EP+R LD RLE SKKSMGSV+ALKE CM 
Sbjct: 829  SANDNGFLGDMNSFGNQPLLKDENITYSATSEPSRLLDQRLEGSKKSMGSVTALKEFCMT 888

Query: 2946 EGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSM 3125
            EGL + FLAQ PLSTN +   EV+AQVEI GQ+LGKG+G TW+EAKMQAAE+AL +L++M
Sbjct: 889  EGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRTM 948

Query: 3126 LGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 3269
             G     R GSPRL+    +KRLK EFPR+LQR+ SS RY  NA PVP
Sbjct: 949  FGQYTPKRQGSPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNASPVP 996


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score = 1232 bits (3188), Expect = 0.0
 Identities = 656/972 (67%), Positives = 748/972 (76%), Gaps = 20/972 (2%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQ--------KICMIDKEIRISHFSQPSERCPPLAVLHT 569
            M+K+V Y G  +LGEVEIYPQ Q        K   +  EIRIS+FS+ SERCPPLAVLHT
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60

Query: 570  IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLG-EEELHLVAMSSRKNLEQYS 746
            I  SG+CFKMESK  S +  L  LHSSC+RE+KTAVMPLG  EELHLVAM SR N +QY 
Sbjct: 61   ITASGICFKMESK-SSDNIQLHLLHSSCIRENKTAVMPLGLTEELHLVAMYSRNNEKQYP 119

Query: 747  CFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDP 926
            CFW F V  GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+DP
Sbjct: 120  CFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDP 179

Query: 927  QRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRL 1106
            QR++GM AE+KRYQDDK+ILKQY ENDQV ENGKVIKVQSEVVPA SD++Q +VRPLIRL
Sbjct: 180  QRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRL 239

Query: 1107 QEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWR 1286
            QEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWR
Sbjct: 240  QEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 299

Query: 1287 LLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSR 1466
            LLDPESNLIN+KELL R+VCVK+GSRKSL NVF DG CHPKMALVIDDRLKVWD+KDQ R
Sbjct: 300  LLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDDKDQPR 359

Query: 1467 VHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLA 1646
            VHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFDEGLLQ I E+ YED + 
Sbjct: 360  VHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVK 419

Query: 1647 NFPSAPDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMVDS----LD 1805
            + PS PDVSN+L+SEDD + +N  KDPL F+G+ D E ERRLK+ I AS+ + S    LD
Sbjct: 420  DIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANLD 479

Query: 1806 PRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQF---TQVVKPLVHVGPSEPSLQGSPA 1976
            PR AP  Q  + SS ST + PT Q  +M   N QF   T +VKPL HVGP E SLQ SPA
Sbjct: 480  PRLAP-FQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQSLQSSPA 538

Query: 1977 REEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSW 2156
            REEGEVPESELDPDTRRRLLILQHG DTR++  SE PF               + SRGSW
Sbjct: 539  REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPF---PARTQMQVSVPRVPSRGSW 595

Query: 2157 FPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNEN 2336
            FP+EE+M PR+ NRA    VPKE PL SE M  +K R  HPSFF   E+   SDR  +EN
Sbjct: 596  FPVEEEMSPRQLNRA----VPKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDRP-HEN 650

Query: 2337 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 2516
            QR PKE  + DDRLR+NH+ S+Y+SF GEE PL +S SS RD  FESGR     TETP+G
Sbjct: 651  QRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGR-DVSSTETPSG 709

Query: 2517 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 2696
            VLQ+I +KCGTKVEFR AL+AS ELQFSIE WF GEKIG G G+TR+EAQRQAA+ S+++
Sbjct: 710  VLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKH 769

Query: 2697 LANKYLSNVMHDPSSVRGDLNKPSHVNE-MGLGDSNSFGYQTLPKEDXXXXXXXXEPTRF 2873
            LAN Y+  V  D  S  GD ++ S+ NE   +G+ NSFG Q L K++        EP++ 
Sbjct: 770  LANVYMLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKL 825

Query: 2874 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGK 3053
            +DPRLE SKK MGSVSALKELCM EGL +VF  QPP S N + K EVYAQVEI GQ+LGK
Sbjct: 826  VDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGK 885

Query: 3054 GVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINS 3233
            G+G+TW+EAKMQAAE+AL +L+SM G   Q   GSPR L    +KRLKPEFPR+LQR+  
Sbjct: 886  GIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPP 945

Query: 3234 STRYSNNAPPVP 3269
            S RY  NAPPVP
Sbjct: 946  SGRYPKNAPPVP 957


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score = 1231 bits (3186), Expect = 0.0
 Identities = 657/972 (67%), Positives = 747/972 (76%), Gaps = 20/972 (2%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQ--------KICMIDKEIRISHFSQPSERCPPLAVLHT 569
            M+K+V Y G  +LGEVEIYPQ Q        K   +  EIRIS+FS+ SERCPPLAVLHT
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60

Query: 570  IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLG-EEELHLVAMSSRKNLEQYS 746
            I  SG+CFKMESK  S +  L  LHSSC+RE+KTAVM LG  EELHLVAM SR N +QY 
Sbjct: 61   ITASGICFKMESK-SSDNVQLHLLHSSCIRENKTAVMLLGLTEELHLVAMYSRNNEKQYP 119

Query: 747  CFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDP 926
            CFW F V  GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+DP
Sbjct: 120  CFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDP 179

Query: 927  QRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRL 1106
            QR++GM AE+KRYQDDK+ILKQY ENDQV ENGKVIKVQSEVVPA SD++Q +VRPLIRL
Sbjct: 180  QRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRL 239

Query: 1107 QEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWR 1286
            QEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWR
Sbjct: 240  QEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 299

Query: 1287 LLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSR 1466
            LLDPESNLIN+KELL R+VCVK+GSRKSL NVF DG CHPKMALVIDDRLKVWDEKDQSR
Sbjct: 300  LLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDEKDQSR 359

Query: 1467 VHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLA 1646
            VHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFDEGLLQ I E+ YED + 
Sbjct: 360  VHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVK 419

Query: 1647 NFPSAPDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMVDS----LD 1805
              PS PDVSN+L+SEDD + +N  KDPL F+G+ D E ERRLK+ I AS+ + S    LD
Sbjct: 420  EIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANLD 479

Query: 1806 PRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQF---TQVVKPLVHVGPSEPSLQGSPA 1976
            PR AP  Q  + SS ST + PT Q  +M   N QF   T +VKPL HVGP E  LQ SPA
Sbjct: 480  PRLAP-FQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQCLQSSPA 538

Query: 1977 REEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSW 2156
            REEGEVPESELDPDTRRRLLILQHG DTR++  SE PF               + SRGSW
Sbjct: 539  REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPF---PARTQMQVSVPRVPSRGSW 595

Query: 2157 FPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNEN 2336
            FP+EE+M PR+ NRA    VPKE PL SE M  +K R  HPSFF   E+S  SDR  +EN
Sbjct: 596  FPVEEEMSPRQLNRA----VPKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDRP-HEN 650

Query: 2337 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 2516
            QR PKE  + DDRLR+NH+ S+Y+SF GEE PL +S SS RD  FESGR     TETP+G
Sbjct: 651  QRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGR-DVSSTETPSG 709

Query: 2517 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 2696
            VLQ+I +KCGTKVEFR AL+AS ELQFSIE WF GEKIG G G+TR+EAQRQAA+ S+++
Sbjct: 710  VLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKH 769

Query: 2697 LANKYLSNVMHDPSSVRGDLNKPSHVNE-MGLGDSNSFGYQTLPKEDXXXXXXXXEPTRF 2873
            LAN Y+  V  D  S  GD ++ S+ NE   +G+ NSFG Q L K++        EP++ 
Sbjct: 770  LANVYVLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKL 825

Query: 2874 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGK 3053
            +DPRLE SKK MGSVSALKELCM EGL +VF  QPP S N + K EVYAQVEI GQ+LGK
Sbjct: 826  VDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGK 885

Query: 3054 GVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINS 3233
            G+G+TW+EAKMQAAE+AL +L+SM G   Q   GSPR L    +KRLKPEFPR+LQR+  
Sbjct: 886  GIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPP 945

Query: 3234 STRYSNNAPPVP 3269
            S RY  NAPPVP
Sbjct: 946  SGRYPKNAPPVP 957


>ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica]
            gi|462410413|gb|EMJ15747.1| hypothetical protein
            PRUPE_ppa000988mg [Prunus persica]
          Length = 940

 Score = 1220 bits (3156), Expect = 0.0
 Identities = 654/971 (67%), Positives = 744/971 (76%), Gaps = 19/971 (1%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQ-----NQKICMID--KEIRISHFSQPSERCPPLAVLHTI 572
            M+KSVVY+G  LLGEVEIYP+     N+   ++D  KEIRIS+FSQ SERCPP+AVLHTI
Sbjct: 1    MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60

Query: 573  APSGVCFKMESKL-QSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSC 749
            +  GVCFKMESK  QS+D+PLF LHSSC+ E+KTAVMPLG EELHLVAM SR   ++Y C
Sbjct: 61   SSHGVCFKMESKTSQSQDTPLFLLHSSCVMENKTAVMPLGGEELHLVAMRSRNGDKRYPC 120

Query: 750  FWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQ 929
            FWGF V+PGLYNSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI++E+DPQ
Sbjct: 121  FWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEVDPQ 180

Query: 930  RVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQ 1109
            R+SGMLAEIKRYQDDK ILKQY ENDQVVENG+VIK QSE VPA SDN+QPI+RPLIRL 
Sbjct: 181  RISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRPLIRLH 240

Query: 1110 EKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRL 1289
            EKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRL
Sbjct: 241  EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300

Query: 1290 LDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRV 1469
            LDP+SNLINS +LL R+VCVK+GSRKSL NVF + +CHPKMALVIDDRLKVWD++DQ RV
Sbjct: 301  LDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRV 360

Query: 1470 HVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLAN 1649
            HVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFF+EFD+ LLQ I EVFYED + +
Sbjct: 361  HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYEDDIKD 420

Query: 1650 FPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQASSMVD----SLDP 1808
             PS PDVSN+L+SEDD SA   N+DPL F+G+TDVE ERR+K+   A+SMV     S+DP
Sbjct: 421  VPS-PDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATPAASMVSSVFTSIDP 479

Query: 1809 RPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAR 1979
            R AP LQ  V  S ST+S PT Q  +MS  + QF Q   +VKPL HVG +EPSLQ SPAR
Sbjct: 480  RLAP-LQYTVPPS-STLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSAEPSLQSSPAR 537

Query: 1980 EEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWF 2159
            EEGEVPESELDPDTRRRLLILQHGQDTRD   SEPPF                QSR  WF
Sbjct: 538  EEGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPF---PVRPPMQASVPRAQSRPGWF 594

Query: 2160 PLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQ 2339
            P+EE+M PR+ +R VPK +    PL+ ET+  +K R  H SFF   E+S PSDR+L ENQ
Sbjct: 595  PVEEEMSPRQLSRMVPKDL----PLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQENQ 650

Query: 2340 RFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGV 2519
            R PKE    DDRLR NH+ S Y S  GEE PL +S SS+RD  FESGR +    ETPAGV
Sbjct: 651  RLPKEAFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGR-AISNAETPAGV 709

Query: 2520 LQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNL 2699
            LQEI +KCG K                   WF GEKIG G+GKTR+EA  QAA+ SL+NL
Sbjct: 710  LQEIAMKCGAK------------------AWFAGEKIGEGSGKTRREAHYQAAEGSLKNL 751

Query: 2700 ANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKEDXXXXXXXXEPTRFL 2876
            AN YLS V  D  SV GD+NK  +VN  G  G+ NSFG Q  PKE+        EP+R L
Sbjct: 752  ANIYLSRVKPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPL 811

Query: 2877 DPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKG 3056
            DPRLE SKKSM SVS LKELCMMEGL +VF  +PP STN + K EV+ QVEI G++LGKG
Sbjct: 812  DPRLEGSKKSMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKG 871

Query: 3057 VGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSS 3236
            +G TW+EAKMQAAE+AL +L S L    Q R GSPR L   SSKR+K EFP++LQR+ SS
Sbjct: 872  IGLTWDEAKMQAAEKALGSLTSTL--YAQKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSS 929

Query: 3237 TRYSNNAPPVP 3269
             RY  NAPPVP
Sbjct: 930  ARYPKNAPPVP 940


>ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
            gi|508781047|gb|EOY28303.1| C-terminal domain
            phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score = 1203 bits (3112), Expect = 0.0
 Identities = 643/951 (67%), Positives = 725/951 (76%), Gaps = 34/951 (3%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQ--------------KICMID---KEIRISHFSQPSER 542
            M+KSVVY+G  +LGEVEIYPQ Q              KI +++   KEIRI + +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 543  CPPLAVLHTIAPSGVCFKMESKLQ-----SKDSP-LFSLHSSCLREDKTAVMPLGEEELH 704
            CPPLAVLHTI  SG+CFKMES        S+DSP L  LHS C+R++KTAVMP+G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 705  LVAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDR 884
            LVAM SR +     CFWGF VS GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDR
Sbjct: 124  LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 885  IDALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAF 1064
            I+ALQRK+ TE+DPQRV+GM+AE+KRYQDDK+ILKQY ENDQVVENGKVIK+QSEVVPA 
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 1065 SDNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYV 1244
            SDN+QPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 1245 CTMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVI 1424
            CTMAERDYALEMWRLLDPESNLINSKELL R+VCVK+GSRKSL NVF DGICHPKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 1425 DDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGL 1604
            DDRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFDEGL
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 1605 LQLIREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVI 1775
            LQ I E+ YED + + PS PDV N+L+SEDD SA   NKDPL F+G+ D E ERRLK+ I
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1776 QASSMVDS----LDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQF---TQVVKPLV 1934
             A+S V S    LDPR  PSLQ  + SS S+I     Q  I+S  N QF     VVKP+ 
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541

Query: 1935 HVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXX 2114
             V   EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT  EP F        
Sbjct: 542  PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601

Query: 2115 XXXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHG 2294
                    QSRGSWF  EE+M PR+ NRA     PKE PL+SE MH +K   RHP FF  
Sbjct: 602  VSVPRG--QSRGSWFAAEEEMSPRQLNRA----APKEFPLDSERMHIEKH--RHPPFFPK 653

Query: 2295 AESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFE 2474
             ESS PSDR+L ENQR  KE    DDRL +NH+ S+Y SF GEE PL QS SS RD  FE
Sbjct: 654  VESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFE 713

Query: 2475 SGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTR 2654
            SGR +    ET AGVLQ+I +KCG KVEFR AL+AS +LQFSIE WF GEK+G G G+TR
Sbjct: 714  SGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTR 772

Query: 2655 KEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKE 2831
            +EAQRQAA+ S++NLAN YLS +  D  S  GDL++  ++N+ G   + NSFG Q L KE
Sbjct: 773  REAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKE 832

Query: 2832 DXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVE 3011
            +        E +R  DPRLE SKKSMGSV+ALKELCMMEGL +VF  QPP S+N + K E
Sbjct: 833  ESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDE 892

Query: 3012 VYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPR 3164
            VYAQVEI GQ+LGKG G TW EAKMQAAE+AL +L+SMLG   Q R GSPR
Sbjct: 893  VYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPR 943


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score = 1201 bits (3106), Expect = 0.0
 Identities = 645/966 (66%), Positives = 740/966 (76%), Gaps = 18/966 (1%)
 Frame = +3

Query: 426  VVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCFK 596
            +VY+G  LLGEVE+YP+   N+KI    KEIRISHFSQ SERCPP+AVLHTI+ +GVCFK
Sbjct: 4    LVYKGEELLGEVEVYPEELNNKKIWDELKEIRISHFSQSSERCPPVAVLHTISSNGVCFK 63

Query: 597  MESKLQSKDSP----LFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 764
            MESK  S  S     LF LHSSC+ E+KTAVM LG EELHLVAM SR N +Q+ CFWGF 
Sbjct: 64   MESKSSSSSSQDTSRLFLLHSSCIMENKTAVMNLGVEELHLVAMYSRNNQKQHPCFWGFS 123

Query: 765  VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 944
            VS GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ LQRKI  E+D QR+SGM
Sbjct: 124  VSSGLYSSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIEGLQRKIQCEVDAQRISGM 183

Query: 945  LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 1124
             AEIKRYQDDK ILKQY ENDQVVENG+VIK QSEVVPA SD++QPI+RPLIRLQEKNII
Sbjct: 184  QAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEVVPALSDSHQPIIRPLIRLQEKNII 243

Query: 1125 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 1304
            LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDPES
Sbjct: 244  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 303

Query: 1305 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 1484
            NLIN+ +LL R+VCVK+G +KSL NVF + +CHPKMALVIDDRLKVWD++DQ RVHVVPA
Sbjct: 304  NLINANKLLDRIVCVKSGLKKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVPA 363

Query: 1485 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 1664
            FAPYYAPQAEANN VPVLCVARNVAC+VRGGFF+EFD+ LLQ I E+FYED + +F S+P
Sbjct: 364  FAPYYAPQAEANNAVPVLCVARNVACSVRGGFFREFDDSLLQKIPEIFYEDNIKDF-SSP 422

Query: 1665 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQA----SSMVDSLDPRPAPS 1823
            DVSNFL+SEDD SAS  N+D L F+G+ D E ERRLK+   A    SS V + DPR A S
Sbjct: 423  DVSNFLVSEDDASASNGNRDQLPFDGMADAEVERRLKEATSAAPTVSSAVSNNDPRLA-S 481

Query: 1824 LQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEV 1994
            LQ  V  S ST+S PT Q  +M   N QF Q   +VKPL HVGP++  L  SPAREEGEV
Sbjct: 482  LQYTVPLS-STVSLPTNQPSMMPFHNVQFPQSASLVKPLGHVGPADLGLHSSPAREEGEV 540

Query: 1995 PESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEED 2174
            PESELDPDTRRRLLILQHGQDTR+   SEP F               +QSRG WFP+EE+
Sbjct: 541  PESELDPDTRRRLLILQHGQDTRESVPSEPSF---PVRPQVQVSVPRVQSRGGWFPVEEE 597

Query: 2175 MIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKE 2354
            M PRK +R     VPKE PL SE M  +K R+ H +FF   E+S PSDR+L ENQR PKE
Sbjct: 598  MSPRKLSR----MVPKEPPLNSEPMQIEKHRSHHSAFFPKVENSMPSDRILQENQRLPKE 653

Query: 2355 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 2534
                D+RLR N + S Y SF GEE PL +S SS+RDF +ESGR +    ETPAGVLQEI 
Sbjct: 654  AFHRDNRLRFNQAMSGYHSFSGEEPPLNRSSSSNRDFDYESGR-AISNAETPAGVLQEIA 712

Query: 2535 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 2714
            +KCGTKVEFR AL+ S ELQF +E WF GEKIG GTG+TR+EA  QAA+ SL+NLAN Y+
Sbjct: 713  MKCGTKVEFRPALVPSTELQFYVEAWFAGEKIGEGTGRTRREAHFQAAEGSLKNLANIYI 772

Query: 2715 SNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLE 2891
            S    D   + GD +K S+V   G +G+ NSFG Q LPKED        EP+R LDPRL+
Sbjct: 773  SRGKPDALPIHGDASKFSNVTNNGFMGNMNSFGTQPLPKEDSLSSSTSSEPSRPLDPRLD 832

Query: 2892 ESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATW 3071
             S+KS+ SVSALKELC MEGLS+++  +PP   N   K EV+ Q EI G++LGKG+G TW
Sbjct: 833  NSRKSVSSVSALKELCTMEGLSVLYQPRPP-PPNSTEKDEVHVQAEIDGEVLGKGIGLTW 891

Query: 3072 NEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSN 3251
            +EAKMQAAE+AL NL+S L   GQ R GSPR L    SKRLK EFP++LQR+ SSTRYS 
Sbjct: 892  DEAKMQAAEKALGNLRSTL--YGQKRQGSPRPLQGMPSKRLKQEFPQVLQRMPSSTRYSK 949

Query: 3252 NAPPVP 3269
            NAPPVP
Sbjct: 950  NAPPVP 955


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score = 1197 bits (3096), Expect = 0.0
 Identities = 647/1000 (64%), Positives = 743/1000 (74%), Gaps = 48/1000 (4%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQK-------------ICMIDKEIRISHFSQPSERCPPL 554
            M+KSVVY+G  LLGEVEIY Q Q+             I  I K IRISHFSQ SERCPPL
Sbjct: 1    MYKSVVYKGEELLGEVEIYAQEQQQEEEENKNKRKRVIDEIVKGIRISHFSQASERCPPL 60

Query: 555  AVLHTIAPSGVCFKMESKLQS--------KDSPLFSLHSSCLREDKTAVMPLGEEELHLV 710
            AVLHTI   GVCFKME    S        ++SPL  LHSSC++E+KTAVM LG EELHLV
Sbjct: 61   AVLHTITSIGVCFKMEESTASSSTKISSQQESPLRLLHSSCIQENKTAVMLLGGEELHLV 120

Query: 711  AMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRID 890
            AM SR N  ++ CFWGF V+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFED+I+
Sbjct: 121  AMPSRSNERKHPCFWGFNVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDKIE 180

Query: 891  ALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSD 1070
            ALQ+KI+TE+D QR+  +++EIKRYQDDK ILKQYVENDQV+ENGKVIK Q EVVPA SD
Sbjct: 181  ALQKKISTEVDQQRILAIISEIKRYQDDKIILKQYVENDQVIENGKVIKTQFEVVPAASD 240

Query: 1071 NYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCT 1250
            N+QP+VRPLIRL EKNII TRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCT
Sbjct: 241  NHQPLVRPLIRLPEKNIIFTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 300

Query: 1251 MAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDD 1430
            MAERDYALEMWRLLDPESNLINS ELL R+VCV +GSRKSL NVF DGICHPKMALVIDD
Sbjct: 301  MAERDYALEMWRLLDPESNLINSNELLDRIVCVSSGSRKSLFNVFQDGICHPKMALVIDD 360

Query: 1431 RLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQ 1610
            R+ VWDEKDQSRVHVVPAFAPYYAPQAEANN VP+LCVARNVACNVRGGFFKEFDEGLLQ
Sbjct: 361  RMNVWDEKDQSRVHVVPAFAPYYAPQAEANNAVPILCVARNVACNVRGGFFKEFDEGLLQ 420

Query: 1611 LIREVFYEDGLANFPSAPDVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQA 1781
             I EV YED  +N PS PDVSN+L+SEDD SA+  N+DP  F+   D E ERRLK+ + A
Sbjct: 421  KIPEVAYEDDTSNIPSPPDVSNYLVSEDDASAANGNRDPPSFDSTADAEVERRLKEAVSA 480

Query: 1782 S--------SMVDSLDPRPAPSLQNAVASSLSTI--SQPT---PQGPIMSSQ-------N 1901
            S        S V SLDPR   SLQ AVASS S +  SQP+    Q P+ +SQ       N
Sbjct: 481  SSTIPSTIPSTVSSLDPRLLQSLQYAVASSSSLMPASQPSMLASQQPVPASQTSMMPFPN 540

Query: 1902 NQFTQV---VKPLVHVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT 2072
             QF QV   VK L  V   EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQD+RD+ 
Sbjct: 541  TQFPQVAPLVKQLGQVVHPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDSRDNA 600

Query: 2073 SSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMH 2252
             SE PF               +QSRGSW P+EE+M PR+ NR      P+E PL+S+ M+
Sbjct: 601  PSESPF---PARPSAPVSAAHVQSRGSWVPVEEEMTPRQLNR-----TPREFPLDSDPMN 652

Query: 2253 FDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETP 2432
             +K +  HPSFF   ES+ PSDRM++ENQR PKE    +DR+R+NHST NY SF  EETP
Sbjct: 653  IEKHQTHHPSFFPKVESNIPSDRMIHENQRLPKEAPYRNDRMRLNHSTPNYHSFQVEETP 712

Query: 2433 LGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVW 2612
            L +S SS+RD   ES R     +ETP  VLQEI +KC TKVEFR AL+AS +LQFSIE W
Sbjct: 713  LSRS-SSNRDLDLESERAF-TISETPVEVLQEIAMKCETKVEFRPALVASIDLQFSIEAW 770

Query: 2613 FTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-L 2789
            F GEK+G GTGKTR+EAQRQAA+ S++ LA  Y+     D   + GD ++    N+ G L
Sbjct: 771  FAGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMLRAKPDSGPMHGDSSRYPSANDNGFL 830

Query: 2790 GDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFL 2969
            G+ N FG Q LPK++        EP+R LDPRLE SKKS GSV+ALKE C MEGL + FL
Sbjct: 831  GNMNLFGNQPLPKDELVAYSAASEPSRLLDPRLEGSKKSSGSVTALKEFCTMEGLVVNFL 890

Query: 2970 AQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNR 3149
            AQ PLS N +   EV+AQVEI GQ+LGKG+G+TW+EAKMQAAE+AL +L++M G   Q R
Sbjct: 891  AQTPLSANSIPGEEVHAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRTMFGQYTQKR 950

Query: 3150 VGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 3269
             GSPR +    +KRLK EFPR+LQR+  S RY  NAPPVP
Sbjct: 951  QGSPRPMQGMPNKRLKQEFPRVLQRMPPSARYHKNAPPVP 990


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score = 1189 bits (3075), Expect = 0.0
 Identities = 634/985 (64%), Positives = 741/985 (75%), Gaps = 33/985 (3%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQK--------------------ICMIDKEIRISHFSQP 533
            M+KSVVY+G+ LLGEVEIY Q ++                    I  I K IRISHFSQ 
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQQEQKLQQQEELQEQEQELKKKRVIDEILKGIRISHFSQA 60

Query: 534  SERCPPLAVLHTIAPSGVCFKMESKLQ-SKDSPLFSLHSSCLREDKTAVMPL-GEEELHL 707
            SERCPPLAVLHTI  +G+CFKMESK   S D+PL  LHSSC++E KTAV+ L G EELHL
Sbjct: 61   SERCPPLAVLHTITTNGICFKMESKNSVSLDTPLHLLHSSCIQESKTAVVLLQGGEELHL 120

Query: 708  VAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRI 887
            VAM SR +  QY CFW F +S GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRI
Sbjct: 121  VAMFSRNDERQYPCFWAFNISSGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI 180

Query: 888  DALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFS 1067
            +ALQRKI+TE+DPQR+SGML+E+KRYQDDK+ILKQYV+NDQVVENG+VIK Q EVVPA S
Sbjct: 181  EALQRKISTELDPQRISGMLSEVKRYQDDKTILKQYVDNDQVVENGRVIKTQFEVVPALS 240

Query: 1068 DNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVC 1247
            DN+Q IVRPLIRLQE+NIILTRINP IRDTSVLVRLRPAWE+LRSYLTA+GRKRFEVYVC
Sbjct: 241  DNHQTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVC 300

Query: 1248 TMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVID 1427
            TMAERDYALEMWRLLDPESNLINSKELL R+VCVK+G RKSL NVF DGICHPKMALVID
Sbjct: 301  TMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVID 360

Query: 1428 DRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLL 1607
            DRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFDEGLL
Sbjct: 361  DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLL 420

Query: 1608 QLIREVFYEDGLANFPSAPDVSNFLISEDD--VSASNKDPLGFEGVTDVE-ERRLKDVIQ 1778
            Q I E+ +ED + + PS PDVSN+L+ EDD   S  N+DPL F+G+ D E E+RLK+ I 
Sbjct: 421  QRIPEISFEDDMNDIPSPPDVSNYLVPEDDAFTSNGNRDPLSFDGMADAEVEKRLKEAIS 480

Query: 1779 AS----SMVDSLDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVH 1937
             S    S V +LD R  P LQ  +ASS S+I  PT Q  +++  + Q  Q   +VKPL  
Sbjct: 481  ISSAFPSTVANLDARLVPPLQYTMASS-SSIPVPTSQPAVVTFPSMQLPQAAPLVKPLGQ 539

Query: 1938 VGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXX 2117
            V PSEPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQD RD   SE PF         
Sbjct: 540  VVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDLRDPAPSESPF-PVRPSNSM 598

Query: 2118 XXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGA 2297
                  +QSRG+W P+EE+M PR+ NRA    V +E P+++E MH DK R  HPSFF   
Sbjct: 599  QVSVPRVQSRGNWVPVEEEMSPRQLNRA----VTREFPMDTEPMHIDKHRPHHPSFFPKV 654

Query: 2298 ESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFES 2477
            ESS PS+RM +ENQR PK     DDRLR+N + SNY+S  GEE  L +S SS+RD   ES
Sbjct: 655  ESSIPSERMPHENQRLPKVAPYKDDRLRLNQTMSNYQSLSGEENSLSRSSSSNRDLDVES 714

Query: 2478 GRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRK 2657
             R +    ETP  VL EI +KCG KVEF+ +L+ S +LQFS+E WF GE++G G G+TR+
Sbjct: 715  DR-AVSSAETPVRVLHEISMKCGAKVEFKHSLVNSRDLQFSVEAWFAGERVGEGFGRTRR 773

Query: 2658 EAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKED 2834
            EAQ  AA++S++NLAN Y+S    D  ++ GD +K S  N+ G LG  NSFG Q LPK++
Sbjct: 774  EAQSVAAEASIKNLANIYISRAKPDNGALHGDASKYSSANDNGFLGHVNSFGSQPLPKDE 833

Query: 2835 XXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEV 3014
                    E +  LDPRLE SKKSM SV+ALKE CMMEGL + FLAQ PLS+N +   EV
Sbjct: 834  ILSYSDSSEQSGLLDPRLESSKKSMSSVNALKEFCMMEGLGVNFLAQTPLSSNSVQNAEV 893

Query: 3015 YAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRL 3194
            +AQVEI GQ++GKG+G+T++EAKMQAAE+AL +L++  G     R GSPR +    +K L
Sbjct: 894  HAQVEIDGQVMGKGIGSTFDEAKMQAAEKALGSLRTTFGRFPPKRQGSPRPVPGMPNKHL 953

Query: 3195 KPEFPRMLQRINSSTRYSNNAPPVP 3269
            KPEFPR+LQR+ SS RY  NAPPVP
Sbjct: 954  KPEFPRVLQRMPSSARYPKNAPPVP 978


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score = 1184 bits (3062), Expect = 0.0
 Identities = 626/969 (64%), Positives = 752/969 (77%), Gaps = 16/969 (1%)
 Frame = +3

Query: 411  KMFKSVVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPS 581
            +M+KSVVYQG  ++GEV++YP+   N K   + KEIRISHFSQPSERCPPLAVLHT+   
Sbjct: 2    RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHV-KEIRISHFSQPSERCPPLAVLHTVTSC 60

Query: 582  GVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGF 761
            GVCFKMESK Q +D  LF LHS C+RE+KTAVMPLG EE+HLVAM SR N+++  CFWGF
Sbjct: 61   GVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSR-NVDR-PCFWGF 117

Query: 762  IVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSG 941
            IV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SG
Sbjct: 118  IVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISG 177

Query: 942  MLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNI 1121
            M AE+KRYQDDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ+KNI
Sbjct: 178  MQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNI 237

Query: 1122 ILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPE 1301
            ILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+
Sbjct: 238  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 297

Query: 1302 SNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVP 1481
            SNLINSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVP
Sbjct: 298  SNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 357

Query: 1482 AFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSA 1661
            AFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + PS 
Sbjct: 358  AFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSP 417

Query: 1662 PDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMVD----SLDPRPAP 1820
            PDVSN+L+SEDD S SN  +DP  F+G+ D E ER+LKD + A+S +     +LDPR   
Sbjct: 418  PDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPR-LT 476

Query: 1821 SLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGE 1991
            SLQ  +  S  ++  PT Q  +M   + QF Q   +VKP+    PSEPSL  SPAREEGE
Sbjct: 477  SLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535

Query: 1992 VPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEE 2171
            VPESELDPDTRRRLLILQHGQDTRDH S+EPPF                 SRG WFP EE
Sbjct: 536  VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVP--SSRGVWFPAEE 593

Query: 2172 DMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFP 2348
            ++  +  NR     VPKE P++S  +   K R  HPSFF   ESS  SDR+L++ +QR P
Sbjct: 594  EIGSQPLNRV----VPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLP 649

Query: 2349 KEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQE 2528
            KE++  DDR R+NH  S+YRSF G++ P  +S SS RD   ESG  S  + +TP  VLQE
Sbjct: 650  KEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSFSSHRDLDSESGH-SVLHADTPVAVLQE 708

Query: 2529 IGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANK 2708
            I +KCGTKV+F S+L+AS ELQFS+E WF+G+KIG   G+TRKEAQ +AA+ S+++LA+ 
Sbjct: 709  IALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADI 768

Query: 2709 YLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPR 2885
            YLS+   +P S  GD++   +VN+ G +G ++S G Q L KED         P+R LDPR
Sbjct: 769  YLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKED-SASFSTASPSRVLDPR 827

Query: 2886 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 3062
            L+ SK+SMGS+S+LKELCMMEGL + FL+ P P+STN + K EV+AQVEI G++ GKG+G
Sbjct: 828  LDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIG 887

Query: 3063 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 3242
             TW+EAKMQAAE+AL +L+S LG S Q R  SPR    FS+KRLK E+PR +QR+ SS R
Sbjct: 888  LTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSAR 947

Query: 3243 YSNNAPPVP 3269
            Y  NAPP+P
Sbjct: 948  YPRNAPPIP 956


>emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]
          Length = 894

 Score = 1181 bits (3056), Expect = 0.0
 Identities = 638/960 (66%), Positives = 725/960 (75%), Gaps = 8/960 (0%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 593
            M+KS+VY+G+ ++GEVEIYPQNQ + ++ KEIRISH+SQPSERCPPLAVLHTI   GVCF
Sbjct: 1    MYKSIVYEGDDVVGEVEIYPQNQGLELM-KEIRISHYSQPSERCPPLAVLHTITSCGVCF 59

Query: 594  KMES-KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVS 770
            KMES K QS+D+PL+ LHS+C+RE+KTAVM LGEEELHLVAM S+K   QY CFWGF V+
Sbjct: 60   KMESSKAQSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFNVA 119

Query: 771  PGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLA 950
             GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTE+DPQR+SGM+A
Sbjct: 120  LGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGMVA 179

Query: 951  EIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILT 1130
            E                   VVENGK+ K Q E+VPA SDN+QPIVRPLIRLQEKNIILT
Sbjct: 180  E-------------------VVENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNIILT 220

Query: 1131 RINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNL 1310
            RINPLIRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDPESNL
Sbjct: 221  RINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNL 280

Query: 1311 INSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFA 1490
            INSKELL R+VCVK+GSRKSL NVF DGICHPKMALVIDDRLKVWDEKDQ RVHVVPAFA
Sbjct: 281  INSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 340

Query: 1491 PYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDV 1670
            PYYAPQAEANN + VLCVARNVACNVRGGFFKEFDEGLLQ I E+ YED + +  SAPDV
Sbjct: 341  PYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDBIKDIRSAPDV 400

Query: 1671 SNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASSMVDSLDPRPAPSLQNAVA 1841
            SN+L+SEDD S S  N+D   F+G+ DVE ER+LKD I A S V SLDPR +P LQ AVA
Sbjct: 401  SNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKDAISAPSTVTSLDPRLSPPLQFAVA 460

Query: 1842 SSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVPESELD 2012
            +S     QP  QG IM   N QF Q   ++KPL      EP++Q SPAREEGEVPESELD
Sbjct: 461  ASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPLA----PEPTMQSSPAREEGEVPESELD 516

Query: 2013 PDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDMIPRKP 2192
            PDTRRRLLILQHGQDTR+H SS+PPF               +QSRGSWFP +E+M PR+ 
Sbjct: 517  PDTRRRLLILQHGQDTREHASSDPPF---PVRPPIQVSVPRVQSRGSWFPADEEMSPRQL 573

Query: 2193 NRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDD 2372
            NRA    VPKE PL+S+TMH +K R  HPSFFH  ESS  SDR+L+ENQR  KEV   DD
Sbjct: 574  NRA----VPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDD 629

Query: 2373 RLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTK 2552
            RLR+NHS   Y SF GEE PLG+S SS+RD  FESGRG+ PY ETPA     +G+     
Sbjct: 630  RLRLNHSLPGYHSFSGEEVPLGRS-SSNRDLDFESGRGA-PYAETPA-----VGL----- 677

Query: 2553 VEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHD 2732
                        L+   EVW  GEKIG GTGKTR+EAQ QAA++SL  L+ +YL      
Sbjct: 678  ------------LRNCNEVWNQGEKIGEGTGKTRREAQCQAAEASLMYLSYRYL------ 719

Query: 2733 PSSVRGDLNK-PSHVNEMGLGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEESKKSM 2909
                 GD+N+ P+  +   + D+NSFGYQ+ PKE         E +R LDPRLE SKKSM
Sbjct: 720  ----HGDVNRFPNASDNNFMSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSM 775

Query: 2910 GSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQ 3089
            GS+SALKELCMMEGL + FL+QPPLS+N   K E+ AQVEI GQ+LGKG G+TW++AKMQ
Sbjct: 776  GSISALKELCMMEGLGVEFLSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQ 835

Query: 3090 AAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 3269
            AAE+AL +LKSMLG   Q R GSPR L     KRLK EF R LQR  SS RYS N  PVP
Sbjct: 836  AAEKALGSLKSMLGQFSQKRQGSPRSLQGM-GKRLKSEFTRGLQRTPSSGRYSKNTSPVP 894


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Glycine max]
          Length = 960

 Score = 1172 bits (3032), Expect = 0.0
 Identities = 620/973 (63%), Positives = 742/973 (76%), Gaps = 21/973 (2%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQ--------NQKICMIDKEIRISHFSQPSERCPPLAVLHT 569
            M+KSVVYQG  ++GEV++YP+        N       KEIRISHFSQPSERCPPLAVLHT
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60

Query: 570  IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSC 749
            +   GVCFKMESK Q +D  LF LHS C+RE+KTAVMPLG EE+HLVAM SR   +   C
Sbjct: 61   VTSCGVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRN--DDRPC 117

Query: 750  FWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQ 929
            FWGFIV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQ
Sbjct: 118  FWGFIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQ 177

Query: 930  RVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQ 1109
            R+SGM AE+KRY DDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ
Sbjct: 178  RISGMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQ 237

Query: 1110 EKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRL 1289
            +KNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRL
Sbjct: 238  DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 297

Query: 1290 LDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRV 1469
            LDP+SNLINSKELL R+VCVK+G +KSL NVF DG C PKMALVIDDRLKVWDE+DQ RV
Sbjct: 298  LDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPRV 357

Query: 1470 HVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLAN 1649
            HVVPAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + +
Sbjct: 358  HVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKD 417

Query: 1650 FPSAPDVSNFLISEDD--VSASNKDPLGFEGVTDVE-ERRLKDVIQASS----MVDSLDP 1808
             PS PDVSN+L+SEDD  +S  N+DP  F+G+ D E ER+LKD + A+S       +LDP
Sbjct: 418  VPSPPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDP 477

Query: 1809 RPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAR 1979
            R   SLQ  +  S  ++  PT Q  +M   + QF Q   +VKP+    PS+PSL  SPAR
Sbjct: 478  R-LTSLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAR 535

Query: 1980 EEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWF 2159
            EEGEVPESELDPDTRRRLLILQHGQDTRDH S+EPPF                 SRG WF
Sbjct: 536  EEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVP--SSRGVWF 593

Query: 2160 PLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-N 2336
            P+EE++  +  NR     VPKE P++S  +  +K R  HPSFF+  ESS  SDR+L++ +
Sbjct: 594  PVEEEIGSQPLNRV----VPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSH 649

Query: 2337 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 2516
            QR PKE++  DDR R+NH  S+YRSF G++ P  +S SS RD   ESG  S  + +TP  
Sbjct: 650  QRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGH-SVLHADTPVA 708

Query: 2517 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 2696
            VL EI +KCGTKV+F S+L+AS EL+FS+E WF+G+KIG G G+TRKEAQ +AA  S+ +
Sbjct: 709  VLHEIALKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEH 768

Query: 2697 LANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRF 2873
            LA+ YLS+   +P S  GD++   +VN+ G +G ++S G Q L KED         P+R 
Sbjct: 769  LADIYLSSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKED-SASFSSASPSRA 827

Query: 2874 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILG 3050
            LDPRL+ SK+SMGS+SALKELCMMEGL + FL+ P P+STN + K EV+AQVEI G+I G
Sbjct: 828  LDPRLDVSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFG 887

Query: 3051 KGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRIN 3230
            KG+G TW+EAKMQAAE+AL NL+S LG S Q    SPR    FS+KRLK E+PR +QR+ 
Sbjct: 888  KGIGLTWDEAKMQAAEKALGNLRSKLGQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMP 947

Query: 3231 SSTRYSNNAPPVP 3269
            SS RY  NAPP+P
Sbjct: 948  SSARYPRNAPPIP 960


>ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
            gi|561032720|gb|ESW31299.1| hypothetical protein
            PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score = 1165 bits (3014), Expect = 0.0
 Identities = 622/977 (63%), Positives = 738/977 (75%), Gaps = 25/977 (2%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQKICMID-KEIRISHFSQPSERCPPLAVLHTIAPSGVC 590
            M+KSVVYQG  +LGEVE+YP+         KEIRISHFSQPSERCPPLAVLHT+   GVC
Sbjct: 1    MYKSVVYQGEVVLGEVEVYPEENNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60

Query: 591  FKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVS 770
            FKMESK Q +D  LF LHS C+RE+KTAV+PLG EE+HLVAM SR   +    FWGFIV+
Sbjct: 61   FKMESKTQQQDG-LFHLHSLCIRENKTAVIPLGGEEIHLVAMHSRN--DDRPRFWGFIVA 117

Query: 771  PGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLA 950
             GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SGM A
Sbjct: 118  LGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQA 177

Query: 951  EIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILT 1130
            E+KRYQ+DK+ILKQY ENDQVV+NG+V+KVQSE+VPA SDN+QPIVRPLIRLQ+KNIILT
Sbjct: 178  EVKRYQEDKNILKQYAENDQVVDNGRVVKVQSEIVPALSDNHQPIVRPLIRLQDKNIILT 237

Query: 1131 RINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNL 1310
            RINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+SNL
Sbjct: 238  RINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNL 297

Query: 1311 INSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFA 1490
            INSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVPAFA
Sbjct: 298  INSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 357

Query: 1491 PYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDV 1670
            PYYAPQAEA+N +PVLCVARNVACNVRGGFFKEFD+GLLQ I +V YED + + P  PDV
Sbjct: 358  PYYAPQAEASNSIPVLCVARNVACNVRGGFFKEFDDGLLQKIPQVAYEDDIKDIPIPPDV 417

Query: 1671 SNFLISEDD----VSASNKDPLGFEGVTDVE-ERRLK---------DVIQASSMVD---- 1796
            SN+L+SEDD    +S  N+DP  F+ + D E ER+ K         D + A+S +     
Sbjct: 418  SNYLVSEDDGSSAISNGNRDPFLFDSMGDAEVERKSKVPTRAPNEHDALSAASTIPVTTA 477

Query: 1797 SLDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQG 1967
            +LDPR   SLQ A+ SS  +   PT Q  +M   + QF Q   +VKP+    PSE SL  
Sbjct: 478  NLDPR-LTSLQYAMVSS-GSAPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHS 535

Query: 1968 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSR 2147
            SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTS+EP +               + SR
Sbjct: 536  SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSNEPTY---AIRHPVPVSAPRVSSR 592

Query: 2148 GSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRML 2327
            G WFP EED+  +  NR     VPKE  ++S ++  +K R  HPSFF   ESS  SDR+L
Sbjct: 593  GGWFPAEEDIGSQPLNRV----VPKEFSVDSGSLVIEKHRPHHPSFFSKVESSISSDRIL 648

Query: 2328 NE-NQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTE 2504
            ++ +QR PKE++  DDR R NH  S+YRS   +E P  +S SS RD   ES   S  + +
Sbjct: 649  HDSHQRLPKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSSSHRDLDSESSH-SVFHAD 707

Query: 2505 TPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADS 2684
            TP  VLQEI +KCGTKVEF S+L+AS ELQFSIE WF+G+KIG G G+TRKEAQ +AA+ 
Sbjct: 708  TPVVVLQEIALKCGTKVEFMSSLVASTELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAED 767

Query: 2685 SLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXE 2861
            S+++LA+ YLS+   +P S  GD+    + N+ G +  ++S   Q LPKED        +
Sbjct: 768  SIKHLADIYLSSAKDEPGSTYGDVGGFPNANDNGYMVIASSLSNQPLPKEDSASFSTASD 827

Query: 2862 PTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGG 3038
            P+R LDPRLE SK+ MGS+SALKELCMMEGL + FL+ P P+STN + K EV+AQVEI G
Sbjct: 828  PSRVLDPRLEVSKRPMGSISALKELCMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDG 887

Query: 3039 QILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRML 3218
            ++ GKG+G TW+EAKMQAAE+AL +L+S LG S Q R  SPR    FS+KRLK E+PR +
Sbjct: 888  KVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRAM 947

Query: 3219 QRINSSTRYSNNAPPVP 3269
            QRI SSTRY  NAPP+P
Sbjct: 948  QRIPSSTRYPRNAPPIP 964


>ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X2 [Glycine max]
          Length = 929

 Score = 1155 bits (2988), Expect = 0.0
 Identities = 615/969 (63%), Positives = 738/969 (76%), Gaps = 16/969 (1%)
 Frame = +3

Query: 411  KMFKSVVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPS 581
            +M+KSVVYQG  ++GEV++YP+   N K   + KEIRISHFSQPSERCPPLAVLHT+   
Sbjct: 2    RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHV-KEIRISHFSQPSERCPPLAVLHTVTSC 60

Query: 582  GVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGF 761
            GVCFKMESK Q +D  LF LHS C+RE+KTAVMPLG EE+HLVAM SR N+++  CFWGF
Sbjct: 61   GVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSR-NVDR-PCFWGF 117

Query: 762  IVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSG 941
            IV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SG
Sbjct: 118  IVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISG 177

Query: 942  MLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNI 1121
            M AE+KRYQDDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ+KNI
Sbjct: 178  MQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNI 237

Query: 1122 ILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPE 1301
            ILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+
Sbjct: 238  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 297

Query: 1302 SNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVP 1481
            SNLINSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVP
Sbjct: 298  SNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 357

Query: 1482 AFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSA 1661
            AFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + PS 
Sbjct: 358  AFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSP 417

Query: 1662 PDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMVD----SLDPRPAP 1820
            PDVSN+L+SEDD S SN  +DP  F+G+ D E ER+LKD + A+S +     +LDPR   
Sbjct: 418  PDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPR-LT 476

Query: 1821 SLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGE 1991
            SLQ  +  S  ++  PT Q  +M   + QF Q   +VKP+    PSEPSL  SPAREEGE
Sbjct: 477  SLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535

Query: 1992 VPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEE 2171
            VPESELDPDTRRRLLILQHGQDTRDH S+EPPF                 SRG WFP EE
Sbjct: 536  VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVP--SSRGVWFPAEE 593

Query: 2172 DMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFP 2348
            ++  +  NR     VPKE P++S  +   K R  HPSFF   ESS  SDR+L++ +QR P
Sbjct: 594  EIGSQPLNRV----VPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLP 649

Query: 2349 KEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQE 2528
            KE++  DDR R+NH  S+YRSF                            ++TP  VLQE
Sbjct: 650  KEMYHRDDRPRLNHMLSSYRSF----------------------------SDTPVAVLQE 681

Query: 2529 IGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANK 2708
            I +KCGTKV+F S+L+AS ELQFS+E WF+G+KIG   G+TRKEAQ +AA+ S+++LA+ 
Sbjct: 682  IALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADI 741

Query: 2709 YLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPR 2885
            YLS+   +P S  GD++   +VN+ G +G ++S G Q L KED         P+R LDPR
Sbjct: 742  YLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKED-SASFSTASPSRVLDPR 800

Query: 2886 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 3062
            L+ SK+SMGS+S+LKELCMMEGL + FL+ P P+STN + K EV+AQVEI G++ GKG+G
Sbjct: 801  LDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIG 860

Query: 3063 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 3242
             TW+EAKMQAAE+AL +L+S LG S Q R  SPR    FS+KRLK E+PR +QR+ SS R
Sbjct: 861  LTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSAR 920

Query: 3243 YSNNAPPVP 3269
            Y  NAPP+P
Sbjct: 921  YPRNAPPIP 929


>ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cicer arietinum]
          Length = 951

 Score = 1154 bits (2986), Expect = 0.0
 Identities = 619/966 (64%), Positives = 728/966 (75%), Gaps = 14/966 (1%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 593
            M+KS+VYQG  +LGEV+IYP+        KEIRISHF+QPSERC PLAVLHTI  SGVCF
Sbjct: 1    MYKSLVYQGEVVLGEVDIYPEVNNNNKNFKEIRISHFTQPSERCLPLAVLHTITSSGVCF 60

Query: 594  KMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVSP 773
            KMESK Q +D PLF LH+ C RE+KTAVMPL  EE+HLVAM SR N     CFWG+IV  
Sbjct: 61   KMESKTQQQD-PLFHLHNLCFRENKTAVMPLCGEEMHLVAMHSRSNGRP--CFWGYIVGM 117

Query: 774  GLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLAE 953
            GLYNSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SGM AE
Sbjct: 118  GLYNSCLMMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAE 177

Query: 954  IKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILTR 1133
            +KRY +DKSILKQYVENDQVV+NGKV+K QSE+VPA SD++QPIVRPLIRL EKNIILTR
Sbjct: 178  VKRYLEDKSILKQYVENDQVVDNGKVLKAQSELVPALSDSHQPIVRPLIRLHEKNIILTR 237

Query: 1134 INPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNLI 1313
            INP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+SNLI
Sbjct: 238  INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLI 297

Query: 1314 NSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFAP 1493
            NSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVPAFAP
Sbjct: 298  NSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP 357

Query: 1494 YYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDVS 1673
            YYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YE+   +   APDVS
Sbjct: 358  YYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKISQIAYENNTRDISPAPDVS 417

Query: 1674 NFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASSMVD----SLDPRPAPSLQN 1832
            N+L+SEDD SAS  N+DP  F+G+ D E ER+LKD I A+S +      LDPR   SLQ 
Sbjct: 418  NYLVSEDDGSASYANRDPFAFDGMADAEVERKLKDAISAASAIPMTTAKLDPRLTSSLQY 477

Query: 1833 AVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVPES 2003
             + S   ++  P  Q  ++   + QF Q   +VKP+  V PSE SL  SPAREEGEVPES
Sbjct: 478  TMVSP-GSVLPPAAQASMIPLPHTQFPQPATLVKPIGQVAPSELSLHSSPAREEGEVPES 536

Query: 2004 ELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDMIP 2183
            ELDPDTRRRLLILQHGQD RDHTSSEPPF               +  RG WFP+EE++  
Sbjct: 537  ELDPDTRRRLLILQHGQDNRDHTSSEPPF----PLKHPVQVSARVPPRGGWFPVEEEIGS 592

Query: 2184 RKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFPKEVH 2360
            + PNR     +PKE  L+S     +K R     FF   + S  SDR L+E NQR PKE++
Sbjct: 593  QPPNRV----IPKEIALDSGPSRIEKHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMY 648

Query: 2361 QGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIK 2540
              DDR RV+H  S+Y S  G++TP G+S SS RDF  ESG  S    ETPA VLQEI +K
Sbjct: 649  HRDDRSRVSHMLSSYPSLSGDDTPFGRSSSSHRDFDSESGH-SVFNAETPAIVLQEIALK 707

Query: 2541 CGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSN 2720
            CGTKVEF S+L AS ELQFSIE WF+G+KIG G G+TR EAQ +AA+ S+++LA+ YLS 
Sbjct: 708  CGTKVEFTSSLAASRELQFSIEAWFSGKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSR 767

Query: 2721 VMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEES 2897
               +  S  GD++   + N+ G +G+ +S G Q LPKE+        +P+R LDPRL+ S
Sbjct: 768  AKDESGSAFGDVSGFPNANDNGYVGNVSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVS 827

Query: 2898 KKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVGATWN 3074
            K+SMGSVSALKELCM+EGL + FL+ P P+STN ++  EV+AQVEI GQ+ GKG G TW+
Sbjct: 828  KRSMGSVSALKELCMVEGLGVNFLSLPAPVSTNSVD--EVHAQVEIDGQVYGKGTGITWD 885

Query: 3075 EAKMQAAEEALWNLKSMLGDSG-QNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSN 3251
            EAKMQAAE+AL +L++ +   G Q R  SPR     S+KRLK E PR LQR  SS RY  
Sbjct: 886  EAKMQAAEKALGSLRTTIHGQGIQRRQLSPRPFQGLSNKRLKQEHPRTLQRFASSGRYPR 945

Query: 3252 NAPPVP 3269
            NAPP+P
Sbjct: 946  NAPPIP 951


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score = 1126 bits (2913), Expect = 0.0
 Identities = 619/968 (63%), Positives = 717/968 (74%), Gaps = 16/968 (1%)
 Frame = +3

Query: 414  MFKS--VVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGV 587
            MFKS  V+Y+G  L+GEVEIY + + +   +K IRISH+S  SERCPPLAVLHT+  +G+
Sbjct: 1    MFKSTVVLYEGERLVGEVEIYCE-KGVLWGEKVIRISHYSPSSERCPPLAVLHTVT-TGL 58

Query: 588  CFKME---SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWG 758
             FK+E   SK  ++DSPL  LHS+CLR++KTAVM LG EELHLVAM S+    Q  CFWG
Sbjct: 59   SFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCFWG 118

Query: 759  FIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVS 938
            F V+ GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN+E DPQR S
Sbjct: 119  FKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQRAS 178

Query: 939  GMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKN 1118
             MLAE+KRYQ+DK ILKQY ENDQVV+NGKVIK QSEV PA SDN+QPIVRPLIRLQ++N
Sbjct: 179  VMLAEVKRYQEDKIILKQYAENDQVVDNGKVIKSQSEVFPALSDNHQPIVRPLIRLQDRN 238

Query: 1119 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDP 1298
            IILTRINP+IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 239  IILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 298

Query: 1299 ESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVV 1478
            +SNLINS+ELL R+VCVK+G RKSL NVF DG CHPKMALVIDDRLKVWD+KDQ RVHVV
Sbjct: 299  DSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVV 358

Query: 1479 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPS 1658
            PAFAPY+APQAE NN VPVLCVARNVACNVRGGFFK+FDEGLLQ I EV YED +   PS
Sbjct: 359  PAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPS 418

Query: 1659 APDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQASSMVDS----LDPRPA 1817
            APDVSN+LISEDD SA   NKD LGF+G+ D E ERRLK+ + AS+ V S    LDPR  
Sbjct: 419  APDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPRLV 478

Query: 1818 PSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQVVKPLV-HVGPSEPSLQGSPAREEGEV 1994
            P+LQ  V   +S  S  +P  P  +    Q T V+K  V  + P + SLQ SPAREEGEV
Sbjct: 479  PALQYPVPPVISQPSIQSPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREEGEV 538

Query: 1995 PESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEED 2174
            PESELDPDTRRRLLILQHGQDTRD  SSEP F               +Q  G WFP EE+
Sbjct: 539  PESELDPDTRRRLLILQHGQDTRDQVSSEPKF--PMGTPLQVSVPPRVQPHG-WFPAEEE 595

Query: 2175 MIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKE 2354
            M PR+ NR +P   PKE PL  E+MH +K R  HP F    E+S PSDR+L ENQR PKE
Sbjct: 596  MSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFENQRLPKE 652

Query: 2355 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 2534
            V   DDR+R + S  ++R  PGEE PLG+S SS+R    E G    PY ETPAG LQ+I 
Sbjct: 653  VIPRDDRMRFSQSQPSFRP-PGEEVPLGRSSSSNRVLDLEPGH-YDPYLETPAGALQDIA 710

Query: 2535 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 2714
             KCG KVEFRS+ ++S ELQFS+EV F GEK+G GTG+TR+EAQR+AA+ SL  LA+KYL
Sbjct: 711  FKCGAKVEFRSSFLSSPELQFSLEVLFAGEKVGEGTGRTRREAQRRAAEESLMYLADKYL 770

Query: 2715 SNVMHDPSSVRGDLNKPSHVNEMGLGDSNS-FGYQTLPKEDXXXXXXXXEPTRFLDPRLE 2891
            S +  D SS +GD  +  + ++ G  D+ S FGYQ     D        EP R LDPRLE
Sbjct: 771  SCIKPDSSSTQGDGFRFPNASDNGFVDNMSPFGYQ-----DRVSHSFASEPPRVLDPRLE 825

Query: 2892 ESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATW 3071
              KKS+GSV AL+ELC +EGL L F  QP LS N   K E+YAQVEI GQ+ GKG+G+TW
Sbjct: 826  VFKKSVGSVGALRELCAIEGLGLAFQTQPQLSANPGQKSEIYAQVEIDGQVFGKGIGSTW 885

Query: 3072 NEAKMQAAEEALWNLKSMLGDSGQNRVGSPR-LLHSFSSKRLKPEFPR-MLQRINSSTRY 3245
            ++AK QAAE AL  LKS L    Q R GSPR L   FS+KRLKPE+ R + QR+  S R+
Sbjct: 886  DDAKTQAAERALVALKSELAQFSQKRQGSPRSLQQGFSNKRLKPEYSRGVQQRVPLSGRF 945

Query: 3246 SNNAPPVP 3269
              N   +P
Sbjct: 946  PKNTSAMP 953


>ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 958

 Score = 1124 bits (2908), Expect = 0.0
 Identities = 606/970 (62%), Positives = 719/970 (74%), Gaps = 18/970 (1%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 593
            M +S+VY G   +GEVEIYP+ +K   + KEIRISHFSQPSERCPPLAVLHTI   G+CF
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDL-KEIRISHFSQPSERCPPLAVLHTITSFGICF 59

Query: 594  KMES---KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 764
            KMES   + + +   LF LHSSC+RE+KTAVMPL  EE+HLVAM SR N     CFWGFI
Sbjct: 60   KMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNN--DRPCFWGFI 117

Query: 765  VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 944
            V+ GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQR+S M
Sbjct: 118  VASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRISTM 177

Query: 945  LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 1124
             AEIKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE+VPA SD++QPIVRPLIRLQEKNII
Sbjct: 178  QAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNII 237

Query: 1125 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 1304
            LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE 
Sbjct: 238  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEL 297

Query: 1305 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 1484
            NLINSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ +VHVVPA
Sbjct: 298  NLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVPA 357

Query: 1485 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 1664
            FAPYYAPQAEA+N VP LC+AR+VACNVRGGFFK+FD+GLLQ I  + YED + + PS P
Sbjct: 358  FAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSPP 417

Query: 1665 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MVDSLDPRPA-- 1817
            DVSN+L+SEDD SAS  NK+ L F+G+ D E ERRLKD I ASS    M  +LDPR A  
Sbjct: 418  DVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFN 477

Query: 1818 PSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEG 1988
             SLQ  + SS  T+  PT Q  I+   N QF Q   +VKP+  V P  PSL  SPAREEG
Sbjct: 478  SSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEG 537

Query: 1989 EVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLE 2168
            EVPESELD DTRRRLLILQHGQDTR+HTSSEPP                + SR  WF +E
Sbjct: 538  EVPESELDLDTRRRLLILQHGQDTREHTSSEPPL---PVRHPTQVSAPSVPSRRGWFSVE 594

Query: 2169 EDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRF 2345
            E+M P++ N+     VPKE P+ SE +H +KR  RHPS F   + S  SDR+ +E +QR 
Sbjct: 595  EEMGPQQLNQL----VPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRL 650

Query: 2346 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2525
            PKEVH  DD  R++ S S+Y SFPG++ PL  S  S+RDF  ESGR S  + +  AGVLQ
Sbjct: 651  PKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGR-SLFHADITAGVLQ 709

Query: 2526 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2705
            EI +KCGTKVEF S+L+AS  LQFSIE WF G+K+G G G+TR+EAQ +AA+ S++ LA+
Sbjct: 710  EIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLAD 769

Query: 2706 KYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDP 2882
             Y+S+   D  S  GD++     N  G +   NS G Q LPKE         + +R  DP
Sbjct: 770  IYMSHAKDDSGSTYGDVSGFHGSNNNGFVSSGNSLGNQLLPKES-VSFSTSSDSSRVSDP 828

Query: 2883 RLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGV 3059
            RLE SK+S  S+SALKE CMMEGL+  F + P P ST+   K EV+AQVEI GQI GKG 
Sbjct: 829  RLEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGF 888

Query: 3060 GATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSST 3239
            G TW EAKMQAA++AL +L++M     + R GSPR +   ++KRLK E+PR LQRI  S 
Sbjct: 889  GLTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSA 948

Query: 3240 RYSNNAPPVP 3269
            RY  NAP VP
Sbjct: 949  RYPRNAPLVP 958


>ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X3 [Glycine max]
          Length = 932

 Score = 1108 bits (2865), Expect = 0.0
 Identities = 599/969 (61%), Positives = 712/969 (73%), Gaps = 17/969 (1%)
 Frame = +3

Query: 414  MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 593
            M +S+VY G   +GEVEIYP+ +K   + KEIRISHFSQPSERCPPLAVLHTI   G+CF
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDL-KEIRISHFSQPSERCPPLAVLHTITSFGICF 59

Query: 594  KMES---KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 764
            KMES   + + +   LF LHSSC+RE+KTAVMPL  EE+HLVAM SR N     CFWGFI
Sbjct: 60   KMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNN--DRPCFWGFI 117

Query: 765  VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 944
            V+ GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQR+S M
Sbjct: 118  VASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRISTM 177

Query: 945  LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 1124
             AEIKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE+VPA SD++QPIVRPLIRLQEKNII
Sbjct: 178  QAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNII 237

Query: 1125 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 1304
            LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE 
Sbjct: 238  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEL 297

Query: 1305 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 1484
            NLINSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ +VHVVPA
Sbjct: 298  NLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVPA 357

Query: 1485 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 1664
            FAPYYAPQAEA+N VP LC+AR+VACNVRGGFFK+FD+GLLQ I  + YED + + PS P
Sbjct: 358  FAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSPP 417

Query: 1665 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MVDSLDPRPA-- 1817
            DVSN+L+SEDD SAS  NK+ L F+G+ D E ERRLKD I ASS    M  +LDPR A  
Sbjct: 418  DVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFN 477

Query: 1818 PSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEG 1988
             SLQ  + SS  T+  PT Q  I+   N QF Q   +VKP+  V P  PSL  SPAREEG
Sbjct: 478  SSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEG 537

Query: 1989 EVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLE 2168
            EVPESELD DTRRRLLILQHGQDTR+HTSSEPP                + SR  WF +E
Sbjct: 538  EVPESELDLDTRRRLLILQHGQDTREHTSSEPPL---PVRHPTQVSAPSVPSRRGWFSVE 594

Query: 2169 EDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRF 2345
            E+M P++ N+     VPKE P+ SE +H +KR  RHPS F   + S  SDR+ +E +QR 
Sbjct: 595  EEMGPQQLNQL----VPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRL 650

Query: 2346 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2525
            PKEVH  DD  R++ S S+Y SFPG++ PL  S  S+RDF  ESGR S  + +  AGVLQ
Sbjct: 651  PKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGR-SLFHADITAGVLQ 709

Query: 2526 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2705
            EI +KCGTKVEF S+L+AS  LQFSIE WF G+K+G G G+TR+EAQ +AA+ S++ LA+
Sbjct: 710  EIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLAD 769

Query: 2706 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPR 2885
             Y+S+   D  S  GD++        G   SN+ G+ +                   DPR
Sbjct: 770  IYMSHAKDDSGSTYGDVS--------GFHGSNNNGFVS------------------SDPR 803

Query: 2886 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 3062
            LE SK+S  S+SALKE CMMEGL+  F + P P ST+   K EV+AQVEI GQI GKG G
Sbjct: 804  LEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFG 863

Query: 3063 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 3242
             TW EAKMQAA++AL +L++M     + R GSPR +   ++KRLK E+PR LQRI  S R
Sbjct: 864  LTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSAR 923

Query: 3243 YSNNAPPVP 3269
            Y  NAP VP
Sbjct: 924  YPRNAPLVP 932


>ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum lycopersicum]
          Length = 954

 Score = 1107 bits (2862), Expect = 0.0
 Identities = 608/972 (62%), Positives = 709/972 (72%), Gaps = 20/972 (2%)
 Frame = +3

Query: 414  MFKSVV--YQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGV 587
            MFKS V  Y+G  L+GEVE+Y + + +   +K IRISH+S  SERCPPLAVLHT+  +G+
Sbjct: 1    MFKSTVLLYEGERLVGEVEMYGE-KGVVWGEKLIRISHYSPSSERCPPLAVLHTVT-TGL 58

Query: 588  CFKME---SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWG 758
             FK+E   SK  ++DSPL  LHS+CLR++KTAVM LG EELHLVAM S+    Q  CFWG
Sbjct: 59   SFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCFWG 118

Query: 759  FIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVS 938
            F V+ GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN+E DPQR S
Sbjct: 119  FKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQRAS 178

Query: 939  GMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKN 1118
             MLAE+KRYQ+DK ILKQY ENDQVV+NGKVI+ QSEV PA SDN+QPIVRPLIRLQ++N
Sbjct: 179  VMLAEVKRYQEDKIILKQYAENDQVVDNGKVIRSQSEVFPALSDNHQPIVRPLIRLQDRN 238

Query: 1119 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDP 1298
            IILTRINP+IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 239  IILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 298

Query: 1299 ESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVV 1478
            +SNLINS+ELL R+VCVK+G RKSL NVF DG CHPKMALVIDDRLKVWD+KDQ RVHVV
Sbjct: 299  DSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVV 358

Query: 1479 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPS 1658
            PAFAPY+APQAE NN VPVLCVARNVACNVRGGFFK+FDEGLLQ I EV YED +   PS
Sbjct: 359  PAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPS 418

Query: 1659 APDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQASSMVDS----LDPRPA 1817
            APDVSN+LISEDD SA   NKD LGF+G+ D E ERRLK+ + AS+ V S    LDPR  
Sbjct: 419  APDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPRLV 478

Query: 1818 PSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQVVKPL----VHVGPSEPSLQGSPAREE 1985
            P+LQ  V      ISQP+ QGP++        QV   L      + P + SLQ SPAREE
Sbjct: 479  PALQYPVP---PVISQPSIQGPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREE 535

Query: 1986 GEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPL 2165
            GEVPESELDPDTRRRLLILQHGQDTRD  SSEP F               +Q  G WFP 
Sbjct: 536  GEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKF--PIGTPLQVSVPPRVQPHG-WFPA 592

Query: 2166 EEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRF 2345
            EE++ PR+ NR +P   PKE PL  E+MH +K R  HP F    E+S PSDR+  ENQR 
Sbjct: 593  EEEVSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVFFENQRL 649

Query: 2346 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2525
            PKEV   DDR+R + S  ++R  PGE+  LG+S SS+R    + G    PY +TPAG LQ
Sbjct: 650  PKEVIPRDDRMRFSQSQPSFRP-PGEDVSLGRSSSSNRVLDLDPGH-YDPYLDTPAGALQ 707

Query: 2526 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2705
            +I  KCG KVEFRS+ ++S ELQF +EV F GEK+G G G+TR+EAQR AA+ SL  LA+
Sbjct: 708  DIAFKCGVKVEFRSSFLSSPELQFCLEVLFAGEKVGEGIGRTRREAQRHAAEESLMYLAD 767

Query: 2706 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNS-FGYQTLPKEDXXXXXXXXEPTRFLDP 2882
            KYLS +  D SS +GD  +  + ++ G  ++ S FGYQ     D        EP R LDP
Sbjct: 768  KYLSCIKADSSSTQGDGFRFPNASDNGFVENMSPFGYQ-----DRVSHSFASEPPRVLDP 822

Query: 2883 RLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVG 3062
            RLE  KKS+GSV AL+ELC +EGL L F  QP LS N   K E+YAQVEI GQ+ GKG+G
Sbjct: 823  RLEVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSVNPGQKSEIYAQVEIDGQVFGKGIG 882

Query: 3063 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLL--HSFSSKRLKPEFPR-MLQRINS 3233
             TW++AK QAAE AL  LKS L      R GSPR L    FS+KRLKPE+ R + QR+  
Sbjct: 883  PTWDDAKTQAAERALVALKSELAQFSHKRQGSPRSLQQQGFSNKRLKPEYSRGVQQRVPL 942

Query: 3234 STRYSNNAPPVP 3269
            S R+  N   +P
Sbjct: 943  SGRFPKNTSAMP 954


>ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
            gi|571500215|ref|XP_006594604.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1-like
            isoform X2 [Glycine max]
          Length = 960

 Score = 1104 bits (2855), Expect = 0.0
 Identities = 598/967 (61%), Positives = 707/967 (73%), Gaps = 18/967 (1%)
 Frame = +3

Query: 423  SVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCFKME 602
            S+VY G   +GEV+IYP+  K   + KEIRISHFSQPSERCPPLAVLHTI   G+CFKME
Sbjct: 4    SMVYHGEMAVGEVKIYPEENKNMDL-KEIRISHFSQPSERCPPLAVLHTITSFGICFKME 62

Query: 603  SKLQSK---DSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVSP 773
            S    K      LF LHSSC+RE+KTAVMP+  EE+HLVAM SR N     CFWGFIV+ 
Sbjct: 63   SSTSQKRQQQDALFHLHSSCIRENKTAVMPVRGEEIHLVAMYSRNN--DRPCFWGFIVAS 120

Query: 774  GLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLAE 953
            GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQ++S M AE
Sbjct: 121  GLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQQISAMQAE 180

Query: 954  IKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILTR 1133
            IKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE VPA SD++QPIVRPLIRLQEKNIILTR
Sbjct: 181  IKRYLDDKNILKEYAENDQVVDNGKVIKIQSESVPALSDSHQPIVRPLIRLQEKNIILTR 240

Query: 1134 INPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNLI 1313
            INP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE NLI
Sbjct: 241  INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPELNLI 300

Query: 1314 NSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFAP 1493
            NSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ RVHVVPAFAP
Sbjct: 301  NSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPRVHVVPAFAP 360

Query: 1494 YYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDVS 1673
            YY PQAEA+N VP LC+ARNVACNVRGGFFK+FD+GLLQ I  + YED + + PS PDVS
Sbjct: 361  YYTPQAEASNAVPFLCLARNVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPS-PDVS 419

Query: 1674 NFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MVDSLDPRPA--PSL 1826
            N+L+SEDD SAS  NK+ L F+G+ D E ERRLKD I ASS    +  ++DPR A   SL
Sbjct: 420  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 479

Query: 1827 QNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVP 1997
            Q  + SS  T+  PT Q  ++   N QF Q   +VKP+  V     SL  SPAREEGE+P
Sbjct: 480  QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 539

Query: 1998 ESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDM 2177
            ESELD DTRRR LILQHGQDTR+  +SEPPF               + SR  WF +EE+M
Sbjct: 540  ESELDLDTRRRFLILQHGQDTRERMASEPPF-PVRHPAQVSAPASSVPSRRGWFSVEEEM 598

Query: 2178 IPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFPKE 2354
             P++ N      VPKE P++SE  H +KR  RHPSFF     S  SDR+ +E +QR PKE
Sbjct: 599  GPQQLN----LPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKE 654

Query: 2355 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 2534
            VH  DDR R++ S S+Y S PG++ PL  S  S+RDF  ESGR S  + +T AGVLQEI 
Sbjct: 655  VHHRDDRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGR-SLFHADTTAGVLQEIA 713

Query: 2535 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 2714
            + CGTKVEF S+L+AS ELQFSIE WF G+KIG G G+TR+EAQ +AA  S++ LA+ Y+
Sbjct: 714  LNCGTKVEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYM 773

Query: 2715 SNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLE 2891
            S+   D  S  GD++     N  G +   NS G Q LPKE+        E +R  D RLE
Sbjct: 774  SHAKDDSGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLE 833

Query: 2892 ESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVGAT 3068
             SK+S  S+SALKELCMMEGL+  F + P   ST+L  K EV+AQVEI GQI GKG G T
Sbjct: 834  VSKRSTDSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVT 893

Query: 3069 WNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYS 3248
            W EAKMQAA++AL +L++M       R GSPR +   ++KRLKPE+P  LQR+  S RY 
Sbjct: 894  WEEAKMQAAKKALGSLRTMFNQGSLKRHGSPRSMQGLANKRLKPEYPPTLQRVPYSARYP 953

Query: 3249 NNAPPVP 3269
             NAP VP
Sbjct: 954  RNAPLVP 960


Top