BLASTX nr result

ID: Akebia24_contig00006697 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00006697
         (4071 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform...  1243   0.0  
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...  1233   0.0  
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...  1227   0.0  
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...  1226   0.0  
ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun...  1215   0.0  
ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform...  1198   0.0  
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...  1196   0.0  
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...  1192   0.0  
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...  1184   0.0  
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...  1183   0.0  
emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]  1180   0.0  
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...  1173   0.0  
ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas...  1164   0.0  
ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma...  1154   0.0  
ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma...  1154   0.0  
ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma...  1125   0.0  
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...  1122   0.0  
ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal doma...  1108   0.0  
ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma...  1104   0.0  
ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma...  1104   0.0  

>ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
            gi|508781046|gb|EOY28302.1| C-terminal domain
            phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score = 1243 bits (3217), Expect = 0.0
 Identities = 664/986 (67%), Positives = 749/986 (75%), Gaps = 34/986 (3%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQ--------------KICMID---KEIRISHFSQPSER 522
            M+KSVVY+G  +LGEVEIYPQ Q              KI +++   KEIRI + +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 523  CPPLAVLHTIAPSGVCFKMESKLQ-----SKDSP-LFSLHSSCLREDKTAVMPLGEEELH 684
            CPPLAVLHTI  SG+CFKMES        S+DSP L  LHS C+R++KTAVMP+G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 685  LVAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDR 864
            LVAM SR +     CFWGF VS GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDR
Sbjct: 124  LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 865  IDALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAF 1044
            I+ALQRK+ TE+DPQRV+GM+AE+KRYQDDK+ILKQY ENDQVVENGKVIK+QSEVVPA 
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 1045 SDNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYV 1224
            SDN+QPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 1225 CTMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVI 1404
            CTMAERDYALEMWRLLDPESNLINSKELL R+VCVK+GSRKSL NVF DGICHPKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 1405 DDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGL 1584
            DDRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFDEGL
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 1585 LQLIREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVI 1755
            LQ I E+ YED + + PS PDV N+L+SEDD SA   NKDPL F+G+ D E ERRLK+ I
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1756 QA----SSMANSLDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQF---TQVVKPLV 1914
             A    SS A +LDPR  PSLQ  + SS  +I     Q  I+S  N QF     VVKP+ 
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541

Query: 1915 HVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXX 2094
             V   EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT  EP F        
Sbjct: 542  PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601

Query: 2095 XXXXXXXMQSRGSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHG 2274
                    QSRGSWF  EE+  PR+ NRA     PKE PL+SE MH +K   RHP FF  
Sbjct: 602  VSVPRG--QSRGSWFAAEEEMSPRQLNRA----APKEFPLDSERMHIEKH--RHPPFFPK 653

Query: 2275 AESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFE 2454
             ESS PSDR+L ENQR  KE    DDRL +NH+ S+Y SF GEE PL QS SS RD  FE
Sbjct: 654  VESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFE 713

Query: 2455 SGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTR 2634
            SGR +    ET AGVLQ+I +KCG KVEFR AL+AS +LQFSIE WF GEK+G G G+TR
Sbjct: 714  SGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTR 772

Query: 2635 KEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKE 2811
            +EAQRQAA+ S++NLAN YLS +  D  S  GDL++  ++N+ G   + NSFG Q L KE
Sbjct: 773  REAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKE 832

Query: 2812 DXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVE 2991
            +        E +R  DPRLE SKKSMGSV+ALKELCMMEGL +VF  QPP S+N + K E
Sbjct: 833  ESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDE 892

Query: 2992 VYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKR 3171
            VYAQVEI GQ+LGKG G TW EAKMQAAE+AL +L+SMLG   Q R GSPR L    +KR
Sbjct: 893  VYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKR 952

Query: 3172 LKPEFPRMLQRINSSTRYSNNAPPVP 3249
            LKPEFPR+LQR+ SS RY  NAPPVP
Sbjct: 953  LKPEFPRVLQRMPSSGRYPKNAPPVP 978


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score = 1233 bits (3189), Expect = 0.0
 Identities = 663/1008 (65%), Positives = 756/1008 (75%), Gaps = 56/1008 (5%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQK-------------ICMIDKEIRISHFSQPSERCPPL 534
            M+KSVVY+G+ LLGEVEIY Q Q+             I  I KEIRISHFSQ SERCPPL
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60

Query: 535  AVLHTIAPSGVCFKME-------SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVA 693
            AVLHTI   GVCFKME       +K+  ++SPL  LHSSC++E+KTAVM LG EELHLVA
Sbjct: 61   AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120

Query: 694  MSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 873
            M SR N  Q+ CFWGF V+PGLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDA
Sbjct: 121  MPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180

Query: 874  LQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDN 1053
            LQRKI+TE+DPQR+ GML+E+KRY DDK+ILKQYVENDQVVENGKVIK QSEVVPA SDN
Sbjct: 181  LQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240

Query: 1054 YQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTM 1233
            +QP+VRPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTM
Sbjct: 241  HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300

Query: 1234 AERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDR 1413
            AERDYALEMWRLLDPESNLINSKELL R+VCVK+G RKSL NVF DGICHPKMALVIDDR
Sbjct: 301  AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360

Query: 1414 LKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQL 1593
            LKVWDE+DQSRVHVVPAFAPYYAPQAE NN VPVLCVARNVACNVRGGFFKEFDEGLLQ 
Sbjct: 361  LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420

Query: 1594 IREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQAS 1764
            I EV YED   N PS PDVSN+L+SEDD SA   N+D L F+G+ D E ER+LK+ + AS
Sbjct: 421  IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSAS 480

Query: 1765 --------SMANSLDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQ-------------- 1878
                    S  +SLDPR   SLQ  +ASS  + S PT Q  +++SQ              
Sbjct: 481  SAILSTIPSTVSSLDPRLLQSLQYTIASS--SSSMPTSQPSMLASQQPMPALQPPKPPSQ 538

Query: 1879 -------NNQFTQV---VKPLVHVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQH 2028
                   N QF QV   VK L  V P EPSLQ SPAREEGEVPESELDPDTRRRLLILQH
Sbjct: 539  LSMTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRRLLILQH 598

Query: 2029 GQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDRIPRKPNRAVPKSVPKES 2208
            G D+RD+  SE PF               +QS GSW P+EE+  PR+ NR      P+E 
Sbjct: 599  GHDSRDNAPSESPF---PARPSTQVSAPRVQSVGSWVPVEEEMSPRQLNR-----TPREF 650

Query: 2209 PLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYR 2388
            PL+S+ M+ +K R  HPSFFH  ES+ PSDRM++ENQR PKE    DDR+++NHSTSNY 
Sbjct: 651  PLDSDPMNIEKHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYP 710

Query: 2389 SFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAE 2568
            SF GEE+PL +S SS+RD   ES R     TETP  VLQEI +KCGTKVEFR ALIA+++
Sbjct: 711  SFQGEESPLSRS-SSNRDLDLESERAFSS-TETPVEVLQEIAMKCGTKVEFRPALIATSD 768

Query: 2569 LQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPS 2748
            LQFSIE WF GEK+G GTGKTR+EAQRQAA+ S++ LA  Y+S V  D   + GD ++  
Sbjct: 769  LQFSIETWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYP 828

Query: 2749 HVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMM 2925
              N+ G LGD NSFG Q L K++        EP+R LD RLE SKKSMGSV+ALKE CM 
Sbjct: 829  SANDNGFLGDMNSFGNQPLLKDENITYSATSEPSRLLDQRLEGSKKSMGSVTALKEFCMT 888

Query: 2926 EGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSM 3105
            EGL + FLAQ PLSTN +   EV+AQVEI GQ+LGKG+G TW+EAKMQAAE+AL +L++M
Sbjct: 889  EGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRTM 948

Query: 3106 LGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 3249
             G     R GSPRL+    +KRLK EFPR+LQR+ SS RY  NA PVP
Sbjct: 949  FGQYTPKRQGSPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNASPVP 996


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score = 1227 bits (3175), Expect = 0.0
 Identities = 654/972 (67%), Positives = 746/972 (76%), Gaps = 20/972 (2%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQ--------KICMIDKEIRISHFSQPSERCPPLAVLHT 549
            M+K+V Y G  +LGEVEIYPQ Q        K   +  EIRIS+FS+ SERCPPLAVLHT
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60

Query: 550  IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLG-EEELHLVAMSSRKNLEQYS 726
            I  SG+CFKMESK  S +  L  LHSSC+RE+KTAVMPLG  EELHLVAM SR N +QY 
Sbjct: 61   ITASGICFKMESK-SSDNIQLHLLHSSCIRENKTAVMPLGLTEELHLVAMYSRNNEKQYP 119

Query: 727  CFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDP 906
            CFW F V  GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+DP
Sbjct: 120  CFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDP 179

Query: 907  QRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRL 1086
            QR++GM AE+KRYQDDK+ILKQY ENDQV ENGKVIKVQSEVVPA SD++Q +VRPLIRL
Sbjct: 180  QRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRL 239

Query: 1087 QEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWR 1266
            QEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWR
Sbjct: 240  QEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 299

Query: 1267 LLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSR 1446
            LLDPESNLIN+KELL R+VCVK+GSRKSL NVF DG CHPKMALVIDDRLKVWD+KDQ R
Sbjct: 300  LLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDDKDQPR 359

Query: 1447 VHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLA 1626
            VHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFDEGLLQ I E+ YED + 
Sbjct: 360  VHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVK 419

Query: 1627 NFPSAPDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMANS----LD 1785
            + PS PDVSN+L+SEDD + +N  KDPL F+G+ D E ERRLK+ I AS+  +S    LD
Sbjct: 420  DIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANLD 479

Query: 1786 PRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQF---TQVVKPLVHVGPSEPSLQGSPA 1956
            PR AP  Q  + SS  T + PT Q  +M   N QF   T +VKPL HVGP E SLQ SPA
Sbjct: 480  PRLAP-FQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQSLQSSPA 538

Query: 1957 REEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSW 2136
            REEGEVPESELDPDTRRRLLILQHG DTR++  SE PF               + SRGSW
Sbjct: 539  REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPF---PARTQMQVSVPRVPSRGSW 595

Query: 2137 FPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNEN 2316
            FP+EE+  PR+ NRA    VPKE PL SE M  +K R  HPSFF   E+   SDR  +EN
Sbjct: 596  FPVEEEMSPRQLNRA----VPKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDRP-HEN 650

Query: 2317 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 2496
            QR PKE  + DDRLR+NH+ S+Y+SF GEE PL +S SS RD  FESGR     TETP+G
Sbjct: 651  QRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGR-DVSSTETPSG 709

Query: 2497 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 2676
            VLQ+I +KCGTKVEFR AL+AS ELQFSIE WF GEKIG G G+TR+EAQRQAA+ S+++
Sbjct: 710  VLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKH 769

Query: 2677 LANKYLSNVMHDPSSVRGDLNKPSHVNE-MGLGDSNSFGYQTLPKEDXXXXXXXXEPTRF 2853
            LAN Y+  V  D  S  GD ++ S+ NE   +G+ NSFG Q L K++        EP++ 
Sbjct: 770  LANVYMLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKL 825

Query: 2854 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGK 3033
            +DPRLE SKK MGSVSALKELCM EGL +VF  QPP S N + K EVYAQVEI GQ+LGK
Sbjct: 826  VDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGK 885

Query: 3034 GVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINS 3213
            G+G+TW+EAKMQAAE+AL +L+SM G   Q   GSPR L    +KRLKPEFPR+LQR+  
Sbjct: 886  GIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPP 945

Query: 3214 STRYSNNAPPVP 3249
            S RY  NAPPVP
Sbjct: 946  SGRYPKNAPPVP 957


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score = 1226 bits (3173), Expect = 0.0
 Identities = 655/972 (67%), Positives = 745/972 (76%), Gaps = 20/972 (2%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQ--------KICMIDKEIRISHFSQPSERCPPLAVLHT 549
            M+K+V Y G  +LGEVEIYPQ Q        K   +  EIRIS+FS+ SERCPPLAVLHT
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60

Query: 550  IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLG-EEELHLVAMSSRKNLEQYS 726
            I  SG+CFKMESK  S +  L  LHSSC+RE+KTAVM LG  EELHLVAM SR N +QY 
Sbjct: 61   ITASGICFKMESK-SSDNVQLHLLHSSCIRENKTAVMLLGLTEELHLVAMYSRNNEKQYP 119

Query: 727  CFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDP 906
            CFW F V  GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+DP
Sbjct: 120  CFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDP 179

Query: 907  QRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRL 1086
            QR++GM AE+KRYQDDK+ILKQY ENDQV ENGKVIKVQSEVVPA SD++Q +VRPLIRL
Sbjct: 180  QRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRL 239

Query: 1087 QEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWR 1266
            QEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWR
Sbjct: 240  QEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 299

Query: 1267 LLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSR 1446
            LLDPESNLIN+KELL R+VCVK+GSRKSL NVF DG CHPKMALVIDDRLKVWDEKDQSR
Sbjct: 300  LLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDEKDQSR 359

Query: 1447 VHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLA 1626
            VHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFDEGLLQ I E+ YED + 
Sbjct: 360  VHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVK 419

Query: 1627 NFPSAPDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMANS----LD 1785
              PS PDVSN+L+SEDD + +N  KDPL F+G+ D E ERRLK+ I AS+  +S    LD
Sbjct: 420  EIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANLD 479

Query: 1786 PRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQF---TQVVKPLVHVGPSEPSLQGSPA 1956
            PR AP  Q  + SS  T + PT Q  +M   N QF   T +VKPL HVGP E  LQ SPA
Sbjct: 480  PRLAP-FQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQCLQSSPA 538

Query: 1957 REEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSW 2136
            REEGEVPESELDPDTRRRLLILQHG DTR++  SE PF               + SRGSW
Sbjct: 539  REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPF---PARTQMQVSVPRVPSRGSW 595

Query: 2137 FPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNEN 2316
            FP+EE+  PR+ NRA    VPKE PL SE M  +K R  HPSFF   E+S  SDR  +EN
Sbjct: 596  FPVEEEMSPRQLNRA----VPKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDRP-HEN 650

Query: 2317 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 2496
            QR PKE  + DDRLR+NH+ S+Y+SF GEE PL +S SS RD  FESGR     TETP+G
Sbjct: 651  QRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGR-DVSSTETPSG 709

Query: 2497 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 2676
            VLQ+I +KCGTKVEFR AL+AS ELQFSIE WF GEKIG G G+TR+EAQRQAA+ S+++
Sbjct: 710  VLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKH 769

Query: 2677 LANKYLSNVMHDPSSVRGDLNKPSHVNE-MGLGDSNSFGYQTLPKEDXXXXXXXXEPTRF 2853
            LAN Y+  V  D  S  GD ++ S+ NE   +G+ NSFG Q L K++        EP++ 
Sbjct: 770  LANVYVLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKL 825

Query: 2854 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGK 3033
            +DPRLE SKK MGSVSALKELCM EGL +VF  QPP S N + K EVYAQVEI GQ+LGK
Sbjct: 826  VDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGK 885

Query: 3034 GVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINS 3213
            G+G+TW+EAKMQAAE+AL +L+SM G   Q   GSPR L    +KRLKPEFPR+LQR+  
Sbjct: 886  GIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPP 945

Query: 3214 STRYSNNAPPVP 3249
            S RY  NAPPVP
Sbjct: 946  SGRYPKNAPPVP 957


>ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica]
            gi|462410413|gb|EMJ15747.1| hypothetical protein
            PRUPE_ppa000988mg [Prunus persica]
          Length = 940

 Score = 1215 bits (3143), Expect = 0.0
 Identities = 651/971 (67%), Positives = 742/971 (76%), Gaps = 19/971 (1%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQ-----NQKICMID--KEIRISHFSQPSERCPPLAVLHTI 552
            M+KSVVY+G  LLGEVEIYP+     N+   ++D  KEIRIS+FSQ SERCPP+AVLHTI
Sbjct: 1    MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60

Query: 553  APSGVCFKMESKL-QSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSC 729
            +  GVCFKMESK  QS+D+PLF LHSSC+ E+KTAVMPLG EELHLVAM SR   ++Y C
Sbjct: 61   SSHGVCFKMESKTSQSQDTPLFLLHSSCVMENKTAVMPLGGEELHLVAMRSRNGDKRYPC 120

Query: 730  FWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQ 909
            FWGF V+PGLYNSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI++E+DPQ
Sbjct: 121  FWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEVDPQ 180

Query: 910  RVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQ 1089
            R+SGMLAEIKRYQDDK ILKQY ENDQVVENG+VIK QSE VPA SDN+QPI+RPLIRL 
Sbjct: 181  RISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRPLIRLH 240

Query: 1090 EKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRL 1269
            EKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRL
Sbjct: 241  EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300

Query: 1270 LDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRV 1449
            LDP+SNLINS +LL R+VCVK+GSRKSL NVF + +CHPKMALVIDDRLKVWD++DQ RV
Sbjct: 301  LDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRV 360

Query: 1450 HVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLAN 1629
            HVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFF+EFD+ LLQ I EVFYED + +
Sbjct: 361  HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYEDDIKD 420

Query: 1630 FPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQASSMAN----SLDP 1788
             PS PDVSN+L+SEDD SA   N+DPL F+G+TDVE ERR+K+   A+SM +    S+DP
Sbjct: 421  VPS-PDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATPAASMVSSVFTSIDP 479

Query: 1789 RPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAR 1959
            R AP LQ  V  S  T+S PT Q  +MS  + QF Q   +VKPL HVG +EPSLQ SPAR
Sbjct: 480  RLAP-LQYTVPPS-STLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSAEPSLQSSPAR 537

Query: 1960 EEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWF 2139
            EEGEVPESELDPDTRRRLLILQHGQDTRD   SEPPF                QSR  WF
Sbjct: 538  EEGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPF---PVRPPMQASVPRAQSRPGWF 594

Query: 2140 PLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQ 2319
            P+EE+  PR+ +R VPK +    PL+ ET+  +K R  H SFF   E+S PSDR+L ENQ
Sbjct: 595  PVEEEMSPRQLSRMVPKDL----PLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQENQ 650

Query: 2320 RFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGV 2499
            R PKE    DDRLR NH+ S Y S  GEE PL +S SS+RD  FESGR +    ETPAGV
Sbjct: 651  RLPKEAFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGR-AISNAETPAGV 709

Query: 2500 LQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNL 2679
            LQEI +KCG K                   WF GEKIG G+GKTR+EA  QAA+ SL+NL
Sbjct: 710  LQEIAMKCGAK------------------AWFAGEKIGEGSGKTRREAHYQAAEGSLKNL 751

Query: 2680 ANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKEDXXXXXXXXEPTRFL 2856
            AN YLS V  D  SV GD+NK  +VN  G  G+ NSFG Q  PKE+        EP+R L
Sbjct: 752  ANIYLSRVKPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPL 811

Query: 2857 DPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKG 3036
            DPRLE SKKSM SVS LKELCMMEGL +VF  +PP STN + K EV+ QVEI G++LGKG
Sbjct: 812  DPRLEGSKKSMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKG 871

Query: 3037 VGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSS 3216
            +G TW+EAKMQAAE+AL +L S L    Q R GSPR L   SSKR+K EFP++LQR+ SS
Sbjct: 872  IGLTWDEAKMQAAEKALGSLTSTL--YAQKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSS 929

Query: 3217 TRYSNNAPPVP 3249
             RY  NAPPVP
Sbjct: 930  ARYPKNAPPVP 940


>ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
            gi|508781047|gb|EOY28303.1| C-terminal domain
            phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score = 1198 bits (3099), Expect = 0.0
 Identities = 641/951 (67%), Positives = 723/951 (76%), Gaps = 34/951 (3%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQ--------------KICMID---KEIRISHFSQPSER 522
            M+KSVVY+G  +LGEVEIYPQ Q              KI +++   KEIRI + +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 523  CPPLAVLHTIAPSGVCFKMESKLQ-----SKDSP-LFSLHSSCLREDKTAVMPLGEEELH 684
            CPPLAVLHTI  SG+CFKMES        S+DSP L  LHS C+R++KTAVMP+G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 685  LVAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDR 864
            LVAM SR +     CFWGF VS GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDR
Sbjct: 124  LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 865  IDALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAF 1044
            I+ALQRK+ TE+DPQRV+GM+AE+KRYQDDK+ILKQY ENDQVVENGKVIK+QSEVVPA 
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 1045 SDNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYV 1224
            SDN+QPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 1225 CTMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVI 1404
            CTMAERDYALEMWRLLDPESNLINSKELL R+VCVK+GSRKSL NVF DGICHPKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 1405 DDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGL 1584
            DDRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFDEGL
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 1585 LQLIREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVI 1755
            LQ I E+ YED + + PS PDV N+L+SEDD SA   NKDPL F+G+ D E ERRLK+ I
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1756 QA----SSMANSLDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQF---TQVVKPLV 1914
             A    SS A +LDPR  PSLQ  + SS  +I     Q  I+S  N QF     VVKP+ 
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541

Query: 1915 HVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXX 2094
             V   EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT  EP F        
Sbjct: 542  PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601

Query: 2095 XXXXXXXMQSRGSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHG 2274
                    QSRGSWF  EE+  PR+ NRA     PKE PL+SE MH +K   RHP FF  
Sbjct: 602  VSVPRG--QSRGSWFAAEEEMSPRQLNRA----APKEFPLDSERMHIEKH--RHPPFFPK 653

Query: 2275 AESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFE 2454
             ESS PSDR+L ENQR  KE    DDRL +NH+ S+Y SF GEE PL QS SS RD  FE
Sbjct: 654  VESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFE 713

Query: 2455 SGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTR 2634
            SGR +    ET AGVLQ+I +KCG KVEFR AL+AS +LQFSIE WF GEK+G G G+TR
Sbjct: 714  SGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTR 772

Query: 2635 KEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKE 2811
            +EAQRQAA+ S++NLAN YLS +  D  S  GDL++  ++N+ G   + NSFG Q L KE
Sbjct: 773  REAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKE 832

Query: 2812 DXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVE 2991
            +        E +R  DPRLE SKKSMGSV+ALKELCMMEGL +VF  QPP S+N + K E
Sbjct: 833  ESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDE 892

Query: 2992 VYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPR 3144
            VYAQVEI GQ+LGKG G TW EAKMQAAE+AL +L+SMLG   Q R GSPR
Sbjct: 893  VYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPR 943


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score = 1196 bits (3093), Expect = 0.0
 Identities = 642/966 (66%), Positives = 738/966 (76%), Gaps = 18/966 (1%)
 Frame = +1

Query: 406  VVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCFK 576
            +VY+G  LLGEVE+YP+   N+KI    KEIRISHFSQ SERCPP+AVLHTI+ +GVCFK
Sbjct: 4    LVYKGEELLGEVEVYPEELNNKKIWDELKEIRISHFSQSSERCPPVAVLHTISSNGVCFK 63

Query: 577  MESKLQSKDSP----LFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 744
            MESK  S  S     LF LHSSC+ E+KTAVM LG EELHLVAM SR N +Q+ CFWGF 
Sbjct: 64   MESKSSSSSSQDTSRLFLLHSSCIMENKTAVMNLGVEELHLVAMYSRNNQKQHPCFWGFS 123

Query: 745  VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 924
            VS GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ LQRKI  E+D QR+SGM
Sbjct: 124  VSSGLYSSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIEGLQRKIQCEVDAQRISGM 183

Query: 925  LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 1104
             AEIKRYQDDK ILKQY ENDQVVENG+VIK QSEVVPA SD++QPI+RPLIRLQEKNII
Sbjct: 184  QAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEVVPALSDSHQPIIRPLIRLQEKNII 243

Query: 1105 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 1284
            LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDPES
Sbjct: 244  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 303

Query: 1285 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 1464
            NLIN+ +LL R+VCVK+G +KSL NVF + +CHPKMALVIDDRLKVWD++DQ RVHVVPA
Sbjct: 304  NLINANKLLDRIVCVKSGLKKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVPA 363

Query: 1465 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 1644
            FAPYYAPQAEANN VPVLCVARNVAC+VRGGFF+EFD+ LLQ I E+FYED + +F S+P
Sbjct: 364  FAPYYAPQAEANNAVPVLCVARNVACSVRGGFFREFDDSLLQKIPEIFYEDNIKDF-SSP 422

Query: 1645 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQA----SSMANSLDPRPAPS 1803
            DVSNFL+SEDD SAS  N+D L F+G+ D E ERRLK+   A    SS  ++ DPR A S
Sbjct: 423  DVSNFLVSEDDASASNGNRDQLPFDGMADAEVERRLKEATSAAPTVSSAVSNNDPRLA-S 481

Query: 1804 LQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEV 1974
            LQ  V  S  T+S PT Q  +M   N QF Q   +VKPL HVGP++  L  SPAREEGEV
Sbjct: 482  LQYTVPLS-STVSLPTNQPSMMPFHNVQFPQSASLVKPLGHVGPADLGLHSSPAREEGEV 540

Query: 1975 PESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEED 2154
            PESELDPDTRRRLLILQHGQDTR+   SEP F               +QSRG WFP+EE+
Sbjct: 541  PESELDPDTRRRLLILQHGQDTRESVPSEPSF---PVRPQVQVSVPRVQSRGGWFPVEEE 597

Query: 2155 RIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKE 2334
              PRK +R     VPKE PL SE M  +K R+ H +FF   E+S PSDR+L ENQR PKE
Sbjct: 598  MSPRKLSR----MVPKEPPLNSEPMQIEKHRSHHSAFFPKVENSMPSDRILQENQRLPKE 653

Query: 2335 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 2514
                D+RLR N + S Y SF GEE PL +S SS+RDF +ESGR +    ETPAGVLQEI 
Sbjct: 654  AFHRDNRLRFNQAMSGYHSFSGEEPPLNRSSSSNRDFDYESGR-AISNAETPAGVLQEIA 712

Query: 2515 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 2694
            +KCGTKVEFR AL+ S ELQF +E WF GEKIG GTG+TR+EA  QAA+ SL+NLAN Y+
Sbjct: 713  MKCGTKVEFRPALVPSTELQFYVEAWFAGEKIGEGTGRTRREAHFQAAEGSLKNLANIYI 772

Query: 2695 SNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLE 2871
            S    D   + GD +K S+V   G +G+ NSFG Q LPKED        EP+R LDPRL+
Sbjct: 773  SRGKPDALPIHGDASKFSNVTNNGFMGNMNSFGTQPLPKEDSLSSSTSSEPSRPLDPRLD 832

Query: 2872 ESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATW 3051
             S+KS+ SVSALKELC MEGLS+++  +PP   N   K EV+ Q EI G++LGKG+G TW
Sbjct: 833  NSRKSVSSVSALKELCTMEGLSVLYQPRPP-PPNSTEKDEVHVQAEIDGEVLGKGIGLTW 891

Query: 3052 NEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSN 3231
            +EAKMQAAE+AL NL+S L   GQ R GSPR L    SKRLK EFP++LQR+ SSTRYS 
Sbjct: 892  DEAKMQAAEKALGNLRSTL--YGQKRQGSPRPLQGMPSKRLKQEFPQVLQRMPSSTRYSK 949

Query: 3232 NAPPVP 3249
            NAPPVP
Sbjct: 950  NAPPVP 955


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score = 1192 bits (3084), Expect = 0.0
 Identities = 645/1000 (64%), Positives = 741/1000 (74%), Gaps = 48/1000 (4%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQK-------------ICMIDKEIRISHFSQPSERCPPL 534
            M+KSVVY+G  LLGEVEIY Q Q+             I  I K IRISHFSQ SERCPPL
Sbjct: 1    MYKSVVYKGEELLGEVEIYAQEQQQEEEENKNKRKRVIDEIVKGIRISHFSQASERCPPL 60

Query: 535  AVLHTIAPSGVCFKMESKLQS--------KDSPLFSLHSSCLREDKTAVMPLGEEELHLV 690
            AVLHTI   GVCFKME    S        ++SPL  LHSSC++E+KTAVM LG EELHLV
Sbjct: 61   AVLHTITSIGVCFKMEESTASSSTKISSQQESPLRLLHSSCIQENKTAVMLLGGEELHLV 120

Query: 691  AMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRID 870
            AM SR N  ++ CFWGF V+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFED+I+
Sbjct: 121  AMPSRSNERKHPCFWGFNVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDKIE 180

Query: 871  ALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSD 1050
            ALQ+KI+TE+D QR+  +++EIKRYQDDK ILKQYVENDQV+ENGKVIK Q EVVPA SD
Sbjct: 181  ALQKKISTEVDQQRILAIISEIKRYQDDKIILKQYVENDQVIENGKVIKTQFEVVPAASD 240

Query: 1051 NYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCT 1230
            N+QP+VRPLIRL EKNII TRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCT
Sbjct: 241  NHQPLVRPLIRLPEKNIIFTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 300

Query: 1231 MAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDD 1410
            MAERDYALEMWRLLDPESNLINS ELL R+VCV +GSRKSL NVF DGICHPKMALVIDD
Sbjct: 301  MAERDYALEMWRLLDPESNLINSNELLDRIVCVSSGSRKSLFNVFQDGICHPKMALVIDD 360

Query: 1411 RLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQ 1590
            R+ VWDEKDQSRVHVVPAFAPYYAPQAEANN VP+LCVARNVACNVRGGFFKEFDEGLLQ
Sbjct: 361  RMNVWDEKDQSRVHVVPAFAPYYAPQAEANNAVPILCVARNVACNVRGGFFKEFDEGLLQ 420

Query: 1591 LIREVFYEDGLANFPSAPDVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQA 1761
             I EV YED  +N PS PDVSN+L+SEDD SA+  N+DP  F+   D E ERRLK+ + A
Sbjct: 421  KIPEVAYEDDTSNIPSPPDVSNYLVSEDDASAANGNRDPPSFDSTADAEVERRLKEAVSA 480

Query: 1762 S--------SMANSLDPRPAPSLQNAVA--SSLGTISQPT---PQGPIMSSQ-------N 1881
            S        S  +SLDPR   SLQ AVA  SSL   SQP+    Q P+ +SQ       N
Sbjct: 481  SSTIPSTIPSTVSSLDPRLLQSLQYAVASSSSLMPASQPSMLASQQPVPASQTSMMPFPN 540

Query: 1882 NQFTQV---VKPLVHVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT 2052
             QF QV   VK L  V   EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQD+RD+ 
Sbjct: 541  TQFPQVAPLVKQLGQVVHPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDSRDNA 600

Query: 2053 SSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMH 2232
             SE PF               +QSRGSW P+EE+  PR+ NR      P+E PL+S+ M+
Sbjct: 601  PSESPF---PARPSAPVSAAHVQSRGSWVPVEEEMTPRQLNR-----TPREFPLDSDPMN 652

Query: 2233 FDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETP 2412
             +K +  HPSFF   ES+ PSDRM++ENQR PKE    +DR+R+NHST NY SF  EETP
Sbjct: 653  IEKHQTHHPSFFPKVESNIPSDRMIHENQRLPKEAPYRNDRMRLNHSTPNYHSFQVEETP 712

Query: 2413 LGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVW 2592
            L +S SS+RD   ES R     +ETP  VLQEI +KC TKVEFR AL+AS +LQFSIE W
Sbjct: 713  LSRS-SSNRDLDLESERAF-TISETPVEVLQEIAMKCETKVEFRPALVASIDLQFSIEAW 770

Query: 2593 FTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-L 2769
            F GEK+G GTGKTR+EAQRQAA+ S++ LA  Y+     D   + GD ++    N+ G L
Sbjct: 771  FAGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMLRAKPDSGPMHGDSSRYPSANDNGFL 830

Query: 2770 GDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFL 2949
            G+ N FG Q LPK++        EP+R LDPRLE SKKS GSV+ALKE C MEGL + FL
Sbjct: 831  GNMNLFGNQPLPKDELVAYSAASEPSRLLDPRLEGSKKSSGSVTALKEFCTMEGLVVNFL 890

Query: 2950 AQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNR 3129
            AQ PLS N +   EV+AQVEI GQ+LGKG+G+TW+EAKMQAAE+AL +L++M G   Q R
Sbjct: 891  AQTPLSANSIPGEEVHAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRTMFGQYTQKR 950

Query: 3130 VGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 3249
             GSPR +    +KRLK EFPR+LQR+  S RY  NAPPVP
Sbjct: 951  QGSPRPMQGMPNKRLKQEFPRVLQRMPPSARYHKNAPPVP 990


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score = 1184 bits (3062), Expect = 0.0
 Identities = 632/985 (64%), Positives = 738/985 (74%), Gaps = 33/985 (3%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQK--------------------ICMIDKEIRISHFSQP 513
            M+KSVVY+G+ LLGEVEIY Q ++                    I  I K IRISHFSQ 
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQQEQKLQQQEELQEQEQELKKKRVIDEILKGIRISHFSQA 60

Query: 514  SERCPPLAVLHTIAPSGVCFKMESKLQ-SKDSPLFSLHSSCLREDKTAVMPL-GEEELHL 687
            SERCPPLAVLHTI  +G+CFKMESK   S D+PL  LHSSC++E KTAV+ L G EELHL
Sbjct: 61   SERCPPLAVLHTITTNGICFKMESKNSVSLDTPLHLLHSSCIQESKTAVVLLQGGEELHL 120

Query: 688  VAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRI 867
            VAM SR +  QY CFW F +S GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRI
Sbjct: 121  VAMFSRNDERQYPCFWAFNISSGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI 180

Query: 868  DALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFS 1047
            +ALQRKI+TE+DPQR+SGML+E+KRYQDDK+ILKQYV+NDQVVENG+VIK Q EVVPA S
Sbjct: 181  EALQRKISTELDPQRISGMLSEVKRYQDDKTILKQYVDNDQVVENGRVIKTQFEVVPALS 240

Query: 1048 DNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVC 1227
            DN+Q IVRPLIRLQE+NIILTRINP IRDTSVLVRLRPAWE+LRSYLTA+GRKRFEVYVC
Sbjct: 241  DNHQTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVC 300

Query: 1228 TMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVID 1407
            TMAERDYALEMWRLLDPESNLINSKELL R+VCVK+G RKSL NVF DGICHPKMALVID
Sbjct: 301  TMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVID 360

Query: 1408 DRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLL 1587
            DRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFDEGLL
Sbjct: 361  DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLL 420

Query: 1588 QLIREVFYEDGLANFPSAPDVSNFLISEDD--VSASNKDPLGFEGVTDVE-ERRLKDVIQ 1758
            Q I E+ +ED + + PS PDVSN+L+ EDD   S  N+DPL F+G+ D E E+RLK+ I 
Sbjct: 421  QRIPEISFEDDMNDIPSPPDVSNYLVPEDDAFTSNGNRDPLSFDGMADAEVEKRLKEAIS 480

Query: 1759 ASSMANS----LDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVH 1917
             SS   S    LD R  P LQ  +ASS  +I  PT Q  +++  + Q  Q   +VKPL  
Sbjct: 481  ISSAFPSTVANLDARLVPPLQYTMASS-SSIPVPTSQPAVVTFPSMQLPQAAPLVKPLGQ 539

Query: 1918 VGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXX 2097
            V PSEPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQD RD   SE PF         
Sbjct: 540  VVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDLRDPAPSESPF-PVRPSNSM 598

Query: 2098 XXXXXXMQSRGSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGA 2277
                  +QSRG+W P+EE+  PR+ NRA    V +E P+++E MH DK R  HPSFF   
Sbjct: 599  QVSVPRVQSRGNWVPVEEEMSPRQLNRA----VTREFPMDTEPMHIDKHRPHHPSFFPKV 654

Query: 2278 ESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFES 2457
            ESS PS+RM +ENQR PK     DDRLR+N + SNY+S  GEE  L +S SS+RD   ES
Sbjct: 655  ESSIPSERMPHENQRLPKVAPYKDDRLRLNQTMSNYQSLSGEENSLSRSSSSNRDLDVES 714

Query: 2458 GRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRK 2637
             R +    ETP  VL EI +KCG KVEF+ +L+ S +LQFS+E WF GE++G G G+TR+
Sbjct: 715  DR-AVSSAETPVRVLHEISMKCGAKVEFKHSLVNSRDLQFSVEAWFAGERVGEGFGRTRR 773

Query: 2638 EAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKED 2814
            EAQ  AA++S++NLAN Y+S    D  ++ GD +K S  N+ G LG  NSFG Q LPK++
Sbjct: 774  EAQSVAAEASIKNLANIYISRAKPDNGALHGDASKYSSANDNGFLGHVNSFGSQPLPKDE 833

Query: 2815 XXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEV 2994
                    E +  LDPRLE SKKSM SV+ALKE CMMEGL + FLAQ PLS+N +   EV
Sbjct: 834  ILSYSDSSEQSGLLDPRLESSKKSMSSVNALKEFCMMEGLGVNFLAQTPLSSNSVQNAEV 893

Query: 2995 YAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRL 3174
            +AQVEI GQ++GKG+G+T++EAKMQAAE+AL +L++  G     R GSPR +    +K L
Sbjct: 894  HAQVEIDGQVMGKGIGSTFDEAKMQAAEKALGSLRTTFGRFPPKRQGSPRPVPGMPNKHL 953

Query: 3175 KPEFPRMLQRINSSTRYSNNAPPVP 3249
            KPEFPR+LQR+ SS RY  NAPPVP
Sbjct: 954  KPEFPRVLQRMPSSARYPKNAPPVP 978


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score = 1183 bits (3060), Expect = 0.0
 Identities = 627/969 (64%), Positives = 751/969 (77%), Gaps = 16/969 (1%)
 Frame = +1

Query: 391  KMFKSVVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPS 561
            +M+KSVVYQG  ++GEV++YP+   N K   + KEIRISHFSQPSERCPPLAVLHT+   
Sbjct: 2    RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHV-KEIRISHFSQPSERCPPLAVLHTVTSC 60

Query: 562  GVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGF 741
            GVCFKMESK Q +D  LF LHS C+RE+KTAVMPLG EE+HLVAM SR N+++  CFWGF
Sbjct: 61   GVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSR-NVDR-PCFWGF 117

Query: 742  IVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSG 921
            IV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SG
Sbjct: 118  IVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISG 177

Query: 922  MLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNI 1101
            M AE+KRYQDDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ+KNI
Sbjct: 178  MQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNI 237

Query: 1102 ILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPE 1281
            ILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+
Sbjct: 238  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 297

Query: 1282 SNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVP 1461
            SNLINSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVP
Sbjct: 298  SNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 357

Query: 1462 AFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSA 1641
            AFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + PS 
Sbjct: 358  AFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSP 417

Query: 1642 PDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPAP 1800
            PDVSN+L+SEDD S SN  +DP  F+G+ D E ER+LKD + A+S       +LDPR   
Sbjct: 418  PDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPR-LT 476

Query: 1801 SLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGE 1971
            SLQ  +  S G++  PT Q  +M   + QF Q   +VKP+    PSEPSL  SPAREEGE
Sbjct: 477  SLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535

Query: 1972 VPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEE 2151
            VPESELDPDTRRRLLILQHGQDTRDH S+EPPF                 SRG WFP EE
Sbjct: 536  VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVP--SSRGVWFPAEE 593

Query: 2152 DRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFP 2328
            +   +  NR     VPKE P++S  +   K R  HPSFF   ESS  SDR+L++ +QR P
Sbjct: 594  EIGSQPLNRV----VPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLP 649

Query: 2329 KEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQE 2508
            KE++  DDR R+NH  S+YRSF G++ P  +S SS RD   ESG  S  + +TP  VLQE
Sbjct: 650  KEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSFSSHRDLDSESGH-SVLHADTPVAVLQE 708

Query: 2509 IGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANK 2688
            I +KCGTKV+F S+L+AS ELQFS+E WF+G+KIG   G+TRKEAQ +AA+ S+++LA+ 
Sbjct: 709  IALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADI 768

Query: 2689 YLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPR 2865
            YLS+   +P S  GD++   +VN+ G +G ++S G Q L KED         P+R LDPR
Sbjct: 769  YLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKED-SASFSTASPSRVLDPR 827

Query: 2866 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 3042
            L+ SK+SMGS+S+LKELCMMEGL + FL+ P P+STN + K EV+AQVEI G++ GKG+G
Sbjct: 828  LDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIG 887

Query: 3043 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 3222
             TW+EAKMQAAE+AL +L+S LG S Q R  SPR    FS+KRLK E+PR +QR+ SS R
Sbjct: 888  LTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSAR 947

Query: 3223 YSNNAPPVP 3249
            Y  NAPP+P
Sbjct: 948  YPRNAPPIP 956


>emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]
          Length = 894

 Score = 1180 bits (3053), Expect = 0.0
 Identities = 637/960 (66%), Positives = 724/960 (75%), Gaps = 8/960 (0%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 573
            M+KS+VY+G+ ++GEVEIYPQNQ + ++ KEIRISH+SQPSERCPPLAVLHTI   GVCF
Sbjct: 1    MYKSIVYEGDDVVGEVEIYPQNQGLELM-KEIRISHYSQPSERCPPLAVLHTITSCGVCF 59

Query: 574  KMES-KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVS 750
            KMES K QS+D+PL+ LHS+C+RE+KTAVM LGEEELHLVAM S+K   QY CFWGF V+
Sbjct: 60   KMESSKAQSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFNVA 119

Query: 751  PGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLA 930
             GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTE+DPQR+SGM+A
Sbjct: 120  LGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGMVA 179

Query: 931  EIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILT 1110
            E                   VVENGK+ K Q E+VPA SDN+QPIVRPLIRLQEKNIILT
Sbjct: 180  E-------------------VVENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNIILT 220

Query: 1111 RINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNL 1290
            RINPLIRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDPESNL
Sbjct: 221  RINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNL 280

Query: 1291 INSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFA 1470
            INSKELL R+VCVK+GSRKSL NVF DGICHPKMALVIDDRLKVWDEKDQ RVHVVPAFA
Sbjct: 281  INSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 340

Query: 1471 PYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDV 1650
            PYYAPQAEANN + VLCVARNVACNVRGGFFKEFDEGLLQ I E+ YED + +  SAPDV
Sbjct: 341  PYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDBIKDIRSAPDV 400

Query: 1651 SNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASSMANSLDPRPAPSLQNAVA 1821
            SN+L+SEDD S S  N+D   F+G+ DVE ER+LKD I A S   SLDPR +P LQ AVA
Sbjct: 401  SNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKDAISAPSTVTSLDPRLSPPLQFAVA 460

Query: 1822 SSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVPESELD 1992
            +S G   QP  QG IM   N QF Q   ++KPL      EP++Q SPAREEGEVPESELD
Sbjct: 461  ASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPLA----PEPTMQSSPAREEGEVPESELD 516

Query: 1993 PDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDRIPRKP 2172
            PDTRRRLLILQHGQDTR+H SS+PPF               +QSRGSWFP +E+  PR+ 
Sbjct: 517  PDTRRRLLILQHGQDTREHASSDPPF---PVRPPIQVSVPRVQSRGSWFPADEEMSPRQL 573

Query: 2173 NRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDD 2352
            NRA    VPKE PL+S+TMH +K R  HPSFFH  ESS  SDR+L+ENQR  KEV   DD
Sbjct: 574  NRA----VPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDD 629

Query: 2353 RLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTK 2532
            RLR+NHS   Y SF GEE PLG+S SS+RD  FESGRG+ PY ETPA     +G+     
Sbjct: 630  RLRLNHSLPGYHSFSGEEVPLGRS-SSNRDLDFESGRGA-PYAETPA-----VGL----- 677

Query: 2533 VEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHD 2712
                        L+   EVW  GEKIG GTGKTR+EAQ QAA++SL  L+ +YL      
Sbjct: 678  ------------LRNCNEVWNQGEKIGEGTGKTRREAQCQAAEASLMYLSYRYL------ 719

Query: 2713 PSSVRGDLNK-PSHVNEMGLGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEESKKSM 2889
                 GD+N+ P+  +   + D+NSFGYQ+ PKE         E +R LDPRLE SKKSM
Sbjct: 720  ----HGDVNRFPNASDNNFMSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSM 775

Query: 2890 GSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQ 3069
            GS+SALKELCMMEGL + FL+QPPLS+N   K E+ AQVEI GQ+LGKG G+TW++AKMQ
Sbjct: 776  GSISALKELCMMEGLGVEFLSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQ 835

Query: 3070 AAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 3249
            AAE+AL +LKSMLG   Q R GSPR L     KRLK EF R LQR  SS RYS N  PVP
Sbjct: 836  AAEKALGSLKSMLGQFSQKRQGSPRSLQGM-GKRLKSEFTRGLQRTPSSGRYSKNTSPVP 894


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Glycine max]
          Length = 960

 Score = 1173 bits (3034), Expect = 0.0
 Identities = 621/973 (63%), Positives = 742/973 (76%), Gaps = 21/973 (2%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQ--------NQKICMIDKEIRISHFSQPSERCPPLAVLHT 549
            M+KSVVYQG  ++GEV++YP+        N       KEIRISHFSQPSERCPPLAVLHT
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60

Query: 550  IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSC 729
            +   GVCFKMESK Q +D  LF LHS C+RE+KTAVMPLG EE+HLVAM SR   +   C
Sbjct: 61   VTSCGVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRN--DDRPC 117

Query: 730  FWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQ 909
            FWGFIV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQ
Sbjct: 118  FWGFIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQ 177

Query: 910  RVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQ 1089
            R+SGM AE+KRY DDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ
Sbjct: 178  RISGMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQ 237

Query: 1090 EKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRL 1269
            +KNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRL
Sbjct: 238  DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 297

Query: 1270 LDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRV 1449
            LDP+SNLINSKELL R+VCVK+G +KSL NVF DG C PKMALVIDDRLKVWDE+DQ RV
Sbjct: 298  LDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPRV 357

Query: 1450 HVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLAN 1629
            HVVPAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + +
Sbjct: 358  HVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKD 417

Query: 1630 FPSAPDVSNFLISEDD--VSASNKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDP 1788
             PS PDVSN+L+SEDD  +S  N+DP  F+G+ D E ER+LKD + A+S       +LDP
Sbjct: 418  VPSPPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDP 477

Query: 1789 RPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAR 1959
            R   SLQ  +  S G++  PT Q  +M   + QF Q   +VKP+    PS+PSL  SPAR
Sbjct: 478  R-LTSLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAR 535

Query: 1960 EEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWF 2139
            EEGEVPESELDPDTRRRLLILQHGQDTRDH S+EPPF                 SRG WF
Sbjct: 536  EEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVP--SSRGVWF 593

Query: 2140 PLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-N 2316
            P+EE+   +  NR     VPKE P++S  +  +K R  HPSFF+  ESS  SDR+L++ +
Sbjct: 594  PVEEEIGSQPLNRV----VPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSH 649

Query: 2317 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 2496
            QR PKE++  DDR R+NH  S+YRSF G++ P  +S SS RD   ESG  S  + +TP  
Sbjct: 650  QRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGH-SVLHADTPVA 708

Query: 2497 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 2676
            VL EI +KCGTKV+F S+L+AS EL+FS+E WF+G+KIG G G+TRKEAQ +AA  S+ +
Sbjct: 709  VLHEIALKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEH 768

Query: 2677 LANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRF 2853
            LA+ YLS+   +P S  GD++   +VN+ G +G ++S G Q L KED         P+R 
Sbjct: 769  LADIYLSSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKED-SASFSSASPSRA 827

Query: 2854 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILG 3030
            LDPRL+ SK+SMGS+SALKELCMMEGL + FL+ P P+STN + K EV+AQVEI G+I G
Sbjct: 828  LDPRLDVSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFG 887

Query: 3031 KGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRIN 3210
            KG+G TW+EAKMQAAE+AL NL+S LG S Q    SPR    FS+KRLK E+PR +QR+ 
Sbjct: 888  KGIGLTWDEAKMQAAEKALGNLRSKLGQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMP 947

Query: 3211 SSTRYSNNAPPVP 3249
            SS RY  NAPP+P
Sbjct: 948  SSARYPRNAPPIP 960


>ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
            gi|561032720|gb|ESW31299.1| hypothetical protein
            PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score = 1164 bits (3012), Expect = 0.0
 Identities = 623/977 (63%), Positives = 737/977 (75%), Gaps = 25/977 (2%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQKICMID-KEIRISHFSQPSERCPPLAVLHTIAPSGVC 570
            M+KSVVYQG  +LGEVE+YP+         KEIRISHFSQPSERCPPLAVLHT+   GVC
Sbjct: 1    MYKSVVYQGEVVLGEVEVYPEENNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60

Query: 571  FKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVS 750
            FKMESK Q +D  LF LHS C+RE+KTAV+PLG EE+HLVAM SR   +    FWGFIV+
Sbjct: 61   FKMESKTQQQDG-LFHLHSLCIRENKTAVIPLGGEEIHLVAMHSRN--DDRPRFWGFIVA 117

Query: 751  PGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLA 930
             GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SGM A
Sbjct: 118  LGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQA 177

Query: 931  EIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILT 1110
            E+KRYQ+DK+ILKQY ENDQVV+NG+V+KVQSE+VPA SDN+QPIVRPLIRLQ+KNIILT
Sbjct: 178  EVKRYQEDKNILKQYAENDQVVDNGRVVKVQSEIVPALSDNHQPIVRPLIRLQDKNIILT 237

Query: 1111 RINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNL 1290
            RINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+SNL
Sbjct: 238  RINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNL 297

Query: 1291 INSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFA 1470
            INSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVPAFA
Sbjct: 298  INSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 357

Query: 1471 PYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDV 1650
            PYYAPQAEA+N +PVLCVARNVACNVRGGFFKEFD+GLLQ I +V YED + + P  PDV
Sbjct: 358  PYYAPQAEASNSIPVLCVARNVACNVRGGFFKEFDDGLLQKIPQVAYEDDIKDIPIPPDV 417

Query: 1651 SNFLISEDD----VSASNKDPLGFEGVTDVE-ERRLK---------DVIQASS----MAN 1776
            SN+L+SEDD    +S  N+DP  F+ + D E ER+ K         D + A+S       
Sbjct: 418  SNYLVSEDDGSSAISNGNRDPFLFDSMGDAEVERKSKVPTRAPNEHDALSAASTIPVTTA 477

Query: 1777 SLDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQG 1947
            +LDPR   SLQ A+ SS G+   PT Q  +M   + QF Q   +VKP+    PSE SL  
Sbjct: 478  NLDPR-LTSLQYAMVSS-GSAPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHS 535

Query: 1948 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSR 2127
            SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTS+EP +               + SR
Sbjct: 536  SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSNEPTY---AIRHPVPVSAPRVSSR 592

Query: 2128 GSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRML 2307
            G WFP EED   +  NR     VPKE  ++S ++  +K R  HPSFF   ESS  SDR+L
Sbjct: 593  GGWFPAEEDIGSQPLNRV----VPKEFSVDSGSLVIEKHRPHHPSFFSKVESSISSDRIL 648

Query: 2308 NE-NQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTE 2484
            ++ +QR PKE++  DDR R NH  S+YRS   +E P  +S SS RD   ES   S  + +
Sbjct: 649  HDSHQRLPKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSSSHRDLDSESSH-SVFHAD 707

Query: 2485 TPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADS 2664
            TP  VLQEI +KCGTKVEF S+L+AS ELQFSIE WF+G+KIG G G+TRKEAQ +AA+ 
Sbjct: 708  TPVVVLQEIALKCGTKVEFMSSLVASTELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAED 767

Query: 2665 SLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXE 2841
            S+++LA+ YLS+   +P S  GD+    + N+ G +  ++S   Q LPKED        +
Sbjct: 768  SIKHLADIYLSSAKDEPGSTYGDVGGFPNANDNGYMVIASSLSNQPLPKEDSASFSTASD 827

Query: 2842 PTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGG 3018
            P+R LDPRLE SK+ MGS+SALKELCMMEGL + FL+ P P+STN + K EV+AQVEI G
Sbjct: 828  PSRVLDPRLEVSKRPMGSISALKELCMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDG 887

Query: 3019 QILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRML 3198
            ++ GKG+G TW+EAKMQAAE+AL +L+S LG S Q R  SPR    FS+KRLK E+PR +
Sbjct: 888  KVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRAM 947

Query: 3199 QRINSSTRYSNNAPPVP 3249
            QRI SSTRY  NAPP+P
Sbjct: 948  QRIPSSTRYPRNAPPIP 964


>ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X2 [Glycine max]
          Length = 929

 Score = 1154 bits (2986), Expect = 0.0
 Identities = 616/969 (63%), Positives = 737/969 (76%), Gaps = 16/969 (1%)
 Frame = +1

Query: 391  KMFKSVVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPS 561
            +M+KSVVYQG  ++GEV++YP+   N K   + KEIRISHFSQPSERCPPLAVLHT+   
Sbjct: 2    RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHV-KEIRISHFSQPSERCPPLAVLHTVTSC 60

Query: 562  GVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGF 741
            GVCFKMESK Q +D  LF LHS C+RE+KTAVMPLG EE+HLVAM SR N+++  CFWGF
Sbjct: 61   GVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSR-NVDR-PCFWGF 117

Query: 742  IVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSG 921
            IV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SG
Sbjct: 118  IVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISG 177

Query: 922  MLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNI 1101
            M AE+KRYQDDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ+KNI
Sbjct: 178  MQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNI 237

Query: 1102 ILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPE 1281
            ILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+
Sbjct: 238  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 297

Query: 1282 SNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVP 1461
            SNLINSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVP
Sbjct: 298  SNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 357

Query: 1462 AFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSA 1641
            AFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + PS 
Sbjct: 358  AFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSP 417

Query: 1642 PDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPAP 1800
            PDVSN+L+SEDD S SN  +DP  F+G+ D E ER+LKD + A+S       +LDPR   
Sbjct: 418  PDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPR-LT 476

Query: 1801 SLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGE 1971
            SLQ  +  S G++  PT Q  +M   + QF Q   +VKP+    PSEPSL  SPAREEGE
Sbjct: 477  SLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535

Query: 1972 VPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEE 2151
            VPESELDPDTRRRLLILQHGQDTRDH S+EPPF                 SRG WFP EE
Sbjct: 536  VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVP--SSRGVWFPAEE 593

Query: 2152 DRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFP 2328
            +   +  NR     VPKE P++S  +   K R  HPSFF   ESS  SDR+L++ +QR P
Sbjct: 594  EIGSQPLNRV----VPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLP 649

Query: 2329 KEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQE 2508
            KE++  DDR R+NH  S+YRSF                            ++TP  VLQE
Sbjct: 650  KEMYHRDDRPRLNHMLSSYRSF----------------------------SDTPVAVLQE 681

Query: 2509 IGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANK 2688
            I +KCGTKV+F S+L+AS ELQFS+E WF+G+KIG   G+TRKEAQ +AA+ S+++LA+ 
Sbjct: 682  IALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADI 741

Query: 2689 YLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPR 2865
            YLS+   +P S  GD++   +VN+ G +G ++S G Q L KED         P+R LDPR
Sbjct: 742  YLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKED-SASFSTASPSRVLDPR 800

Query: 2866 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 3042
            L+ SK+SMGS+S+LKELCMMEGL + FL+ P P+STN + K EV+AQVEI G++ GKG+G
Sbjct: 801  LDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIG 860

Query: 3043 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 3222
             TW+EAKMQAAE+AL +L+S LG S Q R  SPR    FS+KRLK E+PR +QR+ SS R
Sbjct: 861  LTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSAR 920

Query: 3223 YSNNAPPVP 3249
            Y  NAPP+P
Sbjct: 921  YPRNAPPIP 929


>ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cicer arietinum]
          Length = 951

 Score = 1154 bits (2984), Expect = 0.0
 Identities = 620/966 (64%), Positives = 727/966 (75%), Gaps = 14/966 (1%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 573
            M+KS+VYQG  +LGEV+IYP+        KEIRISHF+QPSERC PLAVLHTI  SGVCF
Sbjct: 1    MYKSLVYQGEVVLGEVDIYPEVNNNNKNFKEIRISHFTQPSERCLPLAVLHTITSSGVCF 60

Query: 574  KMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVSP 753
            KMESK Q +D PLF LH+ C RE+KTAVMPL  EE+HLVAM SR N     CFWG+IV  
Sbjct: 61   KMESKTQQQD-PLFHLHNLCFRENKTAVMPLCGEEMHLVAMHSRSNGRP--CFWGYIVGM 117

Query: 754  GLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLAE 933
            GLYNSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SGM AE
Sbjct: 118  GLYNSCLMMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAE 177

Query: 934  IKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILTR 1113
            +KRY +DKSILKQYVENDQVV+NGKV+K QSE+VPA SD++QPIVRPLIRL EKNIILTR
Sbjct: 178  VKRYLEDKSILKQYVENDQVVDNGKVLKAQSELVPALSDSHQPIVRPLIRLHEKNIILTR 237

Query: 1114 INPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNLI 1293
            INP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+SNLI
Sbjct: 238  INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLI 297

Query: 1294 NSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFAP 1473
            NSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVPAFAP
Sbjct: 298  NSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP 357

Query: 1474 YYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDVS 1653
            YYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YE+   +   APDVS
Sbjct: 358  YYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKISQIAYENNTRDISPAPDVS 417

Query: 1654 NFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPAPSLQN 1812
            N+L+SEDD SAS  N+DP  F+G+ D E ER+LKD I A+S        LDPR   SLQ 
Sbjct: 418  NYLVSEDDGSASYANRDPFAFDGMADAEVERKLKDAISAASAIPMTTAKLDPRLTSSLQY 477

Query: 1813 AVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVPES 1983
             + S  G++  P  Q  ++   + QF Q   +VKP+  V PSE SL  SPAREEGEVPES
Sbjct: 478  TMVSP-GSVLPPAAQASMIPLPHTQFPQPATLVKPIGQVAPSELSLHSSPAREEGEVPES 536

Query: 1984 ELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDRIP 2163
            ELDPDTRRRLLILQHGQD RDHTSSEPPF               +  RG WFP+EE+   
Sbjct: 537  ELDPDTRRRLLILQHGQDNRDHTSSEPPF----PLKHPVQVSARVPPRGGWFPVEEEIGS 592

Query: 2164 RKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFPKEVH 2340
            + PNR     +PKE  L+S     +K R     FF   + S  SDR L+E NQR PKE++
Sbjct: 593  QPPNRV----IPKEIALDSGPSRIEKHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMY 648

Query: 2341 QGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIK 2520
              DDR RV+H  S+Y S  G++TP G+S SS RDF  ESG  S    ETPA VLQEI +K
Sbjct: 649  HRDDRSRVSHMLSSYPSLSGDDTPFGRSSSSHRDFDSESGH-SVFNAETPAIVLQEIALK 707

Query: 2521 CGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSN 2700
            CGTKVEF S+L AS ELQFSIE WF+G+KIG G G+TR EAQ +AA+ S+++LA+ YLS 
Sbjct: 708  CGTKVEFTSSLAASRELQFSIEAWFSGKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSR 767

Query: 2701 VMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEES 2877
               +  S  GD++   + N+ G +G+ +S G Q LPKE+        +P+R LDPRL+ S
Sbjct: 768  AKDESGSAFGDVSGFPNANDNGYVGNVSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVS 827

Query: 2878 KKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVGATWN 3054
            K+SMGSVSALKELCM+EGL + FL+ P P+STN ++  EV+AQVEI GQ+ GKG G TW+
Sbjct: 828  KRSMGSVSALKELCMVEGLGVNFLSLPAPVSTNSVD--EVHAQVEIDGQVYGKGTGITWD 885

Query: 3055 EAKMQAAEEALWNLKSMLGDSG-QNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSN 3231
            EAKMQAAE+AL +L++ +   G Q R  SPR     S+KRLK E PR LQR  SS RY  
Sbjct: 886  EAKMQAAEKALGSLRTTIHGQGIQRRQLSPRPFQGLSNKRLKQEHPRTLQRFASSGRYPR 945

Query: 3232 NAPPVP 3249
            NAPP+P
Sbjct: 946  NAPPIP 951


>ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 958

 Score = 1125 bits (2909), Expect = 0.0
 Identities = 606/970 (62%), Positives = 719/970 (74%), Gaps = 18/970 (1%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 573
            M +S+VY G   +GEVEIYP+ +K   + KEIRISHFSQPSERCPPLAVLHTI   G+CF
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDL-KEIRISHFSQPSERCPPLAVLHTITSFGICF 59

Query: 574  KMES---KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 744
            KMES   + + +   LF LHSSC+RE+KTAVMPL  EE+HLVAM SR N     CFWGFI
Sbjct: 60   KMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNN--DRPCFWGFI 117

Query: 745  VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 924
            V+ GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQR+S M
Sbjct: 118  VASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRISTM 177

Query: 925  LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 1104
             AEIKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE+VPA SD++QPIVRPLIRLQEKNII
Sbjct: 178  QAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNII 237

Query: 1105 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 1284
            LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE 
Sbjct: 238  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEL 297

Query: 1285 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 1464
            NLINSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ +VHVVPA
Sbjct: 298  NLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVPA 357

Query: 1465 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 1644
            FAPYYAPQAEA+N VP LC+AR+VACNVRGGFFK+FD+GLLQ I  + YED + + PS P
Sbjct: 358  FAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSPP 417

Query: 1645 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPA-- 1797
            DVSN+L+SEDD SAS  NK+ L F+G+ D E ERRLKD I ASS    M  +LDPR A  
Sbjct: 418  DVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFN 477

Query: 1798 PSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEG 1968
             SLQ  + SS GT+  PT Q  I+   N QF Q   +VKP+  V P  PSL  SPAREEG
Sbjct: 478  SSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEG 537

Query: 1969 EVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLE 2148
            EVPESELD DTRRRLLILQHGQDTR+HTSSEPP                + SR  WF +E
Sbjct: 538  EVPESELDLDTRRRLLILQHGQDTREHTSSEPPL---PVRHPTQVSAPSVPSRRGWFSVE 594

Query: 2149 EDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRF 2325
            E+  P++ N+     VPKE P+ SE +H +KR  RHPS F   + S  SDR+ +E +QR 
Sbjct: 595  EEMGPQQLNQL----VPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRL 650

Query: 2326 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2505
            PKEVH  DD  R++ S S+Y SFPG++ PL  S  S+RDF  ESGR S  + +  AGVLQ
Sbjct: 651  PKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGR-SLFHADITAGVLQ 709

Query: 2506 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2685
            EI +KCGTKVEF S+L+AS  LQFSIE WF G+K+G G G+TR+EAQ +AA+ S++ LA+
Sbjct: 710  EIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLAD 769

Query: 2686 KYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDP 2862
             Y+S+   D  S  GD++     N  G +   NS G Q LPKE         + +R  DP
Sbjct: 770  IYMSHAKDDSGSTYGDVSGFHGSNNNGFVSSGNSLGNQLLPKES-VSFSTSSDSSRVSDP 828

Query: 2863 RLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGV 3039
            RLE SK+S  S+SALKE CMMEGL+  F + P P ST+   K EV+AQVEI GQI GKG 
Sbjct: 829  RLEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGF 888

Query: 3040 GATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSST 3219
            G TW EAKMQAA++AL +L++M     + R GSPR +   ++KRLK E+PR LQRI  S 
Sbjct: 889  GLTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSA 948

Query: 3220 RYSNNAPPVP 3249
            RY  NAP VP
Sbjct: 949  RYPRNAPLVP 958


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score = 1122 bits (2903), Expect = 0.0
 Identities = 617/971 (63%), Positives = 714/971 (73%), Gaps = 19/971 (1%)
 Frame = +1

Query: 394  MFKS--VVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGV 567
            MFKS  V+Y+G  L+GEVEIY + + +   +K IRISH+S  SERCPPLAVLHT+  +G+
Sbjct: 1    MFKSTVVLYEGERLVGEVEIYCE-KGVLWGEKVIRISHYSPSSERCPPLAVLHTVT-TGL 58

Query: 568  CFKME---SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWG 738
             FK+E   SK  ++DSPL  LHS+CLR++KTAVM LG EELHLVAM S+    Q  CFWG
Sbjct: 59   SFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCFWG 118

Query: 739  FIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVS 918
            F V+ GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN+E DPQR S
Sbjct: 119  FKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQRAS 178

Query: 919  GMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKN 1098
             MLAE+KRYQ+DK ILKQY ENDQVV+NGKVIK QSEV PA SDN+QPIVRPLIRLQ++N
Sbjct: 179  VMLAEVKRYQEDKIILKQYAENDQVVDNGKVIKSQSEVFPALSDNHQPIVRPLIRLQDRN 238

Query: 1099 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDP 1278
            IILTRINP+IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 239  IILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 298

Query: 1279 ESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVV 1458
            +SNLINS+ELL R+VCVK+G RKSL NVF DG CHPKMALVIDDRLKVWD+KDQ RVHVV
Sbjct: 299  DSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVV 358

Query: 1459 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPS 1638
            PAFAPY+APQAE NN VPVLCVARNVACNVRGGFFK+FDEGLLQ I EV YED +   PS
Sbjct: 359  PAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPS 418

Query: 1639 APDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQAS----SMANSLDPRPA 1797
            APDVSN+LISEDD SA   NKD LGF+G+ D E ERRLK+ + AS    S   +LDPR  
Sbjct: 419  APDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPRLV 478

Query: 1798 PSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQVVKPL----VHVGPSEPSLQGSPAREE 1965
            P+LQ  V      ISQP+ Q P++        QV   L      + P + SLQ SPAREE
Sbjct: 479  PALQYPVPP---VISQPSIQSPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREE 535

Query: 1966 GEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPL 2145
            GEVPESELDPDTRRRLLILQHGQDTRD  SSEP F               +Q  G WFP 
Sbjct: 536  GEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKF--PMGTPLQVSVPPRVQPHG-WFPA 592

Query: 2146 EEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRF 2325
            EE+  PR+ NR +P   PKE PL  E+MH +K R  HP F    E+S PSDR+L ENQR 
Sbjct: 593  EEEMSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFENQRL 649

Query: 2326 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2505
            PKEV   DDR+R + S  ++R  PGEE PLG+S SS+R    E G    PY ETPAG LQ
Sbjct: 650  PKEVIPRDDRMRFSQSQPSFRP-PGEEVPLGRSSSSNRVLDLEPGH-YDPYLETPAGALQ 707

Query: 2506 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2685
            +I  KCG KVEFRS+ ++S ELQFS+EV F GEK+G GTG+TR+EAQR+AA+ SL  LA+
Sbjct: 708  DIAFKCGAKVEFRSSFLSSPELQFSLEVLFAGEKVGEGTGRTRREAQRRAAEESLMYLAD 767

Query: 2686 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNS-FGYQTLPKEDXXXXXXXXEPTRFLDP 2862
            KYLS +  D SS +GD  +  + ++ G  D+ S FGYQ     D        EP R LDP
Sbjct: 768  KYLSCIKPDSSSTQGDGFRFPNASDNGFVDNMSPFGYQ-----DRVSHSFASEPPRVLDP 822

Query: 2863 RLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVG 3042
            RLE  KKS+GSV AL+ELC +EGL L F  QP LS N   K E+YAQVEI GQ+ GKG+G
Sbjct: 823  RLEVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSANPGQKSEIYAQVEIDGQVFGKGIG 882

Query: 3043 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPR-LLHSFSSKRLKPEFPR-MLQRINSS 3216
            +TW++AK QAAE AL  LKS L    Q R GSPR L   FS+KRLKPE+ R + QR+  S
Sbjct: 883  STWDDAKTQAAERALVALKSELAQFSQKRQGSPRSLQQGFSNKRLKPEYSRGVQQRVPLS 942

Query: 3217 TRYSNNAPPVP 3249
             R+  N   +P
Sbjct: 943  GRFPKNTSAMP 953


>ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X3 [Glycine max]
          Length = 932

 Score = 1108 bits (2866), Expect = 0.0
 Identities = 599/969 (61%), Positives = 712/969 (73%), Gaps = 17/969 (1%)
 Frame = +1

Query: 394  MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 573
            M +S+VY G   +GEVEIYP+ +K   + KEIRISHFSQPSERCPPLAVLHTI   G+CF
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDL-KEIRISHFSQPSERCPPLAVLHTITSFGICF 59

Query: 574  KMES---KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 744
            KMES   + + +   LF LHSSC+RE+KTAVMPL  EE+HLVAM SR N     CFWGFI
Sbjct: 60   KMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNN--DRPCFWGFI 117

Query: 745  VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 924
            V+ GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQR+S M
Sbjct: 118  VASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRISTM 177

Query: 925  LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 1104
             AEIKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE+VPA SD++QPIVRPLIRLQEKNII
Sbjct: 178  QAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNII 237

Query: 1105 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 1284
            LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE 
Sbjct: 238  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEL 297

Query: 1285 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 1464
            NLINSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ +VHVVPA
Sbjct: 298  NLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVPA 357

Query: 1465 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 1644
            FAPYYAPQAEA+N VP LC+AR+VACNVRGGFFK+FD+GLLQ I  + YED + + PS P
Sbjct: 358  FAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSPP 417

Query: 1645 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPA-- 1797
            DVSN+L+SEDD SAS  NK+ L F+G+ D E ERRLKD I ASS    M  +LDPR A  
Sbjct: 418  DVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFN 477

Query: 1798 PSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEG 1968
             SLQ  + SS GT+  PT Q  I+   N QF Q   +VKP+  V P  PSL  SPAREEG
Sbjct: 478  SSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEG 537

Query: 1969 EVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLE 2148
            EVPESELD DTRRRLLILQHGQDTR+HTSSEPP                + SR  WF +E
Sbjct: 538  EVPESELDLDTRRRLLILQHGQDTREHTSSEPPL---PVRHPTQVSAPSVPSRRGWFSVE 594

Query: 2149 EDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRF 2325
            E+  P++ N+     VPKE P+ SE +H +KR  RHPS F   + S  SDR+ +E +QR 
Sbjct: 595  EEMGPQQLNQL----VPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRL 650

Query: 2326 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2505
            PKEVH  DD  R++ S S+Y SFPG++ PL  S  S+RDF  ESGR S  + +  AGVLQ
Sbjct: 651  PKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGR-SLFHADITAGVLQ 709

Query: 2506 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2685
            EI +KCGTKVEF S+L+AS  LQFSIE WF G+K+G G G+TR+EAQ +AA+ S++ LA+
Sbjct: 710  EIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLAD 769

Query: 2686 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPR 2865
             Y+S+   D  S  GD++        G   SN+ G+ +                   DPR
Sbjct: 770  IYMSHAKDDSGSTYGDVS--------GFHGSNNNGFVS------------------SDPR 803

Query: 2866 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 3042
            LE SK+S  S+SALKE CMMEGL+  F + P P ST+   K EV+AQVEI GQI GKG G
Sbjct: 804  LEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFG 863

Query: 3043 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 3222
             TW EAKMQAA++AL +L++M     + R GSPR +   ++KRLK E+PR LQRI  S R
Sbjct: 864  LTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSAR 923

Query: 3223 YSNNAPPVP 3249
            Y  NAP VP
Sbjct: 924  YPRNAPLVP 932


>ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum lycopersicum]
          Length = 954

 Score = 1104 bits (2855), Expect = 0.0
 Identities = 607/972 (62%), Positives = 707/972 (72%), Gaps = 20/972 (2%)
 Frame = +1

Query: 394  MFKSVV--YQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGV 567
            MFKS V  Y+G  L+GEVE+Y + + +   +K IRISH+S  SERCPPLAVLHT+  +G+
Sbjct: 1    MFKSTVLLYEGERLVGEVEMYGE-KGVVWGEKLIRISHYSPSSERCPPLAVLHTVT-TGL 58

Query: 568  CFKME---SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWG 738
             FK+E   SK  ++DSPL  LHS+CLR++KTAVM LG EELHLVAM S+    Q  CFWG
Sbjct: 59   SFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCFWG 118

Query: 739  FIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVS 918
            F V+ GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN+E DPQR S
Sbjct: 119  FKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQRAS 178

Query: 919  GMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKN 1098
             MLAE+KRYQ+DK ILKQY ENDQVV+NGKVI+ QSEV PA SDN+QPIVRPLIRLQ++N
Sbjct: 179  VMLAEVKRYQEDKIILKQYAENDQVVDNGKVIRSQSEVFPALSDNHQPIVRPLIRLQDRN 238

Query: 1099 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDP 1278
            IILTRINP+IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 239  IILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 298

Query: 1279 ESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVV 1458
            +SNLINS+ELL R+VCVK+G RKSL NVF DG CHPKMALVIDDRLKVWD+KDQ RVHVV
Sbjct: 299  DSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVV 358

Query: 1459 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPS 1638
            PAFAPY+APQAE NN VPVLCVARNVACNVRGGFFK+FDEGLLQ I EV YED +   PS
Sbjct: 359  PAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPS 418

Query: 1639 APDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQAS----SMANSLDPRPA 1797
            APDVSN+LISEDD SA   NKD LGF+G+ D E ERRLK+ + AS    S   +LDPR  
Sbjct: 419  APDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPRLV 478

Query: 1798 PSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQVVKPL----VHVGPSEPSLQGSPAREE 1965
            P+LQ  V      ISQP+ QGP++        QV   L      + P + SLQ SPAREE
Sbjct: 479  PALQYPVPP---VISQPSIQGPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREE 535

Query: 1966 GEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPL 2145
            GEVPESELDPDTRRRLLILQHGQDTRD  SSEP F               +Q  G WFP 
Sbjct: 536  GEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKF--PIGTPLQVSVPPRVQPHG-WFPA 592

Query: 2146 EEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRF 2325
            EE+  PR+ NR +P   PKE PL  E+MH +K R  HP F    E+S PSDR+  ENQR 
Sbjct: 593  EEEVSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVFFENQRL 649

Query: 2326 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2505
            PKEV   DDR+R + S  ++R  PGE+  LG+S SS+R    + G    PY +TPAG LQ
Sbjct: 650  PKEVIPRDDRMRFSQSQPSFRP-PGEDVSLGRSSSSNRVLDLDPGH-YDPYLDTPAGALQ 707

Query: 2506 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2685
            +I  KCG KVEFRS+ ++S ELQF +EV F GEK+G G G+TR+EAQR AA+ SL  LA+
Sbjct: 708  DIAFKCGVKVEFRSSFLSSPELQFCLEVLFAGEKVGEGIGRTRREAQRHAAEESLMYLAD 767

Query: 2686 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNS-FGYQTLPKEDXXXXXXXXEPTRFLDP 2862
            KYLS +  D SS +GD  +  + ++ G  ++ S FGYQ     D        EP R LDP
Sbjct: 768  KYLSCIKADSSSTQGDGFRFPNASDNGFVENMSPFGYQ-----DRVSHSFASEPPRVLDP 822

Query: 2863 RLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVG 3042
            RLE  KKS+GSV AL+ELC +EGL L F  QP LS N   K E+YAQVEI GQ+ GKG+G
Sbjct: 823  RLEVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSVNPGQKSEIYAQVEIDGQVFGKGIG 882

Query: 3043 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLL--HSFSSKRLKPEFPR-MLQRINS 3213
             TW++AK QAAE AL  LKS L      R GSPR L    FS+KRLKPE+ R + QR+  
Sbjct: 883  PTWDDAKTQAAERALVALKSELAQFSHKRQGSPRSLQQQGFSNKRLKPEYSRGVQQRVPL 942

Query: 3214 STRYSNNAPPVP 3249
            S R+  N   +P
Sbjct: 943  SGRFPKNTSAMP 954


>ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
            gi|571500215|ref|XP_006594604.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1-like
            isoform X2 [Glycine max]
          Length = 960

 Score = 1104 bits (2855), Expect = 0.0
 Identities = 598/967 (61%), Positives = 707/967 (73%), Gaps = 18/967 (1%)
 Frame = +1

Query: 403  SVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCFKME 582
            S+VY G   +GEV+IYP+  K   + KEIRISHFSQPSERCPPLAVLHTI   G+CFKME
Sbjct: 4    SMVYHGEMAVGEVKIYPEENKNMDL-KEIRISHFSQPSERCPPLAVLHTITSFGICFKME 62

Query: 583  SKLQSK---DSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVSP 753
            S    K      LF LHSSC+RE+KTAVMP+  EE+HLVAM SR N     CFWGFIV+ 
Sbjct: 63   SSTSQKRQQQDALFHLHSSCIRENKTAVMPVRGEEIHLVAMYSRNN--DRPCFWGFIVAS 120

Query: 754  GLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLAE 933
            GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQ++S M AE
Sbjct: 121  GLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQQISAMQAE 180

Query: 934  IKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILTR 1113
            IKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE VPA SD++QPIVRPLIRLQEKNIILTR
Sbjct: 181  IKRYLDDKNILKEYAENDQVVDNGKVIKIQSESVPALSDSHQPIVRPLIRLQEKNIILTR 240

Query: 1114 INPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNLI 1293
            INP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE NLI
Sbjct: 241  INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPELNLI 300

Query: 1294 NSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFAP 1473
            NSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ RVHVVPAFAP
Sbjct: 301  NSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPRVHVVPAFAP 360

Query: 1474 YYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDVS 1653
            YY PQAEA+N VP LC+ARNVACNVRGGFFK+FD+GLLQ I  + YED + + PS PDVS
Sbjct: 361  YYTPQAEASNAVPFLCLARNVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPS-PDVS 419

Query: 1654 NFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPA--PSL 1806
            N+L+SEDD SAS  NK+ L F+G+ D E ERRLKD I ASS    +  ++DPR A   SL
Sbjct: 420  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 479

Query: 1807 QNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVP 1977
            Q  + SS GT+  PT Q  ++   N QF Q   +VKP+  V     SL  SPAREEGE+P
Sbjct: 480  QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 539

Query: 1978 ESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDR 2157
            ESELD DTRRR LILQHGQDTR+  +SEPPF               + SR  WF +EE+ 
Sbjct: 540  ESELDLDTRRRFLILQHGQDTRERMASEPPF-PVRHPAQVSAPASSVPSRRGWFSVEEEM 598

Query: 2158 IPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFPKE 2334
             P++ N      VPKE P++SE  H +KR  RHPSFF     S  SDR+ +E +QR PKE
Sbjct: 599  GPQQLN----LPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKE 654

Query: 2335 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 2514
            VH  DDR R++ S S+Y S PG++ PL  S  S+RDF  ESGR S  + +T AGVLQEI 
Sbjct: 655  VHHRDDRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGR-SLFHADTTAGVLQEIA 713

Query: 2515 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 2694
            + CGTKVEF S+L+AS ELQFSIE WF G+KIG G G+TR+EAQ +AA  S++ LA+ Y+
Sbjct: 714  LNCGTKVEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYM 773

Query: 2695 SNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLE 2871
            S+   D  S  GD++     N  G +   NS G Q LPKE+        E +R  D RLE
Sbjct: 774  SHAKDDSGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLE 833

Query: 2872 ESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVGAT 3048
             SK+S  S+SALKELCMMEGL+  F + P   ST+L  K EV+AQVEI GQI GKG G T
Sbjct: 834  VSKRSTDSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVT 893

Query: 3049 WNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYS 3228
            W EAKMQAA++AL +L++M       R GSPR +   ++KRLKPE+P  LQR+  S RY 
Sbjct: 894  WEEAKMQAAKKALGSLRTMFNQGSLKRHGSPRSMQGLANKRLKPEYPPTLQRVPYSARYP 953

Query: 3229 NNAPPVP 3249
             NAP VP
Sbjct: 954  RNAPLVP 960


Top