BLASTX nr result

ID: Papaver25_contig00029550 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00029550
         (3804 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...  1067   0.0  
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...  1065   0.0  
ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform...  1064   0.0  
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...  1063   0.0  
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...  1051   0.0  
ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun...  1050   0.0  
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...  1041   0.0  
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...  1034   0.0  
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...  1030   0.0  
ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform...  1028   0.0  
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...  1028   0.0  
ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas...  1023   0.0  
emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]  1015   0.0  
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...  1013   0.0  
ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma...  1005   0.0  
ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma...  1004   0.0  
ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma...   991   0.0  
ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma...   990   0.0  
ref|XP_006597420.1| PREDICTED: RNA polymerase II C-terminal doma...   979   0.0  
ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal doma...   979   0.0  

>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score = 1067 bits (2760), Expect = 0.0
 Identities = 592/988 (59%), Positives = 697/988 (70%), Gaps = 15/988 (1%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQ------QMDIIMLNKEIRISTYSQPSERCPPLAVLHT 631
            M+K++ Y G  +LGEVEIYPQ Q      +     +  EIRIS +S+ SERCPPLAVLHT
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60

Query: 632  IAPNGVSFKMEASKSQSEDSL-LYTLHYSLLGENKTAVMGVG-EEELHLVAMPSRRNPSH 805
            I  +G+ FKME   S+S D++ L+ LH S + ENKTAVM +G  EELHLVAM SR N   
Sbjct: 61   ITASGICFKME---SKSSDNIQLHLLHSSCIRENKTAVMPLGLTEELHLVAMYSRNNEKQ 117

Query: 806  GACFWGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEM 985
              CFW F    GLYNS L MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+
Sbjct: 118  YPCFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEV 177

Query: 986  DPQRVSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLI 1165
            DPQR++GM  E+KRYQDDK ILKQY E DQ+ ENG++IKVQSEV PALSD+HQ +VRPLI
Sbjct: 178  DPQRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLI 237

Query: 1166 RLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 1345
            RLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM
Sbjct: 238  RLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 297

Query: 1346 WRLLDPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQ 1525
            WRLLDPESNLI++ ELL+RIVCVK+G RKSL NVF  G C PKMALVIDDRLKVWD+ DQ
Sbjct: 298  WRLLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDDKDQ 357

Query: 1526 PRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDD 1705
            PRVHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFK+FDE LLQRI  I YEDD
Sbjct: 358  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDD 417

Query: 1706 VEEGPSPPDVSNFLISEDDGTAAN--RGPLRFEGMADVEVERKLKDIMPA----FSIPNN 1867
            V++ PSPPDVSN+L+SEDD   AN  + PL F+GMAD EVER+LK+ + A     S   N
Sbjct: 418  VKDIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVAN 477

Query: 1868 IDSRFSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSL 2047
            +D R +  Q  + SSS+ T   P+ Q +VMP  +              P+  VGP E SL
Sbjct: 478  LDPRLAPFQYTMPSSSS-TTTLPTSQAAVMPLANMQFPPATSL---VKPLGHVGPPEQSL 533

Query: 2048 QNSPAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQS 2227
            Q+SPAREEGEVPESELDPDTRRRLLILQHG D RE+ P E+                + S
Sbjct: 534  QSSPAREEGEVPESELDPDTRRRLLILQHGMDTRENAPSEA--PFPARTQMQVSVPRVPS 591

Query: 2228 RGNWFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSFQGTGAVQSDRTF 2407
            RG+WFP EEE SP + NRA+P    K+F + SEAM  EK RP   S F       +    
Sbjct: 592  RGSWFPVEEEMSPRQLNRAVP----KEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDRP 647

Query: 2408 HENLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVA 2587
            HEN R+PKE    DDR +L+ T   YQSFSGEE PL RS+S                   
Sbjct: 648  HENQRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSS------------------- 688

Query: 2588 PSSKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEK 2767
             SS+D+  ESGR  S   +TP+ VLQDIA+KCG+KV FR ALV S ELQFSIE WF+GEK
Sbjct: 689  -SSRDVDFESGRDVSS-TETPSGVLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEK 746

Query: 2768 IGEGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNS 2947
            IGEGIG+TR+EAQRQAAE S++ LAN Y++  K D    +GD ++ SN + N ++ + NS
Sbjct: 747  IGEGIGRTRREAQRQAAEGSIKHLANVYMLRVKSDSGSGHGDGSRFSNANENCFMGEINS 806

Query: 2948 F-HQPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTS 3124
            F  QP   ++S+    +SEPS+ ++ R+E S+  +  +SALKELCM EGL + F+ +P S
Sbjct: 807  FGGQPLAKDESL----SSEPSKLVDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPS 862

Query: 3125 STSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPR 3304
            S +   K E YAQVEI GQV G G GST D          L +L+   G    K  GSPR
Sbjct: 863  SANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPR 922

Query: 3305 SLQAAVPNKRFKPEFPRVLQRIPPSARY 3388
            SLQ  +PNKR KPEFPRVLQR+PPS RY
Sbjct: 923  SLQ-GMPNKRLKPEFPRVLQRMPPSGRY 949


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score = 1065 bits (2755), Expect = 0.0
 Identities = 596/988 (60%), Positives = 700/988 (70%), Gaps = 15/988 (1%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQ------QMDIIMLNKEIRISTYSQPSERCPPLAVLHT 631
            M+K++ Y G  +LGEVEIYPQ Q      +     +  EIRIS +S+ SERCPPLAVLHT
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60

Query: 632  IAPNGVSFKMEASKSQSEDSLLYTLHYSLLGENKTAVMGVG-EEELHLVAMPSRRNPSHG 808
            I  +G+ FKME SKS S++  L+ LH S + ENKTAVM +G  EELHLVAM SR N    
Sbjct: 61   ITASGICFKME-SKS-SDNVQLHLLHSSCIRENKTAVMLLGLTEELHLVAMYSRNNEKQY 118

Query: 809  ACFWGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMD 988
             CFW F    GLYNS L MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+D
Sbjct: 119  PCFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVD 178

Query: 989  PQRVSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIR 1168
            PQR++GM  E+KRYQDDK ILKQY E DQ+ ENG++IKVQSEV PALSD+HQ +VRPLIR
Sbjct: 179  PQRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIR 238

Query: 1169 LQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMW 1348
            LQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMW
Sbjct: 239  LQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMW 298

Query: 1349 RLLDPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQP 1528
            RLLDPESNLI++ ELL+RIVCVK+G RKSL NVF  G C PKMALVIDDRLKVWDE DQ 
Sbjct: 299  RLLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDEKDQS 358

Query: 1529 RVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDV 1708
            RVHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFK+FDE LLQRI  I YEDDV
Sbjct: 359  RVHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDV 418

Query: 1709 EEGPSPPDVSNFLISEDDGTAAN--RGPLRFEGMADVEVERKLKDIMPA----FSIPNNI 1870
            +E PSPPDVSN+L+SEDD   AN  + PL F+GMAD EVER+LK+ + A     S   N+
Sbjct: 419  KEIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANL 478

Query: 1871 DSRFSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQ 2050
            D R +  Q  + SSS+ T   P+ Q +VMP  +              P+  VGP E  LQ
Sbjct: 479  DPRLAPFQYTMPSSSS-TTTLPTSQAAVMPLANMQFPPATSL---VKPLGHVGPPEQCLQ 534

Query: 2051 NSPAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSR 2230
            +SPAREEGEVPESELDPDTRRRLLILQHG D RE+ P E+                + SR
Sbjct: 535  SSPAREEGEVPESELDPDTRRRLLILQHGMDTRENAPSEA--PFPARTQMQVSVPRVPSR 592

Query: 2231 GNWFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSF-QGTGAVQSDRTF 2407
            G+WFP EEE SP + NRA+P    K+F + SEAM  EK RP   S F +   ++ SDR  
Sbjct: 593  GSWFPVEEEMSPRQLNRAVP----KEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDRP- 647

Query: 2408 HENLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVA 2587
            HEN R+PKE    DDR +L+ T   YQSFSGEE PL RS+S                   
Sbjct: 648  HENQRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSS------------------- 688

Query: 2588 PSSKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEK 2767
             SS+D+  ESGR  S   +TP+ VLQDIA+KCG+KV FR ALV S ELQFSIE WF+GEK
Sbjct: 689  -SSRDVDFESGRDVSS-TETPSGVLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEK 746

Query: 2768 IGEGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNS 2947
            IGEGIG+TR+EAQRQAAE S++ LAN Y++  K D    +GD ++ SN + N ++ + NS
Sbjct: 747  IGEGIGRTRREAQRQAAEGSIKHLANVYVLRVKSDSGSGHGDGSRFSNANENCFMGEINS 806

Query: 2948 F-HQPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTS 3124
            F  QP   ++S+    +SEPS+ ++ R+E S+  +  +SALKELCM EGL + F+ +P S
Sbjct: 807  FGGQPLAKDESL----SSEPSKLVDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPS 862

Query: 3125 STSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPR 3304
            S +   K E YAQVEI GQV G G GST D          L +L+   G    K  GSPR
Sbjct: 863  SANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPR 922

Query: 3305 SLQAAVPNKRFKPEFPRVLQRIPPSARY 3388
            SLQ  +PNKR KPEFPRVLQR+PPS RY
Sbjct: 923  SLQ-GMPNKRLKPEFPRVLQRMPPSGRY 949


>ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
            gi|508781046|gb|EOY28302.1| C-terminal domain
            phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score = 1064 bits (2751), Expect = 0.0
 Identities = 594/1005 (59%), Positives = 707/1005 (70%), Gaps = 32/1005 (3%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQ------------QMDIIMLN---KEIRISTYSQPSER 604
            M+KS+VY G  +LGEVEIYPQ Q            +  I+++    KEIRI   +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 605  CPPLAVLHTIAPNGVSFKMEASK----SQSEDSL-LYTLHYSLLGENKTAVMGVGEEELH 769
            CPPLAVLHTI  +G+ FKME+SK    S S+DS  L+ LH   + +NKTAVM +G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 770  LVAMPSRRNPSHGACFWGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDR 949
            LVAM SR   S   CFWGF   +GLY+S L+MLNLRCLGIVFDLDETLIVANTMRSFEDR
Sbjct: 124  LVAMYSRN--SDRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 950  IDALQRKITTEMDPQRVSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPAL 1129
            I+ALQRK+TTE+DPQRV+GM+ E+KRYQDDK ILKQY E DQ+VENG++IK+QSEV PAL
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 1130 SDNHQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 1309
            SDNHQPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 1310 CTMAERDYALEMWRLLDPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVI 1489
            CTMAERDYALEMWRLLDPESNLI+S ELL+RIVCVK+G RKSL NVF  GIC PKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 1490 DDRLKVWDEIDQPRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESL 1669
            DDRLKVWDE DQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF++FDE L
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 1670 LQRIIGIFYEDDVEEGPSPPDVSNFLISEDDGTA--ANRGPLRFEGMADVEVERKLKDIM 1843
            LQRI  I YEDD+++ PSPPDV N+L+SEDD +A   N+ PL F+GMAD EVER+LK+ +
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1844 PAFSIPN----NIDSRFSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFN 2011
             A S  +    N+D R +   Q+   SS+ ++P  + Q S++ F +              
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPV---VK 538

Query: 2012 PVNRVGPSESSLQNSPAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXX 2191
            PV  V   E SLQ+SPAREEGEVPESELDPDTRRRLLILQHGQD R+H P E        
Sbjct: 539  PVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRP 598

Query: 2192 XXXXXXXXXIQSRGNWFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSR-----PQ 2356
                      QSRG+WF +EEE SP + NRA P    K+F ++SE MH EK R     P+
Sbjct: 599  TMQVSVPRG-QSRGSWFAAEEEMSPRQLNRAAP----KEFPLDSERMHIEKHRHPPFFPK 653

Query: 2357 RSSSFQGTGAVQSDRTFHENLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNN 2536
              SS      + SDR   EN RL KE    DDR  L+ T   Y SFSGE           
Sbjct: 654  VESS------IPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGE----------- 696

Query: 2537 KLSPSPKDEEMPLAQVAPSSKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALV 2716
                     EMPL+Q + S +DL  ESGR T    +T A VLQDIA+KCG+KV FR ALV
Sbjct: 697  ---------EMPLSQSSSSHRDLDFESGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALV 746

Query: 2717 GSAELQFSIEVWFSGEKIGEGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDP 2896
             S +LQFSIE WF+GEK+GEG+G+TR+EAQRQAAE S++ LAN YL   KPD     GD 
Sbjct: 747  ASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDL 806

Query: 2897 NKPSNVDMNGYINDSNSF-HQPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKE 3073
            ++  N++ NG+ ++ NSF +Q    E+S+  S+ SE SR  + R+E S+ ++  ++ALKE
Sbjct: 807  SRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGSVTALKE 866

Query: 3074 LCMDEGLKLDFRTRPTSSTSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMN 3253
            LCM EGL + F+ +P SS++   K E YAQVEI GQV G G G T +          L +
Sbjct: 867  LCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGS 926

Query: 3254 LKVKLGLDTHKRIGSPRSLQAAVPNKRFKPEFPRVLQRIPPSARY 3388
            L+  LG  + KR GSPRSLQ  + NKR KPEFPRVLQR+P S RY
Sbjct: 927  LRSMLGQYSQKRQGSPRSLQ-GMQNKRLKPEFPRVLQRMPSSGRY 970


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score = 1063 bits (2750), Expect = 0.0
 Identities = 595/1021 (58%), Positives = 703/1021 (68%), Gaps = 48/1021 (4%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQQMD-----------IIMLNKEIRISTYSQPSERCPPL 616
            M+KS+VY+G+ LLGEVEIY Q QQ +           I  + KEIRIS +SQ SERCPPL
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60

Query: 617  AVLHTIAPNGVSFKMEASKSQS------EDSLLYTLHYSLLGENKTAVMGVGEEELHLVA 778
            AVLHTI   GV FKME S S S      ++S L+ LH S + ENKTAVM +G EELHLVA
Sbjct: 61   AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120

Query: 779  MPSRRNPSHGACFWGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 958
            MPSR N     CFWGF    GLY+S LVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA
Sbjct: 121  MPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180

Query: 959  LQRKITTEMDPQRVSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDN 1138
            LQRKI+TE+DPQR+ GML E+KRY DDK ILKQYVE DQ+VENG++IK QSEV PALSDN
Sbjct: 181  LQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240

Query: 1139 HQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 1318
            HQP+VRPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM
Sbjct: 241  HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300

Query: 1319 AERDYALEMWRLLDPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDR 1498
            AERDYALEMWRLLDPESNLI+S ELL+RIVCVK+GLRKSL NVF  GIC PKMALVIDDR
Sbjct: 301  AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360

Query: 1499 LKVWDEIDQPRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQR 1678
            LKVWDE DQ RVHVVPAFAPYYAPQAE NN VPVLCVARNVACNVRGGFFK+FDE LLQ+
Sbjct: 361  LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420

Query: 1679 IIGIFYEDDVEEGPSPPDVSNFLISEDDGTA--ANRGPLRFEGMADVEVERKLKDIMPAF 1852
            I  + YEDD +  PSPPDVSN+L+SEDD +A   NR  L F+GMAD EVER+LK+ + A 
Sbjct: 421  IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSAS 480

Query: 1853 S-----IPNNIDS----RFSTLQQFVGSSSN--PT----------------VPQPSPQGS 1951
            S     IP+ + S       +LQ  + SSS+  PT                 P+P  Q S
Sbjct: 481  SAILSTIPSTVSSLDPRLLQSLQYTIASSSSSMPTSQPSMLASQQPMPALQPPKPPSQLS 540

Query: 1952 VMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNSPAREEGEVPESELDPDTRRRLLILQ 2131
            + PF +           S   + +V P E SLQ+SPAREEGEVPESELDPDTRRRLLILQ
Sbjct: 541  MTPFPN---TQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRRLLILQ 597

Query: 2132 HGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGNWFPSEEETSPPKQNRALPKQVQKDF 2311
            HG D R++ P ES                +QS G+W P EEE SP + NR       ++F
Sbjct: 598  HGHDSRDNAPSES--PFPARPSTQVSAPRVQSVGSWVPVEEEMSPRQLNR-----TPREF 650

Query: 2312 SVESEAMHFEKSRPQRSSSFQGTGA-VQSDRTFHENLRLPKEVHIGDDRAKLSRTHPKYQ 2488
             ++S+ M+ EK R    S F    + + SDR  HEN R PKE    DDR KL+ +   Y 
Sbjct: 651  PLDSDPMNIEKHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYP 710

Query: 2489 SFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPSSKDLQTESGRGTSQYPDTPAKVLQD 2668
            SF GEE+PL RS+SN                     +DL  ES R  S   +TP +VLQ+
Sbjct: 711  SFQGEESPLSRSSSN---------------------RDLDLESERAFSS-TETPVEVLQE 748

Query: 2669 IAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIGEGIGKTRKEAQRQAAERSLRTLANR 2848
            IA+KCG+KV FR AL+ +++LQFSIE WF GEK+GEG GKTR+EAQRQAAE S++ LA  
Sbjct: 749  IAMKCGTKVEFRPALIATSDLQFSIETWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAGI 808

Query: 2849 YLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSF-HQPFLNEDSVPASSTSEPSRYLNMR 3025
            Y+   KPD   + GD ++  + + NG++ D NSF +QP L ++++  S+TSEPSR L+ R
Sbjct: 809  YMSRVKPDSGPMLGDSSRYPSANDNGFLGDMNSFGNQPLLKDENITYSATSEPSRLLDQR 868

Query: 3026 MEESQSNLDPISALKELCMDEGLKLDFRTRPTSSTSLNYKGEAYAQVEIGGQVFGNGNGS 3205
            +E S+ ++  ++ALKE CM EGL ++F  +   ST+     E +AQVEI GQV G G G 
Sbjct: 869  LEGSKKSMGSVTALKEFCMTEGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGL 928

Query: 3206 TLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRSLQAAVPNKRFKPEFPRVLQRIPPSAR 3385
            T D          L +L+   G  T KR GSPR +Q  +PNKR K EFPRVLQR+P SAR
Sbjct: 929  TWDEAKMQAAEKALGSLRTMFGQYTPKRQGSPRLMQ-GMPNKRLKQEFPRVLQRMPSSAR 987

Query: 3386 Y 3388
            Y
Sbjct: 988  Y 988


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score = 1051 bits (2718), Expect = 0.0
 Identities = 579/1002 (57%), Positives = 701/1002 (69%), Gaps = 29/1002 (2%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQQ--------------------MDIIMLNKEIRISTYS 589
            M+KS+VY+G+ LLGEVEIY Q +Q                    +D I+  K IRIS +S
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQQEQKLQQQEELQEQEQELKKKRVIDEIL--KGIRISHFS 58

Query: 590  QPSERCPPLAVLHTIAPNGVSFKMEASKSQSEDSLLYTLHYSLLGENKTAVMGV-GEEEL 766
            Q SERCPPLAVLHTI  NG+ FKME+  S S D+ L+ LH S + E+KTAV+ + G EEL
Sbjct: 59   QASERCPPLAVLHTITTNGICFKMESKNSVSLDTPLHLLHSSCIQESKTAVVLLQGGEEL 118

Query: 767  HLVAMPSRRNPSHGACFWGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFED 946
            HLVAM SR +     CFW F    GLY+S LVMLNLRCLGIVFDLDETLIVANTMRSFED
Sbjct: 119  HLVAMFSRNDERQYPCFWAFNISSGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFED 178

Query: 947  RIDALQRKITTEMDPQRVSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPA 1126
            RI+ALQRKI+TE+DPQR+SGML E+KRYQDDK ILKQYV+ DQ+VENGR+IK Q EV PA
Sbjct: 179  RIEALQRKISTELDPQRISGMLSEVKRYQDDKTILKQYVDNDQVVENGRVIKTQFEVVPA 238

Query: 1127 LSDNHQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVY 1306
            LSDNHQ IVRPLIRLQE+NIILTRINP IRDTSVLVRLRPAWE+LRSYLTARGRKRFEVY
Sbjct: 239  LSDNHQTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEELRSYLTARGRKRFEVY 298

Query: 1307 VCTMAERDYALEMWRLLDPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALV 1486
            VCTMAERDYALEMWRLLDPESNLI+S ELL+RIVCVK+GLRKSL NVF  GIC PKMALV
Sbjct: 299  VCTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALV 358

Query: 1487 IDDRLKVWDEIDQPRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDES 1666
            IDDRLKVWDE DQPRVHVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFK+FDE 
Sbjct: 359  IDDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEG 418

Query: 1667 LLQRIIGIFYEDDVEEGPSPPDVSNFLISEDDG--TAANRGPLRFEGMADVEVERKLKD- 1837
            LLQRI  I +EDD+ + PSPPDVSN+L+ EDD   +  NR PL F+GMAD EVE++LK+ 
Sbjct: 419  LLQRIPEISFEDDMNDIPSPPDVSNYLVPEDDAFTSNGNRDPLSFDGMADAEVEKRLKEA 478

Query: 1838 --IMPAF-SIPNNIDSRFSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSF 2008
              I  AF S   N+D+R     Q+  +SS+ ++P P+ Q +V+ F               
Sbjct: 479  ISISSAFPSTVANLDARLVPPLQYTMASSS-SIPVPTSQPAVVTFPSMQLPQAAPL---V 534

Query: 2009 NPVNRVGPSESSLQNSPAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXX 2188
             P+ +V PSE SLQ+SPAREEGEVPESELDPDTRRRLLILQHGQD+R+  P ES      
Sbjct: 535  KPLGQVVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDLRDPAPSESPFPVRP 594

Query: 2189 XXXXXXXXXXIQSRGNWFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSS 2368
                      +QSRGNW P EEE SP + NRA    V ++F +++E MH +K RP   S 
Sbjct: 595  SNSMQVSVPRVQSRGNWVPVEEEMSPRQLNRA----VTREFPMDTEPMHIDKHRPHHPSF 650

Query: 2369 F-QGTGAVQSDRTFHENLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLS 2545
            F +   ++ S+R  HEN RLPK     DDR +L++T   YQS SGEE  L RS+S+N   
Sbjct: 651  FPKVESSIPSERMPHENQRLPKVAPYKDDRLRLNQTMSNYQSLSGEENSLSRSSSSN--- 707

Query: 2546 PSPKDEEMPLAQVAPSSKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSA 2725
                             +DL  ES R  S   +TP +VL +I++KCG+KV F+ +LV S 
Sbjct: 708  -----------------RDLDVESDRAVSS-AETPVRVLHEISMKCGAKVEFKHSLVNSR 749

Query: 2726 ELQFSIEVWFSGEKIGEGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKP 2905
            +LQFS+E WF+GE++GEG G+TR+EAQ  AAE S++ LAN Y+   KPD   ++GD +K 
Sbjct: 750  DLQFSVEAWFAGERVGEGFGRTRREAQSVAAEASIKNLANIYISRAKPDNGALHGDASKY 809

Query: 2906 SNVDMNGYINDSNSF-HQPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCM 3082
            S+ + NG++   NSF  QP   ++ +  S +SE S  L+ R+E S+ ++  ++ALKE CM
Sbjct: 810  SSANDNGFLGHVNSFGSQPLPKDEILSYSDSSEQSGLLDPRLESSKKSMSSVNALKEFCM 869

Query: 3083 DEGLKLDFRTRPTSSTSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKV 3262
             EGL ++F  +   S++     E +AQVEI GQV G G GST D          L +L+ 
Sbjct: 870  MEGLGVNFLAQTPLSSNSVQNAEVHAQVEIDGQVMGKGIGSTFDEAKMQAAEKALGSLRT 929

Query: 3263 KLGLDTHKRIGSPRSLQAAVPNKRFKPEFPRVLQRIPPSARY 3388
              G    KR GSPR +   +PNK  KPEFPRVLQR+P SARY
Sbjct: 930  TFGRFPPKRQGSPRPV-PGMPNKHLKPEFPRVLQRMPSSARY 970


>ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica]
            gi|462410413|gb|EMJ15747.1| hypothetical protein
            PRUPE_ppa000988mg [Prunus persica]
          Length = 940

 Score = 1050 bits (2716), Expect = 0.0
 Identities = 585/986 (59%), Positives = 689/986 (69%), Gaps = 13/986 (1%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQ-----NQQMDIIMLNKEIRISTYSQPSERCPPLAVLHTI 634
            M+KS+VY+G  LLGEVEIYP+     N+  +++   KEIRIS +SQ SERCPP+AVLHTI
Sbjct: 1    MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60

Query: 635  APNGVSFKMEASKSQSEDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGAC 814
            + +GV FKME+  SQS+D+ L+ LH S + ENKTAVM +G EELHLVAM SR       C
Sbjct: 61   SSHGVCFKMESKTSQSQDTPLFLLHSSCVMENKTAVMPLGGEELHLVAMRSRNGDKRYPC 120

Query: 815  FWGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQ 994
            FWGF    GLYNS LVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI++E+DPQ
Sbjct: 121  FWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEVDPQ 180

Query: 995  RVSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQ 1174
            R+SGML EIKRYQDDK ILKQY E DQ+VENGR+IK QSE  PALSDNHQPI+RPLIRL 
Sbjct: 181  RISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRPLIRLH 240

Query: 1175 EKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 1354
            EKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL
Sbjct: 241  EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300

Query: 1355 LDPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRV 1534
            LDP+SNLI+SN+LL+RIVCVK+G RKSL NVF   +C PKMALVIDDRLKVWD+ DQPRV
Sbjct: 301  LDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRV 360

Query: 1535 HVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEE 1714
            HVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFF++FD+SLLQ+I  +FYEDD+++
Sbjct: 361  HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYEDDIKD 420

Query: 1715 GPSPPDVSNFLISEDDGTA--ANRGPLRFEGMADVEVERKLKDIMPAFSIPN----NIDS 1876
             PS PDVSN+L+SEDD +A   NR PL F+G+ DVEVER++K+  PA S+ +    +ID 
Sbjct: 421  VPS-PDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATPAASMVSSVFTSIDP 479

Query: 1877 RFSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNS 2056
            R + LQ  V  SS  T+  P+ Q SVM F                P+  VG +E SLQ+S
Sbjct: 480  RLAPLQYTVPPSS--TLSLPTTQPSVMSFPSIQFPQAASL---VKPLGHVGSAEPSLQSS 534

Query: 2057 PAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGN 2236
            PAREEGEVPESELDPDTRRRLLILQHGQD R+ PP E                  QSR  
Sbjct: 535  PAREEGEVPESELDPDTRRRLLILQHGQDTRDQPPSE--PPFPVRPPMQASVPRAQSRPG 592

Query: 2237 WFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSF-QGTGAVQSDRTFHE 2413
            WFP EEE SP    R L + V KD  ++ E +  EK RP  SS F +   ++ SDR   E
Sbjct: 593  WFPVEEEMSP----RQLSRMVPKDLPLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQE 648

Query: 2414 NLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPS 2593
            N RLPKE    DDR + +     Y S SGEE PL RS+S+N                   
Sbjct: 649  NQRLPKEAFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSN------------------- 689

Query: 2594 SKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIG 2773
             +D+  ESGR  S   +TPA VLQ+IA+KCG+K                   WF+GEKIG
Sbjct: 690  -RDVDFESGRAISN-AETPAGVLQEIAMKCGAK------------------AWFAGEKIG 729

Query: 2774 EGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSFH 2953
            EG GKTR+EA  QAAE SL+ LAN YL   KPD   V+GD NK  NV+ NG+  + NSF 
Sbjct: 730  EGSGKTRREAHYQAAEGSLKNLANIYLSRVKPDSVSVHGDMNKFPNVNSNGFAGNLNSFG 789

Query: 2954 -QPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTSST 3130
             QPF  E+S+ +S++SEPSR L+ R+E S+ ++  +S LKELCM EGL + F+ RP  ST
Sbjct: 790  IQPFPKEESLSSSTSSEPSRPLDPRLEGSKKSMSSVSTLKELCMMEGLGVVFQPRPPPST 849

Query: 3131 SLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRSL 3310
            +   K E + QVEI G+V G G G T D          L +L     L   KR GSPRSL
Sbjct: 850  NSVEKDEVHVQVEIDGEVLGKGIGLTWDEAKMQAAEKALGSLTST--LYAQKRQGSPRSL 907

Query: 3311 QAAVPNKRFKPEFPRVLQRIPPSARY 3388
            Q  + +KR K EFP+VLQR+P SARY
Sbjct: 908  Q-GMSSKRMKQEFPQVLQRMPSSARY 932


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score = 1041 bits (2693), Expect = 0.0
 Identities = 582/981 (59%), Positives = 688/981 (70%), Gaps = 12/981 (1%)
 Frame = +2

Query: 482  MVYEGNSLLGEVEIYPQNQQMDIIMLN-KEIRISTYSQPSERCPPLAVLHTIAPNGVSFK 658
            +VY+G  LLGEVE+YP+      I    KEIRIS +SQ SERCPP+AVLHTI+ NGV FK
Sbjct: 4    LVYKGEELLGEVEVYPEELNNKKIWDELKEIRISHFSQSSERCPPVAVLHTISSNGVCFK 63

Query: 659  MEASKSQS---EDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACFWGFY 829
            ME+  S S   + S L+ LH S + ENKTAVM +G EELHLVAM SR N     CFWGF 
Sbjct: 64   MESKSSSSSSQDTSRLFLLHSSCIMENKTAVMNLGVEELHLVAMYSRNNQKQHPCFWGFS 123

Query: 830  APQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQRVSGM 1009
               GLY+S L MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ LQRKI  E+D QR+SGM
Sbjct: 124  VSSGLYSSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIEGLQRKIQCEVDAQRISGM 183

Query: 1010 LQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQEKNII 1189
              EIKRYQDDK ILKQY E DQ+VENGR+IK QSEV PALSD+HQPI+RPLIRLQEKNII
Sbjct: 184  QAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEVVPALSDSHQPIIRPLIRLQEKNII 243

Query: 1190 LTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 1369
            LTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES
Sbjct: 244  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 303

Query: 1370 NLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVHVVPA 1549
            NLI++N+LL+RIVCVK+GL+KSL NVF   +C PKMALVIDDRLKVWD+ DQPRVHVVPA
Sbjct: 304  NLINANKLLDRIVCVKSGLKKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVPA 363

Query: 1550 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEGPSPP 1729
            FAPYYAPQAEANN VPVLCVARNVAC+VRGGFF++FD+SLLQ+I  IFYED++++  S P
Sbjct: 364  FAPYYAPQAEANNAVPVLCVARNVACSVRGGFFREFDDSLLQKIPEIFYEDNIKDF-SSP 422

Query: 1730 DVSNFLISEDDGTAA--NRGPLRFEGMADVEVERKLKDIMPA----FSIPNNIDSRFSTL 1891
            DVSNFL+SEDD +A+  NR  L F+GMAD EVER+LK+   A     S  +N D R ++L
Sbjct: 423  DVSNFLVSEDDASASNGNRDQLPFDGMADAEVERRLKEATSAAPTVSSAVSNNDPRLASL 482

Query: 1892 QQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNSPAREE 2071
            Q  V  SS  TV  P+ Q S+MPFH+              P+  VGP++  L +SPAREE
Sbjct: 483  QYTVPLSS--TVSLPTNQPSMMPFHNVQFPQSASL---VKPLGHVGPADLGLHSSPAREE 537

Query: 2072 GEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGNWFPSE 2251
            GEVPESELDPDTRRRLLILQHGQD RE  P E                 +QSRG WFP E
Sbjct: 538  GEVPESELDPDTRRRLLILQHGQDTRESVPSE--PSFPVRPQVQVSVPRVQSRGGWFPVE 595

Query: 2252 EETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSF-QGTGAVQSDRTFHENLRLP 2428
            EE SP K +R +PK+      + SE M  EK R   S+ F +   ++ SDR   EN RLP
Sbjct: 596  EEMSPRKLSRMVPKEP----PLNSEPMQIEKHRSHHSAFFPKVENSMPSDRILQENQRLP 651

Query: 2429 KEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPSSKDLQ 2608
            KE    D+R + ++    Y SFSGEE PL RS+S+N                    +D  
Sbjct: 652  KEAFHRDNRLRFNQAMSGYHSFSGEEPPLNRSSSSN--------------------RDFD 691

Query: 2609 TESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIGEGIGK 2788
             ESGR  S   +TPA VLQ+IA+KCG+KV FR ALV S ELQF +E WF+GEKIGEG G+
Sbjct: 692  YESGRAISN-AETPAGVLQEIAMKCGTKVEFRPALVPSTELQFYVEAWFAGEKIGEGTGR 750

Query: 2789 TRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSF-HQPFL 2965
            TR+EA  QAAE SL+ LAN Y+   KPD   ++GD +K SNV  NG++ + NSF  QP  
Sbjct: 751  TRREAHFQAAEGSLKNLANIYISRGKPDALPIHGDASKFSNVTNNGFMGNMNSFGTQPLP 810

Query: 2966 NEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTSSTSLNYK 3145
             EDS+ +S++SEPSR L+ R++ S+ ++  +SALKELC  EGL + ++ RP    S   K
Sbjct: 811  KEDSLSSSTSSEPSRPLDPRLDNSRKSVSSVSALKELCTMEGLSVLYQPRPPPPNSTE-K 869

Query: 3146 GEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRSLQAAVP 3325
             E + Q EI G+V G G G T D          L NL+    L   KR GSPR LQ  +P
Sbjct: 870  DEVHVQAEIDGEVLGKGIGLTWDEAKMQAAEKALGNLRST--LYGQKRQGSPRPLQ-GMP 926

Query: 3326 NKRFKPEFPRVLQRIPPSARY 3388
            +KR K EFP+VLQR+P S RY
Sbjct: 927  SKRLKQEFPQVLQRMPSSTRY 947


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score = 1034 bits (2674), Expect = 0.0
 Identities = 575/985 (58%), Positives = 693/985 (70%), Gaps = 10/985 (1%)
 Frame = +2

Query: 464  IKMFKSMVYEGNSLLGEVEIYPQNQQMDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPN 643
            ++M+KS+VY+G  ++GEV++YP+          KEIRIS +SQPSERCPPLAVLHT+   
Sbjct: 1    MRMYKSVVYQGEVVVGEVDVYPEENNNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSC 60

Query: 644  GVSFKMEASKSQSEDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACFWG 823
            GV FKME SK+Q +D L + LH   + ENKTAVM +G EE+HLVAM SR       CFWG
Sbjct: 61   GVCFKME-SKTQQQDGL-FQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNVDR--PCFWG 116

Query: 824  FYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQRVS 1003
            F    GLY+S LVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKI +E+DPQR+S
Sbjct: 117  FIVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRIS 176

Query: 1004 GMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQEKN 1183
            GM  E+KRYQDDK ILKQY E DQ+V+NGR+IKVQSE+ PALSD+HQPIVRPLIRLQ+KN
Sbjct: 177  GMQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKN 236

Query: 1184 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 1363
            IILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 237  IILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 296

Query: 1364 ESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVHVV 1543
            +SNLI+S ELL RIVCVK+GL+KSL NVF  G+C PKMALVIDDRLKVWDE DQPRVHVV
Sbjct: 297  DSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVV 356

Query: 1544 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEGPS 1723
            PAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFKDFD+ LLQ+I  I YEDD+++ PS
Sbjct: 357  PAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPS 416

Query: 1724 PPDVSNFLISEDDGTAAN--RGPLRFEGMADVEVERKLKDIMPAFS-IP---NNIDSRFS 1885
            PPDVSN+L+SEDDG+ +N  R P  F+GMAD EVERKLKD + A S IP    N+D R +
Sbjct: 417  PPDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPRLT 476

Query: 1886 TLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNSPAR 2065
            +LQ  +  S   +VP P+ Q S+MPF                P+ +  PSE SL +SPAR
Sbjct: 477  SLQYTMVPSG--SVPPPTAQASMMPF---PHVQFPQPATLVKPMGQAAPSEPSLHSSPAR 531

Query: 2066 EEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGNWFP 2245
            EEGEVPESELDPDTRRRLLILQHGQD R+H   E                   SRG WFP
Sbjct: 532  EEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVP-SSRGVWFP 590

Query: 2246 SEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSFQGT-GAVQSDRTFHE-NL 2419
            +EEE      NR +P    K+F V+S  +   K RP   S F     ++ SDR  H+ + 
Sbjct: 591  AEEEIGSQPLNRVVP----KEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQ 646

Query: 2420 RLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPSSK 2599
            RLPKE++  DDR +L+     Y+SFSG++ P  RS S                    S +
Sbjct: 647  RLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSFS--------------------SHR 686

Query: 2600 DLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIGEG 2779
            DL +ESG     + DTP  VLQ+IA+KCG+KV F S+LV S ELQFS+E WFSG+KIG  
Sbjct: 687  DLDSESGHSV-LHADTPVAVLQEIALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHR 745

Query: 2780 IGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSF-HQ 2956
            +G+TRKEAQ +AAE S++ LA+ YL + K +P   YGD +   NV+ +GY+  ++S  +Q
Sbjct: 746  VGRTRKEAQNKAAEDSIKHLADIYLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQ 805

Query: 2957 PFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTS-STS 3133
            P   EDS  + ST+ PSR L+ R++ S+ ++  IS+LKELCM EGL ++F + P   ST+
Sbjct: 806  PLSKEDSA-SFSTASPSRVLDPRLDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTN 864

Query: 3134 LNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRSLQ 3313
               K E +AQVEI G+VFG G G T D          L +L+ KLG    KR  SPR  Q
Sbjct: 865  SVQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQ 924

Query: 3314 AAVPNKRFKPEFPRVLQRIPPSARY 3388
                NKR K E+PR +QR+P SARY
Sbjct: 925  -GFSNKRLKQEYPRPMQRMPSSARY 948


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score = 1030 bits (2662), Expect = 0.0
 Identities = 580/1015 (57%), Positives = 693/1015 (68%), Gaps = 42/1015 (4%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQQMD-----------IIMLNKEIRISTYSQPSERCPPL 616
            M+KS+VY+G  LLGEVEIY Q QQ +           I  + K IRIS +SQ SERCPPL
Sbjct: 1    MYKSVVYKGEELLGEVEIYAQEQQQEEEENKNKRKRVIDEIVKGIRISHFSQASERCPPL 60

Query: 617  AVLHTIAPNGVSFKMEASKSQS-------EDSLLYTLHYSLLGENKTAVMGVGEEELHLV 775
            AVLHTI   GV FKME S + S       ++S L  LH S + ENKTAVM +G EELHLV
Sbjct: 61   AVLHTITSIGVCFKMEESTASSSTKISSQQESPLRLLHSSCIQENKTAVMLLGGEELHLV 120

Query: 776  AMPSRRNPSHGACFWGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRID 955
            AMPSR N     CFWGF    GLY+S LVMLNLRCLGIVFDLDETLIVANTMRSFED+I+
Sbjct: 121  AMPSRSNERKHPCFWGFNVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDKIE 180

Query: 956  ALQRKITTEMDPQRVSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSD 1135
            ALQ+KI+TE+D QR+  ++ EIKRYQDDKIILKQYVE DQ++ENG++IK Q EV PA SD
Sbjct: 181  ALQKKISTEVDQQRILAIISEIKRYQDDKIILKQYVENDQVIENGKVIKTQFEVVPAASD 240

Query: 1136 NHQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 1315
            NHQP+VRPLIRL EKNII TRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT
Sbjct: 241  NHQPLVRPLIRLPEKNIIFTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 300

Query: 1316 MAERDYALEMWRLLDPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDD 1495
            MAERDYALEMWRLLDPESNLI+SNELL+RIVCV +G RKSL NVF  GIC PKMALVIDD
Sbjct: 301  MAERDYALEMWRLLDPESNLINSNELLDRIVCVSSGSRKSLFNVFQDGICHPKMALVIDD 360

Query: 1496 RLKVWDEIDQPRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQ 1675
            R+ VWDE DQ RVHVVPAFAPYYAPQAEANN VP+LCVARNVACNVRGGFFK+FDE LLQ
Sbjct: 361  RMNVWDEKDQSRVHVVPAFAPYYAPQAEANNAVPILCVARNVACNVRGGFFKEFDEGLLQ 420

Query: 1676 RIIGIFYEDDVEEGPSPPDVSNFLISEDDGTAA--NRGPLRFEGMADVEVERKLKDIMPA 1849
            +I  + YEDD    PSPPDVSN+L+SEDD +AA  NR P  F+  AD EVER+LK+ + A
Sbjct: 421  KIPEVAYEDDTSNIPSPPDVSNYLVSEDDASAANGNRDPPSFDSTADAEVERRLKEAVSA 480

Query: 1850 FS-IPNNIDSRFS--------TLQQFVGSSSN------PTV-----PQPSPQGSVMPFHD 1969
             S IP+ I S  S        +LQ  V SSS+      P++     P P+ Q S+MPF +
Sbjct: 481  SSTIPSTIPSTVSSLDPRLLQSLQYAVASSSSLMPASQPSMLASQQPVPASQTSMMPFPN 540

Query: 1970 XXXXXXXXXXXSFNPVNRVGPSESSLQNSPAREEGEVPESELDPDTRRRLLILQHGQDVR 2149
                           V  V P E SLQ+SPAREEGEVPESELDPDTRRRLLILQHGQD R
Sbjct: 541  TQFPQVAPLVKQLGQV--VHP-EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDSR 597

Query: 2150 EHPPKESXXXXXXXXXXXXXXXXIQSRGNWFPSEEETSPPKQNRALPKQVQKDFSVESEA 2329
            ++ P ES                +QSRG+W P EEE +P + NR       ++F ++S+ 
Sbjct: 598  DNAPSES--PFPARPSAPVSAAHVQSRGSWVPVEEEMTPRQLNR-----TPREFPLDSDP 650

Query: 2330 MHFEKSRPQRSSSFQGTGA-VQSDRTFHENLRLPKEVHIGDDRAKLSRTHPKYQSFSGEE 2506
            M+ EK +    S F    + + SDR  HEN RLPKE    +DR +L+ + P Y SF  EE
Sbjct: 651  MNIEKHQTHHPSFFPKVESNIPSDRMIHENQRLPKEAPYRNDRMRLNHSTPNYHSFQVEE 710

Query: 2507 TPLGRSASNNKLSPSPKDEEMPLAQVAPSSKDLQTESGRGTSQYPDTPAKVLQDIAIKCG 2686
            TPL RS+SN                     +DL  ES R  +   +TP +VLQ+IA+KC 
Sbjct: 711  TPLSRSSSN---------------------RDLDLESERAFT-ISETPVEVLQEIAMKCE 748

Query: 2687 SKVVFRSALVGSAELQFSIEVWFSGEKIGEGIGKTRKEAQRQAAERSLRTLANRYLINTK 2866
            +KV FR ALV S +LQFSIE WF+GEK+GEG GKTR+EAQRQAAE S++ LA  Y++  K
Sbjct: 749  TKVEFRPALVASIDLQFSIEAWFAGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMLRAK 808

Query: 2867 PDPTMVYGDPNKPSNVDMNGYINDSNSF-HQPFLNEDSVPASSTSEPSRYLNMRMEESQS 3043
            PD   ++GD ++  + + NG++ + N F +QP   ++ V  S+ SEPSR L+ R+E S+ 
Sbjct: 809  PDSGPMHGDSSRYPSANDNGFLGNMNLFGNQPLPKDELVAYSAASEPSRLLDPRLEGSKK 868

Query: 3044 NLDPISALKELCMDEGLKLDFRTRPTSSTSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXX 3223
            +   ++ALKE C  EGL ++F  +   S +     E +AQVEI GQV G G GST D   
Sbjct: 869  SSGSVTALKEFCTMEGLVVNFLAQTPLSANSIPGEEVHAQVEIDGQVLGKGIGSTWDEAK 928

Query: 3224 XXXXXXXLMNLKVKLGLDTHKRIGSPRSLQAAVPNKRFKPEFPRVLQRIPPSARY 3388
                   L +L+   G  T KR GSPR +Q  +PNKR K EFPRVLQR+PPSARY
Sbjct: 929  MQAAEKALGSLRTMFGQYTQKRQGSPRPMQ-GMPNKRLKQEFPRVLQRMPPSARY 982


>ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
            gi|508781047|gb|EOY28303.1| C-terminal domain
            phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score = 1028 bits (2658), Expect = 0.0
 Identities = 574/977 (58%), Positives = 685/977 (70%), Gaps = 32/977 (3%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQ------------QMDIIMLN---KEIRISTYSQPSER 604
            M+KS+VY G  +LGEVEIYPQ Q            +  I+++    KEIRI   +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 605  CPPLAVLHTIAPNGVSFKMEASK----SQSEDSL-LYTLHYSLLGENKTAVMGVGEEELH 769
            CPPLAVLHTI  +G+ FKME+SK    S S+DS  L+ LH   + +NKTAVM +G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 770  LVAMPSRRNPSHGACFWGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDR 949
            LVAM SR   S   CFWGF   +GLY+S L+MLNLRCLGIVFDLDETLIVANTMRSFEDR
Sbjct: 124  LVAMYSRN--SDRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 950  IDALQRKITTEMDPQRVSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPAL 1129
            I+ALQRK+TTE+DPQRV+GM+ E+KRYQDDK ILKQY E DQ+VENG++IK+QSEV PAL
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 1130 SDNHQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 1309
            SDNHQPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 1310 CTMAERDYALEMWRLLDPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVI 1489
            CTMAERDYALEMWRLLDPESNLI+S ELL+RIVCVK+G RKSL NVF  GIC PKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 1490 DDRLKVWDEIDQPRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESL 1669
            DDRLKVWDE DQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF++FDE L
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 1670 LQRIIGIFYEDDVEEGPSPPDVSNFLISEDDGTA--ANRGPLRFEGMADVEVERKLKDIM 1843
            LQRI  I YEDD+++ PSPPDV N+L+SEDD +A   N+ PL F+GMAD EVER+LK+ +
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1844 PAFSIPN----NIDSRFSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFN 2011
             A S  +    N+D R +   Q+   SS+ ++P  + Q S++ F +              
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPV---VK 538

Query: 2012 PVNRVGPSESSLQNSPAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXX 2191
            PV  V   E SLQ+SPAREEGEVPESELDPDTRRRLLILQHGQD R+H P E        
Sbjct: 539  PVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRP 598

Query: 2192 XXXXXXXXXIQSRGNWFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSR-----PQ 2356
                      QSRG+WF +EEE SP + NRA P    K+F ++SE MH EK R     P+
Sbjct: 599  TMQVSVPRG-QSRGSWFAAEEEMSPRQLNRAAP----KEFPLDSERMHIEKHRHPPFFPK 653

Query: 2357 RSSSFQGTGAVQSDRTFHENLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNN 2536
              SS      + SDR   EN RL KE    DDR  L+ T   Y SFSGE           
Sbjct: 654  VESS------IPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGE----------- 696

Query: 2537 KLSPSPKDEEMPLAQVAPSSKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALV 2716
                     EMPL+Q + S +DL  ESGR T    +T A VLQDIA+KCG+KV FR ALV
Sbjct: 697  ---------EMPLSQSSSSHRDLDFESGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALV 746

Query: 2717 GSAELQFSIEVWFSGEKIGEGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDP 2896
             S +LQFSIE WF+GEK+GEG+G+TR+EAQRQAAE S++ LAN YL   KPD     GD 
Sbjct: 747  ASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDL 806

Query: 2897 NKPSNVDMNGYINDSNSF-HQPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKE 3073
            ++  N++ NG+ ++ NSF +Q    E+S+  S+ SE SR  + R+E S+ ++  ++ALKE
Sbjct: 807  SRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGSVTALKE 866

Query: 3074 LCMDEGLKLDFRTRPTSSTSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMN 3253
            LCM EGL + F+ +P SS++   K E YAQVEI GQV G G G T +          L +
Sbjct: 867  LCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGS 926

Query: 3254 LKVKLGLDTHKRIGSPR 3304
            L+  LG  + KR GSPR
Sbjct: 927  LRSMLGQYSQKRQGSPR 943


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Glycine max]
          Length = 960

 Score = 1028 bits (2657), Expect = 0.0
 Identities = 573/989 (57%), Positives = 688/989 (69%), Gaps = 16/989 (1%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQ------NQQMDIIMLNKEIRISTYSQPSERCPPLAVLHT 631
            M+KS+VY+G  ++GEV++YP+      N+  +     KEIRIS +SQPSERCPPLAVLHT
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60

Query: 632  IAPNGVSFKMEASKSQSEDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGA 811
            +   GV FKME SK+Q +D L + LH   + ENKTAVM +G EE+HLVAM SR +     
Sbjct: 61   VTSCGVCFKME-SKTQQQDGL-FQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNDDR--P 116

Query: 812  CFWGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDP 991
            CFWGF    GLY+S LVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKI +E+DP
Sbjct: 117  CFWGFIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDP 176

Query: 992  QRVSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRL 1171
            QR+SGM  E+KRY DDK ILKQY E DQ+V+NGR+IKVQSE+ PALSD+HQPIVRPLIRL
Sbjct: 177  QRISGMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRL 236

Query: 1172 QEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 1351
            Q+KNIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR
Sbjct: 237  QDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 296

Query: 1352 LLDPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPR 1531
            LLDP+SNLI+S ELL RIVCVK+GL+KSL NVF  G CDPKMALVIDDRLKVWDE DQPR
Sbjct: 297  LLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPR 356

Query: 1532 VHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVE 1711
            VHVVPAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFKDFD+ LLQ+I  I YEDD++
Sbjct: 357  VHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIK 416

Query: 1712 EGPSPPDVSNFLISEDDGTAA--NRGPLRFEGMADVEVERKLKDIMPAFS----IPNNID 1873
            + PSPPDVSN+L+SEDDG+ +  NR P  F+GMAD EVERKLKD + A S       N+D
Sbjct: 417  DVPSPPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLD 476

Query: 1874 SRFSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQN 2053
             R ++LQ  +  S   +VP P+ Q S+MPF                P+ +  PS+ SL +
Sbjct: 477  PRLTSLQYTMVPSG--SVPPPTAQASMMPF---PHVQFPQPATLVKPMGQAAPSDPSLHS 531

Query: 2054 SPAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRG 2233
            SPAREEGEVPESELDPDTRRRLLILQHGQD R+H   E                   SRG
Sbjct: 532  SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVP-SSRG 590

Query: 2234 NWFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSFQGT-GAVQSDRTFH 2410
             WFP EEE      NR +P    K+F V+S  +  EK R    S F     ++ SDR  H
Sbjct: 591  VWFPVEEEIGSQPLNRVVP----KEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILH 646

Query: 2411 E-NLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVA 2587
            + + RLPKE++  DDR +L+     Y+SFSG++ P  RS+S                   
Sbjct: 647  DSHQRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSS------------------- 687

Query: 2588 PSSKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEK 2767
             S +DL +ESG     + DTP  VL +IA+KCG+KV F S+LV S EL+FS+E WFSG+K
Sbjct: 688  -SHRDLDSESGHSV-LHADTPVAVLHEIALKCGTKVDFMSSLVASTELKFSLEAWFSGKK 745

Query: 2768 IGEGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNS 2947
            IG G G+TRKEAQ +AA+ S+  LA+ YL + K +P   YGD +   NV+ NGY+  ++S
Sbjct: 746  IGHGFGRTRKEAQNKAAKDSIEHLADIYLSSAKDEPGSTYGDVSGFPNVNDNGYMGIASS 805

Query: 2948 F-HQPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTS 3124
              +QP   EDS   SS S PSR L+ R++ S+ ++  ISALKELCM EGL ++F + P  
Sbjct: 806  LGNQPLSKEDSASFSSAS-PSRALDPRLDVSKRSMGSISALKELCMMEGLGVNFLSTPAP 864

Query: 3125 -STSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSP 3301
             ST+   K E +AQVEI G++FG G G T D          L NL+ KLG    K   SP
Sbjct: 865  VSTNSVQKDEVHAQVEIDGKIFGKGIGLTWDEAKMQAAEKALGNLRSKLGQSIQKMQSSP 924

Query: 3302 RSLQAAVPNKRFKPEFPRVLQRIPPSARY 3388
            R  Q    NKR K E+PR +QR+P SARY
Sbjct: 925  RPHQ-GFSNKRLKQEYPRTMQRMPSSARY 952


>ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
            gi|561032720|gb|ESW31299.1| hypothetical protein
            PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score = 1023 bits (2645), Expect = 0.0
 Identities = 571/994 (57%), Positives = 687/994 (69%), Gaps = 21/994 (2%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQQMDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPNGV 649
            M+KS+VY+G  +LGEVE+YP+        + KEIRIS +SQPSERCPPLAVLHT+   GV
Sbjct: 1    MYKSVVYQGEVVLGEVEVYPEENNYKNFHV-KEIRISHFSQPSERCPPLAVLHTVTSCGV 59

Query: 650  SFKMEASKSQSEDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACFWGFY 829
             FKME SK+Q +D L + LH   + ENKTAV+ +G EE+HLVAM SR +      FWGF 
Sbjct: 60   CFKME-SKTQQQDGLFH-LHSLCIRENKTAVIPLGGEEIHLVAMHSRNDDRPR--FWGFI 115

Query: 830  APQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQRVSGM 1009
               GLY+S LVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKI +E+DPQR+SGM
Sbjct: 116  VALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGM 175

Query: 1010 LQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQEKNII 1189
              E+KRYQ+DK ILKQY E DQ+V+NGR++KVQSE+ PALSDNHQPIVRPLIRLQ+KNII
Sbjct: 176  QAEVKRYQEDKNILKQYAENDQVVDNGRVVKVQSEIVPALSDNHQPIVRPLIRLQDKNII 235

Query: 1190 LTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 1369
            LTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP+S
Sbjct: 236  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 295

Query: 1370 NLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVHVVPA 1549
            NLI+S ELL RIVCVK+GL+KSL NVF  G+C PKMALVIDDRLKVWDE DQPRVHVVPA
Sbjct: 296  NLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPA 355

Query: 1550 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEGPSPP 1729
            FAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+ LLQ+I  + YEDD+++ P PP
Sbjct: 356  FAPYYAPQAEASNSIPVLCVARNVACNVRGGFFKEFDDGLLQKIPQVAYEDDIKDIPIPP 415

Query: 1730 DVSNFLISEDDGTAA----NRGPLRFEGMADVEVERKLK----------DIMPAFSIP-- 1861
            DVSN+L+SEDDG++A    NR P  F+ M D EVERK K           +  A +IP  
Sbjct: 416  DVSNYLVSEDDGSSAISNGNRDPFLFDSMGDAEVERKSKVPTRAPNEHDALSAASTIPVT 475

Query: 1862 -NNIDSRFSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSE 2038
              N+D R ++LQ  + SS   + P P+ Q S+MPF                P+ +  PSE
Sbjct: 476  TANLDPRLTSLQYAMVSSG--SAPPPTAQASMMPF---THVQFPQPAALVKPMGQAAPSE 530

Query: 2039 SSLQNSPAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXX 2218
            SSL +SPAREEGEVPESELDPDTRRRLLILQHGQD R+H   E                 
Sbjct: 531  SSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSNE--PTYAIRHPVPVSAPR 588

Query: 2219 IQSRGNWFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSFQGT-GAVQS 2395
            + SRG WFP+EE+      NR +P    K+FSV+S ++  EK RP   S F     ++ S
Sbjct: 589  VSSRGGWFPAEEDIGSQPLNRVVP----KEFSVDSGSLVIEKHRPHHPSFFSKVESSISS 644

Query: 2396 DRTFHE-NLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMP 2572
            DR  H+ + RLPKE++  DDR + +     Y+S S +E P  RS+S              
Sbjct: 645  DRILHDSHQRLPKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSS-------------- 690

Query: 2573 LAQVAPSSKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVW 2752
                  S +DL +ES      + DTP  VLQ+IA+KCG+KV F S+LV S ELQFSIE W
Sbjct: 691  ------SHRDLDSESSHSVF-HADTPVVVLQEIALKCGTKVEFMSSLVASTELQFSIEAW 743

Query: 2753 FSGEKIGEGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYI 2932
            FSG+KIG G G+TRKEAQ +AAE S++ LA+ YL + K +P   YGD     N + NGY+
Sbjct: 744  FSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAKDEPGSTYGDVGGFPNANDNGYM 803

Query: 2933 NDSNSF-HQPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFR 3109
              ++S  +QP   EDS   S+ S+PSR L+ R+E S+  +  ISALKELCM EGL ++F 
Sbjct: 804  VIASSLSNQPLPKEDSASFSTASDPSRVLDPRLEVSKRPMGSISALKELCMMEGLGVNFL 863

Query: 3110 TRPTS-STSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHK 3286
            + P   ST+   K E +AQVEI G+VFG G G T D          L +L+ KLG    K
Sbjct: 864  SAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQK 923

Query: 3287 RIGSPRSLQAAVPNKRFKPEFPRVLQRIPPSARY 3388
            R  SPRS Q    NKR K E+PR +QRIP S RY
Sbjct: 924  RQSSPRSHQ-GFSNKRLKQEYPRAMQRIPSSTRY 956


>emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]
          Length = 894

 Score = 1015 bits (2625), Expect = 0.0
 Identities = 572/977 (58%), Positives = 666/977 (68%), Gaps = 4/977 (0%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQQMDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPNGV 649
            M+KS+VYEG+ ++GEVEIYPQNQ ++++   KEIRIS YSQPSERCPPLAVLHTI   GV
Sbjct: 1    MYKSIVYEGDDVVGEVEIYPQNQGLELM---KEIRISHYSQPSERCPPLAVLHTITSCGV 57

Query: 650  SFKMEASKSQSEDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACFWGFY 829
             FKME+SK+QS+D+ LY LH + + ENKTAVM +GEEELHLVAM S++      CFWGF 
Sbjct: 58   CFKMESSKAQSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFN 117

Query: 830  APQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQRVSGM 1009
               GLY+S LVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKI TE+DPQR+SGM
Sbjct: 118  VALGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGM 177

Query: 1010 LQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQEKNII 1189
            + E+                   VENG+L K Q E+ PALSDNHQPIVRPLIRLQEKNII
Sbjct: 178  VAEV-------------------VENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNII 218

Query: 1190 LTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 1369
            LTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES
Sbjct: 219  LTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 278

Query: 1370 NLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVHVVPA 1549
            NLI+S ELL+RIVCVK+G RKSL NVF  GIC PKMALVIDDRLKVWDE DQPRVHVVPA
Sbjct: 279  NLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPA 338

Query: 1550 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEGPSPP 1729
            FAPYYAPQAEANN + VLCVARNVACNVRGGFFK+FDE LLQRI  I YED++++  S P
Sbjct: 339  FAPYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDBIKDIRSAP 398

Query: 1730 DVSNFLISEDDGTAA--NRGPLRFEGMADVEVERKLKDIMPAFSIPNNIDSRFSTLQQFV 1903
            DVSN+L+SEDD + +  NR    F+GMADVEVERKLKD + A S   ++D R S   QF 
Sbjct: 399  DVSNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKDAISAPSTVTSLDPRLSPPLQFA 458

Query: 1904 GSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNSPAREEGEVP 2083
             ++S+   PQP+ QGS+MPF +              P       E ++Q+SPAREEGEVP
Sbjct: 459  VAASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPLAP-------EPTMQSSPAREEGEVP 511

Query: 2084 ESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGNWFPSEEETS 2263
            ESELDPDTRRRLLILQHGQD REH    S                +QSRG+WFP++EE S
Sbjct: 512  ESELDPDTRRRLLILQHGQDTREH--ASSDPPFPVRPPIQVSVPRVQSRGSWFPADEEMS 569

Query: 2264 PPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSFQGT-GAVQSDRTFHENLRLPKEVH 2440
            P + NRA+P    K+F ++S+ MH EK RP   S F     +  SDR  HEN RL KEV 
Sbjct: 570  PRQLNRAVP----KEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVL 625

Query: 2441 IGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPSSKDLQTESG 2620
              DDR +L+ + P Y SFSGEE PLGRS+SN                     +DL  ESG
Sbjct: 626  HRDDRLRLNHSLPGYHSFSGEEVPLGRSSSN---------------------RDLDFESG 664

Query: 2621 RGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIGEGIGKTRKE 2800
            RG + Y +TPA  L                      L+   EVW  GEKIGEG GKTR+E
Sbjct: 665  RG-APYAETPAVGL----------------------LRNCNEVWNQGEKIGEGTGKTRRE 701

Query: 2801 AQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSF-HQPFLNEDS 2977
            AQ QAAE SL  L+ RYL          +GD N+  N   N +++D+NSF +Q F  E S
Sbjct: 702  AQCQAAEASLMYLSYRYL----------HGDVNRFPNASDNNFMSDTNSFGYQSFPKEGS 751

Query: 2978 VPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTSSTSLNYKGEAY 3157
            +  S+ SE SR L+ R+E S+ ++  ISALKELCM EGL ++F ++P  S++   K E  
Sbjct: 752  MSFSTASESSRLLDPRLESSKKSMGSISALKELCMMEGLGVEFLSQPPLSSNSTQKEEIC 811

Query: 3158 AQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRSLQAAVPNKRF 3337
            AQVEI GQV G G GST D          L +LK  LG  + KR GSPRSLQ     KR 
Sbjct: 812  AQVEIDGQVLGKGTGSTWDDAKMQAAEKALGSLKSMLGQFSQKRQGSPRSLQGM--GKRL 869

Query: 3338 KPEFPRVLQRIPPSARY 3388
            K EF R LQR P S RY
Sbjct: 870  KSEFTRGLQRTPSSGRY 886


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score = 1013 bits (2620), Expect = 0.0
 Identities = 572/985 (58%), Positives = 681/985 (69%), Gaps = 12/985 (1%)
 Frame = +2

Query: 470  MFKSMV--YEGNSLLGEVEIYPQNQQMDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPN 643
            MFKS V  YEG  L+GEVEIY +     ++   K IRIS YS  SERCPPLAVLHT+   
Sbjct: 1    MFKSTVVLYEGERLVGEVEIYCEK---GVLWGEKVIRISHYSPSSERCPPLAVLHTVT-T 56

Query: 644  GVSFKMEASKSQ--SEDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACF 817
            G+SFK+E +KS+  ++DS L  LH + L +NKTAVM +G EELHLVAM S+       CF
Sbjct: 57   GLSFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCF 116

Query: 818  WGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQR 997
            WGF    GLY+S L MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI +E DPQR
Sbjct: 117  WGFKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQR 176

Query: 998  VSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQE 1177
             S ML E+KRYQ+DKIILKQY E DQ+V+NG++IK QSEVFPALSDNHQPIVRPLIRLQ+
Sbjct: 177  ASVMLAEVKRYQEDKIILKQYAENDQVVDNGKVIKSQSEVFPALSDNHQPIVRPLIRLQD 236

Query: 1178 KNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL 1357
            +NIILTRINP+IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL
Sbjct: 237  RNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL 296

Query: 1358 DPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVH 1537
            DP+SNLI+S ELL+RIVCVK+GLRKSL NVF  G C PKMALVIDDRLKVWD+ DQPRVH
Sbjct: 297  DPDSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVH 356

Query: 1538 VVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEG 1717
            VVPAFAPY+APQAE NN VPVLCVARNVACNVRGGFFKDFDE LLQRI  + YEDD+++ 
Sbjct: 357  VVPAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQV 416

Query: 1718 PSPPDVSNFLISEDDGTA--ANRGPLRFEGMADVEVERKLKDIMPA-FSIPN---NIDSR 1879
            PS PDVSN+LISEDD +A   N+  L F+GMAD EVER+LK+ M A  S+P+   N+D R
Sbjct: 417  PSAPDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPR 476

Query: 1880 FSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNSP 2059
                 Q+      P + QPS Q  V+PF             S   V ++ P ++SLQ+SP
Sbjct: 477  LVPALQY---PVPPVISQPSIQSPVVPFPTQHLPQVTSVLKS--SVTQISPQDTSLQSSP 531

Query: 2060 AREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGNW 2239
            AREEGEVPESELDPDTRRRLLILQHGQD R+    E                 +Q  G W
Sbjct: 532  AREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSE-PKFPMGTPLQVSVPPRVQPHG-W 589

Query: 2240 FPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSF-QGTGAVQSDRTFHEN 2416
            FP+EEE SP + NR LP    K+F +  E+MH  K RP       +   ++ SDR   EN
Sbjct: 590  FPAEEEMSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFEN 646

Query: 2417 LRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPSS 2596
             RLPKEV   DDR + S++ P ++   GEE PLGRS+S+N++                  
Sbjct: 647  QRLPKEVIPRDDRMRFSQSQPSFRP-PGEEVPLGRSSSSNRV------------------ 687

Query: 2597 KDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIGE 2776
              L  E G     Y +TPA  LQDIA KCG+KV FRS+ + S ELQFS+EV F+GEK+GE
Sbjct: 688  --LDLEPGH-YDPYLETPAGALQDIAFKCGAKVEFRSSFLSSPELQFSLEVLFAGEKVGE 744

Query: 2777 GIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSFHQ 2956
            G G+TR+EAQR+AAE SL  LA++YL   KPD +   GD  +  N   NG++++ +    
Sbjct: 745  GTGRTRREAQRRAAEESLMYLADKYLSCIKPDSSSTQGDGFRFPNASDNGFVDNMS---- 800

Query: 2957 PFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTSSTSL 3136
            PF  +D V  S  SEP R L+ R+E  + ++  + AL+ELC  EGL L F+T+P  S + 
Sbjct: 801  PFGYQDRVSHSFASEPPRVLDPRLEVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSANP 860

Query: 3137 NYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRSLQA 3316
              K E YAQVEI GQVFG G GST D          L+ LK +L   + KR GSPRSLQ 
Sbjct: 861  GQKSEIYAQVEIDGQVFGKGIGSTWDDAKTQAAERALVALKSELAQFSQKRQGSPRSLQQ 920

Query: 3317 AVPNKRFKPEFPR-VLQRIPPSARY 3388
               NKR KPE+ R V QR+P S R+
Sbjct: 921  GFSNKRLKPEYSRGVQQRVPLSGRF 945


>ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 958

 Score = 1005 bits (2598), Expect = 0.0
 Identities = 556/987 (56%), Positives = 686/987 (69%), Gaps = 14/987 (1%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQQ-MDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPNG 646
            M +SMVY G   +GEVEIYP+ ++ +D+    KEIRIS +SQPSERCPPLAVLHTI   G
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDL----KEIRISHFSQPSERCPPLAVLHTITSFG 56

Query: 647  VSFKMEASKSQS--EDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACFW 820
            + FKME+S SQ+  +  +L+ LH S + ENKTAVM +  EE+HLVAM SR N     CFW
Sbjct: 57   ICFKMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNNDR--PCFW 114

Query: 821  GFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQRV 1000
            GF    GLYNS L MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+ +E++PQR+
Sbjct: 115  GFIVASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRI 174

Query: 1001 SGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQEK 1180
            S M  EIKRY DDK ILK+Y E DQ+V+NG++IK+QSE+ PALSD+HQPIVRPLIRLQEK
Sbjct: 175  STMQAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEK 234

Query: 1181 NIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLD 1360
            NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLD
Sbjct: 235  NIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLD 294

Query: 1361 PESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVHV 1540
            PE NLI+S ELL+RIVCVK+GL+KSL NVF  G+C  KMALVIDDRLKVWDE DQP+VHV
Sbjct: 295  PELNLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHV 354

Query: 1541 VPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEGP 1720
            VPAFAPYYAPQAEA+N VP LC+AR+VACNVRGGFFKDFD+ LLQ+I  I YEDD+++ P
Sbjct: 355  VPAFAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIP 414

Query: 1721 SPPDVSNFLISEDDGTAA--NRGPLRFEGMADVEVERKLKDIMPAFS----IPNNIDSR- 1879
            SPPDVSN+L+SEDD +A+  N+  L F+GMAD EVER+LKD + A S    +  N+D R 
Sbjct: 415  SPPDVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRL 474

Query: 1880 -FSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNS 2056
             F++  Q+   SS+ TVP P+ Q S++ F +              P+ +V P   SL +S
Sbjct: 475  AFNSSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTL---VKPICQVTPPGPSLHSS 531

Query: 2057 PAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGN 2236
            PAREEGEVPESELD DTRRRLLILQHGQD REH   E                 + SR  
Sbjct: 532  PAREEGEVPESELDLDTRRRLLILQHGQDTREHTSSE--PPLPVRHPTQVSAPSVPSRRG 589

Query: 2237 WFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSFQGT-GAVQSDRTFHE 2413
            WF  EEE  P + N+ +P    K+F V SE +H EK  P+  S F     +V SDR FHE
Sbjct: 590  WFSVEEEMGPQQLNQLVP----KEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHE 645

Query: 2414 -NLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAP 2590
             + RLPKEVH  DD ++LS++   Y SF G++ PL  S+ +N                  
Sbjct: 646  SHQRLPKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSN------------------ 687

Query: 2591 SSKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKI 2770
              +D  +ESGR    + D  A VLQ+IA+KCG+KV F S+LV S  LQFSIE WF+G+K+
Sbjct: 688  --RDFDSESGRSLF-HADITAGVLQEIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKV 744

Query: 2771 GEGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSF 2950
            GEG G+TR+EAQ +AAE S++ LA+ Y+ + K D    YGD +     + NG+++  NS 
Sbjct: 745  GEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNNNGFVSSGNSL 804

Query: 2951 HQPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPT-SS 3127
                L ++SV  S++S+ SR  + R+E S+ + D ISALKE CM EGL  +F++ P  +S
Sbjct: 805  GNQLLPKESVSFSTSSDSSRVSDPRLEVSKRSTDSISALKEFCMMEGLAANFQSSPAPAS 864

Query: 3128 TSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRS 3307
            T    K E +AQVEI GQ+FG G G T +          L +L+      T KR GSPRS
Sbjct: 865  THFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRS 924

Query: 3308 LQAAVPNKRFKPEFPRVLQRIPPSARY 3388
            +Q  + NKR K E+PR LQRIP SARY
Sbjct: 925  MQ-GLANKRLKQEYPRTLQRIPYSARY 950


>ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum lycopersicum]
          Length = 954

 Score = 1004 bits (2596), Expect = 0.0
 Identities = 569/986 (57%), Positives = 678/986 (68%), Gaps = 13/986 (1%)
 Frame = +2

Query: 470  MFKSMV--YEGNSLLGEVEIYPQNQQMDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPN 643
            MFKS V  YEG  L+GEVE+Y +     ++   K IRIS YS  SERCPPLAVLHT+   
Sbjct: 1    MFKSTVLLYEGERLVGEVEMYGEK---GVVWGEKLIRISHYSPSSERCPPLAVLHTVT-T 56

Query: 644  GVSFKMEASKSQ--SEDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACF 817
            G+SFK+E +KS+  ++DS L  LH + L +NKTAVM +G EELHLVAM S+       CF
Sbjct: 57   GLSFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCF 116

Query: 818  WGFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQR 997
            WGF    GLY+S L MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI +E DPQR
Sbjct: 117  WGFKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQR 176

Query: 998  VSGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQE 1177
             S ML E+KRYQ+DKIILKQY E DQ+V+NG++I+ QSEVFPALSDNHQPIVRPLIRLQ+
Sbjct: 177  ASVMLAEVKRYQEDKIILKQYAENDQVVDNGKVIRSQSEVFPALSDNHQPIVRPLIRLQD 236

Query: 1178 KNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL 1357
            +NIILTRINP+IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL
Sbjct: 237  RNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL 296

Query: 1358 DPESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVH 1537
            DP+SNLI+S ELL+RIVCVK+GLRKSL NVF  G C PKMALVIDDRLKVWD+ DQPRVH
Sbjct: 297  DPDSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVH 356

Query: 1538 VVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEG 1717
            VVPAFAPY+APQAE NN VPVLCVARNVACNVRGGFFKDFDE LLQRI  + YEDD+++ 
Sbjct: 357  VVPAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQV 416

Query: 1718 PSPPDVSNFLISEDDGTA--ANRGPLRFEGMADVEVERKLKDIMPA-FSIPN---NIDSR 1879
            PS PDVSN+LISEDD +A   N+  L F+GMAD EVER+LK+ M A  S+P+   N+D R
Sbjct: 417  PSAPDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPR 476

Query: 1880 FSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNSP 2059
                 Q+      P + QPS QG V+PF             S   V ++ P ++SLQ+SP
Sbjct: 477  LVPALQY---PVPPVISQPSIQGPVVPFPTQHLPQVTSVLKS--SVTQISPQDTSLQSSP 531

Query: 2060 AREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGNW 2239
            AREEGEVPESELDPDTRRRLLILQHGQD R+    E                 +Q  G W
Sbjct: 532  AREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSE-PKFPIGTPLQVSVPPRVQPHG-W 589

Query: 2240 FPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSF-QGTGAVQSDRTFHEN 2416
            FP+EEE SP + NR LP    K+F +  E+MH  K RP       +   ++ SDR F EN
Sbjct: 590  FPAEEEVSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVFFEN 646

Query: 2417 LRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPSS 2596
             RLPKEV   DDR + S++ P ++   GE+  LGRS+S+N++                  
Sbjct: 647  QRLPKEVIPRDDRMRFSQSQPSFRP-PGEDVSLGRSSSSNRV------------------ 687

Query: 2597 KDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIGE 2776
              L  + G     Y DTPA  LQDIA KCG KV FRS+ + S ELQF +EV F+GEK+GE
Sbjct: 688  --LDLDPGH-YDPYLDTPAGALQDIAFKCGVKVEFRSSFLSSPELQFCLEVLFAGEKVGE 744

Query: 2777 GIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSFHQ 2956
            GIG+TR+EAQR AAE SL  LA++YL   K D +   GD  +  N   NG++ + +    
Sbjct: 745  GIGRTRREAQRHAAEESLMYLADKYLSCIKADSSSTQGDGFRFPNASDNGFVENMS---- 800

Query: 2957 PFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTSSTSL 3136
            PF  +D V  S  SEP R L+ R+E  + ++  + AL+ELC  EGL L F+T+P  S + 
Sbjct: 801  PFGYQDRVSHSFASEPPRVLDPRLEVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSVNP 860

Query: 3137 NYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRSL-Q 3313
              K E YAQVEI GQVFG G G T D          L+ LK +L   +HKR GSPRSL Q
Sbjct: 861  GQKSEIYAQVEIDGQVFGKGIGPTWDDAKTQAAERALVALKSELAQFSHKRQGSPRSLQQ 920

Query: 3314 AAVPNKRFKPEFPR-VLQRIPPSARY 3388
                NKR KPE+ R V QR+P S R+
Sbjct: 921  QGFSNKRLKPEYSRGVQQRVPLSGRF 946


>ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cicer arietinum]
          Length = 951

 Score =  991 bits (2561), Expect = 0.0
 Identities = 557/985 (56%), Positives = 676/985 (68%), Gaps = 12/985 (1%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQ--NQQMDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPN 643
            M+KS+VY+G  +LGEV+IYP+  N   +     KEIRIS ++QPSERC PLAVLHTI  +
Sbjct: 1    MYKSLVYQGEVVLGEVDIYPEVNNNNKNF----KEIRISHFTQPSERCLPLAVLHTITSS 56

Query: 644  GVSFKMEASKSQSEDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACFWG 823
            GV FKME SK+Q +D L + LH     ENKTAVM +  EE+HLVAM SR N     CFWG
Sbjct: 57   GVCFKME-SKTQQQDPLFH-LHNLCFRENKTAVMPLCGEEMHLVAMHSRSNGR--PCFWG 112

Query: 824  FYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQRVS 1003
            +    GLYNS L+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKI +E+DPQR+S
Sbjct: 113  YIVGMGLYNSCLMMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRIS 172

Query: 1004 GMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQEKN 1183
            GM  E+KRY +DK ILKQYVE DQ+V+NG+++K QSE+ PALSD+HQPIVRPLIRL EKN
Sbjct: 173  GMQAEVKRYLEDKSILKQYVENDQVVDNGKVLKAQSELVPALSDSHQPIVRPLIRLHEKN 232

Query: 1184 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 1363
            IILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 233  IILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 292

Query: 1364 ESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVHVV 1543
            +SNLI+S ELL RIVCVK+GL+KSL NVF  G+C PKMALVIDDRLKVWDE DQPRVHVV
Sbjct: 293  DSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVV 352

Query: 1544 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEGPS 1723
            PAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFKDFD+ LLQ+I  I YE++  +   
Sbjct: 353  PAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKISQIAYENNTRDISP 412

Query: 1724 PPDVSNFLISEDDGTA--ANRGPLRFEGMADVEVERKLKD-IMPAFSIP---NNIDSRFS 1885
             PDVSN+L+SEDDG+A  ANR P  F+GMAD EVERKLKD I  A +IP     +D R +
Sbjct: 413  APDVSNYLVSEDDGSASYANRDPFAFDGMADAEVERKLKDAISAASAIPMTTAKLDPRLT 472

Query: 1886 TLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNSPAR 2065
            +  Q+   S    +P P+ Q S++P                 P+ +V PSE SL +SPAR
Sbjct: 473  SSLQYTMVSPGSVLP-PAAQASMIPL---PHTQFPQPATLVKPIGQVAPSELSLHSSPAR 528

Query: 2066 EEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGNWFP 2245
            EEGEVPESELDPDTRRRLLILQHGQD R+H   E                 +  RG WFP
Sbjct: 529  EEGEVPESELDPDTRRRLLILQHGQDNRDHTSSE---PPFPLKHPVQVSARVPPRGGWFP 585

Query: 2246 SEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSF-QGTGAVQSDRTFHE-NL 2419
             EEE      NR +PK++    +++S     EK R  +   F +  G++ SDR  HE N 
Sbjct: 586  VEEEIGSQPPNRVIPKEI----ALDSGPSRIEKHRLHQQPFFPKVDGSISSDRALHETNQ 641

Query: 2420 RLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPSSK 2599
            RLPKE++  DDR+++S     Y S SG++TP GRS+S                    S +
Sbjct: 642  RLPKEMYHRDDRSRVSHMLSSYPSLSGDDTPFGRSSS--------------------SHR 681

Query: 2600 DLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIGEG 2779
            D  +ESG       +TPA VLQ+IA+KCG+KV F S+L  S ELQFSIE WFSG+KIG G
Sbjct: 682  DFDSESGHSVFN-AETPAIVLQEIALKCGTKVEFTSSLAASRELQFSIEAWFSGKKIGHG 740

Query: 2780 IGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSF-HQ 2956
             G+TR EAQ +AAE S++ LA+ YL   K +    +GD +   N + NGY+ + +S  +Q
Sbjct: 741  FGRTRMEAQYKAAEDSIKHLADIYLSRAKDESGSAFGDVSGFPNANDNGYVGNVSSLGNQ 800

Query: 2957 PFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTSSTSL 3136
            P   E+SV  S+ S+PSR L+ R++ S+ ++  +SALKELCM EGL ++F + P +  S 
Sbjct: 801  PLPKEESVSFSAASDPSRVLDPRLDVSKRSMGSVSALKELCMVEGLGVNFLSLP-APVST 859

Query: 3137 NYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKL-GLDTHKRIGSPRSLQ 3313
            N   E +AQVEI GQV+G G G T D          L +L+  + G    +R  SPR  Q
Sbjct: 860  NSVDEVHAQVEIDGQVYGKGTGITWDEAKMQAAEKALGSLRTTIHGQGIQRRQLSPRPFQ 919

Query: 3314 AAVPNKRFKPEFPRVLQRIPPSARY 3388
              + NKR K E PR LQR   S RY
Sbjct: 920  -GLSNKRLKQEHPRTLQRFASSGRY 943


>ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
            gi|571500215|ref|XP_006594604.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1-like
            isoform X2 [Glycine max]
          Length = 960

 Score =  990 bits (2560), Expect = 0.0
 Identities = 556/985 (56%), Positives = 676/985 (68%), Gaps = 15/985 (1%)
 Frame = +2

Query: 479  SMVYEGNSLLGEVEIYPQ-NQQMDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPNGVSF 655
            SMVY G   +GEV+IYP+ N+ MD+    KEIRIS +SQPSERCPPLAVLHTI   G+ F
Sbjct: 4    SMVYHGEMAVGEVKIYPEENKNMDL----KEIRISHFSQPSERCPPLAVLHTITSFGICF 59

Query: 656  KMEASKSQS--EDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACFWGFY 829
            KME+S SQ   +   L+ LH S + ENKTAVM V  EE+HLVAM SR N     CFWGF 
Sbjct: 60   KMESSTSQKRQQQDALFHLHSSCIRENKTAVMPVRGEEIHLVAMYSRNNDR--PCFWGFI 117

Query: 830  APQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQRVSGM 1009
               GLYNS L MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+ +E++PQ++S M
Sbjct: 118  VASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQQISAM 177

Query: 1010 LQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQEKNII 1189
              EIKRY DDK ILK+Y E DQ+V+NG++IK+QSE  PALSD+HQPIVRPLIRLQEKNII
Sbjct: 178  QAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSESVPALSDSHQPIVRPLIRLQEKNII 237

Query: 1190 LTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 1369
            LTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDPE 
Sbjct: 238  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEL 297

Query: 1370 NLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVHVVPA 1549
            NLI+S ELL+RIVCVK+GL+KSL NVF  G+C  KMALVIDDRLKVWDE DQPRVHVVPA
Sbjct: 298  NLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPRVHVVPA 357

Query: 1550 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEGPSPP 1729
            FAPYY PQAEA+N VP LC+ARNVACNVRGGFFKDFD+ LLQ+I  I YEDD+++ PS P
Sbjct: 358  FAPYYTPQAEASNAVPFLCLARNVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPS-P 416

Query: 1730 DVSNFLISEDDGTAA--NRGPLRFEGMADVEVERKLKDIMPAFS----IPNNIDSR--FS 1885
            DVSN+L+SEDD +A+  N+  L F+GMAD EVER+LKD + A S    +  NID R  F+
Sbjct: 417  DVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFT 476

Query: 1886 TLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNSPAR 2065
            +  Q+   SS+ TVP P+ Q SV+ F +              P+++V     SL +SPAR
Sbjct: 477  SSLQYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTL---VKPMSQVTHPGLSLHSSPAR 533

Query: 2066 EEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGNWFP 2245
            EEGE+PESELD DTRRR LILQHGQD RE    E                 + SR  WF 
Sbjct: 534  EEGELPESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFS 593

Query: 2246 SEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSFQGTG-AVQSDRTFHE-NL 2419
             EEE  P + N  +P    K+F V+SE  H EK  P+  S F   G ++ SDR FHE + 
Sbjct: 594  VEEEMGPQQLNLPVP----KEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQ 649

Query: 2420 RLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPSSK 2599
            RLPKEVH  DDR++LS++   Y S  G++ PL  S+ +N                    +
Sbjct: 650  RLPKEVHHRDDRSRLSQSLSSYHSLPGDDIPLSGSSYSN--------------------R 689

Query: 2600 DLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIGEG 2779
            D  +ESGR    + DT A VLQ+IA+ CG+KV F S+LV S ELQFSIE WF+G+KIGEG
Sbjct: 690  DFDSESGRSLF-HADTTAGVLQEIALNCGTKVEFLSSLVASTELQFSIEAWFAGKKIGEG 748

Query: 2780 IGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSFHQP 2959
             G+TR+EAQ +AA  S++ LA+ Y+ + K D    YGD +     + +G+++  NS    
Sbjct: 749  FGRTRREAQSKAAGCSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNNDGFVSSGNSLGNQ 808

Query: 2960 FL-NEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPTS-STS 3133
             L  E+S   S+ SE SR  + R+E S+ + D ISALKELCM EGL   F++ P S ST 
Sbjct: 809  LLPKEESGSFSTASESSRVSDSRLEVSKRSTDSISALKELCMMEGLAASFQSPPASASTH 868

Query: 3134 LNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRSLQ 3313
            L  K E +AQVEI GQ+FG G G T +          L +L+      + KR GSPRS+Q
Sbjct: 869  LTQKDEVHAQVEIDGQIFGKGFGVTWEEAKMQAAKKALGSLRTMFNQGSLKRHGSPRSMQ 928

Query: 3314 AAVPNKRFKPEFPRVLQRIPPSARY 3388
              + NKR KPE+P  LQR+P SARY
Sbjct: 929  -GLANKRLKPEYPPTLQRVPYSARY 952


>ref|XP_006597420.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X2 [Glycine max]
          Length = 937

 Score =  979 bits (2532), Expect = 0.0
 Identities = 544/985 (55%), Positives = 673/985 (68%), Gaps = 12/985 (1%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQQ-MDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPNG 646
            M +SMVY G   +GEVEIYP+ ++ +D+    KEIRIS +SQPSERCPPLAVLHTI   G
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDL----KEIRISHFSQPSERCPPLAVLHTITSFG 56

Query: 647  VSFKMEASKSQS--EDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACFW 820
            + FKME+S SQ+  +  +L+ LH S + ENKTAVM +  EE+HLVAM SR N     CFW
Sbjct: 57   ICFKMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNNDR--PCFW 114

Query: 821  GFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQRV 1000
            GF    GLYNS L MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+ +E++PQR+
Sbjct: 115  GFIVASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRI 174

Query: 1001 SGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQEK 1180
            S M  EIKRY DDK ILK+Y E DQ+V+NG++IK+QSE+ PALSD+HQPIVRPLIRLQEK
Sbjct: 175  STMQAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEK 234

Query: 1181 NIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLD 1360
            NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLD
Sbjct: 235  NIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLD 294

Query: 1361 PESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVHV 1540
            PE NLI+S ELL+RIVCVK+GL+KSL NVF  G+C  KMALVIDDRLKVWDE DQP+VHV
Sbjct: 295  PELNLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHV 354

Query: 1541 VPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEGP 1720
            VPAFAPYYAPQAEA+N VP LC+AR+VACNVRGGFFKDFD+ LLQ+I  I YEDD+++ P
Sbjct: 355  VPAFAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIP 414

Query: 1721 SPPDVSNFLISEDDGTAA--NRGPLRFEGMADVEVERKLKDIMPAFS----IPNNIDSR- 1879
            SPPDVSN+L+SEDD +A+  N+  L F+GMAD EVER+LKD + A S    +  N+D R 
Sbjct: 415  SPPDVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRL 474

Query: 1880 -FSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNS 2056
             F++  Q+   SS+ TVP P+ Q S++ F +              P+ +V P   SL +S
Sbjct: 475  AFNSSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTL---VKPICQVTPPGPSLHSS 531

Query: 2057 PAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGN 2236
            PAREEGEVPESELD DTRRRLLILQHGQD REH   E                 + SR  
Sbjct: 532  PAREEGEVPESELDLDTRRRLLILQHGQDTREHTSSE--PPLPVRHPTQVSAPSVPSRRG 589

Query: 2237 WFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSFQGTGAVQSDRTFHEN 2416
            WF  EEE  P + N+ +P    K+F V SE +H EK  P+  S F               
Sbjct: 590  WFSVEEEMGPQQLNQLVP----KEFPVGSEPLHIEKRWPRHPSLF--------------- 630

Query: 2417 LRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAPSS 2596
                 +VH  DD ++LS++   Y SF G++ PL  S+ +N                    
Sbjct: 631  ----SKVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSN-------------------- 666

Query: 2597 KDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKIGE 2776
            +D  +ESGR    + D  A VLQ+IA+KCG+KV F S+LV S  LQFSIE WF+G+K+GE
Sbjct: 667  RDFDSESGRSLF-HADITAGVLQEIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGE 725

Query: 2777 GIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSFHQ 2956
            G G+TR+EAQ +AAE S++ LA+ Y+ + K D    YGD +     + NG+++  NS   
Sbjct: 726  GFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNNNGFVSSGNSLGN 785

Query: 2957 PFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPT-SSTS 3133
              L ++SV  S++S+ SR  + R+E S+ + D ISALKE CM EGL  +F++ P  +ST 
Sbjct: 786  QLLPKESVSFSTSSDSSRVSDPRLEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTH 845

Query: 3134 LNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRSLQ 3313
               K E +AQVEI GQ+FG G G T +          L +L+      T KR GSPRS+Q
Sbjct: 846  FAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQ 905

Query: 3314 AAVPNKRFKPEFPRVLQRIPPSARY 3388
              + NKR K E+PR LQRIP SARY
Sbjct: 906  -GLANKRLKQEYPRTLQRIPYSARY 929


>ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X3 [Glycine max]
          Length = 932

 Score =  979 bits (2530), Expect = 0.0
 Identities = 549/987 (55%), Positives = 674/987 (68%), Gaps = 14/987 (1%)
 Frame = +2

Query: 470  MFKSMVYEGNSLLGEVEIYPQNQQ-MDIIMLNKEIRISTYSQPSERCPPLAVLHTIAPNG 646
            M +SMVY G   +GEVEIYP+ ++ +D+    KEIRIS +SQPSERCPPLAVLHTI   G
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDL----KEIRISHFSQPSERCPPLAVLHTITSFG 56

Query: 647  VSFKMEASKSQS--EDSLLYTLHYSLLGENKTAVMGVGEEELHLVAMPSRRNPSHGACFW 820
            + FKME+S SQ+  +  +L+ LH S + ENKTAVM +  EE+HLVAM SR N     CFW
Sbjct: 57   ICFKMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNNDR--PCFW 114

Query: 821  GFYAPQGLYNSSLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKITTEMDPQRV 1000
            GF    GLYNS L MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+ +E++PQR+
Sbjct: 115  GFIVASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRI 174

Query: 1001 SGMLQEIKRYQDDKIILKQYVETDQIVENGRLIKVQSEVFPALSDNHQPIVRPLIRLQEK 1180
            S M  EIKRY DDK ILK+Y E DQ+V+NG++IK+QSE+ PALSD+HQPIVRPLIRLQEK
Sbjct: 175  STMQAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEK 234

Query: 1181 NIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLD 1360
            NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLD
Sbjct: 235  NIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLD 294

Query: 1361 PESNLISSNELLNRIVCVKAGLRKSLLNVFHGGICDPKMALVIDDRLKVWDEIDQPRVHV 1540
            PE NLI+S ELL+RIVCVK+GL+KSL NVF  G+C  KMALVIDDRLKVWDE DQP+VHV
Sbjct: 295  PELNLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHV 354

Query: 1541 VPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKDFDESLLQRIIGIFYEDDVEEGP 1720
            VPAFAPYYAPQAEA+N VP LC+AR+VACNVRGGFFKDFD+ LLQ+I  I YEDD+++ P
Sbjct: 355  VPAFAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIP 414

Query: 1721 SPPDVSNFLISEDDGTAA--NRGPLRFEGMADVEVERKLKDIMPAFS----IPNNIDSR- 1879
            SPPDVSN+L+SEDD +A+  N+  L F+GMAD EVER+LKD + A S    +  N+D R 
Sbjct: 415  SPPDVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRL 474

Query: 1880 -FSTLQQFVGSSSNPTVPQPSPQGSVMPFHDXXXXXXXXXXXSFNPVNRVGPSESSLQNS 2056
             F++  Q+   SS+ TVP P+ Q S++ F +              P+ +V P   SL +S
Sbjct: 475  AFNSSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTL---VKPICQVTPPGPSLHSS 531

Query: 2057 PAREEGEVPESELDPDTRRRLLILQHGQDVREHPPKESXXXXXXXXXXXXXXXXIQSRGN 2236
            PAREEGEVPESELD DTRRRLLILQHGQD REH   E                 + SR  
Sbjct: 532  PAREEGEVPESELDLDTRRRLLILQHGQDTREHTSSE--PPLPVRHPTQVSAPSVPSRRG 589

Query: 2237 WFPSEEETSPPKQNRALPKQVQKDFSVESEAMHFEKSRPQRSSSFQGT-GAVQSDRTFHE 2413
            WF  EEE  P + N+ +P    K+F V SE +H EK  P+  S F     +V SDR FHE
Sbjct: 590  WFSVEEEMGPQQLNQLVP----KEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHE 645

Query: 2414 -NLRLPKEVHIGDDRAKLSRTHPKYQSFSGEETPLGRSASNNKLSPSPKDEEMPLAQVAP 2590
             + RLPKEVH  DD ++LS++   Y SF G++ PL  S+ +N                  
Sbjct: 646  SHQRLPKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSN------------------ 687

Query: 2591 SSKDLQTESGRGTSQYPDTPAKVLQDIAIKCGSKVVFRSALVGSAELQFSIEVWFSGEKI 2770
              +D  +ESGR    + D  A VLQ+IA+KCG+KV F S+LV S  LQFSIE WF+G+K+
Sbjct: 688  --RDFDSESGRSLF-HADITAGVLQEIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKV 744

Query: 2771 GEGIGKTRKEAQRQAAERSLRTLANRYLINTKPDPTMVYGDPNKPSNVDMNGYINDSNSF 2950
            GEG G+TR+EAQ +AAE S++ LA+ Y+ + K D    YGD +     + NG++      
Sbjct: 745  GEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNNNGFV------ 798

Query: 2951 HQPFLNEDSVPASSTSEPSRYLNMRMEESQSNLDPISALKELCMDEGLKLDFRTRPT-SS 3127
                          +S+P      R+E S+ + D ISALKE CM EGL  +F++ P  +S
Sbjct: 799  --------------SSDP------RLEVSKRSTDSISALKEFCMMEGLAANFQSSPAPAS 838

Query: 3128 TSLNYKGEAYAQVEIGGQVFGNGNGSTLDXXXXXXXXXXLMNLKVKLGLDTHKRIGSPRS 3307
            T    K E +AQVEI GQ+FG G G T +          L +L+      T KR GSPRS
Sbjct: 839  THFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRS 898

Query: 3308 LQAAVPNKRFKPEFPRVLQRIPPSARY 3388
            +Q  + NKR K E+PR LQRIP SARY
Sbjct: 899  MQ-GLANKRLKQEYPRTLQRIPYSARY 924


Top