BLASTX nr result

ID: Catharanthus23_contig00009199 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00009199
         (4856 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma...  1042   0.0  
ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma...  1033   0.0  
gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-l...  1033   0.0  
ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal doma...  1017   0.0  
ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu...   999   0.0  
ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr...   988   0.0  
emb|CBI35661.3| unnamed protein product [Vitis vinifera]              981   0.0  
gb|AAV92930.1| putative transcription regulator CPL1 [Solanum ly...   981   0.0  
gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l...   978   0.0  
ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus tric...   963   0.0  
ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma...   949   0.0  
ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma...   949   0.0  
ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Popu...   949   0.0  
ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal doma...   941   0.0  
ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera...   941   0.0  
ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma...   936   0.0  
gb|ESW11309.1| hypothetical protein PHAVU_008G019000g [Phaseolus...   934   0.0  
ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal doma...   929   0.0  
ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal doma...   927   0.0  
ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ...   925   0.0  

>ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Solanum tuberosum]
          Length = 1218

 Score = 1042 bits (2695), Expect = 0.0
 Identities = 610/1214 (50%), Positives = 763/1214 (62%), Gaps = 28/1214 (2%)
 Frame = +3

Query: 627  RFW-MRDFLNYRISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRVG 803
            R W MRD   Y ISR +  GL+NLAWA  VQNKP+ +  VM          +S +SN+  
Sbjct: 57   RVWTMRDAYKYPISRDYARGLYNLAWAQAVQNKPLDELFVM----------TSDNSNQCA 106

Query: 804  PDGLNKNNNERVVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSN-SFGIRD 980
                N  +   + ++ DD+                         +D DA D   +FG   
Sbjct: 107  NANANVESKVIIDVDVDDDAKEEGELEEGE--------------IDLDAADLVLNFG--- 149

Query: 981  AGLEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKKEDLIQLSF 1160
                K  + + + L+ +TL+   KSF   CSKLQ    +L E+A     +K + LIQL  
Sbjct: 150  ----KEANFVREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQ--DKNDILIQLFM 203

Query: 1161 AAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQLKEIEGIISSLDSFAV 1340
             A+RT+N+VF SM           + RLL     Q   ++S+EQLKE++ +I S++  AV
Sbjct: 204  TALRTINSVFYSMNQDQKQQNTDILSRLLFHAKTQLPALLSSEQLKEVDAVILSINQSAV 263

Query: 1341 CSGSKGNDINKEMQVAEGFSKNIIDISTGNVNQEL----------LKIKSSDQSEARTML 1490
             S ++ ND    ++V E   K +   S+ N NQ+           + IKSS   E     
Sbjct: 264  FSNTQDNDKVNGIKVVELLDKKVSHKSSENANQDFTAVNKYDLGAVSIKSSGLKEQSVSF 323

Query: 1491 DNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKPEWP 1670
            +++K G ANSK +GLS+PLLDLHKDHD D+LPSPTRE  P  P+ K     HG++K + P
Sbjct: 324  ESVKPGLANSKAKGLSIPLLDLHKDHDEDTLPSPTREIGPQFPVAKAT-QAHGMVKLDLP 382

Query: 1671 IPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDADISA 1850
            I   +L+     +HPYET+A+KAVS+YQQKFGRSS F+++ LPSPTPSEEG++   DI  
Sbjct: 383  IFAGSLEKGNSLLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEGDSGKGDIGG 442

Query: 1851 EVSSSPKRHAKPEITPM-VGQLGVSSFPNMNNLSVQGLNSIQNAAPSSYGLNPLLKQSFA 2027
            EV+S    H    +    +GQ  +SS P  N L  QGL + + A P S+  NP L+ S A
Sbjct: 443  EVTSLDVVHNASHLNESSMGQPILSSVPQTNILDGQGLGTARTADPLSFLPNPSLRSSTA 502

Query: 2028 KSRDPRLRLVNSDATA-----GISALPN-ETK-EPLGGIISSKKQKIVEERVLDGPALKR 2186
            KSRDPRLRL  SDA A      I  +P+ + K E    +I SKKQK V+  V   P  KR
Sbjct: 503  KSRDPRLRLATSDAVAQNTNKNILPIPDIDLKLEASLEMIGSKKQKTVDLPVFGAPLPKR 562

Query: 2187 PKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVANSTSSNI 2366
             ++E  +S  +   R   GNGGWLEDR   G  + +        D  +   +   T++  
Sbjct: 563  QRSEQTDSIIVSDVRPSTGNGGWLEDRGTAGLPITSSNCATDSSDNDIR-KLEQVTATIA 621

Query: 2367 TMPNVSVGINDKLAIPGSTAS--MQSILTDLVVNPSILLNFLK-GQQMSANSTKSTS-QP 2534
            T+P+V V   +   + G + S  + S+L D+ +NPSI +N +K  QQ SA+++++T+ Q 
Sbjct: 622  TIPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKMEQQKSADASRTTTAQA 681

Query: 2535 TSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRTAALEESGTVRMKPRDPRRVLHSN 2714
            +SS SILGAVPST+   P+   +GQ S GIL TP+ TA+ +E   VRMKPRDPRRVLH+ 
Sbjct: 682  SSSKSILGAVPSTDAIAPRSSAIGQRSVGILQTPTHTASADEVAIVRMKPRDPRRVLHNT 741

Query: 2715 GLQAGKSMEIDQSQTKTSTLSVPGVMGNLNGQRQDHQREQISVXXXXXXXXXXXXXXXXX 2894
             +  G ++  DQ   KT        + NL  Q Q+ Q ++ S                  
Sbjct: 742  AVLKGGNVGSDQC--KTGVAGTHATISNLGFQSQEDQLDRKSAVTLSTTP---------- 789

Query: 2895 XXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRIDAKGALEVGS 3074
                PDI+ QF +NLKNIAD+++VS ++S  +        Q  Q+ Q R + K A+   S
Sbjct: 790  ----PDIARQFTKNLKNIADMISVSPSTSLSAA--SQTQTQCLQSHQSRSEGKEAVSEPS 843

Query: 3075 LQTRSV----KEVSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXXXKMFAAR 3242
             +        ++ S  S + Q +WGDVEHLF+G+ DQQ+A              KMF+ R
Sbjct: 844  ERVNDAGLASEKGSPGSLQPQISWGDVEHLFEGYSDQQRADIQRERARRLEEQKKMFSVR 903

Query: 3243 KXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSMWTKLRPGI 3422
            K            NSAKF E+DPVH+EILRKKEEQDREKP RHLFRFPHM MWTKLRPGI
Sbjct: 904  KLCLVLDLDHTLLNSAKFVEIDPVHEEILRKKEEQDREKPCRHLFRFPHMGMWTKLRPGI 963

Query: 3423 WNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLIDGDERVPK 3602
            WNFLEKAS L+ELHLYTMGNKLYATEMAKLLDPKG+LFAGRVISRGDD D  DGDERVPK
Sbjct: 964  WNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGDERVPK 1023

Query: 3603 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPSLLEIDHD 3782
            SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGL GPSLLEIDHD
Sbjct: 1024 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHD 1083

Query: 3783 ERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVFSRVFPVGE 3962
            ER EDGTLAS L VI++IH +FFAH+++DEADVRNILA+EQ KILAGCRIVFSRVFPVGE
Sbjct: 1084 ERPEDGTLASCLGVIQRIHQNFFAHRSIDEADVRNILATEQKKILAGCRIVFSRVFPVGE 1143

Query: 3963 ANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVHPGWVEASA 4142
            ANPHLHPLWQTAEQFGAVCT+ IDDQVTHVVANSLGTDKVNWAL+TGRFVVHPGWVEASA
Sbjct: 1144 ANPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASA 1203

Query: 4143 LLYRRANEHDFAIK 4184
            LLYRRANEHDFAIK
Sbjct: 1204 LLYRRANEHDFAIK 1217


>ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Vitis vinifera]
          Length = 1238

 Score = 1033 bits (2671), Expect = 0.0
 Identities = 621/1232 (50%), Positives = 752/1232 (61%), Gaps = 46/1232 (3%)
 Frame = +3

Query: 627  RFW-MRDFLN----YRISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDS 791
            R W MRD  +    ++    +   L+NLAWA  VQNKP+ D  VM+         SS+ S
Sbjct: 52   RVWTMRDLQDLYKYHQACSGYTPRLYNLAWAQAVQNKPLNDIFVMD---DEESKRSSSSS 108

Query: 792  NRVGPDGLNKNNNERVVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNSFG 971
            N    D  +     +V+I  DD G                        +D++ +  +  G
Sbjct: 109  NTSRDDSSSAKEVAKVII--DDSGDEMDVKMDDVSEKEEGELEEGEIDLDSEPDVKDEGG 166

Query: 972  IRDAG----------LEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEG 1121
            + D            L + + SI + LE +T+  AEKSF   CS+LQN   SLQ++  E 
Sbjct: 167  VLDVNEPEIDLKERELVERVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEK 226

Query: 1122 WLNK-----KEDLIQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVST 1286
             + +     K+ L Q    AIR LN VFCSM             RLL  +     P+ S 
Sbjct: 227  VVGESSVPTKDALAQQLINAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSI 286

Query: 1287 EQLKEIEGIISSLDSFAVCSGSKGNDINKEMQVAEGFSKNIIDISTGNVNQELLKIKS-- 1460
            + +KE+E ++S LD+ A  S ++ +D   ++QV +G ++NI+D S  +  +     K   
Sbjct: 287  QHIKEVEVMMSFLDTPAAQSSAEASDKVNDVQVTDGMNRNILDSSVESSGRAFASAKKLS 346

Query: 1461 ----SDQSEARTMLDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDK 1628
                S +S  +   D LK G ++S+ R +  PLLDLHKDHD DSLPSPT +   C P++K
Sbjct: 347  LDSISVESYNQNNPDALKPGLSSSRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPVNK 406

Query: 1629 GFGMGHGVLKPEWPIPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPT 1808
                       E    +VA +T    MHPYET+A+KAVSTYQQKFG +SF   D+LPSPT
Sbjct: 407  S----------ELVTAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPT 456

Query: 1809 PSEEGENVDADISAEVSSSPKRHAKPEIT-PMVGQLGVSSFPNMNNLSVQGLNSIQNAAP 1985
            PSEE  +   DIS EVSSS    A      P +G   VSS P M++  VQG    +N + 
Sbjct: 457  PSEESGDTYGDISGEVSSSSTISAPITANAPALGHPIVSSAPQMDSSIVQGPTVGRNTSL 516

Query: 1986 SSYGLNPLLKQSF---AKSRDPRLRLVNSDATA------GISALPNETK-EPLGGIISSK 2135
             S G  P L  S    AKSRDPRLRL +SDA +       + A+ N  K +PLG I+SS+
Sbjct: 517  VSSG--PHLDSSVVASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSR 574

Query: 2136 KQKIVEERVLDGPALKRPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKP--EL 2309
            KQK  EE +LDGP  KR +  L +   +   + V  +GGWLED   V  ++  R    E 
Sbjct: 575  KQKSAEEPLLDGPVTKRQRNGLTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQLIEN 634

Query: 2310 GLVDPRMPGDVANSTSSNITMPNVSVGINDKLAI--PGSTASMQSILTDLVVNPSILLNF 2483
               DP+        T      P V+V  N+ L +    +TAS+QS+L D+ VNP++ +N 
Sbjct: 635  TGTDPKKLESKVTVTGIGCDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNI 694

Query: 2484 LKG--QQMSANSTKSTSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRTAALE 2657
                 QQ S +  K+T  P +SNSILG VP  ++A  KP  LGQ  AG L  P +T  ++
Sbjct: 695  FNKVEQQKSGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVP-QTGPMD 753

Query: 2658 ESGTVRMKPRDPRRVLHSNGLQAGKSMEIDQSQTKTSTLSVPGVMGNLNGQRQDHQREQI 2837
            ESG VRMKPRDPRR+LH+N  Q   S   +Q +T              N Q+Q+ Q E  
Sbjct: 754  ESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKT--------------NAQKQEDQTETK 799

Query: 2838 SVXXXXXXXXXXXXXXXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQ 3017
            SV                     PDIS QF +NLKNIAD+++ SQASS   T PQ+ S Q
Sbjct: 800  SVPSHSVNP--------------PDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQ 845

Query: 3018 TAQTPQGRIDAKGALEVGSLQTR---SVKEVSAVSSRSQNNWGDVEHLFDGFDDQQKAAX 3188
            + Q    R+D K  +     Q     S  E +A   +S+N WGDVEHLFDG+DDQQKAA 
Sbjct: 846  SVQVNTDRMDVKATVSDSGDQLTANGSKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAI 905

Query: 3189 XXXXXXXXXXXXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQR 3368
                        KMF+ARK            NSAKF EVDPVHDEILRKKEEQDREK QR
Sbjct: 906  QRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQR 965

Query: 3369 HLFRFPHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRV 3548
            HLFRFPHM MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAK+LDPKG LFAGRV
Sbjct: 966  HLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1025

Query: 3549 ISRGDDNDLIDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPC 3728
            IS+GDD D++DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPC
Sbjct: 1026 ISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 1085

Query: 3729 SRRQFGLLGPSLLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQH 3908
            SRRQFGL GPSLLEIDHDER EDGTLASSLAVIE+IH  FF+++ALDE DVRNILASEQ 
Sbjct: 1086 SRRQFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQR 1145

Query: 3909 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNW 4088
            KILAGCRIVFSRVFPVGEANPHLHPLWQTAE FGAVCTN ID+QVTHVVANSLGTDKVNW
Sbjct: 1146 KILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNW 1205

Query: 4089 ALNTGRFVVHPGWVEASALLYRRANEHDFAIK 4184
            AL+TGRFVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1206 ALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1237


>gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao]
          Length = 1290

 Score = 1033 bits (2670), Expect = 0.0
 Identities = 626/1249 (50%), Positives = 773/1249 (61%), Gaps = 63/1249 (5%)
 Frame = +3

Query: 627  RFW-MRDFLNY-RISRSFNSGLHNLAWASGVQNKPITDFLV--MEMPVTTAENNSSTDS- 791
            R W M+D   Y  + R + SGL+N AWA  VQNKP+ +  V   E P      NS   S 
Sbjct: 77   RVWTMQDLCKYPSVIRGYASGLYNFAWAQAVQNKPLNEIFVKDFEQPQQDENKNSKRSSP 136

Query: 792  -------NRVGPDGLNKNNNERVVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADA 950
                   N     G + N   +VVI+ D E                         +D D+
Sbjct: 137  SSSVASVNSKEEKGSSGNLAVKVVIDDDSEDEMEEDKVVNLDKEEGELEEGE---IDLDS 193

Query: 951  NDSNSFGIRDAG-------LEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEM 1109
                     + G       LEK  + I   LEG+T+  AEKSF   CS+L N  +SL+ +
Sbjct: 194  EPKEKVLSSEDGNVGNSDELEKRANLIRGVLEGVTVIEAEKSFEGVCSRLHNALESLRAL 253

Query: 1110 AEEGWLNKKEDLIQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTE 1289
              E  +  K+ LIQL+F AI   N+ F ++           + RLL  +      +   +
Sbjct: 254  ILECSVPAKDALIQLAFGAI---NSAFVALNCNSKEQNVAILSRLLSIVKGHDPSLFPPD 310

Query: 1290 QLKEIEGIISSLDSFAVCSGSKGNDINKEMQVAEGFSKNIIDISTGNVNQELL---KIKS 1460
            ++KEI+ ++ SL+S A     +  D  K+M+V +G +K   D    N+  +L    K+ S
Sbjct: 311  KMKEIDVMLISLNSPA-----RAIDTEKDMKVVDGVNKKDPDALPENICHDLTVTNKLPS 365

Query: 1461 SDQ----SEARTMLDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDK 1628
            S +    ++   + + LK G  N + RG+SLPLLDLHKDHDADSLPSPTRETTPCLP++K
Sbjct: 366  SAKFVINNKPNALTETLKPGVPNFRNRGISLPLLDLHKDHDADSLPSPTRETTPCLPVNK 425

Query: 1629 GFGMGHGVLKPEWPIPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPT 1808
                G  ++K  +   + + D     +HPYET+A+KA STYQQKFG+ SFF +DRLPSPT
Sbjct: 426  PLTSGDVMVKSGFMTGKGSHDAEGDKLHPYETDALKAFSTYQQKFGQGSFFSSDRLPSPT 485

Query: 1809 PSEEGENVDADISAEVSSSPK-RHAKPEITPMVGQLGVSSFPNMNNLS--VQGLNSIQNA 1979
            PSEE  +   D   EVSSS    + KP + P++G   VSS P +++ S  +QG  + +NA
Sbjct: 486  PSEESGDEGGDNGGEVSSSSSIGNFKPNL-PILGHPIVSSAPLVDSASSSLQGQITTRNA 544

Query: 1980 APSSYGLNPLLKQSFAKSRDPRLRLVNSDATA---GISALPNETK-EPLGGIISSKKQKI 2147
             P S  ++ ++ +S AKSRDPRL   NS+A+A       L N +K  P+GGI+ S+K+K 
Sbjct: 545  TPMS-SVSNIVSKSLAKSRDPRLWFANSNASALDLNERLLHNASKVAPVGGIMDSRKKKS 603

Query: 2148 VEERVLDGPALKRPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPR 2327
            VEE +LD PALKR + EL N G     + V G GGWLED   +G ++  R      ++  
Sbjct: 604  VEEPILDSPALKRQRNELENLGVARDVQTVSGIGGWLEDTDAIGSQITNRNQTAENLESN 663

Query: 2328 MP----GDVANSTSSNITMPNVSVGINDKLAIPG-STASMQSILTDLVVNPSILLNFLK- 2489
                  G  ++ST S  T  N++VG N+++ +   ST S+ ++L D+ VNP++L+N LK 
Sbjct: 664  SRKMDNGVTSSSTLSGKT--NITVGTNEQVPVTSTSTPSLPALLKDIAVNPTMLINILKM 721

Query: 2490 ---------GQQMSANSTKSTSQPTSSNSILGAVPSTNLATPKPPV--LGQGSAGILHTP 2636
                      QQ S +  KST    SSNS+LG V STN+  P P V  +   S+GI   P
Sbjct: 722  GQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVI-PSPSVNNVPSISSGISSKP 780

Query: 2637 S---RTAALEESGTVRMKPRDPRRVLHSNGLQAGKSMEIDQSQTKTS-TLSVPGVMGNLN 2804
            +   +  + +ESG +RMKPRDPRRVLH N LQ   SM +DQ +T  + T S  G   NLN
Sbjct: 781  AGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGLDQLKTNGALTSSTQGSKDNLN 840

Query: 2805 GQRQDHQREQISVXXXXXXXXXXXXXXXXXXXXGPDISLQFKENLKNIADILTVSQASSA 2984
             Q+ D Q E   +                     PDI+ QF  NLKNIADI++VSQA   
Sbjct: 841  AQKLDSQTESKPMQSQLVPP--------------PDITQQFTNNLKNIADIMSVSQA--- 883

Query: 2985 QSTLPQLPSLQTAQTPQGRIDAKGALEVGSLQTRS---------VKEVSAVSSRSQNNWG 3137
               L  LP +     PQ  +    ++++ +L + S           E  A   RSQN WG
Sbjct: 884  ---LTSLPPVSHNLVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGATGPRSQNAWG 940

Query: 3138 DVEHLFDGFDDQQKAAXXXXXXXXXXXXXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVH 3317
            DVEHLF+ +DDQQKAA             KMF+ARK            NSAKF EVDPVH
Sbjct: 941  DVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDPVH 1000

Query: 3318 DEILRKKEEQDREKPQRHLFRFPHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAT 3497
            +EILRKKEEQDREKP+RHLFRF HM MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAT
Sbjct: 1001 EEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAT 1060

Query: 3498 EMAKLLDPKGELFAGRVISRGDDNDLIDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWP 3677
            EMAK+LDPKG LFAGRVISRGDD D  DGDERVP+SKDLEGVLGMESAVVIIDDSVRVWP
Sbjct: 1061 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVWP 1120

Query: 3678 HNKLNLIVVERYIYFPCSRRQFGLLGPSLLEIDHDERTEDGTLASSLAVIEKIHHDFFAH 3857
            HNKLNLIVVERY YFPCSRRQFGLLGPSLLEIDHDER EDGTLASSLAVIE+IH DFF+H
Sbjct: 1121 HNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFSH 1180

Query: 3858 QALDEADVRNILASEQHKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDD 4037
            Q LD+ DVRNILASEQ KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTN ID+
Sbjct: 1181 QNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDE 1240

Query: 4038 QVTHVVANSLGTDKVNWALNTGRFVVHPGWVEASALLYRRANEHDFAIK 4184
             VTHVVANSLGTDKVNWAL+TG+FVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1241 HVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASALLYRRANEVDFAIK 1289


>ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Solanum lycopersicum]
          Length = 1211

 Score = 1017 bits (2629), Expect = 0.0
 Identities = 602/1219 (49%), Positives = 751/1219 (61%), Gaps = 33/1219 (2%)
 Frame = +3

Query: 627  RFW-MRDFLNYRISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRVG 803
            R W MRD   Y ISR +  GL+NLAWA  VQNKP+ +  VM          +S +SN+  
Sbjct: 60   RVWTMRDVYKYPISRDYARGLYNLAWAQAVQNKPLDELFVM----------TSDNSNQCA 109

Query: 804  PDGLNKNNNERVVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNSFGIRDA 983
                  N   +V+I+ D                           VD DA +       + 
Sbjct: 110  ------NGESKVIIDVD---------------------------VDDDAKEEGELEEGEI 136

Query: 984  GLE---------KGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKK 1136
             L+         K  + I + L+ +TL+   KSF   CSKLQ    +L E+A     +K 
Sbjct: 137  DLDSADLVVNFGKEANFIREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALS--QDKN 194

Query: 1137 EDLIQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQLKEIEGII 1316
            + LIQL   A+RT+N+VF SM           + RLL +   Q   ++S+EQLKE++ +I
Sbjct: 195  DILIQLFMTALRTINSVFYSMNDHQKQQNTDILSRLLFNAKTQLPALLSSEQLKELDALI 254

Query: 1317 SSLDSFAVCSGSKGNDINKEMQVAEGFSKNIIDISTGNVNQEL----------LKIKSSD 1466
             S++   V S ++ ND    + V +         S+ N NQ+           + IKSS 
Sbjct: 255  LSINHSLVSSNTQDNDTVNGINVVQLLDMKDSHKSSENANQDFTSVNKYDLGDVSIKSSG 314

Query: 1467 QSEARTMLDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGH 1646
              E     +++K G  NSK +GLS PLLDLHKDHD D+LPSPTR+  P  P  +     H
Sbjct: 315  LKEQSVSSESVKPGLDNSKAKGLSFPLLDLHKDHDEDTLPSPTRQIGPQFPATQ----TH 370

Query: 1647 GVLKPEWPIPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGE 1826
            G++K + PI   +LD     +HPYET+A+KAVS+YQQKFGRSS F+++ LPSPTPSEE +
Sbjct: 371  GMVKLDLPIFPASLDKGNSLLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEDD 430

Query: 1827 NVDADISAEVSSSPKRHAKPEIT-PMVGQLGVSSFPNMNNLSVQGLNSIQNAAPSSYGLN 2003
            +   D   EV+S    H    +    +GQ  +SS P  N L  QGL + + A P S+  N
Sbjct: 431  SGKGDTGGEVTSFDVVHNASHLNESSMGQPILSSVPQTNILDGQGLGTTRTADPLSFLPN 490

Query: 2004 PLLKQSFAKSRDPRLRLVNSDATAGISALP----NETKEPLGGIISSKKQKIVEERVLDG 2171
            P L+ S AKSRDPRLRL  SD  A  + LP    +   E    +I SKKQK V+    D 
Sbjct: 491  PSLRSSTAKSRDPRLRLATSDTVAQNTILPIPDIDLKLEASLEMIVSKKQKTVDLSAFDA 550

Query: 2172 PALKRPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVANS 2351
            P  KR ++E  +S  +   R   GNGGWLEDR      + +        D  +   +   
Sbjct: 551  PLPKRQRSEQTDSIIVSDVRPSIGNGGWLEDRGTAELPITSSNCATYNSDNDI-RKLEQV 609

Query: 2352 TSSNITMPNVSVGINDKLAIPG--STASMQSILTDLVVNPSILLNFLK-GQQMSANSTK- 2519
            T++  T+P+V V   +   + G  ++ ++ S+L D+ +NPSI +N +K  QQ SA++++ 
Sbjct: 610  TATIATIPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKTEQQKSADASRT 669

Query: 2520 STSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRTAALEESGTVRMKPRDPRR 2699
            +T+Q +SS SILGAVPST    P+   +GQ S GIL TP+ TA+ +E   VRMKPRDPRR
Sbjct: 670  NTAQASSSKSILGAVPSTVAVAPRSSAIGQRSVGILQTPTHTASADEVAIVRMKPRDPRR 729

Query: 2700 VLHSNGLQAGKSMEIDQSQTKTSTLSVPGVMGNLNGQRQDHQREQISVXXXXXXXXXXXX 2879
            VLHS  +  G S+ +D  Q KT        + NL+ Q Q+ Q ++ S             
Sbjct: 730  VLHSTAVLKGGSVGLD--QCKTGVAGTHATISNLSFQSQEDQLDRKSAVTLSTTP----- 782

Query: 2880 XXXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRIDAKGA 3059
                     PDI+ QF +NLKNIAD+++VS  S++ S   Q  +L   Q  Q R + KGA
Sbjct: 783  ---------PDIACQFTKNLKNIADMISVS-PSTSPSVASQTQTL-CIQAYQSRSEVKGA 831

Query: 3060 LEVGSLQTRSV----KEVSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXXXK 3227
            +   S          ++ S  S + Q +WGDVEHLF+G+ DQQ+A              K
Sbjct: 832  VSEPSEWVNDAGLASEKGSPGSLQPQISWGDVEHLFEGYSDQQRADIQRERTRRLEEQKK 891

Query: 3228 MFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSMWTK 3407
            MF+ RK            NSAKF E+DPVH+EILRKKEEQDREKP RHLFRFPHM MWTK
Sbjct: 892  MFSVRKLCLVLDLDHTLLNSAKFVEIDPVHEEILRKKEEQDREKPYRHLFRFPHMGMWTK 951

Query: 3408 LRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLIDGD 3587
            LRPGIWNFLEKAS L+ELHLYTMGNKLYATEMAKLLDPKG+LFAGRVISRGDD D  DGD
Sbjct: 952  LRPGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGD 1011

Query: 3588 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPSLL 3767
            ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGL GPSLL
Sbjct: 1012 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLL 1071

Query: 3768 EIDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVFSRV 3947
            EIDHDER EDGTLAS L VI++IH +FF H+++DEADVRNILA+EQ KILAGCRIVFSRV
Sbjct: 1072 EIDHDERPEDGTLASCLGVIQRIHQNFFTHRSIDEADVRNILATEQKKILAGCRIVFSRV 1131

Query: 3948 FPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVHPGW 4127
            FPVGEA+PHLHPLWQTAEQFGAVCT+ IDDQVTHVVANSLGTDKVNWAL+TGR VVHPGW
Sbjct: 1132 FPVGEASPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKVNWALSTGRSVVHPGW 1191

Query: 4128 VEASALLYRRANEHDFAIK 4184
            VEASALLYRRANEHDFAIK
Sbjct: 1192 VEASALLYRRANEHDFAIK 1210


>ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343308|gb|EEE79627.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1247

 Score =  999 bits (2584), Expect = 0.0
 Identities = 598/1235 (48%), Positives = 735/1235 (59%), Gaps = 52/1235 (4%)
 Frame = +3

Query: 636  MRDFLNYRISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRVGPDGL 815
            +RD   Y++   + SGL+NLAWA  VQNKP+ +  V E+ V  +   SS  S       +
Sbjct: 68   VRDLYKYQVGGGYMSGLYNLAWAQAVQNKPLNELFV-EVEVDDSSQKSSVSS-------V 119

Query: 816  NKNNNERVVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNSFGIRDAGLEK 995
            N +  ++  +  DD G                        +D D+   +  G+     EK
Sbjct: 120  NSSKEDKRTVVIDDSGDEMDVVKVIDIEKEEGELEEGE--IDLDSEGKSEGGMVSVDTEK 177

Query: 996  GLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAE--EGWLNKKEDLIQLSFAAI 1169
             + SI + LE +++   +KSF   C KL N  +SL+E+    E     K+ L++L F AI
Sbjct: 178  RVKSIREDLESVSVIKDDKSFEAVCLKLHNALESLKELVRVNENGFPSKDSLVRLLFTAI 237

Query: 1170 RTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHP-VVSTEQLKEIEGIISSLDSFAVCS 1346
              +N+ F SM             R L  L+N   P   S E  KE+    +  D   V  
Sbjct: 238  GAVNSFFSSMNQKLKEQNKGVFMRFL-SLVNSHDPSFFSPEHTKEVCDFCN-FDFRIVSL 295

Query: 1347 GSKGNDINKEMQVAEGFSKNIIDISTGNVNQELLKIKSSDQSEARTMLDNLKYGGANSKY 1526
                  +N+    AE F  N  + S                      ++  K G  + K 
Sbjct: 296  CYDLTTMNRLPSAAESFVHNKPNFS----------------------IEPPKPGVPSFKS 333

Query: 1527 RGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKPEWPIPRVALDTNKVP 1706
            RG+ LPLLDL K HD DSLPSPTRET P  P+ +   +G G++    P+P+VA  T +  
Sbjct: 334  RGVLLPLLDLKKFHDEDSLPSPTRETAPSFPVQRLLPIGDGMISSGLPVPKVASITEEPR 393

Query: 1707 MHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDADISAEVSSSPKRHAKP 1886
            +HPYET+A+KAVS+YQ+KF  +SFF T+ LPSPTPSEE  N D D + EVSSS   + + 
Sbjct: 394  VHPYETDALKAVSSYQKKFNLNSFF-TNELPSPTPSEESGNGDGDTAGEVSSSSTVNYRT 452

Query: 1887 EITPMVGQLGVSSFPN--------------MNNLSVQGLNSIQNAAPSSYGLNPLLKQSF 2024
               P+  +   S  P+              +NN S++ +   +N+AP S G +  +K S 
Sbjct: 453  VNPPVSDRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVVIPTRNSAPVSSGTSSTVKAS- 511

Query: 2025 AKSRDPRLRLVNSDATA------GISALPNETK-EPLGGIISSKKQKIVEERVLDGPALK 2183
            AKSRDPRLR VN+DA+A       +  + N  + EP G I  S+KQKI EE VLDG +LK
Sbjct: 512  AKSRDPRLRYVNTDASALDQNQRTLLMVNNPPRAEPSGAIAGSRKQKI-EEDVLDGTSLK 570

Query: 2184 RPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVA-----------ARKPELGLVDPRM 2330
            R +    N G +   R++ G GGWLED      +              ++   G+V P  
Sbjct: 571  RQRNSFDNFGVVRDIRSMTGTGGWLEDTDMAEPQTVNKNQWAENAEPGQRINNGVVCPST 630

Query: 2331 PGDVAN-STSSNITMP----NVSVGINDKLAIPGSTASMQSILTDLVVNPSILLNFLK-- 2489
               +++ S S N+ +P    N   G         +TAS+  +L D+ VNP++L+N LK  
Sbjct: 631  GSVMSSVSCSGNVQVPVMGINTIAGSEQAPVTSTTTASLPDLLKDITVNPTMLINILKMG 690

Query: 2490 --------GQQMSANSTKSTSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRT 2645
                    GQQ  A+  KSTS P SSN++LGA+P  N  +  P  +   SAG    PS+ 
Sbjct: 691  QQQRLALDGQQKLADPAKSTSHPPSSNTVLGAIPEVNAVSSLPSGILPRSAGKAQGPSQI 750

Query: 2646 AALEESGTVRMKPRDPRRVLHSNGLQAGKSMEIDQSQTKTSTLSVPGVMGNLNGQRQDHQ 2825
            A  +ESG +RMKPRDPRRVLH+N LQ   S+  +Q +T T T +  G   N N Q+Q+  
Sbjct: 751  ATTDESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTLTSTTQGTKDNQNLQKQEGL 810

Query: 2826 REQISVXXXXXXXXXXXXXXXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQL 3005
             E   V                     PDIS  F ++LKNIADI++VSQ  +    + Q 
Sbjct: 811  AELKPVVP-------------------PDISSPFTKSLKNIADIVSVSQTCTTPPFVSQN 851

Query: 3006 PSLQTAQTPQGRIDAKGALEVGS--LQTRSVKEVSAVSSRSQNNWGDVEHLFDGFDDQQK 3179
             + Q  Q    R+D K  +      +   S  EV A SS SQN W DVEHLF+G+DDQQK
Sbjct: 852  VASQPVQIKSDRVDGKTGISNSDQKMGPASSPEVVAASSLSQNTWEDVEHLFEGYDDQQK 911

Query: 3180 AAXXXXXXXXXXXXXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREK 3359
            AA             K+FAARK            NSAKF EVDPVHDEILRKKEEQDREK
Sbjct: 912  AAIQRERARRIEEQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREK 971

Query: 3360 PQRHLFRFPHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFA 3539
            P RHLFRFPHM MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAK+LDPKG LFA
Sbjct: 972  PYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFA 1031

Query: 3540 GRVISRGDDNDLIDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIY 3719
            GRV+SRGDD DL+DGDERVPKSKDLEGVLGMES VVIIDDS+RVWPHNKLNLIVVERYIY
Sbjct: 1032 GRVVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIY 1091

Query: 3720 FPCSRRQFGLLGPSLLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILAS 3899
            FPCSRRQFGL GPSLLEIDHDER EDGTLA SLAVIE+IH +FF H +LDEADVRNILAS
Sbjct: 1092 FPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILAS 1151

Query: 3900 EQHKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDK 4079
            EQ KILAGCRIVFSRVFPVGE NPHLHPLWQ+AEQFGAVCTN ID+QVTHVVANSLGTDK
Sbjct: 1152 EQRKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDK 1211

Query: 4080 VNWALNTGRFVVHPGWVEASALLYRRANEHDFAIK 4184
            VNWAL+TGRFVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1212 VNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1246


>ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|568858958|ref|XP_006483010.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 3-like
            [Citrus sinensis] gi|557541056|gb|ESR52100.1|
            hypothetical protein CICLE_v10030535mg [Citrus
            clementina]
          Length = 1234

 Score =  988 bits (2554), Expect = 0.0
 Identities = 602/1228 (49%), Positives = 748/1228 (60%), Gaps = 42/1228 (3%)
 Frame = +3

Query: 627  RFW-MRDFLNY--RISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNR 797
            R W MRD  N    I R +  GLHNLAWA  VQNKP+ +  VME         SS  S+ 
Sbjct: 51   RVWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSV 110

Query: 798  VGPD-GLNKNNNERVVIEG---DDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNS 965
               + G     +++ V+E    DD G                        +++++N+  S
Sbjct: 111  ASVNSGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELD------LESESNEKVS 164

Query: 966  FGIRDAGLEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKKEDL 1145
              +++      ++SI + LE +     + SF   CSKL+   +SL+E+  E  +  K+ L
Sbjct: 165  EQVKEEMKLINVESIREALESVLR--GDISFEGVCSKLEFTLESLRELVNENNVPTKDAL 222

Query: 1146 IQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQLKEIEGIISSL 1325
            IQL+F+A++++++VFCSM           + RLL  + + + P+ S+ Q+KE+E ++SSL
Sbjct: 223  IQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSL 282

Query: 1326 DSFAVCSGSKGNDINKEMQVAEGFSKNIIDISTGNVNQEL-------LKIKSSDQSEART 1484
             +       + ND  K+M    G +    +I T N   +L       L + S  Q++   
Sbjct: 283  VT-------RANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKP-- 333

Query: 1485 MLDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKPE 1664
             L+  K G    + RG+ LPLLD HK HD DSLPSPTRETTP +P+ +   +G GV+K  
Sbjct: 334  -LEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVK-S 391

Query: 1665 WPIPRVALDTNKVPMHP-YETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDAD 1841
            W          +V   P YET+A++A S+YQQKFGR+SFFM   LPSPTPSEE  + D D
Sbjct: 392  WAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGD 451

Query: 1842 ISAEVSSSPK-RHAKPEITPMVGQLGVSSFPN-----MNNLSVQGLNSIQNAAPSSYGLN 2003
               E+SS+      KP   P +GQ  VSS P      M+  SVQ L +  N+AP+S G N
Sbjct: 452  TGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYN 511

Query: 2004 PLLK-----QSFAKSRDPRLRLVNSDAT----AGISALPNETK-EPLGGIISSKKQKIVE 2153
            P++K     ++  KSRDPRLR  +S+A          L N  K EP+G ++SS+KQK VE
Sbjct: 512  PVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVE 571

Query: 2154 ERVLDGPALKRPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMP 2333
            E VLDGPALKR +    NSG +   + + G+GGWLED      ++  R     LVD    
Sbjct: 572  EPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNL---LVDSAES 628

Query: 2334 GD--VANSTSSNITM--PNVSVGINDKL--AIPGSTASMQSILTDLVVNPSILLNFLK-- 2489
                + N  +S IT   PNV V  N+      P +T S+ ++L D+ VNP++LLN LK  
Sbjct: 629  NSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMG 688

Query: 2490 GQQMSANSTKSTSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRTAALEESGT 2669
             QQ  A   +  S  +S N++   +PS+    P   V     +GIL  P     ++E G 
Sbjct: 689  QQQKLAADAQQKSNDSSMNTMHPPIPSS---IPPVSVTCSIPSGILSKP-----MDELGK 740

Query: 2670 VRMKPRDPRRVLHSNGLQAGKSMEIDQSQTKTSTLSVPGVMGNLNGQRQDHQREQISVXX 2849
            VRMKPRDPRRVLH N LQ   S+  +      S     G   NLN Q+Q    E   V  
Sbjct: 741  VRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLS 800

Query: 2850 XXXXXXXXXXXXXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQT 3029
                               PDI+ QF +NLK+IAD ++VSQ  +++  + Q   +Q  Q 
Sbjct: 801  QSVLQ--------------PDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQI 846

Query: 3030 PQGRIDAKGAL---EVGSLQTRSVKEVSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXX 3200
              G  D K  +   +     T S  E   V +  Q+ WGDVEHLF+G+DDQQKAA     
Sbjct: 847  KSGA-DMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905

Query: 3201 XXXXXXXXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFR 3380
                    KMF+ARK            NSAKF EVDPVHDEILRKKEEQDREKP RHLFR
Sbjct: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965

Query: 3381 FPHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRG 3560
            FPHM MWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAK+LDPKG LFAGRVISRG
Sbjct: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025

Query: 3561 DDNDLIDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQ 3740
            DD D  DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQ
Sbjct: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085

Query: 3741 FGLLGPSLLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILA 3920
            FGLLGPSLLEIDHDER+EDGTLASSL VIE++H  FF+HQ+LD+ DVRNILA+EQ KILA
Sbjct: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 1145

Query: 3921 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNT 4100
            GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT  IDDQVTHVVANSLGTDKVNWAL+T
Sbjct: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205

Query: 4101 GRFVVHPGWVEASALLYRRANEHDFAIK 4184
            GRFVVHPGWVEASALLYRRANE DFAIK
Sbjct: 1206 GRFVVHPGWVEASALLYRRANEQDFAIK 1233


>emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score =  981 bits (2537), Expect = 0.0
 Identities = 605/1217 (49%), Positives = 729/1217 (59%), Gaps = 31/1217 (2%)
 Frame = +3

Query: 627  RFW-MRDFLN----YRISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDS 791
            R W MRD  +    ++    +   L+NLAWA  VQNKP+ D  V+         + S D 
Sbjct: 92   RVWTMRDLQDLYKYHQACSGYTPRLYNLAWAQAVQNKPLNDIFVII--------DDSGDE 143

Query: 792  NRVGPDGLNKNNN---ERVVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSN 962
              V  D +++      E   I+ D E                        V+D +  + +
Sbjct: 144  MDVKMDDVSEKEEGELEEGEIDLDSE----------------PDVKDEGGVLDVNEPEID 187

Query: 963  SFGIRDAGLEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNK--- 1133
               +++  L + + SI + LE +T+  AEKSF   CS+LQN   SLQ++  E  + +   
Sbjct: 188  ---LKERELVERVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSV 244

Query: 1134 --KEDLIQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQLKEIE 1307
              K+ L Q    AIR LN VFCSM             RLL  +     P+ S + +KE+E
Sbjct: 245  PTKDALAQQLINAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVE 304

Query: 1308 GIISSLDSFAVCSGSKGNDINKEMQVAEGFSKNIIDISTGNVNQELLKIKSSDQSEARTM 1487
             ++S LD+ A  S ++ +D   ++QV +G ++NI+D              SS +S  R  
Sbjct: 305  VMMSFLDTPAAQSSAEASDKVNDVQVTDGMNRNILD--------------SSVESSGRAF 350

Query: 1488 LDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKPEW 1667
                K+ G     R +  PLLDLHKDHD DSLPSPT +   C P++K           E 
Sbjct: 351  ASAKKFRG-----RFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPVNKS----------EL 395

Query: 1668 PIPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDADIS 1847
               +VA +T    MHPYET+A+KAVSTYQQKFG +SF   D+LPSPTPSEE  +   DIS
Sbjct: 396  VTAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDIS 455

Query: 1848 AEVSSSPKRHAKPEIT-PMVGQLGVSSFPNMNNLSVQGLNSIQNAAPSSYGLNPLLKQSF 2024
             EVSSS    A      P +G   VSS P M+   VQGL   +N    +   N +L+ S 
Sbjct: 456  GEVSSSSTISAPITANAPALGHPIVSSAPQMD--IVQGLVVPRNTGAVNSRFNSILRAS- 512

Query: 2025 AKSRDPRLRLVNSDATA------GISALPNETK-EPLGGIISSKKQKIVEERVLDGPALK 2183
            AKSRDPRLRL +SDA +       + A+ N  K +PLG I+SS+KQK  EE +LDGP  K
Sbjct: 513  AKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTK 572

Query: 2184 RPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVANSTSSN 2363
            R +  L +                LE +V V                         T   
Sbjct: 573  RQRNGLTSPATK------------LESKVTV-------------------------TGIG 595

Query: 2364 ITMPNVSVGINDKLAI--PGSTASMQSILTDLVVNPSILLNFLKG--QQMSANSTKSTSQ 2531
               P V+V  N+ L +    +TAS+QS+L D+ VNP++ +N      QQ S +  K+T  
Sbjct: 596  CDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQKSGDPAKNTVL 655

Query: 2532 PTSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRTAAL---EESGTVRMKPRDPRRV 2702
            P +SNSILG VP  ++A  KP  LGQ  AG L  P +T  +   +ESG VRMKPRDPRR+
Sbjct: 656  PPTSNSILGVVPPASVAPLKPSALGQKPAGALQVP-QTGPMNPQDESGKVRMKPRDPRRI 714

Query: 2703 LHSNGLQAGKSMEIDQSQTKTSTLSVPGVMGNLNGQRQDHQREQISVXXXXXXXXXXXXX 2882
            LH+N  Q   S   +Q +T              N Q+Q+ Q E  SV             
Sbjct: 715  LHANSFQRSGSSGSEQFKT--------------NAQKQEDQTETKSVPSHSVNP------ 754

Query: 2883 XXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRIDAKGAL 3062
                    PDIS QF +NLKNIAD+++ SQASS   T PQ+ S Q+ Q    R+D K  +
Sbjct: 755  --------PDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATV 806

Query: 3063 EVGSLQTR---SVKEVSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXXXKMF 3233
                 Q     S  E +A   +S+N WGDVEHLFDG+DDQQKAA             KMF
Sbjct: 807  SDSGDQLTANGSKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMF 866

Query: 3234 AARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSMWTKLR 3413
            +ARK            NSAKF EVDPVHDEILRKKEEQDREK QRHLFRFPHM MWTKLR
Sbjct: 867  SARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLR 926

Query: 3414 PGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLIDGDER 3593
            PGIWNFLEKASKLYELHLYTMGNKLYATEMAK+LDPKG LFAGRVIS+GDD D++DGDER
Sbjct: 927  PGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDER 986

Query: 3594 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPSLLEI 3773
            VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEI
Sbjct: 987  VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEI 1046

Query: 3774 DHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVFSRVFP 3953
            DHDER EDGTLASSLAVIE+IH  FF+++ALDE DVRNILASEQ KILAGCRIVFSRVFP
Sbjct: 1047 DHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFP 1106

Query: 3954 VGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVHPGWVE 4133
            VGEANPHLHPLWQTAE FGAVCTN ID+QVTHVVANSLGTDKVNWAL+TGRFVVHPGWVE
Sbjct: 1107 VGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVE 1166

Query: 4134 ASALLYRRANEHDFAIK 4184
            ASALLYRRANE DFAIK
Sbjct: 1167 ASALLYRRANEQDFAIK 1183


>gb|AAV92930.1| putative transcription regulator CPL1 [Solanum lycopersicum]
          Length = 1227

 Score =  981 bits (2536), Expect = 0.0
 Identities = 597/1254 (47%), Positives = 744/1254 (59%), Gaps = 68/1254 (5%)
 Frame = +3

Query: 627  RFW-MRDFLNYRISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRVG 803
            R W MRD   Y ISR +  GL+NLAWA  VQNKP+ +  VM          +S +SN+  
Sbjct: 60   RVWTMRDVYKYPISRDYARGLYNLAWAQAVQNKPLDELFVM----------TSDNSNQCA 109

Query: 804  PDGLNKNNNERVVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNSFGIRDA 983
                  N   +V+I+ D                           VD DA +       + 
Sbjct: 110  ------NGESKVIIDVD---------------------------VDDDAKEEGELEEGEI 136

Query: 984  GLE---------KGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKK 1136
             L+         K  + I + L+ +TL+   KSF   CSKLQ    +L E+A     +K 
Sbjct: 137  DLDSADLVVNFGKEANFIREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQ--DKN 194

Query: 1137 EDLIQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQLKEIEGII 1316
            + LIQL   A+RT+N+VF SM           + RLL +   Q   ++S+EQLKE++ +I
Sbjct: 195  DILIQLFMTALRTINSVFYSMNDHQKQQNTDILSRLLFNAKTQLPALLSSEQLKELDALI 254

Query: 1317 SSLDSFAVCSGSKGNDINKEMQVAEGFSKNIIDISTGNVNQEL----------LKIKSSD 1466
             S++   V S ++ ND    + V +         S+ N NQ+           + IKSS 
Sbjct: 255  LSINHSLVSSNTQDNDTVNGINVVQLLDMKDSHKSSENANQDFTSVNKYDLGDVSIKSSG 314

Query: 1467 QSEARTMLDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGH 1646
              E     +++K G  NSK +GLS PLLDLHKDHD D+LPSPTR+  P  P  +     H
Sbjct: 315  LKEQSVSSESVKPGLDNSKAKGLSFPLLDLHKDHDEDTLPSPTRQIGPQFPATQT----H 370

Query: 1647 GVLKPEWPIPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGE 1826
            G++K + PI   +LD     +HPYET+A+KAVS+YQQKFGRSS F+++ LPSPTPSEE +
Sbjct: 371  GMVKLDLPIFPASLDKGNSLLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEDD 430

Query: 1827 NVDADISAEVSSSPKRHAKPEITPM-VGQLGVSSFPNMNNLSVQGLNSIQNAAPSSYGLN 2003
            +   D   EV+S    H    +    +GQ  +SS P  N L  QGL + + A P S+  N
Sbjct: 431  SGKGDTGGEVTSFDVVHNASHLNESSMGQPILSSVPQTNILDGQGLGTTRTADPLSFLPN 490

Query: 2004 PLLKQSFAKSRDPRLRLVNSDATAGISALP----NETKEPLGGIISSKKQKIVEERVLDG 2171
            P L+ S AKSRDPRLRL  SD  A  + LP    +   E    +I SKKQK V+    D 
Sbjct: 491  PSLRSSTAKSRDPRLRLATSDTVAQNTILPIPDIDLKLEASLEMIVSKKQKTVDLSAFDA 550

Query: 2172 PALKRPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVANS 2351
            P  KR ++E  +S  +   R   GNGGWLEDR      + +        D  +   +   
Sbjct: 551  PLPKRQRSEQTDSIIVSDVRPSIGNGGWLEDRGTAELPITSSNCATYNSDNDIR-KLEQV 609

Query: 2352 TSSNITMPNVSVGINDKLAIPGSTAS--MQSILTDLVVNPSILLNFLKG-QQMSANSTKS 2522
            T++  T+P+V V   +   + G + S  + S+L D+ +NPSI +N +K  QQ SA+++++
Sbjct: 610  TATIATIPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKTEQQKSADASRT 669

Query: 2523 -TSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRTAAL--------------- 2654
             T+Q +SS SILGAVPST    P+   +GQ S GIL TP+ TA+                
Sbjct: 670  NTAQASSSKSILGAVPSTVAVAPRSSAIGQRSVGILQTPTHTASAASSIYNLLMNDFIYS 729

Query: 2655 --------------------EESGTVRMKPRDPRRVLHSNGLQAGKSMEIDQSQTKTSTL 2774
                                +E   VRMKPRDPRRVLHS  +  G S+ +DQ   KT   
Sbjct: 730  VIFTASIAQFPFYFFLTFSRDEVAIVRMKPRDPRRVLHSTAVLKGGSVGLDQC--KTGVA 787

Query: 2775 SVPGVMGNLNGQRQDHQREQISVXXXXXXXXXXXXXXXXXXXXGPDISLQFKENLKNIAD 2954
                 + NL+ Q Q+ Q ++ S                      PDI+ QF +NLKNIAD
Sbjct: 788  GTHATISNLSFQSQEDQLDRKSAVTLSTTP--------------PDIACQFTKNLKNIAD 833

Query: 2955 ILTVSQASSAQSTLPQLPSLQTAQTPQGRIDAKGALEVGSLQTRSV----KEVSAVSSRS 3122
            +++VS ++S  S   Q  +L   Q  Q R + KGA+   S          ++ S  S + 
Sbjct: 834  MISVSPSTSP-SVASQTQTL-CIQAYQSRSEVKGAVSEPSEWVNDAGLASEKGSPGSLQP 891

Query: 3123 QNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXXXKMFAARKXXXXXXXXXXXXNSAKFAE 3302
            Q +WGDVEHLF+G+ DQQ+A              KMF+                   F E
Sbjct: 892  QISWGDVEHLFEGYSDQQRADIQRERTRRLEEQKKMFS-------------------FVE 932

Query: 3303 VDPVHDEILRKKEEQDREKPQRHLFRFPHMSMWTKLRPGIWNFLEKASKLYELHLYTMGN 3482
            +DPVH+EILRKKEEQDREKP RHLFRFPHM MWTKLRPGIWNFLEKAS L+ELHLYTMGN
Sbjct: 933  IDPVHEEILRKKEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASNLFELHLYTMGN 992

Query: 3483 KLYATEMAKLLDPKGELFAGRVISRGDDNDLIDGDERVPKSKDLEGVLGMESAVVIIDDS 3662
            KLYATEMAKLLDPKG+LFAGRVISRGDD D  DGDERVPKSKDLEGVLGMESAVVIIDDS
Sbjct: 993  KLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1052

Query: 3663 VRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPSLLEIDHDERTEDGTLASSLAVIEKIHH 3842
            VRVWPHNKLNLIVVERYIYFPCSRRQFGL GPSLLEIDHDER EDGTLAS L VI++IH 
Sbjct: 1053 VRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASCLGVIQRIHQ 1112

Query: 3843 DFFAHQALDEADVRNILASEQHKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 4022
            +FF H+++DEADVRNILA+EQ KILAGCRIVFSRVFPVGEA+PHLHPLWQTAEQFGAVCT
Sbjct: 1113 NFFTHRSIDEADVRNILATEQKKILAGCRIVFSRVFPVGEASPHLHPLWQTAEQFGAVCT 1172

Query: 4023 NTIDDQVTHVVANSLGTDKVNWALNTGRFVVHPGWVEASALLYRRANEHDFAIK 4184
            + IDDQVTHVVANSLGTDKVNWAL+TGR VVHPGWVEASALLYRRANEHDFAIK
Sbjct: 1173 SQIDDQVTHVVANSLGTDKVNWALSTGRSVVHPGWVEASALLYRRANEHDFAIK 1226


>gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus
            notabilis]
          Length = 1301

 Score =  978 bits (2527), Expect = 0.0
 Identities = 600/1226 (48%), Positives = 745/1226 (60%), Gaps = 59/1226 (4%)
 Frame = +3

Query: 627  RFW-MRD-FLNYRISRSFNSGLHNLAWASGVQNKPITDFLVMEMP-------VTTAENNS 779
            R W MRD + NY   R + +GL+NLAWA  VQNKP+ +  VM++        V ++ + +
Sbjct: 62   RVWTMRDLYANYPGFRGYTTGLYNLAWAQAVQNKPLNEIFVMDVDADDSSRVVLSSASPA 121

Query: 780  STDSNRVGPDGLNKNNN-ERVVIEGD----DEGXXXXXXXXXXXXXXXXXXXXXXXVVDA 944
                 R G +G+ +    E+VVI+      +EG                         D 
Sbjct: 122  VNSGRREGKNGVKEVEKVEKVVIDDSADEMEEGELEEGEIDLESEPTQKPAGEEAKDGDL 181

Query: 945  DANDSNSFGI----RDAGLEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMA 1112
            +    N  G+    R   LEK +D I + L  + +  AEKSF E CS+LQ   +SL+ + 
Sbjct: 182  NCEAENVGGLEVDSRRDELEKRVDLIWETLGSVNVVNAEKSFEEVCSRLQRTLESLRGVL 241

Query: 1113 EEGWLN--KKEDLIQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVST 1286
             E   +   K+ +IQ+S  AI+ +N+VFCSM           + RL   + N   P+ S 
Sbjct: 242  SEKEFSFPTKDVVIQMSITAIQVVNSVFCSMSVNQKEQKKETLSRLFCSVKNCGTPLFSP 301

Query: 1287 EQLKEIEGIISSLDSFAVCSGSKGNDINKEMQVAEGFSK---NIIDISTGNVNQELLKIK 1457
            EQ KEIE +ISSL+   V   S  +D  KE Q+ E   +   N+ + +  N + E   +K
Sbjct: 302  EQTKEIELMISSLNPLNVLPSSGASDKEKETQIIERLHEMDSNLTNANAENASIERTSVK 361

Query: 1458 -------SSDQSEARTMLDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCL 1616
                   S   S   T+ + L+ G    K RGL LPLLDLHKDHDADSLPSPTRE   C 
Sbjct: 362  LPQDCVASVVHSNPITLPELLRPGTLAFKGRGLLLPLLDLHKDHDADSLPSPTREAPSCF 421

Query: 1617 PIDKGFGMGHGVLKPEWPIPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRL 1796
            P+ K  G+  G++KP     +VA    +  +H YET+A+KAVSTYQQKFGR SF M+DRL
Sbjct: 422  PVYKPLGVADGIIKPVSTTAKVAPGAEESRLHRYETDALKAVSTYQQKFGRGSFLMSDRL 481

Query: 1797 PSPTPSEEGENVDADISAEVSSS-PKRHAKPEITPMVGQLGVSSFPNMNNLSVQGLNSIQ 1973
            PSPTPSEE +  D DI+ EVSSS    + +    P++    V+S   +++ ++QG  + +
Sbjct: 482  PSPTPSEECDEED-DINQEVSSSLTSGNLRTPAIPILRPSVVTSSVPVSSPTMQGPIAAK 540

Query: 1974 NAAPSSYGLNPLLKQSFAKSRDPRLRLVNSDATA------GISALPNETKEPLGGIISSK 2135
            NAAP   G N  +K S A+SRDPRLR  NSDA A       ++A+ N  K   G   SS+
Sbjct: 541  NAAPVGSGSNSTMKAS-ARSRDPRLRFANSDAGALDLNQRPLTAVHNGPKVEPGDPTSSR 599

Query: 2136 KQKIVEERVLDGPALKRPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKP--EL 2309
            KQ+IVEE  LDGPALKR +     S  I V +   G GGWLED    G ++  +    E 
Sbjct: 600  KQRIVEEPNLDGPALKRQRHAFV-SAKIDV-KTASGVGGWLEDNGTTGPQIMNKNQLVEN 657

Query: 2310 GLVDPRMPGDVANSTSSNITMPNVSVGINDKLAIPGSTA--SMQSILTDLVVNPSILLNF 2483
               DPR    + N    N   PN+     +++ + G++   ++ +IL D+ VNP+I ++ 
Sbjct: 658  AEADPRKSIHLVNGPIMN-NGPNIG---KEQVPVTGTSTPDALPAILKDIAVNPTIFMDI 713

Query: 2484 LK--GQQM--------SANSTKSTSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGILHT 2633
            L   GQQ          ++S+K+T+ P  +NSILGA P  N+A  K   + Q  A  L T
Sbjct: 714  LNKLGQQQLLAADAQQKSDSSKNTTHPPGTNSILGAAPLVNVAPSKASGILQTPAVSLPT 773

Query: 2634 PSRTAAL---EESGTVRMKPRDPRRVLHSNGLQAGKSMEIDQSQTKTSTLS-VPGVMGNL 2801
             S+ A     +E G +RMKPRDPRRVLH N LQ   S+  +Q +   S++S  PG   NL
Sbjct: 774  TSQVATASMQDELGKIRMKPRDPRRVLHGNMLQKSWSLGHEQFKPIVSSVSCTPGNKDNL 833

Query: 2802 NGQRQDHQREQISVXXXXXXXXXXXXXXXXXXXXGPDISLQFKENLKNIADILTVSQASS 2981
            NG  Q+ Q ++  V                     PDI+ QF +NL+NIAD+++VSQAS+
Sbjct: 834  NGPVQEGQADKKQVPSQLVVQ--------------PDIARQFTKNLRNIADLMSVSQAST 879

Query: 2982 AQSTLPQLPSLQTAQTPQGRIDAKGALEVGSLQ---TRSVKEVS-AVSSRSQNNWGDVEH 3149
            + +T+ Q  S Q       R D K  +     Q   T S  E + AV SR+ N WGDVEH
Sbjct: 880  SPATVSQNLSSQPLPVKPDRGDVKAVVPNSEDQHSGTNSTPETTLAVPSRTPNAWGDVEH 939

Query: 3150 LFDGFDDQQKAAXXXXXXXXXXXXXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEIL 3329
            LF+G+DD+QKAA             KMF A K            NSAKF EVD VHDEIL
Sbjct: 940  LFEGYDDEQKAAIQRERARRLEEQKKMFDAHKLCLVLDLDHTLLNSAKFVEVDSVHDEIL 999

Query: 3330 RKKEEQDREKPQRHLFRFPHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAK 3509
            RKKEEQDREKPQRHLFRFPHM MWTKLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAK
Sbjct: 1000 RKKEEQDREKPQRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAK 1059

Query: 3510 LLDPKGELFAGRVISRGDDNDLIDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKL 3689
            +LDP G LF+GRVISRGDD D  DGDERVPKSKDLEGVLGMES+VVIIDDSVRVWPHNKL
Sbjct: 1060 VLDPMGTLFSGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKL 1119

Query: 3690 NLIVVERYIYFPCSRRQFGLLGPSLLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQALD 3869
            NLIVVERY YFPCSRRQFGL GPSLLEIDHDER E GTLASSLAVIEKIH +FF+H +LD
Sbjct: 1120 NLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEQGTLASSLAVIEKIHQNFFSHHSLD 1179

Query: 3870 EADVRNILASEQHKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTH 4049
            E DVRNILASEQ KILAGCRIVFSRVFPV E NPHLHPLWQTAEQFGAVCT  IDDQVTH
Sbjct: 1180 EVDVRNILASEQRKILAGCRIVFSRVFPVSEVNPHLHPLWQTAEQFGAVCTTQIDDQVTH 1239

Query: 4050 VVANSLGTDKVNWALNTGRFVVHPGW 4127
            VVANS GTDKVNWAL  G+F VHPGW
Sbjct: 1240 VVANSPGTDKVNWALANGKFAVHPGW 1265


>ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus trichocarpa]
            gi|550347145|gb|EEE82674.2| CTD phosphatase-like protein
            3 [Populus trichocarpa]
          Length = 1190

 Score =  963 bits (2489), Expect = 0.0
 Identities = 585/1223 (47%), Positives = 725/1223 (59%), Gaps = 40/1223 (3%)
 Frame = +3

Query: 636  MRDFLNYRISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRVGPDGL 815
            +RD   Y++   + SGL+NLAWA  VQNKP+ +       +T   ++S  + + V    +
Sbjct: 61   VRDLYKYQVGGGYMSGLYNLAWARAVQNKPLNE-------LTVVIDDSGDEMDVVKVIDI 113

Query: 816  NKNNNERVVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNSFGIRDAGLEK 995
             K   E  + EG+ +                         +D++     S G+    +E 
Sbjct: 114  EKEEGE--LEEGEID-------------------------LDSEPVVVQSEGMVSVDVEN 146

Query: 996  GLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMA--EEGWLNKKEDLIQLSFAAI 1169
             + SI K LE +++   EKSF   C KL  + +SL+E+    +     K+ L+QL F AI
Sbjct: 147  RVKSIRKDLESVSVIETEKSFEAVCLKLHKVLESLKELVGGNDNSFPSKDGLVQLLFMAI 206

Query: 1170 RTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQLKEIEGIISSLDSFAVCSG 1349
            R +N+VFCSM             R    L +   P  S  Q KE+     + DS A  +G
Sbjct: 207  RVVNSVFCSMNKKLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEVLNENHN-DSLAKTAG 265

Query: 1350 SKGNDINKEMQVAEGFSKNIIDISTGNVNQELLKIKSSDQSEARTMLDNLKYGGANSKYR 1529
                 +++++  AE F +N                K +   EA         G  + K R
Sbjct: 266  YDLTTMSEKLPAAETFVQN----------------KPNKSIEAPK-----PPGVPSFKSR 304

Query: 1530 GLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKPEWPIPRVALDTNKVPM 1709
            G+ LPLLDL K HD DSLPSPT+ETTP  P+ +   +G G++    P+P+V     +  M
Sbjct: 305  GVLLPLLDLKKYHDEDSLPSPTQETTP-FPVQRLLAIGDGMVSSGLPVPKVTPVAEEPRM 363

Query: 1710 HPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDADISAEVSSSP------- 1868
            HPYET+A+KAVS+YQQKF R+SFF T+ LPSPTPSEE  N D D + EVSSS        
Sbjct: 364  HPYETDALKAVSSYQQKFNRNSFF-TNELPSPTPSEESGNGDGDTAGEVSSSSTVVNYRT 422

Query: 1869 -------KRHAKPEITPMVGQLGVSSFPNMNNLSVQGLNSIQNAAPSSYGLNPLLKQSFA 2027
                   +++A P   P+         P+ ++ +++G+   +N+AP S G +  +K S A
Sbjct: 423  VNPPVSDQKNAPPSPPPLPPPP-----PHPDSSNIRGVVPTRNSAPVSSGPSSTIKAS-A 476

Query: 2028 KSRDPRLRLVNSDATA---GISALPNETK----EPLGGIISSKKQKIVEERVLDGPALKR 2186
            KSRDPRLR VN DA A      ALP        EP G I+ SKK KI EE VLD P+LKR
Sbjct: 477  KSRDPRLRYVNIDACALDHNQRALPMVNNLPRVEPAGAIVGSKKHKI-EEDVLDDPSLKR 535

Query: 2187 PKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVAN-STSSN 2363
             +    N G +    ++ G GGWLED             E   V+     + +N + S N
Sbjct: 536  QRNSFDNYGAVRDIESMTGTGGWLED---------TDMAEPQTVNKNQWAENSNVNGSGN 586

Query: 2364 ITMPNVSV----GINDKLAIPGSTASMQSILTDLVVNPSILLNFLK----------GQQM 2501
               P + +    G         +T S+  +L D+ VNP++L+N LK          GQQ 
Sbjct: 587  AQSPFMGISNITGSEQAQVTSTATTSLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQT 646

Query: 2502 SANSTKSTSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRTAALEESGTVRMK 2681
             ++  KSTS P  SN++LGA+P+ N+A+ +P  +    AG    PS+ A  +ESG +RMK
Sbjct: 647  LSDPAKSTSHPPISNTVLGAIPTVNVASSQPSGIFPRPAGT-PVPSQIATSDESGKIRMK 705

Query: 2682 PRDPRRVLHSNGLQAGKSMEIDQSQTKTSTLSVPGVMGNLNGQRQDHQREQISVXXXXXX 2861
            PRDPRR LH+N LQ   SM  +Q +T T T +  G   + N Q+Q+   E          
Sbjct: 706  PRDPRRFLHNNSLQRAGSMGSEQFKTTTLTPTTQGTKDDQNVQKQEGLAE---------- 755

Query: 2862 XXXXXXXXXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGR 3041
                           PDIS  F ++L+NIADIL+VSQAS+    + Q  + Q  QT   R
Sbjct: 756  ---------LKPTVPPDISFPFTKSLENIADILSVSQASTTPPFISQNVASQPMQTKSER 806

Query: 3042 IDAKGALEVGSLQT--RSVKEVSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXX 3215
            +D K  + +   +T   S  EV A SS SQN W DVEHLF+G+DDQQKAA          
Sbjct: 807  VDGKTGISISDQKTGPASSPEVVAASSHSQNTWKDVEHLFEGYDDQQKAAIQRERARRLE 866

Query: 3216 XXXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMS 3395
               KMFAARK            NSAK      +HDEILRKKEEQDREKP RH+FR PHM 
Sbjct: 867  EQKKMFAARKLCLVLDLDHTLLNSAKAILSSSLHDEILRKKEEQDREKPYRHIFRIPHMG 926

Query: 3396 MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDL 3575
            MWTKLRPGIWNFLEKASKL+ELHLYTMGNKLYATEMAK+LDPKG LFAGRVISRGDD D 
Sbjct: 927  MWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 986

Query: 3576 IDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLG 3755
             DGDERVPKSKDLEGVLGMES VVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGL G
Sbjct: 987  FDGDERVPKSKDLEGVLGMESGVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPG 1046

Query: 3756 PSLLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIV 3935
            PSLLEIDHDER EDGTLA S AVIEKIH +FF H++LDEADVRNILASEQ KIL GCRI+
Sbjct: 1047 PSLLEIDHDERPEDGTLACSFAVIEKIHQNFFTHRSLDEADVRNILASEQRKILGGCRIL 1106

Query: 3936 FSRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVV 4115
            FSRVFPVGE NPHLHPLWQ AEQFGAVCTN ID+QVTHVVANSLGTDKVNWAL+TGR VV
Sbjct: 1107 FSRVFPVGEVNPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRIVV 1166

Query: 4116 HPGWVEASALLYRRANEHDFAIK 4184
            HPGWVEASALLYRRANE DF+IK
Sbjct: 1167 HPGWVEASALLYRRANEQDFSIK 1189


>ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1257

 Score =  949 bits (2453), Expect = 0.0
 Identities = 581/1221 (47%), Positives = 726/1221 (59%), Gaps = 46/1221 (3%)
 Frame = +3

Query: 660  ISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRVGPDGLNKNNNERV 839
            I R + SGL+NLAWA  VQNKP+ D  VME+      N++S +SNR+    +N    + V
Sbjct: 78   ICRGYASGLYNLAWAQAVQNKPLNDIFVMEVDSDANANSNSNNSNRLASVAVNPK--DVV 135

Query: 840  VIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXV-----------VDADANDSNSFGIRDAG 986
            V++ D E                        V           V  D ++S   G+R   
Sbjct: 136  VVDVDKEEGELEEGEIDADAEPEGEAESVVAVPVVSDSEKLDDVKRDVSNSEQLGVRGV- 194

Query: 987  LEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKKEDLIQLSFAA 1166
                       LEG+T+    +SF + CSKLQN   +L E+      ++++DL++LSF A
Sbjct: 195  -----------LEGVTVANVAESFAQTCSKLQN---ALPEVLSRPADSERDDLVRLSFNA 240

Query: 1167 IRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHP-VVSTEQLKEIEGIISSLDSFAVC 1343
               + +VFCSM           I RLL  + +Q+   + S E +KEI+G+++++D F   
Sbjct: 241  TEVVYSVFCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAIDYFGAL 300

Query: 1344 SGSKGNDINKEMQVAEGFSKNIIDISTGNVNQELLKIKSSDQSEARTMLDNLKYGGANSK 1523
              S+     KE+Q      +     +      EL+       S+       LK+G  + K
Sbjct: 301  VNSEAIGKEKELQTTVQTHEIKTQENQAVEAAELISYNKPLHSDIIGASHALKFGQNSIK 360

Query: 1524 YRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVL-------KPEWPIPRV 1682
             RG+ LPLLDLHKDHDADSLPSPTRE   C P++K   +G  ++       KPE    ++
Sbjct: 361  GRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEPMVSSGSAAAKPE--SGKM 418

Query: 1683 ALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDADISAEVSS 1862
             LD+     H YET+A+KAVSTYQQKFGRSS F  D+ PSPTPS + E+   D + EVSS
Sbjct: 419  ELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVDTNEEVSS 478

Query: 1863 SPKRHAKPEITPMVGQLGVSSFPNMNNLSVQGLNS--IQNAAPSSYGLNPLLKQSFAKSR 2036
            +          P +  L   S  + +  S+ G  S  +  A P S     L  +S AK+R
Sbjct: 479  ASTGDFLTSTKPTLLDLPPVSATSTDRSSLHGFISSRVDAAGPGS-----LPVKSSAKNR 533

Query: 2037 DPRLRLVNSDATA---GISALPNETKEPLGGIISSKKQKIVEERVLDGPALKRPKTELAN 2207
            DPRLR VNSDA+A     + + N  K    G   S+KQK  EE  LD    KR K+ L N
Sbjct: 534  DPRLRFVNSDASAVDNPSTLIHNMPKVEYAGTTISRKQKAAEEPSLDVTVSKRQKSPLEN 593

Query: 2208 SGFIHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVANSTSSNITMP---N 2378
            +   ++     G GGWLE+    G +   R   +    P  P    N+ SS+ T     N
Sbjct: 594  TEH-NMSEVRTGIGGWLEEHTGPGAQFIERNHLMDKFGPE-PQKTLNTVSSSCTGSDNFN 651

Query: 2379 VSVGINDKLAIPGST--ASMQSILTDLVVNPSILLNFLK---GQQMSANS-TKSTSQPTS 2540
             +   N++  I  S   AS+ ++L    VNP++L+N L+    Q+ SA+S T     PTS
Sbjct: 652  ATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLLRIAEAQKKSADSATNMLLHPTS 711

Query: 2541 SNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRTAAL-----EESGTVRMKPRDPRRVL 2705
            SNS +G   + ++ +     L Q S G+L   S++ ++     ++SG +RMKPRDPRR+L
Sbjct: 712  SNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQDDSGKIRMKPRDPRRIL 771

Query: 2706 HSNG-LQAGKSMEIDQSQTKTSTLSV-PGVMGNLNGQRQDHQREQISVXXXXXXXXXXXX 2879
            H+N  +Q   ++  +Q +   S +S   G   N+N Q+ + + +   V            
Sbjct: 772  HTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRVDSKLVPTQPSAQ----- 826

Query: 2880 XXXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRIDAKGA 3059
                     PDI+ QF  NLKNIADI++VSQ SS  + + Q+ S  +      R + K  
Sbjct: 827  ---------PDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRGEQKSV 877

Query: 3060 ------LEVGSLQTRSVKEVSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXX 3221
                  LE G +        ++ + RSQN WGDVEHLF+G+D+QQKAA            
Sbjct: 878  VSNSQNLEAGMVSAHET--AASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEEQ 935

Query: 3222 XKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSMW 3401
             KMFAARK            NSAKF EVDPVHDEILRKKEEQDREKP RHLFRFPHM MW
Sbjct: 936  NKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 995

Query: 3402 TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLID 3581
            TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAK+LDPKG LFAGRVISRGDD D +D
Sbjct: 996  TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTDSVD 1055

Query: 3582 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPS 3761
            G+ER PKSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPS
Sbjct: 1056 GEERAPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPS 1115

Query: 3762 LLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVFS 3941
            LLEIDHDER E GTLASSLAVIEKIH  FFA ++L+E DVRNILASEQ KILAGCRIVFS
Sbjct: 1116 LLEIDHDERPEAGTLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRIVFS 1175

Query: 3942 RVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVHP 4121
            RVFPVGEANPHLHPLWQTAEQFGA CTN ID+QVTHVVANS GTDKVNWALN GRFVVHP
Sbjct: 1176 RVFPVGEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHP 1235

Query: 4122 GWVEASALLYRRANEHDFAIK 4184
            GWVEASALLYRRANE DFAIK
Sbjct: 1236 GWVEASALLYRRANEQDFAIK 1256


>ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1261

 Score =  949 bits (2452), Expect = 0.0
 Identities = 580/1212 (47%), Positives = 727/1212 (59%), Gaps = 37/1212 (3%)
 Frame = +3

Query: 660  ISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRVGPDGLNKNNNERV 839
            I R + SGL+NLAWA  VQNKP+ D  VME+      N++   S+R+    +N    + V
Sbjct: 78   ICRGYASGLYNLAWAQAVQNKPLNDIFVMEVDSDANANSNRNSSHRLASVAVNPK--DVV 135

Query: 840  VIEGD-DEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNSFGIRDAGLEKGLDSILK 1016
            V++ D +EG                       V D++  D     + D+  + G   +L 
Sbjct: 136  VVDVDKEEGELEEGEIDADAEPEGEAESVVVAVSDSEKLDDVKMDVSDSE-QLGARGVL- 193

Query: 1017 GLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKKEDLIQLSFAAIRTLNTVFCS 1196
              EG+T+    +SF + CSKLQN   +L E+      ++K+DL++LSF A   + +VFCS
Sbjct: 194  --EGVTVANVVESFAQTCSKLQN---TLPEVLSRPAGSEKDDLVRLSFNATEVVYSVFCS 248

Query: 1197 MXXXXXXXXXXXIKRLLVDLLNQKHP-VVSTEQLKEIEGIISSLDSFAVCSGSKGNDINK 1373
            M           I RLL  + +Q+   + S E +KEI+G+++++DS      S+     K
Sbjct: 249  MDSSEKEQNKDSILRLLSFVKDQQQAQLFSPEHVKEIQGMMTAIDSVGALVNSEAIGKEK 308

Query: 1374 EMQVAE-------GFSKNIIDISTGNVNQ-----ELLKIKSSDQSEARTMLDNLKYGGAN 1517
            E+Q  E            I +I T   NQ     EL+        +       LK+G  +
Sbjct: 309  ELQTTEIKTQENSAVEVQIHEIKTQE-NQAVEAAELISYSKPLHRDITGTSQALKFGQNS 367

Query: 1518 SKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKPEWPIPRVALDTN 1697
             K RG+ LPLLDLHKDHDADSLPSPTRE   C P++K   +G  +++      ++ LD+ 
Sbjct: 368  IKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGESMVRSGSASAKMELDSE 427

Query: 1698 KVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDADISAEVSSSPKRH 1877
                H YET+A+KAVSTYQQKFGRSS F  D+ PSPTPS + E+   D + EVSS+    
Sbjct: 428  GSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEVVDTNEEVSSASTGD 487

Query: 1878 AKPEITPMVGQLGVSSFPNMNNLSVQGLNS--IQNAAPSSYGLNPLLKQSFAKSRDPRLR 2051
                  P +      S  +M+  S+ G  S  +    P S+ +     +S AK+RDPRLR
Sbjct: 488  FLTSTKPTLLDQPPVSATSMDRSSMHGFISSRVDATGPGSFPV-----KSSAKNRDPRLR 542

Query: 2052 LVNSDATA--GISALPNE-TKEPLGGIISSKKQKIVEERVLDGPALKRPKTELANSGFIH 2222
             +NSDA+A   +S L N  +K    G   S+KQK  EE  LD    KR K+ L N+   +
Sbjct: 543  FINSDASAVDNLSTLINNMSKVEYSGTTISRKQKAAEEPSLDVTVSKRLKSSLENTEH-N 601

Query: 2223 VGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVANSTSSNITMP---NVSVGI 2393
            +     G+GGWLE+    G ++  R   +    P     + N+ SS+ T     N +   
Sbjct: 602  MSEVRTGSGGWLEENTGPGAQLIERNHLMDKFGPEAKKTL-NTVSSSCTGSDNFNATSIR 660

Query: 2394 NDKLAIPGST--ASMQSILTDLVVNPSILLNFLKGQQMSANSTKSTS----QPTSSNSIL 2555
            N++  I  S   AS+ ++L +  VNP +L+N L+  +    S  S +     PTSSN  +
Sbjct: 661  NEQAPITASNVLASLPALLKEASVNPIMLVNILRLAEAQKKSADSAAIMLLHPTSSNPAM 720

Query: 2556 GAVPSTNLATPKPPVLGQGSAGILHTPSRTAAL-----EESGTVRMKPRDPRRVLHSNGL 2720
            G   + ++ +     L Q S G+L   S++ +      ++SG +RMKPRDPRR+LH+N  
Sbjct: 721  GTDSTASIGSSMATGLLQSSVGMLPVSSQSTSTAQTLQDDSGKIRMKPRDPRRILHTNNT 780

Query: 2721 QAGKSMEIDQSQTKTSTLSVPGVMGNLNGQRQDHQREQISVXXXXXXXXXXXXXXXXXXX 2900
               KS ++   Q K     V            ++QR   +V                   
Sbjct: 781  -IQKSGDLGNEQFKAIVSPV-----------SNNQRTGDNVNAPKLEGRVDNKLVPTQSS 828

Query: 2901 XGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRIDAKGALEVG-SL 3077
              PDI+ QF  NLKNIADI++VSQ SS  + + Q  S  +      R + K  +    +L
Sbjct: 829  AQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLTSDRGEQKSVVSSSQNL 888

Query: 3078 QT--RSVKEVSA-VSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXXXKMFAARKX 3248
            Q    S  E +A V+SRSQ+ WGDVEHLF+G+D+QQKAA             KMFAARK 
Sbjct: 889  QADMASAHETAASVTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKMFAARKL 948

Query: 3249 XXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSMWTKLRPGIWN 3428
                       NSAKF EVDP+HDEILRKKEEQDREKP RHLFRFPHM MWTKLRPGIWN
Sbjct: 949  CLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWN 1008

Query: 3429 FLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLIDGDERVPKSK 3608
            FLEKASKLYELHLYTMGNKLYATEMAK+LDPKG LFAGRVISRGDD D +DG+ERVPKSK
Sbjct: 1009 FLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVDGEERVPKSK 1068

Query: 3609 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPSLLEIDHDER 3788
            DLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER
Sbjct: 1069 DLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDER 1128

Query: 3789 TEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVFSRVFPVGEAN 3968
             E GTLASSLAVIEKIH  FFA Q+L+E DVRNILASEQ KILAGCRIVFSRVFPVGEAN
Sbjct: 1129 PEAGTLASSLAVIEKIHQIFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEAN 1188

Query: 3969 PHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVHPGWVEASALL 4148
            PHLHPLWQTAEQFGAVCTN ID+QVTHVVANS GTDKVNWALN GRFVVHPGWVEASALL
Sbjct: 1189 PHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHPGWVEASALL 1248

Query: 4149 YRRANEHDFAIK 4184
            YRRANE DFAIK
Sbjct: 1249 YRRANEQDFAIK 1260


>ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343307|gb|EEE79693.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1030

 Score =  949 bits (2452), Expect = 0.0
 Identities = 547/1036 (52%), Positives = 663/1036 (63%), Gaps = 57/1036 (5%)
 Frame = +3

Query: 1248 VDLLNQKHP-VVSTEQLKEIEGIISSLDSFAVCSGSKGNDINKEMQVAEGFSKNIIDIST 1424
            + L+N   P   S E  KEIE ++SSLDS  + S S+  +  +E QV+   ++   D  +
Sbjct: 17   LSLVNSHDPSFFSPEHTKEIELMVSSLDSHDILSSSRAGE-ERETQVSGKVNERDNDSLS 75

Query: 1425 GNVNQELLKI-------KSSDQSEARTMLDNLKYGGANSKYRGLSLPLLDLHKDHDADSL 1583
                 +L  +       +S   ++    ++  K G  + K RG+ LPLLDL K HD DSL
Sbjct: 76   KTAGYDLTTMNRLPSAAESFVHNKPNFSIEPPKPGVPSFKSRGVLLPLLDLKKFHDEDSL 135

Query: 1584 PSPTRETTPCLPIDKGFGMGHGVLKPEWPIPRVALDTNKVPMHPYETEAVKAVSTYQQKF 1763
            PSPTRET P  P+ +   +G G++    P+P+VA  T +  +HPYET+A+KAVS+YQ+KF
Sbjct: 136  PSPTRETAPSFPVQRLLPIGDGMISSGLPVPKVASITEEPRVHPYETDALKAVSSYQKKF 195

Query: 1764 GRSSFFMTDRLPSPTPSEEGENVDADISAEVSSSPKRHAKPEITPMVGQLGVSSFPN--- 1934
              +SFF T+ LPSPTPSEE  N D D + EVSSS   + +    P+  +   S  P+   
Sbjct: 196  NLNSFF-TNELPSPTPSEESGNGDGDTAGEVSSSSTVNYRTVNPPVSDRKSASPSPSPPP 254

Query: 1935 -----------MNNLSVQGLNSIQNAAPSSYGLNPLLKQSFAKSRDPRLRLVNSDATA-- 2075
                       +NN S++ +   +N+AP S G +  +K S AKSRDPRLR VN+DA+A  
Sbjct: 255  PPPPPPPPPPHLNNSSIRVVIPTRNSAPVSSGTSSTVKAS-AKSRDPRLRYVNTDASALD 313

Query: 2076 ----GISALPNETK-EPLGGIISSKKQKIVEERVLDGPALKRPKTELANSGFIHVGRAVP 2240
                 +  + N  + EP G I  S+KQKI EE VLDG +LKR +    N G +   R++ 
Sbjct: 314  QNQRTLLMVNNPPRAEPSGAIAGSRKQKI-EEDVLDGTSLKRQRNSFDNFGVVRDIRSMT 372

Query: 2241 GNGGWLEDRVPVGFKVA-----------ARKPELGLVDPRMPGDVAN-STSSNITMP--- 2375
            G GGWLED      +              ++   G+V P     +++ S S N+ +P   
Sbjct: 373  GTGGWLEDTDMAEPQTVNKNQWAENAEPGQRINNGVVCPSTGSVMSSVSCSGNVQVPVMG 432

Query: 2376 -NVSVGINDKLAIPGSTASMQSILTDLVVNPSILLNFLK----------GQQMSANSTKS 2522
             N   G         +TAS+  +L D+ VNP++L+N LK          GQQ  A+  KS
Sbjct: 433  INTIAGSEQAPVTSTTTASLPDLLKDITVNPTMLINILKMGQQQRLALDGQQKLADPAKS 492

Query: 2523 TSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSRTAALEESGTVRMKPRDPRRV 2702
            TS P SSN++LGA+P  N  +  P  +   SAG    PS+ A  +ESG +RMKPRDPRRV
Sbjct: 493  TSHPPSSNTVLGAIPEVNAVSSLPSGILPRSAGKAQGPSQIATTDESGKIRMKPRDPRRV 552

Query: 2703 LHSNGLQAGKSMEIDQSQTKTSTLSVPGVMGNLNGQRQDHQREQISVXXXXXXXXXXXXX 2882
            LH+N LQ   S+  +Q +T T T +  G   N N Q+Q+   E   V             
Sbjct: 553  LHNNALQRAGSLGSEQFKTTTLTSTTQGTKDNQNLQKQEGLAELKPVVP----------- 601

Query: 2883 XXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRIDAKGAL 3062
                    PDIS  F ++LKNIADI++VSQ  +    + Q  + Q  Q    R+D K  +
Sbjct: 602  --------PDISSPFTKSLKNIADIVSVSQTCTTPPFVSQNVASQPVQIKSDRVDGKTGI 653

Query: 3063 EVGS--LQTRSVKEVSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXXXKMFA 3236
                  +   S  EV A SS SQN W DVEHLF+G+DDQQKAA             K+FA
Sbjct: 654  SNSDQKMGPASSPEVVAASSLSQNTWEDVEHLFEGYDDQQKAAIQRERARRIEEQKKLFA 713

Query: 3237 ARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSMWTKLRP 3416
            ARK            NSAKF EVDPVHDEILRKKEEQDREKP RHLFRFPHM MWTKLRP
Sbjct: 714  ARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFPHMGMWTKLRP 773

Query: 3417 GIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLIDGDERV 3596
            GIWNFLEKASKLYELHLYTMGNKLYATEMAK+LDPKG LFAGRV+SRGDD DL+DGDERV
Sbjct: 774  GIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDERV 833

Query: 3597 PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPSLLEID 3776
            PKSKDLEGVLGMES VVIIDDS+RVWPHNKLNLIVVERYIYFPCSRRQFGL GPSLLEID
Sbjct: 834  PKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEID 893

Query: 3777 HDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVFSRVFPV 3956
            HDER EDGTLA SLAVIE+IH +FF H +LDEADVRNILASEQ KILAGCRIVFSRVFPV
Sbjct: 894  HDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQRKILAGCRIVFSRVFPV 953

Query: 3957 GEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVHPGWVEA 4136
            GE NPHLHPLWQ+AEQFGAVCTN ID+QVTHVVANSLGTDKVNWAL+TGRFVVHPGWVEA
Sbjct: 954  GEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEA 1013

Query: 4137 SALLYRRANEHDFAIK 4184
            SALLYRRANE DFAIK
Sbjct: 1014 SALLYRRANEQDFAIK 1029


>ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Cucumis sativus]
          Length = 1249

 Score =  941 bits (2433), Expect = 0.0
 Identities = 594/1247 (47%), Positives = 734/1247 (58%), Gaps = 61/1247 (4%)
 Frame = +3

Query: 627  RFW-MRD-FLNYRISR-SFNSGLHNLAWASGVQNKPITDFLVMEMPV-TTAENNSSTDSN 794
            R W M D + NY   R  + SGL+NLAWA  VQNKP+ D  VME  +   ++++SST   
Sbjct: 54   RVWTMSDLYKNYPAMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFG 113

Query: 795  RVGPDGLNKNNNE-RVVIE--GD-----------DEGXXXXXXXXXXXXXXXXXXXXXXX 932
                DG N    E RVVI+  GD           +EG                       
Sbjct: 114  NAKDDGSNTTKEEDRVVIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAM 173

Query: 933  VVDADANDSN--SFGIRDAGLEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQE 1106
            + D+   D N   F +    L++ L  I K L+G+T++ A+KSF E CS++ +  ++  E
Sbjct: 174  LSDSRDMDINGQEFDLETKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVE 233

Query: 1107 MAEEGWLNKKEDLIQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVST 1286
            + +   + +K+ LIQ  +AA+R +N+VFCSM           + RLL  + N   P+ S 
Sbjct: 234  LLQGKVVPRKDALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSP 293

Query: 1287 EQLKEIEGIISSLDSFAVCSGSKGNDINKEMQVAEG---------FSKNIIDISTGN-VN 1436
            EQ+K +E  + S DS       +G+    E+ +  G         ++     ++  N + 
Sbjct: 294  EQIKSVEVKMPSTDSLDHLPSMRGSAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLA 353

Query: 1437 QELLKIKSSDQSEARTMLDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCL 1616
             + +      ++    + + L+ G ++ K RG  LPLLDLHKDHDADSLPSPTRE     
Sbjct: 354  SDSIPFGVKGKNNLNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIF 413

Query: 1617 PIDKGFGMGHGVLKPEWPIPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRL 1796
             + K    G+   K  +P+     D ++   HPYET+A+KAVSTYQQKFGRSSF M DRL
Sbjct: 414  SVQKS---GNAPTKMAFPV-----DGSR--SHPYETDALKAVSTYQQKFGRSSFSMADRL 463

Query: 1797 PSPTPSEEGENVDADISAEVSSSP-KRHAKPEITPMVGQLGVSS-------FPNMNNLSV 1952
            PSPTPSEE +    DI  EVSSS   R  K       GQ   S+       FPNM++ S 
Sbjct: 464  PSPTPSEEHDG-GGDIGGEVSSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSST 522

Query: 1953 QGLNSIQNAAPSSYGLNPLLKQSFAKSRDPRLRLVNSDATA------GISALPNETKEPL 2114
            + L S  N AP S   NP +K   AKSRDPRLR+VNSDA+        ++++ + +    
Sbjct: 523  RVLISPLNVAPPSSVSNPTVK-PLAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILES 581

Query: 2115 GGIISSKKQKIVEERVLDGPALKRPKTELANSGFIHVG-RAVPGNGGWLEDRVPVGFKVA 2291
               +  +KQK+  E   DGP +KR +    N        RAV G+GGWLED +P G ++ 
Sbjct: 582  AATLHLRKQKMDGEPNTDGPEVKRLRIGSQNLAVAASDVRAVSGSGGWLEDTMPAGPRLF 641

Query: 2292 AR-KPELGLVDPRMPGDVA-NSTSSNITMPNVSVGINDKLAIPGSTASMQSILTDLVVNP 2465
             R + E+   +     +V  NS S N   P V+   ND        AS+ S+L D+VVNP
Sbjct: 642  NRNQMEIAEANATEKSNVTNNSGSGNECTPTVN-NSND--------ASLPSLLKDIVVNP 692

Query: 2466 SILLNFLKGQQM----------SANSTKSTSQPTSSNSILGAVPSTNLATPKPPVLGQGS 2615
            ++LLN LK  Q           S+   K+   PTS N   G+ P  N       +L Q S
Sbjct: 693  TMLLNLLKMSQQQQLAAELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVATSGIL-QQS 751

Query: 2616 AGILHTPSRTAAL---EESGTVRMKPRDPRRVLHSNGLQAGKSMEIDQSQTKTSTLS-VP 2783
            AG   TPS +  +   ++ G VRMKPRDPRRVLH N LQ   S+  DQ +    T S   
Sbjct: 752  AG---TPSASPVVGRQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPTASNTE 808

Query: 2784 GVMGNLNGQRQDHQREQISVXXXXXXXXXXXXXXXXXXXXGPDISLQFKENLKNIADILT 2963
            G     NG +Q+ Q +                         PDI  QF  NLKNIADI++
Sbjct: 809  GSRDIPNGHKQEGQGDSKLASSQTIL---------------PDIGRQFTNNLKNIADIMS 853

Query: 2964 VSQASSAQSTLPQLPSLQTAQTPQGRIDAKGALEVGSLQTRSVKEVSAVSSRSQNNWGDV 3143
            V          P   S  ++  P G      +++   + T       A SSRSQ  WGD+
Sbjct: 854  VPS--------PPTSSPNSSSKPVG----SSSMDSKPVTTAFQAVDMAASSRSQGAWGDL 901

Query: 3144 EHLFDGFDDQQKAAXXXXXXXXXXXXXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDE 3323
            EHLFD +DD+QKAA             KMFAARK            NSAKF EVDPVHDE
Sbjct: 902  EHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDE 961

Query: 3324 ILRKKEEQDREKPQRHLFRFPHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEM 3503
            ILRKKEEQDREK QRHLFRFPHM MWTKLRPG+WNFLEKAS+LYELHLYTMGNKLYATEM
Sbjct: 962  ILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEM 1021

Query: 3504 AKLLDPKGELFAGRVISRGDDNDLIDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 3683
            AK+LDPKG LFAGRVISRGDD D +DGD+RVPKSKDLEGVLGMES VVIIDDS+RVWPHN
Sbjct: 1022 AKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHN 1081

Query: 3684 KLNLIVVERYIYFPCSRRQFGLLGPSLLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQA 3863
            K+NLIVVERY YFPCSRRQFGLLGPSLLEIDHDER EDGTLASSL VI++IH  FF++  
Sbjct: 1082 KMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVIQRIHQSFFSNPE 1141

Query: 3864 LDEADVRNILASEQHKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQV 4043
            LD+ DVR IL++EQ KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGA CTN ID+QV
Sbjct: 1142 LDQVDVRTILSAEQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQV 1201

Query: 4044 THVVANSLGTDKVNWALNTGRFVVHPGWVEASALLYRRANEHDFAIK 4184
            THVVANSLGTDKVNWAL+TGRFVVHPGWVEASALLYRRA E DFAIK
Sbjct: 1202 THVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRATEQDFAIK 1248


>ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3-like [Cucumis sativus]
          Length = 1249

 Score =  941 bits (2432), Expect = 0.0
 Identities = 594/1247 (47%), Positives = 734/1247 (58%), Gaps = 61/1247 (4%)
 Frame = +3

Query: 627  RFW-MRD-FLNYRISR-SFNSGLHNLAWASGVQNKPITDFLVMEMPV-TTAENNSSTDSN 794
            R W M D + NY   R  + SGL+NLAWA  VQNKP+ D  VME  +   ++++SST   
Sbjct: 54   RVWTMSDLYKNYPAMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFG 113

Query: 795  RVGPDGLNKNNNE-RVVIE--GD-----------DEGXXXXXXXXXXXXXXXXXXXXXXX 932
                DG N    E RVVI+  GD           +EG                       
Sbjct: 114  NAKDDGSNTTKEEDRVVIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAM 173

Query: 933  VVDADANDSN--SFGIRDAGLEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQE 1106
            + D+   D N   F +    L++ L  I K L+G+T++ A+KSF E CS++ +  ++  E
Sbjct: 174  LSDSRDMDINGQEFDLETKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVE 233

Query: 1107 MAEEGWLNKKEDLIQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVST 1286
            + +   + +K+ LIQ  +AA+R +N+VFCSM           + RLL  + N   P+ S 
Sbjct: 234  LLQGKVVPRKDALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSP 293

Query: 1287 EQLKEIEGIISSLDSFAVCSGSKGNDINKEMQVAEG---------FSKNIIDISTGN-VN 1436
            EQ+K +E  + S DS       +G+    E+ +  G         ++     ++  N + 
Sbjct: 294  EQIKSVEVKMPSTDSLDHLPSMRGSAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLA 353

Query: 1437 QELLKIKSSDQSEARTMLDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCL 1616
             + +      ++    + + L+ G ++ K RG  LPLLDLHKDHDADSLPSPTRE     
Sbjct: 354  SDSIPFGVKGKNNLNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIF 413

Query: 1617 PIDKGFGMGHGVLKPEWPIPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRL 1796
             + K    G+   K  +P+     D ++   HPYET+A+KAVSTYQQKFGRSSF M DRL
Sbjct: 414  SVQKS---GNAPTKMAFPV-----DGSR--SHPYETDALKAVSTYQQKFGRSSFSMADRL 463

Query: 1797 PSPTPSEEGENVDADISAEVSSSP-KRHAKPEITPMVGQLGVSS-------FPNMNNLSV 1952
            PSPTPSEE +    DI  EVSSS   R  K       GQ   S+       FPNM++ S 
Sbjct: 464  PSPTPSEEHDG-GGDIGGEVSSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSST 522

Query: 1953 QGLNSIQNAAPSSYGLNPLLKQSFAKSRDPRLRLVNSDATA------GISALPNETKEPL 2114
            + L S  N AP S   NP +K   AKSRDPRLR+VNSDA+        ++++ + +    
Sbjct: 523  RVLISPLNVAPPSSVSNPTVK-PLAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILES 581

Query: 2115 GGIISSKKQKIVEERVLDGPALKRPKTELANSGFIHVG-RAVPGNGGWLEDRVPVGFKVA 2291
               +  +KQK+  E   DGP +KR +    N        RAV G+GGWLED +P G ++ 
Sbjct: 582  AATLHLRKQKMDGEPNTDGPEVKRLRIGSQNLAVAASDVRAVSGSGGWLEDTMPAGPRLF 641

Query: 2292 AR-KPELGLVDPRMPGDVA-NSTSSNITMPNVSVGINDKLAIPGSTASMQSILTDLVVNP 2465
             R + E+   +     +V  NS S N   P V+   ND        AS+ S+L D+VVNP
Sbjct: 642  NRNQMEIAEANATEKSNVTNNSGSGNECTPTVN-NSND--------ASLPSLLKDIVVNP 692

Query: 2466 SILLNFLKGQQM----------SANSTKSTSQPTSSNSILGAVPSTNLATPKPPVLGQGS 2615
            ++LLN LK  Q           S+   K+   PTS N   G+ P  N       +L Q S
Sbjct: 693  TMLLNLLKMSQQQQLAAELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVATSGIL-QQS 751

Query: 2616 AGILHTPSRTAAL---EESGTVRMKPRDPRRVLHSNGLQAGKSMEIDQSQTKTSTLS-VP 2783
            AG   TPS +  +   ++ G VRMKPRDPRRVLH N LQ   S+  DQ +    T S   
Sbjct: 752  AG---TPSASPVVGRQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPTASNTE 808

Query: 2784 GVMGNLNGQRQDHQREQISVXXXXXXXXXXXXXXXXXXXXGPDISLQFKENLKNIADILT 2963
            G     NG +Q+ Q +                         PDI  QF  NLKNIADI++
Sbjct: 809  GSRDIPNGHKQEGQGDSKLASSQTIL---------------PDIGRQFTNNLKNIADIMS 853

Query: 2964 VSQASSAQSTLPQLPSLQTAQTPQGRIDAKGALEVGSLQTRSVKEVSAVSSRSQNNWGDV 3143
            V          P   S  ++  P G      +++   + T       A SSRSQ  WGD+
Sbjct: 854  VPS--------PPTSSPNSSSKPVG----SSSMDSKPVTTAFQAVDMAASSRSQGAWGDL 901

Query: 3144 EHLFDGFDDQQKAAXXXXXXXXXXXXXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDE 3323
            EHLFD +DD+QKAA             KMFAARK            NSAKF EVDPVHDE
Sbjct: 902  EHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDE 961

Query: 3324 ILRKKEEQDREKPQRHLFRFPHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEM 3503
            ILRKKEEQDREK QRHLFRFPHM MWTKLRPG+WNFLEKAS+LYELHLYTMGNKLYATEM
Sbjct: 962  ILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEM 1021

Query: 3504 AKLLDPKGELFAGRVISRGDDNDLIDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 3683
            AK+LDPKG LFAGRVISRGDD D +DGD+RVPKSKDLEGVLGMES VVIIDDS+RVWPHN
Sbjct: 1022 AKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHN 1081

Query: 3684 KLNLIVVERYIYFPCSRRQFGLLGPSLLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQA 3863
            K+NLIVVERY YFPCSRRQFGLLGPSLLEIDHDER EDGTLASSL VI++IH  FF++  
Sbjct: 1082 KMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVIQRIHQXFFSNPE 1141

Query: 3864 LDEADVRNILASEQHKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQV 4043
            LD+ DVR IL++EQ KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGA CTN ID+QV
Sbjct: 1142 LDQVDVRTILSAEQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQV 1201

Query: 4044 THVVANSLGTDKVNWALNTGRFVVHPGWVEASALLYRRANEHDFAIK 4184
            THVVANSLGTDKVNWAL+TGRFVVHPGWVEASALLYRRA E DFAIK
Sbjct: 1202 THVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRATEQDFAIK 1248


>ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like isoform X1 [Cicer arietinum]
          Length = 1247

 Score =  936 bits (2419), Expect = 0.0
 Identities = 575/1219 (47%), Positives = 740/1219 (60%), Gaps = 44/1219 (3%)
 Frame = +3

Query: 660  ISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAEN-NSSTDSNRVGPDGLNKNNNER 836
            I R + SGL+NLAWA  VQNKP+ D  VME+   +  N NS+ DSN  G   LN    E 
Sbjct: 82   ICRGYASGLYNLAWAQAVQNKPLNDIFVMELDSDSNANANSNNDSNN-GNGDLNMPLKEV 140

Query: 837  VVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNSFGIRDAGLEKGLDSILK 1016
            V+++ D+                          +D D +D+    +   G E   +S ++
Sbjct: 141  VMVDDDEREEGELEEGE----------------IDGD-DDTGGVMVGGDGSETVSESDIR 183

Query: 1017 G-LEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKKEDLIQLSFAAIRTLNTVFC 1193
              LEG+T+    +SF E  S+L  +  S  ++     +++K+ +I+L + AI  +++VFC
Sbjct: 184  DFLEGVTVANVAESFAETISRLLRVLQS--KLLSGPAVSEKDYVIRLLYNAIEIVHSVFC 241

Query: 1194 SMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQLKEIEGIISSLDSFAVCSGSKGNDINK 1373
            SM           I RLL  L N+   + S E +KEI+ +I+++D+      S       
Sbjct: 242  SMDNLQKEDNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNS------- 294

Query: 1374 EMQVAEGFSKNIIDISTGNVN----QELLKIKSSDQSEARTMLDNLKYGGANSKYRGLSL 1541
             + V  G   + +DI T  +      EL+       S      + L  G +N K RG+ L
Sbjct: 295  -VVVGNGEKLDTLDIKTRQIQGLKASELISSSKLVHSNLTEASEALLSGQSNIKGRGVML 353

Query: 1542 PLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKPEWPIP------RVALDTNKV 1703
            PL DLHK HD DSLPSPTRE     P++K F +G G+ +P  P        ++ LDT   
Sbjct: 354  PLFDLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGKTEAVKMELDTENS 413

Query: 1704 PMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDADISAEVSSSPKRHAK 1883
              H YET+A+KAVSTYQQKFGRSS+F  D+ PSPTPS + E   AD + EVSS+    + 
Sbjct: 414  KNHLYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADANEEVSSASIAVSL 473

Query: 1884 PEITPMVGQLGVSSFPNMNNLSVQGL--NSIQNAAPSSYGLNPLLKQSFAKSRDPRLRLV 2057
                P++ Q+ VSS  +++  S+ GL  + I+ A+  +Y +     ++ A+SRDPRLR +
Sbjct: 474  TSSKPLLDQMPVSS-TSVDRSSMHGLINSRIEAASSVTYPV-----KTSARSRDPRLRFI 527

Query: 2058 NSDATA-------GISALPNETKEPLGGIISSKKQKIVEERVLDGPALKRPKTELANSGF 2216
            NSDA+A       G + +P   K    G + S+KQK  EE  LD  A KR ++ L NS  
Sbjct: 528  NSDASALDLNQSLGTNNMP---KVENAGRVISRKQKTTEELSLDATAPKRLRSSLENSRH 584

Query: 2217 -IHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVANSTSSNITMPNVSVGI 2393
                 R + GNGGWLE+    G  +  R   +   +  +   +  STSS  +    +   
Sbjct: 585  NTREERTMAGNGGWLEENRVAGSHLIERNHLMQKGETELKKTM--STSSGYSTVTSNGNE 642

Query: 2394 NDKLAIPGSTASMQSILTDLVVNPSILLNFLKGQQ--MSANSTK-----STSQPTSSNSI 2552
               + +  + A++  +L ++ VNP++LLN L  QQ  ++A + K     +TS    +NS 
Sbjct: 643  QAPVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRLAAEANKKPVDSATSTMHLTNSA 702

Query: 2553 LGAVPSTNLATPKPPVLGQGSAGILHTPSRTAA-----LEESGTVRMKPRDPRRVLH-SN 2714
             G   + N        L Q S G+L   ++ A+     LE+SG +RMKPRDPRR+LH S+
Sbjct: 703  RGPDATVNTGPAMTAGLPQSSVGMLPASTQAASMAHTLLEDSGKIRMKPRDPRRILHGSS 762

Query: 2715 GLQAGKSMEIDQSQTKTS-TLSVPGVMGNLNGQRQDHQREQISVXXXXXXXXXXXXXXXX 2891
             LQ   S   +QS++  S T +  G  GN+N Q+ D + E                    
Sbjct: 763  SLQKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDVRVET--------------KLAPT 808

Query: 2892 XXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRIDAKGALEVG 3071
                 PDI+ QF +NLKNIADI++VSQ  S Q  LP      ++ +    +D K  L+ G
Sbjct: 809  QSSAQPDITRQFTKNLKNIADIMSVSQEPSTQ--LPATTQNVSSASVPFTLD-KAELKSG 865

Query: 3072 SLQTRSVKE--------VSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXXXK 3227
               ++++++         +  SSRSQ+ W DVEHLF+G+D++QKAA             K
Sbjct: 866  VPNSQNLQDGVGSAPETCAPGSSRSQSTWADVEHLFEGYDEKQKAAIQRERARRLEEQNK 925

Query: 3228 MFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSMWTK 3407
            MFA++K            NSAKF EVDPVHDEILRKKEEQDREKP RHLFRFPHM MWTK
Sbjct: 926  MFASKKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 985

Query: 3408 LRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLIDGD 3587
            LRPG+WNFLEKASKLYELHLYTMGNKLYATEMAK+LDPKG LFAGRVISRGDD + +DGD
Sbjct: 986  LRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTESVDGD 1045

Query: 3588 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPSLL 3767
            ER PKSKDLEGV+GMES+VVI+DDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLL
Sbjct: 1046 ERAPKSKDLEGVMGMESSVVIVDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLL 1105

Query: 3768 EIDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVFSRV 3947
            EIDHDER E GTLASSLAVIE+IH +FFA Q+L+E DVRNILASEQ KILAGCRIVFSRV
Sbjct: 1106 EIDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRV 1165

Query: 3948 FPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVHPGW 4127
            FPVGEANPHLHPLWQTAEQFGAVC N IDDQVTHVVANSLGTDKVNWA++TGRFVVHPGW
Sbjct: 1166 FPVGEANPHLHPLWQTAEQFGAVCINQIDDQVTHVVANSLGTDKVNWAISTGRFVVHPGW 1225

Query: 4128 VEASALLYRRANEHDFAIK 4184
            VEASALLYRRANE DFAIK
Sbjct: 1226 VEASALLYRRANEQDFAIK 1244


>gb|ESW11309.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris]
          Length = 1272

 Score =  934 bits (2413), Expect = 0.0
 Identities = 578/1222 (47%), Positives = 733/1222 (59%), Gaps = 47/1222 (3%)
 Frame = +3

Query: 660  ISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRVGPDGLNKNNNERV 839
            I R + SGL+NLAWA  VQNKP+ D  VME+      N++S +SNR  P  ++ N  E +
Sbjct: 77   ICRGYASGLYNLAWAQAVQNKPLNDIFVMELDSEANANSNSNNSNR--PSSVSVNPKEVM 134

Query: 840  VIEGD-DEGXXXXXXXXXXXXXXXXXXXXXXX-VVDADANDSNSFGIRDAGLEKGLDSIL 1013
            V++ D +EG                        VV    +DS  FG++    +     + 
Sbjct: 135  VVDVDREEGELEEGEIDADADPEAEAESVVAASVVSETVSDSEQFGVKKGVSDSEQLGVR 194

Query: 1014 KGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKKEDLIQLSFAAIRTLNTVFC 1193
              LEG+T+    +SF +  S+L N   +L ++      ++K+DLI+LSF AI  + +VF 
Sbjct: 195  DVLEGVTVANVAESFAQTSSRLLN---ALPQVFSRPADSEKDDLIRLSFNAIEVVYSVFR 251

Query: 1194 SMXXXXXXXXXXXIKRLLVDLLNQKHP-VVSTEQLKEIEGIISSLDSFAVCSGSKGNDIN 1370
            SM           I RLL    ++K   + S E +KEI+ +++++DS      ++   + 
Sbjct: 252  SMDSSDKEQNKNSILRLLSSAKDKKQAQLFSPEHIKEIQDMMTAIDSVGALGSNEAIYME 311

Query: 1371 KEMQVAEGFSKNIIDISTGNVNQELLKIKSSDQSEARTMLDN--------------LKYG 1508
             E+Q  E  S+   + S   V    +KI+ +    A  ++ +              LK+G
Sbjct: 312  TELQTPEIKSQ---ENSALEVQTRGIKIQENQAVVATELVSSIKPLHSDIIGASRALKFG 368

Query: 1509 GANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKP-----EWPI 1673
              + K RG+ LPLLDLHKDHDADSLPSPTRE   C P++K   +G  ++K      +   
Sbjct: 369  QNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEVMVKSGSAAAKMQP 428

Query: 1674 PRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDADISAE 1853
             ++ +D+     H YET+A+KAVSTYQQKFGRSS F  D+LPSPTPS + +++  D + E
Sbjct: 429  GKLEVDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCDDMAVDTNEE 488

Query: 1854 VSSSPKRHAKPEITPMVGQLGVSSFPNMNNLSVQGLNSIQNAAPSSYGLNPLLKQSFAKS 2033
            VSS+          P +      S  +++   + GL S +  A  S G  P+  +S AKS
Sbjct: 489  VSSASTSGFLTSTKPTLLDQPPVSATSVDKSRLLGLISSRVDAAGS-GSFPV--KSSAKS 545

Query: 2034 RDPRLRLVNSDATA---GISALPNETKEPLGGIISSKKQKIVEERVLDGPALKRPKTELA 2204
            RDPR RL+NS+A+A     +   N  K    G   S+KQK VEE   D    KR K+ L 
Sbjct: 546  RDPRRRLINSEASAVDNQFTVTHNMPKVEYAGSTISRKQKAVEEPSFDLTVSKRLKSSLE 605

Query: 2205 NSGF-IHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVANSTSSNITMPNV 2381
            N        R + G+GGWLED    G ++  +   +    P     +   +SS     N 
Sbjct: 606  NIEHNTSEVRTIAGSGGWLEDITGPGTQLIEKNHLIDKFAPEPKRTLNTVSSSGSVNFNA 665

Query: 2382 SVGINDKLAIPGST--ASMQSILTDLVVNPSILLNFLKGQQM-------SANSTKSTSQP 2534
            +   N++  I  +   +S+ +I  D+VVNP++LL+ L  Q+        SA+S  +   P
Sbjct: 666  TSIRNEQAPITSNNVPSSLPAIFKDIVVNPTMLLSLLMEQKRLVDAQNNSADSATNMLHP 725

Query: 2535 TSSNSILGAVPSTNLATPKPPVLGQGSAGILHTPSR---TAALEE--SGTVRMKPRDPRR 2699
            TSSNS +G   + ++ +     L Q S G+L   S+   TA L++  SG +RMKPRDPRR
Sbjct: 726  TSSNSAMGTDSTASIVSSMATGL-QTSVGMLPVSSQSTSTAQLQDDYSGKIRMKPRDPRR 784

Query: 2700 VLHSNGLQAGKSMEIDQSQTKTSTLSVPGVM---GNLNGQRQDHQREQISVXXXXXXXXX 2870
            +LH+N     KS  I     K     V  ++    ++N Q+ + + +   V         
Sbjct: 785  ILHTNN-SVQKSGNIVNELHKAIVSPVSNILVTGDSVNAQKLEGRMDTKLVPTQSGA--- 840

Query: 2871 XXXXXXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRIDA 3050
                        PDI+ QF  NLKNIADI++VSQ SS  S   Q  S  +      R + 
Sbjct: 841  -----------APDITRQFTRNLKNIADIMSVSQESSTHSPAAQGFSSASVPLNVDRGEQ 889

Query: 3051 KGALEVGS---LQTRSVKEVSAV-SSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXX 3218
            K  L         T S  E+ A  +SRSQ+ WGDVEHLF+G+D+QQKAA           
Sbjct: 890  KSVLSNSQNLHAGTGSAPEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEE 949

Query: 3219 XXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSM 3398
              KMFAARK            NSAKF EVDPVH+EILRKKEE DREKP RHLFRFPHM M
Sbjct: 950  QNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEELDREKPHRHLFRFPHMGM 1009

Query: 3399 WTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLI 3578
            WTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAK+LDPKG LFAGRVISRGDD D +
Sbjct: 1010 WTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSV 1069

Query: 3579 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGP 3758
            DG+ER PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GP
Sbjct: 1070 DGEERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGP 1129

Query: 3759 SLLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVF 3938
            SLLEIDHDER E GTLASSLAVIE++H +FF+ Q+L+E DVRNILASEQ KIL+GCRIVF
Sbjct: 1130 SLLEIDHDERPEAGTLASSLAVIERLHQNFFSSQSLEEVDVRNILASEQRKILSGCRIVF 1189

Query: 3939 SRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVH 4118
            SRVFPVGEANPHLHPLWQTAEQFGAVCTN IDDQVTHVVANSLGTDKVNWAL+TGRFVVH
Sbjct: 1190 SRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSTGRFVVH 1249

Query: 4119 PGWVEASALLYRRANEHDFAIK 4184
            PGWVEASALLYRRANE DFAIK
Sbjct: 1250 PGWVEASALLYRRANEQDFAIK 1271


>ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Fragaria vesca subsp. vesca]
          Length = 1230

 Score =  929 bits (2400), Expect = 0.0
 Identities = 579/1220 (47%), Positives = 721/1220 (59%), Gaps = 34/1220 (2%)
 Frame = +3

Query: 627  RFW-MRDFLNYRISRSFNSG-LHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRV 800
            RFW   + L +   R    G L NLAWA  VQNKP  D LV        +++  +   + 
Sbjct: 53   RFWTFHEVLAHPHFRGIGGGGLANLAWAQAVQNKPFNDLLVK------LDSDEKSKQQQQ 106

Query: 801  GPDGLNKNNNERVVIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNSFGIRD 980
                ++  N + V+I+  DE                          +   ND  +  + +
Sbjct: 107  QRSSVSSGNEKVVIIDSGDEMDVEKEEEELEEGEIGFDS-------ECGDNDKAAGSVGN 159

Query: 981  AGLEKGLDSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKKEDLIQLSF 1160
               EK ++ + + LE +T+  AEKSF + C +  +  +SL+ +  E  ++ KE L+Q  F
Sbjct: 160  GVWEKRVNLLREALESLTITEAEKSFGDVCHRFLDSLESLRGVLSEINVSTKEALVQQLF 219

Query: 1161 AAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQLKEIEGIISSLDSFAV 1340
             A+R +++VF SM           + R+L    +   P  + EQLKEIE + SS+DS   
Sbjct: 220  NAVRAISSVFRSMSADQKEQNKDVLSRILSSAKSDPSPFPA-EQLKEIEVMSSSMDSPQT 278

Query: 1341 CSGSKGNDINKEMQVAEGFSKNIIDISTGNVNQELLKIKSSDQSEARTMLDNL------- 1499
             +G+K N I    Q   G  K   D S  N +       ++      +++ +        
Sbjct: 279  KAGTKENGI----QCINGVYKTDSDTSGANASHVFTYAANTGSDTQVSVVHSNPNISSEV 334

Query: 1500 -KYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKPE-WPI 1673
             + G ++ K RGL LPLLDLH DHD DSLPSPTRE   C P  K   + +G++K   W  
Sbjct: 335  PRSGSSSFKGRGLMLPLLDLHMDHDEDSLPSPTREPPACFPAQKPVVVENGMVKKSGWET 394

Query: 1674 PRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEE-GENVDADISA 1850
             R ALD     MH YETEA+KAVS+YQQKF R+SF +T  LPSPTPSEE G+N D     
Sbjct: 395  ARAALDVEGSKMHVYETEALKAVSSYQQKFSRNSF-LTSELPSPTPSEEEGDNGDDAAVG 453

Query: 1851 EVSSSP-KRHAKPEITPMVGQLGVSSFPNMN---NLSVQGLNSIQNAAPSSYGLNPLLKQ 2018
            EVSSS    + +    P+ G+  VSS P      +  + GL + + A+P S G N +  +
Sbjct: 454  EVSSSSASNNVRTPQPPVSGRQVVSSVPATTLPGSSGMHGLITAKTASPVSLGSN-MPNK 512

Query: 2019 SFAKSRDPRLRLVNSDATA------GISALPNETKEPLGGIISSKKQKIVEERVLDGPAL 2180
            S AKSRDPRLR  NSDA A          + N  K      +SS+K K  E+   DGP  
Sbjct: 513  SSAKSRDPRLRFANSDAGALTLNQQSSIQVHNAPKVDSVITLSSRKHKSPEDSNFDGPES 572

Query: 2181 KRPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKP--ELGLVDPRMPGDVANST 2354
            KR +   ANS      +   GNG WLED   VG  +  R    E    DPR   +V++S 
Sbjct: 573  KRQRG--ANSVVGWGAKTSFGNGVWLEDGSSVGPHLINRNQTVEKKEADPRKMVNVSSSP 630

Query: 2355 SSNITMPNVSVGINDKLAIPG-STASMQSILTDLVVNPSILLNFLKGQQMSANST----- 2516
             +     N     N+K+ +   S  S+ +I  D+ VNP++L+N LK  +   N+      
Sbjct: 631  GTVEGNSNGQNTANEKVPLVAPSLVSLPAIFKDIAVNPTMLVNILKLAEAQQNAAAPARK 690

Query: 2517 KSTSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGILHTP---SRTAALEESGTVRMKPR 2687
            +S + P SS+SI G     N  +         ++G L TP   S+    +E+G +RMK R
Sbjct: 691  ESLTYPPSSSSIPGTAALVNDPSK--------TSGALLTPTICSQKTPTDEAGKIRMKLR 742

Query: 2688 DPRRVLHSNGLQAGKSMEIDQSQTKTSTLSVPGVMGN-LNGQRQDHQREQISVXXXXXXX 2864
            DPRR+LH N LQ   S+  +QS+     LS      + +NG++QD Q +  SV       
Sbjct: 743  DPRRLLHGNALQNSGSVGHEQSRNIVPPLSSSQANNDDMNGKKQDSQADNNSVTSQSGAL 802

Query: 2865 XXXXXXXXXXXXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRI 3044
                          PDI+ QF +NLKNIADI++VSQ S++ +T  Q  S +        +
Sbjct: 803  G------------APDIASQFTKNLKNIADIISVSQVSTSPATPSQNLSTELISINPDNV 850

Query: 3045 DAKGALEVGSLQTRSVKEVSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXXX 3224
            D K   +     + SV   +A +SRS   WGDVEHLF+G+DD+QKAA             
Sbjct: 851  DLKAEEQHTGSISASVP-TAAGASRSPATWGDVEHLFEGYDDKQKAAIQRERARRIEEQK 909

Query: 3225 KMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSMWT 3404
            KMFAA K            NSAKF EVDPVHDEILRKKEEQDR++PQRHLFRF HM MWT
Sbjct: 910  KMFAAHKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDRKEPQRHLFRFQHMGMWT 969

Query: 3405 KLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLIDG 3584
            KLRPG+W FLEKAS L+E+HLYTMGNKLYATEMAK+LDP G LFAGRVISRGDD D  DG
Sbjct: 970  KLRPGVWKFLEKASHLFEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISRGDDGDPYDG 1029

Query: 3585 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPSL 3764
            DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGLLGPSL
Sbjct: 1030 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1089

Query: 3765 LEIDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVFSR 3944
            LEIDHDER EDGTLASSLAVIEKIH  FF+H +LDEADVRNILASEQ KIL GCRIVFSR
Sbjct: 1090 LEIDHDERHEDGTLASSLAVIEKIHQIFFSHPSLDEADVRNILASEQQKILGGCRIVFSR 1149

Query: 3945 VFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVHPG 4124
            VFPVGE NPHLHPLWQTAEQFGAVCTN IDDQVTHVVANSLGTDKVNWAL++G++VVHPG
Sbjct: 1150 VFPVGEVNPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSSGKYVVHPG 1209

Query: 4125 WVEASALLYRRANEHDFAIK 4184
            WVEASALLYRRANE DFAIK
Sbjct: 1210 WVEASALLYRRANEQDFAIK 1229


>ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like isoform X2 [Cicer arietinum]
          Length = 1227

 Score =  927 bits (2397), Expect = 0.0
 Identities = 571/1218 (46%), Positives = 735/1218 (60%), Gaps = 43/1218 (3%)
 Frame = +3

Query: 660  ISRSFNSGLHNLAWASGVQNKPITDFLVMEMPVTTAENNSSTDSNRVGPDGLNKNNNERV 839
            I R + SGL+NLAWA  VQNKP+ D  VME+     +++S+ +   V  D   +   E  
Sbjct: 82   ICRGYASGLYNLAWAQAVQNKPLNDIFVMEL-----DSDSNANVVMVDDDEREEGELEEG 136

Query: 840  VIEGDDEGXXXXXXXXXXXXXXXXXXXXXXXVVDADANDSNSFG-IRDAGLEKGLDSILK 1016
             I+GDD+                        +V  D +++ S   IRD            
Sbjct: 137  EIDGDDD--------------------TGGVMVGGDGSETVSESDIRDF----------- 165

Query: 1017 GLEGITLEYAEKSFVEACSKLQNLFDSLQEMAEEGWLNKKEDLIQLSFAAIRTLNTVFCS 1196
             LEG+T+    +SF E  S+L  +  S  ++     +++K+ +I+L + AI  +++VFCS
Sbjct: 166  -LEGVTVANVAESFAETISRLLRVLQS--KLLSGPAVSEKDYVIRLLYNAIEIVHSVFCS 222

Query: 1197 MXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQLKEIEGIISSLDSFAVCSGSKGNDINKE 1376
            M           I RLL  L N+   + S E +KEI+ +I+++D+      S        
Sbjct: 223  MDNLQKEDNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNS-------- 274

Query: 1377 MQVAEGFSKNIIDISTGNVN----QELLKIKSSDQSEARTMLDNLKYGGANSKYRGLSLP 1544
            + V  G   + +DI T  +      EL+       S      + L  G +N K RG+ LP
Sbjct: 275  VVVGNGEKLDTLDIKTRQIQGLKASELISSSKLVHSNLTEASEALLSGQSNIKGRGVMLP 334

Query: 1545 LLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGHGVLKPEWPIP------RVALDTNKVP 1706
            L DLHK HD DSLPSPTRE     P++K F +G G+ +P  P        ++ LDT    
Sbjct: 335  LFDLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGKTEAVKMELDTENSK 394

Query: 1707 MHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGENVDADISAEVSSSPKRHAKP 1886
             H YET+A+KAVSTYQQKFGRSS+F  D+ PSPTPS + E   AD + EVSS+    +  
Sbjct: 395  NHLYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADANEEVSSASIAVSLT 454

Query: 1887 EITPMVGQLGVSSFPNMNNLSVQGL--NSIQNAAPSSYGLNPLLKQSFAKSRDPRLRLVN 2060
               P++ Q+ VSS  +++  S+ GL  + I+ A+  +Y +     ++ A+SRDPRLR +N
Sbjct: 455  SSKPLLDQMPVSS-TSVDRSSMHGLINSRIEAASSVTYPV-----KTSARSRDPRLRFIN 508

Query: 2061 SDATA-------GISALPNETKEPLGGIISSKKQKIVEERVLDGPALKRPKTELANSGF- 2216
            SDA+A       G + +P   K    G + S+KQK  EE  LD  A KR ++ L NS   
Sbjct: 509  SDASALDLNQSLGTNNMP---KVENAGRVISRKQKTTEELSLDATAPKRLRSSLENSRHN 565

Query: 2217 IHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLVDPRMPGDVANSTSSNITMPNVSVGIN 2396
                R + GNGGWLE+    G  +  R   +   +  +   +  STSS  +    +    
Sbjct: 566  TREERTMAGNGGWLEENRVAGSHLIERNHLMQKGETELKKTM--STSSGYSTVTSNGNEQ 623

Query: 2397 DKLAIPGSTASMQSILTDLVVNPSILLNFLKGQQ--MSANSTK-----STSQPTSSNSIL 2555
              + +  + A++  +L ++ VNP++LLN L  QQ  ++A + K     +TS    +NS  
Sbjct: 624  APVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRLAAEANKKPVDSATSTMHLTNSAR 683

Query: 2556 GAVPSTNLATPKPPVLGQGSAGILHTPSRTAA-----LEESGTVRMKPRDPRRVLH-SNG 2717
            G   + N        L Q S G+L   ++ A+     LE+SG +RMKPRDPRR+LH S+ 
Sbjct: 684  GPDATVNTGPAMTAGLPQSSVGMLPASTQAASMAHTLLEDSGKIRMKPRDPRRILHGSSS 743

Query: 2718 LQAGKSMEIDQSQTKTS-TLSVPGVMGNLNGQRQDHQREQISVXXXXXXXXXXXXXXXXX 2894
            LQ   S   +QS++  S T +  G  GN+N Q+ D + E                     
Sbjct: 744  LQKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDVRVET--------------KLAPTQ 789

Query: 2895 XXXGPDISLQFKENLKNIADILTVSQASSAQSTLPQLPSLQTAQTPQGRIDAKGALEVGS 3074
                PDI+ QF +NLKNIADI++VSQ  S Q  LP      ++ +    +D K  L+ G 
Sbjct: 790  SSAQPDITRQFTKNLKNIADIMSVSQEPSTQ--LPATTQNVSSASVPFTLD-KAELKSGV 846

Query: 3075 LQTRSVKE--------VSAVSSRSQNNWGDVEHLFDGFDDQQKAAXXXXXXXXXXXXXKM 3230
              ++++++         +  SSRSQ+ W DVEHLF+G+D++QKAA             KM
Sbjct: 847  PNSQNLQDGVGSAPETCAPGSSRSQSTWADVEHLFEGYDEKQKAAIQRERARRLEEQNKM 906

Query: 3231 FAARKXXXXXXXXXXXXNSAKFAEVDPVHDEILRKKEEQDREKPQRHLFRFPHMSMWTKL 3410
            FA++K            NSAKF EVDPVHDEILRKKEEQDREKP RHLFRFPHM MWTKL
Sbjct: 907  FASKKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 966

Query: 3411 RPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGELFAGRVISRGDDNDLIDGDE 3590
            RPG+WNFLEKASKLYELHLYTMGNKLYATEMAK+LDPKG LFAGRVISRGDD + +DGDE
Sbjct: 967  RPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTESVDGDE 1026

Query: 3591 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLLGPSLLE 3770
            R PKSKDLEGV+GMES+VVI+DDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLE
Sbjct: 1027 RAPKSKDLEGVMGMESSVVIVDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLE 1086

Query: 3771 IDHDERTEDGTLASSLAVIEKIHHDFFAHQALDEADVRNILASEQHKILAGCRIVFSRVF 3950
            IDHDER E GTLASSLAVIE+IH +FFA Q+L+E DVRNILASEQ KILAGCRIVFSRVF
Sbjct: 1087 IDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVF 1146

Query: 3951 PVGEANPHLHPLWQTAEQFGAVCTNTIDDQVTHVVANSLGTDKVNWALNTGRFVVHPGWV 4130
            PVGEANPHLHPLWQTAEQFGAVC N IDDQVTHVVANSLGTDKVNWA++TGRFVVHPGWV
Sbjct: 1147 PVGEANPHLHPLWQTAEQFGAVCINQIDDQVTHVVANSLGTDKVNWAISTGRFVVHPGWV 1206

Query: 4131 EASALLYRRANEHDFAIK 4184
            EASALLYRRANE DFAIK
Sbjct: 1207 EASALLYRRANEQDFAIK 1224


>ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223548611|gb|EEF50102.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 1195

 Score =  925 bits (2391), Expect = 0.0
 Identities = 579/1247 (46%), Positives = 731/1247 (58%), Gaps = 61/1247 (4%)
 Frame = +3

Query: 627  RFW-MRDFLNYRISRSFNSGLHNLAWASGVQ------NKPITDF---LVMEMPVTTAENN 776
            R W + D   Y++     SGL+NLAWA  VQ      NKP+ +    +V E+  ++  ++
Sbjct: 63   RVWTISDLYRYQMVGGHVSGLYNLAWAQAVQSKPGKSNKPLNELFADVVEELDESSKRSS 122

Query: 777  SSTDSNRVGPDGLNKNNNERVVIEG---DDEGXXXXXXXXXXXXXXXXXXXXXXXVVDAD 947
             S+ +  V  +  + +  ++ V+E    DD G                       VV+ +
Sbjct: 123  PSSSAASVNSNNKDGDEEKKKVVEKVVIDDNGDEMMDDNNRNKIVD---------VVEKE 173

Query: 948  ANDSNSFGIRDAGLEKGL-----DSILKGLEGITLEYAEKSFVEACSKLQNLFDSLQEMA 1112
              +    G  D  +E G      D +   ++G+ +E  EK F +   K+ ++ D+L+ + 
Sbjct: 174  EGELEE-GEIDLDMEPGEKANNGDVLNMNIDGLEVESGEKGFEK---KMNSIRDALESVT 229

Query: 1113 EEGWLNKKEDLIQLSFAAIRTLNTVFCSMXXXXXXXXXXXIKRLLVDLLNQKHPVVSTEQ 1292
                       I+   A   +    F S                      +K P++ST  
Sbjct: 230  -----------IEFVLACTDSSGVSFSSFSE------------------KEKEPLISTVV 260

Query: 1293 LKEIEGIISSLDSFAVCSGSKGNDINKEMQVAEGFSKNIIDISTGNVNQELLKIKSSD-- 1466
             K                  K ND+N +              S+G+    + K+ +    
Sbjct: 261  NK------------------KDNDVNGK--------------SSGHDMSAVNKLPTDSFV 288

Query: 1467 QSEARTMLDNLKYGGANSKYRGLSLPLLDLHKDHDADSLPSPTRETTPCLPIDKGFGMGH 1646
             ++A   ++  K G ++ K R   LPLLDLHKDHDADSLPSPTRE+   LP        +
Sbjct: 289  NNKANLSIEGPKTGVSSFKSRAALLPLLDLHKDHDADSLPSPTRESALPLP-------AY 341

Query: 1647 GVLKPEWPIPRVALDTNKVPMHPYETEAVKAVSTYQQKFGRSSFFMTDRLPSPTPSEEGE 1826
             VL P     ++ LDT    MHPYET+A+KAVS+YQQKF +SSF +TDRLPSPTPSEE  
Sbjct: 342  RVLTP-----KMVLDTGNSRMHPYETDALKAVSSYQQKFSKSSFALTDRLPSPTPSEESG 396

Query: 1827 NVDADISAEVSSSPKRHAKPEITPMV-GQLGVS-SFPNMNNLSVQGLNSIQNAAPSSYGL 2000
            N D D   EVSSS    +     P+  GQ   S S P M+  S+ G+ SI++A  +S   
Sbjct: 397  NGDGDTGGEVSSSLSVSSFRPANPLTSGQSNASISLPRMDGSSLPGVISIKSAVRASSAP 456

Query: 2001 NPLLKQSFAKSRDPRLRLVNSDATA------GISALPNETKEPLGGIISSKKQKIVEERV 2162
            +  +K S AKSRDPRLR VNSD+ A       +  +     EP+GG ++ K+QKIV++ +
Sbjct: 457  SLTVKAS-AKSRDPRLRFVNSDSNALDQNHRAVPVVNTLKVEPIGGTMNKKRQKIVDDPI 515

Query: 2163 LDGPALKRPKTELANSGFIHVGRAVPGNGGWLEDRVPVGFKVAARKPELGLV--DPRMP- 2333
             DG +LKR K  L NSG +   + + G+GGWLED   VG +   +   +     DPR   
Sbjct: 516  PDGHSLKRQKNALENSGVVRDVKTMVGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKD 575

Query: 2334 -GDVANSTS----------SNITMPNVSVGINDKLA-IPGSTASMQSILTDLVVNPSILL 2477
             G V  S+S            I +   SV I  +L  + GSTA++  +L ++ VNP++L+
Sbjct: 576  GGGVCTSSSCISSVNISGTEQIPVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLI 635

Query: 2478 NFLK----------GQQMSANSTKSTSQPTSSNSILGAVPSTNLATPKPPVLGQGSAGIL 2627
            N LK           QQ   +  KST+ P +SNS+LG VP          V+G   +GIL
Sbjct: 636  NILKMGQQQRLALEAQQKPVDPAKSTTYPLNSNSMLGTVP----------VVGAAHSGIL 685

Query: 2628 HTPSRTAAL-------EESGTVRMKPRDPRRVLHSNGLQAGKSMEIDQSQTKTSTLSV-P 2783
              P+ T  +       ++ G +RMKPRDPRRVLH+N LQ   SM  +  +T  +++ +  
Sbjct: 686  PRPAGTVQVSPQLGTADDLGKIRMKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQ 745

Query: 2784 GVMGNLNGQRQDHQREQISVXXXXXXXXXXXXXXXXXXXXGPDISLQFKENLKNIADILT 2963
                N N Q+Q+ Q E+  V                     PDIS+ F +NLKNIADI++
Sbjct: 746  ETKDNQNLQKQEGQVEKKPVPLQSLAL--------------PDISMPFTKNLKNIADIVS 791

Query: 2964 VSQASSAQSTLPQLPSLQTAQTPQGRIDAKGALEVGSLQTRSVKEVSAVSSRSQNNWGDV 3143
            VS AS++Q  +PQ P+ Q  +T     D    L +GS    +    +A   R+QN WGDV
Sbjct: 792  VSHASTSQPLVPQNPASQPMRTTISSSDQ--FLGIGSAPGAAA--AAAAGPRTQNAWGDV 847

Query: 3144 EHLFDGFDDQQKAAXXXXXXXXXXXXXKMFAARKXXXXXXXXXXXXNSAKFAEVDPVHDE 3323
            EHLF+G++DQQKAA             K+F+ARK            NSAKF EVDPVHDE
Sbjct: 848  EHLFEGYNDQQKAAIQRERARRIEEQKKLFSARKLCLVLDLDHTLLNSAKFVEVDPVHDE 907

Query: 3324 ILRKKEEQDREKPQRHLFRFPHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEM 3503
            ILRKKEEQDREK  RHLFRFPHM MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEM
Sbjct: 908  ILRKKEEQDREKAHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEM 967

Query: 3504 AKLLDPKGELFAGRVISRGDDNDLIDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 3683
            AK+LDP G LF GRVISRGDD +  DGDER+PKSKDLEGVLGMES VVI+DDSVRVWPHN
Sbjct: 968  AKVLDPTGVLFNGRVISRGDDGEPFDGDERIPKSKDLEGVLGMESGVVIMDDSVRVWPHN 1027

Query: 3684 KLNLIVVERYIYFPCSRRQFGLLGPSLLEIDHDERTEDGTLASSLAVIEKIHHDFFAHQA 3863
            KLNLIVVERYIYFPCSRRQFGL GPSLLEIDHDER EDGTLA SLAVIE+IH +FF H +
Sbjct: 1028 KLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHPS 1087

Query: 3864 LDEADVRNILASEQHKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNTIDDQV 4043
            LDEADVRNILASEQ KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTN ID+QV
Sbjct: 1088 LDEADVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQV 1147

Query: 4044 THVVANSLGTDKVNWALNTGRFVVHPGWVEASALLYRRANEHDFAIK 4184
            THVVANSLGTDKVNWAL+TGRFVV+PGWVEASALLYRRANE DFAIK
Sbjct: 1148 THVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYRRANEQDFAIK 1194