BLASTX nr result

ID: Paeonia23_contig00010959 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00010959
         (4700 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma...  1258   0.0  
ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat...  1217   0.0  
emb|CBI35661.3| unnamed protein product [Vitis vinifera]             1167   0.0  
gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l...  1133   0.0  
ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr...  1120   0.0  
ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu...  1077   0.0  
ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus tric...  1057   0.0  
ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citr...  1055   0.0  
ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma...  1048   0.0  
ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma...  1044   0.0  
ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera...  1042   0.0  
ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal doma...  1041   0.0  
ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma...  1026   0.0  
ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phas...  1006   0.0  
ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal doma...  1004   0.0  
ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma...  1000   0.0  
ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal doma...   995   0.0  
ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Popu...   981   0.0  
ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ...   970   0.0  
ref|XP_007225412.1| hypothetical protein PRUPE_ppa000589mg [Prun...   961   0.0  

>ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Vitis vinifera]
          Length = 1238

 Score = 1258 bits (3255), Expect = 0.0
 Identities = 733/1332 (55%), Positives = 899/1332 (67%), Gaps = 22/1332 (1%)
 Frame = -2

Query: 4546 EKETLVTMARSSILIEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQ 4367
            EKE  + M      IEDVEEGEISD+AS+EEISE+DF KQ+VR+L+E         +KP+
Sbjct: 3    EKENNIMMG-----IEDVEEGEISDSASVEEISEEDFNKQEVRVLRE---------AKPK 48

Query: 4366 VEPSRGWESQEVFDAYK--RDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSN 4193
             + +R W  +++ D YK  +    +   LYN AWAQAVQN+PLN+  +F ++ E    S+
Sbjct: 49   AD-TRVWTMRDLQDLYKYHQACSGYTPRLYNLAWAQAVQNKPLND--IFVMDDEESKRSS 105

Query: 4192 RXXXXXXXSGANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEE 4013
                      +N S++D++                  E  ++ +D           G E 
Sbjct: 106  S--------SSNTSRDDSSSAK---------------EVAKVIIDDS---------GDEM 133

Query: 4012 GVLNDSVIVKEEGLNHSPSVKEGP--LDDSVGVREDVILNDSASVSEWEIDSKEKELVER 3839
             V  D V  KEEG      ++EG   LD    V+++  + D   V+E EID KE+ELVER
Sbjct: 134  DVKMDDVSEKEEG-----ELEEGEIDLDSEPDVKDEGGVLD---VNEPEIDLKERELVER 185

Query: 3838 VNSIRETLDNANFTENAQKSFEGACSRLQNSLESLKEIISEG----ASVSSKDDLIQQSY 3671
            V SI+E L++    E A+KSF G CSRLQN+L SL+++  E     +SV +KD L QQ  
Sbjct: 186  VKSIQEDLESVTVIE-AEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLI 244

Query: 3670 TAIQSVNSVFCTSNHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAV 3491
             AI+++N VFC+ N +QKE NK +FSRLL+ V   D P+FS + +KE+  ++S +D+PA 
Sbjct: 245  NAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAA 304

Query: 3490 FSSFDFSNKEEEKPITCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPD 3311
             SS + S+K  +  +T G+N N+ D+  E++     ++ K +  LDS SVES+ N+N PD
Sbjct: 305  QSSAEASDKVNDVQVTDGMNRNILDSSVESSGRAFASAKKLS--LDSISVESY-NQNNPD 361

Query: 3310 VLSEALKPALYDLRGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKP 3131
                ALKP L   RGR +  PLLDLHKDHD D+LPSPT     C PV           K 
Sbjct: 362  ----ALKPGLSSSRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPV----------NKS 407

Query: 3130 EVAPARVAHERENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGD 2951
            E+  A+VAHE ++SIMHPYET+ALKAVSTYQQKFG +SF   D+LPSPTPSEE  D  GD
Sbjct: 408  ELVTAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGD 467

Query: 2950 TGGEVSSSLTVGNVKNVNLPILARPVVSSAPHMDVSSAR--TSG--SMIFGSNPTLK--- 2792
              GEVSSS T+      N P L  P+VSSAP MD S  +  T G  + +  S P L    
Sbjct: 468  ISGEVSSSSTISAPITANAPALGHPIVSSAPQMDSSIVQGPTVGRNTSLVSSGPHLDSSV 527

Query: 2791 -ASSKSRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEPL-DGP 2618
             AS+KSRDPRLRLA+SD G++D+N+ PLP V N+PKVDP+G ++SSRK KS EEPL DGP
Sbjct: 528  VASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGP 587

Query: 2617 ALKRQRNGSTN-YDVRDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVV 2441
              KRQRNG T+   VRDA+TV  SGGWLE D+ TV PQ MN++ LIENT    KK+E+ V
Sbjct: 588  VTKRQRNGLTSPATVRDAQTVVASGGWLE-DSNTVIPQMMNRNQLIENTGTDPKKLESKV 646

Query: 2440 SFSSTAPGKPIVT-SGNEQVPLTGTSTIGDLRNYTPNPIMVQYLLKLQRLESEALKKPFL 2264
            + +     KP VT +GNE +P+  TST   L++   + I V   + +        +K   
Sbjct: 647  TVTGIGCDKPYVTVNGNEHLPVVATSTTASLQSLLKD-IAVNPAVWMNIFNKVEQQKSGD 705

Query: 2263 PPSSTVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVTVEDPGKVRMKP 2084
            P  +TVL P SNSI   +G VP  +VAP+ P+ LGQKPAG LQVPQ   +++ GKVRMKP
Sbjct: 706  PAKNTVLPPTSNSI---LGVVPPASVAPLKPSALGQKPAGALQVPQTGPMDESGKVRMKP 762

Query: 2083 RDPRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQP-S 1907
            RDPRRILH N++Q+S S G+EQFK               +N  K  DQ++  S+ S   +
Sbjct: 763  RDPRRILHANSFQRSGSSGSEQFK---------------TNAQKQEDQTETKSVPSHSVN 807

Query: 1906 LPDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVER 1727
             PDI++QFTKNL+NIA+++S            PQIL SQ +Q N DR+D K+  SDS ++
Sbjct: 808  PPDISQQFTKNLKNIADLMSASQASSMTPTF-PQILSSQSVQVNTDRMDVKATVSDSGDQ 866

Query: 1726 PTETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKX 1547
             T     P E AAG  + +N WGDVE LF+GYDDQQKAAI  ER RR+EEQKKMF+ARK 
Sbjct: 867  LTANGSKP-ESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKL 925

Query: 1546 XXXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWN 1367
                      LNSAKF+EV+P+HDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGIWN
Sbjct: 926  CLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWN 985

Query: 1366 FLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PKSK 1193
            FLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISK             PKSK
Sbjct: 986  FLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSK 1045

Query: 1192 DMEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDER 1013
            D+EGVLGM+SAVVIIDDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DER
Sbjct: 1046 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDER 1105

Query: 1012 PEDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAK 833
            PEDGTLAS L+VIERIHQ FFS R+LD+VDVRNILA EQRKIL+GCRI+FSRVFPVGEA 
Sbjct: 1106 PEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEAN 1165

Query: 832  PHLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALL 653
            PHLHPLWQTAE FGAVCT QIDE VTHVVA SLGTDKVNWAL TGR+VVHPGW+EASALL
Sbjct: 1166 PHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALL 1225

Query: 652  YRRANEQDFAIK 617
            YRRANEQDFAIK
Sbjct: 1226 YRRANEQDFAIK 1237


>ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA
            polymerase II C-terminal domain phosphatase-like 3,
            putative [Theobroma cacao]
          Length = 1290

 Score = 1217 bits (3149), Expect = 0.0
 Identities = 720/1334 (53%), Positives = 881/1334 (66%), Gaps = 25/1334 (1%)
 Frame = -2

Query: 4543 KETLVTMARSSILIEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQV 4364
            +ETL  M +    +EDVEEGEISD+ASIEEISE+DF KQDV+ILKE +     E++    
Sbjct: 18   EETLGEMGKDETKVEDVEEGEISDSASIEEISEEDFNKQDVKILKESKSSKGGEANSN-- 75

Query: 4363 EPSRGWESQEVFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXX 4184
              SR W  Q++   Y      +  GLYNFAWAQAVQN+PLN   V   E +P    N+  
Sbjct: 76   --SRVWTMQDLCK-YPSVIRGYASGLYNFAWAQAVQNKPLNEIFVKDFE-QPQQDENK-- 129

Query: 4183 XXXXXSGANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVL 4004
                   +  S   ++V S              ++   ID DSE+  E       E+ V+
Sbjct: 130  ------NSKRSSPSSSVASVNSKEEKGSSGNLAVKV-VIDDDSEDEME-------EDKVV 175

Query: 4003 NDSVIVKEEGLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIR 3824
            N   + KEEG      ++EG +D     +E V+ ++  +V   +      EL +R N IR
Sbjct: 176  N---LDKEEG-----ELEEGEIDLDSEPKEKVLSSEDGNVGNSD------ELEKRANLIR 221

Query: 3823 ETLDNANFTENAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSV 3644
              L+     E A+KSFEG CSRL N+LESL+ +I E  SV +KD LIQ ++ AI   NS 
Sbjct: 222  GVLEGVTVIE-AEKSFEGVCSRLHNALESLRALILE-CSVPAKDALIQLAFGAI---NSA 276

Query: 3643 FCTSNHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNK 3464
            F   N + KEQN AI SRLL+ V   D  LF P++MKEI+ +L S++SPA        + 
Sbjct: 277  FVALNCNSKEQNVAILSRLLSIVKGHDPSLFPPDKMKEIDVMLISLNSPA-----RAIDT 331

Query: 3463 EEEKPITCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPA 3284
            E++  +  GVN    DA  EN  H++  ++K    L SS+   F+  N P+ L+E LKP 
Sbjct: 332  EKDMKVVDGVNKKDPDALPENICHDLTVTNK----LPSSA--KFVINNKPNALTETLKPG 385

Query: 3283 LYDLRGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAH 3104
            + + R RG+  PLLDLHKDHDAD+LPSPTR TTPCLPV K    G  + K      + +H
Sbjct: 386  VPNFRNRGISLPLLDLHKDHDADSLPSPTRETTPCLPVNKPLTSGDVMVKSGFMTGKGSH 445

Query: 3103 ERENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSL 2924
            + E   +HPYET+ALKA STYQQKFG+ SFFS DRLPSPTPSEE  D  GD GGEVSSS 
Sbjct: 446  DAEGDKLHPYETDALKAFSTYQQKFGQGSFFSSDRLPSPTPSEESGDEGGDNGGEVSSSS 505

Query: 2923 TVGNVKNVNLPILARPVVSSAPHMDVSSARTSGS--------MIFGSNPTLKASSKSRDP 2768
            ++GN K  NLPIL  P+VSSAP +D +S+   G         M   SN   K+ +KSRDP
Sbjct: 506  SIGNFKP-NLPILGHPIVSSAPLVDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDP 564

Query: 2767 RLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPALKRQRNGS 2591
            RL  ANS+  A+D+N+    ++HNA KV P+G ++ SRK KSVEEP LD PALKRQRN  
Sbjct: 565  RLWFANSNASALDLNER---LLHNASKVAPVGGIMDSRKKKSVEEPILDSPALKRQRNEL 621

Query: 2590 TNYDV-RDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPGK 2414
             N  V RD +TV   GGWLE D   +G Q  N++   EN E+  +KM+  V+ SST  GK
Sbjct: 622  ENLGVARDVQTVSGIGGWLE-DTDAIGSQITNRNQTAENLESNSRKMDNGVTSSSTLSGK 680

Query: 2413 PIVTSG-NEQVPLTGTSTIGD---LRNYTPNPIMVQYLLKL---QRLESEALKKPFLPPS 2255
              +T G NEQVP+T TST      L++   NP M+  +LK+   QRL +EA +K   P  
Sbjct: 681  TNITVGTNEQVPVTSTSTPSLPALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVK 740

Query: 2254 STVLTPISNSINSIVGAV-----PFGNVAPINPTGLGQKPAGILQVPQIVTVEDPGKVRM 2090
            ST   P SNS+  +V +      P  N  P   +G+  KPAG LQVP   + ++ GK+RM
Sbjct: 741  STFHQPSSNSLLGVVSSTNVIPSPSVNNVPSISSGISSKPAGNLQVP---SPDESGKIRM 797

Query: 2089 KPRDPRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQ- 1913
            KPRDPRR+LH N+ Q+S S+G +Q K NGA T ST GS+ N N  K   Q++   + SQ 
Sbjct: 798  KPRDPRRVLHGNSLQRSGSMGLDQLKTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQL 857

Query: 1912 PSLPDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSV 1733
               PDI +QFT NL+NIA+I+S         PV+  ++P QP+    D +D K++ S+S 
Sbjct: 858  VPPPDITQQFTNNLKNIADIMSVSQALTSLPPVSHNLVP-QPVLIKSDSMDMKALVSNSE 916

Query: 1732 ERPTETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAAR 1553
            ++ T   L P+ GA G  + QN WGDVE LFE YDDQQKAAI  ER RR+EEQKKMF+AR
Sbjct: 917  DQQTGAGLAPEAGATGP-RSQNAWGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSAR 975

Query: 1552 KXXXXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGI 1373
            K           LNSAKFIEV+P+H+EILRKKEEQDREKP+RHLFRF HMGMWTKLRPGI
Sbjct: 976  KLCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGI 1035

Query: 1372 WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PK 1199
            WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+             P+
Sbjct: 1036 WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPR 1095

Query: 1198 SKDMEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRD 1019
            SKD+EGVLGM+SAVVIIDDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D D
Sbjct: 1096 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 1155

Query: 1018 ERPEDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGE 839
            ERPEDGTLAS L+VIERIHQ FFS ++LDDVDVRNILA EQRKIL+GCRI+FSRVFPVGE
Sbjct: 1156 ERPEDGTLASSLAVIERIHQDFFSHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGE 1215

Query: 838  AKPHLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASA 659
            A PHLHPLWQTAEQFGAVCT QIDEHVTHVVA SLGTDKVNWAL TG++VVHPGW+EASA
Sbjct: 1216 ANPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKFVVHPGWVEASA 1275

Query: 658  LLYRRANEQDFAIK 617
            LLYRRANE DFAIK
Sbjct: 1276 LLYRRANEVDFAIK 1289


>emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score = 1167 bits (3018), Expect = 0.0
 Identities = 687/1314 (52%), Positives = 833/1314 (63%), Gaps = 18/1314 (1%)
 Frame = -2

Query: 4504 IEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEPSRGWESQEVFD 4325
            IEDVEEGEISD+AS+EEISE+DF KQ+VR+L+E         +KP+ + +R W  +++ D
Sbjct: 52   IEDVEEGEISDSASVEEISEEDFNKQEVRVLRE---------AKPKAD-TRVWTMRDLQD 101

Query: 4324 AYK--RDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXSGANES 4151
             YK  +    +   LYN AWAQAVQN+PLN+  V    +  D G          S   E 
Sbjct: 102  LYKYHQACSGYTPRLYNLAWAQAVQNKPLNDIFV----IIDDSGDEMDVKMDDVSEKEEG 157

Query: 4150 KEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDSVIVKEEGL 3971
            +                     LEEGEIDLDSE                     VK+EG 
Sbjct: 158  E---------------------LEEGEIDLDSEPD-------------------VKDEG- 176

Query: 3970 NHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRETLDNANFTEN 3791
                    G LD                V+E EID KE+ELVERV SI+E L++    E 
Sbjct: 177  --------GVLD----------------VNEPEIDLKERELVERVKSIQEDLESVTVIE- 211

Query: 3790 AQKSFEGACSRLQNSLESLKEIISEG----ASVSSKDDLIQQSYTAIQSVNSVFCTSNHS 3623
            A+KSF G CSRLQN+L SL+++  E     +SV +KD L QQ   AI+++N VFC+ N +
Sbjct: 212  AEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIRALNHVFCSMNSN 271

Query: 3622 QKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEKPIT 3443
            QKE NK +FSRLL+ V   D P+FS + +KE+  ++S +D+PA  SS + S+K  +  +T
Sbjct: 272  QKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSAEASDKVNDVQVT 331

Query: 3442 CGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPALYDLRGR 3263
             G+N N+ D+  E++     ++ K                                 RGR
Sbjct: 332  DGMNRNILDSSVESSGRAFASAKK--------------------------------FRGR 359

Query: 3262 GVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAHERENSIM 3083
             +  PLLDLHKDHD D+LPSPT     C PV           K E+  A+VAHE ++SIM
Sbjct: 360  FIFGPLLDLHKDHDEDSLPSPTGKAPQCFPV----------NKSELVTAKVAHETQDSIM 409

Query: 3082 HPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLTVGNVKN 2903
            HPYET+ALKAVSTYQQKFG +SF   D+LPSPTPSEE  D  GD  GEVSSS T+     
Sbjct: 410  HPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVSSSSTISAPIT 469

Query: 2902 VNLPILARPVVSSAPHMDVSSA----RTSGSMIFGSNPTLKASSKSRDPRLRLANSDVGA 2735
             N P L  P+VSSAP MD+       R +G++    N  L+AS+KSRDPRLRLA+SD G+
Sbjct: 470  ANAPALGHPIVSSAPQMDIVQGLVVPRNTGAVNSRFNSILRASAKSRDPRLRLASSDAGS 529

Query: 2734 MDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEPL-DGPALKRQRNGSTNYDVRDAKTV 2558
            +D+N+ PLP V N+PKVDP+G ++SSRK KS EEPL DGP  KRQRNG T+         
Sbjct: 530  LDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTS--------- 580

Query: 2557 PESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPGKPIVT-SGNEQVP 2381
            P +                              K+E+ V+ +     KP VT +GNE +P
Sbjct: 581  PAT------------------------------KLESKVTVTGIGCDKPYVTVNGNEHLP 610

Query: 2380 LTGTSTIGDLRNYTPNPIMVQYLLKLQRLESEALKKPFLPPSSTVLTPISNSINSIVGAV 2201
            +  TST   L++   + I V   + +        +K   P  +TVL P SNSI   +G V
Sbjct: 611  VVATSTTASLQSLLKD-IAVNPAVWMNIFNKVEQQKSGDPAKNTVLPPTSNSI---LGVV 666

Query: 2200 PFGNVAPINPTGLGQKPAGILQVPQIVTV---EDPGKVRMKPRDPRRILHNNTYQKSASL 2030
            P  +VAP+ P+ LGQKPAG LQVPQ   +   ++ GKVRMKPRDPRRILH N++Q+S S 
Sbjct: 667  PPASVAPLKPSALGQKPAGALQVPQTGPMNPQDESGKVRMKPRDPRRILHANSFQRSGSS 726

Query: 2029 GTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQP-SLPDIARQFTKNLENIANI 1853
            G+EQFK               +N  K  DQ++  S+ S   + PDI++QFTKNL+NIA++
Sbjct: 727  GSEQFK---------------TNAQKQEDQTETKSVPSHSVNPPDISQQFTKNLKNIADL 771

Query: 1852 LSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERPTETCLTPDEGAAGTAKP 1673
            +S            PQIL SQ +Q N DR+D K+  SDS ++ T     P E AAG  + 
Sbjct: 772  MSASQASSMTPTF-PQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSKP-ESAAGPPQS 829

Query: 1672 QNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXXXXXXXXXXXLNSAKFIE 1493
            +N WGDVE LF+GYDDQQKAAI  ER RR+EEQKKMF+ARK           LNSAKF+E
Sbjct: 830  KNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVE 889

Query: 1492 VEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGN 1313
            V+P+HDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGN
Sbjct: 890  VDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGN 949

Query: 1312 KLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PKSKDMEGVLGMDSAVVIIDDS 1139
            KLYATEMAKVLDPKGVLFAGRVISK             PKSKD+EGVLGM+SAVVIIDDS
Sbjct: 950  KLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDS 1009

Query: 1138 IRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERPEDGTLASCLSVIERIHQ 959
            +RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DERPEDGTLAS L+VIERIHQ
Sbjct: 1010 VRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQ 1069

Query: 958  IFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKPHLHPLWQTAEQFGAVCT 779
             FFS R+LD+VDVRNILA EQRKIL+GCRI+FSRVFPVGEA PHLHPLWQTAE FGAVCT
Sbjct: 1070 SFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCT 1129

Query: 778  TQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLYRRANEQDFAIK 617
             QIDE VTHVVA SLGTDKVNWAL TGR+VVHPGW+EASALLYRRANEQDFAIK
Sbjct: 1130 NQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1183


>gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus
            notabilis]
          Length = 1301

 Score = 1133 bits (2931), Expect = 0.0
 Identities = 692/1322 (52%), Positives = 842/1322 (63%), Gaps = 41/1322 (3%)
 Frame = -2

Query: 4516 SSILIEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVE--PSRGWE 4343
            S  ++EDVEEGEISD+AS+EEISE+DF KQ+       +V  + +S+  + +   SR W 
Sbjct: 6    SGRVVEDVEEGEISDSASVEEISEEDFNKQEGNGTGSGKVMSVSDSNSKESKFGDSRVWT 65

Query: 4342 SQEVFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVE-----------PDVGS 4196
             ++++  Y      +  GLYN AWAQAVQN+PLN   V  V+ +           P V S
Sbjct: 66   MRDLYANYP-GFRGYTTGLYNLAWAQAVQNKPLNEIFVMDVDADDSSRVVLSSASPAVNS 124

Query: 4195 NRXXXXXXXSGANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGE 4016
             R        G N  KE   V               ELEEGEIDL+SE PT+K A   GE
Sbjct: 125  GRRE------GKNGVKEVEKVEKVVIDDSADEMEEGELEEGEIDLESE-PTQKPA---GE 174

Query: 4015 EGVLNDSVIVKEEGLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERV 3836
            E                    K+G L+            ++ +V   E+DS+  EL +RV
Sbjct: 175  EA-------------------KDGDLNC-----------EAENVGGLEVDSRRDELEKRV 204

Query: 3835 NSIRETLDNANFTENAQKSFEGACSRLQNSLESLKEIISEGA-SVSSKDDLIQQSYTAIQ 3659
            + I ETL + N   NA+KSFE  CSRLQ +LESL+ ++SE   S  +KD +IQ S TAIQ
Sbjct: 205  DLIWETLGSVNVV-NAEKSFEEVCSRLQRTLESLRGVLSEKEFSFPTKDVVIQMSITAIQ 263

Query: 3658 SVNSVFCTSNHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSF 3479
             VNSVFC+ + +QKEQ K   SRL   V +   PLFSPE+ KEI  ++SS++   V  S 
Sbjct: 264  VVNSVFCSMSVNQKEQKKETLSRLFCSVKNCGTPLFSPEQTKEIELMISSLNPLNVLPSS 323

Query: 3478 DFSNKEEEKPITCGVNH---NVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDV 3308
              S+KE+E  I   ++    N+++A AENAS E     + +  L    V S ++ N P  
Sbjct: 324  GASDKEKETQIIERLHEMDSNLTNANAENASIE-----RTSVKLPQDCVASVVHSN-PIT 377

Query: 3307 LSEALKPALYDLRGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPE 3128
            L E L+P     +GRG+L PLLDLHKDHDAD+LPSPTR    C PV+K   +  G+ KP 
Sbjct: 378  LPELLRPGTLAFKGRGLLLPLLDLHKDHDADSLPSPTREAPSCFPVYKPLGVADGIIKPV 437

Query: 3127 VAPARVAHERENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDT 2948
               A+VA   E S +H YET+ALKAVSTYQQKFGR SF   DRLPSPTPSEECD+ D D 
Sbjct: 438  STTAKVAPGAEESRLHRYETDALKAVSTYQQKFGRGSFLMSDRLPSPTPSEECDEED-DI 496

Query: 2947 GGEVSSSLTVGNVKNVNLPILARPVVSSAPHMDVSS--------ARTSGSMIFGSNPTLK 2792
              EVSSSLT GN++   +PIL   VV+S+  + VSS        A+ +  +  GSN T+K
Sbjct: 497  NQEVSSSLTSGNLRTPAIPILRPSVVTSS--VPVSSPTMQGPIAAKNAAPVGSGSNSTMK 554

Query: 2791 ASSKSRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPA 2615
            AS++SRDPRLR ANSD GA+D+NQ PL  VHN PKV+P G   SSRK + VEEP LDGPA
Sbjct: 555  ASARSRDPRLRFANSDAGALDLNQRPLTAVHNGPKVEP-GDPTSSRKQRIVEEPNLDGPA 613

Query: 2614 LKRQRNGSTNYDVRDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSF 2435
            LKRQR+   +  + D KT    GGWLE D GT GPQ MN++ L+EN E   +K   +V+ 
Sbjct: 614  LKRQRHAFVSAKI-DVKTASGVGGWLE-DNGTTGPQIMNKNQLVENAEADPRKSIHLVNG 671

Query: 2434 SSTAPGKPIVTSGNEQVPLTGTSTIGDL----RNYTPNP-IMVQYLLKL---QRLESEAL 2279
                 G  I   G EQVP+TGTST   L    ++   NP I +  L KL   Q L ++A 
Sbjct: 672  PIMNNGPNI---GKEQVPVTGTSTPDALPAILKDIAVNPTIFMDILNKLGQQQLLAADAQ 728

Query: 2278 KKPFLPPSSTVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGIL----QVPQIVTVE 2111
            +K      S+  T      NSI+GA P  NVAP   +G+ Q PA  L    QV      +
Sbjct: 729  QKS----DSSKNTTHPPGTNSILGAAPLVNVAPSKASGILQTPAVSLPTTSQVATASMQD 784

Query: 2110 DPGKVRMKPRDPRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDR 1931
            + GK+RMKPRDPRR+LH N  QKS SLG EQFK   +    T G++ N N      Q+D+
Sbjct: 785  ELGKIRMKPRDPRRVLHGNMLQKSWSLGHEQFKPIVSSVSCTPGNKDNLNGPVQEGQADK 844

Query: 1930 NSIASQPSL-PDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAK 1754
              + SQ  + PDIARQFTKNL NIA+++S          V+ Q L SQPL    DR D K
Sbjct: 845  KQVPSQLVVQPDIARQFTKNLRNIADLMSVSQASTSPATVS-QNLSSQPLPVKPDRGDVK 903

Query: 1753 SMNSDSVERPTETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQ 1574
            ++  +S ++ + T  TP+   A  ++  N WGDVE LFEGYDD+QKAAI  ER RRLEEQ
Sbjct: 904  AVVPNSEDQHSGTNSTPETTLAVPSRTPNAWGDVEHLFEGYDDEQKAAIQRERARRLEEQ 963

Query: 1573 KKMFAARKXXXXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMW 1394
            KKMF A K           LNSAKF+EV+ +HDEILRKKEEQDREKPQRHLFRFPHMGMW
Sbjct: 964  KKMFDAHKLCLVLDLDHTLLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPHMGMW 1023

Query: 1393 TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXX 1214
            TKLRPG+WNFLEKASKLYELHLYTMGNKLYATEMAKVLDP G LF+GRVIS+        
Sbjct: 1024 TKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPMGTLFSGRVISRGDDGDPFD 1083

Query: 1213 XXV--PKSKDMEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPS 1040
                 PKSKD+EGVLGM+S+VVIIDDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPS
Sbjct: 1084 GDERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPS 1143

Query: 1039 LLELDRDERPEDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFS 860
            LLE+D DERPE GTLAS L+VIE+IHQ FFS  SLD+VDVRNILA EQRKIL+GCRI+FS
Sbjct: 1144 LLEIDHDERPEQGTLASSLAVIEKIHQNFFSHHSLDEVDVRNILASEQRKILAGCRIVFS 1203

Query: 859  RVFPVGEAKPHLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHP 680
            RVFPV E  PHLHPLWQTAEQFGAVCTTQID+ VTHVVA S GTDKVNWAL  G++ VHP
Sbjct: 1204 RVFPVSEVNPHLHPLWQTAEQFGAVCTTQIDDQVTHVVANSPGTDKVNWALANGKFAVHP 1263

Query: 679  GW 674
            GW
Sbjct: 1264 GW 1265


>ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|568858958|ref|XP_006483010.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 3-like
            [Citrus sinensis] gi|557541056|gb|ESR52100.1|
            hypothetical protein CICLE_v10030535mg [Citrus
            clementina]
          Length = 1234

 Score = 1120 bits (2898), Expect = 0.0
 Identities = 679/1334 (50%), Positives = 852/1334 (63%), Gaps = 39/1334 (2%)
 Frame = -2

Query: 4501 EDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQV----EPSRGWESQE 4334
            +DVEEGEISDTAS+EEISE+DFK     I +E+ V+V+KE+   +V      +R W  ++
Sbjct: 3    KDVEEGEISDTASVEEISEEDFK-----IKQEEVVKVVKETKPIKVGGGEAAARVWTMRD 57

Query: 4333 VFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXS---- 4166
            +++ Y      +  GL+N AWAQAVQN+PLN   +F +E E D  S R       +    
Sbjct: 58   LYNKYPAICRGYGPGLHNLAWAQAVQNKPLNE--IFVMEAEQDDVSKRSSPASSVASVNS 115

Query: 4165 GANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDS--V 3992
            GA   K+D  V+                                     E+ V++DS   
Sbjct: 116  GAAAGKDDKKVV-------------------------------------EKVVIDDSGDE 138

Query: 3991 IVKEEGLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRETLD 3812
            I KEEG      ++EG ++       D+    +  VSE     KE+  +  V SIRE L+
Sbjct: 139  IEKEEG-----ELEEGEIE------LDLESESNEKVSE---QVKEEMKLINVESIREALE 184

Query: 3811 NANFTENAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVFCTS 3632
            +         SFEG CS+L+ +LESL+E+++E  +V +KD LIQ +++A+QSV+SVFC+ 
Sbjct: 185  SVL---RGDISFEGVCSKLEFTLESLRELVNEN-NVPTKDALIQLAFSAVQSVHSVFCSM 240

Query: 3631 NHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEK 3452
            NH  KEQNK I SRLL+ + S + PLFS  ++KE+  +LSS+ + A       ++KE++ 
Sbjct: 241  NHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSLVTRA-------NDKEKDM 293

Query: 3451 PITCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPALYDL 3272
                GVN   S+   ENA +++    K    +DS      L +N P    EA KP     
Sbjct: 294  LAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDS------LMQNKP---LEASKPGPPGY 344

Query: 3271 RGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAHEREN 3092
            R RGVL PLLD HK HD D+LPSPTR TTP +PV +   +G GV K   A A+++H  E 
Sbjct: 345  RSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVKSWAAAAKLSHNAEV 404

Query: 3091 SIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLTVGN 2912
                 YET+AL+A S+YQQKFGR+SFF    LPSPTPSEE  D DGDTGGE+SS+  V  
Sbjct: 405  HKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQ 464

Query: 2911 VKNVNLPILARPVVSSAPH-----MDVSSAR-----------TSG-SMIFGSNPTLKASS 2783
             K VN+P L +  VSS P      MD+SS +           +SG + +   NP +KA  
Sbjct: 465  PKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPI 524

Query: 2782 KSRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPALKR 2606
            KSRDPRLR A+S+  A+++N  P P++HNAPKV+P+G V+SSRK K+VEEP LDGPALKR
Sbjct: 525  KSRDPRLRFASSN--ALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKR 582

Query: 2605 QRNGSTNYDV-RDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSS 2429
            QRNG  N  V RD K +  SGGWLE D     PQ MN++ L+++ E+  +K++   +   
Sbjct: 583  QRNGFENSGVVRDEKNIYGSGGWLE-DTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPI 641

Query: 2428 TAPGKPIVTSGNEQVPLTGTSTIGDL----RNYTPNPIMVQYLLKL---QRLESEALKKP 2270
            T+    +V SGNE  P T  ST   L    ++   NP M+  +LK+   Q+L ++A +K 
Sbjct: 642  TSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKS 701

Query: 2269 FLPPSSTVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVTVEDPGKVRM 2090
                 +T+  PI +SI             P++ T     P+GIL  P    +++ GKVRM
Sbjct: 702  NDSSMNTMHPPIPSSI------------PPVSVTC--SIPSGILSKP----MDELGKVRM 743

Query: 2089 KPRDPRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQP 1910
            KPRDPRR+LH N  Q+S SLG E FK +G     T GS+ N N  K +   +   + SQ 
Sbjct: 744  KPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQS 802

Query: 1909 SL-PDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSV 1733
             L PDI +QFTKNL++IA+ +S          V+ Q  P QP Q      D K++ ++  
Sbjct: 803  VLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVS-QNSPIQPGQIK-SGADMKAVVTNHD 860

Query: 1732 ERPTETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAAR 1553
            ++ T T   P+ G  G A PQ+ WGDVE LFEGYDDQQKAAI  ERTRRLEEQKKMF+AR
Sbjct: 861  DKQTGTGSGPEAGPVG-AHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSAR 919

Query: 1552 KXXXXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGI 1373
            K           LNSAKF EV+P+HDEILRKKEEQDREKP RHLFRFPHMGMWTKLRPGI
Sbjct: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979

Query: 1372 WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PK 1199
            W FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+             PK
Sbjct: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039

Query: 1198 SKDMEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRD 1019
            SKD+EGVLGM+SAVVIIDDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D D
Sbjct: 1040 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 1099

Query: 1018 ERPEDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGE 839
            ER EDGTLAS L VIER+H+IFFS +SLDDVDVRNILA EQRKIL+GCRI+FSRVFPVGE
Sbjct: 1100 ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGE 1159

Query: 838  AKPHLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASA 659
            A PHLHPLWQTAEQFGAVCT  ID+ VTHVVA SLGTDKVNWAL TGR+VVHPGW+EASA
Sbjct: 1160 ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASA 1219

Query: 658  LLYRRANEQDFAIK 617
            LLYRRANEQDFAIK
Sbjct: 1220 LLYRRANEQDFAIK 1233


>ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343308|gb|EEE79627.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1247

 Score = 1077 bits (2784), Expect = 0.0
 Identities = 671/1363 (49%), Positives = 827/1363 (60%), Gaps = 53/1363 (3%)
 Frame = -2

Query: 4546 EKETLVTMARSSILIEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQ 4367
            + ET    A  S  +EDVEEGEISDTAS+EEISEDDF KQ+V ++KE        +S  +
Sbjct: 3    KNETASAAANGSGKMEDVEEGEISDTASVEEISEDDFNKQEVVVVKETPSSTTNNNSSSK 62

Query: 4366 VEPSRGWESQEVFDAYKRDTYEFR------RGLYNFAWAQAVQNRPLNNPIVFAVEVEPD 4205
                     Q+V+    RD Y+++       GLYN AWAQAVQN+PLN      VEVE D
Sbjct: 63   ---------QKVWTV--RDLYKYQVGGGYMSGLYNLAWAQAVQNKPLNE---LFVEVEVD 108

Query: 4204 VGSNRXXXXXXXSGANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVG 4025
              S +          N SKED   +                    ID DS          
Sbjct: 109  DSSQKSSVS----SVNSSKEDKRTVV-------------------ID-DS---------- 134

Query: 4024 GGEEGVLNDSVIVKEEGLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELV 3845
            G E  V+    I KEEG      ++EG +D            DS   SE  + S + E  
Sbjct: 135  GDEMDVVKVIDIEKEEG-----ELEEGEID-----------LDSEGKSEGGMVSVDTE-- 176

Query: 3844 ERVNSIRETLDNANFTENAQKSFEGACSRLQNSLESLKEIISEGAS-VSSKDDLIQQSYT 3668
            +RV SIRE L++ +  ++  KSFE  C +L N+LESLKE++    +   SKD L++  +T
Sbjct: 177  KRVKSIREDLESVSVIKD-DKSFEAVCLKLHNALESLKELVRVNENGFPSKDSLVRLLFT 235

Query: 3667 AIQSVNSVFCTSNHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVF 3488
            AI +VNS F + N   KEQNK +F R L+ V S D   FSPE  KE+            F
Sbjct: 236  AIGAVNSFFSSMNQKLKEQNKGVFMRFLSLVNSHDPSFFSPEHTKEV----------CDF 285

Query: 3487 SSFDFSNKEEEKPITCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDV 3308
             +FDF                VS  +     + +            S+ ESF++ N P+ 
Sbjct: 286  CNFDF--------------RIVSLCYDLTTMNRL-----------PSAAESFVH-NKPNF 319

Query: 3307 LSEALKPALYDLRGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPE 3128
              E  KP +   + RGVL PLLDL K HD D+LPSPTR T P  PV +L  IG G+    
Sbjct: 320  SIEPPKPGVPSFKSRGVLLPLLDLKKFHDEDSLPSPTRETAPSFPVQRLLPIGDGMISSG 379

Query: 3127 VAPARVAHERENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDT 2948
            +   +VA   E   +HPYET+ALKAVS+YQ+KF  +SFF+ + LPSPTPSEE  + DGDT
Sbjct: 380  LPVPKVASITEEPRVHPYETDALKAVSSYQKKFNLNSFFTNE-LPSPTPSEESGNGDGDT 438

Query: 2947 GGEVSSSLTVGNVKNVNLPILARPVVSSA--------------PHMDVSS------ARTS 2828
             GEVSSS TV N + VN P+  R   S +              PH++ SS       R S
Sbjct: 439  AGEVSSSSTV-NYRTVNPPVSDRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVVIPTRNS 497

Query: 2827 GSMIFGSNPTLKASSKSRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKH 2648
              +  G++ T+KAS+KSRDPRLR  N+D  A+D NQ  L +V+N P+ +P GA+  SRK 
Sbjct: 498  APVSSGTSSTVKASAKSRDPRLRYVNTDASALDQNQRTLLMVNNPPRAEPSGAIAGSRKQ 557

Query: 2647 KSVEEPLDGPALKRQRNGSTNYD-VRDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTE 2471
            K  E+ LDG +LKRQRN   N+  VRD +++  +GGWLE D     PQ +N++   EN E
Sbjct: 558  KIEEDVLDGTSLKRQRNSFDNFGVVRDIRSMTGTGGWLE-DTDMAEPQTVNKNQWAENAE 616

Query: 2470 NYLKKMETVVSFSSTAPGKPIVTSGNEQVPLTGTSTIGD-----------------LRNY 2342
               +    VV  S+ +    +  SGN QVP+ G +TI                   L++ 
Sbjct: 617  PGQRINNGVVCPSTGSVMSSVSCSGNVQVPVMGINTIAGSEQAPVTSTTTASLPDLLKDI 676

Query: 2341 TPNPIMVQYLLKL---QRLESEALKKPFLPPSSTVLTPISNSINSIVGAVPFGNVAPINP 2171
            T NP M+  +LK+   QRL  +  +K   P  ST   P   S N+++GA+P  N     P
Sbjct: 677  TVNPTMLINILKMGQQQRLALDGQQKLADPAKSTSHPP---SSNTVLGAIPEVNAVSSLP 733

Query: 2170 TGLGQKPAGILQVP-QIVTVEDPGKVRMKPRDPRRILHNNTYQKSASLGTEQFKFNGAPT 1994
            +G+  + AG  Q P QI T ++ GK+RMKPRDPRR+LHNN  Q++ SLG+EQFK     T
Sbjct: 734  SGILPRSAGKAQGPSQIATTDESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFK-TTTLT 792

Query: 1993 LSTLGSRGNSNDNKFVDQSDRNSIASQPSLPDIARQFTKNLENIANILSXXXXXXXXXPV 1814
             +T G++ N N  K    ++   +      PDI+  FTK+L+NIA+I+S          V
Sbjct: 793  STTQGTKDNQNLQKQEGLAELKPVVP----PDISSPFTKSLKNIADIVSVSQTCTTPPFV 848

Query: 1813 APQILPSQPLQTNLDRVDAKS--MNSDSVERPTETCLTPDEGAAGTAKPQNMWGDVELLF 1640
            + Q + SQP+Q   DRVD K+   NSD    P  +     E  A ++  QN W DVE LF
Sbjct: 849  S-QNVASQPVQIKSDRVDGKTGISNSDQKMGPASS----PEVVAASSLSQNTWEDVEHLF 903

Query: 1639 EGYDDQQKAAIHTERTRRLEEQKKMFAARKXXXXXXXXXXXLNSAKFIEVEPMHDEILRK 1460
            EGYDDQQKAAI  ER RR+EEQKK+FAARK           LNSAKF+EV+P+HDEILRK
Sbjct: 904  EGYDDQQKAAIQRERARRIEEQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRK 963

Query: 1459 KEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVL 1280
            KEEQDREKP RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVL
Sbjct: 964  KEEQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVL 1023

Query: 1279 DPKGVLFAGRVISK--XXXXXXXXXXVPKSKDMEGVLGMDSAVVIIDDSIRVWPHNKHNL 1106
            DPKGVLFAGRV+S+            VPKSKD+EGVLGM+S VVIIDDS+RVWPHNK NL
Sbjct: 1024 DPKGVLFAGRVVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNL 1083

Query: 1105 IVVERYMYFPCSRRQFGLFGPSLLELDRDERPEDGTLASCLSVIERIHQIFFSQRSLDDV 926
            IVVERY+YFPCSRRQFGL GPSLLE+D DERPEDGTLA  L+VIERIHQ FF+  SLD+ 
Sbjct: 1084 IVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEA 1143

Query: 925  DVRNILACEQRKILSGCRILFSRVFPVGEAKPHLHPLWQTAEQFGAVCTTQIDEHVTHVV 746
            DVRNILA EQRKIL+GCRI+FSRVFPVGE  PHLHPLWQ+AEQFGAVCT QIDE VTHVV
Sbjct: 1144 DVRNILASEQRKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVV 1203

Query: 745  AISLGTDKVNWALHTGRYVVHPGWLEASALLYRRANEQDFAIK 617
            A SLGTDKVNWAL TGR+VVHPGW+EASALLYRRANEQDFAIK
Sbjct: 1204 ANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1246


>ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus trichocarpa]
            gi|550347145|gb|EEE82674.2| CTD phosphatase-like protein
            3 [Populus trichocarpa]
          Length = 1190

 Score = 1057 bits (2734), Expect = 0.0
 Identities = 655/1331 (49%), Positives = 809/1331 (60%), Gaps = 35/1331 (2%)
 Frame = -2

Query: 4504 IEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEPSRGWESQEVFD 4325
            IEDVEEGEISDTAS+EEISE+DF KQ+V I+KE               PS    SQ+V+ 
Sbjct: 15   IEDVEEGEISDTASVEEISEEDFNKQEVVIVKE--------------TPSSNNSSQKVWT 60

Query: 4324 AYKRDTYEFR------RGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXSG 4163
               RD Y+++       GLYN AWA+AVQN+PLN   V                      
Sbjct: 61   V--RDLYKYQVGGGYMSGLYNLAWARAVQNKPLNELTVV--------------------- 97

Query: 4162 ANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDSVIVK 3983
             ++S ++ +V+              ELEEGEIDLDSE                   V+V+
Sbjct: 98   IDDSGDEMDVVK----VIDIEKEEGELEEGEIDLDSE------------------PVVVQ 135

Query: 3982 EEGLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRETLDNAN 3803
             EG+              V V                      ++  RV SIR+ L++ +
Sbjct: 136  SEGM--------------VSV----------------------DVENRVKSIRKDLESVS 159

Query: 3802 FTENAQKSFEGACSRLQNSLESLKEII-SEGASVSSKDDLIQQSYTAIQSVNSVFCTSNH 3626
              E  +KSFE  C +L   LESLKE++     S  SKD L+Q  + AI+ VNSVFC+ N 
Sbjct: 160  VIE-TEKSFEAVCLKLHKVLESLKELVGGNDNSFPSKDGLVQLLFMAIRVVNSVFCSMNK 218

Query: 3625 SQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEKPI 3446
              KEQNK +FSR  + + S   P FSP + KE+                           
Sbjct: 219  KLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEV--------------------------- 251

Query: 3445 TCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALK-PALYDLR 3269
               +N N +D+ A+ A +++   S+        + E+F+ +N P+   EA K P +   +
Sbjct: 252  ---LNENHNDSLAKTAGYDLTTMSEK-----LPAAETFV-QNKPNKSIEAPKPPGVPSFK 302

Query: 3268 GRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAHERENS 3089
             RGVL PLLDL K HD D+LPSPT+ TTP  PV +L AIG G+    +   +V    E  
Sbjct: 303  SRGVLLPLLDLKKYHDEDSLPSPTQETTP-FPVQRLLAIGDGMVSSGLPVPKVTPVAEEP 361

Query: 3088 IMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLTVGNV 2909
             MHPYET+ALKAVS+YQQKF R+SFF+ + LPSPTPSEE  + DGDT GEVSSS TV N 
Sbjct: 362  RMHPYETDALKAVSSYQQKFNRNSFFTNE-LPSPTPSEESGNGDGDTAGEVSSSSTVVNY 420

Query: 2908 KNVNLPILAR--------PVVSSAPHMDVSS------ARTSGSMIFGSNPTLKASSKSRD 2771
            + VN P+  +        P+    PH D S+       R S  +  G + T+KAS+KSRD
Sbjct: 421  RTVNPPVSDQKNAPPSPPPLPPPPPHPDSSNIRGVVPTRNSAPVSSGPSSTIKASAKSRD 480

Query: 2770 PRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEPLDGPALKRQRNGS 2591
            PRLR  N D  A+D NQ  LP+V+N P+V+P GA++ S+KHK  E+ LD P+LKRQRN  
Sbjct: 481  PRLRYVNIDACALDHNQRALPMVNNLPRVEPAGAIVGSKKHKIEEDVLDDPSLKRQRNSF 540

Query: 2590 TNYD-VRDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPG- 2417
             NY  VRD +++  +GGWLE D     PQ +N++   EN+          V+ S  A   
Sbjct: 541  DNYGAVRDIESMTGTGGWLE-DTDMAEPQTVNKNQWAENSN---------VNGSGNAQSP 590

Query: 2416 ----KPIVTSGNEQVPLTGTSTIGD-LRNYTPNPIMVQYLLKL---QRLESEALKKPFLP 2261
                  I  S   QV  T T+++ D L++   NP M+  +LK+   QRL  +  +    P
Sbjct: 591  FMGISNITGSEQAQVTSTATTSLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQTLSDP 650

Query: 2260 PSSTVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVTVEDPGKVRMKPR 2081
              ST   PIS   N+++GA+P  NVA   P+G+  +PAG     QI T ++ GK+RMKPR
Sbjct: 651  AKSTSHPPIS---NTVLGAIPTVNVASSQPSGIFPRPAGTPVPSQIATSDESGKIRMKPR 707

Query: 2080 DPRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQPSL- 1904
            DPRR LHNN+ Q++ S+G+EQFK     T  T  ++G  +D     Q        +P++ 
Sbjct: 708  DPRRFLHNNSLQRAGSMGSEQFK----TTTLTPTTQGTKDDQNV--QKQEGLAELKPTVP 761

Query: 1903 PDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERP 1724
            PDI+  FTK+LENIA+ILS          ++ Q + SQP+QT  +RVD K+  S S ++ 
Sbjct: 762  PDISFPFTKSLENIADILSVSQASTTPPFIS-QNVASQPMQTKSERVDGKTGISIS-DQK 819

Query: 1723 TETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXX 1544
            T    +P E  A ++  QN W DVE LFEGYDDQQKAAI  ER RRLEEQKKMFAARK  
Sbjct: 820  TGPASSP-EVVAASSHSQNTWKDVEHLFEGYDDQQKAAIQRERARRLEEQKKMFAARKLC 878

Query: 1543 XXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNF 1364
                     LNSAK I    +HDEILRKKEEQDREKP RH+FR PHMGMWTKLRPGIWNF
Sbjct: 879  LVLDLDHTLLNSAKAILSSSLHDEILRKKEEQDREKPYRHIFRIPHMGMWTKLRPGIWNF 938

Query: 1363 LEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISK--XXXXXXXXXXVPKSKD 1190
            LEKASKL+ELHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+            VPKSKD
Sbjct: 939  LEKASKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 998

Query: 1189 MEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERP 1010
            +EGVLGM+S VVIIDDS+RVWPHNK NLIVVERY+YFPCSRRQFGL GPSLLE+D DERP
Sbjct: 999  LEGVLGMESGVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERP 1058

Query: 1009 EDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKP 830
            EDGTLA   +VIE+IHQ FF+ RSLD+ DVRNILA EQRKIL GCRILFSRVFPVGE  P
Sbjct: 1059 EDGTLACSFAVIEKIHQNFFTHRSLDEADVRNILASEQRKILGGCRILFSRVFPVGEVNP 1118

Query: 829  HLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLY 650
            HLHPLWQ AEQFGAVCT QIDE VTHVVA SLGTDKVNWAL TGR VVHPGW+EASALLY
Sbjct: 1119 HLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRIVVHPGWVEASALLY 1178

Query: 649  RRANEQDFAIK 617
            RRANEQDF+IK
Sbjct: 1179 RRANEQDFSIK 1189


>ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|557541054|gb|ESR52098.1| hypothetical protein
            CICLE_v10030535mg [Citrus clementina]
          Length = 1208

 Score = 1055 bits (2727), Expect = 0.0
 Identities = 648/1300 (49%), Positives = 819/1300 (63%), Gaps = 39/1300 (3%)
 Frame = -2

Query: 4501 EDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQV----EPSRGWESQE 4334
            +DVEEGEISDTAS+EEISE+DFK     I +E+ V+V+KE+   +V      +R W  ++
Sbjct: 3    KDVEEGEISDTASVEEISEEDFK-----IKQEEVVKVVKETKPIKVGGGEAAARVWTMRD 57

Query: 4333 VFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXS---- 4166
            +++ Y      +  GL+N AWAQAVQN+PLN   +F +E E D  S R       +    
Sbjct: 58   LYNKYPAICRGYGPGLHNLAWAQAVQNKPLNE--IFVMEAEQDDVSKRSSPASSVASVNS 115

Query: 4165 GANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDS--V 3992
            GA   K+D  V+                                     E+ V++DS   
Sbjct: 116  GAAAGKDDKKVV-------------------------------------EKVVIDDSGDE 138

Query: 3991 IVKEEGLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRETLD 3812
            I KEEG      ++EG ++       D+    +  VSE     KE+  +  V SIRE L+
Sbjct: 139  IEKEEG-----ELEEGEIE------LDLESESNEKVSE---QVKEEMKLINVESIREALE 184

Query: 3811 NANFTENAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVFCTS 3632
            +         SFEG CS+L+ +LESL+E+++E  +V +KD LIQ +++A+QSV+SVFC+ 
Sbjct: 185  SVL---RGDISFEGVCSKLEFTLESLRELVNEN-NVPTKDALIQLAFSAVQSVHSVFCSM 240

Query: 3631 NHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEK 3452
            NH  KEQNK I SRLL+ + S + PLFS  ++KE+  +LSS+ + A       ++KE++ 
Sbjct: 241  NHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSLVTRA-------NDKEKDM 293

Query: 3451 PITCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPALYDL 3272
                GVN   S+   ENA +++    K    +DS      L +N P    EA KP     
Sbjct: 294  LAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDS------LMQNKP---LEASKPGPPGY 344

Query: 3271 RGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAHEREN 3092
            R RGVL PLLD HK HD D+LPSPTR TTP +PV +   +G GV K   A A+++H  E 
Sbjct: 345  RSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVKSWAAAAKLSHNAEV 404

Query: 3091 SIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLTVGN 2912
                 YET+AL+A S+YQQKFGR+SFF    LPSPTPSEE  D DGDTGGE+SS+  V  
Sbjct: 405  HKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQ 464

Query: 2911 VKNVNLPILARPVVSSAPH-----MDVSSAR-----------TSG-SMIFGSNPTLKASS 2783
             K VN+P L +  VSS P      MD+SS +           +SG + +   NP +KA  
Sbjct: 465  PKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPI 524

Query: 2782 KSRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPALKR 2606
            KSRDPRLR A+S+  A+++N  P P++HNAPKV+P+G V+SSRK K+VEEP LDGPALKR
Sbjct: 525  KSRDPRLRFASSN--ALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKR 582

Query: 2605 QRNGSTNYDV-RDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSS 2429
            QRNG  N  V RD K +  SGGWLE D     PQ MN++ L+++ E+  +K++   +   
Sbjct: 583  QRNGFENSGVVRDEKNIYGSGGWLE-DTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPI 641

Query: 2428 TAPGKPIVTSGNEQVPLTGTSTIGDL----RNYTPNPIMVQYLLKL---QRLESEALKKP 2270
            T+    +V SGNE  P T  ST   L    ++   NP M+  +LK+   Q+L ++A +K 
Sbjct: 642  TSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKS 701

Query: 2269 FLPPSSTVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVTVEDPGKVRM 2090
                 +T+  PI +SI             P++ T     P+GIL  P    +++ GKVRM
Sbjct: 702  NDSSMNTMHPPIPSSI------------PPVSVTC--SIPSGILSKP----MDELGKVRM 743

Query: 2089 KPRDPRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQP 1910
            KPRDPRR+LH N  Q+S SLG E FK +G     T GS+ N N  K +   +   + SQ 
Sbjct: 744  KPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQS 802

Query: 1909 SL-PDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSV 1733
             L PDI +QFTKNL++IA+ +S          V+ Q  P QP Q      D K++ ++  
Sbjct: 803  VLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVS-QNSPIQPGQIK-SGADMKAVVTNHD 860

Query: 1732 ERPTETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAAR 1553
            ++ T T   P+ G  G A PQ+ WGDVE LFEGYDDQQKAAI  ERTRRLEEQKKMF+AR
Sbjct: 861  DKQTGTGSGPEAGPVG-AHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSAR 919

Query: 1552 KXXXXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGI 1373
            K           LNSAKF EV+P+HDEILRKKEEQDREKP RHLFRFPHMGMWTKLRPGI
Sbjct: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979

Query: 1372 WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PK 1199
            W FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+             PK
Sbjct: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039

Query: 1198 SKDMEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRD 1019
            SKD+EGVLGM+SAVVIIDDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D D
Sbjct: 1040 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 1099

Query: 1018 ERPEDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGE 839
            ER EDGTLAS L VIER+H+IFFS +SLDDVDVRNILA EQRKIL+GCRI+FSRVFPVGE
Sbjct: 1100 ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGE 1159

Query: 838  AKPHLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKV 719
            A PHLHPLWQTAEQFGAVCT  ID+ VTHVVA SLGTDKV
Sbjct: 1160 ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 1199


>ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1257

 Score = 1048 bits (2709), Expect = 0.0
 Identities = 648/1327 (48%), Positives = 809/1327 (60%), Gaps = 31/1327 (2%)
 Frame = -2

Query: 4504 IEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEPSRGWESQEVFD 4325
            +EDVEEGEISDTAS+EEIS +DF KQDV+        VL  ++KP    +R W   +++ 
Sbjct: 22   VEDVEEGEISDTASVEEISAEDFNKQDVK--------VLNNNNKPNGSDARVWAVHDLYS 73

Query: 4324 AYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXSGANESKE 4145
             Y      +  GLYN AWAQAVQN+PLN+  +F +EV+ D  +N        S +N S  
Sbjct: 74   KYPTICRGYASGLYNLAWAQAVQNKPLND--IFVMEVDSDANAN--------SNSNNSNR 123

Query: 4144 DTNVI--SXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDSVIVKEEGL 3971
              +V                 ELEEGEID D+E         G  E V+   V+   E  
Sbjct: 124  LASVAVNPKDVVVVDVDKEEGELEEGEIDADAEPE-------GEAESVVAVPVVSDSE-- 174

Query: 3970 NHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRETLDNANFTEN 3791
                      LDD   V+ DV              S  ++L      +R  L+      N
Sbjct: 175  ---------KLDD---VKRDV--------------SNSEQL-----GVRGVLEGVT-VAN 202

Query: 3790 AQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVFCTSNHSQKEQ 3611
              +SF   CS+LQN   +L E++S  A  S +DDL++ S+ A + V SVFC+ +  +KEQ
Sbjct: 203  VAESFAQTCSKLQN---ALPEVLSRPAD-SERDDLVRLSFNATEVVYSVFCSMDSLKKEQ 258

Query: 3610 NKAIFSRLLTFVMSQDH-PLFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEKPITCGV 3434
            NK    RLL+FV  Q    LFSPE +KEI G+++++D      + +   KE+E   T   
Sbjct: 259  NKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAIDYFGALVNSEAIGKEKELQTTV-- 316

Query: 3433 NHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVL--SEALKPALYDLRGRG 3260
                        +HE+   ++ N  ++++ + S+      D++  S ALK     ++GRG
Sbjct: 317  -----------QTHEI--KTQENQAVEAAELISYNKPLHSDIIGASHALKFGQNSIKGRG 363

Query: 3259 VLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGH-------GVAKPEVAPARVAHE 3101
            VL PLLDLHKDHDAD+LPSPTR    C PV KL ++G          AKPE    ++  +
Sbjct: 364  VLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEPMVSSGSAAAKPE--SGKMELD 421

Query: 3100 RENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLT 2921
             E S  H YET+ALKAVSTYQQKFGRSS F+ D+ PSPTPS +C+D   DT  EVSS+ T
Sbjct: 422  SEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVDTNEEVSSAST 481

Query: 2920 VGNVKNVNLPILARPVVSSAPHMDVSSARTSGSMIFGSNP---TLKASSKSRDPRLRLAN 2750
               + +    +L  P VS+      S      S +  + P    +K+S+K+RDPRLR  N
Sbjct: 482  GDFLTSTKPTLLDLPPVSATSTDRSSLHGFISSRVDAAGPGSLPVKSSAKNRDPRLRFVN 541

Query: 2749 SDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPALKRQRNGSTNYDVR 2573
            SD  A+D    P  ++HN PKV+  G  I SRK K+ EEP LD    KRQ++   N +  
Sbjct: 542  SDASAVD---NPSTLIHNMPKVEYAGTTI-SRKQKAAEEPSLDVTVSKRQKSPLENTEHN 597

Query: 2572 DAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPGKPIVTS-G 2396
             ++     GGWLE   G  G QF+ ++HL++      +K    VS S T       TS  
Sbjct: 598  MSEVRTGIGGWLEEHTGP-GAQFIERNHLMDKFGPEPQKTLNTVSSSCTGSDNFNATSIR 656

Query: 2395 NEQVPLTGTSTIGD----LRNYTPNPIMVQYLLKLQRLESEALKKPFLPPSSTVLTPISN 2228
            NEQ P+T ++ +      L+    NP M+  LL++    +EA KK     ++ +L P S+
Sbjct: 657  NEQAPITSSNVLASLPALLKGAAVNPTMLVNLLRI----AEAQKKSADSATNMLLHPTSS 712

Query: 2227 SINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVT------VEDPGKVRMKPRDPRRI 2066
              NS +G     ++     TGL Q   G+L V    T       +D GK+RMKPRDPRRI
Sbjct: 713  --NSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQDDSGKIRMKPRDPRRI 770

Query: 2065 LH-NNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQPSL-PDIA 1892
            LH NNT QKS +LG EQFK   +P  +  G+  N N  K   + D   + +QPS  PDIA
Sbjct: 771  LHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRVDSKLVPTQPSAQPDIA 830

Query: 1891 RQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERPTETC 1712
            RQF +NL+NIA+I+S         PVA QI  S  +    DR + KS+ S+S        
Sbjct: 831  RQFARNLKNIADIMSVSQESSTHTPVA-QIFSSASVPLTSDRGEQKSVVSNSQNLEAGMV 889

Query: 1711 LTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXXXXXX 1532
               +  A+GT + QN WGDVE LFEGYD+QQKAAI  ER RR+EEQ KMFAARK      
Sbjct: 890  SAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKMFAARKLCLVLD 949

Query: 1531 XXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFLEKA 1352
                 LNSAKF+EV+P+HDEILRKKEEQDREKP RHLFRFPHMGMWTKLRPGIWNFLEKA
Sbjct: 950  LDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWNFLEKA 1009

Query: 1351 SKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISK--XXXXXXXXXXVPKSKDMEGV 1178
            SKLYELHLYTMGNKLYATEMAKVLDPKG+LFAGRVIS+             PKSKD+EGV
Sbjct: 1010 SKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTDSVDGEERAPKSKDLEGV 1069

Query: 1177 LGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERPEDGT 998
            LGM+S+VVIIDDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DERPE GT
Sbjct: 1070 LGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGT 1129

Query: 997  LASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKPHLHP 818
            LAS L+VIE+IHQIFF+ RSL++VDVRNILA EQRKIL+GCRI+FSRVFPVGEA PHLHP
Sbjct: 1130 LASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHP 1189

Query: 817  LWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLYRRAN 638
            LWQTAEQFGA CT QIDE VTHVVA S GTDKVNWAL+ GR+VVHPGW+EASALLYRRAN
Sbjct: 1190 LWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHPGWVEASALLYRRAN 1249

Query: 637  EQDFAIK 617
            EQDFAIK
Sbjct: 1250 EQDFAIK 1256


>ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1261

 Score = 1044 bits (2699), Expect = 0.0
 Identities = 644/1318 (48%), Positives = 811/1318 (61%), Gaps = 23/1318 (1%)
 Frame = -2

Query: 4501 EDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEPSRGWESQEVFDA 4322
            EDVEEGEISDTAS+EEIS +DF KQDV++L          ++KP    +R W   +++  
Sbjct: 23   EDVEEGEISDTASVEEISAEDFNKQDVKLLNN--------NNKPNGSDARVWAVHDLYSK 74

Query: 4321 YKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXSGANESKED 4142
            Y      +  GLYN AWAQAVQN+PLN+  V  V+ + +  SNR       S A   K+ 
Sbjct: 75   YPTICRGYASGLYNLAWAQAVQNKPLNDIFVMEVDSDANANSNRNSSHRLASVAVNPKDV 134

Query: 4141 TNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDSVIVKEEGLNHS 3962
              V               ELEEGEID D+E   E             +SV+V        
Sbjct: 135  VVV--------DVDKEEGELEEGEIDADAEPEGEA------------ESVVVA------- 167

Query: 3961 PSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRETLDNANFTENAQK 3782
                               ++DS  + + ++D  + E +     + E +  AN  E    
Sbjct: 168  -------------------VSDSEKLDDVKMDVSDSEQLG-ARGVLEGVTVANVVE---- 203

Query: 3781 SFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVFCTSNHSQKEQNKA 3602
            SF   CS+LQN+L    E++S  A  S KDDL++ S+ A + V SVFC+ + S+KEQNK 
Sbjct: 204  SFAQTCSKLQNTLP---EVLSRPAG-SEKDDLVRLSFNATEVVYSVFCSMDSSEKEQNKD 259

Query: 3601 IFSRLLTFVMSQDHP-LFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEKPITCGVNHN 3425
               RLL+FV  Q    LFSPE +KEI G+++++DS     + +   KE+E   T      
Sbjct: 260  SILRLLSFVKDQQQAQLFSPEHVKEIQGMMTAIDSVGALVNSEAIGKEKELQTT--EIKT 317

Query: 3424 VSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVL--SEALKPALYDLRGRGVLA 3251
              ++  E   HE+   ++ N  ++++ + S+      D+   S+ALK     ++GRGVL 
Sbjct: 318  QENSAVEVQIHEI--KTQENQAVEAAELISYSKPLHRDITGTSQALKFGQNSIKGRGVLL 375

Query: 3250 PLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAHERENSIMHPYE 3071
            PLLDLHKDHDAD+LPSPTR    C PV KL ++G  + +   A A++  + E S  H YE
Sbjct: 376  PLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGESMVRSGSASAKMELDSEGSKFHLYE 435

Query: 3070 TEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLTVGNVKNVNLP 2891
            T+ALKAVSTYQQKFGRSS F+ D+ PSPTPS +C+D   DT  EVSS+ T   + +    
Sbjct: 436  TDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEVVDTNEEVSSASTGDFLTSTKPT 495

Query: 2890 ILARPVVSSAPHMDVSSARTS-GSMIFGSNP---TLKASSKSRDPRLRLANSDVGAMDIN 2723
            +L +P VS A  MD SS      S +  + P    +K+S+K+RDPRLR  NSD  A+D N
Sbjct: 496  LLDQPPVS-ATSMDRSSMHGFISSRVDATGPGSFPVKSSAKNRDPRLRFINSDASAVD-N 553

Query: 2722 QGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPALKRQRNGSTNYDVRDAKTVPESG 2546
               L  ++N  KV+  G  IS RK K+ EEP LD    KR ++   N +   ++    SG
Sbjct: 554  LSTL--INNMSKVEYSGTTIS-RKQKAAEEPSLDVTVSKRLKSSLENTEHNMSEVRTGSG 610

Query: 2545 GWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPGKPIVTS-GNEQVPLTGT 2369
            GWLE + G  G Q + ++HL++      KK    VS S T       TS  NEQ P+T +
Sbjct: 611  GWLEENTGP-GAQLIERNHLMDKFGPEAKKTLNTVSSSCTGSDNFNATSIRNEQAPITAS 669

Query: 2368 STIGDL----RNYTPNPIMVQYLLKLQRLESEALKKPFLPPSSTVLTPISNSINSIVGAV 2201
            + +  L    +  + NPIM+  +L+L    +EA KK     +  +L P S+  N  +G  
Sbjct: 670  NVLASLPALLKEASVNPIMLVNILRL----AEAQKKSADSAAIMLLHPTSS--NPAMGTD 723

Query: 2200 PFGNVAPINPTGLGQKPAGILQVPQIVTV------EDPGKVRMKPRDPRRILH-NNTYQK 2042
               ++     TGL Q   G+L V    T       +D GK+RMKPRDPRRILH NNT QK
Sbjct: 724  STASIGSSMATGLLQSSVGMLPVSSQSTSTAQTLQDDSGKIRMKPRDPRRILHTNNTIQK 783

Query: 2041 SASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQPSL-PDIARQFTKNLEN 1865
            S  LG EQFK   +P  +   +  N N  K   + D   + +Q S  PDIARQFT+NL+N
Sbjct: 784  SGDLGNEQFKAIVSPVSNNQRTGDNVNAPKLEGRVDNKLVPTQSSAQPDIARQFTRNLKN 843

Query: 1864 IANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERPTETCLTPDEGAAG 1685
            IA+I+S         PV+ Q   S  +    DR + KS+ S S     +     +  A+ 
Sbjct: 844  IADIMSVSQESSTHTPVS-QNFSSASVPLTSDRGEQKSVVSSSQNLQADMASAHETAASV 902

Query: 1684 TAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXXXXXXXXXXXLNSA 1505
            T++ Q+ WGDVE LFEGYD+QQKAAI  ER RR+EEQ KMFAARK           LNSA
Sbjct: 903  TSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKMFAARKLCLVLDLDHTLLNSA 962

Query: 1504 KFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLY 1325
            KF+EV+P+HDEILRKKEEQDREKP RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLY
Sbjct: 963  KFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLY 1022

Query: 1324 TMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PKSKDMEGVLGMDSAVVI 1151
            TMGNKLYATEMAKVLDPKGVLFAGRVIS+             PKSKD+EGVLGM+S+VVI
Sbjct: 1023 TMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVDGEERVPKSKDLEGVLGMESSVVI 1082

Query: 1150 IDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERPEDGTLASCLSVIE 971
            IDDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DERPE GTLAS L+VIE
Sbjct: 1083 IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLAVIE 1142

Query: 970  RIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKPHLHPLWQTAEQFG 791
            +IHQIFF+ +SL++VDVRNILA EQRKIL+GCRI+FSRVFPVGEA PHLHPLWQTAEQFG
Sbjct: 1143 KIHQIFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFG 1202

Query: 790  AVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLYRRANEQDFAIK 617
            AVCT QIDE VTHVVA S GTDKVNWAL+ GR+VVHPGW+EASALLYRRANEQDFAIK
Sbjct: 1203 AVCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHPGWVEASALLYRRANEQDFAIK 1260


>ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3-like [Cucumis sativus]
          Length = 1249

 Score = 1042 bits (2694), Expect = 0.0
 Identities = 643/1329 (48%), Positives = 801/1329 (60%), Gaps = 26/1329 (1%)
 Frame = -2

Query: 4525 MARSSIL-IEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEPSRG 4349
            M +  IL IEDVEEGEISDTAS+EEISE+DF K D     +  V      SK     +R 
Sbjct: 1    MGKDEILKIEDVEEGEISDTASVEEISEEDFNKLDSSASPKVVV-----PSKDSNRETRV 55

Query: 4348 WESQEVFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDV----GSNRXXX 4181
            W   +++  Y    + +  GLYN AWAQAVQN+PLN+  +F +E + D      S+    
Sbjct: 56   WTMSDLYKNYPAMRHGYASGLYNLAWAQAVQNKPLND--IFVMEADLDEKSKHSSSTPFG 113

Query: 4180 XXXXSGANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLN 4001
                 G+N +KE+  V+                    ID   +E     A G  EEG L 
Sbjct: 114  NAKDDGSNTTKEEDRVV--------------------IDDSGDEMNCDNANGEKEEGELE 153

Query: 4000 DSVIVKEEGLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRE 3821
            +  I  +       +  +  L DS     D+ +N        E D + KEL E +  I++
Sbjct: 154  EGEIDMDTEFVEEVADSKAMLSDS----RDMDINGQ------EFDLETKELDELLKFIQK 203

Query: 3820 TLDNANFTENAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVF 3641
            TLD     + AQKSF+  CS++ +S+E+  E++ +G  V  KD LIQ+ Y A++ +NSVF
Sbjct: 204  TLDGVTI-DAAQKSFQEVCSQIHSSIETFVELL-QGKVVPRKDALIQRLYAALRLINSVF 261

Query: 3640 CTSNHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKE 3461
            C+ N S+KE++K   SRLL++V + D PLFSPE++K +   + S DS     S   S KE
Sbjct: 262  CSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPSTDSLDHLPSMRGSAKE 321

Query: 3460 EEKPITCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPAL 3281
             E  I  GV      +   + S ++  S+K   L   S       KN  ++LSE L+  +
Sbjct: 322  VEIHIPNGVKDMDFYSAYTSTSSQLTPSNK---LASDSIPFGVKGKNNLNILSEGLQSGV 378

Query: 3280 YDLRGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAHE 3101
              ++GRG L PLLDLHKDHDAD+LPSPTR       +F +   G+       AP ++A  
Sbjct: 379  SSIKGRGPLLPLLDLHKDHDADSLPSPTREAPT---IFSVQKSGN-------APTKMAFP 428

Query: 3100 RENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLT 2921
             + S  HPYET+ALKAVSTYQQKFGRSSF   DRLPSPTPSEE  D  GD GGEVSSS  
Sbjct: 429  VDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE-HDGGGDIGGEVSSSSI 487

Query: 2920 VGNVKNVNLPILARPVVSSA-------PHMDVSSARTSGSMI------FGSNPTLKASSK 2780
            + ++K+ N+    +   S++       P+MD SS R   S +        SNPT+K  +K
Sbjct: 488  IRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTVKPLAK 547

Query: 2779 SRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPALKRQ 2603
            SRDPRLR+ NSD   MD+N   +  V ++  ++   A +  RK K   EP  DGP +KR 
Sbjct: 548  SRDPRLRIVNSDASGMDLNPRTMASVQSSSILES-AATLHLRKQKMDGEPNTDGPEVKRL 606

Query: 2602 RNGSTNYDVR--DAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSS 2429
            R GS N  V   D + V  SGGWLE D    GP+  N++ +     N  +K     +  S
Sbjct: 607  RIGSQNLAVAASDVRAVSGSGGWLE-DTMPAGPRLFNRNQMEIAEANATEKSNVTNNSGS 665

Query: 2428 TAPGKPIVTSGNEQVPLTGTSTIGDLRNYTPNPIMVQYLLKLQRLESEA--LKKPFLPPS 2255
                 P V + N+       S    L++   NP M+  LLK+ + +  A  LK     P 
Sbjct: 666  GNECTPTVNNSND------ASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEPE 719

Query: 2254 STVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVTVEDP-GKVRMKPRD 2078
               + P   S+N   G+ P  N AP+  +G+ Q+ AG      +V  +D  GKVRMKPRD
Sbjct: 720  KNAICP--TSLNPCQGSSPLIN-APVATSGILQQSAGTPSASPVVGRQDDLGKVRMKPRD 776

Query: 2077 PRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQPSLPD 1898
            PRR+LH N+ QK  SLG +Q K       +T GSR   N +K   Q D    +SQ  LPD
Sbjct: 777  PRRVLHGNSLQKVGSLGNDQLKGVVPTASNTEGSRDIPNGHKQEGQGDSKLASSQTILPD 836

Query: 1897 IARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERPTE 1718
            I RQFT NL+NIA+I+S               +PS P  T+     +K + S S++    
Sbjct: 837  IGRQFTNNLKNIADIMS---------------VPSPP--TSSPNSSSKPVGSSSMDSKPV 879

Query: 1717 TCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXXXX 1538
            T        A +++ Q  WGD+E LF+ YDD+QKAAI  ER RR+EEQKKMFAARK    
Sbjct: 880  TTAFQAVDMAASSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLV 939

Query: 1537 XXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFLE 1358
                   LNSAKF+EV+P+HDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPG+WNFLE
Sbjct: 940  LDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLE 999

Query: 1357 KASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PKSKDME 1184
            KAS+LYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+             PKSKD+E
Sbjct: 1000 KASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLE 1059

Query: 1183 GVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERPED 1004
            GVLGM+S VVIIDDSIRVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DERPED
Sbjct: 1060 GVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPED 1119

Query: 1003 GTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKPHL 824
            GTLAS L VI+RIHQ FFS   LD VDVR IL+ EQ+KIL+GCRI+FSRVFPVGEA PHL
Sbjct: 1120 GTLASSLGVIQRIHQXFFSNPELDQVDVRTILSAEQQKILAGCRIVFSRVFPVGEANPHL 1179

Query: 823  HPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLYRR 644
            HPLWQTAEQFGA CT QIDE VTHVVA SLGTDKVNWAL TGR+VVHPGW+EASALLYRR
Sbjct: 1180 HPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 1239

Query: 643  ANEQDFAIK 617
            A EQDFAIK
Sbjct: 1240 ATEQDFAIK 1248


>ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Cucumis sativus]
          Length = 1249

 Score = 1041 bits (2693), Expect = 0.0
 Identities = 643/1329 (48%), Positives = 801/1329 (60%), Gaps = 26/1329 (1%)
 Frame = -2

Query: 4525 MARSSIL-IEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEPSRG 4349
            M +  IL IEDVEEGEISDTAS+EEISE+DF K D     +  V      SK     +R 
Sbjct: 1    MGKDEILKIEDVEEGEISDTASVEEISEEDFNKLDSSASPKVVV-----PSKDSNRETRV 55

Query: 4348 WESQEVFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDV----GSNRXXX 4181
            W   +++  Y    + +  GLYN AWAQAVQN+PLN+  +F +E + D      S+    
Sbjct: 56   WTMSDLYKNYPAMRHGYASGLYNLAWAQAVQNKPLND--IFVMEADLDEKSKHSSSTPFG 113

Query: 4180 XXXXSGANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLN 4001
                 G+N +KE+  V+                    ID   +E     A G  EEG L 
Sbjct: 114  NAKDDGSNTTKEEDRVV--------------------IDDSGDEMNCDNANGEKEEGELE 153

Query: 4000 DSVIVKEEGLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRE 3821
            +  I  +       +  +  L DS     D+ +N        E D + KEL E +  I++
Sbjct: 154  EGEIDMDTEFVEEVADSKAMLSDS----RDMDINGQ------EFDLETKELDELLKFIQK 203

Query: 3820 TLDNANFTENAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVF 3641
            TLD     + AQKSF+  CS++ +S+E+  E++ +G  V  KD LIQ+ Y A++ +NSVF
Sbjct: 204  TLDGVTI-DAAQKSFQEVCSQIHSSIETFVELL-QGKVVPRKDALIQRLYAALRLINSVF 261

Query: 3640 CTSNHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKE 3461
            C+ N S+KE++K   SRLL++V + D PLFSPE++K +   + S DS     S   S KE
Sbjct: 262  CSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPSTDSLDHLPSMRGSAKE 321

Query: 3460 EEKPITCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPAL 3281
             E  I  GV      +   + S ++  S+K   L   S       KN  ++LSE L+  +
Sbjct: 322  VEIHIPNGVKDMDFYSAYTSTSSQLTPSNK---LASDSIPFGVKGKNNLNILSEGLQSGV 378

Query: 3280 YDLRGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAHE 3101
              ++GRG L PLLDLHKDHDAD+LPSPTR       +F +   G+       AP ++A  
Sbjct: 379  SSIKGRGPLLPLLDLHKDHDADSLPSPTREAPT---IFSVQKSGN-------APTKMAFP 428

Query: 3100 RENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLT 2921
             + S  HPYET+ALKAVSTYQQKFGRSSF   DRLPSPTPSEE  D  GD GGEVSSS  
Sbjct: 429  VDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEE-HDGGGDIGGEVSSSSI 487

Query: 2920 VGNVKNVNLPILARPVVSSA-------PHMDVSSARTSGSMI------FGSNPTLKASSK 2780
            + ++K+ N+    +   S++       P+MD SS R   S +        SNPT+K  +K
Sbjct: 488  IRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTVKPLAK 547

Query: 2779 SRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPALKRQ 2603
            SRDPRLR+ NSD   MD+N   +  V ++  ++   A +  RK K   EP  DGP +KR 
Sbjct: 548  SRDPRLRIVNSDASGMDLNPRTMASVQSSSILES-AATLHLRKQKMDGEPNTDGPEVKRL 606

Query: 2602 RNGSTNYDVR--DAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSS 2429
            R GS N  V   D + V  SGGWLE D    GP+  N++ +     N  +K     +  S
Sbjct: 607  RIGSQNLAVAASDVRAVSGSGGWLE-DTMPAGPRLFNRNQMEIAEANATEKSNVTNNSGS 665

Query: 2428 TAPGKPIVTSGNEQVPLTGTSTIGDLRNYTPNPIMVQYLLKLQRLESEA--LKKPFLPPS 2255
                 P V + N+       S    L++   NP M+  LLK+ + +  A  LK     P 
Sbjct: 666  GNECTPTVNNSND------ASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEPE 719

Query: 2254 STVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVTVEDP-GKVRMKPRD 2078
               + P   S+N   G+ P  N AP+  +G+ Q+ AG      +V  +D  GKVRMKPRD
Sbjct: 720  KNAICP--TSLNPCQGSSPLIN-APVATSGILQQSAGTPSASPVVGRQDDLGKVRMKPRD 776

Query: 2077 PRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQPSLPD 1898
            PRR+LH N+ QK  SLG +Q K       +T GSR   N +K   Q D    +SQ  LPD
Sbjct: 777  PRRVLHGNSLQKVGSLGNDQLKGVVPTASNTEGSRDIPNGHKQEGQGDSKLASSQTILPD 836

Query: 1897 IARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERPTE 1718
            I RQFT NL+NIA+I+S               +PS P  T+     +K + S S++    
Sbjct: 837  IGRQFTNNLKNIADIMS---------------VPSPP--TSSPNSSSKPVGSSSMDSKPV 879

Query: 1717 TCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXXXX 1538
            T        A +++ Q  WGD+E LF+ YDD+QKAAI  ER RR+EEQKKMFAARK    
Sbjct: 880  TTAFQAVDMAASSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLV 939

Query: 1537 XXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFLE 1358
                   LNSAKF+EV+P+HDEILRKKEEQDREK QRHLFRFPHMGMWTKLRPG+WNFLE
Sbjct: 940  LDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLE 999

Query: 1357 KASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PKSKDME 1184
            KAS+LYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+             PKSKD+E
Sbjct: 1000 KASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLE 1059

Query: 1183 GVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERPED 1004
            GVLGM+S VVIIDDSIRVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DERPED
Sbjct: 1060 GVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPED 1119

Query: 1003 GTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKPHL 824
            GTLAS L VI+RIHQ FFS   LD VDVR IL+ EQ+KIL+GCRI+FSRVFPVGEA PHL
Sbjct: 1120 GTLASSLGVIQRIHQSFFSNPELDQVDVRTILSAEQQKILAGCRIVFSRVFPVGEANPHL 1179

Query: 823  HPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLYRR 644
            HPLWQTAEQFGA CT QIDE VTHVVA SLGTDKVNWAL TGR+VVHPGW+EASALLYRR
Sbjct: 1180 HPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 1239

Query: 643  ANEQDFAIK 617
            A EQDFAIK
Sbjct: 1240 ATEQDFAIK 1248


>ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Solanum tuberosum]
          Length = 1218

 Score = 1026 bits (2654), Expect = 0.0
 Identities = 614/1314 (46%), Positives = 820/1314 (62%), Gaps = 18/1314 (1%)
 Frame = -2

Query: 4504 IEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEPSRGWESQEVFD 4325
            +EDVEEGEISD+AS+EEISED F +QD     + ++   +  ++     +R W  +   D
Sbjct: 7    VEDVEEGEISDSASVEEISEDAFNRQDPPTTTKIKIASNENQNQNSTTTTRVWTMR---D 63

Query: 4324 AYKRD-TYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXSGANESK 4148
            AYK   + ++ RGLYN AWAQAVQN+PL+   V   +      SN+       + AN + 
Sbjct: 64   AYKYPISRDYARGLYNLAWAQAVQNKPLDELFVMTSD-----NSNQ------CANANANV 112

Query: 4147 EDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDSVIVKEEGLN 3968
            E   +I                   ++D+D +                      KEEG  
Sbjct: 113  ESKVII-------------------DVDVDDD---------------------AKEEG-- 130

Query: 3967 HSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELV-ERVNSIRETLDNANFTEN 3791
                ++EG +D       D++LN            KE   V E++ S+  TLD  +    
Sbjct: 131  ---ELEEGEIDLDAA---DLVLN----------FGKEANFVREQLQSV--TLDETH---- 168

Query: 3790 AQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVFCTSNHSQKEQ 3611
              KSF   CS+LQ SL +L E+     S    D LIQ   TA++++NSVF + N  QK+Q
Sbjct: 169  --KSFSMVCSKLQTSLLALGEL---ALSQDKNDILIQLFMTALRTINSVFYSMNQDQKQQ 223

Query: 3610 NKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEKPITCGVN 3431
            N  I SRLL    +Q   L S E++KE++ ++ S++  AVFS+   ++K     +   ++
Sbjct: 224  NTDILSRLLFHAKTQLPALLSSEQLKEVDAVILSINQSAVFSNTQDNDKVNGIKVVELLD 283

Query: 3430 HNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPALYDLRGRGVLA 3251
              VS   +ENA+ +   ++ N + L + S++S   K    V  E++KP L + + +G+  
Sbjct: 284  KKVSHKSSENANQDF--TAVNKYDLGAVSIKSSGLKE-QSVSFESVKPGLANSKAKGLSI 340

Query: 3250 PLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAHERENSIMHPYE 3071
            PLLDLHKDHD D LPSPTR   P  PV K +   HG+ K ++     + E+ NS++HPYE
Sbjct: 341  PLLDLHKDHDEDTLPSPTREIGPQFPVAKATQ-AHGMVKLDLPIFAGSLEKGNSLLHPYE 399

Query: 3070 TEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLTVGNVKNVNLP 2891
            T+ALKAVS+YQQKFGRSS F  + LPSPTPSEE D   GD GGEV+S   V N  ++N  
Sbjct: 400  TDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEGDSGKGDIGGEVTSLDVVHNASHLNES 459

Query: 2890 ILARPVVSSAPHMDV------SSARTSGSMIFGSNPTLKAS-SKSRDPRLRLANSDVGAM 2732
             + +P++SS P  ++       +ART+  + F  NP+L++S +KSRDPRLRLA SD  A 
Sbjct: 460  SMGQPILSSVPQTNILDGQGLGTARTADPLSFLPNPSLRSSTAKSRDPRLRLATSDAVAQ 519

Query: 2731 DINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEPLDG-PALKRQRNGST-NYDVRDAKTV 2558
            + N+  LP+     K++    +I S+K K+V+ P+ G P  KRQR+  T +  V D +  
Sbjct: 520  NTNKNILPIPDIDLKLEASLEMIGSKKQKTVDLPVFGAPLPKRQRSEQTDSIIVSDVRPS 579

Query: 2557 PESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPGKPIVTSGNEQVPL 2378
              +GGWLE D GT G    + +   ++++N ++K+E V +  +T P   ++ +  E  P+
Sbjct: 580  TGNGGWLE-DRGTAGLPITSSNCATDSSDNDIRKLEQVTATIATIPS--VIVNAAENFPV 636

Query: 2377 TGTSTIGD----LRNYTPNPIMVQYLLKLQRLESEALKKPFLPPSSTVLTPISNSINSIV 2210
            TG ST       L++   NP +   ++K+++ +S          +S   T  ++S  SI+
Sbjct: 637  TGISTSTTLHSLLKDIAINPSIWMNIIKMEQQKS--------ADASRTTTAQASSSKSIL 688

Query: 2209 GAVPFGNVAPINPTGLGQKPAGILQVP-QIVTVEDPGKVRMKPRDPRRILHNNTYQKSAS 2033
            GAVP  +      + +GQ+  GILQ P    + ++   VRMKPRDPRR+LHN    K  +
Sbjct: 689  GAVPSTDAIAPRSSAIGQRSVGILQTPTHTASADEVAIVRMKPRDPRRVLHNTAVLKGGN 748

Query: 2032 LGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQPSLPDIARQFTKNLENIANI 1853
            +G++Q K   A T +T+ + G  +    +D+    ++++ P  PDIARQFTKNL+NIA++
Sbjct: 749  VGSDQCKTGVAGTHATISNLGFQSQEDQLDRKSAVTLSTTP--PDIARQFTKNLKNIADM 806

Query: 1852 LSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERPTETCLTPDEGAAGTAKP 1673
            +S             Q   +Q LQ++  R + K   S+  ER  +  L  ++G+ G+ +P
Sbjct: 807  ISVSPSTSLSAASQTQ---TQCLQSHQSRSEGKEAVSEPSERVNDAGLASEKGSPGSLQP 863

Query: 1672 QNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXXXXXXXXXXXLNSAKFIE 1493
            Q  WGDVE LFEGY DQQ+A I  ER RRLEEQKKMF+ RK           LNSAKF+E
Sbjct: 864  QISWGDVEHLFEGYSDQQRADIQRERARRLEEQKKMFSVRKLCLVLDLDHTLLNSAKFVE 923

Query: 1492 VEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGN 1313
            ++P+H+EILRKKEEQDREKP RHLFRFPHMGMWTKLRPGIWNFLEKAS L+ELHLYTMGN
Sbjct: 924  IDPVHEEILRKKEEQDREKPCRHLFRFPHMGMWTKLRPGIWNFLEKASNLFELHLYTMGN 983

Query: 1312 KLYATEMAKVLDPKGVLFAGRVISK--XXXXXXXXXXVPKSKDMEGVLGMDSAVVIIDDS 1139
            KLYATEMAK+LDPKG LFAGRVIS+            VPKSKD+EGVLGM+SAVVIIDDS
Sbjct: 984  KLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1043

Query: 1138 IRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERPEDGTLASCLSVIERIHQ 959
            +RVWPHNK NLIVVERY+YFPCSRRQFGL GPSLLE+D DERPEDGTLASCL VI+RIHQ
Sbjct: 1044 VRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASCLGVIQRIHQ 1103

Query: 958  IFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKPHLHPLWQTAEQFGAVCT 779
             FF+ RS+D+ DVRNILA EQ+KIL+GCRI+FSRVFPVGEA PHLHPLWQTAEQFGAVCT
Sbjct: 1104 NFFAHRSIDEADVRNILATEQKKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 1163

Query: 778  TQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLYRRANEQDFAIK 617
            +QID+ VTHVVA SLGTDKVNWAL TGR+VVHPGW+EASALLYRRANE DFAIK
Sbjct: 1164 SQIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEHDFAIK 1217


>ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris]
            gi|561012448|gb|ESW11309.1| hypothetical protein
            PHAVU_008G019000g [Phaseolus vulgaris]
          Length = 1272

 Score = 1006 bits (2602), Expect = 0.0
 Identities = 644/1344 (47%), Positives = 810/1344 (60%), Gaps = 36/1344 (2%)
 Frame = -2

Query: 4540 ETLVTMARSSILIEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVE 4361
            E L  + +    +EDVEEGEISDTAS+EEISE DF KQDV++           ++KP   
Sbjct: 10   ENLGKLEKMGKEVEDVEEGEISDTASVEEISEADFNKQDVKV---------NNNNKPNGS 60

Query: 4360 PSRGWESQEVFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXX 4181
             +R W  ++++  Y      +  GLYN AWAQAVQN+PLN+  V  ++ E +  SN    
Sbjct: 61   DARVWSVRDIYTKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMELDSEANANSNSNNS 120

Query: 4180 XXXXSGANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLN 4001
                S +   KE   V               ELEEGEID D++   E        E V+ 
Sbjct: 121  NRPSSVSVNPKEVMVV--------DVDREEGELEEGEIDADADPEAEA-------ESVVA 165

Query: 4000 DSVIVKEEGLNHSPSVKEGPLD-DSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIR 3824
             SV+ +    +    VK+G  D + +GVR+                            + 
Sbjct: 166  ASVVSETVSDSEQFGVKKGVSDSEQLGVRD----------------------------VL 197

Query: 3823 ETLDNANFTENAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSV 3644
            E +  AN  E    SF    SRL N+L    ++ S  A  S KDDLI+ S+ AI+ V SV
Sbjct: 198  EGVTVANVAE----SFAQTSSRLLNALP---QVFSRPAD-SEKDDLIRLSFNAIEVVYSV 249

Query: 3643 FCTSNHSQKEQNKAIFSRLLTFVMSQDHP-LFSPEEMKEINGILSSMDSPAVFSSFDFSN 3467
            F + + S KEQNK    RLL+    +    LFSPE +KEI  +++++DS     S +   
Sbjct: 250  FRSMDSSDKEQNKNSILRLLSSAKDKKQAQLFSPEHIKEIQDMMTAIDSVGALGSNEAIY 309

Query: 3466 KEEEKPITCGVNHNVSDAFAENASHEVLNSS---KNNFLLDSSSVESFLNKNAPDVL--S 3302
             E E                EN++ EV       + N  + ++ + S +     D++  S
Sbjct: 310  METEL-------QTPEIKSQENSALEVQTRGIKIQENQAVVATELVSSIKPLHSDIIGAS 362

Query: 3301 EALKPALYDLRGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGH-----GVA 3137
             ALK     ++GRGVL PLLDLHKDHDAD+LPSPTR    C PV KL ++G      G A
Sbjct: 363  RALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEVMVKSGSA 422

Query: 3136 KPEVAPARVAHERENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDAD 2957
              ++ P ++  + E S  H YET+ALKAVSTYQQKFGRSS F+ D+LPSPTPS +CDD  
Sbjct: 423  AAKMQPGKLEVDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCDDMA 482

Query: 2956 GDTGGEVSSSLTVGNVKNVNLPILARPVVSSAPHMDVSSARTSGSMIF-------GSNPT 2798
             DT  EVSS+ T G + +    +L +P VS+     V  +R  G +         GS P 
Sbjct: 483  VDTNEEVSSASTSGFLTSTKPTLLDQPPVSAT---SVDKSRLLGLISSRVDAAGSGSFP- 538

Query: 2797 LKASSKSRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDG 2621
            +K+S+KSRDPR RL NS+  A+D NQ    V HN PKV+  G+ IS RK K+VEEP  D 
Sbjct: 539  VKSSAKSRDPRRRLINSEASAVD-NQ--FTVTHNMPKVEYAGSTIS-RKQKAVEEPSFDL 594

Query: 2620 PALKRQRNGSTN--YDVRDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMET 2447
               KR ++   N  ++  + +T+  SGGWLE+  G  G Q + ++HLI+      K+   
Sbjct: 595  TVSKRLKSSLENIEHNTSEVRTIAGSGGWLEDITGP-GTQLIEKNHLIDKFAPEPKRTLN 653

Query: 2446 VVSFSSTAPGKPIVTSGNEQVPLTGTSTIGDL----RNYTPNPIMVQYLLKLQRLESEAL 2279
             VS SS +      +  NEQ P+T  +    L    ++   NP M+  LL  Q+   +A 
Sbjct: 654  TVS-SSGSVNFNATSIRNEQAPITSNNVPSSLPAIFKDIVVNPTMLLSLLMEQKRLVDAQ 712

Query: 2278 KKPFLPPSSTVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVT----VE 2111
                   ++ +L P S+  NS +G     ++     TGL Q   G+L V    T    ++
Sbjct: 713  NNS-ADSATNMLHPTSS--NSAMGTDSTASIVSSMATGL-QTSVGMLPVSSQSTSTAQLQ 768

Query: 2110 DP--GKVRMKPRDPRRILH-NNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQ 1940
            D   GK+RMKPRDPRRILH NN+ QKS ++  E  K   +P  + L +  + N  K   +
Sbjct: 769  DDYSGKIRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNILVTGDSVNAQKLEGR 828

Query: 1939 SDRNSIASQP-SLPDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRV 1763
             D   + +Q  + PDI RQFT+NL+NIA+I+S         P A Q   S  +  N+DR 
Sbjct: 829  MDTKLVPTQSGAAPDITRQFTRNLKNIADIMSVSQESSTHSPAA-QGFSSASVPLNVDRG 887

Query: 1762 DAKSMNSDSVERPTETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRL 1583
            + KS+ S+S      T   P+  A GT++ Q+ WGDVE LFEGYD+QQKAAI  ER RR+
Sbjct: 888  EQKSVLSNSQNLHAGTGSAPEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRI 947

Query: 1582 EEQKKMFAARKXXXXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHM 1403
            EEQ KMFAARK           LNSAKF+EV+P+H+EILRKKEE DREKP RHLFRFPHM
Sbjct: 948  EEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEELDREKPHRHLFRFPHM 1007

Query: 1402 GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXX 1223
            GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+     
Sbjct: 1008 GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTD 1067

Query: 1222 XXXXXV--PKSKDMEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLF 1049
                    PKSKD+EGVLGM+SAVVIIDDS+RVWPHNK NLIVVERY YFPCSRRQFGL 
Sbjct: 1068 SVDGEERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLP 1127

Query: 1048 GPSLLELDRDERPEDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRI 869
            GPSLLE+D DERPE GTLAS L+VIER+HQ FFS +SL++VDVRNILA EQRKILSGCRI
Sbjct: 1128 GPSLLEIDHDERPEAGTLASSLAVIERLHQNFFSSQSLEEVDVRNILASEQRKILSGCRI 1187

Query: 868  LFSRVFPVGEAKPHLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYV 689
            +FSRVFPVGEA PHLHPLWQTAEQFGAVCT QID+ VTHVVA SLGTDKVNWAL TGR+V
Sbjct: 1188 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSTGRFV 1247

Query: 688  VHPGWLEASALLYRRANEQDFAIK 617
            VHPGW+EASALLYRRANEQDFAIK
Sbjct: 1248 VHPGWVEASALLYRRANEQDFAIK 1271


>ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Fragaria vesca subsp. vesca]
          Length = 1230

 Score = 1004 bits (2596), Expect = 0.0
 Identities = 638/1329 (48%), Positives = 789/1329 (59%), Gaps = 29/1329 (2%)
 Frame = -2

Query: 4516 SSILIEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEPSRGWESQ 4337
            SS  + DVEEGEI D+ S+EEISE+DF KQ+ + ++        +S+    + +R W   
Sbjct: 6    SSREVVDVEEGEIPDSNSVEEISEEDFVKQESKAVE-------PKSNGGSGDGARFWTFH 58

Query: 4336 EVFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXSGAN 4157
            EV  A+         GL N AWAQAVQN+P N+     V+++ D  S +         + 
Sbjct: 59   EVL-AHPHFRGIGGGGLANLAWAQAVQNKPFND---LLVKLDSDEKSKQQQQQRSSVSSG 114

Query: 4156 ESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDSVIVKEE 3977
              K    VI              ELEEGEI  DSE      A G    GV          
Sbjct: 115  NEKV---VIIDSGDEMDVEKEEEELEEGEIGFDSECGDNDKAAGSVGNGV---------- 161

Query: 3976 GLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRETLDNANFT 3797
                                             WE         +RVN +RE L++   T
Sbjct: 162  ---------------------------------WE---------KRVNLLREALESLTIT 179

Query: 3796 ENAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVFCTSNHSQK 3617
            E A+KSF   C R  +SLESL+ ++SE  +VS+K+ L+QQ + A+++++SVF + +  QK
Sbjct: 180  E-AEKSFGDVCHRFLDSLESLRGVLSE-INVSTKEALVQQLFNAVRAISSVFRSMSADQK 237

Query: 3616 EQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEKPITC- 3440
            EQNK + SR+L+   S   P F  E++KEI  + SSMDSP   +        +E  I C 
Sbjct: 238  EQNKDVLSRILSSAKSDPSP-FPAEQLKEIEVMSSSMDSPQTKAG------TKENGIQCI 290

Query: 3439 -GVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPALYDLRGR 3263
             GV    SD    NASH V   + N     S +  S ++ N P++ SE  +      +GR
Sbjct: 291  NGVYKTDSDTSGANASH-VFTYAANT---GSDTQVSVVHSN-PNISSEVPRSGSSSFKGR 345

Query: 3262 GVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHG-VAKPEVAPARVAHERENSI 3086
            G++ PLLDLH DHD D+LPSPTR    C P  K   + +G V K     AR A + E S 
Sbjct: 346  GLMLPLLDLHMDHDEDSLPSPTREPPACFPAQKPVVVENGMVKKSGWETARAALDVEGSK 405

Query: 3085 MHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPS-EECDDADGDTGGEVSSSLTVGNV 2909
            MH YETEALKAVS+YQQKF R+SF + + LPSPTPS EE D+ D    GEVSSS    NV
Sbjct: 406  MHVYETEALKAVSSYQQKFSRNSFLTSE-LPSPTPSEEEGDNGDDAAVGEVSSSSASNNV 464

Query: 2908 KNVNLPILARPVVSSAPHMDVS---------SARTSGSMIFGSNPTLKASSKSRDPRLRL 2756
            +    P+  R VVSS P   +          +A+T+  +  GSN   K+S+KSRDPRLR 
Sbjct: 465  RTPQPPVSGRQVVSSVPATTLPGSSGMHGLITAKTASPVSLGSNMPNKSSAKSRDPRLRF 524

Query: 2755 ANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPALKRQRNGSTNYD 2579
            ANSD GA+ +NQ     VHNAPKVD +   +SSRKHKS E+   DGP  KRQR G+ +  
Sbjct: 525  ANSDAGALTLNQQSSIQVHNAPKVDSV-ITLSSRKHKSPEDSNFDGPESKRQR-GANSVV 582

Query: 2578 VRDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPGKPI-VT 2402
               AKT   +G WLE D  +VGP  +N++  +E  E   +KM  V S   T  G      
Sbjct: 583  GWGAKTSFGNGVWLE-DGSSVGPHLINRNQTVEKKEADPRKMVNVSSSPGTVEGNSNGQN 641

Query: 2401 SGNEQVPLTGTSTI---GDLRNYTPNPIMVQYLLKLQRLESEALKKPFLPPSSTVLT--P 2237
            + NE+VPL   S +      ++   NP M+  +LKL    +EA +    P     LT  P
Sbjct: 642  TANEKVPLVAPSLVSLPAIFKDIAVNPTMLVNILKL----AEAQQNAAAPARKESLTYPP 697

Query: 2236 ISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVTVEDP----GKVRMKPRDPRR 2069
             S+SI         G  A +N      K +G L  P I + + P    GK+RMK RDPRR
Sbjct: 698  SSSSIP--------GTAALVNDP---SKTSGALLTPTICSQKTPTDEAGKIRMKLRDPRR 746

Query: 2068 ILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQP---SLPD 1898
            +LH N  Q S S+G EQ +    P  S+  +  + N  K   Q+D NS+ SQ      PD
Sbjct: 747  LLHGNALQNSGSVGHEQSRNIVPPLSSSQANNDDMNGKKQDSQADNNSVTSQSGALGAPD 806

Query: 1897 IARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERPTE 1718
            IA QFTKNL+NIA+I+S                PSQ L T L  ++  +++  + E+ T 
Sbjct: 807  IASQFTKNLKNIADIISVSQVS------TSPATPSQNLSTELISINPDNVDLKAEEQHTG 860

Query: 1717 TCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXXXX 1538
            +       AAG ++    WGDVE LFEGYDD+QKAAI  ER RR+EEQKKMFAA K    
Sbjct: 861  SISASVPTAAGASRSPATWGDVEHLFEGYDDKQKAAIQRERARRIEEQKKMFAAHKLCLV 920

Query: 1537 XXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFLE 1358
                   LNSAKF+EV+P+HDEILRKKEEQDR++PQRHLFRF HMGMWTKLRPG+W FLE
Sbjct: 921  LDLDHTLLNSAKFVEVDPVHDEILRKKEEQDRKEPQRHLFRFQHMGMWTKLRPGVWKFLE 980

Query: 1357 KASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISK--XXXXXXXXXXVPKSKDME 1184
            KAS L+E+HLYTMGNKLYATEMAKVLDP G LFAGRVIS+            VPKSKD+E
Sbjct: 981  KASHLFEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISRGDDGDPYDGDERVPKSKDLE 1040

Query: 1183 GVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERPED 1004
            GVLGM+SAVVIIDDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DER ED
Sbjct: 1041 GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERHED 1100

Query: 1003 GTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKPHL 824
            GTLAS L+VIE+IHQIFFS  SLD+ DVRNILA EQ+KIL GCRI+FSRVFPVGE  PHL
Sbjct: 1101 GTLASSLAVIEKIHQIFFSHPSLDEADVRNILASEQQKILGGCRIVFSRVFPVGEVNPHL 1160

Query: 823  HPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLYRR 644
            HPLWQTAEQFGAVCT QID+ VTHVVA SLGTDKVNWAL +G+YVVHPGW+EASALLYRR
Sbjct: 1161 HPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSSGKYVVHPGWVEASALLYRR 1220

Query: 643  ANEQDFAIK 617
            ANEQDFAIK
Sbjct: 1221 ANEQDFAIK 1229


>ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like isoform X1 [Cicer arietinum]
          Length = 1247

 Score = 1000 bits (2586), Expect = 0.0
 Identities = 625/1331 (46%), Positives = 801/1331 (60%), Gaps = 35/1331 (2%)
 Frame = -2

Query: 4504 IEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEP---SRGWESQE 4334
            +EDVEEGEISDTAS+ EISE+DF KQDV       V+V   S   + +    +R W   +
Sbjct: 22   VEDVEEGEISDTASVVEISEEDFNKQDV-------VKVNNNSDSDKAKTGGDARVWAVHD 74

Query: 4333 VFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXSGANE 4154
            ++  Y      +  GLYN AWAQAVQN+PLN+  +F +E++ D  +N          +N 
Sbjct: 75   LYSKYPTICRGYASGLYNLAWAQAVQNKPLND--IFVMELDSDSNANANSNND----SNN 128

Query: 4153 SKEDTNV-ISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDSVIVKEE 3977
               D N+ +              ELEEGEID D +  T  + VGG               
Sbjct: 129  GNGDLNMPLKEVVMVDDDEREEGELEEGEIDGDDD--TGGVMVGG--------------- 171

Query: 3976 GLNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRETLDNANFT 3797
                                     + S +VSE              + IR+ L+     
Sbjct: 172  -------------------------DGSETVSE--------------SDIRDFLEGVTVA 192

Query: 3796 ENAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVFCTSNHSQK 3617
             N  +SF    SRL   L+S    +  G +VS KD +I+  Y AI+ V+SVFC+ ++ QK
Sbjct: 193  -NVAESFAETISRLLRVLQSK---LLSGPAVSEKDYVIRLLYNAIEIVHSVFCSMDNLQK 248

Query: 3616 EQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDS-PAVFSSFDFSNKEEEKPITC 3440
            E NK    RLL F+ ++   LFSPE MKEI  +++++D+  A+ +S    N E+   +  
Sbjct: 249  EDNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNSVVVGNGEKLDTLDI 308

Query: 3439 GVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPALYDLRGRG 3260
                          + E+++SSK   L+ S+  E+          SEAL     +++GRG
Sbjct: 309  KTRQ-----IQGLKASELISSSK---LVHSNLTEA----------SEALLSGQSNIKGRG 350

Query: 3259 VLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPA------RVAHER 3098
            V+ PL DLHK HD D+LPSPTR      PV KL ++G G+ +P +  A      ++  + 
Sbjct: 351  VMLPLFDLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGKTEAVKMELDT 410

Query: 3097 ENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSS---- 2930
            ENS  H YET+ALKAVSTYQQKFGRSS+F+ D+ PSPTPS +C++   D   EVSS    
Sbjct: 411  ENSKNHLYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADANEEVSSASIA 470

Query: 2929 -SLTVGNVKNVNLPILARPVVSSAPHMDVSSARTSGSMIFGSNPTLKASSKSRDPRLRLA 2753
             SLT        +P+ +  V  S+ H  ++S   + S +      +K S++SRDPRLR  
Sbjct: 471  VSLTSSKPLLDQMPVSSTSVDRSSMHGLINSRIEAASSV---TYPVKTSARSRDPRLRFI 527

Query: 2752 NSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEE-PLDGPALKRQRNGSTN--Y 2582
            NSD  A+D+NQ      +N PKV+  G VIS RK K+ EE  LD  A KR R+   N  +
Sbjct: 528  NSDASALDLNQSL--GTNNMPKVENAGRVIS-RKQKTTEELSLDATAPKRLRSSLENSRH 584

Query: 2581 DVRDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPGKPIVT 2402
            + R+ +T+  +GGWLE +    G   + ++HL++  E  LKK  +  S  ST     + +
Sbjct: 585  NTREERTMAGNGGWLEENR-VAGSHLIERNHLMQKGETELKKTMSTSSGYST-----VTS 638

Query: 2401 SGNEQVPLTGTSTI----GDLRNYTPNPIMVQYLL--KLQRLESEALKKPFLPPSSTVLT 2240
            +GNEQ P+T ++T     G L+N   NP M+  +L  + QRL +EA KKP    +ST+  
Sbjct: 639  NGNEQAPVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRLAAEANKKPVDSATSTM-- 696

Query: 2239 PISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVT------VEDPGKVRMKPRD 2078
               +  NS  G     N  P    GL Q   G+L             +ED GK+RMKPRD
Sbjct: 697  ---HLTNSARGPDATVNTGPAMTAGLPQSSVGMLPASTQAASMAHTLLEDSGKIRMKPRD 753

Query: 2077 PRRILH-NNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQPSL- 1904
            PRRILH +++ QKS S G+EQ K   +PT +  G+ GN N  K   + +     +Q S  
Sbjct: 754  PRRILHGSSSLQKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDVRVETKLAPTQSSAQ 813

Query: 1903 PDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERP 1724
            PDI RQFTKNL+NIA+I+S         P   Q + S  +   LD+ + KS   +S    
Sbjct: 814  PDITRQFTKNLKNIADIMSVSQEPSTQLPATTQNVSSASVPFTLDKAELKSGVPNSQNLQ 873

Query: 1723 TETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXX 1544
                  P+  A G+++ Q+ W DVE LFEGYD++QKAAI  ER RRLEEQ KMFA++K  
Sbjct: 874  DGVGSAPETCAPGSSRSQSTWADVEHLFEGYDEKQKAAIQRERARRLEEQNKMFASKKLC 933

Query: 1543 XXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNF 1364
                     LNSAKF+EV+P+HDEILRKKEEQDREKP RHLFRFPHMGMWTKLRPG+WNF
Sbjct: 934  LVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGVWNF 993

Query: 1363 LEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PKSKD 1190
            LEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+             PKSKD
Sbjct: 994  LEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTESVDGDERAPKSKD 1053

Query: 1189 MEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERP 1010
            +EGV+GM+S+VVI+DDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DERP
Sbjct: 1054 LEGVMGMESSVVIVDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERP 1113

Query: 1009 EDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKP 830
            E GTLAS L+VIERIHQ FF+ +SL++VDVRNILA EQRKIL+GCRI+FSRVFPVGEA P
Sbjct: 1114 EAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEANP 1173

Query: 829  HLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLY 650
            HLHPLWQTAEQFGAVC  QID+ VTHVVA SLGTDKVNWA+ TGR+VVHPGW+EASALLY
Sbjct: 1174 HLHPLWQTAEQFGAVCINQIDDQVTHVVANSLGTDKVNWAISTGRFVVHPGWVEASALLY 1233

Query: 649  RRANEQDFAIK 617
            RRANEQDFAIK
Sbjct: 1234 RRANEQDFAIK 1244


>ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like isoform X2 [Cicer arietinum]
          Length = 1227

 Score =  995 bits (2572), Expect = 0.0
 Identities = 622/1330 (46%), Positives = 799/1330 (60%), Gaps = 34/1330 (2%)
 Frame = -2

Query: 4504 IEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEP---SRGWESQE 4334
            +EDVEEGEISDTAS+ EISE+DF KQDV       V+V   S   + +    +R W   +
Sbjct: 22   VEDVEEGEISDTASVVEISEEDFNKQDV-------VKVNNNSDSDKAKTGGDARVWAVHD 74

Query: 4333 VFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDVGSNRXXXXXXXSGANE 4154
            ++  Y      +  GLYN AWAQAVQN+PLN+  +F +E++ D  +N           ++
Sbjct: 75   LYSKYPTICRGYASGLYNLAWAQAVQNKPLND--IFVMELDSDSNANVVMVD------DD 126

Query: 4153 SKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDSVIVKEEG 3974
             +E+                   LEEGEID D +  T  + VGG                
Sbjct: 127  EREEGE-----------------LEEGEIDGDDD--TGGVMVGG---------------- 151

Query: 3973 LNHSPSVKEGPLDDSVGVREDVILNDSASVSEWEIDSKEKELVERVNSIRETLDNANFTE 3794
                                    + S +VSE              + IR+ L+      
Sbjct: 152  ------------------------DGSETVSE--------------SDIRDFLEGVTVA- 172

Query: 3793 NAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVFCTSNHSQKE 3614
            N  +SF    SRL   L+S    +  G +VS KD +I+  Y AI+ V+SVFC+ ++ QKE
Sbjct: 173  NVAESFAETISRLLRVLQSK---LLSGPAVSEKDYVIRLLYNAIEIVHSVFCSMDNLQKE 229

Query: 3613 QNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDS-PAVFSSFDFSNKEEEKPITCG 3437
             NK    RLL F+ ++   LFSPE MKEI  +++++D+  A+ +S    N E+   +   
Sbjct: 230  DNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNSVVVGNGEKLDTLDIK 289

Query: 3436 VNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPALYDLRGRGV 3257
                         + E+++SSK   L+ S+  E+          SEAL     +++GRGV
Sbjct: 290  TRQ-----IQGLKASELISSSK---LVHSNLTEA----------SEALLSGQSNIKGRGV 331

Query: 3256 LAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPA------RVAHERE 3095
            + PL DLHK HD D+LPSPTR      PV KL ++G G+ +P +  A      ++  + E
Sbjct: 332  MLPLFDLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGKTEAVKMELDTE 391

Query: 3094 NSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSS----- 2930
            NS  H YET+ALKAVSTYQQKFGRSS+F+ D+ PSPTPS +C++   D   EVSS     
Sbjct: 392  NSKNHLYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADANEEVSSASIAV 451

Query: 2929 SLTVGNVKNVNLPILARPVVSSAPHMDVSSARTSGSMIFGSNPTLKASSKSRDPRLRLAN 2750
            SLT        +P+ +  V  S+ H  ++S   + S +      +K S++SRDPRLR  N
Sbjct: 452  SLTSSKPLLDQMPVSSTSVDRSSMHGLINSRIEAASSV---TYPVKTSARSRDPRLRFIN 508

Query: 2749 SDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEE-PLDGPALKRQRNGSTN--YD 2579
            SD  A+D+NQ      +N PKV+  G VIS RK K+ EE  LD  A KR R+   N  ++
Sbjct: 509  SDASALDLNQSL--GTNNMPKVENAGRVIS-RKQKTTEELSLDATAPKRLRSSLENSRHN 565

Query: 2578 VRDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPGKPIVTS 2399
             R+ +T+  +GGWLE +    G   + ++HL++  E  LKK  +  S  ST     + ++
Sbjct: 566  TREERTMAGNGGWLEENR-VAGSHLIERNHLMQKGETELKKTMSTSSGYST-----VTSN 619

Query: 2398 GNEQVPLTGTSTI----GDLRNYTPNPIMVQYLL--KLQRLESEALKKPFLPPSSTVLTP 2237
            GNEQ P+T ++T     G L+N   NP M+  +L  + QRL +EA KKP    +ST+   
Sbjct: 620  GNEQAPVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRLAAEANKKPVDSATSTM--- 676

Query: 2236 ISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQVPQIVT------VEDPGKVRMKPRDP 2075
              +  NS  G     N  P    GL Q   G+L             +ED GK+RMKPRDP
Sbjct: 677  --HLTNSARGPDATVNTGPAMTAGLPQSSVGMLPASTQAASMAHTLLEDSGKIRMKPRDP 734

Query: 2074 RRILH-NNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSNDNKFVDQSDRNSIASQPSL-P 1901
            RRILH +++ QKS S G+EQ K   +PT +  G+ GN N  K   + +     +Q S  P
Sbjct: 735  RRILHGSSSLQKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDVRVETKLAPTQSSAQP 794

Query: 1900 DIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERPT 1721
            DI RQFTKNL+NIA+I+S         P   Q + S  +   LD+ + KS   +S     
Sbjct: 795  DITRQFTKNLKNIADIMSVSQEPSTQLPATTQNVSSASVPFTLDKAELKSGVPNSQNLQD 854

Query: 1720 ETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXXX 1541
                 P+  A G+++ Q+ W DVE LFEGYD++QKAAI  ER RRLEEQ KMFA++K   
Sbjct: 855  GVGSAPETCAPGSSRSQSTWADVEHLFEGYDEKQKAAIQRERARRLEEQNKMFASKKLCL 914

Query: 1540 XXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFL 1361
                    LNSAKF+EV+P+HDEILRKKEEQDREKP RHLFRFPHMGMWTKLRPG+WNFL
Sbjct: 915  VLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGVWNFL 974

Query: 1360 EKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PKSKDM 1187
            EKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+             PKSKD+
Sbjct: 975  EKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTESVDGDERAPKSKDL 1034

Query: 1186 EGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERPE 1007
            EGV+GM+S+VVI+DDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DERPE
Sbjct: 1035 EGVMGMESSVVIVDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPE 1094

Query: 1006 DGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKPH 827
             GTLAS L+VIERIHQ FF+ +SL++VDVRNILA EQRKIL+GCRI+FSRVFPVGEA PH
Sbjct: 1095 AGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPH 1154

Query: 826  LHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLYR 647
            LHPLWQTAEQFGAVC  QID+ VTHVVA SLGTDKVNWA+ TGR+VVHPGW+EASALLYR
Sbjct: 1155 LHPLWQTAEQFGAVCINQIDDQVTHVVANSLGTDKVNWAISTGRFVVHPGWVEASALLYR 1214

Query: 646  RANEQDFAIK 617
            RANEQDFAIK
Sbjct: 1215 RANEQDFAIK 1224


>ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343307|gb|EEE79693.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1030

 Score =  981 bits (2536), Expect = 0.0
 Identities = 563/1051 (53%), Positives = 693/1051 (65%), Gaps = 46/1051 (4%)
 Frame = -2

Query: 3631 NHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEK 3452
            N   KEQNK +F R L+ V S D   FSPE  KEI  ++SS+DS  + SS   + +E E 
Sbjct: 2    NQKLKEQNKGVFMRFLSLVNSHDPSFFSPEHTKEIELMVSSLDSHDILSS-SRAGEERET 60

Query: 3451 PITCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEALKPALYDL 3272
             ++  VN   +D+ ++ A +++   ++       S+ ESF++ N P+   E  KP +   
Sbjct: 61   QVSGKVNERDNDSLSKTAGYDLTTMNRL-----PSAAESFVH-NKPNFSIEPPKPGVPSF 114

Query: 3271 RGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPARVAHEREN 3092
            + RGVL PLLDL K HD D+LPSPTR T P  PV +L  IG G+    +   +VA   E 
Sbjct: 115  KSRGVLLPLLDLKKFHDEDSLPSPTRETAPSFPVQRLLPIGDGMISSGLPVPKVASITEE 174

Query: 3091 SIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLTVGN 2912
              +HPYET+ALKAVS+YQ+KF  +SFF+ + LPSPTPSEE  + DGDT GEVSSS TV N
Sbjct: 175  PRVHPYETDALKAVSSYQKKFNLNSFFTNE-LPSPTPSEESGNGDGDTAGEVSSSSTV-N 232

Query: 2911 VKNVNLPILARPVVSSAP--------------HMDVSS------ARTSGSMIFGSNPTLK 2792
             + VN P+  R   S +P              H++ SS       R S  +  G++ T+K
Sbjct: 233  YRTVNPPVSDRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVVIPTRNSAPVSSGTSSTVK 292

Query: 2791 ASSKSRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEPLDGPAL 2612
            AS+KSRDPRLR  N+D  A+D NQ  L +V+N P+ +P GA+  SRK K  E+ LDG +L
Sbjct: 293  ASAKSRDPRLRYVNTDASALDQNQRTLLMVNNPPRAEPSGAIAGSRKQKIEEDVLDGTSL 352

Query: 2611 KRQRNGSTNYDV-RDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSF 2435
            KRQRN   N+ V RD +++  +GGWLE D     PQ +N++   EN E   +    VV  
Sbjct: 353  KRQRNSFDNFGVVRDIRSMTGTGGWLE-DTDMAEPQTVNKNQWAENAEPGQRINNGVVCP 411

Query: 2434 SSTAPGKPIVTSGNEQVPLTGTSTIGD-----------------LRNYTPNPIMVQYLLK 2306
            S+ +    +  SGN QVP+ G +TI                   L++ T NP M+  +LK
Sbjct: 412  STGSVMSSVSCSGNVQVPVMGINTIAGSEQAPVTSTTTASLPDLLKDITVNPTMLINILK 471

Query: 2305 L---QRLESEALKKPFLPPSSTVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQ 2135
            +   QRL  +  +K   P  ST   P SN++   +GA+P  N     P+G+  + AG  Q
Sbjct: 472  MGQQQRLALDGQQKLADPAKSTSHPPSSNTV---LGAIPEVNAVSSLPSGILPRSAGKAQ 528

Query: 2134 VP-QIVTVEDPGKVRMKPRDPRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSND 1958
             P QI T ++ GK+RMKPRDPRR+LHNN  Q++ SLG+EQFK     T +T G++ N N 
Sbjct: 529  GPSQIATTDESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTL-TSTTQGTKDNQNL 587

Query: 1957 NKFVDQSDRNSIASQPSLPDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQT 1778
             K    ++   +      PDI+  FTK+L+NIA+I+S          V+ Q + SQP+Q 
Sbjct: 588  QKQEGLAELKPVVP----PDISSPFTKSLKNIADIVSVSQTCTTPPFVS-QNVASQPVQI 642

Query: 1777 NLDRVDAKS--MNSDSVERPTETCLTPDEGAAGTAKPQNMWGDVELLFEGYDDQQKAAIH 1604
              DRVD K+   NSD    P  +     E  A ++  QN W DVE LFEGYDDQQKAAI 
Sbjct: 643  KSDRVDGKTGISNSDQKMGPASS----PEVVAASSLSQNTWEDVEHLFEGYDDQQKAAIQ 698

Query: 1603 TERTRRLEEQKKMFAARKXXXXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQRH 1424
             ER RR+EEQKK+FAARK           LNSAKF+EV+P+HDEILRKKEEQDREKP RH
Sbjct: 699  RERARRIEEQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRH 758

Query: 1423 LFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1244
            LFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRV+
Sbjct: 759  LFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVV 818

Query: 1243 SKXXXXXXXXXXV--PKSKDMEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPCS 1070
            S+             PKSKD+EGVLGM+S VVIIDDS+RVWPHNK NLIVVERY+YFPCS
Sbjct: 819  SRGDDGDLLDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCS 878

Query: 1069 RRQFGLFGPSLLELDRDERPEDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQRK 890
            RRQFGL GPSLLE+D DERPEDGTLA  L+VIERIHQ FF+  SLD+ DVRNILA EQRK
Sbjct: 879  RRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQRK 938

Query: 889  ILSGCRILFSRVFPVGEAKPHLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNWA 710
            IL+GCRI+FSRVFPVGE  PHLHPLWQ+AEQFGAVCT QIDE VTHVVA SLGTDKVNWA
Sbjct: 939  ILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWA 998

Query: 709  LHTGRYVVHPGWLEASALLYRRANEQDFAIK 617
            L TGR+VVHPGW+EASALLYRRANEQDFAIK
Sbjct: 999  LSTGRFVVHPGWVEASALLYRRANEQDFAIK 1029


>ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223548611|gb|EEF50102.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 1195

 Score =  970 bits (2508), Expect = 0.0
 Identities = 625/1352 (46%), Positives = 787/1352 (58%), Gaps = 56/1352 (4%)
 Frame = -2

Query: 4504 IEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQVEPSRGWESQEVFD 4325
            + DVEEGEISDTASIEEISE+DF KQDV ++K        E++K + + +       + D
Sbjct: 12   VGDVEEGEISDTASIEEISEEDFNKQDVVVVKPPSSN--NETTKQKEQGNGNGRVWTISD 69

Query: 4324 AYKRDTYEFR-RGLYNFAWAQAVQ------NRPLNNPIVFAVEVEPDVGSNRXXXXXXXS 4166
             Y+         GLYN AWAQAVQ      N+PLN      VE E D  S R       +
Sbjct: 70   LYRYQMVGGHVSGLYNLAWAQAVQSKPGKSNKPLNELFADVVE-ELDESSKRSSPSSSAA 128

Query: 4165 GANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDS--- 3995
              N + +D                         D + ++  EK+ +    + +++D+   
Sbjct: 129  SVNSNNKDG------------------------DEEKKKVVEKVVIDDNGDEMMDDNNRN 164

Query: 3994 ----VIVKEEGLNHSPSVKEGPLDDSVGVREDVILND--SASVSEWEIDSKEKELVERVN 3833
                V+ KEEG      ++EG +D  +   E     D  + ++   E++S EK   +++N
Sbjct: 165  KIVDVVEKEEG-----ELEEGEIDLDMEPGEKANNGDVLNMNIDGLEVESGEKGFEKKMN 219

Query: 3832 SIRETLDNANFTENAQKSFEGACSRLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSV 3653
            SIR+ L++          F  AC+             S G S SS               
Sbjct: 220  SIRDALESVTI------EFVLACTD------------SSGVSFSS--------------- 246

Query: 3652 NSVFCTSNHSQKEQNKAIFSRLLTFVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDF 3473
                     S+KE+   I     T V  +D+         ++NG  S  D  A       
Sbjct: 247  --------FSEKEKEPLI----STVVNKKDN---------DVNGKSSGHDMSA------- 278

Query: 3472 SNKEEEKPITCGVNHNVSDAFAENASHEVLNSSKNNFLLDSSSVESFLNKNAPDVLSEAL 3293
                        VN   +D+F  N ++  +   K       + V SF             
Sbjct: 279  ------------VNKLPTDSFVNNKANLSIEGPK-------TGVSSF------------- 306

Query: 3292 KPALYDLRGRGVLAPLLDLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKPEVAPAR 3113
                   + R  L PLLDLHKDHDAD+LPSPTR +   LP ++            V   +
Sbjct: 307  -------KSRAALLPLLDLHKDHDADSLPSPTRESALPLPAYR------------VLTPK 347

Query: 3112 VAHERENSIMHPYETEALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVS 2933
            +  +  NS MHPYET+ALKAVS+YQQKF +SSF   DRLPSPTPSEE  + DGDTGGEVS
Sbjct: 348  MVLDTGNSRMHPYETDALKAVSSYQQKFSKSSFALTDRLPSPTPSEESGNGDGDTGGEVS 407

Query: 2932 SSLTVGNVKNVNLPILARPVVSSAPHMDVSSARTSGSMIFG-----------SNP--TLK 2792
            SSL+V + +  N      P+ S   +  +S  R  GS + G           S P  T+K
Sbjct: 408  SSLSVSSFRPAN------PLTSGQSNASISLPRMDGSSLPGVISIKSAVRASSAPSLTVK 461

Query: 2791 ASSKSRDPRLRLANSDVGAMDINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEPL-DGPA 2615
            AS+KSRDPRLR  NSD  A+D N   +PVV N  KV+PIG  ++ ++ K V++P+ DG +
Sbjct: 462  ASAKSRDPRLRFVNSDSNALDQNHRAVPVV-NTLKVEPIGGTMNKKRQKIVDDPIPDGHS 520

Query: 2614 LKRQRNGSTNYD-VRDAKTVPESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVS 2438
            LKRQ+N   N   VRD KT+  SGGWLE D   VGPQ MN++ L++N E+  ++ +    
Sbjct: 521  LKRQKNALENSGVVRDVKTMVGSGGWLE-DTDMVGPQTMNKNQLVDNAESDPRRKDGGGV 579

Query: 2437 FSSTAPGKPIVTSGNEQVPLTGTST-IGD---------------LRNYTPNPIMVQYLLK 2306
             +S++    +  SG EQ+P+TGTS  IG                L+N   NP M+  +LK
Sbjct: 580  CTSSSCISSVNISGTEQIPVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILK 639

Query: 2305 L---QRLESEALKKPFLPPSSTVLTPISNSINSIVGAVPFGNVAPINPTGLGQKPAGILQ 2135
            +   QRL  EA +KP  P  ST   P+++  NS++G VP    A    +G+  +PAG +Q
Sbjct: 640  MGQQQRLALEAQQKPVDPAKSTTY-PLNS--NSMLGTVPVVGAA---HSGILPRPAGTVQ 693

Query: 2134 V-PQIVTVEDPGKVRMKPRDPRRILHNNTYQKSASLGTEQFKFNGAPTLSTLGSRGNSND 1958
            V PQ+ T +D GK+RMKPRDPRR+LHNN  Q++ S+G+E  K N         ++ N N 
Sbjct: 694  VSPQLGTADDLGKIRMKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNL 753

Query: 1957 NKFVDQSDRNSIASQP-SLPDIARQFTKNLENIANILSXXXXXXXXXPVAPQILPSQPLQ 1781
             K   Q ++  +  Q  +LPDI+  FTKNL+NIA+I+S          V PQ   SQP++
Sbjct: 754  QKQEGQVEKKPVPLQSLALPDISMPFTKNLKNIADIVSVSHASTSQPLV-PQNPASQPMR 812

Query: 1780 TNLDRVDAKSMNSDSVERPTETCLTPDEGAAGTAKP--QNMWGDVELLFEGYDDQQKAAI 1607
            T +   D + +   S          P   AA  A P  QN WGDVE LFEGY+DQQKAAI
Sbjct: 813  TTISSSD-QFLGIGS---------APGAAAAAAAGPRTQNAWGDVEHLFEGYNDQQKAAI 862

Query: 1606 HTERTRRLEEQKKMFAARKXXXXXXXXXXXLNSAKFIEVEPMHDEILRKKEEQDREKPQR 1427
              ER RR+EEQKK+F+ARK           LNSAKF+EV+P+HDEILRKKEEQDREK  R
Sbjct: 863  QRERARRIEEQKKLFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHR 922

Query: 1426 HLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1247
            HLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDP GVLF GRV
Sbjct: 923  HLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRV 982

Query: 1246 ISK--XXXXXXXXXXVPKSKDMEGVLGMDSAVVIIDDSIRVWPHNKHNLIVVERYMYFPC 1073
            IS+            +PKSKD+EGVLGM+S VVI+DDS+RVWPHNK NLIVVERY+YFPC
Sbjct: 983  ISRGDDGEPFDGDERIPKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPC 1042

Query: 1072 SRRQFGLFGPSLLELDRDERPEDGTLASCLSVIERIHQIFFSQRSLDDVDVRNILACEQR 893
            SRRQFGL GPSLLE+D DERPEDGTLA  L+VIERIHQ FF+  SLD+ DVRNILA EQR
Sbjct: 1043 SRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQR 1102

Query: 892  KILSGCRILFSRVFPVGEAKPHLHPLWQTAEQFGAVCTTQIDEHVTHVVAISLGTDKVNW 713
            KIL+GCRI+FSRVFPVGEA PHLHPLWQTAEQFGAVCT QIDE VTHVVA SLGTDKVNW
Sbjct: 1103 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNW 1162

Query: 712  ALHTGRYVVHPGWLEASALLYRRANEQDFAIK 617
            AL TGR+VV+PGW+EASALLYRRANEQDFAIK
Sbjct: 1163 ALSTGRFVVYPGWVEASALLYRRANEQDFAIK 1194


>ref|XP_007225412.1| hypothetical protein PRUPE_ppa000589mg [Prunus persica]
            gi|462422348|gb|EMJ26611.1| hypothetical protein
            PRUPE_ppa000589mg [Prunus persica]
          Length = 1085

 Score =  961 bits (2485), Expect = 0.0
 Identities = 598/1139 (52%), Positives = 717/1139 (62%), Gaps = 32/1139 (2%)
 Frame = -2

Query: 3937 DDSVGVREDVILNDSASVSEWEI-DSKEKELVERVNSIRETLDNANFTENAQKSFEGACS 3761
            D+S  V EDV         E EI DS   E +   + +++    A       KS  G  S
Sbjct: 4    DESAKVVEDV--------EEGEISDSTSVEEISEEDFVKQEKQEAKVPPMESKSNGGDAS 55

Query: 3760 RLQNSLESLKEIISEGASVSSKDDLIQQSYTAIQSVNSVFCTSNHSQKEQNKAIFSRLLT 3581
            R+     ++++I +               Y   +   S       +Q  QNK +      
Sbjct: 56   RVW----TMRDIYN---------------YPGFRGYGSGLANLAWAQAVQNKPLNE---L 93

Query: 3580 FVMSQDHPLFSPEEMKEINGILSSMDSPAVFSSFDFSNKEEEKPITCGVNHNVSDAFAEN 3401
            FVM  D    S E++K  +  LS   S    S  D SN +E   +    + +  D   E 
Sbjct: 94   FVMDVD----SEEKLKRSSSSLSVNSS----SREDRSNAKEMDKVVIVDSGDEMDVEKEE 145

Query: 3400 ASHEV----LNSS---KNNFLLDSSSVESFLNKNAPDVLSEALKPALYDLRGRGVLAPLL 3242
               E     L+S      N     +  +S +  N P +LSE  +P +  L+GRGVL PLL
Sbjct: 146  GELEEGEIDLDSEPADNENAAAGEAKDDSAVVHNNPIMLSEVPRPGVSSLKGRGVLLPLL 205

Query: 3241 DLHKDHDADNLPSPTRGTTPCLPVFKLSAIGHGVAKP--EVAPARVAHERENSIMHPYET 3068
            DLHKDHDAD+LPSPTR T  C PV     +  G+ K   + A ARVA   E+S +H YET
Sbjct: 206  DLHKDHDADSLPSPTRETPSCFPVQNTLVVADGMVKSASDTATARVALNAEDSRLHSYET 265

Query: 3067 EALKAVSTYQQKFGRSSFFSQDRLPSPTPSEECDDADGDTGGEVSSSLTVGNVKNVNLPI 2888
            EALKAVS+YQQKF RSSF   +RLPSPTPSE+  + D DTGGEVSSS    N++    PI
Sbjct: 266  EALKAVSSYQQKFNRSSFLMSERLPSPTPSEDGGNGDDDTGGEVSSSFA-SNLRTSCPPI 324

Query: 2887 LARPVVSSAPHMDVSSARTSGSMIFGS--------NPTLKASSKSRDPRLRLANSDVGAM 2732
              R +VS +P + V S    G     S        + T+KAS+KSRDPRLR ANSD+GA+
Sbjct: 325  SGRQIVSPSP-IPVGSPSMQGRATAKSAAPPNSEPSMTIKASAKSRDPRLRFANSDMGAL 383

Query: 2731 DINQGPLPVVHNAPKVDPIGAVISSRKHKSVEEP-LDGPALKRQRNGSTNYD-VRDAKTV 2558
            ++NQ P  VVH+APKVD +   +SSRK K +EE   DGPALKRQRN   N   V DAKT 
Sbjct: 384  NLNQQPSTVVHSAPKVDSV-ITLSSRKQKPLEESRFDGPALKRQRNALENSGIVGDAKTA 442

Query: 2557 PESGGWLENDAGTVGPQFMNQSHLIENTENYLKKMETVVSFSSTAPGKPI-VTSGNEQVP 2381
              SGGWLE D G VGP   +++  +EN E   + +  V+S  ST         S NE V 
Sbjct: 443  SGSGGWLE-DIGGVGPHLNSKNQTVENAETDPRNVVKVLSSPSTVDCNTNGPNSANEHVS 501

Query: 2380 LTGTSTIGD---LRNYTPNPIMVQYLLKL---QRLESEALKKPFLPPSSTVLTPISNSIN 2219
            L G S       L++   NP M+  LLK+   QR+ SEA +K   PP  T+  P S+S  
Sbjct: 502  LMGASMASLPELLKDIAVNPTMLLNLLKMGQQQRVASEAHQKSADPPK-TMTHPTSSS-- 558

Query: 2218 SIVGAVPFGNVAPINPTGLGQKPAGILQVP-QIVTVEDPGKVRMKPRDPRRILHNNTYQK 2042
            SI+ +   GNV P   +G+ Q PAG L V  Q   +++ GKVRMKPRDPRR LH N  QK
Sbjct: 559  SILVSAALGNV-PSKTSGILQTPAGTLPVSSQKALMDESGKVRMKPRDPRRALHGNALQK 617

Query: 2041 SASLGTEQFKFNGAPTLSTL-GSRGNSNDNKFVDQSDRNSIASQP-SLPDIARQFTKNLE 1868
            S SLG EQF+ N  P LS + G++ N N      Q+D+  + SQ    PDI RQFTKNL+
Sbjct: 618  SGSLGQEQFR-NIIPPLSAIQGNKDNLNG-----QADKKLVTSQSLDAPDITRQFTKNLK 671

Query: 1867 NIANILSXXXXXXXXXPVAPQILPSQPLQTNLDRVDAKSMNSDSVERPTETCLTPDEGAA 1688
            NIA+I+S          +A Q + SQ +    +R+D K       +RP E+    +  AA
Sbjct: 672  NIADIMSVSNVSTSPA-IASQSVSSQLVPIKPERIDLKPEE----QRP-ESISASEAAAA 725

Query: 1687 GTAKPQNMWGDVELLFEGYDDQQKAAIHTERTRRLEEQKKMFAARKXXXXXXXXXXXLNS 1508
            G ++   MWGDVE LFEGYDDQQKAAI  ERTRR+EEQKKMFAA K           LNS
Sbjct: 726  GPSRSPVMWGDVEHLFEGYDDQQKAAIQRERTRRIEEQKKMFAAHKLCLVLDLDHTLLNS 785

Query: 1507 AKFIEVEPMHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHL 1328
            AKF+EV+P+HDEILRKKEEQDREKPQRHLFRF HMGMWTKLRPGIWNFLEKAS+L+ELHL
Sbjct: 786  AKFVEVDPVHDEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASQLFELHL 845

Query: 1327 YTMGNKLYATEMAKVLDPKGVLFAGRVISKXXXXXXXXXXV--PKSKDMEGVLGMDSAVV 1154
            YTMGNKLYATEMAKVLDP G LFAGRVIS+             PKSKD+EGVLGM+SAVV
Sbjct: 846  YTMGNKLYATEMAKVLDPTGALFAGRVISRGDDGDPEDGDERIPKSKDLEGVLGMESAVV 905

Query: 1153 IIDDSIRVWPHNKHNLIVVERYMYFPCSRRQFGLFGPSLLELDRDERPEDGTLASCLSVI 974
            IIDDS+RVWPHNK NLIVVERY YFPCSRRQFGL GPSLLE+D DER EDGTLAS L+VI
Sbjct: 906  IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERQEDGTLASSLAVI 965

Query: 973  ERIHQIFFSQRSLDDVDVRNILACEQRKILSGCRILFSRVFPVGEAKPHLHPLWQTAEQF 794
            E+IHQ+FFS  SLD+ DVRNILA EQRKIL+GCRI+FSRVFPVGE KPHLHPLWQTAEQF
Sbjct: 966  EKIHQLFFSHSSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEVKPHLHPLWQTAEQF 1025

Query: 793  GAVCTTQIDEHVTHVVAISLGTDKVNWALHTGRYVVHPGWLEASALLYRRANEQDFAIK 617
            GAVCT QID+ VTHVVA SLGTDKVNWAL +G+YVVHPGW+EASALLYRRANEQDFAIK
Sbjct: 1026 GAVCTNQIDDQVTHVVANSLGTDKVNWALSSGKYVVHPGWVEASALLYRRANEQDFAIK 1084



 Score = 89.4 bits (220), Expect = 2e-14
 Identities = 69/182 (37%), Positives = 93/182 (51%), Gaps = 5/182 (2%)
 Frame = -2

Query: 4516 SSILIEDVEEGEISDTASIEEISEDDFKKQDVRILKEQEVRVLKESSKPQ-VEPSRGWES 4340
            S+ ++EDVEEGEISD+ S+EEISE+DF KQ+     +QE +V    SK    + SR W  
Sbjct: 6    SAKVVEDVEEGEISDSTSVEEISEEDFVKQE-----KQEAKVPPMESKSNGGDASRVWTM 60

Query: 4339 QEVFDAYKRDTYEFRRGLYNFAWAQAVQNRPLNNPIVFAVEVEPDV----GSNRXXXXXX 4172
            +++++      Y    GL N AWAQAVQN+PLN   V  V+ E  +     S        
Sbjct: 61   RDIYNYPGFRGY--GSGLANLAWAQAVQNKPLNELFVMDVDSEEKLKRSSSSLSVNSSSR 118

Query: 4171 XSGANESKEDTNVISXXXXXXXXXXXXXELEEGEIDLDSEEPTEKLAVGGGEEGVLNDSV 3992
               +N  + D  VI              ELEEGEIDLDSE    + A  G  +   +DS 
Sbjct: 119  EDRSNAKEMDKVVIVDSGDEMDVEKEEGELEEGEIDLDSEPADNENAAAGEAK---DDSA 175

Query: 3991 IV 3986
            +V
Sbjct: 176  VV 177


Top