BLASTX nr result

ID: Akebia25_contig00012555 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00012555
         (4118 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma...   894   0.0  
ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat...   887   0.0  
emb|CBI35661.3| unnamed protein product [Vitis vinifera]              868   0.0  
gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l...   846   0.0  
ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu...   845   0.0  
ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr...   844   0.0  
ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Popu...   828   0.0  
ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal doma...   818   0.0  
ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus tric...   807   0.0  
ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal doma...   806   0.0  
ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera...   805   0.0  
ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ...   799   0.0  
ref|XP_007225412.1| hypothetical protein PRUPE_ppa000589mg [Prun...   790   0.0  
ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal doma...   786   0.0  
ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma...   786   0.0  
ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma...   784   0.0  
ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma...   784   0.0  
ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phas...   780   0.0  
ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citr...   780   0.0  
ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma...   780   0.0  

>ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Vitis vinifera]
          Length = 1238

 Score =  894 bits (2309), Expect = 0.0
 Identities = 547/1100 (49%), Positives = 687/1100 (62%), Gaps = 36/1100 (3%)
 Frame = -2

Query: 3709 VEEEGIWVVSSDRNLQLREFEM--KIKSIREALDTATVKDPKESYNEVCSRFQISLESLQ 3536
            V++EG  +  ++  + L+E E+  ++KSI+E L++ TV + ++S++ VCSR Q +L SLQ
Sbjct: 161  VKDEGGVLDVNEPEIDLKERELVERVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQ 220

Query: 3535 LMRLD-----NCVPVVDALIQQIFMGIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQD 3371
             +  +     + VP  DAL QQ+   I+A+N VF SMN  Q+E +KD+  RLL+ V+  D
Sbjct: 221  KVFGEKVVGESSVPTKDALAQQLINAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGD 280

Query: 3370 FALFASKQKKEIEAMIRCVESQSISSSV-IMDKKKERGVINGMTXXXXXXXXXNPSGHSL 3194
              +F+ +  KE+E M+  +++ +  SS    DK  +  V +GM          + SG + 
Sbjct: 281  SPIFSIQHIKEVEVMMSFLDTPAAQSSAEASDKVNDVQVTDGMNRNILDSSVES-SGRAF 339

Query: 3193 NSTKKFHLEPVSVR-YSDQNDSNMEFKTLKSKGYGGVGSL-NLPSDHDGDR----TLDTL 3032
             S KK  L+ +SV  Y+  N   ++     S+G    G L +L  DHD D     T    
Sbjct: 340  ASAKKLSLDSISVESYNQNNPDALKPGLSSSRGRFIFGPLLDLHKDHDEDSLPSPTGKAP 399

Query: 3031 QLFPIQNQSDSATCTVSDKTEDAAVLPYETDALRAVSTYQQRFN-SSFLLSNRLPSPTPX 2855
            Q FP+ N+S+  T  V+ +T+D+ + PYETDAL+AVSTYQQ+F  +SFL  ++LPSPTP 
Sbjct: 400  QCFPV-NKSELVTAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPS 458

Query: 2854 XXXXXXXXXXXXXXXXXSTASSIKTENPSVPVQPFTYATPIVDCSSEQESIPAKTGQL-- 2681
                             ST S+  T N      P   + P +D S  Q     +   L  
Sbjct: 459  EESGDTYGDISGEVSSSSTISAPITANAPALGHPIVSSAPQMDSSIVQGPTVGRNTSLVS 518

Query: 2680 -GSCLNPILKAMPKSKDPRR--------ALDLKQQSLTVDYNVPKREPIVGTMCSRKHKI 2528
             G  L+  + A  KS+DPR         +LDL ++ L    N PK +P+   + SRK K 
Sbjct: 519  SGPHLDSSVVASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKS 578

Query: 2527 VKESVLDGHNLKRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKTCLAENM 2348
             +E +LDG   KRQRNGLTS      R+ Q    +  W     + +PQ  N    L EN 
Sbjct: 579  AEEPLLDGPVTKRQRNGLTSPATV--RDAQTVVASGGWLEDSNTVIPQMMNRNQ-LIENT 635

Query: 2347 RTDFRKLENGKFCSGKRQD---TNSGGNQQLPVFGTNTFVSLPSL-KDISMNPTMLVNLI 2180
             TD +KLE+    +G   D       GN+ LPV  T+T  SL SL KDI++NP + +N+ 
Sbjct: 636  GTDPKKLESKVTVTGIGCDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIF 695

Query: 2179 --MEQQKLVADTQQKPAQSMTLPSISSVAVGTVPLAN----DPSSNSAKIVQKSQIPAQI 2018
              +EQQK        PA++  LP  S+  +G VP A+     PS+   K     Q+P Q 
Sbjct: 696  NKVEQQK-----SGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVP-QT 749

Query: 2017 VPMDLREDLGKTRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQ 1838
             PMD   + GK RMKPRDPRRILH N+F+++G  G+EQ KT        Q  +D    + 
Sbjct: 750  GPMD---ESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKTNA------QKQEDQTETKS 800

Query: 1837 HAEXXXXXXXXXXXXXXXAKQNNIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADVRV 1658
                               K  NI   +  SQ+++   T  Q +SSQ +    D  DV+ 
Sbjct: 801  VPSHSVNPPDISQQFTKNLK--NIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKA 858

Query: 1657 VATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQN 1478
              +DS +Q +   S P E AA   QS+N W DVEHLF+G+DDQQKA I+RERARR EEQ 
Sbjct: 859  TVSDSGDQLTANGSKP-ESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQK 917

Query: 1477 KLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWT 1298
            K+F+ARK           LNSAKFVEVDPVH+ ILRKKEEQDR+K QRHLFRFPHMGMWT
Sbjct: 918  KMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWT 977

Query: 1297 KLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDG 1118
            KLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RVIS+GDD D LDG
Sbjct: 978  KLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDG 1037

Query: 1117 DERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSL 938
            DERVPK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL GPSL
Sbjct: 1038 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSL 1097

Query: 937  LEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSR 758
            LEI HDER +DG LASSLAVIERIH +FF +++L++VDVRNILA EQ+ +LAGCRIVFSR
Sbjct: 1098 LEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSR 1157

Query: 757  VFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPC 578
            VFPVGEA PHLHPLWQTA+ FGAVCTNQIDE VTHVVANSLGTDKVNWALSTGRFVVHP 
Sbjct: 1158 VFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPG 1217

Query: 577  WVEASALLYRRANEFDFAIK 518
            WVEASALLYRRANE DFAIK
Sbjct: 1218 WVEASALLYRRANEQDFAIK 1237


>ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA
            polymerase II C-terminal domain phosphatase-like 3,
            putative [Theobroma cacao]
          Length = 1290

 Score =  887 bits (2293), Expect = 0.0
 Identities = 544/1105 (49%), Positives = 686/1105 (62%), Gaps = 59/1105 (5%)
 Frame = -2

Query: 3655 EFEMKIKSIREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALIQQIFM 3476
            E E +   IR  L+  TV + ++S+  VCSR   +LESL+ + L+  VP  DALIQ  F 
Sbjct: 212  ELEKRANLIRGVLEGVTVIEAEKSFEGVCSRLHNALESLRALILECSVPAKDALIQLAF- 270

Query: 3475 GIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVESQSIS 3296
               A+NS F ++N   +EQ+  IL RLL+ VK  D +LF   + KEI+ M+  + S + +
Sbjct: 271  --GAINSAFVALNCNSKEQNVAILSRLLSIVKGHDPSLFPPDKMKEIDVMLISLNSPARA 328

Query: 3295 SSVIMDKKKERGVINGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQNDSNMEFK 3116
                +D +K+  V++G+          N   H L  T K    P S ++   N  N   +
Sbjct: 329  ----IDTEKDMKVVDGVNKKDPDALPENIC-HDLTVTNKL---PSSAKFVINNKPNALTE 380

Query: 3115 TLK-------SKGYGGVGSLNLPSDHDGDR----TLDTLQLFPIQNQSDSATCTV----- 2984
            TLK       ++G   +  L+L  DHD D     T +T    P+     S    V     
Sbjct: 381  TLKPGVPNFRNRGIS-LPLLDLHKDHDADSLPSPTRETTPCLPVNKPLTSGDVMVKSGFM 439

Query: 2983 ----SDKTEDAAVLPYETDALRAVSTYQQRFNS-SFLLSNRLPSPTPXXXXXXXXXXXXX 2819
                S   E   + PYETDAL+A STYQQ+F   SF  S+RLPSPTP             
Sbjct: 440  TGKGSHDAEGDKLHPYETDALKAFSTYQQKFGQGSFFSSDRLPSPTPSEESGDEGGDNGG 499

Query: 2818 XXXXXSTASSIKTENPSVPV--QPFTYATPIVDCSSE--QESIPAKTGQ-LGSCLNPILK 2654
                  ++SSI    P++P+   P   + P+VD +S   Q  I  +    + S  N + K
Sbjct: 500  EVS---SSSSIGNFKPNLPILGHPIVSSAPLVDSASSSLQGQITTRNATPMSSVSNIVSK 556

Query: 2653 AMPKSKDPR--------RALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHN 2498
            ++ KS+DPR         ALDL ++ L   +N  K  P+ G M SRK K V+E +LD   
Sbjct: 557  SLAKSRDPRLWFANSNASALDLNERLL---HNASKVAPVGGIMDSRKKKSVEEPILDSPA 613

Query: 2497 LKRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFRKLENG 2318
            LKRQRN L +  + V+R+ Q  S    W      A+     N+   AEN+ ++ RK++NG
Sbjct: 614  LKRQRNELEN--LGVARDVQTVSGIGGWL-EDTDAIGSQITNRNQTAENLESNSRKMDNG 670

Query: 2317 ----KFCSGKRQDTNSGGNQQLPVFGTNTFVSLPSL-KDISMNPTMLVNLIM--EQQKLV 2159
                   SGK   T  G N+Q+PV  T+T  SLP+L KDI++NPTML+N++   +QQ+L 
Sbjct: 671  VTSSSTLSGKTNIT-VGTNEQVPVTSTST-PSLPALLKDIAVNPTMLINILKMGQQQRLG 728

Query: 2158 ADTQQK-----------PAQSMTLPSISSVAVGTVPLANDPSSNSAKIVQKSQIPAQIVP 2012
            A+ QQK           P+ +  L  +SS  V   P  N+  S S+ I  K   PA  + 
Sbjct: 729  AEAQQKSPDPVKSTFHQPSSNSLLGVVSSTNVIPSPSVNNVPSISSGISSK---PAGNLQ 785

Query: 2011 MDLREDLGKTRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQ-- 1838
            +   ++ GK RMKPRDPRR+LH N+ +++G +G +QLKT G  +S  Q SKD+L  ++  
Sbjct: 786  VPSPDESGKIRMKPRDPRRVLHGNSLQRSGSMGLDQLKTNGALTSSTQGSKDNLNAQKLD 845

Query: 1837 ---HAEXXXXXXXXXXXXXXXAKQN--NIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDN 1673
                ++                  N  NI   +  SQ+ TS   V+  +  QP+  K D+
Sbjct: 846  SQTESKPMQSQLVPPPDITQQFTNNLKNIADIMSVSQALTSLPPVSHNLVPQPVLIKSDS 905

Query: 1672 ADVRVVATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARR 1493
             D++ + ++S +Q++     P+ GA    +SQNAW DVEHLFE +DDQQKA I+RERARR
Sbjct: 906  MDMKALVSNSEDQQTGAGLAPEAGATGP-RSQNAWGDVEHLFERYDDQQKAAIQRERARR 964

Query: 1492 KEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPH 1313
             EEQ K+F+ARK           LNSAKF+EVDPVHE ILRKKEEQDR+KP+RHLFRF H
Sbjct: 965  IEEQKKMFSARKLCLVLDLDHTLLNSAKFIEVDPVHEEILRKKEEQDREKPERHLFRFHH 1024

Query: 1312 MGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDE 1133
            MGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD 
Sbjct: 1025 MGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1084

Query: 1132 DPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGL 953
            DP DGDERVP+ KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL
Sbjct: 1085 DPFDGDERVPRSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1144

Query: 952  SGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCR 773
             GPSLLEI HDER +DG LASSLAVIERIH +FF HQ+L+DVDVRNILA EQ+ +LAGCR
Sbjct: 1145 LGPSLLEIDHDERPEDGTLASSLAVIERIHQDFFSHQNLDDVDVRNILASEQRKILAGCR 1204

Query: 772  IVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRF 593
            IVFSRVFPVGEA PHLHPLWQTA+QFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTG+F
Sbjct: 1205 IVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGKF 1264

Query: 592  VVHPCWVEASALLYRRANEFDFAIK 518
            VVHP WVEASALLYRRANE DFAIK
Sbjct: 1265 VVHPGWVEASALLYRRANEVDFAIK 1289


>emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score =  868 bits (2244), Expect = 0.0
 Identities = 534/1092 (48%), Positives = 670/1092 (61%), Gaps = 28/1092 (2%)
 Frame = -2

Query: 3709 VEEEGIWVVSSDRNLQLREFEM--KIKSIREALDTATVKDPKESYNEVCSRFQISLESLQ 3536
            V++EG  +  ++  + L+E E+  ++KSI+E L++ TV + ++S++ VCSR Q +L SLQ
Sbjct: 172  VKDEGGVLDVNEPEIDLKERELVERVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQ 231

Query: 3535 LMRLD-----NCVPVVDALIQQIFMGIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQD 3371
             +  +     + VP  DAL QQ+   I+A+N VF SMN  Q+E +KD+  RLL+ V+  D
Sbjct: 232  KVFGEKVVGESSVPTKDALAQQLINAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGD 291

Query: 3370 FALFASKQKKEIEAMIRCVESQSISSSV-IMDKKKERGVINGMTXXXXXXXXXNPSGHSL 3194
              +F+ +  KE+E M+  +++ +  SS    DK  +  V +GM          + SG + 
Sbjct: 292  SPIFSIQHIKEVEVMMSFLDTPAAQSSAEASDKVNDVQVTDGMNRNILDSSVES-SGRAF 350

Query: 3193 NSTKKFHLEPVSVRYSDQNDSNMEFKTLKSKGYGGVGSLNLPSDHDGDR----TLDTLQL 3026
             S KKF                  F  L          L+L  DHD D     T    Q 
Sbjct: 351  ASAKKFR-------------GRFIFGPL----------LDLHKDHDEDSLPSPTGKAPQC 387

Query: 3025 FPIQNQSDSATCTVSDKTEDAAVLPYETDALRAVSTYQQRFN-SSFLLSNRLPSPTPXXX 2849
            FP+ N+S+  T  V+ +T+D+ + PYETDAL+AVSTYQQ+F  +SFL  ++LPSPTP   
Sbjct: 388  FPV-NKSELVTAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEE 446

Query: 2848 XXXXXXXXXXXXXXXSTASSIKTENPSVPVQPFTYATPIVDCSSEQESIPAKTGQLGSCL 2669
                           ST S+  T N      P   + P +D   +   +P  TG + S  
Sbjct: 447  SGDTYGDISGEVSSSSTISAPITANAPALGHPIVSSAPQMDIV-QGLVVPRNTGAVNSRF 505

Query: 2668 NPILKAMPKSKDPRR--------ALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESV 2513
            N IL+A  KS+DPR         +LDL ++ L    N PK +P+   + SRK K  +E +
Sbjct: 506  NSILRASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPL 565

Query: 2512 LDGHNLKRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFR 2333
            LDG   KRQRNGLTS   P ++ E   +      G GC        +K  +  N      
Sbjct: 566  LDGPVTKRQRNGLTS---PATKLESKVTVT----GIGC--------DKPYVTVN------ 604

Query: 2332 KLENGKFCSGKRQDTNSGGNQQLPVFGTNTFVSLPSL-KDISMNPTMLVNLI--MEQQKL 2162
                              GN+ LPV  T+T  SL SL KDI++NP + +N+   +EQQK 
Sbjct: 605  ------------------GNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQK- 645

Query: 2161 VADTQQKPAQSMTLPSISSVAVGTVPLAN----DPSSNSAKIVQKSQIPAQIVPMDLRED 1994
                   PA++  LP  S+  +G VP A+     PS+   K     Q+P Q  PM+ +++
Sbjct: 646  ----SGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVP-QTGPMNPQDE 700

Query: 1993 LGKTRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQHAEXXXXX 1814
             GK RMKPRDPRRILH N+F+++G  G+EQ KT        Q  +D    +         
Sbjct: 701  SGKVRMKPRDPRRILHANSFQRSGSSGSEQFKTNA------QKQEDQTETKSVPSHSVNP 754

Query: 1813 XXXXXXXXXXAKQNNIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADVRVVATDSNNQ 1634
                       K  NI   +  SQ+++   T  Q +SSQ +    D  DV+   +DS +Q
Sbjct: 755  PDISQQFTKNLK--NIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATVSDSGDQ 812

Query: 1633 ESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKX 1454
             +   S P E AA   QS+N W DVEHLF+G+DDQQKA I+RERARR EEQ K+F+ARK 
Sbjct: 813  LTANGSKP-ESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKL 871

Query: 1453 XXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWN 1274
                      LNSAKFVEVDPVH+ ILRKKEEQDR+K QRHLFRFPHMGMWTKLRPGIWN
Sbjct: 872  CLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWN 931

Query: 1273 FLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIK 1094
            FLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RVIS+GDD D LDGDERVPK K
Sbjct: 932  FLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSK 991

Query: 1093 DLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDER 914
            DL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL GPSLLEI HDER
Sbjct: 992  DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDER 1051

Query: 913  LDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAK 734
             +DG LASSLAVIERIH +FF +++L++VDVRNILA EQ+ +LAGCRIVFSRVFPVGEA 
Sbjct: 1052 PEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEAN 1111

Query: 733  PHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALL 554
            PHLHPLWQTA+ FGAVCTNQIDE VTHVVANSLGTDKVNWALSTGRFVVHP WVEASALL
Sbjct: 1112 PHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALL 1171

Query: 553  YRRANEFDFAIK 518
            YRRANE DFAIK
Sbjct: 1172 YRRANEQDFAIK 1183


>gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus
            notabilis]
          Length = 1301

 Score =  846 bits (2186), Expect = 0.0
 Identities = 527/1089 (48%), Positives = 661/1089 (60%), Gaps = 62/1089 (5%)
 Frame = -2

Query: 3655 EFEMKIKSIREALDTATVKDPKESYNEVCSRFQISLESLQ--LMRLDNCVPVVDALIQQI 3482
            E E ++  I E L +  V + ++S+ EVCSR Q +LESL+  L   +   P  D +IQ  
Sbjct: 199  ELEKRVDLIWETLGSVNVVNAEKSFEEVCSRLQRTLESLRGVLSEKEFSFPTKDVVIQMS 258

Query: 3481 FMGIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVESQS 3302
               IQ VNSVF SM+  Q+EQ K+ L RL   VK+    LF+ +Q KEIE MI  +   +
Sbjct: 259  ITAIQVVNSVFCSMSVNQKEQKKETLSRLFCSVKNCGTPLFSPEQTKEIELMISSLNPLN 318

Query: 3301 I-SSSVIMDKKKERGVINGMTXXXXXXXXXNPSGHSLNSTKK----------FHLEPVSV 3155
            +  SS   DK+KE  +I  +          N    S+  T             H  P+++
Sbjct: 319  VLPSSGASDKEKETQIIERLHEMDSNLTNANAENASIERTSVKLPQDCVASVVHSNPITL 378

Query: 3154 RYSDQNDSNMEFKTLKSKGYGGV-GSLNLPSDHDGDR----TLDTLQLFPIQN------- 3011
                     +   TL  KG G +   L+L  DHD D     T +    FP+         
Sbjct: 379  ------PELLRPGTLAFKGRGLLLPLLDLHKDHDADSLPSPTREAPSCFPVYKPLGVADG 432

Query: 3010 --QSDSATCTVSDKTEDAAVLPYETDALRAVSTYQQRFN-SSFLLSNRLPSPTPXXXXXX 2840
              +  S T  V+   E++ +  YETDAL+AVSTYQQ+F   SFL+S+RLPSPTP      
Sbjct: 433  IIKPVSTTAKVAPGAEESRLHRYETDALKAVSTYQQKFGRGSFLMSDRLPSPTPSEECDE 492

Query: 2839 XXXXXXXXXXXXSTASSIKTENPSVPVQPFTYATPIVDCSSE--QESIPAKTGQ-LGSCL 2669
                         T+ +++T  P++P+   +  T  V  SS   Q  I AK    +GS  
Sbjct: 493  EDDINQEVSSSL-TSGNLRT--PAIPILRPSVVTSSVPVSSPTMQGPIAAKNAAPVGSGS 549

Query: 2668 NPILKAMPKSKDPRR--------ALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESV 2513
            N  +KA  +S+DPR         ALDL Q+ LT  +N PK EP   T  SRK +IV+E  
Sbjct: 550  NSTMKASARSRDPRLRFANSDAGALDLNQRPLTAVHNGPKVEPGDPTS-SRKQRIVEEPN 608

Query: 2512 LDGHNLKRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFR 2333
            LDG  LKRQR+   S+++ V    + AS    W     +  PQ  N K  L EN   D R
Sbjct: 609  LDGPALKRQRHAFVSAKIDV----KTASGVGGWLEDNGTTGPQIMN-KNQLVENAEADPR 663

Query: 2332 K---LENGKFCSGKRQDTNSG---GNQQLPVFGTNTFVSLPS-LKDISMNPTM---LVNL 2183
            K   L NG          N+G   G +Q+PV GT+T  +LP+ LKDI++NPT+   ++N 
Sbjct: 664  KSIHLVNGPIM-------NNGPNIGKEQVPVTGTSTPDALPAILKDIAVNPTIFMDILNK 716

Query: 2182 IMEQQKLVADTQQKP--AQSMTLPSISSVAVGTVPLANDPSSNSAKIVQKSQIP----AQ 2021
            + +QQ L AD QQK   +++ T P  ++  +G  PL N   S ++ I+Q   +     +Q
Sbjct: 717  LGQQQLLAADAQQKSDSSKNTTHPPGTNSILGAAPLVNVAPSKASGILQTPAVSLPTTSQ 776

Query: 2020 IVPMDLREDLGKTRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLT-- 1847
            +    ++++LGK RMKPRDPRR+LH N  +++  +G EQ K      S    +KD+L   
Sbjct: 777  VATASMQDELGKIRMKPRDPRRVLHGNMLQKSWSLGHEQFKPIVSSVSCTPGNKDNLNGP 836

Query: 1846 VRQHAEXXXXXXXXXXXXXXXAKQ-----NNIPGTLFDSQSATSFTTVAQTVSSQPIPRK 1682
            V++                  A+Q      NI   +  SQ++TS  TV+Q +SSQP+P K
Sbjct: 837  VQEGQADKKQVPSQLVVQPDIARQFTKNLRNIADLMSVSQASTSPATVSQNLSSQPLPVK 896

Query: 1681 IDNADVRVVATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRER 1502
             D  DV+ V  +S +Q S TNSTP+   AV  ++ NAW DVEHLFEG+DD+QKA I+RER
Sbjct: 897  PDRGDVKAVVPNSEDQHSGTNSTPETTLAVPSRTPNAWGDVEHLFEGYDDEQKAAIQRER 956

Query: 1501 ARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFR 1322
            ARR EEQ K+F A K           LNSAKFVEVD VH+ ILRKKEEQDR+KPQRHLFR
Sbjct: 957  ARRLEEQKKMFDAHKLCLVLDLDHTLLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFR 1016

Query: 1321 FPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRG 1142
            FPHMGMWTKLRPG+WNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G+LF  RVISRG
Sbjct: 1017 FPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPMGTLFSGRVISRG 1076

Query: 1141 DDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRH 962
            DD DP DGDERVPK KDL+GVLGMESSVVIIDDS RVWPHNKLNLI +ERY YFPCSRR 
Sbjct: 1077 DDGDPFDGDERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1136

Query: 961  FGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLA 782
            FGL GPSLLEI HDER + G LASSLAVIE+IH NFF H SL++VDVRNILA EQ+ +LA
Sbjct: 1137 FGLPGPSLLEIDHDERPEQGTLASSLAVIEKIHQNFFSHHSLDEVDVRNILASEQRKILA 1196

Query: 781  GCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALST 602
            GCRIVFSRVFPV E  PHLHPLWQTA+QFGAVCT QID+ VTHVVANS GTDKVNWAL+ 
Sbjct: 1197 GCRIVFSRVFPVSEVNPHLHPLWQTAEQFGAVCTTQIDDQVTHVVANSPGTDKVNWALAN 1256

Query: 601  GRFVVHPCW 575
            G+F VHP W
Sbjct: 1257 GKFAVHPGW 1265


>ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343308|gb|EEE79627.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1247

 Score =  845 bits (2183), Expect = 0.0
 Identities = 531/1106 (48%), Positives = 660/1106 (59%), Gaps = 62/1106 (5%)
 Frame = -2

Query: 3649 EMKIKSIREALDTATVKDPKESYNEVCSRFQISLESL-QLMRL-DNCVPVVDALIQQIFM 3476
            E ++KSIRE L++ +V    +S+  VC +   +LESL +L+R+ +N  P  D+L++ +F 
Sbjct: 176  EKRVKSIREDLESVSVIKDDKSFEAVCLKLHNALESLKELVRVNENGFPSKDSLVRLLFT 235

Query: 3475 GIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVESQSIS 3296
             I AVNS F SMN + +EQ+K + +R L+ V   D + F+ +  KE+             
Sbjct: 236  AIGAVNSFFSSMNQKLKEQNKGVFMRFLSLVNSHDPSFFSPEHTKEVCDFCNF------- 288

Query: 3295 SSVIMDKKKERGVINGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQNDSNMEFK 3116
               I+    +   +N +           P+         F +EP            +   
Sbjct: 289  DFRIVSLCYDLTTMNRLPSAAESFVHNKPN---------FSIEPPKPGVPSFKSRGVLLP 339

Query: 3115 TLKSKGYGGVGSLNLPSDHDGDRTLDTLQLFPIQN---------QSDSATCTVSDKTEDA 2963
             L  K +    SL  P       T +T   FP+Q           S      V+  TE+ 
Sbjct: 340  LLDLKKFHDEDSLPSP-------TRETAPSFPVQRLLPIGDGMISSGLPVPKVASITEEP 392

Query: 2962 AVLPYETDALRAVSTYQQRFNSSFLLSNRLPSPTPXXXXXXXXXXXXXXXXXXSTASSIK 2783
             V PYETDAL+AVS+YQ++FN +   +N LPSPTP                  ST +  +
Sbjct: 393  RVHPYETDALKAVSSYQKKFNLNSFFTNELPSPTPSEESGNGDGDTAGEVSSSSTVN-YR 451

Query: 2782 TENPSV--------------PVQPFTYATPIVDCSSEQESIPAK-TGQLGSCLNPILKAM 2648
            T NP V              P  P     P ++ SS +  IP + +  + S  +  +KA 
Sbjct: 452  TVNPPVSDRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVVIPTRNSAPVSSGTSSTVKAS 511

Query: 2647 PKSKDPR--------RALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNLK 2492
             KS+DPR         ALD  Q++L +  N P+ EP      SRK KI +E VLDG +LK
Sbjct: 512  AKSRDPRLRYVNTDASALDQNQRTLLMVNNPPRAEPSGAIAGSRKQKI-EEDVLDGTSLK 570

Query: 2491 RQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFRKLENGKF 2312
            RQRN   +    V R+ +  +    W      A PQ   NK   AEN     +++ NG  
Sbjct: 571  RQRNSFDN--FGVVRDIRSMTGTGGWLEDTDMAEPQ-TVNKNQWAENAEPG-QRINNGVV 626

Query: 2311 CSGKRQDTNS---GGNQQLPVFGTNTFV-------------SLPSL-KDISMNPTMLVNL 2183
            C       +S    GN Q+PV G NT               SLP L KDI++NPTML+N+
Sbjct: 627  CPSTGSVMSSVSCSGNVQVPVMGINTIAGSEQAPVTSTTTASLPDLLKDITVNPTMLINI 686

Query: 2182 IM--EQQKLVADTQQK---PAQSMTLPSISSVAVGTVPLANDPSSNSAKIVQKS----QI 2030
            +   +QQ+L  D QQK   PA+S + P  S+  +G +P  N  SS  + I+ +S    Q 
Sbjct: 687  LKMGQQQRLALDGQQKLADPAKSTSHPPSSNTVLGAIPEVNAVSSLPSGILPRSAGKAQG 746

Query: 2029 PAQIVPMDLREDLGKTRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSL 1850
            P+QI   D   + GK RMKPRDPRR+LHNN  ++ G +G+EQ KT  + S+  Q +KD+ 
Sbjct: 747  PSQIATTD---ESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTLTSTT-QGTKDNQ 802

Query: 1849 TVRQHAEXXXXXXXXXXXXXXXAKQN--NIPGTLFDSQSATSFTTVAQTVSSQPIPRKID 1676
             +++                    ++  NI   +  SQ+ T+   V+Q V+SQP+  K D
Sbjct: 803  NLQKQEGLAELKPVVPPDISSPFTKSLKNIADIVSVSQTCTTPPFVSQNVASQPVQIKSD 862

Query: 1675 NADVRVVATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERAR 1496
              D +   ++S+ Q+    S+P+  AA S  SQN WEDVEHLFEG+DDQQKA I+RERAR
Sbjct: 863  RVDGKTGISNSD-QKMGPASSPEVVAASSL-SQNTWEDVEHLFEGYDDQQKAAIQRERAR 920

Query: 1495 RKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFP 1316
            R EEQ KLFAARK           LNSAKFVEVDPVH+ ILRKKEEQDR+KP RHLFRFP
Sbjct: 921  RIEEQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFP 980

Query: 1315 HMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDD 1136
            HMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RV+SRGDD
Sbjct: 981  HMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVVSRGDD 1040

Query: 1135 EDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFG 956
             D LDGDERVPK KDL+GVLGMES VVIIDDS RVWPHNKLNLI +ERYIYFPCSRR FG
Sbjct: 1041 GDLLDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCSRRQFG 1100

Query: 955  LSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGC 776
            L GPSLLEI HDER +DG LA SLAVIERIH NFF H SL++ DVRNILA EQ+ +LAGC
Sbjct: 1101 LPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQRKILAGC 1160

Query: 775  RIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGR 596
            RIVFSRVFPVGE  PHLHPLWQ+A+QFGAVCTNQIDE VTHVVANSLGTDKVNWALSTGR
Sbjct: 1161 RIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGR 1220

Query: 595  FVVHPCWVEASALLYRRANEFDFAIK 518
            FVVHP WVEASALLYRRANE DFAIK
Sbjct: 1221 FVVHPGWVEASALLYRRANEQDFAIK 1246


>ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|568858958|ref|XP_006483010.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 3-like
            [Citrus sinensis] gi|557541056|gb|ESR52100.1|
            hypothetical protein CICLE_v10030535mg [Citrus
            clementina]
          Length = 1234

 Score =  844 bits (2180), Expect = 0.0
 Identities = 541/1107 (48%), Positives = 678/1107 (61%), Gaps = 58/1107 (5%)
 Frame = -2

Query: 3664 QLREFEMK---IKSIREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDAL 3494
            Q++E EMK   ++SIREAL++    D   S+  VCS+ + +LESL+ +  +N VP  DAL
Sbjct: 166  QVKE-EMKLINVESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDAL 222

Query: 3493 IQQIFMGIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCV 3314
            IQ  F  +Q+V+SVF SMN   +EQ+K+IL RLL+ +K  +  LF+S Q KE+EAM+  +
Sbjct: 223  IQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSL 282

Query: 3313 ESQSISSSVIMDKKKERGVINGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQND 3134
             +++       DK+K+   ++G+          N + + LN  +K  L PV        D
Sbjct: 283  VTRA------NDKEKDMLAMHGVNGKDSNIVTEN-AVNDLNFKEKVPL-PV--------D 326

Query: 3133 SNMEFKTLKSK-----GYGGVGSLNLPSD----HDGDR----TLDTLQLFPIQN------ 3011
            S M+ K L++      GY   G L    D    HD D     T +T    P+Q       
Sbjct: 327  SLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGD 386

Query: 3010 ---QSDSATCTVSDKTEDAAVLPYETDALRAVSTYQQRFN-SSFLLSNRLPSPTPXXXXX 2843
               +S +A   +S   E      YETDALRA S+YQQ+F  +SF +++ LPSPTP     
Sbjct: 387  GVVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESG 446

Query: 2842 XXXXXXXXXXXXXSTASSIKTEN------PSVPVQPFTYATPIVDCSSEQ------ESIP 2699
                         +     K  N        V  QP   + P+ D SS Q       S P
Sbjct: 447  DGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPM-DISSVQALTTANNSAP 505

Query: 2698 AKTGQLGSCL-NPILKAMPKSKDPR------RALDLKQQSLTVDYNVPKREPIVGTMCSR 2540
            A +G       NP++KA  KS+DPR       AL+L  Q   + +N PK EP+   M SR
Sbjct: 506  ASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSR 565

Query: 2539 KHKIVKESVLDGHNLKRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKTCL 2360
            K K V+E VLDG  LKRQRNG  +S V   R+E+    +  W        PQ  N +  L
Sbjct: 566  KQKTVEEPVLDGPALKRQRNGFENSGVV--RDEKNIYGSGGWLEDTDMFEPQIMN-RNLL 622

Query: 2359 AENMRTDFRKLENGK---FCSGKRQDTNSGGNQQLPVFGTNTFVSLPSL-KDISMNPTML 2192
             ++  ++ RKL+NG      SG      SG N+  P    +T VSLP+L KDI++NPTML
Sbjct: 623  VDSAESNSRKLDNGATSPITSGTPNVVVSG-NEPAPATTPSTTVSLPALLKDIAVNPTML 681

Query: 2191 VNLIM--EQQKLVADTQQKPAQSMTLPSISSVAVGTVPLANDPSSNSAKIVQKSQIPAQI 2018
            +N++   +QQKL AD QQK   S ++ ++      ++P  +   S  + I+ K       
Sbjct: 682  LNILKMGQQQKLAADAQQKSNDS-SMNTMHPPIPSSIPPVSVTCSIPSGILSK------- 733

Query: 2017 VPMDLREDLGKTRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQ 1838
             PMD   +LGK RMKPRDPRR+LH N  +++G +G E  KT G  +   Q SK++L  ++
Sbjct: 734  -PMD---ELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQK 788

Query: 1837 H-----AEXXXXXXXXXXXXXXXAKQN--NIPGTLFDSQSATSFTTVAQTVSSQPIPRKI 1679
                  A+                 +N  +I   +  SQ  TS   V+Q    QP   K 
Sbjct: 789  QLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIK- 847

Query: 1678 DNADVRVVATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERA 1499
              AD++ V T+ +++++ T S P+ G  V    Q+AW DVEHLFEG+DDQQKA I++ER 
Sbjct: 848  SGADMKAVVTNHDDKQTGTGSGPEAGP-VGAHPQSAWGDVEHLFEGYDDQQKAAIQKERT 906

Query: 1498 RRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRF 1319
            RR EEQ K+F+ARK           LNSAKF EVDPVH+ ILRKKEEQDR+KP RHLFRF
Sbjct: 907  RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966

Query: 1318 PHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGD 1139
            PHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYA EMAKVLDP G LF  RVISRGD
Sbjct: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026

Query: 1138 DEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHF 959
            D DP DGDERVPK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR F
Sbjct: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086

Query: 958  GLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAG 779
            GL GPSLLEI HDER +DG LASSL VIER+H  FF HQSL+DVDVRNILA EQ+ +LAG
Sbjct: 1087 GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAG 1146

Query: 778  CRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTG 599
            CRIVFSRVFPVGEA PHLHPLWQTA+QFGAVCT  ID+ VTHVVANSLGTDKVNWALSTG
Sbjct: 1147 CRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG 1206

Query: 598  RFVVHPCWVEASALLYRRANEFDFAIK 518
            RFVVHP WVEASALLYRRANE DFAIK
Sbjct: 1207 RFVVHPGWVEASALLYRRANEQDFAIK 1233


>ref|XP_002304714.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343307|gb|EEE79693.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1030

 Score =  828 bits (2138), Expect = 0.0
 Identities = 515/1043 (49%), Positives = 635/1043 (60%), Gaps = 68/1043 (6%)
 Frame = -2

Query: 3442 MNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVESQSISSSVIMDKKKER 3263
            MN + +EQ+K + +R L+ V   D + F+ +  KEIE M+  ++S  I SS    +++E 
Sbjct: 1    MNQKLKEQNKGVFMRFLSLVNSHDPSFFSPEHTKEIELMVSSLDSHDILSSSRAGEERET 60

Query: 3262 GVINGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQNDSNMEFK----TLKSKGY 3095
             V +G             +G+ L +  +      S  ++  N S    K    + KS+G 
Sbjct: 61   QV-SGKVNERDNDSLSKTAGYDLTTMNRLPSAAESFVHNKPNFSIEPPKPGVPSFKSRGV 119

Query: 3094 GGVGSLNLPSDHDGDR----TLDTLQLFPIQN---------QSDSATCTVSDKTEDAAVL 2954
              +  L+L   HD D     T +T   FP+Q           S      V+  TE+  V 
Sbjct: 120  L-LPLLDLKKFHDEDSLPSPTRETAPSFPVQRLLPIGDGMISSGLPVPKVASITEEPRVH 178

Query: 2953 PYETDALRAVSTYQQRFNSSFLLSNRLPSPTPXXXXXXXXXXXXXXXXXXSTASSIKTEN 2774
            PYETDAL+AVS+YQ++FN +   +N LPSPTP                  ST +  +T N
Sbjct: 179  PYETDALKAVSSYQKKFNLNSFFTNELPSPTPSEESGNGDGDTAGEVSSSSTVN-YRTVN 237

Query: 2773 PSV--------------PVQPFTYATPIVDCSSEQESIPAK-TGQLGSCLNPILKAMPKS 2639
            P V              P  P     P ++ SS +  IP + +  + S  +  +KA  KS
Sbjct: 238  PPVSDRKSASPSPSPPPPPPPPPPPPPHLNNSSIRVVIPTRNSAPVSSGTSSTVKASAKS 297

Query: 2638 KDPR--------RALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNLKRQR 2483
            +DPR         ALD  Q++L +  N P+ EP      SRK KI +E VLDG +LKRQR
Sbjct: 298  RDPRLRYVNTDASALDQNQRTLLMVNNPPRAEPSGAIAGSRKQKI-EEDVLDGTSLKRQR 356

Query: 2482 NGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFRKLENGKFCSG 2303
            N   +    V R+ +  +    W      A PQ   NK   AEN     +++ NG  C  
Sbjct: 357  NSFDN--FGVVRDIRSMTGTGGWLEDTDMAEPQ-TVNKNQWAENAEPG-QRINNGVVCPS 412

Query: 2302 KRQDTNS---GGNQQLPVFGTNTFV-------------SLPSL-KDISMNPTMLVNLIM- 2177
                 +S    GN Q+PV G NT               SLP L KDI++NPTML+N++  
Sbjct: 413  TGSVMSSVSCSGNVQVPVMGINTIAGSEQAPVTSTTTASLPDLLKDITVNPTMLINILKM 472

Query: 2176 -EQQKLVADTQQK---PAQSMTLPSISSVAVGTVPLANDPSSNSAKIVQKS----QIPAQ 2021
             +QQ+L  D QQK   PA+S + P  S+  +G +P  N  SS  + I+ +S    Q P+Q
Sbjct: 473  GQQQRLALDGQQKLADPAKSTSHPPSSNTVLGAIPEVNAVSSLPSGILPRSAGKAQGPSQ 532

Query: 2020 IVPMDLREDLGKTRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVR 1841
            I   D   + GK RMKPRDPRR+LHNN  ++ G +G+EQ KT  + S+  Q +KD+  ++
Sbjct: 533  IATTD---ESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTLTSTT-QGTKDNQNLQ 588

Query: 1840 QHAEXXXXXXXXXXXXXXXAKQN--NIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNAD 1667
            +                    ++  NI   +  SQ+ T+   V+Q V+SQP+  K D  D
Sbjct: 589  KQEGLAELKPVVPPDISSPFTKSLKNIADIVSVSQTCTTPPFVSQNVASQPVQIKSDRVD 648

Query: 1666 VRVVATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKE 1487
             +   ++S+ Q+    S+P+  AA S  SQN WEDVEHLFEG+DDQQKA I+RERARR E
Sbjct: 649  GKTGISNSD-QKMGPASSPEVVAASSL-SQNTWEDVEHLFEGYDDQQKAAIQRERARRIE 706

Query: 1486 EQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMG 1307
            EQ KLFAARK           LNSAKFVEVDPVH+ ILRKKEEQDR+KP RHLFRFPHMG
Sbjct: 707  EQKKLFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFPHMG 766

Query: 1306 MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDP 1127
            MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RV+SRGDD D 
Sbjct: 767  MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDL 826

Query: 1126 LDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSG 947
            LDGDERVPK KDL+GVLGMES VVIIDDS RVWPHNKLNLI +ERYIYFPCSRR FGL G
Sbjct: 827  LDGDERVPKSKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPG 886

Query: 946  PSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIV 767
            PSLLEI HDER +DG LA SLAVIERIH NFF H SL++ DVRNILA EQ+ +LAGCRIV
Sbjct: 887  PSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQRKILAGCRIV 946

Query: 766  FSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVV 587
            FSRVFPVGE  PHLHPLWQ+A+QFGAVCTNQIDE VTHVVANSLGTDKVNWALSTGRFVV
Sbjct: 947  FSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVV 1006

Query: 586  HPCWVEASALLYRRANEFDFAIK 518
            HP WVEASALLYRRANE DFAIK
Sbjct: 1007 HPGWVEASALLYRRANEQDFAIK 1029


>ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Fragaria vesca subsp. vesca]
          Length = 1230

 Score =  818 bits (2113), Expect = 0.0
 Identities = 515/1099 (46%), Positives = 659/1099 (59%), Gaps = 53/1099 (4%)
 Frame = -2

Query: 3652 FEMKIKSIREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALIQQIFMG 3473
            +E ++  +REAL++ T+ + ++S+ +VC RF  SLESL+ +  +  V   +AL+QQ+F  
Sbjct: 162  WEKRVNLLREALESLTITEAEKSFGDVCHRFLDSLESLRGVLSEINVSTKEALVQQLFNA 221

Query: 3472 IQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVESQSISS 3293
            ++A++SVF SM+ +Q+EQ+KD+L R+L+  K  D + F ++Q KEIE M     S S+ S
Sbjct: 222  VRAISSVFRSMSADQKEQNKDVLSRILSSAKS-DPSPFPAEQLKEIEVM-----SSSMDS 275

Query: 3292 SVIMDKKKERGV--INGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQNDSNMEF 3119
                   KE G+  ING+          N S     +        VSV +S+ N S+ E 
Sbjct: 276  PQTKAGTKENGIQCINGVYKTDSDTSGANASHVFTYAANTGSDTQVSVVHSNPNISS-EV 334

Query: 3118 KTLKSKGYGGVGS----LNLPSDHDGDR----TLDTLQLFPIQN----------QSDSAT 2993
                S  + G G     L+L  DHD D     T +    FP Q           +S   T
Sbjct: 335  PRSGSSSFKGRGLMLPLLDLHMDHDEDSLPSPTREPPACFPAQKPVVVENGMVKKSGWET 394

Query: 2992 CTVSDKTEDAAVLPYETDALRAVSTYQQRFNSSFLLSNRLPSPTPXXXXXXXXXXXXXXX 2813
               +   E + +  YET+AL+AVS+YQQ+F+ +  L++ LPSPTP               
Sbjct: 395  ARAALDVEGSKMHVYETEALKAVSSYQQKFSRNSFLTSELPSPTPSEEEGDNGDDAAVGE 454

Query: 2812 XXXSTASS-IKTENPSVP---VQPFTYATPIVDCSSEQESIPAKTGQ---LGSCLNPILK 2654
               S+AS+ ++T  P V    V     AT +   S     I AKT     LGS  N   K
Sbjct: 455  VSSSSASNNVRTPQPPVSGRQVVSSVPATTLPGSSGMHGLITAKTASPVSLGS--NMPNK 512

Query: 2653 AMPKSKDPRR--------ALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHN 2498
            +  KS+DPR         AL L QQS    +N PK + ++ T+ SRKHK  ++S  DG  
Sbjct: 513  SSAKSRDPRLRFANSDAGALTLNQQSSIQVHNAPKVDSVI-TLSSRKHKSPEDSNFDGPE 571

Query: 2497 LKRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFRKLENG 2318
             KRQR     +   V    + +  N  W   G S  P H  N+    E    D RK+ N 
Sbjct: 572  SKRQRG----ANSVVGWGAKTSFGNGVWLEDGSSVGP-HLINRNQTVEKKEADPRKMVNV 626

Query: 2317 KFCSGKRQDTNSG---GNQQLPVFGTNTFVSLPSL-KDISMNPTMLVNLIMEQQKLVADT 2150
                G  +  ++G    N+++P+   +  VSLP++ KDI++NPTMLVN++      +A+ 
Sbjct: 627  SSSPGTVEGNSNGQNTANEKVPLVAPS-LVSLPAIFKDIAVNPTMLVNILK-----LAEA 680

Query: 2149 QQKPA-----QSMTLPSISSVAVGTVPLANDPSSNSAKIVQKSQIPAQIVPMDLREDLGK 1985
            QQ  A     +S+T P  SS   GT  L NDPS  S  ++  + I +Q  P D   + GK
Sbjct: 681  QQNAAAPARKESLTYPPSSSSIPGTAALVNDPSKTSGALLTPT-ICSQKTPTD---EAGK 736

Query: 1984 TRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQH---------A 1832
             RMK RDPRR+LH N  + +G +G EQ +    P S  QA+ D +  ++           
Sbjct: 737  IRMKLRDPRRLLHGNALQNSGSVGHEQSRNIVPPLSSSQANNDDMNGKKQDSQADNNSVT 796

Query: 1831 EXXXXXXXXXXXXXXXAKQNNIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADVRVVA 1652
                                NI   +  SQ +TS  T +Q +S++ I    DN D++   
Sbjct: 797  SQSGALGAPDIASQFTKNLKNIADIISVSQVSTSPATPSQNLSTELISINPDNVDLK--- 853

Query: 1651 TDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKL 1472
              +  Q + + S     AA + +S   W DVEHLFEG+DD+QKA I+RERARR EEQ K+
Sbjct: 854  --AEEQHTGSISASVPTAAGASRSPATWGDVEHLFEGYDDKQKAAIQRERARRIEEQKKM 911

Query: 1471 FAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKL 1292
            FAA K           LNSAKFVEVDPVH+ ILRKKEEQDR++PQRHLFRF HMGMWTKL
Sbjct: 912  FAAHKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDRKEPQRHLFRFQHMGMWTKL 971

Query: 1291 RPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDE 1112
            RPG+W FLEKAS L+E+HLYTMGNKLYA EMAKVLDPTG+LF  RVISRGDD DP DGDE
Sbjct: 972  RPGVWKFLEKASHLFEMHLYTMGNKLYATEMAKVLDPTGALFAGRVISRGDDGDPYDGDE 1031

Query: 1111 RVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLE 932
            RVPK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL GPSLLE
Sbjct: 1032 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 1091

Query: 931  IGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVF 752
            I HDER +DG LASSLAVIE+IH  FF H SL++ DVRNILA EQ+ +L GCRIVFSRVF
Sbjct: 1092 IDHDERHEDGTLASSLAVIEKIHQIFFSHPSLDEADVRNILASEQQKILGGCRIVFSRVF 1151

Query: 751  PVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWV 572
            PVGE  PHLHPLWQTA+QFGAVCTNQID+ VTHVVANSLGTDKVNWALS+G++VVHP WV
Sbjct: 1152 PVGEVNPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSSGKYVVHPGWV 1211

Query: 571  EASALLYRRANEFDFAIKT 515
            EASALLYRRANE DFAIK+
Sbjct: 1212 EASALLYRRANEQDFAIKS 1230


>ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus trichocarpa]
            gi|550347145|gb|EEE82674.2| CTD phosphatase-like protein
            3 [Populus trichocarpa]
          Length = 1190

 Score =  807 bits (2085), Expect = 0.0
 Identities = 507/1107 (45%), Positives = 649/1107 (58%), Gaps = 50/1107 (4%)
 Frame = -2

Query: 3688 VVSSDRNLQLREFEMKIKSIREALDTATVKDPKESYNEVCSRFQISLESLQLM--RLDNC 3515
            VV     +   + E ++KSIR+ L++ +V + ++S+  VC +    LESL+ +    DN 
Sbjct: 132  VVVQSEGMVSVDVENRVKSIRKDLESVSVIETEKSFEAVCLKLHKVLESLKELVGGNDNS 191

Query: 3514 VPVVDALIQQIFMGIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEI 3335
             P  D L+Q +FM I+ VNSVF SMN + +EQ+K +  R  + +       F+  Q KE+
Sbjct: 192  FPSKDGLVQLLFMAIRVVNSVFCSMNKKLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEV 251

Query: 3334 EAMIRCVESQSISSSVIMDKKKERGVINGMTXXXXXXXXXNPSGHSLNS-TKKFHLEPVS 3158
               +    + S++ +                           +G+ L + ++K       
Sbjct: 252  ---LNENHNDSLAKT---------------------------AGYDLTTMSEKLPAAETF 281

Query: 3157 VRYSDQNDSNMEFKTLKSKGYGGVGS-------LNLPSDHDGDRTLDTLQL---FPIQN- 3011
            V    QN  N   +  K  G     S       L+L   HD D      Q    FP+Q  
Sbjct: 282  V----QNKPNKSIEAPKPPGVPSFKSRGVLLPLLDLKKYHDEDSLPSPTQETTPFPVQRL 337

Query: 3010 --------QSDSATCTVSDKTEDAAVLPYETDALRAVSTYQQRFNSSFLLSNRLPSPTPX 2855
                     S      V+   E+  + PYETDAL+AVS+YQQ+FN +   +N LPSPTP 
Sbjct: 338  LAIGDGMVSSGLPVPKVTPVAEEPRMHPYETDALKAVSSYQQKFNRNSFFTNELPSPTPS 397

Query: 2854 XXXXXXXXXXXXXXXXXSTASSIKTENPSVPVQ--------PFTYATPIVDCSSEQESIP 2699
                             ST  + +T NP V  Q        P     P  D S+ +  +P
Sbjct: 398  EESGNGDGDTAGEVSSSSTVVNYRTVNPPVSDQKNAPPSPPPLPPPPPHPDSSNIRGVVP 457

Query: 2698 AK-TGQLGSCLNPILKAMPKSKDPRR--------ALDLKQQSLTVDYNVPKREPIVGTMC 2546
             + +  + S  +  +KA  KS+DPR         ALD  Q++L +  N+P+ EP    + 
Sbjct: 458  TRNSAPVSSGPSSTIKASAKSRDPRLRYVNIDACALDHNQRALPMVNNLPRVEPAGAIVG 517

Query: 2545 SRKHKIVKESVLDGHNLKRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKT 2366
            S+KHKI +E VLD  +LKRQRN   +      R+ +  +    W      A PQ   NK 
Sbjct: 518  SKKHKI-EEDVLDDPSLKRQRNSFDN--YGAVRDIESMTGTGGWLEDTDMAEPQ-TVNKN 573

Query: 2365 CLAENMRTDFRKLENGKFCSGKRQDTNSGGNQQLPVFGTNTFVSLPSL-KDISMNPTMLV 2189
              AEN   +     +G   S     +N  G++Q  V  T T  SLP L KDI++NPTML+
Sbjct: 574  QWAENSNVN----GSGNAQSPFMGISNITGSEQAQVTSTAT-TSLPDLLKDIAVNPTMLI 628

Query: 2188 NLIM--EQQKLVADTQQK---PAQSMTLPSISSVAVGTVPLANDPSSNSAKIVQK---SQ 2033
            N++   +QQ+L  D QQ    PA+S + P IS+  +G +P  N  SS  + I  +   + 
Sbjct: 629  NILKMGQQQRLALDGQQTLSDPAKSTSHPPISNTVLGAIPTVNVASSQPSGIFPRPAGTP 688

Query: 2032 IPAQIVPMDLREDLGKTRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDS 1853
            +P+QI   D   + GK RMKPRDPRR LHNN+ ++ G +G+EQ KT  +  +  Q +KD 
Sbjct: 689  VPSQIATSD---ESGKIRMKPRDPRRFLHNNSLQRAGSMGSEQFKTTTLTPTT-QGTKDD 744

Query: 1852 LTVRQHAEXXXXXXXXXXXXXXXAKQN--NIPGTLFDSQSATSFTTVAQTVSSQPIPRKI 1679
              V++                    ++  NI   L  SQ++T+   ++Q V+SQP+  K 
Sbjct: 745  QNVQKQEGLAELKPTVPPDISFPFTKSLENIADILSVSQASTTPPFISQNVASQPMQTKS 804

Query: 1678 DNADVRVVATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERA 1499
            +  D +   + S+ Q++   S+P E  A S  SQN W+DVEHLFEG+DDQQKA I+RERA
Sbjct: 805  ERVDGKTGISISD-QKTGPASSP-EVVAASSHSQNTWKDVEHLFEGYDDQQKAAIQRERA 862

Query: 1498 RRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRF 1319
            RR EEQ K+FAARK           LNSAK +    +H+ ILRKKEEQDR+KP RH+FR 
Sbjct: 863  RRLEEQKKMFAARKLCLVLDLDHTLLNSAKAILSSSLHDEILRKKEEQDREKPYRHIFRI 922

Query: 1318 PHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGD 1139
            PHMGMWTKLRPGIWNFLEKASKL+ELHLYTMGNKLYA EMAKVLDP G LF  RVISRGD
Sbjct: 923  PHMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 982

Query: 1138 DEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHF 959
            D DP DGDERVPK KDL+GVLGMES VVIIDDS RVWPHNKLNLI +ERYIYFPCSRR F
Sbjct: 983  DGDPFDGDERVPKSKDLEGVLGMESGVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQF 1042

Query: 958  GLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAG 779
            GL GPSLLEI HDER +DG LA S AVIE+IH NFF H+SL++ DVRNILA EQ+ +L G
Sbjct: 1043 GLPGPSLLEIDHDERPEDGTLACSFAVIEKIHQNFFTHRSLDEADVRNILASEQRKILGG 1102

Query: 778  CRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTG 599
            CRI+FSRVFPVGE  PHLHPLWQ A+QFGAVCTNQIDE VTHVVANSLGTDKVNWALSTG
Sbjct: 1103 CRILFSRVFPVGEVNPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTG 1162

Query: 598  RFVVHPCWVEASALLYRRANEFDFAIK 518
            R VVHP WVEASALLYRRANE DF+IK
Sbjct: 1163 RIVVHPGWVEASALLYRRANEQDFSIK 1189


>ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Cucumis sativus]
          Length = 1249

 Score =  806 bits (2081), Expect = 0.0
 Identities = 498/1087 (45%), Positives = 650/1087 (59%), Gaps = 36/1087 (3%)
 Frame = -2

Query: 3670 NLQLREFEMKIKSIREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALI 3491
            +L+ +E +  +K I++ LD  T+   ++S+ EVCS+   S+E+   +     VP  DALI
Sbjct: 188  DLETKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKDALI 247

Query: 3490 QQIFMGIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVE 3311
            Q+++  ++ +NSVF SMN  ++E+ K+ L RLL+YVK+ D  LF+ +Q K +E  +   +
Sbjct: 248  QRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPSTD 307

Query: 3310 S-QSISSSVIMDKKKERGVINGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQND 3134
            S   + S     K+ E  + NG+          + S   L  + K   + +      +N+
Sbjct: 308  SLDHLPSMRGSAKEVEIHIPNGVKDMDFYSAYTSTSS-QLTPSNKLASDSIPFGVKGKNN 366

Query: 3133 SNMEFKTLKS-----KGYGG-VGSLNLPSDHDGDR----TLDTLQLFPIQNQSDSATCTV 2984
             N+  + L+S     KG G  +  L+L  DHD D     T +   +F +Q +S +A   +
Sbjct: 367  LNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQ-KSGNAPTKM 425

Query: 2983 SDKTEDAAVLPYETDALRAVSTYQQRF-NSSFLLSNRLPSPTPXXXXXXXXXXXXXXXXX 2807
            +   + +   PYETDAL+AVSTYQQ+F  SSF +++RLPSPTP                 
Sbjct: 426  AFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTP-SEEHDGGGDIGGEVSS 484

Query: 2806 XSTASSIKTENPSVPVQPFTYAT-------PIVDCSSEQESI-PAKTGQLGSCLNPILKA 2651
             S   S+K+ N S P Q    A+       P +D SS +  I P       S  NP +K 
Sbjct: 485  SSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTVKP 544

Query: 2650 MPKSKDPR--------RALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNL 2495
            + KS+DPR          +DL  +++    +    E    T+  RK K+  E   DG  +
Sbjct: 545  LAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILES-AATLHLRKQKMDGEPNTDGPEV 603

Query: 2494 KRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQ-HNNNKTCLAENMRTDFRKLENG 2318
            KR R G  +  V  S + +  S +  W      A P+  N N+  +AE   T+   + N 
Sbjct: 604  KRLRIGSQNLAVAAS-DVRAVSGSGGWLEDTMPAGPRLFNRNQMEIAEANATEKSNVTN- 661

Query: 2317 KFCSGKRQDTNSG-GNQQLPVFGTNTFVSLPS-LKDISMNPTMLVNLI--MEQQKLVADT 2150
                      NSG GN+  P    +   SLPS LKDI +NPTML+NL+   +QQ+L A+ 
Sbjct: 662  ----------NSGSGNECTPTVNNSNDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAEL 711

Query: 2149 QQK---PAQSMTLPSISSVAVGTVPLANDPSSNSAKIVQKSQIPAQIVPMDLREDLGKTR 1979
            + K   P ++   P+  +   G+ PL N P + S  + Q +  P+    +  ++DLGK R
Sbjct: 712  KLKSSEPEKNAICPTSLNPCQGSSPLINAPVATSGILQQSAGTPSASPVVGRQDDLGKVR 771

Query: 1978 MKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQHAEXXXXXXXXXX 1799
            MKPRDPRR+LH N+ ++ G +G +QLK     +S  + S+D     +             
Sbjct: 772  MKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPTASNTEGSRDIPNGHKQ--------EGQG 823

Query: 1798 XXXXXAKQNNIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADVRVVATDSNNQESWTN 1619
                 + Q  +P      Q   +   +A  +S    P    N+  + V + S + +  T 
Sbjct: 824  DSKLASSQTILPD--IGRQFTNNLKNIADIMSVPSPPTSSPNSSSKPVGSSSMDSKPVTT 881

Query: 1618 STPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXXX 1439
            +      A S +SQ AW D+EHLF+ +DD+QKA I+RERARR EEQ K+FAARK      
Sbjct: 882  AFQAVDMAASSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLD 941

Query: 1438 XXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKA 1259
                 LNSAKFVEVDPVH+ ILRKKEEQDR+K QRHLFRFPHMGMWTKLRPG+WNFLEKA
Sbjct: 942  LDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKA 1001

Query: 1258 SKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDGV 1079
            S+LYELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD DPLDGD+RVPK KDL+GV
Sbjct: 1002 SELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGV 1061

Query: 1078 LGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGA 899
            LGMES VVIIDDS RVWPHNK+NLI +ERY YFPCSRR FGL GPSLLEI HDER +DG 
Sbjct: 1062 LGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGT 1121

Query: 898  LASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHP 719
            LASSL VI+RIH +FF +  L+ VDVR IL+ EQ+ +LAGCRIVFSRVFPVGEA PHLHP
Sbjct: 1122 LASSLGVIQRIHQSFFSNPELDQVDVRTILSAEQQKILAGCRIVFSRVFPVGEANPHLHP 1181

Query: 718  LWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRAN 539
            LWQTA+QFGA CTNQIDE VTHVVANSLGTDKVNWALSTGRFVVHP WVEASALLYRRA 
Sbjct: 1182 LWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRAT 1241

Query: 538  EFDFAIK 518
            E DFAIK
Sbjct: 1242 EQDFAIK 1248


>ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3-like [Cucumis sativus]
          Length = 1249

 Score =  805 bits (2079), Expect = 0.0
 Identities = 498/1087 (45%), Positives = 649/1087 (59%), Gaps = 36/1087 (3%)
 Frame = -2

Query: 3670 NLQLREFEMKIKSIREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALI 3491
            +L+ +E +  +K I++ LD  T+   ++S+ EVCS+   S+E+   +     VP  DALI
Sbjct: 188  DLETKELDELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKDALI 247

Query: 3490 QQIFMGIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVE 3311
            Q+++  ++ +NSVF SMN  ++E+ K+ L RLL+YVK+ D  LF+ +Q K +E  +   +
Sbjct: 248  QRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPSTD 307

Query: 3310 S-QSISSSVIMDKKKERGVINGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQND 3134
            S   + S     K+ E  + NG+          + S   L  + K   + +      +N+
Sbjct: 308  SLDHLPSMRGSAKEVEIHIPNGVKDMDFYSAYTSTSS-QLTPSNKLASDSIPFGVKGKNN 366

Query: 3133 SNMEFKTLKS-----KGYGG-VGSLNLPSDHDGDR----TLDTLQLFPIQNQSDSATCTV 2984
             N+  + L+S     KG G  +  L+L  DHD D     T +   +F +Q +S +A   +
Sbjct: 367  LNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQ-KSGNAPTKM 425

Query: 2983 SDKTEDAAVLPYETDALRAVSTYQQRF-NSSFLLSNRLPSPTPXXXXXXXXXXXXXXXXX 2807
            +   + +   PYETDAL+AVSTYQQ+F  SSF +++RLPSPTP                 
Sbjct: 426  AFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTP-SEEHDGGGDIGGEVSS 484

Query: 2806 XSTASSIKTENPSVPVQPFTYAT-------PIVDCSSEQESI-PAKTGQLGSCLNPILKA 2651
             S   S+K+ N S P Q    A+       P +D SS +  I P       S  NP +K 
Sbjct: 485  SSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTVKP 544

Query: 2650 MPKSKDPR--------RALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNL 2495
            + KS+DPR          +DL  +++    +    E    T+  RK K+  E   DG  +
Sbjct: 545  LAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILES-AATLHLRKQKMDGEPNTDGPEV 603

Query: 2494 KRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQ-HNNNKTCLAENMRTDFRKLENG 2318
            KR R G  +  V  S + +  S +  W      A P+  N N+  +AE   T+   + N 
Sbjct: 604  KRLRIGSQNLAVAAS-DVRAVSGSGGWLEDTMPAGPRLFNRNQMEIAEANATEKSNVTN- 661

Query: 2317 KFCSGKRQDTNSG-GNQQLPVFGTNTFVSLPS-LKDISMNPTMLVNLI--MEQQKLVADT 2150
                      NSG GN+  P    +   SLPS LKDI +NPTML+NL+   +QQ+L A+ 
Sbjct: 662  ----------NSGSGNECTPTVNNSNDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAAEL 711

Query: 2149 QQK---PAQSMTLPSISSVAVGTVPLANDPSSNSAKIVQKSQIPAQIVPMDLREDLGKTR 1979
            + K   P ++   P+  +   G+ PL N P + S  + Q +  P+    +  ++DLGK R
Sbjct: 712  KLKSSEPEKNAICPTSLNPCQGSSPLINAPVATSGILQQSAGTPSASPVVGRQDDLGKVR 771

Query: 1978 MKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQHAEXXXXXXXXXX 1799
            MKPRDPRR+LH N+ ++ G +G +QLK     +S  + S+D     +             
Sbjct: 772  MKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPTASNTEGSRDIPNGHKQ--------EGQG 823

Query: 1798 XXXXXAKQNNIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADVRVVATDSNNQESWTN 1619
                 + Q  +P      Q   +   +A  +S    P    N+  + V + S + +  T 
Sbjct: 824  DSKLASSQTILPD--IGRQFTNNLKNIADIMSVPSPPTSSPNSSSKPVGSSSMDSKPVTT 881

Query: 1618 STPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXXX 1439
            +      A S +SQ AW D+EHLF+ +DD+QKA I+RERARR EEQ K+FAARK      
Sbjct: 882  AFQAVDMAASSRSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAARKLCLVLD 941

Query: 1438 XXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKA 1259
                 LNSAKFVEVDPVH+ ILRKKEEQDR+K QRHLFRFPHMGMWTKLRPG+WNFLEKA
Sbjct: 942  LDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKA 1001

Query: 1258 SKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDGV 1079
            S+LYELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD DPLDGD+RVPK KDL+GV
Sbjct: 1002 SELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGV 1061

Query: 1078 LGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGA 899
            LGMES VVIIDDS RVWPHNK+NLI +ERY YFPCSRR FGL GPSLLEI HDER +DG 
Sbjct: 1062 LGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGT 1121

Query: 898  LASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHP 719
            LASSL VI+RIH  FF +  L+ VDVR IL+ EQ+ +LAGCRIVFSRVFPVGEA PHLHP
Sbjct: 1122 LASSLGVIQRIHQXFFSNPELDQVDVRTILSAEQQKILAGCRIVFSRVFPVGEANPHLHP 1181

Query: 718  LWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRAN 539
            LWQTA+QFGA CTNQIDE VTHVVANSLGTDKVNWALSTGRFVVHP WVEASALLYRRA 
Sbjct: 1182 LWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRAT 1241

Query: 538  EFDFAIK 518
            E DFAIK
Sbjct: 1242 EQDFAIK 1248


>ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223548611|gb|EEF50102.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 1195

 Score =  799 bits (2064), Expect = 0.0
 Identities = 504/1090 (46%), Positives = 643/1090 (58%), Gaps = 62/1090 (5%)
 Frame = -2

Query: 3601 KDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALIQQIFMGIQAVNSVFYSMNPEQQE 3422
            KD  E   +V  +  I     ++M  +N   +VD + ++   G      +   M P ++ 
Sbjct: 135  KDGDEEKKKVVEKVVIDDNGDEMMDDNNRNKIVDVVEKE--EGELEEGEIDLDMEPGEKA 192

Query: 3421 QDKDIL------LRLLTYVKDQDFALFASKQKKE---IEAMIRCVESQSISSSVIMDKKK 3269
             + D+L      L + +  K  +  + + +   E   IE ++ C +S  +S S   +K+K
Sbjct: 193  NNGDVLNMNIDGLEVESGEKGFEKKMNSIRDALESVTIEFVLACTDSSGVSFSSFSEKEK 252

Query: 3268 ERGVINGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQNDSNMEFKTLKSKGYGG 3089
            E  +I+ +            SGH +++  K   +         N +N+  +  K+    G
Sbjct: 253  EP-LISTVVNKKDNDVNGKSSGHDMSAVNKLPTDSFV-----NNKANLSIEGPKT----G 302

Query: 3088 VGS----------LNLPSDHDGDRTLDTLQ--LFPIQNQSDSATCTVSDKTEDAAVLPYE 2945
            V S          L+L  DHD D      +    P+          V D T ++ + PYE
Sbjct: 303  VSSFKSRAALLPLLDLHKDHDADSLPSPTRESALPLPAYRVLTPKMVLD-TGNSRMHPYE 361

Query: 2944 TDALRAVSTYQQRFN-SSFLLSNRLPSPTPXXXXXXXXXXXXXXXXXXSTASSIKTENPS 2768
            TDAL+AVS+YQQ+F+ SSF L++RLPSPTP                   + SS +  NP 
Sbjct: 362  TDALKAVSSYQQKFSKSSFALTDRLPSPTPSEESGNGDGDTGGEVSSSLSVSSFRPANPL 421

Query: 2767 VPVQP-FTYATPIVDCSSEQESIPAKTG-QLGSCLNPILKAMPKSKDPR--------RAL 2618
               Q   + + P +D SS    I  K+  +  S  +  +KA  KS+DPR         AL
Sbjct: 422  TSGQSNASISLPRMDGSSLPGVISIKSAVRASSAPSLTVKASAKSRDPRLRFVNSDSNAL 481

Query: 2617 DLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNLKRQRNGLTSSEVPVSRNEQ 2438
            D   +++ V  N  K EPI GTM  ++ KIV + + DGH+LKRQ+N L +S V   R+ +
Sbjct: 482  DQNHRAVPV-VNTLKVEPIGGTMNKKRQKIVDDPIPDGHSLKRQKNALENSGVV--RDVK 538

Query: 2437 MASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFRKLENGKFCSGKR--QDTNSGGNQQL 2264
                +  W        PQ   NK  L +N  +D R+ + G  C+        N  G +Q+
Sbjct: 539  TMVGSGGWLEDTDMVGPQ-TMNKNQLVDNAESDPRRKDGGGVCTSSSCISSVNISGTEQI 597

Query: 2263 PVFGTNTFV------------SLPSL-KDISMNPTMLVNLIM--EQQKLVADTQQKP--- 2138
            PV GT+  +            ++P L K+I++NPTML+N++   +QQ+L  + QQKP   
Sbjct: 598  PVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILKMGQQQRLALEAQQKPVDP 657

Query: 2137 AQSMTLPSISSVAVGTVPLANDPSSNSAKIVQKSQIPAQIVP-MDLREDLGKTRMKPRDP 1961
            A+S T P  S+  +GTVP+     S    I+ +     Q+ P +   +DLGK RMKPRDP
Sbjct: 658  AKSTTYPLNSNSMLGTVPVVGAAHSG---ILPRPAGTVQVSPQLGTADDLGKIRMKPRDP 714

Query: 1960 RRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTV-RQHAEXXXXXXXXXXXXXXX 1784
            RR+LHNN  ++NG +G+E LKT      I Q +KD+  + +Q  +               
Sbjct: 715  RRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEKKPVPLQSLALPD 774

Query: 1783 AKQ------NNIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADVRVVATDSNNQESWT 1622
                      NI   +  S ++TS   V Q  +SQP+   I ++D          Q    
Sbjct: 775  ISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQPMRTTISSSD----------QFLGI 824

Query: 1621 NSTPKEGAAVSF--QSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXX 1448
             S P   AA +   ++QNAW DVEHLFEG++DQQKA I+RERARR EEQ KLF+ARK   
Sbjct: 825  GSAPGAAAAAAAGPRTQNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQKKLFSARKLCL 884

Query: 1447 XXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFL 1268
                    LNSAKFVEVDPVH+ ILRKKEEQDR+K  RHLFRFPHMGMWTKLRPGIWNFL
Sbjct: 885  VLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGMWTKLRPGIWNFL 944

Query: 1267 EKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDL 1088
            EKASKLYELHLYTMGNKLYA EMAKVLDPTG LF  RVISRGDD +P DGDER+PK KDL
Sbjct: 945  EKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPFDGDERIPKSKDL 1004

Query: 1087 DGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLD 908
            +GVLGMES VVI+DDS RVWPHNKLNLI +ERYIYFPCSRR FGL GPSLLEI HDER +
Sbjct: 1005 EGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPE 1064

Query: 907  DGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPH 728
            DG LA SLAVIERIH NFF H SL++ DVRNILA EQ+ +LAGCRIVFSRVFPVGEA PH
Sbjct: 1065 DGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEANPH 1124

Query: 727  LHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYR 548
            LHPLWQTA+QFGAVCTNQIDE VTHVVANSLGTDKVNWALSTGRFVV+P WVEASALLYR
Sbjct: 1125 LHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYPGWVEASALLYR 1184

Query: 547  RANEFDFAIK 518
            RANE DFAIK
Sbjct: 1185 RANEQDFAIK 1194


>ref|XP_007225412.1| hypothetical protein PRUPE_ppa000589mg [Prunus persica]
            gi|462422348|gb|EMJ26611.1| hypothetical protein
            PRUPE_ppa000589mg [Prunus persica]
          Length = 1085

 Score =  790 bits (2040), Expect = 0.0
 Identities = 468/896 (52%), Positives = 575/896 (64%), Gaps = 42/896 (4%)
 Frame = -2

Query: 3079 LNLPSDHDGDR----TLDTLQLFPIQNQ-----------SDSATCTVSDKTEDAAVLPYE 2945
            L+L  DHD D     T +T   FP+QN            SD+AT  V+   ED+ +  YE
Sbjct: 205  LDLHKDHDADSLPSPTRETPSCFPVQNTLVVADGMVKSASDTATARVALNAEDSRLHSYE 264

Query: 2944 TDALRAVSTYQQRFN-SSFLLSNRLPSPTPXXXXXXXXXXXXXXXXXXSTASSIKTENPS 2768
            T+AL+AVS+YQQ+FN SSFL+S RLPSPTP                    AS+++T  P 
Sbjct: 265  TEALKAVSSYQQKFNRSSFLMSERLPSPTPSEDGGNGDDDTGGEVSSSF-ASNLRTSCPP 323

Query: 2767 VPVQPFTYATPI-VDCSSEQESIPAKTGQL-GSCLNPILKAMPKSKDPRR--------AL 2618
            +  +     +PI V   S Q    AK+     S  +  +KA  KS+DPR         AL
Sbjct: 324  ISGRQIVSPSPIPVGSPSMQGRATAKSAAPPNSEPSMTIKASAKSRDPRLRFANSDMGAL 383

Query: 2617 DLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNLKRQRNGLTSSEVPVSRNEQ 2438
            +L QQ  TV ++ PK + ++ T+ SRK K ++ES  DG  LKRQRN L +S +    + +
Sbjct: 384  NLNQQPSTVVHSAPKVDSVI-TLSSRKQKPLEESRFDGPALKRQRNALENSGIV--GDAK 440

Query: 2437 MASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFRKLENGKFCSGKRQDTNSGG----NQ 2270
             AS +  W       +  H N+K    EN  TD R +      S    D N+ G    N+
Sbjct: 441  TASGSGGWL-EDIGGVGPHLNSKNQTVENAETDPRNVVK-VLSSPSTVDCNTNGPNSANE 498

Query: 2269 QLPVFGTNTFVSLPSL-KDISMNPTMLVNLIM--EQQKLVADTQQK---PAQSMTLPSIS 2108
             + + G +   SLP L KDI++NPTML+NL+   +QQ++ ++  QK   P ++MT P+ S
Sbjct: 499  HVSLMGAS-MASLPELLKDIAVNPTMLLNLLKMGQQQRVASEAHQKSADPPKTMTHPTSS 557

Query: 2107 SVAVGTVPLANDPSSNSAKIVQKSQIPAQIVPMD----LREDLGKTRMKPRDPRRILHNN 1940
            S  + +  L N PS  S  +    Q PA  +P+     L ++ GK RMKPRDPRR LH N
Sbjct: 558  SSILVSAALGNVPSKTSGIL----QTPAGTLPVSSQKALMDESGKVRMKPRDPRRALHGN 613

Query: 1939 TFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQHAEXXXXXXXXXXXXXXXAKQN--NI 1766
              +++G +G EQ +    P S IQ +KD+L  +   +                 +N  NI
Sbjct: 614  ALQKSGSLGQEQFRNIIPPLSAIQGNKDNLNGQADKKLVTSQSLDAPDITRQFTKNLKNI 673

Query: 1765 PGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADVRVVATDSNNQESWTNSTPKEGAAVSF 1586
               +  S  +TS    +Q+VSSQ +P K +  D++        Q   + S  +  AA   
Sbjct: 674  ADIMSVSNVSTSPAIASQSVSSQLVPIKPERIDLK-----PEEQRPESISASEAAAAGPS 728

Query: 1585 QSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKF 1406
            +S   W DVEHLFEG+DDQQKA I+RER RR EEQ K+FAA K           LNSAKF
Sbjct: 729  RSPVMWGDVEHLFEGYDDQQKAAIQRERTRRIEEQKKMFAAHKLCLVLDLDHTLLNSAKF 788

Query: 1405 VEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTM 1226
            VEVDPVH+ ILRKKEEQDR+KPQRHLFRF HMGMWTKLRPGIWNFLEKAS+L+ELHLYTM
Sbjct: 789  VEVDPVHDEILRKKEEQDREKPQRHLFRFHHMGMWTKLRPGIWNFLEKASQLFELHLYTM 848

Query: 1225 GNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIID 1046
            GNKLYA EMAKVLDPTG+LF  RVISRGDD DP DGDER+PK KDL+GVLGMES+VVIID
Sbjct: 849  GNKLYATEMAKVLDPTGALFAGRVISRGDDGDPEDGDERIPKSKDLEGVLGMESAVVIID 908

Query: 1045 DSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERI 866
            DS RVWPHNKLNLI +ERY YFPCSRR FGL GPSLLEI HDER +DG LASSLAVIE+I
Sbjct: 909  DSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERQEDGTLASSLAVIEKI 968

Query: 865  HHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAV 686
            H  FF H SL++ DVRNILA EQ+ +LAGCRIVFSRVFPVGE KPHLHPLWQTA+QFGAV
Sbjct: 969  HQLFFSHSSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEVKPHLHPLWQTAEQFGAV 1028

Query: 685  CTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 518
            CTNQID+ VTHVVANSLGTDKVNWALS+G++VVHP WVEASALLYRRANE DFAIK
Sbjct: 1029 CTNQIDDQVTHVVANSLGTDKVNWALSSGKYVVHPGWVEASALLYRRANEQDFAIK 1084


>ref|XP_004492029.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like isoform X2 [Cicer arietinum]
          Length = 1227

 Score =  786 bits (2030), Expect = 0.0
 Identities = 501/1083 (46%), Positives = 648/1083 (59%), Gaps = 42/1083 (3%)
 Frame = -2

Query: 3631 IREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALIQQIFMGIQAVNSV 3452
            IR+ L+  TV +  ES+ E  SR    L+S  L      V   D +I+ ++  I+ V+SV
Sbjct: 162  IRDFLEGVTVANVAESFAETISRLLRVLQSKLLS--GPAVSEKDYVIRLLYNAIEIVHSV 219

Query: 3451 FYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVES-QSISSSVIMDK 3275
            F SM+  Q+E +KD ++RLL ++K++   LF+ +  KEI+ MI  +++  ++ +SV++  
Sbjct: 220  FCSMDNLQKEDNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNSVVVGN 279

Query: 3274 KKERGVINGMTXXXXXXXXXNPSGHSLNSTKKFH---LEPVSVRYSDQNDSNMEFKTLKS 3104
             ++   ++  T           +   ++S+K  H    E      S Q++       L  
Sbjct: 280  GEKLDTLDIKTRQIQGLK----ASELISSSKLVHSNLTEASEALLSGQSNIKGRGVMLPL 335

Query: 3103 KGYGGVGSLN-LPSD-HDGDRTLDTLQLFPIQNQSD-----SATCTVSDK----TEDAAV 2957
                 V  L+ LPS   +        +LF + +  D     SA  T + K    TE++  
Sbjct: 336  FDLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGKTEAVKMELDTENSKN 395

Query: 2956 LPYETDALRAVSTYQQRFN-SSFLLSNRLPSPTPXXXXXXXXXXXXXXXXXXSTASSIKT 2780
              YETDAL+AVSTYQQ+F  SS+   ++ PSPTP                  S A S+ +
Sbjct: 396  HLYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADANEEVSSASIAVSLTS 455

Query: 2779 ENPSVPVQPFTYATPIVDCSSEQESIPAKTGQLGSCLNPILKAMPKSKDPR--------R 2624
              P +   P +  +  VD SS    I ++     S   P+ K   +S+DPR         
Sbjct: 456  SKPLLDQMPVSSTS--VDRSSMHGLINSRIEAASSVTYPV-KTSARSRDPRLRFINSDAS 512

Query: 2623 ALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNLKRQRNGLTSSEVPVSRN 2444
            ALDL Q   T   N+PK E   G + SRK K  +E  LD    KR R+ L +S    +R 
Sbjct: 513  ALDLNQSLGT--NNMPKVEN-AGRVISRKQKTTEELSLDATAPKRLRSSLENSRHN-TRE 568

Query: 2443 EQMASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFRKLENGKFCSGKRQDTNSGGNQQL 2264
            E+  + N  W      A   H   +  L +   T+ +K  +    S       S GN+Q 
Sbjct: 569  ERTMAGNGGWLEENRVA-GSHLIERNHLMQKGETELKKTMS---TSSGYSTVTSNGNEQA 624

Query: 2263 PVFGTNTFVSLPSL-KDISMNPTMLVNLIMEQQ-KLVADTQQKPAQSMTLPS-ISSVAVG 2093
            PV  +NT  +LP L K+I++NPTML+N+++EQQ +L A+  +KP  S T    +++ A G
Sbjct: 625  PVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRLAAEANKKPVDSATSTMHLTNSARG 684

Query: 2092 TVPLANDPSSNSAKIVQKS--QIPAQI----VPMDLREDLGKTRMKPRDPRRILH-NNTF 1934
                 N   + +A + Q S   +PA      +   L ED GK RMKPRDPRRILH +++ 
Sbjct: 685  PDATVNTGPAMTAGLPQSSVGMLPASTQAASMAHTLLEDSGKIRMKPRDPRRILHGSSSL 744

Query: 1933 KQNGCIGTEQLKTKGVPSSIIQAS-----KDSLTVRQHAEXXXXXXXXXXXXXXXAKQN- 1772
            +++G  G+EQ K+   P+S  Q +        L VR   +                 +N 
Sbjct: 745  QKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDVRVETKLAPTQSSAQPDITRQFTKNL 804

Query: 1771 -NIPGTLFDSQS-ATSFTTVAQTVSSQPIPRKIDNADVRVVATDSNNQESWTNSTPKEGA 1598
             NI   +  SQ  +T      Q VSS  +P  +D A+++    +S N +    S P+  A
Sbjct: 805  KNIADIMSVSQEPSTQLPATTQNVSSASVPFTLDKAELKSGVPNSQNLQDGVGSAPETCA 864

Query: 1597 AVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLN 1418
              S +SQ+ W DVEHLFEG+D++QKA I+RERARR EEQNK+FA++K           LN
Sbjct: 865  PGSSRSQSTWADVEHLFEGYDEKQKAAIQRERARRLEEQNKMFASKKLCLVLDLDHTLLN 924

Query: 1417 SAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELH 1238
            SAKFVEVDPVH+ ILRKKEEQDR+KP RHLFRFPHMGMWTKLRPG+WNFLEKASKLYELH
Sbjct: 925  SAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELH 984

Query: 1237 LYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSV 1058
            LYTMGNKLYA EMAKVLDP G LF  RVISRGDD + +DGDER PK KDL+GV+GMESSV
Sbjct: 985  LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTESVDGDERAPKSKDLEGVMGMESSV 1044

Query: 1057 VIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAV 878
            VI+DDS RVWPHNKLNLI +ERY YFPCSRR FGL GPSLLEI HDER + G LASSLAV
Sbjct: 1045 VIVDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLAV 1104

Query: 877  IERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQ 698
            IERIH NFF  QSL +VDVRNILA EQ+ +LAGCRIVFSRVFPVGEA PHLHPLWQTA+Q
Sbjct: 1105 IERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQ 1164

Query: 697  FGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 518
            FGAVC NQID+ VTHVVANSLGTDKVNWA+STGRFVVHP WVEASALLYRRANE DFAIK
Sbjct: 1165 FGAVCINQIDDQVTHVVANSLGTDKVNWAISTGRFVVHPGWVEASALLYRRANEQDFAIK 1224

Query: 517  TKQ 509
             ++
Sbjct: 1225 PEK 1227


>ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like isoform X1 [Cicer arietinum]
          Length = 1247

 Score =  786 bits (2030), Expect = 0.0
 Identities = 501/1083 (46%), Positives = 648/1083 (59%), Gaps = 42/1083 (3%)
 Frame = -2

Query: 3631 IREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALIQQIFMGIQAVNSV 3452
            IR+ L+  TV +  ES+ E  SR    L+S  L      V   D +I+ ++  I+ V+SV
Sbjct: 182  IRDFLEGVTVANVAESFAETISRLLRVLQSKLLS--GPAVSEKDYVIRLLYNAIEIVHSV 239

Query: 3451 FYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVES-QSISSSVIMDK 3275
            F SM+  Q+E +KD ++RLL ++K++   LF+ +  KEI+ MI  +++  ++ +SV++  
Sbjct: 240  FCSMDNLQKEDNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNSVVVGN 299

Query: 3274 KKERGVINGMTXXXXXXXXXNPSGHSLNSTKKFH---LEPVSVRYSDQNDSNMEFKTLKS 3104
             ++   ++  T           +   ++S+K  H    E      S Q++       L  
Sbjct: 300  GEKLDTLDIKTRQIQGLK----ASELISSSKLVHSNLTEASEALLSGQSNIKGRGVMLPL 355

Query: 3103 KGYGGVGSLN-LPSD-HDGDRTLDTLQLFPIQNQSD-----SATCTVSDK----TEDAAV 2957
                 V  L+ LPS   +        +LF + +  D     SA  T + K    TE++  
Sbjct: 356  FDLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGKTEAVKMELDTENSKN 415

Query: 2956 LPYETDALRAVSTYQQRFN-SSFLLSNRLPSPTPXXXXXXXXXXXXXXXXXXSTASSIKT 2780
              YETDAL+AVSTYQQ+F  SS+   ++ PSPTP                  S A S+ +
Sbjct: 416  HLYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADANEEVSSASIAVSLTS 475

Query: 2779 ENPSVPVQPFTYATPIVDCSSEQESIPAKTGQLGSCLNPILKAMPKSKDPR--------R 2624
              P +   P +  +  VD SS    I ++     S   P+ K   +S+DPR         
Sbjct: 476  SKPLLDQMPVSSTS--VDRSSMHGLINSRIEAASSVTYPV-KTSARSRDPRLRFINSDAS 532

Query: 2623 ALDLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNLKRQRNGLTSSEVPVSRN 2444
            ALDL Q   T   N+PK E   G + SRK K  +E  LD    KR R+ L +S    +R 
Sbjct: 533  ALDLNQSLGT--NNMPKVEN-AGRVISRKQKTTEELSLDATAPKRLRSSLENSRHN-TRE 588

Query: 2443 EQMASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFRKLENGKFCSGKRQDTNSGGNQQL 2264
            E+  + N  W      A   H   +  L +   T+ +K  +    S       S GN+Q 
Sbjct: 589  ERTMAGNGGWLEENRVA-GSHLIERNHLMQKGETELKKTMS---TSSGYSTVTSNGNEQA 644

Query: 2263 PVFGTNTFVSLPSL-KDISMNPTMLVNLIMEQQ-KLVADTQQKPAQSMTLPS-ISSVAVG 2093
            PV  +NT  +LP L K+I++NPTML+N+++EQQ +L A+  +KP  S T    +++ A G
Sbjct: 645  PVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRLAAEANKKPVDSATSTMHLTNSARG 704

Query: 2092 TVPLANDPSSNSAKIVQKS--QIPAQI----VPMDLREDLGKTRMKPRDPRRILH-NNTF 1934
                 N   + +A + Q S   +PA      +   L ED GK RMKPRDPRRILH +++ 
Sbjct: 705  PDATVNTGPAMTAGLPQSSVGMLPASTQAASMAHTLLEDSGKIRMKPRDPRRILHGSSSL 764

Query: 1933 KQNGCIGTEQLKTKGVPSSIIQAS-----KDSLTVRQHAEXXXXXXXXXXXXXXXAKQN- 1772
            +++G  G+EQ K+   P+S  Q +        L VR   +                 +N 
Sbjct: 765  QKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDVRVETKLAPTQSSAQPDITRQFTKNL 824

Query: 1771 -NIPGTLFDSQS-ATSFTTVAQTVSSQPIPRKIDNADVRVVATDSNNQESWTNSTPKEGA 1598
             NI   +  SQ  +T      Q VSS  +P  +D A+++    +S N +    S P+  A
Sbjct: 825  KNIADIMSVSQEPSTQLPATTQNVSSASVPFTLDKAELKSGVPNSQNLQDGVGSAPETCA 884

Query: 1597 AVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLN 1418
              S +SQ+ W DVEHLFEG+D++QKA I+RERARR EEQNK+FA++K           LN
Sbjct: 885  PGSSRSQSTWADVEHLFEGYDEKQKAAIQRERARRLEEQNKMFASKKLCLVLDLDHTLLN 944

Query: 1417 SAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELH 1238
            SAKFVEVDPVH+ ILRKKEEQDR+KP RHLFRFPHMGMWTKLRPG+WNFLEKASKLYELH
Sbjct: 945  SAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELH 1004

Query: 1237 LYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSV 1058
            LYTMGNKLYA EMAKVLDP G LF  RVISRGDD + +DGDER PK KDL+GV+GMESSV
Sbjct: 1005 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTESVDGDERAPKSKDLEGVMGMESSV 1064

Query: 1057 VIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAV 878
            VI+DDS RVWPHNKLNLI +ERY YFPCSRR FGL GPSLLEI HDER + G LASSLAV
Sbjct: 1065 VIVDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLAV 1124

Query: 877  IERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQ 698
            IERIH NFF  QSL +VDVRNILA EQ+ +LAGCRIVFSRVFPVGEA PHLHPLWQTA+Q
Sbjct: 1125 IERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQ 1184

Query: 697  FGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 518
            FGAVC NQID+ VTHVVANSLGTDKVNWA+STGRFVVHP WVEASALLYRRANE DFAIK
Sbjct: 1185 FGAVCINQIDDQVTHVVANSLGTDKVNWAISTGRFVVHPGWVEASALLYRRANEQDFAIK 1244

Query: 517  TKQ 509
             ++
Sbjct: 1245 PEK 1247


>ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1257

 Score =  784 bits (2025), Expect = 0.0
 Identities = 501/1088 (46%), Positives = 645/1088 (59%), Gaps = 50/1088 (4%)
 Frame = -2

Query: 3631 IREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALIQQIFMGIQAVNSV 3452
            +R  L+  TV +  ES+ + CS+ Q +L  +     D+     D L++  F   + V SV
Sbjct: 191  VRGVLEGVTVANVAESFAQTCSKLQNALPEVLSRPADS---ERDDLVRLSFNATEVVYSV 247

Query: 3451 FYSMNPEQQEQDKDILLRLLTYVKDQDFA-LFASKQKKEIEAMIRCVES-QSISSSVIMD 3278
            F SM+  ++EQ+KD +LRLL++VKDQ  A LF+ +  KEI+ M+  ++   ++ +S  + 
Sbjct: 248  FCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAIDYFGALVNSEAIG 307

Query: 3277 KKKERGVINGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQNDSNMEFKTLKSKG 3098
            K+KE                   +   ++  K  H + +   ++      ++F     KG
Sbjct: 308  KEKELQTTVQTHEIKTQENQAVEAAELISYNKPLHSDIIGASHA------LKFGQNSIKG 361

Query: 3097 YGGV-GSLNLPSDHDGDR----TLDTLQLFPIQ----------NQSDSATCTVSDK---- 2975
             G +   L+L  DHD D     T +    FP+           +   +A    S K    
Sbjct: 362  RGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEPMVSSGSAAAKPESGKMELD 421

Query: 2974 TEDAAVLPYETDALRAVSTYQQRFNSSFLLSN-RLPSPTPXXXXXXXXXXXXXXXXXXST 2798
            +E +    YETDAL+AVSTYQQ+F  S L +N + PSPTP                  ST
Sbjct: 422  SEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVDTNEEVSSAST 481

Query: 2797 ASSIKTENPSVPVQPFTYATPIVDCSSEQESIPAKTGQLGSCLNPILKAMPKSKDPRRAL 2618
               + +  P++   P   AT   D SS    I ++    G    P+ K+  K++DPR   
Sbjct: 482  GDFLTSTKPTLLDLPPVSATS-TDRSSLHGFISSRVDAAGPGSLPV-KSSAKNRDPRLRF 539

Query: 2617 DLKQQSL-----TVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNLKRQRNGLTSSEVPV 2453
                 S      T+ +N+PK E   GT  SRK K  +E  LD    KRQ++ L ++E  +
Sbjct: 540  VNSDASAVDNPSTLIHNMPKVE-YAGTTISRKQKAAEEPSLDVTVSKRQKSPLENTEHNM 598

Query: 2452 SR-NEQMASRNDRWFGSGCSALPQHNNNKTCLAENMRTDFRKLEN--GKFCSGKRQ-DTN 2285
            S     +    +   G G   + +++     L +    + +K  N     C+G    +  
Sbjct: 599  SEVRTGIGGWLEEHTGPGAQFIERNH-----LMDKFGPEPQKTLNTVSSSCTGSDNFNAT 653

Query: 2284 SGGNQQLPVFGTNTFVSLPSL-KDISMNPTMLVNLIMEQQKLVADTQQKPAQSMTL---- 2120
            S  N+Q P+  +N   SLP+L K  ++NPTMLVNL+      +A+ Q+K A S T     
Sbjct: 654  SIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLLR-----IAEAQKKSADSATNMLLH 708

Query: 2119 PSISSVAVGTVPLANDPSSNSAKIVQKS----QIPAQIVPMD--LREDLGKTRMKPRDPR 1958
            P+ S+ A+GT   A+  SS +  ++Q S     + +Q   M   L++D GK RMKPRDPR
Sbjct: 709  PTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQDDSGKIRMKPRDPR 768

Query: 1957 RILH-NNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTV-----RQHAEXXXXXXXXXXX 1796
            RILH NNT +++G +G EQ K    P S  Q + D++       R  ++           
Sbjct: 769  RILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRVDSKLVPTQPSAQPD 828

Query: 1795 XXXXAKQN--NIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADVRVVATDSNNQESWT 1622
                  +N  NI   +  SQ +++ T VAQ  SS  +P   D  + + V ++S N E+  
Sbjct: 829  IARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRGEQKSVVSNSQNLEAGM 888

Query: 1621 NSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXX 1442
             S  +  A+ + +SQN W DVEHLFEG+D+QQKA I+RERARR EEQNK+FAARK     
Sbjct: 889  VSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKMFAARKLCLVL 948

Query: 1441 XXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEK 1262
                  LNSAKFVEVDPVH+ ILRKKEEQDR+KP RHLFRFPHMGMWTKLRPGIWNFLEK
Sbjct: 949  DLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWNFLEK 1008

Query: 1261 ASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDG 1082
            ASKLYELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD D +DG+ER PK KDL+G
Sbjct: 1009 ASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTDSVDGEERAPKSKDLEG 1068

Query: 1081 VLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDG 902
            VLGMESSVVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL GPSLLEI HDER + G
Sbjct: 1069 VLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAG 1128

Query: 901  ALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLH 722
             LASSLAVIE+IH  FF  +SL +VDVRNILA EQ+ +LAGCRIVFSRVFPVGEA PHLH
Sbjct: 1129 TLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLH 1188

Query: 721  PLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRA 542
            PLWQTA+QFGA CTNQIDE VTHVVANS GTDKVNWAL+ GRFVVHP WVEASALLYRRA
Sbjct: 1189 PLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHPGWVEASALLYRRA 1248

Query: 541  NEFDFAIK 518
            NE DFAIK
Sbjct: 1249 NEQDFAIK 1256


>ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1261

 Score =  784 bits (2024), Expect = 0.0
 Identities = 508/1124 (45%), Positives = 659/1124 (58%), Gaps = 61/1124 (5%)
 Frame = -2

Query: 3706 EEEGIWVVSSDRNLQLREFEMKIKS-----IREALDTATVKDPKESYNEVCSRFQISLES 3542
            E E + V  SD   +L + +M +        R  L+  TV +  ES+ + CS+ Q +L  
Sbjct: 160  EAESVVVAVSDSE-KLDDVKMDVSDSEQLGARGVLEGVTVANVVESFAQTCSKLQNTLPE 218

Query: 3541 LQLMRLDNCVPVVDALIQQIFMGIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFA- 3365
            +      +     D L++  F   + V SVF SM+  ++EQ+KD +LRLL++VKDQ  A 
Sbjct: 219  VLSRPAGS---EKDDLVRLSFNATEVVYSVFCSMDSSEKEQNKDSILRLLSFVKDQQQAQ 275

Query: 3364 LFASKQKKEIEAMIRCVESQ-SISSSVIMDKKKERGVINGMTXXXXXXXXXNPSGHSLNS 3188
            LF+ +  KEI+ M+  ++S  ++ +S  + K+KE       T             H + +
Sbjct: 276  LFSPEHVKEIQGMMTAIDSVGALVNSEAIGKEKELQTTEIKTQENSAVEVQI---HEIKT 332

Query: 3187 TKKFHLEPVS-VRYSDQ-------NDSNMEFKTLKSKGYGGV-GSLNLPSDHDGDR---- 3047
             +   +E    + YS             ++F     KG G +   L+L  DHD D     
Sbjct: 333  QENQAVEAAELISYSKPLHRDITGTSQALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSP 392

Query: 3046 TLDTLQLFPIQN---------QSDSATCTVSDKTEDAAVLPYETDALRAVSTYQQRFNSS 2894
            T +    FP+           +S SA+  +   +E +    YETDAL+AVSTYQQ+F  S
Sbjct: 393  TREAPSCFPVNKLLSVGESMVRSGSASAKMELDSEGSKFHLYETDALKAVSTYQQKFGRS 452

Query: 2893 FLLSN-RLPSPTPXXXXXXXXXXXXXXXXXXSTASSIKTENPSVPVQPFTYATPIVDCSS 2717
             L +N + PSPTP                  ST   + +  P++  QP   AT + D SS
Sbjct: 453  SLFTNDKFPSPTPSGDCEDEVVDTNEEVSSASTGDFLTSTKPTLLDQPPVSATSM-DRSS 511

Query: 2716 EQESIPAKTGQLGSCLNPILKAMPKSKDPRRALDLKQQSL-----TVDYNVPKREPIVGT 2552
                I ++    G    P+ K+  K++DPR        S      T+  N+ K E   GT
Sbjct: 512  MHGFISSRVDATGPGSFPV-KSSAKNRDPRLRFINSDASAVDNLSTLINNMSKVE-YSGT 569

Query: 2551 MCSRKHKIVKESVLDGHNLKRQRNGLTSSEVPVSRNEQMASRNDRWF----GSGCSALPQ 2384
              SRK K  +E  LD    KR ++ L ++E  +S   ++ + +  W     G G   + +
Sbjct: 570  TISRKQKAAEEPSLDVTVSKRLKSSLENTEHNMS---EVRTGSGGWLEENTGPGAQLIER 626

Query: 2383 HNNNKTCLAENMRTDFRKLEN--GKFCSGKRQ-DTNSGGNQQLPVFGTNTFVSLPSL-KD 2216
            ++     L +    + +K  N     C+G    +  S  N+Q P+  +N   SLP+L K+
Sbjct: 627  NH-----LMDKFGPEAKKTLNTVSSSCTGSDNFNATSIRNEQAPITASNVLASLPALLKE 681

Query: 2215 ISMNPTMLVNLIMEQQKLVADTQQKPAQSMTL----PSISSVAVGTVPLANDPSSNSAKI 2048
             S+NP MLVN++      +A+ Q+K A S  +    P+ S+ A+GT   A+  SS +  +
Sbjct: 682  ASVNPIMLVNILR-----LAEAQKKSADSAAIMLLHPTSSNPAMGTDSTASIGSSMATGL 736

Query: 2047 VQKS--QIPAQI----VPMDLREDLGKTRMKPRDPRRILH-NNTFKQNGCIGTEQLKTKG 1889
            +Q S   +P           L++D GK RMKPRDPRRILH NNT +++G +G EQ K   
Sbjct: 737  LQSSVGMLPVSSQSTSTAQTLQDDSGKIRMKPRDPRRILHTNNTIQKSGDLGNEQFKAIV 796

Query: 1888 VPSSIIQASKDSLTV-----RQHAEXXXXXXXXXXXXXXXAKQN--NIPGTLFDSQSATS 1730
             P S  Q + D++       R   +                 +N  NI   +  SQ +++
Sbjct: 797  SPVSNNQRTGDNVNAPKLEGRVDNKLVPTQSSAQPDIARQFTRNLKNIADIMSVSQESST 856

Query: 1729 FTTVAQTVSSQPIPRKIDNADVRVVATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHL 1550
             T V+Q  SS  +P   D  + + V + S N ++   S  +  A+V+ +SQ+ W DVEHL
Sbjct: 857  HTPVSQNFSSASVPLTSDRGEQKSVVSSSQNLQADMASAHETAASVTSRSQSTWGDVEHL 916

Query: 1549 FEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILR 1370
            FEG+D+QQKA I+RERARR EEQNK+FAARK           LNSAKFVEVDP+H+ ILR
Sbjct: 917  FEGYDEQQKAAIQRERARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILR 976

Query: 1369 KKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKV 1190
            KKEEQDR+KP RHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKV
Sbjct: 977  KKEEQDREKPHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKV 1036

Query: 1189 LDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLN 1010
            LDP G LF  RVISRGDD D +DG+ERVPK KDL+GVLGMESSVVIIDDS RVWPHNKLN
Sbjct: 1037 LDPKGVLFAGRVISRGDDTDSVDGEERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLN 1096

Query: 1009 LIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLND 830
            LI +ERY YFPCSRR FGL GPSLLEI HDER + G LASSLAVIE+IH  FF  QSL +
Sbjct: 1097 LIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLAVIEKIHQIFFASQSLEE 1156

Query: 829  VDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHV 650
            VDVRNILA EQ+ +LAGCRIVFSRVFPVGEA PHLHPLWQTA+QFGAVCTNQIDE VTHV
Sbjct: 1157 VDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHV 1216

Query: 649  VANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRANEFDFAIK 518
            VANS GTDKVNWAL+ GRFVVHP WVEASALLYRRANE DFAIK
Sbjct: 1217 VANSPGTDKVNWALNNGRFVVHPGWVEASALLYRRANEQDFAIK 1260


>ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris]
            gi|561012448|gb|ESW11309.1| hypothetical protein
            PHAVU_008G019000g [Phaseolus vulgaris]
          Length = 1272

 Score =  780 bits (2015), Expect = 0.0
 Identities = 509/1102 (46%), Positives = 644/1102 (58%), Gaps = 64/1102 (5%)
 Frame = -2

Query: 3631 IREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALIQQIFMGIQAVNSV 3452
            +R+ L+  TV +  ES+ +  SR   +L  +     D+     D LI+  F  I+ V SV
Sbjct: 193  VRDVLEGVTVANVAESFAQTSSRLLNALPQVFSRPADS---EKDDLIRLSFNAIEVVYSV 249

Query: 3451 FYSMNPEQQEQDKDILLRLLTYVKDQDFA-LFASKQKKEIEAMIRCVESQSISSS---VI 3284
            F SM+   +EQ+K+ +LRLL+  KD+  A LF+ +  KEI+ M+  ++S     S   + 
Sbjct: 250  FRSMDSSDKEQNKNSILRLLSSAKDKKQAQLFSPEHIKEIQDMMTAIDSVGALGSNEAIY 309

Query: 3283 MD--------KKKERGVINGMTXXXXXXXXXNPSGHSL-NSTKKFHLEPVSVRYSDQNDS 3131
            M+        K +E   +   T               L +S K  H + +    +     
Sbjct: 310  METELQTPEIKSQENSALEVQTRGIKIQENQAVVATELVSSIKPLHSDIIGASRA----- 364

Query: 3130 NMEFKTLKSKGYGGV-GSLNLPSDHDGDR----TLDTLQLFPIQNQSDSATCTVSDKTED 2966
             ++F     KG G +   L+L  DHD D     T +    FP+          V   +  
Sbjct: 365  -LKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEVMVKSGSAA 423

Query: 2965 AAVLP--------------YETDALRAVSTYQQRFNSSFLLSN-RLPSPTPXXXXXXXXX 2831
            A + P              YETDAL+AVSTYQQ+F  S L +N +LPSPTP         
Sbjct: 424  AKMQPGKLEVDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCDDMAV 483

Query: 2830 XXXXXXXXXSTASSIKTENPSVPVQPFTYATPIVDCSSEQESIPAKTGQLGSCLNPILKA 2651
                     ST+  + +  P++  QP   AT  VD S     I ++    GS   P+ K+
Sbjct: 484  DTNEEVSSASTSGFLTSTKPTLLDQPPVSATS-VDKSRLLGLISSRVDAAGSGSFPV-KS 541

Query: 2650 MPKSKDPRRALDLKQQS-----LTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNLKRQ 2486
              KS+DPRR L   + S      TV +N+PK E   G+  SRK K V+E   D    KR 
Sbjct: 542  SAKSRDPRRRLINSEASAVDNQFTVTHNMPKVE-YAGSTISRKQKAVEEPSFDLTVSKRL 600

Query: 2485 RNGLTSSEVPVSRNEQMASRNDRWF----GSGCSALPQHNNNKTCLAENMRTDFRKLENG 2318
            ++ L + E   S    +A     W     G G   + +++     L +    + ++  N 
Sbjct: 601  KSSLENIEHNTSEVRTIAGSGG-WLEDITGPGTQLIEKNH-----LIDKFAPEPKRTLNT 654

Query: 2317 KFCSGKRQ-DTNSGGNQQLPVFGTNTFVSLPSL-KDISMNPTMLVNLIMEQQKLVADTQQ 2144
               SG    +  S  N+Q P+   N   SLP++ KDI +NPTML++L+MEQ++LV D Q 
Sbjct: 655  VSSSGSVNFNATSIRNEQAPITSNNVPSSLPAIFKDIVVNPTMLLSLLMEQKRLV-DAQN 713

Query: 2143 KPAQSMTL---PSISSVAVGTVPLANDPSSNSAKIVQKSQIPAQIVPMD--------LRE 1997
              A S T    P+ S+ A+GT   A+  SS +  +    Q    ++P+         L++
Sbjct: 714  NSADSATNMLHPTSSNSAMGTDSTASIVSSMATGL----QTSVGMLPVSSQSTSTAQLQD 769

Query: 1996 DL-GKTRMKPRDPRRILH-NNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTV-----RQ 1838
            D  GK RMKPRDPRRILH NN+ +++G I  E  K    P S I  + DS+       R 
Sbjct: 770  DYSGKIRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNILVTGDSVNAQKLEGRM 829

Query: 1837 HAEXXXXXXXXXXXXXXXAKQN--NIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADV 1664
              +                 +N  NI   +  SQ +++ +  AQ  SS  +P  +D  + 
Sbjct: 830  DTKLVPTQSGAAPDITRQFTRNLKNIADIMSVSQESSTHSPAAQGFSSASVPLNVDRGEQ 889

Query: 1663 RVVATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEE 1484
            + V ++S N  + T S P+  A  + +SQ+ W DVEHLFEG+D+QQKA I+RERARR EE
Sbjct: 890  KSVLSNSQNLHAGTGSAPEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEE 949

Query: 1483 QNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGM 1304
            QNK+FAARK           LNSAKFVEVDPVHE ILRKKEE DR+KP RHLFRFPHMGM
Sbjct: 950  QNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEELDREKPHRHLFRFPHMGM 1009

Query: 1303 WTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPL 1124
            WTKLRPGIWNFLEKASKLYELHLYTMGNKLYA EMAKVLDP G LF  RVISRGDD D +
Sbjct: 1010 WTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSV 1069

Query: 1123 DGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGP 944
            DG+ER PK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR FGL GP
Sbjct: 1070 DGEERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGP 1129

Query: 943  SLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVF 764
            SLLEI HDER + G LASSLAVIER+H NFF  QSL +VDVRNILA EQ+ +L+GCRIVF
Sbjct: 1130 SLLEIDHDERPEAGTLASSLAVIERLHQNFFSSQSLEEVDVRNILASEQRKILSGCRIVF 1189

Query: 763  SRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVH 584
            SRVFPVGEA PHLHPLWQTA+QFGAVCTNQID+ VTHVVANSLGTDKVNWALSTGRFVVH
Sbjct: 1190 SRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSTGRFVVH 1249

Query: 583  PCWVEASALLYRRANEFDFAIK 518
            P WVEASALLYRRANE DFAIK
Sbjct: 1250 PGWVEASALLYRRANEQDFAIK 1271


>ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|557541054|gb|ESR52098.1| hypothetical protein
            CICLE_v10030535mg [Citrus clementina]
          Length = 1208

 Score =  780 bits (2015), Expect = 0.0
 Identities = 509/1073 (47%), Positives = 646/1073 (60%), Gaps = 58/1073 (5%)
 Frame = -2

Query: 3664 QLREFEMK---IKSIREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDAL 3494
            Q++E EMK   ++SIREAL++    D   S+  VCS+ + +LESL+ +  +N VP  DAL
Sbjct: 166  QVKE-EMKLINVESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDAL 222

Query: 3493 IQQIFMGIQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCV 3314
            IQ  F  +Q+V+SVF SMN   +EQ+K+IL RLL+ +K  +  LF+S Q KE+EAM+  +
Sbjct: 223  IQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAMLSSL 282

Query: 3313 ESQSISSSVIMDKKKERGVINGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQND 3134
             +++       DK+K+   ++G+          N + + LN  +K  L PV        D
Sbjct: 283  VTRA------NDKEKDMLAMHGVNGKDSNIVTEN-AVNDLNFKEKVPL-PV--------D 326

Query: 3133 SNMEFKTLKSK-----GYGGVGSLNLPSD----HDGDR----TLDTLQLFPIQN------ 3011
            S M+ K L++      GY   G L    D    HD D     T +T    P+Q       
Sbjct: 327  SLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGD 386

Query: 3010 ---QSDSATCTVSDKTEDAAVLPYETDALRAVSTYQQRFN-SSFLLSNRLPSPTPXXXXX 2843
               +S +A   +S   E      YETDALRA S+YQQ+F  +SF +++ LPSPTP     
Sbjct: 387  GVVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESG 446

Query: 2842 XXXXXXXXXXXXXSTASSIKTEN------PSVPVQPFTYATPIVDCSSEQ------ESIP 2699
                         +     K  N        V  QP   + P+ D SS Q       S P
Sbjct: 447  DGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPM-DISSVQALTTANNSAP 505

Query: 2698 AKTGQLGSCL-NPILKAMPKSKDPR------RALDLKQQSLTVDYNVPKREPIVGTMCSR 2540
            A +G       NP++KA  KS+DPR       AL+L  Q   + +N PK EP+   M SR
Sbjct: 506  ASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSR 565

Query: 2539 KHKIVKESVLDGHNLKRQRNGLTSSEVPVSRNEQMASRNDRWFGSGCSALPQHNNNKTCL 2360
            K K V+E VLDG  LKRQRNG  +S V   R+E+    +  W        PQ  N +  L
Sbjct: 566  KQKTVEEPVLDGPALKRQRNGFENSGVV--RDEKNIYGSGGWLEDTDMFEPQIMN-RNLL 622

Query: 2359 AENMRTDFRKLENGK---FCSGKRQDTNSGGNQQLPVFGTNTFVSLPSL-KDISMNPTML 2192
             ++  ++ RKL+NG      SG      SG N+  P    +T VSLP+L KDI++NPTML
Sbjct: 623  VDSAESNSRKLDNGATSPITSGTPNVVVSG-NEPAPATTPSTTVSLPALLKDIAVNPTML 681

Query: 2191 VNLIM--EQQKLVADTQQKPAQSMTLPSISSVAVGTVPLANDPSSNSAKIVQKSQIPAQI 2018
            +N++   +QQKL AD QQK   S ++ ++      ++P  +   S  + I+ K       
Sbjct: 682  LNILKMGQQQKLAADAQQKSNDS-SMNTMHPPIPSSIPPVSVTCSIPSGILSK------- 733

Query: 2017 VPMDLREDLGKTRMKPRDPRRILHNNTFKQNGCIGTEQLKTKGVPSSIIQASKDSLTVRQ 1838
             PMD   +LGK RMKPRDPRR+LH N  +++G +G E  KT G  +   Q SK++L  ++
Sbjct: 734  -PMD---ELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQK 788

Query: 1837 H-----AEXXXXXXXXXXXXXXXAKQN--NIPGTLFDSQSATSFTTVAQTVSSQPIPRKI 1679
                  A+                 +N  +I   +  SQ  TS   V+Q    QP   K 
Sbjct: 789  QLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIK- 847

Query: 1678 DNADVRVVATDSNNQESWTNSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERA 1499
              AD++ V T+ +++++ T S P+ G  V    Q+AW DVEHLFEG+DDQQKA I++ER 
Sbjct: 848  SGADMKAVVTNHDDKQTGTGSGPEAGP-VGAHPQSAWGDVEHLFEGYDDQQKAAIQKERT 906

Query: 1498 RRKEEQNKLFAARKXXXXXXXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRF 1319
            RR EEQ K+F+ARK           LNSAKF EVDPVH+ ILRKKEEQDR+KP RHLFRF
Sbjct: 907  RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966

Query: 1318 PHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGD 1139
            PHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYA EMAKVLDP G LF  RVISRGD
Sbjct: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026

Query: 1138 DEDPLDGDERVPKIKDLDGVLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHF 959
            D DP DGDERVPK KDL+GVLGMES+VVIIDDS RVWPHNKLNLI +ERY YFPCSRR F
Sbjct: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086

Query: 958  GLSGPSLLEIGHDERLDDGALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAG 779
            GL GPSLLEI HDER +DG LASSL VIER+H  FF HQSL+DVDVRNILA EQ+ +LAG
Sbjct: 1087 GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAG 1146

Query: 778  CRIVFSRVFPVGEAKPHLHPLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKV 620
            CRIVFSRVFPVGEA PHLHPLWQTA+QFGAVCT  ID+ VTHVVANSLGTDKV
Sbjct: 1147 CRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 1199


>ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Solanum tuberosum]
          Length = 1218

 Score =  780 bits (2013), Expect = 0.0
 Identities = 490/1089 (44%), Positives = 636/1089 (58%), Gaps = 43/1089 (3%)
 Frame = -2

Query: 3652 FEMKIKSIREALDTATVKDPKESYNEVCSRFQISLESLQLMRLDNCVPVVDALIQQIFMG 3473
            F  +   +RE L + T+ +  +S++ VCS+ Q SL +L  + L       D LIQ     
Sbjct: 148  FGKEANFVREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQ--DKNDILIQLFMTA 205

Query: 3472 IQAVNSVFYSMNPEQQEQDKDILLRLLTYVKDQDFALFASKQKKEIEAMIRCVESQSISS 3293
            ++ +NSVFYSMN +Q++Q+ DIL RLL + K Q  AL +S+Q KE++A+I  +   ++ S
Sbjct: 206  LRTINSVFYSMNQDQKQQNTDILSRLLFHAKTQLPALLSSEQLKEVDAVILSINQSAVFS 265

Query: 3292 SVIMDKKKERGV-INGMTXXXXXXXXXNPSGHSLNSTKKFHLEPVSVRYSDQNDSNMEFK 3116
            +   D  K  G+ +  +            +     +  K+ L  VS++ S   + ++ F+
Sbjct: 266  NT-QDNDKVNGIKVVELLDKKVSHKSSENANQDFTAVNKYDLGAVSIKSSGLKEQSVSFE 324

Query: 3115 TLK-----SKGYG-GVGSLNLPSDHDGDR----TLDTLQLFPIQNQSDSATCTVSDKTED 2966
            ++K     SK  G  +  L+L  DHD D     T +    FP+   + +      D    
Sbjct: 325  SVKPGLANSKAKGLSIPLLDLHKDHDEDTLPSPTREIGPQFPVAKATQAHGMVKLDLPIF 384

Query: 2965 AAVL--------PYETDALRAVSTYQQRFN-SSFLLSNRLPSPTPXXXXXXXXXXXXXXX 2813
            A  L        PYETDAL+AVS+YQQ+F  SS  +S  LPSPTP               
Sbjct: 385  AGSLEKGNSLLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEGDSGKGDIGGEV 444

Query: 2812 XXXSTASSIKTENPSVPVQPFTYATPIVDCSSEQESIPAKTGQLGSCL-NPILKAMP-KS 2639
                   +    N S   QP   + P  +    Q    A+T    S L NP L++   KS
Sbjct: 445  TSLDVVHNASHLNESSMGQPILSSVPQTNILDGQGLGTARTADPLSFLPNPSLRSSTAKS 504

Query: 2638 KDPRRAL--------DLKQQSLTVDYNVPKREPIVGTMCSRKHKIVKESVLDGHNLKRQR 2483
            +DPR  L        +  +  L +     K E  +  + S+K K V   V      KRQR
Sbjct: 505  RDPRLRLATSDAVAQNTNKNILPIPDIDLKLEASLEMIGSKKQKTVDLPVFGAPLPKRQR 564

Query: 2482 NGLTSSEVPVSRNEQMASRNDRWFGS-GCSALPQHNNNKTCLAENMRTDFRKLENGKFCS 2306
            +  T S +    + + ++ N  W    G + LP  ++N  C  ++   D RKLE      
Sbjct: 565  SEQTDSIIV--SDVRPSTGNGGWLEDRGTAGLPITSSN--CATDSSDNDIRKLEQVTATI 620

Query: 2305 GKRQDTNSGGNQQLPVFGTNTFVSLPSL-KDISMNPTMLVNLI-MEQQKLVADTQQKPAQ 2132
                       +  PV G +T  +L SL KDI++NP++ +N+I MEQQK    ++   AQ
Sbjct: 621  ATIPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKMEQQKSADASRTTTAQ 680

Query: 2131 SMTLPSISSVAVGTVPLANDPSSNSAKIVQKS----QIPAQIVPMDLREDLGKTRMKPRD 1964
            + +  SI    +G VP  +  +  S+ I Q+S    Q P      D   ++   RMKPRD
Sbjct: 681  ASSSKSI----LGAVPSTDAIAPRSSAIGQRSVGILQTPTHTASAD---EVAIVRMKPRD 733

Query: 1963 PRRILHNNTFKQNGCIGTEQLKT--KGVPSSI----IQASKDSLTVRQHAEXXXXXXXXX 1802
            PRR+LHN    + G +G++Q KT   G  ++I     Q+ +D L  +             
Sbjct: 734  PRRVLHNTAVLKGGNVGSDQCKTGVAGTHATISNLGFQSQEDQLDRKSAVTLSTTPPDIA 793

Query: 1801 XXXXXXAKQNNIPGTLFDSQSATSFTTVAQTVSSQPIPRKIDNADVRVVATDSNNQESWT 1622
                   K  NI   +  S S TS +  +QT  +Q +      ++ +   ++ + + +  
Sbjct: 794  RQFTKNLK--NIADMISVSPS-TSLSAASQT-QTQCLQSHQSRSEGKEAVSEPSERVNDA 849

Query: 1621 NSTPKEGAAVSFQSQNAWEDVEHLFEGFDDQQKATIRRERARRKEEQNKLFAARKXXXXX 1442
                ++G+  S Q Q +W DVEHLFEG+ DQQ+A I+RERARR EEQ K+F+ RK     
Sbjct: 850  GLASEKGSPGSLQPQISWGDVEHLFEGYSDQQRADIQRERARRLEEQKKMFSVRKLCLVL 909

Query: 1441 XXXXXXLNSAKFVEVDPVHEMILRKKEEQDRQKPQRHLFRFPHMGMWTKLRPGIWNFLEK 1262
                  LNSAKFVE+DPVHE ILRKKEEQDR+KP RHLFRFPHMGMWTKLRPGIWNFLEK
Sbjct: 910  DLDHTLLNSAKFVEIDPVHEEILRKKEEQDREKPCRHLFRFPHMGMWTKLRPGIWNFLEK 969

Query: 1261 ASKLYELHLYTMGNKLYAREMAKVLDPTGSLFGERVISRGDDEDPLDGDERVPKIKDLDG 1082
            AS L+ELHLYTMGNKLYA EMAK+LDP G LF  RVISRGDD DP DGDERVPK KDL+G
Sbjct: 970  ASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGDERVPKSKDLEG 1029

Query: 1081 VLGMESSVVIIDDSARVWPHNKLNLIALERYIYFPCSRRHFGLSGPSLLEIGHDERLDDG 902
            VLGMES+VVIIDDS RVWPHNKLNLI +ERYIYFPCSRR FGL GPSLLEI HDER +DG
Sbjct: 1030 VLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDG 1089

Query: 901  ALASSLAVIERIHHNFFLHQSLNDVDVRNILAKEQKNVLAGCRIVFSRVFPVGEAKPHLH 722
             LAS L VI+RIH NFF H+S+++ DVRNILA EQK +LAGCRIVFSRVFPVGEA PHLH
Sbjct: 1090 TLASCLGVIQRIHQNFFAHRSIDEADVRNILATEQKKILAGCRIVFSRVFPVGEANPHLH 1149

Query: 721  PLWQTAQQFGAVCTNQIDEHVTHVVANSLGTDKVNWALSTGRFVVHPCWVEASALLYRRA 542
            PLWQTA+QFGAVCT+QID+ VTHVVANSLGTDKVNWALSTGRFVVHP WVEASALLYRRA
Sbjct: 1150 PLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRA 1209

Query: 541  NEFDFAIKT 515
            NE DFAIK+
Sbjct: 1210 NEHDFAIKS 1218


Top