BLASTX nr result

ID: Magnolia22_contig00004154 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Magnolia22_contig00004154
         (4184 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010241993.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1337   0.0  
XP_010918441.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1276   0.0  
ONI10364.1 hypothetical protein PRUPE_4G043500 [Prunus persica]      1250   0.0  
XP_008225045.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1249   0.0  
XP_010932999.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1246   0.0  
XP_008809393.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1243   0.0  
XP_002267987.3 PREDICTED: RNA polymerase II C-terminal domain ph...  1241   0.0  
XP_008809392.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1239   0.0  
XP_018856749.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1236   0.0  
XP_010918443.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1231   0.0  
XP_007025680.2 PREDICTED: RNA polymerase II C-terminal domain ph...  1229   0.0  
EOY28302.1 C-terminal domain phosphatase-like 1 isoform 1 [Theob...  1228   0.0  
GAV74436.1 dsrm domain-containing protein/NIF domain-containing ...  1227   0.0  
JAT50607.1 RNA polymerase II C-terminal domain phosphatase-like ...  1223   0.0  
XP_008371347.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1222   0.0  
XP_002305017.2 hypothetical protein POPTR_0004s04010g [Populus t...  1221   0.0  
XP_012091568.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1219   0.0  
KDP20941.1 hypothetical protein JCGZ_21412 [Jatropha curcas]         1219   0.0  
XP_017696775.1 PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II...  1218   0.0  
XP_011027882.1 PREDICTED: RNA polymerase II C-terminal domain ph...  1218   0.0  

>XP_010241993.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Nelumbo nucifera]
          Length = 948

 Score = 1337 bits (3461), Expect = 0.0
 Identities = 683/960 (71%), Positives = 775/960 (80%), Gaps = 5/960 (0%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIPNKEIRISHFSPPSERCPPLAVLHTIAAGGACF 599
            MFKS  Y GNS LGEVEI+ +N  ID+ NKE RISHFS PSERCPPLAVLHTIA  G C 
Sbjct: 1    MFKSVVYQGNSPLGEVEIFPQNQEIDMTNKEFRISHFSQPSERCPPLAVLHTIAPCGVCL 60

Query: 600  KMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACFW 779
            KM                    H++CLRENKTA++ LGEEELHLVAM +RK  EQ  CFW
Sbjct: 61   KMESKSQSGDSPLFSL------HSSCLRENKTAVVPLGEEELHLVAMPTRKIGEQCLCFW 114

Query: 780  GFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRI 959
            GFNV PGLYNSCLVMLNLRCLGIVFDLDETL+VANTMRSFEDRIDALQRKI++EVDPQRI
Sbjct: 115  GFNVAPGLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIDALQRKISTEVDPQRI 174

Query: 960  SGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQEK 1139
            +GM+AEVKRYQDDK+ILKQYAENDQV++NGKVIKVQSEIVP LSDNHQPI+RPLIRLQE+
Sbjct: 175  AGMIAEVKRYQDDKIILKQYAENDQVIDNGKVIKVQSEIVPALSDNHQPIVRPLIRLQER 234

Query: 1140 NIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD 1319
            NIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYVCTMAERDYALEMWRLLD
Sbjct: 235  NIILTRINPGIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLD 294

Query: 1320 PDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVHV 1499
            PDSNLIN KELLDRIVCVKAG +KSLLNVFQ GICHPKMALVIDDRLKVW++KDQPRVHV
Sbjct: 295  PDSNLINTKELLDRIVCVKAGSRKSLLNVFQVGICHPKMALVIDDRLKVWDEKDQPRVHV 354

Query: 1500 VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDFP 1679
            VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFD+VLLQ I + FYED+M  FP
Sbjct: 355  VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEVLLQRIPEIFYEDDMAGFP 414

Query: 1680 PAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPRL 1859
              PDVSNYL+SEDD SASNGNKD L FEG+TDV+VERRLKD    A+  SS VN++DPRL
Sbjct: 415  SPPDVSNYLISEDDTSASNGNKDPLCFEGITDVEVERRLKD----AIPASSLVNSLDPRL 470

Query: 1860 ASSIQHVLASSSSMIP-QTSQGP-ISFQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREE 2033
               IQH +ASSSS +   TSQGP + F N Q+P      KPL  VGP E SLQSSPAREE
Sbjct: 471  -PLIQHAVASSSSSVSLPTSQGPMMPFPNKQFPHVATLAKPLVQVGPPELSLQSSPAREE 529

Query: 2034 GEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEE 2213
            GEVPESELDPDTRRRLLILQHGQDTRE TS++PPFPVRP LQ+S+P VQSHGS+F  EEE
Sbjct: 530  GEVPESELDPDTRRRLLILQHGQDTREHTSSEPPFPVRPPLQVSVPAVQSHGSWFPSEEE 589

Query: 2214 MSPRKMNRA--REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHG 2387
            MSPR++NR   +EFP+EPEA+HFDK+R   P F+ GLE+S+P +RSL+ENQR  KE+   
Sbjct: 590  MSPRQLNRTIPKEFPLEPEAVHFDKHRPRRPPFFQGLESSIPSDRSLNENQRLAKEVHQT 649

Query: 2388 DDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCG 2564
            DDR R NH +  +   SG+E+PLGR+SS NR+L FESGR   QYPETPAGV+QEIAMKCG
Sbjct: 650  DDRMRINHSVSGHRPLSGEELPLGRSSSSNRDLQFESGRGNLQYPETPAGVVQEIAMKCG 709

Query: 2565 TKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVS 2744
            TKVEFR  L A+TELQFS EV+F+GEK+GEGIG+TRKEAQ QAAE+S+R LANKY+S + 
Sbjct: 710  TKVEFRHGLVASTELQFSFEVYFMGEKVGEGIGRTRKEAQHQAAENSIRNLANKYLSHIK 769

Query: 2745 PDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLK 2924
             D N+ HGD NKLSH N+NG L D NSFG   F +ED L ++++S+ SRF++ R EGS K
Sbjct: 770  SDPNSSHGDGNKLSHGNENGLLNDTNSFGSLPFSKEDSLSLSTSSESSRFVETRLEGSKK 829

Query: 2925 GMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXX 3104
             +  +S+LKELC VEGL+L F   PP+S  S+ KGE+YAEVE+AG +LGKGIG +W    
Sbjct: 830  SVGSLSALKELCTVEGLNLAFQ-MPPISANSTQKGEIYAEVEVAGHVLGKGIGSSWDEAK 888

Query: 3105 XXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNGPP 3284
                         MLSQ +Q+R GSPRS  GI +KRLK E+ R LQR+PSS RY KN PP
Sbjct: 889  IQAADEALGNLKLMLSQNTQKRPGSPRSLQGISSKRLKPEFSRVLQRIPSSGRYPKNTPP 948


>XP_010918441.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Elaeis guineensis]
          Length = 950

 Score = 1276 bits (3302), Expect = 0.0
 Identities = 662/961 (68%), Positives = 747/961 (77%), Gaps = 4/961 (0%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIPNKEIRISHFSPPSERCPPLAVLHTIAAGGACF 599
            MFKSA YHGNSL+GEVEI  +N N     +EIRISHFSPPSERCPPLAVLHTIA+    F
Sbjct: 1    MFKSAVYHGNSLIGEVEISPQNSNPGAWLREIRISHFSPPSERCPPLAVLHTIASASVSF 60

Query: 600  KMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACFW 779
            KM                    H ACLR+ KTA++ LGEEELHLVAM  RKN   YACFW
Sbjct: 61   KMESKSPPSDESQLCSL-----HAACLRDQKTAVIPLGEEELHLVAMKPRKNLMHYACFW 115

Query: 780  GFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRI 959
            GFNV  GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKI+SE DPQR+
Sbjct: 116  GFNVASGLYNSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKISSETDPQRV 175

Query: 960  SGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQEK 1139
            +GMLAEVKRYQDDK ILKQYAENDQVVENG V KVQSE+VPPLSDNHQ I RP+IRLQEK
Sbjct: 176  TGMLAEVKRYQDDKSILKQYAENDQVVENGNVFKVQSEVVPPLSDNHQLITRPIIRLQEK 235

Query: 1140 NIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD 1319
            NIILTR+NP IRDTSVLVRLRPAWE+LRSYL ARGRKRFEVYVCTMAE+DYALEMWRLLD
Sbjct: 236  NIILTRVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAEKDYALEMWRLLD 295

Query: 1320 PDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVHV 1499
            PDS+LIN  +LLDRIVCVK+G +KSLLNVFQDGICHPKMALVIDDRLKVW+DKDQPRVHV
Sbjct: 296  PDSSLINAMQLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWDDKDQPRVHV 355

Query: 1500 VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDFP 1679
            VPAFAPYYAPQAEAN  VPVLCVARNVACNVRGGFFK+FD+ LL  ISD FYEDEM DFP
Sbjct: 356  VPAFAPYYAPQAEANGNVPVLCVARNVACNVRGGFFKDFDEGLLPRISDIFYEDEMKDFP 415

Query: 1680 PAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPRL 1859
             APDV NYL+SEDD + SNG+KDLL  EGMTD +VERRLK+AN     +   VN  DP  
Sbjct: 416  SAPDVGNYLISEDDNATSNGSKDLLCSEGMTDAEVERRLKEANGNVQAIYPMVNTFDPSS 475

Query: 1860 ASSIQHVLASSSSMIPQ--TSQGPISFQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREE 2033
             SSIQHV+ASSS  +P    +Q  +   N Q PQ I   +PLG  G  EPSLQ SPAREE
Sbjct: 476  MSSIQHVMASSSG-VPSLAATQVMMPLPNNQCPQPIALGRPLGQPGLPEPSLQGSPAREE 534

Query: 2034 GEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEE 2213
            GEVPESELDPDTRRRLLILQHGQD R+ T   P FPVRP L +++ PVQS GS+F +EEE
Sbjct: 535  GEVPESELDPDTRRRLLILQHGQDIRDPT---PQFPVRPPLHVAVSPVQSRGSWFPLEEE 591

Query: 2214 MSPRKMNRA-REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHGD 2390
            M+PR+++RA +EF +EPE + FDK R  H S+Y   ENS+  +R L+EN+R   +LRHGD
Sbjct: 592  MNPRQLSRAPKEFSLEPETVCFDKKRPNHQSYYRTGENSISSDRVLNENRRLAMQLRHGD 651

Query: 2391 DRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCGT 2567
            DR RPNH   N  S SG+EMP+GR SS +R++ FESG+VT QY  TPAGVLQ+IA KCG 
Sbjct: 652  DRLRPNHAAANCDSFSGEEMPIGRISSSHRDIQFESGQVTVQYAGTPAGVLQDIATKCGA 711

Query: 2568 KVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVSP 2747
            KVEFR AL  TTELQFS+EVWFVGEKIGEGIGKTRKEAQ+QAAE SLRTLANKY+S  + 
Sbjct: 712  KVEFRTALCDTTELQFSVEVWFVGEKIGEGIGKTRKEAQQQAAEFSLRTLANKYLSNATS 771

Query: 2748 DLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLKG 2927
            D   + GD  K S+  +NGF+ D NSFGYPA+ R+D L +ASTS++SRFLD R EGS K 
Sbjct: 772  D--TLRGDMLKPSNAKENGFISDPNSFGYPAYVRDDLLGVASTSEESRFLDLRLEGSKKS 829

Query: 2928 MAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXXX 3107
             A V++LKELC +EG +L+F  QP  ST S  KGEVYA+VE+AGQILGKG+G TW     
Sbjct: 830  TASVAALKELCTIEGFNLIFQPQPSASTDSVGKGEVYAQVEVAGQILGKGVGTTWEEAKL 889

Query: 3108 XXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNGPPV 3287
                        ML Q +Q+R GSPRS    PNKRLK ++ R LQR+PSS RYSKN  PV
Sbjct: 890  QAAEEALGTLKSMLGQFTQKRSGSPRSVSAAPNKRLKPDFSRMLQRIPSSGRYSKNETPV 949

Query: 3288 P 3290
            P
Sbjct: 950  P 950


>ONI10364.1 hypothetical protein PRUPE_4G043500 [Prunus persica]
          Length = 958

 Score = 1250 bits (3235), Expect = 0.0
 Identities = 647/967 (66%), Positives = 747/967 (77%), Gaps = 10/967 (1%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYL-------KNPNIDIPNKEIRISHFSPPSERCPPLAVLHTI 578
            M+KS  Y G  LLGEVEIY        KN N+    KEIRIS+FS  SERCPP+AVLHTI
Sbjct: 1    MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60

Query: 579  AAGGACFKMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNF 758
            ++ G CFKM                    H++C+ ENKTA+M LG EELHLVAM SR   
Sbjct: 61   SSHGVCFKMESKTSQSQDTPLFLL-----HSSCVMENKTAVMPLGGEELHLVAMRSRNGD 115

Query: 759  EQYACFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINS 938
            ++Y CFWGF+V PGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI+S
Sbjct: 116  KRYPCFWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISS 175

Query: 939  EVDPQRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRP 1118
            EVDPQRISGMLAE+KRYQDDK ILKQYAENDQVVENG+VIK QSE VP LSDNHQPIIRP
Sbjct: 176  EVDPQRISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRP 235

Query: 1119 LIRLQEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYAL 1298
            LIRL EKNIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYVCTMAERDYAL
Sbjct: 236  LIRLHEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYAL 295

Query: 1299 EMWRLLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDK 1478
            EMWRLLDPDSNLIN  +LLDRIVCVK+G +KSL NVFQ+ +CHPKMALVIDDRLKVW+D+
Sbjct: 296  EMWRLLDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDR 355

Query: 1479 DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYE 1658
            DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFF+EFDD LLQ I + FYE
Sbjct: 356  DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYE 415

Query: 1659 DEMIDFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAV 1838
            D++ D  P+PDVSNYLVSEDD+SA NGN+D L F+G+TDV+VERR+K+A   A  VSS  
Sbjct: 416  DDIKDV-PSPDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATPAASMVSSVF 474

Query: 1839 NNIDPRLASSIQHVLASSSSMIPQTSQGPISFQNIQYPQAIPTVKPLGLVGPSEPSLQSS 2018
             +IDPRLA     V  SS+  +P T    +SF +IQ+PQA   VKPLG VG +EPSLQSS
Sbjct: 475  TSIDPRLAPLQYTVPPSSTLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSAEPSLQSS 534

Query: 2019 PAREEGEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYF 2198
            PAREEGEVPESELDPDTRRRLLILQHGQDTR+Q  ++PPFPVRP +Q S+P  QS   +F
Sbjct: 535  PAREEGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPFPVRPPMQASVPRAQSRPGWF 594

Query: 2199 TMEEEMSPRKMNR--AREFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPK 2372
             +EEEMSPR+++R   ++ P++PE +  +K+R  H SF+  +ENS+P +R L ENQR PK
Sbjct: 595  PVEEEMSPRQLSRMVPKDLPLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQENQRLPK 654

Query: 2373 ELRHGDDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEI 2549
            E  H DDR R NH L  YHS SG+E+PL R+SS NR++ FESGR  +   ETPAGVLQEI
Sbjct: 655  EAFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGRAISN-AETPAGVLQEI 713

Query: 2550 AMKCGTKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKY 2729
            AMKCG KVEFRPAL A TELQF +E WF GEKIGEG GKTR+EA  QAAE SL+ LAN Y
Sbjct: 714  AMKCGAKVEFRPALVAGTELQFYVEAWFAGEKIGEGSGKTRREAHYQAAEGSLKNLANIY 773

Query: 2730 ISRVSPDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQ 2909
            +SRV PD  +VHGD NK  + N NGF  + NSFG   FP+E+ L  +++S+ SR LD R 
Sbjct: 774  LSRVKPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPLDPRL 833

Query: 2910 EGSLKGMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLT 3089
            EGS K M+ VS+LKELCM+EGL +VF  +PP ST S  K EV+ +VEI G++LGKGIGLT
Sbjct: 834  EGSKKSMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKGIGLT 893

Query: 3090 WXXXXXXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTRYS 3269
            W                  L   +Q+R GSPRS  G+ +KR+K E+P+ LQR+PSS RY 
Sbjct: 894  WDEAKMQAAEKALGSLTSTLY--AQKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSSARYP 951

Query: 3270 KNGPPVP 3290
            KN PPVP
Sbjct: 952  KNAPPVP 958


>XP_008225045.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Prunus mume]
          Length = 959

 Score = 1249 bits (3233), Expect = 0.0
 Identities = 649/969 (66%), Positives = 753/969 (77%), Gaps = 12/969 (1%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYL-------KNPNIDIPNKEIRISHFSPPSERCPPLAVLHTI 578
            M+KS  Y G  LLGEVEIY        KN N+    KEIRIS+FS  SERCPP+AVLHTI
Sbjct: 1    MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60

Query: 579  AAGGACFKMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNF 758
            ++ G CFKM                    H++C+ ENKTA+M LG EELHLVAM SR + 
Sbjct: 61   SSHGVCFKMESKTSQSQDTPLFLL-----HSSCVMENKTAVMPLGGEELHLVAMHSRNSD 115

Query: 759  EQYACFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINS 938
            ++Y CFWGF+V PGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI+S
Sbjct: 116  KRYPCFWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISS 175

Query: 939  EVDPQRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRP 1118
            EVD QRISGMLAE+KRYQDDK ILKQYAENDQVVENG+VIK QSE VP LSDNHQPIIRP
Sbjct: 176  EVDSQRISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRP 235

Query: 1119 LIRLQEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYAL 1298
            LIRL EKNIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYVCTMAERDYAL
Sbjct: 236  LIRLLEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYAL 295

Query: 1299 EMWRLLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDK 1478
            EMWRLLDPDSNLIN  +LLDRIVCVK+G +KSL NVFQ+ +CHPKMALVIDDRLKVW+D+
Sbjct: 296  EMWRLLDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDR 355

Query: 1479 DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYE 1658
            DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFF+EFDD LLQ I + FYE
Sbjct: 356  DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYE 415

Query: 1659 DEMIDFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAV 1838
            D++ D  P+PDVSNYLVSEDD+SA NGN+D L F+G+TDV+VERR+K+A   A  VSS V
Sbjct: 416  DDIKDV-PSPDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATSAASMVSSVV 474

Query: 1839 NNIDPRLASSIQHVLASSSS--MIPQTSQGPISFQNIQYPQAIPTVKPLGLVGPSEPSLQ 2012
             +IDPRLA S+Q+ +A SSS   +P T    +SF +IQ+PQA   VKPLG VG +EPSLQ
Sbjct: 475  TSIDPRLA-SLQYTVAPSSSTLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSTEPSLQ 533

Query: 2013 SSPAREEGEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGS 2192
            SSPAREEGEVPESELDPDTRRRLLILQHGQDTR+Q  ++PPFPVRP +Q S+P  QS   
Sbjct: 534  SSPAREEGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPFPVRPPMQASVPRAQSRPG 593

Query: 2193 YFTMEEEMSPRKMNR--AREFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRF 2366
            +F +EEEMSPR+++R   ++ P++PE +  +K+R  H SF+  +ENS+P +R L ENQR 
Sbjct: 594  WFPVEEEMSPRQLSRMVPKDLPLDPEPVQIEKHRPHHSSFFPKVENSIPSDRILQENQRL 653

Query: 2367 PKELRHGDDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQ 2543
            PKE  H DDR R NH L  YHS SG+E+PL R+SS NR++ FESGR  +   ETPAGVLQ
Sbjct: 654  PKEAFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGRAISN-AETPAGVLQ 712

Query: 2544 EIAMKCGTKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLAN 2723
            EIAMKCG KVEFRPAL A+ ELQF +E WF GEKIGEG GKTR+EA  QAAE SL+ LAN
Sbjct: 713  EIAMKCGAKVEFRPALVASMELQFYVEAWFAGEKIGEGSGKTRREAHYQAAEGSLKNLAN 772

Query: 2724 KYISRVSPDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQ 2903
             Y+SRV PD  +VHGD NK  + N NGF  + NSFG   FP+E+ L  +++S+ SR LD 
Sbjct: 773  IYLSRVKPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPLDP 832

Query: 2904 RQEGSLKGMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIG 3083
            R EGS K M+ VS+LKELCM+EGL +VF  +PP ST S  K EV+ +VEI G++LGKGIG
Sbjct: 833  RLEGSKKSMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKGIG 892

Query: 3084 LTWXXXXXXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTR 3263
            LTW                  L   +Q+R GSPRS  G+ +KR+K E+P+ LQR+PSS R
Sbjct: 893  LTWDEAKMQAAEKALGSLTSTLY--AQKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSSAR 950

Query: 3264 YSKNGPPVP 3290
            Y KN PPVP
Sbjct: 951  YPKNAPPVP 959


>XP_010932999.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Elaeis guineensis]
          Length = 954

 Score = 1246 bits (3224), Expect = 0.0
 Identities = 646/961 (67%), Positives = 737/961 (76%), Gaps = 4/961 (0%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIPNKEIRISHFSPPSERCPPLAVLHTIAAGGACF 599
            MFKSA YHGNSL+GE EI+ +N N     +EIRISHFSP SERCPPLAVLHTIA+GG  F
Sbjct: 1    MFKSAVYHGNSLIGEAEIFPQNSNPGAWVREIRISHFSPSSERCPPLAVLHTIASGGVSF 60

Query: 600  KMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACFW 779
            KM                    H ACLRENKTA++ LGEEELHLVAM+SRKN  QYACFW
Sbjct: 61   KMESKSAPSDESPLCSL-----HAACLRENKTAVIPLGEEELHLVAMNSRKNLMQYACFW 115

Query: 780  GFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRI 959
            GFNV  GLYNSCL MLNLRCLGIVFDLDETLIVANT+RSFEDRIDALQRKI++E DPQR+
Sbjct: 116  GFNVASGLYNSCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIDALQRKISTETDPQRV 175

Query: 960  SGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQEK 1139
            +GMLAE+KRYQDDK ILKQYAE DQVVENGKV +VQSE+VPPLSD+H  I RP++RLQEK
Sbjct: 176  TGMLAELKRYQDDKSILKQYAEIDQVVENGKVYQVQSEVVPPLSDSHHLITRPVLRLQEK 235

Query: 1140 NIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD 1319
            NIILTR+NP IRDTSVLVRLRPAWE+LRSYL ARGRKRFEVYVCTMAERDYALEMWRLLD
Sbjct: 236  NIILTRVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAERDYALEMWRLLD 295

Query: 1320 PDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVHV 1499
            PDS+LI+   L+DRIVCVK+G +KSLL+VFQDGICHPKMALVIDDRLKVW++KDQPRVHV
Sbjct: 296  PDSSLISSTRLIDRIVCVKSGSRKSLLSVFQDGICHPKMALVIDDRLKVWDEKDQPRVHV 355

Query: 1500 VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDFP 1679
            VPAFAPYYAPQAEAN  VPVLCVARNVACNVRGGFFKEFD+ LL  ISD FYEDE  DFP
Sbjct: 356  VPAFAPYYAPQAEANGNVPVLCVARNVACNVRGGFFKEFDEGLLPRISDIFYEDEWKDFP 415

Query: 1680 PAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPRL 1859
             APDV NYL+SEDD + S GNKD L F+GMTD +VERRLK+AN     V   VNN+D R 
Sbjct: 416  SAPDVGNYLISEDDNATSIGNKDQLCFKGMTDAEVERRLKEANCNVQAVHPMVNNLDLRS 475

Query: 1860 ASSIQHVLASSSSMIPQT-SQGPISFQNIQYPQAIPTVKPLGL-VGPSEPSLQSSPAREE 2033
            ASSIQHV+ASSS++ P T +Q  +   N Q  Q I   +PL    G  EPSLQ SPAREE
Sbjct: 476  ASSIQHVMASSSAVPPLTATQAMMPLPNNQCSQPIALGRPLVCQPGLPEPSLQGSPAREE 535

Query: 2034 GEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEE 2213
            GEVPESELDPDTRRRLLILQHGQDTR+ T   PPF VR  L  ++PPVQS G++F MEEE
Sbjct: 536  GEVPESELDPDTRRRLLILQHGQDTRDPT---PPFTVRSPLHEAVPPVQSQGNWFPMEEE 592

Query: 2214 MSPRKMNRA-REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHGD 2390
            M+P+++NRA +EF +EPE +H +K R  H S++   ENS+   R LHENQR P +L  GD
Sbjct: 593  MNPKQLNRAPKEFTVEPETVHVNKKRPHHQSYFRSGENSISSERVLHENQRLPMQLHPGD 652

Query: 2391 DRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCGT 2567
            DR RPNH   NY+   G+EMP G  SS +R L FE G   AQ  ETPAGVLQ IAMKCG 
Sbjct: 653  DRLRPNHAAANYNCFPGEEMPAGLISSSHRGLQFEPGWAIAQCAETPAGVLQNIAMKCGA 712

Query: 2568 KVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVSP 2747
            KVEFR AL  TTEL+F +EVWFVGEK+GEGIGKTRKEA +QAAE SLRTLA+KY+S    
Sbjct: 713  KVEFRTALCDTTELKFCMEVWFVGEKVGEGIGKTRKEAHQQAAEISLRTLADKYLSNARS 772

Query: 2748 DLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLKG 2927
            D N +HGD +K SH  +NGF+ D NSFGYPA  R+D LP+ASTS++SRF+DQR EGS K 
Sbjct: 773  DSNTLHGDMHKPSHIKENGFISDLNSFGYPACARDDVLPVASTSEESRFMDQRLEGSNKT 832

Query: 2928 MAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXXX 3107
               V+ LKELC +EG +L F A    S  S  KGEVYA+VE+AGQI+G G+G TW     
Sbjct: 833  ATSVAVLKELCTIEGFTLGFQAPTSPSASSVSKGEVYAQVEVAGQIVGIGVGTTWEEAKL 892

Query: 3108 XXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNGPPV 3287
                        ML Q + +R GSPRS    PNKRLK ++ R LQR+PSS RYS++  PV
Sbjct: 893  KAAEEALGTLKSMLGQFTHKRSGSPRSPSATPNKRLKPDFSRVLQRIPSSGRYSRSETPV 952

Query: 3288 P 3290
            P
Sbjct: 953  P 953


>XP_008809393.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X2 [Phoenix dactylifera]
          Length = 950

 Score = 1243 bits (3217), Expect = 0.0
 Identities = 650/960 (67%), Positives = 732/960 (76%), Gaps = 3/960 (0%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIPNKEIRISHFSPPSERCPPLAVLHTIAAGGACF 599
            MF+SA YHGNSL+GE EI  +N N     +EIRISHFS PSERCPPLAVLHTIA+ G  F
Sbjct: 1    MFESAVYHGNSLIGEAEISPQNSNPGAWLREIRISHFSLPSERCPPLAVLHTIASAGVSF 60

Query: 600  KMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACFW 779
            KM                    H ACL+E KTA++ LGEEELHLVAM SRKN   YACFW
Sbjct: 61   KMESKSPPSDESQLCSL-----HAACLKEQKTAVIPLGEEELHLVAMKSRKNLVHYACFW 115

Query: 780  GFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRI 959
            GFNV  GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKI+SE DPQR+
Sbjct: 116  GFNVASGLYNSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKISSETDPQRV 175

Query: 960  SGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQEK 1139
            +GMLAEVKRYQDDK ILKQYAENDQVVENG V KVQSEIVPPLSDNH  I RP+IRL EK
Sbjct: 176  TGMLAEVKRYQDDKSILKQYAENDQVVENGNVFKVQSEIVPPLSDNHPLITRPIIRLHEK 235

Query: 1140 NIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD 1319
            NIILTR+NP IRDTSVLVRLRPAWE+LRSYL ARGRKRFEVYVCTMAE+DYALEMWRLLD
Sbjct: 236  NIILTRVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAEKDYALEMWRLLD 295

Query: 1320 PDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVHV 1499
            PDS LIN   LLDRIVCVK+G +KSLLNVFQDGICHPKMALVIDDRLKVW +KDQPRVHV
Sbjct: 296  PDSRLINSMRLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWYEKDQPRVHV 355

Query: 1500 VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDFP 1679
            VPAFAPYYAPQAEAN  VPVLCVARNVACNVRGGFFK+FD+ +L  ISD FYEDEM DFP
Sbjct: 356  VPAFAPYYAPQAEANGNVPVLCVARNVACNVRGGFFKDFDEGVLPRISDIFYEDEMKDFP 415

Query: 1680 PAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPRL 1859
             APDV NYL+SEDD + SNGNKD L  EGMTD +VERRLK+AN     V   VN +D R 
Sbjct: 416  SAPDVGNYLISEDDNATSNGNKDQLCSEGMTDAEVERRLKEANGNVQVVHPMVNTLDLRS 475

Query: 1860 ASSIQHVLASSSSMIPQT-SQGPISFQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREEG 2036
             S IQ V+ASSS + P T +Q  +   N Q PQ I   +PLG  G  EPSLQ SPAREEG
Sbjct: 476  MSPIQPVMASSSCVPPLTATQVMMPLPNNQCPQPIALGRPLGQPGLPEPSLQGSPAREEG 535

Query: 2037 EVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEEM 2216
            EVPESELDPDTRRRLLILQHGQD R+ T   P FPVR  L +++ PVQS GS+F +EEEM
Sbjct: 536  EVPESELDPDTRRRLLILQHGQDIRDPT---PQFPVRTPLHVAVSPVQSRGSWFPLEEEM 592

Query: 2217 SPRKMNRA-REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHGDD 2393
            +PR+ +RA +EFP+EPE +  DK R  H S+Y   ENS+  +R L+EN+R   +L HGDD
Sbjct: 593  NPRQPSRAPKEFPLEPETVCLDKKRPNHQSYYRSGENSISSDRVLNENRRLAMQLHHGDD 652

Query: 2394 RFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCGTK 2570
            R RPNH   NY S  G+EMP GR SS ++++ FESGR TAQY  TPAGVLQ+IA KCG K
Sbjct: 653  RLRPNHAAANYDSFPGEEMPTGRISSSHKDIQFESGRATAQYARTPAGVLQDIATKCGAK 712

Query: 2571 VEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVSPD 2750
            VEFR AL  TTELQFS+EVWFVGEKIGEGIGKTRKEAQ+QA + SLRTLANKY+S  + D
Sbjct: 713  VEFRTALCDTTELQFSMEVWFVGEKIGEGIGKTRKEAQQQATDFSLRTLANKYLSNATSD 772

Query: 2751 LNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLKGM 2930
               + GD  K S+  +NGF+ DANS GYPA+ R+D L +ASTS++SRF+D R EGS K  
Sbjct: 773  --TLRGDMLKPSNAKENGFISDANSSGYPAYARDDLLAVASTSEESRFMDLRLEGSKKST 830

Query: 2931 APVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXXXX 3110
              +++LKELC +EG SL F AQP  ST S  KGEV  +VE+AGQILGKG+G TW      
Sbjct: 831  TSIAALKELCTIEGFSLNFQAQPSPSTDSVSKGEVCTQVEVAGQILGKGVGTTWEEAKLQ 890

Query: 3111 XXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNGPPVP 3290
                       ML Q +Q+R GSPRS    PNKRLK ++ R LQR+PSS RYSKN   VP
Sbjct: 891  AAEEALGTLKSMLGQFTQKRSGSPRSVSATPNKRLKPDFSRMLQRIPSSGRYSKNETHVP 950


>XP_002267987.3 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Vitis vinifera]
          Length = 935

 Score = 1241 bits (3210), Expect = 0.0
 Identities = 644/962 (66%), Positives = 748/962 (77%), Gaps = 5/962 (0%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIPNKEIRISHFSPPSERCPPLAVLHTIAAGGACF 599
            M+KS  Y G+ ++GEVEIY +N  +++  KEIRISH+S PSERCPPLAVLHTI + G CF
Sbjct: 1    MYKSIVYEGDDVVGEVEIYPQNQGLELM-KEIRISHYSQPSERCPPLAVLHTITSCGVCF 59

Query: 600  KMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACFW 779
            KM                    H+ C+RENKTA+M LGEEELHLVAM S+K   QY CFW
Sbjct: 60   KMESSKAQSQDTPLYLL-----HSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFW 114

Query: 780  GFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRI 959
            GFNV  GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+EVDPQRI
Sbjct: 115  GFNVALGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRI 174

Query: 960  SGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQEK 1139
            SGM AEV+RYQDD+ ILKQYAENDQVVENGK+ K Q EIVP LSDNHQPI+RPLIRLQEK
Sbjct: 175  SGMAAEVRRYQDDRNILKQYAENDQVVENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEK 234

Query: 1140 NIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD 1319
            NIILTRINP+IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYVCTMAERDYALEMWRLLD
Sbjct: 235  NIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLD 294

Query: 1320 PDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVHV 1499
            P+SNLIN KELLDRIVCVK+G +KSL NVFQDGICHPKMALVIDDRLKVW++KDQPRVHV
Sbjct: 295  PESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHV 354

Query: 1500 VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDFP 1679
            VPAFAPYYAPQAEANNA+ VLCVARNVACNVRGGFFKEFD+ LLQ I +  YED++ D  
Sbjct: 355  VPAFAPYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDDIKDIR 414

Query: 1680 PAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPRL 1859
             APDVSNYLVSEDDAS SNGN+D   F+GM DV+VER+LKD    A+   S V ++DPRL
Sbjct: 415  SAPDVSNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKD----AISAPSTVTSLDPRL 470

Query: 1860 ASSIQHVLASSSSMIPQ-TSQGPI-SFQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREE 2033
            +  +Q  +A+SS + PQ  +QG I  F N Q+PQ+   +KPL      EP++QSSPAREE
Sbjct: 471  SPPLQFAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPLA----PEPTMQSSPAREE 526

Query: 2034 GEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEE 2213
            GEVPESELDPDTRRRLLILQHGQDTRE  S+DPPFPVRP +Q+S+P VQS GS+F  +EE
Sbjct: 527  GEVPESELDPDTRRRLLILQHGQDTREHASSDPPFPVRPPIQVSVPRVQSRGSWFPADEE 586

Query: 2214 MSPRKMNRA--REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHG 2387
            MSPR++NRA  +EFP++ + +H +K+R  HPSF+H +E+S   +R LHENQR  KE+ H 
Sbjct: 587  MSPRQLNRAVPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHR 646

Query: 2388 DDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCG 2564
            DDR R NH LP YHS SG+E+PLGR+SS NR+L FESGR  A Y ETPA  LQEIAMKCG
Sbjct: 647  DDRLRLNHSLPGYHSFSGEEVPLGRSSS-NRDLDFESGR-GAPYAETPAVGLQEIAMKCG 704

Query: 2565 TKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVS 2744
            TK+EFRP+L A TELQFSIEVWF GEKIGEG GKTR+EAQ QAAE+SL  L+ +Y+    
Sbjct: 705  TKLEFRPSLVAATELQFSIEVWFAGEKIGEGTGKTRREAQCQAAEASLMYLSYRYL---- 760

Query: 2745 PDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLK 2924
                  HGD N+  + +DN F+ D NSFGY +FP+E  +  ++ S+ SR LD R E S K
Sbjct: 761  ------HGDVNRFPNASDNNFMSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKK 814

Query: 2925 GMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXX 3104
             M  +S+LKELCM+EGL + F +QPPLS+ S+ K E+ A+VEI GQ+LGKG G TW    
Sbjct: 815  SMGSISALKELCMMEGLGVEFLSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAK 874

Query: 3105 XXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNGPP 3284
                         ML Q SQ+R GSPRS  G+  KRLK+E+ R LQR PSS RYSKN  P
Sbjct: 875  MQAAEKALGSLKSMLGQFSQKRQGSPRSLQGM-GKRLKSEFTRGLQRTPSSGRYSKNTSP 933

Query: 3285 VP 3290
            VP
Sbjct: 934  VP 935


>XP_008809392.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Phoenix dactylifera]
          Length = 962

 Score = 1239 bits (3206), Expect = 0.0
 Identities = 650/972 (66%), Positives = 732/972 (75%), Gaps = 15/972 (1%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIPNKEIRISHFSPPSERCPPLAVLHTIAAGGACF 599
            MF+SA YHGNSL+GE EI  +N N     +EIRISHFS PSERCPPLAVLHTIA+ G  F
Sbjct: 1    MFESAVYHGNSLIGEAEISPQNSNPGAWLREIRISHFSLPSERCPPLAVLHTIASAGVSF 60

Query: 600  KMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACFW 779
            KM                    H ACL+E KTA++ LGEEELHLVAM SRKN   YACFW
Sbjct: 61   KMESKSPPSDESQLCSL-----HAACLKEQKTAVIPLGEEELHLVAMKSRKNLVHYACFW 115

Query: 780  GFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRI 959
            GFNV  GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKI+SE DPQR+
Sbjct: 116  GFNVASGLYNSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKISSETDPQRV 175

Query: 960  SGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQEK 1139
            +GMLAEVKRYQDDK ILKQYAENDQVVENG V KVQSEIVPPLSDNH  I RP+IRL EK
Sbjct: 176  TGMLAEVKRYQDDKSILKQYAENDQVVENGNVFKVQSEIVPPLSDNHPLITRPIIRLHEK 235

Query: 1140 NIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD 1319
            NIILTR+NP IRDTSVLVRLRPAWE+LRSYL ARGRKRFEVYVCTMAE+DYALEMWRLLD
Sbjct: 236  NIILTRVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAEKDYALEMWRLLD 295

Query: 1320 PDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVHV 1499
            PDS LIN   LLDRIVCVK+G +KSLLNVFQDGICHPKMALVIDDRLKVW +KDQPRVHV
Sbjct: 296  PDSRLINSMRLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWYEKDQPRVHV 355

Query: 1500 VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDFP 1679
            VPAFAPYYAPQAEAN  VPVLCVARNVACNVRGGFFK+FD+ +L  ISD FYEDEM DFP
Sbjct: 356  VPAFAPYYAPQAEANGNVPVLCVARNVACNVRGGFFKDFDEGVLPRISDIFYEDEMKDFP 415

Query: 1680 PAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPRL 1859
             APDV NYL+SEDD + SNGNKD L  EGMTD +VERRLK+AN     V   VN +D R 
Sbjct: 416  SAPDVGNYLISEDDNATSNGNKDQLCSEGMTDAEVERRLKEANGNVQVVHPMVNTLDLRS 475

Query: 1860 ASSIQHVLASSSSMIPQT-SQGPISFQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREEG 2036
             S IQ V+ASSS + P T +Q  +   N Q PQ I   +PLG  G  EPSLQ SPAREEG
Sbjct: 476  MSPIQPVMASSSCVPPLTATQVMMPLPNNQCPQPIALGRPLGQPGLPEPSLQGSPAREEG 535

Query: 2037 EVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEEM 2216
            EVPESELDPDTRRRLLILQHGQD R+ T   P FPVR  L +++ PVQS GS+F +EEEM
Sbjct: 536  EVPESELDPDTRRRLLILQHGQDIRDPT---PQFPVRTPLHVAVSPVQSRGSWFPLEEEM 592

Query: 2217 SPRKMNRA-REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHGDD 2393
            +PR+ +RA +EFP+EPE +  DK R  H S+Y   ENS+  +R L+EN+R   +L HGDD
Sbjct: 593  NPRQPSRAPKEFPLEPETVCLDKKRPNHQSYYRSGENSISSDRVLNENRRLAMQLHHGDD 652

Query: 2394 RFRPNHPLPNYHS-------------SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAG 2534
            R RPNH   NY S              G+EMP GR SS ++++ FESGR TAQY  TPAG
Sbjct: 653  RLRPNHAAANYDSFPGVLFPNQTLDFEGEEMPTGRISSSHKDIQFESGRATAQYARTPAG 712

Query: 2535 VLQEIAMKCGTKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRT 2714
            VLQ+IA KCG KVEFR AL  TTELQFS+EVWFVGEKIGEGIGKTRKEAQ+QA + SLRT
Sbjct: 713  VLQDIATKCGAKVEFRTALCDTTELQFSMEVWFVGEKIGEGIGKTRKEAQQQATDFSLRT 772

Query: 2715 LANKYISRVSPDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRF 2894
            LANKY+S  + D   + GD  K S+  +NGF+ DANS GYPA+ R+D L +ASTS++SRF
Sbjct: 773  LANKYLSNATSD--TLRGDMLKPSNAKENGFISDANSSGYPAYARDDLLAVASTSEESRF 830

Query: 2895 LDQRQEGSLKGMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGK 3074
            +D R EGS K    +++LKELC +EG SL F AQP  ST S  KGEV  +VE+AGQILGK
Sbjct: 831  MDLRLEGSKKSTTSIAALKELCTIEGFSLNFQAQPSPSTDSVSKGEVCTQVEVAGQILGK 890

Query: 3075 GIGLTWXXXXXXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPS 3254
            G+G TW                 ML Q +Q+R GSPRS    PNKRLK ++ R LQR+PS
Sbjct: 891  GVGTTWEEAKLQAAEEALGTLKSMLGQFTQKRSGSPRSVSATPNKRLKPDFSRMLQRIPS 950

Query: 3255 STRYSKNGPPVP 3290
            S RYSKN   VP
Sbjct: 951  SGRYSKNETHVP 962


>XP_018856749.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Juglans regia]
          Length = 949

 Score = 1236 bits (3198), Expect = 0.0
 Identities = 642/964 (66%), Positives = 741/964 (76%), Gaps = 7/964 (0%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIPN---KEIRISHFSPPSERCPPLAVLHTIAAGG 590
            M+K+  Y G  LLGEVE+Y    N D P    KEIRIS+FS  SERCPPLAVLHTIA+ G
Sbjct: 3    MYKAVVYQGEELLGEVEMYPVGNNNDNPMIEAKEIRISYFSQASERCPPLAVLHTIASSG 62

Query: 591  ACFKMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYA 770
             CFKM                    H++C +ENKTA++++G EELH VAMSSR   +QY 
Sbjct: 63   ICFKMESKSSQSQDSPLYLL-----HSSCFKENKTAVIEVGGEELHFVAMSSRNTDKQYP 117

Query: 771  CFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDP 950
            CFWGFNV  GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDP
Sbjct: 118  CFWGFNVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDP 177

Query: 951  QRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRL 1130
            QRISGM+AEVKRYQDDK ILKQYAENDQVVENGKVIK QSE+VP +SDNHQPI+RP+IRL
Sbjct: 178  QRISGMVAEVKRYQDDKHILKQYAENDQVVENGKVIKSQSEVVPAVSDNHQPIVRPIIRL 237

Query: 1131 QEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWR 1310
            QEKNIILTRINP IRDTSVLVRLRPAW+DLRSYLIARGRKRFEVYVCTMAERDYALEMWR
Sbjct: 238  QEKNIILTRINPQIRDTSVLVRLRPAWDDLRSYLIARGRKRFEVYVCTMAERDYALEMWR 297

Query: 1311 LLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPR 1490
            LLDPDSNLIN KELL RIVCVK+G +KSL NVFQDG+CHPKMALVIDDRLKVW++KDQPR
Sbjct: 298  LLDPDSNLINSKELLARIVCVKSGSRKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPR 357

Query: 1491 VHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMI 1670
            VHVVPAFAPYYAPQAEANNA+PVLCVARNVACNVRGGFFKEFD+ LLQ IS+  +ED++ 
Sbjct: 358  VHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQKISEIAHEDDIK 417

Query: 1671 DFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNID 1850
            D P  PDVSNYLVSEDDASASNGN+D LSF+GM D +VERRLKDA   +  +SSAV N+ 
Sbjct: 418  DIPFPPDVSNYLVSEDDASASNGNRDPLSFDGMADAEVERRLKDAISASSTISSAVANLV 477

Query: 1851 PRLASSIQHVLASSSSMIPQTSQGPIS-FQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAR 2027
            PRL  ++Q+ + S+SS IP T+   +S F +IQ+PQ     KP G +GP EPSLQSSPAR
Sbjct: 478  PRLVPALQNTITSASSSIPLTTTQVLSHFPSIQFPQPASLAKPAGHIGPQEPSLQSSPAR 537

Query: 2028 EEGEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTME 2207
            EEGEVPESELDPDTRRRLLILQHGQDTRE  S +P FPVRP +Q+  P VQS G +F +E
Sbjct: 538  EEGEVPESELDPDTRRRLLILQHGQDTREHASTEPQFPVRPPIQVPAPRVQSRGGWFPVE 597

Query: 2208 EEMSPRKMNR--AREFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELR 2381
            EEM P++++R  A+EFP++PE++H +K+   HP F+  +E+S+  +R LHENQR  KE  
Sbjct: 598  EEMGPQQLSRAGAKEFPLDPESMHIEKHGPHHPPFFPKVESSINSDRVLHENQRLQKEAF 657

Query: 2382 HGDDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMK 2558
            H DDR R NH L +YHS SG+++PL R SS NR+L FESGR      ETPAGVLQ+IAMK
Sbjct: 658  HRDDRLRLNHTLSSYHSFSGEDIPLSRPSSSNRDLDFESGRGVPN-EETPAGVLQDIAMK 716

Query: 2559 CGTKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISR 2738
            CGTKVEFRPAL  + ELQFS+E WF GEKIGEGIG+TR+EAQRQAAE SL+ LAN YI  
Sbjct: 717  CGTKVEFRPALIGSMELQFSMEAWFAGEKIGEGIGRTRREAQRQAAEGSLKNLANVYI-- 774

Query: 2739 VSPDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGS 2918
                    HGD + L + N NGFL   NSFG     +E+P+  ++ S+ SR LD R EGS
Sbjct: 775  --------HGDWSIL-NANGNGFLGSVNSFGDQPLSKEEPVSFSAASEPSRPLDPRLEGS 825

Query: 2919 LKGMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXX 3098
             K M  VS+LKELC +EGL + F  +PP S  S    EVYA+VEI GQ+LGKGIGLTW  
Sbjct: 826  KKLMGSVSALKELCTMEGLDVAFQPRPPPSGNSVQNDEVYAQVEIDGQVLGKGIGLTWDE 885

Query: 3099 XXXXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNG 3278
                           ML Q + +R    RS HGI NKR+K E+ R LQR+PSS RY+KN 
Sbjct: 886  AKMQAAEKALGSLRSMLGQSNPKRPDFSRSLHGISNKRMKPEFSRVLQRMPSSARYAKNA 945

Query: 3279 PPVP 3290
            PPVP
Sbjct: 946  PPVP 949


>XP_010918443.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X2 [Elaeis guineensis]
          Length = 915

 Score = 1231 bits (3186), Expect = 0.0
 Identities = 639/925 (69%), Positives = 721/925 (77%), Gaps = 4/925 (0%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIPNKEIRISHFSPPSERCPPLAVLHTIAAGGACF 599
            MFKSA YHGNSL+GEVEI  +N N     +EIRISHFSPPSERCPPLAVLHTIA+    F
Sbjct: 1    MFKSAVYHGNSLIGEVEISPQNSNPGAWLREIRISHFSPPSERCPPLAVLHTIASASVSF 60

Query: 600  KMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACFW 779
            KM                    H ACLR+ KTA++ LGEEELHLVAM  RKN   YACFW
Sbjct: 61   KMESKSPPSDESQLCSL-----HAACLRDQKTAVIPLGEEELHLVAMKPRKNLMHYACFW 115

Query: 780  GFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRI 959
            GFNV  GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKI+SE DPQR+
Sbjct: 116  GFNVASGLYNSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKISSETDPQRV 175

Query: 960  SGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQEK 1139
            +GMLAEVKRYQDDK ILKQYAENDQVVENG V KVQSE+VPPLSDNHQ I RP+IRLQEK
Sbjct: 176  TGMLAEVKRYQDDKSILKQYAENDQVVENGNVFKVQSEVVPPLSDNHQLITRPIIRLQEK 235

Query: 1140 NIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD 1319
            NIILTR+NP IRDTSVLVRLRPAWE+LRSYL ARGRKRFEVYVCTMAE+DYALEMWRLLD
Sbjct: 236  NIILTRVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAEKDYALEMWRLLD 295

Query: 1320 PDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVHV 1499
            PDS+LIN  +LLDRIVCVK+G +KSLLNVFQDGICHPKMALVIDDRLKVW+DKDQPRVHV
Sbjct: 296  PDSSLINAMQLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWDDKDQPRVHV 355

Query: 1500 VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDFP 1679
            VPAFAPYYAPQAEAN  VPVLCVARNVACNVRGGFFK+FD+ LL  ISD FYEDEM DFP
Sbjct: 356  VPAFAPYYAPQAEANGNVPVLCVARNVACNVRGGFFKDFDEGLLPRISDIFYEDEMKDFP 415

Query: 1680 PAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPRL 1859
             APDV NYL+SEDD + SNG+KDLL  EGMTD +VERRLK+AN     +   VN  DP  
Sbjct: 416  SAPDVGNYLISEDDNATSNGSKDLLCSEGMTDAEVERRLKEANGNVQAIYPMVNTFDPSS 475

Query: 1860 ASSIQHVLASSSSMIPQ--TSQGPISFQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREE 2033
             SSIQHV+ASSS  +P    +Q  +   N Q PQ I   +PLG  G  EPSLQ SPAREE
Sbjct: 476  MSSIQHVMASSSG-VPSLAATQVMMPLPNNQCPQPIALGRPLGQPGLPEPSLQGSPAREE 534

Query: 2034 GEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEE 2213
            GEVPESELDPDTRRRLLILQHGQD R+ T   P FPVRP L +++ PVQS GS+F +EEE
Sbjct: 535  GEVPESELDPDTRRRLLILQHGQDIRDPT---PQFPVRPPLHVAVSPVQSRGSWFPLEEE 591

Query: 2214 MSPRKMNRA-REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHGD 2390
            M+PR+++RA +EF +EPE + FDK R  H S+Y   ENS+  +R L+EN+R   +LRHGD
Sbjct: 592  MNPRQLSRAPKEFSLEPETVCFDKKRPNHQSYYRTGENSISSDRVLNENRRLAMQLRHGD 651

Query: 2391 DRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCGT 2567
            DR RPNH   N  S SG+EMP+GR SS +R++ FESG+VT QY  TPAGVLQ+IA KCG 
Sbjct: 652  DRLRPNHAAANCDSFSGEEMPIGRISSSHRDIQFESGQVTVQYAGTPAGVLQDIATKCGA 711

Query: 2568 KVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVSP 2747
            KVEFR AL  TTELQFS+EVWFVGEKIGEGIGKTRKEAQ+QAAE SLRTLANKY+S  + 
Sbjct: 712  KVEFRTALCDTTELQFSVEVWFVGEKIGEGIGKTRKEAQQQAAEFSLRTLANKYLSNATS 771

Query: 2748 DLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLKG 2927
            D   + GD  K S+  +NGF+ D NSFGYPA+ R+D L +ASTS++SRFLD R EGS K 
Sbjct: 772  D--TLRGDMLKPSNAKENGFISDPNSFGYPAYVRDDLLGVASTSEESRFLDLRLEGSKKS 829

Query: 2928 MAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXXX 3107
             A V++LKELC +EG +L+F  QP  ST S  KGEVYA+VE+AGQILGKG+G TW     
Sbjct: 830  TASVAALKELCTIEGFNLIFQPQPSASTDSVGKGEVYAQVEVAGQILGKGVGTTWEEAKL 889

Query: 3108 XXXXXXXXXXXFMLSQGSQERLGSP 3182
                        ML Q +Q+R GSP
Sbjct: 890  QAAEEALGTLKSMLGQFTQKRSGSP 914


>XP_007025680.2 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Theobroma cacao]
          Length = 978

 Score = 1229 bits (3180), Expect = 0.0
 Identities = 635/980 (64%), Positives = 741/980 (75%), Gaps = 23/980 (2%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIY-----------------LKNPNIDIPNKEIRISHFSPPSER 548
            M+KS  Y G  +LGEVEIY                  K   ++   KEIRI + +  SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 549  CPPLAVLHTIAAGGACFKMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELH 728
            CPPLAVLHTI + G CFKM                    H+ C+R+NKTA+M +G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 729  LVAMSSRKNFEQYACFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDR 908
            LVAM SR +     CFWGFNV  GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDR
Sbjct: 124  LVAMYSRNS--DRPCFWGFNVSRGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 909  IDALQRKINSEVDPQRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPL 1088
            I+ALQRK+ +EVDPQR++GM+AE+KRYQDDK ILKQYAENDQVVENGKVIK+QSE+VP L
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 1089 SDNHQPIIRPLIRLQEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYV 1268
            SDNHQPIIRPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 1269 CTMAERDYALEMWRLLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVI 1448
            CTMAERDYALEMWRLLDP+SNLIN KELLDRIVCVK+G +KSL NVFQDGICHPKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 1449 DDRLKVWEDKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVL 1628
            DDRLKVW++KDQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFD+ L
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 1629 LQWISDAFYEDEMIDFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDAN 1808
            LQ I +  YED++ D P  PDV NYLVSEDD SA NGNKD L F+GM D +VERRLK+A 
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1809 VQALQVSSAVNNIDPRLASSIQHVLASSSSMIPQTSQGP--ISFQNIQYPQAIPTVKPLG 1982
                 VSSA  N+DPRL  S+Q+ + SSSS IP ++  P  +SF N+Q+P A P VKP+ 
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541

Query: 1983 LVGPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTREQTSNDPPF-PVRPALQ 2159
             V   EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+ T  +P F PVRP +Q
Sbjct: 542  PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601

Query: 2160 ISMPPVQSHGSYFTMEEEMSPRKMNRA--REFPIEPEALHFDKNRSPHPSFYHGLENSLP 2333
            +S+P  QS GS+F  EEEMSPR++NRA  +EFP++ E +H +K+R  HP F+  +E+S+P
Sbjct: 602  VSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHR--HPPFFPKVESSIP 659

Query: 2334 PNRSLHENQRFPKELRHGDDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTA 2510
             +R L ENQR  KE  H DDR   NH   +YHS SG+EMPL ++SS +R+L FESGR T 
Sbjct: 660  SDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGR-TV 718

Query: 2511 QYPETPAGVLQEIAMKCGTKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQ 2690
               ET AGVLQ+IAMKCG KVEFRPAL A+ +LQFSIE WF GEK+GEG+G+TR+EAQRQ
Sbjct: 719  TSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQ 778

Query: 2691 AAESSLRTLANKYISRVSPDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIA 2870
            AAE S++ LAN Y+SR+ PD  +  GD ++L + NDNGF  + NSFG     +E+ L  +
Sbjct: 779  AAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPGNVNSFGNQLLAKEESLSFS 838

Query: 2871 STSDQSRFLDQRQEGSLKGMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVE 3050
            + S+QSR  D R EGS K M  V++LKELCM+EGL +VF  QPP S+ +  K EVYA+VE
Sbjct: 839  TASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVE 898

Query: 3051 IAGQILGKGIGLTWXXXXXXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYP 3230
            I GQ+LGKG GLTW                 ML Q SQ+R GSPRS  G+ NKRLK E+P
Sbjct: 899  IDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKRLKPEFP 958

Query: 3231 RTLQRVPSSTRYSKNGPPVP 3290
            R LQR+PSS RY KN PPVP
Sbjct: 959  RVLQRMPSSGRYPKNAPPVP 978


>EOY28302.1 C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score = 1228 bits (3178), Expect = 0.0
 Identities = 634/980 (64%), Positives = 741/980 (75%), Gaps = 23/980 (2%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIY-----------------LKNPNIDIPNKEIRISHFSPPSER 548
            M+KS  Y G  +LGEVEIY                  K   ++   KEIRI + +  SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 549  CPPLAVLHTIAAGGACFKMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELH 728
            CPPLAVLHTI + G CFKM                    H+ C+R+NKTA+M +G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 729  LVAMSSRKNFEQYACFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDR 908
            LVAM SR +     CFWGFNV  GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDR
Sbjct: 124  LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 909  IDALQRKINSEVDPQRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPL 1088
            I+ALQRK+ +EVDPQR++GM+AE+KRYQDDK ILKQYAENDQVVENGKVIK+QSE+VP L
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 1089 SDNHQPIIRPLIRLQEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYV 1268
            SDNHQPIIRPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 1269 CTMAERDYALEMWRLLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVI 1448
            CTMAERDYALEMWRLLDP+SNLIN KELLDRIVCVK+G +KSL NVFQDGICHPKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 1449 DDRLKVWEDKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVL 1628
            DDRLKVW++KDQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFD+ L
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 1629 LQWISDAFYEDEMIDFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDAN 1808
            LQ I +  YED++ D P  PDV NYLVSEDD SA NGNKD L F+GM D +VERRLK+A 
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1809 VQALQVSSAVNNIDPRLASSIQHVLASSSSMIPQTSQGP--ISFQNIQYPQAIPTVKPLG 1982
                 VSSA  N+DPRL  S+Q+ + SSSS IP ++  P  +SF N+Q+P A P VKP+ 
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541

Query: 1983 LVGPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTREQTSNDPPF-PVRPALQ 2159
             V   EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+ T  +P F PVRP +Q
Sbjct: 542  PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601

Query: 2160 ISMPPVQSHGSYFTMEEEMSPRKMNRA--REFPIEPEALHFDKNRSPHPSFYHGLENSLP 2333
            +S+P  QS GS+F  EEEMSPR++NRA  +EFP++ E +H +K+R  HP F+  +E+S+P
Sbjct: 602  VSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHR--HPPFFPKVESSIP 659

Query: 2334 PNRSLHENQRFPKELRHGDDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTA 2510
             +R L ENQR  KE  H DDR   NH   +YHS SG+EMPL ++SS +R+L FESGR T 
Sbjct: 660  SDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGR-TV 718

Query: 2511 QYPETPAGVLQEIAMKCGTKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQ 2690
               ET AGVLQ+IAMKCG KVEFRPAL A+ +LQFSIE WF GEK+GEG+G+TR+EAQRQ
Sbjct: 719  TSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQ 778

Query: 2691 AAESSLRTLANKYISRVSPDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIA 2870
            AAE S++ LAN Y+SR+ PD  +  GD ++L + NDNGF  + NSFG     +E+ L  +
Sbjct: 779  AAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFS 838

Query: 2871 STSDQSRFLDQRQEGSLKGMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVE 3050
            + S+QSR  D R EGS K M  V++LKELCM+EGL +VF  QPP S+ +  K EVYA+VE
Sbjct: 839  TASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVE 898

Query: 3051 IAGQILGKGIGLTWXXXXXXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYP 3230
            I GQ+LGKG GLTW                 ML Q SQ+R GSPRS  G+ NKRLK E+P
Sbjct: 899  IDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKRLKPEFP 958

Query: 3231 RTLQRVPSSTRYSKNGPPVP 3290
            R LQR+PSS RY KN PPVP
Sbjct: 959  RVLQRMPSSGRYPKNAPPVP 978


>GAV74436.1 dsrm domain-containing protein/NIF domain-containing protein
            [Cephalotus follicularis]
          Length = 980

 Score = 1227 bits (3174), Expect = 0.0
 Identities = 644/988 (65%), Positives = 746/988 (75%), Gaps = 31/988 (3%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNI--------------------DIPNKEIRISHFSPP 539
            MFK+  Y G  +LGEVEIY +                        ++  KEIRIS+ S  
Sbjct: 1    MFKTVVYQGEEILGEVEIYPQQLQHGGGGGGEDEEEQEEKRKIIEEVLRKEIRISYLSQG 60

Query: 540  SERCPPLAVLHTI---AAGGACFKMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQL 710
            SERCPPLAVLHTI   +    CFKM                    H++C+R+NKTA+M L
Sbjct: 61   SERCPPLAVLHTITCSSGSSVCFKMESPKSSLTQQQDSPLHLL--HSSCIRDNKTAVMPL 118

Query: 711  GEEELHLVAMSSRKNFEQYACFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTM 890
            G EELHLVAM SR N  QY CFWGFNV  GLYNSCLVMLNLRCLGIVFDLDETL+VANTM
Sbjct: 119  GAEELHLVAMCSRSNERQYPCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLVVANTM 178

Query: 891  RSFEDRIDALQRKINSEVDPQRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQS 1070
            RSFEDRI+ALQRKI++EVDPQR+SGMLAE+KRYQDDK ILKQYAENDQVVENGKVIK+QS
Sbjct: 179  RSFEDRIEALQRKISTEVDPQRMSGMLAEIKRYQDDKNILKQYAENDQVVENGKVIKIQS 238

Query: 1071 EIVPPLSDNHQPIIRPLIRLQEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRK 1250
            E+VP LSDNHQPI+RPLIRLQEKNIILTRINP+IRDTSVLVRLRPAWEDLRSYL A+GRK
Sbjct: 239  EVVPALSDNHQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRK 298

Query: 1251 RFEVYVCTMAERDYALEMWRLLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHP 1430
            RFEVYVCTMAERDYALEMWRLLDPDSNLIN KELLDRIVCVK+G +KSL NVFQDGICHP
Sbjct: 299  RFEVYVCTMAERDYALEMWRLLDPDSNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHP 358

Query: 1431 KMALVIDDRLKVWEDKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFK 1610
            KMALVIDDRLKVW +KDQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFFK
Sbjct: 359  KMALVIDDRLKVWNEKDQPRVHVVPAFAPYYAPQAEANNVIPVLCVARNVACNVRGGFFK 418

Query: 1611 EFDDVLLQWISDAFYEDEMIDFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVER 1790
            +FD+ LLQ I D  YED++ D P  PDVSNYLVSEDDA ASNGNKD LSF+GM DV+VER
Sbjct: 419  DFDEGLLQRIPDILYEDDIKDIPFPPDVSNYLVSEDDAPASNGNKDPLSFDGMADVEVER 478

Query: 1791 RLKDANVQALQVSSAVNNIDPRLASSIQHVLASSSSMIPQ-TSQGP-ISFQNIQYPQAIP 1964
            RLK    +A+  SS V NIDPRLA  +   +A+ S+  P  TSQ P + F  +Q+PQ   
Sbjct: 479  RLK----EAISASSTVANIDPRLA-PLPFTMATFSNPAPMPTSQVPVVPFPTMQFPQVTS 533

Query: 1965 TVKPLGLVG--PSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTREQTSNDP-P 2135
             VKP+G VG  P+EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQD R+   ++P P
Sbjct: 534  LVKPVGHVGPAPAEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDLRDYALSEPSP 593

Query: 2136 FPVRPALQISMPPVQSHGSYFTMEEEMSPRKMNR--AREFPIEPEALHFDKNRSPHPSFY 2309
            FPVR  +Q+S+P V S  S+  +EEEMSPR++NR  ARE+P++ E L  +K++  HP F+
Sbjct: 594  FPVRTPIQVSVPRVPSRRSWHPVEEEMSPRQLNRAVAREYPLDAEPLQIEKHQPQHPPFF 653

Query: 2310 HGLENSLPPNRSLHENQRFPKELRHGDDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLH 2486
              +E+S+P +R  HENQR PKE    DDR R N  L +YHS SG+E+PL R+SS NR+L 
Sbjct: 654  PKVESSIPSDRIFHENQRLPKEASQRDDRLRLNQILSSYHSFSGEEIPLSRSSSSNRDLD 713

Query: 2487 FESGRVTAQYPETPAGVLQEIAMKCGTKVEFRPALAATTELQFSIEVWFVGEKIGEGIGK 2666
            FESGR  +   ETPA  LQ+IA+KCGTKVEF+PAL A+TELQFSIE WF GEKIGEGIG 
Sbjct: 714  FESGRGVSN-AETPAVALQDIALKCGTKVEFKPALVASTELQFSIEAWFAGEKIGEGIGA 772

Query: 2667 TRKEAQRQAAESSLRTLANKYISRVSPDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFP 2846
            TR+EAQRQAAE S++ LA+ Y+SR+ PD  + HGD ++  + NDNGF  + NSFG P+ P
Sbjct: 773  TRREAQRQAAEGSIKKLADIYMSRIKPDSVSPHGDISRFPNANDNGFSANVNSFGSPSLP 832

Query: 2847 REDPLPIASTSDQSRFLDQRQEGSLKGMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHK 3026
            +ED +  ++ S+ SR LD R E S K M  VS+LKELCM+EGL +VF AQPP S  S  K
Sbjct: 833  KEDSVSYSTASESSRLLDPRPESSKKSMGSVSALKELCMMEGLGVVFQAQPPPSANSIQK 892

Query: 3027 GEVYAEVEIAGQILGKGIGLTWXXXXXXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPN 3206
             EV+A+VEI GQ+LGKGIGLTW                 ML    Q+R  SPRS  GIP 
Sbjct: 893  DEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRSMLGHYPQKRQISPRSLQGIPG 952

Query: 3207 KRLKTEYPRTLQRVPSSTRYSKNGPPVP 3290
            KRLK E+PR LQR+PSS RY +N PPVP
Sbjct: 953  KRLKPEFPRVLQRMPSSGRYPRNAPPVP 980


>JAT50607.1 RNA polymerase II C-terminal domain phosphatase-like 1 [Anthurium
            amnicola]
          Length = 949

 Score = 1223 bits (3164), Expect = 0.0
 Identities = 633/961 (65%), Positives = 748/961 (77%), Gaps = 5/961 (0%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIP-NKEIRISHFSPPSERCPPLAVLHTIAAGGAC 596
            M KS  YHGNS LGEV+IY +NPN+     +EIRISHFSPPSERCPPLAVL+TIA+ G C
Sbjct: 1    MLKSVIYHGNSPLGEVDIYHQNPNMTSAWTREIRISHFSPPSERCPPLAVLYTIASAGPC 60

Query: 597  FKMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACF 776
             ++                    H  C+RE KTAI+ LG+EELHLVAM  R+NFE+ +CF
Sbjct: 61   VRIESRCLQPDASPLFLL-----HATCVREKKTAIVLLGDEELHLVAMPYRRNFEKCSCF 115

Query: 777  WGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQR 956
            WGFN + GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRID+LQRKIN+E DPQR
Sbjct: 116  WGFNTVQGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDSLQRKINTETDPQR 175

Query: 957  ISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQE 1136
            +S +LAEVKRYQDD+ ILKQYAENDQVV+NGKV KVQSEIVPPLSD HQPI RP+IRLQE
Sbjct: 176  VSAILAEVKRYQDDQCILKQYAENDQVVDNGKVFKVQSEIVPPLSDGHQPISRPIIRLQE 235

Query: 1137 KNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLL 1316
            KNIILTR+NP IRDTSVLVRLRPAW+DLRSYL ARGRKRFEVYVCTMAERDYALEMWRLL
Sbjct: 236  KNIILTRVNPAIRDTSVLVRLRPAWDDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL 295

Query: 1317 DPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVH 1496
            DP+SNLIN K+LLDRIVCVK+G +KSL NVFQDGICHPKMALVIDDRLKVW+DKDQPRVH
Sbjct: 296  DPESNLINSKQLLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDDKDQPRVH 355

Query: 1497 VVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDF 1676
            VVPAFAPYYAPQAEAN A+PVLCVARNVACNVRGGFFKEFD+ LL  ISD FYEDE++DF
Sbjct: 356  VVPAFAPYYAPQAEANGAIPVLCVARNVACNVRGGFFKEFDEGLLSRISDVFYEDELVDF 415

Query: 1677 PPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPR 1856
            P APDV NYL+SEDD SASNGNKD +SFEGM D +VERRLK+AN       S ++N+DPR
Sbjct: 416  PSAPDVGNYLISEDDTSASNGNKDPVSFEGMADAEVERRLKEANFNVQANPSILSNLDPR 475

Query: 1857 LASSIQHVLASSSSMIPQTSQGPIS-FQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREE 2033
              SS+QHVLASS  M+ Q +   I+  +N QY Q   +VKP+   G  E SLQ SPAREE
Sbjct: 476  ALSSLQHVLASSGIMVTQGATQVIAPLRNNQYTQT-ASVKPVSQPGLPETSLQGSPAREE 534

Query: 2034 GEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISM-PPVQSHGSYFTMEE 2210
            GEVPESELDP+TRRRLLILQHGQDTRE   +D   P    +Q+S+ PPVQ HG+++ +EE
Sbjct: 535  GEVPESELDPNTRRRLLILQHGQDTREHIPSDSALPAIVPMQVSVTPPVQPHGNWYPLEE 594

Query: 2211 EMSPRKMNRA-REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHG 2387
            EMSPRKMNR+ +EFP EP+A+ FDK+RS H SF+H +ENS+ P R++HE+QR  KE+  G
Sbjct: 595  EMSPRKMNRSPKEFPFEPDAVRFDKSRS-HHSFFHRMENSIRPERTIHESQRLSKEVYIG 653

Query: 2388 DDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCG 2564
            D + R +H + NYHS  G+++P+GR SS NR+ HFESGRVT+QY +TPAGVL +IA KCG
Sbjct: 654  DGKLRLDHAVSNYHSVPGEDVPVGRMSSNNRDAHFESGRVTSQYSDTPAGVLLDIATKCG 713

Query: 2565 TKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVS 2744
             KVEFR ALA TTELQFSIEVWFVGEKIGEGIG+TRK+AQ QAAESSLR LANKY+S +S
Sbjct: 714  NKVEFRTALADTTELQFSIEVWFVGEKIGEGIGRTRKDAQHQAAESSLRYLANKYLSTIS 773

Query: 2745 PDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLK 2924
             D  ++HG+   +S   +NG + + +SF YPA   +D L   STS+  +F + R + S  
Sbjct: 774  LDPGSIHGE---VSLFKENGSVSNKSSFWYPASQGDDHL--TSTSEPVQFQELRVDRSKA 828

Query: 2925 GMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXX 3104
              A +++LKELC++EG ++ F  QP +S+    KGEV A+VEIAGQ+LGKG G+TW    
Sbjct: 829  STAAIATLKELCILEGFNMTFQKQPAVSSDLVPKGEVIAQVEIAGQLLGKGSGMTWEEAK 888

Query: 3105 XXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNGPP 3284
                         M  Q +Q+R  SPR +H I +KRLK +  RTLQRVPS+ RY K+  P
Sbjct: 889  LQAALEAIGNLQSMRGQFTQKRSASPRQAHTISSKRLKQDLSRTLQRVPSA-RYFKDDAP 947

Query: 3285 V 3287
            V
Sbjct: 948  V 948


>XP_008371347.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Malus domestica]
          Length = 960

 Score = 1222 bits (3163), Expect = 0.0
 Identities = 635/962 (66%), Positives = 742/962 (77%), Gaps = 11/962 (1%)
 Frame = +3

Query: 438  YHGNSLLGEVEIYL----KNPNIDIPNKEIRISHFSPPSERCPPLAVLHTI-AAGGACFK 602
            Y G  LLGEVEIY      N N+    KEIRIS+FS PSERCPP+AVLHTI ++ G CFK
Sbjct: 6    YKGEDLLGEVEIYPTVNENNKNVQDVLKEIRISYFSQPSERCPPVAVLHTINSSNGVCFK 65

Query: 603  MXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACFWG 782
            M                    H++  +ENKTA+M LG EELHLVAM SR   +Q+ CFWG
Sbjct: 66   MMESKTSPLSSPDTPLFLL--HSSMTQENKTAVMPLGGEELHLVAMQSRNGGKQFPCFWG 123

Query: 783  FNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRIS 962
            F V  GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI++EVDP RIS
Sbjct: 124  FYVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISTEVDPLRIS 183

Query: 963  GMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQEKN 1142
            GMLAE+KRYQDDK ILKQYAENDQVV+NG+V+K QSE+VP LSDNHQPIIRPLIRL EKN
Sbjct: 184  GMLAEIKRYQDDKFILKQYAENDQVVDNGRVVKTQSEVVPALSDNHQPIIRPLIRLHEKN 243

Query: 1143 IILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLLDP 1322
            IILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 244  IILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 303

Query: 1323 DSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVHVV 1502
            DSNLIN  +LLDRIVCVK+G +KSL NVFQ+ +CHPKMALVIDDRLKVW+++DQPRVHVV
Sbjct: 304  DSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDERDQPRVHVV 363

Query: 1503 PAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDFPP 1682
            PAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFDD LLQ I + FYED++ D  P
Sbjct: 364  PAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDSLLQKIPEFFYEDDIKDV-P 422

Query: 1683 APDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPRLA 1862
            +PDVSN+LVSEDD SA NGN+D L+F+GM D +VERRLK+A   AL  SS V NIDPRLA
Sbjct: 423  SPDVSNHLVSEDDPSALNGNRDPLTFDGMADAEVERRLKEATSAALTASSVVTNIDPRLA 482

Query: 1863 SSIQHVLASSSS--MIPQTSQGPISFQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREEG 2036
             S+Q+ +A SSS   +P + Q P++F NIQ+PQ    VKPLG +G +EPSL SSPAREEG
Sbjct: 483  -SLQYSMAPSSSTTSLPSSQQSPMTFPNIQFPQGASVVKPLGHLGAAEPSLHSSPAREEG 541

Query: 2037 EVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEEM 2216
            EVPESELDPDTRRRLLILQHGQDTRE   ++PPF VRP +Q S+P VQ    +F +EEEM
Sbjct: 542  EVPESELDPDTRRRLLILQHGQDTREPPPSEPPFAVRPPVQASVPRVQPRPGWFPVEEEM 601

Query: 2217 SPRKMNRA--REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHGD 2390
            SPR+++R   +E P++P+ +  +K+R  H SF+  ++NS+P +R L ENQRFPKE  H D
Sbjct: 602  SPRQLSRTVPKELPLDPDPMQIEKHRPHHSSFFSKVDNSIPSDRILQENQRFPKEAFHRD 661

Query: 2391 DRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCGT 2567
            DR R NH    YHS SG+E+PL R+ S NR++ FESGR  +   ETPAG LQEIAMKCG 
Sbjct: 662  DRLRFNHASAGYHSVSGEEIPLSRSPSMNRDVDFESGRAISN-AETPAGALQEIAMKCGA 720

Query: 2568 KVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVSP 2747
            KVEFRPAL A+TELQF +E WF GEKIGEG GKTR+EA  QAAE SL+ LAN Y+SRV P
Sbjct: 721  KVEFRPALVASTELQFYVEAWFAGEKIGEGTGKTRREAHFQAAEGSLKNLANIYLSRVKP 780

Query: 2748 DLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLKG 2927
            D   VHG+ +K S+ N+NGF+ +ANSFG  +FP+E+ L  +++S+ SR LD R EG  K 
Sbjct: 781  DSVPVHGEMSKFSNANNNGFVGNANSFGIQSFPKEESLSSSTSSEPSRPLDPRLEGFQKS 840

Query: 2928 MAPVSSLKELCMVEGL-SLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXX 3104
            M  VS+LKELCM+EGL  +VF  +PP S  S  K EV+ +VEI G++LGKGIGLTW    
Sbjct: 841  MNSVSALKELCMIEGLGGVVFQPRPPPSANSVEKDEVHVQVEIDGEVLGKGIGLTWDEAK 900

Query: 3105 XXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNGPP 3284
                          L   +Q+R GSPRS  G+PNKR+K E+P+ LQR+PSS RY KN PP
Sbjct: 901  MQAAEKALGSLRSTLF--AQKRQGSPRSFQGMPNKRMKQEFPQVLQRMPSSARYPKNAPP 958

Query: 3285 VP 3290
            VP
Sbjct: 959  VP 960


>XP_002305017.2 hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            EEE85528.2 hypothetical protein POPTR_0004s04010g
            [Populus trichocarpa]
          Length = 996

 Score = 1221 bits (3158), Expect = 0.0
 Identities = 642/998 (64%), Positives = 747/998 (74%), Gaps = 41/998 (4%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLK--------NPN-----IDIPNKEIRISHFSPPSERCPPL 560
            M+KS  Y G+ LLGEVEIY +        N N     ID   KEIRISHFS  SERCPPL
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60

Query: 561  AVLHTIAAGGACFKMXXXXXXXXXXXXXXXXXXXX-HTACLRENKTAIMQLGEEELHLVA 737
            AVLHTI + G CFKM                     H++C++ENKTA+M LG EELHLVA
Sbjct: 61   AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120

Query: 738  MSSRKNFEQYACFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 917
            M SR N  Q+ CFWGF+V PGLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA
Sbjct: 121  MPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180

Query: 918  LQRKINSEVDPQRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDN 1097
            LQRKI++EVDPQRI GML+EVKRY DDK ILKQY ENDQVVENGKVIK QSE+VP LSDN
Sbjct: 181  LQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240

Query: 1098 HQPIIRPLIRLQEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTM 1277
            HQP++RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYVCTM
Sbjct: 241  HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300

Query: 1278 AERDYALEMWRLLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDR 1457
            AERDYALEMWRLLDP+SNLIN KELLDRIVCVK+GL+KSL NVFQDGICHPKMALVIDDR
Sbjct: 301  AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360

Query: 1458 LKVWEDKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQW 1637
            LKVW+++DQ RVHVVPAFAPYYAPQAE NNAVPVLCVARNVACNVRGGFFKEFD+ LLQ 
Sbjct: 361  LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420

Query: 1638 ISDAFYEDEMIDFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDA---- 1805
            I +  YED+  + P  PDVSNYLVSEDDASA NGN+D LSF+GM D +VER+LK+A    
Sbjct: 421  IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSAS 480

Query: 1806 NVQALQVSSAVNNIDPRLASSIQHVLASSSSMIPQT------SQGPI------------- 1928
            +     + S V+++DPRL  S+Q+ +ASSSS +P +      SQ P+             
Sbjct: 481  SAILSTIPSTVSSLDPRLLQSLQYTIASSSSSMPTSQPSMLASQQPMPALQPPKPPSQLS 540

Query: 1929 --SFQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQ 2102
               F N Q+PQ  P+VK LG V P EPSLQSSPAREEGEVPESELDPDTRRRLLILQHG 
Sbjct: 541  MTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGH 600

Query: 2103 DTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEEMSPRKMNRA-REFPIEPEALHFD 2279
            D+R+   ++ PFP RP+ Q+S P VQS GS+  +EEEMSPR++NR  REFP++ + ++ +
Sbjct: 601  DSRDNAPSESPFPARPSTQVSAPRVQSVGSWVPVEEEMSPRQLNRTPREFPLDSDPMNIE 660

Query: 2280 KNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHGDDRFRPNHPLPNYHS-SGDEMPLG 2456
            K+R+ HPSF+H +E+++P +R +HENQR PKE  + DDR + NH   NY S  G+E PL 
Sbjct: 661  KHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYPSFQGEESPLS 720

Query: 2457 RTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCGTKVEFRPALAATTELQFSIEVWFV 2636
            R+SS NR+L  ES R  +   ETP  VLQEIAMKCGTKVEFRPAL AT++LQFSIE WFV
Sbjct: 721  RSSS-NRDLDLESERAFSS-TETPVEVLQEIAMKCGTKVEFRPALIATSDLQFSIETWFV 778

Query: 2637 GEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVSPDLNAVHGDSNKLSHKNDNGFLRD 2816
            GEK+GEG GKTR+EAQRQAAE S++ LA  Y+SRV PD   + GDS++    NDNGFL D
Sbjct: 779  GEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYPSANDNGFLGD 838

Query: 2817 ANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLKGMAPVSSLKELCMVEGLSLVFHAQ 2996
             NSFG     +++ +  ++TS+ SR LDQR EGS K M  V++LKE CM EGL + F AQ
Sbjct: 839  MNSFGNQPLLKDENITYSATSEPSRLLDQRLEGSKKSMGSVTALKEFCMTEGLGVNFLAQ 898

Query: 2997 PPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXXXXXXXXXXXXXXFMLSQGSQERLG 3176
             PLST S    EV+A+VEI GQ+LGKGIGLTW                 M  Q + +R G
Sbjct: 899  TPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRTMFGQYTPKRQG 958

Query: 3177 SPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNGPPVP 3290
            SPR   G+PNKRLK E+PR LQR+PSS RY KN  PVP
Sbjct: 959  SPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNASPVP 996


>XP_012091568.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Jatropha curcas] XP_012091569.1 PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1
            [Jatropha curcas]
          Length = 976

 Score = 1219 bits (3155), Expect = 0.0
 Identities = 629/977 (64%), Positives = 741/977 (75%), Gaps = 20/977 (2%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYL-----------KNPNID--IPNKEIRISHFSPPSERCPPL 560
            M+KSA Y G  LLGEVEIY            K   ID  +  KEIRISHFS PSERCPPL
Sbjct: 7    MYKSAVYKGEELLGEVEIYPQQHQQQEEENNKKKLIDEILMGKEIRISHFSQPSERCPPL 66

Query: 561  AVLHTIAAGGACFKMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAM 740
            AVLHTI  G  CFKM                    H++C++ENKTA++ LG EELHLVA+
Sbjct: 67   AVLHTITCG-MCFKMESKNSLSLDTPLHLL-----HSSCIQENKTAVVPLGGEELHLVAI 120

Query: 741  SSRKNFEQYACFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDAL 920
             SR N  QY CFWGFNV  GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL
Sbjct: 121  YSRNNERQYPCFWGFNVSAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 180

Query: 921  QRKINSEVDPQRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNH 1100
            QRKIN+EVDPQRI+GML+EVKRYQDDK ILKQY ENDQV+ENG+VIK Q E+VP LSDNH
Sbjct: 181  QRKINTEVDPQRIAGMLSEVKRYQDDKTILKQYVENDQVIENGRVIKTQFEVVPALSDNH 240

Query: 1101 QPIIRPLIRLQEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMA 1280
            Q I+RPLIRLQE+NIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYVCTMA
Sbjct: 241  QTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMA 300

Query: 1281 ERDYALEMWRLLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRL 1460
            ERDYALEMWRLLDP+SNLI+ KELLDRIVCVK+GL+KSL NVFQDG+CHPKMALVIDDRL
Sbjct: 301  ERDYALEMWRLLDPESNLISSKELLDRIVCVKSGLRKSLFNVFQDGVCHPKMALVIDDRL 360

Query: 1461 KVWEDKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWI 1640
            KVW++KDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFD+ LLQ I
Sbjct: 361  KVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLLQRI 420

Query: 1641 SDAFYEDEMIDFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQAL 1820
             D  YED+  D P  PDVS+YL+SEDDAS SNG++D LSF+GM D +VE+RLK+A   A 
Sbjct: 421  PDISYEDDFNDIPSPPDVSSYLISEDDASTSNGHRDPLSFDGMADAEVEKRLKEAISAAS 480

Query: 1821 QVSSAVNNIDPRLASSIQHVLASSSSMIPQTSQGPI--SFQNIQYPQAIPTVKPLGLVGP 1994
               + VNN+DPR+  ++Q+ LASSSS IP ++  P+   F NIQ+PQA   VKPL  VGP
Sbjct: 481  LFPATVNNLDPRVIPALQYSLASSSSSIPVSTSQPLVMPFSNIQFPQAASLVKPLAQVGP 540

Query: 1995 SEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPP 2174
             EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+  S++   PVRP++Q+S+P 
Sbjct: 541  PEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDNVSSESQIPVRPSMQVSVPR 600

Query: 2175 VQSHGSYFTMEEEMSPRKMNRA--REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNR-- 2342
            VQS GS+  +EEEMSPR++N    REFP+E E +H +K++  HPSF+  +EN +  +R  
Sbjct: 601  VQSRGSWVPVEEEMSPRQLNLTVPREFPLELEPMHIEKHQPHHPSFFPKVENPISSDRMG 660

Query: 2343 SLHENQRFPKELRHGDDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYP 2519
             ++EN R PK   + DDR R NH + NYH  SG+E+PL R+SS NR+  FES R  +   
Sbjct: 661  MVNENLRLPKAAPYRDDRLRSNHTMANYHPLSGEEIPLSRSSSSNRDPDFESERAVSS-A 719

Query: 2520 ETPAGVLQEIAMKCGTKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAE 2699
            ETP   LQEIAMKCG KVEFR +L  + +LQFS E WF GE++GEGIGKTR+EAQR AAE
Sbjct: 720  ETPVEALQEIAMKCGAKVEFRASLVDSRDLQFSTEAWFAGERVGEGIGKTRREAQRLAAE 779

Query: 2700 SSLRTLANKYISRVSPDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTS 2879
            SS++ LAN Y+ R  PD  A+HGD+++ S  NDNG+L + NSFG    P+++P+  ++ S
Sbjct: 780  SSIKNLANIYMQRAKPDNGAMHGDASRYSSANDNGYLGNVNSFGSQPLPKDEPVSSSAAS 839

Query: 2880 DQSRFLDQRQEGSLKGMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAG 3059
            +Q R  D R + S K +  V++LKE CM+EGL L F +  PLS+ S  K EVYA+VEI G
Sbjct: 840  EQLRLPDPRLDSSKKAVGSVTALKEFCMMEGLGLNFLSPTPLSSNSLQKDEVYAQVEIDG 899

Query: 3060 QILGKGIGLTWXXXXXXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTL 3239
            Q++GKGIG TW                 M  Q + +R GSPR + G+ NKRLK E+PR L
Sbjct: 900  QVMGKGIGSTWDEAKMQAAERALGSLRTMFGQFTPKRQGSPRPTQGMSNKRLKPEFPRGL 959

Query: 3240 QRVPSSTRYSKNGPPVP 3290
            QR+PSSTRY KN PPVP
Sbjct: 960  QRMPSSTRYPKNAPPVP 976


>KDP20941.1 hypothetical protein JCGZ_21412 [Jatropha curcas]
          Length = 970

 Score = 1219 bits (3155), Expect = 0.0
 Identities = 629/977 (64%), Positives = 741/977 (75%), Gaps = 20/977 (2%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYL-----------KNPNID--IPNKEIRISHFSPPSERCPPL 560
            M+KSA Y G  LLGEVEIY            K   ID  +  KEIRISHFS PSERCPPL
Sbjct: 1    MYKSAVYKGEELLGEVEIYPQQHQQQEEENNKKKLIDEILMGKEIRISHFSQPSERCPPL 60

Query: 561  AVLHTIAAGGACFKMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAM 740
            AVLHTI  G  CFKM                    H++C++ENKTA++ LG EELHLVA+
Sbjct: 61   AVLHTITCG-MCFKMESKNSLSLDTPLHLL-----HSSCIQENKTAVVPLGGEELHLVAI 114

Query: 741  SSRKNFEQYACFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDAL 920
             SR N  QY CFWGFNV  GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL
Sbjct: 115  YSRNNERQYPCFWGFNVSAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 174

Query: 921  QRKINSEVDPQRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNH 1100
            QRKIN+EVDPQRI+GML+EVKRYQDDK ILKQY ENDQV+ENG+VIK Q E+VP LSDNH
Sbjct: 175  QRKINTEVDPQRIAGMLSEVKRYQDDKTILKQYVENDQVIENGRVIKTQFEVVPALSDNH 234

Query: 1101 QPIIRPLIRLQEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMA 1280
            Q I+RPLIRLQE+NIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYVCTMA
Sbjct: 235  QTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMA 294

Query: 1281 ERDYALEMWRLLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRL 1460
            ERDYALEMWRLLDP+SNLI+ KELLDRIVCVK+GL+KSL NVFQDG+CHPKMALVIDDRL
Sbjct: 295  ERDYALEMWRLLDPESNLISSKELLDRIVCVKSGLRKSLFNVFQDGVCHPKMALVIDDRL 354

Query: 1461 KVWEDKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWI 1640
            KVW++KDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFD+ LLQ I
Sbjct: 355  KVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLLQRI 414

Query: 1641 SDAFYEDEMIDFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQAL 1820
             D  YED+  D P  PDVS+YL+SEDDAS SNG++D LSF+GM D +VE+RLK+A   A 
Sbjct: 415  PDISYEDDFNDIPSPPDVSSYLISEDDASTSNGHRDPLSFDGMADAEVEKRLKEAISAAS 474

Query: 1821 QVSSAVNNIDPRLASSIQHVLASSSSMIPQTSQGPI--SFQNIQYPQAIPTVKPLGLVGP 1994
               + VNN+DPR+  ++Q+ LASSSS IP ++  P+   F NIQ+PQA   VKPL  VGP
Sbjct: 475  LFPATVNNLDPRVIPALQYSLASSSSSIPVSTSQPLVMPFSNIQFPQAASLVKPLAQVGP 534

Query: 1995 SEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPP 2174
             EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR+  S++   PVRP++Q+S+P 
Sbjct: 535  PEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDNVSSESQIPVRPSMQVSVPR 594

Query: 2175 VQSHGSYFTMEEEMSPRKMNRA--REFPIEPEALHFDKNRSPHPSFYHGLENSLPPNR-- 2342
            VQS GS+  +EEEMSPR++N    REFP+E E +H +K++  HPSF+  +EN +  +R  
Sbjct: 595  VQSRGSWVPVEEEMSPRQLNLTVPREFPLELEPMHIEKHQPHHPSFFPKVENPISSDRMG 654

Query: 2343 SLHENQRFPKELRHGDDRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYP 2519
             ++EN R PK   + DDR R NH + NYH  SG+E+PL R+SS NR+  FES R  +   
Sbjct: 655  MVNENLRLPKAAPYRDDRLRSNHTMANYHPLSGEEIPLSRSSSSNRDPDFESERAVSS-A 713

Query: 2520 ETPAGVLQEIAMKCGTKVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAE 2699
            ETP   LQEIAMKCG KVEFR +L  + +LQFS E WF GE++GEGIGKTR+EAQR AAE
Sbjct: 714  ETPVEALQEIAMKCGAKVEFRASLVDSRDLQFSTEAWFAGERVGEGIGKTRREAQRLAAE 773

Query: 2700 SSLRTLANKYISRVSPDLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTS 2879
            SS++ LAN Y+ R  PD  A+HGD+++ S  NDNG+L + NSFG    P+++P+  ++ S
Sbjct: 774  SSIKNLANIYMQRAKPDNGAMHGDASRYSSANDNGYLGNVNSFGSQPLPKDEPVSSSAAS 833

Query: 2880 DQSRFLDQRQEGSLKGMAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAG 3059
            +Q R  D R + S K +  V++LKE CM+EGL L F +  PLS+ S  K EVYA+VEI G
Sbjct: 834  EQLRLPDPRLDSSKKAVGSVTALKEFCMMEGLGLNFLSPTPLSSNSLQKDEVYAQVEIDG 893

Query: 3060 QILGKGIGLTWXXXXXXXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTL 3239
            Q++GKGIG TW                 M  Q + +R GSPR + G+ NKRLK E+PR L
Sbjct: 894  QVMGKGIGSTWDEAKMQAAERALGSLRTMFGQFTPKRQGSPRPTQGMSNKRLKPEFPRGL 953

Query: 3240 QRVPSSTRYSKNGPPVP 3290
            QR+PSSTRY KN PPVP
Sbjct: 954  QRMPSSTRYPKNAPPVP 970


>XP_017696775.1 PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 1 [Phoenix dactylifera]
          Length = 942

 Score = 1218 bits (3152), Expect = 0.0
 Identities = 634/950 (66%), Positives = 727/950 (76%), Gaps = 4/950 (0%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLKNPNIDIPNKEIRISHFSPPSERCPPLAVLHTIAAGGACF 599
            MFKSA YHGNSL+GE EI+ +N N     +EIRISHFSP SERC PLAVLHTIA+GG  F
Sbjct: 1    MFKSAVYHGNSLIGEAEIFPQNSNPGAWVREIRISHFSPSSERCLPLAVLHTIASGGVSF 60

Query: 600  KMXXXXXXXXXXXXXXXXXXXXHTACLRENKTAIMQLGEEELHLVAMSSRKNFEQYACFW 779
            KM                    H ACLRENKTA++ LG EELHLVAM   K       F 
Sbjct: 61   KMESRSPPSDESPLCSL-----HAACLRENKTAVIPLGGEELHLVAMFREKPHAP-CMFL 114

Query: 780  GFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRI 959
            GFNV  GLYNSCL MLNLRCLGIVFDLDETLIVANT+RSFEDRIDALQRK+++E DPQR+
Sbjct: 115  GFNVASGLYNSCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIDALQRKLSNETDPQRV 174

Query: 960  SGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDNHQPIIRPLIRLQEK 1139
            +GMLAE+KRYQDDK ILKQYAENDQVVENGKV KVQSE+VPPLSD+HQ I RP+IRLQEK
Sbjct: 175  TGMLAEIKRYQDDKSILKQYAENDQVVENGKVYKVQSEVVPPLSDSHQLITRPVIRLQEK 234

Query: 1140 NIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD 1319
            NIILTR+NP+IRDTSVLVRLRPAWE+LRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD
Sbjct: 235  NIILTRVNPLIRDTSVLVRLRPAWEELRSYLIARGRKRFEVYVCTMAERDYALEMWRLLD 294

Query: 1320 PDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDRLKVWEDKDQPRVHV 1499
            PDS+LI+  +LLDRIVCVK+G +KSLL+VFQDGICHPKMALVIDDRLKVW++KDQPRVH 
Sbjct: 295  PDSSLISSIQLLDRIVCVKSGSRKSLLSVFQDGICHPKMALVIDDRLKVWDEKDQPRVHC 354

Query: 1500 VPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQWISDAFYEDEMIDFP 1679
            VPAFAPYYAPQAEAN  VPVLCVARNVACNVRGGFFKEFD+ LL  ISD+FYEDE  DFP
Sbjct: 355  VPAFAPYYAPQAEANGNVPVLCVARNVACNVRGGFFKEFDEGLLPRISDSFYEDEWKDFP 414

Query: 1680 PAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDANVQALQVSSAVNNIDPRL 1859
             APDV NYL+SEDD + SNGNKD L FEGMTD +VERRLK+AN     +   VNN DPR 
Sbjct: 415  SAPDVGNYLISEDDNATSNGNKDQLCFEGMTDAEVERRLKEANCNVQAIHPMVNNFDPRS 474

Query: 1860 ASSIQHVLASSSSMIPQT-SQGPISFQNIQYPQAIPTVKPLGL-VGPSEPSLQSSPAREE 2033
             SSIQHV+ASSS+ +PQT +Q  +   N   PQ I   +PL    G  EPSLQ SPAREE
Sbjct: 475  VSSIQHVMASSSAALPQTATQAMMPLPNNNCPQPIALGRPLVCQSGLPEPSLQGSPAREE 534

Query: 2034 GEVPESELDPDTRRRLLILQHGQDTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEE 2213
            GEVPESELDPDTRRRLLILQHGQDTR+ T   P F VR  L +++PPVQS G++F +EEE
Sbjct: 535  GEVPESELDPDTRRRLLILQHGQDTRDPT---PSFTVRSPLHVAVPPVQSRGNWFPLEEE 591

Query: 2214 MSPRKMNR-AREFPIEPEALHFDKNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHGD 2390
            M+PR+++R  +EF +EPE + F+K R  H S++   ENS+  +R LHEN+  P +L  GD
Sbjct: 592  MNPRQLSREPKEFTLEPETIRFNKKRPNHQSYFRSGENSISSDRVLHENRGLPMQLHQGD 651

Query: 2391 DRFRPNHPLPNYHS-SGDEMPLGRTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCGT 2567
            DR RPNH   NY+S  G+EMP G  SS +++  FESGR TA+Y ETPAGVLQ IAMKCG 
Sbjct: 652  DRLRPNHAAANYNSFPGEEMPAGLISSSHKDTQFESGRATARYAETPAGVLQNIAMKCGA 711

Query: 2568 KVEFRPALAATTELQFSIEVWFVGEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVSP 2747
            KVEFR AL  TT LQFS+EVWFVG K+GEGIGKTRKEAQ+QAAE SLRTLANKY+S    
Sbjct: 712  KVEFRTALCDTTNLQFSMEVWFVGGKLGEGIGKTRKEAQQQAAEISLRTLANKYLSNARS 771

Query: 2748 DLNAVHGDSNKLSHKNDNGFLRDANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLKG 2927
            D +  HGD  K  H  +NGF  D NSFGYPA  R+D LP+ASTS++SR +DQR EG  K 
Sbjct: 772  DPSXSHGDMLKPFHIKENGFTSDLNSFGYPACARDDVLPVASTSEESRLMDQRLEGPNKT 831

Query: 2928 MAPVSSLKELCMVEGLSLVFHAQPPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXXX 3107
             A V++LK+LC ++G +LVF AQ   S  S  KGEVYA+VE+AGQILGKG+G TW     
Sbjct: 832  AAAVAALKDLCTIKGFNLVFQAQSSPSAGSVSKGEVYAQVEVAGQILGKGVGTTWEEAKL 891

Query: 3108 XXXXXXXXXXXFMLSQGSQERLGSPRSSHGIPNKRLKTEYPRTLQRVPSS 3257
                        ML Q +Q+  GSPRS    PNKRLK ++ R LQR+PSS
Sbjct: 892  QAAEEALGALKSMLGQFTQKHSGSPRSLSATPNKRLKADFSRLLQRIPSS 941


>XP_011027882.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Populus euphratica] XP_011027883.1 PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1
            [Populus euphratica]
          Length = 996

 Score = 1218 bits (3151), Expect = 0.0
 Identities = 639/998 (64%), Positives = 748/998 (74%), Gaps = 41/998 (4%)
 Frame = +3

Query: 420  MFKSAFYHGNSLLGEVEIYLK--------NPN-----IDIPNKEIRISHFSPPSERCPPL 560
            M+KS  Y G+ LLGEVEIY +        N N     ID   KEIRISHFS  SERCPPL
Sbjct: 1    MYKSVAYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60

Query: 561  AVLHTIAAGGACFKMXXXXXXXXXXXXXXXXXXXX-HTACLRENKTAIMQLGEEELHLVA 737
            AVLHTI + G CFKM                     H++C++ENKTA+M LG EELHLVA
Sbjct: 61   AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120

Query: 738  MSSRKNFEQYACFWGFNVMPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 917
            M SR N +Q+ CFWGF+V PGLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA
Sbjct: 121  MLSRSNEKQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180

Query: 918  LQRKINSEVDPQRISGMLAEVKRYQDDKVILKQYAENDQVVENGKVIKVQSEIVPPLSDN 1097
            LQRKI++E+DPQRI GML+EVKRYQDDK ILKQY ENDQVVENGKVIK QSE+VP LSDN
Sbjct: 181  LQRKISTELDPQRILGMLSEVKRYQDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240

Query: 1098 HQPIIRPLIRLQEKNIILTRINPVIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVYVCTM 1277
            HQP++RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVYVCTM
Sbjct: 241  HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300

Query: 1278 AERDYALEMWRLLDPDSNLINPKELLDRIVCVKAGLKKSLLNVFQDGICHPKMALVIDDR 1457
            AERDYALEMWRLLDP+SNLIN KELLDRIVCVK+GL+KSL NVFQDGICHPKMALVIDDR
Sbjct: 301  AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360

Query: 1458 LKVWEDKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDVLLQW 1637
            LKVW+++DQ RVHVVPAFAPYYAPQAE NNAVPVLCVARNVACNVRGGFFKEFD+ LLQ 
Sbjct: 361  LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420

Query: 1638 ISDAFYEDEMIDFPPAPDVSNYLVSEDDASASNGNKDLLSFEGMTDVDVERRLKDA---- 1805
            I +  YED+  + P  PDVSNYLVSEDDASA NGN+D LSF+GM D +VER+LK+A    
Sbjct: 421  IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSSS 480

Query: 1806 NVQALQVSSAVNNIDPRLASSIQHVLASSSSMIPQT------SQGPI------------- 1928
            +     + S V+++DPRL  S+Q+ +ASSSS +P +      SQ P+             
Sbjct: 481  SAILSTIPSTVSSLDPRLLQSLQYTIASSSSSMPTSQPSMLASQQPMPALQPPKPPSQLS 540

Query: 1929 --SFQNIQYPQAIPTVKPLGLVGPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQ 2102
               F N Q+PQ  P++K LG V P EPSLQSSPAREEGEVPESELDPDTRRRLLILQHG 
Sbjct: 541  MTPFPNTQFPQVAPSIKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGH 600

Query: 2103 DTREQTSNDPPFPVRPALQISMPPVQSHGSYFTMEEEMSPRKMNRA-REFPIEPEALHFD 2279
            D+R+   ++ PFP RP+ Q++ P VQS GS+  +EEEMSPR++NR  REFP++ + ++ +
Sbjct: 601  DSRDNAPSESPFPARPSTQVAAPRVQSVGSWVPVEEEMSPRQLNRTPREFPLDSDLMNIE 660

Query: 2280 KNRSPHPSFYHGLENSLPPNRSLHENQRFPKELRHGDDRFRPNHPLPNYHS-SGDEMPLG 2456
            K+R  HPSF+H +E+++P +R +HENQR PKE  + DDR + NH   NY S  G+E PL 
Sbjct: 661  KHRPHHPSFFHKVESNIPSDRMIHENQRLPKEATYRDDRMKLNHSTSNYPSFQGEESPLS 720

Query: 2457 RTSSGNRNLHFESGRVTAQYPETPAGVLQEIAMKCGTKVEFRPALAATTELQFSIEVWFV 2636
            R+SS NR+L  ES R  +   ETPA VLQEIAMKCGTKVEFR AL AT++LQFSIE WF+
Sbjct: 721  RSSS-NRDLDLESERAFSS-TETPAEVLQEIAMKCGTKVEFRSALIATSDLQFSIETWFL 778

Query: 2637 GEKIGEGIGKTRKEAQRQAAESSLRTLANKYISRVSPDLNAVHGDSNKLSHKNDNGFLRD 2816
            GEK+GEG GKTR+EAQRQAAE S++ LA  Y+SR  PD   + GDS++    NDNGFL D
Sbjct: 779  GEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRSKPDSGPMLGDSSRYPSANDNGFLGD 838

Query: 2817 ANSFGYPAFPREDPLPIASTSDQSRFLDQRQEGSLKGMAPVSSLKELCMVEGLSLVFHAQ 2996
             NSFG     +++ +  ++TS+ SR LDQR EGS K M  V++LKE CM EGL + F AQ
Sbjct: 839  MNSFGNQPLLKDENITYSATSEPSRLLDQRLEGSKKSMGSVTALKEFCMTEGLGVNFLAQ 898

Query: 2997 PPLSTLSSHKGEVYAEVEIAGQILGKGIGLTWXXXXXXXXXXXXXXXXFMLSQGSQERLG 3176
             PLST S    EV+A+VEI GQ+LGKGIGLTW                 M  Q + +R G
Sbjct: 899  TPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRTMFGQYTPKRQG 958

Query: 3177 SPRSSHGIPNKRLKTEYPRTLQRVPSSTRYSKNGPPVP 3290
            SPR   G+PNKRLK E+PR LQR+PSS RY KN PPVP
Sbjct: 959  SPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNAPPVP 996


Top