BLASTX nr result

ID: Forsythia22_contig00000976 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00000976
         (4178 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011096251.1| PREDICTED: RNA polymerase II C-terminal doma...  1417   0.0  
ref|XP_012849005.1| PREDICTED: RNA polymerase II C-terminal doma...  1365   0.0  
ref|XP_012849004.1| PREDICTED: RNA polymerase II C-terminal doma...  1358   0.0  
ref|XP_011086104.1| PREDICTED: RNA polymerase II C-terminal doma...  1352   0.0  
emb|CDO99573.1| unnamed protein product [Coffea canephora]           1348   0.0  
ref|XP_009623032.1| PREDICTED: RNA polymerase II C-terminal doma...  1313   0.0  
ref|XP_009789678.1| PREDICTED: RNA polymerase II C-terminal doma...  1312   0.0  
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...  1284   0.0  
ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma...  1273   0.0  
ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform...  1249   0.0  
ref|XP_002267987.3| PREDICTED: RNA polymerase II C-terminal doma...  1244   0.0  
ref|XP_008225045.1| PREDICTED: RNA polymerase II C-terminal doma...  1239   0.0  
ref|XP_012091568.1| PREDICTED: RNA polymerase II C-terminal doma...  1226   0.0  
gb|KDP20941.1| hypothetical protein JCGZ_21412 [Jatropha curcas]     1226   0.0  
ref|XP_008371347.1| PREDICTED: RNA polymerase II C-terminal doma...  1221   0.0  
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...  1220   0.0  
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...  1219   0.0  
ref|XP_009336327.1| PREDICTED: RNA polymerase II C-terminal doma...  1218   0.0  
ref|XP_008383777.1| PREDICTED: RNA polymerase II C-terminal doma...  1215   0.0  
ref|XP_012455431.1| PREDICTED: RNA polymerase II C-terminal doma...  1211   0.0  

>ref|XP_011096251.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Sesamum indicum]
          Length = 951

 Score = 1417 bits (3667), Expect = 0.0
 Identities = 736/965 (76%), Positives = 803/965 (83%), Gaps = 1/965 (0%)
 Frame = -2

Query: 3739 MYGKSVVVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 3560
            MY K V VYEGERVLGE  ++P      VV G  L+EIRISHYS PSERCPPLAVLHTI 
Sbjct: 1    MYRKLVAVYEGERVLGEAELHPPD----VVLGNELREIRISHYSPPSERCPPLAVLHTIN 56

Query: 3559 STGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSC 3380
            +TGI FKLES+   ++QDSPL++LHATCLRDNKTAV S+GG E+ LVAMHSRK EGQ  C
Sbjct: 57   ATGICFKLESTA--KNQDSPLSLLHATCLRDNKTAVASVGGGEIQLVAMHSRKCEGQYPC 114

Query: 3379 FWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQ 3200
            FWGFNVAS LYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKINSESDPQ
Sbjct: 115  FWGFNVASSLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQ 174

Query: 3199 RVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQ 3020
            RVAGM+AE KRYQDDKS+LKQYAE+DQVIDNGKV+KSQSEVVPALSE H PIVRPLIRLQ
Sbjct: 175  RVAGMLAEVKRYQDDKSVLKQYAESDQVIDNGKVVKSQSEVVPALSETHQPIVRPLIRLQ 234

Query: 3019 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 2840
            D+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL
Sbjct: 235  DRNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 294

Query: 2839 LDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 2660
            LDPESNLI+S ELL+RIVCVK+G RKSLFNVFQ GNCHPKMALVIDDRLKVWDEKDQPRV
Sbjct: 295  LDPESNLINSRELLDRIVCVKSGLRKSLFNVFQAGNCHPKMALVIDDRLKVWDEKDQPRV 354

Query: 2659 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKE 2480
            HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLL R+S VAYEDD+++
Sbjct: 355  HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLPRISGVAYEDDMRD 414

Query: 2479 VPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSID 2300
            VP   DVSNYL SEDDPS SSGNKDS+GFDGMADAEVERR LKEA+SASST PL + ++D
Sbjct: 415  VPSSPDVSNYLISEDDPSASSGNKDSLGFDGMADAEVERR-LKEATSASSTVPLPIPNLD 473

Query: 2299 PRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSP 2120
            PRI  AL            P T+ G A SFP              LAQ G AETTLQ+SP
Sbjct: 474  PRITPALHYAVPSSSFTVPPQTIHGSAMSFPGQQLSQVTTLLKPPLAQLGQAETTLQSSP 533

Query: 2119 AREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSRGWFPV 1940
            AREEGEVPESELDPDTRRRLLILQHGQD R+H P E QFPARP MQVSVPRVQ RGWFPV
Sbjct: 534  AREEGEVPESELDPDTRRRLLILQHGQDMREHPPSESQFPARPSMQVSVPRVQPRGWFPV 593

Query: 1939 EEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPKE 1760
            EEEMSPRQ N+V              IDK+R+ HPPF+HKVE  +PPGRV LE+Q+  KE
Sbjct: 594  EEEMSPRQLNQV-----PPPNAESIPIDKNRARHPPFLHKVEPPIPPGRV-LENQRTQKE 647

Query: 1759 ALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIAY 1580
            ALPR + LR+NQSLP+FHS SGED  + +PSS NKDLDLEAGQID Y ET  GALQDIA+
Sbjct: 648  ALPRGDQLRLNQSLPDFHSFSGEDGSVNEPSSANKDLDLEAGQIDPYTETCTGALQDIAF 707

Query: 1579 KCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYLS 1400
            KCGAKVEF+QALVSS ELQF VEVLFAGE+IGEG+GRT         +GSL+ LADKYLS
Sbjct: 708  KCGAKVEFKQALVSSTELQFFVEVLFAGERIGEGVGRTRREAQRQAAEGSLLCLADKYLS 767

Query: 1399 R-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLEA 1223
            + +PDSS + GDGSRFAN  DNG++ D +SFG+Q + KE    FS A+ +PRILDPR+EA
Sbjct: 768  QLRPDSSHVTGDGSRFANQKDNGVLSDTSSFGHQSMLKEGAVPFS-AAPTPRILDPRIEA 826

Query: 1222 SKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLTWD 1043
            SKK MGSI+ LKELCM +GLGVAFQTQPQFS NPGQKNEVYAQVEI+GQVLGKGIGLTWD
Sbjct: 827  SKKPMGSISALKELCMTEGLGVAFQTQPQFSANPGQKNEVYAQVEINGQVLGKGIGLTWD 886

Query: 1042 EAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYPKN 863
            EAK++AAEKALG L+SM GQF Y+ Q SPR  QGMPNKR K EFSR  QRMPSS RYPKN
Sbjct: 887  EAKSEAAEKALGALKSMLGQFPYRHQGSPRSAQGMPNKRVKQEFSRVPQRMPSSGRYPKN 946

Query: 862  ASPVP 848
             SPVP
Sbjct: 947  GSPVP 951


>ref|XP_012849005.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X2 [Erythranthe guttatus]
            gi|604315220|gb|EYU27926.1| hypothetical protein
            MIMGU_mgv1a000848mg [Erythranthe guttata]
          Length = 962

 Score = 1365 bits (3532), Expect = 0.0
 Identities = 703/968 (72%), Positives = 785/968 (81%), Gaps = 4/968 (0%)
 Frame = -2

Query: 3739 MYGKSVVVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 3560
            MY   VVVYEGERVLGE  +     QDGVV G+ LKEIRISHYS PSERCPPLAVLHTI 
Sbjct: 1    MYRNLVVVYEGERVLGEAEL---NLQDGVVLGKGLKEIRISHYSPPSERCPPLAVLHTIN 57

Query: 3559 STGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSC 3380
            STGI FKLE++   ++Q+SPL+ LHA+CLRDNKTAV+ +GG E+ LVAMHSRKYEG   C
Sbjct: 58   STGICFKLEATT--KNQESPLSHLHASCLRDNKTAVVPIGGAEIQLVAMHSRKYEGGNPC 115

Query: 3379 FWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQ 3200
            FWGFNVAS +YNSCLVMLNLRCLGIVFDLDETL+VANTMRSFEDRIEAL RKINSESD Q
Sbjct: 116  FWGFNVASSVYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKINSESDQQ 175

Query: 3199 RVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQ 3020
            R +GM+AE KRYQDDK+ILKQYAE+DQVI+NGKVIKSQSEVVPALS  H PIVRPLIRLQ
Sbjct: 176  RASGMVAEVKRYQDDKNILKQYAESDQVIENGKVIKSQSEVVPALSGTHQPIVRPLIRLQ 235

Query: 3019 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 2840
            D+NIILTRINP IRDTSVLVRLRPAWE+LR+YLTARGRKRFEVFVCTMAERDYALEMWRL
Sbjct: 236  DRNIILTRINPLIRDTSVLVRLRPAWEELRNYLTARGRKRFEVFVCTMAERDYALEMWRL 295

Query: 2839 LDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 2660
            LDPE NLI+S ELLER+VCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV
Sbjct: 296  LDPEFNLINSRELLERVVCVKSGFRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 355

Query: 2659 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKE 2480
            HVVPAFAPYYAPQAEANNT+PVLCVARNVACNVRGGFFK+FDDGLLQ +S VAYEDDIK+
Sbjct: 356  HVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKDFDDGLLQLISGVAYEDDIKD 415

Query: 2479 VPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSID 2300
            VP   DVSNYL SEDDPS S GNKDS+ +DGMADAEV+RR LK+A SASSTAP  + ++D
Sbjct: 416  VPSSPDVSNYLISEDDPSASGGNKDSLVYDGMADAEVQRR-LKDAISASSTAPSPIANLD 474

Query: 2299 PRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSP 2120
            P +AS L            P T QGPA SFP+             L Q G  ETT ++SP
Sbjct: 475  PIVASVLHYMAPSSSFTAPPPTTQGPAMSFPSQQMHQVATLLKPPLVQLGQGETTSRSSP 534

Query: 2119 AREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSRGWFPV 1940
            AREEGEVPESELDPDTRRR+LILQHGQD R  +P EPQFPAR PMQVSVPRVQ  GWFPV
Sbjct: 535  AREEGEVPESELDPDTRRRMLILQHGQDMRGPSPSEPQFPARTPMQVSVPRVQPHGWFPV 594

Query: 1939 EEEMSPRQPNRVT-XXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPK 1763
            EEEMS RQPN+V               IDK+R HH PF+  VE S+PPGR+L ESQ++PK
Sbjct: 595  EEEMSSRQPNQVALPPKEFPLNVESLPIDKNRGHHSPFLQNVEPSIPPGRILPESQRLPK 654

Query: 1762 EALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIA 1583
            EA+PRE+ LR+NQSLP+FHS  GEDA +AQPSS NKD DLEAGQID Y ET  GALQDIA
Sbjct: 655  EAVPREDQLRLNQSLPDFHSFHGEDASVAQPSSANKDFDLEAGQIDPYIETCIGALQDIA 714

Query: 1582 YKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYL 1403
            +KCG KVEF+Q L+SS  LQF VEVLFAGE+IGEG+GRT         +GSL+ LADKYL
Sbjct: 715  FKCGTKVEFKQTLISSTGLQFFVEVLFAGERIGEGMGRTRREAQRQAAEGSLLYLADKYL 774

Query: 1402 SR-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLE 1226
            SR +PD + +PGDGSR  N  +NG   + NSFGYQPLP EE   FS  +  PRI+DPR E
Sbjct: 775  SRSRPDFNYVPGDGSRVGNQKENGFNSNANSFGYQPLPNEEGLPFSTVAAPPRIVDPRTE 834

Query: 1225 ASKKS-MGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLT 1049
             SK+  MGSI  LKE C M+GLGV FQTQPQFS NPGQ+NEVYAQVE++GQVLGKGIGLT
Sbjct: 835  VSKRPIMGSITALKEFCTMEGLGVTFQTQPQFSANPGQRNEVYAQVEVNGQVLGKGIGLT 894

Query: 1048 WDEAKTQAAEKALGTLRSMPGQFSYKRQ-DSPRPLQGMPNKRFKPEFSRSFQRMPSSARY 872
            WDEA++QAAEKAL TL+SMPGQF Y+ Q  SPR +Q +PNKR K EF+R  QR+PS  RY
Sbjct: 895  WDEARSQAAEKALVTLKSMPGQFPYRHQGSSPRSMQSIPNKRVKQEFNRVSQRLPSFGRY 954

Query: 871  PKNASPVP 848
            P+N SPVP
Sbjct: 955  PRNGSPVP 962


>ref|XP_012849004.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Erythranthe guttatus]
          Length = 967

 Score = 1358 bits (3516), Expect = 0.0
 Identities = 703/973 (72%), Positives = 785/973 (80%), Gaps = 9/973 (0%)
 Frame = -2

Query: 3739 MYGKSVVVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 3560
            MY   VVVYEGERVLGE  +     QDGVV G+ LKEIRISHYS PSERCPPLAVLHTI 
Sbjct: 1    MYRNLVVVYEGERVLGEAEL---NLQDGVVLGKGLKEIRISHYSPPSERCPPLAVLHTIN 57

Query: 3559 STGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSC 3380
            STGI FKLE++   ++Q+SPL+ LHA+CLRDNKTAV+ +GG E+ LVAMHSRKYEG   C
Sbjct: 58   STGICFKLEATT--KNQESPLSHLHASCLRDNKTAVVPIGGAEIQLVAMHSRKYEGGNPC 115

Query: 3379 FWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQ 3200
            FWGFNVAS +YNSCLVMLNLRCLGIVFDLDETL+VANTMRSFEDRIEAL RKINSESD Q
Sbjct: 116  FWGFNVASSVYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKINSESDQQ 175

Query: 3199 RVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQ 3020
            R +GM+AE KRYQDDK+ILKQYAE+DQVI+NGKVIKSQSEVVPALS  H PIVRPLIRLQ
Sbjct: 176  RASGMVAEVKRYQDDKNILKQYAESDQVIENGKVIKSQSEVVPALSGTHQPIVRPLIRLQ 235

Query: 3019 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 2840
            D+NIILTRINP IRDTSVLVRLRPAWE+LR+YLTARGRKRFEVFVCTMAERDYALEMWRL
Sbjct: 236  DRNIILTRINPLIRDTSVLVRLRPAWEELRNYLTARGRKRFEVFVCTMAERDYALEMWRL 295

Query: 2839 LDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 2660
            LDPE NLI+S ELLER+VCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV
Sbjct: 296  LDPEFNLINSRELLERVVCVKSGFRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 355

Query: 2659 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKE 2480
            HVVPAFAPYYAPQAEANNT+PVLCVARNVACNVRGGFFK+FDDGLLQ +S VAYEDDIK+
Sbjct: 356  HVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKDFDDGLLQLISGVAYEDDIKD 415

Query: 2479 VPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSID 2300
            VP   DVSNYL SEDDPS S GNKDS+ +DGMADAEV+RR LK+A SASSTAP  + ++D
Sbjct: 416  VPSSPDVSNYLISEDDPSASGGNKDSLVYDGMADAEVQRR-LKDAISASSTAPSPIANLD 474

Query: 2299 PRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSP 2120
            P +AS L            P T QGPA SFP+             L Q G  ETT ++SP
Sbjct: 475  PIVASVLHYMAPSSSFTAPPPTTQGPAMSFPSQQMHQVATLLKPPLVQLGQGETTSRSSP 534

Query: 2119 AREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSRGWFPV 1940
            AREEGEVPESELDPDTRRR+LILQHGQD R  +P EPQFPAR PMQVSVPRVQ  GWFPV
Sbjct: 535  AREEGEVPESELDPDTRRRMLILQHGQDMRGPSPSEPQFPARTPMQVSVPRVQPHGWFPV 594

Query: 1939 EEEMSPRQPNRVT-XXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPK 1763
            EEEMS RQPN+V               IDK+R HH PF+  VE S+PPGR+L ESQ++PK
Sbjct: 595  EEEMSSRQPNQVALPPKEFPLNVESLPIDKNRGHHSPFLQNVEPSIPPGRILPESQRLPK 654

Query: 1762 E-----ALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGA 1598
            E     A+PRE+ LR+NQSLP+FHS  GEDA +AQPSS NKD DLEAGQID Y ET  GA
Sbjct: 655  EVVDFSAVPREDQLRLNQSLPDFHSFHGEDASVAQPSSANKDFDLEAGQIDPYIETCIGA 714

Query: 1597 LQDIAYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNL 1418
            LQDIA+KCG KVEF+Q L+SS  LQF VEVLFAGE+IGEG+GRT         +GSL+ L
Sbjct: 715  LQDIAFKCGTKVEFKQTLISSTGLQFFVEVLFAGERIGEGMGRTRREAQRQAAEGSLLYL 774

Query: 1417 ADKYLSR-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRIL 1241
            ADKYLSR +PD + +PGDGSR  N  +NG   + NSFGYQPLP EE   FS  +  PRI+
Sbjct: 775  ADKYLSRSRPDFNYVPGDGSRVGNQKENGFNSNANSFGYQPLPNEEGLPFSTVAAPPRIV 834

Query: 1240 DPRLEASKKS-MGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGK 1064
            DPR E SK+  MGSI  LKE C M+GLGV FQTQPQFS NPGQ+NEVYAQVE++GQVLGK
Sbjct: 835  DPRTEVSKRPIMGSITALKEFCTMEGLGVTFQTQPQFSANPGQRNEVYAQVEVNGQVLGK 894

Query: 1063 GIGLTWDEAKTQAAEKALGTLRSMPGQFSYKRQ-DSPRPLQGMPNKRFKPEFSRSFQRMP 887
            GIGLTWDEA++QAAEKAL TL+SMPGQF Y+ Q  SPR +Q +PNKR K EF+R  QR+P
Sbjct: 895  GIGLTWDEARSQAAEKALVTLKSMPGQFPYRHQGSSPRSMQSIPNKRVKQEFNRVSQRLP 954

Query: 886  SSARYPKNASPVP 848
            S  RYP+N SPVP
Sbjct: 955  SFGRYPRNGSPVP 967


>ref|XP_011086104.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Sesamum indicum]
          Length = 951

 Score = 1352 bits (3498), Expect = 0.0
 Identities = 698/965 (72%), Positives = 789/965 (81%), Gaps = 1/965 (0%)
 Frame = -2

Query: 3739 MYGKSVVVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 3560
            MYGK V+VYEGER+LGEV +   Q Q G VWGE +KEIRISHYS PSERCPPLAVLHTI 
Sbjct: 1    MYGKLVLVYEGERLLGEVEL---QRQGGGVWGEEIKEIRISHYSPPSERCPPLAVLHTIN 57

Query: 3559 STGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSC 3380
            STGI FKLES+   ++ DSPL++LHATCLRDNKTAV  +G  E+HLVAMHSRKYEGQ  C
Sbjct: 58   STGICFKLESTA--KNVDSPLSILHATCLRDNKTAVAIIGEGEIHLVAMHSRKYEGQHPC 115

Query: 3379 FWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQ 3200
            FWGFNVAS LYNSCL +LNLRCLGIVFDLDETLIVANTMRSFEDRI++L RK+NSESDPQ
Sbjct: 116  FWGFNVASSLYNSCLALLNLRCLGIVFDLDETLIVANTMRSFEDRIDSLQRKVNSESDPQ 175

Query: 3199 RVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQ 3020
            RV+ M+AE KRYQDDK+ILKQYAE+DQVIDNGKVI+SQSEVV ALS+ H  IVRPLIRLQ
Sbjct: 176  RVSSMLAEVKRYQDDKNILKQYAESDQVIDNGKVIRSQSEVVLALSDNHQTIVRPLIRLQ 235

Query: 3019 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 2840
            D+NIILTRINP IRDTSVLVRLRPAWEDL+SYLTA+GRKRFEVFVCTMAERDYALEMWRL
Sbjct: 236  DRNIILTRINPLIRDTSVLVRLRPAWEDLKSYLTAKGRKRFEVFVCTMAERDYALEMWRL 295

Query: 2839 LDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 2660
            LDPESNLI+  +LL+RIVCVK+GSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV
Sbjct: 296  LDPESNLINPRDLLDRIVCVKSGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 355

Query: 2659 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKE 2480
            HVVPAFAPY+APQAEANN++PVLC+ARNVACNVRGGFFKEFD+ L+QR+S VAYEDDIK+
Sbjct: 356  HVVPAFAPYFAPQAEANNSIPVLCLARNVACNVRGGFFKEFDESLIQRISGVAYEDDIKD 415

Query: 2479 VPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSID 2300
            +P P DVSNYL  EDDPS S+GNKD+IGFDGMADAEVERR LKE+ SASSTA   + ++D
Sbjct: 416  MPSPPDVSNYLFPEDDPSASNGNKDAIGFDGMADAEVERR-LKESMSASSTAVTPVINLD 474

Query: 2299 PRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSP 2120
            PRIASA+Q           P  +QGPA  F               L+  G  ETT ++SP
Sbjct: 475  PRIASAIQFAVPSSSFTVHPPKIQGPAVPFLGQQLPPVTTLPKPPLSHLGQGETTFRSSP 534

Query: 2119 AREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSRGWFPV 1940
            AREEGEVPESELDPDTRRRLLILQHGQD R+H P EPQFPARPP+QV VPRV  RGWFPV
Sbjct: 535  AREEGEVPESELDPDTRRRLLILQHGQDMREHPPSEPQFPARPPLQVPVPRVHPRGWFPV 594

Query: 1939 EEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPKE 1760
            EEEMSPRQ   +T             IDKHR+HHPPF+HKV++SV PGRV +ESQ++PKE
Sbjct: 595  EEEMSPRQLKLMT--PPMEFNTESLPIDKHRTHHPPFLHKVDTSVSPGRV-IESQRLPKE 651

Query: 1759 ALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIAY 1580
             +PRE+ LR++Q LP+ HS S ED+ +AQ  S N+DLDLEAGQID Y E    ALQDIA+
Sbjct: 652  EIPREKLLRLSQPLPDSHSFSSEDSAMAQLPSANEDLDLEAGQIDPYGENSTEALQDIAF 711

Query: 1579 KCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYLS 1400
            KCG KVEF+QALVSS ELQFSV+VLFAGE+IGEGIGRT         +GSL+ LADKYLS
Sbjct: 712  KCGTKVEFKQALVSSTELQFSVKVLFAGERIGEGIGRTRREAQRHATEGSLLYLADKYLS 771

Query: 1399 R-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLEA 1223
            + KPDSS++P DGSR     DNG IGD+NS G+Q +P+ +      A   PRILD   EA
Sbjct: 772  QLKPDSSNMPEDGSRVGKLKDNGFIGDVNSVGHQSVPRAQT-----AVAPPRILDLSTEA 826

Query: 1222 SKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLTWD 1043
            SK+SM SI+ LKELC M+GLGVAFQT+PQFS N GQK EVYA+VEIDGQVLGKGIGLTWD
Sbjct: 827  SKRSMSSISALKELCNMEGLGVAFQTRPQFSSNRGQKTEVYAEVEIDGQVLGKGIGLTWD 886

Query: 1042 EAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYPKN 863
            EAK+QAAEKALG L+SM GQF YKR  SPR +Q MP KR KPEFSR+ QRM SS R+PKN
Sbjct: 887  EAKSQAAEKALGALKSMLGQFPYKRPGSPRSMQDMPIKRLKPEFSRALQRMQSSPRHPKN 946

Query: 862  ASPVP 848
            A+P P
Sbjct: 947  AAPCP 951


>emb|CDO99573.1| unnamed protein product [Coffea canephora]
          Length = 968

 Score = 1348 bits (3490), Expect = 0.0
 Identities = 696/967 (71%), Positives = 777/967 (80%), Gaps = 8/967 (0%)
 Frame = -2

Query: 3724 VVVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITST--- 3554
            V + EGERVLGEV VY    Q+GVVW     EIRIS YSQPSERCPPLAVLHT+TS+   
Sbjct: 7    VNLIEGERVLGEVEVYSIDDQNGVVWDR--DEIRISEYSQPSERCPPLAVLHTVTSSSSD 64

Query: 3553 --GIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGE-ELHLVAMHSRKYEGQVS 3383
              G+ FKLE     Q Q+SPL++LHATCLR+NKTA+M L  E ELHLVAMHSR++EGQ  
Sbjct: 65   SGGLCFKLELKDKSQ-QNSPLSILHATCLRENKTAIMPLDEEDELHLVAMHSRQHEGQFP 123

Query: 3382 CFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDP 3203
            CFWGF VAS LYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKINSE D 
Sbjct: 124  CFWGFIVASRLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSEVDQ 183

Query: 3202 QRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRL 3023
            QRV+ M+AE KRYQDDK+ILKQYAENDQV+DNGKV+KSQ EVV ALS+ H  IVRPL+RL
Sbjct: 184  QRVSAMLAEIKRYQDDKNILKQYAENDQVVDNGKVVKSQPEVVLALSDNHQTIVRPLLRL 243

Query: 3022 QDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWR 2843
            Q+KNIILTRINPQIRDTSVLVRLRPAWEDLR+YLTARGRKRFEV+VCTMAERDYALEMWR
Sbjct: 244  QEKNIILTRINPQIRDTSVLVRLRPAWEDLRNYLTARGRKRFEVYVCTMAERDYALEMWR 303

Query: 2842 LLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPR 2663
            LLDP+SNLID +ELL+RIVCVK+G RKSLFNVFQ GNCHPKMALVIDDRLKVWDEKDQPR
Sbjct: 304  LLDPDSNLIDPKELLDRIVCVKSGLRKSLFNVFQHGNCHPKMALVIDDRLKVWDEKDQPR 363

Query: 2662 VHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIK 2483
            VHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFFKEFD+GLLQR+SEVAYEDDIK
Sbjct: 364  VHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRISEVAYEDDIK 423

Query: 2482 EVPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSI 2303
            E+P P DVSNYL SEDDPS S+GNKDS+GFDGMAD EVERR LKEA SASSTAPL + ++
Sbjct: 424  EIPSPPDVSNYLISEDDPSASNGNKDSLGFDGMADVEVERR-LKEAISASSTAPLAIPNL 482

Query: 2302 DPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQR-GPAETTLQN 2126
            DP+I + +Q             TM GP   FP+             + Q   P E +LQ+
Sbjct: 483  DPKIVATVQ-YAVPSSISVLQPTMSGPVVPFPSQQLSQVTSVLKNPINQAILPPEASLQS 541

Query: 2125 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSRGWF 1946
            SPAREEGEVPESELDPDTRRRLLILQHGQD+R+    EPQFP R P+QVS PR Q RGWF
Sbjct: 542  SPAREEGEVPESELDPDTRRRLLILQHGQDSRERTSSEPQFPVRTPLQVSAPRAQGRGWF 601

Query: 1945 PVEEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMP 1766
            P++EEMSPRQ NRV              I+KHRS H PF+HK ES+VPP R  LE+Q+M 
Sbjct: 602  PIDEEMSPRQLNRVVPPKDFPLRSEPMEIEKHRSSHSPFLHKAESAVPPDRAFLENQRML 661

Query: 1765 KEALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDI 1586
            KE LPRE+ LR+NQ + +F S SGE+A + + SS N+DLDLE+GQID   ETP GAL DI
Sbjct: 662  KETLPREDNLRLNQPVASFPSFSGEEASMVRSSSANRDLDLESGQIDPQAETPIGALHDI 721

Query: 1585 AYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKY 1406
            A+KCG KVEF+QALVSS ELQF  EV FAGEKIGEG+GRT           SLMNLADKY
Sbjct: 722  AFKCGTKVEFKQALVSSSELQFCAEVWFAGEKIGEGLGRTRREAQRHAADSSLMNLADKY 781

Query: 1405 LSR-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRL 1229
            +S  KPDSSS+PG+  RF N  +NG   D +S+GYQ LPKEE  SFS AS  PR+LD RL
Sbjct: 782  ISSLKPDSSSVPGEWRRFPNTSNNGFANDFSSWGYQQLPKEEPGSFSTASMPPRVLDSRL 841

Query: 1228 EASKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLT 1049
            EASK+ +G IA LKELC M+GLG+AFQTQPQ S NPGQKNEVYAQVEIDGQVLGKGIG+ 
Sbjct: 842  EASKRPVGPIAALKELCSMEGLGLAFQTQPQLSANPGQKNEVYAQVEIDGQVLGKGIGIN 901

Query: 1048 WDEAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYP 869
            WDEAK+QAAEKALGTL+SM G + +KRQ SPRP QGM +KR KPEFSR  QRMPSSARYP
Sbjct: 902  WDEAKSQAAEKALGTLKSMLGSYGHKRQGSPRPWQGMSSKRLKPEFSRVLQRMPSSARYP 961

Query: 868  KNASPVP 848
            KNASPVP
Sbjct: 962  KNASPVP 968


>ref|XP_009623032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Nicotiana tomentosiformis]
            gi|697137919|ref|XP_009623033.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1
            [Nicotiana tomentosiformis]
          Length = 965

 Score = 1313 bits (3399), Expect = 0.0
 Identities = 683/973 (70%), Positives = 773/973 (79%), Gaps = 9/973 (0%)
 Frame = -2

Query: 3739 MYGKSVVVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 3560
            MY   VV+YEGERV+GE+ +     ++GVVWGE++  IRISHYS PSERCPPLAVLHTIT
Sbjct: 1    MYNSVVVLYEGERVVGELELL-YGGENGVVWGEKV--IRISHYSPPSERCPPLAVLHTIT 57

Query: 3559 ST-----GIAFKLESSQSPQ-HQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKY 3398
            S+     GI+FKLE ++S    QDSPL +LH+TCLRDNKTAV+SLG EELHLVAM S+ +
Sbjct: 58   SSSTTGNGISFKLEPTKSKSLSQDSPLFLLHSTCLRDNKTAVVSLGREELHLVAMQSKNF 117

Query: 3397 EGQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN 3218
             GQ  CFWGF VASGLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN
Sbjct: 118  GGQCPCFWGFKVASGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN 177

Query: 3217 SESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVR 3038
            SESDPQR + M++E KRYQ+DK  LKQYAENDQVIDNGKVIKSQSEV PALS+ H PIVR
Sbjct: 178  SESDPQRASAMLSEVKRYQEDKIFLKQYAENDQVIDNGKVIKSQSEVFPALSDNHQPIVR 237

Query: 3037 PLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYA 2858
            PLIRLQD+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYA
Sbjct: 238  PLIRLQDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 297

Query: 2857 LEMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDE 2678
            LEMWRLLDP+SNLI+S+ELL+RIVCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWDE
Sbjct: 298  LEMWRLLDPDSNLINSKELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDE 357

Query: 2677 KDQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAY 2498
            KDQPRVHVVPAFAPY++PQAE NN+VPVLCVARNVACNVRGGFFK+FD+GLLQR+SEVAY
Sbjct: 358  KDQPRVHVVPAFAPYFSPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAY 417

Query: 2497 EDDIKEVPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPL 2318
            EDDIK+VP   DVSNYL SEDDPS  +GNKDS+GFDGMAD EVERR LKEA  AS++ P 
Sbjct: 418  EDDIKQVPSAPDVSNYLLSEDDPSAVNGNKDSLGFDGMADTEVERR-LKEAMLASTSVPS 476

Query: 2317 TMTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAET 2138
             MT+ DPRIA ALQ            +T+Q P   FP              + Q  P +T
Sbjct: 477  QMTNSDPRIAPALQ---YPVPPAISQSTIQAPVVPFPAQHLPQVTSVLKSSVTQLSPQDT 533

Query: 2137 TLQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSV-PRVQ 1961
            +LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRD    EPQFP   P+QVSV PRVQ
Sbjct: 534  SLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPQFPMGTPLQVSVPPRVQ 593

Query: 1960 SRGWFPVEEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLE 1781
              GWFPVEEEMSPRQ NR               I+K+R  HPPF+ K+E+SVP  RVL E
Sbjct: 594  PHGWFPVEEEMSPRQLNRALPPKEFPLNSETMHINKNRPPHPPFLPKMETSVPSDRVLFE 653

Query: 1780 SQKMPKEALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAG 1601
            SQ++PKE +PR++ +R +QS P FH + GE+  + + SS N+DLDLE G  D Y ETPAG
Sbjct: 654  SQRLPKEVIPRDDRMRFSQSQPTFHPMPGEEVSLGRSSSSNRDLDLEPGHYDPYLETPAG 713

Query: 1600 ALQDIAYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMN 1421
            ALQDIA+KCGAKVEF+  L+SS ELQFSVEV FAGEKIGEGIGRT         + SLMN
Sbjct: 714  ALQDIAFKCGAKVEFKSGLLSSPELQFSVEVWFAGEKIGEGIGRTRREAQRQAAEESLMN 773

Query: 1420 LADKYLSR-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRI 1244
            LADKYLSR KPD SS  GDG RF N  DNG + D++ FGYQ   KE+  S S ASE  R+
Sbjct: 774  LADKYLSRLKPDPSSTAGDGFRFPNASDNGFVDDMSPFGYQSYLKEDRVSHSFASEPSRV 833

Query: 1243 LDPRLEASKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGK 1064
            LDPRLE  KKS+GS+A L+ELC ++GLG+AFQTQPQ S NPG K E+YAQVEIDGQV GK
Sbjct: 834  LDPRLEVLKKSVGSVASLRELCAIEGLGLAFQTQPQLSANPG-KTEIYAQVEIDGQVFGK 892

Query: 1063 GIGLTWDEAKTQAAEKALGTLRSMPGQFSYKRQDSPRPL-QGMPNKRFKPEFSRSFQRMP 887
            GIG TWD+AK QAAE+AL  L+S  GQFS+KRQ SPR L QG  NKR +PE+SR  QR+P
Sbjct: 893  GIGSTWDDAKAQAAERALVALKSELGQFSHKRQGSPRSLQQGFSNKRLRPEYSRGMQRLP 952

Query: 886  SSARYPKNASPVP 848
            SS R+PKN S +P
Sbjct: 953  SSGRFPKNTSAMP 965


>ref|XP_009789678.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Nicotiana sylvestris] gi|698485837|ref|XP_009789679.1|
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 1 [Nicotiana sylvestris]
            gi|698485839|ref|XP_009789680.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1
            [Nicotiana sylvestris]
          Length = 965

 Score = 1312 bits (3396), Expect = 0.0
 Identities = 683/973 (70%), Positives = 775/973 (79%), Gaps = 9/973 (0%)
 Frame = -2

Query: 3739 MYGKSVVVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 3560
            MY   VV+YEGERV+GE+ +     ++GVVWGE++  IRISHYS PSERCPPLAVLHTIT
Sbjct: 1    MYKSVVVLYEGERVVGELELL-YGGENGVVWGEKV--IRISHYSPPSERCPPLAVLHTIT 57

Query: 3559 ST-----GIAFKLESSQSPQ-HQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKY 3398
            S+     GI+FKLE ++S    QDSPL +LH+TCLRDNKTAV+SLG EELHLVAM S+ +
Sbjct: 58   SSSTTGNGISFKLEPTKSKSLSQDSPLFLLHSTCLRDNKTAVVSLGREELHLVAMQSKNF 117

Query: 3397 EGQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN 3218
             GQ  CFWGF VASGLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN
Sbjct: 118  GGQCPCFWGFKVASGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN 177

Query: 3217 SESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVR 3038
            SESDPQR + M+AE KRYQ+DK  LKQYAENDQVIDNGKVIKSQSEV PALS+ H PIVR
Sbjct: 178  SESDPQRASAMLAEVKRYQEDKIFLKQYAENDQVIDNGKVIKSQSEVFPALSDNHQPIVR 237

Query: 3037 PLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYA 2858
            PLIRLQD+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYA
Sbjct: 238  PLIRLQDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 297

Query: 2857 LEMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDE 2678
            LEMWRLLDP+SNLI+S+ELL+RIVCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWDE
Sbjct: 298  LEMWRLLDPDSNLINSKELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDE 357

Query: 2677 KDQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAY 2498
            KDQPRVHVVPAFAPY++PQAE NN+VPVLCVARNVACNVRGGFFK+FD+GLLQR+SEVAY
Sbjct: 358  KDQPRVHVVPAFAPYFSPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAY 417

Query: 2497 EDDIKEVPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPL 2318
            EDDIK+VP   DVSNYL SEDDPS  +G+KDS+GFDGMAD EVERR LKEA  AS++ P 
Sbjct: 418  EDDIKQVPSAPDVSNYLISEDDPSAVNGSKDSLGFDGMADTEVERR-LKEAMLASTSVPS 476

Query: 2317 TMTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAET 2138
             MT+ DPRIA ALQ            +T+Q P   FP              + Q  P +T
Sbjct: 477  QMTNSDPRIAPALQ---YPVPPAISQSTIQAPVVPFPAQHLPQVTSVLKSSVTQLSPQDT 533

Query: 2137 TLQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSV-PRVQ 1961
            +LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRD    EPQFP   P+QVSV PRVQ
Sbjct: 534  SLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPQFPMGTPLQVSVPPRVQ 593

Query: 1960 SRGWFPVEEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLE 1781
              GWFPVEEEMSPRQ NR               I+K+R  HPPF+ K+E+SVP  RVL E
Sbjct: 594  PHGWFPVEEEMSPRQLNRALPPKEFPLNSESMHINKNRPPHPPFLPKMETSVPSDRVLFE 653

Query: 1780 SQKMPKEALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAG 1601
            SQ++PKE +PR++ +R +QS P+FHS+ GE+  + + SS ++DLDLE G  D Y ETPAG
Sbjct: 654  SQRLPKEVIPRDDRMRFSQSQPSFHSMPGEEVSLGRSSSSSRDLDLEPGHYDPYLETPAG 713

Query: 1600 ALQDIAYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMN 1421
            ALQDIA+KCGAKVEF+  L+SS ELQFSVEV FAGEKIGEGIGRT         + SLMN
Sbjct: 714  ALQDIAFKCGAKVEFKSGLLSSPELQFSVEVWFAGEKIGEGIGRTRREAQRQAAEESLMN 773

Query: 1420 LADKYLSR-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRI 1244
            LADKYLSR KPD SS  GDG RF N  DNG + D++ FGYQ   KE+  S S ASE  R+
Sbjct: 774  LADKYLSRLKPDPSSTAGDGFRFPNASDNGFVDDMSPFGYQSYLKEDRVSHSFASEPSRV 833

Query: 1243 LDPRLEASKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGK 1064
            LDPRLE  KKS+GS+A L+ELC ++GLG+AFQTQPQ S NPG K E+YAQVEIDGQV GK
Sbjct: 834  LDPRLEVLKKSVGSVASLRELCAIEGLGLAFQTQPQLSANPG-KTEIYAQVEIDGQVFGK 892

Query: 1063 GIGLTWDEAKTQAAEKALGTLRSMPGQFSYKRQDSPRPL-QGMPNKRFKPEFSRSFQRMP 887
            GIG TWD+AK QAAE+AL  L+S  GQFS+KRQ SPR L QG  NKR +PE+SR  QR+P
Sbjct: 893  GIGSTWDDAKAQAAERALVALKSELGQFSHKRQGSPRSLQQGFSNKRLRPEYSRGMQRLP 952

Query: 886  SSARYPKNASPVP 848
            SS R+PKN S +P
Sbjct: 953  SSGRFPKNTSAMP 965


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score = 1284 bits (3323), Expect = 0.0
 Identities = 669/969 (69%), Positives = 768/969 (79%), Gaps = 5/969 (0%)
 Frame = -2

Query: 3739 MYGKSVVVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 3560
            M+  +VV+YEGER++GEV +Y    + GV+WGE++  IRISHYS  SERCPPLAVLHT+T
Sbjct: 1    MFKSTVVLYEGERLVGEVEIY---CEKGVLWGEKV--IRISHYSPSSERCPPLAVLHTVT 55

Query: 3559 STGIAFKLESSQS-PQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVS 3383
             TG++FKLE ++S P  QDSPLT+LH+TCLRDNKTAVMSLG EELHLVAM S+   GQ  
Sbjct: 56   -TGLSFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCP 114

Query: 3382 CFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDP 3203
            CFWGF VASGLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKINSESDP
Sbjct: 115  CFWGFKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDP 174

Query: 3202 QRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRL 3023
            QR + M+AE KRYQ+DK ILKQYAENDQV+DNGKVIKSQSEV PALS+ H PIVRPLIRL
Sbjct: 175  QRASVMLAEVKRYQEDKIILKQYAENDQVVDNGKVIKSQSEVFPALSDNHQPIVRPLIRL 234

Query: 3022 QDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWR 2843
            QD+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWR
Sbjct: 235  QDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 294

Query: 2842 LLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPR 2663
            LLDP+SNLI+S+ELL+RIVCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWD+KDQPR
Sbjct: 295  LLDPDSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPR 354

Query: 2662 VHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIK 2483
            VHVVPAFAPY+APQAE NN+VPVLCVARNVACNVRGGFFK+FD+GLLQR+SEVAYEDDIK
Sbjct: 355  VHVVPAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIK 414

Query: 2482 EVPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSI 2303
            +VP   DVSNYL SEDDPS  +GNKDS+GFDGMAD+EVERR LKEA  AS++ P  MT++
Sbjct: 415  QVPSAPDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERR-LKEAMLASTSVPSQMTNL 473

Query: 2302 DPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNS 2123
            DPR+  ALQ             ++Q P   FP              + Q  P +T+LQ+S
Sbjct: 474  DPRLVPALQ---YPVPPVISQPSIQSPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSS 530

Query: 2122 PAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSV-PRVQSRGWF 1946
            PAREEGEVPESELDPDTRRRLLILQHGQDTRD    EP+FP   P+QVSV PRVQ  GWF
Sbjct: 531  PAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKFPMGTPLQVSVPPRVQPHGWF 590

Query: 1945 PVEEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMP 1766
            P EEEMSPRQ NR               I+KHR  HPPF+ K+E+S+P  RVL E+Q++P
Sbjct: 591  PAEEEMSPRQLNRPLPPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFENQRLP 650

Query: 1765 KEALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDI 1586
            KE +PR++ +R +QS P+F    GE+ P+ + SS N+ LDLE G  D Y ETPAGALQDI
Sbjct: 651  KEVIPRDDRMRFSQSQPSFRP-PGEEVPLGRSSSSNRVLDLEPGHYDPYLETPAGALQDI 709

Query: 1585 AYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKY 1406
            A+KCGAKVEFR + +SS ELQFS+EVLFAGEK+GEG GRT         + SLM LADKY
Sbjct: 710  AFKCGAKVEFRSSFLSSPELQFSLEVLFAGEKVGEGTGRTRREAQRRAAEESLMYLADKY 769

Query: 1405 LS-RKPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRL 1229
            LS  KPDSSS  GDG RF N  DNG + +++ FGYQ     +  S S ASE PR+LDPRL
Sbjct: 770  LSCIKPDSSSTQGDGFRFPNASDNGFVDNMSPFGYQ-----DRVSHSFASEPPRVLDPRL 824

Query: 1228 EASKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLT 1049
            E  KKS+GS+  L+ELC ++GLG+AFQTQPQ S NPGQK+E+YAQVEIDGQV GKGIG T
Sbjct: 825  EVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSANPGQKSEIYAQVEIDGQVFGKGIGST 884

Query: 1048 WDEAKTQAAEKALGTLRSMPGQFSYKRQDSPRPL-QGMPNKRFKPEFSRSF-QRMPSSAR 875
            WD+AKTQAAE+AL  L+S   QFS KRQ SPR L QG  NKR KPE+SR   QR+P S R
Sbjct: 885  WDDAKTQAAERALVALKSELAQFSQKRQGSPRSLQQGFSNKRLKPEYSRGVQQRVPLSGR 944

Query: 874  YPKNASPVP 848
            +PKN S +P
Sbjct: 945  FPKNTSAMP 953


>ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Solanum lycopersicum]
          Length = 954

 Score = 1273 bits (3295), Expect = 0.0
 Identities = 664/970 (68%), Positives = 766/970 (78%), Gaps = 6/970 (0%)
 Frame = -2

Query: 3739 MYGKSVVVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 3560
            M+  +V++YEGER++GEV +Y    + GVVWGE+L  IRISHYS  SERCPPLAVLHT+T
Sbjct: 1    MFKSTVLLYEGERLVGEVEMY---GEKGVVWGEKL--IRISHYSPSSERCPPLAVLHTVT 55

Query: 3559 STGIAFKLESSQS-PQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVS 3383
             TG++FKLE ++S P  QDSPLT+LH+TCLRDNKTAVMSLG EELHLVAM S+   GQ  
Sbjct: 56   -TGLSFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCP 114

Query: 3382 CFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDP 3203
            CFWGF VASGLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKINSESDP
Sbjct: 115  CFWGFKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDP 174

Query: 3202 QRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRL 3023
            QR + M+AE KRYQ+DK ILKQYAENDQV+DNGKVI+SQSEV PALS+ H PIVRPLIRL
Sbjct: 175  QRASVMLAEVKRYQEDKIILKQYAENDQVVDNGKVIRSQSEVFPALSDNHQPIVRPLIRL 234

Query: 3022 QDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWR 2843
            QD+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWR
Sbjct: 235  QDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 294

Query: 2842 LLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPR 2663
            LLDP+SNLI+S+ELL+RIVCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWD+KDQPR
Sbjct: 295  LLDPDSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPR 354

Query: 2662 VHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIK 2483
            VHVVPAFAPY+APQAE NN+VPVLCVARNVACNVRGGFFK+FD+GLLQR+SEVAYEDDIK
Sbjct: 355  VHVVPAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIK 414

Query: 2482 EVPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSI 2303
            +VP   DVSNYL SEDDPS  +GNKDS+GFDGMAD+EVERR LKEA  AS++ P  MT++
Sbjct: 415  QVPSAPDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERR-LKEAMLASTSVPSQMTNL 473

Query: 2302 DPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNS 2123
            DPR+  ALQ             ++QGP   FP              + Q  P +T+LQ+S
Sbjct: 474  DPRLVPALQ---YPVPPVISQPSIQGPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSS 530

Query: 2122 PAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSV-PRVQSRGWF 1946
            PAREEGEVPESELDPDTRRRLLILQHGQDTRD    EP+FP   P+QVSV PRVQ  GWF
Sbjct: 531  PAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKFPIGTPLQVSVPPRVQPHGWF 590

Query: 1945 PVEEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMP 1766
            P EEE+SPRQ NR               I+KHR  HPPF+ K+E+S+P  RV  E+Q++P
Sbjct: 591  PAEEEVSPRQLNRPLPPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVFFENQRLP 650

Query: 1765 KEALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDI 1586
            KE +PR++ +R +QS P+F    GED  + + SS N+ LDL+ G  D Y +TPAGALQDI
Sbjct: 651  KEVIPRDDRMRFSQSQPSFRP-PGEDVSLGRSSSSNRVLDLDPGHYDPYLDTPAGALQDI 709

Query: 1585 AYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKY 1406
            A+KCG KVEFR + +SS ELQF +EVLFAGEK+GEGIGRT         + SLM LADKY
Sbjct: 710  AFKCGVKVEFRSSFLSSPELQFCLEVLFAGEKVGEGIGRTRREAQRHAAEESLMYLADKY 769

Query: 1405 LS-RKPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRL 1229
            LS  K DSSS  GDG RF N  DNG + +++ FGYQ     +  S S ASE PR+LDPRL
Sbjct: 770  LSCIKADSSSTQGDGFRFPNASDNGFVENMSPFGYQ-----DRVSHSFASEPPRVLDPRL 824

Query: 1228 EASKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLT 1049
            E  KKS+GS+  L+ELC ++GLG+AFQTQPQ S NPGQK+E+YAQVEIDGQV GKGIG T
Sbjct: 825  EVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSVNPGQKSEIYAQVEIDGQVFGKGIGPT 884

Query: 1048 WDEAKTQAAEKALGTLRSMPGQFSYKRQDSPRPL--QGMPNKRFKPEFSRSF-QRMPSSA 878
            WD+AKTQAAE+AL  L+S   QFS+KRQ SPR L  QG  NKR KPE+SR   QR+P S 
Sbjct: 885  WDDAKTQAAERALVALKSELAQFSHKRQGSPRSLQQQGFSNKRLKPEYSRGVQQRVPLSG 944

Query: 877  RYPKNASPVP 848
            R+PKN S +P
Sbjct: 945  RFPKNTSAMP 954


>ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
            gi|508781046|gb|EOY28302.1| C-terminal domain
            phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score = 1249 bits (3232), Expect = 0.0
 Identities = 664/978 (67%), Positives = 751/978 (76%), Gaps = 20/978 (2%)
 Frame = -2

Query: 3721 VVYEGERVLGEVAVYPTQSQDG-------------VVWGERLKEIRISHYSQPSERCPPL 3581
            VVY GE VLGEV +YP Q                 +V  E +KEIRI + +Q SERCPPL
Sbjct: 8    VVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSERCPPL 67

Query: 3580 AVLHTITSTGIAFKLESSQSPQHQDS----PLTVLHATCLRDNKTAVMSLGGEELHLVAM 3413
            AVLHTITS+GI FK+ESS+   +  S    PL +LH+ C+RDNKTAVM +G  ELHLVAM
Sbjct: 68   AVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELHLVAM 127

Query: 3412 HSRKYEGQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 3233
            +SR  +    CFWGFNV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL
Sbjct: 128  YSRNSDRP--CFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 185

Query: 3232 LRKINSESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVH 3053
             RK+ +E DPQRVAGM+AE KRYQDDK+ILKQYAENDQV++NGKVIK QSEVVPALS+ H
Sbjct: 186  QRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPALSDNH 245

Query: 3052 LPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMA 2873
             PI+RPLIRLQ+KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMA
Sbjct: 246  QPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMA 305

Query: 2872 ERDYALEMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRL 2693
            ERDYALEMWRLLDPESNLI+S+ELL+RIVCVK+GSRKSLFNVFQDG CHPKMALVIDDRL
Sbjct: 306  ERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRL 365

Query: 2692 KVWDEKDQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRM 2513
            KVWDEKDQPRVHVVPAFAPYYAPQAEANNT+PVLCVARNVACNVRGGFF+EFD+GLLQR+
Sbjct: 366  KVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGLLQRI 425

Query: 2512 SEVAYEDDIKEVPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSAS 2333
             E++YEDDIK++P P DV NYL SEDD S  +GNKD + FDGMADAEVERR LKEA SA+
Sbjct: 426  PEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERR-LKEAISAT 484

Query: 2332 STAPLTMTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQR 2153
            ST      ++DPR+  +LQ           P+  Q    SF N                 
Sbjct: 485  STVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVAPVA 544

Query: 2152 GPAETTLQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQF-PARPPMQVS 1976
             P E +LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRDH P EP F P RP MQVS
Sbjct: 545  VP-EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQVS 603

Query: 1975 VPRVQSRG-WFPVEEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPP 1799
            VPR QSRG WF  EEEMSPRQ NR               I+KHR  HPPF  KVESS+P 
Sbjct: 604  VPRGQSRGSWFAAEEEMSPRQLNRAA-PKEFPLDSERMHIEKHR--HPPFFPKVESSIPS 660

Query: 1798 GRVLLESQKMPKEALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLY 1619
             R+L E+Q++ KEAL R++ L +N +  ++HS SGE+ P++Q SS ++DLD E+G+    
Sbjct: 661  DRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTS 720

Query: 1618 PETPAGALQDIAYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXX 1439
             ET AG LQDIA KCGAKVEFR ALV+S++LQFS+E  FAGEK+GEG+GRT         
Sbjct: 721  GETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAA 780

Query: 1438 QGSLMNLADKYLSR-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMA 1262
            + S+ NLA+ YLSR KPDS S  GD SR  N  DNG   ++NSFG Q L KEE  SFS A
Sbjct: 781  EESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTA 840

Query: 1261 SESPRILDPRLEASKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEID 1082
            SE  R+ DPRLE SKKSMGS+  LKELCMM+GLGV FQ QP  S N  QK+EVYAQVEID
Sbjct: 841  SEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEID 900

Query: 1081 GQVLGKGIGLTWDEAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRS 902
            GQVLGKG GLTW+EAK QAAEKALG+LRSM GQ+S KRQ SPR LQGM NKR KPEF R 
Sbjct: 901  GQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKRLKPEFPRV 960

Query: 901  FQRMPSSARYPKNASPVP 848
             QRMPSS RYPKNA PVP
Sbjct: 961  LQRMPSSGRYPKNAPPVP 978


>ref|XP_002267987.3| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Vitis vinifera]
          Length = 935

 Score = 1244 bits (3218), Expect = 0.0
 Identities = 651/959 (67%), Positives = 747/959 (77%), Gaps = 1/959 (0%)
 Frame = -2

Query: 3721 VVYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITSTGIAF 3542
            +VYEG+ V+GEV +YP     G+   E +KEIRISHYSQPSERCPPLAVLHTITS G+ F
Sbjct: 5    IVYEGDDVVGEVEIYP--QNQGL---ELMKEIRISHYSQPSERCPPLAVLHTITSCGVCF 59

Query: 3541 KLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSCFWGFNV 3362
            K+ESS++ Q QD+PL +LH+TC+R+NKTAVMSLG EELHLVAM+S+K +GQ  CFWGFNV
Sbjct: 60   KMESSKA-QSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFNV 118

Query: 3361 ASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQRVAGMM 3182
            A GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKIN+E DPQR++GM 
Sbjct: 119  ALGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGMA 178

Query: 3181 AEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQDKNIIL 3002
            AE +RYQDD++ILKQYAENDQV++NGK+ K+Q E+VPALS+ H PIVRPLIRLQ+KNIIL
Sbjct: 179  AEVRRYQDDRNILKQYAENDQVVENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNIIL 238

Query: 3001 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPESN 2822
            TRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDPESN
Sbjct: 239  TRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESN 298

Query: 2821 LIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 2642
            LI+S+ELL+RIVCVK+GSRKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAF
Sbjct: 299  LINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAF 358

Query: 2641 APYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEVPPPLD 2462
            APYYAPQAEANN + VLCVARNVACNVRGGFFKEFD+GLLQR+ E++YEDDIK++    D
Sbjct: 359  APYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDDIKDIRSAPD 418

Query: 2461 VSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDPRIASA 2282
            VSNYL SEDD SVS+GN+D   FDGMAD EVER+ LK+A S    AP T+TS+DPR++  
Sbjct: 419  VSNYLVSEDDASVSNGNRDQPCFDGMADVEVERK-LKDAIS----APSTVTSLDPRLSPP 473

Query: 2281 LQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPAREEGE 2102
            LQ               QG    F N                    E T+Q+SPAREEGE
Sbjct: 474  LQFAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPL-----APEPTMQSSPAREEGE 528

Query: 2101 VPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSRG-WFPVEEEMS 1925
            VPESELDPDTRRRLLILQHGQDTR+HA  +P FP RPP+QVSVPRVQSRG WFP +EEMS
Sbjct: 529  VPESELDPDTRRRLLILQHGQDTREHASSDPPFPVRPPIQVSVPRVQSRGSWFPADEEMS 588

Query: 1924 PRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPKEALPRE 1745
            PRQ NR               I+KHR HHP F HKVESS    R+L E+Q++ KE L R+
Sbjct: 589  PRQLNRAV-PKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRD 647

Query: 1744 EPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIAYKCGAK 1565
            + LR+N SLP +HS SGE+ P+ + SS N+DLD E+G+   Y ETPA  LQ+IA KCG K
Sbjct: 648  DRLRLNHSLPGYHSFSGEEVPLGR-SSSNRDLDFESGRGAPYAETPAVGLQEIAMKCGTK 706

Query: 1564 VEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYLSRKPDS 1385
            +EFR +LV++ ELQFS+EV FAGEKIGEG G+T         + SLM L+ +YL      
Sbjct: 707  LEFRPSLVAATELQFSIEVWFAGEKIGEGTGKTRREAQCQAAEASLMYLSYRYLH----- 761

Query: 1384 SSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLEASKKSMG 1205
                GD +RF N  DN  + D NSFGYQ  PKE   SFS ASES R+LDPRLE+SKKSMG
Sbjct: 762  ----GDVNRFPNASDNNFMSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSMG 817

Query: 1204 SIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLTWDEAKTQA 1025
            SI+ LKELCMM+GLGV F +QP  S N  QK E+ AQVEIDGQVLGKG G TWD+AK QA
Sbjct: 818  SISALKELCMMEGLGVEFLSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQA 877

Query: 1024 AEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYPKNASPVP 848
            AEKALG+L+SM GQFS KRQ SPR LQGM  KR K EF+R  QR PSS RY KN SPVP
Sbjct: 878  AEKALGSLKSMLGQFSQKRQGSPRSLQGM-GKRLKSEFTRGLQRTPSSGRYSKNTSPVP 935


>ref|XP_008225045.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Prunus mume]
          Length = 959

 Score = 1239 bits (3206), Expect = 0.0
 Identities = 651/965 (67%), Positives = 756/965 (78%), Gaps = 7/965 (0%)
 Frame = -2

Query: 3721 VVYEGERVLGEVAVYPTQSQD-----GVVWGERLKEIRISHYSQPSERCPPLAVLHTITS 3557
            VVY+GE +LGEV +YP ++++      +V  + LKEIRIS++SQ SERCPP+AVLHTI+S
Sbjct: 5    VVYKGEELLGEVEIYPEENENKNKNKNLV--DELKEIRISYFSQSSERCPPVAVLHTISS 62

Query: 3556 TGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSCF 3377
             G+ FK+ES  S Q QD+PL +LH++C+ +NKTAVM LGGEELHLVAMHSR  + +  CF
Sbjct: 63   HGVCFKMESKTS-QSQDTPLFLLHSSCVMENKTAVMPLGGEELHLVAMHSRNSDKRYPCF 121

Query: 3376 WGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQR 3197
            WGF+VA GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKI+SE D QR
Sbjct: 122  WGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEVDSQR 181

Query: 3196 VAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQD 3017
            ++GM+AE KRYQDDK ILKQYAENDQV++NG+VIK+QSE VPALS+ H PI+RPLIRL +
Sbjct: 182  ISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRPLIRLLE 241

Query: 3016 KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLL 2837
            KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLL
Sbjct: 242  KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL 301

Query: 2836 DPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVH 2657
            DP+SNLI+S +LL+RIVCVK+GSRKSLFNVFQ+  CHPKMALVIDDRLKVWD++DQPRVH
Sbjct: 302  DPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVH 361

Query: 2656 VVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEV 2477
            VVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFF+EFDD LLQ++ EV YEDDIK+V
Sbjct: 362  VVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYEDDIKDV 421

Query: 2476 PPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDP 2297
            P P DVSNYL SEDD S  +GN+D + FDG+ D EVERR +KEA+SA+S     +TSIDP
Sbjct: 422  PSP-DVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERR-MKEATSAASMVSSVVTSIDP 479

Query: 2296 RIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPA 2117
            R+AS LQ             T Q    SFP+             L   G  E +LQ+SPA
Sbjct: 480  RLAS-LQYTVAPSSSTLSLPTTQPSVMSFPS-IQFPQAASLVKPLGHVGSTEPSLQSSPA 537

Query: 2116 REEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSR-GWFPV 1940
            REEGEVPESELDPDTRRRLLILQHGQDTRD  P EP FP RPPMQ SVPR QSR GWFPV
Sbjct: 538  REEGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPFPVRPPMQASVPRAQSRPGWFPV 597

Query: 1939 EEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPKE 1760
            EEEMSPRQ +R+              I+KHR HH  F  KVE+S+P  R+L E+Q++PKE
Sbjct: 598  EEEMSPRQLSRMV-PKDLPLDPEPVQIEKHRPHHSSFFPKVENSIPSDRILQENQRLPKE 656

Query: 1759 ALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIAY 1580
            A  R++ LR N +L  +HSLSGE+ P+++ SS N+D+D E+G+     ETPAG LQ+IA 
Sbjct: 657  AFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGRAISNAETPAGVLQEIAM 716

Query: 1579 KCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYLS 1400
            KCGAKVEFR ALV+SMELQF VE  FAGEKIGEG G+T         +GSL NLA+ YLS
Sbjct: 717  KCGAKVEFRPALVASMELQFYVEAWFAGEKIGEGSGKTRREAHYQAAEGSLKNLANIYLS 776

Query: 1399 R-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLEA 1223
            R KPDS S+ GD ++F N   NG  G++NSFG QP PKEE  S S +SE  R LDPRLE 
Sbjct: 777  RVKPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPLDPRLEG 836

Query: 1222 SKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLTWD 1043
            SKKSM S++ LKELCMM+GLGV FQ +P  S N  +K+EV+ QVEIDG+VLGKGIGLTWD
Sbjct: 837  SKKSMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKGIGLTWD 896

Query: 1042 EAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYPKN 863
            EAK QAAEKALG+L S    ++ KRQ SPR LQGM +KR K EF +  QRMPSSARYPKN
Sbjct: 897  EAKMQAAEKALGSLTST--LYAQKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSSARYPKN 954

Query: 862  ASPVP 848
            A PVP
Sbjct: 955  APPVP 959


>ref|XP_012091568.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Jatropha curcas] gi|802784113|ref|XP_012091569.1|
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 1 [Jatropha curcas]
          Length = 976

 Score = 1226 bits (3171), Expect = 0.0
 Identities = 640/973 (65%), Positives = 754/973 (77%), Gaps = 16/973 (1%)
 Frame = -2

Query: 3718 VYEGERVLGEVAVYPTQSQ------------DGVVWGERLKEIRISHYSQPSERCPPLAV 3575
            VY+GE +LGEV +YP Q Q            D ++ G   KEIRISH+SQPSERCPPLAV
Sbjct: 12   VYKGEELLGEVEIYPQQHQQQEEENNKKKLIDEILMG---KEIRISHFSQPSERCPPLAV 68

Query: 3574 LHTITSTGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYE 3395
            LHTIT  G+ FK+ES  S    D+PL +LH++C+++NKTAV+ LGGEELHLVA++SR  E
Sbjct: 69   LHTITC-GMCFKMESKNSLS-LDTPLHLLHSSCIQENKTAVVPLGGEELHLVAIYSRNNE 126

Query: 3394 GQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINS 3215
             Q  CFWGFNV++GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKIN+
Sbjct: 127  RQYPCFWGFNVSAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINT 186

Query: 3214 ESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRP 3035
            E DPQR+AGM++E KRYQDDK+ILKQY ENDQVI+NG+VIK+Q EVVPALS+ H  IVRP
Sbjct: 187  EVDPQRIAGMLSEVKRYQDDKTILKQYVENDQVIENGRVIKTQFEVVPALSDNHQTIVRP 246

Query: 3034 LIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYAL 2855
            LIRLQ++NIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYAL
Sbjct: 247  LIRLQERNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYAL 306

Query: 2854 EMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEK 2675
            EMWRLLDPESNLI S+ELL+RIVCVK+G RKSLFNVFQDG CHPKMALVIDDRLKVWDEK
Sbjct: 307  EMWRLLDPESNLISSKELLDRIVCVKSGLRKSLFNVFQDGVCHPKMALVIDDRLKVWDEK 366

Query: 2674 DQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYE 2495
            DQPRVHVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFD+GLLQR+ +++YE
Sbjct: 367  DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLLQRIPDISYE 426

Query: 2494 DDIKEVPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLT 2315
            DD  ++P P DVS+YL SEDD S S+G++D + FDGMADAEVE+R LKEA SA+S  P T
Sbjct: 427  DDFNDIPSPPDVSSYLISEDDASTSNGHRDPLSFDGMADAEVEKR-LKEAISAASLFPAT 485

Query: 2314 MTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETT 2135
            + ++DPR+  ALQ            +T Q     F N             LAQ GP E +
Sbjct: 486  VNNLDPRVIPALQYSLASSSSSIPVSTSQPLVMPFSN-IQFPQAASLVKPLAQVGPPEPS 544

Query: 2134 LQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSR 1955
            LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRD+   E Q P RP MQVSVPRVQSR
Sbjct: 545  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDNVSSESQIPVRPSMQVSVPRVQSR 604

Query: 1954 G-WFPVEEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGR--VLL 1784
            G W PVEEEMSPRQ N +T             I+KH+ HHP F  KVE+ +   R  ++ 
Sbjct: 605  GSWVPVEEEMSPRQLN-LTVPREFPLELEPMHIEKHQPHHPSFFPKVENPISSDRMGMVN 663

Query: 1783 ESQKMPKEALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPA 1604
            E+ ++PK A  R++ LR N ++ N+H LSGE+ P+++ SS N+D D E+ +     ETP 
Sbjct: 664  ENLRLPKAAPYRDDRLRSNHTMANYHPLSGEEIPLSRSSSSNRDPDFESERAVSSAETPV 723

Query: 1603 GALQDIAYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLM 1424
             ALQ+IA KCGAKVEFR +LV S +LQFS E  FAGE++GEGIG+T         + S+ 
Sbjct: 724  EALQEIAMKCGAKVEFRASLVDSRDLQFSTEAWFAGERVGEGIGKTRREAQRLAAESSIK 783

Query: 1423 NLADKYLSR-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPR 1247
            NLA+ Y+ R KPD+ ++ GD SR+++  DNG +G++NSFG QPLPK+E  S S ASE  R
Sbjct: 784  NLANIYMQRAKPDNGAMHGDASRYSSANDNGYLGNVNSFGSQPLPKDEPVSSSAASEQLR 843

Query: 1246 ILDPRLEASKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLG 1067
            + DPRL++SKK++GS+  LKE CMM+GLG+ F +    S N  QK+EVYAQVEIDGQV+G
Sbjct: 844  LPDPRLDSSKKAVGSVTALKEFCMMEGLGLNFLSPTPLSSNSLQKDEVYAQVEIDGQVMG 903

Query: 1066 KGIGLTWDEAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMP 887
            KGIG TWDEAK QAAE+ALG+LR+M GQF+ KRQ SPRP QGM NKR KPEF R  QRMP
Sbjct: 904  KGIGSTWDEAKMQAAERALGSLRTMFGQFTPKRQGSPRPTQGMSNKRLKPEFPRGLQRMP 963

Query: 886  SSARYPKNASPVP 848
            SS RYPKNA PVP
Sbjct: 964  SSTRYPKNAPPVP 976


>gb|KDP20941.1| hypothetical protein JCGZ_21412 [Jatropha curcas]
          Length = 970

 Score = 1226 bits (3171), Expect = 0.0
 Identities = 640/973 (65%), Positives = 754/973 (77%), Gaps = 16/973 (1%)
 Frame = -2

Query: 3718 VYEGERVLGEVAVYPTQSQ------------DGVVWGERLKEIRISHYSQPSERCPPLAV 3575
            VY+GE +LGEV +YP Q Q            D ++ G   KEIRISH+SQPSERCPPLAV
Sbjct: 6    VYKGEELLGEVEIYPQQHQQQEEENNKKKLIDEILMG---KEIRISHFSQPSERCPPLAV 62

Query: 3574 LHTITSTGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYE 3395
            LHTIT  G+ FK+ES  S    D+PL +LH++C+++NKTAV+ LGGEELHLVA++SR  E
Sbjct: 63   LHTITC-GMCFKMESKNSLS-LDTPLHLLHSSCIQENKTAVVPLGGEELHLVAIYSRNNE 120

Query: 3394 GQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINS 3215
             Q  CFWGFNV++GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKIN+
Sbjct: 121  RQYPCFWGFNVSAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINT 180

Query: 3214 ESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRP 3035
            E DPQR+AGM++E KRYQDDK+ILKQY ENDQVI+NG+VIK+Q EVVPALS+ H  IVRP
Sbjct: 181  EVDPQRIAGMLSEVKRYQDDKTILKQYVENDQVIENGRVIKTQFEVVPALSDNHQTIVRP 240

Query: 3034 LIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYAL 2855
            LIRLQ++NIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYAL
Sbjct: 241  LIRLQERNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYAL 300

Query: 2854 EMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEK 2675
            EMWRLLDPESNLI S+ELL+RIVCVK+G RKSLFNVFQDG CHPKMALVIDDRLKVWDEK
Sbjct: 301  EMWRLLDPESNLISSKELLDRIVCVKSGLRKSLFNVFQDGVCHPKMALVIDDRLKVWDEK 360

Query: 2674 DQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYE 2495
            DQPRVHVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFD+GLLQR+ +++YE
Sbjct: 361  DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLLQRIPDISYE 420

Query: 2494 DDIKEVPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLT 2315
            DD  ++P P DVS+YL SEDD S S+G++D + FDGMADAEVE+R LKEA SA+S  P T
Sbjct: 421  DDFNDIPSPPDVSSYLISEDDASTSNGHRDPLSFDGMADAEVEKR-LKEAISAASLFPAT 479

Query: 2314 MTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETT 2135
            + ++DPR+  ALQ            +T Q     F N             LAQ GP E +
Sbjct: 480  VNNLDPRVIPALQYSLASSSSSIPVSTSQPLVMPFSN-IQFPQAASLVKPLAQVGPPEPS 538

Query: 2134 LQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSR 1955
            LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRD+   E Q P RP MQVSVPRVQSR
Sbjct: 539  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDNVSSESQIPVRPSMQVSVPRVQSR 598

Query: 1954 G-WFPVEEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGR--VLL 1784
            G W PVEEEMSPRQ N +T             I+KH+ HHP F  KVE+ +   R  ++ 
Sbjct: 599  GSWVPVEEEMSPRQLN-LTVPREFPLELEPMHIEKHQPHHPSFFPKVENPISSDRMGMVN 657

Query: 1783 ESQKMPKEALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPA 1604
            E+ ++PK A  R++ LR N ++ N+H LSGE+ P+++ SS N+D D E+ +     ETP 
Sbjct: 658  ENLRLPKAAPYRDDRLRSNHTMANYHPLSGEEIPLSRSSSSNRDPDFESERAVSSAETPV 717

Query: 1603 GALQDIAYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLM 1424
             ALQ+IA KCGAKVEFR +LV S +LQFS E  FAGE++GEGIG+T         + S+ 
Sbjct: 718  EALQEIAMKCGAKVEFRASLVDSRDLQFSTEAWFAGERVGEGIGKTRREAQRLAAESSIK 777

Query: 1423 NLADKYLSR-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPR 1247
            NLA+ Y+ R KPD+ ++ GD SR+++  DNG +G++NSFG QPLPK+E  S S ASE  R
Sbjct: 778  NLANIYMQRAKPDNGAMHGDASRYSSANDNGYLGNVNSFGSQPLPKDEPVSSSAASEQLR 837

Query: 1246 ILDPRLEASKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLG 1067
            + DPRL++SKK++GS+  LKE CMM+GLG+ F +    S N  QK+EVYAQVEIDGQV+G
Sbjct: 838  LPDPRLDSSKKAVGSVTALKEFCMMEGLGLNFLSPTPLSSNSLQKDEVYAQVEIDGQVMG 897

Query: 1066 KGIGLTWDEAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMP 887
            KGIG TWDEAK QAAE+ALG+LR+M GQF+ KRQ SPRP QGM NKR KPEF R  QRMP
Sbjct: 898  KGIGSTWDEAKMQAAERALGSLRTMFGQFTPKRQGSPRPTQGMSNKRLKPEFPRGLQRMP 957

Query: 886  SSARYPKNASPVP 848
            SS RYPKNA PVP
Sbjct: 958  SSTRYPKNAPPVP 970


>ref|XP_008371347.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Malus domestica]
          Length = 960

 Score = 1221 bits (3158), Expect = 0.0
 Identities = 648/963 (67%), Positives = 752/963 (78%), Gaps = 6/963 (0%)
 Frame = -2

Query: 3718 VYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITST-GIAF 3542
            VY+GE +LGEV +YPT +++     + LKEIRIS++SQPSERCPP+AVLHTI S+ G+ F
Sbjct: 5    VYKGEDLLGEVEIYPTVNENNKNVQDVLKEIRISYFSQPSERCPPVAVLHTINSSNGVCF 64

Query: 3541 KL-ESSQSP-QHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSCFWGF 3368
            K+ ES  SP    D+PL +LH++  ++NKTAVM LGGEELHLVAM SR    Q  CFWGF
Sbjct: 65   KMMESKTSPLSSPDTPLFLLHSSMTQENKTAVMPLGGEELHLVAMQSRNGGKQFPCFWGF 124

Query: 3367 NVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQRVAG 3188
             VASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKI++E DP R++G
Sbjct: 125  YVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISTEVDPLRISG 184

Query: 3187 MMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQDKNI 3008
            M+AE KRYQDDK ILKQYAENDQV+DNG+V+K+QSEVVPALS+ H PI+RPLIRL +KNI
Sbjct: 185  MLAEIKRYQDDKFILKQYAENDQVVDNGRVVKTQSEVVPALSDNHQPIIRPLIRLHEKNI 244

Query: 3007 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPE 2828
            ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDP+
Sbjct: 245  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 304

Query: 2827 SNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVP 2648
            SNLI+S +LL+RIVCVK+GSRKSLFNVFQ+  CHPKMALVIDDRLKVWDE+DQPRVHVVP
Sbjct: 305  SNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDERDQPRVHVVP 364

Query: 2647 AFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEVPPP 2468
            AFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDD LLQ++ E  YEDDIK+VP P
Sbjct: 365  AFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDSLLQKIPEFFYEDDIKDVPSP 424

Query: 2467 LDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDPRIA 2288
             DVSN+L SEDDPS  +GN+D + FDGMADAEVERR LKEA+SA+ TA   +T+IDPR+A
Sbjct: 425  -DVSNHLVSEDDPSALNGNRDPLTFDGMADAEVERR-LKEATSAALTASSVVTNIDPRLA 482

Query: 2287 SALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPAREE 2108
            S              P++ Q P T FPN             L   G AE +L +SPAREE
Sbjct: 483  SLQYSMAPSSSTTSLPSSQQSPMT-FPN-IQFPQGASVVKPLGHLGAAEPSLHSSPAREE 540

Query: 2107 GEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSR-GWFPVEEE 1931
            GEVPESELDPDTRRRLLILQHGQDTR+  P EP F  RPP+Q SVPRVQ R GWFPVEEE
Sbjct: 541  GEVPESELDPDTRRRLLILQHGQDTREPPPSEPPFAVRPPVQASVPRVQPRPGWFPVEEE 600

Query: 1930 MSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPKEALP 1751
            MSPRQ +R T             I+KHR HH  F  KV++S+P  R+L E+Q+ PKEA  
Sbjct: 601  MSPRQLSR-TVPKELPLDPDPMQIEKHRPHHSSFFSKVDNSIPSDRILQENQRFPKEAFH 659

Query: 1750 REEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIAYKCG 1571
            R++ LR N +   +HS+SGE+ P+++  S N+D+D E+G+     ETPAGALQ+IA KCG
Sbjct: 660  RDDRLRFNHASAGYHSVSGEEIPLSRSPSMNRDVDFESGRAISNAETPAGALQEIAMKCG 719

Query: 1570 AKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYLSR-K 1394
            AKVEFR ALV+S ELQF VE  FAGEKIGEG G+T         +GSL NLA+ YLSR K
Sbjct: 720  AKVEFRPALVASTELQFYVEAWFAGEKIGEGTGKTRREAHFQAAEGSLKNLANIYLSRVK 779

Query: 1393 PDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLEASKK 1214
            PDS  + G+ S+F+N  +NG +G+ NSFG Q  PKEE  S S +SE  R LDPRLE  +K
Sbjct: 780  PDSVPVHGEMSKFSNANNNGFVGNANSFGIQSFPKEESLSSSTSSEPSRPLDPRLEGFQK 839

Query: 1213 SMGSIAELKELCMMQGL-GVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLTWDEA 1037
            SM S++ LKELCM++GL GV FQ +P  S N  +K+EV+ QVEIDG+VLGKGIGLTWDEA
Sbjct: 840  SMNSVSALKELCMIEGLGGVVFQPRPPPSANSVEKDEVHVQVEIDGEVLGKGIGLTWDEA 899

Query: 1036 KTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYPKNAS 857
            K QAAEKALG+LRS    F+ KRQ SPR  QGMPNKR K EF +  QRMPSSARYPKNA 
Sbjct: 900  KMQAAEKALGSLRST--LFAQKRQGSPRSFQGMPNKRMKQEFPQVLQRMPSSARYPKNAP 957

Query: 856  PVP 848
            PVP
Sbjct: 958  PVP 960


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score = 1220 bits (3157), Expect = 0.0
 Identities = 648/965 (67%), Positives = 745/965 (77%), Gaps = 7/965 (0%)
 Frame = -2

Query: 3721 VVYEGERVLGEVAVYPTQSQDGVVWGERLK----EIRISHYSQPSERCPPLAVLHTITST 3554
            V Y G+ +LGEV +YP Q  +G    E+ K    EIRIS++S+ SERCPPLAVLHTIT++
Sbjct: 5    VAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHTITAS 64

Query: 3553 GIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLG-GEELHLVAMHSRKYEGQVSCF 3377
            GI FK+ES  S   Q   L +LH++C+R+NKTAVM LG  EELHLVAM+SR  E Q  CF
Sbjct: 65   GICFKMESKSSDNVQ---LHLLHSSCIRENKTAVMLLGLTEELHLVAMYSRNNEKQYPCF 121

Query: 3376 WGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQR 3197
            W F+V SGLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKI++E DPQR
Sbjct: 122  WAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDPQR 181

Query: 3196 VAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQD 3017
            +AGM AE KRYQDDK+ILKQYAENDQV +NGKVIK QSEVVPALS+ H  +VRPLIRLQ+
Sbjct: 182  IAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRLQE 241

Query: 3016 KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLL 2837
            KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLL
Sbjct: 242  KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL 301

Query: 2836 DPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVH 2657
            DPESNLI+++ELL+RIVCVK+GSRKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQ RVH
Sbjct: 302  DPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDEKDQSRVH 361

Query: 2656 VVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEV 2477
            VVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFD+GLLQR+ E++YEDD+KE+
Sbjct: 362  VVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVKEI 421

Query: 2476 PPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDP 2297
            P P DVSNYL SEDD + ++G KD + FDGMADAEVERR LKEA +AS+T    + ++DP
Sbjct: 422  PSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERR-LKEAIAASATISSAVANLDP 480

Query: 2296 RIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPA 2117
            R+A   Q             T Q       N             L   GP E  LQ+SPA
Sbjct: 481  RLA-PFQYTMPSSSSTTTLPTSQAAVMPLAN-MQFPPATSLVKPLGHVGPPEQCLQSSPA 538

Query: 2116 REEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSRG-WFPV 1940
            REEGEVPESELDPDTRRRLLILQHG DTR++AP E  FPAR  MQVSVPRV SRG WFPV
Sbjct: 539  REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPFPARTQMQVSVPRVPSRGSWFPV 598

Query: 1939 EEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPKE 1760
            EEEMSPRQ NR               I+KHR  HP F  K+E+S+   R   E+Q+MPKE
Sbjct: 599  EEEMSPRQLNRAV-PKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDRP-HENQRMPKE 656

Query: 1759 ALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIAY 1580
            AL R++ LR+N +L ++ S SGE+ P+++ SS ++D+D E+G+     ETP+G LQDIA 
Sbjct: 657  ALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQDIAM 716

Query: 1579 KCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYLS 1400
            KCG KVEFR ALV+S ELQFS+E  FAGEKIGEGIGRT         +GS+ +LA+ Y+ 
Sbjct: 717  KCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYVL 776

Query: 1399 R-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLEA 1223
            R K DS S  GDGSRF+N  +N  +G+INSFG QPL K+E    S++SE  +++DPRLE 
Sbjct: 777  RVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKLVDPRLEG 832

Query: 1222 SKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLTWD 1043
            SKK MGS++ LKELCM +GLGV FQ QP  S N  QK+EVYAQVEIDGQVLGKGIG TWD
Sbjct: 833  SKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTWD 892

Query: 1042 EAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYPKN 863
            EAK QAAEKALG+LRSM GQF  K Q SPR LQGMPNKR KPEF R  QRMP S RYPKN
Sbjct: 893  EAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPPSGRYPKN 952

Query: 862  ASPVP 848
            A PVP
Sbjct: 953  APPVP 957


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis] gi|641857111|gb|KDO75877.1|
            hypothetical protein CISIN_1g002166mg [Citrus sinensis]
          Length = 957

 Score = 1219 bits (3153), Expect = 0.0
 Identities = 646/965 (66%), Positives = 745/965 (77%), Gaps = 7/965 (0%)
 Frame = -2

Query: 3721 VVYEGERVLGEVAVYPTQSQDGVVWGERLK----EIRISHYSQPSERCPPLAVLHTITST 3554
            V Y G+ +LGEV +YP Q  +G    E+ K    EIRIS++S+ SERCPPLAVLHTIT++
Sbjct: 5    VAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHTITAS 64

Query: 3553 GIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLG-GEELHLVAMHSRKYEGQVSCF 3377
            GI FK+ES  S   Q   L +LH++C+R+NKTAVM LG  EELHLVAM+SR  E Q  CF
Sbjct: 65   GICFKMESKSSDNIQ---LHLLHSSCIRENKTAVMPLGLTEELHLVAMYSRNNEKQYPCF 121

Query: 3376 WGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQR 3197
            W F+V SGLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKI++E DPQR
Sbjct: 122  WAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDPQR 181

Query: 3196 VAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQD 3017
            +AGM AE KRYQDDK+ILKQYAENDQV +NGKVIK QSEVVPALS+ H  +VRPLIRLQ+
Sbjct: 182  IAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRLQE 241

Query: 3016 KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLL 2837
            KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLL
Sbjct: 242  KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLL 301

Query: 2836 DPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVH 2657
            DPESNLI+++ELL+RIVCVK+GSRKSLFNVFQDG CHPKMALVIDDRLKVWD+KDQPRVH
Sbjct: 302  DPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDDKDQPRVH 361

Query: 2656 VVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEV 2477
            VVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFD+GLLQR+ E++YEDD+K++
Sbjct: 362  VVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVKDI 421

Query: 2476 PPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDP 2297
            P P DVSNYL SEDD + ++G KD + FDGMADAEVERR LKEA +AS+T    + ++DP
Sbjct: 422  PSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERR-LKEAIAASATISSAVANLDP 480

Query: 2296 RIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPA 2117
            R+A   Q             T Q       N             L   GP E +LQ+SPA
Sbjct: 481  RLA-PFQYTMPSSSSTTTLPTSQAAVMPLAN-MQFPPATSLVKPLGHVGPPEQSLQSSPA 538

Query: 2116 REEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSRG-WFPV 1940
            REEGEVPESELDPDTRRRLLILQHG DTR++AP E  FPAR  MQVSVPRV SRG WFPV
Sbjct: 539  REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPFPARTQMQVSVPRVPSRGSWFPV 598

Query: 1939 EEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPKE 1760
            EEEMSPRQ NR               I+KHR  HP F  K+E+     R   E+Q+MPKE
Sbjct: 599  EEEMSPRQLNRAV-PKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDRP-HENQRMPKE 656

Query: 1759 ALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIAY 1580
            AL R++ LR+N +L ++ S SGE+ P+++ SS ++D+D E+G+     ETP+G LQDIA 
Sbjct: 657  ALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQDIAM 716

Query: 1579 KCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYLS 1400
            KCG KVEFR ALV+S ELQFS+E  FAGEKIGEGIGRT         +GS+ +LA+ Y+ 
Sbjct: 717  KCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYML 776

Query: 1399 R-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLEA 1223
            R K DS S  GDGSRF+N  +N  +G+INSFG QPL K+E    S++SE  +++DPRLE 
Sbjct: 777  RVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKLVDPRLEG 832

Query: 1222 SKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLTWD 1043
            SKK MGS++ LKELCM +GLGV FQ QP  S N  QK+EVYAQVEIDGQVLGKGIG TWD
Sbjct: 833  SKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTWD 892

Query: 1042 EAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYPKN 863
            EAK QAAEKALG+LRSM GQF  K Q SPR LQGMPNKR KPEF R  QRMP S RYPKN
Sbjct: 893  EAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPPSGRYPKN 952

Query: 862  ASPVP 848
            A PVP
Sbjct: 953  APPVP 957


>ref|XP_009336327.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Pyrus x bretschneideri]
          Length = 960

 Score = 1218 bits (3151), Expect = 0.0
 Identities = 647/963 (67%), Positives = 752/963 (78%), Gaps = 6/963 (0%)
 Frame = -2

Query: 3718 VYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITST-GIAF 3542
            VY+GE +LGEV +YPT +++     + LKEIRIS++SQPSERCPP+AVLHTI S+ G+ F
Sbjct: 5    VYKGEDLLGEVEIYPTVNENNKNVLDELKEIRISYFSQPSERCPPVAVLHTINSSNGVCF 64

Query: 3541 KL-ESSQSP-QHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSCFWGF 3368
            KL ES  SP    D+PL +LH++  ++NKTAVM LGGEELHLVAM SR    Q  CFWGF
Sbjct: 65   KLMESKTSPLSSPDTPLFLLHSSMTQENKTAVMPLGGEELHLVAMQSRNGGKQCPCFWGF 124

Query: 3367 NVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQRVAG 3188
             VASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKI++E DPQR++G
Sbjct: 125  YVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISTEVDPQRISG 184

Query: 3187 MMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQDKNI 3008
            M AE KRYQDDK ILKQYAENDQV+DNG+V+K+QSEVVPALS+ H PI+RPLIRL +KNI
Sbjct: 185  MFAEIKRYQDDKFILKQYAENDQVVDNGRVVKTQSEVVPALSDNHQPIIRPLIRLHEKNI 244

Query: 3007 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPE 2828
            ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDP+
Sbjct: 245  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 304

Query: 2827 SNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVP 2648
            SNLI+S +LL+RIVCVK+GSRKSLFNVFQ+  CHPKMALVIDDRLKVWD++DQPRVHVVP
Sbjct: 305  SNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVP 364

Query: 2647 AFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEVPPP 2468
            AFAPYYAPQAE NNTVPVLCVARNVACNVRGGFFKEFDD LLQ++ E+ YEDDIK+VP P
Sbjct: 365  AFAPYYAPQAEGNNTVPVLCVARNVACNVRGGFFKEFDDSLLQKIPEIFYEDDIKDVPSP 424

Query: 2467 LDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDPRIA 2288
             DVSN+L SEDD S  +GN+D + FDGMADAEVERR LKEA+SA+ TA L +T+IDPR+A
Sbjct: 425  -DVSNHLVSEDDTSAVNGNRDPLAFDGMADAEVERR-LKEATSAALTASLVVTNIDPRLA 482

Query: 2287 SALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPAREE 2108
            S LQ             + Q  A +FPN             L   G AE +L +SPAREE
Sbjct: 483  S-LQYSIAPSSSTTSLPSSQQSAMTFPN-IQFPQAASVVKPLGHLGAAEPSLHSSPAREE 540

Query: 2107 GEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSR-GWFPVEEE 1931
            GEVPESELDPDTRRRLLILQHGQDTR+  P EP F ARPP+Q SVPRVQ   GWFPVEEE
Sbjct: 541  GEVPESELDPDTRRRLLILQHGQDTREPPPSEPPFAARPPVQASVPRVQPHPGWFPVEEE 600

Query: 1930 MSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPKEALP 1751
            MSPRQ +R+              I+KHR HH  F  KV++S+P  R+L ++Q+ PKEA  
Sbjct: 601  MSPRQLSRMV-PKELPLDPDPMQIEKHRPHHSAFFSKVDNSIPSDRILQDNQRFPKEAFH 659

Query: 1750 REEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIAYKCG 1571
            R++ LR N +   +HS+SGE+ P+++ SS N+D+D E+G+     ETPAGALQ+IA KCG
Sbjct: 660  RDDRLRFNHASAGYHSVSGEEIPLSRSSSMNRDVDFESGRAISNAETPAGALQEIAMKCG 719

Query: 1570 AKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYLSR-K 1394
            AKVEFR ALV+S ELQF VE  FAGEKIGEG G+T         +GSL NLA+ YLSR K
Sbjct: 720  AKVEFRPALVASTELQFYVEAWFAGEKIGEGSGKTRREAHFQAAEGSLKNLANIYLSRVK 779

Query: 1393 PDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLEASKK 1214
             DS  + G+ S+F+N  +NG +G+ NSFG Q  PKEE  S S +SE  R LDPRLE  +K
Sbjct: 780  LDSVPVNGEMSKFSNVNNNGFVGNANSFGIQSFPKEESLSSSTSSEPSRPLDPRLEGFQK 839

Query: 1213 SMGSIAELKELCMMQGL-GVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLTWDEA 1037
            SM S++ LKELCMM+GL GV FQ +P  S N  +K+EV+ QVEIDG+VLGKGIGLTWDEA
Sbjct: 840  SMNSVSALKELCMMEGLGGVVFQPRPPPSANSVEKDEVHVQVEIDGEVLGKGIGLTWDEA 899

Query: 1036 KTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYPKNAS 857
            K QAAEKALG+LRS    F+ KRQ SPR  QGMPNKR K EF +  QRMPSSARYPKNA 
Sbjct: 900  KMQAAEKALGSLRST--LFAQKRQVSPRSFQGMPNKRMKQEFPQVLQRMPSSARYPKNAP 957

Query: 856  PVP 848
            PVP
Sbjct: 958  PVP 960


>ref|XP_008383777.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Malus domestica]
          Length = 956

 Score = 1215 bits (3144), Expect = 0.0
 Identities = 643/962 (66%), Positives = 752/962 (78%), Gaps = 5/962 (0%)
 Frame = -2

Query: 3718 VYEGERVLGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITSTGIAFK 3539
            VY+GE +LGEV +YPT++++  +  E LKEIRIS++SQPSERCPP+AVLHTI+S G+ FK
Sbjct: 5    VYKGEDLLGEVEIYPTENENKNIQDE-LKEIRISYFSQPSERCPPVAVLHTISSNGVCFK 63

Query: 3538 L-ESSQSPQH-QDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSCFWGFN 3365
            + ES  SP   QD+PL +LH++  ++NKTAVM LGGEELHLVAM SR  + Q  CFWGF 
Sbjct: 64   MMESKTSPSSSQDTPLFLLHSSMTQENKTAVMPLGGEELHLVAMQSRNGDKQCPCFWGFY 123

Query: 3364 VASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQRVAGM 3185
            VASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKI++E DPQR++GM
Sbjct: 124  VASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISTEVDPQRISGM 183

Query: 3184 MAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQDKNII 3005
            +AE KRYQDDK ILKQYAENDQV+DNG+VIK+Q+EVVPALS+ H PI+RPLIRL +KNII
Sbjct: 184  LAEIKRYQDDKFILKQYAENDQVLDNGRVIKTQAEVVPALSDNHQPIIRPLIRLHEKNII 243

Query: 3004 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPES 2825
            LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDP+S
Sbjct: 244  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDS 303

Query: 2824 NLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPA 2645
            NLI+  +LL+RIVCVK+GSRKSLF+VFQ+  CHPKMALVIDDRLKVWD++DQPRVHVVPA
Sbjct: 304  NLINPTKLLDRIVCVKSGSRKSLFSVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVPA 363

Query: 2644 FAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEVPPPL 2465
            FAPYYAPQAEANN VPVLCVARNVACNVRGGFF+EFDD LLQ++ E+ YEDDIK+VP P 
Sbjct: 364  FAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEIFYEDDIKDVPSP- 422

Query: 2464 DVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDPRIAS 2285
            DVSNYL SEDD S  +GN+D + FDGMAD EVERR LKEA+SA+ TA   +T++DPR+AS
Sbjct: 423  DVSNYLVSEDDGSAINGNRDPLTFDGMADIEVERR-LKEATSAALTASSVVTNVDPRLAS 481

Query: 2284 ALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPAREEG 2105
             LQ             + Q  A  FP+             L   G AE +L +SPAREEG
Sbjct: 482  -LQYSMAPSSSIISLPSSQPSAMHFPS-IQFPQAASVVKPLGHLGAAEPSLHSSPAREEG 539

Query: 2104 EVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQSR-GWFPVEEEM 1928
            EVPESELDPDTRRRLLILQHGQDTR+  P EP FP R P+Q SVPRVQ R GWFPVEEEM
Sbjct: 540  EVPESELDPDTRRRLLILQHGQDTREPPPSEPPFPVRSPVQASVPRVQPRPGWFPVEEEM 599

Query: 1927 SPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGRVLLESQKMPKEALPR 1748
            SPRQ +R+              I+KHR HH  F  KV++S+P  R+L E+Q++PKEA  R
Sbjct: 600  SPRQLSRMV-PKELPLDPDPMQIEKHRPHHSSFFSKVDNSIPSDRILQENQRLPKEAFHR 658

Query: 1747 EEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPETPAGALQDIAYKCGA 1568
            ++ LR N  L  +HS+SGE+ P+++ SS N+D+D E+GQ     ETPAGALQ+IA KCGA
Sbjct: 659  DDRLRFNHELAGYHSMSGEEIPLSRSSSMNRDVDFESGQAISNAETPAGALQEIAMKCGA 718

Query: 1567 KVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQGSLMNLADKYLSR-KP 1391
            KVEFR ALV+S ELQF VE  FAGEKIGEG G+T         +GSL NLA+ YLSR K 
Sbjct: 719  KVEFRPALVASAELQFYVEASFAGEKIGEGTGKTRREAHFQAAEGSLKNLANVYLSRFKH 778

Query: 1390 DSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASESPRILDPRLEASKKS 1211
            DS  + G+  +F N  +NG +G+ NSFG Q  PK+E  S S +SES R LDPRLE  KKS
Sbjct: 779  DSVPVQGEMIKFPNVNNNGFVGNANSFGIQSFPKDE--SLSSSSESSRPLDPRLEGPKKS 836

Query: 1210 MGSIAELKELCMMQGL-GVAFQTQPQFSGNPGQKNEVYAQVEIDGQVLGKGIGLTWDEAK 1034
            M S++ LKELCMM+GL GV FQ +P  S N  +K+EV+ QVEIDG+VLGKGIGLTWDEAK
Sbjct: 837  MSSVSALKELCMMEGLGGVVFQPRPPPSANSVEKDEVHVQVEIDGEVLGKGIGLTWDEAK 896

Query: 1033 TQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQRMPSSARYPKNASP 854
             QAAEKAL +LR  P  F+ KRQ SPR  QGMPNKR K EF +  QRMPSS+RYPKNA P
Sbjct: 897  MQAAEKALRSLR--PTLFAQKRQGSPRSFQGMPNKRMKQEFPQVLQRMPSSSRYPKNAPP 954

Query: 853  VP 848
            VP
Sbjct: 955  VP 956


>ref|XP_012455431.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Gossypium raimondii] gi|763802547|gb|KJB69485.1|
            hypothetical protein B456_011G025900 [Gossypium
            raimondii]
          Length = 973

 Score = 1211 bits (3132), Expect = 0.0
 Identities = 645/976 (66%), Positives = 740/976 (75%), Gaps = 18/976 (1%)
 Frame = -2

Query: 3721 VVYEGERVLGEVAVYPTQSQ----------DGVVWGERLKEIRISHYSQPSERCPPLAVL 3572
            VV  G+ VLGEV +YP Q Q             V  E +KEIRI + +Q SERCPPLAVL
Sbjct: 7    VVCRGDEVLGEVEIYPQQQQLREEEEEYGGKITVMEEEMKEIRIGYLTQGSERCPPLAVL 66

Query: 3571 HTITSTGIAFKLESSQSPQHQDS-----PLTVLHATCLRDNKTAVMSLGGEELHLVAMHS 3407
            HTITSTGI FK+ESS+   +  S     PL +LH+ C+RDNKTAVM +G  ELHLVAM+S
Sbjct: 67   HTITSTGICFKMESSKDNNYSSSFQDTPPLHLLHSECIRDNKTAVMPMGDCELHLVAMYS 126

Query: 3406 RKYEGQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLR 3227
            R  +    CFWGFNVA GLY+SCLVMLNLRCLGIVFDLDETL+VANTMRSFEDRIEAL R
Sbjct: 127  RNSDRP--CFWGFNVARGLYDSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQR 184

Query: 3226 KINSESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLP 3047
            K+N+E D QR AGMMAE KRYQDDK+ILKQYAENDQV++NGKVIK QSE+V  LS+ H P
Sbjct: 185  KMNTEVDTQRAAGMMAEIKRYQDDKAILKQYAENDQVVENGKVIKVQSEIVQPLSDNHQP 244

Query: 3046 IVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAER 2867
            I+RPLIRLQ+KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAER
Sbjct: 245  IIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAER 304

Query: 2866 DYALEMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKV 2687
            DYALEMWRLLDPESNLI+S+ELL+RIVCVK+G RKSLFNVFQDG CHPKMALVIDDRLKV
Sbjct: 305  DYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDRLKV 364

Query: 2686 WDEKDQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSE 2507
            WDEKDQPRVHVVPAFAPY+APQAEANNT+PVLCVARNVACNVRGGFF+EFD+GLLQ++ E
Sbjct: 365  WDEKDQPRVHVVPAFAPYFAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGLLQKIPE 424

Query: 2506 VAYEDDIKEVPPPLDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASST 2327
            ++YEDDIK++P P DV NYL SEDD S S+ NKD   FDGMADAEVERR LKEA SA+ST
Sbjct: 425  ISYEDDIKDIPSPPDVGNYLVSEDDTSASTANKDPPIFDGMADAEVERR-LKEAISAAST 483

Query: 2326 APLTMTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGP 2147
                  ++DPR+AS+LQ              +Q    S+PN                  P
Sbjct: 484  VSSASINLDPRLASSLQ-FTMPSSSSVPLLAVQSSMASYPNMQFPQAAQVIKPVAPVVSP 542

Query: 2146 AETTLQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQF-PARPPMQVSVP 1970
             E +LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRDH P EP F PARP MQV V 
Sbjct: 543  -EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPARPAMQVPVS 601

Query: 1969 RVQSRG-WFPVEEEMSPRQPNRVTXXXXXXXXXXXXXIDKHRSHHPPFIHKVESSVPPGR 1793
            R QSRG WF  +EEMSPRQ NR               ++KHR   PPF  KVES +P  R
Sbjct: 602  RAQSRGSWFSSDEEMSPRQLNRAV-PKEFPLDSEQMHMEKHRG--PPFFPKVESPIPSER 658

Query: 1792 VLLESQKMPKEALPREEPLRINQSLPNFHSLSGEDAPIAQPSSGNKDLDLEAGQIDLYPE 1613
            +L E+Q++PKEAL R++ L +N +  ++HS  GE+ P+ + SS +KDLD E+G+     E
Sbjct: 659  LLRENQRLPKEALHRDDRLGLNHTPSSYHSFPGEEMPLGRSSSSHKDLDFESGRTIPSGE 718

Query: 1612 TPAGALQDIAYKCGAKVEFRQALVSSMELQFSVEVLFAGEKIGEGIGRTXXXXXXXXXQG 1433
            TPAG LQDIA KCGAKVEFR ALV+SM+LQFS+E  FAGEK+GEG GRT         + 
Sbjct: 719  TPAGVLQDIAMKCGAKVEFRPALVASMDLQFSIEAWFAGEKVGEGTGRTRREAQRQAAED 778

Query: 1432 SLMNLADKYLSR-KPDSSSIPGDGSRFANPYDNGLIGDINSFGYQPLPKEEVASFSMASE 1256
            S+ +LA+ YLSR KPD+ S  GD SR AN  +NG  G++N +G Q  PKEE   FS A E
Sbjct: 779  SIKSLANTYLSRIKPDTGSTQGDLSRSANTNENGFPGNLNLYGNQQSPKEESMPFSNAPE 838

Query: 1255 SPRILDPRLEASKKSMGSIAELKELCMMQGLGVAFQTQPQFSGNPGQKNEVYAQVEIDGQ 1076
              R+LDPRLE S++SMGS+  LKELCMM+GLGV FQ QP  S N  QK+EVYA+VE+DGQ
Sbjct: 839  PSRLLDPRLEGSRRSMGSVTALKELCMMEGLGVVFQAQPPAS-NTLQKDEVYAEVEVDGQ 897

Query: 1075 VLGKGIGLTWDEAKTQAAEKALGTLRSMPGQFSYKRQDSPRPLQGMPNKRFKPEFSRSFQ 896
            VLGKG G TW+EAK QAAEKALG+LRSM GQF+ KRQ SPR LQ MP+KR KPEF R   
Sbjct: 898  VLGKGTGFTWEEAKMQAAEKALGSLRSMLGQFTQKRQGSPRSLQDMPSKRLKPEFPRVLH 957

Query: 895  RMPSSARYPKNASPVP 848
            RMPSS RY KNA PVP
Sbjct: 958  RMPSSGRYHKNAPPVP 973


Top