BLASTX nr result
ID: Forsythia23_contig00010829
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00010829 (2220 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011096253.1| PREDICTED: RNA polymerase II C-terminal doma... 928 0.0 ref|XP_011096251.1| PREDICTED: RNA polymerase II C-terminal doma... 928 0.0 ref|XP_011086104.1| PREDICTED: RNA polymerase II C-terminal doma... 904 0.0 ref|XP_012849004.1| PREDICTED: RNA polymerase II C-terminal doma... 898 0.0 ref|XP_012849005.1| PREDICTED: RNA polymerase II C-terminal doma... 898 0.0 ref|XP_009623032.1| PREDICTED: RNA polymerase II C-terminal doma... 878 0.0 ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma... 877 0.0 ref|XP_009789678.1| PREDICTED: RNA polymerase II C-terminal doma... 877 0.0 ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma... 875 0.0 emb|CDO99573.1| unnamed protein product [Coffea canephora] 868 0.0 ref|XP_002267987.3| PREDICTED: RNA polymerase II C-terminal doma... 849 0.0 emb|CBI35690.3| unnamed protein product [Vitis vinifera] 849 0.0 ref|XP_010100555.1| RNA polymerase II C-terminal domain phosphat... 845 0.0 ref|XP_012091568.1| PREDICTED: RNA polymerase II C-terminal doma... 841 0.0 gb|KDP20941.1| hypothetical protein JCGZ_21412 [Jatropha curcas] 841 0.0 ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma... 835 0.0 ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma... 835 0.0 ref|XP_007025682.1| C-terminal domain phosphatase-like 1 isoform... 832 0.0 ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform... 832 0.0 ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform... 832 0.0 >ref|XP_011096253.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X2 [Sesamum indicum] Length = 768 Score = 928 bits (2399), Expect = 0.0 Identities = 478/593 (80%), Positives = 509/593 (85%) Frame = -1 Query: 1779 MYGKSVVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 1600 MY K V VYEGER LGE ++P VV G L+EIRISHYS PSERCPPLAVLHTI Sbjct: 1 MYRKLVAVYEGERVLGEAELHPPD----VVLGNELREIRISHYSPPSERCPPLAVLHTIN 56 Query: 1599 STGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSC 1420 +TGI FKLES+ ++QDSPL++LHATCLRDNKTAV S+GG E+ LVAMHSRK EGQ C Sbjct: 57 ATGICFKLESTA--KNQDSPLSLLHATCLRDNKTAVASVGGGEIQLVAMHSRKCEGQYPC 114 Query: 1419 FWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQ 1240 FWGFNVAS LYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKINSESDPQ Sbjct: 115 FWGFNVASSLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQ 174 Query: 1239 RVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQ 1060 RVAGM+AE KRYQDDKS+LKQYAE+DQVIDNGKV+KSQSEVVPALSE H PIVRPLIRLQ Sbjct: 175 RVAGMLAEVKRYQDDKSVLKQYAESDQVIDNGKVVKSQSEVVPALSETHQPIVRPLIRLQ 234 Query: 1059 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 880 D+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL Sbjct: 235 DRNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 294 Query: 879 LDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 700 LDPESNLI+S ELL+RIVCVK+G RKSLFNVFQ GNCHPKMALVIDDRLKVWDEKDQPRV Sbjct: 295 LDPESNLINSRELLDRIVCVKSGLRKSLFNVFQAGNCHPKMALVIDDRLKVWDEKDQPRV 354 Query: 699 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKE 520 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLL R+S VAYEDD+++ Sbjct: 355 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLPRISGVAYEDDMRD 414 Query: 519 VPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSID 340 VPS PDVSNYL SEDDPS SSGNKDS+GFDGMADAEVERR LKEA+SASST PL + ++D Sbjct: 415 VPSSPDVSNYLISEDDPSASSGNKDSLGFDGMADAEVERR-LKEATSASSTVPLPIPNLD 473 Query: 339 PRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSP 160 PRI AL P T+ G A SFP LAQ G AETTLQ+SP Sbjct: 474 PRITPALHYAVPSSSFTVPPQTIHGSAMSFPGQQLSQVTTLLKPPLAQLGQAETTLQSSP 533 Query: 159 AREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 AREEGEVPESELDPDTRRRLLILQHGQD R+H P E QFPARP MQVSVPRVQ Sbjct: 534 AREEGEVPESELDPDTRRRLLILQHGQDMREHPPSESQFPARPSMQVSVPRVQ 586 >ref|XP_011096251.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X1 [Sesamum indicum] Length = 951 Score = 928 bits (2399), Expect = 0.0 Identities = 478/593 (80%), Positives = 509/593 (85%) Frame = -1 Query: 1779 MYGKSVVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 1600 MY K V VYEGER LGE ++P VV G L+EIRISHYS PSERCPPLAVLHTI Sbjct: 1 MYRKLVAVYEGERVLGEAELHPPD----VVLGNELREIRISHYSPPSERCPPLAVLHTIN 56 Query: 1599 STGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSC 1420 +TGI FKLES+ ++QDSPL++LHATCLRDNKTAV S+GG E+ LVAMHSRK EGQ C Sbjct: 57 ATGICFKLESTA--KNQDSPLSLLHATCLRDNKTAVASVGGGEIQLVAMHSRKCEGQYPC 114 Query: 1419 FWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQ 1240 FWGFNVAS LYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKINSESDPQ Sbjct: 115 FWGFNVASSLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQ 174 Query: 1239 RVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQ 1060 RVAGM+AE KRYQDDKS+LKQYAE+DQVIDNGKV+KSQSEVVPALSE H PIVRPLIRLQ Sbjct: 175 RVAGMLAEVKRYQDDKSVLKQYAESDQVIDNGKVVKSQSEVVPALSETHQPIVRPLIRLQ 234 Query: 1059 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 880 D+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL Sbjct: 235 DRNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 294 Query: 879 LDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 700 LDPESNLI+S ELL+RIVCVK+G RKSLFNVFQ GNCHPKMALVIDDRLKVWDEKDQPRV Sbjct: 295 LDPESNLINSRELLDRIVCVKSGLRKSLFNVFQAGNCHPKMALVIDDRLKVWDEKDQPRV 354 Query: 699 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKE 520 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLL R+S VAYEDD+++ Sbjct: 355 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLPRISGVAYEDDMRD 414 Query: 519 VPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSID 340 VPS PDVSNYL SEDDPS SSGNKDS+GFDGMADAEVERR LKEA+SASST PL + ++D Sbjct: 415 VPSSPDVSNYLISEDDPSASSGNKDSLGFDGMADAEVERR-LKEATSASSTVPLPIPNLD 473 Query: 339 PRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSP 160 PRI AL P T+ G A SFP LAQ G AETTLQ+SP Sbjct: 474 PRITPALHYAVPSSSFTVPPQTIHGSAMSFPGQQLSQVTTLLKPPLAQLGQAETTLQSSP 533 Query: 159 AREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 AREEGEVPESELDPDTRRRLLILQHGQD R+H P E QFPARP MQVSVPRVQ Sbjct: 534 AREEGEVPESELDPDTRRRLLILQHGQDMREHPPSESQFPARPSMQVSVPRVQ 586 >ref|XP_011086104.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Sesamum indicum] Length = 951 Score = 904 bits (2336), Expect = 0.0 Identities = 458/592 (77%), Positives = 507/592 (85%) Frame = -1 Query: 1779 MYGKSVVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 1600 MYGK V+VYEGER LGEV + Q Q G VWGE +KEIRISHYS PSERCPPLAVLHTI Sbjct: 1 MYGKLVLVYEGERLLGEVEL---QRQGGGVWGEEIKEIRISHYSPPSERCPPLAVLHTIN 57 Query: 1599 STGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSC 1420 STGI FKLES+ ++ DSPL++LHATCLRDNKTAV +G E+HLVAMHSRKYEGQ C Sbjct: 58 STGICFKLESTA--KNVDSPLSILHATCLRDNKTAVAIIGEGEIHLVAMHSRKYEGQHPC 115 Query: 1419 FWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQ 1240 FWGFNVAS LYNSCL +LNLRCLGIVFDLDETLIVANTMRSFEDRI++L RK+NSESDPQ Sbjct: 116 FWGFNVASSLYNSCLALLNLRCLGIVFDLDETLIVANTMRSFEDRIDSLQRKVNSESDPQ 175 Query: 1239 RVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQ 1060 RV+ M+AE KRYQDDK+ILKQYAE+DQVIDNGKVI+SQSEVV ALS+ H IVRPLIRLQ Sbjct: 176 RVSSMLAEVKRYQDDKNILKQYAESDQVIDNGKVIRSQSEVVLALSDNHQTIVRPLIRLQ 235 Query: 1059 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 880 D+NIILTRINP IRDTSVLVRLRPAWEDL+SYLTA+GRKRFEVFVCTMAERDYALEMWRL Sbjct: 236 DRNIILTRINPLIRDTSVLVRLRPAWEDLKSYLTAKGRKRFEVFVCTMAERDYALEMWRL 295 Query: 879 LDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 700 LDPESNLI+ +LL+RIVCVK+GSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV Sbjct: 296 LDPESNLINPRDLLDRIVCVKSGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 355 Query: 699 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKE 520 HVVPAFAPY+APQAEANN++PVLC+ARNVACNVRGGFFKEFD+ L+QR+S VAYEDDIK+ Sbjct: 356 HVVPAFAPYFAPQAEANNSIPVLCLARNVACNVRGGFFKEFDESLIQRISGVAYEDDIKD 415 Query: 519 VPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSID 340 +PSPPDVSNYL EDDPS S+GNKD+IGFDGMADAEVERR LKE+ SASSTA + ++D Sbjct: 416 MPSPPDVSNYLFPEDDPSASNGNKDAIGFDGMADAEVERR-LKESMSASSTAVTPVINLD 474 Query: 339 PRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSP 160 PRIASA+Q P +QGPA F L+ G ETT ++SP Sbjct: 475 PRIASAIQFAVPSSSFTVHPPKIQGPAVPFLGQQLPPVTTLPKPPLSHLGQGETTFRSSP 534 Query: 159 AREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRV 4 AREEGEVPESELDPDTRRRLLILQHGQD R+H P EPQFPARPP+QV VPRV Sbjct: 535 AREEGEVPESELDPDTRRRLLILQHGQDMREHPPSEPQFPARPPLQVPVPRV 586 >ref|XP_012849004.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X1 [Erythranthe guttatus] Length = 967 Score = 898 bits (2320), Expect = 0.0 Identities = 461/593 (77%), Positives = 503/593 (84%) Frame = -1 Query: 1779 MYGKSVVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 1600 MY VVVYEGER LGE + QDGVV G+ LKEIRISHYS PSERCPPLAVLHTI Sbjct: 1 MYRNLVVVYEGERVLGEAEL---NLQDGVVLGKGLKEIRISHYSPPSERCPPLAVLHTIN 57 Query: 1599 STGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSC 1420 STGI FKLE++ ++Q+SPL+ LHA+CLRDNKTAV+ +GG E+ LVAMHSRKYEG C Sbjct: 58 STGICFKLEATT--KNQESPLSHLHASCLRDNKTAVVPIGGAEIQLVAMHSRKYEGGNPC 115 Query: 1419 FWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQ 1240 FWGFNVAS +YNSCLVMLNLRCLGIVFDLDETL+VANTMRSFEDRIEAL RKINSESD Q Sbjct: 116 FWGFNVASSVYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKINSESDQQ 175 Query: 1239 RVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQ 1060 R +GM+AE KRYQDDK+ILKQYAE+DQVI+NGKVIKSQSEVVPALS H PIVRPLIRLQ Sbjct: 176 RASGMVAEVKRYQDDKNILKQYAESDQVIENGKVIKSQSEVVPALSGTHQPIVRPLIRLQ 235 Query: 1059 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 880 D+NIILTRINP IRDTSVLVRLRPAWE+LR+YLTARGRKRFEVFVCTMAERDYALEMWRL Sbjct: 236 DRNIILTRINPLIRDTSVLVRLRPAWEELRNYLTARGRKRFEVFVCTMAERDYALEMWRL 295 Query: 879 LDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 700 LDPE NLI+S ELLER+VCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV Sbjct: 296 LDPEFNLINSRELLERVVCVKSGFRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 355 Query: 699 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKE 520 HVVPAFAPYYAPQAEANNT+PVLCVARNVACNVRGGFFK+FDDGLLQ +S VAYEDDIK+ Sbjct: 356 HVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKDFDDGLLQLISGVAYEDDIKD 415 Query: 519 VPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSID 340 VPS PDVSNYL SEDDPS S GNKDS+ +DGMADAEV+RR LK+A SASSTAP + ++D Sbjct: 416 VPSSPDVSNYLISEDDPSASGGNKDSLVYDGMADAEVQRR-LKDAISASSTAPSPIANLD 474 Query: 339 PRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSP 160 P +AS L P T QGPA SFP+ L Q G ETT ++SP Sbjct: 475 PIVASVLHYMAPSSSFTAPPPTTQGPAMSFPSQQMHQVATLLKPPLVQLGQGETTSRSSP 534 Query: 159 AREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 AREEGEVPESELDPDTRRR+LILQHGQD R +P EPQFPAR PMQVSVPRVQ Sbjct: 535 AREEGEVPESELDPDTRRRMLILQHGQDMRGPSPSEPQFPARTPMQVSVPRVQ 587 >ref|XP_012849005.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X2 [Erythranthe guttatus] gi|604315220|gb|EYU27926.1| hypothetical protein MIMGU_mgv1a000848mg [Erythranthe guttata] Length = 962 Score = 898 bits (2320), Expect = 0.0 Identities = 461/593 (77%), Positives = 503/593 (84%) Frame = -1 Query: 1779 MYGKSVVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 1600 MY VVVYEGER LGE + QDGVV G+ LKEIRISHYS PSERCPPLAVLHTI Sbjct: 1 MYRNLVVVYEGERVLGEAEL---NLQDGVVLGKGLKEIRISHYSPPSERCPPLAVLHTIN 57 Query: 1599 STGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSC 1420 STGI FKLE++ ++Q+SPL+ LHA+CLRDNKTAV+ +GG E+ LVAMHSRKYEG C Sbjct: 58 STGICFKLEATT--KNQESPLSHLHASCLRDNKTAVVPIGGAEIQLVAMHSRKYEGGNPC 115 Query: 1419 FWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQ 1240 FWGFNVAS +YNSCLVMLNLRCLGIVFDLDETL+VANTMRSFEDRIEAL RKINSESD Q Sbjct: 116 FWGFNVASSVYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIEALQRKINSESDQQ 175 Query: 1239 RVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQ 1060 R +GM+AE KRYQDDK+ILKQYAE+DQVI+NGKVIKSQSEVVPALS H PIVRPLIRLQ Sbjct: 176 RASGMVAEVKRYQDDKNILKQYAESDQVIENGKVIKSQSEVVPALSGTHQPIVRPLIRLQ 235 Query: 1059 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRL 880 D+NIILTRINP IRDTSVLVRLRPAWE+LR+YLTARGRKRFEVFVCTMAERDYALEMWRL Sbjct: 236 DRNIILTRINPLIRDTSVLVRLRPAWEELRNYLTARGRKRFEVFVCTMAERDYALEMWRL 295 Query: 879 LDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 700 LDPE NLI+S ELLER+VCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV Sbjct: 296 LDPEFNLINSRELLERVVCVKSGFRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRV 355 Query: 699 HVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKE 520 HVVPAFAPYYAPQAEANNT+PVLCVARNVACNVRGGFFK+FDDGLLQ +S VAYEDDIK+ Sbjct: 356 HVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFKDFDDGLLQLISGVAYEDDIKD 415 Query: 519 VPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSID 340 VPS PDVSNYL SEDDPS S GNKDS+ +DGMADAEV+RR LK+A SASSTAP + ++D Sbjct: 416 VPSSPDVSNYLISEDDPSASGGNKDSLVYDGMADAEVQRR-LKDAISASSTAPSPIANLD 474 Query: 339 PRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSP 160 P +AS L P T QGPA SFP+ L Q G ETT ++SP Sbjct: 475 PIVASVLHYMAPSSSFTAPPPTTQGPAMSFPSQQMHQVATLLKPPLVQLGQGETTSRSSP 534 Query: 159 AREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 AREEGEVPESELDPDTRRR+LILQHGQD R +P EPQFPAR PMQVSVPRVQ Sbjct: 535 AREEGEVPESELDPDTRRRMLILQHGQDMRGPSPSEPQFPARTPMQVSVPRVQ 587 >ref|XP_009623032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Nicotiana tomentosiformis] gi|697137919|ref|XP_009623033.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Nicotiana tomentosiformis] Length = 965 Score = 878 bits (2269), Expect = 0.0 Identities = 454/600 (75%), Positives = 502/600 (83%), Gaps = 7/600 (1%) Frame = -1 Query: 1779 MYGKSVVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 1600 MY VV+YEGER +GE+ + ++GVVWGE++ IRISHYS PSERCPPLAVLHTIT Sbjct: 1 MYNSVVVLYEGERVVGELELL-YGGENGVVWGEKV--IRISHYSPPSERCPPLAVLHTIT 57 Query: 1599 ST-----GIAFKLESSQSPQ-HQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKY 1438 S+ GI+FKLE ++S QDSPL +LH+TCLRDNKTAV+SLG EELHLVAM S+ + Sbjct: 58 SSSTTGNGISFKLEPTKSKSLSQDSPLFLLHSTCLRDNKTAVVSLGREELHLVAMQSKNF 117 Query: 1437 EGQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN 1258 GQ CFWGF VASGLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN Sbjct: 118 GGQCPCFWGFKVASGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN 177 Query: 1257 SESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVR 1078 SESDPQR + M++E KRYQ+DK LKQYAENDQVIDNGKVIKSQSEV PALS+ H PIVR Sbjct: 178 SESDPQRASAMLSEVKRYQEDKIFLKQYAENDQVIDNGKVIKSQSEVFPALSDNHQPIVR 237 Query: 1077 PLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYA 898 PLIRLQD+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYA Sbjct: 238 PLIRLQDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 297 Query: 897 LEMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDE 718 LEMWRLLDP+SNLI+S+ELL+RIVCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWDE Sbjct: 298 LEMWRLLDPDSNLINSKELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDE 357 Query: 717 KDQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAY 538 KDQPRVHVVPAFAPY++PQAE NN+VPVLCVARNVACNVRGGFFK+FD+GLLQR+SEVAY Sbjct: 358 KDQPRVHVVPAFAPYFSPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAY 417 Query: 537 EDDIKEVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPL 358 EDDIK+VPS PDVSNYL SEDDPS +GNKDS+GFDGMAD EVERR LKEA AS++ P Sbjct: 418 EDDIKQVPSAPDVSNYLLSEDDPSAVNGNKDSLGFDGMADTEVERR-LKEAMLASTSVPS 476 Query: 357 TMTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAET 178 MT+ DPRIA ALQ +T+Q P FP + Q P +T Sbjct: 477 QMTNSDPRIAPALQ---YPVPPAISQSTIQAPVVPFPAQHLPQVTSVLKSSVTQLSPQDT 533 Query: 177 TLQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSV-PRVQ 1 +LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRD EPQFP P+QVSV PRVQ Sbjct: 534 SLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPQFPMGTPLQVSVPPRVQ 593 >ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Solanum lycopersicum] Length = 954 Score = 877 bits (2267), Expect = 0.0 Identities = 450/595 (75%), Positives = 503/595 (84%), Gaps = 2/595 (0%) Frame = -1 Query: 1779 MYGKSVVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 1600 M+ +V++YEGER +GEV +Y + GVVWGE+L IRISHYS SERCPPLAVLHT+T Sbjct: 1 MFKSTVLLYEGERLVGEVEMY---GEKGVVWGEKL--IRISHYSPSSERCPPLAVLHTVT 55 Query: 1599 STGIAFKLESSQS-PQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVS 1423 TG++FKLE ++S P QDSPLT+LH+TCLRDNKTAVMSLG EELHLVAM S+ GQ Sbjct: 56 -TGLSFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCP 114 Query: 1422 CFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDP 1243 CFWGF VASGLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKINSESDP Sbjct: 115 CFWGFKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDP 174 Query: 1242 QRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRL 1063 QR + M+AE KRYQ+DK ILKQYAENDQV+DNGKVI+SQSEV PALS+ H PIVRPLIRL Sbjct: 175 QRASVMLAEVKRYQEDKIILKQYAENDQVVDNGKVIRSQSEVFPALSDNHQPIVRPLIRL 234 Query: 1062 QDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWR 883 QD+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWR Sbjct: 235 QDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 294 Query: 882 LLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPR 703 LLDP+SNLI+S+ELL+RIVCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWD+KDQPR Sbjct: 295 LLDPDSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPR 354 Query: 702 VHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIK 523 VHVVPAFAPY+APQAE NN+VPVLCVARNVACNVRGGFFK+FD+GLLQR+SEVAYEDDIK Sbjct: 355 VHVVPAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIK 414 Query: 522 EVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSI 343 +VPS PDVSNYL SEDDPS +GNKDS+GFDGMAD+EVERR LKEA AS++ P MT++ Sbjct: 415 QVPSAPDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERR-LKEAMLASTSVPSQMTNL 473 Query: 342 DPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNS 163 DPR+ ALQ ++QGP FP + Q P +T+LQ+S Sbjct: 474 DPRLVPALQ---YPVPPVISQPSIQGPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSS 530 Query: 162 PAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSV-PRVQ 1 PAREEGEVPESELDPDTRRRLLILQHGQDTRD EP+FP P+QVSV PRVQ Sbjct: 531 PAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKFPIGTPLQVSVPPRVQ 585 >ref|XP_009789678.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Nicotiana sylvestris] gi|698485837|ref|XP_009789679.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Nicotiana sylvestris] gi|698485839|ref|XP_009789680.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Nicotiana sylvestris] Length = 965 Score = 877 bits (2265), Expect = 0.0 Identities = 454/600 (75%), Positives = 502/600 (83%), Gaps = 7/600 (1%) Frame = -1 Query: 1779 MYGKSVVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 1600 MY VV+YEGER +GE+ + ++GVVWGE++ IRISHYS PSERCPPLAVLHTIT Sbjct: 1 MYKSVVVLYEGERVVGELELL-YGGENGVVWGEKV--IRISHYSPPSERCPPLAVLHTIT 57 Query: 1599 ST-----GIAFKLESSQSPQ-HQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKY 1438 S+ GI+FKLE ++S QDSPL +LH+TCLRDNKTAV+SLG EELHLVAM S+ + Sbjct: 58 SSSTTGNGISFKLEPTKSKSLSQDSPLFLLHSTCLRDNKTAVVSLGREELHLVAMQSKNF 117 Query: 1437 EGQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN 1258 GQ CFWGF VASGLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN Sbjct: 118 GGQCPCFWGFKVASGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKIN 177 Query: 1257 SESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVR 1078 SESDPQR + M+AE KRYQ+DK LKQYAENDQVIDNGKVIKSQSEV PALS+ H PIVR Sbjct: 178 SESDPQRASAMLAEVKRYQEDKIFLKQYAENDQVIDNGKVIKSQSEVFPALSDNHQPIVR 237 Query: 1077 PLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYA 898 PLIRLQD+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYA Sbjct: 238 PLIRLQDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 297 Query: 897 LEMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDE 718 LEMWRLLDP+SNLI+S+ELL+RIVCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWDE Sbjct: 298 LEMWRLLDPDSNLINSKELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDE 357 Query: 717 KDQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAY 538 KDQPRVHVVPAFAPY++PQAE NN+VPVLCVARNVACNVRGGFFK+FD+GLLQR+SEVAY Sbjct: 358 KDQPRVHVVPAFAPYFSPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAY 417 Query: 537 EDDIKEVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPL 358 EDDIK+VPS PDVSNYL SEDDPS +G+KDS+GFDGMAD EVERR LKEA AS++ P Sbjct: 418 EDDIKQVPSAPDVSNYLISEDDPSAVNGSKDSLGFDGMADTEVERR-LKEAMLASTSVPS 476 Query: 357 TMTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAET 178 MT+ DPRIA ALQ +T+Q P FP + Q P +T Sbjct: 477 QMTNSDPRIAPALQ---YPVPPAISQSTIQAPVVPFPAQHLPQVTSVLKSSVTQLSPQDT 533 Query: 177 TLQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSV-PRVQ 1 +LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRD EPQFP P+QVSV PRVQ Sbjct: 534 SLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPQFPMGTPLQVSVPPRVQ 593 >ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Solanum tuberosum] Length = 953 Score = 875 bits (2262), Expect = 0.0 Identities = 449/595 (75%), Positives = 502/595 (84%), Gaps = 2/595 (0%) Frame = -1 Query: 1779 MYGKSVVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTIT 1600 M+ +VV+YEGER +GEV +Y + GV+WGE++ IRISHYS SERCPPLAVLHT+T Sbjct: 1 MFKSTVVLYEGERLVGEVEIY---CEKGVLWGEKV--IRISHYSPSSERCPPLAVLHTVT 55 Query: 1599 STGIAFKLESSQS-PQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVS 1423 TG++FKLE ++S P QDSPLT+LH+TCLRDNKTAVMSLG EELHLVAM S+ GQ Sbjct: 56 -TGLSFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCP 114 Query: 1422 CFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDP 1243 CFWGF VASGLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKINSESDP Sbjct: 115 CFWGFKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDP 174 Query: 1242 QRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRL 1063 QR + M+AE KRYQ+DK ILKQYAENDQV+DNGKVIKSQSEV PALS+ H PIVRPLIRL Sbjct: 175 QRASVMLAEVKRYQEDKIILKQYAENDQVVDNGKVIKSQSEVFPALSDNHQPIVRPLIRL 234 Query: 1062 QDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWR 883 QD+NIILTRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWR Sbjct: 235 QDRNIILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 294 Query: 882 LLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPR 703 LLDP+SNLI+S+ELL+RIVCVK+G RKSLFNVFQDGNCHPKMALVIDDRLKVWD+KDQPR Sbjct: 295 LLDPDSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPR 354 Query: 702 VHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIK 523 VHVVPAFAPY+APQAE NN+VPVLCVARNVACNVRGGFFK+FD+GLLQR+SEVAYEDDIK Sbjct: 355 VHVVPAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIK 414 Query: 522 EVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSI 343 +VPS PDVSNYL SEDDPS +GNKDS+GFDGMAD+EVERR LKEA AS++ P MT++ Sbjct: 415 QVPSAPDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERR-LKEAMLASTSVPSQMTNL 473 Query: 342 DPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNS 163 DPR+ ALQ ++Q P FP + Q P +T+LQ+S Sbjct: 474 DPRLVPALQ---YPVPPVISQPSIQSPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSS 530 Query: 162 PAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSV-PRVQ 1 PAREEGEVPESELDPDTRRRLLILQHGQDTRD EP+FP P+QVSV PRVQ Sbjct: 531 PAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKFPMGTPLQVSVPPRVQ 585 >emb|CDO99573.1| unnamed protein product [Coffea canephora] Length = 968 Score = 868 bits (2244), Expect = 0.0 Identities = 449/595 (75%), Positives = 494/595 (83%), Gaps = 7/595 (1%) Frame = -1 Query: 1764 VVVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITST--- 1594 V + EGER LGEV VY Q+GVVW EIRIS YSQPSERCPPLAVLHT+TS+ Sbjct: 7 VNLIEGERVLGEVEVYSIDDQNGVVWDR--DEIRISEYSQPSERCPPLAVLHTVTSSSSD 64 Query: 1593 --GIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGE-ELHLVAMHSRKYEGQVS 1423 G+ FKLE Q Q+SPL++LHATCLR+NKTA+M L E ELHLVAMHSR++EGQ Sbjct: 65 SGGLCFKLELKDKSQ-QNSPLSILHATCLRENKTAIMPLDEEDELHLVAMHSRQHEGQFP 123 Query: 1422 CFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDP 1243 CFWGF VAS LYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKINSE D Sbjct: 124 CFWGFIVASRLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSEVDQ 183 Query: 1242 QRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRL 1063 QRV+ M+AE KRYQDDK+ILKQYAENDQV+DNGKV+KSQ EVV ALS+ H IVRPL+RL Sbjct: 184 QRVSAMLAEIKRYQDDKNILKQYAENDQVVDNGKVVKSQPEVVLALSDNHQTIVRPLLRL 243 Query: 1062 QDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWR 883 Q+KNIILTRINPQIRDTSVLVRLRPAWEDLR+YLTARGRKRFEV+VCTMAERDYALEMWR Sbjct: 244 QEKNIILTRINPQIRDTSVLVRLRPAWEDLRNYLTARGRKRFEVYVCTMAERDYALEMWR 303 Query: 882 LLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPR 703 LLDP+SNLID +ELL+RIVCVK+G RKSLFNVFQ GNCHPKMALVIDDRLKVWDEKDQPR Sbjct: 304 LLDPDSNLIDPKELLDRIVCVKSGLRKSLFNVFQHGNCHPKMALVIDDRLKVWDEKDQPR 363 Query: 702 VHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIK 523 VHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFFKEFD+GLLQR+SEVAYEDDIK Sbjct: 364 VHVVPAFAPYYAPQAEANNAIPVLCVARNVACNVRGGFFKEFDEGLLQRISEVAYEDDIK 423 Query: 522 EVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSI 343 E+PSPPDVSNYL SEDDPS S+GNKDS+GFDGMAD EVERR LKEA SASSTAPL + ++ Sbjct: 424 EIPSPPDVSNYLISEDDPSASNGNKDSLGFDGMADVEVERR-LKEAISASSTAPLAIPNL 482 Query: 342 DPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQR-GPAETTLQN 166 DP+I + +Q TM GP FP+ + Q P E +LQ+ Sbjct: 483 DPKIVATVQ-YAVPSSISVLQPTMSGPVVPFPSQQLSQVTSVLKNPINQAILPPEASLQS 541 Query: 165 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 SPAREEGEVPESELDPDTRRRLLILQHGQD+R+ EPQFP R P+QVS PR Q Sbjct: 542 SPAREEGEVPESELDPDTRRRLLILQHGQDSRERTSSEPQFPVRTPLQVSAPRAQ 596 >ref|XP_002267987.3| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Vitis vinifera] Length = 935 Score = 849 bits (2193), Expect = 0.0 Identities = 432/587 (73%), Positives = 490/587 (83%) Frame = -1 Query: 1761 VVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITSTGIAF 1582 +VYEG+ +GEV +YP G+ E +KEIRISHYSQPSERCPPLAVLHTITS G+ F Sbjct: 5 IVYEGDDVVGEVEIYP--QNQGL---ELMKEIRISHYSQPSERCPPLAVLHTITSCGVCF 59 Query: 1581 KLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSCFWGFNV 1402 K+ESS++ Q QD+PL +LH+TC+R+NKTAVMSLG EELHLVAM+S+K +GQ CFWGFNV Sbjct: 60 KMESSKA-QSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFNV 118 Query: 1401 ASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQRVAGMM 1222 A GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKIN+E DPQR++GM Sbjct: 119 ALGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGMA 178 Query: 1221 AEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQDKNIIL 1042 AE +RYQDD++ILKQYAENDQV++NGK+ K+Q E+VPALS+ H PIVRPLIRLQ+KNIIL Sbjct: 179 AEVRRYQDDRNILKQYAENDQVVENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNIIL 238 Query: 1041 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPESN 862 TRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDPESN Sbjct: 239 TRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESN 298 Query: 861 LIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 682 LI+S+ELL+RIVCVK+GSRKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAF Sbjct: 299 LINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAF 358 Query: 681 APYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEVPSPPD 502 APYYAPQAEANN + VLCVARNVACNVRGGFFKEFD+GLLQR+ E++YEDDIK++ S PD Sbjct: 359 APYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDDIKDIRSAPD 418 Query: 501 VSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDPRIASA 322 VSNYL SEDD SVS+GN+D FDGMAD EVER+ LK+A S AP T+TS+DPR++ Sbjct: 419 VSNYLVSEDDASVSNGNRDQPCFDGMADVEVERK-LKDAIS----APSTVTSLDPRLSPP 473 Query: 321 LQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPAREEGE 142 LQ QG F N E T+Q+SPAREEGE Sbjct: 474 LQFAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPL-----APEPTMQSSPAREEGE 528 Query: 141 VPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 VPESELDPDTRRRLLILQHGQDTR+HA +P FP RPP+QVSVPRVQ Sbjct: 529 VPESELDPDTRRRLLILQHGQDTREHASSDPPFPVRPPIQVSVPRVQ 575 >emb|CBI35690.3| unnamed protein product [Vitis vinifera] Length = 788 Score = 849 bits (2193), Expect = 0.0 Identities = 432/587 (73%), Positives = 490/587 (83%) Frame = -1 Query: 1761 VVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITSTGIAF 1582 +VYEG+ +GEV +YP G+ E +KEIRISHYSQPSERCPPLAVLHTITS G+ F Sbjct: 5 IVYEGDDVVGEVEIYP--QNQGL---ELMKEIRISHYSQPSERCPPLAVLHTITSCGVCF 59 Query: 1581 KLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSCFWGFNV 1402 K+ESS++ Q QD+PL +LH+TC+R+NKTAVMSLG EELHLVAM+S+K +GQ CFWGFNV Sbjct: 60 KMESSKA-QSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFNV 118 Query: 1401 ASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQRVAGMM 1222 A GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKIN+E DPQR++GM Sbjct: 119 ALGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGMA 178 Query: 1221 AEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQDKNIIL 1042 AE +RYQDD++ILKQYAENDQV++NGK+ K+Q E+VPALS+ H PIVRPLIRLQ+KNIIL Sbjct: 179 AEVRRYQDDRNILKQYAENDQVVENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNIIL 238 Query: 1041 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPESN 862 TRINP IRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDPESN Sbjct: 239 TRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESN 298 Query: 861 LIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 682 LI+S+ELL+RIVCVK+GSRKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAF Sbjct: 299 LINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAF 358 Query: 681 APYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEVPSPPD 502 APYYAPQAEANN + VLCVARNVACNVRGGFFKEFD+GLLQR+ E++YEDDIK++ S PD Sbjct: 359 APYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDDIKDIRSAPD 418 Query: 501 VSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDPRIASA 322 VSNYL SEDD SVS+GN+D FDGMAD EVER+ LK+A S AP T+TS+DPR++ Sbjct: 419 VSNYLVSEDDASVSNGNRDQPCFDGMADVEVERK-LKDAIS----APSTVTSLDPRLSPP 473 Query: 321 LQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPAREEGE 142 LQ QG F N E T+Q+SPAREEGE Sbjct: 474 LQFAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPL-----APEPTMQSSPAREEGE 528 Query: 141 VPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 VPESELDPDTRRRLLILQHGQDTR+HA +P FP RPP+QVSVPRVQ Sbjct: 529 VPESELDPDTRRRLLILQHGQDTREHASSDPPFPVRPPIQVSVPRVQ 575 >ref|XP_010100555.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Morus notabilis] gi|587894270|gb|EXB82798.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Morus notabilis] Length = 998 Score = 845 bits (2183), Expect = 0.0 Identities = 443/589 (75%), Positives = 489/589 (83%), Gaps = 2/589 (0%) Frame = -1 Query: 1761 VVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITST-GIA 1585 VVY+G+ LGEV +YP ++ + LKEIRISH+S PSERCPPLAVLHTITS+ G+ Sbjct: 5 VVYKGKELLGEVEIYPGENNIDHRIIDDLKEIRISHFSPPSERCPPLAVLHTITSSFGVC 64 Query: 1584 FKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLG-GEELHLVAMHSRKYEGQVSCFWGF 1408 FK+ES S QDSPL +LH++C+ +NKTAVMSLG GEELHLVAM+SR + Q CFWGF Sbjct: 65 FKMESKTS-HSQDSPLFLLHSSCVMENKTAVMSLGAGEELHLVAMYSRNSDKQYPCFWGF 123 Query: 1407 NVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQRVAG 1228 NVASGLYNSCL MLNLRCL IVFDLDETLIVANTMRSFEDRIEAL RKI+SESDPQR++G Sbjct: 124 NVASGLYNSCLGMLNLRCLSIVFDLDETLIVANTMRSFEDRIEALQRKISSESDPQRMSG 183 Query: 1227 MMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQDKNI 1048 M+AE KRYQDDKSILKQY ENDQV+DNG+VIK QSEVVPALS+ H PIVRPLIRL +KNI Sbjct: 184 MLAEVKRYQDDKSILKQYVENDQVVDNGRVIKVQSEVVPALSDNHQPIVRPLIRLHEKNI 243 Query: 1047 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPE 868 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDP Sbjct: 244 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPH 303 Query: 867 SNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVP 688 SNLI+S+ LLERIVCVK+G RKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVP Sbjct: 304 SNLINSKALLERIVCVKSGLRKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 363 Query: 687 AFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEVPSP 508 AFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFDDGLLQ++ EV+YEDDIK +PSP Sbjct: 364 AFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQKIPEVSYEDDIKHIPSP 423 Query: 507 PDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDPRIA 328 PDVSNYL SEDD S S+GN+D FDGMADAEVERR LKEA SA+S+A + DPR+ Sbjct: 424 PDVSNYLASEDDGSASNGNRDLPAFDGMADAEVERR-LKEAISAASSA----INPDPRL- 477 Query: 327 SALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPAREE 148 S LQ P T Q FPN + G E++LQ+SPAREE Sbjct: 478 SPLQYTVPSSSGSVPPPTTQVSMMPFPNIQFPQVASVVKPYI---GSVESSLQSSPAREE 534 Query: 147 GEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 GEVPESELDPDTRRRLLILQHGQDTR+H P EP FPARPPMQV +P+VQ Sbjct: 535 GEVPESELDPDTRRRLLILQHGQDTREHTPTEPPFPARPPMQVPLPQVQ 583 >ref|XP_012091568.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Jatropha curcas] gi|802784113|ref|XP_012091569.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Jatropha curcas] Length = 976 Score = 841 bits (2173), Expect = 0.0 Identities = 434/598 (72%), Positives = 490/598 (81%), Gaps = 12/598 (2%) Frame = -1 Query: 1758 VYEGERALGEVAVYPTQSQ------------DGVVWGERLKEIRISHYSQPSERCPPLAV 1615 VY+GE LGEV +YP Q Q D ++ G KEIRISH+SQPSERCPPLAV Sbjct: 12 VYKGEELLGEVEIYPQQHQQQEEENNKKKLIDEILMG---KEIRISHFSQPSERCPPLAV 68 Query: 1614 LHTITSTGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYE 1435 LHTIT G+ FK+ES S D+PL +LH++C+++NKTAV+ LGGEELHLVA++SR E Sbjct: 69 LHTITC-GMCFKMESKNSLS-LDTPLHLLHSSCIQENKTAVVPLGGEELHLVAIYSRNNE 126 Query: 1434 GQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINS 1255 Q CFWGFNV++GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKIN+ Sbjct: 127 RQYPCFWGFNVSAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINT 186 Query: 1254 ESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRP 1075 E DPQR+AGM++E KRYQDDK+ILKQY ENDQVI+NG+VIK+Q EVVPALS+ H IVRP Sbjct: 187 EVDPQRIAGMLSEVKRYQDDKTILKQYVENDQVIENGRVIKTQFEVVPALSDNHQTIVRP 246 Query: 1074 LIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYAL 895 LIRLQ++NIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYAL Sbjct: 247 LIRLQERNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYAL 306 Query: 894 EMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEK 715 EMWRLLDPESNLI S+ELL+RIVCVK+G RKSLFNVFQDG CHPKMALVIDDRLKVWDEK Sbjct: 307 EMWRLLDPESNLISSKELLDRIVCVKSGLRKSLFNVFQDGVCHPKMALVIDDRLKVWDEK 366 Query: 714 DQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYE 535 DQPRVHVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFD+GLLQR+ +++YE Sbjct: 367 DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLLQRIPDISYE 426 Query: 534 DDIKEVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLT 355 DD ++PSPPDVS+YL SEDD S S+G++D + FDGMADAEVE+R LKEA SA+S P T Sbjct: 427 DDFNDIPSPPDVSSYLISEDDASTSNGHRDPLSFDGMADAEVEKR-LKEAISAASLFPAT 485 Query: 354 MTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETT 175 + ++DPR+ ALQ +T Q F N LAQ GP E + Sbjct: 486 VNNLDPRVIPALQYSLASSSSSIPVSTSQPLVMPFSN-IQFPQAASLVKPLAQVGPPEPS 544 Query: 174 LQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRD+ E Q P RP MQVSVPRVQ Sbjct: 545 LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDNVSSESQIPVRPSMQVSVPRVQ 602 >gb|KDP20941.1| hypothetical protein JCGZ_21412 [Jatropha curcas] Length = 970 Score = 841 bits (2173), Expect = 0.0 Identities = 434/598 (72%), Positives = 490/598 (81%), Gaps = 12/598 (2%) Frame = -1 Query: 1758 VYEGERALGEVAVYPTQSQ------------DGVVWGERLKEIRISHYSQPSERCPPLAV 1615 VY+GE LGEV +YP Q Q D ++ G KEIRISH+SQPSERCPPLAV Sbjct: 6 VYKGEELLGEVEIYPQQHQQQEEENNKKKLIDEILMG---KEIRISHFSQPSERCPPLAV 62 Query: 1614 LHTITSTGIAFKLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYE 1435 LHTIT G+ FK+ES S D+PL +LH++C+++NKTAV+ LGGEELHLVA++SR E Sbjct: 63 LHTITC-GMCFKMESKNSLS-LDTPLHLLHSSCIQENKTAVVPLGGEELHLVAIYSRNNE 120 Query: 1434 GQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINS 1255 Q CFWGFNV++GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL RKIN+ Sbjct: 121 RQYPCFWGFNVSAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINT 180 Query: 1254 ESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRP 1075 E DPQR+AGM++E KRYQDDK+ILKQY ENDQVI+NG+VIK+Q EVVPALS+ H IVRP Sbjct: 181 EVDPQRIAGMLSEVKRYQDDKTILKQYVENDQVIENGRVIKTQFEVVPALSDNHQTIVRP 240 Query: 1074 LIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYAL 895 LIRLQ++NIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYAL Sbjct: 241 LIRLQERNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYAL 300 Query: 894 EMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEK 715 EMWRLLDPESNLI S+ELL+RIVCVK+G RKSLFNVFQDG CHPKMALVIDDRLKVWDEK Sbjct: 301 EMWRLLDPESNLISSKELLDRIVCVKSGLRKSLFNVFQDGVCHPKMALVIDDRLKVWDEK 360 Query: 714 DQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYE 535 DQPRVHVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFD+GLLQR+ +++YE Sbjct: 361 DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLLQRIPDISYE 420 Query: 534 DDIKEVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLT 355 DD ++PSPPDVS+YL SEDD S S+G++D + FDGMADAEVE+R LKEA SA+S P T Sbjct: 421 DDFNDIPSPPDVSSYLISEDDASTSNGHRDPLSFDGMADAEVEKR-LKEAISAASLFPAT 479 Query: 354 MTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQRGPAETT 175 + ++DPR+ ALQ +T Q F N LAQ GP E + Sbjct: 480 VNNLDPRVIPALQYSLASSSSSIPVSTSQPLVMPFSN-IQFPQAASLVKPLAQVGPPEPS 538 Query: 174 LQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRVQ 1 LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRD+ E Q P RP MQVSVPRVQ Sbjct: 539 LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDNVSSESQIPVRPSMQVSVPRVQ 596 >ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X2 [Glycine max] Length = 929 Score = 835 bits (2156), Expect = 0.0 Identities = 431/590 (73%), Positives = 487/590 (82%), Gaps = 4/590 (0%) Frame = -1 Query: 1761 VVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITSTGIAF 1582 VVY+GE +GEV VYP ++ + + +KEIRISH+SQPSERCPPLAVLHT+TS G+ F Sbjct: 7 VVYQGEVVVGEVDVYPEENNNYKNF--HVKEIRISHFSQPSERCPPLAVLHTVTSCGVCF 64 Query: 1581 KLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSCFWGFNV 1402 K+ES Q QD L LH+ C+R+NKTAVM LGGEE+HLVAMHSR + CFWGF V Sbjct: 65 KMESKT--QQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNVDRP--CFWGFIV 119 Query: 1401 ASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQRVAGMM 1222 A GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKINSE DPQR++GM Sbjct: 120 ALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQ 179 Query: 1221 AEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQDKNIIL 1042 AE KRYQDDK+ILKQYAENDQV+DNG+VIK QSE+VPALS+ H PIVRPLIRLQDKNIIL Sbjct: 180 AEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNIIL 239 Query: 1041 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPESN 862 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDP+SN Sbjct: 240 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSN 299 Query: 861 LIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 682 LI+S+ELL RIVCVK+G +KSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAF Sbjct: 300 LINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 359 Query: 681 APYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEVPSPPD 502 APYYAPQAEA+NT+PVLCVARNVACNVRGGFFK+FDDGLLQ++ ++AYEDDIK++PSPPD Sbjct: 360 APYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSPPD 419 Query: 501 VSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDPRIASA 322 VSNYL SEDD S+S+G++D FDGMADAEVER+ LK+A SA+ST P+T ++DPR+ S Sbjct: 420 VSNYLVSEDDGSISNGHRDPFLFDGMADAEVERK-LKDALSAASTIPVTTANLDPRLTS- 477 Query: 321 LQXXXXXXXXXXXPTT----MQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPAR 154 LQ PT M P FP + Q P+E +L +SPAR Sbjct: 478 LQYTMVPSGSVPPPTAQASMMPFPHVQFPQ------PATLVKPMGQAAPSEPSLHSSPAR 531 Query: 153 EEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRV 4 EEGEVPESELDPDTRRRLLILQHGQDTRDHA EP FP R P+Q S P V Sbjct: 532 EEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHV 581 >ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] gi|734332252|gb|KHN07275.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Glycine soja] Length = 956 Score = 835 bits (2156), Expect = 0.0 Identities = 431/590 (73%), Positives = 487/590 (82%), Gaps = 4/590 (0%) Frame = -1 Query: 1761 VVYEGERALGEVAVYPTQSQDGVVWGERLKEIRISHYSQPSERCPPLAVLHTITSTGIAF 1582 VVY+GE +GEV VYP ++ + + +KEIRISH+SQPSERCPPLAVLHT+TS G+ F Sbjct: 7 VVYQGEVVVGEVDVYPEENNNYKNF--HVKEIRISHFSQPSERCPPLAVLHTVTSCGVCF 64 Query: 1581 KLESSQSPQHQDSPLTVLHATCLRDNKTAVMSLGGEELHLVAMHSRKYEGQVSCFWGFNV 1402 K+ES Q QD L LH+ C+R+NKTAVM LGGEE+HLVAMHSR + CFWGF V Sbjct: 65 KMESKT--QQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNVDRP--CFWGFIV 119 Query: 1401 ASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKINSESDPQRVAGMM 1222 A GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKINSE DPQR++GM Sbjct: 120 ALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQ 179 Query: 1221 AEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVHLPIVRPLIRLQDKNIIL 1042 AE KRYQDDK+ILKQYAENDQV+DNG+VIK QSE+VPALS+ H PIVRPLIRLQDKNIIL Sbjct: 180 AEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNIIL 239 Query: 1041 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPESN 862 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDP+SN Sbjct: 240 TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSN 299 Query: 861 LIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 682 LI+S+ELL RIVCVK+G +KSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAF Sbjct: 300 LINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 359 Query: 681 APYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRMSEVAYEDDIKEVPSPPD 502 APYYAPQAEA+NT+PVLCVARNVACNVRGGFFK+FDDGLLQ++ ++AYEDDIK++PSPPD Sbjct: 360 APYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSPPD 419 Query: 501 VSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSASSTAPLTMTSIDPRIASA 322 VSNYL SEDD S+S+G++D FDGMADAEVER+ LK+A SA+ST P+T ++DPR+ S Sbjct: 420 VSNYLVSEDDGSISNGHRDPFLFDGMADAEVERK-LKDALSAASTIPVTTANLDPRLTS- 477 Query: 321 LQXXXXXXXXXXXPTT----MQGPATSFPNXXXXXXXXXXXXXLAQRGPAETTLQNSPAR 154 LQ PT M P FP + Q P+E +L +SPAR Sbjct: 478 LQYTMVPSGSVPPPTAQASMMPFPHVQFPQ------PATLVKPMGQAAPSEPSLHSSPAR 531 Query: 153 EEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQFPARPPMQVSVPRV 4 EEGEVPESELDPDTRRRLLILQHGQDTRDHA EP FP R P+Q S P V Sbjct: 532 EEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHV 581 >ref|XP_007025682.1| C-terminal domain phosphatase-like 1 isoform 3 [Theobroma cacao] gi|508781048|gb|EOY28304.1| C-terminal domain phosphatase-like 1 isoform 3 [Theobroma cacao] Length = 870 Score = 832 bits (2150), Expect = 0.0 Identities = 434/605 (71%), Positives = 484/605 (80%), Gaps = 18/605 (2%) Frame = -1 Query: 1761 VVYEGERALGEVAVYPTQSQDG-------------VVWGERLKEIRISHYSQPSERCPPL 1621 VVY GE LGEV +YP Q +V E +KEIRI + +Q SERCPPL Sbjct: 8 VVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSERCPPL 67 Query: 1620 AVLHTITSTGIAFKLESSQSPQHQDS----PLTVLHATCLRDNKTAVMSLGGEELHLVAM 1453 AVLHTITS+GI FK+ESS+ + S PL +LH+ C+RDNKTAVM +G ELHLVAM Sbjct: 68 AVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELHLVAM 127 Query: 1452 HSRKYEGQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 1273 +SR + CFWGFNV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL Sbjct: 128 YSRNSDRP--CFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 185 Query: 1272 LRKINSESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVH 1093 RK+ +E DPQRVAGM+AE KRYQDDK+ILKQYAENDQV++NGKVIK QSEVVPALS+ H Sbjct: 186 QRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPALSDNH 245 Query: 1092 LPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMA 913 PI+RPLIRLQ+KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMA Sbjct: 246 QPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMA 305 Query: 912 ERDYALEMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRL 733 ERDYALEMWRLLDPESNLI+S+ELL+RIVCVK+GSRKSLFNVFQDG CHPKMALVIDDRL Sbjct: 306 ERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRL 365 Query: 732 KVWDEKDQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRM 553 KVWDEKDQPRVHVVPAFAPYYAPQAEANNT+PVLCVARNVACNVRGGFF+EFD+GLLQR+ Sbjct: 366 KVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGLLQRI 425 Query: 552 SEVAYEDDIKEVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSAS 373 E++YEDDIK++PSPPDV NYL SEDD S +GNKD + FDGMADAEVERR LKEA SA+ Sbjct: 426 PEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERR-LKEAISAT 484 Query: 372 STAPLTMTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQR 193 ST ++DPR+ +LQ P+ Q SF N Sbjct: 485 STVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVAPVA 544 Query: 192 GPAETTLQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQF-PARPPMQVS 16 P E +LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRDH P EP F P RP MQVS Sbjct: 545 VP-EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQVS 603 Query: 15 VPRVQ 1 VPR Q Sbjct: 604 VPRGQ 608 >ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] gi|508781047|gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] Length = 984 Score = 832 bits (2150), Expect = 0.0 Identities = 434/605 (71%), Positives = 484/605 (80%), Gaps = 18/605 (2%) Frame = -1 Query: 1761 VVYEGERALGEVAVYPTQSQDG-------------VVWGERLKEIRISHYSQPSERCPPL 1621 VVY GE LGEV +YP Q +V E +KEIRI + +Q SERCPPL Sbjct: 8 VVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSERCPPL 67 Query: 1620 AVLHTITSTGIAFKLESSQSPQHQDS----PLTVLHATCLRDNKTAVMSLGGEELHLVAM 1453 AVLHTITS+GI FK+ESS+ + S PL +LH+ C+RDNKTAVM +G ELHLVAM Sbjct: 68 AVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELHLVAM 127 Query: 1452 HSRKYEGQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 1273 +SR + CFWGFNV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL Sbjct: 128 YSRNSDRP--CFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 185 Query: 1272 LRKINSESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVH 1093 RK+ +E DPQRVAGM+AE KRYQDDK+ILKQYAENDQV++NGKVIK QSEVVPALS+ H Sbjct: 186 QRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPALSDNH 245 Query: 1092 LPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMA 913 PI+RPLIRLQ+KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMA Sbjct: 246 QPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMA 305 Query: 912 ERDYALEMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRL 733 ERDYALEMWRLLDPESNLI+S+ELL+RIVCVK+GSRKSLFNVFQDG CHPKMALVIDDRL Sbjct: 306 ERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRL 365 Query: 732 KVWDEKDQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRM 553 KVWDEKDQPRVHVVPAFAPYYAPQAEANNT+PVLCVARNVACNVRGGFF+EFD+GLLQR+ Sbjct: 366 KVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGLLQRI 425 Query: 552 SEVAYEDDIKEVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSAS 373 E++YEDDIK++PSPPDV NYL SEDD S +GNKD + FDGMADAEVERR LKEA SA+ Sbjct: 426 PEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERR-LKEAISAT 484 Query: 372 STAPLTMTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQR 193 ST ++DPR+ +LQ P+ Q SF N Sbjct: 485 STVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVAPVA 544 Query: 192 GPAETTLQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQF-PARPPMQVS 16 P E +LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRDH P EP F P RP MQVS Sbjct: 545 VP-EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQVS 603 Query: 15 VPRVQ 1 VPR Q Sbjct: 604 VPRGQ 608 >ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] gi|508781046|gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] Length = 978 Score = 832 bits (2150), Expect = 0.0 Identities = 434/605 (71%), Positives = 484/605 (80%), Gaps = 18/605 (2%) Frame = -1 Query: 1761 VVYEGERALGEVAVYPTQSQDG-------------VVWGERLKEIRISHYSQPSERCPPL 1621 VVY GE LGEV +YP Q +V E +KEIRI + +Q SERCPPL Sbjct: 8 VVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSERCPPL 67 Query: 1620 AVLHTITSTGIAFKLESSQSPQHQDS----PLTVLHATCLRDNKTAVMSLGGEELHLVAM 1453 AVLHTITS+GI FK+ESS+ + S PL +LH+ C+RDNKTAVM +G ELHLVAM Sbjct: 68 AVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELHLVAM 127 Query: 1452 HSRKYEGQVSCFWGFNVASGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 1273 +SR + CFWGFNV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL Sbjct: 128 YSRNSDRP--CFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDRIEAL 185 Query: 1272 LRKINSESDPQRVAGMMAEFKRYQDDKSILKQYAENDQVIDNGKVIKSQSEVVPALSEVH 1093 RK+ +E DPQRVAGM+AE KRYQDDK+ILKQYAENDQV++NGKVIK QSEVVPALS+ H Sbjct: 186 QRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPALSDNH 245 Query: 1092 LPIVRPLIRLQDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMA 913 PI+RPLIRLQ+KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMA Sbjct: 246 QPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMA 305 Query: 912 ERDYALEMWRLLDPESNLIDSEELLERIVCVKAGSRKSLFNVFQDGNCHPKMALVIDDRL 733 ERDYALEMWRLLDPESNLI+S+ELL+RIVCVK+GSRKSLFNVFQDG CHPKMALVIDDRL Sbjct: 306 ERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRL 365 Query: 732 KVWDEKDQPRVHVVPAFAPYYAPQAEANNTVPVLCVARNVACNVRGGFFKEFDDGLLQRM 553 KVWDEKDQPRVHVVPAFAPYYAPQAEANNT+PVLCVARNVACNVRGGFF+EFD+GLLQR+ Sbjct: 366 KVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGLLQRI 425 Query: 552 SEVAYEDDIKEVPSPPDVSNYLNSEDDPSVSSGNKDSIGFDGMADAEVERRLLKEASSAS 373 E++YEDDIK++PSPPDV NYL SEDD S +GNKD + FDGMADAEVERR LKEA SA+ Sbjct: 426 PEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERR-LKEAISAT 484 Query: 372 STAPLTMTSIDPRIASALQXXXXXXXXXXXPTTMQGPATSFPNXXXXXXXXXXXXXLAQR 193 ST ++DPR+ +LQ P+ Q SF N Sbjct: 485 STVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVAPVA 544 Query: 192 GPAETTLQNSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHAPVEPQF-PARPPMQVS 16 P E +LQ+SPAREEGEVPESELDPDTRRRLLILQHGQDTRDH P EP F P RP MQVS Sbjct: 545 VP-EPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQVS 603 Query: 15 VPRVQ 1 VPR Q Sbjct: 604 VPRGQ 608