BLASTX nr result
ID: Anemarrhena21_contig00001715
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Anemarrhena21_contig00001715 (2991 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010918442.1| PREDICTED: RNA polymerase II C-terminal doma... 1034 0.0 ref|XP_010918441.1| PREDICTED: RNA polymerase II C-terminal doma... 1034 0.0 ref|XP_010918443.1| PREDICTED: RNA polymerase II C-terminal doma... 1026 0.0 ref|XP_008809393.1| PREDICTED: RNA polymerase II C-terminal doma... 1020 0.0 ref|XP_010932999.1| PREDICTED: RNA polymerase II C-terminal doma... 1019 0.0 ref|XP_008809392.1| PREDICTED: RNA polymerase II C-terminal doma... 1015 0.0 ref|XP_008775881.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera... 1006 0.0 ref|XP_010933000.1| PREDICTED: RNA polymerase II C-terminal doma... 1001 0.0 ref|XP_009413132.1| PREDICTED: RNA polymerase II C-terminal doma... 965 0.0 ref|XP_010241993.1| PREDICTED: RNA polymerase II C-terminal doma... 936 0.0 ref|XP_008225045.1| PREDICTED: RNA polymerase II C-terminal doma... 915 0.0 ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform... 910 0.0 ref|XP_012455431.1| PREDICTED: RNA polymerase II C-terminal doma... 910 0.0 ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu... 909 0.0 ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform... 905 0.0 ref|XP_011027882.1| PREDICTED: RNA polymerase II C-terminal doma... 904 0.0 ref|XP_002267987.3| PREDICTED: RNA polymerase II C-terminal doma... 902 0.0 ref|XP_012091568.1| PREDICTED: RNA polymerase II C-terminal doma... 902 0.0 gb|KDP20941.1| hypothetical protein JCGZ_21412 [Jatropha curcas] 902 0.0 ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma... 898 0.0 >ref|XP_010918442.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X2 [Elaeis guineensis] Length = 941 Score = 1034 bits (2674), Expect = 0.0 Identities = 560/931 (60%), Positives = 676/931 (72%), Gaps = 16/931 (1%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M +S VY NS IGE EI PQN+ G W REIRISH S S+RCPPLAVL+TIA+ F Sbjct: 1 MFKSAVYHGNSLIGEVEISPQNSNPGAWLREIRISHFSPPSERCPPLAVLHTIASASVSF 60 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME HAACLR++KTAV+PLG EELHLVAM R N + Y+CFWGF VA Sbjct: 61 KMESKSPPSDESQLCSLHAACLRDQKTAVIPLGEEELHLVAMKPRKNLMHYACFWGFNVA 120 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 SGLYNSCL MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KIS+ET+ QR+ MLA Sbjct: 121 SGLYNSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKISSETDPQRVTGMLA 180 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 EVKRYQDDK ILKQ+ ENDQVVENG VFKVQ+EVV PLS++HQ I RP+IRL EKNIILT Sbjct: 181 EVKRYQDDKSILKQYAENDQVVENGNVFKVQSEVVPPLSDNHQLITRPIIRLQEKNIILT 240 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 RVNP+IRDTSVLVR+RPAWE+LRSYLTARGRKRFEVYVCTMAE+DYALE+WRLLDP+SSL Sbjct: 241 RVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAEKDYALEMWRLLDPDSSL 300 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 IN+ +LL+RIVCVKS +KSLLNVFQDG CHPKMALVIDDRL VWD KD+ RVHVVPAFA Sbjct: 301 INAMQLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWDDKDQPRVHVVPAFA 360 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE +V VLCVARNVACNVRGGFFK++DEGLLPRI+ +YEDEM+D PSAPDV Sbjct: 361 PYYAPQAEANGNVPVLCVARNVACNVRGGFFKDFDEGLLPRISDIFYEDEMKDFPSAPDV 420 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 NYLI E+D +T+NG+ D LC +GM DAEVERRLKEA+ N QA+ PM N F+ M S+ Sbjct: 421 GNYLISEDDNATSNGSKDLLCSEGMTDAEVERRLKEANGNVQAIYPMVNTFDPSSMSSIQ 480 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNES 1141 VM SS G P NN PQ I P+GQP + E S QGSP REEGEV ES Sbjct: 481 HVMASSSGVPSLAATQVMMPLPNNQCPQPIALGRPLGQPGLPEPSLQGSPAREEGEVPES 540 Query: 1140 ELDPDTRRRLLILQHGQDIRGP-PPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNEV 964 ELDPDTRRRLLILQHGQDIR P P + +RP L V E+ P+QL+ Sbjct: 541 ELDPDTRRRLLILQHGQDIRDPTPQFPVRPPLHVAVSPVQSRGSWFPLEEEMNPRQLSRA 600 Query: 963 SREYHLQPETT---RHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRS 793 +E+ L+PET + + +H S + +GE +S +DRV + N++L+ +++ G D LR N + Sbjct: 601 PKEFSLEPETVCFDKKRPNHQSYYRTGE-NSISSDRVLNENRRLAMQLRHGDDRLRPNHA 659 Query: 792 GSNS---LKDDMSRHSISTRNRDERFKAGHV-IEFSKDPVEVLQGIAAASGAKVEYRTAL 625 +N ++M IS+ +RD +F++G V ++++ P VLQ IA GAKVE+RTAL Sbjct: 660 AANCDSFSGEEMPIGRISSSHRDIQFESGQVTVQYAGTPAGVLQDIATKCGAKVEFRTAL 719 Query: 624 LNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLNDSSKPDSFRDRE 445 +T ELQ SVE WFVGEKIGEG G+TRKEA + A + ++ LAN YL++++ D+ R Sbjct: 720 CDTTELQFSVEVWFVGEKIGEGIGKTRKEAQQQAAEFSLRTLANKYLSNATS-DTLRGDM 778 Query: 444 I--SHTKKIDFLRNSNLSTFS--MSDPL---SNTTEDSRSLNHRLEGSIKTSDSVATLKE 286 + S+ K+ F+ + N + + D L ++T+E+SR L+ RLEGS K++ SVA LKE Sbjct: 779 LKPSNAKENGFISDPNSFGYPAYVRDDLLGVASTSEESRFLDLRLEGSKKSTASVAALKE 838 Query: 285 LCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEAVAN 106 LCT +GF+L F+ S K EVYAQVEVAGQILGKG G TW AK AAEEA+ Sbjct: 839 LCTIEGFNLIFQPQPSASTDSVGKGEVYAQVEVAGQILGKGVGTTWEEAKLQAAEEALGT 898 Query: 105 LKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 LKSMLGQFTQK SPR + A K + DF Sbjct: 899 LKSMLGQFTQKRSGSPRSVSAAPNKRLKPDF 929 >ref|XP_010918441.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X1 [Elaeis guineensis] Length = 950 Score = 1034 bits (2674), Expect = 0.0 Identities = 560/931 (60%), Positives = 676/931 (72%), Gaps = 16/931 (1%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M +S VY NS IGE EI PQN+ G W REIRISH S S+RCPPLAVL+TIA+ F Sbjct: 1 MFKSAVYHGNSLIGEVEISPQNSNPGAWLREIRISHFSPPSERCPPLAVLHTIASASVSF 60 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME HAACLR++KTAV+PLG EELHLVAM R N + Y+CFWGF VA Sbjct: 61 KMESKSPPSDESQLCSLHAACLRDQKTAVIPLGEEELHLVAMKPRKNLMHYACFWGFNVA 120 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 SGLYNSCL MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KIS+ET+ QR+ MLA Sbjct: 121 SGLYNSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKISSETDPQRVTGMLA 180 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 EVKRYQDDK ILKQ+ ENDQVVENG VFKVQ+EVV PLS++HQ I RP+IRL EKNIILT Sbjct: 181 EVKRYQDDKSILKQYAENDQVVENGNVFKVQSEVVPPLSDNHQLITRPIIRLQEKNIILT 240 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 RVNP+IRDTSVLVR+RPAWE+LRSYLTARGRKRFEVYVCTMAE+DYALE+WRLLDP+SSL Sbjct: 241 RVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAEKDYALEMWRLLDPDSSL 300 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 IN+ +LL+RIVCVKS +KSLLNVFQDG CHPKMALVIDDRL VWD KD+ RVHVVPAFA Sbjct: 301 INAMQLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWDDKDQPRVHVVPAFA 360 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE +V VLCVARNVACNVRGGFFK++DEGLLPRI+ +YEDEM+D PSAPDV Sbjct: 361 PYYAPQAEANGNVPVLCVARNVACNVRGGFFKDFDEGLLPRISDIFYEDEMKDFPSAPDV 420 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 NYLI E+D +T+NG+ D LC +GM DAEVERRLKEA+ N QA+ PM N F+ M S+ Sbjct: 421 GNYLISEDDNATSNGSKDLLCSEGMTDAEVERRLKEANGNVQAIYPMVNTFDPSSMSSIQ 480 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNES 1141 VM SS G P NN PQ I P+GQP + E S QGSP REEGEV ES Sbjct: 481 HVMASSSGVPSLAATQVMMPLPNNQCPQPIALGRPLGQPGLPEPSLQGSPAREEGEVPES 540 Query: 1140 ELDPDTRRRLLILQHGQDIRGP-PPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNEV 964 ELDPDTRRRLLILQHGQDIR P P + +RP L V E+ P+QL+ Sbjct: 541 ELDPDTRRRLLILQHGQDIRDPTPQFPVRPPLHVAVSPVQSRGSWFPLEEEMNPRQLSRA 600 Query: 963 SREYHLQPETT---RHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRS 793 +E+ L+PET + + +H S + +GE +S +DRV + N++L+ +++ G D LR N + Sbjct: 601 PKEFSLEPETVCFDKKRPNHQSYYRTGE-NSISSDRVLNENRRLAMQLRHGDDRLRPNHA 659 Query: 792 GSNS---LKDDMSRHSISTRNRDERFKAGHV-IEFSKDPVEVLQGIAAASGAKVEYRTAL 625 +N ++M IS+ +RD +F++G V ++++ P VLQ IA GAKVE+RTAL Sbjct: 660 AANCDSFSGEEMPIGRISSSHRDIQFESGQVTVQYAGTPAGVLQDIATKCGAKVEFRTAL 719 Query: 624 LNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLNDSSKPDSFRDRE 445 +T ELQ SVE WFVGEKIGEG G+TRKEA + A + ++ LAN YL++++ D+ R Sbjct: 720 CDTTELQFSVEVWFVGEKIGEGIGKTRKEAQQQAAEFSLRTLANKYLSNATS-DTLRGDM 778 Query: 444 I--SHTKKIDFLRNSNLSTFS--MSDPL---SNTTEDSRSLNHRLEGSIKTSDSVATLKE 286 + S+ K+ F+ + N + + D L ++T+E+SR L+ RLEGS K++ SVA LKE Sbjct: 779 LKPSNAKENGFISDPNSFGYPAYVRDDLLGVASTSEESRFLDLRLEGSKKSTASVAALKE 838 Query: 285 LCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEAVAN 106 LCT +GF+L F+ S K EVYAQVEVAGQILGKG G TW AK AAEEA+ Sbjct: 839 LCTIEGFNLIFQPQPSASTDSVGKGEVYAQVEVAGQILGKGVGTTWEEAKLQAAEEALGT 898 Query: 105 LKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 LKSMLGQFTQK SPR + A K + DF Sbjct: 899 LKSMLGQFTQKRSGSPRSVSAAPNKRLKPDF 929 >ref|XP_010918443.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X3 [Elaeis guineensis] Length = 915 Score = 1026 bits (2653), Expect = 0.0 Identities = 555/916 (60%), Positives = 669/916 (73%), Gaps = 16/916 (1%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M +S VY NS IGE EI PQN+ G W REIRISH S S+RCPPLAVL+TIA+ F Sbjct: 1 MFKSAVYHGNSLIGEVEISPQNSNPGAWLREIRISHFSPPSERCPPLAVLHTIASASVSF 60 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME HAACLR++KTAV+PLG EELHLVAM R N + Y+CFWGF VA Sbjct: 61 KMESKSPPSDESQLCSLHAACLRDQKTAVIPLGEEELHLVAMKPRKNLMHYACFWGFNVA 120 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 SGLYNSCL MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KIS+ET+ QR+ MLA Sbjct: 121 SGLYNSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKISSETDPQRVTGMLA 180 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 EVKRYQDDK ILKQ+ ENDQVVENG VFKVQ+EVV PLS++HQ I RP+IRL EKNIILT Sbjct: 181 EVKRYQDDKSILKQYAENDQVVENGNVFKVQSEVVPPLSDNHQLITRPIIRLQEKNIILT 240 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 RVNP+IRDTSVLVR+RPAWE+LRSYLTARGRKRFEVYVCTMAE+DYALE+WRLLDP+SSL Sbjct: 241 RVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAEKDYALEMWRLLDPDSSL 300 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 IN+ +LL+RIVCVKS +KSLLNVFQDG CHPKMALVIDDRL VWD KD+ RVHVVPAFA Sbjct: 301 INAMQLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWDDKDQPRVHVVPAFA 360 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE +V VLCVARNVACNVRGGFFK++DEGLLPRI+ +YEDEM+D PSAPDV Sbjct: 361 PYYAPQAEANGNVPVLCVARNVACNVRGGFFKDFDEGLLPRISDIFYEDEMKDFPSAPDV 420 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 NYLI E+D +T+NG+ D LC +GM DAEVERRLKEA+ N QA+ PM N F+ M S+ Sbjct: 421 GNYLISEDDNATSNGSKDLLCSEGMTDAEVERRLKEANGNVQAIYPMVNTFDPSSMSSIQ 480 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNES 1141 VM SS G P NN PQ I P+GQP + E S QGSP REEGEV ES Sbjct: 481 HVMASSSGVPSLAATQVMMPLPNNQCPQPIALGRPLGQPGLPEPSLQGSPAREEGEVPES 540 Query: 1140 ELDPDTRRRLLILQHGQDIRGP-PPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNEV 964 ELDPDTRRRLLILQHGQDIR P P + +RP L V E+ P+QL+ Sbjct: 541 ELDPDTRRRLLILQHGQDIRDPTPQFPVRPPLHVAVSPVQSRGSWFPLEEEMNPRQLSRA 600 Query: 963 SREYHLQPETT---RHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRS 793 +E+ L+PET + + +H S + +GE +S +DRV + N++L+ +++ G D LR N + Sbjct: 601 PKEFSLEPETVCFDKKRPNHQSYYRTGE-NSISSDRVLNENRRLAMQLRHGDDRLRPNHA 659 Query: 792 GSNS---LKDDMSRHSISTRNRDERFKAGHV-IEFSKDPVEVLQGIAAASGAKVEYRTAL 625 +N ++M IS+ +RD +F++G V ++++ P VLQ IA GAKVE+RTAL Sbjct: 660 AANCDSFSGEEMPIGRISSSHRDIQFESGQVTVQYAGTPAGVLQDIATKCGAKVEFRTAL 719 Query: 624 LNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLNDSSKPDSFRDRE 445 +T ELQ SVE WFVGEKIGEG G+TRKEA + A + ++ LAN YL++++ D+ R Sbjct: 720 CDTTELQFSVEVWFVGEKIGEGIGKTRKEAQQQAAEFSLRTLANKYLSNATS-DTLRGDM 778 Query: 444 I--SHTKKIDFLRNSNLSTFS--MSDPL---SNTTEDSRSLNHRLEGSIKTSDSVATLKE 286 + S+ K+ F+ + N + + D L ++T+E+SR L+ RLEGS K++ SVA LKE Sbjct: 779 LKPSNAKENGFISDPNSFGYPAYVRDDLLGVASTSEESRFLDLRLEGSKKSTASVAALKE 838 Query: 285 LCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEAVAN 106 LCT +GF+L F+ S K EVYAQVEVAGQILGKG G TW AK AAEEA+ Sbjct: 839 LCTIEGFNLIFQPQPSASTDSVGKGEVYAQVEVAGQILGKGVGTTWEEAKLQAAEEALGT 898 Query: 105 LKSMLGQFTQKYINSP 58 LKSMLGQFTQK SP Sbjct: 899 LKSMLGQFTQKRSGSP 914 >ref|XP_008809393.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X2 [Phoenix dactylifera] Length = 950 Score = 1020 bits (2637), Expect = 0.0 Identities = 553/931 (59%), Positives = 668/931 (71%), Gaps = 16/931 (1%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M S VY NS IGEAEI PQN+ G W REIRISH S S+RCPPLAVL+TIA+ G F Sbjct: 1 MFESAVYHGNSLIGEAEISPQNSNPGAWLREIRISHFSLPSERCPPLAVLHTIASAGVSF 60 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME HAACL+E+KTAV+PLG EELHLVAM SR N + Y+CFWGF VA Sbjct: 61 KMESKSPPSDESQLCSLHAACLKEQKTAVIPLGEEELHLVAMKSRKNLVHYACFWGFNVA 120 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 SGLYNSCL MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KIS+ET+ QR+ MLA Sbjct: 121 SGLYNSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKISSETDPQRVTGMLA 180 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 EVKRYQDDK ILKQ+ ENDQVVENG VFKVQ+E+V PLS++H I RP+IRL EKNIILT Sbjct: 181 EVKRYQDDKSILKQYAENDQVVENGNVFKVQSEIVPPLSDNHPLITRPIIRLHEKNIILT 240 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 RVNP+IRDTSVLVR+RPAWE+LRSYLTARGRKRFEVYVCTMAE+DYALE+WRLLDP+S L Sbjct: 241 RVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAEKDYALEMWRLLDPDSRL 300 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 INS +LL+RIVCVKS +KSLLNVFQDG CHPKMALVIDDRL VW KD+ RVHVVPAFA Sbjct: 301 INSMRLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWYEKDQPRVHVVPAFA 360 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE +V VLCVARNVACNVRGGFFK++DEG+LPRI+ +YEDEM+D PSAPDV Sbjct: 361 PYYAPQAEANGNVPVLCVARNVACNVRGGFFKDFDEGVLPRISDIFYEDEMKDFPSAPDV 420 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 NYLI E+D +T+NGN D LC +GM DAEVERRLKEA+ N Q V PM N + M + Sbjct: 421 GNYLISEDDNATSNGNKDQLCSEGMTDAEVERRLKEANGNVQVVHPMVNTLDLRSMSPIQ 480 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNES 1141 PVM SS P NN PQ I P+GQP + E S QGSP REEGEV ES Sbjct: 481 PVMASSSCVPPLTATQVMMPLPNNQCPQPIALGRPLGQPGLPEPSLQGSPAREEGEVPES 540 Query: 1140 ELDPDTRRRLLILQHGQDIRGP-PPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNEV 964 ELDPDTRRRLLILQHGQDIR P P + +R L V E+ P+Q + Sbjct: 541 ELDPDTRRRLLILQHGQDIRDPTPQFPVRTPLHVAVSPVQSRGSWFPLEEEMNPRQPSRA 600 Query: 963 SREYHLQPETT---RHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRS 793 +E+ L+PET + + +H S + SGE +S +DRV + N++L+ ++ G D LR N + Sbjct: 601 PKEFPLEPETVCLDKKRPNHQSYYRSGE-NSISSDRVLNENRRLAMQLHHGDDRLRPNHA 659 Query: 792 GSNSLK---DDMSRHSISTRNRDERFKAGH-VIEFSKDPVEVLQGIAAASGAKVEYRTAL 625 +N ++M IS+ ++D +F++G ++++ P VLQ IA GAKVE+RTAL Sbjct: 660 AANYDSFPGEEMPTGRISSSHKDIQFESGRATAQYARTPAGVLQDIATKCGAKVEFRTAL 719 Query: 624 LNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLNDSSKPDSFRDRE 445 +T ELQ S+E WFVGEKIGEG G+TRKEA + A D ++ LAN YL++++ D+ R Sbjct: 720 CDTTELQFSMEVWFVGEKIGEGIGKTRKEAQQQATDFSLRTLANKYLSNATS-DTLRGDM 778 Query: 444 I--SHTKKIDFLRNSNLS---TFSMSDPL--SNTTEDSRSLNHRLEGSIKTSDSVATLKE 286 + S+ K+ F+ ++N S ++ D L ++T+E+SR ++ RLEGS K++ S+A LKE Sbjct: 779 LKPSNAKENGFISDANSSGYPAYARDDLLAVASTSEESRFMDLRLEGSKKSTTSIAALKE 838 Query: 285 LCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEAVAN 106 LCT +GFSL F+A S K EV QVEVAGQILGKG G TW AK AAEEA+ Sbjct: 839 LCTIEGFSLNFQAQPSPSTDSVSKGEVCTQVEVAGQILGKGVGTTWEEAKLQAAEEALGT 898 Query: 105 LKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 LKSMLGQFTQK SPR + K + DF Sbjct: 899 LKSMLGQFTQKRSGSPRSVSATPNKRLKPDF 929 >ref|XP_010932999.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X1 [Elaeis guineensis] Length = 954 Score = 1019 bits (2635), Expect = 0.0 Identities = 554/934 (59%), Positives = 663/934 (70%), Gaps = 19/934 (2%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M +S VY NS IGEAEI PQN+ G W REIRISH S +S+RCPPLAVL+TIA+ G F Sbjct: 1 MFKSAVYHGNSLIGEAEIFPQNSNPGAWVREIRISHFSPSSERCPPLAVLHTIASGGVSF 60 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME HAACLRE KTAV+PLG EELHLVAM SR N +QY+CFWGF VA Sbjct: 61 KMESKSAPSDESPLCSLHAACLRENKTAVIPLGEEELHLVAMNSRKNLMQYACFWGFNVA 120 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 SGLYNSCL MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KIS ET+ QR+ MLA Sbjct: 121 SGLYNSCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIDALQRKISTETDPQRVTGMLA 180 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 E+KRYQDDK ILKQ+ E DQVVENGKV++VQ+EVV PLS+SH I RPV+RL EKNIILT Sbjct: 181 ELKRYQDDKSILKQYAEIDQVVENGKVYQVQSEVVPPLSDSHHLITRPVLRLQEKNIILT 240 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 RVNP+IRDTSVLVR+RPAWE+LRSYLTARGRKRFEVYVCTMAERDYALE+WRLLDP+SSL Sbjct: 241 RVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSSL 300 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 I+S +L++RIVCVKS +KSLL+VFQDG CHPKMALVIDDRL VWD KD+ RVHVVPAFA Sbjct: 301 ISSTRLIDRIVCVKSGSRKSLLSVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 360 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE +V VLCVARNVACNVRGGFFKE+DEGLLPRI+ +YEDE +D PSAPDV Sbjct: 361 PYYAPQAEANGNVPVLCVARNVACNVRGGFFKEFDEGLLPRISDIFYEDEWKDFPSAPDV 420 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 NYLI E+D +T+ GN D LCF GM DAEVERRLKEA+CN QAV PM NN + S+ Sbjct: 421 GNYLISEDDNATSIGNKDQLCFKGMTDAEVERRLKEANCNVQAVHPMVNNLDLRSASSIQ 480 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNP-VGQPSVSEHSFQGSPVREEGEVNE 1144 VM SS P NN Q I P V QP + E S QGSP REEGEV E Sbjct: 481 HVMASSSAVPPLTATQAMMPLPNNQCSQPIALGRPLVCQPGLPEPSLQGSPAREEGEVPE 540 Query: 1143 SELDPDTRRRLLILQHGQDIRGP-PPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNE 967 SELDPDTRRRLLILQHGQD R P PP+ +R L E+ PKQLN Sbjct: 541 SELDPDTRRRLLILQHGQDTRDPTPPFTVRSPLHEAVPPVQSQGNWFPMEEEMNPKQLNR 600 Query: 966 VSREYHLQPET--TRHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRS 793 +E+ ++PET +R H S+F ++S ++RV H N++L ++ G D LR N + Sbjct: 601 APKEFTVEPETVHVNKKRPHHQSYFRSGENSISSERVLHENQRLPMQLHPGDDRLRPNHA 660 Query: 792 GSN---SLKDDMSRHSISTRNRDERFKAGHVI-EFSKDPVEVLQGIAAASGAKVEYRTAL 625 +N ++M IS+ +R +F+ G I + ++ P VLQ IA GAKVE+RTAL Sbjct: 661 AANYNCFPGEEMPAGLISSSHRGLQFEPGWAIAQCAETPAGVLQNIAMKCGAKVEFRTAL 720 Query: 624 LNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLNDSSKPDSFRDRE 445 +T EL+ +E WFVGEK+GEG G+TRKEAH+ A + ++ LA+ YL+++ + + Sbjct: 721 CDTTELKFCMEVWFVGEKVGEGIGKTRKEAHQQAAEISLRTLADKYLSNARSDSNTLHGD 780 Query: 444 I---SHTKKIDFLRNSNLSTFSMSD-------PLSNTTEDSRSLNHRLEGSIKTSDSVAT 295 + SH K+ F+ S+L++F P+++T+E+SR ++ RLEGS KT+ SVA Sbjct: 781 MHKPSHIKENGFI--SDLNSFGYPACARDDVLPVASTSEESRFMDQRLEGSNKTATSVAV 838 Query: 294 LKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEA 115 LKELCT +GF+L F+A ASS K EVYAQVEVAGQI+G G G TW AK AAEEA Sbjct: 839 LKELCTIEGFTLGFQAPTSPSASSVSKGEVYAQVEVAGQIVGIGVGTTWEEAKLKAAEEA 898 Query: 114 VANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 + LKSMLGQFT K SPR K + DF Sbjct: 899 LGTLKSMLGQFTHKRSGSPRSPSATPNKRLKPDF 932 >ref|XP_008809392.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X1 [Phoenix dactylifera] Length = 962 Score = 1015 bits (2625), Expect = 0.0 Identities = 553/943 (58%), Positives = 668/943 (70%), Gaps = 28/943 (2%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M S VY NS IGEAEI PQN+ G W REIRISH S S+RCPPLAVL+TIA+ G F Sbjct: 1 MFESAVYHGNSLIGEAEISPQNSNPGAWLREIRISHFSLPSERCPPLAVLHTIASAGVSF 60 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME HAACL+E+KTAV+PLG EELHLVAM SR N + Y+CFWGF VA Sbjct: 61 KMESKSPPSDESQLCSLHAACLKEQKTAVIPLGEEELHLVAMKSRKNLVHYACFWGFNVA 120 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 SGLYNSCL MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KIS+ET+ QR+ MLA Sbjct: 121 SGLYNSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKISSETDPQRVTGMLA 180 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 EVKRYQDDK ILKQ+ ENDQVVENG VFKVQ+E+V PLS++H I RP+IRL EKNIILT Sbjct: 181 EVKRYQDDKSILKQYAENDQVVENGNVFKVQSEIVPPLSDNHPLITRPIIRLHEKNIILT 240 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 RVNP+IRDTSVLVR+RPAWE+LRSYLTARGRKRFEVYVCTMAE+DYALE+WRLLDP+S L Sbjct: 241 RVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAEKDYALEMWRLLDPDSRL 300 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 INS +LL+RIVCVKS +KSLLNVFQDG CHPKMALVIDDRL VW KD+ RVHVVPAFA Sbjct: 301 INSMRLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWYEKDQPRVHVVPAFA 360 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE +V VLCVARNVACNVRGGFFK++DEG+LPRI+ +YEDEM+D PSAPDV Sbjct: 361 PYYAPQAEANGNVPVLCVARNVACNVRGGFFKDFDEGVLPRISDIFYEDEMKDFPSAPDV 420 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 NYLI E+D +T+NGN D LC +GM DAEVERRLKEA+ N Q V PM N + M + Sbjct: 421 GNYLISEDDNATSNGNKDQLCSEGMTDAEVERRLKEANGNVQVVHPMVNTLDLRSMSPIQ 480 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNES 1141 PVM SS P NN PQ I P+GQP + E S QGSP REEGEV ES Sbjct: 481 PVMASSSCVPPLTATQVMMPLPNNQCPQPIALGRPLGQPGLPEPSLQGSPAREEGEVPES 540 Query: 1140 ELDPDTRRRLLILQHGQDIRGP-PPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNEV 964 ELDPDTRRRLLILQHGQDIR P P + +R L V E+ P+Q + Sbjct: 541 ELDPDTRRRLLILQHGQDIRDPTPQFPVRTPLHVAVSPVQSRGSWFPLEEEMNPRQPSRA 600 Query: 963 SREYHLQPETT---RHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRS 793 +E+ L+PET + + +H S + SGE +S +DRV + N++L+ ++ G D LR N + Sbjct: 601 PKEFPLEPETVCLDKKRPNHQSYYRSGE-NSISSDRVLNENRRLAMQLHHGDDRLRPNHA 659 Query: 792 GSNSLK---------------DDMSRHSISTRNRDERFKAGH-VIEFSKDPVEVLQGIAA 661 +N ++M IS+ ++D +F++G ++++ P VLQ IA Sbjct: 660 AANYDSFPGVLFPNQTLDFEGEEMPTGRISSSHKDIQFESGRATAQYARTPAGVLQDIAT 719 Query: 660 ASGAKVEYRTALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLN 481 GAKVE+RTAL +T ELQ S+E WFVGEKIGEG G+TRKEA + A D ++ LAN YL+ Sbjct: 720 KCGAKVEFRTALCDTTELQFSMEVWFVGEKIGEGIGKTRKEAQQQATDFSLRTLANKYLS 779 Query: 480 DSSKPDSFRDREI--SHTKKIDFLRNSNLS---TFSMSDPL--SNTTEDSRSLNHRLEGS 322 +++ D+ R + S+ K+ F+ ++N S ++ D L ++T+E+SR ++ RLEGS Sbjct: 780 NATS-DTLRGDMLKPSNAKENGFISDANSSGYPAYARDDLLAVASTSEESRFMDLRLEGS 838 Query: 321 IKTSDSVATLKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNV 142 K++ S+A LKELCT +GFSL F+A S K EV QVEVAGQILGKG G TW Sbjct: 839 KKSTTSIAALKELCTIEGFSLNFQAQPSPSTDSVSKGEVCTQVEVAGQILGKGVGTTWEE 898 Query: 141 AKSLAAEEAVANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 AK AAEEA+ LKSMLGQFTQK SPR + K + DF Sbjct: 899 AKLQAAEEALGTLKSMLGQFTQKRSGSPRSVSATPNKRLKPDF 941 >ref|XP_008775881.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 1 [Phoenix dactylifera] Length = 945 Score = 1006 bits (2600), Expect = 0.0 Identities = 555/934 (59%), Positives = 658/934 (70%), Gaps = 19/934 (2%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M +S VY NS IGEAEI PQN+ G W REIRISH S +S+RC PLAVL+TIA+ G F Sbjct: 1 MFKSAVYHGNSLIGEAEIFPQNSNPGAWVREIRISHFSPSSERCLPLAVLHTIASGGVSF 60 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME HAACLRE KTAV+PLG EELHLVAM S N + ++CFWG VA Sbjct: 61 KMESRSPPSDESPLCSLHAACLRENKTAVIPLGGEELHLVAMNSGKNLMHHACFWGXNVA 120 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 SGLYNSCL MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+K+SNET+ QR+ MLA Sbjct: 121 SGLYNSCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIDALQRKLSNETDPQRVTGMLA 180 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 E+KRYQDDK ILKQ+ ENDQVVENGKV+KVQ+EVV PLS+SHQ I RPVIRL EKNIILT Sbjct: 181 EIKRYQDDKSILKQYAENDQVVENGKVYKVQSEVVPPLSDSHQLITRPVIRLQEKNIILT 240 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 RVNP IRDTSVLVR+RPAWE+LRSYL ARGRKRFEVYVCTMAERDYALE+WRLLDP+SSL Sbjct: 241 RVNPLIRDTSVLVRLRPAWEELRSYLIARGRKRFEVYVCTMAERDYALEMWRLLDPDSSL 300 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 I+S +LL+RIVCVKS +KSLL+VFQDG CHPKMALVIDDRL VWD KD+ RVH VPAFA Sbjct: 301 ISSIQLLDRIVCVKSGSRKSLLSVFQDGICHPKMALVIDDRLKVWDEKDQPRVHCVPAFA 360 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE +V VLCVARNVACNVRGGFFKE+DEGLLPRI+ ++YEDE +D PSAPDV Sbjct: 361 PYYAPQAEANGNVPVLCVARNVACNVRGGFFKEFDEGLLPRISDSFYEDEWKDFPSAPDV 420 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 NYLI E+D +T+NGN D LCF+GM DAEVERRLK A+ PM NNF+ + S+ Sbjct: 421 GNYLISEDDNATSNGNKDQLCFEGMTDAEVERRLK-------AIHPMVNNFDPRSVSSIQ 473 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNP-VGQPSVSEHSFQGSPVREEGEVNE 1144 VM SS P NN PQ I P V Q + E S QGSP REEGEV E Sbjct: 474 HVMASSSAALPQTATQAMMPLPNNNCPQPIALGRPLVCQSGLPEPSLQGSPAREEGEVPE 533 Query: 1143 SELDPDTRRRLLILQHGQDIRGP-PPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNE 967 SELDPDTRRRLLILQHGQD R P P + +R L V E+ P+QL+ Sbjct: 534 SELDPDTRRRLLILQHGQDTRDPTPSFTVRSPLHVAVPPVQSRGNWFPLEEEMNPRQLSR 593 Query: 966 VSREYHLQPETTRHQR---SHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNR 796 +E+ L+PET R + +H S F SGE +S +DRV H N+ L ++ G D LR N Sbjct: 594 EPKEFTLEPETIRFNKKRPNHQSYFRSGE-NSISSDRVLHENRGLPMQLHQGDDRLRPNH 652 Query: 795 SGSNSLK---DDMSRHSISTRNRDERFKAGH-VIEFSKDPVEVLQGIAAASGAKVEYRTA 628 + +N ++M IS+ ++D +F++G +++ P VLQ IA GAKVE+RTA Sbjct: 653 AAANYNSFPGEEMPAGLISSSHKDTQFESGRATARYAETPAGVLQNIAMKCGAKVEFRTA 712 Query: 627 LLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYL-NDSSKPDSFRD 451 L +T LQ S+E WFVG K+GEG G+TRKEA + A + ++ LAN YL N S P S D Sbjct: 713 LCDTTNLQFSMEVWFVGGKLGEGIGKTRKEAQQQAAEISLRTLANKYLSNARSDPSSHGD 772 Query: 450 R-EISHTKKIDFLRNSNLSTFSMSD-------PLSNTTEDSRSLNHRLEGSIKTSDSVAT 295 + H K+ F S+L++F P+++T+E+SR ++ RLEG KT+ +VA Sbjct: 773 MLKPFHIKENGF--TSDLNSFGYPACARDDVLPVASTSEESRLMDQRLEGPNKTAAAVAA 830 Query: 294 LKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEA 115 LK+LCT KGF+L F+A A S K EVYAQVEVAGQILGKG G TW AK AAEEA Sbjct: 831 LKDLCTIKGFNLVFQAQSSPSAGSVSKGEVYAQVEVAGQILGKGVGTTWEEAKLQAAEEA 890 Query: 114 VANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 + LKSMLGQFTQK+ SPR L K + DF Sbjct: 891 LGALKSMLGQFTQKHSGSPRSLSATPNKRLKADF 924 >ref|XP_010933000.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X2 [Elaeis guineensis] Length = 915 Score = 1001 bits (2588), Expect = 0.0 Identities = 546/924 (59%), Positives = 644/924 (69%), Gaps = 9/924 (0%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M +S VY NS IGEAEI PQN+ G W REIRISH S +S+RCPPLAVL+TIA+ G F Sbjct: 1 MFKSAVYHGNSLIGEAEIFPQNSNPGAWVREIRISHFSPSSERCPPLAVLHTIASGGVSF 60 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME HAACLRE KTAV+PLG EELHLVAM SR N +QY+CFWGF VA Sbjct: 61 KMESKSAPSDESPLCSLHAACLRENKTAVIPLGEEELHLVAMNSRKNLMQYACFWGFNVA 120 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 SGLYNSCL MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KIS ET+ QR+ MLA Sbjct: 121 SGLYNSCLGMLNLRCLGIVFDLDETLIVANTLRSFEDRIDALQRKISTETDPQRVTGMLA 180 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 E+KRYQDDK ILKQ+ E DQVVENGKV++VQ+EVV PLS+SH I RPV+RL EKNIILT Sbjct: 181 ELKRYQDDKSILKQYAEIDQVVENGKVYQVQSEVVPPLSDSHHLITRPVLRLQEKNIILT 240 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 RVNP+IRDTSVLVR+RPAWE+LRSYLTARGRKRFEVYVCTMAERDYALE+WRLLDP+SSL Sbjct: 241 RVNPSIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSSL 300 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 I+S +L++RIVCVKS +KSLL+VFQDG CHPKMALVIDDRL VWD KD+ RVHVVPAFA Sbjct: 301 ISSTRLIDRIVCVKSGSRKSLLSVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 360 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE +V VLCVARNVACNVRGGFFKE+DEGLLPRI+ +YEDE +D PSAPDV Sbjct: 361 PYYAPQAEANGNVPVLCVARNVACNVRGGFFKEFDEGLLPRISDIFYEDEWKDFPSAPDV 420 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 NYLI E+D +T+ GN D LCF GM DAEVERRLKEA+CN QAV PM NN + S+ Sbjct: 421 GNYLISEDDNATSIGNKDQLCFKGMTDAEVERRLKEANCNVQAVHPMVNNLDLRSASSIQ 480 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNP-VGQPSVSEHSFQGSPVREEGEVNE 1144 VM SS P NN Q I P V QP + E S QGSP REEGEV E Sbjct: 481 HVMASSSAVPPLTATQAMMPLPNNQCSQPIALGRPLVCQPGLPEPSLQGSPAREEGEVPE 540 Query: 1143 SELDPDTRRRLLILQHGQDIRGP-PPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNE 967 SELDPDTRRRLLILQHGQD R P PP+ +R L E+ PKQLN Sbjct: 541 SELDPDTRRRLLILQHGQDTRDPTPPFTVRSPLHEAVPPVQSQGNWFPMEEEMNPKQLNR 600 Query: 966 VSREYHLQPET--TRHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRS 793 +E+ ++PET +R H S+F ++S ++RV H N++L ++ G D LR N + Sbjct: 601 APKEFTVEPETVHVNKKRPHHQSYFRSGENSISSERVLHENQRLPMQLHPGDDRLRPNHA 660 Query: 792 GSN---SLKDDMSRHSISTRNRDERFKAGHVI-EFSKDPVEVLQGIAAASGAKVEYRTAL 625 +N ++M IS+ +R +F+ G I + ++ P VLQ IA GAKVE+RTAL Sbjct: 661 AANYNCFPGEEMPAGLISSSHRGLQFEPGWAIAQCAETPAGVLQNIAMKCGAKVEFRTAL 720 Query: 624 LNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLNDSSKPDSFRDRE 445 +T EL+ +E WFVGEK+GEG G+TRKEAH+ A + ++ LA +D Sbjct: 721 CDTTELKFCMEVWFVGEKVGEGIGKTRKEAHQQAAEISLRTLAACARDDVL--------- 771 Query: 444 ISHTKKIDFLRNSNLSTFSMSDPLSNTTEDSRSLNHRLEGSIKTSDSVATLKELCTSKGF 265 P+++T+E+SR ++ RLEGS KT+ SVA LKELCT +GF Sbjct: 772 ----------------------PVASTSEESRFMDQRLEGSNKTATSVAVLKELCTIEGF 809 Query: 264 SLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEAVANLKSMLGQ 85 +L F+A ASS K EVYAQVEVAGQI+G G G TW AK AAEEA+ LKSMLGQ Sbjct: 810 TLGFQAPTSPSASSVSKGEVYAQVEVAGQIVGIGVGTTWEEAKLKAAEEALGTLKSMLGQ 869 Query: 84 FTQKYINSPRLLQTAVEKSWRTDF 13 FT K SPR K + DF Sbjct: 870 FTHKRSGSPRSPSATPNKRLKPDF 893 >ref|XP_009413132.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X1 [Musa acuminata subsp. malaccensis] gi|695050309|ref|XP_009413133.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 isoform X1 [Musa acuminata subsp. malaccensis] Length = 949 Score = 965 bits (2495), Expect = 0.0 Identities = 523/929 (56%), Positives = 650/929 (69%), Gaps = 15/929 (1%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M S VY ENS +GE E+ PQN +G W REIRISHLS +S+RCPPLA+L+T+A+ F Sbjct: 1 MFNSAVYYENSLVGEVEVYPQNPNTGSWLREIRISHLSPSSERCPPLAILHTVASGVVRF 60 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME HA E KTAV+ LG EELHLVAM SR NP+ Y+CFWGF+V Sbjct: 61 KMESKSPLSKDSPMSSLHATLFSENKTAVIALGEEELHLVAMASRKNPMPYACFWGFSVL 120 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 S LY S LLMLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KISNET+ R+A ML Sbjct: 121 SRLYESSLLMLNLRCLGIVFDLDETLLVANTMRSFEDRIDALQRKISNETDPLRIAGMLT 180 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 E+KRYQDDK ILKQ+ ENDQVVENGKVFKVQ+E+V PLS++HQ I RPVIR+ EK+IILT Sbjct: 181 EIKRYQDDKSILKQYAENDQVVENGKVFKVQSEMVPPLSDNHQLITRPVIRIQEKSIILT 240 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 RVNP+IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCTMAERDYALE+WRLLDP+SSL Sbjct: 241 RVNPSIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSSL 300 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 INS KLL+RIVCVKS +KSLLNVFQDG CHPKMALVIDDRL VWD KD+ RVHVVPAFA Sbjct: 301 INSSKLLDRIVCVKSGSRKSLLNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 360 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE S++ VLCVARNVACNVRGGFFK++DEG+LPRI+ YEDEM+D P APDV Sbjct: 361 PYYAPQAEANSTIPVLCVARNVACNVRGGFFKDFDEGILPRISEVLYEDEMKDFPPAPDV 420 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 N+LI E+D T N N D +C DGM DAEV +RLKEASC+ QAV PM NF + S+ Sbjct: 421 GNFLISEDDALTANANKDQVCLDGMEDAEVGKRLKEASCSMQAVQPMVTNFGPRPVSSLQ 480 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNES 1141 V PSS+ P NN Q++ P+GQ + E SFQGSP REEGEV ES Sbjct: 481 NV-PSSFNTTSLTAMRMAVPLPNNQCAQSVPVGRPLGQLASPEPSFQGSPAREEGEVPES 539 Query: 1140 ELDPDTRRRLLILQHGQDIRGP-PPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNEV 964 ELDPDTRRRLLILQHGQD R P P + + P L V E+ P+Q + Sbjct: 540 ELDPDTRRRLLILQHGQDTREPTPSFPVSPPLRVSIPPVQPQGSWFPLEEEIDPRQQDSA 599 Query: 963 SREYHLQPETTRH-QRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRSGS 787 +E+ +P+ R+ +RS SF G ++S P+DRV H ++L ++++G D L+ N S S Sbjct: 600 PKEFSREPDPGRYRKRSRHPSFMHGGENSVPSDRVLHEPRRLPIQLRNGGDRLQLNNSLS 659 Query: 786 NSLK---DDMSRHSISTRNRDERFKAGH-VIEFSKDPVEVLQGIAAASGAKVEYRTALLN 619 N ++M +R++D + + I+ + P VLQ IA KVE+R+ L + Sbjct: 660 NFNSFQGEEMPMGRNFSRHKDAQLEPKQATIKQAGSPPGVLQEIAIKCRNKVEFRSTLCD 719 Query: 618 TIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLNDS-SKPDSFRD--R 448 T ELQ S+E WFVGEK+GEG G+TRKEA A D ++ NLA+ YL+++ P++ Sbjct: 720 TAELQFSIEVWFVGEKVGEGVGKTRKEAQHRAADMSLRNLADKYLSNALGGPNTVHGDLL 779 Query: 447 EISHTKKIDFLRNSNLSTFSMSD-----PLSNTTEDSRSLNHRLEGSIKTSDSVATLKEL 283 ++ TK++ L +SN + P+++T+EDSRS++ RLE S +TS + +LKEL Sbjct: 780 KLPQTKEMGLLSDSNSYGYQPCPRNDLLPVASTSEDSRSMDQRLESSRRTS-ATTSLKEL 838 Query: 282 CTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEAVANL 103 C +GF L F+A+ S K EV AQVE+A QILG+G G++W AK AAEEA+ L Sbjct: 839 CVMEGFDLVFRAEPSPSNGSISKGEVSAQVEIARQILGRGVGMSWEDAKLQAAEEALGTL 898 Query: 102 KSMLGQFTQKYINSPRLLQTAVEKSWRTD 16 +SMLGQ++QK+ +SP L K ++ + Sbjct: 899 RSMLGQYSQKHSSSPGSLSMMSNKRFKPE 927 >ref|XP_010241993.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Nelumbo nucifera] Length = 948 Score = 936 bits (2419), Expect = 0.0 Identities = 523/935 (55%), Positives = 638/935 (68%), Gaps = 20/935 (2%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M +S+VY+ NS +GE EI PQN + + +E RISH SQ S+RCPPLAVL+TIA G C Sbjct: 1 MFKSVVYQGNSPLGEVEIFPQNQEIDMTNKEFRISHFSQPSERCPPLAVLHTIAPCGVCL 60 Query: 2577 KMEXXXXXXXXXXXXLHAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVAS 2398 KME LH++CLRE KTAVVPLG EELHLVAMP+R Q CFWGF VA Sbjct: 61 KMESKSQSGDSPLFSLHSSCLRENKTAVVPLGEEELHLVAMPTRKIGEQCLCFWGFNVAP 120 Query: 2397 GLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLAE 2218 GLYNSCL+MLNLRCLGIVFDLDETLVVANT+RSFEDRI LQ+KIS E + QR+A M+AE Sbjct: 121 GLYNSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIDALQRKISTEVDPQRIAGMIAE 180 Query: 2217 VKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILTR 2038 VKRYQDDK+ILKQ+ ENDQV++NGKV KVQ+E+V LS++HQ I RP+IRL E+NIILTR Sbjct: 181 VKRYQDDKIILKQYAENDQVIDNGKVIKVQSEIVPALSDNHQPIVRPLIRLQERNIILTR 240 Query: 2037 VNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSLI 1858 +NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCTMAERDYALE+WRLLDP+S+LI Sbjct: 241 INPGIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLI 300 Query: 1857 NSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFAP 1678 N+ +LL+RIVCVK+ +KSLLNVFQ G CHPKMALVIDDRL VWD KD+ RVHVVPAFAP Sbjct: 301 NTKELLDRIVCVKAGSRKSLLNVFQVGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP 360 Query: 1677 YYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDVS 1498 YYAPQAE ++V VLCVARNVACNVRGGFFKE+DE LL RI +YED+M PS PDVS Sbjct: 361 YYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEVLLQRIPEIFYEDDMAGFPSPPDVS 420 Query: 1497 NYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSV-H 1321 NYLI E+DTS +NGN DPLCF+G+ D EVERRLK+A S V N ++P + H Sbjct: 421 NYLISEDDTSASNGNKDPLCFEGITDVEVERRLKDAIPASSLV-----NSLDPRLPLIQH 475 Query: 1320 PVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNES 1141 V SS P+ N FP P+ Q E S Q SP REEGEV ES Sbjct: 476 AVASSSSSVSLPTSQGPMMPFPNKQFPHVATLAKPLVQVGPPELSLQSSPAREEGEVPES 535 Query: 1140 ELDPDTRRRLLILQHGQDIR----GPPPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQL 973 ELDPDTRRRLLILQHGQD R PP+ +RP L+V E+ P+QL Sbjct: 536 ELDPDTRRRLLILQHGQDTREHTSSEPPFPVRPPLQVSVPAVQSHGSWFPSEEEMSPRQL 595 Query: 972 NE-VSREYHLQPETTR--HQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRR 802 N + +E+ L+PE R FF G + S P+DR + N++L+ EV D +R Sbjct: 596 NRTIPKEFPLEPEAVHFDKHRPRRPPFFQGLESSIPSDRSLNENQRLAKEVHQTDDRMRI 655 Query: 801 NRSGSNSLK---DDMSRHSISTRNRDERFKAGH-VIEFSKDPVEVLQGIAAASGAKVEYR 634 N S S +++ S+ NRD +F++G +++ + P V+Q IA G KVE+R Sbjct: 656 NHSVSGHRPLSGEELPLGRSSSSNRDLQFESGRGNLQYPETPAGVVQEIAMKCGTKVEFR 715 Query: 633 TALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLND-SSKPDSF 457 L+ + ELQ S E +F+GEK+GEG GRTRKEA A + +I NLAN YL+ S P+S Sbjct: 716 HGLVASTELQFSFEVYFMGEKVGEGIGRTRKEAQHQAAENSIRNLANKYLSHIKSDPNSS 775 Query: 456 R--DREISHTKKIDFLRNSN---LSTFSMSDPLSNTT--EDSRSLNHRLEGSIKTSDSVA 298 ++SH + L ++N FS D LS +T E SR + RLEGS K+ S++ Sbjct: 776 HGDGNKLSHGNENGLLNDTNSFGSLPFSKEDSLSLSTSSESSRFVETRLEGSKKSVGSLS 835 Query: 297 TLKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEE 118 LKELCT +G +LAF+ A+S K E+YA+VEVAG +LGKG G +W+ AK AA+E Sbjct: 836 ALKELCTVEGLNLAFQMPPI-SANSTQKGEIYAEVEVAGHVLGKGIGSSWDEAKIQAADE 894 Query: 117 AVANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 A+ NLK ML Q TQK SPR LQ K + +F Sbjct: 895 ALGNLKLMLSQNTQKRPGSPRSLQGISSKRLKPEF 929 >ref|XP_008225045.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Prunus mume] Length = 959 Score = 915 bits (2364), Expect = 0.0 Identities = 507/942 (53%), Positives = 643/942 (68%), Gaps = 27/942 (2%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTR-------EIRISHLSQASDRCPPLAVLYTI 2599 M +S+VYK +GE EI P+ N++ + EIRIS+ SQ+S+RCPP+AVL+TI Sbjct: 1 MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60 Query: 2598 AADGFCFKMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSC 2422 ++ G CFKME L H++C+ E KTAV+PLG EELHLVAM SR + +Y C Sbjct: 61 SSHGVCFKMESKTSQSQDTPLFLLHSSCVMENKTAVMPLGGEELHLVAMHSRNSDKRYPC 120 Query: 2421 FWGFTVASGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQ 2242 FWGF+VA GLYNSCL+MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KIS+E + Q Sbjct: 121 FWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEVDSQ 180 Query: 2241 RLATMLAEVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLP 2062 R++ MLAE+KRYQDDK ILKQ+ ENDQVVENG+V K Q+E V LS++HQ I RP+IRL Sbjct: 181 RISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRPLIRLL 240 Query: 2061 EKNIILTRVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRL 1882 EKNIILTR+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCTMAERDYALE+WRL Sbjct: 241 EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300 Query: 1881 LDPESSLINSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRV 1702 LDP+S+LINS+KLL+RIVCVKS +KSL NVFQ+ CHPKMALVIDDRL VWD +D+ RV Sbjct: 301 LDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRV 360 Query: 1701 HVVPAFAPYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRD 1522 HVVPAFAPYYAPQAE ++V VLCVARNVACNVRGGFF+E+D+ LL +I +YED+++D Sbjct: 361 HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYEDDIKD 420 Query: 1521 LPSAPDVSNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQ 1342 +PS PDVSNYL+ E+D+S NGN DPL FDG+ D EVERR+KEA+ + V + + + Sbjct: 421 VPS-PDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATSAASMVSSVVTSIDP 479 Query: 1341 MQMPSVHPVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVRE 1162 + V PSS + + FPQA V P+G +E S Q SP RE Sbjct: 480 RLASLQYTVAPSSSTLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSTEPSLQSSPARE 539 Query: 1161 EGEVNESELDPDTRRRLLILQHGQDIR----GPPPYQLRPHLEVXXXXXXXXXXXXXXXX 994 EGEV ESELDPDTRRRLLILQHGQD R PP+ +RP ++ Sbjct: 540 EGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPFPVRPPMQASVPRAQSRPGWFPVEE 599 Query: 993 EVKPKQLNE-VSREYHLQPETTR--HQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQS 823 E+ P+QL+ V ++ L PE + R H SSFF ++S P+DR+ N++L E Sbjct: 600 EMSPRQLSRMVPKDLPLDPEPVQIEKHRPHHSSFFPKVENSIPSDRILQENQRLPKEAFH 659 Query: 822 GSDNLRRNR--SGSNSLK-DDMSRHSISTRNRDERFKAGHVIEFSKDPVEVLQGIAAASG 652 D LR N SG +SL +++ S+ NRD F++G I ++ P VLQ IA G Sbjct: 660 RDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGRAISNAETPAGVLQEIAMKCG 719 Query: 651 AKVEYRTALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLNDSS 472 AKVE+R AL+ ++ELQ VEAWF GEKIGEG+G+TR+EAH A + +++NLAN YL+ Sbjct: 720 AKVEFRPALVASMELQFYVEAWFAGEKIGEGSGKTRREAHYQAAEGSLKNLANIYLS-RV 778 Query: 471 KPDSFR----DREISHTKKIDFLRNSN---LSTFSMSDPLSNTT--EDSRSLNHRLEGSI 319 KPDS + + F N N + F + LS++T E SR L+ RLEGS Sbjct: 779 KPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPLDPRLEGSK 838 Query: 318 KTSDSVATLKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVA 139 K+ SV+TLKELC +G + F+ +S +K EV+ QVE+ G++LGKG G+TW+ A Sbjct: 839 KSMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKGIGLTWDEA 898 Query: 138 KSLAAEEAVANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 K AAE+A+ +L S L + QK SPR LQ K + +F Sbjct: 899 KMQAAEKALGSLTSTL--YAQKRQGSPRSLQGMSSKRMKQEF 938 >ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] gi|508781046|gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] Length = 978 Score = 910 bits (2352), Expect = 0.0 Identities = 509/957 (53%), Positives = 633/957 (66%), Gaps = 42/957 (4%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWT-----------------REIRISHLSQASDR 2629 M +S+VY+ +GE EI PQ +EIRI +L+Q S+R Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63 Query: 2628 CPPLAVLYTIAADGFCFKMEXXXXXXXXXXXXL------HAACLREKKTAVVPLGNEELH 2467 CPPLAVL+TI + G CFKME H+ C+R+ KTAV+P+G+ ELH Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123 Query: 2466 LVAMPSRTNPLQYSCFWGFTVASGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDR 2287 LVAM SR + CFWGF V+ GLY+SCLLMLNLRCLGIVFDLDETL+VANT+RSFEDR Sbjct: 124 LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181 Query: 2286 ISNLQQKISNETEHQRLATMLAEVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPL 2107 I LQ+K++ E + QR+A M+AE+KRYQDDK ILKQ+ ENDQVVENGKV K+Q+EVV L Sbjct: 182 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241 Query: 2106 SESHQQINRPVIRLPEKNIILTRVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYV 1927 S++HQ I RP+IRL EKNIILTR+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYV Sbjct: 242 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301 Query: 1926 CTMAERDYALEIWRLLDPESSLINSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVI 1747 CTMAERDYALE+WRLLDPES+LINS +LL+RIVCVKS +KSL NVFQDG CHPKMALVI Sbjct: 302 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361 Query: 1746 DDRLTVWDLKDKRRVHVVPAFAPYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGL 1567 DDRL VWD KD+ RVHVVPAFAPYYAPQAE +++ VLCVARNVACNVRGGFF+E+DEGL Sbjct: 362 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421 Query: 1566 LPRIAAAYYEDEMRDLPSAPDVSNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEAS 1387 L RI YED+++D+PS PDV NYL+ E+DTS NGN DPL FDGM DAEVERRLKEA Sbjct: 422 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481 Query: 1386 CNSQAVPPMFNNFNQMQMPSVHPVMP-SSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVG 1210 + V N + PS+ MP SS +SN FP A V PV Sbjct: 482 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541 Query: 1209 QPSVSEHSFQGSPVREEGEVNESELDPDTRRRLLILQHGQDIRGPPPYQ-----LRPHLE 1045 +V E S Q SP REEGEV ESELDPDTRRRLLILQHGQD R P + +RP ++ Sbjct: 542 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601 Query: 1044 VXXXXXXXXXXXXXXXXEVKPKQLNEVS-REYHLQPETTRHQRSHLSSFFSGEKDSNPTD 868 V E+ P+QLN + +E+ L E ++ FF + S P+D Sbjct: 602 VSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHRHPPFFPKVESSIPSD 661 Query: 867 RVNHRNKKLSTEVQSGSDNLRRNRSGSNSLK---DDMSRHSISTRNRDERFKAGHVIEFS 697 R+ N++LS E D L N + S+ ++M S+ +RD F++G + Sbjct: 662 RLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTSG 721 Query: 696 KDPVEVLQGIAAASGAKVEYRTALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVD 517 + VLQ IA GAKVE+R AL+ +++LQ S+EAWF GEK+GEG GRTR+EA + A + Sbjct: 722 ETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAE 781 Query: 516 KAIENLANNYLNDSSKPDS-FRDREISHTKKI-DFLRNSNLSTF-------SMSDPLSNT 364 ++I+NLAN YL+ KPDS + ++S I D SN+++F S S Sbjct: 782 ESIKNLANTYLS-RIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTA 840 Query: 363 TEDSRSLNHRLEGSIKTSDSVATLKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVA 184 +E SR + RLEGS K+ SV LKELC +G + F+ +++ K EVYAQVE+ Sbjct: 841 SEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEID 900 Query: 183 GQILGKGTGVTWNVAKSLAAEEAVANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 GQ+LGKGTG+TW AK AAE+A+ +L+SMLGQ++QK SPR LQ K + +F Sbjct: 901 GQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKRLKPEF 957 >ref|XP_012455431.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Gossypium raimondii] gi|763802547|gb|KJB69485.1| hypothetical protein B456_011G025900 [Gossypium raimondii] Length = 973 Score = 910 bits (2351), Expect = 0.0 Identities = 512/955 (53%), Positives = 632/955 (66%), Gaps = 40/955 (4%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNK--------SGVWT------REIRISHLSQASDRCPP 2620 M +S+V + + +GE EI PQ + G T +EIRI +L+Q S+RCPP Sbjct: 3 MYKSVVCRGDEVLGEVEIYPQQQQLREEEEEYGGKITVMEEEMKEIRIGYLTQGSERCPP 62 Query: 2619 LAVLYTIAADGFCFKMEXXXXXXXXXXXXL-------HAACLREKKTAVVPLGNEELHLV 2461 LAVL+TI + G CFKME H+ C+R+ KTAV+P+G+ ELHLV Sbjct: 63 LAVLHTITSTGICFKMESSKDNNYSSSFQDTPPLHLLHSECIRDNKTAVMPMGDCELHLV 122 Query: 2460 AMPSRTNPLQYSCFWGFTVASGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRIS 2281 AM SR + CFWGF VA GLY+SCL+MLNLRCLGIVFDLDETLVVANT+RSFEDRI Sbjct: 123 AMYSRNS--DRPCFWGFNVARGLYDSCLVMLNLRCLGIVFDLDETLVVANTMRSFEDRIE 180 Query: 2280 NLQQKISNETEHQRLATMLAEVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSE 2101 LQ+K++ E + QR A M+AE+KRYQDDK ILKQ+ ENDQVVENGKV KVQ+E+V+PLS+ Sbjct: 181 ALQRKMNTEVDTQRAAGMMAEIKRYQDDKAILKQYAENDQVVENGKVIKVQSEIVQPLSD 240 Query: 2100 SHQQINRPVIRLPEKNIILTRVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCT 1921 +HQ I RP+IRL EKNIILTR+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCT Sbjct: 241 NHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 300 Query: 1920 MAERDYALEIWRLLDPESSLINSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDD 1741 MAERDYALE+WRLLDPES+LINS +LL+RIVCVKS L+KSL NVFQDG CHPKMALVIDD Sbjct: 301 MAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDD 360 Query: 1740 RLTVWDLKDKRRVHVVPAFAPYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLP 1561 RL VWD KD+ RVHVVPAFAPY+APQAE +++ VLCVARNVACNVRGGFF+E+DEGLL Sbjct: 361 RLKVWDEKDQPRVHVVPAFAPYFAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGLLQ 420 Query: 1560 RIAAAYYEDEMRDLPSAPDVSNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCN 1381 +I YED+++D+PS PDV NYL+ E+DTS + N DP FDGM DAEVERRLKEA Sbjct: 421 KIPEISYEDDIKDIPSPPDVGNYLVSEDDTSASTANKDPPIFDGMADAEVERRLKEAISA 480 Query: 1380 SQAVPPMFNNFNQMQMPSVHPVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPS 1201 + V N + S+ MPSS Y N FPQA + PV Sbjct: 481 ASTVSSASINLDPRLASSLQFTMPSSSSVPLLAVQSSMASYPNMQFPQAAQVIKPVAPVV 540 Query: 1200 VSEHSFQGSPVREEGEVNESELDPDTRRRLLILQHGQDIRGPPPYQ-----LRPHLEVXX 1036 E S Q SP REEGEV ESELDPDTRRRLLILQHGQD R P + RP ++V Sbjct: 541 SPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPARPAMQVPV 600 Query: 1035 XXXXXXXXXXXXXXEVKPKQLNE-VSREYHLQPETTRHQRSHLSSFFSGEKDSNPTDRVN 859 E+ P+QLN V +E+ L E ++ FF + P++R+ Sbjct: 601 SRAQSRGSWFSSDEEMSPRQLNRAVPKEFPLDSEQMHMEKHRGPPFFPKVESPIPSERLL 660 Query: 858 HRNKKLSTEVQSGSDNLRRNRSGSNSLK---DDMSRHSISTRNRDERFKAGHVIEFSKDP 688 N++L E D L N + S+ ++M S+ ++D F++G I + P Sbjct: 661 RENQRLPKEALHRDDRLGLNHTPSSYHSFPGEEMPLGRSSSSHKDLDFESGRTIPSGETP 720 Query: 687 VEVLQGIAAASGAKVEYRTALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAI 508 VLQ IA GAKVE+R AL+ +++LQ S+EAWF GEK+GEGTGRTR+EA + A + +I Sbjct: 721 AGVLQDIAMKCGAKVEFRPALVASMDLQFSIEAWFAGEKVGEGTGRTRREAQRQAAEDSI 780 Query: 507 ENLANNYLNDSSKPDSFRDR----EISHTKKIDFLRNSNL-----STFSMSDPLSNTTED 355 ++LAN YL+ KPD+ + ++T + F N NL S S P SN E Sbjct: 781 KSLANTYLS-RIKPDTGSTQGDLSRSANTNENGFPGNLNLYGNQQSPKEESMPFSNAPEP 839 Query: 354 SRSLNHRLEGSIKTSDSVATLKELCTSKGFSLAFKADQCHPASSD-DKKEVYAQVEVAGQ 178 SR L+ RLEGS ++ SV LKELC +G + F+A PAS+ K EVYA+VEV GQ Sbjct: 840 SRLLDPRLEGSRRSMGSVTALKELCMMEGLGVVFQAQP--PASNTLQKDEVYAEVEVDGQ 897 Query: 177 ILGKGTGVTWNVAKSLAAEEAVANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 +LGKGTG TW AK AAE+A+ +L+SMLGQFTQK SPR LQ K + +F Sbjct: 898 VLGKGTGFTWEEAKMQAAEKALGSLRSMLGQFTQKRQGSPRSLQDMPSKRLKPEF 952 >ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] gi|550340277|gb|EEE85528.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] Length = 996 Score = 909 bits (2350), Expect = 0.0 Identities = 516/990 (52%), Positives = 636/990 (64%), Gaps = 75/990 (7%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQ----------NNKSGVW---TREIRISHLSQASDRCPPL 2617 M +S+VYK + +GE EI Q N K V +EIRISH SQ S+RCPPL Sbjct: 1 MYKSVVYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60 Query: 2616 AVLYTIAADGFCFKMEXXXXXXXXXXXXL-------HAACLREKKTAVVPLGNEELHLVA 2458 AVL+TI + G CFKME H++C++E KTAV+ LG EELHLVA Sbjct: 61 AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120 Query: 2457 MPSRTNPLQYSCFWGFTVASGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISN 2278 MPSR+N Q+ CFWGF+VA GLY+SCL+MLNLRCLGIVFDLDETL+VANT+RSFEDRI Sbjct: 121 MPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180 Query: 2277 LQQKISNETEHQRLATMLAEVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSES 2098 LQ+KIS E + QR+ ML+EVKRY DDK ILKQ++ENDQVVENGKV K Q+EVV LS++ Sbjct: 181 LQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240 Query: 2097 HQQINRPVIRLPEKNIILTRVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTM 1918 HQ + RP+IRL EKNIILTR+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCTM Sbjct: 241 HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300 Query: 1917 AERDYALEIWRLLDPESSLINSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDR 1738 AERDYALE+WRLLDPES+LINS +LL+RIVCVKS L+KSL NVFQDG CHPKMALVIDDR Sbjct: 301 AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360 Query: 1737 LTVWDLKDKRRVHVVPAFAPYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPR 1558 L VWD +D+ RVHVVPAFAPYYAPQAEV ++V VLCVARNVACNVRGGFFKE+DEGLL + Sbjct: 361 LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420 Query: 1557 IAAAYYEDEMRDLPSAPDVSNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNS 1378 I YED+ ++PS PDVSNYL+ E+D S NGN D L FDGM DAEVER+LKEA S Sbjct: 421 IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSAS 480 Query: 1377 QAV------------PPMFNNF--------------------NQMQMPSVHPVMPSSYGX 1294 A+ P + + +Q MP++ P P S Sbjct: 481 SAILSTIPSTVSSLDPRLLQSLQYTIASSSSSMPTSQPSMLASQQPMPALQPPKPPS--- 537 Query: 1293 XXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNESELDPDTRRR 1114 P+ N FPQ SV +GQ E S Q SP REEGEV ESELDPDTRRR Sbjct: 538 -----QLSMTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRR 592 Query: 1113 LLILQHGQDIR----GPPPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNEVSREYHL 946 LLILQHG D R P+ RP +V E+ P+QLN RE+ L Sbjct: 593 LLILQHGHDSRDNAPSESPFPARPSTQVSAPRVQSVGSWVPVEEEMSPRQLNRTPREFPL 652 Query: 945 --QPETTRHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRSGSN--SL 778 P R+H SFF + + P+DR+ H N++ E D ++ N S SN S Sbjct: 653 DSDPMNIEKHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYPSF 712 Query: 777 KDDMSRHSISTRNRDERFKAGHVIEFSKDPVEVLQGIAAASGAKVEYRTALLNTIELQHS 598 + + S S S+ NRD ++ ++ PVEVLQ IA G KVE+R AL+ T +LQ S Sbjct: 713 QGEESPLSRSSSNRDLDLESERAFSSTETPVEVLQEIAMKCGTKVEFRPALIATSDLQFS 772 Query: 597 VEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNY-----------LNDSSKPDSFRD 451 +E WFVGEK+GEGTG+TR+EA + A + +I+ LA Y L DSS+ S D Sbjct: 773 IETWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYPSAND 832 Query: 450 R----EISHTKKIDFLRNSNLSTFSMSDPLSNTTEDSRSLNHRLEGSIKTSDSVATLKEL 283 +++ L++ N++ S T+E SR L+ RLEGS K+ SV LKE Sbjct: 833 NGFLGDMNSFGNQPLLKDENIT-------YSATSEPSRLLDQRLEGSKKSMGSVTALKEF 885 Query: 282 CTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEAVANL 103 C ++G + F A +S +EV+AQVE+ GQ+LGKG G+TW+ AK AAE+A+ +L Sbjct: 886 CMTEGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSL 945 Query: 102 KSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 ++M GQ+T K SPRL+Q K + +F Sbjct: 946 RTMFGQYTPKRQGSPRLMQGMPNKRLKQEF 975 >ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] gi|508781047|gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] Length = 984 Score = 905 bits (2339), Expect = 0.0 Identities = 505/943 (53%), Positives = 627/943 (66%), Gaps = 42/943 (4%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWT-----------------REIRISHLSQASDR 2629 M +S+VY+ +GE EI PQ +EIRI +L+Q S+R Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63 Query: 2628 CPPLAVLYTIAADGFCFKMEXXXXXXXXXXXXL------HAACLREKKTAVVPLGNEELH 2467 CPPLAVL+TI + G CFKME H+ C+R+ KTAV+P+G+ ELH Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123 Query: 2466 LVAMPSRTNPLQYSCFWGFTVASGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDR 2287 LVAM SR + CFWGF V+ GLY+SCLLMLNLRCLGIVFDLDETL+VANT+RSFEDR Sbjct: 124 LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181 Query: 2286 ISNLQQKISNETEHQRLATMLAEVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPL 2107 I LQ+K++ E + QR+A M+AE+KRYQDDK ILKQ+ ENDQVVENGKV K+Q+EVV L Sbjct: 182 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241 Query: 2106 SESHQQINRPVIRLPEKNIILTRVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYV 1927 S++HQ I RP+IRL EKNIILTR+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYV Sbjct: 242 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301 Query: 1926 CTMAERDYALEIWRLLDPESSLINSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVI 1747 CTMAERDYALE+WRLLDPES+LINS +LL+RIVCVKS +KSL NVFQDG CHPKMALVI Sbjct: 302 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361 Query: 1746 DDRLTVWDLKDKRRVHVVPAFAPYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGL 1567 DDRL VWD KD+ RVHVVPAFAPYYAPQAE +++ VLCVARNVACNVRGGFF+E+DEGL Sbjct: 362 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421 Query: 1566 LPRIAAAYYEDEMRDLPSAPDVSNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEAS 1387 L RI YED+++D+PS PDV NYL+ E+DTS NGN DPL FDGM DAEVERRLKEA Sbjct: 422 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481 Query: 1386 CNSQAVPPMFNNFNQMQMPSVHPVMP-SSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVG 1210 + V N + PS+ MP SS +SN FP A V PV Sbjct: 482 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541 Query: 1209 QPSVSEHSFQGSPVREEGEVNESELDPDTRRRLLILQHGQDIRGPPPYQ-----LRPHLE 1045 +V E S Q SP REEGEV ESELDPDTRRRLLILQHGQD R P + +RP ++ Sbjct: 542 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601 Query: 1044 VXXXXXXXXXXXXXXXXEVKPKQLNEVS-REYHLQPETTRHQRSHLSSFFSGEKDSNPTD 868 V E+ P+QLN + +E+ L E ++ FF + S P+D Sbjct: 602 VSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHRHPPFFPKVESSIPSD 661 Query: 867 RVNHRNKKLSTEVQSGSDNLRRNRSGSNSLK---DDMSRHSISTRNRDERFKAGHVIEFS 697 R+ N++LS E D L N + S+ ++M S+ +RD F++G + Sbjct: 662 RLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTSG 721 Query: 696 KDPVEVLQGIAAASGAKVEYRTALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVD 517 + VLQ IA GAKVE+R AL+ +++LQ S+EAWF GEK+GEG GRTR+EA + A + Sbjct: 722 ETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAE 781 Query: 516 KAIENLANNYLNDSSKPDS-FRDREISHTKKI-DFLRNSNLSTF-------SMSDPLSNT 364 ++I+NLAN YL+ KPDS + ++S I D SN+++F S S Sbjct: 782 ESIKNLANTYLS-RIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTA 840 Query: 363 TEDSRSLNHRLEGSIKTSDSVATLKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVA 184 +E SR + RLEGS K+ SV LKELC +G + F+ +++ K EVYAQVE+ Sbjct: 841 SEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEID 900 Query: 183 GQILGKGTGVTWNVAKSLAAEEAVANLKSMLGQFTQKYINSPR 55 GQ+LGKGTG+TW AK AAE+A+ +L+SMLGQ++QK SPR Sbjct: 901 GQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPR 943 >ref|XP_011027882.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Populus euphratica] gi|743847022|ref|XP_011027883.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Populus euphratica] Length = 996 Score = 904 bits (2337), Expect = 0.0 Identities = 512/990 (51%), Positives = 636/990 (64%), Gaps = 75/990 (7%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQ----------NNKSGVW---TREIRISHLSQASDRCPPL 2617 M +S+ YK + +GE EI Q N K V +EIRISH SQ S+RCPPL Sbjct: 1 MYKSVAYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60 Query: 2616 AVLYTIAADGFCFKMEXXXXXXXXXXXXL-------HAACLREKKTAVVPLGNEELHLVA 2458 AVL+TI + G CFKME H++C++E KTAV+ LG EELHLVA Sbjct: 61 AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120 Query: 2457 MPSRTNPLQYSCFWGFTVASGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISN 2278 M SR+N Q+ CFWGF+VA GLY+SCL+MLNLRCLGIVFDLDETL+VANT+RSFEDRI Sbjct: 121 MLSRSNEKQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180 Query: 2277 LQQKISNETEHQRLATMLAEVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSES 2098 LQ+KIS E + QR+ ML+EVKRYQDDK ILKQ++ENDQVVENGKV K Q+EVV LS++ Sbjct: 181 LQRKISTELDPQRILGMLSEVKRYQDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240 Query: 2097 HQQINRPVIRLPEKNIILTRVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTM 1918 HQ + RP+IRL EKNIILTR+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCTM Sbjct: 241 HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300 Query: 1917 AERDYALEIWRLLDPESSLINSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDR 1738 AERDYALE+WRLLDPES+LINS +LL+RIVCVKS L+KSL NVFQDG CHPKMALVIDDR Sbjct: 301 AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360 Query: 1737 LTVWDLKDKRRVHVVPAFAPYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPR 1558 L VWD +D+ RVHVVPAFAPYYAPQAEV ++V VLCVARNVACNVRGGFFKE+DEGLL + Sbjct: 361 LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420 Query: 1557 IAAAYYEDEMRDLPSAPDVSNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNS 1378 I YED+ ++PS PDVSNYL+ E+D S NGN D L FDGM DAEVER+LKEA +S Sbjct: 421 IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSSS 480 Query: 1377 QAV------------PPMFNNF--------------------NQMQMPSVHPVMPSSYGX 1294 A+ P + + +Q MP++ P P S Sbjct: 481 SAILSTIPSTVSSLDPRLLQSLQYTIASSSSSMPTSQPSMLASQQPMPALQPPKPPS--- 537 Query: 1293 XXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNESELDPDTRRR 1114 P+ N FPQ S+ +GQ E S Q SP REEGEV ESELDPDTRRR Sbjct: 538 -----QLSMTPFPNTQFPQVAPSIKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRR 592 Query: 1113 LLILQHGQDIR----GPPPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQLNEVSREYHL 946 LLILQHG D R P+ RP +V E+ P+QLN RE+ L Sbjct: 593 LLILQHGHDSRDNAPSESPFPARPSTQVAAPRVQSVGSWVPVEEEMSPRQLNRTPREFPL 652 Query: 945 QPE--TTRHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLRRNRSGSN--SL 778 + R H SFF + + P+DR+ H N++L E D ++ N S SN S Sbjct: 653 DSDLMNIEKHRPHHPSFFHKVESNIPSDRMIHENQRLPKEATYRDDRMKLNHSTSNYPSF 712 Query: 777 KDDMSRHSISTRNRDERFKAGHVIEFSKDPVEVLQGIAAASGAKVEYRTALLNTIELQHS 598 + + S S S+ NRD ++ ++ P EVLQ IA G KVE+R+AL+ T +LQ S Sbjct: 713 QGEESPLSRSSSNRDLDLESERAFSSTETPAEVLQEIAMKCGTKVEFRSALIATSDLQFS 772 Query: 597 VEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNY-----------LNDSSKPDSFRD 451 +E WF+GEK+GEGTG+TR+EA + A + +I+ LA Y L DSS+ S D Sbjct: 773 IETWFLGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRSKPDSGPMLGDSSRYPSAND 832 Query: 450 R----EISHTKKIDFLRNSNLSTFSMSDPLSNTTEDSRSLNHRLEGSIKTSDSVATLKEL 283 +++ L++ N++ S T+E SR L+ RLEGS K+ SV LKE Sbjct: 833 NGFLGDMNSFGNQPLLKDENIT-------YSATSEPSRLLDQRLEGSKKSMGSVTALKEF 885 Query: 282 CTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEAVANL 103 C ++G + F A +S +EV+AQVE+ GQ+LGKG G+TW+ AK AAE+A+ +L Sbjct: 886 CMTEGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSL 945 Query: 102 KSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 ++M GQ+T K SPRL+Q K + +F Sbjct: 946 RTMFGQYTPKRQGSPRLMQGMPNKRLKQEF 975 >ref|XP_002267987.3| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Vitis vinifera] Length = 935 Score = 902 bits (2331), Expect = 0.0 Identities = 504/931 (54%), Positives = 630/931 (67%), Gaps = 13/931 (1%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTREIRISHLSQASDRCPPLAVLYTIAADGFCF 2578 M +S+VY+ + +GE EI PQN + +EIRISH SQ S+RCPPLAVL+TI + G CF Sbjct: 1 MYKSIVYEGDDVVGEVEIYPQNQGLELM-KEIRISHYSQPSERCPPLAVLHTITSCGVCF 59 Query: 2577 KMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTNPLQYSCFWGFTVA 2401 KME L H+ C+RE KTAV+ LG EELHLVAM S+ QY CFWGF VA Sbjct: 60 KMESSKAQSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFNVA 119 Query: 2400 SGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEHQRLATMLA 2221 GLY+SCL+MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KI+ E + QR++ M A Sbjct: 120 LGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGMAA 179 Query: 2220 EVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRLPEKNIILT 2041 EV+RYQDD+ ILKQ+ ENDQVVENGK+FK Q E+V LS++HQ I RP+IRL EKNIILT Sbjct: 180 EVRRYQDDRNILKQYAENDQVVENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNIILT 239 Query: 2040 RVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWRLLDPESSL 1861 R+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCTMAERDYALE+WRLLDPES+L Sbjct: 240 RINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNL 299 Query: 1860 INSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRRVHVVPAFA 1681 INS +LL+RIVCVKS +KSL NVFQDG CHPKMALVIDDRL VWD KD+ RVHVVPAFA Sbjct: 300 INSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 359 Query: 1680 PYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMRDLPSAPDV 1501 PYYAPQAE ++++VLCVARNVACNVRGGFFKE+DEGLL RI YED+++D+ SAPDV Sbjct: 360 PYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDDIKDIRSAPDV 419 Query: 1500 SNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFNQMQMPSVH 1321 SNYL+ E+D S +NGN D CFDGM D EVER+LK+A + P + + P + Sbjct: 420 SNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKDAI----SAPSTVTSLDPRLSPPLQ 475 Query: 1320 PVMPSSYG-XXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVREEGEVNE 1144 + +S G P+SN FPQ+ + P+ E + Q SP REEGEV E Sbjct: 476 FAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPL----APEPTMQSSPAREEGEVPE 531 Query: 1143 SELDPDTRRRLLILQHGQDIR----GPPPYQLRPHLEVXXXXXXXXXXXXXXXXEVKPKQ 976 SELDPDTRRRLLILQHGQD R PP+ +RP ++V E+ P+Q Sbjct: 532 SELDPDTRRRLLILQHGQDTREHASSDPPFPVRPPIQVSVPRVQSRGSWFPADEEMSPRQ 591 Query: 975 LNE-VSREYHLQPET--TRHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEVQSGSDNLR 805 LN V +E+ L +T R H SFF + S +DR+ H N++LS EV D LR Sbjct: 592 LNRAVPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDDRLR 651 Query: 804 RNRS--GSNSLKDDMSRHSISTRNRDERFKAGHVIEFSKDPVEVLQGIAAASGAKVEYRT 631 N S G +S + S+ NRD F++G +++ P LQ IA G K+E+R Sbjct: 652 LNHSLPGYHSFSGEEVPLGRSSSNRDLDFESGRGAPYAETPAVGLQEIAMKCGTKLEFRP 711 Query: 630 ALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYLNDSSK--PDSF 457 +L+ ELQ S+E WF GEKIGEGTG+TR+EA A + ++ L+ YL+ P++ Sbjct: 712 SLVAATELQFSIEVWFAGEKIGEGTGKTRREAQCQAAEASLMYLSYRYLHGDVNRFPNAS 771 Query: 456 RDREISHTKKIDFLRNSNLSTFSMSDPLSNTTEDSRSLNHRLEGSIKTSDSVATLKELCT 277 + +S T + S SMS S +E SR L+ RLE S K+ S++ LKELC Sbjct: 772 DNNFMSDTNSFGY--QSFPKEGSMS--FSTASESSRLLDPRLESSKKSMGSISALKELCM 827 Query: 276 SKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSLAAEEAVANLKS 97 +G + F + ++S K+E+ AQVE+ GQ+LGKGTG TW+ AK AAE+A+ +LKS Sbjct: 828 MEGLGVEFLSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQAAEKALGSLKS 887 Query: 96 MLGQFTQKYINSPRLLQTAVEKSWRTDFR*G 4 MLGQF+QK SPR LQ + K +++F G Sbjct: 888 MLGQFSQKRQGSPRSLQ-GMGKRLKSEFTRG 917 >ref|XP_012091568.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Jatropha curcas] gi|802784113|ref|XP_012091569.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1 [Jatropha curcas] Length = 976 Score = 902 bits (2330), Expect = 0.0 Identities = 505/950 (53%), Positives = 629/950 (66%), Gaps = 35/950 (3%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQ--------NNKSGV-----WTREIRISHLSQASDRCPPL 2617 M +S VYK +GE EI PQ NNK + +EIRISH SQ S+RCPPL Sbjct: 7 MYKSAVYKGEELLGEVEIYPQQHQQQEEENNKKKLIDEILMGKEIRISHFSQPSERCPPL 66 Query: 2616 AVLYTIAADGFCFKMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTN 2440 AVL+TI G CFKME L H++C++E KTAVVPLG EELHLVA+ SR N Sbjct: 67 AVLHTITC-GMCFKMESKNSLSLDTPLHLLHSSCIQENKTAVVPLGGEELHLVAIYSRNN 125 Query: 2439 PLQYSCFWGFTVASGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKIS 2260 QY CFWGF V++GLYNSCL+MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KI+ Sbjct: 126 ERQYPCFWGFNVSAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKIN 185 Query: 2259 NETEHQRLATMLAEVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINR 2080 E + QR+A ML+EVKRYQDDK ILKQ++ENDQV+ENG+V K Q EVV LS++HQ I R Sbjct: 186 TEVDPQRIAGMLSEVKRYQDDKTILKQYVENDQVIENGRVIKTQFEVVPALSDNHQTIVR 245 Query: 2079 PVIRLPEKNIILTRVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 1900 P+IRL E+NIILTR+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCTMAERDYA Sbjct: 246 PLIRLQERNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 305 Query: 1899 LEIWRLLDPESSLINSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDL 1720 LE+WRLLDPES+LI+S +LL+RIVCVKS L+KSL NVFQDG CHPKMALVIDDRL VWD Sbjct: 306 LEMWRLLDPESNLISSKELLDRIVCVKSGLRKSLFNVFQDGVCHPKMALVIDDRLKVWDE 365 Query: 1719 KDKRRVHVVPAFAPYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYY 1540 KD+ RVHVVPAFAPYYAPQAE ++V VLCVARNVACNVRGGFFKE+DEGLL RI Y Sbjct: 366 KDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLLQRIPDISY 425 Query: 1539 EDEMRDLPSAPDVSNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPM 1360 ED+ D+PS PDVS+YLI E+D ST+NG+ DPL FDGM DAEVE+RLKEA + P Sbjct: 426 EDDFNDIPSPPDVSSYLISEDDASTSNGHRDPLSFDGMADAEVEKRLKEAISAASLFPAT 485 Query: 1359 FNNFNQMQMPSV-HPVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSF 1183 NN + +P++ + + SS P+SN FPQA V P+ Q E S Sbjct: 486 VNNLDPRVIPALQYSLASSSSSIPVSTSQPLVMPFSNIQFPQAASLVKPLAQVGPPEPSL 545 Query: 1182 QGSPVREEGEVNESELDPDTRRRLLILQHGQDIR----GPPPYQLRPHLEVXXXXXXXXX 1015 Q SP REEGEV ESELDPDTRRRLLILQHGQD R +RP ++V Sbjct: 546 QSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDNVSSESQIPVRPSMQVSVPRVQSRG 605 Query: 1014 XXXXXXXEVKPKQLN-EVSREY--HLQPETTRHQRSHLSSFFSGEKDSNPTDRVN--HRN 850 E+ P+QLN V RE+ L+P + H SFF ++ +DR+ + N Sbjct: 606 SWVPVEEEMSPRQLNLTVPREFPLELEPMHIEKHQPHHPSFFPKVENPISSDRMGMVNEN 665 Query: 849 KKLSTEVQSGSDNLRRNRSGSN---SLKDDMSRHSISTRNRDERFKAGHVIEFSKDPVEV 679 +L D LR N + +N +++ S+ NRD F++ + ++ PVE Sbjct: 666 LRLPKAAPYRDDRLRSNHTMANYHPLSGEEIPLSRSSSSNRDPDFESERAVSSAETPVEA 725 Query: 678 LQGIAAASGAKVEYRTALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENL 499 LQ IA GAKVE+R +L+++ +LQ S EAWF GE++GEG G+TR+EA +LA + +I+NL Sbjct: 726 LQEIAMKCGAKVEFRASLVDSRDLQFSTEAWFAGERVGEGIGKTRREAQRLAAESSIKNL 785 Query: 498 ANNYL------NDSSKPDSFRDREISHTKKIDFLRNSNLSTFSMSDPLSNT--TEDSRSL 343 AN Y+ N + D+ R + + + + +P+S++ +E R Sbjct: 786 ANIYMQRAKPDNGAMHGDASRYSSANDNGYLGNVNSFGSQPLPKDEPVSSSAASEQLRLP 845 Query: 342 NHRLEGSIKTSDSVATLKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKG 163 + RL+ S K SV LKE C +G L F + ++S K EVYAQVE+ GQ++GKG Sbjct: 846 DPRLDSSKKAVGSVTALKEFCMMEGLGLNFLSPTPLSSNSLQKDEVYAQVEIDGQVMGKG 905 Query: 162 TGVTWNVAKSLAAEEAVANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 G TW+ AK AAE A+ +L++M GQFT K SPR Q K + +F Sbjct: 906 IGSTWDEAKMQAAERALGSLRTMFGQFTPKRQGSPRPTQGMSNKRLKPEF 955 >gb|KDP20941.1| hypothetical protein JCGZ_21412 [Jatropha curcas] Length = 970 Score = 902 bits (2330), Expect = 0.0 Identities = 505/950 (53%), Positives = 629/950 (66%), Gaps = 35/950 (3%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQ--------NNKSGV-----WTREIRISHLSQASDRCPPL 2617 M +S VYK +GE EI PQ NNK + +EIRISH SQ S+RCPPL Sbjct: 1 MYKSAVYKGEELLGEVEIYPQQHQQQEEENNKKKLIDEILMGKEIRISHFSQPSERCPPL 60 Query: 2616 AVLYTIAADGFCFKMEXXXXXXXXXXXXL-HAACLREKKTAVVPLGNEELHLVAMPSRTN 2440 AVL+TI G CFKME L H++C++E KTAVVPLG EELHLVA+ SR N Sbjct: 61 AVLHTITC-GMCFKMESKNSLSLDTPLHLLHSSCIQENKTAVVPLGGEELHLVAIYSRNN 119 Query: 2439 PLQYSCFWGFTVASGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKIS 2260 QY CFWGF V++GLYNSCL+MLNLRCLGIVFDLDETL+VANT+RSFEDRI LQ+KI+ Sbjct: 120 ERQYPCFWGFNVSAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKIN 179 Query: 2259 NETEHQRLATMLAEVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINR 2080 E + QR+A ML+EVKRYQDDK ILKQ++ENDQV+ENG+V K Q EVV LS++HQ I R Sbjct: 180 TEVDPQRIAGMLSEVKRYQDDKTILKQYVENDQVIENGRVIKTQFEVVPALSDNHQTIVR 239 Query: 2079 PVIRLPEKNIILTRVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 1900 P+IRL E+NIILTR+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCTMAERDYA Sbjct: 240 PLIRLQERNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYA 299 Query: 1899 LEIWRLLDPESSLINSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDL 1720 LE+WRLLDPES+LI+S +LL+RIVCVKS L+KSL NVFQDG CHPKMALVIDDRL VWD Sbjct: 300 LEMWRLLDPESNLISSKELLDRIVCVKSGLRKSLFNVFQDGVCHPKMALVIDDRLKVWDE 359 Query: 1719 KDKRRVHVVPAFAPYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYY 1540 KD+ RVHVVPAFAPYYAPQAE ++V VLCVARNVACNVRGGFFKE+DEGLL RI Y Sbjct: 360 KDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLLQRIPDISY 419 Query: 1539 EDEMRDLPSAPDVSNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPM 1360 ED+ D+PS PDVS+YLI E+D ST+NG+ DPL FDGM DAEVE+RLKEA + P Sbjct: 420 EDDFNDIPSPPDVSSYLISEDDASTSNGHRDPLSFDGMADAEVEKRLKEAISAASLFPAT 479 Query: 1359 FNNFNQMQMPSV-HPVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSF 1183 NN + +P++ + + SS P+SN FPQA V P+ Q E S Sbjct: 480 VNNLDPRVIPALQYSLASSSSSIPVSTSQPLVMPFSNIQFPQAASLVKPLAQVGPPEPSL 539 Query: 1182 QGSPVREEGEVNESELDPDTRRRLLILQHGQDIR----GPPPYQLRPHLEVXXXXXXXXX 1015 Q SP REEGEV ESELDPDTRRRLLILQHGQD R +RP ++V Sbjct: 540 QSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDNVSSESQIPVRPSMQVSVPRVQSRG 599 Query: 1014 XXXXXXXEVKPKQLN-EVSREY--HLQPETTRHQRSHLSSFFSGEKDSNPTDRVN--HRN 850 E+ P+QLN V RE+ L+P + H SFF ++ +DR+ + N Sbjct: 600 SWVPVEEEMSPRQLNLTVPREFPLELEPMHIEKHQPHHPSFFPKVENPISSDRMGMVNEN 659 Query: 849 KKLSTEVQSGSDNLRRNRSGSN---SLKDDMSRHSISTRNRDERFKAGHVIEFSKDPVEV 679 +L D LR N + +N +++ S+ NRD F++ + ++ PVE Sbjct: 660 LRLPKAAPYRDDRLRSNHTMANYHPLSGEEIPLSRSSSSNRDPDFESERAVSSAETPVEA 719 Query: 678 LQGIAAASGAKVEYRTALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENL 499 LQ IA GAKVE+R +L+++ +LQ S EAWF GE++GEG G+TR+EA +LA + +I+NL Sbjct: 720 LQEIAMKCGAKVEFRASLVDSRDLQFSTEAWFAGERVGEGIGKTRREAQRLAAESSIKNL 779 Query: 498 ANNYL------NDSSKPDSFRDREISHTKKIDFLRNSNLSTFSMSDPLSNT--TEDSRSL 343 AN Y+ N + D+ R + + + + +P+S++ +E R Sbjct: 780 ANIYMQRAKPDNGAMHGDASRYSSANDNGYLGNVNSFGSQPLPKDEPVSSSAASEQLRLP 839 Query: 342 NHRLEGSIKTSDSVATLKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKG 163 + RL+ S K SV LKE C +G L F + ++S K EVYAQVE+ GQ++GKG Sbjct: 840 DPRLDSSKKAVGSVTALKEFCMMEGLGLNFLSPTPLSSNSLQKDEVYAQVEIDGQVMGKG 899 Query: 162 TGVTWNVAKSLAAEEAVANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 G TW+ AK AAE A+ +L++M GQFT K SPR Q K + +F Sbjct: 900 IGSTWDEAKMQAAERALGSLRTMFGQFTPKRQGSPRPTQGMSNKRLKPEF 949 >ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Citrus sinensis] gi|641857111|gb|KDO75877.1| hypothetical protein CISIN_1g002166mg [Citrus sinensis] Length = 957 Score = 898 bits (2321), Expect = 0.0 Identities = 503/939 (53%), Positives = 620/939 (66%), Gaps = 24/939 (2%) Frame = -1 Query: 2757 MLRSLVYKENSFIGEAEIRPQNNKSGVWTRE--------IRISHLSQASDRCPPLAVLYT 2602 M +++ Y +GE EI PQ G E IRIS+ S+AS+RCPPLAVL+T Sbjct: 1 MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60 Query: 2601 IAADGFCFKMEXXXXXXXXXXXXLHAACLREKKTAVVPLG-NEELHLVAMPSRTNPLQYS 2425 I A G CFKME H++C+RE KTAV+PLG EELHLVAM SR N QY Sbjct: 61 ITASGICFKMESKSSDNIQLHLL-HSSCIRENKTAVMPLGLTEELHLVAMYSRNNEKQYP 119 Query: 2424 CFWGFTVASGLYNSCLLMLNLRCLGIVFDLDETLVVANTIRSFEDRISNLQQKISNETEH 2245 CFW F+V SGLYNSCL MLNLRCLGIVFDLDETL+VANT+RSFEDRI L +KIS E + Sbjct: 120 CFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDP 179 Query: 2244 QRLATMLAEVKRYQDDKLILKQFIENDQVVENGKVFKVQTEVVRPLSESHQQINRPVIRL 2065 QR+A M AEVKRYQDDK ILKQ+ ENDQV ENGKV KVQ+EVV LS+SHQ + RP+IRL Sbjct: 180 QRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRL 239 Query: 2064 PEKNIILTRVNPTIRDTSVLVRVRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEIWR 1885 EKNIILTR+NP IRDTSVLVR+RPAWEDLRSYLTARGRKRFEVYVCTMAERDYALE+WR Sbjct: 240 QEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 299 Query: 1884 LLDPESSLINSDKLLNRIVCVKSELKKSLLNVFQDGKCHPKMALVIDDRLTVWDLKDKRR 1705 LLDPES+LIN+ +LL+RIVCVKS +KSL NVFQDG CHPKMALVIDDRL VWD KD+ R Sbjct: 300 LLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDDKDQPR 359 Query: 1704 VHVVPAFAPYYAPQAEVASSVAVLCVARNVACNVRGGFFKEYDEGLLPRIAAAYYEDEMR 1525 VHVVPAFAPYYAPQAE +++ VLCVARN+ACNVRGGFFKE+DEGLL RI YED+++ Sbjct: 360 VHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVK 419 Query: 1524 DLPSAPDVSNYLIPEEDTSTTNGNADPLCFDGMVDAEVERRLKEASCNSQAVPPMFNNFN 1345 D+PS PDVSNYL+ E+D +T NG DPL FDGM DAEVERRLKEA S + N + Sbjct: 420 DIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANLD 479 Query: 1344 QMQMPSVHPVMPSSYGXXXXXXXXXXXPYSNNPFPQAIVSVNPVGQPSVSEHSFQGSPVR 1165 P + + SS P +N FP A V P+G E S Q SP R Sbjct: 480 PRLAPFQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQSLQSSPAR 539 Query: 1164 EEGEVNESELDPDTRRRLLILQHGQDIR----GPPPYQLRPHLEVXXXXXXXXXXXXXXX 997 EEGEV ESELDPDTRRRLLILQHG D R P+ R ++V Sbjct: 540 EEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPFPARTQMQVSVPRVPSRGSWFPVE 599 Query: 996 XEVKPKQLNE-VSREYHLQPET---TRHQRSHLSSFFSGEKDSNPTDRVNHRNKKLSTEV 829 E+ P+QLN V +E+ L E +H+ H SFF ++ + +DR H N+++ E Sbjct: 600 EEMSPRQLNRAVPKEFPLNSEAMQIEKHRPPH-PSFFPKIENPSTSDR-PHENQRMPKEA 657 Query: 828 QSGSDNLRRNRSGSNSLK---DDMSRHSISTRNRDERFKAGHVIEFSKDPVEVLQGIAAA 658 D LR N + S+ +++ S+ +RD F++G + ++ P VLQ IA Sbjct: 658 LRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQDIAMK 717 Query: 657 SGAKVEYRTALLNTIELQHSVEAWFVGEKIGEGTGRTRKEAHKLAVDKAIENLANNYL-- 484 G KVE+R AL+ + ELQ S+EAWF GEKIGEG GRTR+EA + A + +I++LAN Y+ Sbjct: 718 CGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYMLR 777 Query: 483 --NDSSKPDSFRDREISHTKKIDFLRNSNLSTFSMSDPLSNTTEDSRSLNHRLEGSIKTS 310 +DS R + + ++ ++ S ++E S+ ++ RLEGS K Sbjct: 778 VKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDESLSSEPSKLVDPRLEGSKKLM 837 Query: 309 DSVATLKELCTSKGFSLAFKADQCHPASSDDKKEVYAQVEVAGQILGKGTGVTWNVAKSL 130 SV+ LKELC ++G + F+ A+S K EVYAQVE+ GQ+LGKG G TW+ AK Sbjct: 838 GSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQ 897 Query: 129 AAEEAVANLKSMLGQFTQKYINSPRLLQTAVEKSWRTDF 13 AAE+A+ +L+SM GQF QK+ SPR LQ K + +F Sbjct: 898 AAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEF 936