BLASTX nr result
ID: Papaver31_contig00043238
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver31_contig00043238 (1621 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010645384.1| PREDICTED: RNA polymerase II C-terminal doma... 495 e-137 ref|XP_010647279.1| PREDICTED: RNA polymerase II C-terminal doma... 494 e-137 ref|XP_010265619.1| PREDICTED: RNA polymerase II C-terminal doma... 490 e-135 ref|XP_010265618.1| PREDICTED: RNA polymerase II C-terminal doma... 486 e-134 ref|XP_008242970.1| PREDICTED: RNA polymerase II C-terminal doma... 477 e-131 ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prun... 476 e-131 gb|KNA16741.1| hypothetical protein SOVF_085710 isoform B [Spina... 476 e-131 ref|XP_012481529.1| PREDICTED: RNA polymerase II C-terminal doma... 476 e-131 ref|XP_010693335.1| PREDICTED: RNA polymerase II C-terminal doma... 476 e-131 gb|KHG05109.1| RNA polymerase II C-terminal domain phosphatase-l... 476 e-131 ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu... 475 e-131 ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative ... 475 e-131 gb|KNA16740.1| hypothetical protein SOVF_085710 isoform A [Spina... 475 e-131 ref|XP_011018018.1| PREDICTED: RNA polymerase II C-terminal doma... 473 e-130 ref|XP_012078975.1| PREDICTED: RNA polymerase II C-terminal doma... 472 e-130 gb|KNA16742.1| hypothetical protein SOVF_085710 isoform C [Spina... 471 e-130 gb|KJB27893.1| hypothetical protein B456_005G016300 [Gossypium r... 471 e-130 ref|XP_012481530.1| PREDICTED: RNA polymerase II C-terminal doma... 470 e-129 gb|KNA25939.1| hypothetical protein SOVF_002000 isoform B [Spina... 468 e-129 ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu... 468 e-129 >ref|XP_010645384.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 [Vitis vinifera] Length = 458 Score = 495 bits (1275), Expect = e-137 Identities = 248/424 (58%), Positives = 312/424 (73%), Gaps = 1/424 (0%) Frame = -2 Query: 1473 DHDGEEEEVNVED-ADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297 + + E++E ED +D KR KRQ+++ + E STS + Q+ +V++ + C H Sbjct: 35 EQEAEDDEQEAEDESDSEYKRVKRQKVEEFESIEEHPGSTSDGSLEQNLEVTITKDTCTH 94 Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117 P + + +C++CG+ ++ S VA GYIHKDL++GS+E+ARLR+ DLK+L KK Sbjct: 95 PGVFRELCIRCGQKMEGGSGVAFGYIHKDLRLGSDEIARLRDTDLKNLLRHKKLYLVLDL 154 Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937 LNST I+P+E YLK+QTD LQ GNLF L++M M TKLRP+V TFLKEAS Sbjct: 155 DHTLLNSTRLLDITPEELYLKNQTDPLQGGLKGNLFMLNTMHMLTKLRPYVHTFLKEASK 214 Query: 936 MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757 MFEMY+YTMGER YA+EMA+LLDP ++YF SRVISQADCTQ+HQKGLDVVLG ESAV+IL Sbjct: 215 MFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCTQRHQKGLDVVLGQESAVLIL 274 Query: 756 DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577 DDTE VW++HK+NLILM+RYH+F+SS R F N +SLSELK DESEPDGALAT+L+VL+R Sbjct: 275 DDTESVWQKHKDNLILMERYHFFASSCRQFGFNCKSLSELKSDESEPDGALATVLKVLQR 334 Query: 576 IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397 IH RDVRQV+K VR EVLKGCK+VFSRV+ Q EN LW +AE+L Sbjct: 335 IHSMFFDPELGDDFSGRDVRQVVKRVRKEVLKGCKIVFSRVFPTRFQAENHHLWRMAEQL 394 Query: 396 GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217 GA C ELD SVTHVVSTD GTEKSRWA++ KKFLVHP WIEAAN+ WQ+QPE++F VN Sbjct: 395 GATCATELDPSVTHVVSTDAGTEKSRWALQEKKFLVHPGWIEAANYFWQKQPEENFPVNQ 454 Query: 216 SKKE 205 K + Sbjct: 455 KKNQ 458 >ref|XP_010647279.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 [Vitis vinifera] Length = 466 Score = 494 bits (1272), Expect = e-137 Identities = 247/424 (58%), Positives = 312/424 (73%), Gaps = 1/424 (0%) Frame = -2 Query: 1473 DHDGEEEEVNVED-ADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297 + + E++E ED +D KR KRQ+++ + E STS + Q+ +V++ + C H Sbjct: 43 EQEAEDDEQEAEDESDSEYKRVKRQKVEEFESIEEHPGSTSDGSLEQNLEVTITKDTCTH 102 Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117 P + + +C++CG+ ++ S VA GYIHKDL++GS+E+ARLR+ DLK+L KK Sbjct: 103 PGVFRELCIRCGQKMEGGSGVAFGYIHKDLRLGSDEIARLRDTDLKNLLRHKKLYLVLDL 162 Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937 LNST I+P+E YLK+QTD LQ GNLF L++M M TKLRP+V TFLKEAS Sbjct: 163 DHTLLNSTRLLDITPEELYLKNQTDPLQGGLKGNLFMLNTMHMLTKLRPYVHTFLKEASK 222 Query: 936 MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757 MFEMY+YTMGER YA+EMA+LLDP ++YF SRVISQADCTQ+HQKGLDVVLG ESAV+IL Sbjct: 223 MFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCTQRHQKGLDVVLGQESAVLIL 282 Query: 756 DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577 DDTE VW++HK+NLILM+RYH+F+SS R F N +SLSELK DESEPDGALAT+L+VL+R Sbjct: 283 DDTESVWQKHKDNLILMERYHFFASSCRQFGFNCKSLSELKSDESEPDGALATVLKVLQR 342 Query: 576 IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397 IH RDVRQV+K VR +VLKGCK+VFSRV+ Q EN LW +AE+L Sbjct: 343 IHSMFFDPELGDDFSGRDVRQVVKRVRKDVLKGCKIVFSRVFPTRFQAENHHLWRMAEQL 402 Query: 396 GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217 GA C ELD SVTHVVSTD GTEKSRWA++ KKFLVHP WIEAAN+ WQ+QPE++F VN Sbjct: 403 GATCATELDPSVTHVVSTDAGTEKSRWALQEKKFLVHPGWIEAANYFWQKQPEENFPVNQ 462 Query: 216 SKKE 205 K + Sbjct: 463 KKNQ 466 >ref|XP_010265619.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X2 [Nelumbo nucifera] Length = 449 Score = 490 bits (1261), Expect = e-135 Identities = 256/450 (56%), Positives = 323/450 (71%), Gaps = 2/450 (0%) Frame = -2 Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375 FAALL+ EL + D G +E+ +D DF +R K++++D L++ E Sbjct: 17 FAALLDAELDTVS-------------SDASGGQED---DDEDFNIERIKKRKVDELENVE 60 Query: 1374 GIHCSTSLEPVPQSSDVSVQLEICA-HPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVG 1198 + STS+ + Q D S + +IC HP I+ MC++CG+ +D S VA GYIHKDLK+G Sbjct: 61 DLQGSTSVGALQQELDTSKE-DICPPHPGFIREMCIRCGQRQEDGSGVAFGYIHKDLKLG 119 Query: 1197 SEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHG 1018 EE+ARLR D K L +K LNST +SP+EEYLK QTDSL D+ +G Sbjct: 120 MEEIARLRGADHKKLLHGRKLYLVLDLDHTLLNSTRLIDLSPEEEYLKGQTDSLNDVLNG 179 Query: 1017 NLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRV 838 +LF+LDSM M TKLRPFV TFLKEAS+MFEMYVYTM ER YA+E+A+LLDPG +YF SRV Sbjct: 180 SLFRLDSMTMLTKLRPFVHTFLKEASSMFEMYVYTMAERSYALEIAKLLDPGGVYFSSRV 239 Query: 837 ISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLN 658 ISQ +CTQ+HQKGLDVVLGAESAV+ILDDTE VW++H+ENLILM+RYHYFSSS R F + Sbjct: 240 ISQDNCTQRHQKGLDVVLGAESAVVILDDTEIVWQKHRENLILMERYHYFSSSCRQFGFS 299 Query: 657 NRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMR-RDVRQVLKTVRSEVLK 481 +SLSELKRDE E +GALAT+L+VLKRIH + RDVRQV+K +R +VLK Sbjct: 300 AKSLSELKRDECESEGALATVLKVLKRIHEMFFNELVFGADLESRDVRQVMKAIRQDVLK 359 Query: 480 GCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYK 301 GCK+VFSRV+ EN +LW++AE+LGA C ELD+SVTHVVSTD GTEK+RWAV++K Sbjct: 360 GCKIVFSRVFPTKFHAENHQLWKIAEQLGATCSTELDSSVTHVVSTDTGTEKARWAVQHK 419 Query: 300 KFLVHPSWIEAANFLWQRQPEDSFAVNDSK 211 K LVHP WIEA N+ W+RQ E++FAV +K Sbjct: 420 KHLVHPQWIEATNYFWERQSEENFAVKKNK 449 >ref|XP_010265618.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Nelumbo nucifera] Length = 451 Score = 486 bits (1251), Expect = e-134 Identities = 255/450 (56%), Positives = 323/450 (71%), Gaps = 2/450 (0%) Frame = -2 Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375 FAALL+ EL + D G +E+ + ED + + R K++++D L++ E Sbjct: 17 FAALLDAELDTVS-------------SDASGGQEDDD-EDFNIESFRIKKRKVDELENVE 62 Query: 1374 GIHCSTSLEPVPQSSDVSVQLEICA-HPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVG 1198 + STS+ + Q D S + +IC HP I+ MC++CG+ +D S VA GYIHKDLK+G Sbjct: 63 DLQGSTSVGALQQELDTSKE-DICPPHPGFIREMCIRCGQRQEDGSGVAFGYIHKDLKLG 121 Query: 1197 SEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHG 1018 EE+ARLR D K L +K LNST +SP+EEYLK QTDSL D+ +G Sbjct: 122 MEEIARLRGADHKKLLHGRKLYLVLDLDHTLLNSTRLIDLSPEEEYLKGQTDSLNDVLNG 181 Query: 1017 NLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRV 838 +LF+LDSM M TKLRPFV TFLKEAS+MFEMYVYTM ER YA+E+A+LLDPG +YF SRV Sbjct: 182 SLFRLDSMTMLTKLRPFVHTFLKEASSMFEMYVYTMAERSYALEIAKLLDPGGVYFSSRV 241 Query: 837 ISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLN 658 ISQ +CTQ+HQKGLDVVLGAESAV+ILDDTE VW++H+ENLILM+RYHYFSSS R F + Sbjct: 242 ISQDNCTQRHQKGLDVVLGAESAVVILDDTEIVWQKHRENLILMERYHYFSSSCRQFGFS 301 Query: 657 NRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMR-RDVRQVLKTVRSEVLK 481 +SLSELKRDE E +GALAT+L+VLKRIH + RDVRQV+K +R +VLK Sbjct: 302 AKSLSELKRDECESEGALATVLKVLKRIHEMFFNELVFGADLESRDVRQVMKAIRQDVLK 361 Query: 480 GCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYK 301 GCK+VFSRV+ EN +LW++AE+LGA C ELD+SVTHVVSTD GTEK+RWAV++K Sbjct: 362 GCKIVFSRVFPTKFHAENHQLWKIAEQLGATCSTELDSSVTHVVSTDTGTEKARWAVQHK 421 Query: 300 KFLVHPSWIEAANFLWQRQPEDSFAVNDSK 211 K LVHP WIEA N+ W+RQ E++FAV +K Sbjct: 422 KHLVHPQWIEATNYFWERQSEENFAVKKNK 451 >ref|XP_008242970.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Prunus mume] Length = 449 Score = 477 bits (1228), Expect = e-131 Identities = 241/420 (57%), Positives = 303/420 (72%) Frame = -2 Query: 1470 HDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHPS 1291 H +EE + E D + + +KR++++ L + H STS V ++S+ S + +IC HP Sbjct: 30 HSSPDEEADYESDDGSERSTKRRKVENLGSIDKTHGSTSQVFVEENSEASPKTDICTHPG 89 Query: 1290 LIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXXX 1111 +K +C+ CG+ VD+ S V LGYIHKD + ++E+ R+R+ D+K KK Sbjct: 90 SVKDLCIVCGQRVDEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLVLDLDH 149 Query: 1110 XXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMF 931 LNSTH H++ +EEYL SQTDSLQD+ +G+LF++D M M TKLRPFVR FLKEAS MF Sbjct: 150 TLLNSTHLNHMTAEEEYLHSQTDSLQDVSNGSLFRVDVMHMMTKLRPFVRKFLKEASEMF 209 Query: 930 EMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILDD 751 EMY+YTMGER YA+EMA+LLDP K YF RVIS+ D TQKHQKGLDVVLG ESA +ILDD Sbjct: 210 EMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDVVLGQESAALILDD 269 Query: 750 TEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRIH 571 TE W +HK+NLILM+RYH+F SS F + +SLSELK DESEP+GALAT+LEVLKR H Sbjct: 270 TENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEGALATVLEVLKRTH 329 Query: 570 XXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGA 391 L+ RDVRQVLKT+R E+LKGCK+VFSRV+ Q EN +LW++AE+LGA Sbjct: 330 -NMFFYESKDNLIDRDVRQVLKTLRKEILKGCKIVFSRVFPSKFQAENHQLWKMAEQLGA 388 Query: 390 ICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSK 211 C ELD SVTHVVSTD GTEKSRWAVK KKFLVHP WIEA+N++W +Q ED F V +K Sbjct: 389 ACSTELDPSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEASNYMWLKQAEDKFPVKQTK 448 >ref|XP_007204345.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica] gi|462399876|gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica] Length = 449 Score = 476 bits (1226), Expect = e-131 Identities = 242/420 (57%), Positives = 303/420 (72%) Frame = -2 Query: 1470 HDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHPS 1291 H +EE + E D + + +KR++++ L + STS V ++S+ S + +IC HP Sbjct: 30 HSSPDEEADYESDDGSERSTKRRKVENLGSIDETQGSTSQIFVEENSEASPKKDICTHPG 89 Query: 1290 LIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXXX 1111 +K +C+ CG+ VD+ S V LGYIHKD + ++E+ R+R+ D+K KK Sbjct: 90 SVKDLCIVCGQRVDEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLVLDLDH 149 Query: 1110 XXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMF 931 LNSTH H++ +EEYL SQTDSLQD+ G+LF++D M M TKLRPFVR FLKEAS MF Sbjct: 150 TLLNSTHLNHMTAEEEYLHSQTDSLQDVSDGSLFRVDVMHMMTKLRPFVRKFLKEASEMF 209 Query: 930 EMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILDD 751 EMY+YTMGER YA+EMA+LLDP K YF RVIS+ D TQKHQKGLDVVLG ESA +ILDD Sbjct: 210 EMYIYTMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDVVLGHESAALILDD 269 Query: 750 TEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRIH 571 TE W +HK+NLILM+RYH+F SS F + +SLSELK DESEP+GALAT+LEVLKRIH Sbjct: 270 TENAWTKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEGALATVLEVLKRIH 329 Query: 570 XXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGA 391 L+ RDVRQVLKT+R E+LKGCK+VFSRV+ Q EN +LW++AE+LGA Sbjct: 330 -NMFFYESKDNLIDRDVRQVLKTLRKEILKGCKIVFSRVFPSKFQAENHQLWKMAEQLGA 388 Query: 390 ICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSK 211 C ELD SVTHVVSTD GTEKSRWAVK KKFLVHP WIEA+N++W +Q ED F VN +K Sbjct: 389 TCSTELDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEASNYMWLKQAEDKFPVNQTK 448 >gb|KNA16741.1| hypothetical protein SOVF_085710 isoform B [Spinacia oleracea] Length = 452 Score = 476 bits (1225), Expect = e-131 Identities = 243/449 (54%), Positives = 311/449 (69%) Frame = -2 Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375 FAALL+ EL DG ++ +D D A+ + +R L+ D Sbjct: 17 FAALLDAELDS---------------DSSDGSPDQDCSDDEDNNAEGERTKRCKVLEMDS 61 Query: 1374 GIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGS 1195 + S + + C HP ++ MC+ CG+ +DD + VA GYIHKDL++GS Sbjct: 62 RVEVQGSNSNGFTEQTIEAITDSCTHPGFLRDMCICCGKRMDDGAGVAFGYIHKDLRLGS 121 Query: 1194 EELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGN 1015 +E++RLR+ D+++L KK LNST + I+ +E+YLKSQTDS +D+ G+ Sbjct: 122 DEVSRLRDADVRTLLRNKKLYLVLDLDHTLLNSTRLEDINSEEDYLKSQTDSFEDISKGS 181 Query: 1014 LFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVI 835 LF+LD MRM TKLRP+VRTFL+EASNMFEMY+YTMGER YA+EMA+LLDPG +YF+SRVI Sbjct: 182 LFRLDKMRMMTKLRPYVRTFLQEASNMFEMYIYTMGERAYAIEMAKLLDPGSLYFNSRVI 241 Query: 834 SQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNN 655 SQADCTQ+HQKGLDVVLG +SAV+ILDDTE VW++HK+NLILM+RYHYFSSS R F N Sbjct: 242 SQADCTQRHQKGLDVVLGKDSAVLILDDTEAVWQRHKDNLILMERYHYFSSSCRQFGFNC 301 Query: 654 RSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGC 475 +SLSELK DE+E DGALAT+L VLK+IH RDVRQVLK +R+EVL C Sbjct: 302 KSLSELKGDENEADGALATVLGVLKKIHSNFFDAEHGNDFAARDVRQVLKKIRNEVLGDC 361 Query: 474 KLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYKKF 295 K+VFSRV+ Q EN LW++AE+LGA C E+D++VTHVVSTD GTEKSRWAV+ K+ Sbjct: 362 KIVFSRVFPTKFQAENHHLWKMAEQLGARCAVEVDSTVTHVVSTDAGTEKSRWAVENGKY 421 Query: 294 LVHPSWIEAANFLWQRQPEDSFAVNDSKK 208 LVHP W+EAAN+LW ++PE F V SKK Sbjct: 422 LVHPKWLEAANYLWSKKPEQEFPVVLSKK 450 >ref|XP_012481529.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Gossypium raimondii] gi|763760638|gb|KJB27892.1| hypothetical protein B456_005G016300 [Gossypium raimondii] Length = 470 Score = 476 bits (1225), Expect = e-131 Identities = 244/423 (57%), Positives = 304/423 (71%) Frame = -2 Query: 1473 DHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHP 1294 D D ++EE + D D R+KR + + LDD EG STS + + +VS+ + C HP Sbjct: 50 DDDSDDEEDDSND-DLNDHRNKRCKTEKLDDLEGPQGSTSQGLIEEKLEVSLNKDTCTHP 108 Query: 1293 SLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXX 1114 MC+ CG+ VDD S V GYIHK L++G++E+ RLR+ D+K+L KK Sbjct: 109 GSFGQMCILCGQRVDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLD 168 Query: 1113 XXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNM 934 LNST H++ +EEYLK Q+DS+QD+ G+LF L+ M M TKLRPFVRTFLKEAS M Sbjct: 169 HTLLNSTQLNHLTAEEEYLKGQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEM 228 Query: 933 FEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILD 754 FEMY+YTMG+RPYA+EMA+LLDP K YF+ RVIS+ D TQKHQKGLDVVLG +SAV+ILD Sbjct: 229 FEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILD 288 Query: 753 DTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRI 574 DTE W +HK+NLILM+RYH+F+SS R F + RSLS+LK DESEPDGALA+IL++L++I Sbjct: 289 DTENAWTKHKDNLILMERYHFFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQI 348 Query: 573 HXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELG 394 H L RDVRQVLKTVR EVLK CK+VFSRV+ Q EN LW++AE+LG Sbjct: 349 H-HIFFDELDSDLASRDVRQVLKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLG 407 Query: 393 AICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDS 214 A C E D+SVTHVVS D GTEKSRWAVK KFLVHP WIEAANF W +QPE+ F V+ + Sbjct: 408 ATCSTETDSSVTHVVSMDAGTEKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQT 467 Query: 213 KKE 205 K + Sbjct: 468 KNQ 470 >ref|XP_010693335.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 [Beta vulgaris subsp. vulgaris] gi|870846670|gb|KMS99186.1| hypothetical protein BVRB_2g047230 [Beta vulgaris subsp. vulgaris] Length = 434 Score = 476 bits (1224), Expect = e-131 Identities = 240/398 (60%), Positives = 296/398 (74%), Gaps = 4/398 (1%) Frame = -2 Query: 1386 DDDEGIHCSTSLEPVPQSSDVSVQLE-ICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKD 1210 D+D + + D V++E C HP ++ +C+ CG+ +DD + VA GYIHKD Sbjct: 37 DEDNNVEGARMKRRKVLEIDSKVEVEGSCTHPGFLRDLCIGCGKRMDDGAGVAFGYIHKD 96 Query: 1209 LKVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQD 1030 L++G++E++RLRN D++SL KK LNST + I+ +EEYLKSQTDS QD Sbjct: 97 LRLGNDEISRLRNADVRSLLRHKKLYLVLDLDHTLLNSTRLEDINSEEEYLKSQTDSFQD 156 Query: 1029 LPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYF 850 + G+LF+LD MRM TKLRP+VRTFL+EAS+MFEMY+YTMGERPYA+EMA+LLDPG +YF Sbjct: 157 IAKGSLFRLDMMRMMTKLRPYVRTFLEEASSMFEMYIYTMGERPYAIEMAKLLDPGNLYF 216 Query: 849 DSRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRS 670 +SRVISQADCTQ+HQKGLDVVLG ESAV+ILDDTE VWR+HK+NLILM+RYHYFSSS R Sbjct: 217 NSRVISQADCTQRHQKGLDVVLGQESAVLILDDTEGVWRRHKDNLILMERYHYFSSSCRQ 276 Query: 669 FNLNNRSLSELKRDESEPDGALATILEVLKRIH---XXXXXXXXXXXLMRRDVRQVLKTV 499 F + +SLSELK DE+E DGALAT+L VLK+IH RDVRQVLK Sbjct: 277 FGYSCKSLSELKGDENEADGALATVLGVLKKIHSKFFDPEHGDESDDFAARDVRQVLKQF 336 Query: 498 RSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSR 319 R EVLK CKLVFSRV+ Q +N LW++AE+LGA C ELD+SVTHVVSTD GTEKSR Sbjct: 337 RKEVLKDCKLVFSRVFPTKFQADNHHLWKMAEKLGATCSMELDSSVTHVVSTDSGTEKSR 396 Query: 318 WAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205 WAV+ KFLVHP W+EAAN+LW RQPED F V+ +K + Sbjct: 397 WAVQNGKFLVHPRWLEAANYLWNRQPEDQFPVHLTKSK 434 >gb|KHG05109.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Gossypium arboreum] Length = 404 Score = 476 bits (1224), Expect = e-131 Identities = 238/404 (58%), Positives = 301/404 (74%) Frame = -2 Query: 1416 RSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSA 1237 R+KR + + LDD EG+ STS + + +VS+ + C+HP MC+ CG+ VDD S+ Sbjct: 2 RNKRCKTEKLDDLEGLQGSTSQGLIEEKLEVSLNKDTCSHPGSFGQMCILCGQRVDDESS 61 Query: 1236 VALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYL 1057 V GYIHK L++G++E+ RLR+ D+K+L KK LNST H++ +EEYL Sbjct: 62 VTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYL 121 Query: 1056 KSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMAR 877 K Q+DSLQD+ G+LF L+ M+M TKLRPFVRTFLKEAS MFEMY+YTMG+RPYA+EMA+ Sbjct: 122 KGQSDSLQDVSKGSLFMLEFMQMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAK 181 Query: 876 LLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRY 697 LLDP K YF+ RVIS+ D TQKHQKGLDVVLG +SAV+ILDDTE W +HK+NLILM+RY Sbjct: 182 LLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERY 241 Query: 696 HYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVR 517 H+F+SS R F + +SLS+LK DESEPDGALA+IL++L++IH L RDVR Sbjct: 242 HFFASSCRQFGFDCKSLSQLKSDESEPDGALASILKILRQIH-HIFFDELDSDLASRDVR 300 Query: 516 QVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDM 337 QVLKTVR EVLK CK+VFSRV+ Q EN LW++AE+LGA C E D+SVTH+VS D Sbjct: 301 QVLKTVRKEVLKNCKIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHIVSMDA 360 Query: 336 GTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205 GTEKSRWAVK KFLVHP WIEAANF WQ+QPE++F V+ +K + Sbjct: 361 GTEKSRWAVKENKFLVHPRWIEAANFFWQKQPEENFPVSQTKNQ 404 >ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318538|gb|EEF03112.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 472 Score = 475 bits (1223), Expect = e-131 Identities = 239/422 (56%), Positives = 303/422 (71%) Frame = -2 Query: 1476 QDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297 QD + EE++ D+DF KR KR +++T++ E +TS + +S+ S+ EIC H Sbjct: 55 QDKEAEEDD----DSDFQRKRVKRSKVETVEIVEDDGGTTSFASLKHNSEASISKEICTH 110 Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117 P MC+ CG+++D S V GYIHK L++G++E+ RLRN D+K+L KK Sbjct: 111 PGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDL 170 Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937 LNST H++ DEEYL QTDSLQD+ G+LF L SM+M TKLRPFVRTFLKEAS Sbjct: 171 DHTLLNSTQLMHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQ 230 Query: 936 MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757 MFEMY+YTMG+R YA+EMA+LLDPG+ YF+++VIS+ D TQ+HQKGLDVVLG ESAV+IL Sbjct: 231 MFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLIL 290 Query: 756 DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577 DDTE W +HK+NLILM+RYH+F+SS F N +SLSE K DESE +GALA+IL+VL++ Sbjct: 291 DDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRK 350 Query: 576 IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397 IH RDVRQVLKTVR +VLKGCK+VFSRV+ Q +N LW +AE+L Sbjct: 351 IHQIFFEELEENMD-GRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQL 409 Query: 396 GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217 GA C ELD SVTHVVS D GTEKS WA+K+ KFLV P WIEAAN+ WQRQPE++F+ N Sbjct: 410 GATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFNQ 469 Query: 216 SK 211 K Sbjct: 470 IK 471 >ref|XP_007014445.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma cacao] gi|508784808|gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma cacao] Length = 469 Score = 475 bits (1223), Expect = e-131 Identities = 247/454 (54%), Positives = 315/454 (69%), Gaps = 4/454 (0%) Frame = -2 Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375 FAALL+ EL D D++ + + + +D D ++R+KR + + L+D E Sbjct: 17 FAALLDAELEVGSSGSSPDEEDVEADGDNNNDNNDDHDDDDDLDSQRNKRCKTEKLEDLE 76 Query: 1374 GIHCSTSL----EPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDL 1207 STS + + +++S++ +IC HP MC+ CG+ +DD S V GYIHK L Sbjct: 77 ESRGSTSQGLIEDKIVIHAELSLKKDICTHPGSFGQMCILCGQRLDDESGVTFGYIHKGL 136 Query: 1206 KVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDL 1027 ++G++E+ RLR+ D+K+L KK LNST H++PDEEYLK Q+DSLQD+ Sbjct: 137 RLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLMHLTPDEEYLKGQSDSLQDV 196 Query: 1026 PHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFD 847 G+LF LD M M TKLRPFVRTFLKEAS MFEMY+YTMG+RPYA+EMA+LLDP + YF Sbjct: 197 SRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPRREYFS 256 Query: 846 SRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSF 667 RVIS+ D TQKHQKGLDVVLG ESAV+ILDDTE W +HK+NLILM+RYHYF+SS F Sbjct: 257 DRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENAWMKHKDNLILMERYHYFASSCHQF 316 Query: 666 NLNNRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEV 487 +SLS+LK DESEPDGALA++L+ L++IH L RDVRQVLKTV+ EV Sbjct: 317 GYKCKSLSQLKSDESEPDGALASVLKALRQIH-HMFFDELDCNLASRDVRQVLKTVQEEV 375 Query: 486 LKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVK 307 LKGCK+VFS V+ E+ LW++AE+LGA C E D SVTHVVSTD GTEKSRWAVK Sbjct: 376 LKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTETDLSVTHVVSTDAGTEKSRWAVK 435 Query: 306 YKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205 KKFLVHP WIEA N+LWQ+QPE++F V+ K + Sbjct: 436 EKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 469 >gb|KNA16740.1| hypothetical protein SOVF_085710 isoform A [Spinacia oleracea] Length = 453 Score = 475 bits (1222), Expect = e-131 Identities = 244/450 (54%), Positives = 313/450 (69%), Gaps = 1/450 (0%) Frame = -2 Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375 FAALL+ EL DG ++ +D D A+ + +R L+ D Sbjct: 17 FAALLDAELDS---------------DSSDGSPDQDCSDDEDNNAEGERTKRCKVLEMDS 61 Query: 1374 GIHCSTS-LEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVG 1198 + S + + V + C HP ++ MC+ CG+ +DD + VA GYIHKDL++G Sbjct: 62 RVEVQGSNSNGFTEQTIVEAITDSCTHPGFLRDMCICCGKRMDDGAGVAFGYIHKDLRLG 121 Query: 1197 SEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHG 1018 S+E++RLR+ D+++L KK LNST + I+ +E+YLKSQTDS +D+ G Sbjct: 122 SDEVSRLRDADVRTLLRNKKLYLVLDLDHTLLNSTRLEDINSEEDYLKSQTDSFEDISKG 181 Query: 1017 NLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRV 838 +LF+LD MRM TKLRP+VRTFL+EASNMFEMY+YTMGER YA+EMA+LLDPG +YF+SRV Sbjct: 182 SLFRLDKMRMMTKLRPYVRTFLQEASNMFEMYIYTMGERAYAIEMAKLLDPGSLYFNSRV 241 Query: 837 ISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLN 658 ISQADCTQ+HQKGLDVVLG +SAV+ILDDTE VW++HK+NLILM+RYHYFSSS R F N Sbjct: 242 ISQADCTQRHQKGLDVVLGKDSAVLILDDTEAVWQRHKDNLILMERYHYFSSSCRQFGFN 301 Query: 657 NRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKG 478 +SLSELK DE+E DGALAT+L VLK+IH RDVRQVLK +R+EVL Sbjct: 302 CKSLSELKGDENEADGALATVLGVLKKIHSNFFDAEHGNDFAARDVRQVLKKIRNEVLGD 361 Query: 477 CKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYKK 298 CK+VFSRV+ Q EN LW++AE+LGA C E+D++VTHVVSTD GTEKSRWAV+ K Sbjct: 362 CKIVFSRVFPTKFQAENHHLWKMAEQLGARCAVEVDSTVTHVVSTDAGTEKSRWAVENGK 421 Query: 297 FLVHPSWIEAANFLWQRQPEDSFAVNDSKK 208 +LVHP W+EAAN+LW ++PE F V SKK Sbjct: 422 YLVHPKWLEAANYLWSKKPEQEFPVVLSKK 451 >ref|XP_011018018.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 [Populus euphratica] Length = 472 Score = 473 bits (1218), Expect = e-130 Identities = 240/422 (56%), Positives = 302/422 (71%) Frame = -2 Query: 1476 QDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297 QD + EE++ D+DF +KR KR +++TL+ E + SL + +S+VS+ EIC H Sbjct: 55 QDKEAEEDD----DSDFQSKRVKRSKVETLEIVEDDGGAASLASLKHNSEVSISKEICTH 110 Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117 P MC+ CG+++D S V GYIHK L++G++E+ RLRN D+K+L KK Sbjct: 111 PGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDL 170 Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937 LNST H++ DEEYL QT SLQD+ G+LF L SM+M TKLRPFVRTFLKEAS Sbjct: 171 DHTLLNSTQLMHMTLDEEYLNGQTASLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQ 230 Query: 936 MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757 MFEMY+YTMG+R YA+EMA+LLDPG+ YF+++VIS+ D TQ+HQKGLDVVLG ESAV+IL Sbjct: 231 MFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLIL 290 Query: 756 DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577 DDTE W +HK+NLILM+RYH+F+SS F N +SLSE DESE +GALA+IL+VL++ Sbjct: 291 DDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQNTDESESEGALASILKVLRK 350 Query: 576 IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397 IH RDVRQVLKTVR +VLKGCK+VFSRV+ Q N LW +AE+L Sbjct: 351 IHQIFFEELEENMD-GRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQANNHHLWRMAEQL 409 Query: 396 GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217 GA C ELD SVTHVVS D GTEKS WA K+ KFLV P WIEAAN+ WQRQPE++F+VN Sbjct: 410 GATCSTELDPSVTHVVSKDSGTEKSHWASKHNKFLVQPGWIEAANYFWQRQPEENFSVNQ 469 Query: 216 SK 211 K Sbjct: 470 IK 471 >ref|XP_012078975.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 [Jatropha curcas] gi|802640739|ref|XP_012078976.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 [Jatropha curcas] gi|643722394|gb|KDP32215.1| hypothetical protein JCGZ_13822 [Jatropha curcas] Length = 470 Score = 472 bits (1215), Expect = e-130 Identities = 244/454 (53%), Positives = 317/454 (69%), Gaps = 4/454 (0%) Frame = -2 Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVE----DADFAAKRSKRQRLDTL 1387 FAALL+ EL +++ + EEEE + D D +KR KR R++TL Sbjct: 17 FAALLDAELDSKSSDSSPNDDDEEEEEEEEEEEEEEAKDEPEDDPDIESKRIKRSRVETL 76 Query: 1386 DDDEGIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDL 1207 ++ E ST + + S C HP MC+ CG+ +++ + V L YIHK L Sbjct: 77 ENVEDPKGSTFHGSLDLNLGASSSKVACTHPGSFGDMCIICGQRLNEETGVTLAYIHKGL 136 Query: 1206 KVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDL 1027 ++G++E+ RLRN D K+L KK LNST H++ +EEYLKSQ DSLQD+ Sbjct: 137 RLGNDEIVRLRNSDTKNLLRHKKLYLVLDLDHTLLNSTQLMHMTAEEEYLKSQLDSLQDV 196 Query: 1026 PHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFD 847 +G+LF+LD M M TKLRP+V TFLKEAS MFEMY+YTMG+R YA+EMA+LLDP + YF+ Sbjct: 197 SNGSLFKLDFMHMMTKLRPYVHTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPRREYFN 256 Query: 846 SRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSF 667 +RVIS+ D TQ+HQKGLD+VLG ESAV+ILDDTE W +HK+NLILM+RYH+F+SS F Sbjct: 257 ARVISRDDGTQRHQKGLDIVLGQESAVLILDDTETAWTKHKDNLILMERYHFFASSCHQF 316 Query: 666 NLNNRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEV 487 + +SLSELK DES+ DGALA++L+VL+RIH L RDVRQVLKTVR +V Sbjct: 317 GFSCKSLSELKSDESDSDGALASVLKVLRRIHHIFFDELMDVNLDSRDVRQVLKTVRKDV 376 Query: 486 LKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVK 307 L+GCK+VFSRV+ Q N +LW++AE+LGAIC ELD+S+THVVST+ GTEKSRWA+K Sbjct: 377 LEGCKIVFSRVFPTQFQANNHQLWKMAEQLGAICSTELDSSITHVVSTEAGTEKSRWAMK 436 Query: 306 YKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205 KKFLVHP WIEAAN+LWQRQPE++F+VN K + Sbjct: 437 NKKFLVHPRWIEAANYLWQRQPEENFSVNQPKHQ 470 >gb|KNA16742.1| hypothetical protein SOVF_085710 isoform C [Spinacia oleracea] Length = 450 Score = 471 bits (1212), Expect = e-130 Identities = 243/449 (54%), Positives = 315/449 (70%) Frame = -2 Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDE 1375 FAALL+ EL D +++ + ED + +R K +D+ + + Sbjct: 17 FAALLDAELDS-------------DSSDGSPDQDCSDDEDNNAEGERCKVLEMDSRVEVQ 63 Query: 1374 GIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGS 1195 G + + E + V + C HP ++ MC+ CG+ +DD + VA GYIHKDL++GS Sbjct: 64 GSNSNGFTE----QTIVEAITDSCTHPGFLRDMCICCGKRMDDGAGVAFGYIHKDLRLGS 119 Query: 1194 EELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGN 1015 +E++RLR+ D+++L KK LNST + I+ +E+YLKSQTDS +D+ G+ Sbjct: 120 DEVSRLRDADVRTLLRNKKLYLVLDLDHTLLNSTRLEDINSEEDYLKSQTDSFEDISKGS 179 Query: 1014 LFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVI 835 LF+LD MRM TKLRP+VRTFL+EASNMFEMY+YTMGER YA+EMA+LLDPG +YF+SRVI Sbjct: 180 LFRLDKMRMMTKLRPYVRTFLQEASNMFEMYIYTMGERAYAIEMAKLLDPGSLYFNSRVI 239 Query: 834 SQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNN 655 SQADCTQ+HQKGLDVVLG +SAV+ILDDTE VW++HK+NLILM+RYHYFSSS R F N Sbjct: 240 SQADCTQRHQKGLDVVLGKDSAVLILDDTEAVWQRHKDNLILMERYHYFSSSCRQFGFNC 299 Query: 654 RSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGC 475 +SLSELK DE+E DGALAT+L VLK+IH RDVRQVLK +R+EVL C Sbjct: 300 KSLSELKGDENEADGALATVLGVLKKIHSNFFDAEHGNDFAARDVRQVLKKIRNEVLGDC 359 Query: 474 KLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYKKF 295 K+VFSRV+ Q EN LW++AE+LGA C E+D++VTHVVSTD GTEKSRWAV+ K+ Sbjct: 360 KIVFSRVFPTKFQAENHHLWKMAEQLGARCAVEVDSTVTHVVSTDAGTEKSRWAVENGKY 419 Query: 294 LVHPSWIEAANFLWQRQPEDSFAVNDSKK 208 LVHP W+EAAN+LW ++PE F V SKK Sbjct: 420 LVHPKWLEAANYLWSKKPEQEFPVVLSKK 448 >gb|KJB27893.1| hypothetical protein B456_005G016300 [Gossypium raimondii] Length = 469 Score = 471 bits (1212), Expect = e-130 Identities = 244/423 (57%), Positives = 304/423 (71%) Frame = -2 Query: 1473 DHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHP 1294 D D ++EE + D D R+KR + + LDD EG STS + + + VS+ + C HP Sbjct: 50 DDDSDDEEDDSND-DLNDHRNKRCKTEKLDDLEGPQGSTS-QGLIEEKLVSLNKDTCTHP 107 Query: 1293 SLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXX 1114 MC+ CG+ VDD S V GYIHK L++G++E+ RLR+ D+K+L KK Sbjct: 108 GSFGQMCILCGQRVDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLD 167 Query: 1113 XXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNM 934 LNST H++ +EEYLK Q+DS+QD+ G+LF L+ M M TKLRPFVRTFLKEAS M Sbjct: 168 HTLLNSTQLNHLTAEEEYLKGQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEM 227 Query: 933 FEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILD 754 FEMY+YTMG+RPYA+EMA+LLDP K YF+ RVIS+ D TQKHQKGLDVVLG +SAV+ILD Sbjct: 228 FEMYIYTMGDRPYALEMAKLLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILD 287 Query: 753 DTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRI 574 DTE W +HK+NLILM+RYH+F+SS R F + RSLS+LK DESEPDGALA+IL++L++I Sbjct: 288 DTENAWTKHKDNLILMERYHFFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQI 347 Query: 573 HXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELG 394 H L RDVRQVLKTVR EVLK CK+VFSRV+ Q EN LW++AE+LG Sbjct: 348 H-HIFFDELDSDLASRDVRQVLKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLG 406 Query: 393 AICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDS 214 A C E D+SVTHVVS D GTEKSRWAVK KFLVHP WIEAANF W +QPE+ F V+ + Sbjct: 407 ATCSTETDSSVTHVVSMDAGTEKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQT 466 Query: 213 KKE 205 K + Sbjct: 467 KNQ 469 >ref|XP_012481530.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4 isoform X2 [Gossypium raimondii] Length = 404 Score = 470 bits (1209), Expect = e-129 Identities = 238/404 (58%), Positives = 295/404 (73%) Frame = -2 Query: 1416 RSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSA 1237 R+KR + + LDD EG STS + + +VS+ + C HP MC+ CG+ VDD S Sbjct: 2 RNKRCKTEKLDDLEGPQGSTSQGLIEEKLEVSLNKDTCTHPGSFGQMCILCGQRVDDESG 61 Query: 1236 VALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYL 1057 V GYIHK L++G++E+ RLR+ D+K+L KK LNST H++ +EEYL Sbjct: 62 VTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNSTQLNHLTAEEEYL 121 Query: 1056 KSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMAR 877 K Q+DS+QD+ G+LF L+ M M TKLRPFVRTFLKEAS MFEMY+YTMG+RPYA+EMA+ Sbjct: 122 KGQSDSMQDVSKGSLFMLEFMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAK 181 Query: 876 LLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRY 697 LLDP K YF+ RVIS+ D TQKHQKGLDVVLG +SAV+ILDDTE W +HK+NLILM+RY Sbjct: 182 LLDPKKEYFNGRVISRDDGTQKHQKGLDVVLGQDSAVVILDDTENAWTKHKDNLILMERY 241 Query: 696 HYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVR 517 H+F+SS R F + RSLS+LK DESEPDGALA+IL++L++IH L RDVR Sbjct: 242 HFFASSCRQFGFDCRSLSQLKSDESEPDGALASILKILRQIH-HIFFDELDSDLASRDVR 300 Query: 516 QVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDM 337 QVLKTVR EVLK CK+VFSRV+ Q EN LW++AE+LGA C E D+SVTHVVS D Sbjct: 301 QVLKTVRKEVLKDCKIVFSRVFPTKFQPENHLLWKMAEQLGATCSTETDSSVTHVVSMDA 360 Query: 336 GTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205 GTEKSRWAVK KFLVHP WIEAANF W +QPE+ F V+ +K + Sbjct: 361 GTEKSRWAVKENKFLVHPRWIEAANFFWLKQPEEKFPVSQTKNQ 404 >gb|KNA25939.1| hypothetical protein SOVF_002000 isoform B [Spinacia oleracea] Length = 452 Score = 468 bits (1203), Expect = e-129 Identities = 240/451 (53%), Positives = 315/451 (69%), Gaps = 1/451 (0%) Frame = -2 Query: 1554 FAALLNEELAGIXXXXXXXXXXXXXDQDHD-GEEEEVNVEDADFAAKRSKRQRLDTLDDD 1378 FAALL+ EL D D ++E+ N E A KR K +DT + Sbjct: 17 FAALLDAELDS---------DSSDGSPDQDCSDDEDNNAEGARI--KRRKVLEMDTRVEV 65 Query: 1377 EGIHCSTSLEPVPQSSDVSVQLEICAHPSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVG 1198 +G + + E + + + C HP ++ MC+ CG+ +DD + VA GYIHKDL++G Sbjct: 66 QGSNSNGFTEKAIEEATT----DSCTHPGFLRDMCICCGKRMDDGAGVAFGYIHKDLRLG 121 Query: 1197 SEELARLRNKDLKSLFCKKKXXXXXXXXXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHG 1018 S+E++RLR+ D+++L KK LNST + I+ +E+YLKSQTDS +D+ G Sbjct: 122 SDEVSRLRDADVRTLLRNKKLYLVLDLDHTLLNSTRLEDINSEEDYLKSQTDSFEDISKG 181 Query: 1017 NLFQLDSMRMFTKLRPFVRTFLKEASNMFEMYVYTMGERPYAMEMARLLDPGKIYFDSRV 838 +LF+LD +RM TKLRP+VRTFL+EASNMFEMY+YTMGER YA+EMA+LLDPG +YF+SRV Sbjct: 182 SLFRLDKIRMMTKLRPYVRTFLQEASNMFEMYIYTMGERAYAIEMAKLLDPGNVYFNSRV 241 Query: 837 ISQADCTQKHQKGLDVVLGAESAVIILDDTEYVWRQHKENLILMDRYHYFSSSGRSFNLN 658 IS+ADCT++HQKGLDVVLG +SAV+ILDDTE VW++HK+NLILM+RYHYFSSS R F N Sbjct: 242 ISKADCTRRHQKGLDVVLGKDSAVLILDDTEAVWQRHKDNLILMERYHYFSSSCRQFGFN 301 Query: 657 NRSLSELKRDESEPDGALATILEVLKRIHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKG 478 +SLSELK DE+E DGALAT+L VLK+IH RDVRQVL+ +R+E+L+G Sbjct: 302 CKSLSELKGDENEADGALATVLGVLKKIHSNFFNPEHGDDFAARDVRQVLRKIRNEILRG 361 Query: 477 CKLVFSRVWKIGEQVENQRLWEVAEELGAICCKELDASVTHVVSTDMGTEKSRWAVKYKK 298 CK+VFSRV+ Q EN LW++AE+LGA C E+D+SVTHVVS GTEKSRWAV+ Sbjct: 362 CKIVFSRVFSTESQAENHHLWKMAEQLGATCAVEVDSSVTHVVSEYAGTEKSRWAVQNGN 421 Query: 297 FLVHPSWIEAANFLWQRQPEDSFAVNDSKKE 205 FLVHP W+EAAN+LW ++PE F V +K + Sbjct: 422 FLVHPKWLEAANYLWSKKPEQQFPVELTKSK 452 >ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318537|gb|EEF03111.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 468 Score = 468 bits (1203), Expect = e-129 Identities = 235/422 (55%), Positives = 300/422 (71%) Frame = -2 Query: 1476 QDHDGEEEEVNVEDADFAAKRSKRQRLDTLDDDEGIHCSTSLEPVPQSSDVSVQLEICAH 1297 QD + EE++ D+DF KR KR +++T++ E +TS + +S+ S+ EIC H Sbjct: 55 QDKEAEEDD----DSDFQRKRVKRSKVETVEIVEDDGGTTSFASLKHNSEASISKEICTH 110 Query: 1296 PSLIKGMCVKCGRMVDDVSAVALGYIHKDLKVGSEELARLRNKDLKSLFCKKKXXXXXXX 1117 P MC+ CG+++D S V GYIHK L++G++E+ RLRN D+K+L KK Sbjct: 111 PGSFGTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDL 170 Query: 1116 XXXXLNSTHFQHISPDEEYLKSQTDSLQDLPHGNLFQLDSMRMFTKLRPFVRTFLKEASN 937 LNST H++ DEEYL QTDSLQD+ G+LF L SM+M TKLRPFVRTFLKEAS Sbjct: 171 DHTLLNSTQLMHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQ 230 Query: 936 MFEMYVYTMGERPYAMEMARLLDPGKIYFDSRVISQADCTQKHQKGLDVVLGAESAVIIL 757 MFEMY+YTMG+R YA+EMA+LLDPG+ YF+++VIS+ D TQ+HQKGLDVVLG ESAV+IL Sbjct: 231 MFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLIL 290 Query: 756 DDTEYVWRQHKENLILMDRYHYFSSSGRSFNLNNRSLSELKRDESEPDGALATILEVLKR 577 DDTE W +HK+NLILM+RYH+F+SS F N +SLSE K DESE +GALA+IL+VL++ Sbjct: 291 DDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRK 350 Query: 576 IHXXXXXXXXXXXLMRRDVRQVLKTVRSEVLKGCKLVFSRVWKIGEQVENQRLWEVAEEL 397 IH + QVLKTVR +VLKGCK+VFSRV+ Q +N LW +AE+L Sbjct: 351 IHQIFFEDHILSLAL-----QVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQL 405 Query: 396 GAICCKELDASVTHVVSTDMGTEKSRWAVKYKKFLVHPSWIEAANFLWQRQPEDSFAVND 217 GA C ELD SVTHVVS D GTEKS WA+K+ KFLV P WIEAAN+ WQRQPE++F+ N Sbjct: 406 GATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFNQ 465 Query: 216 SK 211 K Sbjct: 466 IK 467