BLASTX nr result
ID: Akebia27_contig00023057
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00023057 (1289 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma... 159 2e-36 ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat... 153 2e-34 ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu... 146 2e-32 ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera... 145 4e-32 ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal doma... 145 4e-32 ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr... 144 8e-32 ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citr... 144 8e-32 ref|XP_006438857.1| hypothetical protein CICLE_v10030535mg [Citr... 144 8e-32 gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l... 142 4e-31 emb|CBI35661.3| unnamed protein product [Vitis vinifera] 137 1e-29 ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal doma... 130 1e-27 ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma... 127 1e-26 ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus tric... 125 3e-26 ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma... 125 4e-26 gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus... 114 9e-23 ref|XP_002881287.1| hypothetical protein ARALYDRAFT_482300 [Arab... 111 6e-22 ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phas... 110 1e-21 ref|XP_003621644.1| RNA polymerase II C-terminal domain phosphat... 107 1e-20 ref|XP_006662962.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera... 106 2e-20 ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma... 106 2e-20 >ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Vitis vinifera] Length = 1238 Score = 159 bits (403), Expect = 2e-36 Identities = 122/277 (44%), Positives = 155/277 (55%), Gaps = 14/277 (5%) Frame = -3 Query: 792 SVEEISEEDF-KQEAKVLN---PKGGDSRVW----MGDLLNY-PVSSNYGSGLYNFAWAQ 640 SVEEISEEDF KQE +VL PK D+RVW + DL Y S Y LYN AWAQ Sbjct: 25 SVEEISEEDFNKQEVRVLREAKPKA-DTRVWTMRDLQDLYKYHQACSGYTPRLYNLAWAQ 83 Query: 639 AVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEEID 460 AVQNKPL++I + D +E+SKRS S+ S KEV VIIDDS +E+D Sbjct: 84 AVQNKPLNDIFVMD---DEESKRSSSS------SNTSRDDSSSAKEVAKVIIDDSGDEMD 134 Query: 459 SKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKS 280 K DV LDSE + EGG + N+ P + E E +++KS Sbjct: 135 VKMDDVSEKEEGELEEGEID--LDSEPDVKDEGGVLDVNE--PEIDLK--ERELVERVKS 188 Query: 279 IRGALETVTVKYAEKSFHGVCLELQASLDSLKLM-----IMENGALDVDDLIQQSFTGIQ 115 I+ LE+VTV AEKSF GVC LQ +L SL+ + + E+ D L QQ I+ Sbjct: 189 IQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIR 248 Query: 114 AINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFS 4 A+N VFCSMN Q+E NKD+F RLL+ V+ D+ +FS Sbjct: 249 ALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFS 285 >ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative [Theobroma cacao] Length = 1290 Score = 153 bits (386), Expect = 2e-34 Identities = 116/280 (41%), Positives = 147/280 (52%), Gaps = 16/280 (5%) Frame = -3 Query: 792 SVEEISEEDF-KQEAKVL----NPKGGD----SRVW-MGDLLNYP-VSSNYGSGLYNFAW 646 S+EEISEEDF KQ+ K+L + KGG+ SRVW M DL YP V Y SGLYNFAW Sbjct: 44 SIEEISEEDFNKQDVKILKESKSSKGGEANSNSRVWTMQDLCKYPSVIRGYASGLYNFAW 103 Query: 645 AQAVQNKPLSEILMRDFE----SEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDD 478 AQAVQNKPL+EI ++DFE E K+ + S S V+IDD Sbjct: 104 AQAVQNKPLNEIFVKDFEQPQQDENKNSKRSSPSSSVASVNSKEEKGSSGNLAVKVVIDD 163 Query: 477 SSEEIDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGR-NSEGE 301 SE+ + + V +LDSE E+ L S+ G + E Sbjct: 164 DSED-EMEEDKVVNLDKEEGELEEGEIDLDSEPKEKV----------LSSEDGNVGNSDE 212 Query: 300 FEKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTG 121 EK+ IRG LE VTV AEKSF GVC L +L+SL+ +I+E D LIQ +F Sbjct: 213 LEKRANLIRGVLEGVTVIEAEKSFEGVCSRLHNALESLRALILECSVPAKDALIQLAF-- 270 Query: 120 IQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSP 1 AINS F ++N +EQN + RLL+ VK D +LF P Sbjct: 271 -GAINSAFVALNCNSKEQNVAILSRLLSIVKGHDPSLFPP 309 >ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] gi|550343308|gb|EEE79627.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa] Length = 1247 Score = 146 bits (368), Expect = 2e-32 Identities = 107/278 (38%), Positives = 149/278 (53%), Gaps = 14/278 (5%) Frame = -3 Query: 792 SVEEISEEDF-KQEAKVL---------NPKGGDSRVW-MGDLLNYPVSSNYGSGLYNFAW 646 SVEEISE+DF KQE V+ N +VW + DL Y V Y SGLYN AW Sbjct: 30 SVEEISEDDFNKQEVVVVKETPSSTTNNNSSSKQKVWTVRDLYKYQVGGGYMSGLYNLAW 89 Query: 645 AQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEE 466 AQAVQNKPL+E+ + + E ++ S++S + + ++ V+IDDS +E Sbjct: 90 AQAVQNKPLNELFV-EVEVDDSSQKSSVSSVNSSK-----------EDKRTVVIDDSGDE 137 Query: 465 IDSKAQDVXXXXXXXXXXXXXXXELDSEMVE-ETEGGWSNANDSLPSDSGRNSEGEFEKQ 289 +D +D E E E E G + + S+ G S + EK+ Sbjct: 138 MD------------------VVKVIDIEKEEGELEEGEIDLDSEGKSEGGMVSV-DTEKR 178 Query: 288 IKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIM--ENGALDVDDLIQQSFTGIQ 115 +KSIR LE+V+V +KSF VCL+L +L+SLK ++ ENG D L++ FT I Sbjct: 179 VKSIREDLESVSVIKDDKSFEAVCLKLHNALESLKELVRVNENGFPSKDSLVRLLFTAIG 238 Query: 114 AINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSP 1 A+NS F SMN K +EQNK +F+R L+ V S D + FSP Sbjct: 239 AVNSFFSSMNQKLKEQNKGVFMRFLSLVNSHDPSFFSP 276 >ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 3-like [Cucumis sativus] Length = 1249 Score = 145 bits (366), Expect = 4e-32 Identities = 101/279 (36%), Positives = 152/279 (54%), Gaps = 15/279 (5%) Frame = -3 Query: 792 SVEEISEEDFKQEAKVLNPK--------GGDSRVW-MGDLL-NYPVSSN-YGSGLYNFAW 646 SVEEISEEDF + +PK ++RVW M DL NYP + Y SGLYN AW Sbjct: 22 SVEEISEEDFNKLDSSASPKVVVPSKDSNRETRVWTMSDLYKNYPAMRHGYASGLYNLAW 81 Query: 645 AQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEE 466 AQAVQNKPL++I + + + +EKSK S S + KE V+IDDS +E Sbjct: 82 AQAVQNKPLNDIFVMEADLDEKSKHSSST----PFGNAKDDGSNTTKEEDRVVIDDSGDE 137 Query: 465 IDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSD-SGRNSE---GEF 298 ++ D ++D+E VEE + +DS D +G+ + E Sbjct: 138 MNC---DNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDMDINGQEFDLETKEL 194 Query: 297 EKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGI 118 ++ +K I+ L+ VT+ A+KSF VC ++ +S+++ ++ D LIQ+ + + Sbjct: 195 DELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKDALIQRLYAAL 254 Query: 117 QAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSP 1 + INSVFCSMN ++E++K+ RLL++VK+ D LFSP Sbjct: 255 RLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSP 293 >ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Cucumis sativus] Length = 1249 Score = 145 bits (366), Expect = 4e-32 Identities = 101/279 (36%), Positives = 152/279 (54%), Gaps = 15/279 (5%) Frame = -3 Query: 792 SVEEISEEDFKQEAKVLNPK--------GGDSRVW-MGDLL-NYPVSSN-YGSGLYNFAW 646 SVEEISEEDF + +PK ++RVW M DL NYP + Y SGLYN AW Sbjct: 22 SVEEISEEDFNKLDSSASPKVVVPSKDSNRETRVWTMSDLYKNYPAMRHGYASGLYNLAW 81 Query: 645 AQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEE 466 AQAVQNKPL++I + + + +EKSK S S + KE V+IDDS +E Sbjct: 82 AQAVQNKPLNDIFVMEADLDEKSKHSSST----PFGNAKDDGSNTTKEEDRVVIDDSGDE 137 Query: 465 IDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSD-SGRNSE---GEF 298 ++ D ++D+E VEE + +DS D +G+ + E Sbjct: 138 MNC---DNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDMDINGQEFDLETKEL 194 Query: 297 EKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGI 118 ++ +K I+ L+ VT+ A+KSF VC ++ +S+++ ++ D LIQ+ + + Sbjct: 195 DELLKFIQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGKVVPRKDALIQRLYAAL 254 Query: 117 QAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSP 1 + INSVFCSMN ++E++K+ RLL++VK+ D LFSP Sbjct: 255 RLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSP 293 >ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|568858958|ref|XP_006483010.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Citrus sinensis] gi|557541056|gb|ESR52100.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1234 Score = 144 bits (363), Expect = 8e-32 Identities = 105/278 (37%), Positives = 148/278 (53%), Gaps = 15/278 (5%) Frame = -3 Query: 792 SVEEISEEDFK----------QEAKVLNPKGGDS--RVW-MGDLLN-YP-VSSNYGSGLY 658 SVEEISEEDFK +E K + GG++ RVW M DL N YP + YG GL+ Sbjct: 15 SVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPGLH 74 Query: 657 NFAWAQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDD 478 N AWAQAVQNKPL+EI + + E ++ SKRS K V V+IDD Sbjct: 75 NLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDDKKVVEKVVIDD 134 Query: 477 SSEEIDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEGEF 298 S +EI+ + + +EE E ++S S + E Sbjct: 135 SGDEIEKEEGE----------------------LEEGEIELDLESESNEKVSEQVKEEMK 172 Query: 297 EKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGI 118 ++SIR ALE+V + SF GVC +L+ +L+SL+ ++ EN D LIQ +F+ + Sbjct: 173 LINVESIREALESVL--RGDISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAV 230 Query: 117 QAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFS 4 Q+++SVFCSMN +EQNK++ RLL+ +KS + LFS Sbjct: 231 QSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFS 268 >ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|557541054|gb|ESR52098.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1208 Score = 144 bits (363), Expect = 8e-32 Identities = 105/278 (37%), Positives = 148/278 (53%), Gaps = 15/278 (5%) Frame = -3 Query: 792 SVEEISEEDFK----------QEAKVLNPKGGDS--RVW-MGDLLN-YP-VSSNYGSGLY 658 SVEEISEEDFK +E K + GG++ RVW M DL N YP + YG GL+ Sbjct: 15 SVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPGLH 74 Query: 657 NFAWAQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDD 478 N AWAQAVQNKPL+EI + + E ++ SKRS K V V+IDD Sbjct: 75 NLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDDKKVVEKVVIDD 134 Query: 477 SSEEIDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEGEF 298 S +EI+ + + +EE E ++S S + E Sbjct: 135 SGDEIEKEEGE----------------------LEEGEIELDLESESNEKVSEQVKEEMK 172 Query: 297 EKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGI 118 ++SIR ALE+V + SF GVC +L+ +L+SL+ ++ EN D LIQ +F+ + Sbjct: 173 LINVESIREALESVL--RGDISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAV 230 Query: 117 QAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFS 4 Q+++SVFCSMN +EQNK++ RLL+ +KS + LFS Sbjct: 231 QSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFS 268 >ref|XP_006438857.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|567892677|ref|XP_006438859.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|557541053|gb|ESR52097.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] gi|557541055|gb|ESR52099.1| hypothetical protein CICLE_v10030535mg [Citrus clementina] Length = 1118 Score = 144 bits (363), Expect = 8e-32 Identities = 105/278 (37%), Positives = 148/278 (53%), Gaps = 15/278 (5%) Frame = -3 Query: 792 SVEEISEEDFK----------QEAKVLNPKGGDS--RVW-MGDLLN-YP-VSSNYGSGLY 658 SVEEISEEDFK +E K + GG++ RVW M DL N YP + YG GL+ Sbjct: 15 SVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPGLH 74 Query: 657 NFAWAQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDD 478 N AWAQAVQNKPL+EI + + E ++ SKRS K V V+IDD Sbjct: 75 NLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDDKKVVEKVVIDD 134 Query: 477 SSEEIDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEGEF 298 S +EI+ + + +EE E ++S S + E Sbjct: 135 SGDEIEKEEGE----------------------LEEGEIELDLESESNEKVSEQVKEEMK 172 Query: 297 EKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGI 118 ++SIR ALE+V + SF GVC +L+ +L+SL+ ++ EN D LIQ +F+ + Sbjct: 173 LINVESIREALESVL--RGDISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAV 230 Query: 117 QAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFS 4 Q+++SVFCSMN +EQNK++ RLL+ +KS + LFS Sbjct: 231 QSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFS 268 >gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus notabilis] Length = 1301 Score = 142 bits (357), Expect = 4e-31 Identities = 112/294 (38%), Positives = 153/294 (52%), Gaps = 30/294 (10%) Frame = -3 Query: 792 SVEEISEEDF-KQEA------KVLN--------PKGGDSRVW-MGDLL-NYPVSSNYGSG 664 SVEEISEEDF KQE KV++ K GDSRVW M DL NYP Y +G Sbjct: 23 SVEEISEEDFNKQEGNGTGSGKVMSVSDSNSKESKFGDSRVWTMRDLYANYPGFRGYTTG 82 Query: 663 LYNFAWAQAVQNKPLSEILMRDFESEEKSK--RSGSNLLXXXXXXXXXXXXXSMKEVCNV 490 LYN AWAQAVQNKPL+EI + D ++++ S+ S ++ +++V V Sbjct: 83 LYNLAWAQAVQNKPLNEIFVMDVDADDSSRVVLSSASPAVNSGRREGKNGVKEVEKVEKV 142 Query: 489 IIDDSSEEIDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNAND---------S 337 +IDDS++E++ + +L+SE ++ G + D Sbjct: 143 VIDDSADEMEEGELE------------EGEIDLESEPTQKPAGEEAKDGDLNCEAENVGG 190 Query: 336 LPSDSGRNSEGEFEKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMEN--G 163 L DS R+ E EK++ I L +V V AEKSF VC LQ +L+SL+ ++ E Sbjct: 191 LEVDSRRD---ELEKRVDLIWETLGSVNVVNAEKSFEEVCSRLQRTLESLRGVLSEKEFS 247 Query: 162 ALDVDDLIQQSFTGIQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSP 1 D +IQ S T IQ +NSVFCSM+ Q+EQ K+ RL VK+ T LFSP Sbjct: 248 FPTKDVVIQMSITAIQVVNSVFCSMSVNQKEQKKETLSRLFCSVKNCGTPLFSP 301 >emb|CBI35661.3| unnamed protein product [Vitis vinifera] Length = 1184 Score = 137 bits (345), Expect = 1e-29 Identities = 111/277 (40%), Positives = 140/277 (50%), Gaps = 14/277 (5%) Frame = -3 Query: 792 SVEEISEEDF-KQEAKVLN---PKGGDSRVW----MGDLLNY-PVSSNYGSGLYNFAWAQ 640 SVEEISEEDF KQE +VL PK D+RVW + DL Y S Y LYN AWAQ Sbjct: 65 SVEEISEEDFNKQEVRVLREAKPKA-DTRVWTMRDLQDLYKYHQACSGYTPRLYNLAWAQ 123 Query: 639 AVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEEID 460 AVQNKPL++I VIIDDS +E+D Sbjct: 124 AVQNKPLNDIF--------------------------------------VIIDDSGDEMD 145 Query: 459 SKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKS 280 K DV LDSE + EGG + N+ P + E E +++KS Sbjct: 146 VKMDDVSEKEEGELEEGEID--LDSEPDVKDEGGVLDVNE--PEIDLK--ERELVERVKS 199 Query: 279 IRGALETVTVKYAEKSFHGVCLELQASLDSLKLM-----IMENGALDVDDLIQQSFTGIQ 115 I+ LE+VTV AEKSF GVC LQ +L SL+ + + E+ D L QQ I+ Sbjct: 200 IQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIR 259 Query: 114 AINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFS 4 A+N VFCSMN Q+E NKD+F RLL+ V+ D+ +FS Sbjct: 260 ALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFS 296 >ref|XP_004310239.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Fragaria vesca subsp. vesca] Length = 1230 Score = 130 bits (327), Expect = 1e-27 Identities = 94/264 (35%), Positives = 139/264 (52%), Gaps = 8/264 (3%) Frame = -3 Query: 792 SVEEISEEDF-KQEAKVLNPK-----GGDSRVW-MGDLLNYPVSSNYGSG-LYNFAWAQA 637 SVEEISEEDF KQE+K + PK G +R W ++L +P G G L N AWAQA Sbjct: 23 SVEEISEEDFVKQESKAVEPKSNGGSGDGARFWTFHEVLAHPHFRGIGGGGLANLAWAQA 82 Query: 636 VQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEEIDS 457 VQNKP +++L++ +S+EKSK+ V+I DS +E+D Sbjct: 83 VQNKPFNDLLVK-LDSDEKSKQQQQQRSSVSSGNE------------KVVIIDSGDEMDV 129 Query: 456 KAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKSI 277 + ++ E +EE E G+ + +G G +EK++ + Sbjct: 130 EKEE--------------------EELEEGEIGFDSECGDNDKAAGSVGNGVWEKRVNLL 169 Query: 276 RGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGIQAINSVF 97 R ALE++T+ AEKSF VC SL+SL+ ++ E + L+QQ F ++AI+SVF Sbjct: 170 REALESLTITEAEKSFGDVCHRFLDSLESLRGVLSEINVSTKEALVQQLFNAVRAISSVF 229 Query: 96 CSMNPKQQEQNKDLFLRLLTHVKS 25 SM+ Q+EQNKD+ R+L+ KS Sbjct: 230 RSMSADQKEQNKDVLSRILSSAKS 253 >ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Glycine max] Length = 1261 Score = 127 bits (319), Expect = 1e-26 Identities = 101/275 (36%), Positives = 138/275 (50%), Gaps = 11/275 (4%) Frame = -3 Query: 792 SVEEISEEDF-KQEAKVLN----PKGGDSRVW-MGDLLN-YP-VSSNYGSGLYNFAWAQA 637 SVEEIS EDF KQ+ K+LN P G D+RVW + DL + YP + Y SGLYN AWAQA Sbjct: 35 SVEEISAEDFNKQDVKLLNNNNKPNGSDARVWAVHDLYSKYPTICRGYASGLYNLAWAQA 94 Query: 636 VQNKPLSEILMRDFESEEK--SKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEEI 463 VQNKPL++I + + +S+ S R+ S+ L K+V V +D E+ Sbjct: 95 VQNKPLNDIFVMEVDSDANANSNRNSSHRLASVAVNP--------KDVVVVDVDKEEGEL 146 Query: 462 DSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIK 283 + D D+E E E +DS D + + E+ Sbjct: 147 EEGEIDA-----------------DAEPEGEAESVVVAVSDSEKLDDVKMDVSDSEQL-- 187 Query: 282 SIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGIQAINS 103 RG LE VTV +SF C +LQ +L + + + DDL++ SF + + S Sbjct: 188 GARGVLEGVTVANVVESFAQTCSKLQNTLPEV---LSRPAGSEKDDLVRLSFNATEVVYS 244 Query: 102 VFCSMNPKQQEQNKDLFLRLLTHVK-SQDTTLFSP 1 VFCSM+ ++EQNKD LRLL+ VK Q LFSP Sbjct: 245 VFCSMDSSEKEQNKDSILRLLSFVKDQQQAQLFSP 279 >ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus trichocarpa] gi|550347145|gb|EEE82674.2| CTD phosphatase-like protein 3 [Populus trichocarpa] Length = 1190 Score = 125 bits (315), Expect = 3e-26 Identities = 96/273 (35%), Positives = 130/273 (47%), Gaps = 9/273 (3%) Frame = -3 Query: 792 SVEEISEEDFKQEAKVL---NPKGGDS--RVW-MGDLLNYPVSSNYGSGLYNFAWAQAVQ 631 SVEEISEEDF ++ V+ P +S +VW + DL Y V Y SGLYN AWA+AVQ Sbjct: 28 SVEEISEEDFNKQEVVIVKETPSSNNSSQKVWTVRDLYKYQVGGGYMSGLYNLAWARAVQ 87 Query: 630 NKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEEIDS-K 454 NKPL+E+ V+IDDS +E+D K Sbjct: 88 NKPLNEL--------------------------------------TVVIDDSGDEMDVVK 109 Query: 453 AQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKSIR 274 D+ + E E EG ++ + S + E ++KSIR Sbjct: 110 VIDI-----------------EKEEGELEEGEIDLDSEPVVVQSEGMVSVDVENRVKSIR 152 Query: 273 GALETVTVKYAEKSFHGVCLELQASLDSLKLMI--MENGALDVDDLIQQSFTGIQAINSV 100 LE+V+V EKSF VCL+L L+SLK ++ +N D L+Q F I+ +NSV Sbjct: 153 KDLESVSVIETEKSFEAVCLKLHKVLESLKELVGGNDNSFPSKDGLVQLLFMAIRVVNSV 212 Query: 99 FCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSP 1 FCSMN K +EQNK +F R + + S FSP Sbjct: 213 FCSMNKKLKEQNKGVFSRFFSLLNSHYPPFFSP 245 >ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Glycine max] Length = 1257 Score = 125 bits (314), Expect = 4e-26 Identities = 96/274 (35%), Positives = 136/274 (49%), Gaps = 10/274 (3%) Frame = -3 Query: 792 SVEEISEEDF-KQEAKVLN----PKGGDSRVW-MGDLLN-YP-VSSNYGSGLYNFAWAQA 637 SVEEIS EDF KQ+ KVLN P G D+RVW + DL + YP + Y SGLYN AWAQA Sbjct: 35 SVEEISAEDFNKQDVKVLNNNNKPNGSDARVWAVHDLYSKYPTICRGYASGLYNLAWAQA 94 Query: 636 VQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEEIDS 457 VQNKPL++I + + +S+ + + +N + K+V V +D E++ Sbjct: 95 VQNKPLNDIFVMEVDSDANANSNSNN------SNRLASVAVNPKDVVVVDVDKEEGELEE 148 Query: 456 KAQDVXXXXXXXXXXXXXXXEL-DSEMVEETEGGWSNANDSLPSDSGRNSEGEFEKQIKS 280 D + DSE +++ + SN+ Sbjct: 149 GEIDADAEPEGEAESVVAVPVVSDSEKLDDVKRDVSNSEQL------------------G 190 Query: 279 IRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGIQAINSV 100 +RG LE VTV +SF C +LQ +L + + + DDL++ SF + + SV Sbjct: 191 VRGVLEGVTVANVAESFAQTCSKLQNALPEV---LSRPADSERDDLVRLSFNATEVVYSV 247 Query: 99 FCSMNPKQQEQNKDLFLRLLTHVK-SQDTTLFSP 1 FCSM+ ++EQNKD LRLL+ VK Q LFSP Sbjct: 248 FCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSP 281 >gb|EYU42076.1| hypothetical protein MIMGU_mgv1a000356mg [Mimulus guttatus] Length = 1220 Score = 114 bits (285), Expect = 9e-23 Identities = 94/298 (31%), Positives = 135/298 (45%), Gaps = 34/298 (11%) Frame = -3 Query: 792 SVEEISEEDFKQEAKVLNPK---------------------------------GGDSRVW 712 S+EEISEEDF + + L P GG +RVW Sbjct: 28 SIEEISEEDFNAK-QALQPSPPPAPPLKSSLNSSHINVVTSNNNNNNSNNSAGGGGARVW 86 Query: 711 -MGDLLNYPVSSNYGSGLYNFAWAQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXX 535 M DL Y V+S + GLYN AWAQAV NK L E+LM +E SN Sbjct: 87 TMKDLYEYQVASKHYPGLYNLAWAQAVNNKSLDEVLMM----KEDGNNDRSNGGISDTSS 142 Query: 534 XXXXXXXSMKEVCNVIIDDSSEEIDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGW 355 K V +V ++ EE + + ++ LDSE+V Sbjct: 143 SKSSKTNDSKVVIDVEVEGGMEEGELEEGEID---------------LDSELVVR----- 182 Query: 354 SNANDSLPSDSGRNSEGEFEKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMI 175 N + ++ ++S S +++ SI+ LE++ V A S+H +C L+ ++ SL+ M+ Sbjct: 183 -NMDFNVETNSNEKS-----RRVDSIKRELESLNVADAIISYHRLCSSLKNTIVSLQEMV 236 Query: 174 MENGALDVDDLIQQSFTGIQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSP 1 +E + D L+Q T IQ + SVF SM+PK +EQNK + RLL V S LFSP Sbjct: 237 LEGSFAEKDTLVQLLLTAIQTLYSVFSSMSPKLKEQNKPILSRLLARVTSLKPPLFSP 294 >ref|XP_002881287.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp. lyrata] gi|297327126|gb|EFH57546.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp. lyrata] Length = 1248 Score = 111 bits (278), Expect = 6e-22 Identities = 84/246 (34%), Positives = 120/246 (48%), Gaps = 4/246 (1%) Frame = -3 Query: 729 GDSRVW-MGDLLN-YPVSSNYG-SGLYNFAWAQAVQNKPLSEILMRDFESEEKSKRSGSN 559 G+SRVW M DLL YP Y SGL NFAW+QAVQNK L+E L+ D+E E K Sbjct: 72 GNSRVWTMEDLLTKYPGYRLYATSGLSNFAWSQAVQNKSLNEGLVMDYEPRESDK----- 126 Query: 558 LLXXXXXXXXXXXXXSMKEVCNVIIDDSSEEIDSKAQDVXXXXXXXXXXXXXXXELDSEM 379 ++I+DS +E K + L + + Sbjct: 127 ----------------------IVIEDSGDE---KEEGELEEGEIDLVENASDDNLVASV 161 Query: 378 VEETEGGWSNANDSLPSDSGRNSEGEFEKQIKSIRGALETVTVKYAEKSFHGVCLELQAS 199 +ETE + D + D E + EK++K IRG LE+ ++ A+ F GVC + + Sbjct: 162 DKETESVVLISADKV-EDDRIQKEIDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRILGA 220 Query: 198 LDSLKLMIMENGALDV-DDLIQQSFTGIQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQ 22 L+SL+ ++ +N D L+Q SF +Q INSVFCS+N +E+NK+ RLLT V Sbjct: 221 LESLRELVSDNDDFPKRDTLVQLSFASLQTINSVFCSLNNVSKERNKETMSRLLTLVNDH 280 Query: 21 DTTLFS 4 + S Sbjct: 281 FSRFLS 286 >ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris] gi|561012448|gb|ESW11309.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris] Length = 1272 Score = 110 bits (276), Expect = 1e-21 Identities = 93/273 (34%), Positives = 134/273 (49%), Gaps = 9/273 (3%) Frame = -3 Query: 792 SVEEISEEDF-KQEAKVLN---PKGGDSRVW-MGDLLN-YP-VSSNYGSGLYNFAWAQAV 634 SVEEISE DF KQ+ KV N P G D+RVW + D+ YP + Y SGLYN AWAQAV Sbjct: 35 SVEEISEADFNKQDVKVNNNNKPNGSDARVWSVRDIYTKYPTICRGYASGLYNLAWAQAV 94 Query: 633 QNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSSEEIDSK 454 QNKPL++I + + +SE + + +N + KEV V +D E++ Sbjct: 95 QNKPLNDIFVMELDSEANANSNSNN------SNRPSSVSVNPKEVMVVDVDREEGELEEG 148 Query: 453 AQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSL-PSDSGRNSEGEFEKQIKSI 277 D + D E E+ S ++++ S+ +G + + + Sbjct: 149 EIDA---------------DADPEAEAESVVAASVVSETVSDSEQFGVKKGVSDSEQLGV 193 Query: 276 RGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGIQAINSVF 97 R LE VTV +SF L L++L + + DDLI+ SF I+ + SVF Sbjct: 194 RDVLEGVTVANVAESFAQTSSRL---LNALPQVFSRPADSEKDDLIRLSFNAIEVVYSVF 250 Query: 96 CSMNPKQQEQNKDLFLRLLTHVK-SQDTTLFSP 1 SM+ +EQNK+ LRLL+ K + LFSP Sbjct: 251 RSMDSSDKEQNKNSILRLLSSAKDKKQAQLFSP 283 >ref|XP_003621644.1| RNA polymerase II C-terminal domain phosphatase-like protein [Medicago truncatula] gi|355496659|gb|AES77862.1| RNA polymerase II C-terminal domain phosphatase-like protein [Medicago truncatula] Length = 1213 Score = 107 bits (266), Expect = 1e-20 Identities = 87/280 (31%), Positives = 133/280 (47%), Gaps = 23/280 (8%) Frame = -3 Query: 792 SVEEISEEDFKQ--EAKVLNPK------------------GGDSRVW-MGDLLN-YP-VS 682 S+EEI+EEDFK+ + KV N GGDSRVW + DL + YP + Sbjct: 35 SLEEITEEDFKKGDDVKVNNSDVKTDKSDNKVKTGGGGGGGGDSRVWAVQDLYSKYPTIC 94 Query: 681 SNYGSGLYNFAWAQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKE 502 Y SGLYN AWAQAVQNKPL++I + + + + + S S KE Sbjct: 95 RGYASGLYNLAWAQAVQNKPLNDIFVMELDKNANANSNNSG-------NKDGELNKSSKE 147 Query: 501 VCNVIIDDSSEEIDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDS 322 + V++DD E+ + + ++ E +G + + S++ Sbjct: 148 I--VVVDDDDEKEEGELEE-----------------------GEIDGDADDDCVIVGSEN 182 Query: 321 GRNSEGEFEKQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDL 142 NSE + +RG LE VTV +SF C +Q +L S ++ + DDL Sbjct: 183 FSNSE------VLGVRGVLEGVTVASVAESFAETCRRIQGTLQSKVFSGFDSA--EKDDL 234 Query: 141 IQQSFTGIQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQ 22 ++ F ++ + SVFC M+ Q+E+NKD RLL+ +K+Q Sbjct: 235 VRLLFNAVEVVYSVFCCMDNLQKEENKDNISRLLSFLKNQ 274 >ref|XP_006662962.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain phosphatase-like 3-like [Oryza brachyantha] Length = 1267 Score = 106 bits (265), Expect = 2e-20 Identities = 82/280 (29%), Positives = 131/280 (46%), Gaps = 16/280 (5%) Frame = -3 Query: 792 SVEEISEEDFKQEAKVLNPKGGD-------------SRVWMGDLLNYPVSSNYGSGLYNF 652 S+EEIS +DFK+E+ G SRVWMG Y + +Y ++F Sbjct: 43 SLEEISADDFKKESSGGGGGAGAGAGTGGGVAAAQRSRVWMG----YSMPRSYAPAFHSF 98 Query: 651 AWAQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDSS 472 AWAQAVQNKPL D + E + D+ Sbjct: 99 AWAQAVQNKPLVPRAAADEDEVEHVVDTS---------------------------DEEK 131 Query: 471 EEIDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEG-EFE 295 EE + + + LDS++ E+ E S A D + +G E +F+ Sbjct: 132 EEGEIEEGEAVQSSESPPRAQPETIVLDSDVPEKPE---SAAMDGVTIPAGAEEEDMDFD 188 Query: 294 KQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGA--LDVDDLIQQSFTG 121 +++ SI LET++++ AEKSF G C L+ S ++LK + E G+ +D L+QQ+F Sbjct: 189 QRVGSILEELETISIEEAEKSFEGACTRLRTSFENLKPLFPETGSPMPMLDTLVQQAFIA 248 Query: 120 IQAINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFSP 1 I I +V S + ++EQ K++ L+LL H+K++ + + +P Sbjct: 249 IDTITTVANSYDMPKREQTKNMLLKLLFHIKNRYSYMLTP 288 >ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3-like [Solanum tuberosum] Length = 1218 Score = 106 bits (265), Expect = 2e-20 Identities = 87/277 (31%), Positives = 129/277 (46%), Gaps = 14/277 (5%) Frame = -3 Query: 792 SVEEISEEDFKQE-------AKVLNPKGGD------SRVW-MGDLLNYPVSSNYGSGLYN 655 SVEEISE+ F ++ K+ + + + +RVW M D YP+S +Y GLYN Sbjct: 20 SVEEISEDAFNRQDPPTTTKIKIASNENQNQNSTTTTRVWTMRDAYKYPISRDYARGLYN 79 Query: 654 FAWAQAVQNKPLSEILMRDFESEEKSKRSGSNLLXXXXXXXXXXXXXSMKEVCNVIIDDS 475 AWAQAVQNKPL E+ + ++ + + +N+ K + +V +DD Sbjct: 80 LAWAQAVQNKPLDELFVMTSDNSNQCANANANV--------------ESKVIIDVDVDDD 125 Query: 474 SEEIDSKAQDVXXXXXXXXXXXXXXXELDSEMVEETEGGWSNANDSLPSDSGRNSEGEFE 295 ++E + E+ EE E A+ L F Sbjct: 126 AKE-------------------------EGEL-EEGEIDLDAADLVL----------NFG 149 Query: 294 KQIKSIRGALETVTVKYAEKSFHGVCLELQASLDSLKLMIMENGALDVDDLIQQSFTGIQ 115 K+ +R L++VT+ KSF VC +LQ SL +L + + D+ LIQ T ++ Sbjct: 150 KEANFVREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQDKNDI--LIQLFMTALR 207 Query: 114 AINSVFCSMNPKQQEQNKDLFLRLLTHVKSQDTTLFS 4 INSVF SMN Q++QN D+ RLL H K+Q L S Sbjct: 208 TINSVFYSMNQDQKQQNTDILSRLLFHAKTQLPALLS 244