BLASTX nr result
ID: Stemona21_contig00004688
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Stemona21_contig00004688 (2796 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal doma... 494 e-137 ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [S... 494 e-137 gb|AFW77884.1| CPL3 [Zea mays] 491 e-136 ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Popu... 490 e-135 ref|XP_006654357.1| PREDICTED: RNA polymerase II C-terminal doma... 490 e-135 ref|NP_001152445.1| CPL3 [Zea mays] gi|195656359|gb|ACG47647.1| ... 489 e-135 gb|EMS57931.1| RNA polymerase II C-terminal domain phosphatase-l... 485 e-134 ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal doma... 484 e-134 ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group] g... 482 e-133 gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isofo... 481 e-132 dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare] 480 e-132 ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [S... 479 e-132 ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Popu... 477 e-131 ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal doma... 477 e-131 gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus pe... 476 e-131 gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indi... 476 e-131 ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma... 471 e-130 ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal doma... 469 e-129 ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal doma... 469 e-129 ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal doma... 468 e-129 >ref|XP_004962192.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Setaria italica] Length = 543 Score = 494 bits (1273), Expect = e-137 Identities = 245/407 (60%), Positives = 311/407 (76%), Gaps = 2/407 (0%) Frame = +2 Query: 1019 IQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLKGLCVTCG- 1195 +++ KRR++EE + Q S + A SK +C HP GLC CG Sbjct: 66 LEQNSAKRRRVEE-----QSQDQGTSIRPDKIATGPSKNVQVEVCPHPGYFGGLCFRCGK 120 Query: 1196 -QLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINSTRLA 1372 Q E+D SGVA GYIHK L+L T EI+RLR ADLK L +++ HTLINST+L Sbjct: 121 PQDEEDASGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLQ 180 Query: 1373 DISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIYTMGE 1552 DIS E L + ALK+DP+RS+F LDSM MLTKLRPFVR FL+EA+ +FEMYIYTMG+ Sbjct: 181 DISSAENELGIRTAALKDDPDRSIFSLDSMQMLTKLRPFVRNFLKEASNMFEMYIYTMGD 240 Query: 1553 RSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVWRKHR 1732 ++YA+EIAKLLDP +VYF SKVIS +DCT+RH+KGLDV+LG+ES+ VILDDTE+VW+KH+ Sbjct: 241 KAYAIEIAKLLDPSNVYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEYVWQKHK 300 Query: 1733 ENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFDQVLN 1912 ENL+LMERYH+FASSCRQFGF KSLSES +DERESDGAL+ +L+VLKR H +FFD + Sbjct: 301 ENLILMERYHYFASSCRQFGFGVKSLSESMQDERESDGALATVLDVLKRIHTIFFDTAVE 360 Query: 1913 DDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATEVDAS 2092 SSRDVRQV++ +R E+L+GCK+VFSRVFP S+ ++Q +W+MAE LGA C+T+VD++ Sbjct: 361 TALSSRDVRQVIKTVRKEVLEGCKLVFSRVFPNTSRPQEQMMWKMAEHLGAVCSTDVDST 420 Query: 2093 VTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPI 2233 VTHVV+ D GT+KARWA++N KFLV+PRWIEAAN+ W RQ EEDFP+ Sbjct: 421 VTHVVAVDLGTEKARWAVKNKKFLVHPRWIEAANFRWHRQPEEDFPV 467 >ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor] Length = 558 Score = 494 bits (1273), Expect = e-137 Identities = 247/419 (58%), Positives = 316/419 (75%), Gaps = 2/419 (0%) Frame = +2 Query: 1001 DYDESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLKGL 1180 D + +++ KRR++EE +++ SV I A SK C HP GL Sbjct: 62 DPEVEAVEQNGTKRRRVEEQ--LQDQGTSVRPDKIPTGA---SKNVQVEACPHPGYFGGL 116 Query: 1181 CVTCGQLEDDG--SGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLI 1354 C CG+ +D+ SGVA GYIHK L+L T EI+RLR ADLK L +++ HTLI Sbjct: 117 CFRCGKPQDEENVSGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLI 176 Query: 1355 NSTRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMY 1534 NST+L DIS E+ L Q A K+DPNRS+F LDSM MLTKLRPFVR FL+EA+ +FEMY Sbjct: 177 NSTKLQDISSAEKDLGIQTAASKDDPNRSIFSLDSMQMLTKLRPFVREFLKEASNMFEMY 236 Query: 1535 IYTMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEF 1714 IYTMG+++YA+EIAKLLDP ++YF SKVIS +DCT+RH+KGLDV+LG+ES+ VILDDTE+ Sbjct: 237 IYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEY 296 Query: 1715 VWRKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMF 1894 VW+KH+ENL+LMERYHFFASSCRQFGF +SLSES +DERESDGAL+ +L+VLKR H +F Sbjct: 297 VWQKHKENLILMERYHFFASSCRQFGFGVRSLSESMQDERESDGALATVLDVLKRIHSIF 356 Query: 1895 FDQVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCA 2074 FD + D SS+DVRQV++ +R EILQGCK+VFSRVFP ++ ++Q +W+MAE LGA C+ Sbjct: 357 FDLAVETDLSSQDVRQVIKAVRKEILQGCKIVFSRVFPNNTRPQEQMLWKMAEHLGAVCS 416 Query: 2075 TEVDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKNQ 2251 T+VD+SVTHVV+ D GT+KARW + N KFLV+PRWIEAAN+ W RQ EEDFP++ K + Sbjct: 417 TDVDSSVTHVVTVDLGTEKARWGVANKKFLVHPRWIEAANFRWHRQPEEDFPVTAPKEK 475 >gb|AFW77884.1| CPL3 [Zea mays] Length = 533 Score = 491 bits (1263), Expect = e-136 Identities = 248/413 (60%), Positives = 309/413 (74%), Gaps = 2/413 (0%) Frame = +2 Query: 1019 IQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLKGLCVTCG- 1195 I++ KRR++EE ++ SV I A SK C HP GLC+ CG Sbjct: 66 IEQNGTKRRRVEEQ--CQDQGTSVRPDKIPTGA---SKIVQVEACPHPGHFGGLCIICGK 120 Query: 1196 -QLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINSTRLA 1372 Q E+D SGVA GYIHK L+L T EI+RLR ADLK L +++ HTLINST+L Sbjct: 121 PQDEEDVSGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLQ 180 Query: 1373 DISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIYTMGE 1552 DIS E+ L Q+ A K+DPNRS+F LD M MLTKLRPFVR FL+EA+ +FEMYIYTMG+ Sbjct: 181 DISSAEKDLGIQSAASKDDPNRSIFALDLMPMLTKLRPFVREFLKEASNMFEMYIYTMGD 240 Query: 1553 RSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVWRKHR 1732 ++YA+EIAKLLDP ++YF SKVIS +DCT+RH+KGLDV+LG+ES+ VILDDTE+VW+KH+ Sbjct: 241 KAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEYVWQKHK 300 Query: 1733 ENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFDQVLN 1912 ENL+LMERYHFFASSCRQFGF +SLSES +DERESDGAL+ +L+VLKR H FFD Sbjct: 301 ENLILMERYHFFASSCRQFGFGVRSLSESLQDERESDGALATVLDVLKRIHATFFDMAAE 360 Query: 1913 DDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATEVDAS 2092 D SSRD+RQV++ +R EILQGCK+VFSRVFP ++ ++Q +W+MAE LGA C +VD S Sbjct: 361 TDLSSRDIRQVIKTLRKEILQGCKIVFSRVFPNNTRPQEQMVWKMAEYLGAVCVKDVDPS 420 Query: 2093 VTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKNQ 2251 VTHVV+ D GT+KARW L N KFLV+PRWIEAAN+ W RQ EEDFP++ K + Sbjct: 421 VTHVVTVDLGTEKARWGLNNKKFLVHPRWIEAANFRWHRQPEEDFPVTAPKEK 473 >ref|XP_002324547.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318538|gb|EEF03112.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 472 Score = 490 bits (1262), Expect = e-135 Identities = 239/419 (57%), Positives = 315/419 (75%) Frame = +2 Query: 992 QVNDYDESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVL 1171 + + D+SD Q R+KR K+E VE++++ G+ + ++K + S IC HP Sbjct: 58 EAEEDDDSDFQRKRVKRSKVETVEIVEDDGGTTSFASLKHNSEASIS---KEICTHPGSF 114 Query: 1172 KGLCVTCGQLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTL 1351 +C+ CGQL D SGV GYIHK L+L +EI RLR D+K L K+ HTL Sbjct: 115 GTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTL 174 Query: 1352 INSTRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEM 1531 +NST+L ++ +EEYL Q D+L++ SLF L SM M+TKLRPFVRTFL+EA+++FEM Sbjct: 175 LNSTQLMHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEM 234 Query: 1532 YIYTMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTE 1711 YIYTMG+R+YALE+AKLLDPG YFN+KVIS+ D T+RH+KGLDVVLG ES V+ILDDTE Sbjct: 235 YIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTE 294 Query: 1712 FVWRKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQM 1891 W KH++NL+LMERYHFFASSC QFGFN KSLSE + DE ES+GAL+ IL VL++ HQ+ Sbjct: 295 NAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQI 354 Query: 1892 FFDQVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATC 2071 FF++ L ++ RDVRQVL+ +R ++L+GCK+VFSRVFP +S+A++ +WRMAEQLGATC Sbjct: 355 FFEE-LEENMDGRDVRQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATC 413 Query: 2072 ATEVDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKN 2248 +TE+D SVTHVVS D+GT+K+ WAL++ KFLV P WIEAANYFW+RQ EE+F + +KN Sbjct: 414 STELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFNQIKN 472 >ref|XP_006654357.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Oryza brachyantha] Length = 557 Score = 490 bits (1261), Expect = e-135 Identities = 244/417 (58%), Positives = 315/417 (75%), Gaps = 3/417 (0%) Frame = +2 Query: 995 VNDYDESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICR-HPVVL 1171 V + +E+ +++ KRR++E+ ++Q + + SSK IC HP Sbjct: 64 VVEQEEAKVEQSSSKRRRVED-----QHQDEGKAMRPNDDTVGSSKDVKIEICPPHPGFF 118 Query: 1172 KGLCVTCG--QLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXH 1345 GLC CG Q E+D GVA GYIHK L L T EI+RLR ADLK L ++R H Sbjct: 119 GGLCFKCGKKQDEEDVPGVAFGYIHKGLTLGTSEIDRLRGADLKNLLRERRLVLILDLDH 178 Query: 1346 TLINSTRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLF 1525 TLINST+L D+S E L Q+ A K+DPNRSLF+LD+M MLTKLRPFVR FL+EA+ +F Sbjct: 179 TLINSTKLLDLSAAENELGIQSAASKDDPNRSLFRLDAMQMLTKLRPFVREFLKEASNMF 238 Query: 1526 EMYIYTMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDD 1705 EMYIYTMG+++YA+EIAKLLDP +VYF S VIS +DCT+RH+KGLDV+LG+ES+ VILDD Sbjct: 239 EMYIYTMGDKAYAIEIAKLLDPENVYFGSNVISNSDCTQRHQKGLDVILGAESLAVILDD 298 Query: 1706 TEFVWRKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTH 1885 TE+VW+KH+ENL+LMERYH+FASSCRQFGF+A+SLSES +DERE DGAL+ IL++L+R H Sbjct: 299 TEYVWQKHKENLILMERYHYFASSCRQFGFSARSLSESMQDEREGDGALATILDILRRIH 358 Query: 1886 QMFFDQVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGA 2065 +FFD + + SRDVRQV++R+R EIL GCK+VF+RVFP + +DQ +W+MAEQLGA Sbjct: 359 SIFFDSAVQNPLPSRDVRQVIKRVRQEILDGCKLVFTRVFPLHQRPQDQMLWKMAEQLGA 418 Query: 2066 TCATEVDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPIS 2236 C T+VD+ VTHVV+ D GT+KARWA+ N KFLV+PRWIEAAN+ W RQ+EEDFP++ Sbjct: 419 VCCTDVDSMVTHVVALDLGTEKARWAVGNKKFLVHPRWIEAANFRWHRQQEEDFPVA 475 >ref|NP_001152445.1| CPL3 [Zea mays] gi|195656359|gb|ACG47647.1| CPL3 [Zea mays] Length = 531 Score = 489 bits (1260), Expect = e-135 Identities = 247/413 (59%), Positives = 309/413 (74%), Gaps = 2/413 (0%) Frame = +2 Query: 1019 IQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLKGLCVTCG- 1195 I++ KRR++EE ++ SV I A SK C HP GLC+ CG Sbjct: 64 IEQNGTKRRRVEEQ--CQDQGTSVRPDKIPTGA---SKIVQVEACPHPGHFGGLCIICGK 118 Query: 1196 -QLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINSTRLA 1372 Q E+D SGVA GYIHK L+L T EI+RLR ADLK L +++ HTLINST+L Sbjct: 119 PQDEEDVSGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLQ 178 Query: 1373 DISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIYTMGE 1552 DIS E+ L Q+ A K+DPNRS+F LD M MLTKLRPFVR FL+EA+ +FEMYIYTMG+ Sbjct: 179 DISSAEKDLGIQSAASKDDPNRSIFALDLMPMLTKLRPFVREFLKEASNMFEMYIYTMGD 238 Query: 1553 RSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVWRKHR 1732 ++YA+EIAKLLDP ++YF SKVIS +DCT+RH+KGLDV+LG+ES+ VILDDTE+VW+KH+ Sbjct: 239 KAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQKGLDVILGAESVAVILDDTEYVWQKHK 298 Query: 1733 ENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFDQVLN 1912 ENL+LMERYHFFASSCRQFGF +SLSES +DERESDGAL+ +L+VLKR H FFD Sbjct: 299 ENLILMERYHFFASSCRQFGFGVRSLSESLQDERESDGALATVLDVLKRIHATFFDMAAE 358 Query: 1913 DDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATEVDAS 2092 D SSRD+RQV++ +R EILQGCK+VFSRVFP ++ ++Q +W+MAE LGA C +VD S Sbjct: 359 TDLSSRDIRQVIKTLRKEILQGCKIVFSRVFPNNTRPQEQMVWKMAEYLGAVCVKDVDPS 418 Query: 2093 VTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKNQ 2251 VTHVV+ D GT+K+RW L N KFLV+PRWIEAAN+ W RQ EEDFP++ K + Sbjct: 419 VTHVVTVDLGTEKSRWGLNNKKFLVHPRWIEAANFRWHRQPEEDFPVTAPKEK 471 >gb|EMS57931.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Triticum urartu] Length = 589 Score = 485 bits (1248), Expect = e-134 Identities = 245/412 (59%), Positives = 309/412 (75%), Gaps = 3/412 (0%) Frame = +2 Query: 1025 EGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICR-HPVVLKGLCVTCG-- 1195 E KRRK++ +YQ + E + SS+ IC HP GLC CG Sbjct: 118 EASAKRRKVKV-----QYQDRETAIRPDEDSIGSSEDAQIKICPPHPGYFGGLCFRCGKR 172 Query: 1196 QLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINSTRLAD 1375 Q E+D GVA GY+HK L+L T EI+RLR +DLK L +++ HTLINST+L D Sbjct: 173 QDEEDVPGVAFGYVHKGLRLGTTEIDRLRGSDLKNLLRERKLILILDLDHTLINSTKLHD 232 Query: 1376 ISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIYTMGER 1555 IS E L Q A K+DPN SLF L+ M MLTKLRPFVR FL+EA+ +FEMYIYTMG++ Sbjct: 233 ISAAENNLGIQTAASKDDPNGSLFTLEGMQMLTKLRPFVRKFLKEASNMFEMYIYTMGDK 292 Query: 1556 SYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVWRKHRE 1735 +YA+EIAKLLDP +VYFNSKVIS +DCT+RH+KGLD+VLG+ES+ VILDDTE+VW+KH+E Sbjct: 293 AYAIEIAKLLDPRNVYFNSKVISNSDCTQRHQKGLDMVLGAESVAVILDDTEYVWQKHKE 352 Query: 1736 NLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFDQVLND 1915 NL+LMERYH+FASSCRQFGF+ KSLSE +DER SDGAL+ IL+VLKR H +FFD + Sbjct: 353 NLILMERYHYFASSCRQFGFSVKSLSEFMQDERGSDGALATILDVLKRIHTIFFDSAVET 412 Query: 1916 DYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATEVDASV 2095 SSRDVRQV++R+R E+LQGCK+VFSRVFP+ S+ +DQ IW+MAEQLGA C+ +VD+++ Sbjct: 413 ALSSRDVRQVIKRVRQEVLQGCKLVFSRVFPSSSRPQDQFIWKMAEQLGAICSADVDSTI 472 Query: 2096 THVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKNQ 2251 THVV+ D GT KARWA+ N K LV+PRWIEA+N+ W RQ+EEDFP+ KN+ Sbjct: 473 THVVAVDVGTDKARWAVNNNKILVHPRWIEASNFRWHRQQEEDFPVKVKKNE 524 >ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Brachypodium distachyon] Length = 492 Score = 484 bits (1247), Expect = e-134 Identities = 246/414 (59%), Positives = 315/414 (76%), Gaps = 3/414 (0%) Frame = +2 Query: 1019 IQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICR-HPVVLKGLCVTCG 1195 +++G KRR++EE ++Q + E A S K + IC HP L+GLC+ CG Sbjct: 67 VEQGSTKRRRVEE-----QHQDRGTAMRPDEDAIGSFKDAEIKICPPHPGFLRGLCIKCG 121 Query: 1196 QLED--DGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINSTRL 1369 +++D D GVA GYIH+ L+L T EI RLR +DLK L +++ HTLINSTRL Sbjct: 122 KIQDEEDVPGVACGYIHEGLRLGTSEIERLRGSDLKKLLRERKLVLILDLDHTLINSTRL 181 Query: 1370 ADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIYTMG 1549 DIS E L Q ALK+DP+RSLF L+ MHMLTKLRPFVR FL+EA+ +FEMYIYTMG Sbjct: 182 HDISAAEMDLGIQTAALKDDPDRSLFTLERMHMLTKLRPFVRRFLKEASNMFEMYIYTMG 241 Query: 1550 ERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVWRKH 1729 +++Y++E+AKLLDPG+VYF SKVIS +DCT+RH+KGLDVVLG+ESI VILDDTE VW+KH Sbjct: 242 DKAYSIEVAKLLDPGNVYFGSKVISNSDCTQRHQKGLDVVLGAESIAVILDDTEDVWQKH 301 Query: 1730 RENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFDQVL 1909 +ENL+LMERYH+FASSCRQFGF+ +SLSE DERESDGALS IL+VLKR H +FFD + Sbjct: 302 KENLILMERYHYFASSCRQFGFSVRSLSELMVDERESDGALSTILDVLKRIHTIFFDSGV 361 Query: 1910 NDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATEVDA 2089 SSR + V++R+R E+LQGCK+VFSRVFP+ S +DQ IW+MAE+LGA+C VD+ Sbjct: 362 ETALSSRTL-MVIKRVRQEVLQGCKLVFSRVFPSNSCPQDQIIWKMAEKLGASCCAHVDS 420 Query: 2090 SVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKNQ 2251 +VTHVV+ D GT+KARWA++N KFL++PRWIEA+NY W RQ EEDFP++ K + Sbjct: 421 TVTHVVAVDVGTEKARWAVENKKFLLHPRWIEASNYRWRRQPEEDFPVAGRKEK 474 >ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group] gi|57863785|gb|AAS86390.2| unknown protein [Oryza sativa Japonica Group] gi|113578991|dbj|BAF17354.1| Os05g0390500 [Oryza sativa Japonica Group] gi|215695102|dbj|BAG90293.1| unnamed protein product [Oryza sativa Japonica Group] gi|222631469|gb|EEE63601.1| hypothetical protein OsJ_18418 [Oryza sativa Japonica Group] Length = 536 Score = 482 bits (1241), Expect = e-133 Identities = 241/421 (57%), Positives = 316/421 (75%), Gaps = 2/421 (0%) Frame = +2 Query: 995 VNDYDESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLK 1174 V + +++ +++ KRR++E+ + +V + ++ S K D HP Sbjct: 65 VVEQEDAIVEQSSTKRRRVED----QHRHQAVVMKSDEDTVGSSKDVKIDECPPHPGFFG 120 Query: 1175 GLCVTCG--QLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHT 1348 GLC CG Q E+D GVA GYIHK L+L T EI+RLR ADLK L +++ HT Sbjct: 121 GLCYRCGKRQDEEDVPGVAFGYIHKGLRLGTTEIDRLRGADLKNLLRERKLVLILDLDHT 180 Query: 1349 LINSTRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFE 1528 LINST+L D+S E L Q+ A + P+RSLF L++M MLTKLRPFVR FL+EA+ +FE Sbjct: 181 LINSTKLFDLSAAENELGIQSAAKEVVPDRSLFTLETMQMLTKLRPFVRRFLKEASDMFE 240 Query: 1529 MYIYTMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDT 1708 MYIYTMG+++YA+EIAKLLDP +VYF SKVIS +DCT+RH+KGLDVVLG ES+ VILDDT Sbjct: 241 MYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSDCTQRHQKGLDVVLGDESVAVILDDT 300 Query: 1709 EFVWRKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQ 1888 E+VW+KH+ENL+LMERYH+FASSCRQFGF A+SLSE+ +DERE+DGAL+ IL+VL+R H Sbjct: 301 EYVWQKHKENLILMERYHYFASSCRQFGFGARSLSETMQDERENDGALATILDVLERIHT 360 Query: 1889 MFFDQVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGAT 2068 +FFD SSRDVRQV++R+R E+LQGCK+VF+RVFP + +DQ IW+MAEQLGA Sbjct: 361 IFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKLVFTRVFPLHQRQQDQMIWKMAEQLGAV 420 Query: 2069 CATEVDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKN 2248 C T+VD++VTHVV+ D GT+KARWA+ N KFLV+PRWIEAAN+ W+RQ+EEDFP++ K Sbjct: 421 CCTDVDSTVTHVVALDLGTEKARWAVSNKKFLVHPRWIEAANFRWQRQQEEDFPVARPKE 480 Query: 2249 Q 2251 + Sbjct: 481 K 481 >gb|EOY32064.1| RNA polymerase II ctd phosphatase, putative isoform 1 [Theobroma cacao] Length = 469 Score = 481 bits (1237), Expect = e-132 Identities = 234/416 (56%), Positives = 307/416 (73%), Gaps = 1/416 (0%) Frame = +2 Query: 1007 DESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCK-DDSICRHPVVLKGLC 1183 D+ D+ R KR K E++E ++E +GS + G I+++ I ++ IC HP +C Sbjct: 55 DDDDLDSQRNKRCKTEKLEDLEESRGSTSQGLIEDKIVIHAELSLKKDICTHPGSFGQMC 114 Query: 1184 VTCGQLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINST 1363 + CGQ DD SGV GYIHK L+L +EI RLR D+K L K+ HTL+NST Sbjct: 115 ILCGQRLDDESGVTFGYIHKGLRLGNDEIVRLRSTDMKNLLRHKKLYLVLDLDHTLLNST 174 Query: 1364 RLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIYT 1543 +L ++P+EEYL Q+D+L++ SLF LD MHM+TKLRPFVRTFL+EA+ +FEMYIYT Sbjct: 175 QLMHLTPDEEYLKGQSDSLQDVSRGSLFMLDFMHMMTKLRPFVRTFLKEASEMFEMYIYT 234 Query: 1544 MGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVWR 1723 MG+R YALE+AKLLDP YF+ +VIS+ D T++H+KGLDVVLG ES VVILDDTE W Sbjct: 235 MGDRPYALEMAKLLDPRREYFSDRVISRDDGTQKHQKGLDVVLGQESAVVILDDTENAWM 294 Query: 1724 KHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFDQ 1903 KH++NL+LMERYH+FASSC QFG+ KSLS+ + DE E DGAL+ +L L++ H MFFD+ Sbjct: 295 KHKDNLILMERYHYFASSCHQFGYKCKSLSQLKSDESEPDGALASVLKALRQIHHMFFDE 354 Query: 1904 VLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATEV 2083 L+ + +SRDVRQVL+ ++ E+L+GCK+VFS VFP AE P+W+MAEQLGATC+TE Sbjct: 355 -LDCNLASRDVRQVLKTVQEEVLKGCKIVFSHVFPTNFPAESHPLWKMAEQLGATCSTET 413 Query: 2084 DASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKNQ 2251 D SVTHVVS D GT+K+RWA++ KFLV+PRWIEA NY W++Q EE+FP+S KNQ Sbjct: 414 DLSVTHVVSTDAGTEKSRWAVKEKKFLVHPRWIEATNYLWQKQPEENFPVSQGKNQ 469 >dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 488 Score = 480 bits (1235), Expect = e-132 Identities = 247/420 (58%), Positives = 310/420 (73%), Gaps = 7/420 (1%) Frame = +2 Query: 1007 DESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICR---HPVVLKG 1177 D + +G KRR++EE ++ + T E I S KD I + HP G Sbjct: 61 DLDAVGKGSNKRRRVEE------HRQDQGTATRPEEDVIGS-VKDAQIKKCPPHPGFFGG 113 Query: 1178 LCVTCG--QLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTL 1351 LC+ CG Q E+D GVA GYIHK L+L T E++RLRE+++K L +++ HTL Sbjct: 114 LCINCGKSQDEEDVPGVAFGYIHKGLRLGTSEMDRLRESEVKNLLRERKLVLILDLDHTL 173 Query: 1352 INSTRLADISPEEEYLVRQADALKN--DPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLF 1525 INSTRL DIS E L Q A KN DP RSLF L MHMLTKLRPFVR FLEEA+ +F Sbjct: 174 INSTRLHDISAAEMDLGIQTAASKNADDPERSLFTLQGMHMLTKLRPFVRKFLEEASNMF 233 Query: 1526 EMYIYTMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDD 1705 +MYIYTMG+++YA+EIAKLLDPG+VYF+SKVIS +DCT+RH+KGLDVVLG + + VI+DD Sbjct: 234 DMYIYTMGDKAYAIEIAKLLDPGNVYFDSKVISNSDCTQRHQKGLDVVLGDDKVAVIIDD 293 Query: 1706 TEFVWRKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTH 1885 TE VW+KH+ENL+LMERYH+FA+SCRQFGF+ +SLSE +DERESDGAL+ IL+VLKR H Sbjct: 294 TEHVWQKHKENLILMERYHYFAASCRQFGFSDQSLSELMQDERESDGALATILDVLKRIH 353 Query: 1886 QMFFDQVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGA 2065 +FFD + SSRDVRQV++R+R E+LQGCK+VFSRVFP+ +++DQ +W+MAEQLGA Sbjct: 354 TIFFDSGVETALSSRDVRQVIKRVRQEVLQGCKLVFSRVFPSDCRSQDQIMWKMAEQLGA 413 Query: 2066 TCATEVDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLK 2245 C +EVD SVTHVV+ GT+KARWA N KFL++PRWIEA NY W RQ EEDFP+ LK Sbjct: 414 VCCSEVDPSVTHVVAVHAGTEKARWAAGNKKFLLHPRWIEACNYRWHRQPEEDFPVPGLK 473 >ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor] gi|241945026|gb|EES18171.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor] Length = 547 Score = 479 bits (1233), Expect = e-132 Identities = 243/414 (58%), Positives = 311/414 (75%), Gaps = 3/414 (0%) Frame = +2 Query: 1019 IQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLKGLCVTCGQ 1198 +++ KRR++EE ++ SV I A SK C HP ++GLC CG Sbjct: 70 VEQNATKRRRVEEQH--RDQGTSVRPDKIPTGA---SKNIQVEACPHPGYIRGLCYICGN 124 Query: 1199 LEDDG--SGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINSTRLA 1372 +D+ SGVAL YI K L+LRT EI+RLR ADLK L +++ HTLINST+L Sbjct: 125 PQDEEYISGVALDYIDKGLRLRTSEIDRLRCADLKNLLRERKLVLILDLDHTLINSTKLQ 184 Query: 1373 DISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIYTMGE 1552 +IS E+ L Q A K+DPNRS+F L+SM +LTKLRPFVR FL+EA+ +FEMYIYTMG+ Sbjct: 185 NISSAEKDLGIQTAASKDDPNRSIFALESMQLLTKLRPFVREFLKEASNMFEMYIYTMGD 244 Query: 1553 RSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVWRKHR 1732 ++YA+EIAKLLDP ++YF KVIS +DCT+RH+KGLDV+LG+ S+ VILDDTEFVW+KH+ Sbjct: 245 KAYAIEIAKLLDPSNIYFPLKVISNSDCTKRHQKGLDVILGAASVAVILDDTEFVWKKHK 304 Query: 1733 ENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFDQ-VL 1909 ENL+LMERYHFFASSCR+FGF +SLSE +DERESDGAL+ +L+VLKR H +FFD V Sbjct: 305 ENLILMERYHFFASSCREFGFAVRSLSELMQDERESDGALATVLDVLKRIHAIFFDMAVE 364 Query: 1910 NDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATEVDA 2089 DD SSRDVRQV++ +R EILQGCK+VFSRVFP ++ + Q +W+MAE LGA C+T+VD+ Sbjct: 365 TDDLSSRDVRQVIKAVRKEILQGCKIVFSRVFPNNTRPQKQMVWKMAEYLGAVCSTDVDS 424 Query: 2090 SVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKNQ 2251 SVTHVV+ D GT+KARW + N KFLV+PRWIEAAN+ W RQ EEDFP++ K + Sbjct: 425 SVTHVVTVDLGTEKARWGVANKKFLVHPRWIEAANFRWHRQPEEDFPVTAPKEK 478 >ref|XP_002324546.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] gi|550318537|gb|EEF03111.2| hypothetical protein POPTR_0018s11760g [Populus trichocarpa] Length = 468 Score = 477 bits (1227), Expect = e-131 Identities = 235/419 (56%), Positives = 309/419 (73%) Frame = +2 Query: 992 QVNDYDESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVL 1171 + + D+SD Q R+KR K+E VE++++ G+ + ++K + S IC HP Sbjct: 58 EAEEDDDSDFQRKRVKRSKVETVEIVEDDGGTTSFASLKHNSEASIS---KEICTHPGSF 114 Query: 1172 KGLCVTCGQLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTL 1351 +C+ CGQL D SGV GYIHK L+L +EI RLR D+K L K+ HTL Sbjct: 115 GTMCIVCGQLLDGESGVTFGYIHKGLRLGNDEIVRLRNTDMKNLLRHKKLYLILDLDHTL 174 Query: 1352 INSTRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEM 1531 +NST+L ++ +EEYL Q D+L++ SLF L SM M+TKLRPFVRTFL+EA+++FEM Sbjct: 175 LNSTQLMHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFVRTFLKEASQMFEM 234 Query: 1532 YIYTMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTE 1711 YIYTMG+R+YALE+AKLLDPG YFN+KVIS+ D T+RH+KGLDVVLG ES V+ILDDTE Sbjct: 235 YIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVLGQESAVLILDDTE 294 Query: 1712 FVWRKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQM 1891 W KH++NL+LMERYHFFASSC QFGFN KSLSE + DE ES+GAL+ IL VL++ HQ+ Sbjct: 295 NAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGALASILKVLRKIHQI 354 Query: 1892 FFDQVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATC 2071 FF +D+ QVL+ +R ++L+GCK+VFSRVFP +S+A++ +WRMAEQLGATC Sbjct: 355 FF-----EDHILSLALQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATC 409 Query: 2072 ATEVDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLKN 2248 +TE+D SVTHVVS D+GT+K+ WAL++ KFLV P WIEAANYFW+RQ EE+F + +KN Sbjct: 410 STELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAANYFWQRQPEENFSFNQIKN 468 >ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Brachypodium distachyon] Length = 493 Score = 477 bits (1227), Expect = e-131 Identities = 240/410 (58%), Positives = 310/410 (75%), Gaps = 5/410 (1%) Frame = +2 Query: 1019 IQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSI--CR-HPVVLKGLCVT 1189 +++ KRRK VI++ Q TIK CKD I C HP GLC Sbjct: 66 VEQSSTKRRK-----VIEQVQDRGI--TIKPDEDAKGSCKDSQIKICPPHPGFFGGLCFR 118 Query: 1190 CG--QLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINST 1363 CG Q E+D GVA GYIHK L+L T EI+RLR +++K+ L +++ HTLINST Sbjct: 119 CGKRQDEEDVPGVAFGYIHKGLRLGTSEIDRLRGSNVKSLLRERKLVLILDLDHTLINST 178 Query: 1364 RLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIYT 1543 +L DIS E L Q A ++ P +SLF L++M MLTKLRPFV FL+EA+ +FEMYIYT Sbjct: 179 KLHDISAAERDLGIQTFASEDAPEKSLFTLEAMQMLTKLRPFVCKFLKEASNMFEMYIYT 238 Query: 1544 MGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVWR 1723 MG+++YA+EIAKLLDPG+VYF SKVIS +DCT+RH+KGLDVVLG+E++ +ILDDTE+VW+ Sbjct: 239 MGDKAYAIEIAKLLDPGNVYFGSKVISNSDCTQRHQKGLDVVLGAENVAIILDDTEYVWQ 298 Query: 1724 KHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFDQ 1903 KH+ENL+LMERYH+FASSCRQFGF+ K+LSES +DERESDGAL+ L+VLKR H +FFD Sbjct: 299 KHKENLILMERYHYFASSCRQFGFSVKALSESMQDERESDGALATTLDVLKRIHTLFFDS 358 Query: 1904 VLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATEV 2083 + SSRDVRQV++++R E+LQGCKVVFSRVFP+ S+ +DQ IW+MAEQLGA C ++ Sbjct: 359 AVETALSSRDVRQVIKKVRQEVLQGCKVVFSRVFPSSSRPQDQIIWKMAEQLGAICCADM 418 Query: 2084 DASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPI 2233 D++VTHVV+ D+GT+KARWA+ N K LV+PRWIEA+N+ W RQ+EEDFP+ Sbjct: 419 DSTVTHVVAVDSGTEKARWAVGNNKILVHPRWIEASNFRWHRQQEEDFPV 468 >gb|EMJ05544.1| hypothetical protein PRUPE_ppa005647mg [Prunus persica] Length = 449 Score = 476 bits (1226), Expect = e-131 Identities = 238/415 (57%), Positives = 306/415 (73%) Frame = +2 Query: 1001 DYDESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLKGL 1180 DY+ D E KRRK+E + I E QGS + ++E + S K KD IC HP +K L Sbjct: 38 DYESDDGSERSTKRRKVENLGSIDETQGSTSQIFVEENSEASPK-KD--ICTHPGSVKDL 94 Query: 1181 CVTCGQLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINS 1360 C+ CGQ D+ SGV LGYIHKD L +EI+R+R D+K SL K+ HTL+NS Sbjct: 95 CIVCGQRVDEKSGVPLGYIHKDFWLNNDEIDRVRSTDIKKSLHLKKLYLVLDLDHTLLNS 154 Query: 1361 TRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIY 1540 T L ++ EEEYL Q D+L++ + SLF++D MHM+TKLRPFVR FL+EA+ +FEMYIY Sbjct: 155 THLNHMTAEEEYLHSQTDSLQDVSDGSLFRVDVMHMMTKLRPFVRKFLKEASEMFEMYIY 214 Query: 1541 TMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVW 1720 TMGER+YALE+AKLLDP YF +VIS+ D T++H+KGLDVVLG ES +ILDDTE W Sbjct: 215 TMGERAYALEMAKLLDPRKEYFGDRVISRDDGTQKHQKGLDVVLGHESAALILDDTENAW 274 Query: 1721 RKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFD 1900 KH++NL+LMERYHFF SSC QFGF+ KSLSE + DE E +GAL+ +L VLKR H MFF Sbjct: 275 TKHKDNLILMERYHFFRSSCHQFGFHCKSLSELKSDESEPEGALATVLEVLKRIHNMFFY 334 Query: 1901 QVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATE 2080 + D+ RDVRQVL+ +R EIL+GCK+VFSRVFP+K +AE+ +W+MAEQLGATC+TE Sbjct: 335 E-SKDNLIDRDVRQVLKTLRKEILKGCKIVFSRVFPSKFQAENHQLWKMAEQLGATCSTE 393 Query: 2081 VDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLK 2245 +D SVTHVVS D GT+K+RWA++ KFLV+P+WIEA+NY W +Q E+ FP++ K Sbjct: 394 LDLSVTHVVSTDAGTEKSRWAVKEKKFLVHPQWIEASNYMWLKQAEDKFPVNQTK 448 >gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indica Group] Length = 574 Score = 476 bits (1225), Expect = e-131 Identities = 235/381 (61%), Positives = 296/381 (77%), Gaps = 2/381 (0%) Frame = +2 Query: 1115 AAISSKCKDDSICRHPVVLKGLCVTCG--QLEDDGSGVALGYIHKDLKLRTEEINRLREA 1288 A S K D HP GLC CG Q E+D GVA GYIHK L+L T EI+RLR A Sbjct: 127 AGSSKDVKIDECPPHPGFFGGLCYRCGKRQDEEDVPGVAFGYIHKGLRLGTTEIDRLRGA 186 Query: 1289 DLKTSLEKKRXXXXXXXXHTLINSTRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHM 1468 DLK L +++ HTLINST+L D+S E L Q+ A + P+RSLF L++M M Sbjct: 187 DLKNLLRERKLVLILDLDHTLINSTKLFDLSAAENELGIQSAAKEVVPDRSLFTLETMQM 246 Query: 1469 LTKLRPFVRTFLEEANRLFEMYIYTMGERSYALEIAKLLDPGSVYFNSKVISQADCTERH 1648 LTKLRPFVR FL+EA+ +FEMYIYTMG+++YA+EIAKLLDP +VYF SKVIS +DCT+RH Sbjct: 247 LTKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSDCTQRH 306 Query: 1649 RKGLDVVLGSESIVVILDDTEFVWRKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKD 1828 +KGLDVVLG ES+ VILDDTE+VW+KH+ENL+LMERYH+FASSCRQFGF A+SLSE+ +D Sbjct: 307 QKGLDVVLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARSLSETMQD 366 Query: 1829 ERESDGALSMILNVLKRTHQMFFDQVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFP 2008 ERE+DGAL+ IL+VL+R H +FFD SSRDVRQV++R+R E+LQGCK+VF+RVFP Sbjct: 367 ERENDGALATILDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKLVFTRVFP 426 Query: 2009 AKSKAEDQPIWRMAEQLGATCATEVDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEA 2188 + +DQ +W+MAEQLGA C T+VD++VTHVV+ D GT+KARWA+ N KFLV+PRWIEA Sbjct: 427 LHQRQQDQMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAVSNKKFLVHPRWIEA 486 Query: 2189 ANYFWERQREEDFPISNLKNQ 2251 AN+ W+RQ+EEDFP++ K + Sbjct: 487 ANFRWQRQQEEDFPVARPKEK 507 >ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Cucumis sativus] Length = 452 Score = 471 bits (1212), Expect = e-130 Identities = 233/412 (56%), Positives = 305/412 (74%) Frame = +2 Query: 1010 ESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLKGLCVT 1189 +++ + RIKRRK+E++E +E + ++ + SK +C HP +C+ Sbjct: 41 DNNAESVRIKRRKVEKLENSEE---DIMHEVEEQSLEVLSK---QQLCSHPGSFGNMCII 94 Query: 1190 CGQLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINSTRL 1369 CGQ D+ SGV GYIHK+L+L +EINR+R ++K L++K+ HTL+NST L Sbjct: 95 CGQRLDEESGVTFGYIHKELRLNNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNSTEL 154 Query: 1370 ADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIYTMG 1549 ++ EEEYL Q D+L + SLF L+S+H +TKLRPFV +FL+EA++LFEMYIYTMG Sbjct: 155 RYLTVEEEYLRSQTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMG 214 Query: 1550 ERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVWRKH 1729 ER YA E+AKLLDP YF+SKVIS+ D T++H+KGLDVVLG ES V+ILDDTE W KH Sbjct: 215 ERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKH 274 Query: 1730 RENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFDQVL 1909 +ENL+LMERYHFFASSCRQFGFN KSLSE + DE E+DGAL+ IL VLK+ H MFF++V Sbjct: 275 KENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDGALTTILKVLKQVHHMFFNEV- 333 Query: 1910 NDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATEVDA 2089 + D RDVRQVL+ +R E+L+GCKVVFSRVFP K +AE+ +W+M EQLG TC+TE+D Sbjct: 334 SGDLVDRDVRQVLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQ 393 Query: 2090 SVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISNLK 2245 SVTHVV+ D GT+K+RWAL+ KFLV+PRWIEA+NYFW+RQ EE+F + K Sbjct: 394 SVTHVVATDAGTEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENFTVEQTK 445 >ref|XP_006343662.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum tuberosum] Length = 478 Score = 469 bits (1208), Expect = e-129 Identities = 233/414 (56%), Positives = 300/414 (72%) Frame = +2 Query: 998 NDYDESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLKG 1177 +D D+ I R K+RK+E +E + Q SV+ G E + S +C HP V+ G Sbjct: 68 DDDDDGSIDSSRSKKRKIELIEAAVDPQSSVSRGEPAETSGASLAL---DVCTHPGVMGG 124 Query: 1178 LCVTCGQLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLIN 1357 +C+ CGQ +D SGVA GYIHK+L+L +E+ RLR+ DLK L K+ HTL+N Sbjct: 125 MCIRCGQKVEDESGVAFGYIHKNLRLADDEVARLRDKDLKNLLRHKKLILVLDLDHTLLN 184 Query: 1358 STRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYI 1537 STRLADIS EE YL Q + L + +LFKLD +HM+TKLRPFV TFL+EA+ LFEMYI Sbjct: 185 STRLADISAEESYLKDQREVLPDALRNNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYI 244 Query: 1538 YTMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFV 1717 YTMGER YALE+A LLDPG +YF+S+VI+Q+D T RH+KGLDVVLG ES V+ILDDTE V Sbjct: 245 YTMGERPYALEMASLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVV 304 Query: 1718 WRKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFF 1897 W KHRENL+LM+RYHFF SSCRQFG KSLSE + DE E++GAL+ +L VL+R H++FF Sbjct: 305 WGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFF 364 Query: 1898 DQVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCAT 2077 D D+ RDVRQVL+ +R EIL+GCK+VF+ V P + + E+ W++AE+LGAT +T Sbjct: 365 DLERGDNIMERDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHHYWKLAEKLGATFST 424 Query: 2078 EVDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISN 2239 EVD SVTHVVS + T+K+R AL+ KFLV+P WIEAANY W + EE+FP+S+ Sbjct: 425 EVDESVTHVVSMNDKTEKSRQALREKKFLVHPSWIEAANYLWRKPPEENFPVSS 478 >ref|XP_004242583.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 472 Score = 469 bits (1208), Expect = e-129 Identities = 233/413 (56%), Positives = 301/413 (72%) Frame = +2 Query: 1001 DYDESDIQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVLKGL 1180 D D+ +I R K+RK+E +E + Q V+ G E + S +C HP V+ G+ Sbjct: 63 DGDDGNIDSRRSKKRKIELIEAAVDPQSLVSRGESAETSGASLAL---DVCTHPGVMGGM 119 Query: 1181 CVTCGQLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTLINS 1360 C+ CGQ +D SGVA GYIHK+L+L +E+ RLRE DLK L ++ HTL+NS Sbjct: 120 CIRCGQKVEDESGVAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTLLNS 179 Query: 1361 TRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEMYIY 1540 TRLADIS EE YL Q + L + +LFKLD +HM+TKLRPFV TFL+EA+ LFEMYIY Sbjct: 180 TRLADISAEESYLKDQREVLPDALRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEMYIY 239 Query: 1541 TMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTEFVW 1720 TMGER YALE+AKLLDPG +YF+S+VI+Q+D T RH+KGLDVVLG ES V+ILDDTE VW Sbjct: 240 TMGERPYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTEVVW 299 Query: 1721 RKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQMFFD 1900 KHRENL+LM+RYHFF SSCRQFG KSLSE + DE E++GAL+ +L VL+R H++FFD Sbjct: 300 GKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRLFFD 359 Query: 1901 QVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATCATE 2080 D+ RDVRQVL+ +R EIL+GCK+VF+ V P + + E+ W++AE+LGAT +TE Sbjct: 360 PERGDNIMERDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATFSTE 419 Query: 2081 VDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISN 2239 VD SVTHVVS + T+K+R A++ KFLV+PRWIEAANY W + EE+FP+S+ Sbjct: 420 VDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVSS 472 >ref|XP_004242582.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 4-like [Solanum lycopersicum] Length = 512 Score = 468 bits (1205), Expect = e-129 Identities = 235/416 (56%), Positives = 302/416 (72%), Gaps = 2/416 (0%) Frame = +2 Query: 998 NDYDESD--IQEGRIKRRKLEEVEVIKEYQGSVASGTIKERAAISSKCKDDSICRHPVVL 1171 ND + D I R K+RK+E +E + Q SV+ G E + S +C HP V+ Sbjct: 100 NDTGDGDGSIDSSRSKKRKIELIEGAVDPQSSVSRGEPAETSGASMAL---DVCTHPGVM 156 Query: 1172 KGLCVTCGQLEDDGSGVALGYIHKDLKLRTEEINRLREADLKTSLEKKRXXXXXXXXHTL 1351 G+C+ CGQ +D SGVA GYIHK+L+L +E+ RLRE DLK L ++ HTL Sbjct: 157 GGMCIRCGQKVEDESGVAFGYIHKNLRLADDEVARLREKDLKNLLRHRKLILVLDLDHTL 216 Query: 1352 INSTRLADISPEEEYLVRQADALKNDPNRSLFKLDSMHMLTKLRPFVRTFLEEANRLFEM 1531 +NSTRLADIS EE YL Q + L + +LFKLD +HM+TKLRPFV TFL+EA+ LFEM Sbjct: 217 LNSTRLADISAEESYLKDQREVLPDALRSNLFKLDWIHMMTKLRPFVHTFLKEASSLFEM 276 Query: 1532 YIYTMGERSYALEIAKLLDPGSVYFNSKVISQADCTERHRKGLDVVLGSESIVVILDDTE 1711 YIYTMGER YALE+AKLLDPG +YF+S+VI+Q+D T RH+KGLDVVLG ES V+ILDDTE Sbjct: 277 YIYTMGERPYALEMAKLLDPGGIYFHSRVIAQSDSTRRHQKGLDVVLGQESAVLILDDTE 336 Query: 1712 FVWRKHRENLLLMERYHFFASSCRQFGFNAKSLSESQKDERESDGALSMILNVLKRTHQM 1891 VW KHRENL+LM+RYHFF SSCRQFG KSLSE + DE E++GAL+ +L VL+R H++ Sbjct: 337 VVWGKHRENLILMDRYHFFTSSCRQFGLKCKSLSEQKSDENEAEGALASVLEVLQRIHRL 396 Query: 1892 FFDQVLNDDYSSRDVRQVLRRIRLEILQGCKVVFSRVFPAKSKAEDQPIWRMAEQLGATC 2071 FFD D+ RDVRQVL+ +R EIL+GCK+VF+ V P + + E+ W++AE+LGAT Sbjct: 397 FFDPERGDNIMERDVRQVLKTVRKEILKGCKIVFTGVIPIQCQPENHYYWKLAEKLGATF 456 Query: 2072 ATEVDASVTHVVSADTGTQKARWALQNGKFLVNPRWIEAANYFWERQREEDFPISN 2239 +TEVD SVTHVVS + T+K+R A++ KFLV+PRWIEAANY W + EE+FP+S+ Sbjct: 457 STEVDESVTHVVSMNDKTEKSRQAVREKKFLVHPRWIEAANYLWRKPPEENFPVSS 512