BLASTX nr result
ID: Rheum21_contig00017204
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00017204 (1729 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [T... 313 2e-82 ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr... 299 3e-78 ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma... 296 1e-77 gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [T... 288 3e-75 ref|XP_002519032.1| double-stranded RNA binding protein, putativ... 276 2e-71 ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu... 272 4e-70 ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma... 271 8e-70 ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma... 262 3e-67 ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma... 261 5e-67 gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus... 261 7e-67 ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma... 260 1e-66 ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma... 258 7e-66 ref|XP_002869873.1| hypothetical protein ARALYDRAFT_492708 [Arab... 256 2e-65 gb|EMJ15747.1| hypothetical protein PRUPE_ppa000988mg [Prunus pe... 255 4e-65 ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma... 255 4e-65 ref|XP_006413749.1| hypothetical protein EUTSA_v10024324mg [Eutr... 254 6e-65 dbj|BAJ34643.1| unnamed protein product [Thellungiella halophila] 254 6e-65 ref|XP_006597420.1| PREDICTED: RNA polymerase II C-terminal doma... 253 1e-64 ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma... 253 2e-64 ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu... 253 2e-64 >gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] Length = 978 Score = 313 bits (801), Expect = 2e-82 Identities = 190/406 (46%), Positives = 249/406 (61%), Gaps = 12/406 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSER 169 PR QSR WF E++M Q RA KE+ D +H+EK R PPF K+++ + S+R Sbjct: 605 PRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHRH--PPFFPKVESSIPSDR 662 Query: 170 PYAQKQRFSREGPRRDDSF------RSFKSFQGEDVSISRSSSGNKDFENESGRSYSSNK 331 + QR S+E RDD S+ SF GE++ +S+SSS ++D + ES Sbjct: 663 LLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFES-------- 714 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 G++V+ ET AG L+DIA K GAKVE++ ++ AS DL+F + Sbjct: 715 -------------------GRTVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSI 755 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAND- 688 E FAGE+VGEG G+TRREAQ AAE S+ NLA+TYLS IK +S GD+SR+ ND Sbjct: 756 EAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDN 815 Query: 689 GFFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELC 868 GF + NSFGNQL EE S E + D EG+ KSM SV+AL ELC Sbjct: 816 GFPSNVNSFGNQLLAKEESLSFSTASEQSRLADP-------RLEGSKKSMGSVTALKELC 868 Query: 869 MAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLK 1048 M EG G+ F+ + P S+++L KDEVYAQVEIDGQ +GTGLTW+EAK+QAAEKAL +L+ Sbjct: 869 MMEGLGVVFQPQPPSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLR 928 Query: 1049 SMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 SMLG +QKR SPR++QGM KR + + R LQR P+S RY +NA Sbjct: 929 SMLGQYSQKRQGSPRSLQGMQNKRLKPEFPRVLQRMPSSGRYPKNA 974 >ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina] gi|557551913|gb|ESR62542.1| hypothetical protein CICLE_v10014168mg [Citrus clementina] Length = 957 Score = 299 bits (765), Expect = 3e-78 Identities = 181/406 (44%), Positives = 243/406 (59%), Gaps = 12/406 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEY--TSDGLHVEKQRPLPPPFSKKMDNFVRSER 169 PR+ SR WF E++M Q RAV KE+ S+ + +EK RP P F K++N + S+R Sbjct: 587 PRVPSRGSWFPVEEEMSPRQLNRAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDR 646 Query: 170 PYAQKQRFSREGPRRDDSFR------SFKSFQGEDVSISRSSSGNKDFENESGRSYSSNK 331 P+ + QR +E RRDD R ++SF GE++ +SRSSS ++D + ESG Sbjct: 647 PH-ENQRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESG------- 698 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 + VS +ETP+G L+DIA K G KVE++ ++ AS +L+F + Sbjct: 699 --------------------RDVSSTETPSGVLQDIAMKCGTKVEFRPALVASTELQFSI 738 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGANDG 691 E FAGE++GEG G+TRREAQ AAE S+ +LA+ Y+ +KS+S GD SR + AN+ Sbjct: 739 EAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYVLRVKSDSGSGHGDGSRFSNANEN 798 Query: 692 FFC-DSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELC 868 F + NSFG Q +E SEP + D EG+ K M SVSAL ELC Sbjct: 799 CFMGEINSFGGQPLAKDESLS-SEPSKLV----------DPRLEGSKKLMGSVSALKELC 847 Query: 869 MAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLK 1048 M EG G+ F+ + P SA+S+ KDEVYAQVEIDGQ +G G TWDEAK+QAAEKAL +L+ Sbjct: 848 MTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLR 907 Query: 1049 SMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 SM G QK SPR++QGM KR + + R LQR P S RY +NA Sbjct: 908 SMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPPSGRYPKNA 953 >ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Citrus sinensis] Length = 957 Score = 296 bits (759), Expect = 1e-77 Identities = 181/406 (44%), Positives = 242/406 (59%), Gaps = 12/406 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEY--TSDGLHVEKQRPLPPPFSKKMDNFVRSER 169 PR+ SR WF E++M Q RAV KE+ S+ + +EK RP P F K++N S+R Sbjct: 587 PRVPSRGSWFPVEEEMSPRQLNRAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDR 646 Query: 170 PYAQKQRFSREGPRRDDSFR------SFKSFQGEDVSISRSSSGNKDFENESGRSYSSNK 331 P+ + QR +E RRDD R ++SF GE++ +SRSSS ++D + ESG Sbjct: 647 PH-ENQRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESG------- 698 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 + VS +ETP+G L+DIA K G KVE++ ++ AS +L+F + Sbjct: 699 --------------------RDVSSTETPSGVLQDIAMKCGTKVEFRPALVASTELQFSI 738 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGANDG 691 E FAGE++GEG G+TRREAQ AAE S+ +LA+ Y+ +KS+S GD SR + AN+ Sbjct: 739 EAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYMLRVKSDSGSGHGDGSRFSNANEN 798 Query: 692 FFC-DSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELC 868 F + NSFG Q +E SEP + D EG+ K M SVSAL ELC Sbjct: 799 CFMGEINSFGGQPLAKDESLS-SEPSKLV----------DPRLEGSKKLMGSVSALKELC 847 Query: 869 MAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLK 1048 M EG G+ F+ + P SA+S+ KDEVYAQVEIDGQ +G G TWDEAK+QAAEKAL +L+ Sbjct: 848 MTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLR 907 Query: 1049 SMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 SM G QK SPR++QGM KR + + R LQR P S RY +NA Sbjct: 908 SMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPPSGRYPKNA 953 >gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] Length = 984 Score = 288 bits (738), Expect(2) = 3e-75 Identities = 176/377 (46%), Positives = 228/377 (60%), Gaps = 11/377 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSER 169 PR QSR WF E++M Q RA KE+ D +H+EK R PPF K+++ + S+R Sbjct: 605 PRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHRH--PPFFPKVESSIPSDR 662 Query: 170 PYAQKQRFSREGPRRDDSF------RSFKSFQGEDVSISRSSSGNKDFENESGRSYSSNK 331 + QR S+E RDD S+ SF GE++ +S+SSS ++D + ES Sbjct: 663 LLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFES-------- 714 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 G++V+ ET AG L+DIA K GAKVE++ ++ AS DL+F + Sbjct: 715 -------------------GRTVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSI 755 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAND- 688 E FAGE+VGEG G+TRREAQ AAE S+ NLA+TYLS IK +S GD+SR+ ND Sbjct: 756 EAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDN 815 Query: 689 GFFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELC 868 GF + NSFGNQL EE S E D EG+ KSM SV+AL ELC Sbjct: 816 GFPSNVNSFGNQLLAKEESLSFSTASE-------QSRLADPRLEGSKKSMGSVTALKELC 868 Query: 869 MAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLK 1048 M EG G+ F+ + P S+++L KDEVYAQVEIDGQ +GTGLTW+EAK+QAAEKAL +L+ Sbjct: 869 MMEGLGVVFQPQPPSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLR 928 Query: 1049 SMLGYSNQKRPVSPRAV 1099 SMLG +QKR SPR V Sbjct: 929 SMLGQYSQKRQGSPRCV 945 Score = 22.7 bits (47), Expect(2) = 3e-75 Identities = 7/11 (63%), Positives = 9/11 (81%) Frame = +3 Query: 1089 QGRCKACQLNA 1121 QG CK C++NA Sbjct: 974 QGHCKVCKINA 984 >ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis] gi|223541695|gb|EEF43243.1| double-stranded RNA binding protein, putative [Ricinus communis] Length = 978 Score = 276 bits (706), Expect = 2e-71 Identities = 173/406 (42%), Positives = 234/406 (57%), Gaps = 12/406 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSER 169 PR+QSR W E++M Q RAV +E+ D +H++K RP P F K+++ + SER Sbjct: 603 PRVQSRGNWVPVEEEMSPRQLNRAVTREFPMDTEPMHIDKHRPHHPSFFPKVESSIPSER 662 Query: 170 PYAQKQRFSREGPRRDDSFR------SFKSFQGEDVSISRSSSGNKDFENESGRSYSSNK 331 + QR + P +DD R +++S GE+ S+SRSSS N+D + ES R+ Sbjct: 663 MPHENQRLPKVAPYKDDRLRLNQTMSNYQSLSGEENSLSRSSSSNRDLDVESDRA----- 717 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 VS +ETP L +I+ K GAKVE+K S+ S DL+F + Sbjct: 718 ----------------------VSSAETPVRVLHEISMKCGAKVEFKHSLVNSRDLQFSV 755 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAND- 688 E FAGERVGEG G+TRREAQ AAEAS+ NLA+ Y+S K ++ GD S+ + AND Sbjct: 756 EAWFAGERVGEGFGRTRREAQSVAAEASIKNLANIYISRAKPDNGALHGDASKYSSANDN 815 Query: 689 GFFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELC 868 GF NSFG+Q +E S+ E +G D E + KSMSSV+AL E C Sbjct: 816 GFLGHVNSFGSQPLPKDEILSYSDSSEQSGLLDP-------RLESSKKSMSSVNALKEFC 868 Query: 869 MAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLK 1048 M EG G++F + P S++S+ EV+AQVEIDGQ +G G T+DEAK+QAAEKAL +L+ Sbjct: 869 MMEGLGVNFLAQTPLSSNSVQNAEVHAQVEIDGQVMGKGIGSTFDEAKMQAAEKALGSLR 928 Query: 1049 SMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 + G KR SPR V GM K + + R LQR P+S RY +NA Sbjct: 929 TTFGRFPPKRQGSPRPVPGMPNKHLKPEFPRVLQRMPSSARYPKNA 974 >ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] gi|550340277|gb|EEE85528.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] Length = 996 Score = 272 bits (695), Expect = 4e-70 Identities = 174/406 (42%), Positives = 234/406 (57%), Gaps = 11/406 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAK-EYTSDGLHVEKQRPLPPPFSKKMDNFVRSERP 172 PR+QS W E++M Q R + SD +++EK R P F K+++ + S+R Sbjct: 623 PRVQSVGSWVPVEEEMSPRQLNRTPREFPLDSDPMNIEKHRTHHPSFFHKVESNIPSDRM 682 Query: 173 YAQKQRFSREGPRRDDSFR------SFKSFQGEDVSISRSSSGNKDFENESGRSYSSNKD 334 + QR +E RDD + ++ SFQGE+ +SRSSS N+D + ES R++ Sbjct: 683 IHENQRQPKEATYRDDRMKLNHSTSNYPSFQGEESPLSRSSS-NRDLDLESERAF----- 736 Query: 335 WDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLME 514 S +ETP L++IA K G KVE++ ++ A++DL+F +E Sbjct: 737 ----------------------SSTETPVEVLQEIAMKCGTKVEFRPALIATSDLQFSIE 774 Query: 515 VSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAND-G 691 F GE+VGEG GKTRREAQ AAE S+ LA Y+S +K +S P GD SR AND G Sbjct: 775 TWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYPSANDNG 834 Query: 692 FFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELCM 871 F D NSFGNQ +E S E + D Q L EG+ KSM SV+AL E CM Sbjct: 835 FLGDMNSFGNQPLLKDENITYSATSEPSRLLD-----QRL--EGSKKSMGSVTALKEFCM 887 Query: 872 AEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLKS 1051 EG G++F + P S +S+ +EV+AQVEIDGQ +G GLTWDEAK+QAAEKAL +L++ Sbjct: 888 TEGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRT 947 Query: 1052 MLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNAS 1186 M G KR SPR +QGM KR + + R LQR P+S RY +NAS Sbjct: 948 MFGQYTPKRQGSPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNAS 993 >ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] Length = 958 Score = 271 bits (692), Expect = 8e-70 Identities = 173/408 (42%), Positives = 236/408 (57%), Gaps = 14/408 (3%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYT--SDGLHVEKQRPLPPPFSKKMDNFVRSER 169 P + SRRGWF E++MG Q + V KE+ S+ LH+EK+ P P K+D+ V S+R Sbjct: 582 PSVPSRRGWFSVEEEMGPQQLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDR 641 Query: 170 PYAQK-QRFSREGPRRDD------SFRSFKSFQGEDVSISRSSSGNKDFENESGRSYSSN 328 + + QR +E RDD S S+ SF G+D+ +S SS N+DF++ES Sbjct: 642 VFHESHQRLPKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSES------- 694 Query: 329 KDWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFL 508 G+S+ ++ AG L++IA K G KVE+ SS+ AS L+F Sbjct: 695 --------------------GRSLFHADITAGVLQEIALKCGTKVEFLSSLVASTALQFS 734 Query: 509 MEVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGA-N 685 +E FAG++VGEG G+TRREAQ+ AAE S+ LAD Y+SH K +S GDVS G+ N Sbjct: 735 IEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNN 794 Query: 686 DGFFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTEL 865 +GF NS GNQL E V + S D S D E + +S S+SAL E Sbjct: 795 NGFVSSGNSLGNQLLPKES-------VSFSTSSDSSRVS-DPRLEVSKRSTDSISALKEF 846 Query: 866 CMAEGFGLDFKIE-RPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTN 1042 CM EG +F+ P S KDEV+AQVEIDGQ F +G GLTW+EAK+QAA+KAL + Sbjct: 847 CMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALES 906 Query: 1043 LKSMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYP-ASRYARNA 1183 L++M +KR SPR++QG++ KR + + R LQR P ++RY RNA Sbjct: 907 LRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNA 954 >ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Fragaria vesca subsp. vesca] Length = 955 Score = 262 bits (670), Expect = 3e-67 Identities = 166/406 (40%), Positives = 232/406 (57%), Gaps = 12/406 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKE--YTSDGLHVEKQRPLPPPFSKKMDNFVRSER 169 PR+QSR GWF E++M + R V KE S+ + +EK R F K++N + S+R Sbjct: 583 PRVQSRGGWFPVEEEMSPRKLSRMVPKEPPLNSEPMQIEKHRSHHSAFFPKVENSMPSDR 642 Query: 170 PYAQKQRFSREGPRRDDSFR------SFKSFQGEDVSISRSSSGNKDFENESGRSYSSNK 331 + QR +E RD+ R + SF GE+ ++RSSS N+DF+ ESG Sbjct: 643 ILQENQRLPKEAFHRDNRLRFNQAMSGYHSFSGEEPPLNRSSSSNRDFDYESG------- 695 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 +++S +ETPAG L++IA K G KVE++ ++ S +L+F + Sbjct: 696 --------------------RAISNAETPAGVLQEIAMKCGTKVEFRPALVPSTELQFYV 735 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAG-AND 688 E FAGE++GEG G+TRREA AAE SL NLA+ Y+S K ++ P GD S+ + N+ Sbjct: 736 EAWFAGEKIGEGTGRTRREAHFQAAEGSLKNLANIYISRGKPDALPIHGDASKFSNVTNN 795 Query: 689 GFFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELC 868 GF + NSFG Q E+ S E + D + + KS+SSVSAL ELC Sbjct: 796 GFMGNMNSFGTQPLPKEDSLSSSTSSEPS-------RPLDPRLDNSRKSVSSVSALKELC 848 Query: 869 MAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLK 1048 EG + ++ RP +S KDEV+ Q EIDG+ +G GLTWDEAK+QAAEKAL NL+ Sbjct: 849 TMEGLSVLYQ-PRPPPPNSTEKDEVHVQAEIDGEVLGKGIGLTWDEAKMQAAEKALGNLR 907 Query: 1049 SMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 S L QKR SPR +QGM +KR + + + LQR P+S RY++NA Sbjct: 908 STL--YGQKRQGSPRPLQGMPSKRLKQEFPQVLQRMPSSTRYSKNA 951 >ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] gi|571500215|ref|XP_006594604.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X2 [Glycine max] Length = 960 Score = 261 bits (668), Expect = 5e-67 Identities = 167/406 (41%), Positives = 231/406 (56%), Gaps = 16/406 (3%) Frame = +2 Query: 14 SRRGWF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSERPYAQ 181 SRRGWF E++MG Q V KE+ D H+EK+ P P F K+ + + S+R + + Sbjct: 587 SRRGWFSVEEEMGPQQLNLPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHE 646 Query: 182 K-QRFSREGPRRDD------SFRSFKSFQGEDVSISRSSSGNKDFENESGRSYSSNKDWD 340 QR +E RDD S S+ S G+D+ +S SS N+DF++ES Sbjct: 647 SHQRLPKEVHHRDDRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSES----------- 695 Query: 341 SDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLMEVS 520 G+S+ ++T AG L++IA G KVE+ SS+ AS +L+F +E Sbjct: 696 ----------------GRSLFHADTTAGVLQEIALNCGTKVEFLSSLVASTELQFSIEAW 739 Query: 521 FAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGA-NDGFF 697 FAG+++GEG G+TRREAQ AA S+ LAD Y+SH K +S GDVS G+ NDGF Sbjct: 740 FAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNNDGFV 799 Query: 698 CDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELCMAE 877 NS GNQL EE S E++ D E + +S S+SAL ELCM E Sbjct: 800 SSGNSLGNQLLPKEESGSFSTASESSRVSDS-------RLEVSKRSTDSISALKELCMME 852 Query: 878 GFGLDFKIERPHSADSLH---KDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLK 1048 G F + P ++ S H KDEV+AQVEIDGQ F +G G+TW+EAK+QAA+KAL +L+ Sbjct: 853 GLAASF--QSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKMQAAKKALGSLR 910 Query: 1049 SMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYP-ASRYARNA 1183 +M + KR SPR++QG++ KR + + LQR P ++RY RNA Sbjct: 911 TMFNQGSLKRHGSPRSMQGLANKRLKPEYPPTLQRVPYSARYPRNA 956 >gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris] Length = 964 Score = 261 bits (667), Expect = 7e-67 Identities = 169/408 (41%), Positives = 231/408 (56%), Gaps = 14/408 (3%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSER 169 PR+ SR GWF E+ +GS R V KE++ D L +EK RP P F K+++ + S+R Sbjct: 587 PRVSSRGGWFPAEEDIGSQPLNRVVPKEFSVDSGSLVIEKHRPHHPSFFSKVESSISSDR 646 Query: 170 P-YAQKQRFSREGPRRDDSFRS------FKSFQGEDVSISRSSSGNKDFENESGRSYSSN 328 + QR +E RDD RS ++S +++ SRSSS Sbjct: 647 ILHDSHQRLPKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSS---------------- 690 Query: 329 KDWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFL 508 ++RD ++E SV ++TP L++IA K G KVE+ SS+ AS +L+F Sbjct: 691 -----------SHRDLDSESSHSVFHADTPVVVLQEIALKCGTKVEFMSSLVASTELQFS 739 Query: 509 MEVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAND 688 +E F+G+++G G G+TR+EAQH AAE S+ +LAD YLS K E GDV AND Sbjct: 740 IEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAKDEPGSTYGDVGGFPNAND 799 Query: 689 -GFFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTEL 865 G+ ++S NQ E+ S + + D E + + M S+SAL EL Sbjct: 800 NGYMVIASSLSNQPLPKEDSASFSTASDPSRVLDP-------RLEVSKRPMGSISALKEL 852 Query: 866 CMAEGFGLDF-KIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTN 1042 CM EG G++F P S +SL KDEV+AQVEIDG+ F +G GLTWDEAK+QAAEKAL + Sbjct: 853 CMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGS 912 Query: 1043 LKSMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 L+S LG S QKR SPR+ QG S KR + + R +QR P+S RY RNA Sbjct: 913 LRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRAMQRIPSSTRYPRNA 960 >ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Glycine max] Length = 960 Score = 260 bits (664), Expect = 1e-66 Identities = 171/409 (41%), Positives = 234/409 (57%), Gaps = 15/409 (3%) Frame = +2 Query: 2 PRLQSRRG-WF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSE 166 PR+ S RG WF E+++GS R V KE+ D L +EK R P F K+++ + S+ Sbjct: 583 PRVPSSRGVWFPVEEEIGSQPLNRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSD 642 Query: 167 RP-YAQKQRFSREGPRRDDSFR------SFKSFQGEDVSISRSSSGNKDFENESGRSYSS 325 R + QR +E RDD R S++SF G+D+ SRSSS Sbjct: 643 RILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSS--------------- 687 Query: 326 NKDWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRF 505 ++RD ++E G SV ++TP L +IA K G KV++ SS+ AS +L+F Sbjct: 688 ------------SHRDLDSESGHSVLHADTPVAVLHEIALKCGTKVDFMSSLVASTELKF 735 Query: 506 LMEVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAN 685 +E F+G+++G G G+TR+EAQ+ AA+ S+ +LAD YLS K E GDVS N Sbjct: 736 SLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIYLSSAKDEPGSTYGDVSGFPNVN 795 Query: 686 D-GFFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTE 862 D G+ ++S GNQ LS+ A+ S + D + + +SM S+SAL E Sbjct: 796 DNGYMGIASSLGNQ--------PLSKEDSASFSSASPSRALDPRLDVSKRSMGSISALKE 847 Query: 863 LCMAEGFGLDF-KIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALT 1039 LCM EG G++F P S +S+ KDEV+AQVEIDG+ F +G GLTWDEAK+QAAEKAL Sbjct: 848 LCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIGLTWDEAKMQAAEKALG 907 Query: 1040 NLKSMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 NL+S LG S QK SPR QG S KR + + R +QR P+S RY RNA Sbjct: 908 NLRSKLGQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMPSSARYPRNA 956 >ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Solanum tuberosum] Length = 953 Score = 258 bits (658), Expect = 7e-66 Identities = 183/413 (44%), Positives = 231/413 (55%), Gaps = 18/413 (4%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVA-KEY--TSDGLHVEKQRPLPPPFSKKMDNFVRSE 166 PR+Q GWF E++M Q R + KE+ + +H+ K RP PPF KM+ + S+ Sbjct: 582 PRVQPH-GWFPAEEEMSPRQLNRPLPPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSD 640 Query: 167 RPYAQKQRFSREGPRRDDSFR---SFKSFQ--GEDVSISRSSSGNKDFENESGRSYSSNK 331 R + QR +E RDD R S SF+ GE+V + RSSS N Sbjct: 641 RVLFENQRLPKEVIPRDDRMRFSQSQPSFRPPGEEVPLGRSSSSN--------------- 685 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 R + EPG ETPAGAL+DIA K GAKVE++SS +S +L+F + Sbjct: 686 ------------RVLDLEPGHYDPYLETPAGALQDIAFKCGAKVEFRSSFLSSPELQFSL 733 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGANDG 691 EV FAGE+VGEG G+TRREAQ AAE SLM LAD YLS IK +SS +GD R A+D Sbjct: 734 EVLFAGEKVGEGTGRTRREAQRRAAEESLMYLADKYLSCIKPDSSSTQGDGFRFPNASDN 793 Query: 692 FFCDSNS-FGNQLRHS----EEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSAL 856 F D+ S FG Q R S EP R+ +P E KS+ SV AL Sbjct: 794 GFVDNMSPFGYQDRVSHSFASEPPRVLDP----------------RLEVFKKSVGSVGAL 837 Query: 857 TELCMAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKAL 1036 ELC EG GL F+ + SA+ K E+YAQVEIDGQ F +G G TWD+AK QAAE+AL Sbjct: 838 RELCAIEGLGLAFQTQPQLSANPGQKSEIYAQVEIDGQVFGKGIGSTWDDAKTQAAERAL 897 Query: 1037 TNLKSMLGYSNQKRPVSPRAV-QGMSTKRTRSDLSRGL-QRYPAS-RYARNAS 1186 LKS L +QKR SPR++ QG S KR + + SRG+ QR P S R+ +N S Sbjct: 898 VALKSELAQFSQKRQGSPRSLQQGFSNKRLKPEYSRGVQQRVPLSGRFPKNTS 950 >ref|XP_002869873.1| hypothetical protein ARALYDRAFT_492708 [Arabidopsis lyrata subsp. lyrata] gi|297315709|gb|EFH46132.1| hypothetical protein ARALYDRAFT_492708 [Arabidopsis lyrata subsp. lyrata] Length = 965 Score = 256 bits (655), Expect = 2e-65 Identities = 158/396 (39%), Positives = 214/396 (54%), Gaps = 5/396 (1%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSER 169 P +QSR GWF E++M Q RAV+KEY D +H+EK RP P F K+DN +S+R Sbjct: 604 PHVQSRNGWFPVEEEMDPAQIRRAVSKEYPLDSEMIHMEKHRPRHPSFFSKIDNSTQSDR 663 Query: 170 PYAQKQRFSREGPRRDDSFRSFKSFQGEDVSISRSSSGNKDFENESGRSYSSNKDWDSDS 349 + +R +E RRD+ R + G S Y W+ S Sbjct: 664 MLHENRRQPKESLRRDEQLRPNNNLPG------------------SHPFYGEEASWNQSS 705 Query: 350 GRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLMEVSFAG 529 R+S D + P +SVS +E+ A L IA K G KVEY+ S+ AS +LRF +E + Sbjct: 706 SRNS---DLDFLPERSVSATESSADVLHGIAIKCGTKVEYRPSLVASTNLRFSVEAWLSN 762 Query: 530 ERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGANDGFFCDSN 709 E++GEG GK+RREA H AAEAS+ NLAD Y+ H + P D S N Sbjct: 763 EKIGEGIGKSRREALHKAAEASIQNLADVYI-HANGDPGPSHRDASPFTNGN-------M 814 Query: 710 SFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELCMAEGFGL 889 GN +P E S D EG+++ S++AL ELC +EGF + Sbjct: 815 IMGNASALDNQPFARDETAMPVSSRPTDP-----RLEGSMRHTGSITALRELCASEGFEM 869 Query: 890 DFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLKSMLGYSN 1069 F+ +RP +D +H+DE+ AQVEIDG+ G G TWDEA++QAAE+AL +++SMLG Sbjct: 870 SFQSQRPLPSDMVHRDELRAQVEIDGRVVGEGVGSTWDEARMQAAERALCSVRSMLGQPV 929 Query: 1070 QKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYA 1174 KR SPR+ GMS KR + D R LQR P+S RY+ Sbjct: 930 HKRQGSPRSFAGMSNKRLKPDFQRSLQRMPSSGRYS 965 >gb|EMJ15747.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica] Length = 940 Score = 255 bits (652), Expect = 4e-65 Identities = 171/406 (42%), Positives = 225/406 (55%), Gaps = 12/406 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSER 169 PR QSR GWF E++M Q R V K+ D + +EK RP F K++N + S+R Sbjct: 585 PRAQSRPGWFPVEEEMSPRQLSRMVPKDLPLDPETVQIEKHRPHHSSFFPKVENSIPSDR 644 Query: 170 PYAQKQRFSREGPRRDDSFR------SFKSFQGEDVSISRSSSGNKDFENESGRSYSSNK 331 + QR +E RDD R + S GE++ +SRSSS N+D + ESG Sbjct: 645 ILQENQRLPKEAFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESG------- 697 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 +++S +ETPAG L++IA K GAK Sbjct: 698 --------------------RAISNAETPAGVLQEIAMKCGAKAW--------------- 722 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAN-D 688 FAGE++GEG GKTRREA + AAE SL NLA+ YLS +K +S GD+++ N + Sbjct: 723 ---FAGEKIGEGSGKTRREAHYQAAEGSLKNLANIYLSRVKPDSVSVHGDMNKFPNVNSN 779 Query: 689 GFFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELC 868 GF + NSFG Q EE S E + D EG+ KSMSSVS L ELC Sbjct: 780 GFAGNLNSFGIQPFPKEESLSSSTSSEPSRPLDP-------RLEGSKKSMSSVSTLKELC 832 Query: 869 MAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLK 1048 M EG G+ F+ P S +S+ KDEV+ QVEIDG+ +G GLTWDEAK+QAAEKAL +L Sbjct: 833 MMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKGIGLTWDEAKMQAAEKALGSLT 892 Query: 1049 SMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 S L Y+ QKR SPR++QGMS+KR + + + LQR P+S RY +NA Sbjct: 893 STL-YA-QKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSSARYPKNA 936 >ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Solanum lycopersicum] Length = 954 Score = 255 bits (652), Expect = 4e-65 Identities = 181/414 (43%), Positives = 233/414 (56%), Gaps = 19/414 (4%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVA-KEY--TSDGLHVEKQRPLPPPFSKKMDNFVRSE 166 PR+Q GWF E+++ Q R + KE+ + +H+ K RP PPF KM+ + S+ Sbjct: 582 PRVQPH-GWFPAEEEVSPRQLNRPLPPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSD 640 Query: 167 RPYAQKQRFSREGPRRDDSFR---SFKSFQ--GEDVSISRSSSGNKDFENESGRSYSSNK 331 R + + QR +E RDD R S SF+ GEDVS+ RSSS SN+ Sbjct: 641 RVFFENQRLPKEVIPRDDRMRFSQSQPSFRPPGEDVSLGRSSS--------------SNR 686 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 D D G Y D TPAGAL+DIA K G KVE++SS +S +L+F + Sbjct: 687 VLDLDPGHYDPYLD-------------TPAGALQDIAFKCGVKVEFRSSFLSSPELQFCL 733 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAND- 688 EV FAGE+VGEG G+TRREAQ +AAE SLM LAD YLS IK++SS +GD R A+D Sbjct: 734 EVLFAGEKVGEGIGRTRREAQRHAAEESLMYLADKYLSCIKADSSSTQGDGFRFPNASDN 793 Query: 689 GFFCDSNSFGNQLRHS----EEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSAL 856 GF + + FG Q R S EP R+ +P E KS+ SV AL Sbjct: 794 GFVENMSPFGYQDRVSHSFASEPPRVLDP----------------RLEVFKKSVGSVGAL 837 Query: 857 TELCMAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKAL 1036 ELC EG GL F+ + S + K E+YAQVEIDGQ F +G G TWD+AK QAAE+AL Sbjct: 838 RELCAIEGLGLAFQTQPQLSVNPGQKSEIYAQVEIDGQVFGKGIGPTWDDAKTQAAERAL 897 Query: 1037 TNLKSMLGYSNQKRPVSPRAV--QGMSTKRTRSDLSRGL-QRYPAS-RYARNAS 1186 LKS L + KR SPR++ QG S KR + + SRG+ QR P S R+ +N S Sbjct: 898 VALKSELAQFSHKRQGSPRSLQQQGFSNKRLKPEYSRGVQQRVPLSGRFPKNTS 951 >ref|XP_006413749.1| hypothetical protein EUTSA_v10024324mg [Eutrema salsugineum] gi|557114919|gb|ESQ55202.1| hypothetical protein EUTSA_v10024324mg [Eutrema salsugineum] Length = 963 Score = 254 bits (650), Expect = 6e-65 Identities = 159/402 (39%), Positives = 222/402 (55%), Gaps = 11/402 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSER 169 P +Q R GWF E++M R V+KEY D +H+EK RP P F K+DN +S+R Sbjct: 603 PHVQPRNGWFPVEEEMDQAPLRRTVSKEYPLDSEMIHMEKNRPRHPSFFSKIDNSTQSDR 662 Query: 170 PYAQKQRFSREGPRRDDSFRSFK------SFQGEDVSISRSSSGNKDFENESGRSYSSNK 331 + +R +E RRD+ RS SF GE+ S ++SSS N D + SGR+ Sbjct: 663 MLHENRRPPKESLRRDEQLRSNNNLPGSHSFFGEEASWNQSSSRNSDVDFISGRN----- 717 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 V +E PA L DIA K G KVEYK + AS DLRF + Sbjct: 718 ----------------------VQAAENPAEVLHDIAVKCGTKVEYKPGLVASTDLRFSV 755 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGANDG 691 E +GE++GEG GK+RREA H AAE S+ NLAD YLS + + P D S + N Sbjct: 756 ETWLSGEKIGEGIGKSRREALHKAAEVSIQNLADVYLSRVNGDPGPSHRDASPFSNGN-M 814 Query: 692 FFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELCM 871 ++N+ NQ +E + P+ + + D EG+L+ S++AL ELC Sbjct: 815 VMGNANTLDNQPFARDE---TAMPIPSRPT--------DPRLEGSLRHTGSITALRELCA 863 Query: 872 AEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLKS 1051 +EGF + F+ +RP +D +H+DE++AQVEIDG+ G G TWDEA++QAAE+AL +++S Sbjct: 864 SEGFEMAFQSQRPLPSDMVHRDELHAQVEIDGRVLGEGVGSTWDEARMQAAERALCSVRS 923 Query: 1052 MLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYA 1174 ML +R SPR+ GM KR + D R +QR P+S RY+ Sbjct: 924 MLPL--HRRQESPRSFAGMPNKRLKPDFQRSMQRMPSSGRYS 963 >dbj|BAJ34643.1| unnamed protein product [Thellungiella halophila] Length = 502 Score = 254 bits (650), Expect = 6e-65 Identities = 159/402 (39%), Positives = 222/402 (55%), Gaps = 11/402 (2%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSER 169 P +Q R GWF E++M R V+KEY D +H+EK RP P F K+DN +S+R Sbjct: 142 PHVQPRNGWFPVEEEMDQAPLRRTVSKEYPLDSEMIHMEKNRPRHPSFFSKIDNSTQSDR 201 Query: 170 PYAQKQRFSREGPRRDDSFRSFK------SFQGEDVSISRSSSGNKDFENESGRSYSSNK 331 + +R +E RRD+ RS SF GE+ S ++SSS N D + SGR+ Sbjct: 202 MLHENRRPPKESLRRDEQLRSNNNLPGSHSFFGEEASWNQSSSRNSDVDFISGRN----- 256 Query: 332 DWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLM 511 V +E PA L DIA K G KVEYK + AS DLRF + Sbjct: 257 ----------------------VQAAENPAEVLHDIAVKCGTKVEYKPGLVASTDLRFSV 294 Query: 512 EVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGANDG 691 E +GE++GEG GK+RREA H AAE S+ NLAD YLS + + P D S + N Sbjct: 295 ETWLSGEKIGEGIGKSRREALHKAAEVSIQNLADVYLSRVNGDPGPSHRDASPFSNGN-M 353 Query: 692 FFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELCM 871 ++N+ NQ +E + P+ + + D EG+L+ S++AL ELC Sbjct: 354 VMGNANTLDNQPFARDE---TAMPIPSRPT--------DPRLEGSLRHTGSITALRELCA 402 Query: 872 AEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLKS 1051 +EGF + F+ +RP +D +H+DE++AQVEIDG+ G G TWDEA++QAAE+AL +++S Sbjct: 403 SEGFEMAFQSQRPLPSDMVHRDELHAQVEIDGRVLGEGVGSTWDEARMQAAERALCSVRS 462 Query: 1052 MLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYA 1174 ML +R SPR+ GM KR + D R +QR P+S RY+ Sbjct: 463 MLPL--HRRQESPRSFAGMPNKRLKPDFQRSMQRMPSSGRYS 502 >ref|XP_006597420.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X2 [Glycine max] Length = 937 Score = 253 bits (647), Expect = 1e-64 Identities = 164/401 (40%), Positives = 224/401 (55%), Gaps = 7/401 (1%) Frame = +2 Query: 2 PRLQSRRGWF--EDQMGSGQPGRAVAKEYT--SDGLHVEKQRPLPPPFSKKMDNFVRSER 169 P + SRRGWF E++MG Q + V KE+ S+ LH+EK+ P P K+ + Sbjct: 582 PSVPSRRGWFSVEEEMGPQQLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVHH------ 635 Query: 170 PYAQKQRFSREGPRRDDSFRSFKSFQGEDVSISRSSSGNKDFENESGRSYSSNKDWDSDS 349 + R S S+ SF G+D+ +S SS N+DF++ES Sbjct: 636 --------RDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSES-------------- 673 Query: 350 GRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLMEVSFAG 529 G+S+ ++ AG L++IA K G KVE+ SS+ AS L+F +E FAG Sbjct: 674 -------------GRSLFHADITAGVLQEIALKCGTKVEFLSSLVASTALQFSIEAWFAG 720 Query: 530 ERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGA-NDGFFCDS 706 ++VGEG G+TRREAQ+ AAE S+ LAD Y+SH K +S GDVS G+ N+GF Sbjct: 721 KKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNNNGFVSSG 780 Query: 707 NSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTELCMAEGFG 886 NS GNQL E V + S D S D E + +S S+SAL E CM EG Sbjct: 781 NSLGNQLLPKES-------VSFSTSSDSSRVS-DPRLEVSKRSTDSISALKEFCMMEGLA 832 Query: 887 LDFKIE-RPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALTNLKSMLGY 1063 +F+ P S KDEV+AQVEIDGQ F +G GLTW+EAK+QAA+KAL +L++M Sbjct: 833 ANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESLRTMFNQ 892 Query: 1064 SNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYP-ASRYARNA 1183 +KR SPR++QG++ KR + + R LQR P ++RY RNA Sbjct: 893 GTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNA 933 >ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] Length = 956 Score = 253 bits (646), Expect = 2e-64 Identities = 170/409 (41%), Positives = 232/409 (56%), Gaps = 15/409 (3%) Frame = +2 Query: 2 PRLQSRRG-WF--EDQMGSGQPGRAVAKEYTSDG--LHVEKQRPLPPPFSKKMDNFVRSE 166 P + S RG WF E+++GS R V KE+ D L + K RP P F K+++ + S+ Sbjct: 579 PHVPSSRGVWFPAEEEIGSQPLNRVVPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSD 638 Query: 167 RP-YAQKQRFSREGPRRDDSFR------SFKSFQGEDVSISRSSSGNKDFENESGRSYSS 325 R + QR +E RDD R S++SF G+D+ SRS +SS Sbjct: 639 RILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRS--------------FSS 684 Query: 326 NKDWDSDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRF 505 ++D DS+SG S + D TP L++IA K G KV++ SS+ AS +L+F Sbjct: 685 HRDLDSESGHSVLHAD-------------TPVAVLQEIALKCGTKVDFISSLVASTELQF 731 Query: 506 LMEVSFAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAN 685 ME F+G+++G G+TR+EAQ+ AAE S+ +LAD YLS K E GDVS N Sbjct: 732 SMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKDEPGSTYGDVSGFPNVN 791 Query: 686 D-GFFCDSNSFGNQLRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVSALTE 862 D G+ ++S GNQ LS+ A+ S D + + +SM S+S+L E Sbjct: 792 DSGYMGIASSLGNQ--------PLSKEDSASFSTASPSRVLDPRLDVSKRSMGSISSLKE 843 Query: 863 LCMAEGFGLDF-KIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEKALT 1039 LCM EG ++F P S +S+ KDEV+AQVEIDG+ F +G GLTWDEAK+QAAEKAL Sbjct: 844 LCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALG 903 Query: 1040 NLKSMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 +L+S LG S QKR SPR QG S KR + + R +QR P+S RY RNA Sbjct: 904 SLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSARYPRNA 952 >ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa] gi|550327613|gb|ERP55122.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa] Length = 990 Score = 253 bits (646), Expect = 2e-64 Identities = 166/412 (40%), Positives = 225/412 (54%), Gaps = 20/412 (4%) Frame = +2 Query: 8 LQSRRGWF--EDQMGSGQPGRAVAK-EYTSDGLHVEKQRPLPPPFSKKMDNFVRSERPYA 178 +QSR W E++M Q R + SD +++EK + P F K+++ + S+R Sbjct: 619 VQSRGSWVPVEEEMTPRQLNRTPREFPLDSDPMNIEKHQTHHPSFFPKVESNIPSDRMIH 678 Query: 179 QKQRFSREGPRRDDSFR------SFKSFQGEDVSISRSSSGNKDFENESGRSYSSNKDWD 340 + QR +E P R+D R ++ SFQ E+ +SRSSS N+D + ES R+++ Sbjct: 679 ENQRLPKEAPYRNDRMRLNHSTPNYHSFQVEETPLSRSSS-NRDLDLESERAFT------ 731 Query: 341 SDSGRSSNYRDSENEPGQSVSLSETPAGALRDIARKFGAKVEYKSSIAASADLRFLMEVS 520 +SETP L++IA K KVE++ ++ AS DL+F +E Sbjct: 732 ---------------------ISETPVEVLQEIAMKCETKVEFRPALVASIDLQFSIEAW 770 Query: 521 FAGERVGEGKGKTRREAQHYAAEASLMNLADTYLSHIKSESSPKRGDVSRMAGAND-GFF 697 FAGE+VGEG GKTRREAQ AAE S+ LA Y+ K +S P GD SR AND GF Sbjct: 771 FAGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMLRAKPDSGPMHGDSSRYPSANDNGFL 830 Query: 698 CDSNSFGNQ---------LRHSEEPRRLSEPVEAAGSHDGDGGSQDLTSEGTLKSMSSVS 850 + N FGNQ + EP RL +P EG+ KS SV+ Sbjct: 831 GNMNLFGNQPLPKDELVAYSAASEPSRLLDP----------------RLEGSKKSSGSVT 874 Query: 851 ALTELCMAEGFGLDFKIERPHSADSLHKDEVYAQVEIDGQAFSRGTGLTWDEAKVQAAEK 1030 AL E C EG ++F + P SA+S+ +EV+AQVEIDGQ +G G TWDEAK+QAAEK Sbjct: 875 ALKEFCTMEGLVVNFLAQTPLSANSIPGEEVHAQVEIDGQVLGKGIGSTWDEAKMQAAEK 934 Query: 1031 ALTNLKSMLGYSNQKRPVSPRAVQGMSTKRTRSDLSRGLQRYPAS-RYARNA 1183 AL +L++M G QKR SPR +QGM KR + + R LQR P S RY +NA Sbjct: 935 ALGSLRTMFGQYTQKRQGSPRPMQGMPNKRLKQEFPRVLQRMPPSARYHKNA 986