BLASTX nr result
ID: Sinomenium21_contig00023885
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00023885 (2546 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform... 475 e-131 ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr... 462 e-127 ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma... 460 e-126 ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform... 453 e-124 ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu... 433 e-118 ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma... 423 e-115 ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun... 419 e-114 ref|XP_002519032.1| double-stranded RNA binding protein, putativ... 414 e-113 ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu... 413 e-112 ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas... 404 e-109 ref|XP_006827806.1| hypothetical protein AMTR_s00009p00267690 [A... 399 e-108 ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma... 394 e-106 ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma... 390 e-105 emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera] 387 e-104 ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma... 377 e-101 ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma... 375 e-101 ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal doma... 373 e-100 ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma... 371 1e-99 ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma... 371 1e-99 ref|XP_007025682.1| C-terminal domain phosphatase-like 1 isoform... 365 7e-98 >ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] gi|508781046|gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] Length = 978 Score = 475 bits (1222), Expect = e-131 Identities = 261/462 (56%), Positives = 305/462 (66%), Gaps = 2/462 (0%) Frame = +3 Query: 3 SFNEKQFP-QAPLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDT 179 SF+ QFP AP+VKP+ + EPSLQ SP REEGEVPESELDPDTRRRLLILQHGQDT Sbjct: 524 SFSNMQFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDT 583 Query: 180 RDHTSRESSFP-VRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELES 356 RDHT E +FP VRP +QV P Q+RGSWF EEEMSPRQLNRA PK E P + E Sbjct: 584 RDHTPPEPAFPPVRPTMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPK----EFPLDSER 639 Query: 357 IHFDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDE 536 +H +KHRHP FF VE+S+ SD+ L EN+R KE DDR+ NHT S Y SF G+E Sbjct: 640 MHIEKHRHP--PFFPKVESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEE 697 Query: 537 MPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIE 716 MPL SSSS RD FES R T T ET AGVLQDIAMKCG KVEFR L+AS +LQFSIE Sbjct: 698 MPLSQSSSSHRDLDFESGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIE 756 Query: 717 VWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXX 896 WF+GEK+ EG+GRTR+EAQ QAAE S++NLAN YLS + D+ S D S++ Sbjct: 757 AWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNG 816 Query: 897 XXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALV 1076 R RLEGSKKS+GSV+ALKELCM EGL +V Sbjct: 817 FPSNVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVV 876 Query: 1077 FQAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQ 1256 FQ P S++++ K E +AQVEI GQVLG G G TW+EAK+QAAE+ALG+LRSMLGQ +Q Sbjct: 877 FQPQPPSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQ 936 Query: 1257 KXXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 K +FPRV+QR+PSS RY A VP Sbjct: 937 KRQGSPRSLQGMQNKRLKPEFPRVLQRMPSSGRYPKNAPPVP 978 >ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina] gi|557551913|gb|ESR62542.1| hypothetical protein CICLE_v10014168mg [Citrus clementina] Length = 957 Score = 462 bits (1188), Expect = e-127 Identities = 256/456 (56%), Positives = 302/456 (66%), Gaps = 1/456 (0%) Frame = +3 Query: 18 QFPQAP-LVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTRDHTS 194 QFP A LVKPLGH+G E LQ SP REEGEVPESELDPDTRRRLLILQHG DTR++ Sbjct: 512 QFPPATSLVKPLGHVGPPEQCLQSSPAREEGEVPESELDPDTRRRLLILQHGMDTRENAP 571 Query: 195 RESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESIHFDKH 374 E+ FP R +QV P +RGSWF +EEEMSPRQLNRAV PKE P E++ +KH Sbjct: 572 SEAPFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQLNRAV----PKEFPLNSEAMQIEKH 627 Query: 375 RHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDEMPLGSS 554 R P +FF +ENS++SD+ HEN+R KE R DDR+R NHT+S Y SF G+E+PL S Sbjct: 628 RPPHPSFFPKIENSITSDRP-HENQRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRS 686 Query: 555 SSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIEVWFSGE 734 SSS RD FES R + ETP+GVLQDIAMKCGTKVEFR L+AS ELQFSIE WF+GE Sbjct: 687 SSSSRDVDFESGRDVSS-TETPSGVLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGE 745 Query: 735 KISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXXXXXXX 914 KI EGIGRTR+EAQ QAAE S+++LAN Y+ V +D+ SG+ D S+ Sbjct: 746 KIGEGIGRTRREAQRQAAEGSIKHLANVYVLRVKSDSGSGHGDGSRFSNANENCFMGEIN 805 Query: 915 XXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVFQAPPQ 1094 + + RLEGSKK +GSVSALKELCM EGL +VFQ P Sbjct: 806 SFGGQPLAK----DESLSSEPSKLVDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPP 861 Query: 1095 LSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQKXXXXX 1274 S +S+ K E +AQVEI GQVLG GIGSTWDEAK+QAAE+ALG+LRSM GQ QK Sbjct: 862 SSANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSP 921 Query: 1275 XXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 +FPRV+QR+P S RY A VP Sbjct: 922 RSLQGMPNKRLKPEFPRVLQRMPPSGRYPKNAPPVP 957 >ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Citrus sinensis] Length = 957 Score = 460 bits (1184), Expect = e-126 Identities = 256/456 (56%), Positives = 301/456 (66%), Gaps = 1/456 (0%) Frame = +3 Query: 18 QFPQAP-LVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTRDHTS 194 QFP A LVKPLGH+G E SLQ SP REEGEVPESELDPDTRRRLLILQHG DTR++ Sbjct: 512 QFPPATSLVKPLGHVGPPEQSLQSSPAREEGEVPESELDPDTRRRLLILQHGMDTRENAP 571 Query: 195 RESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESIHFDKH 374 E+ FP R +QV P +RGSWF +EEEMSPRQLNRAV PKE P E++ +KH Sbjct: 572 SEAPFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQLNRAV----PKEFPLNSEAMQIEKH 627 Query: 375 RHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDEMPLGSS 554 R P +FF +EN +SD+ HEN+R KE R DDR+R NHT+S Y SF G+E+PL S Sbjct: 628 RPPHPSFFPKIENPSTSDRP-HENQRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRS 686 Query: 555 SSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIEVWFSGE 734 SSS RD FES R + ETP+GVLQDIAMKCGTKVEFR L+AS ELQFSIE WF+GE Sbjct: 687 SSSSRDVDFESGRDVSS-TETPSGVLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGE 745 Query: 735 KISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXXXXXXX 914 KI EGIGRTR+EAQ QAAE S+++LAN Y+ V +D+ SG+ D S+ Sbjct: 746 KIGEGIGRTRREAQRQAAEGSIKHLANVYMLRVKSDSGSGHGDGSRFSNANENCFMGEIN 805 Query: 915 XXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVFQAPPQ 1094 + + RLEGSKK +GSVSALKELCM EGL +VFQ P Sbjct: 806 SFGGQPLAK----DESLSSEPSKLVDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPP 861 Query: 1095 LSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQKXXXXX 1274 S +S+ K E +AQVEI GQVLG GIGSTWDEAK+QAAE+ALG+LRSM GQ QK Sbjct: 862 SSANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSP 921 Query: 1275 XXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 +FPRV+QR+P S RY A VP Sbjct: 922 RSLQGMPNKRLKPEFPRVLQRMPPSGRYPKNAPPVP 957 >ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] gi|508781047|gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] Length = 984 Score = 453 bits (1166), Expect = e-124 Identities = 247/421 (58%), Positives = 288/421 (68%), Gaps = 2/421 (0%) Frame = +3 Query: 3 SFNEKQFP-QAPLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDT 179 SF+ QFP AP+VKP+ + EPSLQ SP REEGEVPESELDPDTRRRLLILQHGQDT Sbjct: 524 SFSNMQFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDT 583 Query: 180 RDHTSRESSFP-VRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELES 356 RDHT E +FP VRP +QV P Q+RGSWF EEEMSPRQLNRA PK E P + E Sbjct: 584 RDHTPPEPAFPPVRPTMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPK----EFPLDSER 639 Query: 357 IHFDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDE 536 +H +KHRHP FF VE+S+ SD+ L EN+R KE DDR+ NHT S Y SF G+E Sbjct: 640 MHIEKHRHP--PFFPKVESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEE 697 Query: 537 MPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIE 716 MPL SSSS RD FES R T T ET AGVLQDIAMKCG KVEFR L+AS +LQFSIE Sbjct: 698 MPLSQSSSSHRDLDFESGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIE 756 Query: 717 VWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXX 896 WF+GEK+ EG+GRTR+EAQ QAAE S++NLAN YLS + D+ S D S++ Sbjct: 757 AWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNG 816 Query: 897 XXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALV 1076 R RLEGSKKS+GSV+ALKELCM EGL +V Sbjct: 817 FPSNVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVV 876 Query: 1077 FQAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQ 1256 FQ P S++++ K E +AQVEI GQVLG G G TW+EAK+QAAE+ALG+LRSMLGQ +Q Sbjct: 877 FQPQPPSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQ 936 Query: 1257 K 1259 K Sbjct: 937 K 937 >ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] gi|550340277|gb|EEE85528.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] Length = 996 Score = 433 bits (1114), Expect = e-118 Identities = 244/460 (53%), Positives = 291/460 (63%), Gaps = 1/460 (0%) Frame = +3 Query: 6 FNEKQFPQ-APLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F QFPQ AP VK LG + EPSLQ SP REEGEVPESELDPDTRRRLLILQHG D+R Sbjct: 544 FPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGHDSR 603 Query: 183 DHTSRESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESIH 362 D+ ES FP RP+ QV AP Q+ GSW +EEEMSPRQLNR P+E P + + ++ Sbjct: 604 DNAPSESPFPARPSTQVSAPRVQSVGSWVPVEEEMSPRQLNRT-----PREFPLDSDPMN 658 Query: 363 FDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDEMP 542 +KHR +FFH VE+++ SD+ +HEN+R KE DDRM+ NH+ S Y SF G+E P Sbjct: 659 IEKHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYPSFQGEESP 718 Query: 543 LGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIEVW 722 L S SSS RD ESER + ETP VLQ+IAMKCGTKVEFR L+A+ +LQFSIE W Sbjct: 719 L-SRSSSNRDLDLESERAFSS-TETPVEVLQEIAMKCGTKVEFRPALIATSDLQFSIETW 776 Query: 723 FSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXXX 902 F GEK+ EG G+TR+EAQ QAAE S++ LA Y+S V D+ D S+ Sbjct: 777 FVGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYPSANDNGFL 836 Query: 903 XXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVFQ 1082 + R L RLEGSKKS+GSV+ALKE CM EGL + F Sbjct: 837 GDMNSFGNQPLLKDENITYSATSEPSRLLDQRLEGSKKSMGSVTALKEFCMTEGLGVNFL 896 Query: 1083 APPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQKX 1262 A LST+SI E AQVEI GQVLG GIG TWDEAK+QAAE+ALG+LR+M GQ T K Sbjct: 897 AQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRTMFGQYTPKR 956 Query: 1263 XXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 +FPRV+QR+PSS RY AS VP Sbjct: 957 QGSPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNASPVP 996 >ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Fragaria vesca subsp. vesca] Length = 955 Score = 423 bits (1088), Expect = e-115 Identities = 243/460 (52%), Positives = 287/460 (62%), Gaps = 1/460 (0%) Frame = +3 Query: 6 FNEKQFPQ-APLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F+ QFPQ A LVKPLGH+G ++ L SP REEGEVPESELDPDTRRRLLILQHGQDTR Sbjct: 504 FHNVQFPQSASLVKPLGHVGPADLGLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTR 563 Query: 183 DHTSRESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESIH 362 + E SFPVRP +QV P Q+RG WF +EEEMSPR+L+R V PKEPP E + Sbjct: 564 ESVPSEPSFPVRPQVQVSVPRVQSRGGWFPVEEEMSPRKLSRMV----PKEPPLNSEPMQ 619 Query: 363 FDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDEMP 542 +KHR S FF VENS+ SD+ L EN+R KE D+R+R N +S Y SF G+E P Sbjct: 620 IEKHRSHHSAFFPKVENSMPSDRILQENQRLPKEAFHRDNRLRFNQAMSGYHSFSGEEPP 679 Query: 543 LGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIEVW 722 L SSSS RDF +ES R + ETPAGVLQ+IAMKCGTKVEFR L+ S ELQF +E W Sbjct: 680 LNRSSSSNRDFDYESGRAI-SNAETPAGVLQEIAMKCGTKVEFRPALVPSTELQFYVEAW 738 Query: 723 FSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXXX 902 F+GEKI EG GRTR+EA QAAE SL+NLAN Y+S D + D SK Sbjct: 739 FAGEKIGEGTGRTRREAHFQAAEGSLKNLANIYISRGKPDALPIHGDASKFSNVTNNGFM 798 Query: 903 XXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVFQ 1082 R L RL+ S+KS+ SVSALKELC EGL++++Q Sbjct: 799 GNMNSFGTQPLPKEDSLSSSTSSEPSRPLDPRLDNSRKSVSSVSALKELCTMEGLSVLYQ 858 Query: 1083 APPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQKX 1262 P +S K E Q EI G+VLG GIG TWDEAK+QAAE+ALGNLRS L QK Sbjct: 859 PRPP-PPNSTEKDEVHVQAEIDGEVLGKGIGLTWDEAKMQAAEKALGNLRSTL--YGQKR 915 Query: 1263 XXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 +FP+V+QR+PSSTRYS A VP Sbjct: 916 QGSPRPLQGMPSKRLKQEFPQVLQRMPSSTRYSKNAPPVP 955 >ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica] gi|462410413|gb|EMJ15747.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica] Length = 940 Score = 419 bits (1077), Expect = e-114 Identities = 242/461 (52%), Positives = 284/461 (61%), Gaps = 1/461 (0%) Frame = +3 Query: 3 SFNEKQFPQAP-LVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDT 179 SF QFPQA LVKPLGH+G++EPSLQ SP REEGEVPESELDPDTRRRLLILQHGQDT Sbjct: 505 SFPSIQFPQAASLVKPLGHVGSAEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDT 564 Query: 180 RDHTSRESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESI 359 RD E FPVRP +Q P +Q+R WF +EEEMSPRQL+R VPK LP +P E++ Sbjct: 565 RDQPPSEPPFPVRPPMQASVPRAQSRPGWFPVEEEMSPRQLSRMVPKDLPLDP----ETV 620 Query: 360 HFDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDEM 539 +KHR S+FF VENS+ SD+ L EN+R KE DDR+R NH +S Y S G+E+ Sbjct: 621 QIEKHRPHHSSFFPKVENSIPSDRILQENQRLPKEAFHRDDRLRFNHALSGYHSLSGEEI 680 Query: 540 PLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIEV 719 PL SSSS RD FES R + ETPAGVLQ+IAMKCG K Sbjct: 681 PLSRSSSSNRDVDFESGRAI-SNAETPAGVLQEIAMKCGAK------------------A 721 Query: 720 WFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXX 899 WF+GEKI EG G+TR+EA +QAAE SL+NLAN YLS V D+ S + D +K Sbjct: 722 WFAGEKIGEGSGKTRREAHYQAAEGSLKNLANIYLSRVKPDSVSVHGDMNKFPNVNSNGF 781 Query: 900 XXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVF 1079 F R L RLEGSKKS+ SVS LKELCM EGL +VF Sbjct: 782 AGNLNSFGIQPFPKEESLSSSTSSEPSRPLDPRLEGSKKSMSSVSTLKELCMMEGLGVVF 841 Query: 1080 QAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQK 1259 Q P ST+S+ K E QVEI G+VLG GIG TWDEAK+QAAE+ALG+L S L QK Sbjct: 842 QPRPPPSTNSVEKDEVHVQVEIDGEVLGKGIGLTWDEAKMQAAEKALGSLTSTL--YAQK 899 Query: 1260 XXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 +FP+V+QR+PSS RY A VP Sbjct: 900 RQGSPRSLQGMSSKRMKQEFPQVLQRMPSSARYPKNAPPVP 940 >ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis] gi|223541695|gb|EEF43243.1| double-stranded RNA binding protein, putative [Ricinus communis] Length = 978 Score = 414 bits (1065), Expect = e-113 Identities = 234/463 (50%), Positives = 287/463 (61%), Gaps = 3/463 (0%) Frame = +3 Query: 3 SFNEKQFPQA-PLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDT 179 +F Q PQA PLVKPLG + SEPSLQ SP REEGEVPESELDPDTRRRLLILQHGQD Sbjct: 521 TFPSMQLPQAAPLVKPLGQVVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDL 580 Query: 180 RDHTSRESSFPVRPA--IQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELE 353 RD ES FPVRP+ +QV P Q+RG+W +EEEMSPRQLNRAV + E P + E Sbjct: 581 RDPAPSESPFPVRPSNSMQVSVPRVQSRGNWVPVEEEMSPRQLNRAVTR----EFPMDTE 636 Query: 354 SIHFDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGD 533 +H DKHR +FF VE+S+ S++ HEN+R K DDR+R N T+S Y S G+ Sbjct: 637 PMHIDKHRPHHPSFFPKVESSIPSERMPHENQRLPKVAPYKDDRLRLNQTMSNYQSLSGE 696 Query: 534 EMPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSI 713 E L SSSS RD ES+R + ETP VL +I+MKCG KVEF+ +L+ SR+LQFS+ Sbjct: 697 ENSLSRSSSSNRDLDVESDRAVSS-AETPVRVLHEISMKCGAKVEFKHSLVNSRDLQFSV 755 Query: 714 EVWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXX 893 E WF+GE++ EG GRTR+EAQ AAE S++NLAN Y+S DN + + D SK Sbjct: 756 EAWFAGERVGEGFGRTRREAQSVAAEASIKNLANIYISRAKPDNGALHGDASKYSSANDN 815 Query: 894 XXXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLAL 1073 L RLE SKKS+ SV+ALKE CM EGL + Sbjct: 816 GFLGHVNSFGSQPLPKDEILSYSDSSEQSGLLDPRLESSKKSMSSVNALKEFCMMEGLGV 875 Query: 1074 VFQAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGT 1253 F A LS++S+ E AQVEI GQV+G GIGST+DEAK+QAAE+ALG+LR+ G+ Sbjct: 876 NFLAQTPLSSNSVQNAEVHAQVEIDGQVMGKGIGSTFDEAKMQAAEKALGSLRTTFGRFP 935 Query: 1254 QKXXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 K +FPRV+QR+PSS RY A VP Sbjct: 936 PKRQGSPRPVPGMPNKHLKPEFPRVLQRMPSSARYPKNAPPVP 978 >ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa] gi|550327613|gb|ERP55122.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa] Length = 990 Score = 413 bits (1061), Expect = e-112 Identities = 237/460 (51%), Positives = 284/460 (61%), Gaps = 1/460 (0%) Frame = +3 Query: 6 FNEKQFPQ-APLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F QFPQ APLVK LG + EPSLQ SP REEGEVPESELDPDTRRRLLILQHGQD+R Sbjct: 538 FPNTQFPQVAPLVKQLGQVVHPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDSR 597 Query: 183 DHTSRESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESIH 362 D+ ES FP RP+ V A Q+RGSW +EEEM+PRQLNR P+E P + + ++ Sbjct: 598 DNAPSESPFPARPSAPVSAAHVQSRGSWVPVEEEMTPRQLNRT-----PREFPLDSDPMN 652 Query: 363 FDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDEMP 542 +KH+ +FF VE+++ SD+ +HEN+R KE +DRMR NH+ Y SF +E P Sbjct: 653 IEKHQTHHPSFFPKVESNIPSDRMIHENQRLPKEAPYRNDRMRLNHSTPNYHSFQVEETP 712 Query: 543 LGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIEVW 722 L S SSS RD ESER T ETP VLQ+IAMKC TKVEFR L+AS +LQFSIE W Sbjct: 713 L-SRSSSNRDLDLESERAF-TISETPVEVLQEIAMKCETKVEFRPALVASIDLQFSIEAW 770 Query: 723 FSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXXX 902 F+GEK+ EG G+TR+EAQ QAAE S++ LA Y+ D+ + D S+ Sbjct: 771 FAGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMLRAKPDSGPMHGDSSRYPSANDNGFL 830 Query: 903 XXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVFQ 1082 R L RLEGSKKS GSV+ALKE C EGL + F Sbjct: 831 GNMNLFGNQPLPKDELVAYSAASEPSRLLDPRLEGSKKSSGSVTALKEFCTMEGLVVNFL 890 Query: 1083 APPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQKX 1262 A LS +SI E AQVEI GQVLG GIGSTWDEAK+QAAE+ALG+LR+M GQ TQK Sbjct: 891 AQTPLSANSIPGEEVHAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRTMFGQYTQKR 950 Query: 1263 XXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 +FPRV+QR+P S RY A VP Sbjct: 951 QGSPRPMQGMPNKRLKQEFPRVLQRMPPSARYHKNAPPVP 990 >ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris] gi|561032720|gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris] Length = 964 Score = 404 bits (1037), Expect = e-109 Identities = 230/462 (49%), Positives = 283/462 (61%), Gaps = 3/462 (0%) Frame = +3 Query: 6 FNEKQFPQ-APLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F QFPQ A LVKP+G SE SL SP REEGEVPESELDPDTRRRLLILQHGQDTR Sbjct: 508 FTHVQFPQPAALVKPMGQAAPSESSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTR 567 Query: 183 DHTSRESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESIH 362 DHTS E ++ +R + V AP +RG WF EE++ + LNR V PKE + S+ Sbjct: 568 DHTSNEPTYAIRHPVPVSAPRVSSRGGWFPAEEDIGSQPLNRVV----PKEFSVDSGSLV 623 Query: 363 FDKHRHPRSTFFHGVENSVSSDKTLHE-NRRFFKEVHRGDDRMRPNHTVSKYASFPGDEM 539 +KHR +FF VE+S+SSD+ LH+ ++R KE++ DDR R NH +S Y S DE+ Sbjct: 624 IEKHRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRPRSNHMLSSYRSLSVDEI 683 Query: 540 PLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIEV 719 P SSSS RD ES + +TP VLQ+IA+KCGTKVEF ++L+AS ELQFSIE Sbjct: 684 PFSRSSSSHRDLDSESSHSV-FHADTPVVVLQEIALKCGTKVEFMSSLVASTELQFSIEA 742 Query: 720 WFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXX 899 WFSG+KI G GRTRKEAQH+AAE S+++LA+ YLS+ + S D Sbjct: 743 WFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAKDEPGSTYGDVGGFPNANDNGY 802 Query: 900 XXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVF 1079 R L RLE SK+ +GS+SALKELCM EGL + F Sbjct: 803 MVIASSLSNQPLPKEDSASFSTASDPSRVLDPRLEVSKRPMGSISALKELCMMEGLGVNF 862 Query: 1080 -QAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQ 1256 AP +ST+S+ K E AQVEI G+V G GIG TWDEAK+QAAE+ALG+LRS LGQ Q Sbjct: 863 LSAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQ 922 Query: 1257 KXXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 K ++PR MQR+PSSTRY A +P Sbjct: 923 KRQSSPRSHQGFSNKRLKQEYPRAMQRIPSSTRYPRNAPPIP 964 >ref|XP_006827806.1| hypothetical protein AMTR_s00009p00267690 [Amborella trichopoda] gi|548832426|gb|ERM95222.1| hypothetical protein AMTR_s00009p00267690 [Amborella trichopoda] Length = 942 Score = 399 bits (1025), Expect = e-108 Identities = 219/424 (51%), Positives = 283/424 (66%), Gaps = 5/424 (1%) Frame = +3 Query: 3 SFNEKQFPQA-PLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDT 179 S N KQ+ A P +KP GH+ +S+ +LQ SPGREEGEVPESELDPDTRRRLLILQHGQDT Sbjct: 502 SLNNKQYNHAVPSLKPSGHICSSDSTLQCSPGREEGEVPESELDPDTRRRLLILQHGQDT 561 Query: 180 RDHTSRESS---FPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFEL 350 R+H + + FP+RPA+Q+ PP+Q+ G WF +EEEMSPRQL+ + +E P E Sbjct: 562 REHGTIDPPPPPFPLRPALQIAVPPAQSHGPWFPVEEEMSPRQLSHPL-----REFPLEP 616 Query: 351 ESIHFDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPG 530 E++ FD+HR FFHGV+ S+ +D+ +E +R KEV DDR+ N + Y+SFP Sbjct: 617 EAVQFDRHR--ARPFFHGVDGSIPADRVFNEAQRLSKEVQYRDDRLHQNLPKTSYSSFPE 674 Query: 531 -DEMPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQF 707 +EMP G SSS+ RD F + + P Y TP GVL+DIA+KCG+KV+FR+ ++ + ELQF Sbjct: 675 VEEMPPGQSSSNTRDVPFATGQVPPQYSPTPVGVLKDIAIKCGSKVDFRSMVVPTTELQF 734 Query: 708 SIEVWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXX 887 S+EVWF GEKI EGIG+TRKEAQ +A+E S+R LA YL+ +S D G C + Sbjct: 735 SVEVWFVGEKIGEGIGKTRKEAQFKASEASIRTLARTYLAQISPDIGLG---CGDMDDRS 791 Query: 888 XXXXXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGL 1067 + RFL RLEGSK+S+G VS+LKELC EGL Sbjct: 792 LGSDNGLMGDSISSAGLREDSLPIASTSEQQRFLDQRLEGSKQSIGVVSSLKELCSVEGL 851 Query: 1068 ALVFQAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQ 1247 +LVF+ P T S HKGE +AQVEI G+VLG G+GS+W+EAKIQAAE+ALG+L+S L Q Sbjct: 852 SLVFKELPP--TGSNHKGEVYAQVEIAGRVLGEGVGSSWEEAKIQAAEDALGSLKSSLIQ 909 Query: 1248 GTQK 1259 TQK Sbjct: 910 RTQK 913 >ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Glycine max] Length = 960 Score = 394 bits (1011), Expect = e-106 Identities = 228/464 (49%), Positives = 288/464 (62%), Gaps = 5/464 (1%) Frame = +3 Query: 6 FNEKQFPQ-APLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F QFPQ A LVKP+G S+PSL SP REEGEVPESELDPDTRRRLLILQHGQDTR Sbjct: 504 FPHVQFPQPATLVKPMGQAAPSDPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTR 563 Query: 183 DHTSRESSFPVRPAIQVPAP--PSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELES 356 DH S E FPVR +Q AP PS +RG WF +EEE+ + LNR V PKE P + Sbjct: 564 DHASAEPPFPVRHPVQASAPRVPS-SRGVWFPVEEEIGSQPLNRVV----PKEFPVDSGP 618 Query: 357 IHFDKHRHPRSTFFHGVENSVSSDKTLHE-NRRFFKEVHRGDDRMRPNHTVSKYASFPGD 533 + +K R +FF+ VE+S+SSD+ LH+ ++R KE++ DDR R NH +S Y SF GD Sbjct: 619 LGIEKPRLHHPSFFNKVESSISSDRILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSFSGD 678 Query: 534 EMPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSI 713 ++P SSSS RD ES + +TP VL +IA+KCGTKV+F ++L+AS EL+FS+ Sbjct: 679 DIPFSRSSSSHRDLDSESGHSV-LHADTPVAVLHEIALKCGTKVDFMSSLVASTELKFSL 737 Query: 714 EVWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXX 893 E WFSG+KI G GRTRKEAQ++AA+ S+ +LA+ YLS+ + S D S Sbjct: 738 EAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIYLSSAKDEPGSTYGDVSGFPNVNDN 797 Query: 894 XXXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLAL 1073 ++ R L RL+ SK+S+GS+SALKELCM EGL + Sbjct: 798 GYMGIASSLGNQP-LSKEDSASFSSASPSRALDPRLDVSKRSMGSISALKELCMMEGLGV 856 Query: 1074 VF-QAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQG 1250 F P +ST+S+ K E AQVEI G++ G GIG TWDEAK+QAAE+ALGNLRS LGQ Sbjct: 857 NFLSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIGLTWDEAKMQAAEKALGNLRSKLGQS 916 Query: 1251 TQKXXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 QK ++PR MQR+PSS RY A +P Sbjct: 917 IQKMQSSPRPHQGFSNKRLKQEYPRTMQRMPSSARYPRNAPPIP 960 >ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] Length = 956 Score = 390 bits (1002), Expect = e-105 Identities = 230/464 (49%), Positives = 287/464 (61%), Gaps = 5/464 (1%) Frame = +3 Query: 6 FNEKQFPQ-APLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F QFPQ A LVKP+G SEPSL SP REEGEVPESELDPDTRRRLLILQHGQDTR Sbjct: 500 FPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDTR 559 Query: 183 DHTSRESSFPVRPAIQVPAP--PSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELES 356 DH S E FPVR +Q AP PS +RG WF EEE+ + LNR V PKE P + Sbjct: 560 DHASAEPPFPVRHPVQTSAPHVPS-SRGVWFPAEEEIGSQPLNRVV----PKEFPVDSGP 614 Query: 357 IHFDKHRHPRSTFFHGVENSVSSDKTLHE-NRRFFKEVHRGDDRMRPNHTVSKYASFPGD 533 + K R +FF VE+S+SSD+ LH+ ++R KE++ DDR R NH +S Y SF GD Sbjct: 615 LGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRPRLNHMLSSYRSFSGD 674 Query: 534 EMPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSI 713 ++P S SS RD ES + +TP VLQ+IA+KCGTKV+F ++L+AS ELQFS+ Sbjct: 675 DIPFSRSFSSHRDLDSESGHSV-LHADTPVAVLQEIALKCGTKVDFISSLVASTELQFSM 733 Query: 714 EVWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXX 893 E WFSG+KI +GRTRKEAQ++AAE S+++LA+ YLS+ + S D S Sbjct: 734 EAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKDEPGSTYGDVSGFPNVNDS 793 Query: 894 XXXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLAL 1073 ++ R L RL+ SK+S+GS+S+LKELCM EGL + Sbjct: 794 GYMGIASSLGNQP-LSKEDSASFSTASPSRVLDPRLDVSKRSMGSISSLKELCMMEGLDV 852 Query: 1074 VF-QAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQG 1250 F AP +ST+S+ K E AQVEI G+V G GIG TWDEAK+QAAE+ALG+LRS LGQ Sbjct: 853 NFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQS 912 Query: 1251 TQKXXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 QK ++PR MQR+PSS RY A +P Sbjct: 913 IQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSARYPRNAPPIP 956 >emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera] Length = 894 Score = 387 bits (995), Expect = e-104 Identities = 234/461 (50%), Positives = 281/461 (60%), Gaps = 2/461 (0%) Frame = +3 Query: 6 FNEKQFPQ-APLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F+ KQFPQ A L+KPL A EP++Q SP REEGEVPESELDPDTRRRLLILQHGQDTR Sbjct: 478 FSNKQFPQSASLIKPL----APEPTMQSSPAREEGEVPESELDPDTRRRLLILQHGQDTR 533 Query: 183 DHTSRESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESIH 362 +H S + FPVRP IQV P Q+RGSWF +EEMSPRQLNRAV PKE P + +++H Sbjct: 534 EHASSDPPFPVRPPIQVSVPRVQSRGSWFPADEEMSPRQLNRAV----PKEFPLDSDTMH 589 Query: 363 FDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDEMP 542 +KHR +FFH VE+S SSD+ LHEN+R KEV DDR+R NH++ Y SF G+E+P Sbjct: 590 IEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDDRLRLNHSLPGYHSFSGEEVP 649 Query: 543 LGSSSSSKRDFHFESERGTPTYRETPA-GVLQDIAMKCGTKVEFRTTLLASRELQFSIEV 719 LG SSS+ RD FES RG P Y ETPA G+L++ C EV Sbjct: 650 LGRSSSN-RDLDFESGRGAP-YAETPAVGLLRN----CN-------------------EV 684 Query: 720 WFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXX 899 W GEKI EG G+TR+EAQ QAAE SL L+ +YL + D ++ Sbjct: 685 WNQGEKIGEGTGKTRREAQCQAAEASLMYLSYRYL----------HGDVNRFPNASDNNF 734 Query: 900 XXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVF 1079 F R L RLE SKKS+GS+SALKELCM EGL + F Sbjct: 735 MSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSMGSISALKELCMMEGLGVEF 794 Query: 1080 QAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQK 1259 + P LS++S K E AQVEI GQVLG G GSTWD+AK+QAAE+ALG+L+SMLGQ +QK Sbjct: 795 LSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQAAEKALGSLKSMLGQFSQK 854 Query: 1260 XXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 +F R +QR PSS RYS S VP Sbjct: 855 -RQGSPRSLQGMGKRLKSEFTRGLQRTPSSGRYSKNTSPVP 894 >ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Cicer arietinum] Length = 951 Score = 377 bits (969), Expect = e-101 Identities = 229/463 (49%), Positives = 282/463 (60%), Gaps = 8/463 (1%) Frame = +3 Query: 18 QFPQ-APLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTRDHTS 194 QFPQ A LVKP+G + SE SL SP REEGEVPESELDPDTRRRLLILQHGQD RDHTS Sbjct: 501 QFPQPATLVKPIGQVAPSELSLHSSPAREEGEVPESELDPDTRRRLLILQHGQDNRDHTS 560 Query: 195 RESSFPVRPAIQVPA--PPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEP-PFELESIHF 365 E FP++ +QV A PP RG WF +EEE+ + NR +PK + + P +E Sbjct: 561 SEPPFPLKHPVQVSARVPP---RGGWFPVEEEIGSQPPNRVIPKEIALDSGPSRIE---- 613 Query: 366 DKHRHPRSTFFHGVENSVSSDKTLHE-NRRFFKEVHRGDDRMRPNHTVSKYASFPGDEMP 542 KHR + FF V+ S+SSD+ LHE N+R KE++ DDR R +H +S Y S GD+ P Sbjct: 614 -KHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMYHRDDRSRVSHMLSSYPSLSGDDTP 672 Query: 543 LGSSSSSKRDFHFESERGTPTYR-ETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIEV 719 G SSSS RDF +SE G + ETPA VLQ+IA+KCGTKVEF ++L ASRELQFSIE Sbjct: 673 FGRSSSSHRDF--DSESGHSVFNAETPAIVLQEIALKCGTKVEFTSSLAASRELQFSIEA 730 Query: 720 WFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXX 899 WFSG+KI G GRTR EAQ++AAE S+++LA+ YLS ++ S D S Sbjct: 731 WFSGKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSRAKDESGSAFGDVSGFPNANDNGY 790 Query: 900 XXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVF 1079 R L RL+ SK+S+GSVSALKELCM EGL + F Sbjct: 791 VGNVSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVSKRSMGSVSALKELCMVEGLGVNF 850 Query: 1080 -QAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSML-GQGT 1253 P +ST+S+ E AQVEI GQV G G G TWDEAK+QAAE+ALG+LR+ + GQG Sbjct: 851 LSLPAPVSTNSV--DEVHAQVEIDGQVYGKGTGITWDEAKMQAAEKALGSLRTTIHGQGI 908 Query: 1254 QKXXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 Q+ + PR +QR SS RY A +P Sbjct: 909 QRRQLSPRPFQGLSNKRLKQEHPRTLQRFASSGRYPRNAPPIP 951 >ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] Length = 958 Score = 375 bits (963), Expect = e-101 Identities = 223/464 (48%), Positives = 277/464 (59%), Gaps = 5/464 (1%) Frame = +3 Query: 6 FNEKQFPQA-PLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F QFPQ LVKP+ + PSL SP REEGEVPESELD DTRRRLLILQHGQDTR Sbjct: 503 FGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEGEVPESELDLDTRRRLLILQHGQDTR 562 Query: 183 DHTSRESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESIH 362 +HTS E PVR QV AP +R WF +EEEM P+QLN+ V PKE P E +H Sbjct: 563 EHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEEMGPQQLNQLV----PKEFPVGSEPLH 618 Query: 363 FDKH--RHPRSTFFHGVENSVSSDKTLHE-NRRFFKEVHRGDDRMRPNHTVSKYASFPGD 533 +K RHP + F V++SVSSD+ HE ++R KEVH DD R + ++S Y SFPGD Sbjct: 619 IEKRWPRHP--SLFSKVDDSVSSDRVFHESHQRLPKEVHHRDDHSRLSQSLSSYHSFPGD 676 Query: 534 EMPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSI 713 ++PL SS S RDF ES R + + AGVLQ+IA+KCGTKVEF ++L+AS LQFSI Sbjct: 677 DIPLSGSSYSNRDFDSESGRSL-FHADITAGVLQEIALKCGTKVEFLSSLVASTALQFSI 735 Query: 714 EVWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXX 893 E WF+G+K+ EG GRTR+EAQ++AAE S++ LA+ Y+S D+ S D S Sbjct: 736 EAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSG-FHGSNN 794 Query: 894 XXXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLAL 1073 + R RLE SK+S S+SALKE CM EGLA Sbjct: 795 NGFVSSGNSLGNQLLPKESVSFSTSSDSSRVSDPRLEVSKRSTDSISALKEFCMMEGLAA 854 Query: 1074 VFQ-APPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQG 1250 FQ +P ST K E AQVEI GQ+ G G G TW+EAK+QAA++AL +LR+M QG Sbjct: 855 NFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESLRTMFNQG 914 Query: 1251 TQKXXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 T+K ++PR +QR+P S RY A VP Sbjct: 915 TRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNAPLVP 958 >ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X3 [Glycine max] Length = 932 Score = 373 bits (957), Expect = e-100 Identities = 224/466 (48%), Positives = 278/466 (59%), Gaps = 7/466 (1%) Frame = +3 Query: 6 FNEKQFPQA-PLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F QFPQ LVKP+ + PSL SP REEGEVPESELD DTRRRLLILQHGQDTR Sbjct: 503 FGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEGEVPESELDLDTRRRLLILQHGQDTR 562 Query: 183 DHTSRESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESIH 362 +HTS E PVR QV AP +R WF +EEEM P+QLN+ V PKE P E +H Sbjct: 563 EHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEEMGPQQLNQLV----PKEFPVGSEPLH 618 Query: 363 FDKH--RHPRSTFFHGVENSVSSDKTLHE-NRRFFKEVHRGDDRMRPNHTVSKYASFPGD 533 +K RHP + F V++SVSSD+ HE ++R KEVH DD R + ++S Y SFPGD Sbjct: 619 IEKRWPRHP--SLFSKVDDSVSSDRVFHESHQRLPKEVHHRDDHSRLSQSLSSYHSFPGD 676 Query: 534 EMPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSI 713 ++PL SS S RDF ES R + + AGVLQ+IA+KCGTKVEF ++L+AS LQFSI Sbjct: 677 DIPLSGSSYSNRDFDSESGRSL-FHADITAGVLQEIALKCGTKVEFLSSLVASTALQFSI 735 Query: 714 EVWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXX 893 E WF+G+K+ EG GRTR+EAQ++AAE S++ LA+ Y+S D+ S D S Sbjct: 736 EAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVS-------- 787 Query: 894 XXXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGS--RLEGSKKSLGSVSALKELCMAEGL 1067 F+ S RLE SK+S S+SALKE CM EGL Sbjct: 788 ---------------------GFHGSNNNGFVSSDPRLEVSKRSTDSISALKEFCMMEGL 826 Query: 1068 ALVFQ-APPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLG 1244 A FQ +P ST K E AQVEI GQ+ G G G TW+EAK+QAA++AL +LR+M Sbjct: 827 AANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESLRTMFN 886 Query: 1245 QGTQKXXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 QGT+K ++PR +QR+P S RY A VP Sbjct: 887 QGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNAPLVP 932 >ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Solanum tuberosum] Length = 953 Score = 371 bits (952), Expect = 1e-99 Identities = 227/463 (49%), Positives = 268/463 (57%), Gaps = 4/463 (0%) Frame = +3 Query: 6 FNEKQFPQAPLV--KPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDT 179 F + PQ V + + + SLQ SP REEGEVPESELDPDTRRRLLILQHGQDT Sbjct: 501 FPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDT 560 Query: 180 RDHTSRESSFPVRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELESI 359 RD S E FP+ +QV PP WF EEEMSPRQLNR +P PKE P ES+ Sbjct: 561 RDQVSSEPKFPMGTPLQVSVPPRVQPHGWFPAEEEMSPRQLNRPLP---PKEFPLNPESM 617 Query: 360 HFDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDEM 539 H +KHR P F +E S+ SD+ L EN+R KEV DDRMR + + + PG+E+ Sbjct: 618 HINKHRPPHPPFLPKMETSMPSDRVLFENQRLPKEVIPRDDRMRFSQSQPSFRP-PGEEV 676 Query: 540 PLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIEV 719 PLG SSSS R E P Y ETPAG LQDIA KCG KVEFR++ L+S ELQFS+EV Sbjct: 677 PLGRSSSSNRVLDLEPGHYDP-YLETPAGALQDIAFKCGAKVEFRSSFLSSPELQFSLEV 735 Query: 720 WFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXXX 899 F+GEK+ EG GRTR+EAQ +AAE SL LA+KYLS + D+SS D Sbjct: 736 LFAGEKVGEGTGRTRREAQRRAAEESLMYLADKYLSCIKPDSSSTQGD-----GFRFPNA 790 Query: 900 XXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGLALVF 1079 F R L RLE KKS+GSV AL+ELC EGL L F Sbjct: 791 SDNGFVDNMSPFGYQDRVSHSFASEPPRVLDPRLEVFKKSVGSVGALRELCAIEGLGLAF 850 Query: 1080 QAPPQLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLGQGTQK 1259 Q PQLS + K E +AQVEI GQV G GIGSTWD+AK QAAE AL L+S L Q +QK Sbjct: 851 QTQPQLSANPGQKSEIYAQVEIDGQVFGKGIGSTWDDAKTQAAERALVALKSELAQFSQK 910 Query: 1260 -XXXXXXXXXXXXXXXXXXDFPR-VMQRVPSSTRYSSKASSVP 1382 ++ R V QRVP S R+ S++P Sbjct: 911 RQGSPRSLQQGFSNKRLKPEYSRGVQQRVPLSGRFPKNTSAMP 953 >ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] gi|571500215|ref|XP_006594604.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X2 [Glycine max] Length = 960 Score = 371 bits (952), Expect = 1e-99 Identities = 223/466 (47%), Positives = 274/466 (58%), Gaps = 7/466 (1%) Frame = +3 Query: 6 FNEKQFPQA-PLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDTR 182 F QFPQ LVKP+ + SL SP REEGE+PESELD DTRRR LILQHGQDTR Sbjct: 502 FGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELPESELDLDTRRRFLILQHGQDTR 561 Query: 183 DHTSRESSFPVRPAIQVPAPPSQ--TRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELES 356 + + E FPVR QV AP S +R WF +EEEM P+QLN VPK E P + E Sbjct: 562 ERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMGPQQLNLPVPK----EFPVDSEP 617 Query: 357 IHFDKH--RHPRSTFFHGVENSVSSDKTLHEN-RRFFKEVHRGDDRMRPNHTVSKYASFP 527 H +K RHP +FF V +S+SSD+ HE+ +R KEVH DDR R + ++S Y S P Sbjct: 618 FHIEKRWPRHP--SFFSKVGDSISSDRVFHESHQRLPKEVHHRDDRSRLSQSLSSYHSLP 675 Query: 528 GDEMPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQF 707 GD++PL SS S RDF ES R + +T AGVLQ+IA+ CGTKVEF ++L+AS ELQF Sbjct: 676 GDDIPLSGSSYSNRDFDSESGRSL-FHADTTAGVLQEIALNCGTKVEFLSSLVASTELQF 734 Query: 708 SIEVWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXX 887 SIE WF+G+KI EG GRTR+EAQ +AA S++ LA+ Y+S D+ S D S Sbjct: 735 SIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDDSGSTYGDVSGFHGSN 794 Query: 888 XXXXXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKELCMAEGL 1067 R SRLE SK+S S+SALKELCM EGL Sbjct: 795 NDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRSTDSISALKELCMMEGL 854 Query: 1068 ALVFQAPP-QLSTSSIHKGEAFAQVEIGGQVLGNGIGSTWDEAKIQAAEEALGNLRSMLG 1244 A FQ+PP ST K E AQVEI GQ+ G G G TW+EAK+QAA++ALG+LR+M Sbjct: 855 AASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKMQAAKKALGSLRTMFN 914 Query: 1245 QGTQKXXXXXXXXXXXXXXXXXXDFPRVMQRVPSSTRYSSKASSVP 1382 QG+ K ++P +QRVP S RY A VP Sbjct: 915 QGSLKRHGSPRSMQGLANKRLKPEYPPTLQRVPYSARYPRNAPLVP 960 >ref|XP_007025682.1| C-terminal domain phosphatase-like 1 isoform 3 [Theobroma cacao] gi|508781048|gb|EOY28304.1| C-terminal domain phosphatase-like 1 isoform 3 [Theobroma cacao] Length = 870 Score = 365 bits (936), Expect = 7e-98 Identities = 202/351 (57%), Positives = 232/351 (66%), Gaps = 2/351 (0%) Frame = +3 Query: 3 SFNEKQFP-QAPLVKPLGHLGASEPSLQGSPGREEGEVPESELDPDTRRRLLILQHGQDT 179 SF+ QFP AP+VKP+ + EPSLQ SP REEGEVPESELDPDTRRRLLILQHGQDT Sbjct: 524 SFSNMQFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDT 583 Query: 180 RDHTSRESSFP-VRPAIQVPAPPSQTRGSWFLLEEEMSPRQLNRAVPKALPKEPPFELES 356 RDHT E +FP VRP +QV P Q+RGSWF EEEMSPRQLNRA PK E P + E Sbjct: 584 RDHTPPEPAFPPVRPTMQVSVPRGQSRGSWFAAEEEMSPRQLNRAAPK----EFPLDSER 639 Query: 357 IHFDKHRHPRSTFFHGVENSVSSDKTLHENRRFFKEVHRGDDRMRPNHTVSKYASFPGDE 536 +H +KHRHP FF VE+S+ SD+ L EN+R KE DDR+ NHT S Y SF G+E Sbjct: 640 MHIEKHRHP--PFFPKVESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEE 697 Query: 537 MPLGSSSSSKRDFHFESERGTPTYRETPAGVLQDIAMKCGTKVEFRTTLLASRELQFSIE 716 MPL SSSS RD FES R T T ET AGVLQDIAMKCG KVEFR L+AS +LQFSIE Sbjct: 698 MPLSQSSSSHRDLDFESGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIE 756 Query: 717 VWFSGEKISEGIGRTRKEAQHQAAELSLRNLANKYLSTVSADNSSGNEDCSKVXXXXXXX 896 WF+GEK+ EG+GRTR+EAQ QAAE S++NLAN YLS + D+ S D S++ Sbjct: 757 AWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNG 816 Query: 897 XXXXXXXXXXXXFMNXXXXXXXXXXXXXRFLGSRLEGSKKSLGSVSALKEL 1049 R RLEGSKKS+GSV+ALKEL Sbjct: 817 FPSNVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGSVTALKEL 867