BLASTX nr result
ID: Akebia22_contig00005503
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00005503 (4189 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform... 1248 0.0 ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu... 1237 0.0 ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma... 1232 0.0 ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr... 1231 0.0 ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun... 1220 0.0 ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform... 1203 0.0 ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma... 1201 0.0 ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu... 1197 0.0 ref|XP_002519032.1| double-stranded RNA binding protein, putativ... 1189 0.0 ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma... 1184 0.0 emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera] 1181 0.0 ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma... 1172 0.0 ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas... 1165 0.0 ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma... 1155 0.0 ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma... 1154 0.0 ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma... 1126 0.0 ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma... 1124 0.0 ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal doma... 1108 0.0 ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma... 1107 0.0 ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma... 1104 0.0 >ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] gi|508781046|gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] Length = 978 Score = 1248 bits (3230), Expect = 0.0 Identities = 666/986 (67%), Positives = 751/986 (76%), Gaps = 34/986 (3%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQ--------------KICMID---KEIRISHFSQPSER 542 M+KSVVY+G +LGEVEIYPQ Q KI +++ KEIRI + +Q SER Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63 Query: 543 CPPLAVLHTIAPSGVCFKMESKLQ-----SKDSP-LFSLHSSCLREDKTAVMPLGEEELH 704 CPPLAVLHTI SG+CFKMES S+DSP L LHS C+R++KTAVMP+G+ ELH Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123 Query: 705 LVAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDR 884 LVAM SR + CFWGF VS GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDR Sbjct: 124 LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181 Query: 885 IDALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAF 1064 I+ALQRK+ TE+DPQRV+GM+AE+KRYQDDK+ILKQY ENDQVVENGKVIK+QSEVVPA Sbjct: 182 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241 Query: 1065 SDNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYV 1244 SDN+QPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYV Sbjct: 242 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301 Query: 1245 CTMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVI 1424 CTMAERDYALEMWRLLDPESNLINSKELL R+VCVK+GSRKSL NVF DGICHPKMALVI Sbjct: 302 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361 Query: 1425 DDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGL 1604 DDRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFDEGL Sbjct: 362 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421 Query: 1605 LQLIREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVI 1775 LQ I E+ YED + + PS PDV N+L+SEDD SA NKDPL F+G+ D E ERRLK+ I Sbjct: 422 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481 Query: 1776 QASSMVDS----LDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQF---TQVVKPLV 1934 A+S V S LDPR PSLQ + SS S+I Q I+S N QF VVKP+ Sbjct: 482 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541 Query: 1935 HVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXX 2114 V EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT EP F Sbjct: 542 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601 Query: 2115 XXXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHG 2294 QSRGSWF EE+M PR+ NRA PKE PL+SE MH +K RHP FF Sbjct: 602 VSVPRG--QSRGSWFAAEEEMSPRQLNRA----APKEFPLDSERMHIEKH--RHPPFFPK 653 Query: 2295 AESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFE 2474 ESS PSDR+L ENQR KE DDRL +NH+ S+Y SF GEE PL QS SS RD FE Sbjct: 654 VESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFE 713 Query: 2475 SGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTR 2654 SGR + ET AGVLQ+I +KCG KVEFR AL+AS +LQFSIE WF GEK+G G G+TR Sbjct: 714 SGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTR 772 Query: 2655 KEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKE 2831 +EAQRQAA+ S++NLAN YLS + D S GDL++ ++N+ G + NSFG Q L KE Sbjct: 773 REAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKE 832 Query: 2832 DXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVE 3011 + E +R DPRLE SKKSMGSV+ALKELCMMEGL +VF QPP S+N + K E Sbjct: 833 ESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDE 892 Query: 3012 VYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKR 3191 VYAQVEI GQ+LGKG G TW EAKMQAAE+AL +L+SMLG Q R GSPR L +KR Sbjct: 893 VYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKR 952 Query: 3192 LKPEFPRMLQRINSSTRYSNNAPPVP 3269 LKPEFPR+LQR+ SS RY NAPPVP Sbjct: 953 LKPEFPRVLQRMPSSGRYPKNAPPVP 978 >ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] gi|550340277|gb|EEE85528.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] Length = 996 Score = 1237 bits (3201), Expect = 0.0 Identities = 666/1008 (66%), Positives = 757/1008 (75%), Gaps = 56/1008 (5%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQK-------------ICMIDKEIRISHFSQPSERCPPL 554 M+KSVVY+G+ LLGEVEIY Q Q+ I I KEIRISHFSQ SERCPPL Sbjct: 1 MYKSVVYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60 Query: 555 AVLHTIAPSGVCFKME-------SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVA 713 AVLHTI GVCFKME +K+ ++SPL LHSSC++E+KTAVM LG EELHLVA Sbjct: 61 AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120 Query: 714 MSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 893 M SR N Q+ CFWGF V+PGLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDA Sbjct: 121 MPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180 Query: 894 LQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDN 1073 LQRKI+TE+DPQR+ GML+E+KRY DDK+ILKQYVENDQVVENGKVIK QSEVVPA SDN Sbjct: 181 LQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240 Query: 1074 YQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTM 1253 +QP+VRPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTM Sbjct: 241 HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300 Query: 1254 AERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDR 1433 AERDYALEMWRLLDPESNLINSKELL R+VCVK+G RKSL NVF DGICHPKMALVIDDR Sbjct: 301 AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360 Query: 1434 LKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQL 1613 LKVWDE+DQSRVHVVPAFAPYYAPQAE NN VPVLCVARNVACNVRGGFFKEFDEGLLQ Sbjct: 361 LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420 Query: 1614 IREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQAS 1784 I EV YED N PS PDVSN+L+SEDD SA N+D L F+G+ D E ER+LK+ + AS Sbjct: 421 IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSAS 480 Query: 1785 --------SMVDSLDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQ-------------- 1898 S V SLDPR SLQ +ASS S S PT Q +++SQ Sbjct: 481 SAILSTIPSTVSSLDPRLLQSLQYTIASSSS--SMPTSQPSMLASQQPMPALQPPKPPSQ 538 Query: 1899 -------NNQFTQV---VKPLVHVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQH 2048 N QF QV VK L V P EPSLQ SPAREEGEVPESELDPDTRRRLLILQH Sbjct: 539 LSMTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRRLLILQH 598 Query: 2049 GQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKES 2228 G D+RD+ SE PF +QS GSW P+EE+M PR+ NR P+E Sbjct: 599 GHDSRDNAPSESPF---PARPSTQVSAPRVQSVGSWVPVEEEMSPRQLNR-----TPREF 650 Query: 2229 PLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYR 2408 PL+S+ M+ +K R HPSFFH ES+ PSDRM++ENQR PKE DDR+++NHSTSNY Sbjct: 651 PLDSDPMNIEKHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYP 710 Query: 2409 SFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAE 2588 SF GEE+PL +S SS+RD ES R TETP VLQEI +KCGTKVEFR ALIA+++ Sbjct: 711 SFQGEESPLSRS-SSNRDLDLESERAFSS-TETPVEVLQEIAMKCGTKVEFRPALIATSD 768 Query: 2589 LQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPS 2768 LQFSIE WF GEK+G GTGKTR+EAQRQAA+ S++ LA Y+S V D + GD ++ Sbjct: 769 LQFSIETWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYP 828 Query: 2769 HVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMM 2945 N+ G LGD NSFG Q L K++ EP+R LD RLE SKKSMGSV+ALKE CM Sbjct: 829 SANDNGFLGDMNSFGNQPLLKDENITYSATSEPSRLLDQRLEGSKKSMGSVTALKEFCMT 888 Query: 2946 EGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSM 3125 EGL + FLAQ PLSTN + EV+AQVEI GQ+LGKG+G TW+EAKMQAAE+AL +L++M Sbjct: 889 EGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRTM 948 Query: 3126 LGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 3269 G R GSPRL+ +KRLK EFPR+LQR+ SS RY NA PVP Sbjct: 949 FGQYTPKRQGSPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNASPVP 996 >ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Citrus sinensis] Length = 957 Score = 1232 bits (3188), Expect = 0.0 Identities = 656/972 (67%), Positives = 748/972 (76%), Gaps = 20/972 (2%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQ--------KICMIDKEIRISHFSQPSERCPPLAVLHT 569 M+K+V Y G +LGEVEIYPQ Q K + EIRIS+FS+ SERCPPLAVLHT Sbjct: 1 MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60 Query: 570 IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLG-EEELHLVAMSSRKNLEQYS 746 I SG+CFKMESK S + L LHSSC+RE+KTAVMPLG EELHLVAM SR N +QY Sbjct: 61 ITASGICFKMESK-SSDNIQLHLLHSSCIRENKTAVMPLGLTEELHLVAMYSRNNEKQYP 119 Query: 747 CFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDP 926 CFW F V GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+DP Sbjct: 120 CFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDP 179 Query: 927 QRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRL 1106 QR++GM AE+KRYQDDK+ILKQY ENDQV ENGKVIKVQSEVVPA SD++Q +VRPLIRL Sbjct: 180 QRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRL 239 Query: 1107 QEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWR 1286 QEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWR Sbjct: 240 QEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 299 Query: 1287 LLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSR 1466 LLDPESNLIN+KELL R+VCVK+GSRKSL NVF DG CHPKMALVIDDRLKVWD+KDQ R Sbjct: 300 LLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDDKDQPR 359 Query: 1467 VHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLA 1646 VHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFDEGLLQ I E+ YED + Sbjct: 360 VHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVK 419 Query: 1647 NFPSAPDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMVDS----LD 1805 + PS PDVSN+L+SEDD + +N KDPL F+G+ D E ERRLK+ I AS+ + S LD Sbjct: 420 DIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANLD 479 Query: 1806 PRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQF---TQVVKPLVHVGPSEPSLQGSPA 1976 PR AP Q + SS ST + PT Q +M N QF T +VKPL HVGP E SLQ SPA Sbjct: 480 PRLAP-FQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQSLQSSPA 538 Query: 1977 REEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSW 2156 REEGEVPESELDPDTRRRLLILQHG DTR++ SE PF + SRGSW Sbjct: 539 REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPF---PARTQMQVSVPRVPSRGSW 595 Query: 2157 FPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNEN 2336 FP+EE+M PR+ NRA VPKE PL SE M +K R HPSFF E+ SDR +EN Sbjct: 596 FPVEEEMSPRQLNRA----VPKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDRP-HEN 650 Query: 2337 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 2516 QR PKE + DDRLR+NH+ S+Y+SF GEE PL +S SS RD FESGR TETP+G Sbjct: 651 QRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGR-DVSSTETPSG 709 Query: 2517 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 2696 VLQ+I +KCGTKVEFR AL+AS ELQFSIE WF GEKIG G G+TR+EAQRQAA+ S+++ Sbjct: 710 VLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKH 769 Query: 2697 LANKYLSNVMHDPSSVRGDLNKPSHVNE-MGLGDSNSFGYQTLPKEDXXXXXXXXEPTRF 2873 LAN Y+ V D S GD ++ S+ NE +G+ NSFG Q L K++ EP++ Sbjct: 770 LANVYMLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKL 825 Query: 2874 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGK 3053 +DPRLE SKK MGSVSALKELCM EGL +VF QPP S N + K EVYAQVEI GQ+LGK Sbjct: 826 VDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGK 885 Query: 3054 GVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINS 3233 G+G+TW+EAKMQAAE+AL +L+SM G Q GSPR L +KRLKPEFPR+LQR+ Sbjct: 886 GIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPP 945 Query: 3234 STRYSNNAPPVP 3269 S RY NAPPVP Sbjct: 946 SGRYPKNAPPVP 957 >ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina] gi|557551913|gb|ESR62542.1| hypothetical protein CICLE_v10014168mg [Citrus clementina] Length = 957 Score = 1231 bits (3186), Expect = 0.0 Identities = 657/972 (67%), Positives = 747/972 (76%), Gaps = 20/972 (2%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQ--------KICMIDKEIRISHFSQPSERCPPLAVLHT 569 M+K+V Y G +LGEVEIYPQ Q K + EIRIS+FS+ SERCPPLAVLHT Sbjct: 1 MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60 Query: 570 IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLG-EEELHLVAMSSRKNLEQYS 746 I SG+CFKMESK S + L LHSSC+RE+KTAVM LG EELHLVAM SR N +QY Sbjct: 61 ITASGICFKMESK-SSDNVQLHLLHSSCIRENKTAVMLLGLTEELHLVAMYSRNNEKQYP 119 Query: 747 CFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDP 926 CFW F V GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+DP Sbjct: 120 CFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDP 179 Query: 927 QRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRL 1106 QR++GM AE+KRYQDDK+ILKQY ENDQV ENGKVIKVQSEVVPA SD++Q +VRPLIRL Sbjct: 180 QRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRL 239 Query: 1107 QEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWR 1286 QEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWR Sbjct: 240 QEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 299 Query: 1287 LLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSR 1466 LLDPESNLIN+KELL R+VCVK+GSRKSL NVF DG CHPKMALVIDDRLKVWDEKDQSR Sbjct: 300 LLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDEKDQSR 359 Query: 1467 VHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLA 1646 VHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFDEGLLQ I E+ YED + Sbjct: 360 VHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVK 419 Query: 1647 NFPSAPDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMVDS----LD 1805 PS PDVSN+L+SEDD + +N KDPL F+G+ D E ERRLK+ I AS+ + S LD Sbjct: 420 EIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANLD 479 Query: 1806 PRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQF---TQVVKPLVHVGPSEPSLQGSPA 1976 PR AP Q + SS ST + PT Q +M N QF T +VKPL HVGP E LQ SPA Sbjct: 480 PRLAP-FQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQCLQSSPA 538 Query: 1977 REEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSW 2156 REEGEVPESELDPDTRRRLLILQHG DTR++ SE PF + SRGSW Sbjct: 539 REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPF---PARTQMQVSVPRVPSRGSW 595 Query: 2157 FPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNEN 2336 FP+EE+M PR+ NRA VPKE PL SE M +K R HPSFF E+S SDR +EN Sbjct: 596 FPVEEEMSPRQLNRA----VPKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDRP-HEN 650 Query: 2337 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 2516 QR PKE + DDRLR+NH+ S+Y+SF GEE PL +S SS RD FESGR TETP+G Sbjct: 651 QRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGR-DVSSTETPSG 709 Query: 2517 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 2696 VLQ+I +KCGTKVEFR AL+AS ELQFSIE WF GEKIG G G+TR+EAQRQAA+ S+++ Sbjct: 710 VLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKH 769 Query: 2697 LANKYLSNVMHDPSSVRGDLNKPSHVNE-MGLGDSNSFGYQTLPKEDXXXXXXXXEPTRF 2873 LAN Y+ V D S GD ++ S+ NE +G+ NSFG Q L K++ EP++ Sbjct: 770 LANVYVLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKL 825 Query: 2874 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGK 3053 +DPRLE SKK MGSVSALKELCM EGL +VF QPP S N + K EVYAQVEI GQ+LGK Sbjct: 826 VDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGK 885 Query: 3054 GVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINS 3233 G+G+TW+EAKMQAAE+AL +L+SM G Q GSPR L +KRLKPEFPR+LQR+ Sbjct: 886 GIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPP 945 Query: 3234 STRYSNNAPPVP 3269 S RY NAPPVP Sbjct: 946 SGRYPKNAPPVP 957 >ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica] gi|462410413|gb|EMJ15747.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica] Length = 940 Score = 1220 bits (3156), Expect = 0.0 Identities = 654/971 (67%), Positives = 744/971 (76%), Gaps = 19/971 (1%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQ-----NQKICMID--KEIRISHFSQPSERCPPLAVLHTI 572 M+KSVVY+G LLGEVEIYP+ N+ ++D KEIRIS+FSQ SERCPP+AVLHTI Sbjct: 1 MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60 Query: 573 APSGVCFKMESKL-QSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSC 749 + GVCFKMESK QS+D+PLF LHSSC+ E+KTAVMPLG EELHLVAM SR ++Y C Sbjct: 61 SSHGVCFKMESKTSQSQDTPLFLLHSSCVMENKTAVMPLGGEELHLVAMRSRNGDKRYPC 120 Query: 750 FWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQ 929 FWGF V+PGLYNSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI++E+DPQ Sbjct: 121 FWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEVDPQ 180 Query: 930 RVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQ 1109 R+SGMLAEIKRYQDDK ILKQY ENDQVVENG+VIK QSE VPA SDN+QPI+RPLIRL Sbjct: 181 RISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRPLIRLH 240 Query: 1110 EKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRL 1289 EKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRL Sbjct: 241 EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300 Query: 1290 LDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRV 1469 LDP+SNLINS +LL R+VCVK+GSRKSL NVF + +CHPKMALVIDDRLKVWD++DQ RV Sbjct: 301 LDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRV 360 Query: 1470 HVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLAN 1649 HVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFF+EFD+ LLQ I EVFYED + + Sbjct: 361 HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYEDDIKD 420 Query: 1650 FPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQASSMVD----SLDP 1808 PS PDVSN+L+SEDD SA N+DPL F+G+TDVE ERR+K+ A+SMV S+DP Sbjct: 421 VPS-PDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATPAASMVSSVFTSIDP 479 Query: 1809 RPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAR 1979 R AP LQ V S ST+S PT Q +MS + QF Q +VKPL HVG +EPSLQ SPAR Sbjct: 480 RLAP-LQYTVPPS-STLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSAEPSLQSSPAR 537 Query: 1980 EEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWF 2159 EEGEVPESELDPDTRRRLLILQHGQDTRD SEPPF QSR WF Sbjct: 538 EEGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPF---PVRPPMQASVPRAQSRPGWF 594 Query: 2160 PLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQ 2339 P+EE+M PR+ +R VPK + PL+ ET+ +K R H SFF E+S PSDR+L ENQ Sbjct: 595 PVEEEMSPRQLSRMVPKDL----PLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQENQ 650 Query: 2340 RFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGV 2519 R PKE DDRLR NH+ S Y S GEE PL +S SS+RD FESGR + ETPAGV Sbjct: 651 RLPKEAFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGR-AISNAETPAGV 709 Query: 2520 LQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNL 2699 LQEI +KCG K WF GEKIG G+GKTR+EA QAA+ SL+NL Sbjct: 710 LQEIAMKCGAK------------------AWFAGEKIGEGSGKTRREAHYQAAEGSLKNL 751 Query: 2700 ANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKEDXXXXXXXXEPTRFL 2876 AN YLS V D SV GD+NK +VN G G+ NSFG Q PKE+ EP+R L Sbjct: 752 ANIYLSRVKPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPL 811 Query: 2877 DPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKG 3056 DPRLE SKKSM SVS LKELCMMEGL +VF +PP STN + K EV+ QVEI G++LGKG Sbjct: 812 DPRLEGSKKSMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKG 871 Query: 3057 VGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSS 3236 +G TW+EAKMQAAE+AL +L S L Q R GSPR L SSKR+K EFP++LQR+ SS Sbjct: 872 IGLTWDEAKMQAAEKALGSLTSTL--YAQKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSS 929 Query: 3237 TRYSNNAPPVP 3269 RY NAPPVP Sbjct: 930 ARYPKNAPPVP 940 >ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] gi|508781047|gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] Length = 984 Score = 1203 bits (3112), Expect = 0.0 Identities = 643/951 (67%), Positives = 725/951 (76%), Gaps = 34/951 (3%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQ--------------KICMID---KEIRISHFSQPSER 542 M+KSVVY+G +LGEVEIYPQ Q KI +++ KEIRI + +Q SER Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63 Query: 543 CPPLAVLHTIAPSGVCFKMESKLQ-----SKDSP-LFSLHSSCLREDKTAVMPLGEEELH 704 CPPLAVLHTI SG+CFKMES S+DSP L LHS C+R++KTAVMP+G+ ELH Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123 Query: 705 LVAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDR 884 LVAM SR + CFWGF VS GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDR Sbjct: 124 LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181 Query: 885 IDALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAF 1064 I+ALQRK+ TE+DPQRV+GM+AE+KRYQDDK+ILKQY ENDQVVENGKVIK+QSEVVPA Sbjct: 182 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241 Query: 1065 SDNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYV 1244 SDN+QPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYV Sbjct: 242 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301 Query: 1245 CTMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVI 1424 CTMAERDYALEMWRLLDPESNLINSKELL R+VCVK+GSRKSL NVF DGICHPKMALVI Sbjct: 302 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361 Query: 1425 DDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGL 1604 DDRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFDEGL Sbjct: 362 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421 Query: 1605 LQLIREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVI 1775 LQ I E+ YED + + PS PDV N+L+SEDD SA NKDPL F+G+ D E ERRLK+ I Sbjct: 422 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481 Query: 1776 QASSMVDS----LDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQF---TQVVKPLV 1934 A+S V S LDPR PSLQ + SS S+I Q I+S N QF VVKP+ Sbjct: 482 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541 Query: 1935 HVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXX 2114 V EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT EP F Sbjct: 542 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601 Query: 2115 XXXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHG 2294 QSRGSWF EE+M PR+ NRA PKE PL+SE MH +K RHP FF Sbjct: 602 VSVPRG--QSRGSWFAAEEEMSPRQLNRA----APKEFPLDSERMHIEKH--RHPPFFPK 653 Query: 2295 AESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFE 2474 ESS PSDR+L ENQR KE DDRL +NH+ S+Y SF GEE PL QS SS RD FE Sbjct: 654 VESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFE 713 Query: 2475 SGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTR 2654 SGR + ET AGVLQ+I +KCG KVEFR AL+AS +LQFSIE WF GEK+G G G+TR Sbjct: 714 SGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTR 772 Query: 2655 KEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKE 2831 +EAQRQAA+ S++NLAN YLS + D S GDL++ ++N+ G + NSFG Q L KE Sbjct: 773 REAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKE 832 Query: 2832 DXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVE 3011 + E +R DPRLE SKKSMGSV+ALKELCMMEGL +VF QPP S+N + K E Sbjct: 833 ESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDE 892 Query: 3012 VYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPR 3164 VYAQVEI GQ+LGKG G TW EAKMQAAE+AL +L+SMLG Q R GSPR Sbjct: 893 VYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPR 943 >ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Fragaria vesca subsp. vesca] Length = 955 Score = 1201 bits (3106), Expect = 0.0 Identities = 645/966 (66%), Positives = 740/966 (76%), Gaps = 18/966 (1%) Frame = +3 Query: 426 VVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCFK 596 +VY+G LLGEVE+YP+ N+KI KEIRISHFSQ SERCPP+AVLHTI+ +GVCFK Sbjct: 4 LVYKGEELLGEVEVYPEELNNKKIWDELKEIRISHFSQSSERCPPVAVLHTISSNGVCFK 63 Query: 597 MESKLQSKDSP----LFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 764 MESK S S LF LHSSC+ E+KTAVM LG EELHLVAM SR N +Q+ CFWGF Sbjct: 64 MESKSSSSSSQDTSRLFLLHSSCIMENKTAVMNLGVEELHLVAMYSRNNQKQHPCFWGFS 123 Query: 765 VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 944 VS GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ LQRKI E+D QR+SGM Sbjct: 124 VSSGLYSSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIEGLQRKIQCEVDAQRISGM 183 Query: 945 LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 1124 AEIKRYQDDK ILKQY ENDQVVENG+VIK QSEVVPA SD++QPI+RPLIRLQEKNII Sbjct: 184 QAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEVVPALSDSHQPIIRPLIRLQEKNII 243 Query: 1125 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 1304 LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDPES Sbjct: 244 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 303 Query: 1305 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 1484 NLIN+ +LL R+VCVK+G +KSL NVF + +CHPKMALVIDDRLKVWD++DQ RVHVVPA Sbjct: 304 NLINANKLLDRIVCVKSGLKKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVPA 363 Query: 1485 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 1664 FAPYYAPQAEANN VPVLCVARNVAC+VRGGFF+EFD+ LLQ I E+FYED + +F S+P Sbjct: 364 FAPYYAPQAEANNAVPVLCVARNVACSVRGGFFREFDDSLLQKIPEIFYEDNIKDF-SSP 422 Query: 1665 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQA----SSMVDSLDPRPAPS 1823 DVSNFL+SEDD SAS N+D L F+G+ D E ERRLK+ A SS V + DPR A S Sbjct: 423 DVSNFLVSEDDASASNGNRDQLPFDGMADAEVERRLKEATSAAPTVSSAVSNNDPRLA-S 481 Query: 1824 LQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEV 1994 LQ V S ST+S PT Q +M N QF Q +VKPL HVGP++ L SPAREEGEV Sbjct: 482 LQYTVPLS-STVSLPTNQPSMMPFHNVQFPQSASLVKPLGHVGPADLGLHSSPAREEGEV 540 Query: 1995 PESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEED 2174 PESELDPDTRRRLLILQHGQDTR+ SEP F +QSRG WFP+EE+ Sbjct: 541 PESELDPDTRRRLLILQHGQDTRESVPSEPSF---PVRPQVQVSVPRVQSRGGWFPVEEE 597 Query: 2175 MIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKE 2354 M PRK +R VPKE PL SE M +K R+ H +FF E+S PSDR+L ENQR PKE Sbjct: 598 MSPRKLSR----MVPKEPPLNSEPMQIEKHRSHHSAFFPKVENSMPSDRILQENQRLPKE 653 Query: 2355 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 2534 D+RLR N + S Y SF GEE PL +S SS+RDF +ESGR + ETPAGVLQEI Sbjct: 654 AFHRDNRLRFNQAMSGYHSFSGEEPPLNRSSSSNRDFDYESGR-AISNAETPAGVLQEIA 712 Query: 2535 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 2714 +KCGTKVEFR AL+ S ELQF +E WF GEKIG GTG+TR+EA QAA+ SL+NLAN Y+ Sbjct: 713 MKCGTKVEFRPALVPSTELQFYVEAWFAGEKIGEGTGRTRREAHFQAAEGSLKNLANIYI 772 Query: 2715 SNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLE 2891 S D + GD +K S+V G +G+ NSFG Q LPKED EP+R LDPRL+ Sbjct: 773 SRGKPDALPIHGDASKFSNVTNNGFMGNMNSFGTQPLPKEDSLSSSTSSEPSRPLDPRLD 832 Query: 2892 ESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATW 3071 S+KS+ SVSALKELC MEGLS+++ +PP N K EV+ Q EI G++LGKG+G TW Sbjct: 833 NSRKSVSSVSALKELCTMEGLSVLYQPRPP-PPNSTEKDEVHVQAEIDGEVLGKGIGLTW 891 Query: 3072 NEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSN 3251 +EAKMQAAE+AL NL+S L GQ R GSPR L SKRLK EFP++LQR+ SSTRYS Sbjct: 892 DEAKMQAAEKALGNLRSTL--YGQKRQGSPRPLQGMPSKRLKQEFPQVLQRMPSSTRYSK 949 Query: 3252 NAPPVP 3269 NAPPVP Sbjct: 950 NAPPVP 955 >ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa] gi|550327613|gb|ERP55122.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa] Length = 990 Score = 1197 bits (3096), Expect = 0.0 Identities = 647/1000 (64%), Positives = 743/1000 (74%), Gaps = 48/1000 (4%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQK-------------ICMIDKEIRISHFSQPSERCPPL 554 M+KSVVY+G LLGEVEIY Q Q+ I I K IRISHFSQ SERCPPL Sbjct: 1 MYKSVVYKGEELLGEVEIYAQEQQQEEEENKNKRKRVIDEIVKGIRISHFSQASERCPPL 60 Query: 555 AVLHTIAPSGVCFKMESKLQS--------KDSPLFSLHSSCLREDKTAVMPLGEEELHLV 710 AVLHTI GVCFKME S ++SPL LHSSC++E+KTAVM LG EELHLV Sbjct: 61 AVLHTITSIGVCFKMEESTASSSTKISSQQESPLRLLHSSCIQENKTAVMLLGGEELHLV 120 Query: 711 AMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRID 890 AM SR N ++ CFWGF V+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFED+I+ Sbjct: 121 AMPSRSNERKHPCFWGFNVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDKIE 180 Query: 891 ALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSD 1070 ALQ+KI+TE+D QR+ +++EIKRYQDDK ILKQYVENDQV+ENGKVIK Q EVVPA SD Sbjct: 181 ALQKKISTEVDQQRILAIISEIKRYQDDKIILKQYVENDQVIENGKVIKTQFEVVPAASD 240 Query: 1071 NYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCT 1250 N+QP+VRPLIRL EKNII TRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCT Sbjct: 241 NHQPLVRPLIRLPEKNIIFTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 300 Query: 1251 MAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDD 1430 MAERDYALEMWRLLDPESNLINS ELL R+VCV +GSRKSL NVF DGICHPKMALVIDD Sbjct: 301 MAERDYALEMWRLLDPESNLINSNELLDRIVCVSSGSRKSLFNVFQDGICHPKMALVIDD 360 Query: 1431 RLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQ 1610 R+ VWDEKDQSRVHVVPAFAPYYAPQAEANN VP+LCVARNVACNVRGGFFKEFDEGLLQ Sbjct: 361 RMNVWDEKDQSRVHVVPAFAPYYAPQAEANNAVPILCVARNVACNVRGGFFKEFDEGLLQ 420 Query: 1611 LIREVFYEDGLANFPSAPDVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQA 1781 I EV YED +N PS PDVSN+L+SEDD SA+ N+DP F+ D E ERRLK+ + A Sbjct: 421 KIPEVAYEDDTSNIPSPPDVSNYLVSEDDASAANGNRDPPSFDSTADAEVERRLKEAVSA 480 Query: 1782 S--------SMVDSLDPRPAPSLQNAVASSLSTI--SQPT---PQGPIMSSQ-------N 1901 S S V SLDPR SLQ AVASS S + SQP+ Q P+ +SQ N Sbjct: 481 SSTIPSTIPSTVSSLDPRLLQSLQYAVASSSSLMPASQPSMLASQQPVPASQTSMMPFPN 540 Query: 1902 NQFTQV---VKPLVHVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT 2072 QF QV VK L V EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQD+RD+ Sbjct: 541 TQFPQVAPLVKQLGQVVHPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDSRDNA 600 Query: 2073 SSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMH 2252 SE PF +QSRGSW P+EE+M PR+ NR P+E PL+S+ M+ Sbjct: 601 PSESPF---PARPSAPVSAAHVQSRGSWVPVEEEMTPRQLNR-----TPREFPLDSDPMN 652 Query: 2253 FDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETP 2432 +K + HPSFF ES+ PSDRM++ENQR PKE +DR+R+NHST NY SF EETP Sbjct: 653 IEKHQTHHPSFFPKVESNIPSDRMIHENQRLPKEAPYRNDRMRLNHSTPNYHSFQVEETP 712 Query: 2433 LGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVW 2612 L +S SS+RD ES R +ETP VLQEI +KC TKVEFR AL+AS +LQFSIE W Sbjct: 713 LSRS-SSNRDLDLESERAF-TISETPVEVLQEIAMKCETKVEFRPALVASIDLQFSIEAW 770 Query: 2613 FTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-L 2789 F GEK+G GTGKTR+EAQRQAA+ S++ LA Y+ D + GD ++ N+ G L Sbjct: 771 FAGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMLRAKPDSGPMHGDSSRYPSANDNGFL 830 Query: 2790 GDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFL 2969 G+ N FG Q LPK++ EP+R LDPRLE SKKS GSV+ALKE C MEGL + FL Sbjct: 831 GNMNLFGNQPLPKDELVAYSAASEPSRLLDPRLEGSKKSSGSVTALKEFCTMEGLVVNFL 890 Query: 2970 AQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNR 3149 AQ PLS N + EV+AQVEI GQ+LGKG+G+TW+EAKMQAAE+AL +L++M G Q R Sbjct: 891 AQTPLSANSIPGEEVHAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRTMFGQYTQKR 950 Query: 3150 VGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 3269 GSPR + +KRLK EFPR+LQR+ S RY NAPPVP Sbjct: 951 QGSPRPMQGMPNKRLKQEFPRVLQRMPPSARYHKNAPPVP 990 >ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis] gi|223541695|gb|EEF43243.1| double-stranded RNA binding protein, putative [Ricinus communis] Length = 978 Score = 1189 bits (3075), Expect = 0.0 Identities = 634/985 (64%), Positives = 741/985 (75%), Gaps = 33/985 (3%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQK--------------------ICMIDKEIRISHFSQP 533 M+KSVVY+G+ LLGEVEIY Q ++ I I K IRISHFSQ Sbjct: 1 MYKSVVYKGDELLGEVEIYAQQEQKLQQQEELQEQEQELKKKRVIDEILKGIRISHFSQA 60 Query: 534 SERCPPLAVLHTIAPSGVCFKMESKLQ-SKDSPLFSLHSSCLREDKTAVMPL-GEEELHL 707 SERCPPLAVLHTI +G+CFKMESK S D+PL LHSSC++E KTAV+ L G EELHL Sbjct: 61 SERCPPLAVLHTITTNGICFKMESKNSVSLDTPLHLLHSSCIQESKTAVVLLQGGEELHL 120 Query: 708 VAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRI 887 VAM SR + QY CFW F +S GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRI Sbjct: 121 VAMFSRNDERQYPCFWAFNISSGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI 180 Query: 888 DALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFS 1067 +ALQRKI+TE+DPQR+SGML+E+KRYQDDK+ILKQYV+NDQVVENG+VIK Q EVVPA S Sbjct: 181 EALQRKISTELDPQRISGMLSEVKRYQDDKTILKQYVDNDQVVENGRVIKTQFEVVPALS 240 Query: 1068 DNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVC 1247 DN+Q IVRPLIRLQE+NIILTRINP IRDTSVLVRLRPAWE+LRSYLTA+GRKRFEVYVC Sbjct: 241 DNHQTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVC 300 Query: 1248 TMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVID 1427 TMAERDYALEMWRLLDPESNLINSKELL R+VCVK+G RKSL NVF DGICHPKMALVID Sbjct: 301 TMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVID 360 Query: 1428 DRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLL 1607 DRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFDEGLL Sbjct: 361 DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLL 420 Query: 1608 QLIREVFYEDGLANFPSAPDVSNFLISEDD--VSASNKDPLGFEGVTDVE-ERRLKDVIQ 1778 Q I E+ +ED + + PS PDVSN+L+ EDD S N+DPL F+G+ D E E+RLK+ I Sbjct: 421 QRIPEISFEDDMNDIPSPPDVSNYLVPEDDAFTSNGNRDPLSFDGMADAEVEKRLKEAIS 480 Query: 1779 AS----SMVDSLDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVH 1937 S S V +LD R P LQ +ASS S+I PT Q +++ + Q Q +VKPL Sbjct: 481 ISSAFPSTVANLDARLVPPLQYTMASS-SSIPVPTSQPAVVTFPSMQLPQAAPLVKPLGQ 539 Query: 1938 VGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXX 2117 V PSEPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQD RD SE PF Sbjct: 540 VVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDLRDPAPSESPF-PVRPSNSM 598 Query: 2118 XXXXXXMQSRGSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGA 2297 +QSRG+W P+EE+M PR+ NRA V +E P+++E MH DK R HPSFF Sbjct: 599 QVSVPRVQSRGNWVPVEEEMSPRQLNRA----VTREFPMDTEPMHIDKHRPHHPSFFPKV 654 Query: 2298 ESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFES 2477 ESS PS+RM +ENQR PK DDRLR+N + SNY+S GEE L +S SS+RD ES Sbjct: 655 ESSIPSERMPHENQRLPKVAPYKDDRLRLNQTMSNYQSLSGEENSLSRSSSSNRDLDVES 714 Query: 2478 GRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRK 2657 R + ETP VL EI +KCG KVEF+ +L+ S +LQFS+E WF GE++G G G+TR+ Sbjct: 715 DR-AVSSAETPVRVLHEISMKCGAKVEFKHSLVNSRDLQFSVEAWFAGERVGEGFGRTRR 773 Query: 2658 EAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKED 2834 EAQ AA++S++NLAN Y+S D ++ GD +K S N+ G LG NSFG Q LPK++ Sbjct: 774 EAQSVAAEASIKNLANIYISRAKPDNGALHGDASKYSSANDNGFLGHVNSFGSQPLPKDE 833 Query: 2835 XXXXXXXXEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEV 3014 E + LDPRLE SKKSM SV+ALKE CMMEGL + FLAQ PLS+N + EV Sbjct: 834 ILSYSDSSEQSGLLDPRLESSKKSMSSVNALKEFCMMEGLGVNFLAQTPLSSNSVQNAEV 893 Query: 3015 YAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRL 3194 +AQVEI GQ++GKG+G+T++EAKMQAAE+AL +L++ G R GSPR + +K L Sbjct: 894 HAQVEIDGQVMGKGIGSTFDEAKMQAAEKALGSLRTTFGRFPPKRQGSPRPVPGMPNKHL 953 Query: 3195 KPEFPRMLQRINSSTRYSNNAPPVP 3269 KPEFPR+LQR+ SS RY NAPPVP Sbjct: 954 KPEFPRVLQRMPSSARYPKNAPPVP 978 >ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] Length = 956 Score = 1184 bits (3062), Expect = 0.0 Identities = 626/969 (64%), Positives = 752/969 (77%), Gaps = 16/969 (1%) Frame = +3 Query: 411 KMFKSVVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPS 581 +M+KSVVYQG ++GEV++YP+ N K + KEIRISHFSQPSERCPPLAVLHT+ Sbjct: 2 RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHV-KEIRISHFSQPSERCPPLAVLHTVTSC 60 Query: 582 GVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGF 761 GVCFKMESK Q +D LF LHS C+RE+KTAVMPLG EE+HLVAM SR N+++ CFWGF Sbjct: 61 GVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSR-NVDR-PCFWGF 117 Query: 762 IVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSG 941 IV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SG Sbjct: 118 IVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISG 177 Query: 942 MLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNI 1121 M AE+KRYQDDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ+KNI Sbjct: 178 MQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNI 237 Query: 1122 ILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPE 1301 ILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+ Sbjct: 238 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 297 Query: 1302 SNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVP 1481 SNLINSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVP Sbjct: 298 SNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 357 Query: 1482 AFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSA 1661 AFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + PS Sbjct: 358 AFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSP 417 Query: 1662 PDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMVD----SLDPRPAP 1820 PDVSN+L+SEDD S SN +DP F+G+ D E ER+LKD + A+S + +LDPR Sbjct: 418 PDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPR-LT 476 Query: 1821 SLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGE 1991 SLQ + S ++ PT Q +M + QF Q +VKP+ PSEPSL SPAREEGE Sbjct: 477 SLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535 Query: 1992 VPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEE 2171 VPESELDPDTRRRLLILQHGQDTRDH S+EPPF SRG WFP EE Sbjct: 536 VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVP--SSRGVWFPAEE 593 Query: 2172 DMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFP 2348 ++ + NR VPKE P++S + K R HPSFF ESS SDR+L++ +QR P Sbjct: 594 EIGSQPLNRV----VPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLP 649 Query: 2349 KEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQE 2528 KE++ DDR R+NH S+YRSF G++ P +S SS RD ESG S + +TP VLQE Sbjct: 650 KEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSFSSHRDLDSESGH-SVLHADTPVAVLQE 708 Query: 2529 IGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANK 2708 I +KCGTKV+F S+L+AS ELQFS+E WF+G+KIG G+TRKEAQ +AA+ S+++LA+ Sbjct: 709 IALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADI 768 Query: 2709 YLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPR 2885 YLS+ +P S GD++ +VN+ G +G ++S G Q L KED P+R LDPR Sbjct: 769 YLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKED-SASFSTASPSRVLDPR 827 Query: 2886 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 3062 L+ SK+SMGS+S+LKELCMMEGL + FL+ P P+STN + K EV+AQVEI G++ GKG+G Sbjct: 828 LDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIG 887 Query: 3063 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 3242 TW+EAKMQAAE+AL +L+S LG S Q R SPR FS+KRLK E+PR +QR+ SS R Sbjct: 888 LTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSAR 947 Query: 3243 YSNNAPPVP 3269 Y NAPP+P Sbjct: 948 YPRNAPPIP 956 >emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera] Length = 894 Score = 1181 bits (3056), Expect = 0.0 Identities = 638/960 (66%), Positives = 725/960 (75%), Gaps = 8/960 (0%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 593 M+KS+VY+G+ ++GEVEIYPQNQ + ++ KEIRISH+SQPSERCPPLAVLHTI GVCF Sbjct: 1 MYKSIVYEGDDVVGEVEIYPQNQGLELM-KEIRISHYSQPSERCPPLAVLHTITSCGVCF 59 Query: 594 KMES-KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVS 770 KMES K QS+D+PL+ LHS+C+RE+KTAVM LGEEELHLVAM S+K QY CFWGF V+ Sbjct: 60 KMESSKAQSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFNVA 119 Query: 771 PGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLA 950 GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTE+DPQR+SGM+A Sbjct: 120 LGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGMVA 179 Query: 951 EIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILT 1130 E VVENGK+ K Q E+VPA SDN+QPIVRPLIRLQEKNIILT Sbjct: 180 E-------------------VVENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNIILT 220 Query: 1131 RINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNL 1310 RINPLIRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDPESNL Sbjct: 221 RINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNL 280 Query: 1311 INSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFA 1490 INSKELL R+VCVK+GSRKSL NVF DGICHPKMALVIDDRLKVWDEKDQ RVHVVPAFA Sbjct: 281 INSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 340 Query: 1491 PYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDV 1670 PYYAPQAEANN + VLCVARNVACNVRGGFFKEFDEGLLQ I E+ YED + + SAPDV Sbjct: 341 PYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDBIKDIRSAPDV 400 Query: 1671 SNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASSMVDSLDPRPAPSLQNAVA 1841 SN+L+SEDD S S N+D F+G+ DVE ER+LKD I A S V SLDPR +P LQ AVA Sbjct: 401 SNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKDAISAPSTVTSLDPRLSPPLQFAVA 460 Query: 1842 SSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVPESELD 2012 +S QP QG IM N QF Q ++KPL EP++Q SPAREEGEVPESELD Sbjct: 461 ASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPLA----PEPTMQSSPAREEGEVPESELD 516 Query: 2013 PDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDMIPRKP 2192 PDTRRRLLILQHGQDTR+H SS+PPF +QSRGSWFP +E+M PR+ Sbjct: 517 PDTRRRLLILQHGQDTREHASSDPPF---PVRPPIQVSVPRVQSRGSWFPADEEMSPRQL 573 Query: 2193 NRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDD 2372 NRA VPKE PL+S+TMH +K R HPSFFH ESS SDR+L+ENQR KEV DD Sbjct: 574 NRA----VPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDD 629 Query: 2373 RLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTK 2552 RLR+NHS Y SF GEE PLG+S SS+RD FESGRG+ PY ETPA +G+ Sbjct: 630 RLRLNHSLPGYHSFSGEEVPLGRS-SSNRDLDFESGRGA-PYAETPA-----VGL----- 677 Query: 2553 VEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHD 2732 L+ EVW GEKIG GTGKTR+EAQ QAA++SL L+ +YL Sbjct: 678 ------------LRNCNEVWNQGEKIGEGTGKTRREAQCQAAEASLMYLSYRYL------ 719 Query: 2733 PSSVRGDLNK-PSHVNEMGLGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEESKKSM 2909 GD+N+ P+ + + D+NSFGYQ+ PKE E +R LDPRLE SKKSM Sbjct: 720 ----HGDVNRFPNASDNNFMSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSM 775 Query: 2910 GSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQ 3089 GS+SALKELCMMEGL + FL+QPPLS+N K E+ AQVEI GQ+LGKG G+TW++AKMQ Sbjct: 776 GSISALKELCMMEGLGVEFLSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQ 835 Query: 3090 AAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 3269 AAE+AL +LKSMLG Q R GSPR L KRLK EF R LQR SS RYS N PVP Sbjct: 836 AAEKALGSLKSMLGQFSQKRQGSPRSLQGM-GKRLKSEFTRGLQRTPSSGRYSKNTSPVP 894 >ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Glycine max] Length = 960 Score = 1172 bits (3032), Expect = 0.0 Identities = 620/973 (63%), Positives = 742/973 (76%), Gaps = 21/973 (2%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQ--------NQKICMIDKEIRISHFSQPSERCPPLAVLHT 569 M+KSVVYQG ++GEV++YP+ N KEIRISHFSQPSERCPPLAVLHT Sbjct: 1 MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60 Query: 570 IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSC 749 + GVCFKMESK Q +D LF LHS C+RE+KTAVMPLG EE+HLVAM SR + C Sbjct: 61 VTSCGVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRN--DDRPC 117 Query: 750 FWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQ 929 FWGFIV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQ Sbjct: 118 FWGFIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQ 177 Query: 930 RVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQ 1109 R+SGM AE+KRY DDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ Sbjct: 178 RISGMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQ 237 Query: 1110 EKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRL 1289 +KNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRL Sbjct: 238 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 297 Query: 1290 LDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRV 1469 LDP+SNLINSKELL R+VCVK+G +KSL NVF DG C PKMALVIDDRLKVWDE+DQ RV Sbjct: 298 LDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPRV 357 Query: 1470 HVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLAN 1649 HVVPAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + Sbjct: 358 HVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKD 417 Query: 1650 FPSAPDVSNFLISEDD--VSASNKDPLGFEGVTDVE-ERRLKDVIQASS----MVDSLDP 1808 PS PDVSN+L+SEDD +S N+DP F+G+ D E ER+LKD + A+S +LDP Sbjct: 418 VPSPPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDP 477 Query: 1809 RPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAR 1979 R SLQ + S ++ PT Q +M + QF Q +VKP+ PS+PSL SPAR Sbjct: 478 R-LTSLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAR 535 Query: 1980 EEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWF 2159 EEGEVPESELDPDTRRRLLILQHGQDTRDH S+EPPF SRG WF Sbjct: 536 EEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVP--SSRGVWF 593 Query: 2160 PLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-N 2336 P+EE++ + NR VPKE P++S + +K R HPSFF+ ESS SDR+L++ + Sbjct: 594 PVEEEIGSQPLNRV----VPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSH 649 Query: 2337 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 2516 QR PKE++ DDR R+NH S+YRSF G++ P +S SS RD ESG S + +TP Sbjct: 650 QRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGH-SVLHADTPVA 708 Query: 2517 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 2696 VL EI +KCGTKV+F S+L+AS EL+FS+E WF+G+KIG G G+TRKEAQ +AA S+ + Sbjct: 709 VLHEIALKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEH 768 Query: 2697 LANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRF 2873 LA+ YLS+ +P S GD++ +VN+ G +G ++S G Q L KED P+R Sbjct: 769 LADIYLSSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKED-SASFSSASPSRA 827 Query: 2874 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILG 3050 LDPRL+ SK+SMGS+SALKELCMMEGL + FL+ P P+STN + K EV+AQVEI G+I G Sbjct: 828 LDPRLDVSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFG 887 Query: 3051 KGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRIN 3230 KG+G TW+EAKMQAAE+AL NL+S LG S Q SPR FS+KRLK E+PR +QR+ Sbjct: 888 KGIGLTWDEAKMQAAEKALGNLRSKLGQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMP 947 Query: 3231 SSTRYSNNAPPVP 3269 SS RY NAPP+P Sbjct: 948 SSARYPRNAPPIP 960 >ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris] gi|561032720|gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris] Length = 964 Score = 1165 bits (3014), Expect = 0.0 Identities = 622/977 (63%), Positives = 738/977 (75%), Gaps = 25/977 (2%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQKICMID-KEIRISHFSQPSERCPPLAVLHTIAPSGVC 590 M+KSVVYQG +LGEVE+YP+ KEIRISHFSQPSERCPPLAVLHT+ GVC Sbjct: 1 MYKSVVYQGEVVLGEVEVYPEENNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60 Query: 591 FKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVS 770 FKMESK Q +D LF LHS C+RE+KTAV+PLG EE+HLVAM SR + FWGFIV+ Sbjct: 61 FKMESKTQQQDG-LFHLHSLCIRENKTAVIPLGGEEIHLVAMHSRN--DDRPRFWGFIVA 117 Query: 771 PGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLA 950 GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SGM A Sbjct: 118 LGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQA 177 Query: 951 EIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILT 1130 E+KRYQ+DK+ILKQY ENDQVV+NG+V+KVQSE+VPA SDN+QPIVRPLIRLQ+KNIILT Sbjct: 178 EVKRYQEDKNILKQYAENDQVVDNGRVVKVQSEIVPALSDNHQPIVRPLIRLQDKNIILT 237 Query: 1131 RINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNL 1310 RINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+SNL Sbjct: 238 RINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNL 297 Query: 1311 INSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFA 1490 INSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVPAFA Sbjct: 298 INSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 357 Query: 1491 PYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDV 1670 PYYAPQAEA+N +PVLCVARNVACNVRGGFFKEFD+GLLQ I +V YED + + P PDV Sbjct: 358 PYYAPQAEASNSIPVLCVARNVACNVRGGFFKEFDDGLLQKIPQVAYEDDIKDIPIPPDV 417 Query: 1671 SNFLISEDD----VSASNKDPLGFEGVTDVE-ERRLK---------DVIQASSMVD---- 1796 SN+L+SEDD +S N+DP F+ + D E ER+ K D + A+S + Sbjct: 418 SNYLVSEDDGSSAISNGNRDPFLFDSMGDAEVERKSKVPTRAPNEHDALSAASTIPVTTA 477 Query: 1797 SLDPRPAPSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQG 1967 +LDPR SLQ A+ SS + PT Q +M + QF Q +VKP+ PSE SL Sbjct: 478 NLDPR-LTSLQYAMVSS-GSAPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHS 535 Query: 1968 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSR 2147 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTS+EP + + SR Sbjct: 536 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSNEPTY---AIRHPVPVSAPRVSSR 592 Query: 2148 GSWFPLEEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRML 2327 G WFP EED+ + NR VPKE ++S ++ +K R HPSFF ESS SDR+L Sbjct: 593 GGWFPAEEDIGSQPLNRV----VPKEFSVDSGSLVIEKHRPHHPSFFSKVESSISSDRIL 648 Query: 2328 NE-NQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTE 2504 ++ +QR PKE++ DDR R NH S+YRS +E P +S SS RD ES S + + Sbjct: 649 HDSHQRLPKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSSSHRDLDSESSH-SVFHAD 707 Query: 2505 TPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADS 2684 TP VLQEI +KCGTKVEF S+L+AS ELQFSIE WF+G+KIG G G+TRKEAQ +AA+ Sbjct: 708 TPVVVLQEIALKCGTKVEFMSSLVASTELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAED 767 Query: 2685 SLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXE 2861 S+++LA+ YLS+ +P S GD+ + N+ G + ++S Q LPKED + Sbjct: 768 SIKHLADIYLSSAKDEPGSTYGDVGGFPNANDNGYMVIASSLSNQPLPKEDSASFSTASD 827 Query: 2862 PTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGG 3038 P+R LDPRLE SK+ MGS+SALKELCMMEGL + FL+ P P+STN + K EV+AQVEI G Sbjct: 828 PSRVLDPRLEVSKRPMGSISALKELCMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDG 887 Query: 3039 QILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRML 3218 ++ GKG+G TW+EAKMQAAE+AL +L+S LG S Q R SPR FS+KRLK E+PR + Sbjct: 888 KVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRAM 947 Query: 3219 QRINSSTRYSNNAPPVP 3269 QRI SSTRY NAPP+P Sbjct: 948 QRIPSSTRYPRNAPPIP 964 >ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X2 [Glycine max] Length = 929 Score = 1155 bits (2988), Expect = 0.0 Identities = 615/969 (63%), Positives = 738/969 (76%), Gaps = 16/969 (1%) Frame = +3 Query: 411 KMFKSVVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPS 581 +M+KSVVYQG ++GEV++YP+ N K + KEIRISHFSQPSERCPPLAVLHT+ Sbjct: 2 RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHV-KEIRISHFSQPSERCPPLAVLHTVTSC 60 Query: 582 GVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGF 761 GVCFKMESK Q +D LF LHS C+RE+KTAVMPLG EE+HLVAM SR N+++ CFWGF Sbjct: 61 GVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSR-NVDR-PCFWGF 117 Query: 762 IVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSG 941 IV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SG Sbjct: 118 IVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISG 177 Query: 942 MLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNI 1121 M AE+KRYQDDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ+KNI Sbjct: 178 MQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNI 237 Query: 1122 ILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPE 1301 ILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+ Sbjct: 238 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 297 Query: 1302 SNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVP 1481 SNLINSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVP Sbjct: 298 SNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 357 Query: 1482 AFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSA 1661 AFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + PS Sbjct: 358 AFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSP 417 Query: 1662 PDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMVD----SLDPRPAP 1820 PDVSN+L+SEDD S SN +DP F+G+ D E ER+LKD + A+S + +LDPR Sbjct: 418 PDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPR-LT 476 Query: 1821 SLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGE 1991 SLQ + S ++ PT Q +M + QF Q +VKP+ PSEPSL SPAREEGE Sbjct: 477 SLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535 Query: 1992 VPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEE 2171 VPESELDPDTRRRLLILQHGQDTRDH S+EPPF SRG WFP EE Sbjct: 536 VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVP--SSRGVWFPAEE 593 Query: 2172 DMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFP 2348 ++ + NR VPKE P++S + K R HPSFF ESS SDR+L++ +QR P Sbjct: 594 EIGSQPLNRV----VPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLP 649 Query: 2349 KEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQE 2528 KE++ DDR R+NH S+YRSF ++TP VLQE Sbjct: 650 KEMYHRDDRPRLNHMLSSYRSF----------------------------SDTPVAVLQE 681 Query: 2529 IGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANK 2708 I +KCGTKV+F S+L+AS ELQFS+E WF+G+KIG G+TRKEAQ +AA+ S+++LA+ Sbjct: 682 IALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADI 741 Query: 2709 YLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPR 2885 YLS+ +P S GD++ +VN+ G +G ++S G Q L KED P+R LDPR Sbjct: 742 YLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKED-SASFSTASPSRVLDPR 800 Query: 2886 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 3062 L+ SK+SMGS+S+LKELCMMEGL + FL+ P P+STN + K EV+AQVEI G++ GKG+G Sbjct: 801 LDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIG 860 Query: 3063 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 3242 TW+EAKMQAAE+AL +L+S LG S Q R SPR FS+KRLK E+PR +QR+ SS R Sbjct: 861 LTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSAR 920 Query: 3243 YSNNAPPVP 3269 Y NAPP+P Sbjct: 921 YPRNAPPIP 929 >ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Cicer arietinum] Length = 951 Score = 1154 bits (2986), Expect = 0.0 Identities = 619/966 (64%), Positives = 728/966 (75%), Gaps = 14/966 (1%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 593 M+KS+VYQG +LGEV+IYP+ KEIRISHF+QPSERC PLAVLHTI SGVCF Sbjct: 1 MYKSLVYQGEVVLGEVDIYPEVNNNNKNFKEIRISHFTQPSERCLPLAVLHTITSSGVCF 60 Query: 594 KMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVSP 773 KMESK Q +D PLF LH+ C RE+KTAVMPL EE+HLVAM SR N CFWG+IV Sbjct: 61 KMESKTQQQD-PLFHLHNLCFRENKTAVMPLCGEEMHLVAMHSRSNGRP--CFWGYIVGM 117 Query: 774 GLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLAE 953 GLYNSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SGM AE Sbjct: 118 GLYNSCLMMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAE 177 Query: 954 IKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILTR 1133 +KRY +DKSILKQYVENDQVV+NGKV+K QSE+VPA SD++QPIVRPLIRL EKNIILTR Sbjct: 178 VKRYLEDKSILKQYVENDQVVDNGKVLKAQSELVPALSDSHQPIVRPLIRLHEKNIILTR 237 Query: 1134 INPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNLI 1313 INP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+SNLI Sbjct: 238 INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLI 297 Query: 1314 NSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFAP 1493 NSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVPAFAP Sbjct: 298 NSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP 357 Query: 1494 YYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDVS 1673 YYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YE+ + APDVS Sbjct: 358 YYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKISQIAYENNTRDISPAPDVS 417 Query: 1674 NFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASSMVD----SLDPRPAPSLQN 1832 N+L+SEDD SAS N+DP F+G+ D E ER+LKD I A+S + LDPR SLQ Sbjct: 418 NYLVSEDDGSASYANRDPFAFDGMADAEVERKLKDAISAASAIPMTTAKLDPRLTSSLQY 477 Query: 1833 AVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVPES 2003 + S ++ P Q ++ + QF Q +VKP+ V PSE SL SPAREEGEVPES Sbjct: 478 TMVSP-GSVLPPAAQASMIPLPHTQFPQPATLVKPIGQVAPSELSLHSSPAREEGEVPES 536 Query: 2004 ELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDMIP 2183 ELDPDTRRRLLILQHGQD RDHTSSEPPF + RG WFP+EE++ Sbjct: 537 ELDPDTRRRLLILQHGQDNRDHTSSEPPF----PLKHPVQVSARVPPRGGWFPVEEEIGS 592 Query: 2184 RKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFPKEVH 2360 + PNR +PKE L+S +K R FF + S SDR L+E NQR PKE++ Sbjct: 593 QPPNRV----IPKEIALDSGPSRIEKHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMY 648 Query: 2361 QGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIK 2540 DDR RV+H S+Y S G++TP G+S SS RDF ESG S ETPA VLQEI +K Sbjct: 649 HRDDRSRVSHMLSSYPSLSGDDTPFGRSSSSHRDFDSESGH-SVFNAETPAIVLQEIALK 707 Query: 2541 CGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSN 2720 CGTKVEF S+L AS ELQFSIE WF+G+KIG G G+TR EAQ +AA+ S+++LA+ YLS Sbjct: 708 CGTKVEFTSSLAASRELQFSIEAWFSGKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSR 767 Query: 2721 VMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLEES 2897 + S GD++ + N+ G +G+ +S G Q LPKE+ +P+R LDPRL+ S Sbjct: 768 AKDESGSAFGDVSGFPNANDNGYVGNVSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVS 827 Query: 2898 KKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVGATWN 3074 K+SMGSVSALKELCM+EGL + FL+ P P+STN ++ EV+AQVEI GQ+ GKG G TW+ Sbjct: 828 KRSMGSVSALKELCMVEGLGVNFLSLPAPVSTNSVD--EVHAQVEIDGQVYGKGTGITWD 885 Query: 3075 EAKMQAAEEALWNLKSMLGDSG-QNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSN 3251 EAKMQAAE+AL +L++ + G Q R SPR S+KRLK E PR LQR SS RY Sbjct: 886 EAKMQAAEKALGSLRTTIHGQGIQRRQLSPRPFQGLSNKRLKQEHPRTLQRFASSGRYPR 945 Query: 3252 NAPPVP 3269 NAPP+P Sbjct: 946 NAPPIP 951 >ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Solanum tuberosum] Length = 953 Score = 1126 bits (2913), Expect = 0.0 Identities = 619/968 (63%), Positives = 717/968 (74%), Gaps = 16/968 (1%) Frame = +3 Query: 414 MFKS--VVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGV 587 MFKS V+Y+G L+GEVEIY + + + +K IRISH+S SERCPPLAVLHT+ +G+ Sbjct: 1 MFKSTVVLYEGERLVGEVEIYCE-KGVLWGEKVIRISHYSPSSERCPPLAVLHTVT-TGL 58 Query: 588 CFKME---SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWG 758 FK+E SK ++DSPL LHS+CLR++KTAVM LG EELHLVAM S+ Q CFWG Sbjct: 59 SFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCFWG 118 Query: 759 FIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVS 938 F V+ GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN+E DPQR S Sbjct: 119 FKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQRAS 178 Query: 939 GMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKN 1118 MLAE+KRYQ+DK ILKQY ENDQVV+NGKVIK QSEV PA SDN+QPIVRPLIRLQ++N Sbjct: 179 VMLAEVKRYQEDKIILKQYAENDQVVDNGKVIKSQSEVFPALSDNHQPIVRPLIRLQDRN 238 Query: 1119 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDP 1298 IILTRINP+IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP Sbjct: 239 IILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 298 Query: 1299 ESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVV 1478 +SNLINS+ELL R+VCVK+G RKSL NVF DG CHPKMALVIDDRLKVWD+KDQ RVHVV Sbjct: 299 DSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVV 358 Query: 1479 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPS 1658 PAFAPY+APQAE NN VPVLCVARNVACNVRGGFFK+FDEGLLQ I EV YED + PS Sbjct: 359 PAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPS 418 Query: 1659 APDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQASSMVDS----LDPRPA 1817 APDVSN+LISEDD SA NKD LGF+G+ D E ERRLK+ + AS+ V S LDPR Sbjct: 419 APDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPRLV 478 Query: 1818 PSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQVVKPLV-HVGPSEPSLQGSPAREEGEV 1994 P+LQ V +S S +P P + Q T V+K V + P + SLQ SPAREEGEV Sbjct: 479 PALQYPVPPVISQPSIQSPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREEGEV 538 Query: 1995 PESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEED 2174 PESELDPDTRRRLLILQHGQDTRD SSEP F +Q G WFP EE+ Sbjct: 539 PESELDPDTRRRLLILQHGQDTRDQVSSEPKF--PMGTPLQVSVPPRVQPHG-WFPAEEE 595 Query: 2175 MIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKE 2354 M PR+ NR +P PKE PL E+MH +K R HP F E+S PSDR+L ENQR PKE Sbjct: 596 MSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFENQRLPKE 652 Query: 2355 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 2534 V DDR+R + S ++R PGEE PLG+S SS+R E G PY ETPAG LQ+I Sbjct: 653 VIPRDDRMRFSQSQPSFRP-PGEEVPLGRSSSSNRVLDLEPGH-YDPYLETPAGALQDIA 710 Query: 2535 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 2714 KCG KVEFRS+ ++S ELQFS+EV F GEK+G GTG+TR+EAQR+AA+ SL LA+KYL Sbjct: 711 FKCGAKVEFRSSFLSSPELQFSLEVLFAGEKVGEGTGRTRREAQRRAAEESLMYLADKYL 770 Query: 2715 SNVMHDPSSVRGDLNKPSHVNEMGLGDSNS-FGYQTLPKEDXXXXXXXXEPTRFLDPRLE 2891 S + D SS +GD + + ++ G D+ S FGYQ D EP R LDPRLE Sbjct: 771 SCIKPDSSSTQGDGFRFPNASDNGFVDNMSPFGYQ-----DRVSHSFASEPPRVLDPRLE 825 Query: 2892 ESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATW 3071 KKS+GSV AL+ELC +EGL L F QP LS N K E+YAQVEI GQ+ GKG+G+TW Sbjct: 826 VFKKSVGSVGALRELCAIEGLGLAFQTQPQLSANPGQKSEIYAQVEIDGQVFGKGIGSTW 885 Query: 3072 NEAKMQAAEEALWNLKSMLGDSGQNRVGSPR-LLHSFSSKRLKPEFPR-MLQRINSSTRY 3245 ++AK QAAE AL LKS L Q R GSPR L FS+KRLKPE+ R + QR+ S R+ Sbjct: 886 DDAKTQAAERALVALKSELAQFSQKRQGSPRSLQQGFSNKRLKPEYSRGVQQRVPLSGRF 945 Query: 3246 SNNAPPVP 3269 N +P Sbjct: 946 PKNTSAMP 953 >ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] Length = 958 Score = 1124 bits (2908), Expect = 0.0 Identities = 606/970 (62%), Positives = 719/970 (74%), Gaps = 18/970 (1%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 593 M +S+VY G +GEVEIYP+ +K + KEIRISHFSQPSERCPPLAVLHTI G+CF Sbjct: 1 MKRSMVYHGEMEVGEVEIYPEEKKNIDL-KEIRISHFSQPSERCPPLAVLHTITSFGICF 59 Query: 594 KMES---KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 764 KMES + + + LF LHSSC+RE+KTAVMPL EE+HLVAM SR N CFWGFI Sbjct: 60 KMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNN--DRPCFWGFI 117 Query: 765 VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 944 V+ GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQR+S M Sbjct: 118 VASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRISTM 177 Query: 945 LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 1124 AEIKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE+VPA SD++QPIVRPLIRLQEKNII Sbjct: 178 QAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNII 237 Query: 1125 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 1304 LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE Sbjct: 238 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEL 297 Query: 1305 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 1484 NLINSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ +VHVVPA Sbjct: 298 NLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVPA 357 Query: 1485 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 1664 FAPYYAPQAEA+N VP LC+AR+VACNVRGGFFK+FD+GLLQ I + YED + + PS P Sbjct: 358 FAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSPP 417 Query: 1665 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MVDSLDPRPA-- 1817 DVSN+L+SEDD SAS NK+ L F+G+ D E ERRLKD I ASS M +LDPR A Sbjct: 418 DVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFN 477 Query: 1818 PSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEG 1988 SLQ + SS T+ PT Q I+ N QF Q +VKP+ V P PSL SPAREEG Sbjct: 478 SSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEG 537 Query: 1989 EVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLE 2168 EVPESELD DTRRRLLILQHGQDTR+HTSSEPP + SR WF +E Sbjct: 538 EVPESELDLDTRRRLLILQHGQDTREHTSSEPPL---PVRHPTQVSAPSVPSRRGWFSVE 594 Query: 2169 EDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRF 2345 E+M P++ N+ VPKE P+ SE +H +KR RHPS F + S SDR+ +E +QR Sbjct: 595 EEMGPQQLNQL----VPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRL 650 Query: 2346 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2525 PKEVH DD R++ S S+Y SFPG++ PL S S+RDF ESGR S + + AGVLQ Sbjct: 651 PKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGR-SLFHADITAGVLQ 709 Query: 2526 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2705 EI +KCGTKVEF S+L+AS LQFSIE WF G+K+G G G+TR+EAQ +AA+ S++ LA+ Sbjct: 710 EIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLAD 769 Query: 2706 KYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDP 2882 Y+S+ D S GD++ N G + NS G Q LPKE + +R DP Sbjct: 770 IYMSHAKDDSGSTYGDVSGFHGSNNNGFVSSGNSLGNQLLPKES-VSFSTSSDSSRVSDP 828 Query: 2883 RLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGV 3059 RLE SK+S S+SALKE CMMEGL+ F + P P ST+ K EV+AQVEI GQI GKG Sbjct: 829 RLEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGF 888 Query: 3060 GATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSST 3239 G TW EAKMQAA++AL +L++M + R GSPR + ++KRLK E+PR LQRI S Sbjct: 889 GLTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSA 948 Query: 3240 RYSNNAPPVP 3269 RY NAP VP Sbjct: 949 RYPRNAPLVP 958 >ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X3 [Glycine max] Length = 932 Score = 1108 bits (2865), Expect = 0.0 Identities = 599/969 (61%), Positives = 712/969 (73%), Gaps = 17/969 (1%) Frame = +3 Query: 414 MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 593 M +S+VY G +GEVEIYP+ +K + KEIRISHFSQPSERCPPLAVLHTI G+CF Sbjct: 1 MKRSMVYHGEMEVGEVEIYPEEKKNIDL-KEIRISHFSQPSERCPPLAVLHTITSFGICF 59 Query: 594 KMES---KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 764 KMES + + + LF LHSSC+RE+KTAVMPL EE+HLVAM SR N CFWGFI Sbjct: 60 KMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNN--DRPCFWGFI 117 Query: 765 VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 944 V+ GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQR+S M Sbjct: 118 VASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRISTM 177 Query: 945 LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 1124 AEIKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE+VPA SD++QPIVRPLIRLQEKNII Sbjct: 178 QAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNII 237 Query: 1125 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 1304 LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE Sbjct: 238 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEL 297 Query: 1305 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 1484 NLINSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ +VHVVPA Sbjct: 298 NLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVPA 357 Query: 1485 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 1664 FAPYYAPQAEA+N VP LC+AR+VACNVRGGFFK+FD+GLLQ I + YED + + PS P Sbjct: 358 FAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSPP 417 Query: 1665 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MVDSLDPRPA-- 1817 DVSN+L+SEDD SAS NK+ L F+G+ D E ERRLKD I ASS M +LDPR A Sbjct: 418 DVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFN 477 Query: 1818 PSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEG 1988 SLQ + SS T+ PT Q I+ N QF Q +VKP+ V P PSL SPAREEG Sbjct: 478 SSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEG 537 Query: 1989 EVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLE 2168 EVPESELD DTRRRLLILQHGQDTR+HTSSEPP + SR WF +E Sbjct: 538 EVPESELDLDTRRRLLILQHGQDTREHTSSEPPL---PVRHPTQVSAPSVPSRRGWFSVE 594 Query: 2169 EDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRF 2345 E+M P++ N+ VPKE P+ SE +H +KR RHPS F + S SDR+ +E +QR Sbjct: 595 EEMGPQQLNQL----VPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRL 650 Query: 2346 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2525 PKEVH DD R++ S S+Y SFPG++ PL S S+RDF ESGR S + + AGVLQ Sbjct: 651 PKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGR-SLFHADITAGVLQ 709 Query: 2526 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2705 EI +KCGTKVEF S+L+AS LQFSIE WF G+K+G G G+TR+EAQ +AA+ S++ LA+ Sbjct: 710 EIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLAD 769 Query: 2706 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPR 2885 Y+S+ D S GD++ G SN+ G+ + DPR Sbjct: 770 IYMSHAKDDSGSTYGDVS--------GFHGSNNNGFVS------------------SDPR 803 Query: 2886 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 3062 LE SK+S S+SALKE CMMEGL+ F + P P ST+ K EV+AQVEI GQI GKG G Sbjct: 804 LEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFG 863 Query: 3063 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 3242 TW EAKMQAA++AL +L++M + R GSPR + ++KRLK E+PR LQRI S R Sbjct: 864 LTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSAR 923 Query: 3243 YSNNAPPVP 3269 Y NAP VP Sbjct: 924 YPRNAPLVP 932 >ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Solanum lycopersicum] Length = 954 Score = 1107 bits (2862), Expect = 0.0 Identities = 608/972 (62%), Positives = 709/972 (72%), Gaps = 20/972 (2%) Frame = +3 Query: 414 MFKSVV--YQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGV 587 MFKS V Y+G L+GEVE+Y + + + +K IRISH+S SERCPPLAVLHT+ +G+ Sbjct: 1 MFKSTVLLYEGERLVGEVEMYGE-KGVVWGEKLIRISHYSPSSERCPPLAVLHTVT-TGL 58 Query: 588 CFKME---SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWG 758 FK+E SK ++DSPL LHS+CLR++KTAVM LG EELHLVAM S+ Q CFWG Sbjct: 59 SFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCFWG 118 Query: 759 FIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVS 938 F V+ GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN+E DPQR S Sbjct: 119 FKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQRAS 178 Query: 939 GMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKN 1118 MLAE+KRYQ+DK ILKQY ENDQVV+NGKVI+ QSEV PA SDN+QPIVRPLIRLQ++N Sbjct: 179 VMLAEVKRYQEDKIILKQYAENDQVVDNGKVIRSQSEVFPALSDNHQPIVRPLIRLQDRN 238 Query: 1119 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDP 1298 IILTRINP+IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP Sbjct: 239 IILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 298 Query: 1299 ESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVV 1478 +SNLINS+ELL R+VCVK+G RKSL NVF DG CHPKMALVIDDRLKVWD+KDQ RVHVV Sbjct: 299 DSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVV 358 Query: 1479 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPS 1658 PAFAPY+APQAE NN VPVLCVARNVACNVRGGFFK+FDEGLLQ I EV YED + PS Sbjct: 359 PAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPS 418 Query: 1659 APDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQASSMVDS----LDPRPA 1817 APDVSN+LISEDD SA NKD LGF+G+ D E ERRLK+ + AS+ V S LDPR Sbjct: 419 APDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPRLV 478 Query: 1818 PSLQNAVASSLSTISQPTPQGPIMSSQNNQFTQVVKPL----VHVGPSEPSLQGSPAREE 1985 P+LQ V ISQP+ QGP++ QV L + P + SLQ SPAREE Sbjct: 479 PALQYPVP---PVISQPSIQGPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREE 535 Query: 1986 GEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPL 2165 GEVPESELDPDTRRRLLILQHGQDTRD SSEP F +Q G WFP Sbjct: 536 GEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKF--PIGTPLQVSVPPRVQPHG-WFPA 592 Query: 2166 EEDMIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRF 2345 EE++ PR+ NR +P PKE PL E+MH +K R HP F E+S PSDR+ ENQR Sbjct: 593 EEEVSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVFFENQRL 649 Query: 2346 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 2525 PKEV DDR+R + S ++R PGE+ LG+S SS+R + G PY +TPAG LQ Sbjct: 650 PKEVIPRDDRMRFSQSQPSFRP-PGEDVSLGRSSSSNRVLDLDPGH-YDPYLDTPAGALQ 707 Query: 2526 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 2705 +I KCG KVEFRS+ ++S ELQF +EV F GEK+G G G+TR+EAQR AA+ SL LA+ Sbjct: 708 DIAFKCGVKVEFRSSFLSSPELQFCLEVLFAGEKVGEGIGRTRREAQRHAAEESLMYLAD 767 Query: 2706 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNS-FGYQTLPKEDXXXXXXXXEPTRFLDP 2882 KYLS + D SS +GD + + ++ G ++ S FGYQ D EP R LDP Sbjct: 768 KYLSCIKADSSSTQGDGFRFPNASDNGFVENMSPFGYQ-----DRVSHSFASEPPRVLDP 822 Query: 2883 RLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVG 3062 RLE KKS+GSV AL+ELC +EGL L F QP LS N K E+YAQVEI GQ+ GKG+G Sbjct: 823 RLEVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSVNPGQKSEIYAQVEIDGQVFGKGIG 882 Query: 3063 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLL--HSFSSKRLKPEFPR-MLQRINS 3233 TW++AK QAAE AL LKS L R GSPR L FS+KRLKPE+ R + QR+ Sbjct: 883 PTWDDAKTQAAERALVALKSELAQFSHKRQGSPRSLQQQGFSNKRLKPEYSRGVQQRVPL 942 Query: 3234 STRYSNNAPPVP 3269 S R+ N +P Sbjct: 943 SGRFPKNTSAMP 954 >ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] gi|571500215|ref|XP_006594604.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X2 [Glycine max] Length = 960 Score = 1104 bits (2855), Expect = 0.0 Identities = 598/967 (61%), Positives = 707/967 (73%), Gaps = 18/967 (1%) Frame = +3 Query: 423 SVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCFKME 602 S+VY G +GEV+IYP+ K + KEIRISHFSQPSERCPPLAVLHTI G+CFKME Sbjct: 4 SMVYHGEMAVGEVKIYPEENKNMDL-KEIRISHFSQPSERCPPLAVLHTITSFGICFKME 62 Query: 603 SKLQSK---DSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVSP 773 S K LF LHSSC+RE+KTAVMP+ EE+HLVAM SR N CFWGFIV+ Sbjct: 63 SSTSQKRQQQDALFHLHSSCIRENKTAVMPVRGEEIHLVAMYSRNN--DRPCFWGFIVAS 120 Query: 774 GLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLAE 953 GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQ++S M AE Sbjct: 121 GLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQQISAMQAE 180 Query: 954 IKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILTR 1133 IKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE VPA SD++QPIVRPLIRLQEKNIILTR Sbjct: 181 IKRYLDDKNILKEYAENDQVVDNGKVIKIQSESVPALSDSHQPIVRPLIRLQEKNIILTR 240 Query: 1134 INPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNLI 1313 INP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE NLI Sbjct: 241 INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPELNLI 300 Query: 1314 NSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFAP 1493 NSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ RVHVVPAFAP Sbjct: 301 NSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPRVHVVPAFAP 360 Query: 1494 YYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDVS 1673 YY PQAEA+N VP LC+ARNVACNVRGGFFK+FD+GLLQ I + YED + + PS PDVS Sbjct: 361 YYTPQAEASNAVPFLCLARNVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPS-PDVS 419 Query: 1674 NFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MVDSLDPRPA--PSL 1826 N+L+SEDD SAS NK+ L F+G+ D E ERRLKD I ASS + ++DPR A SL Sbjct: 420 NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 479 Query: 1827 QNAVASSLSTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVP 1997 Q + SS T+ PT Q ++ N QF Q +VKP+ V SL SPAREEGE+P Sbjct: 480 QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 539 Query: 1998 ESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXXMQSRGSWFPLEEDM 2177 ESELD DTRRR LILQHGQDTR+ +SEPPF + SR WF +EE+M Sbjct: 540 ESELDLDTRRRFLILQHGQDTRERMASEPPF-PVRHPAQVSAPASSVPSRRGWFSVEEEM 598 Query: 2178 IPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFPKE 2354 P++ N VPKE P++SE H +KR RHPSFF S SDR+ +E +QR PKE Sbjct: 599 GPQQLN----LPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKE 654 Query: 2355 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 2534 VH DDR R++ S S+Y S PG++ PL S S+RDF ESGR S + +T AGVLQEI Sbjct: 655 VHHRDDRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGR-SLFHADTTAGVLQEIA 713 Query: 2535 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 2714 + CGTKVEF S+L+AS ELQFSIE WF G+KIG G G+TR+EAQ +AA S++ LA+ Y+ Sbjct: 714 LNCGTKVEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYM 773 Query: 2715 SNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXXEPTRFLDPRLE 2891 S+ D S GD++ N G + NS G Q LPKE+ E +R D RLE Sbjct: 774 SHAKDDSGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLE 833 Query: 2892 ESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVGAT 3068 SK+S S+SALKELCMMEGL+ F + P ST+L K EV+AQVEI GQI GKG G T Sbjct: 834 VSKRSTDSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVT 893 Query: 3069 WNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYS 3248 W EAKMQAA++AL +L++M R GSPR + ++KRLKPE+P LQR+ S RY Sbjct: 894 WEEAKMQAAKKALGSLRTMFNQGSLKRHGSPRSMQGLANKRLKPEYPPTLQRVPYSARYP 953 Query: 3249 NNAPPVP 3269 NAP VP Sbjct: 954 RNAPLVP 960