BLASTX nr result
ID: Akebia25_contig00017228
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00017228 (4267 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform... 1243 0.0 ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu... 1233 0.0 ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma... 1227 0.0 ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr... 1226 0.0 ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun... 1215 0.0 ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform... 1198 0.0 ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma... 1196 0.0 ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu... 1192 0.0 ref|XP_002519032.1| double-stranded RNA binding protein, putativ... 1184 0.0 ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma... 1183 0.0 emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera] 1180 0.0 ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma... 1173 0.0 ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas... 1164 0.0 ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma... 1154 0.0 ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma... 1154 0.0 ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma... 1125 0.0 ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma... 1122 0.0 ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal doma... 1108 0.0 ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma... 1104 0.0 ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma... 1104 0.0 >ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] gi|508781046|gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao] Length = 978 Score = 1243 bits (3217), Expect = 0.0 Identities = 665/986 (67%), Positives = 750/986 (76%), Gaps = 34/986 (3%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQ--------------KICMID---KEIRISHFSQPSER 3726 M+KSVVY+G +LGEVEIYPQ Q KI +++ KEIRI + +Q SER Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63 Query: 3725 CPPLAVLHTIAPSGVCFKMESKLQ-----SKDSP-LFSLHSSCLREDKTAVMPLGEEELH 3564 CPPLAVLHTI SG+CFKMES S+DSP L LHS C+R++KTAVMP+G+ ELH Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123 Query: 3563 LVAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDR 3384 LVAM SR + CFWGF VS GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDR Sbjct: 124 LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181 Query: 3383 IDALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAF 3204 I+ALQRK+ TE+DPQRV+GM+AE+KRYQDDK+ILKQY ENDQVVENGKVIK+QSEVVPA Sbjct: 182 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241 Query: 3203 SDNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYV 3024 SDN+QPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYV Sbjct: 242 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301 Query: 3023 CTMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVI 2844 CTMAERDYALEMWRLLDPESNLINSKELL R+VCVK+GSRKSL NVF DGICHPKMALVI Sbjct: 302 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361 Query: 2843 DDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGL 2664 DDRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFDEGL Sbjct: 362 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421 Query: 2663 LQLIREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVI 2493 LQ I E+ YED + + PS PDV N+L+SEDD SA NKDPL F+G+ D E ERRLK+ I Sbjct: 422 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481 Query: 2492 QA----SSMANSLDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQF---TQVVKPLV 2334 A SS A +LDPR PSLQ + SS +I Q I+S N QF VVKP+ Sbjct: 482 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541 Query: 2333 HVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXX 2154 V EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT EP F Sbjct: 542 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601 Query: 2153 XXXXXXPMQSRGSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHG 1974 QSRGSWF EE+ PR+ NRA PKE PL+SE MH +K RHP FF Sbjct: 602 VSVPRG--QSRGSWFAAEEEMSPRQLNRA----APKEFPLDSERMHIEKH--RHPPFFPK 653 Query: 1973 AESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFE 1794 ESS PSDR+L ENQR KE DDRL +NH+ S+Y SF GEE PL QS SS RD FE Sbjct: 654 VESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFE 713 Query: 1793 SGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTR 1614 SGR + ET AGVLQ+I +KCG KVEFR AL+AS +LQFSIE WF GEK+G G G+TR Sbjct: 714 SGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTR 772 Query: 1613 KEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKE 1437 +EAQRQAA+ S++NLAN YLS + D S GDL++ ++N+ G + NSFG Q L KE Sbjct: 773 REAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKE 832 Query: 1436 DXXXXXXXSEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVE 1257 + SE +R DPRLE SKKSMGSV+ALKELCMMEGL +VF QPP S+N + K E Sbjct: 833 ESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDE 892 Query: 1256 VYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKR 1077 VYAQVEI GQ+LGKG G TW EAKMQAAE+AL +L+SMLG Q R GSPR L +KR Sbjct: 893 VYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKR 952 Query: 1076 LKPEFPRMLQRINSSTRYSNNAPPVP 999 LKPEFPR+LQR+ SS RY NAPPVP Sbjct: 953 LKPEFPRVLQRMPSSGRYPKNAPPVP 978 >ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] gi|550340277|gb|EEE85528.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa] Length = 996 Score = 1233 bits (3189), Expect = 0.0 Identities = 664/1008 (65%), Positives = 757/1008 (75%), Gaps = 56/1008 (5%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQK-------------ICMIDKEIRISHFSQPSERCPPL 3714 M+KSVVY+G+ LLGEVEIY Q Q+ I I KEIRISHFSQ SERCPPL Sbjct: 1 MYKSVVYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60 Query: 3713 AVLHTIAPSGVCFKME-------SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVA 3555 AVLHTI GVCFKME +K+ ++SPL LHSSC++E+KTAVM LG EELHLVA Sbjct: 61 AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120 Query: 3554 MSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 3375 M SR N Q+ CFWGF V+PGLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDA Sbjct: 121 MPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180 Query: 3374 LQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDN 3195 LQRKI+TE+DPQR+ GML+E+KRY DDK+ILKQYVENDQVVENGKVIK QSEVVPA SDN Sbjct: 181 LQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240 Query: 3194 YQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTM 3015 +QP+VRPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTM Sbjct: 241 HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300 Query: 3014 AERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDR 2835 AERDYALEMWRLLDPESNLINSKELL R+VCVK+G RKSL NVF DGICHPKMALVIDDR Sbjct: 301 AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360 Query: 2834 LKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQL 2655 LKVWDE+DQSRVHVVPAFAPYYAPQAE NN VPVLCVARNVACNVRGGFFKEFDEGLLQ Sbjct: 361 LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420 Query: 2654 IREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQAS 2484 I EV YED N PS PDVSN+L+SEDD SA N+D L F+G+ D E ER+LK+ + AS Sbjct: 421 IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSAS 480 Query: 2483 --------SMANSLDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQ-------------- 2370 S +SLDPR SLQ +ASS + S PT Q +++SQ Sbjct: 481 SAILSTIPSTVSSLDPRLLQSLQYTIASS--SSSMPTSQPSMLASQQPMPALQPPKPPSQ 538 Query: 2369 -------NNQFTQV---VKPLVHVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQH 2220 N QF QV VK L V P EPSLQ SPAREEGEVPESELDPDTRRRLLILQH Sbjct: 539 LSMTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRRLLILQH 598 Query: 2219 GQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLEEDRIPRKPNRAVPKSVPKES 2040 G D+RD+ SE PF +QS GSW P+EE+ PR+ NR P+E Sbjct: 599 GHDSRDNAPSESPF---PARPSTQVSAPRVQSVGSWVPVEEEMSPRQLNR-----TPREF 650 Query: 2039 PLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYR 1860 PL+S+ M+ +K R HPSFFH ES+ PSDRM++ENQR PKE DDR+++NHSTSNY Sbjct: 651 PLDSDPMNIEKHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYP 710 Query: 1859 SFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAE 1680 SF GEE+PL +S SS+RD ES R TETP VLQEI +KCGTKVEFR ALIA+++ Sbjct: 711 SFQGEESPLSRS-SSNRDLDLESERAFSS-TETPVEVLQEIAMKCGTKVEFRPALIATSD 768 Query: 1679 LQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPS 1500 LQFSIE WF GEK+G GTGKTR+EAQRQAA+ S++ LA Y+S V D + GD ++ Sbjct: 769 LQFSIETWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYP 828 Query: 1499 HVNEMG-LGDSNSFGYQTLPKEDXXXXXXXSEPTRFLDPRLEESKKSMGSVSALKELCMM 1323 N+ G LGD NSFG Q L K++ SEP+R LD RLE SKKSMGSV+ALKE CM Sbjct: 829 SANDNGFLGDMNSFGNQPLLKDENITYSATSEPSRLLDQRLEGSKKSMGSVTALKEFCMT 888 Query: 1322 EGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSM 1143 EGL + FLAQ PLSTN + EV+AQVEI GQ+LGKG+G TW+EAKMQAAE+AL +L++M Sbjct: 889 EGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRTM 948 Query: 1142 LGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 999 G R GSPRL+ +KRLK EFPR+LQR+ SS RY NA PVP Sbjct: 949 FGQYTPKRQGSPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNASPVP 996 >ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Citrus sinensis] Length = 957 Score = 1227 bits (3175), Expect = 0.0 Identities = 655/972 (67%), Positives = 747/972 (76%), Gaps = 20/972 (2%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQ--------KICMIDKEIRISHFSQPSERCPPLAVLHT 3699 M+K+V Y G +LGEVEIYPQ Q K + EIRIS+FS+ SERCPPLAVLHT Sbjct: 1 MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60 Query: 3698 IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLG-EEELHLVAMSSRKNLEQYS 3522 I SG+CFKMESK S + L LHSSC+RE+KTAVMPLG EELHLVAM SR N +QY Sbjct: 61 ITASGICFKMESK-SSDNIQLHLLHSSCIRENKTAVMPLGLTEELHLVAMYSRNNEKQYP 119 Query: 3521 CFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDP 3342 CFW F V GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+DP Sbjct: 120 CFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDP 179 Query: 3341 QRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRL 3162 QR++GM AE+KRYQDDK+ILKQY ENDQV ENGKVIKVQSEVVPA SD++Q +VRPLIRL Sbjct: 180 QRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRL 239 Query: 3161 QEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWR 2982 QEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWR Sbjct: 240 QEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 299 Query: 2981 LLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSR 2802 LLDPESNLIN+KELL R+VCVK+GSRKSL NVF DG CHPKMALVIDDRLKVWD+KDQ R Sbjct: 300 LLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDDKDQPR 359 Query: 2801 VHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLA 2622 VHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFDEGLLQ I E+ YED + Sbjct: 360 VHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVK 419 Query: 2621 NFPSAPDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMANS----LD 2463 + PS PDVSN+L+SEDD + +N KDPL F+G+ D E ERRLK+ I AS+ +S LD Sbjct: 420 DIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANLD 479 Query: 2462 PRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQF---TQVVKPLVHVGPSEPSLQGSPA 2292 PR AP Q + SS T + PT Q +M N QF T +VKPL HVGP E SLQ SPA Sbjct: 480 PRLAP-FQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQSLQSSPA 538 Query: 2291 REEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSW 2112 REEGEVPESELDPDTRRRLLILQHG DTR++ SE PF + SRGSW Sbjct: 539 REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPF---PARTQMQVSVPRVPSRGSW 595 Query: 2111 FPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNEN 1932 FP+EE+ PR+ NRA VPKE PL SE M +K R HPSFF E+ SDR +EN Sbjct: 596 FPVEEEMSPRQLNRA----VPKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDRP-HEN 650 Query: 1931 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 1752 QR PKE + DDRLR+NH+ S+Y+SF GEE PL +S SS RD FESGR TETP+G Sbjct: 651 QRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGR-DVSSTETPSG 709 Query: 1751 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 1572 VLQ+I +KCGTKVEFR AL+AS ELQFSIE WF GEKIG G G+TR+EAQRQAA+ S+++ Sbjct: 710 VLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKH 769 Query: 1571 LANKYLSNVMHDPSSVRGDLNKPSHVNE-MGLGDSNSFGYQTLPKEDXXXXXXXSEPTRF 1395 LAN Y+ V D S GD ++ S+ NE +G+ NSFG Q L K++ SEP++ Sbjct: 770 LANVYMLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKL 825 Query: 1394 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGK 1215 +DPRLE SKK MGSVSALKELCM EGL +VF QPP S N + K EVYAQVEI GQ+LGK Sbjct: 826 VDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGK 885 Query: 1214 GVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINS 1035 G+G+TW+EAKMQAAE+AL +L+SM G Q GSPR L +KRLKPEFPR+LQR+ Sbjct: 886 GIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPP 945 Query: 1034 STRYSNNAPPVP 999 S RY NAPPVP Sbjct: 946 SGRYPKNAPPVP 957 >ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina] gi|557551913|gb|ESR62542.1| hypothetical protein CICLE_v10014168mg [Citrus clementina] Length = 957 Score = 1226 bits (3173), Expect = 0.0 Identities = 656/972 (67%), Positives = 746/972 (76%), Gaps = 20/972 (2%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQ--------KICMIDKEIRISHFSQPSERCPPLAVLHT 3699 M+K+V Y G +LGEVEIYPQ Q K + EIRIS+FS+ SERCPPLAVLHT Sbjct: 1 MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFDEIRISYFSEASERCPPLAVLHT 60 Query: 3698 IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLG-EEELHLVAMSSRKNLEQYS 3522 I SG+CFKMESK S + L LHSSC+RE+KTAVM LG EELHLVAM SR N +QY Sbjct: 61 ITASGICFKMESK-SSDNVQLHLLHSSCIRENKTAVMLLGLTEELHLVAMYSRNNEKQYP 119 Query: 3521 CFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDP 3342 CFW F V GLYNSCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+AL RKI+TE+DP Sbjct: 120 CFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEVDP 179 Query: 3341 QRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRL 3162 QR++GM AE+KRYQDDK+ILKQY ENDQV ENGKVIKVQSEVVPA SD++Q +VRPLIRL Sbjct: 180 QRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLIRL 239 Query: 3161 QEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWR 2982 QEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWR Sbjct: 240 QEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 299 Query: 2981 LLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSR 2802 LLDPESNLIN+KELL R+VCVK+GSRKSL NVF DG CHPKMALVIDDRLKVWDEKDQSR Sbjct: 300 LLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDEKDQSR 359 Query: 2801 VHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLA 2622 VHVVPAFAPYYAPQAEANN +PVLCVARN+ACNVRGGFFKEFDEGLLQ I E+ YED + Sbjct: 360 VHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDDVK 419 Query: 2621 NFPSAPDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASSMANS----LD 2463 PS PDVSN+L+SEDD + +N KDPL F+G+ D E ERRLK+ I AS+ +S LD Sbjct: 420 EIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVANLD 479 Query: 2462 PRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQF---TQVVKPLVHVGPSEPSLQGSPA 2292 PR AP Q + SS T + PT Q +M N QF T +VKPL HVGP E LQ SPA Sbjct: 480 PRLAP-FQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQCLQSSPA 538 Query: 2291 REEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSW 2112 REEGEVPESELDPDTRRRLLILQHG DTR++ SE PF + SRGSW Sbjct: 539 REEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPF---PARTQMQVSVPRVPSRGSW 595 Query: 2111 FPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNEN 1932 FP+EE+ PR+ NRA VPKE PL SE M +K R HPSFF E+S SDR +EN Sbjct: 596 FPVEEEMSPRQLNRA----VPKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDRP-HEN 650 Query: 1931 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 1752 QR PKE + DDRLR+NH+ S+Y+SF GEE PL +S SS RD FESGR TETP+G Sbjct: 651 QRMPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGR-DVSSTETPSG 709 Query: 1751 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 1572 VLQ+I +KCGTKVEFR AL+AS ELQFSIE WF GEKIG G G+TR+EAQRQAA+ S+++ Sbjct: 710 VLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKH 769 Query: 1571 LANKYLSNVMHDPSSVRGDLNKPSHVNE-MGLGDSNSFGYQTLPKEDXXXXXXXSEPTRF 1395 LAN Y+ V D S GD ++ S+ NE +G+ NSFG Q L K++ SEP++ Sbjct: 770 LANVYVLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDE----SLSSEPSKL 825 Query: 1394 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGK 1215 +DPRLE SKK MGSVSALKELCM EGL +VF QPP S N + K EVYAQVEI GQ+LGK Sbjct: 826 VDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGK 885 Query: 1214 GVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINS 1035 G+G+TW+EAKMQAAE+AL +L+SM G Q GSPR L +KRLKPEFPR+LQR+ Sbjct: 886 GIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPP 945 Query: 1034 STRYSNNAPPVP 999 S RY NAPPVP Sbjct: 946 SGRYPKNAPPVP 957 >ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica] gi|462410413|gb|EMJ15747.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica] Length = 940 Score = 1215 bits (3143), Expect = 0.0 Identities = 652/971 (67%), Positives = 743/971 (76%), Gaps = 19/971 (1%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQ-----NQKICMID--KEIRISHFSQPSERCPPLAVLHTI 3696 M+KSVVY+G LLGEVEIYP+ N+ ++D KEIRIS+FSQ SERCPP+AVLHTI Sbjct: 1 MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60 Query: 3695 APSGVCFKMESKL-QSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSC 3519 + GVCFKMESK QS+D+PLF LHSSC+ E+KTAVMPLG EELHLVAM SR ++Y C Sbjct: 61 SSHGVCFKMESKTSQSQDTPLFLLHSSCVMENKTAVMPLGGEELHLVAMRSRNGDKRYPC 120 Query: 3518 FWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQ 3339 FWGF V+PGLYNSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKI++E+DPQ Sbjct: 121 FWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEVDPQ 180 Query: 3338 RVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQ 3159 R+SGMLAEIKRYQDDK ILKQY ENDQVVENG+VIK QSE VPA SDN+QPI+RPLIRL Sbjct: 181 RISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRPLIRLH 240 Query: 3158 EKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRL 2979 EKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRL Sbjct: 241 EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300 Query: 2978 LDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRV 2799 LDP+SNLINS +LL R+VCVK+GSRKSL NVF + +CHPKMALVIDDRLKVWD++DQ RV Sbjct: 301 LDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRV 360 Query: 2798 HVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLAN 2619 HVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFF+EFD+ LLQ I EVFYED + + Sbjct: 361 HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYEDDIKD 420 Query: 2618 FPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQASSMAN----SLDP 2460 PS PDVSN+L+SEDD SA N+DPL F+G+TDVE ERR+K+ A+SM + S+DP Sbjct: 421 VPS-PDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATPAASMVSSVFTSIDP 479 Query: 2459 RPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAR 2289 R AP LQ V S T+S PT Q +MS + QF Q +VKPL HVG +EPSLQ SPAR Sbjct: 480 RLAP-LQYTVPPS-STLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSAEPSLQSSPAR 537 Query: 2288 EEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWF 2109 EEGEVPESELDPDTRRRLLILQHGQDTRD SEPPF QSR WF Sbjct: 538 EEGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPF---PVRPPMQASVPRAQSRPGWF 594 Query: 2108 PLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQ 1929 P+EE+ PR+ +R VPK + PL+ ET+ +K R H SFF E+S PSDR+L ENQ Sbjct: 595 PVEEEMSPRQLSRMVPKDL----PLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQENQ 650 Query: 1928 RFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGV 1749 R PKE DDRLR NH+ S Y S GEE PL +S SS+RD FESGR + ETPAGV Sbjct: 651 RLPKEAFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGR-AISNAETPAGV 709 Query: 1748 LQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNL 1569 LQEI +KCG K WF GEKIG G+GKTR+EA QAA+ SL+NL Sbjct: 710 LQEIAMKCGAK------------------AWFAGEKIGEGSGKTRREAHYQAAEGSLKNL 751 Query: 1568 ANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKEDXXXXXXXSEPTRFL 1392 AN YLS V D SV GD+NK +VN G G+ NSFG Q PKE+ SEP+R L Sbjct: 752 ANIYLSRVKPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPL 811 Query: 1391 DPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKG 1212 DPRLE SKKSM SVS LKELCMMEGL +VF +PP STN + K EV+ QVEI G++LGKG Sbjct: 812 DPRLEGSKKSMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKG 871 Query: 1211 VGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSS 1032 +G TW+EAKMQAAE+AL +L S L Q R GSPR L SSKR+K EFP++LQR+ SS Sbjct: 872 IGLTWDEAKMQAAEKALGSLTSTL--YAQKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSS 929 Query: 1031 TRYSNNAPPVP 999 RY NAPPVP Sbjct: 930 ARYPKNAPPVP 940 >ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] gi|508781047|gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao] Length = 984 Score = 1198 bits (3099), Expect = 0.0 Identities = 642/951 (67%), Positives = 724/951 (76%), Gaps = 34/951 (3%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQ--------------KICMID---KEIRISHFSQPSER 3726 M+KSVVY+G +LGEVEIYPQ Q KI +++ KEIRI + +Q SER Sbjct: 4 MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63 Query: 3725 CPPLAVLHTIAPSGVCFKMESKLQ-----SKDSP-LFSLHSSCLREDKTAVMPLGEEELH 3564 CPPLAVLHTI SG+CFKMES S+DSP L LHS C+R++KTAVMP+G+ ELH Sbjct: 64 CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123 Query: 3563 LVAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDR 3384 LVAM SR + CFWGF VS GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDR Sbjct: 124 LVAMYSRNS--DRPCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181 Query: 3383 IDALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAF 3204 I+ALQRK+ TE+DPQRV+GM+AE+KRYQDDK+ILKQY ENDQVVENGKVIK+QSEVVPA Sbjct: 182 IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241 Query: 3203 SDNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYV 3024 SDN+QPI+RPLIRLQEKNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYV Sbjct: 242 SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301 Query: 3023 CTMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVI 2844 CTMAERDYALEMWRLLDPESNLINSKELL R+VCVK+GSRKSL NVF DGICHPKMALVI Sbjct: 302 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361 Query: 2843 DDRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGL 2664 DDRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFDEGL Sbjct: 362 DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421 Query: 2663 LQLIREVFYEDGLANFPSAPDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVI 2493 LQ I E+ YED + + PS PDV N+L+SEDD SA NKDPL F+G+ D E ERRLK+ I Sbjct: 422 LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481 Query: 2492 QA----SSMANSLDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQF---TQVVKPLV 2334 A SS A +LDPR PSLQ + SS +I Q I+S N QF VVKP+ Sbjct: 482 SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541 Query: 2333 HVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXX 2154 V EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT EP F Sbjct: 542 PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601 Query: 2153 XXXXXXPMQSRGSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHG 1974 QSRGSWF EE+ PR+ NRA PKE PL+SE MH +K RHP FF Sbjct: 602 VSVPRG--QSRGSWFAAEEEMSPRQLNRA----APKEFPLDSERMHIEKH--RHPPFFPK 653 Query: 1973 AESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFE 1794 ESS PSDR+L ENQR KE DDRL +NH+ S+Y SF GEE PL QS SS RD FE Sbjct: 654 VESSIPSDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFE 713 Query: 1793 SGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTR 1614 SGR + ET AGVLQ+I +KCG KVEFR AL+AS +LQFSIE WF GEK+G G G+TR Sbjct: 714 SGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTR 772 Query: 1613 KEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMGL-GDSNSFGYQTLPKE 1437 +EAQRQAA+ S++NLAN YLS + D S GDL++ ++N+ G + NSFG Q L KE Sbjct: 773 REAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKE 832 Query: 1436 DXXXXXXXSEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVE 1257 + SE +R DPRLE SKKSMGSV+ALKELCMMEGL +VF QPP S+N + K E Sbjct: 833 ESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDE 892 Query: 1256 VYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPR 1104 VYAQVEI GQ+LGKG G TW EAKMQAAE+AL +L+SMLG Q R GSPR Sbjct: 893 VYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPR 943 >ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Fragaria vesca subsp. vesca] Length = 955 Score = 1196 bits (3093), Expect = 0.0 Identities = 643/966 (66%), Positives = 739/966 (76%), Gaps = 18/966 (1%) Frame = -3 Query: 3842 VVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCFK 3672 +VY+G LLGEVE+YP+ N+KI KEIRISHFSQ SERCPP+AVLHTI+ +GVCFK Sbjct: 4 LVYKGEELLGEVEVYPEELNNKKIWDELKEIRISHFSQSSERCPPVAVLHTISSNGVCFK 63 Query: 3671 MESKLQSKDSP----LFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 3504 MESK S S LF LHSSC+ E+KTAVM LG EELHLVAM SR N +Q+ CFWGF Sbjct: 64 MESKSSSSSSQDTSRLFLLHSSCIMENKTAVMNLGVEELHLVAMYSRNNQKQHPCFWGFS 123 Query: 3503 VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 3324 VS GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ LQRKI E+D QR+SGM Sbjct: 124 VSSGLYSSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIEGLQRKIQCEVDAQRISGM 183 Query: 3323 LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 3144 AEIKRYQDDK ILKQY ENDQVVENG+VIK QSEVVPA SD++QPI+RPLIRLQEKNII Sbjct: 184 QAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEVVPALSDSHQPIIRPLIRLQEKNII 243 Query: 3143 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 2964 LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDPES Sbjct: 244 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 303 Query: 2963 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 2784 NLIN+ +LL R+VCVK+G +KSL NVF + +CHPKMALVIDDRLKVWD++DQ RVHVVPA Sbjct: 304 NLINANKLLDRIVCVKSGLKKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVPA 363 Query: 2783 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 2604 FAPYYAPQAEANN VPVLCVARNVAC+VRGGFF+EFD+ LLQ I E+FYED + +F S+P Sbjct: 364 FAPYYAPQAEANNAVPVLCVARNVACSVRGGFFREFDDSLLQKIPEIFYEDNIKDF-SSP 422 Query: 2603 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQA----SSMANSLDPRPAPS 2445 DVSNFL+SEDD SAS N+D L F+G+ D E ERRLK+ A SS ++ DPR A S Sbjct: 423 DVSNFLVSEDDASASNGNRDQLPFDGMADAEVERRLKEATSAAPTVSSAVSNNDPRLA-S 481 Query: 2444 LQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEV 2274 LQ V S T+S PT Q +M N QF Q +VKPL HVGP++ L SPAREEGEV Sbjct: 482 LQYTVPLS-STVSLPTNQPSMMPFHNVQFPQSASLVKPLGHVGPADLGLHSSPAREEGEV 540 Query: 2273 PESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLEED 2094 PESELDPDTRRRLLILQHGQDTR+ SEP F +QSRG WFP+EE+ Sbjct: 541 PESELDPDTRRRLLILQHGQDTRESVPSEPSF---PVRPQVQVSVPRVQSRGGWFPVEEE 597 Query: 2093 RIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKE 1914 PRK +R VPKE PL SE M +K R+ H +FF E+S PSDR+L ENQR PKE Sbjct: 598 MSPRKLSR----MVPKEPPLNSEPMQIEKHRSHHSAFFPKVENSMPSDRILQENQRLPKE 653 Query: 1913 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 1734 D+RLR N + S Y SF GEE PL +S SS+RDF +ESGR + ETPAGVLQEI Sbjct: 654 AFHRDNRLRFNQAMSGYHSFSGEEPPLNRSSSSNRDFDYESGR-AISNAETPAGVLQEIA 712 Query: 1733 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 1554 +KCGTKVEFR AL+ S ELQF +E WF GEKIG GTG+TR+EA QAA+ SL+NLAN Y+ Sbjct: 713 MKCGTKVEFRPALVPSTELQFYVEAWFAGEKIGEGTGRTRREAHFQAAEGSLKNLANIYI 772 Query: 1553 SNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXSEPTRFLDPRLE 1377 S D + GD +K S+V G +G+ NSFG Q LPKED SEP+R LDPRL+ Sbjct: 773 SRGKPDALPIHGDASKFSNVTNNGFMGNMNSFGTQPLPKEDSLSSSTSSEPSRPLDPRLD 832 Query: 1376 ESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATW 1197 S+KS+ SVSALKELC MEGLS+++ +PP N K EV+ Q EI G++LGKG+G TW Sbjct: 833 NSRKSVSSVSALKELCTMEGLSVLYQPRPP-PPNSTEKDEVHVQAEIDGEVLGKGIGLTW 891 Query: 1196 NEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSN 1017 +EAKMQAAE+AL NL+S L GQ R GSPR L SKRLK EFP++LQR+ SSTRYS Sbjct: 892 DEAKMQAAEKALGNLRSTL--YGQKRQGSPRPLQGMPSKRLKQEFPQVLQRMPSSTRYSK 949 Query: 1016 NAPPVP 999 NAPPVP Sbjct: 950 NAPPVP 955 >ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa] gi|550327613|gb|ERP55122.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa] Length = 990 Score = 1192 bits (3084), Expect = 0.0 Identities = 646/1000 (64%), Positives = 742/1000 (74%), Gaps = 48/1000 (4%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQK-------------ICMIDKEIRISHFSQPSERCPPL 3714 M+KSVVY+G LLGEVEIY Q Q+ I I K IRISHFSQ SERCPPL Sbjct: 1 MYKSVVYKGEELLGEVEIYAQEQQQEEEENKNKRKRVIDEIVKGIRISHFSQASERCPPL 60 Query: 3713 AVLHTIAPSGVCFKMESKLQS--------KDSPLFSLHSSCLREDKTAVMPLGEEELHLV 3558 AVLHTI GVCFKME S ++SPL LHSSC++E+KTAVM LG EELHLV Sbjct: 61 AVLHTITSIGVCFKMEESTASSSTKISSQQESPLRLLHSSCIQENKTAVMLLGGEELHLV 120 Query: 3557 AMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRID 3378 AM SR N ++ CFWGF V+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFED+I+ Sbjct: 121 AMPSRSNERKHPCFWGFNVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDKIE 180 Query: 3377 ALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSD 3198 ALQ+KI+TE+D QR+ +++EIKRYQDDK ILKQYVENDQV+ENGKVIK Q EVVPA SD Sbjct: 181 ALQKKISTEVDQQRILAIISEIKRYQDDKIILKQYVENDQVIENGKVIKTQFEVVPAASD 240 Query: 3197 NYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCT 3018 N+QP+VRPLIRL EKNII TRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCT Sbjct: 241 NHQPLVRPLIRLPEKNIIFTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 300 Query: 3017 MAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDD 2838 MAERDYALEMWRLLDPESNLINS ELL R+VCV +GSRKSL NVF DGICHPKMALVIDD Sbjct: 301 MAERDYALEMWRLLDPESNLINSNELLDRIVCVSSGSRKSLFNVFQDGICHPKMALVIDD 360 Query: 2837 RLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQ 2658 R+ VWDEKDQSRVHVVPAFAPYYAPQAEANN VP+LCVARNVACNVRGGFFKEFDEGLLQ Sbjct: 361 RMNVWDEKDQSRVHVVPAFAPYYAPQAEANNAVPILCVARNVACNVRGGFFKEFDEGLLQ 420 Query: 2657 LIREVFYEDGLANFPSAPDVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQA 2487 I EV YED +N PS PDVSN+L+SEDD SA+ N+DP F+ D E ERRLK+ + A Sbjct: 421 KIPEVAYEDDTSNIPSPPDVSNYLVSEDDASAANGNRDPPSFDSTADAEVERRLKEAVSA 480 Query: 2486 S--------SMANSLDPRPAPSLQNAVA--SSLGTISQPT---PQGPIMSSQ-------N 2367 S S +SLDPR SLQ AVA SSL SQP+ Q P+ +SQ N Sbjct: 481 SSTIPSTIPSTVSSLDPRLLQSLQYAVASSSSLMPASQPSMLASQQPVPASQTSMMPFPN 540 Query: 2366 NQFTQV---VKPLVHVGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHT 2196 QF QV VK L V EPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQD+RD+ Sbjct: 541 TQFPQVAPLVKQLGQVVHPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDSRDNA 600 Query: 2195 SSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMH 2016 SE PF +QSRGSW P+EE+ PR+ NR P+E PL+S+ M+ Sbjct: 601 PSESPF---PARPSAPVSAAHVQSRGSWVPVEEEMTPRQLNR-----TPREFPLDSDPMN 652 Query: 2015 FDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETP 1836 +K + HPSFF ES+ PSDRM++ENQR PKE +DR+R+NHST NY SF EETP Sbjct: 653 IEKHQTHHPSFFPKVESNIPSDRMIHENQRLPKEAPYRNDRMRLNHSTPNYHSFQVEETP 712 Query: 1835 LGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVW 1656 L +S SS+RD ES R +ETP VLQEI +KC TKVEFR AL+AS +LQFSIE W Sbjct: 713 LSRS-SSNRDLDLESERAF-TISETPVEVLQEIAMKCETKVEFRPALVASIDLQFSIEAW 770 Query: 1655 FTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-L 1479 F GEK+G GTGKTR+EAQRQAA+ S++ LA Y+ D + GD ++ N+ G L Sbjct: 771 FAGEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMLRAKPDSGPMHGDSSRYPSANDNGFL 830 Query: 1478 GDSNSFGYQTLPKEDXXXXXXXSEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFL 1299 G+ N FG Q LPK++ SEP+R LDPRLE SKKS GSV+ALKE C MEGL + FL Sbjct: 831 GNMNLFGNQPLPKDELVAYSAASEPSRLLDPRLEGSKKSSGSVTALKEFCTMEGLVVNFL 890 Query: 1298 AQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNR 1119 AQ PLS N + EV+AQVEI GQ+LGKG+G+TW+EAKMQAAE+AL +L++M G Q R Sbjct: 891 AQTPLSANSIPGEEVHAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRTMFGQYTQKR 950 Query: 1118 VGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 999 GSPR + +KRLK EFPR+LQR+ S RY NAPPVP Sbjct: 951 QGSPRPMQGMPNKRLKQEFPRVLQRMPPSARYHKNAPPVP 990 >ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis] gi|223541695|gb|EEF43243.1| double-stranded RNA binding protein, putative [Ricinus communis] Length = 978 Score = 1184 bits (3062), Expect = 0.0 Identities = 633/985 (64%), Positives = 739/985 (75%), Gaps = 33/985 (3%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQK--------------------ICMIDKEIRISHFSQP 3735 M+KSVVY+G+ LLGEVEIY Q ++ I I K IRISHFSQ Sbjct: 1 MYKSVVYKGDELLGEVEIYAQQEQKLQQQEELQEQEQELKKKRVIDEILKGIRISHFSQA 60 Query: 3734 SERCPPLAVLHTIAPSGVCFKMESKLQ-SKDSPLFSLHSSCLREDKTAVMPL-GEEELHL 3561 SERCPPLAVLHTI +G+CFKMESK S D+PL LHSSC++E KTAV+ L G EELHL Sbjct: 61 SERCPPLAVLHTITTNGICFKMESKNSVSLDTPLHLLHSSCIQESKTAVVLLQGGEELHL 120 Query: 3560 VAMSSRKNLEQYSCFWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRI 3381 VAM SR + QY CFW F +S GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRI Sbjct: 121 VAMFSRNDERQYPCFWAFNISSGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI 180 Query: 3380 DALQRKINTEMDPQRVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFS 3201 +ALQRKI+TE+DPQR+SGML+E+KRYQDDK+ILKQYV+NDQVVENG+VIK Q EVVPA S Sbjct: 181 EALQRKISTELDPQRISGMLSEVKRYQDDKTILKQYVDNDQVVENGRVIKTQFEVVPALS 240 Query: 3200 DNYQPIVRPLIRLQEKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVC 3021 DN+Q IVRPLIRLQE+NIILTRINP IRDTSVLVRLRPAWE+LRSYLTA+GRKRFEVYVC Sbjct: 241 DNHQTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVC 300 Query: 3020 TMAERDYALEMWRLLDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVID 2841 TMAERDYALEMWRLLDPESNLINSKELL R+VCVK+G RKSL NVF DGICHPKMALVID Sbjct: 301 TMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVID 360 Query: 2840 DRLKVWDEKDQSRVHVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLL 2661 DRLKVWDEKDQ RVHVVPAFAPYYAPQAEANN VPVLCVARNVACNVRGGFFKEFDEGLL Sbjct: 361 DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLL 420 Query: 2660 QLIREVFYEDGLANFPSAPDVSNFLISEDD--VSASNKDPLGFEGVTDVE-ERRLKDVIQ 2490 Q I E+ +ED + + PS PDVSN+L+ EDD S N+DPL F+G+ D E E+RLK+ I Sbjct: 421 QRIPEISFEDDMNDIPSPPDVSNYLVPEDDAFTSNGNRDPLSFDGMADAEVEKRLKEAIS 480 Query: 2489 ASSMANS----LDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVH 2331 SS S LD R P LQ +ASS +I PT Q +++ + Q Q +VKPL Sbjct: 481 ISSAFPSTVANLDARLVPPLQYTMASS-SSIPVPTSQPAVVTFPSMQLPQAAPLVKPLGQ 539 Query: 2330 VGPSEPSLQGSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXX 2151 V PSEPSLQ SPAREEGEVPESELDPDTRRRLLILQHGQD RD SE PF Sbjct: 540 VVPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDLRDPAPSESPF-PVRPSNSM 598 Query: 2150 XXXXXPMQSRGSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGA 1971 +QSRG+W P+EE+ PR+ NRA V +E P+++E MH DK R HPSFF Sbjct: 599 QVSVPRVQSRGNWVPVEEEMSPRQLNRA----VTREFPMDTEPMHIDKHRPHHPSFFPKV 654 Query: 1970 ESSFPSDRMLNENQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFES 1791 ESS PS+RM +ENQR PK DDRLR+N + SNY+S GEE L +S SS+RD ES Sbjct: 655 ESSIPSERMPHENQRLPKVAPYKDDRLRLNQTMSNYQSLSGEENSLSRSSSSNRDLDVES 714 Query: 1790 GRGSPPYTETPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRK 1611 R + ETP VL EI +KCG KVEF+ +L+ S +LQFS+E WF GE++G G G+TR+ Sbjct: 715 DR-AVSSAETPVRVLHEISMKCGAKVEFKHSLVNSRDLQFSVEAWFAGERVGEGFGRTRR 773 Query: 1610 EAQRQAADSSLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKED 1434 EAQ AA++S++NLAN Y+S D ++ GD +K S N+ G LG NSFG Q LPK++ Sbjct: 774 EAQSVAAEASIKNLANIYISRAKPDNGALHGDASKYSSANDNGFLGHVNSFGSQPLPKDE 833 Query: 1433 XXXXXXXSEPTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEV 1254 SE + LDPRLE SKKSM SV+ALKE CMMEGL + FLAQ PLS+N + EV Sbjct: 834 ILSYSDSSEQSGLLDPRLESSKKSMSSVNALKEFCMMEGLGVNFLAQTPLSSNSVQNAEV 893 Query: 1253 YAQVEIGGQILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRL 1074 +AQVEI GQ++GKG+G+T++EAKMQAAE+AL +L++ G R GSPR + +K L Sbjct: 894 HAQVEIDGQVMGKGIGSTFDEAKMQAAEKALGSLRTTFGRFPPKRQGSPRPVPGMPNKHL 953 Query: 1073 KPEFPRMLQRINSSTRYSNNAPPVP 999 KPEFPR+LQR+ SS RY NAPPVP Sbjct: 954 KPEFPRVLQRMPSSARYPKNAPPVP 978 >ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] Length = 956 Score = 1183 bits (3060), Expect = 0.0 Identities = 627/969 (64%), Positives = 752/969 (77%), Gaps = 16/969 (1%) Frame = -3 Query: 3857 KMFKSVVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPS 3687 +M+KSVVYQG ++GEV++YP+ N K + KEIRISHFSQPSERCPPLAVLHT+ Sbjct: 2 RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHV-KEIRISHFSQPSERCPPLAVLHTVTSC 60 Query: 3686 GVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGF 3507 GVCFKMESK Q +D LF LHS C+RE+KTAVMPLG EE+HLVAM SR N+++ CFWGF Sbjct: 61 GVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSR-NVDR-PCFWGF 117 Query: 3506 IVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSG 3327 IV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SG Sbjct: 118 IVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISG 177 Query: 3326 MLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNI 3147 M AE+KRYQDDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ+KNI Sbjct: 178 MQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNI 237 Query: 3146 ILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPE 2967 ILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+ Sbjct: 238 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 297 Query: 2966 SNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVP 2787 SNLINSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVP Sbjct: 298 SNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 357 Query: 2786 AFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSA 2607 AFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + PS Sbjct: 358 AFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSP 417 Query: 2606 PDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPAP 2448 PDVSN+L+SEDD S SN +DP F+G+ D E ER+LKD + A+S +LDPR Sbjct: 418 PDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPR-LT 476 Query: 2447 SLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGE 2277 SLQ + S G++ PT Q +M + QF Q +VKP+ PSEPSL SPAREEGE Sbjct: 477 SLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535 Query: 2276 VPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLEE 2097 VPESELDPDTRRRLLILQHGQDTRDH S+EPPF SRG WFP EE Sbjct: 536 VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVP--SSRGVWFPAEE 593 Query: 2096 DRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFP 1920 + + NR VPKE P++S + K R HPSFF ESS SDR+L++ +QR P Sbjct: 594 EIGSQPLNRV----VPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLP 649 Query: 1919 KEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQE 1740 KE++ DDR R+NH S+YRSF G++ P +S SS RD ESG S + +TP VLQE Sbjct: 650 KEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSFSSHRDLDSESGH-SVLHADTPVAVLQE 708 Query: 1739 IGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANK 1560 I +KCGTKV+F S+L+AS ELQFS+E WF+G+KIG G+TRKEAQ +AA+ S+++LA+ Sbjct: 709 IALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADI 768 Query: 1559 YLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXSEPTRFLDPR 1383 YLS+ +P S GD++ +VN+ G +G ++S G Q L KED + P+R LDPR Sbjct: 769 YLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKED-SASFSTASPSRVLDPR 827 Query: 1382 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 1206 L+ SK+SMGS+S+LKELCMMEGL + FL+ P P+STN + K EV+AQVEI G++ GKG+G Sbjct: 828 LDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIG 887 Query: 1205 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 1026 TW+EAKMQAAE+AL +L+S LG S Q R SPR FS+KRLK E+PR +QR+ SS R Sbjct: 888 LTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSAR 947 Query: 1025 YSNNAPPVP 999 Y NAPP+P Sbjct: 948 YPRNAPPIP 956 >emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera] Length = 894 Score = 1180 bits (3053), Expect = 0.0 Identities = 638/960 (66%), Positives = 725/960 (75%), Gaps = 8/960 (0%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 3675 M+KS+VY+G+ ++GEVEIYPQNQ + ++ KEIRISH+SQPSERCPPLAVLHTI GVCF Sbjct: 1 MYKSIVYEGDDVVGEVEIYPQNQGLELM-KEIRISHYSQPSERCPPLAVLHTITSCGVCF 59 Query: 3674 KMES-KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVS 3498 KMES K QS+D+PL+ LHS+C+RE+KTAVM LGEEELHLVAM S+K QY CFWGF V+ Sbjct: 60 KMESSKAQSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFNVA 119 Query: 3497 PGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLA 3318 GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTE+DPQR+SGM+A Sbjct: 120 LGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGMVA 179 Query: 3317 EIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILT 3138 E VVENGK+ K Q E+VPA SDN+QPIVRPLIRLQEKNIILT Sbjct: 180 E-------------------VVENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNIILT 220 Query: 3137 RINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNL 2958 RINPLIRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDPESNL Sbjct: 221 RINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNL 280 Query: 2957 INSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFA 2778 INSKELL R+VCVK+GSRKSL NVF DGICHPKMALVIDDRLKVWDEKDQ RVHVVPAFA Sbjct: 281 INSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 340 Query: 2777 PYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDV 2598 PYYAPQAEANN + VLCVARNVACNVRGGFFKEFDEGLLQ I E+ YED + + SAPDV Sbjct: 341 PYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDBIKDIRSAPDV 400 Query: 2597 SNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASSMANSLDPRPAPSLQNAVA 2427 SN+L+SEDD S S N+D F+G+ DVE ER+LKD I A S SLDPR +P LQ AVA Sbjct: 401 SNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKDAISAPSTVTSLDPRLSPPLQFAVA 460 Query: 2426 SSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVPESELD 2256 +S G QP QG IM N QF Q ++KPL EP++Q SPAREEGEVPESELD Sbjct: 461 ASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPLA----PEPTMQSSPAREEGEVPESELD 516 Query: 2255 PDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLEEDRIPRKP 2076 PDTRRRLLILQHGQDTR+H SS+PPF +QSRGSWFP +E+ PR+ Sbjct: 517 PDTRRRLLILQHGQDTREHASSDPPF---PVRPPIQVSVPRVQSRGSWFPADEEMSPRQL 573 Query: 2075 NRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRFPKEVHQGDD 1896 NRA VPKE PL+S+TMH +K R HPSFFH ESS SDR+L+ENQR KEV DD Sbjct: 574 NRA----VPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDD 629 Query: 1895 RLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIKCGTK 1716 RLR+NHS Y SF GEE PLG+S SS+RD FESGRG+ PY ETPA +G+ Sbjct: 630 RLRLNHSLPGYHSFSGEEVPLGRS-SSNRDLDFESGRGA-PYAETPA-----VGL----- 677 Query: 1715 VEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSNVMHD 1536 L+ EVW GEKIG GTGKTR+EAQ QAA++SL L+ +YL Sbjct: 678 ------------LRNCNEVWNQGEKIGEGTGKTRREAQCQAAEASLMYLSYRYL------ 719 Query: 1535 PSSVRGDLNK-PSHVNEMGLGDSNSFGYQTLPKEDXXXXXXXSEPTRFLDPRLEESKKSM 1359 GD+N+ P+ + + D+NSFGYQ+ PKE SE +R LDPRLE SKKSM Sbjct: 720 ----HGDVNRFPNASDNNFMSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSM 775 Query: 1358 GSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVGATWNEAKMQ 1179 GS+SALKELCMMEGL + FL+QPPLS+N K E+ AQVEI GQ+LGKG G+TW++AKMQ Sbjct: 776 GSISALKELCMMEGLGVEFLSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQ 835 Query: 1178 AAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSNNAPPVP 999 AAE+AL +LKSMLG Q R GSPR L KRLK EF R LQR SS RYS N PVP Sbjct: 836 AAEKALGSLKSMLGQFSQKRQGSPRSLQGM-GKRLKSEFTRGLQRTPSSGRYSKNTSPVP 894 >ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Glycine max] Length = 960 Score = 1173 bits (3034), Expect = 0.0 Identities = 621/973 (63%), Positives = 743/973 (76%), Gaps = 21/973 (2%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQ--------NQKICMIDKEIRISHFSQPSERCPPLAVLHT 3699 M+KSVVYQG ++GEV++YP+ N KEIRISHFSQPSERCPPLAVLHT Sbjct: 1 MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60 Query: 3698 IAPSGVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSC 3519 + GVCFKMESK Q +D LF LHS C+RE+KTAVMPLG EE+HLVAM SR + C Sbjct: 61 VTSCGVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRN--DDRPC 117 Query: 3518 FWGFIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQ 3339 FWGFIV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQ Sbjct: 118 FWGFIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQ 177 Query: 3338 RVSGMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQ 3159 R+SGM AE+KRY DDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ Sbjct: 178 RISGMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQ 237 Query: 3158 EKNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRL 2979 +KNIILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRL Sbjct: 238 DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 297 Query: 2978 LDPESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRV 2799 LDP+SNLINSKELL R+VCVK+G +KSL NVF DG C PKMALVIDDRLKVWDE+DQ RV Sbjct: 298 LDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPRV 357 Query: 2798 HVVPAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLAN 2619 HVVPAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + Sbjct: 358 HVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKD 417 Query: 2618 FPSAPDVSNFLISEDD--VSASNKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDP 2460 PS PDVSN+L+SEDD +S N+DP F+G+ D E ER+LKD + A+S +LDP Sbjct: 418 VPSPPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDP 477 Query: 2459 RPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAR 2289 R SLQ + S G++ PT Q +M + QF Q +VKP+ PS+PSL SPAR Sbjct: 478 R-LTSLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAR 535 Query: 2288 EEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWF 2109 EEGEVPESELDPDTRRRLLILQHGQDTRDH S+EPPF SRG WF Sbjct: 536 EEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVP--SSRGVWF 593 Query: 2108 PLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-N 1932 P+EE+ + NR VPKE P++S + +K R HPSFF+ ESS SDR+L++ + Sbjct: 594 PVEEEIGSQPLNRV----VPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSH 649 Query: 1931 QRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAG 1752 QR PKE++ DDR R+NH S+YRSF G++ P +S SS RD ESG S + +TP Sbjct: 650 QRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGH-SVLHADTPVA 708 Query: 1751 VLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRN 1572 VL EI +KCGTKV+F S+L+AS EL+FS+E WF+G+KIG G G+TRKEAQ +AA S+ + Sbjct: 709 VLHEIALKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEH 768 Query: 1571 LANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXSEPTRF 1395 LA+ YLS+ +P S GD++ +VN+ G +G ++S G Q L KED + P+R Sbjct: 769 LADIYLSSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKED-SASFSSASPSRA 827 Query: 1394 LDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILG 1218 LDPRL+ SK+SMGS+SALKELCMMEGL + FL+ P P+STN + K EV+AQVEI G+I G Sbjct: 828 LDPRLDVSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFG 887 Query: 1217 KGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRIN 1038 KG+G TW+EAKMQAAE+AL NL+S LG S Q SPR FS+KRLK E+PR +QR+ Sbjct: 888 KGIGLTWDEAKMQAAEKALGNLRSKLGQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMP 947 Query: 1037 SSTRYSNNAPPVP 999 SS RY NAPP+P Sbjct: 948 SSARYPRNAPPIP 960 >ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris] gi|561032720|gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris] Length = 964 Score = 1164 bits (3012), Expect = 0.0 Identities = 624/977 (63%), Positives = 738/977 (75%), Gaps = 25/977 (2%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQKICMID-KEIRISHFSQPSERCPPLAVLHTIAPSGVC 3678 M+KSVVYQG +LGEVE+YP+ KEIRISHFSQPSERCPPLAVLHT+ GVC Sbjct: 1 MYKSVVYQGEVVLGEVEVYPEENNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60 Query: 3677 FKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVS 3498 FKMESK Q +D LF LHS C+RE+KTAV+PLG EE+HLVAM SR + FWGFIV+ Sbjct: 61 FKMESKTQQQDG-LFHLHSLCIRENKTAVIPLGGEEIHLVAMHSRN--DDRPRFWGFIVA 117 Query: 3497 PGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLA 3318 GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SGM A Sbjct: 118 LGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQA 177 Query: 3317 EIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILT 3138 E+KRYQ+DK+ILKQY ENDQVV+NG+V+KVQSE+VPA SDN+QPIVRPLIRLQ+KNIILT Sbjct: 178 EVKRYQEDKNILKQYAENDQVVDNGRVVKVQSEIVPALSDNHQPIVRPLIRLQDKNIILT 237 Query: 3137 RINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNL 2958 RINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+SNL Sbjct: 238 RINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNL 297 Query: 2957 INSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFA 2778 INSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVPAFA Sbjct: 298 INSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 357 Query: 2777 PYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDV 2598 PYYAPQAEA+N +PVLCVARNVACNVRGGFFKEFD+GLLQ I +V YED + + P PDV Sbjct: 358 PYYAPQAEASNSIPVLCVARNVACNVRGGFFKEFDDGLLQKIPQVAYEDDIKDIPIPPDV 417 Query: 2597 SNFLISEDD----VSASNKDPLGFEGVTDVE-ERRLK---------DVIQASS----MAN 2472 SN+L+SEDD +S N+DP F+ + D E ER+ K D + A+S Sbjct: 418 SNYLVSEDDGSSAISNGNRDPFLFDSMGDAEVERKSKVPTRAPNEHDALSAASTIPVTTA 477 Query: 2471 SLDPRPAPSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQG 2301 +LDPR SLQ A+ SS G+ PT Q +M + QF Q +VKP+ PSE SL Sbjct: 478 NLDPR-LTSLQYAMVSS-GSAPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHS 535 Query: 2300 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSR 2121 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTS+EP + + SR Sbjct: 536 SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSNEPTY---AIRHPVPVSAPRVSSR 592 Query: 2120 GSWFPLEEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRML 1941 G WFP EED + NR VPKE ++S ++ +K R HPSFF ESS SDR+L Sbjct: 593 GGWFPAEEDIGSQPLNRV----VPKEFSVDSGSLVIEKHRPHHPSFFSKVESSISSDRIL 648 Query: 1940 NE-NQRFPKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTE 1764 ++ +QR PKE++ DDR R NH S+YRS +E P +S SS RD ES S + + Sbjct: 649 HDSHQRLPKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSSSHRDLDSESSH-SVFHAD 707 Query: 1763 TPAGVLQEIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADS 1584 TP VLQEI +KCGTKVEF S+L+AS ELQFSIE WF+G+KIG G G+TRKEAQ +AA+ Sbjct: 708 TPVVVLQEIALKCGTKVEFMSSLVASTELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAED 767 Query: 1583 SLRNLANKYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXSE 1407 S+++LA+ YLS+ +P S GD+ + N+ G + ++S Q LPKED S+ Sbjct: 768 SIKHLADIYLSSAKDEPGSTYGDVGGFPNANDNGYMVIASSLSNQPLPKEDSASFSTASD 827 Query: 1406 PTRFLDPRLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGG 1230 P+R LDPRLE SK+ MGS+SALKELCMMEGL + FL+ P P+STN + K EV+AQVEI G Sbjct: 828 PSRVLDPRLEVSKRPMGSISALKELCMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDG 887 Query: 1229 QILGKGVGATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRML 1050 ++ GKG+G TW+EAKMQAAE+AL +L+S LG S Q R SPR FS+KRLK E+PR + Sbjct: 888 KVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRAM 947 Query: 1049 QRINSSTRYSNNAPPVP 999 QRI SSTRY NAPP+P Sbjct: 948 QRIPSSTRYPRNAPPIP 964 >ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X2 [Glycine max] Length = 929 Score = 1154 bits (2986), Expect = 0.0 Identities = 616/969 (63%), Positives = 738/969 (76%), Gaps = 16/969 (1%) Frame = -3 Query: 3857 KMFKSVVYQGNSLLGEVEIYPQ---NQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPS 3687 +M+KSVVYQG ++GEV++YP+ N K + KEIRISHFSQPSERCPPLAVLHT+ Sbjct: 2 RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHV-KEIRISHFSQPSERCPPLAVLHTVTSC 60 Query: 3686 GVCFKMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGF 3507 GVCFKMESK Q +D LF LHS C+RE+KTAVMPLG EE+HLVAM SR N+++ CFWGF Sbjct: 61 GVCFKMESKTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSR-NVDR-PCFWGF 117 Query: 3506 IVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSG 3327 IV+ GLY+SCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SG Sbjct: 118 IVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISG 177 Query: 3326 MLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNI 3147 M AE+KRYQDDK+ILKQY ENDQVV+NG+VIKVQSE+VPA SD++QPIVRPLIRLQ+KNI Sbjct: 178 MQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNI 237 Query: 3146 ILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPE 2967 ILTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+ Sbjct: 238 ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPD 297 Query: 2966 SNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVP 2787 SNLINSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVP Sbjct: 298 SNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVP 357 Query: 2786 AFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSA 2607 AFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YED + + PS Sbjct: 358 AFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSP 417 Query: 2606 PDVSNFLISEDDVSASN--KDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPAP 2448 PDVSN+L+SEDD S SN +DP F+G+ D E ER+LKD + A+S +LDPR Sbjct: 418 PDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPR-LT 476 Query: 2447 SLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGE 2277 SLQ + S G++ PT Q +M + QF Q +VKP+ PSEPSL SPAREEGE Sbjct: 477 SLQYTMVPS-GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535 Query: 2276 VPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLEE 2097 VPESELDPDTRRRLLILQHGQDTRDH S+EPPF SRG WFP EE Sbjct: 536 VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVP--SSRGVWFPAEE 593 Query: 2096 DRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFP 1920 + + NR VPKE P++S + K R HPSFF ESS SDR+L++ +QR P Sbjct: 594 EIGSQPLNRV----VPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLP 649 Query: 1919 KEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQE 1740 KE++ DDR R+NH S+YRSF ++TP VLQE Sbjct: 650 KEMYHRDDRPRLNHMLSSYRSF----------------------------SDTPVAVLQE 681 Query: 1739 IGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANK 1560 I +KCGTKV+F S+L+AS ELQFS+E WF+G+KIG G+TRKEAQ +AA+ S+++LA+ Sbjct: 682 IALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADI 741 Query: 1559 YLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXSEPTRFLDPR 1383 YLS+ +P S GD++ +VN+ G +G ++S G Q L KED + P+R LDPR Sbjct: 742 YLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKED-SASFSTASPSRVLDPR 800 Query: 1382 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 1206 L+ SK+SMGS+S+LKELCMMEGL + FL+ P P+STN + K EV+AQVEI G++ GKG+G Sbjct: 801 LDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIG 860 Query: 1205 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 1026 TW+EAKMQAAE+AL +L+S LG S Q R SPR FS+KRLK E+PR +QR+ SS R Sbjct: 861 LTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSAR 920 Query: 1025 YSNNAPPVP 999 Y NAPP+P Sbjct: 921 YPRNAPPIP 929 >ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Cicer arietinum] Length = 951 Score = 1154 bits (2984), Expect = 0.0 Identities = 621/966 (64%), Positives = 728/966 (75%), Gaps = 14/966 (1%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 3675 M+KS+VYQG +LGEV+IYP+ KEIRISHF+QPSERC PLAVLHTI SGVCF Sbjct: 1 MYKSLVYQGEVVLGEVDIYPEVNNNNKNFKEIRISHFTQPSERCLPLAVLHTITSSGVCF 60 Query: 3674 KMESKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVSP 3495 KMESK Q +D PLF LH+ C RE+KTAVMPL EE+HLVAM SR N CFWG+IV Sbjct: 61 KMESKTQQQD-PLFHLHNLCFRENKTAVMPLCGEEMHLVAMHSRSNGRP--CFWGYIVGM 117 Query: 3494 GLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLAE 3315 GLYNSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN+E+DPQR+SGM AE Sbjct: 118 GLYNSCLMMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAE 177 Query: 3314 IKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILTR 3135 +KRY +DKSILKQYVENDQVV+NGKV+K QSE+VPA SD++QPIVRPLIRL EKNIILTR Sbjct: 178 VKRYLEDKSILKQYVENDQVVDNGKVLKAQSELVPALSDSHQPIVRPLIRLHEKNIILTR 237 Query: 3134 INPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNLI 2955 INP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP+SNLI Sbjct: 238 INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLI 297 Query: 2954 NSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFAP 2775 NSKELL R+VCVK+G +KSL NVF DG+CHPKMALVIDDRLKVWDEKDQ RVHVVPAFAP Sbjct: 298 NSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP 357 Query: 2774 YYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDVS 2595 YYAPQAEA+N +PVLCVARNVACNVRGGFFK+FD+GLLQ I ++ YE+ + APDVS Sbjct: 358 YYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKISQIAYENNTRDISPAPDVS 417 Query: 2594 NFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPAPSLQN 2436 N+L+SEDD SAS N+DP F+G+ D E ER+LKD I A+S LDPR SLQ Sbjct: 418 NYLVSEDDGSASYANRDPFAFDGMADAEVERKLKDAISAASAIPMTTAKLDPRLTSSLQY 477 Query: 2435 AVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVPES 2265 + S G++ P Q ++ + QF Q +VKP+ V PSE SL SPAREEGEVPES Sbjct: 478 TMVSP-GSVLPPAAQASMIPLPHTQFPQPATLVKPIGQVAPSELSLHSSPAREEGEVPES 536 Query: 2264 ELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLEEDRIP 2085 ELDPDTRRRLLILQHGQD RDHTSSEPPF + RG WFP+EE+ Sbjct: 537 ELDPDTRRRLLILQHGQDNRDHTSSEPPF----PLKHPVQVSARVPPRGGWFPVEEEIGS 592 Query: 2084 RKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFPKEVH 1908 + PNR +PKE L+S +K R FF + S SDR L+E NQR PKE++ Sbjct: 593 QPPNRV----IPKEIALDSGPSRIEKHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMY 648 Query: 1907 QGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIGIK 1728 DDR RV+H S+Y S G++TP G+S SS RDF ESG S ETPA VLQEI +K Sbjct: 649 HRDDRSRVSHMLSSYPSLSGDDTPFGRSSSSHRDFDSESGH-SVFNAETPAIVLQEIALK 707 Query: 1727 CGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYLSN 1548 CGTKVEF S+L AS ELQFSIE WF+G+KIG G G+TR EAQ +AA+ S+++LA+ YLS Sbjct: 708 CGTKVEFTSSLAASRELQFSIEAWFSGKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSR 767 Query: 1547 VMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXSEPTRFLDPRLEES 1371 + S GD++ + N+ G +G+ +S G Q LPKE+ S+P+R LDPRL+ S Sbjct: 768 AKDESGSAFGDVSGFPNANDNGYVGNVSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVS 827 Query: 1370 KKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVGATWN 1194 K+SMGSVSALKELCM+EGL + FL+ P P+STN ++ EV+AQVEI GQ+ GKG G TW+ Sbjct: 828 KRSMGSVSALKELCMVEGLGVNFLSLPAPVSTNSVD--EVHAQVEIDGQVYGKGTGITWD 885 Query: 1193 EAKMQAAEEALWNLKSMLGDSG-QNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYSN 1017 EAKMQAAE+AL +L++ + G Q R SPR S+KRLK E PR LQR SS RY Sbjct: 886 EAKMQAAEKALGSLRTTIHGQGIQRRQLSPRPFQGLSNKRLKQEHPRTLQRFASSGRYPR 945 Query: 1016 NAPPVP 999 NAPP+P Sbjct: 946 NAPPIP 951 >ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] Length = 958 Score = 1125 bits (2909), Expect = 0.0 Identities = 607/970 (62%), Positives = 720/970 (74%), Gaps = 18/970 (1%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 3675 M +S+VY G +GEVEIYP+ +K + KEIRISHFSQPSERCPPLAVLHTI G+CF Sbjct: 1 MKRSMVYHGEMEVGEVEIYPEEKKNIDL-KEIRISHFSQPSERCPPLAVLHTITSFGICF 59 Query: 3674 KMES---KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 3504 KMES + + + LF LHSSC+RE+KTAVMPL EE+HLVAM SR N CFWGFI Sbjct: 60 KMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNN--DRPCFWGFI 117 Query: 3503 VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 3324 V+ GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQR+S M Sbjct: 118 VASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRISTM 177 Query: 3323 LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 3144 AEIKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE+VPA SD++QPIVRPLIRLQEKNII Sbjct: 178 QAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNII 237 Query: 3143 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 2964 LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE Sbjct: 238 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEL 297 Query: 2963 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 2784 NLINSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ +VHVVPA Sbjct: 298 NLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVPA 357 Query: 2783 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 2604 FAPYYAPQAEA+N VP LC+AR+VACNVRGGFFK+FD+GLLQ I + YED + + PS P Sbjct: 358 FAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSPP 417 Query: 2603 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPA-- 2451 DVSN+L+SEDD SAS NK+ L F+G+ D E ERRLKD I ASS M +LDPR A Sbjct: 418 DVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFN 477 Query: 2450 PSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEG 2280 SLQ + SS GT+ PT Q I+ N QF Q +VKP+ V P PSL SPAREEG Sbjct: 478 SSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEG 537 Query: 2279 EVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLE 2100 EVPESELD DTRRRLLILQHGQDTR+HTSSEPP + SR WF +E Sbjct: 538 EVPESELDLDTRRRLLILQHGQDTREHTSSEPPL---PVRHPTQVSAPSVPSRRGWFSVE 594 Query: 2099 EDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRF 1923 E+ P++ N+ VPKE P+ SE +H +KR RHPS F + S SDR+ +E +QR Sbjct: 595 EEMGPQQLNQL----VPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRL 650 Query: 1922 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 1743 PKEVH DD R++ S S+Y SFPG++ PL S S+RDF ESGR S + + AGVLQ Sbjct: 651 PKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGR-SLFHADITAGVLQ 709 Query: 1742 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 1563 EI +KCGTKVEF S+L+AS LQFSIE WF G+K+G G G+TR+EAQ +AA+ S++ LA+ Sbjct: 710 EIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLAD 769 Query: 1562 KYLSNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXSEPTRFLDP 1386 Y+S+ D S GD++ N G + NS G Q LPKE S+ +R DP Sbjct: 770 IYMSHAKDDSGSTYGDVSGFHGSNNNGFVSSGNSLGNQLLPKES-VSFSTSSDSSRVSDP 828 Query: 1385 RLEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGV 1209 RLE SK+S S+SALKE CMMEGL+ F + P P ST+ K EV+AQVEI GQI GKG Sbjct: 829 RLEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGF 888 Query: 1208 GATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSST 1029 G TW EAKMQAA++AL +L++M + R GSPR + ++KRLK E+PR LQRI S Sbjct: 889 GLTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSA 948 Query: 1028 RYSNNAPPVP 999 RY NAP VP Sbjct: 949 RYPRNAPLVP 958 >ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Solanum tuberosum] Length = 953 Score = 1122 bits (2903), Expect = 0.0 Identities = 618/971 (63%), Positives = 715/971 (73%), Gaps = 19/971 (1%) Frame = -3 Query: 3854 MFKS--VVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGV 3681 MFKS V+Y+G L+GEVEIY + + + +K IRISH+S SERCPPLAVLHT+ +G+ Sbjct: 1 MFKSTVVLYEGERLVGEVEIYCE-KGVLWGEKVIRISHYSPSSERCPPLAVLHTVT-TGL 58 Query: 3680 CFKME---SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWG 3510 FK+E SK ++DSPL LHS+CLR++KTAVM LG EELHLVAM S+ Q CFWG Sbjct: 59 SFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCFWG 118 Query: 3509 FIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVS 3330 F V+ GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN+E DPQR S Sbjct: 119 FKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQRAS 178 Query: 3329 GMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKN 3150 MLAE+KRYQ+DK ILKQY ENDQVV+NGKVIK QSEV PA SDN+QPIVRPLIRLQ++N Sbjct: 179 VMLAEVKRYQEDKIILKQYAENDQVVDNGKVIKSQSEVFPALSDNHQPIVRPLIRLQDRN 238 Query: 3149 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDP 2970 IILTRINP+IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP Sbjct: 239 IILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 298 Query: 2969 ESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVV 2790 +SNLINS+ELL R+VCVK+G RKSL NVF DG CHPKMALVIDDRLKVWD+KDQ RVHVV Sbjct: 299 DSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVV 358 Query: 2789 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPS 2610 PAFAPY+APQAE NN VPVLCVARNVACNVRGGFFK+FDEGLLQ I EV YED + PS Sbjct: 359 PAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPS 418 Query: 2609 APDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQAS----SMANSLDPRPA 2451 APDVSN+LISEDD SA NKD LGF+G+ D E ERRLK+ + AS S +LDPR Sbjct: 419 APDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPRLV 478 Query: 2450 PSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQVVKPL----VHVGPSEPSLQGSPAREE 2283 P+LQ V ISQP+ Q P++ QV L + P + SLQ SPAREE Sbjct: 479 PALQYPVPP---VISQPSIQSPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREE 535 Query: 2282 GEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPL 2103 GEVPESELDPDTRRRLLILQHGQDTRD SSEP F +Q G WFP Sbjct: 536 GEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKF--PMGTPLQVSVPPRVQPHG-WFPA 592 Query: 2102 EEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRF 1923 EE+ PR+ NR +P PKE PL E+MH +K R HP F E+S PSDR+L ENQR Sbjct: 593 EEEMSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFENQRL 649 Query: 1922 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 1743 PKEV DDR+R + S ++R PGEE PLG+S SS+R E G PY ETPAG LQ Sbjct: 650 PKEVIPRDDRMRFSQSQPSFRP-PGEEVPLGRSSSSNRVLDLEPGH-YDPYLETPAGALQ 707 Query: 1742 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 1563 +I KCG KVEFRS+ ++S ELQFS+EV F GEK+G GTG+TR+EAQR+AA+ SL LA+ Sbjct: 708 DIAFKCGAKVEFRSSFLSSPELQFSLEVLFAGEKVGEGTGRTRREAQRRAAEESLMYLAD 767 Query: 1562 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNS-FGYQTLPKEDXXXXXXXSEPTRFLDP 1386 KYLS + D SS +GD + + ++ G D+ S FGYQ D SEP R LDP Sbjct: 768 KYLSCIKPDSSSTQGDGFRFPNASDNGFVDNMSPFGYQ-----DRVSHSFASEPPRVLDP 822 Query: 1385 RLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVG 1206 RLE KKS+GSV AL+ELC +EGL L F QP LS N K E+YAQVEI GQ+ GKG+G Sbjct: 823 RLEVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSANPGQKSEIYAQVEIDGQVFGKGIG 882 Query: 1205 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPR-LLHSFSSKRLKPEFPR-MLQRINSS 1032 +TW++AK QAAE AL LKS L Q R GSPR L FS+KRLKPE+ R + QR+ S Sbjct: 883 STWDDAKTQAAERALVALKSELAQFSQKRQGSPRSLQQGFSNKRLKPEYSRGVQQRVPLS 942 Query: 1031 TRYSNNAPPVP 999 R+ N +P Sbjct: 943 GRFPKNTSAMP 953 >ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X3 [Glycine max] Length = 932 Score = 1108 bits (2866), Expect = 0.0 Identities = 599/969 (61%), Positives = 712/969 (73%), Gaps = 17/969 (1%) Frame = -3 Query: 3854 MFKSVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCF 3675 M +S+VY G +GEVEIYP+ +K + KEIRISHFSQPSERCPPLAVLHTI G+CF Sbjct: 1 MKRSMVYHGEMEVGEVEIYPEEKKNIDL-KEIRISHFSQPSERCPPLAVLHTITSFGICF 59 Query: 3674 KMES---KLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFI 3504 KMES + + + LF LHSSC+RE+KTAVMPL EE+HLVAM SR N CFWGFI Sbjct: 60 KMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNN--DRPCFWGFI 117 Query: 3503 VSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGM 3324 V+ GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQR+S M Sbjct: 118 VASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRISTM 177 Query: 3323 LAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNII 3144 AEIKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE+VPA SD++QPIVRPLIRLQEKNII Sbjct: 178 QAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNII 237 Query: 3143 LTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPES 2964 LTRINP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE Sbjct: 238 LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPEL 297 Query: 2963 NLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPA 2784 NLINSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ +VHVVPA Sbjct: 298 NLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVPA 357 Query: 2783 FAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAP 2604 FAPYYAPQAEA+N VP LC+AR+VACNVRGGFFK+FD+GLLQ I + YED + + PS P Sbjct: 358 FAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSPP 417 Query: 2603 DVSNFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPA-- 2451 DVSN+L+SEDD SAS NK+ L F+G+ D E ERRLKD I ASS M +LDPR A Sbjct: 418 DVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFN 477 Query: 2450 PSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEG 2280 SLQ + SS GT+ PT Q I+ N QF Q +VKP+ V P PSL SPAREEG Sbjct: 478 SSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEG 537 Query: 2279 EVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLE 2100 EVPESELD DTRRRLLILQHGQDTR+HTSSEPP + SR WF +E Sbjct: 538 EVPESELDLDTRRRLLILQHGQDTREHTSSEPPL---PVRHPTQVSAPSVPSRRGWFSVE 594 Query: 2099 EDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRF 1923 E+ P++ N+ VPKE P+ SE +H +KR RHPS F + S SDR+ +E +QR Sbjct: 595 EEMGPQQLNQL----VPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRL 650 Query: 1922 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 1743 PKEVH DD R++ S S+Y SFPG++ PL S S+RDF ESGR S + + AGVLQ Sbjct: 651 PKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGR-SLFHADITAGVLQ 709 Query: 1742 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 1563 EI +KCGTKVEF S+L+AS LQFSIE WF G+K+G G G+TR+EAQ +AA+ S++ LA+ Sbjct: 710 EIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLAD 769 Query: 1562 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNSFGYQTLPKEDXXXXXXXSEPTRFLDPR 1383 Y+S+ D S GD++ G SN+ G+ + DPR Sbjct: 770 IYMSHAKDDSGSTYGDVS--------GFHGSNNNGFVS------------------SDPR 803 Query: 1382 LEESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVG 1206 LE SK+S S+SALKE CMMEGL+ F + P P ST+ K EV+AQVEI GQI GKG G Sbjct: 804 LEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFG 863 Query: 1205 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTR 1026 TW EAKMQAA++AL +L++M + R GSPR + ++KRLK E+PR LQRI S R Sbjct: 864 LTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSAR 923 Query: 1025 YSNNAPPVP 999 Y NAP VP Sbjct: 924 YPRNAPLVP 932 >ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like [Solanum lycopersicum] Length = 954 Score = 1104 bits (2855), Expect = 0.0 Identities = 608/972 (62%), Positives = 708/972 (72%), Gaps = 20/972 (2%) Frame = -3 Query: 3854 MFKSVV--YQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGV 3681 MFKS V Y+G L+GEVE+Y + + + +K IRISH+S SERCPPLAVLHT+ +G+ Sbjct: 1 MFKSTVLLYEGERLVGEVEMYGE-KGVVWGEKLIRISHYSPSSERCPPLAVLHTVT-TGL 58 Query: 3680 CFKME---SKLQSKDSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWG 3510 FK+E SK ++DSPL LHS+CLR++KTAVM LG EELHLVAM S+ Q CFWG Sbjct: 59 SFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCFWG 118 Query: 3509 FIVSPGLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVS 3330 F V+ GLY+SCL MLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN+E DPQR S Sbjct: 119 FKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQRAS 178 Query: 3329 GMLAEIKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKN 3150 MLAE+KRYQ+DK ILKQY ENDQVV+NGKVI+ QSEV PA SDN+QPIVRPLIRLQ++N Sbjct: 179 VMLAEVKRYQEDKIILKQYAENDQVVDNGKVIRSQSEVFPALSDNHQPIVRPLIRLQDRN 238 Query: 3149 IILTRINPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDP 2970 IILTRINP+IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEVYVCTMAERDYALEMWRLLDP Sbjct: 239 IILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 298 Query: 2969 ESNLINSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVV 2790 +SNLINS+ELL R+VCVK+G RKSL NVF DG CHPKMALVIDDRLKVWD+KDQ RVHVV Sbjct: 299 DSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVV 358 Query: 2789 PAFAPYYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPS 2610 PAFAPY+APQAE NN VPVLCVARNVACNVRGGFFK+FDEGLLQ I EV YED + PS Sbjct: 359 PAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPS 418 Query: 2609 APDVSNFLISEDDVSA--SNKDPLGFEGVTDVE-ERRLKDVIQAS----SMANSLDPRPA 2451 APDVSN+LISEDD SA NKD LGF+G+ D E ERRLK+ + AS S +LDPR Sbjct: 419 APDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPRLV 478 Query: 2450 PSLQNAVASSLGTISQPTPQGPIMSSQNNQFTQVVKPL----VHVGPSEPSLQGSPAREE 2283 P+LQ V ISQP+ QGP++ QV L + P + SLQ SPAREE Sbjct: 479 PALQYPVPP---VISQPSIQGPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREE 535 Query: 2282 GEVPESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPL 2103 GEVPESELDPDTRRRLLILQHGQDTRD SSEP F +Q G WFP Sbjct: 536 GEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKF--PIGTPLQVSVPPRVQPHG-WFPA 592 Query: 2102 EEDRIPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNENQRF 1923 EE+ PR+ NR +P PKE PL E+MH +K R HP F E+S PSDR+ ENQR Sbjct: 593 EEEVSPRQLNRPLP---PKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVFFENQRL 649 Query: 1922 PKEVHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQ 1743 PKEV DDR+R + S ++R PGE+ LG+S SS+R + G PY +TPAG LQ Sbjct: 650 PKEVIPRDDRMRFSQSQPSFRP-PGEDVSLGRSSSSNRVLDLDPGH-YDPYLDTPAGALQ 707 Query: 1742 EIGIKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLAN 1563 +I KCG KVEFRS+ ++S ELQF +EV F GEK+G G G+TR+EAQR AA+ SL LA+ Sbjct: 708 DIAFKCGVKVEFRSSFLSSPELQFCLEVLFAGEKVGEGIGRTRREAQRHAAEESLMYLAD 767 Query: 1562 KYLSNVMHDPSSVRGDLNKPSHVNEMGLGDSNS-FGYQTLPKEDXXXXXXXSEPTRFLDP 1386 KYLS + D SS +GD + + ++ G ++ S FGYQ D SEP R LDP Sbjct: 768 KYLSCIKADSSSTQGDGFRFPNASDNGFVENMSPFGYQ-----DRVSHSFASEPPRVLDP 822 Query: 1385 RLEESKKSMGSVSALKELCMMEGLSLVFLAQPPLSTNLMNKVEVYAQVEIGGQILGKGVG 1206 RLE KKS+GSV AL+ELC +EGL L F QP LS N K E+YAQVEI GQ+ GKG+G Sbjct: 823 RLEVFKKSVGSVGALRELCAIEGLGLAFQTQPQLSVNPGQKSEIYAQVEIDGQVFGKGIG 882 Query: 1205 ATWNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLL--HSFSSKRLKPEFPR-MLQRINS 1035 TW++AK QAAE AL LKS L R GSPR L FS+KRLKPE+ R + QR+ Sbjct: 883 PTWDDAKTQAAERALVALKSELAQFSHKRQGSPRSLQQQGFSNKRLKPEYSRGVQQRVPL 942 Query: 1034 STRYSNNAPPVP 999 S R+ N +P Sbjct: 943 SGRFPKNTSAMP 954 >ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X1 [Glycine max] gi|571500215|ref|XP_006594604.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1-like isoform X2 [Glycine max] Length = 960 Score = 1104 bits (2855), Expect = 0.0 Identities = 599/967 (61%), Positives = 708/967 (73%), Gaps = 18/967 (1%) Frame = -3 Query: 3845 SVVYQGNSLLGEVEIYPQNQKICMIDKEIRISHFSQPSERCPPLAVLHTIAPSGVCFKME 3666 S+VY G +GEV+IYP+ K + KEIRISHFSQPSERCPPLAVLHTI G+CFKME Sbjct: 4 SMVYHGEMAVGEVKIYPEENKNMDL-KEIRISHFSQPSERCPPLAVLHTITSFGICFKME 62 Query: 3665 SKLQSK---DSPLFSLHSSCLREDKTAVMPLGEEELHLVAMSSRKNLEQYSCFWGFIVSP 3495 S K LF LHSSC+RE+KTAVMP+ EE+HLVAM SR N CFWGFIV+ Sbjct: 63 SSTSQKRQQQDALFHLHSSCIRENKTAVMPVRGEEIHLVAMYSRNN--DRPCFWGFIVAS 120 Query: 3494 GLYNSCLIMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEMDPQRVSGMLAE 3315 GLYNSCL MLNLRCLGIVFDLDETL+VANTMRSFED+I+ L RK+N+E++PQ++S M AE Sbjct: 121 GLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQQISAMQAE 180 Query: 3314 IKRYQDDKSILKQYVENDQVVENGKVIKVQSEVVPAFSDNYQPIVRPLIRLQEKNIILTR 3135 IKRY DDK+ILK+Y ENDQVV+NGKVIK+QSE VPA SD++QPIVRPLIRLQEKNIILTR Sbjct: 181 IKRYLDDKNILKEYAENDQVVDNGKVIKIQSESVPALSDSHQPIVRPLIRLQEKNIILTR 240 Query: 3134 INPLIRDTSVLVRLRPAWEDLRSYLTAKGRKRFEVYVCTMAERDYALEMWRLLDPESNLI 2955 INP IRDTSVLVRLRPAWEDLRSYLTA+GRKRFEV+VCTMAERDYALEMWRLLDPE NLI Sbjct: 241 INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPELNLI 300 Query: 2954 NSKELLHRVVCVKAGSRKSLVNVFHDGICHPKMALVIDDRLKVWDEKDQSRVHVVPAFAP 2775 NSKELL R+VCVK+G +KSL NVF +G+CH KMALVIDDRLKVWDEKDQ RVHVVPAFAP Sbjct: 301 NSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPRVHVVPAFAP 360 Query: 2774 YYAPQAEANNVVPVLCVARNVACNVRGGFFKEFDEGLLQLIREVFYEDGLANFPSAPDVS 2595 YY PQAEA+N VP LC+ARNVACNVRGGFFK+FD+GLLQ I + YED + + PS PDVS Sbjct: 361 YYTPQAEASNAVPFLCLARNVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPS-PDVS 419 Query: 2594 NFLISEDDVSAS--NKDPLGFEGVTDVE-ERRLKDVIQASS----MANSLDPRPA--PSL 2442 N+L+SEDD SAS NK+ L F+G+ D E ERRLKD I ASS + ++DPR A SL Sbjct: 420 NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 479 Query: 2441 QNAVASSLGTISQPTPQGPIMSSQNNQFTQ---VVKPLVHVGPSEPSLQGSPAREEGEVP 2271 Q + SS GT+ PT Q ++ N QF Q +VKP+ V SL SPAREEGE+P Sbjct: 480 QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 539 Query: 2270 ESELDPDTRRRLLILQHGQDTRDHTSSEPPFXXXXXXXXXXXXXXPMQSRGSWFPLEEDR 2091 ESELD DTRRR LILQHGQDTR+ +SEPPF + SR WF +EE+ Sbjct: 540 ESELDLDTRRRFLILQHGQDTRERMASEPPF-PVRHPAQVSAPASSVPSRRGWFSVEEEM 598 Query: 2090 IPRKPNRAVPKSVPKESPLESETMHFDKRRARHPSFFHGAESSFPSDRMLNE-NQRFPKE 1914 P++ N VPKE P++SE H +KR RHPSFF S SDR+ +E +QR PKE Sbjct: 599 GPQQLN----LPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKE 654 Query: 1913 VHQGDDRLRVNHSTSNYRSFPGEETPLGQSGSSDRDFHFESGRGSPPYTETPAGVLQEIG 1734 VH DDR R++ S S+Y S PG++ PL S S+RDF ESGR S + +T AGVLQEI Sbjct: 655 VHHRDDRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGR-SLFHADTTAGVLQEIA 713 Query: 1733 IKCGTKVEFRSALIASAELQFSIEVWFTGEKIGVGTGKTRKEAQRQAADSSLRNLANKYL 1554 + CGTKVEF S+L+AS ELQFSIE WF G+KIG G G+TR+EAQ +AA S++ LA+ Y+ Sbjct: 714 LNCGTKVEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYM 773 Query: 1553 SNVMHDPSSVRGDLNKPSHVNEMG-LGDSNSFGYQTLPKEDXXXXXXXSEPTRFLDPRLE 1377 S+ D S GD++ N G + NS G Q LPKE+ SE +R D RLE Sbjct: 774 SHAKDDSGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLE 833 Query: 1376 ESKKSMGSVSALKELCMMEGLSLVFLAQP-PLSTNLMNKVEVYAQVEIGGQILGKGVGAT 1200 SK+S S+SALKELCMMEGL+ F + P ST+L K EV+AQVEI GQI GKG G T Sbjct: 834 VSKRSTDSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVT 893 Query: 1199 WNEAKMQAAEEALWNLKSMLGDSGQNRVGSPRLLHSFSSKRLKPEFPRMLQRINSSTRYS 1020 W EAKMQAA++AL +L++M R GSPR + ++KRLKPE+P LQR+ S RY Sbjct: 894 WEEAKMQAAKKALGSLRTMFNQGSLKRHGSPRSMQGLANKRLKPEYPPTLQRVPYSARYP 953 Query: 1019 NNAPPVP 999 NAP VP Sbjct: 954 RNAPLVP 960