BLASTX nr result

ID: Akebia25_contig00005884 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00005884
         (1590 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform...   241   2e-66
ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform...   242   3e-61
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...   216   4e-60
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...   228   6e-57
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...   228   8e-57
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...   225   4e-56
gb|EXB82797.1| RNA polymerase II C-terminal domain phosphatase-l...   221   7e-55
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...   221   9e-55
ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun...   197   1e-52
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...   211   7e-52
ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas...   209   3e-51
gb|EYU27926.1| hypothetical protein MIMGU_mgv1a000848mg [Mimulus...   196   9e-51
ref|XP_006827806.1| hypothetical protein AMTR_s00009p00267690 [A...   204   7e-50
ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma...   192   3e-49
ref|XP_006597420.1| PREDICTED: RNA polymerase II C-terminal doma...   192   3e-49
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...   196   3e-47
ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma...   196   3e-47
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...   195   6e-47
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...   194   7e-47
ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma...   193   2e-46

>ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
            gi|508781046|gb|EOY28302.1| C-terminal domain
            phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score =  241 bits (616), Expect(2) = 2e-66
 Identities = 134/239 (56%), Positives = 167/239 (69%), Gaps = 4/239 (1%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RDL FE  R +    ET  GVLQDIA++CGAKVEFRPAL+AS +LQFSIE WFAG
Sbjct: 703  SSSSHRDLDFESGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAG 761

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            EK+ EG+G+TR+EAQ QA+E  IKNLA+ YLS   PD  +   DLS+L + N+N      
Sbjct: 762  EKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNV 821

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            NSFG+Q   KEE +  S  SE SRL DPRLEGSKKS+G+V+AL ELC+MEGL + FQ QP
Sbjct: 822  NSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQP 881

Query: 1062 SLSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRLG 886
              S++++ K E   Q E           LTW+EAK++AAE+ALG+L+SML Q + KR G
Sbjct: 882  PSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQG 940



 Score = 40.0 bits (92), Expect(2) = 2e-66
 Identities = 19/38 (50%), Positives = 25/38 (65%)
 Frame = -2

Query: 887  GSPRLLQELPSKQLKPDFS*VLHPMPPAVRYSDKASPI 774
            GSPR LQ + +K+LKP+F  VL  MP + RY   A P+
Sbjct: 940  GSPRSLQGMQNKRLKPEFPRVLQRMPSSGRYPKNAPPV 977


>ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
            gi|508781047|gb|EOY28303.1| C-terminal domain
            phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score =  242 bits (618), Expect = 3e-61
 Identities = 135/244 (55%), Positives = 169/244 (69%), Gaps = 4/244 (1%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RDL FE  R +    ET  GVLQDIA++CGAKVEFRPAL+AS +LQFSIE WFAG
Sbjct: 703  SSSSHRDLDFESGR-TVTSGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAG 761

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            EK+ EG+G+TR+EAQ QA+E  IKNLA+ YLS   PD  +   DLS+L + N+N      
Sbjct: 762  EKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNV 821

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            NSFG+Q   KEE +  S  SE SRL DPRLEGSKKS+G+V+AL ELC+MEGL + FQ QP
Sbjct: 822  NSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQP 881

Query: 1062 SLSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRLGF 883
              S++++ K E   Q E           LTW+EAK++AAE+ALG+L+SML Q + KR G 
Sbjct: 882  PSSSNALQKDEVYAQVEIDGQVLGKGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGS 941

Query: 882  SKVV 871
             + V
Sbjct: 942  PRCV 945


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score =  216 bits (551), Expect(2) = 4e-60
 Identities = 123/230 (53%), Positives = 153/230 (66%), Gaps = 4/230 (1%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RD  +E  R     AETP GVLQ+IA++CG KVEFRPAL+ STELQF +E WFAG
Sbjct: 683  SSSSNRDFDYESGRAISN-AETPAGVLQEIAMKCGTKVEFRPALVPSTELQFYVEAWFAG 741

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            EKI EG G+TR+EA  QA+E  +KNLA+ Y+S   PD   +  D SK S+   N      
Sbjct: 742  EKIGEGTGRTRREAHFQAAEGSLKNLANIYISRGKPDALPIHGDASKFSNVTNNGFMGNM 801

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            NSFG QP PKE+ +  S +SE SR +DPRL+ S+KSV +VSAL ELC MEGL++ +Q +P
Sbjct: 802  NSFGTQPLPKEDSLSSSTSSEPSRPLDPRLDNSRKSVSSVSALKELCTMEGLSVLYQPRP 861

Query: 1062 SLSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSML 913
                +S  K E  VQAE           LTWDEAK++AAE+ALGNL+S L
Sbjct: 862  P-PPNSTEKDEVHVQAEIDGEVLGKGIGLTWDEAKMQAAEKALGNLRSTL 910



 Score = 43.9 bits (102), Expect(2) = 4e-60
 Identities = 21/38 (55%), Positives = 26/38 (68%)
 Frame = -2

Query: 887  GSPRLLQELPSKQLKPDFS*VLHPMPPAVRYSDKASPI 774
            GSPR LQ +PSK+LK +F  VL  MP + RYS  A P+
Sbjct: 917  GSPRPLQGMPSKRLKQEFPQVLQRMPSSTRYSKNAPPV 954


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score =  228 bits (581), Expect = 6e-57
 Identities = 131/248 (52%), Positives = 163/248 (65%), Gaps = 4/248 (1%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RD+ FE  R      ETP+GVLQDIA++CG KVEFRPAL+ASTELQFSIE WFAG
Sbjct: 686  SSSSSRDVDFESGRDVSS-TETPSGVLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAG 744

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            EKI EGIG+TR+EAQ QA+E  IK+LA+ Y+     D  +   D S+ S+ NEN      
Sbjct: 745  EKIGEGIGRTRREAQRQAAEGSIKHLANVYMLRVKSDSGSGHGDGSRFSNANENCFMGEI 804

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            NSFG QP  K+E    S++SE S+L+DPRLEGSKK +G+VSAL ELC+ EGL + FQ QP
Sbjct: 805  NSFGGQPLAKDE----SLSSEPSKLVDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQP 860

Query: 1062 SLSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRLGF 883
              S +S+ K E   Q E            TWDEAK++AAE+ALG+L+SM  Q   K  G 
Sbjct: 861  PSSANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGS 920

Query: 882  SKVVTRTP 859
             + +   P
Sbjct: 921  PRSLQGMP 928


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score =  228 bits (580), Expect = 8e-57
 Identities = 131/248 (52%), Positives = 163/248 (65%), Gaps = 4/248 (1%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RD+ FE  R      ETP+GVLQDIA++CG KVEFRPAL+ASTELQFSIE WFAG
Sbjct: 686  SSSSSRDVDFESGRDVSS-TETPSGVLQDIAMKCGTKVEFRPALVASTELQFSIEAWFAG 744

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            EKI EGIG+TR+EAQ QA+E  IK+LA+ Y+     D  +   D S+ S+ NEN      
Sbjct: 745  EKIGEGIGRTRREAQRQAAEGSIKHLANVYVLRVKSDSGSGHGDGSRFSNANENCFMGEI 804

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            NSFG QP  K+E    S++SE S+L+DPRLEGSKK +G+VSAL ELC+ EGL + FQ QP
Sbjct: 805  NSFGGQPLAKDE----SLSSEPSKLVDPRLEGSKKLMGSVSALKELCMTEGLGVVFQQQP 860

Query: 1062 SLSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRLGF 883
              S +S+ K E   Q E            TWDEAK++AAE+ALG+L+SM  Q   K  G 
Sbjct: 861  PSSANSVQKDEVYAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRSMFGQFPQKHQGS 920

Query: 882  SKVVTRTP 859
             + +   P
Sbjct: 921  PRSLQGMP 928


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score =  225 bits (574), Expect = 4e-56
 Identities = 127/246 (51%), Positives = 161/246 (65%), Gaps = 4/246 (1%)
 Frame = -1

Query: 1584 SSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAGEK 1405
            SS RDL  E ER      ETP  VLQ+IA++CG KVEFRPAL+A+++LQFSIE WF GEK
Sbjct: 723  SSNRDLDLESERAFSS-TETPVEVLQEIAMKCGTKVEFRPALIATSDLQFSIETWFVGEK 781

Query: 1404 ISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENENY----SNS 1237
            + EG GKTR+EAQ QA+E  IK LA  Y+S   PD   +L D S+    N+N      NS
Sbjct: 782  VGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYPSANDNGFLGDMNS 841

Query: 1236 FGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQPSL 1057
            FG+QP  K+E +  S TSE SRL+D RLEGSKKS+G+V+AL E C+ EGL + F +Q  L
Sbjct: 842  FGNQPLLKDENITYSATSEPSRLLDQRLEGSKKSMGSVTALKEFCMTEGLGVNFLAQTPL 901

Query: 1056 STSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRLGFSK 877
            ST+SI   E   Q E           LTWDEAK++AAE+ALG+L++M  Q T KR G  +
Sbjct: 902  STNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRTMFGQYTPKRQGSPR 961

Query: 876  VVTRTP 859
            ++   P
Sbjct: 962  LMQGMP 967


>gb|EXB82797.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Morus
            notabilis]
          Length = 440

 Score =  221 bits (563), Expect = 7e-55
 Identities = 123/232 (53%), Positives = 151/232 (65%), Gaps = 4/232 (1%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS R+L F+    +   AETP GVLQ+I ++CG KVEFRPAL+A  ELQFS+E WFAG
Sbjct: 167  SFSSNRELDFD-SGPAVSNAETPAGVLQEIGMKCGTKVEFRPALVACAELQFSVEAWFAG 225

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            EKI EGIG+TR+EAQ QA+E  +KNLAD YLS   PD  +++ D++K    N+N      
Sbjct: 226  EKIGEGIGRTRREAQLQAAEISLKNLADMYLSRVKPDSGSLVVDMTKFPDANDNGFVSNV 285

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            NSFG   FPKEE +  S  SE SRL   RLEGSKKS+ +VSAL E C+ EGL L F  QP
Sbjct: 286  NSFGSHSFPKEESLSYSTASEPSRLFGARLEGSKKSMSSVSALKEYCMTEGLGLAFHPQP 345

Query: 1062 SLSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQ 907
              S   I K E   Q E           +TWDEAK++AAE+ALG+L+SM  Q
Sbjct: 346  LPSNGPIQKDEVYAQVEIDGQVLGKGIGMTWDEAKLQAAEKALGSLRSMYGQ 397


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score =  221 bits (562), Expect = 9e-55
 Identities = 125/246 (50%), Positives = 155/246 (63%), Gaps = 4/246 (1%)
 Frame = -1

Query: 1584 SSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAGEK 1405
            SS RDL  E ER     +ETP  VLQ+IA++C  KVEFRPAL+AS +LQFSIE WFAGEK
Sbjct: 717  SSNRDLDLESERAFT-ISETPVEVLQEIAMKCETKVEFRPALVASIDLQFSIEAWFAGEK 775

Query: 1404 ISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YSNS 1237
            + EG GKTR+EAQ QA+E  IK LA  Y+    PD   +  D S+    N+N      N 
Sbjct: 776  VGEGTGKTRREAQRQAAEGSIKKLAGIYMLRAKPDSGPMHGDSSRYPSANDNGFLGNMNL 835

Query: 1236 FGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQPSL 1057
            FG+QP PK+E +  S  SE SRL+DPRLEGSKKS G+V+AL E C MEGL + F +Q  L
Sbjct: 836  FGNQPLPKDELVAYSAASEPSRLLDPRLEGSKKSSGSVTALKEFCTMEGLVVNFLAQTPL 895

Query: 1056 STSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRLGFSK 877
            S +SI   E   Q E            TWDEAK++AAE+ALG+L++M  Q T KR G  +
Sbjct: 896  SANSIPGEEVHAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRTMFGQYTQKRQGSPR 955

Query: 876  VVTRTP 859
             +   P
Sbjct: 956  PMQGMP 961


>ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica]
            gi|462410413|gb|EMJ15747.1| hypothetical protein
            PRUPE_ppa000988mg [Prunus persica]
          Length = 940

 Score =  197 bits (500), Expect(2) = 1e-52
 Identities = 117/230 (50%), Positives = 144/230 (62%), Gaps = 4/230 (1%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RD+ FE  R     AETP GVLQ+IA++CGAK                   WFAG
Sbjct: 685  SSSSNRDVDFESGRAISN-AETPAGVLQEIAMKCGAKA------------------WFAG 725

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            EKI EG GKTR+EA +QA+E  +KNLA+ YLS   PD  +V  D++K  + N N      
Sbjct: 726  EKIGEGSGKTRREAHYQAAEGSLKNLANIYLSRVKPDSVSVHGDMNKFPNVNSNGFAGNL 785

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            NSFG QPFPKEE +  S +SE SR +DPRLEGSKKS+ +VS L ELC+MEGL + FQ +P
Sbjct: 786  NSFGIQPFPKEESLSSSTSSEPSRPLDPRLEGSKKSMSSVSTLKELCMMEGLGVVFQPRP 845

Query: 1062 SLSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSML 913
              ST+S+ K E  VQ E           LTWDEAK++AAE+ALG+L S L
Sbjct: 846  PPSTNSVEKDEVHVQVEIDGEVLGKGIGLTWDEAKMQAAEKALGSLTSTL 895



 Score = 38.5 bits (88), Expect(2) = 1e-52
 Identities = 18/38 (47%), Positives = 24/38 (63%)
 Frame = -2

Query: 887  GSPRLLQELPSKQLKPDFS*VLHPMPPAVRYSDKASPI 774
            GSPR LQ + SK++K +F  VL  MP + RY   A P+
Sbjct: 902  GSPRSLQGMSSKRMKQEFPQVLQRMPSSARYPKNAPPV 939


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score =  211 bits (537), Expect = 7e-52
 Identities = 117/248 (47%), Positives = 159/248 (64%), Gaps = 4/248 (1%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RDL  E +R     AETP  VL +I+++CGAKVEF+ +L+ S +LQFS+E WFAG
Sbjct: 703  SSSSNRDLDVESDRAVSS-AETPVRVLHEISMKCGAKVEFKHSLVNSRDLQFSVEAWFAG 761

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            E++ EG G+TR+EAQ  A+E  IKNLA+ Y+S   PD  A+  D SK S  N+N    + 
Sbjct: 762  ERVGEGFGRTRREAQSVAAEASIKNLANIYISRAKPDNGALHGDASKYSSANDNGFLGHV 821

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            NSFG QP PK+E +  S +SE S L+DPRLE SKKS+ +V+AL E C+MEGL + F +Q 
Sbjct: 822  NSFGSQPLPKDEILSYSDSSEQSGLLDPRLESSKKSMSSVNALKEFCMMEGLGVNFLAQT 881

Query: 1062 SLSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRLGF 883
             LS++S+   E   Q E            T+DEAK++AAE+ALG+L++   +   KR G 
Sbjct: 882  PLSSNSVQNAEVHAQVEIDGQVMGKGIGSTFDEAKMQAAEKALGSLRTTFGRFPPKRQGS 941

Query: 882  SKVVTRTP 859
             + V   P
Sbjct: 942  PRPVPGMP 949


>ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
            gi|561032720|gb|ESW31299.1| hypothetical protein
            PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score =  209 bits (532), Expect = 3e-51
 Identities = 117/238 (49%), Positives = 157/238 (65%), Gaps = 5/238 (2%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RDL  E    S  +A+TP  VLQ+IA++CG KVEF  +L+ASTELQFSIE WF+G
Sbjct: 688  SSSSHRDLDSESSH-SVFHADTPVVVLQEIALKCGTKVEFMSSLVASTELQFSIEAWFSG 746

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            +KI  G G+TRKEAQH+A+E  IK+LAD YLS    +  +   D+    + N+N     +
Sbjct: 747  KKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAKDEPGSTYGDVGGFPNANDNGYMVIA 806

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            +S  +QP PKE+    S  S+ SR++DPRLE SK+ +G++SAL ELC+MEGL + F S P
Sbjct: 807  SSLSNQPLPKEDSASFSTASDPSRVLDPRLEVSKRPMGSISALKELCMMEGLGVNFLSAP 866

Query: 1062 S-LSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKR 892
            + +ST+S+ K E   Q E           LTWDEAK++AAE+ALG+L+S L Q   KR
Sbjct: 867  APVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKR 924


>gb|EYU27926.1| hypothetical protein MIMGU_mgv1a000848mg [Mimulus guttatus]
          Length = 962

 Score =  196 bits (499), Expect(2) = 9e-51
 Identities = 112/240 (46%), Positives = 151/240 (62%), Gaps = 6/240 (2%)
 Frame = -1

Query: 1581 SKRDLHFELERGS-PPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAGEK 1405
            S  +  F+LE G   PY ET  G LQDIA +CG KVEF+  L++ST LQF +EV FAGE+
Sbjct: 686  SSANKDFDLEAGQIDPYIETCIGALQDIAFKCGTKVEFKQTLISSTGLQFFVEVLFAGER 745

Query: 1404 ISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YSNS 1237
            I EG+G+TR+EAQ QA+E  +  LADKYLS + PD + V  D S++ ++ EN     +NS
Sbjct: 746  IGEGMGRTRREAQRQAAEGSLLYLADKYLSRSRPDFNYVPGDGSRVGNQKENGFNSNANS 805

Query: 1236 FGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSV-GAVSALTELCIMEGLTLGFQSQPS 1060
            FG+QP P EE +P S  +   R++DPR E SK+ + G+++AL E C MEGL + FQ+QP 
Sbjct: 806  FGYQPLPNEEGLPFSTVAAPPRIVDPRTEVSKRPIMGSITALKEFCTMEGLGVTFQTQPQ 865

Query: 1059 LSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRLGFS 880
             S +   + E   Q E           LTWDEA+ +AAE+AL  LKSM  Q  ++  G S
Sbjct: 866  FSANPGQRNEVYAQVEVNGQVLGKGIGLTWDEARSQAAEKALVTLKSMPGQFPYRHQGSS 925



 Score = 32.7 bits (73), Expect(2) = 9e-51
 Identities = 14/37 (37%), Positives = 23/37 (62%)
 Frame = -2

Query: 884  SPRLLQELPSKQLKPDFS*VLHPMPPAVRYSDKASPI 774
            SPR +Q +P+K++K +F+ V   +P   RY    SP+
Sbjct: 925  SPRSMQSIPNKRVKQEFNRVSQRLPSFGRYPRNGSPV 961


>ref|XP_006827806.1| hypothetical protein AMTR_s00009p00267690 [Amborella trichopoda]
            gi|548832426|gb|ERM95222.1| hypothetical protein
            AMTR_s00009p00267690 [Amborella trichopoda]
          Length = 942

 Score =  204 bits (520), Expect = 7e-50
 Identities = 115/235 (48%), Positives = 150/235 (63%), Gaps = 2/235 (0%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S S+ RD+ F   +  P Y+ TP GVL+DIA++CG+KV+FR  ++ +TELQFS+EVWF G
Sbjct: 683  SSSNTRDVPFATGQVPPQYSPTPVGVLKDIAIKCGSKVDFRSMVVPTTELQFSVEVWFVG 742

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLS--KLSHENENYSNS 1237
            EKI EGIGKTRKEAQ +ASE  I+ LA  YL+   PD      D+    L  +N    +S
Sbjct: 743  EKIGEGIGKTRKEAQFKASEASIRTLARTYLAQISPDIGLGCGDMDDRSLGSDNGLMGDS 802

Query: 1236 FGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQPSL 1057
                   +E+ +PI+ TSE  R +D RLEGSK+S+G VS+L ELC +EGL+L F+  P  
Sbjct: 803  ISSAGL-REDSLPIASTSEQQRFLDQRLEGSKQSIGVVSSLKELCSVEGLSLVFKELP-- 859

Query: 1056 STSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKR 892
             T S HKGE   Q E            +W+EAKI+AAE+ALG+LKS L Q T KR
Sbjct: 860  PTGSNHKGEVYAQVEIAGRVLGEGVGSSWEEAKIQAAEDALGSLKSSLIQRTQKR 914


>ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 958

 Score =  192 bits (487), Expect(2) = 3e-49
 Identities = 113/241 (46%), Positives = 151/241 (62%), Gaps = 6/241 (2%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERG-SPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFA 1414
            S SS  +  F+ E G S  +A+   GVLQ+IA++CG KVEF  +L+AST LQFSIE WFA
Sbjct: 681  SGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCGTKVEFLSSLVASTALQFSIEAWFA 740

Query: 1413 GEKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----Y 1246
            G+K+ EG G+TR+EAQ++A+E  IK LAD Y+S    D  +   D+S     N N     
Sbjct: 741  GKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNNNGFVSS 800

Query: 1245 SNSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQ 1066
             NS G+Q  PKE  +  S +S+ SR+ DPRLE SK+S  ++SAL E C+MEGL   FQS 
Sbjct: 801  GNSLGNQLLPKES-VSFSTSSDSSRVSDPRLEVSKRSTDSISALKEFCMMEGLAANFQSS 859

Query: 1065 PS-LSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRL 889
            P+  ST    K E   Q E           LTW+EAK++AA++AL +L++M +QGT KR 
Sbjct: 860  PAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESLRTMFNQGTRKRH 919

Query: 888  G 886
            G
Sbjct: 920  G 920



 Score = 32.3 bits (72), Expect(2) = 3e-49
 Identities = 17/40 (42%), Positives = 25/40 (62%)
 Frame = -2

Query: 887  GSPRLLQELPSKQLKPDFS*VLHPMPPAVRYSDKASPIVP 768
            GSPR +Q L +K+LK ++   L  +P + RY   A P+VP
Sbjct: 920  GSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNA-PLVP 958


>ref|XP_006597420.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X2 [Glycine max]
          Length = 937

 Score =  192 bits (487), Expect(2) = 3e-49
 Identities = 113/241 (46%), Positives = 151/241 (62%), Gaps = 6/241 (2%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERG-SPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFA 1414
            S SS  +  F+ E G S  +A+   GVLQ+IA++CG KVEF  +L+AST LQFSIE WFA
Sbjct: 660  SGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCGTKVEFLSSLVASTALQFSIEAWFA 719

Query: 1413 GEKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----Y 1246
            G+K+ EG G+TR+EAQ++A+E  IK LAD Y+S    D  +   D+S     N N     
Sbjct: 720  GKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNNNGFVSS 779

Query: 1245 SNSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQ 1066
             NS G+Q  PKE  +  S +S+ SR+ DPRLE SK+S  ++SAL E C+MEGL   FQS 
Sbjct: 780  GNSLGNQLLPKES-VSFSTSSDSSRVSDPRLEVSKRSTDSISALKEFCMMEGLAANFQSS 838

Query: 1065 PS-LSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRL 889
            P+  ST    K E   Q E           LTW+EAK++AA++AL +L++M +QGT KR 
Sbjct: 839  PAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESLRTMFNQGTRKRH 898

Query: 888  G 886
            G
Sbjct: 899  G 899



 Score = 32.3 bits (72), Expect(2) = 3e-49
 Identities = 17/40 (42%), Positives = 25/40 (62%)
 Frame = -2

Query: 887  GSPRLLQELPSKQLKPDFS*VLHPMPPAVRYSDKASPIVP 768
            GSPR +Q L +K+LK ++   L  +P + RY   A P+VP
Sbjct: 899  GSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNA-PLVP 937


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score =  196 bits (497), Expect = 3e-47
 Identities = 112/238 (47%), Positives = 159/238 (66%), Gaps = 5/238 (2%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RDL  E    S  +A+TP  VLQ+IA++CG KV+F  +L+ASTELQFS+E WF+G
Sbjct: 681  SFSSHRDLDSESGH-SVLHADTPVAVLQEIALKCGTKVDFISSLVASTELQFSMEAWFSG 739

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            +KI   +G+TRKEAQ++A+E  IK+LAD YLS    +  +   D+S   + N++     +
Sbjct: 740  KKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKDEPGSTYGDVSGFPNVNDSGYMGIA 799

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            +S G+QP  KE+    S T+  SR++DPRL+ SK+S+G++S+L ELC+MEGL + F S P
Sbjct: 800  SSLGNQPLSKEDSASFS-TASPSRVLDPRLDVSKRSMGSISSLKELCMMEGLDVNFLSAP 858

Query: 1062 S-LSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKR 892
            + +ST+S+ K E   Q E           LTWDEAK++AAE+ALG+L+S L Q   KR
Sbjct: 859  APVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKR 916


>ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
            gi|571500215|ref|XP_006594604.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1-like
            isoform X2 [Glycine max]
          Length = 960

 Score =  196 bits (497), Expect = 3e-47
 Identities = 116/241 (48%), Positives = 150/241 (62%), Gaps = 6/241 (2%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERG-SPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFA 1414
            S SS  +  F+ E G S  +A+T  GVLQ+IA+ CG KVEF  +L+ASTELQFSIE WFA
Sbjct: 682  SGSSYSNRDFDSESGRSLFHADTTAGVLQEIALNCGTKVEFLSSLVASTELQFSIEAWFA 741

Query: 1413 GEKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENE----NY 1246
            G+KI EG G+TR+EAQ +A+   IK LAD Y+S    D  +   D+S     N     + 
Sbjct: 742  GKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDDSGSTYGDVSGFHGSNNDGFVSS 801

Query: 1245 SNSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQ 1066
             NS G+Q  PKEE    S  SE SR+ D RLE SK+S  ++SAL ELC+MEGL   FQS 
Sbjct: 802  GNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRSTDSISALKELCMMEGLAASFQSP 861

Query: 1065 P-SLSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRL 889
            P S ST    K E   Q E           +TW+EAK++AA++ALG+L++M +QG+ KR 
Sbjct: 862  PASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKMQAAKKALGSLRTMFNQGSLKRH 921

Query: 888  G 886
            G
Sbjct: 922  G 922


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score =  195 bits (495), Expect = 6e-47
 Identities = 117/236 (49%), Positives = 145/236 (61%), Gaps = 1/236 (0%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGS-PPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFA 1414
            S SS R L  +LE G   PY ETP G LQDIA +CGAKVEFR + L+S ELQFS+EV FA
Sbjct: 681  SSSSNRVL--DLEPGHYDPYLETPAGALQDIAFKCGAKVEFRSSFLSSPELQFSLEVLFA 738

Query: 1413 GEKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENENYSNSF 1234
            GEK+ EG G+TR+EAQ +A+E  +  LADKYLS   PD S+   D  +  + ++N     
Sbjct: 739  GEKVGEGTGRTRREAQRRAAEESLMYLADKYLSCIKPDSSSTQGDGFRFPNASDN-GFVD 797

Query: 1233 GHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQPSLS 1054
               PF  ++ +  S  SE  R++DPRLE  KKSVG+V AL ELC +EGL L FQ+QP LS
Sbjct: 798  NMSPFGYQDRVSHSFASEPPRVLDPRLEVFKKSVGSVGALRELCAIEGLGLAFQTQPQLS 857

Query: 1053 TSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHKRLG 886
             +   K E   Q E            TWD+AK +AAE AL  LKS L Q + KR G
Sbjct: 858  ANPGQKSEIYAQVEIDGQVFGKGIGSTWDDAKTQAAERALVALKSELAQFSQKRQG 913


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Glycine max]
          Length = 960

 Score =  194 bits (494), Expect = 7e-47
 Identities = 111/237 (46%), Positives = 155/237 (65%), Gaps = 5/237 (2%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPYAETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFAG 1411
            S SS RDL  E    S  +A+TP  VL +IA++CG KV+F  +L+ASTEL+FS+E WF+G
Sbjct: 685  SSSSHRDLDSESGH-SVLHADTPVAVLHEIALKCGTKVDFMSSLVASTELKFSLEAWFSG 743

Query: 1410 EKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----YS 1243
            +KI  G G+TRKEAQ++A++  I++LAD YLS    +  +   D+S   + N+N     +
Sbjct: 744  KKIGHGFGRTRKEAQNKAAKDSIEHLADIYLSSAKDEPGSTYGDVSGFPNVNDNGYMGIA 803

Query: 1242 NSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQP 1063
            +S G+QP  KE+    S  S  SR +DPRL+ SK+S+G++SAL ELC+MEGL + F S P
Sbjct: 804  SSLGNQPLSKEDSASFSSASP-SRALDPRLDVSKRSMGSISALKELCMMEGLGVNFLSTP 862

Query: 1062 S-LSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSMLDQGTHK 895
            + +ST+S+ K E   Q E           LTWDEAK++AAE+ALGNL+S L Q   K
Sbjct: 863  APVSTNSVQKDEVHAQVEIDGKIFGKGIGLTWDEAKMQAAEKALGNLRSKLGQSIQK 919


>ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cicer arietinum]
          Length = 951

 Score =  193 bits (491), Expect = 2e-46
 Identities = 114/240 (47%), Positives = 158/240 (65%), Gaps = 7/240 (2%)
 Frame = -1

Query: 1590 SDSSKRDLHFELERGSPPY-AETPTGVLQDIAVRCGAKVEFRPALLASTELQFSIEVWFA 1414
            S SS RD  F+ E G   + AETP  VLQ+IA++CG KVEF  +L AS ELQFSIE WF+
Sbjct: 676  SSSSHRD--FDSESGHSVFNAETPAIVLQEIALKCGTKVEFTSSLAASRELQFSIEAWFS 733

Query: 1413 GEKISEGIGKTRKEAQHQASERCIKNLADKYLSVTMPDRSAVLEDLSKLSHENEN----Y 1246
            G+KI  G G+TR EAQ++A+E  IK+LAD YLS    +  +   D+S   + N+N     
Sbjct: 734  GKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSRAKDESGSAFGDVSGFPNANDNGYVGN 793

Query: 1245 SNSFGHQPFPKEEPMPISITSELSRLMDPRLEGSKKSVGAVSALTELCIMEGLTLGFQSQ 1066
             +S G+QP PKEE +  S  S+ SR++DPRL+ SK+S+G+VSAL ELC++EGL + F S 
Sbjct: 794  VSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVSKRSMGSVSALKELCMVEGLGVNFLSL 853

Query: 1065 PS-LSTSSIHKGEARVQAEXXXXXXXXXXXLTWDEAKIRAAEEALGNLKSML-DQGTHKR 892
            P+ +ST+S+   E   Q E           +TWDEAK++AAE+ALG+L++ +  QG  +R
Sbjct: 854  PAPVSTNSV--DEVHAQVEIDGQVYGKGTGITWDEAKMQAAEKALGSLRTTIHGQGIQRR 911


Top