BLASTX nr result

ID: Wisteria21_contig00001002 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Wisteria21_contig00001002
         (1650 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma...   730   0.0  
gb|KHN47532.1| RNA polymerase II C-terminal domain phosphatase-l...   660   0.0  
gb|KRH01519.1| hypothetical protein GLYMA_18G282300 [Glycine max]     655   0.0  
gb|KRH01517.1| hypothetical protein GLYMA_18G282300 [Glycine max...   655   0.0  
gb|KHN13828.1| RNA polymerase II C-terminal domain phosphatase-l...   655   0.0  
ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma...   655   0.0  
ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma...   647   0.0  
ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phas...   642   0.0  
gb|KOM36704.1| hypothetical protein LR48_Vigan03g008500 [Vigna a...   623   e-175
ref|XP_013447776.1| carboxy-terminal domain phosphatase-like pro...   618   e-174
ref|XP_003621644.2| carboxy-terminal domain phosphatase-like pro...   618   e-174
ref|XP_014497833.1| PREDICTED: RNA polymerase II C-terminal doma...   608   e-171
ref|XP_010100046.1| RNA polymerase II C-terminal domain phosphat...   400   e-108
ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphat...   390   e-105
ref|XP_008222368.1| PREDICTED: RNA polymerase II C-terminal doma...   389   e-105
ref|XP_003609604.2| carboxy-terminal domain phosphatase-like pro...   379   e-102
ref|XP_007225412.1| hypothetical protein PRUPE_ppa000589mg [Prun...   377   e-101
ref|XP_009378819.1| PREDICTED: RNA polymerase II C-terminal doma...   372   e-100
ref|XP_009359893.1| PREDICTED: RNA polymerase II C-terminal doma...   361   1e-96
ref|XP_008369646.1| PREDICTED: RNA polymerase II C-terminal doma...   358   8e-96

>ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Cicer arietinum]
          Length = 1247

 Score =  730 bits (1885), Expect = 0.0
 Identities = 399/552 (72%), Positives = 431/552 (78%), Gaps = 2/552 (0%)
 Frame = -1

Query: 1650 GMDRSGLPSS-KTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPT 1474
            GMDR GLPS+ KTEA KMELD+E SK H YETDALKAVSTYQQKFGRSS+FT+DK PSPT
Sbjct: 389  GMDRPGLPSAGKTEAVKMELDTENSKNHLYETDALKAVSTYQQKFGRSSYFTDDKFPSPT 448

Query: 1473 PSGDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAG 1294
            PSGDCEEGV D N+EV           SKP LLDQMPVSSTS+DRSSMHGLINSRI+AA 
Sbjct: 449  PSGDCEEGVADANEEVSSASIAVSLTSSKP-LLDQMPVSSTSVDRSSMHGLINSRIEAAS 507

Query: 1293 SGSYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSL 1114
            S +YPVKTSA+SRDPRLRFINSDASALDLNQ  GT+NMPKVE AG +ISRKQK  EE SL
Sbjct: 508  SVTYPVKTSARSRDPRLRFINSDASALDLNQSLGTNNMPKVENAGRVISRKQKTTEELSL 567

Query: 1113 DATVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTV 934
            DAT  KRLRSSLEN   NTRE RT AGNGGWLEE  VAGS LIERNHLM+K ETE KKT+
Sbjct: 568  DATAPKRLRSSLENSRHNTREERTMAGNGGWLEENRVAGSHLIERNHLMQKGETELKKTM 627

Query: 933  STSSGNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXXXXXX 754
            STSSG   V SNGNEQAPVT SNTA +LP LLK+IAVNPTMLLNIL+EQQQRL       
Sbjct: 628  STSSGYSTVTSNGNEQAPVTVSNTAAALPGLLKNIAVNPTMLLNILLEQQQRLAAEANKK 687

Query: 753  XXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQDDSG 574
                  ST  LT  NSA G + TVN GP+MTAGLPQ S GMLP          +L +DSG
Sbjct: 688  PVDSATSTMHLT--NSARGPDATVNTGPAMTAGLPQSSVGMLPASTQAASMAHTLLEDSG 745

Query: 573  KFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRAETKL 394
            K RMKPRDPRRILHGS++LQKSGS GSEQ K++VSP SNNQG G NVNAQK +VR ETKL
Sbjct: 746  KIRMKPRDPRRILHGSSSLQKSGSTGSEQSKSVVSPTSNNQGNGGNVNAQKLDVRVETKL 805

Query: 393  ASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPA-TQNVSSASVPFTLDKKEQKSA 217
            A TQS AQPDITRQFT+NLKNIADI+SVSQE S   PA TQNVSSASVPFTLDK E KS 
Sbjct: 806  APTQSSAQPDITRQFTKNLKNIADIMSVSQEPSTQLPATTQNVSSASVPFTLDKAELKSG 865

Query: 216  VPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXXQNK 37
            VPNSQNLQ G+GSAPE+CA GSSRSQSTW+DVEHLFEGYD                 QNK
Sbjct: 866  VPNSQNLQDGVGSAPETCAPGSSRSQSTWADVEHLFEGYDEKQKAAIQRERARRLEEQNK 925

Query: 36   MFAARKLSLVLD 1
            MFA++KL LVLD
Sbjct: 926  MFASKKLCLVLD 937


>gb|KHN47532.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Glycine soja]
          Length = 1263

 Score =  660 bits (1703), Expect = 0.0
 Identities = 365/555 (65%), Positives = 408/555 (73%), Gaps = 6/555 (1%)
 Frame = -1

Query: 1647 MDRSGLPSSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPS 1468
            M RSG  S+K E+GKMELDSEGSK H YETDALKAVSTYQQKFGRSS FTNDK PSPTPS
Sbjct: 409  MVRSGSASAKMESGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPS 468

Query: 1467 GDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGSG 1288
            GDCE+ VVDT +EV           +KPTLLDQ PVS+TSMDRSSMHG I+SR+DAAG G
Sbjct: 469  GDCEDEVVDTIEEVSSASTGDFLTSTKPTLLDQPPVSATSMDRSSMHGFISSRVDAAGPG 528

Query: 1287 SYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLDA 1108
            S+PVK+SAK+RDPRLRFINSDASA+D N  +  +NM KVEY+GT ISRKQKA EEPSLD 
Sbjct: 529  SFPVKSSAKNRDPRLRFINSDASAVD-NLSTLINNMSKVEYSGTTISRKQKAAEEPSLDV 587

Query: 1107 TVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVST 928
            TVSKRL+SSLEN E N  EVRT  G+GGWLEE T  G+QLIERNHLM+K   E KKT++T
Sbjct: 588  TVSKRLKSSLENTEHNMSEVRT--GSGGWLEENTGPGAQLIERNHLMDKFGPEAKKTLNT 645

Query: 927  ------SSGNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXX 766
                   S NFN  S  NEQAP+T+SN   SLPALLK+ +VNP ML+NIL     RL   
Sbjct: 646  VSSSCTGSDNFNATSIRNEQAPITASNVLASLPALLKEASVNPIMLVNIL-----RLAEA 700

Query: 765  XXXXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQ 586
                           TSSN AMGT++T +IG SM  GL Q S GMLPV         +LQ
Sbjct: 701  QKKSADSAAIMLLHPTSSNPAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSTAQTLQ 760

Query: 585  DDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRA 406
            DDSGK RMKPRDPRRILH +NT+QKSG LG+EQFKAIVSP+SNNQ TGDNVNAQK E R 
Sbjct: 761  DDSGKIRMKPRDPRRILHTNNTIQKSGDLGNEQFKAIVSPVSNNQRTGDNVNAQKLEGRV 820

Query: 405  ETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQ 226
            + KL  TQS AQPDI RQFTRNLKNIADI+SVSQESS H+P +QN SSASVP T D+ EQ
Sbjct: 821  DNKLVPTQSSAQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLTSDRGEQ 880

Query: 225  KSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXX 46
            KS V +SQNLQA + SA E+ AS +SRSQSTW DVEHLFEGYD                 
Sbjct: 881  KSVVSSSQNLQADMASAHETAASVTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEE 940

Query: 45   QNKMFAARKLSLVLD 1
            QNKMFAARKL LVLD
Sbjct: 941  QNKMFAARKLCLVLD 955


>gb|KRH01519.1| hypothetical protein GLYMA_18G282300 [Glycine max]
          Length = 1100

 Score =  655 bits (1690), Expect = 0.0
 Identities = 359/555 (64%), Positives = 404/555 (72%), Gaps = 6/555 (1%)
 Frame = -1

Query: 1647 MDRSGLPSSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPS 1468
            M  SG  ++K E+GKMELDSEGSK H YETDALKAVSTYQQKFGRSS FTNDK PSPTPS
Sbjct: 403  MVSSGSAAAKPESGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPS 462

Query: 1467 GDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGSG 1288
            GDCE+ +VDTN+EV           +KPTLLD  PVS+TS DRSS+HG I+SR+DAAG G
Sbjct: 463  GDCEDEIVDTNEEVSSASTGDFLTSTKPTLLDLPPVSATSTDRSSLHGFISSRVDAAGPG 522

Query: 1287 SYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLDA 1108
            S PVK+SAK+RDPRLRF+NSDASA+D N  +  HNMPKVEYAGT ISRKQKA EEPSLD 
Sbjct: 523  SLPVKSSAKNRDPRLRFVNSDASAVD-NPSTLIHNMPKVEYAGTTISRKQKAAEEPSLDV 581

Query: 1107 TVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVST 928
            TVSKR +S LEN E N  EVRT  G GGWLEE T  G+Q IERNHLM+K   EP+KT++T
Sbjct: 582  TVSKRQKSPLENTEHNMSEVRT--GIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNT 639

Query: 927  ------SSGNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXX 766
                   S NFN  S  NEQAP+TSSN   SLPALLK  AVNPTML+N+L     R+   
Sbjct: 640  VSSSCTGSDNFNATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLL-----RIAEA 694

Query: 765  XXXXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQ 586
                           TSSNSAMGT++T +IG SM  GL Q S GMLPV         +LQ
Sbjct: 695  QKKSADSATNMLLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQ 754

Query: 585  DDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRA 406
            DDSGK RMKPRDPRRILH +NT+QKSG+LG+EQFKAIVSP+SNNQGTGDNVNAQK E R 
Sbjct: 755  DDSGKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRV 814

Query: 405  ETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQ 226
            ++KL  TQ  AQPDI RQF RNLKNIADI+SVSQESS H+P  Q  SSASVP T D+ EQ
Sbjct: 815  DSKLVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRGEQ 874

Query: 225  KSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXX 46
            KS V NSQNL+AG+ SA E+ ASG+ RSQ+TW DVEHLFEGYD                 
Sbjct: 875  KSVVSNSQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEE 934

Query: 45   QNKMFAARKLSLVLD 1
            QNKMFAARKL LVLD
Sbjct: 935  QNKMFAARKLCLVLD 949


>gb|KRH01517.1| hypothetical protein GLYMA_18G282300 [Glycine max]
            gi|947051989|gb|KRH01518.1| hypothetical protein
            GLYMA_18G282300 [Glycine max]
          Length = 1260

 Score =  655 bits (1690), Expect = 0.0
 Identities = 359/555 (64%), Positives = 404/555 (72%), Gaps = 6/555 (1%)
 Frame = -1

Query: 1647 MDRSGLPSSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPS 1468
            M  SG  ++K E+GKMELDSEGSK H YETDALKAVSTYQQKFGRSS FTNDK PSPTPS
Sbjct: 403  MVSSGSAAAKPESGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPS 462

Query: 1467 GDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGSG 1288
            GDCE+ +VDTN+EV           +KPTLLD  PVS+TS DRSS+HG I+SR+DAAG G
Sbjct: 463  GDCEDEIVDTNEEVSSASTGDFLTSTKPTLLDLPPVSATSTDRSSLHGFISSRVDAAGPG 522

Query: 1287 SYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLDA 1108
            S PVK+SAK+RDPRLRF+NSDASA+D N  +  HNMPKVEYAGT ISRKQKA EEPSLD 
Sbjct: 523  SLPVKSSAKNRDPRLRFVNSDASAVD-NPSTLIHNMPKVEYAGTTISRKQKAAEEPSLDV 581

Query: 1107 TVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVST 928
            TVSKR +S LEN E N  EVRT  G GGWLEE T  G+Q IERNHLM+K   EP+KT++T
Sbjct: 582  TVSKRQKSPLENTEHNMSEVRT--GIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNT 639

Query: 927  ------SSGNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXX 766
                   S NFN  S  NEQAP+TSSN   SLPALLK  AVNPTML+N+L     R+   
Sbjct: 640  VSSSCTGSDNFNATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLL-----RIAEA 694

Query: 765  XXXXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQ 586
                           TSSNSAMGT++T +IG SM  GL Q S GMLPV         +LQ
Sbjct: 695  QKKSADSATNMLLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQ 754

Query: 585  DDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRA 406
            DDSGK RMKPRDPRRILH +NT+QKSG+LG+EQFKAIVSP+SNNQGTGDNVNAQK E R 
Sbjct: 755  DDSGKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRV 814

Query: 405  ETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQ 226
            ++KL  TQ  AQPDI RQF RNLKNIADI+SVSQESS H+P  Q  SSASVP T D+ EQ
Sbjct: 815  DSKLVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRGEQ 874

Query: 225  KSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXX 46
            KS V NSQNL+AG+ SA E+ ASG+ RSQ+TW DVEHLFEGYD                 
Sbjct: 875  KSVVSNSQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEE 934

Query: 45   QNKMFAARKLSLVLD 1
            QNKMFAARKL LVLD
Sbjct: 935  QNKMFAARKLCLVLD 949


>gb|KHN13828.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Glycine soja]
          Length = 1251

 Score =  655 bits (1690), Expect = 0.0
 Identities = 359/555 (64%), Positives = 404/555 (72%), Gaps = 6/555 (1%)
 Frame = -1

Query: 1647 MDRSGLPSSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPS 1468
            M  SG  ++K E+GKMELDSEGSK H YETDALKAVSTYQQKFGRSS FTNDK PSPTPS
Sbjct: 397  MVSSGSAAAKPESGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPS 456

Query: 1467 GDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGSG 1288
            GDCE+ +VDTN+EV           +KPTLLD  PVS+TS DRSS+HG I+SR+DAAG G
Sbjct: 457  GDCEDEIVDTNEEVSSASTGDFLTSTKPTLLDLPPVSATSTDRSSLHGFISSRVDAAGPG 516

Query: 1287 SYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLDA 1108
            S PVK+SAK+RDPRLRF+NSDASA+D N  +  HNMPKVEYAGT ISRKQKA EEPSLD 
Sbjct: 517  SLPVKSSAKNRDPRLRFVNSDASAVD-NPSTLIHNMPKVEYAGTTISRKQKAAEEPSLDV 575

Query: 1107 TVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVST 928
            TVSKR +S LEN E N  EVRT  G GGWLEE T  G+Q IERNHLM+K   EP+KT++T
Sbjct: 576  TVSKRQKSPLENTEHNMSEVRT--GIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNT 633

Query: 927  ------SSGNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXX 766
                   S NFN  S  NEQAP+TSSN   SLPALLK  AVNPTML+N+L     R+   
Sbjct: 634  VSSSCTGSDNFNATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLL-----RIAEA 688

Query: 765  XXXXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQ 586
                           TSSNSAMGT++T +IG SM  GL Q S GMLPV         +LQ
Sbjct: 689  QKKSADSATNMLLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQ 748

Query: 585  DDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRA 406
            DDSGK RMKPRDPRRILH +NT+QKSG+LG+EQFKAIVSP+SNNQGTGDNVNAQK E R 
Sbjct: 749  DDSGKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRV 808

Query: 405  ETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQ 226
            ++KL  TQ  AQPDI RQF RNLKNIADI+SVSQESS H+P  Q  SSASVP T D+ EQ
Sbjct: 809  DSKLVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRGEQ 868

Query: 225  KSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXX 46
            KS V NSQNL+AG+ SA E+ ASG+ RSQ+TW DVEHLFEGYD                 
Sbjct: 869  KSVVSNSQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEE 928

Query: 45   QNKMFAARKLSLVLD 1
            QNKMFAARKL LVLD
Sbjct: 929  QNKMFAARKLCLVLD 943


>ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max] gi|947051986|gb|KRH01515.1|
            hypothetical protein GLYMA_18G282300 [Glycine max]
            gi|947051987|gb|KRH01516.1| hypothetical protein
            GLYMA_18G282300 [Glycine max]
          Length = 1257

 Score =  655 bits (1690), Expect = 0.0
 Identities = 359/555 (64%), Positives = 404/555 (72%), Gaps = 6/555 (1%)
 Frame = -1

Query: 1647 MDRSGLPSSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPS 1468
            M  SG  ++K E+GKMELDSEGSK H YETDALKAVSTYQQKFGRSS FTNDK PSPTPS
Sbjct: 403  MVSSGSAAAKPESGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPS 462

Query: 1467 GDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGSG 1288
            GDCE+ +VDTN+EV           +KPTLLD  PVS+TS DRSS+HG I+SR+DAAG G
Sbjct: 463  GDCEDEIVDTNEEVSSASTGDFLTSTKPTLLDLPPVSATSTDRSSLHGFISSRVDAAGPG 522

Query: 1287 SYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLDA 1108
            S PVK+SAK+RDPRLRF+NSDASA+D N  +  HNMPKVEYAGT ISRKQKA EEPSLD 
Sbjct: 523  SLPVKSSAKNRDPRLRFVNSDASAVD-NPSTLIHNMPKVEYAGTTISRKQKAAEEPSLDV 581

Query: 1107 TVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVST 928
            TVSKR +S LEN E N  EVRT  G GGWLEE T  G+Q IERNHLM+K   EP+KT++T
Sbjct: 582  TVSKRQKSPLENTEHNMSEVRT--GIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNT 639

Query: 927  ------SSGNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXX 766
                   S NFN  S  NEQAP+TSSN   SLPALLK  AVNPTML+N+L     R+   
Sbjct: 640  VSSSCTGSDNFNATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLL-----RIAEA 694

Query: 765  XXXXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQ 586
                           TSSNSAMGT++T +IG SM  GL Q S GMLPV         +LQ
Sbjct: 695  QKKSADSATNMLLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQ 754

Query: 585  DDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRA 406
            DDSGK RMKPRDPRRILH +NT+QKSG+LG+EQFKAIVSP+SNNQGTGDNVNAQK E R 
Sbjct: 755  DDSGKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRV 814

Query: 405  ETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQ 226
            ++KL  TQ  AQPDI RQF RNLKNIADI+SVSQESS H+P  Q  SSASVP T D+ EQ
Sbjct: 815  DSKLVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRGEQ 874

Query: 225  KSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXX 46
            KS V NSQNL+AG+ SA E+ ASG+ RSQ+TW DVEHLFEGYD                 
Sbjct: 875  KSVVSNSQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEE 934

Query: 45   QNKMFAARKLSLVLD 1
            QNKMFAARKL LVLD
Sbjct: 935  QNKMFAARKLCLVLD 949


>ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max] gi|947096623|gb|KRH45208.1|
            hypothetical protein GLYMA_08G257900 [Glycine max]
          Length = 1261

 Score =  647 bits (1670), Expect = 0.0
 Identities = 357/547 (65%), Positives = 400/547 (73%), Gaps = 6/547 (1%)
 Frame = -1

Query: 1623 SKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPSGDCEEGVV 1444
            S + + KMELDSEGSK H YETDALKAVSTYQQKFGRSS FTNDK PSPTPSGDCE+ VV
Sbjct: 415  SGSASAKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEVV 474

Query: 1443 DTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGSGSYPVKTSA 1264
            DTN+EV           +KPTLLDQ PVS+TSMDRSSMHG I+SR+DA G GS+PVK+SA
Sbjct: 475  DTNEEVSSASTGDFLTSTKPTLLDQPPVSATSMDRSSMHGFISSRVDATGPGSFPVKSSA 534

Query: 1263 KSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLDATVSKRLRS 1084
            K+RDPRLRFINSDASA+D N  +  +NM KVEY+GT ISRKQKA EEPSLD TVSKRL+S
Sbjct: 535  KNRDPRLRFINSDASAVD-NLSTLINNMSKVEYSGTTISRKQKAAEEPSLDVTVSKRLKS 593

Query: 1083 SLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVST------SS 922
            SLEN E N  EVRT  G+GGWLEE T  G+QLIERNHLM+K   E KKT++T       S
Sbjct: 594  SLENTEHNMSEVRT--GSGGWLEENTGPGAQLIERNHLMDKFGPEAKKTLNTVSSSCTGS 651

Query: 921  GNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXXXXXXXXXX 742
             NFN  S  NEQAP+T+SN   SLPALLK+ +VNP ML+NIL     RL           
Sbjct: 652  DNFNATSIRNEQAPITASNVLASLPALLKEASVNPIMLVNIL-----RLAEAQKKSADSA 706

Query: 741  XXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQDDSGKFRM 562
                   TSSN AMGT++T +IG SM  GL Q S GMLPV         +LQDDSGK RM
Sbjct: 707  AIMLLHPTSSNPAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSTAQTLQDDSGKIRM 766

Query: 561  KPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRAETKLASTQ 382
            KPRDPRRILH +NT+QKSG LG+EQFKAIVSP+SNNQ TGDNVNA K E R + KL  TQ
Sbjct: 767  KPRDPRRILHTNNTIQKSGDLGNEQFKAIVSPVSNNQRTGDNVNAPKLEGRVDNKLVPTQ 826

Query: 381  SIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQKSAVPNSQ 202
            S AQPDI RQFTRNLKNIADI+SVSQESS H+P +QN SSASVP T D+ EQKS V +SQ
Sbjct: 827  SSAQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLTSDRGEQKSVVSSSQ 886

Query: 201  NLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXXQNKMFAAR 22
            NLQA + SA E+ AS +SRSQSTW DVEHLFEGYD                 QNKMFAAR
Sbjct: 887  NLQADMASAHETAASVTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKMFAAR 946

Query: 21   KLSLVLD 1
            KL LVLD
Sbjct: 947  KLCLVLD 953


>ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris]
            gi|561012448|gb|ESW11309.1| hypothetical protein
            PHAVU_008G019000g [Phaseolus vulgaris]
          Length = 1272

 Score =  642 bits (1656), Expect = 0.0
 Identities = 354/555 (63%), Positives = 413/555 (74%), Gaps = 6/555 (1%)
 Frame = -1

Query: 1647 MDRSGLPSSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPS 1468
            M +SG  ++K + GK+E+DSEGSK H YETDALKAVSTYQQKFGRSS FTNDKLPSPTPS
Sbjct: 416  MVKSGSAAAKMQPGKLEVDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPS 475

Query: 1467 GDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGSG 1288
            GDC++  VDTN+EV           +KPTLLDQ PVS+TS+D+S + GLI+SR+DAAGSG
Sbjct: 476  GDCDDMAVDTNEEVSSASTSGFLTSTKPTLLDQPPVSATSVDKSRLLGLISSRVDAAGSG 535

Query: 1287 SYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLDA 1108
            S+PVK+SAKSRDPR R INS+ASA+D NQ + THNMPKVEYAG+ ISRKQKA+EEPS D 
Sbjct: 536  SFPVKSSAKSRDPRRRLINSEASAVD-NQFTVTHNMPKVEYAGSTISRKQKAVEEPSFDL 594

Query: 1107 TVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVST 928
            TVSKRL+SSLEN E NT EVRT AG+GGWLE+ T  G+QLIE+NHL++K   EPK+T++T
Sbjct: 595  TVSKRLKSSLENIEHNTSEVRTIAGSGGWLEDITGPGTQLIEKNHLIDKFAPEPKRTLNT 654

Query: 927  --SSG--NFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXXXX 760
              SSG  NFN  S  NEQAP+TS+N   SLPA+ KDI VNPTMLL++L+EQ++ +     
Sbjct: 655  VSSSGSVNFNATSIRNEQAPITSNNVPSSLPAIFKDIVVNPTMLLSLLMEQKRLVDAQNN 714

Query: 759  XXXXXXXXSTGQL--TSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQ 586
                     T  L  TSSNSAMGT++T +I  SM  GL Q S GMLPV            
Sbjct: 715  SADSA----TNMLHPTSSNSAMGTDSTASIVSSMATGL-QTSVGMLPVSSQSTSTAQLQD 769

Query: 585  DDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRA 406
            D SGK RMKPRDPRRILH +N++QKSG++ +E  KAIVSP+SN   TGD+VNAQK E R 
Sbjct: 770  DYSGKIRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNILVTGDSVNAQKLEGRM 829

Query: 405  ETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQ 226
            +TKL  TQS A PDITRQFTRNLKNIADI+SVSQESS HSPA Q  SSASVP  +D+ EQ
Sbjct: 830  DTKLVPTQSGAAPDITRQFTRNLKNIADIMSVSQESSTHSPAAQGFSSASVPLNVDRGEQ 889

Query: 225  KSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXX 46
            KS + NSQNL AG GSAPE CA G+SRSQSTW DVEHLFEGYD                 
Sbjct: 890  KSVLSNSQNLHAGTGSAPEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEE 949

Query: 45   QNKMFAARKLSLVLD 1
            QNKMFAARKL LVLD
Sbjct: 950  QNKMFAARKLCLVLD 964


>gb|KOM36704.1| hypothetical protein LR48_Vigan03g008500 [Vigna angularis]
          Length = 1275

 Score =  623 bits (1607), Expect = e-175
 Identities = 349/551 (63%), Positives = 405/551 (73%), Gaps = 9/551 (1%)
 Frame = -1

Query: 1626 SSKTEAGKMELDSEGS-KLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPSGDCEEG 1450
            ++K + GK+E+DSEGS K H YETDALKAVSTYQQKFGRSS FTNDKLPSPTPSGDC++ 
Sbjct: 421  AAKMQPGKVEVDSEGSTKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCDDM 480

Query: 1449 VVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGSGSYPVKT 1270
            VVDTN+EV           +KPTL+DQ PVS TSMD S + GLINSR+DAAG GS+PVK+
Sbjct: 481  VVDTNEEVSSASIGGFLTTTKPTLIDQPPVSGTSMDNSRLLGLINSRVDAAGPGSFPVKS 540

Query: 1269 SAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLDATVSKRL 1090
            SAKSRDPR R INS+A+A+D N     +NMPKVEYAG+ ISRKQKA+EEP  D TVSKRL
Sbjct: 541  SAKSRDPRRRLINSEANAVD-NHSVVINNMPKVEYAGSAISRKQKAVEEP-FDVTVSKRL 598

Query: 1089 RSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVSTSSGN-- 916
            +SSLEN E N+ +VRT AG GGWLE+ T  G++LIE+N+LM+K   EPKKT++T S +  
Sbjct: 599  KSSLENIEHNSSQVRTIAGTGGWLEDNTGPGTELIEKNNLMDKFAPEPKKTLNTVSSSCS 658

Query: 915  ----FNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXXXXXXXX 748
                FN  S  NEQ P+TSSN A SLPA+LKDI VNPTMLL ++ EQQ RL         
Sbjct: 659  GSVAFNATSIRNEQVPITSSNIASSLPAVLKDIVVNPTMLLGLIFEQQNRLRNAVNKSSD 718

Query: 747  XXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQDD-SGK 571
                     TSSNSA GT++TV+IG SM  GL Q S GMLPV         SLQDD SGK
Sbjct: 719  SATNILNP-TSSNSATGTDSTVSIGSSMATGL-QTSVGMLPVSSQSTSTAQSLQDDYSGK 776

Query: 570  FRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRAETKLA 391
             RMKPRDPRRILH +N++QKSG++ +E  KAIVSP+SN+Q TG+NVNAQK E R +TKL 
Sbjct: 777  IRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNSQVTGENVNAQKLEGRVDTKLV 836

Query: 390  STQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQKSAVP 211
             TQS A PDITRQFT+NLKNIADI+SVSQESS HS A Q+ SSASVP  +D+ EQKS V 
Sbjct: 837  PTQSGAAPDITRQFTKNLKNIADIMSVSQESSTHSTAAQSFSSASVPLNIDRGEQKSVVS 896

Query: 210  NSQNLQAG-IGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXXQNKM 34
            NSQNLQAG +GSA E CA G+SRSQSTW DVEHLFEGYD                 QNKM
Sbjct: 897  NSQNLQAGTVGSAHEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKM 956

Query: 33   FAARKLSLVLD 1
            FAARKL LVLD
Sbjct: 957  FAARKLCLVLD 967


>ref|XP_013447776.1| carboxy-terminal domain phosphatase-like protein, putative [Medicago
            truncatula] gi|657376865|gb|KEH21861.1| carboxy-terminal
            domain phosphatase-like protein, putative [Medicago
            truncatula]
          Length = 958

 Score =  618 bits (1594), Expect = e-174
 Identities = 336/552 (60%), Positives = 384/552 (69%), Gaps = 2/552 (0%)
 Frame = -1

Query: 1650 GMDRSGLPSSK-TEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPT 1474
            G+DR GLP +  TEA KMELD + SKLH YETDALKAVSTYQQKF RSS+FT+DK PSPT
Sbjct: 145  GIDRFGLPPAVCTEAEKMELDGKDSKLHIYETDALKAVSTYQQKFSRSSYFTDDKFPSPT 204

Query: 1473 PSGDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAG 1294
            PSGDCE   VDTNDEV            KP  LDQ+PVSSTS+DR +MHGL++SRIDA G
Sbjct: 205  PSGDCEGEAVDTNDEVSSASIASSLTSFKPPPLDQIPVSSTSLDRPNMHGLVDSRIDATG 264

Query: 1293 SGSYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSL 1114
            SGSYP K+SAKSRDPRLRFIN DAS LDLNQ  GTH+MP+VEY G +ISRKQK +EEPSL
Sbjct: 265  SGSYPAKSSAKSRDPRLRFINPDASTLDLNQSLGTHSMPRVEYGGRVISRKQKTVEEPSL 324

Query: 1113 DATVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTV 934
            DAT  KRLR SLEN E NTRE R  AG GGW EE TVAGSQL ERNHLM+K ETE K+T+
Sbjct: 325  DATAPKRLRRSLENSEHNTREERAMAGKGGWFEENTVAGSQLAERNHLMQKGETELKRTI 384

Query: 933  STSSGNFNVASNGNEQAPVTSSNTAVSLPA-LLKDIAVNPTMLLNILIEQQQRLXXXXXX 757
            STSS N  V++NGNE A VTSS+   SLP  LL ++AVNP ML+++++E Q         
Sbjct: 385  STSSSNLTVSNNGNELASVTSSSATASLPTYLLNNVAVNPAMLIHMILEHQHN------- 437

Query: 756  XXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQDDS 577
                   +  Q    +SA GT+ TVN GP+MTAGL Q S G+LP          +L +DS
Sbjct: 438  ------EAEAQKKPVDSARGTDATVNTGPAMTAGLTQSSVGILPASSPATSMTQTLPEDS 491

Query: 576  GKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRAETK 397
            GK RMKPRDPRR LHGS+TLQK                               +VR ETK
Sbjct: 492  GKIRMKPRDPRRFLHGSSTLQKF------------------------------DVRVETK 521

Query: 396  LASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQKSA 217
            LA  QSIAQPDITRQFT+NLKNIADI+SV QE+S++ PATQNVSSASVPF  D+ EQKS 
Sbjct: 522  LAPIQSIAQPDITRQFTKNLKNIADIMSVPQETSSNPPATQNVSSASVPFMSDRSEQKSG 581

Query: 216  VPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXXQNK 37
            VPNSQNL+ G+GSAPE+CA GSSR Q+TW+DVEHLFE YD                 Q K
Sbjct: 582  VPNSQNLKDGVGSAPETCAPGSSRPQNTWADVEHLFEAYDVKQKAAIQRERSRRLEEQKK 641

Query: 36   MFAARKLSLVLD 1
            MFAARKL LVLD
Sbjct: 642  MFAARKLCLVLD 653


>ref|XP_003621644.2| carboxy-terminal domain phosphatase-like protein, putative [Medicago
            truncatula] gi|657376864|gb|AES77862.2| carboxy-terminal
            domain phosphatase-like protein, putative [Medicago
            truncatula]
          Length = 1209

 Score =  618 bits (1594), Expect = e-174
 Identities = 336/552 (60%), Positives = 384/552 (69%), Gaps = 2/552 (0%)
 Frame = -1

Query: 1650 GMDRSGLPSSK-TEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPT 1474
            G+DR GLP +  TEA KMELD + SKLH YETDALKAVSTYQQKF RSS+FT+DK PSPT
Sbjct: 396  GIDRFGLPPAVCTEAEKMELDGKDSKLHIYETDALKAVSTYQQKFSRSSYFTDDKFPSPT 455

Query: 1473 PSGDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAG 1294
            PSGDCE   VDTNDEV            KP  LDQ+PVSSTS+DR +MHGL++SRIDA G
Sbjct: 456  PSGDCEGEAVDTNDEVSSASIASSLTSFKPPPLDQIPVSSTSLDRPNMHGLVDSRIDATG 515

Query: 1293 SGSYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSL 1114
            SGSYP K+SAKSRDPRLRFIN DAS LDLNQ  GTH+MP+VEY G +ISRKQK +EEPSL
Sbjct: 516  SGSYPAKSSAKSRDPRLRFINPDASTLDLNQSLGTHSMPRVEYGGRVISRKQKTVEEPSL 575

Query: 1113 DATVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTV 934
            DAT  KRLR SLEN E NTRE R  AG GGW EE TVAGSQL ERNHLM+K ETE K+T+
Sbjct: 576  DATAPKRLRRSLENSEHNTREERAMAGKGGWFEENTVAGSQLAERNHLMQKGETELKRTI 635

Query: 933  STSSGNFNVASNGNEQAPVTSSNTAVSLPA-LLKDIAVNPTMLLNILIEQQQRLXXXXXX 757
            STSS N  V++NGNE A VTSS+   SLP  LL ++AVNP ML+++++E Q         
Sbjct: 636  STSSSNLTVSNNGNELASVTSSSATASLPTYLLNNVAVNPAMLIHMILEHQHN------- 688

Query: 756  XXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQDDS 577
                   +  Q    +SA GT+ TVN GP+MTAGL Q S G+LP          +L +DS
Sbjct: 689  ------EAEAQKKPVDSARGTDATVNTGPAMTAGLTQSSVGILPASSPATSMTQTLPEDS 742

Query: 576  GKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRAETK 397
            GK RMKPRDPRR LHGS+TLQK                               +VR ETK
Sbjct: 743  GKIRMKPRDPRRFLHGSSTLQKF------------------------------DVRVETK 772

Query: 396  LASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQKSA 217
            LA  QSIAQPDITRQFT+NLKNIADI+SV QE+S++ PATQNVSSASVPF  D+ EQKS 
Sbjct: 773  LAPIQSIAQPDITRQFTKNLKNIADIMSVPQETSSNPPATQNVSSASVPFMSDRSEQKSG 832

Query: 216  VPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXXQNK 37
            VPNSQNL+ G+GSAPE+CA GSSR Q+TW+DVEHLFE YD                 Q K
Sbjct: 833  VPNSQNLKDGVGSAPETCAPGSSRPQNTWADVEHLFEAYDVKQKAAIQRERSRRLEEQKK 892

Query: 36   MFAARKLSLVLD 1
            MFAARKL LVLD
Sbjct: 893  MFAARKLCLVLD 904


>ref|XP_014497833.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Vigna radiata var. radiata]
          Length = 1267

 Score =  608 bits (1567), Expect = e-171
 Identities = 347/558 (62%), Positives = 402/558 (72%), Gaps = 9/558 (1%)
 Frame = -1

Query: 1647 MDRSGLPSSKTEAGKMELDSEGS-KLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTP 1471
            M +SG  ++K + GKME+DSEGS K H YETDALKAVSTYQQKFGRSS FTNDKLPSPTP
Sbjct: 415  MVKSG-SAAKMQPGKMEVDSEGSTKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTP 473

Query: 1470 SGDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGS 1291
            SGDC++ VVDTN+EV           +KPTL+DQ PVS+TSMD S + GLIN+R+DAAG 
Sbjct: 474  SGDCDDMVVDTNEEVSSASTGGFLTTTKPTLIDQPPVSATSMDNSRLLGLINTRVDAAGP 533

Query: 1290 GSYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLD 1111
            GS+PVK+SAKSRDPR R IN +A+A+D N     +NMPKVEYAG+ ISRKQKA+EEP  D
Sbjct: 534  GSFPVKSSAKSRDPRRRLINPEANAVD-NHSIVINNMPKVEYAGSTISRKQKAVEEP-FD 591

Query: 1110 ATVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVS 931
             TVSKRL+SSLEN E N+ +VRT AG GGWLE+ T  G+QLIE+N+LM+K   EPKKT++
Sbjct: 592  VTVSKRLKSSLENIEHNSSQVRTIAGTGGWLEDNTGPGTQLIEKNNLMDKFAPEPKKTLN 651

Query: 930  TSSGN------FNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXX 769
            T S +      FN  S  NEQ P+TSSN A SLPA+LKDI VNPTMLL ++ EQQ RL  
Sbjct: 652  TVSSSCSGSVGFNATSIRNEQVPITSSNIASSLPAVLKDIVVNPTMLLGLIFEQQNRLRN 711

Query: 768  XXXXXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSL 589
                            TSSNSA G ++TV+IG SM  GL Q S G+LPV         SL
Sbjct: 712  AVNKSSESATNILNP-TSSNSAAGADSTVSIGSSMATGL-QTSVGILPVSSQSTSTAQSL 769

Query: 588  QDD-SGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEV 412
            QDD SGK RMKPRDPRRILH +N++QKSG+        IVSP+SN+Q TGDNVNAQK E 
Sbjct: 770  QDDYSGKIRMKPRDPRRILHTNNSVQKSGN--------IVSPVSNSQVTGDNVNAQKLEG 821

Query: 411  RAETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKK 232
            R +TKL   QS A PDITRQFT+NLKNIADI+SVSQESS HS A Q+ SSASVP  +D+ 
Sbjct: 822  RVDTKLVPPQSGAAPDITRQFTKNLKNIADIMSVSQESSTHSTAAQSFSSASVPLNIDRG 881

Query: 231  EQKSAVPNSQNLQAG-IGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXX 55
            EQKS V NSQNLQAG +GSA E CA G+SRSQSTW DVEHLFEGYD              
Sbjct: 882  EQKSVVSNSQNLQAGTVGSAHEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARR 941

Query: 54   XXXQNKMFAARKLSLVLD 1
               QNKMFAARKL LVLD
Sbjct: 942  IEEQNKMFAARKLCLVLD 959


>ref|XP_010100046.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus
            notabilis] gi|587892642|gb|EXB81217.1| RNA polymerase II
            C-terminal domain phosphatase-like 3 [Morus notabilis]
          Length = 1301

 Score =  400 bits (1027), Expect = e-108
 Identities = 244/553 (44%), Positives = 325/553 (58%), Gaps = 8/553 (1%)
 Frame = -1

Query: 1635 GLPSSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPSGDCE 1456
            G+    +   K+   +E S+LH YETDALKAVSTYQQKFGR SF  +D+LPSPTPS +C+
Sbjct: 432  GIIKPVSTTAKVAPGAEESRLHRYETDALKAVSTYQQKFGRGSFLMSDRLPSPTPSEECD 491

Query: 1455 EGVVDTNDEVXXXXXXXXXXXSK-PTLLDQMPVSSTSMDRSSMHGLINSRIDA-AGSGSY 1282
            E   D N EV              P L   +  SS  +   +M G I ++  A  GSGS 
Sbjct: 492  EED-DINQEVSSSLTSGNLRTPAIPILRPSVVTSSVPVSSPTMQGPIAAKNAAPVGSGSN 550

Query: 1281 P-VKTSAKSRDPRLRFINSDASALDLNQP--SGTHNMPKVEYAGTIISRKQKAIEEPSLD 1111
              +K SA+SRDPRLRF NSDA ALDLNQ   +  HN PKVE      SRKQ+ +EEP+LD
Sbjct: 551  STMKASARSRDPRLRFANSDAGALDLNQRPLTAVHNGPKVEPGDPTSSRKQRIVEEPNLD 610

Query: 1110 ATVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVS 931
                KR R +  + +    +V+T +G GGWLE+    G Q++ +N L+E  E +P+K++ 
Sbjct: 611  GPALKRQRHAFVSAKI---DVKTASGVGGWLEDNGTTGPQIMNKNQLVENAEADPRKSIH 667

Query: 930  TSSGNF--NVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQ-QQRLXXXXX 760
              +G    N  + G EQ PVT ++T  +LPA+LKDIAVNPT+ ++IL +  QQ+L     
Sbjct: 668  LVNGPIMNNGPNIGKEQVPVTGTSTPDALPAILKDIAVNPTIFMDILNKLGQQQLLAADA 727

Query: 759  XXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQDD 580
                    +T     +NS +G    VN+ PS  +G+ Q  A  LP           +QD+
Sbjct: 728  QQKSDSSKNTTHPPGTNSILGAAPLVNVAPSKASGILQTPAVSLPTTSQVATAS--MQDE 785

Query: 579  SGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRAET 400
             GK RMKPRDPRR+LHG N LQKS SLG EQFK IVS +S   G  DN+N    E +A+ 
Sbjct: 786  LGKIRMKPRDPRRVLHG-NMLQKSWSLGHEQFKPIVSSVSCTPGNKDNLNGPVQEGQADK 844

Query: 399  KLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQKS 220
            K   +Q + QPDI RQFT+NL+NIAD++SVSQ S++ +  +QN+SS  +P   D+ + K+
Sbjct: 845  KQVPSQLVVQPDIARQFTKNLRNIADLMSVSQASTSPATVSQNLSSQPLPVKPDRGDVKA 904

Query: 219  AVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXXQN 40
             VPNS++  +G  S PE+  +  SR+ + W DVEHLFEGYD                 Q 
Sbjct: 905  VVPNSEDQHSGTNSTPETTLAVPSRTPNAWGDVEHLFEGYDDEQKAAIQRERARRLEEQK 964

Query: 39   KMFAARKLSLVLD 1
            KMF A KL LVLD
Sbjct: 965  KMFDAHKLCLVLD 977


>ref|XP_007043830.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao] gi|508707765|gb|EOX99661.1| RNA
            polymerase II C-terminal domain phosphatase-like 3,
            putative [Theobroma cacao]
          Length = 1290

 Score =  390 bits (1002), Expect = e-105
 Identities = 255/556 (45%), Positives = 329/556 (59%), Gaps = 20/556 (3%)
 Frame = -1

Query: 1608 GKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPSGDCEEGVVDTNDE 1429
            GK   D+EG KLH YETDALKA STYQQKFG+ SFF++D+LPSPTPS +  +   D   E
Sbjct: 441  GKGSHDAEGDKLHPYETDALKAFSTYQQKFGQGSFFSSDRLPSPTPSEESGDEGGDNGGE 500

Query: 1428 VXXXXXXXXXXXSKPTLLDQMPVSSTSMDR--SSMHGLINSRIDAAGSGSYPV--KTSAK 1261
            V           + P L   +  S+  +D   SS+ G I +R     S    +  K+ AK
Sbjct: 501  VSSSSSIGNFKPNLPILGHPIVSSAPLVDSASSSLQGQITTRNATPMSSVSNIVSKSLAK 560

Query: 1260 SRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTII-SRKQKAIEEPSLDATVSKRLRS 1084
            SRDPRL F NS+ASALDLN+    HN  KV   G I+ SRK+K++EEP LD+   KR R+
Sbjct: 561  SRDPRLWFANSNASALDLNERL-LHNASKVAPVGGIMDSRKKKSVEEPILDSPALKRQRN 619

Query: 1083 SLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKK------TVSTSS 922
             LEN     R+V+T +G GGWLE+T   GSQ+  RN   E +E+  +K      + ST S
Sbjct: 620  ELENLGV-ARDVQTVSGIGGWLEDTDAIGSQITNRNQTAENLESNSRKMDNGVTSSSTLS 678

Query: 921  GNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNIL-IEQQQRLXXXXXXXXXX 745
            G  N+    NEQ PVTS++T  SLPALLKDIAVNPTML+NIL + QQQRL          
Sbjct: 679  GKTNITVGTNEQVPVTSTSTP-SLPALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPD 737

Query: 744  XXXSTGQLTSSNSAMGTETTVNI--------GPSMTAGLPQGSAGMLPVXXXXXXXXXSL 589
               ST    SSNS +G  ++ N+         PS+++G+    AG L V           
Sbjct: 738  PVKSTFHQPSSNSLLGVVSSTNVIPSPSVNNVPSISSGISSKPAGNLQVPSP-------- 789

Query: 588  QDDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVR 409
             D+SGK RMKPRDPRR+LHG N+LQ+SGS+G +Q K   +  S+ QG+ DN+NAQK + +
Sbjct: 790  -DESGKIRMKPRDPRRVLHG-NSLQRSGSMGLDQLKTNGALTSSTQGSKDNLNAQKLDSQ 847

Query: 408  AETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKE 229
             E+K   +Q +  PDIT+QFT NLKNIADI+SVSQ  ++  P + N+    V    D  +
Sbjct: 848  TESKPMQSQLVPPPDITQQFTNNLKNIADIMSVSQALTSLPPVSHNLVPQPVLIKSDSMD 907

Query: 228  QKSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXX 49
             K+ V NS++ Q G G APE+ A+G  RSQ+ W DVEHLFE YD                
Sbjct: 908  MKALVSNSEDQQTGAGLAPEAGATG-PRSQNAWGDVEHLFERYDDQQKAAIQRERARRIE 966

Query: 48   XQNKMFAARKLSLVLD 1
             Q KMF+ARKL LVLD
Sbjct: 967  EQKKMFSARKLCLVLD 982


>ref|XP_008222368.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Prunus mume]
          Length = 1194

 Score =  389 bits (999), Expect = e-105
 Identities = 258/555 (46%), Positives = 325/555 (58%), Gaps = 13/555 (2%)
 Frame = -1

Query: 1626 SSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPSGDCEEGV 1447
            +S T   ++ L++E S+LH YET+ALKAVS+YQQKF RSSF  +++LPSPTPS D   G 
Sbjct: 354  ASDTATARVALNAEDSRLHSYETEALKAVSSYQQKFNRSSFLMSERLPSPTPSEDGGNGD 413

Query: 1446 VDTNDEVXXXXXXXXXXXSKPTLLDQ-MPVSSTSMDRSSMHGLINSRIDAAGSGSYP--- 1279
             DT  EV             P    Q +  S   +  SSM G   ++  AA   S P   
Sbjct: 414  DDTGGEVSSSSASNLRTSCSPMSGRQIVSPSPIPVGSSSMQGRATAK-SAAPPNSEPSMT 472

Query: 1278 VKTSAKSRDPRLRFINSDASALDLNQPSGT--HNMPKVEYAGTIISRKQKAIEEPSLDAT 1105
            +K SAKSRDPRLRF NSD  AL+LNQ   T  H+ PKV+   T+ SRKQK +EE   D  
Sbjct: 473  IKASAKSRDPRLRFANSDMGALNLNQQPSTVVHSAPKVDSVITLSSRKQKPLEESRFDGP 532

Query: 1104 VSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVSTS 925
              KR R++LEN      + +T +G+GGWLE+    G  L  +N  +E  ET+P+K V   
Sbjct: 533  ALKRQRNALEN-SGIVGDAKTASGSGGWLEDIGGVGPHLNSKNQTVENAETDPRKVVKVL 591

Query: 924  S------GNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNIL-IEQQQRLXXX 766
            S      GN N  ++ NE   +  ++TA SLPALLKDIAVNPTMLLN+L + QQQRL   
Sbjct: 592  SSPSIVDGNTNGPNSANEHVSLMGASTA-SLPALLKDIAVNPTMLLNLLKMGQQQRLAAE 650

Query: 765  XXXXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQ 586
                      +T   TSS+S + +    N+ PS T+G+ Q  AG LPV         +L 
Sbjct: 651  AQQKSADPPKTTTHPTSSSSILVSAALGNV-PSKTSGILQTPAGTLPV-----SSQKALM 704

Query: 585  DDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRA 406
            D+SGK RMKPRDPRR LHG N LQKSGSLG EQF+ IV P+S+ QG  DN+N Q     A
Sbjct: 705  DESGKVRMKPRDPRRALHG-NALQKSGSLGHEQFRNIVPPLSSIQGNKDNLNGQ-----A 758

Query: 405  ETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQ 226
            + K  + QS+  PDITRQFT+NLKNIADI+SVS  S++ + A+Q+VSS  VP    K E+
Sbjct: 759  DKKPVTAQSLDAPDITRQFTKNLKNIADIMSVSNVSTSPAIASQSVSSQPVPI---KPER 815

Query: 225  KSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXX 46
                P  Q  ++   SA E+ A+G SRS   W DVEHLFEGYD                 
Sbjct: 816  IDLKPEEQRPES--ISASEAAAAGPSRSPVMWGDVEHLFEGYDDQQKAAIQRERTRRIEE 873

Query: 45   QNKMFAARKLSLVLD 1
            Q KMFAA KL LVLD
Sbjct: 874  QKKMFAAHKLCLVLD 888


>ref|XP_003609604.2| carboxy-terminal domain phosphatase-like protein, putative [Medicago
            truncatula] gi|657390855|gb|AES91801.2| carboxy-terminal
            domain phosphatase-like protein, putative [Medicago
            truncatula]
          Length = 1074

 Score =  379 bits (974), Expect = e-102
 Identities = 222/402 (55%), Positives = 253/402 (62%)
 Frame = -1

Query: 1650 GMDRSGLPSSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTP 1471
            G DR GLP  KTEA KMELD +  KLH +ETDALKA ST QQKF RSSFFT+D+ PSPTP
Sbjct: 336  GTDRFGLPPVKTEAEKMELDGKDYKLHIHETDALKAASTCQQKFSRSSFFTDDEFPSPTP 395

Query: 1470 SGDCEEGVVDTNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMDRSSMHGLINSRIDAAGS 1291
            SGDCE G VDTNDEV           SKP  LDQM VSST ++RS+MHGLINSRIDA+G+
Sbjct: 396  SGDCEGGAVDTNDEVSSASIASSLTSSKPPPLDQMLVSSTYINRSNMHGLINSRIDASGA 455

Query: 1290 GSYPVKTSAKSRDPRLRFINSDASALDLNQPSGTHNMPKVEYAGTIISRKQKAIEEPSLD 1111
            GSYP KTS KSRDPRLRF  SD       Q S  + MPKVEYA  +ISRK+K +EE SLD
Sbjct: 456  GSYPAKTSVKSRDPRLRFNISD-------QSSTKNIMPKVEYAEGVISRKRKTVEESSLD 508

Query: 1110 ATVSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTVS 931
            AT  KRL  SLEN + N+RE +T    GGWL E TVA                       
Sbjct: 509  ATAPKRLTRSLENSQHNSREEQTMDAKGGWLAENTVA----------------------- 545

Query: 930  TSSGNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNILIEQQQRLXXXXXXXX 751
                N    SNGNEQAPV SS  A  L AL    +VN TMLLN L++  QRL        
Sbjct: 546  ---SNLTTTSNGNEQAPVISSCAATPLLALFNSESVNSTMLLNKLLDIHQRLAEVKRPIN 602

Query: 750  XXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQDDSGK 571
                     LT+SNSA GT +TVN  P+MT+G+PQ S GMLP          +LQ DS K
Sbjct: 603  FATSAL--HLTNSNSARGTNSTVNTSPTMTSGVPQNSIGMLPTSSPTTSMAQTLQVDSEK 660

Query: 570  FRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGT 445
              +KPRDPRR LH S+T+QKSGSLGS+Q KAIVSPM N +G+
Sbjct: 661  ICLKPRDPRRSLHASSTVQKSGSLGSKQSKAIVSPMPNIKGS 702



 Score = 62.0 bits (149), Expect = 2e-06
 Identities = 38/83 (45%), Positives = 48/83 (57%), Gaps = 1/83 (1%)
 Frame = -1

Query: 246 TLDKKEQKSAVPNSQNLQAGIGSAPESCASGSSRSQSTW-SDVEHLFEGYDXXXXXXXXX 70
           +L  K+ K+ V    N++   GSA E+CASGS +  +TW ++VEHL EGYD         
Sbjct: 683 SLGSKQSKAIVSPMPNIK---GSAHETCASGSCQPHNTWAANVEHLLEGYDAQQKAVIQR 739

Query: 69  XXXXXXXXQNKMFAARKLSLVLD 1
                   QNKMFAARKL LVLD
Sbjct: 740 ERARRLEEQNKMFAARKLCLVLD 762


>ref|XP_007225412.1| hypothetical protein PRUPE_ppa000589mg [Prunus persica]
            gi|462422348|gb|EMJ26611.1| hypothetical protein
            PRUPE_ppa000589mg [Prunus persica]
          Length = 1085

 Score =  377 bits (968), Expect = e-101
 Identities = 252/555 (45%), Positives = 321/555 (57%), Gaps = 13/555 (2%)
 Frame = -1

Query: 1626 SSKTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPSGDCEEGV 1447
            +S T   ++ L++E S+LH YET+ALKAVS+YQQKF RSSF  +++LPSPTPS D   G 
Sbjct: 243  ASDTATARVALNAEDSRLHSYETEALKAVSSYQQKFNRSSFLMSERLPSPTPSEDGGNGD 302

Query: 1446 VDTNDEVXXXXXXXXXXXSKPTLLDQ-MPVSSTSMDRSSMHGLINSRIDAAGSGSYP--- 1279
             DT  EV             P    Q +  S   +   SM G   ++  AA   S P   
Sbjct: 303  DDTGGEVSSSFASNLRTSCPPISGRQIVSPSPIPVGSPSMQGRATAK-SAAPPNSEPSMT 361

Query: 1278 VKTSAKSRDPRLRFINSDASALDLNQPSGT--HNMPKVEYAGTIISRKQKAIEEPSLDAT 1105
            +K SAKSRDPRLRF NSD  AL+LNQ   T  H+ PKV+   T+ SRKQK +EE   D  
Sbjct: 362  IKASAKSRDPRLRFANSDMGALNLNQQPSTVVHSAPKVDSVITLSSRKQKPLEESRFDGP 421

Query: 1104 VSKRLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTV--- 934
              KR R++LEN      + +T +G+GGWLE+    G  L  +N  +E  ET+P+  V   
Sbjct: 422  ALKRQRNALEN-SGIVGDAKTASGSGGWLEDIGGVGPHLNSKNQTVENAETDPRNVVKVL 480

Query: 933  ---STSSGNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNIL-IEQQQRLXXX 766
               ST   N N  ++ NE   +  ++ A SLP LLKDIAVNPTMLLN+L + QQQR+   
Sbjct: 481  SSPSTVDCNTNGPNSANEHVSLMGASMA-SLPELLKDIAVNPTMLLNLLKMGQQQRVASE 539

Query: 765  XXXXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQ 586
                      +    TSS+S + +    N+ PS T+G+ Q  AG LPV         +L 
Sbjct: 540  AHQKSADPPKTMTHPTSSSSILVSAALGNV-PSKTSGILQTPAGTLPV-----SSQKALM 593

Query: 585  DDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRA 406
            D+SGK RMKPRDPRR LHG N LQKSGSLG EQF+ I+ P+S  QG  DN+N Q     A
Sbjct: 594  DESGKVRMKPRDPRRALHG-NALQKSGSLGQEQFRNIIPPLSAIQGNKDNLNGQ-----A 647

Query: 405  ETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQ 226
            + KL ++QS+  PDITRQFT+NLKNIADI+SVS  S++ + A+Q+VSS  VP    K E+
Sbjct: 648  DKKLVTSQSLDAPDITRQFTKNLKNIADIMSVSNVSTSPAIASQSVSSQLVPI---KPER 704

Query: 225  KSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXX 46
                P  Q  ++   SA E+ A+G SRS   W DVEHLFEGYD                 
Sbjct: 705  IDLKPEEQRPES--ISASEAAAAGPSRSPVMWGDVEHLFEGYDDQQKAAIQRERTRRIEE 762

Query: 45   QNKMFAARKLSLVLD 1
            Q KMFAA KL LVLD
Sbjct: 763  QKKMFAAHKLCLVLD 777


>ref|XP_009378819.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Pyrus x bretschneideri]
          Length = 1294

 Score =  372 bits (956), Expect = e-100
 Identities = 245/562 (43%), Positives = 325/562 (57%), Gaps = 22/562 (3%)
 Frame = -1

Query: 1620 KTEAGKMELDSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPSGDCEEGVVD 1441
            + +  ++  ++ GS LH Y+TDA+KAVSTYQQ   R+SFF +++LPSPTPS D + G  D
Sbjct: 443  RPDTSRVTPNAGGSGLHPYDTDAIKAVSTYQQ-INRTSFFMSERLPSPTPSEDGDNGDDD 501

Query: 1440 TNDEVXXXXXXXXXXXSKPTLLDQMPVSSTSMD--RSSMHGLINSRIDAAGSG--SYPVK 1273
            T  EV             P +L Q  VS + +    SSM      +  A GS   +  +K
Sbjct: 502  TVGEVSSSSASNLRSSVPP-ILGQQVVSPSPIPVVSSSMQERFTGKSAAPGSSGSNITIK 560

Query: 1272 TSAKSRDPRLRFINSDASALDLN-QPSGTHNMPKVEYAGTIISRKQKAIEEPSLDATVSK 1096
               ++RDPRLRF NSD  AL+ N QP   HN PKV+   T+ SRKQK +E+   D    K
Sbjct: 561  APTRNRDPRLRFSNSDMGALNPNPQPLTVHNAPKVDSVITLSSRKQKPLEDSKFDGPALK 620

Query: 1095 RLRSSLENPERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTV------ 934
            R R++L+N     ++ +T +G+GGWLE+    G  LI +N  +E   ++P++ V      
Sbjct: 621  RQRNTLDN-SGFVKDPKTASGSGGWLEDIGGVGPHLISKNQTVENTVSDPRQVVNDVSSC 679

Query: 933  STSSGNFNVASNGNEQAPVTSSNTAVSLPALLKDIAVN----------PTMLLNIL-IEQ 787
            ST+ GN N A++ NE   VT  +TA SLPAL+KDIAVN          PTMLLNIL + Q
Sbjct: 680  STADGNSNGANSSNEHLSVTDLSTA-SLPALVKDIAVNSAIFKDIAVNPTMLLNILKLGQ 738

Query: 786  QQRLXXXXXXXXXXXXXSTGQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXX 607
            QQRL             S     SS+S + +  +V+I PS T  + Q  AG LPV     
Sbjct: 739  QQRLAAEAQQKSADPEKSMTNPISSSSILRSNASVDI-PSKTTAMLQTLAGTLPVSSQKA 797

Query: 606  XXXXSLQDDSGKFRMKPRDPRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNA 427
                   D+SGK RMKPRDPRR+LHG N LQKSGSLG EQF+ IV+P+S++QG  DN+N 
Sbjct: 798  PT-----DESGKVRMKPRDPRRVLHG-NALQKSGSLGQEQFRNIVTPLSSSQGNKDNLNG 851

Query: 426  QKPEVRAETKLASTQSIAQPDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPF 247
            QK + + + KL ++QS+  PDI RQFT+NLKNIADIISVS  S++ + A+Q+VSS  VP 
Sbjct: 852  QKHDGQVDMKLVTSQSVEAPDIARQFTKNLKNIADIISVSNGSTSPTLASQSVSSQPVPI 911

Query: 246  TLDKKEQKSAVPNSQNLQAGIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXX 67
              ++ +     P ++    G  SA E+ A+G   S   W DVEHLFEGYD          
Sbjct: 912  KTERID-----PKTEEQGTGSISASEAAAAGPPHSAPMWGDVEHLFEGYDDQQKAAIQRE 966

Query: 66   XXXXXXXQNKMFAARKLSLVLD 1
                   Q KMFAARKL LVLD
Sbjct: 967  RARRIEEQKKMFAARKLCLVLD 988


>ref|XP_009359893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Pyrus x bretschneideri]
          Length = 1278

 Score =  361 bits (926), Expect = 1e-96
 Identities = 242/542 (44%), Positives = 315/542 (58%), Gaps = 12/542 (2%)
 Frame = -1

Query: 1590 SEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPSGDCEEGVVDTNDEVXXXXX 1411
            +EGS LH Y+TDALKAVSTYQ    RSSFF +++LPSPTPS D ++G  DT  EV     
Sbjct: 452  AEGSGLHPYDTDALKAVSTYQH-INRSSFFMSERLPSPTPSEDGDKGDDDTVGEVSSSSS 510

Query: 1410 XXXXXXSKPTLLDQMPVSS--TSMDRSSMHGLINSRIDA-AGSGS-YPVKTSAKSRDPRL 1243
                  S P +L Q  VS     +  SSM G    +  A A SGS   +K   ++RDPRL
Sbjct: 511  ASNLRTSVPPILGQQVVSPFPIPVGSSSMQGRFTGKSAAPASSGSNITIKAPTRNRDPRL 570

Query: 1242 RFINSDASALDLN-QPSGTHNMPKVEYAGTIISRKQKAIEEPSLDATVSKRLRSSLENPE 1066
            RF NSD   L+ N QP   H+ PK++   T+ SRKQK IE+   D    KR R++L N  
Sbjct: 571  RFSNSDVGVLNHNPQPLTVHSAPKIDSVITLSSRKQKPIEDSKFDGPALKRQRNTLGN-S 629

Query: 1065 RNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTV------STSSGNFNVA 904
               ++ +T +G+ GWLE+    G  LI  N  +E   ++P+K V      ST  GN N  
Sbjct: 630  GFVKDPKTASGSCGWLEDIGDFGPHLISNNQTVENTLSDPRKVVNVVSSPSTVDGNSNGP 689

Query: 903  SNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNIL-IEQQQRLXXXXXXXXXXXXXSTG 727
            ++ NE   +T  +TA SLPALLK  AVNPTMLLNIL + QQQRL             S  
Sbjct: 690  NSSNEHVALTDLSTA-SLPALLK--AVNPTMLLNILKLGQQQRLAAEAQPKSAYPEKSMT 746

Query: 726  QLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQDDSGKFRMKPRDP 547
               SSNS  G++ +VN+ PS TA + Q     LP+            D+SGK RMKPRDP
Sbjct: 747  HPISSNSIPGSDASVNV-PSKTAAMLQ----TLPISSQKAP-----MDESGKVRMKPRDP 796

Query: 546  RRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRAETKLASTQSIAQP 367
            RRILHG N LQKSGSLG EQF+ IV+P+ ++QG  DN+  +K + +++ KL ++QS+  P
Sbjct: 797  RRILHG-NVLQKSGSLGQEQFRNIVTPLPSSQGNKDNLTGEKQDGQSDMKLVTSQSVEAP 855

Query: 366  DITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQKSAVPNSQNLQAG 187
            DI RQFT+NLKNIAD++SVS   ++ + A+Q+V S  VP   ++ + K     ++  + G
Sbjct: 856  DIARQFTKNLKNIADMMSVSNGLTSPAIASQSVPSQPVPINSERIDSK-----AEEQRTG 910

Query: 186  IGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXXQNKMFAARKLSLV 7
              SA E+ A+G SRS   W DVEHLFEGYD                 Q KMFAARKL LV
Sbjct: 911  SSSAFEAAAAGPSRSAPMWGDVEHLFEGYDDQQKVAIQRERARRIEEQKKMFAARKLCLV 970

Query: 6    LD 1
            LD
Sbjct: 971  LD 972


>ref|XP_008369646.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Malus domestica]
          Length = 1278

 Score =  358 bits (919), Expect = 8e-96
 Identities = 240/543 (44%), Positives = 315/543 (58%), Gaps = 12/543 (2%)
 Frame = -1

Query: 1593 DSEGSKLHFYETDALKAVSTYQQKFGRSSFFTNDKLPSPTPSGDCEEGVVDTNDEVXXXX 1414
            ++EGS LH Y+TDALKAVSTYQ    RSSFF +++LPSPTPS D ++G  DT  EV    
Sbjct: 451  NAEGSGLHPYDTDALKAVSTYQH-INRSSFFMSERLPSPTPSEDGDKGDADTVGEVSSSS 509

Query: 1413 XXXXXXXSKPTLLDQMPVSS--TSMDRSSMHGLINSRIDA-AGSGS-YPVKTSAKSRDPR 1246
                   S P +L Q  VS     +  SSM      +  A A SGS   +K   ++RDPR
Sbjct: 510  SASNLRTSGPPILGQQVVSPFPIPVGSSSMQERFTGKSAAPASSGSKITIKAPTRNRDPR 569

Query: 1245 LRFINSDASALDLN-QPSGTHNMPKVEYAGTIISRKQKAIEEPSLDATVSKRLRSSLENP 1069
            LRF NSD   L+ N QP   H+ PK++   T+ SRKQK +E+  LD    KR R++L N 
Sbjct: 570  LRFSNSDVGVLNHNPQPLTVHSAPKIDSVITLSSRKQKPLEDSKLDGPALKRQRNTLGN- 628

Query: 1068 ERNTREVRTTAGNGGWLEETTVAGSQLIERNHLMEKVETEPKKTV------STSSGNFNV 907
                ++ +T +G+ GWLE+    G  LI  N  +E  +++P+K V      ST  GN N 
Sbjct: 629  SGFVKDPKTASGSCGWLEDIGGVGPHLISNNQTVENTQSDPRKVVNVVSSPSTVDGNSNG 688

Query: 906  ASNGNEQAPVTSSNTAVSLPALLKDIAVNPTMLLNIL-IEQQQRLXXXXXXXXXXXXXST 730
             ++ NE   +T  +TA SLPALLK  AVNPTMLLNIL + QQQRL             S 
Sbjct: 689  PNSSNEHVSLTDLSTA-SLPALLK--AVNPTMLLNILKLGQQQRLAAEAQPKSAYPEKSM 745

Query: 729  GQLTSSNSAMGTETTVNIGPSMTAGLPQGSAGMLPVXXXXXXXXXSLQDDSGKFRMKPRD 550
                SSNS  G++ +VN+ PS T  + Q     LPV          L D+SGK  MKPRD
Sbjct: 746  THPISSNSIPGSDASVNV-PSKTTAMLQ----TLPVSSQKA-----LMDESGKVCMKPRD 795

Query: 549  PRRILHGSNTLQKSGSLGSEQFKAIVSPMSNNQGTGDNVNAQKPEVRAETKLASTQSIAQ 370
            PRRILHG N LQKS SLG EQF+ IV+P+ ++QG  DN+  +K + +++ KL ++QS+  
Sbjct: 796  PRRILHG-NALQKSRSLGQEQFRNIVTPLPSSQGNKDNLTGEKQDGQSDMKLVTSQSVEA 854

Query: 369  PDITRQFTRNLKNIADIISVSQESSNHSPATQNVSSASVPFTLDKKEQKSAVPNSQNLQA 190
            PDI RQFT+NLKNIAD++SVS   ++ + A+Q+V S  VP   ++ +     P ++  + 
Sbjct: 855  PDIARQFTKNLKNIADMMSVSNGLTSPAIASQSVPSQPVPINSERID-----PKAEEQRT 909

Query: 189  GIGSAPESCASGSSRSQSTWSDVEHLFEGYDXXXXXXXXXXXXXXXXXQNKMFAARKLSL 10
            G  SA E+ A+G SRS   W DVEHLFEGYD                 Q KMFAARKL L
Sbjct: 910  GSSSAFEAAAAGPSRSAPMWGDVEHLFEGYDDQQKVAIQRERSRRIEEQKKMFAARKLCL 969

Query: 9    VLD 1
            VLD
Sbjct: 970  VLD 972


Top