BLASTX nr result

ID: Glycyrrhiza29_contig00004654 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza29_contig00004654
         (1551 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_003545893.1 PREDICTED: RNA polymerase II C-terminal domain ph...   757   0.0  
XP_019425587.1 PREDICTED: RNA polymerase II C-terminal domain ph...   737   0.0  
KRH21483.1 hypothetical protein GLYMA_13G241400 [Glycine max]         724   0.0  
KHN08543.1 RNA polymerase II C-terminal domain phosphatase-like ...   724   0.0  
XP_003543063.1 PREDICTED: RNA polymerase II C-terminal domain ph...   724   0.0  
XP_006597421.1 PREDICTED: RNA polymerase II C-terminal domain ph...   721   0.0  
XP_006597420.1 PREDICTED: RNA polymerase II C-terminal domain ph...   719   0.0  
KRH21488.1 hypothetical protein GLYMA_13G241400 [Glycine max]         707   0.0  
XP_014621299.1 PREDICTED: RNA polymerase II C-terminal domain ph...   707   0.0  
KRH21482.1 hypothetical protein GLYMA_13G241400 [Glycine max]         688   0.0  
XP_015956482.1 PREDICTED: RNA polymerase II C-terminal domain ph...   698   0.0  
XP_016189791.1 PREDICTED: RNA polymerase II C-terminal domain ph...   696   0.0  
XP_017440613.1 PREDICTED: RNA polymerase II C-terminal domain ph...   692   0.0  
KOM31067.1 hypothetical protein LR48_Vigan01g062200 [Vigna angul...   692   0.0  
XP_003529311.2 PREDICTED: RNA polymerase II C-terminal domain ph...   691   0.0  
XP_014508623.1 PREDICTED: RNA polymerase II C-terminal domain ph...   691   0.0  
KRH21479.1 hypothetical protein GLYMA_13G241400 [Glycine max]         688   0.0  
XP_003542763.1 PREDICTED: RNA polymerase II C-terminal domain ph...   688   0.0  
KHN10024.1 RNA polymerase II C-terminal domain phosphatase-like ...   688   0.0  
KRH10837.1 hypothetical protein GLYMA_15G072000 [Glycine max]         682   0.0  

>XP_003545893.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Glycine max] KRH10835.1 hypothetical protein
            GLYMA_15G072000 [Glycine max]
          Length = 958

 Score =  757 bits (1955), Expect = 0.0
 Identities = 393/525 (74%), Positives = 439/525 (83%), Gaps = 9/525 (1%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASST+PAMT+NLDPRLAF SS+
Sbjct: 421  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFNSSL 480

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKP+ Q+ P   SLHSSPAREEGEVP
Sbjct: 481  QYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEGEVP 540

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPSRGGWLSVEENLGTQ 539
            ESELD DTRRRLLILQHG+DTREHTS EPP P+RHP Q S P VPSR GW SVEE +G Q
Sbjct: 541  ESELDLDTRRRLLILQHGQDTREHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEEMGPQ 600

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHH---- 707
            QL+Q++PKEFP+ SEPL IEK WP HPSL SKVD+S+ SDR+FHE+ +RLPKEVHH    
Sbjct: 601  QLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRLPKEVHHRDDH 660

Query: 708  ----HTLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVE 875
                 +LSSYHSF GD IPL  SS +N D DSES RSLFHAD T GVL+EIALKCGTKVE
Sbjct: 661  SRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCGTKVE 720

Query: 876  FWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPG 1055
            F SSLVAST LQFSIEAWFAGKK+GEG GRTRREAQ+KAAE SIKQLADIY+S AKDD G
Sbjct: 721  FLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSG 780

Query: 1056 STYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGS 1235
            STYG+VSGFHGS+ NGFV S NSL N LLPK ESV FST S+ SR  DPRLEVSKRS  S
Sbjct: 781  STYGDVSGFHGSNNNGFVSSGNSLGNQLLPK-ESVSFSTSSDSSRVSDPRLEVSKRSTDS 839

Query: 1236 ISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQA 1415
            ISALKE CMMEGLA +FQS PAP ST+F QKD+VHAQVEIDGQ+FGKG GLTW+EAK+QA
Sbjct: 840  ISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQA 899

Query: 1416 AEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            A+KAL SLRT+ +QGT+KR  SPR S+QGL+NKRL+Q+  RT+QR
Sbjct: 900  AKKALESLRTMFNQGTRKRHGSPR-SMQGLANKRLKQEYPRTLQR 943


>XP_019425587.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Lupinus angustifolius] XP_019425589.1
            PREDICTED: RNA polymerase II C-terminal domain
            phosphatase-like 1 isoform X1 [Lupinus angustifolius]
            OIV91757.1 hypothetical protein TanjilG_26610 [Lupinus
            angustifolius]
          Length = 963

 Score =  737 bits (1902), Expect = 0.0
 Identities = 383/525 (72%), Positives = 432/525 (82%), Gaps = 9/525 (1%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGN DSL+FDGMADAEVERRLK+A+ A+S+IP +T+NLDPRLA  SS+
Sbjct: 428  NYLVSEDDASASNGNIDSLQFDGMADAEVERRLKEALLAASSIPPITANLDPRLA--SSL 485

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYT A+SS TVPP T QA  +Q  N+Q PQS  LVKPM Q+ P E SLHSSPAREEGEVP
Sbjct: 486  QYTTASSSGTVPPPTVQAPVIQIANMQFPQSATLVKPMSQVAP-EQSLHSSPAREEGEVP 544

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPSRGGWLSVEENLGTQ 539
            ESELDPDTRRRLLILQHG+D RE+TS EPPFP+R P+Q SPPH+PS  GW  V+E  G Q
Sbjct: 545  ESELDPDTRRRLLILQHGQDIRENTSSEPPFPVRLPVQVSPPHIPSHAGWFPVKEERGPQ 604

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHH---- 707
            QL++V+PKEFP++SEPL IEK WP  PS  S VDN + SDRI HEN +RLPKEV+H    
Sbjct: 605  QLNRVVPKEFPVESEPLHIEKKWPRRPSFFSNVDNPMSSDRILHENHQRLPKEVYHRDDR 664

Query: 708  ----HTLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVE 875
                HT S YHSF+GD IPLGS+SS+N DLDSES   LF+ADS  GVLREIALKCGT+VE
Sbjct: 665  LRLNHTHSGYHSFAGDDIPLGSTSSSNWDLDSESGHPLFYADSPAGVLREIALKCGTRVE 724

Query: 876  FWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPG 1055
            F SSLVASTELQFSIEAWFAGKKIGEGIGRTR+EAQ+KAAE SIKQLADIY+S  K D G
Sbjct: 725  FLSSLVASTELQFSIEAWFAGKKIGEGIGRTRKEAQYKAAEDSIKQLADIYMSHTKADSG 784

Query: 1056 STYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGS 1235
            STYG+V+ F G  +NGF+ SVNSL N LLPKEE   FST S+P R LDPR EV KRSMGS
Sbjct: 785  STYGDVTAFPGVEDNGFMSSVNSLGNQLLPKEELDSFSTASDPLRGLDPRFEV-KRSMGS 843

Query: 1236 ISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQA 1415
            ISALKELCMMEGL VSFQSPP PVSTNFVQKD+VHAQVEIDGQVFGKGIGLTW+EAK+QA
Sbjct: 844  ISALKELCMMEGLGVSFQSPPTPVSTNFVQKDEVHAQVEIDGQVFGKGIGLTWNEAKMQA 903

Query: 1416 AEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            A+KALGSLRT+L +GTQKRQ SP    +G SNKR++Q+  RT QR
Sbjct: 904  ADKALGSLRTMLGEGTQKRQGSPLRPWRGFSNKRMKQEYPRTPQR 948


>KRH21483.1 hypothetical protein GLYMA_13G241400 [Glycine max]
          Length = 681

 Score =  724 bits (1870), Expect = 0.0
 Identities = 381/527 (72%), Positives = 426/527 (80%), Gaps = 11/527 (2%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASSTI A+T+N+DPRLAF SS+
Sbjct: 141  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 200

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKPM Q+     SLHSSPAREEGE+P
Sbjct: 201  QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 260

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPP--HVPSRGGWLSVEENLG 533
            ESELD DTRRR LILQHG+DTRE  + EPPFP+RHP Q S P   VPSR GW SVEE +G
Sbjct: 261  ESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMG 320

Query: 534  TQQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHH- 710
             QQL+  +PKEFP+DSEP  IEK WP HPS  SKV +SI SDR+FHE+ +RLPKEVHH  
Sbjct: 321  PQQLNLPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKEVHHRD 380

Query: 711  -------TLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTK 869
                   +LSSYHS  GD IPL  SS +N D DSES RSLFHAD+T GVL+EIAL CGTK
Sbjct: 381  DRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGRSLFHADTTAGVLQEIALNCGTK 440

Query: 870  VEFWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDD 1049
            VEF SSLVASTELQFSIEAWFAGKKIGEG GRTRREAQ KAA  SIKQLADIY+S AKDD
Sbjct: 441  VEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDD 500

Query: 1050 PGSTYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSM 1229
             GSTYG+VSGFHGS+ +GFV S NSL N LLPKEES  FST SE SR  D RLEVSKRS 
Sbjct: 501  SGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRST 560

Query: 1230 GSISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKV 1409
             SISALKELCMMEGLA SFQSPPA  ST+  QKD+VHAQVEIDGQ+FGKG G+TW+EAK+
Sbjct: 561  DSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKM 620

Query: 1410 QAAEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            QAA+KALGSLRT+ +QG+ KR  SPR S+QGL+NKRL+ +   T+QR
Sbjct: 621  QAAKKALGSLRTMFNQGSLKRHGSPR-SMQGLANKRLKPEYPPTLQR 666


>KHN08543.1 RNA polymerase II C-terminal domain phosphatase-like 1 [Glycine soja]
          Length = 960

 Score =  724 bits (1870), Expect = 0.0
 Identities = 381/527 (72%), Positives = 426/527 (80%), Gaps = 11/527 (2%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASSTI A+T+N+DPRLAF SS+
Sbjct: 420  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 479

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKPM Q+     SLHSSPAREEGE+P
Sbjct: 480  QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 539

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPP--HVPSRGGWLSVEENLG 533
            ESELD DTRRR LILQHG+DTRE  + EPPFP+RHP Q S P   VPSR GW SVEE +G
Sbjct: 540  ESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMG 599

Query: 534  TQQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHH- 710
             QQL+  +PKEFP+DSEP  IEK WP HPS  SKV +SI SDR+FHE+ +RLPKEVHH  
Sbjct: 600  PQQLNLPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKEVHHRD 659

Query: 711  -------TLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTK 869
                   +LSSYHS  GD IPL  SS +N D DSES RSLFHAD+T GVL+EIAL CGTK
Sbjct: 660  DRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGRSLFHADTTAGVLQEIALNCGTK 719

Query: 870  VEFWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDD 1049
            VEF SSLVASTELQFSIEAWFAGKKIGEG GRTRREAQ KAA  SIKQLADIY+S AKDD
Sbjct: 720  VEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDD 779

Query: 1050 PGSTYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSM 1229
             GSTYG+VSGFHGS+ +GFV S NSL N LLPKEES  FST SE SR  D RLEVSKRS 
Sbjct: 780  SGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRST 839

Query: 1230 GSISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKV 1409
             SISALKELCMMEGLA SFQSPPA  ST+  QKD+VHAQVEIDGQ+FGKG G+TW+EAK+
Sbjct: 840  DSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKM 899

Query: 1410 QAAEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            QAA+KALGSLRT+ +QG+ KR  SPR S+QGL+NKRL+ +   T+QR
Sbjct: 900  QAAKKALGSLRTMFNQGSLKRHGSPR-SMQGLANKRLKPEYPPTLQR 945


>XP_003543063.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Glycine max] XP_006594604.1 PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Glycine max] KRH21480.1 hypothetical protein
            GLYMA_13G241400 [Glycine max] KRH21481.1 hypothetical
            protein GLYMA_13G241400 [Glycine max]
          Length = 960

 Score =  724 bits (1870), Expect = 0.0
 Identities = 381/527 (72%), Positives = 426/527 (80%), Gaps = 11/527 (2%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASSTI A+T+N+DPRLAF SS+
Sbjct: 420  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 479

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKPM Q+     SLHSSPAREEGE+P
Sbjct: 480  QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 539

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPP--HVPSRGGWLSVEENLG 533
            ESELD DTRRR LILQHG+DTRE  + EPPFP+RHP Q S P   VPSR GW SVEE +G
Sbjct: 540  ESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMG 599

Query: 534  TQQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHH- 710
             QQL+  +PKEFP+DSEP  IEK WP HPS  SKV +SI SDR+FHE+ +RLPKEVHH  
Sbjct: 600  PQQLNLPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKEVHHRD 659

Query: 711  -------TLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTK 869
                   +LSSYHS  GD IPL  SS +N D DSES RSLFHAD+T GVL+EIAL CGTK
Sbjct: 660  DRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGRSLFHADTTAGVLQEIALNCGTK 719

Query: 870  VEFWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDD 1049
            VEF SSLVASTELQFSIEAWFAGKKIGEG GRTRREAQ KAA  SIKQLADIY+S AKDD
Sbjct: 720  VEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDD 779

Query: 1050 PGSTYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSM 1229
             GSTYG+VSGFHGS+ +GFV S NSL N LLPKEES  FST SE SR  D RLEVSKRS 
Sbjct: 780  SGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRST 839

Query: 1230 GSISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKV 1409
             SISALKELCMMEGLA SFQSPPA  ST+  QKD+VHAQVEIDGQ+FGKG G+TW+EAK+
Sbjct: 840  DSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKM 899

Query: 1410 QAAEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            QAA+KALGSLRT+ +QG+ KR  SPR S+QGL+NKRL+ +   T+QR
Sbjct: 900  QAAKKALGSLRTMFNQGSLKRHGSPR-SMQGLANKRLKPEYPPTLQR 945


>XP_006597421.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X3 [Glycine max] KRH10836.1 hypothetical protein
            GLYMA_15G072000 [Glycine max]
          Length = 932

 Score =  721 bits (1861), Expect = 0.0
 Identities = 376/525 (71%), Positives = 421/525 (80%), Gaps = 9/525 (1%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASST+PAMT+NLDPRLAF SS+
Sbjct: 421  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFNSSL 480

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKP+ Q+ P   SLHSSPAREEGEVP
Sbjct: 481  QYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEGEVP 540

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPSRGGWLSVEENLGTQ 539
            ESELD DTRRRLLILQHG+DTREHTS EPP P+RHP Q S P VPSR GW SVEE +G Q
Sbjct: 541  ESELDLDTRRRLLILQHGQDTREHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEEMGPQ 600

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHH---- 707
            QL+Q++PKEFP+ SEPL IEK WP HPSL SKVD+S+ SDR+FHE+ +RLPKEVHH    
Sbjct: 601  QLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRLPKEVHHRDDH 660

Query: 708  ----HTLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVE 875
                 +LSSYHSF GD IPL  SS +N D DSES RSLFHAD T GVL+EIALKCGTKVE
Sbjct: 661  SRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCGTKVE 720

Query: 876  FWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPG 1055
            F SSLVAST LQFSIEAWFAGKK+GEG GRTRREAQ+KAAE SIKQLADIY+S AKDD G
Sbjct: 721  FLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSG 780

Query: 1056 STYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGS 1235
            STYG+VSGFHGS+ NGFV S                           DPRLEVSKRS  S
Sbjct: 781  STYGDVSGFHGSNNNGFVSS---------------------------DPRLEVSKRSTDS 813

Query: 1236 ISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQA 1415
            ISALKE CMMEGLA +FQS PAP ST+F QKD+VHAQVEIDGQ+FGKG GLTW+EAK+QA
Sbjct: 814  ISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQA 873

Query: 1416 AEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            A+KAL SLRT+ +QGT+KR  SPR S+QGL+NKRL+Q+  RT+QR
Sbjct: 874  AKKALESLRTMFNQGTRKRHGSPR-SMQGLANKRLKQEYPRTLQR 917


>XP_006597420.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X2 [Glycine max]
          Length = 937

 Score =  719 bits (1855), Expect = 0.0
 Identities = 378/517 (73%), Positives = 421/517 (81%), Gaps = 1/517 (0%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASST+PAMT+NLDPRLAF SS+
Sbjct: 421  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFNSSL 480

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKP+ Q+ P   SLHSSPAREEGEVP
Sbjct: 481  QYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEGEVP 540

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPSRGGWLSVEENLGTQ 539
            ESELD DTRRRLLILQHG+DTREHTS EPP P+RHP Q S P VPSR GW SVEE +G Q
Sbjct: 541  ESELDLDTRRRLLILQHGQDTREHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEEMGPQ 600

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHHTLS 719
            QL+Q++PKEFP+ SEPL IEK WP HPSL SKV +     R+               +LS
Sbjct: 601  QLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVHHRDDHSRL-------------SQSLS 647

Query: 720  SYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVEFWSSLVAS 899
            SYHSF GD IPL  SS +N D DSES RSLFHAD T GVL+EIALKCGTKVEF SSLVAS
Sbjct: 648  SYHSFPGDDIPLSGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCGTKVEFLSSLVAS 707

Query: 900  TELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPGSTYGEVSG 1079
            T LQFSIEAWFAGKK+GEG GRTRREAQ+KAAE SIKQLADIY+S AKDD GSTYG+VSG
Sbjct: 708  TALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSG 767

Query: 1080 FHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGSISALKELC 1259
            FHGS+ NGFV S NSL N LLPK ESV FST S+ SR  DPRLEVSKRS  SISALKE C
Sbjct: 768  FHGSNNNGFVSSGNSLGNQLLPK-ESVSFSTSSDSSRVSDPRLEVSKRSTDSISALKEFC 826

Query: 1260 MMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQAAEKALGSL 1439
            MMEGLA +FQS PAP ST+F QKD+VHAQVEIDGQ+FGKG GLTW+EAK+QAA+KAL SL
Sbjct: 827  MMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESL 886

Query: 1440 RTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            RT+ +QGT+KR  SPR S+QGL+NKRL+Q+  RT+QR
Sbjct: 887  RTMFNQGTRKRHGSPR-SMQGLANKRLKQEYPRTLQR 922


>KRH21488.1 hypothetical protein GLYMA_13G241400 [Glycine max]
          Length = 648

 Score =  707 bits (1826), Expect = 0.0
 Identities = 370/508 (72%), Positives = 411/508 (80%), Gaps = 11/508 (2%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASSTI A+T+N+DPRLAF SS+
Sbjct: 141  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 200

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKPM Q+     SLHSSPAREEGE+P
Sbjct: 201  QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 260

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPP--HVPSRGGWLSVEENLG 533
            ESELD DTRRR LILQHG+DTRE  + EPPFP+RHP Q S P   VPSR GW SVEE +G
Sbjct: 261  ESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMG 320

Query: 534  TQQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHH- 710
             QQL+  +PKEFP+DSEP  IEK WP HPS  SKV +SI SDR+FHE+ +RLPKEVHH  
Sbjct: 321  PQQLNLPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKEVHHRD 380

Query: 711  -------TLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTK 869
                   +LSSYHS  GD IPL  SS +N D DSES RSLFHAD+T GVL+EIAL CGTK
Sbjct: 381  DRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGRSLFHADTTAGVLQEIALNCGTK 440

Query: 870  VEFWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDD 1049
            VEF SSLVASTELQFSIEAWFAGKKIGEG GRTRREAQ KAA  SIKQLADIY+S AKDD
Sbjct: 441  VEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDD 500

Query: 1050 PGSTYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSM 1229
             GSTYG+VSGFHGS+ +GFV S NSL N LLPKEES  FST SE SR  D RLEVSKRS 
Sbjct: 501  SGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRST 560

Query: 1230 GSISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKV 1409
             SISALKELCMMEGLA SFQSPPA  ST+  QKD+VHAQVEIDGQ+FGKG G+TW+EAK+
Sbjct: 561  DSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKM 620

Query: 1410 QAAEKALGSLRTVLDQGTQKRQRSPRSS 1493
            QAA+KALGSLRT+ +QG+ KR  SPR +
Sbjct: 621  QAAKKALGSLRTMFNQGSLKRHGSPREN 648


>XP_014621299.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X3 [Glycine max] KRH21485.1 hypothetical protein
            GLYMA_13G241400 [Glycine max] KRH21486.1 hypothetical
            protein GLYMA_13G241400 [Glycine max]
          Length = 927

 Score =  707 bits (1826), Expect = 0.0
 Identities = 370/508 (72%), Positives = 411/508 (80%), Gaps = 11/508 (2%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASSTI A+T+N+DPRLAF SS+
Sbjct: 420  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 479

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKPM Q+     SLHSSPAREEGE+P
Sbjct: 480  QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 539

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPP--HVPSRGGWLSVEENLG 533
            ESELD DTRRR LILQHG+DTRE  + EPPFP+RHP Q S P   VPSR GW SVEE +G
Sbjct: 540  ESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMG 599

Query: 534  TQQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHH- 710
             QQL+  +PKEFP+DSEP  IEK WP HPS  SKV +SI SDR+FHE+ +RLPKEVHH  
Sbjct: 600  PQQLNLPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKEVHHRD 659

Query: 711  -------TLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTK 869
                   +LSSYHS  GD IPL  SS +N D DSES RSLFHAD+T GVL+EIAL CGTK
Sbjct: 660  DRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGRSLFHADTTAGVLQEIALNCGTK 719

Query: 870  VEFWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDD 1049
            VEF SSLVASTELQFSIEAWFAGKKIGEG GRTRREAQ KAA  SIKQLADIY+S AKDD
Sbjct: 720  VEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDD 779

Query: 1050 PGSTYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSM 1229
             GSTYG+VSGFHGS+ +GFV S NSL N LLPKEES  FST SE SR  D RLEVSKRS 
Sbjct: 780  SGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRST 839

Query: 1230 GSISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKV 1409
             SISALKELCMMEGLA SFQSPPA  ST+  QKD+VHAQVEIDGQ+FGKG G+TW+EAK+
Sbjct: 840  DSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKM 899

Query: 1410 QAAEKALGSLRTVLDQGTQKRQRSPRSS 1493
            QAA+KALGSLRT+ +QG+ KR  SPR +
Sbjct: 900  QAAKKALGSLRTMFNQGSLKRHGSPREN 927


>KRH21482.1 hypothetical protein GLYMA_13G241400 [Glycine max]
          Length = 660

 Score =  688 bits (1776), Expect = 0.0
 Identities = 366/519 (70%), Positives = 409/519 (78%), Gaps = 3/519 (0%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASSTI A+T+N+DPRLAF SS+
Sbjct: 141  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 200

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKPM Q+     SLHSSPAREEGE+P
Sbjct: 201  QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 260

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPP--HVPSRGGWLSVEENLG 533
            ESELD DTRRR LILQHG+DTRE  + EPPFP+RHP Q S P   VPSR GW SVEE +G
Sbjct: 261  ESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMG 320

Query: 534  TQQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHHT 713
             QQL+  +PKEFP+DSEP  IEK WP HPS  SKV +     R+               +
Sbjct: 321  PQQLNLPVPKEFPVDSEPFHIEKRWPRHPSFFSKVHHRDDRSRL-------------SQS 367

Query: 714  LSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVEFWSSLV 893
            LSSYHS  GD IPL  SS +N D DSES RSLFHAD+T GVL+EIAL CGTKVEF SSLV
Sbjct: 368  LSSYHSLPGDDIPLSGSSYSNRDFDSESGRSLFHADTTAGVLQEIALNCGTKVEFLSSLV 427

Query: 894  ASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPGSTYGEV 1073
            ASTELQFSIEAWFAGKKIGEG GRTRREAQ KAA  SIKQLADIY+S AKDD GSTYG+V
Sbjct: 428  ASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDDSGSTYGDV 487

Query: 1074 SGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGSISALKE 1253
            SGFHGS+ +GFV S NSL N LLPKEES  FST SE SR  D RLEVSKRS  SISALKE
Sbjct: 488  SGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRSTDSISALKE 547

Query: 1254 LCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQAAEKALG 1433
            LCMMEGLA SFQSPPA  ST+  QKD+VHAQVEIDGQ+FGKG G+TW+EAK+QAA+KALG
Sbjct: 548  LCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKMQAAKKALG 607

Query: 1434 SLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            SLRT+ +QG+ KR  SPR S+QGL+NKRL+ +   T+QR
Sbjct: 608  SLRTMFNQGSLKRHGSPR-SMQGLANKRLKPEYPPTLQR 645


>XP_015956482.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Arachis duranensis]
          Length = 962

 Score =  698 bits (1801), Expect = 0.0
 Identities = 366/525 (69%), Positives = 423/525 (80%), Gaps = 9/525 (1%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLV+E DASA NGN+D L FDGMADAEVERRLKDAISA S IPA+T+NLDPRL   SS+
Sbjct: 428  NYLVAEDDASALNGNRDPLSFDGMADAEVERRLKDAISAVSAIPAITANLDPRLT--SSL 485

Query: 183  QYTMATSSTVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVPE 362
            QYTMA+S + P   AQA+ MQF ++Q PQ   LVKPM Q  P E SLH SPAREEGEVPE
Sbjct: 486  QYTMASSGSGPLPAAQASMMQFPSVQYPQQATLVKPMVQTAPSEPSLHGSPAREEGEVPE 545

Query: 363  SELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQAS-PPHVPSRGGWLSVEENLGTQ 539
            SELDPDTRRRLLILQHG+DTREHT  EP FP+R P+Q S PP VP RGGW  VEE +G Q
Sbjct: 546  SELDPDTRRRLLILQHGQDTREHTPSEPSFPVRQPVQVSAPPRVPPRGGWFPVEEEIGPQ 605

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHH--- 710
            QL++VIP+EFP+D+EPLRIEKH P HP    KVDNS+  DR+ HE+ +RLPKE++H    
Sbjct: 606  QLNRVIPREFPVDTEPLRIEKHRPPHPPFFPKVDNSVSPDRVLHESHQRLPKEMYHRDDR 665

Query: 711  -----TLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVE 875
                   SSYHSFSGD   L  SSS++ DL+SES   L  AD+  GVL+EIALKCGTKVE
Sbjct: 666  TRLSQAPSSYHSFSGDDNSLSRSSSSHKDLESESHSPL-SADTPVGVLQEIALKCGTKVE 724

Query: 876  FWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPG 1055
            F S LVASTELQFSIEAWF+G+K+GEG GR+R+EAQH+AAE+SIKQLADIYLSRAK + G
Sbjct: 725  FKSCLVASTELQFSIEAWFSGRKVGEGFGRSRKEAQHRAAEHSIKQLADIYLSRAKAETG 784

Query: 1056 STYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGS 1235
            STYG+VSGF  +++NG+VG++NS+ N  L KEES  FST S+PSR LDPRLEVSKRSMGS
Sbjct: 785  STYGDVSGFQ-ANDNGYVGNINSIGNQPLSKEESFSFSTASDPSRVLDPRLEVSKRSMGS 843

Query: 1236 ISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQA 1415
            +SALKELCMMEGL VSFQSPPAPVS N +QKD++HAQVEIDGQVFG+GIGLTWDEAK+QA
Sbjct: 844  VSALKELCMMEGLGVSFQSPPAPVSLNPIQKDEIHAQVEIDGQVFGEGIGLTWDEAKMQA 903

Query: 1416 AEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            AEKALGSLRT+L Q   KRQ SPR  + GL NKRL+ D  RT+QR
Sbjct: 904  AEKALGSLRTMLGQSIPKRQGSPR-PVHGLPNKRLKHDYPRTLQR 947


>XP_016189791.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Arachis ipaensis]
          Length = 962

 Score =  696 bits (1795), Expect = 0.0
 Identities = 364/525 (69%), Positives = 423/525 (80%), Gaps = 9/525 (1%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLV+E DASA NGN+D L FDGMADAEVERRLKDAISA S IP++T+NLDPRLA  SS+
Sbjct: 428  NYLVAEDDASALNGNRDPLSFDGMADAEVERRLKDAISAVSAIPSITANLDPRLA--SSL 485

Query: 183  QYTMATSSTVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVPE 362
            QYTMA+S + P   AQA+ MQF ++Q PQ   LVKPM Q  P E SLH SPAREEGEVPE
Sbjct: 486  QYTMASSGSGPLPAAQASMMQFPSVQYPQQATLVKPMVQTAPSEPSLHGSPAREEGEVPE 545

Query: 363  SELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPP-HVPSRGGWLSVEENLGTQ 539
            SELDPDTRRRLLILQHG+DTREHT  EP FP+R P+Q S P  VP RGGW  VEE +G Q
Sbjct: 546  SELDPDTRRRLLILQHGQDTREHTPSEPSFPVRQPVQVSAPARVPPRGGWFPVEEEIGPQ 605

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHH--- 710
            QL++V+P+EFP+D+EPLRIEKH P HP    KVDNS+  DR+ HE+ +RLPKE++H    
Sbjct: 606  QLNRVVPREFPVDTEPLRIEKHRPPHPPFFPKVDNSVSPDRVLHESHQRLPKEIYHRDDR 665

Query: 711  -----TLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVE 875
                   SSYHSFSGD   L  SSS++ DL+SES   L  AD+  GVL+EIALKCGTKVE
Sbjct: 666  TRLSQAPSSYHSFSGDDNSLSRSSSSHKDLESESHSPL-SADTPVGVLQEIALKCGTKVE 724

Query: 876  FWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPG 1055
            F S LVASTELQFSIEAWF+G+K+GEG GR+R+EAQH+AAE+SIKQLADIYLSRAK + G
Sbjct: 725  FKSCLVASTELQFSIEAWFSGRKVGEGFGRSRKEAQHRAAEHSIKQLADIYLSRAKAETG 784

Query: 1056 STYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGS 1235
            STYG+VSGF  +++NG+VG++NS+ N  L KEES  FST S+PSR LDPRLEVSKRSMGS
Sbjct: 785  STYGDVSGFQ-ANDNGYVGNINSIGNQPLSKEESFSFSTASDPSRVLDPRLEVSKRSMGS 843

Query: 1236 ISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQA 1415
            +SALKELCMMEGL VSFQSPPAPVS N +QKD++HAQVEIDGQVFG+GIGLTWDEAK+QA
Sbjct: 844  VSALKELCMMEGLGVSFQSPPAPVSLNPIQKDEIHAQVEIDGQVFGEGIGLTWDEAKMQA 903

Query: 1416 AEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            AEKALGSLRT+L Q   KRQ SPR  + GL NKRL+ D  RT+QR
Sbjct: 904  AEKALGSLRTMLGQSIPKRQGSPR-PVHGLPNKRLKHDYPRTLQR 947


>XP_017440613.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Vigna angularis] BAT73753.1 hypothetical
            protein VIGAN_01127800 [Vigna angularis var. angularis]
          Length = 954

 Score =  692 bits (1787), Expect = 0.0
 Identities = 362/526 (68%), Positives = 420/526 (79%), Gaps = 10/526 (1%)
 Frame = +3

Query: 3    NYLVSEGDASA--SNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFAS 176
            NYLVSE D S+  SNGN+D    DGMADAEVER+LKDA+SA+STIP  T+NLDPRL   +
Sbjct: 419  NYLVSEDDGSSAISNGNRDPFLLDGMADAEVERKLKDALSAASTIPVTTANLDPRL---T 475

Query: 177  SIQYTMATSSTVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEV 356
            S+QYTM++ S VPP TAQA+ M F ++Q PQ  ALVKPMGQ  P E SLH SPAREEGEV
Sbjct: 476  SLQYTMSSGS-VPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHGSPAREEGEV 534

Query: 357  PESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPSRGGWLSVEENLGT 536
            PESELDPDTRRRLLILQHG+DTR+H S EP +PIRHP+  S P V SRGGW   EE++G+
Sbjct: 535  PESELDPDTRRRLLILQHGQDTRDHASTEPTYPIRHPMPVSAPRVSSRGGWFPAEEDIGS 594

Query: 537  QQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHH--- 707
            Q L++V+ KEF +DS PL IEKH PHHPS  SKV++SI SDRI H++ +RLPKE++H   
Sbjct: 595  QPLNRVVSKEFSVDSGPLGIEKHRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDD 654

Query: 708  -----HTLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKV 872
                 H LSSY S SGD +P   SSS++ DLD+ES  S+FHAD+   VL+EIALKCGTKV
Sbjct: 655  RPRSNHMLSSYRSLSGDELPFSRSSSSHRDLDTESGNSVFHADTPVVVLQEIALKCGTKV 714

Query: 873  EFWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDP 1052
            EF SSLVAS ELQFSIEAWF+GKKIG G GRTR+EAQHKAAE SIK LADIYLS AKD+P
Sbjct: 715  EFMSSLVASAELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAKDEP 774

Query: 1053 GSTYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMG 1232
            GSTYG+V GF  S++NG++   +SL N  LPKE+S  F T S+PSR LDPRLEVSKR MG
Sbjct: 775  GSTYGDVGGFPNSNDNGYMVIASSLSNQSLPKEDSASFLTASDPSRVLDPRLEVSKRPMG 834

Query: 1233 SISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQ 1412
            SISALKELCM+EGL V+F S PAPVSTN +QKD+VHAQVEIDG+VFGKGIGLTWDEAK+Q
Sbjct: 835  SISALKELCMIEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQ 894

Query: 1413 AAEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            AAEKALGSLR+ L Q  QKRQ SPRS  QG SNKRL+Q+  RTMQR
Sbjct: 895  AAEKALGSLRSKLGQSIQKRQSSPRSH-QGFSNKRLKQEYPRTMQR 939


>KOM31067.1 hypothetical protein LR48_Vigan01g062200 [Vigna angularis]
          Length = 964

 Score =  692 bits (1787), Expect = 0.0
 Identities = 362/526 (68%), Positives = 420/526 (79%), Gaps = 10/526 (1%)
 Frame = +3

Query: 3    NYLVSEGDASA--SNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFAS 176
            NYLVSE D S+  SNGN+D    DGMADAEVER+LKDA+SA+STIP  T+NLDPRL   +
Sbjct: 429  NYLVSEDDGSSAISNGNRDPFLLDGMADAEVERKLKDALSAASTIPVTTANLDPRL---T 485

Query: 177  SIQYTMATSSTVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEV 356
            S+QYTM++ S VPP TAQA+ M F ++Q PQ  ALVKPMGQ  P E SLH SPAREEGEV
Sbjct: 486  SLQYTMSSGS-VPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHGSPAREEGEV 544

Query: 357  PESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPSRGGWLSVEENLGT 536
            PESELDPDTRRRLLILQHG+DTR+H S EP +PIRHP+  S P V SRGGW   EE++G+
Sbjct: 545  PESELDPDTRRRLLILQHGQDTRDHASTEPTYPIRHPMPVSAPRVSSRGGWFPAEEDIGS 604

Query: 537  QQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHH--- 707
            Q L++V+ KEF +DS PL IEKH PHHPS  SKV++SI SDRI H++ +RLPKE++H   
Sbjct: 605  QPLNRVVSKEFSVDSGPLGIEKHRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDD 664

Query: 708  -----HTLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKV 872
                 H LSSY S SGD +P   SSS++ DLD+ES  S+FHAD+   VL+EIALKCGTKV
Sbjct: 665  RPRSNHMLSSYRSLSGDELPFSRSSSSHRDLDTESGNSVFHADTPVVVLQEIALKCGTKV 724

Query: 873  EFWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDP 1052
            EF SSLVAS ELQFSIEAWF+GKKIG G GRTR+EAQHKAAE SIK LADIYLS AKD+P
Sbjct: 725  EFMSSLVASAELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAKDEP 784

Query: 1053 GSTYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMG 1232
            GSTYG+V GF  S++NG++   +SL N  LPKE+S  F T S+PSR LDPRLEVSKR MG
Sbjct: 785  GSTYGDVGGFPNSNDNGYMVIASSLSNQSLPKEDSASFLTASDPSRVLDPRLEVSKRPMG 844

Query: 1233 SISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQ 1412
            SISALKELCM+EGL V+F S PAPVSTN +QKD+VHAQVEIDG+VFGKGIGLTWDEAK+Q
Sbjct: 845  SISALKELCMIEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQ 904

Query: 1413 AAEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            AAEKALGSLR+ L Q  QKRQ SPRS  QG SNKRL+Q+  RTMQR
Sbjct: 905  AAEKALGSLRSKLGQSIQKRQSSPRSH-QGFSNKRLKQEYPRTMQR 949


>XP_003529311.2 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Glycine max] KHN07275.1 RNA polymerase II
            C-terminal domain phosphatase-like 1 [Glycine soja]
            KRH50009.1 hypothetical protein GLYMA_07G194800 [Glycine
            max]
          Length = 956

 Score =  691 bits (1783), Expect = 0.0
 Identities = 362/525 (68%), Positives = 420/525 (80%), Gaps = 9/525 (1%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE D S SNG++D   FDGMADAEVER+LKDA+SA+STIP  T+NLDPRL   +S+
Sbjct: 422  NYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPRL---TSL 478

Query: 183  QYTMATSSTVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVPE 362
            QYTM  S +VPP TAQA+ M F ++Q PQ   LVKPMGQ  P E SLHSSPAREEGEVPE
Sbjct: 479  QYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGEVPE 538

Query: 363  SELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPS-RGGWLSVEENLGTQ 539
            SELDPDTRRRLLILQHG+DTR+H S EPPFP+RHP+Q S PHVPS RG W   EE +G+Q
Sbjct: 539  SELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVPSSRGVWFPAEEEIGSQ 598

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHH---- 707
             L++V+PKEFP+DS PL I K  PHHPS  SKV++SI SDRI H++ +RLPKE++H    
Sbjct: 599  PLNRVVPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDR 658

Query: 708  ----HTLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVE 875
                H LSSY SFSGD IP   S S++ DLDSES  S+ HAD+   VL+EIALKCGTKV+
Sbjct: 659  PRLNHMLSSYRSFSGDDIPFSRSFSSHRDLDSESGHSVLHADTPVAVLQEIALKCGTKVD 718

Query: 876  FWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPG 1055
            F SSLVASTELQFS+EAWF+GKKIG  +GRTR+EAQ+KAAE SIK LADIYLS AKD+PG
Sbjct: 719  FISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKDEPG 778

Query: 1056 STYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGS 1235
            STYG+VSGF   +++G++G  +SL N  L KE+S  FST S PSR LDPRL+VSKRSMGS
Sbjct: 779  STYGDVSGFPNVNDSGYMGIASSLGNQPLSKEDSASFSTAS-PSRVLDPRLDVSKRSMGS 837

Query: 1236 ISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQA 1415
            IS+LKELCMMEGL V+F S PAPVSTN VQKD+VHAQVEIDG+VFGKGIGLTWDEAK+QA
Sbjct: 838  ISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQA 897

Query: 1416 AEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            AEKALGSLR+ L Q  QKRQ SPR   QG SNKRL+Q+  R MQR
Sbjct: 898  AEKALGSLRSKLGQSIQKRQSSPRPH-QGFSNKRLKQEYPRPMQR 941


>XP_014508623.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Vigna radiata var. radiata]
          Length = 954

 Score =  691 bits (1782), Expect = 0.0
 Identities = 363/526 (69%), Positives = 418/526 (79%), Gaps = 10/526 (1%)
 Frame = +3

Query: 3    NYLVSEGDASA--SNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFAS 176
            NYLVSE D S+  SNGN+D   FDGMADAEVER+LKDA+SA+STIP  T+NLDPRL   +
Sbjct: 419  NYLVSEDDGSSAISNGNRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPRL---T 475

Query: 177  SIQYTMATSSTVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEV 356
            S+QYTM++ S VPP TAQA+ + F ++Q PQ  ALVKPMGQ  P E SLH SPAREEGEV
Sbjct: 476  SLQYTMSSGS-VPPPTAQASMLPFTHVQFPQPAALVKPMGQAAPSESSLHGSPAREEGEV 534

Query: 357  PESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPSRGGWLSVEENLGT 536
            PESELDPDTRRRLLILQHG+DTR+H S EP +PIRHP+  S P V SRGGW   EE++G+
Sbjct: 535  PESELDPDTRRRLLILQHGQDTRDHASTEPTYPIRHPMPVSAPRVSSRGGWFPAEEDIGS 594

Query: 537  QQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHH--- 707
            Q L++V+PKEF +DS PL IEKH PHHPS  SKV++SI SDRI H++ +RLPKE++H   
Sbjct: 595  QPLNRVVPKEFSVDSGPLGIEKHRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDD 654

Query: 708  -----HTLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKV 872
                 H LSSY S SGD +P   SSS++ DLDSES  S FHAD    VL+EIALKCGTKV
Sbjct: 655  RPRSNHMLSSYRSLSGDELPFSRSSSSHRDLDSESGNSGFHADPPVVVLQEIALKCGTKV 714

Query: 873  EFWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDP 1052
            EF SSLVAS ELQFSIEAWF+GKKIG G GRTR+EAQHKAAE SIK LADIYLS AKD+P
Sbjct: 715  EFMSSLVASAELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAKDEP 774

Query: 1053 GSTYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMG 1232
            GSTYG+V GF  S++NG++   +SL N  L KE+S  FS  S+ SR LDPRLEVSKR MG
Sbjct: 775  GSTYGDVGGFPNSNDNGYMVIASSLSNQSLAKEDSASFSIASDASRVLDPRLEVSKRPMG 834

Query: 1233 SISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQ 1412
            SISALKELCMMEGL V+F S PAPVSTN +QKD+VHAQVEIDG+VFGKGIGLTWDEAK+Q
Sbjct: 835  SISALKELCMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEAKMQ 894

Query: 1413 AAEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            AAEKALGSLR+ L Q  QKRQ SPRS  QG SNKRL+Q+  RTMQR
Sbjct: 895  AAEKALGSLRSKLGQSIQKRQSSPRSH-QGFSNKRLKQEYPRTMQR 939


>KRH21479.1 hypothetical protein GLYMA_13G241400 [Glycine max]
          Length = 939

 Score =  688 bits (1776), Expect = 0.0
 Identities = 366/519 (70%), Positives = 409/519 (78%), Gaps = 3/519 (0%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASSTI A+T+N+DPRLAF SS+
Sbjct: 420  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAFTSSL 479

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKPM Q+     SLHSSPAREEGE+P
Sbjct: 480  QYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREEGELP 539

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPP--HVPSRGGWLSVEENLG 533
            ESELD DTRRR LILQHG+DTRE  + EPPFP+RHP Q S P   VPSR GW SVEE +G
Sbjct: 540  ESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVEEEMG 599

Query: 534  TQQLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHHT 713
             QQL+  +PKEFP+DSEP  IEK WP HPS  SKV +     R+               +
Sbjct: 600  PQQLNLPVPKEFPVDSEPFHIEKRWPRHPSFFSKVHHRDDRSRL-------------SQS 646

Query: 714  LSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVEFWSSLV 893
            LSSYHS  GD IPL  SS +N D DSES RSLFHAD+T GVL+EIAL CGTKVEF SSLV
Sbjct: 647  LSSYHSLPGDDIPLSGSSYSNRDFDSESGRSLFHADTTAGVLQEIALNCGTKVEFLSSLV 706

Query: 894  ASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPGSTYGEV 1073
            ASTELQFSIEAWFAGKKIGEG GRTRREAQ KAA  SIKQLADIY+S AKDD GSTYG+V
Sbjct: 707  ASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSHAKDDSGSTYGDV 766

Query: 1074 SGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGSISALKE 1253
            SGFHGS+ +GFV S NSL N LLPKEES  FST SE SR  D RLEVSKRS  SISALKE
Sbjct: 767  SGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVSKRSTDSISALKE 826

Query: 1254 LCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQAAEKALG 1433
            LCMMEGLA SFQSPPA  ST+  QKD+VHAQVEIDGQ+FGKG G+TW+EAK+QAA+KALG
Sbjct: 827  LCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWEEAKMQAAKKALG 886

Query: 1434 SLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            SLRT+ +QG+ KR  SPR S+QGL+NKRL+ +   T+QR
Sbjct: 887  SLRTMFNQGSLKRHGSPR-SMQGLANKRLKPEYPPTLQR 924


>XP_003542763.1 PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Glycine max] KRH20483.1 hypothetical protein
            GLYMA_13G181700 [Glycine max]
          Length = 960

 Score =  688 bits (1775), Expect = 0.0
 Identities = 358/525 (68%), Positives = 420/525 (80%), Gaps = 9/525 (1%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE D S SNGN+D   FDGMADAEVER+LKDA++A+ST P  T+NLDPRL   +S+
Sbjct: 426  NYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDPRL---TSL 482

Query: 183  QYTMATSSTVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVPE 362
            QYTM  S +VPP TAQA+ M F ++Q PQ   LVKPMGQ  P + SLHSSPAREEGEVPE
Sbjct: 483  QYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAREEGEVPE 542

Query: 363  SELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPS-RGGWLSVEENLGTQ 539
            SELDPDTRRRLLILQHG+DTR+H S EPPFP+RHP+QAS P VPS RG W  VEE +G+Q
Sbjct: 543  SELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVPSSRGVWFPVEEEIGSQ 602

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHH---- 707
             L++V+PKEFP+DS PL IEK   HHPS  +KV++SI SDRI H++ +RLPKE++H    
Sbjct: 603  PLNRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSHQRLPKEMYHRDDR 662

Query: 708  ----HTLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVE 875
                H LSSY SFSGD IP   SSS++ DLDSES  S+ HAD+   VL EIALKCGTKV+
Sbjct: 663  PRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGHSVLHADTPVAVLHEIALKCGTKVD 722

Query: 876  FWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPG 1055
            F SSLVASTEL+FS+EAWF+GKKIG G GRTR+EAQ+KAA+ SI+ LADIYLS AKD+PG
Sbjct: 723  FMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIYLSSAKDEPG 782

Query: 1056 STYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGS 1235
            STYG+VSGF   ++NG++G  +SL N  L KE+S  FS+ S PSR LDPRL+VSKRSMGS
Sbjct: 783  STYGDVSGFPNVNDNGYMGIASSLGNQPLSKEDSASFSSAS-PSRALDPRLDVSKRSMGS 841

Query: 1236 ISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQA 1415
            ISALKELCMMEGL V+F S PAPVSTN VQKD+VHAQVEIDG++FGKGIGLTWDEAK+QA
Sbjct: 842  ISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIGLTWDEAKMQA 901

Query: 1416 AEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            AEKALG+LR+ L Q  QK Q SPR   QG SNKRL+Q+  RTMQR
Sbjct: 902  AEKALGNLRSKLGQSIQKMQSSPRPH-QGFSNKRLKQEYPRTMQR 945


>KHN10024.1 RNA polymerase II C-terminal domain phosphatase-like 1 [Glycine soja]
          Length = 961

 Score =  688 bits (1775), Expect = 0.0
 Identities = 358/525 (68%), Positives = 420/525 (80%), Gaps = 9/525 (1%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE D S SNGN+D   FDGMADAEVER+LKDA++A+ST P  T+NLDPRL   +S+
Sbjct: 427  NYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDPRL---TSL 483

Query: 183  QYTMATSSTVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVPE 362
            QYTM  S +VPP TAQA+ M F ++Q PQ   LVKPMGQ  P + SLHSSPAREEGEVPE
Sbjct: 484  QYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAREEGEVPE 543

Query: 363  SELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPS-RGGWLSVEENLGTQ 539
            SELDPDTRRRLLILQHG+DTR+H S EPPFP+RHP+QAS P VPS RG W  VEE +G+Q
Sbjct: 544  SELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVPSSRGVWFPVEEEIGSQ 603

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHH---- 707
             L++V+PKEFP+DS PL IEK   HHPS  +KV++SI SDRI H++ +RLPKE++H    
Sbjct: 604  PLNRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSHQRLPKEMYHRDDR 663

Query: 708  ----HTLSSYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVE 875
                H LSSY SFSGD IP   SSS++ DLDSES  S+ HAD+   VL EIALKCGTKV+
Sbjct: 664  PRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGHSVLHADTPVAVLHEIALKCGTKVD 723

Query: 876  FWSSLVASTELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPG 1055
            F SSLVASTEL+FS+EAWF+GKKIG G GRTR+EAQ+KAA+ SI+ LADIYLS AKD+PG
Sbjct: 724  FMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIYLSSAKDEPG 783

Query: 1056 STYGEVSGFHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGS 1235
            STYG+VSGF   ++NG++G  +SL N  L KE+S  FS+ S PSR LDPRL+VSKRSMGS
Sbjct: 784  STYGDVSGFPNVNDNGYMGIASSLGNQPLSKEDSASFSSAS-PSRALDPRLDVSKRSMGS 842

Query: 1236 ISALKELCMMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQA 1415
            ISALKELCMMEGL V+F S PAPVSTN VQKD+VHAQVEIDG++FGKGIGLTWDEAK+QA
Sbjct: 843  ISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIGLTWDEAKMQA 902

Query: 1416 AEKALGSLRTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            AEKALG+LR+ L Q  QK Q SPR   QG SNKRL+Q+  RTMQR
Sbjct: 903  AEKALGNLRSKLGQSIQKMQSSPRPH-QGFSNKRLKQEYPRTMQR 946


>KRH10837.1 hypothetical protein GLYMA_15G072000 [Glycine max]
          Length = 911

 Score =  682 bits (1761), Expect = 0.0
 Identities = 361/517 (69%), Positives = 403/517 (77%), Gaps = 1/517 (0%)
 Frame = +3

Query: 3    NYLVSEGDASASNGNKDSLKFDGMADAEVERRLKDAISASSTIPAMTSNLDPRLAFASSI 182
            NYLVSE DASASNGNK+ L FDGMADAEVERRLKDAISASST+PAMT+NLDPRLAF SS+
Sbjct: 421  NYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAFNSSL 480

Query: 183  QYTMATSS-TVPPSTAQAAGMQFGNLQSPQSTALVKPMGQIVPLEHSLHSSPAREEGEVP 359
            QYTM +SS TVPP TAQA+ +QFGN+Q PQ   LVKP+ Q+ P   SLHSSPAREEGEVP
Sbjct: 481  QYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEGEVP 540

Query: 360  ESELDPDTRRRLLILQHGKDTREHTSVEPPFPIRHPIQASPPHVPSRGGWLSVEENLGTQ 539
            ESELD DTRRRLLILQHG+DTREHTS EPP P+RHP Q S P VPSR GW SVEE +G Q
Sbjct: 541  ESELDLDTRRRLLILQHGQDTREHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEEMGPQ 600

Query: 540  QLSQVIPKEFPIDSEPLRIEKHWPHHPSLSSKVDNSIPSDRIFHENLRRLPKEVHHHTLS 719
            QL+Q++PKEFP+ SEPL IEK WP HPSL SKV +     R+               +LS
Sbjct: 601  QLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVHHRDDHSRL-------------SQSLS 647

Query: 720  SYHSFSGDVIPLGSSSSNNMDLDSESERSLFHADSTDGVLREIALKCGTKVEFWSSLVAS 899
            SYHSF GD IPL  SS +N D DSES RSLFHAD T GVL+EIALKCGTKVEF SSLVAS
Sbjct: 648  SYHSFPGDDIPLSGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCGTKVEFLSSLVAS 707

Query: 900  TELQFSIEAWFAGKKIGEGIGRTRREAQHKAAEYSIKQLADIYLSRAKDDPGSTYGEVSG 1079
            T LQFSIEAWFAGKK+GEG GRTRREAQ+KAAE SIKQLADIY+S AKDD GSTYG+VSG
Sbjct: 708  TALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSGSTYGDVSG 767

Query: 1080 FHGSHENGFVGSVNSLVNDLLPKEESVPFSTLSEPSRNLDPRLEVSKRSMGSISALKELC 1259
            FHGS+ NGFV S                           DPRLEVSKRS  SISALKE C
Sbjct: 768  FHGSNNNGFVSS---------------------------DPRLEVSKRSTDSISALKEFC 800

Query: 1260 MMEGLAVSFQSPPAPVSTNFVQKDKVHAQVEIDGQVFGKGIGLTWDEAKVQAAEKALGSL 1439
            MMEGLA +FQS PAP ST+F QKD+VHAQVEIDGQ+FGKG GLTW+EAK+QAA+KAL SL
Sbjct: 801  MMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESL 860

Query: 1440 RTVLDQGTQKRQRSPRSSLQGLSNKRLRQDNSRTMQR 1550
            RT+ +QGT+KR  SPR S+QGL+NKRL+Q+  RT+QR
Sbjct: 861  RTMFNQGTRKRHGSPR-SMQGLANKRLKQEYPRTLQR 896


Top