BLASTX nr result

ID: Astragalus23_contig00018107 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00018107
         (3824 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_014508623.1| RNA polymerase II C-terminal domain phosphat...  1534   0.0  
ref|XP_020222315.1| RNA polymerase II C-terminal domain phosphat...  1532   0.0  
ref|XP_017440613.1| PREDICTED: RNA polymerase II C-terminal doma...  1530   0.0  
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...  1523   0.0  
gb|KOM31067.1| hypothetical protein LR48_Vigan01g062200 [Vigna a...  1522   0.0  
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...  1520   0.0  
gb|KHN10024.1| RNA polymerase II C-terminal domain phosphatase-l...  1520   0.0  
ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas...  1513   0.0  
ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma...  1510   0.0  
ref|XP_013457069.1| double-stranded RNA-binding motif protein [M...  1504   0.0  
gb|KRH20482.1| hypothetical protein GLYMA_13G181700 [Glycine max]    1504   0.0  
ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma...  1501   0.0  
ref|XP_017440623.1| PREDICTED: RNA polymerase II C-terminal doma...  1474   0.0  
gb|KRH20484.1| hypothetical protein GLYMA_13G181700 [Glycine max]    1459   0.0  
ref|XP_015956482.1| RNA polymerase II C-terminal domain phosphat...  1443   0.0  
ref|XP_016189791.1| RNA polymerase II C-terminal domain phosphat...  1439   0.0  
gb|KRH20485.1| hypothetical protein GLYMA_13G181700 [Glycine max]    1414   0.0  
ref|XP_012572568.1| PREDICTED: RNA polymerase II C-terminal doma...  1405   0.0  
ref|XP_019425587.1| PREDICTED: RNA polymerase II C-terminal doma...  1373   0.0  
ref|XP_019445654.1| PREDICTED: RNA polymerase II C-terminal doma...  1366   0.0  

>ref|XP_014508623.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Vigna radiata
            var. radiata]
          Length = 954

 Score = 1534 bits (3971), Expect = 0.0
 Identities = 777/963 (80%), Positives = 832/963 (86%), Gaps = 4/963 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN-KNFNVKEIRISHFSQPSERCPPLAVLHSVTSCGVC 635
            MYKSVVYQG++VLGEV++YPE+NN KNF+VKEIRISHFSQPSERCPPLAVLH+VTSCGVC
Sbjct: 1    MYKSVVYQGELVLGEVEVYPEENNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60

Query: 636  FKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAAGL 815
            FKMESKTQQQD LFHLHSLCIR+NKTAV+PL G+EIHLVAMHSRNDDRPCFWGF+VA GL
Sbjct: 61   FKMESKTQQQDGLFHLHSLCIRENKTAVIPLGGEEIHLVAMHSRNDDRPCFWGFIVALGL 120

Query: 816  YNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAEVK 995
            Y+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRISGMQAEVK
Sbjct: 121  YDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAEVK 180

Query: 996  RYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTRIN 1175
            RYLDDK+ILKQYAENDQV DNG++IKVQSEIVPALSDNHQPIVRPLIRLH+KNIILTRIN
Sbjct: 181  RYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDNHQPIVRPLIRLHDKNIILTRIN 240

Query: 1176 PQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLINS 1355
            PQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP  NLINS
Sbjct: 241  PQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINS 300

Query: 1356 RELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 1535
            +ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY
Sbjct: 301  KELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 360

Query: 1536 APQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXSNY 1715
            APQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQ+AYE            SNY
Sbjct: 361  APQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQVAYEDDIKDIPTPPDVSNY 420

Query: 1716 LASEDDGST--SNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLTSSALP 1889
            L SEDDGS+  SNGNRDP LFDGMADAEVERKLKDALSAAS IP+TTANLDPRLTS  L 
Sbjct: 421  LVSEDDGSSAISNGNRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPRLTS--LQ 478

Query: 1890 YTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEGEVPES 2069
            YTM SSGSVPPPTAQAS + F+H+Q PQPAAL KPMGQ +P ESSLH SPAREEGEVPES
Sbjct: 479  YTM-SSGSVPPPTAQASMLPFTHVQFPQPAALVKPMGQAAPSESSLHGSPAREEGEVPES 537

Query: 2070 ELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPVEEE 2249
            ELDPDTRRRLLILQHGQD RDHAS+EP +P +HP+  S      + RV  RGGWFP EE+
Sbjct: 538  ELDPDTRRRLLILQHGQDTRDHASTEPTYPIRHPMPVS------APRVSSRGGWFPAEED 591

Query: 2250 IGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKEMYH 2429
            IGSQP+NR VPK+F +DSGPL IEKHRPHHPSFF KV+SSISSDRILH++ QR+PKEMYH
Sbjct: 592  IGSQPLNRVVPKEFSVDSGPLGIEKHRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYH 651

Query: 2430 RDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEIALKCG 2606
            RDDR R +HMLSSY SL GD++PF R                  AD P VVLQEIALKCG
Sbjct: 652  RDDRPRSNHMLSSYRSLSGDELPFSRSSSSHRDLDSESGNSGFHADPPVVVLQEIALKCG 711

Query: 2607 TKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLSRAK 2786
            TKVEF  SLVASAELQFSIEAWF+GKKIGHGFGRTRKEAQHKAAEDSIKHLAD+YLS AK
Sbjct: 712  TKVEFMSSLVASAELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAK 771

Query: 2787 DEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEVSKR 2966
            DEPGSTYGDV GFPN+NDNGY+   SSL NQ L KE+SASFS ASD SRVLDPRLEVSKR
Sbjct: 772  DEPGSTYGDVGGFPNSNDNGYMVIASSLSNQSLAKEDSASFSIASDASRVLDPRLEVSKR 831

Query: 2967 SMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLTWDEA 3146
             MGSISALKELC MEGLGV+FLS PA  STNS+ +DEVHAQVEIDG+VFGKGIGLTWDEA
Sbjct: 832  PMGSISALKELCMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEA 891

Query: 3147 KMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYPRNAP 3326
            KMQAAEKALGSLR+                    NKRLKQEYPRT+QR PSS RYPRNAP
Sbjct: 892  KMQAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRTMQRIPSSTRYPRNAP 951

Query: 3327 PIP 3335
            PIP
Sbjct: 952  PIP 954


>ref|XP_020222315.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Cajanus
            cajan]
          Length = 956

 Score = 1532 bits (3967), Expect = 0.0
 Identities = 781/964 (81%), Positives = 832/964 (86%), Gaps = 5/964 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN-KNFNVKEIRISHFSQPSERCPPLAVLHSVTSCGVC 635
            MYKSVVYQG+VVLGEV++YPE+NN KNF+VKEIRISHFSQPSERC PLAVLH+VTSCGVC
Sbjct: 1    MYKSVVYQGEVVLGEVEVYPEENNYKNFHVKEIRISHFSQPSERCSPLAVLHTVTSCGVC 60

Query: 636  FKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAAGL 815
            FKMESKTQQQD LFHLHSLCIR+NKTAVMPL G+EIHLVAMHSRNDDRPCFW F+VA GL
Sbjct: 61   FKMESKTQQQDGLFHLHSLCIRENKTAVMPLGGEEIHLVAMHSRNDDRPCFWAFIVALGL 120

Query: 816  YNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAEVK 995
            Y+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN EVDPQRISGMQAEVK
Sbjct: 121  YDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSEVDPQRISGMQAEVK 180

Query: 996  RYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTRIN 1175
            RYLDDKSILKQYAENDQV DNG++IK+QSEIVPA+SD+HQPIVRPLIRL +KNIILTRIN
Sbjct: 181  RYLDDKSILKQYAENDQVVDNGRVIKIQSEIVPAISDSHQPIVRPLIRLQDKNIILTRIN 240

Query: 1176 PQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLINS 1355
            PQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP  NLINS
Sbjct: 241  PQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINS 300

Query: 1356 RELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 1535
            +ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY
Sbjct: 301  KELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 360

Query: 1536 APQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXSNY 1715
            APQAEA+NT+PVLCVARNVACNVRGG        LLQK+ Q+A+E            SNY
Sbjct: 361  APQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKISQVAHEDDIKDIPSPPDVSNY 420

Query: 1716 LASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLTSSALPYT 1895
            L SEDDGS SNGNRDP LFDGMADAEVERKLKDALSAAS IP+TT NLDPRLTS  L YT
Sbjct: 421  LVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALSAASTIPVTTTNLDPRLTS--LQYT 478

Query: 1896 MVSSGSVPPPTAQASTMQFSHLQIPQPA-ALAKPMGQVSPFESSLHSSPAREEGEVPESE 2072
            MVSSGSVPP TAQAS M F H+Q PQPA  L KPMGQ +P ESSLHSSPAREEGEVPESE
Sbjct: 479  MVSSGSVPPSTAQASMMPFPHVQFPQPATTLVKPMGQAAPSESSLHSSPAREEGEVPESE 538

Query: 2073 LDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPVEEEI 2252
            LDPDTRRRLLILQHGQD RDHASSEPPFP +HP+Q S   + +S     RGGWFP EEEI
Sbjct: 539  LDPDTRRRLLILQHGQDTRDHASSEPPFPIRHPVQVSVPRVPSS-----RGGWFPGEEEI 593

Query: 2253 GSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKEMYHR 2432
            GSQP+NR VPK+FPLDSGPL IEKHRPHH SFF KV+SSISSDRILH++ QR+PKEMYHR
Sbjct: 594  GSQPLNRVVPKEFPLDSGPLGIEKHRPHHQSFFSKVESSISSDRILHDSHQRLPKEMYHR 653

Query: 2433 DDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEIALKCGT 2609
            D+R R++H LSSY  L GDDIPF R                  ADTPAVVLQEIALKCGT
Sbjct: 654  DERPRLNHALSSYR-LSGDDIPFSRSSSSHRDLDSESSHSVLHADTPAVVLQEIALKCGT 712

Query: 2610 KVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLSRAKD 2789
            KVEF  SLV+SAELQFSIEAWF+GKK+GHGFGRTRKEAQHKAAEDSIKHLAD+YLS AKD
Sbjct: 713  KVEFMSSLVSSAELQFSIEAWFSGKKVGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAKD 772

Query: 2790 EPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEVSKRS 2969
            EPGSTYGDVSGFPNANDNGY+G  SSL NQ LPKE+SASFS ASDPSRVLDPRLEVSKRS
Sbjct: 773  EPGSTYGDVSGFPNANDNGYMGMTSSLGNQSLPKEDSASFSTASDPSRVLDPRLEVSKRS 832

Query: 2970 MGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLTWDEAK 3149
            MGSISALKELC MEGLGV+FL  PA  STNSV +DEVHAQVEIDG+VFGKGIGLTWDEA+
Sbjct: 833  MGSISALKELCMMEGLGVNFLPTPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAR 892

Query: 3150 MQAAEKALGSLRTT--YXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYPRNA 3323
             QAAEKALGSLR+                      NKRLKQEYPRTLQR PSSARYPRNA
Sbjct: 893  TQAAEKALGSLRSKLGQSIQSQRRQNSPRSHQGFSNKRLKQEYPRTLQRVPSSARYPRNA 952

Query: 3324 PPIP 3335
            PPIP
Sbjct: 953  PPIP 956


>ref|XP_017440613.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Vigna angularis]
 dbj|BAT73753.1| hypothetical protein VIGAN_01127800 [Vigna angularis var. angularis]
          Length = 954

 Score = 1530 bits (3962), Expect = 0.0
 Identities = 775/963 (80%), Positives = 831/963 (86%), Gaps = 4/963 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN-KNFNVKEIRISHFSQPSERCPPLAVLHSVTSCGVC 635
            MYKSVVYQG++VLGEV++YPE+NN KNF++KEIRISHFSQPSERCPPLAVLH+VTSCGVC
Sbjct: 1    MYKSVVYQGELVLGEVEVYPEENNYKNFHLKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60

Query: 636  FKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAAGL 815
            FKMESKTQQQD LF LHSLCIR+NKTAV+PL G+EIHLVAMHSRNDDRPCFWGF+VA GL
Sbjct: 61   FKMESKTQQQDGLFQLHSLCIRENKTAVIPLGGEEIHLVAMHSRNDDRPCFWGFIVALGL 120

Query: 816  YNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAEVK 995
            Y+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRISGMQAEVK
Sbjct: 121  YDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAEVK 180

Query: 996  RYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTRIN 1175
            RYLDDK+ILKQYAENDQV DNG++IKVQSEIVPALSDNHQPIVRPLIRLH+KNIILTRIN
Sbjct: 181  RYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDNHQPIVRPLIRLHDKNIILTRIN 240

Query: 1176 PQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLINS 1355
            PQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP  NLINS
Sbjct: 241  PQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINS 300

Query: 1356 RELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 1535
            +ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY
Sbjct: 301  KELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 360

Query: 1536 APQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXSNY 1715
            APQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQ+AYE            SNY
Sbjct: 361  APQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQVAYEDDIKDIPTPPDVSNY 420

Query: 1716 LASEDDGST--SNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLTSSALP 1889
            L SEDDGS+  SNGNRDP L DGMADAEVERKLKDALSAAS IP+TTANLDPRLTS  L 
Sbjct: 421  LVSEDDGSSAISNGNRDPFLLDGMADAEVERKLKDALSAASTIPVTTANLDPRLTS--LQ 478

Query: 1890 YTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEGEVPES 2069
            YTM SSGSVPPPTAQAS M F+H+Q PQPAAL KPMGQ +P ESSLH SPAREEGEVPES
Sbjct: 479  YTM-SSGSVPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHGSPAREEGEVPES 537

Query: 2070 ELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPVEEE 2249
            ELDPDTRRRLLILQHGQD RDHAS+EP +P +HP+  S      + RV  RGGWFP EE+
Sbjct: 538  ELDPDTRRRLLILQHGQDTRDHASTEPTYPIRHPMPVS------APRVSSRGGWFPAEED 591

Query: 2250 IGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKEMYH 2429
            IGSQP+NR V K+F +DSGPL IEKHRPHHPSFF KV+SSISSDRILH++ QR+PKEMYH
Sbjct: 592  IGSQPLNRVVSKEFSVDSGPLGIEKHRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYH 651

Query: 2430 RDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEIALKCG 2606
            RDDR R +HMLSSY SL GD++PF R                  ADTP VVLQEIALKCG
Sbjct: 652  RDDRPRSNHMLSSYRSLSGDELPFSRSSSSHRDLDTESGNSVFHADTPVVVLQEIALKCG 711

Query: 2607 TKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLSRAK 2786
            TKVEF  SLVASAELQFSIEAWF+GKKIGHGFGRTRKEAQHKAAEDSIKHLAD+YLS AK
Sbjct: 712  TKVEFMSSLVASAELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAK 771

Query: 2787 DEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEVSKR 2966
            DEPGSTYGDV GFPN+NDNGY+   SSL NQ LPKE+SASF  ASDPSRVLDPRLEVSKR
Sbjct: 772  DEPGSTYGDVGGFPNSNDNGYMVIASSLSNQSLPKEDSASFLTASDPSRVLDPRLEVSKR 831

Query: 2967 SMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLTWDEA 3146
             MGSISALKELC +EGLGV+FLS PA  STNS+ +DEVHAQVEIDG+VFGKGIGLTWDEA
Sbjct: 832  PMGSISALKELCMIEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIGLTWDEA 891

Query: 3147 KMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYPRNAP 3326
            KMQAAEKALGSLR+                    NKRLKQEYPRT+QR PSS RYPRNAP
Sbjct: 892  KMQAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRTMQRIPSSTRYPRNAP 951

Query: 3327 PIP 3335
            PIP
Sbjct: 952  PIP 954


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            [Glycine max]
 gb|KRH20483.1| hypothetical protein GLYMA_13G181700 [Glycine max]
          Length = 960

 Score = 1523 bits (3944), Expect = 0.0
 Identities = 770/968 (79%), Positives = 830/968 (85%), Gaps = 9/968 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN--------KNFNVKEIRISHFSQPSERCPPLAVLHS 614
            MYKSVVYQG+VV+GEVD+YPE+NN        KNF+VKEIRISHFSQPSERCPPLAVLH+
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60

Query: 615  VTSCGVCFKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWG 794
            VTSCGVCFKMESKTQQQD LF LHSLCIR+NKTAVMPL G+EIHLVAMHSRNDDRPCFWG
Sbjct: 61   VTSCGVCFKMESKTQQQDGLFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNDDRPCFWG 120

Query: 795  FVVAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRIS 974
            F+V  GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRIS
Sbjct: 121  FIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRIS 180

Query: 975  GMQAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKN 1154
            GMQAEVKRYLDDK+ILKQYAENDQV DNG++IKVQSEIVPALSD+HQPIVRPLIRL +KN
Sbjct: 181  GMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKN 240

Query: 1155 IILTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDP 1334
            IILTRINPQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP
Sbjct: 241  IILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 300

Query: 1335 GLNLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVV 1514
              NLINS+ELLGRIVCVKSGLKKSLFNVFQDGSC PKMALVIDDRLKVWDE+DQPRVHVV
Sbjct: 301  DSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPRVHVV 360

Query: 1515 PAFAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXX 1694
            PAFAPYYAPQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQIAYE        
Sbjct: 361  PAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDVPS 420

Query: 1695 XXXXSNYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLT 1874
                SNYL SEDDGS SNGNRDP LFDGMADAEVERKLKDAL+AAS  P+TTANLDPRLT
Sbjct: 421  PPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDPRLT 480

Query: 1875 SSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEG 2054
            S  L YTMV SGSVPPPTAQAS M F H+Q PQPA L KPMGQ +P + SLHSSPAREEG
Sbjct: 481  S--LQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAREEG 538

Query: 2055 EVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWF 2234
            EVPESELDPDTRRRLLILQHGQD RDHAS+EPPFP +HP+QAS+  + +S     RG WF
Sbjct: 539  EVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVPSS-----RGVWF 593

Query: 2235 PVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMP 2414
            PVEEEIGSQP+NR VPK+FP+DSGPL IEK R HHPSFF KV+SSISSDRILH++ QR+P
Sbjct: 594  PVEEEIGSQPLNRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSHQRLP 653

Query: 2415 KEMYHRDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEI 2591
            KEMYHRDDR R++HMLSSY S  GDDIPF R                  ADTP  VL EI
Sbjct: 654  KEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGHSVLHADTPVAVLHEI 713

Query: 2592 ALKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMY 2771
            ALKCGTKV+F  SLVAS EL+FS+EAWF+GKKIGHGFGRTRKEAQ+KAA+DSI+HLAD+Y
Sbjct: 714  ALKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIY 773

Query: 2772 LSRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRL 2951
            LS AKDEPGSTYGDVSGFPN NDNGY+G  SSL NQPL KE+SASFS+AS PSR LDPRL
Sbjct: 774  LSSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKEDSASFSSAS-PSRALDPRL 832

Query: 2952 EVSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGL 3131
            +VSKRSMGSISALKELC MEGLGV+FLS PA  STNSV +DEVHAQVEIDG++FGKGIGL
Sbjct: 833  DVSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIGL 892

Query: 3132 TWDEAKMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARY 3311
            TWDEAKMQAAEKALG+LR+                    NKRLKQEYPRT+QR PSSARY
Sbjct: 893  TWDEAKMQAAEKALGNLRSKLGQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMPSSARY 952

Query: 3312 PRNAPPIP 3335
            PRNAPPIP
Sbjct: 953  PRNAPPIP 960


>gb|KOM31067.1| hypothetical protein LR48_Vigan01g062200 [Vigna angularis]
          Length = 964

 Score = 1522 bits (3941), Expect = 0.0
 Identities = 775/973 (79%), Positives = 831/973 (85%), Gaps = 14/973 (1%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN-KNFNVKEIRISHFSQPSERCPPLAVLHSVTSCGVC 635
            MYKSVVYQG++VLGEV++YPE+NN KNF++KEIRISHFSQPSERCPPLAVLH+VTSCGVC
Sbjct: 1    MYKSVVYQGELVLGEVEVYPEENNYKNFHLKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60

Query: 636  FKMESKTQQQDQLFHLHSLCIRDNK----------TAVMPLYGDEIHLVAMHSRNDDRPC 785
            FKMESKTQQQD LF LHSLCIR+NK          TAV+PL G+EIHLVAMHSRNDDRPC
Sbjct: 61   FKMESKTQQQDGLFQLHSLCIRENKFSLDFNYVMMTAVIPLGGEEIHLVAMHSRNDDRPC 120

Query: 786  FWGFVVAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQ 965
            FWGF+VA GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQ
Sbjct: 121  FWGFIVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQ 180

Query: 966  RISGMQAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLH 1145
            RISGMQAEVKRYLDDK+ILKQYAENDQV DNG++IKVQSEIVPALSDNHQPIVRPLIRLH
Sbjct: 181  RISGMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDNHQPIVRPLIRLH 240

Query: 1146 EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRL 1325
            +KNIILTRINPQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRL
Sbjct: 241  DKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300

Query: 1326 LDPGLNLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRV 1505
            LDP  NLINS+ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRV
Sbjct: 301  LDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRV 360

Query: 1506 HVVPAFAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXX 1685
            HVVPAFAPYYAPQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQ+AYE     
Sbjct: 361  HVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQVAYEDDIKD 420

Query: 1686 XXXXXXXSNYLASEDDGST--SNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANL 1859
                   SNYL SEDDGS+  SNGNRDP L DGMADAEVERKLKDALSAAS IP+TTANL
Sbjct: 421  IPTPPDVSNYLVSEDDGSSAISNGNRDPFLLDGMADAEVERKLKDALSAASTIPVTTANL 480

Query: 1860 DPRLTSSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSP 2039
            DPRLTS  L YTM SSGSVPPPTAQAS M F+H+Q PQPAAL KPMGQ +P ESSLH SP
Sbjct: 481  DPRLTS--LQYTM-SSGSVPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHGSP 537

Query: 2040 AREEGEVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPL 2219
            AREEGEVPESELDPDTRRRLLILQHGQD RDHAS+EP +P +HP+  S      + RV  
Sbjct: 538  AREEGEVPESELDPDTRRRLLILQHGQDTRDHASTEPTYPIRHPMPVS------APRVSS 591

Query: 2220 RGGWFPVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHEN 2399
            RGGWFP EE+IGSQP+NR V K+F +DSGPL IEKHRPHHPSFF KV+SSISSDRILH++
Sbjct: 592  RGGWFPAEEDIGSQPLNRVVSKEFSVDSGPLGIEKHRPHHPSFFSKVESSISSDRILHDS 651

Query: 2400 QQRMPKEMYHRDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAV 2576
             QR+PKEMYHRDDR R +HMLSSY SL GD++PF R                  ADTP V
Sbjct: 652  HQRLPKEMYHRDDRPRSNHMLSSYRSLSGDELPFSRSSSSHRDLDTESGNSVFHADTPVV 711

Query: 2577 VLQEIALKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKH 2756
            VLQEIALKCGTKVEF  SLVASAELQFSIEAWF+GKKIGHGFGRTRKEAQHKAAEDSIKH
Sbjct: 712  VLQEIALKCGTKVEFMSSLVASAELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKH 771

Query: 2757 LADMYLSRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRV 2936
            LAD+YLS AKDEPGSTYGDV GFPN+NDNGY+   SSL NQ LPKE+SASF  ASDPSRV
Sbjct: 772  LADIYLSSAKDEPGSTYGDVGGFPNSNDNGYMVIASSLSNQSLPKEDSASFLTASDPSRV 831

Query: 2937 LDPRLEVSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFG 3116
            LDPRLEVSKR MGSISALKELC +EGLGV+FLS PA  STNS+ +DEVHAQVEIDG+VFG
Sbjct: 832  LDPRLEVSKRPMGSISALKELCMIEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFG 891

Query: 3117 KGIGLTWDEAKMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFP 3296
            KGIGLTWDEAKMQAAEKALGSLR+                    NKRLKQEYPRT+QR P
Sbjct: 892  KGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRTMQRIP 951

Query: 3297 SSARYPRNAPPIP 3335
            SS RYPRNAPPIP
Sbjct: 952  SSTRYPRNAPPIP 964


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Glycine max]
 gb|KHN07275.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Glycine soja]
 gb|KRH50009.1| hypothetical protein GLYMA_07G194800 [Glycine max]
          Length = 956

 Score = 1520 bits (3936), Expect = 0.0
 Identities = 771/963 (80%), Positives = 826/963 (85%), Gaps = 3/963 (0%)
 Frame = +3

Query: 456  KMYKSVVYQGDVVLGEVDIYPEDNN--KNFNVKEIRISHFSQPSERCPPLAVLHSVTSCG 629
            +MYKSVVYQG+VV+GEVD+YPE+NN  KNF+VKEIRISHFSQPSERCPPLAVLH+VTSCG
Sbjct: 2    RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSCG 61

Query: 630  VCFKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAA 809
            VCFKMESKTQQQD LF LHSLCIR+NKTAVMPL G+EIHLVAMHSRN DRPCFWGF+VA 
Sbjct: 62   VCFKMESKTQQQDGLFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNVDRPCFWGFIVAL 121

Query: 810  GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAE 989
            GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRISGMQAE
Sbjct: 122  GLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAE 181

Query: 990  VKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTR 1169
            VKRY DDK+ILKQYAENDQV DNG++IKVQSEIVPALSD+HQPIVRPLIRL +KNIILTR
Sbjct: 182  VKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNIILTR 241

Query: 1170 INPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLI 1349
            INPQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP  NLI
Sbjct: 242  INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLI 301

Query: 1350 NSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP 1529
            NS+ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP
Sbjct: 302  NSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP 361

Query: 1530 YYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXS 1709
            YYAPQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQIAYE            S
Sbjct: 362  YYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSPPDVS 421

Query: 1710 NYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLTSSALP 1889
            NYL SEDDGS SNG+RDP LFDGMADAEVERKLKDALSAAS IP+TTANLDPRLTS  L 
Sbjct: 422  NYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPRLTS--LQ 479

Query: 1890 YTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEGEVPES 2069
            YTMV SGSVPPPTAQAS M F H+Q PQPA L KPMGQ +P E SLHSSPAREEGEVPES
Sbjct: 480  YTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGEVPES 539

Query: 2070 ELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPVEEE 2249
            ELDPDTRRRLLILQHGQD RDHAS+EPPFP +HP+Q S+  + +S     RG WFP EEE
Sbjct: 540  ELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVPSS-----RGVWFPAEEE 594

Query: 2250 IGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKEMYH 2429
            IGSQP+NR VPK+FP+DSGPL I K RPHHPSFF KV+SSISSDRILH++ QR+PKEMYH
Sbjct: 595  IGSQPLNRVVPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYH 654

Query: 2430 RDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEIALKCG 2606
            RDDR R++HMLSSY S  GDDIPF R                  ADTP  VLQEIALKCG
Sbjct: 655  RDDRPRLNHMLSSYRSFSGDDIPFSRSFSSHRDLDSESGHSVLHADTPVAVLQEIALKCG 714

Query: 2607 TKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLSRAK 2786
            TKV+F  SLVAS ELQFS+EAWF+GKKIGH  GRTRKEAQ+KAAEDSIKHLAD+YLS AK
Sbjct: 715  TKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAK 774

Query: 2787 DEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEVSKR 2966
            DEPGSTYGDVSGFPN ND+GY+G  SSL NQPL KE+SASFS AS PSRVLDPRL+VSKR
Sbjct: 775  DEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKEDSASFSTAS-PSRVLDPRLDVSKR 833

Query: 2967 SMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLTWDEA 3146
            SMGSIS+LKELC MEGL V+FLS PA  STNSV +DEVHAQVEIDG+VFGKGIGLTWDEA
Sbjct: 834  SMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEA 893

Query: 3147 KMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYPRNAP 3326
            KMQAAEKALGSLR+                    NKRLKQEYPR +QR PSSARYPRNAP
Sbjct: 894  KMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSARYPRNAP 953

Query: 3327 PIP 3335
            PIP
Sbjct: 954  PIP 956


>gb|KHN10024.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Glycine soja]
          Length = 961

 Score = 1520 bits (3935), Expect = 0.0
 Identities = 769/969 (79%), Positives = 829/969 (85%), Gaps = 10/969 (1%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN---------KNFNVKEIRISHFSQPSERCPPLAVLH 611
            MYKSVVYQG+VV+GEVD+YPE+NN         KNF+VKEIRISHFSQPSERCPPLAVLH
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLH 60

Query: 612  SVTSCGVCFKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFW 791
            +VTSCGVCFKMESKTQQQ  LF LHSLCIR+NKTAVMPL G+EIHLVAMHSRNDDRPCFW
Sbjct: 61   TVTSCGVCFKMESKTQQQAGLFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNDDRPCFW 120

Query: 792  GFVVAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRI 971
            GF+V  GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRI
Sbjct: 121  GFIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRI 180

Query: 972  SGMQAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEK 1151
            SGMQAEVKRYLDDK+ILKQYAENDQV DNG++IKVQSEIVPALSD+HQPIVRPLIRL +K
Sbjct: 181  SGMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDK 240

Query: 1152 NIILTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLD 1331
            NIILTRINPQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLD
Sbjct: 241  NIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLD 300

Query: 1332 PGLNLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHV 1511
            P  NLINS+ELLGRIVCVKSGLKKSLFNVFQDGSC PKMALVIDDRLKVWDE+DQPRVHV
Sbjct: 301  PDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPRVHV 360

Query: 1512 VPAFAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXX 1691
            VPAFAPYYAPQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQIAYE       
Sbjct: 361  VPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDVP 420

Query: 1692 XXXXXSNYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRL 1871
                 SNYL SEDDGS SNGNRDP LFDGMADAEVERKLKDAL+AAS  P+TTANLDPRL
Sbjct: 421  SPPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDPRL 480

Query: 1872 TSSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREE 2051
            TS  L YTMV SGSVPPPTAQAS M F H+Q PQPA L KPMGQ +P + SLHSSPAREE
Sbjct: 481  TS--LQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAREE 538

Query: 2052 GEVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGW 2231
            GEVPESELDPDTRRRLLILQHGQD RDHAS+EPPFP +HP+QAS+  + +S     RG W
Sbjct: 539  GEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVPSS-----RGVW 593

Query: 2232 FPVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRM 2411
            FPVEEEIGSQP+NR VPK+FP+DSGPL IEK R HHPSFF KV+SSISSDRILH++ QR+
Sbjct: 594  FPVEEEIGSQPLNRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSHQRL 653

Query: 2412 PKEMYHRDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQE 2588
            PKEMYHRDDR R++HMLSSY S  GDDIPF R                  ADTP  VL E
Sbjct: 654  PKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGHSVLHADTPVAVLHE 713

Query: 2589 IALKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADM 2768
            IALKCGTKV+F  SLVAS EL+FS+EAWF+GKKIGHGFGRTRKEAQ+KAA+DSI+HLAD+
Sbjct: 714  IALKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADI 773

Query: 2769 YLSRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPR 2948
            YLS AKDEPGSTYGDVSGFPN NDNGY+G  SSL NQPL KE+SASFS+AS PSR LDPR
Sbjct: 774  YLSSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKEDSASFSSAS-PSRALDPR 832

Query: 2949 LEVSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIG 3128
            L+VSKRSMGSISALKELC MEGLGV+FLS PA  STNSV +DEVHAQVEIDG++FGKGIG
Sbjct: 833  LDVSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIG 892

Query: 3129 LTWDEAKMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSAR 3308
            LTWDEAKMQAAEKALG+LR+                    NKRLKQEYPRT+QR PSSAR
Sbjct: 893  LTWDEAKMQAAEKALGNLRSKLGQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMPSSAR 952

Query: 3309 YPRNAPPIP 3335
            YPRNAPPIP
Sbjct: 953  YPRNAPPIP 961


>ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
 gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score = 1513 bits (3916), Expect = 0.0
 Identities = 770/972 (79%), Positives = 824/972 (84%), Gaps = 13/972 (1%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN-KNFNVKEIRISHFSQPSERCPPLAVLHSVTSCGVC 635
            MYKSVVYQG+VVLGEV++YPE+NN KNF+VKEIRISHFSQPSERCPPLAVLH+VTSCGVC
Sbjct: 1    MYKSVVYQGEVVLGEVEVYPEENNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60

Query: 636  FKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAAGL 815
            FKMESKTQQQD LFHLHSLCIR+NKTAV+PL G+EIHLVAMHSRNDDRP FWGF+VA GL
Sbjct: 61   FKMESKTQQQDGLFHLHSLCIRENKTAVIPLGGEEIHLVAMHSRNDDRPRFWGFIVALGL 120

Query: 816  YNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAEVK 995
            Y+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRISGMQAEVK
Sbjct: 121  YDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAEVK 180

Query: 996  RYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTRIN 1175
            RY +DK+ILKQYAENDQV DNG+++KVQSEIVPALSDNHQPIVRPLIRL +KNIILTRIN
Sbjct: 181  RYQEDKNILKQYAENDQVVDNGRVVKVQSEIVPALSDNHQPIVRPLIRLQDKNIILTRIN 240

Query: 1176 PQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLINS 1355
            PQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP  NLINS
Sbjct: 241  PQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINS 300

Query: 1356 RELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 1535
            +ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY
Sbjct: 301  KELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 360

Query: 1536 APQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXSNY 1715
            APQAEA+N++PVLCVARNVACNVRGG        LLQK+PQ+AYE            SNY
Sbjct: 361  APQAEASNSIPVLCVARNVACNVRGGFFKEFDDGLLQKIPQVAYEDDIKDIPIPPDVSNY 420

Query: 1716 LASEDDGST--SNGNRDPLLFDGMADAEVERKLK---------DALSAASAIPMTTANLD 1862
            L SEDDGS+  SNGNRDP LFD M DAEVERK K         DALSAAS IP+TTANLD
Sbjct: 421  LVSEDDGSSAISNGNRDPFLFDSMGDAEVERKSKVPTRAPNEHDALSAASTIPVTTANLD 480

Query: 1863 PRLTSSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPA 2042
            PRLTS  L Y MVSSGS PPPTAQAS M F+H+Q PQPAAL KPMGQ +P ESSLHSSPA
Sbjct: 481  PRLTS--LQYAMVSSGSAPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHSSPA 538

Query: 2043 REEGEVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLR 2222
            REEGEVPESELDPDTRRRLLILQHGQD RDH S+EP +  +HP+  S      + RV  R
Sbjct: 539  REEGEVPESELDPDTRRRLLILQHGQDTRDHTSNEPTYAIRHPVPVS------APRVSSR 592

Query: 2223 GGWFPVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQ 2402
            GGWFP EE+IGSQP+NR VPK+F +DSG L IEKHRPHHPSFF KV+SSISSDRILH++ 
Sbjct: 593  GGWFPAEEDIGSQPLNRVVPKEFSVDSGSLVIEKHRPHHPSFFSKVESSISSDRILHDSH 652

Query: 2403 QRMPKEMYHRDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVV 2579
            QR+PKEMYHRDDR R +HMLSSY SL  D+IPF R                  ADTP VV
Sbjct: 653  QRLPKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSSSHRDLDSESSHSVFHADTPVVV 712

Query: 2580 LQEIALKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHL 2759
            LQEIALKCGTKVEF  SLVAS ELQFSIEAWF+GKKIGHGFGRTRKEAQHKAAEDSIKHL
Sbjct: 713  LQEIALKCGTKVEFMSSLVASTELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHL 772

Query: 2760 ADMYLSRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVL 2939
            AD+YLS AKDEPGSTYGDV GFPNANDNGY+   SSL NQPLPKE+SASFS ASDPSRVL
Sbjct: 773  ADIYLSSAKDEPGSTYGDVGGFPNANDNGYMVIASSLSNQPLPKEDSASFSTASDPSRVL 832

Query: 2940 DPRLEVSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGK 3119
            DPRLEVSKR MGSISALKELC MEGLGV+FLS PA  STNS+ +DEVHAQVEIDG+VFGK
Sbjct: 833  DPRLEVSKRPMGSISALKELCMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFGK 892

Query: 3120 GIGLTWDEAKMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPS 3299
            GIGLTWDEAKMQAAEKALGSLR+                    NKRLKQEYPR +QR PS
Sbjct: 893  GIGLTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRAMQRIPS 952

Query: 3300 SARYPRNAPPIP 3335
            S RYPRNAPPIP
Sbjct: 953  STRYPRNAPPIP 964


>ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Cicer arietinum]
          Length = 951

 Score = 1510 bits (3909), Expect = 0.0
 Identities = 769/961 (80%), Positives = 817/961 (85%), Gaps = 2/961 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNNKNFNVKEIRISHFSQPSERCPPLAVLHSVTSCGVCF 638
            MYKS+VYQG+VVLGEVDIYPE NN N N KEIRISHF+QPSERC PLAVLH++TS GVCF
Sbjct: 1    MYKSLVYQGEVVLGEVDIYPEVNNNNKNFKEIRISHFTQPSERCLPLAVLHTITSSGVCF 60

Query: 639  KMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAAGLY 818
            KMESKTQQQD LFHLH+LC R+NKTAVMPL G+E+HLVAMHSR++ RPCFWG++V  GLY
Sbjct: 61   KMESKTQQQDPLFHLHNLCFRENKTAVMPLCGEEMHLVAMHSRSNGRPCFWGYIVGMGLY 120

Query: 819  NSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAEVKR 998
            NSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRISGMQAEVKR
Sbjct: 121  NSCLMMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAEVKR 180

Query: 999  YLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTRINP 1178
            YL+DKSILKQY ENDQV DNGK++K QSE+VPALSD+HQPIVRPLIRLHEKNIILTRINP
Sbjct: 181  YLEDKSILKQYVENDQVVDNGKVLKAQSELVPALSDSHQPIVRPLIRLHEKNIILTRINP 240

Query: 1179 QIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLINSR 1358
            QIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP  NLINS+
Sbjct: 241  QIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSK 300

Query: 1359 ELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYA 1538
            ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYA
Sbjct: 301  ELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYA 360

Query: 1539 PQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXSNYL 1718
            PQAEA+NT+PVLCVARNVACNVRGG        LLQK+ QIAYE            SNYL
Sbjct: 361  PQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKISQIAYENNTRDISPAPDVSNYL 420

Query: 1719 ASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLTSSALPYTM 1898
             SEDDGS S  NRDP  FDGMADAEVERKLKDA+SAASAIPMTTA LDPRLTSS L YTM
Sbjct: 421  VSEDDGSASYANRDPFAFDGMADAEVERKLKDAISAASAIPMTTAKLDPRLTSS-LQYTM 479

Query: 1899 VSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEGEVPESELD 2078
            VS GSV PP AQAS +   H Q PQPA L KP+GQV+P E SLHSSPAREEGEVPESELD
Sbjct: 480  VSPGSVLPPAAQASMIPLPHTQFPQPATLVKPIGQVAPSELSLHSSPAREEGEVPESELD 539

Query: 2079 PDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPVEEEIGS 2258
            PDTRRRLLILQHGQD RDH SSEPPFP KHP+Q S+       RVP RGGWFPVEEEIGS
Sbjct: 540  PDTRRRLLILQHGQDNRDHTSSEPPFPLKHPVQVSA-------RVPPRGGWFPVEEEIGS 592

Query: 2259 QPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKEMYHRDD 2438
            QP NR +PK+  LDSGP RIEKHR H   FFPKVD SISSDR LHE  QR+PKEMYHRDD
Sbjct: 593  QPPNRVIPKEIALDSGPSRIEKHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMYHRDD 652

Query: 2439 RSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEIALKCGTKV 2615
            RSRVSHMLSSY SL GDD PFGR                  A+TPA+VLQEIALKCGTKV
Sbjct: 653  RSRVSHMLSSYPSLSGDDTPFGRSSSSHRDFDSESGHSVFNAETPAIVLQEIALKCGTKV 712

Query: 2616 EFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLSRAKDEP 2795
            EFT SL AS ELQFSIEAWF+GKKIGHGFGRTR EAQ+KAAEDSIKHLAD+YLSRAKDE 
Sbjct: 713  EFTSSLAASRELQFSIEAWFSGKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSRAKDES 772

Query: 2796 GSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEVSKRSMG 2975
            GS +GDVSGFPNANDNGYVG+VSSL NQPLPKEES SFSAASDPSRVLDPRL+VSKRSMG
Sbjct: 773  GSAFGDVSGFPNANDNGYVGNVSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVSKRSMG 832

Query: 2976 SISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLTWDEAKMQ 3155
            S+SALKELC +EGLGV+FLSLPA  STNSV  DEVHAQVEIDGQV+GKG G+TWDEAKMQ
Sbjct: 833  SVSALKELCMVEGLGVNFLSLPAPVSTNSV--DEVHAQVEIDGQVYGKGTGITWDEAKMQ 890

Query: 3156 AAEKALGSLRTT-YXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYPRNAPPI 3332
            AAEKALGSLRTT +                  NKRLKQE+PRTLQRF SS RYPRNAPPI
Sbjct: 891  AAEKALGSLRTTIHGQGIQRRQLSPRPFQGLSNKRLKQEHPRTLQRFASSGRYPRNAPPI 950

Query: 3333 P 3335
            P
Sbjct: 951  P 951


>ref|XP_013457069.1| double-stranded RNA-binding motif protein [Medicago truncatula]
 gb|KEH31100.1| double-stranded RNA-binding motif protein [Medicago truncatula]
          Length = 958

 Score = 1504 bits (3895), Expect = 0.0
 Identities = 763/966 (78%), Positives = 821/966 (84%), Gaps = 7/966 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN------KNFNVKEIRISHFSQPSERCPPLAVLHSVT 620
            MYKSVVYQG+V+LGEVDIYPE NN      KNF+VKEIRI+ FSQPSERC PLAVLH++T
Sbjct: 1    MYKSVVYQGEVMLGEVDIYPEVNNNINNKNKNFDVKEIRITQFSQPSERCSPLAVLHTIT 60

Query: 621  SCGVCFKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFV 800
            +  VCFKMESKTQ Q+QL HLHSLCIR+NKTAVMPLYG+E+HLVAMHSRNDDRPCFWGF+
Sbjct: 61   T--VCFKMESKTQHQNQLLHLHSLCIRENKTAVMPLYGEELHLVAMHSRNDDRPCFWGFI 118

Query: 801  VAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGM 980
            VA GLYNS +V+LNLRCLGIVFDLDETL+VANTMRSFEDRIDALQRK+N EVDPQRISGM
Sbjct: 119  VATGLYNSSVVLLNLRCLGIVFDLDETLVVANTMRSFEDRIDALQRKVNSEVDPQRISGM 178

Query: 981  QAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNII 1160
            QAE+KRY +DKSILKQYAENDQV DNGK+IKVQSE+VPALSD+HQPIVRPLIRLHEKNII
Sbjct: 179  QAEIKRYQEDKSILKQYAENDQVVDNGKVIKVQSELVPALSDSHQPIVRPLIRLHEKNII 238

Query: 1161 LTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGL 1340
            LTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDP  
Sbjct: 239  LTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPDS 298

Query: 1341 NLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPA 1520
            NLIN++ELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPA
Sbjct: 299  NLINAKELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPA 358

Query: 1521 FAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXX 1700
            FAPYYAPQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQIAYE          
Sbjct: 359  FAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPPAP 418

Query: 1701 XXSNYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLTSS 1880
              SNYL SEDDGS S GNRDP LFDGMADAEVERKLKDA+SA SAIPMTTA LDPRLTSS
Sbjct: 419  DVSNYLVSEDDGSASYGNRDPFLFDGMADAEVERKLKDAISATSAIPMTTAKLDPRLTSS 478

Query: 1881 ALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEGEV 2060
             L YTMVS GSVPPP   AS +Q  H Q  QPA   KPM QV+P ESSLHSSPAREEGEV
Sbjct: 479  -LQYTMVSPGSVPPPAPHASMIQLPHTQFLQPATPVKPMVQVAPLESSLHSSPAREEGEV 537

Query: 2061 PESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPV 2240
             ESELDPDTRRRLLILQHGQDIRDH SSEPPFP +HP     +P+Q STR P RGGWFPV
Sbjct: 538  GESELDPDTRRRLLILQHGQDIRDHTSSEPPFPVRHP-----NPVQVSTRAPSRGGWFPV 592

Query: 2241 EEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKE 2420
            EEEIGSQP NR +PK+  +DSGP R+EKHRPH PSFF KVD SISSDR LHE+ QR+PKE
Sbjct: 593  EEEIGSQPPNRVLPKEILVDSGPSRMEKHRPHQPSFFSKVDGSISSDRALHESHQRLPKE 652

Query: 2421 MYHRDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEIAL 2597
            +YHRDDRSRV+HML SYHSL GDDI FGR                  A+TPA VLQEIAL
Sbjct: 653  IYHRDDRSRVNHMLPSYHSLSGDDILFGRSSSSHRDLDSESGNSVLHAETPAAVLQEIAL 712

Query: 2598 KCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLS 2777
            KCGTKVE+T SLVAS ELQFS+EAWF+GKK+G G GRTR EA++KAAEDSIKHLAD+YLS
Sbjct: 713  KCGTKVEYTSSLVASRELQFSVEAWFSGKKVGQGIGRTRMEARYKAAEDSIKHLADIYLS 772

Query: 2778 RAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEV 2957
            RAKDEPGS YGDVSGFPNANDNGYVG+VSSL N PLPKEE+ SFSAASD SRVLDPRLEV
Sbjct: 773  RAKDEPGSAYGDVSGFPNANDNGYVGNVSSLGNHPLPKEEAVSFSAASDLSRVLDPRLEV 832

Query: 2958 SKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLTW 3137
            SKRS GS+SALKELC MEGLGV+FLS+PA  STNSV +DEV+AQVEIDGQV+GKG GLTW
Sbjct: 833  SKRSTGSVSALKELCMMEGLGVNFLSVPAPLSTNSVQKDEVYAQVEIDGQVYGKGTGLTW 892

Query: 3138 DEAKMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYPR 3317
            DEAKMQAAEKALGSLR  +                  NKRLKQE+PRTLQRF SS RYPR
Sbjct: 893  DEAKMQAAEKALGSLRPMHGHSIQRRQSSPRPFQGFSNKRLKQEHPRTLQRFASSGRYPR 952

Query: 3318 NAPPIP 3335
            NAP IP
Sbjct: 953  NAPAIP 958


>gb|KRH20482.1| hypothetical protein GLYMA_13G181700 [Glycine max]
          Length = 933

 Score = 1504 bits (3894), Expect = 0.0
 Identities = 762/967 (78%), Positives = 823/967 (85%), Gaps = 8/967 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN--------KNFNVKEIRISHFSQPSERCPPLAVLHS 614
            MYKSVVYQG+VV+GEVD+YPE+NN        KNF+VKEIRISHFSQPSERCPPLAVLH+
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60

Query: 615  VTSCGVCFKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWG 794
            VTSCGVCFKMESKTQQQD LF LHSLCIR+NKTAVMPL G+EIHLVAMHSRNDDRPCFWG
Sbjct: 61   VTSCGVCFKMESKTQQQDGLFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNDDRPCFWG 120

Query: 795  FVVAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRIS 974
            F+V  GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRIS
Sbjct: 121  FIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRIS 180

Query: 975  GMQAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKN 1154
            GMQAEVKRYLDDK+ILKQYAENDQV DNG++IKVQSEIVPALSD+HQPIVRPLIRL +KN
Sbjct: 181  GMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKN 240

Query: 1155 IILTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDP 1334
            IILTRINPQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP
Sbjct: 241  IILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 300

Query: 1335 GLNLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVV 1514
              NLINS+ELLGRIVCVKSGLKKSLFNVFQDGSC PKMALVIDDRLKVWDE+DQPRVHVV
Sbjct: 301  DSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPRVHVV 360

Query: 1515 PAFAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXX 1694
            PAFAPYYAPQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQIAYE        
Sbjct: 361  PAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDVPS 420

Query: 1695 XXXXSNYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLT 1874
                SNYL SEDDGS SNGNRDP LFDGMADAEVERKLKDAL+AAS  P+TTANLDPRLT
Sbjct: 421  PPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDPRLT 480

Query: 1875 SSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEG 2054
            S  L YTMV SGSVPPPTAQAS M F H+Q PQPA L KPMGQ +P + SLHSSPAREEG
Sbjct: 481  S--LQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAREEG 538

Query: 2055 EVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWF 2234
            EVPESELDPDTRRRLLILQHGQD RDHAS+EPPFP +HP+QAS+  + +S     RG WF
Sbjct: 539  EVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVPSS-----RGVWF 593

Query: 2235 PVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMP 2414
            PVEEEIGSQP+NR VPK+FP+DSGPL IEK R HHPSFF KV+SSISSDRILH++ QR+P
Sbjct: 594  PVEEEIGSQPLNRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSHQRLP 653

Query: 2415 KEMYHRDDRSRVSHMLSSYHSLPGDDIPFGRXXXXXXXXXXXXXXXXXADTPAVVLQEIA 2594
            KEMYHRDDR R++HMLSSY S                           +DTP  VL EIA
Sbjct: 654  KEMYHRDDRPRLNHMLSSYRSF--------------------------SDTPVAVLHEIA 687

Query: 2595 LKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYL 2774
            LKCGTKV+F  SLVAS EL+FS+EAWF+GKKIGHGFGRTRKEAQ+KAA+DSI+HLAD+YL
Sbjct: 688  LKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIYL 747

Query: 2775 SRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLE 2954
            S AKDEPGSTYGDVSGFPN NDNGY+G  SSL NQPL KE+SASFS+AS PSR LDPRL+
Sbjct: 748  SSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKEDSASFSSAS-PSRALDPRLD 806

Query: 2955 VSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLT 3134
            VSKRSMGSISALKELC MEGLGV+FLS PA  STNSV +DEVHAQVEIDG++FGKGIGLT
Sbjct: 807  VSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIGLT 866

Query: 3135 WDEAKMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYP 3314
            WDEAKMQAAEKALG+LR+                    NKRLKQEYPRT+QR PSSARYP
Sbjct: 867  WDEAKMQAAEKALGNLRSKLGQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMPSSARYP 926

Query: 3315 RNAPPIP 3335
            RNAPPIP
Sbjct: 927  RNAPPIP 933


>ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X2 [Glycine max]
          Length = 929

 Score = 1501 bits (3886), Expect = 0.0
 Identities = 763/962 (79%), Positives = 819/962 (85%), Gaps = 2/962 (0%)
 Frame = +3

Query: 456  KMYKSVVYQGDVVLGEVDIYPEDNN--KNFNVKEIRISHFSQPSERCPPLAVLHSVTSCG 629
            +MYKSVVYQG+VV+GEVD+YPE+NN  KNF+VKEIRISHFSQPSERCPPLAVLH+VTSCG
Sbjct: 2    RMYKSVVYQGEVVVGEVDVYPEENNNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSCG 61

Query: 630  VCFKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAA 809
            VCFKMESKTQQQD LF LHSLCIR+NKTAVMPL G+EIHLVAMHSRN DRPCFWGF+VA 
Sbjct: 62   VCFKMESKTQQQDGLFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNVDRPCFWGFIVAL 121

Query: 810  GLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAE 989
            GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRISGMQAE
Sbjct: 122  GLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAE 181

Query: 990  VKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTR 1169
            VKRY DDK+ILKQYAENDQV DNG++IKVQSEIVPALSD+HQPIVRPLIRL +KNIILTR
Sbjct: 182  VKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKNIILTR 241

Query: 1170 INPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLI 1349
            INPQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP  NLI
Sbjct: 242  INPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLI 301

Query: 1350 NSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP 1529
            NS+ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP
Sbjct: 302  NSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAP 361

Query: 1530 YYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXS 1709
            YYAPQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQIAYE            S
Sbjct: 362  YYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPSPPDVS 421

Query: 1710 NYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLTSSALP 1889
            NYL SEDDGS SNG+RDP LFDGMADAEVERKLKDALSAAS IP+TTANLDPRLTS  L 
Sbjct: 422  NYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPRLTS--LQ 479

Query: 1890 YTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEGEVPES 2069
            YTMV SGSVPPPTAQAS M F H+Q PQPA L KPMGQ +P E SLHSSPAREEGEVPES
Sbjct: 480  YTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGEVPES 539

Query: 2070 ELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPVEEE 2249
            ELDPDTRRRLLILQHGQD RDHAS+EPPFP +HP+Q S+  + +S     RG WFP EEE
Sbjct: 540  ELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVPSS-----RGVWFPAEEE 594

Query: 2250 IGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKEMYH 2429
            IGSQP+NR VPK+FP+DSGPL I K RPHHPSFF KV+SSISSDRILH++ QR+PKEMYH
Sbjct: 595  IGSQPLNRVVPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYH 654

Query: 2430 RDDRSRVSHMLSSYHSLPGDDIPFGRXXXXXXXXXXXXXXXXXADTPAVVLQEIALKCGT 2609
            RDDR R++HMLSSY S                           +DTP  VLQEIALKCGT
Sbjct: 655  RDDRPRLNHMLSSYRSF--------------------------SDTPVAVLQEIALKCGT 688

Query: 2610 KVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLSRAKD 2789
            KV+F  SLVAS ELQFS+EAWF+GKKIGH  GRTRKEAQ+KAAEDSIKHLAD+YLS AKD
Sbjct: 689  KVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKD 748

Query: 2790 EPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEVSKRS 2969
            EPGSTYGDVSGFPN ND+GY+G  SSL NQPL KE+SASFS AS PSRVLDPRL+VSKRS
Sbjct: 749  EPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKEDSASFSTAS-PSRVLDPRLDVSKRS 807

Query: 2970 MGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLTWDEAK 3149
            MGSIS+LKELC MEGL V+FLS PA  STNSV +DEVHAQVEIDG+VFGKGIGLTWDEAK
Sbjct: 808  MGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAK 867

Query: 3150 MQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYPRNAPP 3329
            MQAAEKALGSLR+                    NKRLKQEYPR +QR PSSARYPRNAPP
Sbjct: 868  MQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSARYPRNAPP 927

Query: 3330 IP 3335
            IP
Sbjct: 928  IP 929


>ref|XP_017440623.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X2 [Vigna angularis]
          Length = 932

 Score = 1474 bits (3816), Expect = 0.0
 Identities = 754/963 (78%), Positives = 809/963 (84%), Gaps = 4/963 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN-KNFNVKEIRISHFSQPSERCPPLAVLHSVTSCGVC 635
            MYKSVVYQG++VLGEV++YPE+NN KNF++KEIRISHFSQPSERCPPLAVLH+VTSCGVC
Sbjct: 1    MYKSVVYQGELVLGEVEVYPEENNYKNFHLKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60

Query: 636  FKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAAGL 815
            FKMESKTQQQD LF LHSLCIR+NKTAV+PL G+EIHLVAMHSRNDDRPCFWGF+VA GL
Sbjct: 61   FKMESKTQQQDGLFQLHSLCIRENKTAVIPLGGEEIHLVAMHSRNDDRPCFWGFIVALGL 120

Query: 816  YNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAEVK 995
            Y+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRISGMQAEVK
Sbjct: 121  YDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAEVK 180

Query: 996  RYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTRIN 1175
            RYLDDK+ILKQYAENDQV DNG++IKVQSEIVPALSDNHQPIVRPLIRLH+KNIILTRIN
Sbjct: 181  RYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDNHQPIVRPLIRLHDKNIILTRIN 240

Query: 1176 PQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLINS 1355
            PQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP  NLINS
Sbjct: 241  PQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINS 300

Query: 1356 RELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 1535
            +ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY
Sbjct: 301  KELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 360

Query: 1536 APQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXSNY 1715
            APQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQ+AYE            SNY
Sbjct: 361  APQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQVAYEDDIKDIPTPPDVSNY 420

Query: 1716 LASEDDGST--SNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLTSSALP 1889
            L SEDDGS+  SNGNRDP L DGMADAEVERKLKDALSAAS IP+TTANLDPRLTS  L 
Sbjct: 421  LVSEDDGSSAISNGNRDPFLLDGMADAEVERKLKDALSAASTIPVTTANLDPRLTS--LQ 478

Query: 1890 YTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEGEVPES 2069
            YTM SSGSVPPPTAQAS M F+H+Q PQPAAL KPMGQ +P ESSLH SPAREEGEVPES
Sbjct: 479  YTM-SSGSVPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHGSPAREEGEVPES 537

Query: 2070 ELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPVEEE 2249
            ELDPDTRRRLLILQHGQD RDHAS+EP +P +HP+  S      + RV  RGGWFP EE+
Sbjct: 538  ELDPDTRRRLLILQHGQDTRDHASTEPTYPIRHPMPVS------APRVSSRGGWFPAEED 591

Query: 2250 IGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKEMYH 2429
            IGSQP+NR V K+F +DSGPL IEKHRPHHPSFF KV+SSISSDRILH++ QR+PKEMYH
Sbjct: 592  IGSQPLNRVVSKEFSVDSGPLGIEKHRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYH 651

Query: 2430 RDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEIALKCG 2606
            RDDR R +HMLSSY SL GD++PF R                  ADTP VVLQEIALKCG
Sbjct: 652  RDDRPRSNHMLSSYRSLSGDELPFSRSSSSHRDLDTESGNSVFHADTPVVVLQEIALKCG 711

Query: 2607 TKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLSRAK 2786
            TKVEF  SLVASAELQFSIEAWF+GKKIGHGFGRTRKEAQHKAAEDSIKHLAD+YLS AK
Sbjct: 712  TKVEFMSSLVASAELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADIYLSSAK 771

Query: 2787 DEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEVSKR 2966
            DEPGSTYGDV GFPN+NDNGY+   SSL NQ LPKE+SASF  ASDPSRVLDPRLEVSKR
Sbjct: 772  DEPGSTYGDVGGFPNSNDNGYMVIASSLSNQSLPKEDSASFLTASDPSRVLDPRLEVSKR 831

Query: 2967 SMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLTWDEA 3146
             MGSISALKELC +EGLGV+FLS PA  STNS+ +DEVHA                    
Sbjct: 832  PMGSISALKELCMIEGLGVNFLSAPAPVSTNSLQKDEVHA-------------------- 871

Query: 3147 KMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYPRNAP 3326
              QAAEKALGSLR+                    NKRLKQEYPRT+QR PSS RYPRNAP
Sbjct: 872  --QAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRTMQRIPSSTRYPRNAP 929

Query: 3327 PIP 3335
            PIP
Sbjct: 930  PIP 932


>gb|KRH20484.1| hypothetical protein GLYMA_13G181700 [Glycine max]
          Length = 907

 Score = 1459 bits (3776), Expect = 0.0
 Identities = 735/909 (80%), Positives = 792/909 (87%), Gaps = 9/909 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN--------KNFNVKEIRISHFSQPSERCPPLAVLHS 614
            MYKSVVYQG+VV+GEVD+YPE+NN        KNF+VKEIRISHFSQPSERCPPLAVLH+
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60

Query: 615  VTSCGVCFKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWG 794
            VTSCGVCFKMESKTQQQD LF LHSLCIR+NKTAVMPL G+EIHLVAMHSRNDDRPCFWG
Sbjct: 61   VTSCGVCFKMESKTQQQDGLFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNDDRPCFWG 120

Query: 795  FVVAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRIS 974
            F+V  GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRIS
Sbjct: 121  FIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRIS 180

Query: 975  GMQAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKN 1154
            GMQAEVKRYLDDK+ILKQYAENDQV DNG++IKVQSEIVPALSD+HQPIVRPLIRL +KN
Sbjct: 181  GMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKN 240

Query: 1155 IILTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDP 1334
            IILTRINPQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP
Sbjct: 241  IILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 300

Query: 1335 GLNLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVV 1514
              NLINS+ELLGRIVCVKSGLKKSLFNVFQDGSC PKMALVIDDRLKVWDE+DQPRVHVV
Sbjct: 301  DSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPRVHVV 360

Query: 1515 PAFAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXX 1694
            PAFAPYYAPQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQIAYE        
Sbjct: 361  PAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDVPS 420

Query: 1695 XXXXSNYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLT 1874
                SNYL SEDDGS SNGNRDP LFDGMADAEVERKLKDAL+AAS  P+TTANLDPRLT
Sbjct: 421  PPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDPRLT 480

Query: 1875 SSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEG 2054
            S  L YTMV SGSVPPPTAQAS M F H+Q PQPA L KPMGQ +P + SLHSSPAREEG
Sbjct: 481  S--LQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAREEG 538

Query: 2055 EVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWF 2234
            EVPESELDPDTRRRLLILQHGQD RDHAS+EPPFP +HP+QAS+  + +S     RG WF
Sbjct: 539  EVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVPSS-----RGVWF 593

Query: 2235 PVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMP 2414
            PVEEEIGSQP+NR VPK+FP+DSGPL IEK R HHPSFF KV+SSISSDRILH++ QR+P
Sbjct: 594  PVEEEIGSQPLNRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSHQRLP 653

Query: 2415 KEMYHRDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEI 2591
            KEMYHRDDR R++HMLSSY S  GDDIPF R                  ADTP  VL EI
Sbjct: 654  KEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGHSVLHADTPVAVLHEI 713

Query: 2592 ALKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMY 2771
            ALKCGTKV+F  SLVAS EL+FS+EAWF+GKKIGHGFGRTRKEAQ+KAA+DSI+HLAD+Y
Sbjct: 714  ALKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIY 773

Query: 2772 LSRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRL 2951
            LS AKDEPGSTYGDVSGFPN NDNGY+G  SSL NQPL KE+SASFS+AS PSR LDPRL
Sbjct: 774  LSSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKEDSASFSSAS-PSRALDPRL 832

Query: 2952 EVSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGL 3131
            +VSKRSMGSISALKELC MEGLGV+FLS PA  STNSV +DEVHAQVEIDG++FGKGIGL
Sbjct: 833  DVSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIGL 892

Query: 3132 TWDEAKMQA 3158
            TWDEAKMQA
Sbjct: 893  TWDEAKMQA 901


>ref|XP_015956482.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Arachis
            duranensis]
          Length = 962

 Score = 1443 bits (3736), Expect = 0.0
 Identities = 738/969 (76%), Positives = 800/969 (82%), Gaps = 10/969 (1%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN------KNFNVKEIRISHFSQPSERCPPLAVLHSVT 620
            MYKSVVY+GDVVLGEVD+YPE+NN      KN +V+EIRI+HFSQPSERCPPLAVLH+VT
Sbjct: 1    MYKSVVYKGDVVLGEVDLYPEENNNNYLKHKNIDVREIRITHFSQPSERCPPLAVLHTVT 60

Query: 621  SCGVCFKMESK---TQQQDQLFHLHSLCIRDNKTAVMPLYG-DEIHLVAMHSRNDDRPCF 788
            SCGVCFKMESK   TQQQD L HLHSLCI++NKTAVMPL G +EIHLVAM+SR++DRPCF
Sbjct: 61   SCGVCFKMESKNLQTQQQDALSHLHSLCIKENKTAVMPLGGGEEIHLVAMYSRHNDRPCF 120

Query: 789  WGFVVAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQR 968
            WGF+VA+GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN EVD QR
Sbjct: 121  WGFIVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSEVDLQR 180

Query: 969  ISGMQAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHE 1148
            ISGMQAEVKRYLDDK+ILKQY ENDQV DNGK+IKVQ EIVPALSD+H PIVRPLIRL E
Sbjct: 181  ISGMQAEVKRYLDDKNILKQYVENDQVVDNGKVIKVQPEIVPALSDSHLPIVRPLIRLPE 240

Query: 1149 KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLL 1328
            KNIILTRINPQIRDTSVLVRLRPAWEDLR YL ARGRKRFEV+VCTMAERDYALEMWRLL
Sbjct: 241  KNIILTRINPQIRDTSVLVRLRPAWEDLRGYLTARGRKRFEVYVCTMAERDYALEMWRLL 300

Query: 1329 DPGLNLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVH 1508
            DP LNLINS+ELL RIVCVK+GLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVH
Sbjct: 301  DPDLNLINSKELLDRIVCVKAGLKKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVH 360

Query: 1509 VVPAFAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXX 1688
            VVPAFAPYYAPQAEA N VPVLCVARNVACNVRGG        LLQKVPQIAYE      
Sbjct: 361  VVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKDFDDGLLQKVPQIAYEDDIKDI 420

Query: 1689 XXXXXXSNYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPR 1868
                  SNYL +EDD S  NGNRDPL FDGMADAEVER+LKDA+SA SAIP  TANLDPR
Sbjct: 421  PSAPDVSNYLVAEDDASALNGNRDPLSFDGMADAEVERRLKDAISAVSAIPAITANLDPR 480

Query: 1869 LTSSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPARE 2048
            LTSS L YTM SSGS P P AQAS MQF  +Q PQ A L KPM Q +P E SLH SPARE
Sbjct: 481  LTSS-LQYTMASSGSGPLPAAQASMMQFPSVQYPQQATLVKPMVQTAPSEPSLHGSPARE 539

Query: 2049 EGEVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGG 2228
            EGEVPESELDPDTRRRLLILQHGQD R+H  SEP FP + P+Q S+ P     RVP RGG
Sbjct: 540  EGEVPESELDPDTRRRLLILQHGQDTREHTPSEPSFPVRQPVQVSAPP-----RVPPRGG 594

Query: 2229 WFPVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQR 2408
            WFPVEEEIG Q +NR +P++FP+D+ PLRIEKHRP HP FFPKVD+S+S DR+LHE+ QR
Sbjct: 595  WFPVEEEIGPQQLNRVIPREFPVDTEPLRIEKHRPPHPPFFPKVDNSVSPDRVLHESHQR 654

Query: 2409 MPKEMYHRDDRSRVSHMLSSYHSLPGDDIPFGRXXXXXXXXXXXXXXXXXADTPAVVLQE 2588
            +PKEMYHRDDR+R+S   SSYHS  GDD    R                 ADTP  VLQE
Sbjct: 655  LPKEMYHRDDRTRLSQAPSSYHSFSGDDNSLSRSSSSHKDLESESHSPLSADTPVGVLQE 714

Query: 2589 IALKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADM 2768
            IALKCGTKVEF   LVAS ELQFSIEAWF+G+K+G GFGR+RKEAQH+AAE SIK LAD+
Sbjct: 715  IALKCGTKVEFKSCLVASTELQFSIEAWFSGRKVGEGFGRSRKEAQHRAAEHSIKQLADI 774

Query: 2769 YLSRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPR 2948
            YLSRAK E GSTYGDVSGF  ANDNGYVG+++S+ NQPL KEES SFS ASDPSRVLDPR
Sbjct: 775  YLSRAKAETGSTYGDVSGF-QANDNGYVGNINSIGNQPLSKEESFSFSTASDPSRVLDPR 833

Query: 2949 LEVSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIG 3128
            LEVSKRSMGS+SALKELC MEGLGVSF S PA  S N + +DE+HAQVEIDGQVFG+GIG
Sbjct: 834  LEVSKRSMGSVSALKELCMMEGLGVSFQSPPAPVSLNPIQKDEIHAQVEIDGQVFGEGIG 893

Query: 3129 LTWDEAKMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSAR 3308
            LTWDEAKMQAAEKALGSLRT                    NKRLK +YPRTLQR PSSAR
Sbjct: 894  LTWDEAKMQAAEKALGSLRTMLGQSIPKRQGSPRPVHGLPNKRLKHDYPRTLQRIPSSAR 953

Query: 3309 YPRNAPPIP 3335
            YPRNAPP+P
Sbjct: 954  YPRNAPPVP 962


>ref|XP_016189791.1| RNA polymerase II C-terminal domain phosphatase-like 1 [Arachis
            ipaensis]
          Length = 962

 Score = 1439 bits (3725), Expect = 0.0
 Identities = 737/969 (76%), Positives = 798/969 (82%), Gaps = 10/969 (1%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN------KNFNVKEIRISHFSQPSERCPPLAVLHSVT 620
            MYKSVVY+GDVVLGEVD+YPE+NN      KN +V+EIRI+HFSQPSERCPPLAVLH+VT
Sbjct: 1    MYKSVVYKGDVVLGEVDLYPEENNNNYLKHKNIDVREIRITHFSQPSERCPPLAVLHTVT 60

Query: 621  SCGVCFKMESK---TQQQDQLFHLHSLCIRDNKTAVMPLYG-DEIHLVAMHSRNDDRPCF 788
            SCGVCFKMESK   TQQQD L HLHSLCI++NKTAVMPL G +EIHLVAM+SR++DRPCF
Sbjct: 61   SCGVCFKMESKNLQTQQQDALSHLHSLCIKENKTAVMPLGGGEEIHLVAMYSRHNDRPCF 120

Query: 789  WGFVVAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQR 968
            WGF+VA+GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN EVD QR
Sbjct: 121  WGFIVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSEVDLQR 180

Query: 969  ISGMQAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHE 1148
            ISGMQAEVKRYLDDK+ILKQY ENDQV DNGK+IKVQ EIVPALSD+H PIVRPLIRL E
Sbjct: 181  ISGMQAEVKRYLDDKNILKQYVENDQVVDNGKVIKVQPEIVPALSDSHLPIVRPLIRLPE 240

Query: 1149 KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLL 1328
            KNIILTRINPQIRDTSVLVRLRPAWEDLR YL ARGRKRFEV+VCTMAERDYALEMWRLL
Sbjct: 241  KNIILTRINPQIRDTSVLVRLRPAWEDLRGYLTARGRKRFEVYVCTMAERDYALEMWRLL 300

Query: 1329 DPGLNLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVH 1508
            DP LNLINS+ELL RIVCVK+GLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVH
Sbjct: 301  DPDLNLINSKELLDRIVCVKAGLKKSLFNVFQDGFCHPKMALVIDDRLKVWDEKDQPRVH 360

Query: 1509 VVPAFAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXX 1688
            VVPAFAPYYAPQAEA N VPVLCVARNVACNVRGG        LLQKVPQIAYE      
Sbjct: 361  VVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKDFDDGLLQKVPQIAYEDDIKDI 420

Query: 1689 XXXXXXSNYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPR 1868
                  SNYL +EDD S  NGNRDPL FDGMADAEVER+LKDA+SA SAIP  TANLDPR
Sbjct: 421  PSAPDVSNYLVAEDDASALNGNRDPLSFDGMADAEVERRLKDAISAVSAIPSITANLDPR 480

Query: 1869 LTSSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPARE 2048
            L SS L YTM SSGS P P AQAS MQF  +Q PQ A L KPM Q +P E SLH SPARE
Sbjct: 481  LASS-LQYTMASSGSGPLPAAQASMMQFPSVQYPQQATLVKPMVQTAPSEPSLHGSPARE 539

Query: 2049 EGEVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGG 2228
            EGEVPESELDPDTRRRLLILQHGQD R+H  SEP FP + P+Q S     A  RVP RGG
Sbjct: 540  EGEVPESELDPDTRRRLLILQHGQDTREHTPSEPSFPVRQPVQVS-----APARVPPRGG 594

Query: 2229 WFPVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQR 2408
            WFPVEEEIG Q +NR VP++FP+D+ PLRIEKHRP HP FFPKVD+S+S DR+LHE+ QR
Sbjct: 595  WFPVEEEIGPQQLNRVVPREFPVDTEPLRIEKHRPPHPPFFPKVDNSVSPDRVLHESHQR 654

Query: 2409 MPKEMYHRDDRSRVSHMLSSYHSLPGDDIPFGRXXXXXXXXXXXXXXXXXADTPAVVLQE 2588
            +PKE+YHRDDR+R+S   SSYHS  GDD    R                 ADTP  VLQE
Sbjct: 655  LPKEIYHRDDRTRLSQAPSSYHSFSGDDNSLSRSSSSHKDLESESHSPLSADTPVGVLQE 714

Query: 2589 IALKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADM 2768
            IALKCGTKVEF   LVAS ELQFSIEAWF+G+K+G GFGR+RKEAQH+AAE SIK LAD+
Sbjct: 715  IALKCGTKVEFKSCLVASTELQFSIEAWFSGRKVGEGFGRSRKEAQHRAAEHSIKQLADI 774

Query: 2769 YLSRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPR 2948
            YLSRAK E GSTYGDVSGF  ANDNGYVG+++S+ NQPL KEES SFS ASDPSRVLDPR
Sbjct: 775  YLSRAKAETGSTYGDVSGF-QANDNGYVGNINSIGNQPLSKEESFSFSTASDPSRVLDPR 833

Query: 2949 LEVSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIG 3128
            LEVSKRSMGS+SALKELC MEGLGVSF S PA  S N + +DE+HAQVEIDGQVFG+GIG
Sbjct: 834  LEVSKRSMGSVSALKELCMMEGLGVSFQSPPAPVSLNPIQKDEIHAQVEIDGQVFGEGIG 893

Query: 3129 LTWDEAKMQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSAR 3308
            LTWDEAKMQAAEKALGSLRT                    NKRLK +YPRTLQR PSSAR
Sbjct: 894  LTWDEAKMQAAEKALGSLRTMLGQSIPKRQGSPRPVHGLPNKRLKHDYPRTLQRIPSSAR 953

Query: 3309 YPRNAPPIP 3335
            YPRNAPP+P
Sbjct: 954  YPRNAPPVP 962


>gb|KRH20485.1| hypothetical protein GLYMA_13G181700 [Glycine max]
          Length = 880

 Score = 1414 bits (3660), Expect = 0.0
 Identities = 714/886 (80%), Positives = 769/886 (86%), Gaps = 9/886 (1%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNN--------KNFNVKEIRISHFSQPSERCPPLAVLHS 614
            MYKSVVYQG+VV+GEVD+YPE+NN        KNF+VKEIRISHFSQPSERCPPLAVLH+
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60

Query: 615  VTSCGVCFKMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWG 794
            VTSCGVCFKMESKTQQQD LF LHSLCIR+NKTAVMPL G+EIHLVAMHSRNDDRPCFWG
Sbjct: 61   VTSCGVCFKMESKTQQQDGLFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNDDRPCFWG 120

Query: 795  FVVAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRIS 974
            F+V  GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRIS
Sbjct: 121  FIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRIS 180

Query: 975  GMQAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKN 1154
            GMQAEVKRYLDDK+ILKQYAENDQV DNG++IKVQSEIVPALSD+HQPIVRPLIRL +KN
Sbjct: 181  GMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKN 240

Query: 1155 IILTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDP 1334
            IILTRINPQIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP
Sbjct: 241  IILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 300

Query: 1335 GLNLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVV 1514
              NLINS+ELLGRIVCVKSGLKKSLFNVFQDGSC PKMALVIDDRLKVWDE+DQPRVHVV
Sbjct: 301  DSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPRVHVV 360

Query: 1515 PAFAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXX 1694
            PAFAPYYAPQAEA+NT+PVLCVARNVACNVRGG        LLQK+PQIAYE        
Sbjct: 361  PAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDVPS 420

Query: 1695 XXXXSNYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLT 1874
                SNYL SEDDGS SNGNRDP LFDGMADAEVERKLKDAL+AAS  P+TTANLDPRLT
Sbjct: 421  PPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLDPRLT 480

Query: 1875 SSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEG 2054
            S  L YTMV SGSVPPPTAQAS M F H+Q PQPA L KPMGQ +P + SLHSSPAREEG
Sbjct: 481  S--LQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAREEG 538

Query: 2055 EVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWF 2234
            EVPESELDPDTRRRLLILQHGQD RDHAS+EPPFP +HP+QAS+  + +S     RG WF
Sbjct: 539  EVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVPSS-----RGVWF 593

Query: 2235 PVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMP 2414
            PVEEEIGSQP+NR VPK+FP+DSGPL IEK R HHPSFF KV+SSISSDRILH++ QR+P
Sbjct: 594  PVEEEIGSQPLNRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSHQRLP 653

Query: 2415 KEMYHRDDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEI 2591
            KEMYHRDDR R++HMLSSY S  GDDIPF R                  ADTP  VL EI
Sbjct: 654  KEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGHSVLHADTPVAVLHEI 713

Query: 2592 ALKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMY 2771
            ALKCGTKV+F  SLVAS EL+FS+EAWF+GKKIGHGFGRTRKEAQ+KAA+DSI+HLAD+Y
Sbjct: 714  ALKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIY 773

Query: 2772 LSRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRL 2951
            LS AKDEPGSTYGDVSGFPN NDNGY+G  SSL NQPL KE+SASFS+AS PSR LDPRL
Sbjct: 774  LSSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKEDSASFSSAS-PSRALDPRL 832

Query: 2952 EVSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQ 3089
            +VSKRSMGSISALKELC MEGLGV+FLS PA  STNSV +DEVHAQ
Sbjct: 833  DVSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQ 878


>ref|XP_012572568.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X2 [Cicer arietinum]
          Length = 881

 Score = 1405 bits (3637), Expect = 0.0
 Identities = 712/880 (80%), Positives = 758/880 (86%), Gaps = 1/880 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNNKNFNVKEIRISHFSQPSERCPPLAVLHSVTSCGVCF 638
            MYKS+VYQG+VVLGEVDIYPE NN N N KEIRISHF+QPSERC PLAVLH++TS GVCF
Sbjct: 1    MYKSLVYQGEVVLGEVDIYPEVNNNNKNFKEIRISHFTQPSERCLPLAVLHTITSSGVCF 60

Query: 639  KMESKTQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAAGLY 818
            KMESKTQQQD LFHLH+LC R+NKTAVMPL G+E+HLVAMHSR++ RPCFWG++V  GLY
Sbjct: 61   KMESKTQQQDPLFHLHNLCFRENKTAVMPLCGEEMHLVAMHSRSNGRPCFWGYIVGMGLY 120

Query: 819  NSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAEVKR 998
            NSCL+MLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKIN EVDPQRISGMQAEVKR
Sbjct: 121  NSCLMMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQAEVKR 180

Query: 999  YLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTRINP 1178
            YL+DKSILKQY ENDQV DNGK++K QSE+VPALSD+HQPIVRPLIRLHEKNIILTRINP
Sbjct: 181  YLEDKSILKQYVENDQVVDNGKVLKAQSELVPALSDSHQPIVRPLIRLHEKNIILTRINP 240

Query: 1179 QIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLINSR 1358
            QIRDTSVLVRLRPAWEDLRSYL ARGRKRFEV+VCTMAERDYALEMWRLLDP  NLINS+
Sbjct: 241  QIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNLINSK 300

Query: 1359 ELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYA 1538
            ELLGRIVCVKSGLKKSLFNVFQDG CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYA
Sbjct: 301  ELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYYA 360

Query: 1539 PQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXSNYL 1718
            PQAEA+NT+PVLCVARNVACNVRGG        LLQK+ QIAYE            SNYL
Sbjct: 361  PQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKISQIAYENNTRDISPAPDVSNYL 420

Query: 1719 ASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPRLTSSALPYTM 1898
             SEDDGS S  NRDP  FDGMADAEVERKLKDA+SAASAIPMTTA LDPRLTSS L YTM
Sbjct: 421  VSEDDGSASYANRDPFAFDGMADAEVERKLKDAISAASAIPMTTAKLDPRLTSS-LQYTM 479

Query: 1899 VSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEGEVPESELD 2078
            VS GSV PP AQAS +   H Q PQPA L KP+GQV+P E SLHSSPAREEGEVPESELD
Sbjct: 480  VSPGSVLPPAAQASMIPLPHTQFPQPATLVKPIGQVAPSELSLHSSPAREEGEVPESELD 539

Query: 2079 PDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPVEEEIGS 2258
            PDTRRRLLILQHGQD RDH SSEPPFP KHP+Q S+       RVP RGGWFPVEEEIGS
Sbjct: 540  PDTRRRLLILQHGQDNRDHTSSEPPFPLKHPVQVSA-------RVPPRGGWFPVEEEIGS 592

Query: 2259 QPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKEMYHRDD 2438
            QP NR +PK+  LDSGP RIEKHR H   FFPKVD SISSDR LHE  QR+PKEMYHRDD
Sbjct: 593  QPPNRVIPKEIALDSGPSRIEKHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMYHRDD 652

Query: 2439 RSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEIALKCGTKV 2615
            RSRVSHMLSSY SL GDD PFGR                  A+TPA+VLQEIALKCGTKV
Sbjct: 653  RSRVSHMLSSYPSLSGDDTPFGRSSSSHRDFDSESGHSVFNAETPAIVLQEIALKCGTKV 712

Query: 2616 EFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLSRAKDEP 2795
            EFT SL AS ELQFSIEAWF+GKKIGHGFGRTR EAQ+KAAEDSIKHLAD+YLSRAKDE 
Sbjct: 713  EFTSSLAASRELQFSIEAWFSGKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSRAKDES 772

Query: 2796 GSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEVSKRSMG 2975
            GS +GDVSGFPNANDNGYVG+VSSL NQPLPKEES SFSAASDPSRVLDPRL+VSKRSMG
Sbjct: 773  GSAFGDVSGFPNANDNGYVGNVSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVSKRSMG 832

Query: 2976 SISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVE 3095
            S+SALKELC +EGLGV+FLSLPA  STNSV  DEVHAQ++
Sbjct: 833  SVSALKELCMVEGLGVNFLSLPAPVSTNSV--DEVHAQLK 870


>ref|XP_019425587.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Lupinus angustifolius]
 ref|XP_019425589.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X1 [Lupinus angustifolius]
 gb|OIV91757.1| hypothetical protein TanjilG_26610 [Lupinus angustifolius]
          Length = 963

 Score = 1373 bits (3555), Expect = 0.0
 Identities = 709/971 (73%), Positives = 783/971 (80%), Gaps = 12/971 (1%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNNKN-------FNVKEIRISHFSQPSERCPPLAVLHSV 617
            MYKSVVYQG++VLGEVDIYP++ NK         N+KEIRI++FS+ SERCPPLAVLH++
Sbjct: 1    MYKSVVYQGEMVLGEVDIYPDEKNKKSMMMMMMMNLKEIRITNFSKQSERCPPLAVLHTI 60

Query: 618  TS--CGVCFKMESK-TQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCF 788
            TS  CGVCFKMESK TQQQD LFHLHS CIR+NKTAV+PLYG+E+HLVAM+SR +DRPCF
Sbjct: 61   TSSSCGVCFKMESKLTQQQDLLFHLHSTCIRENKTAVVPLYGEELHLVAMYSRINDRPCF 120

Query: 789  WGFVVAAGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQR 968
            WGF+VA GLY+SCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI+ALQRKIN E+DPQR
Sbjct: 121  WGFIVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSEIDPQR 180

Query: 969  ISGMQAEVKRYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHE 1148
            ISGMQAE++RYL+DKSILKQYAENDQV DNGK+IK+QSEIVP+LS +HQPIVRPLIRL E
Sbjct: 181  ISGMQAEIRRYLEDKSILKQYAENDQVVDNGKVIKIQSEIVPSLSGSHQPIVRPLIRLQE 240

Query: 1149 KNIILTRINPQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLL 1328
            KNIILTRINP IRDTSVLVRLRPAWEDLRSYL ARGRKRFEVFVCTMAERDYALEMWRLL
Sbjct: 241  KNIILTRINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLL 300

Query: 1329 DPGLNLINSRELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVH 1508
            DP LNLI+S+ELL RIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVH
Sbjct: 301  DPDLNLISSKELLDRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVH 360

Query: 1509 VVPAFAPYYAPQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXX 1688
            VVPAFAPYYAPQAEA+N +P+LCVARNVACNVRGG        LLQK+P IAYE      
Sbjct: 361  VVPAFAPYYAPQAEASNAIPILCVARNVACNVRGGFFKDFDDGLLQKIPLIAYEDDINDI 420

Query: 1689 XXXXXXSNYLASEDDGSTSNGNRDPLLFDGMADAEVERKLKDALSAASAIPMTTANLDPR 1868
                  SNYL SEDD S SNGN D L FDGMADAEVER+LK+AL AAS+IP  TANLDPR
Sbjct: 421  PSPPDVSNYLVSEDDASASNGNIDSLQFDGMADAEVERRLKEALLAASSIPPITANLDPR 480

Query: 1869 LTSSALPYTMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPARE 2048
            L SS    T  SSG+VPPPT QA  +Q +++Q PQ A L KPM QV+P E SLHSSPARE
Sbjct: 481  LASSLQYTTASSSGTVPPPTVQAPVIQIANMQFPQSATLVKPMSQVAP-EQSLHSSPARE 539

Query: 2049 EGEVPESELDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGG 2228
            EGEVPESELDPDTRRRLLILQHGQDIR++ SSEPPFP + P+Q S         +P   G
Sbjct: 540  EGEVPESELDPDTRRRLLILQHGQDIRENTSSEPPFPVRLPVQVS------PPHIPSHAG 593

Query: 2229 WFPVEEEIGSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQR 2408
            WFPV+EE G Q +NR VPK+FP++S PL IEK  P  PSFF  VD+ +SSDRILHEN QR
Sbjct: 594  WFPVKEERGPQQLNRVVPKEFPVESEPLHIEKKWPRRPSFFSNVDNPMSSDRILHENHQR 653

Query: 2409 MPKEMYHRDDRSRVSHMLSSYHSLPGDDIPFG-RXXXXXXXXXXXXXXXXXADTPAVVLQ 2585
            +PKE+YHRDDR R++H  S YHS  GDDIP G                   AD+PA VL+
Sbjct: 654  LPKEVYHRDDRLRLNHTHSGYHSFAGDDIPLGSTSSSNWDLDSESGHPLFYADSPAGVLR 713

Query: 2586 EIALKCGTKVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLAD 2765
            EIALKCGT+VEF  SLVAS ELQFSIEAWF GKKIG G GRTRKEAQ+KAAEDSIK LAD
Sbjct: 714  EIALKCGTRVEFLSSLVASTELQFSIEAWFAGKKIGEGIGRTRKEAQYKAAEDSIKQLAD 773

Query: 2766 MYLSRAKDEPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDP 2945
            +Y+S  K + GSTYGDV+ FP   DNG++ SV+SL NQ LPKEE  SFS ASDP R LDP
Sbjct: 774  IYMSHTKADSGSTYGDVTAFPGVEDNGFMSSVNSLGNQLLPKEELDSFSTASDPLRGLDP 833

Query: 2946 RLEVSKRSMGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGI 3125
            R EV KRSMGSISALKELC MEGLGVSF S P   STN V +DEVHAQVEIDGQVFGKGI
Sbjct: 834  RFEV-KRSMGSISALKELCMMEGLGVSFQSPPTPVSTNFVQKDEVHAQVEIDGQVFGKGI 892

Query: 3126 GLTWDEAKMQAAEKALGSLRTTY-XXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSS 3302
            GLTW+EAKMQAA+KALGSLRT                     NKR+KQEYPRT QR PSS
Sbjct: 893  GLTWNEAKMQAADKALGSLRTMLGEGTQKRQGSPLRPWRGFSNKRMKQEYPRTPQRIPSS 952

Query: 3303 ARYPRNAPPIP 3335
            ARYPRNAPP+P
Sbjct: 953  ARYPRNAPPVP 963


>ref|XP_019445654.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 1
            isoform X2 [Lupinus angustifolius]
 gb|OIW10434.1| hypothetical protein TanjilG_24994 [Lupinus angustifolius]
          Length = 952

 Score = 1366 bits (3535), Expect = 0.0
 Identities = 704/960 (73%), Positives = 783/960 (81%), Gaps = 3/960 (0%)
 Frame = +3

Query: 459  MYKSVVYQGDVVLGEVDIYPEDNNKNFNVKEIRISHFSQPSERCPPLAVLHSVTSCGVCF 638
            MY+SVVYQG+VVLG V I+PE+  K  ++KEIRI+HFSQ SERCPPLAVLH+VTS  VCF
Sbjct: 1    MYESVVYQGNVVLGSVVIHPEE--KIIHLKEIRINHFSQSSERCPPLAVLHTVTSASVCF 58

Query: 639  KMESK-TQQQDQLFHLHSLCIRDNKTAVMPLYGDEIHLVAMHSRNDDRPCFWGFVVAAGL 815
            KMESK TQQQD LFHLHSLCIR+NKTA+MP   +EIHLVAM+SRN+D PCFWG+VVA+GL
Sbjct: 59   KMESKITQQQDGLFHLHSLCIRENKTAIMPFGSEEIHLVAMYSRNNDSPCFWGYVVASGL 118

Query: 816  YNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINLEVDPQRISGMQAEVK 995
            Y+SCLVMLNLRCL IVFDLDETLIVANTMRSFEDRI+ALQRKIN EVDPQRI+GMQAEVK
Sbjct: 119  YDSCLVMLNLRCLAIVFDLDETLIVANTMRSFEDRIEALQRKINSEVDPQRITGMQAEVK 178

Query: 996  RYLDDKSILKQYAENDQVFDNGKLIKVQSEIVPALSDNHQPIVRPLIRLHEKNIILTRIN 1175
            RYLDDK+ILKQYAENDQV DNG+++KVQSEIVPALSD+HQ IVRPLIRL EKNIILTRIN
Sbjct: 179  RYLDDKNILKQYAENDQVVDNGRVMKVQSEIVPALSDSHQSIVRPLIRLQEKNIILTRIN 238

Query: 1176 PQIRDTSVLVRLRPAWEDLRSYLIARGRKRFEVFVCTMAERDYALEMWRLLDPGLNLINS 1355
            PQIRDTSVLVRLRPAWEDLR YL ARGRKRFEV+VCTMAERDYALEMWRLLDP LNLIN+
Sbjct: 239  PQIRDTSVLVRLRPAWEDLRGYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDLNLINA 298

Query: 1356 RELLGRIVCVKSGLKKSLFNVFQDGSCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 1535
            +ELL RIVCVKSGLKKSLFNVFQ+G CHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY
Sbjct: 299  KELLDRIVCVKSGLKKSLFNVFQNGFCHPKMALVIDDRLKVWDEKDQPRVHVVPAFAPYY 358

Query: 1536 APQAEATNTVPVLCVARNVACNVRGGXXXXXXXXLLQKVPQIAYEXXXXXXXXXXXXSNY 1715
            APQAEA+N +PVLCVARNVACNVRGG        LLQK+PQIAYE            SNY
Sbjct: 359  APQAEASNAIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYE-DDIKDVPSPDVSNY 417

Query: 1716 LASEDDGSTSNGNRDPLLFDGMADAEVERKLKDA-LSAASAIPMTTANLDPRLTSSALPY 1892
            L SEDD S SNGNRDP+LFDGMADA++ER+LK+A ++AAS+IP+ TANLDPRLT + L Y
Sbjct: 418  LVSEDDPSASNGNRDPILFDGMADADIERRLKEAIIAAASSIPVATANLDPRLT-TLLQY 476

Query: 1893 TMVSSGSVPPPTAQASTMQFSHLQIPQPAALAKPMGQVSPFESSLHSSPAREEGEVPESE 2072
            T+V S SV PPTAQ S MQFS +   QPA L KPMGQV+P E SLHSSPAREEGEVPESE
Sbjct: 477  TVVPSDSVSPPTAQPSMMQFSSVPFLQPATLVKPMGQVAPPEPSLHSSPAREEGEVPESE 536

Query: 2073 LDPDTRRRLLILQHGQDIRDHASSEPPFPAKHPIQASSHPIQASTRVPLRGGWFPVEEEI 2252
            LDPDTRRRLLILQHGQDIRDH SSEPPFP   P+Q        +  VP RG WFP EEEI
Sbjct: 537  LDPDTRRRLLILQHGQDIRDHTSSEPPFPISQPMQV------PTPHVPARGAWFPGEEEI 590

Query: 2253 GSQPVNRAVPKDFPLDSGPLRIEKHRPHHPSFFPKVDSSISSDRILHENQQRMPKEMYHR 2432
            GSQ ++RAVPK+F +DS PL IEKH PHHPSFF K D++ISSD+IL+E+ QR+PKEM+ R
Sbjct: 591  GSQQLSRAVPKEFRVDSEPLHIEKHGPHHPSFFSKADNAISSDKILNESHQRLPKEMFQR 650

Query: 2433 DDRSRVSHMLSSYHSLPGDDIPFGR-XXXXXXXXXXXXXXXXXADTPAVVLQEIALKCGT 2609
            D+RSR+SH LSSYHS  GDDI   R                  ADTPA V+QEIALKCGT
Sbjct: 651  DNRSRLSHKLSSYHSFSGDDIRLSRSFSSHRGLDSESGHSLLHADTPAGVVQEIALKCGT 710

Query: 2610 KVEFTPSLVASAELQFSIEAWFNGKKIGHGFGRTRKEAQHKAAEDSIKHLADMYLSRAKD 2789
            KVEFT SLVAS ELQFS+EAWF+G++IG G G+TRKEAQHKA+E+S+K+LAD+YLSRAK 
Sbjct: 711  KVEFTSSLVASTELQFSVEAWFSGRRIGQGLGKTRKEAQHKASEESLKYLADIYLSRAKA 770

Query: 2790 EPGSTYGDVSGFPNANDNGYVGSVSSLVNQPLPKEESASFSAASDPSRVLDPRLEVSKRS 2969
            + GSTYG+  GF  ANDNG+VG+V+S  NQ  PKE+S SFS +SD SRVLD R EVSKR 
Sbjct: 771  DSGSTYGNAKGFSKANDNGHVGNVNSPGNQSWPKEDSVSFSTSSDSSRVLDHRFEVSKRP 830

Query: 2970 MGSISALKELCTMEGLGVSFLSLPAQGSTNSVHRDEVHAQVEIDGQVFGKGIGLTWDEAK 3149
            MGS+SALKELC MEGL V F S  A  S NS  +D V+AQVEIDGQVFGKGIGLTWDEAK
Sbjct: 831  MGSVSALKELCMMEGLSVRFQSPDAPASPNSTQKDAVYAQVEIDGQVFGKGIGLTWDEAK 890

Query: 3150 MQAAEKALGSLRTTYXXXXXXXXXXXXXXXXXXNKRLKQEYPRTLQRFPSSARYPRNAPP 3329
            MQAAEKALGSLRT                     KR K EYPR LQR PSS RY RNAPP
Sbjct: 891  MQAAEKALGSLRTMLGQNLQKRQDSPRSSPGLPTKRSKHEYPRNLQRIPSSGRYLRNAPP 950


Top