BLASTX nr result

ID: Astragalus22_contig00010840 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00010840
         (4218 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal doma...  1567   0.0  
ref|XP_020218578.1| RNA polymerase II C-terminal domain phosphat...  1439   0.0  
ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phas...  1413   0.0  
ref|XP_003621644.2| carboxy-terminal domain phosphatase-like pro...  1383   0.0  
ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal doma...  1365   0.0  
gb|KHN13828.1| RNA polymerase II C-terminal domain phosphatase-l...  1363   0.0  
ref|XP_014626407.1| PREDICTED: RNA polymerase II C-terminal doma...  1359   0.0  
dbj|GAU14128.1| hypothetical protein TSUD_169530 [Trifolium subt...  1352   0.0  
ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal doma...  1352   0.0  
gb|KHN47532.1| RNA polymerase II C-terminal domain phosphatase-l...  1346   0.0  
dbj|BAT83124.1| hypothetical protein VIGAN_04022800 [Vigna angul...  1311   0.0  
ref|XP_017419004.1| PREDICTED: RNA polymerase II C-terminal doma...  1309   0.0  
ref|XP_019419694.1| PREDICTED: RNA polymerase II C-terminal doma...  1308   0.0  
gb|PNY08592.1| RNA polymerase II C-terminal domain phosphatase 3...  1303   0.0  
ref|XP_014497833.1| RNA polymerase II C-terminal domain phosphat...  1294   0.0  
gb|OIW17294.1| hypothetical protein TanjilG_22406 [Lupinus angus...  1269   0.0  
ref|XP_019419706.1| PREDICTED: RNA polymerase II C-terminal doma...  1240   0.0  
ref|XP_019455025.1| PREDICTED: RNA polymerase II C-terminal doma...  1234   0.0  
ref|XP_013447776.1| carboxy-terminal domain phosphatase-like pro...  1224   0.0  
gb|OIW05466.1| hypothetical protein TanjilG_12057 [Lupinus angus...  1217   0.0  

>ref|XP_004492028.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Cicer arietinum]
          Length = 1247

 Score = 1567 bits (4057), Expect = 0.0
 Identities = 849/1255 (67%), Positives = 940/1255 (74%), Gaps = 17/1255 (1%)
 Frame = +3

Query: 327  MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
            MVFGSFLDC   KN               ISDTASV EIS +DFNKQ             
Sbjct: 1    MVFGSFLDCENSKNFEGMGKEVEDVEEGEISDTASVVEISEEDFNKQDVVKVNNNSDSDK 60

Query: 507  XXXX---RVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIMEXXXXXXXXX 677
                   RVWAVHDLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME         
Sbjct: 61   AKTGGDARVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMELDSDSNANA 120

Query: 678  XXXXXXXXXXXX------QXXXXXXXXXXXXXXXXXXXXXXXXXXXXMAGGDASETVSDS 839
                              +                            M GGD SETVS+S
Sbjct: 121  NSNNDSNNGNGDLNMPLKEVVMVDDDEREEGELEEGEIDGDDDTGGVMVGGDGSETVSES 180

Query: 840  ELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRLSFNAIEVVY 1019
            +   ++D LEGVTVANVAESFAET +R+   +QSK+ +GPA SEKD ++RL +NAIE+V+
Sbjct: 181  D---IRDFLEGVTVANVAESFAETISRLLRVLQSKLLSGPAVSEKDYVIRLLYNAIEIVH 237

Query: 1020 SVFCSMYNLQKEENKDNILRLLSFLKDEGTHLFSPEHMKEIEVMITAINSVGSLGSSEAV 1199
            SVFCSM NLQKE+NKDNI+RLL FLK+E T LFSPEHMKEI+VMITAI++V +LG+S  V
Sbjct: 238  SVFCSMDNLQKEDNKDNIIRLLYFLKNEHTQLFSPEHMKEIQVMITAIDTVDALGNSVVV 297

Query: 1200 GKEENLETHETKTWEISAVKYGGELISFSKPGNSNSIEASEASKSGQSIIKGRGVXXXXX 1379
            G  E L+T + KT +I  +K   ELIS SK  +SN  EASEA  SGQS IKGRGV     
Sbjct: 298  GNGEKLDTLDIKTRQIQGLK-ASELISSSKLVHSNLTEASEALLSGQSNIKGRGVMLPLF 356

Query: 1380 XXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDKSGLPLAGKTGCGKMELDGEGSKFH 1559
                         PTREAPS FPV K  SVG+GMD+ GLP AGKT   KMELD E SK H
Sbjct: 357  DLHKVHDLDSLPSPTREAPSFFPVNKLFSVGDGMDRPGLPSAGKTEAVKMELDTENSKNH 416

Query: 1560 LYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTNDEVXXXXXXXXXXXX 1739
            LYETDAL+AVSTYQQKFGRSS+FT+D+ PSPTPSGDCEEGV D N+EV            
Sbjct: 417  LYETDALKAVSTYQQKFGRSSYFTDDKFPSPTPSGDCEEGVADANEEVSSASIAVSLTSS 476

Query: 1740 KPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSRDPRLRFINSDASTL 1919
            KP LLDQ+PV          +HGL+NSRI+AA S +Y  KTSA+SRDPRLRFINSDAS L
Sbjct: 477  KP-LLDQMPVSSTSVDRSS-MHGLINSRIEAASSVTYPVKTSARSRDPRLRFINSDASAL 534

Query: 1920 DLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLENTEHSMREARTMAG 2099
            DLNQ  GT+NMPKVE  G + SRKQKT EE SLDAT  K+LR SLEN+ H+ RE RTMAG
Sbjct: 535  DLNQSLGTNNMPKVENAGRVISRKQKTTEELSLDATAPKRLRSSLENSRHNTREERTMAG 594

Query: 2100 NGGWLEDTTLAGSQLIERNHLMQKGETELNTTFSTSSGNLNVTSNGNEQAPVT-SSTTAS 2276
            NGGWLE+  +AGS LIERNHLMQKGETEL  T STSSG   VTSNGNEQAPVT S+T A+
Sbjct: 595  NGGWLEENRVAGSHLIERNHLMQKGETELKKTMSTSSGYSTVTSNGNEQAPVTVSNTAAA 654

Query: 2277 LPDLLKGIAVNPTMLLNILMEQQ-RLAAEAKKNSADSASSTLHLRSSNSAKGADTTVNIG 2453
            LP LLK IAVNPTMLLNIL+EQQ RLAAEA K   DSA+ST+HL  +NSA+G D TVN G
Sbjct: 655  LPGLLKNIAVNPTMLLNILLEQQQRLAAEANKKPVDSATSTMHL--TNSARGPDATVNTG 712

Query: 2454 PAMTAGIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDPRRILHGSGALQKSGKLGS 2633
            PAMTAG+PQ+SVGM P S+QAASMA  + EDSGKIRMKPRDPRRILHGS +LQKSG  GS
Sbjct: 713  PAMTAGLPQSSVGMLPASTQAASMAHTLLEDSGKIRMKPRDPRRILHGSSSLQKSGSTGS 772

Query: 2634 EQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPPDITRQFTSNLKNIADLMS 2813
            EQ K++VSP SNNQG   NVNAQK +V V TKL PT+S A PDITRQFT NLKNIAD+MS
Sbjct: 773  EQSKSVVSPTSNNQGNGGNVNAQKLDVRVETKLAPTQSSAQPDITRQFTKNLKNIADIMS 832

Query: 2814 VPQESSDHSPAT-QNVSSASVPFTLDKAE-----QSSQNLQAVTGLAPETCASGSSRSQS 2975
            V QE S   PAT QNVSSASVPFTLDKAE      +SQNLQ   G APETCA GSSRSQS
Sbjct: 833  VSQEPSTQLPATTQNVSSASVPFTLDKAELKSGVPNSQNLQDGVGSAPETCAPGSSRSQS 892

Query: 2976 TWADVEHLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXXXXXXXXXXXXNSAKFVEVD 3155
            TWADVEHLFEGY+E+QKAAIQRERARRLEEQNKMF+++K            NSAKFVEVD
Sbjct: 893  TWADVEHLFEGYDEKQKAAIQRERARRLEEQNKMFASKKLCLVLDLDHTLLNSAKFVEVD 952

Query: 3156 PVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKA 3335
            PV+DEILRKKE++D+E P RHLFRF HMGMWTKLRPGVWNFLEKASKLYELHLYTMGNK 
Sbjct: 953  PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKL 1012

Query: 3336 YATQMAKVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKDLEGVLGMEXXXXXXXXXXR 3515
            YAT+MAKVLDPKG+LFAGRVISRGDDTES+DGDERAP++KDLEGV+GME          R
Sbjct: 1013 YATEMAKVLDPKGVLFAGRVISRGDDTESVDGDERAPKSKDLEGVMGMESSVVIVDDSVR 1072

Query: 3516 VWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEAGTLAVCLGVIEKLHQTF 3695
            VWPHN+ NLIVVERY YFP SRRQFGL G SLLE+D DE PEAGTLA  L VIE++HQ F
Sbjct: 1073 VWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLAVIERIHQNF 1132

Query: 3696 FASQSLEEADVRNILATEQRKILAGCRIVFSRMFPVGETNPHLHPLWQTAEQFGAVCTNQ 3875
            FASQSLEE DVRNILA+EQRKILAGCRIVFSR+FPVGE NPHLHPLWQTAEQFGAVC NQ
Sbjct: 1133 FASQSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCINQ 1192

Query: 3876 IDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGWVEASALLYRRLNEQDFAIKPEK 4040
            ID+QVTHVVA+S GTDKVNWA STGRFVV PGWVEASALLYRR NEQDFAIKPEK
Sbjct: 1193 IDDQVTHVVANSLGTDKVNWAISTGRFVVHPGWVEASALLYRRANEQDFAIKPEK 1247


>ref|XP_020218578.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Cajanus
            cajan]
          Length = 1248

 Score = 1439 bits (3724), Expect = 0.0
 Identities = 793/1255 (63%), Positives = 895/1255 (71%), Gaps = 19/1255 (1%)
 Frame = +3

Query: 327  MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
            MVFGS  DC  LK                ISDTASVEEI+ +DFNKQ             
Sbjct: 1    MVFGSLFDCENLKGLEKMGKEVEDVEEGEISDTASVEEITEEDFNKQDVKVNNNNNKPNG 60

Query: 507  XXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIMEXXXXXXXXXXXX 686
                RVWAVHDLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME            
Sbjct: 61   SDA-RVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMELDSDGNANSNSN 119

Query: 687  XXXXXXXXXQXXXXXXXXXXXXXXXXXXXXXXXXXXXXMAGGD-----ASETVSDSELLG 851
                                                   A G+     ASETV+DSE LG
Sbjct: 120  NSNRPSSGSVNSKEVVMVDVDKEEGELEEGEIDADADPEAEGETVVVSASETVADSEQLG 179

Query: 852  VKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRLSFNAIEVVYSVFC 1031
            ++DVLEGVTVANVAESF + C R+Q+A+  +V + PAD EKD LVRLSFNAIEVVYSVFC
Sbjct: 180  LRDVLEGVTVANVAESFPQACYRLQNALP-QVSSRPADPEKDGLVRLSFNAIEVVYSVFC 238

Query: 1032 SMYNLQKEENKDNILRLLSFLKDEG-THLFSPEHMKEIEVMITAINSVGSLGSSEAVGKE 1208
            SM   +KE+NKDNILRLLSF+KD+    LFSPEH+KEI+ M+ AI+SVG+L +  A+GKE
Sbjct: 239  SMETSEKEQNKDNILRLLSFVKDQQQAQLFSPEHIKEIQGMMIAIDSVGALSNRGAIGKE 298

Query: 1209 ENLETHETKTWEISAVKYGGELISFSKPGNSNSIEASEASKSGQSIIKGRGVXXXXXXXX 1388
            + LETH+ K  + S      ELIS SK  ++++I A + SK GQ+ IKGRG+        
Sbjct: 299  KELETHDIKMEDPSVEV--AELISSSKHLHTDTIGALQVSKFGQNSIKGRGILLPLLDLH 356

Query: 1389 XXXXXXXXXXPTREAPSCFPVKKSLSVGEGMD-KSGLPLAGKTGCGKMELDGEGSKFHLY 1565
                      PTREAPSCFPV K LS GE +  KSG P    T   KMEL  E SK H Y
Sbjct: 357  KDHDADSLPSPTREAPSCFPVNKLLSTGESLVIKSGSPTP-ITVAEKMELGSEDSKLHHY 415

Query: 1566 ETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTNDEVXXXXXXXXXXXXKP 1745
            ETDA +AVSTYQQKFGRSS FTND+LPSPTPSGDCE+ VVDTN+E+            KP
Sbjct: 416  ETDAFKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCEDEVVDTNEEISSASTSGFLTSTKP 475

Query: 1746 TLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSRDPRLRFINSDASTLDL 1925
             LLD   V          + GL++SR+D AG GSY  K+S+KSRDPRLRFINSDA+ +D 
Sbjct: 476  ILLDHQAVSATSMDRSG-LQGLISSRVDTAGHGSYPVKSSSKSRDPRLRFINSDATAVD- 533

Query: 1926 NQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLENTEHSMREARTMAGNG 2105
            NQ +   NMPKVEY G I SRKQK  EEPSLD TV+K+ + SLEN EH+M E RT AG+G
Sbjct: 534  NQSTIIPNMPKVEYAGAIISRKQKAAEEPSLDVTVSKRQKSSLENAEHNMSEVRTAAGSG 593

Query: 2106 GWLEDTTLAGSQLIERNHLMQKGETE----LNTTFSTSSG--NLNVTSNGNEQAPVTSST 2267
            GWLE+ T  G+QLIERNHLM K   E    LNT  S+  G  N N TS  NEQAPVTS+ 
Sbjct: 594  GWLEEITGPGAQLIERNHLMDKFGPEPRKTLNTVSSSCDGSANFNATSIRNEQAPVTSNN 653

Query: 2268 -TASLPDLLKGIAVNPTMLLNILMEQQRLAAEAKKNSADSASSTLHLRSSNSAKGADTTV 2444
             TASLP LLK IAVNPTMLLNIL EQQ   AEAKK SADSA++ LHL S NS  G D+TV
Sbjct: 654  VTASLPALLKDIAVNPTMLLNILREQQLRLAEAKKKSADSATNVLHLTSLNSTMGTDSTV 713

Query: 2445 NIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDPRRILHGSGALQKSGK 2624
            +IG +MT G+  +SVGM PVSSQ+ S A  +Q+DSGKIRMKPRDPRRILH + A QKSG 
Sbjct: 714  SIGSSMTTGVLHSSVGMHPVSSQSTSTAHSLQDDSGKIRMKPRDPRRILHSNNATQKSGC 773

Query: 2625 LGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPPDITRQFTSNLKNIAD 2804
            LG+EQFK  VSP SNNQG  DN+   K E  V TKLV ++S A PDI RQFT NLKNIAD
Sbjct: 774  LGNEQFKVTVSPASNNQGTGDNIKVPKLEGRVDTKLVTSQSSAAPDIARQFTKNLKNIAD 833

Query: 2805 LMSVPQESSDHSPATQNVSSASVPFTLDKAEQ-----SSQNLQAVTGLAPETCASGSSRS 2969
            +MSV QESS+HSPA QN SSASVP TLD+ EQ     +SQNLQA  G A ET ASG+SRS
Sbjct: 834  IMSVSQESSNHSPAAQNFSSASVPLTLDRGEQKPVVSNSQNLQAGVGSANETGASGASRS 893

Query: 2970 QSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXXXXXXXXXXXXNSAKFVE 3149
            QSTW DVEHLF+GY+EQQKAAIQRERARR++EQNKMF+ARK            NSAKFVE
Sbjct: 894  QSTWGDVEHLFDGYDEQQKAAIQRERARRIDEQNKMFAARKLCLVLDLDHTLLNSAKFVE 953

Query: 3150 VDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLEKASKLYELHLYTMGN 3329
            VDPV++EILRKKE++D+E P RHLFRF HMGMWTKLRPG+WNFLEKASKLYELHLYTMGN
Sbjct: 954  VDPVHEEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGN 1013

Query: 3330 KAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKDLEGVLGMEXXXXXXXXX 3509
            K YAT+MAKVLDPKG+LFAGRVISRGDDT+S DGDER P++KDLEGVLGME         
Sbjct: 1014 KLYATEMAKVLDPKGVLFAGRVISRGDDTDSGDGDERVPKSKDLEGVLGMESSVVIIDDS 1073

Query: 3510 XRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEAGTLAVCLGVIEKLHQ 3689
             RVWPHN+ NLIVVERY YFP SRRQFGL G SLLE+D DE PEAGTLA  L VIE++HQ
Sbjct: 1074 VRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEAGTLASSLAVIERIHQ 1133

Query: 3690 TFFASQSLEEADVRNILATEQRKILAGCRIVFSRMFPVGETNPHLHPLWQTAEQFGAVCT 3869
             FFASQSLE+ DVRNILA+EQRKILAGCRIVFSR+FP    NPHLHPLWQTAEQFGAVCT
Sbjct: 1134 NFFASQSLEDVDVRNILASEQRKILAGCRIVFSRVFPTSLQNPHLHPLWQTAEQFGAVCT 1193

Query: 3870 NQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGWVEASALLYRRLNEQDFAIKP 4034
              ID+QVTHVVA   GTDKVNWA STGRFVV PGWVEASALLYRR NEQDFAIKP
Sbjct: 1194 ANIDDQVTHVVAHCLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 1248


>ref|XP_007139315.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris]
 gb|ESW11309.1| hypothetical protein PHAVU_008G019000g [Phaseolus vulgaris]
          Length = 1272

 Score = 1413 bits (3658), Expect = 0.0
 Identities = 791/1280 (61%), Positives = 908/1280 (70%), Gaps = 44/1280 (3%)
 Frame = +3

Query: 327  MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
            MVFGS LDC  L                 ISDTASVEEIS  DFNKQ             
Sbjct: 1    MVFGSLLDCENLGKLEKMGKEVEDVEEGEISDTASVEEISEADFNKQDVKVNNNNKPNGS 60

Query: 507  XXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIMEXXXXXXXXXXXX 686
                RVW+V D+Y+KYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME            
Sbjct: 61   DA--RVWSVRDIYTKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMELDSEANANSNSN 118

Query: 687  XXXXXXXXXQXXXXXXXXXXXXXXXXXXXXXXXXXXXXMAGGD---ASETVSD----SEL 845
                                                   A  +   A+  VS+    SE 
Sbjct: 119  NSNRPSSVSVNPKEVMVVDVDREEGELEEGEIDADADPEAEAESVVAASVVSETVSDSEQ 178

Query: 846  LGVK------------DVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVR 989
             GVK            DVLEGVTVANVAESFA+T +R+ +A+  +VF+ PADSEKDDL+R
Sbjct: 179  FGVKKGVSDSEQLGVRDVLEGVTVANVAESFAQTSSRLLNAL-PQVFSRPADSEKDDLIR 237

Query: 990  LSFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKD-EGTHLFSPEHMKEIEVMITAIN 1166
            LSFNAIEVVYSVF SM +  KE+NK++ILRLLS  KD +   LFSPEH+KEI+ M+TAI+
Sbjct: 238  LSFNAIEVVYSVFRSMDSSDKEQNKNSILRLLSSAKDKKQAQLFSPEHIKEIQDMMTAID 297

Query: 1167 SVGSLGSSEAVGKEENLETHETKTWEISAVK--------------YGGELISFSKPGNSN 1304
            SVG+LGS+EA+  E  L+T E K+ E SA++                 EL+S  KP +S+
Sbjct: 298  SVGALGSNEAIYMETELQTPEIKSQENSALEVQTRGIKIQENQAVVATELVSSIKPLHSD 357

Query: 1305 SIEASEASKSGQSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMD 1484
             I AS A K GQ+ IKGRGV                  PTREAPSCFPV K LSVGE M 
Sbjct: 358  IIGASRALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEVMV 417

Query: 1485 KSGLPLAGKTGCGKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSG 1664
            KSG   A K   GK+E+D EGSKFHLYETDAL+AVSTYQQKFGRSS FTND+LPSPTPSG
Sbjct: 418  KSG-SAAAKMQPGKLEVDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSG 476

Query: 1665 DCEEGVVDTNDEVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSG 1844
            DC++  VDTN+EV            KPTLLDQ PV          + GL++SR+DAAGSG
Sbjct: 477  DCDDMAVDTNEEVSSASTSGFLTSTKPTLLDQPPVSATSVDKSRLL-GLISSRVDAAGSG 535

Query: 1845 SYAGKTSAKSRDPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDA 2024
            S+  K+SAKSRDPR R INS+AS +D NQ + THNMPKVEY G+  SRKQK VEEPS D 
Sbjct: 536  SFPVKSSAKSRDPRRRLINSEASAVD-NQFTVTHNMPKVEYAGSTISRKQKAVEEPSFDL 594

Query: 2025 TVTKKLRRSLENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETE----LNT 2192
            TV+K+L+ SLEN EH+  E RT+AG+GGWLED T  G+QLIE+NHL+ K   E    LNT
Sbjct: 595  TVSKRLKSSLENIEHNTSEVRTIAGSGGWLEDITGPGTQLIEKNHLIDKFAPEPKRTLNT 654

Query: 2193 TFSTSSGNLNVTSNGNEQAPVTSSTT-ASLPDLLKGIAVNPTMLLNILMEQQRLAAEAKK 2369
              S+ S N N TS  NEQAP+TS+   +SLP + K I VNPTMLL++LMEQ+RL  +A+ 
Sbjct: 655  VSSSGSVNFNATSIRNEQAPITSNNVPSSLPAIFKDIVVNPTMLLSLLMEQKRL-VDAQN 713

Query: 2370 NSADSASSTLHLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQEDS 2549
            NSADSA++ LH  SSNSA G D+T +I  +M  G+ Q SVGM PVSSQ+ S AQ   + S
Sbjct: 714  NSADSATNMLHPTSSNSAMGTDSTASIVSSMATGL-QTSVGMLPVSSQSTSTAQLQDDYS 772

Query: 2550 GKIRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTK 2729
            GKIRMKPRDPRRILH + ++QKSG + +E  KAIVSP+SN     D+VNAQK E  + TK
Sbjct: 773  GKIRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNILVTGDSVNAQKLEGRMDTK 832

Query: 2730 LVPTKSIAPPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ--- 2900
            LVPT+S A PDITRQFT NLKNIAD+MSV QESS HSPA Q  SSASVP  +D+ EQ   
Sbjct: 833  LVPTQSGAAPDITRQFTRNLKNIADIMSVSQESSTHSPAAQGFSSASVPLNVDRGEQKSV 892

Query: 2901 --SSQNLQAVTGLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNK 3074
              +SQNL A TG APE CA G+SRSQSTW DVEHLFEGY+EQQKAAIQRERARR+EEQNK
Sbjct: 893  LSNSQNLHAGTGSAPEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNK 952

Query: 3075 MFSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTK 3254
            MF+ARK            NSAKFVEVDPV++EILRKKE+ D+E P RHLFRF HMGMWTK
Sbjct: 953  MFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEILRKKEELDREKPHRHLFRFPHMGMWTK 1012

Query: 3255 LRPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGD 3434
            LRPG+WNFLEKASKLYELHLYTMGNK YAT+MAKVLDPKG+LFAGRVISRGDDT+S+DG+
Sbjct: 1013 LRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVDGE 1072

Query: 3435 ERAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLL 3614
            ERAP++KDLEGVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G SLL
Sbjct: 1073 ERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLL 1132

Query: 3615 EVDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRM 3794
            E+D DE PEAGTLA  L VIE+LHQ FF+SQSLEE DVRNILA+EQRKIL+GCRIVFSR+
Sbjct: 1133 EIDHDERPEAGTLASSLAVIERLHQNFFSSQSLEEVDVRNILASEQRKILSGCRIVFSRV 1192

Query: 3795 FPVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGW 3974
            FPVGE NPHLHPLWQTAEQFGAVCTNQID+QVTHVVA+S GTDKVNWA STGRFVV PGW
Sbjct: 1193 FPVGEANPHLHPLWQTAEQFGAVCTNQIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGW 1252

Query: 3975 VEASALLYRRLNEQDFAIKP 4034
            VEASALLYRR NEQDFAIKP
Sbjct: 1253 VEASALLYRRANEQDFAIKP 1272


>ref|XP_003621644.2| carboxy-terminal domain phosphatase-like protein, putative [Medicago
            truncatula]
 gb|AES77862.2| carboxy-terminal domain phosphatase-like protein, putative [Medicago
            truncatula]
          Length = 1209

 Score = 1383 bits (3579), Expect = 0.0
 Identities = 771/1268 (60%), Positives = 874/1268 (68%), Gaps = 30/1268 (2%)
 Frame = +3

Query: 327  MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNK-------------- 464
            MVFGS LD    KN               ISD+AS+EEI+ +DF K              
Sbjct: 1    MVFGSLLDFEISKNLIEMGKEVEDVEEGEISDSASLEEITEEDFKKGDDVKVNNSDVKTD 60

Query: 465  QXXXXXXXXXXXXXXXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFI 644
            +                 RVWAV DLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+
Sbjct: 61   KSDNKVKTGGGGGGGGDSRVWAVQDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFV 120

Query: 645  MEXXXXXXXXXXXXXXXXXXXXXQXXXXXXXXXXXXXXXXXXXXXXXXXXXXMAGGDASE 824
            ME                     +                               GDA +
Sbjct: 121  ME-------LDKNANANSNNSGNKDGELNKSSKEIVVVDDDDEKEEGELEEGEIDGDADD 173

Query: 825  --TVSDSELLGVKDVL------EGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDD 980
               +  SE     +VL      EGVTVA+VAESFAETC RIQ  +QSKVF+G   +EKDD
Sbjct: 174  DCVIVGSENFSNSEVLGVRGVLEGVTVASVAESFAETCRRIQGTLQSKVFSGFDSAEKDD 233

Query: 981  LVRLSFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEGTHLFSPEHMKEIEVMITA 1160
            LVRL FNA+EVVYSVFC M NLQKEENKDNI RLLSFLK++  HLF+ EHMK+I+VMIT 
Sbjct: 234  LVRLLFNAVEVVYSVFCCMDNLQKEENKDNISRLLSFLKNQ--HLFTMEHMKKIQVMITV 291

Query: 1161 INSVGSLGSSEAVGKEENLETHETKTWEISAVKYGGELISFSKPGNSNSIEASEASKSGQ 1340
            I+SV +LG++E VGKEE +E   T T +I  +K   E IS S+  + NS  ASEA + GQ
Sbjct: 292  IDSVFALGNNEVVGKEEKVEALNT-TEQIPGLK-ADEYISSSQLVHDNSTYASEALQYGQ 349

Query: 1341 SIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLS-VGEGMDKSGLPLAGKTG 1517
            S + GRG+                  PTREAPSCFPV K  S +G+G+D+ GLP A  T 
Sbjct: 350  SNVVGRGLMLPLFDLHKDHDLDSLPSPTREAPSCFPVNKLFSDLGDGIDRFGLPPAVCTE 409

Query: 1518 CGKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTND 1697
              KMELDG+ SK H+YETDAL+AVSTYQQKF RSS+FT+D+ PSPTPSGDCE   VDTND
Sbjct: 410  AEKMELDGKDSKLHIYETDALKAVSTYQQKFSRSSYFTDDKFPSPTPSGDCEGEAVDTND 469

Query: 1698 EVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSR 1877
            EV            KP  LDQIPV          +HGLV+SRIDA GSGSY  K+SAKSR
Sbjct: 470  EVSSASIASSLTSFKPPPLDQIPV-SSTSLDRPNMHGLVDSRIDATGSGSYPAKSSAKSR 528

Query: 1878 DPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLE 2057
            DPRLRFIN DASTLDLNQ  GTH+MP+VEYGG + SRKQKTVEEPSLDAT  K+LRRSLE
Sbjct: 529  DPRLRFINPDASTLDLNQSLGTHSMPRVEYGGRVISRKQKTVEEPSLDATAPKRLRRSLE 588

Query: 2058 NTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETELNTTFSTSSGNLNVTSNG 2237
            N+EH+ RE R MAG GGW E+ T+AGSQL ERNHLMQKGETEL  T STSS NL V++NG
Sbjct: 589  NSEHNTREERAMAGKGGWFEENTVAGSQLAERNHLMQKGETELKRTISTSSSNLTVSNNG 648

Query: 2238 NEQAPVTSST-TASLPD-LLKGIAVNPTMLLNILMEQQRLAAEAKKNSADSASSTLHLRS 2411
            NE A VTSS+ TASLP  LL  +AVNP ML+++++E Q   AEA+K   D          
Sbjct: 649  NELASVTSSSATASLPTYLLNNVAVNPAMLIHMILEHQHNEAEAQKKPVD---------- 698

Query: 2412 SNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDPRRIL 2591
              SA+G D TVN GPAMTAG+ Q+SVG+ P SS A SM Q + EDSGKIRMKPRDPRR L
Sbjct: 699  --SARGTDATVNTGPAMTAGLTQSSVGILPASSPATSMTQTLPEDSGKIRMKPRDPRRFL 756

Query: 2592 HGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPPDITR 2771
            HGS  L                              QK +V V TKL P +SIA PDITR
Sbjct: 757  HGSSTL------------------------------QKFDVRVETKLAPIQSIAQPDITR 786

Query: 2772 QFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ-----SSQNLQAVTGLA 2936
            QFT NLKNIAD+MSVPQE+S + PATQNVSSASVPF  D++EQ     +SQNL+   G A
Sbjct: 787  QFTKNLKNIADIMSVPQETSSNPPATQNVSSASVPFMSDRSEQKSGVPNSQNLKDGVGSA 846

Query: 2937 PETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXXXXXXXX 3116
            PETCA GSSR Q+TWADVEHLFE Y+ +QKAAIQRER+RRLEEQ KMF+ARK        
Sbjct: 847  PETCAPGSSRPQNTWADVEHLFEAYDVKQKAAIQRERSRRLEEQKKMFAARKLCLVLDLD 906

Query: 3117 XXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLEKASK 3296
                NSAKFVEVDPV+DE+LRKKEQED+E PQRHLFRF HMGMWTKLRPGVWNFLEKA K
Sbjct: 907  HTLLNSAKFVEVDPVHDEMLRKKEQEDREKPQRHLFRFPHMGMWTKLRPGVWNFLEKAGK 966

Query: 3297 LYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKDLEGVLG 3476
            L+E+HLYTMGNK YAT+MAKVLDPKG+LFAGRVISRGDD E+ D      ++KDLEGVLG
Sbjct: 967  LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDAETAD-----TKSKDLEGVLG 1021

Query: 3477 MEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEAGTLA 3656
            ME          RVWPHN+ NLIVVERY YFP SRRQFGL G SLLE+D DE PE+GTLA
Sbjct: 1022 MESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPESGTLA 1081

Query: 3657 VCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRMFPVGETNPHLHPLW 3836
              LGVIE++HQ FFASQSLEE DVRNILA+EQRKIL GCRIVFSRMFPVG+ NPHLHPLW
Sbjct: 1082 SSLGVIERIHQNFFASQSLEEVDVRNILASEQRKILDGCRIVFSRMFPVGDANPHLHPLW 1141

Query: 3837 QTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGWVEASALLYRRLNEQ 4016
            QTAEQFGA CTNQID+QVTHVVA SPGTDKVNWA + G+FVV PGWVEASALLYRR NEQ
Sbjct: 1142 QTAEQFGASCTNQIDDQVTHVVAHSPGTDKVNWAIANGKFVVHPGWVEASALLYRRANEQ 1201

Query: 4017 DFAIKPEK 4040
            DFAIK +K
Sbjct: 1202 DFAIKLDK 1209


>ref|XP_006603006.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X2 [Glycine max]
 gb|KRH01515.1| hypothetical protein GLYMA_18G282300 [Glycine max]
 gb|KRH01516.1| hypothetical protein GLYMA_18G282300 [Glycine max]
          Length = 1257

 Score = 1365 bits (3532), Expect = 0.0
 Identities = 737/1092 (67%), Positives = 838/1092 (76%), Gaps = 18/1092 (1%)
 Frame = +3

Query: 813  DASETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRL 992
            D    VS+SE LGV+ VLEGVTVANVAESFA+TC+++Q+A+  +V + PADSE+DDLVRL
Sbjct: 178  DVKRDVSNSEQLGVRGVLEGVTVANVAESFAQTCSKLQNALP-EVLSRPADSERDDLVRL 236

Query: 993  SFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEG-THLFSPEHMKEIEVMITAINS 1169
            SFNA EVVYSVFCSM +L+KE+NKD+ILRLLSF+KD+    LFSPEH+KEI+ M+TAI+ 
Sbjct: 237  SFNATEVVYSVFCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAIDY 296

Query: 1170 VGSLGSSEAVGKEENLET----HETKTWEISAVKYGGELISFSKPGNSNSIEASEASKSG 1337
             G+L +SEA+GKE+ L+T    HE KT E  AV+   ELIS++KP +S+ I AS A K G
Sbjct: 297  FGALVNSEAIGKEKELQTTVQTHEIKTQENQAVE-AAELISYNKPLHSDIIGASHALKFG 355

Query: 1338 QSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDKSGLPLAGKTG 1517
            Q+ IKGRGV                  PTREAPSCFPV K LSVGE M  SG   A K  
Sbjct: 356  QNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEPMVSSG-SAAAKPE 414

Query: 1518 CGKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTND 1697
             GKMELD EGSKFHLYETDAL+AVSTYQQKFGRSS FTND+ PSPTPSGDCE+ +VDTN+
Sbjct: 415  SGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVDTNE 474

Query: 1698 EVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSR 1877
            EV            KPTLLD  PV          +HG ++SR+DAAG GS   K+SAK+R
Sbjct: 475  EVSSASTGDFLTSTKPTLLDLPPVSATSTDRSS-LHGFISSRVDAAGPGSLPVKSSAKNR 533

Query: 1878 DPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLE 2057
            DPRLRF+NSDAS +D N  +  HNMPKVEY GT  SRKQK  EEPSLD TV+K+ +  LE
Sbjct: 534  DPRLRFVNSDASAVD-NPSTLIHNMPKVEYAGTTISRKQKAAEEPSLDVTVSKRQKSPLE 592

Query: 2058 NTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETE----LNTTFS--TSSGNL 2219
            NTEH+M E RT  G GGWLE+ T  G+Q IERNHLM K   E    LNT  S  T S N 
Sbjct: 593  NTEHNMSEVRT--GIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNTVSSSCTGSDNF 650

Query: 2220 NVTSNGNEQAPVTSSTT-ASLPDLLKGIAVNPTMLLNILMEQQRLAAEAKKNSADSASST 2396
            N TS  NEQAP+TSS   ASLP LLKG AVNPTML+N+L       AEA+K SADSA++ 
Sbjct: 651  NATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLLR-----IAEAQKKSADSATNM 705

Query: 2397 L-HLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPR 2573
            L H  SSNSA G D+T +IG +M  G+ Q+SVGM PVSSQ+ SM Q +Q+DSGKIRMKPR
Sbjct: 706  LLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQDDSGKIRMKPR 765

Query: 2574 DPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIA 2753
            DPRRILH +  +QKSG LG+EQFKAIVSP+SNNQG  DNVNAQK E  V +KLVPT+  A
Sbjct: 766  DPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRVDSKLVPTQPSA 825

Query: 2754 PPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ-----SSQNLQ 2918
             PDI RQF  NLKNIAD+MSV QESS H+P  Q  SSASVP T D+ EQ     +SQNL+
Sbjct: 826  QPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRGEQKSVVSNSQNLE 885

Query: 2919 AVTGLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXX 3098
            A    A ET ASG+ RSQ+TW DVEHLFEGY+EQQKAAIQRERARR+EEQNKMF+ARK  
Sbjct: 886  AGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKMFAARKLC 945

Query: 3099 XXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNF 3278
                      NSAKFVEVDPV+DEILRKKE++D+E P RHLFRF HMGMWTKLRPG+WNF
Sbjct: 946  LVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWNF 1005

Query: 3279 LEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKD 3458
            LEKASKLYELHLYTMGNK YAT+MAKVLDPKGLLFAGRVISRGDDT+S+DG+ERAP++KD
Sbjct: 1006 LEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTDSVDGEERAPKSKD 1065

Query: 3459 LEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLEVDRDEVP 3638
            LEGVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G SLLE+D DE P
Sbjct: 1066 LEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERP 1125

Query: 3639 EAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRMFPVGETNP 3818
            EAGTLA  L VIEK+HQ FFAS+SLEE DVRNILA+EQRKILAGCRIVFSR+FPVGE NP
Sbjct: 1126 EAGTLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGEANP 1185

Query: 3819 HLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGWVEASALLY 3998
            HLHPLWQTAEQFGA CTNQIDEQVTHVVA+SPGTDKVNWA + GRFVV PGWVEASALLY
Sbjct: 1186 HLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHPGWVEASALLY 1245

Query: 3999 RRLNEQDFAIKP 4034
            RR NEQDFAIKP
Sbjct: 1246 RRANEQDFAIKP 1257



 Score =  127 bits (320), Expect = 9e-26
 Identities = 68/108 (62%), Positives = 71/108 (65%)
 Frame = +3

Query: 327 MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
           M+FGS LDC KL                 ISDTASVEEISA+DFNKQ             
Sbjct: 1   MIFGSLLDCEKLGKLEKMGKEVEDVEEGEISDTASVEEISAEDFNKQDVKVLNNNNKPNG 60

Query: 507 XXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIME 650
               RVWAVHDLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME
Sbjct: 61  SDA-RVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVME 107


>gb|KHN13828.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Glycine soja]
          Length = 1251

 Score = 1363 bits (3527), Expect = 0.0
 Identities = 737/1102 (66%), Positives = 838/1102 (76%), Gaps = 28/1102 (2%)
 Frame = +3

Query: 813  DASETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRL 992
            D    VS+SE LGV+ VLEGVTVANVAESFA+TC+++Q+A+  +V + PADSE+DDLVRL
Sbjct: 161  DVKRDVSNSEQLGVRGVLEGVTVANVAESFAQTCSKLQNALP-EVLSRPADSERDDLVRL 219

Query: 993  SFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEG-THLFSPEHMKEIEVMITAINS 1169
            SFNA EVVYSVFCSM +L+KE+NKD+ILRLLSF+KD+    LFSPEH+KEI+ M+TAI+ 
Sbjct: 220  SFNATEVVYSVFCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAIDY 279

Query: 1170 VGSLGSSEAVGKEENLETHETKTWEISAVKY--------------GGELISFSKPGNSNS 1307
             G+L +SEA+GKE+ L+T E KT E SAV+                 ELIS++KP +S+ 
Sbjct: 280  FGALVNSEAIGKEKELQTTEIKTQENSAVEVQIHEIKTQENQAVEAAELISYNKPLHSDI 339

Query: 1308 IEASEASKSGQSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDK 1487
            I AS A K GQ+ IKGRGV                  PTREAPSCFPV K LSVGE M  
Sbjct: 340  IGASHALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEPMVS 399

Query: 1488 SGLPLAGKTGCGKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGD 1667
            SG   A K   GKMELD EGSKFHLYETDAL+AVSTYQQKFGRSS FTND+ PSPTPSGD
Sbjct: 400  SG-SAAAKPESGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGD 458

Query: 1668 CEEGVVDTNDEVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGS 1847
            CE+ +VDTN+EV            KPTLLD  PV          +HG ++SR+DAAG GS
Sbjct: 459  CEDEIVDTNEEVSSASTGDFLTSTKPTLLDLPPVSATSTDRSS-LHGFISSRVDAAGPGS 517

Query: 1848 YAGKTSAKSRDPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDAT 2027
               K+SAK+RDPRLRF+NSDAS +D N  +  HNMPKVEY GT  SRKQK  EEPSLD T
Sbjct: 518  LPVKSSAKNRDPRLRFVNSDASAVD-NPSTLIHNMPKVEYAGTTISRKQKAAEEPSLDVT 576

Query: 2028 VTKKLRRSLENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETE----LNTT 2195
            V+K+ +  LENTEH+M E RT  G GGWLE+ T  G+Q IERNHLM K   E    LNT 
Sbjct: 577  VSKRQKSPLENTEHNMSEVRT--GIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNTV 634

Query: 2196 FS--TSSGNLNVTSNGNEQAPVTSSTT-ASLPDLLKGIAVNPTMLLNILMEQQRLAAEAK 2366
             S  T S N N TS  NEQAP+TSS   ASLP LLKG AVNPTML+N+L       AEA+
Sbjct: 635  SSSCTGSDNFNATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLLR-----IAEAQ 689

Query: 2367 KNSADSASSTL-HLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQE 2543
            K SADSA++ L H  SSNSA G D+T +IG +M  G+ Q+SVGM PVSSQ+ SM Q +Q+
Sbjct: 690  KKSADSATNMLLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQD 749

Query: 2544 DSGKIRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVG 2723
            DSGKIRMKPRDPRRILH +  +QKSG LG+EQFKAIVSP+SNNQG  DNVNAQK E  V 
Sbjct: 750  DSGKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRVD 809

Query: 2724 TKLVPTKSIAPPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ- 2900
            +KLVPT+  A PDI RQF  NLKNIAD+MSV QESS H+P  Q  SSASVP T D+ EQ 
Sbjct: 810  SKLVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRGEQK 869

Query: 2901 ----SSQNLQAVTGLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQ 3068
                +SQNL+A    A ET ASG+ RSQ+TW DVEHLFEGY+EQQKAAIQRERARR+EEQ
Sbjct: 870  SVVSNSQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEEQ 929

Query: 3069 NKMFSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMW 3248
            NKMF+ARK            NSAKFVEVDPV+DEILRKKE++D+E P RHLFRF HMGMW
Sbjct: 930  NKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 989

Query: 3249 TKLRPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESID 3428
            TKLRPG+WNFLEKASKLYELHLYTMGNK YAT+MAKVLDPKGLLFAGRVISRGDDT+S+D
Sbjct: 990  TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTDSVD 1049

Query: 3429 GDERAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQS 3608
            G+ERAP++KDLEGVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G S
Sbjct: 1050 GEERAPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPS 1109

Query: 3609 LLEVDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFS 3788
            LLE+D DE PEAGTLA  L VIEK+HQ FFAS+SLEE DVRNILA+EQRKILAGCRIVFS
Sbjct: 1110 LLEIDHDERPEAGTLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRIVFS 1169

Query: 3789 RMFPVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLP 3968
            R+FPVGE NPHLHPLWQTAEQFGA CTNQIDEQVTHVVA+SPGTDKVNWA + GRFVV P
Sbjct: 1170 RVFPVGEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHP 1229

Query: 3969 GWVEASALLYRRLNEQDFAIKP 4034
            GWVEASALLYRR NEQDFAIKP
Sbjct: 1230 GWVEASALLYRRANEQDFAIKP 1251



 Score =  114 bits (286), Expect = 9e-22
 Identities = 59/79 (74%), Positives = 61/79 (77%)
 Frame = +3

Query: 414 ISDTASVEEISADDFNKQXXXXXXXXXXXXXXXXXRVWAVHDLYSKYPTISRGYASGLYN 593
           ISDTASVEEISA+DFNKQ                 RVWAVHDLYSKYPTI RGYASGLYN
Sbjct: 13  ISDTASVEEISAEDFNKQDVKVLNNNNKPNGSDA-RVWAVHDLYSKYPTICRGYASGLYN 71

Query: 594 LAWAQAVQNKPLNDIFIME 650
           LAWAQAVQNKPLNDIF+ME
Sbjct: 72  LAWAQAVQNKPLNDIFVME 90


>ref|XP_014626407.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            isoform X1 [Glycine max]
 gb|KRH01517.1| hypothetical protein GLYMA_18G282300 [Glycine max]
 gb|KRH01518.1| hypothetical protein GLYMA_18G282300 [Glycine max]
          Length = 1260

 Score = 1359 bits (3518), Expect = 0.0
 Identities = 737/1095 (67%), Positives = 838/1095 (76%), Gaps = 21/1095 (1%)
 Frame = +3

Query: 813  DASETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRL 992
            D    VS+SE LGV+ VLEGVTVANVAESFA+TC+++Q+A+  +V + PADSE+DDLVRL
Sbjct: 178  DVKRDVSNSEQLGVRGVLEGVTVANVAESFAQTCSKLQNALP-EVLSRPADSERDDLVRL 236

Query: 993  SFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEG-THLFSPEHMKEIEVMITAINS 1169
            SFNA EVVYSVFCSM +L+KE+NKD+ILRLLSF+KD+    LFSPEH+KEI+ M+TAI+ 
Sbjct: 237  SFNATEVVYSVFCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAIDY 296

Query: 1170 VGSLGSSEAVGKEENLET----HETKTWEISAVKYGGELISFSKPGNSNSIEASEASKSG 1337
             G+L +SEA+GKE+ L+T    HE KT E  AV+   ELIS++KP +S+ I AS A K G
Sbjct: 297  FGALVNSEAIGKEKELQTTVQTHEIKTQENQAVE-AAELISYNKPLHSDIIGASHALKFG 355

Query: 1338 QSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDKSGLPLAGKTG 1517
            Q+ IKGRGV                  PTREAPSCFPV K LSVGE M  SG   A K  
Sbjct: 356  QNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEPMVSSG-SAAAKPE 414

Query: 1518 CGKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTND 1697
             GKMELD EGSKFHLYETDAL+AVSTYQQKFGRSS FTND+ PSPTPSGDCE+ +VDTN+
Sbjct: 415  SGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVDTNE 474

Query: 1698 EVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSR 1877
            EV            KPTLLD  PV          +HG ++SR+DAAG GS   K+SAK+R
Sbjct: 475  EVSSASTGDFLTSTKPTLLDLPPVSATSTDRSS-LHGFISSRVDAAGPGSLPVKSSAKNR 533

Query: 1878 DPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLE 2057
            DPRLRF+NSDAS +D N  +  HNMPKVEY GT  SRKQK  EEPSLD TV+K+ +  LE
Sbjct: 534  DPRLRFVNSDASAVD-NPSTLIHNMPKVEYAGTTISRKQKAAEEPSLDVTVSKRQKSPLE 592

Query: 2058 NTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETE----LNTTFS--TSSGNL 2219
            NTEH+M E RT  G GGWLE+ T  G+Q IERNHLM K   E    LNT  S  T S N 
Sbjct: 593  NTEHNMSEVRT--GIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNTVSSSCTGSDNF 650

Query: 2220 NVTSNGNEQAPVTSSTT-ASLPDLLKGIAVNPTMLLNILMEQQRLAAEAKKNSADSASST 2396
            N TS  NEQAP+TSS   ASLP LLKG AVNPTML+N+L       AEA+K SADSA++ 
Sbjct: 651  NATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLLR-----IAEAQKKSADSATNM 705

Query: 2397 L-HLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPR 2573
            L H  SSNSA G D+T +IG +M  G+ Q+SVGM PVSSQ+ SM Q +Q+DSGKIRMKPR
Sbjct: 706  LLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSMTQTLQDDSGKIRMKPR 765

Query: 2574 DPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIA 2753
            DPRRILH +  +QKSG LG+EQFKAIVSP+SNNQG  DNVNAQK E  V +KLVPT+  A
Sbjct: 766  DPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRVDSKLVPTQPSA 825

Query: 2754 PPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ-----SSQNLQ 2918
             PDI RQF  NLKNIAD+MSV QESS H+P  Q  SSASVP T D+ EQ     +SQNL+
Sbjct: 826  QPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRGEQKSVVSNSQNLE 885

Query: 2919 AVTGLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXX 3098
            A    A ET ASG+ RSQ+TW DVEHLFEGY+EQQKAAIQRERARR+EEQNKMF+ARK  
Sbjct: 886  AGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKMFAARKLC 945

Query: 3099 XXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNF 3278
                      NSAKFVEVDPV+DEILRKKE++D+E P RHLFRF HMGMWTKLRPG+WNF
Sbjct: 946  LVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWNF 1005

Query: 3279 LEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKD 3458
            LEKASKLYELHLYTMGNK YAT+MAKVLDPKGLLFAGRVISRGDDT+S+DG+ERAP++KD
Sbjct: 1006 LEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTDSVDGEERAPKSKD 1065

Query: 3459 LEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVE---RYLYFPSSRRQFGLTGQSLLEVDRD 3629
            LEGVLGME          RVWPHN+ NLIVVE   RY YFP SRRQFGL G SLLE+D D
Sbjct: 1066 LEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERHCRYTYFPCSRRQFGLPGPSLLEIDHD 1125

Query: 3630 EVPEAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRMFPVGE 3809
            E PEAGTLA  L VIEK+HQ FFAS+SLEE DVRNILA+EQRKILAGCRIVFSR+FPVGE
Sbjct: 1126 ERPEAGTLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRIVFSRVFPVGE 1185

Query: 3810 TNPHLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGWVEASA 3989
             NPHLHPLWQTAEQFGA CTNQIDEQVTHVVA+SPGTDKVNWA + GRFVV PGWVEASA
Sbjct: 1186 ANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHPGWVEASA 1245

Query: 3990 LLYRRLNEQDFAIKP 4034
            LLYRR NEQDFAIKP
Sbjct: 1246 LLYRRANEQDFAIKP 1260



 Score =  127 bits (320), Expect = 9e-26
 Identities = 68/108 (62%), Positives = 71/108 (65%)
 Frame = +3

Query: 327 MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
           M+FGS LDC KL                 ISDTASVEEISA+DFNKQ             
Sbjct: 1   MIFGSLLDCEKLGKLEKMGKEVEDVEEGEISDTASVEEISAEDFNKQDVKVLNNNNKPNG 60

Query: 507 XXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIME 650
               RVWAVHDLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME
Sbjct: 61  SDA-RVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVME 107


>dbj|GAU14128.1| hypothetical protein TSUD_169530 [Trifolium subterraneum]
          Length = 1204

 Score = 1352 bits (3500), Expect = 0.0
 Identities = 723/1070 (67%), Positives = 816/1070 (76%), Gaps = 12/1070 (1%)
 Frame = +3

Query: 801  MAGGDASETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDD 980
            M  GDA ETVSDSE+LGVK VLEG+TVANVAESFAETCTRIQ A+QSK+F+G A SEK D
Sbjct: 159  MLSGDAFETVSDSEVLGVKAVLEGITVANVAESFAETCTRIQGALQSKMFSGLAGSEKVD 218

Query: 981  LVRLSFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEGTHLFSPEHMKEIEVMITA 1160
            L+ LSFNAI+VVYSVFCSM +LQKEENKDNILRLLSFLK+E  HLF+PEH KEI+VMITA
Sbjct: 219  LICLSFNAIKVVYSVFCSMEHLQKEENKDNILRLLSFLKNEHAHLFTPEHTKEIQVMITA 278

Query: 1161 INSVGSLGSSEAVGKEENLETHETKTWEISAVKYGGELISFSKPGNSNSIEASEASKSGQ 1340
            I+SV +LG+++ +G EE LE  + KT +I  +K   ELIS SK    N    SE  K GQ
Sbjct: 279  IDSVDALGNNDVIGNEEKLEALD-KTQQILGLK-ANELISSSKLVLDNLTYPSEVFKYGQ 336

Query: 1341 SIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDKSGLPLAGKTGC 1520
            S IKGRGV                  PTREAPS F      SVG+G+D+ GLP A KT  
Sbjct: 337  SNIKGRGVMLPLFDLHKVHDLDSLPSPTREAPSGFAGNNLFSVGDGVDRFGLPPAVKTEV 396

Query: 1521 GKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTNDE 1700
             KMELD + SKFH Y+TDAL+AVSTYQQKF +SSFFT+D+ PSPTPSGDCE GVVDTNDE
Sbjct: 397  EKMELDSKDSKFHNYDTDALKAVSTYQQKFSQSSFFTDDKFPSPTPSGDCEGGVVDTNDE 456

Query: 1701 VXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXV-----HGLVNSRIDAAGSGSYAGKTS 1865
            V            +P  LD +P           +     HGL+NSR+DA  SGSY  K S
Sbjct: 457  VSSASIASLLTSSRPPPLDAMPAASSSSSSSSSIDRSSMHGLMNSRLDATSSGSYPVKNS 516

Query: 1866 AKSRDPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLR 2045
            AKSRDPRLRFINSDASTLDLNQP G +NMP VEY G + SRKQKT EEPSLDAT  K+LR
Sbjct: 517  AKSRDPRLRFINSDASTLDLNQPLGKNNMPTVEYPGRVISRKQKT-EEPSLDATAPKRLR 575

Query: 2046 RSLENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETELNTTFSTSSGNLNV 2225
            RSLE +EH+ RE RT+AG GGW E+ T+A                EL  T STSSGNL V
Sbjct: 576  RSLEKSEHNTREERTIAGKGGWFEENTVA----------------ELERTMSTSSGNLTV 619

Query: 2226 TSNGNEQAPVT-SSTTASLPDLLKGIAVNPTMLLNILMEQQ-RLAAEAKKNSADSASSTL 2399
            TS+GNEQ PVT SSTTASLP +L+ +AVNPT+L+NIL++QQ RLAAE +K   DSA+S L
Sbjct: 620  TSDGNEQTPVTGSSTTASLPVILQNMAVNPTILMNILIDQQQRLAAEPQKKPVDSATSIL 679

Query: 2400 HLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDP 2579
            HL +SNSA+G  TTVN GPAMTAG+PQ+SVG+ P SS A SMAQ +Q DSGKIRMKPRDP
Sbjct: 680  HLTNSNSARGTATTVNTGPAMTAGLPQSSVGILPASSPATSMAQPLQVDSGKIRMKPRDP 739

Query: 2580 RRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPP 2759
            RRILHG   LQKSG LGSEQ KAIVSP  NNQG  DNVNAQK +V    KL PT+SI  P
Sbjct: 740  RRILHGISTLQKSGNLGSEQSKAIVSPTPNNQGTGDNVNAQKLDVRAAAKLAPTQSITQP 799

Query: 2760 DITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ-----SSQNLQAV 2924
            DITRQFT NLKNIAD+MSVPQE S H  ATQNVSSASVPFT D+AEQ     +SQNL+  
Sbjct: 800  DITRQFTRNLKNIADIMSVPQEPSTHPLATQNVSSASVPFTSDRAEQKSIVPNSQNLKDG 859

Query: 2925 TGLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXXXX 3104
             G APETC SGSSR Q+TWADVEHLF+GY+E+QKAAIQRER RRLEEQNKMF+A+K    
Sbjct: 860  VGSAPETCTSGSSRPQNTWADVEHLFDGYDEKQKAAIQRERTRRLEEQNKMFAAKKLCLV 919

Query: 3105 XXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLE 3284
                    NSAKFVEVDPV+DEILRKKE++D+E PQRHLFRF HMGMWTKLRPGVWNFLE
Sbjct: 920  LDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGVWNFLE 979

Query: 3285 KASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKDLE 3464
            KASKL+E+HLYTMGNK YAT+MAKVLDPKG+LFAGRVISRGDD E++D      ++KDLE
Sbjct: 980  KASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDAETVD-----TKSKDLE 1034

Query: 3465 GVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEA 3644
            GVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G SLLE+D DE P+ 
Sbjct: 1035 GVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPDT 1094

Query: 3645 GTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRMFPVGETNPHL 3824
            GTLA  LGVIE++HQ FFAS+SLEE DVRNILA+EQRKIL GCRIVFSR+FPVGE NPHL
Sbjct: 1095 GTLASSLGVIERIHQNFFASESLEEVDVRNILASEQRKILDGCRIVFSRIFPVGEANPHL 1154

Query: 3825 HPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGW 3974
            HPLWQTAEQFGA CTNQID+QVTHVVA+S GTDKVNWA S G+FVV P W
Sbjct: 1155 HPLWQTAEQFGASCTNQIDDQVTHVVANSLGTDKVNWATSNGKFVVYPSW 1204



 Score =  104 bits (259), Expect = 1e-18
 Identities = 56/88 (63%), Positives = 58/88 (65%), Gaps = 9/88 (10%)
 Frame = +3

Query: 414 ISDTASVEEISADDFNK---------QXXXXXXXXXXXXXXXXXRVWAVHDLYSKYPTIS 566
           ISDTASVEEIS +DFNK                           RVWAV DLYSKYPTI 
Sbjct: 13  ISDTASVEEISEEDFNKPDVVKVNNNSDKVKSGSGGGGGGGGDSRVWAVQDLYSKYPTIC 72

Query: 567 RGYASGLYNLAWAQAVQNKPLNDIFIME 650
           RGYASGLYNLAWAQAVQNKPLNDIF+ME
Sbjct: 73  RGYASGLYNLAWAQAVQNKPLNDIFVME 100


>ref|XP_003530482.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Glycine max]
 gb|KRH45208.1| hypothetical protein GLYMA_08G257900 [Glycine max]
          Length = 1261

 Score = 1352 bits (3499), Expect = 0.0
 Identities = 733/1102 (66%), Positives = 833/1102 (75%), Gaps = 28/1102 (2%)
 Frame = +3

Query: 813  DASETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRL 992
            D    VSDSE LG + VLEGVTVANV ESFA+TC+++Q+ +  +V + PA SEKDDLVRL
Sbjct: 176  DVKMDVSDSEQLGARGVLEGVTVANVVESFAQTCSKLQNTLP-EVLSRPAGSEKDDLVRL 234

Query: 993  SFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEG-THLFSPEHMKEIEVMITAINS 1169
            SFNA EVVYSVFCSM + +KE+NKD+ILRLLSF+KD+    LFSPEH+KEI+ M+TAI+S
Sbjct: 235  SFNATEVVYSVFCSMDSSEKEQNKDSILRLLSFVKDQQQAQLFSPEHVKEIQGMMTAIDS 294

Query: 1170 VGSLGSSEAVGKEENLETHETKTWEISAVKY--------------GGELISFSKPGNSNS 1307
            VG+L +SEA+GKE+ L+T E KT E SAV+                 ELIS+SKP + + 
Sbjct: 295  VGALVNSEAIGKEKELQTTEIKTQENSAVEVQIHEIKTQENQAVEAAELISYSKPLHRDI 354

Query: 1308 IEASEASKSGQSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDK 1487
               S+A K GQ+ IKGRGV                  PTREAPSCFPV K LSVGE M +
Sbjct: 355  TGTSQALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGESMVR 414

Query: 1488 SGLPLAGKTGCGKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGD 1667
            SG      +   KMELD EGSKFHLYETDAL+AVSTYQQKFGRSS FTND+ PSPTPSGD
Sbjct: 415  SG------SASAKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGD 468

Query: 1668 CEEGVVDTNDEVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGS 1847
            CE+ VVDTN+EV            KPTLLDQ PV          +HG ++SR+DA G GS
Sbjct: 469  CEDEVVDTNEEVSSASTGDFLTSTKPTLLDQPPVSATSMDRSS-MHGFISSRVDATGPGS 527

Query: 1848 YAGKTSAKSRDPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDAT 2027
            +  K+SAK+RDPRLRFINSDAS +D N  +  +NM KVEY GT  SRKQK  EEPSLD T
Sbjct: 528  FPVKSSAKNRDPRLRFINSDASAVD-NLSTLINNMSKVEYSGTTISRKQKAAEEPSLDVT 586

Query: 2028 VTKKLRRSLENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETE----LNTT 2195
            V+K+L+ SLENTEH+M E RT  G+GGWLE+ T  G+QLIERNHLM K   E    LNT 
Sbjct: 587  VSKRLKSSLENTEHNMSEVRT--GSGGWLEENTGPGAQLIERNHLMDKFGPEAKKTLNTV 644

Query: 2196 FS--TSSGNLNVTSNGNEQAPVTSSTT-ASLPDLLKGIAVNPTMLLNILMEQQRLAAEAK 2366
             S  T S N N TS  NEQAP+T+S   ASLP LLK  +VNP ML+NIL       AEA+
Sbjct: 645  SSSCTGSDNFNATSIRNEQAPITASNVLASLPALLKEASVNPIMLVNILR-----LAEAQ 699

Query: 2367 KNSADSAS-STLHLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQE 2543
            K SADSA+   LH  SSN A G D+T +IG +M  G+ Q+SVGM PVSSQ+ S AQ +Q+
Sbjct: 700  KKSADSAAIMLLHPTSSNPAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSTAQTLQD 759

Query: 2544 DSGKIRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVG 2723
            DSGKIRMKPRDPRRILH +  +QKSG LG+EQFKAIVSP+SNNQ   DNVNA K E  V 
Sbjct: 760  DSGKIRMKPRDPRRILHTNNTIQKSGDLGNEQFKAIVSPVSNNQRTGDNVNAPKLEGRVD 819

Query: 2724 TKLVPTKSIAPPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ- 2900
             KLVPT+S A PDI RQFT NLKNIAD+MSV QESS H+P +QN SSASVP T D+ EQ 
Sbjct: 820  NKLVPTQSSAQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLTSDRGEQK 879

Query: 2901 ----SSQNLQAVTGLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQ 3068
                SSQNLQA    A ET AS +SRSQSTW DVEHLFEGY+EQQKAAIQRERARR+EEQ
Sbjct: 880  SVVSSSQNLQADMASAHETAASVTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQ 939

Query: 3069 NKMFSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMW 3248
            NKMF+ARK            NSAKFVEVDP++DEILRKKE++D+E P RHLFRF HMGMW
Sbjct: 940  NKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMW 999

Query: 3249 TKLRPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESID 3428
            TKLRPG+WNFLEKASKLYELHLYTMGNK YAT+MAKVLDPKG+LFAGRVISRGDDT+S+D
Sbjct: 1000 TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVD 1059

Query: 3429 GDERAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQS 3608
            G+ER P++KDLEGVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G S
Sbjct: 1060 GEERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPS 1119

Query: 3609 LLEVDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFS 3788
            LLE+D DE PEAGTLA  L VIEK+HQ FFASQSLEE DVRNILA+EQRKILAGCRIVFS
Sbjct: 1120 LLEIDHDERPEAGTLASSLAVIEKIHQIFFASQSLEEVDVRNILASEQRKILAGCRIVFS 1179

Query: 3789 RMFPVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLP 3968
            R+FPVGE NPHLHPLWQTAEQFGAVCTNQIDEQVTHVVA+SPGTDKVNWA + GRFVV P
Sbjct: 1180 RVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHP 1239

Query: 3969 GWVEASALLYRRLNEQDFAIKP 4034
            GWVEASALLYRR NEQDFAIKP
Sbjct: 1240 GWVEASALLYRRANEQDFAIKP 1261



 Score =  125 bits (314), Expect = 5e-25
 Identities = 68/108 (62%), Positives = 70/108 (64%)
 Frame = +3

Query: 327 MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
           MVFGS LDC  L                 ISDTASVEEISA+DFNKQ             
Sbjct: 1   MVFGSLLDCEVLGKLEKMGKEAEDVEEGEISDTASVEEISAEDFNKQDVKLLNNNNKPNG 60

Query: 507 XXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIME 650
               RVWAVHDLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME
Sbjct: 61  SDA-RVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVME 107


>gb|KHN47532.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Glycine soja]
          Length = 1263

 Score = 1346 bits (3483), Expect = 0.0
 Identities = 734/1102 (66%), Positives = 833/1102 (75%), Gaps = 28/1102 (2%)
 Frame = +3

Query: 813  DASETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRL 992
            D    VSDSE LG + VLEGVTVANV ESFA+TC+++Q+ +  +V + PA SEKDDLVRL
Sbjct: 176  DVKMDVSDSEQLGARGVLEGVTVANVVESFAQTCSKLQNTLP-EVLSRPAGSEKDDLVRL 234

Query: 993  SFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEG-THLFSPEHMKEIEVMITAINS 1169
            SFNA EVV   FCSM + +KE+NKD+ILRLLSF+KD+    LFSPEH+KEI+ M+TAI+S
Sbjct: 235  SFNATEVV---FCSMDSSEKEQNKDSILRLLSFVKDQQQAQLFSPEHVKEIQGMLTAIDS 291

Query: 1170 VGSLGSSEAVGKEENLETHETKTWEISAVKY--------------GGELISFSKPGNSNS 1307
            VG+L +SEA+GKE+ L+T E KT E SAV+                 ELIS+SKP + + 
Sbjct: 292  VGALVNSEAIGKEKELQTTEIKTQENSAVEVQIHEIKTQENQAVEAAELISYSKPLHRDI 351

Query: 1308 IEASEASKSGQSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDK 1487
               S+A K GQ+ IKGRGV                  PTREAPSCFPV K LSVGE M +
Sbjct: 352  TGTSQALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGESMVR 411

Query: 1488 SGLPLAGKTGCGKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGD 1667
            SG   A K   GKMELD EGSKFHLYETDAL+AVSTYQQKFGRSS FTND+ PSPTPSGD
Sbjct: 412  SGSASA-KMESGKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGD 470

Query: 1668 CEEGVVDTNDEVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGS 1847
            CE+ VVDT +EV            KPTLLDQ PV          +HG ++SR+DAAG GS
Sbjct: 471  CEDEVVDTIEEVSSASTGDFLTSTKPTLLDQPPVSATSMDRSS-MHGFISSRVDAAGPGS 529

Query: 1848 YAGKTSAKSRDPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDAT 2027
            +  K+SAK+RDPRLRFINSDAS +D N  +  +NM KVEY GT  SRKQK  EEPSLD T
Sbjct: 530  FPVKSSAKNRDPRLRFINSDASAVD-NLSTLINNMSKVEYSGTTISRKQKAAEEPSLDVT 588

Query: 2028 VTKKLRRSLENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETE----LNTT 2195
            V+K+L+ SLENTEH+M E RT  G+GGWLE+ T  G+QLIERNHLM K   E    LNT 
Sbjct: 589  VSKRLKSSLENTEHNMSEVRT--GSGGWLEENTGPGAQLIERNHLMDKFGPEAKKTLNTV 646

Query: 2196 FS--TSSGNLNVTSNGNEQAPVTSSTT-ASLPDLLKGIAVNPTMLLNILMEQQRLAAEAK 2366
             S  T S N N TS  NEQAP+T+S   ASLP LLK  +VNP ML+NIL       AEA+
Sbjct: 647  SSSCTGSDNFNATSIRNEQAPITASNVLASLPALLKEASVNPIMLVNILR-----LAEAQ 701

Query: 2367 KNSADSAS-STLHLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQE 2543
            K SADSA+   LH  SSN A G D+T +IG +M  G+ Q+SVGM PVSSQ+ S AQ +Q+
Sbjct: 702  KKSADSAAIMLLHPTSSNPAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSTAQTLQD 761

Query: 2544 DSGKIRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVG 2723
            DSGKIRMKPRDPRRILH +  +QKSG LG+EQFKAIVSP+SNNQ   DNVNAQK E  V 
Sbjct: 762  DSGKIRMKPRDPRRILHTNNTIQKSGDLGNEQFKAIVSPVSNNQRTGDNVNAQKLEGRVD 821

Query: 2724 TKLVPTKSIAPPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ- 2900
             KLVPT+S A PDI RQFT NLKNIAD+MSV QESS H+P +QN SSASVP T D+ EQ 
Sbjct: 822  NKLVPTQSSAQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLTSDRGEQK 881

Query: 2901 ----SSQNLQAVTGLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQ 3068
                SSQNLQA    A ET AS +SRSQSTW DVEHLFEGY+EQQKAAIQRERARR+EEQ
Sbjct: 882  SVVSSSQNLQADMASAHETAASVTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQ 941

Query: 3069 NKMFSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMW 3248
            NKMF+ARK            NSAKFVEVDP++DEILRKKE++D+E P RHLFRF HMGMW
Sbjct: 942  NKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMW 1001

Query: 3249 TKLRPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESID 3428
            TKLRPG+WNFLEKASKLYELHLYTMGNK YAT+MAKVLDPKG+LFAGRVISRGDDT+S+D
Sbjct: 1002 TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVD 1061

Query: 3429 GDERAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQS 3608
            G+ER P++KDLEGVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G S
Sbjct: 1062 GEERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPS 1121

Query: 3609 LLEVDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFS 3788
            LLE+D DE PEAGTLA  L VIEK+HQ FFASQSLEE DVRNILA+EQRKILAGCRIVFS
Sbjct: 1122 LLEIDHDERPEAGTLASSLAVIEKIHQIFFASQSLEEVDVRNILASEQRKILAGCRIVFS 1181

Query: 3789 RMFPVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLP 3968
            R+FPVGE NPHLHPLWQTAEQFGAVCTNQIDEQVTHVVA+SPGTDKVNWA + GRFVV P
Sbjct: 1182 RVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHP 1241

Query: 3969 GWVEASALLYRRLNEQDFAIKP 4034
            GWVEASALLYRR NEQDFAIKP
Sbjct: 1242 GWVEASALLYRRANEQDFAIKP 1263



 Score =  125 bits (314), Expect = 5e-25
 Identities = 68/108 (62%), Positives = 70/108 (64%)
 Frame = +3

Query: 327 MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
           MVFGS LDC  L                 ISDTASVEEISA+DFNKQ             
Sbjct: 1   MVFGSLLDCEVLGKLEKMGKEAEDVEEGEISDTASVEEISAEDFNKQDVKLLNNNNKPNG 60

Query: 507 XXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIME 650
               RVWAVHDLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME
Sbjct: 61  SDA-RVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVME 107


>dbj|BAT83124.1| hypothetical protein VIGAN_04022800 [Vigna angularis var. angularis]
          Length = 1275

 Score = 1311 bits (3394), Expect = 0.0
 Identities = 720/1099 (65%), Positives = 832/1099 (75%), Gaps = 30/1099 (2%)
 Frame = +3

Query: 828  VSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRLSFNAI 1007
            VSDSE LGV++VLEGVTVANVAESF +T +R+ +A+  +VF+ PADSEKDDL+RLSFNAI
Sbjct: 184  VSDSEQLGVRNVLEGVTVANVAESFVQTSSRLLNALP-EVFSRPADSEKDDLIRLSFNAI 242

Query: 1008 EVVYSVFCSMYNLQKEENKDNILRLLSFLKD-EGTHLFSPEHMKEIEVMITAINSVGSLG 1184
            EVVYSVF SM +  KE NKDNILRLLS +KD E   LFSP+H++EI+ M+TAI+SVG+LG
Sbjct: 243  EVVYSVFRSMDSSDKERNKDNILRLLSSVKDQEQAQLFSPKHIEEIQGMMTAIDSVGALG 302

Query: 1185 SSEAVGKEENLETHETKTWEISAVKY--------------GGELISFSKPGNSNSIEASE 1322
            SSEA+  +   +T E K+ E SA++                  LIS  KP +S+ I  S 
Sbjct: 303  SSEAIYTKTESQTPEIKSQENSALEVQTHSINIQENQAVEATALISSVKPLHSDIIGGSR 362

Query: 1323 ASKSGQSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDKSGLPL 1502
            A K GQ+ IKGRG+                  PTREAPSCFPV K LSVGE M KSG   
Sbjct: 363  ALKFGQNSIKGRGILLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEAMVKSGS-- 420

Query: 1503 AGKTGCGKMELDGEGS-KFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEG 1679
            A K   GK+E+D EGS KFHLYETDAL+AVSTYQQKFGRSS FTND+LPSPTPSGDC++ 
Sbjct: 421  AAKMQPGKVEVDSEGSTKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCDDM 480

Query: 1680 VVDTNDEVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGK 1859
            VVDTN+EV            KPTL+DQ PV          + GL+NSR+DAAG GS+  K
Sbjct: 481  VVDTNEEVSSASIGGFLTTTKPTLIDQPPVSGTSMDNSRLL-GLINSRVDAAGPGSFPVK 539

Query: 1860 TSAKSRDPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKK 2039
            +SAKSRDPR R INS+A+ +D N     +NMPKVEY G+  SRKQK VEEP  D TV+K+
Sbjct: 540  SSAKSRDPRRRLINSEANAVD-NHSVVINNMPKVEYAGSAISRKQKAVEEP-FDVTVSKR 597

Query: 2040 LRRSLENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETE----LNTTFSTS 2207
            L+ SLEN EH+  + RT+AG GGWLED T  G++LIE+N+LM K   E    LNT  S+ 
Sbjct: 598  LKSSLENIEHNSSQVRTIAGTGGWLEDNTGPGTELIEKNNLMDKFAPEPKKTLNTVSSSC 657

Query: 2208 SGNL--NVTSNGNEQAPVTSSTTAS-LPDLLKGIAVNPTMLLNILMEQQRLAAEAKKNSA 2378
            SG++  N TS  NEQ P+TSS  AS LP +LK I VNPTMLL ++ EQQ     A   S+
Sbjct: 658  SGSVAFNATSIRNEQVPITSSNIASSLPAVLKDIVVNPTMLLGLIFEQQNRLRNAVNKSS 717

Query: 2379 DSASSTLHLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQED-SGK 2555
            DSA++ L+  SSNSA G D+TV+IG +M  G+ Q SVGM PVSSQ+ S AQ +Q+D SGK
Sbjct: 718  DSATNILNPTSSNSATGTDSTVSIGSSMATGL-QTSVGMLPVSSQSTSTAQSLQDDYSGK 776

Query: 2556 IRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLV 2735
            IRMKPRDPRRILH + ++QKSG + +E  KAIVSP+SN+Q   +NVNAQK E  V TKLV
Sbjct: 777  IRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNSQVTGENVNAQKLEGRVDTKLV 836

Query: 2736 PTKSIAPPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ----- 2900
            PT+S A PDITRQFT NLKNIAD+MSV QESS HS A Q+ SSASVP  +D+ EQ     
Sbjct: 837  PTQSGAAPDITRQFTKNLKNIADIMSVSQESSTHSTAAQSFSSASVPLNIDRGEQKSVVS 896

Query: 2901 SSQNLQAVT-GLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKM 3077
            +SQNLQA T G A E CA G+SRSQSTW DVEHLFEGY+EQQKAAIQRERARR+EEQNKM
Sbjct: 897  NSQNLQAGTVGSAHEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKM 956

Query: 3078 FSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKL 3257
            F+ARK            NSAKFVEVDPV+DEILRKKE++D+E P RHLFRF HMGMWTKL
Sbjct: 957  FAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 1016

Query: 3258 RPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDE 3437
            RPG+WNFLEKASKLYELHLYTMGNK YAT+MAKVLDPKG+LFAGRVISRGDDT+S+DG+E
Sbjct: 1017 RPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVDGEE 1076

Query: 3438 RAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLE 3617
            RAP++KDLEGVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G SLLE
Sbjct: 1077 RAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLE 1136

Query: 3618 VDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRMF 3797
            +D DE PEAGTLA  L VIE++HQ FFASQSLEE DVRNILA+EQRKILAGCRIVFSR+F
Sbjct: 1137 IDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVF 1196

Query: 3798 PVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGWV 3977
            PVGE NPHLHPLWQTAEQFGAVCTNQIDEQVTHVVA+S GTDKVNWA STGRFVV PGWV
Sbjct: 1197 PVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 1256

Query: 3978 EASALLYRRLNEQDFAIKP 4034
            EASALLYRR NEQDFAIKP
Sbjct: 1257 EASALLYRRANEQDFAIKP 1275



 Score =  124 bits (312), Expect = 8e-25
 Identities = 68/108 (62%), Positives = 69/108 (63%)
 Frame = +3

Query: 327 MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
           MVFGS LDC KL                 ISDTASVEEIS  DFNKQ             
Sbjct: 1   MVFGSLLDCQKLGKLEKMGKEVEDVEEGEISDTASVEEISEADFNKQDVKVNNNNKPNGS 60

Query: 507 XXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIME 650
               RVWAVHDLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME
Sbjct: 61  DA--RVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVME 106


>ref|XP_017419004.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Vigna angularis]
 gb|KOM36704.1| hypothetical protein LR48_Vigan03g008500 [Vigna angularis]
          Length = 1275

 Score = 1309 bits (3387), Expect = 0.0
 Identities = 719/1099 (65%), Positives = 831/1099 (75%), Gaps = 30/1099 (2%)
 Frame = +3

Query: 828  VSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRLSFNAI 1007
            VSDSE LGV++VLEGVTVANVAESF +T +R+ +A+  +VF+ PADSEKDDL+RLSFNAI
Sbjct: 184  VSDSEQLGVRNVLEGVTVANVAESFVQTSSRLLNALP-EVFSRPADSEKDDLIRLSFNAI 242

Query: 1008 EVVYSVFCSMYNLQKEENKDNILRLLSFLKD-EGTHLFSPEHMKEIEVMITAINSVGSLG 1184
            EVVYSVF SM +  KE NKDNILRLLS +KD E   LFSP+H++EI+ M+TAI+SVG+LG
Sbjct: 243  EVVYSVFRSMDSSDKERNKDNILRLLSSVKDQEQAQLFSPKHIEEIQGMMTAIDSVGALG 302

Query: 1185 SSEAVGKEENLETHETKTWEISAVKY--------------GGELISFSKPGNSNSIEASE 1322
            SSEA+  +   +T E K+ E SA++                  LIS  KP +S+ I  S 
Sbjct: 303  SSEAIYTKTESQTPEIKSQENSALEVQTHSINIQENQAVEATALISSVKPLHSDIIGGSR 362

Query: 1323 ASKSGQSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDKSGLPL 1502
            A K GQ+ IKGRG+                  PTREAPSCFPV K LSVGE M KS    
Sbjct: 363  ALKFGQNSIKGRGILLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEAMVKSDS-- 420

Query: 1503 AGKTGCGKMELDGEGS-KFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEG 1679
            A K   GK+E+D EGS KFHLYETDAL+AVSTYQQKFGRSS FTND+LPSPTPSGDC++ 
Sbjct: 421  AAKMQPGKVEVDSEGSTKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCDDM 480

Query: 1680 VVDTNDEVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGK 1859
            VVDTN+EV            KPTL+DQ PV          + GL+NSR+DAAG GS+  K
Sbjct: 481  VVDTNEEVSSASIGGFLTTTKPTLIDQPPVSGTSMDNSRLL-GLINSRVDAAGPGSFPVK 539

Query: 1860 TSAKSRDPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKK 2039
            +SAKSRDPR R INS+A+ +D N     +NMPKVEY G+  SRKQK VEEP  D TV+K+
Sbjct: 540  SSAKSRDPRRRLINSEANAVD-NHSVVINNMPKVEYAGSAISRKQKAVEEP-FDVTVSKR 597

Query: 2040 LRRSLENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETE----LNTTFSTS 2207
            L+ SLEN EH+  + RT+AG GGWLED T  G++LIE+N+LM K   E    LNT  S+ 
Sbjct: 598  LKSSLENIEHNSSQVRTIAGTGGWLEDNTGPGTELIEKNNLMDKFAPEPKKTLNTVSSSC 657

Query: 2208 SGNL--NVTSNGNEQAPVTSSTTAS-LPDLLKGIAVNPTMLLNILMEQQRLAAEAKKNSA 2378
            SG++  N TS  NEQ P+TSS  AS LP +LK I VNPTMLL ++ EQQ     A   S+
Sbjct: 658  SGSVAFNATSIRNEQVPITSSNIASSLPAVLKDIVVNPTMLLGLIFEQQNRLRNAVNKSS 717

Query: 2379 DSASSTLHLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQED-SGK 2555
            DSA++ L+  SSNSA G D+TV+IG +M  G+ Q SVGM PVSSQ+ S AQ +Q+D SGK
Sbjct: 718  DSATNILNPTSSNSATGTDSTVSIGSSMATGL-QTSVGMLPVSSQSTSTAQSLQDDYSGK 776

Query: 2556 IRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLV 2735
            IRMKPRDPRRILH + ++QKSG + +E  KAIVSP+SN+Q   +NVNAQK E  V TKLV
Sbjct: 777  IRMKPRDPRRILHTNNSVQKSGNIVNELHKAIVSPVSNSQVTGENVNAQKLEGRVDTKLV 836

Query: 2736 PTKSIAPPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ----- 2900
            PT+S A PDITRQFT NLKNIAD+MSV QESS HS A Q+ SSASVP  +D+ EQ     
Sbjct: 837  PTQSGAAPDITRQFTKNLKNIADIMSVSQESSTHSTAAQSFSSASVPLNIDRGEQKSVVS 896

Query: 2901 SSQNLQAVT-GLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKM 3077
            +SQNLQA T G A E CA G+SRSQSTW DVEHLFEGY+EQQKAAIQRERARR+EEQNKM
Sbjct: 897  NSQNLQAGTVGSAHEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKM 956

Query: 3078 FSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKL 3257
            F+ARK            NSAKFVEVDPV+DEILRKKE++D+E P RHLFRF HMGMWTKL
Sbjct: 957  FAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 1016

Query: 3258 RPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDE 3437
            RPG+WNFLEKASKLYELHLYTMGNK YAT+MAKVLDPKG+LFAGRVISRGDDT+S+DG+E
Sbjct: 1017 RPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVDGEE 1076

Query: 3438 RAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLE 3617
            RAP++KDLEGVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G SLLE
Sbjct: 1077 RAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLE 1136

Query: 3618 VDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRMF 3797
            +D DE PEAGTLA  L VIE++HQ FFASQSLEE DVRNILA+EQRKILAGCRIVFSR+F
Sbjct: 1137 IDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVF 1196

Query: 3798 PVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGWV 3977
            PVGE NPHLHPLWQTAEQFGAVCTNQIDEQVTHVVA+S GTDKVNWA STGRFVV PGWV
Sbjct: 1197 PVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 1256

Query: 3978 EASALLYRRLNEQDFAIKP 4034
            EASALLYRR NEQDFAIKP
Sbjct: 1257 EASALLYRRANEQDFAIKP 1275



 Score =  124 bits (312), Expect = 8e-25
 Identities = 68/108 (62%), Positives = 69/108 (63%)
 Frame = +3

Query: 327 MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
           MVFGS LDC KL                 ISDTASVEEIS  DFNKQ             
Sbjct: 1   MVFGSLLDCQKLGKLEKMGKEVEDVEEGEISDTASVEEISEADFNKQDVKVNNNNKPNGS 60

Query: 507 XXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIME 650
               RVWAVHDLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME
Sbjct: 61  DA--RVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVME 106


>ref|XP_019419694.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Lupinus angustifolius]
          Length = 1263

 Score = 1308 bits (3385), Expect = 0.0
 Identities = 729/1244 (58%), Positives = 854/1244 (68%), Gaps = 37/1244 (2%)
 Frame = +3

Query: 414  ISDTASVEEISADDFNKQXXXXXXXXXXXXXXXXXRVWAVHDLYSKYPTISRGYASGLYN 593
            ISDTASVEEIS +DF KQ                 RVWAV+DLY+KYPTI  GYASGLYN
Sbjct: 34   ISDTASVEEISEEDFKKQQDVIKVNDNKPKEDSTARVWAVNDLYTKYPTICSGYASGLYN 93

Query: 594  LAWAQAVQNKPLNDIFIMEXXXXXXXXXXXXXXXXXXXXXQXXXXXXXXXXXXXXXXXXX 773
            LAWAQAVQNKPLNDIF+M+                                         
Sbjct: 94   LAWAQAVQNKPLNDIFVMDVVNDANVDNSNQLCSGTVKEDVVMNGQVVIDLDKDDGELEE 153

Query: 774  XXXXXXXXXMAGGDASETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFT 953
                       G    + V +SE   V++VLEG T A+V E FAE+C R+QSA+      
Sbjct: 154  GEIDGDADPEGGN--VQCVLNSEGKSVREVLEGFTNASVEELFAESCGRLQSALH----- 206

Query: 954  GPADSEKDD-LVRLSFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEGTHLFSPEH 1130
              A S+KDD LV LSFNAIEV+YSVF SM + QKE+NKD I+RLL   KD+  HL +PE 
Sbjct: 207  --AVSDKDDVLVGLSFNAIEVIYSVFSSMESSQKEQNKDTIIRLLYLAKDQQGHLLTPEQ 264

Query: 1131 MKEIEVMITAINSVGSLGSSEAVGKEENLETHETKTWEISAVKYGGELISFSKPGNSNSI 1310
            +KEI VMI +++S G+L +SEA+ KE+  +++E KT EI   +  GELIS SKP +S SI
Sbjct: 265  LKEILVMIASLDSTGALANSEAINKEKESQSNEIKTLEIQD-RSAGELISSSKPLDSISI 323

Query: 1311 EASEASKSGQSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFP-------------- 1448
              SEA K GQS  K RGV                  PTRE PSCFP              
Sbjct: 324  GVSEALKFGQSNFKSRGVLVPLFDLHKAHDIDSLPSPTRETPSCFPLNNAFSVGEEVDRP 383

Query: 1449 -----------VKKSLSVGEGMDKSGLPLAGKTGCGKMELDGEGSKFHLYETDALRAVST 1595
                       V KS S GEGM +S LP A KT    ME+D EGSK H Y TDA++AVS+
Sbjct: 384  VLPTHELASFPVNKSFSAGEGMIRSELP-ASKTEAVNMEVDSEGSKLHSYVTDAVKAVSS 442

Query: 1596 YQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTNDEVXXXXXXXXXXXXKPTL--LDQIPV 1769
            YQQKFGRS+FF ++ELPSPTPSGDCE+  VDTN+EV            KP    L+Q+  
Sbjct: 443  YQQKFGRSTFFMSEELPSPTPSGDCEDAAVDTNEEVSSASVAGSAISIKPPSQSLNQLHA 502

Query: 1770 XXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSRDPRLRFINSDASTLDLN-QPSGTH 1946
                      +HGL++SRIDAA S SY+ K S KSRDPRLR INSDAS LDLN Q S  +
Sbjct: 503  SSASTDRSG-MHGLISSRIDAADSRSYSRKPSVKSRDPRLRVINSDASALDLNHQRSLIN 561

Query: 1947 NMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLENTEHSMREARTMAGNGGWLEDTT 2126
            NMP +E  GTI SRKQK  EEPSLD  V+K+L+ SLEN EH  R+ RT  GN GWLE+ +
Sbjct: 562  NMPNMENDGTIISRKQKVPEEPSLDVAVSKRLKTSLENLEHKTRDPRTGTGNRGWLEEIS 621

Query: 2127 LAGSQLIERNHLMQKGETELNTTFST--SSGNLNVTSNGNEQAPVTS-STTASLPDLLKG 2297
              GSQ I RN++  + +  + T  S+   SGN N+TSNGN+QAPV S +TT S+P + K 
Sbjct: 622  ALGSQSIVRNNVEAEPKRTMGTVNSSCAGSGNFNLTSNGNQQAPVASINTTTSIPAVWKD 681

Query: 2298 IAVNPTMLLNILMEQQRLAAEAKKNSADSASSTLHLRSSNSAKGADTTVNIGPAMTAGIP 2477
            + V+P ML+NILME++RLA E K  S D + +TL+L S+NSA G   T++IG ++T G+ 
Sbjct: 682  LTVSPAMLVNILMERKRLATETKNKSDDYSMNTLNLASANSAMGIGPTMSIGTSVTTGLQ 741

Query: 2478 QNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVS 2657
            QNSVGM P+SS A +  +   +DSGKIRMKPRDPRR+LHG   + KSG L SEQ  AIV 
Sbjct: 742  QNSVGMLPISSPATTTVRSPHDDSGKIRMKPRDPRRVLHGR-TIPKSGILASEQSNAIVL 800

Query: 2658 PMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPPDITRQFTSNLKNIADLMSVPQESSDH 2837
            P SNN    DNV+A K EV   TKL P++SIAPPDI   FT NLKNIAD +SV Q+SS++
Sbjct: 801  PTSNNLDTGDNVSASKLEVRADTKLAPSQSIAPPDIAGPFTKNLKNIADTISVTQQSSNN 860

Query: 2838 SPATQNVSSASVPFTLDKAEQ-----SSQNLQAVTGLAPETCASGSSRSQSTWADVEHLF 3002
            SPATQ  SSA V  T D+ EQ     SSQNLQA  G APETCAS SS  QS+W DVEHLF
Sbjct: 861  SPATQAFSSAPV-LTSDRVEQKPVVSSSQNLQASVGSAPETCASVSSTPQSSWGDVEHLF 919

Query: 3003 EGYNEQQKAAIQRERARRLEEQNKMFSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRK 3182
            +GY+E+QKAAIQRERARR+EEQNKMF+ARK            NSAKFVEVDPV+DEILRK
Sbjct: 920  DGYDEKQKAAIQRERARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRK 979

Query: 3183 KEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVL 3362
            KE++D+E P RHLFRF H+GMWTKLRPG+WNFLEKA KL+ELHLYTMGNK YAT+MAKVL
Sbjct: 980  KEEQDREKPHRHLFRFPHLGMWTKLRPGIWNFLEKARKLFELHLYTMGNKLYATEMAKVL 1039

Query: 3363 DPKGLLFAGRVISRGDDTESIDGDERAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNL 3542
            DPKG LF GRVISRGDDT+S+DG+ERAP++KDLEGVLGME          RVWPHN+ NL
Sbjct: 1040 DPKGTLFNGRVISRGDDTDSVDGEERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNL 1099

Query: 3543 IVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEA 3722
            IVVERY YFP SRRQFGL G SLLE+D DE PEAGTLA  LGVIE++HQ FFASQSLEE 
Sbjct: 1100 IVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLGVIERIHQNFFASQSLEEV 1159

Query: 3723 DVRNILATEQRKILAGCRIVFSRMFPVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVV 3902
            DVR+ILA+EQRKILAGCRIVFSR+FPVGE NP+LHPLWQTAEQFGAVCTN ID+QVTHVV
Sbjct: 1160 DVRSILASEQRKILAGCRIVFSRVFPVGEANPNLHPLWQTAEQFGAVCTNHIDDQVTHVV 1219

Query: 3903 ASSPGTDKVNWAFSTGRFVVLPGWVEASALLYRRLNEQDFAIKP 4034
            A+SPGTDKVNWA S GRFVV P WVEASALLYRR NEQDFAIKP
Sbjct: 1220 ANSPGTDKVNWALSIGRFVVHPAWVEASALLYRRANEQDFAIKP 1263


>gb|PNY08592.1| RNA polymerase II C-terminal domain phosphatase 3-like protein
            [Trifolium pratense]
          Length = 1189

 Score = 1303 bits (3372), Expect = 0.0
 Identities = 701/1064 (65%), Positives = 799/1064 (75%), Gaps = 6/1064 (0%)
 Frame = +3

Query: 801  MAGGDASETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDD 980
            M  GD  ETVS  E+LG+K VLEG+TVANVAE FAETCTRIQ  ++SKVF+G A SEKD+
Sbjct: 157  MLSGDDFETVS--EVLGIKAVLEGITVANVAEPFAETCTRIQGVLRSKVFSGLAGSEKDN 214

Query: 981  LVRLSFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEGTHLFSPEHMKEIEVMITA 1160
            LV LSFNAI+VVYSVFCSM + QKEENKDNILRLLSFLK+E  HLF+PEH+KEI+VMI A
Sbjct: 215  LVCLSFNAIKVVYSVFCSMEHSQKEENKDNILRLLSFLKNEHAHLFTPEHIKEIQVMINA 274

Query: 1161 INSVGSLGSSEAVGKEENLETHETKTWEISAVKYGGELISFSKPGNSNSIEASEASKSGQ 1340
            I  V + G+S+ +GKEE LE  + KT EI  +K   ELIS SK    N    SE  K GQ
Sbjct: 275  IEYVDASGNSDVIGKEEKLEALD-KTQEILGLK-ASELISSSKLVLDNLTYPSEVFKYGQ 332

Query: 1341 SIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDKSGLPLAGKTGC 1520
            S IK RGV                  PTREAPS F   K  SVG+G+D+ GLP A KT  
Sbjct: 333  SNIKSRGVMLPLFDLHKVHDLDSLPSPTREAPSGFAGNKLFSVGDGVDRFGLPPAVKTEV 392

Query: 1521 GKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTNDE 1700
             KMELD + SKFH Y+TDAL+AVSTYQQKF +SSFFT+D+ PSPTPSGDCE GVVDTNDE
Sbjct: 393  EKMELDNKDSKFHNYDTDALKAVSTYQQKFSQSSFFTDDKFPSPTPSGDCEGGVVDTNDE 452

Query: 1701 VXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSRD 1880
            V            +P  LD +PV          +HGL+NSRIDA  SGSY  K SAKSRD
Sbjct: 453  VSSASVTSLLTSSRPPPLDPMPVSSSSSTDKSSMHGLMNSRIDATVSGSYPVKNSAKSRD 512

Query: 1881 PRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLEN 2060
            PRLRFINSDASTLDLNQP   +NMP VEY G + SRKQKT EE SLDAT  K+LRRSLEN
Sbjct: 513  PRLRFINSDASTLDLNQPLRANNMPTVEYPGRVISRKQKT-EESSLDATAPKRLRRSLEN 571

Query: 2061 TEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETELNTTFSTSSGNLNVTSNGN 2240
            +EH+ R  RT+AG GGW E++T+A                EL  T +TSSGNL VTS+GN
Sbjct: 572  SEHNTRAERTVAGKGGWFEESTVA----------------ELERTMNTSSGNLTVTSDGN 615

Query: 2241 EQAPVTSSTTASLPDLLKGIAVNPTMLLNILMEQQ-RLAAEAKKNSADSASSTLHLRSSN 2417
            EQAPVT  +TA+   +++ +AVNPT+L+NIL++QQ RLAAEA+K   DSA+S LHL +SN
Sbjct: 616  EQAPVTGCSTAASLPVVQNMAVNPTILMNILLDQQQRLAAEAQKKPVDSATSILHLTNSN 675

Query: 2418 SAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDPRRILHG 2597
            SA+G     N G AMTAG+PQ+SVG+ P SS A S  Q +Q DSGKIRMKPRDPRRILHG
Sbjct: 676  SARG-----NTGSAMTAGLPQSSVGILPASSPATSTTQPLQVDSGKIRMKPRDPRRILHG 730

Query: 2598 SGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPPDITRQF 2777
               LQKS  LGSEQ KAIVSP  NNQG  DNVNAQK +V    KL P +SI  PDITRQF
Sbjct: 731  ISTLQKSENLGSEQSKAIVSPTPNNQGTGDNVNAQKLDVRAAAKLAPIQSITQPDITRQF 790

Query: 2778 TSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ-----SSQNLQAVTGLAPE 2942
            T NLKNIAD+MSVPQE S +  ATQNVSSASVPFT D+AEQ     +SQNL+   G APE
Sbjct: 791  TRNLKNIADIMSVPQEPSTNPLATQNVSSASVPFTSDRAEQKSSVPNSQNLKDGVGSAPE 850

Query: 2943 TCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXXXXXXXXXX 3122
            TCASGSSR Q+TWADVEHLFEGY+E+QKAAIQRER RRL+EQNKMF+A+K          
Sbjct: 851  TCASGSSRPQNTWADVEHLFEGYDEKQKAAIQRERTRRLDEQNKMFAAKKLCLVLDLDHT 910

Query: 3123 XXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLEKASKLY 3302
              NSAKFVEVDPV+DEILRKKE++D+E PQRHLFRF HMGMWTKLRPGVWNFLEKASKL+
Sbjct: 911  LLNSAKFVEVDPVHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGVWNFLEKASKLF 970

Query: 3303 ELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKDLEGVLGME 3482
            E+HLYTMGNK YAT+MAKVLDPKG+LFAGRVISRGDD E++D      ++KDLEGVLGME
Sbjct: 971  EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDAETVD-----TKSKDLEGVLGME 1025

Query: 3483 XXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEAGTLAVC 3662
                      RVWPHN+ NLIVVERY YFP SRRQFGL G SLLE+D DE P+ GTLA  
Sbjct: 1026 SSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPDTGTLASS 1085

Query: 3663 LGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRMFPVGETNPHLHPLWQT 3842
            LGVIE++H  FFAS+SLEE DVRNILA+EQRKIL GCRIVFSR+FPVGE NPHLHPLWQT
Sbjct: 1086 LGVIERIHHNFFASESLEEVDVRNILASEQRKILDGCRIVFSRVFPVGEANPHLHPLWQT 1145

Query: 3843 AEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGW 3974
            AEQFGA CTNQID+QVTHVVA+S GTDKVNWA + G+FVV P W
Sbjct: 1146 AEQFGASCTNQIDDQVTHVVANSLGTDKVNWAMANGKFVVYPSW 1189



 Score =  103 bits (257), Expect = 2e-18
 Identities = 55/87 (63%), Positives = 58/87 (66%), Gaps = 8/87 (9%)
 Frame = +3

Query: 414 ISDTASVEEISADDFNK--------QXXXXXXXXXXXXXXXXXRVWAVHDLYSKYPTISR 569
           ISDTASVEEI+ +DFNK                          RVWAV DLYSKYPTI R
Sbjct: 13  ISDTASVEEITEEDFNKPDVVKVNNNNSDKVKSGSGGGGGGDSRVWAVQDLYSKYPTICR 72

Query: 570 GYASGLYNLAWAQAVQNKPLNDIFIME 650
           GYASGLYNLAWAQAVQNKPLNDIF+ME
Sbjct: 73  GYASGLYNLAWAQAVQNKPLNDIFVME 99


>ref|XP_014497833.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Vigna radiata
            var. radiata]
          Length = 1267

 Score = 1294 bits (3349), Expect = 0.0
 Identities = 715/1099 (65%), Positives = 824/1099 (74%), Gaps = 30/1099 (2%)
 Frame = +3

Query: 828  VSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDLVRLSFNAI 1007
            VSDSE LGV+DVLEGVTVANVAESF +T +R+ +A+ S+V + PADSEKDDL+RLSFNAI
Sbjct: 184  VSDSEQLGVRDVLEGVTVANVAESFVQTSSRLLNAL-SEVLSRPADSEKDDLIRLSFNAI 242

Query: 1008 EVVYSVFCSMYNLQKEENKDNILRLLSFLKD-EGTHLFSPEHMKEIEVMITAINSVGSLG 1184
            EVVYSVF SM +  KE NKD+ILRLLS +KD E   L SP+H+KEI+ M+TAI+SVG+LG
Sbjct: 243  EVVYSVFRSMDSSDKERNKDSILRLLSSVKDQEQAQLLSPKHIKEIQGMMTAIDSVGALG 302

Query: 1185 SSEAVGKEENLETHETKTWEISAVKY--------------GGELISFSKPGNSNSIEASE 1322
            SSE +  +   +T E K+ E SA++                  LIS  KP +S+ I  S 
Sbjct: 303  SSEPIYMKTESQTPEIKSQENSALEVQTHAINIQENQAVEATALISSVKPLHSDIIGGSR 362

Query: 1323 ASKSGQSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDKSGLPL 1502
            A K GQ+ IKGRG+                  PTREAPSCFPV K LSVGE M KSG   
Sbjct: 363  ALKLGQNSIKGRGILLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGEAMVKSGS-- 420

Query: 1503 AGKTGCGKMELDGEGS-KFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEG 1679
            A K   GKME+D EGS KFHLYETDAL+AVSTYQQKFGRSS FTND+LPSPTPSGDC++ 
Sbjct: 421  AAKMQPGKMEVDSEGSTKFHLYETDALKAVSTYQQKFGRSSLFTNDKLPSPTPSGDCDDM 480

Query: 1680 VVDTNDEVXXXXXXXXXXXXKPTLLDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGK 1859
            VVDTN+EV            KPTL+DQ PV          + GL+N+R+DAAG GS+  K
Sbjct: 481  VVDTNEEVSSASTGGFLTTTKPTLIDQPPVSATSMDNSRLL-GLINTRVDAAGPGSFPVK 539

Query: 1860 TSAKSRDPRLRFINSDASTLDLNQPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKK 2039
            +SAKSRDPR R IN +A+ +D N     +NMPKVEY G+  SRKQK VEEP  D TV+K+
Sbjct: 540  SSAKSRDPRRRLINPEANAVD-NHSIVINNMPKVEYAGSTISRKQKAVEEP-FDVTVSKR 597

Query: 2040 LRRSLENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETE----LNTTFSTS 2207
            L+ SLEN EH+  + RT+AG GGWLED T  G+QLIE+N+LM K   E    LNT  S+ 
Sbjct: 598  LKSSLENIEHNSSQVRTIAGTGGWLEDNTGPGTQLIEKNNLMDKFAPEPKKTLNTVSSSC 657

Query: 2208 SGNL--NVTSNGNEQAPVTSSTTAS-LPDLLKGIAVNPTMLLNILMEQQRLAAEAKKNSA 2378
            SG++  N TS  NEQ P+TSS  AS LP +LK I VNPTMLL ++ EQQ     A   S+
Sbjct: 658  SGSVGFNATSIRNEQVPITSSNIASSLPAVLKDIVVNPTMLLGLIFEQQNRLRNAVNKSS 717

Query: 2379 DSASSTLHLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQED-SGK 2555
            +SA++ L+  SSNSA GAD+TV+IG +M  G+ Q SVG+ PVSSQ+ S AQ +Q+D SGK
Sbjct: 718  ESATNILNPTSSNSAAGADSTVSIGSSMATGL-QTSVGILPVSSQSTSTAQSLQDDYSGK 776

Query: 2556 IRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLV 2735
            IRMKPRDPRRILH + ++QKSG         IVSP+SN+Q   DNVNAQK E  V TKLV
Sbjct: 777  IRMKPRDPRRILHTNNSVQKSGN--------IVSPVSNSQVTGDNVNAQKLEGRVDTKLV 828

Query: 2736 PTKSIAPPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ----- 2900
            P +S A PDITRQFT NLKNIAD+MSV QESS HS A Q+ SSASVP  +D+ EQ     
Sbjct: 829  PPQSGAAPDITRQFTKNLKNIADIMSVSQESSTHSTAAQSFSSASVPLNIDRGEQKSVVS 888

Query: 2901 SSQNLQAVT-GLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKM 3077
            +SQNLQA T G A E CA G+SRSQSTW DVEHLFEGY+EQQKAAIQRERARR+EEQNKM
Sbjct: 889  NSQNLQAGTVGSAHEICAPGTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKM 948

Query: 3078 FSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKL 3257
            F+ARK            NSAKFVEVDPV+DEILRKKE++D+E P RHLFRF HMGMWTKL
Sbjct: 949  FAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 1008

Query: 3258 RPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDE 3437
            RPG+WNFLEKASKLYELHLYTMGNK YAT+MAKVLDPKG+LFAGRVISRGDDT+S+DG+E
Sbjct: 1009 RPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVDGEE 1068

Query: 3438 RAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLE 3617
            RAP++KDLEGVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G SLLE
Sbjct: 1069 RAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLE 1128

Query: 3618 VDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRMF 3797
            +D DE PEAGTLA  L VIE++HQ FFASQSLEE DVRNILA+EQRKILAGCRIVFSR+F
Sbjct: 1129 IDHDERPEAGTLASSLAVIERIHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRVF 1188

Query: 3798 PVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGWV 3977
            PVGE NPHLHPLWQTAEQFGAVCTNQIDEQVTHVVA+S GTDKVNWA STGRFVV PGWV
Sbjct: 1189 PVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 1248

Query: 3978 EASALLYRRLNEQDFAIKP 4034
            EASALLYRR NEQDFAIKP
Sbjct: 1249 EASALLYRRANEQDFAIKP 1267



 Score =  124 bits (312), Expect = 8e-25
 Identities = 68/108 (62%), Positives = 69/108 (63%)
 Frame = +3

Query: 327 MVFGSFLDCSKLKNRXXXXXXXXXXXXXXISDTASVEEISADDFNKQXXXXXXXXXXXXX 506
           MVFGS LDC KL                 ISDTASVEEIS  DFNKQ             
Sbjct: 1   MVFGSLLDCQKLGKLEKMGKEVEDVEEGEISDTASVEEISEADFNKQDVKVNNNNKPNGS 60

Query: 507 XXXXRVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIME 650
               RVWAVHDLYSKYPTI RGYASGLYNLAWAQAVQNKPLNDIF+ME
Sbjct: 61  DA--RVWAVHDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVME 106


>gb|OIW17294.1| hypothetical protein TanjilG_22406 [Lupinus angustifolius]
          Length = 1236

 Score = 1269 bits (3285), Expect = 0.0
 Identities = 711/1227 (57%), Positives = 836/1227 (68%), Gaps = 37/1227 (3%)
 Frame = +3

Query: 414  ISDTASVEEISADDFNKQXXXXXXXXXXXXXXXXXRVWAVHDLYSKYPTISRGYASGLYN 593
            ISDTASVEEIS +DF KQ                 RVWAV+DLY+KYPTI  GYASGLYN
Sbjct: 13   ISDTASVEEISEEDFKKQQDVIKVNDNKPKEDSTARVWAVNDLYTKYPTICSGYASGLYN 72

Query: 594  LAWAQAVQNKPLNDIFIMEXXXXXXXXXXXXXXXXXXXXXQXXXXXXXXXXXXXXXXXXX 773
            LAWAQAVQNKPLNDIF+M+                                         
Sbjct: 73   LAWAQAVQNKPLNDIFVMDVVNDANVDNSNQLCSGTVKEDVVMNGQVVIDLDKDDGELEE 132

Query: 774  XXXXXXXXXMAGGDASETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFT 953
                       G    + V +SE   V++VLEG T A+V E FAE+C R+QSA+      
Sbjct: 133  GEIDGDADPEGGN--VQCVLNSEGKSVREVLEGFTNASVEELFAESCGRLQSALH----- 185

Query: 954  GPADSEKDD-LVRLSFNAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEGTHLFSPEH 1130
              A S+KDD LV LSFNAIEV+YSVF SM + QKE+NKD I+RLL   KD+  HL +PE 
Sbjct: 186  --AVSDKDDVLVGLSFNAIEVIYSVFSSMESSQKEQNKDTIIRLLYLAKDQQGHLLTPEQ 243

Query: 1131 MKEIEVMITAINSVGSLGSSEAVGKEENLETHETKTWEISAVKYGGELISFSKPGNSNSI 1310
            +KEI VMI +++S G+L +SEA+ KE+  +++E KT EI   +  GELIS SKP +S SI
Sbjct: 244  LKEILVMIASLDSTGALANSEAINKEKESQSNEIKTLEIQD-RSAGELISSSKPLDSISI 302

Query: 1311 EASEASKSGQSIIKGRGVXXXXXXXXXXXXXXXXXXPTREAPSCFP-------------- 1448
              SEA K GQS  K RGV                  PTRE PSCFP              
Sbjct: 303  GVSEALKFGQSNFKSRGVLVPLFDLHKAHDIDSLPSPTRETPSCFPLNNAFSVGEEVDRP 362

Query: 1449 -----------VKKSLSVGEGMDKSGLPLAGKTGCGKMELDGEGSKFHLYETDALRAVST 1595
                       V KS S GEGM +S LP A KT    ME+D EGSK H Y TDA++AVS+
Sbjct: 363  VLPTHELASFPVNKSFSAGEGMIRSELP-ASKTEAVNMEVDSEGSKLHSYVTDAVKAVSS 421

Query: 1596 YQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTNDEVXXXXXXXXXXXXKPTL--LDQIPV 1769
            YQQKFGRS+FF ++ELPSPTPSGDCE+  VDTN+EV            KP    L+Q+  
Sbjct: 422  YQQKFGRSTFFMSEELPSPTPSGDCEDAAVDTNEEVSSASVAGSAISIKPPSQSLNQLHA 481

Query: 1770 XXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSRDPRLRFINSDASTLDLN-QPSGTH 1946
                      +HGL++SRIDAA S SY+ K S KSRDPRLR INSDAS LDLN Q S  +
Sbjct: 482  SSASTDRSG-MHGLISSRIDAADSRSYSRKPSVKSRDPRLRVINSDASALDLNHQRSLIN 540

Query: 1947 NMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLENTEHSMREARTMAGNGGWLEDTT 2126
            NMP +E  GTI SRKQK  EEPSLD  V+K+L+ SLEN EH  R+ RT  GN GWLE+ +
Sbjct: 541  NMPNMENDGTIISRKQKVPEEPSLDVAVSKRLKTSLENLEHKTRDPRTGTGNRGWLEEIS 600

Query: 2127 LAGSQLIERNHLMQKGETELNTTFST--SSGNLNVTSNGNEQAPVTS-STTASLPDLLKG 2297
              GSQ I RN++  + +  + T  S+   SGN N+TSNGN+QAPV S +TT S+P + K 
Sbjct: 601  ALGSQSIVRNNVEAEPKRTMGTVNSSCAGSGNFNLTSNGNQQAPVASINTTTSIPAVWKD 660

Query: 2298 IAVNPTMLLNILMEQQRLAAEAKKNSADSASSTLHLRSSNSAKGADTTVNIGPAMTAGIP 2477
            + V+P ML+NILME++RLA E K  S D + +TL+L S+NSA G   T++IG ++T G+ 
Sbjct: 661  LTVSPAMLVNILMERKRLATETKNKSDDYSMNTLNLASANSAMGIGPTMSIGTSVTTGLQ 720

Query: 2478 QNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVS 2657
            QNSVGM P+SS A +  +   +DSGKIRMKPRDPRR+LHG   + KSG L SEQ  AIV 
Sbjct: 721  QNSVGMLPISSPATTTVRSPHDDSGKIRMKPRDPRRVLHGR-TIPKSGILASEQSNAIVL 779

Query: 2658 PMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPPDITRQFTSNLKNIADLMSVPQESSDH 2837
            P SNN    DNV+A K EV   TKL P++SIAPPDI   FT NLKNIAD +SV Q+SS++
Sbjct: 780  PTSNNLDTGDNVSASKLEVRADTKLAPSQSIAPPDIAGPFTKNLKNIADTISVTQQSSNN 839

Query: 2838 SPATQNVSSASVPFTLDKAEQ-----SSQNLQAVTGLAPETCASGSSRSQSTWADVEHLF 3002
            SPATQ  SSA V  T D+ EQ     SSQNLQA  G APETCAS SS  QS+W DVEHLF
Sbjct: 840  SPATQAFSSAPV-LTSDRVEQKPVVSSSQNLQASVGSAPETCASVSSTPQSSWGDVEHLF 898

Query: 3003 EGYNEQQKAAIQRERARRLEEQNKMFSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRK 3182
            +GY+E+QKAAIQRERARR+EEQNKMF+ARK            NSAKFVEVDPV+DEILRK
Sbjct: 899  DGYDEKQKAAIQRERARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRK 958

Query: 3183 KEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVL 3362
            KE++D+E P RHLFRF H+GMWTKLRPG+WNFLEKA KL+ELHLYTMGNK YAT+MAKVL
Sbjct: 959  KEEQDREKPHRHLFRFPHLGMWTKLRPGIWNFLEKARKLFELHLYTMGNKLYATEMAKVL 1018

Query: 3363 DPKGLLFAGRVISRGDDTESIDGDERAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNL 3542
            DPKG LF GRVISRGDDT+S+DG+ERAP++KDLEGVLGME          RVWPHN+ NL
Sbjct: 1019 DPKGTLFNGRVISRGDDTDSVDGEERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNL 1078

Query: 3543 IVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEA 3722
            IVVERY YFP SRRQFGL G SLLE+D DE PEAGTLA  LGVIE++HQ FFASQSLEE 
Sbjct: 1079 IVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLGVIERIHQNFFASQSLEEV 1138

Query: 3723 DVRNILATEQRKILAGCRIVFSRMFPVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVV 3902
            DVR+ILA+EQRKILAGCRIVFSR+FPVGE NP+LHPLWQTAEQFGAVCTN ID+QVTHVV
Sbjct: 1139 DVRSILASEQRKILAGCRIVFSRVFPVGEANPNLHPLWQTAEQFGAVCTNHIDDQVTHVV 1198

Query: 3903 ASSPGTDKVNWAFSTGRFVVLPGWVEA 3983
            A+SPGTDKVNWA S GRFVV P  V A
Sbjct: 1199 ANSPGTDKVNWALSIGRFVVHPACVGA 1225


>ref|XP_019419706.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Lupinus angustifolius]
          Length = 1260

 Score = 1240 bits (3209), Expect = 0.0
 Identities = 673/1107 (60%), Positives = 798/1107 (72%), Gaps = 36/1107 (3%)
 Frame = +3

Query: 822  ETVSDSELLGVKDVLEGVTVANVAESFAETCTRIQSAVQSKVFTGPADSEKDDL-VRLSF 998
            ++V +SE   V++VLEG T+ANV E+FAE C ++QS +QS      A S+KDD+ VRL F
Sbjct: 162  QSVLNSEGKSVREVLEGFTIANVEEAFAENCGKLQSVLQSL----HAVSDKDDIIVRLLF 217

Query: 999  NAIEVVYSVFCSMYNLQKEENKDNILRLLSFLKDEGTHLFSPEHMKEIEVMITAINSVGS 1178
            NAIEV+YSVFCSM + QK++N D ILR+L F+ D+  HL +PE +KEI+VMI A++S+G+
Sbjct: 218  NAIEVIYSVFCSMESSQKKQNNDKILRILYFVTDQQAHLLTPEQLKEIQVMIGALDSIGA 277

Query: 1179 LGSSEAVGKEENLETHETKTWEISAVKYGGELISFSKPGNSNSIEASEASKSGQSIIKGR 1358
            L + E +GKE+  +++E KT E    +  GELIS SKP +S SI  SE  K GQS  KGR
Sbjct: 278  LDNGEPIGKEKESQSNEIKTLETQD-RRAGELISSSKPLDSISIGVSEPLKFGQSNFKGR 336

Query: 1359 GVXXXXXXXXXXXXXXXXXXPTREAPSCFPVKKSLSVGEGMDKSGLPLAG---------- 1508
            GV                  PTRE PS FPV  + SV EG+ + GLP             
Sbjct: 337  GVLVPLFDLHKDHDIDSLPSPTRETPSFFPVSNAFSVAEGVVRHGLPTRAIASFPVSKPF 396

Query: 1509 --------------KTGCGKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELP 1646
                          KT    ME+D EGSK H Y TDAL+AVS+YQQKFGRS+FFT++ELP
Sbjct: 397  SAGEEMIRSELPPSKTEAVNMEVDSEGSKLHSYVTDALKAVSSYQQKFGRSTFFTSEELP 456

Query: 1647 SPTPSGDCEEGVVDTNDEVXXXXXXXXXXXXKPTL--LDQIPVXXXXXXXXXXVHGLVNS 1820
            SPTPSGDCE+  VDTN+EV            KP    L+Q+P           +HGL +S
Sbjct: 457  SPTPSGDCEDVAVDTNEEVSSASVAGSAISIKPPSQSLNQLPTSSASTDRSS-MHGLSSS 515

Query: 1821 RIDAAGSGSYAGKTSAKSRDPRLRFINSDASTLDLN-QPSGTHNMPKVEYGGTITSRKQK 1997
            RID AGS SY+ KTS KSRDPRLR INSDAS LDLN QPS  +N+P +E G TI SRKQK
Sbjct: 516  RIDEAGSRSYSRKTSVKSRDPRLRLINSDASALDLNHQPSLMNNVPNMENGRTIISRKQK 575

Query: 1998 TVEEPSLDATVTKKLRRSLENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGE 2177
              EEPSLD  V+K+L+ SLEN EH  R+ RT A   GWLE+T+  GSQ I RN++  + +
Sbjct: 576  AAEEPSLDVAVSKRLKTSLENPEHKTRDPRTAARKRGWLEETSAVGSQSIVRNNVDAEPK 635

Query: 2178 TELNTTFS--TSSGNLNVTSNGNEQAPV-TSSTTASLPDLLKGIAVNPTMLLNILMEQQR 2348
              + T  S  T SGN N+TSNGN+QAP+ TS+TT S+P   K +AV+P +L+NILME+QR
Sbjct: 636  MTMTTVNSSCTGSGNFNLTSNGNQQAPMATSNTTTSIPAAWKDLAVSPAILVNILMERQR 695

Query: 2349 LAAEAKKNSADSASSTLHLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMA 2528
            LAAEAKK S D + + LHL S+NSA G   T++IG +MT G+ QNSVGM P+SS A +  
Sbjct: 696  LAAEAKKKSDDYSINVLHLASANSAMGTGPTMSIGTSMTTGLQQNSVGMLPISSPATTTV 755

Query: 2529 QRIQEDSGKIRMKPRDPRRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKS 2708
            +  Q+DSGKIRMKPRDPRR LH  G + KSG L SEQ K +V P SN     DNV+A K 
Sbjct: 756  RSPQDDSGKIRMKPRDPRRFLH-RGTIPKSGILASEQSKEVVIPTSNTPDTGDNVSAPKL 814

Query: 2709 EVGVGTKLVPTKSIAPPDITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLD 2888
            EV   TKL  ++SIAPPDI   FT NLKNIA+ +SV Q+SS+++PATQ  SSA    T D
Sbjct: 815  EVRADTKLTASQSIAPPDIAGPFTRNLKNIANTISVTQQSSNNAPATQTFSSAPA-LTSD 873

Query: 2889 KAEQ-----SSQNLQAVTGLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERAR 3053
            + EQ     SSQNLQA  G APETCAS SS  QS+W DVEHLF+GY+E+QKAAIQRERAR
Sbjct: 874  RVEQKPVVSSSQNLQASIGSAPETCASVSSTPQSSWGDVEHLFDGYDEKQKAAIQRERAR 933

Query: 3054 RLEEQNKMFSARKXXXXXXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFS 3233
            R+EEQNKMF+ARK            NSAKFVEVDPV+DEILRKKE++D+E P RHLFRF 
Sbjct: 934  RIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFP 993

Query: 3234 HMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDD 3413
            H+GMWTKLRPG+WNFLEKASKL+ELHLYTMGNK YAT+MAKVLDP+G LF GRVISRGDD
Sbjct: 994  HLGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKVLDPRGTLFNGRVISRGDD 1053

Query: 3414 TESIDGDERAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFG 3593
             +S+DG+ERAP++KDLEGVLGME          RVWPHN+ NLIVVERY YFP SRRQFG
Sbjct: 1054 IDSVDGEERAPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1113

Query: 3594 LTGQSLLEVDRDEVPEAGTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGC 3773
            L G SLLE+D DE PEAGTLA  LGVIE++HQ FFASQSLEE DVRNILA+EQRKILAGC
Sbjct: 1114 LPGPSLLEIDHDERPEAGTLASSLGVIERIHQNFFASQSLEEVDVRNILASEQRKILAGC 1173

Query: 3774 RIVFSRMFPVGETNPHLHPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGR 3953
            RIVFSR+FPVGE NPHLHPLWQTAEQFGAVCTN ID+QVTHVVA+S GTDKVNWA S GR
Sbjct: 1174 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNHIDDQVTHVVANSLGTDKVNWALSIGR 1233

Query: 3954 FVVLPGWVEASALLYRRLNEQDFAIKP 4034
            FVV P WVEASALLYRR NEQDFAIKP
Sbjct: 1234 FVVHPSWVEASALLYRRANEQDFAIKP 1260



 Score =  105 bits (263), Expect = 5e-19
 Identities = 52/79 (65%), Positives = 58/79 (73%)
 Frame = +3

Query: 414 ISDTASVEEISADDFNKQXXXXXXXXXXXXXXXXXRVWAVHDLYSKYPTISRGYASGLYN 593
           ISD+ASVEEIS +DF KQ                 RVWAV+DLY+KYPTI  GYASGLYN
Sbjct: 34  ISDSASVEEISEEDFKKQQDVVKVNDNKPKEDSTARVWAVNDLYTKYPTICSGYASGLYN 93

Query: 594 LAWAQAVQNKPLNDIFIME 650
           LAWAQAVQNKPLNDIF+M+
Sbjct: 94  LAWAQAVQNKPLNDIFVMD 112


>ref|XP_019455025.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like 3
            [Lupinus angustifolius]
          Length = 1219

 Score = 1234 bits (3194), Expect = 0.0
 Identities = 685/1167 (58%), Positives = 813/1167 (69%), Gaps = 15/1167 (1%)
 Frame = +3

Query: 519  RVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIMEXXXXXXXXXXXXXXXX 698
            RVWAV+D Y KYP+I RGYASGL+N AWAQAVQNKPLN+I +M+                
Sbjct: 53   RVWAVNDFYPKYPSICRGYASGLHNHAWAQAVQNKPLNEIVVMDNDSNGNDDNSNRLRSG 112

Query: 699  XXXXXQXXXXXXXXXXXXXXXXXXXXXXXXXXXXMAGGDASETVSDSELLGVKDVLEGVT 878
                                              +  G+ S+ V +SE   V+++LEG T
Sbjct: 113  SVKEDNVSVNGHVVIDLDKEDGELEEGEIDDDADIEEGNVSQCVLNSEEKNVREILEGFT 172

Query: 879  VANVAESFAETCTRIQSAVQSKVFTGPADSEKDD-LVRLSFNAIEVVYSVFCSMYNLQKE 1055
            + NV ESFAE+C ++Q+A+QS      A ++KDD LVRLSFNAI+V++S+FCSM + QKE
Sbjct: 173  IDNVEESFAESCDKLQNALQSL----RAVTDKDDVLVRLSFNAIQVIFSLFCSMESSQKE 228

Query: 1056 ENKDNILRLLSFLKDEGTHLFSPEHMKEIEVMITAINSVGSLGSSEAVGKEENLETHETK 1235
            +NKD I RLLSF+KD   HLF+PE +KE  VMIT ++S G+L  SE + KE+  +++E K
Sbjct: 229  KNKDYIPRLLSFVKDRQAHLFAPECLKE--VMITTLDSNGALVDSETIVKEKESQSNEMK 286

Query: 1236 TWEISAVKYGGELISFSKPGNSNSIEASEASKSGQSIIKGRGVXXXXXXXXXXXXXXXXX 1415
            T EI   +  G+LIS SKP +S  I ASEA +SG S  KGRGV                 
Sbjct: 287  TLEIQD-RRTGDLISSSKPFDSIPIGASEALESGPSNFKGRGVLVPLFDLHKDHDIDNLP 345

Query: 1416 XPTREAPSCFPVKKSLSVGEGMD--KSGLPLAGKTGCGKMELDGEGSKFHLYETDALRAV 1589
              TREAPSCFPV  + SVGEGM   +SG P++   G  KME+D EGS  H Y TDAL+AV
Sbjct: 346  SSTREAPSCFPVNNASSVGEGMGMVRSGFPVS-MAGAVKMEVDSEGSNLHPYVTDALKAV 404

Query: 1590 STYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTNDEVXXXXXXXXXXXXKPTL--LDQI 1763
            S+YQQKFGRSS FT++ELPSPTPSGDCE   +DTN+EV            KP L   DQ+
Sbjct: 405  SSYQQKFGRSSLFTSEELPSPTPSGDCEGAAIDTNEEVSSVSVAGSAMSTKPPLPSSDQL 464

Query: 1764 PVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSRDPRLRFINSDASTLDLN-QPSG 1940
             V          +HGL +S +DA GSGS   K S K RDPRLR INSDAS LDLN +PS 
Sbjct: 465  LVSASASKDRSSMHGLSSSGVDATGSGSLPRKPSVKPRDPRLRLINSDASALDLNHRPSL 524

Query: 1941 THNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLENTEHSMREARTMAGNGGWLED 2120
             +NMPKVE   TI SRKQK  EEP LD  V+K+L+ SLEN+EH+ R+ RT A N GWLE+
Sbjct: 525  VNNMPKVE---TIISRKQKAAEEPPLDVAVSKRLKTSLENSEHNTRDPRTAARNYGWLEE 581

Query: 2121 TTLAGSQLIERNHLMQKGETELNTTFS--TSSGNLNVTSNGNEQAPV-TSSTTASLPDLL 2291
             T  GS LIERN++    +  ++T  S  T S   NVTSN N+Q PV TS+ T S+P + 
Sbjct: 582  MTPVGSPLIERNNVEADPKKTISTVNSLCTGSSYFNVTSNVNQQVPVATSNATVSIPAVW 641

Query: 2292 KGIAVNPTMLLNILMEQQRLAA-EAKKNSADSASSTLHLRSSNSAKGADTTVNIGPAMTA 2468
            K +AVNPTML+NIL+E+ +LAA EAKK   D + ++LHL ++NSA G   T++ G +MT 
Sbjct: 642  KDLAVNPTMLVNILLERHKLAAAEAKKKPDDYSRNSLHLANANSALGTGPTMSFGTSMTT 701

Query: 2469 GIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDPRRILHGSGALQKSGKLGSEQFKA 2648
            G  +NSVGM PVS+ A +  + IQ++SG +RMKPRDPR ILHG+  L K G LG EQ +A
Sbjct: 702  GFQKNSVGMLPVSAPATTAGKSIQDNSGNVRMKPRDPRCILHGN-TLPKIGGLGREQSEA 760

Query: 2649 IVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPPDITRQFTSNLKNIADLMSVPQES 2828
            IVSP  NNQG  DNV+A K EV   TKL P++S A PDI  QF  NLKNIAD++SV + S
Sbjct: 761  IVSPTPNNQGKCDNVSAPKLEVRSDTKLAPSQSSASPDIAGQFPKNLKNIADIISVTRPS 820

Query: 2829 SDHSPATQNVSSASVPFTLDKAEQ-----SSQNLQAVTGLAPETCASGSSRSQSTWADVE 2993
            S+ SPATQ  SSA V  T D+ EQ     SSQNLQA  G  PETCAS S   QSTW DVE
Sbjct: 821  SNDSPATQTFSSAPV-LTSDRVEQKPVASSSQNLQAGVGSVPETCASVSLTPQSTWGDVE 879

Query: 2994 HLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXXXXXXXXXXXXNSAKFVEVDPVYDEI 3173
            HLF+GY+E+QKAAIQRERARR+EEQNKMF+ARK            NSAKFVEVDPV++EI
Sbjct: 880  HLFKGYDEKQKAAIQRERARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHEEI 939

Query: 3174 LRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKAYATQMA 3353
            LRKKE++D+E   RHLFRF HMGMWTKLRPG+WNFLEKASKL+ELH+YTMGNK YAT+MA
Sbjct: 940  LRKKEKQDREKHHRHLFRFPHMGMWTKLRPGIWNFLEKASKLFELHVYTMGNKRYATEMA 999

Query: 3354 KVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKDLEGVLGMEXXXXXXXXXXRVWPHNR 3533
            KVLDPKG LF GRVISRGDDT+S+DG+ERAP+ KDLEGVLGME          RVWPHN+
Sbjct: 1000 KVLDPKGTLFKGRVISRGDDTDSVDGEERAPKIKDLEGVLGMESAVVIIDDSVRVWPHNK 1059

Query: 3534 PNLIVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEAGTLAVCLGVIEKLHQTFFASQSL 3713
             NLIVVERY YFP SRRQFGL G SLLE+D DE PEAGTLA  LGVIE+LHQ FFASQSL
Sbjct: 1060 LNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEAGTLASSLGVIERLHQNFFASQSL 1119

Query: 3714 EEADVRNILATEQRKILAGCRIVFSRMFPVGETNPHLHPLWQTAEQFGAVCTNQIDEQVT 3893
            EE DVRNILA+EQRKILAGCRIVFSRMFPV E NPHLHPLWQTAEQFGAVCTN ID+ VT
Sbjct: 1120 EEVDVRNILASEQRKILAGCRIVFSRMFPVDEANPHLHPLWQTAEQFGAVCTNHIDDHVT 1179

Query: 3894 HVVASSPGTDKVNWAFSTGRFVVLPGW 3974
            HVV  SPGTDKV WA STGRFVV P W
Sbjct: 1180 HVVTCSPGTDKVTWALSTGRFVVHPSW 1206


>ref|XP_013447776.1| carboxy-terminal domain phosphatase-like protein, putative [Medicago
            truncatula]
 gb|KEH21861.1| carboxy-terminal domain phosphatase-like protein, putative [Medicago
            truncatula]
          Length = 958

 Score = 1224 bits (3166), Expect = 0.0
 Identities = 660/1010 (65%), Positives = 748/1010 (74%), Gaps = 8/1010 (0%)
 Frame = +3

Query: 1035 MYNLQKEENKDNILRLLSFLKDEGTHLFSPEHMKEIEVMITAINSVGSLGSSEAVGKEEN 1214
            M NLQKEENKDNI RLLSFLK++  HLF+ EHMK+I+VMIT I+SV +LG++E VGKEE 
Sbjct: 1    MDNLQKEENKDNISRLLSFLKNQ--HLFTMEHMKKIQVMITVIDSVFALGNNEVVGKEEK 58

Query: 1215 LETHETKTWEISAVKYGGELISFSKPGNSNSIEASEASKSGQSIIKGRGVXXXXXXXXXX 1394
            +E   T T +I  +K   E IS S+  + NS  ASEA + GQS + GRG+          
Sbjct: 59   VEALNT-TEQIPGLK-ADEYISSSQLVHDNSTYASEALQYGQSNVVGRGLMLPLFDLHKD 116

Query: 1395 XXXXXXXXPTREAPSCFPVKKSLS-VGEGMDKSGLPLAGKTGCGKMELDGEGSKFHLYET 1571
                    PTREAPSCFPV K  S +G+G+D+ GLP A  T   KMELDG+ SK H+YET
Sbjct: 117  HDLDSLPSPTREAPSCFPVNKLFSDLGDGIDRFGLPPAVCTEAEKMELDGKDSKLHIYET 176

Query: 1572 DALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTNDEVXXXXXXXXXXXXKPTL 1751
            DAL+AVSTYQQKF RSS+FT+D+ PSPTPSGDCE   VDTNDEV            KP  
Sbjct: 177  DALKAVSTYQQKFSRSSYFTDDKFPSPTPSGDCEGEAVDTNDEVSSASIASSLTSFKPPP 236

Query: 1752 LDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKSRDPRLRFINSDASTLDLNQ 1931
            LDQIPV          +HGLV+SRIDA GSGSY  K+SAKSRDPRLRFIN DASTLDLNQ
Sbjct: 237  LDQIPVSSTSLDRPN-MHGLVDSRIDATGSGSYPAKSSAKSRDPRLRFINPDASTLDLNQ 295

Query: 1932 PSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRSLENTEHSMREARTMAGNGGW 2111
              GTH+MP+VEYGG + SRKQKTVEEPSLDAT  K+LRRSLEN+EH+ RE R MAG GGW
Sbjct: 296  SLGTHSMPRVEYGGRVISRKQKTVEEPSLDATAPKRLRRSLENSEHNTREERAMAGKGGW 355

Query: 2112 LEDTTLAGSQLIERNHLMQKGETELNTTFSTSSGNLNVTSNGNEQAPVTSST-TASLPD- 2285
             E+ T+AGSQL ERNHLMQKGETEL  T STSS NL V++NGNE A VTSS+ TASLP  
Sbjct: 356  FEENTVAGSQLAERNHLMQKGETELKRTISTSSSNLTVSNNGNELASVTSSSATASLPTY 415

Query: 2286 LLKGIAVNPTMLLNILMEQQRLAAEAKKNSADSASSTLHLRSSNSAKGADTTVNIGPAMT 2465
            LL  +AVNP ML+++++E Q   AEA+K   D            SA+G D TVN GPAMT
Sbjct: 416  LLNNVAVNPAMLIHMILEHQHNEAEAQKKPVD------------SARGTDATVNTGPAMT 463

Query: 2466 AGIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDPRRILHGSGALQKSGKLGSEQFK 2645
            AG+ Q+SVG+ P SS A SM Q + EDSGKIRMKPRDPRR LHGS  L            
Sbjct: 464  AGLTQSSVGILPASSPATSMTQTLPEDSGKIRMKPRDPRRFLHGSSTL------------ 511

Query: 2646 AIVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPPDITRQFTSNLKNIADLMSVPQE 2825
                              QK +V V TKL P +SIA PDITRQFT NLKNIAD+MSVPQE
Sbjct: 512  ------------------QKFDVRVETKLAPIQSIAQPDITRQFTKNLKNIADIMSVPQE 553

Query: 2826 SSDHSPATQNVSSASVPFTLDKAEQ-----SSQNLQAVTGLAPETCASGSSRSQSTWADV 2990
            +S + PATQNVSSASVPF  D++EQ     +SQNL+   G APETCA GSSR Q+TWADV
Sbjct: 554  TSSNPPATQNVSSASVPFMSDRSEQKSGVPNSQNLKDGVGSAPETCAPGSSRPQNTWADV 613

Query: 2991 EHLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXXXXXXXXXXXXNSAKFVEVDPVYDE 3170
            EHLFE Y+ +QKAAIQRER+RRLEEQ KMF+ARK            NSAKFVEVDPV+DE
Sbjct: 614  EHLFEAYDVKQKAAIQRERSRRLEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDE 673

Query: 3171 ILRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLEKASKLYELHLYTMGNKAYATQM 3350
            +LRKKEQED+E PQRHLFRF HMGMWTKLRPGVWNFLEKA KL+E+HLYTMGNK YAT+M
Sbjct: 674  MLRKKEQEDREKPQRHLFRFPHMGMWTKLRPGVWNFLEKAGKLFEMHLYTMGNKLYATEM 733

Query: 3351 AKVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKDLEGVLGMEXXXXXXXXXXRVWPHN 3530
            AKVLDPKG+LFAGRVISRGDD E+ D      ++KDLEGVLGME          RVWPHN
Sbjct: 734  AKVLDPKGVLFAGRVISRGDDAETAD-----TKSKDLEGVLGMESSVVIIDDSVRVWPHN 788

Query: 3531 RPNLIVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEAGTLAVCLGVIEKLHQTFFASQS 3710
            + NLIVVERY YFP SRRQFGL G SLLE+D DE PE+GTLA  LGVIE++HQ FFASQS
Sbjct: 789  KLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPESGTLASSLGVIERIHQNFFASQS 848

Query: 3711 LEEADVRNILATEQRKILAGCRIVFSRMFPVGETNPHLHPLWQTAEQFGAVCTNQIDEQV 3890
            LEE DVRNILA+EQRKIL GCRIVFSRMFPVG+ NPHLHPLWQTAEQFGA CTNQID+QV
Sbjct: 849  LEEVDVRNILASEQRKILDGCRIVFSRMFPVGDANPHLHPLWQTAEQFGASCTNQIDDQV 908

Query: 3891 THVVASSPGTDKVNWAFSTGRFVVLPGWVEASALLYRRLNEQDFAIKPEK 4040
            THVVA SPGTDKVNWA + G+FVV PGWVEASALLYRR NEQDFAIK +K
Sbjct: 909  THVVAHSPGTDKVNWAIANGKFVVHPGWVEASALLYRRANEQDFAIKLDK 958


>gb|OIW05466.1| hypothetical protein TanjilG_12057 [Lupinus angustifolius]
          Length = 1238

 Score = 1217 bits (3150), Expect = 0.0
 Identities = 685/1190 (57%), Positives = 810/1190 (68%), Gaps = 38/1190 (3%)
 Frame = +3

Query: 519  RVWAVHDLYSKYPTISRGYASGLYNLAWAQAVQNKPLNDIFIMEXXXXXXXXXXXXXXXX 698
            RVWAV+D Y KYP+I RGYASGL+N AWAQAVQNKPLN+I +M+                
Sbjct: 53   RVWAVNDFYPKYPSICRGYASGLHNHAWAQAVQNKPLNEIVVMDNDSNGNDDNSNRLRSG 112

Query: 699  XXXXXQXXXXXXXXXXXXXXXXXXXXXXXXXXXXMAGGDASETVSDSELLGVKDVLEGVT 878
                                              +  G+ S+ V +SE   V+++LEG T
Sbjct: 113  SVKEDNVSVNGHVVIDLDKEDGELEEGEIDDDADIEEGNVSQCVLNSEEKNVREILEGFT 172

Query: 879  VANVAESFAETCTRIQSAVQSKVFTGPADSEKDD-LVRLSFNAIEVVYSVFCSMYNLQKE 1055
            + NV ESFAE+C ++Q+A+QS      A ++KDD LVRLSFNAI++    FCSM + QKE
Sbjct: 173  IDNVEESFAESCDKLQNALQSL----RAVTDKDDVLVRLSFNAIQL----FCSMESSQKE 224

Query: 1056 ENKDNILRLLSFLKDEGTHLFSPEHMKEIEVMITAINSVGSLGSSEAVGKEENLETHETK 1235
            +NKD I RLLSF+KD   HLF+PE +KE  VMIT ++S G+L  SE + KE+  +++E K
Sbjct: 225  KNKDYIPRLLSFVKDRQAHLFAPECLKE--VMITTLDSNGALVDSETIVKEKESQSNEMK 282

Query: 1236 TWEISAVKYGGELISFSKPGNSNSIEASEASKSGQSIIKGRGVXXXXXXXXXXXXXXXXX 1415
            T EI   +  G+LIS SKP +S  I ASEA +SG S  KGRGV                 
Sbjct: 283  TLEIQD-RRTGDLISSSKPFDSIPIGASEALESGPSNFKGRGVLVPLFDLHKDHDIDNLP 341

Query: 1416 XPTREAPSCFPVK-------------------------KSLSVGEGMDKSGLPLAGKTGC 1520
              TREAPSCFPV                          KS SVGEGM +SG P++   G 
Sbjct: 342  SSTREAPSCFPVNNASSVGEGMVRPVLPTREAPRFCLNKSFSVGEGMVRSGFPVS-MAGA 400

Query: 1521 GKMELDGEGSKFHLYETDALRAVSTYQQKFGRSSFFTNDELPSPTPSGDCEEGVVDTNDE 1700
             KME+D EGS  H Y TDAL+AVS+YQQKFGRSS FT++ELPSPTPSGDCE   +DTN+E
Sbjct: 401  VKMEVDSEGSNLHPYVTDALKAVSSYQQKFGRSSLFTSEELPSPTPSGDCEGAAIDTNEE 460

Query: 1701 VXXXXXXXXXXXXKPTL--LDQIPVXXXXXXXXXXVHGLVNSRIDAAGSGSYAGKTSAKS 1874
            V            KP L   DQ+ V          +HGL +S +DA GSGS   K S K 
Sbjct: 461  VSSVSVAGSAMSTKPPLPSSDQLLVSASASKDRSSMHGLSSSGVDATGSGSLPRKPSVKP 520

Query: 1875 RDPRLRFINSDASTLDLN-QPSGTHNMPKVEYGGTITSRKQKTVEEPSLDATVTKKLRRS 2051
            RDPRLR INSDAS LDLN +PS  +NMPKVE   TI SRKQK  EEP LD  V+K+L+ S
Sbjct: 521  RDPRLRLINSDASALDLNHRPSLVNNMPKVE---TIISRKQKAAEEPPLDVAVSKRLKTS 577

Query: 2052 LENTEHSMREARTMAGNGGWLEDTTLAGSQLIERNHLMQKGETELNTTFS--TSSGNLNV 2225
            LEN+EH+ R+ RT A N GWLE+ T  GS LIERN++    +  ++T  S  T S   NV
Sbjct: 578  LENSEHNTRDPRTAARNYGWLEEMTPVGSPLIERNNVEADPKKTISTVNSLCTGSSYFNV 637

Query: 2226 TSNGNEQAPV-TSSTTASLPDLLKGIAVNPTMLLNILMEQQRLAA-EAKKNSADSASSTL 2399
            TSN N+Q PV TS+ T S+P + K +AVNPTML+NIL+E+ +LAA EAKK   D + ++L
Sbjct: 638  TSNVNQQVPVATSNATVSIPAVWKDLAVNPTMLVNILLERHKLAAAEAKKKPDDYSRNSL 697

Query: 2400 HLRSSNSAKGADTTVNIGPAMTAGIPQNSVGMPPVSSQAASMAQRIQEDSGKIRMKPRDP 2579
            HL ++NSA G   T++ G +MT G  +NSVGM PVS+ A +  + IQ++SG +RMKPRDP
Sbjct: 698  HLANANSALGTGPTMSFGTSMTTGFQKNSVGMLPVSAPATTAGKSIQDNSGNVRMKPRDP 757

Query: 2580 RRILHGSGALQKSGKLGSEQFKAIVSPMSNNQGARDNVNAQKSEVGVGTKLVPTKSIAPP 2759
            R ILHG+  L K G LG EQ +AIVSP  NNQG  DNV+A K EV   TKL P++S A P
Sbjct: 758  RCILHGN-TLPKIGGLGREQSEAIVSPTPNNQGKCDNVSAPKLEVRSDTKLAPSQSSASP 816

Query: 2760 DITRQFTSNLKNIADLMSVPQESSDHSPATQNVSSASVPFTLDKAEQ-----SSQNLQAV 2924
            DI  QF  NLKNIAD++SV + SS+ SPATQ  SSA V  T D+ EQ     SSQNLQA 
Sbjct: 817  DIAGQFPKNLKNIADIISVTRPSSNDSPATQTFSSAPV-LTSDRVEQKPVASSSQNLQAG 875

Query: 2925 TGLAPETCASGSSRSQSTWADVEHLFEGYNEQQKAAIQRERARRLEEQNKMFSARKXXXX 3104
             G  PETCAS S   QSTW DVEHLF+GY+E+QKAAIQRERARR+EEQNKMF+ARK    
Sbjct: 876  VGSVPETCASVSLTPQSTWGDVEHLFKGYDEKQKAAIQRERARRIEEQNKMFAARKLCLV 935

Query: 3105 XXXXXXXXNSAKFVEVDPVYDEILRKKEQEDQEMPQRHLFRFSHMGMWTKLRPGVWNFLE 3284
                    NSAKFVEVDPV++EILRKKE++D+E   RHLFRF HMGMWTKLRPG+WNFLE
Sbjct: 936  LDLDHTLLNSAKFVEVDPVHEEILRKKEKQDREKHHRHLFRFPHMGMWTKLRPGIWNFLE 995

Query: 3285 KASKLYELHLYTMGNKAYATQMAKVLDPKGLLFAGRVISRGDDTESIDGDERAPRNKDLE 3464
            KASKL+ELH+YTMGNK YAT+MAKVLDPKG LF GRVISRGDDT+S+DG+ERAP+ KDLE
Sbjct: 996  KASKLFELHVYTMGNKRYATEMAKVLDPKGTLFKGRVISRGDDTDSVDGEERAPKIKDLE 1055

Query: 3465 GVLGMEXXXXXXXXXXRVWPHNRPNLIVVERYLYFPSSRRQFGLTGQSLLEVDRDEVPEA 3644
            GVLGME          RVWPHN+ NLIVVERY YFP SRRQFGL G SLLE+D DE PEA
Sbjct: 1056 GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEA 1115

Query: 3645 GTLAVCLGVIEKLHQTFFASQSLEEADVRNILATEQRKILAGCRIVFSRMFPVGETNPHL 3824
            GTLA  LGVIE+LHQ FFASQSLEE DVRNILA+EQRKILAGCRIVFSRMFPV E NPHL
Sbjct: 1116 GTLASSLGVIERLHQNFFASQSLEEVDVRNILASEQRKILAGCRIVFSRMFPVDEANPHL 1175

Query: 3825 HPLWQTAEQFGAVCTNQIDEQVTHVVASSPGTDKVNWAFSTGRFVVLPGW 3974
            HPLWQTAEQFGAVCTN ID+ VTHVV  SPGTDKV WA STGRFVV P W
Sbjct: 1176 HPLWQTAEQFGAVCTNHIDDHVTHVVTCSPGTDKVTWALSTGRFVVHPSW 1225


Top