BLASTX nr result

ID: Paeonia22_contig00005823 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00005823
         (3500 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform...  1407   0.0  
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...  1398   0.0  
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...  1396   0.0  
ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun...  1374   0.0  
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...  1371   0.0  
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...  1368   0.0  
ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform...  1354   0.0  
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...  1343   0.0  
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...  1330   0.0  
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...  1330   0.0  
emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]  1327   0.0  
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...  1318   0.0  
ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas...  1316   0.0  
ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal doma...  1300   0.0  
ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal doma...  1289   0.0  
ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma...  1287   0.0  
ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal doma...  1255   0.0  
ref|XP_006597420.1| PREDICTED: RNA polymerase II C-terminal doma...  1255   0.0  
ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma...  1253   0.0  
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...  1241   0.0  

>ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
            gi|508781046|gb|EOY28302.1| C-terminal domain
            phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score = 1407 bits (3641), Expect = 0.0
 Identities = 729/980 (74%), Positives = 810/980 (82%), Gaps = 40/980 (4%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQNVDMRD-----------------REIRISHFSQSSER 3205
            MYKSVVYRG+++LGEVEIYPQ Q    +                  +EIRI + +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 3204 CPPLAVLHTITSCGVCFKMESPN----SQSQDSP-LFVLHSSCLRENKTAVMSLGEEELH 3040
            CPPLAVLHTITS G+CFKMES      S SQDSP L +LHS C+R+NKTAVM +G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 3039 LVAMYSRKNERQSPCFWGFIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDR 2860
            LVAMYSR ++R  PCFWGF ++ GLY+SCL+MLNLRCLGIVFDLDETLIVANT RSFEDR
Sbjct: 124  LVAMYSRNSDR--PCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 2859 IEALQRKISTETDLQRIAGMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPAL 2680
            IEALQRK++TE D QR+AGM AE+KRYQDDK ILKQYAENDQV+ENGKVIK+QSEVVPAL
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 2679 SVNHQPIVRPIIRLQEKNVILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 2500
            S NHQPI+RP+IRLQEKN+ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 2499 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVI 2320
            CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSG  KSLFNVFQ G CHPKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 2319 DDRLKVWDENDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGL 2140
            DDRLKVWDE DQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFD+GL
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 2139 LQRIPEVAYEDDVRDIPSPPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAI 1960
            LQRIPE++YEDD++DIPSPPDV NYLVSEDD SALN NKDPL  DGMAD+EVERRLKEAI
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1959 LASSVVPSAVANLDPRLGSSLQYTVASSIPILLPT-SQTSVMPLANKQFPQVASLIKPLV 1783
             A+S V SA  NLDPRL  SLQYT+ SS   + P+ SQ S++  +N QFP  A ++KP+ 
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541

Query: 1782 HVGPSESSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPF-PVRPSLQ 1606
             V   E SLQSSPAREEGEVPESELDPDTRRRLLILQHGQD RDHT  EP F PVRP++Q
Sbjct: 542  PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601

Query: 1605 VSVPRVQSRGNWFPVEEEMSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQ 1426
            VSVPR QSRG+WF  EEEMSPRQLNRA PKEFPLDSE MHI+K R HP  FF KVESSI 
Sbjct: 602  VSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHR-HP-PFFPKVESSIP 659

Query: 1425 CDRILHENQRLPKETLHKDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVP 1246
             DR+L ENQRL KE LH+DDRL LNH  SS+HSF GEE PL +SSSS+R+LD ESGR+V 
Sbjct: 660  SDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVT 719

Query: 1245 YVETPVGMLQDIATKCGTKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQA 1066
              ET  G+LQDIA KCG KVEFRPALVA+ +LQFSIE WFAGEK+GEG+GRTRREAQ QA
Sbjct: 720  SGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQA 779

Query: 1065 AQGSLMHLANRYLSHV---------------NTNDNGFVSDGHSSGYQQFLKEESLSFST 931
            A+ S+ +LAN YLS +               N NDNGF S+ +S G Q   KEESLSFST
Sbjct: 780  AEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFST 839

Query: 930  TSDPSRQPDPRLDSSKKSMGSVSALKELCMMEGLGVVFQGQPP-PSNSIQKDEVYAQVEV 754
             S+ SR  DPRL+ SKKSMGSV+ALKELCMMEGLGVVFQ QPP  SN++QKDEVYAQVE+
Sbjct: 840  ASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEI 899

Query: 753  DGQVLGKGIGLTWDEAKMQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFP 574
            DGQVLGKG GLTW+EAKMQAAEKALGSL+SM  GQYS KRQGSPRS QGM +KR K EFP
Sbjct: 900  DGQVLGKGTGLTWEEAKMQAAEKALGSLRSML-GQYSQKRQGSPRSLQGMQNKRLKPEFP 958

Query: 573  RAVQRMPSASRYPKNASPVP 514
            R +QRMPS+ RYPKNA PVP
Sbjct: 959  RVLQRMPSSGRYPKNAPPVP 978


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score = 1398 bits (3618), Expect = 0.0
 Identities = 728/967 (75%), Positives = 800/967 (82%), Gaps = 27/967 (2%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQ---------NVDMRDREIRISHFSQSSERCPPLAVLH 3181
            MYK+V Y G ++LGEVEIYPQ Q         N  + D EIRIS+FS++SERCPPLAVLH
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFD-EIRISYFSEASERCPPLAVLH 59

Query: 3180 TITSCGVCFKMESPNSQSQDSPLFVLHSSCLRENKTAVMSLG-EEELHLVAMYSRKNERQ 3004
            TIT+ G+CFKMES +S +    L +LHSSC+RENKTAVM LG  EELHLVAMYSR NE+Q
Sbjct: 60   TITASGICFKMESKSSDNIQ--LHLLHSSCIRENKTAVMPLGLTEELHLVAMYSRNNEKQ 117

Query: 3003 SPCFWGFIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTET 2824
             PCFW F + SGLYNSCL MLNLRCLGIVFDLDETLIVANT RSFEDRIEAL RKISTE 
Sbjct: 118  YPCFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEV 177

Query: 2823 DLQRIAGMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPII 2644
            D QRIAGMQAEVKRYQDDK ILKQYAENDQV ENGKVIKVQSEVVPALS +HQ +VRP+I
Sbjct: 178  DPQRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLI 237

Query: 2643 RLQEKNVILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 2464
            RLQEKN+ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM
Sbjct: 238  RLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 297

Query: 2463 WRLLDPESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQ 2284
            WRLLDPESNLIN+KELLDRIVCVKSG  KSLFNVFQ G CHPKMALVIDDRLKVWD+ DQ
Sbjct: 298  WRLLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDDKDQ 357

Query: 2283 PRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDD 2104
            PRVHVVPAFAPYYAPQAEANNA+PVLCVARN+ACNVRGGFFKEFD+GLLQRIPE++YEDD
Sbjct: 358  PRVHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDD 417

Query: 2103 VRDIPSPPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVAN 1924
            V+DIPSPPDVSNYLVSEDDA+  N  KDPL  DGMAD+EVERRLKEAI AS+ + SAVAN
Sbjct: 418  VKDIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVAN 477

Query: 1923 LDPRLGSSLQYTV-ASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSS 1747
            LDPRL +  QYT+ +SS    LPTSQ +VMPLAN QFP   SL+KPL HVGP E SLQSS
Sbjct: 478  LDPRL-APFQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQSLQSS 536

Query: 1746 PAREEGEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWF 1567
            PAREEGEVPESELDPDTRRRLLILQHG D R++  SE PFP R  +QVSVPRV SRG+WF
Sbjct: 537  PAREEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPFPARTQMQVSVPRVPSRGSWF 596

Query: 1566 PVEEEMSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHENQRLPK 1387
            PVEEEMSPRQLNRAVPKEFPL+SE M I+K R    SFF K+E+    DR  HENQR+PK
Sbjct: 597  PVEEEMSPRQLNRAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDR-PHENQRMPK 655

Query: 1386 ETLHKDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIA 1207
            E L +DDRLRLNH LS + SF GEE PL RSSSS+R++D ESGR V   ETP G+LQDIA
Sbjct: 656  EALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQDIA 715

Query: 1206 TKCGTKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYL 1027
             KCGTKVEFRPALVA+ ELQFSIE WFAGEKIGEGIGRTRREAQ QAA+GS+ HLAN Y+
Sbjct: 716  MKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYM 775

Query: 1026 SHV---------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLD 892
              V               N N+N F+ + +S G Q   K+ESLS    S+PS+  DPRL+
Sbjct: 776  LRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDESLS----SEPSKLVDPRLE 831

Query: 891  SSKKSMGSVSALKELCMMEGLGVVFQGQPPPS-NSIQKDEVYAQVEVDGQVLGKGIGLTW 715
             SKK MGSVSALKELCM EGLGVVFQ QPP S NS+QKDEVYAQVE+DGQVLGKGIG TW
Sbjct: 832  GSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTW 891

Query: 714  DEAKMQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYP 535
            DEAKMQAAEKALGSL+SM  GQ+  K QGSPRS QGM +KR K EFPR +QRMP + RYP
Sbjct: 892  DEAKMQAAEKALGSLRSMF-GQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPPSGRYP 950

Query: 534  KNASPVP 514
            KNA PVP
Sbjct: 951  KNAPPVP 957


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score = 1396 bits (3614), Expect = 0.0
 Identities = 728/967 (75%), Positives = 800/967 (82%), Gaps = 27/967 (2%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQ---------NVDMRDREIRISHFSQSSERCPPLAVLH 3181
            MYK+V Y G ++LGEVEIYPQ Q         N  + D EIRIS+FS++SERCPPLAVLH
Sbjct: 1    MYKTVAYLGKEILGEVEIYPQQQGEGGEGEEKNKKVFD-EIRISYFSEASERCPPLAVLH 59

Query: 3180 TITSCGVCFKMESPNSQSQDSPLFVLHSSCLRENKTAVMSLG-EEELHLVAMYSRKNERQ 3004
            TIT+ G+CFKMES +S +    L +LHSSC+RENKTAVM LG  EELHLVAMYSR NE+Q
Sbjct: 60   TITASGICFKMESKSSDNVQ--LHLLHSSCIRENKTAVMLLGLTEELHLVAMYSRNNEKQ 117

Query: 3003 SPCFWGFIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTET 2824
             PCFW F + SGLYNSCL MLNLRCLGIVFDLDETLIVANT RSFEDRIEAL RKISTE 
Sbjct: 118  YPCFWAFSVGSGLYNSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALLRKISTEV 177

Query: 2823 DLQRIAGMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPII 2644
            D QRIAGMQAEVKRYQDDK ILKQYAENDQV ENGKVIKVQSEVVPALS +HQ +VRP+I
Sbjct: 178  DPQRIAGMQAEVKRYQDDKNILKQYAENDQVNENGKVIKVQSEVVPALSDSHQALVRPLI 237

Query: 2643 RLQEKNVILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 2464
            RLQEKN+ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM
Sbjct: 238  RLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEM 297

Query: 2463 WRLLDPESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQ 2284
            WRLLDPESNLIN+KELLDRIVCVKSG  KSLFNVFQ G CHPKMALVIDDRLKVWDE DQ
Sbjct: 298  WRLLDPESNLINTKELLDRIVCVKSGSRKSLFNVFQDGTCHPKMALVIDDRLKVWDEKDQ 357

Query: 2283 PRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDD 2104
             RVHVVPAFAPYYAPQAEANNA+PVLCVARN+ACNVRGGFFKEFD+GLLQRIPE++YEDD
Sbjct: 358  SRVHVVPAFAPYYAPQAEANNAIPVLCVARNIACNVRGGFFKEFDEGLLQRIPEISYEDD 417

Query: 2103 VRDIPSPPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVAN 1924
            V++IPSPPDVSNYLVSEDDA+  N  KDPL  DGMAD+EVERRLKEAI AS+ + SAVAN
Sbjct: 418  VKEIPSPPDVSNYLVSEDDAATANGIKDPLSFDGMADAEVERRLKEAIAASATISSAVAN 477

Query: 1923 LDPRLGSSLQYTV-ASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSS 1747
            LDPRL +  QYT+ +SS    LPTSQ +VMPLAN QFP   SL+KPL HVGP E  LQSS
Sbjct: 478  LDPRL-APFQYTMPSSSSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQCLQSS 536

Query: 1746 PAREEGEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWF 1567
            PAREEGEVPESELDPDTRRRLLILQHG D R++  SE PFP R  +QVSVPRV SRG+WF
Sbjct: 537  PAREEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPFPARTQMQVSVPRVPSRGSWF 596

Query: 1566 PVEEEMSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHENQRLPK 1387
            PVEEEMSPRQLNRAVPKEFPL+SE M I+K R    SFF K+E+SI  DR  HENQR+PK
Sbjct: 597  PVEEEMSPRQLNRAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDR-PHENQRMPK 655

Query: 1386 ETLHKDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIA 1207
            E L +DDRLRLNH LS + SF GEE PL RSSSS+R++D ESGR V   ETP G+LQDIA
Sbjct: 656  EALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQDIA 715

Query: 1206 TKCGTKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYL 1027
             KCGTKVEFRPALVA+ ELQFSIE WFAGEKIGEGIGRTRREAQ QAA+GS+ HLAN Y+
Sbjct: 716  MKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYV 775

Query: 1026 SHV---------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLD 892
              V               N N+N F+ + +S G Q   K+ESLS    S+PS+  DPRL+
Sbjct: 776  LRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDESLS----SEPSKLVDPRLE 831

Query: 891  SSKKSMGSVSALKELCMMEGLGVVFQGQPPPS-NSIQKDEVYAQVEVDGQVLGKGIGLTW 715
             SKK MGSVSALKELCM EGLGVVFQ QPP S NS+QKDEVYAQVE+DGQVLGKGIG TW
Sbjct: 832  GSKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTW 891

Query: 714  DEAKMQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYP 535
            DEAKMQAAEKALGSL+SM  GQ+  K QGSPRS QGM +KR K EFPR +QRMP + RYP
Sbjct: 892  DEAKMQAAEKALGSLRSMF-GQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPPSGRYP 950

Query: 534  KNASPVP 514
            KNA PVP
Sbjct: 951  KNAPPVP 957


>ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica]
            gi|462410413|gb|EMJ15747.1| hypothetical protein
            PRUPE_ppa000988mg [Prunus persica]
          Length = 940

 Score = 1374 bits (3557), Expect = 0.0
 Identities = 705/963 (73%), Positives = 790/963 (82%), Gaps = 23/963 (2%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQ-----NQNVDMRD--REIRISHFSQSSERCPPLAVLHTI 3175
            MYKSVVY+G++LLGEVEIYP+     N+N ++ D  +EIRIS+FSQSSERCPP+AVLHTI
Sbjct: 1    MYKSVVYKGEELLGEVEIYPEENENKNKNKNLVDELKEIRISYFSQSSERCPPVAVLHTI 60

Query: 3174 TSCGVCFKMESPNSQSQDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPC 2995
            +S GVCFKMES  SQSQD+PLF+LHSSC+ ENKTAVM LG EELHLVAM SR  +++ PC
Sbjct: 61   SSHGVCFKMESKTSQSQDTPLFLLHSSCVMENKTAVMPLGGEELHLVAMRSRNGDKRYPC 120

Query: 2994 FWGFIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQ 2815
            FWGF +A GLYNSCLVMLNLRCLGIVFDLDETLIVANT RSFEDRIEALQRKIS+E D Q
Sbjct: 121  FWGFSVAPGLYNSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKISSEVDPQ 180

Query: 2814 RIAGMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQ 2635
            RI+GM AE+KRYQDDK ILKQYAENDQV+ENG+VIK QSE VPALS NHQPI+RP+IRL 
Sbjct: 181  RISGMLAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEAVPALSDNHQPIIRPLIRLH 240

Query: 2634 EKNVILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 2455
            EKN+ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL
Sbjct: 241  EKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRL 300

Query: 2454 LDPESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRV 2275
            LDP+SNLINS +LLDRIVCVKSG  KSLFNVFQ+  CHPKMALVIDDRLKVWD+ DQPRV
Sbjct: 301  LDPDSNLINSNKLLDRIVCVKSGSRKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRV 360

Query: 2274 HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRD 2095
            HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFF+EFDD LLQ+IPEV YEDD++D
Sbjct: 361  HVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFREFDDSLLQKIPEVFYEDDIKD 420

Query: 2094 IPSPPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDP 1915
            +PS PDVSNYLVSEDD+SALN N+DPL  DG+ D EVERR+KEA  A+S+V S   ++DP
Sbjct: 421  VPS-PDVSNYLVSEDDSSALNGNRDPLPFDGITDVEVERRMKEATPAASMVSSVFTSIDP 479

Query: 1914 RLGSSLQYTVASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPARE 1735
            RL + LQYTV  S  + LPT+Q SVM   + QFPQ ASL+KPL HVG +E SLQSSPARE
Sbjct: 480  RL-APLQYTVPPSSTLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSAEPSLQSSPARE 538

Query: 1734 EGEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWFPVEE 1555
            EGEVPESELDPDTRRRLLILQHGQD RD   SEPPFPVRP +Q SVPR QSR  WFPVEE
Sbjct: 539  EGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPFPVRPPMQASVPRAQSRPGWFPVEE 598

Query: 1554 EMSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHENQRLPKETLH 1375
            EMSPRQL+R VPK+ PLD E + I+K R H  SFF KVE+SI  DRIL ENQRLPKE  H
Sbjct: 599  EMSPRQLSRMVPKDLPLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQENQRLPKEAFH 658

Query: 1374 KDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCG 1195
            +DDRLR NH LS +HS  GEE PL RSSSSNR++D ESGR++   ETP G+LQ+IA KCG
Sbjct: 659  RDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGRAISNAETPAGVLQEIAMKCG 718

Query: 1194 TKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSHV- 1018
             K                   WFAGEKIGEG G+TRREA +QAA+GSL +LAN YLS V 
Sbjct: 719  AK------------------AWFAGEKIGEGSGKTRREAHYQAAEGSLKNLANIYLSRVK 760

Query: 1017 --------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKK 880
                          N N NGF  + +S G Q F KEESLS ST+S+PSR  DPRL+ SKK
Sbjct: 761  PDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPLDPRLEGSKK 820

Query: 879  SMGSVSALKELCMMEGLGVVFQGQPPPS-NSIQKDEVYAQVEVDGQVLGKGIGLTWDEAK 703
            SM SVS LKELCMMEGLGVVFQ +PPPS NS++KDEV+ QVE+DG+VLGKGIGLTWDEAK
Sbjct: 821  SMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKGIGLTWDEAK 880

Query: 702  MQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNAS 523
            MQAAEKALGSL S     Y+ KRQGSPRS QGMSSKR K EFP+ +QRMPS++RYPKNA 
Sbjct: 881  MQAAEKALGSLTST---LYAQKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSSARYPKNAP 937

Query: 522  PVP 514
            PVP
Sbjct: 938  PVP 940


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score = 1371 bits (3549), Expect = 0.0
 Identities = 718/999 (71%), Positives = 803/999 (80%), Gaps = 59/999 (5%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQN-------------VDMRDREIRISHFSQSSERCPPL 3193
            MYKSVVY+GD+LLGEVEIY Q Q              +D   +EIRISHFSQ+SERCPPL
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQEQQQEEEENKNKKKRVIDEIVKEIRISHFSQTSERCPPL 60

Query: 3192 AVLHTITSCGVCFKMESPNSQS------QDSPLFVLHSSCLRENKTAVMSLGEEELHLVA 3031
            AVLHTITS GVCFKME   S S      Q+SPL +LHSSC++ENKTAVM LG EELHLVA
Sbjct: 61   AVLHTITSIGVCFKMEESTSSSTTKISQQESPLHLLHSSCIQENKTAVMHLGGEELHLVA 120

Query: 3030 MYSRKNERQSPCFWGFIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEA 2851
            M SR NERQ PCFWGF +A GLY+SCLVMLNLRCLGIVFDLDETLIVANT RSFEDRI+A
Sbjct: 121  MPSRSNERQHPCFWGFSVAPGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDA 180

Query: 2850 LQRKISTETDLQRIAGMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVN 2671
            LQRKISTE D QRI GM +EVKRY DDK ILKQY ENDQV+ENGKVIK QSEVVPALS N
Sbjct: 181  LQRKISTEVDPQRILGMLSEVKRYHDDKNILKQYVENDQVVENGKVIKTQSEVVPALSDN 240

Query: 2670 HQPIVRPIIRLQEKNVILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 2491
            HQP+VRP+IRLQEKN+ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM
Sbjct: 241  HQPMVRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTM 300

Query: 2490 AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDR 2311
            AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGL KSLFNVFQ G CHPKMALVIDDR
Sbjct: 301  AERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVIDDR 360

Query: 2310 LKVWDENDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQR 2131
            LKVWDE DQ RVHVVPAFAPYYAPQAE NNAVPVLCVARNVACNVRGGFFKEFD+GLLQ+
Sbjct: 361  LKVWDERDQSRVHVVPAFAPYYAPQAEVNNAVPVLCVARNVACNVRGGFFKEFDEGLLQK 420

Query: 2130 IPEVAYEDDVRDIPSPPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILAS 1951
            IPEVAYEDD  +IPSPPDVSNYLVSEDDASA+N N+D L  DGMAD+EVER+LKEA+ AS
Sbjct: 421  IPEVAYEDDTDNIPSPPDVSNYLVSEDDASAVNGNRDQLSFDGMADAEVERQLKEAVSAS 480

Query: 1950 ----SVVPSAVANLDPRLGSSLQYTVASS-----------------IPIL---LPTSQTS 1843
                S +PS V++LDPRL  SLQYT+ASS                 +P L    P SQ S
Sbjct: 481  SAILSTIPSTVSSLDPRLLQSLQYTIASSSSSMPTSQPSMLASQQPMPALQPPKPPSQLS 540

Query: 1842 VMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREEGEVPESELDPDTRRRLLILQHGQ 1663
            + P  N QFPQVA  +K L  V P E SLQSSPAREEGEVPESELDPDTRRRLLILQHG 
Sbjct: 541  MTPFPNTQFPQVAPSVKQLGQVVPPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGH 600

Query: 1662 DNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWFPVEEEMSPRQLNRAVPKEFPLDSEVMHI 1483
            D+RD+  SE PFP RPS QVS PRVQS G+W PVEEEMSPRQLNR  P+EFPLDS+ M+I
Sbjct: 601  DSRDNAPSESPFPARPSTQVSAPRVQSVGSWVPVEEEMSPRQLNR-TPREFPLDSDPMNI 659

Query: 1482 DKQRGHPQSFFHKVESSIQCDRILHENQRLPKETLHKDDRLRLNHPLSSFHSFPGEEAPL 1303
            +K R H  SFFHKVES+I  DR++HENQR PKE  ++DDR++LNH  S++ SF GEE+PL
Sbjct: 660  EKHRTHHPSFFHKVESNIPSDRMIHENQRQPKEATYRDDRMKLNHSTSNYPSFQGEESPL 719

Query: 1302 GRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCGTKVEFRPALVAAAELQFSIEVWFA 1123
             R SSSNR+LD+ES R+    ETPV +LQ+IA KCGTKVEFRPAL+A ++LQFSIE WF 
Sbjct: 720  SR-SSSNRDLDLESERAFSSTETPVEVLQEIAMKCGTKVEFRPALIATSDLQFSIETWFV 778

Query: 1122 GEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSHV---------------NTNDNGFVSD 988
            GEK+GEG G+TRREAQ QAA+GS+  LA  Y+S V               + NDNGF+ D
Sbjct: 779  GEKVGEGTGKTRREAQRQAAEGSIKKLAGIYMSRVKPDSGPMLGDSSRYPSANDNGFLGD 838

Query: 987  GHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKKSMGSVSALKELCMMEGLGVVFQGQ 808
             +S G Q  LK+E++++S TS+PSR  D RL+ SKKSMGSV+ALKE CM EGLGV F  Q
Sbjct: 839  MNSFGNQPLLKDENITYSATSEPSRLLDQRLEGSKKSMGSVTALKEFCMTEGLGVNFLAQ 898

Query: 807  PPPS-NSIQKDEVYAQVEVDGQVLGKGIGLTWDEAKMQAAEKALGSLKSMHGGQYSHKRQ 631
             P S NSI  +EV+AQVE+DGQVLGKGIGLTWDEAKMQAAEKALGSL++M  GQY+ KRQ
Sbjct: 899  TPLSTNSIPGEEVHAQVEIDGQVLGKGIGLTWDEAKMQAAEKALGSLRTMF-GQYTPKRQ 957

Query: 630  GSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNASPVP 514
            GSPR  QGM +KR K EFPR +QRMPS++RY KNASPVP
Sbjct: 958  GSPRLMQGMPNKRLKQEFPRVLQRMPSSARYHKNASPVP 996


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score = 1368 bits (3542), Expect = 0.0
 Identities = 694/957 (72%), Positives = 794/957 (82%), Gaps = 21/957 (2%)
 Frame = -3

Query: 3321 VVYRGDDLLGEVEIYPQ---NQNVDMRDREIRISHFSQSSERCPPLAVLHTITSCGVCFK 3151
            +VY+G++LLGEVE+YP+   N+ +    +EIRISHFSQSSERCPP+AVLHTI+S GVCFK
Sbjct: 4    LVYKGEELLGEVEVYPEELNNKKIWDELKEIRISHFSQSSERCPPVAVLHTISSNGVCFK 63

Query: 3150 MESPNSQSQD---SPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWGFI 2980
            MES +S S     S LF+LHSSC+ ENKTAVM+LG EELHLVAMYSR N++Q PCFWGF 
Sbjct: 64   MESKSSSSSSQDTSRLFLLHSSCIMENKTAVMNLGVEELHLVAMYSRNNQKQHPCFWGFS 123

Query: 2979 IASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIAGM 2800
            ++SGLY+SCL MLNLRCLGIVFDLDETLIVANT RSFEDRIE LQRKI  E D QRI+GM
Sbjct: 124  VSSGLYSSCLGMLNLRCLGIVFDLDETLIVANTMRSFEDRIEGLQRKIQCEVDAQRISGM 183

Query: 2799 QAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKNVI 2620
            QAE+KRYQDDK ILKQYAENDQV+ENG+VIK QSEVVPALS +HQPI+RP+IRLQEKN+I
Sbjct: 184  QAEIKRYQDDKFILKQYAENDQVVENGRVIKTQSEVVPALSDSHQPIIRPLIRLQEKNII 243

Query: 2619 LTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 2440
            LTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES
Sbjct: 244  LTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPES 303

Query: 2439 NLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVVPA 2260
            NLIN+ +LLDRIVCVKSGL+KSLFNVFQ+  CHPKMALVIDDRLKVWD+ DQPRVHVVPA
Sbjct: 304  NLINANKLLDRIVCVKSGLKKSLFNVFQESLCHPKMALVIDDRLKVWDDRDQPRVHVVPA 363

Query: 2259 FAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPSPP 2080
            FAPYYAPQAEANNAVPVLCVARNVAC+VRGGFF+EFDD LLQ+IPE+ YED+++D  S P
Sbjct: 364  FAPYYAPQAEANNAVPVLCVARNVACSVRGGFFREFDDSLLQKIPEIFYEDNIKDF-SSP 422

Query: 2079 DVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRLGSS 1900
            DVSN+LVSEDDASA N N+D L  DGMAD+EVERRLKEA  A+  V SAV+N DPRL +S
Sbjct: 423  DVSNFLVSEDDASASNGNRDQLPFDGMADAEVERRLKEATSAAPTVSSAVSNNDPRL-AS 481

Query: 1899 LQYTVASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREEGEVP 1720
            LQYTV  S  + LPT+Q S+MP  N QFPQ ASL+KPL HVGP++  L SSPAREEGEVP
Sbjct: 482  LQYTVPLSSTVSLPTNQPSMMPFHNVQFPQSASLVKPLGHVGPADLGLHSSPAREEGEVP 541

Query: 1719 ESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWFPVEEEMSPR 1540
            ESELDPDTRRRLLILQHGQD R+   SEP FPVRP +QVSVPRVQSRG WFPVEEEMSPR
Sbjct: 542  ESELDPDTRRRLLILQHGQDTRESVPSEPSFPVRPQVQVSVPRVQSRGGWFPVEEEMSPR 601

Query: 1539 QLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHENQRLPKETLHKDDRL 1360
            +L+R VPKE PL+SE M I+K R H  +FF KVE+S+  DRIL ENQRLPKE  H+D+RL
Sbjct: 602  KLSRMVPKEPPLNSEPMQIEKHRSHHSAFFPKVENSMPSDRILQENQRLPKEAFHRDNRL 661

Query: 1359 RLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCGTKVEF 1180
            R N  +S +HSF GEE PL RSSSSNR+ D ESGR++   ETP G+LQ+IA KCGTKVEF
Sbjct: 662  RFNQAMSGYHSFSGEEPPLNRSSSSNRDFDYESGRAISNAETPAGVLQEIAMKCGTKVEF 721

Query: 1179 RPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSH------- 1021
            RPALV + ELQF +E WFAGEKIGEG GRTRREA  QAA+GSL +LAN Y+S        
Sbjct: 722  RPALVPSTELQFYVEAWFAGEKIGEGTGRTRREAHFQAAEGSLKNLANIYISRGKPDALP 781

Query: 1020 --------VNTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKKSMGSV 865
                     N  +NGF+ + +S G Q   KE+SLS ST+S+PSR  DPRLD+S+KS+ SV
Sbjct: 782  IHGDASKFSNVTNNGFMGNMNSFGTQPLPKEDSLSSSTSSEPSRPLDPRLDNSRKSVSSV 841

Query: 864  SALKELCMMEGLGVVFQGQPPPSNSIQKDEVYAQVEVDGQVLGKGIGLTWDEAKMQAAEK 685
            SALKELC MEGL V++Q +PPP NS +KDEV+ Q E+DG+VLGKGIGLTWDEAKMQAAEK
Sbjct: 842  SALKELCTMEGLSVLYQPRPPPPNSTEKDEVHVQAEIDGEVLGKGIGLTWDEAKMQAAEK 901

Query: 684  ALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNASPVP 514
            ALG+L+S     Y  KRQGSPR  QGM SKR K EFP+ +QRMPS++RY KNA PVP
Sbjct: 902  ALGNLRST---LYGQKRQGSPRPLQGMPSKRLKQEFPQVLQRMPSSTRYSKNAPPVP 955


>ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
            gi|508781047|gb|EOY28303.1| C-terminal domain
            phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score = 1354 bits (3504), Expect = 0.0
 Identities = 704/945 (74%), Positives = 782/945 (82%), Gaps = 40/945 (4%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQNVDMRD-----------------REIRISHFSQSSER 3205
            MYKSVVYRG+++LGEVEIYPQ Q    +                  +EIRI + +Q SER
Sbjct: 4    MYKSVVYRGEEVLGEVEIYPQQQLQQQQQLREEEDERKIMVMEEEMKEIRIEYLTQGSER 63

Query: 3204 CPPLAVLHTITSCGVCFKMESPN----SQSQDSP-LFVLHSSCLRENKTAVMSLGEEELH 3040
            CPPLAVLHTITS G+CFKMES      S SQDSP L +LHS C+R+NKTAVM +G+ ELH
Sbjct: 64   CPPLAVLHTITSSGICFKMESSKDNNYSSSQDSPPLHLLHSECIRDNKTAVMPMGDCELH 123

Query: 3039 LVAMYSRKNERQSPCFWGFIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDR 2860
            LVAMYSR ++R  PCFWGF ++ GLY+SCL+MLNLRCLGIVFDLDETLIVANT RSFEDR
Sbjct: 124  LVAMYSRNSDR--PCFWGFNVSRGLYDSCLLMLNLRCLGIVFDLDETLIVANTMRSFEDR 181

Query: 2859 IEALQRKISTETDLQRIAGMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPAL 2680
            IEALQRK++TE D QR+AGM AE+KRYQDDK ILKQYAENDQV+ENGKVIK+QSEVVPAL
Sbjct: 182  IEALQRKMTTEVDPQRVAGMVAEMKRYQDDKAILKQYAENDQVVENGKVIKIQSEVVPAL 241

Query: 2679 SVNHQPIVRPIIRLQEKNVILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 2500
            S NHQPI+RP+IRLQEKN+ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV
Sbjct: 242  SDNHQPIIRPLIRLQEKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYV 301

Query: 2499 CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVI 2320
            CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSG  KSLFNVFQ G CHPKMALVI
Sbjct: 302  CTMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVI 361

Query: 2319 DDRLKVWDENDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGL 2140
            DDRLKVWDE DQPRVHVVPAFAPYYAPQAEANN +PVLCVARNVACNVRGGFF+EFD+GL
Sbjct: 362  DDRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNTIPVLCVARNVACNVRGGFFREFDEGL 421

Query: 2139 LQRIPEVAYEDDVRDIPSPPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAI 1960
            LQRIPE++YEDD++DIPSPPDV NYLVSEDD SALN NKDPL  DGMAD+EVERRLKEAI
Sbjct: 422  LQRIPEISYEDDIKDIPSPPDVGNYLVSEDDTSALNGNKDPLLFDGMADAEVERRLKEAI 481

Query: 1959 LASSVVPSAVANLDPRLGSSLQYTVASSIPILLPT-SQTSVMPLANKQFPQVASLIKPLV 1783
             A+S V SA  NLDPRL  SLQYT+ SS   + P+ SQ S++  +N QFP  A ++KP+ 
Sbjct: 482  SATSTVSSAAINLDPRLTPSLQYTMPSSSSSIPPSASQPSIVSFSNMQFPLAAPVVKPVA 541

Query: 1782 HVGPSESSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPF-PVRPSLQ 1606
             V   E SLQSSPAREEGEVPESELDPDTRRRLLILQHGQD RDHT  EP F PVRP++Q
Sbjct: 542  PVAVPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQ 601

Query: 1605 VSVPRVQSRGNWFPVEEEMSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQ 1426
            VSVPR QSRG+WF  EEEMSPRQLNRA PKEFPLDSE MHI+K R HP  FF KVESSI 
Sbjct: 602  VSVPRGQSRGSWFAAEEEMSPRQLNRAAPKEFPLDSERMHIEKHR-HP-PFFPKVESSIP 659

Query: 1425 CDRILHENQRLPKETLHKDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVP 1246
             DR+L ENQRL KE LH+DDRL LNH  SS+HSF GEE PL +SSSS+R+LD ESGR+V 
Sbjct: 660  SDRLLRENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVT 719

Query: 1245 YVETPVGMLQDIATKCGTKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQA 1066
              ET  G+LQDIA KCG KVEFRPALVA+ +LQFSIE WFAGEK+GEG+GRTRREAQ QA
Sbjct: 720  SGETSAGVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQA 779

Query: 1065 AQGSLMHLANRYLSHV---------------NTNDNGFVSDGHSSGYQQFLKEESLSFST 931
            A+ S+ +LAN YLS +               N NDNGF S+ +S G Q   KEESLSFST
Sbjct: 780  AEESIKNLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFST 839

Query: 930  TSDPSRQPDPRLDSSKKSMGSVSALKELCMMEGLGVVFQGQPP-PSNSIQKDEVYAQVEV 754
             S+ SR  DPRL+ SKKSMGSV+ALKELCMMEGLGVVFQ QPP  SN++QKDEVYAQVE+
Sbjct: 840  ASEQSRLADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEI 899

Query: 753  DGQVLGKGIGLTWDEAKMQAAEKALGSLKSMHGGQYSHKRQGSPR 619
            DGQVLGKG GLTW+EAKMQAAEKALGSL+SM  GQYS KRQGSPR
Sbjct: 900  DGQVLGKGTGLTWEEAKMQAAEKALGSLRSML-GQYSQKRQGSPR 943


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score = 1343 bits (3477), Expect = 0.0
 Identities = 700/979 (71%), Positives = 794/979 (81%), Gaps = 39/979 (3%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQN-----QNVDMRDRE---------------IRISHFSQS 3214
            MYKSVVY+GD+LLGEVEIY Q      Q  +++++E               IRISHFSQ+
Sbjct: 1    MYKSVVYKGDELLGEVEIYAQQEQKLQQQEELQEQEQELKKKRVIDEILKGIRISHFSQA 60

Query: 3213 SERCPPLAVLHTITSCGVCFKMESPNSQSQDSPLFVLHSSCLRENKTAVMSL-GEEELHL 3037
            SERCPPLAVLHTIT+ G+CFKMES NS S D+PL +LHSSC++E+KTAV+ L G EELHL
Sbjct: 61   SERCPPLAVLHTITTNGICFKMESKNSVSLDTPLHLLHSSCIQESKTAVVLLQGGEELHL 120

Query: 3036 VAMYSRKNERQSPCFWGFIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRI 2857
            VAM+SR +ERQ PCFW F I+SGLY+SCLVMLNLRCLGIVFDLDETLIVANT RSFEDRI
Sbjct: 121  VAMFSRNDERQYPCFWAFNISSGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRI 180

Query: 2856 EALQRKISTETDLQRIAGMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALS 2677
            EALQRKISTE D QRI+GM +EVKRYQDDK ILKQY +NDQV+ENG+VIK Q EVVPALS
Sbjct: 181  EALQRKISTELDPQRISGMLSEVKRYQDDKTILKQYVDNDQVVENGRVIKTQFEVVPALS 240

Query: 2676 VNHQPIVRPIIRLQEKNVILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVC 2497
             NHQ IVRP+IRLQE+N+ILTRINP++RDTSVLVRLRPAWE+LRSYLTARGRKRFEVYVC
Sbjct: 241  DNHQTIVRPLIRLQERNIILTRINPQIRDTSVLVRLRPAWEELRSYLTARGRKRFEVYVC 300

Query: 2496 TMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVID 2317
            TMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGL KSLFNVFQ G CHPKMALVID
Sbjct: 301  TMAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLRKSLFNVFQDGICHPKMALVID 360

Query: 2316 DRLKVWDENDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLL 2137
            DRLKVWDE DQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFD+GLL
Sbjct: 361  DRLKVWDEKDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDEGLL 420

Query: 2136 QRIPEVAYEDDVRDIPSPPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAIL 1957
            QRIPE+++EDD+ DIPSPPDVSNYLV EDDA   N N+DPL  DGMAD+EVE+RLKEAI 
Sbjct: 421  QRIPEISFEDDMNDIPSPPDVSNYLVPEDDAFTSNGNRDPLSFDGMADAEVEKRLKEAIS 480

Query: 1956 ASSVVPSAVANLDPRLGSSLQYTVASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHV 1777
             SS  PS VANLD RL   LQYT+ASS  I +PTSQ +V+   + Q PQ A L+KPL  V
Sbjct: 481  ISSAFPSTVANLDARLVPPLQYTMASSSSIPVPTSQPAVVTFPSMQLPQAAPLVKPLGQV 540

Query: 1776 GPSESSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRP--SLQV 1603
             PSE SLQSSPAREEGEVPESELDPDTRRRLLILQHGQD RD   SE PFPVRP  S+QV
Sbjct: 541  VPSEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDLRDPAPSESPFPVRPSNSMQV 600

Query: 1602 SVPRVQSRGNWFPVEEEMSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQC 1423
            SVPRVQSRGNW PVEEEMSPRQLNRAV +EFP+D+E MHIDK R H  SFF KVESSI  
Sbjct: 601  SVPRVQSRGNWVPVEEEMSPRQLNRAVTREFPMDTEPMHIDKHRPHHPSFFPKVESSIPS 660

Query: 1422 DRILHENQRLPKETLHKDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPY 1243
            +R+ HENQRLPK   +KDDRLRLN  +S++ S  GEE  L RSSSSNR+LDVES R+V  
Sbjct: 661  ERMPHENQRLPKVAPYKDDRLRLNQTMSNYQSLSGEENSLSRSSSSNRDLDVESDRAVSS 720

Query: 1242 VETPVGMLQDIATKCGTKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAA 1063
             ETPV +L +I+ KCG KVEF+ +LV + +LQFS+E WFAGE++GEG GRTRREAQ  AA
Sbjct: 721  AETPVRVLHEISMKCGAKVEFKHSLVNSRDLQFSVEAWFAGERVGEGFGRTRREAQSVAA 780

Query: 1062 QGSLMHLANRYLS---------------HVNTNDNGFVSDGHSSGYQQFLKEESLSFSTT 928
            + S+ +LAN Y+S               + + NDNGF+   +S G Q   K+E LS+S +
Sbjct: 781  EASIKNLANIYISRAKPDNGALHGDASKYSSANDNGFLGHVNSFGSQPLPKDEILSYSDS 840

Query: 927  SDPSRQPDPRLDSSKKSMGSVSALKELCMMEGLGVVFQGQPP-PSNSIQKDEVYAQVEVD 751
            S+ S   DPRL+SSKKSM SV+ALKE CMMEGLGV F  Q P  SNS+Q  EV+AQVE+D
Sbjct: 841  SEQSGLLDPRLESSKKSMSSVNALKEFCMMEGLGVNFLAQTPLSSNSVQNAEVHAQVEID 900

Query: 750  GQVLGKGIGLTWDEAKMQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPR 571
            GQV+GKGIG T+DEAKMQAAEKALGSL++   G++  KRQGSPR   GM +K  K EFPR
Sbjct: 901  GQVMGKGIGSTFDEAKMQAAEKALGSLRTTF-GRFPPKRQGSPRPVPGMPNKHLKPEFPR 959

Query: 570  AVQRMPSASRYPKNASPVP 514
             +QRMPS++RYPKNA PVP
Sbjct: 960  VLQRMPSSARYPKNAPPVP 978


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score = 1330 bits (3442), Expect = 0.0
 Identities = 680/963 (70%), Positives = 791/963 (82%), Gaps = 21/963 (2%)
 Frame = -3

Query: 3339 LKMYKSVVYRGDDLLGEVEIYPQNQN--VDMRDREIRISHFSQSSERCPPLAVLHTITSC 3166
            ++MYKSVVY+G+ ++GEV++YP+  N   +   +EIRISHFSQ SERCPPLAVLHT+TSC
Sbjct: 1    MRMYKSVVYQGEVVVGEVDVYPEENNNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSC 60

Query: 3165 GVCFKMESPNSQSQDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWG 2986
            GVCFKMES  +Q QD  LF LHS C+RENKTAVM LG EE+HLVAM+SR  +R  PCFWG
Sbjct: 61   GVCFKMES-KTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNVDR--PCFWG 116

Query: 2985 FIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIA 2806
            FI+A GLY+SCLVMLNLRCLGIVFDLDETLIVANT RSFEDRI+ALQRKI++E D QRI+
Sbjct: 117  FIVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRIS 176

Query: 2805 GMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKN 2626
            GMQAEVKRYQDDK ILKQYAENDQV++NG+VIKVQSE+VPALS +HQPIVRP+IRLQ+KN
Sbjct: 177  GMQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKN 236

Query: 2625 VILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 2446
            +ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 237  IILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 296

Query: 2445 ESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVV 2266
            +SNLINSKELL RIVCVKSGL+KSLFNVFQ G CHPKMALVIDDRLKVWDE DQPRVHVV
Sbjct: 297  DSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVV 356

Query: 2265 PAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPS 2086
            PAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FDDGLLQ+IP++AYEDD++DIPS
Sbjct: 357  PAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPS 416

Query: 2085 PPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRLG 1906
            PPDVSNYLVSEDD S  N ++DP   DGMAD+EVER+LK+A+ A+S +P   ANLDPRL 
Sbjct: 417  PPDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPRL- 475

Query: 1905 SSLQYTVASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREEGE 1726
            +SLQYT+  S  +  PT+Q S+MP  + QFPQ A+L+KP+    PSE SL SSPAREEGE
Sbjct: 476  TSLQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535

Query: 1725 VPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRV-QSRGNWFPVEEEM 1549
            VPESELDPDTRRRLLILQHGQD RDH S+EPPFPVR  +Q S P V  SRG WFP EEE+
Sbjct: 536  VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVPSSRGVWFPAEEEI 595

Query: 1548 SPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHE-NQRLPKETLHK 1372
              + LNR VPKEFP+DS  + I K R H  SFF KVESSI  DRILH+ +QRLPKE  H+
Sbjct: 596  GSQPLNRVVPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHR 655

Query: 1371 DDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCGT 1192
            DDR RLNH LSS+ SF G++ P  RS SS+R+LD ESG SV + +TPV +LQ+IA KCGT
Sbjct: 656  DDRPRLNHMLSSYRSFSGDDIPFSRSFSSHRDLDSESGHSVLHADTPVAVLQEIALKCGT 715

Query: 1191 KVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSHV-- 1018
            KV+F  +LVA+ ELQFS+E WF+G+KIG  +GRTR+EAQ++AA+ S+ HLA+ YLS    
Sbjct: 716  KVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKD 775

Query: 1017 -------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKKS 877
                         N ND+G++    S G Q   KE+S SFST S PSR  DPRLD SK+S
Sbjct: 776  EPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKEDSASFSTAS-PSRVLDPRLDVSKRS 834

Query: 876  MGSVSALKELCMMEGLGVVFQGQPPP--SNSIQKDEVYAQVEVDGQVLGKGIGLTWDEAK 703
            MGS+S+LKELCMMEGL V F   P P  +NS+QKDEV+AQVE+DG+V GKGIGLTWDEAK
Sbjct: 835  MGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAK 894

Query: 702  MQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNAS 523
            MQAAEKALGSL+S   GQ   KRQ SPR  QG S+KR K E+PR +QRMPS++RYP+NA 
Sbjct: 895  MQAAEKALGSLRSKL-GQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSARYPRNAP 953

Query: 522  PVP 514
            P+P
Sbjct: 954  PIP 956


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Glycine max]
          Length = 960

 Score = 1330 bits (3442), Expect = 0.0
 Identities = 678/967 (70%), Positives = 792/967 (81%), Gaps = 27/967 (2%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQ--------NQNVDMRDREIRISHFSQSSERCPPLAVLHT 3178
            MYKSVVY+G+ ++GEV++YP+        N N +   +EIRISHFSQ SERCPPLAVLHT
Sbjct: 1    MYKSVVYQGEVVVGEVDVYPEENNNNNNKNYNKNFHVKEIRISHFSQPSERCPPLAVLHT 60

Query: 3177 ITSCGVCFKMESPNSQSQDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSP 2998
            +TSCGVCFKMES  +Q QD  LF LHS C+RENKTAVM LG EE+HLVAM+SR ++R  P
Sbjct: 61   VTSCGVCFKMES-KTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNDDR--P 116

Query: 2997 CFWGFIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDL 2818
            CFWGFI+  GLY+SCLVMLNLRCLGIVFDLDETLIVANT RSFEDRI+ALQRKI++E D 
Sbjct: 117  CFWGFIVTLGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDP 176

Query: 2817 QRIAGMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRL 2638
            QRI+GMQAEVKRY DDK ILKQYAENDQV++NG+VIKVQSE+VPALS +HQPIVRP+IRL
Sbjct: 177  QRISGMQAEVKRYLDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRL 236

Query: 2637 QEKNVILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 2458
            Q+KN+ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR
Sbjct: 237  QDKNIILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWR 296

Query: 2457 LLDPESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPR 2278
            LLDP+SNLINSKELL RIVCVKSGL+KSLFNVFQ G+C PKMALVIDDRLKVWDE DQPR
Sbjct: 297  LLDPDSNLINSKELLGRIVCVKSGLKKSLFNVFQDGSCDPKMALVIDDRLKVWDERDQPR 356

Query: 2277 VHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVR 2098
            VHVVPAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FDDGLLQ+IP++AYEDD++
Sbjct: 357  VHVVPAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIK 416

Query: 2097 DIPSPPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLD 1918
            D+PSPPDVSNYLVSEDD S  N N+DP   DGMAD+EVER+LK+A+ A+S  P   ANLD
Sbjct: 417  DVPSPPDVSNYLVSEDDGSISNGNRDPFLFDGMADAEVERKLKDALAAASTFPVTTANLD 476

Query: 1917 PRLGSSLQYTVASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAR 1738
            PRL +SLQYT+  S  +  PT+Q S+MP  + QFPQ A+L+KP+    PS+ SL SSPAR
Sbjct: 477  PRL-TSLQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSDPSLHSSPAR 535

Query: 1737 EEGEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRV-QSRGNWFPV 1561
            EEGEVPESELDPDTRRRLLILQHGQD RDH S+EPPFPVR  +Q S PRV  SRG WFPV
Sbjct: 536  EEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVPSSRGVWFPV 595

Query: 1560 EEEMSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHE-NQRLPKE 1384
            EEE+  + LNR VPKEFP+DS  + I+K R H  SFF+KVESSI  DRILH+ +QRLPKE
Sbjct: 596  EEEIGSQPLNRVVPKEFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHDSHQRLPKE 655

Query: 1383 TLHKDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIAT 1204
              H+DDR RLNH LSS+ SF G++ P  RSSSS+R+LD ESG SV + +TPV +L +IA 
Sbjct: 656  MYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGHSVLHADTPVAVLHEIAL 715

Query: 1203 KCGTKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLS 1024
            KCGTKV+F  +LVA+ EL+FS+E WF+G+KIG G GRTR+EAQ++AA+ S+ HLA+ YLS
Sbjct: 716  KCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIEHLADIYLS 775

Query: 1023 HV---------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDS 889
                             N NDNG++    S G Q   KE+S SFS+ S PSR  DPRLD 
Sbjct: 776  SAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKEDSASFSSAS-PSRALDPRLDV 834

Query: 888  SKKSMGSVSALKELCMMEGLGVVFQGQPPP--SNSIQKDEVYAQVEVDGQVLGKGIGLTW 715
            SK+SMGS+SALKELCMMEGLGV F   P P  +NS+QKDEV+AQVE+DG++ GKGIGLTW
Sbjct: 835  SKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFGKGIGLTW 894

Query: 714  DEAKMQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYP 535
            DEAKMQAAEKALG+L+S   GQ   K Q SPR  QG S+KR K E+PR +QRMPS++RYP
Sbjct: 895  DEAKMQAAEKALGNLRSKL-GQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMPSSARYP 953

Query: 534  KNASPVP 514
            +NA P+P
Sbjct: 954  RNAPPIP 960


>emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]
          Length = 894

 Score = 1327 bits (3434), Expect = 0.0
 Identities = 692/948 (72%), Positives = 769/948 (81%), Gaps = 8/948 (0%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQNVDMRDREIRISHFSQSSERCPPLAVLHTITSCGVCF 3154
            MYKS+VY GDD++GEVEIYPQNQ +++  +EIRISH+SQ SERCPPLAVLHTITSCGVCF
Sbjct: 1    MYKSIVYEGDDVVGEVEIYPQNQGLELM-KEIRISHYSQPSERCPPLAVLHTITSCGVCF 59

Query: 3153 KMESPNSQSQDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWGFIIA 2974
            KMES  +QSQD+PL++LHS+C+RENKTAVMSLGEEELHLVAMYS+K + Q PCFWGF +A
Sbjct: 60   KMESSKAQSQDTPLYLLHSTCIRENKTAVMSLGEEELHLVAMYSKKKDGQYPCFWGFNVA 119

Query: 2973 SGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIAGMQA 2794
             GLY+SCLVMLNLRCLGIVFDLDETLIVANT RSFEDRI+ALQRKI+TE D QRI+GM A
Sbjct: 120  LGLYSSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINTEVDPQRISGMVA 179

Query: 2793 EVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKNVILT 2614
            EV                   +ENGK+ K Q E+VPALS NHQPIVRP+IRLQEKN+ILT
Sbjct: 180  EV-------------------VENGKLFKTQPEIVPALSDNHQPIVRPLIRLQEKNIILT 220

Query: 2613 RINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNL 2434
            RINP +RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNL
Sbjct: 221  RINPLIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNL 280

Query: 2433 INSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVVPAFA 2254
            INSKELLDRIVCVKSG  KSLFNVFQ G CHPKMALVIDDRLKVWDE DQPRVHVVPAFA
Sbjct: 281  INSKELLDRIVCVKSGSRKSLFNVFQDGICHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 340

Query: 2253 PYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPSPPDV 2074
            PYYAPQAEANNA+ VLCVARNVACNVRGGFFKEFD+GLLQRIPE++YED+++DI S PDV
Sbjct: 341  PYYAPQAEANNAISVLCVARNVACNVRGGFFKEFDEGLLQRIPEISYEDBIKDIRSAPDV 400

Query: 2073 SNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRLGSSLQ 1894
            SNYLVSEDDAS  N N+D    DGMAD EVER+LK+AI A    PS V +LDPRL   LQ
Sbjct: 401  SNYLVSEDDASVSNGNRDQPCFDGMADVEVERKLKDAISA----PSTVTSLDPRLSPPLQ 456

Query: 1893 YTVASSIPIL-LPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREEGEVPE 1717
            + VA+S  +   P +Q S+MP +NKQFPQ ASLIKPL      E ++QSSPAREEGEVPE
Sbjct: 457  FAVAASSGLAPQPAAQGSIMPFSNKQFPQSASLIKPLA----PEPTMQSSPAREEGEVPE 512

Query: 1716 SELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWFPVEEEMSPRQ 1537
            SELDPDTRRRLLILQHGQD R+H SS+PPFPVRP +QVSVPRVQSRG+WFP +EEMSPRQ
Sbjct: 513  SELDPDTRRRLLILQHGQDTREHASSDPPFPVRPPIQVSVPRVQSRGSWFPADEEMSPRQ 572

Query: 1536 LNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHENQRLPKETLHKDDRLR 1357
            LNRAVPKEFPLDS+ MHI+K R H  SFFHKVESS   DRILHENQRL KE LH+DDRLR
Sbjct: 573  LNRAVPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDDRLR 632

Query: 1356 LNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETP-VGMLQDIATKCGTKVEF 1180
            LNH L  +HSF GEE PLGR SSSNR+LD ESGR  PY ETP VG+L++    C      
Sbjct: 633  LNHSLPGYHSFSGEEVPLGR-SSSNRDLDFESGRGAPYAETPAVGLLRN----CN----- 682

Query: 1179 RPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYL-----SHVN 1015
                          EVW  GEKIGEG G+TRREAQ QAA+ SLM+L+ RYL        N
Sbjct: 683  --------------EVWNQGEKIGEGTGKTRREAQCQAAEASLMYLSYRYLHGDVNRFPN 728

Query: 1014 TNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKKSMGSVSALKELCMME 835
             +DN F+SD +S GYQ F KE S+SFST S+ SR  DPRL+SSKKSMGS+SALKELCMME
Sbjct: 729  ASDNNFMSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSMGSISALKELCMME 788

Query: 834  GLGVVFQGQPP-PSNSIQKDEVYAQVEVDGQVLGKGIGLTWDEAKMQAAEKALGSLKSMH 658
            GLGV F  QPP  SNS QK+E+ AQVE+DGQVLGKG G TWD+AKMQAAEKALGSLKSM 
Sbjct: 789  GLGVEFLSQPPLSSNSTQKEEICAQVEIDGQVLGKGTGSTWDDAKMQAAEKALGSLKSML 848

Query: 657  GGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNASPVP 514
             GQ+S KRQGSPRS QGM  KR KSEF R +QR PS+ RY KN SPVP
Sbjct: 849  -GQFSQKRQGSPRSLQGM-GKRLKSEFTRGLQRTPSSGRYSKNTSPVP 894


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score = 1318 bits (3412), Expect = 0.0
 Identities = 690/993 (69%), Positives = 783/993 (78%), Gaps = 53/993 (5%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQNVDMRDRE-------------IRISHFSQSSERCPPL 3193
            MYKSVVY+G++LLGEVEIY Q Q  +  + +             IRISHFSQ+SERCPPL
Sbjct: 1    MYKSVVYKGEELLGEVEIYAQEQQQEEEENKNKRKRVIDEIVKGIRISHFSQASERCPPL 60

Query: 3192 AVLHTITSCGVCFKMESPNSQS-------QDSPLFVLHSSCLRENKTAVMSLGEEELHLV 3034
            AVLHTITS GVCFKME   + S       Q+SPL +LHSSC++ENKTAVM LG EELHLV
Sbjct: 61   AVLHTITSIGVCFKMEESTASSSTKISSQQESPLRLLHSSCIQENKTAVMLLGGEELHLV 120

Query: 3033 AMYSRKNERQSPCFWGFIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIE 2854
            AM SR NER+ PCFWGF +ASGLY+SCLVMLNLRCLGIVFDLDETLIVANT RSFED+IE
Sbjct: 121  AMPSRSNERKHPCFWGFNVASGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDKIE 180

Query: 2853 ALQRKISTETDLQRIAGMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSV 2674
            ALQ+KISTE D QRI  + +E+KRYQDDKIILKQY ENDQVIENGKVIK Q EVVPA S 
Sbjct: 181  ALQKKISTEVDQQRILAIISEIKRYQDDKIILKQYVENDQVIENGKVIKTQFEVVPAASD 240

Query: 2673 NHQPIVRPIIRLQEKNVILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 2494
            NHQP+VRP+IRL EKN+I TRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT
Sbjct: 241  NHQPLVRPLIRLPEKNIIFTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCT 300

Query: 2493 MAERDYALEMWRLLDPESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDD 2314
            MAERDYALEMWRLLDPESNLINS ELLDRIVCV SG  KSLFNVFQ G CHPKMALVIDD
Sbjct: 301  MAERDYALEMWRLLDPESNLINSNELLDRIVCVSSGSRKSLFNVFQDGICHPKMALVIDD 360

Query: 2313 RLKVWDENDQPRVHVVPAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQ 2134
            R+ VWDE DQ RVHVVPAFAPYYAPQAEANNAVP+LCVARNVACNVRGGFFKEFD+GLLQ
Sbjct: 361  RMNVWDEKDQSRVHVVPAFAPYYAPQAEANNAVPILCVARNVACNVRGGFFKEFDEGLLQ 420

Query: 2133 RIPEVAYEDDVRDIPSPPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILA 1954
            +IPEVAYEDD  +IPSPPDVSNYLVSEDDASA N N+DP   D  AD+EVERRLKEA+ A
Sbjct: 421  KIPEVAYEDDTSNIPSPPDVSNYLVSEDDASAANGNRDPPSFDSTADAEVERRLKEAVSA 480

Query: 1953 S----SVVPSAVANLDPRLGSSLQYTVASSIPIL-------------LPTSQTSVMPLAN 1825
            S    S +PS V++LDPRL  SLQY VASS  ++             +P SQTS+MP  N
Sbjct: 481  SSTIPSTIPSTVSSLDPRLLQSLQYAVASSSSLMPASQPSMLASQQPVPASQTSMMPFPN 540

Query: 1824 KQFPQVASLIKPLVHVGPSESSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDNRDHT 1645
             QFPQVA L+K L  V   E SLQSSPAREEGEVPESELDPDTRRRLLILQHGQD+RD+ 
Sbjct: 541  TQFPQVAPLVKQLGQVVHPEPSLQSSPAREEGEVPESELDPDTRRRLLILQHGQDSRDNA 600

Query: 1644 SSEPPFPVRPSLQVSVPRVQSRGNWFPVEEEMSPRQLNRAVPKEFPLDSEVMHIDKQRGH 1465
             SE PFP RPS  VS   VQSRG+W PVEEEM+PRQLNR  P+EFPLDS+ M+I+K + H
Sbjct: 601  PSESPFPARPSAPVSAAHVQSRGSWVPVEEEMTPRQLNR-TPREFPLDSDPMNIEKHQTH 659

Query: 1464 PQSFFHKVESSIQCDRILHENQRLPKETLHKDDRLRLNHPLSSFHSFPGEEAPLGRSSSS 1285
              SFF KVES+I  DR++HENQRLPKE  +++DR+RLNH   ++HSF  EE PL R SSS
Sbjct: 660  HPSFFPKVESNIPSDRMIHENQRLPKEAPYRNDRMRLNHSTPNYHSFQVEETPLSR-SSS 718

Query: 1284 NRNLDVESGRSVPYVETPVGMLQDIATKCGTKVEFRPALVAAAELQFSIEVWFAGEKIGE 1105
            NR+LD+ES R+    ETPV +LQ+IA KC TKVEFRPALVA+ +LQFSIE WFAGEK+GE
Sbjct: 719  NRDLDLESERAFTISETPVEVLQEIAMKCETKVEFRPALVASIDLQFSIEAWFAGEKVGE 778

Query: 1104 GIGRTRREAQHQAAQGSLMHLANRYL---------------SHVNTNDNGFVSDGHSSGY 970
            G G+TRREAQ QAA+GS+  LA  Y+                + + NDNGF+ + +  G 
Sbjct: 779  GTGKTRREAQRQAAEGSIKKLAGIYMLRAKPDSGPMHGDSSRYPSANDNGFLGNMNLFGN 838

Query: 969  QQFLKEESLSFSTTSDPSRQPDPRLDSSKKSMGSVSALKELCMMEGLGVVFQGQPPPS-N 793
            Q   K+E +++S  S+PSR  DPRL+ SKKS GSV+ALKE C MEGL V F  Q P S N
Sbjct: 839  QPLPKDELVAYSAASEPSRLLDPRLEGSKKSSGSVTALKEFCTMEGLVVNFLAQTPLSAN 898

Query: 792  SIQKDEVYAQVEVDGQVLGKGIGLTWDEAKMQAAEKALGSLKSMHGGQYSHKRQGSPRSF 613
            SI  +EV+AQVE+DGQVLGKGIG TWDEAKMQAAEKALGSL++M  GQY+ KRQGSPR  
Sbjct: 899  SIPGEEVHAQVEIDGQVLGKGIGSTWDEAKMQAAEKALGSLRTMF-GQYTQKRQGSPRPM 957

Query: 612  QGMSSKRFKSEFPRAVQRMPSASRYPKNASPVP 514
            QGM +KR K EFPR +QRMP ++RY KNA PVP
Sbjct: 958  QGMPNKRLKQEFPRVLQRMPPSARYHKNAPPVP 990


>ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
            gi|561032720|gb|ESW31299.1| hypothetical protein
            PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score = 1316 bits (3407), Expect = 0.0
 Identities = 679/970 (70%), Positives = 786/970 (81%), Gaps = 30/970 (3%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQNV-DMRDREIRISHFSQSSERCPPLAVLHTITSCGVC 3157
            MYKSVVY+G+ +LGEVE+YP+  N  +   +EIRISHFSQ SERCPPLAVLHT+TSCGVC
Sbjct: 1    MYKSVVYQGEVVLGEVEVYPEENNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSCGVC 60

Query: 3156 FKMESPNSQSQDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWGFII 2977
            FKMES  +Q QD  LF LHS C+RENKTAV+ LG EE+HLVAM+SR ++R  P FWGFI+
Sbjct: 61   FKMES-KTQQQDG-LFHLHSLCIRENKTAVIPLGGEEIHLVAMHSRNDDR--PRFWGFIV 116

Query: 2976 ASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIAGMQ 2797
            A GLY+SCLVMLNLRCLGIVFDLDETLIVANT RSFEDRI+ALQRKI++E D QRI+GMQ
Sbjct: 117  ALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQ 176

Query: 2796 AEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKNVIL 2617
            AEVKRYQ+DK ILKQYAENDQV++NG+V+KVQSE+VPALS NHQPIVRP+IRLQ+KN+IL
Sbjct: 177  AEVKRYQEDKNILKQYAENDQVVDNGRVVKVQSEIVPALSDNHQPIVRPLIRLQDKNIIL 236

Query: 2616 TRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESN 2437
            TRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP+SN
Sbjct: 237  TRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSN 296

Query: 2436 LINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVVPAF 2257
            LINSKELL RIVCVKSGL+KSLFNVFQ G CHPKMALVIDDRLKVWDE DQPRVHVVPAF
Sbjct: 297  LINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAF 356

Query: 2256 APYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPSPPD 2077
            APYYAPQAEA+N++PVLCVARNVACNVRGGFFKEFDDGLLQ+IP+VAYEDD++DIP PPD
Sbjct: 357  APYYAPQAEASNSIPVLCVARNVACNVRGGFFKEFDDGLLQKIPQVAYEDDIKDIPIPPD 416

Query: 2076 VSNYLVSEDDASAL--NVNKDPLRSDGMADSEVERRLK---------EAILASSVVPSAV 1930
            VSNYLVSEDD S+   N N+DP   D M D+EVER+ K         +A+ A+S +P   
Sbjct: 417  VSNYLVSEDDGSSAISNGNRDPFLFDSMGDAEVERKSKVPTRAPNEHDALSAASTIPVTT 476

Query: 1929 ANLDPRLGSSLQYTVASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQS 1750
            ANLDPRL +SLQY + SS     PT+Q S+MP  + QFPQ A+L+KP+    PSESSL S
Sbjct: 477  ANLDPRL-TSLQYAMVSSGSAPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHS 535

Query: 1749 SPAREEGEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNW 1570
            SPAREEGEVPESELDPDTRRRLLILQHGQD RDHTS+EP + +R  + VS PRV SRG W
Sbjct: 536  SPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSNEPTYAIRHPVPVSAPRVSSRGGW 595

Query: 1569 FPVEEEMSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHE-NQRL 1393
            FP EE++  + LNR VPKEF +DS  + I+K R H  SFF KVESSI  DRILH+ +QRL
Sbjct: 596  FPAEEDIGSQPLNRVVPKEFSVDSGSLVIEKHRPHHPSFFSKVESSISSDRILHDSHQRL 655

Query: 1392 PKETLHKDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQD 1213
            PKE  H+DDR R NH LSS+ S   +E P  RSSSS+R+LD ES  SV + +TPV +LQ+
Sbjct: 656  PKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSSSHRDLDSESSHSVFHADTPVVVLQE 715

Query: 1212 IATKCGTKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANR 1033
            IA KCGTKVEF  +LVA+ ELQFSIE WF+G+KIG G GRTR+EAQH+AA+ S+ HLA+ 
Sbjct: 716  IALKCGTKVEFMSSLVASTELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKHLADI 775

Query: 1032 YLSHV---------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPR 898
            YLS                 N NDNG++    S   Q   KE+S SFST SDPSR  DPR
Sbjct: 776  YLSSAKDEPGSTYGDVGGFPNANDNGYMVIASSLSNQPLPKEDSASFSTASDPSRVLDPR 835

Query: 897  LDSSKKSMGSVSALKELCMMEGLGVVFQGQPPP--SNSIQKDEVYAQVEVDGQVLGKGIG 724
            L+ SK+ MGS+SALKELCMMEGLGV F   P P  +NS+QKDEV+AQVE+DG+V GKGIG
Sbjct: 836  LEVSKRPMGSISALKELCMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFGKGIG 895

Query: 723  LTWDEAKMQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSAS 544
            LTWDEAKMQAAEKALGSL+S   GQ   KRQ SPRS QG S+KR K E+PRA+QR+PS++
Sbjct: 896  LTWDEAKMQAAEKALGSLRSKL-GQSIQKRQSSPRSHQGFSNKRLKQEYPRAMQRIPSST 954

Query: 543  RYPKNASPVP 514
            RYP+NA P+P
Sbjct: 955  RYPRNAPPIP 964


>ref|XP_004505032.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Cicer arietinum]
          Length = 951

 Score = 1300 bits (3365), Expect = 0.0
 Identities = 666/956 (69%), Positives = 767/956 (80%), Gaps = 16/956 (1%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQNVDMRDREIRISHFSQSSERCPPLAVLHTITSCGVCF 3154
            MYKS+VY+G+ +LGEV+IYP+  N +   +EIRISHF+Q SERC PLAVLHTITS GVCF
Sbjct: 1    MYKSLVYQGEVVLGEVDIYPEVNNNNKNFKEIRISHFTQPSERCLPLAVLHTITSSGVCF 60

Query: 3153 KMESPNSQSQDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWGFIIA 2974
            KMES  +Q QD PLF LH+ C RENKTAVM L  EE+HLVAM+SR N R  PCFWG+I+ 
Sbjct: 61   KMES-KTQQQD-PLFHLHNLCFRENKTAVMPLCGEEMHLVAMHSRSNGR--PCFWGYIVG 116

Query: 2973 SGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIAGMQA 2794
             GLYNSCL+MLNLRCLGIVFDLDETLIVANT RSFEDRI+ALQRKI++E D QRI+GMQA
Sbjct: 117  MGLYNSCLMMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRISGMQA 176

Query: 2793 EVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKNVILT 2614
            EVKRY +DK ILKQY ENDQV++NGKV+K QSE+VPALS +HQPIVRP+IRL EKN+ILT
Sbjct: 177  EVKRYLEDKSILKQYVENDQVVDNGKVLKAQSELVPALSDSHQPIVRPLIRLHEKNIILT 236

Query: 2613 RINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPESNL 2434
            RINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP+SNL
Sbjct: 237  RINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPDSNL 296

Query: 2433 INSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVVPAFA 2254
            INSKELL RIVCVKSGL+KSLFNVFQ G CHPKMALVIDDRLKVWDE DQPRVHVVPAFA
Sbjct: 297  INSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVVPAFA 356

Query: 2253 PYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPSPPDV 2074
            PYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FDDGLLQ+I ++AYE++ RDI   PDV
Sbjct: 357  PYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKISQIAYENNTRDISPAPDV 416

Query: 2073 SNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRLGSSLQ 1894
            SNYLVSEDD SA   N+DP   DGMAD+EVER+LK+AI A+S +P   A LDPRL SSLQ
Sbjct: 417  SNYLVSEDDGSASYANRDPFAFDGMADAEVERKLKDAISAASAIPMTTAKLDPRLTSSLQ 476

Query: 1893 YTVASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREEGEVPES 1714
            YT+ S   +L P +Q S++PL + QFPQ A+L+KP+  V PSE SL SSPAREEGEVPES
Sbjct: 477  YTMVSPGSVLPPAAQASMIPLPHTQFPQPATLVKPIGQVAPSELSLHSSPAREEGEVPES 536

Query: 1713 ELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWFPVEEEMSPRQL 1534
            ELDPDTRRRLLILQHGQDNRDHTSSEPPFP++  +QVS  RV  RG WFPVEEE+  +  
Sbjct: 537  ELDPDTRRRLLILQHGQDNRDHTSSEPPFPLKHPVQVSA-RVPPRGGWFPVEEEIGSQPP 595

Query: 1533 NRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHE-NQRLPKETLHKDDRLR 1357
            NR +PKE  LDS    I+K R H Q FF KV+ SI  DR LHE NQRLPKE  H+DDR R
Sbjct: 596  NRVIPKEIALDSGPSRIEKHRLHQQPFFPKVDGSISSDRALHETNQRLPKEMYHRDDRSR 655

Query: 1356 LNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCGTKVEFR 1177
            ++H LSS+ S  G++ P GRSSSS+R+ D ESG SV   ETP  +LQ+IA KCGTKVEF 
Sbjct: 656  VSHMLSSYPSLSGDDTPFGRSSSSHRDFDSESGHSVFNAETPAIVLQEIALKCGTKVEFT 715

Query: 1176 PALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSHV------- 1018
             +L A+ ELQFSIE WF+G+KIG G GRTR EAQ++AA+ S+ HLA+ YLS         
Sbjct: 716  SSLAASRELQFSIEAWFSGKKIGHGFGRTRMEAQYKAAEDSIKHLADIYLSRAKDESGSA 775

Query: 1017 --------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKKSMGSVS 862
                    N NDNG+V +  S G Q   KEES+SFS  SDPSR  DPRLD SK+SMGSVS
Sbjct: 776  FGDVSGFPNANDNGYVGNVSSLGNQPLPKEESVSFSAASDPSRVLDPRLDVSKRSMGSVS 835

Query: 861  ALKELCMMEGLGVVFQGQPPPSNSIQKDEVYAQVEVDGQVLGKGIGLTWDEAKMQAAEKA 682
            ALKELCM+EGLGV F   P P ++   DEV+AQVE+DGQV GKG G+TWDEAKMQAAEKA
Sbjct: 836  ALKELCMVEGLGVNFLSLPAPVSTNSVDEVHAQVEIDGQVYGKGTGITWDEAKMQAAEKA 895

Query: 681  LGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNASPVP 514
            LGSL++   GQ   +RQ SPR FQG+S+KR K E PR +QR  S+ RYP+NA P+P
Sbjct: 896  LGSLRTTIHGQGIQRRQLSPRPFQGLSNKRLKQEHPRTLQRFASSGRYPRNAPPIP 951


>ref|XP_006583810.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X2 [Glycine max]
          Length = 929

 Score = 1289 bits (3336), Expect = 0.0
 Identities = 666/963 (69%), Positives = 772/963 (80%), Gaps = 21/963 (2%)
 Frame = -3

Query: 3339 LKMYKSVVYRGDDLLGEVEIYPQNQN--VDMRDREIRISHFSQSSERCPPLAVLHTITSC 3166
            ++MYKSVVY+G+ ++GEV++YP+  N   +   +EIRISHFSQ SERCPPLAVLHT+TSC
Sbjct: 1    MRMYKSVVYQGEVVVGEVDVYPEENNNYKNFHVKEIRISHFSQPSERCPPLAVLHTVTSC 60

Query: 3165 GVCFKMESPNSQSQDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWG 2986
            GVCFKMES  +Q QD  LF LHS C+RENKTAVM LG EE+HLVAM+SR  +R  PCFWG
Sbjct: 61   GVCFKMES-KTQQQDG-LFQLHSLCIRENKTAVMPLGGEEIHLVAMHSRNVDR--PCFWG 116

Query: 2985 FIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIA 2806
            FI+A GLY+SCLVMLNLRCLGIVFDLDETLIVANT RSFEDRI+ALQRKI++E D QRI+
Sbjct: 117  FIVALGLYDSCLVMLNLRCLGIVFDLDETLIVANTMRSFEDRIDALQRKINSEVDPQRIS 176

Query: 2805 GMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKN 2626
            GMQAEVKRYQDDK ILKQYAENDQV++NG+VIKVQSE+VPALS +HQPIVRP+IRLQ+KN
Sbjct: 177  GMQAEVKRYQDDKNILKQYAENDQVVDNGRVIKVQSEIVPALSDSHQPIVRPLIRLQDKN 236

Query: 2625 VILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 2446
            +ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 237  IILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 296

Query: 2445 ESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVV 2266
            +SNLINSKELL RIVCVKSGL+KSLFNVFQ G CHPKMALVIDDRLKVWDE DQPRVHVV
Sbjct: 297  DSNLINSKELLGRIVCVKSGLKKSLFNVFQDGLCHPKMALVIDDRLKVWDEKDQPRVHVV 356

Query: 2265 PAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPS 2086
            PAFAPYYAPQAEA+N +PVLCVARNVACNVRGGFFK+FDDGLLQ+IP++AYEDD++DIPS
Sbjct: 357  PAFAPYYAPQAEASNTIPVLCVARNVACNVRGGFFKDFDDGLLQKIPQIAYEDDIKDIPS 416

Query: 2085 PPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRLG 1906
            PPDVSNYLVSEDD S  N ++DP   DGMAD+EVER+LK+A+ A+S +P   ANLDPRL 
Sbjct: 417  PPDVSNYLVSEDDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTANLDPRL- 475

Query: 1905 SSLQYTVASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREEGE 1726
            +SLQYT+  S  +  PT+Q S+MP  + QFPQ A+L+KP+    PSE SL SSPAREEGE
Sbjct: 476  TSLQYTMVPSGSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGE 535

Query: 1725 VPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRV-QSRGNWFPVEEEM 1549
            VPESELDPDTRRRLLILQHGQD RDH S+EPPFPVR  +Q S P V  SRG WFP EEE+
Sbjct: 536  VPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVPSSRGVWFPAEEEI 595

Query: 1548 SPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHE-NQRLPKETLHK 1372
              + LNR VPKEFP+DS  + I K R H  SFF KVESSI  DRILH+ +QRLPKE  H+
Sbjct: 596  GSQPLNRVVPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHR 655

Query: 1371 DDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCGT 1192
            DDR RLNH LSS+ SF                            +TPV +LQ+IA KCGT
Sbjct: 656  DDRPRLNHMLSSYRSFS---------------------------DTPVAVLQEIALKCGT 688

Query: 1191 KVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSHV-- 1018
            KV+F  +LVA+ ELQFS+E WF+G+KIG  +GRTR+EAQ++AA+ S+ HLA+ YLS    
Sbjct: 689  KVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKD 748

Query: 1017 -------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKKS 877
                         N ND+G++    S G Q   KE+S SFST S PSR  DPRLD SK+S
Sbjct: 749  EPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKEDSASFSTAS-PSRVLDPRLDVSKRS 807

Query: 876  MGSVSALKELCMMEGLGVVFQGQPPP--SNSIQKDEVYAQVEVDGQVLGKGIGLTWDEAK 703
            MGS+S+LKELCMMEGL V F   P P  +NS+QKDEV+AQVE+DG+V GKGIGLTWDEAK
Sbjct: 808  MGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFGKGIGLTWDEAK 867

Query: 702  MQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNAS 523
            MQAAEKALGSL+S   GQ   KRQ SPR  QG S+KR K E+PR +QRMPS++RYP+NA 
Sbjct: 868  MQAAEKALGSLRSKL-GQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMPSSARYPRNAP 926

Query: 522  PVP 514
            P+P
Sbjct: 927  PIP 929


>ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 958

 Score = 1287 bits (3331), Expect = 0.0
 Identities = 663/964 (68%), Positives = 780/964 (80%), Gaps = 24/964 (2%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQ-NVDMRDREIRISHFSQSSERCPPLAVLHTITSCGVC 3157
            M +S+VY G+  +GEVEIYP+ + N+D++  EIRISHFSQ SERCPPLAVLHTITS G+C
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDLK--EIRISHFSQPSERCPPLAVLHTITSFGIC 58

Query: 3156 FKMESPNSQS--QDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWGF 2983
            FKMES  SQ+  Q   LF LHSSC+RENKTAVM L  EE+HLVAMYSR N+R  PCFWGF
Sbjct: 59   FKMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNNDR--PCFWGF 116

Query: 2982 IIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIAG 2803
            I+ASGLYNSCL MLNLRCLGIVFDLDETL+VANT RSFED+IE L RK+++E + QRI+ 
Sbjct: 117  IVASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRIST 176

Query: 2802 MQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKNV 2623
            MQAE+KRY DDK ILK+YAENDQV++NGKVIK+QSE+VPALS +HQPIVRP+IRLQEKN+
Sbjct: 177  MQAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNI 236

Query: 2622 ILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPE 2443
            ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDPE
Sbjct: 237  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPE 296

Query: 2442 SNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVVP 2263
             NLINSKELLDRIVCVKSGL+KSLFNVFQ G CH KMALVIDDRLKVWDE DQP+VHVVP
Sbjct: 297  LNLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVP 356

Query: 2262 AFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPSP 2083
            AFAPYYAPQAEA+NAVP LC+AR+VACNVRGGFFK+FDDGLLQ+IP +AYEDD++DIPSP
Sbjct: 357  AFAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSP 416

Query: 2082 PDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRL-- 1909
            PDVSNYLVSEDDASA N NK+ L  DGMAD+EVERRLK+AI ASS VP+   NLDPRL  
Sbjct: 417  PDVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAF 476

Query: 1908 GSSLQYT-VASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREE 1732
             SSLQYT V+SS  +  PT+Q S++   N QFPQ  +L+KP+  V P   SL SSPAREE
Sbjct: 477  NSSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREE 536

Query: 1731 GEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWFPVEEE 1552
            GEVPESELD DTRRRLLILQHGQD R+HTSSEPP PVR   QVS P V SR  WF VEEE
Sbjct: 537  GEVPESELDLDTRRRLLILQHGQDTREHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEE 596

Query: 1551 MSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHE-NQRLPKETLH 1375
            M P+QLN+ VPKEFP+ SE +HI+K+     S F KV+ S+  DR+ HE +QRLPKE  H
Sbjct: 597  MGPQQLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRLPKEVHH 656

Query: 1374 KDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCG 1195
            +DD  RL+  LSS+HSFPG++ PL  SS SNR+ D ESGRS+ + +   G+LQ+IA KCG
Sbjct: 657  RDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCG 716

Query: 1194 TKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSHV- 1018
            TKVEF  +LVA+  LQFSIE WFAG+K+GEG GRTRREAQ++AA+ S+  LA+ Y+SH  
Sbjct: 717  TKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAK 776

Query: 1017 --------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKK 880
                           +N+NGFVS G+S G  Q L +ES+SFST+SD SR  DPRL+ SK+
Sbjct: 777  DDSGSTYGDVSGFHGSNNNGFVSSGNSLG-NQLLPKESVSFSTSSDSSRVSDPRLEVSKR 835

Query: 879  SMGSVSALKELCMMEGLGVVFQGQPPPSNS--IQKDEVYAQVEVDGQVLGKGIGLTWDEA 706
            S  S+SALKE CMMEGL   FQ  P P+++   QKDEV+AQVE+DGQ+ GKG GLTW+EA
Sbjct: 836  STDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEA 895

Query: 705  KMQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNA 526
            KMQAA+KAL SL++M   Q + KR GSPRS QG+++KR K E+PR +QR+P ++RYP+NA
Sbjct: 896  KMQAAKKALESLRTMF-NQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNA 954

Query: 525  SPVP 514
              VP
Sbjct: 955  PLVP 958


>ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X3 [Glycine max]
          Length = 932

 Score = 1255 bits (3248), Expect = 0.0
 Identities = 648/952 (68%), Positives = 761/952 (79%), Gaps = 12/952 (1%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQ-NVDMRDREIRISHFSQSSERCPPLAVLHTITSCGVC 3157
            M +S+VY G+  +GEVEIYP+ + N+D++  EIRISHFSQ SERCPPLAVLHTITS G+C
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDLK--EIRISHFSQPSERCPPLAVLHTITSFGIC 58

Query: 3156 FKMESPNSQS--QDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWGF 2983
            FKMES  SQ+  Q   LF LHSSC+RENKTAVM L  EE+HLVAMYSR N+R  PCFWGF
Sbjct: 59   FKMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNNDR--PCFWGF 116

Query: 2982 IIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIAG 2803
            I+ASGLYNSCL MLNLRCLGIVFDLDETL+VANT RSFED+IE L RK+++E + QRI+ 
Sbjct: 117  IVASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRIST 176

Query: 2802 MQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKNV 2623
            MQAE+KRY DDK ILK+YAENDQV++NGKVIK+QSE+VPALS +HQPIVRP+IRLQEKN+
Sbjct: 177  MQAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNI 236

Query: 2622 ILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPE 2443
            ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDPE
Sbjct: 237  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPE 296

Query: 2442 SNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVVP 2263
             NLINSKELLDRIVCVKSGL+KSLFNVFQ G CH KMALVIDDRLKVWDE DQP+VHVVP
Sbjct: 297  LNLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVP 356

Query: 2262 AFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPSP 2083
            AFAPYYAPQAEA+NAVP LC+AR+VACNVRGGFFK+FDDGLLQ+IP +AYEDD++DIPSP
Sbjct: 357  AFAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSP 416

Query: 2082 PDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRL-- 1909
            PDVSNYLVSEDDASA N NK+ L  DGMAD+EVERRLK+AI ASS VP+   NLDPRL  
Sbjct: 417  PDVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAF 476

Query: 1908 GSSLQYT-VASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREE 1732
             SSLQYT V+SS  +  PT+Q S++   N QFPQ  +L+KP+  V P   SL SSPAREE
Sbjct: 477  NSSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREE 536

Query: 1731 GEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWFPVEEE 1552
            GEVPESELD DTRRRLLILQHGQD R+HTSSEPP PVR   QVS P V SR  WF VEEE
Sbjct: 537  GEVPESELDLDTRRRLLILQHGQDTREHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEE 596

Query: 1551 MSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHE-NQRLPKETLH 1375
            M P+QLN+ VPKEFP+ SE +HI+K+     S F KV+ S+  DR+ HE +QRLPKE  H
Sbjct: 597  MGPQQLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRLPKEVHH 656

Query: 1374 KDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCG 1195
            +DD  RL+  LSS+HSFPG++ PL  SS SNR+ D ESGRS+ + +   G+LQ+IA KCG
Sbjct: 657  RDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCG 716

Query: 1194 TKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSHVN 1015
            TKVEF  +LVA+  LQFSIE WFAG+K+GEG GRTRREAQ++AA+ S+  LA+ Y+SH  
Sbjct: 717  TKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAK 776

Query: 1014 TNDN---GFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKKSMGSVSALKELC 844
             +     G VS  H S    F+                 DPRL+ SK+S  S+SALKE C
Sbjct: 777  DDSGSTYGDVSGFHGSNNNGFVSS---------------DPRLEVSKRSTDSISALKEFC 821

Query: 843  MMEGLGVVFQGQPPPSNS--IQKDEVYAQVEVDGQVLGKGIGLTWDEAKMQAAEKALGSL 670
            MMEGL   FQ  P P+++   QKDEV+AQVE+DGQ+ GKG GLTW+EAKMQAA+KAL SL
Sbjct: 822  MMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAKMQAAKKALESL 881

Query: 669  KSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNASPVP 514
            ++M   Q + KR GSPRS QG+++KR K E+PR +QR+P ++RYP+NA  VP
Sbjct: 882  RTMF-NQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNAPLVP 932


>ref|XP_006597420.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X2 [Glycine max]
          Length = 937

 Score = 1255 bits (3248), Expect = 0.0
 Identities = 652/963 (67%), Positives = 765/963 (79%), Gaps = 23/963 (2%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQNQ-NVDMRDREIRISHFSQSSERCPPLAVLHTITSCGVC 3157
            M +S+VY G+  +GEVEIYP+ + N+D++  EIRISHFSQ SERCPPLAVLHTITS G+C
Sbjct: 1    MKRSMVYHGEMEVGEVEIYPEEKKNIDLK--EIRISHFSQPSERCPPLAVLHTITSFGIC 58

Query: 3156 FKMESPNSQS--QDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWGF 2983
            FKMES  SQ+  Q   LF LHSSC+RENKTAVM L  EE+HLVAMYSR N+R  PCFWGF
Sbjct: 59   FKMESSTSQTRQQQDVLFHLHSSCIRENKTAVMPLRGEEIHLVAMYSRNNDR--PCFWGF 116

Query: 2982 IIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIAG 2803
            I+ASGLYNSCL MLNLRCLGIVFDLDETL+VANT RSFED+IE L RK+++E + QRI+ 
Sbjct: 117  IVASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQRIST 176

Query: 2802 MQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKNV 2623
            MQAE+KRY DDK ILK+YAENDQV++NGKVIK+QSE+VPALS +HQPIVRP+IRLQEKN+
Sbjct: 177  MQAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSEIVPALSDSHQPIVRPLIRLQEKNI 236

Query: 2622 ILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPE 2443
            ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDPE
Sbjct: 237  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPE 296

Query: 2442 SNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVVP 2263
             NLINSKELLDRIVCVKSGL+KSLFNVFQ G CH KMALVIDDRLKVWDE DQP+VHVVP
Sbjct: 297  LNLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPQVHVVP 356

Query: 2262 AFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPSP 2083
            AFAPYYAPQAEA+NAVP LC+AR+VACNVRGGFFK+FDDGLLQ+IP +AYEDD++DIPSP
Sbjct: 357  AFAPYYAPQAEASNAVPTLCLARSVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPSP 416

Query: 2082 PDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRL-- 1909
            PDVSNYLVSEDDASA N NK+ L  DGMAD+EVERRLK+AI ASS VP+   NLDPRL  
Sbjct: 417  PDVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPAMTTNLDPRLAF 476

Query: 1908 GSSLQYT-VASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREE 1732
             SSLQYT V+SS  +  PT+Q S++   N QFPQ  +L+KP+  V P   SL SSPAREE
Sbjct: 477  NSSLQYTMVSSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREE 536

Query: 1731 GEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVPRVQSRGNWFPVEEE 1552
            GEVPESELD DTRRRLLILQHGQD R+HTSSEPP PVR   QVS P V SR  WF VEEE
Sbjct: 537  GEVPESELDLDTRRRLLILQHGQDTREHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEE 596

Query: 1551 MSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHENQRLPKETLHK 1372
            M P+QLN+ VPKEFP+ SE +HI+K+     S F KV                     H+
Sbjct: 597  MGPQQLNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVH--------------------HR 636

Query: 1371 DDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCGT 1192
            DD  RL+  LSS+HSFPG++ PL  SS SNR+ D ESGRS+ + +   G+LQ+IA KCGT
Sbjct: 637  DDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGRSLFHADITAGVLQEIALKCGT 696

Query: 1191 KVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSHV-- 1018
            KVEF  +LVA+  LQFSIE WFAG+K+GEG GRTRREAQ++AA+ S+  LA+ Y+SH   
Sbjct: 697  KVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKD 756

Query: 1017 -------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKKS 877
                          +N+NGFVS G+S G  Q L +ES+SFST+SD SR  DPRL+ SK+S
Sbjct: 757  DSGSTYGDVSGFHGSNNNGFVSSGNSLG-NQLLPKESVSFSTSSDSSRVSDPRLEVSKRS 815

Query: 876  MGSVSALKELCMMEGLGVVFQGQPPPSNS--IQKDEVYAQVEVDGQVLGKGIGLTWDEAK 703
              S+SALKE CMMEGL   FQ  P P+++   QKDEV+AQVE+DGQ+ GKG GLTW+EAK
Sbjct: 816  TDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWEEAK 875

Query: 702  MQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPKNAS 523
            MQAA+KAL SL++M   Q + KR GSPRS QG+++KR K E+PR +QR+P ++RYP+NA 
Sbjct: 876  MQAAKKALESLRTMF-NQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSARYPRNAP 934

Query: 522  PVP 514
             VP
Sbjct: 935  LVP 937


>ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
            gi|571500215|ref|XP_006594604.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1-like
            isoform X2 [Glycine max]
          Length = 960

 Score = 1253 bits (3241), Expect = 0.0
 Identities = 658/966 (68%), Positives = 768/966 (79%), Gaps = 26/966 (2%)
 Frame = -3

Query: 3333 MYKSVVYRGDDLLGEVEIYPQ-NQNVDMRDREIRISHFSQSSERCPPLAVLHTITSCGVC 3157
            M  S+VY G+  +GEV+IYP+ N+N+D++  EIRISHFSQ SERCPPLAVLHTITS G+C
Sbjct: 1    MPTSMVYHGEMAVGEVKIYPEENKNMDLK--EIRISHFSQPSERCPPLAVLHTITSFGIC 58

Query: 3156 FKMESPNSQS--QDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWGF 2983
            FKMES  SQ   Q   LF LHSSC+RENKTAVM +  EE+HLVAMYSR N+R  PCFWGF
Sbjct: 59   FKMESSTSQKRQQQDALFHLHSSCIRENKTAVMPVRGEEIHLVAMYSRNNDR--PCFWGF 116

Query: 2982 IIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIAG 2803
            I+ASGLYNSCL MLNLRCLGIVFDLDETL+VANT RSFED+IE L RK+++E + Q+I+ 
Sbjct: 117  IVASGLYNSCLTMLNLRCLGIVFDLDETLVVANTMRSFEDKIEVLHRKMNSEVNPQQISA 176

Query: 2802 MQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKNV 2623
            MQAE+KRY DDK ILK+YAENDQV++NGKVIK+QSE VPALS +HQPIVRP+IRLQEKN+
Sbjct: 177  MQAEIKRYLDDKNILKEYAENDQVVDNGKVIKIQSESVPALSDSHQPIVRPLIRLQEKNI 236

Query: 2622 ILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDPE 2443
            ILTRINP++RDTSVLVRLRPAWEDLRSYLTARGRKRFEV+VCTMAERDYALEMWRLLDPE
Sbjct: 237  ILTRINPQIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVFVCTMAERDYALEMWRLLDPE 296

Query: 2442 SNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVVP 2263
             NLINSKELLDRIVCVKSGL+KSLFNVFQ G CH KMALVIDDRLKVWDE DQPRVHVVP
Sbjct: 297  LNLINSKELLDRIVCVKSGLKKSLFNVFQNGLCHLKMALVIDDRLKVWDEKDQPRVHVVP 356

Query: 2262 AFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPSP 2083
            AFAPYY PQAEA+NAVP LC+ARNVACNVRGGFFK+FDDGLLQ+IP +AYEDD++DIPS 
Sbjct: 357  AFAPYYTPQAEASNAVPFLCLARNVACNVRGGFFKDFDDGLLQKIPLIAYEDDIKDIPS- 415

Query: 2082 PDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRLG- 1906
            PDVSNYLVSEDDASA N NK+ L  DGMAD+EVERRLK+AI ASS + +  AN+DPRL  
Sbjct: 416  PDVSNYLVSEDDASASNGNKNLLLFDGMADAEVERRLKDAISASSTILALTANIDPRLAF 475

Query: 1905 -SSLQYT-VASSIPILLPTSQTSVMPLANKQFPQVASLIKPLVHVGPSESSLQSSPAREE 1732
             SSLQYT V+SS  +  PT+Q SV+   N QFPQ  +L+KP+  V     SL SSPAREE
Sbjct: 476  TSSLQYTMVSSSGTVPPPTAQASVVQFGNVQFPQPNTLVKPMSQVTHPGLSLHSSPAREE 535

Query: 1731 GEVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSVP--RVQSRGNWFPVE 1558
            GE+PESELD DTRRR LILQHGQD R+  +SEPPFPVR   QVS P   V SR  WF VE
Sbjct: 536  GELPESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVPSRRGWFSVE 595

Query: 1557 EEMSPRQLNRAVPKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHE-NQRLPKET 1381
            EEM P+QLN  VPKEFP+DSE  HI+K+     SFF KV  SI  DR+ HE +QRLPKE 
Sbjct: 596  EEMGPQQLNLPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFHESHQRLPKEV 655

Query: 1380 LHKDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATK 1201
             H+DDR RL+  LSS+HS PG++ PL  SS SNR+ D ESGRS+ + +T  G+LQ+IA  
Sbjct: 656  HHRDDRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGRSLFHADTTAGVLQEIALN 715

Query: 1200 CGTKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSH 1021
            CGTKVEF  +LVA+ ELQFSIE WFAG+KIGEG GRTRREAQ +AA  S+  LA+ Y+SH
Sbjct: 716  CGTKVEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSIKQLADIYMSH 775

Query: 1020 V---------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSS 886
                             +N++GFVS G+S G Q   KEES SFST S+ SR  D RL+ S
Sbjct: 776  AKDDSGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESSRVSDSRLEVS 835

Query: 885  KKSMGSVSALKELCMMEGLGVVFQGQPPPSNS--IQKDEVYAQVEVDGQVLGKGIGLTWD 712
            K+S  S+SALKELCMMEGL   FQ  P  +++   QKDEV+AQVE+DGQ+ GKG G+TW+
Sbjct: 836  KRSTDSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQIFGKGFGVTWE 895

Query: 711  EAKMQAAEKALGSLKSMHGGQYSHKRQGSPRSFQGMSSKRFKSEFPRAVQRMPSASRYPK 532
            EAKMQAA+KALGSL++M   Q S KR GSPRS QG+++KR K E+P  +QR+P ++RYP+
Sbjct: 896  EAKMQAAKKALGSLRTMF-NQGSLKRHGSPRSMQGLANKRLKPEYPPTLQRVPYSARYPR 954

Query: 531  NASPVP 514
            NA  VP
Sbjct: 955  NAPLVP 960


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score = 1241 bits (3212), Expect = 0.0
 Identities = 661/965 (68%), Positives = 767/965 (79%), Gaps = 25/965 (2%)
 Frame = -3

Query: 3333 MYKS--VVYRGDDLLGEVEIYPQNQNVDMRDREIRISHFSQSSERCPPLAVLHTITSCGV 3160
            M+KS  V+Y G+ L+GEVEIY + + V   ++ IRISH+S SSERCPPLAVLHT+T+ G+
Sbjct: 1    MFKSTVVLYEGERLVGEVEIYCE-KGVLWGEKVIRISHYSPSSERCPPLAVLHTVTT-GL 58

Query: 3159 CFKMESPNSQ--SQDSPLFVLHSSCLRENKTAVMSLGEEELHLVAMYSRKNERQSPCFWG 2986
             FK+E   S+  +QDSPL +LHS+CLR+NKTAVMSLG EELHLVAM S+    Q PCFWG
Sbjct: 59   SFKLEPTKSKPLTQDSPLTLLHSTCLRDNKTAVMSLGREELHLVAMQSKNIGGQCPCFWG 118

Query: 2985 FIIASGLYNSCLVMLNLRCLGIVFDLDETLIVANTWRSFEDRIEALQRKISTETDLQRIA 2806
            F +ASGLY+SCL MLNLRCLGIVFDLDETLIVANT RSFEDRIEALQRKI++E+D QR +
Sbjct: 119  FKVASGLYDSCLTMLNLRCLGIVFDLDETLIVANTMRSFEDRIEALQRKINSESDPQRAS 178

Query: 2805 GMQAEVKRYQDDKIILKQYAENDQVIENGKVIKVQSEVVPALSVNHQPIVRPIIRLQEKN 2626
             M AEVKRYQ+DKIILKQYAENDQV++NGKVIK QSEV PALS NHQPIVRP+IRLQ++N
Sbjct: 179  VMLAEVKRYQEDKIILKQYAENDQVVDNGKVIKSQSEVFPALSDNHQPIVRPLIRLQDRN 238

Query: 2625 VILTRINPKVRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 2446
            +ILTRINP +RDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP
Sbjct: 239  IILTRINPMIRDTSVLVRLRPAWEDLRSYLTARGRKRFEVYVCTMAERDYALEMWRLLDP 298

Query: 2445 ESNLINSKELLDRIVCVKSGLEKSLFNVFQKGNCHPKMALVIDDRLKVWDENDQPRVHVV 2266
            +SNLINS+ELLDRIVCVKSGL KSLFNVFQ GNCHPKMALVIDDRLKVWD+ DQPRVHVV
Sbjct: 299  DSNLINSQELLDRIVCVKSGLRKSLFNVFQDGNCHPKMALVIDDRLKVWDDKDQPRVHVV 358

Query: 2265 PAFAPYYAPQAEANNAVPVLCVARNVACNVRGGFFKEFDDGLLQRIPEVAYEDDVRDIPS 2086
            PAFAPY+APQAE NN+VPVLCVARNVACNVRGGFFK+FD+GLLQRI EVAYEDD++ +PS
Sbjct: 359  PAFAPYFAPQAEGNNSVPVLCVARNVACNVRGGFFKDFDEGLLQRISEVAYEDDIKQVPS 418

Query: 2085 PPDVSNYLVSEDDASALNVNKDPLRSDGMADSEVERRLKEAILASSVVPSAVANLDPRLG 1906
             PDVSNYL+SEDD SA+N NKD L  DGMADSEVERRLKEA+LAS+ VPS + NLDPRL 
Sbjct: 419  APDVSNYLISEDDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTNLDPRLV 478

Query: 1905 SSLQYTVASSIPILLPTSQTSVMPLANKQFPQVASLIKPLV-HVGPSESSLQSSPAREEG 1729
             +LQY V   I    P+ Q+ V+P   +  PQV S++K  V  + P ++SLQSSPAREEG
Sbjct: 479  PALQYPVPPVIS--QPSIQSPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREEG 536

Query: 1728 EVPESELDPDTRRRLLILQHGQDNRDHTSSEPPFPVRPSLQVSV-PRVQSRGNWFPVEEE 1552
            EVPESELDPDTRRRLLILQHGQD RD  SSEP FP+   LQVSV PRVQ  G WFP EEE
Sbjct: 537  EVPESELDPDTRRRLLILQHGQDTRDQVSSEPKFPMGTPLQVSVPPRVQPHG-WFPAEEE 595

Query: 1551 MSPRQLNRAV-PKEFPLDSEVMHIDKQRGHPQSFFHKVESSIQCDRILHENQRLPKETLH 1375
            MSPRQLNR + PKEFPL+ E MHI+K R     F  K+E+S+  DR+L ENQRLPKE + 
Sbjct: 596  MSPRQLNRPLPPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFENQRLPKEVIP 655

Query: 1374 KDDRLRLNHPLSSFHSFPGEEAPLGRSSSSNRNLDVESGRSVPYVETPVGMLQDIATKCG 1195
            +DDR+R +    SF   PGEE PLGRSSSSNR LD+E G   PY+ETP G LQDIA KCG
Sbjct: 656  RDDRMRFSQSQPSFRP-PGEEVPLGRSSSSNRVLDLEPGHYDPYLETPAGALQDIAFKCG 714

Query: 1194 TKVEFRPALVAAAELQFSIEVWFAGEKIGEGIGRTRREAQHQAAQGSLMHLANRYLSHV- 1018
             KVEFR + +++ ELQFS+EV FAGEK+GEG GRTRREAQ +AA+ SLM+LA++YLS + 
Sbjct: 715  AKVEFRSSFLSSPELQFSLEVLFAGEKVGEGTGRTRREAQRRAAEESLMYLADKYLSCIK 774

Query: 1017 --------------NTNDNGFVSDGHSSGYQQFLKEESLSFSTTSDPSRQPDPRLDSSKK 880
                          N +DNGFV +    GYQ     + +S S  S+P R  DPRL+  KK
Sbjct: 775  PDSSSTQGDGFRFPNASDNGFVDNMSPFGYQ-----DRVSHSFASEPPRVLDPRLEVFKK 829

Query: 879  SMGSVSALKELCMMEGLGVVFQGQPPPS-NSIQKDEVYAQVEVDGQVLGKGIGLTWDEAK 703
            S+GSV AL+ELC +EGLG+ FQ QP  S N  QK E+YAQVE+DGQV GKGIG TWD+AK
Sbjct: 830  SVGSVGALRELCAIEGLGLAFQTQPQLSANPGQKSEIYAQVEIDGQVFGKGIGSTWDDAK 889

Query: 702  MQAAEKALGSLKSMHGGQYSHKRQGSPRSF-QGMSSKRFKSEFPRAV-QRMPSASRYPKN 529
             QAAE+AL +LKS    Q+S KRQGSPRS  QG S+KR K E+ R V QR+P + R+PKN
Sbjct: 890  TQAAERALVALKS-ELAQFSQKRQGSPRSLQQGFSNKRLKPEYSRGVQQRVPLSGRFPKN 948

Query: 528  ASPVP 514
             S +P
Sbjct: 949  TSAMP 953


Top