BLASTX nr result

ID: Mentha22_contig00037585 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00037585
         (1225 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU27926.1| hypothetical protein MIMGU_mgv1a000848mg [Mimulus...   469   e-130
gb|EYU43412.1| hypothetical protein MIMGU_mgv1a0014621mg, partia...   440   e-121
ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal doma...   409   e-111
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...   408   e-111
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...   391   e-106
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...   387   e-105
ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform...   374   e-101
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...   366   9e-99
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...   357   6e-96
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...   345   2e-92
ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform...   345   2e-92
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...   342   1e-91
ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phas...   341   4e-91
ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prun...   338   3e-90
ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal doma...   335   3e-89
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...   331   3e-88
ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal doma...   329   1e-87
ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma...   327   6e-87
ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal doma...   320   6e-85
dbj|BAF01152.1| hypothetical protein [Arabidopsis thaliana]           320   6e-85

>gb|EYU27926.1| hypothetical protein MIMGU_mgv1a000848mg [Mimulus guttatus]
          Length = 962

 Score =  469 bits (1208), Expect = e-130
 Identities = 256/422 (60%), Positives = 307/422 (72%), Gaps = 17/422 (4%)
 Frame = +3

Query: 9    SSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQGW 188
            SSPA EEGE+P+SELDPDTRRR+LILQHGQD+R    +E QF            + P GW
Sbjct: 532  SSPAREEGEVPESELDPDTRRRMLILQHGQDMRGPSPSEPQFPARTPMQVSVPRVQPHGW 591

Query: 189  FPVEEEMALRQLNRVS-PPLEF--RAEALPVDNIRARHSTFVHEMQPSIPPGRVR-ENQR 356
            FPVEEEM+ RQ N+V+ PP EF    E+LP+D  R  HS F+  ++PSIPPGR+  E+QR
Sbjct: 592  FPVEEEMSSRQPNQVALPPKEFPLNVESLPIDKNRGHHSPFLQNVEPSIPPGRILPESQR 651

Query: 357  LPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAGALQ 536
            LPKE +PR+D LRLN+ LPDF S  GED S+AQ SSANKD DLEAGQIDP   T  GALQ
Sbjct: 652  LPKEAVPREDQLRLNQSLPDFHSFHGEDASVAQPSSANKDFDLEAGQIDPYIETCIGALQ 711

Query: 537  EIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYLAD 716
            +IAFK GTKVEFKQTL+SST LQFFVEVLFAG++IGEG+GRTRREAQ  AAEGSL YLAD
Sbjct: 712  DIAFKCGTKVEFKQTLISSTGLQFFVEVLFAGERIGEGMGRTRREAQRQAAEGSLLYLAD 771

Query: 717  KYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSS--------REATP-SRASASPRIIGL 869
            KYLS+  PD + +P DG  +G  K+N  + + +S         E  P S  +A PRI+  
Sbjct: 772  KYLSRSRPDFNYVPGDGSRVGNQKENGFNSNANSFGYQPLPNEEGLPFSTVAAPPRIVDP 831

Query: 870  RVEASKKP-TNSIFALKELCMMEGLSVAYQTRPQFLA--GQKNEVYAEVEVDGQVFGKGI 1040
            R E SK+P   SI ALKE C MEGL V +QT+PQF A  GQ+NEVYA+VEV+GQV GKGI
Sbjct: 832  RTEVSKRPIMGSITALKEFCTMEGLGVTFQTQPQFSANPGQRNEVYAQVEVNGQVLGKGI 891

Query: 1041 GLTWDEAKSKAAENALGALRPMLGQFPHKRQG-SPRLMQEISSKRLKPEPSRILQRMPSS 1217
            GLTWDEA+S+AAE AL  L+ M GQFP++ QG SPR MQ I +KR+K E +R+ QR+PS 
Sbjct: 892  GLTWDEARSQAAEKALVTLKSMPGQFPYRHQGSSPRSMQSIPNKRVKQEFNRVSQRLPSF 951

Query: 1218 TR 1223
             R
Sbjct: 952  GR 953


>gb|EYU43412.1| hypothetical protein MIMGU_mgv1a0014621mg, partial [Mimulus guttatus]
          Length = 526

 Score =  440 bits (1131), Expect = e-121
 Identities = 247/412 (59%), Positives = 290/412 (70%), Gaps = 7/412 (1%)
 Frame = +3

Query: 9    SSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXX-IHPQG 185
            SSPA EEGE+ + ELDPDTRRRLLILQHGQDVRE P +E+QF               P G
Sbjct: 135  SSPALEEGEVLEPELDPDTRRRLLILQHGQDVRESPPSESQFPARPPPMQAPTPRAPPHG 194

Query: 186  WFPVEEEMALRQLNRVSPPLEFRAEA-LPVDNIRARHSTFVHEMQPSIPPGRVRENQRLP 362
            WFP+EEEM  RQ+NR +PP++F A+   PVDNIR  H  F+H+M+ ++ PGRV ENQRLP
Sbjct: 195  WFPIEEEMNPRQVNRAAPPVDFIAQPPFPVDNIRTLHPPFLHKMEAAMSPGRVLENQRLP 254

Query: 363  KEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDP-SHLTSAGALQE 539
            K+ELPR + LRL +P+PD+   SG+  ++AQ+ S NKDLDLE GQID  S  +S G L++
Sbjct: 255  KKELPRDEFLRLPQPVPDYHFFSGDGSTVAQLPSTNKDLDLEDGQIDAWSETSSTGVLED 314

Query: 540  IAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYLADK 719
            IAFK GTKVEF+  LV ST LQF VEV FAG+K+GEGIGRTRREAQ  AAEGSL YLADK
Sbjct: 315  IAFKCGTKVEFRHILVPSTALQFCVEVFFAGEKVGEGIGRTRREAQRQAAEGSLLYLADK 374

Query: 720  YLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSSREATPSRASASPRIIGLRVEASKKPTN 899
            YLSQL PDSS MP++                   EA     +A  RI   RVEASKK  +
Sbjct: 375  YLSQLQPDSSYMPKE-------------------EAV---QTAPLRIRDPRVEASKKSMS 412

Query: 900  SIFALKELCMMEGLSVAYQTRPQF--LAGQKNEVYAEVEVDGQVFGKGIGLTWDEAKSKA 1073
            SI ALKELCM EGL VAYQT+ QF      KNEVYAEVE++GQV GKGIGLTW+EAKS+A
Sbjct: 413  SIAALKELCMREGLDVAYQTQSQFSGFRAHKNEVYAEVEINGQVLGKGIGLTWEEAKSQA 472

Query: 1074 AENALGALRPMLG-QFPHKRQGSPRLMQEISS-KRLKPEPSRILQRMPSSTR 1223
            AE A+GA+  MLG Q P+KR  SPR MQ +SS KR KP  SR L RMPSS R
Sbjct: 473  AEKAIGAMNSMLGQQAPYKRMDSPRSMQGMSSNKRFKPGYSRALHRMPSSVR 524


>ref|XP_004232844.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum lycopersicum]
          Length = 954

 Score =  409 bits (1052), Expect = e-111
 Identities = 229/420 (54%), Positives = 288/420 (68%), Gaps = 13/420 (3%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXX-IHP 179
            L SSPA EEGE+P+SELDPDTRRRLLILQHGQD R+Q  +E +F             + P
Sbjct: 527  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKFPIGTPLQVSVPPRVQP 586

Query: 180  QGWFPVEEEMALRQLNRVSPPLEF--RAEALPVDNIRARHSTFVHEMQPSIPPGRVR-EN 350
             GWFP EEE++ RQLNR  PP EF    E++ ++  R  H  F+ +M+ S+P  RV  EN
Sbjct: 587  HGWFPAEEEVSPRQLNRPLPPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVFFEN 646

Query: 351  QRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAGA 530
            QRLPKE +PR D +R ++  P FR   GED S+ + SS+N+ LDL+ G  DP   T AGA
Sbjct: 647  QRLPKEVIPRDDRMRFSQSQPSFRP-PGEDVSLGRSSSSNRVLDLDPGHYDPYLDTPAGA 705

Query: 531  LQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYL 710
            LQ+IAFK G KVEF+ + +SS ELQF +EVLFAG+K+GEGIGRTRREAQ HAAE SL YL
Sbjct: 706  LQDIAFKCGVKVEFRSSFLSSPELQFCLEVLFAGEKVGEGIGRTRREAQRHAAEESLMYL 765

Query: 711  ADKYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMS----SREATPSRASASPRIIGLRVE 878
            ADKYLS +  DSS+   DG       DN   ++MS        + S AS  PR++  R+E
Sbjct: 766  ADKYLSCIKADSSSTQGDGFRFPNASDNGFVENMSPFGYQDRVSHSFASEPPRVLDPRLE 825

Query: 879  ASKKPTNSIFALKELCMMEGLSVAYQTRPQFLA--GQKNEVYAEVEVDGQVFGKGIGLTW 1052
              KK   S+ AL+ELC +EGL +A+QT+PQ     GQK+E+YA+VE+DGQVFGKGIG TW
Sbjct: 826  VFKKSVGSVGALRELCAIEGLGLAFQTQPQLSVNPGQKSEIYAQVEIDGQVFGKGIGPTW 885

Query: 1053 DEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQE--ISSKRLKPEPSR-ILQRMPSSTR 1223
            D+AK++AAE AL AL+  L QF HKRQGSPR +Q+   S+KRLKPE SR + QR+P S R
Sbjct: 886  DDAKTQAAERALVALKSELAQFSHKRQGSPRSLQQQGFSNKRLKPEYSRGVQQRVPLSGR 945


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score =  408 bits (1049), Expect = e-111
 Identities = 230/419 (54%), Positives = 285/419 (68%), Gaps = 12/419 (2%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXX-IHP 179
            L SSPA EEGE+P+SELDPDTRRRLLILQHGQD R+Q  +E +F             + P
Sbjct: 527  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDQVSSEPKFPMGTPLQVSVPPRVQP 586

Query: 180  QGWFPVEEEMALRQLNRVSPPLEF--RAEALPVDNIRARHSTFVHEMQPSIPPGRVR-EN 350
             GWFP EEEM+ RQLNR  PP EF    E++ ++  R  H  F+ +M+ S+P  RV  EN
Sbjct: 587  HGWFPAEEEMSPRQLNRPLPPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFEN 646

Query: 351  QRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAGA 530
            QRLPKE +PR D +R ++  P FR   GE+  + + SS+N+ LDLE G  DP   T AGA
Sbjct: 647  QRLPKEVIPRDDRMRFSQSQPSFRP-PGEEVPLGRSSSSNRVLDLEPGHYDPYLETPAGA 705

Query: 531  LQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYL 710
            LQ+IAFK G KVEF+ + +SS ELQF +EVLFAG+K+GEG GRTRREAQ  AAE SL YL
Sbjct: 706  LQDIAFKCGAKVEFRSSFLSSPELQFSLEVLFAGEKVGEGTGRTRREAQRRAAEESLMYL 765

Query: 711  ADKYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMS----SREATPSRASASPRIIGLRVE 878
            ADKYLS + PDSS+   DG       DN   D+MS        + S AS  PR++  R+E
Sbjct: 766  ADKYLSCIKPDSSSTQGDGFRFPNASDNGFVDNMSPFGYQDRVSHSFASEPPRVLDPRLE 825

Query: 879  ASKKPTNSIFALKELCMMEGLSVAYQTRPQFLA--GQKNEVYAEVEVDGQVFGKGIGLTW 1052
              KK   S+ AL+ELC +EGL +A+QT+PQ  A  GQK+E+YA+VE+DGQVFGKGIG TW
Sbjct: 826  VFKKSVGSVGALRELCAIEGLGLAFQTQPQLSANPGQKSEIYAQVEIDGQVFGKGIGSTW 885

Query: 1053 DEAKSKAAENALGALRPMLGQFPHKRQGSPR-LMQEISSKRLKPEPSR-ILQRMPSSTR 1223
            D+AK++AAE AL AL+  L QF  KRQGSPR L Q  S+KRLKPE SR + QR+P S R
Sbjct: 886  DDAKTQAAERALVALKSELAQFSQKRQGSPRSLQQGFSNKRLKPEYSRGVQQRVPLSGR 944


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score =  391 bits (1004), Expect = e-106
 Identities = 215/416 (51%), Positives = 276/416 (66%), Gaps = 9/416 (2%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            L SSPA EEGE+P+SELDPDTRRRLLILQHG D RE   +E  F            +  +
Sbjct: 533  LQSSPAREEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPFPARTQMQVSVPRVPSR 592

Query: 183  G-WFPVEEEMALRQLNRVSPP-LEFRAEALPVDNIRARHSTFVHEMQPSIPPGRVRENQR 356
            G WFPVEEEM+ RQLNR  P      +EA+ ++  R  H +F  +++ SI   R  ENQR
Sbjct: 593  GSWFPVEEEMSPRQLNRAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDRPHENQR 652

Query: 357  LPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAGALQ 536
            +PKE L R D LRLN  L D++S SGE+  +++ SS+++D+D E+G+   S  T +G LQ
Sbjct: 653  MPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQ 712

Query: 537  EIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYLAD 716
            +IA K GTKVEF+  LV+STELQF +E  FAG+KIGEGIGRTRREAQ  AAEGS+ +LA+
Sbjct: 713  DIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLAN 772

Query: 717  KYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSSREATP-----SRASASPRIIGLRVEA 881
             Y+ ++  DS +   DG       +N    +++S    P     S +S   +++  R+E 
Sbjct: 773  VYVLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDESLSSEPSKLVDPRLEG 832

Query: 882  SKKPTNSIFALKELCMMEGLSVAYQTRPQFLAG--QKNEVYAEVEVDGQVFGKGIGLTWD 1055
            SKK   S+ ALKELCM EGL V +Q +P   A   QK+EVYA+VE+DGQV GKGIG TWD
Sbjct: 833  SKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTWD 892

Query: 1056 EAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPSSTR 1223
            EAK +AAE ALG+LR M GQFP K QGSPR +Q + +KRLKPE  R+LQRMP S R
Sbjct: 893  EAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPPSGR 948


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score =  387 bits (994), Expect = e-105
 Identities = 213/416 (51%), Positives = 274/416 (65%), Gaps = 9/416 (2%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            L SSPA EEGE+P+SELDPDTRRRLLILQHG D RE   +E  F            +  +
Sbjct: 533  LQSSPAREEGEVPESELDPDTRRRLLILQHGMDTRENAPSEAPFPARTQMQVSVPRVPSR 592

Query: 183  G-WFPVEEEMALRQLNRVSPP-LEFRAEALPVDNIRARHSTFVHEMQPSIPPGRVRENQR 356
            G WFPVEEEM+ RQLNR  P      +EA+ ++  R  H +F  +++      R  ENQR
Sbjct: 593  GSWFPVEEEMSPRQLNRAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDRPHENQR 652

Query: 357  LPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAGALQ 536
            +PKE L R D LRLN  L D++S SGE+  +++ SS+++D+D E+G+   S  T +G LQ
Sbjct: 653  MPKEALRRDDRLRLNHTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSSTETPSGVLQ 712

Query: 537  EIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYLAD 716
            +IA K GTKVEF+  LV+STELQF +E  FAG+KIGEGIGRTRREAQ  AAEGS+ +LA+
Sbjct: 713  DIAMKCGTKVEFRPALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLAN 772

Query: 717  KYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSSREATP-----SRASASPRIIGLRVEA 881
             Y+ ++  DS +   DG       +N    +++S    P     S +S   +++  R+E 
Sbjct: 773  VYMLRVKSDSGSGHGDGSRFSNANENCFMGEINSFGGQPLAKDESLSSEPSKLVDPRLEG 832

Query: 882  SKKPTNSIFALKELCMMEGLSVAYQTRPQFLAG--QKNEVYAEVEVDGQVFGKGIGLTWD 1055
            SKK   S+ ALKELCM EGL V +Q +P   A   QK+EVYA+VE+DGQV GKGIG TWD
Sbjct: 833  SKKLMGSVSALKELCMTEGLGVVFQQQPPSSANSVQKDEVYAQVEIDGQVLGKGIGSTWD 892

Query: 1056 EAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPSSTR 1223
            EAK +AAE ALG+LR M GQFP K QGSPR +Q + +KRLKPE  R+LQRMP S R
Sbjct: 893  EAKMQAAEKALGSLRSMFGQFPQKHQGSPRSLQGMPNKRLKPEFPRVLQRMPPSGR 948


>ref|XP_007025680.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
            gi|508781046|gb|EOY28302.1| C-terminal domain
            phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score =  374 bits (961), Expect = e-101
 Identities = 216/424 (50%), Positives = 274/424 (64%), Gaps = 17/424 (4%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            L SSPA EEGE+P+SELDPDTRRRLLILQHGQD R+    E  F              P+
Sbjct: 550  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQVSV---PR 606

Query: 183  G-----WFPVEEEMALRQLNRVSPPLEFRAEALPVDNIRARHSTFVHEMQPSIPPGRV-R 344
            G     WF  EEEM+ RQLNR +P  EF  ++  +   + RH  F  +++ SIP  R+ R
Sbjct: 607  GQSRGSWFAAEEEMSPRQLNRAAPK-EFPLDSERMHIEKHRHPPFFPKVESSIPSDRLLR 665

Query: 345  ENQRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSA 524
            ENQRL KE L R D L LN     + S SGE+  ++Q SS+++DLD E+G+   S  TSA
Sbjct: 666  ENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTSGETSA 725

Query: 525  GALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLF 704
            G LQ+IA K G KVEF+  LV+S +LQF +E  FAG+K+GEG+GRTRREAQ  AAE S+ 
Sbjct: 726  GVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAEESIK 785

Query: 705  YLADKYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSS---------REATPSRASASPR 857
             LA+ YLS++ PDS +   D   L  + DN    +++S            + S AS   R
Sbjct: 786  NLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTASEQSR 845

Query: 858  IIGLRVEASKKPTNSIFALKELCMMEGLSVAYQTRPQFLAG--QKNEVYAEVEVDGQVFG 1031
            +   R+E SKK   S+ ALKELCMMEGL V +Q +P   +   QK+EVYA+VE+DGQV G
Sbjct: 846  LADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEIDGQVLG 905

Query: 1032 KGIGLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMP 1211
            KG GLTW+EAK +AAE ALG+LR MLGQ+  KRQGSPR +Q + +KRLKPE  R+LQRMP
Sbjct: 906  KGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPRSLQGMQNKRLKPEFPRVLQRMP 965

Query: 1212 SSTR 1223
            SS R
Sbjct: 966  SSGR 969


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score =  366 bits (940), Expect = 9e-99
 Identities = 211/422 (50%), Positives = 275/422 (65%), Gaps = 15/422 (3%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            LHSSPA EEGE+P+SELDPDTRRRLLILQHGQD RE   +E  F            +  +
Sbjct: 529  LHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRESVPSEPSFPVRPQVQVSVPRVQSR 588

Query: 183  G-WFPVEEEMALRQLNRV---SPPLEFRAEALPVDNIRARHSTFVHEMQPSIPPGRV-RE 347
            G WFPVEEEM+ R+L+R+    PPL   +E + ++  R+ HS F  +++ S+P  R+ +E
Sbjct: 589  GGWFPVEEEMSPRKLSRMVPKEPPLN--SEPMQIEKHRSHHSAFFPKVENSMPSDRILQE 646

Query: 348  NQRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAG 527
            NQRLPKE   R + LR N+ +  + S SGE+  + + SS+N+D D E+G+   +  T AG
Sbjct: 647  NQRLPKEAFHRDNRLRFNQAMSGYHSFSGEEPPLNRSSSSNRDFDYESGRAISNAETPAG 706

Query: 528  ALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFY 707
             LQEIA K GTKVEF+  LV STELQF+VE  FAG+KIGEG GRTRREA   AAEGSL  
Sbjct: 707  VLQEIAMKCGTKVEFRPALVPSTELQFYVEAWFAGEKIGEGTGRTRREAHFQAAEGSLKN 766

Query: 708  LADKYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSSREATP---------SRASASPRI 860
            LA+ Y+S+  PD+  +  D      + +N    +M+S    P         S +S   R 
Sbjct: 767  LANIYISRGKPDALPIHGDASKFSNVTNNGFMGNMNSFGTQPLPKEDSLSSSTSSEPSRP 826

Query: 861  IGLRVEASKKPTNSIFALKELCMMEGLSVAYQTR-PQFLAGQKNEVYAEVEVDGQVFGKG 1037
            +  R++ S+K  +S+ ALKELC MEGLSV YQ R P   + +K+EV+ + E+DG+V GKG
Sbjct: 827  LDPRLDNSRKSVSSVSALKELCTMEGLSVLYQPRPPPPNSTEKDEVHVQAEIDGEVLGKG 886

Query: 1038 IGLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPSS 1217
            IGLTWDEAK +AAE ALG LR  L  +  KRQGSPR +Q + SKRLK E  ++LQRMPSS
Sbjct: 887  IGLTWDEAKMQAAEKALGNLRSTL--YGQKRQGSPRPLQGMPSKRLKQEFPQVLQRMPSS 944

Query: 1218 TR 1223
            TR
Sbjct: 945  TR 946


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score =  357 bits (916), Expect = 6e-96
 Identities = 202/420 (48%), Positives = 262/420 (62%), Gaps = 13/420 (3%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            L SSPA EEGE+P+SELDPDTRRRLLILQHG D R+   +E+ F            +   
Sbjct: 569  LQSSPAREEGEVPESELDPDTRRRLLILQHGHDSRDNAPSESPFPARPSTQVSAPRVQSV 628

Query: 183  G-WFPVEEEMALRQLNRVSPPLEFRAEALPVDNIRARHSTFVHEMQPSIPPGR-VRENQR 356
            G W PVEEEM+ RQLNR        ++ + ++  R  H +F H+++ +IP  R + ENQR
Sbjct: 629  GSWVPVEEEMSPRQLNRTPREFPLDSDPMNIEKHRTHHPSFFHKVESNIPSDRMIHENQR 688

Query: 357  LPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAGALQ 536
             PKE   R D ++LN    ++ S  GE+  +++ SS+N+DLDLE+ +   S  T    LQ
Sbjct: 689  QPKEATYRDDRMKLNHSTSNYPSFQGEESPLSR-SSSNRDLDLESERAFSSTETPVEVLQ 747

Query: 537  EIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYLAD 716
            EIA K GTKVEF+  L+++++LQF +E  F G+K+GEG G+TRREAQ  AAEGS+  LA 
Sbjct: 748  EIAMKCGTKVEFRPALIATSDLQFSIETWFVGEKVGEGTGKTRREAQRQAAEGSIKKLAG 807

Query: 717  KYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSS---------REATPSRASASPRIIGL 869
             Y+S++ PDS  M  D        DN    DM+S            T S  S   R++  
Sbjct: 808  IYMSRVKPDSGPMLGDSSRYPSANDNGFLGDMNSFGNQPLLKDENITYSATSEPSRLLDQ 867

Query: 870  RVEASKKPTNSIFALKELCMMEGLSVAY--QTRPQFLAGQKNEVYAEVEVDGQVFGKGIG 1043
            R+E SKK   S+ ALKE CM EGL V +  QT     +    EV+A+VE+DGQV GKGIG
Sbjct: 868  RLEGSKKSMGSVTALKEFCMTEGLGVNFLAQTPLSTNSIPGEEVHAQVEIDGQVLGKGIG 927

Query: 1044 LTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPSSTR 1223
            LTWDEAK +AAE ALG+LR M GQ+  KRQGSPRLMQ + +KRLK E  R+LQRMPSS R
Sbjct: 928  LTWDEAKMQAAEKALGSLRTMFGQYTPKRQGSPRLMQGMPNKRLKQEFPRVLQRMPSSAR 987


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score =  345 bits (886), Expect = 2e-92
 Identities = 201/423 (47%), Positives = 268/423 (63%), Gaps = 16/423 (3%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            L SSPA EEGE+P+SELDPDTRRRLLILQHGQD+R+   +E+ F               Q
Sbjct: 547  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDLRDPAPSESPFPVRPSNSMQVSVPRVQ 606

Query: 183  G---WFPVEEEMALRQLNR-VSPPLEFRAEALPVDNIRARHSTFVHEMQPSIPPGRV-RE 347
                W PVEEEM+ RQLNR V+       E + +D  R  H +F  +++ SIP  R+  E
Sbjct: 607  SRGNWVPVEEEMSPRQLNRAVTREFPMDTEPMHIDKHRPHHPSFFPKVESSIPSERMPHE 666

Query: 348  NQRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAG 527
            NQRLPK    + D LRLN+ + +++S+SGE+ S+++ SS+N+DLD+E+ +   S  T   
Sbjct: 667  NQRLPKVAPYKDDRLRLNQTMSNYQSLSGEENSLSRSSSSNRDLDVESDRAVSSAETPVR 726

Query: 528  ALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFY 707
             L EI+ K G KVEFK +LV+S +LQF VE  FAG+++GEG GRTRREAQ  AAE S+  
Sbjct: 727  VLHEISMKCGAKVEFKHSLVNSRDLQFSVEAWFAGERVGEGFGRTRREAQSVAAEASIKN 786

Query: 708  LADKYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSSREATP---------SRASASPRI 860
            LA+ Y+S+  PD+  +  D        DN     ++S  + P         S +S    +
Sbjct: 787  LANIYISRAKPDNGALHGDASKYSSANDNGFLGHVNSFGSQPLPKDEILSYSDSSEQSGL 846

Query: 861  IGLRVEASKKPTNSIFALKELCMMEGLSVAY--QTRPQFLAGQKNEVYAEVEVDGQVFGK 1034
            +  R+E+SKK  +S+ ALKE CMMEGL V +  QT     + Q  EV+A+VE+DGQV GK
Sbjct: 847  LDPRLESSKKSMSSVNALKEFCMMEGLGVNFLAQTPLSSNSVQNAEVHAQVEIDGQVMGK 906

Query: 1035 GIGLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPS 1214
            GIG T+DEAK +AAE ALG+LR   G+FP KRQGSPR +  + +K LKPE  R+LQRMPS
Sbjct: 907  GIGSTFDEAKMQAAEKALGSLRTTFGRFPPKRQGSPRPVPGMPNKHLKPEFPRVLQRMPS 966

Query: 1215 STR 1223
            S R
Sbjct: 967  SAR 969


>ref|XP_007025681.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
            gi|508781047|gb|EOY28303.1| C-terminal domain
            phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score =  345 bits (885), Expect = 2e-92
 Identities = 200/398 (50%), Positives = 254/398 (63%), Gaps = 17/398 (4%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            L SSPA EEGE+P+SELDPDTRRRLLILQHGQD R+    E  F              P+
Sbjct: 550  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTPPEPAFPPVRPTMQVSV---PR 606

Query: 183  G-----WFPVEEEMALRQLNRVSPPLEFRAEALPVDNIRARHSTFVHEMQPSIPPGRV-R 344
            G     WF  EEEM+ RQLNR +P  EF  ++  +   + RH  F  +++ SIP  R+ R
Sbjct: 607  GQSRGSWFAAEEEMSPRQLNRAAPK-EFPLDSERMHIEKHRHPPFFPKVESSIPSDRLLR 665

Query: 345  ENQRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSA 524
            ENQRL KE L R D L LN     + S SGE+  ++Q SS+++DLD E+G+   S  TSA
Sbjct: 666  ENQRLSKEALHRDDRLGLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTSGETSA 725

Query: 525  GALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLF 704
            G LQ+IA K G KVEF+  LV+S +LQF +E  FAG+K+GEG+GRTRREAQ  AAE S+ 
Sbjct: 726  GVLQDIAMKCGAKVEFRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAEESIK 785

Query: 705  YLADKYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSS---------REATPSRASASPR 857
             LA+ YLS++ PDS +   D   L  + DN    +++S            + S AS   R
Sbjct: 786  NLANTYLSRIKPDSGSAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTASEQSR 845

Query: 858  IIGLRVEASKKPTNSIFALKELCMMEGLSVAYQTRPQFLAG--QKNEVYAEVEVDGQVFG 1031
            +   R+E SKK   S+ ALKELCMMEGL V +Q +P   +   QK+EVYA+VE+DGQV G
Sbjct: 846  LADPRLEGSKKSMGSVTALKELCMMEGLGVVFQPQPPSSSNALQKDEVYAQVEIDGQVLG 905

Query: 1032 KGIGLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPR 1145
            KG GLTW+EAK +AAE ALG+LR MLGQ+  KRQGSPR
Sbjct: 906  KGTGLTWEEAKMQAAEKALGSLRSMLGQYSQKRQGSPR 943


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score =  342 bits (878), Expect = 1e-91
 Identities = 198/420 (47%), Positives = 256/420 (60%), Gaps = 13/420 (3%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            L SSPA EEGE+P+SELDPDTRRRLLILQHGQD R+   +E+ F            +  +
Sbjct: 563  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDSRDNAPSESPFPARPSAPVSAAHVQSR 622

Query: 183  G-WFPVEEEMALRQLNRVSPPLEFRAEALPVDNIRARHSTFVHEMQPSIPPGR-VRENQR 356
            G W PVEEEM  RQLNR        ++ + ++  +  H +F  +++ +IP  R + ENQR
Sbjct: 623  GSWVPVEEEMTPRQLNRTPREFPLDSDPMNIEKHQTHHPSFFPKVESNIPSDRMIHENQR 682

Query: 357  LPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAGALQ 536
            LPKE   R D +RLN   P++ S   E+  +++ SS+N+DLDLE+ +      T    LQ
Sbjct: 683  LPKEAPYRNDRMRLNHSTPNYHSFQVEETPLSR-SSSNRDLDLESERAFTISETPVEVLQ 741

Query: 537  EIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYLAD 716
            EIA K  TKVEF+  LV+S +LQF +E  FAG+K+GEG G+TRREAQ  AAEGS+  LA 
Sbjct: 742  EIAMKCETKVEFRPALVASIDLQFSIEAWFAGEKVGEGTGKTRREAQRQAAEGSIKKLAG 801

Query: 717  KYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSSREATP---------SRASASPRIIGL 869
             Y+ +  PDS  M  D        DN    +M+     P         S AS   R++  
Sbjct: 802  IYMLRAKPDSGPMHGDSSRYPSANDNGFLGNMNLFGNQPLPKDELVAYSAASEPSRLLDP 861

Query: 870  RVEASKKPTNSIFALKELCMMEGLSVAYQTRPQFLAGQ--KNEVYAEVEVDGQVFGKGIG 1043
            R+E SKK + S+ ALKE C MEGL V +  +    A      EV+A+VE+DGQV GKGIG
Sbjct: 862  RLEGSKKSSGSVTALKEFCTMEGLVVNFLAQTPLSANSIPGEEVHAQVEIDGQVLGKGIG 921

Query: 1044 LTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPSSTR 1223
             TWDEAK +AAE ALG+LR M GQ+  KRQGSPR MQ + +KRLK E  R+LQRMP S R
Sbjct: 922  STWDEAKMQAAEKALGSLRTMFGQYTQKRQGSPRPMQGMPNKRLKQEFPRVLQRMPPSAR 981


>ref|XP_007159305.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
            gi|561032720|gb|ESW31299.1| hypothetical protein
            PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score =  341 bits (874), Expect = 4e-91
 Identities = 203/424 (47%), Positives = 266/424 (62%), Gaps = 17/424 (4%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            LHSSPA EEGE+P+SELDPDTRRRLLILQHGQD R+   NE  +            +  +
Sbjct: 533  LHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHTSNEPTYAIRHPVPVSAPRVSSR 592

Query: 183  G-WFPVEEEMALRQLNRVSPPLEFRAEA--LPVDNIRARHSTFVHEMQPSIPPGRVREN- 350
            G WFP EE++  + LNRV P  EF  ++  L ++  R  H +F  +++ SI   R+  + 
Sbjct: 593  GGWFPAEEDIGSQPLNRVVPK-EFSVDSGSLVIEKHRPHHPSFFSKVESSISSDRILHDS 651

Query: 351  -QRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAG 527
             QRLPKE   R D  R N  L  +RS+S ++   ++ SS+++DLD E+        T   
Sbjct: 652  HQRLPKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSSSHRDLDSESSHSVFHADTPVV 711

Query: 528  ALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFY 707
             LQEIA K GTKVEF  +LV+STELQF +E  F+G+KIG G GRTR+EAQH AAE S+ +
Sbjct: 712  VLQEIALKCGTKVEFMSSLVASTELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKH 771

Query: 708  LADKYLSQLNPDSSNMPRDGISLGKLKDN---YLSDDMSSR------EATPSRASASPRI 860
            LAD YLS    +  +   D        DN    ++  +S++       A+ S AS   R+
Sbjct: 772  LADIYLSSAKDEPGSTYGDVGGFPNANDNGYMVIASSLSNQPLPKEDSASFSTASDPSRV 831

Query: 861  IGLRVEASKKPTNSIFALKELCMMEGLSVAYQTRPQFLAG---QKNEVYAEVEVDGQVFG 1031
            +  R+E SK+P  SI ALKELCMMEGL V + + P  ++    QK+EV+A+VE+DG+VFG
Sbjct: 832  LDPRLEVSKRPMGSISALKELCMMEGLGVNFLSAPAPVSTNSLQKDEVHAQVEIDGKVFG 891

Query: 1032 KGIGLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMP 1211
            KGIGLTWDEAK +AAE ALG+LR  LGQ   KRQ SPR  Q  S+KRLK E  R +QR+P
Sbjct: 892  KGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRSHQGFSNKRLKQEYPRAMQRIP 951

Query: 1212 SSTR 1223
            SSTR
Sbjct: 952  SSTR 955


>ref|XP_007214548.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica]
            gi|462410413|gb|EMJ15747.1| hypothetical protein
            PRUPE_ppa000988mg [Prunus persica]
          Length = 940

 Score =  338 bits (867), Expect = 3e-90
 Identities = 200/421 (47%), Positives = 264/421 (62%), Gaps = 14/421 (3%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            L SSPA EEGE+P+SELDPDTRRRLLILQHGQD R+QP +E  F               +
Sbjct: 531  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDQPPSEPPFPVRPPMQASVPRAQSR 590

Query: 183  -GWFPVEEEMALRQLNRVSP-PLEFRAEALPVDNIRARHSTFVHEMQPSIPPGRV-RENQ 353
             GWFPVEEEM+ RQL+R+ P  L    E + ++  R  HS+F  +++ SIP  R+ +ENQ
Sbjct: 591  PGWFPVEEEMSPRQLSRMVPKDLPLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQENQ 650

Query: 354  RLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAGAL 533
            RLPKE   R D LR N  L  + S+SGE+  +++ SS+N+D+D E+G+   +  T AG L
Sbjct: 651  RLPKEAFHRDDRLRFNHALSGYHSLSGEEIPLSRSSSSNRDVDFESGRAISNAETPAGVL 710

Query: 534  QEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYLA 713
            QEIA K G K                    FAG+KIGEG G+TRREA + AAEGSL  LA
Sbjct: 711  QEIAMKCGAK------------------AWFAGEKIGEGSGKTRREAHYQAAEGSLKNLA 752

Query: 714  DKYLSQLNPDSSNMPRDGISLGKLKDNYLSDDMSS--------REATPSRASASP-RIIG 866
            + YLS++ PDS ++  D      +  N  + +++S         E+  S  S+ P R + 
Sbjct: 753  NIYLSRVKPDSVSVHGDMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPLD 812

Query: 867  LRVEASKKPTNSIFALKELCMMEGLSVAYQTR--PQFLAGQKNEVYAEVEVDGQVFGKGI 1040
             R+E SKK  +S+  LKELCMMEGL V +Q R  P   + +K+EV+ +VE+DG+V GKGI
Sbjct: 813  PRLEGSKKSMSSVSTLKELCMMEGLGVVFQPRPPPSTNSVEKDEVHVQVEIDGEVLGKGI 872

Query: 1041 GLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPSST 1220
            GLTWDEAK +AAE ALG+L   L  +  KRQGSPR +Q +SSKR+K E  ++LQRMPSS 
Sbjct: 873  GLTWDEAKMQAAEKALGSLTSTL--YAQKRQGSPRSLQGMSSKRMKQEFPQVLQRMPSSA 930

Query: 1221 R 1223
            R
Sbjct: 931  R 931


>ref|XP_003542763.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Glycine max]
          Length = 960

 Score =  335 bits (858), Expect = 3e-89
 Identities = 202/424 (47%), Positives = 263/424 (62%), Gaps = 17/424 (4%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            LHSSPA EEGE+P+SELDPDTRRRLLILQHGQD R+    E  F            +   
Sbjct: 529  LHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQASAPRVPSS 588

Query: 183  G--WFPVEEEMALRQLNRVSPPLEFRAEALP--VDNIRARHSTFVHEMQPSIPPGRVREN 350
               WFPVEEE+  + LNRV P  EF  ++ P  ++  R  H +F ++++ SI   R+  +
Sbjct: 589  RGVWFPVEEEIGSQPLNRVVPK-EFPVDSGPLGIEKPRLHHPSFFNKVESSISSDRILHD 647

Query: 351  --QRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSA 524
              QRLPKE   R D  RLN  L  +RS SG+D   ++ SS+++DLD E+G       T  
Sbjct: 648  SHQRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSSSSHRDLDSESGHSVLHADTPV 707

Query: 525  GALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLF 704
              L EIA K GTKV+F  +LV+STEL+F +E  F+G+KIG G GRTR+EAQ+ AA+ S+ 
Sbjct: 708  AVLHEIALKCGTKVDFMSSLVASTELKFSLEAWFSGKKIGHGFGRTRKEAQNKAAKDSIE 767

Query: 705  YLADKYLSQLNPDSSNMPRDGISLGKLKDN-------YLSDDMSSREATPSRASASP-RI 860
            +LAD YLS    +  +   D      + DN        L +   S+E + S +SASP R 
Sbjct: 768  HLADIYLSSAKDEPGSTYGDVSGFPNVNDNGYMGIASSLGNQPLSKEDSASFSSASPSRA 827

Query: 861  IGLRVEASKKPTNSIFALKELCMMEGLSVAYQTRPQFLAG---QKNEVYAEVEVDGQVFG 1031
            +  R++ SK+   SI ALKELCMMEGL V + + P  ++    QK+EV+A+VE+DG++FG
Sbjct: 828  LDPRLDVSKRSMGSISALKELCMMEGLGVNFLSTPAPVSTNSVQKDEVHAQVEIDGKIFG 887

Query: 1032 KGIGLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMP 1211
            KGIGLTWDEAK +AAE ALG LR  LGQ   K Q SPR  Q  S+KRLK E  R +QRMP
Sbjct: 888  KGIGLTWDEAKMQAAEKALGNLRSKLGQSIQKMQSSPRPHQGFSNKRLKQEYPRTMQRMP 947

Query: 1212 SSTR 1223
            SS R
Sbjct: 948  SSAR 951


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score =  331 bits (849), Expect = 3e-88
 Identities = 201/424 (47%), Positives = 264/424 (62%), Gaps = 17/424 (4%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            LHSSPA EEGE+P+SELDPDTRRRLLILQHGQD R+    E  F            +   
Sbjct: 525  LHSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDHASAEPPFPVRHPVQTSAPHVPSS 584

Query: 183  G--WFPVEEEMALRQLNRVSPPLEFRAEALPVDNIRAR--HSTFVHEMQPSIPPGRVREN 350
               WFP EEE+  + LNRV P  EF  ++ P+   + R  H +F  +++ SI   R+  +
Sbjct: 585  RGVWFPAEEEIGSQPLNRVVPK-EFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHD 643

Query: 351  --QRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSA 524
              QRLPKE   R D  RLN  L  +RS SG+D   ++  S+++DLD E+G       T  
Sbjct: 644  SHQRLPKEMYHRDDRPRLNHMLSSYRSFSGDDIPFSRSFSSHRDLDSESGHSVLHADTPV 703

Query: 525  GALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLF 704
              LQEIA K GTKV+F  +LV+STELQF +E  F+G+KIG  +GRTR+EAQ+ AAE S+ 
Sbjct: 704  AVLQEIALKCGTKVDFISSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIK 763

Query: 705  YLADKYLSQLNPDSSNMPRDGISLGKLKD-------NYLSDDMSSREATPSRASASP-RI 860
            +LAD YLS    +  +   D      + D       + L +   S+E + S ++ASP R+
Sbjct: 764  HLADIYLSSAKDEPGSTYGDVSGFPNVNDSGYMGIASSLGNQPLSKEDSASFSTASPSRV 823

Query: 861  IGLRVEASKKPTNSIFALKELCMMEGLSVAYQTRPQFLAG---QKNEVYAEVEVDGQVFG 1031
            +  R++ SK+   SI +LKELCMMEGL V + + P  ++    QK+EV+A+VE+DG+VFG
Sbjct: 824  LDPRLDVSKRSMGSISSLKELCMMEGLDVNFLSAPAPVSTNSVQKDEVHAQVEIDGKVFG 883

Query: 1032 KGIGLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMP 1211
            KGIGLTWDEAK +AAE ALG+LR  LGQ   KRQ SPR  Q  S+KRLK E  R +QRMP
Sbjct: 884  KGIGLTWDEAKMQAAEKALGSLRSKLGQSIQKRQSSPRPHQGFSNKRLKQEYPRPMQRMP 943

Query: 1212 SSTR 1223
            SS R
Sbjct: 944  SSAR 947


>ref|XP_003543063.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
            gi|571500215|ref|XP_006594604.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1-like
            isoform X2 [Glycine max]
          Length = 960

 Score =  329 bits (844), Expect = 1e-87
 Identities = 202/426 (47%), Positives = 262/426 (61%), Gaps = 19/426 (4%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHP- 179
            LHSSPA EEGE+P+SELD DTRRR LILQHGQD RE+  +E  F                
Sbjct: 527  LHSSPAREEGELPESELDLDTRRRFLILQHGQDTRERMASEPPFPVRHPAQVSAPASSVP 586

Query: 180  --QGWFPVEEEMALRQLNRVSPPLEFRAEALP--VDNIRARHSTFVHEMQPSIPPGRV-- 341
              +GWF VEEEM  +QLN +  P EF  ++ P  ++    RH +F  ++  SI   RV  
Sbjct: 587  SRRGWFSVEEEMGPQQLN-LPVPKEFPVDSEPFHIEKRWPRHPSFFSKVGDSISSDRVFH 645

Query: 342  RENQRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTS 521
              +QRLPKE   R D  RL++ L  + S+ G+D  ++  S +N+D D E+G+      T+
Sbjct: 646  ESHQRLPKEVHHRDDRSRLSQSLSSYHSLPGDDIPLSGSSYSNRDFDSESGRSLFHADTT 705

Query: 522  AGALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSL 701
            AG LQEIA   GTKVEF  +LV+STELQF +E  FAG+KIGEG GRTRREAQ  AA  S+
Sbjct: 706  AGVLQEIALNCGTKVEFLSSLVASTELQFSIEAWFAGKKIGEGFGRTRREAQSKAAGCSI 765

Query: 702  FYLADKYLSQLNPDSSNMPRDGISL-GKLKDNYLSDDMS--------SREATPSRASASP 854
              LAD Y+S    DS +   D     G   D ++S   S            + S AS S 
Sbjct: 766  KQLADIYMSHAKDDSGSTYGDVSGFHGSNNDGFVSSGNSLGNQLLPKEESGSFSTASESS 825

Query: 855  RIIGLRVEASKKPTNSIFALKELCMMEGLSVAYQTRPQFLAG---QKNEVYAEVEVDGQV 1025
            R+   R+E SK+ T+SI ALKELCMMEGL+ ++Q+ P   +    QK+EV+A+VE+DGQ+
Sbjct: 826  RVSDSRLEVSKRSTDSISALKELCMMEGLAASFQSPPASASTHLTQKDEVHAQVEIDGQI 885

Query: 1026 FGKGIGLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQR 1205
            FGKG G+TW+EAK +AA+ ALG+LR M  Q   KR GSPR MQ +++KRLKPE    LQR
Sbjct: 886  FGKGFGVTWEEAKMQAAKKALGSLRTMFNQGSLKRHGSPRSMQGLANKRLKPEYPPTLQR 945

Query: 1206 MPSSTR 1223
            +P S R
Sbjct: 946  VPYSAR 951


>ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 958

 Score =  327 bits (838), Expect = 6e-87
 Identities = 198/423 (46%), Positives = 261/423 (61%), Gaps = 16/423 (3%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            LHSSPA EEGE+P+SELD DTRRRLLILQHGQD RE   +E               +  +
Sbjct: 528  LHSSPAREEGEVPESELDLDTRRRLLILQHGQDTREHTSSEPPLPVRHPTQVSAPSVPSR 587

Query: 183  -GWFPVEEEMALRQLNRVSPPLEFR--AEALPVDNIRARHSTFVHEMQPSIPPGRV--RE 347
             GWF VEEEM  +QLN++ P  EF   +E L ++    RH +   ++  S+   RV    
Sbjct: 588  RGWFSVEEEMGPQQLNQLVPK-EFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHES 646

Query: 348  NQRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAG 527
            +QRLPKE   R DH RL++ L  + S  G+D  ++  S +N+D D E+G+       +AG
Sbjct: 647  HQRLPKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGRSLFHADITAG 706

Query: 528  ALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFY 707
             LQEIA K GTKVEF  +LV+ST LQF +E  FAG+K+GEG GRTRREAQ+ AAE S+  
Sbjct: 707  VLQEIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQ 766

Query: 708  LADKYLSQLNPDSSNMPRDGISLGKLKDN-------YLSDDMSSREATP-SRASASPRII 863
            LAD Y+S    DS +   D        +N        L + +  +E+   S +S S R+ 
Sbjct: 767  LADIYMSHAKDDSGSTYGDVSGFHGSNNNGFVSSGNSLGNQLLPKESVSFSTSSDSSRVS 826

Query: 864  GLRVEASKKPTNSIFALKELCMMEGLSVAYQTRPQFLA---GQKNEVYAEVEVDGQVFGK 1034
              R+E SK+ T+SI ALKE CMMEGL+  +Q+ P   +    QK+EV+A+VE+DGQ+FGK
Sbjct: 827  DPRLEVSKRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGK 886

Query: 1035 GIGLTWDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPS 1214
            G GLTW+EAK +AA+ AL +LR M  Q   KR GSPR MQ +++KRLK E  R LQR+P 
Sbjct: 887  GFGLTWEEAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPY 946

Query: 1215 STR 1223
            S R
Sbjct: 947  SAR 949


>ref|XP_006597421.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X3 [Glycine max]
          Length = 932

 Score =  320 bits (821), Expect = 6e-85
 Identities = 194/416 (46%), Positives = 253/416 (60%), Gaps = 9/416 (2%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            LHSSPA EEGE+P+SELD DTRRRLLILQHGQD RE   +E               +  +
Sbjct: 528  LHSSPAREEGEVPESELDLDTRRRLLILQHGQDTREHTSSEPPLPVRHPTQVSAPSVPSR 587

Query: 183  -GWFPVEEEMALRQLNRVSPPLEFR--AEALPVDNIRARHSTFVHEMQPSIPPGRV--RE 347
             GWF VEEEM  +QLN++ P  EF   +E L ++    RH +   ++  S+   RV    
Sbjct: 588  RGWFSVEEEMGPQQLNQLVPK-EFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHES 646

Query: 348  NQRLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAG 527
            +QRLPKE   R DH RL++ L  + S  G+D  ++  S +N+D D E+G+       +AG
Sbjct: 647  HQRLPKEVHHRDDHSRLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGRSLFHADITAG 706

Query: 528  ALQEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFY 707
             LQEIA K GTKVEF  +LV+ST LQF +E  FAG+K+GEG GRTRREAQ+ AAE S+  
Sbjct: 707  VLQEIALKCGTKVEFLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQ 766

Query: 708  LADKYLSQLNPDSSNMPRDGISL-GKLKDNYLSDDMSSREATPSRASASPRIIGLRVEAS 884
            LAD Y+S    DS +   D     G   + ++S D                    R+E S
Sbjct: 767  LADIYMSHAKDDSGSTYGDVSGFHGSNNNGFVSSDP-------------------RLEVS 807

Query: 885  KKPTNSIFALKELCMMEGLSVAYQTRPQFLA---GQKNEVYAEVEVDGQVFGKGIGLTWD 1055
            K+ T+SI ALKE CMMEGL+  +Q+ P   +    QK+EV+A+VE+DGQ+FGKG GLTW+
Sbjct: 808  KRSTDSISALKEFCMMEGLAANFQSSPAPASTHFAQKDEVHAQVEIDGQIFGKGFGLTWE 867

Query: 1056 EAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPSSTR 1223
            EAK +AA+ AL +LR M  Q   KR GSPR MQ +++KRLK E  R LQR+P S R
Sbjct: 868  EAKMQAAKKALESLRTMFNQGTRKRHGSPRSMQGLANKRLKQEYPRTLQRIPYSAR 923


>dbj|BAF01152.1| hypothetical protein [Arabidopsis thaliana]
          Length = 967

 Score =  320 bits (821), Expect = 6e-85
 Identities = 194/418 (46%), Positives = 256/418 (61%), Gaps = 11/418 (2%)
 Frame = +3

Query: 3    LHSSPATEEGEIPQSELDPDTRRRLLILQHGQDVREQPQNETQFXXXXXXXXXXXXIHPQ 182
            L SSPA EEGE+P+SELDPDTRRRLLILQHGQD R+   +E  F            +  +
Sbjct: 552  LQSSPAREEGEVPESELDPDTRRRLLILQHGQDTRDPAPSEPSFPQRPPVQAPPSHVQSR 611

Query: 183  -GWFPVEEEMALRQLNR-VSPPLEFRAEALPVDNIRARHSTFVHEMQPSIPPGRV-RENQ 353
             GWFPVEEEM   Q+ R VS      +E + ++  R RH +F  ++  S    R+  EN+
Sbjct: 612  NGWFPVEEEMDPAQIRRAVSKEYPLDSEMIHMEKHRPRHPSFFSKIDNSTQSDRMLHENR 671

Query: 354  RLPKEELPRQDHLRLNEPLPDFRSISGEDGSMAQISSANKDLDLEAGQIDPSHLTSAGAL 533
            R PKE L R + LR N  LPD     GED S  Q SS N DLD    +   +  TSA  L
Sbjct: 672  RPPKESLRRDEQLRSNNNLPDSHPFYGEDASWNQSSSRNSDLDFLPERSVSATETSADVL 731

Query: 534  QEIAFKSGTKVEFKQTLVSSTELQFFVEVLFAGQKIGEGIGRTRREAQHHAAEGSLFYLA 713
              IA K G KVE+K +LVSST+L+F VE   + QKIGEGIG++RREA H AAE S+  LA
Sbjct: 732  HGIAIKCGAKVEYKPSLVSSTDLRFSVEAWLSNQKIGEGIGKSRREALHKAAEASIQNLA 791

Query: 714  DKYL------SQLNPDSSNMPRDGISLGKLKDNYLSDDMSSREATPSRASASPRIIGLRV 875
            D Y+         + D++    + IS+G    N L++   +R+ T    S+ P     R+
Sbjct: 792  DGYMRANGDPGPSHRDATPFTNENISMGNA--NALNNQPFARDETALPVSSRP--TDPRL 847

Query: 876  EASKKPTNSIFALKELCMMEGLSVAYQTRPQFLAG--QKNEVYAEVEVDGQVFGKGIGLT 1049
            E S + T SI AL+ELC  EGL +A+Q++ Q  +    ++E++A+VE+DG+V G+G+G T
Sbjct: 848  EGSMRHTGSITALRELCASEGLEMAFQSQRQLPSDMVHRDELHAQVEIDGRVVGEGVGST 907

Query: 1050 WDEAKSKAAENALGALRPMLGQFPHKRQGSPRLMQEISSKRLKPEPSRILQRMPSSTR 1223
            WDEA+ +AAE AL ++R MLGQ  HKRQGSPR    +S+KRLKP+  R LQRMPSS R
Sbjct: 908  WDEARMQAAERALSSVRSMLGQPLHKRQGSPRSFGGMSNKRLKPDFQRSLQRMPSSGR 965


Top