BLASTX nr result

ID: Stemona21_contig00006182 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Stemona21_contig00006182
         (2212 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY28304.1| C-terminal domain phosphatase-like 1 isoform 3 [T...   333   2e-88
gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [T...   331   7e-88
gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [T...   331   7e-88
gb|EMJ15747.1| hypothetical protein PRUPE_ppa000988mg [Prunus pe...   317   2e-83
ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal doma...   316   3e-83
ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal doma...   310   2e-81
ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citr...   308   8e-81
ref|XP_002519032.1| double-stranded RNA binding protein, putativ...   306   2e-80
ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Popu...   303   2e-79
ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Popu...   303   2e-79
ref|XP_006827806.1| hypothetical protein AMTR_s00009p00267690 [A...   294   1e-76
ref|XP_004953235.1| PREDICTED: RNA polymerase II C-terminal doma...   292   5e-76
ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal doma...   284   1e-73
gb|AFW63149.1| hypothetical protein ZEAMMB73_795279 [Zea mays]        281   1e-72
ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal doma...   279   3e-72
ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal doma...   279   3e-72
ref|XP_002452510.1| hypothetical protein SORBIDRAFT_04g027200 [S...   278   5e-72
gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus...   278   7e-72
gb|EEC73671.1| hypothetical protein OsI_08218 [Oryza sativa Indi...   277   1e-71
emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]   277   1e-71

>gb|EOY28304.1| C-terminal domain phosphatase-like 1 isoform 3 [Theobroma cacao]
          Length = 870

 Score =  333 bits (853), Expect = 2e-88
 Identities = 207/428 (48%), Positives = 264/428 (61%), Gaps = 9/428 (2%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D TSALNG+K PL F+G+ D E E+RLK        V    +N  +PR TPSLQ+ M S+
Sbjct: 451  DDTSALNGNKDPLLFDGMADAEVERRLKEAISATSTVSSAAIN-LDPRLTPSLQYTMPSS 509

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLG----KPFGQTGFPEPRLQSSPAREEGEVPESEL 1865
            S  +P SASQ   P  ++ S+    L     KP      PEP LQSSPAREEGEVPESEL
Sbjct: 510  SSSIPPSASQ---PSIVSFSNMQFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESEL 566

Query: 1864 DPDTRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMPL-QSRVGWFQSNGDVSPR 1688
            DPDTRRRLLILQHGQD R              +R  +QVS+P  QSR  WF +  ++SPR
Sbjct: 567  DPDTRRRLLILQHGQDTRDHTPPEPAFPP---VRPTMQVSVPRGQSRGSWFAAEEEMSPR 623

Query: 1687 QLKREA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYP 1511
            QL R A KEF L++E ++  K R  HP FF    ++ PSDR L  NQ+   +    D   
Sbjct: 624  QLNRAAPKEFPLDSERMHIEKHR--HPPFFPKVESSIPSDRLLRENQRLSKEALHRDDRL 681

Query: 1510 RSNHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVE 1331
              NH  S+Y+SF+GEE  + +   +H D+  ESG+ +T   ET AGVLQ IA+KCG KVE
Sbjct: 682  GLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTS-GETSAGVLQDIAMKCGAKVE 740

Query: 1330 YRAALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNS- 1154
            +R AL  S +LQFS E    GE++GEG G+TR+EAQ QAAE+S++ LAN YLS   P+S 
Sbjct: 741  FRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSG 800

Query: 1153 SVHGDLPKLYYAKGNGF-VNSNTFRYQTL-RDGQIPVAVTSEDSRHLDKRLEGSKGSVAS 980
            S  GDL +L+    NGF  N N+F  Q L ++  +  +  SE SR  D RLEGSK S+ S
Sbjct: 801  SAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGS 860

Query: 979  ISALKEVV 956
            ++ALKE+V
Sbjct: 861  VTALKELV 868


>gb|EOY28303.1| C-terminal domain phosphatase-like 1 isoform 2 [Theobroma cacao]
          Length = 984

 Score =  331 bits (849), Expect = 7e-88
 Identities = 206/427 (48%), Positives = 263/427 (61%), Gaps = 9/427 (2%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D TSALNG+K PL F+G+ D E E+RLK        V    +N  +PR TPSLQ+ M S+
Sbjct: 451  DDTSALNGNKDPLLFDGMADAEVERRLKEAISATSTVSSAAIN-LDPRLTPSLQYTMPSS 509

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLG----KPFGQTGFPEPRLQSSPAREEGEVPESEL 1865
            S  +P SASQ   P  ++ S+    L     KP      PEP LQSSPAREEGEVPESEL
Sbjct: 510  SSSIPPSASQ---PSIVSFSNMQFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESEL 566

Query: 1864 DPDTRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMPL-QSRVGWFQSNGDVSPR 1688
            DPDTRRRLLILQHGQD R              +R  +QVS+P  QSR  WF +  ++SPR
Sbjct: 567  DPDTRRRLLILQHGQDTRDHTPPEPAFPP---VRPTMQVSVPRGQSRGSWFAAEEEMSPR 623

Query: 1687 QLKREA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYP 1511
            QL R A KEF L++E ++  K R  HP FF    ++ PSDR L  NQ+   +    D   
Sbjct: 624  QLNRAAPKEFPLDSERMHIEKHR--HPPFFPKVESSIPSDRLLRENQRLSKEALHRDDRL 681

Query: 1510 RSNHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVE 1331
              NH  S+Y+SF+GEE  + +   +H D+  ESG+ +T   ET AGVLQ IA+KCG KVE
Sbjct: 682  GLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTS-GETSAGVLQDIAMKCGAKVE 740

Query: 1330 YRAALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNS- 1154
            +R AL  S +LQFS E    GE++GEG G+TR+EAQ QAAE+S++ LAN YLS   P+S 
Sbjct: 741  FRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSG 800

Query: 1153 SVHGDLPKLYYAKGNGF-VNSNTFRYQTL-RDGQIPVAVTSEDSRHLDKRLEGSKGSVAS 980
            S  GDL +L+    NGF  N N+F  Q L ++  +  +  SE SR  D RLEGSK S+ S
Sbjct: 801  SAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGS 860

Query: 979  ISALKEV 959
            ++ALKE+
Sbjct: 861  VTALKEL 867


>gb|EOY28302.1| C-terminal domain phosphatase-like 1 isoform 1 [Theobroma cacao]
          Length = 978

 Score =  331 bits (849), Expect = 7e-88
 Identities = 206/427 (48%), Positives = 263/427 (61%), Gaps = 9/427 (2%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D TSALNG+K PL F+G+ D E E+RLK        V    +N  +PR TPSLQ+ M S+
Sbjct: 451  DDTSALNGNKDPLLFDGMADAEVERRLKEAISATSTVSSAAIN-LDPRLTPSLQYTMPSS 509

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLG----KPFGQTGFPEPRLQSSPAREEGEVPESEL 1865
            S  +P SASQ   P  ++ S+    L     KP      PEP LQSSPAREEGEVPESEL
Sbjct: 510  SSSIPPSASQ---PSIVSFSNMQFPLAAPVVKPVAPVAVPEPSLQSSPAREEGEVPESEL 566

Query: 1864 DPDTRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMPL-QSRVGWFQSNGDVSPR 1688
            DPDTRRRLLILQHGQD R              +R  +QVS+P  QSR  WF +  ++SPR
Sbjct: 567  DPDTRRRLLILQHGQDTRDHTPPEPAFPP---VRPTMQVSVPRGQSRGSWFAAEEEMSPR 623

Query: 1687 QLKREA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYP 1511
            QL R A KEF L++E ++  K R  HP FF    ++ PSDR L  NQ+   +    D   
Sbjct: 624  QLNRAAPKEFPLDSERMHIEKHR--HPPFFPKVESSIPSDRLLRENQRLSKEALHRDDRL 681

Query: 1510 RSNHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVE 1331
              NH  S+Y+SF+GEE  + +   +H D+  ESG+ +T   ET AGVLQ IA+KCG KVE
Sbjct: 682  GLNHTPSSYHSFSGEEMPLSQSSSSHRDLDFESGRTVTS-GETSAGVLQDIAMKCGAKVE 740

Query: 1330 YRAALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNS- 1154
            +R AL  S +LQFS E    GE++GEG G+TR+EAQ QAAE+S++ LAN YLS   P+S 
Sbjct: 741  FRPALVASLDLQFSIEAWFAGEKVGEGVGRTRREAQRQAAEESIKNLANTYLSRIKPDSG 800

Query: 1153 SVHGDLPKLYYAKGNGF-VNSNTFRYQTL-RDGQIPVAVTSEDSRHLDKRLEGSKGSVAS 980
            S  GDL +L+    NGF  N N+F  Q L ++  +  +  SE SR  D RLEGSK S+ S
Sbjct: 801  SAEGDLSRLHNINDNGFPSNVNSFGNQLLAKEESLSFSTASEQSRLADPRLEGSKKSMGS 860

Query: 979  ISALKEV 959
            ++ALKE+
Sbjct: 861  VTALKEL 867


>gb|EMJ15747.1| hypothetical protein PRUPE_ppa000988mg [Prunus persica]
          Length = 940

 Score =  317 bits (811), Expect = 2e-83
 Identities = 197/423 (46%), Positives = 254/423 (60%), Gaps = 5/423 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D +SALNG++ PLPF+GITDVE E+R+K    P  ++   +  + +PR  P LQ+ +  +
Sbjct: 434  DDSSALNGNRDPLPFDGITDVEVERRMKEAT-PAASMVSSVFTSIDPRLAP-LQYTVPPS 491

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPDT 1853
            S +   +    +M  P  Q  Q+ +L KP G  G  EP LQSSPAREEGEVPESELDPDT
Sbjct: 492  STLSLPTTQPSVMSFPSIQFPQAASLVKPLGHVGSAEPSLQSSPAREEGEVPESELDPDT 551

Query: 1852 RRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQLKR 1676
            RRRLLILQHGQD R           P  +R P+Q S+P  QSR GWF    ++SPRQL R
Sbjct: 552  RRRLLILQHGQDTR----DQPPSEPPFPVRPPMQASVPRAQSRPGWFPVEEEMSPRQLSR 607

Query: 1675 EA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYPRSNH 1499
               K+  L+ E V   K RP H SFF    N+ PSDR L  NQ+ P +    D   R NH
Sbjct: 608  MVPKDLPLDPETVQIEKHRPHHSSFFPKVENSIPSDRILQENQRLPKEAFHRDDRLRFNH 667

Query: 1498 VISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYRAA 1319
             +S Y+S +GEE  + R   ++ DV  ESG+ ++  AETPAGVLQ+IA+KCG K  +   
Sbjct: 668  ALSGYHSLSGEEIPLSRSSSSNRDVDFESGRAISN-AETPAGVLQEIAMKCGAKAWF--- 723

Query: 1318 LPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNS-SVHG 1142
                            GE+IGEG+GKTR+EA  QAAE SL+ LAN YLS   P+S SVHG
Sbjct: 724  ---------------AGEKIGEGSGKTRREAHYQAAEGSLKNLANIYLSRVKPDSVSVHG 768

Query: 1141 DLPKLYYAKGNGFV-NSNTFRYQTL-RDGQIPVAVTSEDSRHLDKRLEGSKGSVASISAL 968
            D+ K      NGF  N N+F  Q   ++  +  + +SE SR LD RLEGSK S++S+S L
Sbjct: 769  DMNKFPNVNSNGFAGNLNSFGIQPFPKEESLSSSTSSEPSRPLDPRLEGSKKSMSSVSTL 828

Query: 967  KEV 959
            KE+
Sbjct: 829  KEL 831


>ref|XP_004293503.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Fragaria vesca subsp. vesca]
          Length = 955

 Score =  316 bits (809), Expect = 3e-83
 Identities = 198/423 (46%), Positives = 258/423 (60%), Gaps = 5/423 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D  SA NG++  LPF+G+ D E E+RLK        V   + NN +PR   SLQ+ +  +
Sbjct: 432  DDASASNGNRDQLPFDGMADAEVERRLKEATSAAPTVSSAVSNN-DPRLA-SLQYTVPLS 489

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPDT 1853
            S V   +    +MP    Q  QS +L KP G  G  +  L SSPAREEGEVPESELDPDT
Sbjct: 490  STVSLPTNQPSMMPFHNVQFPQSASLVKPLGHVGPADLGLHSSPAREEGEVPESELDPDT 549

Query: 1852 RRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQLKR 1676
            RRRLLILQHGQD R +            +R  +QVS+P +QSR GWF    ++SPR+L R
Sbjct: 550  RRRLLILQHGQDTRESVPSEPS----FPVRPQVQVSVPRVQSRGGWFPVEEEMSPRKLSR 605

Query: 1675 EA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYPRSNH 1499
               KE  L +E +   K R  H +FF    N+ PSDR L  NQ+ P +    D   R N 
Sbjct: 606  MVPKEPPLNSEPMQIEKHRSHHSAFFPKVENSMPSDRILQENQRLPKEAFHRDNRLRFNQ 665

Query: 1498 VISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYRAA 1319
             +S Y+SF+GEE  + R   ++ D   ESG+ ++  AETPAGVLQ+IA+KCGTKVE+R A
Sbjct: 666  AMSGYHSFSGEEPPLNRSSSSNRDFDYESGRAISN-AETPAGVLQEIAMKCGTKVEFRPA 724

Query: 1318 LPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNS-SVHG 1142
            L  STELQF  E    GE+IGEGTG+TR+EA  QAAE SL+ LAN Y+S   P++  +HG
Sbjct: 725  LVPSTELQFYVEAWFAGEKIGEGTGRTRREAHFQAAEGSLKNLANIYISRGKPDALPIHG 784

Query: 1141 DLPKLYYAKGNGFV-NSNTFRYQTL-RDGQIPVAVTSEDSRHLDKRLEGSKGSVASISAL 968
            D  K      NGF+ N N+F  Q L ++  +  + +SE SR LD RL+ S+ SV+S+SAL
Sbjct: 785  DASKFSNVTNNGFMGNMNSFGTQPLPKEDSLSSSTSSEPSRPLDPRLDNSRKSVSSVSAL 844

Query: 967  KEV 959
            KE+
Sbjct: 845  KEL 847


>ref|XP_006467834.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Citrus sinensis]
          Length = 957

 Score =  310 bits (793), Expect = 2e-81
 Identities = 199/423 (47%), Positives = 253/423 (59%), Gaps = 5/423 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D  +  NG K PL F+G+ D E E+RLK       A     V N +PR  P  Q+ M S+
Sbjct: 435  DDAATANGIKDPLSFDGMADAEVERRLKEA-IAASATISSAVANLDPRLAP-FQYTMPSS 492

Query: 2032 SGVVPMSASQM-IMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPD 1856
            S    +  SQ  +MPL   Q   + +L KP G  G PE  LQSSPAREEGEVPESELDPD
Sbjct: 493  SSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQSLQSSPAREEGEVPESELDPD 552

Query: 1855 TRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQLK 1679
            TRRRLLILQHG D R           P   R  +QVS+P + SR  WF    ++SPRQL 
Sbjct: 553  TRRRLLILQHGMDTR----ENAPSEAPFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQLN 608

Query: 1678 REA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYPRSN 1502
            R   KEF L +E +   K RP HPSFF    N + SDR  + NQ+ P +  + D   R N
Sbjct: 609  RAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENPSTSDRP-HENQRMPKEALRRDDRLRLN 667

Query: 1501 HVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYRA 1322
            H +S+Y SF+GEE  + R   +  DV  ESG++++   ETP+GVLQ IA+KCGTKVE+R 
Sbjct: 668  HTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSS-TETPSGVLQDIAMKCGTKVEFRP 726

Query: 1321 ALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNS-SVH 1145
            AL  STELQFS E    GE+IGEG G+TR+EAQ QAAE S++ LAN Y+     +S S H
Sbjct: 727  ALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYMLRVKSDSGSGH 786

Query: 1144 GDLPKLYYAKGNGFVNS-NTFRYQTLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASISAL 968
            GD  +   A  N F+   N+F  Q L   +   +++SE S+ +D RLEGSK  + S+SAL
Sbjct: 787  GDGSRFSNANENCFMGEINSFGGQPLAKDE---SLSSEPSKLVDPRLEGSKKLMGSVSAL 843

Query: 967  KEV 959
            KE+
Sbjct: 844  KEL 846


>ref|XP_006449302.1| hypothetical protein CICLE_v10014168mg [Citrus clementina]
            gi|557551913|gb|ESR62542.1| hypothetical protein
            CICLE_v10014168mg [Citrus clementina]
          Length = 957

 Score =  308 bits (788), Expect = 8e-81
 Identities = 199/423 (47%), Positives = 253/423 (59%), Gaps = 5/423 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D  +  NG K PL F+G+ D E E+RLK       A     V N +PR  P  Q+ M S+
Sbjct: 435  DDAATANGIKDPLSFDGMADAEVERRLKEA-IAASATISSAVANLDPRLAP-FQYTMPSS 492

Query: 2032 SGVVPMSASQM-IMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPD 1856
            S    +  SQ  +MPL   Q   + +L KP G  G PE  LQSSPAREEGEVPESELDPD
Sbjct: 493  SSTTTLPTSQAAVMPLANMQFPPATSLVKPLGHVGPPEQCLQSSPAREEGEVPESELDPD 552

Query: 1855 TRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQLK 1679
            TRRRLLILQHG D R           P   R  +QVS+P + SR  WF    ++SPRQL 
Sbjct: 553  TRRRLLILQHGMDTR----ENAPSEAPFPARTQMQVSVPRVPSRGSWFPVEEEMSPRQLN 608

Query: 1678 REA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYPRSN 1502
            R   KEF L +E +   K RP HPSFF    N+  SDR  + NQ+ P +  + D   R N
Sbjct: 609  RAVPKEFPLNSEAMQIEKHRPPHPSFFPKIENSITSDRP-HENQRMPKEALRRDDRLRLN 667

Query: 1501 HVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYRA 1322
            H +S+Y SF+GEE  + R   +  DV  ESG++++   ETP+GVLQ IA+KCGTKVE+R 
Sbjct: 668  HTLSDYQSFSGEEIPLSRSSSSSRDVDFESGRDVSS-TETPSGVLQDIAMKCGTKVEFRP 726

Query: 1321 ALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNS-SVH 1145
            AL  STELQFS E    GE+IGEG G+TR+EAQ QAAE S++ LAN Y+     +S S H
Sbjct: 727  ALVASTELQFSIEAWFAGEKIGEGIGRTRREAQRQAAEGSIKHLANVYVLRVKSDSGSGH 786

Query: 1144 GDLPKLYYAKGNGFVNS-NTFRYQTLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASISAL 968
            GD  +   A  N F+   N+F  Q L   +   +++SE S+ +D RLEGSK  + S+SAL
Sbjct: 787  GDGSRFSNANENCFMGEINSFGGQPLAKDE---SLSSEPSKLVDPRLEGSKKLMGSVSAL 843

Query: 967  KEV 959
            KE+
Sbjct: 844  KEL 846


>ref|XP_002519032.1| double-stranded RNA binding protein, putative [Ricinus communis]
            gi|223541695|gb|EEF43243.1| double-stranded RNA binding
            protein, putative [Ricinus communis]
          Length = 978

 Score =  306 bits (785), Expect = 2e-80
 Identities = 187/422 (44%), Positives = 260/422 (61%), Gaps = 5/422 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D     NG++ PL F+G+ D E EKRLK       A P   V N + R  P LQ+ M+S+
Sbjct: 449  DDAFTSNGNRDPLSFDGMADAEVEKRLKEAISISSAFPST-VANLDARLVPPLQYTMASS 507

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPDT 1853
            S +   ++   ++  P  Q  Q+  L KP GQ    EP LQSSPAREEGEVPESELDPDT
Sbjct: 508  SSIPVPTSQPAVVTFPSMQLPQAAPLVKPLGQVVPSEPSLQSSPAREEGEVPESELDPDT 567

Query: 1852 RRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQLKR 1676
            RRRLLILQHGQD+R           P++    +QVS+P +QSR  W     ++SPRQL R
Sbjct: 568  RRRLLILQHGQDLR--DPAPSESPFPVRPSNSMQVSVPRVQSRGNWVPVEEEMSPRQLNR 625

Query: 1675 E-AKEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYPRSNH 1499
               +EF ++TE ++ +K RP HPSFF    ++ PS+R  + NQ+ P      D   R N 
Sbjct: 626  AVTREFPMDTEPMHIDKHRPHHPSFFPKVESSIPSERMPHENQRLPKVAPYKDDRLRLNQ 685

Query: 1498 VISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYRAA 1319
             +SNY S +GEE ++ R   ++ D+ +ES + ++  AETP  VL +I++KCG KVE++ +
Sbjct: 686  TMSNYQSLSGEENSLSRSSSSNRDLDVESDRAVSS-AETPVRVLHEISMKCGAKVEFKHS 744

Query: 1318 LPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMP-NSSVHG 1142
            L +S +LQFS E    GE++GEG G+TR+EAQ+ AAE S++ LAN Y+S   P N ++HG
Sbjct: 745  LVNSRDLQFSVEAWFAGERVGEGFGRTRREAQSVAAEASIKNLANIYISRAKPDNGALHG 804

Query: 1141 DLPKLYYAKGNGFV-NSNTFRYQTL-RDGQIPVAVTSEDSRHLDKRLEGSKGSVASISAL 968
            D  K   A  NGF+ + N+F  Q L +D  +  + +SE S  LD RLE SK S++S++AL
Sbjct: 805  DASKYSSANDNGFLGHVNSFGSQPLPKDEILSYSDSSEQSGLLDPRLESSKKSMSSVNAL 864

Query: 967  KE 962
            KE
Sbjct: 865  KE 866


>ref|XP_002305017.2| hypothetical protein POPTR_0004s04010g [Populus trichocarpa]
            gi|550340277|gb|EEE85528.2| hypothetical protein
            POPTR_0004s04010g [Populus trichocarpa]
          Length = 996

 Score =  303 bits (776), Expect = 2e-79
 Identities = 196/444 (44%), Positives = 255/444 (57%), Gaps = 27/444 (6%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLK---GLNFPIRAVPRRMVNNFEPRSTPSLQHAM 2042
            D  SA+NG++  L F+G+ D E E++LK     +  I +     V++ +PR   SLQ+ +
Sbjct: 447  DDASAVNGNRDQLSFDGMADAEVERQLKEAVSASSAILSTIPSTVSSLDPRLLQSLQYTI 506

Query: 2041 SSTSGVVPMSASQMIM--------------------PLPINQSSQSITLGKPFGQTGFPE 1922
            +S+S  +P S   M+                     P P  Q  Q     K  GQ   PE
Sbjct: 507  ASSSSSMPTSQPSMLASQQPMPALQPPKPPSQLSMTPFPNTQFPQVAPSVKQLGQVVPPE 566

Query: 1921 PRLQSSPAREEGEVPESELDPDTRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSM 1742
            P LQSSPAREEGEVPESELDPDTRRRLLILQHG D R           P   R   QVS 
Sbjct: 567  PSLQSSPAREEGEVPESELDPDTRRRLLILQHGHDSRD----NAPSESPFPARPSTQVSA 622

Query: 1741 PLQSRVG-WFQSNGDVSPRQLKREAKEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRG 1565
            P    VG W     ++SPRQL R  +EF L+++ +   K R  HPSFF    +  PSDR 
Sbjct: 623  PRVQSVGSWVPVEEEMSPRQLNRTPREFPLDSDPMNIEKHRTHHPSFFHKVESNIPSDRM 682

Query: 1564 LNGNQQCPMQTRQGDGYPRSNHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAE 1385
            ++ NQ+ P +    D   + NH  SNY SF GEE  + R   N  D+ LES +  +   E
Sbjct: 683  IHENQRQPKEATYRDDRMKLNHSTSNYPSFQGEESPLSRSSSNR-DLDLESERAFSS-TE 740

Query: 1384 TPAGVLQKIALKCGTKVEYRAALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEK 1205
            TP  VLQ+IA+KCGTKVE+R AL  +++LQFS E   VGE++GEGTGKTR+EAQ QAAE 
Sbjct: 741  TPVEVLQEIAMKCGTKVEFRPALIATSDLQFSIETWFVGEKVGEGTGKTRREAQRQAAEG 800

Query: 1204 SLQTLANKYLSDTMPNSS-VHGDLPKLYYAKGNGFV-NSNTFRYQ-TLRDGQIPVAVTSE 1034
            S++ LA  Y+S   P+S  + GD  +   A  NGF+ + N+F  Q  L+D  I  + TSE
Sbjct: 801  SIKKLAGIYMSRVKPDSGPMLGDSSRYPSANDNGFLGDMNSFGNQPLLKDENITYSATSE 860

Query: 1033 DSRHLDKRLEGSKGSVASISALKE 962
             SR LD+RLEGSK S+ S++ALKE
Sbjct: 861  PSRLLDQRLEGSKKSMGSVTALKE 884


>ref|XP_006377325.1| hypothetical protein POPTR_0011s04910g [Populus trichocarpa]
            gi|550327613|gb|ERP55122.1| hypothetical protein
            POPTR_0011s04910g [Populus trichocarpa]
          Length = 990

 Score =  303 bits (776), Expect = 2e-79
 Identities = 193/436 (44%), Positives = 252/436 (57%), Gaps = 19/436 (4%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRM---VNNFEPRSTPSLQHAM 2042
            D  SA NG++ P  F+   D E E+RLK        +P  +   V++ +PR   SLQ+A+
Sbjct: 448  DDASAANGNRDPPSFDSTADAEVERRLKEAVSASSTIPSTIPSTVSSLDPRLLQSLQYAV 507

Query: 2041 SSTSGVVPMSASQMI-------------MPLPINQSSQSITLGKPFGQTGFPEPRLQSSP 1901
            +S+S ++P S   M+             MP P  Q  Q   L K  GQ   PEP LQSSP
Sbjct: 508  ASSSSLMPASQPSMLASQQPVPASQTSMMPFPNTQFPQVAPLVKQLGQVVHPEPSLQSSP 567

Query: 1900 AREEGEVPESELDPDTRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMPLQSRVG 1721
            AREEGEVPESELDPDTRRRLLILQHGQD R           P +  AP+  +  +QSR  
Sbjct: 568  AREEGEVPESELDPDTRRRLLILQHGQDSRD--NAPSESPFPARPSAPVSAAH-VQSRGS 624

Query: 1720 WFQSNGDVSPRQLKREAKEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCP 1541
            W     +++PRQL R  +EF L+++ +   K +  HPSFF    +  PSDR ++ NQ+ P
Sbjct: 625  WVPVEEEMTPRQLNRTPREFPLDSDPMNIEKHQTHHPSFFPKVESNIPSDRMIHENQRLP 684

Query: 1540 MQTRQGDGYPRSNHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQK 1361
             +    +   R NH   NY+SF  EE  + R   N  D+ LES +  T  +ETP  VLQ+
Sbjct: 685  KEAPYRNDRMRLNHSTPNYHSFQVEETPLSRSSSNR-DLDLESERAFT-ISETPVEVLQE 742

Query: 1360 IALKCGTKVEYRAALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANK 1181
            IA+KC TKVE+R AL  S +LQFS E    GE++GEGTGKTR+EAQ QAAE S++ LA  
Sbjct: 743  IAMKCETKVEFRPALVASIDLQFSIEAWFAGEKVGEGTGKTRREAQRQAAEGSIKKLAGI 802

Query: 1180 YLSDTMPNSS-VHGDLPKLYYAKGNGFV-NSNTFRYQTL-RDGQIPVAVTSEDSRHLDKR 1010
            Y+    P+S  +HGD  +   A  NGF+ N N F  Q L +D  +  +  SE SR LD R
Sbjct: 803  YMLRAKPDSGPMHGDSSRYPSANDNGFLGNMNLFGNQPLPKDELVAYSAASEPSRLLDPR 862

Query: 1009 LEGSKGSVASISALKE 962
            LEGSK S  S++ALKE
Sbjct: 863  LEGSKKSSGSVTALKE 878


>ref|XP_006827806.1| hypothetical protein AMTR_s00009p00267690 [Amborella trichopoda]
            gi|548832426|gb|ERM95222.1| hypothetical protein
            AMTR_s00009p00267690 [Amborella trichopoda]
          Length = 942

 Score =  294 bits (753), Expect = 1e-76
 Identities = 188/425 (44%), Positives = 255/425 (60%), Gaps = 5/425 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKG-PLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNN-FEPRSTPSLQHAMS 2039
            D +S LNG+K  P+P EG+ D E E+RLK  NF ++A+P    NN FE R T SLQH ++
Sbjct: 429  DDSSVLNGNKDLPIP-EGMVDSEVERRLKDANFAMQAMPTSTSNNNFERRPTMSLQH-VA 486

Query: 2038 STSGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDP 1859
            STS ++  S  Q  M L   Q + ++   KP G     +  LQ SP REEGEVPESELDP
Sbjct: 487  STSNMISQSPCQGPMSLNNKQYNHAVPSLKPSGHICSSDSTLQCSPGREEGEVPESELDP 546

Query: 1858 DTRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSM-PLQSRVGWFQSNGDVSPRQL 1682
            DTRRRLLILQHGQD R           P  LR  +Q+++ P QS   WF    ++SPRQL
Sbjct: 547  DTRRRLLILQHGQDTR-EHGTIDPPPPPFPLRPALQIAVPPAQSHGPWFPVEEEMSPRQL 605

Query: 1681 KREAKEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYPRSN 1502
                +EF LE E V F++ R +   FF G + + P+DR  N  Q+   + +  D     N
Sbjct: 606  SHPLREFPLEPEAVQFDRHRAR--PFFHGVDGSIPADRVFNEAQRLSKEVQYRDDRLHQN 663

Query: 1501 HVISNYNSFTG-EEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYR 1325
               ++Y+SF   EE   G+   N  DV   +GQ   +Y+ TP GVL+ IA+KCG+KV++R
Sbjct: 664  LPKTSYSSFPEVEEMPPGQSSSNTRDVPFATGQVPPQYSPTPVGVLKDIAIKCGSKVDFR 723

Query: 1324 AALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNSSVH 1145
            + +  +TELQFS EV  VGE+IGEG GKTRKEAQ +A+E S++TLA  YL+   P+  + 
Sbjct: 724  SMVVPTTELQFSVEVWFVGEKIGEGIGKTRKEAQFKASEASIRTLARTYLAQISPDIGLG 783

Query: 1144 -GDLPKLYYAKGNGFVNSNTFRYQTLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASISAL 968
             GD+        NG +  ++     LR+  +P+A TSE  R LD+RLEGSK S+  +S+L
Sbjct: 784  CGDMDDRSLGSDNGLM-GDSISSAGLREDSLPIASTSEQQRFLDQRLEGSKQSIGVVSSL 842

Query: 967  KEVVS 953
            KE+ S
Sbjct: 843  KELCS 847


>ref|XP_004953235.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Setaria italica]
            gi|514715399|ref|XP_004953236.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 1-like
            isoform X2 [Setaria italica]
          Length = 937

 Score =  292 bits (747), Expect = 5e-76
 Identities = 185/422 (43%), Positives = 247/422 (58%), Gaps = 4/422 (0%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            +  +A+NG++  LPF+G+ D E E+R+K  +   +A    + +   P + P  Q+ +SS+
Sbjct: 437  ENVAAVNGNRDALPFDGMADAEVERRMKEASGNAQAFHPTVASFVMPVAPP--QNFISSS 494

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPDT 1853
              V P++    +MP P NQ         P  Q GF +P LQ SPAREEGEVPESELDPDT
Sbjct: 495  --VAPIAPPLGMMPPPFNQ---------PVVQPGFSDP-LQGSPAREEGEVPESELDPDT 542

Query: 1852 RRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQLKR 1676
            RRRLLILQHGQD R A          L    P+QV +P +Q    WF +   ++P  L  
Sbjct: 543  RRRLLILQHGQDTRDATPP-------LPAIPPVQVPVPPVQPHGNWFPAEDGMNPSNLNI 595

Query: 1675 EAKEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYP-RSNH 1499
                F +E++ + + K++P HPSFF GG+N   SDR    NQ+ P Q    D +    NH
Sbjct: 596  GPAGFTVESDSLLYEKKQPPHPSFFHGGDNPMSSDRFSYQNQRFPSQLPHADDHHILQNH 655

Query: 1498 VISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYRAA 1319
                Y SF+GEE +   +  N  +  LESG++  +Y  TPAG+L+ IALKCG+KVEYR+ 
Sbjct: 656  GPPKYRSFSGEELSGRHVPTNQRNNQLESGRHFAQYTGTPAGILEGIALKCGSKVEYRST 715

Query: 1318 LPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNSSVHGD 1139
            L D+ ELQFS EV IVGE+IGEG G+TR+EAQ QAAE SL+ LANKYLS          D
Sbjct: 716  LCDTAELQFSIEVWIVGEKIGEGIGRTRREAQRQAAEMSLRNLANKYLS---------SD 766

Query: 1138 LPKLYYAKGNGF-VNSNTFRYQ-TLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASISALK 965
              K+   K NGF  N N F Y    RD  +PV  TSE+SR +      S+ +  S++ALK
Sbjct: 767  PNKMTDLKENGFSSNRNFFGYSGNNRDDILPVPSTSEESRFMKMEENNSRKTGGSVAALK 826

Query: 964  EV 959
            E+
Sbjct: 827  EL 828


>ref|XP_003545893.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 958

 Score =  284 bits (726), Expect = 1e-73
 Identities = 187/425 (44%), Positives = 255/425 (60%), Gaps = 8/425 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRST--PSLQHAMS 2039
            D  SA NG+K  L F+G+ D E E+RLK        VP  M  N +PR     SLQ+ M 
Sbjct: 427  DDASASNGNKNLLLFDGMADAEVERRLKDAISASSTVPA-MTTNLDPRLAFNSSLQYTMV 485

Query: 2038 STSGVVPMSASQM-IMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELD 1862
            S+SG VP   +Q  I+     Q  Q  TL KP  Q   P P L SSPAREEGEVPESELD
Sbjct: 486  SSSGTVPPPTAQASIVQFGNVQFPQPNTLVKPICQVTPPGPSLHSSPAREEGEVPESELD 545

Query: 1861 PDTRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQ 1685
             DTRRRLLILQHGQD R           PL +R P QVS P + SR GWF    ++ P+Q
Sbjct: 546  LDTRRRLLILQHGQDTR----EHTSSEPPLPVRHPTQVSAPSVPSRRGWFSVEEEMGPQQ 601

Query: 1684 LKREA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLN-GNQQCPMQTRQGDGYP 1511
            L +   KEF + +E ++  KR P+HPS F   +++  SDR  +  +Q+ P +    D + 
Sbjct: 602  LNQLVPKEFPVGSEPLHIEKRWPRHPSLFSKVDDSVSSDRVFHESHQRLPKEVHHRDDHS 661

Query: 1510 RSNHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVE 1331
            R +  +S+Y+SF G++  +    +++ D   ESG+++  +A+  AGVLQ+IALKCGTKVE
Sbjct: 662  RLSQSLSSYHSFPGDDIPLSGSSYSNRDFDSESGRSLF-HADITAGVLQEIALKCGTKVE 720

Query: 1330 YRAALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNS- 1154
            + ++L  ST LQFS E    G+++GEG G+TR+EAQ +AAE S++ LA+ Y+S    +S 
Sbjct: 721  FLSSLVASTALQFSIEAWFAGKKVGEGFGRTRREAQNKAAECSIKQLADIYMSHAKDDSG 780

Query: 1153 SVHGDLPKLYYAKGNGFVNS-NTFRYQTLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASI 977
            S +GD+   + +  NGFV+S N+   Q L    +  + +S+ SR  D RLE SK S  SI
Sbjct: 781  STYGDVSGFHGSNNNGFVSSGNSLGNQLLPKESVSFSTSSDSSRVSDPRLEVSKRSTDSI 840

Query: 976  SALKE 962
            SALKE
Sbjct: 841  SALKE 845


>gb|AFW63149.1| hypothetical protein ZEAMMB73_795279 [Zea mays]
          Length = 932

 Score =  281 bits (718), Expect = 1e-72
 Identities = 176/443 (39%), Positives = 249/443 (56%), Gaps = 3/443 (0%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            +  + +NG++  LPF+G+ D E E+R+K  N        +   +F     P+     +S 
Sbjct: 435  ENVALVNGNRDSLPFDGMADAEVERRMKEANAQSF---HQTAGDFVMPVAPAQNFVSTSV 491

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPDT 1853
            + + P      +MP P +Q         P    GF +  LQ SPAREEGEVPESELDPDT
Sbjct: 492  ASLAPPLG---MMPSPFSQ---------PVAPPGFSDS-LQGSPAREEGEVPESELDPDT 538

Query: 1852 RRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQLKR 1676
            RRRLLILQHGQD R            L    P+QV +P +Q    WF +   ++   L R
Sbjct: 539  RRRLLILQHGQDTRDPTSP-------LPAIPPVQVPVPPVQPHGNWFPTEDGINQSNLNR 591

Query: 1675 EAKEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYPRSNHV 1496
             +  F +E++ + + K++P HPSFF GG++  PSDR    NQ+ P Q    D     NH 
Sbjct: 592  GSAGFTVESDSIVYEKKQPPHPSFFHGGDSPMPSDRFGYQNQRFPSQLPHEDHPMMQNHA 651

Query: 1495 ISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYRAAL 1316
               Y SF+GEE     +  +  +  +ESG++  +YA T AG+L+ IALKCG+KVEY++AL
Sbjct: 652  PPKYRSFSGEELASWHVPSSQRNNQIESGRHFAQYAGTSAGILEGIALKCGSKVEYKSAL 711

Query: 1315 PDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNSSVHGDL 1136
             D+ ELQFS EV IVGE++GEG G+TR+EAQ QAAE SL+ LANKYLS          D 
Sbjct: 712  CDTAELQFSIEVWIVGEKVGEGIGRTRREAQRQAAEMSLRNLANKYLS---------SDP 762

Query: 1135 PKLYYAKGNGF-VNSNTFRYQ-TLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASISALKE 962
             KL   K N F  N N F Y    RD  +P++ TSE+SR +      S+ + +S++ALKE
Sbjct: 763  NKLSDMKENDFSSNRNVFGYSGNTRDDMLPLSSTSEESRFMKMENNNSRKTGSSVAALKE 822

Query: 961  VVS***FRIIYRMDLEQTADNLL 893
            + +   + ++++     +AD L+
Sbjct: 823  LCTVEGYNLVFQA-CPSSADGLV 844


>ref|XP_003529311.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like isoform X1 [Glycine max]
          Length = 956

 Score =  279 bits (714), Expect = 3e-72
 Identities = 180/424 (42%), Positives = 244/424 (57%), Gaps = 6/424 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D  S  NG + P  F+G+ D E E++LK        +P    N  +PR T SLQ+ M  +
Sbjct: 428  DDGSISNGHRDPFLFDGMADAEVERKLKDALSAASTIPVTTAN-LDPRLT-SLQYTMVPS 485

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPDT 1853
              V P +A   +MP P  Q  Q  TL KP GQ    EP L SSPAREEGEVPESELDPDT
Sbjct: 486  GSVPPPTAQASMMPFPHVQFPQPATLVKPMGQAAPSEPSLHSSPAREEGEVPESELDPDT 545

Query: 1852 RRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP--LQSRVGWFQSNGDVSPRQLK 1679
            RRRLLILQHGQD R           P  +R P+Q S P    SR  WF +  ++  + L 
Sbjct: 546  RRRLLILQHGQDTR----DHASAEPPFPVRHPVQTSAPHVPSSRGVWFPAEEEIGSQPLN 601

Query: 1678 REA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGL-NGNQQCPMQTRQGDGYPRS 1505
            R   KEF +++  +   K RP HPSFF    ++  SDR L + +Q+ P +    D  PR 
Sbjct: 602  RVVPKEFPVDSGPLGIAKPRPHHPSFFSKVESSISSDRILHDSHQRLPKEMYHRDDRPRL 661

Query: 1504 NHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYR 1325
            NH++S+Y SF+G++    R   +H D+  ESG ++  +A+TP  VLQ+IALKCGTKV++ 
Sbjct: 662  NHMLSSYRSFSGDDIPFSRSFSSHRDLDSESGHSVL-HADTPVAVLQEIALKCGTKVDFI 720

Query: 1324 AALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPN-SSV 1148
            ++L  STELQFS E    G++IG   G+TRKEAQ +AAE S++ LA+ YLS       S 
Sbjct: 721  SSLVASTELQFSMEAWFSGKKIGHRVGRTRKEAQNKAAEDSIKHLADIYLSSAKDEPGST 780

Query: 1147 HGDLPKLYYAKGNGFVN-SNTFRYQTLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASISA 971
            +GD+        +G++  +++   Q L         T+  SR LD RL+ SK S+ SIS+
Sbjct: 781  YGDVSGFPNVNDSGYMGIASSLGNQPLSKEDSASFSTASPSRVLDPRLDVSKRSMGSISS 840

Query: 970  LKEV 959
            LKE+
Sbjct: 841  LKEL 844


>ref|XP_006347069.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            1-like [Solanum tuberosum]
          Length = 953

 Score =  279 bits (714), Expect = 3e-72
 Identities = 185/424 (43%), Positives = 241/424 (56%), Gaps = 6/424 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            D  SA+NG+K  L F+G+ D E E+RLK       +VP +M N  +PR  P+LQ+ +   
Sbjct: 430  DDPSAVNGNKDSLGFDGMADSEVERRLKEAMLASTSVPSQMTN-LDPRLVPALQYPVPPV 488

Query: 2032 SGVVPMSASQMIMPLPINQSSQ-SITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPD 1856
              +   S    ++P P     Q +  L     Q    +  LQSSPAREEGEVPESELDPD
Sbjct: 489  --ISQPSIQSPVVPFPTQHLPQVTSVLKSSVTQISPQDTSLQSSPAREEGEVPESELDPD 546

Query: 1855 TRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMPLQSRV-GWFQSNGDVSPRQLK 1679
            TRRRLLILQHGQD R              +  P+QVS+P + +  GWF +  ++SPRQL 
Sbjct: 547  TRRRLLILQHGQDTRDQVSSEPK----FPMGTPLQVSVPPRVQPHGWFPAEEEMSPRQLN 602

Query: 1678 REA--KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYPRS 1505
            R    KEF L  E ++ NK RP HP F      + PSDR L  NQ+ P +    D   R 
Sbjct: 603  RPLPPKEFPLNPESMHINKHRPPHPPFLPKMETSMPSDRVLFENQRLPKEVIPRDDRMRF 662

Query: 1504 NHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYR 1325
            +    ++    GEE  +GR   ++  + LE G +   Y ETPAG LQ IA KCG KVE+R
Sbjct: 663  SQSQPSFRP-PGEEVPLGRSSSSNRVLDLEPG-HYDPYLETPAGALQDIAFKCGAKVEFR 720

Query: 1324 AALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMP-NSSV 1148
            ++   S ELQFS EV   GE++GEGTG+TR+EAQ +AAE+SL  LA+KYLS   P +SS 
Sbjct: 721  SSFLSSPELQFSLEVLFAGEKVGEGTGRTRREAQRRAAEESLMYLADKYLSCIKPDSSST 780

Query: 1147 HGDLPKLYYAKGNGFV-NSNTFRYQTLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASISA 971
             GD  +   A  NGFV N + F YQ     ++  +  SE  R LD RLE  K SV S+ A
Sbjct: 781  QGDGFRFPNASDNGFVDNMSPFGYQ----DRVSHSFASEPPRVLDPRLEVFKKSVGSVGA 836

Query: 970  LKEV 959
            L+E+
Sbjct: 837  LREL 840


>ref|XP_002452510.1| hypothetical protein SORBIDRAFT_04g027200 [Sorghum bicolor]
            gi|241932341|gb|EES05486.1| hypothetical protein
            SORBIDRAFT_04g027200 [Sorghum bicolor]
          Length = 934

 Score =  278 bits (712), Expect = 5e-72
 Identities = 181/444 (40%), Positives = 251/444 (56%), Gaps = 4/444 (0%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            +  + +NG++  LPF+G+ D E E+R+K  N        +   NF     P+     SS 
Sbjct: 437  ENAALVNGNRDSLPFDGMADAEVERRMKEAN---AQAFHQTAGNFVMPVAPAQNFVSSS- 492

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPDT 1853
              V P++    +MP          T  +P  Q GF +  LQ SPAREEGEVPESELDPDT
Sbjct: 493  --VAPLAPPLGVMPP---------TFSQPVVQPGFSDS-LQGSPAREEGEVPESELDPDT 540

Query: 1852 RRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQLKR 1676
            RRRLLILQHGQDIR            L    P+QV +P +Q    WF +   ++P  L R
Sbjct: 541  RRRLLILQHGQDIRDPTPP-------LPAIPPVQVPVPPVQPHGNWFPTEDGLNPSNLNR 593

Query: 1675 EAKEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQT-RQGDGYPRSNH 1499
             +  F +E++ + + K++P HPSFF GG++   SDR    NQ+ P Q     D +   NH
Sbjct: 594  GSAGFTVESDPMLYEKKQPPHPSFFHGGDSPMSSDRFGYQNQRFPSQLPHTEDHHMLQNH 653

Query: 1498 VISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYRAA 1319
                Y SF+GEE     +  +  +  +ESG++  +YA T AG+L  IALKCG+KVEYR+ 
Sbjct: 654  APPKYRSFSGEELAARHVPSSQRNNQIESGRHFAQYAGTSAGILDGIALKCGSKVEYRST 713

Query: 1318 LPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNSSVHGD 1139
            L D+ ELQFS EV IVGE++GEG G+TR+EAQ +AAE SL+ LANKYLS          D
Sbjct: 714  LCDTAELQFSIEVWIVGEKVGEGIGRTRREAQHKAAEMSLRNLANKYLS---------SD 764

Query: 1138 LPKLYYAKGNGFV-NSNTFRYQ-TLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASISALK 965
              KL   K NGF  N N F Y    RD  +P++ TSE+SR + K    S+ +  S++ALK
Sbjct: 765  PNKLTDMKENGFSGNRNVFGYSGNTRDDMLPLSSTSEESRFM-KMENNSRKTGGSVAALK 823

Query: 964  EVVS***FRIIYRMDLEQTADNLL 893
            E+ +   + ++++ +    AD L+
Sbjct: 824  ELCTVEGYNLVFQ-ERPSPADGLV 846


>gb|ESW31299.1| hypothetical protein PHAVU_002G226900g [Phaseolus vulgaris]
          Length = 964

 Score =  278 bits (711), Expect = 7e-72
 Identities = 187/441 (42%), Positives = 250/441 (56%), Gaps = 23/441 (5%)
 Frame = -1

Query: 2212 DGTSAL-NGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVN---------------- 2084
            DG+SA+ NG++ P  F+ + D E E++ K        VP R  N                
Sbjct: 426  DGSSAISNGNRDPFLFDSMGDAEVERKSK--------VPTRAPNEHDALSAASTIPVTTA 477

Query: 2083 NFEPRSTPSLQHAMSSTSGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSS 1904
            N +PR T SLQ+AM S+    P +A   +MP    Q  Q   L KP GQ    E  L SS
Sbjct: 478  NLDPRLT-SLQYAMVSSGSAPPPTAQASMMPFTHVQFPQPAALVKPMGQAAPSESSLHSS 536

Query: 1903 PAREEGEVPESELDPDTRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSR 1727
            PAREEGEVPESELDPDTRRRLLILQHGQD R              +R P+ VS P + SR
Sbjct: 537  PAREEGEVPESELDPDTRRRLLILQHGQDTR----DHTSNEPTYAIRHPVPVSAPRVSSR 592

Query: 1726 VGWFQSNGDVSPRQLKREA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGL-NGN 1553
             GWF +  D+  + L R   KEF +++  +   K RP HPSFF    ++  SDR L + +
Sbjct: 593  GGWFPAEEDIGSQPLNRVVPKEFSVDSGSLVIEKHRPHHPSFFSKVESSISSDRILHDSH 652

Query: 1552 QQCPMQTRQGDGYPRSNHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAG 1373
            Q+ P +    D  PRSNH++S+Y S + +E    R   +H D+  ES  ++  +A+TP  
Sbjct: 653  QRLPKEMYHRDDRPRSNHMLSSYRSLSVDEIPFSRSSSSHRDLDSESSHSVF-HADTPVV 711

Query: 1372 VLQKIALKCGTKVEYRAALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQT 1193
            VLQ+IALKCGTKVE+ ++L  STELQFS E    G++IG G G+TRKEAQ +AAE S++ 
Sbjct: 712  VLQEIALKCGTKVEFMSSLVASTELQFSIEAWFSGKKIGHGFGRTRKEAQHKAAEDSIKH 771

Query: 1192 LANKYLSDTMPN-SSVHGDLPKLYYAKGNGF-VNSNTFRYQTL-RDGQIPVAVTSEDSRH 1022
            LA+ YLS       S +GD+     A  NG+ V +++   Q L ++     +  S+ SR 
Sbjct: 772  LADIYLSSAKDEPGSTYGDVGGFPNANDNGYMVIASSLSNQPLPKEDSASFSTASDPSRV 831

Query: 1021 LDKRLEGSKGSVASISALKEV 959
            LD RLE SK  + SISALKE+
Sbjct: 832  LDPRLEVSKRPMGSISALKEL 852


>gb|EEC73671.1| hypothetical protein OsI_08218 [Oryza sativa Indica Group]
          Length = 937

 Score =  277 bits (709), Expect = 1e-71
 Identities = 177/422 (41%), Positives = 242/422 (57%), Gaps = 4/422 (0%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLKGLNFPIRAVPRRMVNNFEPRSTPSLQHAMSST 2033
            +  +A+NG++ PL F+G+ D E E+R+K  +   +A      N       P L      +
Sbjct: 431  ENVAAVNGNRDPLAFDGMADAEVERRMKEASGNAQAFTTTAANFV----MPVLPGQNFVS 486

Query: 2032 SGVVPMSASQMIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDPDT 1853
            S V P++ S  ++PL  NQ        +P  Q    +P LQ SPAREEGEVPESELDPDT
Sbjct: 487  SSVAPVAPSLGMVPLSNNQGPPP-PFTQPVAQLSLSDP-LQGSPAREEGEVPESELDPDT 544

Query: 1852 RRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQLKR 1676
            RRRLLILQHGQD R            L    P+QV +P +Q    WF     ++P  L R
Sbjct: 545  RRRLLILQHGQDTRDPTPP-------LPAVPPVQVPVPPVQPHGNWFPVEDGMNPNNLNR 597

Query: 1675 EAKEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYP-RSNH 1499
             +  F LE+E ++++K++P HP FF GG N   SDR    NQ+ P Q    + +    NH
Sbjct: 598  GSAGFPLESETMHYDKKQPPHP-FFHGGENPISSDRFSYQNQRYPSQLPHSEDHRVLQNH 656

Query: 1498 VISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPAGVLQKIALKCGTKVEYRAA 1319
              S Y SF GEE     +  +  +  +  GQ+  ++A + AG+L++IA+KCG+KVEYR+A
Sbjct: 657  APSRYRSFPGEELATRHVSSSQRNNQIVPGQHFARHAGSSAGILEEIAMKCGSKVEYRSA 716

Query: 1318 LPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNSSVHGD 1139
            L D+ +LQFS EV IVGE++GEG G+TRKEAQ QAAE SL+ LANKYLS          D
Sbjct: 717  LCDTADLQFSIEVWIVGEKVGEGIGRTRKEAQCQAAEISLRNLANKYLS---------SD 767

Query: 1138 LPKLYYAKGNGF-VNSNTFRYQ-TLRDGQIPVAVTSEDSRHLDKRLEGSKGSVASISALK 965
              K+   K NGF  N+N F Y    RD  +P+A TSE++R +      S+ +  SI+ALK
Sbjct: 768  PNKMTGMKENGFGSNTNIFGYPGNSRDDVLPIASTSEETRFVKMGENNSRKAGGSIAALK 827

Query: 964  EV 959
            E+
Sbjct: 828  EL 829


>emb|CAN72816.1| hypothetical protein VITISV_004100 [Vitis vinifera]
          Length = 894

 Score =  277 bits (709), Expect = 1e-71
 Identities = 188/425 (44%), Positives = 249/425 (58%), Gaps = 7/425 (1%)
 Frame = -1

Query: 2212 DGTSALNGSKGPLPFEGITDVEAEKRLK-GLNFPIRAVPRRMVNNFEPRSTPSLQHAMSS 2036
            D  S  NG++    F+G+ DVE E++LK  ++ P        V + +PR +P LQ A+++
Sbjct: 408  DDASVSNGNRDQPCFDGMADVEVERKLKDAISAP------STVTSLDPRLSPPLQFAVAA 461

Query: 2035 TSGVVPMSASQ-MIMPLPINQSSQSITLGKPFGQTGFPEPRLQSSPAREEGEVPESELDP 1859
            +SG+ P  A+Q  IMP    Q  QS +L KP      PEP +QSSPAREEGEVPESELDP
Sbjct: 462  SSGLAPQPAAQGSIMPFSNKQFPQSASLIKPLA----PEPTMQSSPAREEGEVPESELDP 517

Query: 1858 DTRRRLLILQHGQDIRGAXXXXXXXXXPLKLRAPIQVSMP-LQSRVGWFQSNGDVSPRQL 1682
            DTRRRLLILQHGQD R           P  +R PIQVS+P +QSR  WF ++ ++SPRQL
Sbjct: 518  DTRRRLLILQHGQDTR----EHASSDPPFPVRPPIQVSVPRVQSRGSWFPADEEMSPRQL 573

Query: 1681 KREA-KEFLLETEDVYFNKRRPQHPSFFGGGNNTNPSDRGLNGNQQCPMQTRQGDGYPRS 1505
             R   KEF L+++ ++  K RP HPSFF    ++  SDR L+ NQ+   +    D   R 
Sbjct: 574  NRAVPKEFPLDSDTMHIEKHRPHHPSFFHKVESSASSDRILHENQRLSKEVLHRDDRLRL 633

Query: 1504 NHVISNYNSFTGEEKTMGRILFNHGDVHLESGQNMTKYAETPA-GVLQKIALKCGTKVEY 1328
            NH +  Y+SF+GEE  +GR   N  D+  ESG+    YAETPA G+L+     C      
Sbjct: 634  NHSLPGYHSFSGEEVPLGRSSSNR-DLDFESGRG-APYAETPAVGLLR----NCN----- 682

Query: 1327 RAALPDSTELQFSFEVSIVGEQIGEGTGKTRKEAQAQAAEKSLQTLANKYLSDTMPNSSV 1148
                          EV   GE+IGEGTGKTR+EAQ QAAE SL  L+ +YL         
Sbjct: 683  --------------EVWNQGEKIGEGTGKTRREAQCQAAEASLMYLSYRYL--------- 719

Query: 1147 HGDLPKLYYAKGNGFV-NSNTFRYQTL-RDGQIPVAVTSEDSRHLDKRLEGSKGSVASIS 974
            HGD+ +   A  N F+ ++N+F YQ+  ++G +  +  SE SR LD RLE SK S+ SIS
Sbjct: 720  HGDVNRFPNASDNNFMSDTNSFGYQSFPKEGSMSFSTASESSRLLDPRLESSKKSMGSIS 779

Query: 973  ALKEV 959
            ALKE+
Sbjct: 780  ALKEL 784


Top