BLASTX nr result

ID: Glycyrrhiza23_contig00010005 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00010005
         (1229 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal doma...   529   e-148
ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal doma...   523   e-146
ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal doma...   428   e-117
ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative ...   422   e-116
dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]        365   1e-98

>ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Glycine max]
          Length = 428

 Score =  529 bits (1362), Expect = e-148
 Identities = 282/368 (76%), Positives = 303/368 (82%), Gaps = 10/368 (2%)
 Frame = -2

Query: 1081 MSVVTDSPVHSSSSDDFVAFLDSALDASSPGSLSDDKDVEKQD--ELESPRIKRRKFENI 908
            MSVVTDSPVHSSSSDDF+AFLD+ LDASSP S S DK+VEKQD  ELES  IKRRKFE+I
Sbjct: 1    MSVVTDSPVHSSSSDDFIAFLDAELDASSPDS-SPDKEVEKQDDDELESG-IKRRKFESI 58

Query: 907  XXXXXXXXXXXXEQKLVAAESSVKVDVCTHPGSFGDMCIRCGQKLDGESGVTFGYIHKGL 728
                               E S    VCTHPGSFG+MCIRCGQKLDGESGVTFGYIHKGL
Sbjct: 59   EE----------------TEGSTSEGVCTHPGSFGNMCIRCGQKLDGESGVTFGYIHKGL 102

Query: 727  RLHDEEISRLRNTDMKNLLCHKKXXXXXXXXXXXLNSTHLAHLSSEELYLLTQTDSLG-- 554
            RLHDEEISRLRNTDMK+LLC KK           LNSTHLAHL+SEE +LL QTDSL   
Sbjct: 103  RLHDEEISRLRNTDMKSLLCRKKLYLVLDLDHTLLNSTHLAHLTSEESHLLNQTDSLRDV 162

Query: 553  --GSIFKLEHMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 380
              GS+FKLEHM+MMTKLRPFVR FLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN
Sbjct: 163  SKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 222

Query: 379  AKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKENLILMERYHFFASSCQQF 200
            AKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHK+NLILMERYHFF SSC+QF
Sbjct: 223  AKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHFFGSSCRQF 282

Query: 199  GFNCKSLAELKNDENETDGALAKILKVLKQVHCTFFDKLQEDLVDRDVRQVVML----LI 32
            GFNCKSLAELK+DENETDGALAKILKVLKQVHC FFDK QED  DRDVRQ++ L    ++
Sbjct: 283  GFNCKSLAELKSDENETDGALAKILKVLKQVHCMFFDK-QEDFDDRDVRQMLSLVRREVL 341

Query: 31   FACLLVLA 8
              C+++ +
Sbjct: 342  SGCVIIFS 349


>ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Glycine max]
          Length = 442

 Score =  523 bits (1348), Expect = e-146
 Identities = 274/351 (78%), Positives = 298/351 (84%), Gaps = 5/351 (1%)
 Frame = -2

Query: 1081 MSVVTDSPVHSSSSDDFVAFLDSALDASSPGSLSDDKDVEKQDELESPRIKRRKFENIXX 902
            MSVVTDSPVHSSSSDDF+AFLD+ LDASSP S  D + V++ DEL+S R KRRKFE+I  
Sbjct: 1    MSVVTDSPVHSSSSDDFIAFLDAELDASSPDSSPDKEVVKQDDELQSVRTKRRKFESIEE 60

Query: 901  XXXXXXXXXXEQKLVAAESSVKVDVC-THPGSFGDMCIRCGQKLDGESGVTFGYIHKGLR 725
                      ++ L   E+S +VDVC THPGSFG+MCIRCGQKLDGESGVTFGYIHKGLR
Sbjct: 61   TEGSTSEGIVKRSL---EASSEVDVCCTHPGSFGNMCIRCGQKLDGESGVTFGYIHKGLR 117

Query: 724  LHDEEISRLRNTDMKNLLCHKKXXXXXXXXXXXLNSTHLAHLSSEELYLLTQTDSLG--- 554
            LHDEEISRLRNTDMK+LL  KK           LNSTHLA L+SEEL+LL QTDSL    
Sbjct: 118  LHDEEISRLRNTDMKSLLGRKKLYLVLDLDHTLLNSTHLAQLTSEELHLLNQTDSLTNVS 177

Query: 553  -GSIFKLEHMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFNA 377
             GS+FKLEHM+MMTKLRPFVR FLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFNA
Sbjct: 178  KGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFNA 237

Query: 376  KVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKENLILMERYHFFASSCQQFG 197
            KVISRDDGTQKHQKGLDVVLGQESAV+ILDDTEHAWMKHK+NLILMERYHFF SSC+QFG
Sbjct: 238  KVISRDDGTQKHQKGLDVVLGQESAVIILDDTEHAWMKHKDNLILMERYHFFGSSCRQFG 297

Query: 196  FNCKSLAELKNDENETDGALAKILKVLKQVHCTFFDKLQEDLVDRDVRQVV 44
            FNCKSLAELK+DE+ETDGALAKILKVLKQVHC FFDK QED  D+DVRQV+
Sbjct: 298  FNCKSLAELKSDEDETDGALAKILKVLKQVHCMFFDK-QEDFDDQDVRQVL 347


>ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Cucumis sativus]
          Length = 452

 Score =  428 bits (1100), Expect = e-117
 Identities = 229/350 (65%), Positives = 267/350 (76%), Gaps = 4/350 (1%)
 Frame = -2

Query: 1081 MSVVTDSPVHSSSSDDFVAFLDSALDASSPGSLSDDKDVEKQDELESPRIKRRKFENIXX 902
            MS+ T+SP HSSSSDDF AFL   LD+ S  S S D++ E  +  ES RIKRRK E +  
Sbjct: 1    MSLATNSPAHSSSSDDFAAFLAVDLDSHSSDS-SPDEETEGDNNAESVRIKRRKVEKLEN 59

Query: 901  XXXXXXXXXXEQKLVAAESSVKVDVCTHPGSFGDMCIRCGQKLDGESGVTFGYIHKGLRL 722
                      EQ L   E   K  +C+HPGSFG+MCI CGQ+LD ESGVTFGYIHK LRL
Sbjct: 60   SEEDIMHEVEEQSL---EVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRL 116

Query: 721  HDEEISRLRNTDMKNLLCHKKXXXXXXXXXXXLNSTHLAHLSSEELYLLTQTDSLG---- 554
            +++EI+R+RN +MK LL  KK           LNST L +L+ EE YL +QTDSL     
Sbjct: 117  NNDEINRMRNKEMKELLQRKKLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLDDVTK 176

Query: 553  GSIFKLEHMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFNAK 374
            GS+F L  +H MTKLRPFV +FLKEAS++FEMYIYTMG+R YA EMAKLLDP+ EYF++K
Sbjct: 177  GSLFLLNSVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSK 236

Query: 373  VISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKENLILMERYHFFASSCQQFGF 194
            VISRDDGTQKHQKGLDVVLG+ESAVLILDDTE+AW KHKENLILMERYHFFASSC+QFGF
Sbjct: 237  VISRDDGTQKHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGF 296

Query: 193  NCKSLAELKNDENETDGALAKILKVLKQVHCTFFDKLQEDLVDRDVRQVV 44
            NCKSL+ELKNDE+ETDGAL  ILKVLKQVH  FF+++  DLVDRDVRQV+
Sbjct: 297  NCKSLSELKNDESETDGALTTILKVLKQVHHMFFNEVSGDLVDRDVRQVL 346


>ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223534449|gb|EEF36151.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  422 bits (1086), Expect = e-116
 Identities = 229/378 (60%), Positives = 274/378 (72%), Gaps = 32/378 (8%)
 Frame = -2

Query: 1081 MSVVTDSPVHSS---SSDDFVAFLDSALDASSPGSLSDDKDV-------------EKQDE 950
            MS+VTDSP+HSS   SSDDF A LD+ LD+ S  S S  K +             E+++E
Sbjct: 1    MSLVTDSPLHSSHSSSSDDFAALLDAELDSKSSSSDSSPKAIKHDDASDANDDVNEEEEE 60

Query: 949  LESP----------RIKRRKFENIXXXXXXXXXXXXE--QKLVAAESSVKVDVCTHPGSF 806
             ES           RIKR + E +               Q LVA+ S V    CTHPGSF
Sbjct: 61   EESDSDDDSDIATNRIKRSRVETLENGENPKESTRVSLDQTLVASSSKV---ACTHPGSF 117

Query: 805  GDMCIRCGQKLDGESGVTFGYIHKGLRLHDEEISRLRNTDMKNLLCHKKXXXXXXXXXXX 626
            GDMCI CG++L  E+GVTFGYIHKGLRL ++EI RLRNTDMKNLL H+K           
Sbjct: 118  GDMCILCGERLIEETGVTFGYIHKGLRLANDEIVRLRNTDMKNLLRHRKLYLVLDLDHTL 177

Query: 625  LNSTHLAHLSSEELYLLTQTDSL----GGSIFKLEHMHMMTKLRPFVRTFLKEASEMFEM 458
            LNST L HL++EE YL +Q DS+     GS+F ++ MHMMTKLRPF+RTFLKEAS+MFEM
Sbjct: 178  LNSTQLMHLTAEEEYLKSQIDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFLKEASQMFEM 237

Query: 457  YIYTMGDRPYALEMAKLLDPQGEYFNAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE 278
            YIYTMGDR YALEMAK LDP  EYFNA+VISRDDGTQ+HQKGLD+VLGQESAVLILDDTE
Sbjct: 238  YIYTMGDRAYALEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQESAVLILDDTE 297

Query: 277  HAWMKHKENLILMERYHFFASSCQQFGFNCKSLAELKNDENETDGALAKILKVLKQVHCT 98
            +AW KHK+NLILMERYHFFASSC+QFGF CKSL++LK+DENE+DGALA +LKVL+++H  
Sbjct: 298  NAWTKHKDNLILMERYHFFASSCRQFGFECKSLSQLKSDENESDGALASVLKVLRRIHHI 357

Query: 97   FFDKLQEDLVDRDVRQVV 44
            FFD+L++ +  RDVRQV+
Sbjct: 358  FFDELEDAIDGRDVRQVL 375


>dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1065

 Score =  365 bits (937), Expect = 1e-98
 Identities = 204/380 (53%), Positives = 259/380 (68%), Gaps = 12/380 (3%)
 Frame = -2

Query: 1111 FIQKHATHTQMSVVTDSPVHSSSS-DDFVAFLDSALDASSPGSLSDDKDVEKQDELESPR 935
            F+       +MSV +DSPVHSSSS DD  AFLD+ LD++S  S    ++ E +D++ES  
Sbjct: 616  FLSPPTNKFKMSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDVESG- 674

Query: 934  IKRRKFENIXXXXXXXXXXXXEQKLVAAESSVKVDVCTHPGSFGDMCIRCGQKLDGESGV 755
            +KR+K E++                   E+S     C HPGSFG+MC  CGQKL+ E+GV
Sbjct: 675  LKRQKLEHLE------------------EASSSKGECEHPGSFGNMCFVCGQKLE-ETGV 715

Query: 754  TFGYIHKGLRLHDEEISRLRNTDMKNLLCHKKXXXXXXXXXXXLNSTHLAHLSSEELYLL 575
            +F YIHK +RL+++EISRLR++D + L   +K           LN+T L  L  EE YL 
Sbjct: 716  SFRYIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLVLDLDHTLLNTTILRDLKPEEEYLK 775

Query: 574  TQTDSL-------GGSIFKLEHMHMMTKLRPFVRTFLKEASEMFEMYIYTMGDRPYALEM 416
            + T SL       GGS+F LE M MMTKLRPFV +FLKEASEMF MYIYTMGDR YA +M
Sbjct: 776  SHTHSLQDGCNVSGGSLFLLEFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQM 835

Query: 415  AKLLDPQGEYFNAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKENLILME 236
            AKLLDP+GEYF  +VISRDDGT +H+K LDVVLGQESAVLILDDTE+AW KHK+NLI++E
Sbjct: 836  AKLLDPKGEYFGDRVISRDDGTVRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIE 895

Query: 235  RYHFFASSCQQFGFNCKSLAELKNDENETDGALAKILKVLKQVHCTFFDKLQEDLVDRDV 56
            RYHFF+SSC+QF    KSL+ELK+DE+E DGALA +LKVLKQ H  FF+ + E + +RDV
Sbjct: 896  RYHFFSSSCRQFDHRYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEGISNRDV 955

Query: 55   R----QVVMLLIFACLLVLA 8
            R    QV   ++  C +V +
Sbjct: 956  RLMLKQVRKEILKGCKIVFS 975


Top