BLASTX nr result

ID: Bupleurum21_contig00009632 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00009632
         (1791 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002275875.1| PREDICTED: transcription factor tau subunit ...   586   e-165
dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana]           491   e-136
ref|NP_190510.3| Transcription factor IIIC, subunit 5 [Arabidops...   490   e-136
ref|XP_003537671.1| PREDICTED: general transcription factor 3C p...   489   e-135
ref|XP_002875963.1| hypothetical protein ARALYDRAFT_485301 [Arab...   481   e-133

>ref|XP_002275875.1| PREDICTED: transcription factor tau subunit sfc1-like [Vitis
            vinifera]
          Length = 568

 Score =  586 bits (1510), Expect = e-165
 Identities = 296/482 (61%), Positives = 358/482 (74%), Gaps = 3/482 (0%)
 Frame = +2

Query: 11   MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190
            MGVI++GSISG +PS + F+VHYP YPSS  RAIETLGG+  I KAR SQ N+LELHFRP
Sbjct: 1    MGVIEEGSISGYIPSNEAFSVHYPAYPSSTARAIETLGGTQAIRKARSSQSNKLELHFRP 60

Query: 191  EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370
            EDPYSHPAFGE  PC N LL+ISKK                               +S +
Sbjct: 61   EDPYSHPAFGELQPCNNLLLRISKK-------------------------------KSTD 89

Query: 371  TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550
             +      SES++    V+AQ S +V   LCADI+A+VSE+YHFNGM DYQHVL VHADV
Sbjct: 90   GQ------SESVATGEEVEAQISGEVPIRLCADIIARVSEAYHFNGMVDYQHVLPVHADV 143

Query: 551  SRRKKRKWADVDPELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQV 730
            +RRKKR WA+V+P LEK  L+DVDQEDL+IL+PPLFSPKD+PEK+VL+PS+ + LK KQ 
Sbjct: 144  ARRKKRNWAEVEPHLEKGDLVDVDQEDLMILLPPLFSPKDVPEKLVLRPSMTLNLKKKQE 203

Query: 731  RVTQHNREMNIEPGLALDFN-NDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKKD 907
             V Q   EM IEP LA+DF   ++P KVNWE+++ K SEQW+WQMAV NLFDERPIW K 
Sbjct: 204  GVVQQRWEMGIEPCLAIDFEIKEIPKKVNWEQYIPKGSEQWEWQMAVSNLFDERPIWPKG 263

Query: 908  SLTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDFR 1087
            +LTE+L +KGL +G+  LRRLL+R AYYFSNGP+LRFWIR+GYDPRK+ +S IYQRIDFR
Sbjct: 264  ALTERLLDKGLNVGDYTLRRLLFRTAYYFSNGPFLRFWIRKGYDPRKNPDSCIYQRIDFR 323

Query: 1088 VPPSLRSYCDANI--GSKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQST 1261
            VPPSLRSYCDAN   G K RW D+CSFRVFPY+C TSLQLFEL DDYIQ+EIR    Q+T
Sbjct: 324  VPPSLRSYCDANAANGLKQRWEDICSFRVFPYKCHTSLQLFELADDYIQQEIRKPLKQTT 383

Query: 1262 CSLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKKLQFFVKDQRPD 1441
            C+ ATGWFS RVL++LR  V VRFL I P+  AE LLK+AS RFEKSK++  +  + RP+
Sbjct: 384  CTGATGWFSYRVLESLRLCVMVRFLSICPETSAEYLLKSASDRFEKSKRMHIYENNLRPN 443

Query: 1442 DK 1447
            ++
Sbjct: 444  EE 445


>dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana]
          Length = 574

 Score =  491 bits (1263), Expect = e-136
 Identities = 245/479 (51%), Positives = 328/479 (68%), Gaps = 3/479 (0%)
 Frame = +2

Query: 11   MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190
            MG+I++G+ISG LPS + F VH+PGYPSSI RAIETLGG  GI +AR S  N+LEL FRP
Sbjct: 1    MGIIEEGTISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRP 60

Query: 191  EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370
            EDPY+HPA GE  PC  FLL+ISK                        QD+K    +SV 
Sbjct: 61   EDPYAHPALGEQRPCSGFLLRISK------------------------QDIKKPESQSV- 95

Query: 371  TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550
                       +     V  + +S V   LCADIVA++SES+HF+GMADYQHV+ +HAD+
Sbjct: 96   -----------LDTSRDVCLEEASPV---LCADIVARLSESFHFDGMADYQHVIPIHADI 141

Query: 551  SRRKKRKWADVDPELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQV 730
            +++KKRKW DVDP   K  L+ +  ED+++L+P  F+PKD+P+ V LKP    G K K  
Sbjct: 142  AQQKKRKWMDVDPLTGKSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKDD 201

Query: 731  RVTQHNREMNIEPGLALDFN-NDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKKD 907
              TQ+  E+++ P  A+DF+  ++P K+ WE  +++SS  WQWQ+AV  LF+ERPIW +D
Sbjct: 202  VATQNFYEIDVGPVFAIDFSVKEIPKKLKWEDFVSRSSNHWQWQVAVSALFEERPIWTRD 261

Query: 908  SLTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDFR 1087
            S+ ++L +KGL   ++ML R L RAAYYFS+GP+LRFWI+RGYDPR D ESR+YQR++FR
Sbjct: 262  SVVQRLLDKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRVYQRMEFR 321

Query: 1088 VPPSLRSYCDANI--GSKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQST 1261
            VPP LR YCDAN    SK  W+D+C+F++FP++CQT LQLFELDD+YIQ+EIR    Q+T
Sbjct: 322  VPPELRGYCDANATNNSKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIRKPPKQTT 381

Query: 1262 CSLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKKLQFFVKDQRP 1438
            CS  +GWFS  +LDTLR RVAVRF+ ++P+ G E + K+    FE+SKK+Q   +  +P
Sbjct: 382  CSHKSGWFSEAMLDTLRLRVAVRFVSVFPETGFEDVFKSIQEEFERSKKVQIQKETLKP 440


>ref|NP_190510.3| Transcription factor IIIC, subunit 5 [Arabidopsis thaliana]
            gi|332645018|gb|AEE78539.1| Transcription factor IIIC,
            subunit 5 [Arabidopsis thaliana]
          Length = 574

 Score =  490 bits (1261), Expect = e-136
 Identities = 244/479 (50%), Positives = 328/479 (68%), Gaps = 3/479 (0%)
 Frame = +2

Query: 11   MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190
            MG+I++G+ISG LPS + F VH+PGYPSSI RAIETLGG  GI +AR S  N+LEL FRP
Sbjct: 1    MGIIEEGTISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRP 60

Query: 191  EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370
            EDPY+HPA GE  PC  FLL+ISK                        QD+K    +SV 
Sbjct: 61   EDPYAHPALGEQRPCSGFLLRISK------------------------QDIKKPESQSV- 95

Query: 371  TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550
                       +     V  + +S V   LCADIVA++SES+HF+GMADYQHV+ +HAD+
Sbjct: 96   -----------LDTSRDVCLEEASPV---LCADIVARLSESFHFDGMADYQHVIPIHADI 141

Query: 551  SRRKKRKWADVDPELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQV 730
            +++KKRKW DVDP   K  L+ +  ED+++L+P  F+PKD+P+ V LKP    G K K  
Sbjct: 142  AQQKKRKWMDVDPLTGKSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKDD 201

Query: 731  RVTQHNREMNIEPGLALDFN-NDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKKD 907
              TQ+  E+++ P  A+DF+  ++P K+ WE  +++SS  WQWQ+AV  LF+ERPIW +D
Sbjct: 202  AATQNFYEIDVGPVFAIDFSVKEIPKKLKWEDFVSRSSNHWQWQVAVSALFEERPIWTRD 261

Query: 908  SLTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDFR 1087
            S+ ++L +KGL   ++ML R L RAAYYFS+GP+LRFWI+RGYDPR D ESR+YQR++FR
Sbjct: 262  SVVQRLLDKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRVYQRMEFR 321

Query: 1088 VPPSLRSYCDANI--GSKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQST 1261
            VPP LR YCDAN    SK  W+D+C+F++FP++CQT LQLFELDD+YIQ+EIR    Q+T
Sbjct: 322  VPPELRGYCDANATNNSKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIRKPPKQTT 381

Query: 1262 CSLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKKLQFFVKDQRP 1438
            CS  +GWFS  +LDTLR RVAVRF+ ++P+ G E + K+    FE+S+K+Q   +  +P
Sbjct: 382  CSHKSGWFSEAMLDTLRLRVAVRFVSVFPETGFEDVFKSIQEEFERSEKVQIQKETLKP 440


>ref|XP_003537671.1| PREDICTED: general transcription factor 3C polypeptide 5-like
            [Glycine max]
          Length = 547

 Score =  489 bits (1259), Expect = e-135
 Identities = 248/470 (52%), Positives = 325/470 (69%), Gaps = 4/470 (0%)
 Frame = +2

Query: 11   MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190
            MGVIKDG+ISGVLP  + F VHYP YPSSI RA++TLGG   I KAR S+ N+LEL FRP
Sbjct: 1    MGVIKDGTISGVLPEPQGFMVHYPAYPSSISRAVDTLGGIQAIQKARCSKSNKLELRFRP 60

Query: 191  EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370
            EDPYSHPAFGE  P  + LLKISK      P + DAE +S     + +Q+          
Sbjct: 61   EDPYSHPAFGELRPTNSLLLKISKTKPP--PPVHDAEASSSSTNGEQDQE---------- 108

Query: 371  TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550
                                       G+LCADIVA+  E+Y F GMADYQHV+ VHADV
Sbjct: 109  ---------------------------GSLCADIVARFPEAYFFYGMADYQHVIPVHADV 141

Query: 551  SRRKKRKWADVDP-ELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQ 727
            +RRKKR W++++    +K G +D+D ED++I+VPP+F+PKD+PE +VL+P+     K K 
Sbjct: 142  ARRKKRNWSELEELHFDKGGFMDLDHEDVMIIVPPIFAPKDVPENLVLRPATMSSSKKKP 201

Query: 728  VRVTQHNREMNIEPGLALDFN-NDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKK 904
              V Q + EM++EP LA+DF+  ++P KVNWE ++ + S+QW+ QM V  +FDERPIW K
Sbjct: 202  EEVVQPHFEMDMEPVLAIDFDIKEIPKKVNWEEYIPQGSDQWELQMVVSRMFDERPIWSK 261

Query: 905  DSLTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDF 1084
            +SLTE L +KGL   ++MLRRLL R +YYFS+GP+LRFWI++GYDPRKD  SRIYQRID+
Sbjct: 262  NSLTELLLDKGLSFSHSMLRRLLSRISYYFSSGPFLRFWIKKGYDPRKDPNSRIYQRIDY 321

Query: 1085 RVPPSLRSYCDANIG--SKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQS 1258
            RVP  LRSYCDA+    SKHRW D+C+FRVFPY+ QTSLQ F+L DDYIQ EI     + 
Sbjct: 322  RVPVPLRSYCDAHSANKSKHRWKDICAFRVFPYKFQTSLQFFDLVDDYIQSEINKPPFRP 381

Query: 1259 TCSLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKK 1408
            TC+  TGWFS  +++ +R R+ VR+L ++PK GAE+LL+ A+++FEK K+
Sbjct: 382  TCTSGTGWFSQHMINCIRQRLMVRYLSVFPKPGAENLLRAATLKFEKLKR 431


>ref|XP_002875963.1| hypothetical protein ARALYDRAFT_485301 [Arabidopsis lyrata subsp.
            lyrata] gi|297321801|gb|EFH52222.1| hypothetical protein
            ARALYDRAFT_485301 [Arabidopsis lyrata subsp. lyrata]
          Length = 571

 Score =  481 bits (1238), Expect = e-133
 Identities = 241/470 (51%), Positives = 324/470 (68%), Gaps = 2/470 (0%)
 Frame = +2

Query: 11   MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190
            MG+I++G ISG LPS + F VH+PGYPSSI RAIETLGG  GI +AR S  N+LEL FRP
Sbjct: 1    MGIIEEGIISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGISQARESISNKLELRFRP 60

Query: 191  EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370
            EDPY+HPA GE  PC  FLL+ISK                        QD+K   PES  
Sbjct: 61   EDPYAHPALGEQRPCCGFLLRISK------------------------QDIKK--PESQP 94

Query: 371  TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550
                       ++  + V  + +S V   LCADI+A+VSES+HF+GMADYQHV+ +HAD+
Sbjct: 95   V----------LATSSDVCLEEASTV---LCADIIARVSESFHFDGMADYQHVIPIHADI 141

Query: 551  SRRKKRKWADVDPELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQV 730
            +++KKRKW DVD       L+ +  ED+++L+P  F+PKD+P+ V LKP    G K K  
Sbjct: 142  AQQKKRKWMDVDSLTGNSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKDD 201

Query: 731  RVTQHNREMNIEPGLALDFNNDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKKDS 910
              TQ+  E+++ P  A+DF+  +P K+ WE  +++SS  WQWQ++V  LF+ERPIW +DS
Sbjct: 202  AATQNFYEIDVGPVFAIDFS--IPKKLKWEDFVSRSSNHWQWQVSVSALFEERPIWTRDS 259

Query: 911  LTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDFRV 1090
            + ++L +KGL   ++ML R L RAAYYFS+GP+LRFWI+RGYDPR D ESR+YQR++FRV
Sbjct: 260  VVQRLLDKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRVYQRMEFRV 319

Query: 1091 PPSLRSYCDANI--GSKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQSTC 1264
            PP LRSYCDAN    +K  W+D+C+F++FP++CQT LQLFELDD+YIQ+EIR    Q+TC
Sbjct: 320  PPELRSYCDANATNSAKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIRKPPKQTTC 379

Query: 1265 SLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKKLQ 1414
            S  +GWFS  +LDTLR RVAVRF+ ++P+ G E + K+    FE+S+K+Q
Sbjct: 380  SHKSGWFSEALLDTLRLRVAVRFVSVFPEPGFEDVFKSIQEEFERSEKVQ 429


Top