BLASTX nr result
ID: Bupleurum21_contig00009632
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00009632 (1791 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002275875.1| PREDICTED: transcription factor tau subunit ... 586 e-165 dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana] 491 e-136 ref|NP_190510.3| Transcription factor IIIC, subunit 5 [Arabidops... 490 e-136 ref|XP_003537671.1| PREDICTED: general transcription factor 3C p... 489 e-135 ref|XP_002875963.1| hypothetical protein ARALYDRAFT_485301 [Arab... 481 e-133 >ref|XP_002275875.1| PREDICTED: transcription factor tau subunit sfc1-like [Vitis vinifera] Length = 568 Score = 586 bits (1510), Expect = e-165 Identities = 296/482 (61%), Positives = 358/482 (74%), Gaps = 3/482 (0%) Frame = +2 Query: 11 MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190 MGVI++GSISG +PS + F+VHYP YPSS RAIETLGG+ I KAR SQ N+LELHFRP Sbjct: 1 MGVIEEGSISGYIPSNEAFSVHYPAYPSSTARAIETLGGTQAIRKARSSQSNKLELHFRP 60 Query: 191 EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370 EDPYSHPAFGE PC N LL+ISKK +S + Sbjct: 61 EDPYSHPAFGELQPCNNLLLRISKK-------------------------------KSTD 89 Query: 371 TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550 + SES++ V+AQ S +V LCADI+A+VSE+YHFNGM DYQHVL VHADV Sbjct: 90 GQ------SESVATGEEVEAQISGEVPIRLCADIIARVSEAYHFNGMVDYQHVLPVHADV 143 Query: 551 SRRKKRKWADVDPELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQV 730 +RRKKR WA+V+P LEK L+DVDQEDL+IL+PPLFSPKD+PEK+VL+PS+ + LK KQ Sbjct: 144 ARRKKRNWAEVEPHLEKGDLVDVDQEDLMILLPPLFSPKDVPEKLVLRPSMTLNLKKKQE 203 Query: 731 RVTQHNREMNIEPGLALDFN-NDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKKD 907 V Q EM IEP LA+DF ++P KVNWE+++ K SEQW+WQMAV NLFDERPIW K Sbjct: 204 GVVQQRWEMGIEPCLAIDFEIKEIPKKVNWEQYIPKGSEQWEWQMAVSNLFDERPIWPKG 263 Query: 908 SLTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDFR 1087 +LTE+L +KGL +G+ LRRLL+R AYYFSNGP+LRFWIR+GYDPRK+ +S IYQRIDFR Sbjct: 264 ALTERLLDKGLNVGDYTLRRLLFRTAYYFSNGPFLRFWIRKGYDPRKNPDSCIYQRIDFR 323 Query: 1088 VPPSLRSYCDANI--GSKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQST 1261 VPPSLRSYCDAN G K RW D+CSFRVFPY+C TSLQLFEL DDYIQ+EIR Q+T Sbjct: 324 VPPSLRSYCDANAANGLKQRWEDICSFRVFPYKCHTSLQLFELADDYIQQEIRKPLKQTT 383 Query: 1262 CSLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKKLQFFVKDQRPD 1441 C+ ATGWFS RVL++LR V VRFL I P+ AE LLK+AS RFEKSK++ + + RP+ Sbjct: 384 CTGATGWFSYRVLESLRLCVMVRFLSICPETSAEYLLKSASDRFEKSKRMHIYENNLRPN 443 Query: 1442 DK 1447 ++ Sbjct: 444 EE 445 >dbj|BAF00928.1| hypothetical protein [Arabidopsis thaliana] Length = 574 Score = 491 bits (1263), Expect = e-136 Identities = 245/479 (51%), Positives = 328/479 (68%), Gaps = 3/479 (0%) Frame = +2 Query: 11 MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190 MG+I++G+ISG LPS + F VH+PGYPSSI RAIETLGG GI +AR S N+LEL FRP Sbjct: 1 MGIIEEGTISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRP 60 Query: 191 EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370 EDPY+HPA GE PC FLL+ISK QD+K +SV Sbjct: 61 EDPYAHPALGEQRPCSGFLLRISK------------------------QDIKKPESQSV- 95 Query: 371 TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550 + V + +S V LCADIVA++SES+HF+GMADYQHV+ +HAD+ Sbjct: 96 -----------LDTSRDVCLEEASPV---LCADIVARLSESFHFDGMADYQHVIPIHADI 141 Query: 551 SRRKKRKWADVDPELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQV 730 +++KKRKW DVDP K L+ + ED+++L+P F+PKD+P+ V LKP G K K Sbjct: 142 AQQKKRKWMDVDPLTGKSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKDD 201 Query: 731 RVTQHNREMNIEPGLALDFN-NDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKKD 907 TQ+ E+++ P A+DF+ ++P K+ WE +++SS WQWQ+AV LF+ERPIW +D Sbjct: 202 VATQNFYEIDVGPVFAIDFSVKEIPKKLKWEDFVSRSSNHWQWQVAVSALFEERPIWTRD 261 Query: 908 SLTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDFR 1087 S+ ++L +KGL ++ML R L RAAYYFS+GP+LRFWI+RGYDPR D ESR+YQR++FR Sbjct: 262 SVVQRLLDKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRVYQRMEFR 321 Query: 1088 VPPSLRSYCDANI--GSKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQST 1261 VPP LR YCDAN SK W+D+C+F++FP++CQT LQLFELDD+YIQ+EIR Q+T Sbjct: 322 VPPELRGYCDANATNNSKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIRKPPKQTT 381 Query: 1262 CSLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKKLQFFVKDQRP 1438 CS +GWFS +LDTLR RVAVRF+ ++P+ G E + K+ FE+SKK+Q + +P Sbjct: 382 CSHKSGWFSEAMLDTLRLRVAVRFVSVFPETGFEDVFKSIQEEFERSKKVQIQKETLKP 440 >ref|NP_190510.3| Transcription factor IIIC, subunit 5 [Arabidopsis thaliana] gi|332645018|gb|AEE78539.1| Transcription factor IIIC, subunit 5 [Arabidopsis thaliana] Length = 574 Score = 490 bits (1261), Expect = e-136 Identities = 244/479 (50%), Positives = 328/479 (68%), Gaps = 3/479 (0%) Frame = +2 Query: 11 MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190 MG+I++G+ISG LPS + F VH+PGYPSSI RAIETLGG GI +AR S N+LEL FRP Sbjct: 1 MGIIEEGTISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGITQARESISNKLELRFRP 60 Query: 191 EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370 EDPY+HPA GE PC FLL+ISK QD+K +SV Sbjct: 61 EDPYAHPALGEQRPCSGFLLRISK------------------------QDIKKPESQSV- 95 Query: 371 TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550 + V + +S V LCADIVA++SES+HF+GMADYQHV+ +HAD+ Sbjct: 96 -----------LDTSRDVCLEEASPV---LCADIVARLSESFHFDGMADYQHVIPIHADI 141 Query: 551 SRRKKRKWADVDPELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQV 730 +++KKRKW DVDP K L+ + ED+++L+P F+PKD+P+ V LKP G K K Sbjct: 142 AQQKKRKWMDVDPLTGKSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKDD 201 Query: 731 RVTQHNREMNIEPGLALDFN-NDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKKD 907 TQ+ E+++ P A+DF+ ++P K+ WE +++SS WQWQ+AV LF+ERPIW +D Sbjct: 202 AATQNFYEIDVGPVFAIDFSVKEIPKKLKWEDFVSRSSNHWQWQVAVSALFEERPIWTRD 261 Query: 908 SLTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDFR 1087 S+ ++L +KGL ++ML R L RAAYYFS+GP+LRFWI+RGYDPR D ESR+YQR++FR Sbjct: 262 SVVQRLLDKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRVYQRMEFR 321 Query: 1088 VPPSLRSYCDANI--GSKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQST 1261 VPP LR YCDAN SK W+D+C+F++FP++CQT LQLFELDD+YIQ+EIR Q+T Sbjct: 322 VPPELRGYCDANATNNSKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIRKPPKQTT 381 Query: 1262 CSLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKKLQFFVKDQRP 1438 CS +GWFS +LDTLR RVAVRF+ ++P+ G E + K+ FE+S+K+Q + +P Sbjct: 382 CSHKSGWFSEAMLDTLRLRVAVRFVSVFPETGFEDVFKSIQEEFERSEKVQIQKETLKP 440 >ref|XP_003537671.1| PREDICTED: general transcription factor 3C polypeptide 5-like [Glycine max] Length = 547 Score = 489 bits (1259), Expect = e-135 Identities = 248/470 (52%), Positives = 325/470 (69%), Gaps = 4/470 (0%) Frame = +2 Query: 11 MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190 MGVIKDG+ISGVLP + F VHYP YPSSI RA++TLGG I KAR S+ N+LEL FRP Sbjct: 1 MGVIKDGTISGVLPEPQGFMVHYPAYPSSISRAVDTLGGIQAIQKARCSKSNKLELRFRP 60 Query: 191 EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370 EDPYSHPAFGE P + LLKISK P + DAE +S + +Q+ Sbjct: 61 EDPYSHPAFGELRPTNSLLLKISKTKPP--PPVHDAEASSSSTNGEQDQE---------- 108 Query: 371 TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550 G+LCADIVA+ E+Y F GMADYQHV+ VHADV Sbjct: 109 ---------------------------GSLCADIVARFPEAYFFYGMADYQHVIPVHADV 141 Query: 551 SRRKKRKWADVDP-ELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQ 727 +RRKKR W++++ +K G +D+D ED++I+VPP+F+PKD+PE +VL+P+ K K Sbjct: 142 ARRKKRNWSELEELHFDKGGFMDLDHEDVMIIVPPIFAPKDVPENLVLRPATMSSSKKKP 201 Query: 728 VRVTQHNREMNIEPGLALDFN-NDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKK 904 V Q + EM++EP LA+DF+ ++P KVNWE ++ + S+QW+ QM V +FDERPIW K Sbjct: 202 EEVVQPHFEMDMEPVLAIDFDIKEIPKKVNWEEYIPQGSDQWELQMVVSRMFDERPIWSK 261 Query: 905 DSLTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDF 1084 +SLTE L +KGL ++MLRRLL R +YYFS+GP+LRFWI++GYDPRKD SRIYQRID+ Sbjct: 262 NSLTELLLDKGLSFSHSMLRRLLSRISYYFSSGPFLRFWIKKGYDPRKDPNSRIYQRIDY 321 Query: 1085 RVPPSLRSYCDANIG--SKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQS 1258 RVP LRSYCDA+ SKHRW D+C+FRVFPY+ QTSLQ F+L DDYIQ EI + Sbjct: 322 RVPVPLRSYCDAHSANKSKHRWKDICAFRVFPYKFQTSLQFFDLVDDYIQSEINKPPFRP 381 Query: 1259 TCSLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKK 1408 TC+ TGWFS +++ +R R+ VR+L ++PK GAE+LL+ A+++FEK K+ Sbjct: 382 TCTSGTGWFSQHMINCIRQRLMVRYLSVFPKPGAENLLRAATLKFEKLKR 431 >ref|XP_002875963.1| hypothetical protein ARALYDRAFT_485301 [Arabidopsis lyrata subsp. lyrata] gi|297321801|gb|EFH52222.1| hypothetical protein ARALYDRAFT_485301 [Arabidopsis lyrata subsp. lyrata] Length = 571 Score = 481 bits (1238), Expect = e-133 Identities = 241/470 (51%), Positives = 324/470 (68%), Gaps = 2/470 (0%) Frame = +2 Query: 11 MGVIKDGSISGVLPSTKLFAVHYPGYPSSIIRAIETLGGSDGIVKARGSQLNRLELHFRP 190 MG+I++G ISG LPS + F VH+PGYPSSI RAIETLGG GI +AR S N+LEL FRP Sbjct: 1 MGIIEEGIISGTLPSKEAFVVHFPGYPSSISRAIETLGGIQGISQARESISNKLELRFRP 60 Query: 191 EDPYSHPAFGEHYPCYNFLLKISKKGEGEHPTIVDAEITSICLAKDSNQDLKISFPESVE 370 EDPY+HPA GE PC FLL+ISK QD+K PES Sbjct: 61 EDPYAHPALGEQRPCCGFLLRISK------------------------QDIKK--PESQP 94 Query: 371 TEYVSQVGSESISAPATVKAQASSKVQGNLCADIVAQVSESYHFNGMADYQHVLAVHADV 550 ++ + V + +S V LCADI+A+VSES+HF+GMADYQHV+ +HAD+ Sbjct: 95 V----------LATSSDVCLEEASTV---LCADIIARVSESFHFDGMADYQHVIPIHADI 141 Query: 551 SRRKKRKWADVDPELEKRGLLDVDQEDLLILVPPLFSPKDMPEKVVLKPSLEVGLKHKQV 730 +++KKRKW DVD L+ + ED+++L+P F+PKD+P+ V LKP G K K Sbjct: 142 AQQKKRKWMDVDSLTGNSDLMGLADEDVMMLLPQFFAPKDIPDNVALKPPATSGPKKKDD 201 Query: 731 RVTQHNREMNIEPGLALDFNNDVPGKVNWERHLAKSSEQWQWQMAVCNLFDERPIWKKDS 910 TQ+ E+++ P A+DF+ +P K+ WE +++SS WQWQ++V LF+ERPIW +DS Sbjct: 202 AATQNFYEIDVGPVFAIDFS--IPKKLKWEDFVSRSSNHWQWQVSVSALFEERPIWTRDS 259 Query: 911 LTEQLHNKGLMIGNNMLRRLLYRAAYYFSNGPYLRFWIRRGYDPRKDVESRIYQRIDFRV 1090 + ++L +KGL ++ML R L RAAYYFS+GP+LRFWI+RGYDPR D ESR+YQR++FRV Sbjct: 260 VVQRLLDKGLKCTHHMLNRFLLRAAYYFSSGPFLRFWIKRGYDPRNDPESRVYQRMEFRV 319 Query: 1091 PPSLRSYCDANI--GSKHRWSDVCSFRVFPYRCQTSLQLFELDDDYIQKEIRNASPQSTC 1264 PP LRSYCDAN +K W+D+C+F++FP++CQT LQLFELDD+YIQ+EIR Q+TC Sbjct: 320 PPELRSYCDANATNSAKPSWNDICAFKLFPFKCQTFLQLFELDDEYIQREIRKPPKQTTC 379 Query: 1265 SLATGWFSTRVLDTLRFRVAVRFLEIYPKDGAESLLKNASIRFEKSKKLQ 1414 S +GWFS +LDTLR RVAVRF+ ++P+ G E + K+ FE+S+K+Q Sbjct: 380 SHKSGWFSEALLDTLRLRVAVRFVSVFPEPGFEDVFKSIQEEFERSEKVQ 429