BLASTX nr result

ID: Scutellaria24_contig00012470 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria24_contig00012470
         (1806 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276057.1| PREDICTED: probable RNA polymerase II transc...   687   0.0  
emb|CAN79528.1| hypothetical protein VITISV_026261 [Vitis vinifera]   635   e-179
ref|XP_002894525.1| hypothetical protein ARALYDRAFT_892577 [Arab...   563   e-158
ref|NP_175971.3| transcription initiation factor TFIIH subunit H...   557   e-156
ref|XP_002530618.1| TFIIH basal transcription factor complex sub...   553   e-155

>ref|XP_002276057.1| PREDICTED: probable RNA polymerase II transcription factor B subunit
            1-1 [Vitis vinifera] gi|296090002|emb|CBI39821.3| unnamed
            protein product [Vitis vinifera]
          Length = 602

 Score =  687 bits (1773), Expect = 0.0
 Identities = 343/511 (67%), Positives = 417/511 (81%), Gaps = 3/511 (0%)
 Frame = -3

Query: 1789 DVCRELVATAIAFHTESGRAPPEKSAAPVNNEQLSRAETERRIKLLQENSELQTLHKQFV 1610
            +VCRE V  A+A  +E+ +A  E+SA  + +EQLS  E ERRIKLL+E+SELQ LHKQFV
Sbjct: 90   EVCREFVGRALAKFSEASKAGSEQSAVKLFDEQLSTIEMERRIKLLREDSELQKLHKQFV 149

Query: 1609 FGGILTDAEFWATRKKLLEQNDSRRPKQRIALKNEMWT-VKPLSDGQSNRVTFNLTPEII 1433
              G+LT+AEFWATRKKLL+ N SR  KQR+  K+ M + +KPL+DG++NRVTFNLTPEII
Sbjct: 150  LSGVLTEAEFWATRKKLLDGNTSRTSKQRVGFKSAMISDLKPLTDGRTNRVTFNLTPEII 209

Query: 1432 HQIFAEKPAVRQAYLNFVPKKMTEKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXAVFL 1253
            HQIFAEKPAV QA+LNFVP KMTEK+FW KY RAEYLH T+N V            AVFL
Sbjct: 210  HQIFAEKPAVHQAFLNFVPNKMTEKDFWNKYCRAEYLHCTRNTVAAAAEAAEDEELAVFL 269

Query: 1252 KRDDMLANEARRKIRRVDPTVDMEADEGDDYIHLPDHGLLQDEAKDVLESQYEPFRRSFA 1073
            K DD+LANEARRKIRRVDPT+DMEAD+GDDY+HLPDHG+ +D +K++++ QYE +RR+ +
Sbjct: 270  KHDDILANEARRKIRRVDPTLDMEADQGDDYMHLPDHGIFRDGSKEIIDPQYEQYRRTLS 329

Query: 1072 QDLNQHAAVVLQGRVVDVELSDTRSVAEALTRRKQAELSDETSNINLDRDLPDRISRTME 893
            QDLN+HAAVVL+GR +DVEL DTR+VAEAL + K+ E ++E S+ ++ R+  +RISR  E
Sbjct: 330  QDLNRHAAVVLEGRPIDVELEDTRTVAEALAKSKRVEAANEKSDGSVTRERLERISRMTE 389

Query: 892  IEDLQGPQDPAVAPLSIKDPRDYFDSQQANALKALGDAASGGKPLKASMSSREAYGSVRD 713
            IEDLQ P+D   A L IKDPRDYFDSQQANALK LGD  +G K +K S+S++EAYGS+R 
Sbjct: 390  IEDLQAPRDLPFAALCIKDPRDYFDSQQANALKTLGDTLAGSKQIKCSLSTQEAYGSLRG 449

Query: 712  LVSDIRVMGLSEPIMNQEVALKVLNGLTQNISSTKLHLGNNPNESVLDRLPKVIKEELLH 533
             +S+I+ +GLS+PI+  ++ALKVLNGLTQNISSTK HLG NP ESVLDRLP + KEELLH
Sbjct: 450  FISEIKSVGLSDPIVKPDIALKVLNGLTQNISSTKFHLGKNPQESVLDRLPIITKEELLH 509

Query: 532  HWTSVQELLKHFWSSYPITTKYLYNKVTRLKDAMSQVYPKLQEMKESVQSDFRHQVSLLV 353
            HWTS+QELL+HFWSSYPITT YLY K +RLKDAMSQ+YPKLQE+KESVQSDFRHQVSLLV
Sbjct: 510  HWTSIQELLRHFWSSYPITTTYLYTKASRLKDAMSQIYPKLQEIKESVQSDFRHQVSLLV 569

Query: 352  HPMLQALDAAFAQYDADVQRRSAKT--PPNG 266
             PMLQALDAAFA YDAD Q+RSA++   PNG
Sbjct: 570  QPMLQALDAAFAHYDADQQKRSARSGERPNG 600


>emb|CAN79528.1| hypothetical protein VITISV_026261 [Vitis vinifera]
          Length = 735

 Score =  635 bits (1637), Expect = e-179
 Identities = 325/528 (61%), Positives = 396/528 (75%), Gaps = 44/528 (8%)
 Frame = -3

Query: 1789 DVCRELVATAIAFHTESGRAPPEKSAAPVNNEQLSRAETERRIKLLQEN----------- 1643
            +VCRE V  A+A  +E+ +A  E+SA  + +EQLS  E ERRIKLL+E+           
Sbjct: 90   EVCREFVGKALAKFSEASKAGSEQSAVKLFDEQLSTIEMERRIKLLREDRHSVPEEKFGV 149

Query: 1642 --------------------------------SELQTLHKQFVFGGILTDAEFWATRKKL 1559
                                            SELQ LHKQFV  G+LT+AEFWATRKKL
Sbjct: 150  LLLKGILPLFGETIIAMKLIDSGIASKVKWDCSELQKLHKQFVLSGVLTEAEFWATRKKL 209

Query: 1558 LEQNDSRRPKQRIALKNEMWT-VKPLSDGQSNRVTFNLTPEIIHQIFAEKPAVRQAYLNF 1382
            L+ N SR  KQR+  K+ M + +KPL+DG++NRVTFNLTPEIIHQIFAEKPAV QA+LNF
Sbjct: 210  LDGNTSRTSKQRVGFKSAMISDLKPLTDGRTNRVTFNLTPEIIHQIFAEKPAVHQAFLNF 269

Query: 1381 VPKKMTEKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXAVFLKRDDMLANEARRKIRRV 1202
            VP KMTEK+FW KY RAEYLH T+N V            AVFLK DD+LA+EARRKIRRV
Sbjct: 270  VPNKMTEKDFWNKYCRAEYLHCTRNTVAAAAEAAEDEELAVFLKHDDILASEARRKIRRV 329

Query: 1201 DPTVDMEADEGDDYIHLPDHGLLQDEAKDVLESQYEPFRRSFAQDLNQHAAVVLQGRVVD 1022
            DPT+DMEAD+GDDY+HLPDHG+ +D +K++++ QYE +RR+ +QDLN+HAAVVL+GR +D
Sbjct: 330  DPTLDMEADQGDDYMHLPDHGIFRDGSKEIIDPQYEQYRRTLSQDLNRHAAVVLEGRPID 389

Query: 1021 VELSDTRSVAEALTRRKQAELSDETSNINLDRDLPDRISRTMEIEDLQGPQDPAVAPLSI 842
            VEL DTR+VAEAL + K+ E ++E S+ ++ R+  +RISR  EIEDLQ P+D   A L I
Sbjct: 390  VELEDTRTVAEALAKSKRVEAANEKSDGSVTRERLERISRMTEIEDLQAPRDLPFAALCI 449

Query: 841  KDPRDYFDSQQANALKALGDAASGGKPLKASMSSREAYGSVRDLVSDIRVMGLSEPIMNQ 662
            KDPRDYFDSQQANALK LGD  +G K +K S+SS+EAYGS+R  +S+I+ +GLS+PI+  
Sbjct: 450  KDPRDYFDSQQANALKTLGDTLAGSKQIKCSLSSQEAYGSLRGFISEIKSVGLSDPIVKP 509

Query: 661  EVALKVLNGLTQNISSTKLHLGNNPNESVLDRLPKVIKEELLHHWTSVQELLKHFWSSYP 482
            ++ALKVLNGLTQNISSTK HLG NP ESVLDRLP + KEELLHHWTS+QELL+HFWSSYP
Sbjct: 510  DIALKVLNGLTQNISSTKFHLGKNPQESVLDRLPIITKEELLHHWTSIQELLRHFWSSYP 569

Query: 481  ITTKYLYNKVTRLKDAMSQVYPKLQEMKESVQSDFRHQVSLLVHPMLQ 338
            ITT YLY K +RLKDAMSQ+YPKLQE+KESVQSDFRHQVSLLV PMLQ
Sbjct: 570  ITTTYLYTKASRLKDAMSQIYPKLQEIKESVQSDFRHQVSLLVQPMLQ 617


>ref|XP_002894525.1| hypothetical protein ARALYDRAFT_892577 [Arabidopsis lyrata subsp.
            lyrata] gi|297340367|gb|EFH70784.1| hypothetical protein
            ARALYDRAFT_892577 [Arabidopsis lyrata subsp. lyrata]
          Length = 592

 Score =  563 bits (1450), Expect = e-158
 Identities = 301/512 (58%), Positives = 374/512 (73%), Gaps = 6/512 (1%)
 Frame = -3

Query: 1783 CRELVATAIAFHTESGRAPPEKSAAPVNNEQLSRAETERRIKLLQENSELQTLHKQFVFG 1604
            CR+ +  A+A   E     P KS    ++EQLS  E E R KLL+ENSELQ LHKQFV  
Sbjct: 91   CRDFITKALAKCEEE----PNKSVVSTSSEQLSIKELELRFKLLRENSELQRLHKQFVES 146

Query: 1603 GILTDAEFWATRKKLLEQNDSRRPKQRIALKNEMWT-VKPLSDGQSNRVTFNLTPEIIHQ 1427
             +LT+ EFWATRKKLL ++  R+ KQ++ LK+ M + +KP +DG++NRVTFNLTPEII Q
Sbjct: 147  KVLTEDEFWATRKKLLGKDSIRKSKQQVGLKSMMVSGIKPSTDGRTNRVTFNLTPEIIFQ 206

Query: 1426 IFAEKPAVRQAYLNFVPKKMTEKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXAVFLKR 1247
            IFAEKPAVRQA++N+VP KMTEK+FWTKY RAEYL+STKN              AVFLK 
Sbjct: 207  IFAEKPAVRQAFINYVPSKMTEKDFWTKYFRAEYLYSTKNTAVAAAEAAEDEELAVFLKP 266

Query: 1246 DDMLANEARRKIRRVDPTVDMEADEGDDYIHLPDHGLLQDEAKDVLESQYEPFRRSFAQD 1067
            D++LA E R+KIRRVDPT+DMEAD+GDDY HL DHG+ +D   DV+E Q + FRRS  QD
Sbjct: 267  DEILARETRQKIRRVDPTLDMEADQGDDYTHLMDHGIQRDGTMDVVEPQNDQFRRSLLQD 326

Query: 1066 LNQHAAVVLQGRVVDVELSDTRSVAEALTRRKQAELSD--ETSNINLDRDLPDRISRTME 893
            LN+HAAVVL+GR +DVE  DTR VAEALTR KQ   +D   T + NL+R   +R+SR   
Sbjct: 327  LNRHAAVVLEGRSIDVESEDTRIVAEALTRVKQVSKADGETTKDANLER--LERMSRLAG 384

Query: 892  IEDLQGPQDPAVAPLSIKDPRDYFDSQQANALKALGDAASGGKPLKASMSSREAYGSVRD 713
            +EDLQ PQ+  +APLSIKDPRDYF+SQQ N L    +   G K LK ++   EAYG +++
Sbjct: 385  MEDLQAPQNFPLAPLSIKDPRDYFESQQGNVL----NVPRGAKGLKRNV--HEAYGLLKE 438

Query: 712  LVSDIRVMGLSEPIMNQEVALKVLNGLTQNISSTKLHLGNNPNESVLDRLPKVIKEELLH 533
             + +IR  GLS+P++  EV+ +V + LT+ IS+ K  +G NP ES LDRLPK  K+E+LH
Sbjct: 439  SILEIRATGLSDPLIRPEVSFEVFSSLTRTISTAKNIIGKNPRESFLDRLPKSTKDEVLH 498

Query: 532  HWTSVQELLKHFWSSYPITTKYLYNKVTRLKDAMSQVYPKLQEMKESVQSDFRHQVSLLV 353
            HWTS+QELL+HFWSSYPITT YL+ KV +LKDAMS  Y KL+ MKESVQSD RHQVSLLV
Sbjct: 499  HWTSIQELLRHFWSSYPITTTYLHTKVGKLKDAMSNTYSKLEAMKESVQSDLRHQVSLLV 558

Query: 352  HPMLQALDAAFAQYDADVQRRSAKT---PPNG 266
             PM QALDAAF  Y+AD+QRR+AK+    PNG
Sbjct: 559  RPMQQALDAAFQHYEADLQRRTAKSGGERPNG 590


>ref|NP_175971.3| transcription initiation factor TFIIH subunit H1 [Arabidopsis
            thaliana] gi|122215373|sp|Q3ECP0.1|TFB1A_ARATH RecName:
            Full=Probable RNA polymerase II transcription factor B
            subunit 1-1; AltName: Full=General transcription and DNA
            repair factor IIH subunit TFB1-1; Short=AtTFB1-1;
            Short=TFIIH subunit TFB1-1 gi|110741140|dbj|BAE98663.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332195172|gb|AEE33293.1| transcription initiation
            factor TFIIH subunit H1 [Arabidopsis thaliana]
          Length = 591

 Score =  557 bits (1436), Expect = e-156
 Identities = 294/509 (57%), Positives = 369/509 (72%), Gaps = 3/509 (0%)
 Frame = -3

Query: 1783 CRELVATAIAFHTESGRAPPEKSAAPVNNEQLSRAETERRIKLLQENSELQTLHKQFVFG 1604
            CR+ +  A+A         P KS    ++EQLS  E E R KLL+ENSELQ LHKQFV  
Sbjct: 91   CRDFITKALA----KCELEPNKSVVSTSSEQLSIKELELRFKLLRENSELQRLHKQFVES 146

Query: 1603 GILTDAEFWATRKKLLEQNDSRRPKQRIALKNEMWT-VKPLSDGQSNRVTFNLTPEIIHQ 1427
             +LT+ EFWATRKKLL ++  R+ KQ++ LK+ M + +KP +DG++NRVTFNLTPEII Q
Sbjct: 147  KVLTEDEFWATRKKLLGKDSIRKSKQQLGLKSMMVSGIKPSTDGRTNRVTFNLTPEIIFQ 206

Query: 1426 IFAEKPAVRQAYLNFVPKKMTEKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXAVFLKR 1247
            IFAEKPAVRQA++N+VP KMTEK+FWTKY RAEYL+STKN              AVFLK 
Sbjct: 207  IFAEKPAVRQAFINYVPSKMTEKDFWTKYFRAEYLYSTKNTAVAAAEAAEDEELAVFLKP 266

Query: 1246 DDMLANEARRKIRRVDPTVDMEADEGDDYIHLPDHGLLQDEAKDVLESQYEPFRRSFAQD 1067
            D++LA E R KIRRVDPT+DMEAD+GDDY HL DHG+ +D   DV+E Q + F+RS  QD
Sbjct: 267  DEILARETRHKIRRVDPTLDMEADQGDDYTHLMDHGIQRDGTMDVVEPQNDQFKRSLLQD 326

Query: 1066 LNQHAAVVLQGRVVDVELSDTRSVAEALTRRKQAELSDETSNINLDRDLPDRISRTMEIE 887
            LN+HAAVVL+GR +DVE  DTR VAEALTR KQ   +D  +  + +++  +R+SR   +E
Sbjct: 327  LNRHAAVVLEGRSIDVESEDTRIVAEALTRVKQVSKADGETTKDANQERLERMSRVAGME 386

Query: 886  DLQGPQDPAVAPLSIKDPRDYFDSQQANALKALGDAASGGKPLKASMSSREAYGSVRDLV 707
            DLQ PQ+  +APLSIKDPRDYF+SQQ N L    +   G K LK ++   EAYG +++ +
Sbjct: 387  DLQAPQNFPLAPLSIKDPRDYFESQQGNVL----NVPRGAKGLKRNV--HEAYGLLKESI 440

Query: 706  SDIRVMGLSEPIMNQEVALKVLNGLTQNISSTKLHLGNNPNESVLDRLPKVIKEELLHHW 527
             +IR  GLS+P++  EV+ +V + LT+ I++ K   G NP ES LDRLPK  K+E+LHHW
Sbjct: 441  LEIRATGLSDPLIKPEVSFEVFSSLTRTIATAKNINGKNPRESFLDRLPKSTKDEVLHHW 500

Query: 526  TSVQELLKHFWSSYPITTKYLYNKVTRLKDAMSQVYPKLQEMKESVQSDFRHQVSLLVHP 347
            TS+QELLKHFWSSYPITT YL+ KV +LKDAMS  Y KL+ MKESVQSD RHQVSLLV P
Sbjct: 501  TSIQELLKHFWSSYPITTTYLHTKVGKLKDAMSNTYSKLEAMKESVQSDLRHQVSLLVRP 560

Query: 346  MLQALDAAFAQYDADVQRRSAKT--PPNG 266
            M QALDAAF  Y+ D+QRR+AK+   PNG
Sbjct: 561  MQQALDAAFHHYEVDLQRRTAKSGERPNG 589


>ref|XP_002530618.1| TFIIH basal transcription factor complex subunit, putative [Ricinus
            communis] gi|223529828|gb|EEF31761.1| TFIIH basal
            transcription factor complex subunit, putative [Ricinus
            communis]
          Length = 597

 Score =  553 bits (1425), Expect = e-155
 Identities = 294/513 (57%), Positives = 378/513 (73%), Gaps = 6/513 (1%)
 Frame = -3

Query: 1786 VCRELVATAIAFHTESGRAPPEKSAAPVNNEQLSRAETERRIKLLQENSELQTLHKQFVF 1607
            +C+E+V  A++   + G  P    A  V ++Q S  E   R+ LL+EN ELQ LHKQFV 
Sbjct: 89   ICKEIVGKALS---KLGDTPKPPDAPEVPSDQPSTEELLLRMNLLRENLELQKLHKQFVS 145

Query: 1606 GGILTDAEFWATRKKLLEQNDSRRPKQRIALKNEMWT-VKPLSDGQSNRVTFNLTPEIIH 1430
              +LTD+EFWATRKKLL    SR+ KQR+ LK+ M    KPL DGQ+N+VTFNLTPEI+ 
Sbjct: 146  DRVLTDSEFWATRKKLLNGEFSRKSKQRVGLKSVMLADSKPLIDGQTNKVTFNLTPEIVR 205

Query: 1429 QIFAEKPAVRQAYLNFVPKKMTEKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXAVFLK 1250
            +IFAEKPAV QAYL+ VP KM+E++FWTKY RAEYL  ++N+             A+FLK
Sbjct: 206  EIFAEKPAVHQAYLSLVPNKMSERDFWTKYCRAEYLQRSRNIHAAAAEAAEDEELALFLK 265

Query: 1249 RDDMLANEARRKIRRVDPTVDMEADEGDDYIHLPDHGLLQDEAKDVLESQYEPFRRSFAQ 1070
             DD+LA+E R+KIR VDPT+DMEAD+GDDY HLPDHG+++D +KDV+ESQ+EP+RR+  Q
Sbjct: 266  PDDILASETRQKIRCVDPTLDMEADQGDDYTHLPDHGIVRDGSKDVIESQHEPYRRTLLQ 325

Query: 1069 DLNQHAAVVLQGRVVDVE-LSDTRSVAEALTRRKQA-ELSDETSNINLDRDLPDRISRTM 896
            DLN+HAAVVL+G  +D E L DT++VA+AL R K+  +  +  ++ N +++  +RIS+ M
Sbjct: 326  DLNRHAAVVLEGTAIDDEQLQDTKAVADALARSKRGIKTINREADGNANQERSNRISQMM 385

Query: 895  EIEDLQGPQDPAVAPLSIKDPRDYFDSQQANALKALGDAASGGKPLKASMSSREAYGSVR 716
            EIEDLQG  D  +APL IKDPRDYFDSQQA+ALK   D  SG +  + S+SS+EAY S+R
Sbjct: 386  EIEDLQGSNDHHLAPLCIKDPRDYFDSQQASALKNSRDIPSGTEAARCSLSSQEAYASLR 445

Query: 715  DLVSDIRVMGLSEPIMNQEVALKVLNGLTQNISSTKLHLGNNPNESVLDRLPKVIKEELL 536
            D ++  + MGL++PI+  E+A KVL+ LT NISSTK HLG N  ESVLDRLP  IKEELL
Sbjct: 446  DSITQTKAMGLNDPIVKPEIATKVLSILTHNISSTKYHLGKNSRESVLDRLPNTIKEELL 505

Query: 535  HHWTSVQELLKHFWSSYPITTKYLYNKVTRLKDAMSQVYPKLQEMKESVQSDFR-HQVSL 359
            HHW S++ELL+H+WSSYPITT YLY KV+RLKDAMS++  +LQEMKESVQSD   H  SL
Sbjct: 506  HHWMSIEELLRHYWSSYPITTAYLYAKVSRLKDAMSKIDSQLQEMKESVQSDLXFHATSL 565

Query: 358  LVHPMLQALDAAFAQYDADVQRRSAKTP--PNG 266
             + P   AL+AA   YDAD+Q+RSAK+   PNG
Sbjct: 566  GIVP---ALEAAMQHYDADLQKRSAKSAERPNG 595


Top