BLASTX nr result

ID: Cephaelis21_contig00019830 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00019830
         (1681 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276057.1| PREDICTED: probable RNA polymerase II transc...   587   e-165
emb|CAN79528.1| hypothetical protein VITISV_026261 [Vitis vinifera]   540   e-151
ref|NP_175971.3| transcription initiation factor TFIIH subunit H...   496   e-138
ref|XP_002894525.1| hypothetical protein ARALYDRAFT_892577 [Arab...   494   e-137
ref|XP_002530618.1| TFIIH basal transcription factor complex sub...   479   e-133

>ref|XP_002276057.1| PREDICTED: probable RNA polymerase II transcription factor B subunit
            1-1 [Vitis vinifera] gi|296090002|emb|CBI39821.3| unnamed
            protein product [Vitis vinifera]
          Length = 602

 Score =  587 bits (1512), Expect = e-165
 Identities = 293/432 (67%), Positives = 349/432 (80%), Gaps = 1/432 (0%)
 Frame = +3

Query: 6    NYSRKQKQRVALKNDMWST-KPLSDGQSNRVTFNLTPEIILQIFAEKPAVRQAYLNFVPK 182
            N SR  KQRV  K+ M S  KPL+DG++NRVTFNLTPEII QIFAEKPAV QA+LNFVP 
Sbjct: 170  NTSRTSKQRVGFKSAMISDLKPLTDGRTNRVTFNLTPEIIHQIFAEKPAVHQAFLNFVPN 229

Query: 183  KMSEKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXXVFLKQDDMLANEARRKIRRVDPT 362
            KM+EK+FW KY RAEYLH T+N V             VFLK DD+LANEARRKIRRVDPT
Sbjct: 230  KMTEKDFWNKYCRAEYLHCTRNTVAAAAEAAEDEELAVFLKHDDILANEARRKIRRVDPT 289

Query: 363  LDMEADEGDDYMHLPDHGLSRDDGKDFLDSQYEPFRRSFPQHLNQHAAVVLQGRVVDFEL 542
            LDMEAD+GDDYMHLPDHG+ RD  K+ +D QYE +RR+  Q LN+HAAVVL+GR +D EL
Sbjct: 290  LDMEADQGDDYMHLPDHGIFRDGSKEIIDPQYEQYRRTLSQDLNRHAAVVLEGRPIDVEL 349

Query: 543  GDTRSVAEALARKKQAELARDVSDVNVDKDRSERISQMAELEDLQAPRDPPVAPLCIKDP 722
             DTR+VAEALA+ K+ E A + SD +V ++R ERIS+M E+EDLQAPRD P A LCIKDP
Sbjct: 350  EDTRTVAEALAKSKRVEAANEKSDGSVTRERLERISRMTEIEDLQAPRDLPFAALCIKDP 409

Query: 723  RDYFDSQQANALNTAGETAFCANQLNFSISTSAAYGSLKECITEIRTLGLTEPIVTSDVA 902
            RDYFDSQQANAL T G+T   + Q+  S+ST  AYGSL+  I+EI+++GL++PIV  D+A
Sbjct: 410  RDYFDSQQANALKTLGDTLAGSKQIKCSLSTQEAYGSLRGFISEIKSVGLSDPIVKPDIA 469

Query: 903  LKVFDGLNRSISCTKYNMGRNPNDSVLDSLPKITKEELLHHWTSIQELLKHFWSSYPVTT 1082
            LKV +GL ++IS TK+++G+NP +SVLD LP ITKEELLHHWTSIQELL+HFWSSYP+TT
Sbjct: 470  LKVLNGLTQNISSTKFHLGKNPQESVLDRLPIITKEELLHHWTSIQELLRHFWSSYPITT 529

Query: 1083 NHLLSKATKLKDAMSQTYSKLQEMKETVQSDLRHQVSLLVQPMLQALDAAFAHYDTDVQK 1262
             +L +KA++LKDAMSQ Y KLQE+KE+VQSD RHQVSLLVQPMLQALDAAFAHYD D QK
Sbjct: 530  TYLYTKASRLKDAMSQIYPKLQEIKESVQSDFRHQVSLLVQPMLQALDAAFAHYDADQQK 589

Query: 1263 RSRRSTERPNGY 1298
            RS RS ERPNG+
Sbjct: 590  RSARSGERPNGF 601


>emb|CAN79528.1| hypothetical protein VITISV_026261 [Vitis vinifera]
          Length = 735

 Score =  540 bits (1391), Expect = e-151
 Identities = 277/433 (63%), Positives = 335/433 (77%), Gaps = 3/433 (0%)
 Frame = +3

Query: 6    NYSRKQKQRVALKNDMWST-KPLSDGQSNRVTFNLTPEIILQIFAEKPAVRQAYLNFVPK 182
            N SR  KQRV  K+ M S  KPL+DG++NRVTFNLTPEII QIFAEKPAV QA+LNFVP 
Sbjct: 213  NTSRTSKQRVGFKSAMISDLKPLTDGRTNRVTFNLTPEIIHQIFAEKPAVHQAFLNFVPN 272

Query: 183  KMSEKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXXVFLKQDDMLANEARRKIRRVDPT 362
            KM+EK+FW KY RAEYLH T+N V             VFLK DD+LA+EARRKIRRVDPT
Sbjct: 273  KMTEKDFWNKYCRAEYLHCTRNTVAAAAEAAEDEELAVFLKHDDILASEARRKIRRVDPT 332

Query: 363  LDMEADEGDDYMHLPDHGLSRDDGKDFLDSQYEPFRRSFPQHLNQHAAVVLQGRVVDFEL 542
            LDMEAD+GDDYMHLPDHG+ RD  K+ +D QYE +RR+  Q LN+HAAVVL+GR +D EL
Sbjct: 333  LDMEADQGDDYMHLPDHGIFRDGSKEIIDPQYEQYRRTLSQDLNRHAAVVLEGRPIDVEL 392

Query: 543  GDTRSVAEALARKKQAELARDVSDVNVDKDRSERISQMAELEDLQAPRDPPVAPLCIKDP 722
             DTR+VAEALA+ K+ E A + SD +V ++R ERIS+M E+EDLQAPRD P A LCIKDP
Sbjct: 393  EDTRTVAEALAKSKRVEAANEKSDGSVTRERLERISRMTEIEDLQAPRDLPFAALCIKDP 452

Query: 723  RDYFDSQQANALNTAGETAFCANQLNFSISTSAAYGSLKECITEIRTLGLTEPIVTSDVA 902
            RDYFDSQQANAL T G+T   + Q+  S+S+  AYGSL+  I+EI+++GL++PIV  D+A
Sbjct: 453  RDYFDSQQANALKTLGDTLAGSKQIKCSLSSQEAYGSLRGFISEIKSVGLSDPIVKPDIA 512

Query: 903  LKVFDGLNRSISCTKYNMGRNPNDSVLDSLPKITKEELLHHWTSIQELLKHFWSSYPVTT 1082
            LKV +GL ++IS TK+++G+NP +SVLD LP ITKEELLHHWTSIQELL+HFWSSYP+TT
Sbjct: 513  LKVLNGLTQNISSTKFHLGKNPQESVLDRLPIITKEELLHHWTSIQELLRHFWSSYPITT 572

Query: 1083 NHLLSKATKLKDAMSQTYSKLQEMKETVQSDLRHQVSLLVQPMLQALDAAFAHYDTDVQK 1262
             +L +KA++LKDAMSQ Y KLQE+KE+VQSD RHQVSLLVQPMLQ           +  K
Sbjct: 573  TYLYTKASRLKDAMSQIYPKLQEIKESVQSDFRHQVSLLVQPMLQR--------RRETLK 624

Query: 1263 RSRRS--TERPNG 1295
            R RRS   E P G
Sbjct: 625  RRRRSEPQEGPGG 637


>ref|NP_175971.3| transcription initiation factor TFIIH subunit H1 [Arabidopsis
            thaliana] gi|122215373|sp|Q3ECP0.1|TFB1A_ARATH RecName:
            Full=Probable RNA polymerase II transcription factor B
            subunit 1-1; AltName: Full=General transcription and DNA
            repair factor IIH subunit TFB1-1; Short=AtTFB1-1;
            Short=TFIIH subunit TFB1-1 gi|110741140|dbj|BAE98663.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332195172|gb|AEE33293.1| transcription initiation
            factor TFIIH subunit H1 [Arabidopsis thaliana]
          Length = 591

 Score =  496 bits (1276), Expect = e-138
 Identities = 254/429 (59%), Positives = 315/429 (73%), Gaps = 1/429 (0%)
 Frame = +3

Query: 15   RKQKQRVALKNDMWS-TKPLSDGQSNRVTFNLTPEIILQIFAEKPAVRQAYLNFVPKKMS 191
            RK KQ++ LK+ M S  KP +DG++NRVTFNLTPEII QIFAEKPAVRQA++N+VP KM+
Sbjct: 168  RKSKQQLGLKSMMVSGIKPSTDGRTNRVTFNLTPEIIFQIFAEKPAVRQAFINYVPSKMT 227

Query: 192  EKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXXVFLKQDDMLANEARRKIRRVDPTLDM 371
            EK+FWTKY RAEYL+STKN               VFLK D++LA E R KIRRVDPTLDM
Sbjct: 228  EKDFWTKYFRAEYLYSTKNTAVAAAEAAEDEELAVFLKPDEILARETRHKIRRVDPTLDM 287

Query: 372  EADEGDDYMHLPDHGLSRDDGKDFLDSQYEPFRRSFPQHLNQHAAVVLQGRVVDFELGDT 551
            EAD+GDDY HL DHG+ RD   D ++ Q + F+RS  Q LN+HAAVVL+GR +D E  DT
Sbjct: 288  EADQGDDYTHLMDHGIQRDGTMDVVEPQNDQFKRSLLQDLNRHAAVVLEGRSIDVESEDT 347

Query: 552  RSVAEALARKKQAELARDVSDVNVDKDRSERISQMAELEDLQAPRDPPVAPLCIKDPRDY 731
            R VAEAL R KQ   A   +  + +++R ER+S++A +EDLQAP++ P+APL IKDPRDY
Sbjct: 348  RIVAEALTRVKQVSKADGETTKDANQERLERMSRVAGMEDLQAPQNFPLAPLSIKDPRDY 407

Query: 732  FDSQQANALNTAGETAFCANQLNFSISTSAAYGSLKECITEIRTLGLTEPIVTSDVALKV 911
            F+SQQ N LN                +   AYG LKE I EIR  GL++P++  +V+ +V
Sbjct: 408  FESQQGNVLNVP------RGAKGLKRNVHEAYGLLKESILEIRATGLSDPLIKPEVSFEV 461

Query: 912  FDGLNRSISCTKYNMGRNPNDSVLDSLPKITKEELLHHWTSIQELLKHFWSSYPVTTNHL 1091
            F  L R+I+  K   G+NP +S LD LPK TK+E+LHHWTSIQELLKHFWSSYP+TT +L
Sbjct: 462  FSSLTRTIATAKNINGKNPRESFLDRLPKSTKDEVLHHWTSIQELLKHFWSSYPITTTYL 521

Query: 1092 LSKATKLKDAMSQTYSKLQEMKETVQSDLRHQVSLLVQPMLQALDAAFAHYDTDVQKRSR 1271
             +K  KLKDAMS TYSKL+ MKE+VQSDLRHQVSLLV+PM QALDAAF HY+ D+Q+R+ 
Sbjct: 522  HTKVGKLKDAMSNTYSKLEAMKESVQSDLRHQVSLLVRPMQQALDAAFHHYEVDLQRRTA 581

Query: 1272 RSTERPNGY 1298
            +S ERPNGY
Sbjct: 582  KSGERPNGY 590


>ref|XP_002894525.1| hypothetical protein ARALYDRAFT_892577 [Arabidopsis lyrata subsp.
            lyrata] gi|297340367|gb|EFH70784.1| hypothetical protein
            ARALYDRAFT_892577 [Arabidopsis lyrata subsp. lyrata]
          Length = 592

 Score =  494 bits (1273), Expect = e-137
 Identities = 256/430 (59%), Positives = 316/430 (73%), Gaps = 2/430 (0%)
 Frame = +3

Query: 15   RKQKQRVALKNDMWS-TKPLSDGQSNRVTFNLTPEIILQIFAEKPAVRQAYLNFVPKKMS 191
            RK KQ+V LK+ M S  KP +DG++NRVTFNLTPEII QIFAEKPAVRQA++N+VP KM+
Sbjct: 168  RKSKQQVGLKSMMVSGIKPSTDGRTNRVTFNLTPEIIFQIFAEKPAVRQAFINYVPSKMT 227

Query: 192  EKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXXVFLKQDDMLANEARRKIRRVDPTLDM 371
            EK+FWTKY RAEYL+STKN               VFLK D++LA E R+KIRRVDPTLDM
Sbjct: 228  EKDFWTKYFRAEYLYSTKNTAVAAAEAAEDEELAVFLKPDEILARETRQKIRRVDPTLDM 287

Query: 372  EADEGDDYMHLPDHGLSRDDGKDFLDSQYEPFRRSFPQHLNQHAAVVLQGRVVDFELGDT 551
            EAD+GDDY HL DHG+ RD   D ++ Q + FRRS  Q LN+HAAVVL+GR +D E  DT
Sbjct: 288  EADQGDDYTHLMDHGIQRDGTMDVVEPQNDQFRRSLLQDLNRHAAVVLEGRSIDVESEDT 347

Query: 552  RSVAEALARKKQAELARDVSDVNVDKDRSERISQMAELEDLQAPRDPPVAPLCIKDPRDY 731
            R VAEAL R KQ   A   +  + + +R ER+S++A +EDLQAP++ P+APL IKDPRDY
Sbjct: 348  RIVAEALTRVKQVSKADGETTKDANLERLERMSRLAGMEDLQAPQNFPLAPLSIKDPRDY 407

Query: 732  FDSQQANALNTAGETAFCANQLNFSISTSAAYGSLKECITEIRTLGLTEPIVTSDVALKV 911
            F+SQQ N LN                +   AYG LKE I EIR  GL++P++  +V+ +V
Sbjct: 408  FESQQGNVLNVP------RGAKGLKRNVHEAYGLLKESILEIRATGLSDPLIRPEVSFEV 461

Query: 912  FDGLNRSISCTKYNMGRNPNDSVLDSLPKITKEELLHHWTSIQELLKHFWSSYPVTTNHL 1091
            F  L R+IS  K  +G+NP +S LD LPK TK+E+LHHWTSIQELL+HFWSSYP+TT +L
Sbjct: 462  FSSLTRTISTAKNIIGKNPRESFLDRLPKSTKDEVLHHWTSIQELLRHFWSSYPITTTYL 521

Query: 1092 LSKATKLKDAMSQTYSKLQEMKETVQSDLRHQVSLLVQPMLQALDAAFAHYDTDVQKRSR 1271
             +K  KLKDAMS TYSKL+ MKE+VQSDLRHQVSLLV+PM QALDAAF HY+ D+Q+R+ 
Sbjct: 522  HTKVGKLKDAMSNTYSKLEAMKESVQSDLRHQVSLLVRPMQQALDAAFQHYEADLQRRTA 581

Query: 1272 RS-TERPNGY 1298
            +S  ERPNGY
Sbjct: 582  KSGGERPNGY 591


>ref|XP_002530618.1| TFIIH basal transcription factor complex subunit, putative [Ricinus
            communis] gi|223529828|gb|EEF31761.1| TFIIH basal
            transcription factor complex subunit, putative [Ricinus
            communis]
          Length = 597

 Score =  479 bits (1234), Expect = e-133
 Identities = 247/434 (56%), Positives = 319/434 (73%), Gaps = 4/434 (0%)
 Frame = +3

Query: 9    YSRKQKQRVALKNDMWS-TKPLSDGQSNRVTFNLTPEIILQIFAEKPAVRQAYLNFVPKK 185
            +SRK KQRV LK+ M + +KPL DGQ+N+VTFNLTPEI+ +IFAEKPAV QAYL+ VP K
Sbjct: 166  FSRKSKQRVGLKSVMLADSKPLIDGQTNKVTFNLTPEIVREIFAEKPAVHQAYLSLVPNK 225

Query: 186  MSEKEFWTKYSRAEYLHSTKNVVXXXXXXXXXXXXXVFLKQDDMLANEARRKIRRVDPTL 365
            MSE++FWTKY RAEYL  ++N+              +FLK DD+LA+E R+KIR VDPTL
Sbjct: 226  MSERDFWTKYCRAEYLQRSRNIHAAAAEAAEDEELALFLKPDDILASETRQKIRCVDPTL 285

Query: 366  DMEADEGDDYMHLPDHGLSRDDGKDFLDSQYEPFRRSFPQHLNQHAAVVLQGRVVDFE-L 542
            DMEAD+GDDY HLPDHG+ RD  KD ++SQ+EP+RR+  Q LN+HAAVVL+G  +D E L
Sbjct: 286  DMEADQGDDYTHLPDHGIVRDGSKDVIESQHEPYRRTLLQDLNRHAAVVLEGTAIDDEQL 345

Query: 543  GDTRSVAEALARKKQA-ELARDVSDVNVDKDRSERISQMAELEDLQAPRDPPVAPLCIKD 719
             DT++VA+ALAR K+  +     +D N +++RS RISQM E+EDLQ   D  +APLCIKD
Sbjct: 346  QDTKAVADALARSKRGIKTINREADGNANQERSNRISQMMEIEDLQGSNDHHLAPLCIKD 405

Query: 720  PRDYFDSQQANALNTAGETAFCANQLNFSISTSAAYGSLKECITEIRTLGLTEPIVTSDV 899
            PRDYFDSQQA+AL  + +          S+S+  AY SL++ IT+ + +GL +PIV  ++
Sbjct: 406  PRDYFDSQQASALKNSRDIPSGTEAARCSLSSQEAYASLRDSITQTKAMGLNDPIVKPEI 465

Query: 900  ALKVFDGLNRSISCTKYNMGRNPNDSVLDSLPKITKEELLHHWTSIQELLKHFWSSYPVT 1079
            A KV   L  +IS TKY++G+N  +SVLD LP   KEELLHHW SI+ELL+H+WSSYP+T
Sbjct: 466  ATKVLSILTHNISSTKYHLGKNSRESVLDRLPNTIKEELLHHWMSIEELLRHYWSSYPIT 525

Query: 1080 TNHLLSKATKLKDAMSQTYSKLQEMKETVQSDLR-HQVSLLVQPMLQALDAAFAHYDTDV 1256
            T +L +K ++LKDAMS+  S+LQEMKE+VQSDL  H  SL + P   AL+AA  HYD D+
Sbjct: 526  TAYLYAKVSRLKDAMSKIDSQLQEMKESVQSDLXFHATSLGIVP---ALEAAMQHYDADL 582

Query: 1257 QKRSRRSTERPNGY 1298
            QKRS +S ERPNGY
Sbjct: 583  QKRSAKSAERPNGY 596