BLASTX nr result

ID: Cimicifuga21_contig00022155 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cimicifuga21_contig00022155
         (1693 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267728.1| PREDICTED: uncharacterized protein LOC100256...   625   e-177
gb|AEI98626.1| hypothetical protein 111O18.13 [Coffea canephora]      617   e-174
ref|XP_002867001.1| transcription factor IIB family protein [Ara...   595   e-167
ref|NP_195383.1| transcription initiation factor TFIIB [Arabidop...   592   e-167
ref|XP_003611972.1| Transcription initiation factor IIB [Medicag...   592   e-166

>ref|XP_002267728.1| PREDICTED: uncharacterized protein LOC100256546 [Vitis vinifera]
          Length = 540

 Score =  625 bits (1613), Expect = e-177
 Identities = 343/519 (66%), Positives = 387/519 (74%), Gaps = 43/519 (8%)
 Frame = +3

Query: 78   VRCPYCSGTQGRCTTTPLGRSITECCSCGRVVEERQTQTNDLFLLRAQDSPLCLVTSDLA 257
            +RCPYC+  QGRC TT  GR +TEC SCGRV+EERQ+Q + LF LRAQD+PLCLVTSDL 
Sbjct: 1    MRCPYCTAAQGRCGTTGSGRLVTECTSCGRVMEERQSQIHHLFHLRAQDTPLCLVTSDLP 60

Query: 258  SIPSPSVVGLGPKNNEEDEDPFESTGFITTFSTWSLEPTPLFARSSLSFSGHLAELERCI 437
              P            E+++DPFESTGFI+ FSTWSL+P P+FARS LSFSGHLAELER +
Sbjct: 61   PPPPQQA----QTAQEQEDDPFESTGFISAFSTWSLDPCPIFARSCLSFSGHLAELERVL 116

Query: 438  DSPANSS-------------------GPVVVVDNLRAYLQIIDVSSILGLDYDISDHAFQ 560
            +S ++SS                   GP VVVDNLRAYLQIIDV+SILGLDYDI+DHAF+
Sbjct: 117  ESSSSSSSSSSSSAAAAASSSSLASSGPAVVVDNLRAYLQIIDVASILGLDYDIADHAFE 176

Query: 561  LFRDCSSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIAANLPQKEIGKYIKILGE 740
            LFRDCSSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIAAN+PQKEIGKYIKILGE
Sbjct: 177  LFRDCSSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIAANVPQKEIGKYIKILGE 236

Query: 741  ALQLSQPINSNSIAVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAI 920
            ALQLSQPINSNSI+VHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAI
Sbjct: 237  ALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAI 296

Query: 921  YLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPANYTPAVPPEKAFPTTA 1100
            YLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLP+NYTPAVPPEKAFPTTA
Sbjct: 297  YLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPEKAFPTTA 356

Query: 1101 ITSTRSSAPRADAVEIISSSDKDKQPEI--TKPSLVSETDNQVKSKMEVEVKGTTSVDQQ 1274
            ITS RSSAPR D +EI+S  D+DKQPE   +KP+   ET +Q + K + E KG       
Sbjct: 357  ITSGRSSAPRVDLIEIVSCLDRDKQPESKPSKPNETLETSHQARGKEDTECKGNCQGAHT 416

Query: 1275 V-LSRGKTA-DSQQQFRVPSFK--GEMNQSKSRGIDINKTQVGP-DSEGKVERDSS---- 1427
            + L+R  T       F    F+  GE NQ  S G+D+N+    P + E K++  +     
Sbjct: 417  LGLNRPTTFWQLPNPFGGSGFRTSGEKNQIVSGGMDLNEGGANPQELEQKLDNGAKAATV 476

Query: 1428 ------------QRSSSNTWPF-HPQLTTSGSSHVQFVQ 1505
                           SS TWPF  P  + S SS+ Q +Q
Sbjct: 477  SVRAEQFPGAPPSSGSSITWPFRQPPSSGSSSSYAQLMQ 515


>gb|AEI98626.1| hypothetical protein 111O18.13 [Coffea canephora]
          Length = 526

 Score =  617 bits (1592), Expect = e-174
 Identities = 342/526 (65%), Positives = 387/526 (73%), Gaps = 29/526 (5%)
 Frame = +3

Query: 78   VRCPYCSGTQGRCTTTPLGRSITECCSCGRVVEERQTQTNDLFLLRAQDSPLCLVTSDLA 257
            +RCPYCS  QGRCTTT  GRSITEC SCGRVVEERQ+Q++ LF +RAQDSPLCLVTSDL 
Sbjct: 1    MRCPYCSAAQGRCTTTSSGRSITECTSCGRVVEERQSQSHHLFHIRAQDSPLCLVTSDLP 60

Query: 258  SIPSPSVVGLGPKNNEEDEDPFESTGFITTFSTWSLEPTPLFARSSLSFSGHLAELERCI 437
            ++P P+     P +N +D+DPFE TGFIT FSTWSLEP PLFA+SS+SF+GHLAELER +
Sbjct: 61   TLPVPATTDSNPTSNSDDDDPFEPTGFITAFSTWSLEPYPLFAQSSISFAGHLAELERVL 120

Query: 438  D-----------SPANSSGPVVVVDNLRAYLQIIDVSSILGLDYDISDHAFQLFRDCSSA 584
            +           S +NSSGP VVVDNLRAYLQIIDV+SILGLDYDISDHAFQLFRDCSSA
Sbjct: 121  ETTSSSSSCSSSSGSNSSGPSVVVDNLRAYLQIIDVASILGLDYDISDHAFQLFRDCSSA 180

Query: 585  TCLRNRSVEALATAALVQAIREAQEPRTLQEISIAANLPQKEIGKYIKILGEALQLSQPI 764
            TCLRNRSVEALATAALVQAIREAQEPRTLQEISIAANLPQKEIGKYIKILGEALQLSQPI
Sbjct: 181  TCLRNRSVEALATAALVQAIREAQEPRTLQEISIAANLPQKEIGKYIKILGEALQLSQPI 240

Query: 765  NSNSIAVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLED 944
            NSNSI+VHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLED
Sbjct: 241  NSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLED 300

Query: 945  KRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPANYTPAVPPEKAFPTTAITSTRSSA 1124
            KRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLP+NY+P VPPEKAFP   I S RSS 
Sbjct: 301  KRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNYSPVVPPEKAFPMATIASGRSST 360

Query: 1125 PRADAVEIISSSDKDKQPEITKPSLVSETDNQVKSKMEVEVKGTTSVDQQ-------VLS 1283
            PRAD VE + SSDK  + +  + S V +T +  K+K E E + +    Q        +L 
Sbjct: 361  PRADLVE-VPSSDKQTESKNPRTSDVLDTCHVAKNKEETENRDSIHRSQSLPMHRTPILW 419

Query: 1284 RGKTADSQQQFRVPSFKGEMNQSKSRGIDINKTQVGPDSEGKVERDS---------SQRS 1436
            + +          P+ K   N ++   ID  ++Q G D   KV   S         +  +
Sbjct: 420  KSQPPVRSATVNTPADKIH-NITQEMDID-PRSQTGSDE--KVVASSMRPVLFSAPTSSA 475

Query: 1437 SSNTWPFHPQLTTSG--SSHVQFVQPPRFKESKADAYQNRSEDTPR 1568
             S +WP   Q T+SG   S  Q V P     +  D    R  +  +
Sbjct: 476  GSLSWPL--QTTSSGFSPSGQQLVHPRNMASNGLDEMTPRQNENKK 519


>ref|XP_002867001.1| transcription factor IIB family protein [Arabidopsis lyrata subsp.
            lyrata] gi|297312837|gb|EFH43260.1| transcription factor
            IIB family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 506

 Score =  595 bits (1533), Expect = e-167
 Identities = 324/510 (63%), Positives = 374/510 (73%), Gaps = 13/510 (2%)
 Frame = +3

Query: 78   VRCPYCSGTQGRCTTTPLGRSITECCSCGRVVEERQTQTNDLFLLRAQDSPLCLVTSDLA 257
            ++CPYCS TQGRC TT  GRSITEC SCGRV+EERQTQ + LF LRAQD+PLCLVTSDL 
Sbjct: 1    MKCPYCSSTQGRCATTSSGRSITECSSCGRVMEERQTQNHHLFHLRAQDTPLCLVTSDLQ 60

Query: 258  SIPSPSVVGLGPKNNEEDEDPFESTGFITTFSTWSLEPTPLFARSSLSFSGHLAELERCI 437
            +   PS+        E++EDPFE TGFIT FSTWSLEP+P+FARSSLSFSGHLAELER +
Sbjct: 61   TATQPSL--------EDEEDPFEPTGFITAFSTWSLEPSPIFARSSLSFSGHLAELERTL 112

Query: 438  D---SPANSSGPVVVVDNLRAYLQIIDVSSILGLDYDISDHAFQLFRDCSSATCLRNRSV 608
            +   S +NS+   VVVDNLRAY+QIIDV+SILGLD DIS+HAFQLFRDC SATCLRNRSV
Sbjct: 113  ELASSTSNSNSSTVVVDNLRAYMQIIDVASILGLDCDISEHAFQLFRDCCSATCLRNRSV 172

Query: 609  EALATAALVQAIREAQEPRTLQEISIAANLPQKEIGKYIKILGEALQLSQPINSNSIAVH 788
            EALATA LVQAIREAQEPRTLQEISIAAN+ QKEIGKYIKILGEALQLSQPINSNSI+VH
Sbjct: 173  EALATACLVQAIREAQEPRTLQEISIAANVQQKEIGKYIKILGEALQLSQPINSNSISVH 232

Query: 789  MPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEI 968
            MPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEI
Sbjct: 233  MPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEI 292

Query: 969  CKVTGLTEVTLRKVYKELLENWDDLLPANYTPAVPPEKAFPTTAITSTRSSAPRADAVEI 1148
            CK+TGLTEVTLRKVYKELLENWDDLLP+NYTPAVPPEKAFPTT I++TRS+ PRA     
Sbjct: 293  CKITGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPEKAFPTTTISTTRSTTPRAADPPE 352

Query: 1149 ISSSDKDKQPEITKPSLVSETDNQVKSKMEVEVK-------GTTSV---DQQVLSRGKTA 1298
             S  D+DK P +        T  Q K K + + K       GT SV    + +    K  
Sbjct: 353  PSFVDRDK-PSVKPIETSDHTYQQPKGKEDKQPKFRQPWLFGTASVMNPGEMISEPAKPN 411

Query: 1299 DSQQQFRVPSFKGEMNQSKSRGIDINKTQVGPDSEGKVERDSSQRSSSNTWPFHPQLTTS 1478
            +   + +    + ++   ++  I + +    P +        S   S+  W F P  +  
Sbjct: 412  NMDYEKQQLDKQQQLGDKETLPIYLRQHNQFPSNP-----SPSTGISTINWSFRPSGSPG 466

Query: 1479 GSSHVQFVQPPRFKESKADAYQNRSEDTPR 1568
             SS++  V PP+     A+   + S+   R
Sbjct: 467  SSSNLPVVHPPKLPPGYAEIRGSGSQTGSR 496


>ref|NP_195383.1| transcription initiation factor TFIIB [Arabidopsis thaliana]
            gi|2464915|emb|CAB16810.1| transcription initiation
            factor like protein [Arabidopsis thaliana]
            gi|7270613|emb|CAB80331.1| transcription initiation
            factor like protein [Arabidopsis thaliana]
            gi|16509378|emb|CAC82714.1| TFIIB-related protein
            [Arabidopsis thaliana] gi|30102720|gb|AAP21278.1|
            At4g37010 [Arabidopsis thaliana]
            gi|39545884|gb|AAR28005.1| TFIIB5/pBrp [Arabidopsis
            thaliana] gi|332661282|gb|AEE86682.1| transcription
            initiation factor TFIIB [Arabidopsis thaliana]
          Length = 503

 Score =  592 bits (1527), Expect = e-167
 Identities = 326/513 (63%), Positives = 375/513 (73%), Gaps = 17/513 (3%)
 Frame = +3

Query: 78   VRCPYCSGTQGRCTTTPLGRSITECCSCGRVVEERQTQTNDLFLLRAQDSPLCLVTSDLA 257
            ++CPYCS  QGRCTTT  GRSITEC SCGRV+EERQTQ + LF LRAQD+PLCLVTSDL 
Sbjct: 1    MKCPYCSSAQGRCTTTSSGRSITECSSCGRVMEERQTQNHHLFHLRAQDTPLCLVTSDLQ 60

Query: 258  SIPSPSVVGLGPKNNEEDEDPFESTGFITTFSTWSLEPTPLFARSSLSFSGHLAELERCI 437
            +   PS         E++EDPFE TGFIT FSTWSLEP+P+FARSSLSFSGHLAELER +
Sbjct: 61   TAAQPSP--------EDEEDPFEPTGFITAFSTWSLEPSPIFARSSLSFSGHLAELERTL 112

Query: 438  D---SPANSSGPVVVVDNLRAYLQIIDVSSILGLDYDISDHAFQLFRDCSSATCLRNRSV 608
            +   S +NS+   VVVDNLRAY+QIIDV+SILGLD DIS+HAFQLFRDC SATCLRNRSV
Sbjct: 113  ELASSTSNSNSSTVVVDNLRAYMQIIDVASILGLDCDISEHAFQLFRDCCSATCLRNRSV 172

Query: 609  EALATAALVQAIREAQEPRTLQEISIAANLPQKEIGKYIKILGEALQLSQPINSNSIAVH 788
            EALATA LVQAIREAQEPRTLQEISIAAN+ QKEIGKYIKILGEALQLSQPINSNSI+VH
Sbjct: 173  EALATACLVQAIREAQEPRTLQEISIAANVQQKEIGKYIKILGEALQLSQPINSNSISVH 232

Query: 789  MPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEI 968
            MPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEI
Sbjct: 233  MPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEI 292

Query: 969  CKVTGLTEVTLRKVYKELLENWDDLLPANYTPAVPPEKAFPTTAITSTRSSAPRADAVEI 1148
            CK+TGLTEVTLRKVYKELLENWDDLLP+NYTPAVPPEKAFPTT I++TRS+ PRA     
Sbjct: 293  CKITGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPEKAFPTTTISTTRSTTPRAVDPPE 352

Query: 1149 ISSSDKDKQPEITKPSLVSETDNQVKSKMEVEVK-------GTTSV--DQQVLS---RGK 1292
             S  +KDK P          T  Q K K + + K       GT SV    +++S   +  
Sbjct: 353  PSFVEKDK-PSAKPIETFDHTYQQPKGKEDKQPKFRQPWLFGTASVMNPAEMISEPAKPN 411

Query: 1293 TADSQQQFRVPSFKGEMNQSKSRGIDINKTQVGPDSEGKVERDSSQRSSSNTWPFHPQLT 1472
              D ++Q      + ++   ++  I +      P +        S   S+  W F P + 
Sbjct: 412  AMDYEKQQLDKQQQQQLGDKETLPIYLRDHNPFPSNP-----SPSTGISTINWSFRPSVV 466

Query: 1473 TSGSSHVQFVQPPRFKESKAD--AYQNRSEDTP 1565
               SS++  + PP+     A+     +R+ D P
Sbjct: 467  PGSSSNLPVIHPPKLPPGYAEIRGSGSRNADNP 499


>ref|XP_003611972.1| Transcription initiation factor IIB [Medicago truncatula]
            gi|358344389|ref|XP_003636272.1| Transcription initiation
            factor IIB [Medicago truncatula]
            gi|355502207|gb|AES83410.1| Transcription initiation
            factor IIB [Medicago truncatula]
            gi|355513307|gb|AES94930.1| Transcription initiation
            factor IIB [Medicago truncatula]
            gi|388496684|gb|AFK36408.1| unknown [Medicago truncatula]
          Length = 487

 Score =  592 bits (1526), Expect = e-166
 Identities = 316/479 (65%), Positives = 367/479 (76%), Gaps = 1/479 (0%)
 Frame = +3

Query: 78   VRCPYCSGTQGRCTTTPLGRSITECCSCGRVVEERQTQTNDLFLLRAQDSPLCLVTSDLA 257
            ++CPYCS  QGRCTTT  G+SITEC SCGRVVEERQ+  + +F LRAQD+PLCLVT DL 
Sbjct: 1    MKCPYCSAAQGRCTTTTTGKSITECTSCGRVVEERQSHPHHIFHLRAQDNPLCLVTPDL- 59

Query: 258  SIPSPSVVGLGPKNNEEDEDPFESTGFITTFSTWSLEPTPLFARSSLSFSGHLAELERCI 437
              P P++       NEE EDPFE TGFIT FSTWSLEP+PL+ +SSLSFSG+LAELER +
Sbjct: 60   --PPPTLNQTNTDTNEE-EDPFEPTGFITAFSTWSLEPSPLYLQSSLSFSGYLAELERTL 116

Query: 438  DSPANSSGPVVVVDNLRAYLQIIDVSSILGLDYDISDHAFQLFRDCSSATCLRNRSVEAL 617
            +S + +S   VVVDNLRAY+QIIDVSSILGL+ DISDHAFQLFRDC SATCLRNRSVEAL
Sbjct: 117  ESSSTNSSSSVVVDNLRAYMQIIDVSSILGLESDISDHAFQLFRDCCSATCLRNRSVEAL 176

Query: 618  ATAALVQAIREAQEPRTLQEISIAANLPQKEIGKYIKILGEALQLSQPINSNSIAVHMPR 797
            ATAALVQAIREAQEPRTLQEISIAAN+ QKEIGKYIKILGEALQLSQPINSNSI+VHMPR
Sbjct: 177  ATAALVQAIREAQEPRTLQEISIAANVAQKEIGKYIKILGEALQLSQPINSNSISVHMPR 236

Query: 798  FCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKV 977
            FCTLLQLNKSAQELATHIG+VVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKV
Sbjct: 237  FCTLLQLNKSAQELATHIGDVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKV 296

Query: 978  TGLTEVTLRKVYKELLENWDDLLPANYTPAVPPEKAFPTTAITSTRSSAPRADAVEIISS 1157
            TGLTEVTLRKVYKELLENWDDLLP+NYTPAVPPE+AFPTT I S RSS  + DA+E+ SS
Sbjct: 297  TGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTLIASGRSSTTKVDAIEVTSS 356

Query: 1158 SDKDKQPEITKPSLVSETDNQVKSKMEVEVKGTTSVDQQVLSRGKTADSQQQFRVPSFKG 1337
             D +  PE  KP+      N+ + K       +T++ Q    + +   + Q  + P    
Sbjct: 357  LDSENLPEF-KPNKA----NEAEGKSNARASQSTAIQQSTFWQSQLPSATQNHQNP---- 407

Query: 1338 EMNQSKSRGID-INKTQVGPDSEGKVERDSSQRSSSNTWPFHPQLTTSGSSHVQFVQPP 1511
              N  +S  ID + +    P+   +V   ++  SS N+   +    +S SS ++ +  P
Sbjct: 408  --NVLESMDIDGLQRNHQQPEPMVEVANGAAIGSSVNSNQLYSPPASSSSSVMRSLSAP 464


Top