BLASTX nr result

ID: Cephaelis21_contig00016191 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00016191
         (2455 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285475.1| PREDICTED: transcription factor bHLH74 [Viti...   461   e-127
emb|CAN60403.1| hypothetical protein VITISV_034133 [Vitis vinifera]   432   e-118
gb|ABN51065.1| basic helix-loop-helix protein [Sesamum indicum]       426   e-116
ref|XP_002510047.1| DNA binding protein, putative [Ricinus commu...   423   e-115
ref|XP_002306505.1| predicted protein [Populus trichocarpa] gi|2...   399   e-108

>ref|XP_002285475.1| PREDICTED: transcription factor bHLH74 [Vitis vinifera]
            gi|302142156|emb|CBI19359.3| unnamed protein product
            [Vitis vinifera]
          Length = 430

 Score =  461 bits (1186), Expect = e-127
 Identities = 244/408 (59%), Positives = 295/408 (72%), Gaps = 10/408 (2%)
 Frame = -1

Query: 1195 MGTDDNGDMGFQQSGGGSILNCPSSGMDTNSMSNKVAGMGMCSETMFKCSTGVDPFYGSG 1016
            MG DDNG+MGF  +   SILNCPSSGM+T+ +S KV GM M S +M+K S G DPF+GSG
Sbjct: 1    MGIDDNGNMGFPNTSQ-SILNCPSSGMNTHPISEKVTGMTMSSASMYKSSNGGDPFFGSG 59

Query: 1015 WDSLVSLNHGENFGGSTVVPHHTEFANSHYPVALQNQPISSSSHVVHYPSDSGLGDMVPK 836
            WD +VSL+  ENFGGS++V H +EFANS YPV L+NQ I S+ H+V YPS+S L +MVPK
Sbjct: 60   WDPIVSLSQNENFGGSSMVSH-SEFANSAYPVVLENQGIGSTPHLVLYPSNSSLVEMVPK 118

Query: 835  IPSFGSGAFSEMVNSFGLPECGQITETSFHPNYAQK---------NGAATQDNCQVSEDR 683
            +P FGSG+FSEMV SFGLPECGQ   +   PN+            NGA +Q+  Q+SE  
Sbjct: 119  LPCFGSGSFSEMVASFGLPECGQTANSGCPPNFPPNKEGLTEKSLNGAQSQEGHQISEGD 178

Query: 682  VLGNSPNGKKKRKASDSHSPLNPKKNVEGEQHKDLSGNSSECSKEQDEMKHKMEQSTSTN 503
             +  SP+GK+++ + D   PLN  K+ +GEQ K L   +SE SKEQ+E K K++Q+ S N
Sbjct: 179  AVDASPSGKRRKSSFDPRPPLNTSKSADGEQPKGLPWENSEFSKEQEEKKLKIDQNMSPN 238

Query: 502  LRSKQAGKQSKENSNSGEPSKETYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQELV 323
            LR KQ  K +K+NS++GE  KE YIHVRA+RGQATNSHSLA            RLLQELV
Sbjct: 239  LRGKQPNKHAKDNSSNGEAPKENYIHVRARRGQATNSHSLAERVRREKISERMRLLQELV 298

Query: 322  PGCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDLERIFSKDILHSRGNN 143
            PGCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNID+ER+ SKDIL+SRG +
Sbjct: 299  PGCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERLLSKDILNSRGGS 358

Query: 142  AAAQGIGPGMNSAHAFA-GYPQGPFPGIPGATPQFHPLAQTVWDNELQ 2
             +  G GPGM+S+H +  G  QG  PGIP  TPQFH   Q VWD ELQ
Sbjct: 359  TSVLGFGPGMSSSHPYPHGISQGTLPGIP--TPQFHS-TQAVWDGELQ 403


>emb|CAN60403.1| hypothetical protein VITISV_034133 [Vitis vinifera]
          Length = 484

 Score =  432 bits (1112), Expect = e-118
 Identities = 232/408 (56%), Positives = 284/408 (69%), Gaps = 10/408 (2%)
 Frame = -1

Query: 1195 MGTDDNGDMGFQQSGGGSILNCPSSGMDTNSMSNKVAGMGMCSETMFKCSTGVDPFYGSG 1016
            MG DDNG+MGF  +   SILNCPSSGM+T+ +S KV GM M S +M+K S G DPF+GSG
Sbjct: 1    MGIDDNGNMGFPNTSQ-SILNCPSSGMNTHPISEKVTGMTMSSASMYKSSNGGDPFFGSG 59

Query: 1015 WDSLVSLNHGENFGGSTVVPHHTEFANSHYPVALQNQPISSSSHVVHYPSDSGLGDMVPK 836
            WD +VSL+  ENFGGS++V H +EFANS YPV L+NQ I S+ H+V YPS+S L +MVPK
Sbjct: 60   WDPIVSLSQNENFGGSSMVSH-SEFANSAYPVVLENQGIGSTPHLVLYPSNSSLVEMVPK 118

Query: 835  IPSFGSGAFSEMVNSFGLPECGQITETSFHPNYAQK---------NGAATQDNCQVSEDR 683
            +P FGSG+FSEMV SFGLPECGQ   +   PN+            NGA +Q+  Q+SE  
Sbjct: 119  LPCFGSGSFSEMVASFGLPECGQTANSGCPPNFPPNKEGLTEKSLNGAQSQEGHQISEGD 178

Query: 682  VLGNSPNGKKKRKASDSHSPLNPKKNVEGEQHKDLSGNSSECSKEQDEMKHKMEQSTSTN 503
             +  SP+GK+++ + D   PLN  K+ +GEQ K L   +SE SKEQ+E K K++Q+ S N
Sbjct: 179  AVDASPSGKRRKSSFDPRPPLNTSKSADGEQPKGLPWENSEFSKEQEEKKQKIDQNMSPN 238

Query: 502  LRSKQAGKQSKENSNSGEPSKETYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQELV 323
            LR KQ  K +K+NS++GE  KE YIHVRA+RGQATNSHSLA                   
Sbjct: 239  LRGKQPNKHAKDNSSNGEAPKENYIHVRARRGQATNSHSLA------------------- 279

Query: 322  PGCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDLERIFSKDILHSRGNN 143
                +ITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNID+ER+ SKDIL+SRG +
Sbjct: 280  ---ERITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERLLSKDILNSRGGS 336

Query: 142  AAAQGIGPGMNSAHAFA-GYPQGPFPGIPGATPQFHPLAQTVWDNELQ 2
             +  G GPGM+S+H +  G  QG  PGIP  TPQFH   Q VWD ELQ
Sbjct: 337  TSVLGFGPGMSSSHPYPHGISQGTLPGIP--TPQFHS-TQAVWDGELQ 381


>gb|ABN51065.1| basic helix-loop-helix protein [Sesamum indicum]
          Length = 400

 Score =  426 bits (1096), Expect = e-116
 Identities = 235/403 (58%), Positives = 277/403 (68%), Gaps = 11/403 (2%)
 Frame = -1

Query: 1177 GDMGFQQSGGGSILNCPSSGMDTNSMSNKVAGMGMCSETMFKCSTGVDPFYGS-GWDSLV 1001
            GD  FQ     SILNCPSS M T S+S+ VAGM +CSE+MFK   G+DPFY S GWD ++
Sbjct: 2    GDRVFQHRNSSSILNCPSSVMATTSISDNVAGMSICSESMFKPPNGIDPFYSSSGWDPVI 61

Query: 1000 SLNHGENFGGSTVVPHHTEFANSHYPVALQNQPISSSSHVVHYPSDSGLGDMVPKIPSFG 821
            S +   NFG S++V  + EFAN +YPV L+NQ + SSSH+VH+PSDSGL  MVPKIPSFG
Sbjct: 62   SQDQSGNFGNSSMVLQN-EFANPNYPVLLENQTMGSSSHLVHFPSDSGLVGMVPKIPSFG 120

Query: 820  SGAFSEMVNSFGLPECGQITETSFHPNYAQKNGAATQ----------DNCQVSEDRVLGN 671
            SG+FSE+V+SFG            H N+AQ NGA  Q          D+ Q SE+ VLG 
Sbjct: 121  SGSFSEIVSSFG------------HSNFAQNNGAGVQNTVKNVEDAQDHRQDSENGVLGA 168

Query: 670  SPNGKKKRKASDSHSPLNPKKNVEGEQHKDLSGNSSECSKEQDEMKHKMEQSTSTNLRSK 491
            SPNGK+KRK            NVE E+ KD + + +E  KE DE K+    S     RS+
Sbjct: 169  SPNGKRKRK------------NVEVEKQKDQTRDLAELPKEYDEKKNSGPSS-----RSR 211

Query: 490  QAGKQSKENSNSGEPSKETYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQELVPGCN 311
            QA K++K+NS+  E SKE YIHVRAKRGQATNSHSLA            RLLQELVPGCN
Sbjct: 212  QAVKEAKDNSSGAEASKENYIHVRAKRGQATNSHSLAERVRRERISERMRLLQELVPGCN 271

Query: 310  KITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDLERIFSKDILHSRGNNAAAQ 131
            KITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELN+D+ER+ SKDILHSRG+NA A 
Sbjct: 272  KITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNVDIERLLSKDILHSRGSNATAL 331

Query: 130  GIGPGMNSAHAFAGYPQGPFPGIPGATPQFHPLAQTVWDNELQ 2
            GIGPG++S+H F G PQG     PG  PQF  L Q +W+NELQ
Sbjct: 332  GIGPGLSSSHPFQGLPQGTLNAFPGTAPQFQSLPQNLWNNELQ 374


>ref|XP_002510047.1| DNA binding protein, putative [Ricinus communis]
            gi|223550748|gb|EEF52234.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 408

 Score =  423 bits (1087), Expect = e-115
 Identities = 230/410 (56%), Positives = 277/410 (67%), Gaps = 12/410 (2%)
 Frame = -1

Query: 1195 MGT--DDNGDMGFQQSGGGSILNCPSSGMDTNSMSNKVAGMGMCSETMFKCSTGVDPFYG 1022
            MGT  D+N  M FQ SGG S++NC SSGM  N                        PF+ 
Sbjct: 1    MGTSEDNNEGMAFQ-SGGESVMNCQSSGMSAN------------------------PFFP 35

Query: 1021 SGWDSLVSLNHGENFGGSTVVPHHTEFANSHYPVALQNQPISSSSHVVHYPSDSGLGDMV 842
              WD +VSLN  ENFG S V    +EF NSHY + ++NQ I+SSSH+VHY SDS   ++V
Sbjct: 36   PAWDPVVSLNQHENFGASMV--SQSEFTNSHYAIVMENQGINSSSHLVHYQSDSSYVELV 93

Query: 841  PKIPSFGSGAFSEMVNSFGLPECGQITETSFHPNYAQK----------NGAATQDNCQVS 692
            PK PS+GSG+FSEMV+SFGL +CGQI+ +  HPNY             N A +Q++ Q+S
Sbjct: 94   PKFPSYGSGSFSEMVSSFGLTDCGQISNSGCHPNYTSNSAANNERTITNSALSQEDHQLS 153

Query: 691  EDRVLGNSPNGKKKRKASDSHSPLNPKKNVEGEQHKDLSGNSSECSKEQDEMKHKMEQST 512
            E+ V+G SP+GK++++ ++  SP +P KN E E HKD SGNSS+  KEQDE K + EQ+T
Sbjct: 154  EEPVVGVSPDGKRRKRLAEPSSPFDPNKNAE-EMHKDPSGNSSDIPKEQDEKKSRTEQNT 212

Query: 511  STNLRSKQAGKQSKENSNSGEPSKETYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQ 332
            + NLR KQA KQ+KENS+SGE  KE YIHVRA+RGQATNSHSLA            RLLQ
Sbjct: 213  AANLRGKQAAKQAKENSHSGEAPKENYIHVRARRGQATNSHSLAERVRREKISERMRLLQ 272

Query: 331  ELVPGCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDLERIFSKDILHSR 152
            ELVPGCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNID+ERI SKDILHSR
Sbjct: 273  ELVPGCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERILSKDILHSR 332

Query: 151  GNNAAAQGIGPGMNSAHAFAGYPQGPFPGIPGATPQFHPLAQTVWDNELQ 2
            G NAA  G+ PG+N+     G      P IP   PQF P+  TV +N+LQ
Sbjct: 333  GGNAAIMGLSPGINAHPYSHGIFPPNIPVIPNTNPQFPPMPHTVLENDLQ 382


>ref|XP_002306505.1| predicted protein [Populus trichocarpa] gi|222855954|gb|EEE93501.1|
            predicted protein [Populus trichocarpa]
          Length = 407

 Score =  399 bits (1024), Expect = e-108
 Identities = 217/401 (54%), Positives = 263/401 (65%), Gaps = 6/401 (1%)
 Frame = -1

Query: 1186 DDNGDMGFQQSGGGSILNCPSSGMDTNSMSNKVAGMGMCSETMFKCSTGVDPFYGSGWDS 1007
            D+NGD+G+Q     S++ CPSSGM+TN                        PFY S WD 
Sbjct: 6    DNNGDLGYQNRVE-SVMKCPSSGMNTN------------------------PFYVSAWDP 40

Query: 1006 LVSLNHGENFGGSTVVPHHTEFANSHYPVALQNQPISSSSHVVHYPSDSGLGDMVPKIPS 827
            +VSL+   NFGGS+     +EF+NS +P+ ++N  IS++ H+VHYPSDSG  ++VPK P 
Sbjct: 41   VVSLSQLGNFGGSS-TGSQSEFSNSPFPIVMENPGISNTCHLVHYPSDSGFVELVPKFPG 99

Query: 826  FGSGAFSEMVNSFGLPECGQITETSFHPNYAQKN------GAATQDNCQVSEDRVLGNSP 665
            FGSG FSEMV S GL ECGQI      PNY + N      GA  +++ Q+SE+  +G  P
Sbjct: 100  FGSGNFSEMVGSVGLTECGQIVNAGCPPNYKEANNESTAHGAQREEDQQLSEETTIGALP 159

Query: 664  NGKKKRKASDSHSPLNPKKNVEGEQHKDLSGNSSECSKEQDEMKHKMEQSTSTNLRSKQA 485
            NGK++R  ++S+SP +P KN EGE  KD SG SS+ +KE DE K K+EQ+ S NLR KQ 
Sbjct: 160  NGKRRRLVAESNSPFDPNKNAEGEFQKDPSGESSDIAKELDEKKQKIEQNCSANLRGKQV 219

Query: 484  GKQSKENSNSGEPSKETYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQELVPGCNKI 305
             KQ+K+N  SGE  K+ YIHVRA+RGQATNSHSLA            R+LQELVPGCNKI
Sbjct: 220  AKQAKDNPQSGEAPKDDYIHVRARRGQATNSHSLAERVRREKISERMRMLQELVPGCNKI 279

Query: 304  TGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDLERIFSKDILHSRGNNAAAQGI 125
            TGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEL  D+E+I SKDILHSRG NAA  G 
Sbjct: 280  TGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELYNDVEKIQSKDILHSRGGNAAILGF 339

Query: 124  GPGMNSAHAFAGYPQGPFPGIPGATPQFHPLAQTVWDNELQ 2
             PG+NS     G  Q   P I  + PQF P    V DNELQ
Sbjct: 340  SPGINSHQYSHGIFQPGIPVILNSNPQFSPAHHAVLDNELQ 380


Top