BLASTX nr result

ID: Cnidium21_contig00003221 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00003221
         (1507 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285475.1| PREDICTED: transcription factor bHLH74 [Viti...   374   e-101
emb|CAN60403.1| hypothetical protein VITISV_034133 [Vitis vinifera]   347   6e-93
ref|XP_002510047.1| DNA binding protein, putative [Ricinus commu...   327   4e-87
gb|ABN51065.1| basic helix-loop-helix protein [Sesamum indicum]       318   3e-84
ref|XP_002306505.1| predicted protein [Populus trichocarpa] gi|2...   295   3e-77

>ref|XP_002285475.1| PREDICTED: transcription factor bHLH74 [Vitis vinifera]
            gi|302142156|emb|CBI19359.3| unnamed protein product
            [Vitis vinifera]
          Length = 430

 Score =  374 bits (959), Expect = e-101
 Identities = 215/403 (53%), Positives = 270/403 (66%), Gaps = 29/403 (7%)
 Frame = +3

Query: 381  MGTEDHGDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPF-GSG 557
            MG +D+G+M F +     +L  PSS M+T+P+S+KV G  M SASM+K+ NG DPF GSG
Sbjct: 1    MGIDDNGNMGFPNTS-QSILNCPSSGMNTHPISEKVTGMTMSSASMYKSSNGGDPFFGSG 59

Query: 558  WDPLVSLNQSENFXXXXXXXXXXXX-------VDNQAISSTSHLVHYQSDSGLGEMVPKL 716
            WDP+VSL+Q+ENF                   ++NQ I ST HLV Y S+S L EMVPKL
Sbjct: 60   WDPIVSLSQNENFGGSSMVSHSEFANSAYPVVLENQGIGSTPHLVLYPSNSSLVEMVPKL 119

Query: 717  SCFGSGSFSEMVNSYGIPDCGQMSYS-------LNKGGMEKAMLTGTHSRKECQIPEDKT 875
             CFGSGSFSEMV S+G+P+CGQ + S        NK G+ +  L G  S++  QI E   
Sbjct: 120  PCFGSGSFSEMVASFGLPECGQTANSGCPPNFPPNKEGLTEKSLNGAQSQEGHQISEGDA 179

Query: 876  ---SPDRKNKRKASDTDSLLHSNKN-DAEQQ-----DPSAISLEEDDKKQKIEQNITSNL 1028
               SP  K ++ + D    L+++K+ D EQ      + S  S E+++KK KI+QN++ NL
Sbjct: 180  VDASPSGKRRKSSFDPRPPLNTSKSADGEQPKGLPWENSEFSKEQEEKKLKIDQNMSPNL 239

Query: 1029 RGKQTGKQVKENSDSGDAAKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVP 1208
            RGKQ  K  K+NS +G+A K+NYIHVRA+RGQATNSHSLA             LLQELVP
Sbjct: 240  RGKQPNKHAKDNSSNGEAPKENYIHVRARRGQATNSHSLAERVRREKISERMRLLQELVP 299

Query: 1209 GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNA 1388
            GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+NIDIER+L+KD+LNSR  + 
Sbjct: 300  GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERLLSKDILNSRGGST 359

Query: 1389 SILGY-QGLRPAHPFP----LGNLPGIPNAPPSYHSMPQAVWD 1502
            S+LG+  G+  +HP+P     G LPGIP   P +HS  QAVWD
Sbjct: 360  SVLGFGPGMSSSHPYPHGISQGTLPGIPT--PQFHS-TQAVWD 399


>emb|CAN60403.1| hypothetical protein VITISV_034133 [Vitis vinifera]
          Length = 484

 Score =  347 bits (889), Expect = 6e-93
 Identities = 205/403 (50%), Positives = 261/403 (64%), Gaps = 29/403 (7%)
 Frame = +3

Query: 381  MGTEDHGDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPF-GSG 557
            MG +D+G+M F +     +L  PSS M+T+P+S+KV G  M SASM+K+ NG DPF GSG
Sbjct: 1    MGIDDNGNMGFPNTS-QSILNCPSSGMNTHPISEKVTGMTMSSASMYKSSNGGDPFFGSG 59

Query: 558  WDPLVSLNQSENFXXXXXXXXXXXX-------VDNQAISSTSHLVHYQSDSGLGEMVPKL 716
            WDP+VSL+Q+ENF                   ++NQ I ST HLV Y S+S L EMVPKL
Sbjct: 60   WDPIVSLSQNENFGGSSMVSHSEFANSAYPVVLENQGIGSTPHLVLYPSNSSLVEMVPKL 119

Query: 717  SCFGSGSFSEMVNSYGIPDCGQMSYS-------LNKGGMEKAMLTGTHSRKECQIPEDKT 875
             CFGSGSFSEMV S+G+P+CGQ + S        NK G+ +  L G  S++  QI E   
Sbjct: 120  PCFGSGSFSEMVASFGLPECGQTANSGCPPNFPPNKEGLTEKSLNGAQSQEGHQISEGDA 179

Query: 876  ---SPDRKNKRKASDTDSLLHSNKN-DAEQQ-----DPSAISLEEDDKKQKIEQNITSNL 1028
               SP  K ++ + D    L+++K+ D EQ      + S  S E+++KKQKI+QN++ NL
Sbjct: 180  VDASPSGKRRKSSFDPRPPLNTSKSADGEQPKGLPWENSEFSKEQEEKKQKIDQNMSPNL 239

Query: 1029 RGKQTGKQVKENSDSGDAAKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVP 1208
            RGKQ  K  K+NS +G+A K+NYIHVRA+RGQATNSHSLA                    
Sbjct: 240  RGKQPNKHAKDNSSNGEAPKENYIHVRARRGQATNSHSLA-------------------- 279

Query: 1209 GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNA 1388
               +ITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+NIDIER+L+KD+LNSR  + 
Sbjct: 280  --ERITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERLLSKDILNSRGGST 337

Query: 1389 SILGY-QGLRPAHPFP----LGNLPGIPNAPPSYHSMPQAVWD 1502
            S+LG+  G+  +HP+P     G LPGIP   P +HS  QAVWD
Sbjct: 338  SVLGFGPGMSSSHPYPHGISQGTLPGIPT--PQFHS-TQAVWD 377


>ref|XP_002510047.1| DNA binding protein, putative [Ricinus communis]
            gi|223550748|gb|EEF52234.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 408

 Score =  327 bits (839), Expect = 4e-87
 Identities = 195/402 (48%), Positives = 243/402 (60%), Gaps = 27/402 (6%)
 Frame = +3

Query: 381  MGT-EDHGDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPFGSG 557
            MGT ED+ +       G+ ++   SS MS NP                        F   
Sbjct: 1    MGTSEDNNEGMAFQSGGESVMNCQSSGMSANPF-----------------------FPPA 37

Query: 558  WDPLVSLNQSENFXXXXXXXXXXXX------VDNQAISSTSHLVHYQSDSGLGEMVPKLS 719
            WDP+VSLNQ ENF                  ++NQ I+S+SHLVHYQSDS   E+VPK  
Sbjct: 38   WDPVVSLNQHENFGASMVSQSEFTNSHYAIVMENQGINSSSHLVHYQSDSSYVELVPKFP 97

Query: 720  CFGSGSFSEMVNSYGIPDCGQMS-------YSLNKGGMEKAMLTGTH-SRKECQIPEDKT 875
             +GSGSFSEMV+S+G+ DCGQ+S       Y+ N     +  +T +  S+++ Q+ E+  
Sbjct: 98   SYGSGSFSEMVSSFGLTDCGQISNSGCHPNYTSNSAANNERTITNSALSQEDHQLSEEPV 157

Query: 876  ---SPDRKNKRKASDTDSLLHSNKNDAEQ-QDPSA----ISLEEDDKKQKIEQNITSNLR 1031
               SPD K +++ ++  S    NKN  E  +DPS     I  E+D+KK + EQN  +NLR
Sbjct: 158  VGVSPDGKRRKRLAEPSSPFDPNKNAEEMHKDPSGNSSDIPKEQDEKKSRTEQNTAANLR 217

Query: 1032 GKQTGKQVKENSDSGDAAKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVPG 1211
            GKQ  KQ KENS SG+A K+NYIHVRA+RGQATNSHSLA             LLQELVPG
Sbjct: 218  GKQAAKQAKENSHSGEAPKENYIHVRARRGQATNSHSLAERVRREKISERMRLLQELVPG 277

Query: 1212 CNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNAS 1391
            CNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+NIDIERIL+KD+L+SR  NA+
Sbjct: 278  CNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERILSKDILHSRGGNAA 337

Query: 1392 ILGYQGLRPAHPFPLG----NLPGIPNAPPSYHSMPQAVWDN 1505
            I+G      AHP+  G    N+P IPN  P +  MP  V +N
Sbjct: 338  IMGLSPGINAHPYSHGIFPPNIPVIPNTNPQFPPMPHTVLEN 379


>gb|ABN51065.1| basic helix-loop-helix protein [Sesamum indicum]
          Length = 400

 Score =  318 bits (814), Expect = 3e-84
 Identities = 187/385 (48%), Positives = 238/385 (61%), Gaps = 16/385 (4%)
 Frame = +3

Query: 399  GDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPF--GSGWDPLV 572
            GD  F HR+   +L  PSSVM+T  +SD VAG  + S SMFK  NG DPF   SGWDP++
Sbjct: 2    GDRVFQHRNSSSILNCPSSVMATTSISDNVAGMSICSESMFKPPNGIDPFYSSSGWDPVI 61

Query: 573  SLNQSENFXXXXXXXXXXXX-------VDNQAISSTSHLVHYQSDSGLGEMVPKLSCFGS 731
            S +QS NF                   ++NQ + S+SHLVH+ SDSGL  MVPK+  FGS
Sbjct: 62   SQDQSGNFGNSSMVLQNEFANPNYPVLLENQTMGSSSHLVHFPSDSGLVGMVPKIPSFGS 121

Query: 732  GSFSEMVNSYGIPDCGQMSYSLNKGGMEKAMLTGTHSRKECQIPEDKT---SPDRKNKRK 902
            GSFSE+V+S+G  +  Q     N  G++  +     ++   Q  E+     SP+ K KRK
Sbjct: 122  GSFSEIVSSFGHSNFAQN----NGAGVQNTVKNVEDAQDHRQDSENGVLGASPNGKRKRK 177

Query: 903  ASDTDSLLHSNKNDAEQQDPSAISLEEDDKKQKIEQNITSNLRGKQTGKQVKENSDSGDA 1082
              + +      K   + +D + +  E D+KK     N   + R +Q  K+ K+NS   +A
Sbjct: 178  NVEVE------KQKDQTRDLAELPKEYDEKK-----NSGPSSRSRQAVKEAKDNSSGAEA 226

Query: 1083 AKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVPGCNKITGKAVMLDEIINY 1262
            +K+NYIHVRAKRGQATNSHSLA             LLQELVPGCNKITGKAVMLDEIINY
Sbjct: 227  SKENYIHVRAKRGQATNSHSLAERVRRERISERMRLLQELVPGCNKITGKAVMLDEIINY 286

Query: 1263 VQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNASILGY-QGLRPAHPF--- 1430
            VQSLQQQVEFLSMKLATVNPE+N+DIER+L+KD+L+SR SNA+ LG   GL  +HPF   
Sbjct: 287  VQSLQQQVEFLSMKLATVNPELNVDIERLLSKDILHSRGSNATALGIGPGLSSSHPFQGL 346

Query: 1431 PLGNLPGIPNAPPSYHSMPQAVWDN 1505
            P G L   P   P + S+PQ +W+N
Sbjct: 347  PQGTLNAFPGTAPQFQSLPQNLWNN 371


>ref|XP_002306505.1| predicted protein [Populus trichocarpa] gi|222855954|gb|EEE93501.1|
            predicted protein [Populus trichocarpa]
          Length = 407

 Score =  295 bits (754), Expect = 3e-77
 Identities = 183/397 (46%), Positives = 239/397 (60%), Gaps = 25/397 (6%)
 Frame = +3

Query: 390  EDHGDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPFGSGWDPL 569
            +++GD+ + +R  + ++K PSS M+TNP                        + S WDP+
Sbjct: 6    DNNGDLGYQNRV-ESVMKCPSSGMNTNPF-----------------------YVSAWDPV 41

Query: 570  VSLNQSENFXXXXXXXXXXXX-------VDNQAISSTSHLVHYQSDSGLGEMVPKLSCFG 728
            VSL+Q  NF                   ++N  IS+T HLVHY SDSG  E+VPK   FG
Sbjct: 42   VSLSQLGNFGGSSTGSQSEFSNSPFPIVMENPGISNTCHLVHYPSDSGFVELVPKFPGFG 101

Query: 729  SGSFSEMVNSYGIPDCGQMSYS----LNKGGMEKAMLTGTHSRKECQIPEDKTS---PDR 887
            SG+FSEMV S G+ +CGQ+  +      K    ++   G    ++ Q+ E+ T    P+ 
Sbjct: 102  SGNFSEMVGSVGLTECGQIVNAGCPPNYKEANNESTAHGAQREEDQQLSEETTIGALPNG 161

Query: 888  KNKRKASDTDSLLHSNKNDAE---QQDPSA----ISLEEDDKKQKIEQNITSNLRGKQTG 1046
            K +R  ++++S    NKN AE   Q+DPS     I+ E D+KKQKIEQN ++NLRGKQ  
Sbjct: 162  KRRRLVAESNSPFDPNKN-AEGEFQKDPSGESSDIAKELDEKKQKIEQNCSANLRGKQVA 220

Query: 1047 KQVKENSDSGDAAKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVPGCNKIT 1226
            KQ K+N  SG+A KD+YIHVRA+RGQATNSHSLA             +LQELVPGCNKIT
Sbjct: 221  KQAKDNPQSGEAPKDDYIHVRARRGQATNSHSLAERVRREKISERMRMLQELVPGCNKIT 280

Query: 1227 GKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNASILGYQ 1406
            GKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+  D+E+I +KD+L+SR  NA+ILG+ 
Sbjct: 281  GKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELYNDVEKIQSKDILHSRGGNAAILGFS 340

Query: 1407 GLRPAHPFPLGNL-PGIP---NAPPSYHSMPQAVWDN 1505
                +H +  G   PGIP   N+ P +     AV DN
Sbjct: 341  PGINSHQYSHGIFQPGIPVILNSNPQFSPAHHAVLDN 377


Top