BLASTX nr result
ID: Cnidium21_contig00003221
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cnidium21_contig00003221 (1507 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002285475.1| PREDICTED: transcription factor bHLH74 [Viti... 374 e-101 emb|CAN60403.1| hypothetical protein VITISV_034133 [Vitis vinifera] 347 6e-93 ref|XP_002510047.1| DNA binding protein, putative [Ricinus commu... 327 4e-87 gb|ABN51065.1| basic helix-loop-helix protein [Sesamum indicum] 318 3e-84 ref|XP_002306505.1| predicted protein [Populus trichocarpa] gi|2... 295 3e-77 >ref|XP_002285475.1| PREDICTED: transcription factor bHLH74 [Vitis vinifera] gi|302142156|emb|CBI19359.3| unnamed protein product [Vitis vinifera] Length = 430 Score = 374 bits (959), Expect = e-101 Identities = 215/403 (53%), Positives = 270/403 (66%), Gaps = 29/403 (7%) Frame = +3 Query: 381 MGTEDHGDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPF-GSG 557 MG +D+G+M F + +L PSS M+T+P+S+KV G M SASM+K+ NG DPF GSG Sbjct: 1 MGIDDNGNMGFPNTS-QSILNCPSSGMNTHPISEKVTGMTMSSASMYKSSNGGDPFFGSG 59 Query: 558 WDPLVSLNQSENFXXXXXXXXXXXX-------VDNQAISSTSHLVHYQSDSGLGEMVPKL 716 WDP+VSL+Q+ENF ++NQ I ST HLV Y S+S L EMVPKL Sbjct: 60 WDPIVSLSQNENFGGSSMVSHSEFANSAYPVVLENQGIGSTPHLVLYPSNSSLVEMVPKL 119 Query: 717 SCFGSGSFSEMVNSYGIPDCGQMSYS-------LNKGGMEKAMLTGTHSRKECQIPEDKT 875 CFGSGSFSEMV S+G+P+CGQ + S NK G+ + L G S++ QI E Sbjct: 120 PCFGSGSFSEMVASFGLPECGQTANSGCPPNFPPNKEGLTEKSLNGAQSQEGHQISEGDA 179 Query: 876 ---SPDRKNKRKASDTDSLLHSNKN-DAEQQ-----DPSAISLEEDDKKQKIEQNITSNL 1028 SP K ++ + D L+++K+ D EQ + S S E+++KK KI+QN++ NL Sbjct: 180 VDASPSGKRRKSSFDPRPPLNTSKSADGEQPKGLPWENSEFSKEQEEKKLKIDQNMSPNL 239 Query: 1029 RGKQTGKQVKENSDSGDAAKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVP 1208 RGKQ K K+NS +G+A K+NYIHVRA+RGQATNSHSLA LLQELVP Sbjct: 240 RGKQPNKHAKDNSSNGEAPKENYIHVRARRGQATNSHSLAERVRREKISERMRLLQELVP 299 Query: 1209 GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNA 1388 GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+NIDIER+L+KD+LNSR + Sbjct: 300 GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERLLSKDILNSRGGST 359 Query: 1389 SILGY-QGLRPAHPFP----LGNLPGIPNAPPSYHSMPQAVWD 1502 S+LG+ G+ +HP+P G LPGIP P +HS QAVWD Sbjct: 360 SVLGFGPGMSSSHPYPHGISQGTLPGIPT--PQFHS-TQAVWD 399 >emb|CAN60403.1| hypothetical protein VITISV_034133 [Vitis vinifera] Length = 484 Score = 347 bits (889), Expect = 6e-93 Identities = 205/403 (50%), Positives = 261/403 (64%), Gaps = 29/403 (7%) Frame = +3 Query: 381 MGTEDHGDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPF-GSG 557 MG +D+G+M F + +L PSS M+T+P+S+KV G M SASM+K+ NG DPF GSG Sbjct: 1 MGIDDNGNMGFPNTS-QSILNCPSSGMNTHPISEKVTGMTMSSASMYKSSNGGDPFFGSG 59 Query: 558 WDPLVSLNQSENFXXXXXXXXXXXX-------VDNQAISSTSHLVHYQSDSGLGEMVPKL 716 WDP+VSL+Q+ENF ++NQ I ST HLV Y S+S L EMVPKL Sbjct: 60 WDPIVSLSQNENFGGSSMVSHSEFANSAYPVVLENQGIGSTPHLVLYPSNSSLVEMVPKL 119 Query: 717 SCFGSGSFSEMVNSYGIPDCGQMSYS-------LNKGGMEKAMLTGTHSRKECQIPEDKT 875 CFGSGSFSEMV S+G+P+CGQ + S NK G+ + L G S++ QI E Sbjct: 120 PCFGSGSFSEMVASFGLPECGQTANSGCPPNFPPNKEGLTEKSLNGAQSQEGHQISEGDA 179 Query: 876 ---SPDRKNKRKASDTDSLLHSNKN-DAEQQ-----DPSAISLEEDDKKQKIEQNITSNL 1028 SP K ++ + D L+++K+ D EQ + S S E+++KKQKI+QN++ NL Sbjct: 180 VDASPSGKRRKSSFDPRPPLNTSKSADGEQPKGLPWENSEFSKEQEEKKQKIDQNMSPNL 239 Query: 1029 RGKQTGKQVKENSDSGDAAKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVP 1208 RGKQ K K+NS +G+A K+NYIHVRA+RGQATNSHSLA Sbjct: 240 RGKQPNKHAKDNSSNGEAPKENYIHVRARRGQATNSHSLA-------------------- 279 Query: 1209 GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNA 1388 +ITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+NIDIER+L+KD+LNSR + Sbjct: 280 --ERITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERLLSKDILNSRGGST 337 Query: 1389 SILGY-QGLRPAHPFP----LGNLPGIPNAPPSYHSMPQAVWD 1502 S+LG+ G+ +HP+P G LPGIP P +HS QAVWD Sbjct: 338 SVLGFGPGMSSSHPYPHGISQGTLPGIPT--PQFHS-TQAVWD 377 >ref|XP_002510047.1| DNA binding protein, putative [Ricinus communis] gi|223550748|gb|EEF52234.1| DNA binding protein, putative [Ricinus communis] Length = 408 Score = 327 bits (839), Expect = 4e-87 Identities = 195/402 (48%), Positives = 243/402 (60%), Gaps = 27/402 (6%) Frame = +3 Query: 381 MGT-EDHGDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPFGSG 557 MGT ED+ + G+ ++ SS MS NP F Sbjct: 1 MGTSEDNNEGMAFQSGGESVMNCQSSGMSANPF-----------------------FPPA 37 Query: 558 WDPLVSLNQSENFXXXXXXXXXXXX------VDNQAISSTSHLVHYQSDSGLGEMVPKLS 719 WDP+VSLNQ ENF ++NQ I+S+SHLVHYQSDS E+VPK Sbjct: 38 WDPVVSLNQHENFGASMVSQSEFTNSHYAIVMENQGINSSSHLVHYQSDSSYVELVPKFP 97 Query: 720 CFGSGSFSEMVNSYGIPDCGQMS-------YSLNKGGMEKAMLTGTH-SRKECQIPEDKT 875 +GSGSFSEMV+S+G+ DCGQ+S Y+ N + +T + S+++ Q+ E+ Sbjct: 98 SYGSGSFSEMVSSFGLTDCGQISNSGCHPNYTSNSAANNERTITNSALSQEDHQLSEEPV 157 Query: 876 ---SPDRKNKRKASDTDSLLHSNKNDAEQ-QDPSA----ISLEEDDKKQKIEQNITSNLR 1031 SPD K +++ ++ S NKN E +DPS I E+D+KK + EQN +NLR Sbjct: 158 VGVSPDGKRRKRLAEPSSPFDPNKNAEEMHKDPSGNSSDIPKEQDEKKSRTEQNTAANLR 217 Query: 1032 GKQTGKQVKENSDSGDAAKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVPG 1211 GKQ KQ KENS SG+A K+NYIHVRA+RGQATNSHSLA LLQELVPG Sbjct: 218 GKQAAKQAKENSHSGEAPKENYIHVRARRGQATNSHSLAERVRREKISERMRLLQELVPG 277 Query: 1212 CNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNAS 1391 CNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+NIDIERIL+KD+L+SR NA+ Sbjct: 278 CNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERILSKDILHSRGGNAA 337 Query: 1392 ILGYQGLRPAHPFPLG----NLPGIPNAPPSYHSMPQAVWDN 1505 I+G AHP+ G N+P IPN P + MP V +N Sbjct: 338 IMGLSPGINAHPYSHGIFPPNIPVIPNTNPQFPPMPHTVLEN 379 >gb|ABN51065.1| basic helix-loop-helix protein [Sesamum indicum] Length = 400 Score = 318 bits (814), Expect = 3e-84 Identities = 187/385 (48%), Positives = 238/385 (61%), Gaps = 16/385 (4%) Frame = +3 Query: 399 GDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPF--GSGWDPLV 572 GD F HR+ +L PSSVM+T +SD VAG + S SMFK NG DPF SGWDP++ Sbjct: 2 GDRVFQHRNSSSILNCPSSVMATTSISDNVAGMSICSESMFKPPNGIDPFYSSSGWDPVI 61 Query: 573 SLNQSENFXXXXXXXXXXXX-------VDNQAISSTSHLVHYQSDSGLGEMVPKLSCFGS 731 S +QS NF ++NQ + S+SHLVH+ SDSGL MVPK+ FGS Sbjct: 62 SQDQSGNFGNSSMVLQNEFANPNYPVLLENQTMGSSSHLVHFPSDSGLVGMVPKIPSFGS 121 Query: 732 GSFSEMVNSYGIPDCGQMSYSLNKGGMEKAMLTGTHSRKECQIPEDKT---SPDRKNKRK 902 GSFSE+V+S+G + Q N G++ + ++ Q E+ SP+ K KRK Sbjct: 122 GSFSEIVSSFGHSNFAQN----NGAGVQNTVKNVEDAQDHRQDSENGVLGASPNGKRKRK 177 Query: 903 ASDTDSLLHSNKNDAEQQDPSAISLEEDDKKQKIEQNITSNLRGKQTGKQVKENSDSGDA 1082 + + K + +D + + E D+KK N + R +Q K+ K+NS +A Sbjct: 178 NVEVE------KQKDQTRDLAELPKEYDEKK-----NSGPSSRSRQAVKEAKDNSSGAEA 226 Query: 1083 AKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVPGCNKITGKAVMLDEIINY 1262 +K+NYIHVRAKRGQATNSHSLA LLQELVPGCNKITGKAVMLDEIINY Sbjct: 227 SKENYIHVRAKRGQATNSHSLAERVRRERISERMRLLQELVPGCNKITGKAVMLDEIINY 286 Query: 1263 VQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNASILGY-QGLRPAHPF--- 1430 VQSLQQQVEFLSMKLATVNPE+N+DIER+L+KD+L+SR SNA+ LG GL +HPF Sbjct: 287 VQSLQQQVEFLSMKLATVNPELNVDIERLLSKDILHSRGSNATALGIGPGLSSSHPFQGL 346 Query: 1431 PLGNLPGIPNAPPSYHSMPQAVWDN 1505 P G L P P + S+PQ +W+N Sbjct: 347 PQGTLNAFPGTAPQFQSLPQNLWNN 371 >ref|XP_002306505.1| predicted protein [Populus trichocarpa] gi|222855954|gb|EEE93501.1| predicted protein [Populus trichocarpa] Length = 407 Score = 295 bits (754), Expect = 3e-77 Identities = 183/397 (46%), Positives = 239/397 (60%), Gaps = 25/397 (6%) Frame = +3 Query: 390 EDHGDMRFHHRDGDGLLKVPSSVMSTNPLSDKVAGKVMGSASMFKAINGPDPFGSGWDPL 569 +++GD+ + +R + ++K PSS M+TNP + S WDP+ Sbjct: 6 DNNGDLGYQNRV-ESVMKCPSSGMNTNPF-----------------------YVSAWDPV 41 Query: 570 VSLNQSENFXXXXXXXXXXXX-------VDNQAISSTSHLVHYQSDSGLGEMVPKLSCFG 728 VSL+Q NF ++N IS+T HLVHY SDSG E+VPK FG Sbjct: 42 VSLSQLGNFGGSSTGSQSEFSNSPFPIVMENPGISNTCHLVHYPSDSGFVELVPKFPGFG 101 Query: 729 SGSFSEMVNSYGIPDCGQMSYS----LNKGGMEKAMLTGTHSRKECQIPEDKTS---PDR 887 SG+FSEMV S G+ +CGQ+ + K ++ G ++ Q+ E+ T P+ Sbjct: 102 SGNFSEMVGSVGLTECGQIVNAGCPPNYKEANNESTAHGAQREEDQQLSEETTIGALPNG 161 Query: 888 KNKRKASDTDSLLHSNKNDAE---QQDPSA----ISLEEDDKKQKIEQNITSNLRGKQTG 1046 K +R ++++S NKN AE Q+DPS I+ E D+KKQKIEQN ++NLRGKQ Sbjct: 162 KRRRLVAESNSPFDPNKN-AEGEFQKDPSGESSDIAKELDEKKQKIEQNCSANLRGKQVA 220 Query: 1047 KQVKENSDSGDAAKDNYIHVRAKRGQATNSHSLAXXXXXXXXXXXXXLLQELVPGCNKIT 1226 KQ K+N SG+A KD+YIHVRA+RGQATNSHSLA +LQELVPGCNKIT Sbjct: 221 KQAKDNPQSGEAPKDDYIHVRARRGQATNSHSLAERVRREKISERMRMLQELVPGCNKIT 280 Query: 1227 GKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILAKDLLNSRSSNASILGYQ 1406 GKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+ D+E+I +KD+L+SR NA+ILG+ Sbjct: 281 GKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELYNDVEKIQSKDILHSRGGNAAILGFS 340 Query: 1407 GLRPAHPFPLGNL-PGIP---NAPPSYHSMPQAVWDN 1505 +H + G PGIP N+ P + AV DN Sbjct: 341 PGINSHQYSHGIFQPGIPVILNSNPQFSPAHHAVLDN 377