BLASTX nr result

ID: Angelica22_contig00009188 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00009188
         (1953 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285475.1| PREDICTED: transcription factor bHLH74 [Viti...   395   e-107
emb|CAN60403.1| hypothetical protein VITISV_034133 [Vitis vinifera]   363   9e-98
ref|XP_002510047.1| DNA binding protein, putative [Ricinus commu...   346   1e-92
gb|ABN51065.1| basic helix-loop-helix protein [Sesamum indicum]       346   1e-92
ref|XP_002306505.1| predicted protein [Populus trichocarpa] gi|2...   301   5e-79

>ref|XP_002285475.1| PREDICTED: transcription factor bHLH74 [Vitis vinifera]
            gi|302142156|emb|CBI19359.3| unnamed protein product
            [Vitis vinifera]
          Length = 430

 Score =  395 bits (1014), Expect = e-107
 Identities = 233/434 (53%), Positives = 291/434 (67%), Gaps = 33/434 (7%)
 Frame = -3

Query: 1576 MGTEDHGDMRFHHRDGDGLLNVPSSVMSTNPLSDKVAGKAMGSASMFKAINGPDPF-GSG 1400
            MG +D+G+M F +     +LN PSS M+T+P+S+KV G  M SASM+K+ NG DPF GSG
Sbjct: 1    MGIDDNGNMGFPNTS-QSILNCPSSGMNTHPISEKVTGMTMSSASMYKSSNGGDPFFGSG 59

Query: 1399 WDPLASLNQSENFXXXXXXXXXXXS-------VDNQAISGTSHLVHYQSDSGLGEMVPKL 1241
            WDP+ SL+Q+ENF           +       ++NQ I  T HLV Y S+S L EMVPKL
Sbjct: 60   WDPIVSLSQNENFGGSSMVSHSEFANSAYPVVLENQGIGSTPHLVLYPSNSSLVEMVPKL 119

Query: 1240 SCFGSGSFSEMVNSYGIPDCGQMSYS-------LNKGGMEKAMLTGTHSGKECQIPEDKI 1082
             CFGSGSFSEMV S+G+P+CGQ + S        NK G+ +  L G  S +  QI E   
Sbjct: 120  PCFGSGSFSEMVASFGLPECGQTANSGCPPNFPPNKEGLTEKSLNGAQSQEGHQISEGDA 179

Query: 1081 ---SPDRKNKRKASDTDSLLHSNKN-DAEQQ-----DPSAISLEEDDKKQKIEHNITSNS 929
               SP  K ++ + D    L+++K+ D EQ      + S  S E+++KK KI+ N++ N 
Sbjct: 180  VDASPSGKRRKSSFDPRPPLNTSKSADGEQPKGLPWENSEFSKEQEEKKLKIDQNMSPNL 239

Query: 928  RGKQTGKQAKENSDSGDAAKDSYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQELVP 749
            RGKQ  K AK+NS +G+A K++YIHVRA+RGQATNSHSLA            RLLQELVP
Sbjct: 240  RGKQPNKHAKDNSSNGEAPKENYIHVRARRGQATNSHSLAERVRREKISERMRLLQELVP 299

Query: 748  GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILSKDLLNSRSTSA 569
            GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+NIDIER+LSKD+LNSR  S 
Sbjct: 300  GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERLLSKDILNSRGGST 359

Query: 568  AVLGY-QGLRPTHPFP----LGNLPAIPNMTPSYHSMPQAIWDNELHNLLQMGFDAN--- 413
            +VLG+  G+  +HP+P     G LP IP  TP +HS  QA+WD EL +LLQMGFD+N   
Sbjct: 360  SVLGFGPGMSSSHPYPHGISQGTLPGIP--TPQFHS-TQAVWDGELQSLLQMGFDSNPSS 416

Query: 412  NNLGP-TGRPKLDL 374
            NNLG   GR KL+L
Sbjct: 417  NNLGTNAGRSKLEL 430


>emb|CAN60403.1| hypothetical protein VITISV_034133 [Vitis vinifera]
          Length = 484

 Score =  363 bits (932), Expect = 9e-98
 Identities = 217/424 (51%), Positives = 275/424 (64%), Gaps = 32/424 (7%)
 Frame = -3

Query: 1576 MGTEDHGDMRFHHRDGDGLLNVPSSVMSTNPLSDKVAGKAMGSASMFKAINGPDPF-GSG 1400
            MG +D+G+M F +     +LN PSS M+T+P+S+KV G  M SASM+K+ NG DPF GSG
Sbjct: 1    MGIDDNGNMGFPNTS-QSILNCPSSGMNTHPISEKVTGMTMSSASMYKSSNGGDPFFGSG 59

Query: 1399 WDPLASLNQSENFXXXXXXXXXXXS-------VDNQAISGTSHLVHYQSDSGLGEMVPKL 1241
            WDP+ SL+Q+ENF           +       ++NQ I  T HLV Y S+S L EMVPKL
Sbjct: 60   WDPIVSLSQNENFGGSSMVSHSEFANSAYPVVLENQGIGSTPHLVLYPSNSSLVEMVPKL 119

Query: 1240 SCFGSGSFSEMVNSYGIPDCGQMSYS-------LNKGGMEKAMLTGTHSGKECQIPEDKI 1082
             CFGSGSFSEMV S+G+P+CGQ + S        NK G+ +  L G  S +  QI E   
Sbjct: 120  PCFGSGSFSEMVASFGLPECGQTANSGCPPNFPPNKEGLTEKSLNGAQSQEGHQISEGDA 179

Query: 1081 ---SPDRKNKRKASDTDSLLHSNKN-DAEQQ-----DPSAISLEEDDKKQKIEHNITSNS 929
               SP  K ++ + D    L+++K+ D EQ      + S  S E+++KKQKI+ N++ N 
Sbjct: 180  VDASPSGKRRKSSFDPRPPLNTSKSADGEQPKGLPWENSEFSKEQEEKKQKIDQNMSPNL 239

Query: 928  RGKQTGKQAKENSDSGDAAKDSYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQELVP 749
            RGKQ  K AK+NS +G+A K++YIHVRA+RGQATNSHSLA                    
Sbjct: 240  RGKQPNKHAKDNSSNGEAPKENYIHVRARRGQATNSHSLA-------------------- 279

Query: 748  GCNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILSKDLLNSRSTSA 569
               +ITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+NIDIER+LSKD+LNSR  S 
Sbjct: 280  --ERITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERLLSKDILNSRGGST 337

Query: 568  AVLGY-QGLRPTHPFP----LGNLPAIPNMTPSYHSMPQAIWDNELHNLLQMGFDAN--- 413
            +VLG+  G+  +HP+P     G LP IP  TP +HS  QA+WD EL +LLQMGFD+N   
Sbjct: 338  SVLGFGPGMSSSHPYPHGISQGTLPGIP--TPQFHS-TQAVWDGELQSLLQMGFDSNPSS 394

Query: 412  NNLG 401
            NNLG
Sbjct: 395  NNLG 398


>ref|XP_002510047.1| DNA binding protein, putative [Ricinus communis]
            gi|223550748|gb|EEF52234.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 408

 Score =  346 bits (888), Expect = 1e-92
 Identities = 207/431 (48%), Positives = 262/431 (60%), Gaps = 30/431 (6%)
 Frame = -3

Query: 1576 MGT-EDHGDMRFHHRDGDGLLNVPSSVMSTNPLSDKVAGKAMGSASMFKAINGPDPFGSG 1400
            MGT ED+ +       G+ ++N  SS MS NP                        F   
Sbjct: 1    MGTSEDNNEGMAFQSGGESVMNCQSSGMSANPF-----------------------FPPA 37

Query: 1399 WDPLASLNQSENFXXXXXXXXXXXS------VDNQAISGTSHLVHYQSDSGLGEMVPKLS 1238
            WDP+ SLNQ ENF           +      ++NQ I+ +SHLVHYQSDS   E+VPK  
Sbjct: 38   WDPVVSLNQHENFGASMVSQSEFTNSHYAIVMENQGINSSSHLVHYQSDSSYVELVPKFP 97

Query: 1237 CFGSGSFSEMVNSYGIPDCGQMS-------YSLNKGGMEKAMLTGTH-SGKECQIPEDKI 1082
             +GSGSFSEMV+S+G+ DCGQ+S       Y+ N     +  +T +  S ++ Q+ E+ +
Sbjct: 98   SYGSGSFSEMVSSFGLTDCGQISNSGCHPNYTSNSAANNERTITNSALSQEDHQLSEEPV 157

Query: 1081 ---SPDRKNKRKASDTDSLLHSNKNDAEQ-QDPSA----ISLEEDDKKQKIEHNITSNSR 926
               SPD K +++ ++  S    NKN  E  +DPS     I  E+D+KK + E N  +N R
Sbjct: 158  VGVSPDGKRRKRLAEPSSPFDPNKNAEEMHKDPSGNSSDIPKEQDEKKSRTEQNTAANLR 217

Query: 925  GKQTGKQAKENSDSGDAAKDSYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQELVPG 746
            GKQ  KQAKENS SG+A K++YIHVRA+RGQATNSHSLA            RLLQELVPG
Sbjct: 218  GKQAAKQAKENSHSGEAPKENYIHVRARRGQATNSHSLAERVRREKISERMRLLQELVPG 277

Query: 745  CNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILSKDLLNSRSTSAA 566
            CNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+NIDIERILSKD+L+SR  +AA
Sbjct: 278  CNKITGKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELNIDIERILSKDILHSRGGNAA 337

Query: 565  VLGYQGLRPTHPFPLG----NLPAIPNMTPSYHSMPQAIWDNELHNLLQMGFD---ANNN 407
            ++G       HP+  G    N+P IPN  P +  MP  + +N+L NL QMGFD   A ++
Sbjct: 338  IMGLSPGINAHPYSHGIFPPNIPVIPNTNPQFPPMPHTVLENDLQNLFQMGFDSGSAIDS 397

Query: 406  LGPTGRPKLDL 374
            LGP GR K +L
Sbjct: 398  LGPNGRLKPEL 408


>gb|ABN51065.1| basic helix-loop-helix protein [Sesamum indicum]
          Length = 400

 Score =  346 bits (887), Expect = 1e-92
 Identities = 203/414 (49%), Positives = 263/414 (63%), Gaps = 19/414 (4%)
 Frame = -3

Query: 1558 GDMRFHHRDGDGLLNVPSSVMSTNPLSDKVAGKAMGSASMFKAINGPDPF--GSGWDPLA 1385
            GD  F HR+   +LN PSSVM+T  +SD VAG ++ S SMFK  NG DPF   SGWDP+ 
Sbjct: 2    GDRVFQHRNSSSILNCPSSVMATTSISDNVAGMSICSESMFKPPNGIDPFYSSSGWDPVI 61

Query: 1384 SLNQSENFXXXXXXXXXXXS-------VDNQAISGTSHLVHYQSDSGLGEMVPKLSCFGS 1226
            S +QS NF           +       ++NQ +  +SHLVH+ SDSGL  MVPK+  FGS
Sbjct: 62   SQDQSGNFGNSSMVLQNEFANPNYPVLLENQTMGSSSHLVHFPSDSGLVGMVPKIPSFGS 121

Query: 1225 GSFSEMVNSYGIPDCGQMSYSLNKGGMEKAMLTGTHSGKECQIPEDKI---SPDRKNKRK 1055
            GSFSE+V+S+G  +  Q     N  G++  +     +    Q  E+ +   SP+ K KRK
Sbjct: 122  GSFSEIVSSFGHSNFAQN----NGAGVQNTVKNVEDAQDHRQDSENGVLGASPNGKRKRK 177

Query: 1054 ASDTDSLLHSNKNDAEQQDPSAISLEEDDKKQKIEHNITSNSRGKQTGKQAKENSDSGDA 875
              + +      K   + +D + +  E D+KK     N   +SR +Q  K+AK+NS   +A
Sbjct: 178  NVEVE------KQKDQTRDLAELPKEYDEKK-----NSGPSSRSRQAVKEAKDNSSGAEA 226

Query: 874  AKDSYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQELVPGCNKITGKAVMLDEIINY 695
            +K++YIHVRAKRGQATNSHSLA            RLLQELVPGCNKITGKAVMLDEIINY
Sbjct: 227  SKENYIHVRAKRGQATNSHSLAERVRRERISERMRLLQELVPGCNKITGKAVMLDEIINY 286

Query: 694  VQSLQQQVEFLSMKLATVNPEVNIDIERILSKDLLNSRSTSAAVLGY-QGLRPTHPF--- 527
            VQSLQQQVEFLSMKLATVNPE+N+DIER+LSKD+L+SR ++A  LG   GL  +HPF   
Sbjct: 287  VQSLQQQVEFLSMKLATVNPELNVDIERLLSKDILHSRGSNATALGIGPGLSSSHPFQGL 346

Query: 526  PLGNLPAIPNMTPSYHSMPQAIWDNELHNLLQMGFDAN---NNLGPTGRPKLDL 374
            P G L A P   P + S+PQ +W+NEL N+LQ G+D+N    +LGP+G  K++L
Sbjct: 347  PQGTLNAFPGTAPQFQSLPQNLWNNELQNILQNGYDSNPSVGSLGPSGLSKMEL 400


>ref|XP_002306505.1| predicted protein [Populus trichocarpa] gi|222855954|gb|EEE93501.1|
            predicted protein [Populus trichocarpa]
          Length = 407

 Score =  301 bits (770), Expect = 5e-79
 Identities = 189/418 (45%), Positives = 249/418 (59%), Gaps = 28/418 (6%)
 Frame = -3

Query: 1567 EDHGDMRFHHRDGDGLLNVPSSVMSTNPLSDKVAGKAMGSASMFKAINGPDPFGSGWDPL 1388
            +++GD+ + +R  + ++  PSS M+TNP                        + S WDP+
Sbjct: 6    DNNGDLGYQNRV-ESVMKCPSSGMNTNPF-----------------------YVSAWDPV 41

Query: 1387 ASLNQSENFXXXXXXXXXXXS-------VDNQAISGTSHLVHYQSDSGLGEMVPKLSCFG 1229
             SL+Q  NF           S       ++N  IS T HLVHY SDSG  E+VPK   FG
Sbjct: 42   VSLSQLGNFGGSSTGSQSEFSNSPFPIVMENPGISNTCHLVHYPSDSGFVELVPKFPGFG 101

Query: 1228 SGSFSEMVNSYGIPDCGQMSYSLNKGGMEKAMLTGTHSG----KECQIPEDKIS---PDR 1070
            SG+FSEMV S G+ +CGQ+  +      ++A    T  G    ++ Q+ E+      P+ 
Sbjct: 102  SGNFSEMVGSVGLTECGQIVNAGCPPNYKEANNESTAHGAQREEDQQLSEETTIGALPNG 161

Query: 1069 KNKRKASDTDSLLHSNKNDAE---QQDPSA----ISLEEDDKKQKIEHNITSNSRGKQTG 911
            K +R  ++++S    NKN AE   Q+DPS     I+ E D+KKQKIE N ++N RGKQ  
Sbjct: 162  KRRRLVAESNSPFDPNKN-AEGEFQKDPSGESSDIAKELDEKKQKIEQNCSANLRGKQVA 220

Query: 910  KQAKENSDSGDAAKDSYIHVRAKRGQATNSHSLAXXXXXXXXXXXXRLLQELVPGCNKIT 731
            KQAK+N  SG+A KD YIHVRA+RGQATNSHSLA            R+LQELVPGCNKIT
Sbjct: 221  KQAKDNPQSGEAPKDDYIHVRARRGQATNSHSLAERVRREKISERMRMLQELVPGCNKIT 280

Query: 730  GKAVMLDEIINYVQSLQQQVEFLSMKLATVNPEVNIDIERILSKDLLNSRSTSAAVLGYQ 551
            GKAVMLDEIINYVQSLQQQVEFLSMKLATVNPE+  D+E+I SKD+L+SR  +AA+LG+ 
Sbjct: 281  GKAVMLDEIINYVQSLQQQVEFLSMKLATVNPELYNDVEKIQSKDILHSRGGNAAILGFS 340

Query: 550  GLRPTHPFPLG----NLPAIPNMTPSYHSMPQAIWDNELHNLLQMGFDAN---NNLGP 398
                +H +  G     +P I N  P +     A+ DNEL +  QMGFD++   ++LGP
Sbjct: 341  PGINSHQYSHGIFQPGIPVILNSNPQFSPAHHAVLDNELQSFFQMGFDSSSAVDSLGP 398


Top