BLASTX nr result

ID: Angelica23_contig00013420 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00013420
         (1868 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI28665.3| unnamed protein product [Vitis vinifera]              722   0.0  
ref|XP_002268533.1| PREDICTED: uncharacterized protein LOC100267...   720   0.0  
ref|XP_002320771.1| predicted protein [Populus trichocarpa] gi|2...   696   0.0  
ref|NP_001030933.1| SET domain-containing protein [Arabidopsis t...   678   0.0  
ref|NP_171694.3| SET domain-containing protein [Arabidopsis thal...   669   0.0  

>emb|CBI28665.3| unnamed protein product [Vitis vinifera]
          Length = 565

 Score =  722 bits (1864), Expect = 0.0
 Identities = 370/565 (65%), Positives = 442/565 (78%), Gaps = 7/565 (1%)
 Frame = -2

Query: 1864 DESKLEFFLEWLKVNKVELRGCKIKYCDSTKGFGIFSSDDVSKNDGILLVVPLDLAITPM 1685
            +E+KL+ FL+WL++N+VELRGC+IKYCDS KGFGIF ++D S  DGI LVVPLDLAITPM
Sbjct: 5    EEAKLQHFLQWLQLNRVELRGCEIKYCDSNKGFGIFYANDAS--DGIPLVVPLDLAITPM 62

Query: 1684 RVLQDPLLGPECSAMYKEGDVDDRFLMILFLMLERLRKKSTWKPYLDVLPTTFGNPLWFS 1505
            RVLQDP LGPEC AM++EG+VDDR LMILFL +ERLRK S+WKPYLD+LPTTFG PLWF 
Sbjct: 63   RVLQDPFLGPECRAMFEEGEVDDRLLMILFLTVERLRKNSSWKPYLDMLPTTFGTPLWFI 122

Query: 1504 DDELLELKGTTLYRATELQKQKLQSIYNDKVEMLVKRLLDLDGNSESKVSFEDFLWANSI 1325
            DDE +ELKGT+++RATELQK++LQS+Y+DKV+ LVK+LL LDG+S+ +V FEDFLWANSI
Sbjct: 123  DDEFIELKGTSVHRATELQKKQLQSLYDDKVKDLVKKLLILDGDSKGEVHFEDFLWANSI 182

Query: 1324 FWTRALNIPLPRSYVFPENHEPQDSTPSNGSSGNHTYNEELTSNN-----GGKSLELKGV 1160
            FWTRALNIPLPRSYVFP+  E Q+S   N    +  + ++++S N       KS ++ G 
Sbjct: 183  FWTRALNIPLPRSYVFPQIQEEQNSCIPNIIKDSGAFTDQISSGNLVSGMDEKSTDVHGF 242

Query: 1159 EXXXXXXXXXXXQDETVWVEGLVPAIDFCNHDIKASATWEVDGTGLTTGVPLSMYLLSVD 980
            E           Q+E +WVEGLVP IDFCNHD+KA+ATWEVD TGL TGVPLSMYLLSV+
Sbjct: 243  ESQVNRGTSSSMQEEILWVEGLVPGIDFCNHDLKAAATWEVDNTGLKTGVPLSMYLLSVE 302

Query: 979  QSSQPAKKEISISYGDKGNEELLYLYGFVVDNNPDDYLMVHYPSEAIQNVPFFESKLQLL 800
            QS    +KEISISYG+KGNEELLYLYGFV+DNNPDDYLMVHYP E  +NVPF ESK QLL
Sbjct: 303  QSPCHMQKEISISYGNKGNEELLYLYGFVIDNNPDDYLMVHYPMELFKNVPFSESKGQLL 362

Query: 799  EAQKGDMRCLLPRSLLGQGLFEASDKQKESTINGTASHVCNYSWSGQRKLPSYIGKLVFP 620
            EAQK +MRCLL ++LL +G F AS  + E     T   VCNYSWSGQRK PSY+ KLVFP
Sbjct: 363  EAQKAEMRCLLHKTLLDRGFFPASTLKNEQNGKSTDHQVCNYSWSGQRKTPSYLNKLVFP 422

Query: 619  ENFMASLRTIAMQEEDIYRVSSLLEELVGSREERQPSDTEVQAAIWEVCGDSGALQLLVD 440
            E F+ +LRTI+M+E+++ RVSSLLEEL  S   RQP D+E +AA+WE CGDSGALQ+LVD
Sbjct: 423  EAFLTALRTISMEEDELSRVSSLLEELAES-GGRQPLDSETRAAVWEACGDSGALQVLVD 481

Query: 439  LLNMKMMDLEEASGTEDTDTKLLQKANNSE--ECINNGAKISADEKTFSRNKWSSIIYRR 266
            LLN+KMMDLEE SGTED DT+LL+KA  +E  E   +G       K  SRN+WSSI+YRR
Sbjct: 482  LLNVKMMDLEEGSGTEDNDTELLEKALMTEIPEQHTSGTDSCIPHK-MSRNRWSSIVYRR 540

Query: 265  GQKQLTRLFLKXXXXXXXXXLSEGN 191
            GQKQLTRLFLK         LSEGN
Sbjct: 541  GQKQLTRLFLKEAEHALQLSLSEGN 565


>ref|XP_002268533.1| PREDICTED: uncharacterized protein LOC100267311 [Vitis vinifera]
          Length = 561

 Score =  720 bits (1859), Expect = 0.0
 Identities = 368/563 (65%), Positives = 440/563 (78%), Gaps = 5/563 (0%)
 Frame = -2

Query: 1864 DESKLEFFLEWLKVNKVELRGCKIKYCDSTKGFGIFSSDDVSKNDGILLVVPLDLAITPM 1685
            +E+KL+ FL+WL++N+VELRGC+IKYCDS KGFGIF ++D S  DGI LVVPLDLAITPM
Sbjct: 5    EEAKLQHFLQWLQLNRVELRGCEIKYCDSNKGFGIFYANDAS--DGIPLVVPLDLAITPM 62

Query: 1684 RVLQDPLLGPECSAMYKEGDVDDRFLMILFLMLERLRKKSTWKPYLDVLPTTFGNPLWFS 1505
            RVLQDP LGPEC AM++EG+VDDR LMILFL +ERLRK S+WKPYLD+LPTTFG PLWF 
Sbjct: 63   RVLQDPFLGPECRAMFEEGEVDDRLLMILFLTVERLRKNSSWKPYLDMLPTTFGTPLWFI 122

Query: 1504 DDELLELKGTTLYRATELQKQKLQSIYNDKVEMLVKRLLDLDGNSESKVSFEDFLWANSI 1325
            DDE +ELKGT+++RATELQK++LQS+Y+DKV+ LVK+LL LDG+S+ +V FEDFLWANSI
Sbjct: 123  DDEFIELKGTSVHRATELQKKQLQSLYDDKVKDLVKKLLILDGDSKGEVHFEDFLWANSI 182

Query: 1324 FWTRALNIPLPRSYVFPENHEPQDSTPSNGSSGNHTYNEELTSNN-----GGKSLELKGV 1160
            FWTRALNIPLPRSYVFP+  E Q+S   N    +  + ++++S N       KS ++ G 
Sbjct: 183  FWTRALNIPLPRSYVFPQIQEEQNSCIPNIIKDSGAFTDQISSGNLVSGMDEKSTDVHGF 242

Query: 1159 EXXXXXXXXXXXQDETVWVEGLVPAIDFCNHDIKASATWEVDGTGLTTGVPLSMYLLSVD 980
            E           Q+E +WVEGLVP IDFCNHD+KA+ATWEVD TGL TGVPLSMYLLSV+
Sbjct: 243  ESQVNRGTSSSMQEEILWVEGLVPGIDFCNHDLKAAATWEVDNTGLKTGVPLSMYLLSVE 302

Query: 979  QSSQPAKKEISISYGDKGNEELLYLYGFVVDNNPDDYLMVHYPSEAIQNVPFFESKLQLL 800
            QS    +KEISISYG+KGNEELLYLYGFV+DNNPDDYLMVHYP E  +NVPF ESK QLL
Sbjct: 303  QSPCHMQKEISISYGNKGNEELLYLYGFVIDNNPDDYLMVHYPMELFKNVPFSESKGQLL 362

Query: 799  EAQKGDMRCLLPRSLLGQGLFEASDKQKESTINGTASHVCNYSWSGQRKLPSYIGKLVFP 620
            EAQK +MRCLL ++LL +G F AS  + E     T   VCNYSWSGQRK PSY+ KLVFP
Sbjct: 363  EAQKAEMRCLLHKTLLDRGFFPASTLKNEQNGKSTDHQVCNYSWSGQRKTPSYLNKLVFP 422

Query: 619  ENFMASLRTIAMQEEDIYRVSSLLEELVGSREERQPSDTEVQAAIWEVCGDSGALQLLVD 440
            E F+ +LRTI+M+E+++ RVSSLLEEL  S   RQP D+E +AA+WE CGDSGALQ+LVD
Sbjct: 423  EAFLTALRTISMEEDELSRVSSLLEELAES-GGRQPLDSETRAAVWEACGDSGALQVLVD 481

Query: 439  LLNMKMMDLEEASGTEDTDTKLLQKANNSEECINNGAKISADEKTFSRNKWSSIIYRRGQ 260
            LLN+KMMDLEE SGTED DT+LL+KA  +E    +    S      SRN+WSSI+YRRGQ
Sbjct: 482  LLNVKMMDLEEGSGTEDNDTELLEKALMTEIPEQH---TSCIPHKMSRNRWSSIVYRRGQ 538

Query: 259  KQLTRLFLKXXXXXXXXXLSEGN 191
            KQLTRLFLK         LSEGN
Sbjct: 539  KQLTRLFLKEAEHALQLSLSEGN 561


>ref|XP_002320771.1| predicted protein [Populus trichocarpa] gi|222861544|gb|EEE99086.1|
            predicted protein [Populus trichocarpa]
          Length = 551

 Score =  696 bits (1795), Expect = 0.0
 Identities = 350/555 (63%), Positives = 420/555 (75%), Gaps = 22/555 (3%)
 Frame = -2

Query: 1831 LKVNKVELRGCKIKYCDSTKGFGIFSSDDVSKNDGILLVVPLDLAITPMRVLQDPLLGPE 1652
            ++VNKVELRGC IKYC   KGFG+FSS+DVS  DG+LLVVPLDLAITPMRVLQDPL+GPE
Sbjct: 1    IQVNKVELRGCNIKYCGQNKGFGVFSSNDVS--DGVLLVVPLDLAITPMRVLQDPLIGPE 58

Query: 1651 CSAMYKEGDVDDRFLMILFLMLERLRKKSTWKPYLDVLPTTFGNPLWFSDDELLELKGTT 1472
            C +M++EG+VDDRFLMILFLMLERLR  S+WKPYLD+LP TFGNPLWF+DDELLELKGTT
Sbjct: 59   CRSMFEEGEVDDRFLMILFLMLERLRNNSSWKPYLDMLPKTFGNPLWFTDDELLELKGTT 118

Query: 1471 LYRATELQKQKLQSIYNDKVEMLVKRLLDLDGNSESKVSFEDFLWANSIFWTRALNIPLP 1292
            LYRATELQ+++L S+Y DKV+ LV++LL LDG+ ES+V FEDFLWANS+FWTRALNIPLP
Sbjct: 119  LYRATELQRKRLLSLYEDKVKGLVQKLLILDGDLESEVCFEDFLWANSVFWTRALNIPLP 178

Query: 1291 RSYVFPENHEPQDSTPSNGSSGNHTYNEELTSNNGGKSLELKGVEXXXXXXXXXXXQDET 1112
            RSYVFP+  E QDS  S       ++ + L  +      ++ GV+            DET
Sbjct: 179  RSYVFPQVQEDQDSQSSLNIDSGVSHTKALLISGS----KVPGVD---------GQFDET 225

Query: 1111 VWVEGLVPAIDFCNHDIKASATWEVDGTGLTTGVPLSMYLLSVDQSSQPAKKEISISYGD 932
            VWVEGLVP IDFCNHD+KA ATWEVDGTG+TTGVP SMYLLS +++    +KEI+ISYG+
Sbjct: 226  VWVEGLVPGIDFCNHDLKAVATWEVDGTGMTTGVPHSMYLLSAEKTPFQMEKEITISYGN 285

Query: 931  KGNEELLYLYGFVVDNNPDDYLM----------------------VHYPSEAIQNVPFFE 818
            KGNEELLYLYGFV+DNNPD+YLM                      VHYP EAIQNVPF +
Sbjct: 286  KGNEELLYLYGFVIDNNPDEYLMVMPLFGFCNSDVVLLGQYFLLDVHYPVEAIQNVPFSD 345

Query: 817  SKLQLLEAQKGDMRCLLPRSLLGQGLFEASDKQKESTINGTASHVCNYSWSGQRKLPSYI 638
            SK+QLLEAQK +MRCLLP+ LL  G F A     +    G A  +C++SWSGQR++PSY 
Sbjct: 346  SKMQLLEAQKAEMRCLLPKRLLAHGFFPAGTTSNDDNGKGKADKICSFSWSGQRRMPSYA 405

Query: 637  GKLVFPENFMASLRTIAMQEEDIYRVSSLLEELVGSREERQPSDTEVQAAIWEVCGDSGA 458
             KLVFPE F+ +LRTIAMQE+++ + SS LEELVGS   RQP+DTEV+ A+WE CGDSGA
Sbjct: 406  NKLVFPEEFLTTLRTIAMQEDELLKASSFLEELVGSEGVRQPTDTEVRTAVWEACGDSGA 465

Query: 457  LQLLVDLLNMKMMDLEEASGTEDTDTKLLQKANNSEECINNGAKISADEKTFSRNKWSSI 278
            LQLL DLL  K+M+LEE  GTED DT+LL+KA + +   +     S   K  SRN+WSSI
Sbjct: 466  LQLLFDLLQTKVMNLEENFGTEDCDTELLEKAQDVKNIEHKDTDESGHYKFMSRNRWSSI 525

Query: 277  IYRRGQKQLTRLFLK 233
            +YR+GQKQL RLFLK
Sbjct: 526  VYRKGQKQLARLFLK 540


>ref|NP_001030933.1| SET domain-containing protein [Arabidopsis thaliana]
            gi|63003834|gb|AAY25446.1| At1g01920 [Arabidopsis
            thaliana] gi|332189233|gb|AEE27354.1| SET
            domain-containing protein [Arabidopsis thaliana]
          Length = 547

 Score =  678 bits (1749), Expect = 0.0
 Identities = 346/546 (63%), Positives = 414/546 (75%), Gaps = 2/546 (0%)
 Frame = -2

Query: 1864 DESKLEFFLEWLKVNKVELRGCKIKYCDSTKGFGIFSSDDVSKNDGILLVVPLDLAITPM 1685
            +E+KLE FL+WL+VN  ELRGC IKY DS KGFGIF+S     +D +LLVVPLDLAITPM
Sbjct: 6    EEAKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPM 65

Query: 1684 RVLQDPLLGPECSAMYKEGDVDDRFLMILFLMLERLRKKSTWKPYLDVLPTTFGNPLWFS 1505
            RVLQDPLLGPEC  M+++G VDDRFLMILFL LERLR  S+WKPYLD+LPT FGNPLWFS
Sbjct: 66   RVLQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGNPLWFS 125

Query: 1504 DDELLELKGTTLYRATELQKQKLQSIYNDKVEMLVKRLLDLDGNSESKVSFEDFLWANSI 1325
            DD++LELKGT LY ATELQK+KL S+Y+DKVE+LV +LL LDG+SESKVSFE FLWANS+
Sbjct: 126  DDDILELKGTNLYHATELQKKKLLSLYHDKVEVLVTKLLILDGDSESKVSFEHFLWANSV 185

Query: 1324 FWTRALNIPLPRSYVFPENHEPQDSTPSNGSSGNHTYNEELTSNNGGKSLELKGVEXXXX 1145
            FW+RALNIPLP S+VFP++   QD T   G   + + + E    N  +  E++       
Sbjct: 186  FWSRALNIPLPHSFVFPQS---QDDT---GECTSTSESPETAPVNSNEEKEIQA------ 233

Query: 1144 XXXXXXXQDETVWVEGLVPAIDFCNHDIKASATWEVDGTGLTTGVPLSMYLLSVDQSSQP 965
                     +T+WVEGLVP IDFCNHD+K  ATWEVDG G  + VP SMYLLSV Q   P
Sbjct: 234  QPAPSVGSGDTIWVEGLVPGIDFCNHDLKPVATWEVDGIGSVSRVPFSMYLLSVAQRPIP 293

Query: 964  AKKEISISYGDKGNEELLYLYGFVVDNNPDDYLMVHYPSEAIQNVPFFESKLQLLEAQKG 785
             KKEISISYG+KGNEELLYLYGFV+DNNPDDYLMVHYP EAI ++PF +SK QLLEAQ  
Sbjct: 294  -KKEISISYGNKGNEELLYLYGFVIDNNPDDYLMVHYPVEAIPSIPFSDSKGQLLEAQNA 352

Query: 784  DMRCLLPRSLLGQGLFEASDKQKESTINGTASHVCNYSWSGQRKLPSYIGKLVFPENFMA 605
             +RCLLP+S+L  G F  +      +        CN+SWSG+RK+P+Y+ KLVFPE+FM 
Sbjct: 353  QLRCLLPKSVLNHGFFPRTTSVIRESDEKETVRSCNFSWSGKRKMPTYMNKLVFPEDFMT 412

Query: 604  SLRTIAMQEEDIYRVSSLLEELVGSREERQPSDTEVQAAIWEVCGDSGALQLLVDLLNMK 425
             LRTIAMQEE+IY+VS++LEELV SR+  QPS+TEV+ A+WE CGDSGALQLLVDLLN K
Sbjct: 413  GLRTIAMQEEEIYKVSAMLEELVESRQGEQPSETEVRMAVWEACGDSGALQLLVDLLNSK 472

Query: 424  MMDLEEASGTEDTDTKLLQKANNSEECINNGAKIS--ADEKTFSRNKWSSIIYRRGQKQL 251
            MM LEE SGTE+ D +LL++A     C+    + S   D +  SRNKWSS++YRRGQKQL
Sbjct: 473  MMKLEENSGTEEQDARLLEEA-----CVLESHEESRDLDGRRMSRNKWSSVVYRRGQKQL 527

Query: 250  TRLFLK 233
            TRL LK
Sbjct: 528  TRLLLK 533


>ref|NP_171694.3| SET domain-containing protein [Arabidopsis thaliana]
            gi|332189232|gb|AEE27353.1| SET domain-containing protein
            [Arabidopsis thaliana]
          Length = 572

 Score =  669 bits (1726), Expect = 0.0
 Identities = 348/568 (61%), Positives = 412/568 (72%), Gaps = 24/568 (4%)
 Frame = -2

Query: 1864 DESKLEFFLEWLKVNKVELRGCKIKYCDSTKGFGIFSSDDVSKNDGILLVVPLDLAITPM 1685
            +E+KLE FL+WL+VN  ELRGC IKY DS KGFGIF+S     +D +LLVVPLDLAITPM
Sbjct: 6    EEAKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPM 65

Query: 1684 RVLQDPLLGPECSAMYKEGDVDDRFLMILFLMLERLRKKSTWKPYLDVLPTTFGNPLWFS 1505
            RVLQDPLLGPEC  M+++G VDDRFLMILFL LERLR  S+WKPYLD+LPT FGNPLWFS
Sbjct: 66   RVLQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGNPLWFS 125

Query: 1504 DDELLELKGTTLYRATELQKQKLQSIYNDKVEMLVKRLLDLDGNSESKVSFEDFLWANSI 1325
            DD++LELKGT LY ATELQK+KL S+Y+DKVE+LV +LL LDG+SESKVSFE FLWANS+
Sbjct: 126  DDDILELKGTNLYHATELQKKKLLSLYHDKVEVLVTKLLILDGDSESKVSFEHFLWANSV 185

Query: 1324 FWTRALNIPLPRSYVFPENHEPQDSTPSNGSSGNHTYNEELTSNNGGKSLELKGVEXXXX 1145
            FW+RALNIPLP S+VFP++   QD T    S+        + SN      E KG      
Sbjct: 186  FWSRALNIPLPHSFVFPQS---QDDTGECTSTSESPETAPVNSN------EEKGKSLTSA 236

Query: 1144 XXXXXXXQDETVWVEGLVPAIDFCNHDIKASATWEVDGTGLTTGVPLSMYLLSVDQSSQP 965
                     +T+WVEGLVP IDFCNHD+K  ATWEVDG G  + VP SMYLLSV Q   P
Sbjct: 237  QPAPSVGSGDTIWVEGLVPGIDFCNHDLKPVATWEVDGIGSVSRVPFSMYLLSVAQRPIP 296

Query: 964  AKKEISISYGDKGNEELLYLYGFVVDNNPDDYLM----------------------VHYP 851
             KKEISISYG+KGNEELLYLYGFV+DNNPDDYLM                      VHYP
Sbjct: 297  -KKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIKEMLVNFVLTSVVTFNNGFIQVHYP 355

Query: 850  SEAIQNVPFFESKLQLLEAQKGDMRCLLPRSLLGQGLFEASDKQKESTINGTASHVCNYS 671
             EAI ++PF +SK QLLEAQ   +RCLLP+S+L  G F  +      +        CN+S
Sbjct: 356  VEAIPSIPFSDSKGQLLEAQNAQLRCLLPKSVLNHGFFPRTTSVIRESDEKETVRSCNFS 415

Query: 670  WSGQRKLPSYIGKLVFPENFMASLRTIAMQEEDIYRVSSLLEELVGSREERQPSDTEVQA 491
            WSG+RK+P+Y+ KLVFPE+FM  LRTIAMQEE+IY+VS++LEELV SR+  QPS+TEV+ 
Sbjct: 416  WSGKRKMPTYMNKLVFPEDFMTGLRTIAMQEEEIYKVSAMLEELVESRQGEQPSETEVRM 475

Query: 490  AIWEVCGDSGALQLLVDLLNMKMMDLEEASGTEDTDTKLLQKANNSEECINNGAKIS--A 317
            A+WE CGDSGALQLLVDLLN KMM LEE SGTE+ D +LL++A     C+    + S   
Sbjct: 476  AVWEACGDSGALQLLVDLLNSKMMKLEENSGTEEQDARLLEEA-----CVLESHEESRDL 530

Query: 316  DEKTFSRNKWSSIIYRRGQKQLTRLFLK 233
            D +  SRNKWSS++YRRGQKQLTRL LK
Sbjct: 531  DGRRMSRNKWSSVVYRRGQKQLTRLLLK 558


Top