BLASTX nr result

ID: Cnidium21_contig00022389 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00022389
         (940 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like ...   270   5e-70
ref|XP_002871756.1| SET domain-containing protein [Arabidopsis l...   261   2e-67
emb|CBI27360.3| unnamed protein product [Vitis vinifera]              261   2e-67
ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus ...   259   5e-67
ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thalia...   259   8e-67

>ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
          Length = 504

 Score =  270 bits (689), Expect = 5e-70
 Identities = 149/275 (54%), Positives = 184/275 (66%), Gaps = 13/275 (4%)
 Frame = -3

Query: 794 VNKFLGWAAKLGITDF-------PLNLNNPXXXXXXXXXXXXXXHFPNAGGRGLGATRDL 636
           + +FL WA +LGI+DF       P  L  P               FP+AGGRGL A RDL
Sbjct: 1   MERFLKWATELGISDFTTTPTTVPSRLQIPHCCVGHSLCVSH---FPHAGGRGLAAARDL 57

Query: 635 RKGELILRVPREALFTTQSVVLQDHNFSVALQKYQSLSCTQKLTVALLNEISKGKSSLWF 456
            +GELIL VP+ AL T+QS+ L+D   SVA++++ SLS  Q LT+ LL E+SKGKSS W 
Sbjct: 58  SQGELILTVPKSALMTSQSL-LKDEKLSVAVKRHTSLSSPQILTICLLAEMSKGKSSWWH 116

Query: 455 PYLKHLPQSYDILASFSQFETQALQVDDAIWAAEKAIGKAKSEWKEAIFLMNDLKIKNKL 276
           PYL  LP+SYD LA+FSQFE QALQVDDAIW  E+AI KA+ EWK+AI LM +LK+K +L
Sbjct: 117 PYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTERAILKAELEWKKAIPLMEELKLKPQL 176

Query: 275 QSLRAWLWASGTISSRTLHIPWDEAGCLCPVGDLFNYAAPGEELGERDDLCAHGNASCMS 96
           Q+ RAWLWAS T+SSRT+HIPWD+AGCLCPVGD +NYAAPGEE    +DL    N S + 
Sbjct: 177 QNFRAWLWASSTVSSRTMHIPWDDAGCLCPVGDFYNYAAPGEEPCGWEDLKGSRNESSLQ 236

Query: 95  TSSCRVTDKPVVEHCDD------SVRLTDGSYEKD 9
            SS    +K    + D       S RLTDG Y++D
Sbjct: 237 DSS--FWNKDATSNSDAEQDDVLSQRLTDGGYKED 269


>ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
           gi|297317593|gb|EFH48015.1| SET domain-containing
           protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  261 bits (666), Expect = 2e-67
 Identities = 133/259 (51%), Positives = 174/259 (67%)
 Frame = -3

Query: 785 FLGWAAKLGITDFPLNLNNPXXXXXXXXXXXXXXHFPNAGGRGLGATRDLRKGELILRVP 606
           FL WAA++GI+D   ++++                FP+AGGRGLGA R+L+KGEL+L+VP
Sbjct: 14  FLRWAAEIGISD---SIDSSRYRDSCLGHSLSVADFPHAGGRGLGAVRELKKGELVLKVP 70

Query: 605 REALFTTQSVVLQDHNFSVALQKYQSLSCTQKLTVALLNEISKGKSSLWFPYLKHLPQSY 426
           R AL TT+S++ +D   + A+  + SLS TQ L+V LL E+ KGK S W+PYL HLP+ Y
Sbjct: 71  RNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLLYEMGKGKRSFWYPYLVHLPRDY 130

Query: 425 DILASFSQFETQALQVDDAIWAAEKAIGKAKSEWKEAIFLMNDLKIKNKLQSLRAWLWAS 246
           D+LA+F +FE QALQV+DA+WA EKAI K + EWKE   LM +L++K+K +S +AWLWAS
Sbjct: 131 DLLATFGEFEKQALQVEDAVWATEKAIAKCQFEWKEVGLLMEELELKSKFRSFQAWLWAS 190

Query: 245 GTISSRTLHIPWDEAGCLCPVGDLFNYAAPGEELGERDDLCAHGNASCMSTSSCRVTDKP 66
            TISSRTLH+PWD AGCLCPVGDLFNY APG++L        H      S +        
Sbjct: 191 ATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDDL--------HTLEGPESANDVEEAGLV 242

Query: 65  VVEHCDDSVRLTDGSYEKD 9
           V  H   S RLTDG +E+D
Sbjct: 243 VETH---SERLTDGGFEED 258


>emb|CBI27360.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score =  261 bits (666), Expect = 2e-67
 Identities = 144/269 (53%), Positives = 177/269 (65%), Gaps = 7/269 (2%)
 Frame = -3

Query: 794 VNKFLGWAAKLGITDF-------PLNLNNPXXXXXXXXXXXXXXHFPNAGGRGLGATRDL 636
           + +FL WA +LGI+DF       P  L  P               FP+AGGRGL A RDL
Sbjct: 1   MERFLKWATELGISDFTTTPTTVPSRLQIPHCCVGHSLCVSH---FPHAGGRGLAAARDL 57

Query: 635 RKGELILRVPREALFTTQSVVLQDHNFSVALQKYQSLSCTQKLTVALLNEISKGKSSLWF 456
            +GELIL VP+ AL T+QS+ L+D   SVA++++ SLS  Q LT+ LL E+SKGKSS W 
Sbjct: 58  SQGELILTVPKSALMTSQSL-LKDEKLSVAVKRHTSLSSPQILTICLLAEMSKGKSSWWH 116

Query: 455 PYLKHLPQSYDILASFSQFETQALQVDDAIWAAEKAIGKAKSEWKEAIFLMNDLKIKNKL 276
           PYL  LP+SYD LA+FSQFE QALQVDDAIW  E+AI KA+ EWK+AI LM +LK+K +L
Sbjct: 117 PYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTERAILKAELEWKKAIPLMEELKLKPQL 176

Query: 275 QSLRAWLWASGTISSRTLHIPWDEAGCLCPVGDLFNYAAPGEELGERDDLCAHGNASCMS 96
           Q+ RAWLWAS T+SSRT+HIPWD+AGCLCPVGD +NYAAPGEE    +DL          
Sbjct: 177 QNFRAWLWASSTVSSRTMHIPWDDAGCLCPVGDFYNYAAPGEEPCGWEDL---------- 226

Query: 95  TSSCRVTDKPVVEHCDDSVRLTDGSYEKD 9
                   K   +    S RLTDG Y++D
Sbjct: 227 --------KDAEQDDVLSQRLTDGGYKED 247


>ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
           gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP,
           putative [Ricinus communis]
          Length = 510

 Score =  259 bits (663), Expect = 5e-67
 Identities = 144/264 (54%), Positives = 181/264 (68%), Gaps = 5/264 (1%)
 Frame = -3

Query: 785 FLGWAA-KLGITDFPLNLNNPXXXXXXXXXXXXXXHFPNAGGRGLGATRDLRKGELILRV 609
           FL WAA +LGI+D   +  +               HFP+AGGRGLGA RDL+KGEL+LRV
Sbjct: 13  FLKWAAAELGISDSSNSSQSLEEPNSCLGISLTVSHFPDAGGRGLGAARDLKKGELVLRV 72

Query: 608 PREALFTTQSVVLQDHNFSVALQKYQSLSCTQKLTVALLNEISKGKSSLWFPYLKHLPQS 429
           P+ AL T  S  L+D     A+  + +LS TQ LTV LL E+SKG+SS W+PYL HLP+S
Sbjct: 73  PKSALLTKDSF-LKDGLLLSAINNHSALSPTQTLTVCLLYEMSKGQSSFWYPYLMHLPRS 131

Query: 428 YDILASFSQFETQALQVDDAIWAAEKAIGKAKSEWKEAIFLMNDLKIKNKLQSLRAWLWA 249
           Y+ILA+FS+FE QALQVDDAIW AEKAI KA+ + KEA  LM +L++K +  +LRAW+WA
Sbjct: 132 YEILATFSEFEKQALQVDDAIWTAEKAISKAELDRKEAYSLMQELRLKPQFLTLRAWIWA 191

Query: 248 SGTISSRTLHIPWDEAGCLCPVGDLFNYAAPGEELGERDDLCAHGNASCM---STSSCRV 78
             TISSRT+HIPWDEAGCLCPVGD FNYAAPGEE    ++  +   ASC+   S SS R 
Sbjct: 192 CATISSRTMHIPWDEAGCLCPVGDFFNYAAPGEESSSPENDESWKPASCLEDASLSSERS 251

Query: 77  TDKPVVEHCDDSVR-LTDGSYEKD 9
           T     E  D  ++ LTDG +++D
Sbjct: 252 TSNFCSETFDVQLKSLTDGGFDED 275


>ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
           gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName:
           Full=Protein SET DOMAIN GROUP 40
           gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis
           thaliana] gi|51969984|dbj|BAD43684.1| unknown protein
           [Arabidopsis thaliana] gi|332005020|gb|AED92403.1|
           protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
          Length = 491

 Score =  259 bits (661), Expect = 8e-67
 Identities = 133/259 (51%), Positives = 175/259 (67%)
 Frame = -3

Query: 785 FLGWAAKLGITDFPLNLNNPXXXXXXXXXXXXXXHFPNAGGRGLGATRDLRKGELILRVP 606
           FL WAA++GI+D   ++++                FP+AGGRGLGA R+L+KGEL+L+VP
Sbjct: 11  FLRWAAEIGISD---SIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKVP 67

Query: 605 REALFTTQSVVLQDHNFSVALQKYQSLSCTQKLTVALLNEISKGKSSLWFPYLKHLPQSY 426
           R+AL TT+S++ +D   S A+  + SLS TQ L+V LL E+SK K S W+PYL H+P+ Y
Sbjct: 68  RKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDY 127

Query: 425 DILASFSQFETQALQVDDAIWAAEKAIGKAKSEWKEAIFLMNDLKIKNKLQSLRAWLWAS 246
           D+LA+F  FE QALQV+DA+WA EKA  K +SEWKEA  LM +L++K K +S +AWLWAS
Sbjct: 128 DLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWAS 187

Query: 245 GTISSRTLHIPWDEAGCLCPVGDLFNYAAPGEELGERDDLCAHGNASCMSTSSCRVTDKP 66
            TISSRTLH+PWD AGCLCPVGDLFNY APG+          + N      S+  V +  
Sbjct: 188 ATISSRTLHVPWDSAGCLCPVGDLFNYDAPGD----------YSNTPQGPESANNVEEAG 237

Query: 65  VVEHCDDSVRLTDGSYEKD 9
           +V     S RLTDG +E+D
Sbjct: 238 LVVE-THSERLTDGGFEED 255


Top