BLASTX nr result

ID: Atractylodes22_contig00019562 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00019562
         (1837 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like ...   370   e-100
ref|XP_002306703.1| SET domain protein [Populus trichocarpa] gi|...   351   3e-94
ref|XP_003548980.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   343   7e-92
emb|CBI18219.3| unnamed protein product [Vitis vinifera]              343   7e-92
ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   328   2e-87

>ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like [Vitis vinifera]
          Length = 660

 Score =  370 bits (951), Expect = e-100
 Identities = 240/551 (43%), Positives = 300/551 (54%), Gaps = 69/551 (12%)
 Frame = +1

Query: 226  RIRGLMTNRHKLLM---SIDDDEFSARIKSGARVMAAATMMRDGAVVSESSEGYLLEEAV 396
            RI GL+TN H L+    + + DE   RI+ G + MA A  MRDG   S  S+   LEEA+
Sbjct: 119  RICGLLTNLHHLISPSHNSESDETLTRIRDGGKAMAVARCMRDGTEFSGDSK---LEEAL 175

Query: 397  LCLVVTNAVEVQDRAGRSVGIAVYDISFSWINHGCSPNARYRFL--PPES--HGGGQRCL 564
            LCLV+TNAVEVQ   G ++GIAVYD  FSWINH CSPNA YRFL   PE+    G  R  
Sbjct: 176  LCLVLTNAVEVQVNGGSALGIAVYDWCFSWINHSCSPNACYRFLLRSPETPQFSGESRLQ 235

Query: 565  ITPASSNGCHPIDSDFLTGSPEICGGPRIVVRSIKLIKKGEQVTIAYTDLLQPKELRQLE 744
            I P  ++      +           GPRI+VRSIK IKKGE+V +AY DLLQPKE+R  E
Sbjct: 236  IIPGGNDEIEVKKNR---------SGPRIIVRSIKAIKKGEEVWVAYIDLLQPKEIRHAE 286

Query: 745  LWSKYRFTCCCPRCVAMPLTYVDQCLQEISFSN-GTGRLSHDV------GVENFTEYIDD 903
            LW KY F+CCC RC A P TYVD  LQE S S+     LS+++       +   T+Y+DD
Sbjct: 287  LWVKYWFSCCCNRCNASPPTYVDLVLQEKSESSLEDSFLSNELLFYREEEIRKLTDYVDD 346

Query: 904  AIGDYLSSNDAESCCKKLENALLNGIRYEGLV---------IRLHPQHHLSLNAYTTLAS 1056
            AI DYLS  + E+CC+KLEN +  G+  E L           +LHP HHLSL AYTTLAS
Sbjct: 347  AIADYLSVGNPEACCEKLENVIAQGLPDEQLEPIEGKSQANFKLHPLHHLSLAAYTTLAS 406

Query: 1057 AYKVRA---------MDENLLYSLKMNRXXXXXXXXXXXXTHNXXXXXXXXXXXXXNFWI 1209
            AY+VRA         MD + L +L + +            TH              NFW+
Sbjct: 407  AYRVRASQLLDLHSEMDGDELEALSLIKTSAAYSLLLAGATHRIFLSDSSLIASIANFWM 466

Query: 1210 GAGESVLNLARSLASNS----------PSELQRCKCSRCGLIDVFEADFDDGQCPDKILD 1359
             AGES+L+LARS   NS           S LQ  KC+ C L D FEA+F   Q  +  L+
Sbjct: 467  NAGESLLSLARSSLLNSFVKGRLPVLNLSSLQSHKCNECSLADEFEANFFGSQAHNGGLE 526

Query: 1360 -ISKEFLNCISDITLKVWKFLVNGNGYLEGVKDPINLRWLESRE------FGVNDGCYL- 1515
             ISK+FLNC+S IT KVW FL+ G+   +  KDPI+  WL+  E      F  + GC   
Sbjct: 527  NISKQFLNCVSSITPKVWSFLIQGHHLCKKFKDPIDSNWLQKMETSKIWGFQAHSGCTAM 586

Query: 1516 ------------------EGQARVELLRLGGHCLLYGGILSNISGGHDSHLSCYVRQLVN 1641
                                Q R  L +LG HCLLYGG LS+I  G  S+L+ Y+R LV+
Sbjct: 587  DSSSWDEESTGGYEAQRDTNQERKNLFKLGIHCLLYGGFLSSICYGPSSYLTRYIRNLVD 646

Query: 1642 NEDG-TGIDLH 1671
             E+  TG   H
Sbjct: 647  GEESLTGNSSH 657


>ref|XP_002306703.1| SET domain protein [Populus trichocarpa] gi|222856152|gb|EEE93699.1|
            SET domain protein [Populus trichocarpa]
          Length = 626

 Score =  351 bits (901), Expect = 3e-94
 Identities = 226/548 (41%), Positives = 294/548 (53%), Gaps = 68/548 (12%)
 Frame = +1

Query: 208  PFKQYDRIRGLMTNRHKLLMSIDDDEFSARIKSGARVMAAATMMRDGAVVSESSEGYLLE 387
            P    +RI GL+TNR KL+    D+E SA ++ GA+ +AAA  +    V +E ++  LLE
Sbjct: 97   PSSSTNRICGLLTNREKLMA---DEEISAHVRYGAKAIAAARRIE--MVENEKNDAVLLE 151

Query: 388  EAVLCLVVTNAVEVQDRAGRSVGIAVYDISFSWINHGCSPNARYRFL--PPES---HGGG 552
             A LCLV+TNAVEV D  GRS+GIAVY  +FSWINH CSPNA YR +  PP++       
Sbjct: 152  -AALCLVLTNAVEVHDNEGRSIGIAVYGPNFSWINHSCSPNACYRSIISPPDNVLPFSDE 210

Query: 553  QRCLITPASSNGCHPIDSDFLTGSPEICGGPRIVVRSIKLIKKGEQVTIAYTDLLQPKEL 732
             R  I PA +             S E   GPR++VRSIK IK+GE+VT+AYTDLLQPKE+
Sbjct: 211  SRLRILPAGTE----------VKSHE--SGPRVIVRSIKRIKRGEEVTVAYTDLLQPKEI 258

Query: 733  RQLELWSKYRFTCCCPRCVAMPLTYVDQCLQEISFSN-GTGRLS------HDVGVENFTE 891
            R+ ELW+KYRF CCC RC+A P +YVD  LQEIS SN  +  LS       D      T+
Sbjct: 259  RRSELWAKYRFICCCTRCIASPPSYVDHVLQEISASNLASSSLSSELSFYRDEATRKLTD 318

Query: 892  YIDDAIGDYLSSNDAESCCKKLENALLNGIRYEGLVI---------RLHPQHHLSLNAYT 1044
            Y+D+   +YL+  D ESCCKK EN L+ G+  E L +         RLH  HHL+LN YT
Sbjct: 319  YVDEVTAEYLAVGDPESCCKKFENMLITGLLDEQLEVREGKSQLNFRLHALHHLALNTYT 378

Query: 1045 TLASAYKVRAMDENLLYS---------LKMNRXXXXXXXXXXXXTHNXXXXXXXXXXXXX 1197
             LASAYK+RA D   L+S         L M+R            T++             
Sbjct: 379  VLASAYKIRASDLFSLHSEVGGLPWEALSMSRISAAYSLLLATATYHLFCFESSLLVSVA 438

Query: 1198 NFWIGAGESVLNLARSLASNS----------PSELQRCKCSRCGLIDVFEADFDDGQCPD 1347
            NFW  AGES+L LA+S A +S           S L + KCS+C L++ FE +   GQ  D
Sbjct: 439  NFWTSAGESLLALAKSSAWDSLGKCGFPVLNLSPLAKHKCSKCSLLESFEVNLSFGQ--D 496

Query: 1348 KIL-----DISKEFLNCISDITLKVWKFLVNGNGYLEGVKDPINLRWL-----------E 1479
             I       +S  FL+CI  +  +VW FL+ G+ YL+  KDP +  WL           E
Sbjct: 497  HIRKAGFDSVSSRFLDCIGSLLQEVWGFLIQGDRYLKMFKDPTDFSWLGKSLDIWDFDAE 556

Query: 1480 SREFGVNDGCYLE------------GQARVELLRLGGHCLLYGGILSNISGGHDSHLSCY 1623
                 V+  C+               Q R+   +LG HCLLYGG L+ I  G  SH S +
Sbjct: 557  LTHNDVDFNCWTNKSVSGIEALGYTDQWRINTFQLGVHCLLYGGFLAGICYGPHSHWSSH 616

Query: 1624 VRQLVNNE 1647
            +R  +N E
Sbjct: 617  IRSALNYE 624


>ref|XP_003548980.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Glycine max]
          Length = 629

 Score =  343 bits (881), Expect = 7e-92
 Identities = 226/544 (41%), Positives = 288/544 (52%), Gaps = 61/544 (11%)
 Frame = +1

Query: 199  SHQPFKQYDRIRGLMTNRHKLLMSIDDDEFSARIKSGARVMAAATMMRDGAVVSESSEGY 378
            SH+P     R+ GL++NRH L      D+ S RI  GA  MA A   + G      ++  
Sbjct: 95   SHRPTSS-SRLAGLLSNRHILTSLSVHDDVSERISVGAGAMAEAIAKQRGI----PNDDA 149

Query: 379  LLEEAVLCL--VVTNAVEVQDRAGRSVGIAVYDISFSWINHGCSPNARYRF-LPPESHGG 549
            +LEEA + L  V+TNAVEV D  GR++GIAV+D  FSWINH CSPNA YRF L   SH G
Sbjct: 150  VLEEATIALSAVLTNAVEVHDNEGRALGIAVFDQIFSWINHSCSPNACYRFVLSSSSHSG 209

Query: 550  GQRCLITPASSNGCHPIDSDFLTGSPEICGGPRIVVRSIKLIKKGEQVTIAYTDLLQPKE 729
              +  I P   N           G   +  GPR+VVRSIK I KGE+VT+AYTDLLQPK 
Sbjct: 210  EAKLGIAPHLQN----------VGG--LGYGPRLVVRSIKKINKGEEVTVAYTDLLQPKA 257

Query: 730  LRQLELWSKYRFTCCCPRCVAMPLTYVDQCLQEIS-----FSNGTGRLSHDVGVENFTEY 894
            +RQ ELWSKYRF CCC RC A+P +YVD  LQEIS      S    +   D+     TE 
Sbjct: 258  MRQSELWSKYRFVCCCKRCSALPSSYVDHALQEISAITCESSGSCSKFLKDMADRRLTEC 317

Query: 895  IDDAIGDYLSSNDAESCCKKLENALLNGIRYEGLVIR--------LHPQHHLSLNAYTTL 1050
            IDD I +YLS  D ESCC+KLE  L  G++    VI         LHP HH S+ AYTTL
Sbjct: 318  IDDVILEYLSVGDPESCCEKLEEILTQGLKEHLEVIEVKPDCIFMLHPLHHHSIKAYTTL 377

Query: 1051 ASAYKVRA---------MDENLLYSLKMNRXXXXXXXXXXXXTHNXXXXXXXXXXXXXNF 1203
            ASAYKV A          D N L +  M+R            TH+             NF
Sbjct: 378  ASAYKVCACDLLSVDSETDINQLKAFDMSRISAAYSLVLAGATHHLFNSESSLIASVANF 437

Query: 1204 WIGAGESVLNLARS----------LASNSPSELQRCKCSRCGLIDVFEADFDDGQCPDKI 1353
            W GAGES+L+L++S          L   + +   + KC++C L+D F A   +GQ     
Sbjct: 438  WTGAGESLLSLSKSSGWSMCVNLGLVIPNLASAMKFKCTKCSLMDRFRAGMLNGQIKSAD 497

Query: 1354 LD-ISKEFLNCISDITLKVWKFLVNGNGYLEGVKDPINLRWLESR--------EFGVN-- 1500
             + +S EFL+C+SDIT KVW FL++   +L+  KDPI   WL S         E  VN  
Sbjct: 498  FENVSNEFLHCVSDITQKVWGFLISDCQFLQSCKDPIISSWLMSTKSSSTVDVEVCVNKT 557

Query: 1501 DGCY---------------LEGQARVELLRLGGHCLLYGGILSNISGGHDSHLSCYVRQL 1635
            + CY               L   A   + +LG HCL YGG+L++I  G  SHL C+V+ +
Sbjct: 558  NMCYTNESENSVSMCHEQTLADHAVACIFQLGVHCLAYGGLLASICYGPHSHLVCHVQNV 617

Query: 1636 VNNE 1647
            + +E
Sbjct: 618  LEHE 621


>emb|CBI18219.3| unnamed protein product [Vitis vinifera]
          Length = 533

 Score =  343 bits (881), Expect = 7e-92
 Identities = 222/515 (43%), Positives = 276/515 (53%), Gaps = 77/515 (14%)
 Frame = +1

Query: 337  MRDGAVVSESSEGYLLEEAVLCLVVTNAVEVQDRAGRSVGIAVYDISFSWINHGCSPNAR 516
            MRDG   S  S+   LEEA+LCLV+TNAVEVQ   G ++GIAVYD  FSWINH CSPNA 
Sbjct: 1    MRDGTEFSGDSK---LEEALLCLVLTNAVEVQVNGGSALGIAVYDWCFSWINHSCSPNAC 57

Query: 517  YRFL--PPES--HGGGQRCLITPASSNGCHPIDSDFLTGSPEICG----GPRIVVRSIKL 672
            YRFL   PE+    G  R  I P  ++      +  L  + E  G    GPRI+VRSIK 
Sbjct: 58   YRFLLRSPETPQFSGESRLQIIPGGNDEIEVKKNRSLFLNSEFKGCNIHGPRIIVRSIKA 117

Query: 673  IKKGEQVTIAYTDLLQPKELRQLELWSKYRFTCCCPRCVAMPLTYVDQCLQEISFSN--- 843
            IKKGE+V +AY DLLQPKE+R  ELW KY F+CCC RC A P TYVD  LQ     N   
Sbjct: 118  IKKGEEVWVAYIDLLQPKEIRHAELWVKYWFSCCCNRCNASPPTYVDLVLQVRLLWNKLH 177

Query: 844  -GTGRLSHDVG-----------VENFTEYIDDAIGDYLSSNDAESCCKKLENALLNGIRY 987
              +  L+H +            +   T+Y+DDAI DYLS  + E+CC+KLEN +  G+  
Sbjct: 178  PESETLAHSLNYIDDNMCREEEIRKLTDYVDDAIADYLSVGNPEACCEKLENVIAQGLPD 237

Query: 988  EGLV---------IRLHPQHHLSLNAYTTLASAYKVRA---------MDENLLYSLKMNR 1113
            E L           +LHP HHLSL AYTTLASAY+VRA         MD + L +L + +
Sbjct: 238  EQLEPIEGKSQANFKLHPLHHLSLAAYTTLASAYRVRASQLLDLHSEMDGDELEALSLIK 297

Query: 1114 XXXXXXXXXXXXTHNXXXXXXXXXXXXXNFWIGAGESVLNLARSLASNS----------P 1263
                        TH              NFW+ AGES+L+LARS   NS           
Sbjct: 298  TSAAYSLLLAGATHRIFLSDSSLIASIANFWMNAGESLLSLARSSLLNSFVKGRLPVLNL 357

Query: 1264 SELQRCKCSRCGLIDVFEADFDDGQCPDKILD-ISKEFLNCISDITLKVWKFLVNGNGYL 1440
            S LQ  KC+ C L D FEA+F   Q  +  L+ ISK+FLNC+S IT KVW FL+ G+   
Sbjct: 358  SSLQSHKCNECSLADEFEANFFGSQAHNGGLENISKQFLNCVSSITPKVWSFLIQGHHLC 417

Query: 1441 EGVKDPINLRWLESRE------FGVNDGCYL-------------------EGQARVELLR 1545
            +  KDPI+  WL+  E      F  + GC                       Q R  L +
Sbjct: 418  KKFKDPIDSNWLQKMETSKIWGFQAHSGCTAMDSSSWDEESTGGYEAQRDTNQERKNLFK 477

Query: 1546 LGGHCLLYGGILSNISGGHDSHLSCYVRQLVNNED 1650
            LG HCLLYGG LS+I  G  S+L+ Y+R LV+ E+
Sbjct: 478  LGIHCLLYGGFLSSICYGPSSYLTRYIRNLVDGEE 512


>ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Cucumis sativus]
          Length = 659

 Score =  328 bits (842), Expect = 2e-87
 Identities = 213/531 (40%), Positives = 271/531 (51%), Gaps = 57/531 (10%)
 Frame = +1

Query: 223  DRIRGLMTNRHKLLMSIDDDEFSARIKSGARVMAAATMMRDGAVVSESSEGYLLEEAVLC 402
            DRI GL+TNRHKL+   +D E   +++ GA  +AA          ++   G  LEEAVLC
Sbjct: 140  DRIYGLLTNRHKLMTPQNDSEVFLKLREGANAIAALRRKN----YADIPPGTALEEAVLC 195

Query: 403  LVVTNAVEVQDRAGRSVGIAVYDISFSWINHGCSPNARYRFLPPESHGGGQRCLITPASS 582
            LV+TNAV+VQD  G+++GIAVY  +FSWINH CSPNA YRF  P S     R  I P+ +
Sbjct: 196  LVLTNAVDVQDSIGQTIGIAVYASTFSWINHSCSPNACYRFETP-SDSVTTRFRIAPSCT 254

Query: 583  NGCHPIDSDFLTGSPEICG-GPRIVVRSIKLIKKGEQVTIAYTDLLQPKELRQLELWSKY 759
                    DF++      G GPR+VVRSIK IKKGE VTIAY DLLQPK LRQ ELWS+Y
Sbjct: 255  --------DFMSDEGNFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKVLRQSELWSRY 306

Query: 760  RFTCCCPRCVAMPLTYVDQCLQEISF-------SNGTGRLSHDVGVENFTEYIDDAIGDY 918
            +F C C RC A+PLTYVD  LQEIS        S       HD  V    EY+D+AI +Y
Sbjct: 307  QFVCSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEY 366

Query: 919  LSSNDAESCCKKLENALLNGIRYE---------GLVIRLHPQHHLSLNAYTTLASAYKVR 1071
            LS++  ESCC+KL+N L  G   E          + +RLHP H L LNAYT L SAYKVR
Sbjct: 367  LSTSSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVR 426

Query: 1072 AMDENLLYS------------LKMNRXXXXXXXXXXXXTHNXXXXXXXXXXXXXNFWIGA 1215
            + D   L S            L M +            TH              N W+ A
Sbjct: 427  SCDLVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVA 486

Query: 1216 GESVLNLAR--------SLASNSPSELQRCKCSRCGLIDVFEADFDDGQ-CPDKILDISK 1368
            GES+L LAR        +  SN    L +  C  C  +D F A    GQ       + S 
Sbjct: 487  GESLLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSI 546

Query: 1369 EFLNCISDITLKVWKFLVNGNGYLEGVKDPINLRWLESRE-----FGVNDGCYL------ 1515
               NCI+ I+ K W  L +G  YL+    P +  W ++ E      G++  C        
Sbjct: 547  GISNCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPKTNEQDICGRGIDHSCACSKTQDV 606

Query: 1516 --------EGQARVELLRLGGHCLLYGGILSNISGGHDSHLSCYVRQLVNN 1644
                      Q R  +  LG HCL YGG L++I  GH SHL+  ++ ++N+
Sbjct: 607  CLECKPQDSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILND 657


Top