BLASTX nr result

ID: Achyranthes23_contig00037032 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00037032
         (854 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   132   2e-28
ref|XP_006434476.1| hypothetical protein CICLE_v10000601mg [Citr...   132   2e-28
ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like ...   126   9e-27
gb|EOY16760.1| SET domain-containing protein, putative isoform 3...   125   1e-26
gb|EOY16758.1| SET domain protein, putative isoform 1 [Theobroma...   125   1e-26
ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   124   4e-26
gb|EMJ00499.1| hypothetical protein PRUPE_ppa023162mg, partial [...   122   2e-25
ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Popu...   117   4e-24
gb|ESW24006.1| hypothetical protein PHAVU_004G094200g, partial [...   115   2e-23
ref|XP_006395990.1| hypothetical protein EUTSA_v10003905mg [Eutr...   114   3e-23
ref|XP_006395991.1| hypothetical protein EUTSA_v10003905mg [Eutr...   114   4e-23
ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   112   1e-22
emb|CBI18219.3| unnamed protein product [Vitis vinifera]              112   2e-22
gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis]          111   3e-22
ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   111   3e-22
ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   111   3e-22
ref|XP_002893944.1| hypothetical protein ARALYDRAFT_314093 [Arab...   107   4e-21
ref|XP_006303699.1| hypothetical protein CARUB_v10011796mg [Caps...   107   5e-21
ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   105   2e-20
ref|NP_683372.2| SET domain-containing protein [Arabidopsis thal...   105   3e-20

>ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Citrus sinensis]
          Length = 619

 Score =  132 bits (332), Expect = 2e-28
 Identities = 84/179 (46%), Positives = 105/179 (58%), Gaps = 10/179 (5%)
 Frame = -3

Query: 510 FGLLTNRDALLS--DLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVI 337
           FGLLTNRD L+S  D    S+I+ GAR +A AR                  EEA LCLV+
Sbjct: 81  FGLLTNRDKLMSSSDSDVASKIREGAREMARARG---------NLSDDVAWEEAALCLVM 131

Query: 336 TNAVEININ--GERLGIGVYDWRFSWINHSCSPNSCFRF------IPSFVVSEGPDCSLS 181
           TNAVE+  +  G  LGI VYD  FSWINHSCSPN+C+RF       PSF         ++
Sbjct: 132 TNAVEVQDDKTGRILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAPSF--RNEKKMRIA 189

Query: 180 LLRIFASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
              +F S++A+       C +CEL +G   +GPR+IVRSIK I + EEVT+ YTDLLQP
Sbjct: 190 PHVVFDSTEAETPGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQP 248


>ref|XP_006434476.1| hypothetical protein CICLE_v10000601mg [Citrus clementina]
           gi|557536598|gb|ESR47716.1| hypothetical protein
           CICLE_v10000601mg [Citrus clementina]
          Length = 619

 Score =  132 bits (332), Expect = 2e-28
 Identities = 84/179 (46%), Positives = 106/179 (59%), Gaps = 10/179 (5%)
 Frame = -3

Query: 510 FGLLTNRDALLS--DLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVI 337
           FGLLTNRD L+S  D    S+I+ GAR +A AR                  EEA LCLV+
Sbjct: 81  FGLLTNRDKLMSSSDSDVASKIREGAREMARARG---------NLSDDVAWEEAALCLVM 131

Query: 336 TNAVEININ--GERLGIGVYDWRFSWINHSCSPNSCFRF------IPSFVVSEGPDCSLS 181
           TNAVE+  +  G  LGI VYD  FSWINHSCSPN+C+RF       PSF   +    +  
Sbjct: 132 TNAVEVQDDKTGRILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAPSFRDEKKKRIAPH 191

Query: 180 LLRIFASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
           +  +F S++A+       C +CEL +G   +GPR+IVRSIK I + EEVT+ YTDLLQP
Sbjct: 192 V--VFDSTEAETQGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQP 248


>ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like [Vitis vinifera]
          Length = 660

 Score =  126 bits (317), Expect = 9e-27
 Identities = 82/176 (46%), Positives = 103/176 (58%), Gaps = 8/176 (4%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQ------FSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLC 346
           GLLTN   L+S           +RI+ G + +A+AR MRD     TE  G  +LEEA+LC
Sbjct: 122 GLLTNLHHLISPSHNSESDETLTRIRDGGKAMAVARCMRD----GTEFSGDSKLEEALLC 177

Query: 345 LVITNAVEINING-ERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCS-LSLLR 172
           LV+TNAVE+ +NG   LGI VYDW FSWINHSCSPN+C+RF+      E P  S  S L+
Sbjct: 178 LVLTNAVEVQVNGGSALGIAVYDWCFSWINHSCSPNACYRFL--LRSPETPQFSGESRLQ 235

Query: 171 IFASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
           I               G  E+    +  GPR+IVRSIK I++ EEV + Y DLLQP
Sbjct: 236 IIPG------------GNDEIEVKKNRSGPRIIVRSIKAIKKGEEVWVAYIDLLQP 279


>gb|EOY16760.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao]
           gi|508724865|gb|EOY16762.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
           gi|508724866|gb|EOY16763.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
           gi|508724867|gb|EOY16764.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
          Length = 625

 Score =  125 bits (315), Expect = 1e-26
 Identities = 85/175 (48%), Positives = 107/175 (61%), Gaps = 7/175 (4%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQFS-RIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITN 331
           GLLTN   L S   + + +I+ GA  IAMA A + + + +     GF LEEAVL LVITN
Sbjct: 121 GLLTNHHMLTSSSPEVAAKIRQGA--IAMAAARKSRNRDNEGQSDGFLLEEAVLSLVITN 178

Query: 330 AVEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRF---IPSFVVSEGPDCSLSLLRIFA 163
           AVE+ + +G  LGI VYD  FSWINHSCSPN+C+RF    P   +S   D S S LRI  
Sbjct: 179 AVEVQDKSGRSLGIAVYDLSFSWINHSCSPNACYRFSISSPHATLSFREDSS-STLRIVP 237

Query: 162 SSDADKGDGRGDCGTCELADGVDNY--GPRVIVRSIKDIQRCEEVTITYTDLLQP 4
           S   ++ D    C   E   G   Y  GP++IVRSIK I++ EEV ++YTDLLQP
Sbjct: 238 SVLGEECDA---CSCVEHTKGNKGYELGPKIIVRSIKRIRKGEEVCVSYTDLLQP 289


>gb|EOY16758.1| SET domain protein, putative isoform 1 [Theobroma cacao]
           gi|508724862|gb|EOY16759.1| SET domain protein, putative
           isoform 1 [Theobroma cacao] gi|508724864|gb|EOY16761.1|
           SET domain protein, putative isoform 1 [Theobroma cacao]
          Length = 658

 Score =  125 bits (315), Expect = 1e-26
 Identities = 85/175 (48%), Positives = 107/175 (61%), Gaps = 7/175 (4%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQFS-RIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITN 331
           GLLTN   L S   + + +I+ GA  IAMA A + + + +     GF LEEAVL LVITN
Sbjct: 121 GLLTNHHMLTSSSPEVAAKIRQGA--IAMAAARKSRNRDNEGQSDGFLLEEAVLSLVITN 178

Query: 330 AVEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRF---IPSFVVSEGPDCSLSLLRIFA 163
           AVE+ + +G  LGI VYD  FSWINHSCSPN+C+RF    P   +S   D S S LRI  
Sbjct: 179 AVEVQDKSGRSLGIAVYDLSFSWINHSCSPNACYRFSISSPHATLSFREDSS-STLRIVP 237

Query: 162 SSDADKGDGRGDCGTCELADGVDNY--GPRVIVRSIKDIQRCEEVTITYTDLLQP 4
           S   ++ D    C   E   G   Y  GP++IVRSIK I++ EEV ++YTDLLQP
Sbjct: 238 SVLGEECDA---CSCVEHTKGNKGYELGPKIIVRSIKRIRKGEEVCVSYTDLLQP 289


>ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Fragaria vesca subsp.
           vesca]
          Length = 645

 Score =  124 bits (311), Expect = 4e-26
 Identities = 83/171 (48%), Positives = 99/171 (57%), Gaps = 3/171 (1%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFEL-EEAVLCLVITN 331
           GLLTNR  L  DL    RI+ GAR + +AR M D      +      + EEA LCLV+TN
Sbjct: 115 GLLTNRRKLDDDL----RIRDGARAMFLARTMPDDNDAVLDVAHDDAVSEEAALCLVLTN 170

Query: 330 AVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVS-EGPDCSLSLLRIFASS 157
           AVE+  + G  LGI VYD  FSWINHSCSPN+C+RF+ S       P C  + LRI  + 
Sbjct: 171 AVEVQDHTGRTLGIAVYDSCFSWINHSCSPNACYRFLLSSPSQPTPPQCDETPLRIVPA- 229

Query: 156 DADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
                      G   +    + +GPRVIVRSIK I R EEVTITYTDLLQP
Sbjct: 230 -----------GQLIVNAECEKFGPRVIVRSIKRINRGEEVTITYTDLLQP 269


>gb|EMJ00499.1| hypothetical protein PRUPE_ppa023162mg, partial [Prunus persica]
          Length = 635

 Score =  122 bits (305), Expect = 2e-25
 Identities = 81/171 (47%), Positives = 98/171 (57%), Gaps = 3/171 (1%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNA 328
           GLLTN    L       RI+ GAR + +AR MRD+     ++     LEEA LCLV+TNA
Sbjct: 130 GLLTNHHKFLHHD-DHHRIRDGARAMFLARKMRDEAPNVYDA----VLEEAALCLVLTNA 184

Query: 327 VEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSL--LRIFASS 157
           VE+ +  G  LGI VY   F WINHSCSPN+C+RF+ S      P CS     LRI    
Sbjct: 185 VEVQDKTGRTLGISVYGPSFCWINHSCSPNACYRFLVS--PPPPPPCSAERTPLRIAPLG 242

Query: 156 DADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
              +  G   C    +      YGPRVIVRSIK I++ EEVT+TYTDLLQP
Sbjct: 243 QGTQSCGIDICCRLRVVFVAIIYGPRVIVRSIKRIKKGEEVTVTYTDLLQP 293


>ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Populus trichocarpa]
           gi|550339461|gb|EEE93699.2| hypothetical protein
           POPTR_0005s21560g [Populus trichocarpa]
          Length = 626

 Score =  117 bits (294), Expect = 4e-24
 Identities = 79/171 (46%), Positives = 97/171 (56%), Gaps = 3/171 (1%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARA--MRDQRKFDTESGGGFELEEAVLCLVIT 334
           GLLTNR+ L++D    + ++ GA+ IA AR   M +  K D        L EA LCLV+T
Sbjct: 106 GLLTNREKLMADEEISAHVRYGAKAIAAARRIEMVENEKNDAV------LLEAALCLVLT 159

Query: 333 NAVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASS 157
           NAVE++ N G  +GI VY   FSWINHSCSPN+C+R I S   +  P    S LRI  + 
Sbjct: 160 NAVEVHDNEGRSIGIAVYGPNFSWINHSCSPNACYRSIISPPDNVLPFSDESRLRILPAG 219

Query: 156 DADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
              K                   GPRVIVRSIK I+R EEVT+ YTDLLQP
Sbjct: 220 TEVKS---------------HESGPRVIVRSIKRIKRGEEVTVAYTDLLQP 255


>gb|ESW24006.1| hypothetical protein PHAVU_004G094200g, partial [Phaseolus
           vulgaris]
          Length = 530

 Score =  115 bits (288), Expect = 2e-23
 Identities = 76/170 (44%), Positives = 98/170 (57%), Gaps = 3/170 (1%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAV--LCLVIT 334
           GLL+NR  L S        ++      MA A+ +QR    +      LEEA   LC V+T
Sbjct: 107 GLLSNRRILTSHHHDHVSERIRLDATVMAEAIAEQRAVPHDDA---VLEEATIALCAVLT 163

Query: 333 NAVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASS 157
           NAVE++ N G  LGI V+D  FSWINHSCSPN+C+RFI S   S  P+    LLRI  + 
Sbjct: 164 NAVEVHDNEGRALGIAVFDPTFSWINHSCSPNACYRFILSSFPSNEPE----LLRI--AP 217

Query: 156 DADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQ 7
               G G     + E A  +  YGPR++VRSIK I++ EEVT+ YTD+LQ
Sbjct: 218 HPQMGSGGVCVSSDEFAKEMLGYGPRLVVRSIKKIKKGEEVTVAYTDILQ 267


>ref|XP_006395990.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum]
           gi|557092629|gb|ESQ33276.1| hypothetical protein
           EUTSA_v10003905mg [Eutrema salsugineum]
          Length = 572

 Score =  114 bits (286), Expect = 3e-23
 Identities = 73/174 (41%), Positives = 99/174 (56%), Gaps = 3/174 (1%)
 Frame = -3

Query: 513 FFGLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVIT 334
           F GLLTN   L++D      I+  A  IA+   +R  RK         ELEEA +C V+T
Sbjct: 106 FGGLLTNHHRLMADSSFSVAIQCAANFIAVV--LRSDRK-------NTELEEAAICSVLT 156

Query: 333 NAVEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASS 157
           NAVE+ + +G  LGI VYD RFSWINHSCSPN+C+RF+ S   +  P        +  ++
Sbjct: 157 NAVELQDSSGRALGIAVYDTRFSWINHSCSPNACYRFVISPHSTTTPSFQDYPKMLPHTT 216

Query: 156 DADKGDGRGDCGTCELADGV--DNYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1
           + +K       G C     +    YGP+V+ RSIK I+  EE+TI+Y DL+QPT
Sbjct: 217 NTEK----EQIGVCSRITSLWEVRYGPKVVARSIKRIKSGEEITISYIDLMQPT 266


>ref|XP_006395991.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum]
           gi|557092630|gb|ESQ33277.1| hypothetical protein
           EUTSA_v10003905mg [Eutrema salsugineum]
          Length = 575

 Score =  114 bits (285), Expect = 4e-23
 Identities = 75/174 (43%), Positives = 102/174 (58%), Gaps = 3/174 (1%)
 Frame = -3

Query: 513 FFGLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVIT 334
           F GLLTN   L++D      I+  A  IA+   +R  RK         ELEEA +C V+T
Sbjct: 106 FGGLLTNHHRLMADSSFSVAIQCAANFIAVV--LRSDRK-------NTELEEAAICSVLT 156

Query: 333 NAVEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASS 157
           NAVE+ + +G  LGI VYD RFSWINHSCSPN+C+RF+ S   +  P        +  ++
Sbjct: 157 NAVELQDSSGRALGIAVYDTRFSWINHSCSPNACYRFVISPHSTTTPSFQDYPKMLPHTT 216

Query: 156 DADKGDGRGDCG-TCELADG-VDNYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1
           + +K +  G C     L +G    YGP+V+ RSIK I+  EE+TI+Y DL+QPT
Sbjct: 217 NTEK-EQIGVCSRITSLWEGKTVRYGPKVVARSIKRIKSGEEITISYIDLMQPT 269


>ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Cucumis sativus]
          Length = 659

 Score =  112 bits (281), Expect = 1e-22
 Identities = 76/173 (43%), Positives = 97/173 (56%), Gaps = 4/173 (2%)
 Frame = -3

Query: 510 FGLLTNRDALLS---DLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLV 340
           +GLLTNR  L++   D   F +++ GA  IA  R     RK   +   G  LEEAVLCLV
Sbjct: 143 YGLLTNRHKLMTPQNDSEVFLKLREGANAIAALR-----RKNYADIPPGTALEEAVLCLV 197

Query: 339 ITNAVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFA 163
           +TNAV++  + G+ +GI VY   FSWINHSCSPN+C+RF       E P  S++     A
Sbjct: 198 LTNAVDVQDSIGQTIGIAVYASTFSWINHSCSPNACYRF-------ETPSDSVTTRFRIA 250

Query: 162 SSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
            S  D     G+             GPRV+VRSIK I++ E VTI Y DLLQP
Sbjct: 251 PSCTDFMSDEGN---------FQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQP 294


>emb|CBI18219.3| unnamed protein product [Vitis vinifera]
          Length = 533

 Score =  112 bits (279), Expect = 2e-22
 Identities = 67/133 (50%), Positives = 85/133 (63%), Gaps = 4/133 (3%)
 Frame = -3

Query: 390 TESGGGFELEEAVLCLVITNAVEINING-ERLGIGVYDWRFSWINHSCSPNSCFRFIPSF 214
           TE  G  +LEEA+LCLV+TNAVE+ +NG   LGI VYDW FSWINHSCSPN+C+RF+   
Sbjct: 5   TEFSGDSKLEEALLCLVLTNAVEVQVNGGSALGIAVYDWCFSWINHSCSPNACYRFL--L 62

Query: 213 VVSEGPDCS-LSLLRIF--ASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRC 43
              E P  S  S L+I    + + +    R      E   G + +GPR+IVRSIK I++ 
Sbjct: 63  RSPETPQFSGESRLQIIPGGNDEIEVKKNRSLFLNSEF-KGCNIHGPRIIVRSIKAIKKG 121

Query: 42  EEVTITYTDLLQP 4
           EEV + Y DLLQP
Sbjct: 122 EEVWVAYIDLLQP 134


>gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis]
          Length = 661

 Score =  111 bits (278), Expect = 3e-22
 Identities = 78/180 (43%), Positives = 101/180 (56%), Gaps = 12/180 (6%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQ--FSRIKVGARLIAMARAMRDQRKFDTESGGGFE-LEEAVLCLVI 337
           GL TN   L +D  +   +RI+ GAR +A AR MRD+     ES G  E +  A LC V+
Sbjct: 131 GLSTNLHKLANDDEEEVAARIRDGARAMAAARRMRDRDCSGEESEGEEEAMAAAALCAVL 190

Query: 336 TNAVEINI-NGERLGIGVYDWR-FSWINHSCSPNSCFRF-------IPSFVVSEGPDCSL 184
           TN VE+ + +G  LG+ VY    FSWINHSCSPN+C+R          SF+    PD   
Sbjct: 191 TNGVEVQVKSGRTLGVAVYGGGGFSWINHSCSPNACYRISLHSDLQTTSFL----PDHET 246

Query: 183 SLLRIFASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
           + +RI    + +       CG         +YGPR+IVRSIK IQ+ EEVT+ YTDLLQP
Sbjct: 247 AAMRIVPCCNKET-----QCGC--------SYGPRIIVRSIKRIQKGEEVTVAYTDLLQP 293


>ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X1 [Glycine
           max]
          Length = 642

 Score =  111 bits (278), Expect = 3e-22
 Identities = 76/174 (43%), Positives = 96/174 (55%), Gaps = 6/174 (3%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQ---FSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCL-- 343
           GLL+NR  L S  +      RI VGA   AMA A+  QR    +      LEEA + L  
Sbjct: 106 GLLSNRHILTSLSVHDDVSERISVGAG--AMAEAIAKQRGIPNDDA---VLEEATIALSA 160

Query: 342 VITNAVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIF 166
           V+TNAVE++ N G  LGI V+D  FSWINHSCSPN+C+RF+ S     G        ++ 
Sbjct: 161 VLTNAVEVHDNEGRALGIAVFDQIFSWINHSCSPNACYRFVLSSSSHSGE------AKLG 214

Query: 165 ASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
            +             + E A G   YGPR++VRSIK I + EEVT+ YTDLLQP
Sbjct: 215 IAPHLQMNSSGVSISSSEFAKGGLGYGPRLVVRSIKKINKGEEVTVAYTDLLQP 268


>ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum lycopersicum]
          Length = 677

 Score =  111 bits (278), Expect = 3e-22
 Identities = 68/159 (42%), Positives = 93/159 (58%), Gaps = 8/159 (5%)
 Frame = -3

Query: 456 RIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNAVEI-NINGERLGIGVYD 280
           RI+ GA+ +A +R MR       E+ G + +E AVLCLV+TNAVE+ + +G  LG+GVYD
Sbjct: 160 RIRDGAKALAASRRMR----VGLETNGEYTVEAAVLCLVLTNAVEVYDKDGRSLGVGVYD 215

Query: 279 WRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASSDADKGDGRGDCG------- 121
             FSW+NHSCSPN+ +RF      S+      S +   A+     G G            
Sbjct: 216 VPFSWVNHSCSPNASYRFC---TASDSGGILESRICPAATETGAAGIGHESISSNTELQK 272

Query: 120 TCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
           +  +  G +  GP++I+RSIK IQR EEV I+YTDLLQP
Sbjct: 273 SMSVIGGSEACGPKIILRSIKGIQRSEEVLISYTDLLQP 311


>ref|XP_002893944.1| hypothetical protein ARALYDRAFT_314093 [Arabidopsis lyrata subsp.
           lyrata] gi|297339786|gb|EFH70203.1| hypothetical protein
           ARALYDRAFT_314093 [Arabidopsis lyrata subsp. lyrata]
          Length = 567

 Score =  107 bits (268), Expect = 4e-21
 Identities = 73/172 (42%), Positives = 95/172 (55%), Gaps = 3/172 (1%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNA 328
           GLLTN   L++D      I   A  IA    +R  RK         ELEEA +C V+TNA
Sbjct: 103 GLLTNHHLLMADSSFSLAIHHAASFIATV--LRSNRK-------NTELEEAAICSVLTNA 153

Query: 327 VEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASSDA 151
           VE+ + NG  LGI +YD RFSWINHSCSPNSC+RF+ +   S   D +  +     ++  
Sbjct: 154 VEVQDSNGLVLGIALYDSRFSWINHSCSPNSCYRFVNN-TTSYHDDLAYPITIPHVNNTE 212

Query: 150 DKGDGRGDCGTCELADGVD--NYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1
                       EL + V    YGP+VI R+IK I+  EE+T++Y DLLQPT
Sbjct: 213 -------TLSNLELQEQVRTMGYGPKVIARNIKRIKSGEEITVSYIDLLQPT 257


>ref|XP_006303699.1| hypothetical protein CARUB_v10011796mg [Capsella rubella]
           gi|482572410|gb|EOA36597.1| hypothetical protein
           CARUB_v10011796mg [Capsella rubella]
          Length = 572

 Score =  107 bits (267), Expect = 5e-21
 Identities = 70/173 (40%), Positives = 97/173 (56%), Gaps = 4/173 (2%)
 Frame = -3

Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNA 328
           GLLTN   +++D      I+  A  I+    +R  R+         ELEEAV+C V+TNA
Sbjct: 103 GLLTNHHRIMADSSLSVAIQTAASFISTV--LRSNRE-------NTELEEAVICSVLTNA 153

Query: 327 VEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLR---IFAS 160
           VE+  + G  LGI +YD RFSWINHSCSPNSC+RF+     S   D +L+      I  +
Sbjct: 154 VEVQDSAGLALGIALYDSRFSWINHSCSPNSCYRFVTK-TTSFHDDLALAKTIPHIIITN 212

Query: 159 SDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1
           ++           + +       YGP+VIVRSIK I+  EE+T++Y +LLQPT
Sbjct: 213 TETSSNLESKALSSLQEQGRRVGYGPKVIVRSIKRIKSGEEITVSYMNLLQPT 265


>ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum tuberosum]
          Length = 681

 Score =  105 bits (263), Expect = 2e-20
 Identities = 68/159 (42%), Positives = 95/159 (59%), Gaps = 8/159 (5%)
 Frame = -3

Query: 456 RIKVGARLIAMARAMRDQRKFDTESGGGFE---LEEAVLCLVITNAVEINI-NGERLGIG 289
           RI+ GA+ +A +R MR     DT     +E   +E AVLCLV+TNAVE++  +G  LG+G
Sbjct: 164 RIRHGAKALAASRRMR--LGLDTNRELLYEEYTVEAAVLCLVLTNAVEVHDKDGRSLGVG 221

Query: 288 VYDWRFSWINHSCSPNSCFRFI---PSFVVSEGPDCSLSLLRIFASSDADKGDGRGDC-G 121
           VYD  FSW+NHSCSPN+ +RF     S  +SE   C  +     A  +++      +   
Sbjct: 222 VYDVPFSWVNHSCSPNASYRFCTASDSGGISECRICPAATETGAAGIESESISSNPELQK 281

Query: 120 TCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4
           +  +  G +  GP++I+RSIK I + EEV ITYTDLLQP
Sbjct: 282 SMSVIGGSETCGPKIILRSIKGINKSEEVLITYTDLLQP 320


>ref|NP_683372.2| SET domain-containing protein [Arabidopsis thaliana]
           gi|97190651|sp|Q3ECY6.1|SDG41_ARATH RecName:
           Full=Protein SET DOMAIN GROUP 41
           gi|332193843|gb|AEE31964.1| SET domain-containing
           protein [Arabidopsis thaliana]
          Length = 558

 Score =  105 bits (261), Expect = 3e-20
 Identities = 70/176 (39%), Positives = 93/176 (52%), Gaps = 8/176 (4%)
 Frame = -3

Query: 504 LLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNAV 325
           LLTN   L++D      I   A  IA    +R  RK         ELEEA +C V+TNAV
Sbjct: 104 LLTNHHLLMADPSISVAIHHAANFIATV--IRSNRK-------NTELEEAAICAVLTNAV 154

Query: 324 EIN-INGERLGIGVYDWRFSWINHSCSPNSCFRFIPS-------FVVSEGPDCSLSLLRI 169
           E++  NG  LGI +Y+  FSWINHSCSPNSC+RF+ +        V +     +L L   
Sbjct: 155 EVHDSNGLALGIALYNSSFSWINHSCSPNSCYRFVNNRTSYHDVHVTNTETSSNLELQEQ 214

Query: 168 FASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1
              +  + G+G                GP++IVRSIK I+  EE+T++Y DLLQPT
Sbjct: 215 VCGTSLNSGNGN---------------GPKLIVRSIKRIKSGEEITVSYIDLLQPT 255


Top