BLASTX nr result

ID: Mentha23_contig00006261 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00006261
         (671 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU36834.1| hypothetical protein MIMGU_mgv1a023205mg [Mimulus...   187   3e-45
ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   134   3e-29
ref|XP_007199300.1| hypothetical protein PRUPE_ppa023162mg, part...   127   2e-27
ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   126   5e-27
ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like ...   109   7e-22
gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis]          102   8e-20
ref|XP_007019535.1| SET domain-containing protein, putative isof...   100   7e-19
ref|XP_007019533.1| SET domain protein, putative isoform 1 [Theo...   100   7e-19
ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Popu...    95   2e-17
ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...    91   4e-16
ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...    89   2e-15
ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...    85   2e-14
ref|XP_006599490.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...    82   2e-13
ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...    82   2e-13
ref|XP_003595407.1| Protein SET DOMAIN GROUP [Medicago truncatul...    77   5e-12
ref|XP_002893944.1| hypothetical protein ARALYDRAFT_314093 [Arab...    77   6e-12
ref|XP_007152012.1| hypothetical protein PHAVU_004G094200g, part...    74   5e-11
ref|XP_006395991.1| hypothetical protein EUTSA_v10003905mg [Eutr...    72   1e-10
ref|XP_006395990.1| hypothetical protein EUTSA_v10003905mg [Eutr...    72   1e-10
ref|NP_683372.2| SET domain-containing protein [Arabidopsis thal...    72   1e-10

>gb|EYU36834.1| hypothetical protein MIMGU_mgv1a023205mg [Mimulus guttatus]
          Length = 635

 Score =  187 bits (475), Expect = 3e-45
 Identities = 114/231 (49%), Positives = 139/231 (60%), Gaps = 11/231 (4%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQH 191
           MEMRA+E I IGEDLTP L PLA VL ++AV+S+CSACF  LPPQ FPP+      N  H
Sbjct: 1   MEMRAVEDIAIGEDLTPALPPLAFVLLETAVSSYCSACFSILPPQPFPPLNPNSRPNCSH 60

Query: 192 VPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQ 371
            P  +PTPLYCS+ CS+ DS LHFSS E  LLSLF  SPP +W+            H+F 
Sbjct: 61  FP--SPTPLYCSVNCSSIDSPLHFSSGELRLLSLFRQSPPFAWEDSSDLRLSLRLIHLF- 117

Query: 372 CSLLPQRSFLEGKNESISQEIEEAKGPDV--CLERIAGLMTNRENLVFATKQIEDSDENS 545
                             Q+IE+ + P+    +ERI GLMTNRE L+F      +  ENS
Sbjct: 118 ------------------QKIEKIECPEASEIIERIGGLMTNREKLIF------EESENS 153

Query: 546 EN-YLRIREGAKMMAKVR-----NNVNSDK---CFAFEEMVLCLVMTNAVE 671
           EN Y +IR GAKMMA+ R     + VN++K    F  EEMVLCLV+TNAVE
Sbjct: 154 ENVYQKIRSGAKMMAEARRASTDHYVNAEKKRDDFVLEEMVLCLVLTNAVE 204


>ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum lycopersicum]
          Length = 677

 Score =  134 bits (336), Expect = 3e-29
 Identities = 87/226 (38%), Positives = 120/226 (53%), Gaps = 6/226 (2%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQT---FPPIWRTFPEN 182
           MEMRA E I IG+DLTPP+ PL+  LH S + SHCS+CF  LPP     +PP +   P+N
Sbjct: 1   MEMRAKEAISIGQDLTPPIPPLSLCLHHSTLLSHCSSCFSPLPPPPSLHYPPFFS--PKN 58

Query: 183 LQHVPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAH 362
               P    +  YCSL CS+ DS +HFSS+E H   LF     +++             H
Sbjct: 59  ----PNSNHSIRYCSLQCSSLDSPIHFSSSEFHFFHLFPQPLYTNFPTSSDLRLSLRLLH 114

Query: 363 IFQCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 542
           +FQ   L                I+E+ G  + LERI GLMTN   ++F  +   D+D +
Sbjct: 115 LFQTLHL----------------IQESNGSLLNLERIGGLMTNFRKVMFLEEHCNDNDLS 158

Query: 543 SENYLRIREGAKMMA---KVRNNVNSDKCFAFEEMVLCLVMTNAVE 671
                RIR+GAK +A   ++R  + ++  +  E  VLCLV+TNAVE
Sbjct: 159 G----RIRDGAKALAASRRMRVGLETNGEYTVEAAVLCLVLTNAVE 200


>ref|XP_007199300.1| hypothetical protein PRUPE_ppa023162mg, partial [Prunus persica]
           gi|462394700|gb|EMJ00499.1| hypothetical protein
           PRUPE_ppa023162mg, partial [Prunus persica]
          Length = 635

 Score =  127 bits (320), Expect = 2e-27
 Identities = 92/225 (40%), Positives = 114/225 (50%), Gaps = 5/225 (2%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRT--FPENL 185
           MEMRA E I IGED+TPPL PL   LHDS ++SHCS+CF  LPP  FPP+  T  FP N 
Sbjct: 1   MEMRAEEDIEIGEDITPPLTPLGFALHDSLLSSHCSSCFSLLPPHPFPPLHFTPPFPHNP 60

Query: 186 QHVPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHI 365
            HV + +    YCS  CST+DS LH SSAE HLL L   S PS++             H 
Sbjct: 61  HHVLSSSS---YCSPLCSTSDSPLHVSSAELHLLHLL-QSHPSTY------------PHG 104

Query: 366 FQCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENS 545
               L      L             A GP     RIAGL+TN    +           + 
Sbjct: 105 DSSDLRAALRLLHSL---------PATGPSA---RIAGLLTNHHKFL-----------HH 141

Query: 546 ENYLRIREGAKMM---AKVRNNVNSDKCFAFEEMVLCLVMTNAVE 671
           +++ RIR+GA+ M    K+R+   +      EE  LCLV+TNAVE
Sbjct: 142 DDHHRIRDGARAMFLARKMRDEAPNVYDAVLEEAALCLVLTNAVE 186


>ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum tuberosum]
          Length = 681

 Score =  126 bits (317), Expect = 5e-27
 Identities = 87/232 (37%), Positives = 122/232 (52%), Gaps = 12/232 (5%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQT----FPPIWRTFPE 179
           MEMRA E I IG+DLTPP+ PL+  LH S + SHCS+CF  LPP      +PP +   P+
Sbjct: 1   MEMRAKEAIPIGQDLTPPIPPLSLSLHHSTLLSHCSSCFSPLPPPPPSLHYPPFFS--PK 58

Query: 180 NLQHVPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXA 359
           N    P       YCSL CS+ DS +HFSS+E H   LF     +++             
Sbjct: 59  N----PNPNHFIRYCSLQCSSLDSPIHFSSSEFHFFHLFPQPLHTNFPTSSDLRLSLRLL 114

Query: 360 HIFQCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDE 539
           H FQ   L                I+E+ G  + LERI GL+TN   ++F  +   D+D+
Sbjct: 115 HRFQTLNL----------------IQESNGSFLNLERIGGLVTNFRKVMFLEEHCNDNDD 158

Query: 540 NSENYLRIREGAKMMA---KVRNNVNSDKCFAFEE-----MVLCLVMTNAVE 671
           +  +  RIR GAK +A   ++R  +++++   +EE      VLCLV+TNAVE
Sbjct: 159 DDLSG-RIRHGAKALAASRRMRLGLDTNRELLYEEYTVEAAVLCLVLTNAVE 209


>ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like [Vitis vinifera]
          Length = 660

 Score =  109 bits (273), Expect = 7e-22
 Identities = 82/223 (36%), Positives = 101/223 (45%), Gaps = 3/223 (1%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQH 191
           MEMR  E   +G DLT PL PLA+ LHDS + SHCSACF  LPP             L +
Sbjct: 1   MEMRMREDTEMGLDLTHPLPPLASSLHDSHLRSHCSACFSPLPPTV-----------LVN 49

Query: 192 VPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQ 371
               +    YCS  CS +DS LHFSSAE HL  L  HS PS+              HI  
Sbjct: 50  TNPSSSFLCYCSPPCSASDSPLHFSSAEHHLFLLLRHSHPST-AHSSDLRAALRLLHILH 108

Query: 372 CSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSEN 551
              L  +                       L RI GL+TN  +L+  +     + E+ E 
Sbjct: 109 LPPLHTQP----------------------LHRICGLLTNLHHLISPS----HNSESDET 142

Query: 552 YLRIREGAKMMAK---VRNNVNSDKCFAFEEMVLCLVMTNAVE 671
             RIR+G K MA    +R+          EE +LCLV+TNAVE
Sbjct: 143 LTRIRDGGKAMAVARCMRDGTEFSGDSKLEEALLCLVLTNAVE 185


>gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis]
          Length = 661

 Score =  102 bits (255), Expect = 8e-20
 Identities = 84/231 (36%), Positives = 103/231 (44%), Gaps = 9/231 (3%)
 Frame = +3

Query: 6   LKMEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIW-RTFPEN 182
           ++M MR  E+I +GEDLT PL PL+  LH S + SHCS+CF  LP    PPI+   FP +
Sbjct: 4   MEMMMRGREEIEMGEDLTRPLPPLSFSLHHSLLLSHCSSCFSPLPSSPLPPIFPPRFPPS 63

Query: 183 LQHVPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAH 362
                   P  LYCS  CS +DS LHFSSAE HLL L     PS+             A 
Sbjct: 64  -----NSNPKILYCSSQCSFSDSPLHFSSAEHHLLCLL----PSA-------------AA 101

Query: 363 IFQCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 542
                L      LE            A      + RIAGL TN   L         +D+ 
Sbjct: 102 ADSSDLRAALRLLES---------NPATRRSSSVSRIAGLSTNLHKLA--------NDDE 144

Query: 543 SENYLRIREGAKMMAKVRNNVNSD--------KCFAFEEMVLCLVMTNAVE 671
            E   RIR+GA+ MA  R   + D        +  A     LC V+TN VE
Sbjct: 145 EEVAARIRDGARAMAAARRMRDRDCSGEESEGEEEAMAAAALCAVLTNGVE 195


>ref|XP_007019535.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao]
           gi|590600821|ref|XP_007019537.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
           gi|590600825|ref|XP_007019538.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
           gi|590600830|ref|XP_007019539.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
           gi|508724863|gb|EOY16760.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
           gi|508724865|gb|EOY16762.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
           gi|508724866|gb|EOY16763.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
           gi|508724867|gb|EOY16764.1| SET domain-containing
           protein, putative isoform 3 [Theobroma cacao]
          Length = 625

 Score = 99.8 bits (247), Expect = 7e-19
 Identities = 76/226 (33%), Positives = 104/226 (46%), Gaps = 5/226 (2%)
 Frame = +3

Query: 9   KMEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQ 188
           +MEMRA + +  G+D+TPP+ PL++ L+DS ++SHCS+CF  LPP        TFP   +
Sbjct: 12  EMEMRAKQDLDYGQDITPPILPLSSSLYDSFLSSHCSSCFSPLPP--------TFPHIPR 63

Query: 189 HVPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIF 368
           HV      PLYCS  CS++ S LH SSAE                               
Sbjct: 64  HV------PLYCSPTCSSSHSPLHSSSAE------------------------------- 86

Query: 369 QCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSE 548
             SLLP          +  + ++        L RI GL+TN   L  ++ ++        
Sbjct: 87  --SLLPPTCPDSSDLRTALRLLQSLPSTPPHLHRIDGLLTNHHMLTSSSPEVA------- 137

Query: 549 NYLRIREGAKMMAKVRNNVNSDK-----CFAFEEMVLCLVMTNAVE 671
              +IR+GA  MA  R + N D       F  EE VL LV+TNAVE
Sbjct: 138 --AKIRQGAIAMAAARKSRNRDNEGQSDGFLLEEAVLSLVITNAVE 181


>ref|XP_007019533.1| SET domain protein, putative isoform 1 [Theobroma cacao]
           gi|590600784|ref|XP_007019534.1| SET domain protein,
           putative isoform 1 [Theobroma cacao]
           gi|590600816|ref|XP_007019536.1| SET domain protein,
           putative isoform 1 [Theobroma cacao]
           gi|508724861|gb|EOY16758.1| SET domain protein, putative
           isoform 1 [Theobroma cacao] gi|508724862|gb|EOY16759.1|
           SET domain protein, putative isoform 1 [Theobroma cacao]
           gi|508724864|gb|EOY16761.1| SET domain protein, putative
           isoform 1 [Theobroma cacao]
          Length = 658

 Score = 99.8 bits (247), Expect = 7e-19
 Identities = 76/226 (33%), Positives = 104/226 (46%), Gaps = 5/226 (2%)
 Frame = +3

Query: 9   KMEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQ 188
           +MEMRA + +  G+D+TPP+ PL++ L+DS ++SHCS+CF  LPP        TFP   +
Sbjct: 12  EMEMRAKQDLDYGQDITPPILPLSSSLYDSFLSSHCSSCFSPLPP--------TFPHIPR 63

Query: 189 HVPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIF 368
           HV      PLYCS  CS++ S LH SSAE                               
Sbjct: 64  HV------PLYCSPTCSSSHSPLHSSSAE------------------------------- 86

Query: 369 QCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSE 548
             SLLP          +  + ++        L RI GL+TN   L  ++ ++        
Sbjct: 87  --SLLPPTCPDSSDLRTALRLLQSLPSTPPHLHRIDGLLTNHHMLTSSSPEVA------- 137

Query: 549 NYLRIREGAKMMAKVRNNVNSDK-----CFAFEEMVLCLVMTNAVE 671
              +IR+GA  MA  R + N D       F  EE VL LV+TNAVE
Sbjct: 138 --AKIRQGAIAMAAARKSRNRDNEGQSDGFLLEEAVLSLVITNAVE 181


>ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Populus trichocarpa]
           gi|550339461|gb|EEE93699.2| hypothetical protein
           POPTR_0005s21560g [Populus trichocarpa]
          Length = 626

 Score = 94.7 bits (234), Expect = 2e-17
 Identities = 86/224 (38%), Positives = 103/224 (45%), Gaps = 4/224 (1%)
 Frame = +3

Query: 12  MEMRAIEK-IGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQ 188
           MEMRA E+ I IGED+TP + PL+  LHDS + SHCS+CF  LP   F           Q
Sbjct: 1   MEMRAGEEDIEIGEDITPSVIPLSYALHDSFIHSHCSSCFSRLPSANF----------TQ 50

Query: 189 HVPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIF 368
           H     PT LYCS  CS++    HFS AE HLL    HSPPSS               + 
Sbjct: 51  H--HHVPTLLYCSSICSSS----HFSPAELHLL----HSPPSS--------DLRAALRLL 92

Query: 369 QCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSE 548
             SL                       P     RI GL+TNRE L+        +DE  E
Sbjct: 93  PLSL-----------------------PSSSTNRICGLLTNREKLM--------ADE--E 119

Query: 549 NYLRIREGAKMMAKVR--NNVNSDKCFA-FEEMVLCLVMTNAVE 671
               +R GAK +A  R    V ++K  A   E  LCLV+TNAVE
Sbjct: 120 ISAHVRYGAKAIAAARRIEMVENEKNDAVLLEAALCLVLTNAVE 163


>ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Fragaria vesca subsp.
           vesca]
          Length = 645

 Score = 90.5 bits (223), Expect = 4e-16
 Identities = 80/228 (35%), Positives = 101/228 (44%), Gaps = 8/228 (3%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQH 191
           MEMRA E+I +G DLTPPL PL + LHDS ++SHCS+CF  LP    P        N  H
Sbjct: 1   MEMRAGEEIELGRDLTPPLSPLYSALHDSLLSSHCSSCFSPLPTPPSP--------NNSH 52

Query: 192 VPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQ 371
                P  L+CS  CS++ S    S+AE  LL L  HS PS++                 
Sbjct: 53  -----PVLLFCSSLCSSSASV---STAEPRLLRLL-HSHPSTYPHG-------------- 89

Query: 372 CSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSEN 551
                  S L      +      +  P     RI+GL+TNR  L        D D     
Sbjct: 90  -----DSSDLRAALRLLHSLPASSPAP-----RISGLLTNRRKL--------DDD----- 126

Query: 552 YLRIREGAKMMAKVRNNVNSDKCF--------AFEEMVLCLVMTNAVE 671
            LRIR+GA+ M   R   + +             EE  LCLV+TNAVE
Sbjct: 127 -LRIRDGARAMFLARTMPDDNDAVLDVAHDDAVSEEAALCLVLTNAVE 173


>ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Cucumis sativus]
          Length = 659

 Score = 88.6 bits (218), Expect = 2e-15
 Identities = 75/224 (33%), Positives = 103/224 (45%), Gaps = 2/224 (0%)
 Frame = +3

Query: 6   LKMEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENL 185
           ++MEM A+E I + ED++PPL PL + LHDS + +HCS+CF  LP    PPI  + P + 
Sbjct: 28  MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPN---PPISHSIPLH- 83

Query: 186 QHVPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHI 365
                      YCSL CS +    H         S+      SS                
Sbjct: 84  -----------YCSLKCSLS----HSDPLTDAFFSIHPFPDASS------------DTSD 116

Query: 366 FQCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENS 545
            + SL      L   + S+S        PD    RI GL+TNR  L+           +S
Sbjct: 117 LRASLRLLHLLLSHPSPSLSPP------PD----RIYGLLTNRHKLM-------TPQNDS 159

Query: 546 ENYLRIREGAKMMAKVR--NNVNSDKCFAFEEMVLCLVMTNAVE 671
           E +L++REGA  +A +R  N  +     A EE VLCLV+TNAV+
Sbjct: 160 EVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVD 203


>ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Citrus sinensis]
          Length = 619

 Score = 84.7 bits (208), Expect = 2e-14
 Identities = 75/223 (33%), Positives = 96/223 (43%), Gaps = 1/223 (0%)
 Frame = +3

Query: 6   LKMEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENL 185
           ++MEMRA E+I  GED+TPPL PL    HDS +  HCS+CF                   
Sbjct: 1   MEMEMRASEEIRQGEDITPPLFPLTFAFHDSLLDGHCSSCF------------------- 41

Query: 186 QHVPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSP-PSSWQXXXXXXXXXXXAH 362
                 +P P  C        S+L  SSAE        HSP P+S               
Sbjct: 42  ------SPLPCCC--------SSLPLSSAELRAALYLLHSPLPTS--------------- 72

Query: 363 IFQCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 542
               SL P                           R+ GL+TNR+ L+ ++    DSD  
Sbjct: 73  ----SLPPP-------------------------PRLFGLLTNRDKLMSSS----DSDVA 99

Query: 543 SENYLRIREGAKMMAKVRNNVNSDKCFAFEEMVLCLVMTNAVE 671
           S    +IREGA+ MA+ R N++ D   A+EE  LCLVMTNAVE
Sbjct: 100 S----KIREGAREMARARGNLSDD--VAWEEAALCLVMTNAVE 136


>ref|XP_006599490.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X2 [Glycine
           max]
          Length = 593

 Score = 81.6 bits (200), Expect = 2e-13
 Identities = 79/227 (34%), Positives = 99/227 (43%), Gaps = 7/227 (3%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQH 191
           MEMR+ E+I IG D+T  L PL+  LH   + +HCSACF +LP                 
Sbjct: 1   MEMRSKEEIEIGRDITATLTPLSFCLHTFYLHTHCSACFSSLP----------------- 43

Query: 192 VPTDTPTP---LYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAH 362
           +P   P P    YCS  CS A S LH SSAERHL       PPS+             +H
Sbjct: 44  IPNPNPNPNSLFYCSPPCSAALSPLHHSSAERHL-------PPSA-----------HSSH 85

Query: 363 IFQCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 542
           +  C+ L  R  L  +  S S              R+AGL++NR  L      +   D+ 
Sbjct: 86  L--CTAL--RLLLSHRPTSSS--------------RLAGLLSNRHILT----SLSVHDDV 123

Query: 543 SENYLRIREGAKMM----AKVRNNVNSDKCFAFEEMVLCLVMTNAVE 671
           SE   RI  GA  M    AK R   N D       + L  V+TNAVE
Sbjct: 124 SE---RISVGAGAMAEAIAKQRGIPNDDAVLEEATIALSAVLTNAVE 167


>ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X1 [Glycine
           max]
          Length = 642

 Score = 81.6 bits (200), Expect = 2e-13
 Identities = 79/227 (34%), Positives = 99/227 (43%), Gaps = 7/227 (3%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQH 191
           MEMR+ E+I IG D+T  L PL+  LH   + +HCSACF +LP                 
Sbjct: 1   MEMRSKEEIEIGRDITATLTPLSFCLHTFYLHTHCSACFSSLP----------------- 43

Query: 192 VPTDTPTP---LYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAH 362
           +P   P P    YCS  CS A S LH SSAERHL       PPS+             +H
Sbjct: 44  IPNPNPNPNSLFYCSPPCSAALSPLHHSSAERHL-------PPSA-----------HSSH 85

Query: 363 IFQCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 542
           +  C+ L  R  L  +  S S              R+AGL++NR  L      +   D+ 
Sbjct: 86  L--CTAL--RLLLSHRPTSSS--------------RLAGLLSNRHILT----SLSVHDDV 123

Query: 543 SENYLRIREGAKMM----AKVRNNVNSDKCFAFEEMVLCLVMTNAVE 671
           SE   RI  GA  M    AK R   N D       + L  V+TNAVE
Sbjct: 124 SE---RISVGAGAMAEAIAKQRGIPNDDAVLEEATIALSAVLTNAVE 167


>ref|XP_003595407.1| Protein SET DOMAIN GROUP [Medicago truncatula]
           gi|355484455|gb|AES65658.1| Protein SET DOMAIN GROUP
           [Medicago truncatula]
          Length = 683

 Score = 77.0 bits (188), Expect = 5e-12
 Identities = 43/95 (45%), Positives = 57/95 (60%)
 Frame = +3

Query: 6   LKMEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENL 185
           ++MEMR+ E I I  D+TPPL PL+  LH++ + +HCS+CF  + P   PPI    P N 
Sbjct: 11  MEMEMRSTEDINIATDITPPLTPLSFSLHNTHLHTHCSSCFSLITP---PPIPIPNPNN- 66

Query: 186 QHVPTDTPTPLYCSLACSTADSALHFSSAERHLLS 290
                  P   YCSL CST+ S++  SSAE HL S
Sbjct: 67  -------PPIHYCSLHCSTSHSSIPLSSAEHHLPS 94


>ref|XP_002893944.1| hypothetical protein ARALYDRAFT_314093 [Arabidopsis lyrata subsp.
           lyrata] gi|297339786|gb|EFH70203.1| hypothetical protein
           ARALYDRAFT_314093 [Arabidopsis lyrata subsp. lyrata]
          Length = 567

 Score = 76.6 bits (187), Expect = 6e-12
 Identities = 70/222 (31%), Positives = 92/222 (41%), Gaps = 2/222 (0%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQH 191
           ME+ A E I IG DL PPL PLA+ LHDS ++SHCS+CF  LPP                
Sbjct: 1   MEILAAEDIEIGTDLFPPLSPLASSLHDSFLSSHCSSCFSLLPP---------------- 44

Query: 192 VPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQ 371
                P PLYCS ACS  DS            + F   PP                    
Sbjct: 45  ---SPPQPLYCSAACSLTDS-----------FTNFPQFPPEI------------------ 72

Query: 372 CSLLPQ--RSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENS 545
             +LP   R+ L   N ++   ++ +  P     R+ GL+TN   L+           +S
Sbjct: 73  TPILPSDIRTALRLLNSTV---VDTSLSP----HRLNGLLTNHHLLM----------ADS 115

Query: 546 ENYLRIREGAKMMAKVRNNVNSDKCFAFEEMVLCLVMTNAVE 671
              L I   A  +A V    ++ K    EE  +C V+TNAVE
Sbjct: 116 SFSLAIHHAASFIATVLR--SNRKNTELEEAAICSVLTNAVE 155


>ref|XP_007152012.1| hypothetical protein PHAVU_004G094200g, partial [Phaseolus
           vulgaris] gi|561025321|gb|ESW24006.1| hypothetical
           protein PHAVU_004G094200g, partial [Phaseolus vulgaris]
          Length = 530

 Score = 73.6 bits (179), Expect = 5e-11
 Identities = 79/226 (34%), Positives = 93/226 (41%), Gaps = 6/226 (2%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTL-PPQTFPPIWRTFPENLQ 188
           MEMR+ E+I IG D+TP L PL   LHDS + +HCSACF  L  P    PI         
Sbjct: 1   MEMRSSEEIEIGRDITPTLTPLTFSLHDSNLNTHCSACFSPLSSPSPSIPI--------- 51

Query: 189 HVPTDTPTPL-YCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHI 365
                 P PL YCS  CS A S LH +SAE  LL   AHS                 A  
Sbjct: 52  ------PNPLIYCSPPCSAALSPLHHASAET-LLPSSAHS------------SHLRAALR 92

Query: 366 FQCSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENS 545
              S  P  SF                       R+AGL++NR  L          D  S
Sbjct: 93  LLRSHRPSPSF-----------------------RLAGLLSNRRILT-----SHHHDHVS 124

Query: 546 ENYLRIREGAKMMAKV----RNNVNSDKCFAFEEMVLCLVMTNAVE 671
           E   RIR  A +MA+     R   + D       + LC V+TNAVE
Sbjct: 125 E---RIRLDATVMAEAIAEQRAVPHDDAVLEEATIALCAVLTNAVE 167


>ref|XP_006395991.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum]
           gi|557092630|gb|ESQ33277.1| hypothetical protein
           EUTSA_v10003905mg [Eutrema salsugineum]
          Length = 575

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 66/220 (30%), Positives = 85/220 (38%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQH 191
           ME+ A + IGIG DL PPL PL   L+DS  TSHCS CF  L P                
Sbjct: 1   MEIMAADDIGIGVDLFPPLSPLTFSLYDSFFTSHCSCCFSLLSP---------------- 44

Query: 192 VPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQ 371
            P  +P  LYCS ACS  DS +       H L L +                        
Sbjct: 45  APPQSPASLYCSAACSLTDSPIVSQIIPDHSLILSSDI---------------------- 82

Query: 372 CSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSEN 551
                 R+ L   N   S  +  A  P     R  GL+TN   L+ A      + + + N
Sbjct: 83  ------RAALRLLNSIPSYAVVAASLP----HRFGGLLTNHHRLM-ADSSFSVAIQCAAN 131

Query: 552 YLRIREGAKMMAKVRNNVNSDKCFAFEEMVLCLVMTNAVE 671
           ++     A ++   R N         EE  +C V+TNAVE
Sbjct: 132 FI-----AVVLRSDRKNTE------LEEAAICSVLTNAVE 160


>ref|XP_006395990.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum]
           gi|557092629|gb|ESQ33276.1| hypothetical protein
           EUTSA_v10003905mg [Eutrema salsugineum]
          Length = 572

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 66/220 (30%), Positives = 85/220 (38%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQH 191
           ME+ A + IGIG DL PPL PL   L+DS  TSHCS CF  L P                
Sbjct: 1   MEIMAADDIGIGVDLFPPLSPLTFSLYDSFFTSHCSCCFSLLSP---------------- 44

Query: 192 VPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQ 371
            P  +P  LYCS ACS  DS +       H L L +                        
Sbjct: 45  APPQSPASLYCSAACSLTDSPIVSQIIPDHSLILSSDI---------------------- 82

Query: 372 CSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSEN 551
                 R+ L   N   S  +  A  P     R  GL+TN   L+ A      + + + N
Sbjct: 83  ------RAALRLLNSIPSYAVVAASLP----HRFGGLLTNHHRLM-ADSSFSVAIQCAAN 131

Query: 552 YLRIREGAKMMAKVRNNVNSDKCFAFEEMVLCLVMTNAVE 671
           ++     A ++   R N         EE  +C V+TNAVE
Sbjct: 132 FI-----AVVLRSDRKNTE------LEEAAICSVLTNAVE 160


>ref|NP_683372.2| SET domain-containing protein [Arabidopsis thaliana]
           gi|97190651|sp|Q3ECY6.1|SDG41_ARATH RecName:
           Full=Protein SET DOMAIN GROUP 41
           gi|332193843|gb|AEE31964.1| SET domain-containing
           protein [Arabidopsis thaliana]
          Length = 558

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 68/222 (30%), Positives = 94/222 (42%), Gaps = 2/222 (0%)
 Frame = +3

Query: 12  MEMRAIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQH 191
           ME+RA E I I  DL PPL PLA+ L+DS ++SHCS+CF  LPP                
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPP---------------- 44

Query: 192 VPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQ 371
                P PLYCS ACS  DS              F +SP    +                
Sbjct: 45  ---SPPQPLYCSAACSLTDS--------------FTNSPQFPPEI--------------- 72

Query: 372 CSLLPQ--RSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENS 545
             +LP   R+ L   N   S  ++ +  P     R+  L+TN  +L+ A   I  +  ++
Sbjct: 73  TPILPSDIRTSLHLLN---STAVDTSSSP----HRLNNLLTN-HHLLMADPSISVAIHHA 124

Query: 546 ENYLRIREGAKMMAKVRNNVNSDKCFAFEEMVLCLVMTNAVE 671
            N++     A ++   R N         EE  +C V+TNAVE
Sbjct: 125 ANFI-----ATVIRSNRKNTE------LEEAAICAVLTNAVE 155


Top