BLASTX nr result

ID: Mentha27_contig00033329 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00033329
         (1015 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU36834.1| hypothetical protein MIMGU_mgv1a023205mg [Mimulus...   310   6e-82
ref|XP_007199300.1| hypothetical protein PRUPE_ppa023162mg, part...   229   2e-57
ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   222   2e-55
ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like ...   211   3e-52
ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   209   1e-51
ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Popu...   202   2e-49
ref|XP_007019533.1| SET domain protein, putative isoform 1 [Theo...   199   2e-48
gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis]          195   3e-47
ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   191   4e-46
ref|XP_007019535.1| SET domain-containing protein, putative isof...   188   3e-45
ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   187   8e-45
ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   185   2e-44
ref|XP_007152012.1| hypothetical protein PHAVU_004G094200g, part...   177   5e-42
ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   175   2e-41
ref|XP_002893944.1| hypothetical protein ARALYDRAFT_314093 [Arab...   172   3e-40
ref|XP_006395991.1| hypothetical protein EUTSA_v10003905mg [Eutr...   165   3e-38
ref|XP_004166625.1| PREDICTED: LOW QUALITY PROTEIN: protein SET ...   163   1e-37
ref|NP_683372.2| SET domain-containing protein [Arabidopsis thal...   163   1e-37
ref|XP_004516216.1| PREDICTED: protein SET DOMAIN GROUP 41-like ...   161   3e-37
ref|XP_006395990.1| hypothetical protein EUTSA_v10003905mg [Eutr...   161   5e-37

>gb|EYU36834.1| hypothetical protein MIMGU_mgv1a023205mg [Mimulus guttatus]
          Length = 635

 Score =  310 bits (794), Expect = 6e-82
 Identities = 184/347 (53%), Positives = 219/347 (63%), Gaps = 11/347 (3%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A+E I IGEDLTP L PLA VL ++AV+S+CSACF  LPPQ FPP+      N  H P+ 
Sbjct: 5    AVEDIAIGEDLTPALPPLAFVLLETAVSSYCSACFSILPPQPFPPLNPNSRPNCSHFPS- 63

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
             PTPLYCS+ CS+ DS LHFSS E  LLSLF  SPP +W+            H+FQ + +
Sbjct: 64   -PTPLYCSVNCSSIDSPLHFSSGELRLLSLFRQSPPFAWEDSSDLRLSLRLIHLFQKIEK 122

Query: 363  -ECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENS 539
             EC   P+ S +                    +ERI GLMTNRE L+F      +  ENS
Sbjct: 123  IEC---PEASEI--------------------IERIGGLMTNREKLIF------EESENS 153

Query: 540  EN-YLRIREGAKMMAKVR-----NNVNSDKS---FPLEEMVLCLVVTNAVEVLLKNGRCI 692
            EN Y +IR GAKMMA+ R     + VN++K    F LEEMVLCLV+TNAVEV  KNG  I
Sbjct: 154  ENVYQKIRSGAKMMAEARRASTDHYVNAEKKRDDFVLEEMVLCLVLTNAVEVQDKNGCTI 213

Query: 693  GIAVYDHTFSWINHSCSPNSCYRFLVGPEENDEQLLRL-RIAPGGCSYRNGDGSIMEGGL 869
            GIAVYD  FSWINHSCSPNSCYRF+   E + +  LR+   A  GC  R+G G I     
Sbjct: 214  GIAVYDTAFSWINHSCSPNSCYRFVSRLENHQQSSLRIASYATSGC--RHGYGDI----- 266

Query: 870  SVQVSDRNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
                 +RNGYGPRV+VRSIKA+ KGEEVTIAYTDLLQPKEMR+ +LW
Sbjct: 267  -----ERNGYGPRVIVRSIKAVQKGEEVTIAYTDLLQPKEMRRAQLW 308


>ref|XP_007199300.1| hypothetical protein PRUPE_ppa023162mg, partial [Prunus persica]
            gi|462394700|gb|EMJ00499.1| hypothetical protein
            PRUPE_ppa023162mg, partial [Prunus persica]
          Length = 635

 Score =  229 bits (583), Expect = 2e-57
 Identities = 152/344 (44%), Positives = 185/344 (53%), Gaps = 8/344 (2%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRT--FPENLQHVP 176
            A E I IGED+TPPL PL   LHDS ++SHCS+CF  LPP  FPP+  T  FP N  HV 
Sbjct: 5    AEEDIEIGEDITPPLTPLGFALHDSLLSSHCSSCFSLLPPHPFPPLHFTPPFPHNPHHVL 64

Query: 177  TDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNL 356
            + +    YCS  CST+DS LH SSAE HLL L    P  S               +  +L
Sbjct: 65   SSSS---YCSPLCSTSDSPLHVSSAELHLLHLLQSHP--STYPHGDSSDLRAALRLLHSL 119

Query: 357  PQECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 536
            P                         A GP     RIAGL+TN    +           +
Sbjct: 120  P-------------------------ATGPSA---RIAGLLTNHHKFL-----------H 140

Query: 537  SENYLRIREGAKMM---AKVRNNVNSDKSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVY 707
             +++ RIR+GA+ M    K+R+   +     LEE  LCLV+TNAVEV  K GR +GI+VY
Sbjct: 141  HDDHHRIRDGARAMFLARKMRDEAPNVYDAVLEEAALCLVLTNAVEVQDKTGRTLGISVY 200

Query: 708  DHTFSWINHSCSPNSCYRFLVGPEEN---DEQLLRLRIAPGGCSYRNGDGSIMEGGLSVQ 878
              +F WINHSCSPN+CYRFLV P        +   LRIAP G   ++    I      V 
Sbjct: 201  GPSFCWINHSCSPNACYRFLVSPPPPPPCSAERTPLRIAPLGQGTQSCGIDICCRLRVVF 260

Query: 879  VSDRNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
            V+    YGPRV+VRSIK I KGEEVT+ YTDLLQPK MRQ+ELW
Sbjct: 261  VAII--YGPRVIVRSIKRIKKGEEVTVTYTDLLQPKAMRQSELW 302


>ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum lycopersicum]
          Length = 677

 Score =  222 bits (565), Expect = 2e-55
 Identities = 136/350 (38%), Positives = 191/350 (54%), Gaps = 14/350 (4%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQT---FPPIWRTFPENLQHV 173
            A E I IG+DLTPP+ PL+  LH S + SHCS+CF  LPP     +PP +   P+N    
Sbjct: 5    AKEAISIGQDLTPPIPPLSLCLHHSTLLSHCSSCFSPLPPPPSLHYPPFFS--PKN---- 58

Query: 174  PTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQN 353
            P    +  YCSL CS+ DS +HFSS+E H   LF     +++             H+FQ 
Sbjct: 59   PNSNHSIRYCSLQCSSLDSPIHFSSSEFHFFHLFPQPLYTNFPTSSDLRLSLRLLHLFQT 118

Query: 354  LPQECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDE 533
            L                       I+E+ G  + LERI GLMTN   ++F  +   D+D 
Sbjct: 119  L---------------------HLIQESNGSLLNLERIGGLMTNFRKVMFLEEHCNDNDL 157

Query: 534  NSENYLRIREGAKMMA---KVRNNVNSDKSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAV 704
            +     RIR+GAK +A   ++R  + ++  + +E  VLCLV+TNAVEV  K+GR +G+ V
Sbjct: 158  SG----RIRDGAKALAASRRMRVGLETNGEYTVEAAVLCLVLTNAVEVYDKDGRSLGVGV 213

Query: 705  YDHTFSWINHSCSPNSCYRFLVGPEENDEQLLRLRIAP-------GGCSYRN-GDGSIME 860
            YD  FSW+NHSCSPN+ YRF    +     +L  RI P        G  + +    + ++
Sbjct: 214  YDVPFSWVNHSCSPNASYRFCTASDSGG--ILESRICPAATETGAAGIGHESISSNTELQ 271

Query: 861  GGLSVQVSDRNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
              +SV +      GP++++RSIK I + EEV I+YTDLLQPK MRQ+ELW
Sbjct: 272  KSMSV-IGGSEACGPKIILRSIKGIQRSEEVLISYTDLLQPKVMRQSELW 320


>ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like [Vitis vinifera]
          Length = 660

 Score =  211 bits (538), Expect = 3e-52
 Identities = 140/340 (41%), Positives = 172/340 (50%), Gaps = 5/340 (1%)
 Frame = +3

Query: 9    EKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTDTP 188
            E   +G DLT PL PLA+ LHDS + SHCSACF  LPP             L +    + 
Sbjct: 7    EDTEMGLDLTHPLPPLASSLHDSHLRSHCSACFSPLPPTV-----------LVNTNPSSS 55

Query: 189  TPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQEC 368
               YCS  CS +DS LHFSSAE HL  L  HS PS+              HI    P   
Sbjct: 56   FLCYCSPPCSASDSPLHFSSAEHHLFLLLRHSHPST-AHSSDLRAALRLLHILHLPPLHT 114

Query: 369  SLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSENY 548
              L                            RI GL+TN  +L+  +     + E+ E  
Sbjct: 115  QPL---------------------------HRICGLLTNLHHLISPSH----NSESDETL 143

Query: 549  LRIREGAKMMAK---VRNNVNSDKSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVYDHTF 719
             RIR+G K MA    +R+         LEE +LCLV+TNAVEV +  G  +GIAVYD  F
Sbjct: 144  TRIRDGGKAMAVARCMRDGTEFSGDSKLEEALLCLVLTNAVEVQVNGGSALGIAVYDWCF 203

Query: 720  SWINHSCSPNSCYRFLVGPEENDE--QLLRLRIAPGGCSYRNGDGSIMEGGLSVQVSDRN 893
            SWINHSCSPN+CYRFL+   E  +     RL+I PGG                ++V  +N
Sbjct: 204  SWINHSCSPNACYRFLLRSPETPQFSGESRLQIIPGGND-------------EIEVK-KN 249

Query: 894  GYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELWL 1013
              GPR++VRSIKAI KGEEV +AY DLLQPKE+R  ELW+
Sbjct: 250  RSGPRIIVRSIKAIKKGEEVWVAYIDLLQPKEIRHAELWV 289


>ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum tuberosum]
          Length = 681

 Score =  209 bits (532), Expect = 1e-51
 Identities = 136/356 (38%), Positives = 185/356 (51%), Gaps = 20/356 (5%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQT----FPPIWRTFPENLQH 170
            A E I IG+DLTPP+ PL+  LH S + SHCS+CF  LPP      +PP +   P+N   
Sbjct: 5    AKEAIPIGQDLTPPIPPLSLSLHHSTLLSHCSSCFSPLPPPPPSLHYPPFFS--PKN--- 59

Query: 171  VPTDTPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQ 350
             P       YCSL CS+ DS +HFSS+E H   LF     +++             H FQ
Sbjct: 60   -PNPNHFIRYCSLQCSSLDSPIHFSSSEFHFFHLFPQPLHTNFPTSSDLRLSLRLLHRFQ 118

Query: 351  NLPQECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSD 530
             L                       I+E+ G  + LERI GL+TN   ++F  +   D+D
Sbjct: 119  TL---------------------NLIQESNGSFLNLERIGGLVTNFRKVMFLEEHCNDND 157

Query: 531  ENSENYLRIREGAKMMAKVRN-----NVNSD---KSFPLEEMVLCLVVTNAVEVLLKNGR 686
            ++  +  RIR GAK +A  R      + N +   + + +E  VLCLV+TNAVEV  K+GR
Sbjct: 158  DDDLSG-RIRHGAKALAASRRMRLGLDTNRELLYEEYTVEAAVLCLVLTNAVEVHDKDGR 216

Query: 687  CIGIAVYDHTFSWINHSCSPNSCYRFLVGPEENDEQLLRLRIAPGGC--------SYRNG 842
             +G+ VYD  FSW+NHSCSPN+ YRF    +     +   RI P           S    
Sbjct: 217  SLGVGVYDVPFSWVNHSCSPNASYRFCTASDSGG--ISECRICPAATETGAAGIESESIS 274

Query: 843  DGSIMEGGLSVQVSDRNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
                ++  +SV +      GP++++RSIK I+K EEV I YTDLLQPK MRQ+ELW
Sbjct: 275  SNPELQKSMSV-IGGSETCGPKIILRSIKGINKSEEVLITYTDLLQPKVMRQSELW 329


>ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Populus trichocarpa]
            gi|550339461|gb|EEE93699.2| hypothetical protein
            POPTR_0005s21560g [Populus trichocarpa]
          Length = 626

 Score =  202 bits (514), Expect = 2e-49
 Identities = 144/342 (42%), Positives = 171/342 (50%), Gaps = 8/342 (2%)
 Frame = +3

Query: 9    EKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTDTP 188
            E I IGED+TP + PL+  LHDS + SHCS+CF  LP   F           QH     P
Sbjct: 8    EDIEIGEDITPSVIPLSYALHDSFIHSHCSSCFSRLPSANFT----------QH--HHVP 55

Query: 189  TPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQEC 368
            T LYCS  CS++    HFS AE HLL    HSPPSS                  +L    
Sbjct: 56   TLLYCSSICSSS----HFSPAELHLL----HSPPSS------------------DLRAAL 89

Query: 369  SLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSENY 548
             LLP                     P     RI GL+TNRE L+        +DE  E  
Sbjct: 90   RLLPLSL------------------PSSSTNRICGLLTNREKLM--------ADE--EIS 121

Query: 549  LRIREGAKMMAKVRNNV---NSDKSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVYDHTF 719
              +R GAK +A  R      N      L E  LCLV+TNAVEV    GR IGIAVY   F
Sbjct: 122  AHVRYGAKAIAAARRIEMVENEKNDAVLLEAALCLVLTNAVEVHDNEGRSIGIAVYGPNF 181

Query: 720  SWINHSCSPNSCYRFLVGPEEN-----DEQLLRLRIAPGGCSYRNGDGSIMEGGLSVQVS 884
            SWINHSCSPN+CYR ++ P +N     DE   RLRI P G   ++ +             
Sbjct: 182  SWINHSCSPNACYRSIISPPDNVLPFSDES--RLRILPAGTEVKSHES------------ 227

Query: 885  DRNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
                 GPRV+VRSIK I +GEEVT+AYTDLLQPKE+R++ELW
Sbjct: 228  -----GPRVIVRSIKRIKRGEEVTVAYTDLLQPKEIRRSELW 264


>ref|XP_007019533.1| SET domain protein, putative isoform 1 [Theobroma cacao]
            gi|590600784|ref|XP_007019534.1| SET domain protein,
            putative isoform 1 [Theobroma cacao]
            gi|590600816|ref|XP_007019536.1| SET domain protein,
            putative isoform 1 [Theobroma cacao]
            gi|508724861|gb|EOY16758.1| SET domain protein, putative
            isoform 1 [Theobroma cacao] gi|508724862|gb|EOY16759.1|
            SET domain protein, putative isoform 1 [Theobroma cacao]
            gi|508724864|gb|EOY16761.1| SET domain protein, putative
            isoform 1 [Theobroma cacao]
          Length = 658

 Score =  199 bits (505), Expect = 2e-48
 Identities = 138/348 (39%), Positives = 178/348 (51%), Gaps = 12/348 (3%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A + +  G+D+TPP+ PL++ L+DS ++SHCS+CF  LPP        TFP   +HVP  
Sbjct: 17   AKQDLDYGQDITPPILPLSSSLYDSFLSSHCSSCFSPLPP--------TFPHIPRHVP-- 66

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
                LYCS  CS++ S LH SSAE    SL   + P S               + Q+LP 
Sbjct: 67   ----LYCSPTCSSSHSPLHSSSAE----SLLPPTCPDS-------SDLRTALRLLQSLP- 110

Query: 363  ECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSE 542
              S  P                         L RI GL+TN   L  ++ ++        
Sbjct: 111  --STPPH------------------------LHRIDGLLTNHHMLTSSSPEVA------- 137

Query: 543  NYLRIREGAKMMAKVRNNVNSDKS-----FPLEEMVLCLVVTNAVEVLLKNGRCIGIAVY 707
               +IR+GA  MA  R + N D       F LEE VL LV+TNAVEV  K+GR +GIAVY
Sbjct: 138  --AKIRQGAIAMAAARKSRNRDNEGQSDGFLLEEAVLSLVITNAVEVQDKSGRSLGIAVY 195

Query: 708  DHTFSWINHSCSPNSCYRFLVGPEE-----NDEQLLRLRIAPGGCSYRNGDGSIMEGGLS 872
            D +FSWINHSCSPN+CYRF +          ++    LRI P          S +E    
Sbjct: 196  DLSFSWINHSCSPNACYRFSISSPHATLSFREDSSSTLRIVPSVLGEECDACSCVE---- 251

Query: 873  VQVSDRNGY--GPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
                   GY  GP+++VRSIK I KGEEV ++YTDLLQPK MRQ+ELW
Sbjct: 252  -HTKGNKGYELGPKIIVRSIKRIRKGEEVCVSYTDLLQPKAMRQSELW 298


>gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis]
          Length = 661

 Score =  195 bits (495), Expect = 3e-47
 Identities = 140/350 (40%), Positives = 173/350 (49%), Gaps = 16/350 (4%)
 Frame = +3

Query: 9    EKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIW-RTFPENLQHVPTDT 185
            E+I +GEDLT PL PL+  LH S + SHCS+CF  LP    PPI+   FP +        
Sbjct: 12   EEIEMGEDLTRPLPPLSFSLHHSLLLSHCSSCFSPLPSSPLPPIFPPRFPPS-----NSN 66

Query: 186  PTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQE 365
            P  LYCS  CS +DS LHFSSAE HLL L     PS+             A    +L   
Sbjct: 67   PKILYCSSQCSFSDSPLHFSSAEHHLLCLL----PSA------------AAADSSDLRAA 110

Query: 366  CSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSEN 545
              LL               E   A      + RIAGL TN   L         +D+  E 
Sbjct: 111  LRLL---------------ESNPATRRSSSVSRIAGLSTNLHKLA--------NDDEEEV 147

Query: 546  YLRIREGAKMMAKVRNNVNSDKSFPLEE--------MVLCLVVTNAVEVLLKNGRCIGIA 701
              RIR+GA+ MA  R   + D S    E          LC V+TN VEV +K+GR +G+A
Sbjct: 148  AARIRDGARAMAAARRMRDRDCSGEESEGEEEAMAAAALCAVLTNGVEVQVKSGRTLGVA 207

Query: 702  VY-DHTFSWINHSCSPNSCYRFLVGPEEN------DEQLLRLRIAPGGCSYRNGDGSIME 860
            VY    FSWINHSCSPN+CYR  +  +        D +   +RI P  C+     G    
Sbjct: 208  VYGGGGFSWINHSCSPNACYRISLHSDLQTTSFLPDHETAAMRIVP-CCNKETQCGC--- 263

Query: 861  GGLSVQVSDRNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
                        YGPR++VRSIK I KGEEVT+AYTDLLQPK +RQ++LW
Sbjct: 264  -----------SYGPRIIVRSIKRIQKGEEVTVAYTDLLQPKSVRQSDLW 302


>ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Cucumis sativus]
          Length = 659

 Score =  191 bits (485), Expect = 4e-46
 Identities = 134/338 (39%), Positives = 172/338 (50%), Gaps = 2/338 (0%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A+E I + ED++PPL PL + LHDS + +HCS+CF  LP    PPI  + P +       
Sbjct: 34   AVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPN---PPISHSIPLH------- 83

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
                 YCSL CS                   +HS P +              H F +   
Sbjct: 84   -----YCSLKCS------------------LSHSDPLT--------DAFFSIHPFPDASS 112

Query: 363  ECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSE 542
            + S L  R+ L   +  +S        P    +RI GL+TNR  L+           +SE
Sbjct: 113  DTSDL--RASLRLLHLLLSHPSPSLSPPP---DRIYGLLTNRHKLM-------TPQNDSE 160

Query: 543  NYLRIREGAKMMAKVRNNVNSD--KSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVYDHT 716
             +L++REGA  +A +R    +D      LEE VLCLV+TNAV+V    G+ IGIAVY  T
Sbjct: 161  VFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDSIGQTIGIAVYAST 220

Query: 717  FSWINHSCSPNSCYRFLVGPEENDEQLLRLRIAPGGCSYRNGDGSIMEGGLSVQVSDRNG 896
            FSWINHSCSPN+CYRF      +D    R RIAP    + + +G+              G
Sbjct: 221  FSWINHSCSPNACYRF---ETPSDSVTTRFRIAPSCTDFMSDEGNF------------QG 265

Query: 897  YGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
             GPRVVVRSIK I KGE VTIAY DLLQPK +RQ+ELW
Sbjct: 266  NGPRVVVRSIKRIKKGEAVTIAYCDLLQPKVLRQSELW 303


>ref|XP_007019535.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao]
            gi|590600821|ref|XP_007019537.1| SET domain-containing
            protein, putative isoform 3 [Theobroma cacao]
            gi|590600825|ref|XP_007019538.1| SET domain-containing
            protein, putative isoform 3 [Theobroma cacao]
            gi|590600830|ref|XP_007019539.1| SET domain-containing
            protein, putative isoform 3 [Theobroma cacao]
            gi|508724863|gb|EOY16760.1| SET domain-containing
            protein, putative isoform 3 [Theobroma cacao]
            gi|508724865|gb|EOY16762.1| SET domain-containing
            protein, putative isoform 3 [Theobroma cacao]
            gi|508724866|gb|EOY16763.1| SET domain-containing
            protein, putative isoform 3 [Theobroma cacao]
            gi|508724867|gb|EOY16764.1| SET domain-containing
            protein, putative isoform 3 [Theobroma cacao]
          Length = 625

 Score =  188 bits (477), Expect = 3e-45
 Identities = 134/347 (38%), Positives = 174/347 (50%), Gaps = 12/347 (3%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A + +  G+D+TPP+ PL++ L+DS ++SHCS+CF  LPP        TFP   +HVP  
Sbjct: 17   AKQDLDYGQDITPPILPLSSSLYDSFLSSHCSSCFSPLPP--------TFPHIPRHVP-- 66

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
                LYCS  CS++ S LH SSAE    SL   + P S               + Q+LP 
Sbjct: 67   ----LYCSPTCSSSHSPLHSSSAE----SLLPPTCPDS-------SDLRTALRLLQSLP- 110

Query: 363  ECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSE 542
              S  P                         L RI GL+TN   L  ++ ++        
Sbjct: 111  --STPPH------------------------LHRIDGLLTNHHMLTSSSPEVA------- 137

Query: 543  NYLRIREGAKMMAKVRNNVNSDKS-----FPLEEMVLCLVVTNAVEVLLKNGRCIGIAVY 707
               +IR+GA  MA  R + N D       F LEE VL LV+TNAVEV  K+GR +GIAVY
Sbjct: 138  --AKIRQGAIAMAAARKSRNRDNEGQSDGFLLEEAVLSLVITNAVEVQDKSGRSLGIAVY 195

Query: 708  DHTFSWINHSCSPNSCYRFLVGPEE-----NDEQLLRLRIAPGGCSYRNGDGSIMEGGLS 872
            D +FSWINHSCSPN+CYRF +          ++    LRI P          S +E    
Sbjct: 196  DLSFSWINHSCSPNACYRFSISSPHATLSFREDSSSTLRIVPSVLGEECDACSCVE---- 251

Query: 873  VQVSDRNGY--GPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTEL 1007
                   GY  GP+++VRSIK I KGEEV ++YTDLLQPKE+    L
Sbjct: 252  -HTKGNKGYELGPKIIVRSIKRIRKGEEVCVSYTDLLQPKEISTCNL 297


>ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Fragaria vesca subsp.
            vesca]
          Length = 645

 Score =  187 bits (474), Expect = 8e-45
 Identities = 136/348 (39%), Positives = 173/348 (49%), Gaps = 14/348 (4%)
 Frame = +3

Query: 9    EKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTDTP 188
            E+I +G DLTPPL PL + LHDS ++SHCS+CF  LP    P        N  H     P
Sbjct: 7    EEIELGRDLTPPLSPLYSALHDSLLSSHCSSCFSPLPTPPSP--------NNSH-----P 53

Query: 189  TPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQEC 368
              L+CS  CS++ S    S+AE  LL L  HS PS++              +  +LP   
Sbjct: 54   VLLFCSSLCSSSASV---STAEPRLLRLL-HSHPSTYPHGDSSDLRAAL-RLLHSLP--- 105

Query: 369  SLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSENY 548
                                  A  P     RI+GL+TNR  L        D D      
Sbjct: 106  ----------------------ASSP---APRISGLLTNRRKL--------DDD------ 126

Query: 549  LRIREGAKMMAKVRNNVNSDKSF--------PLEEMVLCLVVTNAVEVLLKNGRCIGIAV 704
            LRIR+GA+ M   R   + + +           EE  LCLV+TNAVEV    GR +GIAV
Sbjct: 127  LRIRDGARAMFLARTMPDDNDAVLDVAHDDAVSEEAALCLVLTNAVEVQDHTGRTLGIAV 186

Query: 705  YDHTFSWINHSCSPNSCYRFLVG------PEENDEQLLRLRIAPGGCSYRNGDGSIMEGG 866
            YD  FSWINHSCSPN+CYRFL+       P + DE  LR                I+  G
Sbjct: 187  YDSCFSWINHSCSPNACYRFLLSSPSQPTPPQCDETPLR----------------IVPAG 230

Query: 867  LSVQVSDRNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
              +  ++   +GPRV+VRSIK I++GEEVTI YTDLLQPK +R++ELW
Sbjct: 231  QLIVNAECEKFGPRVIVRSIKRINRGEEVTITYTDLLQPKAVRRSELW 278


>ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X1 [Glycine max]
          Length = 642

 Score =  185 bits (470), Expect = 2e-44
 Identities = 140/347 (40%), Positives = 174/347 (50%), Gaps = 13/347 (3%)
 Frame = +3

Query: 9    EKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTDTP 188
            E+I IG D+T  L PL+  LH   + +HCSACF +LP                 +P   P
Sbjct: 7    EEIEIGRDITATLTPLSFCLHTFYLHTHCSACFSSLP-----------------IPNPNP 49

Query: 189  TP---LYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLP 359
             P    YCS  CS A S LH SSAERHL       PPS+             +H+     
Sbjct: 50   NPNSLFYCSPPCSAALSPLHHSSAERHL-------PPSAHS-----------SHL----- 86

Query: 360  QECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENS 539
              C+ L  R  L  +  S S              R+AGL++NR  L      +   D+ S
Sbjct: 87   --CTAL--RLLLSHRPTSSS--------------RLAGLLSNRHILT----SLSVHDDVS 124

Query: 540  ENYLRIREGAKMMA----KVRNNVNSDKSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVY 707
            E   RI  GA  MA    K R   N D       + L  V+TNAVEV    GR +GIAV+
Sbjct: 125  E---RISVGAGAMAEAIAKQRGIPNDDAVLEEATIALSAVLTNAVEVHDNEGRALGIAVF 181

Query: 708  DHTFSWINHSCSPNSCYRFLVGPEENDEQLLRLRIAP------GGCSYRNGDGSIMEGGL 869
            D  FSWINHSCSPN+CYRF++    +  +  +L IAP       G S  + +    +GGL
Sbjct: 182  DQIFSWINHSCSPNACYRFVLSSSSHSGE-AKLGIAPHLQMNSSGVSISSSE--FAKGGL 238

Query: 870  SVQVSDRNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
                    GYGPR+VVRSIK I+KGEEVT+AYTDLLQPK MRQ+ELW
Sbjct: 239  --------GYGPRLVVRSIKKINKGEEVTVAYTDLLQPKAMRQSELW 277


>ref|XP_007152012.1| hypothetical protein PHAVU_004G094200g, partial [Phaseolus vulgaris]
            gi|561025321|gb|ESW24006.1| hypothetical protein
            PHAVU_004G094200g, partial [Phaseolus vulgaris]
          Length = 530

 Score =  177 bits (450), Expect = 5e-42
 Identities = 140/347 (40%), Positives = 171/347 (49%), Gaps = 13/347 (3%)
 Frame = +3

Query: 9    EKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLP-PQTFPPIWRTFPENLQHVPTDT 185
            E+I IG D+TP L PL   LHDS + +HCSACF  L  P    PI               
Sbjct: 7    EEIEIGRDITPTLTPLTFSLHDSNLNTHCSACFSPLSSPSPSIPI--------------- 51

Query: 186  PTPL-YCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
            P PL YCS  CS A S LH +SAE  LL   AHS                 +H+   L  
Sbjct: 52   PNPLIYCSPPCSAALSPLHHASAET-LLPSSAHS-----------------SHLRAALRL 93

Query: 363  ECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSE 542
              S  P  SF                       R+AGL++NR  L          D  SE
Sbjct: 94   LRSHRPSPSF-----------------------RLAGLLSNRRILTS-----HHHDHVSE 125

Query: 543  NYLRIREGAKMMAKVRNNVNSDKSFPLEEMVL-------CLVVTNAVEVLLKNGRCIGIA 701
               RIR  A +MA+    +   ++ P ++ VL       C V+TNAVEV    GR +GIA
Sbjct: 126  ---RIRLDATVMAEA---IAEQRAVPHDDAVLEEATIALCAVLTNAVEVHDNEGRALGIA 179

Query: 702  VYDHTFSWINHSCSPNSCYRFLVGPEENDEQLLRLRIAPGGCSYRNGDGSIMEGGLSVQ- 878
            V+D TFSWINHSCSPN+CYRF++    ++E  L LRIAP           +  GG+ V  
Sbjct: 180  VFDPTFSWINHSCSPNACYRFILSSFPSNEPEL-LRIAP--------HPQMGSGGVCVSS 230

Query: 879  ---VSDRNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
                 +  GYGPR+VVRSIK I KGEEVT+AYTD+LQ K  RQ ELW
Sbjct: 231  DEFAKEMLGYGPRLVVRSIKKIKKGEEVTVAYTDILQTKATRQWELW 277


>ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Citrus sinensis]
          Length = 619

 Score =  175 bits (444), Expect = 2e-41
 Identities = 130/344 (37%), Positives = 167/344 (48%), Gaps = 8/344 (2%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A E+I  GED+TPPL PL    HDS +  HCS+CF                         
Sbjct: 7    ASEEIRQGEDITPPLFPLTFAFHDSLLDGHCSSCF------------------------- 41

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
            +P P  CS        +L  SSAE        HSP                      LP 
Sbjct: 42   SPLPCCCS--------SLPLSSAELRAALYLLHSP----------------------LPT 71

Query: 363  ECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSE 542
                 P R F                          GL+TNR+ L+ ++    DSD  S 
Sbjct: 72   SSLPPPPRLF--------------------------GLLTNRDKLMSSS----DSDVAS- 100

Query: 543  NYLRIREGAKMMAKVRNNVNSDKSFPLEEMVLCLVVTNAVEVLL-KNGRCIGIAVYDHTF 719
               +IREGA+ MA+ R N++ D ++  EE  LCLV+TNAVEV   K GR +GIAVYD  F
Sbjct: 101  ---KIREGAREMARARGNLSDDVAW--EEAALCLVMTNAVEVQDDKTGRILGIAVYDKDF 155

Query: 720  SWINHSCSPNSCYRFLVG----PEENDEQLLRLRIAPGGCSYRNGDGSIMEGGLSVQVSD 887
            SWINHSCSPN+CYRF +     P   +E+  ++RIAP          +  +  + +    
Sbjct: 156  SWINHSCSPNACYRFSLSEPNAPSFRNEK--KMRIAPHVVFDSTEAETPGKSDVCISCEL 213

Query: 888  RNG---YGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
            + G   +GPR++VRSIK I+KGEEVT+AYTDLLQPK MRQ+ELW
Sbjct: 214  KEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELW 257


>ref|XP_002893944.1| hypothetical protein ARALYDRAFT_314093 [Arabidopsis lyrata subsp.
            lyrata] gi|297339786|gb|EFH70203.1| hypothetical protein
            ARALYDRAFT_314093 [Arabidopsis lyrata subsp. lyrata]
          Length = 567

 Score =  172 bits (435), Expect = 3e-40
 Identities = 122/338 (36%), Positives = 161/338 (47%), Gaps = 2/338 (0%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A E I IG DL PPL PLA+ LHDS ++SHCS+CF  LPP                    
Sbjct: 5    AAEDIEIGTDLFPPLSPLASSLHDSFLSSHCSSCFSLLPPSP------------------ 46

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
             P PLYCS ACS  DS  +F                                   Q  P+
Sbjct: 47   -PQPLYCSAACSLTDSFTNFP----------------------------------QFPPE 71

Query: 363  ECSLLPQ--RSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 536
               +LP   R+ L   N ++   ++ +  P     R+ GL+TN   L+           +
Sbjct: 72   ITPILPSDIRTALRLLNSTV---VDTSLSP----HRLNGLLTNHHLLM----------AD 114

Query: 537  SENYLRIREGAKMMAKVRNNVNSDKSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVYDHT 716
            S   L I   A  +A V  +  + K+  LEE  +C V+TNAVEV   NG  +GIA+YD  
Sbjct: 115  SSFSLAIHHAASFIATVLRS--NRKNTELEEAAICSVLTNAVEVQDSNGLVLGIALYDSR 172

Query: 717  FSWINHSCSPNSCYRFLVGPEENDEQLLRLRIAPGGCSYRNGDGSIMEGGLSVQVSDRNG 896
            FSWINHSCSPNSCYRF+       + L      P    + N   ++    L  QV    G
Sbjct: 173  FSWINHSCSPNSCYRFVNNTTSYHDDL----AYPITIPHVNNTETLSNLELQEQVRTM-G 227

Query: 897  YGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
            YGP+V+ R+IK I  GEE+T++Y DLLQP  +RQ++LW
Sbjct: 228  YGPKVIARNIKRIKSGEEITVSYIDLLQPTGLRQSDLW 265


>ref|XP_006395991.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum]
            gi|557092630|gb|ESQ33277.1| hypothetical protein
            EUTSA_v10003905mg [Eutrema salsugineum]
          Length = 575

 Score =  165 bits (417), Expect = 3e-38
 Identities = 118/341 (34%), Positives = 156/341 (45%), Gaps = 5/341 (1%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A + IGIG DL PPL PL   L+DS  TSHCS CF  L P                 P  
Sbjct: 5    AADDIGIGVDLFPPLSPLTFSLYDSFFTSHCSCCFSLLSP----------------APPQ 48

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
            +P  LYCS ACS  DS +                                   + Q +P 
Sbjct: 49   SPASLYCSAACSLTDSPI-----------------------------------VSQIIPD 73

Query: 363  ECSLLPQ--RSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 536
               +L    R+ L   N   S  +  A  P     R  GL+TN   L+           +
Sbjct: 74   HSLILSSDIRAALRLLNSIPSYAVVAASLP----HRFGGLLTNHHRLM----------AD 119

Query: 537  SENYLRIREGAKMMAKVRNNVNSD-KSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVYDH 713
            S   + I+  A  +A V   + SD K+  LEE  +C V+TNAVE+   +GR +GIAVYD 
Sbjct: 120  SSFSVAIQCAANFIAVV---LRSDRKNTELEEAAICSVLTNAVELQDSSGRALGIAVYDT 176

Query: 714  TFSWINHSCSPNSCYRFLVGPEENDEQLLR--LRIAPGGCSYRNGDGSIMEGGLSVQVSD 887
             FSWINHSCSPN+CYRF++ P        +   ++ P   +       +     S+    
Sbjct: 177  RFSWINHSCSPNACYRFVISPHSTTTPSFQDYPKMLPHTTNTEKEQIGVCSRITSLWEGK 236

Query: 888  RNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
               YGP+VV RSIK I  GEE+TI+Y DL+QP  +RQ++LW
Sbjct: 237  TVRYGPKVVARSIKRIKSGEEITISYIDLMQPTGLRQSDLW 277


>ref|XP_004166625.1| PREDICTED: LOW QUALITY PROTEIN: protein SET DOMAIN GROUP 41-like
            [Cucumis sativus]
          Length = 596

 Score =  163 bits (412), Expect = 1e-37
 Identities = 123/336 (36%), Positives = 154/336 (45%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A+E I + ED++PPL PL + LHDS + +HCS+CF  LP           P+ L   P+ 
Sbjct: 5    AVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPN----------PQFLTPFPST 54

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
            T    +   +  T+D      ++ R L  L +H  PS                       
Sbjct: 55   TAPSNFPDASSDTSD----LRASLRLLHLLLSHPSPS----------------------- 87

Query: 363  ECSLLPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSE 542
               L P                     PD    RI GL+TNR  L+        +    +
Sbjct: 88   ---LSPP--------------------PD----RIYGLLTNRHKLM-----TPKTTPRRK 115

Query: 543  NYLRIREGAKMMAKVRNNVNSDKSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVYDHTFS 722
            NY  I  G                  LEE VLCLV+TNAV+V    G+ IGIAVY  TFS
Sbjct: 116  NYADIPPGTA----------------LEEAVLCLVLTNAVDVQDSIGQTIGIAVYASTFS 159

Query: 723  WINHSCSPNSCYRFLVGPEENDEQLLRLRIAPGGCSYRNGDGSIMEGGLSVQVSDRNGYG 902
            WINHSCSPN+CYRF      +D    R RIAP    + + +G+              G G
Sbjct: 160  WINHSCSPNACYRF---ETPSDSVTTRFRIAPSCTDFMSDEGNF------------QGNG 204

Query: 903  PRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
            PRVVVRSIK I KGE VTIAY DLLQPK +RQ+ELW
Sbjct: 205  PRVVVRSIKRIKKGEAVTIAYCDLLQPKVLRQSELW 240


>ref|NP_683372.2| SET domain-containing protein [Arabidopsis thaliana]
            gi|97190651|sp|Q3ECY6.1|SDG41_ARATH RecName: Full=Protein
            SET DOMAIN GROUP 41 gi|332193843|gb|AEE31964.1| SET
            domain-containing protein [Arabidopsis thaliana]
          Length = 558

 Score =  163 bits (412), Expect = 1e-37
 Identities = 117/338 (34%), Positives = 165/338 (48%), Gaps = 2/338 (0%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A E I I  DL PPL PLA+ L+DS ++SHCS+CF  LPP                    
Sbjct: 5    AAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSP------------------ 46

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
             P PLYCS ACS  DS              F +SP                    Q  P+
Sbjct: 47   -PQPLYCSAACSLTDS--------------FTNSP--------------------QFPPE 71

Query: 363  ECSLLPQ--RSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 536
               +LP   R+ L   N   S  ++ +  P     R+  L+TN  +L+ A   I  +  +
Sbjct: 72   ITPILPSDIRTSLHLLN---STAVDTSSSP----HRLNNLLTNH-HLLMADPSISVAIHH 123

Query: 537  SENYLRIREGAKMMAKVRNNVNSDKSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVYDHT 716
            + N++           +R+N    K+  LEE  +C V+TNAVEV   NG  +GIA+Y+ +
Sbjct: 124  AANFIA--------TVIRSN---RKNTELEEAAICAVLTNAVEVHDSNGLALGIALYNSS 172

Query: 717  FSWINHSCSPNSCYRFLVGPEENDEQLLRLRIAPGGCSYRNGDGSIMEGGLSVQVSDRNG 896
            FSWINHSCSPNSCYRF+     N      + +     +  + +  + E      ++  NG
Sbjct: 173  FSWINHSCSPNSCYRFV----NNRTSYHDVHVTN---TETSSNLELQEQVCGTSLNSGNG 225

Query: 897  YGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
             GP+++VRSIK I  GEE+T++Y DLLQP  +RQ++LW
Sbjct: 226  NGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLW 263


>ref|XP_004516216.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X1 [Cicer
            arietinum]
          Length = 660

 Score =  161 bits (408), Expect = 3e-37
 Identities = 128/348 (36%), Positives = 164/348 (47%), Gaps = 18/348 (5%)
 Frame = +3

Query: 21   IGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD--TPTP 194
            IG D+TPPL P +  LH++ + +HCS+CF  + P                +PT   + + 
Sbjct: 13   IGTDITPPLTPFSFSLHNTHLHTHCSSCFSLITPI---------------IPTTNHSHST 57

Query: 195  LYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQECSL 374
             YCS  CST+ S +H SSAERHL S    S                 A     L    SL
Sbjct: 58   FYCSPHCSTSHSPIHLSSAERHLPSSINSS-------------LLRTALRLLLLHHTTSL 104

Query: 375  LPQRSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDENSENYLR 554
             P                           RI  L+TNR   +  T Q +D +E       
Sbjct: 105  FP---------------------------RINHLLTNR---LLLTCQNDDVNET------ 128

Query: 555  IREGAKMMAKV----RNNVNSDKSFPLEEMVL-------CLVVTNAVEVLLKNGRCIGIA 701
            IR GA  MA      R   +   S P +  VL       C V+TNAVEV    G  +GIA
Sbjct: 129  IRLGAHAMATAIANHRGGGSGGFSEPYDNAVLEKSTDALCAVLTNAVEVHDNEGCAVGIA 188

Query: 702  VYDHTFSWINHSCSPNSCYRFLVGPEENDEQLLRLRIAPGGCSYRNGDGSIMEGGLSVQV 881
            V++  FSWINHSCSPN+CYRF         Q  +  IAP     RN     ++ G+S   
Sbjct: 189  VFEPAFSWINHSCSPNACYRFSFSSSSLLSQESKFLIAP---FTRNSQQPQIDCGVSGSS 245

Query: 882  SD--RNGY---GPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
            S+  + G+   GPR++VRSIK I KGEEVT+AYTDLLQPK +RQ+ELW
Sbjct: 246  SEFAQEGWRICGPRLIVRSIKRIKKGEEVTVAYTDLLQPKALRQSELW 293


>ref|XP_006395990.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum]
            gi|557092629|gb|ESQ33276.1| hypothetical protein
            EUTSA_v10003905mg [Eutrema salsugineum]
          Length = 572

 Score =  161 bits (407), Expect = 5e-37
 Identities = 118/341 (34%), Positives = 156/341 (45%), Gaps = 5/341 (1%)
 Frame = +3

Query: 3    AIEKIGIGEDLTPPLRPLAAVLHDSAVTSHCSACFFTLPPQTFPPIWRTFPENLQHVPTD 182
            A + IGIG DL PPL PL   L+DS  TSHCS CF  L P                 P  
Sbjct: 5    AADDIGIGVDLFPPLSPLTFSLYDSFFTSHCSCCFSLLSP----------------APPQ 48

Query: 183  TPTPLYCSLACSTADSALHFSSAERHLLSLFAHSPPSSWQXXXXXXXXXXXAHIFQNLPQ 362
            +P  LYCS ACS  DS +                                   + Q +P 
Sbjct: 49   SPASLYCSAACSLTDSPI-----------------------------------VSQIIPD 73

Query: 363  ECSLLPQ--RSFLEGKNESISQEIEEAKGPDVCLERIAGLMTNRENLVFATKQIEDSDEN 536
               +L    R+ L   N   S  +  A  P     R  GL+TN   L+           +
Sbjct: 74   HSLILSSDIRAALRLLNSIPSYAVVAASLP----HRFGGLLTNHHRLM----------AD 119

Query: 537  SENYLRIREGAKMMAKVRNNVNSD-KSFPLEEMVLCLVVTNAVEVLLKNGRCIGIAVYDH 713
            S   + I+  A  +A V   + SD K+  LEE  +C V+TNAVE+   +GR +GIAVYD 
Sbjct: 120  SSFSVAIQCAANFIAVV---LRSDRKNTELEEAAICSVLTNAVELQDSSGRALGIAVYDT 176

Query: 714  TFSWINHSCSPNSCYRFLVGPEENDEQLLR--LRIAPGGCSYRNGDGSIMEGGLSVQVSD 887
             FSWINHSCSPN+CYRF++ P        +   ++ P   +       +     S+    
Sbjct: 177  RFSWINHSCSPNACYRFVISPHSTTTPSFQDYPKMLPHTTNTEKEQIGVCSRITSLW--- 233

Query: 888  RNGYGPRVVVRSIKAISKGEEVTIAYTDLLQPKEMRQTELW 1010
               YGP+VV RSIK I  GEE+TI+Y DL+QP  +RQ++LW
Sbjct: 234  EVRYGPKVVARSIKRIKSGEEITISYIDLMQPTGLRQSDLW 274


Top